Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 5.2 KiB |
Average record size in memory | 53.3 B |
Variable types
Categorical | 3 |
---|---|
Numeric | 3 |
Dataset
Description | Sample |
---|---|
Author | 국립중앙도서관 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=36673890-3e8c-11eb-af9a-4b03f0a582d6 |
anals_trget_year is highly overall correlated with kdc_nm | High correlation |
kdc_nm is highly overall correlated with anals_trget_year | High correlation |
book_co is highly overall correlated with lon_co | High correlation |
lon_co is highly overall correlated with book_co and 1 other fields | High correlation |
rate_value is highly overall correlated with lon_co | High correlation |
anals_trget_year is highly imbalanced (80.6%) | Imbalance |
book_co has unique values | Unique |
lon_co has unique values | Unique |
rate_value has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 10:18:03.722695 |
---|---|
Analysis finished | 2023-12-10 10:18:05.619178 |
Duration | 1.9 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
anals_trget_year
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2010 | |
---|---|
2019 | 3 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2010 |
---|---|
2nd row | 2019 |
3rd row | 2010 |
4th row | 2010 |
5th row | 2010 |
Common Values
Value | Count | Frequency (%) |
2010 | 97 | |
2019 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2010 | 97 | |
2019 | 3 | 3.0% |
kdc_nm
Categorical
HIGH CORRELATION
 
Distinct | 8 |
---|---|
Distinct (%) | 8.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
200 | |
---|---|
300 | |
400 | |
500 | |
100 | |
Other values (3) |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 2.97 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 000 |
---|---|
2nd row | 미상 |
3rd row | 000 |
4th row | 000 |
5th row | 000 |
Common Values
Value | Count | Frequency (%) |
200 | 16 | |
300 | 16 | |
400 | 16 | |
500 | 16 | |
100 | 15 | |
000 | 14 | |
600 | 4 | 4.0% |
미상 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
200 | 16 | |
300 | 16 | |
400 | 16 | |
500 | 16 | |
100 | 15 | |
000 | 14 | |
600 | 4 | 4.0% |
미상 | 3 | 3.0% |
area_nm
Categorical
Distinct | 16 |
---|---|
Distinct (%) | 16.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
강원도 | |
---|---|
제주특별자치도 | |
경상남도 | |
경상북도 | |
충청남도 | |
Other values (11) |
Length
Max length | 7 |
---|---|
Median length | 5 |
Mean length | 4.61 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 강원도 |
---|---|
2nd row | 제주특별자치도 |
3rd row | 경상남도 |
4th row | 경상북도 |
5th row | 광주광역시 |
Common Values
Value | Count | Frequency (%) |
강원도 | 7 | 7.0% |
제주특별자치도 | 7 | 7.0% |
경상남도 | 7 | 7.0% |
경상북도 | 7 | 7.0% |
충청남도 | 7 | 7.0% |
광주광역시 | 6 | 6.0% |
대구광역시 | 6 | 6.0% |
대전광역시 | 6 | 6.0% |
세종특별자치시 | 6 | 6.0% |
울산광역시 | 6 | 6.0% |
Other values (6) | 35 |
Length
Value | Count | Frequency (%) |
강원도 | 7 | 7.0% |
제주특별자치도 | 7 | 7.0% |
경상남도 | 7 | 7.0% |
경상북도 | 7 | 7.0% |
충청남도 | 7 | 7.0% |
광주광역시 | 6 | 6.0% |
대구광역시 | 6 | 6.0% |
대전광역시 | 6 | 6.0% |
세종특별자치시 | 6 | 6.0% |
울산광역시 | 6 | 6.0% |
Other values (6) | 35 |
book_co
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 211812.14 |
Minimum | 2491 |
---|---|
Maximum | 2549281 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 2491 |
---|---|
5-th percentile | 13060.7 |
Q1 | 60505.5 |
median | 119790 |
Q3 | 183071 |
95-th percentile | 661997.1 |
Maximum | 2549281 |
Range | 2546790 |
Interquartile range (IQR) | 122565.5 |
Descriptive statistics
Standard deviation | 336167.36 |
---|---|
Coefficient of variation (CV) | 1.5871015 |
Kurtosis | 25.448746 |
Mean | 211812.14 |
Median Absolute Deviation (MAD) | 61638 |
Skewness | 4.499044 |
Sum | 21181214 |
Variance | 1.1300849 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
131593 | 1 | 1.0% |
182471 | 1 | 1.0% |
124420 | 1 | 1.0% |
64206 | 1 | 1.0% |
6185 | 1 | 1.0% |
503756 | 1 | 1.0% |
142236 | 1 | 1.0% |
205762 | 1 | 1.0% |
53816 | 1 | 1.0% |
73529 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
2491 | 1 | |
3961 | 1 | |
5824 | 1 | |
6185 | 1 | |
6481 | 1 | |
13407 | 1 | |
27238 | 1 | |
29939 | 1 | |
31968 | 1 | |
32997 | 1 |
Value | Count | Frequency (%) |
2549281 | 1 | |
1463156 | 1 | |
1114933 | 1 | |
1044755 | 1 | |
1006260 | 1 | |
643878 | 1 | |
600249 | 1 | |
527380 | 1 | |
503756 | 1 | |
427089 | 1 |
lon_co
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 277399.62 |
Minimum | 1879 |
---|---|
Maximum | 3859151 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1879 |
---|---|
5-th percentile | 12971.65 |
Q1 | 63393 |
median | 123309.5 |
Q3 | 250345 |
95-th percentile | 1000155.8 |
Maximum | 3859151 |
Range | 3857272 |
Interquartile range (IQR) | 186952 |
Descriptive statistics
Standard deviation | 528355.9 |
---|---|
Coefficient of variation (CV) | 1.9046742 |
Kurtosis | 25.129748 |
Mean | 277399.62 |
Median Absolute Deviation (MAD) | 77413 |
Skewness | 4.6242985 |
Sum | 27739962 |
Variance | 2.7915995 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
45670 | 1 | 1.0% |
155476 | 1 | 1.0% |
274678 | 1 | 1.0% |
168421 | 1 | 1.0% |
9108 | 1 | 1.0% |
1274018 | 1 | 1.0% |
259709 | 1 | 1.0% |
393365 | 1 | 1.0% |
140808 | 1 | 1.0% |
209962 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
1879 | 1 | |
2905 | 1 | |
3235 | 1 | |
5021 | 1 | |
9108 | 1 | |
13175 | 1 | |
19652 | 1 | |
32706 | 1 | |
35498 | 1 | |
37607 | 1 |
Value | Count | Frequency (%) |
3859151 | 1 | |
2508004 | 1 | |
2173193 | 1 | |
1290228 | 1 | |
1274018 | 1 | |
985742 | 1 | |
919482 | 1 | |
780364 | 1 | |
627264 | 1 | |
546130 | 1 |
rate_value
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 121.84165 |
Minimum | 24.07 |
---|---|
Maximum | 285.55 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 24.07 |
---|---|
5-th percentile | 47.00085 |
Q1 | 80.1185 |
median | 116.0475 |
Q3 | 151.936 |
95-th percentile | 253.09635 |
Maximum | 285.55 |
Range | 261.48 |
Interquartile range (IQR) | 71.8175 |
Descriptive statistics
Standard deviation | 56.350337 |
---|---|
Coefficient of variation (CV) | 0.4624883 |
Kurtosis | 0.53442747 |
Mean | 121.84165 |
Median Absolute Deviation (MAD) | 36.0055 |
Skewness | 0.82676679 |
Sum | 12184.165 |
Variance | 3175.3605 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
34.705 | 1 | 1.0% |
85.206 | 1 | 1.0% |
220.767 | 1 | 1.0% |
262.313 | 1 | 1.0% |
147.259 | 1 | 1.0% |
252.904 | 1 | 1.0% |
182.59 | 1 | 1.0% |
191.175 | 1 | 1.0% |
261.647 | 1 | 1.0% |
285.55 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
24.07 | 1 | |
31.646 | 1 | |
34.705 | 1 | |
44.823 | 1 | |
46.941 | 1 | |
47.004 | 1 | |
51.68 | 1 | |
53.582 | 1 | |
55.358 | 1 | |
55.635 | 1 |
Value | Count | Frequency (%) |
285.55 | 1 | |
264.471 | 1 | |
262.313 | 1 | |
261.647 | 1 | |
256.751 | 1 | |
252.904 | 1 | |
224.947 | 1 | |
220.767 | 1 | |
191.175 | 1 | |
187.243 | 1 |
anals_trget_year | kdc_nm | area_nm | book_co | lon_co | rate_value | |
---|---|---|---|---|---|---|
anals_trget_year | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.367 |
kdc_nm | 1.000 | 1.000 | 0.000 | 0.367 | 0.000 | 0.502 |
area_nm | 0.000 | 0.000 | 1.000 | 0.436 | 0.463 | 0.595 |
book_co | 0.000 | 0.367 | 0.436 | 1.000 | 0.981 | 0.098 |
lon_co | 0.000 | 0.000 | 0.463 | 0.981 | 1.000 | 0.516 |
rate_value | 0.367 | 0.502 | 0.595 | 0.098 | 0.516 | 1.000 |
anals_trget_year | kdc_nm | area_nm | |
---|---|---|---|
anals_trget_year | 1.000 | 0.969 | 0.000 |
kdc_nm | 0.969 | 1.000 | 0.000 |
area_nm | 0.000 | 0.000 | 1.000 |
book_co | lon_co | rate_value | anals_trget_year | kdc_nm | area_nm | |
---|---|---|---|---|---|---|
book_co | 1.000 | 0.839 | 0.014 | 0.000 | 0.206 | 0.206 |
lon_co | 0.839 | 1.000 | 0.500 | 0.000 | 0.000 | 0.221 |
rate_value | 0.014 | 0.500 | 1.000 | 0.269 | 0.263 | 0.269 |
anals_trget_year | 0.000 | 0.000 | 0.269 | 1.000 | 0.969 | 0.000 |
kdc_nm | 0.206 | 0.000 | 0.263 | 0.969 | 1.000 | 0.000 |
area_nm | 0.206 | 0.221 | 0.269 | 0.000 | 0.000 | 1.000 |
anals_trget_year | kdc_nm | area_nm | book_co | lon_co | rate_value | |
---|---|---|---|---|---|---|
0 | 2010 | 000 | 강원도 | 131593 | 45670 | 34.705 |
1 | 2019 | 미상 | 제주특별자치도 | 174714 | 327140 | 187.243 |
2 | 2010 | 000 | 경상남도 | 105093 | 122747 | 116.798 |
3 | 2010 | 000 | 경상북도 | 62492 | 80863 | 129.397 |
4 | 2010 | 000 | 광주광역시 | 33447 | 35498 | 106.132 |
5 | 2010 | 000 | 대구광역시 | 171159 | 129626 | 75.734 |
6 | 2010 | 000 | 대전광역시 | 116226 | 94988 | 81.727 |
7 | 2019 | 미상 | 충청남도 | 223241 | 162800 | 72.926 |
8 | 2010 | 000 | 세종특별자치시 | 6481 | 2905 | 44.823 |
9 | 2010 | 000 | 울산광역시 | 40609 | 46365 | 114.174 |
anals_trget_year | kdc_nm | area_nm | book_co | lon_co | rate_value | |
---|---|---|---|---|---|---|
90 | 2010 | 500 | 인천광역시 | 131603 | 154389 | 117.314 |
91 | 2010 | 500 | 전라남도 | 154907 | 137140 | 88.531 |
92 | 2010 | 500 | 전라북도 | 74560 | 106030 | 142.208 |
93 | 2010 | 500 | 제주특별자치도 | 90610 | 51225 | 56.533 |
94 | 2010 | 500 | 충청남도 | 156534 | 111865 | 71.464 |
95 | 2010 | 500 | 충청북도 | 156149 | 101341 | 64.9 |
96 | 2010 | 600 | 강원도 | 185217 | 58614 | 31.646 |
97 | 2010 | 600 | 경기도 | 1006260 | 919482 | 91.376 |
98 | 2010 | 600 | 경상남도 | 140916 | 113127 | 80.28 |
99 | 2010 | 600 | 경상북도 | 65765 | 118445 | 180.103 |