Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.9 KiB |
Average record size in memory | 70.3 B |
Variable types
Categorical | 5 |
---|---|
Numeric | 3 |
Dataset
Description | Sample |
---|---|
Author | 국립중앙도서관 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=befff0c0-3e8c-11eb-af9a-4b03f0a582d6 |
anals_trget_year has constant value "" | Constant |
anals_trget_mt is highly overall correlated with area_nm | High correlation |
area_nm is highly overall correlated with anals_trget_mt | High correlation |
lon_co is highly overall correlated with lon_mber_co | High correlation |
lon_mber_co is highly overall correlated with lon_co | High correlation |
read_qy is highly overall correlated with age_flag_nm | High correlation |
age_flag_nm is highly overall correlated with read_qy | High correlation |
anals_trget_mt is highly imbalanced (80.6%) | Imbalance |
lon_co has unique values | Unique |
lon_mber_co has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 09:57:20.033648 |
---|---|
Analysis finished | 2023-12-10 09:57:23.662427 |
Duration | 3.63 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
anals_trget_year
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2019 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2019 |
---|---|
2nd row | 2019 |
3rd row | 2019 |
4th row | 2019 |
5th row | 2019 |
Common Values
Value | Count | Frequency (%) |
2019 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2019 | 100 |
anals_trget_mt
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
1 | |
---|---|
12 | 3 |
Length
Max length | 2 |
---|---|
Median length | 1 |
Mean length | 1.03 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 12 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 97 | |
12 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 97 | |
12 | 3 | 3.0% |
area_nm
Categorical
HIGH CORRELATION
 
Distinct | 7 |
---|---|
Distinct (%) | 7.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
경상남도 | |
---|---|
경상북도 | |
광주광역시 | |
경기도 | |
강원도 | |
Other values (2) |
Length
Max length | 5 |
---|---|
Median length | 4 |
Mean length | 3.95 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 강원도 |
---|---|
2nd row | 충청북도 |
3rd row | 강원도 |
4th row | 강원도 |
5th row | 강원도 |
Common Values
Value | Count | Frequency (%) |
경상남도 | 18 | |
경상북도 | 18 | |
광주광역시 | 18 | |
경기도 | 17 | |
강원도 | 16 | |
대구광역시 | 10 | |
충청북도 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
경상남도 | 18 | |
경상북도 | 18 | |
광주광역시 | 18 | |
경기도 | 17 | |
강원도 | 16 | |
대구광역시 | 10 | |
충청북도 | 3 | 3.0% |
age_flag_nm
Categorical
HIGH CORRELATION
 
Distinct | 9 |
---|---|
Distinct (%) | 9.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
유아(6-7) | |
---|---|
초등(8-13) | |
60대이상 | |
20대 | |
영유아(0-5) | |
Other values (4) |
Length
Max length | 10 |
---|---|
Median length | 8 |
Mean length | 5.64 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 영유아(0-5) |
---|---|
2nd row | 50대 |
3rd row | 유아(6-7) |
4th row | 유아(6-7) |
5th row | 초등(8-13) |
Common Values
Value | Count | Frequency (%) |
유아(6-7) | 12 | |
초등(8-13) | 12 | |
60대이상 | 12 | |
20대 | 12 | |
영유아(0-5) | 11 | |
50대 | 11 | |
청소년(14-19) | 11 | |
30대 | 10 | |
40대 | 9 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
유아(6-7 | 12 | |
초등(8-13 | 12 | |
60대이상 | 12 | |
20대 | 12 | |
영유아(0-5 | 11 | |
50대 | 11 | |
청소년(14-19 | 11 | |
30대 | 10 | |
40대 | 9 |
sexdstn_flag_nm
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
남자 | |
---|---|
여자 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 남자 |
---|---|
2nd row | 여자 |
3rd row | 남자 |
4th row | 여자 |
5th row | 남자 |
Common Values
Value | Count | Frequency (%) |
남자 | 51 | |
여자 | 49 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
남자 | 51 | |
여자 | 49 |
lon_co
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 36095.2 |
Minimum | 1244 |
---|---|
Maximum | 474241 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1244 |
---|---|
5-th percentile | 2454.1 |
Q1 | 5575.75 |
median | 11456 |
Q3 | 37086.5 |
95-th percentile | 123230.55 |
Maximum | 474241 |
Range | 472997 |
Interquartile range (IQR) | 31510.75 |
Descriptive statistics
Standard deviation | 72584.984 |
---|---|
Coefficient of variation (CV) | 2.0109318 |
Kurtosis | 19.701627 |
Mean | 36095.2 |
Median Absolute Deviation (MAD) | 7061 |
Skewness | 4.1939077 |
Sum | 3609520 |
Variance | 5.2685799 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1909 | 1 | 1.0% |
12734 | 1 | 1.0% |
3025 | 1 | 1.0% |
1244 | 1 | 1.0% |
1670 | 1 | 1.0% |
4831 | 1 | 1.0% |
7635 | 1 | 1.0% |
9853 | 1 | 1.0% |
8047 | 1 | 1.0% |
42872 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
1244 | 1 | |
1670 | 1 | |
1909 | 1 | |
2102 | 1 | |
2437 | 1 | |
2455 | 1 | |
2772 | 1 | |
2876 | 1 | |
3025 | 1 | |
3216 | 1 |
Value | Count | Frequency (%) |
474241 | 1 | |
391536 | 1 | |
275445 | 1 | |
261094 | 1 | |
136522 | 1 | |
122531 | 1 | |
112470 | 1 | |
111277 | 1 | |
101173 | 1 | |
84250 | 1 |
lon_mber_co
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4929.08 |
Minimum | 143 |
---|---|
Maximum | 54100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 143 |
---|---|
5-th percentile | 281.55 |
Q1 | 900.5 |
median | 1754 |
Q3 | 4265.25 |
95-th percentile | 21597.65 |
Maximum | 54100 |
Range | 53957 |
Interquartile range (IQR) | 3364.75 |
Descriptive statistics
Standard deviation | 8789.6808 |
---|---|
Coefficient of variation (CV) | 1.7832295 |
Kurtosis | 14.195331 |
Mean | 4929.08 |
Median Absolute Deviation (MAD) | 1125.5 |
Skewness | 3.5073081 |
Sum | 492908 |
Variance | 77258488 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
240 | 1 | 1.0% |
1832 | 1 | 1.0% |
319 | 1 | 1.0% |
143 | 1 | 1.0% |
178 | 1 | 1.0% |
649 | 1 | 1.0% |
1058 | 1 | 1.0% |
1676 | 1 | 1.0% |
1341 | 1 | 1.0% |
6113 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
143 | 1 | |
178 | 1 | |
240 | 1 | |
253 | 1 | |
273 | 1 | |
282 | 1 | |
319 | 1 | |
339 | 1 | |
369 | 1 | |
383 | 1 |
Value | Count | Frequency (%) |
54100 | 1 | |
46160 | 1 | |
27486 | 1 | |
26201 | 1 | |
24783 | 1 | |
21430 | 1 | |
20989 | 1 | |
19782 | 1 | |
18692 | 1 | |
14795 | 1 |
read_qy
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 99 |
---|---|
Distinct (%) | 99.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7.40772 |
Minimum | 3.813 |
---|---|
Maximum | 14.761 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 3.813 |
---|---|
5-th percentile | 4.04705 |
Q1 | 5.411 |
median | 6.8975 |
Q3 | 8.7065 |
95-th percentile | 13.0322 |
Maximum | 14.761 |
Range | 10.948 |
Interquartile range (IQR) | 3.2955 |
Descriptive statistics
Standard deviation | 2.5957488 |
---|---|
Coefficient of variation (CV) | 0.3504113 |
Kurtosis | 0.50705404 |
Mean | 7.40772 |
Median Absolute Deviation (MAD) | 1.672 |
Skewness | 0.89471649 |
Sum | 740.772 |
Variance | 6.7379118 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6.901 | 2 | 2.0% |
7.954 | 1 | 1.0% |
7.561 | 1 | 1.0% |
9.483 | 1 | 1.0% |
8.699 | 1 | 1.0% |
9.382 | 1 | 1.0% |
7.444 | 1 | 1.0% |
7.216 | 1 | 1.0% |
5.879 | 1 | 1.0% |
6.001 | 1 | 1.0% |
Other values (89) | 89 |
Value | Count | Frequency (%) |
3.813 | 1 | |
3.818 | 1 | |
3.915 | 1 | |
3.973 | 1 | |
4.029 | 1 | |
4.048 | 1 | |
4.156 | 1 | |
4.219 | 1 | |
4.237 | 1 | |
4.402 | 1 |
Value | Count | Frequency (%) |
14.761 | 1 | |
14.345 | 1 | |
13.589 | 1 | |
13.536 | 1 | |
13.492 | 1 | |
13.008 | 1 | |
12.836 | 1 | |
12.772 | 1 | |
11.599 | 1 | |
11.403 | 1 |
anals_trget_mt | area_nm | age_flag_nm | sexdstn_flag_nm | lon_co | lon_mber_co | read_qy | |
---|---|---|---|---|---|---|---|
anals_trget_mt | 1.000 | 1.000 | 0.195 | 0.000 | 0.000 | 0.000 | 0.000 |
area_nm | 1.000 | 1.000 | 0.000 | 0.000 | 0.472 | 0.422 | 0.445 |
age_flag_nm | 0.195 | 0.000 | 1.000 | 0.000 | 0.221 | 0.346 | 0.799 |
sexdstn_flag_nm | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.133 | 0.000 |
lon_co | 0.000 | 0.472 | 0.221 | 0.000 | 1.000 | 0.979 | 0.579 |
lon_mber_co | 0.000 | 0.422 | 0.346 | 0.133 | 0.979 | 1.000 | 0.468 |
read_qy | 0.000 | 0.445 | 0.799 | 0.000 | 0.579 | 0.468 | 1.000 |
anals_trget_mt | area_nm | sexdstn_flag_nm | age_flag_nm | |
---|---|---|---|---|
anals_trget_mt | 1.000 | 0.974 | 0.000 | 0.185 |
area_nm | 0.974 | 1.000 | 0.000 | 0.000 |
sexdstn_flag_nm | 0.000 | 0.000 | 1.000 | 0.000 |
age_flag_nm | 0.185 | 0.000 | 0.000 | 1.000 |
lon_co | lon_mber_co | read_qy | anals_trget_mt | area_nm | age_flag_nm | sexdstn_flag_nm | |
---|---|---|---|---|---|---|---|
lon_co | 1.000 | 0.961 | 0.042 | 0.000 | 0.303 | 0.112 | 0.000 |
lon_mber_co | 0.961 | 1.000 | -0.187 | 0.000 | 0.241 | 0.177 | 0.090 |
read_qy | 0.042 | -0.187 | 1.000 | 0.000 | 0.238 | 0.524 | 0.000 |
anals_trget_mt | 0.000 | 0.000 | 0.000 | 1.000 | 0.974 | 0.185 | 0.000 |
area_nm | 0.303 | 0.241 | 0.238 | 0.974 | 1.000 | 0.000 | 0.000 |
age_flag_nm | 0.112 | 0.177 | 0.524 | 0.185 | 0.000 | 1.000 | 0.000 |
sexdstn_flag_nm | 0.000 | 0.090 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 |
anals_trget_year | anals_trget_mt | area_nm | age_flag_nm | sexdstn_flag_nm | lon_co | lon_mber_co | read_qy | |
---|---|---|---|---|---|---|---|---|
0 | 2019 | 1 | 강원도 | 영유아(0-5) | 남자 | 1909 | 240 | 7.954 |
1 | 2019 | 12 | 충청북도 | 50대 | 여자 | 9665 | 1448 | 6.675 |
2 | 2019 | 1 | 강원도 | 유아(6-7) | 남자 | 2437 | 282 | 8.642 |
3 | 2019 | 1 | 강원도 | 유아(6-7) | 여자 | 2102 | 273 | 7.7 |
4 | 2019 | 1 | 강원도 | 초등(8-13) | 남자 | 13338 | 1606 | 8.305 |
5 | 2019 | 1 | 강원도 | 초등(8-13) | 여자 | 14905 | 1862 | 8.005 |
6 | 2019 | 1 | 강원도 | 청소년(14-19) | 남자 | 5019 | 1121 | 4.477 |
7 | 2019 | 12 | 충청북도 | 60대이상 | 남자 | 7763 | 937 | 8.285 |
8 | 2019 | 1 | 강원도 | 20대 | 남자 | 4040 | 904 | 4.469 |
9 | 2019 | 1 | 강원도 | 20대 | 여자 | 6616 | 1503 | 4.402 |
anals_trget_year | anals_trget_mt | area_nm | age_flag_nm | sexdstn_flag_nm | lon_co | lon_mber_co | read_qy | |
---|---|---|---|---|---|---|---|---|
90 | 2019 | 1 | 대구광역시 | 영유아(0-5) | 남자 | 5875 | 398 | 14.761 |
91 | 2019 | 1 | 대구광역시 | 영유아(0-5) | 여자 | 4863 | 339 | 14.345 |
92 | 2019 | 1 | 대구광역시 | 유아(6-7) | 남자 | 10416 | 772 | 13.492 |
93 | 2019 | 1 | 대구광역시 | 유아(6-7) | 여자 | 8893 | 657 | 13.536 |
94 | 2019 | 1 | 대구광역시 | 초등(8-13) | 남자 | 50805 | 4380 | 11.599 |
95 | 2019 | 1 | 대구광역시 | 초등(8-13) | 여자 | 50038 | 4388 | 11.403 |
96 | 2019 | 1 | 대구광역시 | 청소년(14-19) | 남자 | 12956 | 2483 | 5.218 |
97 | 2019 | 1 | 대구광역시 | 청소년(14-19) | 여자 | 20096 | 3905 | 5.146 |
98 | 2019 | 1 | 대구광역시 | 20대 | 남자 | 10778 | 2675 | 4.029 |
99 | 2019 | 1 | 대구광역시 | 20대 | 여자 | 20552 | 4871 | 4.219 |