Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.9 KiB |
Average record size in memory | 70.3 B |
Variable types
Categorical | 5 |
---|---|
Numeric | 3 |
Dataset
Description | Sample |
---|---|
Author | 국립중앙도서관 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=b51a8610-3e8c-11eb-af9a-4b03f0a582d6 |
anals_trget_year has constant value "" | Constant |
anals_trget_mt is highly overall correlated with area_nm | High correlation |
area_nm is highly overall correlated with anals_trget_mt | High correlation |
lon_mber_co is highly overall correlated with all_mber_co | High correlation |
all_mber_co is highly overall correlated with lon_mber_co and 1 other fields | High correlation |
read_rt is highly overall correlated with all_mber_co | High correlation |
anals_trget_mt is highly imbalanced (80.6%) | Imbalance |
all_mber_co has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 10:14:58.625178 |
---|---|
Analysis finished | 2023-12-10 10:15:01.604543 |
Duration | 2.98 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
anals_trget_year
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2020 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020 |
---|---|
2nd row | 2020 |
3rd row | 2020 |
4th row | 2020 |
5th row | 2020 |
Common Values
Value | Count | Frequency (%) |
2020 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020 | 100 |
anals_trget_mt
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
1 | |
---|---|
12 | 3 |
Length
Max length | 2 |
---|---|
Median length | 1 |
Mean length | 1.03 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 12 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 97 | |
12 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 97 | |
12 | 3 | 3.0% |
area_nm
Categorical
HIGH CORRELATION
 
Distinct | 7 |
---|---|
Distinct (%) | 7.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
경상남도 | |
---|---|
경상북도 | |
광주광역시 | |
경기도 | |
강원도 | |
Other values (2) |
Length
Max length | 5 |
---|---|
Median length | 4 |
Mean length | 3.95 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 강원도 |
---|---|
2nd row | 충청북도 |
3rd row | 강원도 |
4th row | 강원도 |
5th row | 강원도 |
Common Values
Value | Count | Frequency (%) |
경상남도 | 18 | |
경상북도 | 18 | |
광주광역시 | 18 | |
경기도 | 17 | |
강원도 | 16 | |
대구광역시 | 10 | |
충청북도 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
경상남도 | 18 | |
경상북도 | 18 | |
광주광역시 | 18 | |
경기도 | 17 | |
강원도 | 16 | |
대구광역시 | 10 | |
충청북도 | 3 | 3.0% |
age_flag_nm
Categorical
Distinct | 9 |
---|---|
Distinct (%) | 9.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
유아(6-7) | |
---|---|
초등(8-13) | |
60대이상 | |
20대 | |
영유아(0-5) | |
Other values (4) |
Length
Max length | 10 |
---|---|
Median length | 8 |
Mean length | 5.64 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 영유아(0-5) |
---|---|
2nd row | 50대 |
3rd row | 유아(6-7) |
4th row | 유아(6-7) |
5th row | 초등(8-13) |
Common Values
Value | Count | Frequency (%) |
유아(6-7) | 12 | |
초등(8-13) | 12 | |
60대이상 | 12 | |
20대 | 12 | |
영유아(0-5) | 11 | |
50대 | 11 | |
청소년(14-19) | 11 | |
30대 | 10 | |
40대 | 9 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
유아(6-7 | 12 | |
초등(8-13 | 12 | |
60대이상 | 12 | |
20대 | 12 | |
영유아(0-5 | 11 | |
50대 | 11 | |
청소년(14-19 | 11 | |
30대 | 10 | |
40대 | 9 |
sexdstn_flag_nm
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
남자 | |
---|---|
여자 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 남자 |
---|---|
2nd row | 여자 |
3rd row | 남자 |
4th row | 여자 |
5th row | 남자 |
Common Values
Value | Count | Frequency (%) |
남자 | 51 | |
여자 | 49 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
남자 | 51 | |
여자 | 49 |
lon_mber_co
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 98 |
---|---|
Distinct (%) | 98.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4517.23 |
Minimum | 149 |
---|---|
Maximum | 47021 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 149 |
---|---|
5-th percentile | 245.95 |
Q1 | 785.75 |
median | 1649.5 |
Q3 | 3714.25 |
95-th percentile | 21714.25 |
Maximum | 47021 |
Range | 46872 |
Interquartile range (IQR) | 2928.5 |
Descriptive statistics
Standard deviation | 7969.3694 |
---|---|
Coefficient of variation (CV) | 1.764216 |
Kurtosis | 13.197307 |
Mean | 4517.23 |
Median Absolute Deviation (MAD) | 1108.5 |
Skewness | 3.4115903 |
Sum | 451723 |
Variance | 63510849 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1097 | 2 | 2.0% |
537 | 2 | 2.0% |
182 | 1 | 1.0% |
1828 | 1 | 1.0% |
149 | 1 | 1.0% |
231 | 1 | 1.0% |
744 | 1 | 1.0% |
1753 | 1 | 1.0% |
1368 | 1 | 1.0% |
5931 | 1 | 1.0% |
Other values (88) | 88 |
Value | Count | Frequency (%) |
149 | 1 | |
182 | 1 | |
208 | 1 | |
231 | 1 | |
245 | 1 | |
246 | 1 | |
325 | 1 | |
341 | 1 | |
352 | 1 | |
360 | 1 |
Value | Count | Frequency (%) |
47021 | 1 | |
43087 | 1 | |
25058 | 1 | |
24017 | 1 | |
23315 | 1 | |
21630 | 1 | |
18422 | 1 | |
17558 | 1 | |
16944 | 1 | |
12833 | 1 |
all_mber_co
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 91588.43 |
Minimum | 239 |
---|---|
Maximum | 886851 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 239 |
---|---|
5-th percentile | 760.35 |
Q1 | 11544.25 |
median | 28754 |
Q3 | 61825.75 |
95-th percentile | 564609.7 |
Maximum | 886851 |
Range | 886612 |
Interquartile range (IQR) | 50281.5 |
Descriptive statistics
Standard deviation | 181144.62 |
---|---|
Coefficient of variation (CV) | 1.9778112 |
Kurtosis | 9.1447058 |
Mean | 91588.43 |
Median Absolute Deviation (MAD) | 21328.5 |
Skewness | 3.0952621 |
Sum | 9158843 |
Variance | 3.2813374 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
873 | 1 | 1.0% |
38033 | 1 | 1.0% |
766 | 1 | 1.0% |
239 | 1 | 1.0% |
292 | 1 | 1.0% |
11619 | 1 | 1.0% |
12408 | 1 | 1.0% |
33037 | 1 | 1.0% |
22193 | 1 | 1.0% |
63381 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
239 | 1 | |
292 | 1 | |
596 | 1 | |
637 | 1 | |
653 | 1 | |
766 | 1 | |
873 | 1 | |
1021 | 1 | |
1174 | 1 | |
1207 | 1 |
Value | Count | Frequency (%) |
886851 | 1 | |
838395 | 1 | |
765872 | 1 | |
682848 | 1 | |
667755 | 1 | |
559181 | 1 | |
459320 | 1 | |
363170 | 1 | |
327289 | 1 | |
228854 | 1 |
read_rt
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 99 |
---|---|
Distinct (%) | 99.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 12.4559 |
Minimum | 1.405 |
---|---|
Maximum | 79.11 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1.405 |
---|---|
5-th percentile | 2.02575 |
Q1 | 4.062 |
median | 6.1715 |
Q3 | 14.72075 |
95-th percentile | 41.41195 |
Maximum | 79.11 |
Range | 77.705 |
Interquartile range (IQR) | 10.65875 |
Descriptive statistics
Standard deviation | 14.270117 |
---|---|
Coefficient of variation (CV) | 1.1456512 |
Kurtosis | 6.1783564 |
Mean | 12.4559 |
Median Absolute Deviation (MAD) | 3.5735 |
Skewness | 2.3718639 |
Sum | 1245.59 |
Variance | 203.63624 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5.608 | 2 | 2.0% |
20.848 | 1 | 1.0% |
6.936 | 1 | 1.0% |
46.997 | 1 | 1.0% |
62.343 | 1 | 1.0% |
79.11 | 1 | 1.0% |
6.403 | 1 | 1.0% |
8.841 | 1 | 1.0% |
5.306 | 1 | 1.0% |
6.164 | 1 | 1.0% |
Other values (89) | 89 |
Value | Count | Frequency (%) |
1.405 | 1 | |
1.496 | 1 | |
1.604 | 1 | |
1.722 | 1 | |
1.888 | 1 | |
2.033 | 1 | |
2.352 | 1 | |
2.438 | 1 | |
2.484 | 1 | |
2.492 | 1 |
Value | Count | Frequency (%) |
79.11 | 1 | |
62.343 | 1 | |
55.259 | 1 | |
49.77 | 1 | |
46.997 | 1 | |
41.118 | 1 | |
41.107 | 1 | |
39.934 | 1 | |
38.386 | 1 | |
35.459 | 1 |
anals_trget_mt | area_nm | age_flag_nm | sexdstn_flag_nm | lon_mber_co | all_mber_co | read_rt | |
---|---|---|---|---|---|---|---|
anals_trget_mt | 1.000 | 1.000 | 0.195 | 0.000 | 0.000 | 0.000 | 0.000 |
area_nm | 1.000 | 1.000 | 0.000 | 0.000 | 0.604 | 0.426 | 0.374 |
age_flag_nm | 0.195 | 0.000 | 1.000 | 0.000 | 0.161 | 0.254 | 0.807 |
sexdstn_flag_nm | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 |
lon_mber_co | 0.000 | 0.604 | 0.161 | 0.000 | 1.000 | 0.871 | 0.000 |
all_mber_co | 0.000 | 0.426 | 0.254 | 0.000 | 0.871 | 1.000 | 0.000 |
read_rt | 0.000 | 0.374 | 0.807 | 0.000 | 0.000 | 0.000 | 1.000 |
sexdstn_flag_nm | age_flag_nm | anals_trget_mt | area_nm | |
---|---|---|---|---|
sexdstn_flag_nm | 1.000 | 0.000 | 0.000 | 0.000 |
age_flag_nm | 0.000 | 1.000 | 0.185 | 0.000 |
anals_trget_mt | 0.000 | 0.185 | 1.000 | 0.974 |
area_nm | 0.000 | 0.000 | 0.974 | 1.000 |
lon_mber_co | all_mber_co | read_rt | anals_trget_mt | area_nm | age_flag_nm | sexdstn_flag_nm | |
---|---|---|---|---|---|---|---|
lon_mber_co | 1.000 | 0.819 | -0.249 | 0.000 | 0.249 | 0.086 | 0.000 |
all_mber_co | 0.819 | 1.000 | -0.692 | 0.000 | 0.228 | 0.119 | 0.000 |
read_rt | -0.249 | -0.692 | 1.000 | 0.000 | 0.203 | 0.381 | 0.000 |
anals_trget_mt | 0.000 | 0.000 | 0.000 | 1.000 | 0.974 | 0.185 | 0.000 |
area_nm | 0.249 | 0.228 | 0.203 | 0.974 | 1.000 | 0.000 | 0.000 |
age_flag_nm | 0.086 | 0.119 | 0.381 | 0.185 | 0.000 | 1.000 | 0.000 |
sexdstn_flag_nm | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 |
anals_trget_year | anals_trget_mt | area_nm | age_flag_nm | sexdstn_flag_nm | lon_mber_co | all_mber_co | read_rt | |
---|---|---|---|---|---|---|---|---|
0 | 2020 | 1 | 강원도 | 영유아(0-5) | 남자 | 182 | 873 | 20.848 |
1 | 2020 | 12 | 충청북도 | 50대 | 여자 | 814 | 32660 | 2.492 |
2 | 2020 | 1 | 강원도 | 유아(6-7) | 남자 | 246 | 1243 | 19.791 |
3 | 2020 | 1 | 강원도 | 유아(6-7) | 여자 | 208 | 1021 | 20.372 |
4 | 2020 | 1 | 강원도 | 초등(8-13) | 남자 | 1515 | 11320 | 13.383 |
5 | 2020 | 1 | 강원도 | 초등(8-13) | 여자 | 1775 | 9675 | 18.346 |
6 | 2020 | 1 | 강원도 | 청소년(14-19) | 남자 | 987 | 18049 | 5.468 |
7 | 2020 | 12 | 충청북도 | 60대이상 | 남자 | 537 | 12458 | 4.31 |
8 | 2020 | 1 | 강원도 | 20대 | 남자 | 793 | 33721 | 2.352 |
9 | 2020 | 1 | 강원도 | 20대 | 여자 | 1386 | 44900 | 3.087 |
anals_trget_year | anals_trget_mt | area_nm | age_flag_nm | sexdstn_flag_nm | lon_mber_co | all_mber_co | read_rt | |
---|---|---|---|---|---|---|---|---|
90 | 2020 | 1 | 대구광역시 | 영유아(0-5) | 남자 | 325 | 653 | 49.77 |
91 | 2020 | 1 | 대구광역시 | 영유아(0-5) | 여자 | 352 | 637 | 55.259 |
92 | 2020 | 1 | 대구광역시 | 유아(6-7) | 남자 | 634 | 1788 | 35.459 |
93 | 2020 | 1 | 대구광역시 | 유아(6-7) | 여자 | 623 | 1623 | 38.386 |
94 | 2020 | 1 | 대구광역시 | 초등(8-13) | 남자 | 3614 | 18420 | 19.62 |
95 | 2020 | 1 | 대구광역시 | 초등(8-13) | 여자 | 3820 | 17948 | 21.284 |
96 | 2020 | 1 | 대구광역시 | 청소년(14-19) | 남자 | 1862 | 21435 | 8.687 |
97 | 2020 | 1 | 대구광역시 | 청소년(14-19) | 여자 | 3083 | 26881 | 11.469 |
98 | 2020 | 1 | 대구광역시 | 20대 | 남자 | 1899 | 37890 | 5.012 |
99 | 2020 | 1 | 대구광역시 | 20대 | 여자 | 3679 | 81396 | 4.52 |