Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.0 KiB |
Average record size in memory | 61.3 B |
Variable types
Categorical | 3 |
---|---|
Text | 1 |
Numeric | 3 |
Dataset
Description | Sample |
---|---|
Author | 한국문화정보원 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=e7aee7a5-10e6-4c9e-b615-0658e9a83be1 |
FILE_NAME has constant value "" | Constant |
base_ymd has constant value "" | Constant |
hadm_cd is highly overall correlated with book_str_cnt and 2 other fields | High correlation |
book_str_cnt is highly overall correlated with hadm_cd and 1 other fields | High correlation |
residnt_cnt_sum is highly overall correlated with hadm_cd and 1 other fields | High correlation |
sido_nm is highly overall correlated with hadm_cd | High correlation |
hadm_cd has unique values | Unique |
residnt_cnt_sum has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 09:52:41.004196 |
---|---|
Analysis finished | 2023-12-10 09:52:43.746755 |
Duration | 2.74 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
sido_nm
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
경기도 | |
---|---|
경상남도 | |
강원도 | |
경상북도 |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.4 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 강원도 |
---|---|
2nd row | 강원도 |
3rd row | 강원도 |
4th row | 강원도 |
5th row | 강원도 |
Common Values
Value | Count | Frequency (%) |
경기도 | 42 | |
경상남도 | 22 | |
강원도 | 18 | |
경상북도 | 18 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
경기도 | 42 | |
경상남도 | 22 | |
강원도 | 18 | |
경상북도 | 18 |
sgg_nm
Text
Distinct | 99 |
---|---|
Distinct (%) | 99.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
창원시 | 5 | 4.1% |
수원시 | 4 | 3.3% |
고양시 | 3 | 2.5% |
성남시 | 3 | 2.5% |
용인시 | 3 | 2.5% |
안양시 | 2 | 1.6% |
안산시 | 2 | 1.6% |
고성군 | 2 | 1.6% |
영덕군 | 1 | 0.8% |
안동시 | 1 | 0.8% |
Other values (96) | 96 |
Most occurring characters
Value | Count | Frequency (%) |
시 | 68 | 17.1% |
군 | 35 | 8.8% |
구 | 25 | 6.3% |
22 | 5.5% | |
양 | 16 | 4.0% |
천 | 14 | 3.5% |
원 | 14 | 3.5% |
산 | 11 | 2.8% |
주 | 11 | 2.8% |
안 | 10 | 2.5% |
Other values (83) | 171 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 375 | |
Space Separator | 22 | 5.5% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
시 | 68 | |
군 | 35 | 9.3% |
구 | 25 | 6.7% |
양 | 16 | 4.3% |
천 | 14 | 3.7% |
원 | 14 | 3.7% |
산 | 11 | 2.9% |
주 | 11 | 2.9% |
안 | 10 | 2.7% |
성 | 10 | 2.7% |
Other values (82) | 161 |
Space Separator
Value | Count | Frequency (%) |
22 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 375 | |
Common | 22 | 5.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
시 | 68 | |
군 | 35 | 9.3% |
구 | 25 | 6.7% |
양 | 16 | 4.3% |
천 | 14 | 3.7% |
원 | 14 | 3.7% |
산 | 11 | 2.9% |
주 | 11 | 2.9% |
안 | 10 | 2.7% |
성 | 10 | 2.7% |
Other values (82) | 161 |
Common
Value | Count | Frequency (%) |
22 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 375 | |
ASCII | 22 | 5.5% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
시 | 68 | |
군 | 35 | 9.3% |
구 | 25 | 6.7% |
양 | 16 | 4.3% |
천 | 14 | 3.7% |
원 | 14 | 3.7% |
산 | 11 | 2.9% |
주 | 11 | 2.9% |
안 | 10 | 2.7% |
성 | 10 | 2.7% |
Other values (82) | 161 |
ASCII
Value | Count | Frequency (%) |
22 |
hadm_cd
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 44255.9 |
Minimum | 41111 |
---|---|
Maximum | 48890 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 41111 |
---|---|
5-th percentile | 41132.9 |
Q1 | 41425 |
median | 42725 |
Q3 | 47905 |
95-th percentile | 48840.5 |
Maximum | 48890 |
Range | 7779 |
Interquartile range (IQR) | 6480 |
Descriptive statistics
Standard deviation | 3170.1308 |
---|---|
Coefficient of variation (CV) | 0.071631822 |
Kurtosis | -1.7528404 |
Mean | 44255.9 |
Median Absolute Deviation (MAD) | 1543.5 |
Skewness | 0.39435263 |
Sum | 4425590 |
Variance | 10049729 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
42150 | 1 | 1.0% |
48840 | 1 | 1.0% |
48123 | 1 | 1.0% |
48127 | 1 | 1.0% |
48125 | 1 | 1.0% |
48740 | 1 | 1.0% |
48170 | 1 | 1.0% |
48720 | 1 | 1.0% |
48330 | 1 | 1.0% |
48860 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
41111 | 1 | |
41113 | 1 | |
41115 | 1 | |
41117 | 1 | |
41131 | 1 | |
41133 | 1 | |
41135 | 1 | |
41150 | 1 | |
41171 | 1 | |
41173 | 1 |
Value | Count | Frequency (%) |
48890 | 1 | |
48880 | 1 | |
48870 | 1 | |
48860 | 1 | |
48850 | 1 | |
48840 | 1 | |
48820 | 1 | |
48740 | 1 | |
48730 | 1 | |
48720 | 1 |
book_str_cnt
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 52 |
---|---|
Distinct (%) | 52.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 26.17 |
Minimum | 0 |
---|---|
Maximum | 107 |
Zeros | 1 |
Zeros (%) | 1.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 5 |
median | 19 |
Q3 | 45.25 |
95-th percentile | 66.1 |
Maximum | 107 |
Range | 107 |
Interquartile range (IQR) | 40.25 |
Descriptive statistics
Standard deviation | 24.164443 |
---|---|
Coefficient of variation (CV) | 0.92336427 |
Kurtosis | 0.14284925 |
Mean | 26.17 |
Median Absolute Deviation (MAD) | 16.5 |
Skewness | 0.86324801 |
Sum | 2617 |
Variance | 583.9203 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 8 | 8.0% |
3 | 6 | 6.0% |
46 | 5 | 5.0% |
5 | 5 | 5.0% |
2 | 5 | 5.0% |
8 | 4 | 4.0% |
7 | 4 | 4.0% |
4 | 4 | 4.0% |
9 | 3 | 3.0% |
50 | 3 | 3.0% |
Other values (42) | 53 |
Value | Count | Frequency (%) |
0 | 1 | 1.0% |
1 | 8 | |
2 | 5 | |
3 | 6 | |
4 | 4 | |
5 | 5 | |
6 | 1 | 1.0% |
7 | 4 | |
8 | 4 | |
9 | 3 | 3.0% |
Value | Count | Frequency (%) |
107 | 1 | |
85 | 1 | |
83 | 1 | |
81 | 1 | |
68 | 1 | |
66 | 1 | |
63 | 2 | |
59 | 1 | |
58 | 1 | |
56 | 1 |
residnt_cnt_sum
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 161769.75 |
Minimum | 5198 |
---|---|
Maximum | 730558 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 5198 |
---|---|
5-th percentile | 17990.55 |
Q1 | 35464.5 |
median | 132815.5 |
Q3 | 246408.75 |
95-th percentile | 417416.75 |
Maximum | 730558 |
Range | 725360 |
Interquartile range (IQR) | 210944.25 |
Descriptive statistics
Standard deviation | 145624.98 |
---|---|
Coefficient of variation (CV) | 0.90019905 |
Kurtosis | 1.8646798 |
Mean | 161769.75 |
Median Absolute Deviation (MAD) | 100570 |
Skewness | 1.2584857 |
Sum | 16176975 |
Variance | 2.1206633 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
162905 | 1 | 1.0% |
30561 | 1 | 1.0% |
191489 | 1 | 1.0% |
162616 | 1 | 1.0% |
144573 | 1 | 1.0% |
45069 | 1 | 1.0% |
280079 | 1 | 1.0% |
18082 | 1 | 1.0% |
269055 | 1 | 1.0% |
24133 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
5198 | 1 | |
11314 | 1 | |
14731 | 1 | |
15570 | 1 | |
16253 | 1 | |
18082 | 1 | |
18628 | 1 | |
19462 | 1 | |
20388 | 1 | |
20917 | 1 |
Value | Count | Frequency (%) |
730558 | 1 | |
618926 | 1 | |
521159 | 1 | |
447439 | 1 | |
423150 | 1 | |
417115 | 1 | |
414376 | 1 | |
364214 | 1 | |
352389 | 1 | |
351337 | 1 |
FILE_NAME
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
KC_623_CLT_SALE_BOOK_STR_MAP_2019 |
---|
Length
Max length | 33 |
---|---|
Median length | 33 |
Mean length | 33 |
Min length | 33 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | KC_623_CLT_SALE_BOOK_STR_MAP_2019 |
---|---|
2nd row | KC_623_CLT_SALE_BOOK_STR_MAP_2019 |
3rd row | KC_623_CLT_SALE_BOOK_STR_MAP_2019 |
4th row | KC_623_CLT_SALE_BOOK_STR_MAP_2019 |
5th row | KC_623_CLT_SALE_BOOK_STR_MAP_2019 |
Common Values
Value | Count | Frequency (%) |
KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
kc_623_clt_sale_book_str_map_2019 | 100 |
base_ymd
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
20200214 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20200214 |
---|---|
2nd row | 20200214 |
3rd row | 20200214 |
4th row | 20200214 |
5th row | 20200214 |
Common Values
Value | Count | Frequency (%) |
20200214 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20200214 | 100 |
sido_nm | sgg_nm | hadm_cd | book_str_cnt | residnt_cnt_sum | |
---|---|---|---|---|---|
sido_nm | 1.000 | 0.675 | 1.000 | 0.406 | 0.487 |
sgg_nm | 0.675 | 1.000 | 0.862 | 1.000 | 1.000 |
hadm_cd | 1.000 | 0.862 | 1.000 | 0.449 | 0.400 |
book_str_cnt | 0.406 | 1.000 | 0.449 | 1.000 | 0.835 |
residnt_cnt_sum | 0.487 | 1.000 | 0.400 | 0.835 | 1.000 |
hadm_cd | book_str_cnt | residnt_cnt_sum | sido_nm | |
---|---|---|---|---|
hadm_cd | 1.000 | -0.545 | -0.567 | 0.990 |
book_str_cnt | -0.545 | 1.000 | 0.928 | 0.262 |
residnt_cnt_sum | -0.567 | 0.928 | 1.000 | 0.304 |
sido_nm | 0.990 | 0.262 | 0.304 | 1.000 |
sido_nm | sgg_nm | hadm_cd | book_str_cnt | residnt_cnt_sum | FILE_NAME | base_ymd | |
---|---|---|---|---|---|---|---|
0 | 강원도 | 강릉시 | 42150 | 52 | 162905 | KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 20200214 |
1 | 강원도 | 고성군 | 42820 | 2 | 18628 | KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 20200214 |
2 | 강원도 | 동해시 | 42170 | 9 | 68772 | KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 20200214 |
3 | 강원도 | 삼척시 | 42230 | 5 | 45409 | KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 20200214 |
4 | 강원도 | 속초시 | 42210 | 7 | 63203 | KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 20200214 |
5 | 강원도 | 양구군 | 42800 | 3 | 14731 | KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 20200214 |
6 | 강원도 | 양양군 | 42830 | 1 | 19462 | KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 20200214 |
7 | 강원도 | 영월군 | 42750 | 3 | 25832 | KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 20200214 |
8 | 강원도 | 원주시 | 42130 | 55 | 267203 | KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 20200214 |
9 | 강원도 | 인제군 | 42810 | 5 | 20917 | KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 20200214 |
sido_nm | sgg_nm | hadm_cd | book_str_cnt | residnt_cnt_sum | FILE_NAME | base_ymd | |
---|---|---|---|---|---|---|---|
90 | 경상북도 | 상주시 | 47250 | 10 | 68854 | KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 20200214 |
91 | 경상북도 | 성주군 | 47840 | 1 | 32010 | KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 20200214 |
92 | 경상북도 | 안동시 | 47170 | 32 | 119201 | KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 20200214 |
93 | 경상북도 | 영덕군 | 47770 | 2 | 25962 | KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 20200214 |
94 | 경상북도 | 영양군 | 47760 | 1 | 11314 | KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 20200214 |
95 | 경상북도 | 영주시 | 47210 | 23 | 79658 | KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 20200214 |
96 | 경상북도 | 영천시 | 47230 | 12 | 73702 | KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 20200214 |
97 | 경상북도 | 예천군 | 47900 | 7 | 36291 | KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 20200214 |
98 | 경상북도 | 울릉군 | 47940 | 0 | 5198 | KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 20200214 |
99 | 경상북도 | 울진군 | 47930 | 9 | 36399 | KC_623_CLT_SALE_BOOK_STR_MAP_2019 | 20200214 |