Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 250 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 18.9 KiB |
Average record size in memory | 77.5 B |
Variable types
Numeric | 3 |
---|---|
Categorical | 5 |
Text | 1 |
Dataset
Description | Sample |
---|---|
Author | 한국문화정보원 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=29b8fa51-3c9b-4c79-87ca-20038fd2a70e |
SETLE_PRICE has constant value "" | Constant |
FILE_NM has constant value "" | Constant |
BASE_DE has constant value "" | Constant |
SIGNGU_CD is highly overall correlated with CTPRVN_NM | High correlation |
MRHST_CO is highly overall correlated with OFFLN_MRHST_CO | High correlation |
OFFLN_MRHST_CO is highly overall correlated with MRHST_CO | High correlation |
CTPRVN_NM is highly overall correlated with SIGNGU_CD | High correlation |
SIGNGU_CD has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 09:50:27.733731 |
---|---|
Analysis finished | 2023-12-10 09:50:30.812582 |
Duration | 3.08 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
SIGNGU_CD
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 250 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 29868.48 |
Minimum | 11010 |
---|---|
Maximum | 39020 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.3 KiB |
Quantile statistics
Minimum | 11010 |
---|---|
5-th percentile | 11134.5 |
Q1 | 24042.5 |
median | 32315 |
Q3 | 36327.5 |
95-th percentile | 38114.55 |
Maximum | 39020 |
Range | 28010 |
Interquartile range (IQR) | 12285 |
Descriptive statistics
Standard deviation | 8085.1571 |
---|---|
Coefficient of variation (CV) | 0.27069195 |
Kurtosis | 0.32057893 |
Mean | 29868.48 |
Median Absolute Deviation (MAD) | 4140 |
Skewness | -1.1477138 |
Sum | 7467120 |
Variance | 65369765 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
11010 | 1 | 0.4% |
35310 | 1 | 0.4% |
34330 | 1 | 0.4% |
34340 | 1 | 0.4% |
34350 | 1 | 0.4% |
34360 | 1 | 0.4% |
34370 | 1 | 0.4% |
34380 | 1 | 0.4% |
35011 | 1 | 0.4% |
35012 | 1 | 0.4% |
Other values (240) | 240 |
Value | Count | Frequency (%) |
11010 | 1 | |
11020 | 1 | |
11030 | 1 | |
11040 | 1 | |
11050 | 1 | |
11060 | 1 | |
11070 | 1 | |
11080 | 1 | |
11090 | 1 | |
11100 | 1 |
Value | Count | Frequency (%) |
39020 | 1 | |
39010 | 1 | |
38400 | 1 | |
38390 | 1 | |
38380 | 1 | |
38370 | 1 | |
38360 | 1 | |
38350 | 1 | |
38340 | 1 | |
38330 | 1 |
CTPRVN_NM
Categorical
HIGH CORRELATION
 
Distinct | 17 |
---|---|
Distinct (%) | 6.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.1 KiB |
경기도 | |
---|---|
서울특별시 | |
경상북도 | |
전라남도 | |
경상남도 | |
Other values (12) |
Length
Max length | 7 |
---|---|
Median length | 5 |
Mean length | 4.092 |
Min length | 3 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.4% |
Sample
1st row | 서울특별시 |
---|---|
2nd row | 서울특별시 |
3rd row | 서울특별시 |
4th row | 서울특별시 |
5th row | 서울특별시 |
Common Values
Value | Count | Frequency (%) |
경기도 | 42 | |
서울특별시 | 25 | |
경상북도 | 24 | |
전라남도 | 22 | |
경상남도 | 22 | |
강원도 | 18 | |
부산광역시 | 16 | 6.4% |
충청남도 | 16 | 6.4% |
전라북도 | 15 | 6.0% |
충청북도 | 14 | 5.6% |
Other values (7) | 36 |
Length
Value | Count | Frequency (%) |
경기도 | 42 | |
서울특별시 | 25 | |
경상북도 | 24 | |
전라남도 | 22 | |
경상남도 | 22 | |
강원도 | 18 | |
부산광역시 | 16 | 6.4% |
충청남도 | 16 | 6.4% |
전라북도 | 15 | 6.0% |
충청북도 | 14 | 5.6% |
Other values (7) | 36 |
SIGNGU_NM
Text
Distinct | 228 |
---|---|
Distinct (%) | 91.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.1 KiB |
Value | Count | Frequency (%) |
동구 | 6 | 2.1% |
중구 | 6 | 2.1% |
서구 | 5 | 1.8% |
북구 | 5 | 1.8% |
창원시 | 5 | 1.8% |
남구 | 5 | 1.8% |
수원시 | 4 | 1.4% |
청주시 | 4 | 1.4% |
용인시 | 3 | 1.1% |
고양시 | 3 | 1.1% |
Other values (227) | 236 |
Most occurring characters
Value | Count | Frequency (%) |
구 | 106 | 12.2% |
시 | 100 | 11.5% |
군 | 85 | 9.8% |
32 | 3.7% | |
주 | 24 | 2.8% |
천 | 23 | 2.6% |
산 | 23 | 2.6% |
양 | 22 | 2.5% |
성 | 21 | 2.4% |
동 | 20 | 2.3% |
Other values (137) | 415 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 839 | |
Space Separator | 32 | 3.7% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
구 | 106 | 12.6% |
시 | 100 | 11.9% |
군 | 85 | 10.1% |
주 | 24 | 2.9% |
천 | 23 | 2.7% |
산 | 23 | 2.7% |
양 | 22 | 2.6% |
성 | 21 | 2.5% |
동 | 20 | 2.4% |
원 | 18 | 2.1% |
Other values (136) | 397 |
Space Separator
Value | Count | Frequency (%) |
32 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 839 | |
Common | 32 | 3.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
구 | 106 | 12.6% |
시 | 100 | 11.9% |
군 | 85 | 10.1% |
주 | 24 | 2.9% |
천 | 23 | 2.7% |
산 | 23 | 2.7% |
양 | 22 | 2.6% |
성 | 21 | 2.5% |
동 | 20 | 2.4% |
원 | 18 | 2.1% |
Other values (136) | 397 |
Common
Value | Count | Frequency (%) |
32 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 839 | |
ASCII | 32 | 3.7% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
구 | 106 | 12.6% |
시 | 100 | 11.9% |
군 | 85 | 10.1% |
주 | 24 | 2.9% |
천 | 23 | 2.7% |
산 | 23 | 2.7% |
양 | 22 | 2.6% |
성 | 21 | 2.5% |
동 | 20 | 2.4% |
원 | 18 | 2.1% |
Other values (136) | 397 |
ASCII
Value | Count | Frequency (%) |
32 |
MRHST_CO
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 10 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 55 |
Minimum | 10 |
---|---|
Maximum | 100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.3 KiB |
Quantile statistics
Minimum | 10 |
---|---|
5-th percentile | 10 |
Q1 | 30 |
median | 55 |
Q3 | 80 |
95-th percentile | 100 |
Maximum | 100 |
Range | 90 |
Interquartile range (IQR) | 50 |
Descriptive statistics
Standard deviation | 28.780432 |
---|---|
Coefficient of variation (CV) | 0.52328058 |
Kurtosis | -1.2246952 |
Mean | 55 |
Median Absolute Deviation (MAD) | 25 |
Skewness | 0 |
Sum | 13750 |
Variance | 828.31325 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10 | 25 | |
20 | 25 | |
30 | 25 | |
40 | 25 | |
50 | 25 | |
60 | 25 | |
70 | 25 | |
80 | 25 | |
90 | 25 | |
100 | 25 |
Value | Count | Frequency (%) |
10 | 25 | |
20 | 25 | |
30 | 25 | |
40 | 25 | |
50 | 25 | |
60 | 25 | |
70 | 25 | |
80 | 25 | |
90 | 25 | |
100 | 25 |
Value | Count | Frequency (%) |
100 | 25 | |
90 | 25 | |
80 | 25 | |
70 | 25 | |
60 | 25 | |
50 | 25 | |
40 | 25 | |
30 | 25 | |
20 | 25 | |
10 | 25 |
ONLINE_MRHST_CO
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.1 KiB |
0 | |
---|---|
1 | |
2 | |
3 | 7 |
6 | 2 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 3 |
3rd row | 1 |
4th row | 3 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
0 | 153 | |
1 | 60 | 24.0% |
2 | 28 | 11.2% |
3 | 7 | 2.8% |
6 | 2 | 0.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 153 | |
1 | 60 | 24.0% |
2 | 28 | 11.2% |
3 | 7 | 2.8% |
6 | 2 | 0.8% |
OFFLN_MRHST_CO
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 35 |
---|---|
Distinct (%) | 14.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 54.404 |
Minimum | 7 |
---|---|
Maximum | 100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.3 KiB |
Quantile statistics
Minimum | 7 |
---|---|
5-th percentile | 10 |
Q1 | 30 |
median | 54 |
Q3 | 80 |
95-th percentile | 99 |
Maximum | 100 |
Range | 93 |
Interquartile range (IQR) | 50 |
Descriptive statistics
Standard deviation | 28.746098 |
---|---|
Coefficient of variation (CV) | 0.52838206 |
Kurtosis | -1.229975 |
Mean | 54.404 |
Median Absolute Deviation (MAD) | 25 |
Skewness | -0.0068166876 |
Sum | 13601 |
Variance | 826.33814 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
50 | 21 | 8.4% |
20 | 17 | 6.8% |
90 | 16 | 6.4% |
10 | 16 | 6.4% |
70 | 16 | 6.4% |
80 | 16 | 6.4% |
30 | 15 | 6.0% |
60 | 15 | 6.0% |
40 | 11 | 4.4% |
100 | 10 | 4.0% |
Other values (25) | 97 |
Value | Count | Frequency (%) |
7 | 2 | 0.8% |
8 | 2 | 0.8% |
9 | 5 | 2.0% |
10 | 16 | |
17 | 2 | 0.8% |
18 | 1 | 0.4% |
19 | 5 | 2.0% |
20 | 17 | |
28 | 1 | 0.4% |
29 | 9 |
Value | Count | Frequency (%) |
100 | 10 | |
99 | 8 | |
98 | 6 | 2.4% |
94 | 1 | 0.4% |
90 | 16 | |
89 | 8 | |
88 | 1 | 0.4% |
80 | 16 | |
79 | 4 | 1.6% |
78 | 5 | 2.0% |
SETLE_PRICE
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.1 KiB |
100000 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 100000 |
---|---|
2nd row | 100000 |
3rd row | 100000 |
4th row | 100000 |
5th row | 100000 |
Common Values
Value | Count | Frequency (%) |
100000 | 250 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
100000 | 250 |
FILE_NM
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.1 KiB |
KC_616_CLT_NURI_SOC_FOCUS_2019 |
---|
Length
Max length | 30 |
---|---|
Median length | 30 |
Mean length | 30 |
Min length | 30 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | KC_616_CLT_NURI_SOC_FOCUS_2019 |
---|---|
2nd row | KC_616_CLT_NURI_SOC_FOCUS_2019 |
3rd row | KC_616_CLT_NURI_SOC_FOCUS_2019 |
4th row | KC_616_CLT_NURI_SOC_FOCUS_2019 |
5th row | KC_616_CLT_NURI_SOC_FOCUS_2019 |
Common Values
Value | Count | Frequency (%) |
KC_616_CLT_NURI_SOC_FOCUS_2019 | 250 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
kc_616_clt_nuri_soc_focus_2019 | 250 |
BASE_DE
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.1 KiB |
sample |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | sample |
---|---|
2nd row | sample |
3rd row | sample |
4th row | sample |
5th row | sample |
Common Values
Value | Count | Frequency (%) |
sample | 250 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
sample | 250 |
SIGNGU_CD | CTPRVN_NM | MRHST_CO | ONLINE_MRHST_CO | OFFLN_MRHST_CO | |
---|---|---|---|---|---|
SIGNGU_CD | 1.000 | 0.993 | 0.000 | 0.280 | 0.000 |
CTPRVN_NM | 0.993 | 1.000 | 0.000 | 0.384 | 0.000 |
MRHST_CO | 0.000 | 0.000 | 1.000 | 0.231 | 1.000 |
ONLINE_MRHST_CO | 0.280 | 0.384 | 0.231 | 1.000 | 0.229 |
OFFLN_MRHST_CO | 0.000 | 0.000 | 1.000 | 0.229 | 1.000 |
CTPRVN_NM | ONLINE_MRHST_CO | |
---|---|---|
CTPRVN_NM | 1.000 | 0.203 |
ONLINE_MRHST_CO | 0.203 | 1.000 |
SIGNGU_CD | MRHST_CO | OFFLN_MRHST_CO | CTPRVN_NM | ONLINE_MRHST_CO | |
---|---|---|---|---|---|
SIGNGU_CD | 1.000 | 0.040 | 0.071 | 0.951 | 0.174 |
MRHST_CO | 0.040 | 1.000 | 0.996 | 0.000 | 0.096 |
OFFLN_MRHST_CO | 0.071 | 0.996 | 1.000 | 0.000 | 0.095 |
CTPRVN_NM | 0.951 | 0.000 | 0.000 | 1.000 | 0.203 |
ONLINE_MRHST_CO | 0.174 | 0.096 | 0.095 | 0.203 | 1.000 |
SIGNGU_CD | CTPRVN_NM | SIGNGU_NM | MRHST_CO | ONLINE_MRHST_CO | OFFLN_MRHST_CO | SETLE_PRICE | FILE_NM | BASE_DE | |
---|---|---|---|---|---|---|---|---|---|
0 | 11010 | 서울특별시 | 종로구 | 10 | 1 | 9 | 100000 | KC_616_CLT_NURI_SOC_FOCUS_2019 | sample |
1 | 11020 | 서울특별시 | 중구 | 20 | 3 | 17 | 100000 | KC_616_CLT_NURI_SOC_FOCUS_2019 | sample |
2 | 11030 | 서울특별시 | 용산구 | 30 | 1 | 29 | 100000 | KC_616_CLT_NURI_SOC_FOCUS_2019 | sample |
3 | 11040 | 서울특별시 | 성동구 | 40 | 3 | 37 | 100000 | KC_616_CLT_NURI_SOC_FOCUS_2019 | sample |
4 | 11050 | 서울특별시 | 광진구 | 50 | 1 | 49 | 100000 | KC_616_CLT_NURI_SOC_FOCUS_2019 | sample |
5 | 11060 | 서울특별시 | 동대문구 | 60 | 1 | 59 | 100000 | KC_616_CLT_NURI_SOC_FOCUS_2019 | sample |
6 | 11070 | 서울특별시 | 중랑구 | 70 | 0 | 70 | 100000 | KC_616_CLT_NURI_SOC_FOCUS_2019 | sample |
7 | 11080 | 서울특별시 | 성북구 | 80 | 0 | 80 | 100000 | KC_616_CLT_NURI_SOC_FOCUS_2019 | sample |
8 | 11090 | 서울특별시 | 강북구 | 90 | 1 | 89 | 100000 | KC_616_CLT_NURI_SOC_FOCUS_2019 | sample |
9 | 11100 | 서울특별시 | 도봉구 | 100 | 0 | 100 | 100000 | KC_616_CLT_NURI_SOC_FOCUS_2019 | sample |
SIGNGU_CD | CTPRVN_NM | SIGNGU_NM | MRHST_CO | ONLINE_MRHST_CO | OFFLN_MRHST_CO | SETLE_PRICE | FILE_NM | BASE_DE | |
---|---|---|---|---|---|---|---|---|---|
240 | 38330 | 경상남도 | 창녕군 | 10 | 0 | 10 | 100000 | KC_616_CLT_NURI_SOC_FOCUS_2019 | sample |
241 | 38340 | 경상남도 | 고성군 | 20 | 0 | 20 | 100000 | KC_616_CLT_NURI_SOC_FOCUS_2019 | sample |
242 | 38350 | 경상남도 | 남해군 | 30 | 0 | 30 | 100000 | KC_616_CLT_NURI_SOC_FOCUS_2019 | sample |
243 | 38360 | 경상남도 | 하동군 | 40 | 0 | 40 | 100000 | KC_616_CLT_NURI_SOC_FOCUS_2019 | sample |
244 | 38370 | 경상남도 | 산청군 | 50 | 0 | 50 | 100000 | KC_616_CLT_NURI_SOC_FOCUS_2019 | sample |
245 | 38380 | 경상남도 | 함양군 | 60 | 0 | 60 | 100000 | KC_616_CLT_NURI_SOC_FOCUS_2019 | sample |
246 | 38390 | 경상남도 | 거창군 | 70 | 0 | 70 | 100000 | KC_616_CLT_NURI_SOC_FOCUS_2019 | sample |
247 | 38400 | 경상남도 | 합천군 | 80 | 0 | 80 | 100000 | KC_616_CLT_NURI_SOC_FOCUS_2019 | sample |
248 | 39010 | 제주특별자치도 | 제주시 | 90 | 2 | 88 | 100000 | KC_616_CLT_NURI_SOC_FOCUS_2019 | sample |
249 | 39020 | 제주특별자치도 | 서귀포시 | 100 | 1 | 99 | 100000 | KC_616_CLT_NURI_SOC_FOCUS_2019 | sample |