Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.0 KiB |
Average record size in memory | 61.3 B |
Variable types
Numeric | 3 |
---|---|
Categorical | 2 |
Text | 2 |
Dataset
Description | Sample |
---|---|
Author | 한국문화예술위원회 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=9ce671ae-6203-4303-be49-442689873f42 |
ctprvn_cd is highly overall correlated with signgu_cd and 2 other fields | High correlation |
signgu_cd is highly overall correlated with ctprvn_cd and 2 other fields | High correlation |
adstrd_cd is highly overall correlated with ctprvn_cd and 2 other fields | High correlation |
ctprvn_nm is highly overall correlated with ctprvn_cd and 2 other fields | High correlation |
co is highly imbalanced (75.8%) | Imbalance |
adstrd_cd has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 10:13:22.728954 |
---|---|
Analysis finished | 2023-12-10 10:13:25.773987 |
Duration | 3.05 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
ctprvn_cd
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 17 |
---|---|
Distinct (%) | 17.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 29.76 |
Minimum | 11 |
---|---|
Maximum | 39 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 11 |
---|---|
5-th percentile | 11 |
Q1 | 23.75 |
median | 32 |
Q3 | 36 |
95-th percentile | 38 |
Maximum | 39 |
Range | 28 |
Interquartile range (IQR) | 12.25 |
Descriptive statistics
Standard deviation | 8.1415756 |
---|---|
Coefficient of variation (CV) | 0.27357445 |
Kurtosis | 0.32903147 |
Mean | 29.76 |
Median Absolute Deviation (MAD) | 5 |
Skewness | -1.1166454 |
Sum | 2976 |
Variance | 66.285253 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
31 | 16 | |
35 | 10 | |
11 | 10 | |
37 | 9 | |
38 | 9 | |
36 | 8 | |
32 | 7 | |
23 | 7 | |
22 | 5 | 5.0% |
21 | 3 | 3.0% |
Other values (7) | 16 |
Value | Count | Frequency (%) |
11 | 10 | |
21 | 3 | 3.0% |
22 | 5 | 5.0% |
23 | 7 | |
24 | 1 | 1.0% |
25 | 2 | 2.0% |
26 | 3 | 3.0% |
29 | 1 | 1.0% |
31 | 16 | |
32 | 7 |
Value | Count | Frequency (%) |
39 | 3 | 3.0% |
38 | 9 | |
37 | 9 | |
36 | 8 | |
35 | 10 | |
34 | 3 | 3.0% |
33 | 3 | 3.0% |
32 | 7 | |
31 | 16 | |
29 | 1 | 1.0% |
ctprvn_nm
Categorical
HIGH CORRELATION
 
Distinct | 17 |
---|---|
Distinct (%) | 17.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
경기도 | |
---|---|
전라북도 | |
서울특별시 | |
경상북도 | |
경상남도 | |
Other values (12) |
Length
Max length | 7 |
---|---|
Median length | 5 |
Mean length | 4.2 |
Min length | 3 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | 경상북도 |
---|---|
2nd row | 인천광역시 |
3rd row | 전라북도 |
4th row | 인천광역시 |
5th row | 서울특별시 |
Common Values
Value | Count | Frequency (%) |
경기도 | 16 | |
전라북도 | 10 | |
서울특별시 | 10 | |
경상북도 | 9 | |
경상남도 | 9 | |
전라남도 | 8 | |
강원도 | 7 | |
인천광역시 | 7 | |
대구광역시 | 5 | 5.0% |
부산광역시 | 3 | 3.0% |
Other values (7) | 16 |
Length
Value | Count | Frequency (%) |
경기도 | 16 | |
전라북도 | 10 | |
서울특별시 | 10 | |
경상북도 | 9 | |
경상남도 | 9 | |
전라남도 | 8 | |
강원도 | 7 | |
인천광역시 | 7 | |
대구광역시 | 5 | 5.0% |
부산광역시 | 3 | 3.0% |
Other values (7) | 16 |
signgu_cd
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 94 |
---|---|
Distinct (%) | 94.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 29910.37 |
Minimum | 11010 |
---|---|
Maximum | 39020 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 11010 |
---|---|
5-th percentile | 11077 |
Q1 | 23787.5 |
median | 32045 |
Q3 | 36382.5 |
95-th percentile | 38322 |
Maximum | 39020 |
Range | 28010 |
Interquartile range (IQR) | 12595 |
Descriptive statistics
Standard deviation | 8191.2265 |
---|---|
Coefficient of variation (CV) | 0.27385908 |
Kurtosis | 0.30031795 |
Mean | 29910.37 |
Median Absolute Deviation (MAD) | 5030 |
Skewness | -1.1069241 |
Sum | 2991037 |
Variance | 67096192 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11010 | 4 | 4.0% |
39010 | 2 | 2.0% |
35310 | 2 | 2.0% |
31070 | 2 | 2.0% |
37100 | 1 | 1.0% |
23050 | 1 | 1.0% |
24020 | 1 | 1.0% |
31160 | 1 | 1.0% |
34310 | 1 | 1.0% |
36460 | 1 | 1.0% |
Other values (84) | 84 |
Value | Count | Frequency (%) |
11010 | 4 | |
11020 | 1 | 1.0% |
11080 | 1 | 1.0% |
11140 | 1 | 1.0% |
11170 | 1 | 1.0% |
11190 | 1 | 1.0% |
11220 | 1 | 1.0% |
21070 | 1 | 1.0% |
21080 | 1 | 1.0% |
21090 | 1 | 1.0% |
Value | Count | Frequency (%) |
39020 | 1 | |
39010 | 2 | |
38390 | 1 | |
38360 | 1 | |
38320 | 1 | |
38310 | 1 | |
38114 | 1 | |
38111 | 1 | |
38100 | 1 | |
38090 | 1 |
signgu_nm
Text
Distinct | 87 |
---|---|
Distinct (%) | 87.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
종로구 | 4 | 3.7% |
북구 | 3 | 2.8% |
서구 | 3 | 2.8% |
중구 | 3 | 2.8% |
안양시 | 2 | 1.9% |
제주시 | 2 | 1.9% |
남구 | 2 | 1.9% |
완주군 | 2 | 1.9% |
평택시 | 2 | 1.9% |
전주시 | 2 | 1.9% |
Other values (82) | 83 |
Most occurring characters
Value | Count | Frequency (%) |
시 | 44 | 13.3% |
구 | 39 | 11.7% |
군 | 29 | 8.7% |
주 | 12 | 3.6% |
천 | 9 | 2.7% |
8 | 2.4% | |
산 | 8 | 2.4% |
동 | 7 | 2.1% |
양 | 7 | 2.1% |
성 | 7 | 2.1% |
Other values (82) | 162 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 324 | |
Space Separator | 8 | 2.4% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
시 | 44 | 13.6% |
구 | 39 | 12.0% |
군 | 29 | 9.0% |
주 | 12 | 3.7% |
천 | 9 | 2.8% |
산 | 8 | 2.5% |
동 | 7 | 2.2% |
양 | 7 | 2.2% |
성 | 7 | 2.2% |
안 | 7 | 2.2% |
Other values (81) | 155 |
Space Separator
Value | Count | Frequency (%) |
8 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 324 | |
Common | 8 | 2.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
시 | 44 | 13.6% |
구 | 39 | 12.0% |
군 | 29 | 9.0% |
주 | 12 | 3.7% |
천 | 9 | 2.8% |
산 | 8 | 2.5% |
동 | 7 | 2.2% |
양 | 7 | 2.2% |
성 | 7 | 2.2% |
안 | 7 | 2.2% |
Other values (81) | 155 |
Common
Value | Count | Frequency (%) |
8 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 324 | |
ASCII | 8 | 2.4% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
시 | 44 | 13.6% |
구 | 39 | 12.0% |
군 | 29 | 9.0% |
주 | 12 | 3.7% |
천 | 9 | 2.8% |
산 | 8 | 2.5% |
동 | 7 | 2.2% |
양 | 7 | 2.2% |
성 | 7 | 2.2% |
안 | 7 | 2.2% |
Other values (81) | 155 |
ASCII
Value | Count | Frequency (%) |
8 |
adstrd_cd
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2991082.3 |
Minimum | 1101063 |
---|---|
Maximum | 3902054 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1101063 |
---|---|
5-th percentile | 1107781.6 |
Q1 | 2378819.8 |
median | 3204556.5 |
Q3 | 3638261 |
95-th percentile | 3832211 |
Maximum | 3902054 |
Range | 2800991 |
Interquartile range (IQR) | 1259441.2 |
Descriptive statistics
Standard deviation | 819112 |
---|---|
Coefficient of variation (CV) | 0.27385138 |
Kurtosis | 0.3003565 |
Mean | 2991082.3 |
Median Absolute Deviation (MAD) | 502995.5 |
Skewness | -1.1069377 |
Sum | 2.9910823 × 108 |
Variance | 6.7094448 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3710055 | 1 | 1.0% |
3105089 | 1 | 1.0% |
2402068 | 1 | 1.0% |
3116054 | 1 | 1.0% |
3431011 | 1 | 1.0% |
3646011 | 1 | 1.0% |
3123053 | 1 | 1.0% |
2205077 | 1 | 1.0% |
2505053 | 1 | 1.0% |
3110356 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
1101063 | 1 | |
1101064 | 1 | |
1101072 | 1 | |
1101073 | 1 | |
1102055 | 1 | |
1108083 | 1 | |
1114060 | 1 | |
1117052 | 1 | |
1119055 | 1 | |
1122051 | 1 |
Value | Count | Frequency (%) |
3902054 | 1 | |
3901064 | 1 | |
3901052 | 1 | |
3839011 | 1 | |
3836011 | 1 | |
3832011 | 1 | |
3831011 | 1 | |
3811456 | 1 | |
3811155 | 1 | |
3810058 | 1 |
adstrd_nm
Text
Distinct | 97 |
---|---|
Distinct (%) | 97.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
중앙동 | 4 | 4.0% |
북부동 | 1 | 1.0% |
당산1동 | 1 | 1.0% |
산본1동 | 1 | 1.0% |
금산읍 | 1 | 1.0% |
완도읍 | 1 | 1.0% |
사우동 | 1 | 1.0% |
관음동 | 1 | 1.0% |
회덕동 | 1 | 1.0% |
마두1동 | 1 | 1.0% |
Other values (87) | 87 |
Most occurring characters
Value | Count | Frequency (%) |
동 | 71 | 21.0% |
읍 | 29 | 8.6% |
1 | 23 | 6.8% |
성 | 7 | 2.1% |
중 | 6 | 1.8% |
진 | 6 | 1.8% |
산 | 5 | 1.5% |
덕 | 5 | 1.5% |
앙 | 4 | 1.2% |
흥 | 4 | 1.2% |
Other values (105) | 178 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 304 | |
Decimal Number | 31 | 9.2% |
Other Punctuation | 3 | 0.9% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 71 | |
읍 | 29 | 9.5% |
성 | 7 | 2.3% |
중 | 6 | 2.0% |
진 | 6 | 2.0% |
산 | 5 | 1.6% |
덕 | 5 | 1.6% |
앙 | 4 | 1.3% |
흥 | 4 | 1.3% |
장 | 4 | 1.3% |
Other values (99) | 163 |
Decimal Number
Value | Count | Frequency (%) |
1 | 23 | |
2 | 3 | 9.7% |
3 | 3 | 9.7% |
5 | 1 | 3.2% |
6 | 1 | 3.2% |
Other Punctuation
Value | Count | Frequency (%) |
· | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 304 | |
Common | 34 | 10.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 71 | |
읍 | 29 | 9.5% |
성 | 7 | 2.3% |
중 | 6 | 2.0% |
진 | 6 | 2.0% |
산 | 5 | 1.6% |
덕 | 5 | 1.6% |
앙 | 4 | 1.3% |
흥 | 4 | 1.3% |
장 | 4 | 1.3% |
Other values (99) | 163 |
Common
Value | Count | Frequency (%) |
1 | 23 | |
· | 3 | 8.8% |
2 | 3 | 8.8% |
3 | 3 | 8.8% |
5 | 1 | 2.9% |
6 | 1 | 2.9% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 304 | |
ASCII | 31 | 9.2% |
None | 3 | 0.9% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
동 | 71 | |
읍 | 29 | 9.5% |
성 | 7 | 2.3% |
중 | 6 | 2.0% |
진 | 6 | 2.0% |
산 | 5 | 1.6% |
덕 | 5 | 1.6% |
앙 | 4 | 1.3% |
흥 | 4 | 1.3% |
장 | 4 | 1.3% |
Other values (99) | 163 |
ASCII
Value | Count | Frequency (%) |
1 | 23 | |
2 | 3 | 9.7% |
3 | 3 | 9.7% |
5 | 1 | 3.2% |
6 | 1 | 3.2% |
None
Value | Count | Frequency (%) |
· | 3 |
co
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
1 | |
---|---|
2 | 4 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 1 |
3rd row | 2 |
4th row | 2 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
1 | 96 | |
2 | 4 | 4.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 96 | |
2 | 4 | 4.0% |
ctprvn_cd | ctprvn_nm | signgu_cd | signgu_nm | adstrd_cd | adstrd_nm | co | |
---|---|---|---|---|---|---|---|
ctprvn_cd | 1.000 | 1.000 | 0.999 | 0.854 | 0.999 | 0.978 | 0.000 |
ctprvn_nm | 1.000 | 1.000 | 0.994 | 0.000 | 0.994 | 0.986 | 0.000 |
signgu_cd | 0.999 | 0.994 | 1.000 | 0.562 | 1.000 | 0.984 | 0.000 |
signgu_nm | 0.854 | 0.000 | 0.562 | 1.000 | 0.820 | 0.986 | 1.000 |
adstrd_cd | 0.999 | 0.994 | 1.000 | 0.820 | 1.000 | 0.988 | 0.000 |
adstrd_nm | 0.978 | 0.986 | 0.984 | 0.986 | 0.988 | 1.000 | 1.000 |
co | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 1.000 | 1.000 |
ctprvn_nm | co | |
---|---|---|
ctprvn_nm | 1.000 | 0.000 |
co | 0.000 | 1.000 |
ctprvn_cd | signgu_cd | adstrd_cd | ctprvn_nm | co | |
---|---|---|---|---|---|
ctprvn_cd | 1.000 | 0.996 | 0.996 | 0.950 | 0.000 |
signgu_cd | 0.996 | 1.000 | 1.000 | 0.925 | 0.000 |
adstrd_cd | 0.996 | 1.000 | 1.000 | 0.925 | 0.000 |
ctprvn_nm | 0.950 | 0.925 | 0.925 | 1.000 | 0.000 |
co | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 |
ctprvn_cd | ctprvn_nm | signgu_cd | signgu_nm | adstrd_cd | adstrd_nm | co | |
---|---|---|---|---|---|---|---|
0 | 37 | 경상북도 | 37100 | 경산시 | 3710055 | 북부동 | 2 |
1 | 23 | 인천광역시 | 23070 | 계양구 | 2307053 | 계산1동 | 1 |
2 | 35 | 전라북도 | 35012 | 전주시 덕진구 | 3501257 | 덕진동 | 2 |
3 | 23 | 인천광역시 | 23040 | 연수구 | 2304066 | 송도2동 | 2 |
4 | 11 | 서울특별시 | 11220 | 서초구 | 1122051 | 서초1동 | 2 |
5 | 34 | 충청남도 | 34080 | 당진시 | 3408051 | 당진1동 | 1 |
6 | 35 | 전라북도 | 35380 | 부안군 | 3538011 | 부안읍 | 1 |
7 | 35 | 전라북도 | 35310 | 완주군 | 3531013 | 용진읍 | 1 |
8 | 23 | 인천광역시 | 23060 | 부평구 | 2306053 | 부평3동 | 1 |
9 | 33 | 충청북도 | 33370 | 음성군 | 3337011 | 음성읍 | 1 |
ctprvn_cd | ctprvn_nm | signgu_cd | signgu_nm | adstrd_cd | adstrd_nm | co | |
---|---|---|---|---|---|---|---|
90 | 21 | 부산광역시 | 21080 | 북구 | 2108056 | 덕천1동 | 1 |
91 | 37 | 경상북도 | 37070 | 영천시 | 3707052 | 중앙동 | 1 |
92 | 23 | 인천광역시 | 23080 | 서구 | 2308056 | 가정3동 | 1 |
93 | 32 | 강원도 | 32030 | 강릉시 | 3203062 | 강남동 | 1 |
94 | 32 | 강원도 | 32070 | 삼척시 | 3207052 | 성내동 | 1 |
95 | 33 | 충청북도 | 33030 | 제천시 | 3303060 | 화산동 | 1 |
96 | 11 | 서울특별시 | 11140 | 마포구 | 1114060 | 대흥동 | 1 |
97 | 31 | 경기도 | 31270 | 포천시 | 3127031 | 군내면 | 1 |
98 | 35 | 전라북도 | 35011 | 전주시 완산구 | 3501175 | 풍남동 | 1 |
99 | 37 | 경상북도 | 37330 | 청송군 | 3733011 | 청송읍 | 1 |