Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 10000 |
Missing cells | 242 |
Missing cells (%) | 0.3% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 752.0 KiB |
Average record size in memory | 77.0 B |
Variable types
Categorical | 2 |
---|---|
Text | 3 |
Numeric | 3 |
Dataset
Description | 국토지리정보원의 수치지도(수치지형도) 관련 메타데이터 중 주소도엽매칭 정보입니다. (축척, 시군구, 시군구코드, 도엽코드, 도엽명 등) |
---|---|
Author | 국토교통부 국토지리정보원 |
URL | https://www.data.go.kr/data/15067686/fileData.do |
Reproduction
Analysis started | 2023-12-12 11:58:13.300795 |
---|---|
Analysis finished | 2023-12-12 11:58:15.514618 |
Duration | 2.21 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
축척
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
1000 | |
---|---|
5000 | |
250000 | 38 |
25000 | 7 |
Length
Max length | 6 |
---|---|
Median length | 4 |
Mean length | 4.0083 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 5000 |
---|---|
2nd row | 1000 |
3rd row | 1000 |
4th row | 1000 |
5th row | 5000 |
Common Values
Value | Count | Frequency (%) |
1000 | 7807 | |
5000 | 2148 | 21.5% |
250000 | 38 | 0.4% |
25000 | 7 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1000 | 7807 | |
5000 | 2148 | 21.5% |
250000 | 38 | 0.4% |
25000 | 7 | 0.1% |
시군구
Text
Distinct | 233 |
---|---|
Distinct (%) | 2.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
청주시 | 700 | 6.3% |
창원시 | 307 | 2.8% |
울주군 | 296 | 2.7% |
고양시 | 258 | 2.3% |
북구 | 247 | 2.2% |
서귀포시 | 246 | 2.2% |
제주시 | 229 | 2.1% |
화성시 | 223 | 2.0% |
서구 | 216 | 1.9% |
동구 | 205 | 1.8% |
Other values (219) | 8224 |
Most occurring characters
Value | Count | Frequency (%) |
시 | 6052 | |
구 | 3654 | 10.6% |
주 | 2298 | 6.7% |
군 | 1904 | 5.5% |
1151 | 3.3% | |
양 | 1043 | 3.0% |
산 | 987 | 2.9% |
청 | 832 | 2.4% |
원 | 822 | 2.4% |
서 | 792 | 2.3% |
Other values (133) | 14948 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 33332 | |
Space Separator | 1151 | 3.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
시 | 6052 | |
구 | 3654 | 11.0% |
주 | 2298 | 6.9% |
군 | 1904 | 5.7% |
양 | 1043 | 3.1% |
산 | 987 | 3.0% |
청 | 832 | 2.5% |
원 | 822 | 2.5% |
서 | 792 | 2.4% |
성 | 741 | 2.2% |
Other values (132) | 14207 |
Space Separator
Value | Count | Frequency (%) |
1151 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 33332 | |
Common | 1151 | 3.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
시 | 6052 | |
구 | 3654 | 11.0% |
주 | 2298 | 6.9% |
군 | 1904 | 5.7% |
양 | 1043 | 3.1% |
산 | 987 | 3.0% |
청 | 832 | 2.5% |
원 | 822 | 2.5% |
서 | 792 | 2.4% |
성 | 741 | 2.2% |
Other values (132) | 14207 |
Common
Value | Count | Frequency (%) |
1151 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 33332 | |
ASCII | 1151 | 3.3% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
시 | 6052 | |
구 | 3654 | 11.0% |
주 | 2298 | 6.9% |
군 | 1904 | 5.7% |
양 | 1043 | 3.1% |
산 | 987 | 3.0% |
청 | 832 | 2.5% |
원 | 822 | 2.5% |
서 | 792 | 2.4% |
성 | 741 | 2.2% |
Other values (132) | 14207 |
ASCII
Value | Count | Frequency (%) |
1151 |
시군구코드
Real number (ℝ)
Distinct | 255 |
---|---|
Distinct (%) | 2.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.9100168 × 109 |
Minimum | 1.111 × 109 |
---|---|
Maximum | 5.013 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1.111 × 109 |
---|---|
5-th percentile | 1.162 × 109 |
Q1 | 3.171 × 109 |
median | 4.223 × 109 |
Q3 | 4.677 × 109 |
95-th percentile | 4.888 × 109 |
Maximum | 5.013 × 109 |
Range | 3.902 × 109 |
Interquartile range (IQR) | 1.506 × 109 |
Descriptive statistics
Standard deviation | 1.0286851 × 109 |
---|---|
Coefficient of variation (CV) | 0.26308968 |
Kurtosis | 1.2094834 |
Mean | 3.9100168 × 109 |
Median Absolute Deviation (MAD) | 4.68 × 108 |
Skewness | -1.4100795 |
Sum | 3.9100168 × 1013 |
Variance | 1.058193 × 1018 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4311000000 | 325 | 3.2% |
3171000000 | 296 | 3.0% |
5013000000 | 246 | 2.5% |
5011000000 | 229 | 2.3% |
4159000000 | 223 | 2.2% |
4719000000 | 195 | 1.9% |
4136000000 | 179 | 1.8% |
2771000000 | 152 | 1.5% |
2920000000 | 133 | 1.3% |
4812000000 | 132 | 1.3% |
Other values (245) | 7890 |
Value | Count | Frequency (%) |
1111000000 | 24 | |
1114000000 | 9 | 0.1% |
1117000000 | 29 | |
1120000000 | 24 | |
1121500000 | 32 | |
1123000000 | 26 | |
1126000000 | 22 | |
1129000000 | 27 | |
1130500000 | 19 | |
1132000000 | 18 |
Value | Count | Frequency (%) |
5013000000 | 246 | |
5011000000 | 229 | |
4889000000 | 19 | 0.2% |
4888000000 | 30 | 0.3% |
4887000000 | 8 | 0.1% |
4886000000 | 18 | 0.2% |
4885000000 | 12 | 0.1% |
4882000000 | 1 | < 0.1% |
4874000000 | 1 | < 0.1% |
4873000000 | 13 | 0.1% |
도엽코드
Text
Distinct | 9429 |
---|---|
Distinct (%) | 94.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
ni52-2 | 8 | 0.1% |
nj52-10 | 8 | 0.1% |
nj52-7 | 7 | 0.1% |
377092206 | 4 | < 0.1% |
377051646 | 4 | < 0.1% |
377090366 | 4 | < 0.1% |
376121693 | 4 | < 0.1% |
nj52-4 | 3 | < 0.1% |
367061909 | 3 | < 0.1% |
358031171 | 3 | < 0.1% |
Other values (9419) | 9952 |
Most occurring characters
Value | Count | Frequency (%) |
3 | 14587 | |
0 | 13837 | |
1 | 11721 | |
7 | 10345 | |
6 | 9871 | |
5 | 7323 | |
2 | 5816 | 6.6% |
8 | 5453 | 6.2% |
9 | 4460 | 5.1% |
4 | 4201 | 4.8% |
Other values (4) | 114 | 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 87614 | |
Uppercase Letter | 76 | 0.1% |
Dash Punctuation | 38 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
3 | 14587 | |
0 | 13837 | |
1 | 11721 | |
7 | 10345 | |
6 | 9871 | |
5 | 7323 | |
2 | 5816 | 6.6% |
8 | 5453 | 6.2% |
9 | 4460 | 5.1% |
4 | 4201 | 4.8% |
Uppercase Letter
Value | Count | Frequency (%) |
N | 38 | |
J | 25 | |
I | 13 | 17.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 38 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 87652 | |
Latin | 76 | 0.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
3 | 14587 | |
0 | 13837 | |
1 | 11721 | |
7 | 10345 | |
6 | 9871 | |
5 | 7323 | |
2 | 5816 | 6.6% |
8 | 5453 | 6.2% |
9 | 4460 | 5.1% |
4 | 4201 | 4.8% |
Latin
Value | Count | Frequency (%) |
N | 38 | |
J | 25 | |
I | 13 | 17.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 87728 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
3 | 14587 | |
0 | 13837 | |
1 | 11721 | |
7 | 10345 | |
6 | 9871 | |
5 | 7323 | |
2 | 5816 | 6.6% |
8 | 5453 | 6.2% |
9 | 4460 | 5.1% |
4 | 4201 | 4.8% |
Other values (4) | 114 | 0.1% |
도엽명
Text
MISSING
 
Distinct | 8956 |
---|---|
Distinct (%) | 91.6% |
Missing | 228 |
Missing (%) | 2.3% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
마산 | 41 | 0.4% |
창원 | 29 | 0.3% |
안산시 | 26 | 0.3% |
원주 | 22 | 0.2% |
부산 | 21 | 0.2% |
화성 | 21 | 0.2% |
서귀포시 | 20 | 0.2% |
안양 | 17 | 0.2% |
예안 | 16 | 0.2% |
광주 | 13 | 0.1% |
Other values (8947) | 9587 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 6878 | 12.4% |
1 | 6108 | 11.0% |
2 | 4316 | 7.8% |
3 | 2789 | 5.0% |
4 | 2779 | 5.0% |
5 | 2748 | 4.9% |
8 | 2503 | 4.5% |
9 | 2496 | 4.5% |
7 | 2477 | 4.5% |
6 | 2423 | 4.4% |
Other values (173) | 20023 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 35517 | |
Other Letter | 19914 | |
Space Separator | 109 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
주 | 1499 | 7.5% |
양 | 1262 | 6.3% |
산 | 1096 | 5.5% |
원 | 803 | 4.0% |
울 | 727 | 3.7% |
성 | 716 | 3.6% |
안 | 690 | 3.5% |
서 | 686 | 3.4% |
천 | 650 | 3.3% |
동 | 648 | 3.3% |
Other values (162) | 11137 |
Decimal Number
Value | Count | Frequency (%) |
0 | 6878 | |
1 | 6108 | |
2 | 4316 | |
3 | 2789 | |
4 | 2779 | |
5 | 2748 | 7.7% |
8 | 2503 | 7.0% |
9 | 2496 | 7.0% |
7 | 2477 | 7.0% |
6 | 2423 | 6.8% |
Space Separator
Value | Count | Frequency (%) |
109 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 35626 | |
Hangul | 19914 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
주 | 1499 | 7.5% |
양 | 1262 | 6.3% |
산 | 1096 | 5.5% |
원 | 803 | 4.0% |
울 | 727 | 3.7% |
성 | 716 | 3.6% |
안 | 690 | 3.5% |
서 | 686 | 3.4% |
천 | 650 | 3.3% |
동 | 648 | 3.3% |
Other values (162) | 11137 |
Common
Value | Count | Frequency (%) |
0 | 6878 | |
1 | 6108 | |
2 | 4316 | |
3 | 2789 | |
4 | 2779 | |
5 | 2748 | 7.7% |
8 | 2503 | 7.0% |
9 | 2496 | 7.0% |
7 | 2477 | 7.0% |
6 | 2423 | 6.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 35626 | |
Hangul | 19914 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 6878 | |
1 | 6108 | |
2 | 4316 | |
3 | 2789 | |
4 | 2779 | |
5 | 2748 | 7.7% |
8 | 2503 | 7.0% |
9 | 2496 | 7.0% |
7 | 2477 | 7.0% |
6 | 2423 | 6.8% |
Hangul
Value | Count | Frequency (%) |
주 | 1499 | 7.5% |
양 | 1262 | 6.3% |
산 | 1096 | 5.5% |
원 | 803 | 4.0% |
울 | 727 | 3.7% |
성 | 716 | 3.6% |
안 | 690 | 3.5% |
서 | 686 | 3.4% |
천 | 650 | 3.3% |
동 | 648 | 3.3% |
Other values (162) | 11137 |
중간X값
Real number (ℝ)
Distinct | 276 |
---|---|
Distinct (%) | 2.8% |
Missing | 7 |
Missing (%) | 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1005413.3 |
Minimum | 837025 |
---|---|
Maximum | 1339430 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 837025 |
---|---|
5-th percentile | 910860 |
Q1 | 943118 |
median | 981368 |
Q3 | 1076298 |
95-th percentile | 1152038 |
Maximum | 1339430 |
Range | 502405 |
Interquartile range (IQR) | 133180 |
Descriptive statistics
Standard deviation | 81376.264 |
---|---|
Coefficient of variation (CV) | 0.080938125 |
Kurtosis | -0.80944729 |
Mean | 1005413.3 |
Median Absolute Deviation (MAD) | 49439 |
Skewness | 0.54433312 |
Sum | 1.0047095 × 1010 |
Variance | 6.6220964 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1002039 | 325 | 3.2% |
1152038 | 296 | 3.0% |
912313 | 246 | 2.5% |
912423 | 229 | 2.3% |
941859 | 222 | 2.2% |
1076298 | 195 | 1.9% |
976250 | 179 | 1.8% |
1092502 | 152 | 1.5% |
931929 | 133 | 1.3% |
1099843 | 132 | 1.3% |
Other values (266) | 7884 |
Value | Count | Frequency (%) |
837025 | 87 | |
864750 | 38 | 0.4% |
870538 | 52 | |
879452 | 24 | 0.2% |
892285 | 21 | 0.2% |
896410 | 47 | |
896979 | 38 | 0.4% |
900746 | 25 | 0.2% |
904663 | 97 | |
904664 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1339430 | 1 | < 0.1% |
1174374 | 35 | |
1173992 | 42 | |
1173990 | 1 | < 0.1% |
1170048 | 86 | |
1164497 | 45 | |
1163388 | 29 | 0.3% |
1161258 | 4 | < 0.1% |
1160927 | 50 | |
1157768 | 67 |
중간Y값
Real number (ℝ)
Distinct | 277 |
---|---|
Distinct (%) | 2.8% |
Missing | 7 |
Missing (%) | 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1809339.6 |
Minimum | 1478904 |
---|---|
Maximum | 2033039 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1478904 |
---|---|
5-th percentile | 1597927 |
Q1 | 1715100 |
median | 1827831 |
Q3 | 1925193 |
95-th percentile | 1962468 |
Maximum | 2033039 |
Range | 554135 |
Interquartile range (IQR) | 210093 |
Descriptive statistics
Standard deviation | 126898.19 |
---|---|
Coefficient of variation (CV) | 0.070135086 |
Kurtosis | -0.30305282 |
Mean | 1809339.6 |
Median Absolute Deviation (MAD) | 103186 |
Skewness | -0.57339783 |
Sum | 1.808073 × 1010 |
Variance | 1.610315 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1844048 | 326 | 3.3% |
1727047 | 296 | 3.0% |
1478904 | 246 | 2.5% |
1517440 | 229 | 2.3% |
1906899 | 222 | 2.2% |
1962468 | 205 | 2.1% |
1801900 | 195 | 1.9% |
1960416 | 179 | 1.8% |
1753601 | 152 | 1.5% |
1685716 | 133 | 1.3% |
Other values (267) | 7810 |
Value | Count | Frequency (%) |
1478904 | 246 | |
1517440 | 229 | |
1580278 | 19 | 0.2% |
1597927 | 52 | 0.5% |
1606482 | 84 | 0.8% |
1615360 | 23 | 0.2% |
1619844 | 49 | 0.5% |
1624922 | 16 | 0.2% |
1628068 | 87 | 0.9% |
1628566 | 10 | 0.1% |
Value | Count | Frequency (%) |
2033039 | 7 | 0.1% |
2019906 | 15 | 0.1% |
2012719 | 6 | 0.1% |
2005105 | 30 | 0.3% |
2004423 | 1 | < 0.1% |
2001880 | 21 | 0.2% |
1995709 | 77 | |
1991281 | 17 | 0.2% |
1986726 | 56 | |
1986724 | 1 | < 0.1% |
지도종류
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
101 | |
---|---|
102 | |
103 | |
105 | 7 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 102 |
---|---|
2nd row | 101 |
3rd row | 101 |
4th row | 101 |
5th row | 101 |
Common Values
Value | Count | Frequency (%) |
101 | 7083 | |
102 | 1556 | 15.6% |
103 | 1354 | 13.5% |
105 | 7 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
101 | 7083 | |
102 | 1556 | 15.6% |
103 | 1354 | 13.5% |
105 | 7 | 0.1% |
축척 | 시군구코드 | 중간X값 | 중간Y값 | 지도종류 | |
---|---|---|---|---|---|
축척 | 1.000 | 0.426 | 0.287 | 0.358 | 0.915 |
시군구코드 | 0.426 | 1.000 | 0.835 | 0.753 | 0.481 |
중간X값 | 0.287 | 0.835 | 1.000 | 0.622 | 0.308 |
중간Y값 | 0.358 | 0.753 | 0.622 | 1.000 | 0.437 |
지도종류 | 0.915 | 0.481 | 0.308 | 0.437 | 1.000 |
축척 | 지도종류 | |
---|---|---|
축척 | 1.000 | 0.616 |
지도종류 | 0.616 | 1.000 |
시군구코드 | 중간X값 | 중간Y값 | 축척 | 지도종류 | |
---|---|---|---|---|---|
시군구코드 | 1.000 | 0.055 | -0.492 | 0.201 | 0.232 |
중간X값 | 0.055 | 1.000 | -0.123 | 0.224 | 0.212 |
중간Y값 | -0.492 | -0.123 | 1.000 | 0.231 | 0.296 |
축척 | 0.201 | 0.224 | 0.231 | 1.000 | 0.616 |
지도종류 | 0.232 | 0.212 | 0.296 | 0.616 | 1.000 |
축척 | 시군구 | 시군구코드 | 도엽코드 | 도엽명 | 중간X값 | 중간Y값 | 지도종류 | |
---|---|---|---|---|---|---|---|---|
80511 | 5000 | 해남군 | 4682000000 | 34607057 | 해남057 | 907123 | 1615360 | 102 |
73501 | 1000 | 청주시 서원구 | 4311200000 | 367062517 | 청주2517 | 994259 | 1838909 | 101 |
44208 | 1000 | 서귀포시 | 5013000000 | 336120525 | 표선0525 | 912313 | 1478904 | 101 |
77105 | 1000 | 전주시 완산구 | 4511100000 | 357012375 | 전주2375 | 966883 | 1754732 | 101 |
88230 | 5000 | 양양군 | 4283000000 | 38815055 | 속초055 | 1097216 | 2001880 | 101 |
73927 | 1000 | 용인시 기흥구 | 4146300000 | 377130303 | 용인0303 | 966393 | 1919086 | 101 |
21133 | 1000 | 천안시 | 4413000000 | 367012434 | 평택2434 | 974479 | 1866532 | 103 |
78886 | 5000 | 보성군 | 4678000000 | 34705013 | 회천013 | 974904 | 1646371 | 102 |
61891 | 1000 | 이천시 | 4150000000 | 377102597 | 이천2597 | 998579 | 1911410 | 101 |
33897 | 1000 | 여수시 | 4613000000 | 347031311 | 광양1311 | 1005636 | 1606482 | 101 |
축척 | 시군구 | 시군구코드 | 도엽코드 | 도엽명 | 중간X값 | 중간Y값 | 지도종류 | |
---|---|---|---|---|---|---|---|---|
85457 | 5000 | 서구 | 2826000000 | 37611018 | 인천018 | 925385 | 1950975 | 101 |
55165 | 1000 | 화성시 | 4159000000 | 376160365 | 남양0365 | 941859 | 1906899 | 101 |
53619 | 1000 | 유성구 | 3020000000 | 367101231 | 대전1231 | 985051 | 1820544 | 101 |
43309 | 1000 | 양평군 | 4183000000 | 377100505 | 이천0505 | 1006813 | 1946639 | 101 |
75765 | 1000 | 중구 | 2711000000 | 358031372 | 대구1372 | 1098785 | 1763960 | 101 |
90614 | 5000 | 단양군 | 4380000000 | 36802019 | 단양019 | 1083106 | 1887217 | 101 |
38466 | 1000 | 서귀포시 | 5013000000 | 336101449 | 모슬포1449 | 912313 | 1478904 | 101 |
29019 | 1000 | 구미시 | 4719000000 | 368151111 | <NA> | 1076298 | 1801900 | 101 |
18750 | 1000 | 시흥시 | 4139000000 | 376120662 | 안양0662 | 932975 | 1932624 | 103 |
48822 | 1000 | 남양주시 | 4136000000 | 377061711 | 양수1711 | 976250 | 1960416 | 101 |