Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 10000 |
Missing cells | 4488 |
Missing cells (%) | 5.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 761.7 KiB |
Average record size in memory | 78.0 B |
Variable types
Numeric | 5 |
---|---|
Text | 1 |
Unsupported | 1 |
Categorical | 1 |
Dataset
Description | 국토지리정보원의 수치지도(수치지형도) 관련 메타데이터 중 도엽주소매칭 정보입니다. (축척, 도엽명, 도엽번호, 최대값X, 최대값Y 등) |
---|---|
Author | 국토교통부 국토지리정보원 |
URL | https://www.data.go.kr/data/15067688/fileData.do |
최대값X is highly overall correlated with 최소값X | High correlation |
최대값Y is highly overall correlated with 최소값Y | High correlation |
최소값X is highly overall correlated with 최대값X | High correlation |
최소값Y is highly overall correlated with 최대값Y | High correlation |
도엽명 has 444 (4.4%) missing values | Missing |
최대값X has 1011 (10.1%) missing values | Missing |
최대값Y has 1011 (10.1%) missing values | Missing |
최소값X has 1011 (10.1%) missing values | Missing |
최소값Y has 1011 (10.1%) missing values | Missing |
축척 is highly skewed (γ1 = 25.01129627) | Skewed |
도엽번호 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-12-12 08:57:17.155998 |
---|---|
Analysis finished | 2023-12-12 08:57:22.778101 |
Duration | 5.62 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
축척
Real number (ℝ)
SKEWED
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3253.25 |
Minimum | 1000 |
---|---|
Maximum | 250000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1000 |
---|---|
5-th percentile | 1000 |
Q1 | 1000 |
median | 1000 |
Q3 | 5000 |
95-th percentile | 5000 |
Maximum | 250000 |
Range | 249000 |
Interquartile range (IQR) | 4000 |
Descriptive statistics
Standard deviation | 6351.0844 |
---|---|
Coefficient of variation (CV) | 1.9522276 |
Kurtosis | 918.41393 |
Mean | 3253.25 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 25.011296 |
Sum | 32532500 |
Variance | 40336273 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1000 | 5515 | |
5000 | 4253 | |
25000 | 131 | 1.3% |
2500 | 71 | 0.7% |
50000 | 26 | 0.3% |
250000 | 4 | < 0.1% |
Value | Count | Frequency (%) |
1000 | 5515 | |
2500 | 71 | 0.7% |
5000 | 4253 | |
25000 | 131 | 1.3% |
50000 | 26 | 0.3% |
250000 | 4 | < 0.1% |
Value | Count | Frequency (%) |
250000 | 4 | < 0.1% |
50000 | 26 | 0.3% |
25000 | 131 | 1.3% |
5000 | 4253 | |
2500 | 71 | 0.7% |
1000 | 5515 |
도엽명
Text
MISSING
 
Distinct | 6539 |
---|---|
Distinct (%) | 68.4% |
Missing | 444 |
Missing (%) | 4.4% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
원주 | 68 | 0.7% |
양산 | 59 | 0.6% |
김포 | 50 | 0.5% |
창원 | 48 | 0.5% |
언양 | 46 | 0.5% |
마산 | 45 | 0.5% |
화성 | 38 | 0.4% |
광양 | 36 | 0.4% |
구정 | 35 | 0.4% |
광주 | 34 | 0.4% |
Other values (6529) | 9133 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 5245 | 11.9% |
1 | 3853 | 8.7% |
2 | 2954 | 6.7% |
3 | 2036 | 4.6% |
5 | 1815 | 4.1% |
4 | 1803 | 4.1% |
8 | 1783 | 4.0% |
7 | 1724 | 3.9% |
9 | 1713 | 3.9% |
6 | 1605 | 3.6% |
Other values (191) | 19595 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 24531 | |
Other Letter | 19448 | |
Space Separator | 146 | 0.3% |
Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
산 | 1206 | 6.2% |
주 | 1059 | 5.4% |
양 | 1020 | 5.2% |
천 | 819 | 4.2% |
성 | 610 | 3.1% |
안 | 604 | 3.1% |
원 | 599 | 3.1% |
포 | 513 | 2.6% |
전 | 490 | 2.5% |
동 | 476 | 2.4% |
Other values (179) | 12052 |
Decimal Number
Value | Count | Frequency (%) |
0 | 5245 | |
1 | 3853 | |
2 | 2954 | |
3 | 2036 | 8.3% |
5 | 1815 | 7.4% |
4 | 1803 | 7.3% |
8 | 1783 | 7.3% |
7 | 1724 | 7.0% |
9 | 1713 | 7.0% |
6 | 1605 | 6.5% |
Space Separator
Value | Count | Frequency (%) |
146 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 24678 | |
Hangul | 19448 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
산 | 1206 | 6.2% |
주 | 1059 | 5.4% |
양 | 1020 | 5.2% |
천 | 819 | 4.2% |
성 | 610 | 3.1% |
안 | 604 | 3.1% |
원 | 599 | 3.1% |
포 | 513 | 2.6% |
전 | 490 | 2.5% |
동 | 476 | 2.4% |
Other values (179) | 12052 |
Common
Value | Count | Frequency (%) |
0 | 5245 | |
1 | 3853 | |
2 | 2954 | |
3 | 2036 | 8.3% |
5 | 1815 | 7.4% |
4 | 1803 | 7.3% |
8 | 1783 | 7.2% |
7 | 1724 | 7.0% |
9 | 1713 | 6.9% |
6 | 1605 | 6.5% |
Other values (2) | 147 | 0.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 24678 | |
Hangul | 19448 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 5245 | |
1 | 3853 | |
2 | 2954 | |
3 | 2036 | 8.3% |
5 | 1815 | 7.4% |
4 | 1803 | 7.3% |
8 | 1783 | 7.2% |
7 | 1724 | 7.0% |
9 | 1713 | 6.9% |
6 | 1605 | 6.5% |
Other values (2) | 147 | 0.6% |
Hangul
Value | Count | Frequency (%) |
산 | 1206 | 6.2% |
주 | 1059 | 5.4% |
양 | 1020 | 5.2% |
천 | 819 | 4.2% |
성 | 610 | 3.1% |
안 | 604 | 3.1% |
원 | 599 | 3.1% |
포 | 513 | 2.6% |
전 | 490 | 2.5% |
동 | 476 | 2.4% |
Other values (179) | 12052 |
도엽번호
Unsupported
REJECTED
  UNSUPPORTED
 
Missing | 0 |
---|---|
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
최대값X
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 8149 |
---|---|
Distinct (%) | 90.7% |
Missing | 1011 |
Missing (%) | 10.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1020824 |
Minimum | 780779 |
---|---|
Maximum | 1388222 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 780779 |
---|---|
5-th percentile | 901213 |
Q1 | 944723 |
median | 1004533 |
Q3 | 1098080 |
95-th percentile | 1159094.4 |
Maximum | 1388222 |
Range | 607443 |
Interquartile range (IQR) | 153357 |
Descriptive statistics
Standard deviation | 85983.462 |
---|---|
Coefficient of variation (CV) | 0.084229471 |
Kurtosis | -1.1349459 |
Mean | 1020824 |
Median Absolute Deviation (MAD) | 70979 |
Skewness | 0.20791729 |
Sum | 9.1761866 × 109 |
Variance | 7.3931558 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1000000 | 56 | 0.6% |
999553 | 5 | 0.1% |
1000457 | 5 | 0.1% |
1000913 | 5 | 0.1% |
998659 | 4 | < 0.1% |
999557 | 4 | < 0.1% |
1003653 | 4 | < 0.1% |
1071228 | 3 | < 0.1% |
1002219 | 3 | < 0.1% |
993294 | 3 | < 0.1% |
Other values (8139) | 8897 | |
(Missing) | 1011 | 10.1% |
Value | Count | Frequency (%) |
780779 | 1 | |
789275 | 2 | |
805487 | 1 | |
809892 | 1 | |
814586 | 2 | |
816710 | 1 | |
827930 | 1 | |
846452 | 1 | |
848066 | 1 | |
848698 | 1 |
Value | Count | Frequency (%) |
1388222 | 1 | |
1300773 | 1 | |
1296250 | 2 | |
1293940 | 1 | |
1293842 | 1 | |
1222460 | 1 | |
1207051 | 1 | |
1189221 | 2 | |
1187087 | 1 | |
1185722 | 1 |
최대값Y
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 7859 |
---|---|
Distinct (%) | 87.4% |
Missing | 1011 |
Missing (%) | 10.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1791197.6 |
Minimum | 1465550 |
---|---|
Maximum | 2558033 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1465550 |
---|---|
5-th percentile | 1507269 |
Q1 | 1690803 |
median | 1792722 |
Q3 | 1914299 |
95-th percentile | 1966722.8 |
Maximum | 2558033 |
Range | 1092483 |
Interquartile range (IQR) | 223496 |
Descriptive statistics
Standard deviation | 130171.3 |
---|---|
Coefficient of variation (CV) | 0.072672772 |
Kurtosis | -0.38114472 |
Mean | 1791197.6 |
Median Absolute Deviation (MAD) | 108612 |
Skewness | -0.36020926 |
Sum | 1.6101076 × 1010 |
Variance | 1.6944566 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1914018 | 6 | 0.1% |
1915682 | 6 | 0.1% |
1661129 | 6 | 0.1% |
1920119 | 5 | 0.1% |
1695210 | 4 | < 0.1% |
1664456 | 4 | < 0.1% |
1921229 | 4 | < 0.1% |
1825278 | 4 | < 0.1% |
1708880 | 4 | < 0.1% |
1642853 | 4 | < 0.1% |
Other values (7849) | 8942 | |
(Missing) | 1011 | 10.1% |
Value | Count | Frequency (%) |
1465550 | 1 | |
1471121 | 1 | |
1471432 | 1 | |
1471437 | 1 | |
1471441 | 1 | |
1471446 | 1 | |
1471610 | 1 | |
1471676 | 1 | |
1471964 | 1 | |
1471982 | 1 |
Value | Count | Frequency (%) |
2558033 | 1 | |
2042082 | 1 | |
2041971 | 1 | |
2036607 | 1 | |
2036582 | 1 | |
2036557 | 1 | |
2036533 | 1 | |
2036487 | 1 | |
2033833 | 1 | |
2033670 | 1 |
최소값X
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 8157 |
---|---|
Distinct (%) | 90.7% |
Missing | 1011 |
Missing (%) | 10.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1019205.1 |
Minimum | 680551 |
---|---|
Maximum | 1385875 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 680551 |
---|---|
5-th percentile | 899582.8 |
Q1 | 943723 |
median | 1002283 |
Q3 | 1096088 |
95-th percentile | 1157767 |
Maximum | 1385875 |
Range | 705324 |
Interquartile range (IQR) | 152365 |
Descriptive statistics
Standard deviation | 86025.71 |
---|---|
Coefficient of variation (CV) | 0.084404711 |
Kurtosis | -1.1046918 |
Mean | 1019205.1 |
Median Absolute Deviation (MAD) | 70326 |
Skewness | 0.2032475 |
Sum | 9.1616344 × 109 |
Variance | 7.4004228 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1000000 | 69 | 0.7% |
1000913 | 5 | 0.1% |
999113 | 5 | 0.1% |
999106 | 5 | 0.1% |
999544 | 4 | < 0.1% |
1000456 | 4 | < 0.1% |
998212 | 4 | < 0.1% |
999543 | 4 | < 0.1% |
1003196 | 4 | < 0.1% |
1098365 | 3 | < 0.1% |
Other values (8147) | 8882 | |
(Missing) | 1011 | 10.1% |
Value | Count | Frequency (%) |
680551 | 1 | |
778406 | 1 | |
786920 | 2 | |
793733 | 1 | |
807544 | 1 | |
812240 | 2 | |
814363 | 1 | |
825639 | 1 | |
838682 | 1 | |
840127 | 1 |
Value | Count | Frequency (%) |
1385875 | 1 | |
1298461 | 1 | |
1293940 | 2 | |
1291632 | 1 | |
1291534 | 1 | |
1186909 | 2 | |
1185260 | 1 | |
1185248 | 1 | |
1184774 | 1 | |
1182405 | 1 |
최소값Y
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 7803 |
---|---|
Distinct (%) | 86.8% |
Missing | 1011 |
Missing (%) | 10.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1789254.2 |
Minimum | 1462619 |
---|---|
Maximum | 2444082 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1462619 |
---|---|
5-th percentile | 1506680.2 |
Q1 | 1689863 |
median | 1791630 |
Q3 | 1912384 |
95-th percentile | 1964151.4 |
Maximum | 2444082 |
Range | 981463 |
Interquartile range (IQR) | 222521 |
Descriptive statistics
Standard deviation | 130005.71 |
---|---|
Coefficient of variation (CV) | 0.072659163 |
Kurtosis | -0.44566627 |
Mean | 1789254.2 |
Median Absolute Deviation (MAD) | 108428 |
Skewness | -0.36628596 |
Sum | 1.6083606 × 1010 |
Variance | 1.6901486 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1656139 | 6 | 0.1% |
1915127 | 6 | 0.1% |
1660575 | 6 | 0.1% |
1711588 | 5 | 0.1% |
1844686 | 5 | 0.1% |
1660020 | 5 | 0.1% |
1919565 | 5 | 0.1% |
1846904 | 4 | < 0.1% |
1847459 | 4 | < 0.1% |
1661683 | 4 | < 0.1% |
Other values (7793) | 8939 | |
(Missing) | 1011 | 10.1% |
Value | Count | Frequency (%) |
1462619 | 1 | |
1462751 | 1 | |
1470561 | 1 | |
1470783 | 1 | |
1470847 | 1 | |
1470873 | 1 | |
1470878 | 1 | |
1470883 | 1 | |
1470887 | 1 | |
1470939 | 1 |
Value | Count | Frequency (%) |
2444082 | 1 | |
2039284 | 1 | |
2033808 | 1 | |
2033783 | 1 | |
2033759 | 1 | |
2033736 | 1 | |
2033691 | 1 | |
2031034 | 1 | |
2030875 | 1 | |
2028260 | 1 |
지도종류
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
101 | |
---|---|
102 | |
103 | |
104 | 44 |
105 | 2 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 101 |
---|---|
2nd row | 102 |
3rd row | 101 |
4th row | 102 |
5th row | 103 |
Common Values
Value | Count | Frequency (%) |
101 | 4283 | |
102 | 3987 | |
103 | 1684 | 16.8% |
104 | 44 | 0.4% |
105 | 2 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
101 | 4283 | |
102 | 3987 | |
103 | 1684 | 16.8% |
104 | 44 | 0.4% |
105 | 2 | < 0.1% |
축척 | 최대값X | 최대값Y | 최소값X | 최소값Y | 지도종류 | |
---|---|---|---|---|---|---|
축척 | 1.000 | 0.651 | 0.464 | 0.084 | 0.464 | 0.080 |
최대값X | 0.651 | 1.000 | 0.619 | 0.958 | 0.621 | 0.074 |
최대값Y | 0.464 | 0.619 | 1.000 | 0.385 | 0.980 | 0.085 |
최소값X | 0.084 | 0.958 | 0.385 | 1.000 | 0.402 | 0.071 |
최소값Y | 0.464 | 0.621 | 0.980 | 0.402 | 1.000 | 0.062 |
지도종류 | 0.080 | 0.074 | 0.085 | 0.071 | 0.062 | 1.000 |
축척 | 최대값X | 최대값Y | 최소값X | 최소값Y | 지도종류 | |
---|---|---|---|---|---|---|
축척 | 1.000 | 0.027 | 0.042 | 0.012 | 0.028 | 0.040 |
최대값X | 0.027 | 1.000 | 0.051 | 0.999 | 0.052 | 0.045 |
최대값Y | 0.042 | 0.051 | 1.000 | 0.051 | 1.000 | 0.059 |
최소값X | 0.012 | 0.999 | 0.051 | 1.000 | 0.052 | 0.045 |
최소값Y | 0.028 | 0.052 | 1.000 | 0.052 | 1.000 | 0.043 |
지도종류 | 0.040 | 0.045 | 0.059 | 0.045 | 0.043 | 1.000 |
축척 | 도엽명 | 도엽번호 | 최대값X | 최대값Y | 최소값X | 최소값Y | 지도종류 | |
---|---|---|---|---|---|---|---|---|
23554 | 1000 | 당진0430 | 366030430 | 928809 | 1888251 | 928360 | 1887692 | 101 |
49346 | 5000 | 공주077 | 36709077 | 970829 | 1814238 | 968575 | 1811458 | 102 |
787 | 5000 | 삼가 | 35809092 | 1050036 | 1697864 | 1047747 | 1695079 | 101 |
11480 | 1000 | 김포2561 | 376072561 | 929748 | 1947046 | 929301 | 1946487 | 102 |
45618 | 1000 | 서울2374 | 376082374 | <NA> | <NA> | <NA> | <NA> | 103 |
68902 | 5000 | 동곡 | 35808033 | 1119948 | 1742896 | 1117648 | 1740093 | 101 |
22891 | 1000 | 밀양2144 | 358122144 | 1115504 | 1699019 | 1115042 | 1698459 | 102 |
70365 | 1000 | 관기2063 | 367122063 | 1041742 | 1813727 | 1041291 | 1813171 | 102 |
66485 | 1000 | 창원2190 | 358112190 | 1095518 | 1696567 | 1095057 | 1696008 | 102 |
75086 | 1000 | 대부0198 | 376150198 | 914812 | 1912235 | 914362 | 1911676 | 102 |
축척 | 도엽명 | 도엽번호 | 최대값X | 최대값Y | 최소값X | 최소값Y | 지도종류 | |
---|---|---|---|---|---|---|---|---|
33810 | 1000 | 울산1814 | 359061814 | 1169417 | 1734824 | 1168953 | 1734261 | 101 |
93333 | 5000 | 익산076 | 35604076 | 945804 | 1758904 | 943528 | 1756117 | 102 |
77969 | 1000 | 이천2583 | 377102583 | 996896 | 1917901 | 996453 | 1917347 | 102 |
73688 | 1000 | 안양1039 | 376121039 | 955324 | 1937437 | 954879 | 1936880 | 102 |
60810 | 5000 | 순천 | 34702097 | 993137 | 1642281 | 990847 | 1639507 | 101 |
17766 | 1000 | 구미1339 | 368141339 | 1080533 | 1793478 | 1080078 | 1792919 | 101 |
7184 | 5000 | 김천 | 36813062 | 1049523 | 1789365 | 1047257 | 1786580 | 101 |
72454 | 1000 | 거제1802 | 348031802 | 1101482 | 1651156 | 1101019 | 1650596 | 102 |
95269 | 5000 | 함안 | 35814012 | 1072824 | 1692473 | 1070527 | 1689683 | 102 |
35670 | 1000 | 엄정0828 | 377160828 | 1034616 | 1910206 | 1034170 | 1909650 | 101 |