Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 585.9 KiB |
Average record size in memory | 60.0 B |
Variable types
Numeric | 4 |
---|---|
Text | 2 |
Dataset
Description | 도립거창대학교 시스템 DB 내 우편번호(신주소)데이터입니다 .(번호, 우편번호, 주소, 시도 데이터를 포함하고 있습니다.) |
---|---|
Author | 경상남도 |
URL | https://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15049414 |
Reproduction
Analysis started | 2024-04-16 22:38:12.028072 |
---|---|
Analysis finished | 2024-04-16 22:38:14.585891 |
Duration | 2.56 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
번호
Real number (ℝ)
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 25063.939 |
Minimum | 5 |
---|---|
Maximum | 492019 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 5 |
---|---|
5-th percentile | 2572.85 |
Q1 | 12446 |
median | 24861 |
Q3 | 37119.5 |
95-th percentile | 46956.45 |
Maximum | 492019 |
Range | 492014 |
Interquartile range (IQR) | 24673.5 |
Descriptive statistics
Standard deviation | 18521.1 |
---|---|
Coefficient of variation (CV) | 0.73895409 |
Kurtosis | 245.06809 |
Mean | 25063.939 |
Median Absolute Deviation (MAD) | 12330 |
Skewness | 10.058258 |
Sum | 2.5063939 × 108 |
Variance | 3.4303116 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
16316 | 1 | < 0.1% |
46470 | 1 | < 0.1% |
33807 | 1 | < 0.1% |
16824 | 1 | < 0.1% |
18699 | 1 | < 0.1% |
36202 | 1 | < 0.1% |
31911 | 1 | < 0.1% |
2910 | 1 | < 0.1% |
44826 | 1 | < 0.1% |
31626 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
5 | 1 | |
7 | 1 | |
10 | 1 | |
12 | 1 | |
13 | 1 | |
15 | 1 | |
21 | 1 | |
22 | 1 | |
26 | 1 | |
37 | 1 |
Value | Count | Frequency (%) |
492019 | 1 | |
492018 | 1 | |
492015 | 1 | |
492008 | 1 | |
492005 | 1 | |
429022 | 1 | |
429019 | 1 | |
49199 | 1 | |
49194 | 1 | |
49181 | 1 |
우편번호1
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 254 |
---|---|
Distinct (%) | 2.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 460.7341 |
Minimum | 100 |
---|---|
Maximum | 799 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 100 |
---|---|
5-th percentile | 133 |
Q1 | 325 |
median | 472 |
Q3 | 617 |
95-th percentile | 755 |
Maximum | 799 |
Range | 699 |
Interquartile range (IQR) | 292 |
Descriptive statistics
Standard deviation | 198.20943 |
---|---|
Coefficient of variation (CV) | 0.43020352 |
Kurtosis | -1.0134427 |
Mean | 460.7341 |
Median Absolute Deviation (MAD) | 145 |
Skewness | -0.26832714 |
Sum | 4607341 |
Variance | 39286.979 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
135 | 117 | 1.2% |
702 | 96 | 1.0% |
139 | 95 | 0.9% |
706 | 88 | 0.9% |
472 | 85 | 0.9% |
151 | 85 | 0.9% |
100 | 82 | 0.8% |
704 | 80 | 0.8% |
363 | 79 | 0.8% |
132 | 76 | 0.8% |
Other values (244) | 9117 |
Value | Count | Frequency (%) |
100 | 82 | |
110 | 55 | |
120 | 49 | |
121 | 69 | |
122 | 50 | |
130 | 47 | |
131 | 56 | |
132 | 76 | |
133 | 39 | |
134 | 76 |
Value | Count | Frequency (%) |
799 | 3 | < 0.1% |
791 | 59 | |
790 | 41 | |
780 | 68 | |
770 | 63 | |
769 | 56 | |
767 | 21 | 0.2% |
766 | 38 | |
764 | 26 | 0.3% |
763 | 21 | 0.2% |
우편번호2
Real number (ℝ)
Distinct | 577 |
---|---|
Distinct (%) | 5.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 760.2692 |
Minimum | 3 |
---|---|
Maximum | 999 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 3 |
---|---|
5-th percentile | 112 |
Q1 | 762 |
median | 821 |
Q3 | 862 |
95-th percentile | 931 |
Maximum | 999 |
Range | 996 |
Interquartile range (IQR) | 100 |
Descriptive statistics
Standard deviation | 213.61543 |
---|---|
Coefficient of variation (CV) | 0.28097341 |
Kurtosis | 4.9617542 |
Mean | 760.2692 |
Median Absolute Deviation (MAD) | 49 |
Skewness | -2.430658 |
Sum | 7602692 |
Variance | 45631.553 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
821 | 150 | 1.5% |
822 | 146 | 1.5% |
841 | 137 | 1.4% |
832 | 128 | 1.3% |
831 | 127 | 1.3% |
811 | 125 | 1.2% |
842 | 116 | 1.2% |
812 | 113 | 1.1% |
801 | 113 | 1.1% |
804 | 113 | 1.1% |
Other values (567) | 8732 |
Value | Count | Frequency (%) |
3 | 4 | < 0.1% |
10 | 23 | |
11 | 11 | |
12 | 10 | |
13 | 9 | 0.1% |
14 | 3 | < 0.1% |
15 | 1 | < 0.1% |
16 | 2 | < 0.1% |
17 | 3 | < 0.1% |
18 | 3 | < 0.1% |
Value | Count | Frequency (%) |
999 | 1 | < 0.1% |
998 | 1 | < 0.1% |
997 | 1 | < 0.1% |
996 | 1 | < 0.1% |
992 | 3 | |
991 | 4 | |
990 | 3 | |
989 | 1 | < 0.1% |
988 | 2 | |
987 | 2 |
우편번호
Text
Distinct | 8500 |
---|---|
Distinct (%) | 85.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
611-839 | 8 | 0.1% |
536-911 | 6 | 0.1% |
138-873 | 6 | 0.1% |
537-873 | 6 | 0.1% |
487-839 | 6 | 0.1% |
601-803 | 6 | 0.1% |
535-841 | 5 | < 0.1% |
413-911 | 5 | < 0.1% |
395-812 | 5 | < 0.1% |
703-816 | 5 | < 0.1% |
Other values (8490) | 9942 |
Most occurring characters
Value | Count | Frequency (%) |
- | 9561 | |
8 | 7586 | |
1 | 7211 | |
0 | 7063 | |
7 | 6341 | |
3 | 6294 | |
2 | 5643 | |
5 | 5417 | |
6 | 5368 | |
4 | 5319 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 59557 | |
Dash Punctuation | 9561 | 13.8% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
8 | 7586 | |
1 | 7211 | |
0 | 7063 | |
7 | 6341 | |
3 | 6294 | |
2 | 5643 | |
5 | 5417 | |
6 | 5368 | |
4 | 5319 | |
9 | 3315 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 9561 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 69118 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 9561 | |
8 | 7586 | |
1 | 7211 | |
0 | 7063 | |
7 | 6341 | |
3 | 6294 | |
2 | 5643 | |
5 | 5417 | |
6 | 5368 | |
4 | 5319 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 69118 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 9561 | |
8 | 7586 | |
1 | 7211 | |
0 | 7063 | |
7 | 6341 | |
3 | 6294 | |
2 | 5643 | |
5 | 5417 | |
6 | 5368 | |
4 | 5319 |
주소
Text
Distinct | 8399 |
---|---|
Distinct (%) | 84.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 41 |
---|---|
Median length | 38 |
Mean length | 14.8534 |
Min length | 8 |
Characters and Unicode
Total characters | 148534 |
---|---|
Distinct characters | 571 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 7585 ? |
---|---|
Unique (%) | 75.8% |
Sample
1st row | 경북 성주군 초전면 월곡2리 |
---|---|
2nd row | 서울 강동구 둔촌1동 |
3rd row | 경기 김포시 대곶면 상마리 |
4th row | 서울 강서구 방화2동 |
5th row | 경기 수원시 권선구 서둔동 |
Value | Count | Frequency (%) |
서울 | 1577 | 4.2% |
경기 | 1576 | 4.2% |
경북 | 973 | 2.6% |
전남 | 798 | 2.1% |
경남 | 728 | 1.9% |
부산 | 654 | 1.7% |
충남 | 618 | 1.6% |
전북 | 579 | 1.5% |
강원 | 526 | 1.4% |
충북 | 509 | 1.3% |
Other values (8564) | 29317 |
Most occurring characters
Value | Count | Frequency (%) |
30113 | 20.3% | |
동 | 7104 | 4.8% |
구 | 5389 | 3.6% |
시 | 4131 | 2.8% |
리 | 4041 | 2.7% |
경 | 3613 | 2.4% |
남 | 3427 | 2.3% |
면 | 3299 | 2.2% |
산 | 2948 | 2.0% |
1 | 2935 | 2.0% |
Other values (561) | 81534 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 109426 | |
Space Separator | 30113 | 20.3% |
Decimal Number | 8119 | 5.5% |
Dash Punctuation | 668 | 0.4% |
Uppercase Letter | 131 | 0.1% |
Close Punctuation | 28 | < 0.1% |
Open Punctuation | 28 | < 0.1% |
Other Punctuation | 13 | < 0.1% |
Lowercase Letter | 8 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 7104 | 6.5% |
구 | 5389 | 4.9% |
시 | 4131 | 3.8% |
리 | 4041 | 3.7% |
경 | 3613 | 3.3% |
남 | 3427 | 3.1% |
면 | 3299 | 3.0% |
산 | 2948 | 2.7% |
서 | 2883 | 2.6% |
군 | 2825 | 2.6% |
Other values (525) | 69766 |
Uppercase Letter
Value | Count | Frequency (%) |
K | 23 | |
S | 20 | |
L | 18 | |
G | 18 | |
T | 14 | |
C | 11 | |
A | 7 | 5.3% |
B | 6 | 4.6% |
D | 3 | 2.3% |
N | 2 | 1.5% |
Other values (8) | 9 | 6.9% |
Decimal Number
Value | Count | Frequency (%) |
1 | 2935 | |
2 | 1702 | |
0 | 1038 | 12.8% |
3 | 865 | 10.7% |
4 | 509 | 6.3% |
5 | 299 | 3.7% |
6 | 275 | 3.4% |
8 | 185 | 2.3% |
7 | 179 | 2.2% |
9 | 132 | 1.6% |
Other Punctuation
Value | Count | Frequency (%) |
. | 8 | |
· | 3 | 23.1% |
& | 2 | 15.4% |
Space Separator
Value | Count | Frequency (%) |
30113 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 668 |
Close Punctuation
Value | Count | Frequency (%) |
) | 28 |
Open Punctuation
Value | Count | Frequency (%) |
( | 28 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 8 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 109426 | |
Common | 38969 | 26.2% |
Latin | 139 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 7104 | 6.5% |
구 | 5389 | 4.9% |
시 | 4131 | 3.8% |
리 | 4041 | 3.7% |
경 | 3613 | 3.3% |
남 | 3427 | 3.1% |
면 | 3299 | 3.0% |
산 | 2948 | 2.7% |
서 | 2883 | 2.6% |
군 | 2825 | 2.6% |
Other values (525) | 69766 |
Latin
Value | Count | Frequency (%) |
K | 23 | |
S | 20 | |
L | 18 | |
G | 18 | |
T | 14 | |
C | 11 | |
e | 8 | 5.8% |
A | 7 | 5.0% |
B | 6 | 4.3% |
D | 3 | 2.2% |
Other values (9) | 11 |
Common
Value | Count | Frequency (%) |
30113 | ||
1 | 2935 | 7.5% |
2 | 1702 | 4.4% |
0 | 1038 | 2.7% |
3 | 865 | 2.2% |
- | 668 | 1.7% |
4 | 509 | 1.3% |
5 | 299 | 0.8% |
6 | 275 | 0.7% |
8 | 185 | 0.5% |
Other values (7) | 380 | 1.0% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 109426 | |
ASCII | 39105 | 26.3% |
None | 3 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
30113 | ||
1 | 2935 | 7.5% |
2 | 1702 | 4.4% |
0 | 1038 | 2.7% |
3 | 865 | 2.2% |
- | 668 | 1.7% |
4 | 509 | 1.3% |
5 | 299 | 0.8% |
6 | 275 | 0.7% |
8 | 185 | 0.5% |
Other values (25) | 516 | 1.3% |
Hangul
Value | Count | Frequency (%) |
동 | 7104 | 6.5% |
구 | 5389 | 4.9% |
시 | 4131 | 3.8% |
리 | 4041 | 3.7% |
경 | 3613 | 3.3% |
남 | 3427 | 3.1% |
면 | 3299 | 3.0% |
산 | 2948 | 2.7% |
서 | 2883 | 2.6% |
군 | 2825 | 2.6% |
Other values (525) | 69766 |
None
Value | Count | Frequency (%) |
· | 3 |
시도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 16 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10.0519 |
Minimum | 1 |
---|---|
Maximum | 19 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 11 |
Q3 | 15 |
95-th percentile | 18 |
Maximum | 19 |
Range | 18 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 6.1491127 |
---|---|
Coefficient of variation (CV) | 0.61173636 |
Kurtosis | -1.4123621 |
Mean | 10.0519 |
Median Absolute Deviation (MAD) | 6 |
Skewness | -0.31620787 |
Sum | 100519 |
Variance | 37.811588 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 1577 | |
11 | 1576 | |
18 | 973 | |
15 | 798 | |
17 | 728 | |
2 | 654 | |
13 | 618 | 6.2% |
16 | 579 | 5.8% |
12 | 526 | 5.3% |
14 | 509 | 5.1% |
Other values (6) | 1462 |
Value | Count | Frequency (%) |
1 | 1577 | |
2 | 654 | |
3 | 502 | 5.0% |
4 | 290 | 2.9% |
5 | 241 | 2.4% |
6 | 192 | 1.9% |
7 | 152 | 1.5% |
11 | 1576 | |
12 | 526 | 5.3% |
13 | 618 | 6.2% |
Value | Count | Frequency (%) |
19 | 85 | 0.9% |
18 | 973 | |
17 | 728 | |
16 | 579 | 5.8% |
15 | 798 | |
14 | 509 | 5.1% |
13 | 618 | 6.2% |
12 | 526 | 5.3% |
11 | 1576 | |
7 | 152 | 1.5% |
번호 | 우편번호1 | 우편번호2 | 시도 | |
---|---|---|---|---|
번호 | 1.000 | 0.053 | 0.108 | 0.072 |
우편번호1 | 0.053 | 1.000 | 0.325 | 0.890 |
우편번호2 | 0.108 | 0.325 | 1.000 | 0.280 |
시도 | 0.072 | 0.890 | 0.280 | 1.000 |
번호 | 우편번호1 | 우편번호2 | 시도 | |
---|---|---|---|---|
번호 | 1.000 | -0.179 | 0.115 | -0.008 |
우편번호1 | -0.179 | 1.000 | 0.070 | 0.593 |
우편번호2 | 0.115 | 0.070 | 1.000 | 0.241 |
시도 | -0.008 | 0.593 | 0.241 | 1.000 |
번호 | 우편번호1 | 우편번호2 | 우편번호 | 주소 | 시도 | |
---|---|---|---|---|---|---|
45445 | 16316 | 719 | 814 | 719-814 | 경북 성주군 초전면 월곡2리 | 18 |
30279 | 27349 | 134 | 61 | 13461 | 서울 강동구 둔촌1동 | 1 |
1275 | 4149 | 415 | 854 | 415-854 | 경기 김포시 대곶면 상마리 | 11 |
28323 | 28041 | 157 | 854 | 157-854 | 서울 강서구 방화2동 | 1 |
3418 | 5739 | 441 | 855 | 441-855 | 경기 수원시 권선구 서둔동 | 11 |
36673 | 20990 | 711 | 820 | 711-820 | 대구 달성군 하빈면 | 3 |
31991 | 27109 | 135 | 794 | 135-794 | 서울 강남구 압구정2동 한양아파트 1-11 | 1 |
13117 | 45828 | 339 | 801 | 339-801 | 충남 연기군 조치원읍 교리 | 13 |
10074 | 46813 | 350 | 892 | 350-892 | 충남 홍성군 장곡면 광성리 | 13 |
9539 | 46735 | 350 | 870 | 350-870 | 충남 홍성군 결성면 | 13 |
번호 | 우편번호1 | 우편번호2 | 우편번호 | 주소 | 시도 | |
---|---|---|---|---|---|---|
20427 | 34638 | 682 | 712 | 682-712 | 울산 동구 방어동 현대미포조선 | 7 |
47975 | 10091 | 487 | 868 | 487-868 | 경기 포천시 일동면 길명리 사서함 | 11 |
40084 | 15491 | 740 | 833 | 740-833 | 경북 김천시 어모면 도암리 | 18 |
32493 | 29429 | 139 | 815 | 139-815 | 서울 노원구 상계5동 | 1 |
7095 | 4741 | 483 | 30 | 48330 | 경기 동두천시 생연동 | 11 |
33237 | 26838 | 135 | 819 | 135-819 | 서울 강남구 논현2동 | 1 |
8086 | 294 | 210 | 774 | 210-774 | 강원 강릉시 회산동 회산주공아파트 101-106 | 12 |
46912 | 8478 | 449 | 885 | 449-885 | 경기 용인시 처인구 남사면 창리 | 11 |
25498 | 37949 | 534 | 902 | 534-902 | 전남 무안군 일로읍 광암리 | 15 |
7574 | 6606 | 426 | 894 | 426-894 | 경기 안산시 상록구 사1동 | 11 |