Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 585.9 KiB |
Average record size in memory | 60.0 B |
Variable types
Numeric | 4 |
---|---|
Text | 2 |
Dataset
Description | 도립거창대학교 시스템 DB 내 우편번호(신주소)데이터입니다 .(번호, 우편번호, 주소, 시도 데이터를 포함하고 있습니다.) |
---|---|
URL | https://www.data.go.kr/data/15049414/fileData.do |
Reproduction
Analysis started | 2023-12-12 10:45:59.066949 |
---|---|
Analysis finished | 2023-12-12 10:46:03.369486 |
Duration | 4.3 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
번호
Real number (ℝ)
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 25000.798 |
Minimum | 1 |
---|---|
Maximum | 492019 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2403.45 |
Q1 | 12187.75 |
median | 24757.5 |
Q3 | 36878 |
95-th percentile | 46937.05 |
Maximum | 492019 |
Range | 492018 |
Interquartile range (IQR) | 24690.25 |
Descriptive statistics
Standard deviation | 19715.236 |
---|---|
Coefficient of variation (CV) | 0.78858426 |
Kurtosis | 229.41546 |
Mean | 25000.798 |
Median Absolute Deviation (MAD) | 12369.5 |
Skewness | 10.449019 |
Sum | 2.5000798 × 108 |
Variance | 3.8869051 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
12084 | 1 | < 0.1% |
20235 | 1 | < 0.1% |
43274 | 1 | < 0.1% |
6163 | 1 | < 0.1% |
15739 | 1 | < 0.1% |
25751 | 1 | < 0.1% |
19673 | 1 | < 0.1% |
18602 | 1 | < 0.1% |
18348 | 1 | < 0.1% |
21061 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
1 | 1 | |
6 | 1 | |
7 | 1 | |
12 | 1 | |
13 | 1 | |
14 | 1 | |
15 | 1 | |
16 | 1 | |
29 | 1 | |
32 | 1 |
Value | Count | Frequency (%) |
492019 | 1 | |
492017 | 1 | |
492009 | 1 | |
492006 | 1 | |
429031 | 1 | |
429030 | 1 | |
429023 | 1 | |
429022 | 1 | |
429021 | 1 | |
429014 | 1 |
우편번호1
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 254 |
---|---|
Distinct (%) | 2.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 459.2415 |
Minimum | 100 |
---|---|
Maximum | 799 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 100 |
---|---|
5-th percentile | 133 |
Q1 | 325 |
median | 472 |
Q3 | 617 |
95-th percentile | 750 |
Maximum | 799 |
Range | 699 |
Interquartile range (IQR) | 292 |
Descriptive statistics
Standard deviation | 199.10884 |
---|---|
Coefficient of variation (CV) | 0.4335602 |
Kurtosis | -1.0331437 |
Mean | 459.2415 |
Median Absolute Deviation (MAD) | 147 |
Skewness | -0.26269304 |
Sum | 4592415 |
Variance | 39644.329 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
135 | 132 | 1.3% |
472 | 104 | 1.0% |
151 | 99 | 1.0% |
702 | 97 | 1.0% |
560 | 89 | 0.9% |
706 | 87 | 0.9% |
139 | 85 | 0.9% |
100 | 84 | 0.8% |
704 | 84 | 0.8% |
140 | 82 | 0.8% |
Other values (244) | 9057 |
Value | Count | Frequency (%) |
100 | 84 | |
110 | 67 | |
120 | 46 | |
121 | 53 | |
122 | 48 | |
130 | 62 | |
131 | 66 | |
132 | 56 | |
133 | 44 | |
134 | 69 |
Value | Count | Frequency (%) |
799 | 5 | 0.1% |
791 | 54 | |
790 | 49 | |
780 | 68 | |
770 | 59 | |
769 | 52 | |
767 | 26 | 0.3% |
766 | 30 | |
764 | 15 | 0.1% |
763 | 20 | 0.2% |
우편번호2
Real number (ℝ)
Distinct | 564 |
---|---|
Distinct (%) | 5.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 758.7337 |
Minimum | 3 |
---|---|
Maximum | 999 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 3 |
---|---|
5-th percentile | 120 |
Q1 | 760 |
median | 821 |
Q3 | 861 |
95-th percentile | 931 |
Maximum | 999 |
Range | 996 |
Interquartile range (IQR) | 101 |
Descriptive statistics
Standard deviation | 213.90496 |
---|---|
Coefficient of variation (CV) | 0.28192364 |
Kurtosis | 4.8634057 |
Mean | 758.7337 |
Median Absolute Deviation (MAD) | 48 |
Skewness | -2.4102093 |
Sum | 7587337 |
Variance | 45755.333 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
811 | 153 | 1.5% |
841 | 149 | 1.5% |
831 | 147 | 1.5% |
822 | 138 | 1.4% |
821 | 130 | 1.3% |
812 | 126 | 1.3% |
801 | 121 | 1.2% |
842 | 117 | 1.2% |
803 | 115 | 1.1% |
851 | 115 | 1.1% |
Other values (554) | 8689 |
Value | Count | Frequency (%) |
3 | 8 | 0.1% |
10 | 38 | |
11 | 10 | 0.1% |
12 | 9 | 0.1% |
13 | 8 | 0.1% |
14 | 6 | 0.1% |
15 | 5 | 0.1% |
16 | 2 | < 0.1% |
17 | 2 | < 0.1% |
19 | 2 | < 0.1% |
Value | Count | Frequency (%) |
999 | 2 | < 0.1% |
998 | 3 | |
994 | 2 | < 0.1% |
993 | 3 | |
992 | 3 | |
991 | 1 | < 0.1% |
990 | 5 | |
989 | 3 | |
988 | 2 | < 0.1% |
987 | 3 |
우편번호
Text
Distinct | 8468 |
---|---|
Distinct (%) | 84.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
487-839 | 8 | 0.1% |
701-819 | 7 | 0.1% |
611-839 | 7 | 0.1% |
601-803 | 7 | 0.1% |
138-873 | 7 | 0.1% |
138-820 | 6 | 0.1% |
482-839 | 6 | 0.1% |
476-809 | 6 | 0.1% |
537-874 | 6 | 0.1% |
616-844 | 5 | < 0.1% |
Other values (8458) | 9935 |
Most occurring characters
Value | Count | Frequency (%) |
- | 9588 | |
8 | 7520 | |
1 | 7346 | |
0 | 7164 | |
3 | 6297 | |
7 | 6286 | |
2 | 5538 | |
4 | 5443 | |
5 | 5377 | |
6 | 5334 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 59580 | |
Dash Punctuation | 9588 | 13.9% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
8 | 7520 | |
1 | 7346 | |
0 | 7164 | |
3 | 6297 | |
7 | 6286 | |
2 | 5538 | |
4 | 5443 | |
5 | 5377 | |
6 | 5334 | |
9 | 3275 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 9588 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 69168 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 9588 | |
8 | 7520 | |
1 | 7346 | |
0 | 7164 | |
3 | 6297 | |
7 | 6286 | |
2 | 5538 | |
4 | 5443 | |
5 | 5377 | |
6 | 5334 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 69168 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 9588 | |
8 | 7520 | |
1 | 7346 | |
0 | 7164 | |
3 | 6297 | |
7 | 6286 | |
2 | 5538 | |
4 | 5443 | |
5 | 5377 | |
6 | 5334 |
주소
Text
Distinct | 8355 |
---|---|
Distinct (%) | 83.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 41 |
---|---|
Median length | 38 |
Mean length | 14.9328 |
Min length | 8 |
Characters and Unicode
Total characters | 149328 |
---|---|
Distinct characters | 558 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 7523 ? |
---|---|
Unique (%) | 75.2% |
Sample
1st row | 경남 산청군 생비량면 |
---|---|
2nd row | 전남 장성군 장성읍 영천리 |
3rd row | 충남 당진군 면천면 성상리 |
4th row | 경북 상주시 냉림동 주공4단지아파트 |
5th row | 서울 강북구 수유3동 |
Value | Count | Frequency (%) |
서울 | 1610 | 4.2% |
경기 | 1605 | 4.2% |
경북 | 964 | 2.5% |
전남 | 783 | 2.1% |
경남 | 734 | 1.9% |
부산 | 630 | 1.7% |
충남 | 609 | 1.6% |
전북 | 573 | 1.5% |
강원 | 560 | 1.5% |
대구 | 507 | 1.3% |
Other values (8588) | 29330 |
Most occurring characters
Value | Count | Frequency (%) |
30198 | 20.2% | |
동 | 7126 | 4.8% |
구 | 5411 | 3.6% |
시 | 4254 | 2.8% |
리 | 3979 | 2.7% |
경 | 3612 | 2.4% |
남 | 3457 | 2.3% |
면 | 3196 | 2.1% |
산 | 3097 | 2.1% |
1 | 3036 | 2.0% |
Other values (548) | 81962 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 109707 | |
Space Separator | 30198 | 20.2% |
Decimal Number | 8478 | 5.7% |
Dash Punctuation | 700 | 0.5% |
Uppercase Letter | 172 | 0.1% |
Open Punctuation | 28 | < 0.1% |
Close Punctuation | 28 | < 0.1% |
Other Punctuation | 12 | < 0.1% |
Lowercase Letter | 5 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 7126 | 6.5% |
구 | 5411 | 4.9% |
시 | 4254 | 3.9% |
리 | 3979 | 3.6% |
경 | 3612 | 3.3% |
남 | 3457 | 3.2% |
면 | 3196 | 2.9% |
산 | 3097 | 2.8% |
서 | 2913 | 2.7% |
북 | 2732 | 2.5% |
Other values (508) | 69930 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 27 | |
K | 22 | |
L | 19 | |
G | 18 | |
T | 14 | |
A | 12 | 7.0% |
B | 9 | 5.2% |
C | 9 | 5.2% |
D | 6 | 3.5% |
I | 5 | 2.9% |
Other values (13) | 31 |
Decimal Number
Value | Count | Frequency (%) |
1 | 3036 | |
2 | 1797 | |
0 | 1097 | 12.9% |
3 | 877 | 10.3% |
4 | 511 | 6.0% |
5 | 376 | 4.4% |
6 | 294 | 3.5% |
7 | 200 | 2.4% |
8 | 158 | 1.9% |
9 | 132 | 1.6% |
Other Punctuation
Value | Count | Frequency (%) |
. | 11 | |
& | 1 | 8.3% |
Space Separator
Value | Count | Frequency (%) |
30198 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 700 |
Open Punctuation
Value | Count | Frequency (%) |
( | 28 |
Close Punctuation
Value | Count | Frequency (%) |
) | 28 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 5 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 109707 | |
Common | 39444 | 26.4% |
Latin | 177 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 7126 | 6.5% |
구 | 5411 | 4.9% |
시 | 4254 | 3.9% |
리 | 3979 | 3.6% |
경 | 3612 | 3.3% |
남 | 3457 | 3.2% |
면 | 3196 | 2.9% |
산 | 3097 | 2.8% |
서 | 2913 | 2.7% |
북 | 2732 | 2.5% |
Other values (508) | 69930 |
Latin
Value | Count | Frequency (%) |
S | 27 | |
K | 22 | |
L | 19 | |
G | 18 | |
T | 14 | 7.9% |
A | 12 | 6.8% |
B | 9 | 5.1% |
C | 9 | 5.1% |
D | 6 | 3.4% |
e | 5 | 2.8% |
Other values (14) | 36 |
Common
Value | Count | Frequency (%) |
30198 | ||
1 | 3036 | 7.7% |
2 | 1797 | 4.6% |
0 | 1097 | 2.8% |
3 | 877 | 2.2% |
- | 700 | 1.8% |
4 | 511 | 1.3% |
5 | 376 | 1.0% |
6 | 294 | 0.7% |
7 | 200 | 0.5% |
Other values (6) | 358 | 0.9% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 109707 | |
ASCII | 39621 | 26.5% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
30198 | ||
1 | 3036 | 7.7% |
2 | 1797 | 4.5% |
0 | 1097 | 2.8% |
3 | 877 | 2.2% |
- | 700 | 1.8% |
4 | 511 | 1.3% |
5 | 376 | 0.9% |
6 | 294 | 0.7% |
7 | 200 | 0.5% |
Other values (30) | 535 | 1.4% |
Hangul
Value | Count | Frequency (%) |
동 | 7126 | 6.5% |
구 | 5411 | 4.9% |
시 | 4254 | 3.9% |
리 | 3979 | 3.6% |
경 | 3612 | 3.3% |
남 | 3457 | 3.2% |
면 | 3196 | 2.9% |
산 | 3097 | 2.8% |
서 | 2913 | 2.7% |
북 | 2732 | 2.5% |
Other values (508) | 69930 |
시도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 16 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10.0342 |
Minimum | 1 |
---|---|
Maximum | 19 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 11 |
Q3 | 15 |
95-th percentile | 18 |
Maximum | 19 |
Range | 18 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 6.1432894 |
---|---|
Coefficient of variation (CV) | 0.61223509 |
Kurtosis | -1.4072111 |
Mean | 10.0342 |
Median Absolute Deviation (MAD) | 6 |
Skewness | -0.3171108 |
Sum | 100342 |
Variance | 37.740004 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 1610 | |
11 | 1605 | |
18 | 964 | |
15 | 783 | |
17 | 734 | |
2 | 630 | 6.3% |
13 | 609 | 6.1% |
16 | 573 | 5.7% |
12 | 560 | 5.6% |
3 | 507 | 5.1% |
Other values (6) | 1425 |
Value | Count | Frequency (%) |
1 | 1610 | |
2 | 630 | 6.3% |
3 | 507 | 5.1% |
4 | 284 | 2.8% |
5 | 234 | 2.3% |
6 | 164 | 1.6% |
7 | 168 | 1.7% |
11 | 1605 | |
12 | 560 | 5.6% |
13 | 609 | 6.1% |
Value | Count | Frequency (%) |
19 | 80 | 0.8% |
18 | 964 | |
17 | 734 | |
16 | 573 | 5.7% |
15 | 783 | |
14 | 495 | 5.0% |
13 | 609 | 6.1% |
12 | 560 | 5.6% |
11 | 1605 | |
7 | 168 | 1.7% |
번호 | 우편번호1 | 우편번호2 | 시도 | |
---|---|---|---|---|
번호 | 1.000 | 0.078 | 0.096 | 0.103 |
우편번호1 | 0.078 | 1.000 | 0.308 | 0.889 |
우편번호2 | 0.096 | 0.308 | 1.000 | 0.265 |
시도 | 0.103 | 0.889 | 0.265 | 1.000 |
번호 | 우편번호1 | 우편번호2 | 시도 | |
---|---|---|---|---|
번호 | 1.000 | -0.167 | 0.093 | -0.009 |
우편번호1 | -0.167 | 1.000 | 0.073 | 0.594 |
우편번호2 | 0.093 | 0.073 | 1.000 | 0.219 |
시도 | -0.009 | 0.594 | 0.219 | 1.000 |
번호 | 우편번호1 | 우편번호2 | 우편번호 | 주소 | 시도 | |
---|---|---|---|---|---|---|
44559 | 12084 | 666 | 970 | 666-970 | 경남 산청군 생비량면 | 17 |
9900 | 39673 | 515 | 805 | 515-805 | 전남 장성군 장성읍 영천리 | 15 |
8484 | 44503 | 343 | 884 | 343-884 | 충남 당진군 면천면 성상리 | 13 |
38302 | 15962 | 742 | 753 | 742-753 | 경북 상주시 냉림동 주공4단지아파트 | 18 |
28420 | 27841 | 142 | 872 | 142-872 | 서울 강북구 수유3동 | 1 |
2209 | 5895 | 443 | 756 | 443-756 | 경기 수원시 영통구 원천동 주공아파트 201-221 | 11 |
44473 | 11998 | 664 | 951 | 664-951 | 경남 사천시 용현면 통양리 | 17 |
14867 | 45635 | 336 | 823 | 336-823 | 충남 아산시 영인면 아산리 | 13 |
25560 | 35881 | 401 | 60 | 40160 | 인천 동구 금곡동 | 4 |
17654 | 46995 | 367 | 847 | 367-847 | 충북 괴산군 청천면 삼송리 | 14 |
번호 | 우편번호1 | 우편번호2 | 우편번호 | 주소 | 시도 | |
---|---|---|---|---|---|---|
939 | 429031 | 642 | 370 | 642-370 | 경남 창원시 성산구 신촌동 | 17 |
45993 | 16429 | 760 | 380 | 760-380 | 경북 안동시 송천동 | 18 |
21312 | 29647 | 139 | 869 | 139-869 | 서울 노원구 하계1동 | 1 |
12523 | 45988 | 340 | 864 | 340-864 | 충남 예산군 신암면 예림리 | 13 |
15542 | 45277 | 325 | 861 | 325-861 | 충남 서천군 마산면 요곡리 | 13 |
24460 | 35967 | 403 | 808 | 403-808 | 인천 부평구 부개2동 | 4 |
17550 | 39926 | 539 | 915 | 539-915 | 전남 진도군 조도면 내병도리 | 15 |
18005 | 35234 | 417 | 871 | 417-871 | 인천 강화군 하점면 삼거리 | 4 |
37483 | 11207 | 668 | 883 | 668-883 | 경남 남해군 고현면 도마리 | 17 |
4294 | 1359 | 220 | 50 | 22050 | 강원 원주시 일산동 원주우체국 | 12 |