Dataset statistics
Number of variables | 14 |
---|---|
Number of observations | 10000 |
Missing cells | 20147 |
Missing cells (%) | 14.4% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.2 MiB |
Average record size in memory | 127.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 4 |
Text | 4 |
Unsupported | 1 |
Dataset
Description | 부산광역시영도구_옥외광고물새주소관리_20211231 |
---|---|
Author | 부산광역시 영도구 |
URL | http://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15072284 |
시도 has constant value "" | Constant |
시군구 has constant value "" | Constant |
읍면동 is highly overall correlated with 순번 and 3 other fields | High correlation |
우편번호1 is highly overall correlated with 순번 and 5 other fields | High correlation |
순번 is highly overall correlated with 우편번호2 and 2 other fields | High correlation |
우편번호2 is highly overall correlated with 순번 and 2 other fields | High correlation |
건물번호 is highly overall correlated with 우편번호1 | High correlation |
건물번호2 is highly overall correlated with 우편번호1 | High correlation |
신우편코드 is highly overall correlated with 우편번호1 and 1 other fields | High correlation |
우편번호1 is highly imbalanced (92.7%) | Imbalance |
건물명 has 9264 (92.6%) missing values | Missing |
신우편코드 has 499 (5.0%) missing values | Missing |
Unnamed: 13 has 10000 (100.0%) missing values | Missing |
순번 has unique values | Unique |
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
건물번호2 has 7151 (71.5%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 17:08:19.054528 |
---|---|
Analysis finished | 2023-12-10 17:08:25.796202 |
Duration | 6.74 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
순번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 12189.779 |
Minimum | 1 |
---|---|
Maximum | 24468 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1243.95 |
Q1 | 6087.75 |
median | 12150.5 |
Q3 | 18300.5 |
95-th percentile | 23212.05 |
Maximum | 24468 |
Range | 24467 |
Interquartile range (IQR) | 12212.75 |
Descriptive statistics
Standard deviation | 7040.4615 |
---|---|
Coefficient of variation (CV) | 0.5775709 |
Kurtosis | -1.1910282 |
Mean | 12189.779 |
Median Absolute Deviation (MAD) | 6108.5 |
Skewness | 0.0092008237 |
Sum | 1.2189779 × 108 |
Variance | 49568098 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
184 | 1 | < 0.1% |
6281 | 1 | < 0.1% |
107 | 1 | < 0.1% |
15909 | 1 | < 0.1% |
20492 | 1 | < 0.1% |
5656 | 1 | < 0.1% |
8486 | 1 | < 0.1% |
13883 | 1 | < 0.1% |
12616 | 1 | < 0.1% |
8912 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
10 | 1 | |
13 | 1 | |
17 | 1 | |
22 | 1 | |
25 | 1 | |
26 | 1 |
Value | Count | Frequency (%) |
24468 | 1 | |
24466 | 1 | |
24460 | 1 | |
24459 | 1 | |
24458 | 1 | |
24457 | 1 | |
24450 | 1 | |
24448 | 1 | |
24443 | 1 | |
24439 | 1 |
우편번호1
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
606 | |
---|---|
<NA> | 88 |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.0088 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 606 |
---|---|
2nd row | 606 |
3rd row | 606 |
4th row | 606 |
5th row | 606 |
Common Values
Value | Count | Frequency (%) |
606 | 9912 | |
<NA> | 88 | 0.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
606 | 9912 | |
na | 88 | 0.9% |
우편번호2
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 71 |
---|---|
Distinct (%) | 0.7% |
Missing | 88 |
Missing (%) | 0.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 580.94653 |
Minimum | 11 |
---|---|
Maximum | 825 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 11 |
---|---|
5-th percentile | 22 |
Q1 | 61 |
median | 808 |
Q3 | 818 |
95-th percentile | 822 |
Maximum | 825 |
Range | 814 |
Interquartile range (IQR) | 757 |
Descriptive statistics
Standard deviation | 353.136 |
---|---|
Coefficient of variation (CV) | 0.60786317 |
Kurtosis | -1.2468566 |
Mean | 580.94653 |
Median Absolute Deviation (MAD) | 11 |
Skewness | -0.86493229 |
Sum | 5758342 |
Variance | 124705.03 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
822 | 699 | 7.0% |
53 | 614 | 6.1% |
806 | 508 | 5.1% |
812 | 480 | 4.8% |
814 | 449 | 4.5% |
818 | 432 | 4.3% |
823 | 409 | 4.1% |
804 | 389 | 3.9% |
820 | 389 | 3.9% |
51 | 365 | 3.6% |
Other values (61) | 5178 |
Value | Count | Frequency (%) |
11 | 115 | 1.1% |
12 | 116 | 1.2% |
21 | 155 | |
22 | 197 | |
33 | 333 | |
41 | 145 | 1.5% |
42 | 194 | |
43 | 171 | |
44 | 3 | < 0.1% |
51 | 365 |
Value | Count | Frequency (%) |
825 | 13 | 0.1% |
823 | 409 | |
822 | 699 | |
821 | 268 | 2.7% |
820 | 389 | |
819 | 273 | 2.7% |
818 | 432 | |
817 | 324 | |
816 | 262 | 2.6% |
815 | 177 | 1.8% |
시도
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
부산광역시 |
---|
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 부산광역시 |
---|---|
2nd row | 부산광역시 |
3rd row | 부산광역시 |
4th row | 부산광역시 |
5th row | 부산광역시 |
Common Values
Value | Count | Frequency (%) |
부산광역시 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
부산광역시 | 10000 |
시군구
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
영도구 |
---|
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 영도구 |
---|---|
2nd row | 영도구 |
3rd row | 영도구 |
4th row | 영도구 |
5th row | 영도구 |
Common Values
Value | Count | Frequency (%) |
영도구 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
영도구 | 10000 |
읍면동
Categorical
HIGH CORRELATION
 
Distinct | 21 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
청학동 | |
---|---|
동삼동 | |
신선동2가 | |
신선동3가 | |
영선동4가 | |
Other values (16) |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 4.1294 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 남항동1가 |
---|---|
2nd row | 청학동 |
3rd row | 대평동2가 |
4th row | 동삼동 |
5th row | 청학동 |
Common Values
Value | Count | Frequency (%) |
청학동 | 2579 | |
동삼동 | 1774 | |
신선동2가 | 627 | 6.3% |
신선동3가 | 615 | 6.2% |
영선동4가 | 598 | 6.0% |
봉래동5가 | 524 | 5.2% |
봉래동4가 | 505 | 5.1% |
신선동1가 | 364 | 3.6% |
남항동3가 | 336 | 3.4% |
남항동1가 | 246 | 2.5% |
Other values (11) | 1832 |
Length
Value | Count | Frequency (%) |
청학동 | 2579 | |
동삼동 | 1774 | |
신선동2가 | 627 | 6.3% |
신선동3가 | 615 | 6.2% |
영선동4가 | 598 | 6.0% |
봉래동5가 | 524 | 5.2% |
봉래동4가 | 505 | 5.1% |
신선동1가 | 364 | 3.6% |
남항동3가 | 336 | 3.4% |
남항동1가 | 246 | 2.5% |
Other values (11) | 1832 |
번지
Text
Distinct | 6964 |
---|---|
Distinct (%) | 69.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
01월 | 112 | 1.1% |
04월 | 63 | 0.6% |
12월 | 58 | 0.5% |
01일 | 56 | 0.5% |
05월 | 56 | 0.5% |
02월 | 52 | 0.5% |
06월 | 52 | 0.5% |
03월 | 50 | 0.5% |
279-2 | 38 | 0.4% |
07월 | 37 | 0.3% |
Other values (6753) | 10007 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 9795 | |
- | 7498 | |
2 | 7423 | |
3 | 4971 | |
4 | 4036 | |
0 | 3425 | 6.3% |
5 | 3337 | 6.1% |
6 | 3242 | 6.0% |
7 | 2978 | 5.5% |
8 | 2953 | 5.4% |
Other values (5) | 4685 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 45101 | |
Dash Punctuation | 7498 | 13.8% |
Other Letter | 1163 | 2.1% |
Space Separator | 581 | 1.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 9795 | |
2 | 7423 | |
3 | 4971 | |
4 | 4036 | |
0 | 3425 | 7.6% |
5 | 3337 | 7.4% |
6 | 3242 | 7.2% |
7 | 2978 | 6.6% |
8 | 2953 | 6.5% |
9 | 2941 | 6.5% |
Other Letter
Value | Count | Frequency (%) |
월 | 581 | |
일 | 581 | |
산 | 1 | 0.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 7498 |
Space Separator
Value | Count | Frequency (%) |
581 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 53180 | |
Hangul | 1163 | 2.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 9795 | |
- | 7498 | |
2 | 7423 | |
3 | 4971 | |
4 | 4036 | |
0 | 3425 | 6.4% |
5 | 3337 | 6.3% |
6 | 3242 | 6.1% |
7 | 2978 | 5.6% |
8 | 2953 | 5.6% |
Other values (2) | 3522 | 6.6% |
Hangul
Value | Count | Frequency (%) |
월 | 581 | |
일 | 581 | |
산 | 1 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 53180 | |
Hangul | 1163 | 2.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 9795 | |
- | 7498 | |
2 | 7423 | |
3 | 4971 | |
4 | 4036 | |
0 | 3425 | 6.4% |
5 | 3337 | 6.3% |
6 | 3242 | 6.1% |
7 | 2978 | 5.6% |
8 | 2953 | 5.6% |
Other values (2) | 3522 | 6.6% |
Hangul
Value | Count | Frequency (%) |
월 | 581 | |
일 | 581 | |
산 | 1 | 0.1% |
도로명
Text
Distinct | 430 |
---|---|
Distinct (%) | 4.3% |
Missing | 74 |
Missing (%) | 0.7% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
태종로 | 418 | 4.2% |
절영로 | 274 | 2.8% |
하나길 | 248 | 2.5% |
해양로 | 112 | 1.1% |
중복길 | 111 | 1.1% |
청학로 | 108 | 1.1% |
웃서발로 | 104 | 1.0% |
청학동로 | 103 | 1.0% |
새천년길 | 88 | 0.9% |
아리랑길 | 87 | 0.9% |
Other values (420) | 8273 |
Most occurring characters
Value | Count | Frequency (%) |
길 | 7795 | 15.2% |
로 | 5782 | 11.3% |
번 | 3781 | 7.4% |
남 | 1529 | 3.0% |
1 | 1425 | 2.8% |
3 | 1384 | 2.7% |
학 | 1315 | 2.6% |
청 | 1307 | 2.6% |
2 | 1178 | 2.3% |
영 | 1147 | 2.2% |
Other values (148) | 24563 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 42533 | |
Decimal Number | 8673 | 16.9% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
길 | 7795 | |
로 | 5782 | 13.6% |
번 | 3781 | 8.9% |
남 | 1529 | 3.6% |
학 | 1315 | 3.1% |
청 | 1307 | 3.1% |
영 | 1147 | 2.7% |
태 | 991 | 2.3% |
종 | 991 | 2.3% |
항 | 951 | 2.2% |
Other values (138) | 16944 |
Decimal Number
Value | Count | Frequency (%) |
1 | 1425 | |
3 | 1384 | |
2 | 1178 | |
4 | 814 | |
9 | 804 | |
7 | 729 | |
6 | 689 | |
5 | 673 | |
0 | 528 | 6.1% |
8 | 449 | 5.2% |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 42533 | |
Common | 8673 | 16.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
길 | 7795 | |
로 | 5782 | 13.6% |
번 | 3781 | 8.9% |
남 | 1529 | 3.6% |
학 | 1315 | 3.1% |
청 | 1307 | 3.1% |
영 | 1147 | 2.7% |
태 | 991 | 2.3% |
종 | 991 | 2.3% |
항 | 951 | 2.2% |
Other values (138) | 16944 |
Common
Value | Count | Frequency (%) |
1 | 1425 | |
3 | 1384 | |
2 | 1178 | |
4 | 814 | |
9 | 804 | |
7 | 729 | |
6 | 689 | |
5 | 673 | |
0 | 528 | 6.1% |
8 | 449 | 5.2% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 42533 | |
ASCII | 8673 | 16.9% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
길 | 7795 | |
로 | 5782 | 13.6% |
번 | 3781 | 8.9% |
남 | 1529 | 3.6% |
학 | 1315 | 3.1% |
청 | 1307 | 3.1% |
영 | 1147 | 2.7% |
태 | 991 | 2.3% |
종 | 991 | 2.3% |
항 | 951 | 2.2% |
Other values (138) | 16944 |
ASCII
Value | Count | Frequency (%) |
1 | 1425 | |
3 | 1384 | |
2 | 1178 | |
4 | 814 | |
9 | 804 | |
7 | 729 | |
6 | 689 | |
5 | 673 | |
0 | 528 | 6.1% |
8 | 449 | 5.2% |
건물번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 621 |
---|---|
Distinct (%) | 6.3% |
Missing | 74 |
Missing (%) | 0.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 86.88374 |
Minimum | 1 |
---|---|
Maximum | 950 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 5 |
Q1 | 17 |
median | 39 |
Q3 | 87 |
95-th percentile | 371.5 |
Maximum | 950 |
Range | 949 |
Interquartile range (IQR) | 70 |
Descriptive statistics
Standard deviation | 136.75852 |
---|---|
Coefficient of variation (CV) | 1.5740404 |
Kurtosis | 11.526463 |
Mean | 86.88374 |
Median Absolute Deviation (MAD) | 27 |
Skewness | 3.2165036 |
Sum | 862408 |
Variance | 18702.892 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
7 | 188 | 1.9% |
14 | 185 | 1.8% |
6 | 180 | 1.8% |
10 | 178 | 1.8% |
16 | 174 | 1.7% |
12 | 174 | 1.7% |
8 | 173 | 1.7% |
11 | 167 | 1.7% |
5 | 159 | 1.6% |
13 | 157 | 1.6% |
Other values (611) | 8191 |
Value | Count | Frequency (%) |
1 | 92 | |
2 | 120 | |
3 | 118 | |
4 | 122 | |
5 | 159 | |
6 | 180 | |
7 | 188 | |
8 | 173 | |
9 | 152 | |
10 | 178 |
Value | Count | Frequency (%) |
950 | 1 | |
948 | 1 | |
946 | 1 | |
942 | 1 | |
928 | 1 | |
922 | 1 | |
904 | 1 | |
898 | 1 | |
894 | 1 | |
890 | 1 |
건물번호2
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 52 |
---|---|
Distinct (%) | 0.5% |
Missing | 74 |
Missing (%) | 0.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.083518 |
Minimum | 0 |
---|---|
Maximum | 70 |
Zeros | 7151 |
Zeros (%) | 71.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 1 |
95-th percentile | 12 |
Maximum | 70 |
Range | 70 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 5.1670313 |
---|---|
Coefficient of variation (CV) | 2.4799552 |
Kurtosis | 25.88065 |
Mean | 2.083518 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 4.2120576 |
Sum | 20681 |
Variance | 26.698213 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 7151 | |
1 | 500 | 5.0% |
3 | 255 | 2.5% |
4 | 250 | 2.5% |
5 | 248 | 2.5% |
6 | 225 | 2.2% |
2 | 168 | 1.7% |
7 | 158 | 1.6% |
8 | 158 | 1.6% |
9 | 114 | 1.1% |
Other values (42) | 699 | 7.0% |
Value | Count | Frequency (%) |
0 | 7151 | |
1 | 500 | 5.0% |
2 | 168 | 1.7% |
3 | 255 | 2.5% |
4 | 250 | 2.5% |
5 | 248 | 2.5% |
6 | 225 | 2.2% |
7 | 158 | 1.6% |
8 | 158 | 1.6% |
9 | 114 | 1.1% |
Value | Count | Frequency (%) |
70 | 1 | < 0.1% |
68 | 1 | < 0.1% |
64 | 1 | < 0.1% |
60 | 1 | < 0.1% |
55 | 1 | < 0.1% |
53 | 1 | < 0.1% |
52 | 1 | < 0.1% |
51 | 1 | < 0.1% |
43 | 3 | |
42 | 1 | < 0.1% |
건물명
Text
MISSING
 
Distinct | 505 |
---|---|
Distinct (%) | 68.6% |
Missing | 9264 |
Missing (%) | 92.6% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
영선미니아파트 | 13 | 1.6% |
주)한진중공업 | 11 | 1.4% |
한국해양대학교 | 8 | 1.0% |
동삼그린힐아파트 | 7 | 0.9% |
절영아파트 | 6 | 0.7% |
조양비취맨션 | 6 | 0.7% |
주식회사 | 5 | 0.6% |
국보 | 5 | 0.6% |
동삼주공영구임대아파트 | 5 | 0.6% |
a동 | 5 | 0.6% |
Other values (528) | 737 |
Most occurring characters
Value | Count | Frequency (%) |
동 | 159 | 3.5% |
영 | 152 | 3.3% |
아 | 142 | 3.1% |
빌 | 137 | 3.0% |
트 | 132 | 2.9% |
파 | 124 | 2.7% |
라 | 113 | 2.5% |
교 | 103 | 2.2% |
도 | 101 | 2.2% |
주 | 95 | 2.1% |
Other values (318) | 3342 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 4351 | |
Space Separator | 72 | 1.6% |
Open Punctuation | 48 | 1.0% |
Close Punctuation | 48 | 1.0% |
Decimal Number | 45 | 1.0% |
Uppercase Letter | 32 | 0.7% |
Other Punctuation | 2 | < 0.1% |
Lowercase Letter | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 159 | 3.7% |
영 | 152 | 3.5% |
아 | 142 | 3.3% |
빌 | 137 | 3.1% |
트 | 132 | 3.0% |
파 | 124 | 2.8% |
라 | 113 | 2.6% |
교 | 103 | 2.4% |
도 | 101 | 2.3% |
주 | 95 | 2.2% |
Other values (294) | 3093 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 6 | |
A | 6 | |
S | 5 | |
B | 5 | |
K | 4 | |
T | 2 | 6.2% |
G | 1 | 3.1% |
I | 1 | 3.1% |
X | 1 | 3.1% |
Z | 1 | 3.1% |
Decimal Number
Value | Count | Frequency (%) |
1 | 19 | |
2 | 15 | |
3 | 4 | 8.9% |
5 | 2 | 4.4% |
4 | 2 | 4.4% |
0 | 1 | 2.2% |
8 | 1 | 2.2% |
9 | 1 | 2.2% |
Lowercase Letter
Value | Count | Frequency (%) |
t | 1 | |
o | 1 |
Space Separator
Value | Count | Frequency (%) |
72 |
Open Punctuation
Value | Count | Frequency (%) |
( | 48 |
Close Punctuation
Value | Count | Frequency (%) |
) | 48 |
Other Punctuation
Value | Count | Frequency (%) |
. | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 4351 | |
Common | 215 | 4.7% |
Latin | 34 | 0.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 159 | 3.7% |
영 | 152 | 3.5% |
아 | 142 | 3.3% |
빌 | 137 | 3.1% |
트 | 132 | 3.0% |
파 | 124 | 2.8% |
라 | 113 | 2.6% |
교 | 103 | 2.4% |
도 | 101 | 2.3% |
주 | 95 | 2.2% |
Other values (294) | 3093 |
Common
Value | Count | Frequency (%) |
72 | ||
( | 48 | |
) | 48 | |
1 | 19 | 8.8% |
2 | 15 | 7.0% |
3 | 4 | 1.9% |
5 | 2 | 0.9% |
4 | 2 | 0.9% |
. | 2 | 0.9% |
0 | 1 | 0.5% |
Other values (2) | 2 | 0.9% |
Latin
Value | Count | Frequency (%) |
C | 6 | |
A | 6 | |
S | 5 | |
B | 5 | |
K | 4 | |
T | 2 | 5.9% |
G | 1 | 2.9% |
I | 1 | 2.9% |
X | 1 | 2.9% |
t | 1 | 2.9% |
Other values (2) | 2 | 5.9% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 4351 | |
ASCII | 249 | 5.4% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
동 | 159 | 3.7% |
영 | 152 | 3.5% |
아 | 142 | 3.3% |
빌 | 137 | 3.1% |
트 | 132 | 3.0% |
파 | 124 | 2.8% |
라 | 113 | 2.6% |
교 | 103 | 2.4% |
도 | 101 | 2.3% |
주 | 95 | 2.2% |
Other values (294) | 3093 |
ASCII
Value | Count | Frequency (%) |
72 | ||
( | 48 | |
) | 48 | |
1 | 19 | 7.6% |
2 | 15 | 6.0% |
C | 6 | 2.4% |
A | 6 | 2.4% |
S | 5 | 2.0% |
B | 5 | 2.0% |
3 | 4 | 1.6% |
Other values (14) | 21 | 8.4% |
지번
Text
Distinct | 8615 |
---|---|
Distinct (%) | 86.8% |
Missing | 74 |
Missing (%) | 0.7% |
Memory size | 156.2 KiB |
Length
Max length | 13 |
---|---|
Median length | 12 |
Mean length | 10.294076 |
Min length | 5 |
Characters and Unicode
Total characters | 102179 |
---|---|
Distinct characters | 28 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 7656 ? |
---|---|
Unique (%) | 77.1% |
Sample
1st row | 남항동1가 201-12 |
---|---|
2nd row | 청학동 21-30 |
3rd row | 대평동2가 4 |
4th row | 동삼동 1148 |
5th row | 청학동 95-8 |
Value | Count | Frequency (%) |
청학동 | 2536 | 12.8% |
동삼동 | 1762 | 8.9% |
신선동2가 | 627 | 3.2% |
신선동3가 | 615 | 3.1% |
영선동4가 | 598 | 3.0% |
봉래동5가 | 523 | 2.6% |
봉래동4가 | 505 | 2.5% |
신선동1가 | 362 | 1.8% |
남항동3가 | 332 | 1.7% |
남항동1가 | 246 | 1.2% |
Other values (6964) | 11746 |
Most occurring characters
Value | Count | Frequency (%) |
동 | 11688 | 11.4% |
1 | 10585 | 10.4% |
9926 | 9.7% | |
- | 9100 | 8.9% |
2 | 8315 | 8.1% |
3 | 5804 | 5.7% |
가 | 5628 | 5.5% |
4 | 4984 | 4.9% |
5 | 3703 | 3.6% |
6 | 3130 | 3.1% |
Other values (18) | 29316 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 47669 | |
Other Letter | 35484 | |
Space Separator | 9926 | 9.7% |
Dash Punctuation | 9100 | 8.9% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 11688 | |
가 | 5628 | |
선 | 2716 | 7.7% |
청 | 2536 | 7.1% |
학 | 2536 | 7.1% |
삼 | 1762 | 5.0% |
신 | 1604 | 4.5% |
래 | 1519 | 4.3% |
봉 | 1519 | 4.3% |
영 | 1112 | 3.1% |
Other values (6) | 2864 | 8.1% |
Decimal Number
Value | Count | Frequency (%) |
1 | 10585 | |
2 | 8315 | |
3 | 5804 | |
4 | 4984 | |
5 | 3703 | 7.8% |
6 | 3130 | 6.6% |
7 | 2892 | 6.1% |
9 | 2880 | 6.0% |
8 | 2878 | 6.0% |
0 | 2498 | 5.2% |
Space Separator
Value | Count | Frequency (%) |
9926 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 9100 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 66695 | |
Hangul | 35484 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 11688 | |
가 | 5628 | |
선 | 2716 | 7.7% |
청 | 2536 | 7.1% |
학 | 2536 | 7.1% |
삼 | 1762 | 5.0% |
신 | 1604 | 4.5% |
래 | 1519 | 4.3% |
봉 | 1519 | 4.3% |
영 | 1112 | 3.1% |
Other values (6) | 2864 | 8.1% |
Common
Value | Count | Frequency (%) |
1 | 10585 | |
9926 | ||
- | 9100 | |
2 | 8315 | |
3 | 5804 | |
4 | 4984 | |
5 | 3703 | 5.6% |
6 | 3130 | 4.7% |
7 | 2892 | 4.3% |
9 | 2880 | 4.3% |
Other values (2) | 5376 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 66695 | |
Hangul | 35484 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
동 | 11688 | |
가 | 5628 | |
선 | 2716 | 7.7% |
청 | 2536 | 7.1% |
학 | 2536 | 7.1% |
삼 | 1762 | 5.0% |
신 | 1604 | 4.5% |
래 | 1519 | 4.3% |
봉 | 1519 | 4.3% |
영 | 1112 | 3.1% |
Other values (6) | 2864 | 8.1% |
ASCII
Value | Count | Frequency (%) |
1 | 10585 | |
9926 | ||
- | 9100 | |
2 | 8315 | |
3 | 5804 | |
4 | 4984 | |
5 | 3703 | 5.6% |
6 | 3130 | 4.7% |
7 | 2892 | 4.3% |
9 | 2880 | 4.3% |
Other values (2) | 5376 |
신우편코드
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 126 |
---|---|
Distinct (%) | 1.3% |
Missing | 499 |
Missing (%) | 5.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 49057.827 |
Minimum | 49000 |
---|---|
Maximum | 49127 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 49000 |
---|---|
5-th percentile | 49010 |
Q1 | 49029 |
median | 49058 |
Q3 | 49079 |
95-th percentile | 49121 |
Maximum | 49127 |
Range | 127 |
Interquartile range (IQR) | 50 |
Descriptive statistics
Standard deviation | 32.535063 |
---|---|
Coefficient of variation (CV) | 0.00066319821 |
Kurtosis | -0.78773997 |
Mean | 49057.827 |
Median Absolute Deviation (MAD) | 26 |
Skewness | 0.27629532 |
Sum | 4.6609842 × 108 |
Variance | 1058.5303 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
49079 | 346 | 3.5% |
49024 | 215 | 2.1% |
49061 | 197 | 2.0% |
49076 | 194 | 1.9% |
49027 | 194 | 1.9% |
49031 | 180 | 1.8% |
49102 | 173 | 1.7% |
49014 | 166 | 1.7% |
49126 | 164 | 1.6% |
49017 | 159 | 1.6% |
Other values (116) | 7513 | |
(Missing) | 499 | 5.0% |
Value | Count | Frequency (%) |
49000 | 44 | |
49001 | 7 | 0.1% |
49002 | 1 | < 0.1% |
49003 | 72 | |
49004 | 27 | 0.3% |
49005 | 101 | |
49006 | 30 | 0.3% |
49007 | 44 | |
49008 | 69 | |
49009 | 59 |
Value | Count | Frequency (%) |
49127 | 50 | 0.5% |
49126 | 164 | |
49125 | 105 | |
49124 | 84 | |
49123 | 56 | 0.6% |
49122 | 14 | 0.1% |
49121 | 8 | 0.1% |
49120 | 2 | < 0.1% |
49119 | 4 | < 0.1% |
49118 | 84 |
Unnamed: 13
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
순번 | 우편번호2 | 읍면동 | 건물번호 | 건물번호2 | 신우편코드 | |
---|---|---|---|---|---|---|
순번 | 1.000 | 0.542 | 0.955 | 0.561 | 0.277 | 0.932 |
우편번호2 | 0.542 | 1.000 | 0.908 | 0.163 | 0.018 | 0.522 |
읍면동 | 0.955 | 0.908 | 1.000 | 0.457 | 0.234 | 0.918 |
건물번호 | 0.561 | 0.163 | 0.457 | 1.000 | 0.251 | 0.505 |
건물번호2 | 0.277 | 0.018 | 0.234 | 0.251 | 1.000 | 0.267 |
신우편코드 | 0.932 | 0.522 | 0.918 | 0.505 | 0.267 | 1.000 |
읍면동 | 우편번호1 | |
---|---|---|
읍면동 | 1.000 | 1.000 |
우편번호1 | 1.000 | 1.000 |
순번 | 우편번호2 | 건물번호 | 건물번호2 | 신우편코드 | 우편번호1 | 읍면동 | |
---|---|---|---|---|---|---|---|
순번 | 1.000 | 0.700 | 0.078 | -0.016 | -0.372 | 1.000 | 0.775 |
우편번호2 | 0.700 | 1.000 | -0.021 | 0.043 | -0.269 | 1.000 | 0.689 |
건물번호 | 0.078 | -0.021 | 1.000 | -0.162 | 0.090 | 1.000 | 0.187 |
건물번호2 | -0.016 | 0.043 | -0.162 | 1.000 | -0.115 | 1.000 | 0.082 |
신우편코드 | -0.372 | -0.269 | 0.090 | -0.115 | 1.000 | 1.000 | 0.659 |
우편번호1 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
읍면동 | 0.775 | 0.689 | 0.187 | 0.082 | 0.659 | 1.000 | 1.000 |
순번 | 우편번호1 | 우편번호2 | 시도 | 시군구 | 읍면동 | 번지 | 도로명 | 건물번호 | 건물번호2 | 건물명 | 지번 | 신우편코드 | Unnamed: 13 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
278 | 184 | 606 | 801 | 부산광역시 | 영도구 | 남항동1가 | 201-12 | 남항로31번길 | 14 | 3 | <NA> | 남항동1가 201-12 | 49053 | <NA> |
19327 | 20133 | 606 | 818 | 부산광역시 | 영도구 | 청학동 | 21-30 | 청학동로 | 76 | 1 | <NA> | 청학동 21-30 | 49016 | <NA> |
771 | 2995 | 606 | 22 | 부산광역시 | 영도구 | 대평동2가 | 4 | 대평남로 | 59 | 0 | <NA> | 대평동2가 4 | 49044 | <NA> |
9378 | 7486 | 606 | 80 | 부산광역시 | 영도구 | 동삼동 | 1148 | 하리동길 | 37 | 0 | <NA> | 동삼동 1148 | 49125 | <NA> |
20043 | 19733 | 606 | 819 | 부산광역시 | 영도구 | 청학동 | 34912 | 청학남로60번길 | 10 | 0 | <NA> | 청학동 95-8 | <NA> | <NA> |
15046 | 17201 | 606 | 817 | 부산광역시 | 영도구 | 영선동4가 | 238-16 | 에움길 | 140 | 0 | <NA> | 영선동4가 238-16 | 49078 | <NA> |
16287 | 15965 | 606 | 42 | 부산광역시 | 영도구 | 영선동2가 | 16528 | 절영로101번길 | 32 | 0 | <NA> | 영선동2가 45-4 | 49056 | <NA> |
5579 | 4376 | 606 | 806 | 부산광역시 | 영도구 | 동삼동 | 227-205 | 동삼서로 | 6 | 5 | <NA> | 동삼동 227-205 | 49098 | <NA> |
20743 | 18894 | 606 | 823 | 부산광역시 | 영도구 | 청학동 | 391-679 | 우정길 | 29 | 0 | <NA> | 청학동 391-679 | 49030 | <NA> |
20737 | 18888 | 606 | 823 | 부산광역시 | 영도구 | 청학동 | 391-95 | 우정길 | 20 | 0 | <NA> | 청학동 391-95 | <NA> | <NA> |
순번 | 우편번호1 | 우편번호2 | 시도 | 시군구 | 읍면동 | 번지 | 도로명 | 건물번호 | 건물번호2 | 건물명 | 지번 | 신우편코드 | Unnamed: 13 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
17510 | 15551 | 606 | 41 | 부산광역시 | 영도구 | 영선동1가 | 08월 02일 | 태종로113번길 | 13 | 0 | <NA> | 영선동1가 8-2 | 49036 | <NA> |
22786 | 23191 | 606 | 821 | 부산광역시 | 영도구 | 청학동 | 126-49 | 태종로352번길 | 40 | 4 | <NA> | 청학동 126-49 | 49015 | <NA> |
10176 | 9033 | 606 | 811 | 부산광역시 | 영도구 | 봉래동4가 | 260-6 | 개량2길 | 37 | 0 | <NA> | 봉래동4가 260-6 | 49065 | <NA> |
21074 | 21317 | 606 | 821 | 부산광역시 | 영도구 | 청학동 | 159-12 | 청학북로 | 45 | 3 | <NA> | 청학동 159-12 | 49014 | <NA> |
12094 | 11848 | 606 | 51 | 부산광역시 | 영도구 | 신선동1가 | 218-11 | 진달래길 | 10 | 0 | <NA> | 신선동1가 218-11 | 49061 | <NA> |
20181 | 17274 | 606 | 816 | 부산광역시 | 영도구 | 영선동4가 | 58 | 영선대로 | 31 | 0 | <NA> | 영선동4가 58 | 49051 | <NA> |
756 | 2860 | 606 | 21 | 부산광역시 | 영도구 | 대평동1가 | 11355 | 대평로28번길 | 9 | 4 | <NA> | 대평동1가 31-2 | 49040 | <NA> |
5680 | 4148 | 606 | 80 | 부산광역시 | 영도구 | 동삼동 | 116-41 | 동삼로43번길 | 16 | 10 | <NA> | 동삼동 산116-41 | 49106 | <NA> |
13928 | 14115 | 606 | 53 | 부산광역시 | 영도구 | 신선동3가 | 131-12 | 상록수길 | 86 | 0 | <NA> | 신선동3가 131-12 | 49073 | <NA> |
22628 | 23828 | 606 | 823 | 부산광역시 | 영도구 | 청학동 | 468-429 | 해돋이3길 | 244 | 0 | <NA> | 청학동 468-429 | 49032 | <NA> |