Dataset statistics
Number of variables | 17 |
---|---|
Number of observations | 10000 |
Missing cells | 18959 |
Missing cells (%) | 11.2% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.5 MiB |
Average record size in memory | 153.0 B |
Variable types
Numeric | 6 |
---|---|
Categorical | 5 |
Text | 4 |
Boolean | 1 |
Unsupported | 1 |
Dataset
Description | 부산광역시_북구_U옥외광고물통합관리시스템_새주소관리_20221108 |
---|---|
Author | 부산광역시 북구 |
URL | http://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15050087 |
시도 has constant value "" | Constant |
시군구 has constant value "" | Constant |
도로위계 is highly overall correlated with 우편번호1 | High correlation |
읍면동 is highly overall correlated with 순번 and 3 other fields | High correlation |
우편번호1 is highly overall correlated with 순번 and 8 other fields | High correlation |
산(확인) is highly overall correlated with 우편번호1 | High correlation |
순번 is highly overall correlated with 우편번호2 and 3 other fields | High correlation |
우편번호2 is highly overall correlated with 순번 and 2 other fields | High correlation |
건물번호 is highly overall correlated with 우편번호1 | High correlation |
건물번호2 is highly overall correlated with 우편번호1 | High correlation |
연결이미지 is highly overall correlated with 순번 and 3 other fields | High correlation |
신우편번호 is highly overall correlated with 우편번호1 and 1 other fields | High correlation |
우편번호1 is highly imbalanced (84.7%) | Imbalance |
산(확인) is highly imbalanced (93.8%) | Imbalance |
도로위계 is highly imbalanced (57.3%) | Imbalance |
우편번호2 has 221 (2.2%) missing values | Missing |
도로명 has 156 (1.6%) missing values | Missing |
건물번호 has 154 (1.5%) missing values | Missing |
건물번호2 has 154 (1.5%) missing values | Missing |
건물명 has 7686 (76.9%) missing values | Missing |
지번 has 154 (1.5%) missing values | Missing |
연결이미지 has 154 (1.5%) missing values | Missing |
리 has 10000 (100.0%) missing values | Missing |
신우편번호 has 280 (2.8%) missing values | Missing |
순번 has unique values | Unique |
리 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
건물번호2 has 6668 (66.7%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 16:53:33.297807 |
---|---|
Analysis finished | 2023-12-10 16:53:44.733151 |
Duration | 11.44 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
순번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8814.7524 |
Minimum | 4 |
---|---|
Maximum | 17635 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 4 |
---|---|
5-th percentile | 873.65 |
Q1 | 4460.75 |
median | 8794.5 |
Q3 | 13144 |
95-th percentile | 16744.05 |
Maximum | 17635 |
Range | 17631 |
Interquartile range (IQR) | 8683.25 |
Descriptive statistics
Standard deviation | 5065.4618 |
---|---|
Coefficient of variation (CV) | 0.5746573 |
Kurtosis | -1.1797701 |
Mean | 8814.7524 |
Median Absolute Deviation (MAD) | 4338 |
Skewness | -0.004177846 |
Sum | 88147524 |
Variance | 25658904 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
17197 | 1 | < 0.1% |
4196 | 1 | < 0.1% |
7951 | 1 | < 0.1% |
2504 | 1 | < 0.1% |
10957 | 1 | < 0.1% |
3825 | 1 | < 0.1% |
15557 | 1 | < 0.1% |
15733 | 1 | < 0.1% |
14639 | 1 | < 0.1% |
2500 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
4 | 1 | |
7 | 1 | |
9 | 1 | |
10 | 1 | |
11 | 1 | |
13 | 1 | |
14 | 1 | |
17 | 1 | |
18 | 1 | |
20 | 1 |
Value | Count | Frequency (%) |
17635 | 1 | |
17632 | 1 | |
17631 | 1 | |
17629 | 1 | |
17628 | 1 | |
17627 | 1 | |
17626 | 1 | |
17624 | 1 | |
17623 | 1 | |
17622 | 1 |
우편번호1
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
616 | |
---|---|
<NA> | 221 |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.0221 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 616 |
---|---|
2nd row | 616 |
3rd row | 616 |
4th row | 616 |
5th row | 616 |
Common Values
Value | Count | Frequency (%) |
616 | 9779 | |
<NA> | 221 | 2.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
616 | 9779 | |
na | 221 | 2.2% |
우편번호2
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 133 |
---|---|
Distinct (%) | 1.4% |
Missing | 221 |
Missing (%) | 2.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 813.90101 |
Minimum | 90 |
---|---|
Maximum | 861 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 90 |
---|---|
5-th percentile | 800 |
Q1 | 804 |
median | 816 |
Q3 | 828 |
95-th percentile | 835 |
Maximum | 861 |
Range | 771 |
Interquartile range (IQR) | 24 |
Descriptive statistics
Standard deviation | 33.581339 |
---|---|
Coefficient of variation (CV) | 0.041259733 |
Kurtosis | 334.98949 |
Mean | 813.90101 |
Median Absolute Deviation (MAD) | 12 |
Skewness | -15.988812 |
Sum | 7959138 |
Variance | 1127.7063 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
830 | 697 | 7.0% |
802 | 552 | 5.5% |
800 | 525 | 5.2% |
831 | 490 | 4.9% |
801 | 463 | 4.6% |
806 | 451 | 4.5% |
803 | 409 | 4.1% |
817 | 407 | 4.1% |
807 | 400 | 4.0% |
805 | 385 | 3.9% |
Other values (123) | 5000 |
Value | Count | Frequency (%) |
90 | 10 | |
110 | 4 | < 0.1% |
120 | 2 | < 0.1% |
701 | 2 | < 0.1% |
702 | 6 | |
703 | 4 | < 0.1% |
706 | 2 | < 0.1% |
715 | 3 | < 0.1% |
716 | 4 | < 0.1% |
718 | 2 | < 0.1% |
Value | Count | Frequency (%) |
861 | 4 | < 0.1% |
854 | 63 | |
853 | 25 | 0.2% |
852 | 98 | |
851 | 6 | 0.1% |
849 | 9 | 0.1% |
848 | 2 | < 0.1% |
847 | 2 | < 0.1% |
846 | 121 | |
845 | 35 | 0.4% |
시도
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
부산광역시 |
---|
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 부산광역시 |
---|---|
2nd row | 부산광역시 |
3rd row | 부산광역시 |
4th row | 부산광역시 |
5th row | 부산광역시 |
Common Values
Value | Count | Frequency (%) |
부산광역시 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
부산광역시 | 10000 |
시군구
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
북구 |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 북구 |
---|---|
2nd row | 북구 |
3rd row | 북구 |
4th row | 북구 |
5th row | 북구 |
Common Values
Value | Count | Frequency (%) |
북구 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
북구 | 10000 |
읍면동
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
구포동 | |
---|---|
만덕동 | |
덕천동 | |
화명동 | |
금곡동 | 413 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 화명동 |
---|---|
2nd row | 구포동 |
3rd row | 만덕동 |
4th row | 만덕동 |
5th row | 구포동 |
Common Values
Value | Count | Frequency (%) |
구포동 | 4155 | |
만덕동 | 2400 | |
덕천동 | 2125 | |
화명동 | 907 | 9.1% |
금곡동 | 413 | 4.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
구포동 | 4155 | |
만덕동 | 2400 | |
덕천동 | 2125 | |
화명동 | 907 | 9.1% |
금곡동 | 413 | 4.1% |
번지
Text
Distinct | 7739 |
---|---|
Distinct (%) | 77.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
825-10 | 16 | 0.2% |
825-11 | 13 | 0.1% |
216-7 | 12 | 0.1% |
824 | 10 | 0.1% |
1060-316 | 9 | 0.1% |
1150 | 9 | 0.1% |
1170-1 | 8 | 0.1% |
35796 | 8 | 0.1% |
774 | 8 | 0.1% |
2306 | 8 | 0.1% |
Other values (7734) | 9919 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 9662 | |
- | 8723 | |
2 | 6479 | |
3 | 5240 | |
8 | 4406 | |
4 | 4382 | |
0 | 3912 | |
5 | 3391 | 6.0% |
6 | 3268 | 5.8% |
9 | 3268 | 5.8% |
Other values (4) | 3326 | 5.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 47274 | |
Dash Punctuation | 8723 | 15.6% |
Other Letter | 40 | 0.1% |
Space Separator | 20 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 9662 | |
2 | 6479 | |
3 | 5240 | |
8 | 4406 | |
4 | 4382 | |
0 | 3912 | |
5 | 3391 | 7.2% |
6 | 3268 | 6.9% |
9 | 3268 | 6.9% |
7 | 3266 | 6.9% |
Other Letter
Value | Count | Frequency (%) |
월 | 20 | |
일 | 20 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 8723 |
Space Separator
Value | Count | Frequency (%) |
20 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 56017 | |
Hangul | 40 | 0.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 9662 | |
- | 8723 | |
2 | 6479 | |
3 | 5240 | |
8 | 4406 | |
4 | 4382 | |
0 | 3912 | |
5 | 3391 | 6.1% |
6 | 3268 | 5.8% |
9 | 3268 | 5.8% |
Other values (2) | 3286 | 5.9% |
Hangul
Value | Count | Frequency (%) |
월 | 20 | |
일 | 20 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 56017 | |
Hangul | 40 | 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 9662 | |
- | 8723 | |
2 | 6479 | |
3 | 5240 | |
8 | 4406 | |
4 | 4382 | |
0 | 3912 | |
5 | 3391 | 6.1% |
6 | 3268 | 5.8% |
9 | 3268 | 5.8% |
Other values (2) | 3286 | 5.9% |
Hangul
Value | Count | Frequency (%) |
월 | 20 | |
일 | 20 |
도로명
Text
MISSING
 
Distinct | 423 |
---|---|
Distinct (%) | 4.3% |
Missing | 156 |
Missing (%) | 1.6% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
금곡대로 | 219 | 2.2% |
덕천로 | 201 | 2.0% |
백양대로 | 199 | 2.0% |
시랑로 | 180 | 1.8% |
모분재로 | 161 | 1.6% |
상학로 | 154 | 1.6% |
만덕대로 | 122 | 1.2% |
만덕1로42번길 | 121 | 1.2% |
의성로121번길 | 120 | 1.2% |
만덕1로 | 115 | 1.2% |
Other values (413) | 8252 |
Most occurring characters
Value | Count | Frequency (%) |
로 | 9299 | 13.7% |
길 | 7403 | 10.9% |
번 | 6858 | 10.1% |
1 | 4619 | 6.8% |
대 | 2727 | 4.0% |
덕 | 2460 | 3.6% |
2 | 2438 | 3.6% |
3 | 1814 | 2.7% |
6 | 1747 | 2.6% |
만 | 1572 | 2.3% |
Other values (72) | 27156 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 49642 | |
Decimal Number | 18451 | 27.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
로 | 9299 | |
길 | 7403 | |
번 | 6858 | |
대 | 2727 | 5.5% |
덕 | 2460 | 5.0% |
만 | 1572 | 3.2% |
시 | 1167 | 2.4% |
천 | 960 | 1.9% |
성 | 948 | 1.9% |
상 | 947 | 1.9% |
Other values (62) | 15301 |
Decimal Number
Value | Count | Frequency (%) |
1 | 4619 | |
2 | 2438 | |
3 | 1814 | 9.8% |
6 | 1747 | 9.5% |
0 | 1544 | 8.4% |
5 | 1450 | 7.9% |
7 | 1445 | 7.8% |
4 | 1386 | 7.5% |
8 | 1299 | 7.0% |
9 | 709 | 3.8% |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 49642 | |
Common | 18451 | 27.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
로 | 9299 | |
길 | 7403 | |
번 | 6858 | |
대 | 2727 | 5.5% |
덕 | 2460 | 5.0% |
만 | 1572 | 3.2% |
시 | 1167 | 2.4% |
천 | 960 | 1.9% |
성 | 948 | 1.9% |
상 | 947 | 1.9% |
Other values (62) | 15301 |
Common
Value | Count | Frequency (%) |
1 | 4619 | |
2 | 2438 | |
3 | 1814 | 9.8% |
6 | 1747 | 9.5% |
0 | 1544 | 8.4% |
5 | 1450 | 7.9% |
7 | 1445 | 7.8% |
4 | 1386 | 7.5% |
8 | 1299 | 7.0% |
9 | 709 | 3.8% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 49642 | |
ASCII | 18451 | 27.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
로 | 9299 | |
길 | 7403 | |
번 | 6858 | |
대 | 2727 | 5.5% |
덕 | 2460 | 5.0% |
만 | 1572 | 3.2% |
시 | 1167 | 2.4% |
천 | 960 | 1.9% |
성 | 948 | 1.9% |
상 | 947 | 1.9% |
Other values (62) | 15301 |
ASCII
Value | Count | Frequency (%) |
1 | 4619 | |
2 | 2438 | |
3 | 1814 | 9.8% |
6 | 1747 | 9.5% |
0 | 1544 | 8.4% |
5 | 1450 | 7.9% |
7 | 1445 | 7.8% |
4 | 1386 | 7.5% |
8 | 1299 | 7.0% |
9 | 709 | 3.8% |
건물번호
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 573 |
---|---|
Distinct (%) | 5.8% |
Missing | 154 |
Missing (%) | 1.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 90.708511 |
Minimum | 1 |
---|---|
Maximum | 1791 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 5 |
Q1 | 15 |
median | 32 |
Q3 | 63 |
95-th percentile | 316 |
Maximum | 1791 |
Range | 1790 |
Interquartile range (IQR) | 48 |
Descriptive statistics
Standard deviation | 228.8445 |
---|---|
Coefficient of variation (CV) | 2.5228558 |
Kurtosis | 27.220526 |
Mean | 90.708511 |
Median Absolute Deviation (MAD) | 20 |
Skewness | 5.045645 |
Sum | 893116 |
Variance | 52369.803 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
9 | 235 | 2.4% |
15 | 219 | 2.2% |
7 | 215 | 2.1% |
13 | 215 | 2.1% |
14 | 213 | 2.1% |
11 | 204 | 2.0% |
10 | 202 | 2.0% |
20 | 199 | 2.0% |
16 | 196 | 2.0% |
21 | 195 | 1.9% |
Other values (563) | 7753 |
Value | Count | Frequency (%) |
1 | 40 | 0.4% |
2 | 69 | 0.7% |
3 | 116 | |
4 | 120 | |
5 | 170 | |
6 | 172 | |
7 | 215 | |
8 | 177 | |
9 | 235 | |
10 | 202 |
Value | Count | Frequency (%) |
1791 | 1 | < 0.1% |
1790 | 1 | < 0.1% |
1787 | 1 | < 0.1% |
1780 | 1 | < 0.1% |
1778 | 1 | < 0.1% |
1773 | 1 | < 0.1% |
1772 | 2 | |
1771 | 1 | < 0.1% |
1770 | 1 | < 0.1% |
1768 | 3 |
건물번호2
Real number (ℝ)
HIGH CORRELATION
  MISSING
  ZEROS
 
Distinct | 61 |
---|---|
Distinct (%) | 0.6% |
Missing | 154 |
Missing (%) | 1.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.202925 |
Minimum | -1 |
---|---|
Maximum | 138 |
Zeros | 6668 |
Zeros (%) | 66.7% |
Negative | 1 |
Negative (%) | < 0.1% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -1 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 1 |
95-th percentile | 13 |
Maximum | 138 |
Range | 139 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 6.255379 |
---|---|
Coefficient of variation (CV) | 2.8395787 |
Kurtosis | 97.831195 |
Mean | 2.202925 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 7.4849132 |
Sum | 21690 |
Variance | 39.129767 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 6668 | |
1 | 1131 | 11.3% |
2 | 250 | 2.5% |
5 | 167 | 1.7% |
6 | 157 | 1.6% |
4 | 151 | 1.5% |
7 | 148 | 1.5% |
8 | 142 | 1.4% |
3 | 139 | 1.4% |
10 | 112 | 1.1% |
Other values (51) | 781 | 7.8% |
(Missing) | 154 | 1.5% |
Value | Count | Frequency (%) |
-1 | 1 | < 0.1% |
0 | 6668 | |
1 | 1131 | 11.3% |
2 | 250 | 2.5% |
3 | 139 | 1.4% |
4 | 151 | 1.5% |
5 | 167 | 1.7% |
6 | 157 | 1.6% |
7 | 148 | 1.5% |
8 | 142 | 1.4% |
Value | Count | Frequency (%) |
138 | 1 | < 0.1% |
118 | 1 | < 0.1% |
116 | 1 | < 0.1% |
115 | 1 | < 0.1% |
100 | 1 | < 0.1% |
98 | 1 | < 0.1% |
96 | 1 | < 0.1% |
87 | 1 | < 0.1% |
81 | 1 | < 0.1% |
79 | 3 |
건물명
Text
MISSING
 
Distinct | 1515 |
---|---|
Distinct (%) | 65.5% |
Missing | 7686 |
Missing (%) | 76.9% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
시영아파트 | 19 | 0.8% |
주공아파트 | 18 | 0.7% |
빌라 | 13 | 0.5% |
그린코아 | 12 | 0.5% |
롯데아파트 | 12 | 0.5% |
주민센터 | 11 | 0.4% |
럭키만덕아파트 | 10 | 0.4% |
동원로얄듀크 | 9 | 0.4% |
금강빌라 | 9 | 0.4% |
유림아파트 | 8 | 0.3% |
Other values (1581) | 2403 |
Most occurring characters
Value | Count | Frequency (%) |
빌 | 622 | 4.7% |
라 | 515 | 3.9% |
아 | 420 | 3.1% |
트 | 376 | 2.8% |
파 | 350 | 2.6% |
원 | 235 | 1.8% |
동 | 234 | 1.8% |
210 | 1.6% | |
대 | 197 | 1.5% |
이 | 185 | 1.4% |
Other values (525) | 10013 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 12848 | |
Space Separator | 210 | 1.6% |
Decimal Number | 125 | 0.9% |
Uppercase Letter | 99 | 0.7% |
Open Punctuation | 23 | 0.2% |
Close Punctuation | 23 | 0.2% |
Other Punctuation | 15 | 0.1% |
Lowercase Letter | 11 | 0.1% |
Dash Punctuation | 3 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
빌 | 622 | 4.8% |
라 | 515 | 4.0% |
아 | 420 | 3.3% |
트 | 376 | 2.9% |
파 | 350 | 2.7% |
원 | 235 | 1.8% |
동 | 234 | 1.8% |
대 | 197 | 1.5% |
이 | 185 | 1.4% |
화 | 181 | 1.4% |
Other values (482) | 9533 |
Uppercase Letter
Value | Count | Frequency (%) |
I | 18 | |
G | 15 | |
S | 15 | |
V | 10 | |
L | 10 | |
K | 8 | |
P | 4 | 4.0% |
M | 3 | 3.0% |
O | 2 | 2.0% |
T | 2 | 2.0% |
Other values (9) | 12 |
Decimal Number
Value | Count | Frequency (%) |
2 | 55 | |
3 | 25 | |
1 | 17 | 13.6% |
8 | 8 | 6.4% |
4 | 8 | 6.4% |
5 | 6 | 4.8% |
7 | 4 | 3.2% |
6 | 2 | 1.6% |
Lowercase Letter
Value | Count | Frequency (%) |
e | 3 | |
x | 2 | |
t | 1 | 9.1% |
a | 1 | 9.1% |
m | 1 | 9.1% |
d | 1 | 9.1% |
p | 1 | 9.1% |
i | 1 | 9.1% |
Other Punctuation
Value | Count | Frequency (%) |
· | 8 | |
. | 3 | 20.0% |
, | 2 | 13.3% |
& | 2 | 13.3% |
Space Separator
Value | Count | Frequency (%) |
210 |
Open Punctuation
Value | Count | Frequency (%) |
( | 23 |
Close Punctuation
Value | Count | Frequency (%) |
) | 23 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 12848 | |
Common | 399 | 3.0% |
Latin | 110 | 0.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
빌 | 622 | 4.8% |
라 | 515 | 4.0% |
아 | 420 | 3.3% |
트 | 376 | 2.9% |
파 | 350 | 2.7% |
원 | 235 | 1.8% |
동 | 234 | 1.8% |
대 | 197 | 1.5% |
이 | 185 | 1.4% |
화 | 181 | 1.4% |
Other values (482) | 9533 |
Latin
Value | Count | Frequency (%) |
I | 18 | |
G | 15 | |
S | 15 | |
V | 10 | |
L | 10 | |
K | 8 | 7.3% |
P | 4 | 3.6% |
e | 3 | 2.7% |
M | 3 | 2.7% |
x | 2 | 1.8% |
Other values (17) | 22 |
Common
Value | Count | Frequency (%) |
210 | ||
2 | 55 | 13.8% |
3 | 25 | 6.3% |
( | 23 | 5.8% |
) | 23 | 5.8% |
1 | 17 | 4.3% |
8 | 8 | 2.0% |
· | 8 | 2.0% |
4 | 8 | 2.0% |
5 | 6 | 1.5% |
Other values (6) | 16 | 4.0% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 12848 | |
ASCII | 501 | 3.8% |
None | 8 | 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
빌 | 622 | 4.8% |
라 | 515 | 4.0% |
아 | 420 | 3.3% |
트 | 376 | 2.9% |
파 | 350 | 2.7% |
원 | 235 | 1.8% |
동 | 234 | 1.8% |
대 | 197 | 1.5% |
이 | 185 | 1.4% |
화 | 181 | 1.4% |
Other values (482) | 9533 |
ASCII
Value | Count | Frequency (%) |
210 | ||
2 | 55 | 11.0% |
3 | 25 | 5.0% |
( | 23 | 4.6% |
) | 23 | 4.6% |
I | 18 | 3.6% |
1 | 17 | 3.4% |
G | 15 | 3.0% |
S | 15 | 3.0% |
V | 10 | 2.0% |
Other values (32) | 90 |
None
Value | Count | Frequency (%) |
· | 8 |
산(확인)
Boolean
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 87.9 KiB |
False | |
---|---|
True | 72 |
Value | Count | Frequency (%) |
False | 9928 | |
True | 72 | 0.7% |
도로위계
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
4 | |
---|---|
2 | |
1 | 678 |
<NA> | 156 |
5 | 11 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.0468 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 4 |
---|---|
2nd row | 4 |
3rd row | 4 |
4th row | 4 |
5th row | 4 |
Common Values
Value | Count | Frequency (%) |
4 | 7543 | |
2 | 1607 | 16.1% |
1 | 678 | 6.8% |
<NA> | 156 | 1.6% |
5 | 11 | 0.1% |
3 | 5 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
4 | 7543 | |
2 | 1607 | 16.1% |
1 | 678 | 6.8% |
na | 156 | 1.6% |
5 | 11 | 0.1% |
3 | 5 | < 0.1% |
지번
Text
MISSING
 
Distinct | 8039 |
---|---|
Distinct (%) | 81.6% |
Missing | 154 |
Missing (%) | 1.5% |
Memory size | 156.2 KiB |
Length
Max length | 12 |
---|---|
Median length | 11 |
Mean length | 9.6073532 |
Min length | 5 |
Characters and Unicode
Total characters | 94594 |
---|---|
Distinct characters | 23 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 6733 ? |
---|---|
Unique (%) | 68.4% |
Sample
1st row | 화명동 803 |
---|---|
2nd row | 구포동 1251-6 |
3rd row | 만덕동 888-3 |
4th row | 만덕동 883-7 |
5th row | 구포동 1240-4 |
Value | Count | Frequency (%) |
구포동 | 4101 | |
만덕동 | 2371 | 12.0% |
덕천동 | 2105 | 10.7% |
화명동 | 869 | 4.4% |
금곡동 | 400 | 2.0% |
825-10 | 16 | 0.1% |
825-11 | 13 | 0.1% |
216-7 | 12 | 0.1% |
824 | 10 | 0.1% |
1150 | 9 | < 0.1% |
Other values (7627) | 9786 |
Most occurring characters
Value | Count | Frequency (%) |
동 | 9846 | 10.4% |
9846 | 10.4% | |
1 | 9577 | 10.1% |
- | 9015 | 9.5% |
2 | 6365 | 6.7% |
3 | 5061 | 5.4% |
덕 | 4476 | 4.7% |
8 | 4286 | 4.5% |
4 | 4191 | 4.4% |
포 | 4101 | 4.3% |
Other values (13) | 27830 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 46123 | |
Other Letter | 29610 | |
Space Separator | 9846 | 10.4% |
Dash Punctuation | 9015 | 9.5% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 9846 | |
덕 | 4476 | |
포 | 4101 | |
구 | 4101 | |
만 | 2371 | 8.0% |
천 | 2105 | 7.1% |
명 | 869 | 2.9% |
화 | 869 | 2.9% |
금 | 400 | 1.4% |
곡 | 400 | 1.4% |
Decimal Number
Value | Count | Frequency (%) |
1 | 9577 | |
2 | 6365 | |
3 | 5061 | |
8 | 4286 | |
4 | 4191 | |
0 | 3762 | 8.2% |
5 | 3273 | 7.1% |
7 | 3256 | 7.1% |
9 | 3197 | 6.9% |
6 | 3155 | 6.8% |
Space Separator
Value | Count | Frequency (%) |
9846 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 9015 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 64984 | |
Hangul | 29610 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
9846 | ||
1 | 9577 | |
- | 9015 | |
2 | 6365 | |
3 | 5061 | |
8 | 4286 | |
4 | 4191 | |
0 | 3762 | 5.8% |
5 | 3273 | 5.0% |
7 | 3256 | 5.0% |
Other values (2) | 6352 |
Hangul
Value | Count | Frequency (%) |
동 | 9846 | |
덕 | 4476 | |
포 | 4101 | |
구 | 4101 | |
만 | 2371 | 8.0% |
천 | 2105 | 7.1% |
명 | 869 | 2.9% |
화 | 869 | 2.9% |
금 | 400 | 1.4% |
곡 | 400 | 1.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 64984 | |
Hangul | 29610 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
동 | 9846 | |
덕 | 4476 | |
포 | 4101 | |
구 | 4101 | |
만 | 2371 | 8.0% |
천 | 2105 | 7.1% |
명 | 869 | 2.9% |
화 | 869 | 2.9% |
금 | 400 | 1.4% |
곡 | 400 | 1.4% |
ASCII
Value | Count | Frequency (%) |
9846 | ||
1 | 9577 | |
- | 9015 | |
2 | 6365 | |
3 | 5061 | |
8 | 4286 | |
4 | 4191 | |
0 | 3762 | 5.8% |
5 | 3273 | 5.0% |
7 | 3256 | 5.0% |
Other values (2) | 6352 |
연결이미지
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 770 |
---|---|
Distinct (%) | 7.8% |
Missing | 154 |
Missing (%) | 1.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 24893.541 |
Minimum | 4061 |
---|---|
Maximum | 58037 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 4061 |
---|---|
5-th percentile | 10056 |
Q1 | 17054 |
median | 22046 |
Q3 | 32023.75 |
95-th percentile | 45054 |
Maximum | 58037 |
Range | 53976 |
Interquartile range (IQR) | 14969.75 |
Descriptive statistics
Standard deviation | 11439.557 |
---|---|
Coefficient of variation (CV) | 0.45953917 |
Kurtosis | -0.6585328 |
Mean | 24893.541 |
Median Absolute Deviation (MAD) | 5993 |
Skewness | 0.61916485 |
Sum | 2.4510181 × 108 |
Variance | 1.3086347 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
41046 | 72 | 0.7% |
43046 | 72 | 0.7% |
44045 | 68 | 0.7% |
40046 | 66 | 0.7% |
42046 | 65 | 0.7% |
16053 | 62 | 0.6% |
40047 | 61 | 0.6% |
43047 | 60 | 0.6% |
43045 | 59 | 0.6% |
20053 | 55 | 0.5% |
Other values (760) | 9206 | |
(Missing) | 154 | 1.5% |
Value | Count | Frequency (%) |
4061 | 3 | < 0.1% |
4062 | 8 | |
5061 | 13 | |
5062 | 16 | |
5063 | 2 | < 0.1% |
6059 | 3 | < 0.1% |
6060 | 8 | |
6061 | 10 | |
6062 | 15 | |
6063 | 15 |
Value | Count | Frequency (%) |
58037 | 3 | |
58036 | 3 | |
53049 | 2 | |
53046 | 1 | < 0.1% |
53042 | 1 | < 0.1% |
52050 | 1 | < 0.1% |
52049 | 3 | |
52047 | 1 | < 0.1% |
52042 | 1 | < 0.1% |
52041 | 1 | < 0.1% |
리
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
신우편번호
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 154 |
---|---|
Distinct (%) | 1.6% |
Missing | 280 |
Missing (%) | 2.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 46580.604 |
Minimum | 46500 |
---|---|
Maximum | 46653 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 46500 |
---|---|
5-th percentile | 46506 |
Q1 | 46554 |
median | 46579 |
Q3 | 46614 |
95-th percentile | 46646.05 |
Maximum | 46653 |
Range | 153 |
Interquartile range (IQR) | 60 |
Descriptive statistics
Standard deviation | 40.129383 |
---|---|
Coefficient of variation (CV) | 0.00086150413 |
Kurtosis | -0.85514101 |
Mean | 46580.604 |
Median Absolute Deviation (MAD) | 30 |
Skewness | -0.064482346 |
Sum | 4.5276347 × 108 |
Variance | 1610.3674 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
46557 | 848 | 8.5% |
46585 | 246 | 2.5% |
46609 | 166 | 1.7% |
46615 | 153 | 1.5% |
46552 | 147 | 1.5% |
46574 | 146 | 1.5% |
46578 | 146 | 1.5% |
46595 | 137 | 1.4% |
46596 | 134 | 1.3% |
46546 | 132 | 1.3% |
Other values (144) | 7465 | |
(Missing) | 280 | 2.8% |
Value | Count | Frequency (%) |
46500 | 14 | 0.1% |
46501 | 103 | |
46502 | 69 | |
46503 | 66 | |
46504 | 57 | |
46505 | 67 | |
46506 | 122 | |
46507 | 18 | 0.2% |
46508 | 25 | 0.2% |
46509 | 12 | 0.1% |
Value | Count | Frequency (%) |
46653 | 85 | |
46652 | 59 | |
46651 | 3 | < 0.1% |
46650 | 39 | 0.4% |
46649 | 124 | |
46648 | 81 | |
46647 | 95 | |
46646 | 24 | 0.2% |
46645 | 39 | 0.4% |
46644 | 46 | 0.5% |
순번 | 우편번호2 | 읍면동 | 건물번호 | 건물번호2 | 산(확인) | 도로위계 | 연결이미지 | 신우편번호 | |
---|---|---|---|---|---|---|---|---|---|
순번 | 1.000 | 0.142 | 0.987 | 0.402 | 0.097 | 0.065 | 0.386 | 0.880 | 0.895 |
우편번호2 | 0.142 | 1.000 | 0.167 | 0.284 | 0.052 | 0.000 | 0.086 | 0.144 | 0.152 |
읍면동 | 0.987 | 0.167 | 1.000 | 0.326 | 0.122 | 0.029 | 0.352 | 0.912 | 0.956 |
건물번호 | 0.402 | 0.284 | 0.326 | 1.000 | 0.000 | 0.000 | 0.574 | 0.324 | 0.361 |
건물번호2 | 0.097 | 0.052 | 0.122 | 0.000 | 1.000 | 0.268 | 0.653 | 0.248 | 0.131 |
산(확인) | 0.065 | 0.000 | 0.029 | 0.000 | 0.268 | 1.000 | 0.039 | 0.403 | 0.072 |
도로위계 | 0.386 | 0.086 | 0.352 | 0.574 | 0.653 | 0.039 | 1.000 | 0.316 | 0.409 |
연결이미지 | 0.880 | 0.144 | 0.912 | 0.324 | 0.248 | 0.403 | 0.316 | 1.000 | 0.825 |
신우편번호 | 0.895 | 0.152 | 0.956 | 0.361 | 0.131 | 0.072 | 0.409 | 0.825 | 1.000 |
도로위계 | 읍면동 | 우편번호1 | 산(확인) | |
---|---|---|---|---|
도로위계 | 1.000 | 0.137 | 1.000 | 0.048 |
읍면동 | 0.137 | 1.000 | 1.000 | 0.036 |
우편번호1 | 1.000 | 1.000 | 1.000 | 1.000 |
산(확인) | 0.048 | 0.036 | 1.000 | 1.000 |
순번 | 우편번호2 | 건물번호 | 건물번호2 | 연결이미지 | 신우편번호 | 우편번호1 | 읍면동 | 산(확인) | 도로위계 | |
---|---|---|---|---|---|---|---|---|---|---|
순번 | 1.000 | 0.802 | -0.030 | -0.036 | 0.802 | -0.449 | 1.000 | 0.836 | 0.049 | 0.170 |
우편번호2 | 0.802 | 1.000 | -0.022 | -0.016 | 0.674 | -0.346 | 1.000 | 0.137 | 0.000 | 0.070 |
건물번호 | -0.030 | -0.022 | 1.000 | -0.021 | -0.017 | -0.097 | 1.000 | 0.195 | 0.000 | 0.378 |
건물번호2 | -0.036 | -0.016 | -0.021 | 1.000 | -0.020 | 0.014 | 1.000 | 0.051 | 0.205 | 0.328 |
연결이미지 | 0.802 | 0.674 | -0.017 | -0.020 | 1.000 | -0.243 | 1.000 | 0.610 | 0.309 | 0.137 |
신우편번호 | -0.449 | -0.346 | -0.097 | 0.014 | -0.243 | 1.000 | 1.000 | 0.710 | 0.055 | 0.183 |
우편번호1 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
읍면동 | 0.836 | 0.137 | 0.195 | 0.051 | 0.610 | 0.710 | 1.000 | 1.000 | 0.036 | 0.137 |
산(확인) | 0.049 | 0.000 | 0.000 | 0.205 | 0.309 | 0.055 | 1.000 | 0.036 | 1.000 | 0.048 |
도로위계 | 0.170 | 0.070 | 0.378 | 0.328 | 0.137 | 0.183 | 1.000 | 0.137 | 0.048 | 1.000 |
순번 | 우편번호1 | 우편번호2 | 시도 | 시군구 | 읍면동 | 번지 | 도로명 | 건물번호 | 건물번호2 | 건물명 | 산(확인) | 도로위계 | 지번 | 연결이미지 | 리 | 신우편번호 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
17518 | 17197 | 616 | 845 | 부산광역시 | 북구 | 화명동 | 803 | 화명대로64번길 | 40 | 7 | <NA> | N | 4 | 화명동 803 | 27033 | <NA> | 46533 |
7141 | 5782 | 616 | 809 | 부산광역시 | 북구 | 구포동 | 1251-6 | 시랑로138번길 | 20 | 1 | <NA> | N | 4 | 구포동 1251-6 | 22063 | <NA> | 46644 |
11587 | 12369 | 616 | 831 | 부산광역시 | 북구 | 만덕동 | 888-3 | 덕천로380번길 | 43 | 0 | <NA> | N | 4 | 만덕동 888-3 | 49052 | <NA> | 46609 |
12753 | 12195 | 616 | 831 | 부산광역시 | 북구 | 만덕동 | 883-7 | 덕천로352번길 | 5 | 0 | <NA> | N | 4 | 만덕동 883-7 | 47052 | <NA> | 46609 |
5538 | 5895 | 616 | 808 | 부산광역시 | 북구 | 구포동 | 1240-4 | 시랑로163번길 | 28 | 7 | <NA> | N | 4 | 구포동 1240-4 | 25061 | <NA> | 46625 |
13324 | 11675 | 616 | 824 | 부산광역시 | 북구 | 만덕동 | 24898 | 구만덕로80번길 | 39 | 0 | <NA> | N | 4 | 만덕동 68-3 | 50048 | <NA> | 46608 |
10707 | 10051 | 616 | 816 | 부산광역시 | 북구 | 덕천동 | 303-10 | 만덕대로65번길 | 102 | 0 | <NA> | N | 4 | 덕천동 303-10 | 22046 | <NA> | 46549 |
7836 | 9875 | 616 | 817 | 부산광역시 | 북구 | 덕천동 | 316-5 | 만덕대로27번길 | 126 | 0 | 부광주택 | N | 4 | 덕천동 316-5 | 20047 | <NA> | 46546 |
15443 | 16683 | 616 | 834 | 부산광역시 | 북구 | 화명동 | 373-6 | 산성로48번길 | 9 | 9 | <NA> | N | 4 | 화명동 373-6 | 28027 | <NA> | 46529 |
11749 | 14606 | 616 | 830 | 부산광역시 | 북구 | 만덕동 | 817-58 | 상학로15번길 | 48 | 0 | <NA> | N | 4 | 만덕동 817-58 | 41046 | <NA> | 46557 |
순번 | 우편번호1 | 우편번호2 | 시도 | 시군구 | 읍면동 | 번지 | 도로명 | 건물번호 | 건물번호2 | 건물명 | 산(확인) | 도로위계 | 지번 | 연결이미지 | 리 | 신우편번호 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
13554 | 12507 | 616 | 831 | 부산광역시 | 북구 | 만덕동 | 887-16 | 덕천로386번길 | 84 | 0 | <NA> | N | 4 | 만덕동 887-16 | 48053 | <NA> | 46609 |
7993 | 10684 | 616 | 822 | 부산광역시 | 북구 | 덕천동 | 428-10 | 의성로121번길 | 34 | 0 | <NA> | N | 4 | 덕천동 428-10 | 24051 | <NA> | 46574 |
15729 | 16239 | 616 | 834 | 부산광역시 | 북구 | 화명동 | 1429-2 | 금곡대로324번길 | 45 | 0 | 우신아파트 | N | 4 | 화명동 1429-2 | 27029 | <NA> | 46531 |
13680 | 12027 | 616 | 827 | 부산광역시 | 북구 | 만덕동 | 840-37 | 덕천로298번길 | 10 | 0 | <NA> | N | 4 | 만덕동 840-37 | 42052 | <NA> | 46612 |
16213 | 16030 | 616 | 846 | 부산광역시 | 북구 | 화명동 | 899-2 | 금곡대로200번길 | 46 | 0 | <NA> | N | 4 | 화명동 899-2 | 24037 | <NA> | 46538 |
15832 | 16773 | 616 | 833 | 부산광역시 | 북구 | 화명동 | 188-4 | 양달로 | 3 | 0 | <NA> | N | 2 | 화명동 188-4 | 24026 | <NA> | 46518 |
13305 | 11656 | 616 | 827 | 부산광역시 | 북구 | 만덕동 | 447-8 | 구만덕로60번길 | 50 | 0 | <NA> | N | 4 | 만덕동 447-8 | 48047 | <NA> | 46607 |
13053 | 14010 | 616 | 831 | 부산광역시 | 북구 | 만덕동 | 868-8 | 상리로36번가길 | 9 | 0 | <NA> | N | 4 | 만덕동 868-8 | 44053 | <NA> | 46614 |
8411 | 8834 | 616 | 816 | 부산광역시 | 북구 | 덕천동 | 253-8 | 덕천로 | 66 | 7 | <NA> | N | 2 | 덕천동 253-8 | 23052 | <NA> | 46594 |
11030 | 10495 | 616 | 821 | 부산광역시 | 북구 | 덕천동 | 412-11 | 의성로115번길 | 15 | 6 | <NA> | N | 4 | 덕천동 412-11 | 23050 | <NA> | 46567 |