Overview

Dataset statistics

Number of variables17
Number of observations10000
Missing cells18959
Missing cells (%)11.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 MiB
Average record size in memory153.0 B

Variable types

Numeric6
Categorical5
Text4
Boolean1
Unsupported1

Dataset

Description부산광역시_북구_U옥외광고물통합관리시스템_새주소관리_20221108
Author부산광역시 북구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15050087

Alerts

시도 has constant value ""Constant
시군구 has constant value ""Constant
도로위계 is highly overall correlated with 우편번호1High correlation
읍면동 is highly overall correlated with 순번 and 3 other fieldsHigh correlation
우편번호1 is highly overall correlated with 순번 and 8 other fieldsHigh correlation
산(확인) is highly overall correlated with 우편번호1High correlation
순번 is highly overall correlated with 우편번호2 and 3 other fieldsHigh correlation
우편번호2 is highly overall correlated with 순번 and 2 other fieldsHigh correlation
건물번호 is highly overall correlated with 우편번호1High correlation
건물번호2 is highly overall correlated with 우편번호1High correlation
연결이미지 is highly overall correlated with 순번 and 3 other fieldsHigh correlation
신우편번호 is highly overall correlated with 우편번호1 and 1 other fieldsHigh correlation
우편번호1 is highly imbalanced (84.7%)Imbalance
산(확인) is highly imbalanced (93.8%)Imbalance
도로위계 is highly imbalanced (57.3%)Imbalance
우편번호2 has 221 (2.2%) missing valuesMissing
도로명 has 156 (1.6%) missing valuesMissing
건물번호 has 154 (1.5%) missing valuesMissing
건물번호2 has 154 (1.5%) missing valuesMissing
건물명 has 7686 (76.9%) missing valuesMissing
지번 has 154 (1.5%) missing valuesMissing
연결이미지 has 154 (1.5%) missing valuesMissing
has 10000 (100.0%) missing valuesMissing
신우편번호 has 280 (2.8%) missing valuesMissing
순번 has unique valuesUnique
is an unsupported type, check if it needs cleaning or further analysisUnsupported
건물번호2 has 6668 (66.7%) zerosZeros

Reproduction

Analysis started2023-12-10 16:53:33.297807
Analysis finished2023-12-10 16:53:44.733151
Duration11.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8814.7524
Minimum4
Maximum17635
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:53:44.870856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile873.65
Q14460.75
median8794.5
Q313144
95-th percentile16744.05
Maximum17635
Range17631
Interquartile range (IQR)8683.25

Descriptive statistics

Standard deviation5065.4618
Coefficient of variation (CV)0.5746573
Kurtosis-1.1797701
Mean8814.7524
Median Absolute Deviation (MAD)4338
Skewness-0.004177846
Sum88147524
Variance25658904
MonotonicityNot monotonic
2023-12-11T01:53:45.140829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
17197 1
 
< 0.1%
4196 1
 
< 0.1%
7951 1
 
< 0.1%
2504 1
 
< 0.1%
10957 1
 
< 0.1%
3825 1
 
< 0.1%
15557 1
 
< 0.1%
15733 1
 
< 0.1%
14639 1
 
< 0.1%
2500 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
4 1
< 0.1%
7 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
11 1
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
17 1
< 0.1%
18 1
< 0.1%
20 1
< 0.1%
ValueCountFrequency (%)
17635 1
< 0.1%
17632 1
< 0.1%
17631 1
< 0.1%
17629 1
< 0.1%
17628 1
< 0.1%
17627 1
< 0.1%
17626 1
< 0.1%
17624 1
< 0.1%
17623 1
< 0.1%
17622 1
< 0.1%

우편번호1
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
616
9779 
<NA>
 
221

Length

Max length4
Median length3
Mean length3.0221
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row616
2nd row616
3rd row616
4th row616
5th row616

Common Values

ValueCountFrequency (%)
616 9779
97.8%
<NA> 221
 
2.2%

Length

2023-12-11T01:53:45.382660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:53:45.544287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
616 9779
97.8%
na 221
 
2.2%

우편번호2
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct133
Distinct (%)1.4%
Missing221
Missing (%)2.2%
Infinite0
Infinite (%)0.0%
Mean813.90101
Minimum90
Maximum861
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:53:45.727869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum90
5-th percentile800
Q1804
median816
Q3828
95-th percentile835
Maximum861
Range771
Interquartile range (IQR)24

Descriptive statistics

Standard deviation33.581339
Coefficient of variation (CV)0.041259733
Kurtosis334.98949
Mean813.90101
Median Absolute Deviation (MAD)12
Skewness-15.988812
Sum7959138
Variance1127.7063
MonotonicityNot monotonic
2023-12-11T01:53:45.983903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
830 697
 
7.0%
802 552
 
5.5%
800 525
 
5.2%
831 490
 
4.9%
801 463
 
4.6%
806 451
 
4.5%
803 409
 
4.1%
817 407
 
4.1%
807 400
 
4.0%
805 385
 
3.9%
Other values (123) 5000
50.0%
ValueCountFrequency (%)
90 10
0.1%
110 4
 
< 0.1%
120 2
 
< 0.1%
701 2
 
< 0.1%
702 6
0.1%
703 4
 
< 0.1%
706 2
 
< 0.1%
715 3
 
< 0.1%
716 4
 
< 0.1%
718 2
 
< 0.1%
ValueCountFrequency (%)
861 4
 
< 0.1%
854 63
0.6%
853 25
 
0.2%
852 98
1.0%
851 6
 
0.1%
849 9
 
0.1%
848 2
 
< 0.1%
847 2
 
< 0.1%
846 121
1.2%
845 35
 
0.4%

시도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
부산광역시
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시
2nd row부산광역시
3rd row부산광역시
4th row부산광역시
5th row부산광역시

Common Values

ValueCountFrequency (%)
부산광역시 10000
100.0%

Length

2023-12-11T01:53:46.196572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:53:46.338565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 10000
100.0%

시군구
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
북구
10000 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row북구
2nd row북구
3rd row북구
4th row북구
5th row북구

Common Values

ValueCountFrequency (%)
북구 10000
100.0%

Length

2023-12-11T01:53:46.498496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:53:46.701169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
북구 10000
100.0%

읍면동
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
구포동
4155 
만덕동
2400 
덕천동
2125 
화명동
907 
금곡동
 
413

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row화명동
2nd row구포동
3rd row만덕동
4th row만덕동
5th row구포동

Common Values

ValueCountFrequency (%)
구포동 4155
41.5%
만덕동 2400
24.0%
덕천동 2125
21.2%
화명동 907
 
9.1%
금곡동 413
 
4.1%

Length

2023-12-11T01:53:46.852104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:53:47.015133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
구포동 4155
41.5%
만덕동 2400
24.0%
덕천동 2125
21.2%
화명동 907
 
9.1%
금곡동 413
 
4.1%

번지
Text

Distinct7739
Distinct (%)77.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:53:47.554555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length5.6057
Min length1

Characters and Unicode

Total characters56057
Distinct characters14
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6156 ?
Unique (%)61.6%

Sample

1st row803
2nd row1251-6
3rd row888-3
4th row883-7
5th row1240-4
ValueCountFrequency (%)
825-10 16
 
0.2%
825-11 13
 
0.1%
216-7 12
 
0.1%
824 10
 
0.1%
1060-316 9
 
0.1%
1150 9
 
0.1%
1170-1 8
 
0.1%
35796 8
 
0.1%
774 8
 
0.1%
2306 8
 
0.1%
Other values (7734) 9919
99.0%
2023-12-11T01:53:48.302106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 9662
17.2%
- 8723
15.6%
2 6479
11.6%
3 5240
9.3%
8 4406
7.9%
4 4382
7.8%
0 3912
7.0%
5 3391
 
6.0%
6 3268
 
5.8%
9 3268
 
5.8%
Other values (4) 3326
 
5.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 47274
84.3%
Dash Punctuation 8723
 
15.6%
Other Letter 40
 
0.1%
Space Separator 20
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 9662
20.4%
2 6479
13.7%
3 5240
11.1%
8 4406
9.3%
4 4382
9.3%
0 3912
8.3%
5 3391
 
7.2%
6 3268
 
6.9%
9 3268
 
6.9%
7 3266
 
6.9%
Other Letter
ValueCountFrequency (%)
20
50.0%
20
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 8723
100.0%
Space Separator
ValueCountFrequency (%)
20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 56017
99.9%
Hangul 40
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
1 9662
17.2%
- 8723
15.6%
2 6479
11.6%
3 5240
9.4%
8 4406
7.9%
4 4382
7.8%
0 3912
7.0%
5 3391
 
6.1%
6 3268
 
5.8%
9 3268
 
5.8%
Other values (2) 3286
 
5.9%
Hangul
ValueCountFrequency (%)
20
50.0%
20
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 56017
99.9%
Hangul 40
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 9662
17.2%
- 8723
15.6%
2 6479
11.6%
3 5240
9.4%
8 4406
7.9%
4 4382
7.8%
0 3912
7.0%
5 3391
 
6.1%
6 3268
 
5.8%
9 3268
 
5.8%
Other values (2) 3286
 
5.9%
Hangul
ValueCountFrequency (%)
20
50.0%
20
50.0%

도로명
Text

MISSING 

Distinct423
Distinct (%)4.3%
Missing156
Missing (%)1.6%
Memory size156.2 KiB
2023-12-11T01:53:48.748264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length6.9172085
Min length3

Characters and Unicode

Total characters68093
Distinct characters82
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row화명대로64번길
2nd row시랑로138번길
3rd row덕천로380번길
4th row덕천로352번길
5th row시랑로163번길
ValueCountFrequency (%)
금곡대로 219
 
2.2%
덕천로 201
 
2.0%
백양대로 199
 
2.0%
시랑로 180
 
1.8%
모분재로 161
 
1.6%
상학로 154
 
1.6%
만덕대로 122
 
1.2%
만덕1로42번길 121
 
1.2%
의성로121번길 120
 
1.2%
만덕1로 115
 
1.2%
Other values (413) 8252
83.8%
2023-12-11T01:53:49.343247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9299
 
13.7%
7403
 
10.9%
6858
 
10.1%
1 4619
 
6.8%
2727
 
4.0%
2460
 
3.6%
2 2438
 
3.6%
3 1814
 
2.7%
6 1747
 
2.6%
1572
 
2.3%
Other values (72) 27156
39.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 49642
72.9%
Decimal Number 18451
 
27.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9299
18.7%
7403
14.9%
6858
13.8%
2727
 
5.5%
2460
 
5.0%
1572
 
3.2%
1167
 
2.4%
960
 
1.9%
948
 
1.9%
947
 
1.9%
Other values (62) 15301
30.8%
Decimal Number
ValueCountFrequency (%)
1 4619
25.0%
2 2438
13.2%
3 1814
 
9.8%
6 1747
 
9.5%
0 1544
 
8.4%
5 1450
 
7.9%
7 1445
 
7.8%
4 1386
 
7.5%
8 1299
 
7.0%
9 709
 
3.8%

Most occurring scripts

ValueCountFrequency (%)
Hangul 49642
72.9%
Common 18451
 
27.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9299
18.7%
7403
14.9%
6858
13.8%
2727
 
5.5%
2460
 
5.0%
1572
 
3.2%
1167
 
2.4%
960
 
1.9%
948
 
1.9%
947
 
1.9%
Other values (62) 15301
30.8%
Common
ValueCountFrequency (%)
1 4619
25.0%
2 2438
13.2%
3 1814
 
9.8%
6 1747
 
9.5%
0 1544
 
8.4%
5 1450
 
7.9%
7 1445
 
7.8%
4 1386
 
7.5%
8 1299
 
7.0%
9 709
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 49642
72.9%
ASCII 18451
 
27.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9299
18.7%
7403
14.9%
6858
13.8%
2727
 
5.5%
2460
 
5.0%
1572
 
3.2%
1167
 
2.4%
960
 
1.9%
948
 
1.9%
947
 
1.9%
Other values (62) 15301
30.8%
ASCII
ValueCountFrequency (%)
1 4619
25.0%
2 2438
13.2%
3 1814
 
9.8%
6 1747
 
9.5%
0 1544
 
8.4%
5 1450
 
7.9%
7 1445
 
7.8%
4 1386
 
7.5%
8 1299
 
7.0%
9 709
 
3.8%

건물번호
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct573
Distinct (%)5.8%
Missing154
Missing (%)1.5%
Infinite0
Infinite (%)0.0%
Mean90.708511
Minimum1
Maximum1791
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:53:49.555659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5
Q115
median32
Q363
95-th percentile316
Maximum1791
Range1790
Interquartile range (IQR)48

Descriptive statistics

Standard deviation228.8445
Coefficient of variation (CV)2.5228558
Kurtosis27.220526
Mean90.708511
Median Absolute Deviation (MAD)20
Skewness5.045645
Sum893116
Variance52369.803
MonotonicityNot monotonic
2023-12-11T01:53:49.761816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9 235
 
2.4%
15 219
 
2.2%
7 215
 
2.1%
13 215
 
2.1%
14 213
 
2.1%
11 204
 
2.0%
10 202
 
2.0%
20 199
 
2.0%
16 196
 
2.0%
21 195
 
1.9%
Other values (563) 7753
77.5%
ValueCountFrequency (%)
1 40
 
0.4%
2 69
 
0.7%
3 116
1.2%
4 120
1.2%
5 170
1.7%
6 172
1.7%
7 215
2.1%
8 177
1.8%
9 235
2.4%
10 202
2.0%
ValueCountFrequency (%)
1791 1
 
< 0.1%
1790 1
 
< 0.1%
1787 1
 
< 0.1%
1780 1
 
< 0.1%
1778 1
 
< 0.1%
1773 1
 
< 0.1%
1772 2
< 0.1%
1771 1
 
< 0.1%
1770 1
 
< 0.1%
1768 3
< 0.1%

건물번호2
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct61
Distinct (%)0.6%
Missing154
Missing (%)1.5%
Infinite0
Infinite (%)0.0%
Mean2.202925
Minimum-1
Maximum138
Zeros6668
Zeros (%)66.7%
Negative1
Negative (%)< 0.1%
Memory size166.0 KiB
2023-12-11T01:53:49.996312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-1
5-th percentile0
Q10
median0
Q31
95-th percentile13
Maximum138
Range139
Interquartile range (IQR)1

Descriptive statistics

Standard deviation6.255379
Coefficient of variation (CV)2.8395787
Kurtosis97.831195
Mean2.202925
Median Absolute Deviation (MAD)0
Skewness7.4849132
Sum21690
Variance39.129767
MonotonicityNot monotonic
2023-12-11T01:53:50.162085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 6668
66.7%
1 1131
 
11.3%
2 250
 
2.5%
5 167
 
1.7%
6 157
 
1.6%
4 151
 
1.5%
7 148
 
1.5%
8 142
 
1.4%
3 139
 
1.4%
10 112
 
1.1%
Other values (51) 781
 
7.8%
(Missing) 154
 
1.5%
ValueCountFrequency (%)
-1 1
 
< 0.1%
0 6668
66.7%
1 1131
 
11.3%
2 250
 
2.5%
3 139
 
1.4%
4 151
 
1.5%
5 167
 
1.7%
6 157
 
1.6%
7 148
 
1.5%
8 142
 
1.4%
ValueCountFrequency (%)
138 1
 
< 0.1%
118 1
 
< 0.1%
116 1
 
< 0.1%
115 1
 
< 0.1%
100 1
 
< 0.1%
98 1
 
< 0.1%
96 1
 
< 0.1%
87 1
 
< 0.1%
81 1
 
< 0.1%
79 3
< 0.1%

건물명
Text

MISSING 

Distinct1515
Distinct (%)65.5%
Missing7686
Missing (%)76.9%
Memory size156.2 KiB
2023-12-11T01:53:50.501744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length15
Mean length5.7722558
Min length2

Characters and Unicode

Total characters13357
Distinct characters535
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1092 ?
Unique (%)47.2%

Sample

1st row부광주택
2nd row부산빌딩
3rd row고려주차장
4th row동진노인독거요양원
5th row골목집영양탕
ValueCountFrequency (%)
시영아파트 19
 
0.8%
주공아파트 18
 
0.7%
빌라 13
 
0.5%
그린코아 12
 
0.5%
롯데아파트 12
 
0.5%
주민센터 11
 
0.4%
럭키만덕아파트 10
 
0.4%
동원로얄듀크 9
 
0.4%
금강빌라 9
 
0.4%
유림아파트 8
 
0.3%
Other values (1581) 2403
95.2%
2023-12-11T01:53:50.931357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
622
 
4.7%
515
 
3.9%
420
 
3.1%
376
 
2.8%
350
 
2.6%
235
 
1.8%
234
 
1.8%
210
 
1.6%
197
 
1.5%
185
 
1.4%
Other values (525) 10013
75.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12848
96.2%
Space Separator 210
 
1.6%
Decimal Number 125
 
0.9%
Uppercase Letter 99
 
0.7%
Open Punctuation 23
 
0.2%
Close Punctuation 23
 
0.2%
Other Punctuation 15
 
0.1%
Lowercase Letter 11
 
0.1%
Dash Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
622
 
4.8%
515
 
4.0%
420
 
3.3%
376
 
2.9%
350
 
2.7%
235
 
1.8%
234
 
1.8%
197
 
1.5%
185
 
1.4%
181
 
1.4%
Other values (482) 9533
74.2%
Uppercase Letter
ValueCountFrequency (%)
I 18
18.2%
G 15
15.2%
S 15
15.2%
V 10
10.1%
L 10
10.1%
K 8
8.1%
P 4
 
4.0%
M 3
 
3.0%
O 2
 
2.0%
T 2
 
2.0%
Other values (9) 12
12.1%
Decimal Number
ValueCountFrequency (%)
2 55
44.0%
3 25
20.0%
1 17
 
13.6%
8 8
 
6.4%
4 8
 
6.4%
5 6
 
4.8%
7 4
 
3.2%
6 2
 
1.6%
Lowercase Letter
ValueCountFrequency (%)
e 3
27.3%
x 2
18.2%
t 1
 
9.1%
a 1
 
9.1%
m 1
 
9.1%
d 1
 
9.1%
p 1
 
9.1%
i 1
 
9.1%
Other Punctuation
ValueCountFrequency (%)
· 8
53.3%
. 3
 
20.0%
, 2
 
13.3%
& 2
 
13.3%
Space Separator
ValueCountFrequency (%)
210
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12848
96.2%
Common 399
 
3.0%
Latin 110
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
622
 
4.8%
515
 
4.0%
420
 
3.3%
376
 
2.9%
350
 
2.7%
235
 
1.8%
234
 
1.8%
197
 
1.5%
185
 
1.4%
181
 
1.4%
Other values (482) 9533
74.2%
Latin
ValueCountFrequency (%)
I 18
16.4%
G 15
13.6%
S 15
13.6%
V 10
9.1%
L 10
9.1%
K 8
 
7.3%
P 4
 
3.6%
e 3
 
2.7%
M 3
 
2.7%
x 2
 
1.8%
Other values (17) 22
20.0%
Common
ValueCountFrequency (%)
210
52.6%
2 55
 
13.8%
3 25
 
6.3%
( 23
 
5.8%
) 23
 
5.8%
1 17
 
4.3%
8 8
 
2.0%
· 8
 
2.0%
4 8
 
2.0%
5 6
 
1.5%
Other values (6) 16
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12848
96.2%
ASCII 501
 
3.8%
None 8
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
622
 
4.8%
515
 
4.0%
420
 
3.3%
376
 
2.9%
350
 
2.7%
235
 
1.8%
234
 
1.8%
197
 
1.5%
185
 
1.4%
181
 
1.4%
Other values (482) 9533
74.2%
ASCII
ValueCountFrequency (%)
210
41.9%
2 55
 
11.0%
3 25
 
5.0%
( 23
 
4.6%
) 23
 
4.6%
I 18
 
3.6%
1 17
 
3.4%
G 15
 
3.0%
S 15
 
3.0%
V 10
 
2.0%
Other values (32) 90
18.0%
None
ValueCountFrequency (%)
· 8
100.0%

산(확인)
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
9928 
True
 
72
ValueCountFrequency (%)
False 9928
99.3%
True 72
 
0.7%
2023-12-11T01:53:51.039772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

도로위계
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
4
7543 
2
1607 
1
 
678
<NA>
 
156
5
 
11

Length

Max length4
Median length1
Mean length1.0468
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4
2nd row4
3rd row4
4th row4
5th row4

Common Values

ValueCountFrequency (%)
4 7543
75.4%
2 1607
 
16.1%
1 678
 
6.8%
<NA> 156
 
1.6%
5 11
 
0.1%
3 5
 
0.1%

Length

2023-12-11T01:53:51.132283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:53:51.240487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
4 7543
75.4%
2 1607
 
16.1%
1 678
 
6.8%
na 156
 
1.6%
5 11
 
0.1%
3 5
 
< 0.1%

지번
Text

MISSING 

Distinct8039
Distinct (%)81.6%
Missing154
Missing (%)1.5%
Memory size156.2 KiB
2023-12-11T01:53:51.592338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length9.6073532
Min length5

Characters and Unicode

Total characters94594
Distinct characters23
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6733 ?
Unique (%)68.4%

Sample

1st row화명동 803
2nd row구포동 1251-6
3rd row만덕동 888-3
4th row만덕동 883-7
5th row구포동 1240-4
ValueCountFrequency (%)
구포동 4101
20.8%
만덕동 2371
 
12.0%
덕천동 2105
 
10.7%
화명동 869
 
4.4%
금곡동 400
 
2.0%
825-10 16
 
0.1%
825-11 13
 
0.1%
216-7 12
 
0.1%
824 10
 
0.1%
1150 9
 
< 0.1%
Other values (7627) 9786
49.7%
2023-12-11T01:53:52.139366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9846
 
10.4%
9846
 
10.4%
1 9577
 
10.1%
- 9015
 
9.5%
2 6365
 
6.7%
3 5061
 
5.4%
4476
 
4.7%
8 4286
 
4.5%
4 4191
 
4.4%
4101
 
4.3%
Other values (13) 27830
29.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 46123
48.8%
Other Letter 29610
31.3%
Space Separator 9846
 
10.4%
Dash Punctuation 9015
 
9.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9846
33.3%
4476
15.1%
4101
13.9%
4101
13.9%
2371
 
8.0%
2105
 
7.1%
869
 
2.9%
869
 
2.9%
400
 
1.4%
400
 
1.4%
Decimal Number
ValueCountFrequency (%)
1 9577
20.8%
2 6365
13.8%
3 5061
11.0%
8 4286
9.3%
4 4191
9.1%
0 3762
 
8.2%
5 3273
 
7.1%
7 3256
 
7.1%
9 3197
 
6.9%
6 3155
 
6.8%
Space Separator
ValueCountFrequency (%)
9846
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9015
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 64984
68.7%
Hangul 29610
31.3%

Most frequent character per script

Common
ValueCountFrequency (%)
9846
15.2%
1 9577
14.7%
- 9015
13.9%
2 6365
9.8%
3 5061
7.8%
8 4286
6.6%
4 4191
6.4%
0 3762
 
5.8%
5 3273
 
5.0%
7 3256
 
5.0%
Other values (2) 6352
9.8%
Hangul
ValueCountFrequency (%)
9846
33.3%
4476
15.1%
4101
13.9%
4101
13.9%
2371
 
8.0%
2105
 
7.1%
869
 
2.9%
869
 
2.9%
400
 
1.4%
400
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 64984
68.7%
Hangul 29610
31.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9846
33.3%
4476
15.1%
4101
13.9%
4101
13.9%
2371
 
8.0%
2105
 
7.1%
869
 
2.9%
869
 
2.9%
400
 
1.4%
400
 
1.4%
ASCII
ValueCountFrequency (%)
9846
15.2%
1 9577
14.7%
- 9015
13.9%
2 6365
9.8%
3 5061
7.8%
8 4286
6.6%
4 4191
6.4%
0 3762
 
5.8%
5 3273
 
5.0%
7 3256
 
5.0%
Other values (2) 6352
9.8%

연결이미지
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct770
Distinct (%)7.8%
Missing154
Missing (%)1.5%
Infinite0
Infinite (%)0.0%
Mean24893.541
Minimum4061
Maximum58037
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:53:52.304568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4061
5-th percentile10056
Q117054
median22046
Q332023.75
95-th percentile45054
Maximum58037
Range53976
Interquartile range (IQR)14969.75

Descriptive statistics

Standard deviation11439.557
Coefficient of variation (CV)0.45953917
Kurtosis-0.6585328
Mean24893.541
Median Absolute Deviation (MAD)5993
Skewness0.61916485
Sum2.4510181 × 108
Variance1.3086347 × 108
MonotonicityNot monotonic
2023-12-11T01:53:52.431946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
41046 72
 
0.7%
43046 72
 
0.7%
44045 68
 
0.7%
40046 66
 
0.7%
42046 65
 
0.7%
16053 62
 
0.6%
40047 61
 
0.6%
43047 60
 
0.6%
43045 59
 
0.6%
20053 55
 
0.5%
Other values (760) 9206
92.1%
(Missing) 154
 
1.5%
ValueCountFrequency (%)
4061 3
 
< 0.1%
4062 8
0.1%
5061 13
0.1%
5062 16
0.2%
5063 2
 
< 0.1%
6059 3
 
< 0.1%
6060 8
0.1%
6061 10
0.1%
6062 15
0.1%
6063 15
0.1%
ValueCountFrequency (%)
58037 3
< 0.1%
58036 3
< 0.1%
53049 2
< 0.1%
53046 1
 
< 0.1%
53042 1
 
< 0.1%
52050 1
 
< 0.1%
52049 3
< 0.1%
52047 1
 
< 0.1%
52042 1
 
< 0.1%
52041 1
 
< 0.1%


Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

신우편번호
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct154
Distinct (%)1.6%
Missing280
Missing (%)2.8%
Infinite0
Infinite (%)0.0%
Mean46580.604
Minimum46500
Maximum46653
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:53:52.581217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum46500
5-th percentile46506
Q146554
median46579
Q346614
95-th percentile46646.05
Maximum46653
Range153
Interquartile range (IQR)60

Descriptive statistics

Standard deviation40.129383
Coefficient of variation (CV)0.00086150413
Kurtosis-0.85514101
Mean46580.604
Median Absolute Deviation (MAD)30
Skewness-0.064482346
Sum4.5276347 × 108
Variance1610.3674
MonotonicityNot monotonic
2023-12-11T01:53:52.746385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
46557 848
 
8.5%
46585 246
 
2.5%
46609 166
 
1.7%
46615 153
 
1.5%
46552 147
 
1.5%
46574 146
 
1.5%
46578 146
 
1.5%
46595 137
 
1.4%
46596 134
 
1.3%
46546 132
 
1.3%
Other values (144) 7465
74.7%
(Missing) 280
 
2.8%
ValueCountFrequency (%)
46500 14
 
0.1%
46501 103
1.0%
46502 69
0.7%
46503 66
0.7%
46504 57
0.6%
46505 67
0.7%
46506 122
1.2%
46507 18
 
0.2%
46508 25
 
0.2%
46509 12
 
0.1%
ValueCountFrequency (%)
46653 85
0.9%
46652 59
0.6%
46651 3
 
< 0.1%
46650 39
 
0.4%
46649 124
1.2%
46648 81
0.8%
46647 95
0.9%
46646 24
 
0.2%
46645 39
 
0.4%
46644 46
 
0.5%

Interactions

2023-12-11T01:53:42.172539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:36.229045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:37.283334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:38.667776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:39.788656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:41.045870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:42.353556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:36.407923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:37.569494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:38.841255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:40.007772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:41.222382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:42.534341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:36.596871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:37.823619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:39.022085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:40.184787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:41.397160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:42.715990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:36.757999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:38.077219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:39.205513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:40.353747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:41.561478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:42.909083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:36.921130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:38.258003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:39.395991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:40.697126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:41.750577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:43.125177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:37.075065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:38.476145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:39.554031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:40.864912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:53:41.944048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:53:52.867386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번우편번호2읍면동건물번호건물번호2산(확인)도로위계연결이미지신우편번호
순번1.0000.1420.9870.4020.0970.0650.3860.8800.895
우편번호20.1421.0000.1670.2840.0520.0000.0860.1440.152
읍면동0.9870.1671.0000.3260.1220.0290.3520.9120.956
건물번호0.4020.2840.3261.0000.0000.0000.5740.3240.361
건물번호20.0970.0520.1220.0001.0000.2680.6530.2480.131
산(확인)0.0650.0000.0290.0000.2681.0000.0390.4030.072
도로위계0.3860.0860.3520.5740.6530.0391.0000.3160.409
연결이미지0.8800.1440.9120.3240.2480.4030.3161.0000.825
신우편번호0.8950.1520.9560.3610.1310.0720.4090.8251.000
2023-12-11T01:53:53.225816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도로위계읍면동우편번호1산(확인)
도로위계1.0000.1371.0000.048
읍면동0.1371.0001.0000.036
우편번호11.0001.0001.0001.000
산(확인)0.0480.0361.0001.000
2023-12-11T01:53:53.326663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번우편번호2건물번호건물번호2연결이미지신우편번호우편번호1읍면동산(확인)도로위계
순번1.0000.802-0.030-0.0360.802-0.4491.0000.8360.0490.170
우편번호20.8021.000-0.022-0.0160.674-0.3461.0000.1370.0000.070
건물번호-0.030-0.0221.000-0.021-0.017-0.0971.0000.1950.0000.378
건물번호2-0.036-0.016-0.0211.000-0.0200.0141.0000.0510.2050.328
연결이미지0.8020.674-0.017-0.0201.000-0.2431.0000.6100.3090.137
신우편번호-0.449-0.346-0.0970.014-0.2431.0001.0000.7100.0550.183
우편번호11.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
읍면동0.8360.1370.1950.0510.6100.7101.0001.0000.0360.137
산(확인)0.0490.0000.0000.2050.3090.0551.0000.0361.0000.048
도로위계0.1700.0700.3780.3280.1370.1831.0000.1370.0481.000

Missing values

2023-12-11T01:53:43.433968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:53:44.151887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T01:53:44.494910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번우편번호1우편번호2시도시군구읍면동번지도로명건물번호건물번호2건물명산(확인)도로위계지번연결이미지신우편번호
1751817197616845부산광역시북구화명동803화명대로64번길407<NA>N4화명동 80327033<NA>46533
71415782616809부산광역시북구구포동1251-6시랑로138번길201<NA>N4구포동 1251-622063<NA>46644
1158712369616831부산광역시북구만덕동888-3덕천로380번길430<NA>N4만덕동 888-349052<NA>46609
1275312195616831부산광역시북구만덕동883-7덕천로352번길50<NA>N4만덕동 883-747052<NA>46609
55385895616808부산광역시북구구포동1240-4시랑로163번길287<NA>N4구포동 1240-425061<NA>46625
1332411675616824부산광역시북구만덕동24898구만덕로80번길390<NA>N4만덕동 68-350048<NA>46608
1070710051616816부산광역시북구덕천동303-10만덕대로65번길1020<NA>N4덕천동 303-1022046<NA>46549
78369875616817부산광역시북구덕천동316-5만덕대로27번길1260부광주택N4덕천동 316-520047<NA>46546
1544316683616834부산광역시북구화명동373-6산성로48번길99<NA>N4화명동 373-628027<NA>46529
1174914606616830부산광역시북구만덕동817-58상학로15번길480<NA>N4만덕동 817-5841046<NA>46557
순번우편번호1우편번호2시도시군구읍면동번지도로명건물번호건물번호2건물명산(확인)도로위계지번연결이미지신우편번호
1355412507616831부산광역시북구만덕동887-16덕천로386번길840<NA>N4만덕동 887-1648053<NA>46609
799310684616822부산광역시북구덕천동428-10의성로121번길340<NA>N4덕천동 428-1024051<NA>46574
1572916239616834부산광역시북구화명동1429-2금곡대로324번길450우신아파트N4화명동 1429-227029<NA>46531
1368012027616827부산광역시북구만덕동840-37덕천로298번길100<NA>N4만덕동 840-3742052<NA>46612
1621316030616846부산광역시북구화명동899-2금곡대로200번길460<NA>N4화명동 899-224037<NA>46538
1583216773616833부산광역시북구화명동188-4양달로30<NA>N2화명동 188-424026<NA>46518
1330511656616827부산광역시북구만덕동447-8구만덕로60번길500<NA>N4만덕동 447-848047<NA>46607
1305314010616831부산광역시북구만덕동868-8상리로36번가길90<NA>N4만덕동 868-844053<NA>46614
84118834616816부산광역시북구덕천동253-8덕천로667<NA>N2덕천동 253-823052<NA>46594
1103010495616821부산광역시북구덕천동412-11의성로115번길156<NA>N4덕천동 412-1123050<NA>46567