Overview

Dataset statistics

Number of variables25
Number of observations10000
Missing cells34592
Missing cells (%)13.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 MiB
Average record size in memory222.0 B

Variable types

Numeric13
Categorical7
Text4
Unsupported1

Dataset

Description경상북도 구미시 공유재산실태조사분석 시스템의 주제도 속성 관련 테이블 정보로 각 필지에 대한 도로명 주소 등을 제공하고 있습니다.
Author경상북도 구미시
URLhttps://www.data.go.kr/data/15089494/fileData.do

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
지하여부 has constant value ""Constant
산여부 is highly imbalanced (88.8%)Imbalance
공동주택여부 is highly imbalanced (75.5%)Imbalance
법정리명 has 5336 (53.4%) missing valuesMissing
건축물대장건물명 has 9900 (99.0%) missing valuesMissing
상세건물명 has 10000 (100.0%) missing valuesMissing
시군구용건물명 has 9356 (93.6%) missing valuesMissing
행정동코드 is highly skewed (γ1 = 70.66521398)Skewed
상세건물명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
지번부번 has 2745 (27.5%) zerosZeros
건물부번 has 4163 (41.6%) zerosZeros

Reproduction

Analysis started2023-12-11 23:46:43.670860
Analysis finished2023-12-11 23:46:44.648994
Duration0.98 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

매칭코드
Real number (ℝ)

Distinct1145
Distinct (%)11.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.7190429 × 1049
Minimum4.71901 × 1049
Maximum4.7190486 × 1049
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:46:44.728910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.71901 × 1049
5-th percentile4.7190302 × 1049
Q14.7190331 × 1049
median4.7190472 × 1049
Q34.7190472 × 1049
95-th percentile4.7190472 × 1049
Maximum4.7190486 × 1049
Range3.857026 × 1044
Interquartile range (IQR)1.416547 × 1044

Descriptive statistics

Standard deviation7.3441687 × 1043
Coefficient of variation (CV)1.5562835 × 10-6
Kurtosis0.22568881
Mean4.7190429 × 1049
Median Absolute Deviation (MAD)3.7549995 × 1040
Skewness-1.2846977
Sum4.7190429 × 1053
Variance5.3936813 × 1087
MonotonicityNot monotonic
2023-12-12T08:46:45.184731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4.7190230800303e+49 127
 
1.3%
4.7190330500303e+49 102
 
1.0%
4.719030180390199e+49 79
 
0.8%
4.7190472475001e+49 77
 
0.8%
4.719033050030599e+49 70
 
0.7%
4.719047245710099e+49 67
 
0.7%
4.7190472457602e+49 59
 
0.6%
4.7190330500305e+49 54
 
0.5%
4.7190330800601e+49 54
 
0.5%
4.719033080680099e+49 53
 
0.5%
Other values (1135) 9258
92.6%
ValueCountFrequency (%)
4.7190100000101e+49 1
 
< 0.1%
4.7190100000103e+49 1
 
< 0.1%
4.7190100002001e+49 1
 
< 0.1%
4.719010000200199e+49 1
 
< 0.1%
4.7190201800401e+49 1
 
< 0.1%
4.7190201800402e+49 2
 
< 0.1%
4.7190201800403e+49 18
0.2%
4.7190230800101e+49 14
0.1%
4.7190230800102e+49 1
 
< 0.1%
4.7190230800103e+49 17
0.2%
ValueCountFrequency (%)
4.7190485702701e+49 1
 
< 0.1%
4.719048546480099e+49 2
< 0.1%
4.719048546460199e+49 2
< 0.1%
4.7190485369001e+49 1
 
< 0.1%
4.719048536890099e+49 1
 
< 0.1%
4.7190485368801e+49 3
< 0.1%
4.7190485349001e+49 1
 
< 0.1%
4.7190485348801e+49 1
 
< 0.1%
4.7190485348701e+49 3
< 0.1%
4.719048534860099e+49 2
< 0.1%

법정필지코드
Real number (ℝ)

Distinct6382
Distinct (%)63.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.7190199 × 1018
Minimum4.7190101 × 1018
Maximum4.719036 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:46:45.379079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.7190101 × 1018
5-th percentile4.7190101 × 1018
Q14.7190111 × 1018
median4.7190126 × 1018
Q34.719031 × 1018
95-th percentile4.719035 × 1018
Maximum4.719036 × 1018
Range2.59291 × 1013
Interquartile range (IQR)1.9922003 × 1013

Descriptive statistics

Standard deviation9.7764351 × 1012
Coefficient of variation (CV)2.0717088 × 10-6
Kurtosis-1.5421684
Mean4.7190199 × 1018
Median Absolute Deviation (MAD)2.5000499 × 1012
Skewness0.3940879
Sum3.4280449 × 1018
Variance9.5578683 × 1025
MonotonicityNot monotonic
2023-12-12T08:46:45.576195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4719011100100700000 86
 
0.9%
4719010100109640000 72
 
0.7%
4719010100109370000 37
 
0.4%
4719025336109110000 35
 
0.4%
4719012600105910000 32
 
0.3%
4719010300107920000 27
 
0.3%
4719034035100010000 26
 
0.3%
4719025334111190000 23
 
0.2%
4719010100100070000 18
 
0.2%
4719025334111120000 18
 
0.2%
Other values (6372) 9626
96.3%
ValueCountFrequency (%)
4719010100100000000 7
 
0.1%
4719010100100060000 13
0.1%
4719010100100070000 18
0.2%
4719010100100080000 12
0.1%
4719010100100100000 5
 
0.1%
4719010100100140000 1
 
< 0.1%
4719010100100150000 1
 
< 0.1%
4719010100100190000 2
 
< 0.1%
4719010100100240000 1
 
< 0.1%
4719010100100270000 4
 
< 0.1%
ValueCountFrequency (%)
4719036029200440000 3
 
< 0.1%
4719036029110180000 2
 
< 0.1%
4719036029109940000 1
 
< 0.1%
4719036029109890000 1
 
< 0.1%
4719036029109760000 1
 
< 0.1%
4719036029109700000 1
 
< 0.1%
4719036029109360000 1
 
< 0.1%
4719036029109340000 1
 
< 0.1%
4719036029109200000 8
0.1%
4719036029109010000 2
 
< 0.1%

도로명주소코드
Real number (ℝ)

Distinct1145
Distinct (%)11.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.7190429 × 1024
Minimum4.71901 × 1024
Maximum4.7190486 × 1024
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:46:45.717587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.71901 × 1024
5-th percentile4.7190302 × 1024
Q14.7190331 × 1024
median4.7190472 × 1024
Q34.7190472 × 1024
95-th percentile4.7190472 × 1024
Maximum4.7190486 × 1024
Range3.857026 × 1019
Interquartile range (IQR)1.416547 × 1019

Descriptive statistics

Standard deviation7.3441687 × 1018
Coefficient of variation (CV)1.5562835 × 10-6
Kurtosis0.22568881
Mean4.7190429 × 1024
Median Absolute Deviation (MAD)3.7550002 × 1015
Skewness-1.2846977
Sum4.7190429 × 1028
Variance5.3936813 × 1037
MonotonicityNot monotonic
2023-12-12T08:46:45.915898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4.7190230800303e+24 127
 
1.3%
4.7190330500303e+24 102
 
1.0%
4.7190301803902e+24 79
 
0.8%
4.7190472475001e+24 77
 
0.8%
4.7190330500306e+24 70
 
0.7%
4.7190472457101e+24 67
 
0.7%
4.7190472457602e+24 59
 
0.6%
4.7190330500305e+24 54
 
0.5%
4.7190330800601e+24 54
 
0.5%
4.7190330806801e+24 53
 
0.5%
Other values (1135) 9258
92.6%
ValueCountFrequency (%)
4.7190100000101e+24 1
 
< 0.1%
4.7190100000103e+24 1
 
< 0.1%
4.7190100002001e+24 1
 
< 0.1%
4.7190100002002e+24 1
 
< 0.1%
4.7190201800401e+24 1
 
< 0.1%
4.7190201800402e+24 2
 
< 0.1%
4.7190201800403e+24 18
0.2%
4.7190230800101e+24 14
0.1%
4.7190230800102e+24 1
 
< 0.1%
4.7190230800103e+24 17
0.2%
ValueCountFrequency (%)
4.7190485702701e+24 1
 
< 0.1%
4.7190485464801e+24 2
< 0.1%
4.7190485464602e+24 2
< 0.1%
4.7190485369001e+24 1
 
< 0.1%
4.7190485368901e+24 1
 
< 0.1%
4.7190485368801e+24 3
< 0.1%
4.7190485349001e+24 1
 
< 0.1%
4.7190485348801e+24 1
 
< 0.1%
4.7190485348701e+24 3
< 0.1%
4.7190485348601e+24 2
< 0.1%

법정동코드
Real number (ℝ)

Distinct128
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.7190195 × 109
Minimum4.7190101 × 109
Maximum4.719036 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:46:46.121845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.7190101 × 109
5-th percentile4.7190101 × 109
Q14.7190111 × 109
median4.7190126 × 109
Q34.7190256 × 109
95-th percentile4.719034 × 109
Maximum4.719036 × 109
Range25929
Interquartile range (IQR)14523

Descriptive statistics

Standard deviation9326.7897
Coefficient of variation (CV)1.9764253 × 10-6
Kurtosis-1.4697965
Mean4.7190195 × 109
Median Absolute Deviation (MAD)2500
Skewness0.41051767
Sum4.7190195 × 1013
Variance86989005
MonotonicityNot monotonic
2023-12-12T08:46:46.309595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4719010100 697
 
7.0%
4719012300 410
 
4.1%
4719010400 394
 
3.9%
4719012200 373
 
3.7%
4719010300 335
 
3.4%
4719010900 301
 
3.0%
4719011000 235
 
2.4%
4719012800 230
 
2.3%
4719011100 225
 
2.2%
4719011500 216
 
2.2%
Other values (118) 6584
65.8%
ValueCountFrequency (%)
4719010100 697
7.0%
4719010200 175
 
1.8%
4719010300 335
3.4%
4719010400 394
3.9%
4719010500 59
 
0.6%
4719010600 92
 
0.9%
4719010700 8
 
0.1%
4719010800 59
 
0.6%
4719010900 301
3.0%
4719011000 235
 
2.4%
ValueCountFrequency (%)
4719036029 86
0.9%
4719036028 21
 
0.2%
4719036027 40
0.4%
4719036026 50
0.5%
4719036025 35
 
0.4%
4719036024 36
 
0.4%
4719036023 36
 
0.4%
4719036022 23
 
0.2%
4719036021 95
0.9%
4719034035 28
 
0.3%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경상북도
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상북도
2nd row경상북도
3rd row경상북도
4th row경상북도
5th row경상북도

Common Values

ValueCountFrequency (%)
경상북도 10000
100.0%

Length

2023-12-12T08:46:46.470862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:46:46.607656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경상북도 10000
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
구미시
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row구미시
2nd row구미시
3rd row구미시
4th row구미시
5th row구미시

Common Values

ValueCountFrequency (%)
구미시 10000
100.0%

Length

2023-12-12T08:46:46.705645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:46:46.808387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
구미시 10000
100.0%
Distinct38
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
고아읍
1047 
선산읍
1046 
원평동
697 
해평면
632 
산동읍
 
435
Other values (33)
6143 

Length

Max length3
Median length3
Mean length2.9958
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공단동
2nd row해평면
3rd row형곡동
4th row산동읍
5th row해평면

Common Values

ValueCountFrequency (%)
고아읍 1047
 
10.5%
선산읍 1046
 
10.5%
원평동 697
 
7.0%
해평면 632
 
6.3%
산동읍 435
 
4.3%
장천면 422
 
4.2%
도개면 415
 
4.2%
진평동 410
 
4.1%
봉곡동 394
 
3.9%
인의동 373
 
3.7%
Other values (28) 4129
41.3%

Length

2023-12-12T08:46:46.931906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
고아읍 1047
 
10.5%
선산읍 1046
 
10.5%
원평동 697
 
7.0%
해평면 632
 
6.3%
산동읍 435
 
4.3%
장천면 422
 
4.2%
도개면 415
 
4.2%
진평동 410
 
4.1%
봉곡동 394
 
3.9%
인의동 373
 
3.7%
Other values (28) 4129
41.3%

법정리명
Text

MISSING 

Distinct95
Distinct (%)2.0%
Missing5336
Missing (%)53.4%
Memory size156.2 KiB
2023-12-12T08:46:47.241523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9614065
Min length2

Characters and Unicode

Total characters13812
Distinct characters96
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일선리
2nd row동곡리
3rd row송곡리
4th row상장리
5th row월림리
ValueCountFrequency (%)
이문리 144
 
3.1%
문성리 138
 
3.0%
완전리 128
 
2.7%
오로리 117
 
2.5%
동부리 107
 
2.3%
원호리 104
 
2.2%
봉한리 102
 
2.2%
상장리 95
 
2.0%
교리 95
 
2.0%
관심리 91
 
2.0%
Other values (85) 3543
76.0%
2023-12-12T08:46:47.697126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4664
33.8%
564
 
4.1%
459
 
3.3%
380
 
2.8%
292
 
2.1%
279
 
2.0%
278
 
2.0%
275
 
2.0%
258
 
1.9%
231
 
1.7%
Other values (86) 6132
44.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13812
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4664
33.8%
564
 
4.1%
459
 
3.3%
380
 
2.8%
292
 
2.1%
279
 
2.0%
278
 
2.0%
275
 
2.0%
258
 
1.9%
231
 
1.7%
Other values (86) 6132
44.4%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13812
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4664
33.8%
564
 
4.1%
459
 
3.3%
380
 
2.8%
292
 
2.1%
279
 
2.0%
278
 
2.0%
275
 
2.0%
258
 
1.9%
231
 
1.7%
Other values (86) 6132
44.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13812
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4664
33.8%
564
 
4.1%
459
 
3.3%
380
 
2.8%
292
 
2.1%
279
 
2.0%
278
 
2.0%
275
 
2.0%
258
 
1.9%
231
 
1.7%
Other values (86) 6132
44.4%

산여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9851 
1
 
149

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9851
98.5%
1 149
 
1.5%

Length

2023-12-12T08:46:47.822534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:46:47.918549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9851
98.5%
1 149
 
1.5%

지번본번
Real number (ℝ)

Distinct1218
Distinct (%)12.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean435.2278
Minimum0
Maximum1859
Zeros5
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:46:48.024778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile27
Q1170
median397
Q3646.25
95-th percentile1027
Maximum1859
Range1859
Interquartile range (IQR)476.25

Descriptive statistics

Standard deviation312.33531
Coefficient of variation (CV)0.7176364
Kurtosis0.045500709
Mean435.2278
Median Absolute Deviation (MAD)234
Skewness0.70070543
Sum4352278
Variance97553.347
MonotonicityNot monotonic
2023-12-12T08:46:48.162201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
70 98
 
1.0%
964 75
 
0.8%
1 57
 
0.6%
591 43
 
0.4%
911 42
 
0.4%
6 40
 
0.4%
937 39
 
0.4%
792 35
 
0.4%
427 31
 
0.3%
320 30
 
0.3%
Other values (1208) 9510
95.1%
ValueCountFrequency (%)
0 5
 
0.1%
1 57
0.6%
2 15
 
0.1%
3 9
 
0.1%
4 17
 
0.2%
5 7
 
0.1%
6 40
0.4%
7 29
0.3%
8 20
 
0.2%
9 20
 
0.2%
ValueCountFrequency (%)
1859 1
< 0.1%
1855 1
< 0.1%
1845 1
< 0.1%
1796 1
< 0.1%
1790 1
< 0.1%
1779 1
< 0.1%
1754 1
< 0.1%
1726 1
< 0.1%
1673 1
< 0.1%
1672 1
< 0.1%

지번부번
Real number (ℝ)

ZEROS 

Distinct339
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.2505
Minimum0
Maximum869
Zeros2745
Zeros (%)27.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:46:48.295400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median3
Q39
95-th percentile57
Maximum869
Range869
Interquartile range (IQR)9

Descriptive statistics

Standard deviation60.706519
Coefficient of variation (CV)3.7356709
Kurtosis64.230882
Mean16.2505
Median Absolute Deviation (MAD)3
Skewness7.3743292
Sum162505
Variance3685.2815
MonotonicityNot monotonic
2023-12-12T08:46:48.418172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2745
27.5%
1 1204
12.0%
2 834
 
8.3%
3 647
 
6.5%
4 527
 
5.3%
5 473
 
4.7%
6 405
 
4.0%
7 353
 
3.5%
8 311
 
3.1%
9 257
 
2.6%
Other values (329) 2244
22.4%
ValueCountFrequency (%)
0 2745
27.5%
1 1204
12.0%
2 834
 
8.3%
3 647
 
6.5%
4 527
 
5.3%
5 473
 
4.7%
6 405
 
4.0%
7 353
 
3.5%
8 311
 
3.1%
9 257
 
2.6%
ValueCountFrequency (%)
869 1
< 0.1%
858 1
< 0.1%
841 2
< 0.1%
795 1
< 0.1%
707 1
< 0.1%
705 1
< 0.1%
696 1
< 0.1%
683 1
< 0.1%
672 1
< 0.1%
670 1
< 0.1%

도로명코드
Real number (ℝ)

Distinct1018
Distinct (%)10.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.7190429 × 1011
Minimum4.71901 × 1011
Maximum4.7190486 × 1011
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:46:48.547505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.71901 × 1011
5-th percentile4.7190302 × 1011
Q14.7190331 × 1011
median4.7190472 × 1011
Q34.7190472 × 1011
95-th percentile4.7190472 × 1011
Maximum4.7190486 × 1011
Range3857026
Interquartile range (IQR)1416547

Descriptive statistics

Standard deviation734416.87
Coefficient of variation (CV)1.5562835 × 10-6
Kurtosis0.22568881
Mean4.7190429 × 1011
Median Absolute Deviation (MAD)375.5
Skewness-1.2846977
Sum4.7190429 × 1015
Variance5.3936814 × 1011
MonotonicityNot monotonic
2023-12-12T08:46:48.682148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
471903305003 235
 
2.4%
471902308003 200
 
2.0%
471903018039 90
 
0.9%
471904724750 77
 
0.8%
471903308081 69
 
0.7%
471904724571 67
 
0.7%
471903308006 62
 
0.6%
471904724576 60
 
0.6%
471903308082 57
 
0.6%
471902308002 57
 
0.6%
Other values (1008) 9026
90.3%
ValueCountFrequency (%)
471901000001 2
 
< 0.1%
471901000020 2
 
< 0.1%
471902018004 21
 
0.2%
471902308001 33
 
0.3%
471902308002 57
 
0.6%
471902308003 200
2.0%
471902308004 16
 
0.2%
471902308005 35
 
0.4%
471902308006 4
 
< 0.1%
471903018003 19
 
0.2%
ValueCountFrequency (%)
471904857027 1
 
< 0.1%
471904854648 2
< 0.1%
471904854646 2
< 0.1%
471904853690 1
 
< 0.1%
471904853689 1
 
< 0.1%
471904853688 3
< 0.1%
471904853490 1
 
< 0.1%
471904853488 1
 
< 0.1%
471904853487 3
< 0.1%
471904853486 2
< 0.1%
Distinct1018
Distinct (%)10.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T08:46:48.959411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length4.6657
Min length3

Characters and Unicode

Total characters46657
Distinct characters158
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)0.7%

Sample

1st row수출대로3길
2nd row수류길
3rd row형곡동로1길
4th row동곡3길
5th row송곡6길
ValueCountFrequency (%)
강동로 235
 
2.4%
선산대로 200
 
2.0%
상무로 90
 
0.9%
인동32길 77
 
0.8%
선상동로 69
 
0.7%
신비로3길 67
 
0.7%
구미중앙로 62
 
0.6%
신시로10길 60
 
0.6%
선상서로 57
 
0.6%
산호대로 57
 
0.6%
Other values (1008) 9026
90.3%
2023-12-12T08:46:49.480688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7262
 
15.6%
5928
 
12.7%
1 2276
 
4.9%
2 1899
 
4.1%
1557
 
3.3%
1527
 
3.3%
3 1291
 
2.8%
1040
 
2.2%
967
 
2.1%
4 852
 
1.8%
Other values (148) 22058
47.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 37751
80.9%
Decimal Number 8906
 
19.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7262
19.2%
5928
 
15.7%
1557
 
4.1%
1527
 
4.0%
1040
 
2.8%
967
 
2.6%
826
 
2.2%
788
 
2.1%
704
 
1.9%
630
 
1.7%
Other values (138) 16522
43.8%
Decimal Number
ValueCountFrequency (%)
1 2276
25.6%
2 1899
21.3%
3 1291
14.5%
4 852
 
9.6%
5 798
 
9.0%
6 497
 
5.6%
7 356
 
4.0%
0 342
 
3.8%
8 332
 
3.7%
9 263
 
3.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 37751
80.9%
Common 8906
 
19.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7262
19.2%
5928
 
15.7%
1557
 
4.1%
1527
 
4.0%
1040
 
2.8%
967
 
2.6%
826
 
2.2%
788
 
2.1%
704
 
1.9%
630
 
1.7%
Other values (138) 16522
43.8%
Common
ValueCountFrequency (%)
1 2276
25.6%
2 1899
21.3%
3 1291
14.5%
4 852
 
9.6%
5 798
 
9.0%
6 497
 
5.6%
7 356
 
4.0%
0 342
 
3.8%
8 332
 
3.7%
9 263
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 37751
80.9%
ASCII 8906
 
19.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7262
19.2%
5928
 
15.7%
1557
 
4.1%
1527
 
4.0%
1040
 
2.8%
967
 
2.6%
826
 
2.2%
788
 
2.1%
704
 
1.9%
630
 
1.7%
Other values (138) 16522
43.8%
ASCII
ValueCountFrequency (%)
1 2276
25.6%
2 1899
21.3%
3 1291
14.5%
4 852
 
9.6%
5 798
 
9.0%
6 497
 
5.6%
7 356
 
4.0%
0 342
 
3.8%
8 332
 
3.7%
9 263
 
3.0%

지하여부
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 10000
100.0%

Length

2023-12-12T08:46:49.601799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:46:49.684826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 10000
100.0%

건물본번
Real number (ℝ)

Distinct863
Distinct (%)8.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean134.3023
Minimum1
Maximum3872
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:46:49.824799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5
Q114
median32
Q397
95-th percentile724
Maximum3872
Range3871
Interquartile range (IQR)83

Descriptive statistics

Standard deviation294.70332
Coefficient of variation (CV)2.1943282
Kurtosis24.74586
Mean134.3023
Median Absolute Deviation (MAD)23
Skewness4.3793926
Sum1343023
Variance86850.047
MonotonicityNot monotonic
2023-12-12T08:46:50.038691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5 276
 
2.8%
6 258
 
2.6%
14 230
 
2.3%
8 229
 
2.3%
11 223
 
2.2%
15 200
 
2.0%
10 197
 
2.0%
12 196
 
2.0%
9 189
 
1.9%
13 179
 
1.8%
Other values (853) 7823
78.2%
ValueCountFrequency (%)
1 42
 
0.4%
2 37
 
0.4%
3 159
1.6%
4 134
1.3%
5 276
2.8%
6 258
2.6%
7 170
1.7%
8 229
2.3%
9 189
1.9%
10 197
2.0%
ValueCountFrequency (%)
3872 1
 
< 0.1%
3318 1
 
< 0.1%
2906 1
 
< 0.1%
2858 3
< 0.1%
2840 1
 
< 0.1%
2830 1
 
< 0.1%
2814 1
 
< 0.1%
2764 1
 
< 0.1%
2694 1
 
< 0.1%
2637 1
 
< 0.1%

건물부번
Real number (ℝ)

ZEROS 

Distinct136
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.3493
Minimum0
Maximum398
Zeros4163
Zeros (%)41.6%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:46:50.221838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q311
95-th percentile43
Maximum398
Range398
Interquartile range (IQR)11

Descriptive statistics

Standard deviation17.856572
Coefficient of variation (CV)1.9099368
Kurtosis51.762473
Mean9.3493
Median Absolute Deviation (MAD)2
Skewness4.9537841
Sum93493
Variance318.85718
MonotonicityNot monotonic
2023-12-12T08:46:50.394523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 4163
41.6%
1 672
 
6.7%
3 386
 
3.9%
4 343
 
3.4%
5 332
 
3.3%
2 293
 
2.9%
6 282
 
2.8%
8 261
 
2.6%
7 252
 
2.5%
10 224
 
2.2%
Other values (126) 2792
27.9%
ValueCountFrequency (%)
0 4163
41.6%
1 672
 
6.7%
2 293
 
2.9%
3 386
 
3.9%
4 343
 
3.4%
5 332
 
3.3%
6 282
 
2.8%
7 252
 
2.5%
8 261
 
2.6%
9 201
 
2.0%
ValueCountFrequency (%)
398 1
< 0.1%
340 1
< 0.1%
207 1
< 0.1%
193 1
< 0.1%
189 1
< 0.1%
187 1
< 0.1%
183 1
< 0.1%
179 1
< 0.1%
176 1
< 0.1%
171 1
< 0.1%
Distinct97
Distinct (%)97.0%
Missing9900
Missing (%)99.0%
Memory size156.2 KiB
2023-12-12T08:46:50.692498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length40
Mean length40
Min length40

Characters and Unicode

Total characters4000
Distinct characters180
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique95 ?
Unique (%)95.0%

Sample

1st row정우빌라
2nd row구미옥계우미린
3rd row구미원호대우아파트
4th row미성빌라23차
5th row금오공대아파트
ValueCountFrequency (%)
에비앙힐스 3
 
2.5%
구미옥계 2
 
1.7%
3차 2
 
1.7%
프라임 2
 
1.7%
정우빌라 2
 
1.7%
청산빌라 1
 
0.8%
화성스위트빌 1
 
0.8%
우원빌라 1
 
0.8%
헬리오폴리스1단지 1
 
0.8%
올레 1
 
0.8%
Other values (103) 103
86.6%
2023-12-12T08:46:51.069153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3409
85.2%
35
 
0.9%
24
 
0.6%
21
 
0.5%
18
 
0.4%
18
 
0.4%
17
 
0.4%
16
 
0.4%
15
 
0.4%
15
 
0.4%
Other values (170) 412
 
10.3%

Most occurring categories

ValueCountFrequency (%)
Space Separator 3409
85.2%
Other Letter 556
 
13.9%
Decimal Number 23
 
0.6%
Uppercase Letter 4
 
0.1%
Close Punctuation 3
 
0.1%
Open Punctuation 3
 
0.1%
Dash Punctuation 1
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
 
6.3%
24
 
4.3%
21
 
3.8%
18
 
3.2%
18
 
3.2%
17
 
3.1%
16
 
2.9%
15
 
2.7%
15
 
2.7%
15
 
2.7%
Other values (155) 362
65.1%
Decimal Number
ValueCountFrequency (%)
3 7
30.4%
1 6
26.1%
2 3
13.0%
4 3
13.0%
5 2
 
8.7%
9 1
 
4.3%
6 1
 
4.3%
Uppercase Letter
ValueCountFrequency (%)
C 2
50.0%
A 1
25.0%
H 1
25.0%
Space Separator
ValueCountFrequency (%)
3409
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3439
86.0%
Hangul 556
 
13.9%
Latin 5
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
35
 
6.3%
24
 
4.3%
21
 
3.8%
18
 
3.2%
18
 
3.2%
17
 
3.1%
16
 
2.9%
15
 
2.7%
15
 
2.7%
15
 
2.7%
Other values (155) 362
65.1%
Common
ValueCountFrequency (%)
3409
99.1%
3 7
 
0.2%
1 6
 
0.2%
2 3
 
0.1%
4 3
 
0.1%
) 3
 
0.1%
( 3
 
0.1%
5 2
 
0.1%
- 1
 
< 0.1%
9 1
 
< 0.1%
Latin
ValueCountFrequency (%)
C 2
40.0%
A 1
20.0%
e 1
20.0%
H 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3444
86.1%
Hangul 556
 
13.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3409
99.0%
3 7
 
0.2%
1 6
 
0.2%
2 3
 
0.1%
4 3
 
0.1%
) 3
 
0.1%
( 3
 
0.1%
5 2
 
0.1%
C 2
 
0.1%
A 1
 
< 0.1%
Other values (5) 5
 
0.1%
Hangul
ValueCountFrequency (%)
35
 
6.3%
24
 
4.3%
21
 
3.8%
18
 
3.2%
18
 
3.2%
17
 
3.1%
16
 
2.9%
15
 
2.7%
15
 
2.7%
15
 
2.7%
Other values (155) 362
65.1%

상세건물명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

건물관리번호
Real number (ℝ)

Distinct6382
Distinct (%)63.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.7190199 × 1024
Minimum4.7190101 × 1024
Maximum4.719036 × 1024
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:46:51.216438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.7190101 × 1024
5-th percentile4.7190101 × 1024
Q14.7190111 × 1024
median4.7190126 × 1024
Q34.719031 × 1024
95-th percentile4.719035 × 1024
Maximum4.719036 × 1024
Range2.59291 × 1019
Interquartile range (IQR)1.9922003 × 1019

Descriptive statistics

Standard deviation9.7764351 × 1018
Coefficient of variation (CV)2.0717088 × 10-6
Kurtosis-1.5421684
Mean4.7190199 × 1024
Median Absolute Deviation (MAD)2.5000499 × 1018
Skewness0.3940879
Sum4.7190199 × 1028
Variance9.5578683 × 1037
MonotonicityNot monotonic
2023-12-12T08:46:51.348799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4.7190111001007e+24 86
 
0.9%
4.71901010010964e+24 72
 
0.7%
4.7190101001093695e+24 37
 
0.4%
4.7190253361091103e+24 35
 
0.4%
4.7190126001059104e+24 32
 
0.3%
4.71901030010792e+24 27
 
0.3%
4.71903403510001e+24 26
 
0.3%
4.7190253341111903e+24 23
 
0.2%
4.7190101001000703e+24 18
 
0.2%
4.71902533411112e+24 18
 
0.2%
Other values (6372) 9626
96.3%
ValueCountFrequency (%)
4.7190101001e+24 7
 
0.1%
4.71901010010006e+24 13
0.1%
4.7190101001000703e+24 18
0.2%
4.71901010010008e+24 12
0.1%
4.7190101001001e+24 5
 
0.1%
4.71901010010014e+24 1
 
< 0.1%
4.7190101001001503e+24 1
 
< 0.1%
4.7190101001001906e+24 2
 
< 0.1%
4.71901010010024e+24 1
 
< 0.1%
4.7190101001002706e+24 4
 
< 0.1%
ValueCountFrequency (%)
4.71903602920044e+24 3
 
< 0.1%
4.71903602911018e+24 2
 
< 0.1%
4.71903602910994e+24 1
 
< 0.1%
4.7190360291098895e+24 1
 
< 0.1%
4.71903602910976e+24 1
 
< 0.1%
4.7190360291097e+24 1
 
< 0.1%
4.71903602910936e+24 1
 
< 0.1%
4.71903602910934e+24 1
 
< 0.1%
4.7190360291092e+24 8
0.1%
4.7190360291090096e+24 2
 
< 0.1%

읍면동일련번호
Real number (ℝ)

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.2972
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:46:51.472080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile3
Maximum9
Range8
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.80357746
Coefficient of variation (CV)0.61947075
Kurtosis19.557994
Mean1.2972
Median Absolute Deviation (MAD)0
Skewness3.9107312
Sum12972
Variance0.64573673
MonotonicityNot monotonic
2023-12-12T08:46:51.579198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
1 8234
82.3%
2 1070
 
10.7%
3 480
 
4.8%
6 85
 
0.9%
5 77
 
0.8%
4 44
 
0.4%
9 8
 
0.1%
8 1
 
< 0.1%
7 1
 
< 0.1%
ValueCountFrequency (%)
1 8234
82.3%
2 1070
 
10.7%
3 480
 
4.8%
4 44
 
0.4%
5 77
 
0.8%
6 85
 
0.9%
7 1
 
< 0.1%
8 1
 
< 0.1%
9 8
 
0.1%
ValueCountFrequency (%)
9 8
 
0.1%
8 1
 
< 0.1%
7 1
 
< 0.1%
6 85
 
0.9%
5 77
 
0.8%
4 44
 
0.4%
3 480
 
4.8%
2 1070
 
10.7%
1 8234
82.3%

행정동코드
Real number (ℝ)

SKEWED 

Distinct26
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.7190594 × 109
Minimum4.719025 × 109
Maximum4.7850256 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:46:51.695125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.719025 × 109
5-th percentile4.719025 × 109
Q14.7190256 × 109
median4.7190535 × 109
Q34.719063 × 109
95-th percentile4.719069 × 109
Maximum4.7850256 × 109
Range66000600
Interquartile range (IQR)37400

Descriptive statistics

Standard deviation933196.78
Coefficient of variation (CV)0.00019775059
Kurtosis4994.2139
Mean4.7190594 × 109
Median Absolute Deviation (MAD)15500
Skewness70.665214
Sum4.7190594 × 1013
Variance8.7085623 × 1011
MonotonicityNot monotonic
2023-12-12T08:46:52.144412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
4719025000 1046
 
10.5%
4719025300 1046
 
10.5%
4719067000 814
 
8.1%
4719053500 696
 
7.0%
4719034000 632
 
6.3%
4719056500 613
 
6.1%
4719025600 435
 
4.3%
4719068000 428
 
4.3%
4719036000 422
 
4.2%
4719033000 415
 
4.2%
Other values (16) 3453
34.5%
ValueCountFrequency (%)
4719025000 1046
10.5%
4719025300 1046
10.5%
4719025600 435
4.3%
4719031000 360
 
3.6%
4719032000 307
 
3.1%
4719033000 415
 
4.2%
4719034000 632
6.3%
4719036000 422
4.2%
4719051000 235
 
2.4%
4719053500 696
7.0%
ValueCountFrequency (%)
4785025600 2
 
< 0.1%
4719070000 144
 
1.4%
4719069000 412
4.1%
4719068000 428
4.3%
4719067000 814
8.1%
4719066000 297
 
3.0%
4719064500 358
3.6%
4719063000 124
 
1.2%
4719061000 172
 
1.7%
4719060000 112
 
1.1%

행정동명
Categorical

Distinct26
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
선산읍
1046 
고아읍
1046 
인동동
814 
원평동
696 
해평면
632 
Other values (21)
5766 

Length

Max length5
Median length3
Mean length3.2473
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row비산동
2nd row해평면
3rd row형곡2동
4th row산동읍
5th row해평면

Common Values

ValueCountFrequency (%)
선산읍 1046
 
10.5%
고아읍 1046
 
10.5%
인동동 814
 
8.1%
원평동 696
 
7.0%
해평면 632
 
6.3%
선주원남동 613
 
6.1%
산동읍 435
 
4.3%
진미동 428
 
4.3%
장천면 422
 
4.2%
도개면 415
 
4.2%
Other values (16) 3453
34.5%

Length

2023-12-12T08:46:52.276329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
선산읍 1046
 
10.5%
고아읍 1046
 
10.5%
인동동 814
 
8.1%
원평동 696
 
7.0%
해평면 632
 
6.3%
선주원남동 613
 
6.1%
산동읍 435
 
4.3%
진미동 428
 
4.3%
장천면 422
 
4.2%
도개면 415
 
4.2%
Other values (16) 3453
34.5%

우편번호
Real number (ℝ)

Distinct342
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39229.956
Minimum39100
Maximum39465
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:46:52.435346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum39100
5-th percentile39102
Q139139
median39198
Q339322
95-th percentile39442
Maximum39465
Range365
Interquartile range (IQR)183

Descriptive statistics

Standard deviation113.17532
Coefficient of variation (CV)0.0028849208
Kurtosis-0.90967648
Mean39229.956
Median Absolute Deviation (MAD)80
Skewness0.65526578
Sum3.9229956 × 108
Variance12808.652
MonotonicityNot monotonic
2023-12-12T08:46:52.586280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
39102 313
 
3.1%
39103 287
 
2.9%
39133 224
 
2.2%
39101 219
 
2.2%
39155 213
 
2.1%
39105 197
 
2.0%
39139 179
 
1.8%
39154 156
 
1.6%
39100 141
 
1.4%
39104 128
 
1.3%
Other values (332) 7943
79.4%
ValueCountFrequency (%)
39100 141
1.4%
39101 219
2.2%
39102 313
3.1%
39103 287
2.9%
39104 128
1.3%
39105 197
2.0%
39106 63
 
0.6%
39107 54
 
0.5%
39108 69
 
0.7%
39109 36
 
0.4%
ValueCountFrequency (%)
39465 2
 
< 0.1%
39464 6
 
0.1%
39463 1
 
< 0.1%
39462 5
 
0.1%
39461 3
 
< 0.1%
39460 1
 
< 0.1%
39459 3
 
< 0.1%
39458 61
0.6%
39457 38
0.4%
39456 46
0.5%

시군구용건물명
Text

MISSING 

Distinct611
Distinct (%)94.9%
Missing9356
Missing (%)93.6%
Memory size156.2 KiB
2023-12-12T08:46:52.937831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length6.2096273
Min length2

Characters and Unicode

Total characters3999
Distinct characters406
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique584 ?
Unique (%)90.7%

Sample

1st row대풍빌라
2nd row경운대학교기숙사
3rd row오태중학교
4th row(주)영인테크
5th row정우빌라
ValueCountFrequency (%)
에비앙힐스 4
 
0.6%
카운티 4
 
0.6%
백산빌리지3차 4
 
0.6%
주공아파트 4
 
0.6%
밀턴힐 3
 
0.4%
두산맨션 3
 
0.4%
프라임 3
 
0.4%
구미공장 3
 
0.4%
장원빌 3
 
0.4%
송정 2
 
0.3%
Other values (653) 680
95.4%
2023-12-12T08:46:53.441643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
140
 
3.5%
88
 
2.2%
78
 
2.0%
75
 
1.9%
72
 
1.8%
70
 
1.8%
69
 
1.7%
69
 
1.7%
66
 
1.7%
66
 
1.7%
Other values (396) 3206
80.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3666
91.7%
Decimal Number 96
 
2.4%
Space Separator 69
 
1.7%
Uppercase Letter 56
 
1.4%
Close Punctuation 43
 
1.1%
Open Punctuation 43
 
1.1%
Lowercase Letter 19
 
0.5%
Other Punctuation 5
 
0.1%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
140
 
3.8%
88
 
2.4%
78
 
2.1%
75
 
2.0%
72
 
2.0%
70
 
1.9%
69
 
1.9%
66
 
1.8%
66
 
1.8%
62
 
1.7%
Other values (354) 2880
78.6%
Uppercase Letter
ValueCountFrequency (%)
A 9
16.1%
K 6
10.7%
C 5
8.9%
S 5
8.9%
P 5
8.9%
T 5
8.9%
G 3
 
5.4%
I 3
 
5.4%
B 3
 
5.4%
D 3
 
5.4%
Other values (7) 9
16.1%
Decimal Number
ValueCountFrequency (%)
1 31
32.3%
2 25
26.0%
3 18
18.8%
4 5
 
5.2%
5 5
 
5.2%
7 3
 
3.1%
0 3
 
3.1%
8 2
 
2.1%
9 2
 
2.1%
6 2
 
2.1%
Lowercase Letter
ValueCountFrequency (%)
e 5
26.3%
i 4
21.1%
l 3
15.8%
y 1
 
5.3%
a 1
 
5.3%
p 1
 
5.3%
s 1
 
5.3%
o 1
 
5.3%
d 1
 
5.3%
n 1
 
5.3%
Space Separator
ValueCountFrequency (%)
69
100.0%
Close Punctuation
ValueCountFrequency (%)
) 43
100.0%
Open Punctuation
ValueCountFrequency (%)
( 43
100.0%
Other Punctuation
ValueCountFrequency (%)
. 5
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3666
91.7%
Common 258
 
6.5%
Latin 75
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
140
 
3.8%
88
 
2.4%
78
 
2.1%
75
 
2.0%
72
 
2.0%
70
 
1.9%
69
 
1.9%
66
 
1.8%
66
 
1.8%
62
 
1.7%
Other values (354) 2880
78.6%
Latin
ValueCountFrequency (%)
A 9
 
12.0%
K 6
 
8.0%
C 5
 
6.7%
S 5
 
6.7%
e 5
 
6.7%
P 5
 
6.7%
T 5
 
6.7%
i 4
 
5.3%
G 3
 
4.0%
I 3
 
4.0%
Other values (17) 25
33.3%
Common
ValueCountFrequency (%)
69
26.7%
) 43
16.7%
( 43
16.7%
1 31
12.0%
2 25
 
9.7%
3 18
 
7.0%
. 5
 
1.9%
4 5
 
1.9%
5 5
 
1.9%
7 3
 
1.2%
Other values (5) 11
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3666
91.7%
ASCII 333
 
8.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
140
 
3.8%
88
 
2.4%
78
 
2.1%
75
 
2.0%
72
 
2.0%
70
 
1.9%
69
 
1.9%
66
 
1.8%
66
 
1.8%
62
 
1.7%
Other values (354) 2880
78.6%
ASCII
ValueCountFrequency (%)
69
20.7%
) 43
12.9%
( 43
12.9%
1 31
 
9.3%
2 25
 
7.5%
3 18
 
5.4%
A 9
 
2.7%
K 6
 
1.8%
. 5
 
1.5%
4 5
 
1.5%
Other values (32) 79
23.7%

공동주택여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9593 
1
 
407

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row0
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9593
95.9%
1 407
 
4.1%

Length

2023-12-12T08:46:53.596686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:46:53.702383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9593
95.9%
1 407
 
4.1%

Sample

매칭코드법정필지코드도로명주소코드법정동코드시도명시군구명법정읍면동명법정리명산여부지번본번지번부번도로명코드도로명지하여부건물본번건물부번건축물대장건물명상세건물명건물관리번호읍면동일련번호행정동코드행정동명우편번호시군구용건물명공동주택여부
3981247190472455201000000000000000000000000000000000000471901130010113000047190472455201000000000004719011300경상북도구미시공단동<NA>01130471904724552수출대로3길06821<NA><NA>471901130010113000000000014719061000비산동39257<NA>1
3482047190472453801000000000000000000000000000000000000471903403510001000047190472453801000000000004719034035경상북도구미시해평면일선리0175471904724538수류길0288<NA><NA>471903403510001000000000014719034000해평면39105<NA>0
4129747190472490901000000000000000000000000000000000000471901090010375000047190472490901000000000004719010900경상북도구미시형곡동<NA>03752471904724909형곡동로1길0510<NA><NA>471901090010375000000000014719058300형곡2동39320대풍빌라1
2890547190472421202000000000000000000000000000000000000471903502110582000047190472421202000000000004719025621경상북도구미시산동읍동곡리05820471904724212동곡3길070<NA><NA>471903502110582000000000024719025600산동읍39159<NA>0
3923247190472451001000000000000000000000000000000000000471903403110803000047190472451001000000000004719034031경상북도구미시해평면송곡리080398471904724510송곡6길051<NA><NA>471903403110803000000000014719034000해평면39105<NA>0
3362647190472457201000000000000000000000000000000000000471901110010168000047190472457201000000000004719011100경상북도구미시신평동<NA>016819471904724572신비로4길02015<NA><NA>471901110010168000000000014719059000신평1동39252<NA>0
380347190472445001000000000000000000000000000000000000471903602110503000047190472445001000000000004719036021경상북도구미시장천면상장리05030471904724450상장1길0500<NA><NA>471903602110503000000000014719036000장천면39455<NA>0
2577047190472470301000000000000000000000000000000000000471903302310110000047190472470301000000000004719033023경상북도구미시도개면월림리01100471904724703월림4길0116<NA><NA>471903302310110000000000014719033000도개면39103<NA>0
897847190472437001000000000000000000000000000000000000471901120010488000047190472437001000000000004719011200경상북도구미시비산동<NA>04882471904724370비산로1안길0140<NA><NA>471901120010488000000000014719061000비산동39258<NA>0
2353847190472411501000000000000000000000000000000000000471901010010433000047190472411501000000000004719010100경상북도구미시원평동<NA>043325471904724115금오산로16길0131<NA><NA>471901010010433000000000014719053500원평동39302<NA>0
매칭코드법정필지코드도로명주소코드법정동코드시도명시군구명법정읍면동명법정리명산여부지번본번지번부번도로명코드도로명지하여부건물본번건물부번건축물대장건물명상세건물명건물관리번호읍면동일련번호행정동코드행정동명우편번호시군구용건물명공동주택여부
1112747190472422801000000000000000000000000000000000000471902533211348000047190472422801000000000004719025332경상북도구미시고아읍문성리013480471904724228들성로15길0117<NA><NA>471902533211348000000000014719025300고아읍39146<NA>0
3026747190472474901000000000000000000000000000000000000471901200010019000047190472474901000000000004719012200경상북도구미시인의동<NA>010043471904724749인동31길01010<NA><NA>471901200010019000000000014719067000인동동39440<NA>0
2139347190330808701000000000000000000000000000000000000471901190010676000047190330808701000000000004719011900경상북도구미시신동<NA>06760471903308087인동가산로04660<NA><NA>471901190010676000000000014719067000인동동39444<NA>0
1397047190472442801000000000000000000000000000000000000471901150010657000047190472442801000000000004719011500경상북도구미시사곡동<NA>06571471904724428상사동로26길0110<NA><NA>471901150010657000000000014719064500상모사곡동39328<NA>0
1724347190301809502000000000000000000000000000000000000471901240010170000047190301809502000000000004719012400경상북도구미시시미동<NA>017034719030180953공단1로011218<NA><NA>471901240010170000000000024785025600석적읍39403<NA>0
1543647190230800303000000000000000000000000000000000000471902532110665000047190230800303000000000004719025321경상북도구미시고아읍관심리06656471902308003선산대로09480<NA><NA>471902532110665000000000034719025300고아읍39142<NA>0
1376947190472435601000000000000000000000000000000000000471902533610911000047190472435601000000000004719025336경상북도구미시고아읍봉한리0911166471904724356봉한1안길0613<NA><NA>471902533610911000000000014719025300고아읍39144<NA>0
2913247190472421302000000000000000000000000000000000000471903502110406000047190472421302000000000004719025621경상북도구미시산동읍동곡리04065471904724213동곡4길0900<NA><NA>471903502110406000000000024719025600산동읍39159<NA>0
2469447190472408701000000000000000000000000000000000000471901010010118000047190472408701000000000004719010100경상북도구미시원평동<NA>011831471904724087구미중앙로9길070<NA><NA>471901010010118000000000014719053500원평동39221<NA>0
1345247190472429502000000000000000000000000000000000000471903502910942000047190472429502000000000004719025629경상북도구미시산동읍백현리09420471904724295백현1길04130<NA><NA>471903502910942000000000024719025600산동읍39161<NA>0