Overview

Dataset statistics

Number of variables14
Number of observations183
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory20.9 KiB
Average record size in memory116.7 B

Variable types

Text5
Numeric5
Categorical4

Alerts

동/읍/면/리사무소 is highly overall correlated with P02000 and 2 other fieldsHigh correlation
P02000 is highly overall correlated with 행정기관 and 2 other fieldsHigh correlation
행정기관 is highly overall correlated with P02000 and 2 other fieldsHigh correlation
P02003 is highly overall correlated with P02000 and 2 other fieldsHigh correlation
313742 is highly overall correlated with 557432 and 2 other fieldsHigh correlation
557432 is highly overall correlated with 313742 and 2 other fieldsHigh correlation
1130510100008130003 is highly overall correlated with 313742 and 2 other fieldsHigh correlation
1130510100108130003000001 is highly overall correlated with 313742 and 2 other fieldsHigh correlation
P0202904 has unique valuesUnique
삼각산동주민센터 has unique valuesUnique

Reproduction

Analysis started2023-12-10 06:13:52.931647
Analysis finished2023-12-10 06:14:07.983666
Duration15.05 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

P0202904
Text

UNIQUE 

Distinct183
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-10T15:14:08.339785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters1464
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)100.0%

Sample

1st rowP0109522
2nd rowP0110016
3rd rowP0110641
4th rowP0202988
5th rowP0106984
ValueCountFrequency (%)
p0109522 1
 
0.5%
p0203313 1
 
0.5%
p0109509 1
 
0.5%
p0202615 1
 
0.5%
p0110965 1
 
0.5%
p0112706 1
 
0.5%
p0106981 1
 
0.5%
p0106104 1
 
0.5%
p0110967 1
 
0.5%
p0109524 1
 
0.5%
Other values (173) 173
94.5%
2023-12-10T15:14:09.067027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 414
28.3%
1 249
17.0%
P 183
12.5%
2 157
 
10.7%
9 117
 
8.0%
6 81
 
5.5%
8 59
 
4.0%
3 57
 
3.9%
7 55
 
3.8%
5 54
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1281
87.5%
Uppercase Letter 183
 
12.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 414
32.3%
1 249
19.4%
2 157
 
12.3%
9 117
 
9.1%
6 81
 
6.3%
8 59
 
4.6%
3 57
 
4.4%
7 55
 
4.3%
5 54
 
4.2%
4 38
 
3.0%
Uppercase Letter
ValueCountFrequency (%)
P 183
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1281
87.5%
Latin 183
 
12.5%

Most frequent character per script

Common
ValueCountFrequency (%)
0 414
32.3%
1 249
19.4%
2 157
 
12.3%
9 117
 
9.1%
6 81
 
6.3%
8 59
 
4.6%
3 57
 
4.4%
7 55
 
4.3%
5 54
 
4.2%
4 38
 
3.0%
Latin
ValueCountFrequency (%)
P 183
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1464
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 414
28.3%
1 249
17.0%
P 183
12.5%
2 157
 
10.7%
9 117
 
8.0%
6 81
 
5.5%
8 59
 
4.0%
3 57
 
3.9%
7 55
 
3.8%
5 54
 
3.7%
Distinct183
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-10T15:14:09.456643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length7.8032787
Min length4

Characters and Unicode

Total characters1428
Distinct characters117
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)100.0%

Sample

1st row서울도곡한신우편취급국
2nd row서울번3동우체국
3rd row서울공항동우체국
4th row인수동주민센터
5th row강서소방서
ValueCountFrequency (%)
서울도곡한신우편취급국 1
 
0.5%
화곡4동주민센터 1
 
0.5%
서울역삼성보우편취급국 1
 
0.5%
대치1동주민센터 1
 
0.5%
번3파출소 1
 
0.5%
화곡지구대 1
 
0.5%
강남소방서 1
 
0.5%
강남세무서 1
 
0.5%
삼양파출소 1
 
0.5%
서울대치4동우편취급국 1
 
0.5%
Other values (173) 173
94.5%
2023-12-10T15:14:09.976009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
93
 
6.5%
91
 
6.4%
70
 
4.9%
69
 
4.8%
69
 
4.8%
66
 
4.6%
64
 
4.5%
55
 
3.9%
53
 
3.7%
1 50
 
3.5%
Other values (107) 748
52.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1320
92.4%
Decimal Number 105
 
7.4%
Uppercase Letter 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
93
 
7.0%
91
 
6.9%
70
 
5.3%
69
 
5.2%
69
 
5.2%
66
 
5.0%
64
 
4.8%
55
 
4.2%
53
 
4.0%
34
 
2.6%
Other values (96) 656
49.7%
Decimal Number
ValueCountFrequency (%)
1 50
47.6%
2 20
 
19.0%
9 14
 
13.3%
3 13
 
12.4%
4 5
 
4.8%
5 1
 
1.0%
6 1
 
1.0%
8 1
 
1.0%
Uppercase Letter
ValueCountFrequency (%)
D 1
33.3%
I 1
33.3%
A 1
33.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1320
92.4%
Common 105
 
7.4%
Latin 3
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
93
 
7.0%
91
 
6.9%
70
 
5.3%
69
 
5.2%
69
 
5.2%
66
 
5.0%
64
 
4.8%
55
 
4.2%
53
 
4.0%
34
 
2.6%
Other values (96) 656
49.7%
Common
ValueCountFrequency (%)
1 50
47.6%
2 20
 
19.0%
9 14
 
13.3%
3 13
 
12.4%
4 5
 
4.8%
5 1
 
1.0%
6 1
 
1.0%
8 1
 
1.0%
Latin
ValueCountFrequency (%)
D 1
33.3%
I 1
33.3%
A 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1320
92.4%
ASCII 108
 
7.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
93
 
7.0%
91
 
6.9%
70
 
5.3%
69
 
5.2%
69
 
5.2%
66
 
5.0%
64
 
4.8%
55
 
4.2%
53
 
4.0%
34
 
2.6%
Other values (96) 656
49.7%
ASCII
ValueCountFrequency (%)
1 50
46.3%
2 20
 
18.5%
9 14
 
13.0%
3 13
 
12.0%
4 5
 
4.6%
5 1
 
0.9%
6 1
 
0.9%
D 1
 
0.9%
I 1
 
0.9%
8 1
 
0.9%
Distinct177
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-10T15:14:10.543961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length22
Mean length18.076503
Min length1

Characters and Unicode

Total characters3308
Distinct characters104
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique172 ?
Unique (%)94.0%

Sample

1st row서울특별시 강남구 논현로38길 38-4
2nd row서울특별시 강북구 한천로105길 27
3rd row서울특별시 강서구 공항대로 41
4th row서울특별시 강북구 인수봉로 255
5th row서울특별시 강서구 양천로 550
ValueCountFrequency (%)
서울특별시 182
25.0%
강남구 87
 
11.9%
강서구 56
 
7.7%
강북구 39
 
5.3%
삼양로 7
 
1.0%
테헤란로 6
 
0.8%
개포로 6
 
0.8%
학동로 5
 
0.7%
선릉로 5
 
0.7%
한천로 5
 
0.7%
Other values (237) 331
45.4%
2023-12-10T15:14:11.387181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
546
16.5%
248
 
7.5%
193
 
5.8%
189
 
5.7%
182
 
5.5%
182
 
5.5%
182
 
5.5%
182
 
5.5%
180
 
5.4%
1 108
 
3.3%
Other values (94) 1116
33.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2153
65.1%
Decimal Number 603
 
18.2%
Space Separator 546
 
16.5%
Dash Punctuation 5
 
0.2%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
248
11.5%
193
9.0%
189
8.8%
182
8.5%
182
8.5%
182
8.5%
182
8.5%
180
8.4%
93
 
4.3%
76
 
3.5%
Other values (81) 446
20.7%
Decimal Number
ValueCountFrequency (%)
1 108
17.9%
2 88
14.6%
3 67
11.1%
5 61
10.1%
6 61
10.1%
4 52
8.6%
9 47
7.8%
0 43
 
7.1%
7 43
 
7.1%
8 33
 
5.5%
Space Separator
ValueCountFrequency (%)
546
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Uppercase Letter
ValueCountFrequency (%)
X 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2153
65.1%
Common 1154
34.9%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
248
11.5%
193
9.0%
189
8.8%
182
8.5%
182
8.5%
182
8.5%
182
8.5%
180
8.4%
93
 
4.3%
76
 
3.5%
Other values (81) 446
20.7%
Common
ValueCountFrequency (%)
546
47.3%
1 108
 
9.4%
2 88
 
7.6%
3 67
 
5.8%
5 61
 
5.3%
6 61
 
5.3%
4 52
 
4.5%
9 47
 
4.1%
0 43
 
3.7%
7 43
 
3.7%
Other values (2) 38
 
3.3%
Latin
ValueCountFrequency (%)
X 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2153
65.1%
ASCII 1155
34.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
546
47.3%
1 108
 
9.4%
2 88
 
7.6%
3 67
 
5.8%
5 61
 
5.3%
6 61
 
5.3%
4 52
 
4.5%
9 47
 
4.1%
0 43
 
3.7%
7 43
 
3.7%
Other values (3) 39
 
3.4%
Hangul
ValueCountFrequency (%)
248
11.5%
193
9.0%
189
8.8%
182
8.5%
182
8.5%
182
8.5%
182
8.5%
180
8.4%
93
 
4.3%
76
 
3.5%
Other values (81) 446
20.7%

313742
Real number (ℝ)

HIGH CORRELATION 

Distinct177
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean310180.2
Minimum294475
Maximum321232
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-10T15:14:12.038136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum294475
5-th percentile295468.6
Q1299427
median314429
Q3315981
95-th percentile319377.5
Maximum321232
Range26757
Interquartile range (IQR)16554

Descriptive statistics

Standard deviation8546.678
Coefficient of variation (CV)0.027553913
Kurtosis-1.1753482
Mean310180.2
Median Absolute Deviation (MAD)2013
Skewness-0.76390585
Sum56762976
Variance73045705
MonotonicityNot monotonic
2023-12-10T15:14:12.319951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
314435 3
 
1.6%
299479 2
 
1.1%
315302 2
 
1.1%
317696 2
 
1.1%
314783 2
 
1.1%
315721 1
 
0.5%
297098 1
 
0.5%
316788 1
 
0.5%
316170 1
 
0.5%
298163 1
 
0.5%
Other values (167) 167
91.3%
ValueCountFrequency (%)
294475 1
0.5%
294781 1
0.5%
294911 1
0.5%
294920 1
0.5%
294990 1
0.5%
295072 1
0.5%
295221 1
0.5%
295399 1
0.5%
295406 1
0.5%
295465 1
0.5%
ValueCountFrequency (%)
321232 1
0.5%
321192 1
0.5%
321051 1
0.5%
321047 1
0.5%
321031 1
0.5%
319968 1
0.5%
319550 1
0.5%
319530 1
0.5%
319407 1
0.5%
319404 1
0.5%

557432
Real number (ℝ)

HIGH CORRELATION 

Distinct175
Distinct (%)95.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean549807.33
Minimum540712
Maximum562599
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-10T15:14:12.646772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum540712
5-th percentile542772.6
Q1545133
median548410
Q3552626
95-th percentile560132
Maximum562599
Range21887
Interquartile range (IQR)7493

Descriptive statistics

Standard deviation5748.9971
Coefficient of variation (CV)0.010456385
Kurtosis-0.8355743
Mean549807.33
Median Absolute Deviation (MAD)3759
Skewness0.61700073
Sum1.0061474 × 108
Variance33050968
MonotonicityNot monotonic
2023-12-10T15:14:12.879426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
544454 3
 
1.6%
546672 2
 
1.1%
545752 2
 
1.1%
551277 2
 
1.1%
559403 2
 
1.1%
553718 2
 
1.1%
551399 2
 
1.1%
547896 1
 
0.5%
549688 1
 
0.5%
546706 1
 
0.5%
Other values (165) 165
90.2%
ValueCountFrequency (%)
540712 1
0.5%
541144 1
0.5%
541819 1
0.5%
542274 1
0.5%
542295 1
0.5%
542381 1
0.5%
542718 1
0.5%
542730 1
0.5%
542750 1
0.5%
542767 1
0.5%
ValueCountFrequency (%)
562599 1
0.5%
561079 1
0.5%
560743 1
0.5%
560707 1
0.5%
560666 1
0.5%
560365 1
0.5%
560320 1
0.5%
560300 1
0.5%
560296 1
0.5%
560136 1
0.5%

220765
Real number (ℝ)

Distinct161
Distinct (%)88.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean197403.65
Minimum9735
Maximum518897
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-10T15:14:13.095627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9735
5-th percentile14723.8
Q127922.5
median220101
Q3282956.5
95-th percentile414852.9
Maximum518897
Range509162
Interquartile range (IQR)255034

Descriptive statistics

Standard deviation147946.2
Coefficient of variation (CV)0.74946028
Kurtosis-1.176075
Mean197403.65
Median Absolute Deviation (MAD)135380
Skewness0.082259655
Sum36124868
Variance2.1888077 × 1010
MonotonicityNot monotonic
2023-12-10T15:14:13.321524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15248 3
 
1.6%
15515 3
 
1.6%
142113 3
 
1.6%
355917 3
 
1.6%
220809 3
 
1.6%
323900 2
 
1.1%
414643 2
 
1.1%
13785 2
 
1.1%
414149 2
 
1.1%
278605 2
 
1.1%
Other values (151) 158
86.3%
ValueCountFrequency (%)
9735 1
0.5%
10717 1
0.5%
13779 1
0.5%
13785 2
1.1%
14138 1
0.5%
14469 1
0.5%
14537 1
0.5%
14632 1
0.5%
14711 1
0.5%
14839 1
0.5%
ValueCountFrequency (%)
518897 1
0.5%
518516 1
0.5%
508825 2
1.1%
502544 1
0.5%
418721 1
0.5%
417617 1
0.5%
417062 1
0.5%
415152 1
0.5%
414862 1
0.5%
414771 1
0.5%

P02000
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
P01000
127 
P02000
56 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowP01000
2nd rowP01000
3rd rowP01000
4th rowP02000
5th rowP01000

Common Values

ValueCountFrequency (%)
P01000 127
69.4%
P02000 56
30.6%

Length

2023-12-10T15:14:13.536859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:14:13.701119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
p01000 127
69.4%
p02000 56
30.6%

행정기관
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
공공기관
127 
행정기관
56 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공공기관
2nd row공공기관
3rd row공공기관
4th row행정기관
5th row공공기관

Common Values

ValueCountFrequency (%)
공공기관 127
69.4%
행정기관 56
30.6%

Length

2023-12-10T15:14:13.881552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:14:14.147229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공공기관 127
69.4%
행정기관 56
30.6%

P02003
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
P01010
64 
P02003
53 
P01012
19 
P01009
17 
P01014
14 
Other values (4)
16 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowP01010
2nd rowP01010
3rd rowP01010
4th rowP02003
5th rowP01009

Common Values

ValueCountFrequency (%)
P01010 64
35.0%
P02003 53
29.0%
P01012 19
 
10.4%
P01009 17
 
9.3%
P01014 14
 
7.7%
P01008 6
 
3.3%
P01003 4
 
2.2%
P01007 3
 
1.6%
P02002 3
 
1.6%

Length

2023-12-10T15:14:14.316470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:14:14.507138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
p01010 64
35.0%
p02003 53
29.0%
p01012 19
 
10.4%
p01009 17
 
9.3%
p01014 14
 
7.7%
p01008 6
 
3.3%
p01003 4
 
2.2%
p01007 3
 
1.6%
p02002 3
 
1.6%

동/읍/면/리사무소
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
우체국
64 
동/읍/면/리사무소
53 
파출소
19 
소방서
17 
지구대
14 
Other values (4)
16 

Length

Max length10
Median length3
Mean length5.0928962
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row우체국
2nd row우체국
3rd row우체국
4th row동/읍/면/리사무소
5th row소방서

Common Values

ValueCountFrequency (%)
우체국 64
35.0%
동/읍/면/리사무소 53
29.0%
파출소 19
 
10.4%
소방서 17
 
9.3%
지구대 14
 
7.7%
세무서 6
 
3.3%
경찰서 4
 
2.2%
세관 3
 
1.6%
도/시/구/군청 3
 
1.6%

Length

2023-12-10T15:14:14.741932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:14:14.948620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
우체국 64
35.0%
동/읍/면/리사무소 53
29.0%
파출소 19
 
10.4%
소방서 17
 
9.3%
지구대 14
 
7.7%
세무서 6
 
3.3%
경찰서 4
 
2.2%
세관 3
 
1.6%
도/시/구/군청 3
 
1.6%
Distinct177
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-10T15:14:15.577925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length28
Mean length22.234973
Min length13

Characters and Unicode

Total characters4069
Distinct characters132
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique172 ?
Unique (%)94.0%

Sample

1st row서울특별시 강남구 논현로38길 38-4
2nd row서울특별시 강북구 한천로 105길 27
3rd row서울특별시 강서구 공항대로 41 서울공항동우체국
4th row서울특별시 강북구 인수봉로 255(수유동)
5th row서울특별시 강서구 양천로 550 (등촌동)
ValueCountFrequency (%)
서울특별시 179
20.9%
강남구 88
 
10.3%
강서구 56
 
6.5%
강북구 39
 
4.6%
화곡동 15
 
1.8%
삼양로 9
 
1.1%
개포로 6
 
0.7%
한천로 6
 
0.7%
미아동 6
 
0.7%
등촌동 6
 
0.7%
Other values (300) 447
52.2%
2023-12-10T15:14:16.484759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
678
 
16.7%
260
 
6.4%
195
 
4.8%
191
 
4.7%
191
 
4.7%
181
 
4.4%
179
 
4.4%
179
 
4.4%
179
 
4.4%
121
 
3.0%
Other values (122) 1715
42.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2561
62.9%
Space Separator 678
 
16.7%
Decimal Number 633
 
15.6%
Open Punctuation 96
 
2.4%
Close Punctuation 96
 
2.4%
Dash Punctuation 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
260
 
10.2%
195
 
7.6%
191
 
7.5%
191
 
7.5%
181
 
7.1%
179
 
7.0%
179
 
7.0%
179
 
7.0%
121
 
4.7%
95
 
3.7%
Other values (108) 790
30.8%
Decimal Number
ValueCountFrequency (%)
1 114
18.0%
2 95
15.0%
3 71
11.2%
6 64
10.1%
5 62
9.8%
4 54
8.5%
0 50
7.9%
9 47
7.4%
7 44
 
7.0%
8 32
 
5.1%
Space Separator
ValueCountFrequency (%)
678
100.0%
Open Punctuation
ValueCountFrequency (%)
( 96
100.0%
Close Punctuation
ValueCountFrequency (%)
) 96
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2561
62.9%
Common 1508
37.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
260
 
10.2%
195
 
7.6%
191
 
7.5%
191
 
7.5%
181
 
7.1%
179
 
7.0%
179
 
7.0%
179
 
7.0%
121
 
4.7%
95
 
3.7%
Other values (108) 790
30.8%
Common
ValueCountFrequency (%)
678
45.0%
1 114
 
7.6%
( 96
 
6.4%
) 96
 
6.4%
2 95
 
6.3%
3 71
 
4.7%
6 64
 
4.2%
5 62
 
4.1%
4 54
 
3.6%
0 50
 
3.3%
Other values (4) 128
 
8.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2561
62.9%
ASCII 1508
37.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
678
45.0%
1 114
 
7.6%
( 96
 
6.4%
) 96
 
6.4%
2 95
 
6.3%
3 71
 
4.7%
6 64
 
4.2%
5 62
 
4.1%
4 54
 
3.6%
0 50
 
3.3%
Other values (4) 128
 
8.5%
Hangul
ValueCountFrequency (%)
260
 
10.2%
195
 
7.6%
191
 
7.5%
191
 
7.5%
181
 
7.1%
179
 
7.0%
179
 
7.0%
179
 
7.0%
121
 
4.7%
95
 
3.7%
Other values (108) 790
30.8%

1130510100008130003
Real number (ℝ)

HIGH CORRELATION 

Distinct176
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.1545105 × 1018
Minimum1.1305101 × 1018
Maximum1.1680118 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-10T15:14:16.819126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.1305101 × 1018
5-th percentile1.1305101 × 1018
Q11.1500102 × 1018
median1.1500109 × 1018
Q31.1680105 × 1018
95-th percentile1.1680114 × 1018
Maximum1.1680118 × 1018
Range3.75017 × 1016
Interquartile range (IQR)1.80003 × 1016

Descriptive statistics

Standard deviation1.4757562 × 1016
Coefficient of variation (CV)0.012782527
Kurtosis-1.1481347
Mean1.1545105 × 1018
Median Absolute Deviation (MAD)1.79996 × 1016
Skewness-0.56647912
Sum8.3612366 × 1018
Variance2.1778564 × 1032
MonotonicityNot monotonic
2023-12-10T15:14:17.066504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1168010100008240000 3
 
1.6%
1150010800013730005 2
 
1.1%
1150010200006300002 2
 
1.1%
1168010500001710003 2
 
1.1%
1130510200003650001 2
 
1.1%
1168010800000710000 2
 
1.1%
1168011800004170001 1
 
0.5%
1150010600006810005 1
 
0.5%
1168010600006460002 1
 
0.5%
1130510200002420006 1
 
0.5%
Other values (166) 166
90.7%
ValueCountFrequency (%)
1130510100000620006 1
0.5%
1130510100000870141 1
0.5%
1130510100001270009 1
0.5%
1130510100001970001 1
0.5%
1130510100003270001 1
0.5%
1130510100003270002 1
0.5%
1130510100003270005 1
0.5%
1130510100007760005 1
0.5%
1130510100007910350 1
0.5%
1130510100008600056 1
0.5%
ValueCountFrequency (%)
1168011800008920006 1
0.5%
1168011800005430007 1
0.5%
1168011800004670012 1
0.5%
1168011800004590000 1
0.5%
1168011800004170001 1
0.5%
1168011500007410001 1
0.5%
1168011500007180003 1
0.5%
1168011500007180002 1
0.5%
1168011500007180001 1
0.5%
1168011400007350004 1
0.5%

1130510100108130003000001
Real number (ℝ)

HIGH CORRELATION 

Distinct159
Distinct (%)86.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.1545105 × 1024
Minimum1.1305101 × 1024
Maximum1.1680118 × 1024
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-10T15:14:17.321509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.1305101 × 1024
5-th percentile1.1305101 × 1024
Q11.1500102 × 1024
median1.1500109 × 1024
Q31.1680105 × 1024
95-th percentile1.1680114 × 1024
Maximum1.1680118 × 1024
Range3.75017 × 1022
Interquartile range (IQR)1.80003 × 1022

Descriptive statistics

Standard deviation1.4757562 × 1022
Coefficient of variation (CV)0.012782527
Kurtosis-1.1481347
Mean1.1545105 × 1024
Median Absolute Deviation (MAD)1.79996 × 1022
Skewness-0.56647912
Sum2.1127542 × 1026
Variance2.1778564 × 1044
MonotonicityNot monotonic
2023-12-10T15:14:17.586262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1.16801010010824e+24 3
 
1.6%
1.1500103001098e+24 3
 
1.6%
1.1305103001060504e+24 3
 
1.6%
1.16801150010718e+24 3
 
1.6%
1.13051020010242e+24 3
 
1.6%
1.13051010010327e+24 3
 
1.6%
1.1500103001002402e+24 2
 
1.1%
1.1500108001015e+24 2
 
1.1%
1.16801030010014e+24 2
 
1.1%
1.16801140010712e+24 2
 
1.1%
Other values (149) 157
85.8%
ValueCountFrequency (%)
1.13051010010062e+24 1
 
0.5%
1.13051010010087e+24 1
 
0.5%
1.13051010010127e+24 1
 
0.5%
1.13051010010197e+24 1
 
0.5%
1.13051010010327e+24 3
1.6%
1.13051010010776e+24 1
 
0.5%
1.1305101001079104e+24 1
 
0.5%
1.1305101001086e+24 2
1.1%
1.13051010010867e+24 1
 
0.5%
1.13051010011267e+24 1
 
0.5%
ValueCountFrequency (%)
1.16801180010892e+24 1
 
0.5%
1.16801180010869e+24 1
 
0.5%
1.16801180010467e+24 1
 
0.5%
1.16801180010459e+24 1
 
0.5%
1.16801180010417e+24 1
 
0.5%
1.16801150010741e+24 1
 
0.5%
1.16801150010718e+24 3
1.6%
1.16801140010735e+24 2
1.1%
1.16801140010712e+24 2
1.1%
1.16801140010621e+24 1
 
0.5%
Distinct176
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-10T15:14:18.180048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length22
Mean length20.972678
Min length17

Characters and Unicode

Total characters3838
Distinct characters65
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique170 ?
Unique (%)92.9%

Sample

1st row서울특별시 강남구 도곡동 417-1번지
2nd row서울특별시 강북구 번동 242-2번지
3rd row서울특별시 강서구 공항동 28-1번지
4th row서울특별시 강북구 수유동 527-19번지
5th row서울특별시 강서구 등촌동 630-2번지
ValueCountFrequency (%)
서울특별시 183
25.0%
강남구 88
 
12.0%
강서구 56
 
7.7%
강북구 39
 
5.3%
화곡동 21
 
2.9%
역삼동 15
 
2.0%
미아동 14
 
1.9%
수유동 14
 
1.9%
삼성동 13
 
1.8%
논현동 12
 
1.6%
Other values (196) 277
37.8%
2023-12-10T15:14:18.910495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
549
 
14.3%
243
 
6.3%
193
 
5.0%
186
 
4.8%
183
 
4.8%
183
 
4.8%
183
 
4.8%
183
 
4.8%
183
 
4.8%
183
 
4.8%
Other values (55) 1569
40.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2376
61.9%
Decimal Number 761
 
19.8%
Space Separator 549
 
14.3%
Dash Punctuation 152
 
4.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
243
10.2%
193
8.1%
186
 
7.8%
183
 
7.7%
183
 
7.7%
183
 
7.7%
183
 
7.7%
183
 
7.7%
183
 
7.7%
183
 
7.7%
Other values (43) 473
19.9%
Decimal Number
ValueCountFrequency (%)
1 157
20.6%
2 103
13.5%
6 75
9.9%
4 74
9.7%
3 69
9.1%
7 68
8.9%
5 64
8.4%
8 57
 
7.5%
0 49
 
6.4%
9 45
 
5.9%
Space Separator
ValueCountFrequency (%)
549
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 152
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2376
61.9%
Common 1462
38.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
243
10.2%
193
8.1%
186
 
7.8%
183
 
7.7%
183
 
7.7%
183
 
7.7%
183
 
7.7%
183
 
7.7%
183
 
7.7%
183
 
7.7%
Other values (43) 473
19.9%
Common
ValueCountFrequency (%)
549
37.6%
1 157
 
10.7%
- 152
 
10.4%
2 103
 
7.0%
6 75
 
5.1%
4 74
 
5.1%
3 69
 
4.7%
7 68
 
4.7%
5 64
 
4.4%
8 57
 
3.9%
Other values (2) 94
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2376
61.9%
ASCII 1462
38.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
549
37.6%
1 157
 
10.7%
- 152
 
10.4%
2 103
 
7.0%
6 75
 
5.1%
4 74
 
5.1%
3 69
 
4.7%
7 68
 
4.7%
5 64
 
4.4%
8 57
 
3.9%
Other values (2) 94
 
6.4%
Hangul
ValueCountFrequency (%)
243
10.2%
193
8.1%
186
 
7.8%
183
 
7.7%
183
 
7.7%
183
 
7.7%
183
 
7.7%
183
 
7.7%
183
 
7.7%
183
 
7.7%
Other values (43) 473
19.9%

Interactions

2023-12-10T15:14:01.244309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:54.021937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:55.972754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:58.013582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:59.419803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:14:02.321224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:54.161957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:56.095148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:58.130406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:59.541773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:14:03.727322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:54.299662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:56.223169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:58.248503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:59.647162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:14:04.679540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:54.439122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:56.341099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:58.358582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:59.775536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:14:05.757545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:54.594664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:56.512279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:58.464896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:59.913451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:14:19.052104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
313742557432220765P02000행정기관P02003동/읍/면/리사무소11305101000081300031130510100108130003000001
3137421.0000.7230.5960.0000.0000.0000.0000.8150.815
5574320.7231.0000.6850.0000.0000.0000.0000.9990.999
2207650.5960.6851.0000.0000.0000.0000.0000.8560.856
P020000.0000.0000.0001.0001.0001.0001.0000.0000.000
행정기관0.0000.0000.0001.0001.0001.0001.0000.0000.000
P020030.0000.0000.0001.0001.0001.0001.0000.0000.000
동/읍/면/리사무소0.0000.0000.0001.0001.0001.0001.0000.0000.000
11305101000081300030.8150.9990.8560.0000.0000.0000.0001.0001.000
11305101001081300030000010.8150.9990.8560.0000.0000.0000.0001.0001.000
2023-12-10T15:14:19.234201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
동/읍/면/리사무소P02000행정기관P02003
동/읍/면/리사무소1.0000.9800.9801.000
P020000.9801.0000.9870.980
행정기관0.9800.9871.0000.980
P020031.0000.9800.9801.000
2023-12-10T15:14:19.394627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
31374255743222076511305101000081300031130510100108130003000001P02000행정기관P02003동/읍/면/리사무소
3137421.000-0.6340.2890.5270.5270.0000.0000.0000.000
557432-0.6341.000-0.201-0.832-0.8320.0000.0000.0000.000
2207650.289-0.2011.0000.0720.0730.0000.0000.0000.000
11305101000081300030.527-0.8320.0721.0001.0000.0000.0000.0000.000
11305101001081300030000010.527-0.8320.0731.0001.0000.1820.1820.2130.213
P020000.0000.0000.0000.0000.1821.0000.9870.9800.980
행정기관0.0000.0000.0000.0000.1820.9871.0000.9800.980
P020030.0000.0000.0000.0000.2130.9800.9801.0001.000
동/읍/면/리사무소0.0000.0000.0000.0000.2130.9800.9801.0001.000

Missing values

2023-12-10T15:14:07.536806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:14:07.844659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

P0202904삼각산동주민센터서울특별시 강북구 삼양로19길 34313742557432220765P02000행정기관P02003동/읍/면/리사무소서울특별시 강북구 삼양로19길 34(미아동)11305101000081300031130510100108130003000001서울특별시 강북구 미아동 813-3번지
0P0109522서울도곡한신우편취급국서울특별시 강남구 논현로38길 38-431572154300219106P01000공공기관P01010우체국서울특별시 강남구 논현로38길 38-411680118000041700011168011800104170001000973서울특별시 강남구 도곡동 417-1번지
1P0110016서울번3동우체국서울특별시 강북구 한천로105길 27316005558582219195P01000공공기관P01010우체국서울특별시 강북구 한천로 105길 2711305102000024200021130510200102420002035152서울특별시 강북구 번동 242-2번지
2P0110641서울공항동우체국서울특별시 강서구 공항대로 4129522155169715885P01000공공기관P01010우체국서울특별시 강서구 공항대로 41 서울공항동우체국11500108000002800011150010800100280001005979서울특별시 강서구 공항동 28-1번지
3P0202988인수동주민센터서울특별시 강북구 인수봉로 255312877560365221245P02000행정기관P02003동/읍/면/리사무소서울특별시 강북구 인수봉로 255(수유동)11305103000052700191130510300105270019001502서울특별시 강북구 수유동 527-19번지
4P0106984강서소방서서울특별시 강서구 양천로 550299479551277282922P01000공공기관P01009소방서서울특별시 강서구 양천로 550 (등촌동)11500102000063000021150010200106300002026625서울특별시 강서구 등촌동 630-2번지
5P0106868수서119안전센터서울특별시 강남구 광평로31길 6319968542916264903P01000공공기관P01009소방서서울특별시 강남구 광평로31길 6 (수서동)11680115000074100011168011500107410001001709서울특별시 강남구 수서동 741-1번지
6P0202986수유3동주민센터서울특별시 강북구 노해로 36313982560046220132P02000행정기관P02003동/읍/면/리사무소서울특별시 강북구 노해로 36(수유동)11305103000022300081130510300102230008008871서울특별시 강북구 수유동 223-8번지
7P0203003화곡2동주민센터서울특별시 강서구 곰달래로37길 1329895254835115353P02000행정기관P02003동/읍/면/리사무소서울특별시 강서구 곰달래로37길 13 (화곡동)11500103000086000041150010300108600004023074서울특별시 강서구 화곡동 860-4번지
8P0109523서울대치우성우편취급국서울특별시 강남구 남부순환로 2917316999543860269766P01000공공기관P01010우체국서울특별시 강남구 남부순환로 2917 청실상가 2층11680106000062600001168010600106260000014407서울특별시 강남구 대치동 626번지
9P0202621삼성2동주민센터서울특별시 강남구 봉은사로 41931585354587424415P02000행정기관P02003동/읍/면/리사무소서울특별시 강남구 봉은사로 41911680105000003800341168010500100380034016359서울특별시 강남구 삼성동 38-34번지
P0202904삼각산동주민센터서울특별시 강북구 삼양로19길 34313742557432220765P02000행정기관P02003동/읍/면/리사무소서울특별시 강북구 삼양로19길 34(미아동)11305101000081300031130510100108130003000001서울특별시 강북구 미아동 813-3번지
173P0109518서울압구정우편취급국서울특별시 강남구 압구정로 332315153547772283723P01000공공기관P01010우체국서울특별시 강남구 압구정로 332 동도상가 2층11680107000066000071168010700106600007009696서울특별시 강남구 신사동 660-7번지
174P0110647서울화곡4동우체국서울특별시 강서구 곰달래로 265299662548410355481P01000공공기관P01010우체국서울특별시 강서구 곰달래로 265 서울화곡4동우체국11500103000077800051150010300107780005019710서울특별시 강서구 화곡동 778-5번지
175P0100233서울강서경찰서서울특별시 강서구 화곡로 30829855255054615515P01000공공기관P01003경찰서서울 강서구 화곡로 30811500103000098000271150010300109800027014891서울특별시 강서구 화곡동 980-27번지
176P0110648김포공항우편취급국서울특별시 강서구 하늘길 112294920551168508825P01000공공기관P01010우체국서울특별시 강서구 하늘길 11211500108000137300051150010800101500000007416서울특별시 강서구 공항동 1373-5번지
177P0110974화곡3파출소서울특별시 강서구 강서로45다길 7129726254979814999P01000공공기관P01012파출소서울특별시 강서구 강서로45다길 71 (화곡동)11500103000101900001150010300110190000029859서울특별시 강서구 화곡동 1019번지
178P0109387서울수서동우체국서울특별시 강남구 광평로 303321051543325142113P01000공공기관P01010우체국서울특별시 강남구 광평로 30311680115000071800031168011500107180003001451서울특별시 강남구 수서동 718-3번지
179P0110957삼성2파출소서울특별시 강남구 선릉로 62631563754600924411P01000공공기관P01012파출소서울특별시 강남구 선릉로 626 (삼성동)11680105000003500001168010500100350000016012서울특별시 강남구 삼성동 35번지
180P0112701곰달래지구대서울특별시 강서구 곰달래로59길 12299718548462355476P01000공공기관P01014지구대서울특별시 강서구 곰달래로59길 12 (화곡동)11500103000077600041150010300107760004020809서울특별시 강서구 화곡동 776-4번지
181P0200197강서구청서울특별시 강서구 화곡로 30229854055048015515P02000행정기관P02002도/시/구/군청서울특별시 강서구 화곡로 302 (화곡동)11500103000098000161150010300109800016015086서울특별시 강서구 화곡동 980-16번지
182P0112848대치지구대서울특별시 강남구 삼성로 203317306544046271945P01000공공기관P01014지구대서울특별시 강남구 삼성로 203 (대치동)11680106000059800001168010600105980000000001서울특별시 강남구 대치동 598번지