Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory478.5 KiB
Average record size in memory49.0 B

Variable types

Numeric1
Text2
Categorical2

Dataset

Description순번,ID,도시계획코드,분류명,라벨명
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15528/S/1/datasetView.do

Alerts

분류명 is highly overall correlated with 도시계획코드High correlation
도시계획코드 is highly overall correlated with 분류명High correlation
순번 has unique valuesUnique

Reproduction

Analysis started2024-05-03 22:52:32.496782
Analysis finished2024-05-03 22:52:35.163232
Duration2.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean363483.94
Minimum358331
Maximum368567
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-03T22:52:35.546443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum358331
5-th percentile358900.95
Q1360932.75
median363492.5
Q3366032.25
95-th percentile368061.05
Maximum368567
Range10236
Interquartile range (IQR)5099.5

Descriptive statistics

Standard deviation2939.6246
Coefficient of variation (CV)0.0080873574
Kurtosis-1.2022323
Mean363483.94
Median Absolute Deviation (MAD)2550
Skewness-0.0024162463
Sum3.6348394 × 109
Variance8641392.7
MonotonicityNot monotonic
2024-05-03T22:52:36.186211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
359129 1
 
< 0.1%
359262 1
 
< 0.1%
361882 1
 
< 0.1%
361525 1
 
< 0.1%
358663 1
 
< 0.1%
367714 1
 
< 0.1%
363290 1
 
< 0.1%
366254 1
 
< 0.1%
367030 1
 
< 0.1%
359563 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
358331 1
< 0.1%
358332 1
< 0.1%
358333 1
< 0.1%
358334 1
< 0.1%
358335 1
< 0.1%
358336 1
< 0.1%
358401 1
< 0.1%
358402 1
< 0.1%
358403 1
< 0.1%
358404 1
< 0.1%
ValueCountFrequency (%)
368567 1
< 0.1%
368566 1
< 0.1%
368565 1
< 0.1%
368564 1
< 0.1%
368563 1
< 0.1%
368562 1
< 0.1%
368561 1
< 0.1%
368560 1
< 0.1%
368559 1
< 0.1%
368558 1
< 0.1%

ID
Text

Distinct9743
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-03T22:52:37.076259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters120000
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9486 ?
Unique (%)94.9%

Sample

1st row생활서비스시설_1691
2nd row생활서비스시설_7473
3rd row생활서비스시설_1804
4th row생활서비스시설_7810
5th row생활서비스시설_4436
ValueCountFrequency (%)
생활서비스시설_0064 2
 
< 0.1%
생활서비스시설_0120 2
 
< 0.1%
생활서비스시설_1444 2
 
< 0.1%
생활서비스시설_1398 2
 
< 0.1%
생활서비스시설_0161 2
 
< 0.1%
생활서비스시설_0109 2
 
< 0.1%
생활서비스시설_0051 2
 
< 0.1%
생활서비스시설_0152 2
 
< 0.1%
생활서비스시설_1373 2
 
< 0.1%
생활서비스시설_0107 2
 
< 0.1%
Other values (9733) 9980
99.8%
2024-05-03T22:52:38.235942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10000
 
8.3%
10000
 
8.3%
10000
 
8.3%
10000
 
8.3%
10000
 
8.3%
10000
 
8.3%
_ 10000
 
8.3%
10000
 
8.3%
0 4242
 
3.5%
1 4137
 
3.4%
Other values (8) 31621
26.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 70000
58.3%
Decimal Number 40000
33.3%
Connector Punctuation 10000
 
8.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 4242
10.6%
1 4137
10.3%
3 3992
10.0%
4 3990
10.0%
7 3967
9.9%
2 3966
9.9%
6 3965
9.9%
9 3946
9.9%
5 3936
9.8%
8 3859
9.6%
Other Letter
ValueCountFrequency (%)
10000
14.3%
10000
14.3%
10000
14.3%
10000
14.3%
10000
14.3%
10000
14.3%
10000
14.3%
Connector Punctuation
ValueCountFrequency (%)
_ 10000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 70000
58.3%
Common 50000
41.7%

Most frequent character per script

Common
ValueCountFrequency (%)
_ 10000
20.0%
0 4242
8.5%
1 4137
8.3%
3 3992
 
8.0%
4 3990
 
8.0%
7 3967
 
7.9%
2 3966
 
7.9%
6 3965
 
7.9%
9 3946
 
7.9%
5 3936
 
7.9%
Hangul
ValueCountFrequency (%)
10000
14.3%
10000
14.3%
10000
14.3%
10000
14.3%
10000
14.3%
10000
14.3%
10000
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 70000
58.3%
ASCII 50000
41.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10000
14.3%
10000
14.3%
10000
14.3%
10000
14.3%
10000
14.3%
10000
14.3%
10000
14.3%
ASCII
ValueCountFrequency (%)
_ 10000
20.0%
0 4242
8.5%
1 4137
8.3%
3 3992
 
8.0%
4 3990
 
8.0%
7 3967
 
7.9%
2 3966
 
7.9%
6 3965
 
7.9%
9 3946
 
7.9%
5 3936
 
7.9%

도시계획코드
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
ZON234
6468 
ZON224
948 
ZON244
771 
ZON252
 
580
ZON240
 
491
Other values (5)
742 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowZON234
2nd rowZON234
3rd rowZON234
4th rowZON234
5th rowZON234

Common Values

ValueCountFrequency (%)
ZON234 6468
64.7%
ZON224 948
 
9.5%
ZON244 771
 
7.7%
ZON252 580
 
5.8%
ZON240 491
 
4.9%
ZON212 418
 
4.2%
ZON228 116
 
1.2%
ZON248 93
 
0.9%
ZON220 63
 
0.6%
ZON232 52
 
0.5%

Length

2024-05-03T22:52:38.782401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-03T22:52:39.254798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
zon234 6468
64.7%
zon224 948
 
9.5%
zon244 771
 
7.7%
zon252 580
 
5.8%
zon240 491
 
4.9%
zon212 418
 
4.2%
zon228 116
 
1.2%
zon248 93
 
0.9%
zon220 63
 
0.6%
zon232 52
 
0.5%

분류명
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
시설_보육시설
6468 
시설_도서관
948 
시설_주차장
771 
시설_청소년아동
 
580
시설_장애인복지
 
491
Other values (5)
742 

Length

Max length9
Median length7
Mean length7.0136
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row시설_보육시설
2nd row시설_보육시설
3rd row시설_보육시설
4th row시설_보육시설
5th row시설_보육시설

Common Values

ValueCountFrequency (%)
시설_보육시설 6468
64.7%
시설_도서관 948
 
9.5%
시설_주차장 771
 
7.7%
시설_청소년아동 580
 
5.8%
시설_장애인복지 491
 
4.9%
시설_공공체육시설 418
 
4.2%
시설_문화복지 116
 
1.2%
시설_지역주민 93
 
0.9%
시설_노인여가 63
 
0.6%
시설_보건소 52
 
0.5%

Length

2024-05-03T22:52:39.834375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-03T22:52:40.231878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
시설_보육시설 6468
64.7%
시설_도서관 948
 
9.5%
시설_주차장 771
 
7.7%
시설_청소년아동 580
 
5.8%
시설_장애인복지 491
 
4.9%
시설_공공체육시설 418
 
4.2%
시설_문화복지 116
 
1.2%
시설_지역주민 93
 
0.9%
시설_노인여가 63
 
0.6%
시설_보건소 52
 
0.5%
Distinct7514
Distinct (%)75.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-03T22:52:40.841942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length26
Mean length8.0598
Min length2

Characters and Unicode

Total characters80598
Distinct characters767
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6546 ?
Unique (%)65.5%

Sample

1st row은광 어린이집
2nd row고은별어린이집
3rd row파크빌티움아이어린이집
4th row하람어린이집
5th row쌍용어린이집
ValueCountFrequency (%)
어린이집 395
 
3.4%
구립 121
 
1.0%
작은도서관 109
 
0.9%
공영주차장(구 87
 
0.8%
노상주차장(구 53
 
0.5%
새마을문고 45
 
0.4%
마을문고 39
 
0.3%
노상공영주차장(구 33
 
0.3%
도서관 25
 
0.2%
박물관 21
 
0.2%
Other values (7650) 10607
92.0%
2024-05-03T22:52:41.862435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7284
 
9.0%
6623
 
8.2%
6541
 
8.1%
6510
 
8.1%
1548
 
1.9%
1512
 
1.9%
1327
 
1.6%
1259
 
1.6%
1078
 
1.3%
1041
 
1.3%
Other values (757) 45875
56.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 75981
94.3%
Space Separator 1548
 
1.9%
Open Punctuation 911
 
1.1%
Close Punctuation 910
 
1.1%
Decimal Number 902
 
1.1%
Uppercase Letter 167
 
0.2%
Lowercase Letter 99
 
0.1%
Dash Punctuation 44
 
0.1%
Other Punctuation 23
 
< 0.1%
Math Symbol 13
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7284
 
9.6%
6623
 
8.7%
6541
 
8.6%
6510
 
8.6%
1512
 
2.0%
1327
 
1.7%
1259
 
1.7%
1078
 
1.4%
1041
 
1.4%
937
 
1.2%
Other values (694) 41869
55.1%
Uppercase Letter
ValueCountFrequency (%)
S 22
13.2%
A 21
12.6%
B 18
10.8%
C 16
9.6%
Y 11
 
6.6%
L 11
 
6.6%
K 11
 
6.6%
M 10
 
6.0%
G 10
 
6.0%
E 8
 
4.8%
Other values (11) 29
17.4%
Lowercase Letter
ValueCountFrequency (%)
i 32
32.3%
e 16
16.2%
s 8
 
8.1%
k 5
 
5.1%
a 5
 
5.1%
t 4
 
4.0%
c 3
 
3.0%
o 3
 
3.0%
d 3
 
3.0%
r 3
 
3.0%
Other values (10) 17
17.2%
Decimal Number
ValueCountFrequency (%)
1 269
29.8%
2 227
25.2%
3 139
15.4%
4 84
 
9.3%
5 61
 
6.8%
6 36
 
4.0%
7 35
 
3.9%
8 22
 
2.4%
9 15
 
1.7%
0 14
 
1.6%
Other Punctuation
ValueCountFrequency (%)
? 13
56.5%
. 4
 
17.4%
, 3
 
13.0%
& 2
 
8.7%
/ 1
 
4.3%
Open Punctuation
ValueCountFrequency (%)
( 910
99.9%
[ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 909
99.9%
] 1
 
0.1%
Space Separator
ValueCountFrequency (%)
1548
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 44
100.0%
Math Symbol
ValueCountFrequency (%)
~ 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 75977
94.3%
Common 4351
 
5.4%
Latin 266
 
0.3%
Han 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7284
 
9.6%
6623
 
8.7%
6541
 
8.6%
6510
 
8.6%
1512
 
2.0%
1327
 
1.7%
1259
 
1.7%
1078
 
1.4%
1041
 
1.4%
937
 
1.2%
Other values (693) 41865
55.1%
Latin
ValueCountFrequency (%)
i 32
 
12.0%
S 22
 
8.3%
A 21
 
7.9%
B 18
 
6.8%
C 16
 
6.0%
e 16
 
6.0%
Y 11
 
4.1%
L 11
 
4.1%
K 11
 
4.1%
M 10
 
3.8%
Other values (31) 98
36.8%
Common
ValueCountFrequency (%)
1548
35.6%
( 910
20.9%
) 909
20.9%
1 269
 
6.2%
2 227
 
5.2%
3 139
 
3.2%
4 84
 
1.9%
5 61
 
1.4%
- 44
 
1.0%
6 36
 
0.8%
Other values (12) 124
 
2.8%
Han
ValueCountFrequency (%)
4
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 75977
94.3%
ASCII 4617
 
5.7%
CJK 4
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7284
 
9.6%
6623
 
8.7%
6541
 
8.6%
6510
 
8.6%
1512
 
2.0%
1327
 
1.7%
1259
 
1.7%
1078
 
1.4%
1041
 
1.4%
937
 
1.2%
Other values (693) 41865
55.1%
ASCII
ValueCountFrequency (%)
1548
33.5%
( 910
19.7%
) 909
19.7%
1 269
 
5.8%
2 227
 
4.9%
3 139
 
3.0%
4 84
 
1.8%
5 61
 
1.3%
- 44
 
1.0%
6 36
 
0.8%
Other values (53) 390
 
8.4%
CJK
ValueCountFrequency (%)
4
100.0%

Interactions

2024-05-03T22:52:34.143187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-03T22:52:42.133274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번도시계획코드분류명
순번1.0000.6470.647
도시계획코드0.6471.0001.000
분류명0.6471.0001.000
2024-05-03T22:52:42.375925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분류명도시계획코드
분류명1.0001.000
도시계획코드1.0001.000
2024-05-03T22:52:42.620616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번도시계획코드분류명
순번1.0000.2480.248
도시계획코드0.2481.0001.000
분류명0.2481.0001.000

Missing values

2024-05-03T22:52:34.607538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-03T22:52:34.960826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번ID도시계획코드분류명라벨명
3826359129생활서비스시설_1691ZON234시설_보육시설은광 어린이집
9828360802생활서비스시설_7473ZON234시설_보육시설고은별어린이집
3206359022생활서비스시설_1804ZON234시설_보육시설파크빌티움아이어린이집
8792363033생활서비스시설_7810ZON234시설_보육시설하람어린이집
8536368471생활서비스시설_4436ZON234시설_보육시설쌍용어린이집
6711359951생활서비스시설_7592ZON234시설_보육시설송파제일어린이집
7434362706생활서비스시설_9877ZON224시설_도서관네이처힐5단지 작은도서관
9921363444생활서비스시설_3151ZON234시설_보육시설예나동산 어린이집
8658360355생활서비스시설_9180ZON224시설_도서관하늘씨앗문고
3081367129생활서비스시설_5825ZON234시설_보육시설태양어린이집
순번ID도시계획코드분류명라벨명
7708365557생활서비스시설_3986ZON234시설_보육시설해솔어린이집
5410362236생활서비스시설_8686ZON212시설_공공체육시설대치유수지체육공원
5145367646생활서비스시설_3128ZON234시설_보육시설럭키어린이집
7808368255생활서비스시설_9772ZON224시설_도서관은평뉴타운 상림마을14단지마을문고
9968363491생활서비스시설_4861ZON234시설_보육시설보금자리어린이집
7427362699생활서비스시설_6357ZON234시설_보육시설킨더어린이집
6295365176생활서비스시설_9344ZON224시설_도서관꿈꾸는 도서관
1645366723생활서비스시설_5366ZON234시설_보육시설레인보우어린이집
5120367621생활서비스시설_3058ZON234시설_보육시설노아어린이집
3970361831생활서비스시설_1207ZON252시설_청소년아동기쁨지역아동센터