Overview

Dataset statistics

Number of variables5
Number of observations240
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.5 KiB
Average record size in memory40.5 B

Variable types

Text3
Categorical2

Dataset

Description메인 키,명칭,행정 시,행정 구,행정 동
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-13079/S/1/datasetView.do

Alerts

행정 시 has constant value ""Constant
메인 키 has unique valuesUnique

Reproduction

Analysis started2023-12-11 10:11:28.393070
Analysis finished2023-12-11 10:11:28.874204
Duration0.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

메인 키
Text

UNIQUE 

Distinct240
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-11T19:11:29.106803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length14
Min length14

Characters and Unicode

Total characters3360
Distinct characters18
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique240 ?
Unique (%)100.0%

Sample

1st rowBE_LiST36-0206
2nd rowBE_LiST36-0207
3rd rowBE_LiST36-0208
4th rowBE_LiST36-0209
5th rowBE_LiST36-0210
ValueCountFrequency (%)
be_list36-0206 1
 
0.4%
be_list36-0207 1
 
0.4%
be_list36-0130 1
 
0.4%
be_list36-0118 1
 
0.4%
be_list36-0119 1
 
0.4%
be_list36-0120 1
 
0.4%
be_list36-0121 1
 
0.4%
be_list36-0122 1
 
0.4%
be_list36-0123 1
 
0.4%
be_list36-0124 1
 
0.4%
Other values (230) 230
95.8%
2023-12-11T19:11:29.566174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 392
11.7%
3 294
8.8%
6 284
 
8.5%
B 240
 
7.1%
T 240
 
7.1%
E 240
 
7.1%
- 240
 
7.1%
S 240
 
7.1%
i 240
 
7.1%
L 240
 
7.1%
Other values (8) 710
21.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1440
42.9%
Uppercase Letter 1200
35.7%
Dash Punctuation 240
 
7.1%
Lowercase Letter 240
 
7.1%
Connector Punctuation 240
 
7.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 392
27.2%
3 294
20.4%
6 284
19.7%
1 154
 
10.7%
2 95
 
6.6%
4 45
 
3.1%
7 44
 
3.1%
8 44
 
3.1%
9 44
 
3.1%
5 44
 
3.1%
Uppercase Letter
ValueCountFrequency (%)
B 240
20.0%
T 240
20.0%
E 240
20.0%
S 240
20.0%
L 240
20.0%
Dash Punctuation
ValueCountFrequency (%)
- 240
100.0%
Lowercase Letter
ValueCountFrequency (%)
i 240
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 240
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1920
57.1%
Latin 1440
42.9%

Most frequent character per script

Common
ValueCountFrequency (%)
0 392
20.4%
3 294
15.3%
6 284
14.8%
- 240
12.5%
_ 240
12.5%
1 154
 
8.0%
2 95
 
4.9%
4 45
 
2.3%
7 44
 
2.3%
8 44
 
2.3%
Other values (2) 88
 
4.6%
Latin
ValueCountFrequency (%)
B 240
16.7%
T 240
16.7%
E 240
16.7%
S 240
16.7%
i 240
16.7%
L 240
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3360
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 392
11.7%
3 294
8.8%
6 284
 
8.5%
B 240
 
7.1%
T 240
 
7.1%
E 240
 
7.1%
- 240
 
7.1%
S 240
 
7.1%
i 240
 
7.1%
L 240
 
7.1%
Other values (8) 710
21.1%

명칭
Text

Distinct235
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-11T19:11:29.947676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length36
Mean length19.645833
Min length3

Characters and Unicode

Total characters4715
Distinct characters63
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique230 ?
Unique (%)95.8%

Sample

1st rowMongchon Museum of History
2nd rowFolk Museum
3rd rowPark Eul Bok Embroidery Museum
4th rowOwl Antiques Museum
5th rowBukchon Traditional Culture Center
ValueCountFrequency (%)
gallery 65
 
9.0%
art 52
 
7.2%
museum 51
 
7.1%
center 37
 
5.1%
culture 25
 
3.5%
of 19
 
2.6%
theater 17
 
2.4%
hall 12
 
1.7%
broadcasting 9
 
1.2%
and 9
 
1.2%
Other values (326) 426
59.0%
2023-12-11T19:11:30.469423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
489
 
10.4%
e 462
 
9.8%
r 336
 
7.1%
a 335
 
7.1%
n 278
 
5.9%
o 276
 
5.9%
l 271
 
5.7%
u 263
 
5.6%
t 233
 
4.9%
i 157
 
3.3%
Other values (53) 1615
34.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3475
73.7%
Uppercase Letter 724
 
15.4%
Space Separator 489
 
10.4%
Other Punctuation 20
 
0.4%
Dash Punctuation 4
 
0.1%
Decimal Number 3
 
0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
C 102
14.1%
G 90
12.4%
M 81
11.2%
S 80
11.0%
A 76
10.5%
T 41
 
5.7%
H 38
 
5.2%
K 27
 
3.7%
B 27
 
3.7%
P 25
 
3.5%
Other values (16) 137
18.9%
Lowercase Letter
ValueCountFrequency (%)
e 462
13.3%
r 336
9.7%
a 335
9.6%
n 278
 
8.0%
o 276
 
7.9%
l 271
 
7.8%
u 263
 
7.6%
t 233
 
6.7%
i 157
 
4.5%
s 142
 
4.1%
Other values (15) 722
20.8%
Other Punctuation
ValueCountFrequency (%)
. 6
30.0%
& 5
25.0%
? 3
15.0%
' 2
 
10.0%
, 2
 
10.0%
1
 
5.0%
: 1
 
5.0%
Decimal Number
ValueCountFrequency (%)
1 1
33.3%
5 1
33.3%
3 1
33.3%
Space Separator
ValueCountFrequency (%)
489
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4199
89.1%
Common 516
 
10.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 462
 
11.0%
r 336
 
8.0%
a 335
 
8.0%
n 278
 
6.6%
o 276
 
6.6%
l 271
 
6.5%
u 263
 
6.3%
t 233
 
5.5%
i 157
 
3.7%
s 142
 
3.4%
Other values (41) 1446
34.4%
Common
ValueCountFrequency (%)
489
94.8%
. 6
 
1.2%
& 5
 
1.0%
- 4
 
0.8%
? 3
 
0.6%
' 2
 
0.4%
, 2
 
0.4%
1
 
0.2%
: 1
 
0.2%
1 1
 
0.2%
Other values (2) 2
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4714
> 99.9%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
489
 
10.4%
e 462
 
9.8%
r 336
 
7.1%
a 335
 
7.1%
n 278
 
5.9%
o 276
 
5.9%
l 271
 
5.7%
u 263
 
5.6%
t 233
 
4.9%
i 157
 
3.3%
Other values (52) 1614
34.2%
None
ValueCountFrequency (%)
1
100.0%

행정 시
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
Seoul
240 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSeoul
2nd rowSeoul
3rd rowSeoul
4th rowSeoul
5th rowSeoul

Common Values

ValueCountFrequency (%)
Seoul 240
100.0%

Length

2023-12-11T19:11:30.617586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T19:11:30.723941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
seoul 240
100.0%

행정 구
Categorical

Distinct23
Distinct (%)9.6%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
Jongno-gu
110 
Gangnam-gu
31 
Mapo-gu
13 
Jung-gu
12 
Yongsan-gu
 
10
Other values (18)
64 

Length

Max length15
Median length9
Mean length9.3958333
Min length7

Unique

Unique4 ?
Unique (%)1.7%

Sample

1st rowSongpa-gu
2nd rowJongno-gu
3rd rowGangbuk-gu
4th rowJongno-gu
5th rowJongno-gu

Common Values

ValueCountFrequency (%)
Jongno-gu 110
45.8%
Gangnam-gu 31
 
12.9%
Mapo-gu 13
 
5.4%
Jung-gu 12
 
5.0%
Yongsan-gu 10
 
4.2%
Seocho-gu 9
 
3.8%
Songpa-gu 8
 
3.3%
Yeongdeungpo-gu 6
 
2.5%
Seongbuk-gu 6
 
2.5%
Yangcheon-gu 5
 
2.1%
Other values (13) 30
 
12.5%

Length

2023-12-11T19:11:30.862821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
jongno-gu 110
45.8%
gangnam-gu 31
 
12.9%
mapo-gu 13
 
5.4%
jung-gu 12
 
5.0%
yongsan-gu 10
 
4.2%
seocho-gu 9
 
3.8%
songpa-gu 8
 
3.3%
yeongdeungpo-gu 6
 
2.5%
seongbuk-gu 6
 
2.5%
yangcheon-gu 5
 
2.1%
Other values (13) 30
 
12.5%
Distinct91
Distinct (%)37.9%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-11T19:11:31.156578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length13.1125
Min length7

Characters and Unicode

Total characters3147
Distinct characters45
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique63 ?
Unique (%)26.2%

Sample

1st rowOryun-dong
2nd rowCheongunhyoja-dong
3rd rowUi-dong
4th rowSamcheong-dong
5th rowSamcheong-dong
ValueCountFrequency (%)
jongno1.2.3.4ga-dong 23
 
9.6%
samcheong-dong 20
 
8.3%
pyeongchang-dong 18
 
7.5%
gahoe-dong 10
 
4.2%
ihwa-dong 10
 
4.2%
sajik-dong 9
 
3.8%
cheongunhyoja-dong 8
 
3.3%
cheongdam-dong 8
 
3.3%
hyehwa-dong 7
 
2.9%
sinsa-dong 6
 
2.5%
Other values (81) 121
50.4%
2023-12-11T19:11:31.644063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
n 483
15.3%
o 450
14.3%
g 444
14.1%
d 253
 
8.0%
- 240
 
7.6%
a 197
 
6.3%
e 145
 
4.6%
h 107
 
3.4%
. 71
 
2.3%
S 68
 
2.2%
Other values (35) 689
21.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2449
77.8%
Dash Punctuation 240
 
7.6%
Uppercase Letter 240
 
7.6%
Decimal Number 147
 
4.7%
Other Punctuation 71
 
2.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 483
19.7%
o 450
18.4%
g 444
18.1%
d 253
10.3%
a 197
8.0%
e 145
 
5.9%
h 107
 
4.4%
y 56
 
2.3%
m 51
 
2.1%
c 48
 
2.0%
Other values (11) 215
8.8%
Uppercase Letter
ValueCountFrequency (%)
S 68
28.3%
J 31
12.9%
C 20
 
8.3%
P 19
 
7.9%
H 16
 
6.7%
G 15
 
6.2%
I 14
 
5.8%
Y 13
 
5.4%
D 11
 
4.6%
M 10
 
4.2%
Other values (6) 23
 
9.6%
Decimal Number
ValueCountFrequency (%)
2 49
33.3%
1 40
27.2%
3 28
19.0%
4 26
17.7%
5 2
 
1.4%
6 2
 
1.4%
Dash Punctuation
ValueCountFrequency (%)
- 240
100.0%
Other Punctuation
ValueCountFrequency (%)
. 71
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2689
85.4%
Common 458
 
14.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 483
18.0%
o 450
16.7%
g 444
16.5%
d 253
9.4%
a 197
7.3%
e 145
 
5.4%
h 107
 
4.0%
S 68
 
2.5%
y 56
 
2.1%
m 51
 
1.9%
Other values (27) 435
16.2%
Common
ValueCountFrequency (%)
- 240
52.4%
. 71
 
15.5%
2 49
 
10.7%
1 40
 
8.7%
3 28
 
6.1%
4 26
 
5.7%
5 2
 
0.4%
6 2
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3147
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 483
15.3%
o 450
14.3%
g 444
14.1%
d 253
 
8.0%
- 240
 
7.6%
a 197
 
6.3%
e 145
 
4.6%
h 107
 
3.4%
. 71
 
2.3%
S 68
 
2.2%
Other values (35) 689
21.9%

Correlations

2023-12-11T19:11:31.780648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정 구행정 동
행정 구1.0001.000
행정 동1.0001.000

Missing values

2023-12-11T19:11:28.705622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T19:11:28.824291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

메인 키명칭행정 시행정 구행정 동
0BE_LiST36-0206Mongchon Museum of HistorySeoulSongpa-guOryun-dong
1BE_LiST36-0207Folk MuseumSeoulJongno-guCheongunhyoja-dong
2BE_LiST36-0208Park Eul Bok Embroidery MuseumSeoulGangbuk-guUi-dong
3BE_LiST36-0209Owl Antiques MuseumSeoulJongno-guSamcheong-dong
4BE_LiST36-0210Bukchon Traditional Culture CenterSeoulJongno-guSamcheong-dong
5BE_LiST36-0211Samsung Museum of PublishingSeoulJongno-guPyeongchang-dong
6BE_LiST36-0212Sangmyeong University MuseumSeoulJongno-guPyeongchang-dong
7BE_LiST36-0213Life Science MuseumSeoulYangcheon-guMok1-dong
8BE_LiST36-0214The Seodaemun Museum of Natural HistorySeoulSeodaemun-guYeonhui-dong
9BE_LiST36-0215Seodaemun Prison History HallSeoulSeodaemun-guCheonyeon-dong
메인 키명칭행정 시행정 구행정 동
230BE_LiST36-0196Gangbuk Culture & Art CenterSeoulGangbuk-guInsu-dong
231BE_LiST36-0197Olympic CenterSeoulSongpa-guOryun-dong
232BE_LiST36-0198Yongsan Art CenterSeoulYongsan-guItaewon1-dong
233BE_LiST36-0199Police Heritage MuseumSeoulJongno-guSajik-dong
234BE_LiST36-0200The National Palace Museum of KoreaSeoulJongno-guCheongunhyoja-dong
235BE_LiST36-0201The National Museum of KoreaSeoulYongsan-guSeobinggo-dong
236BE_LiST36-0202National Museum of Korean Contemporary HistorySeoulJongno-guSajik-dong
237BE_LiST36-0203Sword MuseumSeoulJongno-guPyeongchang-dong
238BE_LiST36-0204The Memorial to the Patriot Yun BonggilSeoulSeocho-guYangjae2-dong
239BE_LiST36-0205Myeongin MuseumSeoulJongno-guGahoe-dong