Overview

Dataset statistics

Number of variables15
Number of observations1953
Missing cells4269
Missing cells (%)14.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory230.9 KiB
Average record size in memory121.1 B

Variable types

Numeric1
Categorical2
Text12

Dataset

Description고유번호,언어,상호명,콘텐츠URL,주소,신주소,전화번호,팩스번호,웹사이트,운영시간,운영요일,휴무일,교통정보,태그,장애인편의시설
Author서울관광재단
URLhttps://data.seoul.go.kr/dataList/OA-21050/S/1/datasetView.do

Alerts

팩스번호 has a high cardinality: 51 distinct valuesHigh cardinality
고유번호 is highly overall correlated with 팩스번호High correlation
팩스번호 is highly overall correlated with 고유번호High correlation
팩스번호 is highly imbalanced (89.7%)Imbalance
전화번호 has 113 (5.8%) missing valuesMissing
웹사이트 has 614 (31.4%) missing valuesMissing
운영시간 has 259 (13.3%) missing valuesMissing
운영요일 has 916 (46.9%) missing valuesMissing
휴무일 has 555 (28.4%) missing valuesMissing
교통정보 has 79 (4.0%) missing valuesMissing
장애인편의시설 has 1733 (88.7%) missing valuesMissing
고유번호 has unique valuesUnique
콘텐츠URL has unique valuesUnique

Reproduction

Analysis started2024-05-11 05:30:58.386272
Analysis finished2024-05-11 05:31:05.511557
Duration7.13 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

고유번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1953
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23097.266
Minimum36
Maximum45594
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.3 KiB
2024-05-11T14:31:05.691775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36
5-th percentile1307.2
Q16253
median24734
Q334888
95-th percentile44067.4
Maximum45594
Range45558
Interquartile range (IQR)28635

Descriptive statistics

Standard deviation14926.848
Coefficient of variation (CV)0.64626036
Kurtosis-1.2346261
Mean23097.266
Median Absolute Deviation (MAD)13413
Skewness-0.11637546
Sum45108961
Variance2.2281078 × 108
MonotonicityNot monotonic
2024-05-11T14:31:05.945546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
45520 1
 
0.1%
1417 1
 
0.1%
2731 1
 
0.1%
44048 1
 
0.1%
2248 1
 
0.1%
22070 1
 
0.1%
27131 1
 
0.1%
6995 1
 
0.1%
24806 1
 
0.1%
28356 1
 
0.1%
Other values (1943) 1943
99.5%
ValueCountFrequency (%)
36 1
0.1%
37 1
0.1%
72 1
0.1%
73 1
0.1%
74 1
0.1%
75 1
0.1%
76 1
0.1%
77 1
0.1%
78 1
0.1%
79 1
0.1%
ValueCountFrequency (%)
45594 1
0.1%
45576 1
0.1%
45575 1
0.1%
45574 1
0.1%
45573 1
0.1%
45568 1
0.1%
45567 1
0.1%
45564 1
0.1%
45563 1
0.1%
45562 1
0.1%

언어
Categorical

Distinct5
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
zh-TW
398 
en
395 
ja
394 
zh-CN
385 
ko
381 

Length

Max length5
Median length2
Mean length3.202765
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowen
2nd rowen
3rd rowen
4th rowen
5th rowen

Common Values

ValueCountFrequency (%)
zh-TW 398
20.4%
en 395
20.2%
ja 394
20.2%
zh-CN 385
19.7%
ko 381
19.5%

Length

2024-05-11T14:31:06.169537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T14:31:06.521726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
zh-tw 398
20.4%
en 395
20.2%
ja 394
20.2%
zh-cn 385
19.7%
ko 381
19.5%
Distinct1869
Distinct (%)95.7%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
2024-05-11T14:31:07.123412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length74
Median length58
Mean length11.168971
Min length2

Characters and Unicode

Total characters21813
Distinct characters1231
Distinct categories12 ?
Distinct scripts6 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1795 ?
Unique (%)91.9%

Sample

1st rowBaekyang Laundry
2nd rowChangsin-Sungin Quarry Observatory
3rd rowChangsin-dong's Cliff Village
4th rowChoong Ang High School
5th rowChoong Ang Store
ValueCountFrequency (%)
museum 80
 
2.6%
seoul 39
 
1.3%
of 38
 
1.2%
center 36
 
1.2%
art 30
 
1.0%
gallery 28
 
0.9%
information 20
 
0.7%
tourist 18
 
0.6%
national 18
 
0.6%
korea 13
 
0.4%
Other values (2190) 2733
89.5%
2024-05-11T14:31:07.984711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1149
 
5.3%
? 1132
 
5.2%
e 825
 
3.8%
o 748
 
3.4%
a 740
 
3.4%
n 739
 
3.4%
r 493
 
2.3%
u 485
 
2.2%
i 460
 
2.1%
419
 
1.9%
Other values (1221) 14623
67.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9528
43.7%
Lowercase Letter 7327
33.6%
Uppercase Letter 1704
 
7.8%
Other Punctuation 1220
 
5.6%
Space Separator 1150
 
5.3%
Close Punctuation 351
 
1.6%
Open Punctuation 351
 
1.6%
Decimal Number 142
 
0.7%
Dash Punctuation 29
 
0.1%
Control 6
 
< 0.1%
Other values (2) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
419
 
4.4%
238
 
2.5%
162
 
1.7%
161
 
1.7%
136
 
1.4%
96
 
1.0%
93
 
1.0%
89
 
0.9%
84
 
0.9%
84
 
0.9%
Other values (1136) 7966
83.6%
Lowercase Letter
ValueCountFrequency (%)
e 825
11.3%
o 748
10.2%
a 740
10.1%
n 739
10.1%
r 493
 
6.7%
u 485
 
6.6%
i 460
 
6.3%
l 391
 
5.3%
t 373
 
5.1%
g 334
 
4.6%
Other values (16) 1739
23.7%
Uppercase Letter
ValueCountFrequency (%)
S 223
13.1%
M 173
 
10.2%
C 151
 
8.9%
A 126
 
7.4%
H 100
 
5.9%
G 96
 
5.6%
T 91
 
5.3%
B 77
 
4.5%
P 74
 
4.3%
I 66
 
3.9%
Other values (16) 527
30.9%
Other Punctuation
ValueCountFrequency (%)
? 1132
92.8%
· 32
 
2.6%
' 16
 
1.3%
& 15
 
1.2%
. 13
 
1.1%
3
 
0.2%
2
 
0.2%
: 2
 
0.2%
, 2
 
0.2%
1
 
0.1%
Other values (2) 2
 
0.2%
Decimal Number
ValueCountFrequency (%)
1 29
20.4%
3 18
12.7%
7 18
12.7%
8 17
12.0%
2 15
10.6%
9 15
10.6%
6 15
10.6%
4 9
 
6.3%
0 6
 
4.2%
Close Punctuation
ValueCountFrequency (%)
) 333
94.9%
16
 
4.6%
2
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 333
94.9%
16
 
4.6%
2
 
0.6%
Space Separator
ValueCountFrequency (%)
1149
99.9%
  1
 
0.1%
Dash Punctuation
ValueCountFrequency (%)
- 29
100.0%
Control
ValueCountFrequency (%)
6
100.0%
Final Punctuation
ValueCountFrequency (%)
4
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 9031
41.4%
Han 4956
22.7%
Common 3254
 
14.9%
Hangul 2392
 
11.0%
Katakana 2143
 
9.8%
Hiragana 37
 
0.2%

Most frequent character per script

Han
ValueCountFrequency (%)
238
 
4.8%
162
 
3.3%
161
 
3.2%
93
 
1.9%
84
 
1.7%
64
 
1.3%
62
 
1.3%
62
 
1.3%
60
 
1.2%
55
 
1.1%
Other values (652) 3915
79.0%
Hangul
ValueCountFrequency (%)
136
 
5.7%
63
 
2.6%
52
 
2.2%
51
 
2.1%
51
 
2.1%
45
 
1.9%
44
 
1.8%
36
 
1.5%
34
 
1.4%
32
 
1.3%
Other values (377) 1848
77.3%
Katakana
ValueCountFrequency (%)
419
19.6%
96
 
4.5%
89
 
4.2%
84
 
3.9%
84
 
3.9%
80
 
3.7%
79
 
3.7%
70
 
3.3%
48
 
2.2%
47
 
2.2%
Other values (66) 1047
48.9%
Latin
ValueCountFrequency (%)
e 825
 
9.1%
o 748
 
8.3%
a 740
 
8.2%
n 739
 
8.2%
r 493
 
5.5%
u 485
 
5.4%
i 460
 
5.1%
l 391
 
4.3%
t 373
 
4.1%
g 334
 
3.7%
Other values (42) 3443
38.1%
Common
ValueCountFrequency (%)
1149
35.3%
? 1132
34.8%
) 333
 
10.2%
( 333
 
10.2%
· 32
 
1.0%
- 29
 
0.9%
1 29
 
0.9%
3 18
 
0.6%
7 18
 
0.6%
8 17
 
0.5%
Other values (23) 164
 
5.0%
Hiragana
ValueCountFrequency (%)
7
18.9%
6
16.2%
2
 
5.4%
2
 
5.4%
2
 
5.4%
2
 
5.4%
2
 
5.4%
1
 
2.7%
1
 
2.7%
1
 
2.7%
Other values (11) 11
29.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12204
55.9%
CJK 4955
22.7%
Hangul 2392
 
11.0%
Katakana 2143
 
9.8%
None 76
 
0.3%
Hiragana 37
 
0.2%
Punctuation 4
 
< 0.1%
Box Drawing 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1149
 
9.4%
? 1132
 
9.3%
e 825
 
6.8%
o 748
 
6.1%
a 740
 
6.1%
n 739
 
6.1%
r 493
 
4.0%
u 485
 
4.0%
i 460
 
3.8%
l 391
 
3.2%
Other values (63) 5042
41.3%
Katakana
ValueCountFrequency (%)
419
19.6%
96
 
4.5%
89
 
4.2%
84
 
3.9%
84
 
3.9%
80
 
3.7%
79
 
3.7%
70
 
3.3%
48
 
2.2%
47
 
2.2%
Other values (66) 1047
48.9%
CJK
ValueCountFrequency (%)
238
 
4.8%
162
 
3.3%
161
 
3.2%
93
 
1.9%
84
 
1.7%
64
 
1.3%
62
 
1.3%
62
 
1.3%
60
 
1.2%
55
 
1.1%
Other values (651) 3914
79.0%
Hangul
ValueCountFrequency (%)
136
 
5.7%
63
 
2.6%
52
 
2.2%
51
 
2.1%
51
 
2.1%
45
 
1.9%
44
 
1.8%
36
 
1.5%
34
 
1.4%
32
 
1.3%
Other values (377) 1848
77.3%
None
ValueCountFrequency (%)
· 32
42.1%
16
21.1%
16
21.1%
3
 
3.9%
2
 
2.6%
2
 
2.6%
2
 
2.6%
1
 
1.3%
  1
 
1.3%
1
 
1.3%
Hiragana
ValueCountFrequency (%)
7
18.9%
6
16.2%
2
 
5.4%
2
 
5.4%
2
 
5.4%
2
 
5.4%
2
 
5.4%
1
 
2.7%
1
 
2.7%
1
 
2.7%
Other values (11) 11
29.7%
Punctuation
ValueCountFrequency (%)
4
100.0%
Box Drawing
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%

콘텐츠URL
Text

UNIQUE 

Distinct1953
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
2024-05-11T14:31:08.547322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length204
Median length190
Mean length136.55556
Min length124

Characters and Unicode

Total characters266693
Distinct characters1077
Distinct categories14 ?
Distinct scripts7 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1953 ?
Unique (%)100.0%

Sample

1st rowhttps://english.visitseoul.net/attractions/Baekyang-2024/ENP8onuvv?utm_source=seoulopendata&utm_medium=attractions&utm_content=ENP8onuvv
2nd rowhttps://english.visitseoul.net/attractions/2024-Chaeseokjangjeonmangdae/ENPauov7d?utm_source=seoulopendata&utm_medium=attractions&utm_content=ENPauov7d
3rd rowhttps://english.visitseoul.net/attractions/2024-changsincliff/ENPgvo4y2?utm_source=seoulopendata&utm_medium=attractions&utm_content=ENPgvo4y2
4th rowhttps://english.visitseoul.net/attractions/ChoongAngHighSchool/ENPgcblme?utm_source=seoulopendata&utm_medium=attractions&utm_content=ENPgcblme
5th rowhttps://english.visitseoul.net/attractions/ChoongAngStore/ENPl7gype?utm_source=seoulopendata&utm_medium=attractions&utm_content=ENPl7gype
ValueCountFrequency (%)
https://english.visitseoul.net/attractions/baekyang-2024/enp8onuvv?utm_source=seoulopendata&utm_medium=attractions&utm_content=enp8onuvv 1
 
0.1%
https://chinese.visitseoul.net/attractions/2023--028/cnpyubahz?utm_source=seoulopendata&utm_medium=attractions&utm_content=cnpyubahz 1
 
0.1%
https://chinese.visitseoul.net/attractions/2023043/cnpw01l8t?utm_source=seoulopendata&utm_medium=attractions&utm_content=cnpw01l8t 1
 
0.1%
https://chinese.visitseoul.net/attractions/?浦大?月光彩虹?泉/cnp002220?utm_source=seoulopendata&utm_medium=attractions&utm_content=cnp002220 1
 
0.1%
https://tchinese.visitseoul.net/attractions/死六臣公園/tcp004513?utm_source=seoulopendata&utm_medium=attractions&utm_content=tcp004513 1
 
0.1%
https://chinese.visitseoul.net/attractions/文化理容院1/cnp026953?utm_source=seoulopendata&utm_medium=attractions&utm_content=cnp026953 1
 
0.1%
https://tchinese.visitseoul.net/attractions/孫基禎紀念館/tcp006995?utm_source=seoulopendata&utm_medium=attractions&utm_content=tcp006995 1
 
0.1%
https://chinese.visitseoul.net/attractions/?路西服店/cnp024708?utm_source=seoulopendata&utm_medium=attractions&utm_content=cnp024708 1
 
0.1%
https://japanese.visitseoul.net/attractions/hongik-bookstore-jp/jpp028332?utm_source=seoulopendata&utm_medium=attractions&utm_content=jpp028332 1
 
0.1%
https://chinese.visitseoul.net/attractions/弘智?和?春台城(홍지문탕춘대성)/cnp004101?utm_source=seoulopendata&utm_medium=attractions&utm_content=cnp004101 1
 
0.1%
Other values (1943) 1943
99.5%
2024-05-11T14:31:09.447318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
t 32471
 
12.2%
e 18402
 
6.9%
s 15988
 
6.0%
o 15498
 
5.8%
n 15120
 
5.7%
u 14652
 
5.5%
a 14214
 
5.3%
i 11836
 
4.4%
m 10377
 
3.9%
/ 9767
 
3.7%
Other values (1067) 108368
40.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 189001
70.9%
Decimal Number 22485
 
8.4%
Other Punctuation 22229
 
8.3%
Uppercase Letter 13635
 
5.1%
Math Symbol 5859
 
2.2%
Connector Punctuation 5859
 
2.2%
Other Letter 5546
 
2.1%
Dash Punctuation 2019
 
0.8%
Open Punctuation 25
 
< 0.1%
Close Punctuation 24
 
< 0.1%
Other values (4) 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
142
 
2.6%
141
 
2.5%
116
 
2.1%
115
 
2.1%
67
 
1.2%
59
 
1.1%
57
 
1.0%
57
 
1.0%
55
 
1.0%
53
 
1.0%
Other values (965) 4684
84.5%
Lowercase Letter
ValueCountFrequency (%)
t 32471
17.2%
e 18402
9.7%
s 15988
8.5%
o 15498
8.2%
n 15120
8.0%
u 14652
7.8%
a 14214
7.5%
i 11836
 
6.3%
m 10377
 
5.5%
c 9086
 
4.8%
Other values (32) 31357
16.6%
Uppercase Letter
ValueCountFrequency (%)
P 4822
35.4%
C 1745
 
12.8%
N 1636
 
12.0%
T 908
 
6.7%
J 874
 
6.4%
E 843
 
6.2%
K 830
 
6.1%
O 803
 
5.9%
S 248
 
1.8%
M 175
 
1.3%
Other values (18) 751
 
5.5%
Decimal Number
ValueCountFrequency (%)
0 6598
29.3%
2 3281
14.6%
1 2195
 
9.8%
3 2113
 
9.4%
5 1587
 
7.1%
7 1400
 
6.2%
4 1363
 
6.1%
6 1361
 
6.1%
9 1323
 
5.9%
8 1264
 
5.6%
Other Punctuation
ValueCountFrequency (%)
/ 9767
43.9%
. 3906
 
17.6%
& 3906
 
17.6%
? 2690
 
12.1%
: 1953
 
8.8%
· 5
 
< 0.1%
2
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
23
92.0%
1
 
4.0%
( 1
 
4.0%
Close Punctuation
ValueCountFrequency (%)
22
91.7%
1
 
4.2%
) 1
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 2017
99.9%
2
 
0.1%
Final Punctuation
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Math Symbol
ValueCountFrequency (%)
= 5859
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5859
100.0%
Format
ValueCountFrequency (%)
­ 5
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 202608
76.0%
Common 58511
 
21.9%
Han 3105
 
1.2%
Hangul 2182
 
0.8%
Katakana 251
 
0.1%
Cyrillic 28
 
< 0.1%
Hiragana 8
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
142
 
4.6%
116
 
3.7%
115
 
3.7%
67
 
2.2%
59
 
1.9%
47
 
1.5%
41
 
1.3%
39
 
1.3%
38
 
1.2%
36
 
1.2%
Other values (571) 2405
77.5%
Hangul
ValueCountFrequency (%)
141
 
6.5%
57
 
2.6%
57
 
2.6%
55
 
2.5%
53
 
2.4%
48
 
2.2%
37
 
1.7%
37
 
1.7%
33
 
1.5%
32
 
1.5%
Other values (325) 1632
74.8%
Katakana
ValueCountFrequency (%)
32
 
12.7%
20
 
8.0%
16
 
6.4%
16
 
6.4%
10
 
4.0%
9
 
3.6%
9
 
3.6%
9
 
3.6%
9
 
3.6%
9
 
3.6%
Other values (42) 112
44.6%
Latin
ValueCountFrequency (%)
t 32471
16.0%
e 18402
9.1%
s 15988
 
7.9%
o 15498
 
7.6%
n 15120
 
7.5%
u 14652
 
7.2%
a 14214
 
7.0%
i 11836
 
5.8%
m 10377
 
5.1%
c 9086
 
4.5%
Other values (41) 44964
22.2%
Common
ValueCountFrequency (%)
/ 9767
16.7%
0 6598
11.3%
= 5859
10.0%
_ 5859
10.0%
. 3906
 
6.7%
& 3906
 
6.7%
2 3281
 
5.6%
? 2690
 
4.6%
1 2195
 
3.8%
3 2113
 
3.6%
Other values (22) 12337
21.1%
Cyrillic
ValueCountFrequency (%)
у 3
 
10.7%
т 3
 
10.7%
л 2
 
7.1%
к 2
 
7.1%
а 2
 
7.1%
е 2
 
7.1%
р 2
 
7.1%
з 1
 
3.6%
й 1
 
3.6%
ы 1
 
3.6%
Other values (9) 9
32.1%
Hiragana
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 261052
97.9%
CJK 3105
 
1.2%
Hangul 2182
 
0.8%
Katakana 251
 
0.1%
None 59
 
< 0.1%
Cyrillic 28
 
< 0.1%
Hiragana 8
 
< 0.1%
Punctuation 7
 
< 0.1%
Box Drawing 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
t 32471
 
12.4%
e 18402
 
7.0%
s 15988
 
6.1%
o 15498
 
5.9%
n 15120
 
5.8%
u 14652
 
5.6%
a 14214
 
5.4%
i 11836
 
4.5%
m 10377
 
4.0%
/ 9767
 
3.7%
Other values (61) 102727
39.4%
CJK
ValueCountFrequency (%)
142
 
4.6%
116
 
3.7%
115
 
3.7%
67
 
2.2%
59
 
1.9%
47
 
1.5%
41
 
1.3%
39
 
1.3%
38
 
1.2%
36
 
1.2%
Other values (571) 2405
77.5%
Hangul
ValueCountFrequency (%)
141
 
6.5%
57
 
2.6%
57
 
2.6%
55
 
2.5%
53
 
2.4%
48
 
2.2%
37
 
1.7%
37
 
1.7%
33
 
1.5%
32
 
1.5%
Other values (325) 1632
74.8%
Katakana
ValueCountFrequency (%)
32
 
12.7%
20
 
8.0%
16
 
6.4%
16
 
6.4%
10
 
4.0%
9
 
3.6%
9
 
3.6%
9
 
3.6%
9
 
3.6%
9
 
3.6%
Other values (42) 112
44.6%
None
ValueCountFrequency (%)
23
39.0%
22
37.3%
· 5
 
8.5%
­ 5
 
8.5%
2
 
3.4%
1
 
1.7%
1
 
1.7%
Cyrillic
ValueCountFrequency (%)
у 3
 
10.7%
т 3
 
10.7%
л 2
 
7.1%
к 2
 
7.1%
а 2
 
7.1%
е 2
 
7.1%
р 2
 
7.1%
з 1
 
3.6%
й 1
 
3.6%
ы 1
 
3.6%
Other values (9) 9
32.1%
Punctuation
ValueCountFrequency (%)
3
42.9%
2
28.6%
1
 
14.3%
1
 
14.3%
Hiragana
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Box Drawing
ValueCountFrequency (%)
1
100.0%

주소
Text

Distinct1320
Distinct (%)67.6%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
2024-05-11T14:31:09.935779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length79
Median length65
Mean length19.770609
Min length2

Characters and Unicode

Total characters38612
Distinct characters685
Distinct categories12 ?
Distinct scripts6 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1267 ?
Unique (%)64.9%

Sample

1st row 140-24, Gye-dong, Jongno-gu, Seoul, Korea
2nd row 서울 종로구 창신동 23-322
3rd row 서울 종로구 창신동 23-322
4th row 1, Gye-dong, Jongno-gu, Seoul, Korea
5th row 2-105, Gye-dong, Jongno-gu, Seoul, Korea
ValueCountFrequency (%)
서울 331
 
6.6%
seoul 276
 
5.5%
종로구 129
 
2.6%
jongno-gu 114
 
2.3%
중구 65
 
1.3%
jung-gu 43
 
0.9%
korea 43
 
0.9%
100-120 29
 
0.6%
1-1 29
 
0.6%
110-062 27
 
0.5%
Other values (1781) 3897
78.2%
2024-05-11T14:31:10.748382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6112
 
15.8%
1 2969
 
7.7%
- 2435
 
6.3%
0 2012
 
5.2%
2 1262
 
3.3%
? 1181
 
3.1%
o 1172
 
3.0%
g 997
 
2.6%
n 967
 
2.5%
3 845
 
2.2%
Other values (675) 18660
48.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 10736
27.8%
Other Letter 10032
26.0%
Lowercase Letter 6174
16.0%
Space Separator 6113
15.8%
Dash Punctuation 2435
 
6.3%
Other Punctuation 1990
 
5.2%
Uppercase Letter 951
 
2.5%
Close Punctuation 87
 
0.2%
Open Punctuation 87
 
0.2%
Control 4
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
638
 
6.4%
622
 
6.2%
493
 
4.9%
440
 
4.4%
421
 
4.2%
365
 
3.6%
362
 
3.6%
332
 
3.3%
276
 
2.8%
269
 
2.7%
Other values (604) 5814
58.0%
Lowercase Letter
ValueCountFrequency (%)
o 1172
19.0%
g 997
16.1%
n 967
15.7%
u 717
11.6%
e 579
9.4%
a 369
 
6.0%
l 329
 
5.3%
d 272
 
4.4%
i 112
 
1.8%
r 92
 
1.5%
Other values (13) 568
9.2%
Uppercase Letter
ValueCountFrequency (%)
S 402
42.3%
J 179
18.8%
G 71
 
7.5%
Y 45
 
4.7%
K 44
 
4.6%
B 31
 
3.3%
H 29
 
3.0%
M 24
 
2.5%
D 21
 
2.2%
N 18
 
1.9%
Other values (13) 87
 
9.1%
Decimal Number
ValueCountFrequency (%)
1 2969
27.7%
0 2012
18.7%
2 1262
11.8%
3 845
 
7.9%
8 722
 
6.7%
4 713
 
6.6%
5 681
 
6.3%
7 627
 
5.8%
6 519
 
4.8%
9 386
 
3.6%
Other Punctuation
ValueCountFrequency (%)
? 1181
59.3%
, 805
40.5%
· 2
 
0.1%
. 1
 
0.1%
1
 
0.1%
Space Separator
ValueCountFrequency (%)
6112
> 99.9%
  1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 83
95.4%
4
 
4.6%
Open Punctuation
ValueCountFrequency (%)
( 83
95.4%
4
 
4.6%
Dash Punctuation
ValueCountFrequency (%)
- 2435
100.0%
Control
ValueCountFrequency (%)
4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Format
ValueCountFrequency (%)
­ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 21455
55.6%
Latin 7125
 
18.5%
Han 5700
 
14.8%
Hangul 3332
 
8.6%
Katakana 997
 
2.6%
Hiragana 3
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
638
 
11.2%
622
 
10.9%
493
 
8.6%
440
 
7.7%
266
 
4.7%
260
 
4.6%
169
 
3.0%
147
 
2.6%
136
 
2.4%
117
 
2.1%
Other values (333) 2412
42.3%
Hangul
ValueCountFrequency (%)
421
 
12.6%
365
 
11.0%
362
 
10.9%
332
 
10.0%
186
 
5.6%
143
 
4.3%
83
 
2.5%
68
 
2.0%
57
 
1.7%
42
 
1.3%
Other values (206) 1273
38.2%
Katakana
ValueCountFrequency (%)
276
27.7%
269
27.0%
268
26.9%
42
 
4.2%
11
 
1.1%
8
 
0.8%
7
 
0.7%
7
 
0.7%
7
 
0.7%
6
 
0.6%
Other values (42) 96
 
9.6%
Latin
ValueCountFrequency (%)
o 1172
16.4%
g 997
14.0%
n 967
13.6%
u 717
10.1%
e 579
8.1%
S 402
 
5.6%
a 369
 
5.2%
l 329
 
4.6%
d 272
 
3.8%
J 179
 
2.5%
Other values (36) 1142
16.0%
Common
ValueCountFrequency (%)
6112
28.5%
1 2969
13.8%
- 2435
 
11.3%
0 2012
 
9.4%
2 1262
 
5.9%
? 1181
 
5.5%
3 845
 
3.9%
, 805
 
3.8%
8 722
 
3.4%
4 713
 
3.3%
Other values (15) 2399
 
11.2%
Hiragana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 28567
74.0%
CJK 5685
 
14.7%
Hangul 3332
 
8.6%
Katakana 997
 
2.6%
CJK Compat Ideographs 15
 
< 0.1%
None 13
 
< 0.1%
Hiragana 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6112
21.4%
1 2969
 
10.4%
- 2435
 
8.5%
0 2012
 
7.0%
2 1262
 
4.4%
? 1181
 
4.1%
o 1172
 
4.1%
g 997
 
3.5%
n 967
 
3.4%
3 845
 
3.0%
Other values (55) 8615
30.2%
CJK
ValueCountFrequency (%)
638
 
11.2%
622
 
10.9%
493
 
8.7%
440
 
7.7%
266
 
4.7%
260
 
4.6%
169
 
3.0%
147
 
2.6%
136
 
2.4%
117
 
2.1%
Other values (326) 2397
42.2%
Hangul
ValueCountFrequency (%)
421
 
12.6%
365
 
11.0%
362
 
10.9%
332
 
10.0%
186
 
5.6%
143
 
4.3%
83
 
2.5%
68
 
2.0%
57
 
1.7%
42
 
1.3%
Other values (206) 1273
38.2%
Katakana
ValueCountFrequency (%)
276
27.7%
269
27.0%
268
26.9%
42
 
4.2%
11
 
1.1%
8
 
0.8%
7
 
0.7%
7
 
0.7%
7
 
0.7%
6
 
0.6%
Other values (42) 96
 
9.6%
CJK Compat Ideographs
ValueCountFrequency (%)
7
46.7%
2
 
13.3%
2
 
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
None
ValueCountFrequency (%)
4
30.8%
4
30.8%
· 2
15.4%
­ 1
 
7.7%
1
 
7.7%
  1
 
7.7%
Hiragana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Distinct1909
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
2024-05-11T14:31:11.396656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length102
Median length72
Mean length31.637993
Min length2

Characters and Unicode

Total characters61789
Distinct characters999
Distinct categories12 ?
Distinct scripts6 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1878 ?
Unique (%)96.2%

Sample

1st row03057 54 Gyedong-gil, Jongno-gu, Seoul
2nd row03091 51 Naksan 5-gil, Jongno-gu, Seoul
3rd row03091 23-322 Changsin-dong, Dongdaemun-gu, Seoul
4th row03051 164 Changdeokgung-gil, Jongno-gu, Seoul
5th row03051 162 Changdeokgung-gil, Jongno-gu, Seoul
ValueCountFrequency (%)
seoul 396
 
5.1%
서울 329
 
4.2%
jongno-gu 139
 
1.8%
종로구 137
 
1.8%
중구 59
 
0.8%
jung-gu 59
 
0.8%
서울특별시 43
 
0.6%
03056 31
 
0.4%
yongsan-gu 29
 
0.4%
03145 29
 
0.4%
Other values (3137) 6529
83.9%
2024-05-11T14:31:12.354947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7746
 
12.5%
0 3269
 
5.3%
1 2534
 
4.1%
? 2095
 
3.4%
3 1880
 
3.0%
o 1663
 
2.7%
2 1507
 
2.4%
4 1435
 
2.3%
5 1384
 
2.2%
- 1379
 
2.2%
Other values (989) 36897
59.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20227
32.7%
Decimal Number 15449
25.0%
Lowercase Letter 9515
15.4%
Space Separator 7746
 
12.5%
Other Punctuation 3581
 
5.8%
Uppercase Letter 1553
 
2.5%
Dash Punctuation 1379
 
2.2%
Open Punctuation 1166
 
1.9%
Close Punctuation 1165
 
1.9%
Math Symbol 5
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1114
 
5.5%
867
 
4.3%
788
 
3.9%
575
 
2.8%
571
 
2.8%
488
 
2.4%
468
 
2.3%
444
 
2.2%
441
 
2.2%
405
 
2.0%
Other values (908) 14066
69.5%
Lowercase Letter
ValueCountFrequency (%)
o 1663
17.5%
g 1344
14.1%
n 1177
12.4%
u 1104
11.6%
e 918
9.6%
l 684
7.2%
a 614
 
6.5%
r 429
 
4.5%
i 322
 
3.4%
d 208
 
2.2%
Other values (15) 1052
11.1%
Uppercase Letter
ValueCountFrequency (%)
S 574
37.0%
J 228
 
14.7%
Y 84
 
5.4%
G 82
 
5.3%
B 75
 
4.8%
D 61
 
3.9%
I 54
 
3.5%
H 46
 
3.0%
C 40
 
2.6%
M 36
 
2.3%
Other values (14) 273
17.6%
Decimal Number
ValueCountFrequency (%)
0 3269
21.2%
1 2534
16.4%
3 1880
12.2%
2 1507
9.8%
4 1435
9.3%
5 1384
9.0%
6 1006
 
6.5%
7 1003
 
6.5%
8 782
 
5.1%
9 648
 
4.2%
Other Punctuation
ValueCountFrequency (%)
? 2095
58.5%
, 1295
36.2%
97
 
2.7%
53
 
1.5%
. 15
 
0.4%
# 14
 
0.4%
· 7
 
0.2%
& 3
 
0.1%
: 1
 
< 0.1%
' 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 1127
96.7%
37
 
3.2%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1124
96.4%
41
 
3.5%
1
 
0.1%
Space Separator
ValueCountFrequency (%)
7746
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1379
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%
Control
ValueCountFrequency (%)
2
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 30494
49.4%
Latin 11068
 
17.9%
Han 9437
 
15.3%
Hangul 5716
 
9.3%
Katakana 5060
 
8.2%
Hiragana 14
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
1114
 
11.8%
788
 
8.4%
575
 
6.1%
444
 
4.7%
441
 
4.7%
390
 
4.1%
293
 
3.1%
276
 
2.9%
216
 
2.3%
168
 
1.8%
Other values (507) 4732
50.1%
Hangul
ValueCountFrequency (%)
488
 
8.5%
468
 
8.2%
405
 
7.1%
388
 
6.8%
300
 
5.2%
181
 
3.2%
172
 
3.0%
132
 
2.3%
87
 
1.5%
78
 
1.4%
Other values (313) 3017
52.8%
Katakana
ValueCountFrequency (%)
867
17.1%
571
 
11.3%
393
 
7.8%
365
 
7.2%
313
 
6.2%
290
 
5.7%
278
 
5.5%
195
 
3.9%
125
 
2.5%
104
 
2.1%
Other values (62) 1559
30.8%
Latin
ValueCountFrequency (%)
o 1663
15.0%
g 1344
12.1%
n 1177
10.6%
u 1104
10.0%
e 918
8.3%
l 684
 
6.2%
a 614
 
5.5%
S 574
 
5.2%
r 429
 
3.9%
i 322
 
2.9%
Other values (39) 2239
20.2%
Common
ValueCountFrequency (%)
7746
25.4%
0 3269
10.7%
1 2534
 
8.3%
? 2095
 
6.9%
3 1880
 
6.2%
2 1507
 
4.9%
4 1435
 
4.7%
5 1384
 
4.5%
- 1379
 
4.5%
, 1295
 
4.2%
Other values (22) 5970
19.6%
Hiragana
ValueCountFrequency (%)
9
64.3%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 41323
66.9%
CJK 9421
 
15.2%
Hangul 5716
 
9.3%
Katakana 5060
 
8.2%
None 238
 
0.4%
CJK Compat Ideographs 16
 
< 0.1%
Hiragana 14
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7746
18.7%
0 3269
 
7.9%
1 2534
 
6.1%
? 2095
 
5.1%
3 1880
 
4.5%
o 1663
 
4.0%
2 1507
 
3.6%
4 1435
 
3.5%
5 1384
 
3.3%
- 1379
 
3.3%
Other values (62) 16431
39.8%
CJK
ValueCountFrequency (%)
1114
 
11.8%
788
 
8.4%
575
 
6.1%
444
 
4.7%
441
 
4.7%
390
 
4.1%
293
 
3.1%
276
 
2.9%
216
 
2.3%
168
 
1.8%
Other values (500) 4716
50.1%
Katakana
ValueCountFrequency (%)
867
17.1%
571
 
11.3%
393
 
7.8%
365
 
7.2%
313
 
6.2%
290
 
5.7%
278
 
5.5%
195
 
3.9%
125
 
2.5%
104
 
2.1%
Other values (62) 1559
30.8%
Hangul
ValueCountFrequency (%)
488
 
8.5%
468
 
8.2%
405
 
7.1%
388
 
6.8%
300
 
5.2%
181
 
3.2%
172
 
3.0%
132
 
2.3%
87
 
1.5%
78
 
1.4%
Other values (313) 3017
52.8%
None
ValueCountFrequency (%)
97
40.8%
53
22.3%
41
17.2%
37
 
15.5%
· 7
 
2.9%
1
 
0.4%
1
 
0.4%
1
 
0.4%
CJK Compat Ideographs
ValueCountFrequency (%)
9
56.2%
2
 
12.5%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Hiragana
ValueCountFrequency (%)
9
64.3%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Punctuation
ValueCountFrequency (%)
1
100.0%

전화번호
Text

MISSING 

Distinct768
Distinct (%)41.7%
Missing113
Missing (%)5.8%
Memory size15.4 KiB
2024-05-11T14:31:12.812349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length67
Mean length14.063043
Min length6

Characters and Unicode

Total characters25876
Distinct characters115
Distinct categories12 ?
Distinct scripts5 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique398 ?
Unique (%)21.6%

Sample

1st row+82-2-762-1261
2nd row+82-507-1330-5416
3rd row+82-2-742-1321
4th row+82-2-3780-0578
5th row02-774-1784
ValueCountFrequency (%)
82-2-120 38
 
2.0%
02-120 12
 
0.6%
82-2-724-0274 9
 
0.5%
82-2)2133-5695 8
 
0.4%
82-2-970-4500 8
 
0.4%
82-2-793-8249 8
 
0.4%
82-2-3780-0578 8
 
0.4%
82-2-2077-9000 8
 
0.4%
82-2-762-4868 7
 
0.4%
82-2-731-0412 7
 
0.4%
Other values (763) 1742
93.9%
2024-05-11T14:31:13.519362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 5120
19.8%
2 4838
18.7%
8 2483
9.6%
0 2226
8.6%
3 1630
 
6.3%
7 1624
 
6.3%
+ 1490
 
5.8%
1 1472
 
5.7%
4 1263
 
4.9%
6 1146
 
4.4%
Other values (105) 2584
10.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 18896
73.0%
Dash Punctuation 5120
 
19.8%
Math Symbol 1526
 
5.9%
Other Letter 103
 
0.4%
Lowercase Letter 81
 
0.3%
Other Punctuation 53
 
0.2%
Space Separator 27
 
0.1%
Close Punctuation 26
 
0.1%
Open Punctuation 16
 
0.1%
Uppercase Letter 14
 
0.1%
Other values (2) 14
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
 
5.8%
6
 
5.8%
5
 
4.9%
5
 
4.9%
4
 
3.9%
4
 
3.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
Other values (48) 61
59.2%
Lowercase Letter
ValueCountFrequency (%)
s 11
13.6%
e 10
12.3%
u 9
11.1%
r 8
9.9%
t 8
9.9%
i 6
7.4%
o 6
7.4%
n 5
6.2%
a 5
6.2%
m 4
 
4.9%
Other values (7) 9
11.1%
Decimal Number
ValueCountFrequency (%)
2 4838
25.6%
8 2483
13.1%
0 2226
11.8%
3 1630
 
8.6%
7 1624
 
8.6%
1 1472
 
7.8%
4 1263
 
6.7%
6 1146
 
6.1%
9 1118
 
5.9%
5 1096
 
5.8%
Other Punctuation
ValueCountFrequency (%)
? 15
28.3%
/ 14
26.4%
, 7
13.2%
6
 
11.3%
: 4
 
7.5%
3
 
5.7%
2
 
3.8%
1
 
1.9%
. 1
 
1.9%
Uppercase Letter
ValueCountFrequency (%)
M 3
21.4%
C 3
21.4%
T 2
14.3%
D 1
 
7.1%
I 1
 
7.1%
S 1
 
7.1%
A 1
 
7.1%
N 1
 
7.1%
H 1
 
7.1%
Math Symbol
ValueCountFrequency (%)
+ 1490
97.6%
~ 34
 
2.2%
< 1
 
0.1%
> 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 22
84.6%
4
 
15.4%
Open Punctuation
ValueCountFrequency (%)
( 12
75.0%
4
 
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 5120
100.0%
Space Separator
ValueCountFrequency (%)
27
100.0%
Control
ValueCountFrequency (%)
12
100.0%
Format
ValueCountFrequency (%)
­ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 25678
99.2%
Latin 95
 
0.4%
Han 69
 
0.3%
Hangul 31
 
0.1%
Katakana 3
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
- 5120
19.9%
2 4838
18.8%
8 2483
9.7%
0 2226
8.7%
3 1630
 
6.3%
7 1624
 
6.3%
+ 1490
 
5.8%
1 1472
 
5.7%
4 1263
 
4.9%
6 1146
 
4.5%
Other values (21) 2386
9.3%
Han
ValueCountFrequency (%)
6
 
8.7%
6
 
8.7%
5
 
7.2%
5
 
7.2%
4
 
5.8%
3
 
4.3%
3
 
4.3%
3
 
4.3%
3
 
4.3%
3
 
4.3%
Other values (21) 28
40.6%
Latin
ValueCountFrequency (%)
s 11
11.6%
e 10
10.5%
u 9
 
9.5%
r 8
 
8.4%
t 8
 
8.4%
i 6
 
6.3%
o 6
 
6.3%
n 5
 
5.3%
a 5
 
5.3%
m 4
 
4.2%
Other values (16) 23
24.2%
Hangul
ValueCountFrequency (%)
4
 
12.9%
2
 
6.5%
2
 
6.5%
2
 
6.5%
2
 
6.5%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
Other values (14) 14
45.2%
Katakana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 25751
99.5%
CJK 69
 
0.3%
Hangul 31
 
0.1%
None 22
 
0.1%
Katakana 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 5120
19.9%
2 4838
18.8%
8 2483
9.6%
0 2226
8.6%
3 1630
 
6.3%
7 1624
 
6.3%
+ 1490
 
5.8%
1 1472
 
5.7%
4 1263
 
4.9%
6 1146
 
4.5%
Other values (40) 2459
9.5%
CJK
ValueCountFrequency (%)
6
 
8.7%
6
 
8.7%
5
 
7.2%
5
 
7.2%
4
 
5.8%
3
 
4.3%
3
 
4.3%
3
 
4.3%
3
 
4.3%
3
 
4.3%
Other values (21) 28
40.6%
None
ValueCountFrequency (%)
6
27.3%
4
18.2%
4
18.2%
3
13.6%
2
 
9.1%
­ 2
 
9.1%
1
 
4.5%
Hangul
ValueCountFrequency (%)
4
 
12.9%
2
 
6.5%
2
 
6.5%
2
 
6.5%
2
 
6.5%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
Other values (14) 14
45.2%
Katakana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

팩스번호
Categorical

HIGH CARDINALITY  HIGH CORRELATION  IMBALANCE 

Distinct51
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
<NA>
1850 
+82-2-743-8786
 
4
+82-2-732-9928
 
4
+82-2-766-8643
 
4
+82-2-2660-2488
 
4
Other values (46)
 
87

Length

Max length18
Median length4
Mean length4.5202253
Min length4

Unique

Unique26 ?
Unique (%)1.3%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 1850
94.7%
+82-2-743-8786 4
 
0.2%
+82-2-732-9928 4
 
0.2%
+82-2-766-8643 4
 
0.2%
+82-2-2660-2488 4
 
0.2%
+82-2-2022-0644 4
 
0.2%
+82-2-2147-3874 4
 
0.2%
+82-2-969-9245 4
 
0.2%
+82-2-753-4254 4
 
0.2%
+82-2-957-2569 4
 
0.2%
Other values (41) 67
 
3.4%

Length

2024-05-11T14:31:13.766485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 1850
94.3%
9470 5
 
0.3%
82-2-732-9928 4
 
0.2%
82-2-766-8643 4
 
0.2%
82-2-2660-2488 4
 
0.2%
82-2-2022-0644 4
 
0.2%
82-2-2147-3874 4
 
0.2%
82-2-969-9245 4
 
0.2%
82-2-753-4254 4
 
0.2%
82-2-957-2569 4
 
0.2%
Other values (42) 74
 
3.8%

웹사이트
Text

MISSING 

Distinct490
Distinct (%)36.6%
Missing614
Missing (%)31.4%
Memory size15.4 KiB
2024-05-11T14:31:14.207521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length233
Median length90
Mean length35.094847
Min length14

Characters and Unicode

Total characters46992
Distinct characters76
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique207 ?
Unique (%)15.5%

Sample

1st rowhttps://sema.seoul.go.kr/en/index
2nd rowhttps://sema.seoul.go.kr/en/index
3rd rowhttps://lib.seoul.go.kr/rwww/html/ko/seoulArchRoom.jsp
4th rowhttps://sema.seoul.go.kr/
5th rowhttps://www.ssangma.net/
ValueCountFrequency (%)
http://www.sta.or.kr 40
 
3.0%
http://www.mmca.go.kr 9
 
0.7%
http://www.deoksugung.go.kr 8
 
0.6%
http://dmvillage.info 7
 
0.5%
http://sewoon.org 6
 
0.4%
http://plaza.seoul.go.kr/gwanghwamun 6
 
0.4%
http://www.museum.seoul.kr 6
 
0.4%
https://sema.seoul.go.kr/en/index 6
 
0.4%
http://hakrim.pe.kr 5
 
0.4%
http://taeil.org 5
 
0.4%
Other values (477) 1253
92.7%
2024-05-11T14:31:15.046944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 4172
 
8.9%
. 3683
 
7.8%
t 3650
 
7.8%
o 3066
 
6.5%
w 2684
 
5.7%
e 2381
 
5.1%
r 2146
 
4.6%
s 1961
 
4.2%
a 1912
 
4.1%
h 1894
 
4.0%
Other values (66) 19443
41.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 34881
74.2%
Other Punctuation 9388
 
20.0%
Decimal Number 1297
 
2.8%
Uppercase Letter 712
 
1.5%
Math Symbol 313
 
0.7%
Connector Punctuation 285
 
0.6%
Dash Punctuation 48
 
0.1%
Space Separator 35
 
0.1%
Other Letter 25
 
0.1%
Control 8
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t 3650
 
10.5%
o 3066
 
8.8%
w 2684
 
7.7%
e 2381
 
6.8%
r 2146
 
6.2%
s 1961
 
5.6%
a 1912
 
5.5%
h 1894
 
5.4%
n 1803
 
5.2%
p 1770
 
5.1%
Other values (16) 11614
33.3%
Uppercase Letter
ValueCountFrequency (%)
I 121
17.0%
H 86
12.1%
R 64
9.0%
P 54
 
7.6%
C 53
 
7.4%
V 47
 
6.6%
T 46
 
6.5%
N 42
 
5.9%
G 29
 
4.1%
A 26
 
3.7%
Other values (11) 144
20.2%
Decimal Number
ValueCountFrequency (%)
0 410
31.6%
1 268
20.7%
2 166
12.8%
3 86
 
6.6%
6 84
 
6.5%
5 80
 
6.2%
4 67
 
5.2%
7 52
 
4.0%
9 46
 
3.5%
8 38
 
2.9%
Other Punctuation
ValueCountFrequency (%)
/ 4172
44.4%
. 3683
39.2%
: 1195
 
12.7%
? 183
 
1.9%
& 129
 
1.4%
# 17
 
0.2%
, 6
 
0.1%
; 2
 
< 0.1%
' 1
 
< 0.1%
Other Letter
ValueCountFrequency (%)
5
20.0%
5
20.0%
5
20.0%
5
20.0%
5
20.0%
Math Symbol
ValueCountFrequency (%)
= 313
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 285
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 48
100.0%
Space Separator
ValueCountFrequency (%)
35
100.0%
Control
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 35593
75.7%
Common 11374
 
24.2%
Hangul 25
 
0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
t 3650
 
10.3%
o 3066
 
8.6%
w 2684
 
7.5%
e 2381
 
6.7%
r 2146
 
6.0%
s 1961
 
5.5%
a 1912
 
5.4%
h 1894
 
5.3%
n 1803
 
5.1%
p 1770
 
5.0%
Other values (37) 12326
34.6%
Common
ValueCountFrequency (%)
/ 4172
36.7%
. 3683
32.4%
: 1195
 
10.5%
0 410
 
3.6%
= 313
 
2.8%
_ 285
 
2.5%
1 268
 
2.4%
? 183
 
1.6%
2 166
 
1.5%
& 129
 
1.1%
Other values (14) 570
 
5.0%
Hangul
ValueCountFrequency (%)
5
20.0%
5
20.0%
5
20.0%
5
20.0%
5
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 46967
99.9%
Hangul 25
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 4172
 
8.9%
. 3683
 
7.8%
t 3650
 
7.8%
o 3066
 
6.5%
w 2684
 
5.7%
e 2381
 
5.1%
r 2146
 
4.6%
s 1961
 
4.2%
a 1912
 
4.1%
h 1894
 
4.0%
Other values (61) 19418
41.3%
Hangul
ValueCountFrequency (%)
5
20.0%
5
20.0%
5
20.0%
5
20.0%
5
20.0%

운영시간
Text

MISSING 

Distinct1244
Distinct (%)73.4%
Missing259
Missing (%)13.3%
Memory size15.4 KiB
2024-05-11T14:31:15.575954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length396
Median length202
Mean length34.649351
Min length2

Characters and Unicode

Total characters58696
Distinct characters865
Distinct categories14 ?
Distinct scripts6 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1105 ?
Unique (%)65.2%

Sample

1st rowTue-Fri 10:00 - 20:00 / Sat-Sun 10:00 - 22:00
2nd row성당사무실 화 ~ 금 | 09:00 ~ 20:30 토 요 일 | 09:00 ~ 20:00 일 요 일 | 09:00 ~ 21:00
3rd rowTuesday - Friday 10:00 - 20:00 KST Sat, Holiday 10:00 - 19:00 KST
4th row週二 - 週五 10:00 - 20:00 週六、公休日 10:00 - 19:00
5th rowEvery Tue-Sun, 09:00-18:00 (Closed on Mondays and public holidays)
ValueCountFrequency (%)
1308
 
15.2%
kst 541
 
6.3%
10:00 351
 
4.1%
18:00 287
 
3.3%
09:00 241
 
2.8%
17:00 143
 
1.7%
19:00 114
 
1.3%
daily 101
 
1.2%
10:00~18:00 90
 
1.0%
20:00 76
 
0.9%
Other values (2039) 5360
62.2%
2024-05-11T14:31:16.779774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 11264
19.2%
7311
 
12.5%
: 4986
 
8.5%
1 4185
 
7.1%
~ 1736
 
3.0%
9 1184
 
2.0%
? 1149
 
2.0%
2 1138
 
1.9%
- 1032
 
1.8%
3 1008
 
1.7%
Other values (855) 23703
40.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 20917
35.6%
Other Letter 9689
16.5%
Other Punctuation 7334
 
12.5%
Space Separator 7311
 
12.5%
Lowercase Letter 6094
 
10.4%
Uppercase Letter 2648
 
4.5%
Math Symbol 1788
 
3.0%
Dash Punctuation 1033
 
1.8%
Close Punctuation 941
 
1.6%
Open Punctuation 926
 
1.6%
Other values (4) 15
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
474
 
4.9%
458
 
4.7%
260
 
2.7%
220
 
2.3%
213
 
2.2%
206
 
2.1%
196
 
2.0%
193
 
2.0%
159
 
1.6%
154
 
1.6%
Other values (759) 7156
73.9%
Lowercase Letter
ValueCountFrequency (%)
e 752
12.3%
a 644
10.6%
r 505
 
8.3%
s 452
 
7.4%
t 449
 
7.4%
o 402
 
6.6%
n 393
 
6.4%
y 385
 
6.3%
i 385
 
6.3%
u 289
 
4.7%
Other values (16) 1438
23.6%
Uppercase Letter
ValueCountFrequency (%)
S 704
26.6%
T 637
24.1%
K 583
22.0%
M 127
 
4.8%
D 112
 
4.2%
F 71
 
2.7%
W 63
 
2.4%
L 60
 
2.3%
O 44
 
1.7%
N 40
 
1.5%
Other values (13) 207
 
7.8%
Other Punctuation
ValueCountFrequency (%)
: 4986
68.0%
? 1149
 
15.7%
, 241
 
3.3%
203
 
2.8%
* 179
 
2.4%
/ 113
 
1.5%
97
 
1.3%
93
 
1.3%
78
 
1.1%
. 54
 
0.7%
Other values (5) 141
 
1.9%
Decimal Number
ValueCountFrequency (%)
0 11264
53.9%
1 4185
 
20.0%
9 1184
 
5.7%
2 1138
 
5.4%
3 1008
 
4.8%
8 941
 
4.5%
7 621
 
3.0%
6 269
 
1.3%
4 181
 
0.9%
5 125
 
0.6%
Math Symbol
ValueCountFrequency (%)
~ 1736
97.1%
> 19
 
1.1%
< 19
 
1.1%
| 8
 
0.4%
3
 
0.2%
+ 3
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 795
84.5%
119
 
12.6%
] 26
 
2.8%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 786
84.9%
113
 
12.2%
[ 26
 
2.8%
1
 
0.1%
Dash Punctuation
ValueCountFrequency (%)
- 1032
99.9%
1
 
0.1%
Space Separator
ValueCountFrequency (%)
7311
100.0%
Control
ValueCountFrequency (%)
9
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%
Format
ValueCountFrequency (%)
­ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 40265
68.6%
Latin 8742
 
14.9%
Han 6065
 
10.3%
Hangul 2783
 
4.7%
Hiragana 607
 
1.0%
Katakana 234
 
0.4%

Most frequent character per script

Han
ValueCountFrequency (%)
474
 
7.8%
458
 
7.6%
260
 
4.3%
213
 
3.5%
206
 
3.4%
196
 
3.2%
159
 
2.6%
154
 
2.5%
137
 
2.3%
126
 
2.1%
Other values (412) 3682
60.7%
Hangul
ValueCountFrequency (%)
220
 
7.9%
193
 
6.9%
140
 
5.0%
97
 
3.5%
95
 
3.4%
81
 
2.9%
68
 
2.4%
67
 
2.4%
64
 
2.3%
55
 
2.0%
Other values (242) 1703
61.2%
Katakana
ValueCountFrequency (%)
21
 
9.0%
21
 
9.0%
13
 
5.6%
13
 
5.6%
11
 
4.7%
11
 
4.7%
10
 
4.3%
9
 
3.8%
8
 
3.4%
8
 
3.4%
Other values (42) 109
46.6%
Latin
ValueCountFrequency (%)
e 752
 
8.6%
S 704
 
8.1%
a 644
 
7.4%
T 637
 
7.3%
K 583
 
6.7%
r 505
 
5.8%
s 452
 
5.2%
t 449
 
5.1%
o 402
 
4.6%
n 393
 
4.5%
Other values (39) 3221
36.8%
Common
ValueCountFrequency (%)
0 11264
28.0%
7311
18.2%
: 4986
12.4%
1 4185
 
10.4%
~ 1736
 
4.3%
9 1184
 
2.9%
? 1149
 
2.9%
2 1138
 
2.8%
- 1032
 
2.6%
3 1008
 
2.5%
Other values (37) 5272
13.1%
Hiragana
ValueCountFrequency (%)
88
14.5%
85
14.0%
76
12.5%
48
 
7.9%
29
 
4.8%
23
 
3.8%
21
 
3.5%
20
 
3.3%
19
 
3.1%
17
 
2.8%
Other values (33) 181
29.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 48202
82.1%
CJK 6062
 
10.3%
Hangul 2783
 
4.7%
None 704
 
1.2%
Hiragana 607
 
1.0%
Katakana 234
 
0.4%
Punctuation 98
 
0.2%
Math Operators 3
 
< 0.1%
CJK Compat Ideographs 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 11264
23.4%
7311
15.2%
: 4986
 
10.3%
1 4185
 
8.7%
~ 1736
 
3.6%
9 1184
 
2.5%
? 1149
 
2.4%
2 1138
 
2.4%
- 1032
 
2.1%
3 1008
 
2.1%
Other values (69) 13209
27.4%
CJK
ValueCountFrequency (%)
474
 
7.8%
458
 
7.6%
260
 
4.3%
213
 
3.5%
206
 
3.4%
196
 
3.2%
159
 
2.6%
154
 
2.5%
137
 
2.3%
126
 
2.1%
Other values (410) 3679
60.7%
Hangul
ValueCountFrequency (%)
220
 
7.9%
193
 
6.9%
140
 
5.0%
97
 
3.5%
95
 
3.4%
81
 
2.9%
68
 
2.4%
67
 
2.4%
64
 
2.3%
55
 
2.0%
Other values (242) 1703
61.2%
None
ValueCountFrequency (%)
203
28.8%
119
16.9%
113
16.1%
97
13.8%
78
 
11.1%
37
 
5.3%
33
 
4.7%
· 19
 
2.7%
­ 2
 
0.3%
1
 
0.1%
Other values (2) 2
 
0.3%
Punctuation
ValueCountFrequency (%)
93
94.9%
2
 
2.0%
2
 
2.0%
1
 
1.0%
Hiragana
ValueCountFrequency (%)
88
14.5%
85
14.0%
76
12.5%
48
 
7.9%
29
 
4.8%
23
 
3.8%
21
 
3.5%
20
 
3.3%
19
 
3.1%
17
 
2.8%
Other values (33) 181
29.8%
Katakana
ValueCountFrequency (%)
21
 
9.0%
21
 
9.0%
13
 
5.6%
13
 
5.6%
11
 
4.7%
11
 
4.7%
10
 
4.3%
9
 
3.8%
8
 
3.4%
8
 
3.4%
Other values (42) 109
46.6%
Math Operators
ValueCountFrequency (%)
3
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
2
66.7%
1
33.3%

운영요일
Text

MISSING 

Distinct225
Distinct (%)21.7%
Missing916
Missing (%)46.9%
Memory size15.4 KiB
2024-05-11T14:31:17.248883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length58
Mean length5.6827387
Min length2

Characters and Unicode

Total characters5893
Distinct characters249
Distinct categories11 ?
Distinct scripts6 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique137 ?
Unique (%)13.2%

Sample

1st rowTue-Sun
2nd row週二 - 週日
3rd rowTuesday, Wednesday, Thursday, Friday, Saturday, Sunday
4th rowDaily
5th rowDaily
ValueCountFrequency (%)
화~일 78
 
5.8%
每天 73
 
5.4%
매일 70
 
5.2%
69
 
5.1%
67
 
5.0%
daily 62
 
4.6%
週二~週日 61
 
4.5%
每日 60
 
4.5%
周二~周日 39
 
2.9%
週一~週六 26
 
1.9%
Other values (225) 740
55.0%
2024-05-11T14:31:18.148497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
475
 
8.1%
~ 426
 
7.2%
344
 
5.8%
323
 
5.5%
258
 
4.4%
220
 
3.7%
a 204
 
3.5%
189
 
3.2%
y 179
 
3.0%
? 178
 
3.0%
Other values (239) 3097
52.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3124
53.0%
Lowercase Letter 1240
 
21.0%
Math Symbol 426
 
7.2%
Space Separator 344
 
5.8%
Other Punctuation 290
 
4.9%
Uppercase Letter 248
 
4.2%
Decimal Number 134
 
2.3%
Dash Punctuation 67
 
1.1%
Close Punctuation 9
 
0.2%
Open Punctuation 9
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
475
15.2%
323
 
10.3%
258
 
8.3%
220
 
7.0%
189
 
6.0%
138
 
4.4%
132
 
4.2%
110
 
3.5%
93
 
3.0%
92
 
2.9%
Other values (176) 1094
35.0%
Lowercase Letter
ValueCountFrequency (%)
a 204
16.5%
y 179
14.4%
d 125
10.1%
u 111
9.0%
n 93
7.5%
e 84
6.8%
i 77
 
6.2%
l 70
 
5.6%
r 67
 
5.4%
o 65
 
5.2%
Other values (12) 165
13.3%
Uppercase Letter
ValueCountFrequency (%)
S 67
27.0%
D 60
24.2%
T 44
17.7%
M 37
14.9%
F 11
 
4.4%
W 9
 
3.6%
K 6
 
2.4%
O 6
 
2.4%
E 4
 
1.6%
A 1
 
0.4%
Other values (3) 3
 
1.2%
Other Punctuation
ValueCountFrequency (%)
? 178
61.4%
: 32
 
11.0%
, 31
 
10.7%
28
 
9.7%
6
 
2.1%
5
 
1.7%
3
 
1.0%
3
 
1.0%
* 2
 
0.7%
. 1
 
0.3%
Decimal Number
ValueCountFrequency (%)
0 74
55.2%
1 30
22.4%
2 10
 
7.5%
8 8
 
6.0%
9 4
 
3.0%
3 3
 
2.2%
4 3
 
2.2%
7 1
 
0.7%
5 1
 
0.7%
Close Punctuation
ValueCountFrequency (%)
) 5
55.6%
4
44.4%
Open Punctuation
ValueCountFrequency (%)
( 5
55.6%
4
44.4%
Math Symbol
ValueCountFrequency (%)
~ 426
100.0%
Space Separator
ValueCountFrequency (%)
344
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 67
100.0%
Control
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Han 2449
41.6%
Latin 1488
25.3%
Common 1281
21.7%
Hangul 639
 
10.8%
Hiragana 24
 
0.4%
Katakana 12
 
0.2%

Most frequent character per script

Han
ValueCountFrequency (%)
475
19.4%
323
13.2%
258
10.5%
189
 
7.7%
138
 
5.6%
132
 
5.4%
93
 
3.8%
92
 
3.8%
91
 
3.7%
87
 
3.6%
Other values (103) 571
23.3%
Hangul
ValueCountFrequency (%)
220
34.4%
110
17.2%
74
 
11.6%
70
 
11.0%
50
 
7.8%
22
 
3.4%
22
 
3.4%
12
 
1.9%
12
 
1.9%
3
 
0.5%
Other values (40) 44
 
6.9%
Latin
ValueCountFrequency (%)
a 204
13.7%
y 179
12.0%
d 125
 
8.4%
u 111
 
7.5%
n 93
 
6.2%
e 84
 
5.6%
i 77
 
5.2%
l 70
 
4.7%
r 67
 
4.5%
S 67
 
4.5%
Other values (25) 411
27.6%
Common
ValueCountFrequency (%)
~ 426
33.3%
344
26.9%
? 178
13.9%
0 74
 
5.8%
- 67
 
5.2%
: 32
 
2.5%
, 31
 
2.4%
1 30
 
2.3%
28
 
2.2%
2 10
 
0.8%
Other values (18) 61
 
4.8%
Hiragana
ValueCountFrequency (%)
4
16.7%
2
 
8.3%
2
 
8.3%
2
 
8.3%
2
 
8.3%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
Other values (7) 7
29.2%
Katakana
ValueCountFrequency (%)
2
16.7%
2
16.7%
2
16.7%
2
16.7%
2
16.7%
2
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2715
46.1%
CJK 2449
41.6%
Hangul 639
 
10.8%
None 51
 
0.9%
Hiragana 24
 
0.4%
Katakana 12
 
0.2%
Punctuation 3
 
0.1%

Most frequent character per block

CJK
ValueCountFrequency (%)
475
19.4%
323
13.2%
258
10.5%
189
 
7.7%
138
 
5.6%
132
 
5.4%
93
 
3.8%
92
 
3.8%
91
 
3.7%
87
 
3.6%
Other values (103) 571
23.3%
ASCII
ValueCountFrequency (%)
~ 426
15.7%
344
 
12.7%
a 204
 
7.5%
y 179
 
6.6%
? 178
 
6.6%
d 125
 
4.6%
u 111
 
4.1%
n 93
 
3.4%
e 84
 
3.1%
i 77
 
2.8%
Other values (45) 894
32.9%
Hangul
ValueCountFrequency (%)
220
34.4%
110
17.2%
74
 
11.6%
70
 
11.0%
50
 
7.8%
22
 
3.4%
22
 
3.4%
12
 
1.9%
12
 
1.9%
3
 
0.5%
Other values (40) 44
 
6.9%
None
ValueCountFrequency (%)
28
54.9%
6
 
11.8%
5
 
9.8%
4
 
7.8%
4
 
7.8%
3
 
5.9%
· 1
 
2.0%
Hiragana
ValueCountFrequency (%)
4
16.7%
2
 
8.3%
2
 
8.3%
2
 
8.3%
2
 
8.3%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
Other values (7) 7
29.2%
Punctuation
ValueCountFrequency (%)
3
100.0%
Katakana
ValueCountFrequency (%)
2
16.7%
2
16.7%
2
16.7%
2
16.7%
2
16.7%
2
16.7%

휴무일
Text

MISSING 

Distinct663
Distinct (%)47.4%
Missing555
Missing (%)28.4%
Memory size15.4 KiB
2024-05-11T14:31:18.584359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length258
Median length124
Mean length13.603004
Min length1

Characters and Unicode

Total characters19017
Distinct characters528
Distinct categories12 ?
Distinct scripts6 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique546 ?
Unique (%)39.1%

Sample

1st rowMondays
2nd row설날, 추석 당일 (성당사무실: 월요일 휴무)
3rd rowMondays
4th row週一
5th rowMondays and public holidays
ValueCountFrequency (%)
closed 141
 
4.4%
118
 
3.7%
mondays 101
 
3.2%
월요일 95
 
3.0%
new 81
 
2.5%
なし 59
 
1.8%
56
 
1.8%
54
 
1.7%
없음 52
 
1.6%
day 52
 
1.6%
Other values (776) 2382
74.6%
2024-05-11T14:31:19.467806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1884
 
9.9%
818
 
4.3%
? 669
 
3.5%
a 635
 
3.3%
e 583
 
3.1%
s 549
 
2.9%
o 509
 
2.7%
503
 
2.6%
d 501
 
2.6%
n 459
 
2.4%
Other values (518) 11907
62.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8059
42.4%
Lowercase Letter 5195
27.3%
Space Separator 1884
 
9.9%
Other Punctuation 1818
 
9.6%
Uppercase Letter 767
 
4.0%
Decimal Number 750
 
3.9%
Close Punctuation 196
 
1.0%
Open Punctuation 194
 
1.0%
Dash Punctuation 143
 
0.8%
Math Symbol 6
 
< 0.1%
Other values (2) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
818
 
10.2%
422
 
5.2%
398
 
4.9%
291
 
3.6%
286
 
3.5%
223
 
2.8%
213
 
2.6%
208
 
2.6%
208
 
2.6%
192
 
2.4%
Other values (438) 4800
59.6%
Lowercase Letter
ValueCountFrequency (%)
a 635
12.2%
e 583
11.2%
s 549
10.6%
o 509
9.8%
d 501
9.6%
n 459
8.8%
y 368
7.1%
l 288
 
5.5%
r 256
 
4.9%
u 240
 
4.6%
Other values (13) 807
15.5%
Uppercase Letter
ValueCountFrequency (%)
C 190
24.8%
M 137
17.9%
N 87
11.3%
Y 81
10.6%
S 62
 
8.1%
L 48
 
6.3%
D 46
 
6.0%
O 21
 
2.7%
J 19
 
2.5%
H 18
 
2.3%
Other values (12) 58
 
7.6%
Other Punctuation
ValueCountFrequency (%)
? 669
36.8%
503
27.7%
, 305
16.8%
& 81
 
4.5%
77
 
4.2%
' 44
 
2.4%
. 30
 
1.7%
27
 
1.5%
/ 21
 
1.2%
: 16
 
0.9%
Other values (5) 45
 
2.5%
Decimal Number
ValueCountFrequency (%)
1 458
61.1%
5 82
 
10.9%
8 63
 
8.4%
3 51
 
6.8%
2 42
 
5.6%
0 27
 
3.6%
4 14
 
1.9%
6 7
 
0.9%
9 2
 
0.3%
2
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 182
92.9%
14
 
7.1%
Open Punctuation
ValueCountFrequency (%)
( 180
92.8%
14
 
7.2%
Space Separator
ValueCountFrequency (%)
1884
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 143
100.0%
Math Symbol
ValueCountFrequency (%)
~ 6
100.0%
Final Punctuation
ValueCountFrequency (%)
4
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 5962
31.4%
Han 5440
28.6%
Common 4996
26.3%
Hangul 2216
 
11.7%
Hiragana 370
 
1.9%
Katakana 33
 
0.2%

Most frequent character per script

Han
ValueCountFrequency (%)
818
 
15.0%
398
 
7.3%
291
 
5.3%
286
 
5.3%
223
 
4.1%
213
 
3.9%
208
 
3.8%
192
 
3.5%
162
 
3.0%
124
 
2.3%
Other values (231) 2525
46.4%
Hangul
ValueCountFrequency (%)
422
19.0%
208
 
9.4%
190
 
8.6%
160
 
7.2%
64
 
2.9%
59
 
2.7%
59
 
2.7%
56
 
2.5%
55
 
2.5%
52
 
2.3%
Other values (154) 891
40.2%
Latin
ValueCountFrequency (%)
a 635
10.7%
e 583
 
9.8%
s 549
 
9.2%
o 509
 
8.5%
d 501
 
8.4%
n 459
 
7.7%
y 368
 
6.2%
l 288
 
4.8%
r 256
 
4.3%
u 240
 
4.0%
Other values (35) 1574
26.4%
Common
ValueCountFrequency (%)
1884
37.7%
? 669
 
13.4%
503
 
10.1%
1 458
 
9.2%
, 305
 
6.1%
) 182
 
3.6%
( 180
 
3.6%
- 143
 
2.9%
5 82
 
1.6%
& 81
 
1.6%
Other values (25) 509
 
10.2%
Hiragana
ValueCountFrequency (%)
68
18.4%
64
17.3%
63
17.0%
36
9.7%
31
8.4%
21
 
5.7%
15
 
4.1%
13
 
3.5%
8
 
2.2%
6
 
1.6%
Other values (19) 45
12.2%
Katakana
ValueCountFrequency (%)
4
12.1%
4
12.1%
3
9.1%
3
9.1%
3
9.1%
3
9.1%
3
9.1%
2
6.1%
2
6.1%
2
6.1%
Other values (4) 4
12.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 10287
54.1%
CJK 5440
28.6%
Hangul 2216
 
11.7%
None 640
 
3.4%
Hiragana 370
 
1.9%
Katakana 33
 
0.2%
Punctuation 31
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1884
18.3%
? 669
 
6.5%
a 635
 
6.2%
e 583
 
5.7%
s 549
 
5.3%
o 509
 
4.9%
d 501
 
4.9%
n 459
 
4.5%
1 458
 
4.5%
y 368
 
3.6%
Other values (59) 3672
35.7%
CJK
ValueCountFrequency (%)
818
 
15.0%
398
 
7.3%
291
 
5.3%
286
 
5.3%
223
 
4.1%
213
 
3.9%
208
 
3.8%
192
 
3.5%
162
 
3.0%
124
 
2.3%
Other values (231) 2525
46.4%
None
ValueCountFrequency (%)
503
78.6%
77
 
12.0%
14
 
2.2%
14
 
2.2%
11
 
1.7%
11
 
1.7%
· 7
 
1.1%
2
 
0.3%
1
 
0.2%
Hangul
ValueCountFrequency (%)
422
19.0%
208
 
9.4%
190
 
8.6%
160
 
7.2%
64
 
2.9%
59
 
2.7%
59
 
2.7%
56
 
2.5%
55
 
2.5%
52
 
2.3%
Other values (154) 891
40.2%
Hiragana
ValueCountFrequency (%)
68
18.4%
64
17.3%
63
17.0%
36
9.7%
31
8.4%
21
 
5.7%
15
 
4.1%
13
 
3.5%
8
 
2.2%
6
 
1.6%
Other values (19) 45
12.2%
Punctuation
ValueCountFrequency (%)
27
87.1%
4
 
12.9%
Katakana
ValueCountFrequency (%)
4
12.1%
4
12.1%
3
9.1%
3
9.1%
3
9.1%
3
9.1%
3
9.1%
2
6.1%
2
6.1%
2
6.1%
Other values (4) 4
12.1%

교통정보
Text

MISSING 

Distinct1804
Distinct (%)96.3%
Missing79
Missing (%)4.0%
Memory size15.4 KiB
2024-05-11T14:31:20.080383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length402
Median length184
Mean length38.433298
Min length7

Characters and Unicode

Total characters72024
Distinct characters968
Distinct categories13 ?
Distinct scripts6 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1753 ?
Unique (%)93.5%

Sample

1st rowSubway Line 3, Anguk Station, Exit 3
2nd rowSubway Line 6, Changsin Station, Exit 1
3rd rowSubway Line 6, Changsin Station, Exit 1
4th rowSubway Line 3, Anguk Station, Exit 3
5th rowSubway Line 3, Anguk Station, Exit 3
ValueCountFrequency (%)
station 477
 
4.7%
line 443
 
4.4%
exit 429
 
4.2%
subway 361
 
3.6%
출구 323
 
3.2%
3 285
 
2.8%
257
 
2.5%
on 224
 
2.2%
2 209
 
2.1%
1 207
 
2.0%
Other values (2594) 6940
68.3%
2024-05-11T14:31:20.916064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9200
 
12.8%
? 4604
 
6.4%
n 2171
 
3.0%
1 2139
 
3.0%
t 1976
 
2.7%
i 1922
 
2.7%
o 1825
 
2.5%
3 1578
 
2.2%
1562
 
2.2%
1509
 
2.1%
Other values (958) 43538
60.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 24584
34.1%
Lowercase Letter 17308
24.0%
Decimal Number 10038
13.9%
Space Separator 9200
 
12.8%
Other Punctuation 6502
 
9.0%
Uppercase Letter 2652
 
3.7%
Open Punctuation 726
 
1.0%
Close Punctuation 725
 
1.0%
Dash Punctuation 182
 
0.3%
Math Symbol 89
 
0.1%
Other values (3) 18
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1562
 
6.4%
1509
 
6.1%
1088
 
4.4%
968
 
3.9%
967
 
3.9%
637
 
2.6%
597
 
2.4%
533
 
2.2%
532
 
2.2%
502
 
2.0%
Other values (861) 15689
63.8%
Lowercase Letter
ValueCountFrequency (%)
n 2171
12.5%
t 1976
11.4%
i 1922
11.1%
o 1825
10.5%
a 1439
 
8.3%
e 1284
 
7.4%
m 929
 
5.4%
u 782
 
4.5%
g 607
 
3.5%
s 588
 
3.4%
Other values (16) 3785
21.9%
Uppercase Letter
ValueCountFrequency (%)
S 922
34.8%
L 470
17.7%
E 468
17.6%
G 118
 
4.4%
A 98
 
3.7%
H 92
 
3.5%
J 71
 
2.7%
C 65
 
2.5%
T 52
 
2.0%
U 39
 
1.5%
Other values (14) 257
 
9.7%
Other Punctuation
ValueCountFrequency (%)
? 4604
70.8%
, 895
 
13.8%
340
 
5.2%
/ 144
 
2.2%
* 141
 
2.2%
130
 
2.0%
& 89
 
1.4%
· 58
 
0.9%
. 58
 
0.9%
' 19
 
0.3%
Other values (4) 24
 
0.4%
Decimal Number
ValueCountFrequency (%)
1 2139
21.3%
3 1578
15.7%
2 1476
14.7%
5 1252
12.5%
4 957
9.5%
0 940
9.4%
6 634
 
6.3%
7 553
 
5.5%
8 262
 
2.6%
9 243
 
2.4%
Other values (4) 4
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 370
51.0%
340
46.9%
11
 
1.5%
] 4
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 370
51.0%
341
47.0%
11
 
1.5%
[ 4
 
0.6%
Math Symbol
ValueCountFrequency (%)
64
71.9%
~ 16
 
18.0%
> 5
 
5.6%
< 4
 
4.5%
Initial Punctuation
ValueCountFrequency (%)
4
57.1%
3
42.9%
Final Punctuation
ValueCountFrequency (%)
4
57.1%
3
42.9%
Space Separator
ValueCountFrequency (%)
9200
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 182
100.0%
Control
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 27480
38.2%
Latin 19960
27.7%
Han 14513
20.2%
Hangul 6242
 
8.7%
Katakana 3024
 
4.2%
Hiragana 805
 
1.1%

Most frequent character per script

Han
ValueCountFrequency (%)
1562
 
10.8%
1509
 
10.4%
1088
 
7.5%
968
 
6.7%
967
 
6.7%
597
 
4.1%
473
 
3.3%
445
 
3.1%
389
 
2.7%
295
 
2.0%
Other values (491) 6220
42.9%
Hangul
ValueCountFrequency (%)
533
 
8.5%
532
 
8.5%
502
 
8.0%
484
 
7.8%
464
 
7.4%
369
 
5.9%
187
 
3.0%
156
 
2.5%
154
 
2.5%
149
 
2.4%
Other values (251) 2712
43.4%
Katakana
ValueCountFrequency (%)
637
21.1%
177
 
5.9%
172
 
5.7%
144
 
4.8%
121
 
4.0%
110
 
3.6%
90
 
3.0%
85
 
2.8%
80
 
2.6%
77
 
2.5%
Other values (63) 1331
44.0%
Latin
ValueCountFrequency (%)
n 2171
 
10.9%
t 1976
 
9.9%
i 1922
 
9.6%
o 1825
 
9.1%
a 1439
 
7.2%
e 1284
 
6.4%
m 929
 
4.7%
S 922
 
4.6%
u 782
 
3.9%
g 607
 
3.0%
Other values (40) 6103
30.6%
Common
ValueCountFrequency (%)
9200
33.5%
? 4604
16.8%
1 2139
 
7.8%
3 1578
 
5.7%
2 1476
 
5.4%
5 1252
 
4.6%
4 957
 
3.5%
0 940
 
3.4%
, 895
 
3.3%
6 634
 
2.3%
Other values (37) 3805
13.8%
Hiragana
ValueCountFrequency (%)
230
28.6%
227
28.2%
52
 
6.5%
50
 
6.2%
32
 
4.0%
29
 
3.6%
29
 
3.6%
19
 
2.4%
18
 
2.2%
16
 
2.0%
Other values (26) 103
12.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 46106
64.0%
CJK 14513
 
20.2%
Hangul 6242
 
8.7%
Katakana 3024
 
4.2%
None 1255
 
1.7%
Hiragana 805
 
1.1%
Arrows 64
 
0.1%
Punctuation 15
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9200
20.0%
? 4604
 
10.0%
n 2171
 
4.7%
1 2139
 
4.6%
t 1976
 
4.3%
i 1922
 
4.2%
o 1825
 
4.0%
3 1578
 
3.4%
2 1476
 
3.2%
a 1439
 
3.1%
Other values (68) 17776
38.6%
CJK
ValueCountFrequency (%)
1562
 
10.8%
1509
 
10.4%
1088
 
7.5%
968
 
6.7%
967
 
6.7%
597
 
4.1%
473
 
3.3%
445
 
3.1%
389
 
2.7%
295
 
2.0%
Other values (491) 6220
42.9%
Katakana
ValueCountFrequency (%)
637
21.1%
177
 
5.9%
172
 
5.7%
144
 
4.8%
121
 
4.0%
110
 
3.6%
90
 
3.0%
85
 
2.8%
80
 
2.6%
77
 
2.5%
Other values (63) 1331
44.0%
Hangul
ValueCountFrequency (%)
533
 
8.5%
532
 
8.5%
502
 
8.0%
484
 
7.8%
464
 
7.4%
369
 
5.9%
187
 
3.0%
156
 
2.5%
154
 
2.5%
149
 
2.4%
Other values (251) 2712
43.4%
None
ValueCountFrequency (%)
341
27.2%
340
27.1%
340
27.1%
130
 
10.4%
· 58
 
4.6%
18
 
1.4%
11
 
0.9%
11
 
0.9%
2
 
0.2%
1
 
0.1%
Other values (3) 3
 
0.2%
Hiragana
ValueCountFrequency (%)
230
28.6%
227
28.2%
52
 
6.5%
50
 
6.2%
32
 
4.0%
29
 
3.6%
29
 
3.6%
19
 
2.4%
18
 
2.2%
16
 
2.0%
Other values (26) 103
12.8%
Arrows
ValueCountFrequency (%)
64
100.0%
Punctuation
ValueCountFrequency (%)
4
26.7%
4
26.7%
3
20.0%
3
20.0%
1
 
6.7%

태그
Text

Distinct1951
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
2024-05-11T14:31:21.483252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length243
Median length174
Mean length56.275474
Min length3

Characters and Unicode

Total characters109906
Distinct characters1936
Distinct categories11 ?
Distinct scripts6 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1949 ?
Unique (%)99.8%

Sample

1st rowJongro,BaekyangLaundry
2nd rowobservatory, Changsin-dong, cafe,Quarry observatory
3rd rowChangsin-dong Cliff Village, Changsin-dong, modern history of Korea,places to visit in Seoul
4th rowChoongAngHighSchool, WinterSonata, History, Jongno
5th rowChoongAngStore, Jongno,Hallyu
ValueCountFrequency (%)
203
 
2.9%
老店 132
 
1.9%
oraegage 58
 
0.8%
オレガゲ(老 51
 
0.7%
오래가게 47
 
0.7%
information 38
 
0.5%
전시 31
 
0.4%
박물관 26
 
0.4%
광화문 22
 
0.3%
역사 22
 
0.3%
Other values (4689) 6395
91.0%
2024-05-11T14:31:22.321126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 13889
 
12.6%
? 5977
 
5.4%
5265
 
4.8%
e 3693
 
3.4%
o 3250
 
3.0%
a 3169
 
2.9%
n 3089
 
2.8%
i 2573
 
2.3%
t 2412
 
2.2%
r 2267
 
2.1%
Other values (1926) 64322
58.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 44102
40.1%
Lowercase Letter 33066
30.1%
Other Punctuation 20163
18.3%
Uppercase Letter 6481
 
5.9%
Space Separator 5265
 
4.8%
Decimal Number 368
 
0.3%
Open Punctuation 168
 
0.2%
Close Punctuation 168
 
0.2%
Dash Punctuation 121
 
0.1%
Final Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
683
 
1.5%
594
 
1.3%
536
 
1.2%
534
 
1.2%
486
 
1.1%
471
 
1.1%
453
 
1.0%
421
 
1.0%
416
 
0.9%
411
 
0.9%
Other values (1844) 39097
88.7%
Lowercase Letter
ValueCountFrequency (%)
e 3693
11.2%
o 3250
9.8%
a 3169
9.6%
n 3089
9.3%
i 2573
 
7.8%
t 2412
 
7.3%
r 2267
 
6.9%
u 2093
 
6.3%
l 1623
 
4.9%
g 1544
 
4.7%
Other values (16) 7353
22.2%
Uppercase Letter
ValueCountFrequency (%)
S 1095
16.9%
C 558
 
8.6%
M 464
 
7.2%
H 458
 
7.1%
G 442
 
6.8%
A 391
 
6.0%
T 373
 
5.8%
P 301
 
4.6%
O 270
 
4.2%
D 265
 
4.1%
Other values (16) 1864
28.8%
Other Punctuation
ValueCountFrequency (%)
, 13889
68.9%
? 5977
29.6%
· 95
 
0.5%
81
 
0.4%
# 39
 
0.2%
28
 
0.1%
. 27
 
0.1%
' 13
 
0.1%
& 10
 
< 0.1%
/ 3
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
3 80
21.7%
1 60
16.3%
0 44
12.0%
6 36
9.8%
4 33
9.0%
2 31
 
8.4%
8 28
 
7.6%
7 23
 
6.2%
9 19
 
5.2%
5 14
 
3.8%
Open Punctuation
ValueCountFrequency (%)
( 152
90.5%
16
 
9.5%
Close Punctuation
ValueCountFrequency (%)
) 152
90.5%
16
 
9.5%
Final Punctuation
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
5265
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 121
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 39547
36.0%
Common 26257
23.9%
Han 24928
22.7%
Hangul 12827
 
11.7%
Katakana 5876
 
5.3%
Hiragana 471
 
0.4%

Most frequent character per script

Han
ValueCountFrequency (%)
683
 
2.7%
594
 
2.4%
536
 
2.2%
471
 
1.9%
453
 
1.8%
421
 
1.7%
411
 
1.6%
405
 
1.6%
377
 
1.5%
353
 
1.4%
Other values (1150) 20224
81.1%
Hangul
ValueCountFrequency (%)
416
 
3.2%
391
 
3.0%
333
 
2.6%
317
 
2.5%
313
 
2.4%
251
 
2.0%
247
 
1.9%
220
 
1.7%
220
 
1.7%
218
 
1.7%
Other values (554) 9901
77.2%
Katakana
ValueCountFrequency (%)
534
 
9.1%
486
 
8.3%
406
 
6.9%
401
 
6.8%
200
 
3.4%
191
 
3.3%
162
 
2.8%
159
 
2.7%
150
 
2.6%
148
 
2.5%
Other values (69) 3039
51.7%
Latin
ValueCountFrequency (%)
e 3693
 
9.3%
o 3250
 
8.2%
a 3169
 
8.0%
n 3089
 
7.8%
i 2573
 
6.5%
t 2412
 
6.1%
r 2267
 
5.7%
u 2093
 
5.3%
l 1623
 
4.1%
g 1544
 
3.9%
Other values (42) 13834
35.0%
Hiragana
ValueCountFrequency (%)
104
22.1%
46
 
9.8%
30
 
6.4%
29
 
6.2%
26
 
5.5%
19
 
4.0%
19
 
4.0%
18
 
3.8%
13
 
2.8%
12
 
2.5%
Other values (41) 155
32.9%
Common
ValueCountFrequency (%)
, 13889
52.9%
? 5977
22.8%
5265
 
20.1%
( 152
 
0.6%
) 152
 
0.6%
- 121
 
0.5%
· 95
 
0.4%
81
 
0.3%
3 80
 
0.3%
1 60
 
0.2%
Other values (20) 385
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 65564
59.7%
CJK 24924
 
22.7%
Hangul 12827
 
11.7%
Katakana 5876
 
5.3%
Hiragana 471
 
0.4%
None 236
 
0.2%
CJK Compat Ideographs 4
 
< 0.1%
Punctuation 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 13889
21.2%
? 5977
 
9.1%
5265
 
8.0%
e 3693
 
5.6%
o 3250
 
5.0%
a 3169
 
4.8%
n 3089
 
4.7%
i 2573
 
3.9%
t 2412
 
3.7%
r 2267
 
3.5%
Other values (64) 19980
30.5%
CJK
ValueCountFrequency (%)
683
 
2.7%
594
 
2.4%
536
 
2.2%
471
 
1.9%
453
 
1.8%
421
 
1.7%
411
 
1.6%
405
 
1.6%
377
 
1.5%
353
 
1.4%
Other values (1147) 20220
81.1%
Katakana
ValueCountFrequency (%)
534
 
9.1%
486
 
8.3%
406
 
6.9%
401
 
6.8%
200
 
3.4%
191
 
3.3%
162
 
2.8%
159
 
2.7%
150
 
2.6%
148
 
2.5%
Other values (69) 3039
51.7%
Hangul
ValueCountFrequency (%)
416
 
3.2%
391
 
3.0%
333
 
2.6%
317
 
2.5%
313
 
2.4%
251
 
2.0%
247
 
1.9%
220
 
1.7%
220
 
1.7%
218
 
1.7%
Other values (554) 9901
77.2%
Hiragana
ValueCountFrequency (%)
104
22.1%
46
 
9.8%
30
 
6.4%
29
 
6.2%
26
 
5.5%
19
 
4.0%
19
 
4.0%
18
 
3.8%
13
 
2.8%
12
 
2.5%
Other values (41) 155
32.9%
None
ValueCountFrequency (%)
· 95
40.3%
81
34.3%
28
 
11.9%
16
 
6.8%
16
 
6.8%
CJK Compat Ideographs
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Punctuation
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%

장애인편의시설
Text

MISSING 

Distinct157
Distinct (%)71.4%
Missing1733
Missing (%)88.7%
Memory size15.4 KiB
2024-05-11T14:31:22.652710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length137
Median length61
Mean length44.681818
Min length4

Characters and Unicode

Total characters9830
Distinct characters125
Distinct categories7 ?
Distinct scripts6 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique111 ?
Unique (%)50.5%

Sample

1st rowAccessible Pathways,Accessible Restrooms,Accessible Information Centers & Services(Wheelchair rentals, etc.),Elevators
2nd rowAccessible Restrooms,Accessible Pathways,Elevators,Accessible Information Centers & Services(Wheelchair rentals, etc.)
3rd rowAccessible Restrooms,Accessible Pathways
4th rowバリアフリ?トイレ,アクセシビリティ,?の不自由な方のための案?所(車椅子レンタルなど),エレベ?タ?
5th rowAccessible Restrooms,Accessible Information Centers & Services(Wheelchair rentals, etc.)
ValueCountFrequency (%)
accessible 41
 
6.0%
대여 33
 
4.9%
33
 
4.9%
전용 33
 
4.9%
안내(휠체어 33
 
4.9%
restrooms,accessible 31
 
4.6%
28
 
4.1%
information 28
 
4.1%
rentals 28
 
4.1%
services(wheelchair 28
 
4.1%
Other values (141) 363
53.5%
2024-05-11T14:31:23.408902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 582
 
5.9%
e 524
 
5.3%
? 491
 
5.0%
459
 
4.7%
s 451
 
4.6%
c 326
 
3.3%
i 231
 
2.3%
r 224
 
2.3%
t 200
 
2.0%
l 198
 
2.0%
Other values (115) 6144
62.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4496
45.7%
Lowercase Letter 3095
31.5%
Other Punctuation 1129
 
11.5%
Space Separator 459
 
4.7%
Uppercase Letter 347
 
3.5%
Close Punctuation 152
 
1.5%
Open Punctuation 152
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
189
 
4.2%
150
 
3.3%
139
 
3.1%
131
 
2.9%
131
 
2.9%
111
 
2.5%
111
 
2.5%
104
 
2.3%
103
 
2.3%
103
 
2.3%
Other values (81) 3224
71.7%
Lowercase Letter
ValueCountFrequency (%)
e 524
16.9%
s 451
14.6%
c 326
10.5%
i 231
7.5%
r 224
7.2%
t 200
 
6.5%
l 198
 
6.4%
a 191
 
6.2%
o 151
 
4.9%
n 138
 
4.5%
Other values (9) 461
14.9%
Uppercase Letter
ValueCountFrequency (%)
A 121
34.9%
P 56
16.1%
R 37
 
10.7%
I 28
 
8.1%
C 28
 
8.1%
S 28
 
8.1%
W 28
 
8.1%
E 21
 
6.1%
Other Punctuation
ValueCountFrequency (%)
, 582
51.6%
? 491
43.5%
& 28
 
2.5%
. 28
 
2.5%
Space Separator
ValueCountFrequency (%)
459
100.0%
Close Punctuation
ValueCountFrequency (%)
) 152
100.0%
Open Punctuation
ValueCountFrequency (%)
( 152
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 3442
35.0%
Han 2038
20.7%
Common 1892
19.2%
Hangul 1370
 
13.9%
Katakana 732
 
7.4%
Hiragana 356
 
3.6%

Most frequent character per script

Han
ValueCountFrequency (%)
139
 
6.8%
131
 
6.4%
131
 
6.4%
103
 
5.1%
91
 
4.5%
77
 
3.8%
72
 
3.5%
72
 
3.5%
68
 
3.3%
63
 
3.1%
Other values (29) 1091
53.5%
Hangul
ValueCountFrequency (%)
189
 
13.8%
111
 
8.1%
111
 
8.1%
103
 
7.5%
45
 
3.3%
45
 
3.3%
41
 
3.0%
41
 
3.0%
41
 
3.0%
41
 
3.0%
Other values (19) 602
43.9%
Latin
ValueCountFrequency (%)
e 524
15.2%
s 451
13.1%
c 326
9.5%
i 231
 
6.7%
r 224
 
6.5%
t 200
 
5.8%
l 198
 
5.8%
a 191
 
5.5%
o 151
 
4.4%
n 138
 
4.0%
Other values (17) 808
23.5%
Katakana
ValueCountFrequency (%)
104
14.2%
83
 
11.3%
68
 
9.3%
47
 
6.4%
36
 
4.9%
36
 
4.9%
36
 
4.9%
36
 
4.9%
32
 
4.4%
32
 
4.4%
Other values (8) 222
30.3%
Common
ValueCountFrequency (%)
, 582
30.8%
? 491
26.0%
459
24.3%
) 152
 
8.0%
( 152
 
8.0%
& 28
 
1.5%
. 28
 
1.5%
Hiragana
ValueCountFrequency (%)
150
42.1%
78
21.9%
50
 
14.0%
50
 
14.0%
28
 
7.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5334
54.3%
CJK 2038
 
20.7%
Hangul 1370
 
13.9%
Katakana 732
 
7.4%
Hiragana 356
 
3.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 582
 
10.9%
e 524
 
9.8%
? 491
 
9.2%
459
 
8.6%
s 451
 
8.5%
c 326
 
6.1%
i 231
 
4.3%
r 224
 
4.2%
t 200
 
3.7%
l 198
 
3.7%
Other values (24) 1648
30.9%
Hangul
ValueCountFrequency (%)
189
 
13.8%
111
 
8.1%
111
 
8.1%
103
 
7.5%
45
 
3.3%
45
 
3.3%
41
 
3.0%
41
 
3.0%
41
 
3.0%
41
 
3.0%
Other values (19) 602
43.9%
Hiragana
ValueCountFrequency (%)
150
42.1%
78
21.9%
50
 
14.0%
50
 
14.0%
28
 
7.9%
CJK
ValueCountFrequency (%)
139
 
6.8%
131
 
6.4%
131
 
6.4%
103
 
5.1%
91
 
4.5%
77
 
3.8%
72
 
3.5%
72
 
3.5%
68
 
3.3%
63
 
3.1%
Other values (29) 1091
53.5%
Katakana
ValueCountFrequency (%)
104
14.2%
83
 
11.3%
68
 
9.3%
47
 
6.4%
36
 
4.9%
36
 
4.9%
36
 
4.9%
36
 
4.9%
32
 
4.4%
32
 
4.4%
Other values (8) 222
30.3%

Interactions

2024-05-11T14:31:04.496802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T14:31:23.593333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
고유번호언어팩스번호
고유번호1.0000.1240.993
언어0.1241.0000.000
팩스번호0.9930.0001.000
2024-05-11T14:31:23.730763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
팩스번호언어
팩스번호1.0000.000
언어0.0001.000
2024-05-11T14:31:23.878361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
고유번호언어팩스번호
고유번호1.0000.0520.700
언어0.0521.0000.000
팩스번호0.7000.0001.000

Missing values

2024-05-11T14:31:04.762618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T14:31:05.034295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-05-11T14:31:05.299686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

고유번호언어상호명콘텐츠URL주소신주소전화번호팩스번호웹사이트운영시간운영요일휴무일교통정보태그장애인편의시설
045520enBaekyang Laundryhttps://english.visitseoul.net/attractions/Baekyang-2024/ENP8onuvv?utm_source=seoulopendata&utm_medium=attractions&utm_content=ENP8onuvv140-24, Gye-dong, Jongno-gu, Seoul, Korea03057 54 Gyedong-gil, Jongno-gu, Seoul+82-2-762-1261<NA><NA><NA><NA><NA>Subway Line 3, Anguk Station, Exit 3Jongro,BaekyangLaundry<NA>
145492enChangsin-Sungin Quarry Observatoryhttps://english.visitseoul.net/attractions/2024-Chaeseokjangjeonmangdae/ENPauov7d?utm_source=seoulopendata&utm_medium=attractions&utm_content=ENPauov7d서울 종로구 창신동 23-32203091 51 Naksan 5-gil, Jongno-gu, Seoul+82-507-1330-5416<NA><NA>Tue-Fri 10:00 - 20:00 / Sat-Sun 10:00 - 22:00Tue-SunMondaysSubway Line 6, Changsin Station, Exit 1observatory, Changsin-dong, cafe,Quarry observatory<NA>
245482enChangsin-dong's Cliff Villagehttps://english.visitseoul.net/attractions/2024-changsincliff/ENPgvo4y2?utm_source=seoulopendata&utm_medium=attractions&utm_content=ENPgvo4y2서울 종로구 창신동 23-32203091 23-322 Changsin-dong, Dongdaemun-gu, Seoul<NA><NA><NA><NA><NA><NA>Subway Line 6, Changsin Station, Exit 1Changsin-dong Cliff Village, Changsin-dong, modern history of Korea,places to visit in Seoul<NA>
345530enChoong Ang High Schoolhttps://english.visitseoul.net/attractions/ChoongAngHighSchool/ENPgcblme?utm_source=seoulopendata&utm_medium=attractions&utm_content=ENPgcblme1, Gye-dong, Jongno-gu, Seoul, Korea03051 164 Changdeokgung-gil, Jongno-gu, Seoul+82-2-742-1321<NA><NA><NA><NA><NA>Subway Line 3, Anguk Station, Exit 3ChoongAngHighSchool, WinterSonata, History, Jongno<NA>
445535enChoong Ang Storehttps://english.visitseoul.net/attractions/ChoongAngStore/ENPl7gype?utm_source=seoulopendata&utm_medium=attractions&utm_content=ENPl7gype2-105, Gye-dong, Jongno-gu, Seoul, Korea03051 162 Changdeokgung-gil, Jongno-gu, Seoul<NA><NA><NA><NA><NA><NA>Subway Line 3, Anguk Station, Exit 3ChoongAngStore, Jongno,Hallyu<NA>
545563enMyeongdong Jaemi-rohttps://english.visitseoul.net/attractions/2024-jaemiro/ENPhvgb2b?utm_source=seoulopendata&utm_medium=attractions&utm_content=ENPhvgb2b서울 중구 남산동2가 30-404631 24, Toegye-ro 20-gil, Jung-gu, Seoul, Republic of Korea<NA><NA><NA><NA><NA><NA>Subway Line 4, Myeongdong Station, Exit 3Jaemiro,KoreanComics,Animation,Myeongdong,MyeongdongStation, Manwha<NA>
616406enNight Views at Banpodaegyo Bridgehttps://english.visitseoul.net/attractions/Night-Views-at-Banpodaegyo-Bridge/ENP016325?utm_source=seoulopendata&utm_medium=attractions&utm_content=ENP016325649, Banpo-dong, Seocho-gu, Seoul (North end of Banpodaegyo Bridge Bridge)2085-14, Olympic-daero, Seocho-gu, Seoul+82-2-3780-0578<NA><NA><NA><NA><NA><NA>SeoulNightViewSpots,BanpoNightView,HangangDate,SeoulTravel,BanpoBridgeMoonlightRainbowFountain,SeoulAtNight,NightView,SeoulScenery,Park,DateNight,RainbowFountain,BanpoHangangPark,HangangParkBanpodaegyoBridge,BanpodaegyoBridge,Seoul,Walking<NA>
745594enSite of Concubine Jang Huibin’s Wellhttps://english.visitseoul.net/attractions/2024-jangheebin/ENPcyd45f?utm_source=seoulopendata&utm_medium=attractions&utm_content=ENPcyd45f서울 서대문구 연희동 120-2103702 74 Yeonhui-ro 15-gil, Seodaemun-gu, Seoul<NA><NA><NA><NA><NA><NA>A 30-minute walk from Subway Line 2, Hongik University Station, Exit 3JoseonDynasty, Palace,Yeonhui-dong, KoreanHistory<NA>
815338ko1898 명동성당https://korean.visitseoul.net/attractions/1898-명동성당/KOP015338?utm_source=seoulopendata&utm_medium=attractions&utm_content=KOP015338100-809 서울 중구 명동2가 1-104537 서울 중구 명동길 74 (명동2가, 명동성당)02-774-1784<NA><NA>성당사무실 화 ~ 금 | 09:00 ~ 20:30 토 요 일 | 09:00 ~ 20:00 일 요 일 | 09:00 ~ 21:00<NA>설날, 추석 당일 (성당사무실: 월요일 휴무)2호선 을지로입구역 5번 출구 3호선 을지로3가역 12번 출구 4호선 명동역 9번 출구명동대성당, 고딕,명동, 1898광장,복합문화공간,명동나들이, 성당<NA>
923961enSeMA Bunkerhttps://english.visitseoul.net/attractions/SeMA-Bunker/ENP023532?utm_source=seoulopendata&utm_medium=attractions&utm_content=ENP023532150-010 B-101, Uisadangdaero, Yeongdeungpo-gu, Seoul07327 101, Uisadang-daero, Yeongdeungpo-gu, Seoul+82-2-2124-8941<NA>https://sema.seoul.go.kr/en/indexTuesday - Friday 10:00 - 20:00 KST Sat, Holiday 10:00 - 19:00 KST<NA>MondaysSubway Lines 5 & 9, Yeouido Station, Exit 3SeMABunker,Seoul20,YeouidoStation,SeMA,SeoulMuseumOfArt,Gallery,Yeouido,Culture,UndergroundBunker,History,Art<NA>
고유번호언어상호명콘텐츠URL주소신주소전화번호팩스번호웹사이트운영시간운영요일휴무일교통정보태그장애인편의시설
19433266ko창의문https://korean.visitseoul.net/attractions/창의문/KOP003266?utm_source=seoulopendata&utm_medium=attractions&utm_content=KOP003266110-030 서울시 종로구 부암동 277-1103020 서울특별시 종로구 창의문로 118 (부암동)02-730-9924<NA><NA><NA><NA><NA>3호선 경복궁 3번 출구 도보 30분자하문, 한양도성, 성곽길, 창의문, 광화문, 성곽, 북악산, 북문, 경복궁역,유적지<NA>
194428053ko코끼리분식https://korean.visitseoul.net/attractions/2023029/KOP028053?utm_source=seoulopendata&utm_medium=attractions&utm_content=KOP028053서울 마포구 도화동 345-404172 서울 마포구 도화2길 3 (도화동)02-717-9061<NA>https://www.instagram.com/koggiri_mapo_line5/09:30~21:00(라스트오더 20:30)매일매달 1, 3번째 월요일 정기휴무5호선 마포역 3번 출구에서 270m오래가게, 분식, 마포떡볶이골목,즉석떡볶이, 공덕, 볶음밥<NA>
19455242ko헌인릉https://korean.visitseoul.net/attractions/헌인릉/KOP005240?utm_source=seoulopendata&utm_medium=attractions&utm_content=KOP005240137-180 서울 서초구 내곡동 산 13-106795 서울 서초구 헌인릉길 36-10 (헌인릉)02-445-0347<NA>http://royaltombs.cha.go.kr/html/HtmlPage.do?pg=/new/html/portal_01_09_01.jsp&mn=RT_01_092월 ~ 5월, 9월 ~ 10월 09:00 ~ 18:00 6월 ~ 8월 09:00 ~ 18:30 11월 ~ 1월 09:00 ~ 17:30 (매표는 관람종료 1시간 전까지만 가능)화~일월요일3호선 양재역 7번 출구 버스 환승(407, 408, 440, 462, 471) 2호선 강남역 3번 출구 버스 환승(407, 408, 440, 462, 471) ※ 하차지점 : 헌인릉 버스정류장관광코스, 세계문화유산,강남, 헌릉,서울명소, 문화재청,헌인릉, 조선왕릉, 인릉, 유네스코, 서울문화재<NA>
194643442ko효성한의원https://korean.visitseoul.net/attractions/2023021/KOPvsi1e5?utm_source=seoulopendata&utm_medium=attractions&utm_content=KOPvsi1e5서울 동대문구 제기동 1140-5502569 서울 동대문구 약령중앙로 5 (제기동, 대산빌딩)02-961-5544<NA>blog.naver.com/cjswldls6010:00~18:00월~목, 토, 일금요일1호선 제기동역 2번 출구에서 113m오래가게, 제기동, 한방치료, 한의학, 서울약령시, 한약재<NA>
194719657ko효자베이커리https://korean.visitseoul.net/attractions/효자베이커리/KOP019657?utm_source=seoulopendata&utm_medium=attractions&utm_content=KOP01965703036 서울 종로구 필운대로 5402-736-7629<NA><NA>화~일 8:00 ~ 20:20 *빵 소진시 일찍 닫습니다화~일월요일3호선 경복궁역 2번 출구에서 697m발효빵,오래가게, 빵집, 빵, 도넛, 베이글,고로케<NA>
194828330ko훼드라https://korean.visitseoul.net/attractions/훼드라/KOP028330?utm_source=seoulopendata&utm_medium=attractions&utm_content=KOP02833003789 서울 서대문구 연세로5길 3202-323-3201<NA><NA>12:00~02:00매일없음2호선 신촌역 1번 출구에서 172m오래가게, 연세대, 신촌, 라면, 해장, 분식, 매운라면<NA>
194943491ko휘가로https://korean.visitseoul.net/attractions/2023014/KOPuel8q4?utm_source=seoulopendata&utm_medium=attractions&utm_content=KOPuel8q4서울 관악구 신림동 241-2408814 서울 관악구 신림로11길 20 (신림동)02-889-1722<NA><NA>16:00~손님들이 원하는 시간까지매일없음신림선 서울대벤처타운역 2번 출구에서 614m호프, 관악구, 서울대, 맥주,오래가게, 녹두거리<NA>
195011115ko흑석동 효사길https://korean.visitseoul.net/attractions/흑석동-효사길/KOP011112?utm_source=seoulopendata&utm_medium=attractions&utm_content=KOP011112156-860 서울 동작구 흑석동 173-191 173-19106910 서울 동작구 흑석동 효사 4, 5길<NA><NA><NA><NA><NA><NA>9호선 흑석역 4번출구서울골목투어,효사길전망대,흑석동효사길,골목산책,보고싶다촬영지,서울아름다운야경명소,서울여행,서울낭만명소,서울시티투어,서울가볼만한곳,서울야경명소,서울골목길<NA>
19511999ko흥인지문(동대문)https://korean.visitseoul.net/attractions/흥인지문동대문/KOP001999?utm_source=seoulopendata&utm_medium=attractions&utm_content=KOP001999110-126 서울 종로구 종로6가 6903119 서울 종로구 종로 288 (종로6가, 흥인지문)02-731-0412<NA><NA>09:00 ~ 18:00화~일4호선 동대문역 6번 출구 1호선 동대문역 6번 출구 2호선 동대문역사문화공원역 1번 출구서울동대문볼거리,서울문화재,성곽, 동대문, 역사,서울명소,흥인지문,서울데이트, 문화재,서울사대문,동대문야시장,서울여행,동대문명소<NA>
195243455ko힐스트링https://korean.visitseoul.net/attractions/2023044/KOPht2ct9?utm_source=seoulopendata&utm_medium=attractions&utm_content=KOPht2ct906711 서울 서초구 반포대로1길 8 성호빌딩 1층0507-1407-8195<NA>hillstring.modoo.at09:30~18:00월~금주말, 공휴일3호선 남부터미널역 5번 출구에서 757m오래가게, 현악기제작, 악기수리, 서초, 예술의전당, 바이올린, 명장<NA>