Overview

Dataset statistics

Number of variables18
Number of observations10000
Missing cells149156
Missing cells (%)82.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.4 MiB
Average record size in memory152.0 B

Variable types

Text18

Dataset

Description충남도서관 소장 도서에 대한 정보로, 개별 도서에 대한 메타데이터(서명, 저자명, 발행처, 청구기호, ISBN 등의 정보)가 포함되어 있습니다.
Author충청남도
URLhttps://www.data.go.kr/data/15119625/fileData.do

Alerts

Unnamed: 17 has constant value ""Constant
Unnamed: 3 has 9432 (94.3%) missing valuesMissing
Unnamed: 4 has 9848 (98.5%) missing valuesMissing
Unnamed: 5 has 9952 (99.5%) missing valuesMissing
Unnamed: 6 has 9978 (99.8%) missing valuesMissing
Unnamed: 7 has 9992 (99.9%) missing valuesMissing
Unnamed: 8 has 9992 (99.9%) missing valuesMissing
Unnamed: 9 has 9993 (99.9%) missing valuesMissing
Unnamed: 10 has 9993 (99.9%) missing valuesMissing
Unnamed: 11 has 9994 (99.9%) missing valuesMissing
Unnamed: 12 has 9995 (> 99.9%) missing valuesMissing
Unnamed: 13 has 9995 (> 99.9%) missing valuesMissing
Unnamed: 14 has 9997 (> 99.9%) missing valuesMissing
Unnamed: 15 has 9998 (> 99.9%) missing valuesMissing
Unnamed: 16 has 9998 (> 99.9%) missing valuesMissing
Unnamed: 17 has 9999 (> 99.9%) missing valuesMissing
등록번호 has unique valuesUnique

Reproduction

Analysis started2023-12-11 23:56:44.344465
Analysis finished2023-12-11 23:56:47.635674
Duration3.29 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

등록번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T08:56:47.809304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length9
Mean length9
Min length9

Characters and Unicode

Total characters90000
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowKM0014734
2nd rowKM0037125
3rd rowKM0014384
4th rowKM0070897
5th rowKM0050375
ValueCountFrequency (%)
km0014734 1
 
< 0.1%
km0072056 1
 
< 0.1%
km0067012 1
 
< 0.1%
km0024686 1
 
< 0.1%
km0072750 1
 
< 0.1%
km0044749 1
 
< 0.1%
km0060830 1
 
< 0.1%
km0017991 1
 
< 0.1%
km0075184 1
 
< 0.1%
km0070269 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-12T08:56:48.145976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 25155
28.0%
K 10000
 
11.1%
M 10000
 
11.1%
5 5465
 
6.1%
6 5437
 
6.0%
3 5333
 
5.9%
2 5302
 
5.9%
1 5291
 
5.9%
4 5268
 
5.9%
7 5104
 
5.7%
Other values (2) 7645
 
8.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 70000
77.8%
Uppercase Letter 20000
 
22.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 25155
35.9%
5 5465
 
7.8%
6 5437
 
7.8%
3 5333
 
7.6%
2 5302
 
7.6%
1 5291
 
7.6%
4 5268
 
7.5%
7 5104
 
7.3%
8 3833
 
5.5%
9 3812
 
5.4%
Uppercase Letter
ValueCountFrequency (%)
K 10000
50.0%
M 10000
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 70000
77.8%
Latin 20000
 
22.2%

Most frequent character per script

Common
ValueCountFrequency (%)
0 25155
35.9%
5 5465
 
7.8%
6 5437
 
7.8%
3 5333
 
7.6%
2 5302
 
7.6%
1 5291
 
7.6%
4 5268
 
7.5%
7 5104
 
7.3%
8 3833
 
5.5%
9 3812
 
5.4%
Latin
ValueCountFrequency (%)
K 10000
50.0%
M 10000
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 90000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 25155
28.0%
K 10000
 
11.1%
M 10000
 
11.1%
5 5465
 
6.1%
6 5437
 
6.0%
3 5333
 
5.9%
2 5302
 
5.9%
1 5291
 
5.9%
4 5268
 
5.9%
7 5104
 
5.7%
Other values (2) 7645
 
8.5%

서명
Text

Distinct9332
Distinct (%)93.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T08:56:48.421986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length235
Median length152
Mean length21.1572
Min length1

Characters and Unicode

Total characters211572
Distinct characters2206
Distinct categories16 ?
Distinct scripts5 ?
Distinct blocks14 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8752 ?
Unique (%)87.5%

Sample

1st row고구려왕조 700년사
2nd row知的財産權의 刑事的 理解
3rd row한국행정의 과제와 개혁
4th row모두의 파이썬 : 20일 만에 배우는 프로그래밍 기초 : 개정 2판
5th row(만들면서 배우는)Android game programming : 기초부터 배우는 게임 프로그래밍의 원리
ValueCountFrequency (%)
5193
 
10.7%
2 300
 
0.6%
1 299
 
0.6%
장편소설 288
 
0.6%
of 285
 
0.6%
the 206
 
0.4%
연구 192
 
0.4%
위한 185
 
0.4%
171
 
0.4%
3 151
 
0.3%
Other values (21318) 41424
85.1%
2023-12-12T08:56:48.839011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39169
 
18.5%
: 4257
 
2.0%
3889
 
1.8%
e 3099
 
1.5%
2502
 
1.2%
2490
 
1.2%
i 2463
 
1.2%
n 2452
 
1.2%
o 2449
 
1.2%
a 2276
 
1.1%
Other values (2196) 146526
69.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 120109
56.8%
Space Separator 39169
 
18.5%
Lowercase Letter 27067
 
12.8%
Decimal Number 7474
 
3.5%
Other Punctuation 7091
 
3.4%
Uppercase Letter 4967
 
2.3%
Open Punctuation 2054
 
1.0%
Close Punctuation 2041
 
1.0%
Math Symbol 974
 
0.5%
Dash Punctuation 514
 
0.2%
Other values (6) 112
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3889
 
3.2%
2502
 
2.1%
2490
 
2.1%
2086
 
1.7%
1807
 
1.5%
1500
 
1.2%
1483
 
1.2%
1373
 
1.1%
1371
 
1.1%
1302
 
1.1%
Other values (2074) 100306
83.5%
Lowercase Letter
ValueCountFrequency (%)
e 3099
11.4%
i 2463
 
9.1%
n 2452
 
9.1%
o 2449
 
9.0%
a 2276
 
8.4%
t 2181
 
8.1%
r 1851
 
6.8%
s 1502
 
5.5%
l 1190
 
4.4%
c 1058
 
3.9%
Other values (16) 6546
24.2%
Uppercase Letter
ValueCountFrequency (%)
S 456
 
9.2%
C 416
 
8.4%
T 391
 
7.9%
A 357
 
7.2%
I 339
 
6.8%
E 321
 
6.5%
P 270
 
5.4%
D 251
 
5.1%
O 244
 
4.9%
N 231
 
4.7%
Other values (16) 1691
34.0%
Other Punctuation
ValueCountFrequency (%)
: 4257
60.0%
. 1894
26.7%
· 397
 
5.6%
' 143
 
2.0%
/ 94
 
1.3%
! 92
 
1.3%
? 80
 
1.1%
& 64
 
0.9%
25
 
0.4%
" 16
 
0.2%
Other values (11) 29
 
0.4%
Decimal Number
ValueCountFrequency (%)
2 1739
23.3%
1 1646
22.0%
0 1573
21.0%
3 556
 
7.4%
9 525
 
7.0%
4 386
 
5.2%
5 300
 
4.0%
6 268
 
3.6%
7 256
 
3.4%
8 225
 
3.0%
Math Symbol
ValueCountFrequency (%)
= 821
84.3%
+ 75
 
7.7%
~ 64
 
6.6%
< 4
 
0.4%
> 4
 
0.4%
3
 
0.3%
1
 
0.1%
1
 
0.1%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1953
95.1%
[ 79
 
3.8%
15
 
0.7%
4
 
0.2%
1
 
< 0.1%
1
 
< 0.1%
1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 1940
95.1%
] 79
 
3.9%
15
 
0.7%
4
 
0.2%
1
 
< 0.1%
1
 
< 0.1%
1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
42
45.2%
39
41.9%
7
 
7.5%
4
 
4.3%
1
 
1.1%
Other Symbol
ValueCountFrequency (%)
5
62.5%
1
 
12.5%
1
 
12.5%
1
 
12.5%
Other Number
ValueCountFrequency (%)
3
60.0%
2
40.0%
Space Separator
ValueCountFrequency (%)
39169
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 514
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Control
ValueCountFrequency (%)
 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 112952
53.4%
Common 59336
28.0%
Latin 32127
 
15.2%
Han 7142
 
3.4%
Katakana 15
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3889
 
3.4%
2502
 
2.2%
2490
 
2.2%
2086
 
1.8%
1807
 
1.6%
1500
 
1.3%
1483
 
1.3%
1373
 
1.2%
1371
 
1.2%
1302
 
1.2%
Other values (1123) 93149
82.5%
Han
ValueCountFrequency (%)
270
 
3.8%
178
 
2.5%
174
 
2.4%
135
 
1.9%
112
 
1.6%
108
 
1.5%
94
 
1.3%
86
 
1.2%
85
 
1.2%
84
 
1.2%
Other values (927) 5816
81.4%
Common
ValueCountFrequency (%)
39169
66.0%
: 4257
 
7.2%
( 1953
 
3.3%
) 1940
 
3.3%
. 1894
 
3.2%
2 1739
 
2.9%
1 1646
 
2.8%
0 1573
 
2.7%
= 821
 
1.4%
3 556
 
0.9%
Other values (55) 3788
 
6.4%
Latin
ValueCountFrequency (%)
e 3099
 
9.6%
i 2463
 
7.7%
n 2452
 
7.6%
o 2449
 
7.6%
a 2276
 
7.1%
t 2181
 
6.8%
r 1851
 
5.8%
s 1502
 
4.7%
l 1190
 
3.7%
c 1058
 
3.3%
Other values (47) 11606
36.1%
Katakana
ValueCountFrequency (%)
2
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Other values (4) 4
26.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 112849
53.3%
ASCII 90880
43.0%
CJK 6955
 
3.3%
None 471
 
0.2%
CJK Compat Ideographs 187
 
0.1%
Compat Jamo 103
 
< 0.1%
Number Forms 93
 
< 0.1%
Katakana 15
 
< 0.1%
Enclosed Alphanum 6
 
< 0.1%
Misc Symbols 5
 
< 0.1%
Other values (4) 8
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
39169
43.1%
: 4257
 
4.7%
e 3099
 
3.4%
i 2463
 
2.7%
n 2452
 
2.7%
o 2449
 
2.7%
a 2276
 
2.5%
t 2181
 
2.4%
( 1953
 
2.1%
) 1940
 
2.1%
Other values (80) 28641
31.5%
Hangul
ValueCountFrequency (%)
3889
 
3.4%
2502
 
2.2%
2490
 
2.2%
2086
 
1.8%
1807
 
1.6%
1500
 
1.3%
1483
 
1.3%
1373
 
1.2%
1371
 
1.2%
1302
 
1.2%
Other values (1117) 93046
82.5%
None
ValueCountFrequency (%)
· 397
84.3%
25
 
5.3%
15
 
3.2%
15
 
3.2%
4
 
0.8%
4
 
0.8%
1
 
0.2%
1
 
0.2%
1
 
0.2%
1
 
0.2%
Other values (7) 7
 
1.5%
CJK
ValueCountFrequency (%)
270
 
3.9%
178
 
2.6%
174
 
2.5%
135
 
1.9%
112
 
1.6%
108
 
1.6%
94
 
1.4%
86
 
1.2%
85
 
1.2%
84
 
1.2%
Other values (885) 5629
80.9%
Compat Jamo
ValueCountFrequency (%)
95
92.2%
3
 
2.9%
2
 
1.9%
1
 
1.0%
1
 
1.0%
1
 
1.0%
Number Forms
ValueCountFrequency (%)
42
45.2%
39
41.9%
7
 
7.5%
4
 
4.3%
1
 
1.1%
CJK Compat Ideographs
ValueCountFrequency (%)
32
17.1%
26
13.9%
17
 
9.1%
12
 
6.4%
8
 
4.3%
8
 
4.3%
8
 
4.3%
7
 
3.7%
5
 
2.7%
5
 
2.7%
Other values (32) 59
31.6%
Misc Symbols
ValueCountFrequency (%)
5
100.0%
Math Operators
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%
Enclosed Alphanum
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
Katakana
ValueCountFrequency (%)
2
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Other values (4) 4
26.7%
Punctuation
ValueCountFrequency (%)
1
100.0%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Distinct8558
Distinct (%)85.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T08:56:49.082723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length114
Median length81
Mean length12.1173
Min length1

Characters and Unicode

Total characters121173
Distinct characters813
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7679 ?
Unique (%)76.8%

Sample

1st row911.032-조738ㄱ
2nd row365.23-강713ㅈ
3rd row351.1-한744ㅎㄱ
4th row005.133-이436ㅁ
5th row005.319-황644ㅇ
ValueCountFrequency (%)
134
 
1.2%
080-대383-r 35
 
0.3%
그리고 25
 
0.2%
337-충85ㅊ-g 20
 
0.2%
322.004-국464ㄱ-g 18
 
0.2%
811.608-미735 15
 
0.1%
320.911-한419ㅇ-g 14
 
0.1%
13
 
0.1%
the 13
 
0.1%
위한 12
 
0.1%
Other values (9448) 11087
97.4%
2023-12-12T08:56:49.498385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 11119
 
9.2%
3 10071
 
8.3%
1 9898
 
8.2%
5 7573
 
6.2%
. 7562
 
6.2%
8 7280
 
6.0%
9 7218
 
6.0%
6 6891
 
5.7%
7 6765
 
5.6%
2 6155
 
5.1%
Other values (803) 40641
33.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 72287
59.7%
Other Letter 25059
 
20.7%
Dash Punctuation 11119
 
9.2%
Other Punctuation 7711
 
6.4%
Uppercase Letter 1968
 
1.6%
Space Separator 1913
 
1.6%
Lowercase Letter 1047
 
0.9%
Math Symbol 28
 
< 0.1%
Close Punctuation 22
 
< 0.1%
Open Punctuation 14
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1578
 
6.3%
1460
 
5.8%
1372
 
5.5%
1258
 
5.0%
1242
 
5.0%
1223
 
4.9%
1069
 
4.3%
595
 
2.4%
538
 
2.1%
509
 
2.0%
Other values (718) 14215
56.7%
Uppercase Letter
ValueCountFrequency (%)
G 483
24.5%
R 401
20.4%
C 323
16.4%
E 230
11.7%
X 221
11.2%
U 103
 
5.2%
T 40
 
2.0%
A 32
 
1.6%
O 13
 
0.7%
I 12
 
0.6%
Other values (16) 110
 
5.6%
Lowercase Letter
ValueCountFrequency (%)
o 119
11.4%
e 118
11.3%
a 92
 
8.8%
r 89
 
8.5%
n 87
 
8.3%
t 80
 
7.6%
i 77
 
7.4%
s 60
 
5.7%
d 46
 
4.4%
c 38
 
3.6%
Other values (16) 241
23.0%
Decimal Number
ValueCountFrequency (%)
3 10071
13.9%
1 9898
13.7%
5 7573
10.5%
8 7280
10.1%
9 7218
10.0%
6 6891
9.5%
7 6765
9.4%
2 6155
8.5%
4 5799
8.0%
0 4637
6.4%
Other Punctuation
ValueCountFrequency (%)
. 7562
98.1%
: 109
 
1.4%
? 16
 
0.2%
! 8
 
0.1%
' 7
 
0.1%
· 5
 
0.1%
& 3
 
< 0.1%
1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
= 20
71.4%
~ 6
 
21.4%
< 1
 
3.6%
> 1
 
3.6%
Close Punctuation
ValueCountFrequency (%)
) 20
90.9%
] 1
 
4.5%
1
 
4.5%
Open Punctuation
ValueCountFrequency (%)
( 12
85.7%
[ 1
 
7.1%
1
 
7.1%
Letter Number
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%
Dash Punctuation
ValueCountFrequency (%)
- 11119
100.0%
Space Separator
ValueCountFrequency (%)
1913
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 93094
76.8%
Hangul 25044
 
20.7%
Latin 3020
 
2.5%
Han 15
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1578
 
6.3%
1460
 
5.8%
1372
 
5.5%
1258
 
5.0%
1242
 
5.0%
1223
 
4.9%
1069
 
4.3%
595
 
2.4%
538
 
2.1%
509
 
2.0%
Other values (703) 14200
56.7%
Latin
ValueCountFrequency (%)
G 483
16.0%
R 401
13.3%
C 323
 
10.7%
E 230
 
7.6%
X 221
 
7.3%
o 119
 
3.9%
e 118
 
3.9%
U 103
 
3.4%
a 92
 
3.0%
r 89
 
2.9%
Other values (45) 841
27.8%
Common
ValueCountFrequency (%)
- 11119
11.9%
3 10071
10.8%
1 9898
10.6%
5 7573
8.1%
. 7562
8.1%
8 7280
7.8%
9 7218
7.8%
6 6891
7.4%
7 6765
7.3%
2 6155
6.6%
Other values (20) 12562
13.5%
Han
ValueCountFrequency (%)
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Other values (5) 5
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 96101
79.3%
Hangul 15433
 
12.7%
Compat Jamo 9611
 
7.9%
CJK 15
 
< 0.1%
None 8
 
< 0.1%
Number Forms 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 11119
11.6%
3 10071
10.5%
1 9898
10.3%
5 7573
7.9%
. 7562
7.9%
8 7280
7.6%
9 7218
7.5%
6 6891
7.2%
7 6765
7.0%
2 6155
 
6.4%
Other values (68) 15569
16.2%
Compat Jamo
ValueCountFrequency (%)
1578
16.4%
1258
13.1%
1242
12.9%
1223
12.7%
1069
11.1%
595
 
6.2%
509
 
5.3%
475
 
4.9%
422
 
4.4%
386
 
4.0%
Other values (9) 854
8.9%
Hangul
ValueCountFrequency (%)
1460
 
9.5%
1372
 
8.9%
538
 
3.5%
493
 
3.2%
349
 
2.3%
307
 
2.0%
303
 
2.0%
222
 
1.4%
202
 
1.3%
193
 
1.3%
Other values (684) 9994
64.8%
None
ValueCountFrequency (%)
· 5
62.5%
1
 
12.5%
1
 
12.5%
1
 
12.5%
Number Forms
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%
CJK
ValueCountFrequency (%)
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Other values (5) 5
33.3%

Unnamed: 3
Text

MISSING 

Distinct530
Distinct (%)93.3%
Missing9432
Missing (%)94.3%
Memory size156.2 KiB
2023-12-12T08:56:49.823055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length66
Median length42
Mean length11.700704
Min length1

Characters and Unicode

Total characters6646
Distinct characters429
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique502 ?
Unique (%)88.4%

Sample

1st row911.075-최645ㅂ-C
2nd row680.01-남463ㅇ
3rd row610.9205-승293ㄱ
4th row340.911-월323ㅂ
5th row 미래-
ValueCountFrequency (%)
23
 
2.8%
그리고 15
 
1.8%
대한 5
 
0.6%
건축 4
 
0.5%
the 4
 
0.5%
future 4
 
0.5%
toward 4
 
0.5%
and 4
 
0.5%
젠더 3
 
0.4%
접근 3
 
0.4%
Other values (706) 758
91.7%
2023-12-12T08:56:50.328343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 514
 
7.7%
- 463
 
7.0%
397
 
6.0%
3 386
 
5.8%
8 349
 
5.3%
. 341
 
5.1%
9 326
 
4.9%
2 296
 
4.5%
4 287
 
4.3%
6 286
 
4.3%
Other values (419) 3001
45.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3180
47.8%
Other Letter 1861
28.0%
Dash Punctuation 463
 
7.0%
Space Separator 397
 
6.0%
Other Punctuation 367
 
5.5%
Lowercase Letter 280
 
4.2%
Uppercase Letter 75
 
1.1%
Close Punctuation 9
 
0.1%
Open Punctuation 7
 
0.1%
Math Symbol 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
77
 
4.1%
73
 
3.9%
71
 
3.8%
65
 
3.5%
55
 
3.0%
37
 
2.0%
35
 
1.9%
33
 
1.8%
31
 
1.7%
31
 
1.7%
Other values (358) 1353
72.7%
Lowercase Letter
ValueCountFrequency (%)
t 30
10.7%
e 30
10.7%
o 24
 
8.6%
a 23
 
8.2%
u 20
 
7.1%
r 19
 
6.8%
h 17
 
6.1%
i 16
 
5.7%
c 16
 
5.7%
s 16
 
5.7%
Other values (12) 69
24.6%
Uppercase Letter
ValueCountFrequency (%)
G 19
25.3%
R 17
22.7%
C 7
 
9.3%
F 6
 
8.0%
P 5
 
6.7%
A 5
 
6.7%
T 5
 
6.7%
S 4
 
5.3%
J 2
 
2.7%
M 1
 
1.3%
Other values (4) 4
 
5.3%
Decimal Number
ValueCountFrequency (%)
1 514
16.2%
3 386
12.1%
8 349
11.0%
9 326
10.3%
2 296
9.3%
4 287
9.0%
6 286
9.0%
5 281
8.8%
7 235
7.4%
0 220
6.9%
Other Punctuation
ValueCountFrequency (%)
. 341
92.9%
: 19
 
5.2%
! 4
 
1.1%
· 2
 
0.5%
/ 1
 
0.3%
Math Symbol
ValueCountFrequency (%)
= 4
66.7%
< 1
 
16.7%
> 1
 
16.7%
Close Punctuation
ValueCountFrequency (%)
) 8
88.9%
1
 
11.1%
Open Punctuation
ValueCountFrequency (%)
( 6
85.7%
1
 
14.3%
Dash Punctuation
ValueCountFrequency (%)
- 463
100.0%
Space Separator
ValueCountFrequency (%)
397
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4429
66.6%
Hangul 1848
27.8%
Latin 356
 
5.4%
Han 13
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
77
 
4.2%
73
 
4.0%
71
 
3.8%
65
 
3.5%
55
 
3.0%
37
 
2.0%
35
 
1.9%
33
 
1.8%
31
 
1.7%
31
 
1.7%
Other values (345) 1340
72.5%
Latin
ValueCountFrequency (%)
t 30
 
8.4%
e 30
 
8.4%
o 24
 
6.7%
a 23
 
6.5%
u 20
 
5.6%
G 19
 
5.3%
r 19
 
5.3%
R 17
 
4.8%
h 17
 
4.8%
i 16
 
4.5%
Other values (27) 141
39.6%
Common
ValueCountFrequency (%)
1 514
11.6%
- 463
10.5%
397
9.0%
3 386
8.7%
8 349
7.9%
. 341
7.7%
9 326
7.4%
2 296
 
6.7%
4 287
 
6.5%
6 286
 
6.5%
Other values (14) 784
17.7%
Han
ValueCountFrequency (%)
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Other values (3) 3
23.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4780
71.9%
Hangul 1435
 
21.6%
Compat Jamo 413
 
6.2%
CJK 13
 
0.2%
None 4
 
0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 514
10.8%
- 463
9.7%
397
 
8.3%
3 386
 
8.1%
8 349
 
7.3%
. 341
 
7.1%
9 326
 
6.8%
2 296
 
6.2%
4 287
 
6.0%
6 286
 
6.0%
Other values (47) 1135
23.7%
Compat Jamo
ValueCountFrequency (%)
77
18.6%
65
15.7%
55
13.3%
37
9.0%
35
8.5%
31
7.5%
27
 
6.5%
21
 
5.1%
18
 
4.4%
16
 
3.9%
Other values (6) 31
7.5%
Hangul
ValueCountFrequency (%)
73
 
5.1%
71
 
4.9%
33
 
2.3%
31
 
2.2%
27
 
1.9%
27
 
1.9%
24
 
1.7%
22
 
1.5%
22
 
1.5%
20
 
1.4%
Other values (329) 1085
75.6%
None
ValueCountFrequency (%)
· 2
50.0%
1
25.0%
1
25.0%
CJK
ValueCountFrequency (%)
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Other values (3) 3
23.1%
Number Forms
ValueCountFrequency (%)
1
100.0%

Unnamed: 4
Text

MISSING 

Distinct144
Distinct (%)94.7%
Missing9848
Missing (%)98.5%
Memory size156.2 KiB
2023-12-12T08:56:50.645741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length23
Mean length11.796053
Min length2

Characters and Unicode

Total characters1793
Distinct characters238
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique140 ?
Unique (%)92.1%

Sample

1st row181.3-이611ㅇ
2nd row기계설비 전량수록
3rd row 그밖의 중세 발명품들
4th row004.588-나215ㅋ
5th row511.49-페352ㅁ대
ValueCountFrequency (%)
기계설비 4
 
1.9%
377.604-박547ㅅ 4
 
1.9%
전량수록 4
 
1.9%
of 3
 
1.4%
발명품들 2
 
0.9%
중심으로 2
 
0.9%
2
 
0.9%
그밖의 2
 
0.9%
politiker 2
 
0.9%
중세 2
 
0.9%
Other values (184) 184
87.2%
2023-12-12T08:56:51.187187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 126
 
7.0%
3 118
 
6.6%
- 116
 
6.5%
99
 
5.5%
5 98
 
5.5%
. 91
 
5.1%
2 83
 
4.6%
4 79
 
4.4%
9 78
 
4.4%
8 73
 
4.1%
Other values (228) 832
46.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 835
46.6%
Other Letter 501
27.9%
Dash Punctuation 116
 
6.5%
Lowercase Letter 113
 
6.3%
Space Separator 99
 
5.5%
Other Punctuation 96
 
5.4%
Uppercase Letter 19
 
1.1%
Close Punctuation 8
 
0.4%
Open Punctuation 6
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
4.6%
19
 
3.8%
15
 
3.0%
15
 
3.0%
13
 
2.6%
12
 
2.4%
9
 
1.8%
9
 
1.8%
8
 
1.6%
8
 
1.6%
Other values (181) 370
73.9%
Lowercase Letter
ValueCountFrequency (%)
e 15
13.3%
i 14
12.4%
o 13
11.5%
t 8
 
7.1%
r 8
 
7.1%
p 6
 
5.3%
l 6
 
5.3%
s 6
 
5.3%
n 5
 
4.4%
a 5
 
4.4%
Other values (9) 27
23.9%
Decimal Number
ValueCountFrequency (%)
1 126
15.1%
3 118
14.1%
5 98
11.7%
2 83
9.9%
4 79
9.5%
9 78
9.3%
8 73
8.7%
7 65
7.8%
6 62
7.4%
0 53
6.3%
Uppercase Letter
ValueCountFrequency (%)
R 6
31.6%
C 3
15.8%
G 2
 
10.5%
P 2
 
10.5%
H 1
 
5.3%
U 1
 
5.3%
E 1
 
5.3%
F 1
 
5.3%
K 1
 
5.3%
T 1
 
5.3%
Other Punctuation
ValueCountFrequency (%)
. 91
94.8%
: 2
 
2.1%
? 2
 
2.1%
· 1
 
1.0%
Dash Punctuation
ValueCountFrequency (%)
- 116
100.0%
Space Separator
ValueCountFrequency (%)
99
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1160
64.7%
Hangul 496
27.7%
Latin 132
 
7.4%
Han 5
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
4.6%
19
 
3.8%
15
 
3.0%
15
 
3.0%
13
 
2.6%
12
 
2.4%
9
 
1.8%
9
 
1.8%
8
 
1.6%
8
 
1.6%
Other values (176) 365
73.6%
Latin
ValueCountFrequency (%)
e 15
 
11.4%
i 14
 
10.6%
o 13
 
9.8%
t 8
 
6.1%
r 8
 
6.1%
p 6
 
4.5%
l 6
 
4.5%
R 6
 
4.5%
s 6
 
4.5%
n 5
 
3.8%
Other values (19) 45
34.1%
Common
ValueCountFrequency (%)
1 126
10.9%
3 118
10.2%
- 116
10.0%
99
8.5%
5 98
8.4%
. 91
7.8%
2 83
7.2%
4 79
 
6.8%
9 78
 
6.7%
8 73
 
6.3%
Other values (8) 199
17.2%
Han
ValueCountFrequency (%)
1
20.0%
調 1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1291
72.0%
Hangul 391
 
21.8%
Compat Jamo 105
 
5.9%
CJK 5
 
0.3%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 126
 
9.8%
3 118
 
9.1%
- 116
 
9.0%
99
 
7.7%
5 98
 
7.6%
. 91
 
7.0%
2 83
 
6.4%
4 79
 
6.1%
9 78
 
6.0%
8 73
 
5.7%
Other values (36) 330
25.6%
Hangul
ValueCountFrequency (%)
23
 
5.9%
15
 
3.8%
9
 
2.3%
9
 
2.3%
8
 
2.0%
7
 
1.8%
6
 
1.5%
6
 
1.5%
5
 
1.3%
5
 
1.3%
Other values (161) 298
76.2%
Compat Jamo
ValueCountFrequency (%)
19
18.1%
15
14.3%
13
12.4%
12
11.4%
8
7.6%
7
 
6.7%
6
 
5.7%
6
 
5.7%
4
 
3.8%
4
 
3.8%
Other values (5) 11
10.5%
CJK
ValueCountFrequency (%)
1
20.0%
調 1
20.0%
1
20.0%
1
20.0%
1
20.0%
None
ValueCountFrequency (%)
· 1
100.0%

Unnamed: 5
Text

MISSING 

Distinct43
Distinct (%)89.6%
Missing9952
Missing (%)99.5%
Memory size156.2 KiB
2023-12-12T08:56:51.493146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length80
Median length17.5
Mean length13.458333
Min length2

Characters and Unicode

Total characters646
Distinct characters145
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)83.3%

Sample

1st row532.8-건947ㄱ
2nd row920.3-프868ㅋ
3rd row 수능 완벽대비
4th row532.8-건947ㄱ
5th row 머신러닝까지
ValueCountFrequency (%)
532.8-건947ㄱ 4
 
5.1%
and 3
 
3.8%
of 2
 
2.5%
920.3-프868ㅋ 2
 
2.5%
2
 
2.5%
strafrechtsreformer 2
 
2.5%
350.01-나251ㅎ 1
 
1.3%
양자 1
 
1.3%
아이덴티티 1
 
1.3%
811.7-김719ㅅ 1
 
1.3%
Other values (60) 60
75.9%
2023-12-12T08:56:51.986281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
52
 
8.0%
- 28
 
4.3%
1 25
 
3.9%
3 23
 
3.6%
. 23
 
3.6%
5 22
 
3.4%
2 22
 
3.4%
a 21
 
3.3%
e 21
 
3.3%
7 20
 
3.1%
Other values (135) 389
60.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 199
30.8%
Lowercase Letter 175
27.1%
Other Letter 145
22.4%
Space Separator 52
 
8.0%
Dash Punctuation 28
 
4.3%
Other Punctuation 26
 
4.0%
Uppercase Letter 11
 
1.7%
Open Punctuation 6
 
0.9%
Close Punctuation 4
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7
 
4.8%
6
 
4.1%
5
 
3.4%
5
 
3.4%
4
 
2.8%
4
 
2.8%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
Other values (87) 102
70.3%
Lowercase Letter
ValueCountFrequency (%)
a 21
12.0%
e 21
12.0%
r 17
9.7%
t 15
8.6%
n 14
8.0%
o 14
8.0%
i 12
 
6.9%
s 12
 
6.9%
d 8
 
4.6%
l 8
 
4.6%
Other values (10) 33
18.9%
Decimal Number
ValueCountFrequency (%)
1 25
12.6%
3 23
11.6%
5 22
11.1%
2 22
11.1%
7 20
10.1%
8 20
10.1%
0 19
9.5%
4 18
9.0%
9 18
9.0%
6 12
6.0%
Uppercase Letter
ValueCountFrequency (%)
G 2
18.2%
P 1
9.1%
A 1
9.1%
S 1
9.1%
L 1
9.1%
B 1
9.1%
N 1
9.1%
K 1
9.1%
D 1
9.1%
F 1
9.1%
Other Punctuation
ValueCountFrequency (%)
. 23
88.5%
& 1
 
3.8%
: 1
 
3.8%
· 1
 
3.8%
Space Separator
ValueCountFrequency (%)
52
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 315
48.8%
Latin 186
28.8%
Hangul 145
22.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7
 
4.8%
6
 
4.1%
5
 
3.4%
5
 
3.4%
4
 
2.8%
4
 
2.8%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
Other values (87) 102
70.3%
Latin
ValueCountFrequency (%)
a 21
11.3%
e 21
11.3%
r 17
 
9.1%
t 15
 
8.1%
n 14
 
7.5%
o 14
 
7.5%
i 12
 
6.5%
s 12
 
6.5%
d 8
 
4.3%
l 8
 
4.3%
Other values (20) 44
23.7%
Common
ValueCountFrequency (%)
52
16.5%
- 28
8.9%
1 25
7.9%
3 23
 
7.3%
. 23
 
7.3%
5 22
 
7.0%
2 22
 
7.0%
7 20
 
6.3%
8 20
 
6.3%
0 19
 
6.0%
Other values (8) 61
19.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 500
77.4%
Hangul 119
 
18.4%
Compat Jamo 26
 
4.0%
None 1
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
52
 
10.4%
- 28
 
5.6%
1 25
 
5.0%
3 23
 
4.6%
. 23
 
4.6%
5 22
 
4.4%
2 22
 
4.4%
a 21
 
4.2%
e 21
 
4.2%
7 20
 
4.0%
Other values (37) 243
48.6%
Compat Jamo
ValueCountFrequency (%)
7
26.9%
5
19.2%
3
11.5%
3
11.5%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Hangul
ValueCountFrequency (%)
6
 
5.0%
5
 
4.2%
4
 
3.4%
4
 
3.4%
3
 
2.5%
3
 
2.5%
3
 
2.5%
2
 
1.7%
2
 
1.7%
2
 
1.7%
Other values (77) 85
71.4%
None
ValueCountFrequency (%)
· 1
100.0%

Unnamed: 6
Text

MISSING 

Distinct21
Distinct (%)95.5%
Missing9978
Missing (%)99.8%
Memory size156.2 KiB
2023-12-12T08:56:52.263685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length12
Mean length9.6818182
Min length2

Characters and Unicode

Total characters213
Distinct characters65
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)90.9%

Sample

1st row747-박642ㅂ
2nd row005.76-김123ㅂ
3rd row 이광수(무정)
4th row 붉은방)
5th row 김소진(자전거 도둑)
ValueCountFrequency (%)
360.1-노766ㄱ윤 2
 
8.3%
517-권662ㅇ 1
 
4.2%
747-박642ㅂ 1
 
4.2%
830.9-손527ㅈ 1
 
4.2%
대전 1
 
4.2%
322.8-이187ㄱ 1
 
4.2%
195.3-박621ㅈ 1
 
4.2%
1
 
4.2%
뫼비우스의 1
 
4.2%
안수길(북간도 1
 
4.2%
Other values (13) 13
54.2%
2023-12-12T08:56:52.666725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 15
 
7.0%
6 14
 
6.6%
7 13
 
6.1%
5 12
 
5.6%
. 11
 
5.2%
1 11
 
5.2%
3 10
 
4.7%
0 10
 
4.7%
2 10
 
4.7%
9
 
4.2%
Other values (55) 98
46.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 101
47.4%
Other Letter 66
31.0%
Dash Punctuation 15
 
7.0%
Other Punctuation 11
 
5.2%
Space Separator 9
 
4.2%
Close Punctuation 6
 
2.8%
Open Punctuation 3
 
1.4%
Uppercase Letter 2
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
 
6.1%
3
 
4.5%
3
 
4.5%
3
 
4.5%
3
 
4.5%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
Other values (38) 40
60.6%
Decimal Number
ValueCountFrequency (%)
6 14
13.9%
7 13
12.9%
5 12
11.9%
1 11
10.9%
3 10
9.9%
0 10
9.9%
2 10
9.9%
9 8
7.9%
8 8
7.9%
4 5
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
T 1
50.0%
R 1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%
Other Punctuation
ValueCountFrequency (%)
. 11
100.0%
Space Separator
ValueCountFrequency (%)
9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 145
68.1%
Hangul 66
31.0%
Latin 2
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
 
6.1%
3
 
4.5%
3
 
4.5%
3
 
4.5%
3
 
4.5%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
Other values (38) 40
60.6%
Common
ValueCountFrequency (%)
- 15
10.3%
6 14
9.7%
7 13
9.0%
5 12
 
8.3%
. 11
 
7.6%
1 11
 
7.6%
3 10
 
6.9%
0 10
 
6.9%
2 10
 
6.9%
9
 
6.2%
Other values (5) 30
20.7%
Latin
ValueCountFrequency (%)
T 1
50.0%
R 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 147
69.0%
Hangul 52
 
24.4%
Compat Jamo 14
 
6.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 15
10.2%
6 14
9.5%
7 13
8.8%
5 12
 
8.2%
. 11
 
7.5%
1 11
 
7.5%
3 10
 
6.8%
0 10
 
6.8%
2 10
 
6.8%
9
 
6.1%
Other values (7) 32
21.8%
Hangul
ValueCountFrequency (%)
4
 
7.7%
3
 
5.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
1
 
1.9%
1
 
1.9%
Other values (31) 31
59.6%
Compat Jamo
ValueCountFrequency (%)
3
21.4%
3
21.4%
3
21.4%
2
14.3%
1
 
7.1%
1
 
7.1%
1
 
7.1%

Unnamed: 7
Text

MISSING 

Distinct8
Distinct (%)100.0%
Missing9992
Missing (%)99.9%
Memory size156.2 KiB
2023-12-12T08:56:52.884148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length9.5
Mean length7.875
Min length2

Characters and Unicode

Total characters63
Distinct characters42
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)100.0%

Sample

1st row 김동인(배따라기
2nd row 강석경(숲속의 방)
3rd row 박상률(봄바람)
4th row 강경애(인간문제)
5th row영화
ValueCountFrequency (%)
김동인(배따라기 1
10.0%
강석경(숲속의 1
10.0%
1
10.0%
박상률(봄바람 1
10.0%
강경애(인간문제 1
10.0%
영화 1
10.0%
전광용(사수 1
10.0%
전상국(우리들의 1
10.0%
날개 1
10.0%
울산 1
10.0%
2023-12-12T08:56:53.250239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
 
14.3%
( 6
 
9.5%
) 3
 
4.8%
2
 
3.2%
2
 
3.2%
2
 
3.2%
2
 
3.2%
2
 
3.2%
2
 
3.2%
1
 
1.6%
Other values (32) 32
50.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 45
71.4%
Space Separator 9
 
14.3%
Open Punctuation 6
 
9.5%
Close Punctuation 3
 
4.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
Other values (29) 29
64.4%
Space Separator
ValueCountFrequency (%)
9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 45
71.4%
Common 18
 
28.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
Other values (29) 29
64.4%
Common
ValueCountFrequency (%)
9
50.0%
( 6
33.3%
) 3
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 45
71.4%
ASCII 18
 
28.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9
50.0%
( 6
33.3%
) 3
 
16.7%
Hangul
ValueCountFrequency (%)
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
Other values (29) 29
64.4%

Unnamed: 8
Text

MISSING 

Distinct8
Distinct (%)100.0%
Missing9992
Missing (%)99.9%
Memory size156.2 KiB
2023-12-12T08:56:53.472867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length7.375
Min length3

Characters and Unicode

Total characters59
Distinct characters40
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)100.0%

Sample

1st row 감자
2nd row 최일남(흐르는 북)
3rd row 신경숙(외딴 방
4th row 박태원(천변풍경
5th row101-김696ㅅ
ValueCountFrequency (%)
감자 1
8.3%
최일남(흐르는 1
8.3%
1
8.3%
신경숙(외딴 1
8.3%
1
8.3%
박태원(천변풍경 1
8.3%
101-김696ㅅ 1
8.3%
끼삐딴 1
8.3%
1
8.3%
우상의 1
8.3%
Other values (2) 2
16.7%
2023-12-12T08:56:54.156352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11
 
18.6%
( 3
 
5.1%
3
 
5.1%
) 3
 
5.1%
1 2
 
3.4%
2
 
3.4%
6 2
 
3.4%
1
 
1.7%
1
 
1.7%
0 1
 
1.7%
Other values (30) 30
50.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 35
59.3%
Space Separator 11
 
18.6%
Decimal Number 6
 
10.2%
Open Punctuation 3
 
5.1%
Close Punctuation 3
 
5.1%
Dash Punctuation 1
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3
 
8.6%
2
 
5.7%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (22) 22
62.9%
Decimal Number
ValueCountFrequency (%)
1 2
33.3%
6 2
33.3%
0 1
16.7%
9 1
16.7%
Space Separator
ValueCountFrequency (%)
11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 35
59.3%
Common 24
40.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3
 
8.6%
2
 
5.7%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (22) 22
62.9%
Common
ValueCountFrequency (%)
11
45.8%
( 3
 
12.5%
) 3
 
12.5%
1 2
 
8.3%
6 2
 
8.3%
0 1
 
4.2%
- 1
 
4.2%
9 1
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 34
57.6%
ASCII 24
40.7%
Compat Jamo 1
 
1.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11
45.8%
( 3
 
12.5%
) 3
 
12.5%
1 2
 
8.3%
6 2
 
8.3%
0 1
 
4.2%
- 1
 
4.2%
9 1
 
4.2%
Hangul
ValueCountFrequency (%)
3
 
8.8%
2
 
5.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (21) 21
61.8%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

Unnamed: 9
Text

MISSING 

Distinct7
Distinct (%)100.0%
Missing9993
Missing (%)99.9%
Memory size156.2 KiB
2023-12-12T08:56:54.353471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length10.571429
Min length3

Characters and Unicode

Total characters74
Distinct characters46
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)100.0%

Sample

1st row 광염 소나타)
2nd row 이인성(당신에 대해서)
3rd row 감자 먹는 사람들)
4th row 소설가 구보 씨의 일일)
5th row 강신재(젊은 느티나무)
ValueCountFrequency (%)
광염 1
 
6.2%
소나타 1
 
6.2%
이인성(당신에 1
 
6.2%
대해서 1
 
6.2%
감자 1
 
6.2%
먹는 1
 
6.2%
사람들 1
 
6.2%
소설가 1
 
6.2%
구보 1
 
6.2%
씨의 1
 
6.2%
Other values (6) 6
37.5%
2023-12-12T08:56:54.675198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16
21.6%
) 6
 
8.1%
( 3
 
4.1%
2
 
2.7%
2
 
2.7%
2
 
2.7%
2
 
2.7%
2
 
2.7%
2
 
2.7%
1
 
1.4%
Other values (36) 36
48.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 49
66.2%
Space Separator 16
 
21.6%
Close Punctuation 6
 
8.1%
Open Punctuation 3
 
4.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
1
 
2.0%
1
 
2.0%
1
 
2.0%
1
 
2.0%
Other values (33) 33
67.3%
Space Separator
ValueCountFrequency (%)
16
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 49
66.2%
Common 25
33.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
1
 
2.0%
1
 
2.0%
1
 
2.0%
1
 
2.0%
Other values (33) 33
67.3%
Common
ValueCountFrequency (%)
16
64.0%
) 6
 
24.0%
( 3
 
12.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 49
66.2%
ASCII 25
33.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
16
64.0%
) 6
 
24.0%
( 3
 
12.0%
Hangul
ValueCountFrequency (%)
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
1
 
2.0%
1
 
2.0%
1
 
2.0%
1
 
2.0%
Other values (33) 33
67.3%

Unnamed: 10
Text

MISSING 

Distinct7
Distinct (%)100.0%
Missing9993
Missing (%)99.9%
Memory size156.2 KiB
2023-12-12T08:56:54.855604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length10.571429
Min length3

Characters and Unicode

Total characters74
Distinct characters53
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)100.0%

Sample

1st row 현진건(할머니의 죽음
2nd row 김학철(종횡만리)
3rd row 성석제(오랜지 맛 오렌지)
4th row 박영준(모범 경작생)
5th row 최인훈(광장)
ValueCountFrequency (%)
현진건(할머니의 1
9.1%
죽음 1
9.1%
김학철(종횡만리 1
9.1%
성석제(오랜지 1
9.1%
1
9.1%
오렌지 1
9.1%
박영준(모범 1
9.1%
경작생 1
9.1%
최인훈(광장 1
9.1%
813.6082-김964ㄲ 1
9.1%
2023-12-12T08:56:55.207063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10
 
13.5%
( 5
 
6.8%
) 4
 
5.4%
2
 
2.7%
2
 
2.7%
6 2
 
2.7%
2
 
2.7%
8 2
 
2.7%
1
 
1.4%
1
 
1.4%
Other values (43) 43
58.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 43
58.1%
Space Separator 10
 
13.5%
Decimal Number 10
 
13.5%
Open Punctuation 5
 
6.8%
Close Punctuation 4
 
5.4%
Other Punctuation 1
 
1.4%
Dash Punctuation 1
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
 
4.7%
2
 
4.7%
2
 
4.7%
1
 
2.3%
1
 
2.3%
1
 
2.3%
1
 
2.3%
1
 
2.3%
1
 
2.3%
1
 
2.3%
Other values (30) 30
69.8%
Decimal Number
ValueCountFrequency (%)
6 2
20.0%
8 2
20.0%
1 1
10.0%
3 1
10.0%
0 1
10.0%
2 1
10.0%
9 1
10.0%
4 1
10.0%
Space Separator
ValueCountFrequency (%)
10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 43
58.1%
Common 31
41.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
 
4.7%
2
 
4.7%
2
 
4.7%
1
 
2.3%
1
 
2.3%
1
 
2.3%
1
 
2.3%
1
 
2.3%
1
 
2.3%
1
 
2.3%
Other values (30) 30
69.8%
Common
ValueCountFrequency (%)
10
32.3%
( 5
16.1%
) 4
 
12.9%
6 2
 
6.5%
8 2
 
6.5%
1 1
 
3.2%
3 1
 
3.2%
. 1
 
3.2%
0 1
 
3.2%
2 1
 
3.2%
Other values (3) 3
 
9.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 42
56.8%
ASCII 31
41.9%
Compat Jamo 1
 
1.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10
32.3%
( 5
16.1%
) 4
 
12.9%
6 2
 
6.5%
8 2
 
6.5%
1 1
 
3.2%
3 1
 
3.2%
. 1
 
3.2%
0 1
 
3.2%
2 1
 
3.2%
Other values (3) 3
 
9.7%
Hangul
ValueCountFrequency (%)
2
 
4.8%
2
 
4.8%
2
 
4.8%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
Other values (29) 29
69.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

Unnamed: 11
Text

MISSING 

Distinct6
Distinct (%)100.0%
Missing9994
Missing (%)99.9%
Memory size156.2 KiB
2023-12-12T08:56:55.376612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length9.5
Min length3

Characters and Unicode

Total characters57
Distinct characters40
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)100.0%

Sample

1st row 운수 좋은 날
2nd row813.6082-김964ㄲ
3rd row 이영도(드래곤 라자)
4th row 김유정(만무방
5th row 김정한(모래톱 이야기
ValueCountFrequency (%)
운수 1
10.0%
좋은 1
10.0%
1
10.0%
813.6082-김964ㄲ 1
10.0%
이영도(드래곤 1
10.0%
라자 1
10.0%
김유정(만무방 1
10.0%
김정한(모래톱 1
10.0%
이야기 1
10.0%
충남 1
10.0%
2023-12-12T08:56:55.697823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
 
15.8%
3
 
5.3%
( 3
 
5.3%
8 2
 
3.5%
2
 
3.5%
6 2
 
3.5%
2
 
3.5%
2
 
3.5%
1
 
1.8%
1
 
1.8%
Other values (30) 30
52.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 32
56.1%
Decimal Number 10
 
17.5%
Space Separator 9
 
15.8%
Open Punctuation 3
 
5.3%
Close Punctuation 1
 
1.8%
Other Punctuation 1
 
1.8%
Dash Punctuation 1
 
1.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3
 
9.4%
2
 
6.2%
2
 
6.2%
2
 
6.2%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
Other values (17) 17
53.1%
Decimal Number
ValueCountFrequency (%)
8 2
20.0%
6 2
20.0%
1 1
10.0%
3 1
10.0%
0 1
10.0%
2 1
10.0%
9 1
10.0%
4 1
10.0%
Space Separator
ValueCountFrequency (%)
9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 32
56.1%
Common 25
43.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3
 
9.4%
2
 
6.2%
2
 
6.2%
2
 
6.2%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
Other values (17) 17
53.1%
Common
ValueCountFrequency (%)
9
36.0%
( 3
 
12.0%
8 2
 
8.0%
6 2
 
8.0%
) 1
 
4.0%
. 1
 
4.0%
1 1
 
4.0%
3 1
 
4.0%
0 1
 
4.0%
2 1
 
4.0%
Other values (3) 3
 
12.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 31
54.4%
ASCII 25
43.9%
Compat Jamo 1
 
1.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9
36.0%
( 3
 
12.0%
8 2
 
8.0%
6 2
 
8.0%
) 1
 
4.0%
. 1
 
4.0%
1 1
 
4.0%
3 1
 
4.0%
0 1
 
4.0%
2 1
 
4.0%
Other values (3) 3
 
12.0%
Hangul
ValueCountFrequency (%)
3
 
9.7%
2
 
6.5%
2
 
6.5%
2
 
6.5%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
Other values (16) 16
51.6%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

Unnamed: 12
Text

MISSING 

Distinct5
Distinct (%)100.0%
Missing9995
Missing (%)> 99.9%
Memory size156.2 KiB
2023-12-12T08:56:55.864391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length5
Mean length7
Min length3

Characters and Unicode

Total characters35
Distinct characters23
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)100.0%

Sample

1st row 고향)
2nd row 윤영수(민사95다 6008호 사건)
3rd row 봄봄
4th row 수라도)
5th row 전북
ValueCountFrequency (%)
고향 1
14.3%
윤영수(민사95다 1
14.3%
6008호 1
14.3%
사건 1
14.3%
봄봄 1
14.3%
수라도 1
14.3%
전북 1
14.3%
2023-12-12T08:56:56.164153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7
20.0%
) 3
 
8.6%
2
 
5.7%
2
 
5.7%
2
 
5.7%
0 2
 
5.7%
6 1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (13) 13
37.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18
51.4%
Space Separator 7
 
20.0%
Decimal Number 6
 
17.1%
Close Punctuation 3
 
8.6%
Open Punctuation 1
 
2.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
 
11.1%
2
 
11.1%
2
 
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (5) 5
27.8%
Decimal Number
ValueCountFrequency (%)
0 2
33.3%
6 1
16.7%
8 1
16.7%
5 1
16.7%
9 1
16.7%
Space Separator
ValueCountFrequency (%)
7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18
51.4%
Common 17
48.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
 
11.1%
2
 
11.1%
2
 
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (5) 5
27.8%
Common
ValueCountFrequency (%)
7
41.2%
) 3
17.6%
0 2
 
11.8%
6 1
 
5.9%
8 1
 
5.9%
5 1
 
5.9%
9 1
 
5.9%
( 1
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 18
51.4%
ASCII 17
48.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7
41.2%
) 3
17.6%
0 2
 
11.8%
6 1
 
5.9%
8 1
 
5.9%
5 1
 
5.9%
9 1
 
5.9%
( 1
 
5.9%
Hangul
ValueCountFrequency (%)
2
 
11.1%
2
 
11.1%
2
 
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (5) 5
27.8%

Unnamed: 13
Text

MISSING 

Distinct4
Distinct (%)80.0%
Missing9995
Missing (%)> 99.9%
Memory size156.2 KiB
2023-12-12T08:56:56.357078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length9.6
Min length3

Characters and Unicode

Total characters48
Distinct characters28
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)60.0%

Sample

1st row 나도향(벙어리 삼룡)
2nd row813.6082-김964ㄲ
3rd row 동백꽃)
4th row813.6082-김964ㄲ
5th row 전남
ValueCountFrequency (%)
813.6082-김964ㄲ 2
33.3%
나도향(벙어리 1
16.7%
삼룡 1
16.7%
동백꽃 1
16.7%
전남 1
16.7%
2023-12-12T08:56:56.616262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8 4
 
8.3%
6 4
 
8.3%
4
 
8.3%
2
 
4.2%
) 2
 
4.2%
1 2
 
4.2%
2
 
4.2%
9 2
 
4.2%
4 2
 
4.2%
- 2
 
4.2%
Other values (18) 22
45.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 20
41.7%
Other Letter 17
35.4%
Space Separator 4
 
8.3%
Close Punctuation 2
 
4.2%
Dash Punctuation 2
 
4.2%
Other Punctuation 2
 
4.2%
Open Punctuation 1
 
2.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
 
11.8%
2
 
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (5) 5
29.4%
Decimal Number
ValueCountFrequency (%)
8 4
20.0%
6 4
20.0%
1 2
10.0%
9 2
10.0%
4 2
10.0%
2 2
10.0%
0 2
10.0%
3 2
10.0%
Space Separator
ValueCountFrequency (%)
4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 31
64.6%
Hangul 17
35.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
 
11.8%
2
 
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (5) 5
29.4%
Common
ValueCountFrequency (%)
8 4
12.9%
6 4
12.9%
4
12.9%
) 2
 
6.5%
1 2
 
6.5%
9 2
 
6.5%
4 2
 
6.5%
- 2
 
6.5%
2 2
 
6.5%
0 2
 
6.5%
Other values (3) 5
16.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 31
64.6%
Hangul 15
31.2%
Compat Jamo 2
 
4.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8 4
12.9%
6 4
12.9%
4
12.9%
) 2
 
6.5%
1 2
 
6.5%
9 2
 
6.5%
4 2
 
6.5%
- 2
 
6.5%
2 2
 
6.5%
0 2
 
6.5%
Other values (3) 5
16.1%
Hangul
ValueCountFrequency (%)
2
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Other values (4) 4
26.7%
Compat Jamo
ValueCountFrequency (%)
2
100.0%

Unnamed: 14
Text

MISSING 

Distinct3
Distinct (%)100.0%
Missing9997
Missing (%)> 99.9%
Memory size156.2 KiB
2023-12-12T08:56:56.768380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length13
Mean length10
Min length3

Characters and Unicode

Total characters30
Distinct characters25
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)100.0%

Sample

1st row813.6082-김964ㄲ
2nd row 이효석(메밀꽃 필 무렵
3rd row 경북
ValueCountFrequency (%)
813.6082-김964ㄲ 1
20.0%
이효석(메밀꽃 1
20.0%
1
20.0%
무렵 1
20.0%
경북 1
20.0%
2023-12-12T08:56:57.069074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4
 
13.3%
6 2
 
6.7%
8 2
 
6.7%
. 1
 
3.3%
3 1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
Other values (15) 15
50.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13
43.3%
Decimal Number 10
33.3%
Space Separator 4
 
13.3%
Other Punctuation 1
 
3.3%
Open Punctuation 1
 
3.3%
Dash Punctuation 1
 
3.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Other values (3) 3
23.1%
Decimal Number
ValueCountFrequency (%)
6 2
20.0%
8 2
20.0%
3 1
10.0%
1 1
10.0%
4 1
10.0%
9 1
10.0%
2 1
10.0%
0 1
10.0%
Space Separator
ValueCountFrequency (%)
4
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 17
56.7%
Hangul 13
43.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Other values (3) 3
23.1%
Common
ValueCountFrequency (%)
4
23.5%
6 2
11.8%
8 2
11.8%
. 1
 
5.9%
3 1
 
5.9%
( 1
 
5.9%
1 1
 
5.9%
4 1
 
5.9%
9 1
 
5.9%
- 1
 
5.9%
Other values (2) 2
11.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 17
56.7%
Hangul 12
40.0%
Compat Jamo 1
 
3.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4
23.5%
6 2
11.8%
8 2
11.8%
. 1
 
5.9%
3 1
 
5.9%
( 1
 
5.9%
1 1
 
5.9%
4 1
 
5.9%
9 1
 
5.9%
- 1
 
5.9%
Other values (2) 2
11.8%
Hangul
ValueCountFrequency (%)
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
Other values (2) 2
16.7%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

Unnamed: 15
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing9998
Missing (%)> 99.9%
Memory size156.2 KiB
2023-12-12T08:56:57.206090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters6
Distinct characters5
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st row 산)
2nd row 경남
ValueCountFrequency (%)
1
50.0%
경남 1
50.0%
2023-12-12T08:56:57.435034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2
33.3%
1
16.7%
) 1
16.7%
1
16.7%
1
16.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3
50.0%
Space Separator 2
33.3%
Close Punctuation 1
 
16.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Space Separator
ValueCountFrequency (%)
2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3
50.0%
Hangul 3
50.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Common
ValueCountFrequency (%)
2
66.7%
) 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3
50.0%
Hangul 3
50.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2
66.7%
) 1
33.3%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Unnamed: 16
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing9998
Missing (%)> 99.9%
Memory size156.2 KiB
2023-12-12T08:56:57.593063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length8.5
Mean length8.5
Min length3

Characters and Unicode

Total characters17
Distinct characters15
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st row813.6082-김964ㄲ
2nd row 제주
ValueCountFrequency (%)
813.6082-김964ㄲ 1
50.0%
제주 1
50.0%
2023-12-12T08:56:57.866404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8 2
 
11.8%
6 2
 
11.8%
1 1
 
5.9%
3 1
 
5.9%
. 1
 
5.9%
0 1
 
5.9%
2 1
 
5.9%
- 1
 
5.9%
1
 
5.9%
9 1
 
5.9%
Other values (5) 5
29.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 10
58.8%
Other Letter 4
 
23.5%
Other Punctuation 1
 
5.9%
Dash Punctuation 1
 
5.9%
Space Separator 1
 
5.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
8 2
20.0%
6 2
20.0%
1 1
10.0%
3 1
10.0%
0 1
10.0%
2 1
10.0%
9 1
10.0%
4 1
10.0%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 13
76.5%
Hangul 4
 
23.5%

Most frequent character per script

Common
ValueCountFrequency (%)
8 2
15.4%
6 2
15.4%
1 1
7.7%
3 1
7.7%
. 1
7.7%
0 1
7.7%
2 1
7.7%
- 1
7.7%
9 1
7.7%
4 1
7.7%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13
76.5%
Hangul 3
 
17.6%
Compat Jamo 1
 
5.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8 2
15.4%
6 2
15.4%
1 1
7.7%
3 1
7.7%
. 1
7.7%
0 1
7.7%
2 1
7.7%
- 1
7.7%
9 1
7.7%
4 1
7.7%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

Unnamed: 17
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing9999
Missing (%)> 99.9%
Memory size156.2 KiB
2023-12-12T08:56:58.027217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length11
Mean length11
Min length11

Characters and Unicode

Total characters11
Distinct characters9
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row539.9-경228ㅈ
ValueCountFrequency (%)
539.9-경228ㅈ 1
100.0%
2023-12-12T08:56:58.290967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9 2
18.2%
2 2
18.2%
5 1
9.1%
3 1
9.1%
. 1
9.1%
- 1
9.1%
1
9.1%
8 1
9.1%
1
9.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7
63.6%
Other Letter 2
 
18.2%
Other Punctuation 1
 
9.1%
Dash Punctuation 1
 
9.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
9 2
28.6%
2 2
28.6%
5 1
14.3%
3 1
14.3%
8 1
14.3%
Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 9
81.8%
Hangul 2
 
18.2%

Most frequent character per script

Common
ValueCountFrequency (%)
9 2
22.2%
2 2
22.2%
5 1
11.1%
3 1
11.1%
. 1
11.1%
- 1
11.1%
8 1
11.1%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9
81.8%
Hangul 1
 
9.1%
Compat Jamo 1
 
9.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9 2
22.2%
2 2
22.2%
5 1
11.1%
3 1
11.1%
. 1
11.1%
- 1
11.1%
8 1
11.1%
Hangul
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

Correlations

2023-12-12T08:56:58.385038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16
Unnamed: 51.0001.0001.0001.0001.0001.0001.0001.0001.0001.0000.0000.000
Unnamed: 61.0001.0001.0001.0001.0001.0001.0001.0001.0001.0000.0000.000
Unnamed: 71.0001.0001.0001.0001.0001.0001.0001.0001.0001.0000.0000.000
Unnamed: 81.0001.0001.0001.0001.0001.0001.0001.0001.0001.0000.0000.000
Unnamed: 91.0001.0001.0001.0001.0001.0001.0001.0001.0001.0000.0000.000
Unnamed: 101.0001.0001.0001.0001.0001.0001.0001.0001.0001.0000.0000.000
Unnamed: 111.0001.0001.0001.0001.0001.0001.0001.0001.0001.0000.0000.000
Unnamed: 121.0001.0001.0001.0001.0001.0001.0001.0001.0001.0000.0000.000
Unnamed: 131.0001.0001.0001.0001.0001.0001.0001.0001.0001.0000.0000.000
Unnamed: 141.0001.0001.0001.0001.0001.0001.0001.0001.0001.0000.0000.000
Unnamed: 150.0000.0000.0000.0000.0000.0000.0000.0000.0000.0001.0000.000
Unnamed: 160.0000.0000.0000.0000.0000.0000.0000.0000.0000.0000.0001.000

Missing values

2023-12-12T08:56:47.023966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:56:47.249121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T08:56:47.460785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

등록번호서명청구기호Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17
11707KM0014734고구려왕조 700년사911.032-조738ㄱ<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
30433KM0037125知的財産權의 刑事的 理解365.23-강713ㅈ<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
11432KM0014384한국행정의 과제와 개혁351.1-한744ㅎㄱ<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
60780KM0070897모두의 파이썬 : 20일 만에 배우는 프로그래밍 기초 : 개정 2판005.133-이436ㅁ<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
41701KM0050375(만들면서 배우는)Android game programming : 기초부터 배우는 게임 프로그래밍의 원리005.319-황644ㅇ<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
306KM0000481航空工學 問題叢書558.076-김826ㅎㄱ<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
13284KM0016620(최신)소방설비539.99-박514ㅅ<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
4933KM0006387디자인 재료학600.23-임942ㄷ<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
53218KM0063173100℃ : 뜨거운 기억6월민주항쟁911.075-최645ㅂ-C<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
47934KM0057439보조공학총론 = Assistive technology510.74-한515ㅂ<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
등록번호서명청구기호Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17
43086KM0052016지방분산 · 분권과 국토균형발전의 대응과제 (Ⅲ) 자료집 (1) : 전략거점과 주변지역 간 연계를 통한 개발효과 확대방안359.005-국464ㄱㅈ-G<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
43163KM0052101(2011) 국가정보화백서 = National Informatization White Paper020.13-행981ㄱ-G<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
13631KM0017008하수도시설기준539.2-환919ㅎ<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
62274KM0072392애썼다오늘의 공무원 : 오늘도 국가 뒤에서 묵묵히 일하고 있는 공무원들에게818-영413ㅇ<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
60908KM0071025(2021 필기) 지적기사·산업기사 = Cadastral surveying533.077-송648ㅈ-EX<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
57378KM0067459현대 행정의 가치와 윤리350.019-김967ㅎ<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
56660KM0066712왜요그 말이 어때서요? : 나도 모르게 쓰는 차별의 언어330.911-김882ㅇ<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
1458KM0001867地下水工學537.4-이731ㅈ김<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
3493KM0004510韓國 圖書館 法令集021.311-한479ㅎ-R<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
7813KM0009811(圖解式)物理ㆍ化學實驗大辭典 = Physics chemistry experimental dictionary431.03-문592ㅁ-R<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>