Overview

Dataset statistics

Number of variables3
Number of observations10000
Missing cells166
Missing cells (%)0.6%
Duplicate rows412
Duplicate rows (%)4.1%
Total size in memory312.5 KiB
Average record size in memory32.0 B

Variable types

Text3

Dataset

Description문경시 중앙도서관이 소장하고 있는 도서의 서지정보에 대한 데이터로 도서의 서명, 저자, 출판사 등의 항목을 제공합니다.
Author경상북도 문경시
URLhttps://www.data.go.kr/data/15052141/fileData.do

Alerts

Dataset has 412 (4.1%) duplicate rowsDuplicates
저자 has 159 (1.6%) missing valuesMissing

Reproduction

Analysis started2024-04-29 22:32:39.286685
Analysis finished2024-04-29 22:32:41.509067
Duration2.22 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

서명
Text

Distinct9196
Distinct (%)92.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-30T07:32:41.801259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length153
Median length94
Mean length14.122
Min length1

Characters and Unicode

Total characters141220
Distinct characters1991
Distinct categories12 ?
Distinct scripts6 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8701 ?
Unique (%)87.0%

Sample

1st row나의 고전 읽기
2nd row꽃 피우는 아이 티스투
3rd row지구는 대단해
4th row약용식물
5th row날마다 홍차
ValueCountFrequency (%)
이야기 285
 
0.9%
위한 124
 
0.4%
장편소설 121
 
0.4%
우리 104
 
0.3%
the 96
 
0.3%
92
 
0.3%
92
 
0.3%
of 90
 
0.3%
세계 89
 
0.3%
역사 75
 
0.2%
Other values (17780) 31946
96.5%
2024-04-30T07:32:42.310303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23752
 
16.8%
2785
 
2.0%
2626
 
1.9%
1730
 
1.2%
1711
 
1.2%
) 1570
 
1.1%
( 1569
 
1.1%
1552
 
1.1%
e 1482
 
1.0%
1437
 
1.0%
Other values (1981) 101006
71.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 93790
66.4%
Space Separator 23752
 
16.8%
Lowercase Letter 12535
 
8.9%
Other Punctuation 3057
 
2.2%
Uppercase Letter 2305
 
1.6%
Decimal Number 1978
 
1.4%
Close Punctuation 1575
 
1.1%
Open Punctuation 1573
 
1.1%
Math Symbol 567
 
0.4%
Dash Punctuation 77
 
0.1%
Other values (2) 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2785
 
3.0%
2626
 
2.8%
1730
 
1.8%
1711
 
1.8%
1552
 
1.7%
1437
 
1.5%
1287
 
1.4%
1267
 
1.4%
1156
 
1.2%
1083
 
1.2%
Other values (1880) 77156
82.3%
Lowercase Letter
ValueCountFrequency (%)
e 1482
11.8%
o 1115
 
8.9%
n 1024
 
8.2%
i 1016
 
8.1%
a 1016
 
8.1%
r 895
 
7.1%
t 872
 
7.0%
s 771
 
6.2%
l 567
 
4.5%
h 507
 
4.0%
Other values (17) 3270
26.1%
Uppercase Letter
ValueCountFrequency (%)
T 213
 
9.2%
S 188
 
8.2%
C 162
 
7.0%
A 145
 
6.3%
I 137
 
5.9%
E 130
 
5.6%
P 126
 
5.5%
M 119
 
5.2%
D 114
 
4.9%
B 110
 
4.8%
Other values (16) 861
37.4%
Other Punctuation
ValueCountFrequency (%)
: 1338
43.8%
, 584
19.1%
? 399
 
13.1%
! 288
 
9.4%
. 176
 
5.8%
' 80
 
2.6%
· 75
 
2.5%
& 36
 
1.2%
/ 31
 
1.0%
% 11
 
0.4%
Other values (10) 39
 
1.3%
Decimal Number
ValueCountFrequency (%)
0 559
28.3%
1 402
20.3%
2 271
13.7%
3 186
 
9.4%
5 154
 
7.8%
9 110
 
5.6%
4 78
 
3.9%
7 77
 
3.9%
6 72
 
3.6%
8 69
 
3.5%
Math Symbol
ValueCountFrequency (%)
= 525
92.6%
~ 20
 
3.5%
+ 20
 
3.5%
1
 
0.2%
1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 1570
99.7%
] 2
 
0.1%
2
 
0.1%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1569
99.7%
2
 
0.1%
1
 
0.1%
[ 1
 
0.1%
Letter Number
ValueCountFrequency (%)
7
70.0%
3
30.0%
Space Separator
ValueCountFrequency (%)
23752
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 77
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 91440
64.8%
Common 32580
 
23.1%
Latin 14850
 
10.5%
Han 2242
 
1.6%
Hiragana 81
 
0.1%
Katakana 27
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2785
 
3.0%
2626
 
2.9%
1730
 
1.9%
1711
 
1.9%
1552
 
1.7%
1437
 
1.6%
1287
 
1.4%
1267
 
1.4%
1156
 
1.3%
1083
 
1.2%
Other values (1216) 74806
81.8%
Han
ValueCountFrequency (%)
82
 
3.7%
67
 
3.0%
48
 
2.1%
48
 
2.1%
40
 
1.8%
38
 
1.7%
34
 
1.5%
23
 
1.0%
21
 
0.9%
21
 
0.9%
Other values (603) 1820
81.2%
Latin
ValueCountFrequency (%)
e 1482
 
10.0%
o 1115
 
7.5%
n 1024
 
6.9%
i 1016
 
6.8%
a 1016
 
6.8%
r 895
 
6.0%
t 872
 
5.9%
s 771
 
5.2%
l 567
 
3.8%
h 507
 
3.4%
Other values (45) 5585
37.6%
Common
ValueCountFrequency (%)
23752
72.9%
) 1570
 
4.8%
( 1569
 
4.8%
: 1338
 
4.1%
, 584
 
1.8%
0 559
 
1.7%
= 525
 
1.6%
1 402
 
1.2%
? 399
 
1.2%
! 288
 
0.9%
Other values (36) 1594
 
4.9%
Hiragana
ValueCountFrequency (%)
10
 
12.3%
5
 
6.2%
5
 
6.2%
5
 
6.2%
4
 
4.9%
4
 
4.9%
3
 
3.7%
3
 
3.7%
3
 
3.7%
3
 
3.7%
Other values (26) 36
44.4%
Katakana
ValueCountFrequency (%)
5
18.5%
3
11.1%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
1
 
3.7%
1
 
3.7%
Other values (5) 5
18.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 91425
64.7%
ASCII 47317
33.5%
CJK 2192
 
1.6%
None 102
 
0.1%
Hiragana 81
 
0.1%
CJK Compat Ideographs 50
 
< 0.1%
Katakana 27
 
< 0.1%
Compat Jamo 15
 
< 0.1%
Number Forms 10
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
23752
50.2%
) 1570
 
3.3%
( 1569
 
3.3%
e 1482
 
3.1%
: 1338
 
2.8%
o 1115
 
2.4%
n 1024
 
2.2%
i 1016
 
2.1%
a 1016
 
2.1%
r 895
 
1.9%
Other values (76) 12540
26.5%
Hangul
ValueCountFrequency (%)
2785
 
3.0%
2626
 
2.9%
1730
 
1.9%
1711
 
1.9%
1552
 
1.7%
1437
 
1.6%
1287
 
1.4%
1267
 
1.4%
1156
 
1.3%
1083
 
1.2%
Other values (1210) 74791
81.8%
CJK
ValueCountFrequency (%)
82
 
3.7%
67
 
3.1%
48
 
2.2%
48
 
2.2%
40
 
1.8%
38
 
1.7%
34
 
1.6%
23
 
1.0%
21
 
1.0%
21
 
1.0%
Other values (578) 1770
80.7%
None
ValueCountFrequency (%)
· 75
73.5%
6
 
5.9%
6
 
5.9%
3
 
2.9%
2
 
2.0%
đ 2
 
2.0%
2
 
2.0%
2
 
2.0%
1
 
1.0%
1
 
1.0%
Other values (2) 2
 
2.0%
Hiragana
ValueCountFrequency (%)
10
 
12.3%
5
 
6.2%
5
 
6.2%
5
 
6.2%
4
 
4.9%
4
 
4.9%
3
 
3.7%
3
 
3.7%
3
 
3.7%
3
 
3.7%
Other values (26) 36
44.4%
Number Forms
ValueCountFrequency (%)
7
70.0%
3
30.0%
CJK Compat Ideographs
ValueCountFrequency (%)
7
14.0%
5
 
10.0%
5
 
10.0%
4
 
8.0%
4
 
8.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
2
 
4.0%
Other values (15) 15
30.0%
Katakana
ValueCountFrequency (%)
5
18.5%
3
11.1%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
1
 
3.7%
1
 
3.7%
Other values (5) 5
18.5%
Compat Jamo
ValueCountFrequency (%)
4
26.7%
4
26.7%
4
26.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Math Operators
ValueCountFrequency (%)
1
100.0%

저자
Text

MISSING 

Distinct8496
Distinct (%)86.3%
Missing159
Missing (%)1.6%
Memory size156.2 KiB
2024-04-30T07:32:42.607170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length114
Median length76
Mean length12.651661
Min length2

Characters and Unicode

Total characters124505
Distinct characters1506
Distinct categories13 ?
Distinct scripts7 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7713 ?
Unique (%)78.4%

Sample

1st row공지영 외 지음
2nd row모리스 드뤼옹 지음;나선희 옮김
3rd row고하라 도모유키 글;마츠오카 다츠히데 그림;신미원 옮김
4th row김태정 글,사진
5th row김유나 지음
ValueCountFrequency (%)
지음 2929
 
9.9%
옮김 2215
 
7.5%
그림 1640
 
5.5%
402
 
1.4%
255
 
0.9%
249
 
0.8%
엮음 229
 
0.8%
by 162
 
0.5%
159
 
0.5%
152
 
0.5%
Other values (13302) 21303
71.7%
2024-04-30T07:32:43.372216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19895
 
16.0%
; 6241
 
5.0%
5713
 
4.6%
5592
 
4.5%
4374
 
3.5%
2757
 
2.2%
2556
 
2.1%
2474
 
2.0%
2309
 
1.9%
2018
 
1.6%
Other values (1496) 70576
56.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 87846
70.6%
Space Separator 19895
 
16.0%
Other Punctuation 7806
 
6.3%
Lowercase Letter 5724
 
4.6%
Uppercase Letter 1495
 
1.2%
Open Punctuation 796
 
0.6%
Close Punctuation 796
 
0.6%
Decimal Number 90
 
0.1%
Dash Punctuation 38
 
< 0.1%
Math Symbol 14
 
< 0.1%
Other values (3) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5713
 
6.5%
5592
 
6.4%
4374
 
5.0%
2757
 
3.1%
2556
 
2.9%
2474
 
2.8%
2309
 
2.6%
2018
 
2.3%
1257
 
1.4%
1256
 
1.4%
Other values (1409) 57540
65.5%
Uppercase Letter
ValueCountFrequency (%)
M 137
 
9.2%
S 121
 
8.1%
J 111
 
7.4%
B 106
 
7.1%
C 102
 
6.8%
A 87
 
5.8%
L 78
 
5.2%
K 77
 
5.2%
E 72
 
4.8%
H 64
 
4.3%
Other values (17) 540
36.1%
Lowercase Letter
ValueCountFrequency (%)
e 608
10.6%
a 577
10.1%
i 491
 
8.6%
r 463
 
8.1%
n 455
 
7.9%
t 398
 
7.0%
o 386
 
6.7%
l 349
 
6.1%
s 297
 
5.2%
y 267
 
4.7%
Other values (16) 1433
25.0%
Other Punctuation
ValueCountFrequency (%)
; 6241
80.0%
. 982
 
12.6%
, 177
 
2.3%
· 175
 
2.2%
? 137
 
1.8%
: 64
 
0.8%
& 11
 
0.1%
' 7
 
0.1%
* 6
 
0.1%
2
 
< 0.1%
Other values (3) 4
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 25
27.8%
2 22
24.4%
5 10
 
11.1%
0 7
 
7.8%
8 7
 
7.8%
3 6
 
6.7%
4 5
 
5.6%
7 4
 
4.4%
6 3
 
3.3%
9 1
 
1.1%
Open Punctuation
ValueCountFrequency (%)
[ 793
99.6%
( 3
 
0.4%
Close Punctuation
ValueCountFrequency (%)
] 793
99.6%
) 3
 
0.4%
Math Symbol
ValueCountFrequency (%)
< 7
50.0%
> 7
50.0%
Space Separator
ValueCountFrequency (%)
19895
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 38
100.0%
Modifier Symbol
ValueCountFrequency (%)
^ 3
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 85872
69.0%
Common 29439
 
23.6%
Latin 7219
 
5.8%
Han 1814
 
1.5%
Katakana 138
 
0.1%
Hiragana 22
 
< 0.1%
Cyrillic 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5713
 
6.7%
5592
 
6.5%
4374
 
5.1%
2757
 
3.2%
2556
 
3.0%
2474
 
2.9%
2309
 
2.7%
2018
 
2.4%
1257
 
1.5%
1256
 
1.5%
Other values (855) 55566
64.7%
Han
ValueCountFrequency (%)
236
 
13.0%
59
 
3.3%
54
 
3.0%
51
 
2.8%
48
 
2.6%
26
 
1.4%
24
 
1.3%
22
 
1.2%
21
 
1.2%
19
 
1.0%
Other values (482) 1254
69.1%
Latin
ValueCountFrequency (%)
e 608
 
8.4%
a 577
 
8.0%
i 491
 
6.8%
r 463
 
6.4%
n 455
 
6.3%
t 398
 
5.5%
o 386
 
5.3%
l 349
 
4.8%
s 297
 
4.1%
y 267
 
3.7%
Other values (43) 2928
40.6%
Katakana
ValueCountFrequency (%)
12
 
8.7%
9
 
6.5%
6
 
4.3%
6
 
4.3%
6
 
4.3%
6
 
4.3%
6
 
4.3%
6
 
4.3%
5
 
3.6%
4
 
2.9%
Other values (39) 72
52.2%
Common
ValueCountFrequency (%)
19895
67.6%
; 6241
 
21.2%
. 982
 
3.3%
[ 793
 
2.7%
] 793
 
2.7%
, 177
 
0.6%
· 175
 
0.6%
? 137
 
0.5%
: 64
 
0.2%
- 38
 
0.1%
Other values (23) 144
 
0.5%
Hiragana
ValueCountFrequency (%)
4
18.2%
2
9.1%
2
9.1%
2
9.1%
2
9.1%
2
9.1%
2
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (3) 3
13.6%
Cyrillic
ValueCountFrequency (%)
Ф 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 85864
69.0%
ASCII 36479
29.3%
CJK 1734
 
1.4%
None 177
 
0.1%
Katakana 138
 
0.1%
CJK Compat Ideographs 80
 
0.1%
Hiragana 22
 
< 0.1%
Compat Jamo 8
 
< 0.1%
Cyrillic 1
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
19895
54.5%
; 6241
 
17.1%
. 982
 
2.7%
[ 793
 
2.2%
] 793
 
2.2%
e 608
 
1.7%
a 577
 
1.6%
i 491
 
1.3%
r 463
 
1.3%
n 455
 
1.2%
Other values (72) 5181
 
14.2%
Hangul
ValueCountFrequency (%)
5713
 
6.7%
5592
 
6.5%
4374
 
5.1%
2757
 
3.2%
2556
 
3.0%
2474
 
2.9%
2309
 
2.7%
2018
 
2.4%
1257
 
1.5%
1256
 
1.5%
Other values (852) 55558
64.7%
CJK
ValueCountFrequency (%)
236
 
13.6%
59
 
3.4%
51
 
2.9%
48
 
2.8%
26
 
1.5%
24
 
1.4%
22
 
1.3%
21
 
1.2%
19
 
1.1%
18
 
1.0%
Other values (463) 1210
69.8%
None
ValueCountFrequency (%)
· 175
98.9%
2
 
1.1%
CJK Compat Ideographs
ValueCountFrequency (%)
54
67.5%
3
 
3.8%
3
 
3.8%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
1
 
1.2%
1
 
1.2%
1
 
1.2%
Other values (9) 9
 
11.2%
Katakana
ValueCountFrequency (%)
12
 
8.7%
9
 
6.5%
6
 
4.3%
6
 
4.3%
6
 
4.3%
6
 
4.3%
6
 
4.3%
6
 
4.3%
5
 
3.6%
4
 
2.9%
Other values (39) 72
52.2%
Compat Jamo
ValueCountFrequency (%)
6
75.0%
1
 
12.5%
1
 
12.5%
Hiragana
ValueCountFrequency (%)
4
18.2%
2
9.1%
2
9.1%
2
9.1%
2
9.1%
2
9.1%
2
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (3) 3
13.6%
Cyrillic
ValueCountFrequency (%)
Ф 1
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct2841
Distinct (%)28.4%
Missing7
Missing (%)0.1%
Memory size156.2 KiB
2024-04-30T07:32:43.684277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length35
Mean length4.3712599
Min length1

Characters and Unicode

Total characters43682
Distinct characters911
Distinct categories10 ?
Distinct scripts6 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1616 ?
Unique (%)16.2%

Sample

1st row북섬
2nd row길벗어린이
3rd row아이세움
4th row대원사
5th row장서가:청어람M&B
ValueCountFrequency (%)
문학동네 157
 
1.5%
김영사 135
 
1.3%
교원 135
 
1.3%
시공사 107
 
1.0%
비룡소 102
 
1.0%
웅진닷컴 91
 
0.9%
민음사 80
 
0.8%
창비 75
 
0.7%
프뢰벨 70
 
0.7%
문학과지성사 67
 
0.6%
Other values (2891) 9463
90.3%
2024-04-30T07:32:44.133322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2608
 
6.0%
1170
 
2.7%
848
 
1.9%
848
 
1.9%
814
 
1.9%
814
 
1.9%
812
 
1.9%
654
 
1.5%
594
 
1.4%
569
 
1.3%
Other values (901) 33951
77.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39419
90.2%
Lowercase Letter 2266
 
5.2%
Uppercase Letter 866
 
2.0%
Space Separator 532
 
1.2%
Other Punctuation 404
 
0.9%
Decimal Number 139
 
0.3%
Close Punctuation 21
 
< 0.1%
Open Punctuation 21
 
< 0.1%
Dash Punctuation 12
 
< 0.1%
Connector Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2608
 
6.6%
1170
 
3.0%
848
 
2.2%
848
 
2.2%
814
 
2.1%
814
 
2.1%
812
 
2.1%
654
 
1.7%
594
 
1.5%
569
 
1.4%
Other values (822) 29688
75.3%
Lowercase Letter
ValueCountFrequency (%)
o 349
15.4%
a 200
 
8.8%
i 188
 
8.3%
e 170
 
7.5%
s 155
 
6.8%
n 153
 
6.8%
r 141
 
6.2%
l 136
 
6.0%
c 100
 
4.4%
t 100
 
4.4%
Other values (15) 574
25.3%
Uppercase Letter
ValueCountFrequency (%)
B 197
22.7%
M 127
14.7%
S 76
 
8.8%
P 52
 
6.0%
H 49
 
5.7%
K 44
 
5.1%
O 42
 
4.8%
C 38
 
4.4%
I 30
 
3.5%
L 25
 
2.9%
Other values (14) 186
21.5%
Other Punctuation
ValueCountFrequency (%)
: 162
40.1%
& 79
19.6%
? 70
17.3%
* 23
 
5.7%
. 21
 
5.2%
, 16
 
4.0%
12
 
3.0%
' 7
 
1.7%
; 6
 
1.5%
@ 3
 
0.7%
Other values (3) 5
 
1.2%
Decimal Number
ValueCountFrequency (%)
2 60
43.2%
1 55
39.6%
0 13
 
9.4%
5 3
 
2.2%
8 2
 
1.4%
4 2
 
1.4%
9 1
 
0.7%
6 1
 
0.7%
3 1
 
0.7%
7 1
 
0.7%
Close Punctuation
ValueCountFrequency (%)
) 20
95.2%
] 1
 
4.8%
Open Punctuation
ValueCountFrequency (%)
( 20
95.2%
[ 1
 
4.8%
Space Separator
ValueCountFrequency (%)
532
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 37995
87.0%
Latin 3132
 
7.2%
Han 1408
 
3.2%
Common 1131
 
2.6%
Hiragana 8
 
< 0.1%
Katakana 8
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2608
 
6.9%
1170
 
3.1%
848
 
2.2%
848
 
2.2%
814
 
2.1%
814
 
2.1%
812
 
2.1%
654
 
1.7%
594
 
1.6%
569
 
1.5%
Other values (587) 28264
74.4%
Han
ValueCountFrequency (%)
209
 
14.8%
101
 
7.2%
67
 
4.8%
67
 
4.8%
37
 
2.6%
33
 
2.3%
32
 
2.3%
24
 
1.7%
23
 
1.6%
23
 
1.6%
Other values (211) 792
56.2%
Latin
ValueCountFrequency (%)
o 349
 
11.1%
a 200
 
6.4%
B 197
 
6.3%
i 188
 
6.0%
e 170
 
5.4%
s 155
 
4.9%
n 153
 
4.9%
r 141
 
4.5%
l 136
 
4.3%
M 127
 
4.1%
Other values (39) 1316
42.0%
Common
ValueCountFrequency (%)
532
47.0%
: 162
 
14.3%
& 79
 
7.0%
? 70
 
6.2%
2 60
 
5.3%
1 55
 
4.9%
* 23
 
2.0%
. 21
 
1.9%
) 20
 
1.8%
( 20
 
1.8%
Other values (20) 89
 
7.9%
Katakana
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Hiragana
ValueCountFrequency (%)
3
37.5%
1
 
12.5%
1
 
12.5%
1
 
12.5%
1
 
12.5%
1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 37995
87.0%
ASCII 4248
 
9.7%
CJK 1387
 
3.2%
CJK Compat Ideographs 21
 
< 0.1%
None 15
 
< 0.1%
Hiragana 8
 
< 0.1%
Katakana 8
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2608
 
6.9%
1170
 
3.1%
848
 
2.2%
848
 
2.2%
814
 
2.1%
814
 
2.1%
812
 
2.1%
654
 
1.7%
594
 
1.6%
569
 
1.5%
Other values (587) 28264
74.4%
ASCII
ValueCountFrequency (%)
532
 
12.5%
o 349
 
8.2%
a 200
 
4.7%
B 197
 
4.6%
i 188
 
4.4%
e 170
 
4.0%
: 162
 
3.8%
s 155
 
3.6%
n 153
 
3.6%
r 141
 
3.3%
Other values (66) 2001
47.1%
CJK
ValueCountFrequency (%)
209
 
15.1%
101
 
7.3%
67
 
4.8%
67
 
4.8%
37
 
2.7%
33
 
2.4%
32
 
2.3%
24
 
1.7%
23
 
1.7%
23
 
1.7%
Other values (206) 771
55.6%
CJK Compat Ideographs
ValueCountFrequency (%)
12
57.1%
6
28.6%
1
 
4.8%
1
 
4.8%
1
 
4.8%
None
ValueCountFrequency (%)
12
80.0%
· 2
 
13.3%
đ 1
 
6.7%
Hiragana
ValueCountFrequency (%)
3
37.5%
1
 
12.5%
1
 
12.5%
1
 
12.5%
1
 
12.5%
1
 
12.5%
Katakana
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

Missing values

2024-04-30T07:32:41.303726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T07:32:41.380054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-30T07:32:41.466416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

서명저자출판사
61854나의 고전 읽기공지영 외 지음북섬
23671꽃 피우는 아이 티스투모리스 드뤼옹 지음;나선희 옮김길벗어린이
62071지구는 대단해고하라 도모유키 글;마츠오카 다츠히데 그림;신미원 옮김아이세움
10743약용식물김태정 글,사진대원사
76817날마다 홍차김유나 지음장서가:청어람M&B
41191식충식물의 세계전의식;김정환 공저도요새
36704미스 론리하트너새네이얼 웨스트 지음;이종인 옮김마음산책
2949윤리학의 역사강재륜 지음大旺社
56690(사진으로 보는)문경의 근대 100년사문경시 [편]문경시
813062012 신춘문예 당선시집김민철;류성훈;안미옥;여성민 [공]지음문학세계사
서명저자출판사
75519다빈치 푸드:판타지 요리과학만화스튜디오 애니멀 지음아울북:북이십일
59612(코믹 메이플스토리) 수학도둑송도수 글;서정 엔터테인먼트 그림;여운방 콘텐츠서울문화사
45838베짱이 할아버지김나무 글;강전희 그림문학동네어린이
27269아기옷<NA>홍익
88178무민과 아빠의 첫 운전토베 얀손 지음;이지영 옮김어린이작가정신
94798이건희의 서재:고독 몰입 독서로 미래를 창조하라안상헌 지음책비
5497唐詩三百首 I채지충 만화;황병국 번역대현
20755원감국사가송고대민족문화연구소 [편];이종찬 역주고대민족문화연구소
66497아웃=Out기리노 나쓰오 지음;김수현 옮김황금가지
66905천 년의 사랑 직지조경희 글;박철민 그림대교출판

Duplicate rows

Most frequently occurring

서명저자출판사# duplicates
330이조실록사회과학원 민족고전연구소 번역여강42
399헤밍웨이 테마위인<NA>한국헤밍웨이12
310옥스퍼드 원어 성경대전=(The) Oxford Bible interpreter제자원 편제자원11
75(프뢰벨) 자연관찰/<NA>프뢰벨10
224바투바투 인물이야기=:<NA>웅진닷컴9
8(戰略) 삼국지요코야마 미쓰테루 지음;박영 옮김대현8
43(월간) 도예<NA>월간 세라믹스,8
62(코믹 메이플스토리) 수학도둑송도수 글;서정은 그림서울문화사8
71(프뢰벨) 뉴 컨셉동화<NA>프뢰벨8
76(프뢰벨) 테마영어동화;프뢰벨 유아교육연구소 옮김프뢰벨8