Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells19
Missing cells (%)< 0.1%
Duplicate rows38
Duplicate rows (%)0.4%
Total size in memory468.8 KiB
Average record size in memory48.0 B

Variable types

Text4
DateTime1

Dataset

Description대구광역시 관내 공공도서관으로 시민들이 신청한 희망도서신청 현황자료 입니다.서명, 저자, 출판사, 신청일자, 신청도서관 항목을 포함합니다.
Author대구광역시
URLhttps://www.data.go.kr/data/15089207/fileData.do

Alerts

Dataset has 38 (0.4%) duplicate rowsDuplicates

Reproduction

Analysis started2024-04-18 00:19:29.151098
Analysis finished2024-04-18 00:19:31.733815
Duration2.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

서명
Text

Distinct8844
Distinct (%)88.4%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2024-04-18T09:19:31.996347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length172
Median length81
Mean length19.086309
Min length1

Characters and Unicode

Total characters190844
Distinct characters1501
Distinct categories16 ?
Distinct scripts7 ?
Distinct blocks13 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8035 ?
Unique (%)80.4%

Sample

1st row빈대 가족의 덜렁이도 같이 펀딩 (대한민국 공식 짠돌이 빈대 가족에게 배우는 경제 지혜)
2nd row작별하지 않는다 (한강 장편소설)
3rd row역사의 오른편 옳은편
4th row묘한 서점
5th row새들에 관한 짧은 철학
ValueCountFrequency (%)
419
 
0.9%
1 331
 
0.7%
위한 297
 
0.6%
2 252
 
0.5%
이야기 191
 
0.4%
나는 177
 
0.4%
장편소설 156
 
0.3%
the 143
 
0.3%
137
 
0.3%
126
 
0.3%
Other values (18527) 45346
95.3%
2024-04-18T09:19:32.495726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38091
 
20.0%
3771
 
2.0%
3612
 
1.9%
( 3534
 
1.9%
) 3364
 
1.8%
2836
 
1.5%
1967
 
1.0%
1890
 
1.0%
1828
 
1.0%
1712
 
0.9%
Other values (1491) 128239
67.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 125535
65.8%
Space Separator 38091
 
20.0%
Lowercase Letter 8372
 
4.4%
Decimal Number 4926
 
2.6%
Open Punctuation 3604
 
1.9%
Close Punctuation 3434
 
1.8%
Uppercase Letter 3256
 
1.7%
Other Punctuation 3140
 
1.6%
Dash Punctuation 253
 
0.1%
Math Symbol 133
 
0.1%
Other values (6) 100
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3771
 
3.0%
3612
 
2.9%
2836
 
2.3%
1967
 
1.6%
1890
 
1.5%
1828
 
1.5%
1712
 
1.4%
1647
 
1.3%
1586
 
1.3%
1532
 
1.2%
Other values (1362) 103154
82.2%
Lowercase Letter
ValueCountFrequency (%)
e 1062
12.7%
o 871
10.4%
a 759
 
9.1%
r 751
 
9.0%
t 556
 
6.6%
i 553
 
6.6%
n 505
 
6.0%
s 412
 
4.9%
h 376
 
4.5%
d 367
 
4.4%
Other values (33) 2160
25.8%
Uppercase Letter
ValueCountFrequency (%)
C 276
 
8.5%
T 273
 
8.4%
H 255
 
7.8%
S 248
 
7.6%
A 206
 
6.3%
B 192
 
5.9%
P 182
 
5.6%
E 165
 
5.1%
D 151
 
4.6%
M 147
 
4.5%
Other values (21) 1161
35.7%
Other Punctuation
ValueCountFrequency (%)
, 1189
37.9%
. 763
24.3%
! 327
 
10.4%
: 287
 
9.1%
? 264
 
8.4%
& 101
 
3.2%
; 60
 
1.9%
# 55
 
1.8%
· 43
 
1.4%
/ 22
 
0.7%
Other values (4) 29
 
0.9%
Math Symbol
ValueCountFrequency (%)
+ 62
46.6%
~ 55
41.4%
× 4
 
3.0%
= 3
 
2.3%
| 2
 
1.5%
1
 
0.8%
1
 
0.8%
1
 
0.8%
1
 
0.8%
1
 
0.8%
Other values (2) 2
 
1.5%
Decimal Number
ValueCountFrequency (%)
1 1350
27.4%
2 973
19.8%
0 879
17.8%
3 482
 
9.8%
5 307
 
6.2%
4 272
 
5.5%
9 213
 
4.3%
7 161
 
3.3%
6 161
 
3.3%
8 128
 
2.6%
Open Punctuation
ValueCountFrequency (%)
( 3534
98.1%
[ 63
 
1.7%
6
 
0.2%
1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 3364
98.0%
] 63
 
1.8%
6
 
0.2%
1
 
< 0.1%
Final Punctuation
ValueCountFrequency (%)
46
95.8%
2
 
4.2%
Initial Punctuation
ValueCountFrequency (%)
40
95.2%
2
 
4.8%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
38091
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 253
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 125218
65.6%
Common 53679
28.1%
Latin 11595
 
6.1%
Han 155
 
0.1%
Hiragana 129
 
0.1%
Cyrillic 35
 
< 0.1%
Katakana 33
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3771
 
3.0%
3612
 
2.9%
2836
 
2.3%
1967
 
1.6%
1890
 
1.5%
1828
 
1.5%
1712
 
1.4%
1647
 
1.3%
1586
 
1.3%
1532
 
1.2%
Other values (1225) 102837
82.1%
Han
ValueCountFrequency (%)
11
 
7.1%
5
 
3.2%
5
 
3.2%
5
 
3.2%
5
 
3.2%
5
 
3.2%
5
 
3.2%
5
 
3.2%
4
 
2.6%
4
 
2.6%
Other values (70) 101
65.2%
Latin
ValueCountFrequency (%)
e 1062
 
9.2%
o 871
 
7.5%
a 759
 
6.5%
r 751
 
6.5%
t 556
 
4.8%
i 553
 
4.8%
n 505
 
4.4%
s 412
 
3.6%
h 376
 
3.2%
d 367
 
3.2%
Other values (44) 5383
46.4%
Common
ValueCountFrequency (%)
38091
71.0%
( 3534
 
6.6%
) 3364
 
6.3%
1 1350
 
2.5%
, 1189
 
2.2%
2 973
 
1.8%
0 879
 
1.6%
. 763
 
1.4%
3 482
 
0.9%
! 327
 
0.6%
Other values (43) 2727
 
5.1%
Hiragana
ValueCountFrequency (%)
20
 
15.5%
8
 
6.2%
7
 
5.4%
6
 
4.7%
5
 
3.9%
5
 
3.9%
5
 
3.9%
5
 
3.9%
5
 
3.9%
4
 
3.1%
Other values (33) 59
45.7%
Cyrillic
ValueCountFrequency (%)
и 4
 
11.4%
г 3
 
8.6%
е 3
 
8.6%
т 3
 
8.6%
н 2
 
5.7%
л 2
 
5.7%
ь 2
 
5.7%
о 2
 
5.7%
р 1
 
2.9%
я 1
 
2.9%
Other values (12) 12
34.3%
Katakana
ValueCountFrequency (%)
5
15.2%
4
12.1%
3
9.1%
3
9.1%
3
9.1%
3
9.1%
3
9.1%
3
9.1%
1
 
3.0%
1
 
3.0%
Other values (4) 4
12.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 125207
65.6%
ASCII 65113
34.1%
CJK 154
 
0.1%
Hiragana 129
 
0.1%
Punctuation 90
 
< 0.1%
None 64
 
< 0.1%
Cyrillic 35
 
< 0.1%
Katakana 33
 
< 0.1%
Compat Jamo 11
 
< 0.1%
Math Operators 3
 
< 0.1%
Other values (3) 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
38091
58.5%
( 3534
 
5.4%
) 3364
 
5.2%
1 1350
 
2.1%
, 1189
 
1.8%
e 1062
 
1.6%
2 973
 
1.5%
0 879
 
1.3%
o 871
 
1.3%
. 763
 
1.2%
Other values (78) 13037
 
20.0%
Hangul
ValueCountFrequency (%)
3771
 
3.0%
3612
 
2.9%
2836
 
2.3%
1967
 
1.6%
1890
 
1.5%
1828
 
1.5%
1712
 
1.4%
1647
 
1.3%
1586
 
1.3%
1532
 
1.2%
Other values (1218) 102826
82.1%
Punctuation
ValueCountFrequency (%)
46
51.1%
40
44.4%
2
 
2.2%
2
 
2.2%
None
ValueCountFrequency (%)
· 43
67.2%
6
 
9.4%
6
 
9.4%
× 4
 
6.2%
1
 
1.6%
1
 
1.6%
1
 
1.6%
1
 
1.6%
1
 
1.6%
Hiragana
ValueCountFrequency (%)
20
 
15.5%
8
 
6.2%
7
 
5.4%
6
 
4.7%
5
 
3.9%
5
 
3.9%
5
 
3.9%
5
 
3.9%
5
 
3.9%
4
 
3.1%
Other values (33) 59
45.7%
CJK
ValueCountFrequency (%)
11
 
7.1%
5
 
3.2%
5
 
3.2%
5
 
3.2%
5
 
3.2%
5
 
3.2%
5
 
3.2%
5
 
3.2%
4
 
2.6%
4
 
2.6%
Other values (69) 100
64.9%
Katakana
ValueCountFrequency (%)
5
15.2%
4
12.1%
3
9.1%
3
9.1%
3
9.1%
3
9.1%
3
9.1%
3
9.1%
1
 
3.0%
1
 
3.0%
Other values (4) 4
12.1%
Cyrillic
ValueCountFrequency (%)
и 4
 
11.4%
г 3
 
8.6%
е 3
 
8.6%
т 3
 
8.6%
н 2
 
5.7%
л 2
 
5.7%
ь 2
 
5.7%
о 2
 
5.7%
р 1
 
2.9%
я 1
 
2.9%
Other values (12) 12
34.3%
Compat Jamo
ValueCountFrequency (%)
2
18.2%
2
18.2%
2
18.2%
2
18.2%
1
9.1%
1
9.1%
1
9.1%
Misc Symbols
ValueCountFrequency (%)
2
100.0%
Math Operators
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%

저자
Text

Distinct6836
Distinct (%)68.4%
Missing7
Missing (%)0.1%
Memory size156.2 KiB
2024-04-18T09:19:32.787751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length106
Median length92
Mean length6.030021
Min length1

Characters and Unicode

Total characters60258
Distinct characters937
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5379 ?
Unique (%)53.8%

Sample

1st row임창호
2nd row한강
3rd row벤 샤피로 저
4th row김지선, 이선아
5th row필리프 J. 뒤부아|엘리즈 루소
ValueCountFrequency (%)
지은이 157
 
1.0%
지음 140
 
0.9%
91
 
0.6%
편집부 64
 
0.4%
히로시마 59
 
0.4%
레이코 59
 
0.4%
옮긴이 49
 
0.3%
43
 
0.3%
게이고 43
 
0.3%
히가시노 43
 
0.3%
Other values (8515) 14741
95.2%
2024-04-18T09:19:33.349207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5685
 
9.4%
2364
 
3.9%
1543
 
2.6%
| 1182
 
2.0%
1093
 
1.8%
934
 
1.6%
891
 
1.5%
856
 
1.4%
, 725
 
1.2%
684
 
1.1%
Other values (927) 44301
73.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 46929
77.9%
Space Separator 5685
 
9.4%
Lowercase Letter 3172
 
5.3%
Uppercase Letter 1351
 
2.2%
Math Symbol 1200
 
2.0%
Other Punctuation 1046
 
1.7%
Close Punctuation 411
 
0.7%
Open Punctuation 411
 
0.7%
Decimal Number 36
 
0.1%
Dash Punctuation 17
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2364
 
5.0%
1543
 
3.3%
1093
 
2.3%
934
 
2.0%
891
 
1.9%
856
 
1.8%
684
 
1.5%
604
 
1.3%
583
 
1.2%
555
 
1.2%
Other values (846) 36822
78.5%
Lowercase Letter
ValueCountFrequency (%)
a 389
12.3%
e 349
11.0%
n 306
9.6%
i 277
 
8.7%
r 251
 
7.9%
o 244
 
7.7%
l 196
 
6.2%
t 178
 
5.6%
s 148
 
4.7%
h 101
 
3.2%
Other values (16) 733
23.1%
Uppercase Letter
ValueCountFrequency (%)
M 104
 
7.7%
L 96
 
7.1%
J 94
 
7.0%
A 90
 
6.7%
T 82
 
6.1%
B 80
 
5.9%
S 79
 
5.8%
C 78
 
5.8%
D 72
 
5.3%
R 67
 
5.0%
Other values (15) 509
37.7%
Decimal Number
ValueCountFrequency (%)
3 8
22.2%
1 8
22.2%
4 5
13.9%
9 4
11.1%
0 3
 
8.3%
8 2
 
5.6%
6 2
 
5.6%
5 2
 
5.6%
7 1
 
2.8%
2 1
 
2.8%
Other Punctuation
ValueCountFrequency (%)
, 725
69.3%
. 253
 
24.2%
/ 22
 
2.1%
· 15
 
1.4%
; 11
 
1.1%
& 8
 
0.8%
: 6
 
0.6%
? 4
 
0.4%
# 2
 
0.2%
Math Symbol
ValueCountFrequency (%)
| 1182
98.5%
> 9
 
0.8%
< 9
 
0.8%
Close Punctuation
ValueCountFrequency (%)
) 408
99.3%
] 2
 
0.5%
1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 408
99.3%
[ 2
 
0.5%
1
 
0.2%
Space Separator
ValueCountFrequency (%)
5685
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 46917
77.9%
Common 8806
 
14.6%
Latin 4523
 
7.5%
Han 12
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2364
 
5.0%
1543
 
3.3%
1093
 
2.3%
934
 
2.0%
891
 
1.9%
856
 
1.8%
684
 
1.5%
604
 
1.3%
583
 
1.2%
555
 
1.2%
Other values (838) 36810
78.5%
Latin
ValueCountFrequency (%)
a 389
 
8.6%
e 349
 
7.7%
n 306
 
6.8%
i 277
 
6.1%
r 251
 
5.5%
o 244
 
5.4%
l 196
 
4.3%
t 178
 
3.9%
s 148
 
3.3%
M 104
 
2.3%
Other values (41) 2081
46.0%
Common
ValueCountFrequency (%)
5685
64.6%
| 1182
 
13.4%
, 725
 
8.2%
) 408
 
4.6%
( 408
 
4.6%
. 253
 
2.9%
/ 22
 
0.2%
- 17
 
0.2%
· 15
 
0.2%
; 11
 
0.1%
Other values (20) 80
 
0.9%
Han
ValueCountFrequency (%)
2
16.7%
2
16.7%
2
16.7%
2
16.7%
1
8.3%
1
8.3%
1
8.3%
1
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 46913
77.9%
ASCII 13312
 
22.1%
None 17
 
< 0.1%
CJK 12
 
< 0.1%
Compat Jamo 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5685
42.7%
| 1182
 
8.9%
, 725
 
5.4%
) 408
 
3.1%
( 408
 
3.1%
a 389
 
2.9%
e 349
 
2.6%
n 306
 
2.3%
i 277
 
2.1%
. 253
 
1.9%
Other values (68) 3330
25.0%
Hangul
ValueCountFrequency (%)
2364
 
5.0%
1543
 
3.3%
1093
 
2.3%
934
 
2.0%
891
 
1.9%
856
 
1.8%
684
 
1.5%
604
 
1.3%
583
 
1.2%
555
 
1.2%
Other values (835) 36806
78.5%
None
ValueCountFrequency (%)
· 15
88.2%
1
 
5.9%
1
 
5.9%
CJK
ValueCountFrequency (%)
2
16.7%
2
16.7%
2
16.7%
2
16.7%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
Compat Jamo
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Distinct2729
Distinct (%)27.3%
Missing9
Missing (%)0.1%
Memory size156.2 KiB
2024-04-18T09:19:33.638825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length32
Mean length4.7746972
Min length1

Characters and Unicode

Total characters47704
Distinct characters758
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1529 ?
Unique (%)15.3%

Sample

1st row재미북스
2nd row문학동네
3rd row기파랑
4th row새벽감성
5th row다른
ValueCountFrequency (%)
위즈덤하우스 197
 
1.9%
문학동네 184
 
1.8%
창비 160
 
1.6%
비룡소 97
 
0.9%
김영사 94
 
0.9%
민음사 90
 
0.9%
알에이치코리아 89
 
0.9%
아이세움 81
 
0.8%
아울북 75
 
0.7%
길벗 72
 
0.7%
Other values (2700) 9107
88.9%
2024-04-18T09:19:34.808166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2245
 
4.7%
1493
 
3.1%
1367
 
2.9%
1357
 
2.8%
808
 
1.7%
775
 
1.6%
684
 
1.4%
649
 
1.4%
647
 
1.4%
610
 
1.3%
Other values (748) 37069
77.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 40516
84.9%
Lowercase Letter 3915
 
8.2%
Uppercase Letter 1733
 
3.6%
Close Punctuation 377
 
0.8%
Open Punctuation 377
 
0.8%
Space Separator 368
 
0.8%
Decimal Number 319
 
0.7%
Other Punctuation 91
 
0.2%
Math Symbol 4
 
< 0.1%
Dash Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2245
 
5.5%
1493
 
3.7%
1367
 
3.4%
1357
 
3.3%
808
 
2.0%
775
 
1.9%
684
 
1.7%
649
 
1.6%
647
 
1.6%
610
 
1.5%
Other values (670) 29881
73.8%
Uppercase Letter
ValueCountFrequency (%)
B 231
13.3%
O 182
 
10.5%
K 159
 
9.2%
S 153
 
8.8%
P 123
 
7.1%
H 108
 
6.2%
C 98
 
5.7%
R 81
 
4.7%
E 65
 
3.8%
A 61
 
3.5%
Other values (16) 472
27.2%
Lowercase Letter
ValueCountFrequency (%)
o 523
13.4%
e 364
 
9.3%
s 321
 
8.2%
r 308
 
7.9%
i 294
 
7.5%
n 284
 
7.3%
a 283
 
7.2%
l 216
 
5.5%
t 170
 
4.3%
k 164
 
4.2%
Other values (15) 988
25.2%
Decimal Number
ValueCountFrequency (%)
2 165
51.7%
1 85
26.6%
6 18
 
5.6%
3 13
 
4.1%
9 9
 
2.8%
4 9
 
2.8%
0 8
 
2.5%
5 7
 
2.2%
8 3
 
0.9%
7 2
 
0.6%
Other Punctuation
ValueCountFrequency (%)
& 47
51.6%
. 16
 
17.6%
# 9
 
9.9%
; 8
 
8.8%
, 3
 
3.3%
: 3
 
3.3%
/ 2
 
2.2%
? 1
 
1.1%
' 1
 
1.1%
! 1
 
1.1%
Math Symbol
ValueCountFrequency (%)
| 3
75.0%
+ 1
 
25.0%
Close Punctuation
ValueCountFrequency (%)
) 377
100.0%
Open Punctuation
ValueCountFrequency (%)
( 377
100.0%
Space Separator
ValueCountFrequency (%)
368
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 40509
84.9%
Latin 5648
 
11.8%
Common 1540
 
3.2%
Han 7
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2245
 
5.5%
1493
 
3.7%
1367
 
3.4%
1357
 
3.3%
808
 
2.0%
775
 
1.9%
684
 
1.7%
649
 
1.6%
647
 
1.6%
610
 
1.5%
Other values (663) 29874
73.7%
Latin
ValueCountFrequency (%)
o 523
 
9.3%
e 364
 
6.4%
s 321
 
5.7%
r 308
 
5.5%
i 294
 
5.2%
n 284
 
5.0%
a 283
 
5.0%
B 231
 
4.1%
l 216
 
3.8%
O 182
 
3.2%
Other values (41) 2642
46.8%
Common
ValueCountFrequency (%)
) 377
24.5%
( 377
24.5%
368
23.9%
2 165
10.7%
1 85
 
5.5%
& 47
 
3.1%
6 18
 
1.2%
. 16
 
1.0%
3 13
 
0.8%
# 9
 
0.6%
Other values (17) 65
 
4.2%
Han
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 40503
84.9%
ASCII 7188
 
15.1%
CJK 7
 
< 0.1%
Compat Jamo 6
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2245
 
5.5%
1493
 
3.7%
1367
 
3.4%
1357
 
3.4%
808
 
2.0%
775
 
1.9%
684
 
1.7%
649
 
1.6%
647
 
1.6%
610
 
1.5%
Other values (658) 29868
73.7%
ASCII
ValueCountFrequency (%)
o 523
 
7.3%
) 377
 
5.2%
( 377
 
5.2%
368
 
5.1%
e 364
 
5.1%
s 321
 
4.5%
r 308
 
4.3%
i 294
 
4.1%
n 284
 
4.0%
a 283
 
3.9%
Other values (68) 3689
51.3%
Compat Jamo
ValueCountFrequency (%)
2
33.3%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
CJK
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
Distinct647
Distinct (%)6.5%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
Minimum2020-01-01 00:00:00
Maximum2021-10-10 00:00:00
2024-04-18T09:19:34.959538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T09:19:35.085768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct75
Distinct (%)0.8%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2024-04-18T09:19:35.280274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length14
Mean length10.129313
Min length4

Characters and Unicode

Total characters101283
Distinct characters134
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row달서구립 성서도서관
2nd row동구 안심도서관
3rd row삼국유사군위도서관
4th row대구광역시립두류도서관
5th row대구광역시립수성도서관
ValueCountFrequency (%)
수성구립 2165
 
13.2%
달서구립 1831
 
11.2%
북구 1156
 
7.1%
동구 826
 
5.1%
범어도서관 663
 
4.1%
대구광역시립북부도서관 590
 
3.6%
국채보상운동기념도서관 567
 
3.5%
용학도서관 532
 
3.3%
고산도서관 489
 
3.0%
가족문화도서관 466
 
2.9%
Other values (71) 7062
43.2%
2024-04-18T09:19:35.655342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12601
 
12.4%
10423
 
10.3%
10007
 
9.9%
9003
 
8.9%
6589
 
6.5%
6348
 
6.3%
3415
 
3.4%
2968
 
2.9%
2816
 
2.8%
2381
 
2.4%
Other values (124) 34732
34.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 94078
92.9%
Space Separator 6348
 
6.3%
Decimal Number 632
 
0.6%
Other Punctuation 181
 
0.2%
Open Punctuation 22
 
< 0.1%
Close Punctuation 22
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12601
 
13.4%
10423
 
11.1%
10007
 
10.6%
9003
 
9.6%
6589
 
7.0%
3415
 
3.6%
2968
 
3.2%
2816
 
3.0%
2381
 
2.5%
2370
 
2.5%
Other values (114) 31505
33.5%
Decimal Number
ValueCountFrequency (%)
2 383
60.6%
8 180
28.5%
1 41
 
6.5%
3 24
 
3.8%
4 4
 
0.6%
Other Punctuation
ValueCountFrequency (%)
· 180
99.4%
. 1
 
0.6%
Space Separator
ValueCountFrequency (%)
6348
100.0%
Open Punctuation
ValueCountFrequency (%)
( 22
100.0%
Close Punctuation
ValueCountFrequency (%)
) 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 94078
92.9%
Common 7205
 
7.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12601
 
13.4%
10423
 
11.1%
10007
 
10.6%
9003
 
9.6%
6589
 
7.0%
3415
 
3.6%
2968
 
3.2%
2816
 
3.0%
2381
 
2.5%
2370
 
2.5%
Other values (114) 31505
33.5%
Common
ValueCountFrequency (%)
6348
88.1%
2 383
 
5.3%
8 180
 
2.5%
· 180
 
2.5%
1 41
 
0.6%
3 24
 
0.3%
( 22
 
0.3%
) 22
 
0.3%
4 4
 
0.1%
. 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 94078
92.9%
ASCII 7025
 
6.9%
None 180
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
12601
 
13.4%
10423
 
11.1%
10007
 
10.6%
9003
 
9.6%
6589
 
7.0%
3415
 
3.6%
2968
 
3.2%
2816
 
3.0%
2381
 
2.5%
2370
 
2.5%
Other values (114) 31505
33.5%
ASCII
ValueCountFrequency (%)
6348
90.4%
2 383
 
5.5%
8 180
 
2.6%
1 41
 
0.6%
3 24
 
0.3%
( 22
 
0.3%
) 22
 
0.3%
4 4
 
0.1%
. 1
 
< 0.1%
None
ValueCountFrequency (%)
· 180
100.0%

Missing values

2024-04-18T09:19:31.554300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-18T09:19:31.661044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

서명저자출판사신청일자도서관명
51407빈대 가족의 덜렁이도 같이 펀딩 (대한민국 공식 짠돌이 빈대 가족에게 배우는 경제 지혜)임창호재미북스2021-02-06달서구립 성서도서관
92981작별하지 않는다 (한강 장편소설)한강문학동네2021-09-02동구 안심도서관
59171역사의 오른편 옳은편벤 샤피로 저기파랑2021-03-20삼국유사군위도서관
29867묘한 서점김지선, 이선아새벽감성2020-08-23대구광역시립두류도서관
66440새들에 관한 짧은 철학필리프 J. 뒤부아|엘리즈 루소다른2021-04-28대구광역시립수성도서관
58259사랑이 달리다심윤경문학동네2021-03-14달서구립 가족문화도서관
91648달콤한 복수 주식회사 (Hamnden ar ljuv AB)요나스 요나손 지음|임호경열린책들2021-08-28북구 한강공원부키도서관
14804여자 주인공만 모른다 (재미있는 영화 클리셰 사전)듀나제우미디어2020-05-14북구 대현도서관
44225재즈 피아노 교본박소연동락(도서출판)2020-12-10수성구립 범어도서관
85812안녕한, 가 (삶이 버겁다고 느끼는 이들에게 전하는 소박하고 성실한 일상의 기록)무과수위즈덤하우스2021-08-02달서구립 성서도서관
서명저자출판사신청일자도서관명
38626화장실 좀 써도 돼?세르지오 루치에르미디어창비2020-10-29수성구립 무학숲도서관
41483주식 네 이놈2(기법편)문제룡지서연2020-11-20대구광역시립수성도서관
13511마법천자문 11 (참는 마음 참을 인)스튜디오 시리얼아울북2020-05-04달서구립 본리도서관
511모든 공간에는 비밀이 있다최경철웨일북2020-01-03국채보상운동기념도서관
76312질문이 멈춰지면 스스로 답이 된다 (나와 세상에 속지 않고 사는 법)원제불광출판사2021-06-16이천어울림도서관
40470그림으로보는만병통치장습관에다아카시매경출판2020-11-12북구 구수산도서관
69554따님이 기가 세요하말넘많포르체2021-05-14달서구립 가족문화도서관
42100운의 힘박성준소미미디어2020-11-26국채보상운동기념도서관
68381아들아, 돈 공부해야 한다정선용알에이치코리아2021-05-08수성구립 고산도서관
80542종교적 경험의 다양성 (한길그레이트북스 040)윌리엄 제임스한길사2021-07-06동구 안심도서관

Duplicate rows

Most frequently occurring

서명저자출판사신청일자도서관명# duplicates
01년1억 짠테크티티새스마트북스2021-01-21북구 태전도서관2
12030축의 전환마우로F.기옌리더스북2021-03-07달서구립 도원도서관2
290년대생이 온다임홍택whale books2021-07-06달서구립 도원도서관2
3gogo 카카오프렌즈 21 캐나다김미영아울북2021-09-02달서구립 도원도서관2
4간니닌니 마법의 도서관2안성훈아울북2020-09-25대구광역시립북부도서관2
5경성탐정이상 5권김재희시공사2021-02-16동구 안심도서관2
6그건 쓰레기가아니라고요홍수열슬로비2021-01-11대구광역시립북부도서관2
7꿈꾸는 엄마의 미라클모닝김연지유노라이프2021-06-07동구 안심도서관2
8당신의 문해력김윤정Ebs books2021-09-15대구광역시립북부도서관2
9떠난 후에 남겨진 것들김새별 전애원청림출판2021-07-25대구광역시립달성도서관2