Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells7
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory712.9 KiB
Average record size in memory73.0 B

Variable types

Categorical2
Text5
Numeric1

Dataset

Description대전광역시 유성구 진잠도서관에서 보유하고 있는 도서목록에 대한 데이터로 소장처, 자료실, 등록번호, 설명, 저자, 출판사, 출판년, 청구기호 등의 항목을 제공합니다.
Author대전광역시 유성구
URLhttps://www.data.go.kr/data/15053382/fileData.do

Alerts

소장처 has constant value ""Constant
등록번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:19:57.287201
Analysis finished2023-12-12 21:19:59.959304
Duration2.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

소장처
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
진잠도서관
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row진잠도서관
2nd row진잠도서관
3rd row진잠도서관
4th row진잠도서관
5th row진잠도서관

Common Values

ValueCountFrequency (%)
진잠도서관 10000
100.0%

Length

2023-12-13T06:20:00.036553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:20:00.134162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
진잠도서관 10000
100.0%

자료실
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
어린이실
5471 
종합자료실
4529 

Length

Max length5
Median length4
Mean length4.4529
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종합자료실
2nd row어린이실
3rd row어린이실
4th row종합자료실
5th row어린이실

Common Values

ValueCountFrequency (%)
어린이실 5471
54.7%
종합자료실 4529
45.3%

Length

2023-12-13T06:20:00.234545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:20:00.330866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
어린이실 5471
54.7%
종합자료실 4529
45.3%

등록번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T06:20:00.628246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.5319
Min length4

Characters and Unicode

Total characters75319
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowCEM87950
2nd rowCEM82135
3rd rowCEM68971
4th rowEM23565
5th rowEM51941
ValueCountFrequency (%)
cem87950 1
 
< 0.1%
em45939 1
 
< 0.1%
cem83597 1
 
< 0.1%
em42536 1
 
< 0.1%
cem84842 1
 
< 0.1%
cem71774 1
 
< 0.1%
cem66680 1
 
< 0.1%
cem86347 1
 
< 0.1%
cem62689 1
 
< 0.1%
em8584 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-13T06:20:01.384892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
E 10000
13.3%
M 10000
13.3%
C 5656
7.5%
8 5426
7.2%
5 5309
 
7.0%
7 5287
 
7.0%
6 5264
 
7.0%
4 5080
 
6.7%
3 5011
 
6.7%
9 4959
 
6.6%
Other values (3) 13327
17.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 49663
65.9%
Uppercase Letter 25656
34.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
8 5426
10.9%
5 5309
10.7%
7 5287
10.6%
6 5264
10.6%
4 5080
10.2%
3 5011
10.1%
9 4959
10.0%
2 4920
9.9%
1 4511
9.1%
0 3896
7.8%
Uppercase Letter
ValueCountFrequency (%)
E 10000
39.0%
M 10000
39.0%
C 5656
22.0%

Most occurring scripts

ValueCountFrequency (%)
Common 49663
65.9%
Latin 25656
34.1%

Most frequent character per script

Common
ValueCountFrequency (%)
8 5426
10.9%
5 5309
10.7%
7 5287
10.6%
6 5264
10.6%
4 5080
10.2%
3 5011
10.1%
9 4959
10.0%
2 4920
9.9%
1 4511
9.1%
0 3896
7.8%
Latin
ValueCountFrequency (%)
E 10000
39.0%
M 10000
39.0%
C 5656
22.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 75319
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
E 10000
13.3%
M 10000
13.3%
C 5656
7.5%
8 5426
7.2%
5 5309
 
7.0%
7 5287
 
7.0%
6 5264
 
7.0%
4 5080
 
6.7%
3 5011
 
6.7%
9 4959
 
6.6%
Other values (3) 13327
17.7%

서명
Text

Distinct9861
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T06:20:01.768982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length134
Median length80
Mean length21.2187
Min length1

Characters and Unicode

Total characters212187
Distinct characters1682
Distinct categories17 ?
Distinct scripts7 ?
Distinct blocks13 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9732 ?
Unique (%)97.3%

Sample

1st row베토벤 : 절망의 심연에서 불러낸 환희의 선율 = Ludwig Van Beethoven
2nd rowShirley homes and the lithuanian case
3rd row우리 모두 이웃이야! :서로 친친! 지구 마을 사람들
4th row(네이피어가 들려주는) 로그 이야기
5th rowBambi`s Hide-and-Seek
ValueCountFrequency (%)
3911
 
7.1%
이야기 526
 
1.0%
the 304
 
0.6%
장편소설 301
 
0.5%
1 287
 
0.5%
위한 273
 
0.5%
2 267
 
0.5%
우리 198
 
0.4%
173
 
0.3%
나는 138
 
0.2%
Other values (22376) 48840
88.4%
2023-12-13T06:20:02.318733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
46916
 
22.1%
: 3823
 
1.8%
3801
 
1.8%
3722
 
1.8%
2867
 
1.4%
2231
 
1.1%
e 2023
 
1.0%
2014
 
0.9%
1830
 
0.9%
1795
 
0.8%
Other values (1672) 141165
66.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 129448
61.0%
Space Separator 46916
 
22.1%
Lowercase Letter 17295
 
8.2%
Other Punctuation 7945
 
3.7%
Decimal Number 3688
 
1.7%
Uppercase Letter 2848
 
1.3%
Open Punctuation 1655
 
0.8%
Close Punctuation 1655
 
0.8%
Math Symbol 486
 
0.2%
Dash Punctuation 174
 
0.1%
Other values (7) 77
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3801
 
2.9%
3722
 
2.9%
2867
 
2.2%
2231
 
1.7%
2014
 
1.6%
1830
 
1.4%
1795
 
1.4%
1731
 
1.3%
1696
 
1.3%
1660
 
1.3%
Other values (1521) 106101
82.0%
Lowercase Letter
ValueCountFrequency (%)
e 2023
11.7%
a 1485
 
8.6%
o 1436
 
8.3%
n 1335
 
7.7%
t 1333
 
7.7%
i 1259
 
7.3%
r 1196
 
6.9%
s 1073
 
6.2%
h 841
 
4.9%
l 692
 
4.0%
Other values (45) 4622
26.7%
Uppercase Letter
ValueCountFrequency (%)
T 310
 
10.9%
S 272
 
9.6%
A 192
 
6.7%
C 191
 
6.7%
M 163
 
5.7%
I 154
 
5.4%
D 143
 
5.0%
B 143
 
5.0%
E 133
 
4.7%
H 123
 
4.3%
Other values (25) 1024
36.0%
Other Punctuation
ValueCountFrequency (%)
: 3823
48.1%
, 1787
22.5%
. 1322
 
16.6%
! 635
 
8.0%
· 149
 
1.9%
' 107
 
1.3%
23
 
0.3%
& 19
 
0.2%
% 16
 
0.2%
15
 
0.2%
Other values (11) 49
 
0.6%
Decimal Number
ValueCountFrequency (%)
1 998
27.1%
0 645
17.5%
2 626
17.0%
3 402
10.9%
4 262
 
7.1%
5 237
 
6.4%
6 144
 
3.9%
7 134
 
3.6%
9 125
 
3.4%
8 115
 
3.1%
Math Symbol
ValueCountFrequency (%)
= 416
85.6%
~ 40
 
8.2%
+ 17
 
3.5%
5
 
1.0%
× 3
 
0.6%
< 2
 
0.4%
> 2
 
0.4%
1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 1571
94.9%
[ 72
 
4.4%
5
 
0.3%
4
 
0.2%
3
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 1571
94.9%
] 72
 
4.4%
5
 
0.3%
4
 
0.2%
3
 
0.2%
Dash Punctuation
ValueCountFrequency (%)
- 173
99.4%
1
 
0.6%
Modifier Symbol
ValueCountFrequency (%)
` 44
84.6%
´ 8
 
15.4%
Letter Number
ValueCountFrequency (%)
5
71.4%
2
 
28.6%
Space Separator
ValueCountFrequency (%)
46916
100.0%
Final Punctuation
ValueCountFrequency (%)
7
100.0%
Format
ValueCountFrequency (%)
­ 4
100.0%
Initial Punctuation
ValueCountFrequency (%)
4
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 129068
60.8%
Common 62589
29.5%
Latin 19877
 
9.4%
Cyrillic 273
 
0.1%
Han 262
 
0.1%
Hiragana 104
 
< 0.1%
Katakana 14
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3801
 
2.9%
3722
 
2.9%
2867
 
2.2%
2231
 
1.7%
2014
 
1.6%
1830
 
1.4%
1795
 
1.4%
1731
 
1.3%
1696
 
1.3%
1660
 
1.3%
Other values (1293) 105721
81.9%
Han
ValueCountFrequency (%)
11
 
4.2%
5
 
1.9%
5
 
1.9%
5
 
1.9%
4
 
1.5%
4
 
1.5%
4
 
1.5%
4
 
1.5%
4
 
1.5%
4
 
1.5%
Other values (163) 212
80.9%
Common
ValueCountFrequency (%)
46916
75.0%
: 3823
 
6.1%
, 1787
 
2.9%
( 1571
 
2.5%
) 1571
 
2.5%
. 1322
 
2.1%
1 998
 
1.6%
0 645
 
1.0%
! 635
 
1.0%
2 626
 
1.0%
Other values (49) 2695
 
4.3%
Latin
ValueCountFrequency (%)
e 2023
 
10.2%
a 1485
 
7.5%
o 1436
 
7.2%
n 1335
 
6.7%
t 1333
 
6.7%
i 1259
 
6.3%
r 1196
 
6.0%
s 1073
 
5.4%
h 841
 
4.2%
l 692
 
3.5%
Other values (46) 7204
36.2%
Hiragana
ValueCountFrequency (%)
6
 
5.8%
6
 
5.8%
5
 
4.8%
5
 
4.8%
5
 
4.8%
5
 
4.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
Other values (34) 56
53.8%
Cyrillic
ValueCountFrequency (%)
а 33
 
12.1%
о 27
 
9.9%
н 24
 
8.8%
г 19
 
7.0%
э 17
 
6.2%
л 16
 
5.9%
р 14
 
5.1%
д 14
 
5.1%
и 9
 
3.3%
к 8
 
2.9%
Other values (26) 92
33.7%
Katakana
ValueCountFrequency (%)
3
21.4%
2
14.3%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 129046
60.8%
ASCII 82183
38.7%
Cyrillic 273
 
0.1%
None 255
 
0.1%
CJK 255
 
0.1%
Hiragana 104
 
< 0.1%
Compat Jamo 22
 
< 0.1%
Punctuation 18
 
< 0.1%
Katakana 14
 
< 0.1%
Number Forms 7
 
< 0.1%
Other values (3) 10
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
46916
57.1%
: 3823
 
4.7%
e 2023
 
2.5%
, 1787
 
2.2%
( 1571
 
1.9%
) 1571
 
1.9%
a 1485
 
1.8%
o 1436
 
1.7%
n 1335
 
1.6%
t 1333
 
1.6%
Other values (75) 18903
23.0%
Hangul
ValueCountFrequency (%)
3801
 
2.9%
3722
 
2.9%
2867
 
2.2%
2231
 
1.7%
2014
 
1.6%
1830
 
1.4%
1795
 
1.4%
1731
 
1.3%
1696
 
1.3%
1660
 
1.3%
Other values (1286) 105699
81.9%
None
ValueCountFrequency (%)
· 149
58.4%
23
 
9.0%
15
 
5.9%
´ 8
 
3.1%
6
 
2.4%
5
 
2.0%
5
 
2.0%
5
 
2.0%
5
 
2.0%
đ 4
 
1.6%
Other values (12) 30
 
11.8%
Cyrillic
ValueCountFrequency (%)
а 33
 
12.1%
о 27
 
9.9%
н 24
 
8.8%
г 19
 
7.0%
э 17
 
6.2%
л 16
 
5.9%
р 14
 
5.1%
д 14
 
5.1%
и 9
 
3.3%
к 8
 
2.9%
Other values (26) 92
33.7%
CJK
ValueCountFrequency (%)
11
 
4.3%
5
 
2.0%
5
 
2.0%
5
 
2.0%
4
 
1.6%
4
 
1.6%
4
 
1.6%
4
 
1.6%
4
 
1.6%
3
 
1.2%
Other values (160) 206
80.8%
Punctuation
ValueCountFrequency (%)
7
38.9%
6
33.3%
4
22.2%
1
 
5.6%
Compat Jamo
ValueCountFrequency (%)
6
27.3%
5
22.7%
4
18.2%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
Hiragana
ValueCountFrequency (%)
6
 
5.8%
6
 
5.8%
5
 
4.8%
5
 
4.8%
5
 
4.8%
5
 
4.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
Other values (34) 56
53.8%
Number Forms
ValueCountFrequency (%)
5
71.4%
2
 
28.6%
CJK Compat Ideographs
ValueCountFrequency (%)
4
57.1%
2
28.6%
1
 
14.3%
Katakana
ValueCountFrequency (%)
3
21.4%
2
14.3%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Misc Symbols
ValueCountFrequency (%)
2
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

저자
Text

Distinct8328
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T06:20:02.747865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length138
Median length105
Mean length11.3602
Min length2

Characters and Unicode

Total characters113602
Distinct characters1185
Distinct categories14 ?
Distinct scripts7 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7379 ?
Unique (%)73.8%

Sample

1st row최은규
2nd rowBassett, Jennifer
3rd row김성은
4th row김승태 지음
5th rowby Andrea Posner-Sanchez ; illustrated by Isidre Mones
ValueCountFrequency (%)
4239
 
13.0%
지음 1845
 
5.6%
그림 1829
 
5.6%
1424
 
4.4%
옮김 1374
 
4.2%
by 372
 
1.1%
글·그림 146
 
0.4%
엮음 130
 
0.4%
illustrated 107
 
0.3%
79
 
0.2%
Other values (11291) 21121
64.7%
2023-12-13T06:20:03.287998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23681
 
20.8%
; 4220
 
3.7%
3132
 
2.8%
2608
 
2.3%
2567
 
2.3%
2246
 
2.0%
2134
 
1.9%
2048
 
1.8%
, 1853
 
1.6%
1734
 
1.5%
Other values (1175) 67379
59.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 68479
60.3%
Space Separator 23681
 
20.8%
Lowercase Letter 11494
 
10.1%
Other Punctuation 7016
 
6.2%
Uppercase Letter 2177
 
1.9%
Open Punctuation 328
 
0.3%
Close Punctuation 326
 
0.3%
Dash Punctuation 56
 
< 0.1%
Decimal Number 26
 
< 0.1%
Math Symbol 13
 
< 0.1%
Other values (4) 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3132
 
4.6%
2608
 
3.8%
2567
 
3.7%
2246
 
3.3%
2134
 
3.1%
2048
 
3.0%
1734
 
2.5%
1427
 
2.1%
1323
 
1.9%
1178
 
1.7%
Other values (1042) 48082
70.2%
Lowercase Letter
ValueCountFrequency (%)
e 1196
10.4%
a 1160
 
10.1%
n 965
 
8.4%
r 924
 
8.0%
i 878
 
7.6%
l 796
 
6.9%
t 745
 
6.5%
o 713
 
6.2%
y 621
 
5.4%
s 542
 
4.7%
Other values (46) 2954
25.7%
Uppercase Letter
ValueCountFrequency (%)
S 225
 
10.3%
M 194
 
8.9%
J 176
 
8.1%
B 154
 
7.1%
R 138
 
6.3%
A 133
 
6.1%
L 127
 
5.8%
C 113
 
5.2%
D 107
 
4.9%
K 94
 
4.3%
Other values (35) 716
32.9%
Other Punctuation
ValueCountFrequency (%)
; 4220
60.1%
, 1853
26.4%
. 386
 
5.5%
: 326
 
4.6%
· 204
 
2.9%
& 11
 
0.2%
' 7
 
0.1%
/ 6
 
0.1%
2
 
< 0.1%
1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
3 6
23.1%
0 6
23.1%
2 5
19.2%
1 5
19.2%
9 1
 
3.8%
8 1
 
3.8%
4 1
 
3.8%
7 1
 
3.8%
Open Punctuation
ValueCountFrequency (%)
[ 210
64.0%
( 116
35.4%
2
 
0.6%
Close Punctuation
ValueCountFrequency (%)
] 210
64.4%
) 115
35.3%
1
 
0.3%
Math Symbol
ValueCountFrequency (%)
> 7
53.8%
< 6
46.2%
Space Separator
ValueCountFrequency (%)
23681
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 56
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 3
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 68237
60.1%
Common 31451
27.7%
Latin 13439
 
11.8%
Cyrillic 233
 
0.2%
Han 188
 
0.2%
Hiragana 41
 
< 0.1%
Katakana 13
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3132
 
4.6%
2608
 
3.8%
2567
 
3.8%
2246
 
3.3%
2134
 
3.1%
2048
 
3.0%
1734
 
2.5%
1427
 
2.1%
1323
 
1.9%
1178
 
1.7%
Other values (871) 47840
70.1%
Han
ValueCountFrequency (%)
14
 
7.4%
6
 
3.2%
5
 
2.7%
4
 
2.1%
3
 
1.6%
3
 
1.6%
3
 
1.6%
3
 
1.6%
3
 
1.6%
3
 
1.6%
Other values (117) 141
75.0%
Latin
ValueCountFrequency (%)
e 1196
 
8.9%
a 1160
 
8.6%
n 965
 
7.2%
r 924
 
6.9%
i 878
 
6.5%
l 796
 
5.9%
t 745
 
5.5%
o 713
 
5.3%
y 621
 
4.6%
s 542
 
4.0%
Other values (44) 4899
36.5%
Cyrillic
ValueCountFrequency (%)
а 28
 
12.0%
н 20
 
8.6%
и 19
 
8.2%
р 17
 
7.3%
о 16
 
6.9%
л 11
 
4.7%
с 9
 
3.9%
д 8
 
3.4%
г 8
 
3.4%
э 7
 
3.0%
Other values (38) 90
38.6%
Common
ValueCountFrequency (%)
23681
75.3%
; 4220
 
13.4%
, 1853
 
5.9%
. 386
 
1.2%
: 326
 
1.0%
[ 210
 
0.7%
] 210
 
0.7%
· 204
 
0.6%
( 116
 
0.4%
) 115
 
0.4%
Other values (21) 130
 
0.4%
Hiragana
ValueCountFrequency (%)
4
 
9.8%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
1
 
2.4%
1
 
2.4%
Other values (21) 21
51.2%
Katakana
ValueCountFrequency (%)
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Other values (3) 3
23.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 68210
60.0%
ASCII 44677
39.3%
Cyrillic 233
 
0.2%
None 211
 
0.2%
CJK 185
 
0.2%
Hiragana 41
 
< 0.1%
Compat Jamo 27
 
< 0.1%
Katakana 13
 
< 0.1%
CJK Compat Ideographs 3
 
< 0.1%
Enclosed Alphanum 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
23681
53.0%
; 4220
 
9.4%
, 1853
 
4.1%
e 1196
 
2.7%
a 1160
 
2.6%
n 965
 
2.2%
r 924
 
2.1%
i 878
 
2.0%
l 796
 
1.8%
t 745
 
1.7%
Other values (67) 8259
 
18.5%
Hangul
ValueCountFrequency (%)
3132
 
4.6%
2608
 
3.8%
2567
 
3.8%
2246
 
3.3%
2134
 
3.1%
2048
 
3.0%
1734
 
2.5%
1427
 
2.1%
1323
 
1.9%
1178
 
1.7%
Other values (868) 47813
70.1%
None
ValueCountFrequency (%)
· 204
96.7%
2
 
0.9%
2
 
0.9%
ø 1
 
0.5%
1
 
0.5%
1
 
0.5%
Cyrillic
ValueCountFrequency (%)
а 28
 
12.0%
н 20
 
8.6%
и 19
 
8.2%
р 17
 
7.3%
о 16
 
6.9%
л 11
 
4.7%
с 9
 
3.9%
д 8
 
3.4%
г 8
 
3.4%
э 7
 
3.0%
Other values (38) 90
38.6%
CJK
ValueCountFrequency (%)
14
 
7.6%
6
 
3.2%
5
 
2.7%
4
 
2.2%
3
 
1.6%
3
 
1.6%
3
 
1.6%
3
 
1.6%
3
 
1.6%
3
 
1.6%
Other values (115) 138
74.6%
Compat Jamo
ValueCountFrequency (%)
12
44.4%
12
44.4%
3
 
11.1%
Hiragana
ValueCountFrequency (%)
4
 
9.8%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
1
 
2.4%
1
 
2.4%
Other values (21) 21
51.2%
CJK Compat Ideographs
ValueCountFrequency (%)
2
66.7%
1
33.3%
Katakana
ValueCountFrequency (%)
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Other values (3) 3
23.1%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct2485
Distinct (%)24.9%
Missing2
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-13T06:20:03.594758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length40
Mean length4.824965
Min length1

Characters and Unicode

Total characters48240
Distinct characters793
Distinct categories11 ?
Distinct scripts6 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1333 ?
Unique (%)13.3%

Sample

1st row아르테
2nd rowOxford University Press
3rd row토토북
4th row자음과모음
5th rowMoonjinmedia
ValueCountFrequency (%)
비룡소 196
 
1.8%
시공주니어 178
 
1.7%
문학동네 177
 
1.6%
창비 175
 
1.6%
주니어김영사 96
 
0.9%
김영사 95
 
0.9%
아이세움 95
 
0.9%
books 85
 
0.8%
자음과모음 85
 
0.8%
위즈덤하우스 82
 
0.8%
Other values (2522) 9489
88.2%
2023-12-13T06:20:04.039243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1473
 
3.1%
1422
 
2.9%
1239
 
2.6%
1120
 
2.3%
1100
 
2.3%
o 926
 
1.9%
755
 
1.6%
696
 
1.4%
s 684
 
1.4%
e 673
 
1.4%
Other values (783) 38152
79.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 37911
78.6%
Lowercase Letter 7184
 
14.9%
Uppercase Letter 1915
 
4.0%
Space Separator 755
 
1.6%
Decimal Number 158
 
0.3%
Other Punctuation 133
 
0.3%
Open Punctuation 83
 
0.2%
Close Punctuation 82
 
0.2%
Dash Punctuation 10
 
< 0.1%
Modifier Symbol 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1473
 
3.9%
1422
 
3.8%
1239
 
3.3%
1120
 
3.0%
1100
 
2.9%
696
 
1.8%
610
 
1.6%
583
 
1.5%
582
 
1.5%
568
 
1.5%
Other values (674) 28518
75.2%
Lowercase Letter
ValueCountFrequency (%)
o 926
12.9%
s 684
 
9.5%
e 673
 
9.4%
i 561
 
7.8%
a 548
 
7.6%
n 542
 
7.5%
r 528
 
7.3%
t 292
 
4.1%
l 289
 
4.0%
d 260
 
3.6%
Other values (34) 1881
26.2%
Uppercase Letter
ValueCountFrequency (%)
B 233
12.2%
H 179
 
9.3%
P 153
 
8.0%
S 150
 
7.8%
C 147
 
7.7%
M 137
 
7.2%
R 122
 
6.4%
O 112
 
5.8%
T 93
 
4.9%
K 93
 
4.9%
Other values (25) 496
25.9%
Other Punctuation
ValueCountFrequency (%)
& 29
21.8%
24
18.0%
. 21
15.8%
' 18
13.5%
, 11
 
8.3%
· 11
 
8.3%
; 7
 
5.3%
# 6
 
4.5%
: 3
 
2.3%
/ 1
 
0.8%
Other values (2) 2
 
1.5%
Decimal Number
ValueCountFrequency (%)
1 67
42.4%
2 62
39.2%
0 10
 
6.3%
3 8
 
5.1%
4 3
 
1.9%
6 3
 
1.9%
8 2
 
1.3%
7 2
 
1.3%
5 1
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 78
94.0%
[ 5
 
6.0%
Close Punctuation
ValueCountFrequency (%)
) 78
95.1%
] 4
 
4.9%
Math Symbol
ValueCountFrequency (%)
+ 1
50.0%
| 1
50.0%
Space Separator
ValueCountFrequency (%)
755
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 37777
78.3%
Latin 9019
 
18.7%
Common 1230
 
2.5%
Han 128
 
0.3%
Cyrillic 80
 
0.2%
Katakana 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1473
 
3.9%
1422
 
3.8%
1239
 
3.3%
1120
 
3.0%
1100
 
2.9%
696
 
1.8%
610
 
1.6%
583
 
1.5%
582
 
1.5%
568
 
1.5%
Other values (618) 28384
75.1%
Latin
ValueCountFrequency (%)
o 926
 
10.3%
s 684
 
7.6%
e 673
 
7.5%
i 561
 
6.2%
a 548
 
6.1%
n 542
 
6.0%
r 528
 
5.9%
t 292
 
3.2%
l 289
 
3.2%
d 260
 
2.9%
Other values (42) 3716
41.2%
Han
ValueCountFrequency (%)
19
 
14.8%
15
 
11.7%
15
 
11.7%
6
 
4.7%
6
 
4.7%
5
 
3.9%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
Other values (40) 50
39.1%
Common
ValueCountFrequency (%)
755
61.4%
( 78
 
6.3%
) 78
 
6.3%
1 67
 
5.4%
2 62
 
5.0%
& 29
 
2.4%
24
 
2.0%
. 21
 
1.7%
' 18
 
1.5%
, 11
 
0.9%
Other values (20) 87
 
7.1%
Cyrillic
ValueCountFrequency (%)
с 10
 
12.5%
н 8
 
10.0%
а 7
 
8.8%
е 7
 
8.8%
р 6
 
7.5%
т 4
 
5.0%
к 4
 
5.0%
п 3
 
3.8%
э 3
 
3.8%
И 3
 
3.8%
Other values (17) 25
31.2%
Katakana
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 37777
78.3%
ASCII 10213
 
21.2%
CJK 128
 
0.3%
Cyrillic 80
 
0.2%
None 36
 
0.1%
Katakana 6
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1473
 
3.9%
1422
 
3.8%
1239
 
3.3%
1120
 
3.0%
1100
 
2.9%
696
 
1.8%
610
 
1.6%
583
 
1.5%
582
 
1.5%
568
 
1.5%
Other values (618) 28384
75.1%
ASCII
ValueCountFrequency (%)
o 926
 
9.1%
755
 
7.4%
s 684
 
6.7%
e 673
 
6.6%
i 561
 
5.5%
a 548
 
5.4%
n 542
 
5.3%
r 528
 
5.2%
t 292
 
2.9%
l 289
 
2.8%
Other values (69) 4415
43.2%
None
ValueCountFrequency (%)
24
66.7%
· 11
30.6%
1
 
2.8%
CJK
ValueCountFrequency (%)
19
 
14.8%
15
 
11.7%
15
 
11.7%
6
 
4.7%
6
 
4.7%
5
 
3.9%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
Other values (40) 50
39.1%
Cyrillic
ValueCountFrequency (%)
с 10
 
12.5%
н 8
 
10.0%
а 7
 
8.8%
е 7
 
8.8%
р 6
 
7.5%
т 4
 
5.0%
к 4
 
5.0%
п 3
 
3.8%
э 3
 
3.8%
И 3
 
3.8%
Other values (17) 25
31.2%
Katakana
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

출판년
Real number (ℝ)

Distinct40
Distinct (%)0.4%
Missing5
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean2013.1859
Minimum1958
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T06:20:04.249426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1958
5-th percentile2003
Q12010
median2014
Q32017
95-th percentile2021
Maximum2022
Range64
Interquartile range (IQR)7

Descriptive statistics

Standard deviation5.4537792
Coefficient of variation (CV)0.0027090291
Kurtosis2.4880081
Mean2013.1859
Median Absolute Deviation (MAD)4
Skewness-0.95990311
Sum20121793
Variance29.743707
MonotonicityNot monotonic
2023-12-13T06:20:04.412424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
2017 770
 
7.7%
2016 731
 
7.3%
2010 731
 
7.3%
2011 730
 
7.3%
2015 688
 
6.9%
2012 652
 
6.5%
2019 641
 
6.4%
2013 595
 
5.9%
2018 578
 
5.8%
2014 560
 
5.6%
Other values (30) 3319
33.2%
ValueCountFrequency (%)
1958 1
 
< 0.1%
1980 1
 
< 0.1%
1981 4
 
< 0.1%
1984 2
 
< 0.1%
1986 1
 
< 0.1%
1987 1
 
< 0.1%
1989 2
 
< 0.1%
1990 4
 
< 0.1%
1991 11
0.1%
1992 4
 
< 0.1%
ValueCountFrequency (%)
2022 175
 
1.8%
2021 460
4.6%
2020 477
4.8%
2019 641
6.4%
2018 578
5.8%
2017 770
7.7%
2016 731
7.3%
2015 688
6.9%
2014 560
5.6%
2013 595
5.9%
Distinct9885
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T06:20:04.815914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length21
Mean length12.5752
Min length3

Characters and Unicode

Total characters125752
Distinct characters634
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9777 ?
Unique (%)97.8%

Sample

1st row082 클589ㅇ v.17
2nd row808 O98o v.1-30
3rd row331.5 김567ㅇ
4th row410 수922ㅈ 39
5th row747 F982m K-6
ValueCountFrequency (%)
808.9 551
 
2.2%
843 482
 
1.9%
813.8 451
 
1.8%
c.2 341
 
1.4%
082 273
 
1.1%
2 246
 
1.0%
408 240
 
1.0%
813.7 238
 
0.9%
1 232
 
0.9%
808.91 231
 
0.9%
Other values (8451) 21855
86.9%
2023-12-13T06:20:05.393824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15144
12.0%
8 12683
 
10.1%
1 10688
 
8.5%
3 9418
 
7.5%
. 9173
 
7.3%
9 7644
 
6.1%
2 7609
 
6.1%
4 7487
 
6.0%
5 6456
 
5.1%
7 5737
 
4.6%
Other values (624) 33713
26.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 78435
62.4%
Other Letter 18901
 
15.0%
Space Separator 15144
 
12.0%
Other Punctuation 9185
 
7.3%
Lowercase Letter 3106
 
2.5%
Uppercase Letter 546
 
0.4%
Dash Punctuation 427
 
0.3%
Open Punctuation 4
 
< 0.1%
Close Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1816
 
9.6%
1144
 
6.1%
910
 
4.8%
722
 
3.8%
687
 
3.6%
644
 
3.4%
644
 
3.4%
643
 
3.4%
595
 
3.1%
584
 
3.1%
Other values (564) 10512
55.6%
Uppercase Letter
ValueCountFrequency (%)
B 73
13.4%
S 60
11.0%
M 52
 
9.5%
L 42
 
7.7%
P 41
 
7.5%
F 34
 
6.2%
C 31
 
5.7%
O 30
 
5.5%
N 26
 
4.8%
H 25
 
4.6%
Other values (13) 132
24.2%
Lowercase Letter
ValueCountFrequency (%)
v 2185
70.3%
c 468
 
15.1%
m 65
 
2.1%
p 52
 
1.7%
s 46
 
1.5%
j 40
 
1.3%
r 34
 
1.1%
o 33
 
1.1%
t 26
 
0.8%
a 21
 
0.7%
Other values (11) 136
 
4.4%
Decimal Number
ValueCountFrequency (%)
8 12683
16.2%
1 10688
13.6%
3 9418
12.0%
9 7644
9.7%
2 7609
9.7%
4 7487
9.5%
5 6456
8.2%
7 5737
7.3%
6 5717
7.3%
0 4996
 
6.4%
Other Punctuation
ValueCountFrequency (%)
. 9173
99.9%
, 12
 
0.1%
Space Separator
ValueCountFrequency (%)
15144
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 427
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 4
100.0%
Close Punctuation
ValueCountFrequency (%)
] 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 103199
82.1%
Hangul 18899
 
15.0%
Latin 3652
 
2.9%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1816
 
9.6%
1144
 
6.1%
910
 
4.8%
722
 
3.8%
687
 
3.6%
644
 
3.4%
644
 
3.4%
643
 
3.4%
595
 
3.1%
584
 
3.1%
Other values (562) 10510
55.6%
Latin
ValueCountFrequency (%)
v 2185
59.8%
c 468
 
12.8%
B 73
 
2.0%
m 65
 
1.8%
S 60
 
1.6%
p 52
 
1.4%
M 52
 
1.4%
s 46
 
1.3%
L 42
 
1.2%
P 41
 
1.1%
Other values (34) 568
 
15.6%
Common
ValueCountFrequency (%)
15144
14.7%
8 12683
12.3%
1 10688
10.4%
3 9418
9.1%
. 9173
8.9%
9 7644
7.4%
2 7609
7.4%
4 7487
7.3%
5 6456
6.3%
7 5737
 
5.6%
Other values (6) 11160
10.8%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 106851
85.0%
Hangul 9762
 
7.8%
Compat Jamo 9137
 
7.3%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
15144
14.2%
8 12683
11.9%
1 10688
10.0%
3 9418
8.8%
. 9173
8.6%
9 7644
7.2%
2 7609
7.1%
4 7487
7.0%
5 6456
6.0%
7 5737
 
5.4%
Other values (50) 14812
13.9%
Compat Jamo
ValueCountFrequency (%)
1816
19.9%
1144
12.5%
910
10.0%
722
 
7.9%
687
 
7.5%
644
 
7.0%
595
 
6.5%
584
 
6.4%
554
 
6.1%
426
 
4.7%
Other values (9) 1055
11.5%
Hangul
ValueCountFrequency (%)
644
 
6.6%
643
 
6.6%
277
 
2.8%
189
 
1.9%
156
 
1.6%
152
 
1.6%
146
 
1.5%
143
 
1.5%
133
 
1.4%
132
 
1.4%
Other values (543) 7147
73.2%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

Interactions

2023-12-13T06:19:59.556132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:20:05.501362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자료실출판년
자료실1.0000.163
출판년0.1631.000
2023-12-13T06:20:05.587946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출판년자료실
출판년1.0000.174
자료실0.1741.000

Missing values

2023-12-13T06:19:59.680060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:19:59.806956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T06:19:59.904684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

소장처자료실등록번호서명저자출판사출판년청구기호
8781진잠도서관종합자료실CEM87950베토벤 : 절망의 심연에서 불러낸 환희의 선율 = Ludwig Van Beethoven최은규아르테2020082 클589ㅇ v.17
14357진잠도서관어린이실CEM82135Shirley homes and the lithuanian caseBassett, JenniferOxford University Press2012808 O98o v.1-30
26216진잠도서관어린이실CEM68971우리 모두 이웃이야! :서로 친친! 지구 마을 사람들김성은토토북2015331.5 김567ㅇ
60702진잠도서관종합자료실EM23565(네이피어가 들려주는) 로그 이야기김승태 지음자음과모음2008410 수922ㅈ 39
41591진잠도서관어린이실EM51941Bambi`s Hide-and-Seekby Andrea Posner-Sanchez ; illustrated by Isidre MonesMoonjinmedia2011747 F982m K-6
33919진잠도서관어린이실CEM59976이슬람의 황금시대를 연 칼리프들김지항그레이트북스2013909 으933ㄱ v.16
41847진잠도서관종합자료실EM51615인플레이션과 세계경제 대예측아사쿠라 게이 지음 ; 이연재 옮김매일경제신문사2013321.97 아275ㅇ
3471진잠도서관종합자료실CEM93346사라진 내일차일드, 리Openhouse(오픈하우스)2013843.5 차165ㅅ
64503진잠도서관어린이실EM16499음악,아름다운 소리의 세계호세 루이스 코르테스 지음,신승혜 옮김을파소2003670.4 호392ㅇ
10638진잠도서관어린이실CEM86012초등영어 문장만들기가 먼저다 . 4 , 수식어로 문장 꾸미기박광희사람in2019746 박213ㅊ v.4
소장처자료실등록번호서명저자출판사출판년청구기호
55048진잠도서관어린이실EM33365비버 벤이 집을 지었어비키 이건 글 ; 다니엘라 데 루카 그림 ; 신혜정 옮김다섯수레2009491 이125ㅂ
32593진잠도서관어린이실CEM61557살꽃이야기이현주한겨레아이들2014808.9 징985ㅎ v.18
63119진잠도서관어린이실EM19339(피에르 오귀스트) 르누아르마이크 버네치어 글·그림 ; 오정환 번역한국몬테소리2002609.9 버112ㄹ 9
1787진잠도서관종합자료실CEM95042그해, 선셋 비치에서 : 니콜라스 스파크스 장편소설스파크스, 니콜라스문학사상2022843.6 스236ㄱ
33769진잠도서관어린이실CEM60127같은 병을 앓는 사람끼리 가엾게 여긴다 :동병상련최유성통큰세상2010711.47 하138ㅌ v.3
1447진잠도서관어린이실CEM95382(처음 읽는) 그리스 로마 신화 . 3 , 인간의 탄생과 판도라최설희아이세움2020219.2 최759ㄱ v.3
55710진잠도서관종합자료실EM32412과학자가 말하는, 환경 문제의 진실과 거짓말이케다 기요히코 지음 ; 한석호 옮김소와당2011539.9 이732ㄱ
66277진잠도서관종합자료실EM10319생명의 아픔 : 박경리 생명 에세이박경리 지음이룸2004814.6 박173ㅅ
20773진잠도서관어린이실CEM75191응가가 쑴풍조은수한울림어린이2018375.1 쭈999ㅎ v.8
8810진잠도서관종합자료실CEM87920나는 독일인입니다 : 전쟁과 역사와 죄의식에 대하여크루크, 노라엘리2020909.54 크567ㄴ