Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells9
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory712.9 KiB
Average record size in memory73.0 B

Variable types

Categorical2
Text5
Numeric1

Dataset

Description대전광역시 유성구 노은도서관에서 보유하고 있는 도서목록 정보에 대한 데이터로 서명, 저자, 출판사, 출판년, 대출횟수, 데이터기준일자 등의 항목을 제공합니다.
Author대전광역시 유성구
URLhttps://www.data.go.kr/data/15053383/fileData.do

Alerts

소장처 has constant value ""Constant
등록번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:42:41.240496
Analysis finished2023-12-12 23:42:43.694021
Duration2.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

소장처
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
노은도서관
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row노은도서관
2nd row노은도서관
3rd row노은도서관
4th row노은도서관
5th row노은도서관

Common Values

ValueCountFrequency (%)
노은도서관 10000
100.0%

Length

2023-12-13T08:42:43.753911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:42:43.844765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
노은도서관 10000
100.0%

자료실
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
어린이실
4686 
종합자료실
4627 
보존서고
621 
장기연체도서
 
66

Length

Max length6
Median length4
Mean length4.4759
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row어린이실
2nd row어린이실
3rd row어린이실
4th row보존서고
5th row어린이실

Common Values

ValueCountFrequency (%)
어린이실 4686
46.9%
종합자료실 4627
46.3%
보존서고 621
 
6.2%
장기연체도서 66
 
0.7%

Length

2023-12-13T08:42:43.953755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:42:44.067735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
어린이실 4686
46.9%
종합자료실 4627
46.3%
보존서고 621
 
6.2%
장기연체도서 66
 
0.7%

등록번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T08:42:44.278972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters120000
Distinct characters16
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowANM000086358
2nd rowANM000087643
3rd rowNM0000027266
4th rowNM0000008660
5th rowNM0000038207
ValueCountFrequency (%)
anm000086358 1
 
< 0.1%
anm000059099 1
 
< 0.1%
anm000066534 1
 
< 0.1%
nm0000015231 1
 
< 0.1%
anm000083006 1
 
< 0.1%
anm000080611 1
 
< 0.1%
anm000094786 1
 
< 0.1%
nm0000012349 1
 
< 0.1%
anm000088200 1
 
< 0.1%
anm000060413 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-13T08:42:44.790715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 49201
41.0%
N 9669
 
8.1%
M 9669
 
8.1%
A 5742
 
4.8%
7 5208
 
4.3%
8 5201
 
4.3%
5 5119
 
4.3%
1 5028
 
4.2%
3 5025
 
4.2%
4 5000
 
4.2%
Other values (6) 15138
 
12.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 94258
78.5%
Uppercase Letter 25742
 
21.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 49201
52.2%
7 5208
 
5.5%
8 5201
 
5.5%
5 5119
 
5.4%
1 5028
 
5.3%
3 5025
 
5.3%
4 5000
 
5.3%
6 4952
 
5.3%
2 4851
 
5.1%
9 4673
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
N 9669
37.6%
M 9669
37.6%
A 5742
22.3%
S 331
 
1.3%
J 239
 
0.9%
U 92
 
0.4%

Most occurring scripts

ValueCountFrequency (%)
Common 94258
78.5%
Latin 25742
 
21.5%

Most frequent character per script

Common
ValueCountFrequency (%)
0 49201
52.2%
7 5208
 
5.5%
8 5201
 
5.5%
5 5119
 
5.4%
1 5028
 
5.3%
3 5025
 
5.3%
4 5000
 
5.3%
6 4952
 
5.3%
2 4851
 
5.1%
9 4673
 
5.0%
Latin
ValueCountFrequency (%)
N 9669
37.6%
M 9669
37.6%
A 5742
22.3%
S 331
 
1.3%
J 239
 
0.9%
U 92
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 120000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 49201
41.0%
N 9669
 
8.1%
M 9669
 
8.1%
A 5742
 
4.8%
7 5208
 
4.3%
8 5201
 
4.3%
5 5119
 
4.3%
1 5028
 
4.2%
3 5025
 
4.2%
4 5000
 
4.2%
Other values (6) 15138
 
12.6%

서명
Text

Distinct9823
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T08:42:45.114637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length184
Median length78
Mean length21.6229
Min length1

Characters and Unicode

Total characters216229
Distinct characters1662
Distinct categories15 ?
Distinct scripts8 ?
Distinct blocks15 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9652 ?
Unique (%)96.5%

Sample

1st row찰방찰방 밤을 건너
2nd row그냥 아무것도 하기 싫은데 어떡해요
3rd row아차산 : 고구려의 힘찬 기상을 찾아 떠나요
4th row청소의 여왕
5th row노아 박사의 우주선
ValueCountFrequency (%)
3946
 
7.1%
이야기 463
 
0.8%
1 339
 
0.6%
the 308
 
0.6%
장편소설 306
 
0.6%
2 276
 
0.5%
위한 231
 
0.4%
우리 203
 
0.4%
200
 
0.4%
3 140
 
0.3%
Other values (22688) 49094
88.4%
2023-12-13T08:42:45.587710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
47154
 
21.8%
: 3880
 
1.8%
3849
 
1.8%
3830
 
1.8%
2828
 
1.3%
e 2285
 
1.1%
2191
 
1.0%
1902
 
0.9%
, 1900
 
0.9%
1806
 
0.8%
Other values (1652) 144604
66.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 130277
60.2%
Space Separator 47154
 
21.8%
Lowercase Letter 19408
 
9.0%
Other Punctuation 8272
 
3.8%
Decimal Number 3780
 
1.7%
Uppercase Letter 3178
 
1.5%
Close Punctuation 1711
 
0.8%
Open Punctuation 1711
 
0.8%
Math Symbol 543
 
0.3%
Dash Punctuation 134
 
0.1%
Other values (5) 61
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3849
 
3.0%
3830
 
2.9%
2828
 
2.2%
2191
 
1.7%
1902
 
1.5%
1806
 
1.4%
1805
 
1.4%
1790
 
1.4%
1685
 
1.3%
1663
 
1.3%
Other values (1506) 106928
82.1%
Lowercase Letter
ValueCountFrequency (%)
e 2285
11.8%
a 1682
 
8.7%
o 1620
 
8.3%
n 1509
 
7.8%
i 1474
 
7.6%
t 1414
 
7.3%
r 1332
 
6.9%
s 1156
 
6.0%
h 966
 
5.0%
l 862
 
4.4%
Other values (43) 5108
26.3%
Uppercase Letter
ValueCountFrequency (%)
T 363
 
11.4%
S 270
 
8.5%
C 197
 
6.2%
B 192
 
6.0%
A 180
 
5.7%
W 174
 
5.5%
D 147
 
4.6%
M 146
 
4.6%
I 145
 
4.6%
G 141
 
4.4%
Other values (22) 1223
38.5%
Other Punctuation
ValueCountFrequency (%)
: 3880
46.9%
, 1900
23.0%
. 1458
 
17.6%
! 597
 
7.2%
· 163
 
2.0%
' 150
 
1.8%
& 27
 
0.3%
23
 
0.3%
/ 17
 
0.2%
15
 
0.2%
Other values (10) 42
 
0.5%
Decimal Number
ValueCountFrequency (%)
1 1039
27.5%
2 656
17.4%
0 641
17.0%
3 392
 
10.4%
4 261
 
6.9%
5 226
 
6.0%
7 147
 
3.9%
6 145
 
3.8%
9 139
 
3.7%
8 134
 
3.5%
Math Symbol
ValueCountFrequency (%)
= 470
86.6%
~ 32
 
5.9%
+ 17
 
3.1%
< 7
 
1.3%
> 7
 
1.3%
3
 
0.6%
3
 
0.6%
2
 
0.4%
| 1
 
0.2%
× 1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 1640
95.9%
] 58
 
3.4%
8
 
0.5%
5
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 1640
95.9%
[ 58
 
3.4%
8
 
0.5%
5
 
0.3%
Letter Number
ValueCountFrequency (%)
4
44.4%
3
33.3%
1
 
11.1%
1
 
11.1%
Other Symbol
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%
Modifier Symbol
ValueCountFrequency (%)
` 30
69.8%
´ 13
30.2%
Space Separator
ValueCountFrequency (%)
47154
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 134
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 129847
60.1%
Common 63357
29.3%
Latin 22464
 
10.4%
Han 391
 
0.2%
Cyrillic 129
 
0.1%
Hiragana 35
 
< 0.1%
Katakana 4
 
< 0.1%
Greek 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3849
 
3.0%
3830
 
2.9%
2828
 
2.2%
2191
 
1.7%
1902
 
1.5%
1806
 
1.4%
1805
 
1.4%
1790
 
1.4%
1685
 
1.3%
1663
 
1.3%
Other values (1264) 106498
82.0%
Han
ValueCountFrequency (%)
17
 
4.3%
14
 
3.6%
10
 
2.6%
9
 
2.3%
8
 
2.0%
6
 
1.5%
6
 
1.5%
6
 
1.5%
6
 
1.5%
6
 
1.5%
Other values (203) 303
77.5%
Common
ValueCountFrequency (%)
47154
74.4%
: 3880
 
6.1%
, 1900
 
3.0%
) 1640
 
2.6%
( 1640
 
2.6%
. 1458
 
2.3%
1 1039
 
1.6%
2 656
 
1.0%
0 641
 
1.0%
! 597
 
0.9%
Other values (47) 2752
 
4.3%
Latin
ValueCountFrequency (%)
e 2285
 
10.2%
a 1682
 
7.5%
o 1620
 
7.2%
n 1509
 
6.7%
i 1474
 
6.6%
t 1414
 
6.3%
r 1332
 
5.9%
s 1156
 
5.1%
h 966
 
4.3%
l 862
 
3.8%
Other values (47) 8164
36.3%
Cyrillic
ValueCountFrequency (%)
н 16
12.4%
а 13
 
10.1%
э 11
 
8.5%
и 11
 
8.5%
о 10
 
7.8%
р 9
 
7.0%
с 8
 
6.2%
д 6
 
4.7%
г 4
 
3.1%
ч 4
 
3.1%
Other values (20) 37
28.7%
Hiragana
ValueCountFrequency (%)
4
 
11.4%
4
 
11.4%
3
 
8.6%
2
 
5.7%
2
 
5.7%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (15) 15
42.9%
Katakana
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Greek
ValueCountFrequency (%)
π 1
50.0%
α 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 129819
60.0%
ASCII 85542
39.6%
CJK 377
 
0.2%
None 258
 
0.1%
Cyrillic 129
 
0.1%
Hiragana 35
 
< 0.1%
Compat Jamo 28
 
< 0.1%
CJK Compat Ideographs 14
 
< 0.1%
Number Forms 9
 
< 0.1%
Punctuation 7
 
< 0.1%
Other values (5) 11
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
47154
55.1%
: 3880
 
4.5%
e 2285
 
2.7%
, 1900
 
2.2%
a 1682
 
2.0%
) 1640
 
1.9%
( 1640
 
1.9%
o 1620
 
1.9%
n 1509
 
1.8%
i 1474
 
1.7%
Other values (78) 20758
24.3%
Hangul
ValueCountFrequency (%)
3849
 
3.0%
3830
 
3.0%
2828
 
2.2%
2191
 
1.7%
1902
 
1.5%
1806
 
1.4%
1805
 
1.4%
1790
 
1.4%
1685
 
1.3%
1663
 
1.3%
Other values (1258) 106470
82.0%
None
ValueCountFrequency (%)
· 163
63.2%
23
 
8.9%
15
 
5.8%
´ 13
 
5.0%
8
 
3.1%
8
 
3.1%
5
 
1.9%
5
 
1.9%
3
 
1.2%
3
 
1.2%
Other values (7) 12
 
4.7%
CJK
ValueCountFrequency (%)
17
 
4.5%
14
 
3.7%
10
 
2.7%
9
 
2.4%
8
 
2.1%
6
 
1.6%
6
 
1.6%
6
 
1.6%
6
 
1.6%
6
 
1.6%
Other values (194) 289
76.7%
Cyrillic
ValueCountFrequency (%)
н 16
12.4%
а 13
 
10.1%
э 11
 
8.5%
и 11
 
8.5%
о 10
 
7.8%
р 9
 
7.0%
с 8
 
6.2%
д 6
 
4.7%
г 4
 
3.1%
ч 4
 
3.1%
Other values (20) 37
28.7%
Compat Jamo
ValueCountFrequency (%)
11
39.3%
5
17.9%
5
17.9%
5
17.9%
1
 
3.6%
1
 
3.6%
Hiragana
ValueCountFrequency (%)
4
 
11.4%
4
 
11.4%
3
 
8.6%
2
 
5.7%
2
 
5.7%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (15) 15
42.9%
Number Forms
ValueCountFrequency (%)
4
44.4%
3
33.3%
1
 
11.1%
1
 
11.1%
Punctuation
ValueCountFrequency (%)
3
42.9%
2
28.6%
2
28.6%
CJK Compat Ideographs
ValueCountFrequency (%)
3
21.4%
3
21.4%
2
14.3%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Letterlike Symbols
ValueCountFrequency (%)
3
100.0%
Math Operators
ValueCountFrequency (%)
2
100.0%
Katakana
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Misc Symbols
ValueCountFrequency (%)
1
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%

저자
Text

Distinct8414
Distinct (%)84.2%
Missing6
Missing (%)0.1%
Memory size156.2 KiB
2023-12-13T08:42:45.936155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length114
Median length107
Mean length11.951171
Min length1

Characters and Unicode

Total characters119440
Distinct characters1176
Distinct categories12 ?
Distinct scripts7 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7509 ?
Unique (%)75.1%

Sample

1st row이상교
2nd row제성은
3rd row최원근 지음, 김순남 그림
4th row린다 코브 지음 ; 김태윤 옮김
5th row브라이언 와일드스미스 지음 ; 서애경 옮김
ValueCountFrequency (%)
4263
 
12.7%
지음 2222
 
6.6%
옮김 1611
 
4.8%
그림 1489
 
4.4%
1132
 
3.4%
by 553
 
1.6%
글·그림 169
 
0.5%
illustrated 153
 
0.5%
116
 
0.3%
엮음 105
 
0.3%
Other values (11779) 21780
64.8%
2023-12-13T08:42:46.527866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24666
 
20.7%
; 4241
 
3.6%
3435
 
2.9%
2899
 
2.4%
2615
 
2.2%
2458
 
2.1%
1942
 
1.6%
1840
 
1.5%
, 1725
 
1.4%
1663
 
1.4%
Other values (1166) 71956
60.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 68444
57.3%
Space Separator 24666
 
20.7%
Lowercase Letter 15369
 
12.9%
Other Punctuation 7110
 
6.0%
Uppercase Letter 2921
 
2.4%
Open Punctuation 429
 
0.4%
Close Punctuation 421
 
0.4%
Dash Punctuation 46
 
< 0.1%
Decimal Number 19
 
< 0.1%
Math Symbol 13
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3435
 
5.0%
2899
 
4.2%
2615
 
3.8%
2458
 
3.6%
1942
 
2.8%
1840
 
2.7%
1663
 
2.4%
1495
 
2.2%
1334
 
1.9%
1156
 
1.7%
Other values (1053) 47607
69.6%
Lowercase Letter
ValueCountFrequency (%)
e 1571
10.2%
a 1497
 
9.7%
r 1336
 
8.7%
i 1220
 
7.9%
n 1219
 
7.9%
t 1065
 
6.9%
l 1010
 
6.6%
y 961
 
6.3%
o 887
 
5.8%
s 744
 
4.8%
Other values (38) 3859
25.1%
Uppercase Letter
ValueCountFrequency (%)
M 285
 
9.8%
S 280
 
9.6%
D 217
 
7.4%
R 208
 
7.1%
B 203
 
6.9%
J 187
 
6.4%
L 160
 
5.5%
H 153
 
5.2%
A 147
 
5.0%
P 145
 
5.0%
Other values (22) 936
32.0%
Other Punctuation
ValueCountFrequency (%)
; 4241
59.6%
, 1725
24.3%
. 502
 
7.1%
: 359
 
5.0%
· 249
 
3.5%
' 14
 
0.2%
/ 10
 
0.1%
& 8
 
0.1%
1
 
< 0.1%
@ 1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
3 4
21.1%
1 3
15.8%
4 2
10.5%
2 2
10.5%
9 2
10.5%
6 2
10.5%
0 2
10.5%
5 1
 
5.3%
7 1
 
5.3%
Close Punctuation
ValueCountFrequency (%)
] 294
69.8%
) 125
29.7%
1
 
0.2%
1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
[ 295
68.8%
( 133
31.0%
1
 
0.2%
Math Symbol
ValueCountFrequency (%)
> 6
46.2%
< 6
46.2%
| 1
 
7.7%
Space Separator
ValueCountFrequency (%)
24666
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 46
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 68166
57.1%
Common 32706
27.4%
Latin 18206
 
15.2%
Han 243
 
0.2%
Cyrillic 84
 
0.1%
Hiragana 19
 
< 0.1%
Katakana 16
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3435
 
5.0%
2899
 
4.3%
2615
 
3.8%
2458
 
3.6%
1942
 
2.8%
1840
 
2.7%
1663
 
2.4%
1495
 
2.2%
1334
 
2.0%
1156
 
1.7%
Other values (897) 47329
69.4%
Han
ValueCountFrequency (%)
17
 
7.0%
10
 
4.1%
8
 
3.3%
6
 
2.5%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
4
 
1.6%
Other values (117) 173
71.2%
Latin
ValueCountFrequency (%)
e 1571
 
8.6%
a 1497
 
8.2%
r 1336
 
7.3%
i 1220
 
6.7%
n 1219
 
6.7%
t 1065
 
5.8%
l 1010
 
5.5%
y 961
 
5.3%
o 887
 
4.9%
s 744
 
4.1%
Other values (42) 6696
36.8%
Common
ValueCountFrequency (%)
24666
75.4%
; 4241
 
13.0%
, 1725
 
5.3%
. 502
 
1.5%
: 359
 
1.1%
[ 295
 
0.9%
] 294
 
0.9%
· 249
 
0.8%
( 133
 
0.4%
) 125
 
0.4%
Other values (23) 117
 
0.4%
Cyrillic
ValueCountFrequency (%)
а 9
 
10.7%
н 8
 
9.5%
р 6
 
7.1%
е 6
 
7.1%
с 5
 
6.0%
д 5
 
6.0%
л 5
 
6.0%
и 4
 
4.8%
г 4
 
4.8%
у 3
 
3.6%
Other values (18) 29
34.5%
Hiragana
ValueCountFrequency (%)
3
15.8%
2
 
10.5%
2
 
10.5%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (5) 5
26.3%
Katakana
ValueCountFrequency (%)
2
12.5%
2
12.5%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Other values (4) 4
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 68155
57.1%
ASCII 50656
42.4%
None 255
 
0.2%
CJK 243
 
0.2%
Cyrillic 84
 
0.1%
Hiragana 19
 
< 0.1%
Katakana 16
 
< 0.1%
Compat Jamo 11
 
< 0.1%
Enclosed Alphanum 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
24666
48.7%
; 4241
 
8.4%
, 1725
 
3.4%
e 1571
 
3.1%
a 1497
 
3.0%
r 1336
 
2.6%
i 1220
 
2.4%
n 1219
 
2.4%
t 1065
 
2.1%
l 1010
 
2.0%
Other values (68) 11106
21.9%
Hangul
ValueCountFrequency (%)
3435
 
5.0%
2899
 
4.3%
2615
 
3.8%
2458
 
3.6%
1942
 
2.8%
1840
 
2.7%
1663
 
2.4%
1495
 
2.2%
1334
 
2.0%
1156
 
1.7%
Other values (896) 47318
69.4%
None
ValueCountFrequency (%)
· 249
97.6%
ł 2
 
0.8%
1
 
0.4%
1
 
0.4%
1
 
0.4%
1
 
0.4%
CJK
ValueCountFrequency (%)
17
 
7.0%
10
 
4.1%
8
 
3.3%
6
 
2.5%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
4
 
1.6%
Other values (117) 173
71.2%
Compat Jamo
ValueCountFrequency (%)
11
100.0%
Cyrillic
ValueCountFrequency (%)
а 9
 
10.7%
н 8
 
9.5%
р 6
 
7.1%
е 6
 
7.1%
с 5
 
6.0%
д 5
 
6.0%
л 5
 
6.0%
и 4
 
4.8%
г 4
 
4.8%
у 3
 
3.6%
Other values (18) 29
34.5%
Hiragana
ValueCountFrequency (%)
3
15.8%
2
 
10.5%
2
 
10.5%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (5) 5
26.3%
Katakana
ValueCountFrequency (%)
2
12.5%
2
12.5%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Other values (4) 4
25.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%
Distinct2614
Distinct (%)26.1%
Missing3
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-13T08:42:46.843954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length31
Mean length4.8258478
Min length1

Characters and Unicode

Total characters48244
Distinct characters800
Distinct categories12 ?
Distinct scripts6 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1452 ?
Unique (%)14.5%

Sample

1st row문학동네
2nd row팜파스
3rd row김영사
4th row럭스미디어
5th row현북스
ValueCountFrequency (%)
문학동네 197
 
1.8%
비룡소 185
 
1.7%
창비 132
 
1.2%
주니어김영사 114
 
1.1%
시공주니어 112
 
1.0%
house 101
 
0.9%
민음사 97
 
0.9%
books 95
 
0.9%
위즈덤하우스 87
 
0.8%
김영사 87
 
0.8%
Other values (2636) 9630
88.9%
2023-12-13T08:42:47.362669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1589
 
3.3%
1267
 
2.6%
1173
 
2.4%
o 1070
 
2.2%
1039
 
2.2%
953
 
2.0%
840
 
1.7%
s 781
 
1.6%
717
 
1.5%
679
 
1.4%
Other values (790) 38136
79.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 37479
77.7%
Lowercase Letter 7487
 
15.5%
Uppercase Letter 2008
 
4.2%
Space Separator 840
 
1.7%
Other Punctuation 168
 
0.3%
Decimal Number 141
 
0.3%
Open Punctuation 50
 
0.1%
Close Punctuation 49
 
0.1%
Dash Punctuation 13
 
< 0.1%
Modifier Symbol 7
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1589
 
4.2%
1267
 
3.4%
1173
 
3.1%
1039
 
2.8%
953
 
2.5%
717
 
1.9%
679
 
1.8%
576
 
1.5%
521
 
1.4%
514
 
1.4%
Other values (688) 28451
75.9%
Lowercase Letter
ValueCountFrequency (%)
o 1070
14.3%
s 781
 
10.4%
r 575
 
7.7%
e 552
 
7.4%
a 533
 
7.1%
n 510
 
6.8%
i 484
 
6.5%
u 282
 
3.8%
t 281
 
3.8%
d 272
 
3.6%
Other values (33) 2147
28.7%
Uppercase Letter
ValueCountFrequency (%)
B 288
14.3%
H 259
12.9%
S 180
 
9.0%
P 153
 
7.6%
R 145
 
7.2%
O 145
 
7.2%
C 126
 
6.3%
M 89
 
4.4%
U 79
 
3.9%
D 75
 
3.7%
Other values (19) 469
23.4%
Other Punctuation
ValueCountFrequency (%)
& 56
33.3%
42
25.0%
. 28
16.7%
, 15
 
8.9%
' 13
 
7.7%
· 8
 
4.8%
: 2
 
1.2%
# 1
 
0.6%
1
 
0.6%
@ 1
 
0.6%
Decimal Number
ValueCountFrequency (%)
2 55
39.0%
1 54
38.3%
0 11
 
7.8%
4 6
 
4.3%
8 5
 
3.5%
3 4
 
2.8%
5 2
 
1.4%
9 2
 
1.4%
6 2
 
1.4%
Open Punctuation
ValueCountFrequency (%)
( 43
86.0%
[ 7
 
14.0%
Close Punctuation
ValueCountFrequency (%)
) 43
87.8%
] 6
 
12.2%
Modifier Symbol
ValueCountFrequency (%)
` 6
85.7%
´ 1
 
14.3%
Space Separator
ValueCountFrequency (%)
840
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 37318
77.4%
Latin 9451
 
19.6%
Common 1270
 
2.6%
Han 154
 
0.3%
Cyrillic 44
 
0.1%
Katakana 7
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1589
 
4.3%
1267
 
3.4%
1173
 
3.1%
1039
 
2.8%
953
 
2.6%
717
 
1.9%
679
 
1.8%
576
 
1.5%
521
 
1.4%
514
 
1.4%
Other values (633) 28290
75.8%
Latin
ValueCountFrequency (%)
o 1070
 
11.3%
s 781
 
8.3%
r 575
 
6.1%
e 552
 
5.8%
a 533
 
5.6%
n 510
 
5.4%
i 484
 
5.1%
B 288
 
3.0%
u 282
 
3.0%
t 281
 
3.0%
Other values (42) 4095
43.3%
Han
ValueCountFrequency (%)
30
19.5%
17
 
11.0%
17
 
11.0%
8
 
5.2%
7
 
4.5%
6
 
3.9%
5
 
3.2%
4
 
2.6%
4
 
2.6%
3
 
1.9%
Other values (38) 53
34.4%
Common
ValueCountFrequency (%)
840
66.1%
& 56
 
4.4%
2 55
 
4.3%
1 54
 
4.3%
( 43
 
3.4%
) 43
 
3.4%
42
 
3.3%
. 28
 
2.2%
, 15
 
1.2%
- 13
 
1.0%
Other values (20) 81
 
6.4%
Cyrillic
ValueCountFrequency (%)
э 6
13.6%
н 5
11.4%
а 4
 
9.1%
х 3
 
6.8%
т 3
 
6.8%
р 3
 
6.8%
с 3
 
6.8%
й 2
 
4.5%
г 2
 
4.5%
л 2
 
4.5%
Other values (10) 11
25.0%
Katakana
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 37317
77.4%
ASCII 10669
 
22.1%
CJK 154
 
0.3%
None 52
 
0.1%
Cyrillic 44
 
0.1%
Katakana 7
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1589
 
4.3%
1267
 
3.4%
1173
 
3.1%
1039
 
2.8%
953
 
2.6%
717
 
1.9%
679
 
1.8%
576
 
1.5%
521
 
1.4%
514
 
1.4%
Other values (632) 28289
75.8%
ASCII
ValueCountFrequency (%)
o 1070
 
10.0%
840
 
7.9%
s 781
 
7.3%
r 575
 
5.4%
e 552
 
5.2%
a 533
 
5.0%
n 510
 
4.8%
i 484
 
4.5%
B 288
 
2.7%
u 282
 
2.6%
Other values (68) 4754
44.6%
None
ValueCountFrequency (%)
42
80.8%
· 8
 
15.4%
1
 
1.9%
´ 1
 
1.9%
CJK
ValueCountFrequency (%)
30
19.5%
17
 
11.0%
17
 
11.0%
8
 
5.2%
7
 
4.5%
6
 
3.9%
5
 
3.2%
4
 
2.6%
4
 
2.6%
3
 
1.9%
Other values (38) 53
34.4%
Cyrillic
ValueCountFrequency (%)
э 6
13.6%
н 5
11.4%
а 4
 
9.1%
х 3
 
6.8%
т 3
 
6.8%
р 3
 
6.8%
с 3
 
6.8%
й 2
 
4.5%
г 2
 
4.5%
л 2
 
4.5%
Other values (10) 11
25.0%
Katakana
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

출판년
Real number (ℝ)

Distinct38
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2012.2843
Minimum1975
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T08:42:47.500545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1975
5-th percentile2002
Q12009
median2012
Q32017
95-th percentile2021
Maximum2022
Range47
Interquartile range (IQR)8

Descriptive statistics

Standard deviation5.8815247
Coefficient of variation (CV)0.00292281
Kurtosis0.71286231
Mean2012.2843
Median Absolute Deviation (MAD)4
Skewness-0.63744324
Sum20122843
Variance34.592333
MonotonicityNot monotonic
2023-12-13T08:42:47.643606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=38)
ValueCountFrequency (%)
2009 764
 
7.6%
2010 733
 
7.3%
2011 663
 
6.6%
2012 632
 
6.3%
2015 632
 
6.3%
2016 625
 
6.2%
2019 560
 
5.6%
2017 558
 
5.6%
2008 536
 
5.4%
2013 529
 
5.3%
Other values (28) 3768
37.7%
ValueCountFrequency (%)
1975 1
 
< 0.1%
1976 2
 
< 0.1%
1984 1
 
< 0.1%
1986 2
 
< 0.1%
1987 1
 
< 0.1%
1988 2
 
< 0.1%
1991 6
 
0.1%
1992 9
 
0.1%
1993 15
0.1%
1994 27
0.3%
ValueCountFrequency (%)
2022 205
 
2.1%
2021 429
4.3%
2020 501
5.0%
2019 560
5.6%
2018 434
4.3%
2017 558
5.6%
2016 625
6.2%
2015 632
6.3%
2014 433
4.3%
2013 529
5.3%
Distinct9820
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T08:42:47.996454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length19
Mean length12.2457
Min length8

Characters and Unicode

Total characters122457
Distinct characters645
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9647 ?
Unique (%)96.5%

Sample

1st row811.8 문637ㅁ 71
2nd row813.8 제562ㄱ
3rd row373 신595ㅎ 78
4th row597.3 코375ㅊ
5th row843 알713ㅎ 13
ValueCountFrequency (%)
843 586
 
2.4%
1 481
 
2.0%
813.8 460
 
1.9%
808.9 415
 
1.7%
2 408
 
1.7%
808.91 296
 
1.2%
3 281
 
1.2%
813.7 253
 
1.0%
082 239
 
1.0%
408 227
 
0.9%
Other values (8811) 20604
85.0%
2023-12-13T08:42:48.474979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18267
14.9%
8 12092
 
9.9%
1 10764
 
8.8%
3 9636
 
7.9%
9 7515
 
6.1%
4 7396
 
6.0%
2 7279
 
5.9%
. 6630
 
5.4%
5 6326
 
5.2%
7 6081
 
5.0%
Other values (635) 30471
24.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 77093
63.0%
Other Letter 18646
 
15.2%
Space Separator 18267
 
14.9%
Other Punctuation 6653
 
5.4%
Uppercase Letter 710
 
0.6%
Lowercase Letter 661
 
0.5%
Dash Punctuation 375
 
0.3%
Close Punctuation 26
 
< 0.1%
Open Punctuation 26
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1715
 
9.2%
1160
 
6.2%
960
 
5.1%
741
 
4.0%
696
 
3.7%
674
 
3.6%
641
 
3.4%
634
 
3.4%
627
 
3.4%
593
 
3.2%
Other values (570) 10205
54.7%
Uppercase Letter
ValueCountFrequency (%)
S 127
17.9%
H 72
10.1%
M 71
10.0%
O 53
 
7.5%
D 47
 
6.6%
C 44
 
6.2%
A 38
 
5.4%
P 34
 
4.8%
W 27
 
3.8%
U 27
 
3.8%
Other values (15) 170
23.9%
Lowercase Letter
ValueCountFrequency (%)
c 90
13.6%
s 72
10.9%
r 64
 
9.7%
h 48
 
7.3%
o 46
 
7.0%
p 40
 
6.1%
m 33
 
5.0%
i 32
 
4.8%
j 29
 
4.4%
g 28
 
4.2%
Other values (13) 179
27.1%
Decimal Number
ValueCountFrequency (%)
8 12092
15.7%
1 10764
14.0%
3 9636
12.5%
9 7515
9.7%
4 7396
9.6%
2 7279
9.4%
5 6326
8.2%
7 6081
7.9%
6 5330
6.9%
0 4674
 
6.1%
Other Punctuation
ValueCountFrequency (%)
. 6630
99.7%
, 22
 
0.3%
# 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
18267
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 375
100.0%
Close Punctuation
ValueCountFrequency (%)
] 26
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 102440
83.7%
Hangul 18646
 
15.2%
Latin 1371
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1715
 
9.2%
1160
 
6.2%
960
 
5.1%
741
 
4.0%
696
 
3.7%
674
 
3.6%
641
 
3.4%
634
 
3.4%
627
 
3.4%
593
 
3.2%
Other values (570) 10205
54.7%
Latin
ValueCountFrequency (%)
S 127
 
9.3%
c 90
 
6.6%
H 72
 
5.3%
s 72
 
5.3%
M 71
 
5.2%
r 64
 
4.7%
O 53
 
3.9%
h 48
 
3.5%
D 47
 
3.4%
o 46
 
3.4%
Other values (38) 681
49.7%
Common
ValueCountFrequency (%)
18267
17.8%
8 12092
11.8%
1 10764
10.5%
3 9636
9.4%
9 7515
7.3%
4 7396
7.2%
2 7279
 
7.1%
. 6630
 
6.5%
5 6326
 
6.2%
7 6081
 
5.9%
Other values (7) 10454
10.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 103811
84.8%
Hangul 9697
 
7.9%
Compat Jamo 8949
 
7.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18267
17.6%
8 12092
11.6%
1 10764
10.4%
3 9636
9.3%
9 7515
7.2%
4 7396
7.1%
2 7279
 
7.0%
. 6630
 
6.4%
5 6326
 
6.1%
7 6081
 
5.9%
Other values (55) 11825
11.4%
Compat Jamo
ValueCountFrequency (%)
1715
19.2%
1160
13.0%
960
10.7%
674
 
7.5%
641
 
7.2%
634
 
7.1%
627
 
7.0%
593
 
6.6%
482
 
5.4%
481
 
5.4%
Other values (9) 982
11.0%
Hangul
ValueCountFrequency (%)
741
 
7.6%
696
 
7.2%
307
 
3.2%
194
 
2.0%
157
 
1.6%
149
 
1.5%
146
 
1.5%
141
 
1.5%
140
 
1.4%
138
 
1.4%
Other values (551) 6888
71.0%

Interactions

2023-12-13T08:42:43.273832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:42:48.583900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자료실출판년
자료실1.0000.381
출판년0.3811.000
2023-12-13T08:42:48.679313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출판년자료실
출판년1.0000.237
자료실0.2371.000

Missing values

2023-12-13T08:42:43.387411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:42:43.519018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T08:42:43.633912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

소장처자료실등록번호서명저자출판사출판년청구기호
73190노은도서관어린이실ANM000086358찰방찰방 밤을 건너이상교문학동네2019811.8 문637ㅁ 71
73784노은도서관어린이실ANM000087643그냥 아무것도 하기 싫은데 어떡해요제성은팜파스2021813.8 제562ㄱ
47001노은도서관어린이실NM0000027266아차산 : 고구려의 힘찬 기상을 찾아 떠나요최원근 지음, 김순남 그림김영사2007373 신595ㅎ 78
79177노은도서관보존서고NM0000008660청소의 여왕린다 코브 지음 ; 김태윤 옮김럭스미디어2003597.3 코375ㅊ
52089노은도서관어린이실NM0000038207노아 박사의 우주선브라이언 와일드스미스 지음 ; 서애경 옮김현북스2012843 알713ㅎ 13
18147노은도서관종합자료실ANM000055539팽목항에서 불어오는 바람 :세월호 이후 인문학의 기록노명우현실문화2015331.54 노716ㅍ
44295노은도서관어린이실NM0000018101Junie B. Jones and her big fat mouthBarbara Park ; illustrated by Denise BrunkusRandom House1993843 P235j 3
19086노은도서관종합자료실ANM000058780여성의 몸 여성의 지혜강현주한문화2011516.2 노746ㅇ
30077노은도서관종합자료실ANM000080016곁에 두고 읽는 장자 : 내 인생에 희망이 되어준 장자의 말김태관홍익출판사2015152.226 김912ㄱ
21556노은도서관종합자료실ANM000064731IS는 왜 :IS는 '테러 괴물'인가, 객관적인 우리 시각으로 파헤친 IS 심층 파일한상용서해문집2016349.97 한714i
소장처자료실등록번호서명저자출판사출판년청구기호
36496노은도서관종합자료실ANM000092266완벽한 생애조해진창비2021813.7 조935ㅇ
39919노은도서관어린이실NM0000012144세종대왕 : 초등학교 고학년과 중학생을 위한차원재 글씀 ; 김순자 감수파랑새어린이2001991.108 역292ㅍ 18
7393노은도서관종합자료실NM0000025944아웅 산 수 치의 평화지은이: 아웅 산 수 치 ; 옮긴이: 이문희 ; 그린이: 헤잉 텟공존2007340.99 아342ㅇ
51921노은도서관어린이실NM0000038699(어린이를 위한)주강현의 우리 문화. 2:, 구들에서 방아까지주강현 글아이세움2002082 아351ㅇ 3
72156노은도서관어린이실ANM000083972파도가 온다안효림반달2019813.8 안665ㅍ
74353노은도서관어린이실ANM000088503진화가 뭐예요 : 지구 생명체 탄생의 기원과 비밀루니. 앤빅북2021476.01 루692ㅈ
60257노은도서관어린이실ANM000056832너무 심심해블랙스톤, 스텔라여원미디어2013808.9 탄348ㅇ 63
41800노은도서관어린이실NM0000004137괴물 셀리반 : 러시아문학니콜라이 레스코프 ; 비탈리 콘스탄티노프 그림다림2009808.9 다183ㄷ 9
21047노은도서관종합자료실ANM000063323드래곤 라자 :이영도 판타지 장편소설 .8 ,석양을 향해 나는 드래곤이영도황금가지2016813.7 이472ㄷ 8
9256노은도서관종합자료실NM0000032028인디자인 CS5 무작정 따라하기이민기 지음길벗2011004.76 이317ㅇ