Overview

Dataset statistics

Number of variables12
Number of observations10000
Missing cells40008
Missing cells (%)33.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.0 MiB
Average record size in memory109.0 B

Variable types

Categorical2
Text5
Numeric1
Unsupported4

Dataset

Description대전광역시 유성구 구즉도서관에서 보유하고 있는 도서목록 정보(소장처, 자료실, 등록번호, 설명, 저자, 출판사, 출판년, 청구기호 등)
Author대전광역시 유성구
URLhttps://www.data.go.kr/data/15053384/fileData.do

Alerts

소장처 has constant value ""Constant
Unnamed: 8 has 10000 (100.0%) missing valuesMissing
Unnamed: 9 has 10000 (100.0%) missing valuesMissing
Unnamed: 10 has 10000 (100.0%) missing valuesMissing
Unnamed: 11 has 10000 (100.0%) missing valuesMissing
등록번호 has unique valuesUnique
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 02:55:05.569117
Analysis finished2023-12-12 02:55:09.416476
Duration3.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

소장처
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
구즉도서관
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row구즉도서관
2nd row구즉도서관
3rd row구즉도서관
4th row구즉도서관
5th row구즉도서관

Common Values

ValueCountFrequency (%)
구즉도서관 10000
100.0%

Length

2023-12-12T11:55:09.511457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:55:09.644012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
구즉도서관 10000
100.0%

자료실
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
어린이실
5127 
종합자료실
4533 
지역정보실
 
340

Length

Max length5
Median length4
Mean length4.4873
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종합자료실
2nd row어린이실
3rd row종합자료실
4th row어린이실
5th row종합자료실

Common Values

ValueCountFrequency (%)
어린이실 5127
51.3%
종합자료실 4533
45.3%
지역정보실 340
 
3.4%

Length

2023-12-12T11:55:09.788627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:55:09.921852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
어린이실 5127
51.3%
종합자료실 4533
45.3%
지역정보실 340
 
3.4%

등록번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T11:55:10.478303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters120000
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowDGM000141443
2nd rowGM0000052843
3rd rowGM0000090836
4th rowDGM000119219
5th rowDGM000139160
ValueCountFrequency (%)
dgm000141443 1
 
< 0.1%
gm0000050306 1
 
< 0.1%
gm0000103592 1
 
< 0.1%
dgm000116608 1
 
< 0.1%
gm0000034260 1
 
< 0.1%
gm0000082799 1
 
< 0.1%
dgm000126436 1
 
< 0.1%
dgm000126400 1
 
< 0.1%
dgm000118866 1
 
< 0.1%
dgm000140111 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-12T11:55:10.998755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 45601
38.0%
1 10742
 
9.0%
M 10000
 
8.3%
G 9660
 
8.1%
3 5600
 
4.7%
2 5430
 
4.5%
D 5195
 
4.3%
4 5167
 
4.3%
8 4728
 
3.9%
9 4656
 
3.9%
Other values (3) 13221
 
11.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 95145
79.3%
Uppercase Letter 24855
 
20.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 45601
47.9%
1 10742
 
11.3%
3 5600
 
5.9%
2 5430
 
5.7%
4 5167
 
5.4%
8 4728
 
5.0%
9 4656
 
4.9%
7 4445
 
4.7%
6 4427
 
4.7%
5 4349
 
4.6%
Uppercase Letter
ValueCountFrequency (%)
M 10000
40.2%
G 9660
38.9%
D 5195
20.9%

Most occurring scripts

ValueCountFrequency (%)
Common 95145
79.3%
Latin 24855
 
20.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 45601
47.9%
1 10742
 
11.3%
3 5600
 
5.9%
2 5430
 
5.7%
4 5167
 
5.4%
8 4728
 
5.0%
9 4656
 
4.9%
7 4445
 
4.7%
6 4427
 
4.7%
5 4349
 
4.6%
Latin
ValueCountFrequency (%)
M 10000
40.2%
G 9660
38.9%
D 5195
20.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 120000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 45601
38.0%
1 10742
 
9.0%
M 10000
 
8.3%
G 9660
 
8.1%
3 5600
 
4.7%
2 5430
 
4.5%
D 5195
 
4.3%
4 5167
 
4.3%
8 4728
 
3.9%
9 4656
 
3.9%
Other values (3) 13221
 
11.0%

서명
Text

Distinct9848
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T11:55:11.513748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length144
Median length78
Mean length19.9131
Min length1

Characters and Unicode

Total characters199131
Distinct characters1663
Distinct categories16 ?
Distinct scripts7 ?
Distinct blocks17 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9711 ?
Unique (%)97.1%

Sample

1st row청소년 인문학 수업 : 공부와 삶을 연결하는 인문학 . 1 , 역사 예술 문학
2nd row리더 : 성공한 위인들의 리더 방법
3rd row유럽동화마을여행
4th row노인과 소년
5th row들개를 위한 변론
ValueCountFrequency (%)
3561
 
6.8%
이야기 489
 
0.9%
2 290
 
0.6%
1 271
 
0.5%
장편소설 252
 
0.5%
위한 215
 
0.4%
the 211
 
0.4%
209
 
0.4%
우리 195
 
0.4%
3 138
 
0.3%
Other values (22028) 46271
88.8%
2023-12-12T11:55:12.189601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
43661
 
21.9%
3700
 
1.9%
3624
 
1.8%
: 3561
 
1.8%
2753
 
1.4%
2081
 
1.0%
1860
 
0.9%
1822
 
0.9%
1757
 
0.9%
1696
 
0.9%
Other values (1653) 132616
66.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 125129
62.8%
Space Separator 43661
 
21.9%
Lowercase Letter 12764
 
6.4%
Other Punctuation 7606
 
3.8%
Decimal Number 3638
 
1.8%
Uppercase Letter 2449
 
1.2%
Open Punctuation 1660
 
0.8%
Close Punctuation 1660
 
0.8%
Math Symbol 305
 
0.2%
Dash Punctuation 171
 
0.1%
Other values (6) 88
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3700
 
3.0%
3624
 
2.9%
2753
 
2.2%
2081
 
1.7%
1860
 
1.5%
1822
 
1.5%
1757
 
1.4%
1696
 
1.4%
1614
 
1.3%
1606
 
1.3%
Other values (1490) 102616
82.0%
Lowercase Letter
ValueCountFrequency (%)
e 1523
11.9%
o 1053
 
8.2%
a 1033
 
8.1%
i 926
 
7.3%
n 922
 
7.2%
t 869
 
6.8%
r 859
 
6.7%
s 791
 
6.2%
h 605
 
4.7%
l 572
 
4.5%
Other values (46) 3611
28.3%
Uppercase Letter
ValueCountFrequency (%)
D 279
 
11.4%
T 245
 
10.0%
S 221
 
9.0%
V 137
 
5.6%
B 128
 
5.2%
E 127
 
5.2%
C 127
 
5.2%
A 127
 
5.2%
M 121
 
4.9%
W 100
 
4.1%
Other values (26) 837
34.2%
Other Punctuation
ValueCountFrequency (%)
: 3561
46.8%
, 1636
21.5%
. 1449
19.1%
! 586
 
7.7%
· 159
 
2.1%
' 99
 
1.3%
& 21
 
0.3%
18
 
0.2%
/ 16
 
0.2%
% 15
 
0.2%
Other values (10) 46
 
0.6%
Decimal Number
ValueCountFrequency (%)
1 945
26.0%
2 696
19.1%
0 572
15.7%
3 380
10.4%
5 244
 
6.7%
4 225
 
6.2%
6 159
 
4.4%
8 144
 
4.0%
7 142
 
3.9%
9 131
 
3.6%
Math Symbol
ValueCountFrequency (%)
= 251
82.3%
~ 35
 
11.5%
+ 6
 
2.0%
< 4
 
1.3%
> 4
 
1.3%
× 2
 
0.7%
1
 
0.3%
1
 
0.3%
1
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 1460
88.0%
[ 188
 
11.3%
5
 
0.3%
4
 
0.2%
2
 
0.1%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1460
88.0%
] 188
 
11.3%
5
 
0.3%
4
 
0.2%
2
 
0.1%
1
 
0.1%
Letter Number
ValueCountFrequency (%)
5
35.7%
3
21.4%
3
21.4%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Other Symbol
ValueCountFrequency (%)
3
30.0%
3
30.0%
2
20.0%
1
 
10.0%
1
 
10.0%
Dash Punctuation
ValueCountFrequency (%)
- 170
99.4%
1
 
0.6%
Modifier Symbol
ValueCountFrequency (%)
` 47
79.7%
´ 12
 
20.3%
Other Number
ValueCountFrequency (%)
1
50.0%
² 1
50.0%
Space Separator
ValueCountFrequency (%)
43661
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 124856
62.7%
Common 58775
29.5%
Latin 14939
 
7.5%
Cyrillic 288
 
0.1%
Han 260
 
0.1%
Hiragana 8
 
< 0.1%
Katakana 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3700
 
3.0%
3624
 
2.9%
2753
 
2.2%
2081
 
1.7%
1860
 
1.5%
1822
 
1.5%
1757
 
1.4%
1696
 
1.4%
1614
 
1.3%
1606
 
1.3%
Other values (1301) 102343
82.0%
Han
ValueCountFrequency (%)
7
 
2.7%
7
 
2.7%
7
 
2.7%
6
 
2.3%
5
 
1.9%
5
 
1.9%
5
 
1.9%
4
 
1.5%
4
 
1.5%
4
 
1.5%
Other values (166) 206
79.2%
Common
ValueCountFrequency (%)
43661
74.3%
: 3561
 
6.1%
, 1636
 
2.8%
( 1460
 
2.5%
) 1460
 
2.5%
. 1449
 
2.5%
1 945
 
1.6%
2 696
 
1.2%
! 586
 
1.0%
0 572
 
1.0%
Other values (55) 2749
 
4.7%
Latin
ValueCountFrequency (%)
e 1523
 
10.2%
o 1053
 
7.0%
a 1033
 
6.9%
i 926
 
6.2%
n 922
 
6.2%
t 869
 
5.8%
r 859
 
5.8%
s 791
 
5.3%
h 605
 
4.0%
l 572
 
3.8%
Other values (49) 5786
38.7%
Cyrillic
ValueCountFrequency (%)
а 34
 
11.8%
н 29
 
10.1%
р 21
 
7.3%
и 18
 
6.2%
о 17
 
5.9%
л 16
 
5.6%
э 14
 
4.9%
у 14
 
4.9%
г 11
 
3.8%
й 10
 
3.5%
Other values (29) 104
36.1%
Hiragana
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Katakana
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 124838
62.7%
ASCII 73433
36.9%
Cyrillic 288
 
0.1%
CJK 253
 
0.1%
None 245
 
0.1%
Compat Jamo 18
 
< 0.1%
Number Forms 14
 
< 0.1%
Punctuation 10
 
< 0.1%
Hiragana 8
 
< 0.1%
CJK Compat Ideographs 7
 
< 0.1%
Other values (7) 17
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
43661
59.5%
: 3561
 
4.8%
, 1636
 
2.2%
e 1523
 
2.1%
( 1460
 
2.0%
) 1460
 
2.0%
. 1449
 
2.0%
o 1053
 
1.4%
a 1033
 
1.4%
1 945
 
1.3%
Other values (75) 15652
 
21.3%
Hangul
ValueCountFrequency (%)
3700
 
3.0%
3624
 
2.9%
2753
 
2.2%
2081
 
1.7%
1860
 
1.5%
1822
 
1.5%
1757
 
1.4%
1696
 
1.4%
1614
 
1.3%
1606
 
1.3%
Other values (1297) 102325
82.0%
None
ValueCountFrequency (%)
· 159
64.9%
18
 
7.3%
´ 12
 
4.9%
11
 
4.5%
7
 
2.9%
5
 
2.0%
5
 
2.0%
4
 
1.6%
4
 
1.6%
2
 
0.8%
Other values (13) 18
 
7.3%
Cyrillic
ValueCountFrequency (%)
а 34
 
11.8%
н 29
 
10.1%
р 21
 
7.3%
и 18
 
6.2%
о 17
 
5.9%
л 16
 
5.6%
э 14
 
4.9%
у 14
 
4.9%
г 11
 
3.8%
й 10
 
3.5%
Other values (29) 104
36.1%
CJK
ValueCountFrequency (%)
7
 
2.8%
7
 
2.8%
7
 
2.8%
6
 
2.4%
5
 
2.0%
5
 
2.0%
5
 
2.0%
4
 
1.6%
4
 
1.6%
4
 
1.6%
Other values (160) 199
78.7%
Punctuation
ValueCountFrequency (%)
7
70.0%
2
 
20.0%
1
 
10.0%
Compat Jamo
ValueCountFrequency (%)
6
33.3%
4
22.2%
4
22.2%
4
22.2%
Number Forms
ValueCountFrequency (%)
5
35.7%
3
21.4%
3
21.4%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Geometric Shapes
ValueCountFrequency (%)
3
100.0%
Misc Symbols
ValueCountFrequency (%)
3
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
2
28.6%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
Enclosed Alphanum
ValueCountFrequency (%)
2
66.7%
1
33.3%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Hiragana
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Katakana
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
CJK Compat
ValueCountFrequency (%)
1
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

저자
Text

Distinct8638
Distinct (%)86.4%
Missing2
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-12T11:55:12.679543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length114
Median length109
Mean length12.067113
Min length1

Characters and Unicode

Total characters120647
Distinct characters1186
Distinct categories11 ?
Distinct scripts6 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7821 ?
Unique (%)78.2%

Sample

1st row백상경제연구원
2nd row김현민 글 ; 나일영 그림
3rd row이형준 글·사진
4th row박완서
5th row우재욱
ValueCountFrequency (%)
4652
 
13.3%
지음 2125
 
6.1%
그림 1671
 
4.8%
옮김 1627
 
4.7%
1312
 
3.8%
by 255
 
0.7%
감독 180
 
0.5%
글·그림 152
 
0.4%
130
 
0.4%
엮음 108
 
0.3%
Other values (12595) 22667
65.0%
2023-12-12T11:55:13.520028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25892
 
21.5%
; 4815
 
4.0%
3506
 
2.9%
2870
 
2.4%
2737
 
2.3%
2382
 
2.0%
2249
 
1.9%
2075
 
1.7%
, 1840
 
1.5%
1782
 
1.5%
Other values (1176) 70499
58.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 74895
62.1%
Space Separator 25892
 
21.5%
Lowercase Letter 8954
 
7.4%
Other Punctuation 7752
 
6.4%
Uppercase Letter 1886
 
1.6%
Open Punctuation 587
 
0.5%
Close Punctuation 578
 
0.5%
Decimal Number 45
 
< 0.1%
Dash Punctuation 43
 
< 0.1%
Math Symbol 13
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3506
 
4.7%
2870
 
3.8%
2737
 
3.7%
2382
 
3.2%
2249
 
3.0%
2075
 
2.8%
1782
 
2.4%
1711
 
2.3%
1520
 
2.0%
1284
 
1.7%
Other values (1040) 52779
70.5%
Lowercase Letter
ValueCountFrequency (%)
e 922
 
10.3%
a 820
 
9.2%
n 726
 
8.1%
r 700
 
7.8%
i 696
 
7.8%
t 625
 
7.0%
l 598
 
6.7%
o 588
 
6.6%
y 467
 
5.2%
s 360
 
4.0%
Other values (47) 2452
27.4%
Uppercase Letter
ValueCountFrequency (%)
S 170
 
9.0%
J 160
 
8.5%
M 139
 
7.4%
R 138
 
7.3%
A 128
 
6.8%
B 119
 
6.3%
D 114
 
6.0%
C 108
 
5.7%
H 91
 
4.8%
L 85
 
4.5%
Other values (33) 634
33.6%
Other Punctuation
ValueCountFrequency (%)
; 4815
62.1%
, 1840
 
23.7%
. 506
 
6.5%
: 334
 
4.3%
· 211
 
2.7%
/ 29
 
0.4%
& 6
 
0.1%
' 4
 
0.1%
* 4
 
0.1%
1
 
< 0.1%
Other values (2) 2
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
2 11
24.4%
0 10
22.2%
1 6
13.3%
3 5
11.1%
6 5
11.1%
5 3
 
6.7%
4 2
 
4.4%
8 1
 
2.2%
9 1
 
2.2%
7 1
 
2.2%
Open Punctuation
ValueCountFrequency (%)
[ 479
81.6%
( 105
 
17.9%
2
 
0.3%
1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
] 475
82.2%
) 100
 
17.3%
2
 
0.3%
1
 
0.2%
Math Symbol
ValueCountFrequency (%)
| 7
53.8%
< 3
23.1%
> 3
23.1%
Space Separator
ValueCountFrequency (%)
25892
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 43
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 74707
61.9%
Common 34912
28.9%
Latin 10419
 
8.6%
Cyrillic 421
 
0.3%
Han 171
 
0.1%
Hiragana 17
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3506
 
4.7%
2870
 
3.8%
2737
 
3.7%
2382
 
3.2%
2249
 
3.0%
2075
 
2.8%
1782
 
2.4%
1711
 
2.3%
1520
 
2.0%
1284
 
1.7%
Other values (918) 52591
70.4%
Han
ValueCountFrequency (%)
17
 
9.9%
6
 
3.5%
4
 
2.3%
4
 
2.3%
4
 
2.3%
4
 
2.3%
3
 
1.8%
3
 
1.8%
3
 
1.8%
3
 
1.8%
Other values (96) 120
70.2%
Latin
ValueCountFrequency (%)
e 922
 
8.8%
a 820
 
7.9%
n 726
 
7.0%
r 700
 
6.7%
i 696
 
6.7%
t 625
 
6.0%
l 598
 
5.7%
o 588
 
5.6%
y 467
 
4.5%
s 360
 
3.5%
Other values (42) 3917
37.6%
Cyrillic
ValueCountFrequency (%)
а 46
 
10.9%
н 38
 
9.0%
о 33
 
7.8%
р 32
 
7.6%
и 25
 
5.9%
у 22
 
5.2%
д 17
 
4.0%
л 16
 
3.8%
э 15
 
3.6%
г 13
 
3.1%
Other values (38) 164
39.0%
Common
ValueCountFrequency (%)
25892
74.2%
; 4815
 
13.8%
, 1840
 
5.3%
. 506
 
1.4%
[ 479
 
1.4%
] 475
 
1.4%
: 334
 
1.0%
· 211
 
0.6%
( 105
 
0.3%
) 100
 
0.3%
Other values (26) 155
 
0.4%
Hiragana
ValueCountFrequency (%)
2
 
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (6) 6
35.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 74700
61.9%
ASCII 45110
37.4%
Cyrillic 421
 
0.3%
None 221
 
0.2%
CJK 163
 
0.1%
Hiragana 17
 
< 0.1%
CJK Compat Ideographs 8
 
< 0.1%
Compat Jamo 7
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
25892
57.4%
; 4815
 
10.7%
, 1840
 
4.1%
e 922
 
2.0%
a 820
 
1.8%
n 726
 
1.6%
r 700
 
1.6%
i 696
 
1.5%
t 625
 
1.4%
l 598
 
1.3%
Other values (70) 7476
 
16.6%
Hangul
ValueCountFrequency (%)
3506
 
4.7%
2870
 
3.8%
2737
 
3.7%
2382
 
3.2%
2249
 
3.0%
2075
 
2.8%
1782
 
2.4%
1711
 
2.3%
1520
 
2.0%
1284
 
1.7%
Other values (916) 52584
70.4%
None
ValueCountFrequency (%)
· 211
95.5%
2
 
0.9%
2
 
0.9%
ł 2
 
0.9%
1
 
0.5%
1
 
0.5%
1
 
0.5%
1
 
0.5%
Cyrillic
ValueCountFrequency (%)
а 46
 
10.9%
н 38
 
9.0%
о 33
 
7.8%
р 32
 
7.6%
и 25
 
5.9%
у 22
 
5.2%
д 17
 
4.0%
л 16
 
3.8%
э 15
 
3.6%
г 13
 
3.1%
Other values (38) 164
39.0%
CJK
ValueCountFrequency (%)
17
 
10.4%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
3
 
1.8%
3
 
1.8%
3
 
1.8%
3
 
1.8%
3
 
1.8%
Other values (93) 115
70.6%
Compat Jamo
ValueCountFrequency (%)
6
85.7%
1
 
14.3%
CJK Compat Ideographs
ValueCountFrequency (%)
6
75.0%
1
 
12.5%
1
 
12.5%
Hiragana
ValueCountFrequency (%)
2
 
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (6) 6
35.3%
Distinct2698
Distinct (%)27.0%
Missing3
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-12T11:55:13.945333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length40
Mean length4.7554266
Min length1

Characters and Unicode

Total characters47540
Distinct characters814
Distinct categories11 ?
Distinct scripts6 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1565 ?
Unique (%)15.7%

Sample

1st row한빛비즈
2nd row아이앤북
3rd row즐거운상상
4th row어린이작가정신
5th row지성사
ValueCountFrequency (%)
비룡소 170
 
1.6%
문학동네 159
 
1.5%
창비 131
 
1.2%
자음과모음 107
 
1.0%
아이세움 91
 
0.8%
주니어김영사 85
 
0.8%
시공주니어 80
 
0.7%
위즈덤하우스 78
 
0.7%
웅진주니어 76
 
0.7%
민음사 75
 
0.7%
Other values (2763) 9719
90.2%
2023-12-12T11:55:14.537582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1552
 
3.3%
1396
 
2.9%
1316
 
2.8%
1065
 
2.2%
910
 
1.9%
774
 
1.6%
676
 
1.4%
663
 
1.4%
o 647
 
1.4%
610
 
1.3%
Other values (804) 37931
79.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39471
83.0%
Lowercase Letter 5110
 
10.7%
Uppercase Letter 1553
 
3.3%
Space Separator 774
 
1.6%
Other Punctuation 213
 
0.4%
Decimal Number 170
 
0.4%
Open Punctuation 116
 
0.2%
Close Punctuation 115
 
0.2%
Dash Punctuation 12
 
< 0.1%
Modifier Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1552
 
3.9%
1396
 
3.5%
1316
 
3.3%
1065
 
2.7%
910
 
2.3%
676
 
1.7%
663
 
1.7%
610
 
1.5%
587
 
1.5%
582
 
1.5%
Other values (698) 30114
76.3%
Lowercase Letter
ValueCountFrequency (%)
o 647
12.7%
s 485
 
9.5%
i 416
 
8.1%
e 413
 
8.1%
n 380
 
7.4%
r 338
 
6.6%
a 329
 
6.4%
l 259
 
5.1%
k 205
 
4.0%
t 203
 
4.0%
Other values (35) 1435
28.1%
Uppercase Letter
ValueCountFrequency (%)
B 234
15.1%
M 135
 
8.7%
S 130
 
8.4%
H 110
 
7.1%
C 107
 
6.9%
P 90
 
5.8%
K 88
 
5.7%
R 85
 
5.5%
A 67
 
4.3%
L 63
 
4.1%
Other values (22) 444
28.6%
Other Punctuation
ValueCountFrequency (%)
& 90
42.3%
· 37
17.4%
24
 
11.3%
. 20
 
9.4%
' 16
 
7.5%
, 13
 
6.1%
* 4
 
1.9%
: 3
 
1.4%
@ 3
 
1.4%
; 2
 
0.9%
Decimal Number
ValueCountFrequency (%)
2 75
44.1%
1 70
41.2%
0 9
 
5.3%
3 6
 
3.5%
9 3
 
1.8%
4 3
 
1.8%
8 1
 
0.6%
6 1
 
0.6%
5 1
 
0.6%
7 1
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 77
66.4%
[ 39
33.6%
Close Punctuation
ValueCountFrequency (%)
) 76
66.1%
] 39
33.9%
Space Separator
ValueCountFrequency (%)
774
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 4
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39350
82.8%
Latin 6515
 
13.7%
Common 1406
 
3.0%
Cyrillic 148
 
0.3%
Han 118
 
0.2%
Katakana 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1552
 
3.9%
1396
 
3.5%
1316
 
3.3%
1065
 
2.7%
910
 
2.3%
676
 
1.7%
663
 
1.7%
610
 
1.6%
587
 
1.5%
582
 
1.5%
Other values (638) 29993
76.2%
Han
ValueCountFrequency (%)
16
 
13.6%
11
 
9.3%
11
 
9.3%
6
 
5.1%
4
 
3.4%
4
 
3.4%
3
 
2.5%
3
 
2.5%
3
 
2.5%
2
 
1.7%
Other values (47) 55
46.6%
Latin
ValueCountFrequency (%)
o 647
 
9.9%
s 485
 
7.4%
i 416
 
6.4%
e 413
 
6.3%
n 380
 
5.8%
r 338
 
5.2%
a 329
 
5.0%
l 259
 
4.0%
B 234
 
3.6%
k 205
 
3.1%
Other values (42) 2809
43.1%
Common
ValueCountFrequency (%)
774
55.0%
& 90
 
6.4%
( 77
 
5.5%
) 76
 
5.4%
2 75
 
5.3%
1 70
 
5.0%
] 39
 
2.8%
[ 39
 
2.8%
· 37
 
2.6%
24
 
1.7%
Other values (19) 105
 
7.5%
Cyrillic
ValueCountFrequency (%)
н 17
11.5%
с 16
 
10.8%
а 12
 
8.1%
э 12
 
8.1%
р 11
 
7.4%
М 9
 
6.1%
г 8
 
5.4%
о 8
 
5.4%
х 7
 
4.7%
п 7
 
4.7%
Other values (15) 41
27.7%
Katakana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39347
82.8%
ASCII 7859
 
16.5%
Cyrillic 148
 
0.3%
CJK 118
 
0.2%
None 62
 
0.1%
Katakana 3
 
< 0.1%
Compat Jamo 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1552
 
3.9%
1396
 
3.5%
1316
 
3.3%
1065
 
2.7%
910
 
2.3%
676
 
1.7%
663
 
1.7%
610
 
1.6%
587
 
1.5%
582
 
1.5%
Other values (635) 29990
76.2%
ASCII
ValueCountFrequency (%)
774
 
9.8%
o 647
 
8.2%
s 485
 
6.2%
i 416
 
5.3%
e 413
 
5.3%
n 380
 
4.8%
r 338
 
4.3%
a 329
 
4.2%
l 259
 
3.3%
B 234
 
3.0%
Other values (68) 3584
45.6%
None
ValueCountFrequency (%)
· 37
59.7%
24
38.7%
đ 1
 
1.6%
Cyrillic
ValueCountFrequency (%)
н 17
11.5%
с 16
 
10.8%
а 12
 
8.1%
э 12
 
8.1%
р 11
 
7.4%
М 9
 
6.1%
г 8
 
5.4%
о 8
 
5.4%
х 7
 
4.7%
п 7
 
4.7%
Other values (15) 41
27.7%
CJK
ValueCountFrequency (%)
16
 
13.6%
11
 
9.3%
11
 
9.3%
6
 
5.1%
4
 
3.4%
4
 
3.4%
3
 
2.5%
3
 
2.5%
3
 
2.5%
2
 
1.7%
Other values (47) 55
46.6%
Katakana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Compat Jamo
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

출판년
Real number (ℝ)

Distinct36
Distinct (%)0.4%
Missing3
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean2012.5088
Minimum1983
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T11:55:14.744077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1983
5-th percentile2003
Q12008
median2013
Q32017
95-th percentile2021
Maximum2022
Range39
Interquartile range (IQR)9

Descriptive statistics

Standard deviation5.5426832
Coefficient of variation (CV)0.0027541163
Kurtosis-0.68649076
Mean2012.5088
Median Absolute Deviation (MAD)4
Skewness-0.28226562
Sum20119050
Variance30.721337
MonotonicityNot monotonic
2023-12-12T11:55:14.903597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
2018 676
 
6.8%
2017 641
 
6.4%
2016 628
 
6.3%
2011 624
 
6.2%
2019 599
 
6.0%
2010 583
 
5.8%
2013 566
 
5.7%
2012 563
 
5.6%
2020 553
 
5.5%
2009 538
 
5.4%
Other values (26) 4026
40.3%
ValueCountFrequency (%)
1983 1
 
< 0.1%
1986 1
 
< 0.1%
1989 1
 
< 0.1%
1990 1
 
< 0.1%
1991 1
 
< 0.1%
1992 1
 
< 0.1%
1993 2
 
< 0.1%
1994 3
 
< 0.1%
1995 1
 
< 0.1%
1996 8
0.1%
ValueCountFrequency (%)
2022 138
 
1.4%
2021 384
3.8%
2020 553
5.5%
2019 599
6.0%
2018 676
6.8%
2017 641
6.4%
2016 628
6.3%
2015 464
4.6%
2014 470
4.7%
2013 566
5.7%
Distinct9760
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T11:55:15.321491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length20
Mean length11.9246
Min length3

Characters and Unicode

Total characters119246
Distinct characters631
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9590 ?
Unique (%)95.9%

Sample

1st row342.1 백966ㅊ 1
2nd row199.1 미735ㅇ 1
3rd row982.02 이787ㅇ
4th row813.8 박513ㄴ
5th row527.41 우178ㄷ
ValueCountFrequency (%)
808.9 566
 
2.4%
813.8 548
 
2.3%
843 475
 
2.0%
1 421
 
1.8%
2 391
 
1.6%
688 322
 
1.4%
3 261
 
1.1%
408 222
 
0.9%
813.6 213
 
0.9%
4 196
 
0.8%
Other values (8871) 20115
84.8%
2023-12-12T11:55:15.950353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17711
14.9%
8 12167
10.2%
1 10384
 
8.7%
3 9386
 
7.9%
9 7562
 
6.3%
4 7254
 
6.1%
2 6825
 
5.7%
. 6394
 
5.4%
5 6203
 
5.2%
6 5765
 
4.8%
Other values (621) 29595
24.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 75591
63.4%
Other Letter 18492
 
15.5%
Space Separator 17711
 
14.9%
Other Punctuation 6431
 
5.4%
Uppercase Letter 499
 
0.4%
Lowercase Letter 293
 
0.2%
Dash Punctuation 212
 
0.2%
Close Punctuation 8
 
< 0.1%
Open Punctuation 8
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1728
 
9.3%
1194
 
6.5%
922
 
5.0%
732
 
4.0%
718
 
3.9%
693
 
3.7%
671
 
3.6%
628
 
3.4%
616
 
3.3%
596
 
3.2%
Other values (554) 9994
54.0%
Uppercase Letter
ValueCountFrequency (%)
U 50
 
10.0%
S 47
 
9.4%
A 46
 
9.2%
C 33
 
6.6%
R 33
 
6.6%
G 31
 
6.2%
B 30
 
6.0%
O 29
 
5.8%
M 28
 
5.6%
H 25
 
5.0%
Other values (15) 147
29.5%
Lowercase Letter
ValueCountFrequency (%)
m 37
 
12.6%
s 28
 
9.6%
c 27
 
9.2%
a 21
 
7.2%
r 16
 
5.5%
p 16
 
5.5%
o 16
 
5.5%
t 15
 
5.1%
g 13
 
4.4%
b 11
 
3.8%
Other values (15) 93
31.7%
Decimal Number
ValueCountFrequency (%)
8 12167
16.1%
1 10384
13.7%
3 9386
12.4%
9 7562
10.0%
4 7254
9.6%
2 6825
9.0%
5 6203
8.2%
6 5765
7.6%
7 5540
7.3%
0 4505
 
6.0%
Other Punctuation
ValueCountFrequency (%)
. 6394
99.4%
, 37
 
0.6%
Space Separator
ValueCountFrequency (%)
17711
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 212
100.0%
Close Punctuation
ValueCountFrequency (%)
] 8
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 8
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 99961
83.8%
Hangul 18492
 
15.5%
Latin 793
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1728
 
9.3%
1194
 
6.5%
922
 
5.0%
732
 
4.0%
718
 
3.9%
693
 
3.7%
671
 
3.6%
628
 
3.4%
616
 
3.3%
596
 
3.2%
Other values (554) 9994
54.0%
Latin
ValueCountFrequency (%)
U 50
 
6.3%
S 47
 
5.9%
A 46
 
5.8%
m 37
 
4.7%
C 33
 
4.2%
R 33
 
4.2%
G 31
 
3.9%
B 30
 
3.8%
O 29
 
3.7%
M 28
 
3.5%
Other values (41) 429
54.1%
Common
ValueCountFrequency (%)
17711
17.7%
8 12167
12.2%
1 10384
10.4%
3 9386
9.4%
9 7562
7.6%
4 7254
7.3%
2 6825
 
6.8%
. 6394
 
6.4%
5 6203
 
6.2%
6 5765
 
5.8%
Other values (6) 10310
10.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 100753
84.5%
Hangul 9601
 
8.1%
Compat Jamo 8891
 
7.5%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17711
17.6%
8 12167
12.1%
1 10384
10.3%
3 9386
9.3%
9 7562
7.5%
4 7254
7.2%
2 6825
 
6.8%
. 6394
 
6.3%
5 6203
 
6.2%
6 5765
 
5.7%
Other values (56) 11102
11.0%
Compat Jamo
ValueCountFrequency (%)
1728
19.4%
1194
13.4%
922
10.4%
718
8.1%
671
 
7.5%
628
 
7.1%
616
 
6.9%
596
 
6.7%
437
 
4.9%
422
 
4.7%
Other values (9) 959
10.8%
Hangul
ValueCountFrequency (%)
732
 
7.6%
693
 
7.2%
253
 
2.6%
211
 
2.2%
163
 
1.7%
152
 
1.6%
147
 
1.5%
141
 
1.5%
130
 
1.4%
122
 
1.3%
Other values (535) 6857
71.4%
Number Forms
ValueCountFrequency (%)
1
100.0%

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

Interactions

2023-12-12T11:55:08.670612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T11:55:16.102789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자료실출판년
자료실1.0000.126
출판년0.1261.000
2023-12-12T11:55:16.236415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출판년자료실
출판년1.0000.075
자료실0.0751.000

Missing values

2023-12-12T11:55:08.873950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:55:09.126901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T11:55:09.317060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

소장처자료실등록번호서명저자출판사출판년청구기호Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11
56721구즉도서관종합자료실DGM000141443청소년 인문학 수업 : 공부와 삶을 연결하는 인문학 . 1 , 역사 예술 문학백상경제연구원한빛비즈2020342.1 백966ㅊ 1<NA><NA><NA><NA>
25186구즉도서관어린이실GM0000052843리더 : 성공한 위인들의 리더 방법김현민 글 ; 나일영 그림아이앤북2007199.1 미735ㅇ 1<NA><NA><NA><NA>
72572구즉도서관종합자료실GM0000090836유럽동화마을여행이형준 글·사진즐거운상상2009982.02 이787ㅇ<NA><NA><NA><NA>
4653구즉도서관어린이실DGM000119219노인과 소년박완서어린이작가정신2017813.8 박513ㄴ<NA><NA><NA><NA>
55563구즉도서관종합자료실DGM000139160들개를 위한 변론우재욱지성사2020527.41 우178ㄷ<NA><NA><NA><NA>
19187구즉도서관어린이실GM0000006445후박나무 우리집고은명 ; 김윤주창작과비평사2003082 창281ㅊ<NA><NA><NA><NA>
62372구즉도서관종합자료실GM0000032910인더풀오쿠다 히데오 지음 ; 양억관 옮김은행나무2005833.6 오776ㅇ<NA><NA><NA><NA>
6832구즉도서관어린이실DGM000123711Hey! Get off our trainBurningham, JohnBragonfly Books2012843 B966h<NA><NA><NA><NA>
3094구즉도서관어린이실DGM000114821Making Tens :Groups of GollywomplesBurstein, JohnWeekly Reader Early Learning Library2012410 B972m<NA><NA><NA><NA>
33515구즉도서관어린이실GM0000088538Twice as nicewritten by Margaret Allen ; illus. by Megan HalseyMoonjinmedia2007747 앨756T<NA><NA><NA><NA>
소장처자료실등록번호서명저자출판사출판년청구기호Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11
51675구즉도서관종합자료실DGM000131438나는 습관을 조금 바꾸기로 했다사사키 후미오쌤앤파커스2019325.211 사243ㄴ<NA><NA><NA><NA>
17173구즉도서관어린이실DGM000144160엔들링 . 3 , 오직 하나애플게이트, 캐서린가람어린이2021843 애751ㅇ 3<NA><NA><NA><NA>
31311구즉도서관어린이실GM0000082342찬성!미야니시 타츠야시공주니어2011808.9 네672ㅅ<NA><NA><NA><NA>
42333구즉도서관종합자료실DGM000108405박물관의 탄생풀로, 도미니크돌베개2014069 풀776ㅂ<NA><NA><NA><NA>
72154구즉도서관종합자료실GM0000089837달려라, 토끼지은이: 존 업다이크 ; 옮긴이: 정영목문학동네2011843.5 업184ㄷ<NA><NA><NA><NA>
40541구즉도서관어린이실GM0000105884슬픔을 꽉 안아줘마리 프랑신 에베르 글 ; 이자벨 말앙팡 그림 ; 임은경 옮김걸음동무2013863 에215ㅅ<NA><NA><NA><NA>
62629구즉도서관종합자료실GM0000035323김사량·허 준 외. 12김사량 외 엮음창비2005813.6082 이452ㅊ 12<NA><NA><NA><NA>
13826구즉도서관어린이실DGM000137154다 푼다 카카오프렌즈 : 재미Up 실력Up 수학 체험 만화 . 3 , 길이·들이·무게의 비교김혜성대원키즈2020410 김964ㄷ 3<NA><NA><NA><NA>
17125구즉도서관어린이실DGM000143992오경수의 비밀혜련다림2021813.8 혜354ㅇ<NA><NA><NA><NA>
59447구즉도서관종합자료실DGM000146865오늘부터 시작하는 탄소중립 : 기후위기 시대, 우리는 무엇을 입고 먹고 탈까권승문곰곰2022539.9 권593ㅇ<NA><NA><NA><NA>