Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells19
Missing cells (%)< 0.1%
Duplicate rows24
Duplicate rows (%)0.2%
Total size in memory703.1 KiB
Average record size in memory72.0 B

Variable types

Categorical4
Text4

Dataset

Description대구광역시 달서구 내 구립도서관에 배치되어 있는 도서보유목록입니다. (도서관명, 청구기호, 도서명, 저작자, 출판사, 발행연도 등)
URLhttps://www.data.go.kr/data/15100216/fileData.do

Alerts

도서관명 has constant value ""Constant
관리부서 has constant value ""Constant
기준일자 has constant value ""Constant
Dataset has 24 (0.2%) duplicate rowsDuplicates
발행연도 has a high cardinality: 51 distinct valuesHigh cardinality

Reproduction

Analysis started2023-12-12 09:05:50.811592
Analysis finished2023-12-12 09:05:54.001102
Duration3.19 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

도서관명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
도원도서관
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row도원도서관
2nd row도원도서관
3rd row도원도서관
4th row도원도서관
5th row도원도서관

Common Values

ValueCountFrequency (%)
도원도서관 10000
100.0%

Length

2023-12-12T18:05:54.101101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:05:54.220454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
도원도서관 10000
100.0%
Distinct9739
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:05:54.474651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length19
Mean length11.4671
Min length6

Characters and Unicode

Total characters114671
Distinct characters618
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9601 ?
Unique (%)96.0%

Sample

1st rowI 375.1-키897ㅎ
2nd row911.058-설38ㅅ
3rd row818-최66ㅇ
4th rowI 457.279-상52ㅇ-13
5th row813.6-정55ㅇ
ValueCountFrequency (%)
i 2227
 
17.6%
dv 316
 
2.5%
b 97
 
0.8%
688 62
 
0.5%
j 43
 
0.3%
688.2 14
 
0.1%
688.6 13
 
0.1%
320.8-통877ㅎ=2 12
 
0.1%
747-f954m 10
 
0.1%
808.9-아521ㅎ 9
 
0.1%
Other values (9726) 9885
77.9%
2023-12-12T18:05:55.072189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 12724
 
11.1%
8 10851
 
9.5%
1 8411
 
7.3%
2 7820
 
6.8%
. 7667
 
6.7%
3 7456
 
6.5%
4 6708
 
5.8%
5 6166
 
5.4%
6 6117
 
5.3%
9 5714
 
5.0%
Other values (608) 35037
30.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 68497
59.7%
Other Letter 18180
 
15.9%
Dash Punctuation 12724
 
11.1%
Other Punctuation 7668
 
6.7%
Uppercase Letter 3304
 
2.9%
Space Separator 2688
 
2.3%
Math Symbol 1135
 
1.0%
Lowercase Letter 437
 
0.4%
Open Punctuation 19
 
< 0.1%
Close Punctuation 19
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1753
 
9.6%
1080
 
5.9%
980
 
5.4%
895
 
4.9%
853
 
4.7%
659
 
3.6%
651
 
3.6%
562
 
3.1%
557
 
3.1%
548
 
3.0%
Other values (544) 9642
53.0%
Uppercase Letter
ValueCountFrequency (%)
I 2236
67.7%
D 337
 
10.2%
V 316
 
9.6%
B 129
 
3.9%
J 46
 
1.4%
S 32
 
1.0%
F 26
 
0.8%
C 25
 
0.8%
R 24
 
0.7%
H 19
 
0.6%
Other values (13) 114
 
3.5%
Lowercase Letter
ValueCountFrequency (%)
v 214
49.0%
m 30
 
6.9%
s 20
 
4.6%
r 17
 
3.9%
w 16
 
3.7%
c 16
 
3.7%
l 14
 
3.2%
a 13
 
3.0%
b 13
 
3.0%
d 12
 
2.7%
Other values (12) 72
 
16.5%
Decimal Number
ValueCountFrequency (%)
8 10851
15.8%
1 8411
12.3%
2 7820
11.4%
3 7456
10.9%
4 6708
9.8%
5 6166
9.0%
6 6117
8.9%
9 5714
8.3%
7 5652
8.3%
0 3602
 
5.3%
Other Punctuation
ValueCountFrequency (%)
. 7667
> 99.9%
, 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
[ 14
73.7%
( 5
 
26.3%
Close Punctuation
ValueCountFrequency (%)
] 14
73.7%
) 5
 
26.3%
Dash Punctuation
ValueCountFrequency (%)
- 12724
100.0%
Space Separator
ValueCountFrequency (%)
2688
100.0%
Math Symbol
ValueCountFrequency (%)
= 1135
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 92750
80.9%
Hangul 18180
 
15.9%
Latin 3741
 
3.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1753
 
9.6%
1080
 
5.9%
980
 
5.4%
895
 
4.9%
853
 
4.7%
659
 
3.6%
651
 
3.6%
562
 
3.1%
557
 
3.1%
548
 
3.0%
Other values (544) 9642
53.0%
Latin
ValueCountFrequency (%)
I 2236
59.8%
D 337
 
9.0%
V 316
 
8.4%
v 214
 
5.7%
B 129
 
3.4%
J 46
 
1.2%
S 32
 
0.9%
m 30
 
0.8%
F 26
 
0.7%
C 25
 
0.7%
Other values (35) 350
 
9.4%
Common
ValueCountFrequency (%)
- 12724
13.7%
8 10851
11.7%
1 8411
9.1%
2 7820
8.4%
. 7667
8.3%
3 7456
8.0%
4 6708
7.2%
5 6166
6.6%
6 6117
6.6%
9 5714
6.2%
Other values (9) 13116
14.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 96491
84.1%
Hangul 9692
 
8.5%
Compat Jamo 8488
 
7.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 12724
13.2%
8 10851
11.2%
1 8411
8.7%
2 7820
8.1%
. 7667
7.9%
3 7456
7.7%
4 6708
 
7.0%
5 6166
 
6.4%
6 6117
 
6.3%
9 5714
 
5.9%
Other values (54) 16857
17.5%
Compat Jamo
ValueCountFrequency (%)
1753
20.7%
1080
12.7%
980
11.5%
659
 
7.8%
651
 
7.7%
562
 
6.6%
557
 
6.6%
548
 
6.5%
424
 
5.0%
327
 
3.9%
Other values (9) 947
11.2%
Hangul
ValueCountFrequency (%)
895
 
9.2%
853
 
8.8%
368
 
3.8%
221
 
2.3%
188
 
1.9%
184
 
1.9%
157
 
1.6%
152
 
1.6%
136
 
1.4%
119
 
1.2%
Other values (525) 6419
66.2%
Distinct9810
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:05:55.542937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length150
Median length82
Mean length19.6141
Min length1

Characters and Unicode

Total characters196141
Distinct characters1680
Distinct categories16 ?
Distinct scripts6 ?
Distinct blocks15 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9629 ?
Unique (%)96.3%

Sample

1st row코뿔소 가죽은 왜 주름이 졌을까
2nd row(버림받은 왕자) 사도
3rd row(최윤희의)웃음 비타민 : 인생을 바꾸는 유쾌한 촌철살인 명언 719
4th row할아버지 원시 공룡. 2
5th row옆집의 영희 씨 : 정소연 소설집
ValueCountFrequency (%)
4403
 
8.5%
1 339
 
0.7%
장편소설 323
 
0.6%
2 313
 
0.6%
이야기 263
 
0.5%
211
 
0.4%
위한 209
 
0.4%
the 150
 
0.3%
나는 140
 
0.3%
우리 125
 
0.2%
Other values (22130) 45173
87.5%
2023-12-12T18:05:56.262392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42369
 
21.6%
3826
 
2.0%
: 3766
 
1.9%
3434
 
1.8%
2624
 
1.3%
2007
 
1.0%
1748
 
0.9%
1723
 
0.9%
1713
 
0.9%
1707
 
0.9%
Other values (1670) 131224
66.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 123702
63.1%
Space Separator 42369
 
21.6%
Lowercase Letter 11904
 
6.1%
Other Punctuation 7386
 
3.8%
Decimal Number 4435
 
2.3%
Uppercase Letter 2594
 
1.3%
Close Punctuation 1522
 
0.8%
Open Punctuation 1522
 
0.8%
Math Symbol 462
 
0.2%
Dash Punctuation 157
 
0.1%
Other values (6) 88
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3826
 
3.1%
3434
 
2.8%
2624
 
2.1%
2007
 
1.6%
1748
 
1.4%
1723
 
1.4%
1713
 
1.4%
1707
 
1.4%
1675
 
1.4%
1550
 
1.3%
Other values (1555) 101695
82.2%
Lowercase Letter
ValueCountFrequency (%)
e 1476
12.4%
o 1038
 
8.7%
i 987
 
8.3%
a 980
 
8.2%
n 835
 
7.0%
t 814
 
6.8%
r 809
 
6.8%
s 792
 
6.7%
h 526
 
4.4%
l 522
 
4.4%
Other values (16) 3125
26.3%
Uppercase Letter
ValueCountFrequency (%)
T 255
 
9.8%
S 250
 
9.6%
D 186
 
7.2%
A 163
 
6.3%
C 163
 
6.3%
E 154
 
5.9%
B 146
 
5.6%
M 131
 
5.1%
I 116
 
4.5%
L 99
 
3.8%
Other values (16) 931
35.9%
Other Punctuation
ValueCountFrequency (%)
: 3766
51.0%
. 1467
 
19.9%
, 1271
 
17.2%
! 378
 
5.1%
· 194
 
2.6%
' 129
 
1.7%
& 42
 
0.6%
% 30
 
0.4%
28
 
0.4%
" 16
 
0.2%
Other values (12) 65
 
0.9%
Decimal Number
ValueCountFrequency (%)
1 1180
26.6%
0 882
19.9%
2 776
17.5%
3 409
 
9.2%
5 280
 
6.3%
4 236
 
5.3%
6 197
 
4.4%
9 190
 
4.3%
7 143
 
3.2%
8 142
 
3.2%
Math Symbol
ValueCountFrequency (%)
= 364
78.8%
~ 38
 
8.2%
+ 25
 
5.4%
14
 
3.0%
< 5
 
1.1%
> 5
 
1.1%
4
 
0.9%
4
 
0.9%
× 2
 
0.4%
| 1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 1442
94.7%
] 77
 
5.1%
2
 
0.1%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1442
94.7%
[ 77
 
5.1%
2
 
0.1%
1
 
0.1%
Other Symbol
ValueCountFrequency (%)
3
42.9%
2
28.6%
1
 
14.3%
° 1
 
14.3%
Letter Number
ValueCountFrequency (%)
16
50.0%
10
31.2%
6
 
18.8%
Space Separator
ValueCountFrequency (%)
42369
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 157
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 44
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 123199
62.8%
Common 57909
29.5%
Latin 14530
 
7.4%
Han 492
 
0.3%
Hiragana 9
 
< 0.1%
Katakana 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3826
 
3.1%
3434
 
2.8%
2624
 
2.1%
2007
 
1.6%
1748
 
1.4%
1723
 
1.4%
1713
 
1.4%
1707
 
1.4%
1675
 
1.4%
1550
 
1.3%
Other values (1302) 101192
82.1%
Han
ValueCountFrequency (%)
14
 
2.8%
10
 
2.0%
9
 
1.8%
8
 
1.6%
8
 
1.6%
8
 
1.6%
8
 
1.6%
7
 
1.4%
7
 
1.4%
6
 
1.2%
Other values (234) 407
82.7%
Common
ValueCountFrequency (%)
42369
73.2%
: 3766
 
6.5%
. 1467
 
2.5%
) 1442
 
2.5%
( 1442
 
2.5%
, 1271
 
2.2%
1 1180
 
2.0%
0 882
 
1.5%
2 776
 
1.3%
3 409
 
0.7%
Other values (50) 2905
 
5.0%
Latin
ValueCountFrequency (%)
e 1476
 
10.2%
o 1038
 
7.1%
i 987
 
6.8%
a 980
 
6.7%
n 835
 
5.7%
t 814
 
5.6%
r 809
 
5.6%
s 792
 
5.5%
h 526
 
3.6%
l 522
 
3.6%
Other values (45) 5751
39.6%
Hiragana
ValueCountFrequency (%)
2
22.2%
2
22.2%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
Katakana
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 123185
62.8%
ASCII 72112
36.8%
CJK 475
 
0.2%
None 274
 
0.1%
Number Forms 32
 
< 0.1%
CJK Compat Ideographs 17
 
< 0.1%
Compat Jamo 14
 
< 0.1%
Punctuation 10
 
< 0.1%
Hiragana 9
 
< 0.1%
Math Operators 4
 
< 0.1%
Other values (5) 9
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
42369
58.8%
: 3766
 
5.2%
e 1476
 
2.0%
. 1467
 
2.0%
) 1442
 
2.0%
( 1442
 
2.0%
, 1271
 
1.8%
1 1180
 
1.6%
o 1038
 
1.4%
i 987
 
1.4%
Other values (77) 15674
 
21.7%
Hangul
ValueCountFrequency (%)
3826
 
3.1%
3434
 
2.8%
2624
 
2.1%
2007
 
1.6%
1748
 
1.4%
1723
 
1.4%
1713
 
1.4%
1707
 
1.4%
1675
 
1.4%
1550
 
1.3%
Other values (1296) 101178
82.1%
None
ValueCountFrequency (%)
· 194
70.8%
28
 
10.2%
15
 
5.5%
14
 
5.1%
4
 
1.5%
3
 
1.1%
2
 
0.7%
2
 
0.7%
2
 
0.7%
2
 
0.7%
Other values (7) 8
 
2.9%
Number Forms
ValueCountFrequency (%)
16
50.0%
10
31.2%
6
 
18.8%
CJK
ValueCountFrequency (%)
14
 
2.9%
10
 
2.1%
9
 
1.9%
8
 
1.7%
8
 
1.7%
8
 
1.7%
8
 
1.7%
7
 
1.5%
7
 
1.5%
6
 
1.3%
Other values (222) 390
82.1%
Punctuation
ValueCountFrequency (%)
6
60.0%
2
 
20.0%
2
 
20.0%
Compat Jamo
ValueCountFrequency (%)
6
42.9%
2
 
14.3%
2
 
14.3%
2
 
14.3%
1
 
7.1%
1
 
7.1%
Math Operators
ValueCountFrequency (%)
4
100.0%
Misc Symbols
ValueCountFrequency (%)
3
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
3
17.6%
2
11.8%
2
11.8%
2
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (2) 2
11.8%
Hiragana
ValueCountFrequency (%)
2
22.2%
2
22.2%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
Letterlike Symbols
ValueCountFrequency (%)
2
100.0%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%
Katakana
ValueCountFrequency (%)
1
50.0%
1
50.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%
Distinct8873
Distinct (%)88.8%
Missing12
Missing (%)0.1%
Memory size156.2 KiB
2023-12-12T18:05:57.021088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length118
Median length98
Mean length14.930416
Min length2

Characters and Unicode

Total characters149125
Distinct characters1107
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8148 ?
Unique (%)81.6%

Sample

1st row러디어드 키플링 글 ; 장 자크 프룬 그림 ; 함춘성 옮김
2nd row설민석 지음
3rd row최윤희 지음
4th row상상도깨비 글·기획 ; 토드랩 그림
5th row정소연 지음
ValueCountFrequency (%)
지음 6205
 
14.0%
6094
 
13.7%
옮김 3127
 
7.1%
그림 1496
 
3.4%
1293
 
2.9%
글·그림 405
 
0.9%
by 361
 
0.8%
283
 
0.6%
공]지음 275
 
0.6%
엮음 221
 
0.5%
Other values (13314) 24581
55.4%
2023-12-12T18:05:57.615949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35835
24.0%
7228
 
4.8%
6866
 
4.6%
; 5929
 
4.0%
5445
 
3.7%
3215
 
2.2%
2925
 
2.0%
2234
 
1.5%
2125
 
1.4%
1946
 
1.3%
Other values (1097) 75377
50.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 93306
62.6%
Space Separator 35835
 
24.0%
Lowercase Letter 8167
 
5.5%
Other Punctuation 8072
 
5.4%
Uppercase Letter 1912
 
1.3%
Close Punctuation 859
 
0.6%
Open Punctuation 858
 
0.6%
Decimal Number 53
 
< 0.1%
Dash Punctuation 34
 
< 0.1%
Math Symbol 23
 
< 0.1%
Other values (2) 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7228
 
7.7%
6866
 
7.4%
5445
 
5.8%
3215
 
3.4%
2925
 
3.1%
2234
 
2.4%
2125
 
2.3%
1946
 
2.1%
1490
 
1.6%
1308
 
1.4%
Other values (1008) 58524
62.7%
Uppercase Letter
ValueCountFrequency (%)
S 193
 
10.1%
B 149
 
7.8%
A 141
 
7.4%
J 117
 
6.1%
M 116
 
6.1%
R 114
 
6.0%
D 113
 
5.9%
K 109
 
5.7%
C 98
 
5.1%
P 97
 
5.1%
Other values (16) 665
34.8%
Lowercase Letter
ValueCountFrequency (%)
e 898
11.0%
a 714
 
8.7%
r 651
 
8.0%
n 631
 
7.7%
t 620
 
7.6%
i 613
 
7.5%
l 558
 
6.8%
y 546
 
6.7%
o 504
 
6.2%
b 424
 
5.2%
Other values (15) 2008
24.6%
Other Punctuation
ValueCountFrequency (%)
; 5929
73.5%
, 895
 
11.1%
. 654
 
8.1%
· 530
 
6.6%
: 31
 
0.4%
& 15
 
0.2%
' 9
 
0.1%
3
 
< 0.1%
" 2
 
< 0.1%
/ 2
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
0 19
35.8%
2 10
18.9%
1 8
15.1%
8 3
 
5.7%
5 3
 
5.7%
4 3
 
5.7%
3 2
 
3.8%
6 2
 
3.8%
9 2
 
3.8%
7 1
 
1.9%
Open Punctuation
ValueCountFrequency (%)
[ 829
96.6%
( 22
 
2.6%
5
 
0.6%
2
 
0.2%
Close Punctuation
ValueCountFrequency (%)
] 829
96.5%
) 22
 
2.6%
6
 
0.7%
2
 
0.2%
Math Symbol
ValueCountFrequency (%)
> 10
43.5%
< 10
43.5%
= 2
 
8.7%
+ 1
 
4.3%
Modifier Symbol
ValueCountFrequency (%)
´ 4
80.0%
1
 
20.0%
Space Separator
ValueCountFrequency (%)
35835
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 34
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 93127
62.4%
Common 45740
30.7%
Latin 10079
 
6.8%
Han 179
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7228
 
7.8%
6866
 
7.4%
5445
 
5.8%
3215
 
3.5%
2925
 
3.1%
2234
 
2.4%
2125
 
2.3%
1946
 
2.1%
1490
 
1.6%
1308
 
1.4%
Other values (891) 58345
62.7%
Han
ValueCountFrequency (%)
10
 
5.6%
9
 
5.0%
4
 
2.2%
4
 
2.2%
4
 
2.2%
4
 
2.2%
3
 
1.7%
3
 
1.7%
3
 
1.7%
3
 
1.7%
Other values (107) 132
73.7%
Latin
ValueCountFrequency (%)
e 898
 
8.9%
a 714
 
7.1%
r 651
 
6.5%
n 631
 
6.3%
t 620
 
6.2%
i 613
 
6.1%
l 558
 
5.5%
y 546
 
5.4%
o 504
 
5.0%
b 424
 
4.2%
Other values (41) 3920
38.9%
Common
ValueCountFrequency (%)
35835
78.3%
; 5929
 
13.0%
, 895
 
2.0%
[ 829
 
1.8%
] 829
 
1.8%
. 654
 
1.4%
· 530
 
1.2%
- 34
 
0.1%
: 31
 
0.1%
) 22
 
< 0.1%
Other values (28) 152
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 93122
62.4%
ASCII 55265
37.1%
None 553
 
0.4%
CJK 165
 
0.1%
CJK Compat Ideographs 14
 
< 0.1%
Compat Jamo 5
 
< 0.1%
Enclosed Alphanum 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
35835
64.8%
; 5929
 
10.7%
e 898
 
1.6%
, 895
 
1.6%
[ 829
 
1.5%
] 829
 
1.5%
a 714
 
1.3%
. 654
 
1.2%
r 651
 
1.2%
n 631
 
1.1%
Other values (70) 7400
 
13.4%
Hangul
ValueCountFrequency (%)
7228
 
7.8%
6866
 
7.4%
5445
 
5.8%
3215
 
3.5%
2925
 
3.1%
2234
 
2.4%
2125
 
2.3%
1946
 
2.1%
1490
 
1.6%
1308
 
1.4%
Other values (890) 58340
62.6%
None
ValueCountFrequency (%)
· 530
95.8%
6
 
1.1%
5
 
0.9%
´ 4
 
0.7%
3
 
0.5%
2
 
0.4%
2
 
0.4%
1
 
0.2%
CJK
ValueCountFrequency (%)
10
 
6.1%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
3
 
1.8%
3
 
1.8%
3
 
1.8%
3
 
1.8%
3
 
1.8%
Other values (101) 124
75.2%
CJK Compat Ideographs
ValueCountFrequency (%)
9
64.3%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Compat Jamo
ValueCountFrequency (%)
5
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%
Distinct2996
Distinct (%)30.0%
Missing7
Missing (%)0.1%
Memory size156.2 KiB
2023-12-12T18:05:57.899109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length29
Mean length4.679976
Min length1

Characters and Unicode

Total characters46767
Distinct characters809
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1703 ?
Unique (%)17.0%

Sample

1st row블루앤트리
2nd row휴먼큐브
3rd row원앤원북스
4th row어깨동무
5th row창비
ValueCountFrequency (%)
문학동네 173
 
1.6%
민음사 150
 
1.4%
창비 127
 
1.2%
비룡소 102
 
1.0%
김영사 88
 
0.8%
교원 84
 
0.8%
시공주니어 75
 
0.7%
위즈덤하우스 72
 
0.7%
황금가지 67
 
0.6%
열린책들 66
 
0.6%
Other values (3057) 9511
90.5%
2023-12-12T18:05:58.344917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1753
 
3.7%
1566
 
3.3%
1030
 
2.2%
996
 
2.1%
827
 
1.8%
754
 
1.6%
690
 
1.5%
634
 
1.4%
o 604
 
1.3%
574
 
1.2%
Other values (799) 37339
79.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39655
84.8%
Lowercase Letter 4157
 
8.9%
Uppercase Letter 1552
 
3.3%
Space Separator 525
 
1.1%
Close Punctuation 237
 
0.5%
Open Punctuation 237
 
0.5%
Decimal Number 202
 
0.4%
Other Punctuation 185
 
0.4%
Dash Punctuation 8
 
< 0.1%
Math Symbol 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1753
 
4.4%
1566
 
3.9%
1030
 
2.6%
996
 
2.5%
827
 
2.1%
754
 
1.9%
690
 
1.7%
634
 
1.6%
574
 
1.4%
545
 
1.4%
Other values (720) 30286
76.4%
Lowercase Letter
ValueCountFrequency (%)
o 604
14.5%
s 410
9.9%
e 395
 
9.5%
i 341
 
8.2%
n 339
 
8.2%
r 284
 
6.8%
a 271
 
6.5%
k 226
 
5.4%
l 149
 
3.6%
d 139
 
3.3%
Other values (16) 999
24.0%
Uppercase Letter
ValueCountFrequency (%)
B 236
15.2%
M 126
 
8.1%
P 119
 
7.7%
K 114
 
7.3%
H 107
 
6.9%
S 103
 
6.6%
R 79
 
5.1%
E 73
 
4.7%
C 71
 
4.6%
D 67
 
4.3%
Other values (16) 457
29.4%
Other Punctuation
ValueCountFrequency (%)
: 58
31.4%
& 55
29.7%
. 19
 
10.3%
16
 
8.6%
· 15
 
8.1%
; 11
 
5.9%
' 4
 
2.2%
, 4
 
2.2%
# 2
 
1.1%
/ 1
 
0.5%
Decimal Number
ValueCountFrequency (%)
1 79
39.1%
2 75
37.1%
0 33
16.3%
4 4
 
2.0%
6 3
 
1.5%
5 2
 
1.0%
3 2
 
1.0%
8 2
 
1.0%
7 2
 
1.0%
Close Punctuation
ValueCountFrequency (%)
) 165
69.6%
] 72
30.4%
Open Punctuation
ValueCountFrequency (%)
( 165
69.6%
[ 72
30.4%
Space Separator
ValueCountFrequency (%)
525
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Math Symbol
ValueCountFrequency (%)
+ 7
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39536
84.5%
Latin 5709
 
12.2%
Common 1403
 
3.0%
Han 119
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1753
 
4.4%
1566
 
4.0%
1030
 
2.6%
996
 
2.5%
827
 
2.1%
754
 
1.9%
690
 
1.7%
634
 
1.6%
574
 
1.5%
545
 
1.4%
Other values (663) 30167
76.3%
Han
ValueCountFrequency (%)
16
 
13.4%
12
 
10.1%
7
 
5.9%
7
 
5.9%
4
 
3.4%
3
 
2.5%
3
 
2.5%
3
 
2.5%
3
 
2.5%
2
 
1.7%
Other values (47) 59
49.6%
Latin
ValueCountFrequency (%)
o 604
 
10.6%
s 410
 
7.2%
e 395
 
6.9%
i 341
 
6.0%
n 339
 
5.9%
r 284
 
5.0%
a 271
 
4.7%
B 236
 
4.1%
k 226
 
4.0%
l 149
 
2.6%
Other values (42) 2454
43.0%
Common
ValueCountFrequency (%)
525
37.4%
) 165
 
11.8%
( 165
 
11.8%
1 79
 
5.6%
2 75
 
5.3%
] 72
 
5.1%
[ 72
 
5.1%
: 58
 
4.1%
& 55
 
3.9%
0 33
 
2.4%
Other values (17) 104
 
7.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39529
84.5%
ASCII 7081
 
15.1%
CJK 119
 
0.3%
None 31
 
0.1%
Compat Jamo 7
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1753
 
4.4%
1566
 
4.0%
1030
 
2.6%
996
 
2.5%
827
 
2.1%
754
 
1.9%
690
 
1.7%
634
 
1.6%
574
 
1.5%
545
 
1.4%
Other values (658) 30160
76.3%
ASCII
ValueCountFrequency (%)
o 604
 
8.5%
525
 
7.4%
s 410
 
5.8%
e 395
 
5.6%
i 341
 
4.8%
n 339
 
4.8%
r 284
 
4.0%
a 271
 
3.8%
B 236
 
3.3%
k 226
 
3.2%
Other values (67) 3450
48.7%
None
ValueCountFrequency (%)
16
51.6%
· 15
48.4%
CJK
ValueCountFrequency (%)
16
 
13.4%
12
 
10.1%
7
 
5.9%
7
 
5.9%
4
 
3.4%
3
 
2.5%
3
 
2.5%
3
 
2.5%
3
 
2.5%
2
 
1.7%
Other values (47) 59
49.6%
Compat Jamo
ValueCountFrequency (%)
3
42.9%
1
 
14.3%
1
 
14.3%
1
 
14.3%
1
 
14.3%

발행연도
Categorical

HIGH CARDINALITY 

Distinct51
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2005
914 
2006
851 
2007
760 
2004
 
595
2015
 
589
Other values (46)
6291 

Length

Max length9
Median length4
Mean length4.0096
Min length1

Unique

Unique13 ?
Unique (%)0.1%

Sample

1st row2013
2nd row2015
3rd row2010
4th row2018
5th row2015

Common Values

ValueCountFrequency (%)
2005 914
 
9.1%
2006 851
 
8.5%
2007 760
 
7.6%
2004 595
 
5.9%
2015 589
 
5.9%
2018 535
 
5.3%
2008 531
 
5.3%
2016 464
 
4.6%
2010 461
 
4.6%
2014 461
 
4.6%
Other values (41) 3839
38.4%

Length

2023-12-12T18:05:58.488260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2005 914
 
9.1%
2006 851
 
8.5%
2007 760
 
7.6%
2004 595
 
5.9%
2015 589
 
5.9%
2018 535
 
5.3%
2008 531
 
5.3%
2016 464
 
4.6%
2010 461
 
4.6%
2014 461
 
4.6%
Other values (38) 3839
38.4%

관리부서
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
평생교육과
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row평생교육과
2nd row평생교육과
3rd row평생교육과
4th row평생교육과
5th row평생교육과

Common Values

ValueCountFrequency (%)
평생교육과 10000
100.0%

Length

2023-12-12T18:05:58.599819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:05:58.684901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
평생교육과 10000
100.0%

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-03-01
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-03-01
2nd row2023-03-01
3rd row2023-03-01
4th row2023-03-01
5th row2023-03-01

Common Values

ValueCountFrequency (%)
2023-03-01 10000
100.0%

Length

2023-12-12T18:05:58.779309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:05:58.910005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-03-01 10000
100.0%

Missing values

2023-12-12T18:05:53.554592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:05:53.739746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T18:05:53.898838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

도서관명청구기호도서명저작자출판사발행연도관리부서기준일자
66124도원도서관I 375.1-키897ㅎ코뿔소 가죽은 왜 주름이 졌을까러디어드 키플링 글 ; 장 자크 프룬 그림 ; 함춘성 옮김블루앤트리2013평생교육과2023-03-01
56180도원도서관911.058-설38ㅅ(버림받은 왕자) 사도설민석 지음휴먼큐브2015평생교육과2023-03-01
46383도원도서관818-최66ㅇ(최윤희의)웃음 비타민 : 인생을 바꾸는 유쾌한 촌철살인 명언 719최윤희 지음원앤원북스2010평생교육과2023-03-01
69142도원도서관I 457.279-상52ㅇ-13할아버지 원시 공룡. 2상상도깨비 글·기획 ; 토드랩 그림어깨동무2018평생교육과2023-03-01
41110도원도서관813.6-정55ㅇ옆집의 영희 씨 : 정소연 소설집정소연 지음창비2015평생교육과2023-03-01
53850도원도서관863-루62ㄴ내 생애의 아이들가브리엘 루아 지음 ; 김화영 옮김현대문학2006평생교육과2023-03-01
51834도원도서관843-챈27ㅎ호수의 여인레이먼드 챈들러 지음 ; 박현주 옮김북하우스2004평생교육과2023-03-01
45297도원도서관818-박54ㅂ별은 스스로 빛나지 않는다 : 스타를 부탁해박성혜 지음씨네21북스2010평생교육과2023-03-01
20090도원도서관510.19-헤295ㅇ아무도 죽지 않는 세상 : 트랜스휴머니즘의 현재와 미래이브 헤롤드 지음 ; 강병철 옮김꿈꿀자유(꿈꿀자유 서울의학서적)2020평생교육과2023-03-01
70286도원도서관I 691-시34시-1(신나게 두뇌회전!)시멘토 똑똑하고 기발한 미로찾기. 1시멘토 교육연구소 지음시멘토2020평생교육과2023-03-01
도서관명청구기호도서명저작자출판사발행연도관리부서기준일자
75058도원도서관I 813.8-김54ㅇ웃음이 퐁퐁퐁김성은 글 ; 조미자 그림천개의바람2019평생교육과2023-03-01
64524도원도서관I 375.1-노293ㄱ-35노래하는 솜사탕. 35 : 부릉부릉 어떤 차일까야마모토 쇼우조 글 ; 이치하라 쥰 그림 ; 권순주 옮김교원2008평생교육과2023-03-01
77424도원도서관I 833.8-후876ㄴ내가 먹어 줄게후쿠베 아키히로 글 ; 오노 코헤이 그림 ; 사과나무 옮김크레용하우스2013평생교육과2023-03-01
75767도원도서관I 813.8-안194ㅅ수박 수영장안녕달 글.그림창비2015평생교육과2023-03-01
47855도원도서관833.6-미63ㄱ괴수전미야베 미유키 지음 ; 이규원 옮김북스피어2015평생교육과2023-03-01
63600도원도서관I 219-지58-23돌이 된 왕비 니오베토머스 불핀치 원작 ; 이붕 엮음 ; 최현주 그림한국톨스토이2014평생교육과2023-03-01
74820도원도서관I 813.8-강38ㄴ=2나는 너무나 소중해강민석 글 ; 김문수 그림열린생각2006평생교육과2023-03-01
58290도원도서관980.24-큐298-v.35아일랜드패트리샤 레비 지음 ; 이동진 옮김휘슬러2005평생교육과2023-03-01
49313도원도서관834-하58ㄴ나답게 살다 나답게 죽고 싶다 : 품위 있는 죽음을 위한 종활 일기하시다 스가코 지음 ; 김정환 옮김21세기북스:북이십일 21세기북스2018평생교육과2023-03-01
80727도원도서관I 980-세14ㅇ-39돈조아 임금님의 퀴즈김미연 글 ; 심보영 그림이수2013평생교육과2023-03-01

Duplicate rows

Most frequently occurring

도서관명청구기호도서명저작자출판사발행연도관리부서기준일자# duplicates
7도원도서관512.57-최68ㅇ5분의 기적 EFT : 건강 행복 성공의 테크닉최인원 ; 김원영 ; 정유진 [공]지음정신세계사2008평생교육과2023-03-013
0도원도서관004.46-리887ㅎ하드디스크 포맷+복구 지존에 도전하자 : 초보자도 100%성공하는 엄청 쉬운 윈도우 설치리트머스 저영진닷컴2005평생교육과2023-03-012
1도원도서관004.76-김64ㅇ(모든 걸 알켜주마) 일러스트레이터CS :, 아마추어에서 그래픽 전문가로의 도약김영원 외 지음제우미디어2004평생교육과2023-03-012
2도원도서관005.72-송15ㄷ(예제로 배우는)드림위버4송관호 ; 마현철 [공]지음글로벌2001평생교육과2023-03-012
3도원도서관005.756-정54ㄷ데이터베이스 개론과 실습 : ERwin과 오라클정선호 지음한빛미디어2004평생교육과2023-03-012
4도원도서관327.856-부72ㅈ(경제적 자유를 위한)주식투자 시크릿부자아빠 지음모든국민은주주다2012평생교육과2023-03-012
5도원도서관331.54-카198ㅅ생각하지 않는 사람들니콜라스 카 지음 ; 최지향 옮김청림출판2011평생교육과2023-03-012
6도원도서관410-민74ㅅ(중학교 필수공식이 통째로 외워지는) 수학쇼 show민정범 지음살림Math2007평생교육과2023-03-012
8도원도서관592.3-안55ㅇ(너무 예뻐 꼭 한번 입고 싶은) 영화 속 옷+소품 만들기안소영 지음미디어윌2009평생교육과2023-03-012
9도원도서관592.3-이68ㄹ리넨으로 만드는 엄마와 딸의 커플룩 36이인자 저Handis(핸디스 소잉스토리)2019평생교육과2023-03-012