Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells147
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory556.6 KiB
Average record size in memory57.0 B

Variable types

Numeric1
Text5

Dataset

Description인천광역시 인재개발원 자료실 도서 목록 현황 자료 데이터 입니다.(청구 기호, 서명, 저작자, 발행자, 발행년 등의 항목을 제공합니다.)
URLhttps://www.data.go.kr/data/15052866/fileData.do

Alerts

저작자 has 132 (1.3%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:22:49.404706
Analysis finished2023-12-12 09:22:52.742578
Duration3.34 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8110.6155
Minimum1
Maximum16190
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T18:22:52.835102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile814.95
Q14083.75
median8092.5
Q312144.25
95-th percentile15378.05
Maximum16190
Range16189
Interquartile range (IQR)8060.5

Descriptive statistics

Standard deviation4671.5974
Coefficient of variation (CV)0.57598555
Kurtosis-1.1997509
Mean8110.6155
Median Absolute Deviation (MAD)4033
Skewness-0.0010680358
Sum81106155
Variance21823822
MonotonicityNot monotonic
2023-12-12T18:22:53.022171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15 1
 
< 0.1%
1458 1
 
< 0.1%
10343 1
 
< 0.1%
1176 1
 
< 0.1%
11503 1
 
< 0.1%
12431 1
 
< 0.1%
15452 1
 
< 0.1%
9809 1
 
< 0.1%
8213 1
 
< 0.1%
9807 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
10 1
< 0.1%
13 1
< 0.1%
15 1
< 0.1%
16 1
< 0.1%
ValueCountFrequency (%)
16190 1
< 0.1%
16188 1
< 0.1%
16187 1
< 0.1%
16185 1
< 0.1%
16184 1
< 0.1%
16182 1
< 0.1%
16181 1
< 0.1%
16177 1
< 0.1%
16176 1
< 0.1%
16174 1
< 0.1%
Distinct9517
Distinct (%)95.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:22:53.351019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length20
Mean length9.5446
Min length1

Characters and Unicode

Total characters95446
Distinct characters529
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9197 ?
Unique (%)92.0%

Sample

1st row813-김36ㄴ-2
2nd row452-하75ㅋ
3rd row713-오76ㄱ
4th row594-고94ㄱ
5th row813.6-김74ㄱ-1
ValueCountFrequency (%)
408 36
 
0.4%
아동 20
 
0.2%
608-편78ㅇ 13
 
0.1%
811.32-신14ㅇ 9
 
0.1%
9
 
0.1%
470.8 8
 
0.1%
810.9-kㅇ 7
 
0.1%
811.33-김79ㅊ 7
 
0.1%
0정65ㅈ 6
 
0.1%
031-한16ㅎ 6
 
0.1%
Other values (9518) 9924
98.8%
2023-12-12T18:22:53.813353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 12087
12.7%
1 8761
 
9.2%
8 8111
 
8.5%
3 7866
 
8.2%
6 5665
 
5.9%
4 5627
 
5.9%
2 5547
 
5.8%
5 5496
 
5.8%
9 4683
 
4.9%
. 4088
 
4.3%
Other values (519) 27515
28.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 59128
61.9%
Other Letter 19475
 
20.4%
Dash Punctuation 12087
 
12.7%
Other Punctuation 4106
 
4.3%
Math Symbol 292
 
0.3%
Uppercase Letter 271
 
0.3%
Space Separator 45
 
< 0.1%
Lowercase Letter 30
 
< 0.1%
Open Punctuation 5
 
< 0.1%
Close Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1781
 
9.1%
1315
 
6.8%
1083
 
5.6%
1060
 
5.4%
975
 
5.0%
926
 
4.8%
880
 
4.5%
630
 
3.2%
630
 
3.2%
564
 
2.9%
Other values (460) 9631
49.5%
Uppercase Letter
ValueCountFrequency (%)
W 63
23.2%
K 41
15.1%
B 27
10.0%
A 19
 
7.0%
T 14
 
5.2%
E 13
 
4.8%
S 13
 
4.8%
C 12
 
4.4%
M 11
 
4.1%
H 10
 
3.7%
Other values (12) 48
17.7%
Lowercase Letter
ValueCountFrequency (%)
v 13
43.3%
c 3
 
10.0%
g 2
 
6.7%
w 2
 
6.7%
s 2
 
6.7%
o 2
 
6.7%
d 2
 
6.7%
y 1
 
3.3%
p 1
 
3.3%
l 1
 
3.3%
Decimal Number
ValueCountFrequency (%)
1 8761
14.8%
8 8111
13.7%
3 7866
13.3%
6 5665
9.6%
4 5627
9.5%
2 5547
9.4%
5 5496
9.3%
9 4683
7.9%
7 3926
6.6%
0 3446
 
5.8%
Other Punctuation
ValueCountFrequency (%)
. 4088
99.6%
: 13
 
0.3%
# 2
 
< 0.1%
, 2
 
< 0.1%
? 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
= 290
99.3%
~ 1
 
0.3%
1
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 3
60.0%
[ 2
40.0%
Close Punctuation
ValueCountFrequency (%)
) 3
60.0%
] 2
40.0%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 12087
100.0%
Space Separator
ValueCountFrequency (%)
45
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 75668
79.3%
Hangul 19471
 
20.4%
Latin 303
 
0.3%
Han 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1781
 
9.1%
1315
 
6.8%
1083
 
5.6%
1060
 
5.4%
975
 
5.0%
926
 
4.8%
880
 
4.5%
630
 
3.2%
630
 
3.2%
564
 
2.9%
Other values (458) 9627
49.4%
Latin
ValueCountFrequency (%)
W 63
20.8%
K 41
13.5%
B 27
 
8.9%
A 19
 
6.3%
T 14
 
4.6%
E 13
 
4.3%
v 13
 
4.3%
S 13
 
4.3%
C 12
 
4.0%
M 11
 
3.6%
Other values (25) 77
25.4%
Common
ValueCountFrequency (%)
- 12087
16.0%
1 8761
11.6%
8 8111
10.7%
3 7866
10.4%
6 5665
7.5%
4 5627
7.4%
2 5547
7.3%
5 5496
7.3%
9 4683
 
6.2%
. 4088
 
5.4%
Other values (14) 7737
10.2%
Han
ValueCountFrequency (%)
2
50.0%
2
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 75968
79.6%
Hangul 9926
 
10.4%
Compat Jamo 9545
 
10.0%
CJK 4
 
< 0.1%
Number Forms 2
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 12087
15.9%
1 8761
11.5%
8 8111
10.7%
3 7866
10.4%
6 5665
7.5%
4 5627
7.4%
2 5547
7.3%
5 5496
7.2%
9 4683
 
6.2%
. 4088
 
5.4%
Other values (46) 8037
10.6%
Compat Jamo
ValueCountFrequency (%)
1781
18.7%
1315
13.8%
1083
11.3%
926
9.7%
880
9.2%
630
 
6.6%
630
 
6.6%
564
 
5.9%
526
 
5.5%
410
 
4.3%
Other values (9) 800
8.4%
Hangul
ValueCountFrequency (%)
1060
 
10.7%
975
 
9.8%
420
 
4.2%
350
 
3.5%
267
 
2.7%
249
 
2.5%
246
 
2.5%
138
 
1.4%
131
 
1.3%
129
 
1.3%
Other values (439) 5961
60.1%
CJK
ValueCountFrequency (%)
2
50.0%
2
50.0%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

서명
Text

Distinct9436
Distinct (%)94.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:22:54.252019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length187
Median length109
Mean length13.6371
Min length1

Characters and Unicode

Total characters136371
Distinct characters1618
Distinct categories15 ?
Distinct scripts4 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9175 ?
Unique (%)91.8%

Sample

1st row눈물꽃2 / . 2
2nd row퀴즈! 과학상식
3rd row공부가재미있어지는 교과서속담
4th row고형욱의맛있는이야기 /
5th row길없는 사람들. 1
ValueCountFrequency (%)
5990
 
18.1%
1 570
 
1.7%
2 378
 
1.1%
3 156
 
0.5%
장편소설 97
 
0.3%
4 97
 
0.3%
5 90
 
0.3%
이야기 89
 
0.3%
위한 73
 
0.2%
6 70
 
0.2%
Other values (15662) 25511
77.0%
2023-12-12T18:22:54.935401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23789
 
17.4%
/ 3863
 
2.8%
2971
 
2.2%
. 2385
 
1.7%
2282
 
1.7%
1711
 
1.3%
1708
 
1.3%
1 1618
 
1.2%
1392
 
1.0%
1349
 
1.0%
Other values (1608) 93303
68.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 91893
67.4%
Space Separator 23789
 
17.4%
Other Punctuation 8597
 
6.3%
Decimal Number 5308
 
3.9%
Lowercase Letter 3920
 
2.9%
Uppercase Letter 1022
 
0.7%
Open Punctuation 683
 
0.5%
Close Punctuation 682
 
0.5%
Dash Punctuation 306
 
0.2%
Math Symbol 161
 
0.1%
Other values (5) 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2971
 
3.2%
2282
 
2.5%
1711
 
1.9%
1708
 
1.9%
1392
 
1.5%
1349
 
1.5%
1326
 
1.4%
1301
 
1.4%
1244
 
1.4%
1186
 
1.3%
Other values (1505) 75423
82.1%
Lowercase Letter
ValueCountFrequency (%)
e 434
11.1%
o 378
 
9.6%
n 352
 
9.0%
a 313
 
8.0%
i 311
 
7.9%
t 282
 
7.2%
r 241
 
6.1%
s 239
 
6.1%
l 173
 
4.4%
h 167
 
4.3%
Other values (16) 1030
26.3%
Uppercase Letter
ValueCountFrequency (%)
E 88
 
8.6%
S 75
 
7.3%
O 73
 
7.1%
T 71
 
6.9%
W 62
 
6.1%
I 60
 
5.9%
A 60
 
5.9%
C 54
 
5.3%
R 46
 
4.5%
H 44
 
4.3%
Other values (16) 389
38.1%
Other Punctuation
ValueCountFrequency (%)
/ 3863
44.9%
. 2385
27.7%
: 1174
 
13.7%
, 683
 
7.9%
? 184
 
2.1%
! 132
 
1.5%
· 84
 
1.0%
' 32
 
0.4%
% 20
 
0.2%
& 18
 
0.2%
Other values (9) 22
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 1618
30.5%
2 1023
19.3%
0 698
13.1%
3 451
 
8.5%
4 354
 
6.7%
5 335
 
6.3%
9 305
 
5.7%
6 199
 
3.7%
7 172
 
3.2%
8 153
 
2.9%
Math Symbol
ValueCountFrequency (%)
= 133
82.6%
~ 10
 
6.2%
9
 
5.6%
+ 5
 
3.1%
× 2
 
1.2%
< 1
 
0.6%
> 1
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 673
98.5%
[ 9
 
1.3%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 671
98.4%
] 10
 
1.5%
1
 
0.1%
Letter Number
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%
Space Separator
ValueCountFrequency (%)
23789
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 306
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 90535
66.4%
Common 39531
29.0%
Latin 4947
 
3.6%
Han 1358
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2971
 
3.3%
2282
 
2.5%
1711
 
1.9%
1708
 
1.9%
1392
 
1.5%
1349
 
1.5%
1326
 
1.5%
1301
 
1.4%
1244
 
1.4%
1186
 
1.3%
Other values (1162) 74065
81.8%
Han
ValueCountFrequency (%)
35
 
2.6%
32
 
2.4%
32
 
2.4%
29
 
2.1%
26
 
1.9%
24
 
1.8%
24
 
1.8%
23
 
1.7%
21
 
1.5%
20
 
1.5%
Other values (333) 1092
80.4%
Latin
ValueCountFrequency (%)
e 434
 
8.8%
o 378
 
7.6%
n 352
 
7.1%
a 313
 
6.3%
i 311
 
6.3%
t 282
 
5.7%
r 241
 
4.9%
s 239
 
4.8%
l 173
 
3.5%
h 167
 
3.4%
Other values (45) 2057
41.6%
Common
ValueCountFrequency (%)
23789
60.2%
/ 3863
 
9.8%
. 2385
 
6.0%
1 1618
 
4.1%
: 1174
 
3.0%
2 1023
 
2.6%
0 698
 
1.8%
, 683
 
1.7%
( 673
 
1.7%
) 671
 
1.7%
Other values (38) 2954
 
7.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 90525
66.4%
ASCII 44365
32.5%
CJK 1330
 
1.0%
None 94
 
0.1%
CJK Compat Ideographs 28
 
< 0.1%
Compat Jamo 10
 
< 0.1%
Math Operators 9
 
< 0.1%
Number Forms 5
 
< 0.1%
Punctuation 3
 
< 0.1%
Misc Symbols 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
23789
53.6%
/ 3863
 
8.7%
. 2385
 
5.4%
1 1618
 
3.6%
: 1174
 
2.6%
2 1023
 
2.3%
0 698
 
1.6%
, 683
 
1.5%
( 673
 
1.5%
) 671
 
1.5%
Other values (78) 7788
 
17.6%
Hangul
ValueCountFrequency (%)
2971
 
3.3%
2282
 
2.5%
1711
 
1.9%
1708
 
1.9%
1392
 
1.5%
1349
 
1.5%
1326
 
1.5%
1301
 
1.4%
1244
 
1.4%
1186
 
1.3%
Other values (1154) 74055
81.8%
None
ValueCountFrequency (%)
· 84
89.4%
4
 
4.3%
× 2
 
2.1%
1
 
1.1%
1
 
1.1%
1
 
1.1%
1
 
1.1%
CJK
ValueCountFrequency (%)
35
 
2.6%
32
 
2.4%
32
 
2.4%
29
 
2.2%
26
 
2.0%
24
 
1.8%
24
 
1.8%
23
 
1.7%
21
 
1.6%
20
 
1.5%
Other values (319) 1064
80.0%
Math Operators
ValueCountFrequency (%)
9
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
6
21.4%
5
17.9%
4
14.3%
3
10.7%
1
 
3.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%
Other values (4) 4
14.3%
Number Forms
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%
Compat Jamo
ValueCountFrequency (%)
2
20.0%
2
20.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
Misc Symbols
ValueCountFrequency (%)
2
100.0%
Punctuation
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

저작자
Text

MISSING 

Distinct7434
Distinct (%)75.3%
Missing132
Missing (%)1.3%
Memory size156.2 KiB
2023-12-12T18:22:55.336047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length86
Median length66
Mean length7.539927
Min length1

Characters and Unicode

Total characters74404
Distinct characters1235
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6418 ?
Unique (%)65.0%

Sample

1st row김민기.
2nd row하종준 글, 오수진 그림
3rd row오주영
4th row고형욱
5th row김정현
ValueCountFrequency (%)
1867
 
9.2%
지음 1375
 
6.8%
옮김 528
 
2.6%
그림 370
 
1.8%
302
 
1.5%
지은이 300
 
1.5%
290
 
1.4%
편집부 164
 
0.8%
107
 
0.5%
엮음 81
 
0.4%
Other values (9068) 14830
73.4%
2023-12-12T18:22:55.884969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10898
 
14.6%
2345
 
3.2%
2148
 
2.9%
1951
 
2.6%
; 1864
 
2.5%
1535
 
2.1%
. 1209
 
1.6%
991
 
1.3%
880
 
1.2%
847
 
1.1%
Other values (1225) 49736
66.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 56622
76.1%
Space Separator 10898
 
14.6%
Other Punctuation 4198
 
5.6%
Lowercase Letter 692
 
0.9%
Uppercase Letter 687
 
0.9%
Open Punctuation 612
 
0.8%
Close Punctuation 612
 
0.8%
Decimal Number 64
 
0.1%
Math Symbol 14
 
< 0.1%
Dash Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2345
 
4.1%
2148
 
3.8%
1951
 
3.4%
1535
 
2.7%
991
 
1.8%
880
 
1.6%
847
 
1.5%
770
 
1.4%
624
 
1.1%
574
 
1.0%
Other values (1147) 43957
77.6%
Uppercase Letter
ValueCountFrequency (%)
K 88
12.8%
B 80
11.6%
S 71
10.3%
C 59
 
8.6%
J 49
 
7.1%
A 46
 
6.7%
R 39
 
5.7%
M 34
 
4.9%
E 31
 
4.5%
H 23
 
3.3%
Other values (14) 167
24.3%
Lowercase Letter
ValueCountFrequency (%)
a 80
11.6%
o 70
 
10.1%
i 63
 
9.1%
e 59
 
8.5%
r 48
 
6.9%
n 46
 
6.6%
s 43
 
6.2%
h 35
 
5.1%
t 31
 
4.5%
u 30
 
4.3%
Other values (14) 187
27.0%
Other Punctuation
ValueCountFrequency (%)
; 1864
44.4%
. 1209
28.8%
, 767
18.3%
: 252
 
6.0%
· 50
 
1.2%
18
 
0.4%
/ 17
 
0.4%
& 17
 
0.4%
! 2
 
< 0.1%
? 2
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
1 17
26.6%
2 13
20.3%
9 6
 
9.4%
4 6
 
9.4%
0 6
 
9.4%
3 5
 
7.8%
8 4
 
6.2%
7 3
 
4.7%
5 2
 
3.1%
6 2
 
3.1%
Open Punctuation
ValueCountFrequency (%)
[ 607
99.2%
( 3
 
0.5%
2
 
0.3%
Close Punctuation
ValueCountFrequency (%)
] 607
99.2%
) 3
 
0.5%
2
 
0.3%
Math Symbol
ValueCountFrequency (%)
> 7
50.0%
< 7
50.0%
Space Separator
ValueCountFrequency (%)
10898
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 55460
74.5%
Common 16403
 
22.0%
Latin 1379
 
1.9%
Han 1162
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2345
 
4.2%
2148
 
3.9%
1951
 
3.5%
1535
 
2.8%
991
 
1.8%
880
 
1.6%
847
 
1.5%
770
 
1.4%
624
 
1.1%
574
 
1.0%
Other values (824) 42795
77.2%
Han
ValueCountFrequency (%)
106
 
9.1%
90
 
7.7%
31
 
2.7%
23
 
2.0%
23
 
2.0%
22
 
1.9%
22
 
1.9%
21
 
1.8%
21
 
1.8%
20
 
1.7%
Other values (313) 783
67.4%
Latin
ValueCountFrequency (%)
K 88
 
6.4%
B 80
 
5.8%
a 80
 
5.8%
S 71
 
5.1%
o 70
 
5.1%
i 63
 
4.6%
C 59
 
4.3%
e 59
 
4.3%
J 49
 
3.6%
r 48
 
3.5%
Other values (38) 712
51.6%
Common
ValueCountFrequency (%)
10898
66.4%
; 1864
 
11.4%
. 1209
 
7.4%
, 767
 
4.7%
[ 607
 
3.7%
] 607
 
3.7%
: 252
 
1.5%
· 50
 
0.3%
18
 
0.1%
/ 17
 
0.1%
Other values (20) 114
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 55458
74.5%
ASCII 17710
 
23.8%
CJK 1127
 
1.5%
None 72
 
0.1%
CJK Compat Ideographs 35
 
< 0.1%
Compat Jamo 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10898
61.5%
; 1864
 
10.5%
. 1209
 
6.8%
, 767
 
4.3%
[ 607
 
3.4%
] 607
 
3.4%
: 252
 
1.4%
K 88
 
0.5%
B 80
 
0.5%
a 80
 
0.5%
Other values (64) 1258
 
7.1%
Hangul
ValueCountFrequency (%)
2345
 
4.2%
2148
 
3.9%
1951
 
3.5%
1535
 
2.8%
991
 
1.8%
880
 
1.6%
847
 
1.5%
770
 
1.4%
624
 
1.1%
574
 
1.0%
Other values (823) 42793
77.2%
CJK
ValueCountFrequency (%)
106
 
9.4%
90
 
8.0%
31
 
2.8%
23
 
2.0%
22
 
2.0%
22
 
2.0%
21
 
1.9%
21
 
1.9%
20
 
1.8%
17
 
1.5%
Other values (302) 754
66.9%
None
ValueCountFrequency (%)
· 50
69.4%
18
 
25.0%
2
 
2.8%
2
 
2.8%
CJK Compat Ideographs
ValueCountFrequency (%)
23
65.7%
2
 
5.7%
2
 
5.7%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
Distinct3293
Distinct (%)33.0%
Missing7
Missing (%)0.1%
Memory size156.2 KiB
2023-12-12T18:22:56.285995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length22
Mean length4.8160712
Min length1

Characters and Unicode

Total characters48127
Distinct characters867
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1974 ?
Unique (%)19.8%

Sample

1st row은행나무,
2nd row글송이
3rd row은하수미디어
4th row롱셀러,
5th row문이당
ValueCountFrequency (%)
문학동네 160
 
1.6%
민음사 153
 
1.5%
김영사 138
 
1.4%
21세기북스 99
 
1.0%
박영사 87
 
0.9%
위즈덤하우스 81
 
0.8%
웅진출판사 78
 
0.8%
시공사 72
 
0.7%
해냄 68
 
0.7%
예림당 66
 
0.6%
Other values (2943) 9209
90.2%
2023-12-12T18:22:56.815069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 4079
 
8.5%
2859
 
5.9%
1264
 
2.6%
1235
 
2.6%
1226
 
2.5%
1212
 
2.5%
799
 
1.7%
699
 
1.5%
697
 
1.4%
652
 
1.4%
Other values (857) 33405
69.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 42554
88.4%
Other Punctuation 4188
 
8.7%
Lowercase Letter 369
 
0.8%
Decimal Number 337
 
0.7%
Uppercase Letter 326
 
0.7%
Space Separator 219
 
0.5%
Open Punctuation 67
 
0.1%
Close Punctuation 64
 
0.1%
Dash Punctuation 2
 
< 0.1%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2859
 
6.7%
1264
 
3.0%
1235
 
2.9%
1226
 
2.9%
1212
 
2.8%
799
 
1.9%
699
 
1.6%
697
 
1.6%
652
 
1.5%
556
 
1.3%
Other values (783) 31355
73.7%
Uppercase Letter
ValueCountFrequency (%)
B 79
24.2%
M 52
16.0%
K 30
 
9.2%
S 25
 
7.7%
C 15
 
4.6%
P 15
 
4.6%
I 13
 
4.0%
R 11
 
3.4%
N 11
 
3.4%
O 11
 
3.4%
Other values (14) 64
19.6%
Lowercase Letter
ValueCountFrequency (%)
o 51
13.8%
e 42
11.4%
i 32
 
8.7%
r 30
 
8.1%
s 28
 
7.6%
n 24
 
6.5%
k 22
 
6.0%
a 22
 
6.0%
b 19
 
5.1%
t 16
 
4.3%
Other values (14) 83
22.5%
Decimal Number
ValueCountFrequency (%)
1 132
39.2%
2 130
38.6%
8 16
 
4.7%
0 14
 
4.2%
6 9
 
2.7%
4 9
 
2.7%
9 8
 
2.4%
7 8
 
2.4%
3 8
 
2.4%
5 3
 
0.9%
Other Punctuation
ValueCountFrequency (%)
, 4079
97.4%
: 48
 
1.1%
& 39
 
0.9%
. 10
 
0.2%
5
 
0.1%
· 3
 
0.1%
/ 2
 
< 0.1%
@ 1
 
< 0.1%
; 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 62
92.5%
[ 5
 
7.5%
Close Punctuation
ValueCountFrequency (%)
) 58
90.6%
] 6
 
9.4%
Space Separator
ValueCountFrequency (%)
219
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41721
86.7%
Common 4878
 
10.1%
Han 833
 
1.7%
Latin 695
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2859
 
6.9%
1264
 
3.0%
1235
 
3.0%
1226
 
2.9%
1212
 
2.9%
799
 
1.9%
699
 
1.7%
697
 
1.7%
652
 
1.6%
556
 
1.3%
Other values (621) 30522
73.2%
Han
ValueCountFrequency (%)
124
 
14.9%
41
 
4.9%
40
 
4.8%
31
 
3.7%
27
 
3.2%
20
 
2.4%
19
 
2.3%
18
 
2.2%
18
 
2.2%
18
 
2.2%
Other values (152) 477
57.3%
Latin
ValueCountFrequency (%)
B 79
 
11.4%
M 52
 
7.5%
o 51
 
7.3%
e 42
 
6.0%
i 32
 
4.6%
K 30
 
4.3%
r 30
 
4.3%
s 28
 
4.0%
S 25
 
3.6%
n 24
 
3.5%
Other values (38) 302
43.5%
Common
ValueCountFrequency (%)
, 4079
83.6%
219
 
4.5%
1 132
 
2.7%
2 130
 
2.7%
( 62
 
1.3%
) 58
 
1.2%
: 48
 
1.0%
& 39
 
0.8%
8 16
 
0.3%
0 14
 
0.3%
Other values (16) 81
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 41719
86.7%
ASCII 5565
 
11.6%
CJK 832
 
1.7%
None 8
 
< 0.1%
Compat Jamo 2
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 4079
73.3%
219
 
3.9%
1 132
 
2.4%
2 130
 
2.3%
B 79
 
1.4%
( 62
 
1.1%
) 58
 
1.0%
M 52
 
0.9%
o 51
 
0.9%
: 48
 
0.9%
Other values (62) 655
 
11.8%
Hangul
ValueCountFrequency (%)
2859
 
6.9%
1264
 
3.0%
1235
 
3.0%
1226
 
2.9%
1212
 
2.9%
799
 
1.9%
699
 
1.7%
697
 
1.7%
652
 
1.6%
556
 
1.3%
Other values (619) 30520
73.2%
CJK
ValueCountFrequency (%)
124
 
14.9%
41
 
4.9%
40
 
4.8%
31
 
3.7%
27
 
3.2%
20
 
2.4%
19
 
2.3%
18
 
2.2%
18
 
2.2%
18
 
2.2%
Other values (151) 476
57.2%
None
ValueCountFrequency (%)
5
62.5%
· 3
37.5%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct86
Distinct (%)0.9%
Missing8
Missing (%)0.1%
Memory size156.2 KiB
2023-12-12T18:22:57.064393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length4
Mean length4.010008
Min length3

Characters and Unicode

Total characters40068
Distinct characters35
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)0.3%

Sample

1st row2003
2nd row2007
3rd row2012
4th row2001
5th row2003
ValueCountFrequency (%)
2007 743
 
7.4%
2008 691
 
6.9%
2010 503
 
5.0%
2012 493
 
4.9%
2011 492
 
4.9%
2001 452
 
4.5%
2013 399
 
4.0%
1997 397
 
4.0%
2009 393
 
3.9%
2000 370
 
3.7%
Other values (63) 5059
50.6%
2023-12-12T18:22:57.477993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 11916
29.7%
2 8169
20.4%
1 7017
17.5%
9 6224
15.5%
8 2049
 
5.1%
7 1494
 
3.7%
3 892
 
2.2%
6 845
 
2.1%
4 719
 
1.8%
5 652
 
1.6%
Other values (25) 91
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 39977
99.8%
Other Letter 31
 
0.1%
Open Punctuation 27
 
0.1%
Close Punctuation 27
 
0.1%
Dash Punctuation 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3
 
9.7%
3
 
9.7%
3
 
9.7%
3
 
9.7%
2
 
6.5%
2
 
6.5%
2
 
6.5%
1
 
3.2%
1
 
3.2%
1
 
3.2%
Other values (10) 10
32.3%
Decimal Number
ValueCountFrequency (%)
0 11916
29.8%
2 8169
20.4%
1 7017
17.6%
9 6224
15.6%
8 2049
 
5.1%
7 1494
 
3.7%
3 892
 
2.2%
6 845
 
2.1%
4 719
 
1.8%
5 652
 
1.6%
Open Punctuation
ValueCountFrequency (%)
[ 23
85.2%
( 4
 
14.8%
Close Punctuation
ValueCountFrequency (%)
] 23
85.2%
) 4
 
14.8%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 40037
99.9%
Hangul 29
 
0.1%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3
 
10.3%
3
 
10.3%
3
 
10.3%
3
 
10.3%
2
 
6.9%
2
 
6.9%
2
 
6.9%
1
 
3.4%
1
 
3.4%
1
 
3.4%
Other values (8) 8
27.6%
Common
ValueCountFrequency (%)
0 11916
29.8%
2 8169
20.4%
1 7017
17.5%
9 6224
15.5%
8 2049
 
5.1%
7 1494
 
3.7%
3 892
 
2.2%
6 845
 
2.1%
4 719
 
1.8%
5 652
 
1.6%
Other values (5) 60
 
0.1%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 40037
99.9%
Hangul 29
 
0.1%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 11916
29.8%
2 8169
20.4%
1 7017
17.5%
9 6224
15.5%
8 2049
 
5.1%
7 1494
 
3.7%
3 892
 
2.2%
6 845
 
2.1%
4 719
 
1.8%
5 652
 
1.6%
Other values (5) 60
 
0.1%
Hangul
ValueCountFrequency (%)
3
 
10.3%
3
 
10.3%
3
 
10.3%
3
 
10.3%
2
 
6.9%
2
 
6.9%
2
 
6.9%
1
 
3.4%
1
 
3.4%
1
 
3.4%
Other values (8) 8
27.6%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

Interactions

2023-12-12T18:22:52.158781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:22:57.593845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호발행년
번호1.0000.754
발행년0.7541.000

Missing values

2023-12-12T18:22:52.358686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:22:52.506782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T18:22:52.658028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호청구기호서명저작자발행자발행년
1415813-김36ㄴ-2눈물꽃2 / . 2김민기.은행나무,2003
1049710498452-하75ㅋ퀴즈! 과학상식하종준 글, 오수진 그림글송이2007
42064207713-오76ㄱ공부가재미있어지는 교과서속담오주영은하수미디어2012
39503951594-고94ㄱ고형욱의맛있는이야기 /고형욱롱셀러,2001
32023203813.6-김74ㄱ-1길없는 사람들. 1김정현문이당2003
61826183808-호55ㅇ일곱 박공의 집너새니얼 호손민음사2012
1528315284108-세14-4世界大思想全集. . 1-8 /[양지당 편]양지당,1980
74257426325-댄98ㅇ이모셔노믹스댄힐마젤란2011
1340313404337.2-민54ㅆ썅년의 미학민서영 글·그림위즈덤하우스2018
442443326-탭57ㄷ디지털캐피털 /돈 탭스콧 ; ; 데이비드 니콜 [공 지은이]도서출판물푸레,2000
번호청구기호서명저작자발행자발행년
80998100220-원74ㅊ침묵의 깊은 뜻을 마음으로 보게나 /圓淨 지음맑은소리,1997
1194911950911-안23ㅈ조선을 사로잡은꾼들안대회한겨레출판2010
1485614857377.68-서66ㅎ행정논총 .20권 제1호 [OR 4927]서울대학교 행정대학원서울대학교 행정대학원1994
1330813309808-괴833ㅈ젊은 베르테르의 슬픔요한 볼프강 폰 괴테민음사1999
1013110132843-페68ㅇ에덴의악녀페이웰던쿠오레2008
38023803843-톨88ㅂ-4반지의제왕 / . 4, : 두개의탑 하J.R.R.톨킨.황금가지,2001
1143911440813.6-최68ㅁ머저리클럽최인호랜덤하우스2008
41944195813-이68ㅎ한없이낮은숨결 /이인성문학과지성사,1989
1301313014598-미63ㅊ차근차근가치육아미야자키쇼코마고북스2010
1554315544029-최75ㅅ시골에서 책 읽는 즐거움 : 시골에서 책을 고르고·읽고·쓴다는 것최종규 지음스토리닷2017