Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells144
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory556.6 KiB
Average record size in memory57.0 B

Variable types

Numeric1
Text5

Dataset

Description인천광역시 인재개발원 자료실 도서 목록 현황 자료 데이터 입니다.(청구 기호, 서명, 저작자, 발행자, 발행년 등의 항목을 제공합니다.)
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15052866&srcSe=7661IVAWM27C61E190

Alerts

저작자 has 130 (1.3%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2024-01-28 16:46:16.469604
Analysis finished2024-01-28 16:46:19.375532
Duration2.91 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8113.9921
Minimum2
Maximum16190
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-29T01:46:19.462561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile800.95
Q14057.5
median8153.5
Q312182.75
95-th percentile15355.05
Maximum16190
Range16188
Interquartile range (IQR)8125.25

Descriptive statistics

Standard deviation4671.6129
Coefficient of variation (CV)0.57574778
Kurtosis-1.1968144
Mean8113.9921
Median Absolute Deviation (MAD)4061.5
Skewness-0.013369532
Sum81139921
Variance21823967
MonotonicityNot monotonic
2024-01-29T01:46:19.997507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3513 1
 
< 0.1%
6583 1
 
< 0.1%
13084 1
 
< 0.1%
6707 1
 
< 0.1%
13954 1
 
< 0.1%
10388 1
 
< 0.1%
5763 1
 
< 0.1%
2936 1
 
< 0.1%
4874 1
 
< 0.1%
15815 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
2 1
< 0.1%
3 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
10 1
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
14 1
< 0.1%
ValueCountFrequency (%)
16190 1
< 0.1%
16189 1
< 0.1%
16182 1
< 0.1%
16181 1
< 0.1%
16179 1
< 0.1%
16178 1
< 0.1%
16175 1
< 0.1%
16174 1
< 0.1%
16171 1
< 0.1%
16170 1
< 0.1%
Distinct9487
Distinct (%)94.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-29T01:46:20.299675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length22
Mean length9.5668
Min length1

Characters and Unicode

Total characters95668
Distinct characters522
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9150 ?
Unique (%)91.5%

Sample

1st row567-남16ㅇ
2nd row029-웬24ㅅ
3rd row102-랜27ㅊ
4th row221-한16ㅂ-1
5th row911-김57ㄷ-9:
ValueCountFrequency (%)
408 31
 
0.3%
아동 24
 
0.2%
470.8 14
 
0.1%
13
 
0.1%
608-편78ㅇ 12
 
0.1%
909-정52ㅌ-1 9
 
0.1%
811.33-김79ㅊ 8
 
0.1%
810.9-kㅇ 8
 
0.1%
0정65ㅈ 7
 
0.1%
912-나15ㅁ 7
 
0.1%
Other values (9491) 9928
98.7%
2024-01-29T01:46:20.743863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 12083
12.6%
1 8771
 
9.2%
8 8219
 
8.6%
3 7896
 
8.3%
6 5702
 
6.0%
4 5610
 
5.9%
2 5594
 
5.8%
5 5475
 
5.7%
9 4651
 
4.9%
. 4128
 
4.3%
Other values (512) 27539
28.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 59305
62.0%
Other Letter 19450
 
20.3%
Dash Punctuation 12083
 
12.6%
Other Punctuation 4151
 
4.3%
Math Symbol 295
 
0.3%
Uppercase Letter 282
 
0.3%
Space Separator 61
 
0.1%
Lowercase Letter 30
 
< 0.1%
Close Punctuation 4
 
< 0.1%
Open Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1771
 
9.1%
1348
 
6.9%
1117
 
5.7%
1041
 
5.4%
958
 
4.9%
882
 
4.5%
879
 
4.5%
619
 
3.2%
614
 
3.2%
577
 
3.0%
Other values (452) 9644
49.6%
Uppercase Letter
ValueCountFrequency (%)
W 59
20.9%
K 40
14.2%
B 26
9.2%
A 22
 
7.8%
E 18
 
6.4%
T 13
 
4.6%
S 12
 
4.3%
H 12
 
4.3%
C 11
 
3.9%
M 10
 
3.5%
Other values (11) 59
20.9%
Lowercase Letter
ValueCountFrequency (%)
v 9
30.0%
e 4
13.3%
c 3
 
10.0%
w 2
 
6.7%
l 2
 
6.7%
p 2
 
6.7%
o 2
 
6.7%
s 2
 
6.7%
g 1
 
3.3%
y 1
 
3.3%
Other values (2) 2
 
6.7%
Decimal Number
ValueCountFrequency (%)
1 8771
14.8%
8 8219
13.9%
3 7896
13.3%
6 5702
9.6%
4 5610
9.5%
2 5594
9.4%
5 5475
9.2%
9 4651
7.8%
7 3906
6.6%
0 3481
 
5.9%
Other Punctuation
ValueCountFrequency (%)
. 4128
99.4%
: 15
 
0.4%
, 3
 
0.1%
# 3
 
0.1%
? 2
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
= 293
99.3%
1
 
0.3%
~ 1
 
0.3%
Letter Number
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 3
75.0%
] 1
 
25.0%
Open Punctuation
ValueCountFrequency (%)
( 3
75.0%
[ 1
 
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 12083
100.0%
Space Separator
ValueCountFrequency (%)
61
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 75903
79.3%
Hangul 19445
 
20.3%
Latin 315
 
0.3%
Han 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1771
 
9.1%
1348
 
6.9%
1117
 
5.7%
1041
 
5.4%
958
 
4.9%
882
 
4.5%
879
 
4.5%
619
 
3.2%
614
 
3.2%
577
 
3.0%
Other values (449) 9639
49.6%
Latin
ValueCountFrequency (%)
W 59
18.7%
K 40
12.7%
B 26
 
8.3%
A 22
 
7.0%
E 18
 
5.7%
T 13
 
4.1%
S 12
 
3.8%
H 12
 
3.8%
C 11
 
3.5%
M 10
 
3.2%
Other values (26) 92
29.2%
Common
ValueCountFrequency (%)
- 12083
15.9%
1 8771
11.6%
8 8219
10.8%
3 7896
10.4%
6 5702
7.5%
4 5610
7.4%
2 5594
7.4%
5 5475
7.2%
9 4651
 
6.1%
. 4128
 
5.4%
Other values (14) 7774
10.2%
Han
ValueCountFrequency (%)
2
40.0%
2
40.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 76214
79.7%
Hangul 9934
 
10.4%
Compat Jamo 9511
 
9.9%
CJK 5
 
< 0.1%
Number Forms 3
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 12083
15.9%
1 8771
11.5%
8 8219
10.8%
3 7896
10.4%
6 5702
7.5%
4 5610
7.4%
2 5594
7.3%
5 5475
7.2%
9 4651
 
6.1%
. 4128
 
5.4%
Other values (46) 8085
10.6%
Compat Jamo
ValueCountFrequency (%)
1771
18.6%
1348
14.2%
1117
11.7%
882
9.3%
879
9.2%
619
 
6.5%
614
 
6.5%
577
 
6.1%
529
 
5.6%
393
 
4.1%
Other values (9) 782
8.2%
Hangul
ValueCountFrequency (%)
1041
 
10.5%
958
 
9.6%
412
 
4.1%
361
 
3.6%
280
 
2.8%
255
 
2.6%
223
 
2.2%
135
 
1.4%
134
 
1.3%
133
 
1.3%
Other values (430) 6002
60.4%
CJK
ValueCountFrequency (%)
2
40.0%
2
40.0%
1
20.0%
Math Operators
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

서명
Text

Distinct9439
Distinct (%)94.4%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2024-01-29T01:46:21.063638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length130
Median length109
Mean length13.691169
Min length1

Characters and Unicode

Total characters136898
Distinct characters1656
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9185 ?
Unique (%)91.9%

Sample

1st row음향영상설비 /
2nd row새로운시스코인증#640-607형식에맞춘완벽한 /
3rd row철학,누가그것을필요로하는가 /
4th row불교사상의 새로운 발견. . 1 /
5th row(통째로 한국사)두근두근 라이벌 열전. 9:, 붕당 정치부터 세도 정치까지
ValueCountFrequency (%)
6016
 
18.1%
1 573
 
1.7%
2 369
 
1.1%
3 174
 
0.5%
장편소설 92
 
0.3%
4 89
 
0.3%
이야기 84
 
0.3%
5 79
 
0.2%
위한 73
 
0.2%
6 63
 
0.2%
Other values (15716) 25643
77.1%
2024-01-29T01:46:21.602406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23946
 
17.5%
/ 3866
 
2.8%
2984
 
2.2%
. 2416
 
1.8%
2270
 
1.7%
1711
 
1.2%
1 1685
 
1.2%
1673
 
1.2%
1404
 
1.0%
1362
 
1.0%
Other values (1646) 93581
68.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 91927
67.1%
Space Separator 23946
 
17.5%
Other Punctuation 8632
 
6.3%
Decimal Number 5522
 
4.0%
Lowercase Letter 3856
 
2.8%
Uppercase Letter 1175
 
0.9%
Open Punctuation 679
 
0.5%
Close Punctuation 677
 
0.5%
Dash Punctuation 316
 
0.2%
Math Symbol 157
 
0.1%
Other values (4) 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2984
 
3.2%
2270
 
2.5%
1711
 
1.9%
1673
 
1.8%
1404
 
1.5%
1362
 
1.5%
1301
 
1.4%
1281
 
1.4%
1229
 
1.3%
1203
 
1.3%
Other values (1544) 75509
82.1%
Lowercase Letter
ValueCountFrequency (%)
e 423
11.0%
o 360
 
9.3%
n 353
 
9.2%
i 315
 
8.2%
a 296
 
7.7%
t 263
 
6.8%
s 239
 
6.2%
r 232
 
6.0%
h 163
 
4.2%
l 161
 
4.2%
Other values (16) 1051
27.3%
Uppercase Letter
ValueCountFrequency (%)
E 94
 
8.0%
S 91
 
7.7%
T 84
 
7.1%
I 82
 
7.0%
A 81
 
6.9%
O 79
 
6.7%
C 68
 
5.8%
W 62
 
5.3%
R 57
 
4.9%
H 55
 
4.7%
Other values (16) 422
35.9%
Other Punctuation
ValueCountFrequency (%)
/ 3866
44.8%
. 2416
28.0%
: 1190
 
13.8%
, 664
 
7.7%
? 183
 
2.1%
! 132
 
1.5%
· 85
 
1.0%
' 39
 
0.5%
& 23
 
0.3%
% 12
 
0.1%
Other values (10) 22
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 1685
30.5%
2 1063
19.3%
0 777
14.1%
3 497
 
9.0%
5 337
 
6.1%
4 336
 
6.1%
9 299
 
5.4%
6 191
 
3.5%
7 186
 
3.4%
8 151
 
2.7%
Math Symbol
ValueCountFrequency (%)
= 131
83.4%
~ 12
 
7.6%
8
 
5.1%
+ 4
 
2.5%
× 2
 
1.3%
Letter Number
ValueCountFrequency (%)
2
28.6%
2
28.6%
2
28.6%
1
14.3%
Open Punctuation
ValueCountFrequency (%)
( 675
99.4%
[ 3
 
0.4%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 673
99.4%
] 3
 
0.4%
1
 
0.1%
Space Separator
ValueCountFrequency (%)
23946
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 316
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 90458
66.1%
Common 39933
29.2%
Latin 5038
 
3.7%
Han 1469
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2984
 
3.3%
2270
 
2.5%
1711
 
1.9%
1673
 
1.8%
1404
 
1.6%
1362
 
1.5%
1301
 
1.4%
1281
 
1.4%
1229
 
1.4%
1203
 
1.3%
Other values (1169) 74040
81.9%
Han
ValueCountFrequency (%)
32
 
2.2%
30
 
2.0%
30
 
2.0%
29
 
2.0%
28
 
1.9%
27
 
1.8%
25
 
1.7%
25
 
1.7%
24
 
1.6%
23
 
1.6%
Other values (365) 1196
81.4%
Latin
ValueCountFrequency (%)
e 423
 
8.4%
o 360
 
7.1%
n 353
 
7.0%
i 315
 
6.3%
a 296
 
5.9%
t 263
 
5.2%
s 239
 
4.7%
r 232
 
4.6%
h 163
 
3.2%
l 161
 
3.2%
Other values (46) 2233
44.3%
Common
ValueCountFrequency (%)
23946
60.0%
/ 3866
 
9.7%
. 2416
 
6.1%
1 1685
 
4.2%
: 1190
 
3.0%
2 1063
 
2.7%
0 777
 
1.9%
( 675
 
1.7%
) 673
 
1.7%
, 664
 
1.7%
Other values (36) 2978
 
7.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 90445
66.1%
ASCII 44855
32.8%
CJK 1443
 
1.1%
None 96
 
0.1%
CJK Compat Ideographs 26
 
< 0.1%
Compat Jamo 13
 
< 0.1%
Math Operators 8
 
< 0.1%
Number Forms 7
 
< 0.1%
Punctuation 3
 
< 0.1%
Misc Symbols 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
23946
53.4%
/ 3866
 
8.6%
. 2416
 
5.4%
1 1685
 
3.8%
: 1190
 
2.7%
2 1063
 
2.4%
0 777
 
1.7%
( 675
 
1.5%
) 673
 
1.5%
, 664
 
1.5%
Other values (75) 7900
 
17.6%
Hangul
ValueCountFrequency (%)
2984
 
3.3%
2270
 
2.5%
1711
 
1.9%
1673
 
1.8%
1404
 
1.6%
1362
 
1.5%
1301
 
1.4%
1281
 
1.4%
1229
 
1.4%
1203
 
1.3%
Other values (1161) 74027
81.8%
None
ValueCountFrequency (%)
· 85
88.5%
3
 
3.1%
2
 
2.1%
× 2
 
2.1%
1
 
1.0%
1
 
1.0%
1
 
1.0%
1
 
1.0%
CJK
ValueCountFrequency (%)
32
 
2.2%
30
 
2.1%
30
 
2.1%
29
 
2.0%
28
 
1.9%
27
 
1.9%
25
 
1.7%
25
 
1.7%
24
 
1.7%
23
 
1.6%
Other values (350) 1170
81.1%
Math Operators
ValueCountFrequency (%)
8
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
4
15.4%
4
15.4%
3
11.5%
3
11.5%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (5) 5
19.2%
Compat Jamo
ValueCountFrequency (%)
4
30.8%
2
15.4%
2
15.4%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Number Forms
ValueCountFrequency (%)
2
28.6%
2
28.6%
2
28.6%
1
14.3%
Misc Symbols
ValueCountFrequency (%)
2
100.0%
Punctuation
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

저작자
Text

MISSING 

Distinct7418
Distinct (%)75.2%
Missing130
Missing (%)1.3%
Memory size156.2 KiB
2024-01-29T01:46:21.919755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length106
Median length79
Mean length7.6088146
Min length1

Characters and Unicode

Total characters75099
Distinct characters1230
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6420 ?
Unique (%)65.0%

Sample

1st row남궁재한
2nd rowwendell odom
3rd row아인 랜드 [지은이]
4th row한국불교사회교육원 엮음
5th row김승현 글 ; 장인호 그림
ValueCountFrequency (%)
1867
 
9.2%
지음 1363
 
6.7%
옮김 524
 
2.6%
그림 378
 
1.9%
313
 
1.5%
299
 
1.5%
지은이 296
 
1.5%
편집부 160
 
0.8%
100
 
0.5%
88
 
0.4%
Other values (9088) 14949
73.5%
2024-01-29T01:46:22.445215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11004
 
14.7%
2344
 
3.1%
2139
 
2.8%
1938
 
2.6%
; 1871
 
2.5%
1531
 
2.0%
. 1227
 
1.6%
1003
 
1.3%
916
 
1.2%
854
 
1.1%
Other values (1220) 50272
66.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 57167
76.1%
Space Separator 11004
 
14.7%
Other Punctuation 4222
 
5.6%
Lowercase Letter 692
 
0.9%
Uppercase Letter 658
 
0.9%
Close Punctuation 635
 
0.8%
Open Punctuation 635
 
0.8%
Decimal Number 60
 
0.1%
Math Symbol 18
 
< 0.1%
Dash Punctuation 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2344
 
4.1%
2139
 
3.7%
1938
 
3.4%
1531
 
2.7%
1003
 
1.8%
916
 
1.6%
854
 
1.5%
747
 
1.3%
654
 
1.1%
583
 
1.0%
Other values (1141) 44458
77.8%
Uppercase Letter
ValueCountFrequency (%)
K 76
11.6%
B 74
11.2%
S 66
 
10.0%
C 55
 
8.4%
J 47
 
7.1%
R 42
 
6.4%
A 39
 
5.9%
E 38
 
5.8%
M 32
 
4.9%
H 24
 
3.6%
Other values (14) 165
25.1%
Lowercase Letter
ValueCountFrequency (%)
a 72
 
10.4%
o 70
 
10.1%
e 64
 
9.2%
i 59
 
8.5%
n 55
 
7.9%
r 47
 
6.8%
s 45
 
6.5%
u 34
 
4.9%
t 30
 
4.3%
l 30
 
4.3%
Other values (14) 186
26.9%
Other Punctuation
ValueCountFrequency (%)
; 1871
44.3%
. 1227
29.1%
, 772
18.3%
: 239
 
5.7%
· 51
 
1.2%
/ 23
 
0.5%
19
 
0.5%
& 15
 
0.4%
! 2
 
< 0.1%
' 2
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
1 17
28.3%
2 11
18.3%
0 8
13.3%
6 4
 
6.7%
9 4
 
6.7%
8 4
 
6.7%
3 4
 
6.7%
5 3
 
5.0%
7 3
 
5.0%
4 2
 
3.3%
Close Punctuation
ValueCountFrequency (%)
] 629
99.1%
) 5
 
0.8%
1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
[ 629
99.1%
( 5
 
0.8%
1
 
0.2%
Math Symbol
ValueCountFrequency (%)
< 9
50.0%
> 9
50.0%
Space Separator
ValueCountFrequency (%)
11004
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 56018
74.6%
Common 16582
 
22.1%
Latin 1350
 
1.8%
Han 1149
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2344
 
4.2%
2139
 
3.8%
1938
 
3.5%
1531
 
2.7%
1003
 
1.8%
916
 
1.6%
854
 
1.5%
747
 
1.3%
654
 
1.2%
583
 
1.0%
Other values (816) 43309
77.3%
Han
ValueCountFrequency (%)
118
 
10.3%
88
 
7.7%
30
 
2.6%
28
 
2.4%
27
 
2.3%
22
 
1.9%
19
 
1.7%
18
 
1.6%
17
 
1.5%
17
 
1.5%
Other values (315) 765
66.6%
Latin
ValueCountFrequency (%)
K 76
 
5.6%
B 74
 
5.5%
a 72
 
5.3%
o 70
 
5.2%
S 66
 
4.9%
e 64
 
4.7%
i 59
 
4.4%
C 55
 
4.1%
n 55
 
4.1%
J 47
 
3.5%
Other values (38) 712
52.7%
Common
ValueCountFrequency (%)
11004
66.4%
; 1871
 
11.3%
. 1227
 
7.4%
, 772
 
4.7%
] 629
 
3.8%
[ 629
 
3.8%
: 239
 
1.4%
· 51
 
0.3%
/ 23
 
0.1%
19
 
0.1%
Other values (21) 118
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 56016
74.6%
ASCII 17860
 
23.8%
CJK 1103
 
1.5%
None 72
 
0.1%
CJK Compat Ideographs 46
 
0.1%
Compat Jamo 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11004
61.6%
; 1871
 
10.5%
. 1227
 
6.9%
, 772
 
4.3%
] 629
 
3.5%
[ 629
 
3.5%
: 239
 
1.3%
K 76
 
0.4%
B 74
 
0.4%
a 72
 
0.4%
Other values (65) 1267
 
7.1%
Hangul
ValueCountFrequency (%)
2344
 
4.2%
2139
 
3.8%
1938
 
3.5%
1531
 
2.7%
1003
 
1.8%
916
 
1.6%
854
 
1.5%
747
 
1.3%
654
 
1.2%
583
 
1.0%
Other values (815) 43307
77.3%
CJK
ValueCountFrequency (%)
118
 
10.7%
88
 
8.0%
28
 
2.5%
27
 
2.4%
22
 
2.0%
19
 
1.7%
18
 
1.6%
17
 
1.5%
17
 
1.5%
15
 
1.4%
Other values (302) 734
66.5%
None
ValueCountFrequency (%)
· 51
70.8%
19
 
26.4%
1
 
1.4%
1
 
1.4%
CJK Compat Ideographs
ValueCountFrequency (%)
30
65.2%
3
 
6.5%
2
 
4.3%
2
 
4.3%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
Other values (3) 3
 
6.5%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
Distinct3270
Distinct (%)32.7%
Missing6
Missing (%)0.1%
Memory size156.2 KiB
2024-01-29T01:46:22.762405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length25
Mean length4.8294977
Min length1

Characters and Unicode

Total characters48266
Distinct characters870
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1983 ?
Unique (%)19.8%

Sample

1st row기전연구사,
2nd rowcisco press,
3rd row자유기업센터,
4th row정토,
5th row휘슬러
ValueCountFrequency (%)
민음사 184
 
1.8%
문학동네 168
 
1.6%
김영사 146
 
1.4%
21세기북스 99
 
1.0%
위즈덤하우스 88
 
0.9%
박영사 82
 
0.8%
시공사 75
 
0.7%
웅진출판사 72
 
0.7%
예림당 67
 
0.7%
문학과지성사 67
 
0.7%
Other values (2917) 9156
89.7%
2024-01-29T01:46:23.241365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 4066
 
8.4%
2877
 
6.0%
1281
 
2.7%
1230
 
2.5%
1224
 
2.5%
1220
 
2.5%
809
 
1.7%
704
 
1.5%
687
 
1.4%
661
 
1.4%
Other values (860) 33507
69.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 42612
88.3%
Other Punctuation 4178
 
8.7%
Lowercase Letter 460
 
1.0%
Uppercase Letter 338
 
0.7%
Decimal Number 318
 
0.7%
Space Separator 211
 
0.4%
Open Punctuation 73
 
0.2%
Close Punctuation 71
 
0.1%
Dash Punctuation 4
 
< 0.1%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2877
 
6.8%
1281
 
3.0%
1230
 
2.9%
1224
 
2.9%
1220
 
2.9%
809
 
1.9%
704
 
1.7%
687
 
1.6%
661
 
1.6%
544
 
1.3%
Other values (787) 31375
73.6%
Uppercase Letter
ValueCountFrequency (%)
B 78
23.1%
M 51
15.1%
K 32
9.5%
S 20
 
5.9%
C 19
 
5.6%
R 17
 
5.0%
I 14
 
4.1%
N 13
 
3.8%
P 13
 
3.8%
H 12
 
3.6%
Other values (14) 69
20.4%
Lowercase Letter
ValueCountFrequency (%)
o 68
14.8%
e 52
11.3%
s 38
 
8.3%
i 37
 
8.0%
r 35
 
7.6%
k 28
 
6.1%
a 28
 
6.1%
b 25
 
5.4%
n 25
 
5.4%
t 18
 
3.9%
Other values (14) 106
23.0%
Decimal Number
ValueCountFrequency (%)
1 125
39.3%
2 125
39.3%
8 14
 
4.4%
4 13
 
4.1%
0 10
 
3.1%
3 8
 
2.5%
9 7
 
2.2%
6 7
 
2.2%
5 5
 
1.6%
7 4
 
1.3%
Other Punctuation
ValueCountFrequency (%)
, 4066
97.3%
: 54
 
1.3%
& 36
 
0.9%
. 12
 
0.3%
5
 
0.1%
/ 2
 
< 0.1%
· 2
 
< 0.1%
; 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 71
97.3%
[ 2
 
2.7%
Close Punctuation
ValueCountFrequency (%)
) 68
95.8%
] 3
 
4.2%
Space Separator
ValueCountFrequency (%)
211
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41719
86.4%
Common 4856
 
10.1%
Han 893
 
1.9%
Latin 798
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2877
 
6.9%
1281
 
3.1%
1230
 
2.9%
1224
 
2.9%
1220
 
2.9%
809
 
1.9%
704
 
1.7%
687
 
1.6%
661
 
1.6%
544
 
1.3%
Other values (611) 30482
73.1%
Han
ValueCountFrequency (%)
129
 
14.4%
45
 
5.0%
45
 
5.0%
34
 
3.8%
27
 
3.0%
20
 
2.2%
19
 
2.1%
18
 
2.0%
17
 
1.9%
16
 
1.8%
Other values (166) 523
58.6%
Latin
ValueCountFrequency (%)
B 78
 
9.8%
o 68
 
8.5%
e 52
 
6.5%
M 51
 
6.4%
s 38
 
4.8%
i 37
 
4.6%
r 35
 
4.4%
K 32
 
4.0%
k 28
 
3.5%
a 28
 
3.5%
Other values (38) 351
44.0%
Common
ValueCountFrequency (%)
, 4066
83.7%
211
 
4.3%
1 125
 
2.6%
2 125
 
2.6%
( 71
 
1.5%
) 68
 
1.4%
: 54
 
1.1%
& 36
 
0.7%
8 14
 
0.3%
4 13
 
0.3%
Other values (15) 73
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 41718
86.4%
ASCII 5647
 
11.7%
CJK 889
 
1.8%
None 7
 
< 0.1%
CJK Compat Ideographs 4
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 4066
72.0%
211
 
3.7%
1 125
 
2.2%
2 125
 
2.2%
B 78
 
1.4%
( 71
 
1.3%
o 68
 
1.2%
) 68
 
1.2%
: 54
 
1.0%
e 52
 
0.9%
Other values (61) 729
 
12.9%
Hangul
ValueCountFrequency (%)
2877
 
6.9%
1281
 
3.1%
1230
 
2.9%
1224
 
2.9%
1220
 
2.9%
809
 
1.9%
704
 
1.7%
687
 
1.6%
661
 
1.6%
544
 
1.3%
Other values (610) 30481
73.1%
CJK
ValueCountFrequency (%)
129
 
14.5%
45
 
5.1%
45
 
5.1%
34
 
3.8%
27
 
3.0%
20
 
2.2%
19
 
2.1%
18
 
2.0%
17
 
1.9%
16
 
1.8%
Other values (163) 519
58.4%
None
ValueCountFrequency (%)
5
71.4%
· 2
 
28.6%
CJK Compat Ideographs
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct90
Distinct (%)0.9%
Missing7
Missing (%)0.1%
Memory size156.2 KiB
2024-01-29T01:46:23.502247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length4
Mean length4.0159111
Min length3

Characters and Unicode

Total characters40131
Distinct characters29
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)0.3%

Sample

1st row1989
2nd row2002
3rd row1998
4th row1989
5th row2008
ValueCountFrequency (%)
2007 746
 
7.5%
2008 683
 
6.8%
2011 522
 
5.2%
2012 500
 
5.0%
2010 489
 
4.9%
2001 453
 
4.5%
1997 420
 
4.2%
2009 399
 
4.0%
2013 378
 
3.8%
2000 374
 
3.7%
Other values (68) 5029
50.3%
2024-01-29T01:46:23.866477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 11948
29.8%
2 8159
20.3%
1 7023
17.5%
9 6160
15.3%
8 2079
 
5.2%
7 1536
 
3.8%
3 891
 
2.2%
6 859
 
2.1%
4 706
 
1.8%
5 656
 
1.6%
Other values (19) 114
 
0.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 40017
99.7%
Open Punctuation 34
 
0.1%
Close Punctuation 34
 
0.1%
Other Letter 34
 
0.1%
Dash Punctuation 10
 
< 0.1%
Other Punctuation 1
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
11.8%
4
11.8%
4
11.8%
4
11.8%
4
11.8%
4
11.8%
3
8.8%
3
8.8%
1
 
2.9%
1
 
2.9%
Other values (2) 2
5.9%
Decimal Number
ValueCountFrequency (%)
0 11948
29.9%
2 8159
20.4%
1 7023
17.6%
9 6160
15.4%
8 2079
 
5.2%
7 1536
 
3.8%
3 891
 
2.2%
6 859
 
2.1%
4 706
 
1.8%
5 656
 
1.6%
Open Punctuation
ValueCountFrequency (%)
[ 25
73.5%
( 9
 
26.5%
Close Punctuation
ValueCountFrequency (%)
] 25
73.5%
) 9
 
26.5%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
c 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 40096
99.9%
Hangul 32
 
0.1%
Han 2
 
< 0.1%
Latin 1
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 11948
29.8%
2 8159
20.3%
1 7023
17.5%
9 6160
15.4%
8 2079
 
5.2%
7 1536
 
3.8%
3 891
 
2.2%
6 859
 
2.1%
4 706
 
1.8%
5 656
 
1.6%
Other values (6) 79
 
0.2%
Hangul
ValueCountFrequency (%)
4
12.5%
4
12.5%
4
12.5%
4
12.5%
4
12.5%
4
12.5%
3
9.4%
3
9.4%
1
 
3.1%
1
 
3.1%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%
Latin
ValueCountFrequency (%)
c 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 40097
99.9%
Hangul 32
 
0.1%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 11948
29.8%
2 8159
20.3%
1 7023
17.5%
9 6160
15.4%
8 2079
 
5.2%
7 1536
 
3.8%
3 891
 
2.2%
6 859
 
2.1%
4 706
 
1.8%
5 656
 
1.6%
Other values (7) 80
 
0.2%
Hangul
ValueCountFrequency (%)
4
12.5%
4
12.5%
4
12.5%
4
12.5%
4
12.5%
4
12.5%
3
9.4%
3
9.4%
1
 
3.1%
1
 
3.1%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

Interactions

2024-01-29T01:46:18.892512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-29T01:46:23.969288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호발행년
번호1.0000.771
발행년0.7711.000

Missing values

2024-01-29T01:46:19.038929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-29T01:46:19.161717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-29T01:46:19.305141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호청구기호서명저작자발행자발행년
35123513567-남16ㅇ음향영상설비 /남궁재한기전연구사,1989
74837484029-웬24ㅅ새로운시스코인증#640-607형식에맞춘완벽한 /wendell odomcisco press,2002
565566102-랜27ㅊ철학,누가그것을필요로하는가 /아인 랜드 [지은이]자유기업센터,1998
83498350221-한16ㅂ-1불교사상의 새로운 발견. . 1 /한국불교사회교육원 엮음정토,1989
39983999911-김57ㄷ-9:(통째로 한국사)두근두근 라이벌 열전. 9:, 붕당 정치부터 세도 정치까지김승현 글 ; 장인호 그림휘슬러2008
1273112732358.4-민72ㅅ생각을 경영하라민재형청림출판2014
36643665808-야32ㄷ-12대망 /야마오까소하찌중앙출판사,1997
85608561913-마29ㅂ번역과 일본의 근대 /마루야마 마사오 ; ; 가토 슈이치 [공]지음 ; ; 임성모 옮김이산,2000
1486514866912.03-사32ㅅ-2:사기. 2:, 진실로 용기있는 자는 가볍게 죽지 않는다사마천 지음 ; 김진연 편역서해문집2007
24832484189.1-김52ㅁ마음을비우면 얻어지는것들김상운21세기북스2012
번호청구기호서명저작자발행자발행년
90359036375.1-조66ㄸ따르릉 따르릉조우영사계절2007
1488514886340.911-통68ㅌ통일교육원 40년사통일교육원 교육총괄과 [편]통일교육원 교육총괄과2012
49414942823-마69ㅅ신더마리사 마이어더난콘텐츠그룹2013
22972298911-장69ㅎ-1한국전쟁. 1 : 불길한징조들 /장문평,이동식,김재구.아이템뱅크,1980
27082709594.5-하32ㅃ빨간토마토레시피57하마우치치나미아르고나인2011
1059310594813.6-허64ㅅ-12사랑해. 12허영만김영사2007
996997811-강67ㅂ붉은강 /강은교풀빛,1984
1511015111859.7-요192ㅊ(큰글자)창문 넘어 도망친 100세 노인 : 요나스 요나손 장편소설요나스 요나손 지음 ; 임호경 옮김열린책들2017
66396640380-최82ㅎ한국의풍수사상 /최창조민음사,1986
1360013601808.3-임82ㄱ귀신이 들려주는 세계공포괴담(미국)임창호, 정현희(주)재미북스2013