Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells9035
Missing cells (%)12.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory634.8 KiB
Average record size in memory65.0 B

Variable types

Text6
Numeric1

Dataset

Description국방기술품질원 전자도서관이 보유한 연구도서 현황입니다. 고유서가번호, 볼륨(판), 서명, 저자, 출판사, 출판년도가 포함되어 있습니다.
Author국방기술품질원
URLhttps://www.data.go.kr/data/15052693/fileData.do

Alerts

볼륨 has 8883 (88.8%) missing valuesMissing
출판년 is highly skewed (γ1 = 58.11418899)Skewed
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 00:16:25.772430
Analysis finished2023-12-12 00:16:28.237544
Duration2.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T09:16:28.554441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length4
Mean length4.0236
Min length1

Characters and Unicode

Total characters40236
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st row3770
2nd row11210
3rd row6984
4th row526
5th row7538
ValueCountFrequency (%)
3770 1
 
< 0.1%
2438 1
 
< 0.1%
7788 1
 
< 0.1%
7289 1
 
< 0.1%
7869 1
 
< 0.1%
6512 1
 
< 0.1%
7792 1
 
< 0.1%
10126 1
 
< 0.1%
8659 1
 
< 0.1%
10605 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-12T09:16:29.098638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 5527
13.7%
3 3908
9.7%
2 3904
9.7%
0 3867
9.6%
8 3855
9.6%
6 3850
9.6%
4 3847
9.6%
7 3841
9.5%
9 3821
9.5%
5 3814
9.5%
Other values (2) 2
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 40234
> 99.9%
Close Punctuation 1
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 5527
13.7%
3 3908
9.7%
2 3904
9.7%
0 3867
9.6%
8 3855
9.6%
6 3850
9.6%
4 3847
9.6%
7 3841
9.5%
9 3821
9.5%
5 3814
9.5%
Close Punctuation
ValueCountFrequency (%)
1
100.0%
Other Punctuation
ValueCountFrequency (%)
" 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 40236
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 5527
13.7%
3 3908
9.7%
2 3904
9.7%
0 3867
9.6%
8 3855
9.6%
6 3850
9.6%
4 3847
9.6%
7 3841
9.5%
9 3821
9.5%
5 3814
9.5%
Other values (2) 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 40235
> 99.9%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 5527
13.7%
3 3908
9.7%
2 3904
9.7%
0 3867
9.6%
8 3855
9.6%
6 3850
9.6%
4 3847
9.6%
7 3841
9.5%
9 3821
9.5%
5 3814
9.5%
None
ValueCountFrequency (%)
1
100.0%
Distinct7144
Distinct (%)71.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T09:16:29.507097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length22
Mean length11.0043
Min length4

Characters and Unicode

Total characters110043
Distinct characters349
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5894 ?
Unique (%)58.9%

Sample

1st rowTA357.5 K5f
2nd rowTV152 부74ㅈ
3rd rowTK7825 D8h
4th rowQA76.15.E48 R3e2
5th rowTK7876 C4h
ValueCountFrequency (%)
ts156 383
 
1.9%
qa76.76 182
 
0.9%
qa76 106
 
0.5%
qa76.25 96
 
0.5%
ts173 96
 
0.5%
한16ㄱ 95
 
0.5%
한16ㅈ 89
 
0.4%
한16ㅍ 84
 
0.4%
ta459 83
 
0.4%
ta409 77
 
0.4%
Other values (7406) 18802
93.6%
2023-12-12T09:16:30.113582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10111
 
9.2%
5 8492
 
7.7%
6 7972
 
7.2%
T 7916
 
7.2%
7 7759
 
7.1%
1 6964
 
6.3%
4 5258
 
4.8%
8 4777
 
4.3%
2 4728
 
4.3%
. 4549
 
4.1%
Other values (339) 41517
37.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 55625
50.5%
Uppercase Letter 25293
23.0%
Other Letter 10656
 
9.7%
Space Separator 10111
 
9.2%
Other Punctuation 4549
 
4.1%
Lowercase Letter 3808
 
3.5%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
835
 
7.8%
727
 
6.8%
718
 
6.7%
708
 
6.6%
704
 
6.6%
700
 
6.6%
687
 
6.4%
421
 
4.0%
402
 
3.8%
268
 
2.5%
Other values (274) 4486
42.1%
Uppercase Letter
ValueCountFrequency (%)
T 7916
31.3%
A 3482
13.8%
Q 2393
 
9.5%
S 2065
 
8.2%
K 1943
 
7.7%
L 991
 
3.9%
P 885
 
3.5%
J 866
 
3.4%
D 728
 
2.9%
C 693
 
2.7%
Other values (16) 3331
13.2%
Lowercase Letter
ValueCountFrequency (%)
a 530
13.9%
c 369
9.7%
m 331
 
8.7%
s 266
 
7.0%
p 263
 
6.9%
e 257
 
6.7%
i 246
 
6.5%
d 236
 
6.2%
h 233
 
6.1%
f 200
 
5.3%
Other values (16) 877
23.0%
Decimal Number
ValueCountFrequency (%)
5 8492
15.3%
6 7972
14.3%
7 7759
13.9%
1 6964
12.5%
4 5258
9.5%
8 4777
8.6%
2 4728
8.5%
3 4264
7.7%
9 3108
 
5.6%
0 2303
 
4.1%
Space Separator
ValueCountFrequency (%)
10111
100.0%
Other Punctuation
ValueCountFrequency (%)
. 4549
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 70286
63.9%
Latin 29101
26.4%
Hangul 10656
 
9.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
835
 
7.8%
727
 
6.8%
718
 
6.7%
708
 
6.6%
704
 
6.6%
700
 
6.6%
687
 
6.4%
421
 
4.0%
402
 
3.8%
268
 
2.5%
Other values (274) 4486
42.1%
Latin
ValueCountFrequency (%)
T 7916
27.2%
A 3482
12.0%
Q 2393
 
8.2%
S 2065
 
7.1%
K 1943
 
6.7%
L 991
 
3.4%
P 885
 
3.0%
J 866
 
3.0%
D 728
 
2.5%
C 693
 
2.4%
Other values (42) 7139
24.5%
Common
ValueCountFrequency (%)
10111
14.4%
5 8492
12.1%
6 7972
11.3%
7 7759
11.0%
1 6964
9.9%
4 5258
7.5%
8 4777
6.8%
2 4728
6.7%
. 4549
6.5%
3 4264
6.1%
Other values (3) 5412
7.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 99387
90.3%
Hangul 5607
 
5.1%
Compat Jamo 5049
 
4.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10111
 
10.2%
5 8492
 
8.5%
6 7972
 
8.0%
T 7916
 
8.0%
7 7759
 
7.8%
1 6964
 
7.0%
4 5258
 
5.3%
8 4777
 
4.8%
2 4728
 
4.8%
. 4549
 
4.6%
Other values (55) 30861
31.1%
Compat Jamo
ValueCountFrequency (%)
835
16.5%
727
14.4%
718
14.2%
708
14.0%
421
8.3%
402
8.0%
246
 
4.9%
201
 
4.0%
182
 
3.6%
173
 
3.4%
Other values (9) 436
8.6%
Hangul
ValueCountFrequency (%)
704
 
12.6%
700
 
12.5%
687
 
12.3%
268
 
4.8%
137
 
2.4%
132
 
2.4%
115
 
2.1%
109
 
1.9%
108
 
1.9%
103
 
1.8%
Other values (255) 2544
45.4%

볼륨
Text

MISSING 

Distinct142
Distinct (%)12.7%
Missing8883
Missing (%)88.8%
Memory size156.2 KiB
2023-12-12T09:16:30.489778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length3.2470904
Min length1

Characters and Unicode

Total characters3627
Distinct characters61
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)6.5%

Sample

1st rowv.1
2nd rowv.3
3rd rowv.2
4th rowv.3
5th rowv.3
ValueCountFrequency (%)
v.2 207
18.8%
v.1 141
 
12.8%
v.3 107
 
9.7%
v.4 68
 
6.2%
v.5 48
 
4.4%
v.6 40
 
3.6%
v.8 32
 
2.9%
v.7 31
 
2.8%
vol.1 24
 
2.2%
v.9 24
 
2.2%
Other values (115) 381
34.5%
2023-12-12T09:16:31.003493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 987
27.2%
v 947
26.1%
1 335
 
9.2%
2 299
 
8.2%
3 157
 
4.3%
o 109
 
3.0%
l 109
 
3.0%
4 106
 
2.9%
5 78
 
2.2%
6 66
 
1.8%
Other values (51) 434
12.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1203
33.2%
Decimal Number 1203
33.2%
Other Punctuation 987
27.2%
Other Letter 127
 
3.5%
Uppercase Letter 63
 
1.7%
Space Separator 30
 
0.8%
Dash Punctuation 6
 
0.2%
Close Punctuation 4
 
0.1%
Open Punctuation 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
26.0%
23
18.1%
21
16.5%
6
 
4.7%
6
 
4.7%
4
 
3.1%
3
 
2.4%
3
 
2.4%
3
 
2.4%
2
 
1.6%
Other values (14) 23
18.1%
Uppercase Letter
ValueCountFrequency (%)
V 48
76.2%
I 3
 
4.8%
A 2
 
3.2%
D 2
 
3.2%
M 1
 
1.6%
R 1
 
1.6%
F 1
 
1.6%
H 1
 
1.6%
P 1
 
1.6%
Z 1
 
1.6%
Other values (2) 2
 
3.2%
Lowercase Letter
ValueCountFrequency (%)
v 947
78.7%
o 109
 
9.1%
l 109
 
9.1%
t 12
 
1.0%
p 10
 
0.8%
d 6
 
0.5%
e 4
 
0.3%
n 3
 
0.2%
h 2
 
0.2%
x 1
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 335
27.8%
2 299
24.9%
3 157
13.1%
4 106
 
8.8%
5 78
 
6.5%
6 66
 
5.5%
7 51
 
4.2%
8 43
 
3.6%
9 35
 
2.9%
0 33
 
2.7%
Other Punctuation
ValueCountFrequency (%)
. 987
100.0%
Space Separator
ValueCountFrequency (%)
30
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Close Punctuation
ValueCountFrequency (%)
] 4
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2234
61.6%
Latin 1266
34.9%
Hangul 110
 
3.0%
Han 17
 
0.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
v 947
74.8%
o 109
 
8.6%
l 109
 
8.6%
V 48
 
3.8%
t 12
 
0.9%
p 10
 
0.8%
d 6
 
0.5%
e 4
 
0.3%
n 3
 
0.2%
I 3
 
0.2%
Other values (12) 15
 
1.2%
Hangul
ValueCountFrequency (%)
33
30.0%
23
20.9%
21
19.1%
6
 
5.5%
4
 
3.6%
3
 
2.7%
2
 
1.8%
2
 
1.8%
2
 
1.8%
2
 
1.8%
Other values (7) 12
 
10.9%
Common
ValueCountFrequency (%)
. 987
44.2%
1 335
 
15.0%
2 299
 
13.4%
3 157
 
7.0%
4 106
 
4.7%
5 78
 
3.5%
6 66
 
3.0%
7 51
 
2.3%
8 43
 
1.9%
9 35
 
1.6%
Other values (5) 77
 
3.4%
Han
ValueCountFrequency (%)
6
35.3%
3
17.6%
3
17.6%
2
 
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3500
96.5%
Hangul 110
 
3.0%
CJK 17
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 987
28.2%
v 947
27.1%
1 335
 
9.6%
2 299
 
8.5%
3 157
 
4.5%
o 109
 
3.1%
l 109
 
3.1%
4 106
 
3.0%
5 78
 
2.2%
6 66
 
1.9%
Other values (27) 307
 
8.8%
Hangul
ValueCountFrequency (%)
33
30.0%
23
20.9%
21
19.1%
6
 
5.5%
4
 
3.6%
3
 
2.7%
2
 
1.8%
2
 
1.8%
2
 
1.8%
2
 
1.8%
Other values (7) 12
 
10.9%
CJK
ValueCountFrequency (%)
6
35.3%
3
17.6%
3
17.6%
2
 
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%

서명
Text

Distinct7423
Distinct (%)74.2%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-12T09:16:31.289129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length193
Median length127
Mean length26.002
Min length1

Characters and Unicode

Total characters259994
Distinct characters1304
Distinct categories13 ?
Distinct scripts6 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6179 ?
Unique (%)61.8%

Sample

1st rowFluid mechanics of mixing: modelling, operations and experimental techniques
2nd row전자제어 엔진 고장 탐구
3rd rowA Handbook series on electromagnetic interference and compatibility
4th rowEncyclopedia of computer science and engineering
5th rowHandbook of microwave and optical components-Microwave passive and antenna components
ValueCountFrequency (%)
and 1561
 
3.9%
of 1116
 
2.8%
1063
 
2.6%
handbook 434
 
1.1%
in 425
 
1.1%
for 411
 
1.0%
the 353
 
0.9%
engineering 349
 
0.9%
design 328
 
0.8%
systems 327
 
0.8%
Other values (9304) 33897
84.2%
2023-12-12T09:16:31.811938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30337
 
11.7%
e 16334
 
6.3%
i 14194
 
5.5%
n 14170
 
5.5%
a 12784
 
4.9%
o 12696
 
4.9%
t 11753
 
4.5%
r 10689
 
4.1%
s 10591
 
4.1%
c 8556
 
3.3%
Other values (1294) 117890
45.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 154781
59.5%
Other Letter 52032
 
20.0%
Space Separator 30337
 
11.7%
Uppercase Letter 13475
 
5.2%
Other Punctuation 3123
 
1.2%
Decimal Number 2880
 
1.1%
Open Punctuation 1097
 
0.4%
Close Punctuation 1094
 
0.4%
Dash Punctuation 850
 
0.3%
Math Symbol 311
 
0.1%
Other values (3) 14
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1327
 
2.6%
967
 
1.9%
858
 
1.6%
814
 
1.6%
765
 
1.5%
755
 
1.5%
689
 
1.3%
659
 
1.3%
656
 
1.3%
634
 
1.2%
Other values (1201) 43908
84.4%
Lowercase Letter
ValueCountFrequency (%)
e 16334
10.6%
i 14194
 
9.2%
n 14170
 
9.2%
a 12784
 
8.3%
o 12696
 
8.2%
t 11753
 
7.6%
r 10689
 
6.9%
s 10591
 
6.8%
c 8556
 
5.5%
l 6889
 
4.5%
Other values (16) 36125
23.3%
Uppercase Letter
ValueCountFrequency (%)
A 1561
11.6%
C 1362
 
10.1%
S 1168
 
8.7%
I 939
 
7.0%
P 894
 
6.6%
T 889
 
6.6%
M 885
 
6.6%
E 746
 
5.5%
D 741
 
5.5%
R 557
 
4.1%
Other values (16) 3733
27.7%
Other Punctuation
ValueCountFrequency (%)
: 824
26.4%
, 784
25.1%
. 496
15.9%
; 374
12.0%
/ 202
 
6.5%
& 197
 
6.3%
' 154
 
4.9%
· 69
 
2.2%
! 12
 
0.4%
# 5
 
0.2%
Other values (4) 6
 
0.2%
Decimal Number
ValueCountFrequency (%)
0 813
28.2%
1 536
18.6%
2 424
14.7%
9 251
 
8.7%
5 208
 
7.2%
3 166
 
5.8%
8 133
 
4.6%
7 125
 
4.3%
4 117
 
4.1%
6 107
 
3.7%
Letter Number
ValueCountFrequency (%)
4
36.4%
3
27.3%
2
18.2%
1
 
9.1%
1
 
9.1%
Math Symbol
ValueCountFrequency (%)
= 201
64.6%
+ 99
31.8%
~ 9
 
2.9%
> 2
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 845
77.0%
[ 252
 
23.0%
Close Punctuation
ValueCountFrequency (%)
) 844
77.1%
] 250
 
22.9%
Space Separator
ValueCountFrequency (%)
30337
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 850
100.0%
Other Symbol
ValueCountFrequency (%)
® 2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 168267
64.7%
Hangul 41453
 
15.9%
Common 39695
 
15.3%
Han 10534
 
4.1%
Katakana 33
 
< 0.1%
Hiragana 12
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1327
 
3.2%
967
 
2.3%
858
 
2.1%
814
 
2.0%
765
 
1.8%
755
 
1.8%
689
 
1.7%
659
 
1.6%
656
 
1.6%
634
 
1.5%
Other values (659) 33329
80.4%
Han
ValueCountFrequency (%)
599
 
5.7%
478
 
4.5%
216
 
2.1%
212
 
2.0%
200
 
1.9%
182
 
1.7%
148
 
1.4%
147
 
1.4%
145
 
1.4%
144
 
1.4%
Other values (517) 8063
76.5%
Latin
ValueCountFrequency (%)
e 16334
 
9.7%
i 14194
 
8.4%
n 14170
 
8.4%
a 12784
 
7.6%
o 12696
 
7.5%
t 11753
 
7.0%
r 10689
 
6.4%
s 10591
 
6.3%
c 8556
 
5.1%
l 6889
 
4.1%
Other values (47) 49611
29.5%
Common
ValueCountFrequency (%)
30337
76.4%
- 850
 
2.1%
( 845
 
2.1%
) 844
 
2.1%
: 824
 
2.1%
0 813
 
2.0%
, 784
 
2.0%
1 536
 
1.4%
. 496
 
1.2%
2 424
 
1.1%
Other values (26) 2942
 
7.4%
Katakana
ValueCountFrequency (%)
8
24.2%
8
24.2%
7
21.2%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
Other values (3) 3
 
9.1%
Hiragana
ValueCountFrequency (%)
10
83.3%
2
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 207879
80.0%
Hangul 41453
 
15.9%
CJK 10283
 
4.0%
CJK Compat Ideographs 251
 
0.1%
None 71
 
< 0.1%
Katakana 33
 
< 0.1%
Hiragana 12
 
< 0.1%
Number Forms 11
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
30337
14.6%
e 16334
 
7.9%
i 14194
 
6.8%
n 14170
 
6.8%
a 12784
 
6.1%
o 12696
 
6.1%
t 11753
 
5.7%
r 10689
 
5.1%
s 10591
 
5.1%
c 8556
 
4.1%
Other values (75) 65775
31.6%
Hangul
ValueCountFrequency (%)
1327
 
3.2%
967
 
2.3%
858
 
2.1%
814
 
2.0%
765
 
1.8%
755
 
1.8%
689
 
1.7%
659
 
1.6%
656
 
1.6%
634
 
1.5%
Other values (659) 33329
80.4%
CJK
ValueCountFrequency (%)
599
 
5.8%
478
 
4.6%
216
 
2.1%
212
 
2.1%
200
 
1.9%
182
 
1.8%
148
 
1.4%
147
 
1.4%
145
 
1.4%
144
 
1.4%
Other values (499) 7812
76.0%
CJK Compat Ideographs
ValueCountFrequency (%)
106
42.2%
71
28.3%
46
18.3%
7
 
2.8%
3
 
1.2%
2
 
0.8%
2
 
0.8%
2
 
0.8%
2
 
0.8%
2
 
0.8%
Other values (8) 8
 
3.2%
None
ValueCountFrequency (%)
· 69
97.2%
® 2
 
2.8%
Hiragana
ValueCountFrequency (%)
10
83.3%
2
 
16.7%
Katakana
ValueCountFrequency (%)
8
24.2%
8
24.2%
7
21.2%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
Other values (3) 3
 
9.1%
Number Forms
ValueCountFrequency (%)
4
36.4%
3
27.3%
2
18.2%
1
 
9.1%
1
 
9.1%
Punctuation
ValueCountFrequency (%)
1
100.0%

저자
Text

Distinct6421
Distinct (%)64.3%
Missing8
Missing (%)0.1%
Memory size156.2 KiB
2023-12-12T09:16:32.137095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length100
Median length72
Mean length9.4385508
Min length1

Characters and Unicode

Total characters94310
Distinct characters1267
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5036 ?
Unique (%)50.4%

Sample

1st rowKing, R. ed
2nd row富田照義
3rd rowDuff, William G
4th rowRalston, Anthony
5th rowChang, Kai ed
ValueCountFrequency (%)
ed 372
 
1.9%
370
 
1.9%
j 298
 
1.5%
한국공업표준협회 251
 
1.3%
a 238
 
1.2%
r 216
 
1.1%
共著 206
 
1.1%
h 195
 
1.0%
m 192
 
1.0%
w 185
 
1.0%
Other values (7140) 16812
87.0%
2023-12-12T09:16:32.604581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9828
 
10.4%
e 4685
 
5.0%
a 4028
 
4.3%
, 3841
 
4.1%
r 3476
 
3.7%
n 3338
 
3.5%
i 2992
 
3.2%
o 2949
 
3.1%
l 2446
 
2.6%
s 1926
 
2.0%
Other values (1257) 54801
58.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 38401
40.7%
Other Letter 27672
29.3%
Uppercase Letter 12497
 
13.3%
Space Separator 9828
 
10.4%
Other Punctuation 5640
 
6.0%
Dash Punctuation 91
 
0.1%
Decimal Number 82
 
0.1%
Close Punctuation 45
 
< 0.1%
Open Punctuation 45
 
< 0.1%
Letter Number 4
 
< 0.1%
Other values (2) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
772
 
2.8%
670
 
2.4%
624
 
2.3%
596
 
2.2%
594
 
2.1%
530
 
1.9%
496
 
1.8%
475
 
1.7%
429
 
1.6%
411
 
1.5%
Other values (1180) 22075
79.8%
Lowercase Letter
ValueCountFrequency (%)
e 4685
12.2%
a 4028
10.5%
r 3476
 
9.1%
n 3338
 
8.7%
i 2992
 
7.8%
o 2949
 
7.7%
l 2446
 
6.4%
s 1926
 
5.0%
t 1905
 
5.0%
d 1607
 
4.2%
Other values (16) 9049
23.6%
Uppercase Letter
ValueCountFrequency (%)
S 1042
 
8.3%
M 946
 
7.6%
A 918
 
7.3%
R 865
 
6.9%
J 798
 
6.4%
C 727
 
5.8%
H 669
 
5.4%
B 647
 
5.2%
D 630
 
5.0%
E 606
 
4.8%
Other values (16) 4649
37.2%
Other Punctuation
ValueCountFrequency (%)
, 3841
68.1%
. 1743
30.9%
& 23
 
0.4%
; 13
 
0.2%
' 11
 
0.2%
: 4
 
0.1%
/ 2
 
< 0.1%
" 2
 
< 0.1%
· 1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
1 28
34.1%
2 19
23.2%
5 11
 
13.4%
0 8
 
9.8%
3 7
 
8.5%
6 6
 
7.3%
7 1
 
1.2%
4 1
 
1.2%
8 1
 
1.2%
Space Separator
ValueCountFrequency (%)
9828
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 91
100.0%
Close Punctuation
ValueCountFrequency (%)
) 45
100.0%
Open Punctuation
ValueCountFrequency (%)
( 45
100.0%
Letter Number
ValueCountFrequency (%)
4
100.0%
Math Symbol
ValueCountFrequency (%)
| 4
100.0%
Control
ValueCountFrequency (%)
 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 50902
54.0%
Hangul 20192
 
21.4%
Common 15736
 
16.7%
Han 7480
 
7.9%

Most frequent character per script

Han
ValueCountFrequency (%)
624
 
8.3%
393
 
5.3%
293
 
3.9%
242
 
3.2%
227
 
3.0%
224
 
3.0%
178
 
2.4%
121
 
1.6%
113
 
1.5%
113
 
1.5%
Other values (678) 4952
66.2%
Hangul
ValueCountFrequency (%)
772
 
3.8%
670
 
3.3%
596
 
3.0%
594
 
2.9%
530
 
2.6%
496
 
2.5%
475
 
2.4%
429
 
2.1%
411
 
2.0%
383
 
1.9%
Other values (492) 14836
73.5%
Latin
ValueCountFrequency (%)
e 4685
 
9.2%
a 4028
 
7.9%
r 3476
 
6.8%
n 3338
 
6.6%
i 2992
 
5.9%
o 2949
 
5.8%
l 2446
 
4.8%
s 1926
 
3.8%
t 1905
 
3.7%
d 1607
 
3.2%
Other values (43) 21550
42.3%
Common
ValueCountFrequency (%)
9828
62.5%
, 3841
 
24.4%
. 1743
 
11.1%
- 91
 
0.6%
) 45
 
0.3%
( 45
 
0.3%
1 28
 
0.2%
& 23
 
0.1%
2 19
 
0.1%
; 13
 
0.1%
Other values (14) 60
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 66633
70.7%
Hangul 20192
 
21.4%
CJK 7133
 
7.6%
CJK Compat Ideographs 347
 
0.4%
Number Forms 4
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9828
 
14.7%
e 4685
 
7.0%
a 4028
 
6.0%
, 3841
 
5.8%
r 3476
 
5.2%
n 3338
 
5.0%
i 2992
 
4.5%
o 2949
 
4.4%
l 2446
 
3.7%
s 1926
 
2.9%
Other values (65) 27124
40.7%
Hangul
ValueCountFrequency (%)
772
 
3.8%
670
 
3.3%
596
 
3.0%
594
 
2.9%
530
 
2.6%
496
 
2.5%
475
 
2.4%
429
 
2.1%
411
 
2.0%
383
 
1.9%
Other values (492) 14836
73.5%
CJK
ValueCountFrequency (%)
624
 
8.7%
393
 
5.5%
293
 
4.1%
242
 
3.4%
227
 
3.2%
178
 
2.5%
121
 
1.7%
113
 
1.6%
113
 
1.6%
104
 
1.5%
Other values (651) 4725
66.2%
CJK Compat Ideographs
ValueCountFrequency (%)
224
64.6%
13
 
3.7%
13
 
3.7%
12
 
3.5%
11
 
3.2%
10
 
2.9%
10
 
2.9%
9
 
2.6%
9
 
2.6%
7
 
2.0%
Other values (17) 29
 
8.4%
Number Forms
ValueCountFrequency (%)
4
100.0%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct1889
Distinct (%)19.0%
Missing50
Missing (%)0.5%
Memory size156.2 KiB
2023-12-12T09:16:32.966126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length75
Median length58
Mean length8.2540704
Min length2

Characters and Unicode

Total characters82128
Distinct characters642
Distinct categories11 ?
Distinct scripts5 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1022 ?
Unique (%)10.3%

Sample

1st rowKluwer Academic Pub
2nd row골든벨
3rd rowI.C.T
4th rowVan Nostrand Reinhold Co
5th rowJohn Wiley & Sons
ValueCountFrequency (%)
press 492
 
3.4%
463
 
3.2%
wiley 365
 
2.5%
mcgraw-hill 339
 
2.4%
john 335
 
2.3%
sons 328
 
2.3%
한국공업표준협회 311
 
2.2%
pub 241
 
1.7%
기전연구사 213
 
1.5%
academic 208
 
1.4%
Other values (1788) 11103
77.1%
2023-12-12T09:16:33.481228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 4842
 
5.9%
4448
 
5.4%
r 3452
 
4.2%
i 3313
 
4.0%
l 3292
 
4.0%
n 3252
 
4.0%
a 2732
 
3.3%
s 2539
 
3.1%
o 2492
 
3.0%
2159
 
2.6%
Other values (632) 49607
60.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 37596
45.8%
Other Letter 27019
32.9%
Uppercase Letter 11113
 
13.5%
Space Separator 4448
 
5.4%
Other Punctuation 906
 
1.1%
Dash Punctuation 793
 
1.0%
Open Punctuation 103
 
0.1%
Close Punctuation 100
 
0.1%
Decimal Number 44
 
0.1%
Math Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2159
 
8.0%
1005
 
3.7%
935
 
3.5%
935
 
3.5%
770
 
2.8%
765
 
2.8%
610
 
2.3%
570
 
2.1%
528
 
2.0%
502
 
1.9%
Other values (561) 18240
67.5%
Uppercase Letter
ValueCountFrequency (%)
P 1300
11.7%
S 1177
10.6%
A 1062
 
9.6%
M 998
 
9.0%
H 923
 
8.3%
E 705
 
6.3%
C 648
 
5.8%
W 597
 
5.4%
I 554
 
5.0%
G 448
 
4.0%
Other values (15) 2701
24.3%
Lowercase Letter
ValueCountFrequency (%)
e 4842
12.9%
r 3452
9.2%
i 3313
8.8%
l 3292
8.8%
n 3252
8.6%
a 2732
 
7.3%
s 2539
 
6.8%
o 2492
 
6.6%
c 2135
 
5.7%
t 1579
 
4.2%
Other values (14) 7968
21.2%
Other Punctuation
ValueCountFrequency (%)
& 485
53.5%
. 331
36.5%
, 34
 
3.8%
/ 30
 
3.3%
' 16
 
1.8%
4
 
0.4%
; 3
 
0.3%
# 1
 
0.1%
· 1
 
0.1%
: 1
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 21
47.7%
2 17
38.6%
9 3
 
6.8%
5 1
 
2.3%
8 1
 
2.3%
7 1
 
2.3%
Space Separator
ValueCountFrequency (%)
4448
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 793
100.0%
Open Punctuation
ValueCountFrequency (%)
( 103
100.0%
Close Punctuation
ValueCountFrequency (%)
) 100
100.0%
Math Symbol
ValueCountFrequency (%)
+ 4
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 48709
59.3%
Hangul 24487
29.8%
Common 6398
 
7.8%
Han 2531
 
3.1%
Katakana 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2159
 
8.8%
1005
 
4.1%
935
 
3.8%
935
 
3.8%
770
 
3.1%
765
 
3.1%
610
 
2.5%
570
 
2.3%
528
 
2.2%
502
 
2.1%
Other values (366) 15708
64.1%
Han
ValueCountFrequency (%)
346
 
13.7%
114
 
4.5%
88
 
3.5%
81
 
3.2%
81
 
3.2%
80
 
3.2%
78
 
3.1%
69
 
2.7%
66
 
2.6%
61
 
2.4%
Other values (183) 1467
58.0%
Latin
ValueCountFrequency (%)
e 4842
 
9.9%
r 3452
 
7.1%
i 3313
 
6.8%
l 3292
 
6.8%
n 3252
 
6.7%
a 2732
 
5.6%
s 2539
 
5.2%
o 2492
 
5.1%
c 2135
 
4.4%
t 1579
 
3.2%
Other values (39) 19081
39.2%
Common
ValueCountFrequency (%)
4448
69.5%
- 793
 
12.4%
& 485
 
7.6%
. 331
 
5.2%
( 103
 
1.6%
) 100
 
1.6%
, 34
 
0.5%
/ 30
 
0.5%
1 21
 
0.3%
2 17
 
0.3%
Other values (11) 36
 
0.6%
Katakana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 55102
67.1%
Hangul 24485
29.8%
CJK 2509
 
3.1%
CJK Compat Ideographs 22
 
< 0.1%
None 7
 
< 0.1%
Katakana 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 4842
 
8.8%
4448
 
8.1%
r 3452
 
6.3%
i 3313
 
6.0%
l 3292
 
6.0%
n 3252
 
5.9%
a 2732
 
5.0%
s 2539
 
4.6%
o 2492
 
4.5%
c 2135
 
3.9%
Other values (58) 22605
41.0%
Hangul
ValueCountFrequency (%)
2159
 
8.8%
1005
 
4.1%
935
 
3.8%
935
 
3.8%
770
 
3.1%
765
 
3.1%
610
 
2.5%
570
 
2.3%
528
 
2.2%
502
 
2.1%
Other values (365) 15706
64.1%
CJK
ValueCountFrequency (%)
346
 
13.8%
114
 
4.5%
88
 
3.5%
81
 
3.2%
81
 
3.2%
80
 
3.2%
78
 
3.1%
69
 
2.8%
66
 
2.6%
61
 
2.4%
Other values (179) 1445
57.6%
CJK Compat Ideographs
ValueCountFrequency (%)
18
81.8%
2
 
9.1%
1
 
4.5%
1
 
4.5%
None
ValueCountFrequency (%)
4
57.1%
2
28.6%
· 1
 
14.3%
Katakana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

출판년
Real number (ℝ)

SKEWED 

Distinct72
Distinct (%)0.7%
Missing93
Missing (%)0.9%
Infinite0
Infinite (%)0.0%
Mean1992.5308
Minimum194
Maximum9106
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T09:16:33.619992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum194
5-th percentile1977
Q11984
median1991
Q31997
95-th percentile2013
Maximum9106
Range8912
Interquartile range (IQR)13

Descriptive statistics

Standard deviation106.82002
Coefficient of variation (CV)0.053610221
Kurtosis3994.8975
Mean1992.5308
Median Absolute Deviation (MAD)7
Skewness58.114189
Sum19740003
Variance11410.516
MonotonicityNot monotonic
2023-12-12T09:16:33.763159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1993 511
 
5.1%
1991 466
 
4.7%
1982 461
 
4.6%
1992 438
 
4.4%
1990 413
 
4.1%
1994 411
 
4.1%
1989 398
 
4.0%
1981 397
 
4.0%
1988 388
 
3.9%
1987 371
 
3.7%
Other values (62) 5653
56.5%
ValueCountFrequency (%)
194 1
 
< 0.1%
197 1
 
< 0.1%
199 1
 
< 0.1%
995 1
 
< 0.1%
1945 1
 
< 0.1%
1949 1
 
< 0.1%
1950 3
< 0.1%
1951 1
 
< 0.1%
1954 2
< 0.1%
1955 2
< 0.1%
ValueCountFrequency (%)
9106 2
 
< 0.1%
2019 9
 
0.1%
2018 20
 
0.2%
2017 53
 
0.5%
2016 82
0.8%
2015 171
1.7%
2014 148
1.5%
2013 79
0.8%
2012 81
0.8%
2011 86
0.9%

Interactions

2023-12-12T09:16:27.850102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T09:16:27.957585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:16:28.074451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T09:16:28.172969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번서가번호볼륨서명저자출판사출판년
37713770TA357.5 K5f<NA>Fluid mechanics of mixing: modelling, operations and experimental techniquesKing, R. edKluwer Academic Pub1992
1121111210TV152 부74ㅈ<NA>전자제어 엔진 고장 탐구富田照義골든벨2002
69856984TK7825 D8hv.1A Handbook series on electromagnetic interference and compatibilityDuff, William GI.C.T1988
525526QA76.15.E48 R3e2<NA>Encyclopedia of computer science and engineeringRalston, AnthonyVan Nostrand Reinhold Co1983
75397538TK7876 C4hv.3Handbook of microwave and optical components-Microwave passive and antenna componentsChang, Kai edJohn Wiley & Sons1989
38663865TA403.9 조76ㄱ<NA>건축재료학 = Architecture materials조준현기문당2015
570571QA76.25 세66ㅇ<NA>WordStar;입문에서 실제의 사용법까지세운편집실도서출판세운1985
61206119TK2901 C7s<NA>Small batteriesCrompton, T.RJ-W1982
90909089TP1955 강82ㅅ<NA>食肉生産과 加工의 科學강창기 共著선진문화사<NA>
34693470TA169.5 N5f<NA>Failure analysis in engineering applicationsNishida, Shin-ichiButterwoths1992
순번서가번호볼륨서명저자출판사출판년
17401741QA76.95 김65ㅂ<NA>BASIC프로그램과컴퓨터응용김용득희중당1984
32483249T59.I7 한16ㅇv.2ISO 9000한국표준협회 譯한국표준협회1993
39193918TA405 구54ㄱ<NA>各種 形鋼構造設計 및 DATA구성모기전연구사1980
38503849TA403.6 이65ㅈ<NA>材料力學李完益喜重堂1988
44574456TA455.P58 D9e<NA>Engineering polymersDyson, R. W. edBlackie1990
74607459TK7874 E4l<NA>Laser and particle-beam chemical processing for microelectronicsEhrlich, Daniel J. edMaterials Research Society1988
98369835TS156 S3u<NA>Understanding ISO 9000 and implementing the basic to qualityStamatis, D.HMarcel Dekker1995
67196718TK6565.T73 G976m<NA>Microwave transmission-line impedance dataGunston, M. A. RNoble Pub1997
48144813TA656.5 T48a<NA>Application of structural systems reliability theoryMurctsu, YoshisadaSpringer-Verlag1986
82238222TL509 이34ㅎ<NA>항공우주시대 항공력 운용(이론과 실제)이명환오름2010