Overview

Dataset statistics

Number of variables7
Number of observations6560
Missing cells333
Missing cells (%)0.7%
Duplicate rows350
Duplicate rows (%)5.3%
Total size in memory365.3 KiB
Average record size in memory57.0 B

Variable types

Text3
Categorical2
Numeric1
DateTime1

Dataset

Description한국산업기술기획평가원에서 보유하고 있는 도서 정보(도서명, 도서구분(명), 저자, 출판사, 발행년도, 도서등록일자, 도서위치)를 제공합니다.
Author한국산업기술기획평가원
URLhttps://www.data.go.kr/data/15039776/fileData.do

Alerts

도서구분(명) has constant value ""Constant
Dataset has 350 (5.3%) duplicate rowsDuplicates
도서위치 is highly imbalanced (68.2%)Imbalance
저자 has 285 (4.3%) missing valuesMissing

Reproduction

Analysis started2023-12-11 23:02:22.206769
Analysis finished2023-12-11 23:02:24.031815
Duration1.83 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct5866
Distinct (%)89.4%
Missing0
Missing (%)0.0%
Memory size51.4 KiB
2023-12-12T08:02:24.327771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length155
Median length122
Mean length23.452287
Min length1

Characters and Unicode

Total characters153847
Distinct characters1210
Distinct categories15 ?
Distinct scripts6 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5253 ?
Unique (%)80.1%

Sample

1st row법전,1993
2nd rowSPSS/PC+ II
3rd rowSPSS/PC+ I
4th row90년대의 전자산업비젼
5th row소법전,1993
ValueCountFrequency (%)
1021
 
3.5%
276
 
1.0%
위한 245
 
0.8%
and 214
 
0.7%
the 192
 
0.7%
of 178
 
0.6%
연구 140
 
0.5%
관한 104
 
0.4%
technology 102
 
0.4%
전략 90
 
0.3%
Other values (11888) 26478
91.2%
2023-12-12T08:02:24.896899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22600
 
14.7%
0 3557
 
2.3%
2976
 
1.9%
2 2490
 
1.6%
e 2486
 
1.6%
n 2108
 
1.4%
o 1944
 
1.3%
1939
 
1.3%
a 1779
 
1.2%
1766
 
1.1%
Other values (1200) 110202
71.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 80974
52.6%
Space Separator 22600
 
14.7%
Lowercase Letter 21024
 
13.7%
Uppercase Letter 10868
 
7.1%
Decimal Number 10232
 
6.7%
Other Punctuation 3522
 
2.3%
Open Punctuation 1667
 
1.1%
Close Punctuation 1664
 
1.1%
Dash Punctuation 896
 
0.6%
Math Symbol 279
 
0.2%
Other values (5) 121
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2976
 
3.7%
1939
 
2.4%
1766
 
2.2%
1717
 
2.1%
1411
 
1.7%
1314
 
1.6%
1257
 
1.6%
1235
 
1.5%
1133
 
1.4%
1111
 
1.4%
Other values (1094) 65115
80.4%
Lowercase Letter
ValueCountFrequency (%)
e 2486
11.8%
n 2108
10.0%
o 1944
 
9.2%
a 1779
 
8.5%
i 1704
 
8.1%
t 1611
 
7.7%
r 1410
 
6.7%
s 1218
 
5.8%
c 924
 
4.4%
l 868
 
4.1%
Other values (16) 4972
23.6%
Uppercase Letter
ValueCountFrequency (%)
I 1034
 
9.5%
T 903
 
8.3%
E 883
 
8.1%
S 797
 
7.3%
A 790
 
7.3%
O 729
 
6.7%
C 691
 
6.4%
R 680
 
6.3%
N 614
 
5.6%
D 494
 
4.5%
Other values (16) 3253
29.9%
Other Punctuation
ValueCountFrequency (%)
. 1124
31.9%
: 991
28.1%
, 518
14.7%
; 263
 
7.5%
/ 257
 
7.3%
& 142
 
4.0%
' 69
 
2.0%
· 47
 
1.3%
? 38
 
1.1%
! 33
 
0.9%
Other values (5) 40
 
1.1%
Decimal Number
ValueCountFrequency (%)
0 3557
34.8%
2 2490
24.3%
1 1446
14.1%
9 588
 
5.7%
3 530
 
5.2%
4 402
 
3.9%
5 397
 
3.9%
6 306
 
3.0%
7 267
 
2.6%
8 249
 
2.4%
Letter Number
ValueCountFrequency (%)
29
28.4%
26
25.5%
15
14.7%
11
 
10.8%
6
 
5.9%
4
 
3.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
2
 
2.0%
Math Symbol
ValueCountFrequency (%)
= 172
61.6%
~ 75
26.9%
+ 21
 
7.5%
< 4
 
1.4%
> 4
 
1.4%
| 3
 
1.1%
Open Punctuation
ValueCountFrequency (%)
( 1604
96.2%
[ 60
 
3.6%
3
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 1602
96.3%
] 59
 
3.5%
3
 
0.2%
Dash Punctuation
ValueCountFrequency (%)
- 895
99.9%
1
 
0.1%
Space Separator
ValueCountFrequency (%)
22600
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 11
100.0%
Final Punctuation
ValueCountFrequency (%)
4
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 80581
52.4%
Common 40879
26.6%
Latin 31994
 
20.8%
Han 338
 
0.2%
Hiragana 41
 
< 0.1%
Katakana 14
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2976
 
3.7%
1939
 
2.4%
1766
 
2.2%
1717
 
2.1%
1411
 
1.8%
1314
 
1.6%
1257
 
1.6%
1235
 
1.5%
1133
 
1.4%
1111
 
1.4%
Other values (893) 64722
80.3%
Han
ValueCountFrequency (%)
14
 
4.1%
13
 
3.8%
13
 
3.8%
11
 
3.3%
11
 
3.3%
10
 
3.0%
9
 
2.7%
7
 
2.1%
6
 
1.8%
5
 
1.5%
Other values (155) 239
70.7%
Latin
ValueCountFrequency (%)
e 2486
 
7.8%
n 2108
 
6.6%
o 1944
 
6.1%
a 1779
 
5.6%
i 1704
 
5.3%
t 1611
 
5.0%
r 1410
 
4.4%
s 1218
 
3.8%
I 1034
 
3.2%
c 924
 
2.9%
Other values (52) 15776
49.3%
Common
ValueCountFrequency (%)
22600
55.3%
0 3557
 
8.7%
2 2490
 
6.1%
( 1604
 
3.9%
) 1602
 
3.9%
1 1446
 
3.5%
. 1124
 
2.7%
: 991
 
2.4%
- 895
 
2.2%
9 588
 
1.4%
Other values (34) 3982
 
9.7%
Hiragana
ValueCountFrequency (%)
6
14.6%
4
 
9.8%
3
 
7.3%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
Other values (14) 14
34.1%
Katakana
ValueCountFrequency (%)
2
14.3%
2
14.3%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
Other values (2) 2
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 80514
52.3%
ASCII 72707
47.3%
CJK 332
 
0.2%
Number Forms 102
 
0.1%
Compat Jamo 67
 
< 0.1%
None 60
 
< 0.1%
Hiragana 41
 
< 0.1%
Katakana 14
 
< 0.1%
CJK Compat Ideographs 6
 
< 0.1%
Punctuation 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
22600
31.1%
0 3557
 
4.9%
2 2490
 
3.4%
e 2486
 
3.4%
n 2108
 
2.9%
o 1944
 
2.7%
a 1779
 
2.4%
i 1704
 
2.3%
t 1611
 
2.2%
( 1604
 
2.2%
Other values (79) 30824
42.4%
Hangul
ValueCountFrequency (%)
2976
 
3.7%
1939
 
2.4%
1766
 
2.2%
1717
 
2.1%
1411
 
1.8%
1314
 
1.6%
1257
 
1.6%
1235
 
1.5%
1133
 
1.4%
1111
 
1.4%
Other values (892) 64655
80.3%
Compat Jamo
ValueCountFrequency (%)
67
100.0%
None
ValueCountFrequency (%)
· 47
78.3%
5
 
8.3%
3
 
5.0%
3
 
5.0%
1
 
1.7%
1
 
1.7%
Number Forms
ValueCountFrequency (%)
29
28.4%
26
25.5%
15
14.7%
11
 
10.8%
6
 
5.9%
4
 
3.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
2
 
2.0%
CJK
ValueCountFrequency (%)
14
 
4.2%
13
 
3.9%
13
 
3.9%
11
 
3.3%
11
 
3.3%
10
 
3.0%
9
 
2.7%
7
 
2.1%
6
 
1.8%
5
 
1.5%
Other values (152) 233
70.2%
Hiragana
ValueCountFrequency (%)
6
14.6%
4
 
9.8%
3
 
7.3%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
Other values (14) 14
34.1%
Punctuation
ValueCountFrequency (%)
4
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
Katakana
ValueCountFrequency (%)
2
14.3%
2
14.3%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
1
7.1%
Other values (2) 2
14.3%

도서구분(명)
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size51.4 KiB
단행본
6560 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row단행본
2nd row단행본
3rd row단행본
4th row단행본
5th row단행본

Common Values

ValueCountFrequency (%)
단행본 6560
100.0%

Length

2023-12-12T08:02:25.056905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:02:25.163303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
단행본 6560
100.0%

저자
Text

MISSING 

Distinct3748
Distinct (%)59.7%
Missing285
Missing (%)4.3%
Memory size51.4 KiB
2023-12-12T08:02:25.441250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length77
Median length69
Mean length10.432829
Min length1

Characters and Unicode

Total characters65466
Distinct characters804
Distinct categories10 ?
Distinct scripts5 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2928 ?
Unique (%)46.7%

Sample

1st row조상원 편저
2nd row이용구,원태연,정성원 공저
3rd row이용구,원태연,정성원 공저
4th row신용태 편저
5th row신용태 편저
ValueCountFrequency (%)
지음 244
 
2.1%
한국산업기술평가원 220
 
1.9%
206
 
1.8%
옮김 200
 
1.7%
163
 
1.4%
152
 
1.3%
129
 
1.1%
by 117
 
1.0%
산업자원부 103
 
0.9%
공저 92
 
0.8%
Other values (5441) 10091
86.1%
2023-12-12T08:02:25.982496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5482
 
8.4%
/ 1619
 
2.5%
, 1555
 
2.4%
1519
 
2.3%
1469
 
2.2%
1415
 
2.2%
1380
 
2.1%
1159
 
1.8%
1149
 
1.8%
1093
 
1.7%
Other values (794) 47626
72.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 45814
70.0%
Space Separator 5482
 
8.4%
Lowercase Letter 5299
 
8.1%
Other Punctuation 4170
 
6.4%
Uppercase Letter 4003
 
6.1%
Open Punctuation 274
 
0.4%
Close Punctuation 271
 
0.4%
Decimal Number 124
 
0.2%
Dash Punctuation 27
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1519
 
3.3%
1469
 
3.2%
1415
 
3.1%
1380
 
3.0%
1159
 
2.5%
1149
 
2.5%
1093
 
2.4%
1065
 
2.3%
1000
 
2.2%
765
 
1.7%
Other values (717) 33800
73.8%
Lowercase Letter
ValueCountFrequency (%)
e 673
12.7%
a 552
10.4%
n 478
 
9.0%
i 431
 
8.1%
o 425
 
8.0%
r 422
 
8.0%
l 301
 
5.7%
t 298
 
5.6%
s 234
 
4.4%
h 212
 
4.0%
Other values (17) 1273
24.0%
Uppercase Letter
ValueCountFrequency (%)
A 363
 
9.1%
E 314
 
7.8%
R 302
 
7.5%
S 265
 
6.6%
C 220
 
5.5%
I 215
 
5.4%
T 209
 
5.2%
B 198
 
4.9%
L 196
 
4.9%
O 193
 
4.8%
Other values (16) 1528
38.2%
Decimal Number
ValueCountFrequency (%)
2 31
25.0%
1 28
22.6%
0 22
17.7%
7 11
 
8.9%
3 9
 
7.3%
6 8
 
6.5%
8 7
 
5.6%
4 5
 
4.0%
5 3
 
2.4%
Other Punctuation
ValueCountFrequency (%)
/ 1619
38.8%
, 1555
37.3%
. 738
17.7%
; 144
 
3.5%
& 74
 
1.8%
: 24
 
0.6%
· 14
 
0.3%
' 2
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 197
71.9%
[ 77
 
28.1%
Close Punctuation
ValueCountFrequency (%)
) 193
71.2%
] 78
28.8%
Space Separator
ValueCountFrequency (%)
5482
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 27
100.0%
Math Symbol
ValueCountFrequency (%)
= 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 45706
69.8%
Common 10350
 
15.8%
Latin 9302
 
14.2%
Han 103
 
0.2%
Hiragana 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1519
 
3.3%
1469
 
3.2%
1415
 
3.1%
1380
 
3.0%
1159
 
2.5%
1149
 
2.5%
1093
 
2.4%
1065
 
2.3%
1000
 
2.2%
765
 
1.7%
Other values (645) 33692
73.7%
Han
ValueCountFrequency (%)
16
 
15.5%
3
 
2.9%
3
 
2.9%
3
 
2.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (58) 66
64.1%
Latin
ValueCountFrequency (%)
e 673
 
7.2%
a 552
 
5.9%
n 478
 
5.1%
i 431
 
4.6%
o 425
 
4.6%
r 422
 
4.5%
A 363
 
3.9%
E 314
 
3.4%
R 302
 
3.2%
l 301
 
3.2%
Other values (43) 5041
54.2%
Common
ValueCountFrequency (%)
5482
53.0%
/ 1619
 
15.6%
, 1555
 
15.0%
. 738
 
7.1%
( 197
 
1.9%
) 193
 
1.9%
; 144
 
1.4%
] 78
 
0.8%
[ 77
 
0.7%
& 74
 
0.7%
Other values (14) 193
 
1.9%
Hiragana
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 45582
69.6%
ASCII 19637
30.0%
Compat Jamo 124
 
0.2%
CJK 102
 
0.2%
None 15
 
< 0.1%
Hiragana 5
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5482
27.9%
/ 1619
 
8.2%
, 1555
 
7.9%
. 738
 
3.8%
e 673
 
3.4%
a 552
 
2.8%
n 478
 
2.4%
i 431
 
2.2%
o 425
 
2.2%
r 422
 
2.1%
Other values (65) 7262
37.0%
Hangul
ValueCountFrequency (%)
1519
 
3.3%
1469
 
3.2%
1415
 
3.1%
1380
 
3.0%
1159
 
2.5%
1149
 
2.5%
1093
 
2.4%
1065
 
2.3%
1000
 
2.2%
765
 
1.7%
Other values (643) 33568
73.6%
Compat Jamo
ValueCountFrequency (%)
123
99.2%
1
 
0.8%
CJK
ValueCountFrequency (%)
16
 
15.7%
3
 
2.9%
3
 
2.9%
3
 
2.9%
2
 
2.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
Other values (57) 65
63.7%
None
ValueCountFrequency (%)
· 14
93.3%
ø 1
 
6.7%
Hiragana
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct1957
Distinct (%)29.8%
Missing1
Missing (%)< 0.1%
Memory size51.4 KiB
2023-12-12T08:02:26.305785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length38
Mean length7.6340906
Min length1

Characters and Unicode

Total characters50072
Distinct characters633
Distinct categories9 ?
Distinct scripts5 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1121 ?
Unique (%)17.1%

Sample

1st row현암사
2nd row자유아카데미
3rd row자유아카데미
4th row통상산업성전자기기과
5th row법전출판사
ValueCountFrequency (%)
한국산업기술평가원 210
 
2.8%
한국과학기술정보연구원(kisti 178
 
2.4%
산업자원부 145
 
1.9%
한국산업기술재단 125
 
1.7%
한국산업기술평가관리원 104
 
1.4%
특허청 103
 
1.4%
한국산업기술평가원(itep 92
 
1.2%
산업연구원 87
 
1.2%
매일경제신문사 66
 
0.9%
press 63
 
0.8%
Other values (2013) 6336
84.4%
2023-12-12T08:02:26.880199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1932
 
3.9%
1918
 
3.8%
1881
 
3.8%
1864
 
3.7%
1497
 
3.0%
1390
 
2.8%
1320
 
2.6%
988
 
2.0%
984
 
2.0%
962
 
1.9%
Other values (623) 35336
70.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39463
78.8%
Uppercase Letter 4867
 
9.7%
Lowercase Letter 2943
 
5.9%
Space Separator 962
 
1.9%
Open Punctuation 591
 
1.2%
Close Punctuation 590
 
1.2%
Other Punctuation 424
 
0.8%
Decimal Number 205
 
0.4%
Dash Punctuation 27
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1932
 
4.9%
1918
 
4.9%
1881
 
4.8%
1864
 
4.7%
1497
 
3.8%
1390
 
3.5%
1320
 
3.3%
988
 
2.5%
984
 
2.5%
917
 
2.3%
Other values (548) 24772
62.8%
Uppercase Letter
ValueCountFrequency (%)
I 799
16.4%
S 543
11.2%
T 539
11.1%
E 481
9.9%
P 358
 
7.4%
K 324
 
6.7%
A 291
 
6.0%
R 236
 
4.8%
C 185
 
3.8%
O 177
 
3.6%
Other values (16) 934
19.2%
Lowercase Letter
ValueCountFrequency (%)
e 354
12.0%
n 263
8.9%
i 255
 
8.7%
a 243
 
8.3%
r 237
 
8.1%
s 218
 
7.4%
o 213
 
7.2%
l 188
 
6.4%
t 176
 
6.0%
c 142
 
4.8%
Other values (16) 654
22.2%
Decimal Number
ValueCountFrequency (%)
1 81
39.5%
2 68
33.2%
4 14
 
6.8%
3 10
 
4.9%
6 8
 
3.9%
0 8
 
3.9%
7 7
 
3.4%
8 5
 
2.4%
9 2
 
1.0%
5 2
 
1.0%
Other Punctuation
ValueCountFrequency (%)
, 141
33.3%
/ 125
29.5%
. 81
19.1%
& 65
15.3%
' 5
 
1.2%
; 5
 
1.2%
: 2
 
0.5%
Open Punctuation
ValueCountFrequency (%)
( 589
99.7%
[ 2
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 588
99.7%
] 2
 
0.3%
Space Separator
ValueCountFrequency (%)
962
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 27
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39355
78.6%
Latin 7810
 
15.6%
Common 2799
 
5.6%
Han 103
 
0.2%
Katakana 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1932
 
4.9%
1918
 
4.9%
1881
 
4.8%
1864
 
4.7%
1497
 
3.8%
1390
 
3.5%
1320
 
3.4%
988
 
2.5%
984
 
2.5%
917
 
2.3%
Other values (500) 24664
62.7%
Latin
ValueCountFrequency (%)
I 799
 
10.2%
S 543
 
7.0%
T 539
 
6.9%
E 481
 
6.2%
P 358
 
4.6%
e 354
 
4.5%
K 324
 
4.1%
A 291
 
3.7%
n 263
 
3.4%
i 255
 
3.3%
Other values (42) 3603
46.1%
Han
ValueCountFrequency (%)
19
18.4%
11
 
10.7%
8
 
7.8%
5
 
4.9%
4
 
3.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
Other values (33) 41
39.8%
Common
ValueCountFrequency (%)
962
34.4%
( 589
21.0%
) 588
21.0%
, 141
 
5.0%
/ 125
 
4.5%
1 81
 
2.9%
. 81
 
2.9%
2 68
 
2.4%
& 65
 
2.3%
- 27
 
1.0%
Other values (13) 72
 
2.6%
Katakana
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39338
78.6%
ASCII 10609
 
21.2%
CJK 103
 
0.2%
Compat Jamo 17
 
< 0.1%
Katakana 5
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1932
 
4.9%
1918
 
4.9%
1881
 
4.8%
1864
 
4.7%
1497
 
3.8%
1390
 
3.5%
1320
 
3.4%
988
 
2.5%
984
 
2.5%
917
 
2.3%
Other values (499) 24647
62.7%
ASCII
ValueCountFrequency (%)
962
 
9.1%
I 799
 
7.5%
( 589
 
5.6%
) 588
 
5.5%
S 543
 
5.1%
T 539
 
5.1%
E 481
 
4.5%
P 358
 
3.4%
e 354
 
3.3%
K 324
 
3.1%
Other values (65) 5072
47.8%
CJK
ValueCountFrequency (%)
19
18.4%
11
 
10.7%
8
 
7.8%
5
 
4.9%
4
 
3.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
Other values (33) 41
39.8%
Compat Jamo
ValueCountFrequency (%)
17
100.0%
Katakana
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

발행년도
Real number (ℝ)

Distinct42
Distinct (%)0.6%
Missing47
Missing (%)0.7%
Infinite0
Infinite (%)0.0%
Mean2007.0969
Minimum1976
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size57.8 KiB
2023-12-12T08:02:27.009614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1976
5-th percentile1995
Q12002
median2006
Q32013
95-th percentile2021
Maximum2023
Range47
Interquartile range (IQR)11

Descriptive statistics

Standard deviation7.9915867
Coefficient of variation (CV)0.0039816647
Kurtosis-0.63996945
Mean2007.0969
Median Absolute Deviation (MAD)5
Skewness0.17819647
Sum13072222
Variance63.865459
MonotonicityNot monotonic
2023-12-12T08:02:27.148272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
ValueCountFrequency (%)
2004 510
 
7.8%
2005 375
 
5.7%
2002 353
 
5.4%
2000 342
 
5.2%
2020 341
 
5.2%
2003 329
 
5.0%
2006 321
 
4.9%
2001 320
 
4.9%
2008 297
 
4.5%
2007 295
 
4.5%
Other values (32) 3030
46.2%
ValueCountFrequency (%)
1976 1
 
< 0.1%
1979 1
 
< 0.1%
1983 1
 
< 0.1%
1985 5
 
0.1%
1986 2
 
< 0.1%
1987 3
 
< 0.1%
1988 3
 
< 0.1%
1989 11
 
0.2%
1990 16
0.2%
1991 37
0.6%
ValueCountFrequency (%)
2023 41
 
0.6%
2022 65
 
1.0%
2021 293
4.5%
2020 341
5.2%
2019 256
3.9%
2018 91
 
1.4%
2017 52
 
0.8%
2016 72
 
1.1%
2015 99
 
1.5%
2014 106
 
1.6%
Distinct1560
Distinct (%)23.8%
Missing0
Missing (%)0.0%
Memory size51.4 KiB
Minimum1993-07-13 00:00:00
Maximum2023-08-02 00:00:00
2023-12-12T08:02:27.323402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:02:27.469061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

도서위치
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size51.4 KiB
대구본원
5733 
북카페
674 
대전분원
 
151
자료실
 
2

Length

Max length4
Median length4
Mean length3.8969512
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구본원
2nd row대구본원
3rd row대구본원
4th row대구본원
5th row대구본원

Common Values

ValueCountFrequency (%)
대구본원 5733
87.4%
북카페 674
 
10.3%
대전분원 151
 
2.3%
자료실 2
 
< 0.1%

Length

2023-12-12T08:02:27.593431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:02:27.714125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구본원 5733
87.4%
북카페 674
 
10.3%
대전분원 151
 
2.3%
자료실 2
 
< 0.1%

Interactions

2023-12-12T08:02:23.585550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:02:27.786169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발행년도도서위치
발행년도1.0000.672
도서위치0.6721.000
2023-12-12T08:02:27.878488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발행년도도서위치
발행년도1.0000.472
도서위치0.4721.000

Missing values

2023-12-12T08:02:23.709924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:02:23.846099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T08:02:23.952273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

도서명도서구분(명)저자출판사발행년도도서등록일자도서위치
0법전,1993단행본조상원 편저현암사19931993-07-13대구본원
1SPSS/PC+ II단행본이용구,원태연,정성원 공저자유아카데미19911993-09-16대구본원
2SPSS/PC+ I단행본이용구,원태연,정성원 공저자유아카데미19911993-09-16대구본원
390년대의 전자산업비젼단행본<NA>통상산업성전자기기과19891993-10-11대구본원
4소법전,1993단행본<NA>법전출판사19931993-11-12대구본원
5일본어한자읽기사전단행본신용태 편저해문출판사19921993-11-12대구본원
6일본어한자읽기사전단행본신용태 편저해문출판사19921993-11-12대구본원
710년후 일본의 선진기술시장단행본청유전 저다이아몬드사19921993-11-12대구본원
8일본의 오리지널기술 아메리카에 공포한다단행본소림기흥 저조도전출판19901993-11-12대구본원
9산업과학기술의 동향과 과제단행본<NA>통상산업성19921993-11-26대구본원
도서명도서구분(명)저자출판사발행년도도서등록일자도서위치
6550128호실의 원고단행본카티 보니당한스미디어20202023-07-21북카페
6551일의 격단행본신수정턴어라운드20212023-07-21북카페
6552거인의 어깨 위에서단행본선우휘 외조선뉴스프레스20232023-07-21북카페
6553세이노의 가르침단행본세이노데이원20232023-07-21대전분원
6554카구야 프로젝트단행본원샨아작20202023-07-21북카페
6555과학이 필요한 시간 : 빅뱅에서 다중우주로 가는 초광속 초밀착 길 안내서단행본궤도동아시아20222023-07-21북카페
6556소르본 철학 수업단행본전진나무의철학20202023-07-21북카페
6557세계를 품다 2023단행본글로벌 리더 선정자 22인매일경제신문사20232023-07-21북카페
6558ESS사업 아는 만큼 성공한다단행본최동배산경E뉴스신문사20232023-08-02북카페
6559ESS사업 아는 만큼 성공한다단행본최동배산경E뉴스신문사20232023-08-02북카페

Duplicate rows

Most frequently occurring

도서명도서구분(명)저자출판사발행년도도서등록일자도서위치# duplicates
110Turning Points : GLOBAL AGENDA 2018단행본뉴스1뉴스120182018-03-12대구본원10
230산업집적의 공간구조와 지역혁신 거버넌스단행본정준호,김선배,변창욱 공저산업연구원20042007-03-26대구본원8
94High-Office Program : 2007 Microsoft Office System단행본한국마이크로소프트(유)한국마이크로소프트(유)20072007-10-31대구본원5
153김대중 대통령의 시스템 사고단행본김동환집문당20002018-02-12대구본원5
157나를 성장시키는 생각의 기술단행본이창후소울메이트20112018-02-13대구본원5
160논리적 사고와 글쓰기단행본가톨릭관동대학교 글쓰기 교재 편찬위원회경진출판20172018-02-13대구본원5
173디베이트와 논리적 사고단행본Dr.Z성숙한삶20132018-02-12대구본원5
1(11가지 질문도구의)비판적 사고력 연습단행본M. 닐 브라운돈키호테20162018-02-12대구본원4
864차산업 투자지도단행본한국비즈니스정보어바웃어북20172020-04-28대구본원4
325초일류 기업의 합리적 사고력단행본강관수세화20072018-02-12대구본원4