Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells350
Missing cells (%)0.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory566.4 KiB
Average record size in memory58.0 B

Variable types

Numeric2
Text3
Categorical1

Dataset

Description기초과학연구원 과학문화센터 전자도서관 소장 도서정보입니다. 해당 데이터가 보유한 컬럼은 다음과 같습니다.컬럼명: 서명, 저자, 출판사, 출판년, 매체
Author기초과학연구원
URLhttps://www.data.go.kr/data/15053238/fileData.do

Alerts

매체 has constant value ""Constant
출판년 has 272 (2.7%) missing valuesMissing
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 13:59:53.358969
Analysis finished2023-12-12 13:59:56.931402
Duration3.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9412.8339
Minimum1
Maximum18850
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T22:59:57.084235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile913.85
Q14690.5
median9507
Q314065.25
95-th percentile17868.05
Maximum18850
Range18849
Interquartile range (IQR)9374.75

Descriptive statistics

Standard deviation5432.6281
Coefficient of variation (CV)0.57715117
Kurtosis-1.1953912
Mean9412.8339
Median Absolute Deviation (MAD)4686
Skewness-0.011646075
Sum94128339
Variance29513448
MonotonicityNot monotonic
2023-12-12T22:59:57.404231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
18719 1
 
< 0.1%
1187 1
 
< 0.1%
10475 1
 
< 0.1%
7878 1
 
< 0.1%
2963 1
 
< 0.1%
13703 1
 
< 0.1%
11187 1
 
< 0.1%
13407 1
 
< 0.1%
3810 1
 
< 0.1%
12266 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
8 1
< 0.1%
10 1
< 0.1%
12 1
< 0.1%
14 1
< 0.1%
16 1
< 0.1%
17 1
< 0.1%
18 1
< 0.1%
19 1
< 0.1%
24 1
< 0.1%
ValueCountFrequency (%)
18850 1
< 0.1%
18847 1
< 0.1%
18846 1
< 0.1%
18845 1
< 0.1%
18844 1
< 0.1%
18843 1
< 0.1%
18840 1
< 0.1%
18839 1
< 0.1%
18838 1
< 0.1%
18837 1
< 0.1%

서명
Text

Distinct9187
Distinct (%)91.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T22:59:57.930912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length313
Median length136
Mean length32.566
Min length1

Characters and Unicode

Total characters325660
Distinct characters1494
Distinct categories18 ?
Distinct scripts8 ?
Distinct blocks16 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8689 ?
Unique (%)86.9%

Sample

1st row작은 별이지만 빛나고 있어: 소윤 에세이
2nd rowMe, myself, and why : searching for the science of self
3rd row1분 경영
4th rowRace Tech's motorcycle suspension bible
5th row잠중록: 처처칭한 장편소설. 4
ValueCountFrequency (%)
4256
 
6.2%
the 1973
 
2.9%
of 1340
 
1.9%
and 987
 
1.4%
a 571
 
0.8%
이야기 476
 
0.7%
in 378
 
0.5%
to 362
 
0.5%
science 357
 
0.5%
위한 265
 
0.4%
Other values (21473) 58224
84.2%
2023-12-12T22:59:58.552324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
59200
 
18.2%
e 14070
 
4.3%
o 10109
 
3.1%
i 9908
 
3.0%
n 9799
 
3.0%
t 9622
 
3.0%
a 9275
 
2.8%
r 7569
 
2.3%
s 7511
 
2.3%
: 5474
 
1.7%
Other values (1484) 183123
56.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 122129
37.5%
Lowercase Letter 119330
36.6%
Space Separator 59200
18.2%
Other Punctuation 10070
 
3.1%
Uppercase Letter 6367
 
2.0%
Decimal Number 3985
 
1.2%
Close Punctuation 1603
 
0.5%
Open Punctuation 1603
 
0.5%
Math Symbol 835
 
0.3%
Dash Punctuation 397
 
0.1%
Other values (8) 141
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4629
 
3.8%
3132
 
2.6%
3076
 
2.5%
2873
 
2.4%
2361
 
1.9%
2161
 
1.8%
1801
 
1.5%
1730
 
1.4%
1728
 
1.4%
1688
 
1.4%
Other values (1364) 96950
79.4%
Lowercase Letter
ValueCountFrequency (%)
e 14070
11.8%
o 10109
 
8.5%
i 9908
 
8.3%
n 9799
 
8.2%
t 9622
 
8.1%
a 9275
 
7.8%
r 7569
 
6.3%
s 7511
 
6.3%
h 5325
 
4.5%
c 5009
 
4.2%
Other values (18) 31133
26.1%
Uppercase Letter
ValueCountFrequency (%)
T 969
15.2%
S 629
 
9.9%
A 604
 
9.5%
D 353
 
5.5%
E 314
 
4.9%
C 311
 
4.9%
M 307
 
4.8%
P 296
 
4.6%
I 290
 
4.6%
B 269
 
4.2%
Other values (16) 2025
31.8%
Other Punctuation
ValueCountFrequency (%)
: 5474
54.4%
, 2004
 
19.9%
. 834
 
8.3%
? 403
 
4.0%
' 357
 
3.5%
/ 313
 
3.1%
! 274
 
2.7%
· 211
 
2.1%
57
 
0.6%
& 57
 
0.6%
Other values (8) 86
 
0.9%
Decimal Number
ValueCountFrequency (%)
1 998
25.0%
0 850
21.3%
2 730
18.3%
3 327
 
8.2%
5 269
 
6.8%
4 245
 
6.1%
9 143
 
3.6%
8 142
 
3.6%
7 141
 
3.5%
6 140
 
3.5%
Math Symbol
ValueCountFrequency (%)
= 745
89.2%
+ 27
 
3.2%
~ 25
 
3.0%
< 12
 
1.4%
> 12
 
1.4%
7
 
0.8%
× 3
 
0.4%
2
 
0.2%
| 1
 
0.1%
÷ 1
 
0.1%
Other Symbol
ValueCountFrequency (%)
77
82.8%
9
 
9.7%
2
 
2.2%
2
 
2.2%
® 2
 
2.2%
1
 
1.1%
Close Punctuation
ValueCountFrequency (%)
) 1575
98.3%
] 21
 
1.3%
5
 
0.3%
2
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1575
98.3%
[ 21
 
1.3%
5
 
0.3%
2
 
0.1%
Letter Number
ValueCountFrequency (%)
13
44.8%
9
31.0%
5
 
17.2%
2
 
6.9%
Modifier Symbol
ValueCountFrequency (%)
` 7
77.8%
˚ 2
 
22.2%
Other Number
ValueCountFrequency (%)
² 3
75.0%
1
 
25.0%
Space Separator
ValueCountFrequency (%)
59200
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 397
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Nonspacing Mark
ValueCountFrequency (%)
́ 1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 125723
38.6%
Hangul 121651
37.4%
Common 77804
23.9%
Han 453
 
0.1%
Katakana 19
 
< 0.1%
Hiragana 6
 
< 0.1%
Greek 3
 
< 0.1%
Inherited 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4629
 
3.8%
3132
 
2.6%
3076
 
2.5%
2873
 
2.4%
2361
 
1.9%
2161
 
1.8%
1801
 
1.5%
1730
 
1.4%
1728
 
1.4%
1688
 
1.4%
Other values (1151) 96472
79.3%
Han
ValueCountFrequency (%)
20
 
4.4%
19
 
4.2%
13
 
2.9%
12
 
2.6%
12
 
2.6%
11
 
2.4%
11
 
2.4%
11
 
2.4%
11
 
2.4%
11
 
2.4%
Other values (184) 322
71.1%
Common
ValueCountFrequency (%)
59200
76.1%
: 5474
 
7.0%
, 2004
 
2.6%
) 1575
 
2.0%
( 1575
 
2.0%
1 998
 
1.3%
0 850
 
1.1%
. 834
 
1.1%
= 745
 
1.0%
2 730
 
0.9%
Other values (51) 3819
 
4.9%
Latin
ValueCountFrequency (%)
e 14070
 
11.2%
o 10109
 
8.0%
i 9908
 
7.9%
n 9799
 
7.8%
t 9622
 
7.7%
a 9275
 
7.4%
r 7569
 
6.0%
s 7511
 
6.0%
h 5325
 
4.2%
c 5009
 
4.0%
Other values (47) 37526
29.8%
Katakana
ValueCountFrequency (%)
3
15.8%
2
10.5%
2
10.5%
2
10.5%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (4) 4
21.1%
Hiragana
ValueCountFrequency (%)
2
33.3%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Greek
ValueCountFrequency (%)
π 3
100.0%
Inherited
ValueCountFrequency (%)
́ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 203074
62.4%
Hangul 121604
37.3%
CJK 441
 
0.1%
None 326
 
0.1%
Box Drawing 79
 
< 0.1%
Compat Jamo 47
 
< 0.1%
Number Forms 29
 
< 0.1%
Katakana 19
 
< 0.1%
CJK Compat Ideographs 12
 
< 0.1%
Enclosed Alphanum 9
 
< 0.1%
Other values (6) 20
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
59200
29.2%
e 14070
 
6.9%
o 10109
 
5.0%
i 9908
 
4.9%
n 9799
 
4.8%
t 9622
 
4.7%
a 9275
 
4.6%
r 7569
 
3.7%
s 7511
 
3.7%
: 5474
 
2.7%
Other values (78) 60537
29.8%
Hangul
ValueCountFrequency (%)
4629
 
3.8%
3132
 
2.6%
3076
 
2.5%
2873
 
2.4%
2361
 
1.9%
2161
 
1.8%
1801
 
1.5%
1730
 
1.4%
1728
 
1.4%
1688
 
1.4%
Other values (1145) 96425
79.3%
None
ValueCountFrequency (%)
· 211
64.7%
57
 
17.5%
11
 
3.4%
7
 
2.1%
7
 
2.1%
5
 
1.5%
5
 
1.5%
× 3
 
0.9%
² 3
 
0.9%
π 3
 
0.9%
Other values (8) 14
 
4.3%
Box Drawing
ValueCountFrequency (%)
77
97.5%
2
 
2.5%
Compat Jamo
ValueCountFrequency (%)
29
61.7%
7
 
14.9%
6
 
12.8%
2
 
4.3%
2
 
4.3%
1
 
2.1%
CJK
ValueCountFrequency (%)
20
 
4.5%
19
 
4.3%
13
 
2.9%
12
 
2.7%
12
 
2.7%
11
 
2.5%
11
 
2.5%
11
 
2.5%
11
 
2.5%
11
 
2.5%
Other values (177) 310
70.3%
Number Forms
ValueCountFrequency (%)
13
44.8%
9
31.0%
5
 
17.2%
2
 
6.9%
Enclosed Alphanum
ValueCountFrequency (%)
9
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
5
41.7%
2
 
16.7%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
Punctuation
ValueCountFrequency (%)
4
50.0%
2
25.0%
2
25.0%
Katakana
ValueCountFrequency (%)
3
15.8%
2
10.5%
2
10.5%
2
10.5%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (4) 4
21.1%
Letterlike Symbols
ValueCountFrequency (%)
2
100.0%
Hiragana
ValueCountFrequency (%)
2
33.3%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Modifier Letters
ValueCountFrequency (%)
˚ 2
100.0%
Diacriticals
ValueCountFrequency (%)
́ 1
100.0%
Misc Symbols
ValueCountFrequency (%)
1
100.0%

저자
Text

Distinct6657
Distinct (%)67.0%
Missing64
Missing (%)0.6%
Memory size156.2 KiB
2023-12-12T22:59:58.995483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length36
Mean length8.8144122
Min length2

Characters and Unicode

Total characters87580
Distinct characters717
Distinct categories15 ?
Distinct scripts6 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5247 ?
Unique (%)52.8%

Sample

1st row소윤
2nd rowOuellette, Jennifer
3rd rowBlanchard, Ken
4th rowParks, Lee
5th row서미영
ValueCountFrequency (%)
j 161
 
1.0%
동아사이언스 129
 
0.8%
m 126
 
0.8%
john 121
 
0.7%
david 115
 
0.7%
a 115
 
0.7%
richard 111
 
0.7%
michael 107
 
0.6%
r 95
 
0.6%
정완상 93
 
0.6%
Other values (7798) 15347
92.9%
2023-12-12T22:59:59.991709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6586
 
7.5%
e 5519
 
6.3%
a 5378
 
6.1%
, 4699
 
5.4%
n 4279
 
4.9%
r 4179
 
4.8%
i 3652
 
4.2%
o 3078
 
3.5%
l 2859
 
3.3%
t 2086
 
2.4%
Other values (707) 45265
51.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 44998
51.4%
Other Letter 20424
23.3%
Uppercase Letter 10436
 
11.9%
Space Separator 6586
 
7.5%
Other Punctuation 4971
 
5.7%
Dash Punctuation 125
 
0.1%
Decimal Number 11
 
< 0.1%
Other Symbol 7
 
< 0.1%
Open Punctuation 6
 
< 0.1%
Close Punctuation 6
 
< 0.1%
Other values (5) 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1011
 
5.0%
792
 
3.9%
554
 
2.7%
416
 
2.0%
405
 
2.0%
330
 
1.6%
310
 
1.5%
302
 
1.5%
294
 
1.4%
248
 
1.2%
Other values (600) 15762
77.2%
Lowercase Letter
ValueCountFrequency (%)
e 5519
12.3%
a 5378
12.0%
n 4279
9.5%
r 4179
9.3%
i 3652
 
8.1%
o 3078
 
6.8%
l 2859
 
6.4%
t 2086
 
4.6%
s 2069
 
4.6%
h 1886
 
4.2%
Other values (35) 10013
22.3%
Uppercase Letter
ValueCountFrequency (%)
S 940
 
9.0%
M 928
 
8.9%
J 810
 
7.8%
B 689
 
6.6%
R 684
 
6.6%
D 659
 
6.3%
C 637
 
6.1%
A 582
 
5.6%
H 520
 
5.0%
P 508
 
4.9%
Other values (22) 3479
33.3%
Decimal Number
ValueCountFrequency (%)
2 3
27.3%
0 2
18.2%
1 2
18.2%
5 1
 
9.1%
9 1
 
9.1%
8 1
 
9.1%
3 1
 
9.1%
Other Symbol
ValueCountFrequency (%)
2
28.6%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
Other Punctuation
ValueCountFrequency (%)
, 4699
94.5%
. 241
 
4.8%
' 21
 
0.4%
? 10
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 5
83.3%
1
 
16.7%
Close Punctuation
ValueCountFrequency (%)
) 5
83.3%
1
 
16.7%
Math Symbol
ValueCountFrequency (%)
< 2
50.0%
> 2
50.0%
Modifier Symbol
ValueCountFrequency (%)
´ 1
50.0%
¨ 1
50.0%
Space Separator
ValueCountFrequency (%)
6586
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 125
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Nonspacing Mark
ValueCountFrequency (%)
̈ 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 55392
63.2%
Hangul 20398
 
23.3%
Common 11720
 
13.4%
Cyrillic 43
 
< 0.1%
Han 26
 
< 0.1%
Inherited 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1011
 
5.0%
792
 
3.9%
554
 
2.7%
416
 
2.0%
405
 
2.0%
330
 
1.6%
310
 
1.5%
302
 
1.5%
294
 
1.4%
248
 
1.2%
Other values (575) 15736
77.1%
Latin
ValueCountFrequency (%)
e 5519
 
10.0%
a 5378
 
9.7%
n 4279
 
7.7%
r 4179
 
7.5%
i 3652
 
6.6%
o 3078
 
5.6%
l 2859
 
5.2%
t 2086
 
3.8%
s 2069
 
3.7%
h 1886
 
3.4%
Other values (47) 20407
36.8%
Common
ValueCountFrequency (%)
6586
56.2%
, 4699
40.1%
. 241
 
2.1%
- 125
 
1.1%
' 21
 
0.2%
? 10
 
0.1%
( 5
 
< 0.1%
) 5
 
< 0.1%
2 3
 
< 0.1%
2
 
< 0.1%
Other values (18) 23
 
0.2%
Han
ValueCountFrequency (%)
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (15) 15
57.7%
Cyrillic
ValueCountFrequency (%)
и 6
14.0%
о 5
11.6%
р 3
 
7.0%
д 3
 
7.0%
в 3
 
7.0%
а 3
 
7.0%
к 2
 
4.7%
с 2
 
4.7%
В 2
 
4.7%
л 2
 
4.7%
Other values (11) 12
27.9%
Inherited
ValueCountFrequency (%)
̈ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 67091
76.6%
Hangul 20398
 
23.3%
Cyrillic 43
 
< 0.1%
CJK 25
 
< 0.1%
None 11
 
< 0.1%
Box Drawing 7
 
< 0.1%
Punctuation 2
 
< 0.1%
Diacriticals 1
 
< 0.1%
Number Forms 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6586
 
9.8%
e 5519
 
8.2%
a 5378
 
8.0%
, 4699
 
7.0%
n 4279
 
6.4%
r 4179
 
6.2%
i 3652
 
5.4%
o 3078
 
4.6%
l 2859
 
4.3%
t 2086
 
3.1%
Other values (59) 24776
36.9%
Hangul
ValueCountFrequency (%)
1011
 
5.0%
792
 
3.9%
554
 
2.7%
416
 
2.0%
405
 
2.0%
330
 
1.6%
310
 
1.5%
302
 
1.5%
294
 
1.4%
248
 
1.2%
Other values (575) 15736
77.1%
Cyrillic
ValueCountFrequency (%)
и 6
14.0%
о 5
11.6%
р 3
 
7.0%
д 3
 
7.0%
в 3
 
7.0%
а 3
 
7.0%
к 2
 
4.7%
с 2
 
4.7%
В 2
 
4.7%
л 2
 
4.7%
Other values (11) 12
27.9%
Punctuation
ValueCountFrequency (%)
2
100.0%
Box Drawing
ValueCountFrequency (%)
2
28.6%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
None
ValueCountFrequency (%)
ł 2
18.2%
ø 2
18.2%
ü 2
18.2%
1
9.1%
1
9.1%
´ 1
9.1%
Ø 1
9.1%
¨ 1
9.1%
CJK
ValueCountFrequency (%)
2
 
8.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
Other values (14) 14
56.0%
Diacriticals
ValueCountFrequency (%)
̈ 1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct2525
Distinct (%)25.3%
Missing14
Missing (%)0.1%
Memory size156.2 KiB
2023-12-12T23:00:00.400424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length74
Median length67
Mean length7.4137793
Min length1

Characters and Unicode

Total characters74034
Distinct characters711
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1397 ?
Unique (%)14.0%

Sample

1st row북로망스
2nd rowPenguin Books
3rd row21세기북스
4th rowMotorbooks
5th row아르테
ValueCountFrequency (%)
press 657
 
4.8%
university 452
 
3.3%
books 393
 
2.9%
동아사이언스 194
 
1.4%
자음과모음 193
 
1.4%
사이언스북스 190
 
1.4%
oxford 163
 
1.2%
김영사 145
 
1.1%
of 143
 
1.0%
114
 
0.8%
Other values (2547) 11052
80.7%
2023-12-12T23:00:00.972758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3711
 
5.0%
e 3446
 
4.7%
r 3302
 
4.5%
i 3245
 
4.4%
s 3239
 
4.4%
o 2819
 
3.8%
n 2736
 
3.7%
a 1966
 
2.7%
1806
 
2.4%
1766
 
2.4%
Other values (701) 45998
62.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 32075
43.3%
Other Letter 30787
41.6%
Uppercase Letter 6465
 
8.7%
Space Separator 3711
 
5.0%
Other Punctuation 653
 
0.9%
Decimal Number 190
 
0.3%
Dash Punctuation 70
 
0.1%
Open Punctuation 41
 
0.1%
Close Punctuation 41
 
0.1%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1806
 
5.9%
1766
 
5.7%
1213
 
3.9%
954
 
3.1%
904
 
2.9%
575
 
1.9%
552
 
1.8%
515
 
1.7%
496
 
1.6%
487
 
1.6%
Other values (621) 21519
69.9%
Lowercase Letter
ValueCountFrequency (%)
e 3446
10.7%
r 3302
10.3%
i 3245
10.1%
s 3239
10.1%
o 2819
 
8.8%
n 2736
 
8.5%
a 1966
 
6.1%
t 1556
 
4.9%
l 1257
 
3.9%
c 902
 
2.8%
Other values (16) 7607
23.7%
Uppercase Letter
ValueCountFrequency (%)
P 1236
19.1%
B 798
12.3%
C 490
 
7.6%
U 469
 
7.3%
S 462
 
7.1%
W 352
 
5.4%
H 337
 
5.2%
M 267
 
4.1%
A 250
 
3.9%
O 231
 
3.6%
Other values (16) 1573
24.3%
Other Punctuation
ValueCountFrequency (%)
. 285
43.6%
124
19.0%
, 109
 
16.7%
& 55
 
8.4%
' 34
 
5.2%
/ 30
 
4.6%
? 5
 
0.8%
# 4
 
0.6%
· 3
 
0.5%
: 2
 
0.3%
Other values (2) 2
 
0.3%
Decimal Number
ValueCountFrequency (%)
2 72
37.9%
1 64
33.7%
0 18
 
9.5%
8 14
 
7.4%
3 9
 
4.7%
5 5
 
2.6%
4 4
 
2.1%
9 3
 
1.6%
7 1
 
0.5%
Open Punctuation
ValueCountFrequency (%)
( 22
53.7%
[ 19
46.3%
Close Punctuation
ValueCountFrequency (%)
) 22
53.7%
] 19
46.3%
Space Separator
ValueCountFrequency (%)
3711
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 70
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 38540
52.1%
Hangul 30537
41.2%
Common 4707
 
6.4%
Han 250
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1806
 
5.9%
1766
 
5.8%
1213
 
4.0%
954
 
3.1%
904
 
3.0%
575
 
1.9%
552
 
1.8%
515
 
1.7%
496
 
1.6%
487
 
1.6%
Other values (554) 21269
69.6%
Han
ValueCountFrequency (%)
40
 
16.0%
26
 
10.4%
10
 
4.0%
10
 
4.0%
8
 
3.2%
8
 
3.2%
7
 
2.8%
6
 
2.4%
6
 
2.4%
6
 
2.4%
Other values (57) 123
49.2%
Latin
ValueCountFrequency (%)
e 3446
 
8.9%
r 3302
 
8.6%
i 3245
 
8.4%
s 3239
 
8.4%
o 2819
 
7.3%
n 2736
 
7.1%
a 1966
 
5.1%
t 1556
 
4.0%
l 1257
 
3.3%
P 1236
 
3.2%
Other values (42) 13738
35.6%
Common
ValueCountFrequency (%)
3711
78.8%
. 285
 
6.1%
124
 
2.6%
, 109
 
2.3%
2 72
 
1.5%
- 70
 
1.5%
1 64
 
1.4%
& 55
 
1.2%
' 34
 
0.7%
/ 30
 
0.6%
Other values (18) 153
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 43120
58.2%
Hangul 30537
41.2%
CJK 250
 
0.3%
None 127
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3711
 
8.6%
e 3446
 
8.0%
r 3302
 
7.7%
i 3245
 
7.5%
s 3239
 
7.5%
o 2819
 
6.5%
n 2736
 
6.3%
a 1966
 
4.6%
t 1556
 
3.6%
l 1257
 
2.9%
Other values (68) 15843
36.7%
Hangul
ValueCountFrequency (%)
1806
 
5.9%
1766
 
5.8%
1213
 
4.0%
954
 
3.1%
904
 
3.0%
575
 
1.9%
552
 
1.8%
515
 
1.7%
496
 
1.6%
487
 
1.6%
Other values (554) 21269
69.6%
None
ValueCountFrequency (%)
124
97.6%
· 3
 
2.4%
CJK
ValueCountFrequency (%)
40
 
16.0%
26
 
10.4%
10
 
4.0%
10
 
4.0%
8
 
3.2%
8
 
3.2%
7
 
2.8%
6
 
2.4%
6
 
2.4%
6
 
2.4%
Other values (57) 123
49.2%

출판년
Real number (ℝ)

MISSING 

Distinct62
Distinct (%)0.6%
Missing272
Missing (%)2.7%
Infinite0
Infinite (%)0.0%
Mean2013.2685
Minimum1952
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T23:00:01.160142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1952
5-th percentile2000
Q12011
median2015
Q32018
95-th percentile2021
Maximum2023
Range71
Interquartile range (IQR)7

Descriptive statistics

Standard deviation6.9672579
Coefficient of variation (CV)0.00346067
Kurtosis8.3113101
Mean2013.2685
Median Absolute Deviation (MAD)3
Skewness-2.1341914
Sum19585076
Variance48.542682
MonotonicityNot monotonic
2023-12-12T23:00:01.340008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2016 1033
 
10.3%
2018 890
 
8.9%
2017 830
 
8.3%
2019 771
 
7.7%
2015 758
 
7.6%
2014 616
 
6.2%
2013 593
 
5.9%
2012 516
 
5.2%
2020 469
 
4.7%
2011 363
 
3.6%
Other values (52) 2889
28.9%
ValueCountFrequency (%)
1952 2
< 0.1%
1954 1
 
< 0.1%
1957 1
 
< 0.1%
1963 3
< 0.1%
1964 1
 
< 0.1%
1965 1
 
< 0.1%
1967 2
< 0.1%
1968 3
< 0.1%
1969 3
< 0.1%
1970 3
< 0.1%
ValueCountFrequency (%)
2023 222
 
2.2%
2022 166
 
1.7%
2021 145
 
1.5%
2020 469
4.7%
2019 771
7.7%
2018 890
8.9%
2017 830
8.3%
2016 1033
10.3%
2015 758
7.6%
2014 616
6.2%

매체
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
인쇄
10000 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인쇄
2nd row인쇄
3rd row인쇄
4th row인쇄
5th row인쇄

Common Values

ValueCountFrequency (%)
인쇄 10000
100.0%

Length

2023-12-12T23:00:01.535746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:00:01.637831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인쇄 10000
100.0%

Interactions

2023-12-12T22:59:55.869664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:59:55.390603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:59:56.073750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:59:55.644740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:00:01.701803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번출판년
순번1.0000.580
출판년0.5801.000
2023-12-12T23:00:01.812195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번출판년
순번1.0000.497
출판년0.4971.000

Missing values

2023-12-12T22:59:56.361274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:59:56.575274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T22:59:56.793741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번서명저자출판사출판년매체
1871818719작은 별이지만 빛나고 있어: 소윤 에세이소윤북로망스2021인쇄
1229012291Me, myself, and why : searching for the science of selfOuellette, JenniferPenguin Books2014인쇄
483748381분 경영Blanchard, Ken21세기북스2016인쇄
1337513376Race Tech's motorcycle suspension bibleParks, LeeMotorbooks2010인쇄
1549315494잠중록: 처처칭한 장편소설. 4서미영아르테2019인쇄
50465047두 글자 : 일상과 운동을 엿보다이학준시간의물레2016인쇄
41694170아프리카, 중국의 두 번째 대륙 : 100만 이주자의 아프리카 새 왕국 건설기French, Howard W지식의날개2015인쇄
43434344스타트업 바이블 : 세계 최초로 공개되는 24단계 MIT 창업 프로그램Aulet, Bill비즈니스북스2015인쇄
54755476(뇌과학으로 읽는) 트라우마와 통증 : 우리 몸의 생존법Haines, Steve푸른지식2016인쇄
1578115782공부, 이래도 안되면 포기하세요: 무조건 합격을 부르는 최강의 멘탈 솔루션이지훈위즈덤하우스2020인쇄
순번서명저자출판사출판년매체
1256312564The exact sciences in antiquityNeugebauer, ODover Publications1969인쇄
722723High dimensional probability IIIHoffmann-Jørgensen, JBirkhauser2004인쇄
1736617367하룻밤에 읽는 경제학Montousse, Marc랜덤하우스코리아2011인쇄
1736817369통섭의 식탁최재천움직이는서재2015인쇄
1243312434Testosterone rex : myths of sex, science, and societyFine, CordeliaW.W. Norton & Company2018인쇄
97029703과학공화국 화학법정. 6, 신기한 금속정완상자음과모음2016인쇄
1287512876Scientific practice : theories and stories of doing physicsBuchwald, Jed ZThe University of Chicago Press1995인쇄
1409614097The moral arc : how science makes us better peopleShermer, MichaelSt. Martin's Griffin2016인쇄
75857586일상적이지만 절대적인 뇌과학지식 50Costandi, Moheb반니2016인쇄
53685369일곱 가지 이야기가노 도모코피니스아프리카에2016인쇄