Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 10000 |
Missing cells | 350 |
Missing cells (%) | 0.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 566.4 KiB |
Average record size in memory | 58.0 B |
Variable types
Numeric | 2 |
---|---|
Text | 3 |
Categorical | 1 |
Dataset
Description | 기초과학연구원 과학문화센터 전자도서관 소장 도서정보입니다. 해당 데이터가 보유한 컬럼은 다음과 같습니다.컬럼명: 서명, 저자, 출판사, 출판년, 매체 |
---|---|
Author | 기초과학연구원 |
URL | https://www.data.go.kr/data/15053238/fileData.do |
Reproduction
Analysis started | 2023-12-12 13:59:53.358969 |
---|---|
Analysis finished | 2023-12-12 13:59:56.931402 |
Duration | 3.57 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
순번
Real number (ℝ)
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9412.8339 |
Minimum | 1 |
---|---|
Maximum | 18850 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 913.85 |
Q1 | 4690.5 |
median | 9507 |
Q3 | 14065.25 |
95-th percentile | 17868.05 |
Maximum | 18850 |
Range | 18849 |
Interquartile range (IQR) | 9374.75 |
Descriptive statistics
Standard deviation | 5432.6281 |
---|---|
Coefficient of variation (CV) | 0.57715117 |
Kurtosis | -1.1953912 |
Mean | 9412.8339 |
Median Absolute Deviation (MAD) | 4686 |
Skewness | -0.011646075 |
Sum | 94128339 |
Variance | 29513448 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
18719 | 1 | < 0.1% |
1187 | 1 | < 0.1% |
10475 | 1 | < 0.1% |
7878 | 1 | < 0.1% |
2963 | 1 | < 0.1% |
13703 | 1 | < 0.1% |
11187 | 1 | < 0.1% |
13407 | 1 | < 0.1% |
3810 | 1 | < 0.1% |
12266 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
1 | 1 | |
8 | 1 | |
10 | 1 | |
12 | 1 | |
14 | 1 | |
16 | 1 | |
17 | 1 | |
18 | 1 | |
19 | 1 | |
24 | 1 |
Value | Count | Frequency (%) |
18850 | 1 | |
18847 | 1 | |
18846 | 1 | |
18845 | 1 | |
18844 | 1 | |
18843 | 1 | |
18840 | 1 | |
18839 | 1 | |
18838 | 1 | |
18837 | 1 |
서명
Text
Distinct | 9187 |
---|---|
Distinct (%) | 91.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 313 |
---|---|
Median length | 136 |
Mean length | 32.566 |
Min length | 1 |
Characters and Unicode
Total characters | 325660 |
---|---|
Distinct characters | 1494 |
Distinct categories | 18 ? |
Distinct scripts | 8 ? |
Distinct blocks | 16 ? |
Unique
Unique | 8689 ? |
---|---|
Unique (%) | 86.9% |
Sample
1st row | 작은 별이지만 빛나고 있어: 소윤 에세이 |
---|---|
2nd row | Me, myself, and why : searching for the science of self |
3rd row | 1분 경영 |
4th row | Race Tech's motorcycle suspension bible |
5th row | 잠중록: 처처칭한 장편소설. 4 |
Value | Count | Frequency (%) |
4256 | 6.2% | |
the | 1973 | 2.9% |
of | 1340 | 1.9% |
and | 987 | 1.4% |
a | 571 | 0.8% |
이야기 | 476 | 0.7% |
in | 378 | 0.5% |
to | 362 | 0.5% |
science | 357 | 0.5% |
위한 | 265 | 0.4% |
Other values (21473) | 58224 |
Most occurring characters
Value | Count | Frequency (%) |
59200 | 18.2% | |
e | 14070 | 4.3% |
o | 10109 | 3.1% |
i | 9908 | 3.0% |
n | 9799 | 3.0% |
t | 9622 | 3.0% |
a | 9275 | 2.8% |
r | 7569 | 2.3% |
s | 7511 | 2.3% |
: | 5474 | 1.7% |
Other values (1484) | 183123 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 122129 | |
Lowercase Letter | 119330 | |
Space Separator | 59200 | |
Other Punctuation | 10070 | 3.1% |
Uppercase Letter | 6367 | 2.0% |
Decimal Number | 3985 | 1.2% |
Close Punctuation | 1603 | 0.5% |
Open Punctuation | 1603 | 0.5% |
Math Symbol | 835 | 0.3% |
Dash Punctuation | 397 | 0.1% |
Other values (8) | 141 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
의 | 4629 | 3.8% |
학 | 3132 | 2.6% |
이 | 3076 | 2.5% |
는 | 2873 | 2.4% |
과 | 2361 | 1.9% |
기 | 2161 | 1.8% |
가 | 1801 | 1.5% |
한 | 1730 | 1.4% |
지 | 1728 | 1.4% |
리 | 1688 | 1.4% |
Other values (1364) | 96950 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 14070 | |
o | 10109 | 8.5% |
i | 9908 | 8.3% |
n | 9799 | 8.2% |
t | 9622 | 8.1% |
a | 9275 | 7.8% |
r | 7569 | 6.3% |
s | 7511 | 6.3% |
h | 5325 | 4.5% |
c | 5009 | 4.2% |
Other values (18) | 31133 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 969 | |
S | 629 | 9.9% |
A | 604 | 9.5% |
D | 353 | 5.5% |
E | 314 | 4.9% |
C | 311 | 4.9% |
M | 307 | 4.8% |
P | 296 | 4.6% |
I | 290 | 4.6% |
B | 269 | 4.2% |
Other values (16) | 2025 |
Other Punctuation
Value | Count | Frequency (%) |
: | 5474 | |
, | 2004 | 19.9% |
. | 834 | 8.3% |
? | 403 | 4.0% |
' | 357 | 3.5% |
/ | 313 | 3.1% |
! | 274 | 2.7% |
· | 211 | 2.1% |
& | 57 | 0.6% |
& | 57 | 0.6% |
Other values (8) | 86 | 0.9% |
Decimal Number
Value | Count | Frequency (%) |
1 | 998 | |
0 | 850 | |
2 | 730 | |
3 | 327 | 8.2% |
5 | 269 | 6.8% |
4 | 245 | 6.1% |
9 | 143 | 3.6% |
8 | 142 | 3.6% |
7 | 141 | 3.5% |
6 | 140 | 3.5% |
Math Symbol
Value | Count | Frequency (%) |
= | 745 | |
+ | 27 | 3.2% |
~ | 25 | 3.0% |
< | 12 | 1.4% |
> | 12 | 1.4% |
| | 7 | 0.8% |
× | 3 | 0.4% |
+ | 2 | 0.2% |
| | 1 | 0.1% |
÷ | 1 | 0.1% |
Other Symbol
Value | Count | Frequency (%) |
│ | 77 | |
ⓔ | 9 | 9.7% |
℃ | 2 | 2.2% |
┃ | 2 | 2.2% |
® | 2 | 2.2% |
★ | 1 | 1.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 1575 | |
] | 21 | 1.3% |
』 | 5 | 0.3% |
》 | 2 | 0.1% |
Open Punctuation
Value | Count | Frequency (%) |
( | 1575 | |
[ | 21 | 1.3% |
『 | 5 | 0.3% |
《 | 2 | 0.1% |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 13 | |
Ⅱ | 9 | |
Ⅲ | 5 | 17.2% |
Ⅳ | 2 | 6.9% |
Modifier Symbol
Value | Count | Frequency (%) |
` | 7 | |
˚ | 2 | 22.2% |
Other Number
Value | Count | Frequency (%) |
² | 3 | |
₂ | 1 | 25.0% |
Space Separator
Value | Count | Frequency (%) |
59200 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 397 |
Initial Punctuation
Value | Count | Frequency (%) |
‘ | 2 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 2 |
Nonspacing Mark
Value | Count | Frequency (%) |
́ | 1 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 125723 | |
Hangul | 121651 | |
Common | 77804 | |
Han | 453 | 0.1% |
Katakana | 19 | < 0.1% |
Hiragana | 6 | < 0.1% |
Greek | 3 | < 0.1% |
Inherited | 1 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
의 | 4629 | 3.8% |
학 | 3132 | 2.6% |
이 | 3076 | 2.5% |
는 | 2873 | 2.4% |
과 | 2361 | 1.9% |
기 | 2161 | 1.8% |
가 | 1801 | 1.5% |
한 | 1730 | 1.4% |
지 | 1728 | 1.4% |
리 | 1688 | 1.4% |
Other values (1151) | 96472 |
Han
Value | Count | Frequency (%) |
食 | 20 | 4.4% |
客 | 19 | 4.2% |
大 | 13 | 2.9% |
說 | 12 | 2.6% |
國 | 12 | 2.6% |
來 | 11 | 2.4% |
趙 | 11 | 2.4% |
河 | 11 | 2.4% |
小 | 11 | 2.4% |
廷 | 11 | 2.4% |
Other values (184) | 322 |
Common
Value | Count | Frequency (%) |
59200 | ||
: | 5474 | 7.0% |
, | 2004 | 2.6% |
) | 1575 | 2.0% |
( | 1575 | 2.0% |
1 | 998 | 1.3% |
0 | 850 | 1.1% |
. | 834 | 1.1% |
= | 745 | 1.0% |
2 | 730 | 0.9% |
Other values (51) | 3819 | 4.9% |
Latin
Value | Count | Frequency (%) |
e | 14070 | 11.2% |
o | 10109 | 8.0% |
i | 9908 | 7.9% |
n | 9799 | 7.8% |
t | 9622 | 7.7% |
a | 9275 | 7.4% |
r | 7569 | 6.0% |
s | 7511 | 6.0% |
h | 5325 | 4.2% |
c | 5009 | 4.0% |
Other values (47) | 37526 |
Katakana
Value | Count | Frequency (%) |
ト | 3 | |
ッ | 2 | |
ン | 2 | |
ス | 2 | |
プ | 1 | 5.3% |
ワ | 1 | 5.3% |
チ | 1 | 5.3% |
キ | 1 | 5.3% |
リ | 1 | 5.3% |
エ | 1 | 5.3% |
Other values (4) | 4 |
Hiragana
Value | Count | Frequency (%) |
で | 2 | |
を | 1 | |
の | 1 | |
い | 1 | |
し | 1 |
Greek
Value | Count | Frequency (%) |
π | 3 |
Inherited
Value | Count | Frequency (%) |
́ | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 203074 | |
Hangul | 121604 | |
CJK | 441 | 0.1% |
None | 326 | 0.1% |
Box Drawing | 79 | < 0.1% |
Compat Jamo | 47 | < 0.1% |
Number Forms | 29 | < 0.1% |
Katakana | 19 | < 0.1% |
CJK Compat Ideographs | 12 | < 0.1% |
Enclosed Alphanum | 9 | < 0.1% |
Other values (6) | 20 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
59200 | ||
e | 14070 | 6.9% |
o | 10109 | 5.0% |
i | 9908 | 4.9% |
n | 9799 | 4.8% |
t | 9622 | 4.7% |
a | 9275 | 4.6% |
r | 7569 | 3.7% |
s | 7511 | 3.7% |
: | 5474 | 2.7% |
Other values (78) | 60537 |
Hangul
Value | Count | Frequency (%) |
의 | 4629 | 3.8% |
학 | 3132 | 2.6% |
이 | 3076 | 2.5% |
는 | 2873 | 2.4% |
과 | 2361 | 1.9% |
기 | 2161 | 1.8% |
가 | 1801 | 1.5% |
한 | 1730 | 1.4% |
지 | 1728 | 1.4% |
리 | 1688 | 1.4% |
Other values (1145) | 96425 |
None
Value | Count | Frequency (%) |
· | 211 | |
& | 57 | 17.5% |
% | 11 | 3.4% |
| | 7 | 2.1% |
? | 7 | 2.1% |
』 | 5 | 1.5% |
『 | 5 | 1.5% |
× | 3 | 0.9% |
² | 3 | 0.9% |
π | 3 | 0.9% |
Other values (8) | 14 | 4.3% |
Box Drawing
Value | Count | Frequency (%) |
│ | 77 | |
┃ | 2 | 2.5% |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 29 | |
ㅅ | 7 | 14.9% |
ㄱ | 6 | 12.8% |
ㅈ | 2 | 4.3% |
ㅎ | 2 | 4.3% |
ㅇ | 1 | 2.1% |
CJK
Value | Count | Frequency (%) |
食 | 20 | 4.5% |
客 | 19 | 4.3% |
大 | 13 | 2.9% |
說 | 12 | 2.7% |
國 | 12 | 2.7% |
來 | 11 | 2.5% |
趙 | 11 | 2.5% |
河 | 11 | 2.5% |
小 | 11 | 2.5% |
廷 | 11 | 2.5% |
Other values (177) | 310 |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 13 | |
Ⅱ | 9 | |
Ⅲ | 5 | 17.2% |
Ⅳ | 2 | 6.9% |
Enclosed Alphanum
Value | Count | Frequency (%) |
ⓔ | 9 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
理 | 5 | |
不 | 2 | 16.7% |
沈 | 1 | 8.3% |
臨 | 1 | 8.3% |
論 | 1 | 8.3% |
若 | 1 | 8.3% |
金 | 1 | 8.3% |
Punctuation
Value | Count | Frequency (%) |
… | 4 | |
‘ | 2 | |
’ | 2 |
Katakana
Value | Count | Frequency (%) |
ト | 3 | |
ッ | 2 | |
ン | 2 | |
ス | 2 | |
プ | 1 | 5.3% |
ワ | 1 | 5.3% |
チ | 1 | 5.3% |
キ | 1 | 5.3% |
リ | 1 | 5.3% |
エ | 1 | 5.3% |
Other values (4) | 4 |
Letterlike Symbols
Value | Count | Frequency (%) |
℃ | 2 |
Hiragana
Value | Count | Frequency (%) |
で | 2 | |
を | 1 | |
の | 1 | |
い | 1 | |
し | 1 |
Modifier Letters
Value | Count | Frequency (%) |
˚ | 2 |
Diacriticals
Value | Count | Frequency (%) |
́ | 1 |
Misc Symbols
Value | Count | Frequency (%) |
★ | 1 |
저자
Text
Distinct | 6657 |
---|---|
Distinct (%) | 67.0% |
Missing | 64 |
Missing (%) | 0.6% |
Memory size | 156.2 KiB |
Length
Max length | 55 |
---|---|
Median length | 36 |
Mean length | 8.8144122 |
Min length | 2 |
Characters and Unicode
Total characters | 87580 |
---|---|
Distinct characters | 717 |
Distinct categories | 15 ? |
Distinct scripts | 6 ? |
Distinct blocks | 10 ? |
Unique
Unique | 5247 ? |
---|---|
Unique (%) | 52.8% |
Sample
1st row | 소윤 |
---|---|
2nd row | Ouellette, Jennifer |
3rd row | Blanchard, Ken |
4th row | Parks, Lee |
5th row | 서미영 |
Value | Count | Frequency (%) |
j | 161 | 1.0% |
동아사이언스 | 129 | 0.8% |
m | 126 | 0.8% |
john | 121 | 0.7% |
david | 115 | 0.7% |
a | 115 | 0.7% |
richard | 111 | 0.7% |
michael | 107 | 0.6% |
r | 95 | 0.6% |
정완상 | 93 | 0.6% |
Other values (7798) | 15347 |
Most occurring characters
Value | Count | Frequency (%) |
6586 | 7.5% | |
e | 5519 | 6.3% |
a | 5378 | 6.1% |
, | 4699 | 5.4% |
n | 4279 | 4.9% |
r | 4179 | 4.8% |
i | 3652 | 4.2% |
o | 3078 | 3.5% |
l | 2859 | 3.3% |
t | 2086 | 2.4% |
Other values (707) | 45265 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 44998 | |
Other Letter | 20424 | |
Uppercase Letter | 10436 | 11.9% |
Space Separator | 6586 | 7.5% |
Other Punctuation | 4971 | 5.7% |
Dash Punctuation | 125 | 0.1% |
Decimal Number | 11 | < 0.1% |
Other Symbol | 7 | < 0.1% |
Open Punctuation | 6 | < 0.1% |
Close Punctuation | 6 | < 0.1% |
Other values (5) | 10 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 1011 | 5.0% |
김 | 792 | 3.9% |
정 | 554 | 2.7% |
스 | 416 | 2.0% |
영 | 405 | 2.0% |
사 | 330 | 1.6% |
동 | 310 | 1.5% |
아 | 302 | 1.5% |
박 | 294 | 1.4% |
성 | 248 | 1.2% |
Other values (600) | 15762 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 5519 | |
a | 5378 | |
n | 4279 | |
r | 4179 | |
i | 3652 | 8.1% |
o | 3078 | 6.8% |
l | 2859 | 6.4% |
t | 2086 | 4.6% |
s | 2069 | 4.6% |
h | 1886 | 4.2% |
Other values (35) | 10013 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 940 | 9.0% |
M | 928 | 8.9% |
J | 810 | 7.8% |
B | 689 | 6.6% |
R | 684 | 6.6% |
D | 659 | 6.3% |
C | 637 | 6.1% |
A | 582 | 5.6% |
H | 520 | 5.0% |
P | 508 | 4.9% |
Other values (22) | 3479 |
Decimal Number
Value | Count | Frequency (%) |
2 | 3 | |
0 | 2 | |
1 | 2 | |
5 | 1 | 9.1% |
9 | 1 | 9.1% |
8 | 1 | 9.1% |
3 | 1 | 9.1% |
Other Symbol
Value | Count | Frequency (%) |
┭ | 2 | |
╂ | 1 | |
┷ | 1 | |
╈ | 1 | |
┾ | 1 | |
┿ | 1 |
Other Punctuation
Value | Count | Frequency (%) |
, | 4699 | |
. | 241 | 4.8% |
' | 21 | 0.4% |
? | 10 | 0.2% |
Open Punctuation
Value | Count | Frequency (%) |
( | 5 | |
『 | 1 | 16.7% |
Close Punctuation
Value | Count | Frequency (%) |
) | 5 | |
』 | 1 | 16.7% |
Math Symbol
Value | Count | Frequency (%) |
< | 2 | |
> | 2 |
Modifier Symbol
Value | Count | Frequency (%) |
´ | 1 | |
¨ | 1 |
Space Separator
Value | Count | Frequency (%) |
6586 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 125 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 2 |
Nonspacing Mark
Value | Count | Frequency (%) |
̈ | 1 |
Letter Number
Value | Count | Frequency (%) |
Ⅲ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 55392 | |
Hangul | 20398 | 23.3% |
Common | 11720 | 13.4% |
Cyrillic | 43 | < 0.1% |
Han | 26 | < 0.1% |
Inherited | 1 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 1011 | 5.0% |
김 | 792 | 3.9% |
정 | 554 | 2.7% |
스 | 416 | 2.0% |
영 | 405 | 2.0% |
사 | 330 | 1.6% |
동 | 310 | 1.5% |
아 | 302 | 1.5% |
박 | 294 | 1.4% |
성 | 248 | 1.2% |
Other values (575) | 15736 |
Latin
Value | Count | Frequency (%) |
e | 5519 | 10.0% |
a | 5378 | 9.7% |
n | 4279 | 7.7% |
r | 4179 | 7.5% |
i | 3652 | 6.6% |
o | 3078 | 5.6% |
l | 2859 | 5.2% |
t | 2086 | 3.8% |
s | 2069 | 3.7% |
h | 1886 | 3.4% |
Other values (47) | 20407 |
Common
Value | Count | Frequency (%) |
6586 | ||
, | 4699 | |
. | 241 | 2.1% |
- | 125 | 1.1% |
' | 21 | 0.2% |
? | 10 | 0.1% |
( | 5 | < 0.1% |
) | 5 | < 0.1% |
2 | 3 | < 0.1% |
’ | 2 | < 0.1% |
Other values (18) | 23 | 0.2% |
Han
Value | Count | Frequency (%) |
見 | 2 | 7.7% |
隆 | 1 | 3.8% |
藤 | 1 | 3.8% |
村 | 1 | 3.8% |
上 | 1 | 3.8% |
春 | 1 | 3.8% |
樹 | 1 | 3.8% |
本 | 1 | 3.8% |
宏 | 1 | 3.8% |
志 | 1 | 3.8% |
Other values (15) | 15 |
Cyrillic
Value | Count | Frequency (%) |
и | 6 | |
о | 5 | |
р | 3 | 7.0% |
д | 3 | 7.0% |
в | 3 | 7.0% |
а | 3 | 7.0% |
к | 2 | 4.7% |
с | 2 | 4.7% |
В | 2 | 4.7% |
л | 2 | 4.7% |
Other values (11) | 12 |
Inherited
Value | Count | Frequency (%) |
̈ | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 67091 | |
Hangul | 20398 | 23.3% |
Cyrillic | 43 | < 0.1% |
CJK | 25 | < 0.1% |
None | 11 | < 0.1% |
Box Drawing | 7 | < 0.1% |
Punctuation | 2 | < 0.1% |
Diacriticals | 1 | < 0.1% |
Number Forms | 1 | < 0.1% |
CJK Compat Ideographs | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
6586 | 9.8% | |
e | 5519 | 8.2% |
a | 5378 | 8.0% |
, | 4699 | 7.0% |
n | 4279 | 6.4% |
r | 4179 | 6.2% |
i | 3652 | 5.4% |
o | 3078 | 4.6% |
l | 2859 | 4.3% |
t | 2086 | 3.1% |
Other values (59) | 24776 |
Hangul
Value | Count | Frequency (%) |
이 | 1011 | 5.0% |
김 | 792 | 3.9% |
정 | 554 | 2.7% |
스 | 416 | 2.0% |
영 | 405 | 2.0% |
사 | 330 | 1.6% |
동 | 310 | 1.5% |
아 | 302 | 1.5% |
박 | 294 | 1.4% |
성 | 248 | 1.2% |
Other values (575) | 15736 |
Cyrillic
Value | Count | Frequency (%) |
и | 6 | |
о | 5 | |
р | 3 | 7.0% |
д | 3 | 7.0% |
в | 3 | 7.0% |
а | 3 | 7.0% |
к | 2 | 4.7% |
с | 2 | 4.7% |
В | 2 | 4.7% |
л | 2 | 4.7% |
Other values (11) | 12 |
Punctuation
Value | Count | Frequency (%) |
’ | 2 |
Box Drawing
Value | Count | Frequency (%) |
┭ | 2 | |
╂ | 1 | |
┷ | 1 | |
╈ | 1 | |
┾ | 1 | |
┿ | 1 |
None
Value | Count | Frequency (%) |
ł | 2 | |
ø | 2 | |
ü | 2 | |
『 | 1 | |
』 | 1 | |
´ | 1 | |
Ø | 1 | |
¨ | 1 |
CJK
Value | Count | Frequency (%) |
見 | 2 | 8.0% |
隆 | 1 | 4.0% |
藤 | 1 | 4.0% |
村 | 1 | 4.0% |
上 | 1 | 4.0% |
春 | 1 | 4.0% |
樹 | 1 | 4.0% |
本 | 1 | 4.0% |
宏 | 1 | 4.0% |
志 | 1 | 4.0% |
Other values (14) | 14 |
Diacriticals
Value | Count | Frequency (%) |
̈ | 1 |
Number Forms
Value | Count | Frequency (%) |
Ⅲ | 1 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
立 | 1 |
출판사
Text
Distinct | 2525 |
---|---|
Distinct (%) | 25.3% |
Missing | 14 |
Missing (%) | 0.1% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
press | 657 | 4.8% |
university | 452 | 3.3% |
books | 393 | 2.9% |
동아사이언스 | 194 | 1.4% |
자음과모음 | 193 | 1.4% |
사이언스북스 | 190 | 1.4% |
oxford | 163 | 1.2% |
김영사 | 145 | 1.1% |
of | 143 | 1.0% |
& | 114 | 0.8% |
Other values (2547) | 11052 |
Most occurring characters
Value | Count | Frequency (%) |
3711 | 5.0% | |
e | 3446 | 4.7% |
r | 3302 | 4.5% |
i | 3245 | 4.4% |
s | 3239 | 4.4% |
o | 2819 | 3.8% |
n | 2736 | 3.7% |
a | 1966 | 2.7% |
스 | 1806 | 2.4% |
사 | 1766 | 2.4% |
Other values (701) | 45998 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 32075 | |
Other Letter | 30787 | |
Uppercase Letter | 6465 | 8.7% |
Space Separator | 3711 | 5.0% |
Other Punctuation | 653 | 0.9% |
Decimal Number | 190 | 0.3% |
Dash Punctuation | 70 | 0.1% |
Open Punctuation | 41 | 0.1% |
Close Punctuation | 41 | 0.1% |
Modifier Symbol | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
스 | 1806 | 5.9% |
사 | 1766 | 5.7% |
이 | 1213 | 3.9% |
북 | 954 | 3.1% |
아 | 904 | 2.9% |
문 | 575 | 1.9% |
음 | 552 | 1.8% |
동 | 515 | 1.7% |
리 | 496 | 1.6% |
언 | 487 | 1.6% |
Other values (621) | 21519 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 3446 | |
r | 3302 | |
i | 3245 | |
s | 3239 | |
o | 2819 | 8.8% |
n | 2736 | 8.5% |
a | 1966 | 6.1% |
t | 1556 | 4.9% |
l | 1257 | 3.9% |
c | 902 | 2.8% |
Other values (16) | 7607 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 1236 | |
B | 798 | |
C | 490 | 7.6% |
U | 469 | 7.3% |
S | 462 | 7.1% |
W | 352 | 5.4% |
H | 337 | 5.2% |
M | 267 | 4.1% |
A | 250 | 3.9% |
O | 231 | 3.6% |
Other values (16) | 1573 |
Other Punctuation
Value | Count | Frequency (%) |
. | 285 | |
& | 124 | |
, | 109 | 16.7% |
& | 55 | 8.4% |
' | 34 | 5.2% |
/ | 30 | 4.6% |
? | 5 | 0.8% |
# | 4 | 0.6% |
· | 3 | 0.5% |
: | 2 | 0.3% |
Other values (2) | 2 | 0.3% |
Decimal Number
Value | Count | Frequency (%) |
2 | 72 | |
1 | 64 | |
0 | 18 | 9.5% |
8 | 14 | 7.4% |
3 | 9 | 4.7% |
5 | 5 | 2.6% |
4 | 4 | 2.1% |
9 | 3 | 1.6% |
7 | 1 | 0.5% |
Open Punctuation
Value | Count | Frequency (%) |
( | 22 | |
[ | 19 |
Close Punctuation
Value | Count | Frequency (%) |
) | 22 | |
] | 19 |
Space Separator
Value | Count | Frequency (%) |
3711 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 70 |
Modifier Symbol
Value | Count | Frequency (%) |
` | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 38540 | |
Hangul | 30537 | |
Common | 4707 | 6.4% |
Han | 250 | 0.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
스 | 1806 | 5.9% |
사 | 1766 | 5.8% |
이 | 1213 | 4.0% |
북 | 954 | 3.1% |
아 | 904 | 3.0% |
문 | 575 | 1.9% |
음 | 552 | 1.8% |
동 | 515 | 1.7% |
리 | 496 | 1.6% |
언 | 487 | 1.6% |
Other values (554) | 21269 |
Han
Value | Count | Frequency (%) |
社 | 40 | 16.0% |
文 | 26 | 10.4% |
出 | 10 | 4.0% |
版 | 10 | 4.0% |
大 | 8 | 3.2% |
明 | 8 | 3.2% |
敎 | 7 | 2.8% |
學 | 6 | 2.4% |
法 | 6 | 2.4% |
京 | 6 | 2.4% |
Other values (57) | 123 |
Latin
Value | Count | Frequency (%) |
e | 3446 | 8.9% |
r | 3302 | 8.6% |
i | 3245 | 8.4% |
s | 3239 | 8.4% |
o | 2819 | 7.3% |
n | 2736 | 7.1% |
a | 1966 | 5.1% |
t | 1556 | 4.0% |
l | 1257 | 3.3% |
P | 1236 | 3.2% |
Other values (42) | 13738 |
Common
Value | Count | Frequency (%) |
3711 | ||
. | 285 | 6.1% |
& | 124 | 2.6% |
, | 109 | 2.3% |
2 | 72 | 1.5% |
- | 70 | 1.5% |
1 | 64 | 1.4% |
& | 55 | 1.2% |
' | 34 | 0.7% |
/ | 30 | 0.6% |
Other values (18) | 153 | 3.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 43120 | |
Hangul | 30537 | |
CJK | 250 | 0.3% |
None | 127 | 0.2% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
3711 | 8.6% | |
e | 3446 | 8.0% |
r | 3302 | 7.7% |
i | 3245 | 7.5% |
s | 3239 | 7.5% |
o | 2819 | 6.5% |
n | 2736 | 6.3% |
a | 1966 | 4.6% |
t | 1556 | 3.6% |
l | 1257 | 2.9% |
Other values (68) | 15843 |
Hangul
Value | Count | Frequency (%) |
스 | 1806 | 5.9% |
사 | 1766 | 5.8% |
이 | 1213 | 4.0% |
북 | 954 | 3.1% |
아 | 904 | 3.0% |
문 | 575 | 1.9% |
음 | 552 | 1.8% |
동 | 515 | 1.7% |
리 | 496 | 1.6% |
언 | 487 | 1.6% |
Other values (554) | 21269 |
None
Value | Count | Frequency (%) |
& | 124 | |
· | 3 | 2.4% |
CJK
Value | Count | Frequency (%) |
社 | 40 | 16.0% |
文 | 26 | 10.4% |
出 | 10 | 4.0% |
版 | 10 | 4.0% |
大 | 8 | 3.2% |
明 | 8 | 3.2% |
敎 | 7 | 2.8% |
學 | 6 | 2.4% |
法 | 6 | 2.4% |
京 | 6 | 2.4% |
Other values (57) | 123 |
출판년
Real number (ℝ)
MISSING
 
Distinct | 62 |
---|---|
Distinct (%) | 0.6% |
Missing | 272 |
Missing (%) | 2.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2013.2685 |
Minimum | 1952 |
---|---|
Maximum | 2023 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1952 |
---|---|
5-th percentile | 2000 |
Q1 | 2011 |
median | 2015 |
Q3 | 2018 |
95-th percentile | 2021 |
Maximum | 2023 |
Range | 71 |
Interquartile range (IQR) | 7 |
Descriptive statistics
Standard deviation | 6.9672579 |
---|---|
Coefficient of variation (CV) | 0.00346067 |
Kurtosis | 8.3113101 |
Mean | 2013.2685 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -2.1341914 |
Sum | 19585076 |
Variance | 48.542682 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2016 | 1033 | 10.3% |
2018 | 890 | 8.9% |
2017 | 830 | 8.3% |
2019 | 771 | 7.7% |
2015 | 758 | 7.6% |
2014 | 616 | 6.2% |
2013 | 593 | 5.9% |
2012 | 516 | 5.2% |
2020 | 469 | 4.7% |
2011 | 363 | 3.6% |
Other values (52) | 2889 |
Value | Count | Frequency (%) |
1952 | 2 | |
1954 | 1 | < 0.1% |
1957 | 1 | < 0.1% |
1963 | 3 | |
1964 | 1 | < 0.1% |
1965 | 1 | < 0.1% |
1967 | 2 | |
1968 | 3 | |
1969 | 3 | |
1970 | 3 |
Value | Count | Frequency (%) |
2023 | 222 | 2.2% |
2022 | 166 | 1.7% |
2021 | 145 | 1.5% |
2020 | 469 | |
2019 | 771 | |
2018 | 890 | |
2017 | 830 | |
2016 | 1033 | |
2015 | 758 | |
2014 | 616 |
매체
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
인쇄 |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 인쇄 |
---|---|
2nd row | 인쇄 |
3rd row | 인쇄 |
4th row | 인쇄 |
5th row | 인쇄 |
Common Values
Value | Count | Frequency (%) |
인쇄 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
인쇄 | 10000 |
순번 | 출판년 | |
---|---|---|
순번 | 1.000 | 0.580 |
출판년 | 0.580 | 1.000 |
순번 | 출판년 | |
---|---|---|
순번 | 1.000 | 0.497 |
출판년 | 0.497 | 1.000 |
순번 | 서명 | 저자 | 출판사 | 출판년 | 매체 | |
---|---|---|---|---|---|---|
18718 | 18719 | 작은 별이지만 빛나고 있어: 소윤 에세이 | 소윤 | 북로망스 | 2021 | 인쇄 |
12290 | 12291 | Me, myself, and why : searching for the science of self | Ouellette, Jennifer | Penguin Books | 2014 | 인쇄 |
4837 | 4838 | 1분 경영 | Blanchard, Ken | 21세기북스 | 2016 | 인쇄 |
13375 | 13376 | Race Tech's motorcycle suspension bible | Parks, Lee | Motorbooks | 2010 | 인쇄 |
15493 | 15494 | 잠중록: 처처칭한 장편소설. 4 | 서미영 | 아르테 | 2019 | 인쇄 |
5046 | 5047 | 두 글자 : 일상과 운동을 엿보다 | 이학준 | 시간의물레 | 2016 | 인쇄 |
4169 | 4170 | 아프리카, 중국의 두 번째 대륙 : 100만 이주자의 아프리카 새 왕국 건설기 | French, Howard W | 지식의날개 | 2015 | 인쇄 |
4343 | 4344 | 스타트업 바이블 : 세계 최초로 공개되는 24단계 MIT 창업 프로그램 | Aulet, Bill | 비즈니스북스 | 2015 | 인쇄 |
5475 | 5476 | (뇌과학으로 읽는) 트라우마와 통증 : 우리 몸의 생존법 | Haines, Steve | 푸른지식 | 2016 | 인쇄 |
15781 | 15782 | 공부, 이래도 안되면 포기하세요: 무조건 합격을 부르는 최강의 멘탈 솔루션 | 이지훈 | 위즈덤하우스 | 2020 | 인쇄 |
순번 | 서명 | 저자 | 출판사 | 출판년 | 매체 | |
---|---|---|---|---|---|---|
12563 | 12564 | The exact sciences in antiquity | Neugebauer, O | Dover Publications | 1969 | 인쇄 |
722 | 723 | High dimensional probability III | Hoffmann-Jørgensen, J | Birkhauser | 2004 | 인쇄 |
17366 | 17367 | 하룻밤에 읽는 경제학 | Montousse, Marc | 랜덤하우스코리아 | 2011 | 인쇄 |
17368 | 17369 | 통섭의 식탁 | 최재천 | 움직이는서재 | 2015 | 인쇄 |
12433 | 12434 | Testosterone rex : myths of sex, science, and society | Fine, Cordelia | W.W. Norton & Company | 2018 | 인쇄 |
9702 | 9703 | 과학공화국 화학법정. 6, 신기한 금속 | 정완상 | 자음과모음 | 2016 | 인쇄 |
12875 | 12876 | Scientific practice : theories and stories of doing physics | Buchwald, Jed Z | The University of Chicago Press | 1995 | 인쇄 |
14096 | 14097 | The moral arc : how science makes us better people | Shermer, Michael | St. Martin's Griffin | 2016 | 인쇄 |
7585 | 7586 | 일상적이지만 절대적인 뇌과학지식 50 | Costandi, Moheb | 반니 | 2016 | 인쇄 |
5368 | 5369 | 일곱 가지 이야기 | 가노 도모코 | 피니스아프리카에 | 2016 | 인쇄 |