Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 10000 |
Missing cells | 7250 |
Missing cells (%) | 8.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 800.8 KiB |
Average record size in memory | 82.0 B |
Variable types
Numeric | 2 |
---|---|
Text | 6 |
Categorical | 1 |
Dataset
Description | 한국생명공학연구원 도서관에서 소장중인 도서 리스트 |
---|---|
Author | 한국생명공학연구원 |
URL | https://www.data.go.kr/data/3034126/fileData.do |
등록번호 is highly overall correlated with 출판년 | High correlation |
출판년 is highly overall correlated with 등록번호 | High correlation |
복본 is highly imbalanced (82.5%) | Imbalance |
저자 has 194 (1.9%) missing values | Missing |
도서 has 181 (1.8%) missing values | Missing |
권년차 has 6862 (68.6%) missing values | Missing |
등록번호 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 18:25:11.812118 |
---|---|
Analysis finished | 2023-12-12 18:25:15.445858 |
Duration | 3.63 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
등록번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9921.3135 |
Minimum | 1 |
---|---|
Maximum | 18822 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 788.9 |
Q1 | 5560.5 |
median | 9975 |
Q3 | 14748.25 |
95-th percentile | 17996.15 |
Maximum | 18822 |
Range | 18821 |
Interquartile range (IQR) | 9187.75 |
Descriptive statistics
Standard deviation | 5442.2651 |
---|---|
Coefficient of variation (CV) | 0.5485428 |
Kurtosis | -1.1542962 |
Mean | 9921.3135 |
Median Absolute Deviation (MAD) | 4560.5 |
Skewness | -0.13318522 |
Sum | 99213135 |
Variance | 29618250 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2583 | 1 | < 0.1% |
8876 | 1 | < 0.1% |
691 | 1 | < 0.1% |
9725 | 1 | < 0.1% |
9390 | 1 | < 0.1% |
15350 | 1 | < 0.1% |
11495 | 1 | < 0.1% |
9883 | 1 | < 0.1% |
2952 | 1 | < 0.1% |
18498 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
4 | 1 | |
5 | 1 | |
11 | 1 | |
12 | 1 | |
13 | 1 | |
14 | 1 | |
15 | 1 | |
16 | 1 |
Value | Count | Frequency (%) |
18822 | 1 | |
18821 | 1 | |
18820 | 1 | |
18819 | 1 | |
18818 | 1 | |
18816 | 1 | |
18814 | 1 | |
18813 | 1 | |
18812 | 1 | |
18811 | 1 |
서명
Text
Distinct | 7714 |
---|---|
Distinct (%) | 77.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 200 |
---|---|
Median length | 160 |
Mean length | 37.6851 |
Min length | 1 |
Characters and Unicode
Total characters | 376851 |
---|---|
Distinct characters | 1972 |
Distinct categories | 17 ? |
Distinct scripts | 7 ? |
Distinct blocks | 15 ? |
Unique
Unique | 6683 ? |
---|---|
Unique (%) | 66.8% |
Sample
1st row | Bioinformatics : from nucleic acids and proteins to cell metabolism ; from nucleic acids and proteins to cell metabolism ; contributions to the conference on "Bioinformatics", October 9 to 11, 1995, B |
---|---|
2nd row | 장류의 과학과 건강기능성 |
3rd row | Ecologically based pest management : new solutions for a new century |
4th row | (앨빈 토플러)부의 미래 |
5th row | 進化の謎をゲノムで解く |
Value | Count | Frequency (%) |
6021 | 9.8% | |
of | 2150 | 3.5% |
and | 2079 | 3.4% |
the | 1247 | 2.0% |
in | 1058 | 1.7% |
a | 670 | 1.1% |
biology | 464 | 0.8% |
molecular | 436 | 0.7% |
for | 404 | 0.7% |
to | 357 | 0.6% |
Other values (14859) | 46733 |
Most occurring characters
Value | Count | Frequency (%) |
51654 | 13.7% | |
e | 22698 | 6.0% |
o | 22440 | 6.0% |
i | 20207 | 5.4% |
a | 19898 | 5.3% |
n | 18785 | 5.0% |
t | 16029 | 4.3% |
r | 14617 | 3.9% |
s | 13945 | 3.7% |
c | 12833 | 3.4% |
Other values (1962) | 163745 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 229536 | |
Other Letter | 66850 | 17.7% |
Space Separator | 51657 | 13.7% |
Uppercase Letter | 11870 | 3.1% |
Other Punctuation | 8463 | 2.2% |
Decimal Number | 4111 | 1.1% |
Math Symbol | 1076 | 0.3% |
Dash Punctuation | 1027 | 0.3% |
Open Punctuation | 988 | 0.3% |
Close Punctuation | 980 | 0.3% |
Other values (7) | 293 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
의 | 1824 | 2.7% |
이 | 1301 | 1.9% |
학 | 1094 | 1.6% |
기 | 1059 | 1.6% |
한 | 1016 | 1.5% |
과 | 913 | 1.4% |
는 | 778 | 1.2% |
물 | 762 | 1.1% |
생 | 740 | 1.1% |
지 | 705 | 1.1% |
Other values (1832) | 56658 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 22698 | |
o | 22440 | |
i | 20207 | 8.8% |
a | 19898 | 8.7% |
n | 18785 | 8.2% |
t | 16029 | 7.0% |
r | 14617 | 6.4% |
s | 13945 | 6.1% |
c | 12833 | 5.6% |
l | 12674 | 5.5% |
Other values (18) | 55410 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 1174 | 9.9% |
P | 1030 | 8.7% |
C | 1004 | 8.5% |
M | 881 | 7.4% |
I | 846 | 7.1% |
B | 831 | 7.0% |
T | 794 | 6.7% |
S | 747 | 6.3% |
E | 550 | 4.6% |
N | 525 | 4.4% |
Other values (18) | 3488 |
Other Punctuation
Value | Count | Frequency (%) |
: | 4758 | |
, | 2077 | |
. | 564 | 6.7% |
· | 308 | 3.6% |
' | 215 | 2.5% |
/ | 212 | 2.5% |
& | 145 | 1.7% |
! | 74 | 0.9% |
, | 41 | 0.5% |
; | 33 | 0.4% |
Other values (8) | 36 | 0.4% |
Decimal Number
Value | Count | Frequency (%) |
1 | 1145 | |
0 | 775 | |
2 | 768 | |
9 | 361 | 8.8% |
3 | 304 | 7.4% |
4 | 197 | 4.8% |
5 | 193 | 4.7% |
8 | 149 | 3.6% |
6 | 125 | 3.0% |
7 | 90 | 2.2% |
Other values (3) | 4 | 0.1% |
Math Symbol
Value | Count | Frequency (%) |
= | 999 | |
+ | 30 | 2.8% |
~ | 28 | 2.6% |
| | 6 | 0.6% |
| | 6 | 0.6% |
∼ | 3 | 0.3% |
< | 1 | 0.1% |
> | 1 | 0.1% |
~ | 1 | 0.1% |
+ | 1 | 0.1% |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 39 | |
Ⅱ | 34 | |
Ⅲ | 29 | |
Ⅳ | 6 | 5.3% |
Ⅴ | 4 | 3.5% |
Ⅵ | 2 | 1.8% |
Open Punctuation
Value | Count | Frequency (%) |
( | 968 | |
[ | 16 | 1.6% |
『 | 2 | 0.2% |
《 | 1 | 0.1% |
「 | 1 | 0.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 961 | |
] | 15 | 1.5% |
』 | 2 | 0.2% |
》 | 1 | 0.1% |
」 | 1 | 0.1% |
Other Number
Value | Count | Frequency (%) |
¹ | 4 | |
³ | 3 | |
③ | 1 | 11.1% |
² | 1 | 11.1% |
Other Symbol
Value | Count | Frequency (%) |
▼ | 14 | |
™ | 4 | 21.1% |
℃ | 1 | 5.3% |
Space Separator
Value | Count | Frequency (%) |
51654 | ||
3 | < 0.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 969 | |
― | 58 | 5.6% |
Control
Value | Count | Frequency (%) |
125 | ||
1 | 0.8% |
Modifier Symbol
Value | Count | Frequency (%) |
˙ | 7 | |
´ | 1 | 12.5% |
Final Punctuation
Value | Count | Frequency (%) |
’ | 15 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 241515 | |
Common | 68481 | 18.2% |
Hangul | 56510 | 15.0% |
Han | 6917 | 1.8% |
Katakana | 2094 | 0.6% |
Hiragana | 1329 | 0.4% |
Greek | 5 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
의 | 1824 | 3.2% |
이 | 1301 | 2.3% |
학 | 1094 | 1.9% |
기 | 1059 | 1.9% |
한 | 1016 | 1.8% |
과 | 913 | 1.6% |
는 | 778 | 1.4% |
물 | 762 | 1.3% |
생 | 740 | 1.3% |
지 | 705 | 1.2% |
Other values (993) | 46318 |
Han
Value | Count | Frequency (%) |
物 | 222 | 3.2% |
學 | 205 | 3.0% |
生 | 187 | 2.7% |
大 | 120 | 1.7% |
國 | 107 | 1.5% |
鑑 | 104 | 1.5% |
植 | 100 | 1.4% |
術 | 92 | 1.3% |
技 | 91 | 1.3% |
科 | 85 | 1.2% |
Other values (695) | 5604 |
Katakana
Value | Count | Frequency (%) |
イ | 168 | 8.0% |
ン | 121 | 5.8% |
ス | 107 | 5.1% |
ク | 107 | 5.1% |
オ | 91 | 4.3% |
ト | 75 | 3.6% |
バ | 74 | 3.5% |
テ | 72 | 3.4% |
ロ | 67 | 3.2% |
ル | 62 | 3.0% |
Other values (66) | 1150 |
Common
Value | Count | Frequency (%) |
51654 | ||
: | 4758 | 6.9% |
, | 2077 | 3.0% |
1 | 1145 | 1.7% |
= | 999 | 1.5% |
- | 969 | 1.4% |
( | 968 | 1.4% |
) | 961 | 1.4% |
0 | 775 | 1.1% |
2 | 768 | 1.1% |
Other values (58) | 3407 | 5.0% |
Latin
Value | Count | Frequency (%) |
e | 22698 | 9.4% |
o | 22440 | 9.3% |
i | 20207 | 8.4% |
a | 19898 | 8.2% |
n | 18785 | 7.8% |
t | 16029 | 6.6% |
r | 14617 | 6.1% |
s | 13945 | 5.8% |
c | 12833 | 5.3% |
l | 12674 | 5.2% |
Other values (50) | 67389 |
Hiragana
Value | Count | Frequency (%) |
の | 327 | |
と | 161 | 12.1% |
る | 79 | 5.9% |
か | 68 | 5.1% |
を | 54 | 4.1% |
ら | 54 | 4.1% |
で | 40 | 3.0% |
に | 38 | 2.9% |
た | 36 | 2.7% |
す | 31 | 2.3% |
Other values (48) | 441 |
Greek
Value | Count | Frequency (%) |
β | 4 | |
α | 1 | 20.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 309376 | |
Hangul | 56402 | 15.0% |
CJK | 6831 | 1.8% |
Katakana | 2094 | 0.6% |
Hiragana | 1329 | 0.4% |
None | 408 | 0.1% |
Number Forms | 114 | < 0.1% |
Compat Jamo | 108 | < 0.1% |
CJK Compat Ideographs | 86 | < 0.1% |
Punctuation | 73 | < 0.1% |
Other values (5) | 30 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
51654 | ||
e | 22698 | 7.3% |
o | 22440 | 7.3% |
i | 20207 | 6.5% |
a | 19898 | 6.4% |
n | 18785 | 6.1% |
t | 16029 | 5.2% |
r | 14617 | 4.7% |
s | 13945 | 4.5% |
c | 12833 | 4.1% |
Other values (78) | 96270 |
Hangul
Value | Count | Frequency (%) |
의 | 1824 | 3.2% |
이 | 1301 | 2.3% |
학 | 1094 | 1.9% |
기 | 1059 | 1.9% |
한 | 1016 | 1.8% |
과 | 913 | 1.6% |
는 | 778 | 1.4% |
물 | 762 | 1.4% |
생 | 740 | 1.3% |
지 | 705 | 1.2% |
Other values (991) | 46210 |
Hiragana
Value | Count | Frequency (%) |
の | 327 | |
と | 161 | 12.1% |
る | 79 | 5.9% |
か | 68 | 5.1% |
を | 54 | 4.1% |
ら | 54 | 4.1% |
で | 40 | 3.0% |
に | 38 | 2.9% |
た | 36 | 2.7% |
す | 31 | 2.3% |
Other values (48) | 441 |
None
Value | Count | Frequency (%) |
· | 308 | |
, | 41 | 10.0% |
、 | 9 | 2.2% |
& | 6 | 1.5% |
| | 6 | 1.5% |
β | 4 | 1.0% |
¹ | 4 | 1.0% |
! | 3 | 0.7% |
3 | 0.7% | |
³ | 3 | 0.7% |
Other values (18) | 21 | 5.1% |
CJK
Value | Count | Frequency (%) |
物 | 222 | 3.2% |
學 | 205 | 3.0% |
生 | 187 | 2.7% |
大 | 120 | 1.8% |
國 | 107 | 1.6% |
鑑 | 104 | 1.5% |
植 | 100 | 1.5% |
術 | 92 | 1.3% |
技 | 91 | 1.3% |
科 | 85 | 1.2% |
Other values (675) | 5518 |
Katakana
Value | Count | Frequency (%) |
イ | 168 | 8.0% |
ン | 121 | 5.8% |
ス | 107 | 5.1% |
ク | 107 | 5.1% |
オ | 91 | 4.3% |
ト | 75 | 3.6% |
バ | 74 | 3.5% |
テ | 72 | 3.4% |
ロ | 67 | 3.2% |
ル | 62 | 3.0% |
Other values (66) | 1150 |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 101 | |
ㅡ | 7 | 6.5% |
Punctuation
Value | Count | Frequency (%) |
― | 58 | |
’ | 15 | 20.5% |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 39 | |
Ⅱ | 34 | |
Ⅲ | 29 | |
Ⅳ | 6 | 5.3% |
Ⅴ | 4 | 3.5% |
Ⅵ | 2 | 1.8% |
CJK Compat Ideographs
Value | Count | Frequency (%) |
歷 | 25 | |
年 | 25 | |
利 | 10 | 11.6% |
理 | 3 | 3.5% |
不 | 2 | 2.3% |
勞 | 2 | 2.3% |
臨 | 2 | 2.3% |
女 | 2 | 2.3% |
茶 | 2 | 2.3% |
裂 | 2 | 2.3% |
Other values (10) | 11 |
Geometric Shapes
Value | Count | Frequency (%) |
▼ | 14 |
Modifier Letters
Value | Count | Frequency (%) |
˙ | 7 |
Letterlike Symbols
Value | Count | Frequency (%) |
™ | 4 | |
℃ | 1 | 20.0% |
Math Operators
Value | Count | Frequency (%) |
∼ | 3 |
Enclosed Alphanum
Value | Count | Frequency (%) |
③ | 1 |
저자
Text
MISSING
 
Distinct | 6297 |
---|---|
Distinct (%) | 64.2% |
Missing | 194 |
Missing (%) | 1.9% |
Memory size | 156.2 KiB |
Length
Max length | 100 |
---|---|
Median length | 89 |
Mean length | 11.866204 |
Min length | 2 |
Characters and Unicode
Total characters | 116360 |
---|---|
Distinct characters | 1149 |
Distinct categories | 13 ? |
Distinct scripts | 6 ? |
Distinct blocks | 10 ? |
Unique
Unique | 4889 ? |
---|---|
Unique (%) | 49.9% |
Sample
1st row | Schomburg, Dietmar |
---|---|
2nd row | 박건영 |
3rd row | National Research Council (U.S.) |
4th row | 토플러, 앨빈 |
5th row | 長谷部, 光泰 |
Value | Count | Frequency (%) |
j | 438 | 2.1% |
m | 333 | 1.6% |
a | 314 | 1.5% |
r | 268 | 1.3% |
l | 212 | 1.0% |
e | 202 | 1.0% |
d | 196 | 0.9% |
h | 177 | 0.8% |
david | 171 | 0.8% |
w | 169 | 0.8% |
Other values (7747) | 18706 |
Most occurring characters
Value | Count | Frequency (%) |
11387 | 9.8% | |
e | 7440 | 6.4% |
a | 6463 | 5.6% |
n | 5851 | 5.0% |
, | 5813 | 5.0% |
r | 5439 | 4.7% |
o | 5334 | 4.6% |
i | 4910 | 4.2% |
l | 3606 | 3.1% |
t | 3220 | 2.8% |
Other values (1139) | 56897 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 61895 | |
Other Letter | 19876 | 17.1% |
Uppercase Letter | 15802 | 13.6% |
Space Separator | 11392 | 9.8% |
Other Punctuation | 6919 | 5.9% |
Dash Punctuation | 229 | 0.2% |
Open Punctuation | 66 | 0.1% |
Close Punctuation | 63 | 0.1% |
Control | 58 | < 0.1% |
Decimal Number | 42 | < 0.1% |
Other values (3) | 18 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 676 | 3.4% |
김 | 395 | 2.0% |
정 | 338 | 1.7% |
학 | 308 | 1.5% |
스 | 292 | 1.5% |
한 | 292 | 1.5% |
국 | 276 | 1.4% |
기 | 239 | 1.2% |
연 | 233 | 1.2% |
회 | 225 | 1.1% |
Other values (1049) | 16602 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 1188 | 7.5% |
M | 1179 | 7.5% |
J | 1176 | 7.4% |
A | 1050 | 6.6% |
R | 1048 | 6.6% |
C | 1005 | 6.4% |
D | 953 | 6.0% |
B | 944 | 6.0% |
H | 810 | 5.1% |
G | 783 | 5.0% |
Other values (17) | 5666 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 7440 | |
a | 6463 | |
n | 5851 | |
r | 5439 | 8.8% |
o | 5334 | 8.6% |
i | 4910 | 7.9% |
l | 3606 | 5.8% |
t | 3220 | 5.2% |
s | 2901 | 4.7% |
h | 2329 | 3.8% |
Other values (16) | 14402 |
Other Punctuation
Value | Count | Frequency (%) |
, | 5813 | |
. | 981 | 14.2% |
& | 55 | 0.8% |
' | 28 | 0.4% |
/ | 17 | 0.2% |
: | 11 | 0.2% |
· | 7 | 0.1% |
" | 5 | 0.1% |
! | 1 | < 0.1% |
! | 1 | < 0.1% |
Decimal Number
Value | Count | Frequency (%) |
9 | 16 | |
1 | 9 | |
4 | 9 | |
2 | 2 | 4.8% |
6 | 2 | 4.8% |
0 | 2 | 4.8% |
5 | 1 | 2.4% |
3 | 1 | 2.4% |
Math Symbol
Value | Count | Frequency (%) |
| | 4 | |
< | 3 | |
> | 3 | |
+ | 1 | 9.1% |
Control
Value | Count | Frequency (%) |
54 | ||
3 | 5.2% | |
1 | 1.7% |
Modifier Symbol
Value | Count | Frequency (%) |
´ | 4 | |
¨ | 1 | 16.7% |
^ | 1 | 16.7% |
Space Separator
Value | Count | Frequency (%) |
11387 | ||
5 | < 0.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 225 | |
― | 4 | 1.7% |
Open Punctuation
Value | Count | Frequency (%) |
( | 65 | |
[ | 1 | 1.5% |
Close Punctuation
Value | Count | Frequency (%) |
) | 62 | |
] | 1 | 1.6% |
Other Symbol
Value | Count | Frequency (%) |
▼ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 77697 | |
Common | 18787 | 16.1% |
Hangul | 18054 | 15.5% |
Han | 1632 | 1.4% |
Katakana | 183 | 0.2% |
Hiragana | 7 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 676 | 3.7% |
김 | 395 | 2.2% |
정 | 338 | 1.9% |
학 | 308 | 1.7% |
스 | 292 | 1.6% |
한 | 292 | 1.6% |
국 | 276 | 1.5% |
기 | 239 | 1.3% |
연 | 233 | 1.3% |
회 | 225 | 1.2% |
Other values (600) | 14780 |
Han
Value | Count | Frequency (%) |
中 | 77 | 4.7% |
國 | 61 | 3.7% |
會 | 47 | 2.9% |
學 | 44 | 2.7% |
編 | 35 | 2.1% |
田 | 34 | 2.1% |
科 | 33 | 2.0% |
植 | 33 | 2.0% |
物 | 32 | 2.0% |
志 | 31 | 1.9% |
Other values (389) | 1205 |
Latin
Value | Count | Frequency (%) |
e | 7440 | 9.6% |
a | 6463 | 8.3% |
n | 5851 | 7.5% |
r | 5439 | 7.0% |
o | 5334 | 6.9% |
i | 4910 | 6.3% |
l | 3606 | 4.6% |
t | 3220 | 4.1% |
s | 2901 | 3.7% |
h | 2329 | 3.0% |
Other values (43) | 30204 |
Katakana
Value | Count | Frequency (%) |
シ | 18 | 9.8% |
イ | 16 | 8.7% |
ン | 16 | 8.7% |
ス | 12 | 6.6% |
ム | 11 | 6.0% |
エ | 10 | 5.5% |
オ | 9 | 4.9% |
バ | 9 | 4.9% |
ト | 6 | 3.3% |
リ | 6 | 3.3% |
Other values (34) | 70 |
Common
Value | Count | Frequency (%) |
11387 | ||
, | 5813 | |
. | 981 | 5.2% |
- | 225 | 1.2% |
( | 65 | 0.3% |
) | 62 | 0.3% |
& | 55 | 0.3% |
54 | 0.3% | |
' | 28 | 0.1% |
/ | 17 | 0.1% |
Other values (27) | 100 | 0.5% |
Hiragana
Value | Count | Frequency (%) |
い | 2 | |
と | 1 | |
な | 1 | |
ら | 1 | |
え | 1 | |
み | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 96460 | |
Hangul | 18052 | 15.5% |
CJK | 1618 | 1.4% |
Katakana | 183 | 0.2% |
None | 19 | < 0.1% |
CJK Compat Ideographs | 14 | < 0.1% |
Hiragana | 7 | < 0.1% |
Punctuation | 4 | < 0.1% |
Compat Jamo | 2 | < 0.1% |
Geometric Shapes | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
11387 | 11.8% | |
e | 7440 | 7.7% |
a | 6463 | 6.7% |
n | 5851 | 6.1% |
, | 5813 | 6.0% |
r | 5439 | 5.6% |
o | 5334 | 5.5% |
i | 4910 | 5.1% |
l | 3606 | 3.7% |
t | 3220 | 3.3% |
Other values (72) | 36997 |
Hangul
Value | Count | Frequency (%) |
이 | 676 | 3.7% |
김 | 395 | 2.2% |
정 | 338 | 1.9% |
학 | 308 | 1.7% |
스 | 292 | 1.6% |
한 | 292 | 1.6% |
국 | 276 | 1.5% |
기 | 239 | 1.3% |
연 | 233 | 1.3% |
회 | 225 | 1.2% |
Other values (598) | 14778 |
CJK
Value | Count | Frequency (%) |
中 | 77 | 4.8% |
國 | 61 | 3.8% |
會 | 47 | 2.9% |
學 | 44 | 2.7% |
編 | 35 | 2.2% |
田 | 34 | 2.1% |
科 | 33 | 2.0% |
植 | 33 | 2.0% |
物 | 32 | 2.0% |
志 | 31 | 1.9% |
Other values (382) | 1191 |
Katakana
Value | Count | Frequency (%) |
シ | 18 | 9.8% |
イ | 16 | 8.7% |
ン | 16 | 8.7% |
ス | 12 | 6.6% |
ム | 11 | 6.0% |
エ | 10 | 5.5% |
オ | 9 | 4.9% |
バ | 9 | 4.9% |
ト | 6 | 3.3% |
リ | 6 | 3.3% |
Other values (34) | 70 |
None
Value | Count | Frequency (%) |
· | 7 | |
5 | ||
´ | 4 | |
¨ | 1 | 5.3% |
Ø | 1 | 5.3% |
! | 1 | 5.3% |
CJK Compat Ideographs
Value | Count | Frequency (%) |
利 | 5 | |
隆 | 4 | |
林 | 1 | 7.1% |
立 | 1 | 7.1% |
不 | 1 | 7.1% |
良 | 1 | 7.1% |
女 | 1 | 7.1% |
Punctuation
Value | Count | Frequency (%) |
― | 4 |
Hiragana
Value | Count | Frequency (%) |
い | 2 | |
と | 1 | |
な | 1 | |
ら | 1 | |
え | 1 | |
み | 1 |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 1 | |
ㅡ | 1 |
Geometric Shapes
Value | Count | Frequency (%) |
▼ | 1 |
출판사
Text
Distinct | 2427 |
---|---|
Distinct (%) | 24.3% |
Missing | 10 |
Missing (%) | 0.1% |
Memory size | 156.2 KiB |
Length
Max length | 100 |
---|---|
Median length | 94 |
Mean length | 12.13003 |
Min length | 1 |
Characters and Unicode
Total characters | 121179 |
---|---|
Distinct characters | 846 |
Distinct categories | 12 ? |
Distinct scripts | 6 ? |
Distinct blocks | 9 ? |
Unique
Unique | 1421 ? |
---|---|
Unique (%) | 14.2% |
Sample
1st row | VCH |
---|---|
2nd row | 한국장류협동조합 |
3rd row | National Academy Press |
4th row | 청림출판 |
5th row | 學硏メディカル秀潤社 |
Value | Count | Frequency (%) |
press | 2160 | 11.7% |
academic | 875 | 4.7% |
university | 400 | 2.2% |
379 | 2.0% | |
wiley | 297 | 1.6% |
humana | 251 | 1.4% |
oxford | 251 | 1.4% |
of | 250 | 1.4% |
publishers | 203 | 1.1% |
springer | 198 | 1.1% |
Other values (2382) | 13253 |
Most occurring characters
Value | Count | Frequency (%) |
e | 9270 | 7.6% |
8527 | 7.0% | |
s | 7478 | 6.2% |
r | 7374 | 6.1% |
i | 6838 | 5.6% |
a | 5373 | 4.4% |
n | 4593 | 3.8% |
o | 4392 | 3.6% |
c | 4316 | 3.6% |
l | 3757 | 3.1% |
Other values (836) | 59261 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 71207 | |
Other Letter | 23940 | 19.8% |
Uppercase Letter | 14917 | 12.3% |
Space Separator | 8527 | 7.0% |
Other Punctuation | 1356 | 1.1% |
Dash Punctuation | 510 | 0.4% |
Control | 374 | 0.3% |
Close Punctuation | 119 | 0.1% |
Open Punctuation | 118 | 0.1% |
Decimal Number | 104 | 0.1% |
Other values (2) | 7 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
사 | 1166 | 4.9% |
스 | 796 | 3.3% |
이 | 682 | 2.8% |
학 | 511 | 2.1% |
국 | 477 | 2.0% |
한 | 460 | 1.9% |
원 | 456 | 1.9% |
판 | 419 | 1.8% |
출 | 418 | 1.7% |
문 | 384 | 1.6% |
Other values (757) | 18171 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 9270 | |
s | 7478 | |
r | 7374 | |
i | 6838 | |
a | 5373 | 7.5% |
n | 4593 | 6.5% |
o | 4392 | 6.2% |
c | 4316 | 6.1% |
l | 3757 | 5.3% |
t | 2859 | 4.0% |
Other values (16) | 14957 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 2952 | |
A | 1493 | |
S | 1429 | |
C | 1306 | 8.8% |
H | 927 | 6.2% |
B | 763 | 5.1% |
W | 713 | 4.8% |
I | 667 | 4.5% |
R | 575 | 3.9% |
L | 545 | 3.7% |
Other values (15) | 3547 |
Other Punctuation
Value | Count | Frequency (%) |
. | 422 | |
& | 353 | |
, | 344 | |
/ | 141 | 10.4% |
' | 39 | 2.9% |
: | 22 | 1.6% |
; | 16 | 1.2% |
* | 12 | 0.9% |
· | 4 | 0.3% |
@ | 2 | 0.1% |
Decimal Number
Value | Count | Frequency (%) |
1 | 49 | |
2 | 45 | |
9 | 5 | 4.8% |
8 | 2 | 1.9% |
3 | 1 | 1.0% |
0 | 1 | 1.0% |
5 | 1 | 1.0% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 497 | |
― | 13 | 2.5% |
Open Punctuation
Value | Count | Frequency (%) |
( | 106 | |
[ | 12 | 10.2% |
Close Punctuation
Value | Count | Frequency (%) |
) | 106 | |
] | 13 | 10.9% |
Space Separator
Value | Count | Frequency (%) |
8527 |
Control
Value | Count | Frequency (%) |
374 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 5 |
Math Symbol
Value | Count | Frequency (%) |
+ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 86124 | |
Hangul | 21477 | 17.7% |
Common | 11115 | 9.2% |
Han | 2119 | 1.7% |
Katakana | 339 | 0.3% |
Hiragana | 5 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
사 | 1166 | 5.4% |
스 | 796 | 3.7% |
이 | 682 | 3.2% |
학 | 511 | 2.4% |
국 | 477 | 2.2% |
한 | 460 | 2.1% |
원 | 456 | 2.1% |
판 | 419 | 2.0% |
출 | 418 | 1.9% |
문 | 384 | 1.8% |
Other values (469) | 15708 |
Han
Value | Count | Frequency (%) |
社 | 272 | 12.8% |
出 | 120 | 5.7% |
版 | 120 | 5.7% |
學 | 84 | 4.0% |
會 | 78 | 3.7% |
羊 | 72 | 3.4% |
土 | 72 | 3.4% |
書 | 50 | 2.4% |
日 | 40 | 1.9% |
業 | 32 | 1.5% |
Other values (225) | 1179 |
Latin
Value | Count | Frequency (%) |
e | 9270 | 10.8% |
s | 7478 | 8.7% |
r | 7374 | 8.6% |
i | 6838 | 7.9% |
a | 5373 | 6.2% |
n | 4593 | 5.3% |
o | 4392 | 5.1% |
c | 4316 | 5.0% |
l | 3757 | 4.4% |
P | 2952 | 3.4% |
Other values (41) | 29781 |
Katakana
Value | Count | Frequency (%) |
シ | 47 | 13.9% |
ン | 29 | 8.6% |
エ | 26 | 7.7% |
ム | 26 | 7.7% |
ス | 14 | 4.1% |
イ | 14 | 4.1% |
タ | 13 | 3.8% |
セ | 11 | 3.2% |
フ | 11 | 3.2% |
ラ | 11 | 3.2% |
Other values (38) | 137 |
Common
Value | Count | Frequency (%) |
8527 | ||
- | 497 | 4.5% |
. | 422 | 3.8% |
374 | 3.4% | |
& | 353 | 3.2% |
, | 344 | 3.1% |
/ | 141 | 1.3% |
( | 106 | 1.0% |
) | 106 | 1.0% |
1 | 49 | 0.4% |
Other values (18) | 196 | 1.8% |
Hiragana
Value | Count | Frequency (%) |
む | 1 | |
く | 1 | |
す | 1 | |
あ | 1 | |
と | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 97217 | |
Hangul | 21476 | 17.7% |
CJK | 2111 | 1.7% |
Katakana | 339 | 0.3% |
Punctuation | 18 | < 0.1% |
CJK Compat Ideographs | 8 | < 0.1% |
Hiragana | 5 | < 0.1% |
None | 4 | < 0.1% |
Compat Jamo | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 9270 | 9.5% |
8527 | 8.8% | |
s | 7478 | 7.7% |
r | 7374 | 7.6% |
i | 6838 | 7.0% |
a | 5373 | 5.5% |
n | 4593 | 4.7% |
o | 4392 | 4.5% |
c | 4316 | 4.4% |
l | 3757 | 3.9% |
Other values (66) | 35299 |
Hangul
Value | Count | Frequency (%) |
사 | 1166 | 5.4% |
스 | 796 | 3.7% |
이 | 682 | 3.2% |
학 | 511 | 2.4% |
국 | 477 | 2.2% |
한 | 460 | 2.1% |
원 | 456 | 2.1% |
판 | 419 | 2.0% |
출 | 418 | 1.9% |
문 | 384 | 1.8% |
Other values (468) | 15707 |
CJK
Value | Count | Frequency (%) |
社 | 272 | 12.9% |
出 | 120 | 5.7% |
版 | 120 | 5.7% |
學 | 84 | 4.0% |
會 | 78 | 3.7% |
羊 | 72 | 3.4% |
土 | 72 | 3.4% |
書 | 50 | 2.4% |
日 | 40 | 1.9% |
業 | 32 | 1.5% |
Other values (219) | 1171 |
Katakana
Value | Count | Frequency (%) |
シ | 47 | 13.9% |
ン | 29 | 8.6% |
エ | 26 | 7.7% |
ム | 26 | 7.7% |
ス | 14 | 4.1% |
イ | 14 | 4.1% |
タ | 13 | 3.8% |
セ | 11 | 3.2% |
フ | 11 | 3.2% |
ラ | 11 | 3.2% |
Other values (38) | 137 |
Punctuation
Value | Count | Frequency (%) |
― | 13 | |
’ | 5 | 27.8% |
None
Value | Count | Frequency (%) |
· | 4 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
率 | 2 | |
勞 | 2 | |
林 | 1 | |
栗 | 1 | |
阮 | 1 | |
金 | 1 |
Hiragana
Value | Count | Frequency (%) |
む | 1 | |
く | 1 | |
す | 1 | |
あ | 1 | |
と | 1 |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 1 |
출판년
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 84 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2000.2735 |
Minimum | 1905 |
---|---|
Maximum | 2020 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1905 |
---|---|
5-th percentile | 1975 |
Q1 | 1994 |
median | 2003 |
Q3 | 2010 |
95-th percentile | 2017 |
Maximum | 2020 |
Range | 115 |
Interquartile range (IQR) | 16 |
Descriptive statistics
Standard deviation | 13.730177 |
---|---|
Coefficient of variation (CV) | 0.0068641496 |
Kurtosis | 3.9439247 |
Mean | 2000.2735 |
Median Absolute Deviation (MAD) | 8 |
Skewness | -1.5999817 |
Sum | 20002735 |
Variance | 188.51775 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2003 | 446 | 4.5% |
1994 | 411 | 4.1% |
2004 | 406 | 4.1% |
2005 | 390 | 3.9% |
2002 | 383 | 3.8% |
2007 | 342 | 3.4% |
1995 | 334 | 3.3% |
2011 | 330 | 3.3% |
2006 | 317 | 3.2% |
2013 | 310 | 3.1% |
Other values (74) | 6331 |
Value | Count | Frequency (%) |
1905 | 1 | < 0.1% |
1916 | 1 | < 0.1% |
1921 | 1 | < 0.1% |
1922 | 1 | < 0.1% |
1934 | 1 | < 0.1% |
1938 | 1 | < 0.1% |
1940 | 1 | < 0.1% |
1941 | 39 | |
1943 | 13 | 0.1% |
1944 | 27 |
Value | Count | Frequency (%) |
2020 | 54 | 0.5% |
2019 | 147 | |
2018 | 217 | |
2017 | 187 | |
2016 | 202 | |
2015 | 273 | |
2014 | 304 | |
2013 | 310 | |
2012 | 300 | |
2011 | 330 |
분류
Text
Distinct | 2969 |
---|---|
Distinct (%) | 29.7% |
Missing | 3 |
Missing (%) | < 0.1% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
qp601 | 336 | 3.2% |
qh506 | 322 | 3.1% |
qh301 | 207 | 2.0% |
md | 155 | 1.5% |
g154.7 | 138 | 1.3% |
qp551 | 124 | 1.2% |
qh324.2 | 84 | 0.8% |
tp248.2 | 81 | 0.8% |
qk355 | 76 | 0.7% |
tp248.3 | 73 | 0.7% |
Other values (2877) | 8773 |
Most occurring characters
Value | Count | Frequency (%) |
5 | 5668 | 8.6% |
. | 5561 | 8.4% |
Q | 5341 | 8.1% |
1 | 5149 | 7.8% |
6 | 4227 | 6.4% |
4 | 4181 | 6.3% |
2 | 4059 | 6.1% |
3 | 3868 | 5.8% |
9 | 2876 | 4.3% |
0 | 2871 | 4.3% |
Other values (35) | 22325 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 38515 | |
Uppercase Letter | 21496 | |
Other Punctuation | 5561 | 8.4% |
Space Separator | 372 | 0.6% |
Dash Punctuation | 176 | 0.3% |
Other Letter | 5 | < 0.1% |
Lowercase Letter | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
Q | 5341 | |
H | 2566 | |
P | 2478 | |
R | 1839 | 8.6% |
D | 1194 | 5.6% |
K | 986 | 4.6% |
T | 886 | 4.1% |
S | 859 | 4.0% |
B | 855 | 4.0% |
L | 753 | 3.5% |
Other values (16) | 3739 |
Decimal Number
Value | Count | Frequency (%) |
5 | 5668 | |
1 | 5149 | |
6 | 4227 | |
4 | 4181 | |
2 | 4059 | |
3 | 3868 | |
9 | 2876 | |
0 | 2871 | |
7 | 2836 | |
8 | 2780 |
Other Letter
Value | Count | Frequency (%) |
김 | 1 | |
홍 | 1 | |
피 | 1 | |
황 | 1 | |
박 | 1 |
Other Punctuation
Value | Count | Frequency (%) |
. | 5561 |
Space Separator
Value | Count | Frequency (%) |
372 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 176 |
Lowercase Letter
Value | Count | Frequency (%) |
l | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 44624 | |
Latin | 21497 | |
Hangul | 5 | < 0.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
Q | 5341 | |
H | 2566 | |
P | 2478 | |
R | 1839 | 8.6% |
D | 1194 | 5.6% |
K | 986 | 4.6% |
T | 886 | 4.1% |
S | 859 | 4.0% |
B | 855 | 4.0% |
L | 753 | 3.5% |
Other values (17) | 3740 |
Common
Value | Count | Frequency (%) |
5 | 5668 | |
. | 5561 | |
1 | 5149 | |
6 | 4227 | |
4 | 4181 | |
2 | 4059 | |
3 | 3868 | |
9 | 2876 | |
0 | 2871 | |
7 | 2836 | |
Other values (3) | 3328 |
Hangul
Value | Count | Frequency (%) |
김 | 1 | |
홍 | 1 | |
피 | 1 | |
황 | 1 | |
박 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 66121 | |
Hangul | 5 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 5668 | 8.6% |
. | 5561 | 8.4% |
Q | 5341 | 8.1% |
1 | 5149 | 7.8% |
6 | 4227 | 6.4% |
4 | 4181 | 6.3% |
2 | 4059 | 6.1% |
3 | 3868 | 5.8% |
9 | 2876 | 4.3% |
0 | 2871 | 4.3% |
Other values (30) | 22320 |
Hangul
Value | Count | Frequency (%) |
김 | 1 | |
홍 | 1 | |
피 | 1 | |
황 | 1 | |
박 | 1 |
도서
Text
MISSING
 
Distinct | 6733 |
---|---|
Distinct (%) | 68.6% |
Missing | 181 |
Missing (%) | 1.8% |
Memory size | 156.2 KiB |
Length
Max length | 19 |
---|---|
Median length | 16 |
Mean length | 8.7515022 |
Min length | 3 |
Characters and Unicode
Total characters | 85931 |
---|---|
Distinct characters | 496 |
Distinct categories | 7 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 5530 ? |
---|---|
Unique (%) | 56.3% |
Sample
1st row | .B55 1995 |
---|---|
2nd row | 박13 2009 |
3rd row | .E365 1996 |
4th row | 토798 2006 |
5th row | 장295 2015 |
Value | Count | Frequency (%) |
2 | 577 | 2.9% |
2003 | 406 | 2.0% |
2005 | 376 | 1.9% |
2004 | 374 | 1.9% |
1994 | 363 | 1.8% |
2002 | 357 | 1.8% |
2007 | 339 | 1.7% |
1995 | 318 | 1.6% |
3 | 313 | 1.6% |
2011 | 313 | 1.6% |
Other values (3972) | 16431 |
Most occurring characters
Value | Count | Frequency (%) |
10350 | ||
0 | 9839 | |
2 | 8822 | |
9 | 8335 | |
1 | 8067 | |
. | 5281 | 6.1% |
4 | 4304 | 5.0% |
5 | 4197 | 4.9% |
6 | 4069 | 4.7% |
3 | 3817 | 4.4% |
Other values (486) | 18850 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 58845 | |
Space Separator | 10350 | 12.0% |
Uppercase Letter | 5289 | 6.2% |
Other Punctuation | 5281 | 6.1% |
Other Letter | 4578 | 5.3% |
Dash Punctuation | 1532 | 1.8% |
Lowercase Letter | 56 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 233 | 5.1% |
한 | 230 | 5.0% |
김 | 209 | 4.6% |
박 | 109 | 2.4% |
세 | 104 | 2.3% |
유 | 83 | 1.8% |
생 | 75 | 1.6% |
조 | 70 | 1.5% |
정 | 70 | 1.5% |
중 | 65 | 1.4% |
Other values (441) | 3330 |
Uppercase Letter
Value | Count | Frequency (%) |
M | 898 | |
A | 489 | 9.2% |
C | 461 | 8.7% |
B | 373 | 7.1% |
S | 366 | 6.9% |
P | 353 | 6.7% |
E | 235 | 4.4% |
I | 228 | 4.3% |
F | 218 | 4.1% |
H | 211 | 4.0% |
Other values (16) | 1457 |
Decimal Number
Value | Count | Frequency (%) |
0 | 9839 | |
2 | 8822 | |
9 | 8335 | |
1 | 8067 | |
4 | 4304 | |
5 | 4197 | |
6 | 4069 | |
3 | 3817 | 6.5% |
7 | 3786 | 6.4% |
8 | 3609 | 6.1% |
Lowercase Letter
Value | Count | Frequency (%) |
t | 47 | |
p | 4 | 7.1% |
u | 2 | 3.6% |
x | 1 | 1.8% |
s | 1 | 1.8% |
l | 1 | 1.8% |
Space Separator
Value | Count | Frequency (%) |
10350 |
Other Punctuation
Value | Count | Frequency (%) |
. | 5281 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1532 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 76008 | |
Latin | 5345 | 6.2% |
Hangul | 4578 | 5.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 233 | 5.1% |
한 | 230 | 5.0% |
김 | 209 | 4.6% |
박 | 109 | 2.4% |
세 | 104 | 2.3% |
유 | 83 | 1.8% |
생 | 75 | 1.6% |
조 | 70 | 1.5% |
정 | 70 | 1.5% |
중 | 65 | 1.4% |
Other values (441) | 3330 |
Latin
Value | Count | Frequency (%) |
M | 898 | |
A | 489 | 9.1% |
C | 461 | 8.6% |
B | 373 | 7.0% |
S | 366 | 6.8% |
P | 353 | 6.6% |
E | 235 | 4.4% |
I | 228 | 4.3% |
F | 218 | 4.1% |
H | 211 | 3.9% |
Other values (22) | 1513 |
Common
Value | Count | Frequency (%) |
10350 | ||
0 | 9839 | |
2 | 8822 | |
9 | 8335 | |
1 | 8067 | |
. | 5281 | |
4 | 4304 | |
5 | 4197 | |
6 | 4069 | 5.4% |
3 | 3817 | 5.0% |
Other values (3) | 8927 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 81353 | |
Hangul | 4543 | 5.3% |
Compat Jamo | 35 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
10350 | ||
0 | 9839 | |
2 | 8822 | |
9 | 8335 | |
1 | 8067 | |
. | 5281 | 6.5% |
4 | 4304 | 5.3% |
5 | 4197 | 5.2% |
6 | 4069 | 5.0% |
3 | 3817 | 4.7% |
Other values (35) | 14272 |
Hangul
Value | Count | Frequency (%) |
이 | 233 | 5.1% |
한 | 230 | 5.1% |
김 | 209 | 4.6% |
박 | 109 | 2.4% |
세 | 104 | 2.3% |
유 | 83 | 1.8% |
생 | 75 | 1.7% |
조 | 70 | 1.5% |
정 | 70 | 1.5% |
중 | 65 | 1.4% |
Other values (430) | 3295 |
Compat Jamo
Value | Count | Frequency (%) |
ㅎ | 7 | |
ㅇ | 6 | |
ㅅ | 5 | |
ㄱ | 4 | |
ㅈ | 4 | |
ㅁ | 2 | 5.7% |
ㅂ | 2 | 5.7% |
ㄴ | 2 | 5.7% |
ㅍ | 1 | 2.9% |
ㄷ | 1 | 2.9% |
권년차
Text
MISSING
 
Distinct | 633 |
---|---|
Distinct (%) | 20.2% |
Missing | 6862 |
Missing (%) | 68.6% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
v.1 | 450 | 14.3% |
v.2 | 436 | 13.9% |
v.3 | 180 | 5.7% |
v.4 | 108 | 3.4% |
v.5 | 86 | 2.7% |
v.6 | 62 | 2.0% |
v.7 | 59 | 1.9% |
v.8 | 45 | 1.4% |
v.10 | 36 | 1.1% |
v.9 | 35 | 1.1% |
Other values (620) | 1644 |
Most occurring characters
Value | Count | Frequency (%) |
. | 2906 | |
v | 2890 | |
1 | 1419 | |
2 | 1288 | |
3 | 623 | 5.2% |
0 | 537 | 4.4% |
4 | 420 | 3.5% |
5 | 367 | 3.0% |
6 | 358 | 3.0% |
9 | 330 | 2.7% |
Other values (25) | 956 | 7.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 5905 | |
Lowercase Letter | 2931 | |
Other Punctuation | 2924 | |
Dash Punctuation | 215 | 1.8% |
Uppercase Letter | 65 | 0.5% |
Close Punctuation | 24 | 0.2% |
Open Punctuation | 24 | 0.2% |
Space Separator | 3 | < 0.1% |
Other Letter | 3 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 1419 | |
2 | 1288 | |
3 | 623 | |
0 | 537 | 9.1% |
4 | 420 | 7.1% |
5 | 367 | 6.2% |
6 | 358 | 6.1% |
9 | 330 | 5.6% |
7 | 285 | 4.8% |
8 | 278 | 4.7% |
Lowercase Letter
Value | Count | Frequency (%) |
v | 2890 | |
p | 17 | 0.6% |
t | 15 | 0.5% |
a | 3 | 0.1% |
c | 2 | 0.1% |
s | 1 | < 0.1% |
u | 1 | < 0.1% |
e | 1 | < 0.1% |
b | 1 | < 0.1% |
Uppercase Letter
Value | Count | Frequency (%) |
B | 22 | |
A | 21 | |
I | 17 | |
P | 2 | 3.1% |
C | 2 | 3.1% |
F | 1 | 1.5% |
Other Punctuation
Value | Count | Frequency (%) |
. | 2906 | |
/ | 17 | 0.6% |
, | 1 | < 0.1% |
Other Letter
Value | Count | Frequency (%) |
해 | 1 | |
설 | 1 | |
서 | 1 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 215 |
Close Punctuation
Value | Count | Frequency (%) |
) | 24 |
Open Punctuation
Value | Count | Frequency (%) |
( | 24 |
Space Separator
Value | Count | Frequency (%) |
3 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 9095 | |
Latin | 2996 | 24.8% |
Hangul | 3 | < 0.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
. | 2906 | |
1 | 1419 | |
2 | 1288 | |
3 | 623 | 6.8% |
0 | 537 | 5.9% |
4 | 420 | 4.6% |
5 | 367 | 4.0% |
6 | 358 | 3.9% |
9 | 330 | 3.6% |
7 | 285 | 3.1% |
Other values (7) | 562 | 6.2% |
Latin
Value | Count | Frequency (%) |
v | 2890 | |
B | 22 | 0.7% |
A | 21 | 0.7% |
p | 17 | 0.6% |
I | 17 | 0.6% |
t | 15 | 0.5% |
a | 3 | 0.1% |
P | 2 | 0.1% |
c | 2 | 0.1% |
C | 2 | 0.1% |
Other values (5) | 5 | 0.2% |
Hangul
Value | Count | Frequency (%) |
해 | 1 | |
설 | 1 | |
서 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 12091 | |
Hangul | 3 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 2906 | |
v | 2890 | |
1 | 1419 | |
2 | 1288 | |
3 | 623 | 5.2% |
0 | 537 | 4.4% |
4 | 420 | 3.5% |
5 | 367 | 3.0% |
6 | 358 | 3.0% |
9 | 330 | 2.7% |
Other values (22) | 953 | 7.9% |
Hangul
Value | Count | Frequency (%) |
해 | 1 | |
설 | 1 | |
서 | 1 |
복본
Categorical
IMBALANCE
 
Distinct | 14 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
c.2 | 877 |
c.3 | 227 |
c.4 | 53 |
c.5 | 12 |
Other values (9) | 18 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.8815 |
Min length | 3 |
Unique
Unique | 4 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 8813 | |
c.2 | 877 | 8.8% |
c.3 | 227 | 2.3% |
c.4 | 53 | 0.5% |
c.5 | 12 | 0.1% |
v.1 | 4 | < 0.1% |
c.7 | 3 | < 0.1% |
c.6 | 3 | < 0.1% |
v.3 | 2 | < 0.1% |
v.2 | 2 | < 0.1% |
Other values (4) | 4 | < 0.1% |
Length
Value | Count | Frequency (%) |
na | 8813 | |
c.2 | 877 | 8.8% |
c.3 | 227 | 2.3% |
c.4 | 53 | 0.5% |
c.5 | 12 | 0.1% |
v.1 | 4 | < 0.1% |
c.7 | 3 | < 0.1% |
c.6 | 3 | < 0.1% |
v.3 | 2 | < 0.1% |
v.2 | 2 | < 0.1% |
Other values (4) | 4 | < 0.1% |
등록번호 | 출판년 | 복본 | |
---|---|---|---|
등록번호 | 1.000 | 0.845 | 0.197 |
출판년 | 0.845 | 1.000 | 0.163 |
복본 | 0.197 | 0.163 | 1.000 |
등록번호 | 출판년 | 복본 | |
---|---|---|---|
등록번호 | 1.000 | 0.896 | 0.082 |
출판년 | 0.896 | 1.000 | 0.081 |
복본 | 0.082 | 0.081 | 1.000 |
등록번호 | 서명 | 저자 | 출판사 | 출판년 | 분류 | 도서 | 권년차 | 복본 | |
---|---|---|---|---|---|---|---|---|---|
1876 | 2583 | Bioinformatics : from nucleic acids and proteins to cell metabolism ; from nucleic acids and proteins to cell metabolism ; contributions to the conference on "Bioinformatics", October 9 to 11, 1995, B | Schomburg, Dietmar | VCH | 1995 | QP517.C45 | .B55 1995 | <NA> | <NA> |
10131 | 12979 | 장류의 과학과 건강기능성 | 박건영 | 한국장류협동조합 | 2009 | TX560.F47 | 박13 2009 | <NA> | <NA> |
5718 | 7763 | Ecologically based pest management : new solutions for a new century | National Research Council (U.S.) | National Academy Press | 1996 | SB950 | .E365 1996 | <NA> | <NA> |
8804 | 11551 | (앨빈 토플러)부의 미래 | 토플러, 앨빈 | 청림출판 | 2006 | HB3730 | 토798 2006 | <NA> | <NA> |
13675 | 16909 | 進化の謎をゲノムで解く | 長谷部, 光泰 | 學硏メディカル秀潤社 | 2015 | QH431 | 장295 2015 | <NA> | <NA> |
13589 | 16823 | 바이오센서 응용분야별 R&D현황 및 나노바이오융합 기술/시장분석 | R&D정보센터 | 지식산업정보원 | 2015 | MD 2015 -14 | <NA> | <NA> | <NA> |
1390 | 1919 | The model leader : a fully functioning person | Hitt, William D | Battelle Press | 1993 | HD57.7 | .H58 1993 | <NA> | <NA> |
9521 | 12299 | 커피 | 조윤정 | 대원사 | 2007 | TX415 | 조37 2007 | <NA> | <NA> |
13860 | 17122 | 웃으면서 죽음을 이야기하는 방법 | 최세희 | 다산북스 | 2016 | PR6052.A6657 | 웃68 2016 | <NA> | <NA> |
1202 | 1429 | EDI通信과 保安 | 임승택 | 컴퓨터월드 출판사업부 | 1994 | TK3226 | 임57 1994 | <NA> | <NA> |
등록번호 | 서명 | 저자 | 출판사 | 출판년 | 분류 | 도서 | 권년차 | 복본 | |
---|---|---|---|---|---|---|---|---|---|
14916 | 18215 | 바이오의약품 산업분석보고서 | 비피기술거래 | 비티타임즈 | 2018 | MD 2018 -10 | <NA> | <NA> | <NA> |
6806 | 8946 | Microbes : an invisible universe | Gest, Howard | ASM Press | 2003 | QR41.2 | .G468 2003 | <NA> | <NA> |
3701 | 5266 | Integrin protocols | Howlett, Anthony | Humana Press | 1999 | QH506 | .M48 1999 | v.129 | <NA> |
749 | 761 | Vitamins and hormones | <NA> | Academic Press | 1943 | QP801.V5 | .V5 | v.26 | <NA> |
647 | 653 | Progress in nucleic acid research and molecular biology | Davidson, J. N | Academic Press | 1963 | QP551 | .P695 | v.18 | <NA> |
188 | 192 | Aqueous two-phase systems | Walter, Harry | Academic Press | 1994 | QP601 | .M49 1994 | v.228 | <NA> |
6376 | 8493 | 정부지원제도총람 | 한국산업정보원 | 한국산업정보원 | 2003 | HD62.5 | 정46 | 2003 | <NA> |
2367 | 3337 | Nuclear magnetic resonance and nucleic acids | James, Thomas L | Academic Press | 1995 | QP601 | .M49 1995 | v.261 | <NA> |
6091 | 8171 | Marek's disease | Hirai, Kanji | Springer | 2001 | QR1 | .C9 2001 | v.255 | <NA> |
10445 | 13338 | 바우돌리노 : 움베르토 에코 장편소설 상-하 | 에코, 움베르토 | 열린책들 | 2002 | PQ4865.C65 | 바67 2002 | v.1 | <NA> |