Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 258 |
Missing cells (%) | 0.4% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 634.8 KiB |
Average record size in memory | 65.0 B |
Variable types
Numeric | 1 |
---|---|
Categorical | 1 |
Text | 5 |
Dataset
Description | 도서명, 저자명, 출판사명, 형태사항, 주기사항,원문정보등 |
---|---|
Author | 충북대학교 |
URL | https://www.data.go.kr/data/3058186/fileData.do |
Reproduction
Analysis started | 2023-12-12 04:56:07.387462 |
---|---|
Analysis finished | 2023-12-12 04:56:10.843416 |
Duration | 3.46 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
서지번호
Real number (ℝ)
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1175787.5 |
Minimum | 44494 |
---|---|
Maximum | 3017993 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 44494 |
---|---|
5-th percentile | 93635.7 |
Q1 | 264490.25 |
median | 422836 |
Q3 | 2242929.8 |
95-th percentile | 2962100.6 |
Maximum | 3017993 |
Range | 2973499 |
Interquartile range (IQR) | 1978439.5 |
Descriptive statistics
Standard deviation | 1075023 |
---|---|
Coefficient of variation (CV) | 0.91430039 |
Kurtosis | -1.5860814 |
Mean | 1175787.5 |
Median Absolute Deviation (MAD) | 279787 |
Skewness | 0.44847822 |
Sum | 1.1757875 × 1010 |
Variance | 1.1556744 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
285701 | 1 | < 0.1% |
272390 | 1 | < 0.1% |
2081396 | 1 | < 0.1% |
2187130 | 1 | < 0.1% |
1933784 | 1 | < 0.1% |
2973452 | 1 | < 0.1% |
404881 | 1 | < 0.1% |
2771411 | 1 | < 0.1% |
358401 | 1 | < 0.1% |
389211 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
44494 | 1 | |
44503 | 1 | |
44517 | 1 | |
44631 | 1 | |
44634 | 1 | |
44645 | 1 | |
44757 | 1 | |
44803 | 1 | |
44839 | 1 | |
44951 | 1 |
Value | Count | Frequency (%) |
3017993 | 1 | |
3017983 | 1 | |
3017980 | 1 | |
3017709 | 1 | |
3015435 | 1 | |
3015243 | 1 | |
3015233 | 1 | |
3015226 | 1 | |
3014845 | 1 | |
3014828 | 1 |
유형
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
국내단행본 | |
---|---|
DVD | 375 |
고서 | 49 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 4.9103 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 국내단행본 |
---|---|
2nd row | 국내단행본 |
3rd row | 국내단행본 |
4th row | 국내단행본 |
5th row | 국내단행본 |
Common Values
Value | Count | Frequency (%) |
국내단행본 | 9576 | |
DVD | 375 | 3.8% |
고서 | 49 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
국내단행본 | 9576 | |
dvd | 375 | 3.8% |
고서 | 49 | 0.5% |
서명
Text
Distinct | 9728 |
---|---|
Distinct (%) | 97.3% |
Missing | 1 |
Missing (%) | < 0.1% |
Memory size | 156.2 KiB |
Length
Max length | 133 |
---|---|
Median length | 88 |
Mean length | 15.020002 |
Min length | 1 |
Characters and Unicode
Total characters | 150185 |
---|---|
Distinct characters | 2635 |
Distinct categories | 16 ? |
Distinct scripts | 7 ? |
Distinct blocks | 13 ? |
Unique
Unique | 9557 ? |
---|---|
Unique (%) | 95.6% |
Sample
1st row | TCP/IP 네트워킹 |
---|---|
2nd row | 건축설계이론 |
3rd row | (새로운)財務管理論 |
4th row | (단재)신채호 |
5th row | 軍改革 이렇게 해야 한다 |
Value | Count | Frequency (%) |
위한 | 325 | 1.1% |
연구 | 261 | 0.9% |
및 | 244 | 0.8% |
개발 | 118 | 0.4% |
관한 | 92 | 0.3% |
이야기 | 78 | 0.3% |
21세기 | 72 | 0.2% |
이해 | 69 | 0.2% |
67 | 0.2% | |
프로그래밍 | 58 | 0.2% |
Other values (18772) | 29041 |
Most occurring characters
Value | Count | Frequency (%) |
20438 | 13.6% | |
) | 5270 | 3.5% |
( | 5269 | 3.5% |
의 | 2607 | 1.7% |
기 | 1726 | 1.1% |
한 | 1605 | 1.1% |
사 | 1524 | 1.0% |
이 | 1492 | 1.0% |
0 | 1234 | 0.8% |
학 | 1181 | 0.8% |
Other values (2625) | 107839 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 98158 | |
Space Separator | 20438 | 13.6% |
Lowercase Letter | 9229 | 6.1% |
Uppercase Letter | 5958 | 4.0% |
Close Punctuation | 5326 | 3.5% |
Open Punctuation | 5325 | 3.5% |
Decimal Number | 4365 | 2.9% |
Other Punctuation | 1082 | 0.7% |
Dash Punctuation | 185 | 0.1% |
Math Symbol | 87 | 0.1% |
Other values (6) | 32 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
의 | 2607 | 2.7% |
기 | 1726 | 1.8% |
한 | 1605 | 1.6% |
사 | 1524 | 1.6% |
이 | 1492 | 1.5% |
학 | 1181 | 1.2% |
는 | 1107 | 1.1% |
과 | 1055 | 1.1% |
국 | 997 | 1.0% |
가 | 962 | 1.0% |
Other values (2488) | 83902 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 1028 | |
o | 842 | 9.1% |
i | 812 | 8.8% |
a | 807 | 8.7% |
n | 714 | 7.7% |
r | 684 | 7.4% |
t | 619 | 6.7% |
s | 555 | 6.0% |
l | 429 | 4.6% |
c | 373 | 4.0% |
Other values (38) | 2366 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 569 | 9.6% |
C | 505 | 8.5% |
A | 491 | 8.2% |
T | 441 | 7.4% |
I | 419 | 7.0% |
E | 414 | 6.9% |
P | 329 | 5.5% |
O | 312 | 5.2% |
M | 291 | 4.9% |
D | 273 | 4.6% |
Other values (20) | 1914 |
Other Punctuation
Value | Count | Frequency (%) |
. | 291 | |
, | 275 | |
· | 191 | |
/ | 81 | 7.5% |
! | 64 | 5.9% |
& | 49 | 4.5% |
' | 45 | 4.2% |
: | 24 | 2.2% |
" | 18 | 1.7% |
% | 15 | 1.4% |
Other values (7) | 29 | 2.7% |
Decimal Number
Value | Count | Frequency (%) |
0 | 1234 | |
1 | 895 | |
2 | 849 | |
3 | 280 | 6.4% |
5 | 247 | 5.7% |
9 | 219 | 5.0% |
4 | 174 | 4.0% |
8 | 166 | 3.8% |
6 | 156 | 3.6% |
7 | 145 | 3.3% |
Math Symbol
Value | Count | Frequency (%) |
+ | 56 | |
~ | 21 | 24.1% |
> | 4 | 4.6% |
= | 3 | 3.4% |
< | 2 | 2.3% |
+ | 1 | 1.1% |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 6 | |
Ⅰ | 5 | |
Ⅲ | 4 | |
Ⅳ | 1 | 5.6% |
Ⅴ | 1 | 5.6% |
Ⅶ | 1 | 5.6% |
Close Punctuation
Value | Count | Frequency (%) |
) | 5270 | |
] | 30 | 0.6% |
』 | 13 | 0.2% |
」 | 12 | 0.2% |
] | 1 | < 0.1% |
Open Punctuation
Value | Count | Frequency (%) |
( | 5269 | |
[ | 30 | 0.6% |
『 | 13 | 0.2% |
「 | 12 | 0.2% |
[ | 1 | < 0.1% |
Other Symbol
Value | Count | Frequency (%) |
™ | 5 | |
® | 2 | 25.0% |
ⓝ | 1 | 12.5% |
Modifier Symbol
Value | Count | Frequency (%) |
´ | 1 | |
^ | 1 |
Space Separator
Value | Count | Frequency (%) |
20438 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 185 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 2 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1 |
Initial Punctuation
Value | Count | Frequency (%) |
‘ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 82217 | |
Common | 36822 | |
Han | 15871 | 10.6% |
Latin | 15149 | 10.1% |
Cyrillic | 56 | < 0.1% |
Katakana | 48 | < 0.1% |
Hiragana | 22 | < 0.1% |
Most frequent character per script
Han
Value | Count | Frequency (%) |
學 | 465 | 2.9% |
國 | 372 | 2.3% |
法 | 298 | 1.9% |
集 | 239 | 1.5% |
論 | 237 | 1.5% |
新 | 232 | 1.5% |
經 | 213 | 1.3% |
敎 | 181 | 1.1% |
文 | 180 | 1.1% |
濟 | 137 | 0.9% |
Other values (1403) | 13317 |
Hangul
Value | Count | Frequency (%) |
의 | 2607 | 3.2% |
기 | 1726 | 2.1% |
한 | 1605 | 2.0% |
사 | 1524 | 1.9% |
이 | 1492 | 1.8% |
학 | 1181 | 1.4% |
는 | 1107 | 1.3% |
과 | 1055 | 1.3% |
국 | 997 | 1.2% |
가 | 962 | 1.2% |
Other values (1038) | 67961 |
Latin
Value | Count | Frequency (%) |
e | 1028 | 6.8% |
o | 842 | 5.6% |
i | 812 | 5.4% |
a | 807 | 5.3% |
n | 714 | 4.7% |
r | 684 | 4.5% |
t | 619 | 4.1% |
S | 569 | 3.8% |
s | 555 | 3.7% |
C | 505 | 3.3% |
Other values (48) | 8014 |
Common
Value | Count | Frequency (%) |
20438 | ||
) | 5270 | 14.3% |
( | 5269 | 14.3% |
0 | 1234 | 3.4% |
1 | 895 | 2.4% |
2 | 849 | 2.3% |
. | 291 | 0.8% |
3 | 280 | 0.8% |
, | 275 | 0.7% |
5 | 247 | 0.7% |
Other values (43) | 1774 | 4.8% |
Katakana
Value | Count | Frequency (%) |
ス | 6 | 12.5% |
シ | 5 | 10.4% |
ム | 4 | 8.3% |
テ | 4 | 8.3% |
リ | 3 | 6.2% |
ト | 2 | 4.2% |
ン | 2 | 4.2% |
オ | 1 | 2.1% |
ナ | 1 | 2.1% |
ア | 1 | 2.1% |
Other values (19) | 19 |
Cyrillic
Value | Count | Frequency (%) |
и | 7 | 12.5% |
я | 5 | 8.9% |
е | 5 | 8.9% |
л | 4 | 7.1% |
а | 4 | 7.1% |
р | 4 | 7.1% |
д | 3 | 5.4% |
ы | 2 | 3.6% |
т | 2 | 3.6% |
п | 2 | 3.6% |
Other values (16) | 18 |
Hiragana
Value | Count | Frequency (%) |
の | 13 | |
と | 3 | 13.6% |
に | 1 | 4.5% |
き | 1 | 4.5% |
る | 1 | 4.5% |
め | 1 | 4.5% |
た | 1 | 4.5% |
へ | 1 | 4.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 82182 | |
ASCII | 51680 | |
CJK | 15561 | 10.4% |
CJK Compat Ideographs | 310 | 0.2% |
None | 264 | 0.2% |
Cyrillic | 56 | < 0.1% |
Katakana | 48 | < 0.1% |
Compat Jamo | 35 | < 0.1% |
Hiragana | 22 | < 0.1% |
Number Forms | 18 | < 0.1% |
Other values (3) | 9 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
20438 | ||
) | 5270 | 10.2% |
( | 5269 | 10.2% |
0 | 1234 | 2.4% |
e | 1028 | 2.0% |
1 | 895 | 1.7% |
2 | 849 | 1.6% |
o | 842 | 1.6% |
i | 812 | 1.6% |
a | 807 | 1.6% |
Other values (75) | 14236 |
Hangul
Value | Count | Frequency (%) |
의 | 2607 | 3.2% |
기 | 1726 | 2.1% |
한 | 1605 | 2.0% |
사 | 1524 | 1.9% |
이 | 1492 | 1.8% |
학 | 1181 | 1.4% |
는 | 1107 | 1.3% |
과 | 1055 | 1.3% |
국 | 997 | 1.2% |
가 | 962 | 1.2% |
Other values (1036) | 67926 |
CJK
Value | Count | Frequency (%) |
學 | 465 | 3.0% |
國 | 372 | 2.4% |
法 | 298 | 1.9% |
集 | 239 | 1.5% |
論 | 237 | 1.5% |
新 | 232 | 1.5% |
經 | 213 | 1.4% |
敎 | 181 | 1.2% |
文 | 180 | 1.2% |
濟 | 137 | 0.9% |
Other values (1343) | 13007 |
None
Value | Count | Frequency (%) |
· | 191 | |
』 | 13 | 4.9% |
『 | 13 | 4.9% |
「 | 12 | 4.5% |
」 | 12 | 4.5% |
% | 7 | 2.7% |
& | 4 | 1.5% |
! | 3 | 1.1% |
® | 2 | 0.8% |
´ | 1 | 0.4% |
Other values (6) | 6 | 2.3% |
CJK Compat Ideographs
Value | Count | Frequency (%) |
論 | 43 | |
理 | 41 | 13.2% |
金 | 29 | 9.4% |
李 | 29 | 9.4% |
倫 | 14 | 4.5% |
力 | 13 | 4.2% |
勞 | 10 | 3.2% |
例 | 10 | 3.2% |
歷 | 9 | 2.9% |
年 | 7 | 2.3% |
Other values (50) | 105 |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 32 | |
ㅇ | 3 | 8.6% |
Hiragana
Value | Count | Frequency (%) |
の | 13 | |
と | 3 | 13.6% |
に | 1 | 4.5% |
き | 1 | 4.5% |
る | 1 | 4.5% |
め | 1 | 4.5% |
た | 1 | 4.5% |
へ | 1 | 4.5% |
Cyrillic
Value | Count | Frequency (%) |
и | 7 | 12.5% |
я | 5 | 8.9% |
е | 5 | 8.9% |
л | 4 | 7.1% |
а | 4 | 7.1% |
р | 4 | 7.1% |
д | 3 | 5.4% |
ы | 2 | 3.6% |
т | 2 | 3.6% |
п | 2 | 3.6% |
Other values (16) | 18 |
Katakana
Value | Count | Frequency (%) |
ス | 6 | 12.5% |
シ | 5 | 10.4% |
ム | 4 | 8.3% |
テ | 4 | 8.3% |
リ | 3 | 6.2% |
ト | 2 | 4.2% |
ン | 2 | 4.2% |
オ | 1 | 2.1% |
ナ | 1 | 2.1% |
ア | 1 | 2.1% |
Other values (19) | 19 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 6 | |
Ⅰ | 5 | |
Ⅲ | 4 | |
Ⅳ | 1 | 5.6% |
Ⅴ | 1 | 5.6% |
Ⅶ | 1 | 5.6% |
Letterlike Symbols
Value | Count | Frequency (%) |
™ | 5 |
Punctuation
Value | Count | Frequency (%) |
’ | 2 | |
‘ | 1 |
Enclosed Alphanum
Value | Count | Frequency (%) |
ⓝ | 1 |
저자
Text
Distinct | 8245 |
---|---|
Distinct (%) | 83.2% |
Missing | 93 |
Missing (%) | 0.9% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
한국 | 168 | 1.2% |
편집부 | 80 | 0.6% |
david | 45 | 0.3% |
j | 43 | 0.3% |
정보통신부 | 37 | 0.3% |
john | 35 | 0.3% |
michael | 35 | 0.3% |
m | 33 | 0.2% |
과학기술부 | 31 | 0.2% |
l | 27 | 0.2% |
Other values (9538) | 13057 |
Most occurring characters
Value | Count | Frequency (%) |
3686 | 5.5% | |
a | 2156 | 3.2% |
e | 1830 | 2.7% |
, | 1756 | 2.6% |
i | 1623 | 2.4% |
r | 1469 | 2.2% |
n | 1436 | 2.2% |
o | 1322 | 2.0% |
국 | 1114 | 1.7% |
김 | 1092 | 1.6% |
Other values (905) | 49157 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 37087 | |
Lowercase Letter | 17385 | |
Uppercase Letter | 4185 | 6.3% |
Space Separator | 3686 | 5.5% |
Other Punctuation | 2619 | 3.9% |
Decimal Number | 1299 | 1.9% |
Dash Punctuation | 279 | 0.4% |
Open Punctuation | 41 | 0.1% |
Close Punctuation | 40 | 0.1% |
Math Symbol | 14 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
국 | 1114 | 3.0% |
김 | 1092 | 2.9% |
이 | 1061 | 2.9% |
원 | 957 | 2.6% |
한 | 937 | 2.5% |
정 | 898 | 2.4% |
구 | 826 | 2.2% |
연 | 814 | 2.2% |
회 | 630 | 1.7% |
기 | 575 | 1.6% |
Other values (788) | 28183 |
Lowercase Letter
Value | Count | Frequency (%) |
a | 2156 | |
e | 1830 | |
i | 1623 | 9.3% |
r | 1469 | 8.4% |
n | 1436 | 8.3% |
o | 1322 | 7.6% |
l | 925 | 5.3% |
s | 862 | 5.0% |
t | 825 | 4.7% |
h | 791 | 4.5% |
Other values (38) | 4146 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 387 | 9.2% |
M | 355 | 8.5% |
C | 261 | 6.2% |
J | 253 | 6.0% |
D | 240 | 5.7% |
R | 239 | 5.7% |
H | 238 | 5.7% |
K | 228 | 5.4% |
B | 213 | 5.1% |
A | 207 | 4.9% |
Other values (25) | 1564 |
Other Punctuation
Value | Count | Frequency (%) |
, | 1756 | |
. | 814 | |
& | 14 | 0.5% |
' | 12 | 0.5% |
· | 12 | 0.5% |
: | 6 | 0.2% |
# | 2 | 0.1% |
& | 1 | < 0.1% |
* | 1 | < 0.1% |
@ | 1 | < 0.1% |
Decimal Number
Value | Count | Frequency (%) |
1 | 346 | |
9 | 289 | |
5 | 115 | 8.9% |
6 | 104 | 8.0% |
4 | 99 | 7.6% |
7 | 83 | 6.4% |
2 | 77 | 5.9% |
0 | 74 | 5.7% |
3 | 57 | 4.4% |
8 | 55 | 4.2% |
Math Symbol
Value | Count | Frequency (%) |
> | 5 | |
< | 4 | |
+ | 2 | 14.3% |
= | 2 | 14.3% |
~ | 1 | 7.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 278 | |
― | 1 | 0.4% |
Close Punctuation
Value | Count | Frequency (%) |
) | 37 | |
] | 3 | 7.5% |
Open Punctuation
Value | Count | Frequency (%) |
( | 37 | |
[ | 4 | 9.8% |
Modifier Symbol
Value | Count | Frequency (%) |
^ | 5 | |
¨ | 1 | 16.7% |
Space Separator
Value | Count | Frequency (%) |
3686 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 36772 | |
Latin | 21502 | |
Common | 7984 | 12.0% |
Han | 302 | 0.5% |
Cyrillic | 68 | 0.1% |
Katakana | 11 | < 0.1% |
Hiragana | 2 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
국 | 1114 | 3.0% |
김 | 1092 | 3.0% |
이 | 1061 | 2.9% |
원 | 957 | 2.6% |
한 | 937 | 2.5% |
정 | 898 | 2.4% |
구 | 826 | 2.2% |
연 | 814 | 2.2% |
회 | 630 | 1.7% |
기 | 575 | 1.6% |
Other values (569) | 27868 |
Han
Value | Count | Frequency (%) |
金 | 7 | 2.3% |
會 | 6 | 2.0% |
世 | 5 | 1.7% |
大 | 5 | 1.7% |
所 | 4 | 1.3% |
編 | 4 | 1.3% |
譜 | 4 | 1.3% |
一 | 4 | 1.3% |
家 | 3 | 1.0% |
西 | 3 | 1.0% |
Other values (197) | 257 |
Latin
Value | Count | Frequency (%) |
a | 2156 | 10.0% |
e | 1830 | 8.5% |
i | 1623 | 7.5% |
r | 1469 | 6.8% |
n | 1436 | 6.7% |
o | 1322 | 6.1% |
l | 925 | 4.3% |
s | 862 | 4.0% |
t | 825 | 3.8% |
h | 791 | 3.7% |
Other values (43) | 8263 |
Common
Value | Count | Frequency (%) |
3686 | ||
, | 1756 | |
. | 814 | 10.2% |
1 | 346 | 4.3% |
9 | 289 | 3.6% |
- | 278 | 3.5% |
5 | 115 | 1.4% |
6 | 104 | 1.3% |
4 | 99 | 1.2% |
7 | 83 | 1.0% |
Other values (24) | 414 | 5.2% |
Cyrillic
Value | Count | Frequency (%) |
а | 9 | 13.2% |
в | 6 | 8.8% |
о | 6 | 8.8% |
н | 5 | 7.4% |
и | 4 | 5.9% |
л | 4 | 5.9% |
е | 3 | 4.4% |
д | 3 | 4.4% |
й | 3 | 4.4% |
А | 2 | 2.9% |
Other values (20) | 23 |
Katakana
Value | Count | Frequency (%) |
シ | 2 | |
ム | 1 | |
イ | 1 | |
マ | 1 | |
コ | 1 | |
ト | 1 | |
テ | 1 | |
ス | 1 | |
ン | 1 | |
カ | 1 |
Hiragana
Value | Count | Frequency (%) |
か | 1 | |
ほ | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 36767 | |
ASCII | 29470 | |
CJK | 297 | 0.4% |
Cyrillic | 68 | 0.1% |
None | 15 | < 0.1% |
Katakana | 11 | < 0.1% |
CJK Compat Ideographs | 5 | < 0.1% |
Compat Jamo | 5 | < 0.1% |
Hiragana | 2 | < 0.1% |
Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
3686 | 12.5% | |
a | 2156 | 7.3% |
e | 1830 | 6.2% |
, | 1756 | 6.0% |
i | 1623 | 5.5% |
r | 1469 | 5.0% |
n | 1436 | 4.9% |
o | 1322 | 4.5% |
l | 925 | 3.1% |
s | 862 | 2.9% |
Other values (72) | 12405 |
Hangul
Value | Count | Frequency (%) |
국 | 1114 | 3.0% |
김 | 1092 | 3.0% |
이 | 1061 | 2.9% |
원 | 957 | 2.6% |
한 | 937 | 2.5% |
정 | 898 | 2.4% |
구 | 826 | 2.2% |
연 | 814 | 2.2% |
회 | 630 | 1.7% |
기 | 575 | 1.6% |
Other values (565) | 27863 |
None
Value | Count | Frequency (%) |
· | 12 | |
¨ | 1 | 6.7% |
& | 1 | 6.7% |
æ | 1 | 6.7% |
Cyrillic
Value | Count | Frequency (%) |
а | 9 | 13.2% |
в | 6 | 8.8% |
о | 6 | 8.8% |
н | 5 | 7.4% |
и | 4 | 5.9% |
л | 4 | 5.9% |
е | 3 | 4.4% |
д | 3 | 4.4% |
й | 3 | 4.4% |
А | 2 | 2.9% |
Other values (20) | 23 |
CJK
Value | Count | Frequency (%) |
金 | 7 | 2.4% |
會 | 6 | 2.0% |
世 | 5 | 1.7% |
大 | 5 | 1.7% |
所 | 4 | 1.3% |
編 | 4 | 1.3% |
譜 | 4 | 1.3% |
一 | 4 | 1.3% |
家 | 3 | 1.0% |
西 | 3 | 1.0% |
Other values (193) | 252 |
Katakana
Value | Count | Frequency (%) |
シ | 2 | |
ム | 1 | |
イ | 1 | |
マ | 1 | |
コ | 1 | |
ト | 1 | |
テ | 1 | |
ス | 1 | |
ン | 1 | |
カ | 1 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
李 | 2 | |
勞 | 1 | |
麟 | 1 | |
歷 | 1 |
Compat Jamo
Value | Count | Frequency (%) |
ㄱ | 2 | |
ㅋ | 1 | |
ㄴ | 1 | |
ㅅ | 1 |
Punctuation
Value | Count | Frequency (%) |
― | 1 |
Hiragana
Value | Count | Frequency (%) |
か | 1 | |
ほ | 1 |
출판사
Text
Distinct | 4311 |
---|---|
Distinct (%) | 43.2% |
Missing | 17 |
Missing (%) | 0.2% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
博英社 | 111 | 1.0% |
法文社 | 74 | 0.7% |
과학기술부 | 65 | 0.6% |
출판부 | 64 | 0.6% |
학지사 | 61 | 0.6% |
정보통신부 | 61 | 0.6% |
한국학술정보 | 51 | 0.5% |
교육과학사 | 43 | 0.4% |
螢雪出版社 | 41 | 0.4% |
산업자원부 | 40 | 0.4% |
Other values (4514) | 10390 |
Most occurring characters
Value | Count | Frequency (%) |
사 | 2034 | 3.9% |
社 | 1161 | 2.2% |
1020 | 1.9% | |
한 | 977 | 1.9% |
원 | 969 | 1.8% |
국 | 957 | 1.8% |
문 | 948 | 1.8% |
학 | 941 | 1.8% |
스 | 874 | 1.7% |
文 | 614 | 1.2% |
Other values (1355) | 42041 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 47630 | |
Lowercase Letter | 1943 | 3.7% |
Space Separator | 1020 | 1.9% |
Uppercase Letter | 1000 | 1.9% |
Open Punctuation | 280 | 0.5% |
Close Punctuation | 279 | 0.5% |
Other Punctuation | 198 | 0.4% |
Decimal Number | 172 | 0.3% |
Dash Punctuation | 12 | < 0.1% |
Math Symbol | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
사 | 2034 | 4.3% |
社 | 1161 | 2.4% |
한 | 977 | 2.1% |
원 | 969 | 2.0% |
국 | 957 | 2.0% |
문 | 948 | 2.0% |
학 | 941 | 2.0% |
스 | 874 | 1.8% |
文 | 614 | 1.3% |
연 | 599 | 1.3% |
Other values (1271) | 37556 |
Lowercase Letter
Value | Count | Frequency (%) |
o | 215 | |
e | 205 | |
i | 193 | |
a | 181 | 9.3% |
n | 160 | 8.2% |
s | 139 | 7.2% |
r | 112 | 5.8% |
l | 104 | 5.4% |
t | 101 | 5.2% |
m | 69 | 3.6% |
Other values (23) | 464 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 136 | |
M | 98 | 9.8% |
C | 91 | 9.1% |
S | 78 | 7.8% |
E | 74 | 7.4% |
K | 65 | 6.5% |
A | 46 | 4.6% |
P | 45 | 4.5% |
O | 39 | 3.9% |
D | 35 | 3.5% |
Other values (14) | 293 |
Decimal Number
Value | Count | Frequency (%) |
2 | 71 | |
1 | 67 | |
0 | 17 | 9.9% |
4 | 5 | 2.9% |
3 | 3 | 1.7% |
9 | 3 | 1.7% |
8 | 2 | 1.2% |
5 | 2 | 1.2% |
6 | 1 | 0.6% |
7 | 1 | 0.6% |
Other Punctuation
Value | Count | Frequency (%) |
· | 57 | |
. | 47 | |
& | 38 | |
, | 25 | |
: | 12 | 6.1% |
& | 10 | 5.1% |
/ | 5 | 2.5% |
' | 2 | 1.0% |
* | 1 | 0.5% |
# | 1 | 0.5% |
Open Punctuation
Value | Count | Frequency (%) |
[ | 217 | |
( | 63 | 22.5% |
Close Punctuation
Value | Count | Frequency (%) |
] | 216 | |
) | 63 | 22.6% |
Space Separator
Value | Count | Frequency (%) |
1020 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 12 |
Math Symbol
Value | Count | Frequency (%) |
+ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 38458 | |
Han | 9162 | 17.4% |
Latin | 2921 | 5.6% |
Common | 1963 | 3.7% |
Cyrillic | 22 | < 0.1% |
Katakana | 10 | < 0.1% |
Most frequent character per script
Han
Value | Count | Frequency (%) |
社 | 1161 | 12.7% |
文 | 614 | 6.7% |
出 | 278 | 3.0% |
版 | 275 | 3.0% |
學 | 260 | 2.8% |
化 | 235 | 2.6% |
英 | 181 | 2.0% |
大 | 175 | 1.9% |
法 | 164 | 1.8% |
硏 | 149 | 1.6% |
Other values (651) | 5670 |
Hangul
Value | Count | Frequency (%) |
사 | 2034 | 5.3% |
한 | 977 | 2.5% |
원 | 969 | 2.5% |
국 | 957 | 2.5% |
문 | 948 | 2.5% |
학 | 941 | 2.4% |
스 | 874 | 2.3% |
연 | 599 | 1.6% |
판 | 598 | 1.6% |
구 | 582 | 1.5% |
Other values (600) | 28979 |
Latin
Value | Count | Frequency (%) |
o | 215 | 7.4% |
e | 205 | 7.0% |
i | 193 | 6.6% |
a | 181 | 6.2% |
n | 160 | 5.5% |
s | 139 | 4.8% |
B | 136 | 4.7% |
r | 112 | 3.8% |
l | 104 | 3.6% |
t | 101 | 3.5% |
Other values (38) | 1375 |
Common
Value | Count | Frequency (%) |
1020 | ||
[ | 217 | 11.1% |
] | 216 | 11.0% |
2 | 71 | 3.6% |
1 | 67 | 3.4% |
) | 63 | 3.2% |
( | 63 | 3.2% |
· | 57 | 2.9% |
. | 47 | 2.4% |
& | 38 | 1.9% |
Other values (17) | 104 | 5.3% |
Katakana
Value | Count | Frequency (%) |
ン | 1 | |
ナ | 1 | |
ロ | 1 | |
コ | 1 | |
ハ | 1 | |
リ | 1 | |
ム | 1 | |
グ | 1 | |
ネ | 1 | |
ア | 1 |
Cyrillic
Value | Count | Frequency (%) |
п | 4 | |
н | 4 | |
а | 2 | |
у | 2 | |
К | 2 | |
р | 2 | |
й | 2 | |
л | 2 | |
ы | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 38454 | |
CJK | 9096 | 17.3% |
ASCII | 4817 | 9.2% |
None | 67 | 0.1% |
CJK Compat Ideographs | 66 | 0.1% |
Cyrillic | 22 | < 0.1% |
Katakana | 10 | < 0.1% |
Compat Jamo | 4 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
사 | 2034 | 5.3% |
한 | 977 | 2.5% |
원 | 969 | 2.5% |
국 | 957 | 2.5% |
문 | 948 | 2.5% |
학 | 941 | 2.4% |
스 | 874 | 2.3% |
연 | 599 | 1.6% |
판 | 598 | 1.6% |
구 | 582 | 1.5% |
Other values (599) | 28975 |
CJK
Value | Count | Frequency (%) |
社 | 1161 | 12.8% |
文 | 614 | 6.8% |
出 | 278 | 3.1% |
版 | 275 | 3.0% |
學 | 260 | 2.9% |
化 | 235 | 2.6% |
英 | 181 | 2.0% |
大 | 175 | 1.9% |
法 | 164 | 1.8% |
硏 | 149 | 1.6% |
Other values (621) | 5604 |
ASCII
Value | Count | Frequency (%) |
1020 | ||
[ | 217 | 4.5% |
] | 216 | 4.5% |
o | 215 | 4.5% |
e | 205 | 4.3% |
i | 193 | 4.0% |
a | 181 | 3.8% |
n | 160 | 3.3% |
s | 139 | 2.9% |
B | 136 | 2.8% |
Other values (63) | 2135 |
None
Value | Count | Frequency (%) |
· | 57 | |
& | 10 | 14.9% |
CJK Compat Ideographs
Value | Count | Frequency (%) |
良 | 8 | 12.1% |
李 | 7 | 10.6% |
論 | 6 | 9.1% |
嶺 | 5 | 7.6% |
金 | 4 | 6.1% |
理 | 4 | 6.1% |
女 | 4 | 6.1% |
率 | 3 | 4.5% |
梨 | 2 | 3.0% |
勞 | 2 | 3.0% |
Other values (20) | 21 |
Cyrillic
Value | Count | Frequency (%) |
п | 4 | |
н | 4 | |
а | 2 | |
у | 2 | |
К | 2 | |
р | 2 | |
й | 2 | |
л | 2 | |
ы | 2 |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 4 |
Katakana
Value | Count | Frequency (%) |
ン | 1 | |
ナ | 1 | |
ロ | 1 | |
コ | 1 | |
ハ | 1 | |
リ | 1 | |
ム | 1 | |
グ | 1 | |
ネ | 1 | |
ア | 1 |
출판년도
Text
Distinct | 93 |
---|---|
Distinct (%) | 0.9% |
Missing | 38 |
Missing (%) | 0.4% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
2010 | 507 | 5.1% |
2009 | 463 | 4.6% |
2012 | 430 | 4.3% |
2014 | 405 | 4.1% |
2006 | 404 | 4.1% |
2007 | 401 | 4.0% |
2005 | 399 | 4.0% |
2011 | 382 | 3.8% |
2008 | 379 | 3.8% |
2004 | 367 | 3.7% |
Other values (82) | 5825 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 11218 | |
2 | 7672 | |
1 | 7260 | |
9 | 6017 | |
8 | 1970 | 4.9% |
7 | 1417 | 3.6% |
4 | 1126 | 2.8% |
6 | 1120 | 2.8% |
5 | 1074 | 2.7% |
3 | 947 | 2.4% |
Other values (4) | 7 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 39821 | |
Dash Punctuation | 3 | < 0.1% |
Uppercase Letter | 2 | < 0.1% |
Lowercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 11218 | |
2 | 7672 | |
1 | 7260 | |
9 | 6017 | |
8 | 1970 | 4.9% |
7 | 1417 | 3.6% |
4 | 1126 | 2.8% |
6 | 1120 | 2.8% |
5 | 1074 | 2.7% |
3 | 947 | 2.4% |
Lowercase Letter
Value | Count | Frequency (%) |
u | 1 | |
s | 1 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 3 |
Uppercase Letter
Value | Count | Frequency (%) |
U | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 39824 | |
Latin | 4 | < 0.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 11218 | |
2 | 7672 | |
1 | 7260 | |
9 | 6017 | |
8 | 1970 | 4.9% |
7 | 1417 | 3.6% |
4 | 1126 | 2.8% |
6 | 1120 | 2.8% |
5 | 1074 | 2.7% |
3 | 947 | 2.4% |
Latin
Value | Count | Frequency (%) |
U | 2 | |
u | 1 | |
s | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 39828 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 11218 | |
2 | 7672 | |
1 | 7260 | |
9 | 6017 | |
8 | 1970 | 4.9% |
7 | 1417 | 3.6% |
4 | 1126 | 2.8% |
6 | 1120 | 2.8% |
5 | 1074 | 2.7% |
3 | 947 | 2.4% |
Other values (4) | 7 | < 0.1% |
분류번호
Text
MISSING
 
Distinct | 3276 |
---|---|
Distinct (%) | 33.1% |
Missing | 109 |
Missing (%) | 1.1% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
688.2 | 262 | 2.6% |
813.6 | 137 | 1.4% |
325.1 | 85 | 0.9% |
814.6 | 73 | 0.7% |
811.6 | 65 | 0.7% |
325.04 | 58 | 0.6% |
28.64 | 56 | 0.6% |
4.76 | 53 | 0.5% |
818 | 48 | 0.5% |
320.1 | 48 | 0.5% |
Other values (3266) | 9007 |
Most occurring characters
Value | Count | Frequency (%) |
. | 8294 | |
3 | 6576 | |
1 | 6434 | |
5 | 5087 | |
2 | 4687 | |
0 | 4070 | |
6 | 3800 | |
8 | 3745 | |
7 | 3597 | |
9 | 3444 | |
Other values (3) | 3247 | 6.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 44685 | |
Other Punctuation | 8295 | 15.7% |
Space Separator | 1 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
3 | 6576 | |
1 | 6434 | |
5 | 5087 | |
2 | 4687 | |
0 | 4070 | |
6 | 3800 | |
8 | 3745 | |
7 | 3597 | |
9 | 3444 | |
4 | 3245 |
Other Punctuation
Value | Count | Frequency (%) |
. | 8294 | |
, | 1 | < 0.1% |
Space Separator
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 52981 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
. | 8294 | |
3 | 6576 | |
1 | 6434 | |
5 | 5087 | |
2 | 4687 | |
0 | 4070 | |
6 | 3800 | |
8 | 3745 | |
7 | 3597 | |
9 | 3444 | |
Other values (3) | 3247 | 6.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 52981 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 8294 | |
3 | 6576 | |
1 | 6434 | |
5 | 5087 | |
2 | 4687 | |
0 | 4070 | |
6 | 3800 | |
8 | 3745 | |
7 | 3597 | |
9 | 3444 | |
Other values (3) | 3247 | 6.1% |
서지번호 | 유형 | 출판년도 | |
---|---|---|---|
서지번호 | 1.000 | 0.191 | 0.939 |
유형 | 0.191 | 1.000 | 0.761 |
출판년도 | 0.939 | 0.761 | 1.000 |
서지번호 | 유형 | |
---|---|---|
서지번호 | 1.000 | 0.122 |
유형 | 0.122 | 1.000 |
서지번호 | 유형 | 서명 | 저자 | 출판사 | 출판년도 | 분류번호 | |
---|---|---|---|---|---|---|---|
65128 | 285701 | 국내단행본 | TCP/IP 네트워킹 | Martin, James | 이한 | 1998 | 5.4 |
73126 | 430753 | 국내단행본 | 건축설계이론 | 조영호 | 예문사 | 2007 | 542.1 |
27044 | 153979 | 국내단행본 | (새로운)財務管理論 | 노덕환 | 學文社 | 1991 | 325.8 |
19930 | 244004 | 국내단행본 | (단재)신채호 | 외솔회 | 정음문화사 | 1989 | 991.17 |
94723 | 212169 | 국내단행본 | 軍改革 이렇게 해야 한다 | 서효일 | 백암 | 1995 | 390 |
32256 | 2182127 | 국내단행본 | (실험과 도전,) 식민지의 심연 | 이상, 1910-1937 | 민음사 | 2010 | 810.906 |
18846 | 452080 | 국내단행본 | (내 손으로 받는) 우리 종자 | 안완식, 1942- | 들녘 | 2007 | 523.22 |
40338 | 44839 | 국내단행본 | (長篇小說)그 少年의 첫사랑 | Wouk, Herman | 乙酉文化社 | 1956 | 843 |
19507 | 2980699 | 국내단행본 | (누리과정과 연계한) 창의적 전통놀이 | 임혜수 | 창지사 | 2016 | 375.1 |
57443 | 348907 | 국내단행본 | After effects 5 | 이병현 | 사이버출판사 | 2001 | 4.76 |
서지번호 | 유형 | 서명 | 저자 | 출판사 | 출판년도 | 분류번호 | |
---|---|---|---|---|---|---|---|
19158 | 452841 | 국내단행본 | (노인복지를 위한) 노인영양관리 | 이병순 | 광문각 | 2007 | 594.1 |
77638 | 413527 | 국내단행본 | 계량정보분석을 위한 프로그래밍 활용사례연구 | 한국과학기술정보연구원 | 한국과학기술정보연구원 | 2005 | 3.56 |
32273 | 197330 | 국내단행본 | (實話集)狼虎血戰記 | 구소청 | 朝洋社 | 1953 | 813.7 |
17958 | 87986 | 국내단행본 | (김수용 운명소설)命 | 김수용 | 玄岩社 | 1989 | 813.6 |
8208 | 2205691 | 국내단행본 | (Again!)뒤집어본 영문법 | 오성호 | 김영사 | 2006 | 745 |
31180 | 166666 | 국내단행본 | (新制)作物生理學 | 박종성 | 鄕文社 | 1994 | 524 |
92427 | 2542252 | 국내단행본 | 국어과 교과서론 | 주세형, 1973- | 사회평론 | 2014 | 374.71 |
60432 | 437071 | 국내단행본 | GoF의 디자인 패턴 | Gamma, Erich | 피어슨에듀케이션코리아 | 2007 | 5.115 |
76341 | 151404 | 국내단행본 | 경제사강의 | 한경민 | 두리 | 1989 | 320.9 |
28574 | 249357 | 국내단행본 | (소설)마쓰시타 | Kosaka Jiro | 매일경제신문사 | 1995 | 813.6 |