Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 10000 |
Missing cells | 50 |
Missing cells (%) | 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 722.7 KiB |
Average record size in memory | 74.0 B |
Variable types
Numeric | 2 |
---|---|
Text | 5 |
Categorical | 1 |
Dataset
Description | 한국예술종합학교 교내도서관 도서정보 |
---|---|
Author | 문화체육관광부 한국예술종합학교 |
URL | https://www.data.go.kr/data/3069958/fileData.do |
No. is highly overall correlated with 제어번호 and 1 other fields | High correlation |
제어번호 is highly overall correlated with No. and 1 other fields | High correlation |
별치기호 is highly overall correlated with No. and 1 other fields | High correlation |
별치기호 is highly imbalanced (91.9%) | Imbalance |
No. has unique values | Unique |
제어번호 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 21:37:50.489216 |
---|---|
Analysis finished | 2023-12-12 21:37:55.249302 |
Duration | 4.76 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
No.
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 19723.609 |
Minimum | 2 |
---|---|
Maximum | 39681 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 1935.95 |
Q1 | 10001.75 |
median | 19747.5 |
Q3 | 29490.5 |
95-th percentile | 37642.1 |
Maximum | 39681 |
Range | 39679 |
Interquartile range (IQR) | 19488.75 |
Descriptive statistics
Standard deviation | 11407.502 |
---|---|
Coefficient of variation (CV) | 0.57836785 |
Kurtosis | -1.1789247 |
Mean | 19723.609 |
Median Absolute Deviation (MAD) | 9745 |
Skewness | 0.011226358 |
Sum | 1.9723609 × 108 |
Variance | 1.3013109 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
16082 | 1 | < 0.1% |
13354 | 1 | < 0.1% |
12412 | 1 | < 0.1% |
2490 | 1 | < 0.1% |
26634 | 1 | < 0.1% |
18724 | 1 | < 0.1% |
4616 | 1 | < 0.1% |
11866 | 1 | < 0.1% |
21036 | 1 | < 0.1% |
24165 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
2 | 1 | |
3 | 1 | |
4 | 1 | |
6 | 1 | |
7 | 1 | |
9 | 1 | |
10 | 1 | |
16 | 1 | |
22 | 1 | |
24 | 1 |
Value | Count | Frequency (%) |
39681 | 1 | |
39675 | 1 | |
39671 | 1 | |
39667 | 1 | |
39660 | 1 | |
39655 | 1 | |
39648 | 1 | |
39646 | 1 | |
39641 | 1 | |
39639 | 1 |
제어번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 200006.2 |
Minimum | 154890 |
---|---|
Maximum | 241601 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 154890 |
---|---|
5-th percentile | 163799.9 |
Q1 | 182310.25 |
median | 198643.5 |
Q3 | 216506.75 |
95-th percentile | 237138.1 |
Maximum | 241601 |
Range | 86711 |
Interquartile range (IQR) | 34196.5 |
Descriptive statistics
Standard deviation | 23042.25 |
---|---|
Coefficient of variation (CV) | 0.11520768 |
Kurtosis | -1.0225168 |
Mean | 200006.2 |
Median Absolute Deviation (MAD) | 17076.5 |
Skewness | 0.069894885 |
Sum | 2.000062 × 109 |
Variance | 5.309453 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
178517 | 1 | < 0.1% |
184562 | 1 | < 0.1% |
189552 | 1 | < 0.1% |
164852 | 1 | < 0.1% |
212383 | 1 | < 0.1% |
186071 | 1 | < 0.1% |
172036 | 1 | < 0.1% |
188346 | 1 | < 0.1% |
198772 | 1 | < 0.1% |
215209 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
154890 | 1 | |
154891 | 1 | |
154893 | 1 | |
154897 | 1 | |
154902 | 1 | |
154953 | 1 | |
154961 | 1 | |
154962 | 1 | |
154964 | 1 | |
154972 | 1 |
Value | Count | Frequency (%) |
241601 | 1 | |
241598 | 1 | |
241595 | 1 | |
241592 | 1 | |
241587 | 1 | |
241570 | 1 | |
241568 | 1 | |
241564 | 1 | |
241562 | 1 | |
241546 | 1 |
서명
Text
Distinct | 9968 |
---|---|
Distinct (%) | 99.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 298 |
---|---|
Median length | 153 |
Mean length | 26.7457 |
Min length | 1 |
Characters and Unicode
Total characters | 267457 |
---|---|
Distinct characters | 2395 |
Distinct categories | 16 ? |
Distinct scripts | 7 ? |
Distinct blocks | 14 ? |
Unique
Unique | 9939 ? |
---|---|
Unique (%) | 99.4% |
Sample
1st row | (우리 인문학의 자긍심) 김수영을 위하여 |
---|---|
2nd row | Tomboy style |
3rd row | 나는 여기가 좋다: 한창훈 소설 |
4th row | 세렐렘 : 나더쉬 피테르 중편소설 |
5th row | The Tao of physics : an exploration of the parallels between modern physics and Eastern mysticism |
Value | Count | Frequency (%) |
4468 | 8.0% | |
the | 1446 | 2.6% |
of | 891 | 1.6% |
and | 750 | 1.3% |
in | 414 | 0.7% |
a | 331 | 0.6% |
art | 279 | 0.5% |
장편소설 | 233 | 0.4% |
to | 213 | 0.4% |
design | 202 | 0.4% |
Other values (23457) | 46727 |
Most occurring characters
Value | Count | Frequency (%) |
45954 | 17.2% | |
e | 12323 | 4.6% |
a | 9011 | 3.4% |
i | 8666 | 3.2% |
t | 8478 | 3.2% |
n | 8194 | 3.1% |
o | 7872 | 2.9% |
r | 7848 | 2.9% |
s | 6680 | 2.5% |
: | 5001 | 1.9% |
Other values (2385) | 147430 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 103787 | |
Other Letter | 93510 | |
Space Separator | 45954 | |
Other Punctuation | 8824 | 3.3% |
Uppercase Letter | 7322 | 2.7% |
Decimal Number | 5226 | 2.0% |
Open Punctuation | 987 | 0.4% |
Close Punctuation | 985 | 0.4% |
Dash Punctuation | 610 | 0.2% |
Math Symbol | 229 | 0.1% |
Other values (6) | 23 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
의 | 3580 | 3.8% |
이 | 2005 | 2.1% |
사 | 1452 | 1.6% |
한 | 1388 | 1.5% |
기 | 1386 | 1.5% |
는 | 1335 | 1.4% |
가 | 1212 | 1.3% |
리 | 1128 | 1.2% |
인 | 1076 | 1.2% |
지 | 1028 | 1.1% |
Other values (2228) | 77920 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 12323 | |
a | 9011 | 8.7% |
i | 8666 | 8.3% |
t | 8478 | 8.2% |
n | 8194 | 7.9% |
o | 7872 | 7.6% |
r | 7848 | 7.6% |
s | 6680 | 6.4% |
l | 4347 | 4.2% |
h | 4095 | 3.9% |
Other values (46) | 26273 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 774 | 10.6% |
A | 633 | 8.6% |
S | 539 | 7.4% |
C | 517 | 7.1% |
M | 440 | 6.0% |
D | 366 | 5.0% |
P | 365 | 5.0% |
B | 361 | 4.9% |
I | 334 | 4.6% |
F | 290 | 4.0% |
Other values (30) | 2703 |
Other Punctuation
Value | Count | Frequency (%) |
: | 5001 | |
, | 1698 | 19.2% |
? | 631 | 7.2% |
. | 536 | 6.1% |
' | 298 | 3.4% |
· | 234 | 2.7% |
& | 126 | 1.4% |
! | 119 | 1.3% |
/ | 65 | 0.7% |
; | 53 | 0.6% |
Other values (10) | 63 | 0.7% |
Decimal Number
Value | Count | Frequency (%) |
1 | 1317 | |
0 | 1246 | |
2 | 717 | |
9 | 480 | 9.2% |
3 | 312 | 6.0% |
5 | 305 | 5.8% |
4 | 241 | 4.6% |
8 | 220 | 4.2% |
6 | 201 | 3.8% |
7 | 187 | 3.6% |
Math Symbol
Value | Count | Frequency (%) |
= | 132 | |
~ | 39 | 17.0% |
+ | 36 | 15.7% |
> | 7 | 3.1% |
< | 7 | 3.1% |
| | 4 | 1.7% |
≪ | 1 | 0.4% |
≫ | 1 | 0.4% |
+ | 1 | 0.4% |
| | 1 | 0.4% |
Close Punctuation
Value | Count | Frequency (%) |
) | 927 | |
] | 23 | 2.3% |
』 | 22 | 2.2% |
」 | 10 | 1.0% |
》 | 3 | 0.3% |
Open Punctuation
Value | Count | Frequency (%) |
( | 927 | |
[ | 25 | 2.5% |
『 | 22 | 2.2% |
「 | 10 | 1.0% |
《 | 3 | 0.3% |
Other Symbol
Value | Count | Frequency (%) |
★ | 3 | |
° | 2 | |
™ | 2 | |
│ | 1 | 12.5% |
Space Separator
Value | Count | Frequency (%) |
45954 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 610 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 5 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 4 |
Initial Punctuation
Value | Count | Frequency (%) |
‘ | 3 |
Currency Symbol
Value | Count | Frequency (%) |
$ | 2 |
Control
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 110745 | |
Hangul | 89250 | |
Common | 62838 | |
Han | 3495 | 1.3% |
Katakana | 470 | 0.2% |
Cyrillic | 364 | 0.1% |
Hiragana | 295 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
의 | 3580 | 4.0% |
이 | 2005 | 2.2% |
사 | 1452 | 1.6% |
한 | 1388 | 1.6% |
기 | 1386 | 1.6% |
는 | 1335 | 1.5% |
가 | 1212 | 1.4% |
리 | 1128 | 1.3% |
인 | 1076 | 1.2% |
지 | 1028 | 1.2% |
Other values (1156) | 73660 |
Han
Value | Count | Frequency (%) |
國 | 79 | 2.3% |
中 | 78 | 2.2% |
文 | 78 | 2.2% |
集 | 62 | 1.8% |
代 | 48 | 1.4% |
史 | 47 | 1.3% |
學 | 45 | 1.3% |
化 | 41 | 1.2% |
志 | 34 | 1.0% |
術 | 31 | 0.9% |
Other values (937) | 2952 |
Katakana
Value | Count | Frequency (%) |
ン | 35 | 7.4% |
イ | 29 | 6.2% |
ス | 28 | 6.0% |
ラ | 20 | 4.3% |
タ | 19 | 4.0% |
フ | 18 | 3.8% |
リ | 17 | 3.6% |
ィ | 14 | 3.0% |
デ | 14 | 3.0% |
ッ | 14 | 3.0% |
Other values (59) | 262 |
Common
Value | Count | Frequency (%) |
45954 | ||
: | 5001 | 8.0% |
, | 1698 | 2.7% |
1 | 1317 | 2.1% |
0 | 1246 | 2.0% |
) | 927 | 1.5% |
( | 927 | 1.5% |
2 | 717 | 1.1% |
? | 631 | 1.0% |
- | 610 | 1.0% |
Other values (51) | 3810 | 6.1% |
Hiragana
Value | Count | Frequency (%) |
の | 55 | |
と | 16 | 5.4% |
か | 15 | 5.1% |
を | 13 | 4.4% |
る | 13 | 4.4% |
た | 12 | 4.1% |
い | 12 | 4.1% |
し | 10 | 3.4% |
き | 9 | 3.1% |
に | 8 | 2.7% |
Other values (46) | 132 |
Latin
Value | Count | Frequency (%) |
e | 12323 | 11.1% |
a | 9011 | 8.1% |
i | 8666 | 7.8% |
t | 8478 | 7.7% |
n | 8194 | 7.4% |
o | 7872 | 7.1% |
r | 7848 | 7.1% |
s | 6680 | 6.0% |
l | 4347 | 3.9% |
h | 4095 | 3.7% |
Other values (43) | 33231 |
Cyrillic
Value | Count | Frequency (%) |
и | 37 | 10.2% |
а | 31 | 8.5% |
о | 28 | 7.7% |
с | 26 | 7.1% |
к | 24 | 6.6% |
р | 23 | 6.3% |
е | 22 | 6.0% |
в | 17 | 4.7% |
н | 15 | 4.1% |
л | 14 | 3.8% |
Other values (33) | 127 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 173238 | |
Hangul | 89238 | |
CJK | 3435 | 1.3% |
Katakana | 470 | 0.2% |
Cyrillic | 364 | 0.1% |
None | 322 | 0.1% |
Hiragana | 295 | 0.1% |
CJK Compat Ideographs | 60 | < 0.1% |
Punctuation | 15 | < 0.1% |
Compat Jamo | 12 | < 0.1% |
Other values (4) | 8 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
45954 | ||
e | 12323 | 7.1% |
a | 9011 | 5.2% |
i | 8666 | 5.0% |
t | 8478 | 4.9% |
n | 8194 | 4.7% |
o | 7872 | 4.5% |
r | 7848 | 4.5% |
s | 6680 | 3.9% |
: | 5001 | 2.9% |
Other values (81) | 53211 |
Hangul
Value | Count | Frequency (%) |
의 | 3580 | 4.0% |
이 | 2005 | 2.2% |
사 | 1452 | 1.6% |
한 | 1388 | 1.6% |
기 | 1386 | 1.6% |
는 | 1335 | 1.5% |
가 | 1212 | 1.4% |
리 | 1128 | 1.3% |
인 | 1076 | 1.2% |
지 | 1028 | 1.2% |
Other values (1152) | 73648 |
None
Value | Count | Frequency (%) |
· | 234 | |
』 | 22 | 6.8% |
『 | 22 | 6.8% |
」 | 10 | 3.1% |
「 | 10 | 3.1% |
& | 8 | 2.5% |
》 | 3 | 0.9% |
《 | 3 | 0.9% |
、 | 2 | 0.6% |
° | 2 | 0.6% |
Other values (5) | 6 | 1.9% |
CJK
Value | Count | Frequency (%) |
國 | 79 | 2.3% |
中 | 78 | 2.3% |
文 | 78 | 2.3% |
集 | 62 | 1.8% |
代 | 48 | 1.4% |
史 | 47 | 1.4% |
學 | 45 | 1.3% |
化 | 41 | 1.2% |
志 | 34 | 1.0% |
術 | 31 | 0.9% |
Other values (906) | 2892 |
Hiragana
Value | Count | Frequency (%) |
の | 55 | |
と | 16 | 5.4% |
か | 15 | 5.1% |
を | 13 | 4.4% |
る | 13 | 4.4% |
た | 12 | 4.1% |
い | 12 | 4.1% |
し | 10 | 3.4% |
き | 9 | 3.1% |
に | 8 | 2.7% |
Other values (46) | 132 |
Cyrillic
Value | Count | Frequency (%) |
и | 37 | 10.2% |
а | 31 | 8.5% |
о | 28 | 7.7% |
с | 26 | 7.1% |
к | 24 | 6.6% |
р | 23 | 6.3% |
е | 22 | 6.0% |
в | 17 | 4.7% |
н | 15 | 4.1% |
л | 14 | 3.8% |
Other values (33) | 127 |
Katakana
Value | Count | Frequency (%) |
ン | 35 | 7.4% |
イ | 29 | 6.2% |
ス | 28 | 6.0% |
ラ | 20 | 4.3% |
タ | 19 | 4.0% |
フ | 18 | 3.8% |
リ | 17 | 3.6% |
ィ | 14 | 3.0% |
デ | 14 | 3.0% |
ッ | 14 | 3.0% |
Other values (59) | 262 |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 8 | |
ㄹ | 2 | 16.7% |
ㅎ | 1 | 8.3% |
ㅅ | 1 | 8.3% |
Punctuation
Value | Count | Frequency (%) |
… | 7 | |
’ | 5 | |
‘ | 3 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
論 | 7 | 11.7% |
李 | 6 | 10.0% |
理 | 5 | 8.3% |
禮 | 4 | 6.7% |
歷 | 4 | 6.7% |
老 | 3 | 5.0% |
女 | 3 | 5.0% |
茶 | 2 | 3.3% |
年 | 2 | 3.3% |
六 | 2 | 3.3% |
Other values (21) | 22 |
Misc Symbols
Value | Count | Frequency (%) |
★ | 3 |
Letterlike Symbols
Value | Count | Frequency (%) |
™ | 2 |
Math Operators
Value | Count | Frequency (%) |
≪ | 1 | |
≫ | 1 |
Box Drawing
Value | Count | Frequency (%) |
│ | 1 |
저자
Text
Distinct | 9281 |
---|---|
Distinct (%) | 93.1% |
Missing | 33 |
Missing (%) | 0.3% |
Memory size | 156.2 KiB |
Length
Max length | 148 |
---|---|
Median length | 78 |
Mean length | 14.780476 |
Min length | 1 |
Characters and Unicode
Total characters | 147317 |
---|---|
Distinct characters | 1729 |
Distinct categories | 15 ? |
Distinct scripts | 7 ? |
Distinct blocks | 12 ? |
Unique
Unique | 8815 ? |
---|---|
Unique (%) | 88.4% |
Sample
1st row | 강신주 지음 ; 김서연 만듦 |
---|---|
2nd row | Garrett Mettler, Lizzie |
3rd row | 한창훈 지음 |
4th row | 나더쉬 피테르 지음 ; 김보국 옮김 |
5th row | Capra, Fritjof |
Value | Count | Frequency (%) |
지음 | 4330 | 12.4% |
2575 | 7.3% | |
옮김 | 2193 | 6.3% |
저 | 371 | 1.1% |
편 | 308 | 0.9% |
공]지음 | 280 | 0.8% |
엮음 | 194 | 0.6% |
외 | 184 | 0.5% |
著 | 181 | 0.5% |
글 | 178 | 0.5% |
Other values (14375) | 24242 |
Most occurring characters
Value | Count | Frequency (%) |
25091 | 17.0% | |
지 | 5173 | 3.5% |
음 | 4962 | 3.4% |
e | 4217 | 2.9% |
a | 4156 | 2.8% |
, | 4070 | 2.8% |
김 | 3809 | 2.6% |
; | 3517 | 2.4% |
n | 3226 | 2.2% |
i | 3183 | 2.2% |
Other values (1719) | 85913 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 65914 | |
Lowercase Letter | 35543 | |
Space Separator | 25091 | 17.0% |
Other Punctuation | 9981 | 6.8% |
Uppercase Letter | 7852 | 5.3% |
Close Punctuation | 1344 | 0.9% |
Open Punctuation | 1344 | 0.9% |
Dash Punctuation | 148 | 0.1% |
Decimal Number | 70 | < 0.1% |
Math Symbol | 24 | < 0.1% |
Other values (5) | 6 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
지 | 5173 | 7.8% |
음 | 4962 | 7.5% |
김 | 3809 | 5.8% |
옮 | 2388 | 3.6% |
이 | 2013 | 3.1% |
스 | 994 | 1.5% |
정 | 840 | 1.3% |
공 | 736 | 1.1% |
리 | 704 | 1.1% |
영 | 693 | 1.1% |
Other values (1614) | 43602 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 4217 | |
a | 4156 | |
n | 3226 | |
i | 3183 | 9.0% |
r | 3173 | 8.9% |
o | 2511 | 7.1% |
l | 2229 | 6.3% |
t | 1769 | 5.0% |
s | 1682 | 4.7% |
h | 1363 | 3.8% |
Other values (27) | 8034 |
Uppercase Letter
Value | Count | Frequency (%) |
M | 700 | 8.9% |
S | 696 | 8.9% |
B | 525 | 6.7% |
A | 516 | 6.6% |
J | 507 | 6.5% |
C | 499 | 6.4% |
R | 411 | 5.2% |
D | 397 | 5.1% |
H | 383 | 4.9% |
G | 379 | 4.8% |
Other values (18) | 2839 |
Other Punctuation
Value | Count | Frequency (%) |
, | 4070 | |
; | 3517 | |
. | 1541 | 15.4% |
: | 322 | 3.2% |
? | 309 | 3.1% |
· | 176 | 1.8% |
' | 22 | 0.2% |
& | 11 | 0.1% |
/ | 10 | 0.1% |
" | 2 | < 0.1% |
Decimal Number
Value | Count | Frequency (%) |
1 | 19 | |
2 | 12 | |
0 | 11 | |
9 | 7 | 10.0% |
3 | 5 | 7.1% |
5 | 5 | 7.1% |
7 | 4 | 5.7% |
8 | 3 | 4.3% |
4 | 3 | 4.3% |
6 | 1 | 1.4% |
Math Symbol
Value | Count | Frequency (%) |
< | 9 | |
> | 9 | |
+ | 5 | |
| | 1 | 4.2% |
Close Punctuation
Value | Count | Frequency (%) |
] | 1306 | |
) | 37 | 2.8% |
》 | 1 | 0.1% |
Open Punctuation
Value | Count | Frequency (%) |
[ | 1306 | |
( | 37 | 2.8% |
《 | 1 | 0.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 143 | |
― | 5 | 3.4% |
Other Symbol
Value | Count | Frequency (%) |
│ | 1 | |
▲ | 1 |
Space Separator
Value | Count | Frequency (%) |
25091 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1 |
Initial Punctuation
Value | Count | Frequency (%) |
“ | 1 |
Final Punctuation
Value | Count | Frequency (%) |
” | 1 |
Control
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 63068 | |
Latin | 43379 | |
Common | 38008 | |
Han | 2557 | 1.7% |
Katakana | 250 | 0.2% |
Hiragana | 39 | < 0.1% |
Cyrillic | 16 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
지 | 5173 | 8.2% |
음 | 4962 | 7.9% |
김 | 3809 | 6.0% |
옮 | 2388 | 3.8% |
이 | 2013 | 3.2% |
스 | 994 | 1.6% |
정 | 840 | 1.3% |
공 | 736 | 1.2% |
리 | 704 | 1.1% |
영 | 693 | 1.1% |
Other values (819) | 40756 |
Han
Value | Count | Frequency (%) |
著 | 222 | 8.7% |
編 | 151 | 5.9% |
主 | 66 | 2.6% |
譯 | 61 | 2.4% |
文 | 40 | 1.6% |
撰 | 36 | 1.4% |
中 | 29 | 1.1% |
金 | 26 | 1.0% |
集 | 21 | 0.8% |
博 | 18 | 0.7% |
Other values (709) | 1887 |
Katakana
Value | Count | Frequency (%) |
ス | 16 | 6.4% |
イ | 13 | 5.2% |
タ | 12 | 4.8% |
ジ | 11 | 4.4% |
リ | 11 | 4.4% |
ッ | 11 | 4.4% |
フ | 11 | 4.4% |
ン | 10 | 4.0% |
ル | 10 | 4.0% |
ド | 10 | 4.0% |
Other values (47) | 135 |
Latin
Value | Count | Frequency (%) |
e | 4217 | 9.7% |
a | 4156 | 9.6% |
n | 3226 | 7.4% |
i | 3183 | 7.3% |
r | 3173 | 7.3% |
o | 2511 | 5.8% |
l | 2229 | 5.1% |
t | 1769 | 4.1% |
s | 1682 | 3.9% |
h | 1363 | 3.1% |
Other values (43) | 15870 |
Common
Value | Count | Frequency (%) |
25091 | ||
, | 4070 | 10.7% |
; | 3517 | 9.3% |
. | 1541 | 4.1% |
] | 1306 | 3.4% |
[ | 1306 | 3.4% |
: | 322 | 0.8% |
? | 309 | 0.8% |
· | 176 | 0.5% |
- | 143 | 0.4% |
Other values (30) | 227 | 0.6% |
Hiragana
Value | Count | Frequency (%) |
き | 6 | |
ま | 5 | |
の | 4 | |
た | 4 | |
か | 3 | 7.7% |
ひ | 2 | 5.1% |
ち | 2 | 5.1% |
こ | 2 | 5.1% |
ほ | 1 | 2.6% |
ゆ | 1 | 2.6% |
Other values (9) | 9 |
Cyrillic
Value | Count | Frequency (%) |
и | 4 | |
о | 2 | |
н | 1 | 6.2% |
З | 1 | 6.2% |
л | 1 | 6.2% |
т | 1 | 6.2% |
ц | 1 | 6.2% |
к | 1 | 6.2% |
Д | 1 | 6.2% |
а | 1 | 6.2% |
Other values (2) | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 81198 | |
Hangul | 63067 | |
CJK | 2525 | 1.7% |
Katakana | 250 | 0.2% |
None | 180 | 0.1% |
Hiragana | 39 | < 0.1% |
CJK Compat Ideographs | 32 | < 0.1% |
Cyrillic | 16 | < 0.1% |
Punctuation | 7 | < 0.1% |
Compat Jamo | 1 | < 0.1% |
Other values (2) | 2 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
25091 | ||
e | 4217 | 5.2% |
a | 4156 | 5.1% |
, | 4070 | 5.0% |
; | 3517 | 4.3% |
n | 3226 | 4.0% |
i | 3183 | 3.9% |
r | 3173 | 3.9% |
o | 2511 | 3.1% |
l | 2229 | 2.7% |
Other values (73) | 25825 |
Hangul
Value | Count | Frequency (%) |
지 | 5173 | 8.2% |
음 | 4962 | 7.9% |
김 | 3809 | 6.0% |
옮 | 2388 | 3.8% |
이 | 2013 | 3.2% |
스 | 994 | 1.6% |
정 | 840 | 1.3% |
공 | 736 | 1.2% |
리 | 704 | 1.1% |
영 | 693 | 1.1% |
Other values (818) | 40755 |
CJK
Value | Count | Frequency (%) |
著 | 222 | 8.8% |
編 | 151 | 6.0% |
主 | 66 | 2.6% |
譯 | 61 | 2.4% |
文 | 40 | 1.6% |
撰 | 36 | 1.4% |
中 | 29 | 1.1% |
金 | 26 | 1.0% |
集 | 21 | 0.8% |
博 | 18 | 0.7% |
Other values (695) | 1855 |
None
Value | Count | Frequency (%) |
· | 176 | |
| | 1 | 0.6% |
ł | 1 | 0.6% |
《 | 1 | 0.6% |
》 | 1 | 0.6% |
Katakana
Value | Count | Frequency (%) |
ス | 16 | 6.4% |
イ | 13 | 5.2% |
タ | 12 | 4.8% |
ジ | 11 | 4.4% |
リ | 11 | 4.4% |
ッ | 11 | 4.4% |
フ | 11 | 4.4% |
ン | 10 | 4.0% |
ル | 10 | 4.0% |
ド | 10 | 4.0% |
Other values (47) | 135 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
李 | 16 | |
麗 | 2 | 6.2% |
盧 | 2 | 6.2% |
劉 | 2 | 6.2% |
良 | 1 | 3.1% |
龍 | 1 | 3.1% |
呂 | 1 | 3.1% |
林 | 1 | 3.1% |
沈 | 1 | 3.1% |
羅 | 1 | 3.1% |
Other values (4) | 4 | 12.5% |
Hiragana
Value | Count | Frequency (%) |
き | 6 | |
ま | 5 | |
の | 4 | |
た | 4 | |
か | 3 | 7.7% |
ひ | 2 | 5.1% |
ち | 2 | 5.1% |
こ | 2 | 5.1% |
ほ | 1 | 2.6% |
ゆ | 1 | 2.6% |
Other values (9) | 9 |
Punctuation
Value | Count | Frequency (%) |
― | 5 | |
“ | 1 | 14.3% |
” | 1 | 14.3% |
Cyrillic
Value | Count | Frequency (%) |
и | 4 | |
о | 2 | |
н | 1 | 6.2% |
З | 1 | 6.2% |
л | 1 | 6.2% |
т | 1 | 6.2% |
ц | 1 | 6.2% |
к | 1 | 6.2% |
Д | 1 | 6.2% |
а | 1 | 6.2% |
Other values (2) | 2 |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 1 |
Box Drawing
Value | Count | Frequency (%) |
│ | 1 |
Geometric Shapes
Value | Count | Frequency (%) |
▲ | 1 |
출판사
Text
Distinct | 3917 |
---|---|
Distinct (%) | 39.2% |
Missing | 15 |
Missing (%) | 0.1% |
Memory size | 156.2 KiB |
Length
Max length | 132 |
---|---|
Median length | 107 |
Mean length | 9.615323 |
Min length | 1 |
Characters and Unicode
Total characters | 96009 |
---|---|
Distinct characters | 1013 |
Distinct categories | 13 ? |
Distinct scripts | 7 ? |
Distinct blocks | 10 ? |
Unique
Unique | 2621 ? |
---|---|
Unique (%) | 26.2% |
Sample
1st row | 천년의상상 |
---|---|
2nd row | Rizzoli International Publications |
3rd row | 문학동네 |
4th row | arte: 북이십일 아르테 |
5th row | Shambhala |
Value | Count | Frequency (%) |
press | 686 | 4.1% |
university | 436 | 2.6% |
395 | 2.4% | |
of | 213 | 1.3% |
books | 210 | 1.3% |
문학동네 | 158 | 0.9% |
art | 142 | 0.9% |
pub | 109 | 0.7% |
distributed | 96 | 0.6% |
publishers | 96 | 0.6% |
Other values (3986) | 14104 |
Most occurring characters
Value | Count | Frequency (%) |
6663 | 6.9% | |
e | 5051 | 5.3% |
i | 4398 | 4.6% |
s | 4333 | 4.5% |
r | 4180 | 4.4% |
n | 3390 | 3.5% |
a | 3278 | 3.4% |
o | 3269 | 3.4% |
t | 3172 | 3.3% |
l | 2204 | 2.3% |
Other values (1003) | 56071 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 44739 | |
Other Letter | 33081 | |
Uppercase Letter | 8769 | 9.1% |
Space Separator | 6663 | 6.9% |
Other Punctuation | 2229 | 2.3% |
Decimal Number | 236 | 0.2% |
Close Punctuation | 99 | 0.1% |
Open Punctuation | 96 | 0.1% |
Dash Punctuation | 87 | 0.1% |
Math Symbol | 6 | < 0.1% |
Other values (3) | 4 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
사 | 1233 | 3.7% |
스 | 1031 | 3.1% |
문 | 949 | 2.9% |
학 | 807 | 2.4% |
북 | 773 | 2.3% |
이 | 646 | 2.0% |
한 | 553 | 1.7% |
지 | 484 | 1.5% |
화 | 457 | 1.4% |
미 | 420 | 1.3% |
Other values (909) | 25728 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 5051 | |
i | 4398 | |
s | 4333 | |
r | 4180 | |
n | 3390 | 7.6% |
a | 3278 | 7.3% |
o | 3269 | 7.3% |
t | 3172 | 7.1% |
l | 2204 | 4.9% |
u | 1822 | 4.1% |
Other values (25) | 9642 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 1588 | |
A | 664 | 7.6% |
C | 609 | 6.9% |
B | 566 | 6.5% |
U | 546 | 6.2% |
M | 511 | 5.8% |
S | 466 | 5.3% |
D | 434 | 4.9% |
H | 415 | 4.7% |
T | 319 | 3.6% |
Other values (19) | 2651 |
Other Punctuation
Value | Count | Frequency (%) |
: | 737 | |
. | 482 | |
, | 303 | |
? | 213 | 9.6% |
; | 197 | 8.8% |
& | 173 | 7.8% |
/ | 74 | 3.3% |
' | 31 | 1.4% |
& | 9 | 0.4% |
· | 9 | 0.4% |
Decimal Number
Value | Count | Frequency (%) |
1 | 91 | |
2 | 86 | |
0 | 26 | 11.0% |
4 | 9 | 3.8% |
3 | 6 | 2.5% |
6 | 5 | 2.1% |
5 | 5 | 2.1% |
9 | 4 | 1.7% |
8 | 4 | 1.7% |
Close Punctuation
Value | Count | Frequency (%) |
] | 89 | |
) | 10 | 10.1% |
Open Punctuation
Value | Count | Frequency (%) |
[ | 86 | |
( | 10 | 10.4% |
Space Separator
Value | Count | Frequency (%) |
6663 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 87 |
Math Symbol
Value | Count | Frequency (%) |
+ | 6 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 2 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 1 |
Modifier Symbol
Value | Count | Frequency (%) |
` | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 53495 | |
Hangul | 30601 | |
Common | 9419 | 9.8% |
Han | 2290 | 2.4% |
Katakana | 190 | 0.2% |
Cyrillic | 13 | < 0.1% |
Hiragana | 1 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
사 | 1233 | 4.0% |
스 | 1031 | 3.4% |
문 | 949 | 3.1% |
학 | 807 | 2.6% |
북 | 773 | 2.5% |
이 | 646 | 2.1% |
한 | 553 | 1.8% |
지 | 484 | 1.6% |
화 | 457 | 1.5% |
미 | 420 | 1.4% |
Other values (575) | 23248 |
Han
Value | Count | Frequency (%) |
社 | 287 | 12.5% |
出 | 218 | 9.5% |
版 | 216 | 9.4% |
文 | 98 | 4.3% |
海 | 72 | 3.1% |
上 | 71 | 3.1% |
學 | 64 | 2.8% |
民 | 59 | 2.6% |
人 | 55 | 2.4% |
大 | 38 | 1.7% |
Other values (276) | 1112 |
Latin
Value | Count | Frequency (%) |
e | 5051 | 9.4% |
i | 4398 | 8.2% |
s | 4333 | 8.1% |
r | 4180 | 7.8% |
n | 3390 | 6.3% |
a | 3278 | 6.1% |
o | 3269 | 6.1% |
t | 3172 | 5.9% |
l | 2204 | 4.1% |
u | 1822 | 3.4% |
Other values (43) | 18398 |
Katakana
Value | Count | Frequency (%) |
ン | 15 | 7.9% |
イ | 14 | 7.4% |
ナ | 13 | 6.8% |
ク | 12 | 6.3% |
ル | 10 | 5.3% |
ッ | 10 | 5.3% |
ョ | 9 | 4.7% |
シ | 9 | 4.7% |
ス | 9 | 4.7% |
タ | 8 | 4.2% |
Other values (38) | 81 |
Common
Value | Count | Frequency (%) |
6663 | ||
: | 737 | 7.8% |
. | 482 | 5.1% |
, | 303 | 3.2% |
? | 213 | 2.3% |
; | 197 | 2.1% |
& | 173 | 1.8% |
1 | 91 | 1.0% |
] | 89 | 0.9% |
- | 87 | 0.9% |
Other values (19) | 384 | 4.1% |
Cyrillic
Value | Count | Frequency (%) |
а | 3 | |
о | 1 | 7.7% |
Н | 1 | 7.7% |
т | 1 | 7.7% |
и | 1 | 7.7% |
В | 1 | 7.7% |
в | 1 | 7.7% |
А | 1 | 7.7% |
г | 1 | 7.7% |
р | 1 | 7.7% |
Hiragana
Value | Count | Frequency (%) |
の | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 62892 | |
Hangul | 30596 | |
CJK | 2288 | 2.4% |
Katakana | 190 | 0.2% |
None | 21 | < 0.1% |
Cyrillic | 13 | < 0.1% |
Compat Jamo | 4 | < 0.1% |
Punctuation | 2 | < 0.1% |
CJK Compat Ideographs | 2 | < 0.1% |
Hiragana | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
6663 | 10.6% | |
e | 5051 | 8.0% |
i | 4398 | 7.0% |
s | 4333 | 6.9% |
r | 4180 | 6.6% |
n | 3390 | 5.4% |
a | 3278 | 5.2% |
o | 3269 | 5.2% |
t | 3172 | 5.0% |
l | 2204 | 3.5% |
Other values (68) | 22954 |
Hangul
Value | Count | Frequency (%) |
사 | 1233 | 4.0% |
스 | 1031 | 3.4% |
문 | 949 | 3.1% |
학 | 807 | 2.6% |
북 | 773 | 2.5% |
이 | 646 | 2.1% |
한 | 553 | 1.8% |
지 | 484 | 1.6% |
화 | 457 | 1.5% |
미 | 420 | 1.4% |
Other values (573) | 23243 |
CJK
Value | Count | Frequency (%) |
社 | 287 | 12.5% |
出 | 218 | 9.5% |
版 | 216 | 9.4% |
文 | 98 | 4.3% |
海 | 72 | 3.1% |
上 | 71 | 3.1% |
學 | 64 | 2.8% |
民 | 59 | 2.6% |
人 | 55 | 2.4% |
大 | 38 | 1.7% |
Other values (274) | 1110 |
Katakana
Value | Count | Frequency (%) |
ン | 15 | 7.9% |
イ | 14 | 7.4% |
ナ | 13 | 6.8% |
ク | 12 | 6.3% |
ル | 10 | 5.3% |
ッ | 10 | 5.3% |
ョ | 9 | 4.7% |
シ | 9 | 4.7% |
ス | 9 | 4.7% |
タ | 8 | 4.2% |
Other values (38) | 81 |
None
Value | Count | Frequency (%) |
& | 9 | |
· | 9 | |
ı | 2 | 9.5% |
㈜ | 1 | 4.8% |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 4 |
Cyrillic
Value | Count | Frequency (%) |
а | 3 | |
о | 1 | 7.7% |
Н | 1 | 7.7% |
т | 1 | 7.7% |
и | 1 | 7.7% |
В | 1 | 7.7% |
в | 1 | 7.7% |
А | 1 | 7.7% |
г | 1 | 7.7% |
р | 1 | 7.7% |
Punctuation
Value | Count | Frequency (%) |
’ | 2 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
六 | 1 | |
寧 | 1 |
Hiragana
Value | Count | Frequency (%) |
の | 1 |
출판년
Text
Distinct | 117 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
2012 | 1817 | |
2014 | 1280 | |
2013 | 1255 | |
2011 | 1179 | |
2010 | 984 | |
2015 | 692 | 6.9% |
2009 | 499 | 5.0% |
2008 | 258 | 2.6% |
2007 | 246 | 2.5% |
2006 | 203 | 2.0% |
Other values (101) | 1586 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 12380 | |
2 | 11313 | |
1 | 9472 | |
9 | 1781 | 4.4% |
4 | 1478 | 3.7% |
3 | 1469 | 3.6% |
5 | 939 | 2.3% |
8 | 546 | 1.4% |
6 | 429 | 1.1% |
7 | 416 | 1.0% |
Other values (4) | 74 | 0.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 40223 | |
Dash Punctuation | 67 | 0.2% |
Space Separator | 5 | < 0.1% |
Other Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 12380 | |
2 | 11313 | |
1 | 9472 | |
9 | 1781 | 4.4% |
4 | 1478 | 3.7% |
3 | 1469 | 3.7% |
5 | 939 | 2.3% |
8 | 546 | 1.4% |
6 | 429 | 1.1% |
7 | 416 | 1.0% |
Other Letter
Value | Count | Frequency (%) |
미 | 1 | |
상 | 1 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 67 |
Space Separator
Value | Count | Frequency (%) |
5 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 40295 | |
Hangul | 2 | < 0.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 12380 | |
2 | 11313 | |
1 | 9472 | |
9 | 1781 | 4.4% |
4 | 1478 | 3.7% |
3 | 1469 | 3.6% |
5 | 939 | 2.3% |
8 | 546 | 1.4% |
6 | 429 | 1.1% |
7 | 416 | 1.0% |
Other values (2) | 72 | 0.2% |
Hangul
Value | Count | Frequency (%) |
미 | 1 | |
상 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 40295 | |
Hangul | 2 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 12380 | |
2 | 11313 | |
1 | 9472 | |
9 | 1781 | 4.4% |
4 | 1478 | 3.7% |
3 | 1469 | 3.6% |
5 | 939 | 2.3% |
8 | 546 | 1.4% |
6 | 429 | 1.1% |
7 | 416 | 1.0% |
Other values (2) | 72 | 0.2% |
Hangul
Value | Count | Frequency (%) |
미 | 1 | |
상 | 1 |
별치기호
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
100 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.97 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9900 | |
100 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9900 |
청구기호
Text
Distinct | 9694 |
---|---|
Distinct (%) | 97.0% |
Missing | 2 |
Missing (%) | < 0.1% |
Memory size | 156.2 KiB |
Length
Max length | 20 |
---|---|
Median length | 18 |
Mean length | 11.181736 |
Min length | 1 |
Characters and Unicode
Total characters | 111795 |
---|---|
Distinct characters | 603 |
Distinct categories | 9 ? |
Distinct scripts | 4 ? |
Distinct blocks | 4 ? |
Unique
Unique | 9575 ? |
---|---|
Unique (%) | 95.8% |
Sample
1st row | 811.609 강59ㄱ |
---|---|
2nd row | 746.92 G239t |
3rd row | 813.6 한811나 |
4th row | 839.61 피833ㅅ |
5th row | 530.01 C251t |
Value | Count | Frequency (%) |
813.6 | 208 | 1.0% |
이종호 | 195 | 1.0% |
843 | 179 | 0.9% |
818 | 147 | 0.7% |
833.6 | 121 | 0.6% |
709.2 | 93 | 0.5% |
811.6 | 93 | 0.5% |
814.6 | 91 | 0.4% |
658 | 73 | 0.4% |
812.6 | 69 | 0.3% |
Other values (12143) | 19043 |
Most occurring characters
Value | Count | Frequency (%) |
10320 | 9.2% | |
1 | 9908 | 8.9% |
3 | 8390 | 7.5% |
2 | 8134 | 7.3% |
. | 8116 | 7.3% |
9 | 7986 | 7.1% |
8 | 7710 | 6.9% |
7 | 6761 | 6.0% |
6 | 6611 | 5.9% |
5 | 5975 | 5.3% |
Other values (593) | 31884 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 72584 | |
Other Letter | 14712 | 13.2% |
Space Separator | 10320 | 9.2% |
Other Punctuation | 8239 | 7.4% |
Uppercase Letter | 2970 | 2.7% |
Lowercase Letter | 2960 | 2.6% |
Dash Punctuation | 8 | < 0.1% |
Open Punctuation | 1 | < 0.1% |
Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
ㅇ | 1141 | 7.8% |
이 | 813 | 5.5% |
ㅅ | 810 | 5.5% |
김 | 725 | 4.9% |
ㄱ | 696 | 4.7% |
ㅈ | 554 | 3.8% |
ㅎ | 504 | 3.4% |
ㅁ | 452 | 3.1% |
ㅂ | 406 | 2.8% |
ㄷ | 381 | 2.6% |
Other values (525) | 8230 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 286 | 9.6% |
S | 286 | 9.6% |
M | 244 | 8.2% |
C | 190 | 6.4% |
H | 171 | 5.8% |
G | 166 | 5.6% |
K | 164 | 5.5% |
P | 147 | 4.9% |
D | 146 | 4.9% |
L | 141 | 4.7% |
Other values (16) | 1029 |
Lowercase Letter
Value | Count | Frequency (%) |
a | 255 | 8.6% |
c | 243 | 8.2% |
s | 230 | 7.8% |
p | 207 | 7.0% |
m | 192 | 6.5% |
d | 170 | 5.7% |
t | 163 | 5.5% |
b | 140 | 4.7% |
f | 137 | 4.6% |
r | 129 | 4.4% |
Other values (16) | 1094 |
Decimal Number
Value | Count | Frequency (%) |
1 | 9908 | |
3 | 8390 | |
2 | 8134 | |
9 | 7986 | |
8 | 7710 | |
7 | 6761 | |
6 | 6611 | |
5 | 5975 | |
4 | 5861 | |
0 | 5248 |
Other Punctuation
Value | Count | Frequency (%) |
. | 8116 | |
/ | 123 | 1.5% |
Space Separator
Value | Count | Frequency (%) |
10320 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 8 |
Open Punctuation
Value | Count | Frequency (%) |
[ | 1 |
Close Punctuation
Value | Count | Frequency (%) |
] | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 91153 | |
Hangul | 14711 | 13.2% |
Latin | 5930 | 5.3% |
Han | 1 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
ㅇ | 1141 | 7.8% |
이 | 813 | 5.5% |
ㅅ | 810 | 5.5% |
김 | 725 | 4.9% |
ㄱ | 696 | 4.7% |
ㅈ | 554 | 3.8% |
ㅎ | 504 | 3.4% |
ㅁ | 452 | 3.1% |
ㅂ | 406 | 2.8% |
ㄷ | 381 | 2.6% |
Other values (524) | 8229 |
Latin
Value | Count | Frequency (%) |
B | 286 | 4.8% |
S | 286 | 4.8% |
a | 255 | 4.3% |
M | 244 | 4.1% |
c | 243 | 4.1% |
s | 230 | 3.9% |
p | 207 | 3.5% |
m | 192 | 3.2% |
C | 190 | 3.2% |
H | 171 | 2.9% |
Other values (42) | 3626 |
Common
Value | Count | Frequency (%) |
10320 | ||
1 | 9908 | |
3 | 8390 | |
2 | 8134 | |
. | 8116 | |
9 | 7986 | |
8 | 7710 | |
7 | 6761 | |
6 | 6611 | |
5 | 5975 | |
Other values (6) | 11242 |
Han
Value | Count | Frequency (%) |
外 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 97083 | |
Hangul | 8612 | 7.7% |
Compat Jamo | 6099 | 5.5% |
CJK | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
10320 | ||
1 | 9908 | |
3 | 8390 | |
2 | 8134 | |
. | 8116 | |
9 | 7986 | |
8 | 7710 | |
7 | 6761 | 7.0% |
6 | 6611 | 6.8% |
5 | 5975 | 6.2% |
Other values (58) | 17172 |
Compat Jamo
Value | Count | Frequency (%) |
ㅇ | 1141 | |
ㅅ | 810 | |
ㄱ | 696 | |
ㅈ | 554 | |
ㅎ | 504 | |
ㅁ | 452 | 7.4% |
ㅂ | 406 | 6.7% |
ㄷ | 381 | 6.2% |
ㄴ | 311 | 5.1% |
ㅍ | 211 | 3.5% |
Other values (9) | 633 |
Hangul
Value | Count | Frequency (%) |
이 | 813 | 9.4% |
김 | 725 | 8.4% |
호 | 221 | 2.6% |
박 | 213 | 2.5% |
정 | 205 | 2.4% |
종 | 200 | 2.3% |
한 | 193 | 2.2% |
조 | 152 | 1.8% |
최 | 143 | 1.7% |
오 | 120 | 1.4% |
Other values (505) | 5627 |
CJK
Value | Count | Frequency (%) |
外 | 1 |
No. | 제어번호 | |
---|---|---|
No. | 1.000 | 0.941 |
제어번호 | 0.941 | 1.000 |
No. | 제어번호 | 별치기호 | |
---|---|---|---|
No. | 1.000 | 0.955 | 1.000 |
제어번호 | 0.955 | 1.000 | 1.000 |
별치기호 | 1.000 | 1.000 | 1.000 |
No. | 제어번호 | 서명 | 저자 | 출판사 | 출판년 | 별치기호 | 청구기호 | |
---|---|---|---|---|---|---|---|---|
16081 | 16082 | 178517 | (우리 인문학의 자긍심) 김수영을 위하여 | 강신주 지음 ; 김서연 만듦 | 천년의상상 | 2012 | <NA> | 811.609 강59ㄱ |
14043 | 14044 | 181827 | Tomboy style | Garrett Mettler, Lizzie | Rizzoli International Publications | 2012 | <NA> | 746.92 G239t |
4496 | 4497 | 171989 | 나는 여기가 좋다: 한창훈 소설 | 한창훈 지음 | 문학동네 | 2009 | <NA> | 813.6 한811나 |
28365 | 28366 | 212919 | 세렐렘 : 나더쉬 피테르 중편소설 | 나더쉬 피테르 지음 ; 김보국 옮김 | arte: 북이십일 아르테 | 2014 | <NA> | 839.61 피833ㅅ |
18095 | 18096 | 192280 | The Tao of physics : an exploration of the parallels between modern physics and Eastern mysticism | Capra, Fritjof | Shambhala | 2010 | <NA> | 530.01 C251t |
25858 | 25859 | 203361 | Cultivate | 小嶋一浩; 赤松佳珠子 [共]著 | TOTO出版 | 2007 | <NA> | 720 코79ㅋ 이종호 |
33849 | 33850 | 228814 | Creativity : the magic synthesis | Arieti, Silvano | Basic Books | 1976 | <NA> | 153.35 A698c |
8569 | 8570 | 181750 | (창조가 쉬워지는) 모방의 힘 | 김남국 지음 | 위즈덤하우스 | 2012 | <NA> | 325.1 김211ㅁ |
22322 | 22323 | 209682 | 現代美國史 | A. 모로와 著; 申相楚 譯 | 三星文化財團 | 1975 | <NA> | 942.07 모235ㅎ |
22604 | 22605 | 203356 | 청주연초제조창의 문화적 재생 : 모더니즘 동시대성 | 도코모모코리아 편저 | 하나 | 2013 | <NA> | 542.1 도825 이종호 |
No. | 제어번호 | 서명 | 저자 | 출판사 | 출판년 | 별치기호 | 청구기호 | |
---|---|---|---|---|---|---|---|---|
25524 | 25525 | 212647 | (달라이 라마) 행복의 지혜 : 지치고, 상처받은 이들이 마음의 평화를 키우는 법 | 달라이 라마, 빅터 챈 [공]지음 ; 진우기 옮김 | 반니 | 2014 | <NA> | 229 달231ㅎ |
28482 | 28483 | 205415 | 학문의 구조사전 | Valis Deux 저; 오상혁 역 | 더난출판사 | 1996 | <NA> | 001 발239ㅎ 이종호 |
17728 | 17729 | 190628 | 내 얘기를 들어줄 단 한 사람이 있다면 | 조우성 지음 | 리더스북 | 2013 | <NA> | 818 조67내 |
33061 | 33062 | 218493 | 적멸을 위하여 : 조오현문학전집 | [지은이: 조오현] ; 권영민 엮음 | 문학사상 | 2012 | <NA> | 811.7 조65ㅈ |
14673 | 14674 | 187426 | (톨스토이 인생론) 자아의 발견 | 톨스토이 지음 ; 함현규 옮김 | 빛과향기 | 2012 | <NA> | 199.1 톨58ㅈ |
27544 | 27545 | 207653 | 월정사 | 한상길 지음 | 대한불교진흥원 | 2009 | <NA> | 226.911 한17 |
26343 | 26344 | 207323 | 박하사탕 : 한국현대미술 | 국립현대미술관 [편] | 국립현대미술관 | 2007 | <NA> | 606.3 국239박 |
17689 | 17690 | 191844 | 서양미술사 속에는 서양미술이 있다 | 박우찬 지음 | 재원 | 2012 | <NA> | 609.2 박67ㅅ/개정 |
7744 | 7745 | 180979 | 1984년 | 조지 오웰 지음 ; 권진아 옮김 | 을유문화사 | 2012 | <NA> | 843 오67ㅊ 권진아 |
36897 | 36898 | 239529 | 휴게소 | 글: 정미진 ; 그림: 구자선 | 엣눈북스 | 2016 | <NA> | 818 정39ㅎ |