Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 10000 |
Missing cells (%) | 14.3% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 673.8 KiB |
Average record size in memory | 69.0 B |
Variable types
Text | 2 |
---|---|
Unsupported | 1 |
Categorical | 2 |
Numeric | 2 |
Dataset
Description | 한국학중앙연구원 해외한국학지원사업 연구성과의 목차 정보 |
---|---|
Author | 한국학중앙연구원 |
URL | https://www.data.go.kr/data/15049070/fileData.do |
PAGE_NO_PAPER has constant value "" | Constant |
PAGE_NO_PDF has constant value "" | Constant |
LEVEL is highly overall correlated with CONTENTS_ORDER | High correlation |
CONTENTS_ORDER is highly overall correlated with LEVEL | High correlation |
CONTENTS_ENG has 10000 (100.0%) missing values | Missing |
CONTENTS_ENG is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-12-12 05:41:26.107300 |
---|---|
Analysis finished | 2023-12-12 05:41:28.445338 |
Duration | 2.34 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
CATALOG_ID
Text
Distinct | 1980 |
---|---|
Distinct (%) | 19.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
09c02 | 78 | 0.8% |
09c05 | 72 | 0.7% |
06c17 | 63 | 0.6% |
07c06 | 62 | 0.6% |
10r41 | 48 | 0.5% |
08c09 | 48 | 0.5% |
06c15 | 44 | 0.4% |
07c09 | 42 | 0.4% |
07c15 | 42 | 0.4% |
09c12 | 40 | 0.4% |
Other values (1970) | 9461 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 32813 | |
1 | 8998 | 10.5% |
_ | 7174 | 8.4% |
C | 6549 | 7.6% |
6 | 4837 | 5.6% |
7 | 4361 | 5.1% |
9 | 3945 | 4.6% |
2 | 3741 | 4.4% |
8 | 2724 | 3.2% |
5 | 2607 | 3.0% |
Other values (8) | 8151 | 9.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 68696 | |
Uppercase Letter | 9998 | 11.6% |
Connector Punctuation | 7174 | 8.4% |
Lowercase Letter | 32 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 32813 | |
1 | 8998 | 13.1% |
6 | 4837 | 7.0% |
7 | 4361 | 6.3% |
9 | 3945 | 5.7% |
2 | 3741 | 5.4% |
8 | 2724 | 4.0% |
5 | 2607 | 3.8% |
3 | 2425 | 3.5% |
4 | 2245 | 3.3% |
Uppercase Letter
Value | Count | Frequency (%) |
C | 6549 | |
R | 1813 | 18.1% |
P | 1589 | 15.9% |
S | 47 | 0.5% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 17 | |
b | 13 | |
t | 2 | 6.2% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 7174 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 75870 | |
Latin | 10030 | 11.7% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 32813 | |
1 | 8998 | 11.9% |
_ | 7174 | 9.5% |
6 | 4837 | 6.4% |
7 | 4361 | 5.7% |
9 | 3945 | 5.2% |
2 | 3741 | 4.9% |
8 | 2724 | 3.6% |
5 | 2607 | 3.4% |
3 | 2425 | 3.2% |
Latin
Value | Count | Frequency (%) |
C | 6549 | |
R | 1813 | 18.1% |
P | 1589 | 15.8% |
S | 47 | 0.5% |
a | 17 | 0.2% |
b | 13 | 0.1% |
t | 2 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 85900 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 32813 | |
1 | 8998 | 10.5% |
_ | 7174 | 8.4% |
C | 6549 | 7.6% |
6 | 4837 | 5.6% |
7 | 4361 | 5.1% |
9 | 3945 | 4.6% |
2 | 3741 | 4.4% |
8 | 2724 | 3.2% |
5 | 2607 | 3.0% |
Other values (8) | 8151 | 9.5% |
CONTENTS_ORI
Text
Distinct | 7878 |
---|---|
Distinct (%) | 78.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 213 |
---|---|
Median length | 146 |
Mean length | 29.8417 |
Min length | 1 |
Characters and Unicode
Total characters | 298417 |
---|---|
Distinct characters | 2981 |
Distinct categories | 20 ? |
Distinct scripts | 11 ? |
Distinct blocks | 22 ? |
Unique
Unique | 7029 ? |
---|---|
Unique (%) | 70.3% |
Sample
1st row | 4. 『朝鮮策略』의 原文 校勘 |
---|---|
2nd row | The LeftPeriphery Structure of Korean in Minimalist Syntax |
3rd row | 한국에서의 우즈베키스탄 노동자와 아내 |
4th row | 2-lc) Income Distribution and Poverty |
5th row | Ⅱ. Centгal Asia before 15th centuгy |
Value | Count | Frequency (%) |
the | 1587 | 3.2% |
of | 1326 | 2.7% |
and | 1114 | 2.3% |
in | 967 | 2.0% |
korean | 737 | 1.5% |
1 | 650 | 1.3% |
2 | 648 | 1.3% |
3 | 554 | 1.1% |
4 | 411 | 0.8% |
411 | 0.8% | |
Other values (14820) | 40831 |
Most occurring characters
Value | Count | Frequency (%) |
39289 | 13.2% | |
e | 14843 | 5.0% |
n | 12878 | 4.3% |
o | 12021 | 4.0% |
i | 11484 | 3.8% |
a | 11253 | 3.8% |
t | 10399 | 3.5% |
r | 8966 | 3.0% |
s | 7972 | 2.7% |
c | 4900 | 1.6% |
Other values (2971) | 164412 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 144266 | |
Other Letter | 65641 | |
Space Separator | 39293 | 13.2% |
Uppercase Letter | 29238 | 9.8% |
Other Punctuation | 7383 | 2.5% |
Decimal Number | 7293 | 2.4% |
Dash Punctuation | 1374 | 0.5% |
Close Punctuation | 1230 | 0.4% |
Open Punctuation | 1014 | 0.3% |
Final Punctuation | 658 | 0.2% |
Other values (10) | 1027 | 0.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
의 | 2644 | 4.0% |
한 | 1191 | 1.8% |
문 | 935 | 1.4% |
국 | 913 | 1.4% |
어 | 825 | 1.3% |
的 | 821 | 1.3% |
과 | 805 | 1.2% |
에 | 696 | 1.1% |
대 | 695 | 1.1% |
사 | 632 | 1.0% |
Other values (2600) | 55484 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 14843 | 10.3% |
n | 12878 | 8.9% |
o | 12021 | 8.3% |
i | 11484 | 8.0% |
a | 11253 | 7.8% |
t | 10399 | 7.2% |
r | 8966 | 6.2% |
s | 7972 | 5.5% |
c | 4900 | 3.4% |
l | 4851 | 3.4% |
Other values (126) | 44699 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 1870 | 6.4% |
I | 1785 | 6.1% |
T | 1758 | 6.0% |
S | 1737 | 5.9% |
A | 1624 | 5.6% |
K | 1570 | 5.4% |
N | 1397 | 4.8% |
R | 1330 | 4.5% |
E | 1254 | 4.3% |
P | 1127 | 3.9% |
Other values (112) | 13786 |
Other Punctuation
Value | Count | Frequency (%) |
. | 4416 | |
: | 945 | 12.8% |
, | 503 | 6.8% |
' | 462 | 6.3% |
、 | 459 | 6.2% |
? | 186 | 2.5% |
· | 99 | 1.3% |
/ | 92 | 1.2% |
, | 83 | 1.1% |
。 | 22 | 0.3% |
Other values (14) | 116 | 1.6% |
Decimal Number
Value | Count | Frequency (%) |
1 | 1680 | |
2 | 1680 | |
3 | 1173 | |
4 | 776 | |
5 | 500 | 6.9% |
0 | 435 | 6.0% |
9 | 409 | 5.6% |
6 | 280 | 3.8% |
7 | 178 | 2.4% |
8 | 177 | 2.4% |
Other values (5) | 5 | 0.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 823 | |
』 | 148 | 12.0% |
》 | 117 | 9.5% |
」 | 67 | 5.4% |
〉 | 28 | 2.3% |
] | 27 | 2.2% |
) | 10 | 0.8% |
】 | 5 | 0.4% |
} | 3 | 0.2% |
〛 | 1 | 0.1% |
Open Punctuation
Value | Count | Frequency (%) |
( | 613 | |
『 | 149 | 14.7% |
《 | 115 | 11.3% |
「 | 65 | 6.4% |
[ | 26 | 2.6% |
〈 | 24 | 2.4% |
( | 10 | 1.0% |
【 | 5 | 0.5% |
{ | 3 | 0.3% |
⟪ | 3 | 0.3% |
Nonspacing Mark
Value | Count | Frequency (%) |
ี | 13 | |
ิ | 12 | |
ั | 9 | |
้ | 6 | |
ุ | 4 | 7.7% |
̣ | 2 | 3.8% |
ู | 2 | 3.8% |
่ | 1 | 1.9% |
ื | 1 | 1.9% |
์ | 1 | 1.9% |
Math Symbol
Value | Count | Frequency (%) |
< | 67 | |
> | 65 | |
~ | 25 | 11.6% |
≪ | 19 | 8.8% |
≫ | 19 | 8.8% |
+ | 10 | 4.6% |
= | 5 | 2.3% |
− | 3 | 1.4% |
∅ | 2 | 0.9% |
| | 1 | 0.5% |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 48 | |
Ⅲ | 45 | |
Ⅰ | 44 | |
Ⅳ | 31 | |
Ⅴ | 15 | 8.0% |
Ⅵ | 5 | 2.7% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1343 | |
– | 23 | 1.7% |
- | 4 | 0.3% |
― | 2 | 0.1% |
‐ | 2 | 0.1% |
Initial Punctuation
Value | Count | Frequency (%) |
“ | 280 | |
‘ | 189 | |
« | 39 | 7.7% |
‛ | 1 | 0.2% |
Final Punctuation
Value | Count | Frequency (%) |
’ | 336 | |
” | 285 | |
» | 37 | 5.6% |
Other Symbol
Value | Count | Frequency (%) |
┌ | 6 | |
™ | 1 | 12.5% |
┐ | 1 | 12.5% |
Space Separator
Value | Count | Frequency (%) |
39289 | ||
4 | < 0.1% |
Modifier Symbol
Value | Count | Frequency (%) |
` | 17 | |
˳ | 1 | 5.6% |
Other Number
Value | Count | Frequency (%) |
① | 3 | |
② | 1 | 25.0% |
Private Use
Value | Count | Frequency (%) |
| 1 | |
| 1 |
Modifier Letter
Value | Count | Frequency (%) |
ー | 20 |
Control
Value | Count | Frequency (%) |
10 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 152314 | |
Common | 59030 | 19.8% |
Hangul | 41974 | 14.1% |
Han | 22111 | 7.4% |
Cyrillic | 21376 | 7.2% |
Hiragana | 1045 | 0.4% |
Thai | 384 | 0.1% |
Katakana | 176 | 0.1% |
Greek | 3 | < 0.1% |
Inherited | 2 | < 0.1% |
Most frequent character per script
Han
Value | Count | Frequency (%) |
的 | 821 | 3.7% |
朝 | 410 | 1.9% |
国 | 400 | 1.8% |
中 | 312 | 1.4% |
日 | 282 | 1.3% |
第 | 274 | 1.2% |
一 | 273 | 1.2% |
与 | 249 | 1.1% |
鲜 | 249 | 1.1% |
文 | 239 | 1.1% |
Other values (1801) | 18602 |
Hangul
Value | Count | Frequency (%) |
의 | 2644 | 6.3% |
한 | 1191 | 2.8% |
문 | 935 | 2.2% |
국 | 913 | 2.2% |
어 | 825 | 2.0% |
과 | 805 | 1.9% |
에 | 696 | 1.7% |
대 | 695 | 1.7% |
사 | 632 | 1.5% |
학 | 591 | 1.4% |
Other values (663) | 32047 |
Latin
Value | Count | Frequency (%) |
e | 14843 | 9.7% |
n | 12878 | 8.5% |
o | 12021 | 7.9% |
i | 11484 | 7.5% |
a | 11253 | 7.4% |
t | 10399 | 6.8% |
r | 8966 | 5.9% |
s | 7972 | 5.2% |
c | 4900 | 3.2% |
l | 4851 | 3.2% |
Other values (184) | 52747 |
Common
Value | Count | Frequency (%) |
39289 | ||
. | 4416 | 7.5% |
1 | 1680 | 2.8% |
2 | 1680 | 2.8% |
- | 1343 | 2.3% |
3 | 1173 | 2.0% |
: | 945 | 1.6% |
) | 823 | 1.4% |
4 | 776 | 1.3% |
( | 613 | 1.0% |
Other values (84) | 6292 | 10.7% |
Cyrillic
Value | Count | Frequency (%) |
о | 1799 | 8.4% |
и | 1462 | 6.8% |
е | 1429 | 6.7% |
а | 1088 | 5.1% |
н | 1047 | 4.9% |
с | 946 | 4.4% |
р | 936 | 4.4% |
т | 789 | 3.7% |
к | 731 | 3.4% |
в | 687 | 3.2% |
Other values (58) | 10462 |
Hiragana
Value | Count | Frequency (%) |
の | 295 | |
と | 139 | |
に | 104 | 10.0% |
る | 45 | 4.3% |
お | 37 | 3.5% |
か | 30 | 2.9% |
て | 29 | 2.8% |
め | 29 | 2.8% |
を | 22 | 2.1% |
け | 22 | 2.1% |
Other values (37) | 293 |
Thai
Value | Count | Frequency (%) |
า | 40 | 10.4% |
ร | 28 | 7.3% |
ก | 26 | 6.8% |
ย | 19 | 4.9% |
เ | 18 | 4.7% |
บ | 18 | 4.7% |
น | 18 | 4.7% |
ท | 15 | 3.9% |
ี | 13 | 3.4% |
ิ | 12 | 3.1% |
Other values (35) | 177 |
Katakana
Value | Count | Frequency (%) |
ア | 43 | |
ジ | 24 | |
ダ | 10 | 5.7% |
ン | 8 | 4.5% |
リ | 7 | 4.0% |
ツ | 6 | 3.4% |
ム | 5 | 2.8% |
シ | 5 | 2.8% |
テ | 5 | 2.8% |
ス | 4 | 2.3% |
Other values (33) | 59 |
Greek
Value | Count | Frequency (%) |
γ | 1 | |
π | 1 | |
ω | 1 |
Unknown
Value | Count | Frequency (%) |
| 1 | |
| 1 |
Inherited
Value | Count | Frequency (%) |
̣ | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 206933 | |
Hangul | 41916 | 14.0% |
CJK | 22055 | 7.4% |
Cyrillic | 21376 | 7.2% |
None | 2498 | 0.8% |
Punctuation | 1127 | 0.4% |
Hiragana | 1045 | 0.4% |
Latin Ext Additional | 509 | 0.2% |
Thai | 384 | 0.1% |
Katakana | 210 | 0.1% |
Other values (12) | 364 | 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
39289 | ||
e | 14843 | 7.2% |
n | 12878 | 6.2% |
o | 12021 | 5.8% |
i | 11484 | 5.5% |
a | 11253 | 5.4% |
t | 10399 | 5.0% |
r | 8966 | 4.3% |
s | 7972 | 3.9% |
c | 4900 | 2.4% |
Other values (81) | 72928 |
Hangul
Value | Count | Frequency (%) |
의 | 2644 | 6.3% |
한 | 1191 | 2.8% |
문 | 935 | 2.2% |
국 | 913 | 2.2% |
어 | 825 | 2.0% |
과 | 805 | 1.9% |
에 | 696 | 1.7% |
대 | 695 | 1.7% |
사 | 632 | 1.5% |
학 | 591 | 1.4% |
Other values (643) | 31989 |
Cyrillic
Value | Count | Frequency (%) |
о | 1799 | 8.4% |
и | 1462 | 6.8% |
е | 1429 | 6.7% |
а | 1088 | 5.1% |
н | 1047 | 4.9% |
с | 946 | 4.4% |
р | 936 | 4.4% |
т | 789 | 3.7% |
к | 731 | 3.4% |
в | 687 | 3.2% |
Other values (58) | 10462 |
CJK
Value | Count | Frequency (%) |
的 | 821 | 3.7% |
朝 | 410 | 1.9% |
国 | 400 | 1.8% |
中 | 312 | 1.4% |
日 | 282 | 1.3% |
第 | 274 | 1.2% |
一 | 273 | 1.2% |
与 | 249 | 1.1% |
鲜 | 249 | 1.1% |
文 | 239 | 1.1% |
Other values (1775) | 18546 |
None
Value | Count | Frequency (%) |
、 | 459 | |
ŏ | 160 | 6.4% |
『 | 149 | 6.0% |
』 | 148 | 5.9% |
é | 146 | 5.8% |
》 | 117 | 4.7% |
《 | 115 | 4.6% |
· | 99 | 4.0% |
, | 83 | 3.3% |
À | 79 | 3.2% |
Other values (92) | 943 |
Punctuation
Value | Count | Frequency (%) |
’ | 336 | |
” | 285 | |
“ | 280 | |
‘ | 189 | |
– | 23 | 2.0% |
• | 5 | 0.4% |
․ | 2 | 0.2% |
― | 2 | 0.2% |
… | 2 | 0.2% |
‐ | 2 | 0.2% |
Hiragana
Value | Count | Frequency (%) |
の | 295 | |
と | 139 | |
に | 104 | 10.0% |
る | 45 | 4.3% |
お | 37 | 3.5% |
か | 30 | 2.9% |
て | 29 | 2.8% |
め | 29 | 2.8% |
を | 22 | 2.1% |
け | 22 | 2.1% |
Other values (37) | 293 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 48 | |
Ⅲ | 45 | |
Ⅰ | 44 | |
Ⅳ | 31 | |
Ⅴ | 15 | 8.0% |
Ⅵ | 5 | 2.7% |
Katakana
Value | Count | Frequency (%) |
ア | 43 | |
ジ | 24 | 11.4% |
ー | 20 | 9.5% |
・ | 14 | 6.7% |
ダ | 10 | 4.8% |
ン | 8 | 3.8% |
リ | 7 | 3.3% |
ツ | 6 | 2.9% |
ム | 5 | 2.4% |
シ | 5 | 2.4% |
Other values (35) | 68 |
Latin Ext Additional
Value | Count | Frequency (%) |
Ệ | 41 | 8.1% |
Ố | 37 | 7.3% |
ố | 28 | 5.5% |
ệ | 19 | 3.7% |
ể | 18 | 3.5% |
Ủ | 17 | 3.3% |
ớ | 16 | 3.1% |
Ể | 15 | 2.9% |
Ọ | 14 | 2.8% |
Ả | 13 | 2.6% |
Other values (58) | 291 |
Thai
Value | Count | Frequency (%) |
า | 40 | 10.4% |
ร | 28 | 7.3% |
ก | 26 | 6.8% |
ย | 19 | 4.9% |
เ | 18 | 4.7% |
บ | 18 | 4.7% |
น | 18 | 4.7% |
ท | 15 | 3.9% |
ี | 13 | 3.4% |
ิ | 12 | 3.1% |
Other values (35) | 177 |
Math Operators
Value | Count | Frequency (%) |
≪ | 19 | |
≫ | 19 | |
− | 3 | 7.0% |
∅ | 2 | 4.7% |
CJK Compat Ideographs
Value | Count | Frequency (%) |
李 | 11 | |
陸 | 6 | 10.7% |
類 | 5 | 8.9% |
樂 | 3 | 5.4% |
理 | 3 | 5.4% |
兩 | 2 | 3.6% |
論 | 2 | 3.6% |
金 | 2 | 3.6% |
栗 | 2 | 3.6% |
烈 | 2 | 3.6% |
Other values (16) | 18 |
Box Drawing
Value | Count | Frequency (%) |
┌ | 6 | |
┐ | 1 | 14.3% |
Compat Jamo
Value | Count | Frequency (%) |
ㄱ | 6 | 10.5% |
ㅋ | 5 | 8.8% |
ㅍ | 4 | 7.0% |
ㅅ | 4 | 7.0% |
ㅡ | 4 | 7.0% |
ㄴ | 4 | 7.0% |
ㅂ | 3 | 5.3% |
ㅈ | 3 | 5.3% |
ㅊ | 3 | 5.3% |
ㅇ | 3 | 5.3% |
Other values (9) | 18 |
Enclosed Alphanum
Value | Count | Frequency (%) |
① | 3 | |
② | 1 | 25.0% |
Diacriticals
Value | Count | Frequency (%) |
̣ | 2 |
IPA Ext
Value | Count | Frequency (%) |
ɨ | 2 |
PUA
Value | Count | Frequency (%) |
| 1 | |
| 1 |
Letterlike Symbols
Value | Count | Frequency (%) |
™ | 1 |
Jamo
Value | Count | Frequency (%) |
ᅳ | 1 |
Modifier Letters
Value | Count | Frequency (%) |
˳ | 1 |
CONTENTS_ENG
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
PAGE_NO_PAPER
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 10000 |
PAGE_NO_PDF
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 10000 |
LEVEL
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.1741 |
Minimum | 0 |
---|---|
Maximum | 5 |
Zeros | 28 |
Zeros (%) | 0.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 2 |
Q3 | 3 |
95-th percentile | 3 |
Maximum | 5 |
Range | 5 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 0.69212506 |
---|---|
Coefficient of variation (CV) | 0.31835015 |
Kurtosis | -0.1134089 |
Mean | 2.1741 |
Median Absolute Deviation (MAD) | 0 |
Skewness | -0.024598174 |
Sum | 21741 |
Variance | 0.47903709 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 5405 | |
3 | 2993 | |
1 | 1451 | 14.5% |
4 | 114 | 1.1% |
0 | 28 | 0.3% |
5 | 9 | 0.1% |
Value | Count | Frequency (%) |
0 | 28 | 0.3% |
1 | 1451 | 14.5% |
2 | 5405 | |
3 | 2993 | |
4 | 114 | 1.1% |
5 | 9 | 0.1% |
Value | Count | Frequency (%) |
5 | 9 | 0.1% |
4 | 114 | 1.1% |
3 | 2993 | |
2 | 5405 | |
1 | 1451 | 14.5% |
0 | 28 | 0.3% |
CONTENTS_ORDER
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 116 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9.8278 |
Minimum | 1 |
---|---|
Maximum | 119 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 5 |
Q3 | 10 |
95-th percentile | 37 |
Maximum | 119 |
Range | 118 |
Interquartile range (IQR) | 7 |
Descriptive statistics
Standard deviation | 13.645745 |
---|---|
Coefficient of variation (CV) | 1.3884842 |
Kurtosis | 15.965966 |
Mean | 9.8278 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 3.5319073 |
Sum | 98278 |
Variance | 186.20637 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 1347 | |
2 | 1005 | |
4 | 928 | 9.3% |
3 | 891 | 8.9% |
5 | 870 | 8.7% |
6 | 730 | 7.3% |
7 | 601 | 6.0% |
8 | 478 | 4.8% |
9 | 377 | 3.8% |
10 | 286 | 2.9% |
Other values (106) | 2487 |
Value | Count | Frequency (%) |
1 | 1347 | |
2 | 1005 | |
3 | 891 | |
4 | 928 | |
5 | 870 | |
6 | 730 | |
7 | 601 | |
8 | 478 | 4.8% |
9 | 377 | 3.8% |
10 | 286 | 2.9% |
Value | Count | Frequency (%) |
119 | 1 | < 0.1% |
117 | 1 | < 0.1% |
115 | 1 | < 0.1% |
114 | 2 | |
113 | 2 | |
112 | 2 | |
111 | 2 | |
110 | 1 | < 0.1% |
109 | 2 | |
108 | 3 |
LEVEL | CONTENTS_ORDER | |
---|---|---|
LEVEL | 1.000 | 0.306 |
CONTENTS_ORDER | 0.306 | 1.000 |
LEVEL | CONTENTS_ORDER | |
---|---|---|
LEVEL | 1.000 | 0.651 |
CONTENTS_ORDER | 0.651 | 1.000 |
CATALOG_ID | CONTENTS_ORI | CONTENTS_ENG | PAGE_NO_PAPER | PAGE_NO_PDF | LEVEL | CONTENTS_ORDER | |
---|---|---|---|---|---|---|---|
6969 | 07C06_0062 | 4. 『朝鮮策略』의 原文 校勘 | <NA> | 0 | 0 | 2 | 5 |
10915 | 06C10 | The LeftPeriphery Structure of Korean in Minimalist Syntax | <NA> | 0 | 0 | 2 | 9 |
3790 | 07C15 | 한국에서의 우즈베키스탄 노동자와 아내 | <NA> | 0 | 0 | 3 | 60 |
15111 | 08C08_0009 | 2-lc) Income Distribution and Poverty | <NA> | 0 | 0 | 4 | 7 |
15834 | 07P01_0005 | Ⅱ. Centгal Asia before 15th centuгy | <NA> | 0 | 0 | 2 | 3 |
2477 | 07R38 | 第一节图们江流域的地理位置 | <NA> | 0 | 0 | 3 | 17 |
11995 | 06C04_0013 | 四、明朝在壬辰战争结束后对明鲜、鲜曰关系的反应 | <NA> | 0 | 0 | 2 | 5 |
11157 | 07C17_0005 | 4.3 AUN-Hankuk University of Foreign Studies | <NA> | 0 | 0 | 3 | 11 |
7831 | 10R33 | Trustworthiness | <NA> | 0 | 0 | 3 | 20 |
6031 | 07P01 | Феномен этнического предпринимательства как следствие миграционных процессов u адаптации национальных меньшинств в иноэтнuчной среде: на при мере корейской дuаспорыг.Новосuбuрска | <NA> | 0 | 0 | 3 | 38 |
CATALOG_ID | CONTENTS_ORI | CONTENTS_ENG | PAGE_NO_PAPER | PAGE_NO_PDF | LEVEL | CONTENTS_ORDER | |
---|---|---|---|---|---|---|---|
15231 | 07C18_0009 | 2.2.2.2 각주를 통하여 | <NA> | 0 | 0 | 5 | 14 |
1413 | 06C06_0018 | 5)朝-清-日仲介貿易と制限的貿易課税政策 | <NA> | 0 | 0 | 3 | 9 |
6010 | 07P01 | 시베리아 지역 고려인 학자 롤-모델 연구를 통한 새로운 고려인 문제 연구방법 | <NA> | 0 | 0 | 3 | 17 |
3732 | 11R22 | 三、朝鲜族的民族通婚现状 | <NA> | 0 | 0 | 2 | 5 |
12314 | 08C12_0014 | From nominalizer to stance marker in the history of Okinawan | <NA> | 0 | 0 | 1 | 1 |
10029 | 07C07_0005 | 5. 사용하여도 무방하게 된 복수 어휘 | <NA> | 0 | 0 | 2 | 14 |
3040 | 06R25 | 第三节 隋丽关系演变及对东北亚政局的影响 | <NA> | 0 | 0 | 3 | 14 |
7335 | 09C01_0028 | Japan and Korea as a Source of Media and Cultural Capital | <NA> | 0 | 0 | 1 | 1 |
13577 | 08C19_0029 | A “stretch” | <NA> | 0 | 0 | 2 | 2 |
9621 | 07P03_0006 | La dialectique traditionnelle de « Won » et « Han » : le fondement psychologique des Coréens | <NA> | 0 | 0 | 3 | 6 |