Overview

Dataset statistics

Number of variables53
Number of observations2272
Missing cells37784
Missing cells (%)31.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory971.9 KiB
Average record size in memory438.1 B

Variable types

Text26
Categorical17
Numeric3
DateTime1
Boolean3
Unsupported3

Dataset

Description한국학중앙연구원 해외한국학지원사업 연구성과 상세정보
Author한국학중앙연구원
URLhttps://www.data.go.kr/data/15049068/fileData.do

Alerts

SUBTITLE_KOR has constant value ""Constant
PAPER_PAGE_START has constant value ""Constant
PAPER_PAGE_END has constant value ""Constant
PDF_PAGE_START has constant value ""Constant
PDF_PAGE_END has constant value ""Constant
PROJECTYEAR_END is highly imbalanced (98.7%)Imbalance
SERIAL_NUMBER is highly imbalanced (99.2%)Imbalance
VOLUME is highly imbalanced (99.2%)Imbalance
NUMBER is highly imbalanced (99.3%)Imbalance
SEARCH_YN is highly imbalanced (87.2%)Imbalance
IS_OPEN is highly imbalanced (99.0%)Imbalance
ERASE_YN is highly imbalanced (92.7%)Imbalance
REGISTER is highly imbalanced (69.8%)Imbalance
MODIFIER is highly imbalanced (78.2%)Imbalance
ERASER is highly imbalanced (95.3%)Imbalance
TITLE_ENG has 889 (39.1%) missing valuesMissing
TITLE_KOR has 164 (7.2%) missing valuesMissing
SUBTITLE_ENG has 2269 (99.9%) missing valuesMissing
SUBTITLE_KOR has 2271 (> 99.9%) missing valuesMissing
SUBTITLE_ORI has 2263 (99.6%) missing valuesMissing
AUTHOR_ORI has 104 (4.6%) missing valuesMissing
AUTHOR_KOR has 1599 (70.4%) missing valuesMissing
AUTHOR_ENG has 1214 (53.4%) missing valuesMissing
AUTHOR_ETC has 1736 (76.4%) missing valuesMissing
ORGANIZATION_ORI has 319 (14.0%) missing valuesMissing
ORGANIZATION_KOR has 550 (24.2%) missing valuesMissing
ORGANIZATION_ENG has 545 (24.0%) missing valuesMissing
ORGANIZATION_ETC has 2126 (93.6%) missing valuesMissing
NATION has 436 (19.2%) missing valuesMissing
PUBLISH_DATE has 2235 (98.4%) missing valuesMissing
PUBLISHER has 2236 (98.4%) missing valuesMissing
ISBN has 2249 (99.0%) missing valuesMissing
ISSN has 2259 (99.4%) missing valuesMissing
SORT_TITLE_ENG has 889 (39.1%) missing valuesMissing
SORT_TITLE_KOR has 166 (7.3%) missing valuesMissing
FILE_NAME has 2272 (100.0%) missing valuesMissing
ROOT_DIR has 2272 (100.0%) missing valuesMissing
SUB_DIR has 2272 (100.0%) missing valuesMissing
MODIFIED_DATE has 2193 (96.5%) missing valuesMissing
ERASE_DATE has 2252 (99.1%) missing valuesMissing
CATALOG_ID has unique valuesUnique
FILE_NAME is an unsupported type, check if it needs cleaning or further analysisUnsupported
ROOT_DIR is an unsupported type, check if it needs cleaning or further analysisUnsupported
SUB_DIR is an unsupported type, check if it needs cleaning or further analysisUnsupported
SUB_INDEX has 226 (9.9%) zerosZeros

Reproduction

Analysis started2023-12-12 15:43:02.858721
Analysis finished2023-12-12 15:43:07.664270
Duration4.81 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

CATALOG_ID
Text

UNIQUE 

Distinct2272
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
2023-12-13T00:43:07.866039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.4920775
Min length5

Characters and Unicode

Total characters21566
Distinct characters18
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2272 ?
Unique (%)100.0%

Sample

1st row05R42
2nd row05R42_0001
3rd row05R42_0002
4th row05R42_0003
5th row05R42_0004
ValueCountFrequency (%)
05r42 1
 
< 0.1%
09c02_0036 1
 
< 0.1%
09c02_0038 1
 
< 0.1%
09c02_0031 1
 
< 0.1%
09c02_0032 1
 
< 0.1%
09c02_0033 1
 
< 0.1%
09c02_0034 1
 
< 0.1%
09c02_0035 1
 
< 0.1%
09c02_0029 1
 
< 0.1%
09c02_0028 1
 
< 0.1%
Other values (2262) 2262
99.6%
2023-12-13T00:43:08.238598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 8283
38.4%
1 2327
 
10.8%
_ 2040
 
9.5%
C 1678
 
7.8%
6 1109
 
5.1%
2 1021
 
4.7%
7 955
 
4.4%
9 930
 
4.3%
8 710
 
3.3%
5 709
 
3.3%
Other values (8) 1804
 
8.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 17248
80.0%
Uppercase Letter 2272
 
10.5%
Connector Punctuation 2040
 
9.5%
Lowercase Letter 6
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 8283
48.0%
1 2327
 
13.5%
6 1109
 
6.4%
2 1021
 
5.9%
7 955
 
5.5%
9 930
 
5.4%
8 710
 
4.1%
5 709
 
4.1%
3 620
 
3.6%
4 584
 
3.4%
Uppercase Letter
ValueCountFrequency (%)
C 1678
73.9%
P 319
 
14.0%
R 273
 
12.0%
S 2
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
a 3
50.0%
b 2
33.3%
d 1
 
16.7%
Connector Punctuation
ValueCountFrequency (%)
_ 2040
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 19288
89.4%
Latin 2278
 
10.6%

Most frequent character per script

Common
ValueCountFrequency (%)
0 8283
42.9%
1 2327
 
12.1%
_ 2040
 
10.6%
6 1109
 
5.7%
2 1021
 
5.3%
7 955
 
5.0%
9 930
 
4.8%
8 710
 
3.7%
5 709
 
3.7%
3 620
 
3.2%
Latin
ValueCountFrequency (%)
C 1678
73.7%
P 319
 
14.0%
R 273
 
12.0%
a 3
 
0.1%
b 2
 
0.1%
S 2
 
0.1%
d 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 21566
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 8283
38.4%
1 2327
 
10.8%
_ 2040
 
9.5%
C 1678
 
7.8%
6 1109
 
5.1%
2 1021
 
4.7%
7 955
 
4.4%
9 930
 
4.3%
8 710
 
3.3%
5 709
 
3.3%
Other values (8) 1804
 
8.4%
Distinct99
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
2023-12-13T00:43:08.810718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length4.5915493
Min length1

Characters and Unicode

Total characters10432
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)0.2%

Sample

1st row0
2nd row05R42
3rd row05R42
4th row05R42
5th row05R42
ValueCountFrequency (%)
0 232
 
10.2%
09c02 111
 
4.9%
09c05 106
 
4.7%
06c17 104
 
4.6%
07c06 89
 
3.9%
08c09 70
 
3.1%
07c15 67
 
2.9%
06c10 61
 
2.7%
07c18 58
 
2.6%
09c15 57
 
2.5%
Other values (89) 1317
58.0%
2023-12-13T00:43:09.277552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3332
31.9%
C 1613
15.5%
1 1437
13.8%
6 771
 
7.4%
9 669
 
6.4%
7 658
 
6.3%
2 433
 
4.2%
8 424
 
4.1%
5 371
 
3.6%
P 301
 
2.9%
Other values (3) 423
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 8392
80.4%
Uppercase Letter 2040
 
19.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3332
39.7%
1 1437
17.1%
6 771
 
9.2%
9 669
 
8.0%
7 658
 
7.8%
2 433
 
5.2%
8 424
 
5.1%
5 371
 
4.4%
4 175
 
2.1%
3 122
 
1.5%
Uppercase Letter
ValueCountFrequency (%)
C 1613
79.1%
P 301
 
14.8%
R 126
 
6.2%

Most occurring scripts

ValueCountFrequency (%)
Common 8392
80.4%
Latin 2040
 
19.6%

Most frequent character per script

Common
ValueCountFrequency (%)
0 3332
39.7%
1 1437
17.1%
6 771
 
9.2%
9 669
 
8.0%
7 658
 
7.8%
2 433
 
5.2%
8 424
 
5.1%
5 371
 
4.4%
4 175
 
2.1%
3 122
 
1.5%
Latin
ValueCountFrequency (%)
C 1613
79.1%
P 301
 
14.8%
R 126
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 10432
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3332
31.9%
C 1613
15.5%
1 1437
13.8%
6 771
 
7.4%
9 669
 
6.4%
7 658
 
6.3%
2 433
 
4.2%
8 424
 
4.1%
5 371
 
3.6%
P 301
 
2.9%
Other values (3) 423
 
4.1%

TITLE_ENG
Text

MISSING 

Distinct1373
Distinct (%)99.3%
Missing889
Missing (%)39.1%
Memory size17.9 KiB
2023-12-13T00:43:09.574730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length296
Median length132
Mean length75.751988
Min length10

Characters and Unicode

Total characters104765
Distinct characters260
Distinct categories14 ?
Distinct scripts6 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1363 ?
Unique (%)98.6%

Sample

1st rowKorean-English Dictionary in Hindi Pronunciation
2nd rowKorean-English Dictionary in Hindi Pronunciation (ㄱ)
3rd rowKorean-English Dictionary in Hindi Pronunciation (ㄴ)
4th rowKorean-English Dictionary in Hindi Pronunciation (ㄷ)
5th rowKorean-English Dictionary in Hindi Pronunciation (ㄹ,ㅁ)
ValueCountFrequency (%)
of 1113
 
7.2%
the 1069
 
6.9%
in 814
 
5.3%
and 808
 
5.2%
korean 596
 
3.9%
on 252
 
1.6%
a 251
 
1.6%
korea 224
 
1.5%
98
 
0.6%
south 92
 
0.6%
Other values (3997) 10126
65.6%
2023-12-13T00:43:10.045865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14088
13.4%
e 8819
 
8.4%
n 8043
 
7.7%
o 7415
 
7.1%
a 7262
 
6.9%
i 6890
 
6.6%
t 5885
 
5.6%
r 5103
 
4.9%
s 4442
 
4.2%
l 2698
 
2.6%
Other values (250) 34120
32.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 76166
72.7%
Space Separator 14088
 
13.4%
Uppercase Letter 11587
 
11.1%
Other Punctuation 1075
 
1.0%
Decimal Number 711
 
0.7%
Dash Punctuation 531
 
0.5%
Other Letter 142
 
0.1%
Final Punctuation 134
 
0.1%
Open Punctuation 117
 
0.1%
Close Punctuation 115
 
0.1%
Other values (4) 99
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3
 
2.1%
2
 
1.4%
2
 
1.4%
2
 
1.4%
2
 
1.4%
2
 
1.4%
2
 
1.4%
2
 
1.4%
2
 
1.4%
2
 
1.4%
Other values (116) 121
85.2%
Lowercase Letter
ValueCountFrequency (%)
e 8819
11.6%
n 8043
10.6%
o 7415
9.7%
a 7262
9.5%
i 6890
9.0%
t 5885
 
7.7%
r 5103
 
6.7%
s 4442
 
5.8%
l 2698
 
3.5%
h 2641
 
3.5%
Other values (41) 16968
22.3%
Uppercase Letter
ValueCountFrequency (%)
K 1181
 
10.2%
C 1081
 
9.3%
S 1012
 
8.7%
T 872
 
7.5%
A 857
 
7.4%
P 711
 
6.1%
E 651
 
5.6%
I 583
 
5.0%
N 551
 
4.8%
R 521
 
4.5%
Other values (30) 3567
30.8%
Other Punctuation
ValueCountFrequency (%)
: 420
39.1%
' 228
21.2%
, 218
20.3%
. 111
 
10.3%
" 44
 
4.1%
/ 35
 
3.3%
! 5
 
0.5%
; 4
 
0.4%
· 4
 
0.4%
& 3
 
0.3%
Other values (2) 3
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 160
22.5%
0 147
20.7%
9 99
13.9%
2 91
12.8%
8 48
 
6.8%
5 41
 
5.8%
6 40
 
5.6%
7 34
 
4.8%
4 27
 
3.8%
3 24
 
3.4%
Open Punctuation
ValueCountFrequency (%)
( 111
94.9%
[ 4
 
3.4%
1
 
0.9%
1
 
0.9%
Close Punctuation
ValueCountFrequency (%)
) 109
94.8%
] 4
 
3.5%
1
 
0.9%
1
 
0.9%
Math Symbol
ValueCountFrequency (%)
< 8
42.1%
> 8
42.1%
~ 2
 
10.5%
+ 1
 
5.3%
Dash Punctuation
ValueCountFrequency (%)
- 530
99.8%
1
 
0.2%
Final Punctuation
ValueCountFrequency (%)
79
59.0%
55
41.0%
Initial Punctuation
ValueCountFrequency (%)
55
80.9%
13
 
19.1%
Space Separator
ValueCountFrequency (%)
14088
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 10
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 87447
83.5%
Common 16870
 
16.1%
Cyrillic 305
 
0.3%
Hangul 72
 
0.1%
Han 70
 
0.1%
Greek 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3
 
4.2%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
1
 
1.4%
1
 
1.4%
Other values (53) 53
73.6%
Han
ValueCountFrequency (%)
2
 
2.9%
2
 
2.9%
2
 
2.9%
2
 
2.9%
2
 
2.9%
2
 
2.9%
2
 
2.9%
1
 
1.4%
1
 
1.4%
1
 
1.4%
Other values (53) 53
75.7%
Latin
ValueCountFrequency (%)
e 8819
 
10.1%
n 8043
 
9.2%
o 7415
 
8.5%
a 7262
 
8.3%
i 6890
 
7.9%
t 5885
 
6.7%
r 5103
 
5.8%
s 4442
 
5.1%
l 2698
 
3.1%
h 2641
 
3.0%
Other values (44) 28249
32.3%
Common
ValueCountFrequency (%)
14088
83.5%
- 530
 
3.1%
: 420
 
2.5%
' 228
 
1.4%
, 218
 
1.3%
1 160
 
0.9%
0 147
 
0.9%
. 111
 
0.7%
( 111
 
0.7%
) 109
 
0.6%
Other values (33) 748
 
4.4%
Cyrillic
ValueCountFrequency (%)
п 46
15.1%
а 41
13.4%
е 38
12.5%
о 36
11.8%
г 32
10.5%
К 22
 
7.2%
и 9
 
3.0%
с 7
 
2.3%
у 7
 
2.3%
А 7
 
2.3%
Other values (26) 60
19.7%
Greek
ValueCountFrequency (%)
γ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 104098
99.4%
Cyrillic 305
 
0.3%
Punctuation 203
 
0.2%
CJK 67
 
0.1%
Hangul 57
 
0.1%
None 15
 
< 0.1%
Compat Jamo 15
 
< 0.1%
CJK Compat Ideographs 3
 
< 0.1%
Letterlike Symbols 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
14088
13.5%
e 8819
 
8.5%
n 8043
 
7.7%
o 7415
 
7.1%
a 7262
 
7.0%
i 6890
 
6.6%
t 5885
 
5.7%
r 5103
 
4.9%
s 4442
 
4.3%
l 2698
 
2.6%
Other values (72) 33453
32.1%
Punctuation
ValueCountFrequency (%)
79
38.9%
55
27.1%
55
27.1%
13
 
6.4%
1
 
0.5%
Cyrillic
ValueCountFrequency (%)
п 46
15.1%
а 41
13.4%
е 38
12.5%
о 36
11.8%
г 32
10.5%
К 22
 
7.2%
и 9
 
3.0%
с 7
 
2.3%
у 7
 
2.3%
А 7
 
2.3%
Other values (26) 60
19.7%
None
ValueCountFrequency (%)
· 4
26.7%
2
13.3%
2
13.3%
1
 
6.7%
1
 
6.7%
ß 1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
γ 1
 
6.7%
Hangul
ValueCountFrequency (%)
3
 
5.3%
2
 
3.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
1
 
1.8%
1
 
1.8%
Other values (38) 38
66.7%
CJK
ValueCountFrequency (%)
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
1
 
1.5%
1
 
1.5%
1
 
1.5%
Other values (50) 50
74.6%
Letterlike Symbols
ValueCountFrequency (%)
2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Other values (5) 5
33.3%
CJK Compat Ideographs
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

TITLE_KOR
Text

MISSING 

Distinct2065
Distinct (%)98.0%
Missing164
Missing (%)7.2%
Memory size17.9 KiB
2023-12-13T00:43:10.456802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length88
Median length60
Mean length27.387097
Min length1

Characters and Unicode

Total characters57732
Distinct characters1423
Distinct categories12 ?
Distinct scripts5 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2022 ?
Unique (%)95.9%

Sample

1st row힌디어로 발음하는 인도의 한영 대사전
2nd row힌디어로 발음하는 인도의 한영 대사전 - ㄱ(기역)
3rd row힌디어로 발음하는 인도의 한영 대사전 - ㄴ(니은)
4th row힌디어로 발음하는 인도의 한영 대사전 - ㄷ(디귿)
5th row힌디어로 발음하는 인도의 한영 대사전 - ㄹ, ㅁ(리을, 미음)
ValueCountFrequency (%)
대한 297
 
2.2%
237
 
1.7%
한국 223
 
1.6%
한국어 217
 
1.6%
연구 169
 
1.2%
중심으로 156
 
1.1%
한국의 89
 
0.6%
대하여 72
 
0.5%
중국 65
 
0.5%
63
 
0.5%
Other values (7018) 12172
88.5%
2023-12-13T00:43:11.074861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11693
 
20.3%
2468
 
4.3%
1780
 
3.1%
1378
 
2.4%
925
 
1.6%
901
 
1.6%
844
 
1.5%
780
 
1.4%
666
 
1.2%
537
 
0.9%
Other values (1413) 35760
61.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 41813
72.4%
Space Separator 11693
 
20.3%
Other Punctuation 969
 
1.7%
Decimal Number 880
 
1.5%
Lowercase Letter 612
 
1.1%
Dash Punctuation 472
 
0.8%
Open Punctuation 396
 
0.7%
Close Punctuation 395
 
0.7%
Uppercase Letter 238
 
0.4%
Final Punctuation 95
 
0.2%
Other values (2) 169
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2468
 
5.9%
1780
 
4.3%
1378
 
3.3%
925
 
2.2%
901
 
2.2%
844
 
2.0%
780
 
1.9%
666
 
1.6%
537
 
1.3%
526
 
1.3%
Other values (1297) 31008
74.2%
Lowercase Letter
ValueCountFrequency (%)
a 63
 
10.3%
i 61
 
10.0%
n 53
 
8.7%
o 53
 
8.7%
e 52
 
8.5%
t 43
 
7.0%
l 30
 
4.9%
r 27
 
4.4%
s 21
 
3.4%
g 20
 
3.3%
Other values (30) 189
30.9%
Uppercase Letter
ValueCountFrequency (%)
A 24
 
10.1%
P 22
 
9.2%
C 17
 
7.1%
S 16
 
6.7%
I 15
 
6.3%
K 14
 
5.9%
E 13
 
5.5%
T 13
 
5.5%
L 11
 
4.6%
O 11
 
4.6%
Other values (16) 82
34.5%
Other Punctuation
ValueCountFrequency (%)
: 387
39.9%
, 189
19.5%
' 165
17.0%
· 97
 
10.0%
" 73
 
7.5%
. 25
 
2.6%
/ 22
 
2.3%
! 4
 
0.4%
2
 
0.2%
2
 
0.2%
Other values (2) 3
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 195
22.2%
0 183
20.8%
9 121
13.8%
2 118
13.4%
8 63
 
7.2%
5 48
 
5.5%
3 43
 
4.9%
6 41
 
4.7%
7 38
 
4.3%
4 30
 
3.4%
Open Punctuation
ValueCountFrequency (%)
( 239
60.4%
69
 
17.4%
47
 
11.9%
19
 
4.8%
15
 
3.8%
[ 6
 
1.5%
1
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 238
60.3%
69
 
17.5%
47
 
11.9%
19
 
4.8%
15
 
3.8%
] 6
 
1.5%
1
 
0.3%
Math Symbol
ValueCountFrequency (%)
16
21.6%
16
21.6%
~ 15
20.3%
> 12
16.2%
< 12
16.2%
+ 2
 
2.7%
= 1
 
1.4%
Dash Punctuation
ValueCountFrequency (%)
- 470
99.6%
2
 
0.4%
Final Punctuation
ValueCountFrequency (%)
51
53.7%
44
46.3%
Initial Punctuation
ValueCountFrequency (%)
51
53.7%
44
46.3%
Space Separator
ValueCountFrequency (%)
11693
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 40027
69.3%
Common 15069
 
26.1%
Han 1786
 
3.1%
Latin 813
 
1.4%
Cyrillic 37
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2468
 
6.2%
1780
 
4.4%
1378
 
3.4%
925
 
2.3%
901
 
2.3%
844
 
2.1%
780
 
1.9%
666
 
1.7%
537
 
1.3%
526
 
1.3%
Other values (702) 29222
73.0%
Han
ValueCountFrequency (%)
50
 
2.8%
37
 
2.1%
35
 
2.0%
33
 
1.8%
29
 
1.6%
29
 
1.6%
26
 
1.5%
25
 
1.4%
25
 
1.4%
20
 
1.1%
Other values (585) 1477
82.7%
Common
ValueCountFrequency (%)
11693
77.6%
- 470
 
3.1%
: 387
 
2.6%
( 239
 
1.6%
) 238
 
1.6%
1 195
 
1.3%
, 189
 
1.3%
0 183
 
1.2%
' 165
 
1.1%
9 121
 
0.8%
Other values (40) 1189
 
7.9%
Latin
ValueCountFrequency (%)
a 63
 
7.7%
i 61
 
7.5%
n 53
 
6.5%
o 53
 
6.5%
e 52
 
6.4%
t 43
 
5.3%
l 30
 
3.7%
r 27
 
3.3%
A 24
 
3.0%
P 22
 
2.7%
Other values (39) 385
47.4%
Cyrillic
ValueCountFrequency (%)
е 5
13.5%
а 4
10.8%
и 4
10.8%
о 3
8.1%
с 3
8.1%
н 3
8.1%
т 2
 
5.4%
р 2
 
5.4%
л 2
 
5.4%
в 2
 
5.4%
Other values (7) 7
18.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 40005
69.3%
ASCII 15255
 
26.4%
CJK 1768
 
3.1%
None 401
 
0.7%
Punctuation 194
 
0.3%
Cyrillic 37
 
0.1%
Math Operators 32
 
0.1%
Compat Jamo 22
 
< 0.1%
CJK Compat Ideographs 18
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11693
76.7%
- 470
 
3.1%
: 387
 
2.5%
( 239
 
1.6%
) 238
 
1.6%
1 195
 
1.3%
, 189
 
1.2%
0 183
 
1.2%
' 165
 
1.1%
9 121
 
0.8%
Other values (69) 1375
 
9.0%
Hangul
ValueCountFrequency (%)
2468
 
6.2%
1780
 
4.4%
1378
 
3.4%
925
 
2.3%
901
 
2.3%
844
 
2.1%
780
 
1.9%
666
 
1.7%
537
 
1.3%
526
 
1.3%
Other values (687) 29200
73.0%
None
ValueCountFrequency (%)
· 97
24.2%
69
17.2%
69
17.2%
47
11.7%
47
11.7%
19
 
4.7%
19
 
4.7%
15
 
3.7%
15
 
3.7%
2
 
0.5%
Other values (2) 2
 
0.5%
Punctuation
ValueCountFrequency (%)
51
26.3%
51
26.3%
44
22.7%
44
22.7%
2
 
1.0%
2
 
1.0%
CJK
ValueCountFrequency (%)
50
 
2.8%
37
 
2.1%
35
 
2.0%
33
 
1.9%
29
 
1.6%
29
 
1.6%
26
 
1.5%
25
 
1.4%
25
 
1.4%
20
 
1.1%
Other values (570) 1459
82.5%
Math Operators
ValueCountFrequency (%)
16
50.0%
16
50.0%
Cyrillic
ValueCountFrequency (%)
е 5
13.5%
а 4
10.8%
и 4
10.8%
о 3
8.1%
с 3
8.1%
н 3
8.1%
т 2
 
5.4%
р 2
 
5.4%
л 2
 
5.4%
в 2
 
5.4%
Other values (7) 7
18.9%
CJK Compat Ideographs
ValueCountFrequency (%)
2
 
11.1%
2
 
11.1%
2
 
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (5) 5
27.8%
Compat Jamo
ValueCountFrequency (%)
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (5) 5
22.7%
Distinct2243
Distinct (%)98.8%
Missing2
Missing (%)0.1%
Memory size17.9 KiB
2023-12-13T00:43:11.445609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length209
Median length126
Mean length50.67489
Min length3

Characters and Unicode

Total characters115032
Distinct characters1636
Distinct categories14 ?
Distinct scripts8 ?
Distinct blocks12 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2216 ?
Unique (%)97.6%

Sample

1st row힌디어로 발음하는 인도의 한영 대사전
2nd row힌디어로 발음하는 인도의 한영 대사전 - ㄱ(기역)
3rd row힌디어로 발음하는 인도의 한영 대사전 - ㄴ(니은)
4th row힌디어로 발음하는 인도의 한영 대사전 - ㄷ(디귿)
5th row힌디어로 발음하는 인도의 한영 대사전 - ㄹ, ㅁ(리을, 미음)
ValueCountFrequency (%)
the 725
 
4.0%
of 652
 
3.6%
in 594
 
3.3%
and 549
 
3.0%
korean 452
 
2.5%
259
 
1.4%
korea 183
 
1.0%
a 178
 
1.0%
on 140
 
0.8%
중심으로 122
 
0.7%
Other values (7441) 14306
78.8%
2023-12-13T00:43:12.004807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15938
 
13.9%
e 6493
 
5.6%
n 5793
 
5.0%
a 5241
 
4.6%
o 5240
 
4.6%
i 4947
 
4.3%
t 4227
 
3.7%
r 3810
 
3.3%
s 3304
 
2.9%
l 2033
 
1.8%
Other values (1626) 58006
50.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 63475
55.2%
Other Letter 18684
 
16.2%
Space Separator 15938
 
13.9%
Uppercase Letter 12842
 
11.2%
Other Punctuation 1119
 
1.0%
Decimal Number 972
 
0.8%
Dash Punctuation 757
 
0.7%
Open Punctuation 369
 
0.3%
Close Punctuation 368
 
0.3%
Final Punctuation 248
 
0.2%
Other values (4) 260
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
821
 
4.4%
509
 
2.7%
426
 
2.3%
312
 
1.7%
307
 
1.6%
297
 
1.6%
288
 
1.5%
288
 
1.5%
261
 
1.4%
241
 
1.3%
Other values (1445) 14934
79.9%
Lowercase Letter
ValueCountFrequency (%)
e 6493
 
10.2%
n 5793
 
9.1%
a 5241
 
8.3%
o 5240
 
8.3%
i 4947
 
7.8%
t 4227
 
6.7%
r 3810
 
6.0%
s 3304
 
5.2%
l 2033
 
3.2%
h 1790
 
2.8%
Other values (51) 20597
32.4%
Uppercase Letter
ValueCountFrequency (%)
K 889
 
6.9%
C 756
 
5.9%
S 752
 
5.9%
T 734
 
5.7%
A 677
 
5.3%
N 572
 
4.5%
I 538
 
4.2%
E 511
 
4.0%
P 466
 
3.6%
R 412
 
3.2%
Other values (50) 6535
50.9%
Other Punctuation
ValueCountFrequency (%)
: 463
41.4%
, 272
24.3%
' 165
 
14.7%
. 68
 
6.1%
· 54
 
4.8%
/ 45
 
4.0%
22
 
2.0%
" 6
 
0.5%
! 5
 
0.4%
5
 
0.4%
Other values (6) 14
 
1.3%
Decimal Number
ValueCountFrequency (%)
1 224
23.0%
0 194
20.0%
9 140
14.4%
2 116
11.9%
8 68
 
7.0%
5 54
 
5.6%
6 51
 
5.2%
7 45
 
4.6%
3 44
 
4.5%
4 36
 
3.7%
Math Symbol
ValueCountFrequency (%)
17
21.2%
17
21.2%
< 16
20.0%
> 15
18.8%
~ 10
12.5%
+ 2
 
2.5%
× 1
 
1.2%
÷ 1
 
1.2%
= 1
 
1.2%
Open Punctuation
ValueCountFrequency (%)
( 201
54.5%
70
 
19.0%
53
 
14.4%
23
 
6.2%
12
 
3.3%
[ 9
 
2.4%
1
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 200
54.3%
70
 
19.0%
53
 
14.4%
23
 
6.2%
12
 
3.3%
] 9
 
2.4%
1
 
0.3%
Dash Punctuation
ValueCountFrequency (%)
- 754
99.6%
2
 
0.3%
1
 
0.1%
Final Punctuation
ValueCountFrequency (%)
132
53.2%
116
46.8%
Initial Punctuation
ValueCountFrequency (%)
116
66.3%
59
33.7%
Modifier Symbol
ValueCountFrequency (%)
` 3
75.0%
¨ 1
 
25.0%
Space Separator
ValueCountFrequency (%)
15938
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 64186
55.8%
Common 20031
 
17.4%
Hangul 13754
 
12.0%
Cyrillic 12130
 
10.5%
Han 4557
 
4.0%
Hiragana 293
 
0.3%
Katakana 80
 
0.1%
Greek 1
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
195
 
4.3%
148
 
3.2%
147
 
3.2%
116
 
2.5%
88
 
1.9%
68
 
1.5%
56
 
1.2%
55
 
1.2%
48
 
1.1%
43
 
0.9%
Other values (846) 3593
78.8%
Hangul
ValueCountFrequency (%)
821
 
6.0%
509
 
3.7%
426
 
3.1%
312
 
2.3%
307
 
2.2%
297
 
2.2%
288
 
2.1%
288
 
2.1%
261
 
1.9%
241
 
1.8%
Other values (528) 10004
72.7%
Cyrillic
ValueCountFrequency (%)
о 994
 
8.2%
и 789
 
6.5%
е 780
 
6.4%
а 597
 
4.9%
н 591
 
4.9%
с 547
 
4.5%
р 539
 
4.4%
к 429
 
3.5%
т 426
 
3.5%
О 394
 
3.2%
Other values (55) 6044
49.8%
Common
ValueCountFrequency (%)
15938
79.6%
- 754
 
3.8%
: 463
 
2.3%
, 272
 
1.4%
1 224
 
1.1%
( 201
 
1.0%
) 200
 
1.0%
0 194
 
1.0%
' 165
 
0.8%
9 140
 
0.7%
Other values (50) 1480
 
7.4%
Latin
ValueCountFrequency (%)
e 6493
 
10.1%
n 5793
 
9.0%
a 5241
 
8.2%
o 5240
 
8.2%
i 4947
 
7.7%
t 4227
 
6.6%
r 3810
 
5.9%
s 3304
 
5.1%
l 2033
 
3.2%
h 1790
 
2.8%
Other values (45) 21308
33.2%
Hiragana
ValueCountFrequency (%)
82
28.0%
42
14.3%
33
11.3%
14
 
4.8%
14
 
4.8%
13
 
4.4%
11
 
3.8%
11
 
3.8%
10
 
3.4%
8
 
2.7%
Other values (24) 55
18.8%
Katakana
ValueCountFrequency (%)
22
27.5%
14
17.5%
6
 
7.5%
4
 
5.0%
3
 
3.8%
3
 
3.8%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
Other values (17) 20
25.0%
Greek
ValueCountFrequency (%)
γ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 83343
72.5%
Hangul 13737
 
11.9%
Cyrillic 12130
 
10.5%
CJK 4531
 
3.9%
Punctuation 427
 
0.4%
None 413
 
0.4%
Hiragana 293
 
0.3%
Katakana 80
 
0.1%
Math Operators 34
 
< 0.1%
CJK Compat Ideographs 26
 
< 0.1%
Other values (2) 18
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
15938
19.1%
e 6493
 
7.8%
n 5793
 
7.0%
a 5241
 
6.3%
o 5240
 
6.3%
i 4947
 
5.9%
t 4227
 
5.1%
r 3810
 
4.6%
s 3304
 
4.0%
l 2033
 
2.4%
Other values (75) 26317
31.6%
Cyrillic
ValueCountFrequency (%)
о 994
 
8.2%
и 789
 
6.5%
е 780
 
6.4%
а 597
 
4.9%
н 591
 
4.9%
с 547
 
4.5%
р 539
 
4.4%
к 429
 
3.5%
т 426
 
3.5%
О 394
 
3.2%
Other values (55) 6044
49.8%
Hangul
ValueCountFrequency (%)
821
 
6.0%
509
 
3.7%
426
 
3.1%
312
 
2.3%
307
 
2.2%
297
 
2.2%
288
 
2.1%
288
 
2.1%
261
 
1.9%
241
 
1.8%
Other values (512) 9987
72.7%
CJK
ValueCountFrequency (%)
195
 
4.3%
148
 
3.3%
147
 
3.2%
116
 
2.6%
88
 
1.9%
68
 
1.5%
56
 
1.2%
55
 
1.2%
48
 
1.1%
43
 
0.9%
Other values (829) 3567
78.7%
Punctuation
ValueCountFrequency (%)
132
30.9%
116
27.2%
116
27.2%
59
13.8%
2
 
0.5%
2
 
0.5%
Hiragana
ValueCountFrequency (%)
82
28.0%
42
14.3%
33
11.3%
14
 
4.8%
14
 
4.8%
13
 
4.4%
11
 
3.8%
11
 
3.8%
10
 
3.4%
8
 
2.7%
Other values (24) 55
18.8%
None
ValueCountFrequency (%)
70
16.9%
70
16.9%
· 54
13.1%
53
12.8%
53
12.8%
23
 
5.6%
23
 
5.6%
22
 
5.3%
12
 
2.9%
12
 
2.9%
Other values (12) 21
 
5.1%
Katakana
ValueCountFrequency (%)
22
27.5%
14
17.5%
6
 
7.5%
4
 
5.0%
3
 
3.8%
3
 
3.8%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
Other values (17) 20
25.0%
Math Operators
ValueCountFrequency (%)
17
50.0%
17
50.0%
CJK Compat Ideographs
ValueCountFrequency (%)
4
15.4%
4
15.4%
2
 
7.7%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (7) 7
26.9%
Compat Jamo
ValueCountFrequency (%)
2
 
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (6) 6
35.3%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%

SUBTITLE_ENG
Text

MISSING 

Distinct3
Distinct (%)100.0%
Missing2269
Missing (%)99.9%
Memory size17.9 KiB
2023-12-13T00:43:12.202272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length108
Median length106
Mean length73
Min length5

Characters and Unicode

Total characters219
Distinct characters46
Distinct categories10 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)100.0%

Sample

1st row#NAME
2nd row Restudy on the Battle of Dingmao and The Negotiation Between Houjin and Chosn Dynasty Concerning the Battle
3rd rowThe Comparative Analysis on the Scholarly Journals of“Korean Studies”and “Contemporary Korea”( 1996 -2010)
ValueCountFrequency (%)
the 5
 
16.1%
on 2
 
6.5%
battle 2
 
6.5%
and 2
 
6.5%
name 1
 
3.2%
comparative 1
 
3.2%
1996 1
 
3.2%
korea” 1
 
3.2%
“contemporary 1
 
3.2%
studies”and 1
 
3.2%
Other values (14) 14
45.2%
2023-12-13T00:43:12.538748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29
 
13.2%
e 18
 
8.2%
n 18
 
8.2%
o 17
 
7.8%
a 16
 
7.3%
t 15
 
6.8%
r 8
 
3.7%
i 8
 
3.7%
s 7
 
3.2%
h 7
 
3.2%
Other values (36) 76
34.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 150
68.5%
Space Separator 29
 
13.2%
Uppercase Letter 24
 
11.0%
Decimal Number 8
 
3.7%
Final Punctuation 2
 
0.9%
Initial Punctuation 2
 
0.9%
Dash Punctuation 1
 
0.5%
Open Punctuation 1
 
0.5%
Other Punctuation 1
 
0.5%
Close Punctuation 1
 
0.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 18
12.0%
n 18
12.0%
o 17
11.3%
a 16
10.7%
t 15
10.0%
r 8
 
5.3%
i 8
 
5.3%
s 7
 
4.7%
h 7
 
4.7%
l 6
 
4.0%
Other values (11) 30
20.0%
Uppercase Letter
ValueCountFrequency (%)
C 4
16.7%
B 3
12.5%
K 2
8.3%
S 2
8.3%
A 2
8.3%
T 2
8.3%
N 2
8.3%
D 2
8.3%
J 1
 
4.2%
H 1
 
4.2%
Other values (3) 3
12.5%
Decimal Number
ValueCountFrequency (%)
0 2
25.0%
1 2
25.0%
9 2
25.0%
6 1
12.5%
2 1
12.5%
Space Separator
ValueCountFrequency (%)
29
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%
Dash Punctuation
ValueCountFrequency (%)
1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Other Punctuation
ValueCountFrequency (%)
# 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 174
79.5%
Common 45
 
20.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 18
 
10.3%
n 18
 
10.3%
o 17
 
9.8%
a 16
 
9.2%
t 15
 
8.6%
r 8
 
4.6%
i 8
 
4.6%
s 7
 
4.0%
h 7
 
4.0%
l 6
 
3.4%
Other values (24) 54
31.0%
Common
ValueCountFrequency (%)
29
64.4%
2
 
4.4%
0 2
 
4.4%
1 2
 
4.4%
9 2
 
4.4%
2
 
4.4%
6 1
 
2.2%
2 1
 
2.2%
1
 
2.2%
( 1
 
2.2%
Other values (2) 2
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 214
97.7%
Punctuation 4
 
1.8%
None 1
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
29
13.6%
e 18
 
8.4%
n 18
 
8.4%
o 17
 
7.9%
a 16
 
7.5%
t 15
 
7.0%
r 8
 
3.7%
i 8
 
3.7%
s 7
 
3.3%
h 7
 
3.3%
Other values (33) 71
33.2%
Punctuation
ValueCountFrequency (%)
2
50.0%
2
50.0%
None
ValueCountFrequency (%)
1
100.0%

SUBTITLE_KOR
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing2271
Missing (%)> 99.9%
Memory size17.9 KiB
2023-12-13T00:43:12.757943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length20
Mean length20
Min length20

Characters and Unicode

Total characters20
Distinct characters17
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row현대 한국의 정토사회와 인드라넷 운동
ValueCountFrequency (%)
현대 1
20.0%
한국의 1
20.0%
정토사회와 1
20.0%
인드라넷 1
20.0%
운동 1
20.0%
2023-12-13T00:43:13.123611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4
20.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (7) 7
35.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16
80.0%
Space Separator 4
 
20.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Other values (6) 6
37.5%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16
80.0%
Common 4
 
20.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Other values (6) 6
37.5%
Common
ValueCountFrequency (%)
4
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16
80.0%
ASCII 4
 
20.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4
100.0%
Hangul
ValueCountFrequency (%)
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Other values (6) 6
37.5%

SUBTITLE_ORI
Text

MISSING 

Distinct9
Distinct (%)100.0%
Missing2263
Missing (%)99.6%
Memory size17.9 KiB
2023-12-13T00:43:13.465915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length89
Median length39
Mean length43.888889
Min length5

Characters and Unicode

Total characters395
Distinct characters132
Distinct categories12 ?
Distinct scripts5 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)100.0%

Sample

1st rowTHE JUNGTO SOCIETY AND THE INDRA'S NET COMMUNITY MOVEMENT IN CONTEMPORARY KOREA
2nd row성리학사상 한국성리학의 위상과 역할 - 주자성리학과 한국성리학 비교연구
3rd row “丁卯之役”及金判的再探
4th rowReading between the Lines: Automated Content Analysis of North Korean Nuclear Rhetooric
5th rowWomen’s Citizenship and Civil Society in South Korea: Gender Politics in Labor Movements
ValueCountFrequency (%)
the 3
 
5.3%
in 3
 
5.3%
society 2
 
3.5%
and 2
 
3.5%
korea 2
 
3.5%
2
 
3.5%
movements 1
 
1.8%
nuclear 1
 
1.8%
rhetooric 1
 
1.8%
women’s 1
 
1.8%
Other values (39) 39
68.4%
2023-12-13T00:43:13.897641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
51
 
12.9%
e 19
 
4.8%
o 14
 
3.5%
n 14
 
3.5%
i 14
 
3.5%
t 13
 
3.3%
N 11
 
2.8%
E 9
 
2.3%
a 8
 
2.0%
T 8
 
2.0%
Other values (122) 234
59.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 130
32.9%
Uppercase Letter 90
22.8%
Other Letter 87
22.0%
Space Separator 51
 
12.9%
Other Punctuation 9
 
2.3%
Dash Punctuation 8
 
2.0%
Decimal Number 8
 
2.0%
Close Punctuation 4
 
1.0%
Open Punctuation 4
 
1.0%
Final Punctuation 2
 
0.5%
Other values (2) 2
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
 
4.6%
4
 
4.6%
4
 
4.6%
3
 
3.4%
3
 
3.4%
3
 
3.4%
2
 
2.3%
2
 
2.3%
2
 
2.3%
2
 
2.3%
Other values (56) 58
66.7%
Lowercase Letter
ValueCountFrequency (%)
e 19
14.6%
o 14
10.8%
n 14
10.8%
i 14
10.8%
t 13
10.0%
a 8
 
6.2%
r 7
 
5.4%
s 7
 
5.4%
h 5
 
3.8%
l 4
 
3.1%
Other values (12) 25
19.2%
Uppercase Letter
ValueCountFrequency (%)
N 11
12.2%
E 9
10.0%
T 8
 
8.9%
M 7
 
7.8%
A 7
 
7.8%
O 7
 
7.8%
C 6
 
6.7%
R 6
 
6.7%
S 4
 
4.4%
I 4
 
4.4%
Other values (11) 21
23.3%
Other Punctuation
ValueCountFrequency (%)
\ 4
44.4%
: 2
22.2%
1
 
11.1%
' 1
 
11.1%
# 1
 
11.1%
Decimal Number
ValueCountFrequency (%)
1 2
25.0%
9 2
25.0%
0 2
25.0%
6 1
12.5%
2 1
12.5%
Close Punctuation
ValueCountFrequency (%)
2
50.0%
) 1
25.0%
1
25.0%
Open Punctuation
ValueCountFrequency (%)
2
50.0%
( 1
25.0%
1
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
50.0%
4
50.0%
Final Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
51
100.0%
Math Symbol
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 220
55.7%
Common 88
 
22.3%
Hangul 42
 
10.6%
Han 30
 
7.6%
Hiragana 15
 
3.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 19
 
8.6%
o 14
 
6.4%
n 14
 
6.4%
i 14
 
6.4%
t 13
 
5.9%
N 11
 
5.0%
E 9
 
4.1%
a 8
 
3.6%
T 8
 
3.6%
r 7
 
3.2%
Other values (33) 103
46.8%
Han
ValueCountFrequency (%)
2
 
6.7%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
Other values (19) 19
63.3%
Hangul
ValueCountFrequency (%)
4
 
9.5%
4
 
9.5%
4
 
9.5%
3
 
7.1%
3
 
7.1%
2
 
4.8%
2
 
4.8%
2
 
4.8%
1
 
2.4%
1
 
2.4%
Other values (16) 16
38.1%
Common
ValueCountFrequency (%)
51
58.0%
- 4
 
4.5%
\ 4
 
4.5%
4
 
4.5%
2
 
2.3%
1 2
 
2.3%
: 2
 
2.3%
9 2
 
2.3%
0 2
 
2.3%
2
 
2.3%
Other values (13) 13
 
14.8%
Hiragana
ValueCountFrequency (%)
3
20.0%
2
13.3%
2
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 293
74.2%
Hangul 42
 
10.6%
CJK 30
 
7.6%
Hiragana 15
 
3.8%
None 8
 
2.0%
Punctuation 7
 
1.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
51
 
17.4%
e 19
 
6.5%
o 14
 
4.8%
n 14
 
4.8%
i 14
 
4.8%
t 13
 
4.4%
N 11
 
3.8%
E 9
 
3.1%
a 8
 
2.7%
T 8
 
2.7%
Other values (46) 132
45.1%
Punctuation
ValueCountFrequency (%)
4
57.1%
1
 
14.3%
1
 
14.3%
1
 
14.3%
Hangul
ValueCountFrequency (%)
4
 
9.5%
4
 
9.5%
4
 
9.5%
3
 
7.1%
3
 
7.1%
2
 
4.8%
2
 
4.8%
2
 
4.8%
1
 
2.4%
1
 
2.4%
Other values (16) 16
38.1%
Hiragana
ValueCountFrequency (%)
3
20.0%
2
13.3%
2
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
None
ValueCountFrequency (%)
2
25.0%
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
CJK
ValueCountFrequency (%)
2
 
6.7%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
Other values (19) 19
63.3%

AUTHOR_ORI
Text

MISSING 

Distinct1743
Distinct (%)80.4%
Missing104
Missing (%)4.6%
Memory size17.9 KiB
2023-12-13T00:43:14.298886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length62
Mean length10.502306
Min length1

Characters and Unicode

Total characters22769
Distinct characters706
Distinct categories10 ?
Distinct scripts6 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1475 ?
Unique (%)68.0%

Sample

1st row김도영
2nd row김도영
3rd row김도영
4th row김도영
5th row김도영
ValueCountFrequency (%)
kim 114
 
2.8%
lee 53
 
1.3%
park 47
 
1.2%
김관웅 29
 
0.7%
ким 24
 
0.6%
shin 23
 
0.6%
а 20
 
0.5%
j 17
 
0.4%
yoon 15
 
0.4%
han 15
 
0.4%
Other values (2416) 3653
91.1%
2023-12-13T00:43:15.012291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1849
 
8.1%
n 1521
 
6.7%
a 1266
 
5.6%
o 1203
 
5.3%
e 1105
 
4.9%
i 985
 
4.3%
u 702
 
3.1%
h 618
 
2.7%
g 616
 
2.7%
r 590
 
2.6%
Other values (696) 12314
54.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 12739
55.9%
Uppercase Letter 4178
 
18.3%
Other Letter 2953
 
13.0%
Space Separator 1849
 
8.1%
Other Punctuation 696
 
3.1%
Dash Punctuation 308
 
1.4%
Close Punctuation 22
 
0.1%
Open Punctuation 22
 
0.1%
Decimal Number 1
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
175
 
5.9%
67
 
2.3%
66
 
2.2%
47
 
1.6%
43
 
1.5%
40
 
1.4%
36
 
1.2%
34
 
1.2%
33
 
1.1%
33
 
1.1%
Other values (566) 2379
80.6%
Lowercase Letter
ValueCountFrequency (%)
n 1521
11.9%
a 1266
 
9.9%
o 1203
 
9.4%
e 1105
 
8.7%
i 985
 
7.7%
u 702
 
5.5%
h 618
 
4.9%
g 616
 
4.8%
r 590
 
4.6%
m 388
 
3.0%
Other values (50) 3745
29.4%
Uppercase Letter
ValueCountFrequency (%)
S 428
 
10.2%
K 339
 
8.1%
H 266
 
6.4%
J 247
 
5.9%
M 207
 
5.0%
Y 183
 
4.4%
A 164
 
3.9%
C 157
 
3.8%
L 147
 
3.5%
P 144
 
3.4%
Other values (48) 1896
45.4%
Other Punctuation
ValueCountFrequency (%)
. 360
51.7%
; 286
41.1%
, 40
 
5.7%
' 10
 
1.4%
Close Punctuation
ValueCountFrequency (%)
) 18
81.8%
] 4
 
18.2%
Open Punctuation
ValueCountFrequency (%)
( 18
81.8%
[ 4
 
18.2%
Space Separator
ValueCountFrequency (%)
1849
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 308
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%
Math Symbol
ValueCountFrequency (%)
÷ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 14803
65.0%
Common 2899
 
12.7%
Cyrillic 2114
 
9.3%
Hangul 2031
 
8.9%
Han 897
 
3.9%
Katakana 25
 
0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
43
 
4.8%
28
 
3.1%
15
 
1.7%
14
 
1.6%
14
 
1.6%
12
 
1.3%
11
 
1.2%
11
 
1.2%
11
 
1.2%
10
 
1.1%
Other values (324) 728
81.2%
Hangul
ValueCountFrequency (%)
175
 
8.6%
67
 
3.3%
66
 
3.2%
47
 
2.3%
40
 
2.0%
36
 
1.8%
34
 
1.7%
33
 
1.6%
33
 
1.6%
28
 
1.4%
Other values (217) 1472
72.5%
Cyrillic
ValueCountFrequency (%)
а 241
 
11.4%
н 131
 
6.2%
А 127
 
6.0%
и 119
 
5.6%
о 83
 
3.9%
в 80
 
3.8%
е 80
 
3.8%
м 68
 
3.2%
М 66
 
3.1%
л 64
 
3.0%
Other values (52) 1055
49.9%
Latin
ValueCountFrequency (%)
n 1521
 
10.3%
a 1266
 
8.6%
o 1203
 
8.1%
e 1105
 
7.5%
i 985
 
6.7%
u 702
 
4.7%
h 618
 
4.2%
g 616
 
4.2%
r 590
 
4.0%
S 428
 
2.9%
Other values (46) 5769
39.0%
Katakana
ValueCountFrequency (%)
6
24.0%
3
12.0%
3
12.0%
2
 
8.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
Other values (5) 5
20.0%
Common
ValueCountFrequency (%)
1849
63.8%
. 360
 
12.4%
- 308
 
10.6%
; 286
 
9.9%
, 40
 
1.4%
) 18
 
0.6%
( 18
 
0.6%
' 10
 
0.3%
] 4
 
0.1%
[ 4
 
0.1%
Other values (2) 2
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 17696
77.7%
Cyrillic 2114
 
9.3%
Hangul 2031
 
8.9%
CJK 881
 
3.9%
Katakana 25
 
0.1%
CJK Compat Ideographs 16
 
0.1%
None 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1849
 
10.4%
n 1521
 
8.6%
a 1266
 
7.2%
o 1203
 
6.8%
e 1105
 
6.2%
i 985
 
5.6%
u 702
 
4.0%
h 618
 
3.5%
g 616
 
3.5%
r 590
 
3.3%
Other values (53) 7241
40.9%
Cyrillic
ValueCountFrequency (%)
а 241
 
11.4%
н 131
 
6.2%
А 127
 
6.0%
и 119
 
5.6%
о 83
 
3.9%
в 80
 
3.8%
е 80
 
3.8%
м 68
 
3.2%
М 66
 
3.1%
л 64
 
3.0%
Other values (52) 1055
49.9%
Hangul
ValueCountFrequency (%)
175
 
8.6%
67
 
3.3%
66
 
3.2%
47
 
2.3%
40
 
2.0%
36
 
1.8%
34
 
1.7%
33
 
1.6%
33
 
1.6%
28
 
1.4%
Other values (217) 1472
72.5%
CJK
ValueCountFrequency (%)
43
 
4.9%
28
 
3.2%
15
 
1.7%
14
 
1.6%
14
 
1.6%
12
 
1.4%
11
 
1.2%
11
 
1.2%
10
 
1.1%
10
 
1.1%
Other values (320) 713
80.9%
CJK Compat Ideographs
ValueCountFrequency (%)
11
68.8%
2
 
12.5%
2
 
12.5%
1
 
6.2%
Katakana
ValueCountFrequency (%)
6
24.0%
3
12.0%
3
12.0%
2
 
8.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
Other values (5) 5
20.0%
None
ValueCountFrequency (%)
ð 2
33.3%
ı 1
16.7%
Þ 1
16.7%
÷ 1
16.7%
Ð 1
16.7%

AUTHOR_KOR
Text

MISSING 

Distinct472
Distinct (%)70.1%
Missing1599
Missing (%)70.4%
Memory size17.9 KiB
2023-12-13T00:43:15.414356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length3
Mean length3.2897474
Min length1

Characters and Unicode

Total characters2214
Distinct characters231
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique374 ?
Unique (%)55.6%

Sample

1st row김도영
2nd row김도영
3rd row김도영
4th row김도영
5th row김도영
ValueCountFrequency (%)
김관웅 30
 
4.3%
김도영 11
 
1.6%
김호웅 9
 
1.3%
8
 
1.1%
안평추 8
 
1.1%
정광 7
 
1.0%
오상순 6
 
0.9%
김춘선 6
 
0.9%
장광군 5
 
0.7%
장흥권 5
 
0.7%
Other values (480) 607
86.5%
2023-12-13T00:43:15.938558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
180
 
8.1%
69
 
3.1%
66
 
3.0%
50
 
2.3%
48
 
2.2%
; 44
 
2.0%
37
 
1.7%
35
 
1.6%
34
 
1.5%
31
 
1.4%
Other values (221) 1620
73.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2140
96.7%
Other Punctuation 45
 
2.0%
Space Separator 29
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
180
 
8.4%
69
 
3.2%
66
 
3.1%
50
 
2.3%
48
 
2.2%
37
 
1.7%
35
 
1.6%
34
 
1.6%
31
 
1.4%
29
 
1.4%
Other values (218) 1561
72.9%
Other Punctuation
ValueCountFrequency (%)
; 44
97.8%
, 1
 
2.2%
Space Separator
ValueCountFrequency (%)
29
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2140
96.7%
Common 74
 
3.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
180
 
8.4%
69
 
3.2%
66
 
3.1%
50
 
2.3%
48
 
2.2%
37
 
1.7%
35
 
1.6%
34
 
1.6%
31
 
1.4%
29
 
1.4%
Other values (218) 1561
72.9%
Common
ValueCountFrequency (%)
; 44
59.5%
29
39.2%
, 1
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2140
96.7%
ASCII 74
 
3.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
180
 
8.4%
69
 
3.2%
66
 
3.1%
50
 
2.3%
48
 
2.2%
37
 
1.7%
35
 
1.6%
34
 
1.6%
31
 
1.4%
29
 
1.4%
Other values (218) 1561
72.9%
ASCII
ValueCountFrequency (%)
; 44
59.5%
29
39.2%
, 1
 
1.4%

AUTHOR_ENG
Text

MISSING 

Distinct885
Distinct (%)83.6%
Missing1214
Missing (%)53.4%
Memory size17.9 KiB
2023-12-13T00:43:16.240487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length60
Mean length16.147448
Min length1

Characters and Unicode

Total characters17084
Distinct characters66
Distinct categories6 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique750 ?
Unique (%)70.9%

Sample

1st rowJungmin Seo;Dong-Hoon Seol
2nd rowMary Deborah Lee
3rd rowKyung-Tae Park
4th rowWhi Chang
5th rowSoo Hyun Jang
ValueCountFrequency (%)
kim 119
 
4.6%
lee 55
 
2.1%
park 48
 
1.9%
shin 22
 
0.9%
j 18
 
0.7%
han 16
 
0.6%
yoon 15
 
0.6%
s 14
 
0.5%
young 14
 
0.5%
suh 13
 
0.5%
Other values (1417) 2240
87.0%
2023-12-13T00:43:16.730727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
n 1544
 
9.0%
1519
 
8.9%
a 1278
 
7.5%
o 1199
 
7.0%
e 1116
 
6.5%
i 997
 
5.8%
u 712
 
4.2%
g 633
 
3.7%
h 604
 
3.5%
r 579
 
3.4%
Other values (56) 6903
40.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 11470
67.1%
Uppercase Letter 3375
 
19.8%
Space Separator 1519
 
8.9%
Other Punctuation 393
 
2.3%
Dash Punctuation 324
 
1.9%
Other Letter 3
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
S 413
 
12.2%
K 343
 
10.2%
H 278
 
8.2%
J 255
 
7.6%
M 206
 
6.1%
Y 189
 
5.6%
A 176
 
5.2%
C 164
 
4.9%
P 149
 
4.4%
L 148
 
4.4%
Other values (19) 1054
31.2%
Lowercase Letter
ValueCountFrequency (%)
n 1544
13.5%
a 1278
11.1%
o 1199
10.5%
e 1116
9.7%
i 997
8.7%
u 712
 
6.2%
g 633
 
5.5%
h 604
 
5.3%
r 579
 
5.0%
k 385
 
3.4%
Other values (18) 2423
21.1%
Other Punctuation
ValueCountFrequency (%)
; 204
51.9%
. 131
33.3%
, 49
 
12.5%
' 9
 
2.3%
Other Letter
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Space Separator
ValueCountFrequency (%)
1519
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 324
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 14841
86.9%
Common 2236
 
13.1%
Cyrillic 4
 
< 0.1%
Hangul 3
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 1544
 
10.4%
a 1278
 
8.6%
o 1199
 
8.1%
e 1116
 
7.5%
i 997
 
6.7%
u 712
 
4.8%
g 633
 
4.3%
h 604
 
4.1%
r 579
 
3.9%
S 413
 
2.8%
Other values (43) 5766
38.9%
Common
ValueCountFrequency (%)
1519
67.9%
- 324
 
14.5%
; 204
 
9.1%
. 131
 
5.9%
, 49
 
2.2%
' 9
 
0.4%
Cyrillic
ValueCountFrequency (%)
о 1
25.0%
В 1
25.0%
Н 1
25.0%
А 1
25.0%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 17076
> 99.9%
Cyrillic 4
 
< 0.1%
Hangul 3
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 1544
 
9.0%
1519
 
8.9%
a 1278
 
7.5%
o 1199
 
7.0%
e 1116
 
6.5%
i 997
 
5.8%
u 712
 
4.2%
g 633
 
3.7%
h 604
 
3.5%
r 579
 
3.4%
Other values (48) 6895
40.4%
Cyrillic
ValueCountFrequency (%)
о 1
25.0%
В 1
25.0%
Н 1
25.0%
А 1
25.0%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
None
ValueCountFrequency (%)
ı 1
100.0%

AUTHOR_ETC
Text

MISSING 

Distinct448
Distinct (%)83.6%
Missing1736
Missing (%)76.4%
Memory size17.9 KiB
2023-12-13T00:43:17.058398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length34
Mean length7.3246269
Min length1

Characters and Unicode

Total characters3926
Distinct characters450
Distinct categories9 ?
Distinct scripts5 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique385 ?
Unique (%)71.8%

Sample

1st row李奎泰
2nd row李奎泰
3rd row俊美
4th row邵雍
5th row斗音
ValueCountFrequency (%)
ким 24
 
2.8%
а 19
 
2.2%
пак 12
 
1.4%
с 12
 
1.4%
м 11
 
1.3%
10
 
1.1%
鄭光 7
 
0.8%
и 7
 
0.8%
тен 6
 
0.7%
л 6
 
0.7%
Other values (573) 756
86.9%
2023-12-13T00:43:17.634271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
337
 
8.6%
а 238
 
6.1%
. 233
 
5.9%
н 128
 
3.3%
А 124
 
3.2%
и 118
 
3.0%
о 81
 
2.1%
в 78
 
2.0%
е 76
 
1.9%
; 75
 
1.9%
Other values (440) 2438
62.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1478
37.6%
Uppercase Letter 923
23.5%
Other Letter 873
22.2%
Space Separator 337
 
8.6%
Other Punctuation 310
 
7.9%
Dash Punctuation 2
 
0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
40
 
4.6%
30
 
3.4%
15
 
1.7%
14
 
1.6%
13
 
1.5%
13
 
1.5%
12
 
1.4%
10
 
1.1%
10
 
1.1%
10
 
1.1%
Other values (330) 706
80.9%
Lowercase Letter
ValueCountFrequency (%)
а 238
16.1%
н 128
 
8.7%
и 118
 
8.0%
о 81
 
5.5%
в 78
 
5.3%
е 76
 
5.1%
м 68
 
4.6%
л 62
 
4.2%
р 59
 
4.0%
к 47
 
3.2%
Other values (41) 523
35.4%
Uppercase Letter
ValueCountFrequency (%)
А 124
 
13.4%
М 66
 
7.2%
К 63
 
6.8%
Н 60
 
6.5%
С 54
 
5.9%
В 53
 
5.7%
И 43
 
4.7%
Е 35
 
3.8%
Б 34
 
3.7%
Д 30
 
3.3%
Other values (40) 361
39.1%
Other Punctuation
ValueCountFrequency (%)
. 233
75.2%
; 75
 
24.2%
' 1
 
0.3%
, 1
 
0.3%
Space Separator
ValueCountFrequency (%)
337
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Cyrillic 2078
52.9%
Han 855
21.8%
Common 652
 
16.6%
Latin 323
 
8.2%
Katakana 18
 
0.5%

Most frequent character per script

Han
ValueCountFrequency (%)
40
 
4.7%
30
 
3.5%
15
 
1.8%
14
 
1.6%
13
 
1.5%
13
 
1.5%
12
 
1.4%
10
 
1.2%
10
 
1.2%
10
 
1.2%
Other values (316) 688
80.5%
Cyrillic
ValueCountFrequency (%)
а 238
 
11.5%
н 128
 
6.2%
А 124
 
6.0%
и 118
 
5.7%
о 81
 
3.9%
в 78
 
3.8%
е 76
 
3.7%
м 68
 
3.3%
М 66
 
3.2%
К 63
 
3.0%
Other values (52) 1038
50.0%
Latin
ValueCountFrequency (%)
n 31
 
9.6%
h 27
 
8.4%
T 26
 
8.0%
a 20
 
6.2%
e 18
 
5.6%
i 17
 
5.3%
g 16
 
5.0%
r 15
 
4.6%
N 12
 
3.7%
u 12
 
3.7%
Other values (29) 129
39.9%
Katakana
ValueCountFrequency (%)
3
16.7%
2
11.1%
2
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (4) 4
22.2%
Common
ValueCountFrequency (%)
337
51.7%
. 233
35.7%
; 75
 
11.5%
- 2
 
0.3%
' 1
 
0.2%
( 1
 
0.2%
, 1
 
0.2%
) 1
 
0.2%
3 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Cyrillic 2078
52.9%
ASCII 975
24.8%
CJK 842
21.4%
Katakana 18
 
0.5%
CJK Compat Ideographs 13
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
337
34.6%
. 233
23.9%
; 75
 
7.7%
n 31
 
3.2%
h 27
 
2.8%
T 26
 
2.7%
a 20
 
2.1%
e 18
 
1.8%
i 17
 
1.7%
g 16
 
1.6%
Other values (38) 175
17.9%
Cyrillic
ValueCountFrequency (%)
а 238
 
11.5%
н 128
 
6.2%
А 124
 
6.0%
и 118
 
5.7%
о 81
 
3.9%
в 78
 
3.8%
е 76
 
3.7%
м 68
 
3.3%
М 66
 
3.2%
К 63
 
3.0%
Other values (52) 1038
50.0%
CJK
ValueCountFrequency (%)
40
 
4.8%
30
 
3.6%
15
 
1.8%
14
 
1.7%
13
 
1.5%
13
 
1.5%
12
 
1.4%
10
 
1.2%
10
 
1.2%
10
 
1.2%
Other values (312) 675
80.2%
CJK Compat Ideographs
ValueCountFrequency (%)
8
61.5%
2
 
15.4%
2
 
15.4%
1
 
7.7%
Katakana
ValueCountFrequency (%)
3
16.7%
2
11.1%
2
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (4) 4
22.2%

ORGANIZATION_ORI
Text

MISSING 

Distinct882
Distinct (%)45.2%
Missing319
Missing (%)14.0%
Memory size17.9 KiB
2023-12-13T00:43:17.986124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length107
Median length81
Mean length17.97235
Min length1

Characters and Unicode

Total characters35100
Distinct characters509
Distinct categories11 ?
Distinct scripts7 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique622 ?
Unique (%)31.8%

Sample

1st rowUniversity of Delhi
2nd rowUniversity of Delhi
3rd rowUniversity of Delhi
4th rowUniversity of Delhi
5th rowUniversity of Delhi
ValueCountFrequency (%)
university 733
 
15.3%
of 405
 
8.4%
the 115
 
2.4%
national 82
 
1.7%
연변대학교 66
 
1.4%
studies 54
 
1.1%
연변대학 53
 
1.1%
중앙민족대학교 46
 
1.0%
им 46
 
1.0%
조선사회과학원 39
 
0.8%
Other values (1126) 3166
65.9%
2023-12-13T00:43:18.631386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2860
 
8.1%
i 2537
 
7.2%
n 2048
 
5.8%
e 1977
 
5.6%
t 1536
 
4.4%
o 1487
 
4.2%
a 1407
 
4.0%
r 1342
 
3.8%
s 1297
 
3.7%
y 1005
 
2.9%
Other values (499) 17604
50.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 22893
65.2%
Other Letter 5116
 
14.6%
Uppercase Letter 3723
 
10.6%
Space Separator 2860
 
8.1%
Other Punctuation 321
 
0.9%
Dash Punctuation 71
 
0.2%
Open Punctuation 46
 
0.1%
Close Punctuation 46
 
0.1%
Decimal Number 17
 
< 0.1%
Final Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
629
 
12.3%
580
 
11.3%
410
 
8.0%
272
 
5.3%
156
 
3.0%
131
 
2.6%
121
 
2.4%
114
 
2.2%
94
 
1.8%
88
 
1.7%
Other values (375) 2521
49.3%
Lowercase Letter
ValueCountFrequency (%)
i 2537
 
11.1%
n 2048
 
8.9%
e 1977
 
8.6%
t 1536
 
6.7%
o 1487
 
6.5%
a 1407
 
6.1%
r 1342
 
5.9%
s 1297
 
5.7%
y 1005
 
4.4%
v 852
 
3.7%
Other values (47) 7405
32.3%
Uppercase Letter
ValueCountFrequency (%)
U 836
22.5%
S 445
 
12.0%
C 213
 
5.7%
A 186
 
5.0%
T 173
 
4.6%
N 156
 
4.2%
H 133
 
3.6%
K 131
 
3.5%
I 118
 
3.2%
M 105
 
2.8%
Other values (38) 1227
33.0%
Other Punctuation
ValueCountFrequency (%)
, 115
35.8%
. 109
34.0%
; 81
25.2%
' 8
 
2.5%
& 4
 
1.2%
: 2
 
0.6%
/ 2
 
0.6%
Decimal Number
ValueCountFrequency (%)
2 9
52.9%
7 4
23.5%
3 3
 
17.6%
1 1
 
5.9%
Final Punctuation
ValueCountFrequency (%)
4
80.0%
1
 
20.0%
Initial Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
2860
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 71
100.0%
Open Punctuation
ValueCountFrequency (%)
( 46
100.0%
Close Punctuation
ValueCountFrequency (%)
) 46
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 22429
63.9%
Cyrillic 4186
 
11.9%
Hangul 3966
 
11.3%
Common 3368
 
9.6%
Han 1130
 
3.2%
Katakana 20
 
0.1%
Greek 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
629
15.9%
580
 
14.6%
410
 
10.3%
156
 
3.9%
131
 
3.3%
121
 
3.1%
114
 
2.9%
94
 
2.4%
88
 
2.2%
82
 
2.1%
Other values (183) 1561
39.4%
Han
ValueCountFrequency (%)
272
24.1%
61
 
5.4%
51
 
4.5%
38
 
3.4%
36
 
3.2%
28
 
2.5%
24
 
2.1%
24
 
2.1%
21
 
1.9%
20
 
1.8%
Other values (171) 555
49.1%
Latin
ValueCountFrequency (%)
i 2537
 
11.3%
n 2048
 
9.1%
e 1977
 
8.8%
t 1536
 
6.8%
o 1487
 
6.6%
a 1407
 
6.3%
r 1342
 
6.0%
s 1297
 
5.8%
y 1005
 
4.5%
v 852
 
3.8%
Other values (42) 6941
30.9%
Cyrillic
ValueCountFrequency (%)
и 427
 
10.2%
а 391
 
9.3%
н 305
 
7.3%
т 296
 
7.1%
с 287
 
6.9%
е 266
 
6.4%
о 198
 
4.7%
р 171
 
4.1%
к 171
 
4.1%
в 142
 
3.4%
Other values (42) 1532
36.6%
Common
ValueCountFrequency (%)
2860
84.9%
, 115
 
3.4%
. 109
 
3.2%
; 81
 
2.4%
- 71
 
2.1%
( 46
 
1.4%
) 46
 
1.4%
2 9
 
0.3%
' 8
 
0.2%
4
 
0.1%
Other values (9) 19
 
0.6%
Katakana
ValueCountFrequency (%)
4
20.0%
4
20.0%
4
20.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Greek
ValueCountFrequency (%)
γ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 25790
73.5%
Cyrillic 4186
 
11.9%
Hangul 3966
 
11.3%
CJK 1128
 
3.2%
Katakana 20
 
0.1%
Punctuation 7
 
< 0.1%
CJK Compat Ideographs 2
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2860
 
11.1%
i 2537
 
9.8%
n 2048
 
7.9%
e 1977
 
7.7%
t 1536
 
6.0%
o 1487
 
5.8%
a 1407
 
5.5%
r 1342
 
5.2%
s 1297
 
5.0%
y 1005
 
3.9%
Other values (57) 8294
32.2%
Hangul
ValueCountFrequency (%)
629
15.9%
580
 
14.6%
410
 
10.3%
156
 
3.9%
131
 
3.3%
121
 
3.1%
114
 
2.9%
94
 
2.4%
88
 
2.2%
82
 
2.1%
Other values (183) 1561
39.4%
Cyrillic
ValueCountFrequency (%)
и 427
 
10.2%
а 391
 
9.3%
н 305
 
7.3%
т 296
 
7.1%
с 287
 
6.9%
е 266
 
6.4%
о 198
 
4.7%
р 171
 
4.1%
к 171
 
4.1%
в 142
 
3.4%
Other values (42) 1532
36.6%
CJK
ValueCountFrequency (%)
272
24.1%
61
 
5.4%
51
 
4.5%
38
 
3.4%
36
 
3.2%
28
 
2.5%
24
 
2.1%
24
 
2.1%
21
 
1.9%
20
 
1.8%
Other values (169) 553
49.0%
Punctuation
ValueCountFrequency (%)
4
57.1%
1
 
14.3%
1
 
14.3%
1
 
14.3%
Katakana
ValueCountFrequency (%)
4
20.0%
4
20.0%
4
20.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
50.0%
1
50.0%
None
ValueCountFrequency (%)
γ 1
100.0%

ORGANIZATION_KOR
Text

MISSING 

Distinct495
Distinct (%)28.7%
Missing550
Missing (%)24.2%
Memory size17.9 KiB
2023-12-13T00:43:18.964849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length31
Mean length7.4947735
Min length1

Characters and Unicode

Total characters12906
Distinct characters381
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique255 ?
Unique (%)14.8%

Sample

1st row델리대학교
2nd row델리대학교
3rd row델리대학교
4th row델리대학교
5th row델리대학교
ValueCountFrequency (%)
연변대학교 137
 
7.4%
중앙민족대학교 71
 
3.8%
서울대학교 53
 
2.9%
조선사회과학원 39
 
2.1%
한국학중앙연구원 35
 
1.9%
푸단대학교 32
 
1.7%
고려대학교 31
 
1.7%
산둥대학교 26
 
1.4%
중앙대학교 25
 
1.4%
하와이대학교 25
 
1.4%
Other values (513) 1376
74.4%
2023-12-13T00:43:19.491882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1782
 
13.8%
1643
 
12.7%
1607
 
12.5%
277
 
2.1%
267
 
2.1%
240
 
1.9%
186
 
1.4%
173
 
1.3%
171
 
1.3%
169
 
1.3%
Other values (371) 6391
49.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12624
97.8%
Space Separator 128
 
1.0%
Other Punctuation 97
 
0.8%
Decimal Number 18
 
0.1%
Close Punctuation 12
 
0.1%
Dash Punctuation 12
 
0.1%
Open Punctuation 12
 
0.1%
Uppercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1782
 
14.1%
1643
 
13.0%
1607
 
12.7%
277
 
2.2%
267
 
2.1%
240
 
1.9%
186
 
1.5%
173
 
1.4%
171
 
1.4%
169
 
1.3%
Other values (358) 6109
48.4%
Decimal Number
ValueCountFrequency (%)
2 9
50.0%
7 4
22.2%
3 3
 
16.7%
0 1
 
5.6%
1 1
 
5.6%
Other Punctuation
ValueCountFrequency (%)
; 86
88.7%
, 11
 
11.3%
Uppercase Letter
ValueCountFrequency (%)
M 2
66.7%
G 1
33.3%
Space Separator
ValueCountFrequency (%)
128
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12624
97.8%
Common 279
 
2.2%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1782
 
14.1%
1643
 
13.0%
1607
 
12.7%
277
 
2.2%
267
 
2.1%
240
 
1.9%
186
 
1.5%
173
 
1.4%
171
 
1.4%
169
 
1.3%
Other values (358) 6109
48.4%
Common
ValueCountFrequency (%)
128
45.9%
; 86
30.8%
) 12
 
4.3%
- 12
 
4.3%
( 12
 
4.3%
, 11
 
3.9%
2 9
 
3.2%
7 4
 
1.4%
3 3
 
1.1%
0 1
 
0.4%
Latin
ValueCountFrequency (%)
M 2
66.7%
G 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12624
97.8%
ASCII 282
 
2.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1782
 
14.1%
1643
 
13.0%
1607
 
12.7%
277
 
2.2%
267
 
2.1%
240
 
1.9%
186
 
1.5%
173
 
1.4%
171
 
1.4%
169
 
1.3%
Other values (358) 6109
48.4%
ASCII
ValueCountFrequency (%)
128
45.4%
; 86
30.5%
) 12
 
4.3%
- 12
 
4.3%
( 12
 
4.3%
, 11
 
3.9%
2 9
 
3.2%
7 4
 
1.4%
3 3
 
1.1%
M 2
 
0.7%
Other values (3) 3
 
1.1%

ORGANIZATION_ENG
Text

MISSING 

Distinct529
Distinct (%)30.6%
Missing545
Missing (%)24.0%
Memory size17.9 KiB
2023-12-13T00:43:19.940613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length122
Median length79
Mean length28
Min length6

Characters and Unicode

Total characters48356
Distinct characters67
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique288 ?
Unique (%)16.7%

Sample

1st rowUniversity of Delhi
2nd rowUniversity of Delhi
3rd rowUniversity of Delhi
4th rowUniversity of Delhi
5th rowUniversity of Delhi
ValueCountFrequency (%)
university 1495
24.4%
of 626
 
10.2%
the 184
 
3.0%
national 157
 
2.6%
yanbian 150
 
2.5%
studies 125
 
2.0%
for 87
 
1.4%
and 82
 
1.3%
central 75
 
1.2%
nationalities 73
 
1.2%
Other values (684) 3068
50.1%
2023-12-13T00:43:20.639899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 5253
 
10.9%
n 4411
 
9.1%
4395
 
9.1%
e 3864
 
8.0%
a 3073
 
6.4%
t 2984
 
6.2%
o 2648
 
5.5%
r 2573
 
5.3%
s 2556
 
5.3%
y 1978
 
4.1%
Other values (57) 14621
30.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 37912
78.4%
Uppercase Letter 5675
 
11.7%
Space Separator 4395
 
9.1%
Other Punctuation 206
 
0.4%
Dash Punctuation 92
 
0.2%
Close Punctuation 34
 
0.1%
Open Punctuation 34
 
0.1%
Final Punctuation 5
 
< 0.1%
Decimal Number 2
 
< 0.1%
Initial Punctuation 1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
U 1608
28.3%
S 691
12.2%
C 389
 
6.9%
N 346
 
6.1%
T 303
 
5.3%
K 280
 
4.9%
A 258
 
4.5%
Y 227
 
4.0%
F 205
 
3.6%
H 166
 
2.9%
Other values (17) 1202
21.2%
Lowercase Letter
ValueCountFrequency (%)
i 5253
13.9%
n 4411
11.6%
e 3864
10.2%
a 3073
8.1%
t 2984
7.9%
o 2648
 
7.0%
r 2573
 
6.8%
s 2556
 
6.7%
y 1978
 
5.2%
v 1639
 
4.3%
Other values (16) 6933
18.3%
Other Punctuation
ValueCountFrequency (%)
, 84
40.8%
; 83
40.3%
' 19
 
9.2%
& 9
 
4.4%
. 9
 
4.4%
: 2
 
1.0%
Final Punctuation
ValueCountFrequency (%)
4
80.0%
1
 
20.0%
Space Separator
ValueCountFrequency (%)
4395
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 92
100.0%
Close Punctuation
ValueCountFrequency (%)
) 34
100.0%
Open Punctuation
ValueCountFrequency (%)
( 34
100.0%
Decimal Number
ValueCountFrequency (%)
3 2
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 43586
90.1%
Common 4769
 
9.9%
Cyrillic 1
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 5253
12.1%
n 4411
 
10.1%
e 3864
 
8.9%
a 3073
 
7.1%
t 2984
 
6.8%
o 2648
 
6.1%
r 2573
 
5.9%
s 2556
 
5.9%
y 1978
 
4.5%
v 1639
 
3.8%
Other values (42) 12607
28.9%
Common
ValueCountFrequency (%)
4395
92.2%
- 92
 
1.9%
, 84
 
1.8%
; 83
 
1.7%
) 34
 
0.7%
( 34
 
0.7%
' 19
 
0.4%
& 9
 
0.2%
. 9
 
0.2%
4
 
0.1%
Other values (4) 6
 
0.1%
Cyrillic
ValueCountFrequency (%)
К 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 48349
> 99.9%
Punctuation 6
 
< 0.1%
Cyrillic 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 5253
 
10.9%
n 4411
 
9.1%
4395
 
9.1%
e 3864
 
8.0%
a 3073
 
6.4%
t 2984
 
6.2%
o 2648
 
5.5%
r 2573
 
5.3%
s 2556
 
5.3%
y 1978
 
4.1%
Other values (53) 14614
30.2%
Punctuation
ValueCountFrequency (%)
4
66.7%
1
 
16.7%
1
 
16.7%
Cyrillic
ValueCountFrequency (%)
К 1
100.0%

ORGANIZATION_ETC
Text

MISSING 

Distinct114
Distinct (%)78.1%
Missing2126
Missing (%)93.6%
Memory size17.9 KiB
2023-12-13T00:43:21.115100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length107
Median length58
Mean length34.219178
Min length1

Characters and Unicode

Total characters4996
Distinct characters147
Distinct categories9 ?
Distinct scripts5 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique97 ?
Unique (%)66.4%

Sample

1st row上海市案
2nd rowКазахстан
3rd rowКаз УМОиМЯ им. Абылай хана
4th rowКазУМОиМЯ имени Абылай хана
5th rowКазНУ им.
ValueCountFrequency (%)
им 44
 
7.1%
университет 36
 
5.8%
государственный 33
 
5.4%
институт 29
 
4.7%
и 22
 
3.6%
арабаева 16
 
2.6%
ташкентский 15
 
2.4%
казахский 11
 
1.8%
кгу 10
 
1.6%
педагогический 10
 
1.6%
Other values (189) 390
63.3%
2023-12-13T00:43:21.779630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
472
 
9.4%
и 414
 
8.3%
а 373
 
7.5%
н 297
 
5.9%
т 286
 
5.7%
с 278
 
5.6%
е 259
 
5.2%
о 192
 
3.8%
р 168
 
3.4%
к 165
 
3.3%
Other values (137) 2092
41.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3724
74.5%
Uppercase Letter 575
 
11.5%
Space Separator 472
 
9.4%
Other Punctuation 130
 
2.6%
Other Letter 79
 
1.6%
Dash Punctuation 6
 
0.1%
Open Punctuation 4
 
0.1%
Close Punctuation 4
 
0.1%
Decimal Number 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
и 414
 
11.1%
а 373
 
10.0%
н 297
 
8.0%
т 286
 
7.7%
с 278
 
7.5%
е 259
 
7.0%
о 192
 
5.2%
р 168
 
4.5%
к 165
 
4.4%
в 139
 
3.7%
Other values (41) 1153
31.0%
Other Letter
ValueCountFrequency (%)
6
 
7.6%
5
 
6.3%
3
 
3.8%
3
 
3.8%
3
 
3.8%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
Other values (40) 49
62.0%
Uppercase Letter
ValueCountFrequency (%)
Г 69
12.0%
У 69
12.0%
И 61
10.6%
К 60
10.4%
А 49
 
8.5%
Т 39
 
6.8%
Н 30
 
5.2%
М 24
 
4.2%
Р 23
 
4.0%
В 18
 
3.1%
Other values (26) 133
23.1%
Other Punctuation
ValueCountFrequency (%)
. 93
71.5%
, 34
 
26.2%
; 2
 
1.5%
& 1
 
0.8%
Decimal Number
ValueCountFrequency (%)
1 1
50.0%
2 1
50.0%
Space Separator
ValueCountFrequency (%)
472
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Cyrillic 4051
81.1%
Common 618
 
12.4%
Latin 248
 
5.0%
Han 76
 
1.5%
Katakana 3
 
0.1%

Most frequent character per script

Cyrillic
ValueCountFrequency (%)
и 414
 
10.2%
а 373
 
9.2%
н 297
 
7.3%
т 286
 
7.1%
с 278
 
6.9%
е 259
 
6.4%
о 192
 
4.7%
р 168
 
4.1%
к 165
 
4.1%
в 139
 
3.4%
Other values (42) 1480
36.5%
Han
ValueCountFrequency (%)
6
 
7.9%
5
 
6.6%
3
 
3.9%
3
 
3.9%
3
 
3.9%
2
 
2.6%
2
 
2.6%
2
 
2.6%
2
 
2.6%
2
 
2.6%
Other values (37) 46
60.5%
Latin
ValueCountFrequency (%)
i 31
 
12.5%
a 23
 
9.3%
c 20
 
8.1%
h 19
 
7.7%
n 18
 
7.3%
g 16
 
6.5%
N 12
 
4.8%
u 10
 
4.0%
t 10
 
4.0%
H 10
 
4.0%
Other values (25) 79
31.9%
Common
ValueCountFrequency (%)
472
76.4%
. 93
 
15.0%
, 34
 
5.5%
- 6
 
1.0%
( 4
 
0.6%
) 4
 
0.6%
; 2
 
0.3%
1 1
 
0.2%
2 1
 
0.2%
& 1
 
0.2%
Katakana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Cyrillic 4051
81.1%
ASCII 866
 
17.3%
CJK 76
 
1.5%
Katakana 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
472
54.5%
. 93
 
10.7%
, 34
 
3.9%
i 31
 
3.6%
a 23
 
2.7%
c 20
 
2.3%
h 19
 
2.2%
n 18
 
2.1%
g 16
 
1.8%
N 12
 
1.4%
Other values (35) 128
 
14.8%
Cyrillic
ValueCountFrequency (%)
и 414
 
10.2%
а 373
 
9.2%
н 297
 
7.3%
т 286
 
7.1%
с 278
 
6.9%
е 259
 
6.4%
о 192
 
4.7%
р 168
 
4.1%
к 165
 
4.1%
в 139
 
3.4%
Other values (42) 1480
36.5%
CJK
ValueCountFrequency (%)
6
 
7.9%
5
 
6.6%
3
 
3.9%
3
 
3.9%
3
 
3.9%
2
 
2.6%
2
 
2.6%
2
 
2.6%
2
 
2.6%
2
 
2.6%
Other values (37) 46
60.5%
Katakana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

NATION
Text

MISSING 

Distinct64
Distinct (%)3.5%
Missing436
Missing (%)19.2%
Memory size17.9 KiB
2023-12-13T00:43:22.023807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length9
Mean length9.1633987
Min length9

Characters and Unicode

Total characters16824
Distinct characters27
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)1.3%

Sample

1st rowNATION_IN
2nd rowNATION_IN
3rd rowNATION_IN
4th rowNATION_IN
5th rowNATION_IN
ValueCountFrequency (%)
nation_cn 508
27.7%
nation_kr 458
24.9%
nation_us 157
 
8.6%
nation_jp 112
 
6.1%
nation_au 75
 
4.1%
nation_uz 60
 
3.3%
nation_fr 42
 
2.3%
nation_kp 40
 
2.2%
nation_kz 38
 
2.1%
nation_ru 31
 
1.7%
Other values (54) 315
17.2%
2023-12-13T00:43:22.401396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
N 4318
25.7%
A 1973
11.7%
T 1905
11.3%
I 1894
11.3%
O 1883
11.2%
_ 1866
11.1%
K 581
 
3.5%
R 564
 
3.4%
C 541
 
3.2%
U 344
 
2.0%
Other values (17) 955
 
5.7%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 14928
88.7%
Connector Punctuation 1866
 
11.1%
Other Punctuation 30
 
0.2%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
N 4318
28.9%
A 1973
13.2%
T 1905
12.8%
I 1894
12.7%
O 1883
12.6%
K 581
 
3.9%
R 564
 
3.8%
C 541
 
3.6%
U 344
 
2.3%
S 170
 
1.1%
Other values (15) 755
 
5.1%
Connector Punctuation
ValueCountFrequency (%)
_ 1866
100.0%
Other Punctuation
ValueCountFrequency (%)
; 30
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 14928
88.7%
Common 1896
 
11.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
N 4318
28.9%
A 1973
13.2%
T 1905
12.8%
I 1894
12.7%
O 1883
12.6%
K 581
 
3.9%
R 564
 
3.8%
C 541
 
3.6%
U 344
 
2.3%
S 170
 
1.1%
Other values (15) 755
 
5.1%
Common
ValueCountFrequency (%)
_ 1866
98.4%
; 30
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 16824
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N 4318
25.7%
A 1973
11.7%
T 1905
11.3%
I 1894
11.3%
O 1883
11.2%
_ 1866
11.1%
K 581
 
3.5%
R 564
 
3.4%
C 541
 
3.2%
U 344
 
2.0%
Other values (17) 955
 
5.7%

SUPPORT
Categorical

Distinct4
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
SUPPORT_C
1678 
SUPPORT_J
319 
SUPPORT_R
273 
SUPPORT_P
 
2

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSUPPORT_R
2nd rowSUPPORT_R
3rd rowSUPPORT_R
4th rowSUPPORT_R
5th rowSUPPORT_R

Common Values

ValueCountFrequency (%)
SUPPORT_C 1678
73.9%
SUPPORT_J 319
 
14.0%
SUPPORT_R 273
 
12.0%
SUPPORT_P 2
 
0.1%

Length

2023-12-13T00:43:22.532618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:43:22.634634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
support_c 1678
73.9%
support_j 319
 
14.0%
support_r 273
 
12.0%
support_p 2
 
0.1%

CATEGORY
Categorical

Distinct13
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
CATEGORY_H
629 
CATEGORY_B
350 
CATEGORY_G
284 
CATEGORY_E
276 
CATEGORY_C
200 
Other values (8)
533 

Length

Max length10
Median length10
Mean length9.9894366
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCATEGORY_H
2nd rowCATEGORY_H
3rd rowCATEGORY_H
4th rowCATEGORY_H
5th rowCATEGORY_H

Common Values

ValueCountFrequency (%)
CATEGORY_H 629
27.7%
CATEGORY_B 350
15.4%
CATEGORY_G 284
12.5%
CATEGORY_E 276
12.1%
CATEGORY_C 200
 
8.8%
CATEGORY_L 121
 
5.3%
CATEGORY_I 119
 
5.2%
CATEGORY_D 102
 
4.5%
CATEGORY_N 99
 
4.4%
CATEGORY_J 76
 
3.3%
Other values (3) 16
 
0.7%

Length

2023-12-13T00:43:22.753499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
category_h 629
27.7%
category_b 350
15.4%
category_g 284
12.5%
category_e 276
12.1%
category_c 200
 
8.8%
category_l 121
 
5.3%
category_i 119
 
5.2%
category_d 102
 
4.5%
category_n 99
 
4.4%
category_j 76
 
3.3%
Other values (3) 16
 
0.7%

LANGUAGE
Categorical

Distinct17
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
LANGUAGE_EN
949 
LANGUAGE_KO
726 
LANGUAGE_ZH
249 
LANGUAGE_RU
199 
LANGUAGE_JA
 
63
Other values (12)
 
86

Length

Max length11
Median length11
Mean length10.987676
Min length4

Unique

Unique4 ?
Unique (%)0.2%

Sample

1st rowLANGUAGE_HI
2nd rowLANGUAGE_HI
3rd rowLANGUAGE_HI
4th rowLANGUAGE_HI
5th rowLANGUAGE_HI

Common Values

ValueCountFrequency (%)
LANGUAGE_EN 949
41.8%
LANGUAGE_KO 726
32.0%
LANGUAGE_ZH 249
 
11.0%
LANGUAGE_RU 199
 
8.8%
LANGUAGE_JA 63
 
2.8%
LANGUAGE_FR 40
 
1.8%
LANGUAGE_VI 17
 
0.7%
LANGUAGE_HI 11
 
0.5%
<NA> 4
 
0.2%
LANGUAGE_TH 3
 
0.1%
Other values (7) 11
 
0.5%

Length

2023-12-13T00:43:22.892350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
language_en 949
41.8%
language_ko 726
32.0%
language_zh 249
 
11.0%
language_ru 199
 
8.8%
language_ja 63
 
2.8%
language_fr 40
 
1.8%
language_vi 17
 
0.7%
language_hi 11
 
0.5%
na 4
 
0.2%
language_th 3
 
0.1%
Other values (7) 11
 
0.5%

PAPER_PAGE_START
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
0
2272 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2272
100.0%

Length

2023-12-13T00:43:23.007895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:43:23.098996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2272
100.0%

PAPER_PAGE_END
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
0
2272 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2272
100.0%

Length

2023-12-13T00:43:23.567171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:43:23.680152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2272
100.0%

PDF_PAGE_START
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
0
2272 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2272
100.0%

Length

2023-12-13T00:43:23.794884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:43:23.900063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2272
100.0%

PDF_PAGE_END
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
0
2272 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2272
100.0%

Length

2023-12-13T00:43:23.982986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:43:24.067563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2272
100.0%

PAGES
Real number (ℝ)

Distinct202
Distinct (%)8.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.889965
Minimum1
Maximum3433
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size20.1 KiB
2023-12-13T00:43:24.166929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q15
median10
Q318
95-th percentile153.45
Maximum3433
Range3432
Interquartile range (IQR)13

Descriptive statistics

Standard deviation110.01015
Coefficient of variation (CV)3.4496792
Kurtosis416.05574
Mean31.889965
Median Absolute Deviation (MAD)6
Skewness15.616348
Sum72454
Variance12102.233
MonotonicityNot monotonic
2023-12-13T00:43:24.312161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 177
 
7.8%
10 131
 
5.8%
6 129
 
5.7%
7 124
 
5.5%
4 116
 
5.1%
8 112
 
4.9%
1 109
 
4.8%
12 105
 
4.6%
14 96
 
4.2%
5 95
 
4.2%
Other values (192) 1078
47.4%
ValueCountFrequency (%)
1 109
4.8%
2 177
7.8%
3 94
4.1%
4 116
5.1%
5 95
4.2%
6 129
5.7%
7 124
5.5%
8 112
4.9%
9 83
3.7%
10 131
5.8%
ValueCountFrequency (%)
3433 1
< 0.1%
1085 1
< 0.1%
1001 1
< 0.1%
885 1
< 0.1%
833 1
< 0.1%
740 1
< 0.1%
714 1
< 0.1%
633 1
< 0.1%
632 1
< 0.1%
568 1
< 0.1%

SUB_INDEX
Real number (ℝ)

ZEROS 

Distinct112
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.268046
Minimum0
Maximum111
Zeros226
Zeros (%)9.9%
Negative0
Negative (%)0.0%
Memory size20.1 KiB
2023-12-13T00:43:24.488743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q14
median14
Q333
95-th percentile74.45
Maximum111
Range111
Interquartile range (IQR)29

Descriptive statistics

Standard deviation23.518599
Coefficient of variation (CV)1.0561591
Kurtosis1.7493786
Mean22.268046
Median Absolute Deviation (MAD)12
Skewness1.4562855
Sum50593
Variance553.12451
MonotonicityNot monotonic
2023-12-13T00:43:24.655644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 226
 
9.9%
2 97
 
4.3%
1 96
 
4.2%
3 81
 
3.6%
4 75
 
3.3%
5 72
 
3.2%
6 68
 
3.0%
7 66
 
2.9%
8 64
 
2.8%
9 60
 
2.6%
Other values (102) 1367
60.2%
ValueCountFrequency (%)
0 226
9.9%
1 96
4.2%
2 97
4.3%
3 81
 
3.6%
4 75
 
3.3%
5 72
 
3.2%
6 68
 
3.0%
7 66
 
2.9%
8 64
 
2.8%
9 60
 
2.6%
ValueCountFrequency (%)
111 1
 
< 0.1%
110 1
 
< 0.1%
109 1
 
< 0.1%
108 1
 
< 0.1%
107 1
 
< 0.1%
106 2
0.1%
105 2
0.1%
104 3
0.1%
103 3
0.1%
102 3
0.1%

PROJECTYEAR_BEGIN
Real number (ℝ)

Distinct10
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2007.8464
Minimum2005
Maximum2014
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size20.1 KiB
2023-12-13T00:43:24.812340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2005
5-th percentile2006
Q12007
median2008
Q32009
95-th percentile2010
Maximum2014
Range9
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.5402125
Coefficient of variation (CV)0.0007670968
Kurtosis-0.31577274
Mean2007.8464
Median Absolute Deviation (MAD)1
Skewness0.47981784
Sum4561827
Variance2.3722547
MonotonicityNot monotonic
2023-12-13T00:43:24.968006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
2006 555
24.4%
2007 517
22.8%
2009 491
21.6%
2008 348
15.3%
2010 268
11.8%
2011 58
 
2.6%
2012 15
 
0.7%
2005 11
 
0.5%
2014 5
 
0.2%
2013 4
 
0.2%
ValueCountFrequency (%)
2005 11
 
0.5%
2006 555
24.4%
2007 517
22.8%
2008 348
15.3%
2009 491
21.6%
2010 268
11.8%
2011 58
 
2.6%
2012 15
 
0.7%
2013 4
 
0.2%
2014 5
 
0.2%
ValueCountFrequency (%)
2014 5
 
0.2%
2013 4
 
0.2%
2012 15
 
0.7%
2011 58
 
2.6%
2010 268
11.8%
2009 491
21.6%
2008 348
15.3%
2007 517
22.8%
2006 555
24.4%
2005 11
 
0.5%

PROJECTYEAR_END
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
<NA>
2267 
2007
 
2
2008
 
2
2009
 
1

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 2267
99.8%
2007 2
 
0.1%
2008 2
 
0.1%
2009 1
 
< 0.1%

Length

2023-12-13T00:43:25.147510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:43:25.320190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 2267
99.8%
2007 2
 
0.1%
2008 2
 
0.1%
2009 1
 
< 0.1%

PUBLISH_DATE
Date

MISSING 

Distinct14
Distinct (%)37.8%
Missing2235
Missing (%)98.4%
Memory size17.9 KiB
Minimum2004-01-01 00:00:00
Maximum2014-01-01 00:00:00
2023-12-13T00:43:25.418475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:43:25.545357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)

PUBLISHER
Text

MISSING 

Distinct24
Distinct (%)66.7%
Missing2236
Missing (%)98.4%
Memory size17.9 KiB
2023-12-13T00:43:25.811645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length27
Mean length10.555556
Min length2

Characters and Unicode

Total characters380
Distinct characters106
Distinct categories6 ?
Distinct scripts5 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)50.0%

Sample

1st row山大出版社
2nd row민족출판사
3rd row민족출판사
4th row延大出版社
5th rowBRILL
ValueCountFrequency (%)
민족출판사 6
 
12.0%
karec 3
 
6.0%
brill 3
 
6.0%
世界出版公司北京公司 2
 
4.0%
научное 2
 
4.0%
издание 2
 
4.0%
routledge 2
 
4.0%
development 2
 
4.0%
길림성잡지사 1
 
2.0%
dtudes 1
 
2.0%
Other values (26) 26
52.0%
2023-12-13T00:43:26.190234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 20
 
5.3%
14
 
3.7%
n 12
 
3.2%
t 10
 
2.6%
о 9
 
2.4%
l 9
 
2.4%
E 8
 
2.1%
o 8
 
2.1%
8
 
2.1%
a 8
 
2.1%
Other values (96) 274
72.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 192
50.5%
Other Letter 98
25.8%
Uppercase Letter 74
 
19.5%
Space Separator 14
 
3.7%
Other Punctuation 1
 
0.3%
Dash Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
8.2%
7
 
7.1%
7
 
7.1%
6
 
6.1%
6
 
6.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
Other values (33) 44
44.9%
Lowercase Letter
ValueCountFrequency (%)
e 20
 
10.4%
n 12
 
6.2%
t 10
 
5.2%
о 9
 
4.7%
l 9
 
4.7%
o 8
 
4.2%
a 8
 
4.2%
s 8
 
4.2%
а 8
 
4.2%
е 7
 
3.6%
Other values (27) 93
48.4%
Uppercase Letter
ValueCountFrequency (%)
E 8
 
10.8%
R 8
 
10.8%
L 8
 
10.8%
I 6
 
8.1%
C 5
 
6.8%
A 4
 
5.4%
N 3
 
4.1%
S 3
 
4.1%
B 3
 
4.1%
P 3
 
4.1%
Other values (13) 23
31.1%
Space Separator
ValueCountFrequency (%)
14
100.0%
Other Punctuation
ValueCountFrequency (%)
' 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 188
49.5%
Cyrillic 78
20.5%
Hangul 55
 
14.5%
Han 43
 
11.3%
Common 16
 
4.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 20
 
10.6%
n 12
 
6.4%
t 10
 
5.3%
l 9
 
4.8%
E 8
 
4.3%
o 8
 
4.3%
a 8
 
4.3%
R 8
 
4.3%
L 8
 
4.3%
s 8
 
4.3%
Other values (29) 89
47.3%
Han
ValueCountFrequency (%)
4
 
9.3%
4
 
9.3%
4
 
9.3%
4
 
9.3%
3
 
7.0%
2
 
4.7%
2
 
4.7%
2
 
4.7%
2
 
4.7%
2
 
4.7%
Other values (13) 14
32.6%
Cyrillic
ValueCountFrequency (%)
о 9
11.5%
а 8
10.3%
е 7
 
9.0%
и 7
 
9.0%
н 7
 
9.0%
с 5
 
6.4%
т 5
 
6.4%
у 4
 
5.1%
д 4
 
5.1%
в 3
 
3.8%
Other values (11) 19
24.4%
Hangul
ValueCountFrequency (%)
8
14.5%
7
12.7%
7
12.7%
6
10.9%
6
10.9%
4
 
7.3%
2
 
3.6%
2
 
3.6%
2
 
3.6%
1
 
1.8%
Other values (10) 10
18.2%
Common
ValueCountFrequency (%)
14
87.5%
' 1
 
6.2%
- 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 204
53.7%
Cyrillic 78
 
20.5%
Hangul 55
 
14.5%
CJK 43
 
11.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 20
 
9.8%
14
 
6.9%
n 12
 
5.9%
t 10
 
4.9%
l 9
 
4.4%
E 8
 
3.9%
o 8
 
3.9%
a 8
 
3.9%
R 8
 
3.9%
L 8
 
3.9%
Other values (32) 99
48.5%
Cyrillic
ValueCountFrequency (%)
о 9
11.5%
а 8
10.3%
е 7
 
9.0%
и 7
 
9.0%
н 7
 
9.0%
с 5
 
6.4%
т 5
 
6.4%
у 4
 
5.1%
д 4
 
5.1%
в 3
 
3.8%
Other values (11) 19
24.4%
Hangul
ValueCountFrequency (%)
8
14.5%
7
12.7%
7
12.7%
6
10.9%
6
10.9%
4
 
7.3%
2
 
3.6%
2
 
3.6%
2
 
3.6%
1
 
1.8%
Other values (10) 10
18.2%
CJK
ValueCountFrequency (%)
4
 
9.3%
4
 
9.3%
4
 
9.3%
4
 
9.3%
3
 
7.0%
2
 
4.7%
2
 
4.7%
2
 
4.7%
2
 
4.7%
2
 
4.7%
Other values (13) 14
32.6%

SERIAL_NUMBER
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
<NA>
2269 
420
 
1
3
 
1
10
 
1

Length

Max length4
Median length4
Mean length3.9973592
Min length1

Unique

Unique3 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 2269
99.9%
420 1
 
< 0.1%
3 1
 
< 0.1%
10 1
 
< 0.1%

Length

2023-12-13T00:43:26.328319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:43:26.428585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 2269
99.9%
420 1
 
< 0.1%
3 1
 
< 0.1%
10 1
 
< 0.1%

VOLUME
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
<NA>
2269 
28
 
1
43
 
1
13
 
1

Length

Max length4
Median length4
Mean length3.9973592
Min length2

Unique

Unique3 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 2269
99.9%
28 1
 
< 0.1%
43 1
 
< 0.1%
13 1
 
< 0.1%

Length

2023-12-13T00:43:26.538971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:43:26.636810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 2269
99.9%
28 1
 
< 0.1%
43 1
 
< 0.1%
13 1
 
< 0.1%

NUMBER
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
<NA>
2270 
1
 
1
2
 
1

Length

Max length4
Median length4
Mean length3.9973592
Min length1

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 2270
99.9%
1 1
 
< 0.1%
2 1
 
< 0.1%

Length

2023-12-13T00:43:26.752583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:43:26.863296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 2270
99.9%
1 1
 
< 0.1%
2 1
 
< 0.1%

ISBN
Text

MISSING 

Distinct21
Distinct (%)91.3%
Missing2249
Missing (%)99.0%
Memory size17.9 KiB
2023-12-13T00:43:27.040801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length16.391304
Min length13

Characters and Unicode

Total characters377
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)87.0%

Sample

1st row978-7-5607-3674-7
2nd row7-105-08137-6
3rd row978-7-105-08237-7
4th row978 -7 -5634 -3013-0
5th row978-90-04-16440-6
ValueCountFrequency (%)
5-628-01-982-8 3
 
11.5%
7-105-08137-6 1
 
3.8%
978-7-105-09795-1 1
 
3.8%
978-90-04-18535-7 1
 
3.8%
978-1-872588-19-3 1
 
3.8%
978-4-7722-3131-2 1
 
3.8%
978-7-5634-2430-6 1
 
3.8%
978-7-5062-9567-3 1
 
3.8%
0-7734-6311-9 1
 
3.8%
978-7-105-09982-5 1
 
3.8%
Other values (14) 14
53.8%
2023-12-13T00:43:27.426423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 90
23.9%
7 43
11.4%
8 40
10.6%
9 40
10.6%
0 33
 
8.8%
5 25
 
6.6%
2 25
 
6.6%
1 24
 
6.4%
6 22
 
5.8%
3 16
 
4.2%
Other values (2) 19
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 284
75.3%
Dash Punctuation 90
 
23.9%
Space Separator 3
 
0.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
7 43
15.1%
8 40
14.1%
9 40
14.1%
0 33
11.6%
5 25
8.8%
2 25
8.8%
1 24
8.5%
6 22
7.7%
3 16
 
5.6%
4 16
 
5.6%
Dash Punctuation
ValueCountFrequency (%)
- 90
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 377
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 90
23.9%
7 43
11.4%
8 40
10.6%
9 40
10.6%
0 33
 
8.8%
5 25
 
6.6%
2 25
 
6.6%
1 24
 
6.4%
6 22
 
5.8%
3 16
 
4.2%
Other values (2) 19
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 377
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 90
23.9%
7 43
11.4%
8 40
10.6%
9 40
10.6%
0 33
 
8.8%
5 25
 
6.6%
2 25
 
6.6%
1 24
 
6.4%
6 22
 
5.8%
3 16
 
4.2%
Other values (2) 19
 
5.0%

ISSN
Text

MISSING 

Distinct9
Distinct (%)69.2%
Missing2259
Missing (%)99.4%
Memory size17.9 KiB
2023-12-13T00:43:27.618623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length9
Mean length10.153846
Min length9

Characters and Unicode

Total characters132
Distinct characters15
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)53.8%

Sample

1st row1875-0273
2nd row1449-7395
3rd row1950-4462
4th row1449-7395
5th row1875-0273
ValueCountFrequency (%)
1875-0273 3
18.8%
1449-7395 3
18.8%
issn 3
18.8%
1950-4462 1
 
6.2%
1347-7307 1
 
6.2%
0965-1942 1
 
6.2%
1741-1912 1
 
6.2%
1696-2206 1
 
6.2%
1009-3311 1
 
6.2%
1674-0866 1
 
6.2%
2023-12-13T00:43:27.970296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 18
13.6%
7 14
10.6%
- 13
9.8%
4 12
9.1%
9 12
9.1%
0 10
7.6%
3 10
7.6%
5 8
6.1%
2 8
6.1%
6 8
6.1%
Other values (5) 19
14.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 104
78.8%
Dash Punctuation 13
 
9.8%
Uppercase Letter 12
 
9.1%
Space Separator 3
 
2.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 18
17.3%
7 14
13.5%
4 12
11.5%
9 12
11.5%
0 10
9.6%
3 10
9.6%
5 8
7.7%
2 8
7.7%
6 8
7.7%
8 4
 
3.8%
Uppercase Letter
ValueCountFrequency (%)
S 6
50.0%
I 3
25.0%
N 3
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 120
90.9%
Latin 12
 
9.1%

Most frequent character per script

Common
ValueCountFrequency (%)
1 18
15.0%
7 14
11.7%
- 13
10.8%
4 12
10.0%
9 12
10.0%
0 10
8.3%
3 10
8.3%
5 8
6.7%
2 8
6.7%
6 8
6.7%
Other values (2) 7
 
5.8%
Latin
ValueCountFrequency (%)
S 6
50.0%
I 3
25.0%
N 3
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 132
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 18
13.6%
7 14
10.6%
- 13
9.8%
4 12
9.1%
9 12
9.1%
0 10
7.6%
3 10
7.6%
5 8
6.1%
2 8
6.1%
6 8
6.1%
Other values (5) 19
14.4%

SORT_TITLE_ENG
Text

MISSING 

Distinct1371
Distinct (%)99.1%
Missing889
Missing (%)39.1%
Memory size17.9 KiB
2023-12-13T00:43:28.296600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length292
Median length137
Mean length73.38684
Min length8

Characters and Unicode

Total characters101494
Distinct characters203
Distinct categories12 ?
Distinct scripts6 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1359 ?
Unique (%)98.3%

Sample

1st rowKOREANENGLISH DICTIONARY IN HINDI PRONUNCIATION
2nd rowKOREANENGLISH DICTIONARY IN HINDI PRONUNCIATION ㄱ
3rd rowKOREANENGLISH DICTIONARY IN HINDI PRONUNCIATION ㄴ
4th rowKOREANENGLISH DICTIONARY IN HINDI PRONUNCIATION ㄷ
5th rowKOREANENGLISH DICTIONARY IN HINDI PRONUNCIATION ㄹㅁ
ValueCountFrequency (%)
of 1113
 
7.4%
the 825
 
5.5%
in 814
 
5.4%
and 808
 
5.4%
korean 599
 
4.0%
on 252
 
1.7%
korea 225
 
1.5%
a 151
 
1.0%
south 92
 
0.6%
for 92
 
0.6%
Other values (3901) 10014
66.8%
2023-12-13T00:43:28.943116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13722
13.5%
E 9225
 
9.1%
N 8579
 
8.5%
A 8004
 
7.9%
O 7788
 
7.7%
I 7473
 
7.4%
T 6512
 
6.4%
R 5624
 
5.5%
S 5454
 
5.4%
C 3479
 
3.4%
Other values (193) 25634
25.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 86887
85.6%
Space Separator 13722
 
13.5%
Decimal Number 711
 
0.7%
Other Letter 142
 
0.1%
Other Punctuation 12
 
< 0.1%
Final Punctuation 11
 
< 0.1%
Initial Punctuation 3
 
< 0.1%
Other Symbol 2
 
< 0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3
 
2.1%
2
 
1.4%
2
 
1.4%
2
 
1.4%
2
 
1.4%
2
 
1.4%
2
 
1.4%
2
 
1.4%
2
 
1.4%
2
 
1.4%
Other values (116) 121
85.2%
Uppercase Letter
ValueCountFrequency (%)
E 9225
10.6%
N 8579
 
9.9%
A 8004
 
9.2%
O 7788
 
9.0%
I 7473
 
8.6%
T 6512
 
7.5%
R 5624
 
6.5%
S 5454
 
6.3%
C 3479
 
4.0%
L 3107
 
3.6%
Other values (43) 21642
24.9%
Decimal Number
ValueCountFrequency (%)
1 160
22.5%
0 147
20.7%
9 99
13.9%
2 91
12.8%
8 48
 
6.8%
5 41
 
5.8%
6 40
 
5.6%
7 34
 
4.8%
4 27
 
3.8%
3 24
 
3.4%
Other Punctuation
ValueCountFrequency (%)
! 5
41.7%
· 4
33.3%
2
 
16.7%
1
 
8.3%
Final Punctuation
ValueCountFrequency (%)
10
90.9%
1
 
9.1%
Initial Punctuation
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
13722
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Open Punctuation
ValueCountFrequency (%)
1
100.0%
Close Punctuation
ValueCountFrequency (%)
1
100.0%
Lowercase Letter
ValueCountFrequency (%)
ß 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 86582
85.3%
Common 14464
 
14.3%
Cyrillic 305
 
0.3%
Hangul 72
 
0.1%
Han 70
 
0.1%
Greek 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3
 
4.2%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
1
 
1.4%
1
 
1.4%
Other values (53) 53
73.6%
Han
ValueCountFrequency (%)
2
 
2.9%
2
 
2.9%
2
 
2.9%
2
 
2.9%
2
 
2.9%
2
 
2.9%
2
 
2.9%
1
 
1.4%
1
 
1.4%
1
 
1.4%
Other values (53) 53
75.7%
Latin
ValueCountFrequency (%)
E 9225
10.7%
N 8579
9.9%
A 8004
 
9.2%
O 7788
 
9.0%
I 7473
 
8.6%
T 6512
 
7.5%
R 5624
 
6.5%
S 5454
 
6.3%
C 3479
 
4.0%
L 3107
 
3.6%
Other values (18) 21337
24.6%
Cyrillic
ValueCountFrequency (%)
А 48
15.7%
П 46
15.1%
Е 39
12.8%
О 38
12.5%
Г 32
10.5%
К 25
8.2%
И 9
 
3.0%
Ь 8
 
2.6%
С 7
 
2.3%
У 7
 
2.3%
Other values (15) 46
15.1%
Common
ValueCountFrequency (%)
13722
94.9%
1 160
 
1.1%
0 147
 
1.0%
9 99
 
0.7%
2 91
 
0.6%
8 48
 
0.3%
5 41
 
0.3%
6 40
 
0.3%
7 34
 
0.2%
4 27
 
0.2%
Other values (13) 55
 
0.4%
Greek
ValueCountFrequency (%)
Γ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 101017
99.5%
Cyrillic 305
 
0.3%
CJK 67
 
0.1%
Hangul 57
 
0.1%
Punctuation 15
 
< 0.1%
Compat Jamo 15
 
< 0.1%
None 13
 
< 0.1%
CJK Compat Ideographs 3
 
< 0.1%
Letterlike Symbols 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13722
13.6%
E 9225
 
9.1%
N 8579
 
8.5%
A 8004
 
7.9%
O 7788
 
7.7%
I 7473
 
7.4%
T 6512
 
6.4%
R 5624
 
5.6%
S 5454
 
5.4%
C 3479
 
3.4%
Other values (28) 25157
24.9%
Cyrillic
ValueCountFrequency (%)
А 48
15.7%
П 46
15.1%
Е 39
12.8%
О 38
12.5%
Г 32
10.5%
К 25
8.2%
И 9
 
3.0%
Ь 8
 
2.6%
С 7
 
2.3%
У 7
 
2.3%
Other values (15) 46
15.1%
Punctuation
ValueCountFrequency (%)
10
66.7%
2
 
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
None
ValueCountFrequency (%)
· 4
30.8%
2
15.4%
2
15.4%
1
 
7.7%
1
 
7.7%
ß 1
 
7.7%
1
 
7.7%
Γ 1
 
7.7%
Hangul
ValueCountFrequency (%)
3
 
5.3%
2
 
3.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
1
 
1.8%
1
 
1.8%
Other values (38) 38
66.7%
CJK
ValueCountFrequency (%)
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
1
 
1.5%
1
 
1.5%
1
 
1.5%
Other values (50) 50
74.6%
Letterlike Symbols
ValueCountFrequency (%)
2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Other values (5) 5
33.3%
CJK Compat Ideographs
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

SORT_TITLE_KOR
Text

MISSING 

Distinct2063
Distinct (%)98.0%
Missing166
Missing (%)7.3%
Memory size17.9 KiB
2023-12-13T00:43:29.400580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length57
Mean length26.273504
Min length3

Characters and Unicode

Total characters55332
Distinct characters1364
Distinct categories6 ?
Distinct scripts5 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2020 ?
Unique (%)95.9%

Sample

1st row힌디어로 발음하는 인도의 한영 대사전
2nd row힌디어로 발음하는 인도의 한영 대사전 ㄱ기역
3rd row힌디어로 발음하는 인도의 한영 대사전 ㄴ니은
4th row힌디어로 발음하는 인도의 한영 대사전 ㄷ디귿
5th row힌디어로 발음하는 인도의 한영 대사전 ㄹ ㅁ리을 미음
ValueCountFrequency (%)
대한 297
 
2.2%
한국 223
 
1.6%
한국어 217
 
1.6%
연구 170
 
1.3%
중심으로 156
 
1.2%
한국의 89
 
0.7%
대하여 72
 
0.5%
중국 65
 
0.5%
63
 
0.5%
분석 51
 
0.4%
Other values (6974) 12120
89.6%
2023-12-13T00:43:30.086418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11682
 
21.1%
2468
 
4.5%
1780
 
3.2%
1378
 
2.5%
925
 
1.7%
901
 
1.6%
844
 
1.5%
780
 
1.4%
666
 
1.2%
537
 
1.0%
Other values (1354) 33371
60.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 41813
75.6%
Space Separator 11682
 
21.1%
Decimal Number 880
 
1.6%
Uppercase Letter 850
 
1.5%
Other Punctuation 105
 
0.2%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2468
 
5.9%
1780
 
4.3%
1378
 
3.3%
925
 
2.2%
901
 
2.2%
844
 
2.0%
780
 
1.9%
666
 
1.6%
537
 
1.3%
526
 
1.3%
Other values (1297) 31008
74.2%
Uppercase Letter
ValueCountFrequency (%)
A 87
 
10.2%
I 76
 
8.9%
E 65
 
7.6%
O 64
 
7.5%
N 63
 
7.4%
T 56
 
6.6%
L 41
 
4.8%
S 37
 
4.4%
C 36
 
4.2%
R 36
 
4.2%
Other values (31) 289
34.0%
Decimal Number
ValueCountFrequency (%)
1 195
22.2%
0 183
20.8%
9 121
13.8%
2 118
13.4%
8 63
 
7.2%
5 48
 
5.5%
3 43
 
4.9%
6 41
 
4.7%
7 38
 
4.3%
4 30
 
3.4%
Other Punctuation
ValueCountFrequency (%)
· 97
92.4%
! 4
 
3.8%
2
 
1.9%
2
 
1.9%
Space Separator
ValueCountFrequency (%)
11682
100.0%
Dash Punctuation
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 40027
72.3%
Common 12669
 
22.9%
Han 1786
 
3.2%
Latin 813
 
1.5%
Cyrillic 37
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2468
 
6.2%
1780
 
4.4%
1378
 
3.4%
925
 
2.3%
901
 
2.3%
844
 
2.1%
780
 
1.9%
666
 
1.7%
537
 
1.3%
526
 
1.3%
Other values (702) 29222
73.0%
Han
ValueCountFrequency (%)
50
 
2.8%
37
 
2.1%
35
 
2.0%
33
 
1.8%
29
 
1.6%
29
 
1.6%
26
 
1.5%
25
 
1.4%
25
 
1.4%
20
 
1.1%
Other values (585) 1477
82.7%
Latin
ValueCountFrequency (%)
A 87
 
10.7%
I 76
 
9.3%
E 65
 
8.0%
O 64
 
7.9%
N 63
 
7.7%
T 56
 
6.9%
L 41
 
5.0%
S 37
 
4.6%
C 36
 
4.4%
R 36
 
4.4%
Other values (15) 252
31.0%
Common
ValueCountFrequency (%)
11682
92.2%
1 195
 
1.5%
0 183
 
1.4%
9 121
 
1.0%
2 118
 
0.9%
· 97
 
0.8%
8 63
 
0.5%
5 48
 
0.4%
3 43
 
0.3%
6 41
 
0.3%
Other values (6) 78
 
0.6%
Cyrillic
ValueCountFrequency (%)
Е 5
13.5%
И 4
10.8%
А 4
10.8%
О 3
8.1%
Н 3
8.1%
С 3
8.1%
Т 2
 
5.4%
Р 2
 
5.4%
Л 2
 
5.4%
В 2
 
5.4%
Other values (6) 7
18.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 40005
72.3%
ASCII 13379
 
24.2%
CJK 1768
 
3.2%
None 99
 
0.2%
Cyrillic 37
 
0.1%
Compat Jamo 22
 
< 0.1%
CJK Compat Ideographs 18
 
< 0.1%
Punctuation 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11682
87.3%
1 195
 
1.5%
0 183
 
1.4%
9 121
 
0.9%
2 118
 
0.9%
A 87
 
0.7%
I 76
 
0.6%
E 65
 
0.5%
O 64
 
0.5%
N 63
 
0.5%
Other values (27) 725
 
5.4%
Hangul
ValueCountFrequency (%)
2468
 
6.2%
1780
 
4.4%
1378
 
3.4%
925
 
2.3%
901
 
2.3%
844
 
2.1%
780
 
1.9%
666
 
1.7%
537
 
1.3%
526
 
1.3%
Other values (687) 29200
73.0%
None
ValueCountFrequency (%)
· 97
98.0%
2
 
2.0%
CJK
ValueCountFrequency (%)
50
 
2.8%
37
 
2.1%
35
 
2.0%
33
 
1.9%
29
 
1.6%
29
 
1.6%
26
 
1.5%
25
 
1.4%
25
 
1.4%
20
 
1.1%
Other values (570) 1459
82.5%
Cyrillic
ValueCountFrequency (%)
Е 5
13.5%
И 4
10.8%
А 4
10.8%
О 3
8.1%
Н 3
8.1%
С 3
8.1%
Т 2
 
5.4%
Р 2
 
5.4%
Л 2
 
5.4%
В 2
 
5.4%
Other values (6) 7
18.9%
CJK Compat Ideographs
ValueCountFrequency (%)
2
 
11.1%
2
 
11.1%
2
 
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (5) 5
27.8%
Punctuation
ValueCountFrequency (%)
2
50.0%
2
50.0%
Compat Jamo
ValueCountFrequency (%)
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (5) 5
22.7%
Distinct2242
Distinct (%)98.8%
Missing2
Missing (%)0.1%
Memory size17.9 KiB
2023-12-13T00:43:30.521990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length203
Median length130
Mean length48.989868
Min length3

Characters and Unicode

Total characters111207
Distinct characters1546
Distinct categories13 ?
Distinct scripts8 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2214 ?
Unique (%)97.5%

Sample

1st row힌디어로 발음하는 인도의 한영 대사전
2nd row힌디어로 발음하는 인도의 한영 대사전 ㄱ기역
3rd row힌디어로 발음하는 인도의 한영 대사전 ㄴ니은
4th row힌디어로 발음하는 인도의 한영 대사전 ㄷ디귿
5th row힌디어로 발음하는 인도의 한영 대사전 ㄹ ㅁ리을 미음
ValueCountFrequency (%)
of 652
 
3.7%
in 594
 
3.4%
the 561
 
3.2%
and 549
 
3.1%
korean 455
 
2.6%
korea 184
 
1.0%
on 140
 
0.8%
중심으로 122
 
0.7%
a 121
 
0.7%
в 112
 
0.6%
Other values (7339) 14178
80.2%
2023-12-13T00:43:31.353735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15690
 
14.1%
E 6839
 
6.1%
N 6354
 
5.7%
A 5850
 
5.3%
O 5564
 
5.0%
I 5485
 
4.9%
T 4796
 
4.3%
R 4222
 
3.8%
S 4056
 
3.6%
C 2546
 
2.3%
Other values (1536) 49805
44.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 75743
68.1%
Other Letter 18684
 
16.8%
Space Separator 15690
 
14.1%
Decimal Number 972
 
0.9%
Other Punctuation 90
 
0.1%
Final Punctuation 12
 
< 0.1%
Initial Punctuation 7
 
< 0.1%
Dash Punctuation 3
 
< 0.1%
Math Symbol 2
 
< 0.1%
Modifier Symbol 1
 
< 0.1%
Other values (3) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
821
 
4.4%
509
 
2.7%
426
 
2.3%
312
 
1.7%
307
 
1.6%
297
 
1.6%
288
 
1.5%
288
 
1.5%
261
 
1.4%
241
 
1.3%
Other values (1445) 14934
79.9%
Uppercase Letter
ValueCountFrequency (%)
E 6839
 
9.0%
N 6354
 
8.4%
A 5850
 
7.7%
O 5564
 
7.3%
I 5485
 
7.2%
T 4796
 
6.3%
R 4222
 
5.6%
S 4056
 
5.4%
C 2546
 
3.4%
L 2326
 
3.1%
Other values (52) 27705
36.6%
Decimal Number
ValueCountFrequency (%)
1 224
23.0%
0 194
20.0%
9 140
14.4%
2 116
11.9%
8 68
 
7.0%
5 54
 
5.6%
6 51
 
5.2%
7 45
 
4.6%
3 44
 
4.5%
4 36
 
3.7%
Other Punctuation
ValueCountFrequency (%)
· 54
60.0%
22
24.4%
5
 
5.6%
! 5
 
5.6%
2
 
2.2%
2
 
2.2%
Final Punctuation
ValueCountFrequency (%)
9
75.0%
3
 
25.0%
Initial Punctuation
ValueCountFrequency (%)
4
57.1%
3
42.9%
Dash Punctuation
ValueCountFrequency (%)
2
66.7%
1
33.3%
Math Symbol
ValueCountFrequency (%)
÷ 1
50.0%
× 1
50.0%
Space Separator
ValueCountFrequency (%)
15690
100.0%
Modifier Symbol
ValueCountFrequency (%)
¨ 1
100.0%
Open Punctuation
ValueCountFrequency (%)
1
100.0%
Close Punctuation
ValueCountFrequency (%)
1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 63612
57.2%
Common 16780
 
15.1%
Hangul 13754
 
12.4%
Cyrillic 12130
 
10.9%
Han 4557
 
4.1%
Hiragana 293
 
0.3%
Katakana 80
 
0.1%
Greek 1
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
195
 
4.3%
148
 
3.2%
147
 
3.2%
116
 
2.5%
88
 
1.9%
68
 
1.5%
56
 
1.2%
55
 
1.2%
48
 
1.1%
43
 
0.9%
Other values (846) 3593
78.8%
Hangul
ValueCountFrequency (%)
821
 
6.0%
509
 
3.7%
426
 
3.1%
312
 
2.3%
307
 
2.2%
297
 
2.2%
288
 
2.1%
288
 
2.1%
261
 
1.9%
241
 
1.8%
Other values (528) 10004
72.7%
Hiragana
ValueCountFrequency (%)
82
28.0%
42
14.3%
33
11.3%
14
 
4.8%
14
 
4.8%
13
 
4.4%
11
 
3.8%
11
 
3.8%
10
 
3.4%
8
 
2.7%
Other values (24) 55
18.8%
Cyrillic
ValueCountFrequency (%)
О 1388
 
11.4%
И 1098
 
9.1%
Е 1074
 
8.9%
А 832
 
6.9%
Н 828
 
6.8%
Р 776
 
6.4%
С 775
 
6.4%
К 726
 
6.0%
Т 602
 
5.0%
В 504
 
4.2%
Other values (23) 3527
29.1%
Common
ValueCountFrequency (%)
15690
93.5%
1 224
 
1.3%
0 194
 
1.2%
9 140
 
0.8%
2 116
 
0.7%
8 68
 
0.4%
· 54
 
0.3%
5 54
 
0.3%
6 51
 
0.3%
7 45
 
0.3%
Other values (19) 144
 
0.9%
Latin
ValueCountFrequency (%)
E 6839
10.8%
N 6354
10.0%
A 5850
 
9.2%
O 5564
 
8.7%
I 5485
 
8.6%
T 4796
 
7.5%
R 4222
 
6.6%
S 4056
 
6.4%
C 2546
 
4.0%
L 2326
 
3.7%
Other values (18) 15574
24.5%
Katakana
ValueCountFrequency (%)
22
27.5%
14
17.5%
6
 
7.5%
4
 
5.0%
3
 
3.8%
3
 
3.8%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
Other values (17) 20
25.0%
Greek
ValueCountFrequency (%)
Γ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 80272
72.2%
Hangul 13737
 
12.4%
Cyrillic 12130
 
10.9%
CJK 4531
 
4.1%
Hiragana 293
 
0.3%
None 97
 
0.1%
Katakana 80
 
0.1%
CJK Compat Ideographs 26
 
< 0.1%
Punctuation 23
 
< 0.1%
Compat Jamo 17
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
15690
19.5%
E 6839
 
8.5%
N 6354
 
7.9%
A 5850
 
7.3%
O 5564
 
6.9%
I 5485
 
6.8%
T 4796
 
6.0%
R 4222
 
5.3%
S 4056
 
5.1%
C 2546
 
3.2%
Other values (28) 18870
23.5%
Cyrillic
ValueCountFrequency (%)
О 1388
 
11.4%
И 1098
 
9.1%
Е 1074
 
8.9%
А 832
 
6.9%
Н 828
 
6.8%
Р 776
 
6.4%
С 775
 
6.4%
К 726
 
6.0%
Т 602
 
5.0%
В 504
 
4.2%
Other values (23) 3527
29.1%
Hangul
ValueCountFrequency (%)
821
 
6.0%
509
 
3.7%
426
 
3.1%
312
 
2.3%
307
 
2.2%
297
 
2.2%
288
 
2.1%
288
 
2.1%
261
 
1.9%
241
 
1.8%
Other values (512) 9987
72.7%
CJK
ValueCountFrequency (%)
195
 
4.3%
148
 
3.3%
147
 
3.2%
116
 
2.6%
88
 
1.9%
68
 
1.5%
56
 
1.2%
55
 
1.2%
48
 
1.1%
43
 
0.9%
Other values (829) 3567
78.7%
Hiragana
ValueCountFrequency (%)
82
28.0%
42
14.3%
33
11.3%
14
 
4.8%
14
 
4.8%
13
 
4.4%
11
 
3.8%
11
 
3.8%
10
 
3.4%
8
 
2.7%
Other values (24) 55
18.8%
None
ValueCountFrequency (%)
· 54
55.7%
22
22.7%
Ð 6
 
6.2%
5
 
5.2%
2
 
2.1%
¨ 1
 
1.0%
÷ 1
 
1.0%
Æ 1
 
1.0%
× 1
 
1.0%
1
 
1.0%
Other values (3) 3
 
3.1%
Katakana
ValueCountFrequency (%)
22
27.5%
14
17.5%
6
 
7.5%
4
 
5.0%
3
 
3.8%
3
 
3.8%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
Other values (17) 20
25.0%
Punctuation
ValueCountFrequency (%)
9
39.1%
4
17.4%
3
 
13.0%
3
 
13.0%
2
 
8.7%
2
 
8.7%
CJK Compat Ideographs
ValueCountFrequency (%)
4
15.4%
4
15.4%
2
 
7.7%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (7) 7
26.9%
Compat Jamo
ValueCountFrequency (%)
2
 
11.8%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (6) 6
35.3%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%

GANADA_TITLE_ENG
Categorical

Distinct32
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
<NA>
889 
C
162 
S
152 
P
129 
K
103 
Other values (27)
837 

Length

Max length4
Median length1
Mean length2.190581
Min length1

Unique

Unique4 ?
Unique (%)0.2%

Sample

1st rowK
2nd rowK
3rd rowK
4th rowK
5th rowK

Common Values

ValueCountFrequency (%)
<NA> 889
39.1%
C 162
 
7.1%
S 152
 
6.7%
P 129
 
5.7%
K 103
 
4.5%
R 79
 
3.5%
I 65
 
2.9%
T 65
 
2.9%
E 65
 
2.9%
A 58
 
2.6%
Other values (22) 505
22.2%

Length

2023-12-13T00:43:31.547952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 889
39.1%
c 162
 
7.1%
s 152
 
6.7%
p 129
 
5.7%
k 103
 
4.5%
r 79
 
3.5%
i 65
 
2.9%
t 65
 
2.9%
e 65
 
2.9%
a 58
 
2.6%
Other values (22) 505
22.2%

GANADA_TITLE_KOR
Categorical

Distinct34
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
538 
300 
275 
199 
<NA>
166 
Other values (29)
794 

Length

Max length4
Median length1
Mean length1.318662
Min length1

Unique

Unique10 ?
Unique (%)0.4%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
538
23.7%
300
13.2%
275
12.1%
199
 
8.8%
<NA> 166
 
7.3%
133
 
5.9%
114
 
5.0%
ETC 113
 
5.0%
97
 
4.3%
67
 
2.9%
Other values (24) 270
11.9%

Length

2023-12-13T00:43:31.737180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
538
23.7%
300
13.2%
275
12.1%
199
 
8.8%
na 166
 
7.3%
133
 
5.9%
114
 
5.0%
etc 113
 
5.0%
97
 
4.3%
67
 
2.9%
Other values (24) 270
11.9%

GANADA_TITLE_ORI
Categorical

Distinct44
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
ETC
589 
164 
138 
S
 
103
C
 
95
Other values (39)
1183 

Length

Max length3
Median length1
Mean length1.5184859
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
ETC 589
25.9%
164
 
7.2%
138
 
6.1%
S 103
 
4.5%
C 95
 
4.2%
P 88
 
3.9%
81
 
3.6%
K 77
 
3.4%
75
 
3.3%
R 61
 
2.7%
Other values (34) 801
35.3%

Length

2023-12-13T00:43:31.890386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
etc 589
25.9%
164
 
7.2%
138
 
6.1%
s 103
 
4.5%
c 95
 
4.2%
p 88
 
3.9%
81
 
3.6%
k 77
 
3.4%
75
 
3.3%
r 61
 
2.7%
Other values (34) 801
35.3%

SEARCH_YN
Boolean

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
True
2232 
False
 
40
ValueCountFrequency (%)
True 2232
98.2%
False 40
 
1.8%
2023-12-13T00:43:32.018439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

IS_OPEN
Boolean

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
True
2270 
False
 
2
ValueCountFrequency (%)
True 2270
99.9%
False 2
 
0.1%
2023-12-13T00:43:32.125998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

ERASE_YN
Boolean

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
False
2252 
True
 
20
ValueCountFrequency (%)
False 2252
99.1%
True 20
 
0.9%
2023-12-13T00:43:32.230653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

FILE_NAME
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2272
Missing (%)100.0%
Memory size20.1 KiB

ROOT_DIR
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2272
Missing (%)100.0%
Memory size20.1 KiB

SUB_DIR
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2272
Missing (%)100.0%
Memory size20.1 KiB
Distinct2263
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
2023-12-13T00:43:32.638365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters15904
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2254 ?
Unique (%)99.2%

Sample

1st row49:10.4
2nd row49:10.6
3rd row49:10.8
4th row49:10.9
5th row49:11.1
ValueCountFrequency (%)
55:43.2 2
 
0.1%
57:02.1 2
 
0.1%
52:54.9 2
 
0.1%
55:51.5 2
 
0.1%
56:54.5 2
 
0.1%
50:58.7 2
 
0.1%
55:19.9 2
 
0.1%
52:56.7 2
 
0.1%
54:27.0 2
 
0.1%
50:45.7 1
 
< 0.1%
Other values (2253) 2253
99.2%
2023-12-13T00:43:33.270127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 2760
17.4%
: 2272
14.3%
. 2272
14.3%
4 1552
9.8%
0 1151
7.2%
3 1084
 
6.8%
1 1083
 
6.8%
2 1068
 
6.7%
6 714
 
4.5%
9 711
 
4.5%
Other values (2) 1237
7.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 11360
71.4%
Other Punctuation 4544
 
28.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 2760
24.3%
4 1552
13.7%
0 1151
10.1%
3 1084
 
9.5%
1 1083
 
9.5%
2 1068
 
9.4%
6 714
 
6.3%
9 711
 
6.3%
8 671
 
5.9%
7 566
 
5.0%
Other Punctuation
ValueCountFrequency (%)
: 2272
50.0%
. 2272
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 15904
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 2760
17.4%
: 2272
14.3%
. 2272
14.3%
4 1552
9.8%
0 1151
7.2%
3 1084
 
6.8%
1 1083
 
6.8%
2 1068
 
6.7%
6 714
 
4.5%
9 711
 
4.5%
Other values (2) 1237
7.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 15904
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 2760
17.4%
: 2272
14.3%
. 2272
14.3%
4 1552
9.8%
0 1151
7.2%
3 1084
 
6.8%
1 1083
 
6.8%
2 1068
 
6.7%
6 714
 
4.5%
9 711
 
4.5%
Other values (2) 1237
7.8%

REGISTER
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
<NA>
2150 
super
 
122

Length

Max length5
Median length4
Mean length4.0536972
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 2150
94.6%
super 122
 
5.4%

Length

2023-12-13T00:43:33.494568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:43:33.633301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 2150
94.6%
super 122
 
5.4%

MODIFIED_DATE
Text

MISSING 

Distinct79
Distinct (%)100.0%
Missing2193
Missing (%)96.5%
Memory size17.9 KiB
2023-12-13T00:43:33.873010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters553
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique79 ?
Unique (%)100.0%

Sample

1st row15:43.3
2nd row24:25.9
3rd row14:25.3
4th row21:15.5
5th row13:39.7
ValueCountFrequency (%)
15:43.3 1
 
1.3%
16:38.3 1
 
1.3%
47:25.0 1
 
1.3%
33:21.4 1
 
1.3%
22:41.4 1
 
1.3%
25:25.4 1
 
1.3%
05:31.2 1
 
1.3%
07:26.0 1
 
1.3%
42:04.1 1
 
1.3%
14:43.4 1
 
1.3%
Other values (69) 69
87.3%
2023-12-13T00:43:34.231596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
: 79
14.3%
. 79
14.3%
2 61
11.0%
1 50
9.0%
3 50
9.0%
4 49
8.9%
5 46
8.3%
0 45
8.1%
6 30
 
5.4%
8 24
 
4.3%
Other values (2) 40
7.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 395
71.4%
Other Punctuation 158
 
28.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 61
15.4%
1 50
12.7%
3 50
12.7%
4 49
12.4%
5 46
11.6%
0 45
11.4%
6 30
7.6%
8 24
 
6.1%
7 23
 
5.8%
9 17
 
4.3%
Other Punctuation
ValueCountFrequency (%)
: 79
50.0%
. 79
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 553
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
: 79
14.3%
. 79
14.3%
2 61
11.0%
1 50
9.0%
3 50
9.0%
4 49
8.9%
5 46
8.3%
0 45
8.1%
6 30
 
5.4%
8 24
 
4.3%
Other values (2) 40
7.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 553
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
: 79
14.3%
. 79
14.3%
2 61
11.0%
1 50
9.0%
3 50
9.0%
4 49
8.9%
5 46
8.3%
0 45
8.1%
6 30
 
5.4%
8 24
 
4.3%
Other values (2) 40
7.2%

MODIFIER
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
<NA>
2193 
super
 
79

Length

Max length5
Median length4
Mean length4.0347711
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowsuper
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 2193
96.5%
super 79
 
3.5%

Length

2023-12-13T00:43:34.377167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:43:34.486706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 2193
96.5%
super 79
 
3.5%

ERASE_DATE
Text

MISSING 

Distinct20
Distinct (%)100.0%
Missing2252
Missing (%)99.1%
Memory size17.9 KiB
2023-12-13T00:43:34.651042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters140
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)100.0%

Sample

1st row14:14.5
2nd row45:32.8
3rd row47:56.5
4th row55:26.8
5th row12:57.6
ValueCountFrequency (%)
14:14.5 1
 
5.0%
45:32.8 1
 
5.0%
00:22.6 1
 
5.0%
22:49.2 1
 
5.0%
16:58.4 1
 
5.0%
49:29.5 1
 
5.0%
02:08.0 1
 
5.0%
01:43.1 1
 
5.0%
42:53.4 1
 
5.0%
50:37.7 1
 
5.0%
Other values (10) 10
50.0%
2023-12-13T00:43:34.943671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
: 20
14.3%
. 20
14.3%
4 16
11.4%
2 16
11.4%
5 15
10.7%
0 10
7.1%
1 9
6.4%
3 9
6.4%
8 7
 
5.0%
7 7
 
5.0%
Other values (2) 11
7.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 100
71.4%
Other Punctuation 40
 
28.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 16
16.0%
2 16
16.0%
5 15
15.0%
0 10
10.0%
1 9
9.0%
3 9
9.0%
8 7
7.0%
7 7
7.0%
9 6
 
6.0%
6 5
 
5.0%
Other Punctuation
ValueCountFrequency (%)
: 20
50.0%
. 20
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 140
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
: 20
14.3%
. 20
14.3%
4 16
11.4%
2 16
11.4%
5 15
10.7%
0 10
7.1%
1 9
6.4%
3 9
6.4%
8 7
 
5.0%
7 7
 
5.0%
Other values (2) 11
7.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 140
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
: 20
14.3%
. 20
14.3%
4 16
11.4%
2 16
11.4%
5 15
10.7%
0 10
7.1%
1 9
6.4%
3 9
6.4%
8 7
 
5.0%
7 7
 
5.0%
Other values (2) 11
7.9%

ERASER
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size17.9 KiB
<NA>
2252 
super
 
19
test
 
1

Length

Max length5
Median length4
Mean length4.0083627
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 2252
99.1%
super 19
 
0.8%
test 1
 
< 0.1%

Length

2023-12-13T00:43:35.460415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:43:35.563211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 2252
99.1%
super 19
 
0.8%
test 1
 
< 0.1%

Sample

CATALOG_IDPARENT_CATALOG_IDTITLE_ENGTITLE_KORTITLE_ORISUBTITLE_ENGSUBTITLE_KORSUBTITLE_ORIAUTHOR_ORIAUTHOR_KORAUTHOR_ENGAUTHOR_ETCORGANIZATION_ORIORGANIZATION_KORORGANIZATION_ENGORGANIZATION_ETCNATIONSUPPORTCATEGORYLANGUAGEPAPER_PAGE_STARTPAPER_PAGE_ENDPDF_PAGE_STARTPDF_PAGE_ENDPAGESSUB_INDEXPROJECTYEAR_BEGINPROJECTYEAR_ENDPUBLISH_DATEPUBLISHERSERIAL_NUMBERVOLUMENUMBERISBNISSNSORT_TITLE_ENGSORT_TITLE_KORSORT_TITLE_ORIGANADA_TITLE_ENGGANADA_TITLE_KORGANADA_TITLE_ORISEARCH_YNIS_OPENERASE_YNFILE_NAMEROOT_DIRSUB_DIRREGISTED_DATEREGISTERMODIFIED_DATEMODIFIERERASE_DATEERASER
005R420Korean-English Dictionary in Hindi Pronunciation힌디어로 발음하는 인도의 한영 대사전힌디어로 발음하는 인도의 한영 대사전<NA><NA><NA>김도영김도영<NA><NA>University of Delhi델리대학교University of Delhi<NA><NA>SUPPORT_RCATEGORY_HLANGUAGE_HI0000343302005<NA><NA><NA><NA><NA><NA><NA><NA>KOREANENGLISH DICTIONARY IN HINDI PRONUNCIATION힌디어로 발음하는 인도의 한영 대사전힌디어로 발음하는 인도의 한영 대사전KYYN<NA><NA><NA>49:10.4<NA>15:43.3super<NA><NA>
105R42_000105R42Korean-English Dictionary in Hindi Pronunciation (ㄱ)힌디어로 발음하는 인도의 한영 대사전 - ㄱ(기역)힌디어로 발음하는 인도의 한영 대사전 - ㄱ(기역)<NA><NA><NA>김도영김도영<NA><NA>University of Delhi델리대학교University of Delhi<NA>NATION_INSUPPORT_RCATEGORY_HLANGUAGE_HI000044712005<NA><NA><NA><NA><NA><NA><NA><NA>KOREANENGLISH DICTIONARY IN HINDI PRONUNCIATION ㄱ힌디어로 발음하는 인도의 한영 대사전 ㄱ기역힌디어로 발음하는 인도의 한영 대사전 ㄱ기역KYYN<NA><NA><NA>49:10.6<NA><NA><NA><NA><NA>
205R42_000205R42Korean-English Dictionary in Hindi Pronunciation (ㄴ)힌디어로 발음하는 인도의 한영 대사전 - ㄴ(니은)힌디어로 발음하는 인도의 한영 대사전 - ㄴ(니은)<NA><NA><NA>김도영김도영<NA><NA>University of Delhi델리대학교University of Delhi<NA>NATION_INSUPPORT_RCATEGORY_HLANGUAGE_HI000019722005<NA><NA><NA><NA><NA><NA><NA><NA>KOREANENGLISH DICTIONARY IN HINDI PRONUNCIATION ㄴ힌디어로 발음하는 인도의 한영 대사전 ㄴ니은힌디어로 발음하는 인도의 한영 대사전 ㄴ니은KYYN<NA><NA><NA>49:10.8<NA><NA><NA><NA><NA>
305R42_000305R42Korean-English Dictionary in Hindi Pronunciation (ㄷ)힌디어로 발음하는 인도의 한영 대사전 - ㄷ(디귿)힌디어로 발음하는 인도의 한영 대사전 - ㄷ(디귿)<NA><NA><NA>김도영김도영<NA><NA>University of Delhi델리대학교University of Delhi<NA>NATION_INSUPPORT_RCATEGORY_HLANGUAGE_HI000032532005<NA><NA><NA><NA><NA><NA><NA><NA>KOREANENGLISH DICTIONARY IN HINDI PRONUNCIATION ㄷ힌디어로 발음하는 인도의 한영 대사전 ㄷ디귿힌디어로 발음하는 인도의 한영 대사전 ㄷ디귿KYYN<NA><NA><NA>49:10.9<NA><NA><NA><NA><NA>
405R42_000405R42Korean-English Dictionary in Hindi Pronunciation (ㄹ,ㅁ)힌디어로 발음하는 인도의 한영 대사전 - ㄹ, ㅁ(리을, 미음)힌디어로 발음하는 인도의 한영 대사전 - ㄹ, ㅁ(리을, 미음)<NA><NA><NA>김도영김도영<NA><NA>University of Delhi델리대학교University of Delhi<NA>NATION_INSUPPORT_RCATEGORY_HLANGUAGE_HI000027942005<NA><NA><NA><NA><NA><NA><NA><NA>KOREANENGLISH DICTIONARY IN HINDI PRONUNCIATION ㄹㅁ힌디어로 발음하는 인도의 한영 대사전 ㄹ ㅁ리을 미음힌디어로 발음하는 인도의 한영 대사전 ㄹ ㅁ리을 미음KYYN<NA><NA><NA>49:11.1<NA><NA><NA><NA><NA>
505R42_000505R42Korean-English Dictionary in Hindi Pronunciation (ㅂ)힌디어로 발음하는 인도의 한영 대사전 - ㅂ(비읍)힌디어로 발음하는 인도의 한영 대사전 - ㅂ(비읍)<NA><NA><NA>김도영김도영<NA><NA>University of Delhi델리대학교University of Delhi<NA>NATION_INSUPPORT_RCATEGORY_HLANGUAGE_HI000024352005<NA><NA><NA><NA><NA><NA><NA><NA>KOREANENGLISH DICTIONARY IN HINDI PRONUNCIATION ㅂ힌디어로 발음하는 인도의 한영 대사전 ㅂ비읍힌디어로 발음하는 인도의 한영 대사전 ㅂ비읍KYYN<NA><NA><NA>49:11.2<NA><NA><NA><NA><NA>
605R42_000605R42Korean-English Dictionary in Hindi Pronunciation (ㅅ)힌디어로 발음하는 인도의 한영 대사전 - ㅅ(시옷)힌디어로 발음하는 인도의 한영 대사전 - ㅅ(시옷)<NA><NA><NA>김도영김도영<NA><NA>University of Delhi델리대학교University of Delhi<NA>NATION_INSUPPORT_RCATEGORY_HLANGUAGE_HI000051362005<NA><NA><NA><NA><NA><NA><NA><NA>KOREANENGLISH DICTIONARY IN HINDI PRONUNCIATION ㅅ힌디어로 발음하는 인도의 한영 대사전 ㅅ시옷힌디어로 발음하는 인도의 한영 대사전 ㅅ시옷KYYN<NA><NA><NA>49:11.4<NA><NA><NA><NA><NA>
705R42_000705R42Korean-English Dictionary in Hindi Pronunciation (ㅇ)힌디어로 발음하는 인도의 한영 대사전 - ㅇ(이응)힌디어로 발음하는 인도의 한영 대사전 - ㅇ(이응)<NA><NA><NA>김도영김도영<NA><NA>University of Delhi델리대학교University of Delhi<NA>NATION_INSUPPORT_RCATEGORY_HLANGUAGE_HI000047772005<NA><NA><NA><NA><NA><NA><NA><NA>KOREANENGLISH DICTIONARY IN HINDI PRONUNCIATION ㅇ힌디어로 발음하는 인도의 한영 대사전 ㅇ이응힌디어로 발음하는 인도의 한영 대사전 ㅇ이응KYYN<NA><NA><NA>49:11.5<NA><NA><NA><NA><NA>
805R42_000805R42Korean-English Dictionary in Hindi Pronunciation (ㅈ)힌디어로 발음하는 인도의 한영 대사전 - ㅈ(지읏)힌디어로 발음하는 인도의 한영 대사전 - ㅈ(지읏)<NA><NA><NA>김도영김도영<NA><NA>University of Delhi델리대학교University of Delhi<NA>NATION_INSUPPORT_RCATEGORY_HLANGUAGE_HI000043182005<NA><NA><NA><NA><NA><NA><NA><NA>KOREANENGLISH DICTIONARY IN HINDI PRONUNCIATION ㅈ힌디어로 발음하는 인도의 한영 대사전 ㅈ지읏힌디어로 발음하는 인도의 한영 대사전 ㅈ지읏KYYN<NA><NA><NA>49:11.7<NA><NA><NA><NA><NA>
905R42_000905R42Korean-English Dictionary in Hindi Pronunciation (ㅊㅋㅌㅍㅎ)힌디어로 발음하는 인도의 한영 대사전 - ㅊㅋㅌㅍㅎ(치읓, 키읔, 티읕, 피읖, 히읗)힌디어로 발음하는 인도의 한영 대사전 - ㅊㅋㅌㅍㅎ(치읓, 키읔, 티읕, 피읖, 히읗)<NA><NA><NA>김도영김도영<NA><NA>University of Delhi델리대학교University of Delhi<NA>NATION_INSUPPORT_RCATEGORY_HLANGUAGE_HI000052192005<NA><NA><NA><NA><NA><NA><NA><NA>KOREANENGLISH DICTIONARY IN HINDI PRONUNCIATION ㅊㅋㅌㅍㅎ힌디어로 발음하는 인도의 한영 대사전 ㅊㅋㅌㅍㅎ치읓 키읔 티읕 피읖 히읗힌디어로 발음하는 인도의 한영 대사전 ㅊㅋㅌㅍㅎ치읓 키읔 티읕 피읖 히읗KYYN<NA><NA><NA>49:11.9<NA><NA><NA><NA><NA>
CATALOG_IDPARENT_CATALOG_IDTITLE_ENGTITLE_KORTITLE_ORISUBTITLE_ENGSUBTITLE_KORSUBTITLE_ORIAUTHOR_ORIAUTHOR_KORAUTHOR_ENGAUTHOR_ETCORGANIZATION_ORIORGANIZATION_KORORGANIZATION_ENGORGANIZATION_ETCNATIONSUPPORTCATEGORYLANGUAGEPAPER_PAGE_STARTPAPER_PAGE_ENDPDF_PAGE_STARTPDF_PAGE_ENDPAGESSUB_INDEXPROJECTYEAR_BEGINPROJECTYEAR_ENDPUBLISH_DATEPUBLISHERSERIAL_NUMBERVOLUMENUMBERISBNISSNSORT_TITLE_ENGSORT_TITLE_KORSORT_TITLE_ORIGANADA_TITLE_ENGGANADA_TITLE_KORGANADA_TITLE_ORISEARCH_YNIS_OPENERASE_YNFILE_NAMEROOT_DIRSUB_DIRREGISTED_DATEREGISTERMODIFIED_DATEMODIFIERERASE_DATEERASER
226212R840<NA><NA>ИЗУЧЕНИЕ КОРЕЙСКОГО ЯЗЫКА В УЗБЕКИСТАНЕ В СОВЕТСКИЙ И ПОСТСОВЕТСКИЙ ПЕРИОДЫ(опыт социологического исследования)<NA><NA><NA>Хан В. С.<NA>Valeriy Khan<NA>Research Center “Sharh va Tavsiya”<NA>Research Center “Sharh va Tavsiya”<NA><NA>SUPPORT_RCATEGORY_LLANGUAGE_RU00001702012<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>ИЗУЧЕНИЕ КОРЕЙСКОГО ЯЗЫКА В УЗБЕКИСТАНЕ В СОВЕТСКИЙ И ПОСТСОВЕТСКИЙ ПЕРИОДЫОПЫТ СОЦИОЛОГИЧЕСКОГО ИССЛЕДОВАНИЯ<NA><NA>ETCYYN<NA><NA><NA>06:01.5super20:12.3super<NA><NA>
226313C040Korean Trade and Investment in the Greater Mekong Sub-Region<NA>Korean Trade and Investment in the Greater Mekong Sub-Region<NA><NA><NA>John Walsh<NA>John Walsh<NA>Shinawatra University<NA>Shinawatra University<NA><NA>SUPPORT_CCATEGORY_DLANGUAGE_EN00001502013<NA><NA><NA><NA><NA><NA><NA><NA>KOREAN TRADE AND INVESTMENT IN THE GREATER MEKONG SUBREGION<NA>KOREAN TRADE AND INVESTMENT IN THE GREATER MEKONG SUBREGIONK<NA>KYYN<NA><NA><NA>41:30.3super42:24.3super<NA><NA>
226413C130Biography in East Asia, 1400~1900<NA>Biography in East Asia, 1400~1900<NA><NA><NA><NA><NA><NA><NA>King's University College at Western University<NA>King's University College at Western University<NA><NA>SUPPORT_CCATEGORY_BLANGUAGE_EN000035702013<NA><NA><NA><NA><NA><NA><NA><NA>BIOGRAPHY IN EAST ASIA 14001900<NA>BIOGRAPHY IN EAST ASIA 14001900B<NA>BYYN<NA><NA><NA>38:23.5super18:55.1super<NA><NA>
226513C230Becoming an Adult in East Asia: Comparative and Interdisciplinary Approaches<NA>Becoming an Adult in East Asia: Comparative and Interdisciplinary Approaches<NA><NA><NA><NA><NA><NA><NA>Department of Sociology, University of Pennsylvania<NA>Department of Sociology, University of Pennsylvania<NA><NA>SUPPORT_CCATEGORY_ELANGUAGE_EN0000702013<NA><NA><NA><NA><NA><NA><NA><NA>BECOMING AN ADULT IN EAST ASIA COMPARATIVE AND INTERDISCIPLINARY APPROACHES<NA>BECOMING AN ADULT IN EAST ASIA COMPARATIVE AND INTERDISCIPLINARY APPROACHESB<NA>BYYY<NA><NA><NA>31:55.6super32:52.5super35:57.9super
226613C250Religious markets in Korea in comparative perspective<NA>Religious markets in Korea in comparative perspective<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>SUPPORT_CCATEGORY_ELANGUAGE_EN00005302013<NA><NA><NA><NA><NA><NA><NA><NA>RELIGIOUS MARKETS IN KOREA IN COMPARATIVE PERSPECTIVE<NA>RELIGIOUS MARKETS IN KOREA IN COMPARATIVE PERSPECTIVER<NA>RYYN<NA><NA><NA>54:27.0super<NA><NA><NA><NA>
226714C010Korean Studies Association of Australasia Postgraduate Students' Workshop<NA>Korean Studies Association of Australasia Postgraduate Students' Workshop<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>SUPPORT_C<NA>LANGUAGE_EN00002902014<NA>2014<NA><NA><NA><NA><NA><NA>KOREAN STUDIES ASSOCIATION OF AUSTRALASIA POSTGRADUATE STUDENTS WORKSHOP<NA>KOREAN STUDIES ASSOCIATION OF AUSTRALASIA POSTGRADUATE STUDENTS WORKSHOPK<NA>KYYN<NA><NA><NA>10:05.6super18:14.8super<NA><NA>
226814C080Korea at the Crossroads: Geopolitics, Economics and Political System<NA>Korea at the Crossroads: Geopolitics, Economics and Political System<NA><NA><NA><NA><NA><NA><NA>Asia Center, Center etudes Asie<NA>Asia Center, Center etudes Asie<NA><NA>SUPPORT_CCATEGORY_CLANGUAGE_EN00005502014<NA><NA><NA><NA><NA><NA><NA><NA>KOREA AT THE CROSSROADS GEOPOLITICS ECONOMICS AND POLITICAL SYSTEM<NA>KOREA AT THE CROSSROADS GEOPOLITICS ECONOMICS AND POLITICAL SYSTEMK<NA>KYYN<NA><NA><NA>42:30.3super20:17.0super<NA><NA>
226914C110The Spirit of Korean Philosophy: 6 Great Debates and their Significance for Asian and Western Philosophies<NA>The Spirit of Korean Philosophy: 6 Great Debates and their Significance for Asian and Western Philosophies<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>SUPPORT_CCATEGORY_ELANGUAGE_EN000020602014<NA><NA><NA><NA><NA><NA><NA><NA>SPIRIT OF KOREAN PHILOSOPHY 6 GREAT DEBATES AND THEIR SIGNIFICANCE FOR ASIAN AND WESTERN PHILOSOPHIES<NA>SPIRIT OF KOREAN PHILOSOPHY 6 GREAT DEBATES AND THEIR SIGNIFICANCE FOR ASIAN AND WESTERN PHILOSOPHIESS<NA>SYYN<NA><NA><NA>23:39.8super<NA><NA><NA><NA>
227014C150<NA><NA>The Great Kanto Earthquake & the Massacre of Koreans: Reconstructing Disaster and Colonial Atrocity in History and Literature 90 Years Later<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>SUPPORT_CCATEGORY_GLANGUAGE_EN00008002014<NA>2014<NA><NA><NA><NA><NA><NA><NA><NA>GREAT KANTO EARTHQUAKE THE MASSACRE OF KOREANS RECONSTRUCTING DISASTER AND COLONIAL ATROCITY IN HISTORY AND LITERATURE 90 YEARS LATER<NA><NA>GYYN<NA><NA><NA>31:50.5super<NA><NA><NA><NA>
227114C180<NA><NA>Winning Central Europe: Spread and Reception of the Korean Wave in the Czech Republic<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>SUPPORT_CCATEGORY_NLANGUAGE_EN00008402014<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>WINNING CENTRAL EUROPE SPREAD AND RECEPTION OF THE KOREAN WAVE IN THE CZECH REPUBLIC<NA><NA>WYYN<NA><NA><NA>43:20.2super<NA><NA><NA><NA>