Overview

Dataset statistics

Number of variables22
Number of observations10000
Missing cells23716
Missing cells (%)10.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 MiB
Average record size in memory189.0 B

Variable types

Text14
Categorical4
Numeric4

Dataset

Description한국연구재단이 보유하고 있는 한국학술지인용색인(KCI)에서의 논문데이터를 나타내고 있습니다. 컬럼명은 논문명, 공동저자, 학술지명, ISSN, 발행년도 등으로 다양한 데이터를 가지고 있습니다
Author한국연구재단
URLhttps://www.data.go.kr/data/15083283/fileData.do

Alerts

발행년 has constant value ""Constant
데이터기준일 has constant value ""Constant
등재구분 is highly imbalanced (75.3%)Imbalance
논문명(외국어) has 8014 (80.1%) missing valuesMissing
공동저자 has 3198 (32.0%) missing valuesMissing
학술지명(외국어) has 1163 (11.6%) missing valuesMissing
발행기관명(영문) has 1349 (13.5%) missing valuesMissing
has 1164 (11.6%) missing valuesMissing
has 498 (5.0%) missing valuesMissing
키워드(국문) has 227 (2.3%) missing valuesMissing
키워드(외국어) has 7980 (79.8%) missing valuesMissing
논문명(국문) has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:30:50.993672
Analysis finished2023-12-12 18:30:58.396811
Duration7.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

논문명(국문)
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T03:30:58.841865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length309
Median length188
Mean length60.96
Min length1

Characters and Unicode

Total characters609600
Distinct characters2469
Distinct categories20 ?
Distinct scripts8 ?
Distinct blocks16 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowTripartite Motif Containing 3 inhibits the aggressive behaviors of papillary thyroid carcinoma and indicates lower recurrence risk
2nd row한국 전통 사각소반의 ‘상판과 변죽의 형태’를 응용한 트레이 디자인 연구
3rd row청소년들의 여가태도가 운동지속수행에 미치는 영향: 건강증진행위의 매개효과
4th rowThe Inhibitory Effect of Corni Fructus against Oxidative Stress-induced Cellular Damage in C2C12 Murine Myoblasts
5th rowWonhyo’s View of Human Beings and his Redemption of Mankind
ValueCountFrequency (%)
of 3019
 
2.9%
and 1641
 
1.5%
연구 1522
 
1.4%
in 1310
 
1.2%
중심으로 1223
 
1.2%
the 1206
 
1.1%
933
 
0.9%
907
 
0.9%
a 876
 
0.8%
for 830
 
0.8%
Other values (36628) 92417
87.3%
2023-12-13T03:30:59.531457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
95902
 
15.7%
e 31093
 
5.1%
i 26534
 
4.4%
o 23812
 
3.9%
n 23788
 
3.9%
t 23686
 
3.9%
a 23666
 
3.9%
r 19048
 
3.1%
s 16053
 
2.6%
l 12711
 
2.1%
Other values (2459) 313307
51.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 279119
45.8%
Other Letter 183785
30.1%
Space Separator 95904
 
15.7%
Uppercase Letter 32833
 
5.4%
Decimal Number 5096
 
0.8%
Dash Punctuation 4738
 
0.8%
Other Punctuation 3840
 
0.6%
Close Punctuation 1426
 
0.2%
Open Punctuation 1424
 
0.2%
Math Symbol 492
 
0.1%
Other values (10) 943
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7518
 
4.1%
3989
 
2.2%
3862
 
2.1%
2800
 
1.5%
2774
 
1.5%
2693
 
1.5%
2674
 
1.5%
2655
 
1.4%
2492
 
1.4%
2266
 
1.2%
Other values (2256) 150062
81.7%
Lowercase Letter
ValueCountFrequency (%)
e 31093
11.1%
i 26534
 
9.5%
o 23812
 
8.5%
n 23788
 
8.5%
t 23686
 
8.5%
a 23666
 
8.5%
r 19048
 
6.8%
s 16053
 
5.8%
l 12711
 
4.6%
c 12066
 
4.3%
Other values (58) 66662
23.9%
Uppercase Letter
ValueCountFrequency (%)
S 3293
 
10.0%
C 3224
 
9.8%
A 2861
 
8.7%
P 2423
 
7.4%
M 2019
 
6.1%
I 1962
 
6.0%
D 1831
 
5.6%
T 1762
 
5.4%
E 1699
 
5.2%
R 1635
 
5.0%
Other values (32) 10124
30.8%
Other Punctuation
ValueCountFrequency (%)
: 1834
47.8%
, 1183
30.8%
. 296
 
7.7%
/ 215
 
5.6%
· 177
 
4.6%
' 34
 
0.9%
* 16
 
0.4%
& 13
 
0.3%
" 12
 
0.3%
¡ 11
 
0.3%
Other values (11) 49
 
1.3%
Math Symbol
ValueCountFrequency (%)
> 144
29.3%
< 143
29.1%
~ 58
11.8%
+ 47
 
9.6%
31
 
6.3%
31
 
6.3%
= 8
 
1.6%
7
 
1.4%
7
 
1.4%
4
 
0.8%
Other values (6) 12
 
2.4%
Decimal Number
ValueCountFrequency (%)
1 1169
22.9%
2 1061
20.8%
0 769
15.1%
9 561
11.0%
3 452
 
8.9%
5 277
 
5.4%
4 245
 
4.8%
8 204
 
4.0%
6 192
 
3.8%
7 166
 
3.3%
Close Punctuation
ValueCountFrequency (%)
) 1037
72.7%
229
 
16.1%
107
 
7.5%
28
 
2.0%
] 15
 
1.1%
5
 
0.4%
} 4
 
0.3%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1036
72.8%
228
 
16.0%
107
 
7.5%
28
 
2.0%
[ 15
 
1.1%
5
 
0.4%
{ 4
 
0.3%
1
 
0.1%
Other Symbol
ValueCountFrequency (%)
8
50.0%
3
 
18.8%
1
 
6.2%
1
 
6.2%
® 1
 
6.2%
° 1
 
6.2%
1
 
6.2%
Letter Number
ValueCountFrequency (%)
7
50.0%
5
35.7%
1
 
7.1%
1
 
7.1%
Dash Punctuation
ValueCountFrequency (%)
- 4331
91.4%
264
 
5.6%
143
 
3.0%
Modifier Symbol
ValueCountFrequency (%)
^ 2
50.0%
` 1
25.0%
¸ 1
25.0%
Space Separator
ValueCountFrequency (%)
95902
> 99.9%
  2
 
< 0.1%
Final Punctuation
ValueCountFrequency (%)
416
90.6%
43
 
9.4%
Initial Punctuation
ValueCountFrequency (%)
321
87.9%
44
 
12.1%
Currency Symbol
ValueCountFrequency (%)
$ 24
92.3%
¤ 2
 
7.7%
Control
ValueCountFrequency (%)
 14
93.3%
 1
 
6.7%
Connector Punctuation
ValueCountFrequency (%)
_ 34
100.0%
Format
ValueCountFrequency (%)
­ 8
100.0%
Other Number
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 311671
51.1%
Hangul 179431
29.4%
Common 113849
 
18.7%
Han 3844
 
0.6%
Hiragana 409
 
0.1%
Cyrillic 226
 
< 0.1%
Katakana 101
 
< 0.1%
Greek 69
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
85
 
2.2%
78
 
2.0%
48
 
1.2%
43
 
1.1%
39
 
1.0%
39
 
1.0%
33
 
0.9%
29
 
0.8%
28
 
0.7%
27
 
0.7%
Other values (1117) 3395
88.3%
Hangul
ValueCountFrequency (%)
7518
 
4.2%
3989
 
2.2%
3862
 
2.2%
2800
 
1.6%
2774
 
1.5%
2693
 
1.5%
2674
 
1.5%
2655
 
1.5%
2492
 
1.4%
2266
 
1.3%
Other values (1036) 145708
81.2%
Common
ValueCountFrequency (%)
95902
84.2%
- 4331
 
3.8%
: 1834
 
1.6%
, 1183
 
1.0%
1 1169
 
1.0%
2 1061
 
0.9%
) 1037
 
0.9%
( 1036
 
0.9%
0 769
 
0.7%
9 561
 
0.5%
Other values (79) 4966
 
4.4%
Latin
ValueCountFrequency (%)
e 31093
 
10.0%
i 26534
 
8.5%
o 23812
 
7.6%
n 23788
 
7.6%
t 23686
 
7.6%
a 23666
 
7.6%
r 19048
 
6.1%
s 16053
 
5.2%
l 12711
 
4.1%
c 12066
 
3.9%
Other values (50) 99214
31.8%
Hiragana
ValueCountFrequency (%)
67
16.4%
52
 
12.7%
39
 
9.5%
21
 
5.1%
18
 
4.4%
15
 
3.7%
14
 
3.4%
13
 
3.2%
13
 
3.2%
13
 
3.2%
Other values (37) 144
35.2%
Katakana
ValueCountFrequency (%)
5
 
5.0%
5
 
5.0%
5
 
5.0%
4
 
4.0%
4
 
4.0%
4
 
4.0%
4
 
4.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
Other values (36) 61
60.4%
Cyrillic
ValueCountFrequency (%)
и 24
 
10.6%
о 19
 
8.4%
е 19
 
8.4%
с 14
 
6.2%
а 12
 
5.3%
к 11
 
4.9%
н 11
 
4.9%
т 11
 
4.9%
р 10
 
4.4%
л 9
 
4.0%
Other values (28) 86
38.1%
Greek
ValueCountFrequency (%)
α 19
27.5%
β 18
26.1%
κ 8
11.6%
γ 5
 
7.2%
φ 3
 
4.3%
δ 2
 
2.9%
η 2
 
2.9%
Ι 2
 
2.9%
ε 2
 
2.9%
Ω 2
 
2.9%
Other values (6) 6
 
8.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 423198
69.4%
Hangul 179396
29.4%
CJK 3814
 
0.6%
None 1201
 
0.2%
Punctuation 1089
 
0.2%
Hiragana 409
 
0.1%
Cyrillic 226
 
< 0.1%
Katakana 101
 
< 0.1%
Math Operators 73
 
< 0.1%
Compat Jamo 35
 
< 0.1%
Other values (6) 58
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
95902
22.7%
e 31093
 
7.3%
i 26534
 
6.3%
o 23812
 
5.6%
n 23788
 
5.6%
t 23686
 
5.6%
a 23666
 
5.6%
r 19048
 
4.5%
s 16053
 
3.8%
l 12711
 
3.0%
Other values (86) 126905
30.0%
Hangul
ValueCountFrequency (%)
7518
 
4.2%
3989
 
2.2%
3862
 
2.2%
2800
 
1.6%
2774
 
1.5%
2693
 
1.5%
2674
 
1.5%
2655
 
1.5%
2492
 
1.4%
2266
 
1.3%
Other values (1023) 145673
81.2%
Punctuation
ValueCountFrequency (%)
416
38.2%
321
29.5%
264
24.2%
44
 
4.0%
43
 
3.9%
1
 
0.1%
None
ValueCountFrequency (%)
229
19.1%
228
19.0%
· 177
14.7%
143
11.9%
107
8.9%
107
8.9%
28
 
2.3%
28
 
2.3%
α 19
 
1.6%
β 18
 
1.5%
Other values (38) 117
9.7%
CJK
ValueCountFrequency (%)
85
 
2.2%
78
 
2.0%
48
 
1.3%
43
 
1.1%
39
 
1.0%
39
 
1.0%
33
 
0.9%
29
 
0.8%
28
 
0.7%
27
 
0.7%
Other values (1101) 3365
88.2%
Hiragana
ValueCountFrequency (%)
67
16.4%
52
 
12.7%
39
 
9.5%
21
 
5.1%
18
 
4.4%
15
 
3.7%
14
 
3.4%
13
 
3.2%
13
 
3.2%
13
 
3.2%
Other values (37) 144
35.2%
Math Operators
ValueCountFrequency (%)
31
42.5%
31
42.5%
4
 
5.5%
4
 
5.5%
2
 
2.7%
1
 
1.4%
Cyrillic
ValueCountFrequency (%)
и 24
 
10.6%
о 19
 
8.4%
е 19
 
8.4%
с 14
 
6.2%
а 12
 
5.3%
к 11
 
4.9%
н 11
 
4.9%
т 11
 
4.9%
р 10
 
4.4%
л 9
 
4.0%
Other values (28) 86
38.1%
Compat Jamo
ValueCountFrequency (%)
19
54.3%
2
 
5.7%
2
 
5.7%
2
 
5.7%
2
 
5.7%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (3) 3
 
8.6%
Box Drawing
ValueCountFrequency (%)
8
100.0%
Number Forms
ValueCountFrequency (%)
7
50.0%
5
35.7%
1
 
7.1%
1
 
7.1%
CJK Compat Ideographs
ValueCountFrequency (%)
7
23.3%
5
16.7%
3
10.0%
2
 
6.7%
2
 
6.7%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
Other values (6) 6
20.0%
Katakana
ValueCountFrequency (%)
5
 
5.0%
5
 
5.0%
5
 
5.0%
4
 
4.0%
4
 
4.0%
4
 
4.0%
4
 
4.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
Other values (36) 61
60.4%
Letterlike Symbols
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
CJK Compat
ValueCountFrequency (%)
1
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%

논문명(외국어)
Text

MISSING 

Distinct1953
Distinct (%)98.3%
Missing8014
Missing (%)80.1%
Memory size156.2 KiB
2023-12-13T03:30:59.952936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length1024
Median length164
Mean length95.6143
Min length1

Characters and Unicode

Total characters189890
Distinct characters773
Distinct categories17 ?
Distinct scripts8 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1952 ?
Unique (%)98.3%

Sample

1st rowRecommendations for Clinical Application of Pharmacogenetic Test Results Interpretation by Clinical Laboratories
2nd rowResearch Trend Analysis of 'Korean Association of Secretarial Science': Focused on the respective Career of Secretarial Majors and Secretaries
3rd row즉각적 조직검사 대신 Prostate-Specific Antigen 추적 관찰로 유의미한 전립선암을 구분하면서도 불필요한 검사를 피할 수 있는가
4th rowDarknet Traffic Detection and Classification using Gradient Boosting Techniques
5th rowThe Effects of Lifestyle and Self-rated Health on Mental Health of Breast Cancer Survivors: Using Propensity Score Matching Approach
ValueCountFrequency (%)
of 1893
 
6.9%
the 1363
 
5.0%
on 933
 
3.4%
and 931
 
3.4%
a 652
 
2.4%
in 648
 
2.4%
for 518
 
1.9%
study 433
 
1.6%
using 238
 
0.9%
215
 
0.8%
Other values (7425) 19637
71.5%
2023-12-13T03:31:00.581003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25553
 
13.5%
e 16193
 
8.5%
n 13026
 
6.9%
i 12451
 
6.6%
o 12413
 
6.5%
t 11864
 
6.2%
a 11150
 
5.9%
r 8637
 
4.5%
s 8113
 
4.3%
l 5599
 
2.9%
Other values (763) 64891
34.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 138318
72.8%
Space Separator 25553
 
13.5%
Uppercase Letter 18523
 
9.8%
Other Letter 4052
 
2.1%
Dash Punctuation 1170
 
0.6%
Other Punctuation 854
 
0.4%
Decimal Number 791
 
0.4%
Final Punctuation 244
 
0.1%
Close Punctuation 121
 
0.1%
Open Punctuation 121
 
0.1%
Other values (7) 143
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
143
 
3.5%
99
 
2.4%
82
 
2.0%
73
 
1.8%
58
 
1.4%
55
 
1.4%
53
 
1.3%
49
 
1.2%
49
 
1.2%
47
 
1.2%
Other values (608) 3344
82.5%
Lowercase Letter
ValueCountFrequency (%)
e 16193
11.7%
n 13026
 
9.4%
i 12451
 
9.0%
o 12413
 
9.0%
t 11864
 
8.6%
a 11150
 
8.1%
r 8637
 
6.2%
s 8113
 
5.9%
l 5599
 
4.0%
c 5292
 
3.8%
Other values (53) 33580
24.3%
Uppercase Letter
ValueCountFrequency (%)
S 2184
11.8%
A 1847
 
10.0%
C 1768
 
9.5%
P 1322
 
7.1%
M 1158
 
6.3%
E 1075
 
5.8%
T 1061
 
5.7%
D 1049
 
5.7%
R 932
 
5.0%
I 874
 
4.7%
Other values (25) 5253
28.4%
Other Punctuation
ValueCountFrequency (%)
: 363
42.5%
, 246
28.8%
' 92
 
10.8%
. 82
 
9.6%
/ 25
 
2.9%
& 18
 
2.1%
" 11
 
1.3%
¡ 4
 
0.5%
; 3
 
0.4%
* 3
 
0.4%
Other values (6) 7
 
0.8%
Decimal Number
ValueCountFrequency (%)
1 176
22.3%
2 172
21.7%
0 144
18.2%
9 91
11.5%
3 61
 
7.7%
5 33
 
4.2%
8 30
 
3.8%
6 29
 
3.7%
4 29
 
3.7%
7 26
 
3.3%
Math Symbol
ValueCountFrequency (%)
< 15
30.6%
> 15
30.6%
+ 10
20.4%
4
 
8.2%
3
 
6.1%
= 2
 
4.1%
Close Punctuation
ValueCountFrequency (%)
) 109
90.1%
5
 
4.1%
4
 
3.3%
] 2
 
1.7%
1
 
0.8%
Open Punctuation
ValueCountFrequency (%)
( 109
90.1%
5
 
4.1%
4
 
3.3%
[ 2
 
1.7%
1
 
0.8%
Dash Punctuation
ValueCountFrequency (%)
- 1137
97.2%
29
 
2.5%
4
 
0.3%
Letter Number
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Final Punctuation
ValueCountFrequency (%)
218
89.3%
26
 
10.7%
Initial Punctuation
ValueCountFrequency (%)
56
67.5%
27
32.5%
Space Separator
ValueCountFrequency (%)
25553
100.0%
Currency Symbol
ValueCountFrequency (%)
¤ 3
100.0%
Format
ValueCountFrequency (%)
­ 2
100.0%
Control
ValueCountFrequency (%)
 1
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 156344
82.3%
Common 28993
 
15.3%
Hangul 3767
 
2.0%
Cyrillic 494
 
0.3%
Han 210
 
0.1%
Hiragana 49
 
< 0.1%
Katakana 26
 
< 0.1%
Greek 7
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
143
 
3.8%
99
 
2.6%
82
 
2.2%
73
 
1.9%
58
 
1.5%
55
 
1.5%
53
 
1.4%
49
 
1.3%
49
 
1.3%
47
 
1.2%
Other values (422) 3059
81.2%
Han
ValueCountFrequency (%)
6
 
2.9%
6
 
2.9%
6
 
2.9%
4
 
1.9%
4
 
1.9%
4
 
1.9%
3
 
1.4%
3
 
1.4%
3
 
1.4%
3
 
1.4%
Other values (143) 168
80.0%
Latin
ValueCountFrequency (%)
e 16193
 
10.4%
n 13026
 
8.3%
i 12451
 
8.0%
o 12413
 
7.9%
t 11864
 
7.6%
a 11150
 
7.1%
r 8637
 
5.5%
s 8113
 
5.2%
l 5599
 
3.6%
c 5292
 
3.4%
Other values (46) 51606
33.0%
Common
ValueCountFrequency (%)
25553
88.1%
- 1137
 
3.9%
: 363
 
1.3%
, 246
 
0.8%
218
 
0.8%
1 176
 
0.6%
2 172
 
0.6%
0 144
 
0.5%
) 109
 
0.4%
( 109
 
0.4%
Other values (44) 766
 
2.6%
Cyrillic
ValueCountFrequency (%)
е 47
 
9.5%
о 47
 
9.5%
и 45
 
9.1%
а 41
 
8.3%
н 34
 
6.9%
с 33
 
6.7%
р 27
 
5.5%
к 25
 
5.1%
т 22
 
4.5%
в 21
 
4.3%
Other values (28) 152
30.8%
Katakana
ValueCountFrequency (%)
3
11.5%
3
11.5%
3
11.5%
2
 
7.7%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (7) 7
26.9%
Hiragana
ValueCountFrequency (%)
11
22.4%
10
20.4%
5
10.2%
4
 
8.2%
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
Other values (6) 7
14.3%
Greek
ValueCountFrequency (%)
η 1
14.3%
φ 1
14.3%
ν 1
14.3%
ι 1
14.3%
γ 1
14.3%
α 1
14.3%
ξ 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 184930
97.4%
Hangul 3767
 
2.0%
Cyrillic 494
 
0.3%
Punctuation 357
 
0.2%
CJK 210
 
0.1%
Hiragana 49
 
< 0.1%
None 46
 
< 0.1%
Katakana 26
 
< 0.1%
Math Operators 7
 
< 0.1%
Number Forms 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
25553
13.8%
e 16193
 
8.8%
n 13026
 
7.0%
i 12451
 
6.7%
o 12413
 
6.7%
t 11864
 
6.4%
a 11150
 
6.0%
r 8637
 
4.7%
s 8113
 
4.4%
l 5599
 
3.0%
Other values (74) 59931
32.4%
Punctuation
ValueCountFrequency (%)
218
61.1%
56
 
15.7%
29
 
8.1%
27
 
7.6%
26
 
7.3%
1
 
0.3%
Hangul
ValueCountFrequency (%)
143
 
3.8%
99
 
2.6%
82
 
2.2%
73
 
1.9%
58
 
1.5%
55
 
1.5%
53
 
1.4%
49
 
1.3%
49
 
1.3%
47
 
1.2%
Other values (422) 3059
81.2%
Cyrillic
ValueCountFrequency (%)
е 47
 
9.5%
о 47
 
9.5%
и 45
 
9.1%
а 41
 
8.3%
н 34
 
6.9%
с 33
 
6.7%
р 27
 
5.5%
к 25
 
5.1%
т 22
 
4.5%
в 21
 
4.3%
Other values (28) 152
30.8%
Hiragana
ValueCountFrequency (%)
11
22.4%
10
20.4%
5
10.2%
4
 
8.2%
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
Other values (6) 7
14.3%
CJK
ValueCountFrequency (%)
6
 
2.9%
6
 
2.9%
6
 
2.9%
4
 
1.9%
4
 
1.9%
4
 
1.9%
3
 
1.4%
3
 
1.4%
3
 
1.4%
3
 
1.4%
Other values (143) 168
80.0%
None
ValueCountFrequency (%)
5
10.9%
5
10.9%
¡ 4
 
8.7%
4
 
8.7%
4
 
8.7%
4
 
8.7%
¤ 3
 
6.5%
­ 2
 
4.3%
đ 2
 
4.3%
η 1
 
2.2%
Other values (12) 12
26.1%
Math Operators
ValueCountFrequency (%)
4
57.1%
3
42.9%
Katakana
ValueCountFrequency (%)
3
11.5%
3
11.5%
3
11.5%
2
 
7.7%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (7) 7
26.9%
Number Forms
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Distinct9999
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T03:31:01.030391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length1024
Median length207
Mean length105.1707
Min length18

Characters and Unicode

Total characters1051707
Distinct characters734
Distinct categories20 ?
Distinct scripts8 ?
Distinct blocks13 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9998 ?
Unique (%)> 99.9%

Sample

1st rowTripartite Motif Containing 3 inhibits the aggressive behaviors of papillary thyroid carcinoma and indicates lower recurrence risk
2nd rowA Study on Tray Design Applying the Top Plate and Rim Shape of Korean Traditional Rectangular Soban
3rd rowEffect of Adolescents' Leisure Attitude on Exercise Adherence: Mediating Effect of Health Promotion Behavior
4th rowThe Inhibitory Effect of Corni Fructus against Oxidative Stress-induced Cellular Damage in C2C12 Murine Myoblasts
5th rowWonhyo’s View of Human Beings and his Redemption of Mankind
ValueCountFrequency (%)
of 10961
 
7.4%
the 7458
 
5.1%
and 5550
 
3.8%
on 4949
 
3.4%
in 3982
 
2.7%
a 3279
 
2.2%
for 2207
 
1.5%
study 2036
 
1.4%
with 1085
 
0.7%
to 984
 
0.7%
Other values (22654) 104786
71.1%
2023-12-13T03:31:01.732582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
137334
13.1%
e 91679
 
8.7%
n 73926
 
7.0%
i 72552
 
6.9%
o 71661
 
6.8%
t 68577
 
6.5%
a 65277
 
6.2%
r 50303
 
4.8%
s 46361
 
4.4%
l 33141
 
3.2%
Other values (724) 340896
32.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 798003
75.9%
Space Separator 137336
 
13.1%
Uppercase Letter 95392
 
9.1%
Dash Punctuation 6658
 
0.6%
Decimal Number 4943
 
0.5%
Other Punctuation 4769
 
0.5%
Final Punctuation 1164
 
0.1%
Close Punctuation 908
 
0.1%
Open Punctuation 906
 
0.1%
Other Letter 877
 
0.1%
Other values (10) 751
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
1.0%
9
 
1.0%
7
 
0.8%
7
 
0.8%
7
 
0.8%
7
 
0.8%
6
 
0.7%
6
 
0.7%
6
 
0.7%
5
 
0.6%
Other values (563) 808
92.1%
Lowercase Letter
ValueCountFrequency (%)
e 91679
11.5%
n 73926
 
9.3%
i 72552
 
9.1%
o 71661
 
9.0%
t 68577
 
8.6%
a 65277
 
8.2%
r 50303
 
6.3%
s 46361
 
5.8%
l 33141
 
4.2%
c 32566
 
4.1%
Other values (35) 191960
24.1%
Uppercase Letter
ValueCountFrequency (%)
S 10689
 
11.2%
C 9464
 
9.9%
A 9181
 
9.6%
P 7052
 
7.4%
E 5927
 
6.2%
T 5710
 
6.0%
M 5603
 
5.9%
D 4990
 
5.2%
I 4722
 
5.0%
R 4588
 
4.8%
Other values (21) 27466
28.8%
Other Punctuation
ValueCountFrequency (%)
: 2018
42.3%
, 1444
30.3%
' 438
 
9.2%
. 430
 
9.0%
/ 236
 
4.9%
& 59
 
1.2%
" 54
 
1.1%
¡ 33
 
0.7%
; 13
 
0.3%
13
 
0.3%
Other values (11) 31
 
0.7%
Math Symbol
ValueCountFrequency (%)
> 74
28.5%
< 73
28.1%
+ 54
20.8%
~ 21
 
8.1%
9
 
3.5%
= 9
 
3.5%
8
 
3.1%
4
 
1.5%
× 2
 
0.8%
2
 
0.8%
Other values (3) 4
 
1.5%
Decimal Number
ValueCountFrequency (%)
1 1142
23.1%
2 997
20.2%
0 755
15.3%
9 568
11.5%
3 435
 
8.8%
5 266
 
5.4%
4 235
 
4.8%
8 197
 
4.0%
6 184
 
3.7%
7 164
 
3.3%
Open Punctuation
ValueCountFrequency (%)
( 818
90.3%
50
 
5.5%
17
 
1.9%
[ 13
 
1.4%
{ 4
 
0.4%
3
 
0.3%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 818
90.1%
51
 
5.6%
18
 
2.0%
] 13
 
1.4%
} 4
 
0.4%
3
 
0.3%
1
 
0.1%
Letter Number
ValueCountFrequency (%)
4
40.0%
2
20.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
Dash Punctuation
ValueCountFrequency (%)
- 6506
97.7%
80
 
1.2%
72
 
1.1%
Other Symbol
ValueCountFrequency (%)
3
60.0%
® 1
 
20.0%
1
 
20.0%
Space Separator
ValueCountFrequency (%)
137334
> 99.9%
  2
 
< 0.1%
Final Punctuation
ValueCountFrequency (%)
1027
88.2%
137
 
11.8%
Initial Punctuation
ValueCountFrequency (%)
242
62.4%
146
37.6%
Currency Symbol
ValueCountFrequency (%)
$ 24
58.5%
¤ 17
41.5%
Control
ValueCountFrequency (%)
 13
86.7%
2
 
13.3%
Modifier Symbol
ValueCountFrequency (%)
` 6
75.0%
^ 2
 
25.0%
Format
ValueCountFrequency (%)
­ 13
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 8
100.0%
Other Number
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 893331
84.9%
Common 157425
 
15.0%
Han 673
 
0.1%
Hangul 188
 
< 0.1%
Greek 70
 
< 0.1%
Hiragana 15
 
< 0.1%
Cyrillic 4
 
< 0.1%
Katakana 1
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
9
 
1.3%
9
 
1.3%
7
 
1.0%
7
 
1.0%
7
 
1.0%
7
 
1.0%
6
 
0.9%
6
 
0.9%
5
 
0.7%
5
 
0.7%
Other values (423) 605
89.9%
Hangul
ValueCountFrequency (%)
6
 
3.2%
5
 
2.7%
4
 
2.1%
4
 
2.1%
3
 
1.6%
3
 
1.6%
3
 
1.6%
3
 
1.6%
3
 
1.6%
3
 
1.6%
Other values (116) 151
80.3%
Common
ValueCountFrequency (%)
137334
87.2%
- 6506
 
4.1%
: 2018
 
1.3%
, 1444
 
0.9%
1 1142
 
0.7%
1027
 
0.7%
2 997
 
0.6%
( 818
 
0.5%
) 818
 
0.5%
0 755
 
0.5%
Other values (69) 4566
 
2.9%
Latin
ValueCountFrequency (%)
e 91679
 
10.3%
n 73926
 
8.3%
i 72552
 
8.1%
o 71661
 
8.0%
t 68577
 
7.7%
a 65277
 
7.3%
r 50303
 
5.6%
s 46361
 
5.2%
l 33141
 
3.7%
c 32566
 
3.6%
Other values (53) 287288
32.2%
Greek
ValueCountFrequency (%)
α 19
27.1%
β 19
27.1%
κ 8
11.4%
γ 5
 
7.1%
φ 3
 
4.3%
δ 2
 
2.9%
Ω 2
 
2.9%
Ι 2
 
2.9%
ε 2
 
2.9%
η 2
 
2.9%
Other values (6) 6
 
8.6%
Hiragana
ValueCountFrequency (%)
2
13.3%
2
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Other values (3) 3
20.0%
Cyrillic
ValueCountFrequency (%)
ы 2
50.0%
т 1
25.0%
б 1
25.0%
Katakana
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1048766
99.7%
Punctuation 1636
 
0.2%
CJK 667
 
0.1%
None 385
 
< 0.1%
Hangul 186
 
< 0.1%
Math Operators 25
 
< 0.1%
Hiragana 15
 
< 0.1%
Number Forms 10
 
< 0.1%
CJK Compat Ideographs 6
 
< 0.1%
Letterlike Symbols 4
 
< 0.1%
Other values (3) 7
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
137334
13.1%
e 91679
 
8.7%
n 73926
 
7.0%
i 72552
 
6.9%
o 71661
 
6.8%
t 68577
 
6.5%
a 65277
 
6.2%
r 50303
 
4.8%
s 46361
 
4.4%
l 33141
 
3.2%
Other values (86) 337955
32.2%
Punctuation
ValueCountFrequency (%)
1027
62.8%
242
 
14.8%
146
 
8.9%
137
 
8.4%
80
 
4.9%
2
 
0.1%
1
 
0.1%
1
 
0.1%
None
ValueCountFrequency (%)
72
18.7%
51
13.2%
50
13.0%
¡ 33
8.6%
α 19
 
4.9%
β 19
 
4.9%
18
 
4.7%
17
 
4.4%
¤ 17
 
4.4%
­ 13
 
3.4%
Other values (30) 76
19.7%
Math Operators
ValueCountFrequency (%)
9
36.0%
8
32.0%
4
16.0%
2
 
8.0%
1
 
4.0%
1
 
4.0%
CJK
ValueCountFrequency (%)
9
 
1.3%
9
 
1.3%
7
 
1.0%
7
 
1.0%
7
 
1.0%
7
 
1.0%
6
 
0.9%
6
 
0.9%
5
 
0.7%
5
 
0.7%
Other values (417) 599
89.8%
Hangul
ValueCountFrequency (%)
6
 
3.2%
5
 
2.7%
4
 
2.2%
4
 
2.2%
3
 
1.6%
3
 
1.6%
3
 
1.6%
3
 
1.6%
3
 
1.6%
3
 
1.6%
Other values (115) 149
80.1%
Number Forms
ValueCountFrequency (%)
4
40.0%
2
20.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
Letterlike Symbols
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Hiragana
ValueCountFrequency (%)
2
13.3%
2
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Other values (3) 3
20.0%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
Cyrillic
ValueCountFrequency (%)
ы 2
50.0%
т 1
25.0%
б 1
25.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Katakana
ValueCountFrequency (%)
1
100.0%

저자
Text

Distinct8420
Distinct (%)84.2%
Missing3
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-13T03:31:02.216012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length70
Median length3
Mean length5.5925778
Min length1

Characters and Unicode

Total characters55909
Distinct characters551
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7376 ?
Unique (%)73.8%

Sample

1st rowSong Yubao
2nd row한동엽
3rd row김보람
4th row김성옥
5th row남동신
ValueCountFrequency (%)
kim 185
 
1.4%
lee 124
 
0.9%
park 67
 
0.5%
zhang 58
 
0.4%
li 54
 
0.4%
wang 52
 
0.4%
choi 46
 
0.3%
kang 42
 
0.3%
yang 40
 
0.3%
liu 37
 
0.3%
Other values (9069) 12641
94.7%
2023-12-13T03:31:02.936587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3393
 
6.1%
a 3301
 
5.9%
n 3079
 
5.5%
i 2283
 
4.1%
e 2044
 
3.7%
o 1932
 
3.5%
u 1629
 
2.9%
1487
 
2.7%
g 1433
 
2.6%
h 1314
 
2.4%
Other values (541) 34014
60.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 22914
41.0%
Other Letter 22263
39.8%
Uppercase Letter 6615
 
11.8%
Space Separator 3393
 
6.1%
Dash Punctuation 371
 
0.7%
Other Punctuation 322
 
0.6%
Open Punctuation 11
 
< 0.1%
Close Punctuation 11
 
< 0.1%
Decimal Number 5
 
< 0.1%
Final Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1487
 
6.7%
1173
 
5.3%
874
 
3.9%
664
 
3.0%
538
 
2.4%
527
 
2.4%
506
 
2.3%
439
 
2.0%
402
 
1.8%
400
 
1.8%
Other values (474) 15253
68.5%
Lowercase Letter
ValueCountFrequency (%)
a 3301
14.4%
n 3079
13.4%
i 2283
10.0%
e 2044
8.9%
o 1932
8.4%
u 1629
 
7.1%
g 1433
 
6.3%
h 1314
 
5.7%
r 843
 
3.7%
m 745
 
3.3%
Other values (17) 4311
18.8%
Uppercase Letter
ValueCountFrequency (%)
S 607
 
9.2%
H 559
 
8.5%
K 519
 
7.8%
J 498
 
7.5%
M 428
 
6.5%
Y 415
 
6.3%
A 404
 
6.1%
L 392
 
5.9%
C 303
 
4.6%
P 227
 
3.4%
Other values (16) 2263
34.2%
Other Punctuation
ValueCountFrequency (%)
. 294
91.3%
, 21
 
6.5%
* 6
 
1.9%
¡ 1
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 2
40.0%
8 1
20.0%
0 1
20.0%
2 1
20.0%
Space Separator
ValueCountFrequency (%)
3393
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 371
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Final Punctuation
ValueCountFrequency (%)
3
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 29529
52.8%
Hangul 22157
39.6%
Common 4117
 
7.4%
Han 106
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1487
 
6.7%
1173
 
5.3%
874
 
3.9%
664
 
3.0%
538
 
2.4%
527
 
2.4%
506
 
2.3%
439
 
2.0%
402
 
1.8%
400
 
1.8%
Other values (382) 15147
68.4%
Han
ValueCountFrequency (%)
5
 
4.7%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (82) 83
78.3%
Latin
ValueCountFrequency (%)
a 3301
 
11.2%
n 3079
 
10.4%
i 2283
 
7.7%
e 2044
 
6.9%
o 1932
 
6.5%
u 1629
 
5.5%
g 1433
 
4.9%
h 1314
 
4.4%
r 843
 
2.9%
m 745
 
2.5%
Other values (43) 10926
37.0%
Common
ValueCountFrequency (%)
3393
82.4%
- 371
 
9.0%
. 294
 
7.1%
, 21
 
0.5%
( 11
 
0.3%
) 11
 
0.3%
* 6
 
0.1%
3
 
0.1%
1 2
 
< 0.1%
1
 
< 0.1%
Other values (4) 4
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 33630
60.2%
Hangul 22157
39.6%
CJK 106
 
0.2%
None 13
 
< 0.1%
Punctuation 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3393
 
10.1%
a 3301
 
9.8%
n 3079
 
9.2%
i 2283
 
6.8%
e 2044
 
6.1%
o 1932
 
5.7%
u 1629
 
4.8%
g 1433
 
4.3%
h 1314
 
3.9%
r 843
 
2.5%
Other values (54) 12379
36.8%
Hangul
ValueCountFrequency (%)
1487
 
6.7%
1173
 
5.3%
874
 
3.9%
664
 
3.0%
538
 
2.4%
527
 
2.4%
506
 
2.3%
439
 
2.0%
402
 
1.8%
400
 
1.8%
Other values (382) 15147
68.4%
None
ValueCountFrequency (%)
ı 12
92.3%
¡ 1
 
7.7%
CJK
ValueCountFrequency (%)
5
 
4.7%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (82) 83
78.3%
Punctuation
ValueCountFrequency (%)
3
100.0%

공동저자
Text

MISSING 

Distinct6386
Distinct (%)93.9%
Missing3198
Missing (%)32.0%
Memory size156.2 KiB
2023-12-13T03:31:03.438716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length1024
Median length712
Mean length25.077918
Min length1

Characters and Unicode

Total characters170580
Distinct characters471
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6072 ?
Unique (%)89.3%

Sample

1st rowGao Zefeng;Yan Zhifeng;Zheng Caihong
2nd row이정교
3rd row윤주석
4th row정지숙;박철;Lee Hyesook;최성현;김기영;김혜영;최영현;황은주
5th rowRyu Vin;Choi Jungwon;Oh Yunhye;Yoon Jin Woong;Han Hyeree;Hong Hyeon;Son Hye Jung;Lee Ji Hyun;Park Subin
ValueCountFrequency (%)
kim 197
 
1.0%
lee 125
 
0.6%
park 73
 
0.4%
m 68
 
0.3%
young 67
 
0.3%
jin 63
 
0.3%
hyun 61
 
0.3%
wang 61
 
0.3%
jung 59
 
0.3%
a 56
 
0.3%
Other values (14534) 18686
95.7%
2023-12-13T03:31:04.095923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
; 13530
 
7.9%
12743
 
7.5%
n 12149
 
7.1%
a 11582
 
6.8%
i 8628
 
5.1%
o 7983
 
4.7%
e 7522
 
4.4%
u 6143
 
3.6%
g 5682
 
3.3%
h 4770
 
2.8%
Other values (461) 79848
46.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 84753
49.7%
Other Letter 31794
 
18.6%
Uppercase Letter 24706
 
14.5%
Other Punctuation 14841
 
8.7%
Space Separator 12743
 
7.5%
Dash Punctuation 1704
 
1.0%
Open Punctuation 14
 
< 0.1%
Close Punctuation 14
 
< 0.1%
Final Punctuation 7
 
< 0.1%
Decimal Number 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2224
 
7.0%
1662
 
5.2%
1180
 
3.7%
896
 
2.8%
851
 
2.7%
820
 
2.6%
749
 
2.4%
580
 
1.8%
568
 
1.8%
532
 
1.7%
Other values (391) 21732
68.4%
Lowercase Letter
ValueCountFrequency (%)
n 12149
14.3%
a 11582
13.7%
i 8628
10.2%
o 7983
9.4%
e 7522
8.9%
u 6143
7.2%
g 5682
 
6.7%
h 4770
 
5.6%
r 3020
 
3.6%
m 2636
 
3.1%
Other values (19) 14638
17.3%
Uppercase Letter
ValueCountFrequency (%)
S 2472
 
10.0%
H 2133
 
8.6%
J 2061
 
8.3%
K 2027
 
8.2%
Y 1682
 
6.8%
M 1536
 
6.2%
L 1528
 
6.2%
C 1272
 
5.1%
A 1126
 
4.6%
W 904
 
3.7%
Other values (16) 7965
32.2%
Other Punctuation
ValueCountFrequency (%)
; 13530
91.2%
. 1164
 
7.8%
, 135
 
0.9%
* 8
 
0.1%
/ 2
 
< 0.1%
' 1
 
< 0.1%
¡ 1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
1 2
66.7%
6 1
33.3%
Space Separator
ValueCountFrequency (%)
12743
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1704
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Final Punctuation
ValueCountFrequency (%)
7
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 109459
64.2%
Hangul 31770
 
18.6%
Common 29327
 
17.2%
Han 24
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2224
 
7.0%
1662
 
5.2%
1180
 
3.7%
896
 
2.8%
851
 
2.7%
820
 
2.6%
749
 
2.4%
580
 
1.8%
568
 
1.8%
532
 
1.7%
Other values (370) 21708
68.3%
Latin
ValueCountFrequency (%)
n 12149
 
11.1%
a 11582
 
10.6%
i 8628
 
7.9%
o 7983
 
7.3%
e 7522
 
6.9%
u 6143
 
5.6%
g 5682
 
5.2%
h 4770
 
4.4%
r 3020
 
2.8%
m 2636
 
2.4%
Other values (45) 39344
35.9%
Han
ValueCountFrequency (%)
3
 
12.5%
2
 
8.3%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
Other values (11) 11
45.8%
Common
ValueCountFrequency (%)
; 13530
46.1%
12743
43.5%
- 1704
 
5.8%
. 1164
 
4.0%
, 135
 
0.5%
( 14
 
< 0.1%
) 14
 
< 0.1%
* 8
 
< 0.1%
7
 
< 0.1%
1 2
 
< 0.1%
Other values (5) 6
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 138751
81.3%
Hangul 31770
 
18.6%
None 28
 
< 0.1%
CJK 24
 
< 0.1%
Punctuation 7
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
; 13530
 
9.8%
12743
 
9.2%
n 12149
 
8.8%
a 11582
 
8.3%
i 8628
 
6.2%
o 7983
 
5.8%
e 7522
 
5.4%
u 6143
 
4.4%
g 5682
 
4.1%
h 4770
 
3.4%
Other values (55) 48019
34.6%
Hangul
ValueCountFrequency (%)
2224
 
7.0%
1662
 
5.2%
1180
 
3.7%
896
 
2.8%
851
 
2.7%
820
 
2.6%
749
 
2.4%
580
 
1.8%
568
 
1.8%
532
 
1.7%
Other values (370) 21708
68.3%
None
ValueCountFrequency (%)
ı 25
89.3%
ø 1
 
3.6%
ł 1
 
3.6%
¡ 1
 
3.6%
Punctuation
ValueCountFrequency (%)
7
100.0%
CJK
ValueCountFrequency (%)
3
 
12.5%
2
 
8.3%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
Other values (11) 11
45.8%
Distinct357
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T03:31:04.352896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length50
Mean length16.1983
Min length3

Characters and Unicode

Total characters161983
Distinct characters341
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowGenes & Genomics
2nd row한국가구학회지
3rd row한국체육교육학회지
4th rowBiotechnology and Bioprocess Engineering
5th rowThe Review of Korean Studies
ValueCountFrequency (%)
journal 1513
 
6.7%
and 1318
 
5.8%
of 1259
 
5.6%
engineering 485
 
2.1%
481
 
2.1%
international 436
 
1.9%
systems 365
 
1.6%
an 357
 
1.6%
research 351
 
1.6%
science 340
 
1.5%
Other values (449) 15672
69.4%
2023-12-13T03:31:04.832847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12604
 
7.8%
n 10011
 
6.2%
o 8883
 
5.5%
e 8383
 
5.2%
a 7442
 
4.6%
r 6518
 
4.0%
i 5847
 
3.6%
t 5813
 
3.6%
l 5577
 
3.4%
4811
 
3.0%
Other values (331) 86094
53.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 82184
50.7%
Other Letter 51440
31.8%
Uppercase Letter 13464
 
8.3%
Space Separator 12604
 
7.8%
Other Punctuation 1977
 
1.2%
Open Punctuation 153
 
0.1%
Close Punctuation 153
 
0.1%
Decimal Number 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4811
 
9.4%
3023
 
5.9%
2878
 
5.6%
2580
 
5.0%
2566
 
5.0%
2476
 
4.8%
2177
 
4.2%
1458
 
2.8%
1094
 
2.1%
786
 
1.5%
Other values (274) 27591
53.6%
Lowercase Letter
ValueCountFrequency (%)
n 10011
12.2%
o 8883
10.8%
e 8383
10.2%
a 7442
9.1%
r 6518
7.9%
i 5847
 
7.1%
t 5813
 
7.1%
l 5577
 
6.8%
c 4411
 
5.4%
s 3340
 
4.1%
Other values (13) 15959
19.4%
Uppercase Letter
ValueCountFrequency (%)
S 1835
13.6%
J 1595
11.8%
A 1292
9.6%
I 1217
9.0%
C 1121
 
8.3%
E 953
 
7.1%
T 740
 
5.5%
P 718
 
5.3%
M 590
 
4.4%
R 555
 
4.1%
Other values (12) 2848
21.2%
Other Punctuation
ValueCountFrequency (%)
, 845
42.7%
& 481
24.3%
. 286
 
14.5%
144
 
7.3%
· 92
 
4.7%
: 92
 
4.7%
' 37
 
1.9%
Decimal Number
ValueCountFrequency (%)
2 4
50.0%
1 4
50.0%
Space Separator
ValueCountFrequency (%)
12604
100.0%
Open Punctuation
ValueCountFrequency (%)
( 153
100.0%
Close Punctuation
ValueCountFrequency (%)
) 153
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 95648
59.0%
Hangul 51046
31.5%
Common 14895
 
9.2%
Han 394
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4811
 
9.4%
3023
 
5.9%
2878
 
5.6%
2580
 
5.1%
2566
 
5.0%
2476
 
4.9%
2177
 
4.3%
1458
 
2.9%
1094
 
2.1%
786
 
1.5%
Other values (253) 27197
53.3%
Latin
ValueCountFrequency (%)
n 10011
 
10.5%
o 8883
 
9.3%
e 8383
 
8.8%
a 7442
 
7.8%
r 6518
 
6.8%
i 5847
 
6.1%
t 5813
 
6.1%
l 5577
 
5.8%
c 4411
 
4.6%
s 3340
 
3.5%
Other values (35) 29423
30.8%
Han
ValueCountFrequency (%)
62
15.7%
30
 
7.6%
30
 
7.6%
28
 
7.1%
28
 
7.1%
21
 
5.3%
20
 
5.1%
20
 
5.1%
20
 
5.1%
20
 
5.1%
Other values (11) 115
29.2%
Common
ValueCountFrequency (%)
12604
84.6%
, 845
 
5.7%
& 481
 
3.2%
. 286
 
1.9%
( 153
 
1.0%
) 153
 
1.0%
144
 
1.0%
· 92
 
0.6%
: 92
 
0.6%
' 37
 
0.2%
Other values (2) 8
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 110307
68.1%
Hangul 51046
31.5%
CJK 382
 
0.2%
None 236
 
0.1%
CJK Compat Ideographs 12
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12604
 
11.4%
n 10011
 
9.1%
o 8883
 
8.1%
e 8383
 
7.6%
a 7442
 
6.7%
r 6518
 
5.9%
i 5847
 
5.3%
t 5813
 
5.3%
l 5577
 
5.1%
c 4411
 
4.0%
Other values (45) 34818
31.6%
Hangul
ValueCountFrequency (%)
4811
 
9.4%
3023
 
5.9%
2878
 
5.6%
2580
 
5.1%
2566
 
5.0%
2476
 
4.9%
2177
 
4.3%
1458
 
2.9%
1094
 
2.1%
786
 
1.5%
Other values (253) 27197
53.3%
None
ValueCountFrequency (%)
144
61.0%
· 92
39.0%
CJK
ValueCountFrequency (%)
62
16.2%
30
 
7.9%
30
 
7.9%
28
 
7.3%
28
 
7.3%
21
 
5.5%
20
 
5.2%
20
 
5.2%
20
 
5.2%
20
 
5.2%
Other values (10) 103
27.0%
CJK Compat Ideographs
ValueCountFrequency (%)
12
100.0%
Distinct331
Distinct (%)3.7%
Missing1163
Missing (%)11.6%
Memory size156.2 KiB
2023-12-13T03:31:05.165973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length91
Median length57
Mean length38.315831
Min length3

Characters and Unicode

Total characters338597
Distinct characters62
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowGenes and Genomics
2nd rowJournal of the Korea furniture Society
3rd rowKorean Society For The Study Of Physical Education
4th rowThe Review of Korean Studies
5th rowLaboratory Medicine Online
ValueCountFrequency (%)
of 6980
 
14.5%
journal 5930
 
12.3%
korean 3025
 
6.3%
and 2657
 
5.5%
the 2295
 
4.8%
society 1147
 
2.4%
773
 
1.6%
studies 757
 
1.6%
research 703
 
1.5%
science 613
 
1.3%
Other values (437) 23162
48.2%
2023-12-13T03:31:05.736126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39373
 
11.6%
o 30584
 
9.0%
n 26165
 
7.7%
e 25345
 
7.5%
a 24291
 
7.2%
r 20084
 
5.9%
i 17458
 
5.2%
t 15781
 
4.7%
l 13475
 
4.0%
u 11960
 
3.5%
Other values (52) 114081
33.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 244764
72.3%
Uppercase Letter 51681
 
15.3%
Space Separator 39373
 
11.6%
Other Punctuation 2181
 
0.6%
Dash Punctuation 356
 
0.1%
Open Punctuation 90
 
< 0.1%
Close Punctuation 90
 
< 0.1%
Decimal Number 62
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 30584
12.5%
n 26165
10.7%
e 25345
10.4%
a 24291
9.9%
r 20084
8.2%
i 17458
 
7.1%
t 15781
 
6.4%
l 13475
 
5.5%
u 11960
 
4.9%
c 10848
 
4.4%
Other values (15) 48773
19.9%
Uppercase Letter
ValueCountFrequency (%)
J 6170
11.9%
S 5679
11.0%
K 4144
 
8.0%
T 3737
 
7.2%
A 3712
 
7.2%
E 3362
 
6.5%
C 3326
 
6.4%
I 2956
 
5.7%
R 2647
 
5.1%
O 2334
 
4.5%
Other values (15) 13614
26.3%
Other Punctuation
ValueCountFrequency (%)
, 1117
51.2%
& 721
33.1%
: 162
 
7.4%
144
 
6.6%
' 37
 
1.7%
Decimal Number
ValueCountFrequency (%)
0 30
48.4%
1 28
45.2%
2 4
 
6.5%
Space Separator
ValueCountFrequency (%)
39373
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 356
100.0%
Open Punctuation
ValueCountFrequency (%)
( 90
100.0%
Close Punctuation
ValueCountFrequency (%)
) 90
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 296445
87.6%
Common 42152
 
12.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 30584
 
10.3%
n 26165
 
8.8%
e 25345
 
8.5%
a 24291
 
8.2%
r 20084
 
6.8%
i 17458
 
5.9%
t 15781
 
5.3%
l 13475
 
4.5%
u 11960
 
4.0%
c 10848
 
3.7%
Other values (40) 100454
33.9%
Common
ValueCountFrequency (%)
39373
93.4%
, 1117
 
2.6%
& 721
 
1.7%
- 356
 
0.8%
: 162
 
0.4%
144
 
0.3%
( 90
 
0.2%
) 90
 
0.2%
' 37
 
0.1%
0 30
 
0.1%
Other values (2) 32
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 338453
> 99.9%
None 144
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
39373
 
11.6%
o 30584
 
9.0%
n 26165
 
7.7%
e 25345
 
7.5%
a 24291
 
7.2%
r 20084
 
5.9%
i 17458
 
5.2%
t 15781
 
4.7%
l 13475
 
4.0%
u 11960
 
3.5%
Other values (51) 113937
33.7%
None
ValueCountFrequency (%)
144
100.0%
Distinct352
Distinct (%)3.6%
Missing94
Missing (%)0.9%
Memory size156.2 KiB
2023-12-13T03:31:06.136363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.9490208
Min length6

Characters and Unicode

Total characters78743
Distinct characters12
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row19769571
2nd row12263109
3rd row12299685
4th row12268372
5th row12290076
ValueCountFrequency (%)
15986446 212
 
2.1%
27136434 210
 
2.1%
19750102 202
 
2.0%
3744884 187
 
1.9%
12290424 163
 
1.6%
19758359 152
 
1.5%
26358875 145
 
1.5%
12254568 144
 
1.5%
15988635 138
 
1.4%
15671739 137
 
1.4%
Other values (342) 8216
82.9%
2023-12-13T03:31:06.702574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 15090
19.2%
1 11685
14.8%
6 7420
9.4%
5 7413
9.4%
9 7327
9.3%
8 7174
9.1%
7 6402
8.1%
3 5907
 
7.5%
4 5418
 
6.9%
0 4354
 
5.5%
Other values (2) 553
 
0.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 78190
99.3%
Uppercase Letter 528
 
0.7%
Lowercase Letter 25
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 15090
19.3%
1 11685
14.9%
6 7420
9.5%
5 7413
9.5%
9 7327
9.4%
8 7174
9.2%
7 6402
8.2%
3 5907
 
7.6%
4 5418
 
6.9%
0 4354
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
X 528
100.0%
Lowercase Letter
ValueCountFrequency (%)
x 25
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 78190
99.3%
Latin 553
 
0.7%

Most frequent character per script

Common
ValueCountFrequency (%)
2 15090
19.3%
1 11685
14.9%
6 7420
9.5%
5 7413
9.5%
9 7327
9.4%
8 7174
9.2%
7 6402
8.2%
3 5907
 
7.6%
4 5418
 
6.9%
0 4354
 
5.6%
Latin
ValueCountFrequency (%)
X 528
95.5%
x 25
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 78743
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 15090
19.2%
1 11685
14.8%
6 7420
9.4%
5 7413
9.4%
9 7327
9.3%
8 7174
9.1%
7 6402
8.1%
3 5907
 
7.5%
4 5418
 
6.9%
0 4354
 
5.5%
Other values (2) 553
 
0.7%
Distinct320
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T03:31:07.034116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length16
Mean length7.749
Min length4

Characters and Unicode

Total characters77490
Distinct characters275
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한국유전학회
2nd row한국가구학회
3rd row한국체육교육학회
4th row한국생물공학회
5th row한국학중앙연구원
ValueCountFrequency (%)
한국물리학회 399
 
3.9%
대한전기학회 379
 
3.7%
국제구조공학회 357
 
3.5%
제어·로봇·시스템학회 312
 
3.1%
한국디지털정책학회 210
 
2.1%
한국식품과학회 173
 
1.7%
사)한국관광레저학회 163
 
1.6%
한국미생물·생명공학회 150
 
1.5%
대한산업경영학회 145
 
1.4%
한국정보과학회 141
 
1.4%
Other values (314) 7681
76.0%
2023-12-13T03:31:07.466052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10152
 
13.1%
9956
 
12.8%
8510
 
11.0%
6959
 
9.0%
2252
 
2.9%
1178
 
1.5%
1059
 
1.4%
1028
 
1.3%
1014
 
1.3%
951
 
1.2%
Other values (265) 34431
44.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 75828
97.9%
Other Punctuation 791
 
1.0%
Open Punctuation 318
 
0.4%
Close Punctuation 318
 
0.4%
Space Separator 115
 
0.1%
Uppercase Letter 112
 
0.1%
Decimal Number 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10152
 
13.4%
9956
 
13.1%
8510
 
11.2%
6959
 
9.2%
2252
 
3.0%
1178
 
1.6%
1059
 
1.4%
1028
 
1.4%
1014
 
1.3%
951
 
1.3%
Other values (250) 32769
43.2%
Uppercase Letter
ValueCountFrequency (%)
B 20
17.9%
M 20
17.9%
I 20
17.9%
R 11
9.8%
P 11
9.8%
N 10
8.9%
G 10
8.9%
O 10
8.9%
Other Punctuation
ValueCountFrequency (%)
· 774
97.9%
. 17
 
2.1%
Decimal Number
ValueCountFrequency (%)
2 4
50.0%
1 4
50.0%
Open Punctuation
ValueCountFrequency (%)
( 318
100.0%
Close Punctuation
ValueCountFrequency (%)
) 318
100.0%
Space Separator
ValueCountFrequency (%)
115
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 75828
97.9%
Common 1550
 
2.0%
Latin 112
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10152
 
13.4%
9956
 
13.1%
8510
 
11.2%
6959
 
9.2%
2252
 
3.0%
1178
 
1.6%
1059
 
1.4%
1028
 
1.4%
1014
 
1.3%
951
 
1.3%
Other values (250) 32769
43.2%
Latin
ValueCountFrequency (%)
B 20
17.9%
M 20
17.9%
I 20
17.9%
R 11
9.8%
P 11
9.8%
N 10
8.9%
G 10
8.9%
O 10
8.9%
Common
ValueCountFrequency (%)
· 774
49.9%
( 318
20.5%
) 318
20.5%
115
 
7.4%
. 17
 
1.1%
2 4
 
0.3%
1 4
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 75802
97.8%
ASCII 888
 
1.1%
None 774
 
1.0%
Compat Jamo 26
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10152
 
13.4%
9956
 
13.1%
8510
 
11.2%
6959
 
9.2%
2252
 
3.0%
1178
 
1.6%
1059
 
1.4%
1028
 
1.4%
1014
 
1.3%
951
 
1.3%
Other values (249) 32743
43.2%
None
ValueCountFrequency (%)
· 774
100.0%
ASCII
ValueCountFrequency (%)
( 318
35.8%
) 318
35.8%
115
 
13.0%
B 20
 
2.3%
M 20
 
2.3%
I 20
 
2.3%
. 17
 
1.9%
R 11
 
1.2%
P 11
 
1.2%
N 10
 
1.1%
Other values (4) 28
 
3.2%
Compat Jamo
ValueCountFrequency (%)
26
100.0%
Distinct283
Distinct (%)3.3%
Missing1349
Missing (%)13.5%
Memory size156.2 KiB
2023-12-13T03:31:07.784751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length98
Median length62
Mean length41.82372
Min length12

Characters and Unicode

Total characters361817
Distinct characters64
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowThe Genetics Society Of Korea
2nd rowKorea Furniture Society
3rd rowKorean Society For The Study Of Physical Education
4th rowThe Korean Society For Biotechnology And Bioengineering
5th rowThe Academy of Korean Studies
ValueCountFrequency (%)
korean 5341
 
10.6%
society 5126
 
10.2%
of 5015
 
9.9%
the 4578
 
9.1%
association 2309
 
4.6%
and 2269
 
4.5%
for 1627
 
3.2%
korea 1458
 
2.9%
science 613
 
1.2%
596
 
1.2%
Other values (396) 21543
42.7%
2023-12-13T03:31:08.237102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42323
 
11.7%
e 34302
 
9.5%
o 33182
 
9.2%
i 26151
 
7.2%
n 24326
 
6.7%
a 23489
 
6.5%
t 20655
 
5.7%
r 18541
 
5.1%
c 17727
 
4.9%
s 14162
 
3.9%
Other values (54) 106959
29.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 270521
74.8%
Uppercase Letter 47212
 
13.0%
Space Separator 42323
 
11.7%
Other Punctuation 1159
 
0.3%
Dash Punctuation 272
 
0.1%
Open Punctuation 142
 
< 0.1%
Close Punctuation 142
 
< 0.1%
Other Letter 28
 
< 0.1%
Decimal Number 18
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 34302
12.7%
o 33182
12.3%
i 26151
9.7%
n 24326
9.0%
a 23489
8.7%
t 20655
7.6%
r 18541
 
6.9%
c 17727
 
6.6%
s 14162
 
5.2%
y 9867
 
3.6%
Other values (16) 48119
17.8%
Uppercase Letter
ValueCountFrequency (%)
S 8175
17.3%
K 7073
15.0%
T 5556
11.8%
A 5376
11.4%
O 2930
 
6.2%
C 2156
 
4.6%
F 2043
 
4.3%
M 1834
 
3.9%
I 1814
 
3.8%
E 1694
 
3.6%
Other values (15) 8561
18.1%
Other Punctuation
ValueCountFrequency (%)
& 590
50.9%
, 520
44.9%
. 37
 
3.2%
/ 12
 
1.0%
Decimal Number
ValueCountFrequency (%)
0 10
55.6%
2 4
 
22.2%
1 4
 
22.2%
Other Letter
ValueCountFrequency (%)
14
50.0%
14
50.0%
Space Separator
ValueCountFrequency (%)
42323
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 272
100.0%
Open Punctuation
ValueCountFrequency (%)
( 142
100.0%
Close Punctuation
ValueCountFrequency (%)
) 142
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 317733
87.8%
Common 44056
 
12.2%
Hangul 28
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 34302
 
10.8%
o 33182
 
10.4%
i 26151
 
8.2%
n 24326
 
7.7%
a 23489
 
7.4%
t 20655
 
6.5%
r 18541
 
5.8%
c 17727
 
5.6%
s 14162
 
4.5%
y 9867
 
3.1%
Other values (41) 95331
30.0%
Common
ValueCountFrequency (%)
42323
96.1%
& 590
 
1.3%
, 520
 
1.2%
- 272
 
0.6%
( 142
 
0.3%
) 142
 
0.3%
. 37
 
0.1%
/ 12
 
< 0.1%
0 10
 
< 0.1%
2 4
 
< 0.1%
Hangul
ValueCountFrequency (%)
14
50.0%
14
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 361789
> 99.9%
Hangul 28
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
42323
 
11.7%
e 34302
 
9.5%
o 33182
 
9.2%
i 26151
 
7.2%
n 24326
 
6.7%
a 23489
 
6.5%
t 20655
 
5.7%
r 18541
 
5.1%
c 17727
 
4.9%
s 14162
 
3.9%
Other values (52) 106931
29.6%
Hangul
ValueCountFrequency (%)
14
50.0%
14
50.0%

등재구분
Categorical

IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
KCI등재
9248 
<NA>
 
357
KCI우수등재
 
288
KCI등재후보
 
107

Length

Max length7
Median length5
Mean length5.0433
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowKCI등재
2nd rowKCI등재
3rd rowKCI등재
4th rowKCI등재
5th rowKCI등재

Common Values

ValueCountFrequency (%)
KCI등재 9248
92.5%
<NA> 357
 
3.6%
KCI우수등재 288
 
2.9%
KCI등재후보 107
 
1.1%

Length

2023-12-13T03:31:08.650556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:31:08.748221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kci등재 9248
92.5%
na 357
 
3.6%
kci우수등재 288
 
2.9%
kci등재후보 107
 
1.1%
Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
6750 
SCIE:SCOPUS
1860 
SCOPUS
932 
SCI:SCIE
 
187
SCI:SCIE:SCOPUS
 
184
Other values (2)
 
87

Length

Max length15
Median length4
Mean length5.8145
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSCIE:SCOPUS
2nd row<NA>
3rd row<NA>
4th rowSCIE:SCOPUS
5th rowSCOPUS

Common Values

ValueCountFrequency (%)
<NA> 6750
67.5%
SCIE:SCOPUS 1860
 
18.6%
SCOPUS 932
 
9.3%
SCI:SCIE 187
 
1.9%
SCI:SCIE:SCOPUS 184
 
1.8%
SCIE:SSCI 69
 
0.7%
A&HCI:SCOPUS 18
 
0.2%

Length

2023-12-13T03:31:08.875145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:31:09.013569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 6750
67.5%
scie:scopus 1860
 
18.6%
scopus 932
 
9.3%
sci:scie 187
 
1.9%
sci:scie:scopus 184
 
1.8%
scie:ssci 69
 
0.7%
a&hci:scopus 18
 
0.2%

발행년
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2022
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 10000
100.0%

Length

2023-12-13T03:31:09.137680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:31:09.235856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 10000
100.0%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-02-22
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-02-22
2nd row2023-02-22
3rd row2023-02-22
4th row2023-02-22
5th row2023-02-22

Common Values

ValueCountFrequency (%)
2023-02-22 10000
100.0%

Length

2023-12-13T03:31:09.324790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:31:09.414200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-02-22 10000
100.0%


Real number (ℝ)

MISSING 

Distinct101
Distinct (%)1.1%
Missing1164
Missing (%)11.6%
Infinite0
Infinite (%)0.0%
Mean44.027614
Minimum1
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T03:31:09.509477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile17
Q124
median33
Q344
95-th percentile82
Maximum2022
Range2021
Interquartile range (IQR)20

Descriptive statistics

Standard deviation105.65424
Coefficient of variation (CV)2.3997267
Kurtosis330.92446
Mean44.027614
Median Absolute Deviation (MAD)10
Skewness17.838978
Sum389028
Variance11162.819
MonotonicityNot monotonic
2023-12-13T03:31:09.640688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20 692
 
6.9%
27 373
 
3.7%
34 367
 
3.7%
32 359
 
3.6%
28 349
 
3.5%
23 289
 
2.9%
24 283
 
2.8%
17 276
 
2.8%
33 274
 
2.7%
42 267
 
2.7%
Other values (91) 5307
53.1%
(Missing) 1164
 
11.6%
ValueCountFrequency (%)
1 8
 
0.1%
5 3
 
< 0.1%
8 22
 
0.2%
10 65
0.7%
11 94
0.9%
12 112
1.1%
13 19
 
0.2%
14 48
0.5%
15 28
 
0.3%
16 27
 
0.3%
ValueCountFrequency (%)
2022 24
0.2%
164 7
 
0.1%
163 6
 
0.1%
162 6
 
0.1%
161 8
 
0.1%
149 4
 
< 0.1%
148 5
 
0.1%
147 7
 
0.1%
146 7
 
0.1%
116 4
 
< 0.1%


Real number (ℝ)

MISSING 

Distinct117
Distinct (%)1.2%
Missing498
Missing (%)5.0%
Infinite0
Infinite (%)0.0%
Mean14.384972
Minimum1
Maximum256
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T03:31:09.777021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median4
Q36
95-th percentile90
Maximum256
Range255
Interquartile range (IQR)4

Descriptive statistics

Standard deviation31.81686
Coefficient of variation (CV)2.2118125
Kurtosis16.072989
Mean14.384972
Median Absolute Deviation (MAD)2
Skewness3.6213756
Sum136686
Variance1012.3126
MonotonicityNot monotonic
2023-12-13T03:31:09.898067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1606
16.1%
2 1537
15.4%
3 1437
14.4%
4 1372
13.7%
5 690
6.9%
6 622
 
6.2%
10 191
 
1.9%
11 190
 
1.9%
12 190
 
1.9%
9 164
 
1.6%
Other values (107) 1503
15.0%
(Missing) 498
 
5.0%
ValueCountFrequency (%)
1 1606
16.1%
2 1537
15.4%
3 1437
14.4%
4 1372
13.7%
5 690
6.9%
6 622
 
6.2%
7 154
 
1.5%
8 163
 
1.6%
9 164
 
1.6%
10 191
 
1.9%
ValueCountFrequency (%)
256 10
0.1%
255 12
0.1%
254 2
 
< 0.1%
253 10
0.1%
199 5
0.1%
198 6
0.1%
197 6
0.1%
196 3
 
< 0.1%
139 5
0.1%
138 8
0.1%

시작페이지
Real number (ℝ)

Distinct1446
Distinct (%)14.5%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean337.58956
Minimum1
Maximum4100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T03:31:10.015419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q166.5
median169
Q3389
95-th percentile1230.1
Maximum4100
Range4099
Interquartile range (IQR)322.5

Descriptive statistics

Standard deviation510.86794
Coefficient of variation (CV)1.5132812
Kurtosis16.323592
Mean337.58956
Median Absolute Deviation (MAD)128
Skewness3.6143009
Sum3375558
Variance260986.05
MonotonicityNot monotonic
2023-12-13T03:31:10.128979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 569
 
5.7%
5 108
 
1.1%
41 71
 
0.7%
63 71
 
0.7%
51 68
 
0.7%
67 65
 
0.7%
27 62
 
0.6%
43 62
 
0.6%
31 62
 
0.6%
39 60
 
0.6%
Other values (1436) 8801
88.0%
ValueCountFrequency (%)
1 569
5.7%
2 3
 
< 0.1%
3 30
 
0.3%
4 3
 
< 0.1%
5 108
 
1.1%
6 4
 
< 0.1%
7 40
 
0.4%
8 6
 
0.1%
9 42
 
0.4%
10 2
 
< 0.1%
ValueCountFrequency (%)
4100 1
< 0.1%
4076 1
< 0.1%
4026 1
< 0.1%
4005 1
< 0.1%
3996 1
< 0.1%
3951 1
< 0.1%
3940 1
< 0.1%
3932 1
< 0.1%
3923 1
< 0.1%
3915 1
< 0.1%

끝페이지
Real number (ℝ)

Distinct1513
Distinct (%)15.1%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean353.72567
Minimum2
Maximum4110
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T03:31:10.252656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile22
Q183
median187
Q3404
95-th percentile1238.1
Maximum4110
Range4108
Interquartile range (IQR)321

Descriptive statistics

Standard deviation508.93532
Coefficient of variation (CV)1.4387854
Kurtosis16.454252
Mean353.72567
Median Absolute Deviation (MAD)129
Skewness3.6291777
Sum3536903
Variance259015.16
MonotonicityNot monotonic
2023-12-13T03:31:10.423456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
32 46
 
0.5%
50 45
 
0.4%
8 45
 
0.4%
68 45
 
0.4%
56 43
 
0.4%
26 43
 
0.4%
112 42
 
0.4%
66 42
 
0.4%
42 41
 
0.4%
40 41
 
0.4%
Other values (1503) 9566
95.7%
ValueCountFrequency (%)
2 2
 
< 0.1%
3 2
 
< 0.1%
5 3
 
< 0.1%
6 10
 
0.1%
7 18
 
0.2%
8 45
0.4%
9 34
0.3%
10 31
0.3%
11 37
0.4%
12 38
0.4%
ValueCountFrequency (%)
4110 1
< 0.1%
4089 1
< 0.1%
4036 1
< 0.1%
4014 1
< 0.1%
4004 1
< 0.1%
3959 1
< 0.1%
3950 1
< 0.1%
3939 1
< 0.1%
3931 1
< 0.1%
3922 1
< 0.1%

키워드(국문)
Text

MISSING 

Distinct9540
Distinct (%)97.6%
Missing227
Missing (%)2.3%
Memory size156.2 KiB
2023-12-13T03:31:10.740213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length1024
Median length208
Mean length60.377571
Min length1

Characters and Unicode

Total characters590070
Distinct characters2262
Distinct categories20 ?
Distinct scripts8 ?
Distinct blocks13 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9531 ?
Unique (%)97.5%

Sample

1st rowPapillary thyroid carcinoma ¡¤ Tripartite Motif Containing 3 ¡¤ Proliferation ¡¤ Prognosis
2nd row여가태도, 운동지속수행, 건강증진행위
3rd rowCorni Fructus, DNA damage, apoptosis, reactive oxygen species
4th rowBodhisattva 菩薩, conduct of no-obstruction 無碍行, Gisinnon byeolgi 起信論別記, Gisinnon so/Commentary on the Awakening of Faith 大乘起信論疏, gha-pati 居士, Hwajaeng 和諍, Ilsim/One Mind 一心, Geumgang sammaegyong non 金剛三昧經論, Wonhyo/Weonhyo 元曉
5th rowDisruptive behavior disorders, Adolescents, Children, Screening tool, Psychometric properties.
ValueCountFrequency (%)
¡¤ 1993
 
2.4%
of 319
 
0.4%
· 317
 
0.4%
analysis 271
 
0.3%
control 262
 
0.3%
system 240
 
0.3%
and 214
 
0.3%
learning 192
 
0.2%
model 189
 
0.2%
분석 176
 
0.2%
Other values (33874) 80015
95.0%
2023-12-13T03:31:11.268940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
74876
 
12.7%
, 35254
 
6.0%
e 35017
 
5.9%
i 29856
 
5.1%
a 26608
 
4.5%
t 26277
 
4.5%
n 24615
 
4.2%
o 24012
 
4.1%
r 22449
 
3.8%
s 18219
 
3.1%
Other values (2252) 272887
46.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 310381
52.6%
Other Letter 125859
21.3%
Space Separator 74876
 
12.7%
Other Punctuation 39199
 
6.6%
Uppercase Letter 27902
 
4.7%
Dash Punctuation 2983
 
0.5%
Decimal Number 2547
 
0.4%
Currency Symbol 1993
 
0.3%
Close Punctuation 1945
 
0.3%
Open Punctuation 1942
 
0.3%
Other values (10) 443
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2120
 
1.7%
2108
 
1.7%
2048
 
1.6%
1951
 
1.6%
1951
 
1.6%
1832
 
1.5%
1830
 
1.5%
1751
 
1.4%
1673
 
1.3%
1661
 
1.3%
Other values (2089) 106934
85.0%
Lowercase Letter
ValueCountFrequency (%)
e 35017
11.3%
i 29856
9.6%
a 26608
 
8.6%
t 26277
 
8.5%
n 24615
 
7.9%
o 24012
 
7.7%
r 22449
 
7.2%
s 18219
 
5.9%
l 16302
 
5.3%
c 15394
 
5.0%
Other values (54) 71632
23.1%
Uppercase Letter
ValueCountFrequency (%)
S 2713
 
9.7%
C 2667
 
9.6%
A 2120
 
7.6%
P 2088
 
7.5%
M 1813
 
6.5%
D 1645
 
5.9%
T 1519
 
5.4%
I 1432
 
5.1%
R 1429
 
5.1%
E 1297
 
4.6%
Other values (20) 9179
32.9%
Other Punctuation
ValueCountFrequency (%)
, 35254
89.9%
¡ 2005
 
5.1%
. 736
 
1.9%
· 425
 
1.1%
/ 403
 
1.0%
: 123
 
0.3%
83
 
0.2%
69
 
0.2%
& 31
 
0.1%
' 26
 
0.1%
Other values (10) 44
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 670
26.3%
2 480
18.8%
9 327
12.8%
3 287
11.3%
0 238
 
9.3%
5 156
 
6.1%
4 145
 
5.7%
6 105
 
4.1%
8 78
 
3.1%
7 61
 
2.4%
Math Symbol
ValueCountFrequency (%)
< 80
40.2%
> 80
40.2%
+ 20
 
10.1%
~ 6
 
3.0%
5
 
2.5%
5
 
2.5%
2
 
1.0%
1
 
0.5%
Open Punctuation
ValueCountFrequency (%)
( 1756
90.4%
121
 
6.2%
45
 
2.3%
[ 12
 
0.6%
7
 
0.4%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1756
90.3%
124
 
6.4%
45
 
2.3%
] 12
 
0.6%
7
 
0.4%
1
 
0.1%
Dash Punctuation
ValueCountFrequency (%)
- 2980
99.9%
2
 
0.1%
1
 
< 0.1%
Final Punctuation
ValueCountFrequency (%)
114
84.4%
21
 
15.6%
Initial Punctuation
ValueCountFrequency (%)
59
73.8%
21
 
26.2%
Control
ValueCountFrequency (%)
 7
87.5%
1
 
12.5%
Other Symbol
ValueCountFrequency (%)
4
80.0%
° 1
 
20.0%
Modifier Symbol
ValueCountFrequency (%)
` 4
80.0%
¨ 1
 
20.0%
Space Separator
ValueCountFrequency (%)
74876
100.0%
Currency Symbol
ValueCountFrequency (%)
¤ 1993
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 8
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%
Format
ValueCountFrequency (%)
­ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 337974
57.3%
Common 125927
 
21.3%
Hangul 122778
 
20.8%
Han 2891
 
0.5%
Cyrillic 239
 
< 0.1%
Katakana 104
 
< 0.1%
Hiragana 85
 
< 0.1%
Greek 72
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
43
 
1.5%
26
 
0.9%
24
 
0.8%
24
 
0.8%
21
 
0.7%
21
 
0.7%
19
 
0.7%
19
 
0.7%
18
 
0.6%
18
 
0.6%
Other values (1004) 2658
91.9%
Hangul
ValueCountFrequency (%)
2120
 
1.7%
2108
 
1.7%
2048
 
1.7%
1951
 
1.6%
1951
 
1.6%
1832
 
1.5%
1830
 
1.5%
1751
 
1.4%
1673
 
1.4%
1661
 
1.4%
Other values (1000) 103853
84.6%
Common
ValueCountFrequency (%)
74876
59.5%
, 35254
28.0%
- 2980
 
2.4%
¡ 2005
 
1.6%
¤ 1993
 
1.6%
( 1756
 
1.4%
) 1756
 
1.4%
. 736
 
0.6%
1 670
 
0.5%
2 480
 
0.4%
Other values (58) 3421
 
2.7%
Latin
ValueCountFrequency (%)
e 35017
 
10.4%
i 29856
 
8.8%
a 26608
 
7.9%
t 26277
 
7.8%
n 24615
 
7.3%
o 24012
 
7.1%
r 22449
 
6.6%
s 18219
 
5.4%
l 16302
 
4.8%
c 15394
 
4.6%
Other values (47) 99225
29.4%
Katakana
ValueCountFrequency (%)
7
 
6.7%
6
 
5.8%
6
 
5.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
Other values (31) 57
54.8%
Hiragana
ValueCountFrequency (%)
11
 
12.9%
7
 
8.2%
7
 
8.2%
4
 
4.7%
4
 
4.7%
4
 
4.7%
3
 
3.5%
3
 
3.5%
3
 
3.5%
3
 
3.5%
Other values (23) 36
42.4%
Cyrillic
ValueCountFrequency (%)
н 27
 
11.3%
а 25
 
10.5%
и 22
 
9.2%
о 21
 
8.8%
с 16
 
6.7%
е 14
 
5.9%
р 13
 
5.4%
к 12
 
5.0%
л 11
 
4.6%
я 10
 
4.2%
Other values (17) 68
28.5%
Greek
ValueCountFrequency (%)
β 23
31.9%
α 21
29.2%
κ 8
 
11.1%
γ 4
 
5.6%
ε 3
 
4.2%
θ 3
 
4.2%
φ 3
 
4.2%
Δ 2
 
2.8%
η 2
 
2.8%
δ 1
 
1.4%
Other values (2) 2
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 458692
77.7%
Hangul 122757
 
20.8%
None 5045
 
0.9%
CJK 2869
 
0.5%
Cyrillic 239
 
< 0.1%
Punctuation 218
 
< 0.1%
Katakana 104
 
< 0.1%
Hiragana 85
 
< 0.1%
CJK Compat Ideographs 22
 
< 0.1%
Compat Jamo 21
 
< 0.1%
Other values (3) 18
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
74876
16.3%
, 35254
 
7.7%
e 35017
 
7.6%
i 29856
 
6.5%
a 26608
 
5.8%
t 26277
 
5.7%
n 24615
 
5.4%
o 24012
 
5.2%
r 22449
 
4.9%
s 18219
 
4.0%
Other values (77) 141509
30.9%
Hangul
ValueCountFrequency (%)
2120
 
1.7%
2108
 
1.7%
2048
 
1.7%
1951
 
1.6%
1951
 
1.6%
1832
 
1.5%
1830
 
1.5%
1751
 
1.4%
1673
 
1.4%
1661
 
1.4%
Other values (995) 103832
84.6%
None
ValueCountFrequency (%)
¡ 2005
39.7%
¤ 1993
39.5%
· 425
 
8.4%
124
 
2.5%
121
 
2.4%
83
 
1.6%
69
 
1.4%
45
 
0.9%
45
 
0.9%
β 23
 
0.5%
Other values (28) 112
 
2.2%
Punctuation
ValueCountFrequency (%)
114
52.3%
59
27.1%
21
 
9.6%
21
 
9.6%
2
 
0.9%
1
 
0.5%
CJK
ValueCountFrequency (%)
43
 
1.5%
26
 
0.9%
24
 
0.8%
24
 
0.8%
21
 
0.7%
21
 
0.7%
19
 
0.7%
19
 
0.7%
18
 
0.6%
18
 
0.6%
Other values (992) 2636
91.9%
Cyrillic
ValueCountFrequency (%)
н 27
 
11.3%
а 25
 
10.5%
и 22
 
9.2%
о 21
 
8.8%
с 16
 
6.7%
е 14
 
5.9%
р 13
 
5.4%
к 12
 
5.0%
л 11
 
4.6%
я 10
 
4.2%
Other values (17) 68
28.5%
Compat Jamo
ValueCountFrequency (%)
17
81.0%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
Hiragana
ValueCountFrequency (%)
11
 
12.9%
7
 
8.2%
7
 
8.2%
4
 
4.7%
4
 
4.7%
4
 
4.7%
3
 
3.5%
3
 
3.5%
3
 
3.5%
3
 
3.5%
Other values (23) 36
42.4%
Katakana
ValueCountFrequency (%)
7
 
6.7%
6
 
5.8%
6
 
5.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
4
 
3.8%
Other values (31) 57
54.8%
CJK Compat Ideographs
ValueCountFrequency (%)
6
27.3%
3
13.6%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (2) 2
 
9.1%
Math Operators
ValueCountFrequency (%)
5
38.5%
5
38.5%
2
 
15.4%
1
 
7.7%
Letterlike Symbols
ValueCountFrequency (%)
4
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

키워드(외국어)
Text

MISSING 

Distinct1940
Distinct (%)96.0%
Missing7980
Missing (%)79.8%
Memory size156.2 KiB
2023-12-13T03:31:11.603798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length443
Median length182
Mean length87.337624
Min length1

Characters and Unicode

Total characters176422
Distinct characters1382
Distinct categories15 ?
Distinct scripts8 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1930 ?
Unique (%)95.5%

Sample

1st rowPharmacogenetics, Pharmacogenomic variants, Practice guideline
2nd rowSecretaries, Career of Secretarial Majors, Career of Secretaries, Research Trends
3rd row경어, 존경어, 겸양어, 정중어, 전단지
4th rowDarknet, Network Traffic Analysis, Gradient Boosting
5th rowBreast neoplasm, Cancer survivors, Mental health, Life style, Health status
ValueCountFrequency (%)
of 407
 
2.0%
the 192
 
0.9%
analysis 152
 
0.7%
and 128
 
0.6%
learning 120
 
0.6%
education 101
 
0.5%
system 93
 
0.5%
network 87
 
0.4%
data 78
 
0.4%
model 73
 
0.4%
Other values (8391) 19228
93.1%
2023-12-13T03:31:12.150528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18719
 
10.6%
e 14615
 
8.3%
i 12453
 
7.1%
a 11095
 
6.3%
n 11006
 
6.2%
t 10803
 
6.1%
o 9973
 
5.7%
r 8883
 
5.0%
, 7785
 
4.4%
s 7467
 
4.2%
Other values (1372) 63623
36.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 128542
72.9%
Space Separator 18719
 
10.6%
Uppercase Letter 11442
 
6.5%
Other Punctuation 8250
 
4.7%
Other Letter 6956
 
3.9%
Dash Punctuation 888
 
0.5%
Open Punctuation 570
 
0.3%
Close Punctuation 569
 
0.3%
Decimal Number 317
 
0.2%
Final Punctuation 96
 
0.1%
Other values (5) 73
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
105
 
1.5%
99
 
1.4%
90
 
1.3%
89
 
1.3%
85
 
1.2%
71
 
1.0%
70
 
1.0%
67
 
1.0%
63
 
0.9%
62
 
0.9%
Other values (1227) 6155
88.5%
Lowercase Letter
ValueCountFrequency (%)
e 14615
11.4%
i 12453
9.7%
a 11095
 
8.6%
n 11006
 
8.6%
t 10803
 
8.4%
o 9973
 
7.8%
r 8883
 
6.9%
s 7467
 
5.8%
l 6254
 
4.9%
c 5901
 
4.6%
Other values (54) 30092
23.4%
Uppercase Letter
ValueCountFrequency (%)
C 1141
 
10.0%
S 1114
 
9.7%
P 882
 
7.7%
A 839
 
7.3%
M 665
 
5.8%
D 660
 
5.8%
R 635
 
5.5%
T 594
 
5.2%
I 589
 
5.1%
E 571
 
5.0%
Other values (21) 3752
32.8%
Other Punctuation
ValueCountFrequency (%)
, 7785
94.4%
191
 
2.3%
. 151
 
1.8%
/ 56
 
0.7%
& 18
 
0.2%
' 18
 
0.2%
8
 
0.1%
" 6
 
0.1%
· 6
 
0.1%
: 4
 
< 0.1%
Other values (4) 7
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 88
27.8%
2 51
16.1%
9 51
16.1%
0 33
 
10.4%
3 26
 
8.2%
4 19
 
6.0%
8 17
 
5.4%
5 15
 
4.7%
6 12
 
3.8%
7 5
 
1.6%
Open Punctuation
ValueCountFrequency (%)
( 545
95.6%
10
 
1.8%
9
 
1.6%
[ 3
 
0.5%
2
 
0.4%
1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 544
95.6%
10
 
1.8%
9
 
1.6%
] 3
 
0.5%
2
 
0.4%
1
 
0.2%
Math Symbol
ValueCountFrequency (%)
+ 11
45.8%
< 6
25.0%
> 6
25.0%
~ 1
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 887
99.9%
1
 
0.1%
Final Punctuation
ValueCountFrequency (%)
82
85.4%
14
 
14.6%
Initial Punctuation
ValueCountFrequency (%)
32
69.6%
14
30.4%
Space Separator
ValueCountFrequency (%)
18719
100.0%
Control
ValueCountFrequency (%)
 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 139708
79.2%
Common 29481
 
16.7%
Hangul 5046
 
2.9%
Han 1707
 
1.0%
Cyrillic 270
 
0.2%
Katakana 143
 
0.1%
Hiragana 60
 
< 0.1%
Greek 7
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
31
 
1.8%
29
 
1.7%
28
 
1.6%
27
 
1.6%
27
 
1.6%
23
 
1.3%
23
 
1.3%
20
 
1.2%
18
 
1.1%
13
 
0.8%
Other values (641) 1468
86.0%
Hangul
ValueCountFrequency (%)
105
 
2.1%
99
 
2.0%
90
 
1.8%
89
 
1.8%
85
 
1.7%
71
 
1.4%
70
 
1.4%
67
 
1.3%
63
 
1.2%
62
 
1.2%
Other values (495) 4245
84.1%
Latin
ValueCountFrequency (%)
e 14615
 
10.5%
i 12453
 
8.9%
a 11095
 
7.9%
n 11006
 
7.9%
t 10803
 
7.7%
o 9973
 
7.1%
r 8883
 
6.4%
s 7467
 
5.3%
l 6254
 
4.5%
c 5901
 
4.2%
Other values (45) 41258
29.5%
Katakana
ValueCountFrequency (%)
17
 
11.9%
10
 
7.0%
7
 
4.9%
6
 
4.2%
5
 
3.5%
5
 
3.5%
4
 
2.8%
4
 
2.8%
4
 
2.8%
4
 
2.8%
Other values (40) 77
53.8%
Common
ValueCountFrequency (%)
18719
63.5%
, 7785
26.4%
- 887
 
3.0%
( 545
 
1.8%
) 544
 
1.8%
191
 
0.6%
. 151
 
0.5%
1 88
 
0.3%
82
 
0.3%
/ 56
 
0.2%
Other values (39) 433
 
1.5%
Cyrillic
ValueCountFrequency (%)
а 29
 
10.7%
о 24
 
8.9%
е 23
 
8.5%
н 21
 
7.8%
т 20
 
7.4%
с 19
 
7.0%
и 18
 
6.7%
л 17
 
6.3%
к 13
 
4.8%
в 11
 
4.1%
Other values (24) 75
27.8%
Hiragana
ValueCountFrequency (%)
17
28.3%
4
 
6.7%
3
 
5.0%
3
 
5.0%
3
 
5.0%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
1
 
1.7%
Other values (21) 21
35.0%
Greek
ValueCountFrequency (%)
γ 1
14.3%
ι 1
14.3%
φ 1
14.3%
ν 1
14.3%
η 1
14.3%
α 1
14.3%
ξ 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 168786
95.7%
Hangul 5046
 
2.9%
CJK 1698
 
1.0%
Cyrillic 270
 
0.2%
None 266
 
0.2%
Punctuation 143
 
0.1%
Katakana 143
 
0.1%
Hiragana 60
 
< 0.1%
CJK Compat Ideographs 9
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18719
 
11.1%
e 14615
 
8.7%
i 12453
 
7.4%
a 11095
 
6.6%
n 11006
 
6.5%
t 10803
 
6.4%
o 9973
 
5.9%
r 8883
 
5.3%
, 7785
 
4.6%
s 7467
 
4.4%
Other values (71) 55987
33.2%
None
ValueCountFrequency (%)
191
71.8%
10
 
3.8%
10
 
3.8%
9
 
3.4%
9
 
3.4%
8
 
3.0%
· 6
 
2.3%
¡ 3
 
1.1%
2
 
0.8%
2
 
0.8%
Other values (14) 16
 
6.0%
Hangul
ValueCountFrequency (%)
105
 
2.1%
99
 
2.0%
90
 
1.8%
89
 
1.8%
85
 
1.7%
71
 
1.4%
70
 
1.4%
67
 
1.3%
63
 
1.2%
62
 
1.2%
Other values (495) 4245
84.1%
Punctuation
ValueCountFrequency (%)
82
57.3%
32
 
22.4%
14
 
9.8%
14
 
9.8%
1
 
0.7%
CJK
ValueCountFrequency (%)
31
 
1.8%
29
 
1.7%
28
 
1.6%
27
 
1.6%
27
 
1.6%
23
 
1.4%
23
 
1.4%
20
 
1.2%
18
 
1.1%
13
 
0.8%
Other values (635) 1459
85.9%
Cyrillic
ValueCountFrequency (%)
а 29
 
10.7%
о 24
 
8.9%
е 23
 
8.5%
н 21
 
7.8%
т 20
 
7.4%
с 19
 
7.0%
и 18
 
6.7%
л 17
 
6.3%
к 13
 
4.8%
в 11
 
4.1%
Other values (24) 75
27.8%
Hiragana
ValueCountFrequency (%)
17
28.3%
4
 
6.7%
3
 
5.0%
3
 
5.0%
3
 
5.0%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
1
 
1.7%
Other values (21) 21
35.0%
Katakana
ValueCountFrequency (%)
17
 
11.9%
10
 
7.0%
7
 
4.9%
6
 
4.2%
5
 
3.5%
5
 
3.5%
4
 
2.8%
4
 
2.8%
4
 
2.8%
4
 
2.8%
Other values (40) 77
53.8%
CJK Compat Ideographs
ValueCountFrequency (%)
4
44.4%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct9878
Distinct (%)98.8%
Missing4
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-13T03:31:12.547955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length1024
Median length242
Mean length92.818627
Min length1

Characters and Unicode

Total characters927815
Distinct characters1426
Distinct categories20 ?
Distinct scripts7 ?
Distinct blocks12 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9869 ?
Unique (%)98.7%

Sample

1st rowPapillary thyroid carcinoma ¡¤ Tripartite Motif Containing 3 ¡¤ Proliferation ¡¤ Prognosis
2nd rowtradition, soban, reinterpretation, furniture, tray
3rd rowleisure attitudes, exercise adherence, health promotion behavior
4th rowCorni Fructus, DNA damage, apoptosis, reactive oxygen species
5th rowBodhisattva 菩薩, conduct of no-obstruction 無碍行, Gisinnon byeolgi 起信論別記, Gisinnon so/Commentary on the Awakening of Faith 大乘起信論疏, gha-pati 居士, Hwajaeng 和諍, Ilsim/One Mind 一心, Geumgang sammaegyong non 金剛三昧經論, Wonhyo/Weonhyo 元曉
ValueCountFrequency (%)
¡¤ 1990
 
1.9%
of 1787
 
1.7%
the 937
 
0.9%
and 696
 
0.7%
analysis 639
 
0.6%
education 502
 
0.5%
system 460
 
0.4%
learning 459
 
0.4%
model 377
 
0.4%
control 365
 
0.3%
Other values (24866) 98634
92.3%
2023-12-13T03:31:13.093803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
97052
 
10.5%
e 80735
 
8.7%
i 68291
 
7.4%
a 61267
 
6.6%
n 59328
 
6.4%
t 58830
 
6.3%
o 55687
 
6.0%
r 49151
 
5.3%
s 40880
 
4.4%
, 36459
 
3.9%
Other values (1416) 320135
34.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 707500
76.3%
Space Separator 97053
 
10.5%
Uppercase Letter 61977
 
6.7%
Other Punctuation 40537
 
4.4%
Dash Punctuation 5498
 
0.6%
Other Letter 5471
 
0.6%
Decimal Number 2603
 
0.3%
Close Punctuation 2268
 
0.2%
Open Punctuation 2267
 
0.2%
Currency Symbol 1999
 
0.2%
Other values (10) 642
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
78
 
1.4%
65
 
1.2%
63
 
1.2%
62
 
1.1%
58
 
1.1%
53
 
1.0%
51
 
0.9%
48
 
0.9%
46
 
0.8%
46
 
0.8%
Other values (1277) 4901
89.6%
Lowercase Letter
ValueCountFrequency (%)
e 80735
11.4%
i 68291
9.7%
a 61267
 
8.7%
n 59328
 
8.4%
t 58830
 
8.3%
o 55687
 
7.9%
r 49151
 
6.9%
s 40880
 
5.8%
l 35518
 
5.0%
c 33481
 
4.7%
Other values (31) 164332
23.2%
Uppercase Letter
ValueCountFrequency (%)
S 6472
 
10.4%
C 6124
 
9.9%
P 4626
 
7.5%
A 4334
 
7.0%
M 3801
 
6.1%
D 3511
 
5.7%
T 3192
 
5.2%
I 3111
 
5.0%
E 3050
 
4.9%
R 3027
 
4.9%
Other values (18) 20729
33.4%
Other Punctuation
ValueCountFrequency (%)
, 36459
89.9%
¡ 2027
 
5.0%
. 1021
 
2.5%
· 354
 
0.9%
/ 283
 
0.7%
' 128
 
0.3%
: 120
 
0.3%
& 68
 
0.2%
32
 
0.1%
" 14
 
< 0.1%
Other values (10) 31
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 685
26.3%
2 499
19.2%
9 353
13.6%
3 288
11.1%
0 249
 
9.6%
5 151
 
5.8%
4 133
 
5.1%
6 104
 
4.0%
8 78
 
3.0%
7 63
 
2.4%
Math Symbol
ValueCountFrequency (%)
> 36
31.3%
< 35
30.4%
+ 29
25.2%
= 4
 
3.5%
~ 3
 
2.6%
2
 
1.7%
2
 
1.7%
2
 
1.7%
1
 
0.9%
1
 
0.9%
Close Punctuation
ValueCountFrequency (%)
) 2186
96.4%
42
 
1.9%
] 29
 
1.3%
9
 
0.4%
2
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 2183
96.3%
42
 
1.9%
[ 29
 
1.3%
10
 
0.4%
3
 
0.1%
Control
ValueCountFrequency (%)
 11
78.6%
2
 
14.3%
1
 
7.1%
Space Separator
ValueCountFrequency (%)
97052
> 99.9%
  1
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 5489
99.8%
9
 
0.2%
Final Punctuation
ValueCountFrequency (%)
332
89.7%
38
 
10.3%
Initial Punctuation
ValueCountFrequency (%)
87
69.6%
38
30.4%
Modifier Symbol
ValueCountFrequency (%)
` 4
80.0%
¨ 1
 
20.0%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Currency Symbol
ValueCountFrequency (%)
¤ 1999
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Format
ValueCountFrequency (%)
­ 1
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 769400
82.9%
Common 152865
 
16.5%
Hangul 3492
 
0.4%
Han 1953
 
0.2%
Greek 80
 
< 0.1%
Hiragana 18
 
< 0.1%
Katakana 7
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
28
 
1.4%
19
 
1.0%
18
 
0.9%
16
 
0.8%
12
 
0.6%
12
 
0.6%
11
 
0.6%
11
 
0.6%
11
 
0.6%
11
 
0.6%
Other values (854) 1804
92.4%
Hangul
ValueCountFrequency (%)
78
 
2.2%
65
 
1.9%
63
 
1.8%
62
 
1.8%
58
 
1.7%
53
 
1.5%
51
 
1.5%
48
 
1.4%
46
 
1.3%
46
 
1.3%
Other values (390) 2922
83.7%
Common
ValueCountFrequency (%)
97052
63.5%
, 36459
 
23.9%
- 5489
 
3.6%
) 2186
 
1.4%
( 2183
 
1.4%
¡ 2027
 
1.3%
¤ 1999
 
1.3%
. 1021
 
0.7%
1 685
 
0.4%
2 499
 
0.3%
Other values (58) 3265
 
2.1%
Latin
ValueCountFrequency (%)
e 80735
 
10.5%
i 68291
 
8.9%
a 61267
 
8.0%
n 59328
 
7.7%
t 58830
 
7.6%
o 55687
 
7.2%
r 49151
 
6.4%
s 40880
 
5.3%
l 35518
 
4.6%
c 33481
 
4.4%
Other values (47) 226232
29.4%
Greek
ValueCountFrequency (%)
β 23
28.7%
α 22
27.5%
κ 8
 
10.0%
γ 6
 
7.5%
φ 4
 
5.0%
η 3
 
3.8%
θ 3
 
3.8%
ε 3
 
3.8%
Δ 2
 
2.5%
δ 1
 
1.2%
Other values (5) 5
 
6.2%
Hiragana
ValueCountFrequency (%)
2
 
11.1%
2
 
11.1%
2
 
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (5) 5
27.8%
Katakana
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 917198
98.9%
None 4635
 
0.5%
Hangul 3472
 
0.4%
CJK 1937
 
0.2%
Punctuation 499
 
0.1%
Compat Jamo 20
 
< 0.1%
Hiragana 18
 
< 0.1%
CJK Compat Ideographs 16
 
< 0.1%
Math Operators 7
 
< 0.1%
Katakana 7
 
< 0.1%
Other values (2) 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
97052
 
10.6%
e 80735
 
8.8%
i 68291
 
7.4%
a 61267
 
6.7%
n 59328
 
6.5%
t 58830
 
6.4%
o 55687
 
6.1%
r 49151
 
5.4%
s 40880
 
4.5%
, 36459
 
4.0%
Other values (80) 309518
33.7%
None
ValueCountFrequency (%)
¡ 2027
43.7%
¤ 1999
43.1%
· 354
 
7.6%
42
 
0.9%
42
 
0.9%
32
 
0.7%
β 23
 
0.5%
α 22
 
0.5%
10
 
0.2%
9
 
0.2%
Other values (27) 75
 
1.6%
Punctuation
ValueCountFrequency (%)
332
66.5%
87
 
17.4%
38
 
7.6%
38
 
7.6%
3
 
0.6%
1
 
0.2%
Hangul
ValueCountFrequency (%)
78
 
2.2%
65
 
1.9%
63
 
1.8%
62
 
1.8%
58
 
1.7%
53
 
1.5%
51
 
1.5%
48
 
1.4%
46
 
1.3%
46
 
1.3%
Other values (385) 2902
83.6%
CJK
ValueCountFrequency (%)
28
 
1.4%
19
 
1.0%
18
 
0.9%
16
 
0.8%
12
 
0.6%
12
 
0.6%
11
 
0.6%
11
 
0.6%
11
 
0.6%
11
 
0.6%
Other values (843) 1788
92.3%
Compat Jamo
ValueCountFrequency (%)
16
80.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Letterlike Symbols
ValueCountFrequency (%)
4
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
4
25.0%
2
12.5%
2
12.5%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Hiragana
ValueCountFrequency (%)
2
 
11.1%
2
 
11.1%
2
 
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (5) 5
27.8%
Math Operators
ValueCountFrequency (%)
2
28.6%
2
28.6%
2
28.6%
1
14.3%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%
Katakana
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
Distinct174
Distinct (%)1.7%
Missing20
Missing (%)0.2%
Memory size156.2 KiB
2023-12-13T03:31:13.382205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length30
Mean length12.555812
Min length8

Characters and Unicode

Total characters125307
Distinct characters174
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row자연과학 > 생물학 > 유전학
2nd row예술체육학 > 디자인 > 환경디자인 > 생활/실내디자인
3rd row사회과학 > 교육학 > 교과교육학 > 체육교육학
4th row공학 > 생물공학
5th row인문학 > 한국어와문학
ValueCountFrequency (%)
12863
36.0%
사회과학 2262
 
6.3%
공학 2096
 
5.9%
인문학 1662
 
4.7%
의약학 1236
 
3.5%
자연과학 1121
 
3.1%
복합학 575
 
1.6%
농수해양학 523
 
1.5%
예술체육학 505
 
1.4%
물리학 452
 
1.3%
Other values (204) 12411
34.8%
2023-12-13T03:31:13.798233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25726
20.5%
21740
17.3%
> 12863
 
10.3%
5521
 
4.4%
4290
 
3.4%
3194
 
2.5%
2881
 
2.3%
2803
 
2.2%
2353
 
1.9%
1916
 
1.5%
Other values (164) 42020
33.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 86358
68.9%
Space Separator 25726
 
20.5%
Math Symbol 12863
 
10.3%
Other Punctuation 360
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21740
25.2%
5521
 
6.4%
4290
 
5.0%
3194
 
3.7%
2881
 
3.3%
2803
 
3.2%
2353
 
2.7%
1916
 
2.2%
1813
 
2.1%
1758
 
2.0%
Other values (161) 38089
44.1%
Space Separator
ValueCountFrequency (%)
25726
100.0%
Math Symbol
ValueCountFrequency (%)
> 12863
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 360
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 86358
68.9%
Common 38949
31.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21740
25.2%
5521
 
6.4%
4290
 
5.0%
3194
 
3.7%
2881
 
3.3%
2803
 
3.2%
2353
 
2.7%
1916
 
2.2%
1813
 
2.1%
1758
 
2.0%
Other values (161) 38089
44.1%
Common
ValueCountFrequency (%)
25726
66.1%
> 12863
33.0%
/ 360
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 86358
68.9%
ASCII 38949
31.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
25726
66.1%
> 12863
33.0%
/ 360
 
0.9%
Hangul
ValueCountFrequency (%)
21740
25.2%
5521
 
6.4%
4290
 
5.0%
3194
 
3.7%
2881
 
3.3%
2803
 
3.2%
2353
 
2.7%
1916
 
2.2%
1813
 
2.1%
1758
 
2.0%
Other values (161) 38089
44.1%

Sample

논문명(국문)논문명(외국어)논문명(영어)저자공동저자학술지명(국문)학술지명(외국어)국제표준연속간행물발행기관명(국문)발행기관명(영문)등재구분해외 등재 구분발행년데이터기준일시작페이지끝페이지키워드(국문)키워드(외국어)키워드(영문)주제분야
13185Tripartite Motif Containing 3 inhibits the aggressive behaviors of papillary thyroid carcinoma and indicates lower recurrence risk<NA>Tripartite Motif Containing 3 inhibits the aggressive behaviors of papillary thyroid carcinoma and indicates lower recurrence riskSong YubaoGao Zefeng;Yan Zhifeng;Zheng CaihongGenes & GenomicsGenes and Genomics19769571한국유전학회The Genetics Society Of KoreaKCI등재SCIE:SCOPUS20222023-02-22444455465Papillary thyroid carcinoma ¡¤ Tripartite Motif Containing 3 ¡¤ Proliferation ¡¤ Prognosis<NA>Papillary thyroid carcinoma ¡¤ Tripartite Motif Containing 3 ¡¤ Proliferation ¡¤ Prognosis자연과학 > 생물학 > 유전학
12087한국 전통 사각소반의 ‘상판과 변죽의 형태’를 응용한 트레이 디자인 연구<NA>A Study on Tray Design Applying the Top Plate and Rim Shape of Korean Traditional Rectangular Soban한동엽이정교한국가구학회지Journal of the Korea furniture Society12263109한국가구학회Korea Furniture SocietyKCI등재<NA>20222023-02-22334457465<NA><NA>tradition, soban, reinterpretation, furniture, tray예술체육학 > 디자인 > 환경디자인 > 생활/실내디자인
5737청소년들의 여가태도가 운동지속수행에 미치는 영향: 건강증진행위의 매개효과<NA>Effect of Adolescents' Leisure Attitude on Exercise Adherence: Mediating Effect of Health Promotion Behavior김보람윤주석한국체육교육학회지Korean Society For The Study Of Physical Education12299685한국체육교육학회Korean Society For The Study Of Physical EducationKCI등재<NA>20222023-02-2227489101여가태도, 운동지속수행, 건강증진행위<NA>leisure attitudes, exercise adherence, health promotion behavior사회과학 > 교육학 > 교과교육학 > 체육교육학
11273The Inhibitory Effect of Corni Fructus against Oxidative Stress-induced Cellular Damage in C2C12 Murine Myoblasts<NA>The Inhibitory Effect of Corni Fructus against Oxidative Stress-induced Cellular Damage in C2C12 Murine Myoblasts김성옥정지숙;박철;Lee Hyesook;최성현;김기영;김혜영;최영현;황은주Biotechnology and Bioprocess Engineering<NA>12268372한국생물공학회The Korean Society For Biotechnology And BioengineeringKCI등재SCIE:SCOPUS20222023-02-22273386397Corni Fructus, DNA damage, apoptosis, reactive oxygen species<NA>Corni Fructus, DNA damage, apoptosis, reactive oxygen species공학 > 생물공학
4898Wonhyo’s View of Human Beings and his Redemption of Mankind<NA>Wonhyo’s View of Human Beings and his Redemption of Mankind남동신<NA>The Review of Korean StudiesThe Review of Korean Studies12290076한국학중앙연구원The Academy of Korean StudiesKCI등재SCOPUS20222023-02-22251942Bodhisattva 菩薩, conduct of no-obstruction 無碍行, Gisinnon byeolgi 起信論別記, Gisinnon so/Commentary on the Awakening of Faith 大乘起信論疏, gha-pati 居士, Hwajaeng 和諍, Ilsim/One Mind 一心, Geumgang sammaegyong non 金剛三昧經論, Wonhyo/Weonhyo 元曉<NA>Bodhisattva 菩薩, conduct of no-obstruction 無碍行, Gisinnon byeolgi 起信論別記, Gisinnon so/Commentary on the Awakening of Faith 大乘起信論疏, gha-pati 居士, Hwajaeng 和諍, Ilsim/One Mind 一心, Geumgang sammaegyong non 金剛三昧經論, Wonhyo/Weonhyo 元曉인문학 > 한국어와문학
16052Reliability and Validity of the Korean Version of Disruptive Behavior Disorders Rating Scale, DSM-5 Version-Parent Form<NA>Reliability and Validity of the Korean Version of Disruptive Behavior Disorders Rating Scale, DSM-5 Version-Parent FormLee Eun SolRyu Vin;Choi Jungwon;Oh Yunhye;Yoon Jin Woong;Han Hyeree;Hong Hyeon;Son Hye Jung;Lee Ji Hyun;Park SubinPSYCHIATRY INVESTIGATION<NA>17383684대한신경정신의학회Korean Neuropsychiatric AssociationKCI등재SCIE:SSCI20222023-02-221911884897Disruptive behavior disorders, Adolescents, Children, Screening tool, Psychometric properties.<NA>Disruptive behavior disorders, Adolescents, Children, Screening tool, Psychometric properties.의약학 > 정신과학
9077약물유전자검사 결과 해석의 임상검사실 적용 권고안Recommendations for Clinical Application of Pharmacogenetic Test Results Interpretation by Clinical LaboratoriesRecommendations for Clinical Application of Pharmacogenetic Test Results Interpretation by Clinical Laboratories임정훈김지은;조선미Laboratory Medicine OnlineLaboratory Medicine Online<NA>대한진단검사의학회Korean Society For Laboratory MedicineKCI등재<NA>20222023-02-22124244261Pharmacogenetics, Pharmacogenomic variants, Practice guidelinePharmacogenetics, Pharmacogenomic variants, Practice guidelinePharmacogenetics, Pharmacogenomic variants, Practice guideline의약학 > 임상병리학 > 기타임상병리학
12400LMX가 구성원들의 직무만족과 혁신적 행동에 미치는 경로 탐색: 정보공유와 감정공유를 중심으로<NA>Path Exploration from LMX to Employee Job Satisfaction and Innovative Behavior: Focusing on Information Sharing and Emotion Sharing김정식<NA>대한경영학회지Korean Journal of Business Administration12262234대한경영학회The Korean Academic Association of Business AdministrationKCI등재<NA>20222023-02-22351019471964LMX, 정보공유, 감정공유, 직무만족, 혁신적 행동<NA>LMX, Information Sharing, Emotion Sharing, Job Satisfaction, Innovative Behavior사회과학 > 경영학
13568『비서·사무경영연구』의 연구동향 분석: 비서직의 진로 및 경력에 대한 연구를 중심으로Research Trend Analysis of 'Korean Association of Secretarial Science': Focused on the respective Career of Secretarial Majors and SecretariesResearch Trend Analysis of 'Korean Association of Secretarial Science': Focused on the respective Career of Secretarial Majors and Secretaries김민아윤홍인;백지연비서·사무경영연구Journal of Secretarial Studies26717379한국비서학회Korean Association Of Secretarial StudiesKCI등재<NA>20222023-02-2231275103비서직, 진로, 경력, 연구동향Secretaries, Career of Secretarial Majors, Career of Secretaries, Research TrendsSecretaries, Career of Secretarial Majors, Career of Secretaries, Research Trends사회과학 > 경영학
143보안 전문 인력 양성을 위한 정보보안 수업 개선 방안 특성화 과정을 중심으로<NA>Information Security Class Improvement Plan to Cultivate Security Professionals - Focusing on Specialization Course박중오<NA>산업융합연구Journal of Industrial Convergence26358875대한산업경영학회Dae Han Society of Industrial ManagementKCI등재<NA>20222023-02-222032331정보보안, 보안실습, 대학교육, 보안학과, 정보보호<NA>Information Security, Security Practice, College Education, Security Department, Information Protection복합학 > 학제간연구
논문명(국문)논문명(외국어)논문명(영어)저자공동저자학술지명(국문)학술지명(외국어)국제표준연속간행물발행기관명(국문)발행기관명(영문)등재구분해외 등재 구분발행년데이터기준일시작페이지끝페이지키워드(국문)키워드(외국어)키워드(영문)주제분야
13910피겨 스케이팅 선수의 근력 및 무산소성 파워가 경기력에 미치는 영향<NA>The Effects of Muscle Strength and Anaerobic Power on Performance in Figure Skaters안나영<NA>코칭능력개발지Journal of Coaching Development12296597한국코칭능력개발원Korea Coaching Development CenterKCI등재<NA>20222023-02-22244279286피겨 스케이팅, 근력, 무산소성 파워, 점프, 스핀, 경기력<NA>figure skating, muscle strength, anaerobic power, spin, jump, performance예술체육학 > 체육 > 기타체육
3655대순 신앙의 천계(天界) 관념 -무극도를 중심으로-The Concept of the Heavenly Realm in Daesoon Belief: Focused on the Comparative Analysis of the Heavenly Realm in DaoismThe Concept of the Heavenly Realm in Daesoon Belief: Focused on the Comparative Analysis of the Heavenly Realm in Daoism박상규<NA>종교연구Studies in Religion(The Journal of the Korean Association for the History of Religions)12263516한국종교학회Korean Association For The History Of ReligionsKCI등재<NA>20222023-02-22822173205구천, 구중천, 대순진리회, 도솔궁, 도솔천, 무극도, 삼십삼천, 삼십육천, 옥황상제the Ninth Heaven (Gucheon), the nine heavens, Daesoon Jinrihoe, Tushita Heaven Palace, Tushita Heaven, Mugeuk-do (Limitless Dao), the Heaven of Thirty-three Gods, the Thirty-six Heavens, the Jade Emperorthe Ninth Heaven (Gucheon), the nine heavens, Daesoon Jinrihoe, Tushita Heaven Palace, Tushita Heaven, Mugeuk-do (Limitless Dao), the Heaven of Thirty-three Gods, the Thirty-six Heavens, the Jade Emperor인문학 > 종교학
8224Finite-time Consensus of Networked Euler-Lagrange Systems via STA-based Output Feedback<NA>Finite-time Consensus of Networked Euler-Lagrange Systems via STA-based Output FeedbackYanyan FanZhenlin Jin;Baosu Guo;Xiaoyuan Luo;Xinping GuanInternational Journal of Control, Automation, and SystemsInternational Journal of Control, Automation, and Systems15986446제어·로봇·시스템학회Institute of Control, Robotics and SystemsKCI등재SCIE:SCOPUS20222023-02-2220929933005Distributed control, Euler-Lagrange systems, finite-time consensus, super-twisting observer.<NA>Distributed control, Euler-Lagrange systems, finite-time consensus, super-twisting observer.공학 > 제어계측공학
12490아버지의 애정-자율적 양육태도가 유아의 유치원 적응에 미치는 영향: 유아의 결과예측 사고와 긍정적 또래상호작용의 이중매개효과<NA>The Effect of Father's Affection-Autonomous Parenting Attitude on Infant's Adjustment to kindergarten: The Double Mediating Effect of Infant's Outcome Prediction Skill and Positive Peer Interaction윤주연<NA>한국영유아보육학The Korea Association of Child Care and Education12266795한국영유아보육학회The Korea Association Of Child And EducationKCI등재<NA>20222023-02-22<NA>1372563아버지의 애정-자율적 양육태도, 유아의 유치원 적응, 유아의 결과예측 사고, 유아의 긍정적 또래상호작용, 이중매개효과<NA>father's affection-autonomous parenting attitude, children's adjustment to kindergarten, children's outcome prediction skill, positive peer interaction, double mediation effect사회과학 > 사회복지학
8185Decentralized Fault Tolerant Control of Modular Manipulators System Based on Adaptive Dynamic Programming<NA>Decentralized Fault Tolerant Control of Modular Manipulators System Based on Adaptive Dynamic ProgrammingFan ZhouFujie Nie;Tianjiao An;Bing Ma;Yuanchun LiInternational Journal of Control, Automation, and SystemsInternational Journal of Control, Automation, and Systems15986446제어·로봇·시스템학회Institute of Control, Robotics and SystemsKCI등재SCIE:SCOPUS20222023-02-22201032523263Adaptive dynamic programming, decentralized control, fault tolerant control, modular manipulators.<NA>Adaptive dynamic programming, decentralized control, fault tolerant control, modular manipulators.공학 > 제어계측공학
8917A Qualitative Study of Design Students’ Color Tool Use<NA>A Qualitative Study of Design Students’ Color Tool Use원세화<NA>디자인학연구Archives of Design Research12268046한국디자인학회<NA>KCI등재SCOPUS20222023-02-223536979Color Tool, Color Data, Design Students, Qualitative Study, Colorful Design Trend<NA>Color Tool, Color Data, Design Students, Qualitative Study, Colorful Design Trend예술체육학 > 디자인
3207기술적 보호조치의 구분과 저작물 이용과의 관계에 관한 고찰 ―노래방 기술적 보호조치 무력화 장치 제조·판매 사건―<NA>A Study on the Differentiation between Access and Rights Control, and the Relationship between Technological Protections Measures and Exploitation of Copyrighted Works ―Case Study of the Korean Supreme Court Decision on the Circumvention of TPM―이대희<NA>경영법률Journal of Business Administration & Law12293261한국경영법률학회The Korean Academic Society Of Business Administration And LawKCI등재<NA>20222023-02-22322395432저작물, 저작권, 기술적 보호조치, 접근통제, 권리통제, 무력화, 복제, 배포, 공연, 저작물의 이용·향유<NA>copryrighted works, copyright, technological protection measure(TPM), access control, rights control, circumvention, reproduction, distribution, public performance, exploitation¡¤enjoyment of works사회과학 > 법학
1059Study on seismic performance of SRC special-shaped columns with different loading angles<NA>Study on seismic performance of SRC special-shaped columns with different loading anglesPengfei QuZuqiang Liu;Jian-yang XueSteel and Composite Structures, An International JournalSteel and Composite Structures, An International Journal12299367국제구조공학회International Association of Structural Engineering And Mechanics<NA>SCIE:SCOPUS20222023-02-22446789801cyclic loading, loading angle, seismic performance, special-shaped column, steel reinforced concrete<NA>cyclic loading, loading angle, seismic performance, special-shaped column, steel reinforced concrete공학 > 토목공학 > 구조공학
6305다목적실용위성 영상처리 및 활용KOMPSAT Image Processing and ApplicationKOMPSAT Image Processing and Application이광재김예슬;채성호;오관영;이선구대한원격탐사학회지Korean Journal of Remote Sensing12256161대한원격탐사학회The Korean Society Of Remote SensingKCI등재<NA>20222023-02-2238618711877KOMPSAT, SAR, Semantic segmentation, Fusion, Classification, Deep learning, Change detection<NA>KOMPSAT, SAR, Semantic segmentation, Fusion, Classification, Deep learning, Change detection자연과학 > 기타자연과학
11967지리적 환경이 가스터빈 고온부품에 미치는 영향 분석Effect of Geographical Environment on Gas Turbine Hot Gas PartsEffect of Geographical Environment on Gas Turbine Hot Gas Parts유원주윤동석비파괴검사학회지Journal of the Korean Society for Nondestructive Testing12257842한국비파괴검사학회The Korean Society For Nondestructive Testing, Inc.KCI등재<NA>20222023-02-22423244250가스터빈, 고온부품, 샘플링, SEM-EDS, 부식Gas Turbine, Hot Gas Part, Sampling, SEM-EDS, CorrosionGas Turbine, Hot Gas Part, Sampling, SEM-EDS, Corrosion공학 > 기계공학