Overview

Dataset statistics

Number of variables3
Number of observations10000
Missing cells47
Missing cells (%)0.2%
Duplicate rows1274
Duplicate rows (%)12.7%
Total size in memory312.5 KiB
Average record size in memory32.0 B

Variable types

Text3

Dataset

Description우리 대학은 대한민국 농수산업의 특성화 대학으로서 국내외의 다양한 농수축산 자료목록을 공개하여 국민의 알 권리 충족에 기여하고자 함
URLhttps://www.data.go.kr/data/15116535/fileData.do

Alerts

Dataset has 1274 (12.7%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 22:54:34.125735
Analysis finished2023-12-12 22:54:36.511643
Duration2.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct6960
Distinct (%)69.7%
Missing14
Missing (%)0.1%
Memory size156.2 KiB
2023-12-13T07:54:36.724657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length199
Median length136
Mean length21.323052
Min length1

Characters and Unicode

Total characters212932
Distinct characters2123
Distinct categories17 ?
Distinct scripts6 ?
Distinct blocks14 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5695 ?
Unique (%)57.0%

Sample

1st row첨단유리온실용 알루미늄구조재의 규격표준화 및 적합소재 개발
2nd row(농업과학기술연구개발결과)영농활용자료
3rd row농어업인 삶의 질 향상 및 농어촌 지역개발 2020년 시행계획
4th row(플로리스트의 첫 걸음)화훼장식기사
5th row농산물 판매 홍보전략
ValueCountFrequency (%)
3170
 
7.6%
543
 
1.3%
위한 328
 
0.8%
of 291
 
0.7%
and 275
 
0.7%
the 211
 
0.5%
연구 205
 
0.5%
농산물 164
 
0.4%
관한 164
 
0.4%
방안 150
 
0.4%
Other values (14320) 36449
86.9%
2023-12-13T07:54:37.200488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32207
 
15.1%
3529
 
1.7%
e 3437
 
1.6%
2765
 
1.3%
n 2743
 
1.3%
o 2698
 
1.3%
: 2670
 
1.3%
a 2658
 
1.2%
2467
 
1.2%
i 2458
 
1.2%
Other values (2113) 155300
72.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 123918
58.2%
Space Separator 32214
 
15.1%
Lowercase Letter 30879
 
14.5%
Decimal Number 9263
 
4.4%
Other Punctuation 5856
 
2.8%
Uppercase Letter 4968
 
2.3%
Open Punctuation 2038
 
1.0%
Close Punctuation 2036
 
1.0%
Dash Punctuation 1068
 
0.5%
Math Symbol 507
 
0.2%
Other values (7) 185
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3529
 
2.8%
2765
 
2.2%
2467
 
2.0%
2020
 
1.6%
1904
 
1.5%
1865
 
1.5%
1699
 
1.4%
1590
 
1.3%
1481
 
1.2%
1422
 
1.1%
Other values (1993) 103176
83.3%
Uppercase Letter
ValueCountFrequency (%)
A 585
 
11.8%
R 395
 
8.0%
I 350
 
7.0%
T 337
 
6.8%
E 326
 
6.6%
P 308
 
6.2%
C 296
 
6.0%
S 272
 
5.5%
O 265
 
5.3%
F 229
 
4.6%
Other values (19) 1605
32.3%
Lowercase Letter
ValueCountFrequency (%)
e 3437
11.1%
n 2743
 
8.9%
o 2698
 
8.7%
a 2658
 
8.6%
i 2458
 
8.0%
r 2441
 
7.9%
t 2213
 
7.2%
s 1915
 
6.2%
l 1402
 
4.5%
c 1338
 
4.3%
Other values (17) 7576
24.5%
Other Punctuation
ValueCountFrequency (%)
: 2670
45.6%
. 1464
25.0%
· 618
 
10.6%
, 571
 
9.8%
' 138
 
2.4%
/ 134
 
2.3%
& 77
 
1.3%
? 66
 
1.1%
! 52
 
0.9%
; 19
 
0.3%
Other values (10) 47
 
0.8%
Decimal Number
ValueCountFrequency (%)
0 2404
26.0%
2 1998
21.6%
1 1851
20.0%
9 920
 
9.9%
8 461
 
5.0%
3 360
 
3.9%
7 358
 
3.9%
4 322
 
3.5%
5 314
 
3.4%
6 270
 
2.9%
Other values (4) 5
 
0.1%
Math Symbol
ValueCountFrequency (%)
= 408
80.5%
~ 74
 
14.6%
10
 
2.0%
9
 
1.8%
+ 5
 
1.0%
1
 
0.2%
Letter Number
ValueCountFrequency (%)
58
40.6%
48
33.6%
20
 
14.0%
10
 
7.0%
7
 
4.9%
Open Punctuation
ValueCountFrequency (%)
( 1933
94.8%
[ 95
 
4.7%
9
 
0.4%
1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 1932
94.9%
] 94
 
4.6%
9
 
0.4%
1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
32207
> 99.9%
  7
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 1067
99.9%
1
 
0.1%
Other Symbol
ValueCountFrequency (%)
1
50.0%
1
50.0%
Control
ValueCountFrequency (%)
 25
100.0%
Modifier Letter
ValueCountFrequency (%)
11
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 100126
47.0%
Common 53024
24.9%
Latin 35990
 
16.9%
Han 20201
 
9.5%
Hiragana 1911
 
0.9%
Katakana 1680
 
0.8%

Most frequent character per script

Han
ValueCountFrequency (%)
804
 
4.0%
788
 
3.9%
529
 
2.6%
367
 
1.8%
356
 
1.8%
303
 
1.5%
281
 
1.4%
267
 
1.3%
262
 
1.3%
248
 
1.2%
Other values (1029) 15996
79.2%
Hangul
ValueCountFrequency (%)
3529
 
3.5%
2765
 
2.8%
2467
 
2.5%
2020
 
2.0%
1904
 
1.9%
1865
 
1.9%
1699
 
1.7%
1590
 
1.6%
1481
 
1.5%
1422
 
1.4%
Other values (815) 79384
79.3%
Katakana
ValueCountFrequency (%)
122
 
7.3%
71
 
4.2%
69
 
4.1%
65
 
3.9%
62
 
3.7%
61
 
3.6%
58
 
3.5%
57
 
3.4%
51
 
3.0%
50
 
3.0%
Other values (67) 1014
60.4%
Hiragana
ValueCountFrequency (%)
556
29.1%
235
 
12.3%
85
 
4.4%
84
 
4.4%
62
 
3.2%
61
 
3.2%
58
 
3.0%
54
 
2.8%
52
 
2.7%
50
 
2.6%
Other values (52) 614
32.1%
Latin
ValueCountFrequency (%)
e 3437
 
9.5%
n 2743
 
7.6%
o 2698
 
7.5%
a 2658
 
7.4%
i 2458
 
6.8%
r 2441
 
6.8%
t 2213
 
6.1%
s 1915
 
5.3%
l 1402
 
3.9%
c 1338
 
3.7%
Other values (51) 12687
35.3%
Common
ValueCountFrequency (%)
32207
60.7%
: 2670
 
5.0%
0 2404
 
4.5%
2 1998
 
3.8%
( 1933
 
3.6%
) 1932
 
3.6%
1 1851
 
3.5%
. 1464
 
2.8%
- 1067
 
2.0%
9 920
 
1.7%
Other values (49) 4578
 
8.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 100027
47.0%
ASCII 88149
41.4%
CJK 19830
 
9.3%
Hiragana 1911
 
0.9%
Katakana 1705
 
0.8%
None 671
 
0.3%
CJK Compat Ideographs 370
 
0.2%
Number Forms 143
 
0.1%
Compat Jamo 99
 
< 0.1%
Math Operators 19
 
< 0.1%
Other values (4) 8
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
32207
36.5%
e 3437
 
3.9%
n 2743
 
3.1%
o 2698
 
3.1%
: 2670
 
3.0%
a 2658
 
3.0%
i 2458
 
2.8%
r 2441
 
2.8%
0 2404
 
2.7%
t 2213
 
2.5%
Other values (75) 32220
36.6%
Hangul
ValueCountFrequency (%)
3529
 
3.5%
2765
 
2.8%
2467
 
2.5%
2020
 
2.0%
1904
 
1.9%
1865
 
1.9%
1699
 
1.7%
1590
 
1.6%
1481
 
1.5%
1422
 
1.4%
Other values (814) 79285
79.3%
CJK
ValueCountFrequency (%)
804
 
4.1%
788
 
4.0%
529
 
2.7%
367
 
1.9%
356
 
1.8%
303
 
1.5%
281
 
1.4%
267
 
1.3%
262
 
1.3%
248
 
1.3%
Other values (989) 15625
78.8%
None
ValueCountFrequency (%)
· 618
92.1%
9
 
1.3%
9
 
1.3%
7
 
1.0%
  7
 
1.0%
3
 
0.4%
2
 
0.3%
2
 
0.3%
2
 
0.3%
ü 2
 
0.3%
Other values (10) 10
 
1.5%
Hiragana
ValueCountFrequency (%)
556
29.1%
235
 
12.3%
85
 
4.4%
84
 
4.4%
62
 
3.2%
61
 
3.2%
58
 
3.0%
54
 
2.8%
52
 
2.7%
50
 
2.6%
Other values (52) 614
32.1%
Katakana
ValueCountFrequency (%)
122
 
7.2%
71
 
4.2%
69
 
4.0%
65
 
3.8%
62
 
3.6%
61
 
3.6%
58
 
3.4%
57
 
3.3%
51
 
3.0%
50
 
2.9%
Other values (69) 1039
60.9%
Compat Jamo
ValueCountFrequency (%)
99
100.0%
Number Forms
ValueCountFrequency (%)
58
40.6%
48
33.6%
20
 
14.0%
10
 
7.0%
7
 
4.9%
CJK Compat Ideographs
ValueCountFrequency (%)
55
14.9%
48
13.0%
44
11.9%
40
10.8%
27
 
7.3%
24
 
6.5%
20
 
5.4%
19
 
5.1%
13
 
3.5%
6
 
1.6%
Other values (29) 74
20.0%
Math Operators
ValueCountFrequency (%)
10
52.6%
9
47.4%
Punctuation
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%
Box Drawing
ValueCountFrequency (%)
1
100.0%
CJK Ext A
ValueCountFrequency (%)
1
100.0%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%
Distinct3069
Distinct (%)30.7%
Missing19
Missing (%)0.2%
Memory size156.2 KiB
2023-12-13T07:54:37.471257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length46
Mean length7.2270314
Min length2

Characters and Unicode

Total characters72133
Distinct characters963
Distinct categories13 ?
Distinct scripts7 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2124 ?
Unique (%)21.3%

Sample

1st row용역보고서
2nd row강원도농업기술원
3rd row농림축산식품부농어업인 삶의 질 향상 및 농어촌 지역개발 위원회
4th row한국화훼장식기사협회
5th row이수근 저
ValueCountFrequency (%)
농촌진흥청 992
 
7.6%
한국농촌경제연구원 789
 
6.0%
국립농업과학원 211
 
1.6%
국립원예특작과학원 191
 
1.5%
농림부 141
 
1.1%
120
 
0.9%
위원회 114
 
0.9%
도서 107
 
0.8%
1종 107
 
0.8%
대학 107
 
0.8%
Other values (3802) 10207
78.0%
2023-12-13T07:54:37.921288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4243
 
5.9%
3132
 
4.3%
2901
 
4.0%
2123
 
2.9%
2121
 
2.9%
1723
 
2.4%
1672
 
2.3%
1405
 
1.9%
1390
 
1.9%
1286
 
1.8%
Other values (953) 50137
69.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 59360
82.3%
Lowercase Letter 6051
 
8.4%
Space Separator 3132
 
4.3%
Uppercase Letter 1921
 
2.7%
Other Punctuation 1428
 
2.0%
Decimal Number 135
 
0.2%
Dash Punctuation 38
 
0.1%
Open Punctuation 30
 
< 0.1%
Close Punctuation 30
 
< 0.1%
Math Symbol 4
 
< 0.1%
Other values (3) 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4243
 
7.1%
2901
 
4.9%
2123
 
3.6%
2121
 
3.6%
1723
 
2.9%
1672
 
2.8%
1405
 
2.4%
1390
 
2.3%
1286
 
2.2%
1268
 
2.1%
Other values (870) 39228
66.1%
Lowercase Letter
ValueCountFrequency (%)
e 718
11.9%
a 645
10.7%
r 539
8.9%
n 535
8.8%
i 517
 
8.5%
o 430
 
7.1%
l 421
 
7.0%
t 408
 
6.7%
s 270
 
4.5%
h 246
 
4.1%
Other values (22) 1322
21.8%
Uppercase Letter
ValueCountFrequency (%)
A 162
 
8.4%
S 135
 
7.0%
C 131
 
6.8%
M 126
 
6.6%
R 124
 
6.5%
N 117
 
6.1%
I 102
 
5.3%
L 96
 
5.0%
T 95
 
4.9%
H 86
 
4.5%
Other values (17) 747
38.9%
Other Punctuation
ValueCountFrequency (%)
. 787
55.1%
, 620
43.4%
· 13
 
0.9%
& 4
 
0.3%
' 2
 
0.1%
/ 1
 
0.1%
? 1
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 112
83.0%
4 10
 
7.4%
0 5
 
3.7%
2 4
 
3.0%
5 3
 
2.2%
3 1
 
0.7%
Open Punctuation
ValueCountFrequency (%)
[ 24
80.0%
( 6
 
20.0%
Close Punctuation
ValueCountFrequency (%)
] 24
80.0%
) 6
 
20.0%
Math Symbol
ValueCountFrequency (%)
< 2
50.0%
> 2
50.0%
Space Separator
ValueCountFrequency (%)
3132
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 38
100.0%
Modifier Letter
ValueCountFrequency (%)
2
100.0%
Control
ValueCountFrequency (%)
 1
100.0%
Nonspacing Mark
ValueCountFrequency (%)
̈ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 58071
80.5%
Latin 7972
 
11.1%
Common 4800
 
6.7%
Han 1233
 
1.7%
Hiragana 32
 
< 0.1%
Katakana 24
 
< 0.1%
Inherited 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4243
 
7.3%
2901
 
5.0%
2123
 
3.7%
2121
 
3.7%
1723
 
3.0%
1672
 
2.9%
1405
 
2.4%
1390
 
2.4%
1286
 
2.2%
1268
 
2.2%
Other values (537) 37939
65.3%
Han
ValueCountFrequency (%)
108
 
8.8%
104
 
8.4%
103
 
8.4%
103
 
8.4%
101
 
8.2%
101
 
8.2%
32
 
2.6%
26
 
2.1%
23
 
1.9%
15
 
1.2%
Other values (281) 517
41.9%
Latin
ValueCountFrequency (%)
e 718
 
9.0%
a 645
 
8.1%
r 539
 
6.8%
n 535
 
6.7%
i 517
 
6.5%
o 430
 
5.4%
l 421
 
5.3%
t 408
 
5.1%
s 270
 
3.4%
h 246
 
3.1%
Other values (49) 3243
40.7%
Common
ValueCountFrequency (%)
3132
65.2%
. 787
 
16.4%
, 620
 
12.9%
1 112
 
2.3%
- 38
 
0.8%
[ 24
 
0.5%
] 24
 
0.5%
· 13
 
0.3%
4 10
 
0.2%
( 6
 
0.1%
Other values (13) 34
 
0.7%
Katakana
ValueCountFrequency (%)
2
 
8.3%
2
 
8.3%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
Other values (12) 12
50.0%
Hiragana
ValueCountFrequency (%)
4
 
12.5%
3
 
9.4%
3
 
9.4%
2
 
6.2%
2
 
6.2%
2
 
6.2%
2
 
6.2%
2
 
6.2%
1
 
3.1%
1
 
3.1%
Other values (10) 10
31.2%
Inherited
ValueCountFrequency (%)
̈ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 58069
80.5%
ASCII 12748
 
17.7%
CJK 1228
 
1.7%
Hiragana 32
 
< 0.1%
Katakana 26
 
< 0.1%
None 22
 
< 0.1%
CJK Compat Ideographs 5
 
< 0.1%
Compat Jamo 2
 
< 0.1%
Diacriticals 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4243
 
7.3%
2901
 
5.0%
2123
 
3.7%
2121
 
3.7%
1723
 
3.0%
1672
 
2.9%
1405
 
2.4%
1390
 
2.4%
1286
 
2.2%
1268
 
2.2%
Other values (535) 37937
65.3%
ASCII
ValueCountFrequency (%)
3132
24.6%
. 787
 
6.2%
e 718
 
5.6%
a 645
 
5.1%
, 620
 
4.9%
r 539
 
4.2%
n 535
 
4.2%
i 517
 
4.1%
o 430
 
3.4%
l 421
 
3.3%
Other values (63) 4404
34.5%
CJK
ValueCountFrequency (%)
108
 
8.8%
104
 
8.5%
103
 
8.4%
103
 
8.4%
101
 
8.2%
101
 
8.2%
32
 
2.6%
26
 
2.1%
23
 
1.9%
15
 
1.2%
Other values (276) 512
41.7%
None
ValueCountFrequency (%)
· 13
59.1%
á 3
 
13.6%
é 1
 
4.5%
í 1
 
4.5%
Ć 1
 
4.5%
ö 1
 
4.5%
ä 1
 
4.5%
ó 1
 
4.5%
Hiragana
ValueCountFrequency (%)
4
 
12.5%
3
 
9.4%
3
 
9.4%
2
 
6.2%
2
 
6.2%
2
 
6.2%
2
 
6.2%
2
 
6.2%
1
 
3.1%
1
 
3.1%
Other values (10) 10
31.2%
Katakana
ValueCountFrequency (%)
2
 
7.7%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (13) 13
50.0%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Diacriticals
ValueCountFrequency (%)
̈ 1
100.0%
Distinct1794
Distinct (%)18.0%
Missing14
Missing (%)0.1%
Memory size156.2 KiB
2023-12-13T07:54:38.175585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length73
Median length64
Mean length7.8872421
Min length2

Characters and Unicode

Total characters78762
Distinct characters909
Distinct categories11 ?
Distinct scripts6 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1028 ?
Unique (%)10.3%

Sample

1st row농림부
2nd row강원도농업기술원
3rd row농림축산식품부
4th row부민문화사
5th row[농업기반공사]
ValueCountFrequency (%)
농촌진흥청 2265
 
18.2%
한국농촌경제연구원 540
 
4.4%
先進文化社 344
 
2.8%
鄕文社 303
 
2.4%
韓國農村經濟硏究院 286
 
2.3%
農山漁村文化協會 203
 
1.6%
농림부 203
 
1.6%
국립원예특작과학원 187
 
1.5%
국립농업과학원 164
 
1.3%
농림축산식품부 155
 
1.2%
Other values (1814) 7761
62.5%
2023-12-13T07:54:38.626134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5014
 
6.4%
2978
 
3.8%
2539
 
3.2%
2476
 
3.1%
2407
 
3.1%
2386
 
3.0%
2346
 
3.0%
1867
 
2.4%
1329
 
1.7%
1222
 
1.6%
Other values (899) 54198
68.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 66630
84.6%
Lowercase Letter 6910
 
8.8%
Space Separator 2539
 
3.2%
Uppercase Letter 1824
 
2.3%
Other Punctuation 397
 
0.5%
Close Punctuation 175
 
0.2%
Open Punctuation 175
 
0.2%
Dash Punctuation 62
 
0.1%
Decimal Number 46
 
0.1%
Math Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5014
 
7.5%
2978
 
4.5%
2476
 
3.7%
2407
 
3.6%
2386
 
3.6%
2346
 
3.5%
1867
 
2.8%
1329
 
2.0%
1222
 
1.8%
1219
 
1.8%
Other values (824) 43386
65.1%
Lowercase Letter
ValueCountFrequency (%)
e 863
12.5%
i 679
9.8%
r 673
9.7%
s 599
 
8.7%
n 559
 
8.1%
a 478
 
6.9%
o 459
 
6.6%
t 411
 
5.9%
l 371
 
5.4%
c 243
 
3.5%
Other values (17) 1575
22.8%
Uppercase Letter
ValueCountFrequency (%)
A 209
 
11.5%
S 186
 
10.2%
C 163
 
8.9%
P 161
 
8.8%
R 112
 
6.1%
B 108
 
5.9%
I 106
 
5.8%
O 90
 
4.9%
N 84
 
4.6%
U 81
 
4.4%
Other values (14) 524
28.7%
Other Punctuation
ValueCountFrequency (%)
: 233
58.7%
. 63
 
15.9%
, 37
 
9.3%
& 36
 
9.1%
· 18
 
4.5%
' 4
 
1.0%
/ 4
 
1.0%
# 1
 
0.3%
? 1
 
0.3%
Decimal Number
ValueCountFrequency (%)
4 13
28.3%
1 11
23.9%
2 10
21.7%
0 6
13.0%
8 3
 
6.5%
5 2
 
4.3%
3 1
 
2.2%
Close Punctuation
ValueCountFrequency (%)
] 120
68.6%
) 55
31.4%
Open Punctuation
ValueCountFrequency (%)
[ 120
68.6%
( 55
31.4%
Space Separator
ValueCountFrequency (%)
2539
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 62
100.0%
Math Symbol
ValueCountFrequency (%)
+ 3
100.0%
Control
ValueCountFrequency (%)
 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 53130
67.5%
Han 13114
 
16.7%
Latin 8734
 
11.1%
Common 3398
 
4.3%
Katakana 347
 
0.4%
Hiragana 39
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5014
 
9.4%
2978
 
5.6%
2476
 
4.7%
2407
 
4.5%
2386
 
4.5%
2346
 
4.4%
1867
 
3.5%
1329
 
2.5%
1222
 
2.3%
1193
 
2.2%
Other values (440) 29912
56.3%
Han
ValueCountFrequency (%)
1219
 
9.3%
1135
 
8.7%
669
 
5.1%
644
 
4.9%
561
 
4.3%
388
 
3.0%
382
 
2.9%
368
 
2.8%
351
 
2.7%
342
 
2.6%
Other values (320) 7055
53.8%
Latin
ValueCountFrequency (%)
e 863
 
9.9%
i 679
 
7.8%
r 673
 
7.7%
s 599
 
6.9%
n 559
 
6.4%
a 478
 
5.5%
o 459
 
5.3%
t 411
 
4.7%
l 371
 
4.2%
c 243
 
2.8%
Other values (41) 3399
38.9%
Katakana
ValueCountFrequency (%)
32
 
9.2%
28
 
8.1%
26
 
7.5%
23
 
6.6%
23
 
6.6%
21
 
6.1%
21
 
6.1%
21
 
6.1%
21
 
6.1%
19
 
5.5%
Other values (31) 112
32.3%
Common
ValueCountFrequency (%)
2539
74.7%
: 233
 
6.9%
] 120
 
3.5%
[ 120
 
3.5%
. 63
 
1.9%
- 62
 
1.8%
( 55
 
1.6%
) 55
 
1.6%
, 37
 
1.1%
& 36
 
1.1%
Other values (14) 78
 
2.3%
Hiragana
ValueCountFrequency (%)
23
59.0%
2
 
5.1%
2
 
5.1%
2
 
5.1%
2
 
5.1%
1
 
2.6%
1
 
2.6%
1
 
2.6%
1
 
2.6%
1
 
2.6%
Other values (3) 3
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 53130
67.5%
CJK 13090
 
16.6%
ASCII 12113
 
15.4%
Katakana 347
 
0.4%
Hiragana 39
 
< 0.1%
CJK Compat Ideographs 24
 
< 0.1%
None 19
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5014
 
9.4%
2978
 
5.6%
2476
 
4.7%
2407
 
4.5%
2386
 
4.5%
2346
 
4.4%
1867
 
3.5%
1329
 
2.5%
1222
 
2.3%
1193
 
2.2%
Other values (440) 29912
56.3%
ASCII
ValueCountFrequency (%)
2539
21.0%
e 863
 
7.1%
i 679
 
5.6%
r 673
 
5.6%
s 599
 
4.9%
n 559
 
4.6%
a 478
 
3.9%
o 459
 
3.8%
t 411
 
3.4%
l 371
 
3.1%
Other values (63) 4482
37.0%
CJK
ValueCountFrequency (%)
1219
 
9.3%
1135
 
8.7%
669
 
5.1%
644
 
4.9%
561
 
4.3%
388
 
3.0%
382
 
2.9%
368
 
2.8%
351
 
2.7%
342
 
2.6%
Other values (308) 7031
53.7%
Katakana
ValueCountFrequency (%)
32
 
9.2%
28
 
8.1%
26
 
7.5%
23
 
6.6%
23
 
6.6%
21
 
6.1%
21
 
6.1%
21
 
6.1%
21
 
6.1%
19
 
5.5%
Other values (31) 112
32.3%
Hiragana
ValueCountFrequency (%)
23
59.0%
2
 
5.1%
2
 
5.1%
2
 
5.1%
2
 
5.1%
1
 
2.6%
1
 
2.6%
1
 
2.6%
1
 
2.6%
1
 
2.6%
Other values (3) 3
 
7.7%
None
ValueCountFrequency (%)
· 18
94.7%
ä 1
 
5.3%
CJK Compat Ideographs
ValueCountFrequency (%)
5
20.8%
3
12.5%
3
12.5%
3
12.5%
2
 
8.3%
2
 
8.3%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
Other values (2) 2
 
8.3%

Missing values

2023-12-13T07:54:36.257325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:54:36.341252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T07:54:36.441723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

도서명저자명출판사
378첨단유리온실용 알루미늄구조재의 규격표준화 및 적합소재 개발용역보고서농림부
1904(농업과학기술연구개발결과)영농활용자료강원도농업기술원강원도농업기술원
279농어업인 삶의 질 향상 및 농어촌 지역개발 2020년 시행계획농림축산식품부농어업인 삶의 질 향상 및 농어촌 지역개발 위원회농림축산식품부
11555(플로리스트의 첫 걸음)화훼장식기사한국화훼장식기사협회부민문화사
6936농산물 판매 홍보전략이수근 저[농업기반공사]
13853내 강아지 마음 상담소 : 강아지 마음에 대한 소소한 질문들강형욱혜다
16104(대전환 시대) 농정혁신의 길김동환학현사:YSW group
9911種子産業法에 의한 手數料 및 品種保護料농림수산식품부농림수산식품부
11651보존화 : 생화의 아름다움과 싱싱함을 그대로국립원예특작과학원농촌진흥청 국립원예특작과학원
2946강한농민 열린농업 : 일본 농민의 성공사례와 한국 농업의 진로가노오 요시카즈삼성경제연구소
도서명저자명출판사
6155국제화시대에 한국농업김종무成均館大學校出版部
7930(농업분야 관계자를 위한)농업경영과 생활보건이훈영HM연구소
9912農業技術大系 - 作物編. 1-7농문협農山漁村文化協會
5818UR 이후 농산물 무역정책의 방향한국농촌경제연구원한국농촌경제연구원
7375세계식량불안보고서국제연합식량농업기구국제연합식량농업기구(FAO) 한국협회
6724왜 최고가격 농산물인가. 1호-19호농촌진흥청농촌진흥청
4507작물별 시비처방기준농업과학기술원농업과학기술원
4540토양분류지침신영오한림저널사
1479품목별 경영평가.진단표(육계)농촌진흥청 편농촌진흥청
9095벼 GAP 생산가이드국립농업과학원농촌진흥청 국립농업과학원

Duplicate rows

Most frequently occurring

도서명저자명출판사# duplicates
768농축산물표준소득. 1977-2006농촌진흥청농촌진흥청34
335日本の食生活全集. 1-50농문협農山漁村文化協會27
620농림사업시행지침서1998-2008농림부농림부25
1259흙은 여자인가 남자인가? : 쉽게 풀어보는 흙의 과학과 관리및 시비기술이완주서원24
738농업전망한국농촌경제연구원농업관측본부한국농촌경제연구원23
770농협연감. 1984-2018농업협동조합중앙회농업협동조합중앙회21
765농촌진흥사업연보농촌진흥청농촌진흥청19
392畜産機械 및 施設박경규文運堂17
1117주요 농산물 유통실태. 2000-2017농수산물유통공사농수산물유통공사15
314家畜育種學오봉국한국방송대학교출판부14