Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells8
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory390.6 KiB
Average record size in memory40.0 B

Variable types

Text4

Dataset

Description한국농수산대학은 대한민국 농수산업의 특성화 대학으로서 농림축산식품부 소속 직속기관이며 국내외의 다양한 농수축산업 자료를 보유하고 있는 바 이에 대한 전문도서 목록과 정보를 공개하여 국민의 알 권리 충족에 기여하고자 함.
Author한국농수산대학
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20181018000000000966

Alerts

등록번호 has unique valuesUnique

Reproduction

Analysis started2023-12-11 03:38:40.143866
Analysis finished2023-12-11 03:38:43.169388
Duration3.03 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

등록번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T12:38:43.481028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters80000
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowEM009707
2nd rowEM007493
3rd rowEM013864
4th rowEM001651
5th rowEM010540
ValueCountFrequency (%)
em009707 1
 
< 0.1%
em010873 1
 
< 0.1%
em007962 1
 
< 0.1%
em001155 1
 
< 0.1%
em009011 1
 
< 0.1%
em012066 1
 
< 0.1%
em013937 1
 
< 0.1%
em004436 1
 
< 0.1%
em004647 1
 
< 0.1%
em005436 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-11T12:38:43.973924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 21529
26.9%
E 10000
12.5%
M 10000
12.5%
1 7344
 
9.2%
2 4459
 
5.6%
3 4377
 
5.5%
4 3767
 
4.7%
7 3763
 
4.7%
5 3752
 
4.7%
6 3679
 
4.6%
Other values (2) 7330
 
9.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 60000
75.0%
Uppercase Letter 20000
 
25.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 21529
35.9%
1 7344
 
12.2%
2 4459
 
7.4%
3 4377
 
7.3%
4 3767
 
6.3%
7 3763
 
6.3%
5 3752
 
6.3%
6 3679
 
6.1%
8 3671
 
6.1%
9 3659
 
6.1%
Uppercase Letter
ValueCountFrequency (%)
E 10000
50.0%
M 10000
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 60000
75.0%
Latin 20000
 
25.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 21529
35.9%
1 7344
 
12.2%
2 4459
 
7.4%
3 4377
 
7.3%
4 3767
 
6.3%
7 3763
 
6.3%
5 3752
 
6.3%
6 3679
 
6.1%
8 3671
 
6.1%
9 3659
 
6.1%
Latin
ValueCountFrequency (%)
E 10000
50.0%
M 10000
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 80000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 21529
26.9%
E 10000
12.5%
M 10000
12.5%
1 7344
 
9.2%
2 4459
 
5.6%
3 4377
 
5.5%
4 3767
 
4.7%
7 3763
 
4.7%
5 3752
 
4.7%
6 3679
 
4.6%
Other values (2) 7330
 
9.2%
Distinct6642
Distinct (%)66.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T12:38:44.333606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length191
Median length130
Mean length20.2071
Min length2

Characters and Unicode

Total characters202071
Distinct characters2091
Distinct categories15 ?
Distinct scripts6 ?
Distinct blocks13 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5378 ?
Unique (%)53.8%

Sample

1st row농촌지표. 2008
2nd row신개발 농기계 : 논 농사편
3rd row신 조경시공학
4th row蠶種學
5th row양봉 사계절 관리법
ValueCountFrequency (%)
3225
 
8.1%
525
 
1.3%
위한 343
 
0.9%
연구 244
 
0.6%
of 201
 
0.5%
관한 179
 
0.4%
농산물 167
 
0.4%
방안 163
 
0.4%
report 128
 
0.3%
보고서 123
 
0.3%
Other values (13547) 34511
86.7%
2023-12-11T12:38:44.876205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30094
 
14.9%
3706
 
1.8%
2908
 
1.4%
0 2786
 
1.4%
: 2518
 
1.2%
2493
 
1.2%
2 2137
 
1.1%
2076
 
1.0%
1 2064
 
1.0%
e 2023
 
1.0%
Other values (2081) 149266
73.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 127091
62.9%
Space Separator 30100
 
14.9%
Lowercase Letter 18052
 
8.9%
Decimal Number 10351
 
5.1%
Other Punctuation 6088
 
3.0%
Uppercase Letter 4262
 
2.1%
Open Punctuation 2035
 
1.0%
Close Punctuation 2033
 
1.0%
Dash Punctuation 1195
 
0.6%
Math Symbol 688
 
0.3%
Other values (5) 176
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3706
 
2.9%
2908
 
2.3%
2493
 
2.0%
2076
 
1.6%
1961
 
1.5%
1930
 
1.5%
1922
 
1.5%
1654
 
1.3%
1606
 
1.3%
1433
 
1.1%
Other values (1960) 105402
82.9%
Uppercase Letter
ValueCountFrequency (%)
A 506
 
11.9%
R 417
 
9.8%
I 346
 
8.1%
E 298
 
7.0%
T 267
 
6.3%
O 256
 
6.0%
S 228
 
5.3%
C 212
 
5.0%
N 211
 
5.0%
P 205
 
4.8%
Other values (22) 1316
30.9%
Lowercase Letter
ValueCountFrequency (%)
e 2023
11.2%
o 1675
 
9.3%
r 1553
 
8.6%
n 1520
 
8.4%
a 1435
 
7.9%
i 1346
 
7.5%
t 1304
 
7.2%
s 1074
 
5.9%
l 884
 
4.9%
c 781
 
4.3%
Other values (16) 4457
24.7%
Other Punctuation
ValueCountFrequency (%)
: 2518
41.4%
. 1767
29.0%
· 714
 
11.7%
, 457
 
7.5%
? 191
 
3.1%
/ 136
 
2.2%
' 125
 
2.1%
& 82
 
1.3%
! 47
 
0.8%
; 14
 
0.2%
Other values (9) 37
 
0.6%
Decimal Number
ValueCountFrequency (%)
0 2786
26.9%
2 2137
20.6%
1 2064
19.9%
9 1081
 
10.4%
5 423
 
4.1%
3 380
 
3.7%
7 379
 
3.7%
6 373
 
3.6%
8 363
 
3.5%
4 358
 
3.5%
Other values (4) 7
 
0.1%
Letter Number
ValueCountFrequency (%)
64
38.1%
59
35.1%
22
 
13.1%
15
 
8.9%
6
 
3.6%
1
 
0.6%
1
 
0.6%
Math Symbol
ValueCountFrequency (%)
= 604
87.8%
~ 68
 
9.9%
12
 
1.7%
+ 3
 
0.4%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1930
94.8%
[ 92
 
4.5%
11
 
0.5%
2
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1928
94.8%
] 92
 
4.5%
11
 
0.5%
2
 
0.1%
Other Symbol
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Space Separator
ValueCountFrequency (%)
30094
> 99.9%
  6
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 1194
99.9%
1
 
0.1%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 99575
49.3%
Common 52498
26.0%
Han 23193
 
11.5%
Latin 22482
 
11.1%
Hiragana 2286
 
1.1%
Katakana 2037
 
1.0%

Most frequent character per script

Han
ValueCountFrequency (%)
943
 
4.1%
930
 
4.0%
618
 
2.7%
441
 
1.9%
422
 
1.8%
365
 
1.6%
326
 
1.4%
308
 
1.3%
283
 
1.2%
281
 
1.2%
Other values (1009) 18276
78.8%
Hangul
ValueCountFrequency (%)
3706
 
3.7%
2908
 
2.9%
2493
 
2.5%
2076
 
2.1%
1961
 
2.0%
1930
 
1.9%
1922
 
1.9%
1654
 
1.7%
1606
 
1.6%
1433
 
1.4%
Other values (800) 77886
78.2%
Katakana
ValueCountFrequency (%)
153
 
7.5%
81
 
4.0%
80
 
3.9%
79
 
3.9%
78
 
3.8%
71
 
3.5%
70
 
3.4%
67
 
3.3%
65
 
3.2%
63
 
3.1%
Other values (66) 1230
60.4%
Latin
ValueCountFrequency (%)
e 2023
 
9.0%
o 1675
 
7.5%
r 1553
 
6.9%
n 1520
 
6.8%
a 1435
 
6.4%
i 1346
 
6.0%
t 1304
 
5.8%
s 1074
 
4.8%
l 884
 
3.9%
c 781
 
3.5%
Other values (55) 8887
39.5%
Hiragana
ValueCountFrequency (%)
630
27.6%
262
 
11.5%
98
 
4.3%
92
 
4.0%
81
 
3.5%
71
 
3.1%
71
 
3.1%
61
 
2.7%
60
 
2.6%
59
 
2.6%
Other values (55) 801
35.0%
Common
ValueCountFrequency (%)
30094
57.3%
0 2786
 
5.3%
: 2518
 
4.8%
2 2137
 
4.1%
1 2064
 
3.9%
( 1930
 
3.7%
) 1928
 
3.7%
. 1767
 
3.4%
- 1194
 
2.3%
9 1081
 
2.1%
Other values (46) 4999
 
9.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 99497
49.2%
ASCII 74014
36.6%
CJK 22786
 
11.3%
Hiragana 2286
 
1.1%
Katakana 2037
 
1.0%
None 777
 
0.4%
CJK Compat Ideographs 407
 
0.2%
Number Forms 168
 
0.1%
Compat Jamo 78
 
< 0.1%
Math Operators 12
 
< 0.1%
Other values (3) 9
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
30094
40.7%
0 2786
 
3.8%
: 2518
 
3.4%
2 2137
 
2.9%
1 2064
 
2.8%
e 2023
 
2.7%
( 1930
 
2.6%
) 1928
 
2.6%
. 1767
 
2.4%
o 1675
 
2.3%
Other values (74) 25092
33.9%
Hangul
ValueCountFrequency (%)
3706
 
3.7%
2908
 
2.9%
2493
 
2.5%
2076
 
2.1%
1961
 
2.0%
1930
 
1.9%
1922
 
1.9%
1654
 
1.7%
1606
 
1.6%
1433
 
1.4%
Other values (799) 77808
78.2%
CJK
ValueCountFrequency (%)
943
 
4.1%
930
 
4.1%
618
 
2.7%
441
 
1.9%
422
 
1.9%
365
 
1.6%
326
 
1.4%
308
 
1.4%
283
 
1.2%
281
 
1.2%
Other values (966) 17869
78.4%
None
ValueCountFrequency (%)
· 714
91.9%
11
 
1.4%
11
 
1.4%
7
 
0.9%
  6
 
0.8%
4
 
0.5%
3
 
0.4%
2
 
0.3%
2
 
0.3%
2
 
0.3%
Other values (12) 15
 
1.9%
Hiragana
ValueCountFrequency (%)
630
27.6%
262
 
11.5%
98
 
4.3%
92
 
4.0%
81
 
3.5%
71
 
3.1%
71
 
3.1%
61
 
2.7%
60
 
2.6%
59
 
2.6%
Other values (55) 801
35.0%
Katakana
ValueCountFrequency (%)
153
 
7.5%
81
 
4.0%
80
 
3.9%
79
 
3.9%
78
 
3.8%
71
 
3.5%
70
 
3.4%
67
 
3.3%
65
 
3.2%
63
 
3.1%
Other values (66) 1230
60.4%
Compat Jamo
ValueCountFrequency (%)
78
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
64
15.7%
59
14.5%
39
9.6%
37
9.1%
30
 
7.4%
27
 
6.6%
26
 
6.4%
23
 
5.7%
12
 
2.9%
6
 
1.5%
Other values (33) 84
20.6%
Number Forms
ValueCountFrequency (%)
64
38.1%
59
35.1%
22
 
13.1%
15
 
8.9%
6
 
3.6%
1
 
0.6%
1
 
0.6%
Math Operators
ValueCountFrequency (%)
12
100.0%
Geometric Shapes
ValueCountFrequency (%)
2
66.7%
1
33.3%
Punctuation
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%
Box Drawing
ValueCountFrequency (%)
1
100.0%
Distinct2721
Distinct (%)27.2%
Missing8
Missing (%)0.1%
Memory size156.2 KiB
2023-12-11T12:38:45.152746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length46
Mean length6.7607086
Min length2

Characters and Unicode

Total characters67553
Distinct characters935
Distinct categories10 ?
Distinct scripts6 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1779 ?
Unique (%)17.8%

Sample

1st row농촌자원개발연구소
2nd row농촌진흥청
3rd row강태호 외
4th row정원복
5th row조도행
ValueCountFrequency (%)
농촌진흥청 1036
 
8.1%
한국농촌경제연구원 865
 
6.8%
농림부 172
 
1.3%
국립원예특작과학원 161
 
1.3%
국립농업과학원 155
 
1.2%
위원회 131
 
1.0%
농업과학기술원 124
 
1.0%
대학교 123
 
1.0%
도서 123
 
1.0%
1종 123
 
1.0%
Other values (3192) 9754
76.4%
2023-12-11T12:38:45.589935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4323
 
6.4%
2853
 
4.2%
2801
 
4.1%
2263
 
3.3%
2059
 
3.0%
1851
 
2.7%
1829
 
2.7%
1486
 
2.2%
1374
 
2.0%
1359
 
2.0%
Other values (925) 45355
67.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 60799
90.0%
Space Separator 2801
 
4.1%
Lowercase Letter 1850
 
2.7%
Other Punctuation 1081
 
1.6%
Uppercase Letter 759
 
1.1%
Decimal Number 144
 
0.2%
Close Punctuation 39
 
0.1%
Open Punctuation 39
 
0.1%
Math Symbol 21
 
< 0.1%
Dash Punctuation 20
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4323
 
7.1%
2853
 
4.7%
2263
 
3.7%
2059
 
3.4%
1851
 
3.0%
1829
 
3.0%
1486
 
2.4%
1374
 
2.3%
1359
 
2.2%
1334
 
2.2%
Other values (858) 40068
65.9%
Lowercase Letter
ValueCountFrequency (%)
t 224
12.1%
e 204
11.0%
i 181
9.8%
a 176
9.5%
n 166
9.0%
r 136
7.4%
o 133
7.2%
l 122
6.6%
c 106
 
5.7%
u 102
 
5.5%
Other values (14) 300
16.2%
Uppercase Letter
ValueCountFrequency (%)
A 88
11.6%
N 88
11.6%
I 84
11.1%
T 57
 
7.5%
R 55
 
7.2%
S 50
 
6.6%
E 41
 
5.4%
L 40
 
5.3%
H 38
 
5.0%
O 34
 
4.5%
Other values (14) 184
24.2%
Other Punctuation
ValueCountFrequency (%)
. 851
78.7%
, 188
 
17.4%
? 26
 
2.4%
· 12
 
1.1%
& 4
 
0.4%
Decimal Number
ValueCountFrequency (%)
1 128
88.9%
4 10
 
6.9%
5 2
 
1.4%
0 2
 
1.4%
2 2
 
1.4%
Math Symbol
ValueCountFrequency (%)
| 17
81.0%
< 2
 
9.5%
> 2
 
9.5%
Close Punctuation
ValueCountFrequency (%)
] 32
82.1%
) 7
 
17.9%
Open Punctuation
ValueCountFrequency (%)
[ 32
82.1%
( 7
 
17.9%
Space Separator
ValueCountFrequency (%)
2801
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 59144
87.6%
Common 4145
 
6.1%
Latin 2609
 
3.9%
Han 1581
 
2.3%
Katakana 47
 
0.1%
Hiragana 27
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4323
 
7.3%
2853
 
4.8%
2263
 
3.8%
2059
 
3.5%
1851
 
3.1%
1829
 
3.1%
1486
 
2.5%
1374
 
2.3%
1359
 
2.3%
1334
 
2.3%
Other values (526) 38413
64.9%
Han
ValueCountFrequency (%)
139
 
8.8%
135
 
8.5%
131
 
8.3%
131
 
8.3%
129
 
8.2%
129
 
8.2%
48
 
3.0%
37
 
2.3%
32
 
2.0%
21
 
1.3%
Other values (269) 649
41.0%
Latin
ValueCountFrequency (%)
t 224
 
8.6%
e 204
 
7.8%
i 181
 
6.9%
a 176
 
6.7%
n 166
 
6.4%
r 136
 
5.2%
o 133
 
5.1%
l 122
 
4.7%
c 106
 
4.1%
u 102
 
3.9%
Other values (38) 1059
40.6%
Katakana
ValueCountFrequency (%)
4
 
8.5%
3
 
6.4%
3
 
6.4%
3
 
6.4%
3
 
6.4%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
1
 
2.1%
Other values (22) 22
46.8%
Hiragana
ValueCountFrequency (%)
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
Other values (11) 11
40.7%
Common
ValueCountFrequency (%)
2801
67.6%
. 851
 
20.5%
, 188
 
4.5%
1 128
 
3.1%
] 32
 
0.8%
[ 32
 
0.8%
? 26
 
0.6%
- 20
 
0.5%
| 17
 
0.4%
· 12
 
0.3%
Other values (9) 38
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 59143
87.6%
ASCII 6742
 
10.0%
CJK 1572
 
2.3%
Katakana 47
 
0.1%
Hiragana 27
 
< 0.1%
None 12
 
< 0.1%
CJK Compat Ideographs 9
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4323
 
7.3%
2853
 
4.8%
2263
 
3.8%
2059
 
3.5%
1851
 
3.1%
1829
 
3.1%
1486
 
2.5%
1374
 
2.3%
1359
 
2.3%
1334
 
2.3%
Other values (525) 38412
64.9%
ASCII
ValueCountFrequency (%)
2801
41.5%
. 851
 
12.6%
t 224
 
3.3%
e 204
 
3.0%
, 188
 
2.8%
i 181
 
2.7%
a 176
 
2.6%
n 166
 
2.5%
r 136
 
2.0%
o 133
 
2.0%
Other values (56) 1682
24.9%
CJK
ValueCountFrequency (%)
139
 
8.8%
135
 
8.6%
131
 
8.3%
131
 
8.3%
129
 
8.2%
129
 
8.2%
48
 
3.1%
37
 
2.4%
32
 
2.0%
21
 
1.3%
Other values (263) 640
40.7%
None
ValueCountFrequency (%)
· 12
100.0%
Katakana
ValueCountFrequency (%)
4
 
8.5%
3
 
6.4%
3
 
6.4%
3
 
6.4%
3
 
6.4%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
1
 
2.1%
Other values (22) 22
46.8%
CJK Compat Ideographs
ValueCountFrequency (%)
3
33.3%
2
22.2%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
Hiragana
ValueCountFrequency (%)
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
Other values (11) 11
40.7%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct1531
Distinct (%)15.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T12:38:45.925903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length71
Median length57
Mean length7.2347
Min length1

Characters and Unicode

Total characters72347
Distinct characters890
Distinct categories10 ?
Distinct scripts6 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique808 ?
Unique (%)8.1%

Sample

1st row농촌진흥청 농업과학기술원
2nd row농촌진흥청
3rd row문운당
4th row東亞大學校出版部
5th row오성출판사
ValueCountFrequency (%)
농촌진흥청 2301
 
19.4%
한국농촌경제연구원 549
 
4.6%
先進文化社 395
 
3.3%
韓國農村經濟硏究院 335
 
2.8%
鄕文社 331
 
2.8%
농림부 253
 
2.1%
農山漁村文化協會 230
 
1.9%
농업과학기술원 166
 
1.4%
농민신문사 148
 
1.2%
恒星社厚生閣 133
 
1.1%
Other values (1482) 7019
59.2%
2023-12-11T12:38:46.508429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5015
 
6.9%
3028
 
4.2%
2514
 
3.5%
2442
 
3.4%
2386
 
3.3%
2171
 
3.0%
1980
 
2.7%
1652
 
2.3%
1398
 
1.9%
1294
 
1.8%
Other values (880) 48467
67.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 67248
93.0%
Space Separator 1980
 
2.7%
Lowercase Letter 1751
 
2.4%
Uppercase Letter 651
 
0.9%
Other Punctuation 230
 
0.3%
Close Punctuation 200
 
0.3%
Open Punctuation 200
 
0.3%
Dash Punctuation 55
 
0.1%
Decimal Number 31
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5015
 
7.5%
3028
 
4.5%
2514
 
3.7%
2442
 
3.6%
2386
 
3.5%
2171
 
3.2%
1652
 
2.5%
1398
 
2.1%
1294
 
1.9%
1273
 
1.9%
Other values (813) 44075
65.5%
Lowercase Letter
ValueCountFrequency (%)
e 197
11.3%
t 189
10.8%
o 162
9.3%
n 160
9.1%
i 159
9.1%
a 149
8.5%
r 117
 
6.7%
l 110
 
6.3%
u 82
 
4.7%
s 82
 
4.7%
Other values (14) 344
19.6%
Uppercase Letter
ValueCountFrequency (%)
A 99
15.2%
R 76
11.7%
D 54
 
8.3%
N 49
 
7.5%
I 48
 
7.4%
O 45
 
6.9%
S 44
 
6.8%
H 31
 
4.8%
C 26
 
4.0%
E 24
 
3.7%
Other values (13) 155
23.8%
Other Punctuation
ValueCountFrequency (%)
: 97
42.2%
? 42
18.3%
, 34
 
14.8%
. 32
 
13.9%
& 12
 
5.2%
· 12
 
5.2%
# 1
 
0.4%
Decimal Number
ValueCountFrequency (%)
4 11
35.5%
2 8
25.8%
1 6
19.4%
0 4
 
12.9%
8 1
 
3.2%
9 1
 
3.2%
Close Punctuation
ValueCountFrequency (%)
] 149
74.5%
) 51
 
25.5%
Open Punctuation
ValueCountFrequency (%)
[ 149
74.5%
( 51
 
25.5%
Space Separator
ValueCountFrequency (%)
1980
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 55
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 51720
71.5%
Han 15096
 
20.9%
Common 2697
 
3.7%
Latin 2402
 
3.3%
Katakana 375
 
0.5%
Hiragana 57
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5015
 
9.7%
3028
 
5.9%
2514
 
4.9%
2442
 
4.7%
2386
 
4.6%
2171
 
4.2%
1652
 
3.2%
1273
 
2.5%
1245
 
2.4%
1206
 
2.3%
Other values (425) 28788
55.7%
Han
ValueCountFrequency (%)
1398
 
9.3%
1294
 
8.6%
785
 
5.2%
750
 
5.0%
658
 
4.4%
461
 
3.1%
448
 
3.0%
422
 
2.8%
404
 
2.7%
389
 
2.6%
Other values (315) 8087
53.6%
Latin
ValueCountFrequency (%)
e 197
 
8.2%
t 189
 
7.9%
o 162
 
6.7%
n 160
 
6.7%
i 159
 
6.6%
a 149
 
6.2%
r 117
 
4.9%
l 110
 
4.6%
A 99
 
4.1%
u 82
 
3.4%
Other values (37) 978
40.7%
Katakana
ValueCountFrequency (%)
37
 
9.9%
28
 
7.5%
26
 
6.9%
23
 
6.1%
22
 
5.9%
21
 
5.6%
19
 
5.1%
19
 
5.1%
19
 
5.1%
17
 
4.5%
Other values (35) 144
38.4%
Common
ValueCountFrequency (%)
1980
73.4%
] 149
 
5.5%
[ 149
 
5.5%
: 97
 
3.6%
- 55
 
2.0%
( 51
 
1.9%
) 51
 
1.9%
? 42
 
1.6%
, 34
 
1.3%
. 32
 
1.2%
Other values (10) 57
 
2.1%
Hiragana
ValueCountFrequency (%)
33
57.9%
3
 
5.3%
3
 
5.3%
2
 
3.5%
2
 
3.5%
2
 
3.5%
1
 
1.8%
1
 
1.8%
1
 
1.8%
1
 
1.8%
Other values (8) 8
 
14.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 51720
71.5%
CJK 15066
 
20.8%
ASCII 5087
 
7.0%
Katakana 375
 
0.5%
Hiragana 57
 
0.1%
CJK Compat Ideographs 30
 
< 0.1%
None 12
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5015
 
9.7%
3028
 
5.9%
2514
 
4.9%
2442
 
4.7%
2386
 
4.6%
2171
 
4.2%
1652
 
3.2%
1273
 
2.5%
1245
 
2.4%
1206
 
2.3%
Other values (425) 28788
55.7%
ASCII
ValueCountFrequency (%)
1980
38.9%
e 197
 
3.9%
t 189
 
3.7%
o 162
 
3.2%
n 160
 
3.1%
i 159
 
3.1%
] 149
 
2.9%
[ 149
 
2.9%
a 149
 
2.9%
r 117
 
2.3%
Other values (56) 1676
32.9%
CJK
ValueCountFrequency (%)
1398
 
9.3%
1294
 
8.6%
785
 
5.2%
750
 
5.0%
658
 
4.4%
461
 
3.1%
448
 
3.0%
422
 
2.8%
404
 
2.7%
389
 
2.6%
Other values (301) 8057
53.5%
Katakana
ValueCountFrequency (%)
37
 
9.9%
28
 
7.5%
26
 
6.9%
23
 
6.1%
22
 
5.9%
21
 
5.6%
19
 
5.1%
19
 
5.1%
19
 
5.1%
17
 
4.5%
Other values (35) 144
38.4%
Hiragana
ValueCountFrequency (%)
33
57.9%
3
 
5.3%
3
 
5.3%
2
 
3.5%
2
 
3.5%
2
 
3.5%
1
 
1.8%
1
 
1.8%
1
 
1.8%
1
 
1.8%
Other values (8) 8
 
14.0%
None
ValueCountFrequency (%)
· 12
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
8
26.7%
4
13.3%
3
 
10.0%
2
 
6.7%
2
 
6.7%
2
 
6.7%
2
 
6.7%
1
 
3.3%
1
 
3.3%
1
 
3.3%
Other values (4) 4
13.3%

Missing values

2023-12-11T12:38:42.978738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:38:43.107462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

등록번호도서명저자명출판사
9706EM009707농촌지표. 2008농촌자원개발연구소농촌진흥청 농업과학기술원
7492EM007493신개발 농기계 : 논 농사편농촌진흥청농촌진흥청
13863EM013864신 조경시공학강태호 외문운당
1650EM001651蠶種學정원복東亞大學校出版部
10539EM010540양봉 사계절 관리법조도행오성출판사
2044EM002045畜産學 II정천용한국방송대학교출판부
4076EM0040771996年度 農村振興試驗硏究事業年報농촌진흥청農村振興廳
7806EM007807식물검역연보. 1999-2014국립식물검역소농림축산검역본부 식물검역부
4568EM004569농업과학기술 연구개발 1997년도 시험연구보고서 : 맥류편작물시험장농촌진흥청 작물시험장
11748EM011749(2010) 농림어업총조사보고서 . 1-2 = 2010 agriculture, forestry & fishery census report : forestry : 임업통계청통계청
등록번호도서명저자명출판사
11933EM011934주요 농산물 유통실태. 2000-2014농수산물유통공사농수산물유통공사
4867EM004868農業科學論文集 終刊 特輯號농촌진흥청農村振興廳
4094EM004095식물유전자원 국제기술회의 결과보고서농업과학기술원농업과학기술원 유전자원과
8159EM008160농산물 품질관리사 : 우리농산물지킴이. I부민문화사 자연과학부부민문화사
5173EM005174녹차산업의 발전 방향과 정책과제한국농촌경제연구원한국농촌경제연구원
13874EM013875은수저 6아라카와 히로무학산문화사
7682EM007683식량농업식물유전자원 국제조약 = INTERNATIONAL TREATY ON PLANT GENETIC RESOURCES FOR FOOD AND AGRICULTURE농업생명공학연구원농촌진흥청 농업생명공학연구원
471EM000472農業動力學정창주文運堂
3940EM003941사회주의 농업의 변모한국농촌경제연구원韓國農村經濟硏究院
12619EM012620개도국 농촌개발을 위한 협력모델과 전략수립 방안한국농촌경제연구원한국농촌경제연구원