Overview

Dataset statistics

Number of variables1
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows1281
Duplicate rows (%)12.8%
Total size in memory156.2 KiB
Average record size in memory16.0 B

Variable types

Text1

Dataset

Description한국농수산대학은 대한민국 농수산업의 특성화 대학으로서 농림축산식품부 소속 직속기관이며 국내외의 다양한 농수축산업 자료를 보유하고 있는 바 이에 대한 전문도서 목록과 정보를 공개하여 국민의 알 권리 충족에 기여하고자 함.
Author한국농수산대학
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20181018000000000966

Alerts

Dataset has 1281 (12.8%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-11 03:38:47.482103
Analysis finished2023-12-11 03:38:48.707527
Duration1.23 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct6705
Distinct (%)67.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T12:38:48.906598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length229
Median length153
Mean length49.0443
Min length11

Characters and Unicode

Total characters490443
Distinct characters2288
Distinct categories16 ?
Distinct scripts6 ?
Distinct blocks13 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5424 ?
Unique (%)54.2%

Sample

1st row조림.임업경영 부민문화사 문제은행 부민문화사 ,,
2nd row그리는, 조경 : 드로잉으로 보는 조경 디자인 역사,이명준,한숲
3rd row家蠶解剖生理學.蠶病學 문재유 鄕文社 ,,
4th row'96 농업과학기술 연구개발결과 농촌지도사업 활용자료 Ⅲ : 축산 생활개선 농업경영 농촌진흥청 농촌진흥청 ,,
5th row쌀 협상 이후의 농지이용구조 변화 전망과 대책 한국농촌경제연구원 한국농촌경제연구원 ,,
ValueCountFrequency (%)
12897
 
17.5%
농촌진흥청 3292
 
4.5%
한국농촌경제연구원 1387
 
1.9%
562
 
0.8%
농림부 413
 
0.6%
先進文化社 407
 
0.6%
韓國農村經濟硏究院 346
 
0.5%
鄕文社 341
 
0.5%
연구 327
 
0.4%
위한 322
 
0.4%
Other values (17522) 53586
72.5%
2023-12-11T12:38:49.500945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
125054
25.5%
35331
 
7.2%
, 20067
 
4.1%
13318
 
2.7%
6047
 
1.2%
5830
 
1.2%
5234
 
1.1%
4837
 
1.0%
4636
 
0.9%
4618
 
0.9%
Other values (2278) 265471
54.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 258239
52.7%
Control 125055
25.5%
Space Separator 35339
 
7.2%
Other Punctuation 26888
 
5.5%
Lowercase Letter 21467
 
4.4%
Decimal Number 10881
 
2.2%
Uppercase Letter 5793
 
1.2%
Open Punctuation 2317
 
0.5%
Close Punctuation 2316
 
0.5%
Dash Punctuation 1279
 
0.3%
Other values (6) 869
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13318
 
5.2%
6047
 
2.3%
5830
 
2.3%
5234
 
2.0%
4837
 
1.9%
4636
 
1.8%
4618
 
1.8%
4363
 
1.7%
4270
 
1.7%
4091
 
1.6%
Other values (2154) 200995
77.8%
Uppercase Letter
ValueCountFrequency (%)
A 691
 
11.9%
R 537
 
9.3%
I 495
 
8.5%
T 365
 
6.3%
E 359
 
6.2%
O 354
 
6.1%
N 342
 
5.9%
S 337
 
5.8%
C 277
 
4.8%
F 249
 
4.3%
Other values (22) 1787
30.8%
Lowercase Letter
ValueCountFrequency (%)
e 2352
11.0%
o 2003
 
9.3%
n 1787
 
8.3%
r 1786
 
8.3%
a 1771
 
8.2%
t 1706
 
7.9%
i 1656
 
7.7%
s 1216
 
5.7%
l 1125
 
5.2%
c 929
 
4.3%
Other values (16) 5136
23.9%
Other Punctuation
ValueCountFrequency (%)
, 20067
74.6%
: 2698
 
10.0%
. 2613
 
9.7%
· 764
 
2.8%
? 224
 
0.8%
' 127
 
0.5%
/ 123
 
0.5%
& 108
 
0.4%
" 78
 
0.3%
! 50
 
0.2%
Other values (11) 36
 
0.1%
Decimal Number
ValueCountFrequency (%)
0 2878
26.4%
2 2274
20.9%
1 2270
20.9%
9 1157
10.6%
5 413
 
3.8%
7 390
 
3.6%
6 382
 
3.5%
8 379
 
3.5%
4 374
 
3.4%
3 359
 
3.3%
Other values (4) 5
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
= 605
86.4%
~ 73
 
10.4%
+ 8
 
1.1%
7
 
1.0%
> 3
 
0.4%
< 3
 
0.4%
1
 
0.1%
Letter Number
ValueCountFrequency (%)
60
37.0%
60
37.0%
18
 
11.1%
14
 
8.6%
8
 
4.9%
1
 
0.6%
1
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 2034
87.8%
[ 271
 
11.7%
12
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 2033
87.8%
] 271
 
11.7%
12
 
0.5%
Control
ValueCountFrequency (%)
125054
> 99.9%
 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
35331
> 99.9%
  8
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 1278
99.9%
1
 
0.1%
Other Symbol
ValueCountFrequency (%)
2
66.7%
1
33.3%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 214187
43.7%
Common 204782
41.8%
Han 39462
 
8.0%
Latin 27422
 
5.6%
Hiragana 2337
 
0.5%
Katakana 2253
 
0.5%

Most frequent character per script

Han
ValueCountFrequency (%)
1754
 
4.4%
1420
 
3.6%
1383
 
3.5%
1245
 
3.2%
850
 
2.2%
785
 
2.0%
658
 
1.7%
644
 
1.6%
642
 
1.6%
630
 
1.6%
Other values (1121) 29451
74.6%
Hangul
ValueCountFrequency (%)
13318
 
6.2%
6047
 
2.8%
5830
 
2.7%
5234
 
2.4%
4837
 
2.3%
4636
 
2.2%
4618
 
2.2%
4363
 
2.0%
4270
 
2.0%
4091
 
1.9%
Other values (879) 156943
73.3%
Katakana
ValueCountFrequency (%)
188
 
8.3%
92
 
4.1%
91
 
4.0%
82
 
3.6%
81
 
3.6%
77
 
3.4%
76
 
3.4%
71
 
3.2%
71
 
3.2%
63
 
2.8%
Other values (69) 1361
60.4%
Latin
ValueCountFrequency (%)
e 2352
 
8.6%
o 2003
 
7.3%
n 1787
 
6.5%
r 1786
 
6.5%
a 1771
 
6.5%
t 1706
 
6.2%
i 1656
 
6.0%
s 1216
 
4.4%
l 1125
 
4.1%
c 929
 
3.4%
Other values (55) 11091
40.4%
Hiragana
ValueCountFrequency (%)
679
29.1%
269
 
11.5%
98
 
4.2%
88
 
3.8%
71
 
3.0%
68
 
2.9%
67
 
2.9%
67
 
2.9%
66
 
2.8%
59
 
2.5%
Other values (55) 805
34.4%
Common
ValueCountFrequency (%)
125054
61.1%
35331
 
17.3%
, 20067
 
9.8%
0 2878
 
1.4%
: 2698
 
1.3%
. 2613
 
1.3%
2 2274
 
1.1%
1 2270
 
1.1%
( 2034
 
1.0%
) 2033
 
1.0%
Other values (49) 7530
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 231200
47.1%
Hangul 214085
43.7%
CJK 39008
 
8.0%
Hiragana 2337
 
0.5%
Katakana 2253
 
0.5%
None 827
 
0.2%
CJK Compat Ideographs 454
 
0.1%
Number Forms 162
 
< 0.1%
Compat Jamo 102
 
< 0.1%
Math Operators 7
 
< 0.1%
Other values (3) 8
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
125054
54.1%
35331
 
15.3%
, 20067
 
8.7%
0 2878
 
1.2%
: 2698
 
1.2%
. 2613
 
1.1%
e 2352
 
1.0%
2 2274
 
1.0%
1 2270
 
1.0%
( 2034
 
0.9%
Other values (79) 33629
 
14.5%
Hangul
ValueCountFrequency (%)
13318
 
6.2%
6047
 
2.8%
5830
 
2.7%
5234
 
2.4%
4837
 
2.3%
4636
 
2.2%
4618
 
2.2%
4363
 
2.0%
4270
 
2.0%
4091
 
1.9%
Other values (876) 156841
73.3%
CJK
ValueCountFrequency (%)
1754
 
4.5%
1420
 
3.6%
1383
 
3.5%
1245
 
3.2%
850
 
2.2%
785
 
2.0%
658
 
1.7%
644
 
1.7%
642
 
1.6%
630
 
1.6%
Other values (1072) 28997
74.3%
None
ValueCountFrequency (%)
· 764
92.4%
12
 
1.5%
12
 
1.5%
8
 
1.0%
  8
 
1.0%
3
 
0.4%
3
 
0.4%
2
 
0.2%
2
 
0.2%
2
 
0.2%
Other values (11) 11
 
1.3%
Hiragana
ValueCountFrequency (%)
679
29.1%
269
 
11.5%
98
 
4.2%
88
 
3.8%
71
 
3.0%
68
 
2.9%
67
 
2.9%
67
 
2.9%
66
 
2.8%
59
 
2.5%
Other values (55) 805
34.4%
Katakana
ValueCountFrequency (%)
188
 
8.3%
92
 
4.1%
91
 
4.0%
82
 
3.6%
81
 
3.6%
77
 
3.4%
76
 
3.4%
71
 
3.2%
71
 
3.2%
63
 
2.8%
Other values (69) 1361
60.4%
Compat Jamo
ValueCountFrequency (%)
100
98.0%
1
 
1.0%
1
 
1.0%
CJK Compat Ideographs
ValueCountFrequency (%)
68
15.0%
63
13.9%
49
10.8%
44
9.7%
38
 
8.4%
25
 
5.5%
22
 
4.8%
20
 
4.4%
16
 
3.5%
10
 
2.2%
Other values (39) 99
21.8%
Number Forms
ValueCountFrequency (%)
60
37.0%
60
37.0%
18
 
11.1%
14
 
8.6%
8
 
4.9%
1
 
0.6%
1
 
0.6%
Math Operators
ValueCountFrequency (%)
7
100.0%
Geometric Shapes
ValueCountFrequency (%)
2
100.0%
Punctuation
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%
Box Drawing
ValueCountFrequency (%)
1
100.0%

Missing values

2023-12-11T12:38:48.578706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:38:48.665006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

도서명 저자명 출판사 ,,
1543조림.임업경영 부민문화사 문제은행 부민문화사 ,,
14202그리는, 조경 : 드로잉으로 보는 조경 디자인 역사,이명준,한숲
2199家蠶解剖生理學.蠶病學 문재유 鄕文社 ,,
4030'96 농업과학기술 연구개발결과 농촌지도사업 활용자료 Ⅲ : 축산 생활개선 농업경영 농촌진흥청 농촌진흥청 ,,
9274쌀 협상 이후의 농지이용구조 변화 전망과 대책 한국농촌경제연구원 한국농촌경제연구원 ,,
2055家畜營養學 한인규 한국방송대학교출판부 ,,
40371997年度(第39次) 中央農業産學協同審議會資料 중앙농업산학협동심의회 中央農業産學協同審議會 ,,
12663(농업인을 위한)개인보호구 및 보조장비 = Personal Protective Equipments and Ergonomic Tools for Farmers 국립농업과학원 농촌진흥청 국립농업과학원 ,,
9099쌈채소류 : 근대·쑥갓·신선초·청경채 원예연구소 농촌진흥청 작물과학원 ,,
7343농업인 전자상거래 운영실태 조사결과 농림부 농림부 한국농림수산정보센타 ,,
도서명 저자명 출판사 ,,
12513도시양봉 : 도심 속 양봉가의 즐거움 벤보우 스티브 들녘 ,,
13529식량작물과학 연구사업연보. 2008-2015 국립식량과학원 농촌진흥청 ,,
9166꽃나무 = Ornamental trees&shrubs 서정남 부민문화사 ,,
2208家畜生理學 손제영 先進文化社 ,,
9348中國의 WTO 加入과 韓·中 農業協力 한국농촌경제연구원 한국농촌경제연구원 ,,
13821(2021학년도) 장기현장 실습일지 콘테스트 수상작. [제7회],한국농수산대학교수부,한국농수산대학 교수부
3654농업구조정책의 목표와 지원시책 한국농촌경제연구원 한국농촌경제연구원 ,,
9648기능성축산물 및 축산식품의 제도설정 : 기능성 축산물연구회 창립 심포지엄 축산과학원 농촌진흥청 ,,
6921원예시험연구계획서 원예연구소 [농촌진흥청 원예연구소] ,,
1410最新造園槪論 성기택 先進文化社 ,,

Duplicate rows

Most frequently occurring

도서명 저자명 출판사 ,,# duplicates
331日本の食生活全集. 1-50 농문협 農山漁村文化協會 ,,36
779농축산물표준소득. 1977-2006 농촌진흥청 농촌진흥청 ,,36
622농림사업시행지침서 농림부 농림부 ,,34
1267흙은 여자인가 남자인가? = 쉽게 풀어보는 흙의 과학과 관리및 시비기술 이완주 도서출판 서원 ,,30
384畜産機械 및 施設 박경규 文運堂 ,,22
773농촌진흥사업연보 = RDA Annual Report. 1998-2016 농촌진흥청 농촌진흥청 ,,19
781농협연감. 1984-2016 농업협동조합중앙회 농업협동조합중앙회 ,,18
84(국가농업 R&D)시험연구사업보고서. 2009 농촌진흥청 농촌진흥청 ,,17
306家畜育種學 오봉국 한국방송대학교출판부 ,,17
550가축통계 농림부국립농산물품질관리원 [농림부 국립농산물품질관리원] ,,17