Overview

Dataset statistics

Number of variables5
Number of observations197
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.8 KiB
Average record size in memory40.7 B

Variable types

Categorical4
Text1

Dataset

Description2022년 경상남도농업기술원 연구개발사업 현황자료입니다.(농업연구, 연구개발사업, 경상남도농업기술원)
Author경상남도
URLhttps://www.data.go.kr/data/15072212/fileData.do

Alerts

부서 is highly overall correlated with 분야 and 1 other fieldsHigh correlation
분야 is highly overall correlated with 부서 and 1 other fieldsHigh correlation
연구실 is highly overall correlated with 부서 and 1 other fieldsHigh correlation
과제명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:06:32.561693
Analysis finished2023-12-12 06:06:33.162021
Duration0.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

부서
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)4.6%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
환경농업연구과
45 
작물연구과
38 
원예연구과
31 
사과이용연구소
18 
양파연구소
17 
Other values (4)
48 

Length

Max length7
Median length5
Mean length5.8527919
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row작물연구과
2nd row작물연구과
3rd row작물연구과
4th row작물연구과
5th row작물연구과

Common Values

ValueCountFrequency (%)
환경농업연구과 45
22.8%
작물연구과 38
19.3%
원예연구과 31
15.7%
사과이용연구소 18
 
9.1%
양파연구소 17
 
8.6%
단감연구소 14
 
7.1%
화훼연구소 13
 
6.6%
유용곤충연구소 11
 
5.6%
약용자원연구소 10
 
5.1%

Length

2023-12-12T15:06:33.236789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:06:33.405716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
환경농업연구과 45
22.8%
작물연구과 38
19.3%
원예연구과 31
15.7%
사과이용연구소 18
 
9.1%
양파연구소 17
 
8.6%
단감연구소 14
 
7.1%
화훼연구소 13
 
6.6%
유용곤충연구소 11
 
5.6%
약용자원연구소 10
 
5.1%

과제구분
Categorical

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
기관고유
110 
공동
87 

Length

Max length4
Median length4
Mean length3.1167513
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기관고유
2nd row기관고유
3rd row기관고유
4th row기관고유
5th row기관고유

Common Values

ValueCountFrequency (%)
기관고유 110
55.8%
공동 87
44.2%

Length

2023-12-12T15:06:33.570312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:06:33.706601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기관고유 110
55.8%
공동 87
44.2%

과제명
Text

UNIQUE 

Distinct197
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-12T15:06:34.033810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length35
Mean length22.588832
Min length8

Characters and Unicode

Total characters4450
Distinct characters349
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique197 ?
Unique (%)100.0%

Sample

1st row벼 우량 신품종 육성
2nd row벼 우량 계통 내병성 검정
3rd row생분해필름 피복 벼 직파재배 안정화 기술 개발
4th row벼와 밀 작부체계에 적합한 벼 적품종 선발
5th row경남지역 지대별 벼 장려품종 선발
ValueCountFrequency (%)
57
 
5.0%
개발 56
 
4.9%
신품종 36
 
3.1%
경남지역 29
 
2.5%
육성 24
 
2.1%
위한 23
 
2.0%
연구 19
 
1.7%
기술 16
 
1.4%
12
 
1.0%
구명 12
 
1.0%
Other values (563) 861
75.2%
2023-12-12T15:06:34.566904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
951
 
21.4%
82
 
1.8%
81
 
1.8%
80
 
1.8%
77
 
1.7%
71
 
1.6%
71
 
1.6%
71
 
1.6%
59
 
1.3%
57
 
1.3%
Other values (339) 2850
64.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3392
76.2%
Space Separator 951
 
21.4%
Decimal Number 32
 
0.7%
Close Punctuation 26
 
0.6%
Open Punctuation 26
 
0.6%
Other Punctuation 12
 
0.3%
Uppercase Letter 8
 
0.2%
Math Symbol 1
 
< 0.1%
Final Punctuation 1
 
< 0.1%
Initial Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
82
 
2.4%
81
 
2.4%
80
 
2.4%
77
 
2.3%
71
 
2.1%
71
 
2.1%
71
 
2.1%
59
 
1.7%
57
 
1.7%
55
 
1.6%
Other values (315) 2688
79.2%
Decimal Number
ValueCountFrequency (%)
1 6
18.8%
4 6
18.8%
2 5
15.6%
3 4
12.5%
5 3
9.4%
8 3
9.4%
6 2
 
6.2%
0 1
 
3.1%
7 1
 
3.1%
9 1
 
3.1%
Uppercase Letter
ValueCountFrequency (%)
B 2
25.0%
D 2
25.0%
S 1
12.5%
O 1
12.5%
F 1
12.5%
C 1
12.5%
Other Punctuation
ValueCountFrequency (%)
, 8
66.7%
· 4
33.3%
Space Separator
ValueCountFrequency (%)
951
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3392
76.2%
Common 1050
 
23.6%
Latin 8
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
82
 
2.4%
81
 
2.4%
80
 
2.4%
77
 
2.3%
71
 
2.1%
71
 
2.1%
71
 
2.1%
59
 
1.7%
57
 
1.7%
55
 
1.6%
Other values (315) 2688
79.2%
Common
ValueCountFrequency (%)
951
90.6%
) 26
 
2.5%
( 26
 
2.5%
, 8
 
0.8%
1 6
 
0.6%
4 6
 
0.6%
2 5
 
0.5%
3 4
 
0.4%
· 4
 
0.4%
5 3
 
0.3%
Other values (8) 11
 
1.0%
Latin
ValueCountFrequency (%)
B 2
25.0%
D 2
25.0%
S 1
12.5%
O 1
12.5%
F 1
12.5%
C 1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3392
76.2%
ASCII 1052
 
23.6%
None 4
 
0.1%
Punctuation 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
951
90.4%
) 26
 
2.5%
( 26
 
2.5%
, 8
 
0.8%
1 6
 
0.6%
4 6
 
0.6%
2 5
 
0.5%
3 4
 
0.4%
5 3
 
0.3%
8 3
 
0.3%
Other values (11) 14
 
1.3%
Hangul
ValueCountFrequency (%)
82
 
2.4%
81
 
2.4%
80
 
2.4%
77
 
2.3%
71
 
2.1%
71
 
2.1%
71
 
2.1%
59
 
1.7%
57
 
1.7%
55
 
1.6%
Other values (315) 2688
79.2%
None
ValueCountFrequency (%)
· 4
100.0%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%

분야
Categorical

HIGH CORRELATION 

Distinct23
Distinct (%)11.7%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
채 소
32 
과 수
30 
사 과
18 
작물보호
17 
13 
Other values (18)
87 

Length

Max length5
Median length3
Mean length3.2081218
Min length1

Unique

Unique4 ?
Unique (%)2.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
채 소 32
16.2%
과 수 30
15.2%
사 과 18
9.1%
작물보호 17
8.6%
13
 
6.6%
화 훼 13
 
6.6%
인삼·특작 10
 
5.1%
농산가공 9
 
4.6%
버 섯 9
 
4.6%
농업환경 7
 
3.6%
Other values (13) 39
19.8%

Length

2023-12-12T15:06:34.703588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
48
14.8%
32
 
9.9%
32
 
9.9%
30
 
9.3%
18
 
5.6%
작물보호 17
 
5.2%
13
 
4.0%
13
 
4.0%
13
 
4.0%
13
 
4.0%
Other values (25) 95
29.3%

연구실
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)8.6%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
육 종
27 
재배이용
27 
전 작
20 
과 수
16 
답 작
13 
Other values (12)
94 

Length

Max length5
Median length3
Mean length3.5837563
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row답 작
2nd row답 작
3rd row답 작
4th row답 작
5th row답 작

Common Values

ValueCountFrequency (%)
육 종 27
13.7%
재배이용 27
13.7%
전 작 20
10.2%
과 수 16
 
8.1%
답 작 13
 
6.6%
재 배 11
 
5.6%
병 해 충 11
 
5.6%
농업환경 10
 
5.1%
농산가공 9
 
4.6%
생명공학 9
 
4.6%
Other values (7) 44
22.3%

Length

2023-12-12T15:06:34.843461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
33
 
10.5%
27
 
8.6%
재배이용 27
 
8.6%
27
 
8.6%
20
 
6.4%
16
 
5.1%
16
 
5.1%
13
 
4.2%
11
 
3.5%
11
 
3.5%
Other values (14) 112
35.8%

Correlations

2023-12-12T15:06:34.930014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부서과제구분분야연구실
부서1.0000.1630.9790.946
과제구분0.1631.0000.2290.302
분야0.9790.2291.0000.968
연구실0.9460.3020.9681.000
2023-12-12T15:06:35.018543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과제구분연구실부서분야
과제구분1.0000.2600.1590.187
연구실0.2601.0000.7550.745
부서0.1590.7551.0000.854
분야0.1870.7450.8541.000
2023-12-12T15:06:35.115354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부서과제구분분야연구실
부서1.0000.1590.8540.755
과제구분0.1591.0000.1870.260
분야0.8540.1871.0000.745
연구실0.7550.2600.7451.000

Missing values

2023-12-12T15:06:33.014429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:06:33.118037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

부서과제구분과제명분야연구실
0작물연구과기관고유벼 우량 신품종 육성답 작
1작물연구과기관고유벼 우량 계통 내병성 검정답 작
2작물연구과기관고유생분해필름 피복 벼 직파재배 안정화 기술 개발답 작
3작물연구과기관고유벼와 밀 작부체계에 적합한 벼 적품종 선발답 작
4작물연구과기관고유경남지역 지대별 벼 장려품종 선발답 작
5작물연구과기관고유드론 활용 벼 직파재배 개선 연구답 작
6작물연구과기관고유벼 스마트팜 물 관리 모듈 개선 연구답 작
7작물연구과기관고유벼 원원종 생산답 작
8작물연구과기관고유경남지역 적응 고구마 신품종 육성서 류전 작
9작물연구과기관고유경남지역 우수 토종자원 활용 밀 품종 개발맥 류전 작
부서과제구분과제명분야연구실
187유용곤충연구소기관고유유황굼벵이 생산기술 개발산업곤충산업곤충
188유용곤충연구소기관고유흰점박이꽃무지 분변토 이용 기술 개발산업곤충잠사양봉
189유용곤충연구소기관고유흰점박이꽃무지 인돌알칼로이드 분석방법 개발가 공산업곤충
190유용곤충연구소기관고유흰점박이꽃무지 장내미생물 이용기술 개발산업곤충산업곤충
191유용곤충연구소기관고유식용곤충을 활용한 여드름 피부치료효과 탐색 및 소재화 개발 연구산업곤충산업곤충
192유용곤충연구소공동경남지역 꿀벌 신품종 지역적응시험양 봉잠사양봉
193유용곤충연구소공동경남지역 고품질 다수성 잠상 신품종 지역적응연구잠 업잠사양봉
194유용곤충연구소공동식용곤충 소재 가공기술 개발가 공잠사양봉
195유용곤충연구소공동지역특산물 활용한 수확용 먹이에 따른 이취저감 기술 개발산업곤충잠사양봉
196유용곤충연구소공동지능형 곤충 스마트팜(누에, 쌍별귀뚜라미) 데이터 구축산업곤충잠사양봉