Overview

Dataset statistics

Number of variables5
Number of observations73
Missing cells73
Missing cells (%)20.0%
Duplicate rows2
Duplicate rows (%)2.7%
Total size in memory3.0 KiB
Average record size in memory41.8 B

Variable types

Text3
Categorical1
DateTime1

Dataset

Description강원도 평창군 종자생산업 현황 데이터로 종자생산업체명, 법인명, 종자업 종류, 취급 작물, 데이터 기준일자 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15113458/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 2 (2.7%) duplicate rowsDuplicates
업체명 has 26 (35.6%) missing valuesMissing
법인명 has 47 (64.4%) missing valuesMissing

Reproduction

Analysis started2023-12-12 19:13:39.245680
Analysis finished2023-12-12 19:13:40.283072
Duration1.04 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업체명
Text

MISSING 

Distinct45
Distinct (%)95.7%
Missing26
Missing (%)35.6%
Memory size716.0 B
2023-12-13T04:13:40.501309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length9
Mean length6.5744681
Min length4

Characters and Unicode

Total characters309
Distinct characters112
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)91.5%

Sample

1st row㈜오리온
2nd row대관령원예협동조합
3rd row대관령하스카프농장
4th row오대산영농조합법인
5th row미탄농장
ValueCountFrequency (%)
청옥산 2
 
3.8%
가자팜평창 2
 
3.8%
육백마지기 2
 
3.8%
대관령 1
 
1.9%
이루농장 1
 
1.9%
㈜오리온 1
 
1.9%
백운산촌 1
 
1.9%
치유농장 1
 
1.9%
할미골산채원 1
 
1.9%
평창약초농원 1
 
1.9%
Other values (39) 39
75.0%
2023-12-13T04:13:40.934184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29
 
9.4%
16
 
5.2%
13
 
4.2%
12
 
3.9%
10
 
3.2%
9
 
2.9%
8
 
2.6%
8
 
2.6%
8
 
2.6%
7
 
2.3%
Other values (102) 189
61.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 299
96.8%
Other Symbol 5
 
1.6%
Space Separator 5
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
 
9.7%
16
 
5.4%
13
 
4.3%
12
 
4.0%
10
 
3.3%
9
 
3.0%
8
 
2.7%
8
 
2.7%
8
 
2.7%
7
 
2.3%
Other values (100) 179
59.9%
Other Symbol
ValueCountFrequency (%)
5
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 304
98.4%
Common 5
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
 
9.5%
16
 
5.3%
13
 
4.3%
12
 
3.9%
10
 
3.3%
9
 
3.0%
8
 
2.6%
8
 
2.6%
8
 
2.6%
7
 
2.3%
Other values (101) 184
60.5%
Common
ValueCountFrequency (%)
5
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 299
96.8%
None 5
 
1.6%
ASCII 5
 
1.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
29
 
9.7%
16
 
5.4%
13
 
4.3%
12
 
4.0%
10
 
3.3%
9
 
3.0%
8
 
2.7%
8
 
2.7%
8
 
2.7%
7
 
2.3%
Other values (100) 179
59.9%
None
ValueCountFrequency (%)
5
100.0%
ASCII
ValueCountFrequency (%)
5
100.0%

법인명
Text

MISSING 

Distinct26
Distinct (%)100.0%
Missing47
Missing (%)64.4%
Memory size716.0 B
2023-12-13T04:13:41.187908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length12
Mean length9.2692308
Min length4

Characters and Unicode

Total characters241
Distinct characters82
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)100.0%

Sample

1st row㈜오리온
2nd row대관령원예협동조합
3rd row대관령하스카프농장
4th row오대산영농조합법인
5th row㈜후레쉬푸드
ValueCountFrequency (%)
농업회사법인 2
 
6.7%
주식회사 2
 
6.7%
강원감자영농조합법인 1
 
3.3%
㈜오리온 1
 
3.3%
㈜오대산씨감자 1
 
3.3%
㈜상산재농업회사법인 1
 
3.3%
평창산양삼특구영농조합법인 1
 
3.3%
봉평영농조합법인 1
 
3.3%
눈꽃마을 1
 
3.3%
대관령산양산삼영농조합법인 1
 
3.3%
Other values (18) 18
60.0%
2023-12-13T04:13:41.553885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19
 
7.9%
14
 
5.8%
14
 
5.8%
12
 
5.0%
12
 
5.0%
12
 
5.0%
9
 
3.7%
8
 
3.3%
6
 
2.5%
6
 
2.5%
Other values (72) 129
53.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 229
95.0%
Other Symbol 8
 
3.3%
Space Separator 4
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
 
8.3%
14
 
6.1%
14
 
6.1%
12
 
5.2%
12
 
5.2%
12
 
5.2%
9
 
3.9%
6
 
2.6%
6
 
2.6%
6
 
2.6%
Other values (70) 119
52.0%
Other Symbol
ValueCountFrequency (%)
8
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 237
98.3%
Common 4
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
8.0%
14
 
5.9%
14
 
5.9%
12
 
5.1%
12
 
5.1%
12
 
5.1%
9
 
3.8%
8
 
3.4%
6
 
2.5%
6
 
2.5%
Other values (71) 125
52.7%
Common
ValueCountFrequency (%)
4
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 229
95.0%
None 8
 
3.3%
ASCII 4
 
1.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
19
 
8.3%
14
 
6.1%
14
 
6.1%
12
 
5.2%
12
 
5.2%
12
 
5.2%
9
 
3.9%
6
 
2.6%
6
 
2.6%
6
 
2.6%
Other values (70) 119
52.0%
None
ValueCountFrequency (%)
8
100.0%
ASCII
ValueCountFrequency (%)
4
100.0%

종자업 종류
Categorical

Distinct6
Distinct (%)8.2%
Missing0
Missing (%)0.0%
Memory size716.0 B
기타
51 
채소
식량작물
과수
 
3
버섯
 
3

Length

Max length4
Median length2
Mean length2.1643836
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row식량작물
2nd row식량작물
3rd row과수
4th row채소
5th row채소

Common Values

ValueCountFrequency (%)
기타 51
69.9%
채소 8
 
11.0%
식량작물 6
 
8.2%
과수 3
 
4.1%
버섯 3
 
4.1%
화훼 2
 
2.7%

Length

2023-12-13T04:13:41.695317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:13:41.805125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기타 51
69.9%
채소 8
 
11.0%
식량작물 6
 
8.2%
과수 3
 
4.1%
버섯 3
 
4.1%
화훼 2
 
2.7%
Distinct43
Distinct (%)58.9%
Missing0
Missing (%)0.0%
Memory size716.0 B
2023-12-13T04:13:42.002672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length33
Mean length7.7260274
Min length2

Characters and Unicode

Total characters564
Distinct characters102
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)43.8%

Sample

1st row감자
2nd row감자
3rd row블루베리, 하스카프
4th row삼채육묘
5th row삼마늘, 곰취, 수리취 등
ValueCountFrequency (%)
산마늘 23
15.2%
산양삼 19
 
12.6%
17
 
11.3%
눈개승마 7
 
4.6%
곰취 7
 
4.6%
산채류 6
 
4.0%
더덕 5
 
3.3%
씨감자 4
 
2.6%
딸기 4
 
2.6%
엄나무 4
 
2.6%
Other values (48) 55
36.4%
2023-12-13T04:13:42.417679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
79
 
14.0%
, 62
 
11.0%
54
 
9.6%
34
 
6.0%
25
 
4.4%
23
 
4.1%
21
 
3.7%
17
 
3.0%
13
 
2.3%
11
 
2.0%
Other values (92) 225
39.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 417
73.9%
Space Separator 79
 
14.0%
Other Punctuation 62
 
11.0%
Open Punctuation 3
 
0.5%
Close Punctuation 3
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
 
12.9%
34
 
8.2%
25
 
6.0%
23
 
5.5%
21
 
5.0%
17
 
4.1%
13
 
3.1%
11
 
2.6%
11
 
2.6%
9
 
2.2%
Other values (88) 199
47.7%
Space Separator
ValueCountFrequency (%)
79
100.0%
Other Punctuation
ValueCountFrequency (%)
, 62
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 417
73.9%
Common 147
 
26.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
 
12.9%
34
 
8.2%
25
 
6.0%
23
 
5.5%
21
 
5.0%
17
 
4.1%
13
 
3.1%
11
 
2.6%
11
 
2.6%
9
 
2.2%
Other values (88) 199
47.7%
Common
ValueCountFrequency (%)
79
53.7%
, 62
42.2%
( 3
 
2.0%
) 3
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 417
73.9%
ASCII 147
 
26.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
79
53.7%
, 62
42.2%
( 3
 
2.0%
) 3
 
2.0%
Hangul
ValueCountFrequency (%)
54
 
12.9%
34
 
8.2%
25
 
6.0%
23
 
5.5%
21
 
5.0%
17
 
4.1%
13
 
3.1%
11
 
2.6%
11
 
2.6%
9
 
2.2%
Other values (88) 199
47.7%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size716.0 B
Minimum2023-04-19 00:00:00
Maximum2023-04-19 00:00:00
2023-12-13T04:13:42.528612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:13:42.622717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-13T04:13:42.693352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체명법인명종자업 종류취급 작물
업체명1.0001.0000.0000.943
법인명1.0001.0001.0001.000
종자업 종류0.0001.0001.0001.000
취급 작물0.9431.0001.0001.000

Missing values

2023-12-13T04:13:39.979572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:13:40.094231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T04:13:40.223387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업체명법인명종자업 종류취급 작물데이터기준일자
0㈜오리온㈜오리온식량작물감자2023-04-19
1대관령원예협동조합대관령원예협동조합식량작물감자2023-04-19
2대관령하스카프농장대관령하스카프농장과수블루베리, 하스카프2023-04-19
3오대산영농조합법인오대산영농조합법인채소삼채육묘2023-04-19
4미탄농장<NA>채소삼마늘, 곰취, 수리취 등2023-04-19
5㈜후레쉬푸드㈜후레쉬푸드채소삼채2023-04-19
6오대산영농조합<NA>기타더덕2023-04-19
7지복농원지복영농조합법인기타산마늘, 눈개승마, 오미자 등2023-04-19
8신가네농장<NA>기타특용작물2023-04-19
9대관령육묘장<NA>채소배추, 양배추, 양상추 등2023-04-19
업체명법인명종자업 종류취급 작물데이터기준일자
63<NA>㈜상산재농업회사법인기타산양삼, 도라지, 더덕, 산마늘, 곰취, 두릅 등2023-04-19
64이루농장<NA>기타산마늘, 눈개승마, 누룩취 등2023-04-19
65평창햇살버섯<NA>버섯버섯2023-04-19
66<NA>농업회사법인㈜우리두기타산양삼2023-04-19
67<NA><NA>기타산양삼2023-04-19
68<NA><NA>채소딸기 등2023-04-19
69가자팜평창<NA>기타산마늘, 눈개승마, 곰취 등2023-04-19
70<NA><NA>기타엄나무, 메밀 등2023-04-19
71가자팜평창<NA>화훼수선화, 백합, 튤립 등2023-04-19
72대관령 옥토농원<NA>기타함박꽃나무, 마가목, 엄나무, 두메닥나무, 오얏나무 산작약,잔대,누룩취2023-04-19

Duplicate rows

Most frequently occurring

업체명법인명종자업 종류취급 작물데이터기준일자# duplicates
0<NA><NA>기타산양삼2023-04-194
1<NA><NA>기타산채류2023-04-192