Overview

Dataset statistics

Number of variables8
Number of observations153
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.7 KiB
Average record size in memory64.9 B

Variable types

Text2
Categorical6

Dataset

Description국립농산물품질관리원에서 관리하는 농산물 원산지 인증정보(인증번호, 최초인증일, 인증유효기간, 인증업체명, 제품명, 원산지, 비율, 인증기관)
Author국립농산물품질관리원
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220609000000002101

Alerts

최초인증일자 is highly overall correlated with 인증유효기간 and 4 other fieldsHigh correlation
인증업체명 is highly overall correlated with 최초인증일자 and 4 other fieldsHigh correlation
인증유효기간 is highly overall correlated with 최초인증일자 and 4 other fieldsHigh correlation
원산지 is highly overall correlated with 최초인증일자 and 3 other fieldsHigh correlation
비율 is highly overall correlated with 최초인증일자 and 2 other fieldsHigh correlation
인증기관 is highly overall correlated with 최초인증일자 and 3 other fieldsHigh correlation
원산지 is highly imbalanced (76.3%)Imbalance
인증번호 has unique valuesUnique

Reproduction

Analysis started2024-03-23 07:48:42.775006
Analysis finished2024-03-23 07:48:44.000060
Duration1.23 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

인증번호
Text

UNIQUE 

Distinct153
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-03-23T07:48:44.436517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length13
Mean length12.267974
Min length10

Characters and Unicode

Total characters1877
Distinct characters21
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique153 ?
Unique (%)100.0%

Sample

1st row푸름 원산지 제1호
2nd row푸름 원산지 제2호
3rd row푸름 원산지 제3호
4th row푸름 원산지 제4호
5th row푸름 원산지 제5호
ValueCountFrequency (%)
원산지 153
33.3%
식품연 126
27.5%
푸름 27
 
5.9%
제190호 1
 
0.2%
제141호 1
 
0.2%
제135호 1
 
0.2%
제136호 1
 
0.2%
제152호 1
 
0.2%
제137호 1
 
0.2%
제138호 1
 
0.2%
Other values (146) 146
31.8%
2024-03-23T07:48:45.359009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
306
16.3%
153
8.2%
153
8.2%
153
8.2%
153
8.2%
153
8.2%
126
 
6.7%
126
 
6.7%
126
 
6.7%
1 108
 
5.8%
Other values (11) 320
17.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1197
63.8%
Decimal Number 374
 
19.9%
Space Separator 306
 
16.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
153
12.8%
153
12.8%
153
12.8%
153
12.8%
153
12.8%
126
10.5%
126
10.5%
126
10.5%
27
 
2.3%
27
 
2.3%
Decimal Number
ValueCountFrequency (%)
1 108
28.9%
6 36
 
9.6%
7 35
 
9.4%
2 34
 
9.1%
3 34
 
9.1%
5 33
 
8.8%
4 31
 
8.3%
8 30
 
8.0%
9 17
 
4.5%
0 16
 
4.3%
Space Separator
ValueCountFrequency (%)
306
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1197
63.8%
Common 680
36.2%

Most frequent character per script

Common
ValueCountFrequency (%)
306
45.0%
1 108
 
15.9%
6 36
 
5.3%
7 35
 
5.1%
2 34
 
5.0%
3 34
 
5.0%
5 33
 
4.9%
4 31
 
4.6%
8 30
 
4.4%
9 17
 
2.5%
Hangul
ValueCountFrequency (%)
153
12.8%
153
12.8%
153
12.8%
153
12.8%
153
12.8%
126
10.5%
126
10.5%
126
10.5%
27
 
2.3%
27
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1197
63.8%
ASCII 680
36.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
306
45.0%
1 108
 
15.9%
6 36
 
5.3%
7 35
 
5.1%
2 34
 
5.0%
3 34
 
5.0%
5 33
 
4.9%
4 31
 
4.6%
8 30
 
4.4%
9 17
 
2.5%
Hangul
ValueCountFrequency (%)
153
12.8%
153
12.8%
153
12.8%
153
12.8%
153
12.8%
126
10.5%
126
10.5%
126
10.5%
27
 
2.3%
27
 
2.3%

최초인증일자
Categorical

HIGH CORRELATION 

Distinct29
Distinct (%)19.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2019-12-09
36 
2018-12-18
18 
2020-01-21
16 
2021-02-08
11 
2020-01-20
Other values (24)
63 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique10 ?
Unique (%)6.5%

Sample

1st row2021-05-27
2nd row2021-07-12
3rd row2021-07-12
4th row2021-08-02
5th row2021-08-02

Common Values

ValueCountFrequency (%)
2019-12-09 36
23.5%
2018-12-18 18
11.8%
2020-01-21 16
10.5%
2021-02-08 11
 
7.2%
2020-01-20 9
 
5.9%
2020-02-25 8
 
5.2%
2021-07-12 6
 
3.9%
2020-03-26 5
 
3.3%
2020-11-09 5
 
3.3%
2022-07-28 5
 
3.3%
Other values (19) 34
22.2%

Length

2024-03-23T07:48:45.701378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2019-12-09 36
23.5%
2018-12-18 18
11.8%
2020-01-21 16
10.5%
2021-02-08 11
 
7.2%
2020-01-20 9
 
5.9%
2020-02-25 8
 
5.2%
2021-07-12 6
 
3.9%
2020-03-26 5
 
3.3%
2020-11-09 5
 
3.3%
2022-07-28 5
 
3.3%
Other values (19) 34
22.2%

인증유효기간
Categorical

HIGH CORRELATION 

Distinct29
Distinct (%)19.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2022-12-08
36 
2024-12-17
18 
2023-01-20
16 
2024-02-07
11 
2023-01-19
Other values (24)
63 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique10 ?
Unique (%)6.5%

Sample

1st row2024-05-26
2nd row2024-07-11
3rd row2024-07-11
4th row2024-08-01
5th row2024-08-01

Common Values

ValueCountFrequency (%)
2022-12-08 36
23.5%
2024-12-17 18
11.8%
2023-01-20 16
10.5%
2024-02-07 11
 
7.2%
2023-01-19 9
 
5.9%
2023-02-24 8
 
5.2%
2024-07-11 6
 
3.9%
2023-03-25 5
 
3.3%
2023-11-08 5
 
3.3%
2025-07-27 5
 
3.3%
Other values (19) 34
22.2%

Length

2024-03-23T07:48:46.071615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2022-12-08 36
23.5%
2024-12-17 18
11.8%
2023-01-20 16
10.5%
2024-02-07 11
 
7.2%
2023-01-19 9
 
5.9%
2023-02-24 8
 
5.2%
2024-07-11 6
 
3.9%
2023-03-25 5
 
3.3%
2023-11-08 5
 
3.3%
2025-07-27 5
 
3.3%
Other values (19) 34
22.2%

인증업체명
Categorical

HIGH CORRELATION 

Distinct39
Distinct (%)25.5%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
㈜농가식품
24 
서안동농협 풍산김치공장
18 
고삼농협 안성마춤 푸드센터
10 
㈜송원식품
 
8
㈜한성식품 서산지사
 
7
Other values (34)
86 

Length

Max length21
Median length15
Mean length9.2941176
Min length3

Unique

Unique14 ?
Unique (%)9.2%

Sample

1st row농업회사법인 온샘㈜
2nd row태장고
3rd row태장고
4th row지보농협참기름가공공장
5th row지보농협참기름가공공장

Common Values

ValueCountFrequency (%)
㈜농가식품 24
15.7%
서안동농협 풍산김치공장 18
 
11.8%
고삼농협 안성마춤 푸드센터 10
 
6.5%
㈜송원식품 8
 
5.2%
㈜한성식품 서산지사 7
 
4.6%
㈜한성식품 부천공장 7
 
4.6%
㈜효원 6
 
3.9%
안동제비원전통식품 5
 
3.3%
황금터영농조합법인 5
 
3.3%
농업회사법인(주)영양F&S 5
 
3.3%
Other values (29) 58
37.9%

Length

2024-03-23T07:48:46.478487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
㈜농가식품 24
 
11.1%
풍산김치공장 18
 
8.3%
서안동농협 18
 
8.3%
㈜한성식품 15
 
6.9%
고삼농협 10
 
4.6%
안성마춤 10
 
4.6%
푸드센터 10
 
4.6%
㈜송원식품 8
 
3.7%
서산지사 7
 
3.2%
부천공장 7
 
3.2%
Other values (35) 89
41.2%
Distinct145
Distinct (%)94.8%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-03-23T07:48:47.066573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length18
Mean length8.1633987
Min length2

Characters and Unicode

Total characters1249
Distinct characters230
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique139 ?
Unique (%)90.8%

Sample

1st row안동참마분말
2nd row새뜸된장
3rd row새뜸고추장
4th row참기름
5th row들기름
ValueCountFrequency (%)
포기김치 6
 
3.0%
애터미 5
 
2.5%
서분례명인 4
 
2.0%
고춧가루(고춧가루 4
 
2.0%
아주 3
 
1.5%
좋은 3
 
1.5%
이혜정의 3
 
1.5%
절임배추 2
 
1.0%
안동제비원 2
 
1.0%
구운요술콩 2
 
1.0%
Other values (156) 165
82.9%
2024-03-23T07:48:48.041577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
57
 
4.6%
57
 
4.6%
49
 
3.9%
46
 
3.7%
36
 
2.9%
34
 
2.7%
34
 
2.7%
33
 
2.6%
31
 
2.5%
30
 
2.4%
Other values (220) 842
67.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1164
93.2%
Space Separator 46
 
3.7%
Open Punctuation 16
 
1.3%
Close Punctuation 16
 
1.3%
Decimal Number 4
 
0.3%
Lowercase Letter 2
 
0.2%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
57
 
4.9%
57
 
4.9%
49
 
4.2%
36
 
3.1%
34
 
2.9%
34
 
2.9%
33
 
2.8%
31
 
2.7%
30
 
2.6%
29
 
2.5%
Other values (211) 774
66.5%
Decimal Number
ValueCountFrequency (%)
1 2
50.0%
0 1
25.0%
6 1
25.0%
Lowercase Letter
ValueCountFrequency (%)
h 1
50.0%
e 1
50.0%
Space Separator
ValueCountFrequency (%)
46
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Uppercase Letter
ValueCountFrequency (%)
T 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1163
93.1%
Common 82
 
6.6%
Latin 3
 
0.2%
Han 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
57
 
4.9%
57
 
4.9%
49
 
4.2%
36
 
3.1%
34
 
2.9%
34
 
2.9%
33
 
2.8%
31
 
2.7%
30
 
2.6%
29
 
2.5%
Other values (210) 773
66.5%
Common
ValueCountFrequency (%)
46
56.1%
( 16
 
19.5%
) 16
 
19.5%
1 2
 
2.4%
0 1
 
1.2%
6 1
 
1.2%
Latin
ValueCountFrequency (%)
T 1
33.3%
h 1
33.3%
e 1
33.3%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1163
93.1%
ASCII 85
 
6.8%
CJK 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
57
 
4.9%
57
 
4.9%
49
 
4.2%
36
 
3.1%
34
 
2.9%
34
 
2.9%
33
 
2.8%
31
 
2.7%
30
 
2.6%
29
 
2.5%
Other values (210) 773
66.5%
ASCII
ValueCountFrequency (%)
46
54.1%
( 16
 
18.8%
) 16
 
18.8%
1 2
 
2.4%
T 1
 
1.2%
h 1
 
1.2%
e 1
 
1.2%
0 1
 
1.2%
6 1
 
1.2%
CJK
ValueCountFrequency (%)
1
100.0%

원산지
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct10
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
국산
137 
건고추(한국산100%)
 
5
고추(한국산100%)
 
2
콩(한국산100%), 소금(한국산100%)
 
2
들깨(국산)
 
2
Other values (5)
 
5

Length

Max length23
Median length2
Mean length2.9869281
Min length2

Unique

Unique5 ?
Unique (%)3.3%

Sample

1st row국산
2nd row국산
3rd row국산
4th row국산
5th row국산

Common Values

ValueCountFrequency (%)
국산 137
89.5%
건고추(한국산100%) 5
 
3.3%
고추(한국산100%) 2
 
1.3%
콩(한국산100%), 소금(한국산100%) 2
 
1.3%
들깨(국산) 2
 
1.3%
고추(100%) 1
 
0.7%
콩(한국산100%) 1
 
0.7%
참깨(국산) 1
 
0.7%
건고추 / 한국산 1
 
0.7%
건고추 / 한국산 1
 
0.7%

Length

2024-03-23T07:48:48.432284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T07:48:48.792068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국산 137
86.2%
건고추(한국산100 5
 
3.1%
콩(한국산100 3
 
1.9%
고추(한국산100 2
 
1.3%
소금(한국산100 2
 
1.3%
들깨(국산 2
 
1.3%
건고추 2
 
1.3%
2
 
1.3%
한국산 2
 
1.3%
고추(100 1
 
0.6%

비율
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
100%
89 
95%
64 

Length

Max length4
Median length4
Mean length3.5816993
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row100%
2nd row100%
3rd row100%
4th row100%
5th row100%

Common Values

ValueCountFrequency (%)
100% 89
58.2%
95% 64
41.8%

Length

2024-03-23T07:48:49.201946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T07:48:49.514151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
100 89
58.2%
95 64
41.8%

인증기관
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
한국식품연구원
126 
주식회사 푸름인증원
27 

Length

Max length10
Median length7
Mean length7.5294118
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주식회사 푸름인증원
2nd row주식회사 푸름인증원
3rd row주식회사 푸름인증원
4th row주식회사 푸름인증원
5th row주식회사 푸름인증원

Common Values

ValueCountFrequency (%)
한국식품연구원 126
82.4%
주식회사 푸름인증원 27
 
17.6%

Length

2024-03-23T07:48:49.860636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T07:48:50.100482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한국식품연구원 126
70.0%
주식회사 27
 
15.0%
푸름인증원 27
 
15.0%

Correlations

2024-03-23T07:48:50.328182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
최초인증일자인증유효기간인증업체명원산지비율인증기관
최초인증일자1.0001.0000.9980.9300.7970.980
인증유효기간1.0001.0000.9980.9300.7970.980
인증업체명0.9980.9981.0000.9010.9380.993
원산지0.9300.9300.9011.0000.2130.884
비율0.7970.7970.9380.2131.0000.546
인증기관0.9800.9800.9930.8840.5461.000
2024-03-23T07:48:50.520984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
최초인증일자인증업체명인증기관인증유효기간원산지비율
최초인증일자1.0000.9120.8721.0000.6280.642
인증업체명0.9121.0000.8560.9120.5260.755
인증기관0.8720.8561.0000.8720.6990.368
인증유효기간1.0000.9120.8721.0000.6280.642
원산지0.6280.5260.6990.6281.0000.158
비율0.6420.7550.3680.6420.1581.000
2024-03-23T07:48:50.708717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
최초인증일자인증유효기간인증업체명원산지비율인증기관
최초인증일자1.0001.0000.9120.6280.6420.872
인증유효기간1.0001.0000.9120.6280.6420.872
인증업체명0.9120.9121.0000.5260.7550.856
원산지0.6280.6280.5261.0000.1580.699
비율0.6420.6420.7550.1581.0000.368
인증기관0.8720.8720.8560.6990.3681.000

Missing values

2024-03-23T07:48:43.680278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T07:48:43.913501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

인증번호최초인증일자인증유효기간인증업체명제품명원산지비율인증기관
0푸름 원산지 제1호2021-05-272024-05-26농업회사법인 온샘㈜안동참마분말국산100%주식회사 푸름인증원
1푸름 원산지 제2호2021-07-122024-07-11태장고새뜸된장국산100%주식회사 푸름인증원
2푸름 원산지 제3호2021-07-122024-07-11태장고새뜸고추장국산100%주식회사 푸름인증원
3푸름 원산지 제4호2021-08-022024-08-01지보농협참기름가공공장참기름국산100%주식회사 푸름인증원
4푸름 원산지 제5호2021-08-022024-08-01지보농협참기름가공공장들기름국산100%주식회사 푸름인증원
5푸름 원산지 제6호2021-08-022024-08-01지보농협참기름가공공장볶음참깨국산100%주식회사 푸름인증원
6푸름 원산지 제7호2021-08-252024-08-24대풍년영농조합법인친정엄마꾸러미 요리 앤 고춧가루국산100%주식회사 푸름인증원
7푸름 원산지 제8호2021-11-222024-11-21농업회사법인 어울림(유)구운요술콩 백태국산100%주식회사 푸름인증원
8푸름 원산지 제9호2021-11-222024-11-21농업회사법인 어울림(유)구운요술콩 서리태국산100%주식회사 푸름인증원
9푸름 원산지 제10호2021-11-222024-11-21농업회사법인 어울림(유)위트밀 구운통곡물 시리얼국산100%주식회사 푸름인증원
인증번호최초인증일자인증유효기간인증업체명제품명원산지비율인증기관
143식품연 원산지 제183호2021-03-022024-03-01남영양농협가공사업소햇살촌영양청결고춧가루국산100%한국식품연구원
144식품연 원산지 제184호2021-03-022024-03-01남영양농협가공사업소참고춧가루국산100%한국식품연구원
145식품연 원산지 제185호2021-03-092024-03-08신태인농협청결고춧가루가공공장단풍고춧가루국산100%한국식품연구원
146식품연 원산지 제186호2021-05-102024-05-09안면도농협고추가공공장안면도태양초고춧가루국산100%한국식품연구원
147식품연 원산지 제187호2021-06-142024-06-13맑은샘자연교육농원조금자채소잡곡국산100%한국식품연구원
148식품연 원산지 제188호2021-06-142024-06-13맑은샘자연교육농원조금자채소볼국산100%한국식품연구원
149식품연 원산지 제189호2021-07-122024-07-11서일농원서분례명인 청국장국산100%한국식품연구원
150식품연 원산지 제190호2021-07-122024-07-11서일농원서분례명인 매운청국장국산100%한국식품연구원
151식품연 원산지 제191호2021-07-122024-07-11서일농원서분례명인 마늘청국장국산100%한국식품연구원
152식품연 원산지 제192호2021-07-122024-07-11서일농원서분례명인 들깨청국장국산100%한국식품연구원