Overview

Dataset statistics

Number of variables7
Number of observations164
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.1 KiB
Average record size in memory56.8 B

Variable types

Text2
Categorical5

Dataset

Description탁주, 약주, 청주, 과실주 등의 술 품질인증 관리 정보(인증번호, 인증기관, 인증업체, 품목명, 인증일자, 인증시작일, 인증종료일 등)
Author국립농산물품질관리원
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220204000000001687

Alerts

인증기관 has constant value ""Constant
인증시작일자 is highly overall correlated with 인증일자 and 1 other fieldsHigh correlation
인증종료일자 is highly overall correlated with 인증일자 and 1 other fieldsHigh correlation
인증일자 is highly overall correlated with 인증시작일자 and 1 other fieldsHigh correlation
인증번호 has unique valuesUnique

Reproduction

Analysis started2024-03-23 07:55:45.439719
Analysis finished2024-03-23 07:55:46.556300
Duration1.12 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

인증번호
Text

UNIQUE 

Distinct164
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-23T07:55:46.781510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters1640
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique164 ?
Unique (%)100.0%

Sample

1st row국가지정-가-004
2nd row국가지정-가-005
3rd row국가지정-가-006
4th row국가지정-가-014
5th row국가지정-가-022
ValueCountFrequency (%)
국가지정-가-004 1
 
0.6%
국가지정-가-218 1
 
0.6%
국가지정-가-228 1
 
0.6%
국가지정-가-221 1
 
0.6%
국가지정-가-222 1
 
0.6%
국가지정-가-223 1
 
0.6%
국가지정-가-224 1
 
0.6%
국가지정-가-225 1
 
0.6%
국가지정-가-226 1
 
0.6%
국가지정-가-227 1
 
0.6%
Other values (154) 154
93.9%
2024-03-23T07:55:47.536667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
328
20.0%
- 328
20.0%
164
10.0%
164
10.0%
164
10.0%
2 113
 
6.9%
1 85
 
5.2%
0 66
 
4.0%
5 39
 
2.4%
7 38
 
2.3%
Other values (5) 151
9.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 820
50.0%
Decimal Number 492
30.0%
Dash Punctuation 328
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 113
23.0%
1 85
17.3%
0 66
13.4%
5 39
 
7.9%
7 38
 
7.7%
4 35
 
7.1%
3 33
 
6.7%
6 33
 
6.7%
8 28
 
5.7%
9 22
 
4.5%
Other Letter
ValueCountFrequency (%)
328
40.0%
164
20.0%
164
20.0%
164
20.0%
Dash Punctuation
ValueCountFrequency (%)
- 328
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 820
50.0%
Common 820
50.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 328
40.0%
2 113
 
13.8%
1 85
 
10.4%
0 66
 
8.0%
5 39
 
4.8%
7 38
 
4.6%
4 35
 
4.3%
3 33
 
4.0%
6 33
 
4.0%
8 28
 
3.4%
Hangul
ValueCountFrequency (%)
328
40.0%
164
20.0%
164
20.0%
164
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 820
50.0%
ASCII 820
50.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
328
40.0%
164
20.0%
164
20.0%
164
20.0%
ASCII
ValueCountFrequency (%)
- 328
40.0%
2 113
 
13.8%
1 85
 
10.4%
0 66
 
8.0%
5 39
 
4.8%
7 38
 
4.6%
4 35
 
4.3%
3 33
 
4.0%
6 33
 
4.0%
8 28
 
3.4%

인증기관
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
한국식품연구원
164 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한국식품연구원
2nd row한국식품연구원
3rd row한국식품연구원
4th row한국식품연구원
5th row한국식품연구원

Common Values

ValueCountFrequency (%)
한국식품연구원 164
100.0%

Length

2024-03-23T07:55:48.041627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T07:55:48.408401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한국식품연구원 164
100.0%
Distinct76
Distinct (%)46.3%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-23T07:55:48.993918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length15.5
Mean length10.006098
Min length3

Characters and Unicode

Total characters1641
Distinct characters155
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)18.9%

Sample

1st row서울장수주식회사
2nd row서울장수주식회사
3rd row구암농산
4th row울산탁주
5th row한주양조
ValueCountFrequency (%)
농업회사법인 32
 
13.8%
주식회사 10
 
4.3%
국순당여주명주 7
 
3.0%
주)제이엘 7
 
3.0%
제주샘영농조합법인 6
 
2.6%
양주골 6
 
2.6%
이가전통주 6
 
2.6%
농업회사법인(주)죽향도가 6
 
2.6%
주)화요 5
 
2.2%
주)조은술세종 5
 
2.2%
Other values (77) 142
61.2%
2024-03-23T07:55:50.457567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
142
 
8.7%
93
 
5.7%
88
 
5.4%
88
 
5.4%
86
 
5.2%
83
 
5.1%
68
 
4.1%
65
 
4.0%
( 61
 
3.7%
) 61
 
3.7%
Other values (145) 806
49.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1436
87.5%
Space Separator 68
 
4.1%
Open Punctuation 61
 
3.7%
Close Punctuation 61
 
3.7%
Decimal Number 9
 
0.5%
Other Symbol 4
 
0.2%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
142
 
9.9%
93
 
6.5%
88
 
6.1%
88
 
6.1%
86
 
6.0%
83
 
5.8%
65
 
4.5%
60
 
4.2%
34
 
2.4%
31
 
2.2%
Other values (137) 666
46.4%
Decimal Number
ValueCountFrequency (%)
2 5
55.6%
1 4
44.4%
Uppercase Letter
ValueCountFrequency (%)
L 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
68
100.0%
Open Punctuation
ValueCountFrequency (%)
( 61
100.0%
Close Punctuation
ValueCountFrequency (%)
) 61
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1440
87.8%
Common 199
 
12.1%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
142
 
9.9%
93
 
6.5%
88
 
6.1%
88
 
6.1%
86
 
6.0%
83
 
5.8%
65
 
4.5%
60
 
4.2%
34
 
2.4%
31
 
2.2%
Other values (138) 670
46.5%
Common
ValueCountFrequency (%)
68
34.2%
( 61
30.7%
) 61
30.7%
2 5
 
2.5%
1 4
 
2.0%
Latin
ValueCountFrequency (%)
L 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1436
87.5%
ASCII 201
 
12.2%
None 4
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
142
 
9.9%
93
 
6.5%
88
 
6.1%
88
 
6.1%
86
 
6.0%
83
 
5.8%
65
 
4.5%
60
 
4.2%
34
 
2.4%
31
 
2.2%
Other values (137) 666
46.4%
ASCII
ValueCountFrequency (%)
68
33.8%
( 61
30.3%
) 61
30.3%
2 5
 
2.5%
1 4
 
2.0%
L 1
 
0.5%
B 1
 
0.5%
None
ValueCountFrequency (%)
4
100.0%

품목명
Categorical

Distinct8
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
탁주
60 
증류식소주
36 
약주
28 
과실주
18 
일반증류주
10 
Other values (3)
12 

Length

Max length5
Median length2
Mean length3.0914634
Min length2

Unique

Unique1 ?
Unique (%)0.6%

Sample

1st row탁주
2nd row살균탁주
3rd row살균탁주
4th row탁주
5th row탁주

Common Values

ValueCountFrequency (%)
탁주 60
36.6%
증류식소주 36
22.0%
약주 28
17.1%
과실주 18
 
11.0%
일반증류주 10
 
6.1%
기타주류 6
 
3.7%
살균탁주 5
 
3.0%
리큐르 1
 
0.6%

Length

2024-03-23T07:55:50.930992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T07:55:51.337096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
탁주 60
36.6%
증류식소주 36
22.0%
약주 28
17.1%
과실주 18
 
11.0%
일반증류주 10
 
6.1%
기타주류 6
 
3.7%
살균탁주 5
 
3.0%
리큐르 1
 
0.6%

인증일자
Categorical

HIGH CORRELATION 

Distinct24
Distinct (%)14.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2022-10-05
34 
2022-06-15
27 
2021-07-14
12 
2020-08-11
11 
2021-08-22
Other values (19)
71 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique6 ?
Unique (%)3.7%

Sample

1st row2022-10-05
2nd row2022-10-05
3rd row2022-10-05
4th row2022-10-05
5th row2022-10-05

Common Values

ValueCountFrequency (%)
2022-10-05 34
20.7%
2022-06-15 27
16.5%
2021-07-14 12
 
7.3%
2020-08-11 11
 
6.7%
2021-08-22 9
 
5.5%
2017-08-11 8
 
4.9%
2020-08-14 8
 
4.9%
2017-09-14 7
 
4.3%
2021-09-27 7
 
4.3%
2021-12-21 6
 
3.7%
Other values (14) 35
21.3%

Length

2024-03-23T07:55:51.820510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2022-10-05 34
20.7%
2022-06-15 27
16.5%
2021-07-14 12
 
7.3%
2020-08-11 11
 
6.7%
2021-08-22 9
 
5.5%
2017-08-11 8
 
4.9%
2020-08-14 8
 
4.9%
2017-09-14 7
 
4.3%
2021-09-27 7
 
4.3%
2021-12-21 6
 
3.7%
Other values (14) 35
21.3%

인증시작일자
Categorical

HIGH CORRELATION 

Distinct23
Distinct (%)14.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2022-10-05
34 
2022-06-15
26 
2021-07-14
12 
2020-08-11
11 
2021-08-22
Other values (18)
72 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique5 ?
Unique (%)3.0%

Sample

1st row2022-10-05
2nd row2022-10-05
3rd row2022-10-05
4th row2022-10-05
5th row2022-10-05

Common Values

ValueCountFrequency (%)
2022-10-05 34
20.7%
2022-06-15 26
15.9%
2021-07-14 12
 
7.3%
2020-08-11 11
 
6.7%
2021-08-22 9
 
5.5%
2020-09-13 8
 
4.9%
2020-08-14 8
 
4.9%
2021-01-26 7
 
4.3%
2021-09-27 7
 
4.3%
2021-10-05 6
 
3.7%
Other values (13) 36
22.0%

Length

2024-03-23T07:55:52.217119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2022-10-05 34
20.7%
2022-06-15 26
15.9%
2021-07-14 12
 
7.3%
2020-08-11 11
 
6.7%
2021-08-22 9
 
5.5%
2020-09-13 8
 
4.9%
2020-08-14 8
 
4.9%
2021-01-26 7
 
4.3%
2021-09-27 7
 
4.3%
2021-10-05 6
 
3.7%
Other values (13) 36
22.0%

인증종료일자
Categorical

HIGH CORRELATION 

Distinct23
Distinct (%)14.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2025-10-04
34 
2025-06-14
26 
2024-07-13
11 
2023-08-10
11 
2024-08-21
Other values (18)
73 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique4 ?
Unique (%)2.4%

Sample

1st row2025-10-04
2nd row2025-10-04
3rd row2025-10-04
4th row2025-10-04
5th row2025-10-04

Common Values

ValueCountFrequency (%)
2025-10-04 34
20.7%
2025-06-14 26
15.9%
2024-07-13 11
 
6.7%
2023-08-10 11
 
6.7%
2024-08-21 9
 
5.5%
2023-09-12 8
 
4.9%
2023-08-13 8
 
4.9%
2024-01-25 7
 
4.3%
2024-09-26 7
 
4.3%
2024-10-04 6
 
3.7%
Other values (13) 37
22.6%

Length

2024-03-23T07:55:52.583425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2025-10-04 34
20.7%
2025-06-14 26
15.9%
2024-07-13 11
 
6.7%
2023-08-10 11
 
6.7%
2024-08-21 9
 
5.5%
2023-09-12 8
 
4.9%
2023-08-13 8
 
4.9%
2024-01-25 7
 
4.3%
2024-09-26 7
 
4.3%
2024-10-04 6
 
3.7%
Other values (13) 37
22.6%

Correlations

2024-03-23T07:55:52.830624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인증업체품목명인증일자인증시작일자인증종료일자
인증업체1.0000.9230.9870.9900.990
품목명0.9231.0000.7060.6680.679
인증일자0.9870.7061.0000.9970.996
인증시작일자0.9900.6680.9971.0001.000
인증종료일자0.9900.6790.9961.0001.000
2024-03-23T07:55:53.117880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목명인증시작일자인증종료일자인증일자
품목명1.0000.3240.3330.313
인증시작일자0.3241.0000.9730.942
인증종료일자0.3330.9731.0000.928
인증일자0.3130.9420.9281.000
2024-03-23T07:55:53.382794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목명인증일자인증시작일자인증종료일자
품목명1.0000.3130.3240.333
인증일자0.3131.0000.9420.928
인증시작일자0.3240.9421.0000.973
인증종료일자0.3330.9280.9731.000

Missing values

2024-03-23T07:55:46.258775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T07:55:46.475703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

인증번호인증기관인증업체품목명인증일자인증시작일자인증종료일자
0국가지정-가-004한국식품연구원서울장수주식회사탁주2022-10-052022-10-052025-10-04
1국가지정-가-005한국식품연구원서울장수주식회사살균탁주2022-10-052022-10-052025-10-04
2국가지정-가-006한국식품연구원구암농산살균탁주2022-10-052022-10-052025-10-04
3국가지정-가-014한국식품연구원울산탁주탁주2022-10-052022-10-052025-10-04
4국가지정-가-022한국식품연구원한주양조탁주2022-10-052022-10-052025-10-04
5국가지정-가-027한국식품연구원배상면주가고창LB주식회사과실주2022-10-052022-10-052025-10-04
6국가지정-가-028한국식품연구원여수주조공사탁주2022-10-052022-10-052025-10-04
7국가지정-가-032한국식품연구원순천주조탁주2022-10-052022-10-052025-10-04
8국가지정-가-035한국식품연구원순천주조탁주2022-10-052022-10-052025-10-04
9국가지정-가-043한국식품연구원(주)제주막걸리탁주2022-10-052022-10-052025-10-04
인증번호인증기관인증업체품목명인증일자인증시작일자인증종료일자
154국가지정-가-270한국식품연구원대구탁주합동제1공장탁주2022-10-052022-10-052025-10-04
155국가지정-가-271한국식품연구원대구탁주합동제1공장탁주2022-10-052022-10-052025-10-04
156국가지정-가-272한국식품연구원주식회사 대구불로탁주탁주2022-10-052022-10-052025-10-04
157국가지정-가-273한국식품연구원주식회사 대구불로탁주탁주2022-10-052022-10-052025-10-04
158국가지정-가-274한국식품연구원농업회사법인 담을증류식소주2022-10-052022-10-052025-10-04
159국가지정-가-275한국식품연구원협동조합 모월증류식소주2022-10-052022-10-052025-10-04
160국가지정-가-276한국식품연구원농업회사법인 미담(유)탁주2022-10-252022-10-252025-10-24
161국가지정-가-277한국식품연구원농업회사법인 미담(유)약주2022-10-252022-10-252025-10-24
162국가지정-가-278한국식품연구원농업회사법인 (주)술아원약주2022-10-252022-10-252025-10-24
163국가지정-가-279한국식품연구원(주)국순당살균탁주2022-12-142022-12-142025-12-13