Overview

Dataset statistics

Number of variables7
Number of observations197
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.9 KiB
Average record size in memory56.7 B

Variable types

Text2
Categorical5

Dataset

Description탁주, 약주, 청주, 과실주 등의 술 품질인증 관리 정보(인증번호, 인증기관, 인증업체, 품목명, 인증일자, 인증시작일, 인증종료일 등)
Author국립농산물품질관리원
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220204000000001687

Alerts

인증기관 has constant value ""Constant
인증시작일자 is highly overall correlated with 인증일자 and 1 other fieldsHigh correlation
인증종료일자 is highly overall correlated with 인증일자 and 1 other fieldsHigh correlation
인증일자 is highly overall correlated with 인증시작일자 and 1 other fieldsHigh correlation
인증번호 has unique valuesUnique

Reproduction

Analysis started2024-03-23 07:56:04.430994
Analysis finished2024-03-23 07:56:06.137196
Duration1.71 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

인증번호
Text

UNIQUE 

Distinct197
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-03-23T07:56:06.503670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters1970
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique197 ?
Unique (%)100.0%

Sample

1st row국가지정-가-004
2nd row국가지정-가-005
3rd row국가지정-가-006
4th row국가지정-가-014
5th row국가지정-가-022
ValueCountFrequency (%)
국가지정-가-004 1
 
0.5%
국가지정-가-225 1
 
0.5%
국가지정-가-256 1
 
0.5%
국가지정-가-257 1
 
0.5%
국가지정-가-258 1
 
0.5%
국가지정-가-259 1
 
0.5%
국가지정-가-260 1
 
0.5%
국가지정-가-261 1
 
0.5%
국가지정-가-262 1
 
0.5%
국가지정-가-263 1
 
0.5%
Other values (187) 187
94.9%
2024-03-23T07:56:07.378701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
394
20.0%
- 394
20.0%
197
10.0%
197
10.0%
197
10.0%
2 130
 
6.6%
1 95
 
4.8%
0 75
 
3.8%
3 62
 
3.1%
5 41
 
2.1%
Other values (5) 188
9.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 985
50.0%
Decimal Number 591
30.0%
Dash Punctuation 394
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 130
22.0%
1 95
16.1%
0 75
12.7%
3 62
10.5%
5 41
 
6.9%
7 40
 
6.8%
8 40
 
6.8%
4 39
 
6.6%
6 36
 
6.1%
9 33
 
5.6%
Other Letter
ValueCountFrequency (%)
394
40.0%
197
20.0%
197
20.0%
197
20.0%
Dash Punctuation
ValueCountFrequency (%)
- 394
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 985
50.0%
Common 985
50.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 394
40.0%
2 130
 
13.2%
1 95
 
9.6%
0 75
 
7.6%
3 62
 
6.3%
5 41
 
4.2%
7 40
 
4.1%
8 40
 
4.1%
4 39
 
4.0%
6 36
 
3.7%
Hangul
ValueCountFrequency (%)
394
40.0%
197
20.0%
197
20.0%
197
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 985
50.0%
ASCII 985
50.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
394
40.0%
197
20.0%
197
20.0%
197
20.0%
ASCII
ValueCountFrequency (%)
- 394
40.0%
2 130
 
13.2%
1 95
 
9.6%
0 75
 
7.6%
3 62
 
6.3%
5 41
 
4.2%
7 40
 
4.1%
8 40
 
4.1%
4 39
 
4.0%
6 36
 
3.7%

인증기관
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
한국식품연구원
197 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한국식품연구원
2nd row한국식품연구원
3rd row한국식품연구원
4th row한국식품연구원
5th row한국식품연구원

Common Values

ValueCountFrequency (%)
한국식품연구원 197
100.0%

Length

2024-03-23T07:56:07.655746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T07:56:07.960078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한국식품연구원 197
100.0%
Distinct93
Distinct (%)47.2%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-03-23T07:56:08.394017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length13
Mean length8.8324873
Min length2

Characters and Unicode

Total characters1740
Distinct characters169
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)19.3%

Sample

1st row서울장수주식회사
2nd row서울장수주식회사
3rd row구암농산
4th row울산탁주
5th row한주양조
ValueCountFrequency (%)
농업회사법인 32
 
11.9%
주식회사 10
 
3.7%
인산농장 8
 
3.0%
국순당여주명주 7
 
2.6%
양주골 6
 
2.2%
이가전통주 6
 
2.2%
제주샘영농조합법인 6
 
2.2%
농업회사법인(주)죽향도가 6
 
2.2%
모월 5
 
1.9%
서울장수주식회사 5
 
1.9%
Other values (93) 179
66.3%
2024-03-23T07:56:09.200366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
139
 
8.0%
94
 
5.4%
85
 
4.9%
82
 
4.7%
77
 
4.4%
74
 
4.3%
73
 
4.2%
64
 
3.7%
56
 
3.2%
) 54
 
3.1%
Other values (159) 942
54.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1544
88.7%
Space Separator 73
 
4.2%
Close Punctuation 54
 
3.1%
Open Punctuation 54
 
3.1%
Decimal Number 9
 
0.5%
Other Symbol 4
 
0.2%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
139
 
9.0%
94
 
6.1%
85
 
5.5%
82
 
5.3%
77
 
5.0%
74
 
4.8%
64
 
4.1%
56
 
3.6%
39
 
2.5%
35
 
2.3%
Other values (151) 799
51.7%
Decimal Number
ValueCountFrequency (%)
2 5
55.6%
1 4
44.4%
Uppercase Letter
ValueCountFrequency (%)
B 1
50.0%
L 1
50.0%
Space Separator
ValueCountFrequency (%)
73
100.0%
Close Punctuation
ValueCountFrequency (%)
) 54
100.0%
Open Punctuation
ValueCountFrequency (%)
( 54
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1548
89.0%
Common 190
 
10.9%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
139
 
9.0%
94
 
6.1%
85
 
5.5%
82
 
5.3%
77
 
5.0%
74
 
4.8%
64
 
4.1%
56
 
3.6%
39
 
2.5%
35
 
2.3%
Other values (152) 803
51.9%
Common
ValueCountFrequency (%)
73
38.4%
) 54
28.4%
( 54
28.4%
2 5
 
2.6%
1 4
 
2.1%
Latin
ValueCountFrequency (%)
B 1
50.0%
L 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1544
88.7%
ASCII 192
 
11.0%
None 4
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
139
 
9.0%
94
 
6.1%
85
 
5.5%
82
 
5.3%
77
 
5.0%
74
 
4.8%
64
 
4.1%
56
 
3.6%
39
 
2.5%
35
 
2.3%
Other values (151) 799
51.7%
ASCII
ValueCountFrequency (%)
73
38.0%
) 54
28.1%
( 54
28.1%
2 5
 
2.6%
1 4
 
2.1%
B 1
 
0.5%
L 1
 
0.5%
None
ValueCountFrequency (%)
4
100.0%

품목명
Categorical

Distinct8
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
탁주
67 
약주
45 
증류식소주
42 
과실주
19 
일반증류주
10 
Other values (3)
14 

Length

Max length5
Median length2
Mean length3.0253807
Min length2

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row탁주
2nd row살균탁주
3rd row살균탁주
4th row탁주
5th row탁주

Common Values

ValueCountFrequency (%)
탁주 67
34.0%
약주 45
22.8%
증류식소주 42
21.3%
과실주 19
 
9.6%
일반증류주 10
 
5.1%
기타주류 8
 
4.1%
살균탁주 5
 
2.5%
리큐르 1
 
0.5%

Length

2024-03-23T07:56:09.592151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T07:56:09.936101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
탁주 67
34.0%
약주 45
22.8%
증류식소주 42
21.3%
과실주 19
 
9.6%
일반증류주 10
 
5.1%
기타주류 8
 
4.1%
살균탁주 5
 
2.5%
리큐르 1
 
0.5%

인증일자
Categorical

HIGH CORRELATION 

Distinct29
Distinct (%)14.7%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2022-10-05
34 
2023-05-11
34 
2022-06-15
26 
2023-08-11
13 
2021-08-22
Other values (24)
81 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique7 ?
Unique (%)3.6%

Sample

1st row2022-10-05
2nd row2022-10-05
3rd row2022-10-05
4th row2022-10-05
5th row2022-10-05

Common Values

ValueCountFrequency (%)
2022-10-05 34
17.3%
2023-05-11 34
17.3%
2022-06-15 26
13.2%
2023-08-11 13
 
6.6%
2021-08-22 9
 
4.6%
2023-08-14 7
 
3.6%
2021-09-27 7
 
3.6%
2021-07-14 7
 
3.6%
2021-10-05 6
 
3.0%
2023-09-13 6
 
3.0%
Other values (19) 48
24.4%

Length

2024-03-23T07:56:10.308445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2022-10-05 34
17.3%
2023-05-11 34
17.3%
2022-06-15 26
13.2%
2023-08-11 13
 
6.6%
2021-08-22 9
 
4.6%
2023-08-14 7
 
3.6%
2021-09-27 7
 
3.6%
2021-07-14 7
 
3.6%
2023-09-13 6
 
3.0%
2021-10-05 6
 
3.0%
Other values (19) 48
24.4%

인증시작일자
Categorical

HIGH CORRELATION 

Distinct28
Distinct (%)14.2%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2022-10-05
34 
2023-05-11
34 
2022-06-15
25 
2023-08-11
13 
2021-08-22
Other values (23)
82 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique6 ?
Unique (%)3.0%

Sample

1st row2022-10-05
2nd row2022-10-05
3rd row2022-10-05
4th row2022-10-05
5th row2022-10-05

Common Values

ValueCountFrequency (%)
2022-10-05 34
17.3%
2023-05-11 34
17.3%
2022-06-15 25
12.7%
2023-08-11 13
 
6.6%
2021-08-22 9
 
4.6%
2021-09-27 7
 
3.6%
2021-07-14 7
 
3.6%
2023-08-14 7
 
3.6%
2021-10-05 6
 
3.0%
2023-09-13 6
 
3.0%
Other values (18) 49
24.9%

Length

2024-03-23T07:56:10.784224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2022-10-05 34
17.3%
2023-05-11 34
17.3%
2022-06-15 25
12.7%
2023-08-11 13
 
6.6%
2021-08-22 9
 
4.6%
2021-09-27 7
 
3.6%
2021-07-14 7
 
3.6%
2023-08-14 7
 
3.6%
2023-09-13 6
 
3.0%
2021-10-05 6
 
3.0%
Other values (18) 49
24.9%

인증종료일자
Categorical

HIGH CORRELATION 

Distinct28
Distinct (%)14.2%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2025-10-04
34 
2026-05-10
34 
2025-06-14
25 
2026-08-10
13 
2024-08-21
Other values (23)
82 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique5 ?
Unique (%)2.5%

Sample

1st row2025-10-04
2nd row2025-10-04
3rd row2025-10-04
4th row2025-10-04
5th row2025-10-04

Common Values

ValueCountFrequency (%)
2025-10-04 34
17.3%
2026-05-10 34
17.3%
2025-06-14 25
12.7%
2026-08-10 13
 
6.6%
2024-08-21 9
 
4.6%
2024-09-26 7
 
3.6%
2026-08-13 7
 
3.6%
2024-07-13 6
 
3.0%
2024-10-04 6
 
3.0%
2026-09-12 6
 
3.0%
Other values (18) 50
25.4%

Length

2024-03-23T07:56:11.107868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2025-10-04 34
17.3%
2026-05-10 34
17.3%
2025-06-14 25
12.7%
2026-08-10 13
 
6.6%
2024-08-21 9
 
4.6%
2024-09-26 7
 
3.6%
2026-08-13 7
 
3.6%
2026-09-12 6
 
3.0%
2024-10-04 6
 
3.0%
2024-07-13 6
 
3.0%
Other values (18) 50
25.4%

Correlations

2024-03-23T07:56:11.322089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인증업체품목명인증일자인증시작일자인증종료일자
인증업체1.0000.9240.9880.9920.992
품목명0.9241.0000.6440.6650.694
인증일자0.9880.6441.0000.9990.999
인증시작일자0.9920.6650.9991.0001.000
인증종료일자0.9920.6940.9991.0001.000
2024-03-23T07:56:11.595620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목명인증시작일자인증종료일자인증일자
품목명1.0000.3090.3320.300
인증시작일자0.3091.0000.9780.975
인증종료일자0.3320.9781.0000.964
인증일자0.3000.9750.9641.000
2024-03-23T07:56:11.841959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목명인증일자인증시작일자인증종료일자
품목명1.0000.3000.3090.332
인증일자0.3001.0000.9750.964
인증시작일자0.3090.9751.0000.978
인증종료일자0.3320.9640.9781.000

Missing values

2024-03-23T07:56:05.558183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T07:56:06.014816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

인증번호인증기관인증업체품목명인증일자인증시작일자인증종료일자
0국가지정-가-004한국식품연구원서울장수주식회사탁주2022-10-052022-10-052025-10-04
1국가지정-가-005한국식품연구원서울장수주식회사살균탁주2022-10-052022-10-052025-10-04
2국가지정-가-006한국식품연구원구암농산살균탁주2022-10-052022-10-052025-10-04
3국가지정-가-014한국식품연구원울산탁주탁주2022-10-052022-10-052025-10-04
4국가지정-가-022한국식품연구원한주양조탁주2022-10-052022-10-052025-10-04
5국가지정-가-027한국식품연구원배상면주가고창LB주식회사과실주2022-10-052022-10-052025-10-04
6국가지정-가-028한국식품연구원여수주조공사탁주2022-10-052022-10-052025-10-04
7국가지정-가-032한국식품연구원순천주조탁주2022-10-052022-10-052025-10-04
8국가지정-가-035한국식품연구원순천주조탁주2022-10-052022-10-052025-10-04
9국가지정-가-043한국식품연구원(주)제주막걸리탁주2022-10-052022-10-052025-10-04
인증번호인증기관인증업체품목명인증일자인증시작일자인증종료일자
187국가지정-가-317한국식품연구원농업회사법인 (주)벗드림탁주2023-07-242023-07-242026-07-23
188국가지정-가-318한국식품연구원농업회사법인 (주)벗드림탁주2023-07-242023-07-242026-07-23
189국가지정-가-319한국식품연구원농업회사법인 (주)벗드림약주2023-07-242023-07-242026-07-23
190국가지정-가-320한국식품연구원한강주조약주2023-08-112023-08-112026-08-10
191국가지정-가-321한국식품연구원인산농장 죽림지점약주2023-08-112023-08-112026-08-10
192국가지정-가-322한국식품연구원인산농장 죽림지점약주2023-08-112023-08-112026-08-10
193국가지정-가-323한국식품연구원인산농장 죽림지점증류식소주2023-08-112023-08-112026-08-10
194국가지정-가-324한국식품연구원인산농장 죽림지점증류식소주2023-08-112023-08-112026-08-10
195국가지정-가-325한국식품연구원인산농장 죽림지점일반증류주2023-08-112023-08-112026-08-10
196국가지정-가-326한국식품연구원협동조합 모월증류식소주2023-11-062023-11-062026-11-05