Overview

Dataset statistics

Number of variables7
Number of observations165
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.2 KiB
Average record size in memory56.8 B

Variable types

Text2
Categorical5

Dataset

Description탁주, 약주, 청주, 과실주 등의 술 품질인증 관리 정보(인증번호, 인증기관, 인증업체, 품목명, 인증일자, 인증시작일, 인증종료일 등)
Author국립농산물품질관리원
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220204000000001687

Alerts

인증기관 has constant value ""Constant
인증시작일자 is highly overall correlated with 인증일자 and 1 other fieldsHigh correlation
인증종료일자 is highly overall correlated with 인증일자 and 1 other fieldsHigh correlation
인증일자 is highly overall correlated with 인증시작일자 and 1 other fieldsHigh correlation
인증번호 has unique valuesUnique

Reproduction

Analysis started2024-03-23 07:55:24.138385
Analysis finished2024-03-23 07:55:25.853716
Duration1.72 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

인증번호
Text

UNIQUE 

Distinct165
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-23T07:55:26.169539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters1650
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique165 ?
Unique (%)100.0%

Sample

1st row국가지정-가-004
2nd row국가지정-가-005
3rd row국가지정-가-006
4th row국가지정-가-014
5th row국가지정-가-022
ValueCountFrequency (%)
국가지정-가-004 1
 
0.6%
국가지정-가-179 1
 
0.6%
국가지정-가-210 1
 
0.6%
국가지정-가-212 1
 
0.6%
국가지정-가-213 1
 
0.6%
국가지정-가-214 1
 
0.6%
국가지정-가-215 1
 
0.6%
국가지정-가-216 1
 
0.6%
국가지정-가-217 1
 
0.6%
국가지정-가-218 1
 
0.6%
Other values (155) 155
93.9%
2024-03-23T07:55:27.072192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
330
20.0%
- 330
20.0%
165
10.0%
165
10.0%
165
10.0%
2 104
 
6.3%
1 93
 
5.6%
0 70
 
4.2%
5 39
 
2.4%
4 37
 
2.2%
Other values (5) 152
9.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 825
50.0%
Decimal Number 495
30.0%
Dash Punctuation 330
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 104
21.0%
1 93
18.8%
0 70
14.1%
5 39
 
7.9%
4 37
 
7.5%
3 36
 
7.3%
6 35
 
7.1%
7 29
 
5.9%
8 28
 
5.7%
9 24
 
4.8%
Other Letter
ValueCountFrequency (%)
330
40.0%
165
20.0%
165
20.0%
165
20.0%
Dash Punctuation
ValueCountFrequency (%)
- 330
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 825
50.0%
Common 825
50.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 330
40.0%
2 104
 
12.6%
1 93
 
11.3%
0 70
 
8.5%
5 39
 
4.7%
4 37
 
4.5%
3 36
 
4.4%
6 35
 
4.2%
7 29
 
3.5%
8 28
 
3.4%
Hangul
ValueCountFrequency (%)
330
40.0%
165
20.0%
165
20.0%
165
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 825
50.0%
ASCII 825
50.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
330
40.0%
165
20.0%
165
20.0%
165
20.0%
ASCII
ValueCountFrequency (%)
- 330
40.0%
2 104
 
12.6%
1 93
 
11.3%
0 70
 
8.5%
5 39
 
4.7%
4 37
 
4.5%
3 36
 
4.4%
6 35
 
4.2%
7 29
 
3.5%
8 28
 
3.4%

인증기관
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
한국식품연구원
165 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한국식품연구원
2nd row한국식품연구원
3rd row한국식품연구원
4th row한국식품연구원
5th row한국식품연구원

Common Values

ValueCountFrequency (%)
한국식품연구원 165
100.0%

Length

2024-03-23T07:55:27.528742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T07:55:27.780373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한국식품연구원 165
100.0%
Distinct79
Distinct (%)47.9%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-23T07:55:28.211429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length15
Mean length9.6424242
Min length2

Characters and Unicode

Total characters1591
Distinct characters162
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)21.2%

Sample

1st row서울장수주식회사
2nd row서울장수주식회사
3rd row구암농산
4th row울산탁주
5th row한주양조
ValueCountFrequency (%)
농업회사법인 28
 
12.4%
주식회사 8
 
3.5%
주)제이엘 7
 
3.1%
국순당여주명주 7
 
3.1%
양주골 6
 
2.7%
이가전통주 6
 
2.7%
제주샘영농조합법인 6
 
2.7%
농업회사법인(주)죽향도가 5
 
2.2%
주)화요 5
 
2.2%
주)조은술세종 5
 
2.2%
Other values (80) 143
63.3%
2024-03-23T07:55:29.092068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
137
 
8.6%
86
 
5.4%
83
 
5.2%
81
 
5.1%
81
 
5.1%
78
 
4.9%
61
 
3.8%
61
 
3.8%
60
 
3.8%
( 59
 
3.7%
Other values (152) 804
50.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1395
87.7%
Space Separator 61
 
3.8%
Open Punctuation 59
 
3.7%
Close Punctuation 59
 
3.7%
Decimal Number 11
 
0.7%
Other Symbol 4
 
0.3%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
137
 
9.8%
86
 
6.2%
83
 
5.9%
81
 
5.8%
81
 
5.8%
78
 
5.6%
61
 
4.4%
60
 
4.3%
38
 
2.7%
31
 
2.2%
Other values (142) 659
47.2%
Decimal Number
ValueCountFrequency (%)
2 6
54.5%
1 3
27.3%
9 1
 
9.1%
3 1
 
9.1%
Uppercase Letter
ValueCountFrequency (%)
L 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
61
100.0%
Open Punctuation
ValueCountFrequency (%)
( 59
100.0%
Close Punctuation
ValueCountFrequency (%)
) 59
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1399
87.9%
Common 190
 
11.9%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
137
 
9.8%
86
 
6.1%
83
 
5.9%
81
 
5.8%
81
 
5.8%
78
 
5.6%
61
 
4.4%
60
 
4.3%
38
 
2.7%
31
 
2.2%
Other values (143) 663
47.4%
Common
ValueCountFrequency (%)
61
32.1%
( 59
31.1%
) 59
31.1%
2 6
 
3.2%
1 3
 
1.6%
9 1
 
0.5%
3 1
 
0.5%
Latin
ValueCountFrequency (%)
L 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1395
87.7%
ASCII 192
 
12.1%
None 4
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
137
 
9.8%
86
 
6.2%
83
 
5.9%
81
 
5.8%
81
 
5.8%
78
 
5.6%
61
 
4.4%
60
 
4.3%
38
 
2.7%
31
 
2.2%
Other values (142) 659
47.2%
ASCII
ValueCountFrequency (%)
61
31.8%
( 59
30.7%
) 59
30.7%
2 6
 
3.1%
1 3
 
1.6%
L 1
 
0.5%
B 1
 
0.5%
9 1
 
0.5%
3 1
 
0.5%
None
ValueCountFrequency (%)
4
100.0%

품목명
Categorical

Distinct8
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
탁주
62 
증류식소주
35 
약주
28 
과실주
19 
일반증류주
10 
Other values (3)
11 

Length

Max length5
Median length2
Mean length3.0606061
Min length2

Unique

Unique1 ?
Unique (%)0.6%

Sample

1st row탁주
2nd row살균탁주
3rd row살균탁주
4th row탁주
5th row탁주

Common Values

ValueCountFrequency (%)
탁주 62
37.6%
증류식소주 35
21.2%
약주 28
17.0%
과실주 19
 
11.5%
일반증류주 10
 
6.1%
기타주류 6
 
3.6%
살균탁주 4
 
2.4%
리큐르 1
 
0.6%

Length

2024-03-23T07:55:29.520615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T07:55:29.882711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
탁주 62
37.6%
증류식소주 35
21.2%
약주 28
17.0%
과실주 19
 
11.5%
일반증류주 10
 
6.1%
기타주류 6
 
3.6%
살균탁주 4
 
2.4%
리큐르 1
 
0.6%

인증일자
Categorical

HIGH CORRELATION 

Distinct36
Distinct (%)21.8%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2022-06-15
27 
2021-07-14
12 
2020-08-11
11 
2017-08-11
10 
2021-08-22
 
9
Other values (31)
96 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique8 ?
Unique (%)4.8%

Sample

1st row2020-03-18
2nd row2020-03-18
3rd row2020-03-18
4th row2020-04-18
5th row2020-05-23

Common Values

ValueCountFrequency (%)
2022-06-15 27
16.4%
2021-07-14 12
 
7.3%
2020-08-11 11
 
6.7%
2017-08-11 10
 
6.1%
2021-08-22 9
 
5.5%
2020-08-14 8
 
4.8%
2017-09-14 7
 
4.2%
2021-09-27 7
 
4.2%
2021-12-21 6
 
3.6%
2021-10-05 6
 
3.6%
Other values (26) 62
37.6%

Length

2024-03-23T07:55:30.501446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2022-06-15 27
16.4%
2021-07-14 12
 
7.3%
2020-08-11 11
 
6.7%
2017-08-11 10
 
6.1%
2021-08-22 9
 
5.5%
2020-08-14 8
 
4.8%
2017-09-14 7
 
4.2%
2021-09-27 7
 
4.2%
2021-10-05 6
 
3.6%
2021-12-21 6
 
3.6%
Other values (26) 62
37.6%

인증시작일자
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)12.7%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2019-10-05
34 
2022-06-15
26 
2021-07-14
12 
2020-08-11
11 
2020-09-13
10 
Other values (16)
72 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique3 ?
Unique (%)1.8%

Sample

1st row2019-10-05
2nd row2019-10-05
3rd row2019-10-05
4th row2019-10-05
5th row2019-10-05

Common Values

ValueCountFrequency (%)
2019-10-05 34
20.6%
2022-06-15 26
15.8%
2021-07-14 12
 
7.3%
2020-08-11 11
 
6.7%
2020-09-13 10
 
6.1%
2021-08-22 9
 
5.5%
2020-08-14 8
 
4.8%
2019-12-26 7
 
4.2%
2021-01-26 7
 
4.2%
2021-09-27 7
 
4.2%
Other values (11) 34
20.6%

Length

2024-03-23T07:55:30.709562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2019-10-05 34
20.6%
2022-06-15 26
15.8%
2021-07-14 12
 
7.3%
2020-08-11 11
 
6.7%
2020-09-13 10
 
6.1%
2021-08-22 9
 
5.5%
2020-08-14 8
 
4.8%
2019-12-26 7
 
4.2%
2021-01-26 7
 
4.2%
2021-09-27 7
 
4.2%
Other values (11) 34
20.6%

인증종료일자
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)12.7%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2022-10-04
34 
2025-06-14
26 
2023-08-10
11 
2024-07-13
11 
2023-09-12
10 
Other values (16)
73 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique2 ?
Unique (%)1.2%

Sample

1st row2022-10-04
2nd row2022-10-04
3rd row2022-10-04
4th row2022-10-04
5th row2022-10-04

Common Values

ValueCountFrequency (%)
2022-10-04 34
20.6%
2025-06-14 26
15.8%
2023-08-10 11
 
6.7%
2024-07-13 11
 
6.7%
2023-09-12 10
 
6.1%
2024-08-21 9
 
5.5%
2023-08-13 8
 
4.8%
2022-12-25 7
 
4.2%
2024-01-25 7
 
4.2%
2024-09-26 7
 
4.2%
Other values (11) 35
21.2%

Length

2024-03-23T07:55:31.063600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2022-10-04 34
20.6%
2025-06-14 26
15.8%
2023-08-10 11
 
6.7%
2024-07-13 11
 
6.7%
2023-09-12 10
 
6.1%
2024-08-21 9
 
5.5%
2023-08-13 8
 
4.8%
2022-12-25 7
 
4.2%
2024-01-25 7
 
4.2%
2024-09-26 7
 
4.2%
Other values (11) 35
21.2%

Correlations

2024-03-23T07:55:31.290404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인증업체품목명인증일자인증시작일자인증종료일자
인증업체1.0000.9080.9930.9860.986
품목명0.9081.0000.6970.6040.617
인증일자0.9930.6971.0000.9950.994
인증시작일자0.9860.6040.9951.0001.000
인증종료일자0.9860.6170.9941.0001.000
2024-03-23T07:55:31.571123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목명인증시작일자인증종료일자인증일자
품목명1.0000.2870.2960.315
인증시작일자0.2871.0000.9710.870
인증종료일자0.2960.9711.0000.853
인증일자0.3150.8700.8531.000
2024-03-23T07:55:31.736432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목명인증일자인증시작일자인증종료일자
품목명1.0000.3150.2870.296
인증일자0.3151.0000.8700.853
인증시작일자0.2870.8701.0000.971
인증종료일자0.2960.8530.9711.000

Missing values

2024-03-23T07:55:25.382732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T07:55:25.753094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

인증번호인증기관인증업체품목명인증일자인증시작일자인증종료일자
0국가지정-가-004한국식품연구원서울장수주식회사탁주2020-03-182019-10-052022-10-04
1국가지정-가-005한국식품연구원서울장수주식회사살균탁주2020-03-182019-10-052022-10-04
2국가지정-가-006한국식품연구원구암농산살균탁주2020-03-182019-10-052022-10-04
3국가지정-가-014한국식품연구원울산탁주탁주2020-04-182019-10-052022-10-04
4국가지정-가-022한국식품연구원한주양조탁주2020-05-232019-10-052022-10-04
5국가지정-가-027한국식품연구원배상면주가고창LB주식회사과실주2020-05-232019-10-052022-10-04
6국가지정-가-028한국식품연구원여수주조공사탁주2020-07-202019-10-052022-10-04
7국가지정-가-032한국식품연구원순천주조탁주2020-07-202019-10-052022-10-04
8국가지정-가-035한국식품연구원순천주조탁주2020-09-142019-10-052022-10-04
9국가지정-가-043한국식품연구원(주)제주막걸리탁주2020-12-092019-10-052022-10-04
인증번호인증기관인증업체품목명인증일자인증시작일자인증종료일자
155국가지정-가-260한국식품연구원양주골 이가전통주약주2022-06-152022-06-152025-06-14
156국가지정-가-261한국식품연구원농업회사법인(유)친구들의술지란지교약주2022-06-152022-06-152025-06-14
157국가지정-가-262한국식품연구원국도 양조장탁주2022-06-152022-06-152025-06-14
158국가지정-가-263한국식품연구원국도 양조장탁주2022-06-152022-06-152025-06-14
159국가지정-가-264한국식품연구원국도 양조장탁주2022-06-152022-06-152025-06-14
160국가지정-가-265한국식품연구원농업회사법인(주)죽향도가탁주2022-06-152022-06-152025-06-14
161국가지정-가-266한국식품연구원농업회사법인(주)죽향도가탁주2022-06-152022-06-152025-06-14
162국가지정-가-267한국식품연구원농업회사법인(주)죽향도가탁주2022-06-152022-06-152025-06-14
163국가지정-가-268한국식품연구원중원당탁주2022-06-152022-06-152025-06-14
164국가지정-가-269한국식품연구원농업회사법인 다도참주가(유)탁주2022-06-152022-06-152025-06-14