Overview

Dataset statistics

Number of variables7
Number of observations160
Missing cells0
Missing cells (%)0.0%
Duplicate rows5
Duplicate rows (%)3.1%
Total size in memory8.9 KiB
Average record size in memory56.8 B

Variable types

Text2
Categorical5

Dataset

Description탁주, 약주, 청주, 과실주 등의 술 품질인증 관리 정보(인증번호, 인증기관, 인증업체, 품목명, 인증일자, 인증시작일, 인증종료일 등)
Author국립농산물품질관리원
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220204000000001687

Alerts

인증기관 has constant value ""Constant
Dataset has 5 (3.1%) duplicate rowsDuplicates
인증시작일자 is highly overall correlated with 인증일자 and 1 other fieldsHigh correlation
인증종료일자 is highly overall correlated with 인증일자 and 1 other fieldsHigh correlation
인증일자 is highly overall correlated with 인증시작일자 and 1 other fieldsHigh correlation

Reproduction

Analysis started2024-03-23 07:54:36.152122
Analysis finished2024-03-23 07:54:37.707897
Duration1.56 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct155
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-23T07:54:37.979650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters1600
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique150 ?
Unique (%)93.8%

Sample

1st row국가지정-가-001
2nd row국가지정-가-002
3rd row국가지정-가-004
4th row국가지정-가-005
5th row국가지정-가-006
ValueCountFrequency (%)
국가지정-가-045 2
 
1.2%
국가지정-가-052 2
 
1.2%
국가지정-가-051 2
 
1.2%
국가지정-가-077 2
 
1.2%
국가지정-가-057 2
 
1.2%
국가지정-가-188 1
 
0.6%
국가지정-가-001 1
 
0.6%
국가지정-가-190 1
 
0.6%
국가지정-가-191 1
 
0.6%
국가지정-가-192 1
 
0.6%
Other values (145) 145
90.6%
2024-03-23T07:54:39.079789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
320
20.0%
- 320
20.0%
160
10.0%
160
10.0%
160
10.0%
1 109
 
6.8%
0 81
 
5.1%
2 80
 
5.0%
3 36
 
2.2%
5 33
 
2.1%
Other values (5) 141
8.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 800
50.0%
Decimal Number 480
30.0%
Dash Punctuation 320
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 109
22.7%
0 81
16.9%
2 80
16.7%
3 36
 
7.5%
5 33
 
6.9%
4 31
 
6.5%
7 31
 
6.5%
9 28
 
5.8%
6 26
 
5.4%
8 25
 
5.2%
Other Letter
ValueCountFrequency (%)
320
40.0%
160
20.0%
160
20.0%
160
20.0%
Dash Punctuation
ValueCountFrequency (%)
- 320
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 800
50.0%
Common 800
50.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 320
40.0%
1 109
 
13.6%
0 81
 
10.1%
2 80
 
10.0%
3 36
 
4.5%
5 33
 
4.1%
4 31
 
3.9%
7 31
 
3.9%
9 28
 
3.5%
6 26
 
3.2%
Hangul
ValueCountFrequency (%)
320
40.0%
160
20.0%
160
20.0%
160
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 800
50.0%
ASCII 800
50.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
320
40.0%
160
20.0%
160
20.0%
160
20.0%
ASCII
ValueCountFrequency (%)
- 320
40.0%
1 109
 
13.6%
0 81
 
10.1%
2 80
 
10.0%
3 36
 
4.5%
5 33
 
4.1%
4 31
 
3.9%
7 31
 
3.9%
9 28
 
3.5%
6 26
 
3.2%

인증기관
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
한국식품연구원
160 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한국식품연구원
2nd row한국식품연구원
3rd row한국식품연구원
4th row한국식품연구원
5th row한국식품연구원

Common Values

ValueCountFrequency (%)
한국식품연구원 160
100.0%

Length

2024-03-23T07:54:39.494963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T07:54:39.835532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한국식품연구원 160
100.0%
Distinct75
Distinct (%)46.9%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-23T07:54:40.310781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length14
Mean length9.425
Min length2

Characters and Unicode

Total characters1508
Distinct characters150
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)20.6%

Sample

1st row(주)국순당
2nd row(주)국순당
3rd row서울장수주식회사
4th row서울장수주식회사
5th row구암농산
ValueCountFrequency (%)
농업회사법인 25
 
12.0%
주식회사 8
 
3.8%
서울장수주식회사 7
 
3.3%
국순당여주명주 7
 
3.3%
주)국순당(횡성공장 7
 
3.3%
주)제이엘 7
 
3.3%
제주샘영농조합법인 6
 
2.9%
주)조은술세종 5
 
2.4%
주)화요 5
 
2.4%
안양주조2공장 4
 
1.9%
Other values (74) 128
61.2%
2024-03-23T07:54:41.311719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
130
 
8.6%
76
 
5.0%
75
 
5.0%
72
 
4.8%
72
 
4.8%
70
 
4.6%
) 67
 
4.4%
( 67
 
4.4%
63
 
4.2%
50
 
3.3%
Other values (140) 766
50.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1308
86.7%
Close Punctuation 67
 
4.4%
Open Punctuation 67
 
4.4%
Space Separator 49
 
3.2%
Decimal Number 11
 
0.7%
Other Symbol 4
 
0.3%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
130
 
9.9%
76
 
5.8%
75
 
5.7%
72
 
5.5%
72
 
5.5%
70
 
5.4%
63
 
4.8%
50
 
3.8%
40
 
3.1%
29
 
2.2%
Other values (130) 631
48.2%
Decimal Number
ValueCountFrequency (%)
2 6
54.5%
1 3
27.3%
9 1
 
9.1%
3 1
 
9.1%
Uppercase Letter
ValueCountFrequency (%)
L 1
50.0%
B 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 67
100.0%
Open Punctuation
ValueCountFrequency (%)
( 67
100.0%
Space Separator
ValueCountFrequency (%)
49
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1312
87.0%
Common 194
 
12.9%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
130
 
9.9%
76
 
5.8%
75
 
5.7%
72
 
5.5%
72
 
5.5%
70
 
5.3%
63
 
4.8%
50
 
3.8%
40
 
3.0%
29
 
2.2%
Other values (131) 635
48.4%
Common
ValueCountFrequency (%)
) 67
34.5%
( 67
34.5%
49
25.3%
2 6
 
3.1%
1 3
 
1.5%
9 1
 
0.5%
3 1
 
0.5%
Latin
ValueCountFrequency (%)
L 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1308
86.7%
ASCII 196
 
13.0%
None 4
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
130
 
9.9%
76
 
5.8%
75
 
5.7%
72
 
5.5%
72
 
5.5%
70
 
5.4%
63
 
4.8%
50
 
3.8%
40
 
3.1%
29
 
2.2%
Other values (130) 631
48.2%
ASCII
ValueCountFrequency (%)
) 67
34.2%
( 67
34.2%
49
25.0%
2 6
 
3.1%
1 3
 
1.5%
9 1
 
0.5%
3 1
 
0.5%
L 1
 
0.5%
B 1
 
0.5%
None
ValueCountFrequency (%)
4
100.0%

품목명
Categorical

Distinct8
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
탁주
60 
증류식소주
30 
약주
29 
과실주
18 
일반증류주
10 
Other values (3)
13 

Length

Max length5
Median length2
Mean length3.01875
Min length2

Unique

Unique1 ?
Unique (%)0.6%

Sample

1st row탁주
2nd row살균탁주
3rd row탁주
4th row살균탁주
5th row살균탁주

Common Values

ValueCountFrequency (%)
탁주 60
37.5%
증류식소주 30
18.8%
약주 29
18.1%
과실주 18
 
11.2%
일반증류주 10
 
6.2%
살균탁주 7
 
4.4%
기타주류 5
 
3.1%
리큐르 1
 
0.6%

Length

2024-03-23T07:54:41.743020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T07:54:42.280822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
탁주 60
37.5%
증류식소주 30
18.8%
약주 29
18.1%
과실주 18
 
11.2%
일반증류주 10
 
6.2%
살균탁주 7
 
4.4%
기타주류 5
 
3.1%
리큐르 1
 
0.6%

인증일자
Categorical

HIGH CORRELATION 

Distinct34
Distinct (%)21.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2021-07-14
12 
2019-10-05
12 
2020-08-11
11 
2017-08-11
 
10
2020-08-14
 
9
Other values (29)
106 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique6 ?
Unique (%)3.8%

Sample

1st row2020-03-18
2nd row2020-03-18
3rd row2020-03-18
4th row2020-03-18
5th row2020-03-18

Common Values

ValueCountFrequency (%)
2021-07-14 12
 
7.5%
2019-10-05 12
 
7.5%
2020-08-11 11
 
6.9%
2017-08-11 10
 
6.2%
2020-08-14 9
 
5.6%
2021-08-22 9
 
5.6%
2017-09-14 7
 
4.4%
2018-12-03 7
 
4.4%
2021-09-27 7
 
4.4%
2021-12-21 6
 
3.8%
Other values (24) 70
43.8%

Length

2024-03-23T07:54:42.852031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2021-07-14 12
 
7.5%
2019-10-05 12
 
7.5%
2020-08-11 11
 
6.9%
2017-08-11 10
 
6.2%
2020-08-14 9
 
5.6%
2021-08-22 9
 
5.6%
2018-12-03 7
 
4.4%
2021-09-27 7
 
4.4%
2017-09-14 7
 
4.4%
2021-12-21 6
 
3.8%
Other values (24) 70
43.8%

인증시작일자
Categorical

HIGH CORRELATION 

Distinct18
Distinct (%)11.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2019-10-05
51 
2021-07-14
12 
2020-08-11
11 
2020-09-13
10 
2020-08-14
Other values (13)
67 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique1 ?
Unique (%)0.6%

Sample

1st row2019-10-05
2nd row2019-10-05
3rd row2019-10-05
4th row2019-10-05
5th row2019-10-05

Common Values

ValueCountFrequency (%)
2019-10-05 51
31.9%
2021-07-14 12
 
7.5%
2020-08-11 11
 
6.9%
2020-09-13 10
 
6.2%
2020-08-14 9
 
5.6%
2021-08-22 9
 
5.6%
2019-06-28 8
 
5.0%
2019-12-26 7
 
4.4%
2021-01-26 7
 
4.4%
2021-09-27 7
 
4.4%
Other values (8) 29
18.1%

Length

2024-03-23T07:54:43.445110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2019-10-05 51
31.9%
2021-07-14 12
 
7.5%
2020-08-11 11
 
6.9%
2020-09-13 10
 
6.2%
2020-08-14 9
 
5.6%
2021-08-22 9
 
5.6%
2019-06-28 8
 
5.0%
2021-01-26 7
 
4.4%
2021-09-27 7
 
4.4%
2019-12-26 7
 
4.4%
Other values (8) 29
18.1%

인증종료일자
Categorical

HIGH CORRELATION 

Distinct20
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2022-10-04
51 
2023-08-10
11 
2024-07-13
11 
2023-09-12
10 
2024-08-21
Other values (15)
68 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique2 ?
Unique (%)1.2%

Sample

1st row2022-10-04
2nd row2022-10-04
3rd row2022-10-04
4th row2022-10-04
5th row2022-10-04

Common Values

ValueCountFrequency (%)
2022-10-04 51
31.9%
2023-08-10 11
 
6.9%
2024-07-13 11
 
6.9%
2023-09-12 10
 
6.2%
2024-08-21 9
 
5.6%
2023-08-13 9
 
5.6%
2024-09-26 7
 
4.4%
2022-12-25 7
 
4.4%
2024-01-25 7
 
4.4%
2024-12-20 6
 
3.8%
Other values (10) 32
20.0%

Length

2024-03-23T07:54:43.863312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2022-10-04 51
31.9%
2023-08-10 11
 
6.9%
2024-07-13 11
 
6.9%
2023-09-12 10
 
6.2%
2024-08-21 9
 
5.6%
2023-08-13 9
 
5.6%
2024-09-26 7
 
4.4%
2022-12-25 7
 
4.4%
2024-01-25 7
 
4.4%
2024-10-04 6
 
3.8%
Other values (10) 32
20.0%

Correlations

2024-03-23T07:54:44.100598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인증업체품목명인증일자인증시작일자인증종료일자
인증업체1.0000.8990.9930.9820.985
품목명0.8991.0000.7180.6400.681
인증일자0.9930.7181.0000.9950.991
인증시작일자0.9820.6400.9951.0001.000
인증종료일자0.9850.6810.9911.0001.000
2024-03-23T07:54:44.275378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목명인증시작일자인증종료일자인증일자
품목명1.0000.3210.3350.341
인증시작일자0.3211.0000.9930.874
인증종료일자0.3350.9931.0000.825
인증일자0.3410.8740.8251.000
2024-03-23T07:54:44.530613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목명인증일자인증시작일자인증종료일자
품목명1.0000.3410.3210.335
인증일자0.3411.0000.8740.825
인증시작일자0.3210.8741.0000.993
인증종료일자0.3350.8250.9931.000

Missing values

2024-03-23T07:54:37.199771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T07:54:37.534108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

인증번호인증기관인증업체품목명인증일자인증시작일자인증종료일자
0국가지정-가-001한국식품연구원(주)국순당탁주2020-03-182019-10-052022-10-04
1국가지정-가-002한국식품연구원(주)국순당살균탁주2020-03-182019-10-052022-10-04
2국가지정-가-004한국식품연구원서울장수주식회사탁주2020-03-182019-10-052022-10-04
3국가지정-가-005한국식품연구원서울장수주식회사살균탁주2020-03-182019-10-052022-10-04
4국가지정-가-006한국식품연구원구암농산살균탁주2020-03-182019-10-052022-10-04
5국가지정-가-014한국식품연구원울산탁주탁주2020-04-182019-10-052022-10-04
6국가지정-가-022한국식품연구원한주양조탁주2020-05-232019-10-052022-10-04
7국가지정-가-027한국식품연구원배상면주가고창LB주식회사과실주2020-05-232019-10-052022-10-04
8국가지정-가-028한국식품연구원여수주조공사탁주2020-07-202019-10-052022-10-04
9국가지정-가-032한국식품연구원순천주조탁주2020-07-202019-10-052022-10-04
인증번호인증기관인증업체품목명인증일자인증시작일자인증종료일자
150국가지정-가-234한국식품연구원농업회사법인(주)시트러스과실주2021-09-272021-09-272024-09-26
151국가지정-가-235한국식품연구원제주샘영농조합법인일반증류주2021-09-272021-09-272024-09-26
152국가지정-가-236한국식품연구원영농조합법인 우도땅콩막걸리 낙화곡주탁주2021-09-272021-09-272024-09-26
153국가지정-가-237한국식품연구원영농조합법인 우도땅콩막걸리 낙화곡주기타주류2021-09-272021-09-272024-09-26
154국가지정-가-238한국식품연구원농업회사법인(주)광양주조공사탁주2021-09-272021-09-272024-09-26
155국가지정-가-239한국식품연구원인천탁주제조제1공장탁주2021-12-052021-12-062024-12-05
156국가지정-가-240한국식품연구원인천탁주제조제1공장탁주2021-12-062021-12-062024-12-05
157국가지정-가-241한국식품연구원농업회사법인 다도참주가(유)탁주2021-12-062021-12-062024-12-05
158국가지정-가-242한국식품연구원농업회사법인 다도참주가(유)탁주2021-12-062021-12-062024-12-05
159국가지정-가-243한국식품연구원농업회사법인 국순당여주명주 주식회사증류식소주2021-12-062021-12-062024-12-05

Duplicate rows

Most frequently occurring

인증번호인증기관인증업체품목명인증일자인증시작일자인증종료일자# duplicates
0국가지정-가-045한국식품연구원농업회사법인(주)영덕주조탁주2020-12-092019-10-052022-10-042
1국가지정-가-051한국식품연구원장생도라지영농조합법인약주2021-02-102019-10-052022-10-042
2국가지정-가-052한국식품연구원장생도라지영농조합법인약주2021-02-102019-10-052022-10-042
3국가지정-가-057한국식품연구원은척양조장탁주2018-04-172019-10-052022-10-042
4국가지정-가-077한국식품연구원서울장수주식회사살균탁주2019-06-122019-10-052022-10-042