Overview

Dataset statistics

Number of variables11
Number of observations169
Missing cells507
Missing cells (%)27.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.1 KiB
Average record size in memory91.8 B

Variable types

Text3
Categorical5
Unsupported3

Dataset

Description탁주, 약주, 청주, 과실주 등의 술 품질인증 관리 정보(인증번호, 인증기관, 인증업체, 품목명, 인증일자, 인증시작일, 인증종료일 등)
Author국립농산물품질관리원
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220204000000001687

Alerts

인증기관 has constant value ""Constant
인증종료일 is highly overall correlated with 인증일자 and 1 other fieldsHigh correlation
인증시작일 is highly overall correlated with 인증일자 and 1 other fieldsHigh correlation
인증일자 is highly overall correlated with 인증시작일 and 1 other fieldsHigh correlation
Unnamed: 8 has 169 (100.0%) missing valuesMissing
Unnamed: 9 has 169 (100.0%) missing valuesMissing
Unnamed: 10 has 169 (100.0%) missing valuesMissing
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-23 07:55:12.009048
Analysis finished2024-03-23 07:55:14.525013
Duration2.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct138
Distinct (%)81.7%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-23T07:55:14.993521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.5976331
Min length8

Characters and Unicode

Total characters1622
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique107 ?
Unique (%)63.3%

Sample

1st row국가지정-가-35
2nd row국가지정-가-130
3rd row국가지정-가-131
4th row국가지정-가-100
5th row국가지정-가-92
ValueCountFrequency (%)
국가지정-가-35 2
 
1.2%
국가지정-가-19 2
 
1.2%
국가지정-가-4 2
 
1.2%
국가지정-가-116 2
 
1.2%
국가지정-가-130 2
 
1.2%
국가지정-가-125 2
 
1.2%
국가지정-가-115 2
 
1.2%
국가지정-가-123 2
 
1.2%
국가지정-가-5 2
 
1.2%
국가지정-가-120 2
 
1.2%
Other values (128) 149
88.2%
2024-03-23T07:55:16.187637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
338
20.8%
- 338
20.8%
169
10.4%
169
10.4%
169
10.4%
1 144
8.9%
2 45
 
2.8%
5 35
 
2.2%
9 34
 
2.1%
3 32
 
2.0%
Other values (5) 149
9.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 845
52.1%
Decimal Number 439
27.1%
Dash Punctuation 338
 
20.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 144
32.8%
2 45
 
10.3%
5 35
 
8.0%
9 34
 
7.7%
3 32
 
7.3%
7 32
 
7.3%
4 31
 
7.1%
0 30
 
6.8%
8 28
 
6.4%
6 28
 
6.4%
Other Letter
ValueCountFrequency (%)
338
40.0%
169
20.0%
169
20.0%
169
20.0%
Dash Punctuation
ValueCountFrequency (%)
- 338
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 845
52.1%
Common 777
47.9%

Most frequent character per script

Common
ValueCountFrequency (%)
- 338
43.5%
1 144
18.5%
2 45
 
5.8%
5 35
 
4.5%
9 34
 
4.4%
3 32
 
4.1%
7 32
 
4.1%
4 31
 
4.0%
0 30
 
3.9%
8 28
 
3.6%
Hangul
ValueCountFrequency (%)
338
40.0%
169
20.0%
169
20.0%
169
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 845
52.1%
ASCII 777
47.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
338
40.0%
169
20.0%
169
20.0%
169
20.0%
ASCII
ValueCountFrequency (%)
- 338
43.5%
1 144
18.5%
2 45
 
5.8%
5 35
 
4.5%
9 34
 
4.4%
3 32
 
4.1%
7 32
 
4.1%
4 31
 
4.0%
0 30
 
3.9%
8 28
 
3.6%

인증기관
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
한국식품연구원
169 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한국식품연구원
2nd row한국식품연구원
3rd row한국식품연구원
4th row한국식품연구원
5th row한국식품연구원

Common Values

ValueCountFrequency (%)
한국식품연구원 169
100.0%

Length

2024-03-23T07:55:16.599501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T07:55:16.905609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한국식품연구원 169
100.0%
Distinct68
Distinct (%)40.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-23T07:55:17.505405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length12
Mean length8.7633136
Min length2

Characters and Unicode

Total characters1481
Distinct characters138
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)11.8%

Sample

1st row순천주조
2nd row농업회사법인 (주)명가주조
3rd row농업회사법인 (주)토향
4th row한산소곡주
5th row제주샘영농조합법인
ValueCountFrequency (%)
농업회사법인 23
 
10.8%
주)국순당(횡성공장 9
 
4.2%
제주샘영농조합법인 9
 
4.2%
국순당여주명주 9
 
4.2%
주식회사 9
 
4.2%
서울장수주식회사 8
 
3.8%
주)조은술세종 6
 
2.8%
주)화요 5
 
2.3%
농업회사법인주식회사좋은술 5
 
2.3%
장희도가 4
 
1.9%
Other values (66) 126
59.2%
2024-03-23T07:55:18.355912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
145
 
9.8%
74
 
5.0%
65
 
4.4%
( 61
 
4.1%
) 61
 
4.1%
60
 
4.1%
60
 
4.1%
59
 
4.0%
55
 
3.7%
44
 
3.0%
Other values (128) 797
53.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1293
87.3%
Open Punctuation 61
 
4.1%
Close Punctuation 61
 
4.1%
Space Separator 44
 
3.0%
Decimal Number 14
 
0.9%
Other Symbol 4
 
0.3%
Uppercase Letter 4
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
145
 
11.2%
74
 
5.7%
65
 
5.0%
60
 
4.6%
60
 
4.6%
59
 
4.6%
55
 
4.3%
44
 
3.4%
33
 
2.6%
29
 
2.2%
Other values (118) 669
51.7%
Decimal Number
ValueCountFrequency (%)
2 8
57.1%
1 2
 
14.3%
9 2
 
14.3%
3 2
 
14.3%
Uppercase Letter
ValueCountFrequency (%)
L 2
50.0%
B 2
50.0%
Open Punctuation
ValueCountFrequency (%)
( 61
100.0%
Close Punctuation
ValueCountFrequency (%)
) 61
100.0%
Space Separator
ValueCountFrequency (%)
44
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1297
87.6%
Common 180
 
12.2%
Latin 4
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
145
 
11.2%
74
 
5.7%
65
 
5.0%
60
 
4.6%
60
 
4.6%
59
 
4.5%
55
 
4.2%
44
 
3.4%
33
 
2.5%
29
 
2.2%
Other values (119) 673
51.9%
Common
ValueCountFrequency (%)
( 61
33.9%
) 61
33.9%
44
24.4%
2 8
 
4.4%
1 2
 
1.1%
9 2
 
1.1%
3 2
 
1.1%
Latin
ValueCountFrequency (%)
L 2
50.0%
B 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1293
87.3%
ASCII 184
 
12.4%
None 4
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
145
 
11.2%
74
 
5.7%
65
 
5.0%
60
 
4.6%
60
 
4.6%
59
 
4.6%
55
 
4.3%
44
 
3.4%
33
 
2.6%
29
 
2.2%
Other values (118) 669
51.7%
ASCII
ValueCountFrequency (%)
( 61
33.2%
) 61
33.2%
44
23.9%
2 8
 
4.3%
1 2
 
1.1%
9 2
 
1.1%
3 2
 
1.1%
L 2
 
1.1%
B 2
 
1.1%
None
ValueCountFrequency (%)
4
100.0%
Distinct69
Distinct (%)40.8%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-23T07:55:18.839075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters507
Distinct characters97
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)12.4%

Sample

1st row조병준
2nd row이승일
3rd row정옥주
4th row우희열
5th row김숙희
ValueCountFrequency (%)
배중호 11
 
6.5%
김숙희 9
 
5.3%
박용구 9
 
5.3%
유재찬 7
 
4.1%
경기호 6
 
3.6%
이예령 5
 
3.0%
조태권 5
 
3.0%
조병준 4
 
2.4%
장정수 4
 
2.4%
채창윤 4
 
2.4%
Other values (59) 105
62.1%
2024-03-23T07:55:19.835801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35
 
6.9%
24
 
4.7%
20
 
3.9%
19
 
3.7%
16
 
3.2%
14
 
2.8%
13
 
2.6%
13
 
2.6%
12
 
2.4%
12
 
2.4%
Other values (87) 329
64.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 507
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
 
6.9%
24
 
4.7%
20
 
3.9%
19
 
3.7%
16
 
3.2%
14
 
2.8%
13
 
2.6%
13
 
2.6%
12
 
2.4%
12
 
2.4%
Other values (87) 329
64.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 507
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
35
 
6.9%
24
 
4.7%
20
 
3.9%
19
 
3.7%
16
 
3.2%
14
 
2.8%
13
 
2.6%
13
 
2.6%
12
 
2.4%
12
 
2.4%
Other values (87) 329
64.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 507
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
35
 
6.9%
24
 
4.7%
20
 
3.9%
19
 
3.7%
16
 
3.2%
14
 
2.8%
13
 
2.6%
13
 
2.6%
12
 
2.4%
12
 
2.4%
Other values (87) 329
64.9%

품목명
Categorical

Distinct7
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
탁주
65 
약주
32 
증류식소주
31 
과실주
17 
살균탁주
13 
Other values (2)
11 

Length

Max length5
Median length2
Mean length2.9763314
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row탁주
2nd row탁주
3rd row기타주류
4th row약주
5th row약주

Common Values

ValueCountFrequency (%)
탁주 65
38.5%
약주 32
18.9%
증류식소주 31
18.3%
과실주 17
 
10.1%
살균탁주 13
 
7.7%
일반증류주 7
 
4.1%
기타주류 4
 
2.4%

Length

2024-03-23T07:55:20.324454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T07:55:20.723991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
탁주 65
38.5%
약주 32
18.9%
증류식소주 31
18.3%
과실주 17
 
10.1%
살균탁주 13
 
7.7%
일반증류주 7
 
4.1%
기타주류 4
 
2.4%

인증일자
Categorical

HIGH CORRELATION 

Distinct37
Distinct (%)21.9%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2017-08-11
25 
2019-10-05
12 
2017-09-14
 
10
2018-08-22
 
9
2018-12-03
 
8
Other values (32)
105 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique6 ?
Unique (%)3.6%

Sample

1st row2020-09-14
2nd row2020-09-13
3rd row2020-09-13
4th row2020-08-11
5th row2020-08-11

Common Values

ValueCountFrequency (%)
2017-08-11 25
 
14.8%
2019-10-05 12
 
7.1%
2017-09-14 10
 
5.9%
2018-08-22 9
 
5.3%
2018-12-03 8
 
4.7%
2018-10-05 6
 
3.6%
2018-12-21 6
 
3.6%
2019-06-28 6
 
3.6%
2018-11-29 6
 
3.6%
2020-04-03 5
 
3.0%
Other values (27) 76
45.0%

Length

2024-03-23T07:55:21.105794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2017-08-11 25
 
14.8%
2019-10-05 12
 
7.1%
2017-09-14 10
 
5.9%
2018-08-22 9
 
5.3%
2018-12-03 8
 
4.7%
2018-10-05 6
 
3.6%
2018-12-21 6
 
3.6%
2019-06-28 6
 
3.6%
2018-11-29 6
 
3.6%
2017-03-18 5
 
3.0%
Other values (27) 76
45.0%

인증시작일
Categorical

HIGH CORRELATION 

Distinct39
Distinct (%)23.1%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2017-08-11
23 
2019-10-05
12 
2017-04-05
11 
2017-09-14
 
10
2018-12-03
 
8
Other values (34)
105 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique8 ?
Unique (%)4.7%

Sample

1st row2020-09-14
2nd row2020-09-13
3rd row2020-09-13
4th row2020-08-11
5th row2020-08-11

Common Values

ValueCountFrequency (%)
2017-08-11 23
 
13.6%
2019-10-05 12
 
7.1%
2017-04-05 11
 
6.5%
2017-09-14 10
 
5.9%
2018-12-03 8
 
4.7%
2019-05-22 7
 
4.1%
2019-06-28 6
 
3.6%
2020-03-18 5
 
3.0%
2019-07-10 5
 
3.0%
2020-04-03 5
 
3.0%
Other values (29) 77
45.6%

Length

2024-03-23T07:55:21.495328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2017-08-11 23
 
13.6%
2019-10-05 12
 
7.1%
2017-04-05 11
 
6.5%
2017-09-14 10
 
5.9%
2018-12-03 8
 
4.7%
2019-05-22 7
 
4.1%
2019-06-28 6
 
3.6%
2017-03-18 5
 
3.0%
2018-12-06 5
 
3.0%
2020-08-11 5
 
3.0%
Other values (29) 77
45.6%

인증종료일
Categorical

HIGH CORRELATION 

Distinct42
Distinct (%)24.9%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2020-09-12
20 
2022-12-25
14 
2021-01-25
 
9
2021-12-02
 
8
2021-12-04
 
7
Other values (37)
111 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique10 ?
Unique (%)5.9%

Sample

1st row2023-09-13
2nd row2023-09-12
3rd row2023-09-12
4th row2023-08-10
5th row2023-08-10

Common Values

ValueCountFrequency (%)
2020-09-12 20
 
11.8%
2022-12-25 14
 
8.3%
2021-01-25 9
 
5.3%
2021-12-02 8
 
4.7%
2021-12-04 7
 
4.1%
2022-05-21 7
 
4.1%
2021-12-05 6
 
3.6%
2023-08-10 5
 
3.0%
2023-03-30 5
 
3.0%
2020-08-10 5
 
3.0%
Other values (32) 83
49.1%

Length

2024-03-23T07:55:21.887731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2020-09-12 20
 
11.8%
2022-12-25 14
 
8.3%
2021-01-25 9
 
5.3%
2021-12-02 8
 
4.7%
2021-12-04 7
 
4.1%
2022-05-21 7
 
4.1%
2021-12-05 6
 
3.6%
2020-03-17 5
 
3.0%
2020-03-30 5
 
3.0%
2023-04-02 5
 
3.0%
Other values (32) 83
49.1%

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing169
Missing (%)100.0%
Memory size1.6 KiB

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing169
Missing (%)100.0%
Memory size1.6 KiB

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing169
Missing (%)100.0%
Memory size1.6 KiB

Correlations

2024-03-23T07:55:22.123237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인증업체대표자명품목명인증일자인증시작일인증종료일
인증업체1.0001.0000.9760.9760.9790.982
대표자명1.0001.0000.9730.9770.9800.983
품목명0.9760.9731.0000.6600.7230.739
인증일자0.9760.9770.6601.0000.9980.998
인증시작일0.9790.9800.7230.9981.0000.998
인증종료일0.9820.9830.7390.9980.9981.000
2024-03-23T07:55:22.396547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인증종료일품목명인증시작일인증일자
인증종료일1.0000.3350.9230.920
품목명0.3351.0000.3520.306
인증시작일0.9230.3521.0000.926
인증일자0.9200.3060.9261.000
2024-03-23T07:55:22.597821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목명인증일자인증시작일인증종료일
품목명1.0000.3060.3520.335
인증일자0.3061.0000.9260.920
인증시작일0.3520.9261.0000.923
인증종료일0.3350.9200.9231.000

Missing values

2024-03-23T07:55:13.702572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T07:55:14.386290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

인증번호인증기관인증업체대표자명품목명인증일자인증시작일인증종료일Unnamed: 8Unnamed: 9Unnamed: 10
0국가지정-가-35한국식품연구원순천주조조병준탁주2020-09-142020-09-142023-09-13<NA><NA><NA>
1국가지정-가-130한국식품연구원농업회사법인 (주)명가주조이승일탁주2020-09-132020-09-132023-09-12<NA><NA><NA>
2국가지정-가-131한국식품연구원농업회사법인 (주)토향정옥주기타주류2020-09-132020-09-132023-09-12<NA><NA><NA>
3국가지정-가-100한국식품연구원한산소곡주우희열약주2020-08-112020-08-112023-08-10<NA><NA><NA>
4국가지정-가-92한국식품연구원제주샘영농조합법인김숙희약주2020-08-112020-08-112023-08-10<NA><NA><NA>
5국가지정-가-93한국식품연구원제주샘영농조합법인김숙희약주2020-08-112020-08-112023-08-10<NA><NA><NA>
6국가지정-가-94한국식품연구원제주샘영농조합법인김숙희증류식소주2020-08-112020-08-112023-08-10<NA><NA><NA>
7국가지정-가-95한국식품연구원제주샘영농조합법인김숙희증류식소주2020-08-112020-08-112023-08-10<NA><NA><NA>
8국가지정-가-28한국식품연구원여수주조공사임용택탁주2020-07-202020-07-202023-07-19<NA><NA><NA>
9국가지정-가-32한국식품연구원순천주조조병준탁주2020-07-202020-07-202023-07-19<NA><NA><NA>
인증번호인증기관인증업체대표자명품목명인증일자인증시작일인증종료일Unnamed: 8Unnamed: 9Unnamed: 10
159국가지정-가-122한국식품연구원농업회사법인 국순당여주명주 주식회사박용구증류식소주2016-12-262017-04-052020-03-30<NA><NA><NA>
160국가지정-가-123한국식품연구원농업회사법인 국순당여주명주 주식회사박용구증류식소주2016-12-262017-04-052020-03-30<NA><NA><NA>
161국가지정-가-124한국식품연구원농업회사법인 국순당여주명주 주식회사박용구증류식소주2016-12-262017-04-052020-03-30<NA><NA><NA>
162국가지정-가-125한국식품연구원우리술박성기탁주2016-12-262017-04-052020-04-02<NA><NA><NA>
163국가지정-가-118한국식품연구원장희도가장정수약주2016-10-052017-04-052020-04-02<NA><NA><NA>
164국가지정-가-119한국식품연구원장희도가장정수탁주2016-10-052017-04-052020-04-02<NA><NA><NA>
165국가지정-가-120한국식품연구원제주고소리술익는집김희숙증류식소주2016-10-052017-04-052020-03-29<NA><NA><NA>
166국가지정-가-113한국식품연구원삼봉표 아리랑막걸리김남두탁주2016-06-282017-04-052020-04-02<NA><NA><NA>
167국가지정-가-115한국식품연구원(주)당진면천주조박경하탁주2016-06-282017-04-052020-03-30<NA><NA><NA>
168국가지정-가-116한국식품연구원(주)당진면천주조박경하탁주2016-06-282017-04-052020-03-30<NA><NA><NA>