Overview

Dataset statistics

Number of variables5
Number of observations177
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.0 KiB
Average record size in memory40.7 B

Variable types

Text3
Categorical2

Dataset

Description우리술의 품질향상 및 고품질 생산장려 및 소비자 보호를 목적으로 정부가 지정한 인증기관이 품질인증을 받고자 하는 생산업체가 신청한 술에 대해 품질인증을 실시하고, 그 인증품에 대해 정부가 품질을 보증하는 제도
Author농림축산식품부
URLhttps://www.data.go.kr/data/15042291/fileData.do

Alerts

인증번호 has unique valuesUnique
제품명 has unique valuesUnique

Reproduction

Analysis started2024-04-17 22:37:28.808694
Analysis finished2024-04-17 22:37:29.713968
Duration0.91 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

인증번호
Text

UNIQUE 

Distinct177
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2024-04-18T07:37:29.888123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters1770
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique177 ?
Unique (%)100.0%

Sample

1st row국가지정-가-269
2nd row국가지정-가-268
3rd row국가지정-가-267
4th row국가지정-가-266
5th row국가지정-가-265
ValueCountFrequency (%)
국가지정-가-269 1
 
0.6%
국가지정-가-176 1
 
0.6%
국가지정-가-131 1
 
0.6%
국가지정-가-143 1
 
0.6%
국가지정-가-140 1
 
0.6%
국가지정-가-139 1
 
0.6%
국가지정-가-137 1
 
0.6%
국가지정-가-136 1
 
0.6%
국가지정-가-135 1
 
0.6%
국가지정-가-134 1
 
0.6%
Other values (167) 167
94.4%
2024-04-18T07:37:30.293011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
354
20.0%
- 354
20.0%
177
10.0%
177
10.0%
177
10.0%
2 106
 
6.0%
1 102
 
5.8%
0 78
 
4.4%
5 41
 
2.3%
4 39
 
2.2%
Other values (5) 165
9.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 885
50.0%
Decimal Number 531
30.0%
Dash Punctuation 354
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 106
20.0%
1 102
19.2%
0 78
14.7%
5 41
 
7.7%
4 39
 
7.3%
6 38
 
7.2%
3 37
 
7.0%
9 31
 
5.8%
7 31
 
5.8%
8 28
 
5.3%
Other Letter
ValueCountFrequency (%)
354
40.0%
177
20.0%
177
20.0%
177
20.0%
Dash Punctuation
ValueCountFrequency (%)
- 354
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 885
50.0%
Common 885
50.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 354
40.0%
2 106
 
12.0%
1 102
 
11.5%
0 78
 
8.8%
5 41
 
4.6%
4 39
 
4.4%
6 38
 
4.3%
3 37
 
4.2%
9 31
 
3.5%
7 31
 
3.5%
Hangul
ValueCountFrequency (%)
354
40.0%
177
20.0%
177
20.0%
177
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 885
50.0%
ASCII 885
50.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
354
40.0%
177
20.0%
177
20.0%
177
20.0%
ASCII
ValueCountFrequency (%)
- 354
40.0%
2 106
 
12.0%
1 102
 
11.5%
0 78
 
8.8%
5 41
 
4.6%
4 39
 
4.4%
6 38
 
4.3%
3 37
 
4.2%
9 31
 
3.5%
7 31
 
3.5%
Distinct82
Distinct (%)46.3%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2024-04-18T07:37:30.543950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length14
Mean length9.6892655
Min length2

Characters and Unicode

Total characters1715
Distinct characters164
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)20.9%

Sample

1st row농업회사법인 다도참주가(유)
2nd row중원당
3rd row농업회사법인(주)죽향도가
4th row농업회사법인(주)죽향도가
5th row농업회사법인(주)죽향도가
ValueCountFrequency (%)
농업회사법인 28
 
11.8%
주식회사 8
 
3.4%
주)국순당(횡성공장 7
 
3.0%
국순당여주명주 7
 
3.0%
주)제이엘 7
 
3.0%
서울장수주식회사 6
 
2.5%
제주샘영농조합법인 6
 
2.5%
농업회사법인(주)죽향도가 6
 
2.5%
이가전통주 6
 
2.5%
양주골 6
 
2.5%
Other values (82) 150
63.3%
2024-04-18T07:37:30.941605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
149
 
8.7%
89
 
5.2%
85
 
5.0%
84
 
4.9%
82
 
4.8%
80
 
4.7%
( 77
 
4.5%
) 77
 
4.5%
62
 
3.6%
62
 
3.6%
Other values (154) 868
50.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1484
86.5%
Open Punctuation 77
 
4.5%
Close Punctuation 77
 
4.5%
Space Separator 60
 
3.5%
Decimal Number 11
 
0.6%
Other Symbol 4
 
0.2%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
149
 
10.0%
89
 
6.0%
85
 
5.7%
84
 
5.7%
82
 
5.5%
80
 
5.4%
62
 
4.2%
62
 
4.2%
39
 
2.6%
37
 
2.5%
Other values (144) 715
48.2%
Decimal Number
ValueCountFrequency (%)
2 6
54.5%
1 3
27.3%
9 1
 
9.1%
3 1
 
9.1%
Uppercase Letter
ValueCountFrequency (%)
L 1
50.0%
B 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 77
100.0%
Close Punctuation
ValueCountFrequency (%)
) 77
100.0%
Space Separator
ValueCountFrequency (%)
60
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1488
86.8%
Common 225
 
13.1%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
149
 
10.0%
89
 
6.0%
85
 
5.7%
84
 
5.6%
82
 
5.5%
80
 
5.4%
62
 
4.2%
62
 
4.2%
39
 
2.6%
37
 
2.5%
Other values (145) 719
48.3%
Common
ValueCountFrequency (%)
( 77
34.2%
) 77
34.2%
60
26.7%
2 6
 
2.7%
1 3
 
1.3%
9 1
 
0.4%
3 1
 
0.4%
Latin
ValueCountFrequency (%)
L 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1484
86.5%
ASCII 227
 
13.2%
None 4
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
149
 
10.0%
89
 
6.0%
85
 
5.7%
84
 
5.7%
82
 
5.5%
80
 
5.4%
62
 
4.2%
62
 
4.2%
39
 
2.6%
37
 
2.5%
Other values (144) 715
48.2%
ASCII
ValueCountFrequency (%)
( 77
33.9%
) 77
33.9%
60
26.4%
2 6
 
2.6%
1 3
 
1.3%
L 1
 
0.4%
B 1
 
0.4%
9 1
 
0.4%
3 1
 
0.4%
None
ValueCountFrequency (%)
4
100.0%

품목명
Categorical

Distinct8
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
탁주
67 
증류식소주
35 
약주
33 
과실주
19 
일반증류주
10 
Other values (3)
13 

Length

Max length5
Median length2
Mean length3.0112994
Min length2

Unique

Unique1 ?
Unique (%)0.6%

Sample

1st row탁주
2nd row탁주
3rd row탁주
4th row탁주
5th row탁주

Common Values

ValueCountFrequency (%)
탁주 67
37.9%
증류식소주 35
19.8%
약주 33
18.6%
과실주 19
 
10.7%
일반증류주 10
 
5.6%
기타주류 6
 
3.4%
살균탁주 6
 
3.4%
리큐르 1
 
0.6%

Length

2024-04-18T07:37:31.097641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T07:37:31.227082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
탁주 67
37.9%
증류식소주 35
19.8%
약주 33
18.6%
과실주 19
 
10.7%
일반증류주 10
 
5.6%
기타주류 6
 
3.4%
살균탁주 6
 
3.4%
리큐르 1
 
0.6%

제품명
Text

UNIQUE 

Distinct177
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2024-04-18T07:37:31.462326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length23
Mean length7.2824859
Min length2

Characters and Unicode

Total characters1289
Distinct characters248
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique177 ?
Unique (%)100.0%

Sample

1st row라봉
2nd row보은주
3rd row죽향도가 죽향
4th row프리미엄 대대포13
5th row대대포9
ValueCountFrequency (%)
막걸리 9
 
3.2%
7
 
2.5%
25 6
 
2.2%
국순당 5
 
1.8%
프리미엄 4
 
1.4%
오미로제 3
 
1.1%
˚ 3
 
1.1%
3
 
1.1%
풍정사계 3
 
1.1%
드라이 3
 
1.1%
Other values (203) 231
83.4%
2024-04-18T07:37:31.839890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
100
 
7.8%
61
 
4.7%
53
 
4.1%
39
 
3.0%
39
 
3.0%
5 32
 
2.5%
30
 
2.3%
2 26
 
2.0%
( 25
 
1.9%
) 25
 
1.9%
Other values (238) 859
66.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 950
73.7%
Decimal Number 131
 
10.2%
Space Separator 100
 
7.8%
Open Punctuation 25
 
1.9%
Close Punctuation 25
 
1.9%
Other Punctuation 24
 
1.9%
Uppercase Letter 16
 
1.2%
Other Symbol 9
 
0.7%
Lowercase Letter 6
 
0.5%
Modifier Symbol 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
61
 
6.4%
53
 
5.6%
39
 
4.1%
39
 
4.1%
30
 
3.2%
18
 
1.9%
16
 
1.7%
16
 
1.7%
14
 
1.5%
13
 
1.4%
Other values (207) 651
68.5%
Decimal Number
ValueCountFrequency (%)
5 32
24.4%
2 26
19.8%
0 21
16.0%
1 16
12.2%
4 11
 
8.4%
3 11
 
8.4%
7 9
 
6.9%
9 3
 
2.3%
6 2
 
1.5%
Uppercase Letter
ValueCountFrequency (%)
M 5
31.2%
E 2
 
12.5%
I 2
 
12.5%
L 2
 
12.5%
P 1
 
6.2%
U 1
 
6.2%
R 1
 
6.2%
O 1
 
6.2%
N 1
 
6.2%
Lowercase Letter
ValueCountFrequency (%)
m 2
33.3%
e 1
16.7%
u 1
16.7%
l 1
16.7%
b 1
16.7%
Other Punctuation
ValueCountFrequency (%)
, 14
58.3%
% 8
33.3%
. 2
 
8.3%
Space Separator
ValueCountFrequency (%)
100
100.0%
Open Punctuation
ValueCountFrequency (%)
( 25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 25
100.0%
Other Symbol
ValueCountFrequency (%)
9
100.0%
Modifier Symbol
ValueCountFrequency (%)
˚ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 939
72.8%
Common 317
 
24.6%
Latin 22
 
1.7%
Han 11
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
61
 
6.5%
53
 
5.6%
39
 
4.2%
39
 
4.2%
30
 
3.2%
18
 
1.9%
16
 
1.7%
16
 
1.7%
14
 
1.5%
13
 
1.4%
Other values (201) 640
68.2%
Common
ValueCountFrequency (%)
100
31.5%
5 32
 
10.1%
2 26
 
8.2%
( 25
 
7.9%
) 25
 
7.9%
0 21
 
6.6%
1 16
 
5.0%
, 14
 
4.4%
4 11
 
3.5%
3 11
 
3.5%
Other values (7) 36
 
11.4%
Latin
ValueCountFrequency (%)
M 5
22.7%
m 2
 
9.1%
E 2
 
9.1%
I 2
 
9.1%
L 2
 
9.1%
P 1
 
4.5%
e 1
 
4.5%
u 1
 
4.5%
l 1
 
4.5%
b 1
 
4.5%
Other values (4) 4
18.2%
Han
ValueCountFrequency (%)
5
45.5%
2
 
18.2%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 939
72.8%
ASCII 327
 
25.4%
CJK 11
 
0.9%
CJK Compat 9
 
0.7%
Modifier Letters 3
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
100
30.6%
5 32
 
9.8%
2 26
 
8.0%
( 25
 
7.6%
) 25
 
7.6%
0 21
 
6.4%
1 16
 
4.9%
, 14
 
4.3%
4 11
 
3.4%
3 11
 
3.4%
Other values (19) 46
14.1%
Hangul
ValueCountFrequency (%)
61
 
6.5%
53
 
5.6%
39
 
4.2%
39
 
4.2%
30
 
3.2%
18
 
1.9%
16
 
1.7%
16
 
1.7%
14
 
1.5%
13
 
1.4%
Other values (201) 640
68.2%
CJK Compat
ValueCountFrequency (%)
9
100.0%
CJK
ValueCountFrequency (%)
5
45.5%
2
 
18.2%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%
Modifier Letters
ValueCountFrequency (%)
˚ 3
100.0%

인증유형
Categorical

Distinct2
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
144 
33 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
144
81.4%
33
 
18.6%

Length

2024-04-18T07:37:31.973325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T07:37:32.077069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
144
81.4%
33
 
18.6%

Correlations

2024-04-18T07:37:32.141703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인증업체품목명인증유형
인증업체1.0000.8920.818
품목명0.8921.0000.494
인증유형0.8180.4941.000
2024-04-18T07:37:32.227892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목명인증유형
품목명1.0000.365
인증유형0.3651.000
2024-04-18T07:37:32.312654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목명인증유형
품목명1.0000.365
인증유형0.3651.000

Missing values

2024-04-18T07:37:29.678808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

인증번호인증업체품목명제품명인증유형
0국가지정-가-269농업회사법인 다도참주가(유)탁주라봉
1국가지정-가-268중원당탁주보은주
2국가지정-가-267농업회사법인(주)죽향도가탁주죽향도가 죽향
3국가지정-가-266농업회사법인(주)죽향도가탁주프리미엄 대대포13
4국가지정-가-265농업회사법인(주)죽향도가탁주대대포9
5국가지정-가-264국도 양조장탁주국도11.5 ˚ 막걸리
6국가지정-가-263국도 양조장탁주국도9 ˚ 막걸리
7국가지정-가-262국도 양조장탁주국도6 ˚ 막걸리
8국가지정-가-261농업회사법인(유)친구들의술지란지교약주지란지교(약주)
9국가지정-가-260양주골 이가전통주약주酒(주)줌치 17
인증번호인증업체품목명제품명인증유형
167국가지정-가-032순천주조탁주나누우리
168국가지정-가-028여수주조공사탁주여수생막걸리
169국가지정-가-027배상면주가고창LB주식회사과실주복분자음
170국가지정-가-022한주양조탁주안성마춤생막걸리길따라벗따라, 안성생막걸리길따라벗따라
171국가지정-가-014울산탁주탁주태화루
172국가지정-가-006구암농산살균탁주우리랑대추막걸리
173국가지정-가-005서울장수주식회사살균탁주월매 쌀막걸리
174국가지정-가-004서울장수주식회사탁주장수 생막걸리
175국가지정-가-002(주)국순당살균탁주국순당 막걸리 쌀
176국가지정-가-001(주)국순당탁주국순당막걸리우국생