Overview

Dataset statistics

Number of variables5
Number of observations45
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory42.8 B

Variable types

Categorical3
Text2

Dataset

Description공공데이터 제공 신청접수에 따른 커피식품제조가공업 데이터로 업종,업소명,소재지(도로명),식품의종료, 식품의유형 항목을 제공합니다.
Author공공데이터포털
URLhttps://www.data.go.kr/data/15114232/fileData.do

Alerts

업종 has constant value ""Constant
식품의종류 is highly overall correlated with 식품의유형High correlation
식품의유형 is highly overall correlated with 식품의종류High correlation
식품의종류 is highly imbalanced (70.6%)Imbalance
식품의유형 is highly imbalanced (70.6%)Imbalance
업소명 has unique valuesUnique
소재지(도로명) has unique valuesUnique

Reproduction

Analysis started2024-04-20 18:08:56.918115
Analysis finished2024-04-20 18:08:57.548729
Duration0.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size488.0 B
식품제조가공업
45 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row식품제조가공업
2nd row식품제조가공업
3rd row식품제조가공업
4th row식품제조가공업
5th row식품제조가공업

Common Values

ValueCountFrequency (%)
식품제조가공업 45
100.0%

Length

2024-04-21T03:08:57.651311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:08:57.817110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
식품제조가공업 45
100.0%

업소명
Text

UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size488.0 B
2024-04-21T03:08:58.528864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length12
Mean length7.6444444
Min length2

Characters and Unicode

Total characters344
Distinct characters146
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)100.0%

Sample

1st row주식회사우진식품
2nd row사랑의일터
3rd row(주)웰빙엘에스
4th row쉘리스커피
5th row커피커퍼
ValueCountFrequency (%)
주식회사 2
 
3.6%
주식회사베라커피아울렛 1
 
1.8%
커피 1
 
1.8%
블랙포레스트커피컴퍼니 1
 
1.8%
정커피 1
 
1.8%
강릉 1
 
1.8%
로스팅 1
 
1.8%
랩(lab 1
 
1.8%
두리기바이오텍 1
 
1.8%
게락로스터리 1
 
1.8%
Other values (44) 44
80.0%
2024-04-21T03:08:59.565043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23
 
6.7%
22
 
6.4%
13
 
3.8%
10
 
2.9%
10
 
2.9%
10
 
2.9%
( 9
 
2.6%
) 9
 
2.6%
8
 
2.3%
7
 
2.0%
Other values (136) 223
64.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 296
86.0%
Uppercase Letter 15
 
4.4%
Space Separator 10
 
2.9%
Open Punctuation 9
 
2.6%
Close Punctuation 9
 
2.6%
Lowercase Letter 5
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
7.8%
22
 
7.4%
13
 
4.4%
10
 
3.4%
10
 
3.4%
8
 
2.7%
7
 
2.4%
6
 
2.0%
6
 
2.0%
6
 
2.0%
Other values (116) 185
62.5%
Uppercase Letter
ValueCountFrequency (%)
F 2
13.3%
A 2
13.3%
E 2
13.3%
C 1
6.7%
G 1
6.7%
O 1
6.7%
B 1
6.7%
L 1
6.7%
R 1
6.7%
J 1
6.7%
Other values (2) 2
13.3%
Lowercase Letter
ValueCountFrequency (%)
l 1
20.0%
e 1
20.0%
i 1
20.0%
n 1
20.0%
a 1
20.0%
Space Separator
ValueCountFrequency (%)
10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 296
86.0%
Common 28
 
8.1%
Latin 20
 
5.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
7.8%
22
 
7.4%
13
 
4.4%
10
 
3.4%
10
 
3.4%
8
 
2.7%
7
 
2.4%
6
 
2.0%
6
 
2.0%
6
 
2.0%
Other values (116) 185
62.5%
Latin
ValueCountFrequency (%)
F 2
 
10.0%
A 2
 
10.0%
E 2
 
10.0%
C 1
 
5.0%
G 1
 
5.0%
O 1
 
5.0%
B 1
 
5.0%
L 1
 
5.0%
R 1
 
5.0%
J 1
 
5.0%
Other values (7) 7
35.0%
Common
ValueCountFrequency (%)
10
35.7%
( 9
32.1%
) 9
32.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 296
86.0%
ASCII 48
 
14.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
23
 
7.8%
22
 
7.4%
13
 
4.4%
10
 
3.4%
10
 
3.4%
8
 
2.7%
7
 
2.4%
6
 
2.0%
6
 
2.0%
6
 
2.0%
Other values (116) 185
62.5%
ASCII
ValueCountFrequency (%)
10
20.8%
( 9
18.8%
) 9
18.8%
F 2
 
4.2%
A 2
 
4.2%
E 2
 
4.2%
C 1
 
2.1%
G 1
 
2.1%
O 1
 
2.1%
B 1
 
2.1%
Other values (10) 10
20.8%
Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size488.0 B
2024-04-21T03:09:00.563930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length29
Mean length25.088889
Min length19

Characters and Unicode

Total characters1129
Distinct characters105
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)100.0%

Sample

1st row강원도 강릉시 주문진읍 공시내길 73-3
2nd row강원도 강릉시 사천면 방동길 43-2
3rd row강원도 강릉시 과학단지로 24-19 (대전동)
4th row강원도 강릉시 사천면 진리해변길 95
5th row강원도 강릉시 해안로 341 (강문동)
ValueCountFrequency (%)
강원도 45
 
17.9%
강릉시 45
 
17.9%
1층 6
 
2.4%
포남동 6
 
2.4%
2층 5
 
2.0%
성산면 5
 
2.0%
대전동 5
 
2.0%
해안로 3
 
1.2%
교동 3
 
1.2%
율곡로 3
 
1.2%
Other values (108) 126
50.0%
2024-04-21T03:09:01.846100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
207
18.3%
101
 
8.9%
48
 
4.3%
48
 
4.3%
47
 
4.2%
46
 
4.1%
2 44
 
3.9%
1 42
 
3.7%
35
 
3.1%
30
 
2.7%
Other values (95) 481
42.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 637
56.4%
Space Separator 207
 
18.3%
Decimal Number 193
 
17.1%
Close Punctuation 29
 
2.6%
Open Punctuation 29
 
2.6%
Other Punctuation 20
 
1.8%
Dash Punctuation 13
 
1.2%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
101
15.9%
48
 
7.5%
48
 
7.5%
47
 
7.4%
46
 
7.2%
35
 
5.5%
30
 
4.7%
27
 
4.2%
14
 
2.2%
12
 
1.9%
Other values (79) 229
35.9%
Decimal Number
ValueCountFrequency (%)
2 44
22.8%
1 42
21.8%
4 27
14.0%
3 17
 
8.8%
9 15
 
7.8%
0 12
 
6.2%
7 11
 
5.7%
6 10
 
5.2%
5 9
 
4.7%
8 6
 
3.1%
Space Separator
ValueCountFrequency (%)
207
100.0%
Close Punctuation
ValueCountFrequency (%)
) 29
100.0%
Open Punctuation
ValueCountFrequency (%)
( 29
100.0%
Other Punctuation
ValueCountFrequency (%)
, 20
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Uppercase Letter
ValueCountFrequency (%)
F 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 637
56.4%
Common 491
43.5%
Latin 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
101
15.9%
48
 
7.5%
48
 
7.5%
47
 
7.4%
46
 
7.2%
35
 
5.5%
30
 
4.7%
27
 
4.2%
14
 
2.2%
12
 
1.9%
Other values (79) 229
35.9%
Common
ValueCountFrequency (%)
207
42.2%
2 44
 
9.0%
1 42
 
8.6%
) 29
 
5.9%
( 29
 
5.9%
4 27
 
5.5%
, 20
 
4.1%
3 17
 
3.5%
9 15
 
3.1%
- 13
 
2.6%
Other values (5) 48
 
9.8%
Latin
ValueCountFrequency (%)
F 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 637
56.4%
ASCII 492
43.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
207
42.1%
2 44
 
8.9%
1 42
 
8.5%
) 29
 
5.9%
( 29
 
5.9%
4 27
 
5.5%
, 20
 
4.1%
3 17
 
3.5%
9 15
 
3.0%
- 13
 
2.6%
Other values (6) 49
 
10.0%
Hangul
ValueCountFrequency (%)
101
15.9%
48
 
7.5%
48
 
7.5%
47
 
7.4%
46
 
7.2%
35
 
5.5%
30
 
4.7%
27
 
4.2%
14
 
2.2%
12
 
1.9%
Other values (79) 229
35.9%

식품의종류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)13.3%
Missing0
Missing (%)0.0%
Memory size488.0 B
음료류
40 
음료류 + 수산가공식품류
 
1
음료류 + 농산가공식품류
 
1
특수용도식품 + 당류 + 음료류 + 특수영양식품 + 조미식품 + 농산가공식품류 + 기타식품류
 
1
커피원두 + 음료류
 
1

Length

Max length59
Median length5
Mean length7.2888889
Min length5

Unique

Unique5 ?
Unique (%)11.1%

Sample

1st row 음료류 + 수산가공식품류
2nd row 음료류 + 농산가공식품류
3rd row 특수용도식품 + 당류 + 음료류 + 특수영양식품 + 조미식품 + 농산가공식품류 + 기타식품류
4th row 음료류
5th row 음료류

Common Values

ValueCountFrequency (%)
음료류 40
88.9%
음료류 + 수산가공식품류 1
 
2.2%
음료류 + 농산가공식품류 1
 
2.2%
특수용도식품 + 당류 + 음료류 + 특수영양식품 + 조미식품 + 농산가공식품류 + 기타식품류 1
 
2.2%
커피원두 + 음료류 1
 
2.2%
음료류 + 조미식품 + 농산가공식품류 1
 
2.2%

Length

2024-04-21T03:09:02.280063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:09:02.475068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
음료류 45
67.2%
11
 
16.4%
농산가공식품류 3
 
4.5%
조미식품 2
 
3.0%
수산가공식품류 1
 
1.5%
특수용도식품 1
 
1.5%
당류 1
 
1.5%
특수영양식품 1
 
1.5%
기타식품류 1
 
1.5%
커피원두 1
 
1.5%

식품의유형
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)13.3%
Missing0
Missing (%)0.0%
Memory size488.0 B
커피
40 
커피 + 조미건어포 + 건어포
 
1
침출차 + 커피
 
1
커피 + 곡류가공품 + 두류가공품 + 기타 농산가공품 + 기타가공품
 
1
액상차 + 커피
 
1

Length

Max length43
Median length4
Mean length6.1555556
Min length4

Unique

Unique5 ?
Unique (%)11.1%

Sample

1st row 커피 + 조미건어포 + 건어포
2nd row 침출차 + 커피
3rd row 커피 + 곡류가공품 + 두류가공품 + 기타 농산가공품 + 기타가공품
4th row 커피
5th row 커피

Common Values

ValueCountFrequency (%)
커피 40
88.9%
커피 + 조미건어포 + 건어포 1
 
2.2%
침출차 + 커피 1
 
2.2%
커피 + 곡류가공품 + 두류가공품 + 기타 농산가공품 + 기타가공품 1
 
2.2%
액상차 + 커피 1
 
2.2%
커피 + 복합조미식품 + 곡류가공품 + 두류가공품 1
 
2.2%

Length

2024-04-21T03:09:02.706225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:09:02.912639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
커피 45
66.2%
11
 
16.2%
곡류가공품 2
 
2.9%
두류가공품 2
 
2.9%
조미건어포 1
 
1.5%
건어포 1
 
1.5%
침출차 1
 
1.5%
기타 1
 
1.5%
농산가공품 1
 
1.5%
기타가공품 1
 
1.5%
Other values (2) 2
 
2.9%

Correlations

2024-04-21T03:09:03.064669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명소재지(도로명)식품의종류식품의유형
업소명1.0001.0001.0001.000
소재지(도로명)1.0001.0001.0001.000
식품의종류1.0001.0001.0000.994
식품의유형1.0001.0000.9941.000
2024-04-21T03:09:03.314642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
식품의유형식품의종류
식품의유형1.0000.880
식품의종류0.8801.000
2024-04-21T03:09:03.454817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
식품의종류식품의유형
식품의종류1.0000.880
식품의유형0.8801.000

Missing values

2024-04-21T03:08:57.318042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T03:08:57.485419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종업소명소재지(도로명)식품의종류식품의유형
0식품제조가공업주식회사우진식품강원도 강릉시 주문진읍 공시내길 73-3음료류 + 수산가공식품류커피 + 조미건어포 + 건어포
1식품제조가공업사랑의일터강원도 강릉시 사천면 방동길 43-2음료류 + 농산가공식품류침출차 + 커피
2식품제조가공업(주)웰빙엘에스강원도 강릉시 과학단지로 24-19 (대전동)특수용도식품 + 당류 + 음료류 + 특수영양식품 + 조미식품 + 농산가공식품류 + 기타식품류커피 + 곡류가공품 + 두류가공품 + 기타 농산가공품 + 기타가공품
3식품제조가공업쉘리스커피강원도 강릉시 사천면 진리해변길 95음료류커피
4식품제조가공업커피커퍼강원도 강릉시 해안로 341 (강문동)음료류커피
5식품제조가공업커피앤피플강원도 강릉시 성산면 삼왕길 193-22음료류커피
6식품제조가공업보헤미안커피점강원도 강릉시 연곡면 홍질목길 55-11음료류커피
7식품제조가공업크레마코스타강원도 강릉시 대송길46번길 14-2 (대전동)음료류커피
8식품제조가공업(합명회사)산토리니커피강원도 강릉시 경강로 2667 (견소동)음료류커피
9식품제조가공업동진교역강원도 강릉시 성산면 구산안길 28음료류커피
업종업소명소재지(도로명)식품의종류식품의유형
35식품제조가공업보사노바로스팅팩토리강원도 강릉시 경강로2660번길 7 (견소동)음료류커피
36식품제조가공업(주)에스티알강원도 강릉시 사임당로 641-22, 벤처제2공장동 F3-3호 (대전동)음료류커피
37식품제조가공업봉봉방앗간아르(R)강원도 강릉시 경강로2024번길 3, 1층 (명주동)음료류커피
38식품제조가공업노보발효커피강원도 강릉시 안현로 110, 1층 (안현동)음료류커피
39식품제조가공업슈테츠커피로스터스강원도 강릉시 성덕포남로174번길 14 (포남동)음료류커피
40식품제조가공업남산강원도 강릉시 강변로194번길 3, 1층 (내곡동)음료류커피
41식품제조가공업서플라이빈강원도 강릉시 강릉대로419번길 9, 101호 (포남동)음료류커피
42식품제조가공업상상로지스강원도 강릉시 연곡면 동덕2길 2, 1층음료류커피
43식품제조가공업주가커피(JUGA COFFEE)강원도 강릉시 임영로 227-2 (교동)음료류커피
44식품제조가공업강릉발효커피강원도 강릉시 남부로 55, 2층 (내곡동)음료류커피