Overview

Dataset statistics

Number of variables4
Number of observations31
Missing cells11
Missing cells (%)8.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 KiB
Average record size in memory36.3 B

Variable types

Text4

Dataset

Description부산광역시연제구식품제조가공업체현황_20201027
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15047915

Alerts

소재지전화 has 11 (35.5%) missing valuesMissing
업소명 has unique valuesUnique
소재지(도로명) has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:44:41.555681
Analysis finished2023-12-10 17:44:42.365883
Duration0.81 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업소명
Text

UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size380.0 B
2023-12-11T02:44:42.716917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length5.8709677
Min length2

Characters and Unicode

Total characters182
Distinct characters101
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)100.0%

Sample

1st row성원식품
2nd row우현식품
3rd row일품식품
4th row해광ANS
5th row성광식품
ValueCountFrequency (%)
성원식품 1
 
3.0%
어나더미네스 1
 
3.0%
아임군 1
 
3.0%
창창유통 1
 
3.0%
인투인푸드 1
 
3.0%
스탠다드커피 1
 
3.0%
밥애반찬협동조합 1
 
3.0%
주)상하에프앤비 1
 
3.0%
ms(명성)바이오 1
 
3.0%
커피 1
 
3.0%
Other values (23) 23
69.7%
2023-12-11T02:44:43.652576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8
 
4.4%
7
 
3.8%
) 6
 
3.3%
( 6
 
3.3%
6
 
3.3%
6
 
3.3%
6
 
3.3%
5
 
2.7%
E 5
 
2.7%
O 4
 
2.2%
Other values (91) 123
67.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 140
76.9%
Uppercase Letter 27
 
14.8%
Close Punctuation 6
 
3.3%
Open Punctuation 6
 
3.3%
Space Separator 2
 
1.1%
Decimal Number 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
5.7%
7
 
5.0%
6
 
4.3%
6
 
4.3%
6
 
4.3%
5
 
3.6%
4
 
2.9%
4
 
2.9%
3
 
2.1%
3
 
2.1%
Other values (74) 88
62.9%
Uppercase Letter
ValueCountFrequency (%)
E 5
18.5%
O 4
14.8%
S 4
14.8%
M 2
 
7.4%
F 2
 
7.4%
C 2
 
7.4%
N 2
 
7.4%
P 1
 
3.7%
K 1
 
3.7%
G 1
 
3.7%
Other values (3) 3
11.1%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 140
76.9%
Latin 27
 
14.8%
Common 15
 
8.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
5.7%
7
 
5.0%
6
 
4.3%
6
 
4.3%
6
 
4.3%
5
 
3.6%
4
 
2.9%
4
 
2.9%
3
 
2.1%
3
 
2.1%
Other values (74) 88
62.9%
Latin
ValueCountFrequency (%)
E 5
18.5%
O 4
14.8%
S 4
14.8%
M 2
 
7.4%
F 2
 
7.4%
C 2
 
7.4%
N 2
 
7.4%
P 1
 
3.7%
K 1
 
3.7%
G 1
 
3.7%
Other values (3) 3
11.1%
Common
ValueCountFrequency (%)
) 6
40.0%
( 6
40.0%
2
 
13.3%
2 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 140
76.9%
ASCII 42
 
23.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8
 
5.7%
7
 
5.0%
6
 
4.3%
6
 
4.3%
6
 
4.3%
5
 
3.6%
4
 
2.9%
4
 
2.9%
3
 
2.1%
3
 
2.1%
Other values (74) 88
62.9%
ASCII
ValueCountFrequency (%)
) 6
14.3%
( 6
14.3%
E 5
11.9%
O 4
9.5%
S 4
9.5%
M 2
 
4.8%
F 2
 
4.8%
C 2
 
4.8%
2
 
4.8%
N 2
 
4.8%
Other values (7) 7
16.7%
Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size380.0 B
2023-12-11T02:44:44.211854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length34
Mean length29.709677
Min length23

Characters and Unicode

Total characters921
Distinct characters76
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)100.0%

Sample

1st row부산광역시 연제구 쌍미천로16번길 17 (연산동,남정해바라기맨션 상가 1호)
2nd row부산광역시 연제구 중앙대로1150번길 50 (연산동)
3rd row부산광역시 연제구 월드컵대로46번길 33 (연산동)
4th row부산광역시 연제구 쌍미천로7번길 66 (연산동,1층)
5th row부산광역시 연제구 거제대로108번길 41 (거제동,지상1층)
ValueCountFrequency (%)
부산광역시 31
17.2%
연제구 31
17.2%
연산동 16
 
8.9%
1층 10
 
5.6%
거제동 10
 
5.6%
2층 5
 
2.8%
거제시장로 2
 
1.1%
과정로 2
 
1.1%
47 2
 
1.1%
6 2
 
1.1%
Other values (67) 69
38.3%
2023-12-11T02:44:45.272179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
149
 
16.2%
53
 
5.8%
51
 
5.5%
50
 
5.4%
1 43
 
4.7%
33
 
3.6%
) 32
 
3.5%
( 32
 
3.5%
31
 
3.4%
31
 
3.4%
Other values (66) 416
45.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 545
59.2%
Space Separator 149
 
16.2%
Decimal Number 136
 
14.8%
Close Punctuation 32
 
3.5%
Open Punctuation 32
 
3.5%
Other Punctuation 23
 
2.5%
Dash Punctuation 4
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
53
 
9.7%
51
 
9.4%
50
 
9.2%
33
 
6.1%
31
 
5.7%
31
 
5.7%
31
 
5.7%
31
 
5.7%
31
 
5.7%
31
 
5.7%
Other values (51) 172
31.6%
Decimal Number
ValueCountFrequency (%)
1 43
31.6%
2 21
15.4%
3 18
13.2%
4 12
 
8.8%
0 9
 
6.6%
5 8
 
5.9%
7 8
 
5.9%
6 8
 
5.9%
9 6
 
4.4%
8 3
 
2.2%
Space Separator
ValueCountFrequency (%)
149
100.0%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%
Open Punctuation
ValueCountFrequency (%)
( 32
100.0%
Other Punctuation
ValueCountFrequency (%)
, 23
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 545
59.2%
Common 376
40.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
53
 
9.7%
51
 
9.4%
50
 
9.2%
33
 
6.1%
31
 
5.7%
31
 
5.7%
31
 
5.7%
31
 
5.7%
31
 
5.7%
31
 
5.7%
Other values (51) 172
31.6%
Common
ValueCountFrequency (%)
149
39.6%
1 43
 
11.4%
) 32
 
8.5%
( 32
 
8.5%
, 23
 
6.1%
2 21
 
5.6%
3 18
 
4.8%
4 12
 
3.2%
0 9
 
2.4%
5 8
 
2.1%
Other values (5) 29
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 545
59.2%
ASCII 376
40.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
149
39.6%
1 43
 
11.4%
) 32
 
8.5%
( 32
 
8.5%
, 23
 
6.1%
2 21
 
5.6%
3 18
 
4.8%
4 12
 
3.2%
0 9
 
2.4%
5 8
 
2.1%
Other values (5) 29
 
7.7%
Hangul
ValueCountFrequency (%)
53
 
9.7%
51
 
9.4%
50
 
9.2%
33
 
6.1%
31
 
5.7%
31
 
5.7%
31
 
5.7%
31
 
5.7%
31
 
5.7%
31
 
5.7%
Other values (51) 172
31.6%

소재지전화
Text

MISSING 

Distinct20
Distinct (%)100.0%
Missing11
Missing (%)35.5%
Memory size380.0 B
2023-12-11T02:44:45.617318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length13.95
Min length13

Characters and Unicode

Total characters279
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)100.0%

Sample

1st row 051- 852-1005
2nd row051 -927 -9292
3rd row051 -852 -0010
4th row051 -867 -8200
5th row051 -867 -3252
ValueCountFrequency (%)
051 16
30.8%
070 3
 
5.8%
853 2
 
3.8%
9292 2
 
3.8%
867 2
 
3.8%
0772 1
 
1.9%
8802-3036 1
 
1.9%
8712-8483 1
 
1.9%
851 1
 
1.9%
8830 1
 
1.9%
Other values (22) 22
42.3%
2023-12-11T02:44:46.290416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 41
14.7%
- 40
14.3%
5 36
12.9%
35
12.5%
1 25
9.0%
2 24
8.6%
7 21
7.5%
8 20
7.2%
3 12
 
4.3%
6 10
 
3.6%
Other values (2) 15
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 204
73.1%
Dash Punctuation 40
 
14.3%
Space Separator 35
 
12.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 41
20.1%
5 36
17.6%
1 25
12.3%
2 24
11.8%
7 21
10.3%
8 20
9.8%
3 12
 
5.9%
6 10
 
4.9%
9 9
 
4.4%
4 6
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 40
100.0%
Space Separator
ValueCountFrequency (%)
35
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 279
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 41
14.7%
- 40
14.3%
5 36
12.9%
35
12.5%
1 25
9.0%
2 24
8.6%
7 21
7.5%
8 20
7.2%
3 12
 
4.3%
6 10
 
3.6%
Other values (2) 15
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 279
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 41
14.7%
- 40
14.3%
5 36
12.9%
35
12.5%
1 25
9.0%
2 24
8.6%
7 21
7.5%
8 20
7.2%
3 12
 
4.3%
6 10
 
3.6%
Other values (2) 15
 
5.4%
Distinct21
Distinct (%)67.7%
Missing0
Missing (%)0.0%
Memory size380.0 B
2023-12-11T02:44:46.684517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length15
Mean length9.6451613
Min length2

Characters and Unicode

Total characters299
Distinct characters42
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)51.6%

Sample

1st row규격외일반가공식품, 두부류또는묵류, 기타식품류
2nd row조미식품
3rd row장류
4th row조미식품, 규격외일반가공식품
5th row빵또는떡류
ValueCountFrequency (%)
조미식품 9
15.3%
음료류 8
13.6%
커피 8
13.6%
수산가공식품류 5
8.5%
규격외일반가공식품 5
8.5%
즉석식품류 3
 
5.1%
기타식품류 3
 
5.1%
장류 2
 
3.4%
또는 2
 
3.4%
빵또는떡류 2
 
3.4%
Other values (11) 12
20.3%
2023-12-11T02:44:47.277253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34
 
11.4%
28
 
9.4%
27
 
9.0%
27
 
9.0%
, 24
 
8.0%
11
 
3.7%
11
 
3.7%
10
 
3.3%
9
 
3.0%
9
 
3.0%
Other values (32) 109
36.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 247
82.6%
Space Separator 28
 
9.4%
Other Punctuation 24
 
8.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
 
13.8%
27
 
10.9%
27
 
10.9%
11
 
4.5%
11
 
4.5%
10
 
4.0%
9
 
3.6%
9
 
3.6%
9
 
3.6%
8
 
3.2%
Other values (30) 92
37.2%
Space Separator
ValueCountFrequency (%)
28
100.0%
Other Punctuation
ValueCountFrequency (%)
, 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 247
82.6%
Common 52
 
17.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
 
13.8%
27
 
10.9%
27
 
10.9%
11
 
4.5%
11
 
4.5%
10
 
4.0%
9
 
3.6%
9
 
3.6%
9
 
3.6%
8
 
3.2%
Other values (30) 92
37.2%
Common
ValueCountFrequency (%)
28
53.8%
, 24
46.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 247
82.6%
ASCII 52
 
17.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
34
 
13.8%
27
 
10.9%
27
 
10.9%
11
 
4.5%
11
 
4.5%
10
 
4.0%
9
 
3.6%
9
 
3.6%
9
 
3.6%
8
 
3.2%
Other values (30) 92
37.2%
ASCII
ValueCountFrequency (%)
28
53.8%
, 24
46.2%

Correlations

2023-12-11T02:44:47.451601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명소재지(도로명)소재지전화식품의종류
업소명1.0001.0001.0001.000
소재지(도로명)1.0001.0001.0001.000
소재지전화1.0001.0001.0001.000
식품의종류1.0001.0001.0001.000

Missing values

2023-12-11T02:44:42.064878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:44:42.289817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명소재지(도로명)소재지전화식품의종류
0성원식품부산광역시 연제구 쌍미천로16번길 17 (연산동,남정해바라기맨션 상가 1호)051- 852-1005규격외일반가공식품, 두부류또는묵류, 기타식품류
1우현식품부산광역시 연제구 중앙대로1150번길 50 (연산동)051 -927 -9292조미식품
2일품식품부산광역시 연제구 월드컵대로46번길 33 (연산동)051 -852 -0010장류
3해광ANS부산광역시 연제구 쌍미천로7번길 66 (연산동,1층)051 -867 -8200조미식품, 규격외일반가공식품
4성광식품부산광역시 연제구 거제대로108번길 41 (거제동,지상1층)051 -867 -3252빵또는떡류
5깜돌이식품부산광역시 연제구 과정로251번길 6 (연산동,지상1층)051 -752 -3907장류, 규격외일반가공식품, 빵또는떡류, 음료류
6라파식품부산광역시 연제구 거제시장로 21 (거제동,(3층))051 -853 -2678다류
7커피긱스(COFFEE GEEKS)부산광역시 연제구 교대로24번길 7 (거제동)051 -506 -7581음료류, 커피
8루트커피로스터리부산광역시 연제구 교대로 12, 1층 (거제동)070 -7352-1422커피
9민진로스팅부산광역시 연제구 거제대로 296, 대승 아이티빌딩 1층 (거제동)070-7721-5955음료류, 커피
업소명소재지(도로명)소재지전화식품의종류
21고담부산광역시 연제구 거제천로 153, 1층 (거제동)<NA>과자류, 빵류 또는 떡류
22사이먼 커피부산광역시 연제구 중앙대로1056번길 2, 1층 (연산동)<NA>음료류
23MS(명성)바이오부산광역시 연제구 거제시장로 43, 3층 303호 (거제동)<NA>음료류, 조미식품
24(주)상하에프앤비부산광역시 연제구 거제대로214번길 6, 지하1층 11호 (거제동, 경남)051 -853 -6431수산가공식품류
25밥애반찬협동조합부산광역시 연제구 여고로 134-1, 한빛빌딩 2층 (거제동)<NA>수산가공식품류
26스탠다드커피부산광역시 연제구 배산로 24, 1층 (연산동)<NA>음료류
27인투인푸드부산광역시 연제구 토곡남로20번가길 1-3, 1층 (연산동)051 -527 -6860즉석식품류, 조미식품
28창창유통부산광역시 연제구 과정로 242, 2층 (연산동)<NA>조미식품
29아임군부산광역시 연제구 거제대로 128-7, 1층 (거제동)<NA>수산가공식품류, 즉석식품류
30(주)동심컴퍼니(제2공장)부산광역시 연제구 연수로 193-1, 1층 (연산동)<NA>수산가공식품류, 조미식품, 절임류 또는 조림류