Overview

Dataset statistics

Number of variables3
Number of observations201
Missing cells92
Missing cells (%)15.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.8 KiB
Average record size in memory24.7 B

Variable types

Text3

Dataset

Description부산광역시_기장군_식품제조가공업현황_20230908
Author부산광역시 기장군
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15047917

Alerts

소재지전화 has 92 (45.8%) missing valuesMissing

Reproduction

Analysis started2023-12-10 17:08:45.576401
Analysis finished2023-12-10 17:08:46.649948
Duration1.07 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct199
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-11T02:08:47.026603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length22
Mean length7.641791
Min length2

Characters and Unicode

Total characters1536
Distinct characters311
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique197 ?
Unique (%)98.0%

Sample

1st row해양식품
2nd row기장특산물영어조합
3rd row신앙촌소비조합주식회사
4th row오양식품
5th row염씨네식품
ValueCountFrequency (%)
주식회사 27
 
10.5%
주)지이스트냉동 2
 
0.8%
기장식품 2
 
0.8%
푸드 2
 
0.8%
농업회사법인 2
 
0.8%
신카스테라 1
 
0.4%
꼬마푸드 1
 
0.4%
김현모 1
 
0.4%
더소울푸드 1
 
0.4%
크레이지피넛 1
 
0.4%
Other values (217) 217
84.4%
2023-12-11T02:08:47.681614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
83
 
5.4%
65
 
4.2%
( 63
 
4.1%
) 63
 
4.1%
56
 
3.6%
43
 
2.8%
42
 
2.7%
41
 
2.7%
37
 
2.4%
36
 
2.3%
Other values (301) 1007
65.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1229
80.0%
Uppercase Letter 101
 
6.6%
Open Punctuation 63
 
4.1%
Close Punctuation 63
 
4.1%
Space Separator 56
 
3.6%
Other Punctuation 8
 
0.5%
Decimal Number 8
 
0.5%
Lowercase Letter 7
 
0.5%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
83
 
6.8%
65
 
5.3%
43
 
3.5%
42
 
3.4%
41
 
3.3%
37
 
3.0%
36
 
2.9%
34
 
2.8%
31
 
2.5%
27
 
2.2%
Other values (263) 790
64.3%
Uppercase Letter
ValueCountFrequency (%)
O 14
13.9%
E 10
 
9.9%
C 8
 
7.9%
F 8
 
7.9%
R 7
 
6.9%
S 7
 
6.9%
A 6
 
5.9%
N 6
 
5.9%
D 5
 
5.0%
I 5
 
5.0%
Other values (11) 25
24.8%
Lowercase Letter
ValueCountFrequency (%)
e 2
28.6%
s 1
14.3%
t 1
14.3%
n 1
14.3%
a 1
14.3%
d 1
14.3%
Decimal Number
ValueCountFrequency (%)
3 3
37.5%
9 2
25.0%
1 1
 
12.5%
6 1
 
12.5%
5 1
 
12.5%
Other Punctuation
ValueCountFrequency (%)
& 4
50.0%
. 4
50.0%
Open Punctuation
ValueCountFrequency (%)
( 63
100.0%
Close Punctuation
ValueCountFrequency (%)
) 63
100.0%
Space Separator
ValueCountFrequency (%)
56
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1229
80.0%
Common 199
 
13.0%
Latin 108
 
7.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
83
 
6.8%
65
 
5.3%
43
 
3.5%
42
 
3.4%
41
 
3.3%
37
 
3.0%
36
 
2.9%
34
 
2.8%
31
 
2.5%
27
 
2.2%
Other values (263) 790
64.3%
Latin
ValueCountFrequency (%)
O 14
13.0%
E 10
 
9.3%
C 8
 
7.4%
F 8
 
7.4%
R 7
 
6.5%
S 7
 
6.5%
A 6
 
5.6%
N 6
 
5.6%
D 5
 
4.6%
I 5
 
4.6%
Other values (17) 32
29.6%
Common
ValueCountFrequency (%)
( 63
31.7%
) 63
31.7%
56
28.1%
& 4
 
2.0%
. 4
 
2.0%
3 3
 
1.5%
9 2
 
1.0%
1 1
 
0.5%
6 1
 
0.5%
- 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1229
80.0%
ASCII 307
 
20.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
83
 
6.8%
65
 
5.3%
43
 
3.5%
42
 
3.4%
41
 
3.3%
37
 
3.0%
36
 
2.9%
34
 
2.8%
31
 
2.5%
27
 
2.2%
Other values (263) 790
64.3%
ASCII
ValueCountFrequency (%)
( 63
20.5%
) 63
20.5%
56
18.2%
O 14
 
4.6%
E 10
 
3.3%
C 8
 
2.6%
F 8
 
2.6%
R 7
 
2.3%
S 7
 
2.3%
A 6
 
2.0%
Other values (28) 65
21.2%
Distinct197
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-11T02:08:48.229350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length41
Mean length26.517413
Min length19

Characters and Unicode

Total characters5330
Distinct characters141
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique193 ?
Unique (%)96.0%

Sample

1st row부산광역시 기장군 장안읍 해맞이로 335
2nd row부산광역시 기장군 장안읍 오리길 96
3rd row부산광역시 기장군 기장읍 죽성로 197
4th row부산광역시 기장군 장안읍 반룡산단1로 20
5th row부산광역시 기장군 기장읍 대변3길 45
ValueCountFrequency (%)
부산광역시 201
17.1%
기장군 201
17.1%
기장읍 67
 
5.7%
1층 61
 
5.2%
장안읍 53
 
4.5%
정관읍 53
 
4.5%
일광읍 23
 
2.0%
대변로 9
 
0.8%
기장해안로 8
 
0.7%
2층 8
 
0.7%
Other values (300) 492
41.8%
2023-12-11T02:08:49.044017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
975
18.3%
345
 
6.5%
285
 
5.3%
266
 
5.0%
1 256
 
4.8%
231
 
4.3%
210
 
3.9%
201
 
3.8%
201
 
3.8%
201
 
3.8%
Other values (131) 2159
40.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3262
61.2%
Space Separator 975
 
18.3%
Decimal Number 866
 
16.2%
Other Punctuation 132
 
2.5%
Dash Punctuation 49
 
0.9%
Uppercase Letter 21
 
0.4%
Math Symbol 9
 
0.2%
Open Punctuation 8
 
0.2%
Close Punctuation 8
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
345
 
10.6%
285
 
8.7%
266
 
8.2%
231
 
7.1%
210
 
6.4%
201
 
6.2%
201
 
6.2%
201
 
6.2%
196
 
6.0%
139
 
4.3%
Other values (109) 987
30.3%
Decimal Number
ValueCountFrequency (%)
1 256
29.6%
2 113
13.0%
6 90
 
10.4%
3 80
 
9.2%
4 71
 
8.2%
5 64
 
7.4%
0 58
 
6.7%
7 48
 
5.5%
9 46
 
5.3%
8 40
 
4.6%
Uppercase Letter
ValueCountFrequency (%)
A 9
42.9%
B 7
33.3%
C 2
 
9.5%
T 1
 
4.8%
S 1
 
4.8%
E 1
 
4.8%
Space Separator
ValueCountFrequency (%)
975
100.0%
Other Punctuation
ValueCountFrequency (%)
, 132
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 49
100.0%
Math Symbol
ValueCountFrequency (%)
~ 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3262
61.2%
Common 2047
38.4%
Latin 21
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
345
 
10.6%
285
 
8.7%
266
 
8.2%
231
 
7.1%
210
 
6.4%
201
 
6.2%
201
 
6.2%
201
 
6.2%
196
 
6.0%
139
 
4.3%
Other values (109) 987
30.3%
Common
ValueCountFrequency (%)
975
47.6%
1 256
 
12.5%
, 132
 
6.4%
2 113
 
5.5%
6 90
 
4.4%
3 80
 
3.9%
4 71
 
3.5%
5 64
 
3.1%
0 58
 
2.8%
- 49
 
2.4%
Other values (6) 159
 
7.8%
Latin
ValueCountFrequency (%)
A 9
42.9%
B 7
33.3%
C 2
 
9.5%
T 1
 
4.8%
S 1
 
4.8%
E 1
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3262
61.2%
ASCII 2068
38.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
975
47.1%
1 256
 
12.4%
, 132
 
6.4%
2 113
 
5.5%
6 90
 
4.4%
3 80
 
3.9%
4 71
 
3.4%
5 64
 
3.1%
0 58
 
2.8%
- 49
 
2.4%
Other values (12) 180
 
8.7%
Hangul
ValueCountFrequency (%)
345
 
10.6%
285
 
8.7%
266
 
8.2%
231
 
7.1%
210
 
6.4%
201
 
6.2%
201
 
6.2%
201
 
6.2%
196
 
6.0%
139
 
4.3%
Other values (109) 987
30.3%

소재지전화
Text

MISSING 

Distinct107
Distinct (%)98.2%
Missing92
Missing (%)45.8%
Memory size1.7 KiB
2023-12-11T02:08:49.416839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.045872
Min length12

Characters and Unicode

Total characters1313
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique105 ?
Unique (%)96.3%

Sample

1st row051-727-4879
2nd row051-727-7366
3rd row051-722-7091
4th row051-325-3318
5th row051-722-9321
ValueCountFrequency (%)
051-782-0961 2
 
1.8%
051-723-5570 2
 
1.8%
051-727-3083 1
 
0.9%
051-728-1469 1
 
0.9%
051-722-2000 1
 
0.9%
051-722-1760 1
 
0.9%
051-747-9405 1
 
0.9%
051-728-6543 1
 
0.9%
051-747-8511 1
 
0.9%
051-863-2040 1
 
0.9%
Other values (97) 97
89.0%
2023-12-11T02:08:49.985452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 218
16.6%
0 194
14.8%
1 178
13.6%
5 161
12.3%
7 149
11.3%
2 138
10.5%
3 67
 
5.1%
6 61
 
4.6%
8 58
 
4.4%
4 49
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1095
83.4%
Dash Punctuation 218
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 194
17.7%
1 178
16.3%
5 161
14.7%
7 149
13.6%
2 138
12.6%
3 67
 
6.1%
6 61
 
5.6%
8 58
 
5.3%
4 49
 
4.5%
9 40
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 218
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1313
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 218
16.6%
0 194
14.8%
1 178
13.6%
5 161
12.3%
7 149
11.3%
2 138
10.5%
3 67
 
5.1%
6 61
 
4.6%
8 58
 
4.4%
4 49
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1313
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 218
16.6%
0 194
14.8%
1 178
13.6%
5 161
12.3%
7 149
11.3%
2 138
10.5%
3 67
 
5.1%
6 61
 
4.6%
8 58
 
4.4%
4 49
 
3.7%

Missing values

2023-12-11T02:08:46.068732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:08:46.589588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명소재지(도로명)소재지전화
0해양식품부산광역시 기장군 장안읍 해맞이로 335051-727-4879
1기장특산물영어조합부산광역시 기장군 장안읍 오리길 96051-727-7366
2신앙촌소비조합주식회사부산광역시 기장군 기장읍 죽성로 197051-722-7091
3오양식품부산광역시 기장군 장안읍 반룡산단1로 20051-325-3318
4염씨네식품부산광역시 기장군 기장읍 대변3길 45051-722-9321
5신광식품부산광역시 기장군 장안읍 해맞이로 339051-727-2073
6기장바다부산광역시 기장군 일광읍 문오성길 695051-727-1441
7기장식품부산광역시 기장군 기장읍 기장해안로 593-6051-721-2151
8신영식품부산광역시 기장군 정관읍 산단7로 69051-782-0961
9구포종합식품부산광역시 기장군 정관읍 예림길 28-4051-727-7111
업소명소재지(도로명)소재지전화
191풀문하우스부산광역시 기장군 정관읍 정관1로 51, 102,103일부호<NA>
192오씨엔부산광역시 기장군 장안읍 장안산단4로 39, 2층<NA>
193(주)바이오포트코리아부산광역시 기장군 장안읍 반룡산단1로 36, A동 1~2층<NA>
194신카스테라부산광역시 기장군 기장읍 기장해안로 58<NA>
195KJ 두부연구소부산광역시 기장군 기장읍 기장해안로 160, 1층 일부호<NA>
196주식회사 모찌마찌부산광역시 기장군 일광읍 장곡길 42-8, 1동<NA>
197(주)순진종합식품 농업법인회사부산광역시 기장군 장안읍 기장대로 1626-4, A, B동<NA>
198도토리 로스터스(DOTTORI ROASTERS)부산광역시 기장군 장안읍 상장안1길 29-4, 2동 1층 일부호<NA>
199제이엔푸드부산광역시 기장군 정관읍 병산로 2, 1동 1층<NA>
200기원식품부산광역시 기장군 정관읍 달산1길 40, 101호<NA>