Overview

Dataset statistics

Number of variables5
Number of observations148
Missing cells61
Missing cells (%)8.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.9 KiB
Average record size in memory40.9 B

Variable types

Categorical1
Text4

Dataset

Description부산광역시_기장군_식품제조가공업현황_07/15/2021
Author부산광역시 기장군
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15047917

Alerts

업종 is highly imbalanced (78.7%)Imbalance
소재지전화번호 has 59 (39.9%) missing valuesMissing
식품의유형 has 2 (1.4%) missing valuesMissing

Reproduction

Analysis started2023-12-10 17:08:58.777980
Analysis finished2023-12-10 17:09:00.090323
Duration1.31 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
식품제조가공업
143 
식품첨가물제조업
 
5

Length

Max length8
Median length7
Mean length7.0337838
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row식품제조가공업
2nd row식품제조가공업
3rd row식품제조가공업
4th row식품제조가공업
5th row식품제조가공업

Common Values

ValueCountFrequency (%)
식품제조가공업 143
96.6%
식품첨가물제조업 5
 
3.4%

Length

2023-12-11T02:09:00.220094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:09:00.414882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
식품제조가공업 143
96.6%
식품첨가물제조업 5
 
3.4%
Distinct144
Distinct (%)97.3%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-11T02:09:00.758443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length14
Mean length7.0135135
Min length2

Characters and Unicode

Total characters1038
Distinct characters247
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique140 ?
Unique (%)94.6%

Sample

1st row해양식품
2nd row기장특산물영어조합
3rd row신앙촌소비조합주식회사
4th row염씨네식품
5th row신광식품
ValueCountFrequency (%)
주)지이스트냉동 2
 
1.4%
주)엔제이에프앤비 2
 
1.4%
기장식품 2
 
1.4%
주)케이와이바이오 2
 
1.4%
티앤에스피플(주 1
 
0.7%
주)소풍메이드윤 1
 
0.7%
해양식품 1
 
0.7%
주)더젬코리아 1
 
0.7%
웨이브온커피 1
 
0.7%
수반식품 1
 
0.7%
Other values (134) 134
90.5%
2023-12-11T02:09:01.426848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
59
 
5.7%
( 49
 
4.7%
) 49
 
4.7%
45
 
4.3%
37
 
3.6%
29
 
2.8%
28
 
2.7%
26
 
2.5%
25
 
2.4%
24
 
2.3%
Other values (237) 667
64.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 881
84.9%
Open Punctuation 49
 
4.7%
Close Punctuation 49
 
4.7%
Uppercase Letter 42
 
4.0%
Other Punctuation 6
 
0.6%
Decimal Number 6
 
0.6%
Lowercase Letter 4
 
0.4%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
59
 
6.7%
45
 
5.1%
37
 
4.2%
29
 
3.3%
28
 
3.2%
26
 
3.0%
25
 
2.8%
24
 
2.7%
21
 
2.4%
20
 
2.3%
Other values (209) 567
64.4%
Uppercase Letter
ValueCountFrequency (%)
F 5
11.9%
S 5
11.9%
O 5
11.9%
D 4
9.5%
T 4
9.5%
A 3
 
7.1%
R 2
 
4.8%
L 2
 
4.8%
H 2
 
4.8%
C 2
 
4.8%
Other values (6) 8
19.0%
Decimal Number
ValueCountFrequency (%)
3 2
33.3%
9 2
33.3%
2 1
16.7%
1 1
16.7%
Lowercase Letter
ValueCountFrequency (%)
o 2
50.0%
d 1
25.0%
f 1
25.0%
Other Punctuation
ValueCountFrequency (%)
& 4
66.7%
. 2
33.3%
Open Punctuation
ValueCountFrequency (%)
( 49
100.0%
Close Punctuation
ValueCountFrequency (%)
) 49
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 881
84.9%
Common 111
 
10.7%
Latin 46
 
4.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
59
 
6.7%
45
 
5.1%
37
 
4.2%
29
 
3.3%
28
 
3.2%
26
 
3.0%
25
 
2.8%
24
 
2.7%
21
 
2.4%
20
 
2.3%
Other values (209) 567
64.4%
Latin
ValueCountFrequency (%)
F 5
10.9%
S 5
10.9%
O 5
10.9%
D 4
 
8.7%
T 4
 
8.7%
A 3
 
6.5%
o 2
 
4.3%
R 2
 
4.3%
L 2
 
4.3%
H 2
 
4.3%
Other values (9) 12
26.1%
Common
ValueCountFrequency (%)
( 49
44.1%
) 49
44.1%
& 4
 
3.6%
3 2
 
1.8%
9 2
 
1.8%
. 2
 
1.8%
2 1
 
0.9%
- 1
 
0.9%
1 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 881
84.9%
ASCII 157
 
15.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
59
 
6.7%
45
 
5.1%
37
 
4.2%
29
 
3.3%
28
 
3.2%
26
 
3.0%
25
 
2.8%
24
 
2.7%
21
 
2.4%
20
 
2.3%
Other values (209) 567
64.4%
ASCII
ValueCountFrequency (%)
( 49
31.2%
) 49
31.2%
F 5
 
3.2%
S 5
 
3.2%
O 5
 
3.2%
D 4
 
2.5%
& 4
 
2.5%
T 4
 
2.5%
A 3
 
1.9%
o 2
 
1.3%
Other values (18) 27
17.2%

소재지전화번호
Text

MISSING 

Distinct87
Distinct (%)97.8%
Missing59
Missing (%)39.9%
Memory size1.3 KiB
2023-12-11T02:09:01.747100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1068
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique85 ?
Unique (%)95.5%

Sample

1st row051-727-4879
2nd row051-727-7366
3rd row051-722-7091
4th row051-722-9321
5th row051-727-2073
ValueCountFrequency (%)
051-782-0961 2
 
2.2%
051-723-5570 2
 
2.2%
052-911-7979 1
 
1.1%
051-727-6788 1
 
1.1%
051-514-6067 1
 
1.1%
051-722-2117 1
 
1.1%
051-780-7350 1
 
1.1%
051-727-6906 1
 
1.1%
051-524-0689 1
 
1.1%
051-723-7766 1
 
1.1%
Other values (77) 77
86.5%
2023-12-11T02:09:02.309305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 178
16.7%
0 151
14.1%
1 144
13.5%
5 129
12.1%
7 124
11.6%
2 121
11.3%
3 55
 
5.1%
6 51
 
4.8%
8 46
 
4.3%
9 35
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 890
83.3%
Dash Punctuation 178
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 151
17.0%
1 144
16.2%
5 129
14.5%
7 124
13.9%
2 121
13.6%
3 55
 
6.2%
6 51
 
5.7%
8 46
 
5.2%
9 35
 
3.9%
4 34
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 178
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1068
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 178
16.7%
0 151
14.1%
1 144
13.5%
5 129
12.1%
7 124
11.6%
2 121
11.3%
3 55
 
5.1%
6 51
 
4.8%
8 46
 
4.3%
9 35
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1068
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 178
16.7%
0 151
14.1%
1 144
13.5%
5 129
12.1%
7 124
11.6%
2 121
11.3%
3 55
 
5.1%
6 51
 
4.8%
8 46
 
4.3%
9 35
 
3.3%
Distinct145
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-11T02:09:02.818976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length33
Mean length22.006757
Min length16

Characters and Unicode

Total characters3257
Distinct characters133
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique142 ?
Unique (%)95.9%

Sample

1st row부산광역시기장군장안읍해맞이로335
2nd row부산광역시기장군장안읍오리길96
3rd row부산광역시기장군기장읍죽성로197
4th row부산광역시기장군기장읍대변3길45
5th row부산광역시기장군장안읍해맞이로339
ValueCountFrequency (%)
부산광역시기장군기장읍죽성로197 2
 
1.4%
부산광역시기장군일광면횡계길7,해양생물산업육성센터a동103호 2
 
1.4%
부산광역시기장군정관읍산단7로69 2
 
1.4%
부산광역시기장군정관읍예림1로56-1,1층 1
 
0.7%
부산광역시기장군기장읍차성로190번길11,1층 1
 
0.7%
부산광역시기장군기장읍대청로22번길30 1
 
0.7%
부산광역시기장군장안읍해맞이로335 1
 
0.7%
부산광역시기장군기장읍기장해안로108,에이원오션시티506호 1
 
0.7%
부산광역시기장군기장읍청강로85번길56 1
 
0.7%
부산광역시기장군기장읍당사로3길16,1층 1
 
0.7%
Other values (135) 135
91.2%
2023-12-11T02:09:03.387580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
245
 
7.5%
221
 
6.8%
1 198
 
6.1%
184
 
5.6%
168
 
5.2%
152
 
4.7%
149
 
4.6%
148
 
4.5%
148
 
4.5%
127
 
3.9%
Other values (123) 1517
46.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2420
74.3%
Decimal Number 653
 
20.0%
Other Punctuation 103
 
3.2%
Dash Punctuation 39
 
1.2%
Uppercase Letter 14
 
0.4%
Open Punctuation 10
 
0.3%
Close Punctuation 10
 
0.3%
Math Symbol 8
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
245
 
10.1%
221
 
9.1%
184
 
7.6%
168
 
6.9%
152
 
6.3%
149
 
6.2%
148
 
6.1%
148
 
6.1%
127
 
5.2%
99
 
4.1%
Other values (105) 779
32.2%
Decimal Number
ValueCountFrequency (%)
1 198
30.3%
2 79
 
12.1%
6 76
 
11.6%
3 63
 
9.6%
5 56
 
8.6%
4 47
 
7.2%
0 45
 
6.9%
8 31
 
4.7%
9 29
 
4.4%
7 29
 
4.4%
Uppercase Letter
ValueCountFrequency (%)
A 9
64.3%
B 4
28.6%
C 1
 
7.1%
Other Punctuation
ValueCountFrequency (%)
, 103
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 39
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Math Symbol
ValueCountFrequency (%)
~ 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2420
74.3%
Common 823
 
25.3%
Latin 14
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
245
 
10.1%
221
 
9.1%
184
 
7.6%
168
 
6.9%
152
 
6.3%
149
 
6.2%
148
 
6.1%
148
 
6.1%
127
 
5.2%
99
 
4.1%
Other values (105) 779
32.2%
Common
ValueCountFrequency (%)
1 198
24.1%
, 103
12.5%
2 79
 
9.6%
6 76
 
9.2%
3 63
 
7.7%
5 56
 
6.8%
4 47
 
5.7%
0 45
 
5.5%
- 39
 
4.7%
8 31
 
3.8%
Other values (5) 86
10.4%
Latin
ValueCountFrequency (%)
A 9
64.3%
B 4
28.6%
C 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2420
74.3%
ASCII 837
 
25.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
245
 
10.1%
221
 
9.1%
184
 
7.6%
168
 
6.9%
152
 
6.3%
149
 
6.2%
148
 
6.1%
148
 
6.1%
127
 
5.2%
99
 
4.1%
Other values (105) 779
32.2%
ASCII
ValueCountFrequency (%)
1 198
23.7%
, 103
12.3%
2 79
 
9.4%
6 76
 
9.1%
3 63
 
7.5%
5 56
 
6.7%
4 47
 
5.6%
0 45
 
5.4%
- 39
 
4.7%
8 31
 
3.7%
Other values (8) 100
11.9%

식품의유형
Text

MISSING 

Distinct90
Distinct (%)61.6%
Missing2
Missing (%)1.4%
Memory size1.3 KiB
2023-12-11T02:09:03.664805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length83
Median length48
Mean length10.328767
Min length2

Characters and Unicode

Total characters1508
Distinct characters140
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique75 ?
Unique (%)51.4%

Sample

1st row젓갈,액젓
2nd row젓갈,액젓,조미액젓
3rd row두부,두류가공품
4th row젓갈,액젓
5th row젓갈,액젓
ValueCountFrequency (%)
커피 13
 
8.9%
젓갈,액젓 12
 
8.2%
젓갈 10
 
6.8%
기타수산물가공품 9
 
6.2%
소스,절임식품 4
 
2.7%
소스 4
 
2.7%
기타가공품 3
 
2.1%
김치,김칫속,절임식품 2
 
1.4%
떡류 2
 
1.4%
식육함유가공품,즉석조리식품 2
 
1.4%
Other values (80) 85
58.2%
2023-12-11T02:09:04.133709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 177
 
11.7%
122
 
8.1%
90
 
6.0%
89
 
5.9%
55
 
3.6%
54
 
3.6%
51
 
3.4%
43
 
2.9%
39
 
2.6%
38
 
2.5%
Other values (130) 750
49.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1313
87.1%
Other Punctuation 189
 
12.5%
Open Punctuation 3
 
0.2%
Close Punctuation 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
122
 
9.3%
90
 
6.9%
89
 
6.8%
55
 
4.2%
54
 
4.1%
51
 
3.9%
43
 
3.3%
39
 
3.0%
38
 
2.9%
36
 
2.7%
Other values (126) 696
53.0%
Other Punctuation
ValueCountFrequency (%)
, 177
93.7%
. 12
 
6.3%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1313
87.1%
Common 195
 
12.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
122
 
9.3%
90
 
6.9%
89
 
6.8%
55
 
4.2%
54
 
4.1%
51
 
3.9%
43
 
3.3%
39
 
3.0%
38
 
2.9%
36
 
2.7%
Other values (126) 696
53.0%
Common
ValueCountFrequency (%)
, 177
90.8%
. 12
 
6.2%
( 3
 
1.5%
) 3
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1313
87.1%
ASCII 195
 
12.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 177
90.8%
. 12
 
6.2%
( 3
 
1.5%
) 3
 
1.5%
Hangul
ValueCountFrequency (%)
122
 
9.3%
90
 
6.9%
89
 
6.8%
55
 
4.2%
54
 
4.1%
51
 
3.9%
43
 
3.3%
39
 
3.0%
38
 
2.9%
36
 
2.7%
Other values (126) 696
53.0%

Correlations

2023-12-11T02:09:04.257914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종소재지전화번호식품의유형
업종1.0001.0001.000
소재지전화번호1.0001.0000.990
식품의유형1.0000.9901.000

Missing values

2023-12-11T02:08:59.662776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:08:59.843161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T02:09:00.005779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업종업소명소재지전화번호소재지(도로명)식품의유형
0식품제조가공업해양식품051-727-4879부산광역시기장군장안읍해맞이로335젓갈,액젓
1식품제조가공업기장특산물영어조합051-727-7366부산광역시기장군장안읍오리길96젓갈,액젓,조미액젓
2식품제조가공업신앙촌소비조합주식회사051-722-7091부산광역시기장군기장읍죽성로197두부,두류가공품
3식품제조가공업염씨네식품051-722-9321부산광역시기장군기장읍대변3길45젓갈,액젓
4식품제조가공업신광식품051-727-2073부산광역시기장군장안읍해맞이로339젓갈,액젓
5식품제조가공업기장바다<NA>부산광역시기장군일광면문오성길695기타수산물가공품
6식품제조가공업기장식품051-721-2151부산광역시기장군기장읍기장해안로593-6젓갈,액젓
7식품제조가공업신영식품051-782-0961부산광역시기장군정관읍산단7로69조미김,기타수산물가공품
8식품제조가공업구포종합식품051-727-7111부산광역시기장군정관읍예림길28-4생면,건면
9식품제조가공업미진자연식품051-722-2301부산광역시기장군기장읍청강로91번길19,2층기타수산물가공품
업종업소명소재지전화번호소재지(도로명)식품의유형
138식품제조가공업재이에스푸드051-723-7273부산광역시기장군기장읍차성로413,1층소스,절임식품
139식품제조가공업해림FNC<NA>부산광역시기장군정관읍산단5로76-62,2층빵류,액상차
140식품제조가공업다식푸드<NA>부산광역시기장군정관읍곰내길626-15,1층식육함유가공품,즉석조리식품
141식품제조가공업동광푸드<NA>부산광역시기장군정관읍곰내길654-60,1층소스,김치,조림류,식육함유가공품,기타수산물가공품
142식품제조가공업파로스로스터리<NA>부산광역시기장군정관읍곰내길640<NA>
143식품첨가물제조업(주)엔씨시스템051-527-5852부산광역시기장군정관읍예림1로56-1,1층몰포린지방산염,혼합제제,혼합제제
144식품첨가물제조업와이엠생활<NA>부산광역시기장군정관읍정관상곡2길25-21,2동차아염소산나트륨
145식품첨가물제조업(주)케이와이바이오<NA>부산광역시기장군기장읍청강로85번길56,1층혼합제제
146식품첨가물제조업(주)엔제이에프앤비<NA>부산광역시기장군일광면횡계길7,해양생물산업육성센터A동103호혼합제제
147식품첨가물제조업(주)플러크<NA>부산광역시기장군정관읍방곡5로32-5,1층차아염소산수,차아염소산수