Overview

Dataset statistics

Number of variables4
Number of observations225
Missing cells51
Missing cells (%)5.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.2 KiB
Average record size in memory32.6 B

Variable types

Text4

Dataset

Description부산광역시_기장군_즉석식품제조가공업_20190516
Author부산광역시 기장군
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15047916

Alerts

소재지전화번호 has 51 (22.7%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:35:32.687896
Analysis finished2023-12-10 16:35:33.242107
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct223
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-11T01:35:33.501956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length17
Mean length6.3466667
Min length2

Characters and Unicode

Total characters1428
Distinct characters335
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique221 ?
Unique (%)98.2%

Sample

1st row경주상회
2nd row송정기름집
3rd row안동참기름
4th row기장상회
5th row일광참기름상회
ValueCountFrequency (%)
주식회사 5
 
2.0%
휴마트 2
 
0.8%
장안울산휴게소 2
 
0.8%
장안부산휴게소 2
 
0.8%
2
 
0.8%
정관점 2
 
0.8%
주)에이치앤디이 2
 
0.8%
롯데쇼핑(주)롯데마트동부산점 2
 
0.8%
채움푸드 1
 
0.4%
보배홈메이드 1
 
0.4%
Other values (234) 234
91.8%
2023-12-11T01:35:34.062068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30
 
2.1%
30
 
2.1%
29
 
2.0%
29
 
2.0%
28
 
2.0%
28
 
2.0%
28
 
2.0%
27
 
1.9%
25
 
1.8%
( 24
 
1.7%
Other values (325) 1150
80.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1315
92.1%
Space Separator 30
 
2.1%
Open Punctuation 24
 
1.7%
Close Punctuation 24
 
1.7%
Lowercase Letter 22
 
1.5%
Decimal Number 6
 
0.4%
Uppercase Letter 4
 
0.3%
Other Punctuation 2
 
0.1%
Letter Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
2.3%
29
 
2.2%
29
 
2.2%
28
 
2.1%
28
 
2.1%
28
 
2.1%
27
 
2.1%
25
 
1.9%
23
 
1.7%
23
 
1.7%
Other values (298) 1045
79.5%
Lowercase Letter
ValueCountFrequency (%)
e 5
22.7%
m 3
13.6%
c 2
 
9.1%
y 2
 
9.1%
o 2
 
9.1%
a 1
 
4.5%
k 1
 
4.5%
b 1
 
4.5%
i 1
 
4.5%
s 1
 
4.5%
Other values (3) 3
13.6%
Decimal Number
ValueCountFrequency (%)
1 2
33.3%
0 1
16.7%
4 1
16.7%
2 1
16.7%
3 1
16.7%
Uppercase Letter
ValueCountFrequency (%)
M 1
25.0%
L 1
25.0%
S 1
25.0%
R 1
25.0%
Space Separator
ValueCountFrequency (%)
30
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%
Other Punctuation
ValueCountFrequency (%)
& 2
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1315
92.1%
Common 86
 
6.0%
Latin 27
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
 
2.3%
29
 
2.2%
29
 
2.2%
28
 
2.1%
28
 
2.1%
28
 
2.1%
27
 
2.1%
25
 
1.9%
23
 
1.7%
23
 
1.7%
Other values (298) 1045
79.5%
Latin
ValueCountFrequency (%)
e 5
18.5%
m 3
 
11.1%
c 2
 
7.4%
y 2
 
7.4%
o 2
 
7.4%
M 1
 
3.7%
L 1
 
3.7%
S 1
 
3.7%
a 1
 
3.7%
k 1
 
3.7%
Other values (8) 8
29.6%
Common
ValueCountFrequency (%)
30
34.9%
( 24
27.9%
) 24
27.9%
& 2
 
2.3%
1 2
 
2.3%
0 1
 
1.2%
4 1
 
1.2%
2 1
 
1.2%
3 1
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1315
92.1%
ASCII 112
 
7.8%
Number Forms 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
30
26.8%
( 24
21.4%
) 24
21.4%
e 5
 
4.5%
m 3
 
2.7%
c 2
 
1.8%
& 2
 
1.8%
y 2
 
1.8%
o 2
 
1.8%
1 2
 
1.8%
Other values (16) 16
14.3%
Hangul
ValueCountFrequency (%)
30
 
2.3%
29
 
2.2%
29
 
2.2%
28
 
2.1%
28
 
2.1%
28
 
2.1%
27
 
2.1%
25
 
1.9%
23
 
1.7%
23
 
1.7%
Other values (298) 1045
79.5%
Number Forms
ValueCountFrequency (%)
1
100.0%

소재지전화번호
Text

MISSING 

Distinct169
Distinct (%)97.1%
Missing51
Missing (%)22.7%
Memory size1.9 KiB
2023-12-11T01:35:34.363176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.982759
Min length9

Characters and Unicode

Total characters2085
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique165 ?
Unique (%)94.8%

Sample

1st row051-721-2179
2nd row051-515-1796
3rd row051-721-2223
4th row051-721-1612
5th row051-727-0548
ValueCountFrequency (%)
051-727-3714 3
 
1.7%
051-519-8224 2
 
1.1%
051-922-2500 2
 
1.1%
051-728-4160 2
 
1.1%
051-722-0038 1
 
0.6%
051-728-3639 1
 
0.6%
051-721-3688 1
 
0.6%
051-721-5687 1
 
0.6%
051-721-2131 1
 
0.6%
051-721-9701 1
 
0.6%
Other values (159) 159
91.4%
2023-12-11T01:35:34.817875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 347
16.6%
7 282
13.5%
2 278
13.3%
1 275
13.2%
0 264
12.7%
5 253
12.1%
8 105
 
5.0%
3 84
 
4.0%
4 72
 
3.5%
6 64
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1738
83.4%
Dash Punctuation 347
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
7 282
16.2%
2 278
16.0%
1 275
15.8%
0 264
15.2%
5 253
14.6%
8 105
 
6.0%
3 84
 
4.8%
4 72
 
4.1%
6 64
 
3.7%
9 61
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 347
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2085
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 347
16.6%
7 282
13.5%
2 278
13.3%
1 275
13.2%
0 264
12.7%
5 253
12.1%
8 105
 
5.0%
3 84
 
4.0%
4 72
 
3.5%
6 64
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2085
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 347
16.6%
7 282
13.5%
2 278
13.3%
1 275
13.2%
0 264
12.7%
5 253
12.1%
8 105
 
5.0%
3 84
 
4.0%
4 72
 
3.5%
6 64
 
3.1%
Distinct217
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-11T01:35:35.160986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length43
Mean length29.257778
Min length19

Characters and Unicode

Total characters6583
Distinct characters173
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique210 ?
Unique (%)93.3%

Sample

1st row부산광역시 기장군 기장읍 대라리 64-6
2nd row부산광역시 기장군 철마면 송정리 568
3rd row부산광역시 기장군 정관읍 정관1로 18, 123동 B-103호 (이지 더원1차 아파트)
4th row부산광역시 기장군 기장읍 읍내로104번길 19
5th row부산광역시 기장군 일광면 일광로 128
ValueCountFrequency (%)
부산광역시 225
 
16.1%
기장군 225
 
16.1%
기장읍 94
 
6.7%
정관읍 88
 
6.3%
1층 73
 
5.2%
장안읍 25
 
1.8%
기장해안로 18
 
1.3%
일광면 12
 
0.9%
정관6로 11
 
0.8%
정관로 10
 
0.7%
Other values (347) 615
44.1%
2023-12-11T01:35:35.714797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1176
 
17.9%
379
 
5.8%
349
 
5.3%
1 312
 
4.7%
248
 
3.8%
244
 
3.7%
237
 
3.6%
227
 
3.4%
225
 
3.4%
225
 
3.4%
Other values (163) 2961
45.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3972
60.3%
Space Separator 1176
 
17.9%
Decimal Number 1088
 
16.5%
Other Punctuation 135
 
2.1%
Close Punctuation 65
 
1.0%
Open Punctuation 65
 
1.0%
Dash Punctuation 46
 
0.7%
Uppercase Letter 31
 
0.5%
Lowercase Letter 4
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
379
 
9.5%
349
 
8.8%
248
 
6.2%
244
 
6.1%
237
 
6.0%
227
 
5.7%
225
 
5.7%
225
 
5.7%
221
 
5.6%
191
 
4.8%
Other values (136) 1426
35.9%
Decimal Number
ValueCountFrequency (%)
1 312
28.7%
2 126
11.6%
0 110
 
10.1%
3 110
 
10.1%
4 109
 
10.0%
5 96
 
8.8%
6 81
 
7.4%
7 55
 
5.1%
8 48
 
4.4%
9 41
 
3.8%
Uppercase Letter
ValueCountFrequency (%)
B 16
51.6%
A 6
 
19.4%
D 3
 
9.7%
C 2
 
6.5%
L 1
 
3.2%
H 1
 
3.2%
K 1
 
3.2%
P 1
 
3.2%
Lowercase Letter
ValueCountFrequency (%)
a 2
50.0%
l 1
25.0%
z 1
25.0%
Space Separator
ValueCountFrequency (%)
1176
100.0%
Other Punctuation
ValueCountFrequency (%)
, 135
100.0%
Close Punctuation
ValueCountFrequency (%)
) 65
100.0%
Open Punctuation
ValueCountFrequency (%)
( 65
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 46
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3972
60.3%
Common 2576
39.1%
Latin 35
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
379
 
9.5%
349
 
8.8%
248
 
6.2%
244
 
6.1%
237
 
6.0%
227
 
5.7%
225
 
5.7%
225
 
5.7%
221
 
5.6%
191
 
4.8%
Other values (136) 1426
35.9%
Common
ValueCountFrequency (%)
1176
45.7%
1 312
 
12.1%
, 135
 
5.2%
2 126
 
4.9%
0 110
 
4.3%
3 110
 
4.3%
4 109
 
4.2%
5 96
 
3.7%
6 81
 
3.1%
) 65
 
2.5%
Other values (6) 256
 
9.9%
Latin
ValueCountFrequency (%)
B 16
45.7%
A 6
 
17.1%
D 3
 
8.6%
C 2
 
5.7%
a 2
 
5.7%
L 1
 
2.9%
H 1
 
2.9%
K 1
 
2.9%
P 1
 
2.9%
l 1
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3972
60.3%
ASCII 2611
39.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1176
45.0%
1 312
 
11.9%
, 135
 
5.2%
2 126
 
4.8%
0 110
 
4.2%
3 110
 
4.2%
4 109
 
4.2%
5 96
 
3.7%
6 81
 
3.1%
) 65
 
2.5%
Other values (17) 291
 
11.1%
Hangul
ValueCountFrequency (%)
379
 
9.5%
349
 
8.8%
248
 
6.2%
244
 
6.1%
237
 
6.0%
227
 
5.7%
225
 
5.7%
225
 
5.7%
221
 
5.6%
191
 
4.8%
Other values (136) 1426
35.9%
Distinct139
Distinct (%)61.8%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-11T01:35:35.987335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length155
Median length72
Mean length25.24
Min length4

Characters and Unicode

Total characters5679
Distinct characters150
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique107 ?
Unique (%)47.6%

Sample

1st row 식용유지류(압착식으로 착유하는 전품목), 조미식품(고추가루또는실고추), 조미식품(천연향신료)
2nd row 식용유지류(압착식으로 착유하는 전품목)
3rd row 참기름, 들기름, 고춧가루
4th row 식용유지류(압착식으로 착유하는 전품목), 조미식품(고추가루또는실고추), 조미식품(천연향신료)
5th row 식용유지류(압착식으로 착유하는 전품목)
ValueCountFrequency (%)
즉석조리식품 46
 
6.1%
즉석섭취식품 44
 
5.8%
양념젓갈 31
 
4.1%
떡류 22
 
2.9%
기타김치 22
 
2.9%
농산물조림 20
 
2.6%
배추김치 20
 
2.6%
과자류(떡류 20
 
2.6%
액상차 19
 
2.5%
조림류 19
 
2.5%
Other values (106) 496
65.3%
2023-12-11T01:35:36.496010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1456
25.6%
, 476
 
8.4%
294
 
5.2%
251
 
4.4%
195
 
3.4%
148
 
2.6%
117
 
2.1%
101
 
1.8%
( 91
 
1.6%
) 91
 
1.6%
Other values (140) 2459
43.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3545
62.4%
Space Separator 1456
25.6%
Other Punctuation 496
 
8.7%
Open Punctuation 91
 
1.6%
Close Punctuation 91
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
294
 
8.3%
251
 
7.1%
195
 
5.5%
148
 
4.2%
117
 
3.3%
101
 
2.8%
90
 
2.5%
90
 
2.5%
74
 
2.1%
74
 
2.1%
Other values (134) 2111
59.5%
Other Punctuation
ValueCountFrequency (%)
, 476
96.0%
. 16
 
3.2%
· 4
 
0.8%
Space Separator
ValueCountFrequency (%)
1456
100.0%
Open Punctuation
ValueCountFrequency (%)
( 91
100.0%
Close Punctuation
ValueCountFrequency (%)
) 91
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3545
62.4%
Common 2134
37.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
294
 
8.3%
251
 
7.1%
195
 
5.5%
148
 
4.2%
117
 
3.3%
101
 
2.8%
90
 
2.5%
90
 
2.5%
74
 
2.1%
74
 
2.1%
Other values (134) 2111
59.5%
Common
ValueCountFrequency (%)
1456
68.2%
, 476
 
22.3%
( 91
 
4.3%
) 91
 
4.3%
. 16
 
0.7%
· 4
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3545
62.4%
ASCII 2130
37.5%
None 4
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1456
68.4%
, 476
 
22.3%
( 91
 
4.3%
) 91
 
4.3%
. 16
 
0.8%
Hangul
ValueCountFrequency (%)
294
 
8.3%
251
 
7.1%
195
 
5.5%
148
 
4.2%
117
 
3.3%
101
 
2.8%
90
 
2.5%
90
 
2.5%
74
 
2.1%
74
 
2.1%
Other values (134) 2111
59.5%
None
ValueCountFrequency (%)
· 4
100.0%

Missing values

2023-12-11T01:35:33.086137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:35:33.195367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명소재지전화번호소재지(도로명)식품의종류
0경주상회051-721-2179부산광역시 기장군 기장읍 대라리 64-6식용유지류(압착식으로 착유하는 전품목), 조미식품(고추가루또는실고추), 조미식품(천연향신료)
1송정기름집<NA>부산광역시 기장군 철마면 송정리 568식용유지류(압착식으로 착유하는 전품목)
2안동참기름051-515-1796부산광역시 기장군 정관읍 정관1로 18, 123동 B-103호 (이지 더원1차 아파트)참기름, 들기름, 고춧가루
3기장상회051-721-2223부산광역시 기장군 기장읍 읍내로104번길 19식용유지류(압착식으로 착유하는 전품목), 조미식품(고추가루또는실고추), 조미식품(천연향신료)
4일광참기름상회<NA>부산광역시 기장군 일광면 일광로 128식용유지류(압착식으로 착유하는 전품목)
5안동기름집<NA>부산광역시 기장군 일광면 기장해안로 1291식용유지류(압착식으로 착유하는 전품목)
6하서떡방앗간051-721-1612부산광역시 기장군 기장읍 차성남로65번길 4과자류(떡류)
7칠암제분업051-727-0548부산광역시 기장군 일광면 일광로 646-1과자류(떡류)
8송정떡방앗간051-508-4422부산광역시 기장군 철마면 여락송정로 334-16, 1층과자류(떡류)
9풍년상회051-721-2022부산광역시 기장군 기장읍 차성로287번길 10, 1층식용유지류(압착식으로 착유하는 전품목)
업소명소재지전화번호소재지(도로명)식품의종류
215주식회사 다정한진<NA>부산광역시 기장군 정관읍 모전1길 52, 1층과자, 액상차, 당절임
216염씨네젓갈건어물051-722-9321부산광역시 기장군 기장읍 기장해안로 609젓갈, 양념젓갈, 조미건어포
217두부본가어묵(기장점)<NA>부산광역시 기장군 일광면 기장대로 673, 메가마트 1층어묵
218에코해마루051-720-0000부산광역시 기장군 정관읍 모전로 23액상차
219엘림집반찬051-727-5643부산광역시 기장군 정관읍 구연2로 10, 1층김치, 절임식품, 조림류, 식육함유가공품, 양념젓갈, 즉석섭취식품, 즉석조리식품
220풀리페의맛있는반찬<NA>부산광역시 기장군 정관읍 정관5로 114, 1층빙과, 혼합음료, 김치, 절임식품, 조림류, 식육함유가공품, 즉석섭취식품, 즉석조리식품
221대궐푸드(주)051-727-6788부산광역시 기장군 정관읍 용수공단2길 30, 1층만두
222기장곰장어051-721-2934부산광역시 기장군 기장읍 기장해안로 70, 1층소스
223빅세일마트051-722-0207부산광역시 기장군 기장읍 차성동로 174, 빅세일마트 1층식육함유가공품
224해논산업051-727-3036부산광역시 기장군 정관읍 정관6로 58, 금샘광장빌딩 201~203호빵류