Overview

Dataset statistics

Number of variables4
Number of observations857
Missing cells385
Missing cells (%)11.2%
Duplicate rows4
Duplicate rows (%)0.5%
Total size in memory26.9 KiB
Average record size in memory32.2 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시_사상구_식품위생업소관리현황_20230619
Author부산광역시 사상구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15025663

Alerts

Dataset has 4 (0.5%) duplicate rowsDuplicates
소재지전화 has 385 (44.9%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:44:19.808997
Analysis finished2023-12-10 16:44:20.485762
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

Distinct11
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
즉석판매제조가공업
332 
식품자동판매기영업
235 
집단급식소 식품판매업
79 
식품소분업
71 
유통전문판매업
42 
Other values (6)
98 

Length

Max length11
Median length9
Mean length8.5775963
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row즉석판매제조가공업
2nd row즉석판매제조가공업
3rd row즉석판매제조가공업
4th row즉석판매제조가공업
5th row즉석판매제조가공업

Common Values

ValueCountFrequency (%)
즉석판매제조가공업 332
38.7%
식품자동판매기영업 235
27.4%
집단급식소 식품판매업 79
 
9.2%
식품소분업 71
 
8.3%
유통전문판매업 42
 
4.9%
용기.포장지제조업 34
 
4.0%
기타식품판매업 31
 
3.6%
식품운반업 15
 
1.8%
식용얼음판매업 12
 
1.4%
식품첨가물제조업 4
 
0.5%

Length

2023-12-11T01:44:20.917180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
즉석판매제조가공업 332
35.5%
식품자동판매기영업 235
25.1%
집단급식소 79
 
8.4%
식품판매업 79
 
8.4%
식품소분업 71
 
7.6%
유통전문판매업 42
 
4.5%
용기.포장지제조업 34
 
3.6%
기타식품판매업 31
 
3.3%
식품운반업 15
 
1.6%
식용얼음판매업 12
 
1.3%
Other values (2) 6
 
0.6%
Distinct785
Distinct (%)91.6%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
2023-12-11T01:44:21.419716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length30
Mean length6.5052509
Min length2

Characters and Unicode

Total characters5575
Distinct characters478
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique726 ?
Unique (%)84.7%

Sample

1st row제일떡방앗간
2nd row청도상회
3rd row청도떡방앗간
4th row밀양상회
5th row진주상회
ValueCountFrequency (%)
주식회사 19
 
1.8%
씨유 16
 
1.5%
이마트24 14
 
1.3%
세븐일레븐 11
 
1.0%
지에스(gs)25 6
 
0.6%
진주상회 5
 
0.5%
항도청과 5
 
0.5%
담꾹 4
 
0.4%
사상점 4
 
0.4%
밀양상회 4
 
0.4%
Other values (871) 994
91.9%
2023-12-11T01:44:22.061854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
241
 
4.3%
150
 
2.7%
142
 
2.5%
129
 
2.3%
) 129
 
2.3%
( 126
 
2.3%
126
 
2.3%
97
 
1.7%
96
 
1.7%
94
 
1.7%
Other values (468) 4245
76.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4734
84.9%
Space Separator 241
 
4.3%
Uppercase Letter 141
 
2.5%
Close Punctuation 129
 
2.3%
Open Punctuation 126
 
2.3%
Decimal Number 117
 
2.1%
Lowercase Letter 70
 
1.3%
Other Punctuation 14
 
0.3%
Dash Punctuation 1
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
150
 
3.2%
142
 
3.0%
129
 
2.7%
126
 
2.7%
97
 
2.0%
96
 
2.0%
94
 
2.0%
87
 
1.8%
82
 
1.7%
74
 
1.6%
Other values (412) 3657
77.2%
Uppercase Letter
ValueCountFrequency (%)
S 23
16.3%
C 19
13.5%
G 16
11.3%
U 13
9.2%
F 11
 
7.8%
E 6
 
4.3%
H 6
 
4.3%
R 5
 
3.5%
O 5
 
3.5%
D 5
 
3.5%
Other values (11) 32
22.7%
Lowercase Letter
ValueCountFrequency (%)
o 14
20.0%
e 9
12.9%
a 6
8.6%
t 6
8.6%
l 5
 
7.1%
d 5
 
7.1%
u 4
 
5.7%
f 3
 
4.3%
n 3
 
4.3%
h 2
 
2.9%
Other values (8) 13
18.6%
Decimal Number
ValueCountFrequency (%)
2 42
35.9%
4 28
23.9%
5 18
15.4%
1 11
 
9.4%
3 9
 
7.7%
0 7
 
6.0%
7 2
 
1.7%
Other Punctuation
ValueCountFrequency (%)
. 8
57.1%
, 2
 
14.3%
& 2
 
14.3%
· 2
 
14.3%
Space Separator
ValueCountFrequency (%)
241
100.0%
Close Punctuation
ValueCountFrequency (%)
) 129
100.0%
Open Punctuation
ValueCountFrequency (%)
( 126
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4733
84.9%
Common 630
 
11.3%
Latin 211
 
3.8%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
150
 
3.2%
142
 
3.0%
129
 
2.7%
126
 
2.7%
97
 
2.0%
96
 
2.0%
94
 
2.0%
87
 
1.8%
82
 
1.7%
74
 
1.6%
Other values (411) 3656
77.2%
Latin
ValueCountFrequency (%)
S 23
 
10.9%
C 19
 
9.0%
G 16
 
7.6%
o 14
 
6.6%
U 13
 
6.2%
F 11
 
5.2%
e 9
 
4.3%
E 6
 
2.8%
H 6
 
2.8%
a 6
 
2.8%
Other values (29) 88
41.7%
Common
ValueCountFrequency (%)
241
38.3%
) 129
20.5%
( 126
20.0%
2 42
 
6.7%
4 28
 
4.4%
5 18
 
2.9%
1 11
 
1.7%
3 9
 
1.4%
. 8
 
1.3%
0 7
 
1.1%
Other values (7) 11
 
1.7%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4733
84.9%
ASCII 838
 
15.0%
None 2
 
< 0.1%
Geometric Shapes 1
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
241
28.8%
) 129
15.4%
( 126
15.0%
2 42
 
5.0%
4 28
 
3.3%
S 23
 
2.7%
C 19
 
2.3%
5 18
 
2.1%
G 16
 
1.9%
o 14
 
1.7%
Other values (44) 182
21.7%
Hangul
ValueCountFrequency (%)
150
 
3.2%
142
 
3.0%
129
 
2.7%
126
 
2.7%
97
 
2.0%
96
 
2.0%
94
 
2.0%
87
 
1.8%
82
 
1.7%
74
 
1.6%
Other values (411) 3656
77.2%
None
ValueCountFrequency (%)
· 2
100.0%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct780
Distinct (%)91.0%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
2023-12-11T01:44:22.517779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length52
Mean length31.750292
Min length21

Characters and Unicode

Total characters27210
Distinct characters258
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique739 ?
Unique (%)86.2%

Sample

1st row부산광역시 사상구 새벽로168번길 24 (감전동)
2nd row부산광역시 사상구 사상로238번길 47 (괘법동)
3rd row부산광역시 사상구 새벽시장로57번길 8 (감전동)
4th row부산광역시 사상구 낙동대로772번길 9 (엄궁동)
5th row부산광역시 사상구 모덕로67번길 144 (모라동)
ValueCountFrequency (%)
부산광역시 857
 
16.2%
사상구 857
 
16.2%
1층 267
 
5.1%
엄궁동 162
 
3.1%
감전동 149
 
2.8%
괘법동 118
 
2.2%
모라동 106
 
2.0%
주례동 99
 
1.9%
학장동 81
 
1.5%
덕포동 75
 
1.4%
Other values (872) 2514
47.6%
2023-12-11T01:44:23.271383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4429
 
16.3%
1142
 
4.2%
1 1113
 
4.1%
1091
 
4.0%
1031
 
3.8%
1015
 
3.7%
1002
 
3.7%
908
 
3.3%
890
 
3.3%
( 866
 
3.2%
Other values (248) 13723
50.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15959
58.7%
Space Separator 4429
 
16.3%
Decimal Number 4275
 
15.7%
Open Punctuation 867
 
3.2%
Close Punctuation 867
 
3.2%
Other Punctuation 654
 
2.4%
Dash Punctuation 104
 
0.4%
Uppercase Letter 42
 
0.2%
Math Symbol 13
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1142
 
7.2%
1091
 
6.8%
1031
 
6.5%
1015
 
6.4%
1002
 
6.3%
908
 
5.7%
890
 
5.6%
860
 
5.4%
860
 
5.4%
858
 
5.4%
Other values (217) 6302
39.5%
Decimal Number
ValueCountFrequency (%)
1 1113
26.0%
2 604
14.1%
3 446
10.4%
0 416
 
9.7%
4 337
 
7.9%
5 318
 
7.4%
7 306
 
7.2%
6 280
 
6.5%
9 253
 
5.9%
8 202
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
B 12
28.6%
A 12
28.6%
G 3
 
7.1%
D 3
 
7.1%
P 3
 
7.1%
T 3
 
7.1%
S 2
 
4.8%
C 2
 
4.8%
U 1
 
2.4%
E 1
 
2.4%
Other Punctuation
ValueCountFrequency (%)
, 651
99.5%
. 1
 
0.2%
* 1
 
0.2%
/ 1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 866
99.9%
[ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 866
99.9%
] 1
 
0.1%
Space Separator
ValueCountFrequency (%)
4429
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 104
100.0%
Math Symbol
ValueCountFrequency (%)
~ 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15959
58.7%
Common 11209
41.2%
Latin 42
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1142
 
7.2%
1091
 
6.8%
1031
 
6.5%
1015
 
6.4%
1002
 
6.3%
908
 
5.7%
890
 
5.6%
860
 
5.4%
860
 
5.4%
858
 
5.4%
Other values (217) 6302
39.5%
Common
ValueCountFrequency (%)
4429
39.5%
1 1113
 
9.9%
( 866
 
7.7%
) 866
 
7.7%
, 651
 
5.8%
2 604
 
5.4%
3 446
 
4.0%
0 416
 
3.7%
4 337
 
3.0%
5 318
 
2.8%
Other values (11) 1163
 
10.4%
Latin
ValueCountFrequency (%)
B 12
28.6%
A 12
28.6%
G 3
 
7.1%
D 3
 
7.1%
P 3
 
7.1%
T 3
 
7.1%
S 2
 
4.8%
C 2
 
4.8%
U 1
 
2.4%
E 1
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15959
58.7%
ASCII 11251
41.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4429
39.4%
1 1113
 
9.9%
( 866
 
7.7%
) 866
 
7.7%
, 651
 
5.8%
2 604
 
5.4%
3 446
 
4.0%
0 416
 
3.7%
4 337
 
3.0%
5 318
 
2.8%
Other values (21) 1205
 
10.7%
Hangul
ValueCountFrequency (%)
1142
 
7.2%
1091
 
6.8%
1031
 
6.5%
1015
 
6.4%
1002
 
6.3%
908
 
5.7%
890
 
5.6%
860
 
5.4%
860
 
5.4%
858
 
5.4%
Other values (217) 6302
39.5%

소재지전화
Text

MISSING 

Distinct430
Distinct (%)91.1%
Missing385
Missing (%)44.9%
Memory size6.8 KiB
2023-12-11T01:44:23.558161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.002119
Min length12

Characters and Unicode

Total characters5665
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique393 ?
Unique (%)83.3%

Sample

1st row051-328-6060
2nd row051-302-9382
3rd row051-317-1353
4th row051-303-2127
5th row051-312-8375
ValueCountFrequency (%)
051-329-2500 5
 
1.1%
051-327-1800 3
 
0.6%
051-325-8585 3
 
0.6%
051-329-1234 2
 
0.4%
051-321-0533 2
 
0.4%
051-324-4804 2
 
0.4%
051-301-4141 2
 
0.4%
051-313-1110 2
 
0.4%
051-304-8861 2
 
0.4%
051-301-3269 2
 
0.4%
Other values (420) 447
94.7%
2023-12-11T01:44:24.002630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 944
16.7%
1 900
15.9%
0 833
14.7%
5 745
13.2%
3 706
12.5%
2 425
7.5%
7 251
 
4.4%
9 234
 
4.1%
6 216
 
3.8%
4 210
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4721
83.3%
Dash Punctuation 944
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 900
19.1%
0 833
17.6%
5 745
15.8%
3 706
15.0%
2 425
9.0%
7 251
 
5.3%
9 234
 
5.0%
6 216
 
4.6%
4 210
 
4.4%
8 201
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 944
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5665
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 944
16.7%
1 900
15.9%
0 833
14.7%
5 745
13.2%
3 706
12.5%
2 425
7.5%
7 251
 
4.4%
9 234
 
4.1%
6 216
 
3.8%
4 210
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5665
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 944
16.7%
1 900
15.9%
0 833
14.7%
5 745
13.2%
3 706
12.5%
2 425
7.5%
7 251
 
4.4%
9 234
 
4.1%
6 216
 
3.8%
4 210
 
3.7%

Missing values

2023-12-11T01:44:20.307444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:44:20.436524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지(도로명)소재지전화
0즉석판매제조가공업제일떡방앗간부산광역시 사상구 새벽로168번길 24 (감전동)<NA>
1즉석판매제조가공업청도상회부산광역시 사상구 사상로238번길 47 (괘법동)051-328-6060
2즉석판매제조가공업청도떡방앗간부산광역시 사상구 새벽시장로57번길 8 (감전동)<NA>
3즉석판매제조가공업밀양상회부산광역시 사상구 낙동대로772번길 9 (엄궁동)<NA>
4즉석판매제조가공업진주상회부산광역시 사상구 모덕로67번길 144 (모라동)051-302-9382
5즉석판매제조가공업밀양상회부산광역시 사상구 사상로309번길 67 (삼락동)<NA>
6즉석판매제조가공업감전시장 방앗간부산광역시 사상구 새벽시장로103번길 38, 1층 (감전동)051-317-1353
7즉석판매제조가공업유진 상회부산광역시 사상구 낙동대로1530번길 16 (삼락동)<NA>
8즉석판매제조가공업대광상회부산광역시 사상구 사상로285번길 8-9 (덕포동)051-303-2127
9즉석판매제조가공업경주상회부산광역시 사상구 사상로238번길 39, 1층 (괘법동)051-312-8375
업종명업소명소재지(도로명)소재지전화
847집단급식소 식품판매업맑은나라부산광역시 사상구 농산물시장로25번길 70, 청과물도매시장 1층 바-1,2,3호 (엄궁동)<NA>
848집단급식소 식품판매업성진FS부산광역시 사상구 농산물시장로25번길 70, 청과물도매시장 1층 라-56,57,58,59호 (엄궁동)<NA>
849집단급식소 식품판매업진주유통부산광역시 사상구 농산물시장로25번길 70, 청과물도매시장 1층 가-66호 (엄궁동)051-322-7598
850집단급식소 식품판매업예찬푸드부산광역시 사상구 새벽시장로 78-5, 1층 (감전동)<NA>
851집단급식소 식품판매업(주)남영축산부산광역시 사상구 사상로551번길 8, 1층 (모라동)051-303-3838
852집단급식소 식품판매업해미루부산광역시 사상구 농산물시장로 33, 항도청과(주) 2층 1-8호 (엄궁동)<NA>
853집단급식소 식품판매업더베스트부산광역시 사상구 엄궁로 142, 분산상가동 101호 (엄궁동, 코오롱아파트)<NA>
854집단급식소 식품판매업이레축산부산광역시 사상구 새벽시장로103번길 24, 1층 (감전동)051-321-3123
855집단급식소 식품판매업주식회사 스마트에프엔씨부산광역시 사상구 강변대로532번길 17-26, 철강판매단지 상가6동 1층 47호 (엄궁동)<NA>
856집단급식소 식품판매업성원푸드부산광역시 사상구 엄궁로 142, 분산상가동 101-2호 (엄궁동, 코오롱아파트)<NA>

Duplicate rows

Most frequently occurring

업종명업소명소재지(도로명)소재지전화# duplicates
0식품운반업개별화물부산광역시 사상구 낙동대로1016번길 17, A동 2층 (감전동)<NA>2
1식품운반업개인용달부산광역시 사상구 낙동대로1016번길 17, 2층 (감전동)<NA>2
2식품자동판매기영업밀양국밥집부산광역시 사상구 낙동대로 745 (엄궁동)051-324-09852
3식품자동판매기영업한송현부산광역시 사상구 새벽로 131 (감전동)051-322-30212