Overview

Dataset statistics

Number of variables4
Number of observations195
Missing cells128
Missing cells (%)16.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.2 KiB
Average record size in memory32.7 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시_동구_건강기능식품판매업현황_20220113
Author부산광역시 동구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15028638

Alerts

업종명 has constant value ""Constant
소재지전화 has 128 (65.6%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:35:17.080720
Analysis finished2023-12-10 16:35:17.661024
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
건강기능식품일반판매업
195 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건강기능식품일반판매업
2nd row건강기능식품일반판매업
3rd row건강기능식품일반판매업
4th row건강기능식품일반판매업
5th row건강기능식품일반판매업

Common Values

ValueCountFrequency (%)
건강기능식품일반판매업 195
100.0%

Length

2023-12-11T01:35:17.741166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:35:17.873071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건강기능식품일반판매업 195
100.0%
Distinct193
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-11T01:35:18.151574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length7.2717949
Min length2

Characters and Unicode

Total characters1418
Distinct characters326
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique191 ?
Unique (%)97.9%

Sample

1st row인제건강생활동부산지점
2nd row초량지사
3rd row파스퇴르 진구유통
4th row세븐일레븐 부산국제여객터미널2호점
5th row포라이프
ValueCountFrequency (%)
주식회사 7
 
2.7%
세븐일레븐 5
 
2.0%
애터미 4
 
1.6%
유니베라 2
 
0.8%
파트너스 2
 
0.8%
수정점 2
 
0.8%
부산역점 2
 
0.8%
중동구가정대리점 1
 
0.4%
동구녹즙 1
 
0.4%
초량뷰티센터 1
 
0.4%
Other values (229) 229
89.5%
2023-12-11T01:35:18.713230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
61
 
4.3%
47
 
3.3%
39
 
2.8%
35
 
2.5%
33
 
2.3%
31
 
2.2%
31
 
2.2%
) 30
 
2.1%
( 30
 
2.1%
26
 
1.8%
Other values (316) 1055
74.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1248
88.0%
Space Separator 61
 
4.3%
Close Punctuation 30
 
2.1%
Open Punctuation 30
 
2.1%
Uppercase Letter 23
 
1.6%
Lowercase Letter 14
 
1.0%
Decimal Number 10
 
0.7%
Other Punctuation 1
 
0.1%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
47
 
3.8%
39
 
3.1%
35
 
2.8%
33
 
2.6%
31
 
2.5%
31
 
2.5%
26
 
2.1%
22
 
1.8%
21
 
1.7%
17
 
1.4%
Other values (278) 946
75.8%
Uppercase Letter
ValueCountFrequency (%)
G 3
13.0%
S 2
 
8.7%
O 2
 
8.7%
I 2
 
8.7%
B 2
 
8.7%
N 2
 
8.7%
A 2
 
8.7%
C 1
 
4.3%
K 1
 
4.3%
J 1
 
4.3%
Other values (5) 5
21.7%
Lowercase Letter
ValueCountFrequency (%)
i 2
14.3%
n 2
14.3%
o 1
7.1%
l 1
7.1%
y 1
7.1%
j 1
7.1%
u 1
7.1%
k 1
7.1%
c 1
7.1%
p 1
7.1%
Other values (2) 2
14.3%
Decimal Number
ValueCountFrequency (%)
5 3
30.0%
2 3
30.0%
1 1
 
10.0%
3 1
 
10.0%
6 1
 
10.0%
4 1
 
10.0%
Space Separator
ValueCountFrequency (%)
61
100.0%
Close Punctuation
ValueCountFrequency (%)
) 30
100.0%
Open Punctuation
ValueCountFrequency (%)
( 30
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%
Math Symbol
ValueCountFrequency (%)
= 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1248
88.0%
Common 133
 
9.4%
Latin 37
 
2.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
47
 
3.8%
39
 
3.1%
35
 
2.8%
33
 
2.6%
31
 
2.5%
31
 
2.5%
26
 
2.1%
22
 
1.8%
21
 
1.7%
17
 
1.4%
Other values (278) 946
75.8%
Latin
ValueCountFrequency (%)
G 3
 
8.1%
S 2
 
5.4%
O 2
 
5.4%
I 2
 
5.4%
i 2
 
5.4%
B 2
 
5.4%
n 2
 
5.4%
N 2
 
5.4%
A 2
 
5.4%
o 1
 
2.7%
Other values (17) 17
45.9%
Common
ValueCountFrequency (%)
61
45.9%
) 30
22.6%
( 30
22.6%
5 3
 
2.3%
2 3
 
2.3%
1 1
 
0.8%
& 1
 
0.8%
= 1
 
0.8%
3 1
 
0.8%
6 1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1248
88.0%
ASCII 170
 
12.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
61
35.9%
) 30
17.6%
( 30
17.6%
5 3
 
1.8%
G 3
 
1.8%
2 3
 
1.8%
S 2
 
1.2%
O 2
 
1.2%
I 2
 
1.2%
i 2
 
1.2%
Other values (28) 32
18.8%
Hangul
ValueCountFrequency (%)
47
 
3.8%
39
 
3.1%
35
 
2.8%
33
 
2.6%
31
 
2.5%
31
 
2.5%
26
 
2.1%
22
 
1.8%
21
 
1.7%
17
 
1.4%
Other values (278) 946
75.8%
Distinct191
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-11T01:35:19.172701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length45
Mean length33.148718
Min length21

Characters and Unicode

Total characters6464
Distinct characters185
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique189 ?
Unique (%)96.9%

Sample

1st row부산광역시 동구 중앙대로349번길 38 (수정동)
2nd row부산광역시 동구 중앙대로 266, 7층 (초량동, 부경빌딩)
3rd row부산광역시 동구 홍곡로 20 (초량동)
4th row부산광역시 동구 충장대로 206, 3층 (초량동)
5th row부산광역시 동구 망양로 494-10, 301호 (초량동, 대륙아파트)
ValueCountFrequency (%)
부산광역시 195
 
15.0%
동구 195
 
15.0%
초량동 76
 
5.9%
범일동 64
 
4.9%
중앙대로 40
 
3.1%
범일로 27
 
2.1%
수정동 24
 
1.8%
1층 19
 
1.5%
2층 17
 
1.3%
좌천동 14
 
1.1%
Other values (366) 627
48.3%
2023-12-11T01:35:19.797736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1103
 
17.1%
420
 
6.5%
1 239
 
3.7%
215
 
3.3%
214
 
3.3%
, 210
 
3.2%
) 204
 
3.2%
( 204
 
3.2%
203
 
3.1%
202
 
3.1%
Other values (175) 3250
50.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3584
55.4%
Decimal Number 1110
 
17.2%
Space Separator 1103
 
17.1%
Other Punctuation 213
 
3.3%
Close Punctuation 204
 
3.2%
Open Punctuation 204
 
3.2%
Dash Punctuation 38
 
0.6%
Uppercase Letter 6
 
0.1%
Lowercase Letter 1
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
420
 
11.7%
215
 
6.0%
214
 
6.0%
203
 
5.7%
202
 
5.6%
195
 
5.4%
195
 
5.4%
194
 
5.4%
117
 
3.3%
117
 
3.3%
Other values (149) 1512
42.2%
Decimal Number
ValueCountFrequency (%)
1 239
21.5%
2 184
16.6%
0 156
14.1%
3 122
11.0%
5 85
 
7.7%
6 82
 
7.4%
4 78
 
7.0%
7 70
 
6.3%
9 52
 
4.7%
8 42
 
3.8%
Uppercase Letter
ValueCountFrequency (%)
A 1
16.7%
C 1
16.7%
M 1
16.7%
Y 1
16.7%
G 1
16.7%
T 1
16.7%
Other Punctuation
ValueCountFrequency (%)
, 210
98.6%
/ 1
 
0.5%
. 1
 
0.5%
· 1
 
0.5%
Space Separator
ValueCountFrequency (%)
1103
100.0%
Close Punctuation
ValueCountFrequency (%)
) 204
100.0%
Open Punctuation
ValueCountFrequency (%)
( 204
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 38
100.0%
Lowercase Letter
ValueCountFrequency (%)
b 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3584
55.4%
Common 2872
44.4%
Latin 8
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
420
 
11.7%
215
 
6.0%
214
 
6.0%
203
 
5.7%
202
 
5.6%
195
 
5.4%
195
 
5.4%
194
 
5.4%
117
 
3.3%
117
 
3.3%
Other values (149) 1512
42.2%
Common
ValueCountFrequency (%)
1103
38.4%
1 239
 
8.3%
, 210
 
7.3%
) 204
 
7.1%
( 204
 
7.1%
2 184
 
6.4%
0 156
 
5.4%
3 122
 
4.2%
5 85
 
3.0%
6 82
 
2.9%
Other values (8) 283
 
9.9%
Latin
ValueCountFrequency (%)
b 1
12.5%
A 1
12.5%
C 1
12.5%
M 1
12.5%
Y 1
12.5%
1
12.5%
G 1
12.5%
T 1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3584
55.4%
ASCII 2878
44.5%
Number Forms 1
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1103
38.3%
1 239
 
8.3%
, 210
 
7.3%
) 204
 
7.1%
( 204
 
7.1%
2 184
 
6.4%
0 156
 
5.4%
3 122
 
4.2%
5 85
 
3.0%
6 82
 
2.8%
Other values (14) 289
 
10.0%
Hangul
ValueCountFrequency (%)
420
 
11.7%
215
 
6.0%
214
 
6.0%
203
 
5.7%
202
 
5.6%
195
 
5.4%
195
 
5.4%
194
 
5.4%
117
 
3.3%
117
 
3.3%
Other values (149) 1512
42.2%
Number Forms
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
· 1
100.0%

소재지전화
Text

MISSING 

Distinct67
Distinct (%)100.0%
Missing128
Missing (%)65.6%
Memory size1.7 KiB
2023-12-11T01:35:20.146777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters804
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)100.0%

Sample

1st row051-328-0529
2nd row051-343-2141
3rd row051-441-1667
4th row051-441-2211
5th row051-441-2304
ValueCountFrequency (%)
051-441-5485 1
 
1.5%
051-638-6278 1
 
1.5%
051-637-7131 1
 
1.5%
051-635-9197 1
 
1.5%
051-634-0462 1
 
1.5%
051-633-5500 1
 
1.5%
051-633-4285 1
 
1.5%
051-633-1600 1
 
1.5%
051-632-6100 1
 
1.5%
051-328-0529 1
 
1.5%
Other values (57) 57
85.1%
2023-12-11T01:35:20.700941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 134
16.7%
1 110
13.7%
0 109
13.6%
5 109
13.6%
6 84
10.4%
4 81
10.1%
2 43
 
5.3%
3 42
 
5.2%
7 41
 
5.1%
9 28
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 670
83.3%
Dash Punctuation 134
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 110
16.4%
0 109
16.3%
5 109
16.3%
6 84
12.5%
4 81
12.1%
2 43
 
6.4%
3 42
 
6.3%
7 41
 
6.1%
9 28
 
4.2%
8 23
 
3.4%
Dash Punctuation
ValueCountFrequency (%)
- 134
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 804
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 134
16.7%
1 110
13.7%
0 109
13.6%
5 109
13.6%
6 84
10.4%
4 81
10.1%
2 43
 
5.3%
3 42
 
5.2%
7 41
 
5.1%
9 28
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 804
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 134
16.7%
1 110
13.7%
0 109
13.6%
5 109
13.6%
6 84
10.4%
4 81
10.1%
2 43
 
5.3%
3 42
 
5.2%
7 41
 
5.1%
9 28
 
3.5%

Missing values

2023-12-11T01:35:17.486467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:35:17.616456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지(도로명)소재지전화
0건강기능식품일반판매업인제건강생활동부산지점부산광역시 동구 중앙대로349번길 38 (수정동)<NA>
1건강기능식품일반판매업초량지사부산광역시 동구 중앙대로 266, 7층 (초량동, 부경빌딩)<NA>
2건강기능식품일반판매업파스퇴르 진구유통부산광역시 동구 홍곡로 20 (초량동)<NA>
3건강기능식품일반판매업세븐일레븐 부산국제여객터미널2호점부산광역시 동구 충장대로 206, 3층 (초량동)<NA>
4건강기능식품일반판매업포라이프부산광역시 동구 망양로 494-10, 301호 (초량동, 대륙아파트)<NA>
5건강기능식품일반판매업부산스마일진센타부산광역시 동구 범일로 78, 3층 (범일동)<NA>
6건강기능식품일반판매업엠티글로벌부산광역시 동구 중앙대로 270, 강남빌딩 704호 (초량동)<NA>
7건강기능식품일반판매업라운드부산광역시 동구 중앙대로 270, 강남빌딩 10층 1049호 (초량동)<NA>
8건강기능식품일반판매업파인헬스부산광역시 동구 자성공원로 13, 6층 602호 (범일동)<NA>
9건강기능식품일반판매업보아비다부산광역시 동구 중앙대로 지하 200, 가30호 (초량동)<NA>
업종명업소명소재지(도로명)소재지전화
185건강기능식품일반판매업아리따움 수정점부산광역시 동구 중앙대로371번길 45 (수정동,(1층))<NA>
186건강기능식품일반판매업일신기독병원부산광역시 동구 정공단로 34 (좌천동)<NA>
187건강기능식품일반판매업광덕허브부산광역시 동구 중앙대로 509-1 (범일동,(1층))<NA>
188건강기능식품일반판매업메디프렌드부산광역시 동구 범일로 119 (범일동,,9)<NA>
189건강기능식품일반판매업유니베라부산광역시 동구 조방로 17 (범일동)<NA>
190건강기능식품일반판매업건강생활(주)항도지점부산광역시 동구 범일로 113 (범일동,동성빌딩 8층)<NA>
191건강기능식품일반판매업남경향문외과부산광역시 동구 범일로 114 (범일동)<NA>
192건강기능식품일반판매업GNC 현대부산점부산광역시 동구 범일로 125, 지하2층 (범일동, 현대백화점)<NA>
193건강기능식품일반판매업(주)대동농원부산광역시 동구 범일로 125, 지하2층 (범일동)<NA>
194건강기능식품일반판매업(주)현대백화점부산광역시 동구 범일로 125 (범일동,현대백화점)<NA>