Overview

Dataset statistics

Number of variables4
Number of observations234
Missing cells165
Missing cells (%)17.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.4 KiB
Average record size in memory32.6 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시_동구_건강기능식품판매업현황_20230117
Author부산광역시 동구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15028638

Alerts

업종명 has constant value ""Constant
소재지전화 has 165 (70.5%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:35:12.238263
Analysis finished2023-12-10 16:35:12.658195
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
건강기능식품일반판매업
234 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건강기능식품일반판매업
2nd row건강기능식품일반판매업
3rd row건강기능식품일반판매업
4th row건강기능식품일반판매업
5th row건강기능식품일반판매업

Common Values

ValueCountFrequency (%)
건강기능식품일반판매업 234
100.0%

Length

2023-12-11T01:35:12.742337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:35:12.848734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건강기능식품일반판매업 234
100.0%
Distinct232
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-11T01:35:13.119151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length6.9358974
Min length2

Characters and Unicode

Total characters1623
Distinct characters355
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique230 ?
Unique (%)98.3%

Sample

1st row메디프렌드
2nd row일신기독병원
3rd row(주)현대백화점
4th row건강생활(주)항도지점
5th row건강생활 초량지점
ValueCountFrequency (%)
주식회사 9
 
3.0%
세븐일레븐 5
 
1.7%
애터미 4
 
1.3%
수정점 3
 
1.0%
부산역점 2
 
0.7%
현대헬쓰케어 2
 
0.7%
홍헤원 1
 
0.3%
파트너스 1
 
0.3%
초량뷰티센터 1
 
0.3%
제이엠(jm 1
 
0.3%
Other values (271) 271
90.3%
2023-12-11T01:35:13.597558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
66
 
4.1%
51
 
3.1%
42
 
2.6%
38
 
2.3%
38
 
2.3%
35
 
2.2%
31
 
1.9%
31
 
1.9%
( 30
 
1.8%
) 30
 
1.8%
Other values (345) 1231
75.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1436
88.5%
Space Separator 66
 
4.1%
Open Punctuation 30
 
1.8%
Close Punctuation 30
 
1.8%
Uppercase Letter 27
 
1.7%
Decimal Number 16
 
1.0%
Lowercase Letter 16
 
1.0%
Math Symbol 1
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
51
 
3.6%
42
 
2.9%
38
 
2.6%
38
 
2.6%
35
 
2.4%
31
 
2.2%
31
 
2.2%
24
 
1.7%
22
 
1.5%
22
 
1.5%
Other values (306) 1102
76.7%
Uppercase Letter
ValueCountFrequency (%)
N 4
14.8%
D 3
11.1%
G 3
11.1%
I 2
 
7.4%
L 2
 
7.4%
S 2
 
7.4%
E 2
 
7.4%
J 1
 
3.7%
M 1
 
3.7%
A 1
 
3.7%
Other values (6) 6
22.2%
Lowercase Letter
ValueCountFrequency (%)
i 3
18.8%
k 2
12.5%
y 2
12.5%
a 2
12.5%
w 1
 
6.2%
e 1
 
6.2%
b 1
 
6.2%
t 1
 
6.2%
h 1
 
6.2%
p 1
 
6.2%
Decimal Number
ValueCountFrequency (%)
0 4
25.0%
2 3
18.8%
5 3
18.8%
3 2
12.5%
1 2
12.5%
6 1
 
6.2%
7 1
 
6.2%
Space Separator
ValueCountFrequency (%)
66
100.0%
Open Punctuation
ValueCountFrequency (%)
( 30
100.0%
Close Punctuation
ValueCountFrequency (%)
) 30
100.0%
Math Symbol
ValueCountFrequency (%)
= 1
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1436
88.5%
Common 144
 
8.9%
Latin 43
 
2.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
51
 
3.6%
42
 
2.9%
38
 
2.6%
38
 
2.6%
35
 
2.4%
31
 
2.2%
31
 
2.2%
24
 
1.7%
22
 
1.5%
22
 
1.5%
Other values (306) 1102
76.7%
Latin
ValueCountFrequency (%)
N 4
 
9.3%
D 3
 
7.0%
G 3
 
7.0%
i 3
 
7.0%
I 2
 
4.7%
L 2
 
4.7%
k 2
 
4.7%
y 2
 
4.7%
a 2
 
4.7%
S 2
 
4.7%
Other values (17) 18
41.9%
Common
ValueCountFrequency (%)
66
45.8%
( 30
20.8%
) 30
20.8%
0 4
 
2.8%
2 3
 
2.1%
5 3
 
2.1%
3 2
 
1.4%
1 2
 
1.4%
6 1
 
0.7%
7 1
 
0.7%
Other values (2) 2
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1436
88.5%
ASCII 187
 
11.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
66
35.3%
( 30
16.0%
) 30
16.0%
0 4
 
2.1%
N 4
 
2.1%
2 3
 
1.6%
D 3
 
1.6%
G 3
 
1.6%
5 3
 
1.6%
i 3
 
1.6%
Other values (29) 38
20.3%
Hangul
ValueCountFrequency (%)
51
 
3.6%
42
 
2.9%
38
 
2.6%
38
 
2.6%
35
 
2.4%
31
 
2.2%
31
 
2.2%
24
 
1.7%
22
 
1.5%
22
 
1.5%
Other values (306) 1102
76.7%
Distinct230
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-11T01:35:13.943702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length45
Mean length33.782051
Min length21

Characters and Unicode

Total characters7905
Distinct characters198
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique227 ?
Unique (%)97.0%

Sample

1st row부산광역시 동구 범일로 119 (범일동,,9)
2nd row부산광역시 동구 정공단로 34 (좌천동)
3rd row부산광역시 동구 범일로 125 (범일동,현대백화점)
4th row부산광역시 동구 범일로 113, 3층 (범일동)
5th row부산광역시 동구 중앙대로349번길 38, 6층 (수정동)
ValueCountFrequency (%)
부산광역시 233
 
14.7%
동구 233
 
14.7%
초량동 93
 
5.9%
범일동 73
 
4.6%
중앙대로 45
 
2.8%
수정동 35
 
2.2%
1층 29
 
1.8%
범일로 27
 
1.7%
2층 21
 
1.3%
좌천동 20
 
1.3%
Other values (427) 774
48.9%
2023-12-11T01:35:14.484287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1349
 
17.1%
509
 
6.4%
1 317
 
4.0%
, 269
 
3.4%
257
 
3.3%
254
 
3.2%
242
 
3.1%
) 242
 
3.1%
( 242
 
3.1%
239
 
3.0%
Other values (188) 3985
50.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4314
54.6%
Decimal Number 1407
 
17.8%
Space Separator 1349
 
17.1%
Other Punctuation 273
 
3.5%
Close Punctuation 242
 
3.1%
Open Punctuation 242
 
3.1%
Dash Punctuation 52
 
0.7%
Uppercase Letter 21
 
0.3%
Lowercase Letter 3
 
< 0.1%
Letter Number 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
509
 
11.8%
257
 
6.0%
254
 
5.9%
242
 
5.6%
239
 
5.5%
233
 
5.4%
233
 
5.4%
231
 
5.4%
140
 
3.2%
131
 
3.0%
Other values (158) 1845
42.8%
Decimal Number
ValueCountFrequency (%)
1 317
22.5%
2 217
15.4%
0 204
14.5%
3 148
10.5%
6 104
 
7.4%
5 99
 
7.0%
4 99
 
7.0%
7 92
 
6.5%
9 67
 
4.8%
8 60
 
4.3%
Uppercase Letter
ValueCountFrequency (%)
A 6
28.6%
Y 3
14.3%
C 3
14.3%
M 2
 
9.5%
D 2
 
9.5%
B 2
 
9.5%
G 1
 
4.8%
T 1
 
4.8%
W 1
 
4.8%
Other Punctuation
ValueCountFrequency (%)
, 269
98.5%
· 2
 
0.7%
: 1
 
0.4%
/ 1
 
0.4%
Lowercase Letter
ValueCountFrequency (%)
e 2
66.7%
b 1
33.3%
Space Separator
ValueCountFrequency (%)
1349
100.0%
Close Punctuation
ValueCountFrequency (%)
) 242
100.0%
Open Punctuation
ValueCountFrequency (%)
( 242
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4314
54.6%
Common 3565
45.1%
Latin 26
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
509
 
11.8%
257
 
6.0%
254
 
5.9%
242
 
5.6%
239
 
5.5%
233
 
5.4%
233
 
5.4%
231
 
5.4%
140
 
3.2%
131
 
3.0%
Other values (158) 1845
42.8%
Common
ValueCountFrequency (%)
1349
37.8%
1 317
 
8.9%
, 269
 
7.5%
) 242
 
6.8%
( 242
 
6.8%
2 217
 
6.1%
0 204
 
5.7%
3 148
 
4.2%
6 104
 
2.9%
5 99
 
2.8%
Other values (8) 374
 
10.5%
Latin
ValueCountFrequency (%)
A 6
23.1%
Y 3
11.5%
C 3
11.5%
M 2
 
7.7%
2
 
7.7%
D 2
 
7.7%
B 2
 
7.7%
e 2
 
7.7%
b 1
 
3.8%
G 1
 
3.8%
Other values (2) 2
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4314
54.6%
ASCII 3587
45.4%
Number Forms 2
 
< 0.1%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1349
37.6%
1 317
 
8.8%
, 269
 
7.5%
) 242
 
6.7%
( 242
 
6.7%
2 217
 
6.0%
0 204
 
5.7%
3 148
 
4.1%
6 104
 
2.9%
5 99
 
2.8%
Other values (18) 396
 
11.0%
Hangul
ValueCountFrequency (%)
509
 
11.8%
257
 
6.0%
254
 
5.9%
242
 
5.6%
239
 
5.5%
233
 
5.4%
233
 
5.4%
231
 
5.4%
140
 
3.2%
131
 
3.0%
Other values (158) 1845
42.8%
Number Forms
ValueCountFrequency (%)
2
100.0%
None
ValueCountFrequency (%)
· 2
100.0%

소재지전화
Text

MISSING 

Distinct69
Distinct (%)100.0%
Missing165
Missing (%)70.5%
Memory size2.0 KiB
2023-12-11T01:35:14.803051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.956522
Min length11

Characters and Unicode

Total characters825
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique69 ?
Unique (%)100.0%

Sample

1st row051-466-8452
2nd row051-468-2300
3rd row051-441-2211
4th row051-441-1195
5th row051-469-6366
ValueCountFrequency (%)
051-644-6300 1
 
1.4%
051-462-7737 1
 
1.4%
051-441-5071 1
 
1.4%
051-714-0671 1
 
1.4%
051-604-0915 1
 
1.4%
051-442-5290 1
 
1.4%
051-646-5654 1
 
1.4%
051-759-6319 1
 
1.4%
051-643-5252 1
 
1.4%
051-635-9197 1
 
1.4%
Other values (59) 59
85.5%
2023-12-11T01:35:15.257757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 138
16.7%
1 115
13.9%
5 109
13.2%
0 106
12.8%
4 90
10.9%
6 80
9.7%
2 55
 
6.7%
7 45
 
5.5%
3 42
 
5.1%
9 24
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 687
83.3%
Dash Punctuation 138
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 115
16.7%
5 109
15.9%
0 106
15.4%
4 90
13.1%
6 80
11.6%
2 55
8.0%
7 45
 
6.6%
3 42
 
6.1%
9 24
 
3.5%
8 21
 
3.1%
Dash Punctuation
ValueCountFrequency (%)
- 138
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 825
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 138
16.7%
1 115
13.9%
5 109
13.2%
0 106
12.8%
4 90
10.9%
6 80
9.7%
2 55
 
6.7%
7 45
 
5.5%
3 42
 
5.1%
9 24
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 825
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 138
16.7%
1 115
13.9%
5 109
13.2%
0 106
12.8%
4 90
10.9%
6 80
9.7%
2 55
 
6.7%
7 45
 
5.5%
3 42
 
5.1%
9 24
 
2.9%

Missing values

2023-12-11T01:35:12.530080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:35:12.615852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지(도로명)소재지전화
0건강기능식품일반판매업메디프렌드부산광역시 동구 범일로 119 (범일동,,9)<NA>
1건강기능식품일반판매업일신기독병원부산광역시 동구 정공단로 34 (좌천동)<NA>
2건강기능식품일반판매업(주)현대백화점부산광역시 동구 범일로 125 (범일동,현대백화점)<NA>
3건강기능식품일반판매업건강생활(주)항도지점부산광역시 동구 범일로 113, 3층 (범일동)<NA>
4건강기능식품일반판매업건강생활 초량지점부산광역시 동구 중앙대로349번길 38, 6층 (수정동)051-466-8452
5건강기능식품일반판매업유니베라 부산동부대리점부산광역시 동구 중앙대로 383 (수정동,메디컬센터 9층)051-468-2300
6건강기능식품일반판매업(주)건강생활 서부산지점부산광역시 동구 조방로 39, 썬오피스텔 1301호 (범일동)<NA>
7건강기능식품일반판매업에이스패밀리부산광역시 동구 중앙대로226번길 13-7, 5층 (초량동, 서강종합빌딩)<NA>
8건강기능식품일반판매업앨트웰(주)부산지사부산광역시 동구 중앙대로 262, 1층 (초량동)051-441-2211
9건강기능식품일반판매업생그린부산광역시 동구 고관로 67-2 (수정동,337-9(3층))051-441-1195
업종명업소명소재지(도로명)소재지전화
224건강기능식품일반판매업주식회사 치엘로부산광역시 동구 중앙대로196번길 6-7, 403-A11호 (초량동)<NA>
225건강기능식품일반판매업요야마켓부산광역시 동구 진성로71번길 28, 8층 802호 (수정동, 수정 하늘꽃)<NA>
226건강기능식품일반판매업석인강식부산광역시 동구 성남이로60번길 24, 4층 (범일동)<NA>
227건강기능식품일반판매업어썸리본부산광역시 동구 조방로 48, 1층 1066호 (범일동)<NA>
228건강기능식품일반판매업샤월마켓부산광역시 동구 중앙대로221번길 43, 초량골든힐오피스텔 601호 (초량동)<NA>
229건강기능식품일반판매업무창부산광역시 동구 범일로90번길 17, 5층 528호 (범일동, 범일동 삼정그린코아 더시티)<NA>
230건강기능식품일반판매업유니온엠시스부산광역시 동구 중앙대로226번길 7-3, 거영빌딩 701호 (초량동)<NA>
231건강기능식품일반판매업호니파파컴퍼니부산광역시 동구 수정로 23, 2층 (수정동)<NA>
232건강기능식품일반판매업3070몰부산광역시 동구 중앙대로196번길 6-7, 801호 (초량동)<NA>
233건강기능식품일반판매업인디언리프부산광역시 동구 홍곡로 50, 104동 1220호 (초량동, e편한세상 부산항)<NA>