Overview

Dataset statistics

Number of variables4
Number of observations357
Missing cells241
Missing cells (%)16.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.3 KiB
Average record size in memory32.4 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시 사상구 건강기능식품판매업 현황(업종명, 업소명, 소재지, 소재지전화번호)에 대한 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15025661/fileData.do

Alerts

업종명 is highly imbalanced (84.5%)Imbalance
소재지전화 has 241 (67.5%) missing valuesMissing

Reproduction

Analysis started2023-12-13 00:36:53.135092
Analysis finished2023-12-13 00:36:53.571765
Duration0.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
건강기능식품일반판매업
349 
건강기능식품유통전문판매업
 
8

Length

Max length13
Median length11
Mean length11.044818
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건강기능식품일반판매업
2nd row건강기능식품일반판매업
3rd row건강기능식품일반판매업
4th row건강기능식품일반판매업
5th row건강기능식품일반판매업

Common Values

ValueCountFrequency (%)
건강기능식품일반판매업 349
97.8%
건강기능식품유통전문판매업 8
 
2.2%

Length

2023-12-13T09:36:53.840467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:36:53.918479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건강기능식품일반판매업 349
97.8%
건강기능식품유통전문판매업 8
 
2.2%
Distinct352
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-12-13T09:36:54.091591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length18
Mean length7.1820728
Min length2

Characters and Unicode

Total characters2564
Distinct characters410
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique347 ?
Unique (%)97.2%

Sample

1st row(주)이마트사상점
2nd row정관장홍삼모라점
3rd row정관장홍삼 괘법 전시판매장
4th row유니베라
5th row지에스리테일(주례점)
ValueCountFrequency (%)
세븐일레븐 7
 
1.5%
주식회사 6
 
1.3%
씨제이올리브영(주 3
 
0.7%
애터미 3
 
0.7%
에치와이 3
 
0.7%
유니베라 3
 
0.7%
한국암웨이 3
 
0.7%
엄궁점 3
 
0.7%
3h 3
 
0.7%
아리따움 2
 
0.4%
Other values (407) 418
92.1%
2023-12-13T09:36:54.407636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
97
 
3.8%
82
 
3.2%
74
 
2.9%
72
 
2.8%
) 60
 
2.3%
( 59
 
2.3%
58
 
2.3%
48
 
1.9%
39
 
1.5%
36
 
1.4%
Other values (400) 1939
75.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2209
86.2%
Space Separator 97
 
3.8%
Uppercase Letter 81
 
3.2%
Close Punctuation 60
 
2.3%
Open Punctuation 59
 
2.3%
Lowercase Letter 36
 
1.4%
Decimal Number 17
 
0.7%
Connector Punctuation 2
 
0.1%
Dash Punctuation 2
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
82
 
3.7%
74
 
3.3%
72
 
3.3%
58
 
2.6%
48
 
2.2%
39
 
1.8%
36
 
1.6%
35
 
1.6%
35
 
1.6%
35
 
1.6%
Other values (349) 1695
76.7%
Uppercase Letter
ValueCountFrequency (%)
S 9
 
11.1%
O 9
 
11.1%
M 6
 
7.4%
A 5
 
6.2%
U 5
 
6.2%
H 5
 
6.2%
G 5
 
6.2%
L 5
 
6.2%
D 4
 
4.9%
N 4
 
4.9%
Other values (11) 24
29.6%
Lowercase Letter
ValueCountFrequency (%)
l 8
22.2%
a 4
11.1%
e 4
11.1%
n 3
 
8.3%
h 3
 
8.3%
g 2
 
5.6%
y 2
 
5.6%
t 2
 
5.6%
v 1
 
2.8%
r 1
 
2.8%
Other values (6) 6
16.7%
Decimal Number
ValueCountFrequency (%)
2 6
35.3%
3 4
23.5%
5 2
 
11.8%
8 1
 
5.9%
1 1
 
5.9%
9 1
 
5.9%
4 1
 
5.9%
7 1
 
5.9%
Space Separator
ValueCountFrequency (%)
97
100.0%
Close Punctuation
ValueCountFrequency (%)
) 60
100.0%
Open Punctuation
ValueCountFrequency (%)
( 59
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Punctuation
ValueCountFrequency (%)
· 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2209
86.2%
Common 238
 
9.3%
Latin 117
 
4.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
82
 
3.7%
74
 
3.3%
72
 
3.3%
58
 
2.6%
48
 
2.2%
39
 
1.8%
36
 
1.6%
35
 
1.6%
35
 
1.6%
35
 
1.6%
Other values (349) 1695
76.7%
Latin
ValueCountFrequency (%)
S 9
 
7.7%
O 9
 
7.7%
l 8
 
6.8%
M 6
 
5.1%
A 5
 
4.3%
U 5
 
4.3%
H 5
 
4.3%
G 5
 
4.3%
L 5
 
4.3%
D 4
 
3.4%
Other values (27) 56
47.9%
Common
ValueCountFrequency (%)
97
40.8%
) 60
25.2%
( 59
24.8%
2 6
 
2.5%
3 4
 
1.7%
_ 2
 
0.8%
- 2
 
0.8%
5 2
 
0.8%
8 1
 
0.4%
1 1
 
0.4%
Other values (4) 4
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2209
86.2%
ASCII 354
 
13.8%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
97
27.4%
) 60
16.9%
( 59
16.7%
S 9
 
2.5%
O 9
 
2.5%
l 8
 
2.3%
2 6
 
1.7%
M 6
 
1.7%
A 5
 
1.4%
U 5
 
1.4%
Other values (40) 90
25.4%
Hangul
ValueCountFrequency (%)
82
 
3.7%
74
 
3.3%
72
 
3.3%
58
 
2.6%
48
 
2.2%
39
 
1.8%
36
 
1.6%
35
 
1.6%
35
 
1.6%
35
 
1.6%
Other values (349) 1695
76.7%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct350
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-12-13T09:36:54.609178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length48
Mean length35.128852
Min length21

Characters and Unicode

Total characters12541
Distinct characters226
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique343 ?
Unique (%)96.1%

Sample

1st row부산광역시 사상구 광장로 17 (괘법동)
2nd row부산광역시 사상구 백양대로 916, 1동 127호 (모라동, 우성아파트상가)
3rd row부산광역시 사상구 사상로 221, 1층 (괘법동)
4th row부산광역시 사상구 낙동대로 790, 9층 901호 (엄궁동)
5th row부산광역시 사상구 가야대로284번길 12 (주례동)
ValueCountFrequency (%)
부산광역시 357
 
14.8%
사상구 357
 
14.8%
주례동 85
 
3.5%
1층 66
 
2.7%
괘법동 58
 
2.4%
모라동 52
 
2.2%
엄궁동 45
 
1.9%
2층 42
 
1.7%
감전동 38
 
1.6%
백양대로 33
 
1.4%
Other values (591) 1272
52.9%
2023-12-13T09:36:54.940803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2048
 
16.3%
550
 
4.4%
1 514
 
4.1%
447
 
3.6%
421
 
3.4%
, 412
 
3.3%
388
 
3.1%
380
 
3.0%
374
 
3.0%
2 369
 
2.9%
Other values (216) 6638
52.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6982
55.7%
Decimal Number 2276
 
18.1%
Space Separator 2048
 
16.3%
Other Punctuation 415
 
3.3%
Open Punctuation 359
 
2.9%
Close Punctuation 359
 
2.9%
Dash Punctuation 67
 
0.5%
Uppercase Letter 27
 
0.2%
Lowercase Letter 6
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
550
 
7.9%
447
 
6.4%
421
 
6.0%
388
 
5.6%
380
 
5.4%
374
 
5.4%
366
 
5.2%
362
 
5.2%
361
 
5.2%
360
 
5.2%
Other values (184) 2973
42.6%
Decimal Number
ValueCountFrequency (%)
1 514
22.6%
2 369
16.2%
0 342
15.0%
3 260
11.4%
4 198
 
8.7%
5 141
 
6.2%
7 123
 
5.4%
6 119
 
5.2%
8 114
 
5.0%
9 96
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
A 12
44.4%
B 8
29.6%
G 2
 
7.4%
R 1
 
3.7%
K 1
 
3.7%
C 1
 
3.7%
S 1
 
3.7%
M 1
 
3.7%
Lowercase Letter
ValueCountFrequency (%)
e 1
16.7%
r 1
16.7%
a 1
16.7%
u 1
16.7%
q 1
16.7%
s 1
16.7%
Other Punctuation
ValueCountFrequency (%)
, 412
99.3%
. 2
 
0.5%
@ 1
 
0.2%
Space Separator
ValueCountFrequency (%)
2048
100.0%
Open Punctuation
ValueCountFrequency (%)
( 359
100.0%
Close Punctuation
ValueCountFrequency (%)
) 359
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 67
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6982
55.7%
Common 5526
44.1%
Latin 33
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
550
 
7.9%
447
 
6.4%
421
 
6.0%
388
 
5.6%
380
 
5.4%
374
 
5.4%
366
 
5.2%
362
 
5.2%
361
 
5.2%
360
 
5.2%
Other values (184) 2973
42.6%
Common
ValueCountFrequency (%)
2048
37.1%
1 514
 
9.3%
, 412
 
7.5%
2 369
 
6.7%
( 359
 
6.5%
) 359
 
6.5%
0 342
 
6.2%
3 260
 
4.7%
4 198
 
3.6%
5 141
 
2.6%
Other values (8) 524
 
9.5%
Latin
ValueCountFrequency (%)
A 12
36.4%
B 8
24.2%
G 2
 
6.1%
R 1
 
3.0%
K 1
 
3.0%
C 1
 
3.0%
S 1
 
3.0%
e 1
 
3.0%
r 1
 
3.0%
a 1
 
3.0%
Other values (4) 4
 
12.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6982
55.7%
ASCII 5559
44.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2048
36.8%
1 514
 
9.2%
, 412
 
7.4%
2 369
 
6.6%
( 359
 
6.5%
) 359
 
6.5%
0 342
 
6.2%
3 260
 
4.7%
4 198
 
3.6%
5 141
 
2.5%
Other values (22) 557
 
10.0%
Hangul
ValueCountFrequency (%)
550
 
7.9%
447
 
6.4%
421
 
6.0%
388
 
5.6%
380
 
5.4%
374
 
5.4%
366
 
5.2%
362
 
5.2%
361
 
5.2%
360
 
5.2%
Other values (184) 2973
42.6%

소재지전화
Text

MISSING 

Distinct114
Distinct (%)98.3%
Missing241
Missing (%)67.5%
Memory size2.9 KiB
2023-12-13T09:36:55.156604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1392
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique112 ?
Unique (%)96.6%

Sample

1st row051-329-1234
2nd row051-327-8811
3rd row051-327-3016
4th row051-319-9168
5th row051-301-2313
ValueCountFrequency (%)
051-323-7577 2
 
1.7%
051-995-6135 2
 
1.7%
051-503-9824 1
 
0.9%
051-313-3321 1
 
0.9%
051-311-5132 1
 
0.9%
051-323-4277 1
 
0.9%
051-301-4333 1
 
0.9%
051-517-0325 1
 
0.9%
051-302-9949 1
 
0.9%
051-322-5870 1
 
0.9%
Other values (104) 104
89.7%
2023-12-13T09:36:55.464547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 245
17.6%
- 232
16.7%
0 199
14.3%
5 188
13.5%
3 168
12.1%
2 97
 
7.0%
7 69
 
5.0%
8 52
 
3.7%
9 49
 
3.5%
4 49
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1160
83.3%
Dash Punctuation 232
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 245
21.1%
0 199
17.2%
5 188
16.2%
3 168
14.5%
2 97
 
8.4%
7 69
 
5.9%
8 52
 
4.5%
9 49
 
4.2%
4 49
 
4.2%
6 44
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 232
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1392
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 245
17.6%
- 232
16.7%
0 199
14.3%
5 188
13.5%
3 168
12.1%
2 97
 
7.0%
7 69
 
5.0%
8 52
 
3.7%
9 49
 
3.5%
4 49
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1392
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 245
17.6%
- 232
16.7%
0 199
14.3%
5 188
13.5%
3 168
12.1%
2 97
 
7.0%
7 69
 
5.0%
8 52
 
3.7%
9 49
 
3.5%
4 49
 
3.5%

Missing values

2023-12-13T09:36:53.480649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T09:36:53.545595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지(도로명)소재지전화
0건강기능식품일반판매업(주)이마트사상점부산광역시 사상구 광장로 17 (괘법동)051-329-1234
1건강기능식품일반판매업정관장홍삼모라점부산광역시 사상구 백양대로 916, 1동 127호 (모라동, 우성아파트상가)<NA>
2건강기능식품일반판매업정관장홍삼 괘법 전시판매장부산광역시 사상구 사상로 221, 1층 (괘법동)<NA>
3건강기능식품일반판매업유니베라부산광역시 사상구 낙동대로 790, 9층 901호 (엄궁동)<NA>
4건강기능식품일반판매업지에스리테일(주례점)부산광역시 사상구 가야대로284번길 12 (주례동)051-327-8811
5건강기능식품일반판매업쥬단학주례점부산광역시 사상구 가야대로318번길 14 (주례동)051-327-3016
6건강기능식품일반판매업(주)홈플러스서부산점부산광역시 사상구 광장로 7 (괘법동)051-319-9168
7건강기능식품일반판매업유니베라부산광역시 사상구 사상로 512 (모라동, 모라 리버빌 상가 A-203)051-301-2313
8건강기능식품일반판매업유 성부산광역시 사상구 광장로56번길 60-14 (괘법동)<NA>
9건강기능식품일반판매업암웨이부산광역시 사상구 백양대로 883, 105동 902호 (모라동, 동원아파트)<NA>
업종명업소명소재지(도로명)소재지전화
347건강기능식품일반판매업서진헤어아트부산광역시 사상구 백양대로 1000-5, 1층 (모라동)051-312-0987
348건강기능식품일반판매업쓰리원컴퍼니부산광역시 사상구 사상로146번길 5, 1층 (감전동)051-851-4054
349건강기능식품유통전문판매업허브플렛폼부산광역시 사상구 학감대로222번길 83 (주례동)<NA>
350건강기능식품유통전문판매업(주)프라임오라부산광역시 사상구 주례로 101, 1동 302호 (주례동)051-714-3712
351건강기능식품유통전문판매업(주)내츄럴바이오부산광역시 사상구 가야대로 105, 2층 (감전동)051-515-9659
352건강기능식품유통전문판매업(주)영운코리아부산광역시 사상구 장인로 70 (주)켐코리아 3층 (학장동)<NA>
353건강기능식품유통전문판매업넷닥터부산광역시 사상구 새벽로215번길 7, 2층 (괘법동)<NA>
354건강기능식품유통전문판매업주식회사 투에이치바이오부산광역시 사상구 광장로 62, 406호 (괘법동)<NA>
355건강기능식품유통전문판매업미에르(주)부산광역시 사상구 학감대로 252, 7층 (감전동)<NA>
356건강기능식품유통전문판매업미트리(주)부산광역시 사상구 학감대로 252, 7층 (감전동)051-995-6135