Overview

Dataset statistics

Number of variables4
Number of observations462
Missing cells311
Missing cells (%)16.8%
Duplicate rows1
Duplicate rows (%)0.2%
Total size in memory14.6 KiB
Average record size in memory32.3 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시남구건강기능식품판매업소현황_20230510
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3045987

Alerts

업종명 has constant value ""Constant
Dataset has 1 (0.2%) duplicate rowsDuplicates
소재지전화 has 311 (67.3%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:37:06.346124
Analysis finished2023-12-10 16:37:07.083673
Duration0.74 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
건강기능식품일반판매업
462 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건강기능식품일반판매업
2nd row건강기능식품일반판매업
3rd row건강기능식품일반판매업
4th row건강기능식품일반판매업
5th row건강기능식품일반판매업

Common Values

ValueCountFrequency (%)
건강기능식품일반판매업 462
100.0%

Length

2023-12-11T01:37:07.190669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:37:07.316217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건강기능식품일반판매업 462
100.0%
Distinct459
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
2023-12-11T01:37:07.700189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length20
Mean length6.969697
Min length1

Characters and Unicode

Total characters3220
Distinct characters448
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique456 ?
Unique (%)98.7%

Sample

1st row(주)이마트문현점
2nd row정관장홍삼용호점
3rd row마임부산남부지사
4th row(주)서원유통탑마트감만점
5th row남부산농업협동조합 석포지점
ValueCountFrequency (%)
대연점 8
 
1.3%
주식회사 8
 
1.3%
인셀덤 6
 
1.0%
세븐일레븐 6
 
1.0%
gs25 5
 
0.8%
용호점 5
 
0.8%
헬스케어 4
 
0.7%
문현점 3
 
0.5%
시너지 3
 
0.5%
홈플러스 3
 
0.5%
Other values (539) 562
91.7%
2023-12-11T01:37:08.331497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
153
 
4.8%
99
 
3.1%
90
 
2.8%
85
 
2.6%
( 61
 
1.9%
) 61
 
1.9%
54
 
1.7%
52
 
1.6%
49
 
1.5%
46
 
1.4%
Other values (438) 2470
76.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2712
84.2%
Space Separator 153
 
4.8%
Uppercase Letter 105
 
3.3%
Lowercase Letter 85
 
2.6%
Open Punctuation 61
 
1.9%
Close Punctuation 61
 
1.9%
Decimal Number 35
 
1.1%
Other Punctuation 8
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
99
 
3.7%
90
 
3.3%
85
 
3.1%
54
 
2.0%
52
 
1.9%
49
 
1.8%
46
 
1.7%
45
 
1.7%
44
 
1.6%
39
 
1.4%
Other values (381) 2109
77.8%
Uppercase Letter
ValueCountFrequency (%)
S 13
 
12.4%
G 10
 
9.5%
R 7
 
6.7%
N 7
 
6.7%
U 6
 
5.7%
O 6
 
5.7%
B 6
 
5.7%
T 5
 
4.8%
E 5
 
4.8%
K 5
 
4.8%
Other values (13) 35
33.3%
Lowercase Letter
ValueCountFrequency (%)
e 14
16.5%
i 6
 
7.1%
o 6
 
7.1%
a 6
 
7.1%
c 6
 
7.1%
t 6
 
7.1%
n 5
 
5.9%
y 5
 
5.9%
b 4
 
4.7%
d 4
 
4.7%
Other values (10) 23
27.1%
Decimal Number
ValueCountFrequency (%)
2 10
28.6%
5 9
25.7%
1 4
 
11.4%
3 3
 
8.6%
4 3
 
8.6%
6 2
 
5.7%
8 2
 
5.7%
9 2
 
5.7%
Other Punctuation
ValueCountFrequency (%)
' 3
37.5%
, 3
37.5%
& 2
25.0%
Space Separator
ValueCountFrequency (%)
153
100.0%
Open Punctuation
ValueCountFrequency (%)
( 61
100.0%
Close Punctuation
ValueCountFrequency (%)
) 61
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2712
84.2%
Common 318
 
9.9%
Latin 190
 
5.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
99
 
3.7%
90
 
3.3%
85
 
3.1%
54
 
2.0%
52
 
1.9%
49
 
1.8%
46
 
1.7%
45
 
1.7%
44
 
1.6%
39
 
1.4%
Other values (381) 2109
77.8%
Latin
ValueCountFrequency (%)
e 14
 
7.4%
S 13
 
6.8%
G 10
 
5.3%
R 7
 
3.7%
N 7
 
3.7%
i 6
 
3.2%
o 6
 
3.2%
U 6
 
3.2%
O 6
 
3.2%
a 6
 
3.2%
Other values (33) 109
57.4%
Common
ValueCountFrequency (%)
153
48.1%
( 61
 
19.2%
) 61
 
19.2%
2 10
 
3.1%
5 9
 
2.8%
1 4
 
1.3%
' 3
 
0.9%
3 3
 
0.9%
4 3
 
0.9%
, 3
 
0.9%
Other values (4) 8
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2712
84.2%
ASCII 508
 
15.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
153
30.1%
( 61
 
12.0%
) 61
 
12.0%
e 14
 
2.8%
S 13
 
2.6%
G 10
 
2.0%
2 10
 
2.0%
5 9
 
1.8%
R 7
 
1.4%
N 7
 
1.4%
Other values (47) 163
32.1%
Hangul
ValueCountFrequency (%)
99
 
3.7%
90
 
3.3%
85
 
3.1%
54
 
2.0%
52
 
1.9%
49
 
1.8%
46
 
1.7%
45
 
1.7%
44
 
1.6%
39
 
1.4%
Other values (381) 2109
77.8%
Distinct442
Distinct (%)95.7%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
2023-12-11T01:37:08.728811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length45
Mean length36.261905
Min length9

Characters and Unicode

Total characters16753
Distinct characters274
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique433 ?
Unique (%)93.7%

Sample

1st row부산광역시 남구 전포대로91번길 47 (문현동)
2nd row부산광역시 남구 용호로 100-1 (용호동)
3rd row부산광역시 남구 용호로 158 (용호동)
4th row부산광역시 남구 홍곡로 3 (감만동)
5th row부산광역시 남구 석포로 131 (대연동)
ValueCountFrequency (%)
부산광역시 459
 
14.0%
남구 459
 
14.0%
대연동 207
 
6.3%
용호동 125
 
3.8%
수영로 89
 
2.7%
문현동 73
 
2.2%
1층 51
 
1.6%
2층 48
 
1.5%
분포로 43
 
1.3%
엘지메트로시티 27
 
0.8%
Other values (745) 1709
51.9%
2023-12-11T01:37:09.340929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2828
 
16.9%
1 876
 
5.2%
686
 
4.1%
, 612
 
3.7%
502
 
3.0%
2 497
 
3.0%
494
 
2.9%
491
 
2.9%
489
 
2.9%
( 481
 
2.9%
Other values (264) 8797
52.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8977
53.6%
Decimal Number 3197
 
19.1%
Space Separator 2828
 
16.9%
Other Punctuation 612
 
3.7%
Open Punctuation 481
 
2.9%
Close Punctuation 481
 
2.9%
Uppercase Letter 95
 
0.6%
Dash Punctuation 62
 
0.4%
Lowercase Letter 13
 
0.1%
Math Symbol 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
686
 
7.6%
502
 
5.6%
494
 
5.5%
491
 
5.5%
489
 
5.4%
473
 
5.3%
464
 
5.2%
462
 
5.1%
459
 
5.1%
446
 
5.0%
Other values (222) 4011
44.7%
Uppercase Letter
ValueCountFrequency (%)
C 14
14.7%
A 14
14.7%
E 9
9.5%
I 9
9.5%
S 7
7.4%
F 7
7.4%
B 6
 
6.3%
G 5
 
5.3%
L 4
 
4.2%
K 4
 
4.2%
Other values (8) 16
16.8%
Decimal Number
ValueCountFrequency (%)
1 876
27.4%
2 497
15.5%
0 470
14.7%
3 349
 
10.9%
5 211
 
6.6%
4 207
 
6.5%
6 178
 
5.6%
9 150
 
4.7%
8 136
 
4.3%
7 123
 
3.8%
Lowercase Letter
ValueCountFrequency (%)
l 4
30.8%
i 3
23.1%
e 2
15.4%
s 2
15.4%
v 1
 
7.7%
w 1
 
7.7%
Math Symbol
ValueCountFrequency (%)
> 3
50.0%
< 3
50.0%
Space Separator
ValueCountFrequency (%)
2828
100.0%
Other Punctuation
ValueCountFrequency (%)
, 612
100.0%
Open Punctuation
ValueCountFrequency (%)
( 481
100.0%
Close Punctuation
ValueCountFrequency (%)
) 481
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 62
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8977
53.6%
Common 7667
45.8%
Latin 109
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
686
 
7.6%
502
 
5.6%
494
 
5.5%
491
 
5.5%
489
 
5.4%
473
 
5.3%
464
 
5.2%
462
 
5.1%
459
 
5.1%
446
 
5.0%
Other values (222) 4011
44.7%
Latin
ValueCountFrequency (%)
C 14
12.8%
A 14
12.8%
E 9
 
8.3%
I 9
 
8.3%
S 7
 
6.4%
F 7
 
6.4%
B 6
 
5.5%
G 5
 
4.6%
l 4
 
3.7%
L 4
 
3.7%
Other values (15) 30
27.5%
Common
ValueCountFrequency (%)
2828
36.9%
1 876
 
11.4%
, 612
 
8.0%
2 497
 
6.5%
( 481
 
6.3%
) 481
 
6.3%
0 470
 
6.1%
3 349
 
4.6%
5 211
 
2.8%
4 207
 
2.7%
Other values (7) 655
 
8.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8977
53.6%
ASCII 7775
46.4%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2828
36.4%
1 876
 
11.3%
, 612
 
7.9%
2 497
 
6.4%
( 481
 
6.2%
) 481
 
6.2%
0 470
 
6.0%
3 349
 
4.5%
5 211
 
2.7%
4 207
 
2.7%
Other values (31) 763
 
9.8%
Hangul
ValueCountFrequency (%)
686
 
7.6%
502
 
5.6%
494
 
5.5%
491
 
5.5%
489
 
5.4%
473
 
5.3%
464
 
5.2%
462
 
5.1%
459
 
5.1%
446
 
5.0%
Other values (222) 4011
44.7%
Number Forms
ValueCountFrequency (%)
1
100.0%

소재지전화
Text

MISSING 

Distinct145
Distinct (%)96.0%
Missing311
Missing (%)67.3%
Memory size3.7 KiB
2023-12-11T01:37:09.690015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length13.960265
Min length12

Characters and Unicode

Total characters2108
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique140 ?
Unique (%)92.7%

Sample

1st row 051- 609-1234
2nd row 051- 612-2304
3rd row 051- 627-2265
4th row 051- 624-2975
5th row 051- 627-6001
ValueCountFrequency (%)
051 133
35.1%
611 9
 
2.4%
070 9
 
2.4%
621 6
 
1.6%
622 5
 
1.3%
633 5
 
1.3%
625 5
 
1.3%
612 5
 
1.3%
627 4
 
1.1%
634 3
 
0.8%
Other values (179) 195
51.5%
2023-12-11T01:37:10.243819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 302
14.3%
283
13.4%
0 256
12.1%
1 256
12.1%
5 237
11.2%
6 190
9.0%
2 155
7.4%
7 116
 
5.5%
3 103
 
4.9%
4 79
 
3.7%
Other values (2) 131
6.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1523
72.2%
Dash Punctuation 302
 
14.3%
Space Separator 283
 
13.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 256
16.8%
1 256
16.8%
5 237
15.6%
6 190
12.5%
2 155
10.2%
7 116
7.6%
3 103
6.8%
4 79
 
5.2%
8 71
 
4.7%
9 60
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 302
100.0%
Space Separator
ValueCountFrequency (%)
283
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2108
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 302
14.3%
283
13.4%
0 256
12.1%
1 256
12.1%
5 237
11.2%
6 190
9.0%
2 155
7.4%
7 116
 
5.5%
3 103
 
4.9%
4 79
 
3.7%
Other values (2) 131
6.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2108
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 302
14.3%
283
13.4%
0 256
12.1%
1 256
12.1%
5 237
11.2%
6 190
9.0%
2 155
7.4%
7 116
 
5.5%
3 103
 
4.9%
4 79
 
3.7%
Other values (2) 131
6.2%

Missing values

2023-12-11T01:37:06.893225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:37:07.025453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지(도로명)소재지전화
0건강기능식품일반판매업(주)이마트문현점부산광역시 남구 전포대로91번길 47 (문현동)051- 609-1234
1건강기능식품일반판매업정관장홍삼용호점부산광역시 남구 용호로 100-1 (용호동)051- 612-2304
2건강기능식품일반판매업마임부산남부지사부산광역시 남구 용호로 158 (용호동)051- 627-2265
3건강기능식품일반판매업(주)서원유통탑마트감만점부산광역시 남구 홍곡로 3 (감만동)<NA>
4건강기능식품일반판매업남부산농업협동조합 석포지점부산광역시 남구 석포로 131 (대연동)051- 624-2975
5건강기능식품일반판매업남부산농협부산광역시 남구 수영로 251 (대연동)051- 627-6001
6건강기능식품일반판매업남부산농업협동조합 감만지점부산광역시 남구 우암로 58, 1층 (감만동)051- 643-3100
7건강기능식품일반판매업유니베라 남구대리점부산광역시 남구 수영로 237 (대연동,대연메디타워3층)051- 611-8777
8건강기능식품일반판매업(주)이마트문현점부산광역시 남구 전포대로91번길 47 (문현동)051- 609-1234
9건강기능식품일반판매업민현주 여성의원부산광역시 남구 수영로 27 (문현동,금호빌딩 201호)051- 638-8275
업종명업소명소재지(도로명)소재지전화
452건강기능식품일반판매업에스에이치 컴퍼니부산광역시 남구 수영로325번길 61, 102동 1503호 (대연동, 대연 롯데캐슬)<NA>
453건강기능식품일반판매업오키오키몰부산광역시 남구 천제등로28번길 50, 101동 1202호 (대연동, 대연동 현대아파트)<NA>
454건강기능식품일반판매업진영상사부산광역시 남구 수영로 135, 126동 102호 (대연동, 대연롯데캐슬레전드)<NA>
455건강기능식품일반판매업브로커머스부산광역시 남구 동명로170번길 93, 1동 1103호 (용호동, 용호동일타운)<NA>
456건강기능식품일반판매업약손건강부산광역시 남구 분포로 61, 빌리브센트로 A동 2층 A215호 (용호동)051- 621-7501
457건강기능식품일반판매업헬시토푸부산광역시 남구 전포대로 90, 602호 (문현동)<NA>
458건강기능식품일반판매업똥글뱅이부산광역시 남구 수영로 298, 산암빌딩 10층 1001호 (대연동)<NA>
459건강기능식품일반판매업늘봄부산광역시 남구 수영로208번길 15, 201호 (대연동)<NA>
460건강기능식품일반판매업점핑다이어트부산광역시 남구 지게골로 46, 4층 (문현동)<NA>
461건강기능식품일반판매업푸른부산광역시 남구 수영로 233, 민석빌딩 9층 46호 (대연동)<NA>

Duplicate rows

Most frequently occurring

업종명업소명소재지(도로명)소재지전화# duplicates
0건강기능식품일반판매업(주)이마트문현점부산광역시 남구 전포대로91번길 47 (문현동)051- 609-12342