Overview

Dataset statistics

Number of variables3
Number of observations504
Missing cells286
Missing cells (%)18.9%
Duplicate rows1
Duplicate rows (%)0.2%
Total size in memory11.9 KiB
Average record size in memory24.3 B

Variable types

Text3

Dataset

Description부산광역시_사상구_의료기기판매업소현황_20230706
Author부산광역시 사상구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3078763

Alerts

Dataset has 1 (0.2%) duplicate rowsDuplicates
영업소전화번호 has 286 (56.7%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:16:39.357572
Analysis finished2023-12-10 16:16:39.843795
Duration0.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct499
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size4.1 KiB
2023-12-11T01:16:40.044485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length17
Mean length7.8571429
Min length2

Characters and Unicode

Total characters3960
Distinct characters381
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique494 ?
Unique (%)98.0%

Sample

1st row대성종합상사
2nd row힐스템 모라점
3rd row우신메디칼
4th row우성메디칼
5th row의족보조기센터
ValueCountFrequency (%)
세븐일레븐 23
 
3.4%
씨유 22
 
3.3%
gs25 19
 
2.8%
주식회사 19
 
2.8%
이마트24 6
 
0.9%
cu 5
 
0.7%
지에스25 5
 
0.7%
주)아성다이소 4
 
0.6%
korea 3
 
0.4%
메디칼 3
 
0.4%
Other values (543) 565
83.8%
2023-12-11T01:16:40.501398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
176
 
4.4%
170
 
4.3%
103
 
2.6%
100
 
2.5%
95
 
2.4%
91
 
2.3%
84
 
2.1%
82
 
2.1%
) 75
 
1.9%
( 74
 
1.9%
Other values (371) 2910
73.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3263
82.4%
Uppercase Letter 207
 
5.2%
Space Separator 170
 
4.3%
Decimal Number 120
 
3.0%
Close Punctuation 75
 
1.9%
Open Punctuation 74
 
1.9%
Lowercase Letter 30
 
0.8%
Other Punctuation 11
 
0.3%
Other Symbol 8
 
0.2%
Connector Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
176
 
5.4%
103
 
3.2%
100
 
3.1%
95
 
2.9%
91
 
2.8%
84
 
2.6%
82
 
2.5%
65
 
2.0%
64
 
2.0%
61
 
1.9%
Other values (320) 2342
71.8%
Uppercase Letter
ValueCountFrequency (%)
S 50
24.2%
G 46
22.2%
C 25
12.1%
U 22
10.6%
A 8
 
3.9%
B 7
 
3.4%
E 5
 
2.4%
K 5
 
2.4%
R 5
 
2.4%
M 5
 
2.4%
Other values (12) 29
14.0%
Lowercase Letter
ValueCountFrequency (%)
e 6
20.0%
a 5
16.7%
l 3
10.0%
r 3
10.0%
y 2
 
6.7%
c 2
 
6.7%
d 2
 
6.7%
i 1
 
3.3%
o 1
 
3.3%
h 1
 
3.3%
Other values (4) 4
13.3%
Decimal Number
ValueCountFrequency (%)
2 57
47.5%
5 46
38.3%
4 7
 
5.8%
1 4
 
3.3%
3 4
 
3.3%
0 2
 
1.7%
Other Punctuation
ValueCountFrequency (%)
. 6
54.5%
· 2
 
18.2%
& 2
 
18.2%
, 1
 
9.1%
Space Separator
ValueCountFrequency (%)
170
100.0%
Close Punctuation
ValueCountFrequency (%)
) 75
100.0%
Open Punctuation
ValueCountFrequency (%)
( 74
100.0%
Other Symbol
ValueCountFrequency (%)
8
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3271
82.6%
Common 452
 
11.4%
Latin 237
 
6.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
176
 
5.4%
103
 
3.1%
100
 
3.1%
95
 
2.9%
91
 
2.8%
84
 
2.6%
82
 
2.5%
65
 
2.0%
64
 
2.0%
61
 
1.9%
Other values (321) 2350
71.8%
Latin
ValueCountFrequency (%)
S 50
21.1%
G 46
19.4%
C 25
10.5%
U 22
 
9.3%
A 8
 
3.4%
B 7
 
3.0%
e 6
 
2.5%
E 5
 
2.1%
K 5
 
2.1%
R 5
 
2.1%
Other values (26) 58
24.5%
Common
ValueCountFrequency (%)
170
37.6%
) 75
16.6%
( 74
16.4%
2 57
 
12.6%
5 46
 
10.2%
4 7
 
1.5%
. 6
 
1.3%
1 4
 
0.9%
3 4
 
0.9%
_ 2
 
0.4%
Other values (4) 7
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3263
82.4%
ASCII 687
 
17.3%
None 10
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
176
 
5.4%
103
 
3.2%
100
 
3.1%
95
 
2.9%
91
 
2.8%
84
 
2.6%
82
 
2.5%
65
 
2.0%
64
 
2.0%
61
 
1.9%
Other values (320) 2342
71.8%
ASCII
ValueCountFrequency (%)
170
24.7%
) 75
10.9%
( 74
10.8%
2 57
 
8.3%
S 50
 
7.3%
5 46
 
6.7%
G 46
 
6.7%
C 25
 
3.6%
U 22
 
3.2%
A 8
 
1.2%
Other values (39) 114
16.6%
None
ValueCountFrequency (%)
8
80.0%
· 2
 
20.0%
Distinct486
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Memory size4.1 KiB
2023-12-11T01:16:40.862098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length47
Mean length32.789683
Min length21

Characters and Unicode

Total characters16526
Distinct characters240
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique470 ?
Unique (%)93.3%

Sample

1st row부산광역시 사상구 괘감로 38, 한림도매상가 101호 (감전동)
2nd row부산광역시 사상구 백양대로 937, 3층 (모라동)
3rd row부산광역시 사상구 괘감로 37, 산업용품유통상가 21동 228호 (괘법동)
4th row부산광역시 사상구 새벽로 131, 3동 229호 (감전동)
5th row부산광역시 사상구 백양대로768번길 5, 1층 (덕포동)
ValueCountFrequency (%)
부산광역시 504
 
15.5%
사상구 503
 
15.4%
1층 134
 
4.1%
괘법동 123
 
3.8%
감전동 111
 
3.4%
주례동 103
 
3.2%
모라동 55
 
1.7%
사상로 42
 
1.3%
새벽로 41
 
1.3%
학장동 36
 
1.1%
Other values (675) 1609
49.3%
2023-12-11T01:16:41.338261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2758
 
16.7%
704
 
4.3%
679
 
4.1%
1 654
 
4.0%
599
 
3.6%
590
 
3.6%
560
 
3.4%
554
 
3.4%
539
 
3.3%
511
 
3.1%
Other values (230) 8378
50.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9561
57.9%
Space Separator 2758
 
16.7%
Decimal Number 2631
 
15.9%
Close Punctuation 504
 
3.0%
Open Punctuation 504
 
3.0%
Other Punctuation 472
 
2.9%
Dash Punctuation 49
 
0.3%
Uppercase Letter 37
 
0.2%
Lowercase Letter 7
 
< 0.1%
Math Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
704
 
7.4%
679
 
7.1%
599
 
6.3%
590
 
6.2%
560
 
5.9%
554
 
5.8%
539
 
5.6%
511
 
5.3%
507
 
5.3%
505
 
5.3%
Other values (198) 3813
39.9%
Decimal Number
ValueCountFrequency (%)
1 654
24.9%
2 388
14.7%
3 368
14.0%
0 268
10.2%
4 196
 
7.4%
7 184
 
7.0%
6 154
 
5.9%
5 150
 
5.7%
8 137
 
5.2%
9 132
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
B 11
29.7%
A 7
18.9%
G 6
16.2%
D 3
 
8.1%
E 3
 
8.1%
S 3
 
8.1%
L 2
 
5.4%
C 2
 
5.4%
Lowercase Letter
ValueCountFrequency (%)
e 2
28.6%
s 1
14.3%
k 1
14.3%
r 1
14.3%
t 1
14.3%
n 1
14.3%
Other Punctuation
ValueCountFrequency (%)
, 458
97.0%
· 13
 
2.8%
@ 1
 
0.2%
Space Separator
ValueCountFrequency (%)
2758
100.0%
Close Punctuation
ValueCountFrequency (%)
) 504
100.0%
Open Punctuation
ValueCountFrequency (%)
( 504
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 49
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9561
57.9%
Common 6921
41.9%
Latin 44
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
704
 
7.4%
679
 
7.1%
599
 
6.3%
590
 
6.2%
560
 
5.9%
554
 
5.8%
539
 
5.6%
511
 
5.3%
507
 
5.3%
505
 
5.3%
Other values (198) 3813
39.9%
Common
ValueCountFrequency (%)
2758
39.8%
1 654
 
9.4%
) 504
 
7.3%
( 504
 
7.3%
, 458
 
6.6%
2 388
 
5.6%
3 368
 
5.3%
0 268
 
3.9%
4 196
 
2.8%
7 184
 
2.7%
Other values (8) 639
 
9.2%
Latin
ValueCountFrequency (%)
B 11
25.0%
A 7
15.9%
G 6
13.6%
D 3
 
6.8%
E 3
 
6.8%
S 3
 
6.8%
L 2
 
4.5%
C 2
 
4.5%
e 2
 
4.5%
s 1
 
2.3%
Other values (4) 4
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9561
57.9%
ASCII 6952
42.1%
None 13
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2758
39.7%
1 654
 
9.4%
) 504
 
7.2%
( 504
 
7.2%
, 458
 
6.6%
2 388
 
5.6%
3 368
 
5.3%
0 268
 
3.9%
4 196
 
2.8%
7 184
 
2.6%
Other values (21) 670
 
9.6%
Hangul
ValueCountFrequency (%)
704
 
7.4%
679
 
7.1%
599
 
6.3%
590
 
6.2%
560
 
5.9%
554
 
5.8%
539
 
5.6%
511
 
5.3%
507
 
5.3%
505
 
5.3%
Other values (198) 3813
39.9%
None
ValueCountFrequency (%)
· 13
100.0%

영업소전화번호
Text

MISSING 

Distinct214
Distinct (%)98.2%
Missing286
Missing (%)56.7%
Memory size4.1 KiB
2023-12-11T01:16:41.603858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.022936
Min length9

Characters and Unicode

Total characters2621
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique210 ?
Unique (%)96.3%

Sample

1st row051-315-8201
2nd row051-322-2859
3rd row051-303-2926
4th row070-8850-2893
5th row051-322-2012
ValueCountFrequency (%)
051-314-0514 2
 
0.9%
051-866-6257 2
 
0.9%
051-892-2004 2
 
0.9%
051-332-5243 2
 
0.9%
051-326-1842 1
 
0.5%
070-7769-5754 1
 
0.5%
051-265-6924 1
 
0.5%
051-315-8201 1
 
0.5%
051-302-9908 1
 
0.5%
051-316-1303 1
 
0.5%
Other values (204) 204
93.6%
2023-12-11T01:16:42.019209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 434
16.6%
1 399
15.2%
0 381
14.5%
5 333
12.7%
3 287
11.0%
2 224
8.5%
9 124
 
4.7%
7 124
 
4.7%
6 116
 
4.4%
8 105
 
4.0%
Other values (2) 94
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2186
83.4%
Dash Punctuation 434
 
16.6%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 399
18.3%
0 381
17.4%
5 333
15.2%
3 287
13.1%
2 224
10.2%
9 124
 
5.7%
7 124
 
5.7%
6 116
 
5.3%
8 105
 
4.8%
4 93
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 434
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2621
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 434
16.6%
1 399
15.2%
0 381
14.5%
5 333
12.7%
3 287
11.0%
2 224
8.5%
9 124
 
4.7%
7 124
 
4.7%
6 116
 
4.4%
8 105
 
4.0%
Other values (2) 94
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2621
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 434
16.6%
1 399
15.2%
0 381
14.5%
5 333
12.7%
3 287
11.0%
2 224
8.5%
9 124
 
4.7%
7 124
 
4.7%
6 116
 
4.4%
8 105
 
4.0%
Other values (2) 94
 
3.6%

Missing values

2023-12-11T01:16:39.727787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:16:39.806479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

영업소명영업소소재지(도로명)영업소전화번호
0대성종합상사부산광역시 사상구 괘감로 38, 한림도매상가 101호 (감전동)051-315-8201
1힐스템 모라점부산광역시 사상구 백양대로 937, 3층 (모라동)<NA>
2우신메디칼부산광역시 사상구 괘감로 37, 산업용품유통상가 21동 228호 (괘법동)<NA>
3우성메디칼부산광역시 사상구 새벽로 131, 3동 229호 (감전동)<NA>
4의족보조기센터부산광역시 사상구 백양대로768번길 5, 1층 (덕포동)<NA>
5스피드 온부산광역시 사상구 가야대로230번길 19, 4층 (주례동)<NA>
6나라편의점부산광역시 사상구 낙동대로 1026, 1층 (감전동)<NA>
7인피니트부산광역시 사상구 새벽로 131, 부산산업용재유통상가 5동 331·332호 (감전동)<NA>
8씨유 엄궁플렉스점부산광역시 사상구 낙동대로 746-1, 1층 (엄궁동)<NA>
9지에스25 사상스타힐스점부산광역시 사상구 사상로223번길 23, 304동 107·108호 (괘법동, 센트럴 스타힐스)<NA>
영업소명영업소소재지(도로명)영업소전화번호
494해피랜드부산광역시 사상구 광장로 7 (괘법동, 르네시떼 지하1층)051-319-8000
495굿셀의료기부산광역시 사상구 사상로 465-1 (모라동)051-304-3439
496영림의료기부산광역시 사상구 동주로 24-3 (주례동, 2층,3층)051-314-0514
497원메딕스부산광역시 사상구 사상로 290 (덕포동)051-305-7111
498박포전자부산광역시 사상구 광장로 7 (괘법동, 르네시떼 5층 5033호)051-319-5033
499미소의료기부산광역시 사상구 사상로148번길 25 (괘법동)<NA>
500홈플러스(주)서부산점부산광역시 사상구 광장로 7 (괘법동, 르네시떼 지하1층)051-319-9135
501선영메디칼모라점부산광역시 사상구 사상로 476 (모라동, 명진빌딩 4층)051-301-8275
502명진의료기상사부산광역시 사상구 백양대로 430 (주례동)051-324-8996
503한국보장구상사부산광역시 사상구 사상로 78 (감전동)051-328-4534

Duplicate rows

Most frequently occurring

영업소명영업소소재지(도로명)영업소전화번호# duplicates
0휴바디앤스킨부산광역시 사상구 사상로 247-1 (괘법동)<NA>2