Overview

Dataset statistics

Number of variables3
Number of observations586
Missing cells40
Missing cells (%)2.3%
Duplicate rows3
Duplicate rows (%)0.5%
Total size in memory13.9 KiB
Average record size in memory24.2 B

Variable types

Text3

Dataset

Description부산광역시연제구_의료기기업소현황_20230918
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15048094

Alerts

Dataset has 3 (0.5%) duplicate rowsDuplicates
영업소우편번호(도로명) has 38 (6.5%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:37:46.873225
Analysis finished2023-12-10 16:37:47.846120
Duration0.97 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct576
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
2023-12-11T01:37:48.032601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length17
Mean length7.4982935
Min length2

Characters and Unicode

Total characters4394
Distinct characters409
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique567 ?
Unique (%)96.8%

Sample

1st row(주)아성다이소 부산교대점
2nd rowJW메디칼
3rd row에스원메드
4th row조앤
5th row도토리컴퍼니
ValueCountFrequency (%)
주식회사 24
 
3.3%
세븐일레븐 18
 
2.5%
씨유 7
 
1.0%
지에스25 7
 
1.0%
주)아성다이소 6
 
0.8%
연산점 5
 
0.7%
이마트24 4
 
0.6%
라라샵 4
 
0.6%
해피랜드 4
 
0.6%
메디칼 3
 
0.4%
Other values (621) 640
88.6%
2023-12-11T01:37:48.435150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
168
 
3.8%
161
 
3.7%
136
 
3.1%
135
 
3.1%
131
 
3.0%
129
 
2.9%
120
 
2.7%
114
 
2.6%
108
 
2.5%
) 91
 
2.1%
Other values (399) 3101
70.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3739
85.1%
Uppercase Letter 176
 
4.0%
Space Separator 136
 
3.1%
Decimal Number 117
 
2.7%
Close Punctuation 91
 
2.1%
Open Punctuation 90
 
2.0%
Lowercase Letter 37
 
0.8%
Other Punctuation 4
 
0.1%
Dash Punctuation 2
 
< 0.1%
Other Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
168
 
4.5%
161
 
4.3%
135
 
3.6%
131
 
3.5%
129
 
3.5%
120
 
3.2%
114
 
3.0%
108
 
2.9%
66
 
1.8%
63
 
1.7%
Other values (345) 2544
68.0%
Uppercase Letter
ValueCountFrequency (%)
S 44
25.0%
G 35
19.9%
C 19
10.8%
U 16
 
9.1%
M 6
 
3.4%
H 6
 
3.4%
E 6
 
3.4%
K 5
 
2.8%
P 5
 
2.8%
N 4
 
2.3%
Other values (12) 30
17.0%
Lowercase Letter
ValueCountFrequency (%)
s 9
24.3%
g 7
18.9%
e 4
10.8%
h 3
 
8.1%
m 2
 
5.4%
c 2
 
5.4%
r 1
 
2.7%
a 1
 
2.7%
b 1
 
2.7%
u 1
 
2.7%
Other values (6) 6
16.2%
Decimal Number
ValueCountFrequency (%)
2 55
47.0%
5 47
40.2%
4 5
 
4.3%
3 3
 
2.6%
0 2
 
1.7%
1 2
 
1.7%
6 1
 
0.9%
7 1
 
0.9%
8 1
 
0.9%
Other Punctuation
ValueCountFrequency (%)
. 3
75.0%
& 1
 
25.0%
Space Separator
ValueCountFrequency (%)
136
100.0%
Close Punctuation
ValueCountFrequency (%)
) 91
100.0%
Open Punctuation
ValueCountFrequency (%)
( 90
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3741
85.1%
Common 440
 
10.0%
Latin 213
 
4.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
168
 
4.5%
161
 
4.3%
135
 
3.6%
131
 
3.5%
129
 
3.4%
120
 
3.2%
114
 
3.0%
108
 
2.9%
66
 
1.8%
63
 
1.7%
Other values (346) 2546
68.1%
Latin
ValueCountFrequency (%)
S 44
20.7%
G 35
16.4%
C 19
 
8.9%
U 16
 
7.5%
s 9
 
4.2%
g 7
 
3.3%
M 6
 
2.8%
H 6
 
2.8%
E 6
 
2.8%
K 5
 
2.3%
Other values (28) 60
28.2%
Common
ValueCountFrequency (%)
136
30.9%
) 91
20.7%
( 90
20.5%
2 55
12.5%
5 47
 
10.7%
4 5
 
1.1%
. 3
 
0.7%
3 3
 
0.7%
- 2
 
0.5%
0 2
 
0.5%
Other values (5) 6
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3739
85.1%
ASCII 653
 
14.9%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
168
 
4.5%
161
 
4.3%
135
 
3.6%
131
 
3.5%
129
 
3.5%
120
 
3.2%
114
 
3.0%
108
 
2.9%
66
 
1.8%
63
 
1.7%
Other values (345) 2544
68.0%
ASCII
ValueCountFrequency (%)
136
20.8%
) 91
13.9%
( 90
13.8%
2 55
8.4%
5 47
 
7.2%
S 44
 
6.7%
G 35
 
5.4%
C 19
 
2.9%
U 16
 
2.5%
s 9
 
1.4%
Other values (43) 111
17.0%
None
ValueCountFrequency (%)
2
100.0%
Distinct564
Distinct (%)96.6%
Missing2
Missing (%)0.3%
Memory size4.7 KiB
2023-12-11T01:37:48.785446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length50
Mean length32.818493
Min length16

Characters and Unicode

Total characters19166
Distinct characters258
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique544 ?
Unique (%)93.2%

Sample

1st row부산광역시 연제구 중앙대로 1178, 1~3층 (거제동)
2nd row부산광역시 연제구 중앙천로19번길 50, 306호 (연산동)
3rd row부산광역시 연제구 월드컵대로 160, 4층 일부(B11)호 (연산동)
4th row부산광역시 연제구 중앙대로 1078, 뉴그랜드오피스텔 1202호 (연산동)
5th row부산광역시 연제구 중앙대로1124번길 20, 서진빌딩 2층 일부호 (연산동)
ValueCountFrequency (%)
부산광역시 584
 
15.7%
연제구 583
 
15.7%
연산동 417
 
11.2%
거제동 137
 
3.7%
1층 115
 
3.1%
월드컵대로 59
 
1.6%
중앙대로 48
 
1.3%
2층 48
 
1.3%
과정로 36
 
1.0%
3층 29
 
0.8%
Other values (732) 1668
44.8%
2023-12-11T01:37:49.272957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3141
 
16.4%
1117
 
5.8%
1076
 
5.6%
1 848
 
4.4%
830
 
4.3%
694
 
3.6%
644
 
3.4%
634
 
3.3%
598
 
3.1%
) 594
 
3.1%
Other values (248) 8990
46.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11172
58.3%
Space Separator 3141
 
16.4%
Decimal Number 3005
 
15.7%
Close Punctuation 594
 
3.1%
Open Punctuation 594
 
3.1%
Other Punctuation 531
 
2.8%
Uppercase Letter 80
 
0.4%
Dash Punctuation 47
 
0.2%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1117
 
10.0%
1076
 
9.6%
830
 
7.4%
694
 
6.2%
644
 
5.8%
634
 
5.7%
598
 
5.4%
585
 
5.2%
585
 
5.2%
583
 
5.2%
Other values (212) 3826
34.2%
Uppercase Letter
ValueCountFrequency (%)
B 11
13.8%
K 8
10.0%
C 7
8.8%
W 7
8.8%
I 7
8.8%
S 7
8.8%
V 6
7.5%
E 6
7.5%
H 5
6.2%
J 4
 
5.0%
Other values (8) 12
15.0%
Decimal Number
ValueCountFrequency (%)
1 848
28.2%
2 456
15.2%
0 374
12.4%
3 329
 
10.9%
4 244
 
8.1%
5 197
 
6.6%
6 161
 
5.4%
8 144
 
4.8%
7 138
 
4.6%
9 114
 
3.8%
Other Punctuation
ValueCountFrequency (%)
, 529
99.6%
? 1
 
0.2%
& 1
 
0.2%
Space Separator
ValueCountFrequency (%)
3141
100.0%
Close Punctuation
ValueCountFrequency (%)
) 594
100.0%
Open Punctuation
ValueCountFrequency (%)
( 594
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 47
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11172
58.3%
Common 7914
41.3%
Latin 80
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1117
 
10.0%
1076
 
9.6%
830
 
7.4%
694
 
6.2%
644
 
5.8%
634
 
5.7%
598
 
5.4%
585
 
5.2%
585
 
5.2%
583
 
5.2%
Other values (212) 3826
34.2%
Common
ValueCountFrequency (%)
3141
39.7%
1 848
 
10.7%
) 594
 
7.5%
( 594
 
7.5%
, 529
 
6.7%
2 456
 
5.8%
0 374
 
4.7%
3 329
 
4.2%
4 244
 
3.1%
5 197
 
2.5%
Other values (8) 608
 
7.7%
Latin
ValueCountFrequency (%)
B 11
13.8%
K 8
10.0%
C 7
8.8%
W 7
8.8%
I 7
8.8%
S 7
8.8%
V 6
7.5%
E 6
7.5%
H 5
6.2%
J 4
 
5.0%
Other values (8) 12
15.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11172
58.3%
ASCII 7994
41.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3141
39.3%
1 848
 
10.6%
) 594
 
7.4%
( 594
 
7.4%
, 529
 
6.6%
2 456
 
5.7%
0 374
 
4.7%
3 329
 
4.1%
4 244
 
3.1%
5 197
 
2.5%
Other values (26) 688
 
8.6%
Hangul
ValueCountFrequency (%)
1117
 
10.0%
1076
 
9.6%
830
 
7.4%
694
 
6.2%
644
 
5.8%
634
 
5.7%
598
 
5.4%
585
 
5.2%
585
 
5.2%
583
 
5.2%
Other values (212) 3826
34.2%
Distinct101
Distinct (%)18.4%
Missing38
Missing (%)6.5%
Memory size4.7 KiB
2023-12-11T01:37:49.548115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length6.0054745
Min length6

Characters and Unicode

Total characters3291
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)3.5%

Sample

1st row'47515
2nd row'47601
3rd row'47524
4th row'47596
5th row'47520
ValueCountFrequency (%)
47524 26
 
4.7%
47596 24
 
4.4%
47540 20
 
3.6%
47541 19
 
3.5%
47542 18
 
3.3%
47520 17
 
3.1%
47565 17
 
3.1%
47558 12
 
2.2%
47583 12
 
2.2%
47564 12
 
2.2%
Other values (91) 371
67.7%
2023-12-11T01:37:49.916059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 711
21.6%
7 607
18.4%
5 582
17.7%
' 545
16.6%
6 177
 
5.4%
0 139
 
4.2%
1 136
 
4.1%
2 132
 
4.0%
9 114
 
3.5%
8 87
 
2.6%
Other values (2) 61
 
1.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2743
83.3%
Other Punctuation 545
 
16.6%
Dash Punctuation 3
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 711
25.9%
7 607
22.1%
5 582
21.2%
6 177
 
6.5%
0 139
 
5.1%
1 136
 
5.0%
2 132
 
4.8%
9 114
 
4.2%
8 87
 
3.2%
3 58
 
2.1%
Other Punctuation
ValueCountFrequency (%)
' 545
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3291
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 711
21.6%
7 607
18.4%
5 582
17.7%
' 545
16.6%
6 177
 
5.4%
0 139
 
4.2%
1 136
 
4.1%
2 132
 
4.0%
9 114
 
3.5%
8 87
 
2.6%
Other values (2) 61
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3291
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 711
21.6%
7 607
18.4%
5 582
17.7%
' 545
16.6%
6 177
 
5.4%
0 139
 
4.2%
1 136
 
4.1%
2 132
 
4.0%
9 114
 
3.5%
8 87
 
2.6%
Other values (2) 61
 
1.9%

Missing values

2023-12-11T01:37:47.568834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:37:47.666883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T01:37:47.789030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

영업소명영업소소재지(도로명)영업소우편번호(도로명)
0(주)아성다이소 부산교대점부산광역시 연제구 중앙대로 1178, 1~3층 (거제동)'47515
1JW메디칼부산광역시 연제구 중앙천로19번길 50, 306호 (연산동)'47601
2에스원메드부산광역시 연제구 월드컵대로 160, 4층 일부(B11)호 (연산동)'47524
3조앤부산광역시 연제구 중앙대로 1078, 뉴그랜드오피스텔 1202호 (연산동)'47596
4도토리컴퍼니부산광역시 연제구 중앙대로1124번길 20, 서진빌딩 2층 일부호 (연산동)'47520
5지에스25 연제미라주점부산광역시 연제구 반송로 80, 107동 108, 109호 (연산동, 연산동 일동 미라주 더 스타)'47552
6시그니아 독일보청기 연산센터부산광역시 연제구 반송로 8, 2층 일부호 (연산동)'47549
7세븐일레븐 부산연제연산점부산광역시 연제구 월드컵대로145번길 74, 1층 (연산동)'47541
8케이엘컴퍼니부산광역시 연제구 중앙대로 1078, 뉴그랜드오피스텔 11층 1102호 (연산동)'47596
9씨유연제거성점부산광역시 연제구 아시아드대로 82, 1층 (거제동)'47508
영업소명영업소소재지(도로명)영업소우편번호(도로명)
576미주덴탈부산광역시 연제구 법원남로15번길 29, 2층 (거제동)'47511
577다산메디칼주식회사부산광역시 연제구 연안로13번길 97 (연산동)'47565
578(주)이마트연제점부산광역시 연제구 연수로 89 (연산동)'47604
579엘림메디칼부산광역시 연제구 세병로 26 (연산동)'47518
580동양치과재료상사부산광역시 연제구 반송로 112 (연산동)'47559
581피앤디동아메디컬주식회사부산광역시 연제구 과정로 262, 에이스리버팰리스 203호 (연산동)'47564
582한솔메디칼<NA><NA>
583주식회사 신승시스템부산광역시 연제구 중앙대로1219번길 15 (거제동)'47505
584한솔메디칼부산광역시 연제구 연안로13번길 97 (연산동)'47565
585가톨릭덴탈부산광역시 연제구 쌍미천로16번길 4, 1층 (연산동)'47594

Duplicate rows

Most frequently occurring

영업소명영업소소재지(도로명)영업소우편번호(도로명)# duplicates
0GS25연산신촌중앙점부산광역시 연제구 신촌로 30(연산동)'475412
1웰빙라이프부산광역시 연제구 고분로 136, 라동 105호 (연산동, 진일아파트)'475832
2이지라이프부산광역시 연제구 연수로148번길 45 (연산동)'476102