Overview

Dataset statistics

Number of variables3
Number of observations875
Missing cells522
Missing cells (%)19.9%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory20.6 KiB
Average record size in memory24.2 B

Variable types

Text3

Dataset

Description부산광역시 동래구에 소재해 있는 의료기기 임대 및 판매업소에 대한 데이터로 업소명, 주소, 전화번호 등의 항목을 제공합니다.
Author부산광역시 동래구
URLhttps://www.data.go.kr/data/15026178/fileData.do

Alerts

Dataset has 1 (0.1%) duplicate rowsDuplicates
영업소전화번호 has 522 (59.7%) missing valuesMissing

Reproduction

Analysis started2023-12-12 03:16:10.890516
Analysis finished2023-12-12 03:16:11.437730
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct864
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size7.0 KiB
2023-12-12T12:16:11.667103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length38
Mean length7.8937143
Min length2

Characters and Unicode

Total characters6907
Distinct characters501
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique854 ?
Unique (%)97.6%

Sample

1st row기노뷰티살롱
2nd row96분식
3rd row조이풀뮤직
4th row주식회사 리온
5th row몽인연기학원
ValueCountFrequency (%)
gs25 47
 
4.0%
주식회사 39
 
3.3%
세븐일레븐 31
 
2.6%
씨유 17
 
1.4%
cu 11
 
0.9%
메디칼 10
 
0.9%
이마트24 9
 
0.8%
동래점 8
 
0.7%
부산동래점 5
 
0.4%
사직점 5
 
0.4%
Other values (921) 993
84.5%
2023-12-12T12:16:12.136434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
730
 
10.6%
212
 
3.1%
206
 
3.0%
203
 
2.9%
192
 
2.8%
155
 
2.2%
142
 
2.1%
) 122
 
1.8%
( 122
 
1.8%
117
 
1.7%
Other values (491) 4706
68.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5406
78.3%
Space Separator 730
 
10.6%
Uppercase Letter 293
 
4.2%
Decimal Number 166
 
2.4%
Close Punctuation 122
 
1.8%
Open Punctuation 122
 
1.8%
Lowercase Letter 43
 
0.6%
Other Symbol 13
 
0.2%
Other Punctuation 8
 
0.1%
Modifier Symbol 2
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
212
 
3.9%
206
 
3.8%
203
 
3.8%
192
 
3.6%
155
 
2.9%
142
 
2.6%
117
 
2.2%
111
 
2.1%
102
 
1.9%
101
 
1.9%
Other values (432) 3865
71.5%
Uppercase Letter
ValueCountFrequency (%)
S 71
24.2%
G 60
20.5%
C 28
 
9.6%
U 18
 
6.1%
M 17
 
5.8%
K 17
 
5.8%
H 15
 
5.1%
B 10
 
3.4%
D 8
 
2.7%
J 7
 
2.4%
Other values (12) 42
14.3%
Lowercase Letter
ValueCountFrequency (%)
e 9
20.9%
l 4
 
9.3%
o 3
 
7.0%
a 3
 
7.0%
u 3
 
7.0%
i 3
 
7.0%
s 2
 
4.7%
h 2
 
4.7%
g 2
 
4.7%
c 2
 
4.7%
Other values (8) 10
23.3%
Decimal Number
ValueCountFrequency (%)
2 70
42.2%
5 61
36.7%
4 12
 
7.2%
1 8
 
4.8%
3 6
 
3.6%
0 6
 
3.6%
6 2
 
1.2%
9 1
 
0.6%
Other Punctuation
ValueCountFrequency (%)
& 4
50.0%
. 2
25.0%
/ 1
 
12.5%
, 1
 
12.5%
Space Separator
ValueCountFrequency (%)
730
100.0%
Close Punctuation
ValueCountFrequency (%)
) 122
100.0%
Open Punctuation
ValueCountFrequency (%)
( 122
100.0%
Other Symbol
ValueCountFrequency (%)
13
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5416
78.4%
Common 1152
 
16.7%
Latin 336
 
4.9%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
212
 
3.9%
206
 
3.8%
203
 
3.7%
192
 
3.5%
155
 
2.9%
142
 
2.6%
117
 
2.2%
111
 
2.0%
102
 
1.9%
101
 
1.9%
Other values (430) 3875
71.5%
Latin
ValueCountFrequency (%)
S 71
21.1%
G 60
17.9%
C 28
 
8.3%
U 18
 
5.4%
M 17
 
5.1%
K 17
 
5.1%
H 15
 
4.5%
B 10
 
3.0%
e 9
 
2.7%
D 8
 
2.4%
Other values (30) 83
24.7%
Common
ValueCountFrequency (%)
730
63.4%
) 122
 
10.6%
( 122
 
10.6%
2 70
 
6.1%
5 61
 
5.3%
4 12
 
1.0%
1 8
 
0.7%
3 6
 
0.5%
0 6
 
0.5%
& 4
 
0.3%
Other values (8) 11
 
1.0%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5403
78.2%
ASCII 1487
 
21.5%
None 13
 
0.2%
CJK 3
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
730
49.1%
) 122
 
8.2%
( 122
 
8.2%
S 71
 
4.8%
2 70
 
4.7%
5 61
 
4.1%
G 60
 
4.0%
C 28
 
1.9%
U 18
 
1.2%
M 17
 
1.1%
Other values (47) 188
 
12.6%
Hangul
ValueCountFrequency (%)
212
 
3.9%
206
 
3.8%
203
 
3.8%
192
 
3.6%
155
 
2.9%
142
 
2.6%
117
 
2.2%
111
 
2.1%
102
 
1.9%
101
 
1.9%
Other values (429) 3862
71.5%
None
ValueCountFrequency (%)
13
100.0%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Punctuation
ValueCountFrequency (%)
1
100.0%
Distinct850
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size7.0 KiB
2023-12-12T12:16:12.486679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length52
Mean length32.675429
Min length17

Characters and Unicode

Total characters28591
Distinct characters289
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique830 ?
Unique (%)94.9%

Sample

1st row부산광역시 동래구 온천장로107번길 5, 4층 (온천동)
2nd row부산광역시 동래구 명안로85번길 5, 1층 (명장동)
3rd row부산광역시 동래구 명장로 65, 상가1동 202호 (명장동, e편한세상 동래명장)
4th row부산광역시 동래구 명장로20번길 98, 삼성타운 201,206호 (명장동)
5th row부산광역시 동래구 명륜로 78-1, 7층 (수안동)
ValueCountFrequency (%)
부산광역시 875
 
15.9%
동래구 875
 
15.9%
온천동 252
 
4.6%
1층 165
 
3.0%
안락동 142
 
2.6%
사직동 123
 
2.2%
2층 119
 
2.2%
충렬대로 66
 
1.2%
명륜동 63
 
1.1%
수안동 59
 
1.1%
Other values (1019) 2772
50.3%
2023-12-12T12:16:13.122212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4639
 
16.2%
1929
 
6.7%
1 1182
 
4.1%
939
 
3.3%
938
 
3.3%
924
 
3.2%
889
 
3.1%
883
 
3.1%
881
 
3.1%
875
 
3.1%
Other values (279) 14512
50.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16356
57.2%
Decimal Number 4846
 
16.9%
Space Separator 4639
 
16.2%
Other Punctuation 844
 
3.0%
Close Punctuation 804
 
2.8%
Open Punctuation 804
 
2.8%
Dash Punctuation 167
 
0.6%
Uppercase Letter 118
 
0.4%
Lowercase Letter 8
 
< 0.1%
Math Symbol 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1929
 
11.8%
939
 
5.7%
938
 
5.7%
924
 
5.6%
889
 
5.4%
883
 
5.4%
881
 
5.4%
875
 
5.3%
803
 
4.9%
497
 
3.0%
Other values (238) 6798
41.6%
Uppercase Letter
ValueCountFrequency (%)
S 22
18.6%
K 22
18.6%
B 18
15.3%
A 9
7.6%
H 8
 
6.8%
U 8
 
6.8%
Y 7
 
5.9%
I 4
 
3.4%
W 3
 
2.5%
V 3
 
2.5%
Other values (7) 14
11.9%
Decimal Number
ValueCountFrequency (%)
1 1182
24.4%
2 811
16.7%
3 577
11.9%
0 473
9.8%
4 452
 
9.3%
5 351
 
7.2%
7 284
 
5.9%
6 251
 
5.2%
8 238
 
4.9%
9 227
 
4.7%
Lowercase Letter
ValueCountFrequency (%)
e 4
50.0%
o 1
 
12.5%
l 1
 
12.5%
i 1
 
12.5%
v 1
 
12.5%
Other Punctuation
ValueCountFrequency (%)
, 841
99.6%
. 1
 
0.1%
/ 1
 
0.1%
· 1
 
0.1%
Space Separator
ValueCountFrequency (%)
4639
100.0%
Close Punctuation
ValueCountFrequency (%)
) 804
100.0%
Open Punctuation
ValueCountFrequency (%)
( 804
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 167
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16356
57.2%
Common 12109
42.4%
Latin 126
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1929
 
11.8%
939
 
5.7%
938
 
5.7%
924
 
5.6%
889
 
5.4%
883
 
5.4%
881
 
5.4%
875
 
5.3%
803
 
4.9%
497
 
3.0%
Other values (238) 6798
41.6%
Latin
ValueCountFrequency (%)
S 22
17.5%
K 22
17.5%
B 18
14.3%
A 9
7.1%
H 8
 
6.3%
U 8
 
6.3%
Y 7
 
5.6%
e 4
 
3.2%
I 4
 
3.2%
W 3
 
2.4%
Other values (12) 21
16.7%
Common
ValueCountFrequency (%)
4639
38.3%
1 1182
 
9.8%
, 841
 
6.9%
2 811
 
6.7%
) 804
 
6.6%
( 804
 
6.6%
3 577
 
4.8%
0 473
 
3.9%
4 452
 
3.7%
5 351
 
2.9%
Other values (9) 1175
 
9.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16356
57.2%
ASCII 12234
42.8%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4639
37.9%
1 1182
 
9.7%
, 841
 
6.9%
2 811
 
6.6%
) 804
 
6.6%
( 804
 
6.6%
3 577
 
4.7%
0 473
 
3.9%
4 452
 
3.7%
5 351
 
2.9%
Other values (30) 1300
 
10.6%
Hangul
ValueCountFrequency (%)
1929
 
11.8%
939
 
5.7%
938
 
5.7%
924
 
5.6%
889
 
5.4%
883
 
5.4%
881
 
5.4%
875
 
5.3%
803
 
4.9%
497
 
3.0%
Other values (238) 6798
41.6%
None
ValueCountFrequency (%)
· 1
100.0%

영업소전화번호
Text

MISSING 

Distinct340
Distinct (%)96.3%
Missing522
Missing (%)59.7%
Memory size7.0 KiB
2023-12-12T12:16:13.474614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.042493
Min length12

Characters and Unicode

Total characters4251
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique327 ?
Unique (%)92.6%

Sample

1st row051-710-2286
2nd row051-558-5865
3rd row051-717-3625
4th row051-351-4249
5th row051-503-7418
ValueCountFrequency (%)
051-556-3333 2
 
0.6%
051-507-5107 2
 
0.6%
051-853-3517 2
 
0.6%
051-462-2430 2
 
0.6%
051-525-6780 2
 
0.6%
070-7715-3100 2
 
0.6%
051-556-3434 2
 
0.6%
051-466-9753 2
 
0.6%
051-521-3049 2
 
0.6%
051-583-9671 2
 
0.6%
Other values (330) 333
94.3%
2023-12-12T12:16:13.964268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 896
21.1%
- 706
16.6%
0 635
14.9%
1 568
13.4%
2 254
 
6.0%
3 237
 
5.6%
8 215
 
5.1%
7 215
 
5.1%
6 196
 
4.6%
4 185
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3545
83.4%
Dash Punctuation 706
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 896
25.3%
0 635
17.9%
1 568
16.0%
2 254
 
7.2%
3 237
 
6.7%
8 215
 
6.1%
7 215
 
6.1%
6 196
 
5.5%
4 185
 
5.2%
9 144
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 706
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4251
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 896
21.1%
- 706
16.6%
0 635
14.9%
1 568
13.4%
2 254
 
6.0%
3 237
 
5.6%
8 215
 
5.1%
7 215
 
5.1%
6 196
 
4.6%
4 185
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4251
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 896
21.1%
- 706
16.6%
0 635
14.9%
1 568
13.4%
2 254
 
6.0%
3 237
 
5.6%
8 215
 
5.1%
7 215
 
5.1%
6 196
 
4.6%
4 185
 
4.4%

Missing values

2023-12-12T12:16:11.317634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:16:11.399190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

영업소명영업소소재지(도로명)영업소전화번호
0기노뷰티살롱부산광역시 동래구 온천장로107번길 5, 4층 (온천동)<NA>
196분식부산광역시 동래구 명안로85번길 5, 1층 (명장동)<NA>
2조이풀뮤직부산광역시 동래구 명장로 65, 상가1동 202호 (명장동, e편한세상 동래명장)<NA>
3주식회사 리온부산광역시 동래구 명장로20번길 98, 삼성타운 201,206호 (명장동)<NA>
4몽인연기학원부산광역시 동래구 명륜로 78-1, 7층 (수안동)<NA>
5(주)넥스트캐즘부산광역시 동래구 충렬대로202번가길 3, 1층 107호 (수안동)<NA>
6모던파마부산광역시 동래구 금강로 21, 온천빌딩 412호 (온천동)<NA>
7폼메디칼부산광역시 동래구 반송로 352, 훈창빌딩 지하층 105호 (명장동)<NA>
8세븐일레븐 부산온천삼익점부산광역시 동래구 금강로 28 (온천동, 온천삼익아파트)<NA>
9GS25 부산명장역점부산광역시 동래구 명안로85번길 45, 1층 (명장동)<NA>
영업소명영업소소재지(도로명)영업소전화번호
865동원치과재료상사부산광역시 동래구 미남로 54 (사직동)051-502-6543
866동래의료기부산광역시 동래구 안연로109번길 24 (안락동)051-531-2661
867유니온메디칼부산광역시 동래구 아시아드대로154번길 23 (사직동)051-504-0885
868동해의료기부산광역시 동래구 사직3동 157-9051-503-1516
869신광의료기기부산광역시 동래구 사직3동 143-45 1층051-503-9389
870㈜진욱상사부산광역시 동래구 사직3동 129-9051-526-3527
871부산동래의료기상사부산광역시 동래구 충렬대로181번길 21, 명륜빌딩 403호 (명륜동)051-557-1648
872현대치과기재상사부산광역시 동래구 충렬대로410번길 21, 1층 46호 (안락동, 안락시장상가아파트)051-554-5599
873수석콘택트랜즈부산광역시 동래구 온천3동 1249-6 5층051-557-2951
874독일보청기부산광역시 동래구 충렬대로 229, 2층 (수안동)051-555-5777

Duplicate rows

Most frequently occurring

영업소명영업소소재지(도로명)영업소전화번호# duplicates
0바디닥터부산광역시 동래구 충렬대로237번길 4, 2층 (수안동)<NA>2