Overview

Dataset statistics

Number of variables3
Number of observations2838
Missing cells1241
Missing cells (%)14.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory66.6 KiB
Average record size in memory24.0 B

Variable types

Text3

Dataset

Description경상남도 내 의료기기 판매업체 현황에 대한 데이터로, 시군, 신고일자, 업체명, 소재지에 대한 정보를 제공합니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15017515

Alerts

연락처 has 1239 (43.7%) missing valuesMissing

Reproduction

Analysis started2023-12-10 23:39:34.324926
Analysis finished2023-12-10 23:39:35.169390
Duration0.84 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct2677
Distinct (%)94.3%
Missing0
Missing (%)0.0%
Memory size22.3 KiB
2023-12-11T08:39:35.403379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length22
Mean length8.3689218
Min length2

Characters and Unicode

Total characters23751
Distinct characters620
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2586 ?
Unique (%)91.1%

Sample

1st row홍익상사
2nd row텃밭의료기상사
3rd row서해통상
4th row성준메딕스
5th row파티마의료기
ValueCountFrequency (%)
gs25 121
 
3.2%
세븐일레븐 107
 
2.8%
주식회사 43
 
1.1%
씨유 41
 
1.1%
주)코리아세븐 29
 
0.8%
주)아성다이소 22
 
0.6%
의료기 14
 
0.4%
스튜디오 14
 
0.4%
타파웨어 13
 
0.3%
롯데하이마트(주 12
 
0.3%
Other values (2813) 3394
89.1%
2023-12-11T08:39:35.853959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1124
 
4.7%
973
 
4.1%
560
 
2.4%
535
 
2.3%
527
 
2.2%
512
 
2.2%
) 430
 
1.8%
( 422
 
1.8%
395
 
1.7%
394
 
1.7%
Other values (610) 17879
75.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 19892
83.8%
Uppercase Letter 1044
 
4.4%
Space Separator 973
 
4.1%
Decimal Number 752
 
3.2%
Close Punctuation 430
 
1.8%
Open Punctuation 422
 
1.8%
Lowercase Letter 127
 
0.5%
Other Punctuation 55
 
0.2%
Other Symbol 49
 
0.2%
Dash Punctuation 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1124
 
5.7%
560
 
2.8%
535
 
2.7%
527
 
2.6%
512
 
2.6%
395
 
2.0%
394
 
2.0%
383
 
1.9%
352
 
1.8%
302
 
1.5%
Other values (543) 14808
74.4%
Uppercase Letter
ValueCountFrequency (%)
S 357
34.2%
G 341
32.7%
C 56
 
5.4%
B 33
 
3.2%
U 31
 
3.0%
M 29
 
2.8%
H 28
 
2.7%
O 20
 
1.9%
D 18
 
1.7%
K 16
 
1.5%
Other values (15) 115
 
11.0%
Lowercase Letter
ValueCountFrequency (%)
e 21
16.5%
a 13
10.2%
l 10
7.9%
c 10
7.9%
s 10
7.9%
t 9
 
7.1%
i 8
 
6.3%
d 8
 
6.3%
h 7
 
5.5%
n 7
 
5.5%
Other values (11) 24
18.9%
Decimal Number
ValueCountFrequency (%)
2 350
46.5%
5 345
45.9%
3 17
 
2.3%
1 16
 
2.1%
0 11
 
1.5%
4 6
 
0.8%
6 3
 
0.4%
8 2
 
0.3%
9 1
 
0.1%
7 1
 
0.1%
Other Punctuation
ValueCountFrequency (%)
. 27
49.1%
& 21
38.2%
/ 3
 
5.5%
, 2
 
3.6%
· 2
 
3.6%
Space Separator
ValueCountFrequency (%)
973
100.0%
Close Punctuation
ValueCountFrequency (%)
) 430
100.0%
Open Punctuation
ValueCountFrequency (%)
( 422
100.0%
Other Symbol
ValueCountFrequency (%)
49
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 19941
84.0%
Common 2639
 
11.1%
Latin 1171
 
4.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1124
 
5.6%
560
 
2.8%
535
 
2.7%
527
 
2.6%
512
 
2.6%
395
 
2.0%
394
 
2.0%
383
 
1.9%
352
 
1.8%
302
 
1.5%
Other values (544) 14857
74.5%
Latin
ValueCountFrequency (%)
S 357
30.5%
G 341
29.1%
C 56
 
4.8%
B 33
 
2.8%
U 31
 
2.6%
M 29
 
2.5%
H 28
 
2.4%
e 21
 
1.8%
O 20
 
1.7%
D 18
 
1.5%
Other values (36) 237
20.2%
Common
ValueCountFrequency (%)
973
36.9%
) 430
16.3%
( 422
16.0%
2 350
 
13.3%
5 345
 
13.1%
. 27
 
1.0%
& 21
 
0.8%
3 17
 
0.6%
1 16
 
0.6%
0 11
 
0.4%
Other values (10) 27
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 19892
83.8%
ASCII 3808
 
16.0%
None 51
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1124
 
5.7%
560
 
2.8%
535
 
2.7%
527
 
2.6%
512
 
2.6%
395
 
2.0%
394
 
2.0%
383
 
1.9%
352
 
1.8%
302
 
1.5%
Other values (543) 14808
74.4%
ASCII
ValueCountFrequency (%)
973
25.6%
) 430
11.3%
( 422
11.1%
S 357
 
9.4%
2 350
 
9.2%
5 345
 
9.1%
G 341
 
9.0%
C 56
 
1.5%
B 33
 
0.9%
U 31
 
0.8%
Other values (55) 470
12.3%
None
ValueCountFrequency (%)
49
96.1%
· 2
 
3.9%
Distinct2676
Distinct (%)94.4%
Missing2
Missing (%)0.1%
Memory size22.3 KiB
2023-12-11T08:39:36.281789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length45
Mean length24.541961
Min length11

Characters and Unicode

Total characters69601
Distinct characters460
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2557 ?
Unique (%)90.2%

Sample

1st row창원시 의창구 도계로107번길 6 (도계동)
2nd row창원시 의창구 의창대로211번길 2 (서상동)
3rd row창원시 의창구 의안로27번길 2 (중동)
4th row창원시 의창구 사화로 382 (팔용동, 2층)
5th row창원시 의창구 우곡로217번길 10 (명서동)
ValueCountFrequency (%)
창원시 1204
 
8.2%
경상남도 452
 
3.1%
진주시 446
 
3.0%
김해시 419
 
2.8%
의창구 341
 
2.3%
마산회원구 254
 
1.7%
마산합포구 247
 
1.7%
양산시 240
 
1.6%
성산구 235
 
1.6%
1층 213
 
1.4%
Other values (3161) 10696
72.5%
2023-12-11T08:39:36.899403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11917
 
17.1%
1 2905
 
4.2%
2881
 
4.1%
2711
 
3.9%
2448
 
3.5%
) 2386
 
3.4%
( 2384
 
3.4%
1792
 
2.6%
2 1778
 
2.6%
1767
 
2.5%
Other values (450) 36632
52.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39168
56.3%
Space Separator 11917
 
17.1%
Decimal Number 11731
 
16.9%
Close Punctuation 2386
 
3.4%
Open Punctuation 2384
 
3.4%
Other Punctuation 1390
 
2.0%
Dash Punctuation 518
 
0.7%
Uppercase Letter 78
 
0.1%
Lowercase Letter 27
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2881
 
7.4%
2711
 
6.9%
2448
 
6.2%
1792
 
4.6%
1767
 
4.5%
1330
 
3.4%
1274
 
3.3%
1163
 
3.0%
915
 
2.3%
891
 
2.3%
Other values (396) 21996
56.2%
Uppercase Letter
ValueCountFrequency (%)
B 18
23.1%
A 13
16.7%
C 9
11.5%
S 4
 
5.1%
W 3
 
3.8%
L 3
 
3.8%
Y 3
 
3.8%
D 3
 
3.8%
N 2
 
2.6%
F 2
 
2.6%
Other values (12) 18
23.1%
Lowercase Letter
ValueCountFrequency (%)
i 4
14.8%
a 4
14.8%
y 3
11.1%
t 3
11.1%
m 2
7.4%
c 2
7.4%
u 2
7.4%
e 2
7.4%
l 2
7.4%
r 1
 
3.7%
Other values (2) 2
7.4%
Decimal Number
ValueCountFrequency (%)
1 2905
24.8%
2 1778
15.2%
3 1248
10.6%
0 1040
 
8.9%
5 998
 
8.5%
4 977
 
8.3%
6 782
 
6.7%
7 712
 
6.1%
9 655
 
5.6%
8 636
 
5.4%
Other Punctuation
ValueCountFrequency (%)
, 1314
94.5%
· 63
 
4.5%
. 9
 
0.6%
& 3
 
0.2%
; 1
 
0.1%
Space Separator
ValueCountFrequency (%)
11917
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2386
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2384
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 518
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39168
56.3%
Common 30328
43.6%
Latin 105
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2881
 
7.4%
2711
 
6.9%
2448
 
6.2%
1792
 
4.6%
1767
 
4.5%
1330
 
3.4%
1274
 
3.3%
1163
 
3.0%
915
 
2.3%
891
 
2.3%
Other values (396) 21996
56.2%
Latin
ValueCountFrequency (%)
B 18
17.1%
A 13
 
12.4%
C 9
 
8.6%
i 4
 
3.8%
S 4
 
3.8%
a 4
 
3.8%
W 3
 
2.9%
y 3
 
2.9%
L 3
 
2.9%
t 3
 
2.9%
Other values (24) 41
39.0%
Common
ValueCountFrequency (%)
11917
39.3%
1 2905
 
9.6%
) 2386
 
7.9%
( 2384
 
7.9%
2 1778
 
5.9%
, 1314
 
4.3%
3 1248
 
4.1%
0 1040
 
3.4%
5 998
 
3.3%
4 977
 
3.2%
Other values (10) 3381
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39168
56.3%
ASCII 30370
43.6%
None 63
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11917
39.2%
1 2905
 
9.6%
) 2386
 
7.9%
( 2384
 
7.8%
2 1778
 
5.9%
, 1314
 
4.3%
3 1248
 
4.1%
0 1040
 
3.4%
5 998
 
3.3%
4 977
 
3.2%
Other values (43) 3423
 
11.3%
Hangul
ValueCountFrequency (%)
2881
 
7.4%
2711
 
6.9%
2448
 
6.2%
1792
 
4.6%
1767
 
4.5%
1330
 
3.4%
1274
 
3.3%
1163
 
3.0%
915
 
2.3%
891
 
2.3%
Other values (396) 21996
56.2%
None
ValueCountFrequency (%)
· 63
100.0%

연락처
Text

MISSING 

Distinct1553
Distinct (%)97.1%
Missing1239
Missing (%)43.7%
Memory size22.3 KiB
2023-12-11T08:39:37.502411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.029393
Min length11

Characters and Unicode

Total characters19235
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1511 ?
Unique (%)94.5%

Sample

1st row055-297-8933
2nd row055-277-5675
3rd row055-270-8124
4th row055-251-2286
5th row055-294-0264
ValueCountFrequency (%)
055-240-6689 3
 
0.2%
055-237-5533 3
 
0.2%
055-753-8217 3
 
0.2%
055-221-2283 3
 
0.2%
055-573-2588 2
 
0.1%
055-962-6077 2
 
0.1%
055-962-3061 2
 
0.1%
055-882-5431 2
 
0.1%
055-264-7370 2
 
0.1%
055-275-7702 2
 
0.1%
Other values (1543) 1575
98.5%
2023-12-11T08:39:37.910329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 4365
22.7%
- 3198
16.6%
0 2581
13.4%
2 1651
 
8.6%
3 1372
 
7.1%
7 1219
 
6.3%
6 1108
 
5.8%
4 1083
 
5.6%
8 1066
 
5.5%
1 886
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 16037
83.4%
Dash Punctuation 3198
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 4365
27.2%
0 2581
16.1%
2 1651
 
10.3%
3 1372
 
8.6%
7 1219
 
7.6%
6 1108
 
6.9%
4 1083
 
6.8%
8 1066
 
6.6%
1 886
 
5.5%
9 706
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 3198
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 19235
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 4365
22.7%
- 3198
16.6%
0 2581
13.4%
2 1651
 
8.6%
3 1372
 
7.1%
7 1219
 
6.3%
6 1108
 
5.8%
4 1083
 
5.6%
8 1066
 
5.5%
1 886
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 19235
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 4365
22.7%
- 3198
16.6%
0 2581
13.4%
2 1651
 
8.6%
3 1372
 
7.1%
7 1219
 
6.3%
6 1108
 
5.8%
4 1083
 
5.6%
8 1066
 
5.5%
1 886
 
4.6%

Missing values

2023-12-11T08:39:34.919029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:39:34.995441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T08:39:35.112206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업체명소재지연락처
0홍익상사창원시 의창구 도계로107번길 6 (도계동)<NA>
1텃밭의료기상사창원시 의창구 의창대로211번길 2 (서상동)<NA>
2서해통상창원시 의창구 의안로27번길 2 (중동)<NA>
3성준메딕스창원시 의창구 사화로 382 (팔용동, 2층)055-297-8933
4파티마의료기창원시 의창구 우곡로217번길 10 (명서동)<NA>
5OK메디칼창원시 의창구 도계로4번길 46 (도계동)<NA>
6가람메디칼창원시 의창구 의안로62번길 9-2 (소답동)<NA>
7메디테크창원시 의창구 의창대로282번길 289-22 (소답동)<NA>
8한빛건강의료기창원시 성산구 중앙대로 85 (중앙동)<NA>
9제니선터치포인트창원시 의창구 원이대로 581 (용호동)<NA>
업체명소재지연락처
2828㈜에스원합천군 합천읍 동서로 113, 범한빌딩 1층055-944-0187
2829금강의료기합천군 합천읍 동서로 42-3055-931-5554
2830미건원적외선합천군 합천읍 서산길 41055-931-9144
2831삼성의료기합천군 합천읍 중앙로2길 21-1055-932-5394
2832합천복지용구센터합천군 합천읍 중앙로 17 (고품아파트)055-933-7766
2833김정문알로에합천영업소합천군 합천읍 동서로 67-1055-933-0393
2834금강복지용구사업소합천군 합천읍 동서로 42-3<NA>
2835봄의료기합천군 대병면 서부로 2555055-932-5875
2836굿모닝보청기, 굿모닝의료기합천군 합천읍 동서로 87055-931-7774
2837신명의료기합천군 합천읍 옥산로 44, 합천시장 45호<NA>