Overview

Dataset statistics

Number of variables4
Number of observations72
Missing cells15
Missing cells (%)5.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 KiB
Average record size in memory33.8 B

Variable types

Categorical1
Text3

Dataset

Description대구광역시 수성구 의약품 공급업체 현황 입니다. (업종, 업소명, 소재지, 연락처)This is the current status of pharmaceutical suppliers in Suseong-gu, Daegu Metropolitan City. (Industry, business name, location, contact number)
Author대구광역시 수성구
URLhttps://www.data.go.kr/data/15077888/fileData.do

Alerts

업종 is highly imbalanced (73.9%)Imbalance
연락처 has 15 (20.8%) missing valuesMissing
업소명 has unique valuesUnique

Reproduction

Analysis started2024-03-23 06:27:07.102913
Analysis finished2024-03-23 06:27:08.635507
Duration1.53 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

IMBALANCE 

Distinct3
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size708.0 B
일반종합도매
67 
한약도매
 
4
시약도매
 
1

Length

Max length6
Median length6
Mean length5.8611111
Min length4

Unique

Unique1 ?
Unique (%)1.4%

Sample

1st row일반종합도매
2nd row일반종합도매
3rd row일반종합도매
4th row일반종합도매
5th row일반종합도매

Common Values

ValueCountFrequency (%)
일반종합도매 67
93.1%
한약도매 4
 
5.6%
시약도매 1
 
1.4%

Length

2024-03-23T06:27:09.062085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T06:27:09.517964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반종합도매 67
93.1%
한약도매 4
 
5.6%
시약도매 1
 
1.4%

업소명
Text

UNIQUE 

Distinct72
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size708.0 B
2024-03-23T06:27:10.077384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length11
Mean length5.7083333
Min length2

Characters and Unicode

Total characters411
Distinct characters124
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique72 ?
Unique (%)100.0%

Sample

1st row스카이약품
2nd row주식회사 송빈팜
3rd row시현메디팜
4th row백균약품
5th row다원약품
ValueCountFrequency (%)
주식회사 8
 
10.0%
동성약품 1
 
1.2%
초이스팜 1
 
1.2%
세일약품 1
 
1.2%
원화약품 1
 
1.2%
태양메디 1
 
1.2%
아이엠메드 1
 
1.2%
메디백스 1
 
1.2%
제이어스팜 1
 
1.2%
더조은메디컬 1
 
1.2%
Other values (63) 63
78.8%
2024-03-23T06:27:11.504136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25
 
6.1%
24
 
5.8%
22
 
5.4%
20
 
4.9%
17
 
4.1%
16
 
3.9%
16
 
3.9%
14
 
3.4%
13
 
3.2%
10
 
2.4%
Other values (114) 234
56.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 371
90.3%
Uppercase Letter 14
 
3.4%
Space Separator 8
 
1.9%
Close Punctuation 8
 
1.9%
Open Punctuation 8
 
1.9%
Other Symbol 1
 
0.2%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
 
6.7%
24
 
6.5%
22
 
5.9%
20
 
5.4%
17
 
4.6%
16
 
4.3%
16
 
4.3%
14
 
3.8%
13
 
3.5%
10
 
2.7%
Other values (101) 194
52.3%
Uppercase Letter
ValueCountFrequency (%)
R 3
21.4%
H 2
14.3%
P 2
14.3%
M 2
14.3%
A 2
14.3%
G 1
 
7.1%
C 1
 
7.1%
K 1
 
7.1%
Space Separator
ValueCountFrequency (%)
8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 372
90.5%
Common 25
 
6.1%
Latin 14
 
3.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
 
6.7%
24
 
6.5%
22
 
5.9%
20
 
5.4%
17
 
4.6%
16
 
4.3%
16
 
4.3%
14
 
3.8%
13
 
3.5%
10
 
2.7%
Other values (102) 195
52.4%
Latin
ValueCountFrequency (%)
R 3
21.4%
H 2
14.3%
P 2
14.3%
M 2
14.3%
A 2
14.3%
G 1
 
7.1%
C 1
 
7.1%
K 1
 
7.1%
Common
ValueCountFrequency (%)
8
32.0%
) 8
32.0%
( 8
32.0%
& 1
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 371
90.3%
ASCII 39
 
9.5%
None 1
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
25
 
6.7%
24
 
6.5%
22
 
5.9%
20
 
5.4%
17
 
4.6%
16
 
4.3%
16
 
4.3%
14
 
3.8%
13
 
3.5%
10
 
2.7%
Other values (101) 194
52.3%
ASCII
ValueCountFrequency (%)
8
20.5%
) 8
20.5%
( 8
20.5%
R 3
 
7.7%
H 2
 
5.1%
P 2
 
5.1%
M 2
 
5.1%
A 2
 
5.1%
G 1
 
2.6%
C 1
 
2.6%
Other values (2) 2
 
5.1%
None
ValueCountFrequency (%)
1
100.0%
Distinct71
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size708.0 B
2024-03-23T06:27:12.382904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length36
Mean length30.083333
Min length22

Characters and Unicode

Total characters2166
Distinct characters91
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)97.2%

Sample

1st row대구광역시 수성구 무학로11길 42-18, 3층 302호 (상동)
2nd row대구광역시 수성구 용학로42길 9, 에이동 2층 203호 (지산동)
3rd row대구광역시 수성구 범어천로 54, 3층 3B-40호 (범어동)
4th row대구광역시 수성구 알파시티1로4길 8, 1103호 (대흥동)
5th row대구광역시 수성구 달구벌대로 2319-8, 7층 707호 (수성동4가)
ValueCountFrequency (%)
대구광역시 72
 
16.5%
수성구 72
 
16.5%
2층 19
 
4.3%
만촌동 17
 
3.9%
3층 14
 
3.2%
상동 7
 
1.6%
황금동 7
 
1.6%
1층 7
 
1.6%
지산동 6
 
1.4%
수성로 5
 
1.1%
Other values (150) 211
48.3%
2024-03-23T06:27:14.003944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
365
 
16.9%
151
 
7.0%
95
 
4.4%
93
 
4.3%
2 84
 
3.9%
83
 
3.8%
80
 
3.7%
1 78
 
3.6%
77
 
3.6%
( 73
 
3.4%
Other values (81) 987
45.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1189
54.9%
Decimal Number 385
 
17.8%
Space Separator 365
 
16.9%
Open Punctuation 73
 
3.4%
Close Punctuation 73
 
3.4%
Other Punctuation 63
 
2.9%
Dash Punctuation 17
 
0.8%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
151
12.7%
95
 
8.0%
93
 
7.8%
83
 
7.0%
80
 
6.7%
77
 
6.5%
73
 
6.1%
72
 
6.1%
72
 
6.1%
55
 
4.6%
Other values (65) 338
28.4%
Decimal Number
ValueCountFrequency (%)
2 84
21.8%
1 78
20.3%
3 45
11.7%
0 44
11.4%
4 32
 
8.3%
9 24
 
6.2%
5 24
 
6.2%
8 22
 
5.7%
6 18
 
4.7%
7 14
 
3.6%
Space Separator
ValueCountFrequency (%)
365
100.0%
Open Punctuation
ValueCountFrequency (%)
( 73
100.0%
Close Punctuation
ValueCountFrequency (%)
) 73
100.0%
Other Punctuation
ValueCountFrequency (%)
, 63
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1189
54.9%
Common 976
45.1%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
151
12.7%
95
 
8.0%
93
 
7.8%
83
 
7.0%
80
 
6.7%
77
 
6.5%
73
 
6.1%
72
 
6.1%
72
 
6.1%
55
 
4.6%
Other values (65) 338
28.4%
Common
ValueCountFrequency (%)
365
37.4%
2 84
 
8.6%
1 78
 
8.0%
( 73
 
7.5%
) 73
 
7.5%
, 63
 
6.5%
3 45
 
4.6%
0 44
 
4.5%
4 32
 
3.3%
9 24
 
2.5%
Other values (5) 95
 
9.7%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1189
54.9%
ASCII 977
45.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
365
37.4%
2 84
 
8.6%
1 78
 
8.0%
( 73
 
7.5%
) 73
 
7.5%
, 63
 
6.4%
3 45
 
4.6%
0 44
 
4.5%
4 32
 
3.3%
9 24
 
2.5%
Other values (6) 96
 
9.8%
Hangul
ValueCountFrequency (%)
151
12.7%
95
 
8.0%
93
 
7.8%
83
 
7.0%
80
 
6.7%
77
 
6.5%
73
 
6.1%
72
 
6.1%
72
 
6.1%
55
 
4.6%
Other values (65) 338
28.4%

연락처
Text

MISSING 

Distinct56
Distinct (%)98.2%
Missing15
Missing (%)20.8%
Memory size708.0 B
2024-03-23T06:27:15.343977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.035088
Min length12

Characters and Unicode

Total characters686
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)96.5%

Sample

1st row053-767-7582
2nd row053-793-2580
3rd row053-745-5701
4th row053-248-0100
5th row053-710-9004
ValueCountFrequency (%)
053-766-6120 2
 
3.5%
053-426-1087 1
 
1.8%
053-751-3707 1
 
1.8%
053-286-7778 1
 
1.8%
053-252-5833 1
 
1.8%
053-782-2177 1
 
1.8%
053-759-3525 1
 
1.8%
053-764-8634 1
 
1.8%
000-000-0000 1
 
1.8%
070-4065-6651 1
 
1.8%
Other values (46) 46
80.7%
2024-03-23T06:27:16.825814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 114
16.6%
0 111
16.2%
5 97
14.1%
3 90
13.1%
7 73
10.6%
6 50
7.3%
8 36
 
5.2%
2 35
 
5.1%
1 34
 
5.0%
4 24
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 572
83.4%
Dash Punctuation 114
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 111
19.4%
5 97
17.0%
3 90
15.7%
7 73
12.8%
6 50
8.7%
8 36
 
6.3%
2 35
 
6.1%
1 34
 
5.9%
4 24
 
4.2%
9 22
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 114
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 686
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 114
16.6%
0 111
16.2%
5 97
14.1%
3 90
13.1%
7 73
10.6%
6 50
7.3%
8 36
 
5.2%
2 35
 
5.1%
1 34
 
5.0%
4 24
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 686
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 114
16.6%
0 111
16.2%
5 97
14.1%
3 90
13.1%
7 73
10.6%
6 50
7.3%
8 36
 
5.2%
2 35
 
5.1%
1 34
 
5.0%
4 24
 
3.5%

Correlations

2024-03-23T06:27:17.192313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종업소명소재지연락처
업종1.0001.0001.0001.000
업소명1.0001.0001.0001.000
소재지1.0001.0001.0001.000
연락처1.0001.0001.0001.000

Missing values

2024-03-23T06:27:08.087189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T06:27:08.481083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종업소명소재지연락처
0일반종합도매스카이약품대구광역시 수성구 무학로11길 42-18, 3층 302호 (상동)053-767-7582
1일반종합도매주식회사 송빈팜대구광역시 수성구 용학로42길 9, 에이동 2층 203호 (지산동)<NA>
2일반종합도매시현메디팜대구광역시 수성구 범어천로 54, 3층 3B-40호 (범어동)<NA>
3일반종합도매백균약품대구광역시 수성구 알파시티1로4길 8, 1103호 (대흥동)053-793-2580
4일반종합도매다원약품대구광역시 수성구 달구벌대로 2319-8, 7층 707호 (수성동4가)053-745-5701
5일반종합도매유마힐메디대구광역시 수성구 국채보상로 985, 2층 203호 (만촌동)053-248-0100
6일반종합도매이오약품 주식회사대구광역시 수성구 무열로 35, 4층 (만촌동)053-710-9004
7일반종합도매자이언트파마대구광역시 수성구 무학로31길 92, 2층 (지산동)<NA>
8일반종합도매케이제이팜대구광역시 수성구 동대구로14길 81, 102호 (지산동)053-214-8013
9일반종합도매더케어팜대구광역시 수성구 범어로34길 7-20, 2층 (범어동)<NA>
업종업소명소재지연락처
62일반종합도매상록팜대구광역시 수성구 수성로 355, 4층 (수성동1가)053-653-3838
63일반종합도매다정팜대구광역시 수성구 동대구로20길 91, 1층 (지산동)053-765-9191
64일반종합도매수림약품대구광역시 수성구 희망로 199, 5층 (황금동)053-766-6120
65일반종합도매대호약품대구광역시 수성구 화랑로 60 (만촌동, (지상6층, 지상7층))053-768-8901
66한약도매현대약업사대구광역시 수성구 들안로19길 58, 지상1층 (상동)053-763-9466
67한약도매금포나라약업사대구광역시 수성구 수성로25길 74 (상동)053-766-0776
68한약도매세종허브대구광역시 수성구 수성로25길 56, 1층 (상동)053-768-6663
69일반종합도매주식회사동국헬스케어대구광역시 수성구 청호로69길 49 (황금동)053-751-0805
70일반종합도매경일약품㈜대구광역시 수성구 들안로 142(황금동)053-766-3400
71일반종합도매경안약품대구광역시 수성구 무열로 65 (만촌동)053-755-7991