Overview

Dataset statistics

Number of variables4
Number of observations77
Missing cells8
Missing cells (%)2.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.5 KiB
Average record size in memory33.7 B

Variable types

Categorical1
Text3

Dataset

Description경기도 고양시(덕양구, 일산서구, 일산동구) 의약품도매업소 현황으로 업종, 업소명, 소재지, 전화번호 항목을 제공합니다.
Author경기도 고양시
URLhttps://www.data.go.kr/data/3078256/fileData.do

Alerts

영업종별 is highly imbalanced (61.1%)Imbalance
영업소전화번호 has 8 (10.4%) missing valuesMissing
영업소명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 13:09:29.530996
Analysis finished2023-12-12 13:09:30.109649
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

영업종별
Categorical

IMBALANCE 

Distinct6
Distinct (%)7.8%
Missing0
Missing (%)0.0%
Memory size748.0 B
일반종합도매
64 
한약도매
 
5
원료의약품도매
 
3
의료용 고압가스
 
3
수입의약품도매
 
1

Length

Max length8
Median length6
Mean length5.9480519
Min length2

Unique

Unique2 ?
Unique (%)2.6%

Sample

1st row일반종합도매
2nd row일반종합도매
3rd row일반종합도매
4th row일반종합도매
5th row일반종합도매

Common Values

ValueCountFrequency (%)
일반종합도매 64
83.1%
한약도매 5
 
6.5%
원료의약품도매 3
 
3.9%
의료용 고압가스 3
 
3.9%
수입의약품도매 1
 
1.3%
기타 1
 
1.3%

Length

2023-12-12T22:09:30.178630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:09:30.293016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반종합도매 64
80.0%
한약도매 5
 
6.2%
원료의약품도매 3
 
3.8%
의료용 3
 
3.8%
고압가스 3
 
3.8%
수입의약품도매 1
 
1.2%
기타 1
 
1.2%

영업소명
Text

UNIQUE 

Distinct77
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size748.0 B
2023-12-12T22:09:30.526720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length9
Mean length7.1688312
Min length3

Characters and Unicode

Total characters552
Distinct characters131
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique77 ?
Unique (%)100.0%

Sample

1st row(주)메디업파트너스
2nd row지에프텍
3rd row(주)비즈메디코리아
4th row(주)에스에이치이앤티
5th row유진팜
ValueCountFrequency (%)
주식회사 12
 
13.3%
솔과향약업사 1
 
1.1%
주)휴인메디컬 1
 
1.1%
주)대경약품 1
 
1.1%
드림팜 1
 
1.1%
지엘메디칼 1
 
1.1%
제이와이팜텍 1
 
1.1%
청아약품 1
 
1.1%
주)케이앤에스팜 1
 
1.1%
태종약품주식회사 1
 
1.1%
Other values (69) 69
76.7%
2023-12-12T22:09:30.936059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
54
 
9.8%
( 36
 
6.5%
) 36
 
6.5%
22
 
4.0%
20
 
3.6%
19
 
3.4%
18
 
3.3%
18
 
3.3%
17
 
3.1%
17
 
3.1%
Other values (121) 295
53.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 466
84.4%
Open Punctuation 36
 
6.5%
Close Punctuation 36
 
6.5%
Space Separator 13
 
2.4%
Other Symbol 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
 
11.6%
22
 
4.7%
20
 
4.3%
19
 
4.1%
18
 
3.9%
18
 
3.9%
17
 
3.6%
17
 
3.6%
16
 
3.4%
15
 
3.2%
Other values (117) 250
53.6%
Open Punctuation
ValueCountFrequency (%)
( 36
100.0%
Close Punctuation
ValueCountFrequency (%)
) 36
100.0%
Space Separator
ValueCountFrequency (%)
13
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 467
84.6%
Common 85
 
15.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
 
11.6%
22
 
4.7%
20
 
4.3%
19
 
4.1%
18
 
3.9%
18
 
3.9%
17
 
3.6%
17
 
3.6%
16
 
3.4%
15
 
3.2%
Other values (118) 251
53.7%
Common
ValueCountFrequency (%)
( 36
42.4%
) 36
42.4%
13
 
15.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 466
84.4%
ASCII 85
 
15.4%
None 1
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
54
 
11.6%
22
 
4.7%
20
 
4.3%
19
 
4.1%
18
 
3.9%
18
 
3.9%
17
 
3.6%
17
 
3.6%
16
 
3.4%
15
 
3.2%
Other values (117) 250
53.6%
ASCII
ValueCountFrequency (%)
( 36
42.4%
) 36
42.4%
13
 
15.3%
None
ValueCountFrequency (%)
1
100.0%
Distinct76
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size748.0 B
2023-12-12T22:09:31.301234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length43
Mean length37.194805
Min length25

Characters and Unicode

Total characters2864
Distinct characters171
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique75 ?
Unique (%)97.4%

Sample

1st row경기도 고양시 일산동구 숲속마을로 14-15, 일산 드림월드 5층 503(일부)호 (풍동)
2nd row경기도 고양시 덕양구 고골길116번길 39-52 (관산동)
3rd row경기도 고양시 덕양구 행주로83번길 34-8, 2층 (행주내동)
4th row경기도 고양시 덕양구 대덕로200번길 124-2, 1층 일부호 (현천동)
5th row경기도 고양시 덕양구 통일로 343 (신원동)
ValueCountFrequency (%)
경기도 77
 
13.0%
고양시 77
 
13.0%
덕양구 45
 
7.6%
일산동구 26
 
4.4%
2층 11
 
1.9%
행신동 9
 
1.5%
일부호 9
 
1.5%
토당동 9
 
1.5%
1층 9
 
1.5%
3층 7
 
1.2%
Other values (237) 314
53.0%
2023-12-12T22:09:31.824411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
516
 
18.0%
126
 
4.4%
113
 
3.9%
1 108
 
3.8%
, 90
 
3.1%
84
 
2.9%
( 83
 
2.9%
) 83
 
2.9%
81
 
2.8%
79
 
2.8%
Other values (161) 1501
52.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1613
56.3%
Space Separator 516
 
18.0%
Decimal Number 439
 
15.3%
Other Punctuation 91
 
3.2%
Open Punctuation 83
 
2.9%
Close Punctuation 83
 
2.9%
Dash Punctuation 31
 
1.1%
Uppercase Letter 7
 
0.2%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
126
 
7.8%
113
 
7.0%
84
 
5.2%
81
 
5.0%
79
 
4.9%
79
 
4.9%
77
 
4.8%
77
 
4.8%
75
 
4.6%
61
 
3.8%
Other values (141) 761
47.2%
Decimal Number
ValueCountFrequency (%)
1 108
24.6%
2 73
16.6%
0 59
13.4%
3 48
10.9%
5 36
 
8.2%
4 33
 
7.5%
8 27
 
6.2%
6 27
 
6.2%
9 20
 
4.6%
7 8
 
1.8%
Uppercase Letter
ValueCountFrequency (%)
I 4
57.1%
B 2
28.6%
A 1
 
14.3%
Other Punctuation
ValueCountFrequency (%)
, 90
98.9%
& 1
 
1.1%
Space Separator
ValueCountFrequency (%)
516
100.0%
Open Punctuation
ValueCountFrequency (%)
( 83
100.0%
Close Punctuation
ValueCountFrequency (%)
) 83
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 31
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1613
56.3%
Common 1243
43.4%
Latin 8
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
126
 
7.8%
113
 
7.0%
84
 
5.2%
81
 
5.0%
79
 
4.9%
79
 
4.9%
77
 
4.8%
77
 
4.8%
75
 
4.6%
61
 
3.8%
Other values (141) 761
47.2%
Common
ValueCountFrequency (%)
516
41.5%
1 108
 
8.7%
, 90
 
7.2%
( 83
 
6.7%
) 83
 
6.7%
2 73
 
5.9%
0 59
 
4.7%
3 48
 
3.9%
5 36
 
2.9%
4 33
 
2.7%
Other values (6) 114
 
9.2%
Latin
ValueCountFrequency (%)
I 4
50.0%
B 2
25.0%
A 1
 
12.5%
1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1613
56.3%
ASCII 1250
43.6%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
516
41.3%
1 108
 
8.6%
, 90
 
7.2%
( 83
 
6.6%
) 83
 
6.6%
2 73
 
5.8%
0 59
 
4.7%
3 48
 
3.8%
5 36
 
2.9%
4 33
 
2.6%
Other values (9) 121
 
9.7%
Hangul
ValueCountFrequency (%)
126
 
7.8%
113
 
7.0%
84
 
5.2%
81
 
5.0%
79
 
4.9%
79
 
4.9%
77
 
4.8%
77
 
4.8%
75
 
4.6%
61
 
3.8%
Other values (141) 761
47.2%
Number Forms
ValueCountFrequency (%)
1
100.0%

영업소전화번호
Text

MISSING 

Distinct68
Distinct (%)98.6%
Missing8
Missing (%)10.4%
Memory size748.0 B
2023-12-12T22:09:32.122031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.072464
Min length9

Characters and Unicode

Total characters833
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)97.1%

Sample

1st row02-2243-5855
2nd row02-2662-7417
3rd row02-2664-4671
4th row02-3141-6605
5th row02-371-3751
ValueCountFrequency (%)
031-922-5181 2
 
2.9%
031-975-8933 1
 
1.4%
031-971-9025 1
 
1.4%
031-973-3365 1
 
1.4%
031-973-9994 1
 
1.4%
031-974-5556 1
 
1.4%
031-975-3156 1
 
1.4%
031-979-4315 1
 
1.4%
031-976-7361 1
 
1.4%
031-966-1999 1
 
1.4%
Other values (58) 58
84.1%
2023-12-12T22:09:32.613554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 137
16.4%
0 120
14.4%
1 105
12.6%
3 93
11.2%
9 87
10.4%
7 73
8.8%
6 55
6.6%
2 47
 
5.6%
8 46
 
5.5%
5 36
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 696
83.6%
Dash Punctuation 137
 
16.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 120
17.2%
1 105
15.1%
3 93
13.4%
9 87
12.5%
7 73
10.5%
6 55
7.9%
2 47
 
6.8%
8 46
 
6.6%
5 36
 
5.2%
4 34
 
4.9%
Dash Punctuation
ValueCountFrequency (%)
- 137
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 833
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 137
16.4%
0 120
14.4%
1 105
12.6%
3 93
11.2%
9 87
10.4%
7 73
8.8%
6 55
6.6%
2 47
 
5.6%
8 46
 
5.5%
5 36
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 833
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 137
16.4%
0 120
14.4%
1 105
12.6%
3 93
11.2%
9 87
10.4%
7 73
8.8%
6 55
6.6%
2 47
 
5.6%
8 46
 
5.5%
5 36
 
4.3%

Correlations

2023-12-12T22:09:32.732185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업종별영업소명영업소소재지(도로명)영업소전화번호
영업종별1.0001.0001.0001.000
영업소명1.0001.0001.0001.000
영업소소재지(도로명)1.0001.0001.0000.998
영업소전화번호1.0001.0000.9981.000

Missing values

2023-12-12T22:09:29.991041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:09:30.076927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

영업종별영업소명영업소소재지(도로명)영업소전화번호
0일반종합도매(주)메디업파트너스경기도 고양시 일산동구 숲속마을로 14-15, 일산 드림월드 5층 503(일부)호 (풍동)02-2243-5855
1일반종합도매지에프텍경기도 고양시 덕양구 고골길116번길 39-52 (관산동)02-2662-7417
2일반종합도매(주)비즈메디코리아경기도 고양시 덕양구 행주로83번길 34-8, 2층 (행주내동)02-2664-4671
3일반종합도매(주)에스에이치이앤티경기도 고양시 덕양구 대덕로200번길 124-2, 1층 일부호 (현천동)02-3141-6605
4일반종합도매유진팜경기도 고양시 덕양구 통일로 343 (신원동)02-371-3751
5원료의약품도매(주)디엠경기도 고양시 덕양구 삼송로136번길 81, 1층 (삼송동)02-381-8217
6일반종합도매(주)한결메디칼경기도 고양시 덕양구 신원로2길 38-9, 1층 (신원동)02-381-8941
7수입의약품도매주식회사 메디엘경기도 고양시 덕양구 으뜸로 124, 드림코어테라스 1303호 (덕은동)02-443-9588
8일반종합도매엠브이엠앤엘경기도 고양시 덕양구 화랑로 59, 3층 (화전동)02-6406-0820
9일반종합도매주식회사 파마메디스경기도 고양시 덕양구 권율대로 890, 중앙프라자 5층 501호 (신원동)02-6956-5640
영업종별영업소명영업소소재지(도로명)영업소전화번호
67일반종합도매(주)바로팜경기도 고양시 일산동구 숲속마을로 22, 진넥스 블루오션 9층 (풍동)070-8871-0749
68일반종합도매서울메디케어경기도 고양시 덕양구 용두로 29-2, 401호 (용두동)1661-6919
69일반종합도매삼송바이오경기도 고양시 일산동구 중앙로 1192, 601(일부)호 (마두동)<NA>
70일반종합도매주식회사 미르파마경기도 고양시 덕양구 화랑로 33-1, 가동 일부호 (화전동)<NA>
71일반종합도매㈜베네메디경기도 고양시 일산동구 백마로 195, 엠시티타워&엠시티오피스텔 2-2112호 (장항동)<NA>
72일반종합도매아이진(주)경기도 고양시 덕양구 의장로 122-1, 삼일빌딩 2층 (도내동)<NA>
73일반종합도매(주)도원약품경기도 고양시 덕양구 호수로 100, 2층 (토당동)<NA>
74의료용 고압가스미소테크경기도 고양시 일산서구 대화로37번길 148 (법곳동)<NA>
75일반종합도매메디그린경기도 고양시 일산동구 중앙로 1080, 207(일부)호 (백석동, 남정골드프라자)<NA>
76일반종합도매(주)신텍스팜경기도 고양시 덕양구 은빛로 53, 203-1호 (화정동, 코스미온빌)<NA>