Overview

Dataset statistics

Number of variables4
Number of observations93
Missing cells12
Missing cells (%)3.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.0 KiB
Average record size in memory33.4 B

Variable types

Text3
Categorical1

Dataset

Description경기도 용인시 의약품 공급업체 현황입니다. 업체명, 업체주소, 연락처 등의 데이터를 제공합니다. ※ 데이터기준일자 : 2023-05-11
URLhttps://www.data.go.kr/data/15041748/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연락처 has 12 (12.9%) missing valuesMissing
업체명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:38:20.832474
Analysis finished2023-12-12 09:38:21.691805
Duration0.86 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업체명
Text

UNIQUE 

Distinct93
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size876.0 B
2023-12-12T18:38:21.918675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length7.655914
Min length3

Characters and Unicode

Total characters712
Distinct characters151
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)100.0%

Sample

1st row(주)다올 로지스틱
2nd row(주)다함메디
3rd row(주)레포스
4th row(주)메디웰에코
5th row(주)비아다빈치
ValueCountFrequency (%)
주식회사 19
 
16.1%
용인지점 2
 
1.7%
주)다올 1
 
0.8%
다모더랩 1
 
0.8%
메디진 1
 
0.8%
더맥팜 1
 
0.8%
다산메디칼 1
 
0.8%
제이엔코 1
 
0.8%
제이스팜 1
 
0.8%
정마 1
 
0.8%
Other values (89) 89
75.4%
2023-12-12T18:38:22.372341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
65
 
9.1%
( 44
 
6.2%
) 44
 
6.2%
25
 
3.5%
25
 
3.5%
24
 
3.4%
24
 
3.4%
22
 
3.1%
22
 
3.1%
22
 
3.1%
Other values (141) 395
55.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 586
82.3%
Open Punctuation 44
 
6.2%
Close Punctuation 44
 
6.2%
Space Separator 25
 
3.5%
Uppercase Letter 8
 
1.1%
Other Symbol 5
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
65
 
11.1%
25
 
4.3%
24
 
4.1%
24
 
4.1%
22
 
3.8%
22
 
3.8%
22
 
3.8%
21
 
3.6%
21
 
3.6%
15
 
2.6%
Other values (131) 325
55.5%
Uppercase Letter
ValueCountFrequency (%)
S 2
25.0%
M 2
25.0%
P 1
12.5%
C 1
12.5%
B 1
12.5%
K 1
12.5%
Open Punctuation
ValueCountFrequency (%)
( 44
100.0%
Close Punctuation
ValueCountFrequency (%)
) 44
100.0%
Space Separator
ValueCountFrequency (%)
25
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 591
83.0%
Common 113
 
15.9%
Latin 8
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
65
 
11.0%
25
 
4.2%
24
 
4.1%
24
 
4.1%
22
 
3.7%
22
 
3.7%
22
 
3.7%
21
 
3.6%
21
 
3.6%
15
 
2.5%
Other values (132) 330
55.8%
Latin
ValueCountFrequency (%)
S 2
25.0%
M 2
25.0%
P 1
12.5%
C 1
12.5%
B 1
12.5%
K 1
12.5%
Common
ValueCountFrequency (%)
( 44
38.9%
) 44
38.9%
25
22.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 586
82.3%
ASCII 121
 
17.0%
None 5
 
0.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
65
 
11.1%
25
 
4.3%
24
 
4.1%
24
 
4.1%
22
 
3.8%
22
 
3.8%
22
 
3.8%
21
 
3.6%
21
 
3.6%
15
 
2.6%
Other values (131) 325
55.5%
ASCII
ValueCountFrequency (%)
( 44
36.4%
) 44
36.4%
25
20.7%
S 2
 
1.7%
M 2
 
1.7%
P 1
 
0.8%
C 1
 
0.8%
B 1
 
0.8%
K 1
 
0.8%
None
ValueCountFrequency (%)
5
100.0%
Distinct91
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Memory size876.0 B
2023-12-12T18:38:22.671836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length44
Mean length35.376344
Min length23

Characters and Unicode

Total characters3290
Distinct characters169
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)95.7%

Sample

1st row경기도 용인시 처인구 남사읍 천덕산로 116-4, 2층
2nd row경기도 용인시 처인구 포곡읍 마성로 386, 102,103호
3rd row경기도 용인시 처인구 모현읍 능곡로 151, 2층
4th row경기도 용인시 처인구 모현읍 오포로25번길 17, A동
5th row경기도 용인시 처인구 남사읍 천덕산로 116-2, 2층,3층
ValueCountFrequency (%)
경기도 93
 
13.6%
용인시 93
 
13.6%
기흥구 50
 
7.3%
처인구 27
 
4.0%
보라동 21
 
3.1%
한보라1로10번길 21
 
3.1%
15 21
 
3.1%
수지구 16
 
2.3%
2층 9
 
1.3%
영덕동 8
 
1.2%
Other values (224) 324
47.4%
2023-12-12T18:38:23.116643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
590
 
17.9%
1 162
 
4.9%
147
 
4.5%
125
 
3.8%
101
 
3.1%
100
 
3.0%
96
 
2.9%
95
 
2.9%
95
 
2.9%
94
 
2.9%
Other values (159) 1685
51.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1837
55.8%
Space Separator 590
 
17.9%
Decimal Number 579
 
17.6%
Other Punctuation 87
 
2.6%
Close Punctuation 74
 
2.2%
Open Punctuation 74
 
2.2%
Dash Punctuation 28
 
0.9%
Uppercase Letter 19
 
0.6%
Lowercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
147
 
8.0%
125
 
6.8%
101
 
5.5%
100
 
5.4%
96
 
5.2%
95
 
5.2%
95
 
5.2%
94
 
5.1%
88
 
4.8%
59
 
3.2%
Other values (130) 837
45.6%
Uppercase Letter
ValueCountFrequency (%)
A 6
31.6%
B 3
15.8%
U 2
 
10.5%
T 1
 
5.3%
O 1
 
5.3%
W 1
 
5.3%
E 1
 
5.3%
R 1
 
5.3%
H 1
 
5.3%
S 1
 
5.3%
Decimal Number
ValueCountFrequency (%)
1 162
28.0%
0 90
15.5%
2 85
14.7%
3 58
 
10.0%
4 47
 
8.1%
5 43
 
7.4%
8 30
 
5.2%
7 23
 
4.0%
6 23
 
4.0%
9 18
 
3.1%
Other Punctuation
ValueCountFrequency (%)
, 86
98.9%
/ 1
 
1.1%
Lowercase Letter
ValueCountFrequency (%)
u 1
50.0%
b 1
50.0%
Space Separator
ValueCountFrequency (%)
590
100.0%
Close Punctuation
ValueCountFrequency (%)
) 74
100.0%
Open Punctuation
ValueCountFrequency (%)
( 74
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1837
55.8%
Common 1432
43.5%
Latin 21
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
147
 
8.0%
125
 
6.8%
101
 
5.5%
100
 
5.4%
96
 
5.2%
95
 
5.2%
95
 
5.2%
94
 
5.1%
88
 
4.8%
59
 
3.2%
Other values (130) 837
45.6%
Common
ValueCountFrequency (%)
590
41.2%
1 162
 
11.3%
0 90
 
6.3%
, 86
 
6.0%
2 85
 
5.9%
) 74
 
5.2%
( 74
 
5.2%
3 58
 
4.1%
4 47
 
3.3%
5 43
 
3.0%
Other values (6) 123
 
8.6%
Latin
ValueCountFrequency (%)
A 6
28.6%
B 3
14.3%
U 2
 
9.5%
T 1
 
4.8%
O 1
 
4.8%
W 1
 
4.8%
E 1
 
4.8%
R 1
 
4.8%
u 1
 
4.8%
b 1
 
4.8%
Other values (3) 3
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1837
55.8%
ASCII 1453
44.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
590
40.6%
1 162
 
11.1%
0 90
 
6.2%
, 86
 
5.9%
2 85
 
5.8%
) 74
 
5.1%
( 74
 
5.1%
3 58
 
4.0%
4 47
 
3.2%
5 43
 
3.0%
Other values (19) 144
 
9.9%
Hangul
ValueCountFrequency (%)
147
 
8.0%
125
 
6.8%
101
 
5.5%
100
 
5.4%
96
 
5.2%
95
 
5.2%
95
 
5.2%
94
 
5.1%
88
 
4.8%
59
 
3.2%
Other values (130) 837
45.6%

연락처
Text

MISSING 

Distinct80
Distinct (%)98.8%
Missing12
Missing (%)12.9%
Memory size876.0 B
2023-12-12T18:38:23.363649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.061728
Min length9

Characters and Unicode

Total characters977
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique79 ?
Unique (%)97.5%

Sample

1st row031-334-1300
2nd row031-705-0041
3rd row070-4012-0372
4th row031-321-9073
5th row031-337-2300
ValueCountFrequency (%)
031-548-2053 2
 
2.5%
031-321-6635 1
 
1.2%
031-334-1300 1
 
1.2%
031-284-2430 1
 
1.2%
031-284-0099 1
 
1.2%
031-278-7776 1
 
1.2%
070-7095-9501 1
 
1.2%
031-278-6361 1
 
1.2%
031-693-7642 1
 
1.2%
031-284-0788 1
 
1.2%
Other values (70) 70
86.4%
2023-12-12T18:38:23.741209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 161
16.5%
- 159
16.3%
3 145
14.8%
1 119
12.2%
2 101
10.3%
7 63
 
6.4%
8 57
 
5.8%
5 49
 
5.0%
6 43
 
4.4%
4 42
 
4.3%
Other values (2) 38
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 815
83.4%
Dash Punctuation 159
 
16.3%
Math Symbol 3
 
0.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 161
19.8%
3 145
17.8%
1 119
14.6%
2 101
12.4%
7 63
 
7.7%
8 57
 
7.0%
5 49
 
6.0%
6 43
 
5.3%
4 42
 
5.2%
9 35
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 159
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 977
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 161
16.5%
- 159
16.3%
3 145
14.8%
1 119
12.2%
2 101
10.3%
7 63
 
6.4%
8 57
 
5.8%
5 49
 
5.0%
6 43
 
4.4%
4 42
 
4.3%
Other values (2) 38
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 977
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 161
16.5%
- 159
16.3%
3 145
14.8%
1 119
12.2%
2 101
10.3%
7 63
 
6.4%
8 57
 
5.8%
5 49
 
5.0%
6 43
 
4.4%
4 42
 
4.3%
Other values (2) 38
 
3.9%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size876.0 B
2023-05-11
93 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-05-11
2nd row2023-05-11
3rd row2023-05-11
4th row2023-05-11
5th row2023-05-11

Common Values

ValueCountFrequency (%)
2023-05-11 93
100.0%

Length

2023-12-12T18:38:23.873136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:38:23.960567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-05-11 93
100.0%

Correlations

2023-12-12T18:38:24.016823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체명업체주소연락처
업체명1.0001.0001.000
업체주소1.0001.0000.996
연락처1.0000.9961.000

Missing values

2023-12-12T18:38:21.543034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:38:21.653015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업체명업체주소연락처데이터기준일자
0(주)다올 로지스틱경기도 용인시 처인구 남사읍 천덕산로 116-4, 2층031-334-13002023-05-11
1(주)다함메디경기도 용인시 처인구 포곡읍 마성로 386, 102,103호031-705-00412023-05-11
2(주)레포스경기도 용인시 처인구 모현읍 능곡로 151, 2층070-4012-03722023-05-11
3(주)메디웰에코경기도 용인시 처인구 모현읍 오포로25번길 17, A동031-321-90732023-05-11
4(주)비아다빈치경기도 용인시 처인구 남사읍 천덕산로 116-2, 2층,3층031-337-23002023-05-11
5(주)비앤에프인코리아경기도 용인시 처인구 포곡읍 마성로 386, 104,105호031-526-70512023-05-11
6(주)에이치디비네트웍스경기도 용인시 처인구 포곡읍 마성로 386, 2층 201호031-526-70612023-05-11
7(주)유니온가스경기도 용인시 처인구 남사읍 원암로 370-1031-322-12852023-05-11
8(주)인투바이오경기도 용인시 처인구 역북동 386번지 18호031-548-20532023-05-11
9(주)인투팜경기도 용인시 처인구 금학로265번길 5-6, 1층 (역북동)031-548-20532023-05-11
업체명업체주소연락처데이터기준일자
83㈜지바이오텍경기도 용인시 수지구 용구대로2772번길 5, 3층 (죽전동)031-272-36962023-05-11
84누리팜텍경기도 용인시 수지구 현암로 119, 죽전메디뷰 602호 (죽전동)031-897-16182023-05-11
85메디안경기도 용인시 수지구 진산로 28, 성원상떼빌아파트 상가1동 203호 (상현동)<NA>2023-05-11
86브이에이피 리서치경기도 용인시 수지구 동천로113번길 5, 2층 (동천동)031-263-90672023-05-11
87씨디씨코리아주식회사경기도 용인시 수지구 동천로 17, 신영프라자 403호 (동천동)070-4517-00302023-05-11
88알엠에스코리아(주)경기도 용인시 수지구 풍덕천로181번길 4-19 (풍덕천동)031-261-63832023-05-11
89제이에스파마 주식회사경기도 용인시 수지구 정든로6번길 2, 그린피아 503호 (죽전동)<NA>2023-05-11
90주식회사 안연케어 용인지점경기도 용인시 수지구 문인로 30-1, 그랜드백화점물류센타 (동천동)<NA>2023-05-11
91지에이치약품경기도 용인시 수지구 광교중앙로 304, 801호 (상현동)<NA>2023-05-11
92하나메디팜(주)경기도 용인시 수지구 문인로 30-1, 102호 (동천동, 그랜드백화점물류센타)031-263-09412023-05-11