Overview

Dataset statistics

Number of variables5
Number of observations44
Missing cells7
Missing cells (%)3.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory43.0 B

Variable types

Text4
Categorical1

Dataset

Description부산광역시 연제구 의약품업소(의약품 도매상, 한약 도매상)현황 데이터입니다.(2023.10.23.기준)상호명, 소재지를 제공합니다.
Author부산광역시 연제구
URLhttps://www.data.go.kr/data/15025419/fileData.do

Alerts

영업소우편번호(도로명) has 1 (2.3%) missing valuesMissing
영업소전화번호 has 6 (13.6%) missing valuesMissing
영업소명 has unique valuesUnique

Reproduction

Analysis started2023-12-13 00:01:54.209843
Analysis finished2023-12-13 00:01:54.667928
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

영업소명
Text

UNIQUE 

Distinct44
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size484.0 B
2023-12-13T09:01:54.792459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length9
Mean length5.8409091
Min length3

Characters and Unicode

Total characters257
Distinct characters84
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)100.0%

Sample

1st row제이에스메디칼
2nd row하나파마
3rd row(주)나이스팜
4th row(주)효산메디팜
5th row(주)제이에스약품
ValueCountFrequency (%)
주식회사 2
 
4.3%
제이에스메디칼 1
 
2.2%
엠씨메디 1
 
2.2%
경남종합가스상사 1
 
2.2%
주)두산약품 1
 
2.2%
주)새생명약품 1
 
2.2%
보민약품 1
 
2.2%
에이치에스학삼 1
 
2.2%
제이케이팜스 1
 
2.2%
스마트팜 1
 
2.2%
Other values (35) 35
76.1%
2023-12-13T09:01:55.063980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17
 
6.6%
15
 
5.8%
14
 
5.4%
14
 
5.4%
13
 
5.1%
( 12
 
4.7%
) 12
 
4.7%
10
 
3.9%
10
 
3.9%
9
 
3.5%
Other values (74) 131
51.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 227
88.3%
Open Punctuation 12
 
4.7%
Close Punctuation 12
 
4.7%
Uppercase Letter 4
 
1.6%
Space Separator 2
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
7.5%
15
 
6.6%
14
 
6.2%
14
 
6.2%
13
 
5.7%
10
 
4.4%
10
 
4.4%
9
 
4.0%
7
 
3.1%
7
 
3.1%
Other values (67) 111
48.9%
Uppercase Letter
ValueCountFrequency (%)
V 1
25.0%
S 1
25.0%
U 1
25.0%
B 1
25.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 227
88.3%
Common 26
 
10.1%
Latin 4
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
 
7.5%
15
 
6.6%
14
 
6.2%
14
 
6.2%
13
 
5.7%
10
 
4.4%
10
 
4.4%
9
 
4.0%
7
 
3.1%
7
 
3.1%
Other values (67) 111
48.9%
Latin
ValueCountFrequency (%)
V 1
25.0%
S 1
25.0%
U 1
25.0%
B 1
25.0%
Common
ValueCountFrequency (%)
( 12
46.2%
) 12
46.2%
2
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 227
88.3%
ASCII 30
 
11.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
17
 
7.5%
15
 
6.6%
14
 
6.2%
14
 
6.2%
13
 
5.7%
10
 
4.4%
10
 
4.4%
9
 
4.0%
7
 
3.1%
7
 
3.1%
Other values (67) 111
48.9%
ASCII
ValueCountFrequency (%)
( 12
40.0%
) 12
40.0%
2
 
6.7%
V 1
 
3.3%
S 1
 
3.3%
U 1
 
3.3%
B 1
 
3.3%
Distinct42
Distinct (%)95.5%
Missing0
Missing (%)0.0%
Memory size484.0 B
2023-12-13T09:01:55.289182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length40
Mean length32.340909
Min length22

Characters and Unicode

Total characters1423
Distinct characters87
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)90.9%

Sample

1st row부산광역시 연제구 중앙대로1235번길 41, 양지빌딩 610호 (거제동)
2nd row부산광역시 연제구 과정로 74, 상가동 2층 210호 (연산동, 선경아파트)
3rd row부산광역시 연제구 연미로13번길 33, 3층 일부호 (연산동)
4th row부산광역시 연제구 월드컵대로243번길 19, 상가23동 304호 (거제동, 거제동원타워)
5th row부산광역시 연제구 세병로 35-6, 1층 101호 (연산동)
ValueCountFrequency (%)
부산광역시 44
16.1%
연제구 44
16.1%
연산동 34
 
12.4%
세병로 11
 
4.0%
거제동 8
 
2.9%
39 7
 
2.6%
2층 6
 
2.2%
일부호 5
 
1.8%
1층 5
 
1.8%
3층 4
 
1.5%
Other values (86) 106
38.7%
2023-12-13T09:01:55.611096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
230
 
16.2%
82
 
5.8%
79
 
5.6%
59
 
4.1%
1 53
 
3.7%
49
 
3.4%
48
 
3.4%
) 44
 
3.1%
3 44
 
3.1%
44
 
3.1%
Other values (77) 691
48.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 806
56.6%
Decimal Number 248
 
17.4%
Space Separator 230
 
16.2%
Close Punctuation 44
 
3.1%
Other Punctuation 44
 
3.1%
Open Punctuation 44
 
3.1%
Dash Punctuation 5
 
0.4%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
82
 
10.2%
79
 
9.8%
59
 
7.3%
49
 
6.1%
48
 
6.0%
44
 
5.5%
44
 
5.5%
44
 
5.5%
44
 
5.5%
44
 
5.5%
Other values (60) 269
33.4%
Decimal Number
ValueCountFrequency (%)
1 53
21.4%
3 44
17.7%
2 39
15.7%
0 25
10.1%
5 24
9.7%
6 18
 
7.3%
4 18
 
7.3%
9 12
 
4.8%
7 10
 
4.0%
8 5
 
2.0%
Uppercase Letter
ValueCountFrequency (%)
T 1
50.0%
C 1
50.0%
Space Separator
ValueCountFrequency (%)
230
100.0%
Close Punctuation
ValueCountFrequency (%)
) 44
100.0%
Other Punctuation
ValueCountFrequency (%)
, 44
100.0%
Open Punctuation
ValueCountFrequency (%)
( 44
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 806
56.6%
Common 615
43.2%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
82
 
10.2%
79
 
9.8%
59
 
7.3%
49
 
6.1%
48
 
6.0%
44
 
5.5%
44
 
5.5%
44
 
5.5%
44
 
5.5%
44
 
5.5%
Other values (60) 269
33.4%
Common
ValueCountFrequency (%)
230
37.4%
1 53
 
8.6%
) 44
 
7.2%
3 44
 
7.2%
, 44
 
7.2%
( 44
 
7.2%
2 39
 
6.3%
0 25
 
4.1%
5 24
 
3.9%
6 18
 
2.9%
Other values (5) 50
 
8.1%
Latin
ValueCountFrequency (%)
T 1
50.0%
C 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 806
56.6%
ASCII 617
43.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
230
37.3%
1 53
 
8.6%
) 44
 
7.1%
3 44
 
7.1%
, 44
 
7.1%
( 44
 
7.1%
2 39
 
6.3%
0 25
 
4.1%
5 24
 
3.9%
6 18
 
2.9%
Other values (7) 52
 
8.4%
Hangul
ValueCountFrequency (%)
82
 
10.2%
79
 
9.8%
59
 
7.3%
49
 
6.1%
48
 
6.0%
44
 
5.5%
44
 
5.5%
44
 
5.5%
44
 
5.5%
44
 
5.5%
Other values (60) 269
33.4%
Distinct26
Distinct (%)60.5%
Missing1
Missing (%)2.3%
Memory size484.0 B
2023-12-13T09:01:55.744451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters258
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)46.5%

Sample

1st row'47505
2nd row'47575
3rd row'47614
4th row'47525
5th row'47519
ValueCountFrequency (%)
47519 12
27.9%
47518 3
 
7.0%
47505 2
 
4.7%
47614 2
 
4.7%
47524 2
 
4.7%
47583 2
 
4.7%
47568 1
 
2.3%
47540 1
 
2.3%
47513 1
 
2.3%
47585 1
 
2.3%
Other values (16) 16
37.2%
2023-12-13T09:01:55.966673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 51
19.8%
7 49
19.0%
5 48
18.6%
' 43
16.7%
1 20
 
7.8%
9 15
 
5.8%
8 9
 
3.5%
0 8
 
3.1%
6 6
 
2.3%
3 5
 
1.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 215
83.3%
Other Punctuation 43
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 51
23.7%
7 49
22.8%
5 48
22.3%
1 20
 
9.3%
9 15
 
7.0%
8 9
 
4.2%
0 8
 
3.7%
6 6
 
2.8%
3 5
 
2.3%
2 4
 
1.9%
Other Punctuation
ValueCountFrequency (%)
' 43
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 258
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 51
19.8%
7 49
19.0%
5 48
18.6%
' 43
16.7%
1 20
 
7.8%
9 15
 
5.8%
8 9
 
3.5%
0 8
 
3.1%
6 6
 
2.3%
3 5
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 258
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 51
19.8%
7 49
19.0%
5 48
18.6%
' 43
16.7%
1 20
 
7.8%
9 15
 
5.8%
8 9
 
3.5%
0 8
 
3.1%
6 6
 
2.3%
3 5
 
1.9%

영업소전화번호
Text

MISSING 

Distinct37
Distinct (%)97.4%
Missing6
Missing (%)13.6%
Memory size484.0 B
2023-12-13T09:01:56.379914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters456
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)94.7%

Sample

1st row051-900-9911
2nd row051-864-0890
3rd row051-501-2599
4th row051-852-6991
5th row051-526-1510
ValueCountFrequency (%)
051-804-1202 2
 
5.3%
051-503-6207 1
 
2.6%
051-312-0214 1
 
2.6%
051-865-9633 1
 
2.6%
051-631-4500 1
 
2.6%
051-851-6006 1
 
2.6%
051-507-2235 1
 
2.6%
051-759-8992 1
 
2.6%
051-507-2080 1
 
2.6%
051-851-6494 1
 
2.6%
Other values (27) 27
71.1%
2023-12-13T09:01:56.655761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 79
17.3%
1 76
16.7%
- 76
16.7%
0 74
16.2%
2 27
 
5.9%
7 24
 
5.3%
6 23
 
5.0%
8 22
 
4.8%
9 22
 
4.8%
4 18
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 380
83.3%
Dash Punctuation 76
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 79
20.8%
1 76
20.0%
0 74
19.5%
2 27
 
7.1%
7 24
 
6.3%
6 23
 
6.1%
8 22
 
5.8%
9 22
 
5.8%
4 18
 
4.7%
3 15
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 76
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 456
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 79
17.3%
1 76
16.7%
- 76
16.7%
0 74
16.2%
2 27
 
5.9%
7 24
 
5.3%
6 23
 
5.0%
8 22
 
4.8%
9 22
 
4.8%
4 18
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 456
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 79
17.3%
1 76
16.7%
- 76
16.7%
0 74
16.2%
2 27
 
5.9%
7 24
 
5.3%
6 23
 
5.0%
8 22
 
4.8%
9 22
 
4.8%
4 18
 
3.9%
Distinct2
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size484.0 B
<NA>
38 
전화번호 개인정보로 미제공

Length

Max length14
Median length4
Mean length5.3636364
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전화번호 개인정보로 미제공
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 38
86.4%
전화번호 개인정보로 미제공 6
 
13.6%

Length

2023-12-13T09:01:56.762249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:01:56.842326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 38
67.9%
전화번호 6
 
10.7%
개인정보로 6
 
10.7%
미제공 6
 
10.7%

Correlations

2023-12-13T09:01:56.893675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업소명영업소소재지(도로명)영업소우편번호(도로명)영업소전화번호
영업소명1.0001.0001.0001.000
영업소소재지(도로명)1.0001.0001.0001.000
영업소우편번호(도로명)1.0001.0001.0001.000
영업소전화번호1.0001.0001.0001.000

Missing values

2023-12-13T09:01:54.459126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T09:01:54.552220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T09:01:54.627121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

영업소명영업소소재지(도로명)영업소우편번호(도로명)영업소전화번호기타유의사항
0제이에스메디칼부산광역시 연제구 중앙대로1235번길 41, 양지빌딩 610호 (거제동)'47505<NA>전화번호 개인정보로 미제공
1하나파마부산광역시 연제구 과정로 74, 상가동 2층 210호 (연산동, 선경아파트)'47575051-900-9911<NA>
2(주)나이스팜부산광역시 연제구 연미로13번길 33, 3층 일부호 (연산동)'47614051-864-0890<NA>
3(주)효산메디팜부산광역시 연제구 월드컵대로243번길 19, 상가23동 304호 (거제동, 거제동원타워)'47525051-501-2599<NA>
4(주)제이에스약품부산광역시 연제구 세병로 35-6, 1층 101호 (연산동)'47519051-852-6991<NA>
5(주)서유메딕스부산광역시 연제구 거제천로 258, 월드빌스포츠센터 304호 (연산동)'47518051-526-1510<NA>
6더메디부산광역시 연제구 연안로13번길 75, 2층 (연산동)'47565051-915-5247<NA>
7수현팜부산광역시 연제구 세병로 35-6, 2층 일부호 (연산동)'47519<NA>전화번호 개인정보로 미제공
8오에스팜부산광역시 연제구 세병로 39, 3층 일부호 (연산동)'47519051-865-2255<NA>
9(주)케이플러스팜부산광역시 연제구 세병로 39, 304호 (연산동)'47519051-714-1610<NA>
영업소명영업소소재지(도로명)영업소우편번호(도로명)영업소전화번호기타유의사항
34동아제약(주)부산지점부산광역시 연제구 거제천로270번길 16 (연산동)'47518051-804-1202<NA>
35(주)정진약품부산광역시 연제구 과정로265번길 53 (연산동)'47557051-851-6494<NA>
36누리한약부산광역시 연제구 중앙천로 17, 지하1층 (연산동)'47604051-531-8900<NA>
37UV메디칼부산광역시 연제구 해맞이로61번길 10, 2층 (거제동)'47534051-505-0355<NA>
38제일팜부산광역시 연제구 토곡로 44, 1층 (연산동)'47585051-759-9871<NA>
39(주)대산팜부산광역시 연제구 세병로 39, 302,303호 (연산동)'47519051-507-9525<NA>
40제이에스팜부산광역시 연제구 중앙대로1226번길 18, 지하1층 (거제동)'47513051-506-5241<NA>
41삼보약품부산광역시 연제구 중앙대로1219번길 15 (거제동,6층)<NA>051-507-9131<NA>
42우정약품주식회사부산광역시 연제구 세병로 39 (연산동)'47519051-867-6101<NA>
43서경한방약업사부산광역시 연제구 중앙대로 1067, 1층 (연산동)'47541051-863-7777<NA>