Overview

Dataset statistics

Number of variables6
Number of observations104
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.2 KiB
Average record size in memory51.3 B

Variable types

Categorical1
Text3
Numeric2

Dataset

Description대구광역시 동구 관내 의약품 도매상 현황 데이터이며, 영업종, 영업소명, 도로명주소, 전화번호 등의 항목을 포함합니다.
URLhttps://www.data.go.kr/data/15077924/fileData.do

Alerts

영업종 is highly imbalanced (58.3%)Imbalance

Reproduction

Analysis started2023-12-12 13:04:36.534685
Analysis finished2023-12-12 13:04:37.407753
Duration0.87 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

영업종
Categorical

IMBALANCE 

Distinct4
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size964.0 B
일반종합도매
87 
한약도매
11 
의료용 고압가스
 
5
수입의약품도매
 
1

Length

Max length8
Median length6
Mean length5.8942308
Min length4

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row한약도매
2nd row일반종합도매
3rd row일반종합도매
4th row일반종합도매
5th row일반종합도매

Common Values

ValueCountFrequency (%)
일반종합도매 87
83.7%
한약도매 11
 
10.6%
의료용 고압가스 5
 
4.8%
수입의약품도매 1
 
1.0%

Length

2023-12-12T22:04:37.484813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:04:37.584332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반종합도매 87
79.8%
한약도매 11
 
10.1%
의료용 5
 
4.6%
고압가스 5
 
4.6%
수입의약품도매 1
 
0.9%
Distinct103
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size964.0 B
2023-12-12T22:04:37.816828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10.5
Mean length5.6057692
Min length2

Characters and Unicode

Total characters583
Distinct characters143
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique102 ?
Unique (%)98.1%

Sample

1st row글로벌바이오넷
2nd row골드팜
3rd row노벨팜
4th row(주)제이엠팜
5th row주식회사 에스씨바이오
ValueCountFrequency (%)
주식회사 7
 
6.2%
경북종합가스 2
 
1.8%
윤일약품 1
 
0.9%
주)남산메디칼 1
 
0.9%
신우약품 1
 
0.9%
유창약품 1
 
0.9%
보람메딕스 1
 
0.9%
주)명성팜 1
 
0.9%
메디칼팜 1
 
0.9%
주)프라임덴탈 1
 
0.9%
Other values (95) 95
84.8%
2023-12-12T22:04:38.186146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
36
 
6.2%
34
 
5.8%
32
 
5.5%
27
 
4.6%
24
 
4.1%
) 23
 
3.9%
( 22
 
3.8%
17
 
2.9%
17
 
2.9%
16
 
2.7%
Other values (133) 335
57.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 524
89.9%
Close Punctuation 23
 
3.9%
Open Punctuation 22
 
3.8%
Space Separator 8
 
1.4%
Other Symbol 6
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
36
 
6.9%
34
 
6.5%
32
 
6.1%
27
 
5.2%
24
 
4.6%
17
 
3.2%
17
 
3.2%
16
 
3.1%
15
 
2.9%
13
 
2.5%
Other values (129) 293
55.9%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Open Punctuation
ValueCountFrequency (%)
( 22
100.0%
Space Separator
ValueCountFrequency (%)
8
100.0%
Other Symbol
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 530
90.9%
Common 53
 
9.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
36
 
6.8%
34
 
6.4%
32
 
6.0%
27
 
5.1%
24
 
4.5%
17
 
3.2%
17
 
3.2%
16
 
3.0%
15
 
2.8%
13
 
2.5%
Other values (130) 299
56.4%
Common
ValueCountFrequency (%)
) 23
43.4%
( 22
41.5%
8
 
15.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 524
89.9%
ASCII 53
 
9.1%
None 6
 
1.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
36
 
6.9%
34
 
6.5%
32
 
6.1%
27
 
5.2%
24
 
4.6%
17
 
3.2%
17
 
3.2%
16
 
3.1%
15
 
2.9%
13
 
2.5%
Other values (129) 293
55.9%
ASCII
ValueCountFrequency (%)
) 23
43.4%
( 22
41.5%
8
 
15.1%
None
ValueCountFrequency (%)
6
100.0%
Distinct103
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size964.0 B
2023-12-12T22:04:38.567093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length38
Mean length29.711538
Min length21

Characters and Unicode

Total characters3090
Distinct characters133
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique102 ?
Unique (%)98.1%

Sample

1st row대구광역시 동구 동내로 76, 한국메디벤처센터 1층 116호 (동내동)
2nd row대구광역시 동구 아양로15길 42, 3층 301호 (신암동)
3rd row대구광역시 동구 아양로 18-1, 2층 (신암동)
4th row대구광역시 동구 동부로22길 48, 유성푸르나임 3층 309호 (신천동)
5th row대구광역시 동구 동부로26길 49-1, 2층 (신천동)
ValueCountFrequency (%)
대구광역시 104
 
15.9%
동구 104
 
15.9%
2층 24
 
3.7%
신천동 19
 
2.9%
3층 17
 
2.6%
1층 14
 
2.1%
율하동 14
 
2.1%
4층 11
 
1.7%
각산동 10
 
1.5%
신암동 9
 
1.4%
Other values (210) 330
50.3%
2023-12-12T22:04:39.099440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
552
17.9%
263
 
8.5%
210
 
6.8%
111
 
3.6%
109
 
3.5%
105
 
3.4%
105
 
3.4%
104
 
3.4%
) 104
 
3.4%
( 104
 
3.4%
Other values (123) 1323
42.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1681
54.4%
Space Separator 552
 
17.9%
Decimal Number 522
 
16.9%
Close Punctuation 104
 
3.4%
Open Punctuation 104
 
3.4%
Other Punctuation 102
 
3.3%
Dash Punctuation 21
 
0.7%
Uppercase Letter 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
263
15.6%
210
12.5%
111
 
6.6%
109
 
6.5%
105
 
6.2%
105
 
6.2%
104
 
6.2%
80
 
4.8%
50
 
3.0%
48
 
2.9%
Other values (104) 496
29.5%
Decimal Number
ValueCountFrequency (%)
2 99
19.0%
1 91
17.4%
3 71
13.6%
4 64
12.3%
0 53
10.2%
6 37
 
7.1%
5 34
 
6.5%
7 30
 
5.7%
8 29
 
5.6%
9 14
 
2.7%
Uppercase Letter
ValueCountFrequency (%)
D 1
25.0%
A 1
25.0%
L 1
25.0%
H 1
25.0%
Space Separator
ValueCountFrequency (%)
552
100.0%
Close Punctuation
ValueCountFrequency (%)
) 104
100.0%
Open Punctuation
ValueCountFrequency (%)
( 104
100.0%
Other Punctuation
ValueCountFrequency (%)
, 102
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1681
54.4%
Common 1405
45.5%
Latin 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
263
15.6%
210
12.5%
111
 
6.6%
109
 
6.5%
105
 
6.2%
105
 
6.2%
104
 
6.2%
80
 
4.8%
50
 
3.0%
48
 
2.9%
Other values (104) 496
29.5%
Common
ValueCountFrequency (%)
552
39.3%
) 104
 
7.4%
( 104
 
7.4%
, 102
 
7.3%
2 99
 
7.0%
1 91
 
6.5%
3 71
 
5.1%
4 64
 
4.6%
0 53
 
3.8%
6 37
 
2.6%
Other values (5) 128
 
9.1%
Latin
ValueCountFrequency (%)
D 1
25.0%
A 1
25.0%
L 1
25.0%
H 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1681
54.4%
ASCII 1409
45.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
552
39.2%
) 104
 
7.4%
( 104
 
7.4%
, 102
 
7.2%
2 99
 
7.0%
1 91
 
6.5%
3 71
 
5.0%
4 64
 
4.5%
0 53
 
3.8%
6 37
 
2.6%
Other values (9) 132
 
9.4%
Hangul
ValueCountFrequency (%)
263
15.6%
210
12.5%
111
 
6.6%
109
 
6.5%
105
 
6.2%
105
 
6.2%
104
 
6.2%
80
 
4.8%
50
 
3.0%
48
 
2.9%
Other values (104) 496
29.5%
Distinct87
Distinct (%)83.7%
Missing0
Missing (%)0.0%
Memory size964.0 B
2023-12-12T22:04:39.437967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.009615
Min length12

Characters and Unicode

Total characters1249
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique86 ?
Unique (%)82.7%

Sample

1st row053-965-5556
2nd row053-000-0000
3rd row053-955-0045
4th row053-000-0000
5th row053-741-4214
ValueCountFrequency (%)
053-000-0000 18
 
17.3%
053-965-4631 1
 
1.0%
053-943-0915 1
 
1.0%
053-961-0080 1
 
1.0%
053-327-1940 1
 
1.0%
053-741-0900 1
 
1.0%
053-962-8090 1
 
1.0%
053-755-8901 1
 
1.0%
053-213-1700 1
 
1.0%
053-965-9802 1
 
1.0%
Other values (77) 77
74.0%
2023-12-12T22:04:39.928100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 291
23.3%
- 208
16.7%
5 189
15.1%
3 144
11.5%
9 78
 
6.2%
6 63
 
5.0%
1 59
 
4.7%
4 57
 
4.6%
2 55
 
4.4%
7 53
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1041
83.3%
Dash Punctuation 208
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 291
28.0%
5 189
18.2%
3 144
13.8%
9 78
 
7.5%
6 63
 
6.1%
1 59
 
5.7%
4 57
 
5.5%
2 55
 
5.3%
7 53
 
5.1%
8 52
 
5.0%
Dash Punctuation
ValueCountFrequency (%)
- 208
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1249
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 291
23.3%
- 208
16.7%
5 189
15.1%
3 144
11.5%
9 78
 
6.2%
6 63
 
5.0%
1 59
 
4.7%
4 57
 
4.6%
2 55
 
4.4%
7 53
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1249
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 291
23.3%
- 208
16.7%
5 189
15.1%
3 144
11.5%
9 78
 
6.2%
6 63
 
5.0%
1 59
 
4.7%
4 57
 
4.6%
2 55
 
4.4%
7 53
 
4.2%

위도
Real number (ℝ)

Distinct88
Distinct (%)84.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.879588
Minimum35.862188
Maximum35.937132
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T22:04:40.112830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum35.862188
5-th percentile35.864312
Q135.871749
median35.876494
Q335.881962
95-th percentile35.916084
Maximum35.937132
Range0.07494412
Interquartile range (IQR)0.010213282

Descriptive statistics

Standard deviation0.015554827
Coefficient of variation (CV)0.00043352858
Kurtosis6.1475405
Mean35.879588
Median Absolute Deviation (MAD)0.00511365
Skewness2.4010453
Sum3731.4772
Variance0.00024195265
MonotonicityNot monotonic
2023-12-12T22:04:40.644099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
35.93713197 4
 
3.8%
35.87832123 3
 
2.9%
35.88375842 2
 
1.9%
35.87413559 2
 
1.9%
35.87244762 2
 
1.9%
35.87229477 2
 
1.9%
35.86302221 2
 
1.9%
35.86869749 2
 
1.9%
35.88507221 2
 
1.9%
35.87371652 2
 
1.9%
Other values (78) 81
77.9%
ValueCountFrequency (%)
35.86218785 1
1.0%
35.86302221 2
1.9%
35.86358508 1
1.0%
35.86429705 2
1.9%
35.86439497 2
1.9%
35.86593987 1
1.0%
35.86621757 1
1.0%
35.86656739 1
1.0%
35.86742137 1
1.0%
35.86750736 1
1.0%
ValueCountFrequency (%)
35.93713197 4
3.8%
35.92103187 1
 
1.0%
35.91608434 2
1.9%
35.91263236 1
 
1.0%
35.89706819 1
 
1.0%
35.8902797 1
 
1.0%
35.8898534 1
 
1.0%
35.88837966 1
 
1.0%
35.88668057 1
 
1.0%
35.88664227 1
 
1.0%

경도
Real number (ℝ)

Distinct88
Distinct (%)84.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean128.67164
Minimum128.61305
Maximum128.74937
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T22:04:40.811231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum128.61305
5-th percentile128.62275
Q1128.63371
median128.66747
Q3128.70715
95-th percentile128.73206
Maximum128.74937
Range0.1363185
Interquartile range (IQR)0.0734472

Descriptive statistics

Standard deviation0.039957368
Coefficient of variation (CV)0.0003105375
Kurtosis-1.5019794
Mean128.67164
Median Absolute Deviation (MAD)0.036704
Skewness0.15052912
Sum13381.85
Variance0.0015965913
MonotonicityNot monotonic
2023-12-12T22:04:40.993214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
128.6352591 4
 
3.8%
128.7137401 3
 
2.9%
128.6365262 2
 
1.9%
128.6342121 2
 
1.9%
128.6261146 2
 
1.9%
128.7146937 2
 
1.9%
128.7021536 2
 
1.9%
128.6964899 2
 
1.9%
128.6232127 2
 
1.9%
128.6264901 2
 
1.9%
Other values (78) 81
77.9%
ValueCountFrequency (%)
128.6130497 1
1.0%
128.616293 1
1.0%
128.6164854 1
1.0%
128.6193771 1
1.0%
128.6219126 1
1.0%
128.6226696 1
1.0%
128.6232127 2
1.9%
128.6233197 1
1.0%
128.6245698 1
1.0%
128.6258282 1
1.0%
ValueCountFrequency (%)
128.7493682 1
1.0%
128.7487933 1
1.0%
128.7360179 1
1.0%
128.7333135 2
1.9%
128.7320633 1
1.0%
128.7320094 1
1.0%
128.7319164 1
1.0%
128.7257211 1
1.0%
128.7249452 1
1.0%
128.7185013 1
1.0%

Interactions

2023-12-12T22:04:37.018492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:04:36.821765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:04:37.126154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:04:36.926124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:04:41.080162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업종전화번호위도경도
영업종1.0000.0000.1680.522
전화번호0.0001.0000.8730.000
위도0.1680.8731.0000.414
경도0.5220.0000.4141.000
2023-12-12T22:04:41.162808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위도경도영업종
위도1.000-0.3030.070
경도-0.3031.0000.328
영업종0.0700.3281.000

Missing values

2023-12-12T22:04:37.240004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:04:37.356572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

영업종영업소명도로명주소전화번호위도경도
0한약도매글로벌바이오넷대구광역시 동구 동내로 76, 한국메디벤처센터 1층 116호 (동내동)053-965-555635.877835128.736018
1일반종합도매골드팜대구광역시 동구 아양로15길 42, 3층 301호 (신암동)053-000-000035.885072128.623213
2일반종합도매노벨팜대구광역시 동구 아양로 18-1, 2층 (신암동)053-955-004535.881133128.616293
3일반종합도매(주)제이엠팜대구광역시 동구 동부로22길 48, 유성푸르나임 3층 309호 (신천동)053-000-000035.872448128.626115
4일반종합도매주식회사 에스씨바이오대구광역시 동구 동부로26길 49-1, 2층 (신천동)053-741-421435.871346128.628265
5일반종합도매주식회사효빈약품대구광역시 동구 신암남로 173, 4층 401호 (신암동)053-000-000035.88471128.632192
6일반종합도매주식회사 마이팜대구광역시 동구 율하동로10길 11, 2층 202호 (율하동)053-962-123235.863022128.702154
7일반종합도매(주)신텍스헬스케어대구광역시 동구 동부로 35, 6층 (신천동)053-255-811035.874535128.616485
8일반종합도매성모약품대구광역시 동구 이노밸리로26길 18, 4층 401호 (각산동)053-000-000035.877985128.712641
9일반종합도매해동약품대구광역시 동구 아양로 75-3, 2, 3층 (신암동)053-000-000035.883123128.621913
영업종영업소명도로명주소전화번호위도경도
94한약도매보경약업사대구광역시 동구 평화로 82 (신암동)053-957-843435.885021128.62267
95한약도매원산약업사대구광역시 동구 효목로19길 39 (효목동)053-955-949035.882542128.641534
96한약도매풍산약업사대구광역시 동구 효동로 118 (효목동)053-741-992635.886642128.641667
97일반종합도매아이팜코리아(주)대구광역시 동구 동화천로 467-2, 3층 (지묘동)053-755-444435.937132128.635259
98한약도매보건당약업사대구광역시 동구 경안로101길 42 (서호동)053-964-463635.867507128.708859
99일반종합도매㈜효성약품대구광역시 동구 화랑로 205 (효목동)053-955-890035.877922128.647543
100한약도매동대구약업사대구광역시 동구 동대구로85길 64, 2층 (신천동)053-754-776635.875971128.62457
101일반종합도매㈜부림약품대구광역시 동구 반야월로 239, 3층 (동호동)053-757-450035.872295128.714694
102일반종합도매(주)신라약품대구광역시 동구 동부로26길 33, 1,4,5층 (신천동)053-754-444435.872834128.628011
103일반종합도매한솔메디팜대구광역시 동구 반야월로24길 45(율하동)053-767-872235.871117128.700442