Overview

Dataset statistics

Number of variables5
Number of observations2251
Missing cells1190
Missing cells (%)10.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory90.3 KiB
Average record size in memory41.1 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description2023.1.1.기준 대전광역시 약국, 안전상비의약품판매업소, 의약품도매상 현황으로 업체명, 주소, 연락처 정보 포함
URLhttps://www.data.go.kr/data/15077468/fileData.do

Alerts

연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
전화번호 has 1190 (52.9%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:13:08.253813
Analysis finished2023-12-12 07:13:09.207364
Duration0.95 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct2251
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1126
Minimum1
Maximum2251
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size19.9 KiB
2023-12-12T16:13:09.569577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile113.5
Q1563.5
median1126
Q31688.5
95-th percentile2138.5
Maximum2251
Range2250
Interquartile range (IQR)1125

Descriptive statistics

Standard deviation649.95205
Coefficient of variation (CV)0.57722207
Kurtosis-1.2
Mean1126
Median Absolute Deviation (MAD)563
Skewness0
Sum2534626
Variance422437.67
MonotonicityStrictly increasing
2023-12-12T16:13:09.718690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
1505 1
 
< 0.1%
1499 1
 
< 0.1%
1500 1
 
< 0.1%
1501 1
 
< 0.1%
1502 1
 
< 0.1%
1503 1
 
< 0.1%
1504 1
 
< 0.1%
1506 1
 
< 0.1%
1497 1
 
< 0.1%
Other values (2241) 2241
99.6%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
2251 1
< 0.1%
2250 1
< 0.1%
2249 1
< 0.1%
2248 1
< 0.1%
2247 1
< 0.1%
2246 1
< 0.1%
2245 1
< 0.1%
2244 1
< 0.1%
2243 1
< 0.1%
2242 1
< 0.1%

업종
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size17.7 KiB
안전상비의약품판매업
1271 
약국
774 
의약품도매상
206 

Length

Max length10
Median length10
Mean length6.883163
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row약국
2nd row약국
3rd row약국
4th row약국
5th row약국

Common Values

ValueCountFrequency (%)
안전상비의약품판매업 1271
56.5%
약국 774
34.4%
의약품도매상 206
 
9.2%

Length

2023-12-12T16:13:09.919361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:13:10.073357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
안전상비의약품판매업 1271
56.5%
약국 774
34.4%
의약품도매상 206
 
9.2%
Distinct2127
Distinct (%)94.5%
Missing0
Missing (%)0.0%
Memory size17.7 KiB
2023-12-12T16:13:10.361968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length8.413594
Min length2

Characters and Unicode

Total characters18939
Distinct characters471
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2036 ?
Unique (%)90.4%

Sample

1st row메디컬대동약국
2nd row장원약국
3rd row대성당약국
4th row성남약국
5th row영성일약국
ValueCountFrequency (%)
씨유 122
 
4.4%
세븐일레븐 109
 
3.9%
지에스25 74
 
2.7%
주)코리아세븐 42
 
1.5%
주식회사 36
 
1.3%
gs25 33
 
1.2%
미니스톱 24
 
0.9%
지에스(gs)25 21
 
0.8%
이마트24 14
 
0.5%
지에스25(gs25 8
 
0.3%
Other values (2134) 2303
82.7%
2023-12-12T16:13:10.848985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1180
 
6.2%
893
 
4.7%
831
 
4.4%
784
 
4.1%
731
 
3.9%
588
 
3.1%
536
 
2.8%
535
 
2.8%
2 511
 
2.7%
487
 
2.6%
Other values (461) 11863
62.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16273
85.9%
Decimal Number 1067
 
5.6%
Uppercase Letter 560
 
3.0%
Space Separator 536
 
2.8%
Close Punctuation 241
 
1.3%
Open Punctuation 235
 
1.2%
Other Symbol 15
 
0.1%
Lowercase Letter 11
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1180
 
7.3%
893
 
5.5%
831
 
5.1%
784
 
4.8%
731
 
4.5%
588
 
3.6%
535
 
3.3%
487
 
3.0%
387
 
2.4%
371
 
2.3%
Other values (423) 9486
58.3%
Uppercase Letter
ValueCountFrequency (%)
S 235
42.0%
G 231
41.2%
C 33
 
5.9%
U 25
 
4.5%
R 10
 
1.8%
I 7
 
1.2%
K 7
 
1.2%
Q 2
 
0.4%
D 2
 
0.4%
B 2
 
0.4%
Other values (6) 6
 
1.1%
Decimal Number
ValueCountFrequency (%)
2 511
47.9%
5 458
42.9%
4 54
 
5.1%
1 16
 
1.5%
3 11
 
1.0%
6 9
 
0.8%
7 2
 
0.2%
8 2
 
0.2%
0 2
 
0.2%
9 2
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
s 3
27.3%
g 2
18.2%
e 2
18.2%
y 1
 
9.1%
k 1
 
9.1%
u 1
 
9.1%
a 1
 
9.1%
Space Separator
ValueCountFrequency (%)
536
100.0%
Close Punctuation
ValueCountFrequency (%)
) 241
100.0%
Open Punctuation
ValueCountFrequency (%)
( 235
100.0%
Other Symbol
ValueCountFrequency (%)
15
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16288
86.0%
Common 2080
 
11.0%
Latin 571
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1180
 
7.2%
893
 
5.5%
831
 
5.1%
784
 
4.8%
731
 
4.5%
588
 
3.6%
535
 
3.3%
487
 
3.0%
387
 
2.4%
371
 
2.3%
Other values (424) 9501
58.3%
Latin
ValueCountFrequency (%)
S 235
41.2%
G 231
40.5%
C 33
 
5.8%
U 25
 
4.4%
R 10
 
1.8%
I 7
 
1.2%
K 7
 
1.2%
s 3
 
0.5%
Q 2
 
0.4%
g 2
 
0.4%
Other values (13) 16
 
2.8%
Common
ValueCountFrequency (%)
536
25.8%
2 511
24.6%
5 458
22.0%
) 241
11.6%
( 235
11.3%
4 54
 
2.6%
1 16
 
0.8%
3 11
 
0.5%
6 9
 
0.4%
7 2
 
0.1%
Other values (4) 7
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16273
85.9%
ASCII 2651
 
14.0%
None 15
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1180
 
7.3%
893
 
5.5%
831
 
5.1%
784
 
4.8%
731
 
4.5%
588
 
3.6%
535
 
3.3%
487
 
3.0%
387
 
2.4%
371
 
2.3%
Other values (423) 9486
58.3%
ASCII
ValueCountFrequency (%)
536
20.2%
2 511
19.3%
5 458
17.3%
) 241
9.1%
S 235
8.9%
( 235
8.9%
G 231
8.7%
4 54
 
2.0%
C 33
 
1.2%
U 25
 
0.9%
Other values (27) 92
 
3.5%
None
ValueCountFrequency (%)
15
100.0%
Distinct2215
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size17.7 KiB
2023-12-12T16:13:11.261046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length55
Mean length29.475789
Min length19

Characters and Unicode

Total characters66350
Distinct characters414
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2179 ?
Unique (%)96.8%

Sample

1st row대전광역시 동구 계족로 171, 1층 102호 (대동)
2nd row대전광역시 동구 계족로 254 (소제동)
3rd row대전광역시 동구 계족로 324 (성남동)
4th row대전광역시 동구 계족로 362, 성남약국 1층 (성남동)
5th row대전광역시 동구 계족로 37, 1층 (효동)
ValueCountFrequency (%)
대전광역시 2251
 
16.9%
서구 732
 
5.5%
1층 660
 
4.9%
유성구 517
 
3.9%
중구 371
 
2.8%
동구 349
 
2.6%
대덕구 282
 
2.1%
둔산동 146
 
1.1%
101호 103
 
0.8%
봉명동 95
 
0.7%
Other values (2364) 7851
58.8%
2023-12-12T16:13:11.802602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11118
 
16.8%
3311
 
5.0%
1 3261
 
4.9%
3010
 
4.5%
2457
 
3.7%
2339
 
3.5%
2284
 
3.4%
( 2266
 
3.4%
) 2266
 
3.4%
2258
 
3.4%
Other values (404) 31780
47.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 37766
56.9%
Space Separator 11118
 
16.8%
Decimal Number 10947
 
16.5%
Open Punctuation 2266
 
3.4%
Close Punctuation 2266
 
3.4%
Other Punctuation 1639
 
2.5%
Dash Punctuation 256
 
0.4%
Uppercase Letter 70
 
0.1%
Math Symbol 13
 
< 0.1%
Lowercase Letter 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3311
 
8.8%
3010
 
8.0%
2457
 
6.5%
2339
 
6.2%
2284
 
6.0%
2258
 
6.0%
2256
 
6.0%
2238
 
5.9%
921
 
2.4%
906
 
2.4%
Other values (358) 15786
41.8%
Uppercase Letter
ValueCountFrequency (%)
A 8
11.4%
K 8
11.4%
B 7
 
10.0%
N 7
 
10.0%
L 5
 
7.1%
C 4
 
5.7%
S 4
 
5.7%
J 3
 
4.3%
H 3
 
4.3%
F 3
 
4.3%
Other values (11) 18
25.7%
Decimal Number
ValueCountFrequency (%)
1 3261
29.8%
2 1285
 
11.7%
0 1137
 
10.4%
3 1021
 
9.3%
4 883
 
8.1%
5 847
 
7.7%
6 752
 
6.9%
7 670
 
6.1%
8 624
 
5.7%
9 467
 
4.3%
Lowercase Letter
ValueCountFrequency (%)
e 4
50.0%
s 1
 
12.5%
k 1
 
12.5%
j 1
 
12.5%
c 1
 
12.5%
Other Punctuation
ValueCountFrequency (%)
, 1635
99.8%
. 2
 
0.1%
& 1
 
0.1%
@ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
11118
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2266
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2266
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 256
100.0%
Math Symbol
ValueCountFrequency (%)
~ 13
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 37763
56.9%
Common 28505
43.0%
Latin 79
 
0.1%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3311
 
8.8%
3010
 
8.0%
2457
 
6.5%
2339
 
6.2%
2284
 
6.0%
2258
 
6.0%
2256
 
6.0%
2238
 
5.9%
921
 
2.4%
906
 
2.4%
Other values (357) 15783
41.8%
Latin
ValueCountFrequency (%)
A 8
 
10.1%
K 8
 
10.1%
B 7
 
8.9%
N 7
 
8.9%
L 5
 
6.3%
C 4
 
5.1%
S 4
 
5.1%
e 4
 
5.1%
J 3
 
3.8%
H 3
 
3.8%
Other values (17) 26
32.9%
Common
ValueCountFrequency (%)
11118
39.0%
1 3261
 
11.4%
( 2266
 
7.9%
) 2266
 
7.9%
, 1635
 
5.7%
2 1285
 
4.5%
0 1137
 
4.0%
3 1021
 
3.6%
4 883
 
3.1%
5 847
 
3.0%
Other values (9) 2786
 
9.8%
Han
ValueCountFrequency (%)
3
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 37763
56.9%
ASCII 28583
43.1%
CJK 3
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11118
38.9%
1 3261
 
11.4%
( 2266
 
7.9%
) 2266
 
7.9%
, 1635
 
5.7%
2 1285
 
4.5%
0 1137
 
4.0%
3 1021
 
3.6%
4 883
 
3.1%
5 847
 
3.0%
Other values (35) 2864
 
10.0%
Hangul
ValueCountFrequency (%)
3311
 
8.8%
3010
 
8.0%
2457
 
6.5%
2339
 
6.2%
2284
 
6.0%
2258
 
6.0%
2256
 
6.0%
2238
 
5.9%
921
 
2.4%
906
 
2.4%
Other values (357) 15783
41.8%
CJK
ValueCountFrequency (%)
3
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

전화번호
Text

MISSING 

Distinct1047
Distinct (%)98.7%
Missing1190
Missing (%)52.9%
Memory size17.7 KiB
2023-12-12T16:13:12.071029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.016965
Min length9

Characters and Unicode

Total characters12750
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1033 ?
Unique (%)97.4%

Sample

1st row042-271-8837
2nd row042-634-8502
3rd row042-626-6388
4th row042-672-2957
5th row042-271-2774
ValueCountFrequency (%)
042-719-1400 2
 
0.2%
02-3284-8124 2
 
0.2%
042-930-5782 2
 
0.2%
042-622-9999 2
 
0.2%
042-671-7888 2
 
0.2%
042-933-0710 2
 
0.2%
042-826-5017 2
 
0.2%
02-3284-8112 2
 
0.2%
042-622-4346 2
 
0.2%
042-636-9849 2
 
0.2%
Other values (1037) 1041
98.1%
2023-12-12T16:13:12.521594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 2213
17.4%
- 2121
16.6%
4 1697
13.3%
0 1634
12.8%
5 958
7.5%
8 901
7.1%
3 810
 
6.4%
7 724
 
5.7%
6 704
 
5.5%
1 568
 
4.5%
Other values (2) 420
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 10628
83.4%
Dash Punctuation 2121
 
16.6%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 2213
20.8%
4 1697
16.0%
0 1634
15.4%
5 958
9.0%
8 901
8.5%
3 810
 
7.6%
7 724
 
6.8%
6 704
 
6.6%
1 568
 
5.3%
9 419
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 2121
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12750
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 2213
17.4%
- 2121
16.6%
4 1697
13.3%
0 1634
12.8%
5 958
7.5%
8 901
7.1%
3 810
 
6.4%
7 724
 
5.7%
6 704
 
5.5%
1 568
 
4.5%
Other values (2) 420
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12750
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 2213
17.4%
- 2121
16.6%
4 1697
13.3%
0 1634
12.8%
5 958
7.5%
8 901
7.1%
3 810
 
6.4%
7 724
 
5.7%
6 704
 
5.5%
1 568
 
4.5%
Other values (2) 420
 
3.3%

Interactions

2023-12-12T16:13:08.851250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:13:12.643226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.886
업종0.8861.000
2023-12-12T16:13:12.741528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.826
업종0.8261.000

Missing values

2023-12-12T16:13:09.021769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:13:09.154873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종업체명소재지전화번호
01약국메디컬대동약국대전광역시 동구 계족로 171, 1층 102호 (대동)042-271-8837
12약국장원약국대전광역시 동구 계족로 254 (소제동)042-634-8502
23약국대성당약국대전광역시 동구 계족로 324 (성남동)042-626-6388
34약국성남약국대전광역시 동구 계족로 362, 성남약국 1층 (성남동)042-672-2957
45약국영성일약국대전광역시 동구 계족로 37, 1층 (효동)042-271-2774
56약국한솔약국대전광역시 동구 계족로 393-1 (성남동)042-635-7388
67약국삼성약국대전광역시 동구 계족로 412 (용전동)042-631-6654
78약국용전약국대전광역시 동구 계족로 416, 1층 (용전동)042-624-8981
89약국새소망약국대전광역시 동구 계족로 425, 1층 (용전동)042-625-7927
910약국유산균약국대전광역시 동구 계족로 476 (용전동)042-584-2220
연번업종업체명소재지전화번호
22412242안전상비의약품판매업씨유대전중리한밭점대전광역시 대덕구 한밭대로 1146 (중리동)<NA>
22422243안전상비의약품판매업세븐일레븐대전중리한밭점대전광역시 대덕구 한밭대로 1158, 한밭캠퍼스빌 106호 (중리동)<NA>
22432244안전상비의약품판매업이마트24R대전오정점대전광역시 대덕구 한밭대로1003번길 12, 1층 (오정동)<NA>
22442245안전상비의약품판매업씨유 오정본점대전광역시 대덕구 한밭대로1006번길 4 (오정동)<NA>
22452246안전상비의약품판매업씨유대전중리하하호호점대전광역시 대덕구 한밭대로1129번길 43 (중리동)042-639-6355
22462247안전상비의약품판매업GS25뉴중리사랑점대전광역시 대덕구 한밭대로1149번길 24 (중리동)<NA>
22472248안전상비의약품판매업세븐일레븐대전중리2호점대전광역시 대덕구 한밭대로1149번길 35 (중리동)<NA>
22482249안전상비의약품판매업세븐일레븐대전광역시 대덕구 홍도로119번길 6 (중리동)<NA>
22492250안전상비의약품판매업지에스25 한남대북문점대전광역시 대덕구 홍도로129번길 68 (중리동)<NA>
22502251안전상비의약품판매업(주)코리아세븐대전한남대원룸점대전광역시 대덕구 홍도로73번길 52(오정동)<NA>