Overview

Dataset statistics

Number of variables5
Number of observations5140
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory205.9 KiB
Average record size in memory41.0 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description인천광역시 군구별 의약품 판매업소 현황(안전상비의약품 판매업소, 의료기기판매업소 등)에 대한 데이터로 구성된 정보입니다.
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15064907&srcSe=7661IVAWM27C61E190

Alerts

연번 is highly overall correlated with 군구명 and 1 other fieldsHigh correlation
군구명 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
업종별 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-28 06:03:16.795627
Analysis finished2024-01-28 06:03:17.740132
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct5140
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2570.5
Minimum1
Maximum5140
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size45.3 KiB
2024-01-28T15:03:17.808023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile257.95
Q11285.75
median2570.5
Q33855.25
95-th percentile4883.05
Maximum5140
Range5139
Interquartile range (IQR)2569.5

Descriptive statistics

Standard deviation1483.9345
Coefficient of variation (CV)0.57729411
Kurtosis-1.2
Mean2570.5
Median Absolute Deviation (MAD)1285
Skewness0
Sum13212370
Variance2202061.7
MonotonicityStrictly increasing
2024-01-28T15:03:17.950839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
3376 1
 
< 0.1%
3434 1
 
< 0.1%
3433 1
 
< 0.1%
3432 1
 
< 0.1%
3431 1
 
< 0.1%
3430 1
 
< 0.1%
3429 1
 
< 0.1%
3428 1
 
< 0.1%
3427 1
 
< 0.1%
Other values (5130) 5130
99.8%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
5140 1
< 0.1%
5139 1
< 0.1%
5138 1
< 0.1%
5137 1
< 0.1%
5136 1
< 0.1%
5135 1
< 0.1%
5134 1
< 0.1%
5133 1
< 0.1%
5132 1
< 0.1%
5131 1
< 0.1%

군구명
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size40.3 KiB
부평구
1656 
서구
748 
남동구
718 
미추홀구
622 
연수구
537 
Other values (12)
859 

Length

Max length6
Median length3
Mean length2.9200389
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row강화군
2nd row강화군
3rd row강화군
4th row강화군
5th row강화군

Common Values

ValueCountFrequency (%)
부평구 1656
32.2%
서구 748
14.6%
남동구 718
14.0%
미추홀구 622
 
12.1%
연수구 537
 
10.4%
계양구 398
 
7.7%
중구 260
 
5.1%
동구 77
 
1.5%
강화군 55
 
1.1%
서구 19
 
0.4%
Other values (7) 50
 
1.0%

Length

2024-01-28T15:03:18.097715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
부평구 1656
32.2%
서구 771
15.0%
남동구 718
14.0%
미추홀구 622
 
12.1%
연수구 558
 
10.9%
계양구 398
 
7.7%
중구 269
 
5.2%
동구 77
 
1.5%
강화군 55
 
1.1%
옹진군 16
 
0.3%

업종별
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size40.3 KiB
안전상비의약품 판매업소
1560 
약국
1294 
판매업
993 
안전상비의약품판매
425 
안전상비의약품 판매업
378 
Other values (10)
490 

Length

Max length12
Median length11
Mean length6.9597276
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row약국
2nd row약국
3rd row약국
4th row약국
5th row약국

Common Values

ValueCountFrequency (%)
안전상비의약품 판매업소 1560
30.4%
약국 1294
25.2%
판매업 993
19.3%
안전상비의약품판매 425
 
8.3%
안전상비의약품 판매업 378
 
7.4%
안전상비의약품 238
 
4.6%
의약품도매상 91
 
1.8%
일반종합도매 69
 
1.3%
안전상비의약품 판매 37
 
0.7%
안전상비의약품판매업 28
 
0.5%
Other values (5) 27
 
0.5%

Length

2024-01-28T15:03:18.210965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
안전상비의약품 2213
31.1%
판매업소 1560
21.9%
판매업 1371
19.3%
약국 1295
18.2%
안전상비의약품판매 425
 
6.0%
의약품도매상 91
 
1.3%
일반종합도매 69
 
1.0%
판매 37
 
0.5%
안전상비의약품판매업 28
 
0.4%
안전상비의약품판매업소 12
 
0.2%
Other values (4) 15
 
0.2%
Distinct4626
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Memory size40.3 KiB
2024-01-28T15:03:18.702900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length18
Mean length8.6330739
Min length1

Characters and Unicode

Total characters44374
Distinct characters652
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4234 ?
Unique (%)82.4%

Sample

1st row교동약국
2nd row강화건강약국
3rd row서울약국
4th row세광약국
5th row강화정문약국
ValueCountFrequency (%)
씨유 563
 
7.9%
세븐일레븐 327
 
4.6%
지에스25 280
 
3.9%
gs25 199
 
2.8%
주)코리아세븐 149
 
2.1%
이마트24 90
 
1.3%
지에스(gs)25 75
 
1.1%
주식회사 32
 
0.4%
cu 22
 
0.3%
미니스톱 18
 
0.3%
Other values (4591) 5384
75.4%
2024-01-28T15:03:19.102995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2802
 
6.3%
2017
 
4.5%
1344
 
3.0%
1322
 
3.0%
1268
 
2.9%
2 1175
 
2.6%
1113
 
2.5%
5 982
 
2.2%
941
 
2.1%
937
 
2.1%
Other values (642) 30473
68.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 37058
83.5%
Decimal Number 2532
 
5.7%
Space Separator 2017
 
4.5%
Uppercase Letter 1453
 
3.3%
Close Punctuation 585
 
1.3%
Open Punctuation 584
 
1.3%
Lowercase Letter 102
 
0.2%
Other Symbol 27
 
0.1%
Other Punctuation 9
 
< 0.1%
Dash Punctuation 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2802
 
7.6%
1344
 
3.6%
1322
 
3.6%
1268
 
3.4%
1113
 
3.0%
941
 
2.5%
937
 
2.5%
816
 
2.2%
765
 
2.1%
722
 
1.9%
Other values (581) 25028
67.5%
Uppercase Letter
ValueCountFrequency (%)
S 512
35.2%
G 488
33.6%
C 144
 
9.9%
U 138
 
9.5%
R 41
 
2.8%
K 29
 
2.0%
B 10
 
0.7%
H 9
 
0.6%
A 9
 
0.6%
J 9
 
0.6%
Other values (13) 64
 
4.4%
Lowercase Letter
ValueCountFrequency (%)
s 16
15.7%
l 16
15.7%
e 10
9.8%
a 8
 
7.8%
o 8
 
7.8%
g 7
 
6.9%
m 4
 
3.9%
u 4
 
3.9%
y 4
 
3.9%
t 3
 
2.9%
Other values (10) 22
21.6%
Decimal Number
ValueCountFrequency (%)
2 1175
46.4%
5 982
38.8%
4 190
 
7.5%
1 69
 
2.7%
3 54
 
2.1%
6 38
 
1.5%
0 11
 
0.4%
8 7
 
0.3%
7 4
 
0.2%
9 2
 
0.1%
Other Punctuation
ValueCountFrequency (%)
& 4
44.4%
. 4
44.4%
? 1
 
11.1%
Space Separator
ValueCountFrequency (%)
2017
100.0%
Close Punctuation
ValueCountFrequency (%)
) 585
100.0%
Open Punctuation
ValueCountFrequency (%)
( 584
100.0%
Other Symbol
ValueCountFrequency (%)
27
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 37085
83.6%
Common 5734
 
12.9%
Latin 1555
 
3.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2802
 
7.6%
1344
 
3.6%
1322
 
3.6%
1268
 
3.4%
1113
 
3.0%
941
 
2.5%
937
 
2.5%
816
 
2.2%
765
 
2.1%
722
 
1.9%
Other values (582) 25055
67.6%
Latin
ValueCountFrequency (%)
S 512
32.9%
G 488
31.4%
C 144
 
9.3%
U 138
 
8.9%
R 41
 
2.6%
K 29
 
1.9%
s 16
 
1.0%
l 16
 
1.0%
e 10
 
0.6%
B 10
 
0.6%
Other values (33) 151
 
9.7%
Common
ValueCountFrequency (%)
2017
35.2%
2 1175
20.5%
5 982
17.1%
) 585
 
10.2%
( 584
 
10.2%
4 190
 
3.3%
1 69
 
1.2%
3 54
 
0.9%
6 38
 
0.7%
0 11
 
0.2%
Other values (7) 29
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 37058
83.5%
ASCII 7289
 
16.4%
None 27
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2802
 
7.6%
1344
 
3.6%
1322
 
3.6%
1268
 
3.4%
1113
 
3.0%
941
 
2.5%
937
 
2.5%
816
 
2.2%
765
 
2.1%
722
 
1.9%
Other values (581) 25028
67.5%
ASCII
ValueCountFrequency (%)
2017
27.7%
2 1175
16.1%
5 982
13.5%
) 585
 
8.0%
( 584
 
8.0%
S 512
 
7.0%
G 488
 
6.7%
4 190
 
2.6%
C 144
 
2.0%
U 138
 
1.9%
Other values (50) 474
 
6.5%
None
ValueCountFrequency (%)
27
100.0%
Distinct5012
Distinct (%)97.5%
Missing1
Missing (%)< 0.1%
Memory size40.3 KiB
2024-01-28T15:03:19.380303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length73
Median length57
Mean length34.246351
Min length9

Characters and Unicode

Total characters175992
Distinct characters572
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4889 ?
Unique (%)95.1%

Sample

1st row인천광역시 강화군 교도면 대룡리 479-9
2nd row인천광역시 강화군 강화읍 강화대로312번길 12
3rd row인천광역시 강화군 강화읍 강화대로404번길 4, 서울약국
4th row인천광역시 강화군 선원면 중앙로 259, 세광약국 1층
5th row인천광역시 강화군 강화읍 충렬사로 25
ValueCountFrequency (%)
인천광역시 4857
 
14.4%
부평구 1655
 
4.9%
1층 1624
 
4.8%
서구 769
 
2.3%
남동구 717
 
2.1%
부평동 656
 
1.9%
미추홀구 622
 
1.8%
연수구 558
 
1.7%
계양구 398
 
1.2%
101호 301
 
0.9%
Other values (5453) 21655
64.0%
2024-01-28T15:03:19.827949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28777
 
16.4%
1 9454
 
5.4%
6848
 
3.9%
, 5881
 
3.3%
5420
 
3.1%
5370
 
3.1%
5263
 
3.0%
5177
 
2.9%
5146
 
2.9%
) 5075
 
2.9%
Other values (562) 93581
53.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 99438
56.5%
Decimal Number 30242
 
17.2%
Space Separator 28777
 
16.4%
Other Punctuation 5923
 
3.4%
Close Punctuation 5075
 
2.9%
Open Punctuation 5075
 
2.9%
Uppercase Letter 676
 
0.4%
Dash Punctuation 636
 
0.4%
Lowercase Letter 90
 
0.1%
Math Symbol 53
 
< 0.1%
Other values (2) 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6848
 
6.9%
5420
 
5.5%
5370
 
5.4%
5263
 
5.3%
5177
 
5.2%
5146
 
5.2%
4929
 
5.0%
4909
 
4.9%
3663
 
3.7%
3188
 
3.2%
Other values (496) 49525
49.8%
Uppercase Letter
ValueCountFrequency (%)
B 144
21.3%
A 129
19.1%
S 48
 
7.1%
C 45
 
6.7%
L 33
 
4.9%
K 32
 
4.7%
E 31
 
4.6%
I 31
 
4.6%
W 21
 
3.1%
V 20
 
3.0%
Other values (15) 142
21.0%
Lowercase Letter
ValueCountFrequency (%)
e 30
33.3%
a 11
 
12.2%
s 11
 
12.2%
r 11
 
12.2%
d 9
 
10.0%
y 5
 
5.6%
k 5
 
5.6%
t 3
 
3.3%
h 1
 
1.1%
o 1
 
1.1%
Other values (3) 3
 
3.3%
Decimal Number
ValueCountFrequency (%)
1 9454
31.3%
0 3959
13.1%
2 3676
 
12.2%
3 2843
 
9.4%
4 2254
 
7.5%
5 1974
 
6.5%
6 1785
 
5.9%
7 1590
 
5.3%
8 1479
 
4.9%
9 1228
 
4.1%
Other Punctuation
ValueCountFrequency (%)
, 5881
99.3%
. 19
 
0.3%
' 9
 
0.2%
@ 7
 
0.1%
& 2
 
< 0.1%
? 2
 
< 0.1%
/ 1
 
< 0.1%
1
 
< 0.1%
· 1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
4
66.7%
1
 
16.7%
1
 
16.7%
Space Separator
ValueCountFrequency (%)
28777
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5075
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5075
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 636
100.0%
Math Symbol
ValueCountFrequency (%)
~ 53
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 99438
56.5%
Common 75781
43.1%
Latin 772
 
0.4%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6848
 
6.9%
5420
 
5.5%
5370
 
5.4%
5263
 
5.3%
5177
 
5.2%
5146
 
5.2%
4929
 
5.0%
4909
 
4.9%
3663
 
3.7%
3188
 
3.2%
Other values (496) 49525
49.8%
Latin
ValueCountFrequency (%)
B 144
18.7%
A 129
16.7%
S 48
 
6.2%
C 45
 
5.8%
L 33
 
4.3%
K 32
 
4.1%
E 31
 
4.0%
I 31
 
4.0%
e 30
 
3.9%
W 21
 
2.7%
Other values (31) 228
29.5%
Common
ValueCountFrequency (%)
28777
38.0%
1 9454
 
12.5%
, 5881
 
7.8%
) 5075
 
6.7%
( 5075
 
6.7%
0 3959
 
5.2%
2 3676
 
4.9%
3 2843
 
3.8%
4 2254
 
3.0%
5 1974
 
2.6%
Other values (14) 6813
 
9.0%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 99437
56.5%
ASCII 76545
43.5%
Number Forms 6
 
< 0.1%
None 3
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
28777
37.6%
1 9454
 
12.4%
, 5881
 
7.7%
) 5075
 
6.6%
( 5075
 
6.6%
0 3959
 
5.2%
2 3676
 
4.8%
3 2843
 
3.7%
4 2254
 
2.9%
5 1974
 
2.6%
Other values (50) 7577
 
9.9%
Hangul
ValueCountFrequency (%)
6848
 
6.9%
5420
 
5.5%
5370
 
5.4%
5263
 
5.3%
5177
 
5.2%
5146
 
5.2%
4929
 
5.0%
4909
 
4.9%
3663
 
3.7%
3188
 
3.2%
Other values (495) 49524
49.8%
Number Forms
ValueCountFrequency (%)
4
66.7%
1
 
16.7%
1
 
16.7%
None
ValueCountFrequency (%)
1
33.3%
1
33.3%
· 1
33.3%
CJK
ValueCountFrequency (%)
1
100.0%

Interactions

2024-01-28T15:03:17.505176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T15:03:19.914028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번군구명업종별
연번1.0000.9250.856
군구명0.9251.0000.881
업종별0.8560.8811.000
2024-01-28T15:03:19.997362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종별군구명
업종별1.0000.540
군구명0.5401.000
2024-01-28T15:03:20.074072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번군구명업종별
연번1.0000.7060.530
군구명0.7061.0000.540
업종별0.5300.5401.000

Missing values

2024-01-28T15:03:17.604025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T15:03:17.699142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번군구명업종별시설명소재지
01강화군약국교동약국인천광역시 강화군 교도면 대룡리 479-9
12강화군약국강화건강약국인천광역시 강화군 강화읍 강화대로312번길 12
23강화군약국서울약국인천광역시 강화군 강화읍 강화대로404번길 4, 서울약국
34강화군약국세광약국인천광역시 강화군 선원면 중앙로 259, 세광약국 1층
45강화군약국강화정문약국인천광역시 강화군 강화읍 충렬사로 25
56강화군약국강화종로약국인천광역시 강화군 강화읍 강화대로 387, 이레빌딩
67강화군약국메디팜 조은약국인천광역시 강화군 강화읍 중앙로 45, 정우빌딩
78강화군약국바다약국인천광역시 강화군 내가면 중앙로 1314-1, 1층
89강화군약국은화약국인천광역시 강화군 강화읍 강화대로 404
910강화군약국큰샘 온누리약국인천광역시 강화군 강화읍 중앙로 9
연번군구명업종별시설명소재지
51305131서구안전상비의약품 판매업지에스25 청라마루점서구 중봉대로 610, 115호
51315132서구안전상비의약품 판매업지에스25 청라대장점서구 청라한내로 72번길 7-15, 1층 109호
51325133서구안전상비의약품 판매업지에스 25 청라썬앤빌점서구 미래로11, 상가1층 108,124호
51335134서구안전상비의약품 판매업씨유 청라엔파트점서구 솔빛로 13, 상가 101, 102,103호
51345135서구안전상비의약품 판매업㈜코리아세븐 청라리치아노서구 청라에메랄드로 102번길 10, 101,102호(연희동, 청라리치아노)
51355136서구안전상비의약품 판매업씨유 청라진성점서구 청라커낼로260번길 11 1층 116호
51365137서구안전상비의약품 판매업씨유 청라그린코어점서구 청라한내로72번길 7 106호
51375138서구안전상비의약품 판매업㈜코리아세븐 청라딜라이트점서구 청라에메랄드로78 청라딜라이트타워 1층 101,102호
51385139서구안전상비의약품 판매업지에스25 청라한양점서구 비즈니스로41 상가동 104호 105호
51395140서구안전상비의약품 판매업씨유 청라시그니처점서구 청라한내로72번길 13, B동 1층 101호