Overview

Dataset statistics

Number of variables5
Number of observations1208
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory48.5 KiB
Average record size in memory41.1 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description인천광역시 약국 현황(상비의약품,약국,약방,의료기기,한약방 등) 데이터이며 군구명, 업종별, 시설명, 소재지, 연락처 등의 항목으로 구성되어 있습니다.
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15095641&srcSe=7661IVAWM27C61E190

Alerts

연번 is highly overall correlated with 군구명High correlation
군구명 is highly overall correlated with 연번High correlation
업종별 is highly imbalanced (80.2%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-28 11:57:24.601587
Analysis finished2024-01-28 11:57:25.203010
Duration0.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1208
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean604.5
Minimum1
Maximum1208
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.7 KiB
2024-01-28T20:57:25.263885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile61.35
Q1302.75
median604.5
Q3906.25
95-th percentile1147.65
Maximum1208
Range1207
Interquartile range (IQR)603.5

Descriptive statistics

Standard deviation348.86387
Coefficient of variation (CV)0.57711145
Kurtosis-1.2
Mean604.5
Median Absolute Deviation (MAD)302
Skewness0
Sum730236
Variance121706
MonotonicityStrictly increasing
2024-01-28T20:57:25.381858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
813 1
 
0.1%
811 1
 
0.1%
810 1
 
0.1%
809 1
 
0.1%
808 1
 
0.1%
807 1
 
0.1%
806 1
 
0.1%
805 1
 
0.1%
804 1
 
0.1%
Other values (1198) 1198
99.2%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1208 1
0.1%
1207 1
0.1%
1206 1
0.1%
1205 1
0.1%
1204 1
0.1%
1203 1
0.1%
1202 1
0.1%
1201 1
0.1%
1200 1
0.1%
1199 1
0.1%

군구명
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size9.6 KiB
부평구
248 
남동구
235 
미추홀구
189 
서구
187 
연수구
124 
Other values (5)
225 

Length

Max length4
Median length3
Mean length2.9395695
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강화군
2nd row강화군
3rd row강화군
4th row강화군
5th row강화군

Common Values

ValueCountFrequency (%)
부평구 248
20.5%
남동구 235
19.5%
미추홀구 189
15.6%
서구 187
15.5%
연수구 124
10.3%
계양구 123
10.2%
중구 40
 
3.3%
동구 35
 
2.9%
강화군 24
 
2.0%
옹진군 3
 
0.2%

Length

2024-01-28T20:57:25.500440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T20:57:25.607813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부평구 248
20.5%
남동구 235
19.5%
미추홀구 189
15.6%
서구 187
15.5%
연수구 124
10.3%
계양구 123
10.2%
중구 40
 
3.3%
동구 35
 
2.9%
강화군 24
 
2.0%
옹진군 3
 
0.2%

업종별
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.6 KiB
약국
1171 
한약국
 
37

Length

Max length3
Median length2
Mean length2.0306291
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row약국
2nd row약국
3rd row약국
4th row약국
5th row약국

Common Values

ValueCountFrequency (%)
약국 1171
96.9%
한약국 37
 
3.1%

Length

2024-01-28T20:57:25.736828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T20:57:25.813457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
약국 1171
96.9%
한약국 37
 
3.1%
Distinct949
Distinct (%)78.6%
Missing0
Missing (%)0.0%
Memory size9.6 KiB
2024-01-28T20:57:26.051858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length5.25
Min length3

Characters and Unicode

Total characters6342
Distinct characters356
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique797 ?
Unique (%)66.0%

Sample

1st row강화건강약국
2nd row서울약국
3rd row세광약국
4th row강화정문약국
5th row강화종로약국
ValueCountFrequency (%)
중앙약국 8
 
0.6%
열린약국 6
 
0.5%
희망약국 6
 
0.5%
조은약국 6
 
0.5%
약국 6
 
0.5%
한솔약국 6
 
0.5%
하늘약국 5
 
0.4%
수약국 5
 
0.4%
참사랑약국 5
 
0.4%
푸른약국 5
 
0.4%
Other values (948) 1175
95.3%
2024-01-28T20:57:26.405888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1210
 
19.1%
1209
 
19.1%
152
 
2.4%
105
 
1.7%
105
 
1.7%
88
 
1.4%
71
 
1.1%
67
 
1.1%
60
 
0.9%
54
 
0.9%
Other values (346) 3221
50.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6219
98.1%
Decimal Number 65
 
1.0%
Space Separator 25
 
0.4%
Lowercase Letter 15
 
0.2%
Uppercase Letter 10
 
0.2%
Open Punctuation 3
 
< 0.1%
Close Punctuation 3
 
< 0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1210
 
19.5%
1209
 
19.4%
152
 
2.4%
105
 
1.7%
105
 
1.7%
88
 
1.4%
71
 
1.1%
67
 
1.1%
60
 
1.0%
54
 
0.9%
Other values (315) 3098
49.8%
Lowercase Letter
ValueCountFrequency (%)
e 3
20.0%
l 2
13.3%
o 2
13.3%
w 1
 
6.7%
n 1
 
6.7%
r 1
 
6.7%
t 1
 
6.7%
s 1
 
6.7%
g 1
 
6.7%
a 1
 
6.7%
Uppercase Letter
ValueCountFrequency (%)
S 2
20.0%
K 1
10.0%
P 1
10.0%
I 1
10.0%
V 1
10.0%
D 1
10.0%
H 1
10.0%
W 1
10.0%
B 1
10.0%
Decimal Number
ValueCountFrequency (%)
5 15
23.1%
3 15
23.1%
6 15
23.1%
1 8
12.3%
2 6
 
9.2%
0 4
 
6.2%
4 2
 
3.1%
Space Separator
ValueCountFrequency (%)
25
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6219
98.1%
Common 98
 
1.5%
Latin 25
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1210
 
19.5%
1209
 
19.4%
152
 
2.4%
105
 
1.7%
105
 
1.7%
88
 
1.4%
71
 
1.1%
67
 
1.1%
60
 
1.0%
54
 
0.9%
Other values (315) 3098
49.8%
Latin
ValueCountFrequency (%)
e 3
 
12.0%
l 2
 
8.0%
o 2
 
8.0%
S 2
 
8.0%
K 1
 
4.0%
P 1
 
4.0%
I 1
 
4.0%
V 1
 
4.0%
D 1
 
4.0%
w 1
 
4.0%
Other values (10) 10
40.0%
Common
ValueCountFrequency (%)
25
25.5%
5 15
15.3%
3 15
15.3%
6 15
15.3%
1 8
 
8.2%
2 6
 
6.1%
0 4
 
4.1%
( 3
 
3.1%
) 3
 
3.1%
4 2
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6219
98.1%
ASCII 123
 
1.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1210
 
19.5%
1209
 
19.4%
152
 
2.4%
105
 
1.7%
105
 
1.7%
88
 
1.4%
71
 
1.1%
67
 
1.1%
60
 
1.0%
54
 
0.9%
Other values (315) 3098
49.8%
ASCII
ValueCountFrequency (%)
25
20.3%
5 15
12.2%
3 15
12.2%
6 15
12.2%
1 8
 
6.5%
2 6
 
4.9%
0 4
 
3.3%
( 3
 
2.4%
) 3
 
2.4%
e 3
 
2.4%
Other values (21) 26
21.1%
Distinct1193
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size9.6 KiB
2024-01-28T20:57:26.739001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length49
Mean length31.607616
Min length19

Characters and Unicode

Total characters38182
Distinct characters432
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1178 ?
Unique (%)97.5%

Sample

1st row인천광역시 강화군 강화읍 강화대로312번길 12
2nd row인천광역시 강화군 강화읍 강화대로404번길 4, 서울약국
3rd row인천광역시 강화군 선원면 중앙로 259, 세광약국 1층
4th row인천광역시 강화군 강화읍 충렬사로 25
5th row인천광역시 강화군 강화읍 강화대로 387, 이레빌딩
ValueCountFrequency (%)
인천광역시 1208
 
16.0%
1층 280
 
3.7%
부평구 248
 
3.3%
남동구 235
 
3.1%
미추홀구 189
 
2.5%
서구 187
 
2.5%
연수구 124
 
1.6%
계양구 123
 
1.6%
부평동 106
 
1.4%
주안동 78
 
1.0%
Other values (1735) 4770
63.2%
2024-01-28T20:57:27.187844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6352
 
16.6%
1 1711
 
4.5%
1606
 
4.2%
1366
 
3.6%
1305
 
3.4%
1268
 
3.3%
1252
 
3.3%
1221
 
3.2%
1218
 
3.2%
1218
 
3.2%
Other values (422) 19665
51.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22153
58.0%
Space Separator 6352
 
16.6%
Decimal Number 5904
 
15.5%
Close Punctuation 1190
 
3.1%
Open Punctuation 1190
 
3.1%
Other Punctuation 1135
 
3.0%
Dash Punctuation 126
 
0.3%
Uppercase Letter 98
 
0.3%
Lowercase Letter 22
 
0.1%
Math Symbol 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1606
 
7.2%
1366
 
6.2%
1305
 
5.9%
1268
 
5.7%
1252
 
5.7%
1221
 
5.5%
1218
 
5.5%
1218
 
5.5%
577
 
2.6%
546
 
2.5%
Other values (371) 10576
47.7%
Uppercase Letter
ValueCountFrequency (%)
A 18
18.4%
B 12
12.2%
S 8
 
8.2%
E 8
 
8.2%
C 6
 
6.1%
I 6
 
6.1%
K 6
 
6.1%
W 4
 
4.1%
V 4
 
4.1%
Y 3
 
3.1%
Other values (12) 23
23.5%
Decimal Number
ValueCountFrequency (%)
1 1711
29.0%
0 725
12.3%
2 694
11.8%
3 581
 
9.8%
4 478
 
8.1%
5 385
 
6.5%
6 364
 
6.2%
8 359
 
6.1%
7 354
 
6.0%
9 253
 
4.3%
Lowercase Letter
ValueCountFrequency (%)
e 6
27.3%
s 3
13.6%
r 3
13.6%
d 3
13.6%
a 3
13.6%
y 1
 
4.5%
t 1
 
4.5%
i 1
 
4.5%
c 1
 
4.5%
Other Punctuation
ValueCountFrequency (%)
, 1129
99.5%
' 3
 
0.3%
/ 1
 
0.1%
@ 1
 
0.1%
· 1
 
0.1%
Space Separator
ValueCountFrequency (%)
6352
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1190
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1190
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 126
100.0%
Math Symbol
ValueCountFrequency (%)
~ 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22153
58.0%
Common 15909
41.7%
Latin 120
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1606
 
7.2%
1366
 
6.2%
1305
 
5.9%
1268
 
5.7%
1252
 
5.7%
1221
 
5.5%
1218
 
5.5%
1218
 
5.5%
577
 
2.6%
546
 
2.5%
Other values (371) 10576
47.7%
Latin
ValueCountFrequency (%)
A 18
15.0%
B 12
 
10.0%
S 8
 
6.7%
E 8
 
6.7%
C 6
 
5.0%
I 6
 
5.0%
e 6
 
5.0%
K 6
 
5.0%
W 4
 
3.3%
V 4
 
3.3%
Other values (21) 42
35.0%
Common
ValueCountFrequency (%)
6352
39.9%
1 1711
 
10.8%
) 1190
 
7.5%
( 1190
 
7.5%
, 1129
 
7.1%
0 725
 
4.6%
2 694
 
4.4%
3 581
 
3.7%
4 478
 
3.0%
5 385
 
2.4%
Other values (10) 1474
 
9.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22153
58.0%
ASCII 16028
42.0%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6352
39.6%
1 1711
 
10.7%
) 1190
 
7.4%
( 1190
 
7.4%
, 1129
 
7.0%
0 725
 
4.5%
2 694
 
4.3%
3 581
 
3.6%
4 478
 
3.0%
5 385
 
2.4%
Other values (40) 1593
 
9.9%
Hangul
ValueCountFrequency (%)
1606
 
7.2%
1366
 
6.2%
1305
 
5.9%
1268
 
5.7%
1252
 
5.7%
1221
 
5.5%
1218
 
5.5%
1218
 
5.5%
577
 
2.6%
546
 
2.5%
Other values (371) 10576
47.7%
None
ValueCountFrequency (%)
· 1
100.0%

Interactions

2024-01-28T20:57:24.995113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T20:57:27.269470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번군구명업종별
연번1.0000.9620.186
군구명0.9621.0000.000
업종별0.1860.0001.000
2024-01-28T20:57:27.340921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종별군구명
업종별1.0000.000
군구명0.0001.000
2024-01-28T20:57:27.409853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번군구명업종별
연번1.0000.6640.142
군구명0.6641.0000.000
업종별0.1420.0001.000

Missing values

2024-01-28T20:57:25.092430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T20:57:25.173227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번군구명업종별시설명소재지
01강화군약국강화건강약국인천광역시 강화군 강화읍 강화대로312번길 12
12강화군약국서울약국인천광역시 강화군 강화읍 강화대로404번길 4, 서울약국
23강화군약국세광약국인천광역시 강화군 선원면 중앙로 259, 세광약국 1층
34강화군약국강화정문약국인천광역시 강화군 강화읍 충렬사로 25
45강화군약국강화종로약국인천광역시 강화군 강화읍 강화대로 387, 이레빌딩
56강화군약국메디팜 조은약국인천광역시 강화군 강화읍 중앙로 45, 정우빌딩
67강화군약국바다약국인천광역시 강화군 내가면 중앙로 1314-1, 1층
78강화군약국은화약국인천광역시 강화군 강화읍 강화대로 404
89강화군한약국모던 한방약국인천광역시 강화군 강화읍 중앙로 18
910강화군약국큰샘 온누리약국인천광역시 강화군 강화읍 중앙로 9
연번군구명업종별시설명소재지
11981199서구약국호수약국인천광역시 서구 크리스탈로 78, 105호 (경서동, 엘림존)
11991200서구약국휴베이스퍼스트약국인천광역시 서구 이음5로 80, 108호 (원당동)
12001201서구약국희망약국인천광역시 서구 가정로 140, 103호 (가좌동)
12011202서구한약국경희기경한약국인천광역시 서구 고산후로 148, 1층 일부호 (원당동)
12021203서구한약국다혜원한약국인천광역시 서구 가정로 437, 301동 B118호 (가정동, 루원시티 SK Leaders' VIEW)
12031204서구한약국루원시티약국인천광역시 서구 염곡로 468, 드림타워 106호 (가정동)
12041205서구한약국블루약국인천광역시 서구 보듬로 146, 메트로밸리 112호 (오류동)
12051206서구한약국삼육오사랑약국인천광역시 서구 칠천왕로33번길 14, 1층 일부호 (석남동)
12061207서구한약국원광한약국인천광역시 서구 탁옥로98번길 1, 106호 (심곡동)
12071208서구한약국한마음한약국인천광역시 서구 심곡로 79, 3층 (심곡동, 우정상가)