Overview

Dataset statistics

Number of variables5
Number of observations139
Missing cells65
Missing cells (%)9.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.7 KiB
Average record size in memory41.9 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description인천광역시 서구 관내 위치한 동물약국(상호명, 소재지, 전화번호)에 대한 현황데이터로 이루어져 있는 파일입니다.
URLhttps://www.data.go.kr/data/15039513/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
전화번호 has 65 (46.8%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:18:30.594413
Analysis finished2023-12-12 09:18:31.213323
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct139
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean70
Minimum1
Maximum139
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2023-12-12T18:18:31.314909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.9
Q135.5
median70
Q3104.5
95-th percentile132.1
Maximum139
Range138
Interquartile range (IQR)69

Descriptive statistics

Standard deviation40.269923
Coefficient of variation (CV)0.57528461
Kurtosis-1.2
Mean70
Median Absolute Deviation (MAD)35
Skewness0
Sum9730
Variance1621.6667
MonotonicityStrictly increasing
2023-12-12T18:18:31.491564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
97 1
 
0.7%
91 1
 
0.7%
92 1
 
0.7%
93 1
 
0.7%
94 1
 
0.7%
95 1
 
0.7%
96 1
 
0.7%
98 1
 
0.7%
89 1
 
0.7%
Other values (129) 129
92.8%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
139 1
0.7%
138 1
0.7%
137 1
0.7%
136 1
0.7%
135 1
0.7%
134 1
0.7%
133 1
0.7%
132 1
0.7%
131 1
0.7%
130 1
0.7%
Distinct132
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T18:18:31.782499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length9
Mean length5.6258993
Min length3

Characters and Unicode

Total characters782
Distinct characters162
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique126 ?
Unique (%)90.6%

Sample

1st row검단프라자약국
2nd row동원약국
3rd row마전메디칼약국
4th row메디팜성모약국
5th row사랑의약국
ValueCountFrequency (%)
약국 9
 
5.9%
청라바다약국 3
 
2.0%
올리브약국 2
 
1.3%
블루팜약국 2
 
1.3%
성모사랑약국 2
 
1.3%
성누가약국 2
 
1.3%
해미약국 2
 
1.3%
온누리 2
 
1.3%
이야기약국 1
 
0.7%
새오개약국 1
 
0.7%
Other values (127) 127
83.0%
2023-12-12T18:18:32.177056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
139
 
17.8%
139
 
17.8%
23
 
2.9%
18
 
2.3%
18
 
2.3%
16
 
2.0%
14
 
1.8%
13
 
1.7%
12
 
1.5%
12
 
1.5%
Other values (152) 378
48.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 756
96.7%
Space Separator 14
 
1.8%
Decimal Number 10
 
1.3%
Uppercase Letter 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
139
 
18.4%
139
 
18.4%
23
 
3.0%
18
 
2.4%
18
 
2.4%
16
 
2.1%
13
 
1.7%
12
 
1.6%
12
 
1.6%
10
 
1.3%
Other values (146) 356
47.1%
Decimal Number
ValueCountFrequency (%)
3 4
40.0%
5 3
30.0%
6 3
30.0%
Uppercase Letter
ValueCountFrequency (%)
K 1
50.0%
S 1
50.0%
Space Separator
ValueCountFrequency (%)
14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 756
96.7%
Common 24
 
3.1%
Latin 2
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
139
 
18.4%
139
 
18.4%
23
 
3.0%
18
 
2.4%
18
 
2.4%
16
 
2.1%
13
 
1.7%
12
 
1.6%
12
 
1.6%
10
 
1.3%
Other values (146) 356
47.1%
Common
ValueCountFrequency (%)
14
58.3%
3 4
 
16.7%
5 3
 
12.5%
6 3
 
12.5%
Latin
ValueCountFrequency (%)
K 1
50.0%
S 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 756
96.7%
ASCII 26
 
3.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
139
 
18.4%
139
 
18.4%
23
 
3.0%
18
 
2.4%
18
 
2.4%
16
 
2.1%
13
 
1.7%
12
 
1.6%
12
 
1.6%
10
 
1.3%
Other values (146) 356
47.1%
ASCII
ValueCountFrequency (%)
14
53.8%
3 4
 
15.4%
5 3
 
11.5%
6 3
 
11.5%
K 1
 
3.8%
S 1
 
3.8%
Distinct135
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T18:18:32.482158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length38
Mean length31.194245
Min length18

Characters and Unicode

Total characters4336
Distinct characters214
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique131 ?
Unique (%)94.2%

Sample

1st row인천광역시 서구 마전동 697-6
2nd row인천광역시 서구 석남동 556-53
3rd row인천광역시 서구 왕길동 638-1
4th row인천광역시 서구 가좌동 261-20
5th row인천광역시 서구 원적로 96 (가좌동)
ValueCountFrequency (%)
인천광역시 139
 
15.9%
서구 139
 
15.9%
1층 22
 
2.5%
청라동 21
 
2.4%
가좌동 15
 
1.7%
가정로 15
 
1.7%
당하동 14
 
1.6%
가정동 13
 
1.5%
석남동 11
 
1.3%
마전동 10
 
1.1%
Other values (296) 476
54.4%
2023-12-12T18:18:32.957680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
736
 
17.0%
1 202
 
4.7%
155
 
3.6%
144
 
3.3%
143
 
3.3%
143
 
3.3%
142
 
3.3%
141
 
3.3%
139
 
3.2%
139
 
3.2%
Other values (204) 2252
51.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2448
56.5%
Space Separator 736
 
17.0%
Decimal Number 719
 
16.6%
Close Punctuation 135
 
3.1%
Open Punctuation 135
 
3.1%
Other Punctuation 117
 
2.7%
Uppercase Letter 21
 
0.5%
Dash Punctuation 17
 
0.4%
Lowercase Letter 6
 
0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
155
 
6.3%
144
 
5.9%
143
 
5.8%
143
 
5.8%
142
 
5.8%
141
 
5.8%
139
 
5.7%
139
 
5.7%
139
 
5.7%
74
 
3.0%
Other values (171) 1089
44.5%
Uppercase Letter
ValueCountFrequency (%)
B 4
19.0%
S 4
19.0%
M 2
9.5%
E 2
9.5%
K 2
9.5%
A 2
9.5%
L 1
 
4.8%
I 1
 
4.8%
V 1
 
4.8%
W 1
 
4.8%
Decimal Number
ValueCountFrequency (%)
1 202
28.1%
0 112
15.6%
2 69
 
9.6%
3 65
 
9.0%
6 58
 
8.1%
5 55
 
7.6%
7 40
 
5.6%
8 40
 
5.6%
4 39
 
5.4%
9 39
 
5.4%
Lowercase Letter
ValueCountFrequency (%)
e 2
33.3%
s 1
16.7%
a 1
16.7%
d 1
16.7%
r 1
16.7%
Other Punctuation
ValueCountFrequency (%)
, 116
99.1%
' 1
 
0.9%
Space Separator
ValueCountFrequency (%)
736
100.0%
Close Punctuation
ValueCountFrequency (%)
) 135
100.0%
Open Punctuation
ValueCountFrequency (%)
( 135
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2448
56.5%
Common 1861
42.9%
Latin 27
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
155
 
6.3%
144
 
5.9%
143
 
5.8%
143
 
5.8%
142
 
5.8%
141
 
5.8%
139
 
5.7%
139
 
5.7%
139
 
5.7%
74
 
3.0%
Other values (171) 1089
44.5%
Common
ValueCountFrequency (%)
736
39.5%
1 202
 
10.9%
) 135
 
7.3%
( 135
 
7.3%
, 116
 
6.2%
0 112
 
6.0%
2 69
 
3.7%
3 65
 
3.5%
6 58
 
3.1%
5 55
 
3.0%
Other values (7) 178
 
9.6%
Latin
ValueCountFrequency (%)
B 4
14.8%
S 4
14.8%
M 2
 
7.4%
E 2
 
7.4%
K 2
 
7.4%
e 2
 
7.4%
A 2
 
7.4%
s 1
 
3.7%
L 1
 
3.7%
a 1
 
3.7%
Other values (6) 6
22.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2448
56.5%
ASCII 1888
43.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
736
39.0%
1 202
 
10.7%
) 135
 
7.2%
( 135
 
7.2%
, 116
 
6.1%
0 112
 
5.9%
2 69
 
3.7%
3 65
 
3.4%
6 58
 
3.1%
5 55
 
2.9%
Other values (23) 205
 
10.9%
Hangul
ValueCountFrequency (%)
155
 
6.3%
144
 
5.9%
143
 
5.8%
143
 
5.8%
142
 
5.8%
141
 
5.8%
139
 
5.7%
139
 
5.7%
139
 
5.7%
74
 
3.0%
Other values (171) 1089
44.5%

전화번호
Text

MISSING 

Distinct71
Distinct (%)95.9%
Missing65
Missing (%)46.8%
Memory size1.2 KiB
2023-12-12T18:18:33.247036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.067568
Min length12

Characters and Unicode

Total characters893
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique68 ?
Unique (%)91.9%

Sample

1st row032-565-1654
2nd row032-573-6012
3rd row032-561-0396
4th row032-576-2028
5th row032-577-4241
ValueCountFrequency (%)
032-568-5538 2
 
2.7%
032-562-4001 2
 
2.7%
032-565-5250 2
 
2.7%
070-7670-1009 1
 
1.4%
032-565-1654 1
 
1.4%
032-569-2773 1
 
1.4%
032-569-1057 1
 
1.4%
032-581-9916 1
 
1.4%
032-567-0167 1
 
1.4%
032-566-7809 1
 
1.4%
Other values (61) 61
82.4%
2023-12-12T18:18:33.702034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 148
16.6%
0 124
13.9%
5 112
12.5%
2 105
11.8%
3 102
11.4%
6 83
9.3%
7 69
7.7%
1 48
 
5.4%
8 46
 
5.2%
4 28
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 745
83.4%
Dash Punctuation 148
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 124
16.6%
5 112
15.0%
2 105
14.1%
3 102
13.7%
6 83
11.1%
7 69
9.3%
1 48
 
6.4%
8 46
 
6.2%
4 28
 
3.8%
9 28
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 148
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 893
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 148
16.6%
0 124
13.9%
5 112
12.5%
2 105
11.8%
3 102
11.4%
6 83
9.3%
7 69
7.7%
1 48
 
5.4%
8 46
 
5.2%
4 28
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 893
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 148
16.6%
0 124
13.9%
5 112
12.5%
2 105
11.8%
3 102
11.4%
6 83
9.3%
7 69
7.7%
1 48
 
5.4%
8 46
 
5.2%
4 28
 
3.1%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-07-31
139 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-31
2nd row2023-07-31
3rd row2023-07-31
4th row2023-07-31
5th row2023-07-31

Common Values

ValueCountFrequency (%)
2023-07-31 139
100.0%

Length

2023-12-12T18:18:33.861011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:18:33.996648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-07-31 139
100.0%

Interactions

2023-12-12T18:18:30.905241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:18:34.097561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번전화번호
연번1.0000.725
전화번호0.7251.000

Missing values

2023-12-12T18:18:31.036643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:18:31.156400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호명소재지전화번호데이터기준일자
01검단프라자약국인천광역시 서구 마전동 697-6032-565-16542023-07-31
12동원약국인천광역시 서구 석남동 556-53032-573-60122023-07-31
23마전메디칼약국인천광역시 서구 왕길동 638-1032-561-03962023-07-31
34메디팜성모약국인천광역시 서구 가좌동 261-20032-576-20282023-07-31
45사랑의약국인천광역시 서구 원적로 96 (가좌동)032-577-42412023-07-31
56복지약국인천광역시 서구 탁옥로 37, 107호 (심곡동,우리타워)032-568-55382023-07-31
67을지약국인천광역시 서구 장고개로 279, 101호 (가좌동)032-576-11192023-07-31
78온누리 이화약국인천광역시 서구 거북로 115 (석남동)032-571-25692023-07-31
89정다운온누리약국인천광역시 서구 가정로 398 (가정동)<NA>2023-07-31
910명성약국인천광역시 서구 신진말로 37-1 (가좌동)<NA>2023-07-31
연번상호명소재지전화번호데이터기준일자
129130건강희망약국인천광역시 서구 완정로 153, 이레메디칼센타 101~102호 (왕길동)<NA>2023-07-31
130131보령검단약국인천광역시 서구 원당대로 1039, 태경타워 1층 105, 106호 (원당동)<NA>2023-07-31
131132청라바다약국인천광역시 서구 중봉대로612번길 10-16, 마르씨엘 105,106호 (청라동)<NA>2023-07-31
132133365검단우리약국인천광역시 서구 발산로5번길 12, 인천검단 엔젤리움1차 윈팰리스 107호 (원당동)<NA>2023-07-31
133134옵티마정다운약국인천광역시 서구 가정로 204 (석남동)<NA>2023-07-31
134135맑은약국인천광역시 서구 명가골로 37 (석남동)032-581-05802023-07-31
135136소소약국인천광역시 서구 장고개로 285, 104호 (가좌동)032-572-05252023-07-31
136137시티약국인천광역시 서구 발산로 6, 아인시티 주차타워 113호 (원당동)<NA>2023-07-31
137138검단센트럴약국인천광역시 서구 이음3로 149, 위너스 프라자 104호 (당하동)<NA>2023-07-31
138139365아라약국인천광역시 서구 이음5로 30, 연세프라자9 106호 (원당동)<NA>2023-07-31