Overview

Dataset statistics

Number of variables5
Number of observations158
Missing cells116
Missing cells (%)14.7%
Duplicate rows1
Duplicate rows (%)0.6%
Total size in memory6.5 KiB
Average record size in memory41.8 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description인천광역시 남동구 영유아국가예방접종 위탁의료기관 현황에 대한 데이터로 번호, 병의원명, 주소, 전화번호, 데이터기준일 항목을 제공합니다.
Author인천광역시 남동구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15075445&srcSe=7661IVAWM27C61E190

Alerts

Dataset has 1 (0.6%) duplicate rowsDuplicates
번호 is highly overall correlated with 데이터 기준일High correlation
데이터 기준일 is highly overall correlated with 번호High correlation
번호 has 29 (18.4%) missing valuesMissing
병원(의원)명 has 29 (18.4%) missing valuesMissing
주소 has 29 (18.4%) missing valuesMissing
전화번호 has 29 (18.4%) missing valuesMissing

Reproduction

Analysis started2024-01-28 06:57:57.806852
Analysis finished2024-01-28 06:57:58.567526
Duration0.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct129
Distinct (%)100.0%
Missing29
Missing (%)18.4%
Infinite0
Infinite (%)0.0%
Mean65
Minimum1
Maximum129
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2024-01-28T15:57:58.628030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.4
Q133
median65
Q397
95-th percentile122.6
Maximum129
Range128
Interquartile range (IQR)64

Descriptive statistics

Standard deviation37.383151
Coefficient of variation (CV)0.5751254
Kurtosis-1.2
Mean65
Median Absolute Deviation (MAD)32
Skewness0
Sum8385
Variance1397.5
MonotonicityStrictly increasing
2024-01-28T15:57:58.740145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
98 1
 
0.6%
96 1
 
0.6%
95 1
 
0.6%
94 1
 
0.6%
93 1
 
0.6%
92 1
 
0.6%
91 1
 
0.6%
90 1
 
0.6%
89 1
 
0.6%
88 1
 
0.6%
Other values (119) 119
75.3%
(Missing) 29
 
18.4%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
129 1
0.6%
128 1
0.6%
127 1
0.6%
126 1
0.6%
125 1
0.6%
124 1
0.6%
123 1
0.6%
122 1
0.6%
121 1
0.6%
120 1
0.6%

병원(의원)명
Text

MISSING 

Distinct129
Distinct (%)100.0%
Missing29
Missing (%)18.4%
Memory size1.4 KiB
2024-01-28T15:57:58.911647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length13
Mean length8.7054264
Min length3

Characters and Unicode

Total characters1123
Distinct characters174
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique129 ?
Unique (%)100.0%

Sample

1st row153신경외과의원
2nd row21세기미소내과의원
3rd row간석메디정형외과의원
4th row고은소아청소년과의원
5th row꿈나무들소아청소년과의원
ValueCountFrequency (%)
미듬메디컬의원 1
 
0.8%
인천아시아드병원 1
 
0.8%
인천속내과의원 1
 
0.8%
인사랑내과의원 1
 
0.8%
인구보건복지협회인천지회가족보건의원 1
 
0.8%
이화웰소아청소년과의원 1
 
0.8%
이형원소아청소년과의원 1
 
0.8%
이한진가정의학과의원 1
 
0.8%
이진호내과의원 1
 
0.8%
이이비인후과의원 1
 
0.8%
Other values (121) 121
92.4%
2024-01-28T15:57:59.186764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
131
 
11.7%
130
 
11.6%
96
 
8.5%
63
 
5.6%
40
 
3.6%
38
 
3.4%
36
 
3.2%
30
 
2.7%
30
 
2.7%
29
 
2.6%
Other values (164) 500
44.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1110
98.8%
Decimal Number 8
 
0.7%
Space Separator 2
 
0.2%
Close Punctuation 1
 
0.1%
Lowercase Letter 1
 
0.1%
Open Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
131
 
11.8%
130
 
11.7%
96
 
8.6%
63
 
5.7%
40
 
3.6%
38
 
3.4%
36
 
3.2%
30
 
2.7%
30
 
2.7%
29
 
2.6%
Other values (155) 487
43.9%
Decimal Number
ValueCountFrequency (%)
1 2
25.0%
3 2
25.0%
5 2
25.0%
2 1
12.5%
6 1
12.5%
Space Separator
ValueCountFrequency (%)
2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
i 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1110
98.8%
Common 12
 
1.1%
Latin 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
131
 
11.8%
130
 
11.7%
96
 
8.6%
63
 
5.7%
40
 
3.6%
38
 
3.4%
36
 
3.2%
30
 
2.7%
30
 
2.7%
29
 
2.6%
Other values (155) 487
43.9%
Common
ValueCountFrequency (%)
1 2
16.7%
3 2
16.7%
5 2
16.7%
2
16.7%
2 1
8.3%
) 1
8.3%
( 1
8.3%
6 1
8.3%
Latin
ValueCountFrequency (%)
i 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1110
98.8%
ASCII 13
 
1.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
131
 
11.8%
130
 
11.7%
96
 
8.6%
63
 
5.7%
40
 
3.6%
38
 
3.4%
36
 
3.2%
30
 
2.7%
30
 
2.7%
29
 
2.6%
Other values (155) 487
43.9%
ASCII
ValueCountFrequency (%)
1 2
15.4%
3 2
15.4%
5 2
15.4%
2
15.4%
2 1
7.7%
) 1
7.7%
i 1
7.7%
( 1
7.7%
6 1
7.7%

주소
Text

MISSING 

Distinct129
Distinct (%)100.0%
Missing29
Missing (%)18.4%
Memory size1.4 KiB
2024-01-28T15:57:59.407384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length46
Mean length36.697674
Min length24

Characters and Unicode

Total characters4734
Distinct characters204
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique129 ?
Unique (%)100.0%

Sample

1st row인천광역시 남동구 호구포로 830, (구월동) 5층
2nd row인천광역시 남동구 백범로 322, (간석동, 진메디칼센터) 2층
3rd row인천광역시 남동구 백범로 276, (간석동) 자하1,지상1~3층
4th row인천광역시 남동구 구월남로 280, (구월동, 알파프라자) 3층
5th row인천광역시 남동구 구월로 212, (구월동, 힐캐슬프라자) 2층 206호
ValueCountFrequency (%)
인천광역시 129
 
14.6%
남동구 129
 
14.6%
구월동 34
 
3.9%
논현동 27
 
3.1%
만수동 27
 
3.1%
2층 26
 
2.9%
간석동 24
 
2.7%
3층 19
 
2.2%
호구포로 14
 
1.6%
서창동 12
 
1.4%
Other values (297) 441
50.0%
2024-01-28T15:57:59.745593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
770
 
16.3%
274
 
5.8%
, 232
 
4.9%
193
 
4.1%
157
 
3.3%
2 150
 
3.2%
147
 
3.1%
134
 
2.8%
131
 
2.8%
131
 
2.8%
Other values (194) 2415
51.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2639
55.7%
Decimal Number 806
 
17.0%
Space Separator 770
 
16.3%
Other Punctuation 233
 
4.9%
Open Punctuation 126
 
2.7%
Close Punctuation 126
 
2.7%
Dash Punctuation 15
 
0.3%
Math Symbol 10
 
0.2%
Uppercase Letter 9
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
274
 
10.4%
193
 
7.3%
157
 
5.9%
147
 
5.6%
134
 
5.1%
131
 
5.0%
131
 
5.0%
130
 
4.9%
128
 
4.9%
100
 
3.8%
Other values (170) 1114
42.2%
Decimal Number
ValueCountFrequency (%)
2 150
18.6%
3 124
15.4%
0 120
14.9%
1 107
13.3%
4 82
10.2%
7 62
7.7%
5 52
 
6.5%
8 43
 
5.3%
6 41
 
5.1%
9 25
 
3.1%
Uppercase Letter
ValueCountFrequency (%)
A 3
33.3%
V 1
 
11.1%
C 1
 
11.1%
G 1
 
11.1%
L 1
 
11.1%
J 1
 
11.1%
S 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
, 232
99.6%
: 1
 
0.4%
Space Separator
ValueCountFrequency (%)
770
100.0%
Open Punctuation
ValueCountFrequency (%)
( 126
100.0%
Close Punctuation
ValueCountFrequency (%)
) 126
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%
Math Symbol
ValueCountFrequency (%)
~ 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2639
55.7%
Common 2086
44.1%
Latin 9
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
274
 
10.4%
193
 
7.3%
157
 
5.9%
147
 
5.6%
134
 
5.1%
131
 
5.0%
131
 
5.0%
130
 
4.9%
128
 
4.9%
100
 
3.8%
Other values (170) 1114
42.2%
Common
ValueCountFrequency (%)
770
36.9%
, 232
 
11.1%
2 150
 
7.2%
( 126
 
6.0%
) 126
 
6.0%
3 124
 
5.9%
0 120
 
5.8%
1 107
 
5.1%
4 82
 
3.9%
7 62
 
3.0%
Other values (7) 187
 
9.0%
Latin
ValueCountFrequency (%)
A 3
33.3%
V 1
 
11.1%
C 1
 
11.1%
G 1
 
11.1%
L 1
 
11.1%
J 1
 
11.1%
S 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2639
55.7%
ASCII 2095
44.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
770
36.8%
, 232
 
11.1%
2 150
 
7.2%
( 126
 
6.0%
) 126
 
6.0%
3 124
 
5.9%
0 120
 
5.7%
1 107
 
5.1%
4 82
 
3.9%
7 62
 
3.0%
Other values (14) 196
 
9.4%
Hangul
ValueCountFrequency (%)
274
 
10.4%
193
 
7.3%
157
 
5.9%
147
 
5.6%
134
 
5.1%
131
 
5.0%
131
 
5.0%
130
 
4.9%
128
 
4.9%
100
 
3.8%
Other values (170) 1114
42.2%

전화번호
Text

MISSING 

Distinct129
Distinct (%)100.0%
Missing29
Missing (%)18.4%
Memory size1.4 KiB
2024-01-28T15:57:59.960299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.015504
Min length12

Characters and Unicode

Total characters1550
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique129 ?
Unique (%)100.0%

Sample

1st row032-217-1100
2nd row032-431-7715
3rd row032-429-3600
4th row032-472-5292
5th row032-473-7585
ValueCountFrequency (%)
032-471-3900 1
 
0.8%
032-437-7585 1
 
0.8%
032-437-5002 1
 
0.8%
032-431-0119 1
 
0.8%
032-765-8275 1
 
0.8%
032-451-4000 1
 
0.8%
032-472-2123 1
 
0.8%
032-442-3477 1
 
0.8%
032-461-5525 1
 
0.8%
032-431-8575 1
 
0.8%
Other values (119) 119
92.2%
2024-01-28T15:58:00.290185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 258
16.6%
3 223
14.4%
2 215
13.9%
0 214
13.8%
4 166
10.7%
7 106
6.8%
5 106
6.8%
6 88
 
5.7%
1 75
 
4.8%
8 53
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1292
83.4%
Dash Punctuation 258
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 223
17.3%
2 215
16.6%
0 214
16.6%
4 166
12.8%
7 106
8.2%
5 106
8.2%
6 88
 
6.8%
1 75
 
5.8%
8 53
 
4.1%
9 46
 
3.6%
Dash Punctuation
ValueCountFrequency (%)
- 258
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1550
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 258
16.6%
3 223
14.4%
2 215
13.9%
0 214
13.8%
4 166
10.7%
7 106
6.8%
5 106
6.8%
6 88
 
5.7%
1 75
 
4.8%
8 53
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1550
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 258
16.6%
3 223
14.4%
2 215
13.9%
0 214
13.8%
4 166
10.7%
7 106
6.8%
5 106
6.8%
6 88
 
5.7%
1 75
 
4.8%
8 53
 
3.4%

데이터 기준일
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2022-12-12
129 
<NA>
29 

Length

Max length10
Median length10
Mean length8.8987342
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-12-12
2nd row2022-12-12
3rd row2022-12-12
4th row2022-12-12
5th row2022-12-12

Common Values

ValueCountFrequency (%)
2022-12-12 129
81.6%
<NA> 29
 
18.4%

Length

2024-01-28T15:58:00.396192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T15:58:00.472411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-12-12 129
81.6%
na 29
 
18.4%

Interactions

2024-01-28T15:57:58.282497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T15:58:00.520742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호
번호1.000
2024-01-28T15:58:00.574200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호데이터 기준일
번호1.0001.000
데이터 기준일1.0001.000

Missing values

2024-01-28T15:57:58.365483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T15:57:58.437635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-28T15:57:58.510617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호병원(의원)명주소전화번호데이터 기준일
01153신경외과의원인천광역시 남동구 호구포로 830, (구월동) 5층032-217-11002022-12-12
1221세기미소내과의원인천광역시 남동구 백범로 322, (간석동, 진메디칼센터) 2층032-431-77152022-12-12
23간석메디정형외과의원인천광역시 남동구 백범로 276, (간석동) 자하1,지상1~3층032-429-36002022-12-12
34고은소아청소년과의원인천광역시 남동구 구월남로 280, (구월동, 알파프라자) 3층032-472-52922022-12-12
45꿈나무들소아청소년과의원인천광역시 남동구 구월로 212, (구월동, 힐캐슬프라자) 2층 206호032-473-75852022-12-12
56나은요양병원인천광역시 남동구 소래역남로16번길 20, (논현동) 둘리프라자032-710-60012022-12-12
67남동성심의원인천광역시 남동구 장승로 36, (만수동) 지하1~지상2층 전체호032-465-44942022-12-12
78남동우리메디칼의원인천광역시 남동구 남동대로 705, (구월동) 1층일부,2층,3층,5층032-433-71712022-12-12
89남촌가정의원인천광역시 남동구 호구포로535번길 48-2, (남촌동, 부경빌딩) 3층032-467-32352022-12-12
910노상수소아청소년과의원인천광역시 남동구 남동대로 890, (간석동, 탑메디칼) 201호 노상수소아과032-435-55772022-12-12
번호병원(의원)명주소전화번호데이터 기준일
148<NA><NA><NA><NA><NA>
149<NA><NA><NA><NA><NA>
150<NA><NA><NA><NA><NA>
151<NA><NA><NA><NA><NA>
152<NA><NA><NA><NA><NA>
153<NA><NA><NA><NA><NA>
154<NA><NA><NA><NA><NA>
155<NA><NA><NA><NA><NA>
156<NA><NA><NA><NA><NA>
157<NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

번호병원(의원)명주소전화번호데이터 기준일# duplicates
0<NA><NA><NA><NA><NA>29