Overview

Dataset statistics

Number of variables4
Number of observations169
Missing cells417
Missing cells (%)61.7%
Duplicate rows1
Duplicate rows (%)0.6%
Total size in memory5.4 KiB
Average record size in memory32.8 B

Variable types

Text3
Categorical1

Dataset

Description인천광역시 서구 소아과의원의 현황에 대한 데이터입니다. 이 데이터는 의원명, 소재지, 전화번호 등에 대한 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15086609/fileData.do

Alerts

Dataset has 1 (0.6%) duplicate rowsDuplicates
의원명 has 139 (82.2%) missing valuesMissing
소재지 has 139 (82.2%) missing valuesMissing
전화번호 has 139 (82.2%) missing valuesMissing

Reproduction

Analysis started2023-12-12 10:28:01.768615
Analysis finished2023-12-12 10:28:02.310132
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

의원명
Text

MISSING 

Distinct30
Distinct (%)100.0%
Missing139
Missing (%)82.2%
Memory size1.4 KiB
2023-12-12T19:28:02.467298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length11.133333
Min length7

Characters and Unicode

Total characters334
Distinct characters77
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row튼튼나무소아청소년과의원
2nd row루원하다소아치과의원
3rd row꿈꾸는소아청소년과의원
4th row삼성퍼스트소아청소년과의원
5th row연세기린소아청소년과의원
ValueCountFrequency (%)
루원하다소아치과의원 1
 
3.3%
꿈꾸는소아청소년과의원 1
 
3.3%
임진영소아청소년과의원 1
 
3.3%
유앤박소아과의원 1
 
3.3%
가좌연세소아과의원 1
 
3.3%
이승철소아과의원 1
 
3.3%
이동희소아청소년과의원 1
 
3.3%
엄마와아이들소아청소년과산부인과의원 1
 
3.3%
우리소아과의원 1
 
3.3%
연세두리소아청소년과의원 1
 
3.3%
Other values (20) 20
66.7%
2023-12-12T19:28:02.909379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
56
16.8%
34
 
10.2%
32
 
9.6%
31
 
9.3%
30
 
9.0%
27
 
8.1%
25
 
7.5%
7
 
2.1%
4
 
1.2%
3
 
0.9%
Other values (67) 85
25.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 328
98.2%
Decimal Number 3
 
0.9%
Open Punctuation 1
 
0.3%
Lowercase Letter 1
 
0.3%
Close Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
56
17.1%
34
10.4%
32
9.8%
31
 
9.5%
30
 
9.1%
27
 
8.2%
25
 
7.6%
7
 
2.1%
4
 
1.2%
3
 
0.9%
Other values (61) 79
24.1%
Decimal Number
ValueCountFrequency (%)
5 1
33.3%
3 1
33.3%
6 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
i 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 328
98.2%
Common 5
 
1.5%
Latin 1
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
56
17.1%
34
10.4%
32
9.8%
31
 
9.5%
30
 
9.1%
27
 
8.2%
25
 
7.6%
7
 
2.1%
4
 
1.2%
3
 
0.9%
Other values (61) 79
24.1%
Common
ValueCountFrequency (%)
5 1
20.0%
( 1
20.0%
) 1
20.0%
3 1
20.0%
6 1
20.0%
Latin
ValueCountFrequency (%)
i 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 328
98.2%
ASCII 6
 
1.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
56
17.1%
34
10.4%
32
9.8%
31
 
9.5%
30
 
9.1%
27
 
8.2%
25
 
7.6%
7
 
2.1%
4
 
1.2%
3
 
0.9%
Other values (61) 79
24.1%
ASCII
ValueCountFrequency (%)
5 1
16.7%
( 1
16.7%
i 1
16.7%
) 1
16.7%
3 1
16.7%
6 1
16.7%

소재지
Text

MISSING 

Distinct30
Distinct (%)100.0%
Missing139
Missing (%)82.2%
Memory size1.4 KiB
2023-12-12T19:28:03.257890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length39
Mean length34.266667
Min length21

Characters and Unicode

Total characters1028
Distinct characters130
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row인천광역시 서구 이음1로 383, 세중시그니쳐 6~7층 607~611,701~711호 (원당동)
2nd row인천광역시 서구 염곡로464번길 15, 304~306호 (가정동)
3rd row인천광역시 서구 이음대로 378, 로뎀타워 507, 508호 (원당동)
4th row인천광역시 서구 이음5로 80, 검단퍼스트 603호 (원당동)
5th row인천광역시 서구 이음5로 60, JS프라자 309~311호 (원당동)
ValueCountFrequency (%)
인천광역시 30
 
15.2%
서구 30
 
15.2%
원당동 5
 
2.5%
가정동 4
 
2.0%
2층 4
 
2.0%
201호 3
 
1.5%
당하동 3
 
1.5%
202호 3
 
1.5%
청라동 3
 
1.5%
신현동 2
 
1.0%
Other values (97) 111
56.1%
2023-12-12T19:28:03.802829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
168
 
16.3%
, 41
 
4.0%
36
 
3.5%
0 33
 
3.2%
32
 
3.1%
31
 
3.0%
31
 
3.0%
) 30
 
2.9%
30
 
2.9%
30
 
2.9%
Other values (120) 566
55.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 562
54.7%
Decimal Number 190
 
18.5%
Space Separator 168
 
16.3%
Other Punctuation 41
 
4.0%
Close Punctuation 30
 
2.9%
Open Punctuation 30
 
2.9%
Math Symbol 5
 
0.5%
Uppercase Letter 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
36
 
6.4%
32
 
5.7%
31
 
5.5%
31
 
5.5%
30
 
5.3%
30
 
5.3%
30
 
5.3%
30
 
5.3%
30
 
5.3%
21
 
3.7%
Other values (103) 261
46.4%
Decimal Number
ValueCountFrequency (%)
0 33
17.4%
1 28
14.7%
2 26
13.7%
3 25
13.2%
7 16
8.4%
5 16
8.4%
8 14
7.4%
4 14
7.4%
6 11
 
5.8%
9 7
 
3.7%
Uppercase Letter
ValueCountFrequency (%)
J 1
50.0%
S 1
50.0%
Space Separator
ValueCountFrequency (%)
168
100.0%
Other Punctuation
ValueCountFrequency (%)
, 41
100.0%
Close Punctuation
ValueCountFrequency (%)
) 30
100.0%
Open Punctuation
ValueCountFrequency (%)
( 30
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 562
54.7%
Common 464
45.1%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
36
 
6.4%
32
 
5.7%
31
 
5.5%
31
 
5.5%
30
 
5.3%
30
 
5.3%
30
 
5.3%
30
 
5.3%
30
 
5.3%
21
 
3.7%
Other values (103) 261
46.4%
Common
ValueCountFrequency (%)
168
36.2%
, 41
 
8.8%
0 33
 
7.1%
) 30
 
6.5%
( 30
 
6.5%
1 28
 
6.0%
2 26
 
5.6%
3 25
 
5.4%
7 16
 
3.4%
5 16
 
3.4%
Other values (5) 51
 
11.0%
Latin
ValueCountFrequency (%)
J 1
50.0%
S 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 562
54.7%
ASCII 466
45.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
168
36.1%
, 41
 
8.8%
0 33
 
7.1%
) 30
 
6.4%
( 30
 
6.4%
1 28
 
6.0%
2 26
 
5.6%
3 25
 
5.4%
7 16
 
3.4%
5 16
 
3.4%
Other values (7) 53
 
11.4%
Hangul
ValueCountFrequency (%)
36
 
6.4%
32
 
5.7%
31
 
5.5%
31
 
5.5%
30
 
5.3%
30
 
5.3%
30
 
5.3%
30
 
5.3%
30
 
5.3%
21
 
3.7%
Other values (103) 261
46.4%

전화번호
Text

MISSING 

Distinct30
Distinct (%)100.0%
Missing139
Missing (%)82.2%
Memory size1.4 KiB
2023-12-12T19:28:04.084660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters360
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row032-719-7559
2nd row032-567-2272
3rd row032-564-1675
4th row032-563-3715
5th row032-719-4620
ValueCountFrequency (%)
032-567-2272 1
 
3.3%
032-564-1675 1
 
3.3%
032-576-0870 1
 
3.3%
032-573-3297 1
 
3.3%
032-575-5275 1
 
3.3%
032-583-7523 1
 
3.3%
032-565-7575 1
 
3.3%
032-572-1771 1
 
3.3%
032-565-0075 1
 
3.3%
032-562-6069 1
 
3.3%
Other values (20) 20
66.7%
2023-12-12T19:28:04.467841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 60
16.7%
5 56
15.6%
2 47
13.1%
0 41
11.4%
7 40
11.1%
3 39
10.8%
6 27
7.5%
9 15
 
4.2%
1 14
 
3.9%
8 14
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 300
83.3%
Dash Punctuation 60
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 56
18.7%
2 47
15.7%
0 41
13.7%
7 40
13.3%
3 39
13.0%
6 27
9.0%
9 15
 
5.0%
1 14
 
4.7%
8 14
 
4.7%
4 7
 
2.3%
Dash Punctuation
ValueCountFrequency (%)
- 60
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 360
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 60
16.7%
5 56
15.6%
2 47
13.1%
0 41
11.4%
7 40
11.1%
3 39
10.8%
6 27
7.5%
9 15
 
4.2%
1 14
 
3.9%
8 14
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 360
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 60
16.7%
5 56
15.6%
2 47
13.1%
0 41
11.4%
7 40
11.1%
3 39
10.8%
6 27
7.5%
9 15
 
4.2%
1 14
 
3.9%
8 14
 
3.9%
Distinct2
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
<NA>
139 
2023-08-01
30 

Length

Max length10
Median length4
Mean length5.0650888
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-01
2nd row2023-08-01
3rd row2023-08-01
4th row2023-08-01
5th row2023-08-01

Common Values

ValueCountFrequency (%)
<NA> 139
82.2%
2023-08-01 30
 
17.8%

Length

2023-12-12T19:28:04.628482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:28:04.751097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 139
82.2%
2023-08-01 30
 
17.8%

Correlations

2023-12-12T19:28:04.833609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
의원명소재지전화번호
의원명1.0001.0001.000
소재지1.0001.0001.000
전화번호1.0001.0001.000

Missing values

2023-12-12T19:28:02.022697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:28:02.137636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T19:28:02.239635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

의원명소재지전화번호데이터기준일자
0튼튼나무소아청소년과의원인천광역시 서구 이음1로 383, 세중시그니쳐 6~7층 607~611,701~711호 (원당동)032-719-75592023-08-01
1루원하다소아치과의원인천광역시 서구 염곡로464번길 15, 304~306호 (가정동)032-567-22722023-08-01
2꿈꾸는소아청소년과의원인천광역시 서구 이음대로 378, 로뎀타워 507, 508호 (원당동)032-564-16752023-08-01
3삼성퍼스트소아청소년과의원인천광역시 서구 이음5로 80, 검단퍼스트 603호 (원당동)032-563-37152023-08-01
4연세기린소아청소년과의원인천광역시 서구 이음5로 60, JS프라자 309~311호 (원당동)032-719-46202023-08-01
5청라미소소아청소년과의원인천광역시 서구 청라커낼로288번길 10, 더스페이스타워 505호 (청라동)032-563-72272023-08-01
6킹소아청소년과의원인천광역시 서구 완정로 172, 402호 (마전동)032-568-51192023-08-01
7아이사랑소아청소년과의원인천광역시 서구 봉오재3로 40, 202호 (가정동)032-561-02792023-08-01
8사랑소아청소년과의원인천광역시 서구 새오개로111번안길 31, 201호 (신현동)032-572-82772023-08-01
9미래안소아청소년과의원인천광역시 서구 검단로 480, 510호 (왕길동, 검단리치웰프라자)032-568-52552023-08-01
의원명소재지전화번호데이터기준일자
159<NA><NA><NA><NA>
160<NA><NA><NA><NA>
161<NA><NA><NA><NA>
162<NA><NA><NA><NA>
163<NA><NA><NA><NA>
164<NA><NA><NA><NA>
165<NA><NA><NA><NA>
166<NA><NA><NA><NA>
167<NA><NA><NA><NA>
168<NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

의원명소재지전화번호데이터기준일자# duplicates
0<NA><NA><NA><NA>139