Overview

Dataset statistics

Number of variables4
Number of observations170
Missing cells402
Missing cells (%)59.1%
Duplicate rows1
Duplicate rows (%)0.6%
Total size in memory5.4 KiB
Average record size in memory32.8 B

Variable types

Text3
Categorical1

Dataset

Description인천광역시 서구 외과의원의 현황에 대한 데이터입니다. 이 데이터는 의원명, 소재지, 전화번호 등에 대한 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15086611/fileData.do

Alerts

Dataset has 1 (0.6%) duplicate rowsDuplicates
의원명 has 134 (78.8%) missing valuesMissing
소재지 has 134 (78.8%) missing valuesMissing
전화번호 has 134 (78.8%) missing valuesMissing

Reproduction

Analysis started2023-12-12 07:36:01.265918
Analysis finished2023-12-12 07:36:01.789929
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

의원명
Text

MISSING 

Distinct36
Distinct (%)100.0%
Missing134
Missing (%)78.8%
Memory size1.5 KiB
2023-12-12T16:36:01.952074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length8.8611111
Min length6

Characters and Unicode

Total characters319
Distinct characters71
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row서울정석정형외과의원
2nd row삼성플러스유외과의원
3rd row위풍당당정형외과의원
4th row성모퍼스트정형외과의원
5th row검단정형외과의원
ValueCountFrequency (%)
삼성플러스유외과의원 1
 
2.8%
위풍당당정형외과의원 1
 
2.8%
가좌정형외과의원 1
 
2.8%
연세항외과의원 1
 
2.8%
원당연세정형외과의원 1
 
2.8%
정정형외과의원 1
 
2.8%
이정형외과의원 1
 
2.8%
정재운정형외과의원 1
 
2.8%
엄기영정형외과의원 1
 
2.8%
가좌신경외과의원 1
 
2.8%
Other values (26) 26
72.2%
2023-12-12T16:36:02.369047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
37
11.6%
36
11.3%
36
11.3%
36
11.3%
32
 
10.0%
29
 
9.1%
7
 
2.2%
6
 
1.9%
6
 
1.9%
5
 
1.6%
Other values (61) 89
27.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 319
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
37
11.6%
36
11.3%
36
11.3%
36
11.3%
32
 
10.0%
29
 
9.1%
7
 
2.2%
6
 
1.9%
6
 
1.9%
5
 
1.6%
Other values (61) 89
27.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 319
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
37
11.6%
36
11.3%
36
11.3%
36
11.3%
32
 
10.0%
29
 
9.1%
7
 
2.2%
6
 
1.9%
6
 
1.9%
5
 
1.6%
Other values (61) 89
27.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 319
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
37
11.6%
36
11.3%
36
11.3%
36
11.3%
32
 
10.0%
29
 
9.1%
7
 
2.2%
6
 
1.9%
6
 
1.9%
5
 
1.6%
Other values (61) 89
27.9%

소재지
Text

MISSING 

Distinct36
Distinct (%)100.0%
Missing134
Missing (%)78.8%
Memory size1.5 KiB
2023-12-12T16:36:02.693835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length40
Mean length33.944444
Min length21

Characters and Unicode

Total characters1222
Distinct characters124
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row인천광역시 서구 이음대로 388, ABM타워 406~409호 (원당동)
2nd row인천광역시 서구 이음대로 388, ABM타워 6층 603~607호 (원당동)
3rd row인천광역시 서구 검단로 480, 검단리치웰프라자 301~303호 (왕길동)
4th row인천광역시 서구 원적로 120, 인성프라자 202~203호 (가좌동)
5th row인천광역시 서구 이음대로 378, 로뎀타워 705~709호 (원당동)
ValueCountFrequency (%)
인천광역시 36
 
14.9%
서구 36
 
14.9%
가정로 7
 
2.9%
청라동 6
 
2.5%
석남동 5
 
2.1%
가좌동 5
 
2.1%
원적로 4
 
1.7%
왕길동 4
 
1.7%
청라라임로 3
 
1.2%
이음대로 3
 
1.2%
Other values (113) 132
54.8%
2023-12-12T16:36:03.156270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
205
 
16.8%
, 47
 
3.8%
40
 
3.3%
0 39
 
3.2%
39
 
3.2%
3 38
 
3.1%
37
 
3.0%
) 36
 
2.9%
36
 
2.9%
36
 
2.9%
Other values (114) 669
54.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 649
53.1%
Decimal Number 228
 
18.7%
Space Separator 205
 
16.8%
Other Punctuation 47
 
3.8%
Close Punctuation 36
 
2.9%
Open Punctuation 36
 
2.9%
Math Symbol 13
 
1.1%
Uppercase Letter 6
 
0.5%
Dash Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
40
 
6.2%
39
 
6.0%
37
 
5.7%
36
 
5.5%
36
 
5.5%
36
 
5.5%
36
 
5.5%
36
 
5.5%
36
 
5.5%
23
 
3.5%
Other values (95) 294
45.3%
Decimal Number
ValueCountFrequency (%)
0 39
17.1%
3 38
16.7%
1 25
11.0%
4 22
9.6%
6 22
9.6%
2 20
8.8%
5 19
8.3%
8 18
7.9%
7 18
7.9%
9 7
 
3.1%
Uppercase Letter
ValueCountFrequency (%)
B 2
33.3%
M 2
33.3%
A 2
33.3%
Space Separator
ValueCountFrequency (%)
205
100.0%
Other Punctuation
ValueCountFrequency (%)
, 47
100.0%
Close Punctuation
ValueCountFrequency (%)
) 36
100.0%
Open Punctuation
ValueCountFrequency (%)
( 36
100.0%
Math Symbol
ValueCountFrequency (%)
~ 13
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 649
53.1%
Common 567
46.4%
Latin 6
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
40
 
6.2%
39
 
6.0%
37
 
5.7%
36
 
5.5%
36
 
5.5%
36
 
5.5%
36
 
5.5%
36
 
5.5%
36
 
5.5%
23
 
3.5%
Other values (95) 294
45.3%
Common
ValueCountFrequency (%)
205
36.2%
, 47
 
8.3%
0 39
 
6.9%
3 38
 
6.7%
) 36
 
6.3%
( 36
 
6.3%
1 25
 
4.4%
4 22
 
3.9%
6 22
 
3.9%
2 20
 
3.5%
Other values (6) 77
 
13.6%
Latin
ValueCountFrequency (%)
B 2
33.3%
M 2
33.3%
A 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 649
53.1%
ASCII 573
46.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
205
35.8%
, 47
 
8.2%
0 39
 
6.8%
3 38
 
6.6%
) 36
 
6.3%
( 36
 
6.3%
1 25
 
4.4%
4 22
 
3.8%
6 22
 
3.8%
2 20
 
3.5%
Other values (9) 83
14.5%
Hangul
ValueCountFrequency (%)
40
 
6.2%
39
 
6.0%
37
 
5.7%
36
 
5.5%
36
 
5.5%
36
 
5.5%
36
 
5.5%
36
 
5.5%
36
 
5.5%
23
 
3.5%
Other values (95) 294
45.3%

전화번호
Text

MISSING 

Distinct36
Distinct (%)100.0%
Missing134
Missing (%)78.8%
Memory size1.5 KiB
2023-12-12T16:36:03.415144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.055556
Min length12

Characters and Unicode

Total characters434
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row032-287-1200
2nd row032-265-6075
3rd row032-567-7575
4th row032-571-1100
5th row032-561-6122
ValueCountFrequency (%)
032-265-6075 1
 
2.8%
032-567-7575 1
 
2.8%
032-577-1500 1
 
2.8%
032-568-2259 1
 
2.8%
032-569-2400 1
 
2.8%
032-567-7577 1
 
2.8%
032-561-3382 1
 
2.8%
032-584-0300 1
 
2.8%
032-571-0053 1
 
2.8%
032-577-4215 1
 
2.8%
Other values (26) 26
72.2%
2023-12-12T16:36:03.774744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 72
16.6%
0 64
14.7%
5 60
13.8%
2 58
13.4%
3 48
11.1%
7 43
9.9%
1 27
 
6.2%
6 25
 
5.8%
8 20
 
4.6%
9 9
 
2.1%
Other values (2) 8
 
1.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 361
83.2%
Dash Punctuation 72
 
16.6%
Math Symbol 1
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 64
17.7%
5 60
16.6%
2 58
16.1%
3 48
13.3%
7 43
11.9%
1 27
7.5%
6 25
 
6.9%
8 20
 
5.5%
9 9
 
2.5%
4 7
 
1.9%
Dash Punctuation
ValueCountFrequency (%)
- 72
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 434
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 72
16.6%
0 64
14.7%
5 60
13.8%
2 58
13.4%
3 48
11.1%
7 43
9.9%
1 27
 
6.2%
6 25
 
5.8%
8 20
 
4.6%
9 9
 
2.1%
Other values (2) 8
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 434
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 72
16.6%
0 64
14.7%
5 60
13.8%
2 58
13.4%
3 48
11.1%
7 43
9.9%
1 27
 
6.2%
6 25
 
5.8%
8 20
 
4.6%
9 9
 
2.1%
Other values (2) 8
 
1.8%
Distinct2
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
<NA>
134 
2023-08-01
36 

Length

Max length10
Median length4
Mean length5.2705882
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-01
2nd row2023-08-01
3rd row2023-08-01
4th row2023-08-01
5th row2023-08-01

Common Values

ValueCountFrequency (%)
<NA> 134
78.8%
2023-08-01 36
 
21.2%

Length

2023-12-12T16:36:03.942730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:36:04.072274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 134
78.8%
2023-08-01 36
 
21.2%

Correlations

2023-12-12T16:36:04.517944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
의원명소재지전화번호
의원명1.0001.0001.000
소재지1.0001.0001.000
전화번호1.0001.0001.000

Missing values

2023-12-12T16:36:01.514783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:36:01.624131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T16:36:01.731797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

의원명소재지전화번호데이터기준일자
0서울정석정형외과의원인천광역시 서구 이음대로 388, ABM타워 406~409호 (원당동)032-287-12002023-08-01
1삼성플러스유외과의원인천광역시 서구 이음대로 388, ABM타워 6층 603~607호 (원당동)032-265-60752023-08-01
2위풍당당정형외과의원인천광역시 서구 검단로 480, 검단리치웰프라자 301~303호 (왕길동)032-567-75752023-08-01
3성모퍼스트정형외과의원인천광역시 서구 원적로 120, 인성프라자 202~203호 (가좌동)032-571-11002023-08-01
4검단정형외과의원인천광역시 서구 이음대로 378, 로뎀타워 705~709호 (원당동)032-561-61222023-08-01
5가정신현탑본정형외과의원인천광역시 서구 가정로 369, 서경 플러스 존 3층 304호 (신현동)032-719-75572023-08-01
6서울정형외과의원인천광역시 서구 염곡로 468, 드림타워 601호,7층,8층 (가정동)032-561-11402023-08-01
7소유외과의원인천광역시 서구 중봉대로 610, 603,604호 (청라동)032-292-10002023-08-01
8연세유쾌항외과의원인천광역시 서구 청라루비로 93, 루비타워 203,204호 (청라동)032-721-55852023-08-01
9나눔성형외과의원인천광역시 서구 검단로 480, 검단리치웰프라자 308~310호 (왕길동)032-567-77882023-08-01
의원명소재지전화번호데이터기준일자
160<NA><NA><NA><NA>
161<NA><NA><NA><NA>
162<NA><NA><NA><NA>
163<NA><NA><NA><NA>
164<NA><NA><NA><NA>
165<NA><NA><NA><NA>
166<NA><NA><NA><NA>
167<NA><NA><NA><NA>
168<NA><NA><NA><NA>
169<NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

의원명소재지전화번호데이터기준일자# duplicates
0<NA><NA><NA><NA>134