Overview

Dataset statistics

Number of variables13
Number of observations232
Missing cells53
Missing cells (%)1.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory23.7 KiB
Average record size in memory104.6 B

Variable types

Unsupported3
Text6
Categorical4

Dataset

Description공공의룍기관의 현황으로 일반, 인력, 전문의, 시설(병상), 장비 현황을 포함. 공공의료기관은 공공보건의료기관 중 공공보건기관(보건소, 보건의료원, 보건지소, 보건진료소)을 제외한 의료기관으로 정의함.
Author국립중앙의료원
URLhttps://www.data.go.kr/data/15096111/fileData.do

Alerts

Unnamed: 2 is highly overall correlated with Unnamed: 5 and 1 other fieldsHigh correlation
Unnamed: 3 is highly overall correlated with Unnamed: 5High correlation
Unnamed: 5 is highly overall correlated with Unnamed: 2 and 1 other fieldsHigh correlation
Unnamed: 6 is highly overall correlated with Unnamed: 2High correlation
Unnamed: 6 is highly imbalanced (51.7%)Imbalance
Unnamed: 8 has 10 (4.3%) missing valuesMissing
Unnamed: 10 has 11 (4.7%) missing valuesMissing
Unnamed: 11 has 11 (4.7%) missing valuesMissing
Unnamed: 12 has 11 (4.7%) missing valuesMissing
공공의료기관 현황 (2021.12.31 기준) is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-21 15:22:36.680526
Analysis finished2024-04-21 15:22:38.710323
Duration2.03 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

공공의료기관 현황 (2021.12.31 기준)
Unsupported

REJECTED  UNSUPPORTED 

Missing2
Missing (%)0.9%
Memory size1.9 KiB
Distinct230
Distinct (%)100.0%
Missing2
Missing (%)0.9%
Memory size1.9 KiB
2024-04-22T00:22:39.315396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length14
Mean length9.9695652
Min length4

Characters and Unicode

Total characters2293
Distinct characters171
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique230 ?
Unique (%)100.0%

Sample

1st row의료기관명
2nd row서울대학교병원
3rd row서울적십자병원
4th row서울특별시 동부병원
5th row경찰병원
ValueCountFrequency (%)
근로복지공단 13
 
4.0%
서울특별시 12
 
3.7%
한국보훈복지의료공단 7
 
2.2%
경기도의료원 6
 
1.9%
강원도 6
 
1.9%
전라북도 5
 
1.6%
경상남도립 4
 
1.2%
노인전문병원 3
 
0.9%
충청남도 3
 
0.9%
노인요양병원 3
 
0.9%
Other values (248) 260
80.7%
2024-04-22T00:22:40.269094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
258
 
11.3%
184
 
8.0%
95
 
4.1%
80
 
3.5%
70
 
3.1%
67
 
2.9%
65
 
2.8%
60
 
2.6%
56
 
2.4%
54
 
2.4%
Other values (161) 1304
56.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2187
95.4%
Space Separator 95
 
4.1%
Decimal Number 9
 
0.4%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
258
 
11.8%
184
 
8.4%
80
 
3.7%
70
 
3.2%
67
 
3.1%
65
 
3.0%
60
 
2.7%
56
 
2.6%
54
 
2.5%
54
 
2.5%
Other values (154) 1239
56.7%
Decimal Number
ValueCountFrequency (%)
2 4
44.4%
1 3
33.3%
4 1
 
11.1%
3 1
 
11.1%
Space Separator
ValueCountFrequency (%)
95
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2187
95.4%
Common 106
 
4.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
258
 
11.8%
184
 
8.4%
80
 
3.7%
70
 
3.2%
67
 
3.1%
65
 
3.0%
60
 
2.7%
56
 
2.6%
54
 
2.5%
54
 
2.5%
Other values (154) 1239
56.7%
Common
ValueCountFrequency (%)
95
89.6%
2 4
 
3.8%
1 3
 
2.8%
( 1
 
0.9%
) 1
 
0.9%
4 1
 
0.9%
3 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2187
95.4%
ASCII 106
 
4.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
258
 
11.8%
184
 
8.4%
80
 
3.7%
70
 
3.2%
67
 
3.1%
65
 
3.0%
60
 
2.7%
56
 
2.6%
54
 
2.5%
54
 
2.5%
Other values (154) 1239
56.7%
ASCII
ValueCountFrequency (%)
95
89.6%
2 4
 
3.8%
1 3
 
2.8%
( 1
 
0.9%
) 1
 
0.9%
4 1
 
0.9%
3 1
 
0.9%

Unnamed: 2
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
요양병원
88 
종합병원
60 
병원
48 
상급종합병원
12 
치과병원
 
8
Other values (6)
16 

Length

Max length6
Median length4
Mean length3.6293103
Min length2

Unique

Unique2 ?
Unique (%)0.9%

Sample

1st row<NA>
2nd row요양종별
3rd row<NA>
4th row상급종합병원
5th row종합병원

Common Values

ValueCountFrequency (%)
요양병원 88
37.9%
종합병원 60
25.9%
병원 48
20.7%
상급종합병원 12
 
5.2%
치과병원 8
 
3.4%
의원 6
 
2.6%
정신병원 4
 
1.7%
<NA> 2
 
0.9%
한의원 2
 
0.9%
요양종별 1
 
0.4%

Length

2024-04-22T00:22:40.514938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
요양병원 88
37.9%
종합병원 60
25.9%
병원 48
20.7%
상급종합병원 12
 
5.2%
치과병원 8
 
3.4%
의원 6
 
2.6%
정신병원 4
 
1.7%
na 2
 
0.9%
한의원 2
 
0.9%
요양종별 1
 
0.4%

Unnamed: 3
Categorical

HIGH CORRELATION 

Distinct29
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
교육부
25 
경상북도
22 
서울특별시
17 
경기도
17 
전라남도
16 
Other values (24)
135 

Length

Max length9
Median length7
Mean length4.1853448
Min length3

Unique

Unique5 ?
Unique (%)2.2%

Sample

1st row<NA>
2nd row관계 행정기관
3rd row<NA>
4th row교육부
5th row대한적십자사

Common Values

ValueCountFrequency (%)
교육부 25
 
10.8%
경상북도 22
 
9.5%
서울특별시 17
 
7.3%
경기도 17
 
7.3%
전라남도 16
 
6.9%
국방부 15
 
6.5%
고용노동부 13
 
5.6%
경상남도 10
 
4.3%
전라북도 10
 
4.3%
보건복지부 10
 
4.3%
Other values (19) 77
33.2%

Length

2024-04-22T00:22:40.746690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
교육부 25
 
10.7%
경상북도 22
 
9.4%
서울특별시 17
 
7.3%
경기도 17
 
7.3%
전라남도 16
 
6.9%
국방부 15
 
6.4%
고용노동부 13
 
5.6%
경상남도 10
 
4.3%
전라북도 10
 
4.3%
보건복지부 10
 
4.3%
Other values (20) 78
33.5%
Distinct71
Distinct (%)30.9%
Missing2
Missing (%)0.9%
Memory size1.9 KiB
2024-04-22T00:22:41.400136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length74
Median length49
Mean length24.204348
Min length1

Characters and Unicode

Total characters5567
Distinct characters117
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)19.6%

Sample

1st row근거 법령
2nd row「서울대학교병원 설치법」
3rd row「대한적십자사조직법」
4th row 「의료법」 제33조제2항제2호 및 「정신건강증진 및 정신질환자 복지서비스 지원에 관한 법률」 제21조제1항, 「지방자치법」 제13조
5th row「경찰청과 그 소속기관 직제」 제2조(소속기관)
ValueCountFrequency (%)
109
 
11.5%
관한 68
 
7.2%
법률」 68
 
7.2%
설립 54
 
5.7%
「지방자치법」 47
 
5.0%
운영에 41
 
4.3%
「지방의료원의 40
 
4.2%
「의료법」 40
 
4.2%
설치법」 25
 
2.6%
제33조제2항 24
 
2.5%
Other values (98) 432
45.6%
2024-04-22T00:22:42.349509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
742
 
13.3%
321
 
5.8%
321
 
5.8%
318
 
5.7%
309
 
5.6%
252
 
4.5%
213
 
3.8%
3 202
 
3.6%
172
 
3.1%
1 158
 
2.8%
Other values (107) 2559
46.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3470
62.3%
Space Separator 742
 
13.3%
Decimal Number 592
 
10.6%
Open Punctuation 346
 
6.2%
Close Punctuation 346
 
6.2%
Other Punctuation 55
 
1.0%
Dash Punctuation 16
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
318
 
9.2%
309
 
8.9%
252
 
7.3%
213
 
6.1%
172
 
5.0%
139
 
4.0%
119
 
3.4%
110
 
3.2%
109
 
3.1%
105
 
3.0%
Other values (89) 1624
46.8%
Decimal Number
ValueCountFrequency (%)
3 202
34.1%
1 158
26.7%
2 94
15.9%
6 50
 
8.4%
4 36
 
6.1%
0 20
 
3.4%
5 15
 
2.5%
7 9
 
1.5%
8 6
 
1.0%
9 2
 
0.3%
Open Punctuation
ValueCountFrequency (%)
321
92.8%
( 25
 
7.2%
Close Punctuation
ValueCountFrequency (%)
321
92.8%
) 25
 
7.2%
Other Punctuation
ValueCountFrequency (%)
, 54
98.2%
· 1
 
1.8%
Space Separator
ValueCountFrequency (%)
742
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3470
62.3%
Common 2097
37.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
318
 
9.2%
309
 
8.9%
252
 
7.3%
213
 
6.1%
172
 
5.0%
139
 
4.0%
119
 
3.4%
110
 
3.2%
109
 
3.1%
105
 
3.0%
Other values (89) 1624
46.8%
Common
ValueCountFrequency (%)
742
35.4%
321
15.3%
321
15.3%
3 202
 
9.6%
1 158
 
7.5%
2 94
 
4.5%
, 54
 
2.6%
6 50
 
2.4%
4 36
 
1.7%
) 25
 
1.2%
Other values (8) 94
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3470
62.3%
ASCII 1454
26.1%
None 643
 
11.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
742
51.0%
3 202
 
13.9%
1 158
 
10.9%
2 94
 
6.5%
, 54
 
3.7%
6 50
 
3.4%
4 36
 
2.5%
) 25
 
1.7%
( 25
 
1.7%
0 20
 
1.4%
Other values (5) 48
 
3.3%
None
ValueCountFrequency (%)
321
49.9%
321
49.9%
· 1
 
0.2%
Hangul
ValueCountFrequency (%)
318
 
9.2%
309
 
8.9%
252
 
7.3%
213
 
6.1%
172
 
5.0%
139
 
4.0%
119
 
3.4%
110
 
3.2%
109
 
3.1%
105
 
3.0%
Other values (89) 1624
46.8%

Unnamed: 5
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
특수법인
92 
시도립
47 
시군구립
36 
국립
26 
군립
 
7
Other values (10)
24 

Length

Max length9
Median length4
Mean length3.5387931
Min length2

Unique

Unique6 ?
Unique (%)2.6%

Sample

1st row<NA>
2nd row설립형태
3rd row<NA>
4th row특수법인
5th row특수법인

Common Values

ValueCountFrequency (%)
특수법인 92
39.7%
시도립 47
20.3%
시군구립 36
 
15.5%
국립 26
 
11.2%
군립 7
 
3.0%
도립 6
 
2.6%
공립(시군구립) 5
 
2.2%
시립 5
 
2.2%
<NA> 2
 
0.9%
설립형태 1
 
0.4%
Other values (5) 5
 
2.2%

Length

2024-04-22T00:22:42.585913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
특수법인 92
39.5%
시도립 47
20.2%
시군구립 36
 
15.5%
국립 27
 
11.6%
군립 7
 
3.0%
도립 6
 
2.6%
공립(시군구립 5
 
2.1%
시립 5
 
2.1%
na 2
 
0.9%
설립형태 1
 
0.4%
Other values (5) 5
 
2.1%

Unnamed: 6
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct9
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
N
154 
지역응급의료기관
44 
지역응급의료센터
20 
권역응급의료센터
 
6
권역응급의료센터/권역외상센터
 
3
Other values (4)
 
5

Length

Max length15
Median length1
Mean length3.4224138
Min length1

Unique

Unique3 ?
Unique (%)1.3%

Sample

1st row<NA>
2nd row응급지정
3rd row<NA>
4th row권역응급의료센터
5th row지역응급의료기관

Common Values

ValueCountFrequency (%)
N 154
66.4%
지역응급의료기관 44
 
19.0%
지역응급의료센터 20
 
8.6%
권역응급의료센터 6
 
2.6%
권역응급의료센터/권역외상센터 3
 
1.3%
<NA> 2
 
0.9%
응급지정 1
 
0.4%
중앙응급의료센터 1
 
0.4%
지역응급의료센터/권역외상센터 1
 
0.4%

Length

2024-04-22T00:22:42.817201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T00:22:43.021009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
n 154
66.4%
지역응급의료기관 44
 
19.0%
지역응급의료센터 20
 
8.6%
권역응급의료센터 6
 
2.6%
권역응급의료센터/권역외상센터 3
 
1.3%
na 2
 
0.9%
응급지정 1
 
0.4%
중앙응급의료센터 1
 
0.4%
지역응급의료센터/권역외상센터 1
 
0.4%

Unnamed: 7
Unsupported

REJECTED  UNSUPPORTED 

Missing2
Missing (%)0.9%
Memory size1.9 KiB

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10
Missing (%)4.3%
Memory size1.9 KiB
Distinct222
Distinct (%)96.5%
Missing2
Missing (%)0.9%
Memory size1.9 KiB
2024-04-22T00:22:44.487326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length30
Mean length21.030435
Min length2

Characters and Unicode

Total characters4837
Distinct characters268
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique214 ?
Unique (%)93.0%

Sample

1st row주소
2nd row서울특별시 종로구 대학로101 서울대학교병원
3rd row서울특별시 종로구 새문안로 9
4th row서울특별시 동대문구 무학로 124
5th row서울특별시 송파구 송이로 123
ValueCountFrequency (%)
경기도 31
 
2.9%
경상북도 26
 
2.4%
서울특별시 23
 
2.2%
전라남도 22
 
2.1%
강원도 18
 
1.7%
경상남도 18
 
1.7%
충청북도 11
 
1.0%
충청남도 10
 
0.9%
전라북도 10
 
0.9%
대구광역시 9
 
0.8%
Other values (678) 890
83.3%
2024-04-22T00:22:46.020565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
844
 
17.4%
200
 
4.1%
188
 
3.9%
164
 
3.4%
1 160
 
3.3%
117
 
2.4%
2 99
 
2.0%
94
 
1.9%
89
 
1.8%
89
 
1.8%
Other values (258) 2793
57.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3044
62.9%
Space Separator 844
 
17.4%
Decimal Number 777
 
16.1%
Close Punctuation 60
 
1.2%
Open Punctuation 60
 
1.2%
Dash Punctuation 42
 
0.9%
Other Punctuation 8
 
0.2%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
200
 
6.6%
188
 
6.2%
164
 
5.4%
117
 
3.8%
94
 
3.1%
89
 
2.9%
89
 
2.9%
64
 
2.1%
62
 
2.0%
60
 
2.0%
Other values (241) 1917
63.0%
Decimal Number
ValueCountFrequency (%)
1 160
20.6%
2 99
12.7%
3 88
11.3%
0 76
9.8%
5 74
9.5%
4 71
9.1%
7 67
8.6%
6 52
 
6.7%
9 46
 
5.9%
8 44
 
5.7%
Other Punctuation
ValueCountFrequency (%)
, 7
87.5%
. 1
 
12.5%
Space Separator
ValueCountFrequency (%)
844
100.0%
Close Punctuation
ValueCountFrequency (%)
) 60
100.0%
Open Punctuation
ValueCountFrequency (%)
( 60
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 42
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3044
62.9%
Common 1793
37.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
200
 
6.6%
188
 
6.2%
164
 
5.4%
117
 
3.8%
94
 
3.1%
89
 
2.9%
89
 
2.9%
64
 
2.1%
62
 
2.0%
60
 
2.0%
Other values (241) 1917
63.0%
Common
ValueCountFrequency (%)
844
47.1%
1 160
 
8.9%
2 99
 
5.5%
3 88
 
4.9%
0 76
 
4.2%
5 74
 
4.1%
4 71
 
4.0%
7 67
 
3.7%
) 60
 
3.3%
( 60
 
3.3%
Other values (7) 194
 
10.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3044
62.9%
ASCII 1793
37.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
844
47.1%
1 160
 
8.9%
2 99
 
5.5%
3 88
 
4.9%
0 76
 
4.2%
5 74
 
4.1%
4 71
 
4.0%
7 67
 
3.7%
) 60
 
3.3%
( 60
 
3.3%
Other values (7) 194
 
10.8%
Hangul
ValueCountFrequency (%)
200
 
6.6%
188
 
6.2%
164
 
5.4%
117
 
3.8%
94
 
3.1%
89
 
2.9%
89
 
2.9%
64
 
2.1%
62
 
2.0%
60
 
2.0%
Other values (241) 1917
63.0%

Unnamed: 10
Text

MISSING 

Distinct200
Distinct (%)90.5%
Missing11
Missing (%)4.7%
Memory size1.9 KiB
2024-04-22T00:22:46.597539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length31
Mean length19.429864
Min length2

Characters and Unicode

Total characters4294
Distinct characters62
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique195 ?
Unique (%)88.2%

Sample

1st row홈페이지
2nd rowwww.snuh.org
3rd rowwww.rch.or.kr/seoul
4th rowwww.dbhosp.go.kr
5th rowwww.nph.go.kr
ValueCountFrequency (%)
없음 18
 
8.1%
http://www.icmc.or.kr 2
 
0.9%
www.pmc.or.kr 2
 
0.9%
http://www.jejumed.com 2
 
0.9%
www.umc.or.kr 2
 
0.9%
http://www.sshp.co.kr 1
 
0.5%
http://jsmc.or.kr 1
 
0.5%
http://www.kgh.co.kr 1
 
0.5%
www.hp8119.co.kr 1
 
0.5%
www.gcmh.or.kr 1
 
0.5%
Other values (190) 190
86.0%
2024-04-22T00:22:47.409992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 552
12.9%
w 538
12.5%
r 307
 
7.1%
o 293
 
6.8%
/ 288
 
6.7%
t 247
 
5.8%
h 242
 
5.6%
c 198
 
4.6%
k 193
 
4.5%
p 158
 
3.7%
Other values (52) 1278
29.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3246
75.6%
Other Punctuation 948
 
22.1%
Other Letter 58
 
1.4%
Decimal Number 21
 
0.5%
Connector Punctuation 10
 
0.2%
Uppercase Letter 6
 
0.1%
Space Separator 3
 
0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
w 538
16.6%
r 307
 
9.5%
o 293
 
9.0%
t 247
 
7.6%
h 242
 
7.5%
c 198
 
6.1%
k 193
 
5.9%
p 158
 
4.9%
n 144
 
4.4%
m 129
 
4.0%
Other values (14) 797
24.6%
Other Letter
ValueCountFrequency (%)
18
31.0%
18
31.0%
2
 
3.4%
2
 
3.4%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
Other values (12) 12
20.7%
Decimal Number
ValueCountFrequency (%)
0 5
23.8%
3 4
19.0%
8 3
14.3%
1 3
14.3%
9 3
14.3%
4 2
 
9.5%
6 1
 
4.8%
Other Punctuation
ValueCountFrequency (%)
. 552
58.2%
/ 288
30.4%
: 106
 
11.2%
; 1
 
0.1%
, 1
 
0.1%
Connector Punctuation
ValueCountFrequency (%)
_ 10
100.0%
Uppercase Letter
ValueCountFrequency (%)
W 6
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 3252
75.7%
Common 984
 
22.9%
Hangul 58
 
1.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 538
16.5%
r 307
 
9.4%
o 293
 
9.0%
t 247
 
7.6%
h 242
 
7.4%
c 198
 
6.1%
k 193
 
5.9%
p 158
 
4.9%
n 144
 
4.4%
m 129
 
4.0%
Other values (15) 803
24.7%
Hangul
ValueCountFrequency (%)
18
31.0%
18
31.0%
2
 
3.4%
2
 
3.4%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
Other values (12) 12
20.7%
Common
ValueCountFrequency (%)
. 552
56.1%
/ 288
29.3%
: 106
 
10.8%
_ 10
 
1.0%
0 5
 
0.5%
3 4
 
0.4%
8 3
 
0.3%
1 3
 
0.3%
9 3
 
0.3%
3
 
0.3%
Other values (5) 7
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4236
98.6%
Hangul 58
 
1.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 552
13.0%
w 538
12.7%
r 307
 
7.2%
o 293
 
6.9%
/ 288
 
6.8%
t 247
 
5.8%
h 242
 
5.7%
c 198
 
4.7%
k 193
 
4.6%
p 158
 
3.7%
Other values (30) 1220
28.8%
Hangul
ValueCountFrequency (%)
18
31.0%
18
31.0%
2
 
3.4%
2
 
3.4%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
1
 
1.7%
Other values (12) 12
20.7%

Unnamed: 11
Text

MISSING 

Distinct216
Distinct (%)97.7%
Missing11
Missing (%)4.7%
Memory size1.9 KiB
2024-04-22T00:22:48.312148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.669683
Min length4

Characters and Unicode

Total characters2579
Distinct characters18
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique211 ?
Unique (%)95.5%

Sample

1st row대표전화
2nd row1588-5700
3rd row02-2002-8000
4th row02-920-9114
5th row02-3400-1114
ValueCountFrequency (%)
054-785-7000 2
 
0.9%
032-899-4000 2
 
0.9%
064-720-2222 2
 
0.9%
1577-7877 2
 
0.9%
054-247-0551 2
 
0.9%
062-613-9000 1
 
0.5%
061)542-3004 1
 
0.5%
061)453-7007 1
 
0.5%
061)353-7500 1
 
0.5%
061)860-7777 1
 
0.5%
Other values (206) 206
93.2%
2024-04-22T00:22:49.411377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 577
22.4%
- 396
15.4%
1 291
11.3%
5 218
 
8.5%
3 203
 
7.9%
2 185
 
7.2%
4 175
 
6.8%
6 161
 
6.2%
7 135
 
5.2%
8 111
 
4.3%
Other values (8) 127
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2154
83.5%
Dash Punctuation 396
 
15.4%
Close Punctuation 22
 
0.9%
Other Letter 4
 
0.2%
Math Symbol 2
 
0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 577
26.8%
1 291
13.5%
5 218
 
10.1%
3 203
 
9.4%
2 185
 
8.6%
4 175
 
8.1%
6 161
 
7.5%
7 135
 
6.3%
8 111
 
5.2%
9 98
 
4.5%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 396
100.0%
Close Punctuation
ValueCountFrequency (%)
) 22
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2575
99.8%
Hangul 4
 
0.2%

Most frequent character per script

Common
ValueCountFrequency (%)
0 577
22.4%
- 396
15.4%
1 291
11.3%
5 218
 
8.5%
3 203
 
7.9%
2 185
 
7.2%
4 175
 
6.8%
6 161
 
6.3%
7 135
 
5.2%
8 111
 
4.3%
Other values (4) 123
 
4.8%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2575
99.8%
Hangul 4
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 577
22.4%
- 396
15.4%
1 291
11.3%
5 218
 
8.5%
3 203
 
7.9%
2 185
 
7.2%
4 175
 
6.8%
6 161
 
6.3%
7 135
 
5.2%
8 111
 
4.3%
Other values (4) 123
 
4.8%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Unnamed: 12
Text

MISSING 

Distinct217
Distinct (%)98.2%
Missing11
Missing (%)4.7%
Memory size1.9 KiB
2024-04-22T00:22:50.160635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.723982
Min length1

Characters and Unicode

Total characters2591
Distinct characters18
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique214 ?
Unique (%)96.8%

Sample

1st rowFAX
2nd row-
3rd row02-738-5664
4th row02-920-9219
5th row02-3400-1573
ValueCountFrequency (%)
3
 
1.4%
054)247-0559 2
 
0.9%
064-724-2103 2
 
0.9%
064-757-8276 1
 
0.5%
061)752-7683 1
 
0.5%
061)434-4736 1
 
0.5%
061)363-6020 1
 
0.5%
061-840-0517 1
 
0.5%
061)324-1763 1
 
0.5%
062-573-2206 1
 
0.5%
Other values (207) 207
93.7%
2024-04-22T00:22:51.108888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 439
16.9%
- 395
15.2%
3 273
10.5%
5 247
9.5%
2 226
8.7%
1 198
7.6%
4 184
7.1%
6 172
 
6.6%
9 147
 
5.7%
7 144
 
5.6%
Other values (8) 166
 
6.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2150
83.0%
Dash Punctuation 395
 
15.2%
Close Punctuation 40
 
1.5%
Uppercase Letter 3
 
0.1%
Other Letter 2
 
0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 439
20.4%
3 273
12.7%
5 247
11.5%
2 226
10.5%
1 198
9.2%
4 184
8.6%
6 172
 
8.0%
9 147
 
6.8%
7 144
 
6.7%
8 120
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
F 1
33.3%
A 1
33.3%
X 1
33.3%
Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 395
100.0%
Close Punctuation
ValueCountFrequency (%)
) 40
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2586
99.8%
Latin 3
 
0.1%
Hangul 2
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 439
17.0%
- 395
15.3%
3 273
10.6%
5 247
9.6%
2 226
8.7%
1 198
7.7%
4 184
7.1%
6 172
 
6.7%
9 147
 
5.7%
7 144
 
5.6%
Other values (3) 161
 
6.2%
Latin
ValueCountFrequency (%)
F 1
33.3%
A 1
33.3%
X 1
33.3%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2589
99.9%
Hangul 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 439
17.0%
- 395
15.3%
3 273
10.5%
5 247
9.5%
2 226
8.7%
1 198
7.6%
4 184
7.1%
6 172
 
6.6%
9 147
 
5.7%
7 144
 
5.6%
Other values (6) 164
 
6.3%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Correlations

2024-04-22T00:22:51.269507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6
Unnamed: 21.0000.8230.8850.8340.830
Unnamed: 30.8231.0000.9900.9300.741
Unnamed: 40.8850.9901.0000.9850.865
Unnamed: 50.8340.9300.9851.0000.717
Unnamed: 60.8300.7410.8650.7171.000
2024-04-22T00:22:51.460670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 6Unnamed: 3Unnamed: 2Unnamed: 5
Unnamed: 61.0000.3780.5910.412
Unnamed: 30.3781.0000.4510.529
Unnamed: 20.5910.4511.0000.529
Unnamed: 50.4120.5290.5291.000
2024-04-22T00:22:51.734007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 2Unnamed: 3Unnamed: 5Unnamed: 6
Unnamed: 21.0000.4510.5290.591
Unnamed: 30.4511.0000.5290.378
Unnamed: 50.5290.5291.0000.412
Unnamed: 60.5910.3780.4121.000

Missing values

2024-04-22T00:22:37.818215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-22T00:22:38.127871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-22T00:22:38.435888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

공공의료기관 현황 (2021.12.31 기준)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12
0NaN<NA><NA><NA><NA><NA><NA>NaNNaN<NA><NA><NA><NA>
1연번의료기관명요양종별관계 행정기관근거 법령설립형태응급지정허가병상수소재지<NA>홈페이지대표전화FAX
2NaN<NA><NA><NA><NA><NA><NA>NaN우편번호주소<NA><NA><NA>
31서울대학교병원상급종합병원교육부「서울대학교병원 설치법」특수법인권역응급의료센터179303080서울특별시 종로구 대학로101 서울대학교병원www.snuh.org1588-5700-
42서울적십자병원종합병원대한적십자사「대한적십자사조직법」특수법인지역응급의료기관29203181서울특별시 종로구 새문안로 9www.rch.or.kr/seoul02-2002-800002-738-5664
53서울특별시 동부병원종합병원서울특별시「의료법」 제33조제2항제2호 및 「정신건강증진 및 정신질환자 복지서비스 지원에 관한 법률」 제21조제1항, 「지방자치법」 제13조시도립지역응급의료기관20102584서울특별시 동대문구 무학로 124www.dbhosp.go.kr02-920-911402-920-9219
64경찰병원종합병원경찰청「경찰청과 그 소속기관 직제」 제2조(소속기관)국립지역응급의료기관37905715서울특별시 송파구 송이로 123www.nph.go.kr02-3400-111402-3400-1573
75한국보훈복지의료공단 중앙보훈병원종합병원국가보훈처「한국보훈복지의료공단법」 제7조특수법인지역응급의료센터99505368서울특별시 강동구 진황도로 61길 53http://seoul.bohun.or.kr02-2225-111402-484-9649
86서울특별시 보라매병원종합병원서울특별시「의료법」 제33조제2항제2호 및 「정신건강증진 및 정신질환자 복지서비스 지원에 관한 법률」 제21조제1항, 「지방자치법」 제13조시도립지역응급의료센터76507061서울특별시 동작구 보라매로5길 20http://www.brmh.org02-870-211402-831-0206
97한국원자력의학원 원자력병원종합병원과학기술정보통신부「방사선 및 방사성동위원소 이용진흥법」 제13조의2특수법인지역응급의료기관47801812서울시 노원구 노원로 75www.kirams.re.kr02-970-211402-978-2005
공공의료기관 현황 (2021.12.31 기준)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12
222220국립부곡병원병원보건복지부「보건복지부와 그 소속기관 직제」 제2조(소속기관)국립N40050365경상남도 창녕군 부곡면 부곡로 145www.bgnmh.go.kr055-536-6440055-536-6444
223221부산대학교치과병원치과병원교육부「국립대학치과병원 설치법」특수법인N3650612경상남도 양산시 물금읍 금오로 20http://www.pnudh.co.kr055-360-5114055-360-5029
224222부산대학교한방병원한방병원교육부「국립대학병원 설치법」특수법인N11550612경남 양산시 물금읍 금오로 20http://www.pnukh.or.kr055-360-5555055-360-5509
225223제주특별자치도 서귀포의료원종합병원제주특별자치도「지방의료원의 설립 및 운영에 관한 법률」특수법인지역응급의료센터29163585제주특별자치도 서귀포시 장수로 47(동홍동)http://jjsmc.or.kr064)730-3100064)760-0009
226224제주대학교병원종합병원교육부「국립대학병원 설치법」특수법인지역응급의료센터65863241제주특별자치도 아란13길 15(아라일동)http://www.jejunuh.co.kr064-717-1114064-757-8276
227225제주특별자치도 제주의료원병원제주특별자치도「지방의료원의 설립 및 운영에 관한 법률」특수법인N20063243제주특별자치도 제주시 산천단남길 10http://www.jejumed.com/064-720-2222064-724-2103
228226제주권역재활병원병원제주특별자치도「장애인복지법」제18조 및 「의료법」제33조시도립N13263590제주특별자치도 서귀포시 동문로 1http://www.jrh.or.kr064)730-9000064)762-0027
229227제주의료원 부속요양병원요양병원제주특별자치도「지방의료원의 설립 및 운영에 관한 법률」특수법인N14863243제주특별자치도 제주시 산천단남길 10http://www.jejumed.com/064-720-2222064-724-2103
230228성남시노인보건센터의원의원서울특별시-공립(시군구립)N0NaN경기도 성남시 중원구 금상로 137, 1층 (상대원동)<NA><NA><NA>
231229경기도립 안성휴게소의원의원경기도-시도립N0NaN경기도 안성시 원곡면 경부고속도로 372<NA><NA><NA>