Overview

Dataset statistics

Number of variables3
Number of observations76
Missing cells2
Missing cells (%)0.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory25.7 B

Variable types

Text3

Alerts

병원급 의료기관 has 1 (1.3%) missing valuesMissing
Unnamed: 1 has 1 (1.3%) missing valuesMissing

Reproduction

Analysis started2024-03-14 00:33:08.308273
Analysis finished2024-03-14 00:33:08.670016
Duration0.36 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct75
Distinct (%)100.0%
Missing1
Missing (%)1.3%
Memory size740.0 B
2024-03-14T09:33:08.848595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length14
Mean length7.9066667
Min length3

Characters and Unicode

Total characters593
Distinct characters122
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique75 ?
Unique (%)100.0%

Sample

1st row병원명
2nd row 다사랑병원 
3rd row 다솔아동병원 
4th row 다은병원 
5th row 대자인병원 
ValueCountFrequency (%)
의료법인 3
 
3.4%
희망병원 2
 
2.3%
백제병원 1
 
1.1%
남원병원 1
 
1.1%
정읍박병원 1
 
1.1%
북면필병원 1
 
1.1%
의료법인한국필의료재단 1
 
1.1%
전라병원 1
 
1.1%
의료법인평화의료재단 1
 
1.1%
참조은병원 1
 
1.1%
Other values (74) 74
85.1%
2024-03-14T09:33:09.220170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
78
 
13.2%
75
 
12.6%
  70
 
11.8%
24
 
4.0%
21
 
3.5%
15
 
2.5%
13
 
2.2%
12
 
2.0%
12
 
2.0%
12
 
2.0%
Other values (112) 261
44.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 507
85.5%
Space Separator 82
 
13.8%
Decimal Number 4
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
78
 
15.4%
75
 
14.8%
24
 
4.7%
21
 
4.1%
15
 
3.0%
13
 
2.6%
12
 
2.4%
12
 
2.4%
11
 
2.2%
9
 
1.8%
Other values (108) 237
46.7%
Space Separator
ValueCountFrequency (%)
  70
85.4%
12
 
14.6%
Decimal Number
ValueCountFrequency (%)
1 2
50.0%
2 2
50.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 507
85.5%
Common 86
 
14.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
78
 
15.4%
75
 
14.8%
24
 
4.7%
21
 
4.1%
15
 
3.0%
13
 
2.6%
12
 
2.4%
12
 
2.4%
11
 
2.2%
9
 
1.8%
Other values (108) 237
46.7%
Common
ValueCountFrequency (%)
  70
81.4%
12
 
14.0%
1 2
 
2.3%
2 2
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 507
85.5%
None 70
 
11.8%
ASCII 16
 
2.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
78
 
15.4%
75
 
14.8%
24
 
4.7%
21
 
4.1%
15
 
3.0%
13
 
2.6%
12
 
2.4%
12
 
2.4%
11
 
2.2%
9
 
1.8%
Other values (108) 237
46.7%
None
ValueCountFrequency (%)
  70
100.0%
ASCII
ValueCountFrequency (%)
12
75.0%
1 2
 
12.5%
2 2
 
12.5%

Unnamed: 1
Text

MISSING 

Distinct73
Distinct (%)97.3%
Missing1
Missing (%)1.3%
Memory size740.0 B
2024-03-14T09:33:09.475949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length25
Mean length21.773333
Min length2

Characters and Unicode

Total characters1633
Distinct characters142
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)94.7%

Sample

1st row주소
2nd row전주시 완산구 백제대로 74 (삼천동1가) 
3rd row전주시 완산구 우전로 250 (효자동2가) 
4th row전주시 완산구 세내로 277 (효자동3가) 
5th row전주시 덕진구 견훤로 390 (우아동3가) 
ValueCountFrequency (%)
전주시 36
 
10.3%
완산구 25
 
7.1%
덕진구 11
 
3.1%
김제시 8
 
2.3%
부안읍 7
 
2.0%
군산시 7
 
2.0%
백제대로 7
 
2.0%
익산시 6
 
1.7%
정읍시 6
 
1.7%
금산면 4
 
1.1%
Other values (179) 234
66.7%
2024-03-14T09:33:09.856507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
276
 
16.9%
( 70
 
4.3%
) 70
 
4.3%
65
 
4.0%
62
 
3.8%
60
 
3.7%
1 57
 
3.5%
54
 
3.3%
43
 
2.6%
41
 
2.5%
Other values (132) 835
51.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 901
55.2%
Space Separator 312
 
19.1%
Decimal Number 265
 
16.2%
Open Punctuation 70
 
4.3%
Close Punctuation 70
 
4.3%
Dash Punctuation 13
 
0.8%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
65
 
7.2%
62
 
6.9%
60
 
6.7%
54
 
6.0%
43
 
4.8%
41
 
4.6%
37
 
4.1%
34
 
3.8%
30
 
3.3%
22
 
2.4%
Other values (116) 453
50.3%
Decimal Number
ValueCountFrequency (%)
1 57
21.5%
4 33
12.5%
2 32
12.1%
3 32
12.1%
5 23
8.7%
0 22
 
8.3%
7 20
 
7.5%
9 16
 
6.0%
6 16
 
6.0%
8 14
 
5.3%
Space Separator
ValueCountFrequency (%)
276
88.5%
  36
 
11.5%
Open Punctuation
ValueCountFrequency (%)
( 70
100.0%
Close Punctuation
ValueCountFrequency (%)
) 70
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 901
55.2%
Common 732
44.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
65
 
7.2%
62
 
6.9%
60
 
6.7%
54
 
6.0%
43
 
4.8%
41
 
4.6%
37
 
4.1%
34
 
3.8%
30
 
3.3%
22
 
2.4%
Other values (116) 453
50.3%
Common
ValueCountFrequency (%)
276
37.7%
( 70
 
9.6%
) 70
 
9.6%
1 57
 
7.8%
  36
 
4.9%
4 33
 
4.5%
2 32
 
4.4%
3 32
 
4.4%
5 23
 
3.1%
0 22
 
3.0%
Other values (6) 81
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 901
55.2%
ASCII 696
42.6%
None 36
 
2.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
276
39.7%
( 70
 
10.1%
) 70
 
10.1%
1 57
 
8.2%
4 33
 
4.7%
2 32
 
4.6%
3 32
 
4.6%
5 23
 
3.3%
0 22
 
3.2%
7 20
 
2.9%
Other values (5) 61
 
8.8%
Hangul
ValueCountFrequency (%)
65
 
7.2%
62
 
6.9%
60
 
6.7%
54
 
6.0%
43
 
4.8%
41
 
4.6%
37
 
4.1%
34
 
3.8%
30
 
3.3%
22
 
2.4%
Other values (116) 453
50.3%
None
ValueCountFrequency (%)
  36
100.0%
Distinct75
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size740.0 B
2024-03-14T09:33:10.082380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length13
Mean length12.828947
Min length4

Characters and Unicode

Total characters975
Distinct characters19
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique74 ?
Unique (%)97.4%

Sample

1st row2015. 2
2nd row전화번호
3rd row 063-228-5540 
4th row 063-280-0800 
5th row 063-239-0114 
ValueCountFrequency (%)
063-545-8383 2
 
2.6%
063-840-5000 1
 
1.3%
063-856-5522 1
 
1.3%
063-538-0321 1
 
1.3%
063-530-7100 1
 
1.3%
063-538-9730 1
 
1.3%
063-571-0845 1
 
1.3%
063-861-2700 1
 
1.3%
063-840-2305 1
 
1.3%
063-530-3130 1
 
1.3%
Other values (66) 66
85.7%
2024-03-14T09:33:10.398730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 218
22.4%
- 149
15.3%
3 120
12.3%
6 99
10.2%
  72
 
7.4%
2 69
 
7.1%
5 63
 
6.5%
1 49
 
5.0%
8 46
 
4.7%
4 37
 
3.8%
Other values (9) 53
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 747
76.6%
Dash Punctuation 149
 
15.3%
Space Separator 73
 
7.5%
Other Letter 4
 
0.4%
Other Punctuation 1
 
0.1%
Math Symbol 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 218
29.2%
3 120
16.1%
6 99
13.3%
2 69
 
9.2%
5 63
 
8.4%
1 49
 
6.6%
8 46
 
6.2%
4 37
 
5.0%
7 25
 
3.3%
9 21
 
2.8%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Space Separator
ValueCountFrequency (%)
  72
98.6%
1
 
1.4%
Dash Punctuation
ValueCountFrequency (%)
- 149
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 971
99.6%
Hangul 4
 
0.4%

Most frequent character per script

Common
ValueCountFrequency (%)
0 218
22.5%
- 149
15.3%
3 120
12.4%
6 99
10.2%
  72
 
7.4%
2 69
 
7.1%
5 63
 
6.5%
1 49
 
5.0%
8 46
 
4.7%
4 37
 
3.8%
Other values (5) 49
 
5.0%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 899
92.2%
None 72
 
7.4%
Hangul 4
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 218
24.2%
- 149
16.6%
3 120
13.3%
6 99
11.0%
2 69
 
7.7%
5 63
 
7.0%
1 49
 
5.5%
8 46
 
5.1%
4 37
 
4.1%
7 25
 
2.8%
Other values (4) 24
 
2.7%
None
ValueCountFrequency (%)
  72
100.0%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Correlations

2024-03-14T09:33:10.505824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
병원급 의료기관Unnamed: 1Unnamed: 2
병원급 의료기관1.0001.0001.000
Unnamed: 11.0001.0001.000
Unnamed: 21.0001.0001.000

Missing values

2024-03-14T09:33:08.513894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T09:33:08.573039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T09:33:08.635221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

병원급 의료기관Unnamed: 1Unnamed: 2
0<NA><NA>2015. 2
1병원명주소전화번호
2다사랑병원전주시 완산구 백제대로 74 (삼천동1가)063-228-5540
3다솔아동병원전주시 완산구 우전로 250 (효자동2가)063-280-0800
4다은병원전주시 완산구 세내로 277 (효자동3가)063-239-0114
5대자인병원전주시 덕진구 견훤로 390 (우아동3가)063-240-2000
6드림솔병원전주시 완산구 천잠로 507 (효자동3가)063-250-8000
7미르아동병원전주시 완산구 백제대로 100 (효자동1가)063-229-0114
8미르피아여성병원전주시 완산구 쑥고개로 343 (효자동2가)063-211-1004
9백제병원전주시 덕진구 백제대로 700 (인후동2가)063-240-7000
병원급 의료기관Unnamed: 1Unnamed: 2
66희망병원김제시 금구면 낙산1길 74-1 (금구면)063-540-8855
67고려병원완주군 삼례읍 동학로 21 고려병원063-290-0114
68전라북도마음사랑병원완주군 소양면 소양로 465-23 (소양면)063-240-2100
69한마음화산병원완주군 화산면 운제로 100 (화산면)063-260-1300
70의료법인이루의료재단 임실병원임실군 임실읍 운수로 15 (임실읍)063-640-8888
71의료법인희망의료재단 희망병원순창군 순창읍 장류로 347 (순창읍)063-652-2612
72부안21세기병원부안군 부안읍 석정로 171063-583-3366
73하나성심병원부안군 부안읍 낭주길 3 (부안읍)063-582-7119
74부안드림병원부안군 부안읍 번영로 189 (부안읍)063-580-6700
75의료법인 혜성병원부안군 부안읍 부령로 33 (부안읍)063-583-5001