Overview

Dataset statistics

Number of variables3
Number of observations77
Missing cells5
Missing cells (%)2.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory25.7 B

Variable types

Text3

Alerts

병원급 의료기관 has 2 (2.6%) missing valuesMissing
Unnamed: 1 has 2 (2.6%) missing valuesMissing
Unnamed: 2 has 1 (1.3%) missing valuesMissing

Reproduction

Analysis started2024-03-14 00:33:05.614935
Analysis finished2024-03-14 00:33:06.024340
Duration0.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct75
Distinct (%)100.0%
Missing2
Missing (%)2.6%
Memory size748.0 B
2024-03-14T09:33:06.190661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length14
Mean length7.9066667
Min length3

Characters and Unicode

Total characters593
Distinct characters122
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique75 ?
Unique (%)100.0%

Sample

1st row병원명
2nd row?다사랑병원?
3rd row?다솔아동병원?
4th row?다은병원?
5th row?대자인병원?
ValueCountFrequency (%)
의료법인 3
 
3.4%
희망병원 2
 
2.3%
백제병원 1
 
1.1%
남원병원 1
 
1.1%
정읍박병원 1
 
1.1%
북면필병원 1
 
1.1%
의료법인한국필의료재단 1
 
1.1%
전라병원 1
 
1.1%
의료법인평화의료재단 1
 
1.1%
참조은병원 1
 
1.1%
Other values (74) 74
85.1%
2024-03-14T09:33:06.558833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
78
 
13.2%
75
 
12.6%
? 70
 
11.8%
24
 
4.0%
21
 
3.5%
15
 
2.5%
13
 
2.2%
12
 
2.0%
12
 
2.0%
12
 
2.0%
Other values (112) 261
44.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 507
85.5%
Other Punctuation 70
 
11.8%
Space Separator 12
 
2.0%
Decimal Number 4
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
78
 
15.4%
75
 
14.8%
24
 
4.7%
21
 
4.1%
15
 
3.0%
13
 
2.6%
12
 
2.4%
12
 
2.4%
11
 
2.2%
9
 
1.8%
Other values (108) 237
46.7%
Decimal Number
ValueCountFrequency (%)
1 2
50.0%
2 2
50.0%
Other Punctuation
ValueCountFrequency (%)
? 70
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 507
85.5%
Common 86
 
14.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
78
 
15.4%
75
 
14.8%
24
 
4.7%
21
 
4.1%
15
 
3.0%
13
 
2.6%
12
 
2.4%
12
 
2.4%
11
 
2.2%
9
 
1.8%
Other values (108) 237
46.7%
Common
ValueCountFrequency (%)
? 70
81.4%
12
 
14.0%
1 2
 
2.3%
2 2
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 507
85.5%
ASCII 86
 
14.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
78
 
15.4%
75
 
14.8%
24
 
4.7%
21
 
4.1%
15
 
3.0%
13
 
2.6%
12
 
2.4%
12
 
2.4%
11
 
2.2%
9
 
1.8%
Other values (108) 237
46.7%
ASCII
ValueCountFrequency (%)
? 70
81.4%
12
 
14.0%
1 2
 
2.3%
2 2
 
2.3%

Unnamed: 1
Text

MISSING 

Distinct73
Distinct (%)97.3%
Missing2
Missing (%)2.6%
Memory size748.0 B
2024-03-14T09:33:06.832809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length25
Mean length21.773333
Min length2

Characters and Unicode

Total characters1633
Distinct characters142
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)94.7%

Sample

1st row주소
2nd row전주시 완산구 백제대로 74 (삼천동1가)?
3rd row전주시 완산구 우전로 250 (효자동2가)?
4th row전주시 완산구 세내로 277 (효자동3가)?
5th row전주시 덕진구 견훤로 390 (우아동3가)?
ValueCountFrequency (%)
전주시 36
 
10.3%
완산구 25
 
7.1%
덕진구 11
 
3.1%
김제시 8
 
2.3%
부안읍 7
 
2.0%
군산시 7
 
2.0%
백제대로 7
 
2.0%
익산시 6
 
1.7%
정읍시 6
 
1.7%
부안군 4
 
1.1%
Other values (179) 234
66.7%
2024-03-14T09:33:07.216875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
276
 
16.9%
) 70
 
4.3%
( 70
 
4.3%
65
 
4.0%
62
 
3.8%
60
 
3.7%
1 57
 
3.5%
54
 
3.3%
43
 
2.6%
41
 
2.5%
Other values (132) 835
51.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 901
55.2%
Space Separator 276
 
16.9%
Decimal Number 265
 
16.2%
Close Punctuation 70
 
4.3%
Open Punctuation 70
 
4.3%
Other Punctuation 38
 
2.3%
Dash Punctuation 13
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
65
 
7.2%
62
 
6.9%
60
 
6.7%
54
 
6.0%
43
 
4.8%
41
 
4.6%
37
 
4.1%
34
 
3.8%
30
 
3.3%
22
 
2.4%
Other values (116) 453
50.3%
Decimal Number
ValueCountFrequency (%)
1 57
21.5%
4 33
12.5%
3 32
12.1%
2 32
12.1%
5 23
8.7%
0 22
 
8.3%
7 20
 
7.5%
6 16
 
6.0%
9 16
 
6.0%
8 14
 
5.3%
Other Punctuation
ValueCountFrequency (%)
? 36
94.7%
, 2
 
5.3%
Space Separator
ValueCountFrequency (%)
276
100.0%
Close Punctuation
ValueCountFrequency (%)
) 70
100.0%
Open Punctuation
ValueCountFrequency (%)
( 70
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 901
55.2%
Common 732
44.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
65
 
7.2%
62
 
6.9%
60
 
6.7%
54
 
6.0%
43
 
4.8%
41
 
4.6%
37
 
4.1%
34
 
3.8%
30
 
3.3%
22
 
2.4%
Other values (116) 453
50.3%
Common
ValueCountFrequency (%)
276
37.7%
) 70
 
9.6%
( 70
 
9.6%
1 57
 
7.8%
? 36
 
4.9%
4 33
 
4.5%
3 32
 
4.4%
2 32
 
4.4%
5 23
 
3.1%
0 22
 
3.0%
Other values (6) 81
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 901
55.2%
ASCII 732
44.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
276
37.7%
) 70
 
9.6%
( 70
 
9.6%
1 57
 
7.8%
? 36
 
4.9%
4 33
 
4.5%
3 32
 
4.4%
2 32
 
4.4%
5 23
 
3.1%
0 22
 
3.0%
Other values (6) 81
 
11.1%
Hangul
ValueCountFrequency (%)
65
 
7.2%
62
 
6.9%
60
 
6.7%
54
 
6.0%
43
 
4.8%
41
 
4.6%
37
 
4.1%
34
 
3.8%
30
 
3.3%
22
 
2.4%
Other values (116) 453
50.3%

Unnamed: 2
Text

MISSING 

Distinct75
Distinct (%)98.7%
Missing1
Missing (%)1.3%
Memory size748.0 B
2024-03-14T09:33:07.407474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length13
Mean length12.828947
Min length4

Characters and Unicode

Total characters975
Distinct characters19
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique74 ?
Unique (%)97.4%

Sample

1st row2015. 2
2nd row전화번호
3rd row?063-228-5540?
4th row?063-280-0800?
5th row?063-239-0114?
ValueCountFrequency (%)
063-545-8383 2
 
2.6%
063-530-3130 1
 
1.3%
063-538-0321 1
 
1.3%
063-530-7100 1
 
1.3%
063-538-9730 1
 
1.3%
063-571-0845 1
 
1.3%
063-861-2700 1
 
1.3%
063-840-2305 1
 
1.3%
063-840-5000 1
 
1.3%
063-859-7700 1
 
1.3%
Other values (66) 66
85.7%
2024-03-14T09:33:07.734351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 218
22.4%
- 149
15.3%
3 120
12.3%
6 99
10.2%
? 72
 
7.4%
2 69
 
7.1%
5 63
 
6.5%
1 49
 
5.0%
8 46
 
4.7%
4 37
 
3.8%
Other values (9) 53
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 747
76.6%
Dash Punctuation 149
 
15.3%
Other Punctuation 73
 
7.5%
Other Letter 4
 
0.4%
Space Separator 1
 
0.1%
Math Symbol 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 218
29.2%
3 120
16.1%
6 99
13.3%
2 69
 
9.2%
5 63
 
8.4%
1 49
 
6.6%
8 46
 
6.2%
4 37
 
5.0%
7 25
 
3.3%
9 21
 
2.8%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Other Punctuation
ValueCountFrequency (%)
? 72
98.6%
. 1
 
1.4%
Dash Punctuation
ValueCountFrequency (%)
- 149
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 971
99.6%
Hangul 4
 
0.4%

Most frequent character per script

Common
ValueCountFrequency (%)
0 218
22.5%
- 149
15.3%
3 120
12.4%
6 99
10.2%
? 72
 
7.4%
2 69
 
7.1%
5 63
 
6.5%
1 49
 
5.0%
8 46
 
4.7%
4 37
 
3.8%
Other values (5) 49
 
5.0%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 971
99.6%
Hangul 4
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 218
22.5%
- 149
15.3%
3 120
12.4%
6 99
10.2%
? 72
 
7.4%
2 69
 
7.1%
5 63
 
6.5%
1 49
 
5.0%
8 46
 
4.7%
4 37
 
3.8%
Other values (5) 49
 
5.0%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Correlations

2024-03-14T09:33:07.820790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
병원급 의료기관Unnamed: 1Unnamed: 2
병원급 의료기관1.0001.0001.000
Unnamed: 11.0001.0001.000
Unnamed: 21.0001.0001.000

Missing values

2024-03-14T09:33:05.846550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T09:33:05.914452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T09:33:05.981932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

병원급 의료기관Unnamed: 1Unnamed: 2
0<NA><NA>2015. 2
1병원명주소전화번호
2?다사랑병원?전주시 완산구 백제대로 74 (삼천동1가)??063-228-5540?
3?다솔아동병원?전주시 완산구 우전로 250 (효자동2가)??063-280-0800?
4?다은병원?전주시 완산구 세내로 277 (효자동3가)??063-239-0114?
5?대자인병원?전주시 덕진구 견훤로 390 (우아동3가)??063-240-2000?
6?드림솔병원?전주시 완산구 천잠로 507 (효자동3가)??063-250-8000?
7?미르아동병원?전주시 완산구 백제대로 100 (효자동1가)??063-229-0114?
8?미르피아여성병원?전주시 완산구 쑥고개로 343 (효자동2가)??063-211-1004?
9?백제병원?전주시 덕진구 백제대로 700 (인후동2가)??063-240-7000?
병원급 의료기관Unnamed: 1Unnamed: 2
67고려병원완주군 삼례읍 동학로 21 고려병원063-290-0114
68전라북도마음사랑병원완주군 소양면 소양로 465-23 (소양면)063-240-2100
69한마음화산병원완주군 화산면 운제로 100 (화산면)063-260-1300
70의료법인이루의료재단 임실병원임실군 임실읍 운수로 15 (임실읍)063-640-8888
71의료법인희망의료재단 희망병원순창군 순창읍 장류로 347 (순창읍)063-652-2612
72부안21세기병원부안군 부안읍 석정로 171063-583-3366
73하나성심병원부안군 부안읍 낭주길 3 (부안읍)063-582-7119
74부안드림병원부안군 부안읍 번영로 189 (부안읍)063-580-6700
75의료법인 혜성병원부안군 부안읍 부령로 33 (부안읍)063-583-5001
76<NA><NA><NA>