Overview

Dataset statistics

Number of variables8
Number of observations105
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.7 KiB
Average record size in memory65.3 B

Variable types

Text3
Boolean4
Categorical1

Dataset

Description경상남도 양산시에서 지정된 코로나19 예방접종이 가능한 병원 목록(의료기관명,주소지,전화번호,백신종류,접종요일)
Author경상남도 양산시
URLhttps://www.data.go.kr/data/15103421/fileData.do

Alerts

화이자 is highly overall correlated with 접종요일High correlation
모더나 is highly overall correlated with 접종요일High correlation
접종요일 is highly overall correlated with 화이자 and 1 other fieldsHigh correlation
화이자 is highly imbalanced (72.4%)Imbalance
노바백스 is highly imbalanced (81.3%)Imbalance
화이자(소아용) is highly imbalanced (54.6%)Imbalance
의료기관명 has unique valuesUnique
주소지 has unique valuesUnique
전화번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:32:43.533334
Analysis finished2023-12-12 05:32:44.628070
Duration1.09 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

의료기관명
Text

UNIQUE 

Distinct105
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size972.0 B
2023-12-12T14:32:44.924973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length14
Mean length7.9333333
Min length3

Characters and Unicode

Total characters833
Distinct characters170
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique105 ?
Unique (%)100.0%

Sample

1st row가람의원
2nd row강경순의원
3rd row강소아청소년과의원
4th row고운누리의원
5th row길이비인후과의원
ValueCountFrequency (%)
의원 2
 
1.7%
가람의원 1
 
0.9%
유창훈내과의원 1
 
0.9%
이무열내과의원 1
 
0.9%
아이병원 1
 
0.9%
이레 1
 
0.9%
이동일소아청소년과의원 1
 
0.9%
이동완내과의원 1
 
0.9%
이내과의원 1
 
0.9%
홍익요양병원 1
 
0.9%
Other values (104) 104
90.4%
2023-12-12T14:32:45.839339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
107
 
12.8%
107
 
12.8%
73
 
8.8%
35
 
4.2%
32
 
3.8%
25
 
3.0%
25
 
3.0%
22
 
2.6%
19
 
2.3%
17
 
2.0%
Other values (160) 371
44.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 823
98.8%
Space Separator 10
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
107
 
13.0%
107
 
13.0%
73
 
8.9%
35
 
4.3%
32
 
3.9%
25
 
3.0%
25
 
3.0%
22
 
2.7%
19
 
2.3%
17
 
2.1%
Other values (159) 361
43.9%
Space Separator
ValueCountFrequency (%)
10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 823
98.8%
Common 10
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
107
 
13.0%
107
 
13.0%
73
 
8.9%
35
 
4.3%
32
 
3.9%
25
 
3.0%
25
 
3.0%
22
 
2.7%
19
 
2.3%
17
 
2.1%
Other values (159) 361
43.9%
Common
ValueCountFrequency (%)
10
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 823
98.8%
ASCII 10
 
1.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
107
 
13.0%
107
 
13.0%
73
 
8.9%
35
 
4.3%
32
 
3.9%
25
 
3.0%
25
 
3.0%
22
 
2.7%
19
 
2.3%
17
 
2.1%
Other values (159) 361
43.9%
ASCII
ValueCountFrequency (%)
10
100.0%

주소지
Text

UNIQUE 

Distinct105
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size972.0 B
2023-12-12T14:32:46.177795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length39
Mean length30.838095
Min length20

Characters and Unicode

Total characters3238
Distinct characters169
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique105 ?
Unique (%)100.0%

Sample

1st row경상남도 양산시 서일동로 39, (중부동) 자연빌딩 2층
2nd row경상남도 양산시 신기서길 23, (신기동, 주공아파트상가) 202,203호
3rd row경상남도 양산시 동면 금오13길 20, 센텀빌딩 501,502호
4th row경상남도 양산시 삼호1길 34, (삼호동, 롯데마트) 2층
5th row경상남도 양산시 동면 금오13길 4, 네오스퀘어 4층 402,403호
ValueCountFrequency (%)
경상남도 105
 
15.3%
양산시 105
 
15.3%
물금읍 31
 
4.5%
중부동 20
 
2.9%
2층 19
 
2.8%
덕계동 14
 
2.0%
덕계로 12
 
1.7%
3층 11
 
1.6%
삼호동 10
 
1.5%
4층 8
 
1.2%
Other values (222) 352
51.2%
2023-12-12T14:32:46.753292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
608
 
18.8%
, 139
 
4.3%
133
 
4.1%
125
 
3.9%
111
 
3.4%
111
 
3.4%
110
 
3.4%
106
 
3.3%
105
 
3.2%
1 98
 
3.0%
Other values (159) 1592
49.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1758
54.3%
Space Separator 608
 
18.8%
Decimal Number 556
 
17.2%
Other Punctuation 144
 
4.4%
Open Punctuation 73
 
2.3%
Close Punctuation 73
 
2.3%
Dash Punctuation 8
 
0.2%
Uppercase Letter 7
 
0.2%
Math Symbol 6
 
0.2%
Lowercase Letter 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
133
 
7.6%
125
 
7.1%
111
 
6.3%
111
 
6.3%
110
 
6.3%
106
 
6.0%
105
 
6.0%
80
 
4.6%
78
 
4.4%
59
 
3.4%
Other values (131) 740
42.1%
Decimal Number
ValueCountFrequency (%)
1 98
17.6%
2 88
15.8%
0 77
13.8%
3 74
13.3%
4 60
10.8%
5 47
8.5%
6 44
7.9%
8 29
 
5.2%
7 24
 
4.3%
9 15
 
2.7%
Uppercase Letter
ValueCountFrequency (%)
Y 2
28.6%
B 2
28.6%
E 1
14.3%
S 1
14.3%
C 1
14.3%
Other Punctuation
ValueCountFrequency (%)
, 139
96.5%
· 3
 
2.1%
& 1
 
0.7%
. 1
 
0.7%
Lowercase Letter
ValueCountFrequency (%)
a 2
40.0%
p 1
20.0%
l 1
20.0%
z 1
20.0%
Space Separator
ValueCountFrequency (%)
608
100.0%
Open Punctuation
ValueCountFrequency (%)
( 73
100.0%
Close Punctuation
ValueCountFrequency (%)
) 73
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Math Symbol
ValueCountFrequency (%)
~ 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1758
54.3%
Common 1468
45.3%
Latin 12
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
133
 
7.6%
125
 
7.1%
111
 
6.3%
111
 
6.3%
110
 
6.3%
106
 
6.0%
105
 
6.0%
80
 
4.6%
78
 
4.4%
59
 
3.4%
Other values (131) 740
42.1%
Common
ValueCountFrequency (%)
608
41.4%
, 139
 
9.5%
1 98
 
6.7%
2 88
 
6.0%
0 77
 
5.2%
3 74
 
5.0%
( 73
 
5.0%
) 73
 
5.0%
4 60
 
4.1%
5 47
 
3.2%
Other values (9) 131
 
8.9%
Latin
ValueCountFrequency (%)
a 2
16.7%
Y 2
16.7%
B 2
16.7%
E 1
8.3%
p 1
8.3%
l 1
8.3%
z 1
8.3%
S 1
8.3%
C 1
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1758
54.3%
ASCII 1477
45.6%
None 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
608
41.2%
, 139
 
9.4%
1 98
 
6.6%
2 88
 
6.0%
0 77
 
5.2%
3 74
 
5.0%
( 73
 
4.9%
) 73
 
4.9%
4 60
 
4.1%
5 47
 
3.2%
Other values (17) 140
 
9.5%
Hangul
ValueCountFrequency (%)
133
 
7.6%
125
 
7.1%
111
 
6.3%
111
 
6.3%
110
 
6.3%
106
 
6.0%
105
 
6.0%
80
 
4.6%
78
 
4.4%
59
 
3.4%
Other values (131) 740
42.1%
None
ValueCountFrequency (%)
· 3
100.0%

전화번호
Text

UNIQUE 

Distinct105
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size972.0 B
2023-12-12T14:32:47.075438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1260
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique105 ?
Unique (%)100.0%

Sample

1st row055-367-3922
2nd row055-365-7582
3rd row055-365-7585
4th row055-386-7582
5th row055-364-5550
ValueCountFrequency (%)
055-367-3922 1
 
1.0%
055-785-1234 1
 
1.0%
055-386-0888 1
 
1.0%
055-381-1090 1
 
1.0%
055-781-1175 1
 
1.0%
055-367-8275 1
 
1.0%
055-387-8575 1
 
1.0%
055-383-4182 1
 
1.0%
055-367-7565 1
 
1.0%
055-785-1777 1
 
1.0%
Other values (95) 95
90.5%
2023-12-12T14:32:47.574589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 298
23.7%
- 210
16.7%
0 170
13.5%
3 122
9.7%
8 109
 
8.7%
7 83
 
6.6%
6 81
 
6.4%
2 63
 
5.0%
1 63
 
5.0%
9 33
 
2.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1050
83.3%
Dash Punctuation 210
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 298
28.4%
0 170
16.2%
3 122
11.6%
8 109
 
10.4%
7 83
 
7.9%
6 81
 
7.7%
2 63
 
6.0%
1 63
 
6.0%
9 33
 
3.1%
4 28
 
2.7%
Dash Punctuation
ValueCountFrequency (%)
- 210
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1260
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 298
23.7%
- 210
16.7%
0 170
13.5%
3 122
9.7%
8 109
 
8.7%
7 83
 
6.6%
6 81
 
6.4%
2 63
 
5.0%
1 63
 
5.0%
9 33
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1260
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 298
23.7%
- 210
16.7%
0 170
13.5%
3 122
9.7%
8 109
 
8.7%
7 83
 
6.6%
6 81
 
6.4%
2 63
 
5.0%
1 63
 
5.0%
9 33
 
2.6%

화이자
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size237.0 B
True
100 
False
 
5
ValueCountFrequency (%)
True 100
95.2%
False 5
 
4.8%
2023-12-12T14:32:47.735954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

모더나
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size237.0 B
True
80 
False
25 
ValueCountFrequency (%)
True 80
76.2%
False 25
 
23.8%
2023-12-12T14:32:47.845124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

노바백스
Boolean

IMBALANCE 

Distinct2
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size237.0 B
True
102 
False
 
3
ValueCountFrequency (%)
True 102
97.1%
False 3
 
2.9%
2023-12-12T14:32:47.972400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

화이자(소아용)
Boolean

IMBALANCE 

Distinct2
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size237.0 B
False
95 
True
10 
ValueCountFrequency (%)
False 95
90.5%
True 10
 
9.5%
2023-12-12T14:32:48.084573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

접종요일
Categorical

HIGH CORRELATION 

Distinct24
Distinct (%)22.9%
Missing0
Missing (%)0.0%
Memory size972.0 B
월금토
20 
월수금
17 
화수금
화목금
수금토
Other values (19)
44 

Length

Max length5
Median length3
Mean length2.8571429
Min length1

Unique

Unique7 ?
Unique (%)6.7%

Sample

1st row화목금
2nd row월금토
3rd row월금토
4th row목금토
5th row수금토

Common Values

ValueCountFrequency (%)
월금토 20
19.0%
월수금 17
16.2%
화수금 9
8.6%
화목금 8
 
7.6%
수금토 7
 
6.7%
월목금 7
 
6.7%
수목금 6
 
5.7%
화목토 5
 
4.8%
3
 
2.9%
화금토 2
 
1.9%
Other values (14) 21
20.0%

Length

2023-12-12T14:32:48.224033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
월금토 20
19.0%
월수금 17
16.2%
화수금 9
8.6%
화목금 8
 
7.6%
수금토 7
 
6.7%
월목금 7
 
6.7%
수목금 6
 
5.7%
화목토 5
 
4.8%
3
 
2.9%
월~금 2
 
1.9%
Other values (14) 21
20.0%

Correlations

2023-12-12T14:32:48.355651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
화이자모더나노바백스화이자(소아용)접종요일
화이자1.0000.5020.0000.0001.000
모더나0.5021.0000.0630.0000.748
노바백스0.0000.0631.0000.0000.000
화이자(소아용)0.0000.0000.0001.0000.273
접종요일1.0000.7480.0000.2731.000
2023-12-12T14:32:48.473892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
화이자모더나접종요일화이자(소아용)노바백스
화이자1.0000.3350.8870.0000.000
모더나0.3351.0000.5400.0000.039
접종요일0.8870.5401.0000.1870.000
화이자(소아용)0.0000.0000.1871.0000.000
노바백스0.0000.0390.0000.0001.000
2023-12-12T14:32:48.860245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
화이자모더나노바백스화이자(소아용)접종요일
화이자1.0000.3350.0000.0000.887
모더나0.3351.0000.0390.0000.540
노바백스0.0000.0391.0000.0000.000
화이자(소아용)0.0000.0000.0001.0000.187
접종요일0.8870.5400.0000.1871.000

Missing values

2023-12-12T14:32:44.260497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:32:44.520513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

의료기관명주소지전화번호화이자모더나노바백스화이자(소아용)접종요일
0가람의원경상남도 양산시 서일동로 39, (중부동) 자연빌딩 2층055-367-3922YNYN화목금
1강경순의원경상남도 양산시 신기서길 23, (신기동, 주공아파트상가) 202,203호055-365-7582YYYN월금토
2강소아청소년과의원경상남도 양산시 동면 금오13길 20, 센텀빌딩 501,502호055-365-7585YYYY월금토
3고운누리의원경상남도 양산시 삼호1길 34, (삼호동, 롯데마트) 2층055-386-7582YYYN목금토
4길이비인후과의원경상남도 양산시 동면 금오13길 4, 네오스퀘어 4층 402,403호055-364-5550YYYN수금토
5김덕한의원경상남도 양산시 물금읍 황산로 597 551-2055-387-0863YNYN월수금
6김동훈의원경상남도 양산시 상북면 반회서6길 18 석계리 238-12 2층055-374-1563YYYN월수금
7김영록이비인후과의원경상남도 양산시 양산역1길 24, (중부동, 영동프라자) 403호055-388-7543YYYN화목금
8김지웅내과의원경상남도 양산시 북정로 59-1, (북정동) 메디앤타임빌딩 2층055-785-2222YYYN화수금
9김혜린의 아름다운 내과 의원경상남도 양산시 덕계로 23, (덕계동)055-365-6688YYYN화목토
의료기관명주소지전화번호화이자모더나노바백스화이자(소아용)접종요일
95푸른내과의원경상남도 양산시 양산역로 85, (중부동) 시티타워 2층 203호055-381-7582YYYN수금토
96하은소아청소년과의원경상남도 양산시 물금읍 새실로 38 402호055-381-6003YYYN목금토
97하하정형외과의원경상남도 양산시 물금읍 증산역로 143, 지오프라자 3층 303055-372-0017YYYN월목금
98한사랑이비인후과의원경상남도 양산시 덕계로 117, (덕계동) 2층055-388-3966YYYN화목금
99행복한가정의원경상남도 양산시 상북면 반회서4길 12-23, 2층055-374-7582YYYN화금토
100현대의료소비자생활협동조합현대메디컬의원경상남도 양산시 서일동2길 37-1, (중부동)055-384-8880YNYN월수금
101홈즈가정의학과의원경상남도 양산시 물금읍 새실로 38, 301호055-387-1233YYYN월수금
102홍내과의원경상남도 양산시 양산역6길 9, (중부동, BYC빌딩) 3층055-912-1004YNYN
103황외과의원경상남도 양산시 덕계로 41, (덕계동)055-366-7582YYYN월수금
104훈의원경상남도 양산시 덕계로 72, (덕계동) 2~3층055-364-8561YYYN월수금