Overview

Dataset statistics

Number of variables6
Number of observations39
Missing cells19
Missing cells (%)8.1%
Duplicate rows1
Duplicate rows (%)2.6%
Total size in memory2.0 KiB
Average record size in memory51.4 B

Variable types

Categorical1
Text5

Dataset

Description산청군의 의료기관 정보(의료기관 구분, 업소명, 소재지, 대표자, 전화번호, 팩스번호)에 대한 공공데이터 자료입니다.
Author경상남도 산청군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3072700

Alerts

Dataset has 1 (2.6%) duplicate rowsDuplicates
업소명 has 3 (7.7%) missing valuesMissing
소재지 has 3 (7.7%) missing valuesMissing
대표자 has 3 (7.7%) missing valuesMissing
전화번호 has 3 (7.7%) missing valuesMissing
팩스번호 has 7 (17.9%) missing valuesMissing

Reproduction

Analysis started2023-12-11 00:19:48.581632
Analysis finished2023-12-11 00:19:49.174033
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct5
Distinct (%)12.8%
Missing0
Missing (%)0.0%
Memory size444.0 B
한의원
15 
의원
14 
치과의원
<NA>
요양병원
 
1

Length

Max length4
Median length3
Mean length2.8974359
Min length2

Unique

Unique1 ?
Unique (%)2.6%

Sample

1st row요양병원
2nd row의원
3rd row의원
4th row의원
5th row의원

Common Values

ValueCountFrequency (%)
한의원 15
38.5%
의원 14
35.9%
치과의원 6
 
15.4%
<NA> 3
 
7.7%
요양병원 1
 
2.6%

Length

2023-12-11T09:19:49.248954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:19:49.369598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한의원 15
38.5%
의원 14
35.9%
치과의원 6
 
15.4%
na 3
 
7.7%
요양병원 1
 
2.6%

업소명
Text

MISSING 

Distinct36
Distinct (%)100.0%
Missing3
Missing (%)7.7%
Memory size444.0 B
2023-12-11T09:19:49.569401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length5.7777778
Min length3

Characters and Unicode

Total characters208
Distinct characters71
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row산청요양병원
2nd row제생의원
3rd row서울의원
4th row청송의원
5th row한마음의원
ValueCountFrequency (%)
제생의원 1
 
2.8%
서울의원 1
 
2.8%
산청매화한의원 1
 
2.8%
경희우리한의원 1
 
2.8%
한보당한의원 1
 
2.8%
덕산한의원 1
 
2.8%
원광한의원 1
 
2.8%
청담한의원 1
 
2.8%
본디올한의원 1
 
2.8%
생초한의원 1
 
2.8%
Other values (26) 26
72.2%
2023-12-11T09:19:49.950278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38
18.3%
38
18.3%
19
 
9.1%
9
 
4.3%
7
 
3.4%
6
 
2.9%
6
 
2.9%
4
 
1.9%
3
 
1.4%
3
 
1.4%
Other values (61) 75
36.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 207
99.5%
Space Separator 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
38
18.4%
38
18.4%
19
 
9.2%
9
 
4.3%
7
 
3.4%
6
 
2.9%
6
 
2.9%
4
 
1.9%
3
 
1.4%
3
 
1.4%
Other values (60) 74
35.7%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 207
99.5%
Common 1
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
38
18.4%
38
18.4%
19
 
9.2%
9
 
4.3%
7
 
3.4%
6
 
2.9%
6
 
2.9%
4
 
1.9%
3
 
1.4%
3
 
1.4%
Other values (60) 74
35.7%
Common
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 207
99.5%
ASCII 1
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
38
18.4%
38
18.4%
19
 
9.2%
9
 
4.3%
7
 
3.4%
6
 
2.9%
6
 
2.9%
4
 
1.9%
3
 
1.4%
3
 
1.4%
Other values (60) 74
35.7%
ASCII
ValueCountFrequency (%)
1
100.0%

소재지
Text

MISSING 

Distinct34
Distinct (%)94.4%
Missing3
Missing (%)7.7%
Memory size444.0 B
2023-12-11T09:19:50.184108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length26
Mean length22.027778
Min length19

Characters and Unicode

Total characters793
Distinct characters65
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)88.9%

Sample

1st row경상남도 산청군 금서면 친환경로 2605번길69
2nd row경상남도 산청군 산청읍 웅석봉로 10
3rd row경상남도 산청군 산청읍 중앙로 19
4th row경상남도 산청군 산청읍 웅석봉로 18
5th row경상남도 산청군 산청읍 덕계로 21
ValueCountFrequency (%)
경상남도 36
19.7%
산청군 36
19.7%
산청읍 12
 
6.6%
신안면 8
 
4.4%
금서면 5
 
2.7%
시천면 5
 
2.7%
원지로 4
 
2.2%
17 4
 
2.2%
지리산대로 4
 
2.2%
중앙로 4
 
2.2%
Other values (49) 65
35.5%
2023-12-11T09:19:50.788754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
147
18.5%
55
 
6.9%
49
 
6.2%
40
 
5.0%
37
 
4.7%
36
 
4.5%
36
 
4.5%
36
 
4.5%
35
 
4.4%
24
 
3.0%
Other values (55) 298
37.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 519
65.4%
Space Separator 147
 
18.5%
Decimal Number 116
 
14.6%
Dash Punctuation 6
 
0.8%
Close Punctuation 2
 
0.3%
Open Punctuation 2
 
0.3%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
55
 
10.6%
49
 
9.4%
40
 
7.7%
37
 
7.1%
36
 
6.9%
36
 
6.9%
36
 
6.9%
35
 
6.7%
24
 
4.6%
12
 
2.3%
Other values (40) 159
30.6%
Decimal Number
ValueCountFrequency (%)
1 20
17.2%
2 17
14.7%
9 14
12.1%
5 12
10.3%
7 12
10.3%
4 11
9.5%
0 10
8.6%
3 8
 
6.9%
8 6
 
5.2%
6 6
 
5.2%
Space Separator
ValueCountFrequency (%)
147
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 519
65.4%
Common 274
34.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
55
 
10.6%
49
 
9.4%
40
 
7.7%
37
 
7.1%
36
 
6.9%
36
 
6.9%
36
 
6.9%
35
 
6.7%
24
 
4.6%
12
 
2.3%
Other values (40) 159
30.6%
Common
ValueCountFrequency (%)
147
53.6%
1 20
 
7.3%
2 17
 
6.2%
9 14
 
5.1%
5 12
 
4.4%
7 12
 
4.4%
4 11
 
4.0%
0 10
 
3.6%
3 8
 
2.9%
- 6
 
2.2%
Other values (5) 17
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 519
65.4%
ASCII 274
34.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
147
53.6%
1 20
 
7.3%
2 17
 
6.2%
9 14
 
5.1%
5 12
 
4.4%
7 12
 
4.4%
4 11
 
4.0%
0 10
 
3.6%
3 8
 
2.9%
- 6
 
2.2%
Other values (5) 17
 
6.2%
Hangul
ValueCountFrequency (%)
55
 
10.6%
49
 
9.4%
40
 
7.7%
37
 
7.1%
36
 
6.9%
36
 
6.9%
36
 
6.9%
35
 
6.7%
24
 
4.6%
12
 
2.3%
Other values (40) 159
30.6%

대표자
Text

MISSING 

Distinct36
Distinct (%)100.0%
Missing3
Missing (%)7.7%
Memory size444.0 B
2023-12-11T09:19:50.995081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters108
Distinct characters59
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row정영숙
2nd row정종태
3rd row한태영
4th row이남규
5th row오창석
ValueCountFrequency (%)
정종태 1
 
2.8%
한태영 1
 
2.8%
서보견 1
 
2.8%
김현정 1
 
2.8%
김동수 1
 
2.8%
전동민 1
 
2.8%
장재호 1
 
2.8%
김명철 1
 
2.8%
허태강 1
 
2.8%
이영민 1
 
2.8%
Other values (26) 26
72.2%
2023-12-11T09:19:51.333488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7
 
6.5%
6
 
5.6%
6
 
5.6%
5
 
4.6%
5
 
4.6%
5
 
4.6%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
Other values (49) 63
58.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 108
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7
 
6.5%
6
 
5.6%
6
 
5.6%
5
 
4.6%
5
 
4.6%
5
 
4.6%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
Other values (49) 63
58.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 108
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7
 
6.5%
6
 
5.6%
6
 
5.6%
5
 
4.6%
5
 
4.6%
5
 
4.6%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
Other values (49) 63
58.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 108
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7
 
6.5%
6
 
5.6%
6
 
5.6%
5
 
4.6%
5
 
4.6%
5
 
4.6%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
Other values (49) 63
58.3%

전화번호
Text

MISSING 

Distinct34
Distinct (%)94.4%
Missing3
Missing (%)7.7%
Memory size444.0 B
2023-12-11T09:19:51.540625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters432
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)88.9%

Sample

1st row055-983-1000
2nd row055-973-2079
3rd row055-973-3828
4th row055-973-5422
5th row055-974-0975
ValueCountFrequency (%)
055-974-0417 2
 
5.6%
055-972-1319 2
 
5.6%
055-973-1391 1
 
2.8%
055-974-0676 1
 
2.8%
055-972-0302 1
 
2.8%
055-973-1077 1
 
2.8%
055-972-9799 1
 
2.8%
055-974-5200 1
 
2.8%
055-973-9782 1
 
2.8%
055-974-1047 1
 
2.8%
Other values (24) 24
66.7%
2023-12-11T09:19:51.851395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 83
19.2%
- 72
16.7%
0 60
13.9%
7 54
12.5%
9 52
12.0%
2 26
 
6.0%
3 26
 
6.0%
1 22
 
5.1%
4 16
 
3.7%
8 12
 
2.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 360
83.3%
Dash Punctuation 72
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 83
23.1%
0 60
16.7%
7 54
15.0%
9 52
14.4%
2 26
 
7.2%
3 26
 
7.2%
1 22
 
6.1%
4 16
 
4.4%
8 12
 
3.3%
6 9
 
2.5%
Dash Punctuation
ValueCountFrequency (%)
- 72
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 432
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 83
19.2%
- 72
16.7%
0 60
13.9%
7 54
12.5%
9 52
12.0%
2 26
 
6.0%
3 26
 
6.0%
1 22
 
5.1%
4 16
 
3.7%
8 12
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 432
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 83
19.2%
- 72
16.7%
0 60
13.9%
7 54
12.5%
9 52
12.0%
2 26
 
6.0%
3 26
 
6.0%
1 22
 
5.1%
4 16
 
3.7%
8 12
 
2.8%

팩스번호
Text

MISSING 

Distinct32
Distinct (%)100.0%
Missing7
Missing (%)17.9%
Memory size444.0 B
2023-12-11T09:19:52.139380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.1875
Min length12

Characters and Unicode

Total characters390
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)100.0%

Sample

1st row055-983-1005
2nd row055-972-8078
3rd row055-973-3920
4th row055-973-8926
5th row0504-401-5118
ValueCountFrequency (%)
055-973-6496 1
 
3.1%
055-973-3920 1
 
3.1%
055-973-9112 1
 
3.1%
055-974-2777 1
 
3.1%
055-973-4957 1
 
3.1%
055-972-2627 1
 
3.1%
055-973-5735 1
 
3.1%
055-974-1075 1
 
3.1%
055-974-1026 1
 
3.1%
055-974-1049 1
 
3.1%
Other values (22) 22
68.8%
2023-12-11T09:19:52.484857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 77
19.7%
- 64
16.4%
0 52
13.3%
7 48
12.3%
9 40
10.3%
3 26
 
6.7%
4 21
 
5.4%
2 21
 
5.4%
1 19
 
4.9%
8 14
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 326
83.6%
Dash Punctuation 64
 
16.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 77
23.6%
0 52
16.0%
7 48
14.7%
9 40
12.3%
3 26
 
8.0%
4 21
 
6.4%
2 21
 
6.4%
1 19
 
5.8%
8 14
 
4.3%
6 8
 
2.5%
Dash Punctuation
ValueCountFrequency (%)
- 64
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 390
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 77
19.7%
- 64
16.4%
0 52
13.3%
7 48
12.3%
9 40
10.3%
3 26
 
6.7%
4 21
 
5.4%
2 21
 
5.4%
1 19
 
4.9%
8 14
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 390
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 77
19.7%
- 64
16.4%
0 52
13.3%
7 48
12.3%
9 40
10.3%
3 26
 
6.7%
4 21
 
5.4%
2 21
 
5.4%
1 19
 
4.9%
8 14
 
3.6%

Correlations

2023-12-11T09:19:52.588626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분업소명소재지대표자전화번호팩스번호
구분1.0001.0000.9101.0001.0001.000
업소명1.0001.0001.0001.0001.0001.000
소재지0.9101.0001.0001.0000.9971.000
대표자1.0001.0001.0001.0001.0001.000
전화번호1.0001.0000.9971.0001.0001.000
팩스번호1.0001.0001.0001.0001.0001.000

Missing values

2023-12-11T09:19:48.896379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:19:48.991304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T09:19:49.092295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

구분업소명소재지대표자전화번호팩스번호
0요양병원산청요양병원경상남도 산청군 금서면 친환경로 2605번길69정영숙055-983-1000055-983-1005
1의원제생의원경상남도 산청군 산청읍 웅석봉로 10정종태055-973-2079055-972-8078
2의원서울의원경상남도 산청군 산청읍 중앙로 19한태영055-973-3828055-973-3920
3의원청송의원경상남도 산청군 산청읍 웅석봉로 18이남규055-973-5422055-973-8926
4의원한마음의원경상남도 산청군 산청읍 덕계로 21오창석055-974-09750504-401-5118
5의원윤의원경상남도 산청군 시천면 남명로 205윤기완055-973-6096<NA>
6의원덕산가정의학과의원경상남도 산청군 시천면 남명로 199나병화055-972-9318055-973-1856
7의원신세계의원경상남도 산청군 단성면 목화로 968번길 17이주봉055-972-0012055-974-0016
8의원황도영의원경상남도 산청군 신안면 원지로 17황도영055-972-7535055-972-7525
9의원밝은서울안과의원경상남도 산청군 산청읍 웅석봉로 6김휘언055-973-9937055-973-9938
구분업소명소재지대표자전화번호팩스번호
29한의원풀향기한의원경상남도 산청군 시천면 덕산대포로 20최정선055-974-5200055-974-5201
30한의원동의보감한의원경상남도 산청군 금서면 동의보감로555번길 45-27김종권055-972-9799055-972-9711
31한의원원지한의원경상남도 산청군 신안면 지리산대로 3482윤우선055-973-1077<NA>
32한의원삼성당한의원경상남도 산청군 단성면 목화로 1002안형진055-972-0302055-972-3804
33한의원창덕궁한의원경상남도 산청군 금서면 동의보감로479번길 43이동현055-972-13190303-3445-5801
34한의원다함한의원경상남도 산청군 단성면 지리산대로 2919번길7-17김주안055-974-0676<NA>
35한의원산청한방가족한의원경상남도 산청군 금서면 동의보감로479번길 53조정현055-972-13190303-3444-3451
36<NA><NA><NA><NA><NA><NA>
37<NA><NA><NA><NA><NA><NA>
38<NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

구분업소명소재지대표자전화번호팩스번호# duplicates
0<NA><NA><NA><NA><NA><NA>3