Overview

Dataset statistics

Number of variables10
Number of observations149
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.8 KiB
Average record size in memory80.9 B

Variable types

Categorical8
Text2

Dataset

Description국립암센터 진료현황에 대한 외래 일정표 입니다. 진료센터, 진료과, 연락처, 의사명, 전문분야등의 정보를 확인할 수 있습니다.
Author국립암센터
URLhttps://www.data.go.kr/data/3074206/fileData.do

Alerts

연락처 is highly overall correlated with 진료센터 and 1 other fieldsHigh correlation
진료센터 is highly overall correlated with 진료과 and 1 other fieldsHigh correlation
진료과 is highly overall correlated with 진료센터 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-13 00:35:13.414944
Analysis finished2023-12-13 00:35:14.422171
Duration1.01 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

진료센터
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
지원진료센터
31 
특수암센터
15 
대장암센터
13 
암예방검진센터
13 
폐암센터
12 
Other values (9)
65 

Length

Max length8
Median length7
Mean length5.4228188
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row위암센터
2nd row위암센터
3rd row위암센터
4th row위암센터
5th row위암센터

Common Values

ValueCountFrequency (%)
지원진료센터 31
20.8%
특수암센터 15
10.1%
대장암센터 13
8.7%
암예방검진센터 13
8.7%
폐암센터 12
 
8.1%
위암센터 11
 
7.4%
간암센터 11
 
7.4%
갑상선암센터 9
 
6.0%
유방암센터 8
 
5.4%
자궁암센터 7
 
4.7%
Other values (4) 19
12.8%

Length

2023-12-13T09:35:14.476285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
지원진료센터 31
20.8%
특수암센터 15
10.1%
대장암센터 13
8.7%
암예방검진센터 13
8.7%
폐암센터 12
 
8.1%
위암센터 11
 
7.4%
간암센터 11
 
7.4%
갑상선암센터 9
 
6.0%
유방암센터 8
 
5.4%
자궁암센터 7
 
4.7%
Other values (4) 19
12.8%

진료과
Categorical

HIGH CORRELATION 

Distinct41
Distinct (%)27.5%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
외과
30 
소화기내과
12 
혈액종양내과
12 
방사선종양학과
 
7
비뇨의학과
 
5
Other values (36)
83 

Length

Max length10
Median length7
Mean length5.1543624
Min length2

Unique

Unique14 ?
Unique (%)9.4%

Sample

1st row혈액종양내과
2nd row외과
3rd row외과
4th row혈액종양내과
5th row소화기내과

Common Values

ValueCountFrequency (%)
외과 30
20.1%
소화기내과 12
 
8.1%
혈액종양내과 12
 
8.1%
방사선종양학과 7
 
4.7%
비뇨의학과 5
 
3.4%
부인과 5
 
3.4%
가정의학클리닉 5
 
3.4%
금연클리닉 5
 
3.4%
뇌척수종양클리닉 4
 
2.7%
소화기클리닉 4
 
2.7%
Other values (31) 60
40.3%

Length

2023-12-13T09:35:14.587192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
외과 30
20.1%
혈액종양내과 12
 
8.1%
소화기내과 12
 
8.1%
방사선종양학과 7
 
4.7%
비뇨의학과 5
 
3.4%
부인과 5
 
3.4%
가정의학클리닉 5
 
3.4%
금연클리닉 5
 
3.4%
뇌척수종양클리닉 4
 
2.7%
소화기클리닉 4
 
2.7%
Other values (31) 60
40.3%

연락처
Categorical

HIGH CORRELATION 

Distinct36
Distinct (%)24.2%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
031-920-1212
13 
031-920-1168
11 
031-920-1130
11 
031-920-1230
 
7
031-920-0130
 
7
Other values (31)
100 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique6 ?
Unique (%)4.0%

Sample

1st row031-920-1125
2nd row031-920-1120
3rd row031-920-1120
4th row031-920-1125
5th row031-920-1121

Common Values

ValueCountFrequency (%)
031-920-1212 13
 
8.7%
031-920-1168 11
 
7.4%
031-920-1130 11
 
7.4%
031-920-1230 7
 
4.7%
031-920-0130 7
 
4.7%
031-920-0841 6
 
4.0%
031-920-1274 6
 
4.0%
031-920-1255 5
 
3.4%
031-920-1120 5
 
3.4%
031-920-1220 5
 
3.4%
Other values (26) 73
49.0%

Length

2023-12-13T09:35:14.692355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
031-920-1212 13
 
8.7%
031-920-1130 11
 
7.4%
031-920-1168 11
 
7.4%
031-920-1230 7
 
4.7%
031-920-0130 7
 
4.7%
031-920-0841 6
 
4.0%
031-920-1274 6
 
4.0%
031-920-1250 5
 
3.4%
031-920-1147 5
 
3.4%
031-920-1041 5
 
3.4%
Other values (26) 73
49.0%
Distinct129
Distinct (%)86.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-13T09:35:14.951225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length3.0805369
Min length2

Characters and Unicode

Total characters459
Distinct characters109
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique111 ?
Unique (%)74.5%

Sample

1st row박영이
2nd row김영우
3rd row류근원
4th row김학균
5th row최일주
ValueCountFrequency (%)
류준선 3
 
1.9%
유창환 3
 
1.9%
3
 
1.9%
문성진 2
 
1.3%
서홍관 2
 
1.3%
장미소 2
 
1.3%
박현진 2
 
1.3%
조현정 2
 
1.3%
2
 
1.3%
장윤정 2
 
1.3%
Other values (122) 131
85.1%
2023-12-13T09:35:15.336222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29
 
6.3%
22
 
4.8%
17
 
3.7%
17
 
3.7%
16
 
3.5%
15
 
3.3%
12
 
2.6%
12
 
2.6%
11
 
2.4%
9
 
2.0%
Other values (99) 299
65.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 443
96.5%
Space Separator 16
 
3.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
 
6.5%
22
 
5.0%
17
 
3.8%
17
 
3.8%
15
 
3.4%
12
 
2.7%
12
 
2.7%
11
 
2.5%
9
 
2.0%
9
 
2.0%
Other values (98) 290
65.5%
Space Separator
ValueCountFrequency (%)
16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 443
96.5%
Common 16
 
3.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
 
6.5%
22
 
5.0%
17
 
3.8%
17
 
3.8%
15
 
3.4%
12
 
2.7%
12
 
2.7%
11
 
2.5%
9
 
2.0%
9
 
2.0%
Other values (98) 290
65.5%
Common
ValueCountFrequency (%)
16
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 443
96.5%
ASCII 16
 
3.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
29
 
6.5%
22
 
5.0%
17
 
3.8%
17
 
3.8%
15
 
3.4%
12
 
2.7%
12
 
2.7%
11
 
2.5%
9
 
2.0%
9
 
2.0%
Other values (98) 290
65.5%
ASCII
ValueCountFrequency (%)
16
100.0%


Categorical

Distinct8
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
없음
74 
오후
36 
오전
27 
종일
장기연수(~18.12.26.)
 
1
Other values (3)
 
3

Length

Max length16
Median length2
Mean length2.3758389
Min length2

Unique

Unique4 ?
Unique (%)2.7%

Sample

1st row오후
2nd row없음
3rd row없음
4th row오전
5th row없음

Common Values

ValueCountFrequency (%)
없음 74
49.7%
오후 36
24.2%
오전 27
 
18.1%
종일 8
 
5.4%
장기연수(~18.12.26.) 1
 
0.7%
장기연수(~19.04.30.) 1
 
0.7%
장기연수(~19.06.30.) 1
 
0.7%
단기연수(~18.12.31.) 1
 
0.7%

Length

2023-12-13T09:35:15.454454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:35:15.567358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
없음 74
49.7%
오후 36
24.2%
오전 27
 
18.1%
종일 8
 
5.4%
장기연수(~18.12.26 1
 
0.7%
장기연수(~19.04.30 1
 
0.7%
장기연수(~19.06.30 1
 
0.7%
단기연수(~18.12.31 1
 
0.7%


Categorical

Distinct4
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
없음
81 
오후
31 
오전
23 
종일
14 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row없음
2nd row종일
3rd row없음
4th row오후
5th row오전

Common Values

ValueCountFrequency (%)
없음 81
54.4%
오후 31
 
20.8%
오전 23
 
15.4%
종일 14
 
9.4%

Length

2023-12-13T09:35:15.690798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:35:15.774628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
없음 81
54.4%
오후 31
 
20.8%
오전 23
 
15.4%
종일 14
 
9.4%


Categorical

Distinct4
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
없음
85 
오후
26 
종일
20 
오전
18 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종일
2nd row없음
3rd row종일
4th row없음
5th row없음

Common Values

ValueCountFrequency (%)
없음 85
57.0%
오후 26
 
17.4%
종일 20
 
13.4%
오전 18
 
12.1%

Length

2023-12-13T09:35:15.883759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:35:15.968791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
없음 85
57.0%
오후 26
 
17.4%
종일 20
 
13.4%
오전 18
 
12.1%


Categorical

Distinct4
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
없음
77 
오후
31 
오전
26 
종일
15 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row오후
2nd row없음
3rd row없음
4th row없음
5th row없음

Common Values

ValueCountFrequency (%)
없음 77
51.7%
오후 31
20.8%
오전 26
 
17.4%
종일 15
 
10.1%

Length

2023-12-13T09:35:16.055987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:35:16.130746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
없음 77
51.7%
오후 31
20.8%
오전 26
 
17.4%
종일 15
 
10.1%


Categorical

Distinct4
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
없음
92 
오전
35 
오후
16 
종일
 
6

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row없음
2nd row없음
3rd row오전
4th row종일
5th row오전

Common Values

ValueCountFrequency (%)
없음 92
61.7%
오전 35
 
23.5%
오후 16
 
10.7%
종일 6
 
4.0%

Length

2023-12-13T09:35:16.237577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:35:16.311998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
없음 92
61.7%
오전 35
 
23.5%
오후 16
 
10.7%
종일 6
 
4.0%
Distinct93
Distinct (%)62.4%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-13T09:35:16.542798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length40
Mean length18.657718
Min length4

Characters and Unicode

Total characters2780
Distinct characters204
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)43.6%

Sample

1st row위암의 항암화학요법
2nd row위암의 수술적 치료
3rd row위암의 수술적 치료
4th row위암의 항암화학요법
5th row위암의 진단 및 내시경치료
ValueCountFrequency (%)
65
 
9.9%
치료 64
 
9.7%
진단 39
 
5.9%
수술적 35
 
5.3%
항암화학요법 14
 
2.1%
위암의 10
 
1.5%
대장암의 10
 
1.5%
진료 9
 
1.4%
수술 9
 
1.4%
암환자의 8
 
1.2%
Other values (221) 396
60.1%
2023-12-13T09:35:16.899841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
518
 
18.6%
170
 
6.1%
, 135
 
4.9%
115
 
4.1%
105
 
3.8%
88
 
3.2%
65
 
2.3%
62
 
2.2%
56
 
2.0%
51
 
1.8%
Other values (194) 1415
50.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2052
73.8%
Space Separator 518
 
18.6%
Other Punctuation 152
 
5.5%
Decimal Number 26
 
0.9%
Close Punctuation 15
 
0.5%
Open Punctuation 15
 
0.5%
Dash Punctuation 1
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
170
 
8.3%
115
 
5.6%
105
 
5.1%
88
 
4.3%
65
 
3.2%
62
 
3.0%
56
 
2.7%
51
 
2.5%
44
 
2.1%
43
 
2.1%
Other values (179) 1253
61.1%
Decimal Number
ValueCountFrequency (%)
1 11
42.3%
0 6
23.1%
7 3
 
11.5%
2 2
 
7.7%
8 2
 
7.7%
3 1
 
3.8%
9 1
 
3.8%
Other Punctuation
ValueCountFrequency (%)
, 135
88.8%
. 14
 
9.2%
· 3
 
2.0%
Space Separator
ValueCountFrequency (%)
518
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Math Symbol
ValueCountFrequency (%)
> 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2052
73.8%
Common 728
 
26.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
170
 
8.3%
115
 
5.6%
105
 
5.1%
88
 
4.3%
65
 
3.2%
62
 
3.0%
56
 
2.7%
51
 
2.5%
44
 
2.1%
43
 
2.1%
Other values (179) 1253
61.1%
Common
ValueCountFrequency (%)
518
71.2%
, 135
 
18.5%
) 15
 
2.1%
( 15
 
2.1%
. 14
 
1.9%
1 11
 
1.5%
0 6
 
0.8%
7 3
 
0.4%
· 3
 
0.4%
2 2
 
0.3%
Other values (5) 6
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2052
73.8%
ASCII 725
 
26.1%
None 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
518
71.4%
, 135
 
18.6%
) 15
 
2.1%
( 15
 
2.1%
. 14
 
1.9%
1 11
 
1.5%
0 6
 
0.8%
7 3
 
0.4%
2 2
 
0.3%
8 2
 
0.3%
Other values (4) 4
 
0.6%
Hangul
ValueCountFrequency (%)
170
 
8.3%
115
 
5.6%
105
 
5.1%
88
 
4.3%
65
 
3.2%
62
 
3.0%
56
 
2.7%
51
 
2.5%
44
 
2.1%
43
 
2.1%
Other values (179) 1253
61.1%
None
ValueCountFrequency (%)
· 3
100.0%

Correlations

2023-12-13T09:35:16.981203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
진료센터진료과연락처전문분야
진료센터1.0000.9720.9990.0000.0000.0000.0000.1270.999
진료과0.9721.0000.9890.0000.0000.0000.0000.0001.000
연락처0.9990.9891.0000.3080.0000.0000.0000.3060.998
0.0000.0000.3081.0000.3190.1510.3300.0000.000
0.0000.0000.0000.3191.0000.4560.3010.4770.000
0.0000.0000.0000.1510.4561.0000.2540.3480.000
0.0000.0000.0000.3300.3010.2541.0000.4720.000
0.1270.0000.3060.0000.4770.3480.4721.0000.000
전문분야0.9991.0000.9980.0000.0000.0000.0000.0001.000
2023-12-13T09:35:17.071282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연락처진료과진료센터
1.0000.0000.0000.0000.1210.2010.1440.191
연락처0.0001.0000.7330.9000.0000.1260.1010.000
진료과0.0000.7331.0000.6850.0000.0000.0000.000
진료센터0.0000.9000.6851.0000.0000.0650.0000.000
0.1210.0000.0000.0001.0000.1990.1490.101
0.2010.1260.0000.0650.1991.0000.0000.141
0.1440.1010.0000.0000.1490.0001.0000.065
0.1910.0000.0000.0000.1010.1410.0651.000
2023-12-13T09:35:17.158435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
진료센터진료과연락처
진료센터1.0000.6850.9000.0000.0000.0000.0000.065
진료과0.6851.0000.7330.0000.0000.0000.0000.000
연락처0.9000.7331.0000.1010.0000.0000.0000.126
0.0000.0000.1011.0000.1440.0650.1490.000
0.0000.0000.0000.1441.0000.1910.1210.201
0.0000.0000.0000.0650.1911.0000.1010.141
0.0000.0000.0000.1490.1210.1011.0000.199
0.0650.0000.1260.0000.2010.1410.1991.000

Missing values

2023-12-13T09:35:14.056674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T09:35:14.381497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

진료센터진료과연락처의사명전문분야
0위암센터혈액종양내과031-920-1125박영이오후없음종일오후없음위암의 항암화학요법
1위암센터외과031-920-1120김영우없음종일없음없음없음위암의 수술적 치료
2위암센터외과031-920-1120류근원없음없음종일없음오전위암의 수술적 치료
3위암센터혈액종양내과031-920-1125김학균오전오후없음없음종일위암의 항암화학요법
4위암센터소화기내과031-920-1121최일주없음오전없음없음오전위암의 진단 및 내시경치료
5위암센터소화기내과031-920-1121김찬규오후없음오전없음없음위암의 진단 및 내시경치료
6위암센터소화기내과031-920-1121이종열오전없음없음오후없음위암과 식도암의 진단 및 내시경치료
7위암센터외과031-920-1120윤홍만장기연수(~18.12.26.)없음없음없음없음위암의 수술적 치료
8위암센터외과031-920-1120엄방울종일없음없음종일없음위암의 수술적 치료
9위암센터소화기내과031-920-1121김영일없음없음없음오전오후위암의 내시경적 치료
진료센터진료과연락처의사명전문분야
139암예방검진센터임신준비클리닉031-920-1212정연경오후오후오후오전오전임신준비관련상담
140암예방검진센터유전상담클리닉031-920-1212공선영없음오후없음오후없음유전상담진료
141암예방검진센터유전다학제클리닉031-920-1212공선영없음오후없음오후없음유전다학제통합진료(유전상담, 부인과, 유방외과, 가정의학과)
142양성자치료센터방사선종양학과031-920-0130조관호없음오전없음종일없음뇌척수종양, 두경부종양, 남자비뇨기암
143양성자치료센터방사선종양학과031-920-0130김주영오후오후없음오전없음부인암, 소아암
144양성자치료센터방사선종양학과031-920-0130김대용오후종일없음오전없음대장암, 위암, 유방암
145양성자치료센터방사선종양학과031-920-0130김태현오후없음종일없음오전간암, 췌담도암, 갑상선암, 유방암
146양성자치료센터방사선종양학과031-920-0130문성호없음오후종일오후없음두경부종양, 폐암, 안암
147양성자치료센터방사선종양학과031-920-0130김연주없음없음종일오후없음자궁암, 유방암
148양성자치료센터방사선종양학과031-920-0130서양권오후오전없음오전오전폐암, 골연부종양, 혈액종양