Overview

Dataset statistics

Number of variables7
Number of observations4545
Missing cells80
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory257.6 KiB
Average record size in memory58.0 B

Variable types

Categorical5
Numeric2

Dataset

Description한국보훈복지의료공단 광주보훈병원에서 개방하는 영상의학과 진료과별 검사건수 데이터로 진료과별 국사비별 월별검사건수의 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15067162/fileData.do

Alerts

진료과순번 is highly overall correlated with 진료과명High correlation
검사실명 is highly overall correlated with 검사종류High correlation
검사종류 is highly overall correlated with 검사실명High correlation
진료과명 is highly overall correlated with 진료과순번High correlation
진료과순번 has 80 (1.8%) missing valuesMissing

Reproduction

Analysis started2023-12-12 12:12:36.234340
Analysis finished2023-12-12 12:12:37.594892
Duration1.36 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

국사비
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size35.6 KiB
사비
2309 
국비
2236 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국비
2nd row국비
3rd row국비
4th row국비
5th row국비

Common Values

ValueCountFrequency (%)
사비 2309
50.8%
국비 2236
49.2%

Length

2023-12-12T21:12:37.664016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:12:37.786228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사비 2309
50.8%
국비 2236
49.2%

검사실명
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size35.6 KiB
일반검사
1811 
CT검사
1073 
MRI검사
665 
초음파검사
571 
골밀도
235 
Other values (3)
190 

Length

Max length5
Median length4
Mean length4.1590759
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCT검사
2nd rowCT검사
3rd rowCT검사
4th rowCT검사
5th rowMRI검사

Common Values

ValueCountFrequency (%)
일반검사 1811
39.8%
CT검사 1073
23.6%
MRI검사 665
 
14.6%
초음파검사 571
 
12.6%
골밀도 235
 
5.2%
기타 142
 
3.1%
유방검사 42
 
0.9%
ANGIO 6
 
0.1%

Length

2023-12-12T21:12:37.913635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:12:38.062589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반검사 1811
39.8%
ct검사 1073
23.6%
mri검사 665
 
14.6%
초음파검사 571
 
12.6%
골밀도 235
 
5.2%
기타 142
 
3.1%
유방검사 42
 
0.9%
angio 6
 
0.1%

검사종류
Categorical

HIGH CORRELATION 

Distinct28
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size35.6 KiB
Chest
720 
Head & Neck
573 
Abdomen
465 
Spine
459 
Abdomen & Pelvis
426 
Other values (23)
1902 

Length

Max length28
Median length25
Mean length8.1359736
Min length3

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st rowAbdomen & Pelvis
2nd rowChest
3rd rowExtremity
4th rowHead & Neck
5th rowAbdomen & Pelvis

Common Values

ValueCountFrequency (%)
Chest 720
15.8%
Head & Neck 573
12.6%
Abdomen 465
10.2%
Spine 459
10.1%
Abdomen & Pelvis 426
9.4%
Others 410
9.0%
골밀도 235
 
5.2%
Upper. Ext. 188
 
4.1%
Skull 186
 
4.1%
Lower. Ext. 185
 
4.1%
Other values (18) 698
15.4%

Length

2023-12-12T21:12:38.197837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1048
14.5%
abdomen 892
12.4%
chest 720
10.0%
head 573
7.9%
neck 573
7.9%
ext 516
 
7.1%
spine 459
 
6.4%
pelvis 427
 
5.9%
others 423
 
5.9%
lower 262
 
3.6%
Other values (20) 1325
18.4%

진료과순번
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct28
Distinct (%)0.6%
Missing80
Missing (%)1.8%
Infinite0
Infinite (%)0.0%
Mean15.152072
Minimum1
Maximum92
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size40.1 KiB
2023-12-12T21:12:38.319786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q17
median12
Q324
95-th percentile27.6
Maximum92
Range91
Interquartile range (IQR)17

Descriptive statistics

Standard deviation12.76981
Coefficient of variation (CV)0.84277649
Kurtosis15.566373
Mean15.152072
Median Absolute Deviation (MAD)8
Skewness2.9960807
Sum67654
Variance163.06804
MonotonicityIncreasing
2023-12-12T21:12:38.451005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
12 518
11.4%
25 412
 
9.1%
26 400
 
8.8%
13 372
 
8.2%
1 354
 
7.8%
10 336
 
7.4%
7 276
 
6.1%
24 270
 
5.9%
3 174
 
3.8%
2 168
 
3.7%
Other values (18) 1185
26.1%
ValueCountFrequency (%)
1 354
7.8%
2 168
3.7%
3 174
3.8%
4 30
 
0.7%
5 122
 
2.7%
6 165
3.6%
7 276
6.1%
8 133
 
2.9%
10 336
7.4%
11 17
 
0.4%
ValueCountFrequency (%)
92 17
 
0.4%
91 47
 
1.0%
40 11
 
0.2%
37 20
 
0.4%
32 7
 
0.2%
29 88
 
1.9%
28 34
 
0.7%
26 400
8.8%
25 412
9.1%
24 270
5.9%

진료과명
Categorical

HIGH CORRELATION 

Distinct29
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size35.6 KiB
외과
518 
재활의학과
412 
응급실
400 
정형외과
372 
일반내과
354 
Other values (24)
2489 

Length

Max length7
Median length6
Mean length4.1412541
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row일반내과
2nd row일반내과
3rd row일반내과
4th row일반내과
5th row일반내과

Common Values

ValueCountFrequency (%)
외과 518
11.4%
재활의학과 412
 
9.1%
응급실 400
 
8.8%
정형외과 372
 
8.2%
일반내과 354
 
7.8%
신경과 336
 
7.4%
혈액종양내과 276
 
6.1%
비뇨의학과 270
 
5.9%
순환기내과 174
 
3.8%
소화기내과 168
 
3.7%
Other values (19) 1265
27.8%

Length

2023-12-12T21:12:38.599940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
외과 518
11.4%
재활의학과 412
 
9.1%
응급실 400
 
8.8%
정형외과 372
 
8.2%
일반내과 354
 
7.8%
신경과 336
 
7.4%
혈액종양내과 276
 
6.1%
비뇨의학과 270
 
5.9%
순환기내과 174
 
3.8%
소화기내과 168
 
3.7%
Other values (19) 1265
27.8%

접수월
Categorical

Distinct12
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size35.6 KiB
1월
429 
2월
413 
8월
398 
5월
387 
7월
384 
Other values (7)
2534 

Length

Max length3
Median length2
Mean length2.2356436
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1월
2nd row1월
3rd row1월
4th row1월
5th row1월

Common Values

ValueCountFrequency (%)
1월 429
9.4%
2월 413
9.1%
8월 398
8.8%
5월 387
8.5%
7월 384
8.4%
4월 373
8.2%
3월 371
8.2%
9월 363
8.0%
12월 362
8.0%
6월 356
7.8%
Other values (2) 709
15.6%

Length

2023-12-12T21:12:38.737581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1월 429
9.4%
2월 413
9.1%
8월 398
8.8%
5월 387
8.5%
7월 384
8.4%
4월 373
8.2%
3월 371
8.2%
9월 363
8.0%
12월 362
8.0%
6월 356
7.8%
Other values (2) 709
15.6%

횟수
Real number (ℝ)

Distinct374
Distinct (%)8.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44.015842
Minimum1
Maximum1301
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size40.1 KiB
2023-12-12T21:12:38.890951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median7
Q331
95-th percentile251
Maximum1301
Range1300
Interquartile range (IQR)29

Descriptive statistics

Standard deviation104.15891
Coefficient of variation (CV)2.3663959
Kurtosis36.280882
Mean44.015842
Median Absolute Deviation (MAD)6
Skewness5.0615958
Sum200052
Variance10849.078
MonotonicityNot monotonic
2023-12-12T21:12:39.043297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 706
 
15.5%
2 476
 
10.5%
4 304
 
6.7%
3 302
 
6.6%
6 217
 
4.8%
5 156
 
3.4%
8 135
 
3.0%
7 118
 
2.6%
10 97
 
2.1%
9 85
 
1.9%
Other values (364) 1949
42.9%
ValueCountFrequency (%)
1 706
15.5%
2 476
10.5%
3 302
6.6%
4 304
6.7%
5 156
 
3.4%
6 217
 
4.8%
7 118
 
2.6%
8 135
 
3.0%
9 85
 
1.9%
10 97
 
2.1%
ValueCountFrequency (%)
1301 1
< 0.1%
1211 1
< 0.1%
1157 1
< 0.1%
1133 1
< 0.1%
1083 1
< 0.1%
1066 1
< 0.1%
1045 1
< 0.1%
1027 1
< 0.1%
1024 1
< 0.1%
1022 1
< 0.1%

Interactions

2023-12-12T21:12:37.132784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:12:36.884084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:12:37.261977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:12:37.008493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:12:39.178776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
국사비검사실명검사종류진료과순번진료과명접수월횟수
국사비1.0000.1270.1510.1810.2860.0000.047
검사실명0.1271.0000.9750.5710.7120.0400.247
검사종류0.1510.9751.0000.6940.7160.0000.324
진료과순번0.1810.5710.6941.0001.0000.1620.144
진료과명0.2860.7120.7161.0001.0000.1910.334
접수월0.0000.0400.0000.1620.1911.0000.000
횟수0.0470.2470.3240.1440.3340.0001.000
2023-12-12T21:12:39.653995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
진료과명국사비검사실명검사종류접수월
진료과명1.0000.2440.3760.2390.064
국사비0.2441.0000.0950.1190.000
검사실명0.3760.0951.0000.8450.017
검사종류0.2390.1190.8451.0000.000
접수월0.0640.0000.0170.0001.000
2023-12-12T21:12:39.791864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
진료과순번횟수국사비검사실명검사종류진료과명접수월
진료과순번1.0000.1600.1300.3620.3950.9980.064
횟수0.1601.0000.0360.1200.1220.1240.000
국사비0.1300.0361.0000.0950.1190.2440.000
검사실명0.3620.1200.0951.0000.8450.3760.017
검사종류0.3950.1220.1190.8451.0000.2390.000
진료과명0.9980.1240.2440.3760.2391.0000.064
접수월0.0640.0000.0000.0170.0000.0641.000

Missing values

2023-12-12T21:12:37.409646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:12:37.550328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

국사비검사실명검사종류진료과순번진료과명접수월횟수
0국비CT검사Abdomen & Pelvis1일반내과1월15
1국비CT검사Chest1일반내과1월28
2국비CT검사Extremity1일반내과1월1
3국비CT검사Head & Neck1일반내과1월8
4국비MRI검사Abdomen & Pelvis1일반내과1월2
5국비MRI검사Head & Neck1일반내과1월33
6국비MRI검사Lower Ext.1일반내과1월3
7국비MRI검사Spine1일반내과1월8
8국비골밀도골밀도1일반내과1월43
9국비기타소화기계1일반내과1월3
국사비검사실명검사종류진료과순번진료과명접수월횟수
4535사비일반검사Abdomen<NA>재활센터3월2
4536사비일반검사Chest<NA>재활센터3월20
4537사비일반검사Lower. Ext.<NA>재활센터3월6
4538사비일반검사Others<NA>재활센터3월1
4539사비일반검사Spine<NA>재활센터3월4
4540사비MRI검사Spine<NA>재활센터7월1
4541사비MRI검사Upper Ext.<NA>재활센터7월1
4542사비일반검사Chest<NA>재활센터7월7
4543사비일반검사Lower. Ext.<NA>재활센터7월5
4544사비일반검사Spine<NA>재활센터7월14