Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows19
Duplicate rows (%)0.2%
Total size in memory634.8 KiB
Average record size in memory65.0 B

Variable types

Categorical3
Text1
DateTime1
Boolean1
Numeric1

Dataset

Description한국보훈복지의료공단 광주보훈병원의 검사항목 및 예약기준 공공데이터로, 검사실명, 검사코드명, 요일구분, 예약시간 등이 포함되어 있습니다.
Author한국보훈복지의료공단
URLhttps://www.data.go.kr/data/15102385/fileData.do

Alerts

검사구분 has constant value ""Constant
Dataset has 19 (0.2%) duplicate rowsDuplicates
예약가능인원 is highly overall correlated with 검사실명High correlation
검사실명 is highly overall correlated with 예약가능인원 and 1 other fieldsHigh correlation
예약가능여부 is highly overall correlated with 검사실명High correlation
예약가능여부 is highly imbalanced (99.2%)Imbalance

Reproduction

Analysis started2023-12-12 01:37:10.498714
Analysis finished2023-12-12 01:37:11.643585
Duration1.14 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

검사실명
Categorical

HIGH CORRELATION 

Distinct19
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
내시경실
3131 
뇌혈류 & 근전도 & 뇌파
1090 
임상심리
1062 
산부인과
753 
정신과
692 
Other values (14)
3272 

Length

Max length14
Median length4
Mean length5.1922
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row흉부외과
2nd row뇌혈류 & 근전도 & 뇌파
3rd row비뇨기과
4th row폐기능검사실
5th row산부인과

Common Values

ValueCountFrequency (%)
내시경실 3131
31.3%
뇌혈류 & 근전도 & 뇌파 1090
 
10.9%
임상심리 1062
 
10.6%
산부인과 753
 
7.5%
정신과 692
 
6.9%
비뇨기과 660
 
6.6%
폐기능검사실 532
 
5.3%
심전도실 345
 
3.5%
재활의학과 339
 
3.4%
신경외과 313
 
3.1%
Other values (9) 1083
 
10.8%

Length

2023-12-12T10:37:11.738276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
내시경실 3131
21.8%
2180
15.2%
뇌혈류 1090
 
7.6%
근전도 1090
 
7.6%
뇌파 1090
 
7.6%
임상심리 1062
 
7.4%
산부인과 753
 
5.2%
정신과 692
 
4.8%
비뇨기과 660
 
4.6%
폐기능검사실 532
 
3.7%
Other values (12) 2080
14.5%
Distinct664
Distinct (%)6.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T10:37:12.034805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length75
Median length42
Mean length22.7706
Min length4

Characters and Unicode

Total characters227706
Distinct characters391
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)< 0.1%

Sample

1st row(CS용)유도초음파-천자(흉막천자)
2nd row신경학적 검사-단순검사(7개평가영역중 4개이상 평가영역시행시)
3rd row요류속도측정
4th rowPFT/폐탄성검사
5th rowWet Smear (vaginal discharge)
ValueCountFrequency (%)
531
 
2.3%
종양 331
 
1.4%
세척료 302
 
1.3%
검사 293
 
1.2%
test 252
 
1.1%
점막하 237
 
1.0%
of 234
 
1.0%
박리 233
 
1.0%
상부 233
 
1.0%
소화관 233
 
1.0%
Other values (978) 20577
87.7%
2023-12-12T10:37:12.675903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13784
 
6.1%
) 5857
 
2.6%
( 5806
 
2.5%
5575
 
2.4%
5528
 
2.4%
e 5488
 
2.4%
o 5100
 
2.2%
4939
 
2.2%
/ 4939
 
2.2%
t 4244
 
1.9%
Other values (381) 166446
73.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 117128
51.4%
Lowercase Letter 46378
 
20.4%
Uppercase Letter 25309
 
11.1%
Space Separator 13784
 
6.1%
Close Punctuation 6403
 
2.8%
Open Punctuation 6352
 
2.8%
Other Punctuation 5787
 
2.5%
Dash Punctuation 3697
 
1.6%
Decimal Number 1659
 
0.7%
Math Symbol 1041
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5575
 
4.8%
5528
 
4.7%
4939
 
4.2%
2881
 
2.5%
2530
 
2.2%
2351
 
2.0%
2342
 
2.0%
2299
 
2.0%
2287
 
2.0%
2021
 
1.7%
Other values (306) 84375
72.0%
Lowercase Letter
ValueCountFrequency (%)
e 5488
11.8%
o 5100
11.0%
t 4244
9.2%
i 3833
 
8.3%
r 3530
 
7.6%
a 3283
 
7.1%
n 3221
 
6.9%
s 2720
 
5.9%
l 2504
 
5.4%
c 2237
 
4.8%
Other values (14) 10218
22.0%
Uppercase Letter
ValueCountFrequency (%)
E 3389
13.4%
S 2617
 
10.3%
N 2331
 
9.2%
P 2245
 
8.9%
R 1643
 
6.5%
T 1380
 
5.5%
C 1326
 
5.2%
M 1268
 
5.0%
D 1243
 
4.9%
B 1229
 
4.9%
Other values (14) 6638
26.2%
Decimal Number
ValueCountFrequency (%)
1 626
37.7%
2 325
19.6%
6 157
 
9.5%
3 104
 
6.3%
7 98
 
5.9%
0 97
 
5.8%
8 81
 
4.9%
5 81
 
4.9%
4 52
 
3.1%
9 38
 
2.3%
Other Punctuation
ValueCountFrequency (%)
/ 4939
85.3%
, 725
 
12.5%
· 57
 
1.0%
. 44
 
0.8%
% 22
 
0.4%
Letter Number
ValueCountFrequency (%)
50
29.8%
41
24.4%
37
22.0%
27
16.1%
13
 
7.7%
Close Punctuation
ValueCountFrequency (%)
) 5857
91.5%
] 546
 
8.5%
Open Punctuation
ValueCountFrequency (%)
( 5806
91.4%
[ 546
 
8.6%
Space Separator
ValueCountFrequency (%)
13784
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3697
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1041
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 117128
51.4%
Latin 71855
31.6%
Common 38723
 
17.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5575
 
4.8%
5528
 
4.7%
4939
 
4.2%
2881
 
2.5%
2530
 
2.2%
2351
 
2.0%
2342
 
2.0%
2299
 
2.0%
2287
 
2.0%
2021
 
1.7%
Other values (306) 84375
72.0%
Latin
ValueCountFrequency (%)
e 5488
 
7.6%
o 5100
 
7.1%
t 4244
 
5.9%
i 3833
 
5.3%
r 3530
 
4.9%
E 3389
 
4.7%
a 3283
 
4.6%
n 3221
 
4.5%
s 2720
 
3.8%
S 2617
 
3.6%
Other values (43) 34430
47.9%
Common
ValueCountFrequency (%)
13784
35.6%
) 5857
15.1%
( 5806
15.0%
/ 4939
 
12.8%
- 3697
 
9.5%
+ 1041
 
2.7%
, 725
 
1.9%
1 626
 
1.6%
[ 546
 
1.4%
] 546
 
1.4%
Other values (12) 1156
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 117128
51.4%
ASCII 110353
48.5%
Number Forms 168
 
0.1%
None 57
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13784
 
12.5%
) 5857
 
5.3%
( 5806
 
5.3%
e 5488
 
5.0%
o 5100
 
4.6%
/ 4939
 
4.5%
t 4244
 
3.8%
i 3833
 
3.5%
- 3697
 
3.4%
r 3530
 
3.2%
Other values (59) 54075
49.0%
Hangul
ValueCountFrequency (%)
5575
 
4.8%
5528
 
4.7%
4939
 
4.2%
2881
 
2.5%
2530
 
2.2%
2351
 
2.0%
2342
 
2.0%
2299
 
2.0%
2287
 
2.0%
2021
 
1.7%
Other values (306) 84375
72.0%
None
ValueCountFrequency (%)
· 57
100.0%
Number Forms
ValueCountFrequency (%)
50
29.8%
41
24.4%
37
22.0%
27
16.1%
13
 
7.7%

요일구분
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2031 
1936 
1930 
1905 
1827 
Other values (2)
371 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
2031
20.3%
1936
19.4%
1930
19.3%
1905
19.1%
1827
18.3%
186
 
1.9%
185
 
1.8%

Length

2023-12-12T10:37:12.865615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:37:13.047181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2031
20.3%
1936
19.4%
1930
19.3%
1905
19.1%
1827
18.3%
186
 
1.9%
185
 
1.8%
Distinct51
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2023-12-12 00:00:00
Maximum2023-12-12 23:59:00
2023-12-12T10:37:13.242875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:37:13.438666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

검사구분
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
전체
10000 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전체
2nd row전체
3rd row전체
4th row전체
5th row전체

Common Values

ValueCountFrequency (%)
전체 10000
100.0%

Length

2023-12-12T10:37:13.632341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:37:13.767804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전체 10000
100.0%

예약가능여부
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
9993 
True
 
7
ValueCountFrequency (%)
False 9993
99.9%
True 7
 
0.1%
2023-12-12T10:37:13.900547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

예약가능인원
Real number (ℝ)

HIGH CORRELATION 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.7041
Minimum0
Maximum12
Zeros7
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T10:37:14.035890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q32
95-th percentile10
Maximum12
Range12
Interquartile range (IQR)1

Descriptive statistics

Standard deviation3.2192203
Coefficient of variation (CV)1.190496
Kurtosis1.1311744
Mean2.7041
Median Absolute Deviation (MAD)0
Skewness1.657342
Sum27041
Variance10.36338
MonotonicityNot monotonic
2023-12-12T10:37:14.204024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
1 7077
70.8%
10 1171
 
11.7%
6 853
 
8.5%
2 696
 
7.0%
12 128
 
1.3%
3 66
 
0.7%
0 7
 
0.1%
5 2
 
< 0.1%
ValueCountFrequency (%)
0 7
 
0.1%
1 7077
70.8%
2 696
 
7.0%
3 66
 
0.7%
5 2
 
< 0.1%
6 853
 
8.5%
10 1171
 
11.7%
12 128
 
1.3%
ValueCountFrequency (%)
12 128
 
1.3%
10 1171
 
11.7%
6 853
 
8.5%
5 2
 
< 0.1%
3 66
 
0.7%
2 696
 
7.0%
1 7077
70.8%
0 7
 
0.1%

Interactions

2023-12-12T10:37:11.216696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:37:14.326514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
검사실명요일구분예약시간예약가능여부예약가능인원
검사실명1.0000.4390.7860.7710.943
요일구분0.4391.0000.1500.0220.596
예약시간0.7860.1501.0001.0000.461
예약가능여부0.7710.0221.0001.0000.000
예약가능인원0.9430.5960.4610.0001.000
2023-12-12T10:37:14.499365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
예약가능여부요일구분검사실명
예약가능여부1.0000.0230.706
요일구분0.0231.0000.211
검사실명0.7060.2111.000
2023-12-12T10:37:14.657557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
예약가능인원검사실명요일구분예약가능여부
예약가능인원1.0000.7500.2670.000
검사실명0.7501.0000.2110.706
요일구분0.2670.2111.0000.023
예약가능여부0.0000.7060.0231.000

Missing values

2023-12-12T10:37:11.402299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:37:11.568555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

검사실명검사코드명요일구분예약시간검사구분예약가능여부예약가능인원
43366흉부외과(CS용)유도초음파-천자(흉막천자)13:30전체N1
19187뇌혈류 & 근전도 & 뇌파신경학적 검사-단순검사(7개평가영역중 4개이상 평가영역시행시)16:00전체N10
21276비뇨기과요류속도측정15:30전체N1
41096폐기능검사실PFT/폐탄성검사16:10전체N2
23005산부인과Wet Smear (vaginal discharge)12:00전체N1
38153정신과NP/경계력검사[청각]14:00전체N1
14175내시경실풍선소장내시경하 스텐트삽입술-경항문14:00전체N1
32893임상심리기호잇기 검사08:30전체N6
31704임상심리NE/CDR2치매척도검사15:30전체N12
11997내시경실식도(일괄전액본인부담)-점막하 박리 절제술(내시경적 상부 소화관 종양 수술)12:00전체N1
검사실명검사코드명요일구분예약시간검사구분예약가능여부예약가능인원
7855내시경실결장경하종양수술/Polypectomy/1개이상초과1개당14:00전체N1
15136뇌혈류 & 근전도 & 뇌파(NE)중증근무력증 약물검사Pharmacologic Test Of Myasthenia Gravis14:30전체N10
15237뇌혈류 & 근전도 & 뇌파Bubble test-미세기포를 이용한 우좌 단락 검사17:00전체N10
18060뇌혈류 & 근전도 & 뇌파NE/신경전도검사/감각(상지,편측)11:30전체N10
32938임상심리기호잇기 검사15:00전체N6
18963뇌혈류 & 근전도 & 뇌파바이오피드백16:00전체N10
21749비뇨기과전기적유도사정술16:30전체N1
12229내시경실에스상결장경하 종양수술-Polypectomy17:00전체N1
29711심전도실기관지유발시험/특이적(항원별)15:20전체N1
34274임상심리손가락두드리기 검사09:00전체N6

Duplicate rows

Most frequently occurring

검사실명검사코드명요일구분예약시간검사구분예약가능여부예약가능인원# duplicates
2내시경실결장경하종양수술/점막절제술및점막하종양절제Muco14:30전체N13
0내시경실결장경하종양수술/점막절제술및점막하종양절제Muco09:00전체N12
1내시경실결장경하종양수술/점막절제술및점막하종양절제Muco10:30전체N12
3내시경실결장경하종양수술/점막절제술및점막하종양절제Muco17:00전체N12
4내시경실결장경하종양수술/점막절제술및점막하종양절제Muco11:00전체N12
5내시경실결장경하종양수술/점막절제술및점막하종양절제Muco11:30전체N12
6내시경실결장경하종양수술/점막절제술및점막하종양절제Muco14:30전체N12
7내시경실결장경하종양수술/점막절제술및점막하종양절제Muco13:30전체N12
8내시경실결장경하종양수술/점막절제술및점막하종양절제Muco08:30전체N12
9내시경실결장경하종양수술/점막절제술및점막하종양절제Muco09:00전체N12