Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows50
Duplicate rows (%)0.5%
Total size in memory634.8 KiB
Average record size in memory65.0 B

Variable types

Categorical3
Text1
DateTime1
Boolean1
Numeric1

Dataset

Description대구보훈병원에서 개방하는 검사항목 및 예약기준 데이터로 검사실명, 검사코드명, 요일구분, 예약시간, 검사구분, 예약가능여부, 예약가능인원이 포함된 데이터입니다.
URLhttps://www.data.go.kr/data/15116645/fileData.do

Alerts

검사구분 has constant value ""Constant
예약가능여부 has constant value ""Constant
Dataset has 50 (0.5%) duplicate rowsDuplicates
예약가능인원 is highly overall correlated with 검사실명High correlation
검사실명 is highly overall correlated with 예약가능인원High correlation

Reproduction

Analysis started2023-12-12 05:31:56.764086
Analysis finished2023-12-12 05:31:57.664726
Duration0.9 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

검사실명
Categorical

HIGH CORRELATION 

Distinct19
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
내시경실
1893 
임상심리실3
1636 
임상심리실1
1404 
임상심리실2
1324 
이비인후과
598 
Other values (14)
3145 

Length

Max length7
Median length6
Mean length4.942
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row임상심리실3
2nd row근전도실
3rd row흉부외과
4th row흉부외과
5th row임상심리실3

Common Values

ValueCountFrequency (%)
내시경실 1893
18.9%
임상심리실3 1636
16.4%
임상심리실1 1404
14.0%
임상심리실2 1324
13.2%
이비인후과 598
 
6.0%
근전도실 565
 
5.7%
안과 551
 
5.5%
생리기능검사실 381
 
3.8%
산부인과 327
 
3.3%
외과 285
 
2.9%
Other values (9) 1036
10.4%

Length

2023-12-12T14:31:57.728364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
내시경실 1893
18.9%
임상심리실3 1636
16.4%
임상심리실1 1404
14.0%
임상심리실2 1324
13.2%
이비인후과 598
 
6.0%
근전도실 565
 
5.7%
안과 551
 
5.5%
생리기능검사실 381
 
3.8%
산부인과 327
 
3.3%
외과 285
 
2.9%
Other values (9) 1036
10.4%
Distinct426
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T14:31:57.929180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length37
Mean length17.0517
Min length4

Characters and Unicode

Total characters170517
Distinct characters387
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row해밀턴 불안검사(HAS)
2nd rowRM/순목반사검사
3rd row외과/US Dopple(low leg artery)
4th rowCS/흉강경(수술용)
5th row신경인지기능검사 개별검사 유형1 (3 ~5개 검사)
ValueCountFrequency (%)
검사 459
 
2.4%
개별검사 403
 
2.1%
신경인지기능검사 403
 
2.1%
351
 
1.8%
점막하 205
 
1.1%
상부소화관 188
 
1.0%
평가 175
 
0.9%
행동 175
 
0.9%
증상 175
 
0.9%
초음파 151
 
0.8%
Other values (592) 16370
85.9%
2023-12-12T14:31:58.348903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9451
 
5.5%
6678
 
3.9%
6647
 
3.9%
) 5661
 
3.3%
( 5613
 
3.3%
3691
 
2.2%
3222
 
1.9%
/ 3164
 
1.9%
o 2950
 
1.7%
S 2729
 
1.6%
Other values (377) 120711
70.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 99300
58.2%
Lowercase Letter 23464
 
13.8%
Uppercase Letter 18328
 
10.7%
Space Separator 9451
 
5.5%
Close Punctuation 5900
 
3.5%
Open Punctuation 5852
 
3.4%
Other Punctuation 3772
 
2.2%
Dash Punctuation 2189
 
1.3%
Decimal Number 2134
 
1.3%
Math Symbol 92
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6678
 
6.7%
6647
 
6.7%
3691
 
3.7%
3222
 
3.2%
2250
 
2.3%
2161
 
2.2%
2007
 
2.0%
1836
 
1.8%
1652
 
1.7%
1598
 
1.6%
Other values (309) 67558
68.0%
Lowercase Letter
ValueCountFrequency (%)
o 2950
12.6%
e 2712
11.6%
l 1776
 
7.6%
t 1750
 
7.5%
i 1555
 
6.6%
r 1552
 
6.6%
p 1461
 
6.2%
n 1377
 
5.9%
a 1281
 
5.5%
s 1258
 
5.4%
Other values (14) 5792
24.7%
Uppercase Letter
ValueCountFrequency (%)
S 2729
14.9%
M 1490
 
8.1%
A 1376
 
7.5%
P 1362
 
7.4%
C 1350
 
7.4%
R 1277
 
7.0%
E 1262
 
6.9%
D 1236
 
6.7%
I 880
 
4.8%
N 873
 
4.8%
Other values (13) 4493
24.5%
Decimal Number
ValueCountFrequency (%)
2 580
27.2%
1 516
24.2%
3 263
12.3%
5 243
11.4%
7 135
 
6.3%
4 127
 
6.0%
6 101
 
4.7%
9 81
 
3.8%
8 80
 
3.7%
0 8
 
0.4%
Other Punctuation
ValueCountFrequency (%)
/ 3164
83.9%
, 555
 
14.7%
: 53
 
1.4%
Close Punctuation
ValueCountFrequency (%)
) 5661
95.9%
] 239
 
4.1%
Open Punctuation
ValueCountFrequency (%)
( 5613
95.9%
[ 239
 
4.1%
Space Separator
ValueCountFrequency (%)
9451
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2189
100.0%
Math Symbol
ValueCountFrequency (%)
~ 92
100.0%
Letter Number
ValueCountFrequency (%)
35
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 99300
58.2%
Latin 41827
24.5%
Common 29390
 
17.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6678
 
6.7%
6647
 
6.7%
3691
 
3.7%
3222
 
3.2%
2250
 
2.3%
2161
 
2.2%
2007
 
2.0%
1836
 
1.8%
1652
 
1.7%
1598
 
1.6%
Other values (309) 67558
68.0%
Latin
ValueCountFrequency (%)
o 2950
 
7.1%
S 2729
 
6.5%
e 2712
 
6.5%
l 1776
 
4.2%
t 1750
 
4.2%
i 1555
 
3.7%
r 1552
 
3.7%
M 1490
 
3.6%
p 1461
 
3.5%
n 1377
 
3.3%
Other values (38) 22475
53.7%
Common
ValueCountFrequency (%)
9451
32.2%
) 5661
19.3%
( 5613
19.1%
/ 3164
 
10.8%
- 2189
 
7.4%
2 580
 
2.0%
, 555
 
1.9%
1 516
 
1.8%
3 263
 
0.9%
5 243
 
0.8%
Other values (10) 1155
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 99300
58.2%
ASCII 71182
41.7%
Number Forms 35
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9451
 
13.3%
) 5661
 
8.0%
( 5613
 
7.9%
/ 3164
 
4.4%
o 2950
 
4.1%
S 2729
 
3.8%
e 2712
 
3.8%
- 2189
 
3.1%
l 1776
 
2.5%
t 1750
 
2.5%
Other values (57) 33187
46.6%
Hangul
ValueCountFrequency (%)
6678
 
6.7%
6647
 
6.7%
3691
 
3.7%
3222
 
3.2%
2250
 
2.3%
2161
 
2.2%
2007
 
2.0%
1836
 
1.8%
1652
 
1.7%
1598
 
1.6%
Other values (309) 67558
68.0%
Number Forms
ValueCountFrequency (%)
35
100.0%

요일구분
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1996 
1986 
1968 
1949 
1944 
Other values (2)
 
157

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
1996
20.0%
1986
19.9%
1968
19.7%
1949
19.5%
1944
19.4%
85
 
0.9%
72
 
0.7%

Length

2023-12-12T14:31:58.526902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:31:58.659726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1996
20.0%
1986
19.9%
1968
19.7%
1949
19.5%
1944
19.4%
85
 
0.9%
72
 
0.7%
Distinct49
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2023-12-12 00:00:00
Maximum2023-12-12 23:59:00
2023-12-12T14:31:58.859111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:31:59.046526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)

검사구분
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
전체
10000 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전체
2nd row전체
3rd row전체
4th row전체
5th row전체

Common Values

ValueCountFrequency (%)
전체 10000
100.0%

Length

2023-12-12T14:31:59.251753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:31:59.366151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전체 10000
100.0%

예약가능여부
Boolean

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
10000 
ValueCountFrequency (%)
False 10000
100.0%
2023-12-12T14:31:59.470074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

예약가능인원
Real number (ℝ)

HIGH CORRELATION 

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.9485
Minimum1
Maximum15
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T14:31:59.574978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q310
95-th percentile10
Maximum15
Range14
Interquartile range (IQR)9

Descriptive statistics

Standard deviation4.351062
Coefficient of variation (CV)0.87926888
Kurtosis-1.2274064
Mean4.9485
Median Absolute Deviation (MAD)1
Skewness0.55316023
Sum49485
Variance18.931741
MonotonicityNot monotonic
2023-12-12T14:31:59.716309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
1 4014
40.1%
10 3231
32.3%
2 1153
 
11.5%
5 598
 
6.0%
3 406
 
4.1%
15 310
 
3.1%
7 267
 
2.7%
4 15
 
0.1%
11 4
 
< 0.1%
12 2
 
< 0.1%
ValueCountFrequency (%)
1 4014
40.1%
2 1153
 
11.5%
3 406
 
4.1%
4 15
 
0.1%
5 598
 
6.0%
7 267
 
2.7%
10 3231
32.3%
11 4
 
< 0.1%
12 2
 
< 0.1%
15 310
 
3.1%
ValueCountFrequency (%)
15 310
 
3.1%
12 2
 
< 0.1%
11 4
 
< 0.1%
10 3231
32.3%
7 267
 
2.7%
5 598
 
6.0%
4 15
 
0.1%
3 406
 
4.1%
2 1153
 
11.5%
1 4014
40.1%

Interactions

2023-12-12T14:31:57.361624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:31:59.829418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
검사실명요일구분예약시간예약가능인원
검사실명1.0000.2640.7370.922
요일구분0.2641.0000.1520.136
예약시간0.7370.1521.0000.789
예약가능인원0.9220.1360.7891.000
2023-12-12T14:31:59.957499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
요일구분검사실명
요일구분1.0000.120
검사실명0.1201.000
2023-12-12T14:32:00.055078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
예약가능인원검사실명요일구분
예약가능인원1.0000.7400.048
검사실명0.7401.0000.120
요일구분0.0480.1201.000

Missing values

2023-12-12T14:31:57.496375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:31:57.608998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

검사실명검사코드명요일구분예약시간검사구분예약가능여부예약가능인원
48284임상심리실3해밀턴 불안검사(HAS)11:30전체N1
2784근전도실RM/순목반사검사10:30전체N1
49849흉부외과외과/US Dopple(low leg artery)17:00전체N2
49421흉부외과CS/흉강경(수술용)11:30전체N2
44993임상심리실3신경인지기능검사 개별검사 유형1 (3 ~5개 검사)15:30전체N10
30763임상심리실1신경인지기능검사 개별검사 유형1 (6-8개 검사)09:00전체N10
30296임상심리실1성인 진단적 계산력 검사08:00전체N10
46104임상심리실3영심리도식질문지(YSQ)09:30전체N10
45659임상심리실3신경인지기능검사 개별검사 유형617:00전체N15
33328임상심리실1캘리포니아 언어학습검사09:00전체N10
검사실명검사코드명요일구분예약시간검사구분예약가능여부예약가능인원
11940내시경실상부소화관 내시경검사/수면내시경09:20전체N2
5979내시경실S상결장검사(내과용)14:40전체N1
4916내시경실(위-비급여)내시경적 상부소화관 종양수술-점막절제술 및 점막하 종양절제술10:50전체N1
31901임상심리실1우울척도/BECK우울평가08:00전체N10
46359임상심리실3우울척도/BECK우울평가09:30전체N10
42518임상심리실3다면적 인성검사-2(MMPI-2)09:00전체N10
15325산부인과질확대경검사(단순)15:50전체N1
12283내시경실역행성담췌관내시경수술/담(췌)관배액술EndoscopicBil10:10전체N2
17332생리기능검사실천식유발검사/아리돌사용09:00전체N1
14416산부인과Culdoscopy11:40전체N1

Duplicate rows

Most frequently occurring

검사실명검사코드명요일구분예약시간검사구분예약가능여부예약가능인원# duplicates
0내시경실S상결장검사(내과용)10:30전체N22
1내시경실S상결장검사(내과용)10:40전체N12
2내시경실S상결장검사(내과용)13:30전체N22
3내시경실S상결장검사(내과용)14:00전체N12
4내시경실S상결장검사(내과용)14:00전체N12
5내시경실S상결장검사(내과용)14:40전체N12
6내시경실S상결장검사(내과용)14:50전체N12
7내시경실S상결장검사(내과용)10:10전체N22
8내시경실S상결장검사(내과용)10:50전체N12
9내시경실고형암검진 위 내시경(수면)09:40전체N22