Overview

Dataset statistics

Number of variables10
Number of observations543
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory45.2 KiB
Average record size in memory85.2 B

Variable types

DateTime1
Categorical4
Numeric3
Unsupported2

Dataset

Description저희 도로교통공단에서는 법규위반, 사고, 음주운전, 난폭, 보복운전 등으로 운전면허 행정처분을 받은 대상자를 대상으로 특별교통안전교육을 진행하고 있습니다. 어린이통학버스교육 관련 교육일정 및 예약정보 관련 자료입니다.
Author도로교통공단
URLhttps://www.data.go.kr/data/15087810/fileData.do

Alerts

시간표구분 has constant value ""Constant
교육반코드 has constant value ""Constant
강의실번호 is highly overall correlated with 지부코드High correlation
지부코드 is highly overall correlated with 강의실번호High correlation
순번 is highly imbalanced (86.1%)Imbalance
강의시작시간 is an unsupported type, check if it needs cleaning or further analysisUnsupported
강의종료시간 is an unsupported type, check if it needs cleaning or further analysisUnsupported
강의실정원 has 20 (3.7%) zerosZeros
예약정원 has 79 (14.5%) zerosZeros

Reproduction

Analysis started2023-12-12 16:40:26.182932
Analysis finished2023-12-12 16:40:27.379205
Duration1.2 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct234
Distinct (%)43.1%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
Minimum2020-01-02 00:00:00
Maximum2020-12-31 00:00:00
2023-12-13T01:40:27.436152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:40:27.547829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

지부코드
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
양재교육장
60 
인천지부
53 
부산지부
50 
울산경남지부
37 
울산지소
 
27
Other values (17)
316 

Length

Max length8
Median length6
Mean length4.7569061
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row양재교육장
2nd row울산경남지부
3rd row양재교육장
4th row춘천교육장
5th row충주교육장

Common Values

ValueCountFrequency (%)
양재교육장 60
 
11.0%
인천지부 53
 
9.8%
부산지부 50
 
9.2%
울산경남지부 37
 
6.8%
울산지소 27
 
5.0%
강북교육장 27
 
5.0%
의정부교육장 26
 
4.8%
충북지부 25
 
4.6%
경기지부 25
 
4.6%
대구지부 25
 
4.6%
Other values (12) 188
34.6%

Length

2023-12-13T01:40:27.667972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
양재교육장 60
 
11.0%
인천지부 53
 
9.8%
부산지부 50
 
9.2%
울산경남지부 37
 
6.8%
울산지소 27
 
5.0%
강북교육장 27
 
5.0%
의정부교육장 26
 
4.8%
충북지부 25
 
4.6%
경기지부 25
 
4.6%
대구지부 25
 
4.6%
Other values (12) 188
34.6%

시간표구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
3
543 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row3
3rd row3
4th row3
5th row3

Common Values

ValueCountFrequency (%)
3 543
100.0%

Length

2023-12-13T01:40:27.781834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:40:27.861867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 543
100.0%

교육반코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
통학버스교육
543 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row통학버스교육
2nd row통학버스교육
3rd row통학버스교육
4th row통학버스교육
5th row통학버스교육

Common Values

ValueCountFrequency (%)
통학버스교육 543
100.0%

Length

2023-12-13T01:40:27.942850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:40:28.034745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
통학버스교육 543
100.0%

강의실번호
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.8563536
Minimum1
Maximum6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.9 KiB
2023-12-13T01:40:28.110847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile5
Maximum6
Range5
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.2528901
Coefficient of variation (CV)0.67491995
Kurtosis1.6038636
Mean1.8563536
Median Absolute Deviation (MAD)0
Skewness1.5444526
Sum1008
Variance1.5697335
MonotonicityNot monotonic
2023-12-13T01:40:28.210056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 309
56.9%
2 107
 
19.7%
3 73
 
13.4%
5 36
 
6.6%
4 11
 
2.0%
6 7
 
1.3%
ValueCountFrequency (%)
1 309
56.9%
2 107
 
19.7%
3 73
 
13.4%
4 11
 
2.0%
5 36
 
6.6%
6 7
 
1.3%
ValueCountFrequency (%)
6 7
 
1.3%
5 36
 
6.6%
4 11
 
2.0%
3 73
 
13.4%
2 107
 
19.7%
1 309
56.9%

강의시작시간
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size4.4 KiB

순번
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
1
527 
2
 
10
3
 
6

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 527
97.1%
2 10
 
1.8%
3 6
 
1.1%

Length

2023-12-13T01:40:28.309182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:40:28.415938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 527
97.1%
2 10
 
1.8%
3 6
 
1.1%

강의종료시간
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size4.4 KiB

강의실정원
Real number (ℝ)

ZEROS 

Distinct51
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean78.01105
Minimum0
Maximum200
Zeros20
Zeros (%)3.7%
Negative0
Negative (%)0.0%
Memory size4.9 KiB
2023-12-13T01:40:28.564153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile20
Q150
median90
Q3100
95-th percentile120
Maximum200
Range200
Interquartile range (IQR)50

Descriptive statistics

Standard deviation36.457728
Coefficient of variation (CV)0.46734056
Kurtosis0.17641698
Mean78.01105
Median Absolute Deviation (MAD)29
Skewness0.047670799
Sum42360
Variance1329.1659
MonotonicityNot monotonic
2023-12-13T01:40:28.706345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100 186
34.3%
50 69
 
12.7%
120 37
 
6.8%
60 33
 
6.1%
70 31
 
5.7%
30 25
 
4.6%
0 20
 
3.7%
80 20
 
3.7%
40 12
 
2.2%
90 11
 
2.0%
Other values (41) 99
18.2%
ValueCountFrequency (%)
0 20
3.7%
13 1
 
0.2%
15 3
 
0.6%
19 1
 
0.2%
20 10
 
1.8%
24 1
 
0.2%
25 5
 
0.9%
26 1
 
0.2%
30 25
4.6%
31 1
 
0.2%
ValueCountFrequency (%)
200 4
0.7%
180 2
 
0.4%
165 1
 
0.2%
160 1
 
0.2%
158 1
 
0.2%
155 1
 
0.2%
151 1
 
0.2%
150 7
1.3%
140 3
0.6%
139 1
 
0.2%

예약정원
Real number (ℝ)

ZEROS 

Distinct58
Distinct (%)10.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean49.60221
Minimum0
Maximum200
Zeros79
Zeros (%)14.5%
Negative0
Negative (%)0.0%
Memory size4.9 KiB
2023-12-13T01:40:28.829912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q130
median50
Q370
95-th percentile100
Maximum200
Range200
Interquartile range (IQR)40

Descriptive statistics

Standard deviation34.462346
Coefficient of variation (CV)0.6947744
Kurtosis1.5471542
Mean49.60221
Median Absolute Deviation (MAD)20
Skewness0.80865172
Sum26934
Variance1187.6533
MonotonicityNot monotonic
2023-12-13T01:40:28.976182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
50 137
25.2%
0 79
14.5%
30 45
 
8.3%
100 40
 
7.4%
70 34
 
6.3%
60 32
 
5.9%
80 22
 
4.1%
40 21
 
3.9%
20 19
 
3.5%
90 10
 
1.8%
Other values (48) 104
19.2%
ValueCountFrequency (%)
0 79
14.5%
1 4
 
0.7%
2 1
 
0.2%
5 4
 
0.7%
13 1
 
0.2%
14 1
 
0.2%
15 3
 
0.6%
19 1
 
0.2%
20 19
 
3.5%
24 2
 
0.4%
ValueCountFrequency (%)
200 2
0.4%
170 1
 
0.2%
165 1
 
0.2%
160 1
 
0.2%
155 1
 
0.2%
151 1
 
0.2%
150 3
0.6%
140 1
 
0.2%
139 1
 
0.2%
138 3
0.6%

Interactions

2023-12-13T01:40:26.907062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:40:26.391793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:40:26.636590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:40:26.992535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:40:26.464585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:40:26.730300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:40:27.089172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:40:26.552341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:40:26.820599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:40:29.081918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지부코드강의실번호순번강의실정원예약정원
지부코드1.0000.8680.5060.7620.707
강의실번호0.8681.0000.0000.4150.366
순번0.5060.0001.0000.2290.198
강의실정원0.7620.4150.2291.0000.958
예약정원0.7070.3660.1980.9581.000
2023-12-13T01:40:29.163825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번지부코드
순번1.0000.302
지부코드0.3021.000
2023-12-13T01:40:29.282361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강의실번호강의실정원예약정원지부코드순번
강의실번호1.0000.047-0.0210.6210.000
강의실정원0.0471.0000.3380.3950.140
예약정원-0.0210.3381.0000.3360.124
지부코드0.6210.3950.3361.0000.302
순번0.0000.1400.1240.3021.000

Missing values

2023-12-13T01:40:27.209291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:40:27.334667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

교육일자지부코드시간표구분교육반코드강의실번호강의시작시간순번강의종료시간강의실정원예약정원
02020-01-02양재교육장3통학버스교육114:00:00117:00:00120100
12020-01-02울산경남지부3통학버스교육110:00:00113:00:00150150
22020-01-02양재교육장3통학버스교육210:00:00113:00:00120100
32020-01-03춘천교육장3통학버스교육210:00:00113:00:007070
42020-01-06충주교육장3통학버스교육110:00:00113:00:0000
52020-01-06충북지부3통학버스교육110:00:00113:00:009090
62020-01-07부산지부3통학버스교육510:00:00113:00:00120100
72020-01-07울산지소3통학버스교육110:00:00113:00:00138138
82020-01-07인천지부3통학버스교육210:00:00113:00:00100100
92020-01-08제주지부3통학버스교육118:30:00121:30:0010070
교육일자지부코드시간표구분교육반코드강의실번호강의시작시간순번강의종료시간강의실정원예약정원
5332020-12-24울산경남지부3통학버스교육118:30:00121:30:006161
5342020-12-24양재교육장3통학버스교육310:00:00113:00:0010048
5352020-12-24예산교육장3통학버스교육110:00:00113:00:0010062
5362020-12-24경기지부3통학버스교육410:30:00113:30:005050
5372020-12-26대구지부3통학버스교육210:00:00113:00:00100100
5382020-12-26대구지부3통학버스교육110:00:00113:00:00100100
5392020-12-28의정부교육장3통학버스교육210:30:00113:30:004040
5402020-12-29인천지부3통학버스교육110:00:00113:00:005050
5412020-12-31양재교육장3통학버스교육310:00:00113:00:0010030
5422020-12-31양재교육장3통학버스교육610:00:00113:00:0010050