Overview

Dataset statistics

Number of variables8
Number of observations45
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.1 KiB
Average record size in memory69.9 B

Variable types

Numeric2
Categorical5
Text1

Dataset

Description상주교통안전체험교육센터 교통안전체험교육 프로그램 과정명, 교육시간, 정원 및 교육 수수료 등 관련 전반적인 내용을 제공합니다.
URLhttps://www.data.go.kr/data/15046005/fileData.do

Alerts

정원 has constant value ""Constant
교육일 is highly overall correlated with 교육비 and 2 other fieldsHigh correlation
숙박여부 is highly overall correlated with 교육비 and 2 other fieldsHigh correlation
교육시간 is highly overall correlated with 교육비 and 2 other fieldsHigh correlation
번호 is highly overall correlated with 업종High correlation
교육비 is highly overall correlated with 교육일 and 2 other fieldsHigh correlation
업종 is highly overall correlated with 번호High correlation
번호 has unique valuesUnique
교육과정명 has unique valuesUnique
교육비 has 1 (2.2%) zerosZeros

Reproduction

Analysis started2023-12-12 09:00:39.090489
Analysis finished2023-12-12 09:00:39.986947
Duration0.9 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23
Minimum1
Maximum45
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size537.0 B
2023-12-12T18:00:40.073526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.2
Q112
median23
Q334
95-th percentile42.8
Maximum45
Range44
Interquartile range (IQR)22

Descriptive statistics

Standard deviation13.133926
Coefficient of variation (CV)0.57104024
Kurtosis-1.2
Mean23
Median Absolute Deviation (MAD)11
Skewness0
Sum1035
Variance172.5
MonotonicityStrictly increasing
2023-12-12T18:00:40.242328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
1 1
 
2.2%
35 1
 
2.2%
26 1
 
2.2%
27 1
 
2.2%
28 1
 
2.2%
29 1
 
2.2%
30 1
 
2.2%
31 1
 
2.2%
32 1
 
2.2%
33 1
 
2.2%
Other values (35) 35
77.8%
ValueCountFrequency (%)
1 1
2.2%
2 1
2.2%
3 1
2.2%
4 1
2.2%
5 1
2.2%
6 1
2.2%
7 1
2.2%
8 1
2.2%
9 1
2.2%
10 1
2.2%
ValueCountFrequency (%)
45 1
2.2%
44 1
2.2%
43 1
2.2%
42 1
2.2%
41 1
2.2%
40 1
2.2%
39 1
2.2%
38 1
2.2%
37 1
2.2%
36 1
2.2%

업종
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)8.9%
Missing0
Missing (%)0.0%
Memory size492.0 B
기타
13 
버스
12 
택시
11 
화물

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기타
2nd row기타
3rd row기타
4th row기타
5th row기타

Common Values

ValueCountFrequency (%)
기타 13
28.9%
버스 12
26.7%
택시 11
24.4%
화물 9
20.0%

Length

2023-12-12T18:00:40.392408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:00:40.521642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기타 13
28.9%
버스 12
26.7%
택시 11
24.4%
화물 9
20.0%

교육과정명
Text

UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size492.0 B
2023-12-12T18:00:40.751208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length20
Mean length15.866667
Min length6

Characters and Unicode

Total characters714
Distinct characters64
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)100.0%

Sample

1st row일반기본 교육과정(신규교육)
2nd row일반기본 교육과정(재교육)
3rd row일반기본 교육과정(야간교육)
4th row일반기본 교육과정(고령자교육)
5th row일반심화 교육과정(신규교육)
ValueCountFrequency (%)
교육과정 10
 
8.2%
교육과정(재교육 8
 
6.6%
교육과정(신규교육 8
 
6.6%
경제운전 6
 
4.9%
교육과정(연비절감 6
 
4.9%
일반기본 4
 
3.3%
택시기본 4
 
3.3%
교육과정(야간교육 4
 
3.3%
교육과정(고령자교육 4
 
3.3%
화물 4
 
3.3%
Other values (30) 64
52.5%
2023-12-12T18:00:41.224844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
77
 
10.8%
71
 
9.9%
71
 
9.9%
50
 
7.0%
44
 
6.2%
( 34
 
4.8%
) 34
 
4.8%
22
 
3.1%
19
 
2.7%
19
 
2.7%
Other values (54) 273
38.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 566
79.3%
Space Separator 77
 
10.8%
Open Punctuation 34
 
4.8%
Close Punctuation 34
 
4.8%
Decimal Number 3
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
71
 
12.5%
71
 
12.5%
50
 
8.8%
44
 
7.8%
22
 
3.9%
19
 
3.4%
19
 
3.4%
12
 
2.1%
12
 
2.1%
12
 
2.1%
Other values (48) 234
41.3%
Decimal Number
ValueCountFrequency (%)
1 1
33.3%
0 1
33.3%
5 1
33.3%
Space Separator
ValueCountFrequency (%)
77
100.0%
Open Punctuation
ValueCountFrequency (%)
( 34
100.0%
Close Punctuation
ValueCountFrequency (%)
) 34
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 566
79.3%
Common 148
 
20.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
71
 
12.5%
71
 
12.5%
50
 
8.8%
44
 
7.8%
22
 
3.9%
19
 
3.4%
19
 
3.4%
12
 
2.1%
12
 
2.1%
12
 
2.1%
Other values (48) 234
41.3%
Common
ValueCountFrequency (%)
77
52.0%
( 34
23.0%
) 34
23.0%
1 1
 
0.7%
0 1
 
0.7%
5 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 566
79.3%
ASCII 148
 
20.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
77
52.0%
( 34
23.0%
) 34
23.0%
1 1
 
0.7%
0 1
 
0.7%
5 1
 
0.7%
Hangul
ValueCountFrequency (%)
71
 
12.5%
71
 
12.5%
50
 
8.8%
44
 
7.8%
22
 
3.9%
19
 
3.4%
19
 
3.4%
12
 
2.1%
12
 
2.1%
12
 
2.1%
Other values (48) 234
41.3%

교육일
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)11.1%
Missing0
Missing (%)0.0%
Memory size492.0 B
1일
29 
2일
12 
5일
 
2
3일
 
1
10일
 
1

Length

Max length3
Median length2
Mean length2.0222222
Min length2

Unique

Unique2 ?
Unique (%)4.4%

Sample

1st row1일
2nd row1일
3rd row1일
4th row1일
5th row2일

Common Values

ValueCountFrequency (%)
1일 29
64.4%
2일 12
26.7%
5일 2
 
4.4%
3일 1
 
2.2%
10일 1
 
2.2%

Length

2023-12-12T18:00:41.414748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:00:41.553278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1일 29
64.4%
2일 12
26.7%
5일 2
 
4.4%
3일 1
 
2.2%
10일 1
 
2.2%

교육시간
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)15.6%
Missing0
Missing (%)0.0%
Memory size492.0 B
8시간
27 
16시간
12 
40시간
 
2
24시간
 
1
80시간
 
1
Other values (2)
 
2

Length

Max length4
Median length3
Mean length3.3555556
Min length3

Unique

Unique4 ?
Unique (%)8.9%

Sample

1st row8시간
2nd row8시간
3rd row8시간
4th row8시간
5th row16시간

Common Values

ValueCountFrequency (%)
8시간 27
60.0%
16시간 12
26.7%
40시간 2
 
4.4%
24시간 1
 
2.2%
80시간 1
 
2.2%
4시간 1
 
2.2%
3시간 1
 
2.2%

Length

2023-12-12T18:00:41.696355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:00:41.830742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
8시간 27
60.0%
16시간 12
26.7%
40시간 2
 
4.4%
24시간 1
 
2.2%
80시간 1
 
2.2%
4시간 1
 
2.2%
3시간 1
 
2.2%

정원
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size492.0 B
30
45 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row30
2nd row30
3rd row30
4th row30
5th row30

Common Values

ValueCountFrequency (%)
30 45
100.0%

Length

2023-12-12T18:00:41.963391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:00:42.120744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
30 45
100.0%

교육비
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct13
Distinct (%)28.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean154311.11
Minimum0
Maximum864000
Zeros1
Zeros (%)2.2%
Negative0
Negative (%)0.0%
Memory size537.0 B
2023-12-12T18:00:42.216341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile91000
Q192000
median96000
Q3182000
95-th percentile422400
Maximum864000
Range864000
Interquartile range (IQR)90000

Descriptive statistics

Standard deviation142891.38
Coefficient of variation (CV)0.92599541
Kurtosis14.359265
Mean154311.11
Median Absolute Deviation (MAD)5000
Skewness3.4688374
Sum6944000
Variance2.0417946 × 1010
MonotonicityNot monotonic
2023-12-12T18:00:42.661914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
92000 11
24.4%
96000 8
17.8%
91000 8
17.8%
182000 5
11.1%
192000 4
 
8.9%
189000 2
 
4.4%
288000 1
 
2.2%
456000 1
 
2.2%
864000 1
 
2.2%
46000 1
 
2.2%
Other values (3) 3
 
6.7%
ValueCountFrequency (%)
0 1
 
2.2%
46000 1
 
2.2%
91000 8
17.8%
92000 11
24.4%
96000 8
17.8%
182000 5
11.1%
189000 2
 
4.4%
192000 4
 
8.9%
206000 1
 
2.2%
288000 1
 
2.2%
ValueCountFrequency (%)
864000 1
 
2.2%
520000 1
 
2.2%
456000 1
 
2.2%
288000 1
 
2.2%
206000 1
 
2.2%
192000 4
 
8.9%
189000 2
 
4.4%
182000 5
11.1%
96000 8
17.8%
92000 11
24.4%

숙박여부
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size492.0 B
비합숙
29 
합숙
16 

Length

Max length3
Median length3
Mean length2.6444444
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row비합숙
2nd row비합숙
3rd row비합숙
4th row비합숙
5th row합숙

Common Values

ValueCountFrequency (%)
비합숙 29
64.4%
합숙 16
35.6%

Length

2023-12-12T18:00:42.799490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:00:42.944256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
비합숙 29
64.4%
합숙 16
35.6%

Interactions

2023-12-12T18:00:39.609562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:00:39.451159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:00:39.685723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:00:39.535782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:00:43.048947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업종교육과정명교육일교육시간교육비숙박여부
번호1.0000.9561.0000.2350.0000.1570.000
업종0.9561.0001.0000.0000.0000.0000.219
교육과정명1.0001.0001.0001.0001.0001.0001.000
교육일0.2350.0001.0001.0001.0001.0001.000
교육시간0.0000.0001.0001.0001.0000.9961.000
교육비0.1570.0001.0001.0000.9961.0001.000
숙박여부0.0000.2191.0001.0001.0001.0001.000
2023-12-12T18:00:43.175164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교육일숙박여부업종교육시간
교육일1.0000.9640.0000.975
숙박여부0.9641.0000.1370.940
업종0.0000.1371.0000.000
교육시간0.9750.9400.0001.000
2023-12-12T18:00:43.286139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호교육비업종교육일교육시간숙박여부
번호1.000-0.3150.8190.0930.0770.000
교육비-0.3151.0000.0000.9750.8980.940
업종0.8190.0001.0000.0000.0000.137
교육일0.0930.9750.0001.0000.9750.964
교육시간0.0770.8980.0000.9751.0000.940
숙박여부0.0000.9400.1370.9640.9401.000

Missing values

2023-12-12T18:00:39.795609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:00:39.928416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호업종교육과정명교육일교육시간정원교육비숙박여부
01기타일반기본 교육과정(신규교육)1일8시간3092000비합숙
12기타일반기본 교육과정(재교육)1일8시간3092000비합숙
23기타일반기본 교육과정(야간교육)1일8시간3092000비합숙
34기타일반기본 교육과정(고령자교육)1일8시간3092000비합숙
45기타일반심화 교육과정(신규교육)2일16시간30182000합숙
56기타일반심화 교육과정(재교육)2일16시간30182000합숙
67버스버스 법정심화 교육과정2일16시간30192000합숙
78버스버스 법정기본 교육과정1일8시간3096000비합숙
89버스버스기본 교육과정(신규교육)1일8시간3096000비합숙
910버스버스기본 교육과정(재교육)1일8시간3096000비합숙
번호업종교육과정명교육일교육시간정원교육비숙박여부
3536화물화물심화 교육과정(신규교육)2일16시간30189000합숙
3637화물화물심화 교육과정(재교육)2일16시간30189000합숙
3738화물운수종사자격 취득 교육과정(화물)2일16시간30192000합숙
3839기타승용 경제운전 교육과정(연비절감)1일8시간3092000비합숙
3940기타승합 경제운전 교육과정(연비절감)1일8시간3096000비합숙
4041기타화물 경제운전 교육과정(연비절감)1일8시간3091000비합숙
4142기타승용 경제운전 교육과정(연비절감 및 안전체험)1일8시간3092000비합숙
4243기타승합 경제운전 교육과정(연비절감 및 안전체험)1일8시간3096000비합숙
4344기타화물 경제운전 교육과정(연비절감 및 안전체험)1일8시간3091000비합숙
4445기타교육센터견학1일3시간300비합숙