Overview

Dataset statistics

Number of variables13
Number of observations46
Missing cells97
Missing cells (%)16.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.0 KiB
Average record size in memory110.9 B

Variable types

DateTime5
Categorical6
Numeric2

Dataset

Description자동차관리법 및 자동차종합검사 시행등에 관한 규칙에 따라 한국교통안전공단(KOTSA)에서 관리하는 자동차검사 자료입니다.
Author한국교통안전공단
URLhttps://www.data.go.kr/data/15088047/fileData.do

Alerts

적재량최대 has constant value ""Constant
안내 is highly overall correlated with 차종승용 and 1 other fieldsHigh correlation
적재량최소 is highly overall correlated with 총중량최소 and 1 other fieldsHigh correlation
연료 is highly overall correlated with 총중량최소 and 3 other fieldsHigh correlation
총중량최소 is highly overall correlated with 연료 and 1 other fieldsHigh correlation
총중량최대 is highly overall correlated with 연료High correlation
차종승용 is highly overall correlated with 안내High correlation
적용일 has 28 (60.9%) missing valuesMissing
만료일 has 27 (58.7%) missing valuesMissing
기준제작일자시작일 has 5 (10.9%) missing valuesMissing
기준제작일자종료일 has 37 (80.4%) missing valuesMissing
총중량최소 has 21 (45.7%) zerosZeros

Reproduction

Analysis started2023-12-12 20:59:58.208509
Analysis finished2023-12-12 20:59:59.403745
Duration1.2 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

적용일
Date

MISSING 

Distinct3
Distinct (%)16.7%
Missing28
Missing (%)60.9%
Memory size500.0 B
Minimum2017-01-11 00:00:00
Maximum2021-07-20 00:00:00
2023-12-13T05:59:59.438884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:59:59.528435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=3)

만료일
Date

MISSING 

Distinct3
Distinct (%)15.8%
Missing27
Missing (%)58.7%
Memory size500.0 B
Minimum2017-01-10 00:00:00
Maximum2021-07-19 00:00:00
2023-12-13T05:59:59.623793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:59:59.721744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=3)
Distinct7
Distinct (%)17.1%
Missing5
Missing (%)10.9%
Memory size500.0 B
Minimum1996-01-01 00:00:00
Maximum2013-08-16 00:00:00
2023-12-13T05:59:59.849896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:59:59.946253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
Distinct2
Distinct (%)22.2%
Missing37
Missing (%)80.4%
Memory size500.0 B
Minimum2006-02-09 00:00:00
Maximum2006-02-10 00:00:00
2023-12-13T06:00:00.055134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:00:00.177660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=2)

차종승용
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)10.9%
Missing0
Missing (%)0.0%
Memory size500.0 B
화물
19 
특수
11 
승합
<NA>
승용
 
1

Length

Max length4
Median length2
Mean length2.2608696
Min length2

Unique

Unique1 ?
Unique (%)2.2%

Sample

1st row승용
2nd row<NA>
3rd row승합
4th row승합
5th row승합

Common Values

ValueCountFrequency (%)
화물 19
41.3%
특수 11
23.9%
승합 9
19.6%
<NA> 6
 
13.0%
승용 1
 
2.2%

Length

2023-12-13T06:00:00.314893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:00:00.437102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
화물 19
41.3%
특수 11
23.9%
승합 9
19.6%
na 6
 
13.0%
승용 1
 
2.2%

용도
Categorical

Distinct3
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size500.0 B
자가용
18 
관용
18 
<NA>
10 

Length

Max length4
Median length3
Mean length2.826087
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row자가용
4th row관용
5th row자가용

Common Values

ValueCountFrequency (%)
자가용 18
39.1%
관용 18
39.1%
<NA> 10
21.7%

Length

2023-12-13T06:00:00.586252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:00:00.705908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자가용 18
39.1%
관용 18
39.1%
na 10
21.7%

연료
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size500.0 B
<NA>
39 
전기

Length

Max length4
Median length4
Mean length3.6956522
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전기
2nd row전기
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 39
84.8%
전기 7
 
15.2%

Length

2023-12-13T06:00:00.849380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:00:00.951518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 39
84.8%
전기 7
 
15.2%

총중량최소
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct6
Distinct (%)13.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5725.413
Minimum0
Maximum16000
Zeros21
Zeros (%)45.7%
Negative0
Negative (%)0.0%
Memory size546.0 B
2023-12-13T06:00:01.302386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median3501
Q314500
95-th percentile16000
Maximum16000
Range16000
Interquartile range (IQR)14500

Descriptive statistics

Standard deviation6798.7189
Coefficient of variation (CV)1.1874635
Kurtosis-1.3312322
Mean5725.413
Median Absolute Deviation (MAD)3501
Skewness0.68266838
Sum263369
Variance46222578
MonotonicityNot monotonic
2023-12-13T06:00:01.410108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 21
45.7%
16000 12
26.1%
3501 6
 
13.0%
10000 4
 
8.7%
4501 2
 
4.3%
1361 1
 
2.2%
ValueCountFrequency (%)
0 21
45.7%
1361 1
 
2.2%
3501 6
 
13.0%
4501 2
 
4.3%
10000 4
 
8.7%
16000 12
26.1%
ValueCountFrequency (%)
16000 12
26.1%
10000 4
 
8.7%
4501 2
 
4.3%
3501 6
 
13.0%
1361 1
 
2.2%
0 21
45.7%

총중량최대
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)13.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean741269.61
Minimum1360
Maximum999999
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size546.0 B
2023-12-13T06:00:01.521199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1360
5-th percentile1361
Q1261999
median999999
Q3999999
95-th percentile999999
Maximum999999
Range998639
Interquartile range (IQR)738000

Descriptive statistics

Standard deviation440330.55
Coefficient of variation (CV)0.59402213
Kurtosis-0.76617883
Mean741269.61
Median Absolute Deviation (MAD)0
Skewness-1.1264126
Sum34098402
Variance1.9389099 × 1011
MonotonicityNot monotonic
2023-12-13T06:00:01.634465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
999999 34
73.9%
15999 4
 
8.7%
9999 2
 
4.3%
4500 2
 
4.3%
1360 2
 
4.3%
1361 2
 
4.3%
ValueCountFrequency (%)
1360 2
 
4.3%
1361 2
 
4.3%
4500 2
 
4.3%
9999 2
 
4.3%
15999 4
 
8.7%
999999 34
73.9%
ValueCountFrequency (%)
999999 34
73.9%
15999 4
 
8.7%
9999 2
 
4.3%
4500 2
 
4.3%
1361 2
 
4.3%
1360 2
 
4.3%

적재량최소
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size500.0 B
0
34 
8000
12 

Length

Max length4
Median length1
Mean length1.7826087
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 34
73.9%
8000 12
 
26.1%

Length

2023-12-13T06:00:01.753140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:00:01.857426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 34
73.9%
8000 12
 
26.1%

적재량최대
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size500.0 B
999999
46 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row999999
2nd row999999
3rd row999999
4th row999999
5th row999999

Common Values

ValueCountFrequency (%)
999999 46
100.0%

Length

2023-12-13T06:00:01.941327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:00:02.020060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
999999 46
100.0%

안내
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)17.4%
Missing0
Missing (%)0.0%
Memory size500.0 B
속도제한장치 설치 대상 (90km/h)
20 
속도제한장치 설치/미설치 대상
속도제한장치 설치 대상 (110km/h)
제작일자 오류입니다. 최고속도제한장치 설치대상
속도제한장치 설치 대상(초소형전기차 80km/h, 저속전기차 60km/h)
Other values (3)

Length

Max length54
Median length42
Mean length23.021739
Min length12

Unique

Unique3 ?
Unique (%)6.5%

Sample

1st row제작일자 오류입니다. 최고속도제한장치 설치대상
2nd row속도제한장치 설치 대상(초소형전기차 80km/h, 저속전기차 60km/h)
3rd row속도제한장치 설치 대상
4th row속도제한장치 설치 대상 (110km/h)
5th row속도제한장치 설치 대상 (110km/h)

Common Values

ValueCountFrequency (%)
속도제한장치 설치 대상 (90km/h) 20
43.5%
속도제한장치 설치/미설치 대상 8
 
17.4%
속도제한장치 설치 대상 (110km/h) 7
 
15.2%
제작일자 오류입니다. 최고속도제한장치 설치대상 4
 
8.7%
속도제한장치 설치 대상(초소형전기차 80km/h, 저속전기차 60km/h) 4
 
8.7%
속도제한장치 설치 대상 1
 
2.2%
최고속도제한장치 설치 대상 여부 확인 필요 1
 
2.2%
제작일자 오류입니다. 속도제한장치 설치 대상(초소형전기차 80km/h, 저속전기차 60km/h) 1
 
2.2%

Length

2023-12-13T06:00:02.115228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:00:02.262234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
속도제한장치 41
21.7%
대상 37
19.6%
설치 34
18.0%
90km/h 20
10.6%
설치/미설치 8
 
4.2%
110km/h 7
 
3.7%
대상(초소형전기차 5
 
2.6%
60km/h 5
 
2.6%
저속전기차 5
 
2.6%
80km/h 5
 
2.6%
Other values (7) 22
11.6%
Distinct4
Distinct (%)8.7%
Missing0
Missing (%)0.0%
Memory size500.0 B
Minimum2016-09-27 00:00:00
Maximum2020-04-23 00:00:00
2023-12-13T06:00:02.370173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:00:02.462550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=4)

Interactions

2023-12-13T05:59:58.790369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:59:58.594969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:59:58.881343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:59:58.685121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:00:02.543016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
적용일만료일기준제작일자시작일기준제작일자종료일차종승용용도총중량최소총중량최대적재량최소안내등록일
적용일1.0000.0000.714NaNNaNNaN0.0000.8850.0000.8370.771
만료일0.0001.0000.6670.0000.9510.0000.000NaN0.1570.6860.295
기준제작일자시작일0.7140.6671.000NaN0.4160.0000.6890.8880.2200.7970.744
기준제작일자종료일NaN0.000NaN1.0000.0000.0000.000NaN0.0000.0000.704
차종승용NaN0.9510.4160.0001.0000.0000.6560.3210.4110.6870.120
용도NaN0.0000.0000.0000.0001.0000.0000.0000.0000.0000.000
총중량최소0.0000.0000.6890.0000.6560.0001.0000.7970.7930.7400.000
총중량최대0.885NaN0.888NaN0.3210.0000.7971.0000.3510.5410.109
적재량최소0.0000.1570.2200.0000.4110.0000.7930.3511.0000.4120.000
안내0.8370.6860.7970.0000.6870.0000.7400.5410.4121.0000.902
등록일0.7710.2950.7440.7040.1200.0000.0000.1090.0000.9021.000
2023-12-13T06:00:02.692575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
안내적재량최소용도연료차종승용
안내1.0000.2820.0001.0000.611
적재량최소0.2821.0000.0001.0000.265
용도0.0000.0001.000NaN0.000
연료1.0001.000NaN1.000NaN
차종승용0.6110.2650.000NaN1.000
2023-12-13T06:00:02.805595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
총중량최소총중량최대차종승용용도연료적재량최소안내
총중량최소1.0000.2610.3060.0001.0000.5710.385
총중량최대0.2611.0000.2040.0001.0000.2590.459
차종승용0.3060.2041.0000.000NaN0.2650.611
용도0.0000.0000.0001.0000.0000.0000.000
연료1.0001.000NaN0.0001.0001.0001.000
적재량최소0.5710.2590.2650.0001.0001.0000.282
안내0.3850.4590.6110.0001.0000.2821.000

Missing values

2023-12-13T05:59:59.009865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:59:59.192700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T05:59:59.313183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

적용일만료일기준제작일자시작일기준제작일자종료일차종승용용도연료총중량최소총중량최대적재량최소적재량최대안내등록일
0<NA>2021-07-19<NA><NA>승용<NA>전기09999990999999제작일자 오류입니다. 최고속도제한장치 설치대상2016-09-27
1<NA>2017-01-102010-03-29<NA><NA><NA>전기09999990999999속도제한장치 설치 대상(초소형전기차 80km/h, 저속전기차 60km/h)2016-09-27
2<NA>2021-07-191996-01-012006-02-10승합자가용<NA>100009999990999999속도제한장치 설치 대상2016-09-27
3<NA>2017-04-182006-02-10<NA>승합관용<NA>100009999990999999속도제한장치 설치 대상 (110km/h)2016-09-27
4<NA><NA>2006-02-11<NA>승합자가용<NA>100009999990999999속도제한장치 설치 대상 (110km/h)2016-09-27
5<NA><NA>2012-08-16<NA>승합관용<NA>450199990999999속도제한장치 설치 대상 (110km/h)2016-09-27
6<NA><NA>2012-08-16<NA>승합자가용<NA>450199990999999속도제한장치 설치 대상 (110km/h)2016-09-27
7<NA><NA>2013-08-16<NA>승합관용<NA>045000999999속도제한장치 설치 대상 (110km/h)2016-09-27
8<NA><NA>2013-08-16<NA>승합자가용<NA>045000999999속도제한장치 설치 대상 (110km/h)2016-09-27
9<NA>2017-04-181996-01-012006-02-09화물관용<NA>160009999990999999속도제한장치 설치/미설치 대상2016-09-27
적용일만료일기준제작일자시작일기준제작일자종료일차종승용용도연료총중량최소총중량최대적재량최소적재량최대안내등록일
362017-04-19<NA>2006-02-11<NA>화물관용<NA>160009999990999999속도제한장치 설치 대상 (90km/h)2017-04-18
372017-04-19<NA>2006-02-11<NA>화물자가용<NA>09999998000999999속도제한장치 설치 대상 (90km/h)2017-04-18
382017-04-19<NA>2006-02-11<NA>화물자가용<NA>160009999990999999속도제한장치 설치 대상 (90km/h)2017-04-18
392017-04-19<NA>2006-02-11<NA>특수관용<NA>09999998000999999속도제한장치 설치 대상 (90km/h)2017-04-18
402017-04-19<NA>2006-02-11<NA>특수관용<NA>160009999990999999속도제한장치 설치 대상 (90km/h)2017-04-18
412017-04-19<NA>2006-02-11<NA>특수자가용<NA>09999998000999999속도제한장치 설치 대상 (90km/h)2017-04-18
422017-04-19<NA>2006-02-11<NA>특수자가용<NA>160009999990999999속도제한장치 설치 대상 (90km/h)2017-04-18
432017-04-192021-07-192010-03-30<NA><NA><NA>전기13619999990999999최고속도제한장치 설치 대상 여부 확인 필요2020-04-23
442021-07-20<NA><NA><NA><NA><NA>전기013610999999제작일자 오류입니다. 속도제한장치 설치 대상(초소형전기차 80km/h, 저속전기차 60km/h)2016-09-27
452021-07-20<NA>2010-03-30<NA><NA><NA>전기013610999999속도제한장치 설치 대상(초소형전기차 80km/h, 저속전기차 60km/h)2017-04-18