Overview

Dataset statistics

Number of variables5
Number of observations300
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.8 KiB
Average record size in memory40.4 B

Variable types

Text1
Categorical3
Boolean1

Dataset

Description자동차관리법 및 자동차종합검사 시행등에 관한 규칙에 따라 한국교통안전공단(KOTSA)에서 관리하는 자동차검사 자료입니다.
Author한국교통안전공단
URLhttps://www.data.go.kr/data/15088045/fileData.do

Alerts

사용여부 has constant value ""Constant
적용법령 is highly overall correlated with 적용일자 and 1 other fieldsHigh correlation
적용일자 is highly overall correlated with 적용법령 and 1 other fieldsHigh correlation
등록일시 is highly overall correlated with 적용법령 and 1 other fieldsHigh correlation
적용법령 is highly imbalanced (51.2%)Imbalance

Reproduction

Analysis started2023-12-12 19:54:10.343541
Analysis finished2023-12-12 19:54:10.745329
Duration0.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct252
Distinct (%)84.0%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023-12-13T04:54:11.076333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length18
Mean length13.903333
Min length7

Characters and Unicode

Total characters4171
Distinct characters163
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique215 ?
Unique (%)71.7%

Sample

1st row경기도 용인시 처인구 남사읍 봉무리
2nd row경기도 용인시 처인구 남사읍 봉무리
3rd row경기도 용인시 처인구 남사읍 북리
4th row경기도 용인시 처인구 남사읍 북리
5th row경기도 용인시 처인구 남사읍 북리
ValueCountFrequency (%)
인천광역시 143
 
14.2%
광주광역시 72
 
7.1%
중구 52
 
5.2%
동구 41
 
4.1%
서구 39
 
3.9%
경기도 37
 
3.7%
문무대왕면 32
 
3.2%
경주시 32
 
3.2%
경상북도 32
 
3.2%
용인시 32
 
3.2%
Other values (251) 496
49.2%
2023-12-13T04:54:11.607555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
708
17.0%
304
 
7.3%
291
 
7.0%
286
 
6.9%
269
 
6.4%
215
 
5.2%
209
 
5.0%
149
 
3.6%
119
 
2.9%
110
 
2.6%
Other values (153) 1511
36.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3423
82.1%
Space Separator 708
 
17.0%
Decimal Number 40
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
304
 
8.9%
291
 
8.5%
286
 
8.4%
269
 
7.9%
215
 
6.3%
209
 
6.1%
149
 
4.4%
119
 
3.5%
110
 
3.2%
101
 
3.0%
Other values (145) 1370
40.0%
Decimal Number
ValueCountFrequency (%)
2 10
25.0%
1 10
25.0%
3 10
25.0%
4 5
12.5%
5 3
 
7.5%
6 1
 
2.5%
7 1
 
2.5%
Space Separator
ValueCountFrequency (%)
708
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3423
82.1%
Common 748
 
17.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
304
 
8.9%
291
 
8.5%
286
 
8.4%
269
 
7.9%
215
 
6.3%
209
 
6.1%
149
 
4.4%
119
 
3.5%
110
 
3.2%
101
 
3.0%
Other values (145) 1370
40.0%
Common
ValueCountFrequency (%)
708
94.7%
2 10
 
1.3%
1 10
 
1.3%
3 10
 
1.3%
4 5
 
0.7%
5 3
 
0.4%
6 1
 
0.1%
7 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3423
82.1%
ASCII 748
 
17.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
708
94.7%
2 10
 
1.3%
1 10
 
1.3%
3 10
 
1.3%
4 5
 
0.7%
5 3
 
0.4%
6 1
 
0.1%
7 1
 
0.1%
Hangul
ValueCountFrequency (%)
304
 
8.9%
291
 
8.5%
286
 
8.4%
269
 
7.9%
215
 
6.3%
209
 
6.1%
149
 
4.4%
119
 
3.5%
110
 
3.2%
101
 
3.0%
Other values (145) 1370
40.0%

적용법령
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
대기환경보전법
251 
대기관리권역법
37 
수도권특별법
 
12

Length

Max length7
Median length7
Mean length6.96
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대기관리권역법
2nd row수도권특별법
3rd row대기환경보전법
4th row대기관리권역법
5th row수도권특별법

Common Values

ValueCountFrequency (%)
대기환경보전법 251
83.7%
대기관리권역법 37
 
12.3%
수도권특별법 12
 
4.0%

Length

2023-12-13T04:54:11.756030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:54:11.876526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대기환경보전법 251
83.7%
대기관리권역법 37
 
12.3%
수도권특별법 12
 
4.0%

적용일자
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2003-03-01
135 
2006-07-15
71 
2020-07-03
32 
2020-04-03
22 
2006-01-01
 
12
Other values (6)
28 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique3 ?
Unique (%)1.0%

Sample

1st row2020-04-03
2nd row2006-01-01
3rd row2006-05-03
4th row2020-04-03
5th row2006-01-01

Common Values

ValueCountFrequency (%)
2003-03-01 135
45.0%
2006-07-15 71
23.7%
2020-07-03 32
 
10.7%
2020-04-03 22
 
7.3%
2006-01-01 12
 
4.0%
2006-05-03 10
 
3.3%
2008-01-01 8
 
2.7%
2018-07-01 7
 
2.3%
2003-04-01 1
 
0.3%
2018-07-02 1
 
0.3%

Length

2023-12-13T04:54:12.017156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2003-03-01 135
45.0%
2006-07-15 71
23.7%
2020-07-03 32
 
10.7%
2020-04-03 22
 
7.3%
2006-01-01 12
 
4.0%
2006-05-03 10
 
3.3%
2008-01-01 8
 
2.7%
2018-07-01 7
 
2.3%
2003-04-01 1
 
0.3%
2018-07-02 1
 
0.3%

사용여부
Boolean

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size432.0 B
True
300 
ValueCountFrequency (%)
True 300
100.0%
2023-12-13T04:54:12.177754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

등록일시
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2020-04-02
217 
2021-02-22
32 
2021-04-02
32 
2021-07-02
 
12
2021-07-07
 
7

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-02-22
2nd row2021-02-22
3rd row2021-02-22
4th row2021-02-22
5th row2021-02-22

Common Values

ValueCountFrequency (%)
2020-04-02 217
72.3%
2021-02-22 32
 
10.7%
2021-04-02 32
 
10.7%
2021-07-02 12
 
4.0%
2021-07-07 7
 
2.3%

Length

2023-12-13T04:54:12.300700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:54:12.457127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-04-02 217
72.3%
2021-02-22 32
 
10.7%
2021-04-02 32
 
10.7%
2021-07-02 12
 
4.0%
2021-07-07 7
 
2.3%

Correlations

2023-12-13T04:54:12.577984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
적용법령적용일자등록일시
적용법령1.0000.9460.618
적용일자0.9461.0000.913
등록일시0.6180.9131.000
2023-12-13T04:54:12.681505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
적용법령등록일시적용일자
적용법령1.0000.5730.913
등록일시0.5731.0000.788
적용일자0.9130.7881.000
2023-12-13T04:54:13.154769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
적용법령적용일자등록일시
적용법령1.0000.9130.573
적용일자0.9131.0000.788
등록일시0.5730.7881.000

Missing values

2023-12-13T04:54:10.599581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:54:10.700148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

법정동적용법령적용일자사용여부등록일시
0경기도 용인시 처인구 남사읍 봉무리대기관리권역법2020-04-03Y2021-02-22
1경기도 용인시 처인구 남사읍 봉무리수도권특별법2006-01-01Y2021-02-22
2경기도 용인시 처인구 남사읍 북리대기환경보전법2006-05-03Y2021-02-22
3경기도 용인시 처인구 남사읍 북리대기관리권역법2020-04-03Y2021-02-22
4경기도 용인시 처인구 남사읍 북리수도권특별법2006-01-01Y2021-02-22
5경기도 용인시 처인구 남사읍 통삼리대기환경보전법2006-05-03Y2021-02-22
6경기도 용인시 처인구 남사읍 통삼리대기관리권역법2020-04-03Y2021-02-22
7경기도 용인시 처인구 남사읍 통삼리수도권특별법2006-01-01Y2021-02-22
8경기도 용인시 처인구 남사읍 봉명리대기환경보전법2006-05-03Y2021-02-22
9경기도 용인시 처인구 남사읍 봉명리대기관리권역법2020-04-03Y2021-02-22
법정동적용법령적용일자사용여부등록일시
290광주광역시 남구 임암동대기환경보전법2006-07-15Y2020-04-02
291광주광역시 남구 송하동대기환경보전법2006-07-15Y2020-04-02
292광주광역시 남구 양림동대기환경보전법2006-07-15Y2020-04-02
293광주광역시 남구 방림동대기환경보전법2006-07-15Y2020-04-02
294광주광역시 남구 봉선동대기환경보전법2006-07-15Y2020-04-02
295광주광역시 남구 구소동대기환경보전법2006-07-15Y2020-04-02
296광주광역시 남구 양촌동대기환경보전법2006-07-15Y2020-04-02
297광주광역시 남구 도금동대기환경보전법2006-07-15Y2020-04-02
298광주광역시 남구 승촌동대기환경보전법2006-07-15Y2020-04-02
299광주광역시 남구 지석동대기환경보전법2006-07-15Y2020-04-02