Overview

Dataset statistics

Number of variables4
Number of observations201
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.5%
Total size in memory6.6 KiB
Average record size in memory33.7 B

Variable types

Categorical3
Text1

Dataset

Description연도별 보급평가시험 대상 전기자동차 수: 연도별로 전기차 보급대상평가 시험 대상 전기자동차 수에 대한 정확한 데이터는 보유하고 있지 않아, 보급대상 평가시험 후 확정된 연도별 보조금 지급대상 차량(일반 승용)에 대한 데이터를붙임과 같이 제공합니다.
Author한국환경공단
URLhttps://www.data.go.kr/data/15123033/fileData.do

Alerts

Dataset has 1 (0.5%) duplicate rowsDuplicates
차종 is highly overall correlated with 제작사High correlation
제작사 is highly overall correlated with 차종High correlation
차종 is highly imbalanced (58.2%)Imbalance

Reproduction

Analysis started2023-12-12 13:02:00.292642
Analysis finished2023-12-12 13:02:00.649411
Duration0.36 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Categorical

Distinct3
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2022
90 
2021
73 
2020
38 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 90
44.8%
2021 73
36.3%
2020 38
18.9%

Length

2023-12-12T22:02:00.716639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:02:00.827776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 90
44.8%
2021 73
36.3%
2020 38
18.9%

차종
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
일반승용
184 
초소형
 
17

Length

Max length4
Median length4
Mean length3.9154229
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반승용
2nd row일반승용
3rd row일반승용
4th row일반승용
5th row일반승용

Common Values

ValueCountFrequency (%)
일반승용 184
91.5%
초소형 17
 
8.5%

Length

2023-12-12T22:02:00.950538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:02:01.059611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반승용 184
91.5%
초소형 17
 
8.5%

제작사
Categorical

HIGH CORRELATION 

Distinct19
Distinct (%)9.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
현대자동차
56 
기아
31 
테슬라코리아
26 
스텔란티스코리아
20 
르노코리아자동차
12 
Other values (14)
56 

Length

Max length11
Median length8
Mean length5.4875622
Min length2

Unique

Unique3 ?
Unique (%)1.5%

Sample

1st row현대자동차
2nd row현대자동차
3rd row현대자동차
4th row현대자동차
5th row현대자동차

Common Values

ValueCountFrequency (%)
현대자동차 56
27.9%
기아 31
15.4%
테슬라코리아 26
12.9%
스텔란티스코리아 20
 
10.0%
르노코리아자동차 12
 
6.0%
BMW 11
 
5.5%
메르세데스벤츠코리아 8
 
4.0%
한국지엠 7
 
3.5%
쎄보모빌리티 7
 
3.5%
아우디폭스바겐코리아 4
 
2.0%
Other values (9) 19
 
9.5%

Length

2023-12-12T22:02:01.193409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
현대자동차 56
27.9%
기아 31
15.4%
테슬라코리아 26
12.9%
스텔란티스코리아 20
 
10.0%
르노코리아자동차 12
 
6.0%
bmw 11
 
5.5%
메르세데스벤츠코리아 8
 
4.0%
한국지엠 7
 
3.5%
쎄보모빌리티 7
 
3.5%
아우디폭스바겐코리아 4
 
2.0%
Other values (9) 19
 
9.5%
Distinct138
Distinct (%)68.7%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-12T22:02:01.572565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length25
Mean length16.0199
Min length3

Characters and Unicode

Total characters3220
Distinct characters138
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)41.3%

Sample

1st row아이오닉5 2WD 롱레인지 20인치
2nd row아이오닉5 2WD 롱레인지 19인치
3rd row아이오닉5 2WD 롱레인지 19인치 빌트인 캠 미적용
4th row아이오닉5 AWD 롱레인지 20인치
5th row아이오닉5 AWD 롱레인지 19인치
ValueCountFrequency (%)
롱레인지 28
 
4.6%
model 26
 
4.2%
2wd 26
 
4.2%
19인치 26
 
4.2%
아이오닉5 21
 
3.4%
awd 20
 
3.3%
스탠다드 17
 
2.8%
20인치 16
 
2.6%
peugeot 14
 
2.3%
ev6 13
 
2.1%
Other values (149) 408
66.3%
2023-12-12T22:02:02.205204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
414
 
12.9%
e 131
 
4.1%
2 100
 
3.1%
E 95
 
3.0%
0 90
 
2.8%
o 86
 
2.7%
84
 
2.6%
D 74
 
2.3%
W 65
 
2.0%
V 62
 
1.9%
Other values (128) 2019
62.7%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 824
25.6%
Other Letter 715
22.2%
Lowercase Letter 645
20.0%
Decimal Number 423
13.1%
Space Separator 414
12.9%
Close Punctuation 60
 
1.9%
Open Punctuation 60
 
1.9%
Dash Punctuation 40
 
1.2%
Other Punctuation 37
 
1.1%
Math Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
84
 
11.7%
51
 
7.1%
38
 
5.3%
34
 
4.8%
33
 
4.6%
33
 
4.6%
32
 
4.5%
30
 
4.2%
30
 
4.2%
28
 
3.9%
Other values (63) 322
45.0%
Uppercase Letter
ValueCountFrequency (%)
E 95
11.5%
D 74
 
9.0%
W 65
 
7.9%
V 62
 
7.5%
P 58
 
7.0%
S 57
 
6.9%
M 54
 
6.6%
A 46
 
5.6%
C 42
 
5.1%
T 40
 
4.9%
Other values (16) 231
28.0%
Lowercase Letter
ValueCountFrequency (%)
e 131
20.3%
o 86
13.3%
n 46
 
7.1%
l 45
 
7.0%
r 45
 
7.0%
g 41
 
6.4%
i 39
 
6.0%
t 38
 
5.9%
a 37
 
5.7%
d 35
 
5.4%
Other values (12) 102
15.8%
Decimal Number
ValueCountFrequency (%)
2 100
23.6%
0 90
21.3%
1 54
12.8%
3 40
 
9.5%
5 34
 
8.0%
6 31
 
7.3%
9 29
 
6.9%
4 22
 
5.2%
8 19
 
4.5%
7 4
 
0.9%
Other Punctuation
ValueCountFrequency (%)
, 31
83.8%
. 6
 
16.2%
Space Separator
ValueCountFrequency (%)
414
100.0%
Close Punctuation
ValueCountFrequency (%)
) 60
100.0%
Open Punctuation
ValueCountFrequency (%)
( 60
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 40
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1469
45.6%
Common 1036
32.2%
Hangul 715
22.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
84
 
11.7%
51
 
7.1%
38
 
5.3%
34
 
4.8%
33
 
4.6%
33
 
4.6%
32
 
4.5%
30
 
4.2%
30
 
4.2%
28
 
3.9%
Other values (63) 322
45.0%
Latin
ValueCountFrequency (%)
e 131
 
8.9%
E 95
 
6.5%
o 86
 
5.9%
D 74
 
5.0%
W 65
 
4.4%
V 62
 
4.2%
P 58
 
3.9%
S 57
 
3.9%
M 54
 
3.7%
A 46
 
3.1%
Other values (38) 741
50.4%
Common
ValueCountFrequency (%)
414
40.0%
2 100
 
9.7%
0 90
 
8.7%
) 60
 
5.8%
( 60
 
5.8%
1 54
 
5.2%
- 40
 
3.9%
3 40
 
3.9%
5 34
 
3.3%
6 31
 
3.0%
Other values (7) 113
 
10.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2505
77.8%
Hangul 715
 
22.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
414
 
16.5%
e 131
 
5.2%
2 100
 
4.0%
E 95
 
3.8%
0 90
 
3.6%
o 86
 
3.4%
D 74
 
3.0%
W 65
 
2.6%
V 62
 
2.5%
) 60
 
2.4%
Other values (55) 1328
53.0%
Hangul
ValueCountFrequency (%)
84
 
11.7%
51
 
7.1%
38
 
5.3%
34
 
4.8%
33
 
4.6%
33
 
4.6%
32
 
4.5%
30
 
4.2%
30
 
4.2%
28
 
3.9%
Other values (63) 322
45.0%

Correlations

2023-12-12T22:02:02.349635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도차종제작사
연도1.0000.0480.000
차종0.0481.0000.931
제작사0.0000.9311.000
2023-12-12T22:02:02.474094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
차종연도제작사
차종1.0000.0790.862
연도0.0791.0000.000
제작사0.8620.0001.000
2023-12-12T22:02:02.571021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도차종제작사
연도1.0000.0790.000
차종0.0791.0000.862
제작사0.0000.8621.000

Missing values

2023-12-12T22:02:00.523241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:02:00.617863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도차종제작사차량명
02022일반승용현대자동차아이오닉5 2WD 롱레인지 20인치
12022일반승용현대자동차아이오닉5 2WD 롱레인지 19인치
22022일반승용현대자동차아이오닉5 2WD 롱레인지 19인치 빌트인 캠 미적용
32022일반승용현대자동차아이오닉5 AWD 롱레인지 20인치
42022일반승용현대자동차아이오닉5 AWD 롱레인지 19인치
52022일반승용현대자동차아이오닉5 2WD 스탠다드 19인치
62022일반승용현대자동차G80 Electrified
72022일반승용현대자동차아이오닉5 AWD 스탠다드 19인치
82022일반승용현대자동차GV60 스탠다드 2WD 19인치
92022일반승용현대자동차GV60 스탠다드 AWD 19인치
연도차종제작사차량명
1912020일반승용스텔란티스코리아Peugeot e-2008 SUV
1922020일반승용스텔란티스코리아Peugeot e-208
1932020일반승용아우디폭스바겐코리아e-tron 55 quattro
1942020일반승용스마트솔루션즈SMART EV Z
1952020초소형르노코리아자동차TWIZY
1962020초소형르노코리아자동차TWIZY(K1J05)
1972020초소형대창모터스Danigo
1982020초소형마이브마이브 M1
1992020초소형쎄보모빌리티CEVO-C
2002020초소형쎄보모빌리티CEVO-C SE

Duplicate rows

Most frequently occurring

연도차종제작사차량명# duplicates
02021일반승용현대자동차아이오닉5 AWD 롱레인지 20인치2