Overview

Dataset statistics

Number of variables12
Number of observations70
Missing cells210
Missing cells (%)25.0%
Duplicate rows35
Duplicate rows (%)50.0%
Total size in memory7.0 KiB
Average record size in memory102.9 B

Variable types

Categorical8
DateTime1
Unsupported3

Dataset

DescriptionSample
Author(주)넥스트이지
URLhttps://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=NXERNTCARGRAD0000000

Alerts

차량변속기명 has constant value ""Constant
Dataset has 35 (50.0%) duplicate rowsDuplicates
차량명 is highly overall correlated with 차량유형명 and 3 other fieldsHigh correlation
차량유형명 is highly overall correlated with 차량명 and 2 other fieldsHigh correlation
차량연료명 is highly overall correlated with 차량명 and 2 other fieldsHigh correlation
차량인원갯수 is highly overall correlated with 차량명 and 1 other fieldsHigh correlation
차량제조사명 is highly overall correlated with 차량명 and 1 other fieldsHigh correlation
차량인원갯수 is highly imbalanced (57.4%)Imbalance
이용자연령대코드 has 70 (100.0%) missing valuesMissing
이용자성별코드 has 70 (100.0%) missing valuesMissing
이용자광역시도코드 has 70 (100.0%) missing valuesMissing
이용자연령대코드 is an unsupported type, check if it needs cleaning or further analysisUnsupported
이용자성별코드 is an unsupported type, check if it needs cleaning or further analysisUnsupported
이용자광역시도코드 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 06:57:20.744332
Analysis finished2023-12-10 06:57:21.420861
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

차량명
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Memory size692.0 B
코나 일렉트릭 EV
12 
아반떼 AD
LF쏘나타
그랜저IG
올 뉴 카니발(9인승)
Other values (16)
34 

Length

Max length13
Median length11
Mean length7.2571429
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowLF쏘나타
2nd row코나 일렉트릭 EV
3rd row아반떼 AD
4th row올 뉴 카니발(9인승)
5th row더 뉴 K5 2세대

Common Values

ValueCountFrequency (%)
코나 일렉트릭 EV 12
17.1%
아반떼 AD 8
 
11.4%
LF쏘나타 8
 
11.4%
그랜저IG 4
 
5.7%
올 뉴 카니발(9인승) 4
 
5.7%
엑센트 4
 
5.7%
쏘렌토 더 마스터 2
 
2.9%
더 뉴 K5 2세대 2
 
2.9%
K5 2세대 2
 
2.9%
K3 2
 
2.9%
Other values (11) 22
31.4%

Length

2023-12-10T15:57:21.490005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
14
 
9.2%
코나 12
 
7.9%
12
 
7.9%
ev 12
 
7.9%
일렉트릭 12
 
7.9%
아반떼 10
 
6.6%
ad 10
 
6.6%
lf쏘나타 8
 
5.3%
6
 
3.9%
k5 6
 
3.9%
Other values (21) 50
32.9%

차량유형명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Memory size692.0 B
중형
32 
SUV/승합
24 
소형
고급
수입
 
2

Length

Max length6
Median length2
Mean length3.3714286
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중형
2nd rowSUV/승합
3rd row중형
4th rowSUV/승합
5th row중형

Common Values

ValueCountFrequency (%)
중형 32
45.7%
SUV/승합 24
34.3%
소형 8
 
11.4%
고급 4
 
5.7%
수입 2
 
2.9%

Length

2023-12-10T15:57:21.632186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:57:21.746285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중형 32
45.7%
suv/승합 24
34.3%
소형 8
 
11.4%
고급 4
 
5.7%
수입 2
 
2.9%
Distinct35
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size692.0 B
Minimum2019-11-25 12:26:33
Maximum2019-12-23 23:32:04
2023-12-10T15:57:21.863907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:57:21.998518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)

평가점수값
Categorical

Distinct5
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Memory size692.0 B
5
32 
4
18 
1
2
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row4
3rd row5
4th row5
5th row5

Common Values

ValueCountFrequency (%)
5 32
45.7%
4 18
25.7%
1 8
 
11.4%
2 6
 
8.6%
3 6
 
8.6%

Length

2023-12-10T15:57:22.144897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:57:22.257187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5 32
45.7%
4 18
25.7%
1 8
 
11.4%
2 6
 
8.6%
3 6
 
8.6%

차량연료명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Memory size692.0 B
휘발유
24 
LPG
20 
전기
16 
경유
<NA>
 
2

Length

Max length4
Median length3
Mean length2.6857143
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowLPG
2nd row전기
3rd row휘발유
4th row경유
5th rowLPG

Common Values

ValueCountFrequency (%)
휘발유 24
34.3%
LPG 20
28.6%
전기 16
22.9%
경유 8
 
11.4%
<NA> 2
 
2.9%

Length

2023-12-10T15:57:22.382188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:57:22.519893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
휘발유 24
34.3%
lpg 20
28.6%
전기 16
22.9%
경유 8
 
11.4%
na 2
 
2.9%

차량변속기명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size692.0 B
오토
70 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row오토
2nd row오토
3rd row오토
4th row오토
5th row오토

Common Values

ValueCountFrequency (%)
오토 70
100.0%

Length

2023-12-10T15:57:22.637682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:57:22.736792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
오토 70
100.0%

차량인원갯수
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Memory size692.0 B
5
58 
9
 
4
4
 
4
11
 
2
7
 
2

Length

Max length2
Median length1
Mean length1.0285714
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5
2nd row5
3rd row5
4th row9
5th row5

Common Values

ValueCountFrequency (%)
5 58
82.9%
9 4
 
5.7%
4 4
 
5.7%
11 2
 
2.9%
7 2
 
2.9%

Length

2023-12-10T15:57:22.847048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:57:22.968095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5 58
82.9%
9 4
 
5.7%
4 4
 
5.7%
11 2
 
2.9%
7 2
 
2.9%

차량제조사명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Memory size692.0 B
현대
44 
기아
20 
르노삼성
 
2
쉐보레
 
2
BMW
 
2

Length

Max length4
Median length2
Mean length2.1142857
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row현대
2nd row현대
3rd row현대
4th row기아
5th row기아

Common Values

ValueCountFrequency (%)
현대 44
62.9%
기아 20
28.6%
르노삼성 2
 
2.9%
쉐보레 2
 
2.9%
BMW 2
 
2.9%

Length

2023-12-10T15:57:23.355156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:57:23.472268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
현대 44
62.9%
기아 20
28.6%
르노삼성 2
 
2.9%
쉐보레 2
 
2.9%
bmw 2
 
2.9%

이용자연령대코드
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing70
Missing (%)100.0%
Memory size762.0 B

이용자성별코드
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing70
Missing (%)100.0%
Memory size762.0 B

이용자광역시도코드
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing70
Missing (%)100.0%
Memory size762.0 B

접속장치명
Categorical

Distinct2
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size692.0 B
PC
36 
MOBILE
34 

Length

Max length6
Median length2
Mean length3.9428571
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPC
2nd rowPC
3rd rowPC
4th rowMOBILE
5th rowMOBILE

Common Values

ValueCountFrequency (%)
PC 36
51.4%
MOBILE 34
48.6%

Length

2023-12-10T15:57:23.611357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:57:23.744117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
pc 36
51.4%
mobile 34
48.6%

Correlations

2023-12-10T15:57:23.817122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
차량명차량유형명등록일시평가점수값차량연료명차량인원갯수차량제조사명접속장치명
차량명1.0001.0001.0000.8170.9990.9791.0000.635
차량유형명1.0001.0001.0000.7690.7090.7590.8820.079
등록일시1.0001.0001.0001.0001.0001.0001.0001.000
평가점수값0.8170.7691.0001.0000.3360.4670.7380.000
차량연료명0.9990.7091.0000.3361.0000.8640.3340.335
차량인원갯수0.9790.7591.0000.4670.8641.0000.7430.202
차량제조사명1.0000.8821.0000.7380.3340.7431.0000.259
접속장치명0.6350.0791.0000.0000.3350.2020.2591.000
2023-12-10T15:57:23.936171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
차량명차량인원갯수접속장치명차량연료명평가점수값차량제조사명차량유형명
차량명1.0000.7970.4770.8380.4940.8680.868
차량인원갯수0.7971.0000.2400.5240.1870.3650.379
접속장치명0.4770.2401.0000.2190.0000.3080.090
차량연료명0.8380.5240.2191.0000.2760.2750.641
평가점수값0.4940.1870.0000.2761.0000.3610.389
차량제조사명0.8680.3650.3080.2750.3611.0000.535
차량유형명0.8680.3790.0900.6410.3890.5351.000
2023-12-10T15:57:24.049330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
차량명차량유형명평가점수값차량연료명차량인원갯수차량제조사명접속장치명
차량명1.0000.8680.4940.8380.7970.8680.477
차량유형명0.8681.0000.3890.6410.3790.5350.090
평가점수값0.4940.3891.0000.2760.1870.3610.000
차량연료명0.8380.6410.2761.0000.5240.2750.219
차량인원갯수0.7970.3790.1870.5241.0000.3650.240
차량제조사명0.8680.5350.3610.2750.3651.0000.308
접속장치명0.4770.0900.0000.2190.2400.3081.000

Missing values

2023-12-10T15:57:21.178698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:57:21.353665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

차량명차량유형명등록일시평가점수값차량연료명차량변속기명차량인원갯수차량제조사명이용자연령대코드이용자성별코드이용자광역시도코드접속장치명
0LF쏘나타중형2019-12-17 10:15:232LPG오토5현대<NA><NA><NA>PC
1코나 일렉트릭 EVSUV/승합2019-12-17 10:26:344전기오토5현대<NA><NA><NA>PC
2아반떼 AD중형2019-12-17 14:52:035휘발유오토5현대<NA><NA><NA>PC
3올 뉴 카니발(9인승)SUV/승합2019-12-17 18:08:195경유오토9기아<NA><NA><NA>MOBILE
4더 뉴 K5 2세대중형2019-12-17 18:28:355LPG오토5기아<NA><NA><NA>MOBILE
5아반떼 AD중형2019-12-17 18:29:585LPG오토5현대<NA><NA><NA>MOBILE
6K5 2세대중형2019-12-13 09:22:254LPG오토5기아<NA><NA><NA>MOBILE
7K3중형2019-12-13 09:39:255휘발유오토5기아<NA><NA><NA>PC
8더 뉴 K5중형2019-12-13 21:52:205LPG오토5기아<NA><NA><NA>MOBILE
9SM6중형2019-12-14 12:22:425LPG오토5르노삼성<NA><NA><NA>MOBILE
차량명차량유형명등록일시평가점수값차량연료명차량변속기명차량인원갯수차량제조사명이용자연령대코드이용자성별코드이용자광역시도코드접속장치명
60엑센트소형2019-11-30 04:21:145휘발유오토5현대<NA><NA><NA>PC
61아반떼 AD중형2019-11-30 08:30:165휘발유오토5현대<NA><NA><NA>PC
62올 뉴 카니발(11인승)SUV/승합2019-11-25 12:26:335<NA>오토11기아<NA><NA><NA>MOBILE
63그랜저IG고급2019-12-08 07:18:171휘발유오토5현대<NA><NA><NA>MOBILE
64LF쏘나타중형2019-12-17 10:15:232LPG오토5현대<NA><NA><NA>PC
65코나 일렉트릭 EVSUV/승합2019-12-17 10:26:344전기오토5현대<NA><NA><NA>PC
66아반떼 AD중형2019-12-17 14:52:035휘발유오토5현대<NA><NA><NA>PC
67올 뉴 카니발(9인승)SUV/승합2019-12-17 18:08:195경유오토9기아<NA><NA><NA>MOBILE
68더 뉴 K5 2세대중형2019-12-17 18:28:355LPG오토5기아<NA><NA><NA>MOBILE
69아반떼 AD중형2019-12-17 18:29:585LPG오토5현대<NA><NA><NA>MOBILE

Duplicate rows

Most frequently occurring

차량명차량유형명등록일시평가점수값차량연료명차량변속기명차량인원갯수차량제조사명접속장치명# duplicates
02017 i30중형2019-12-15 19:42:405휘발유오토5현대MOBILE2
1BMW i3수입2019-12-10 14:41:532전기오토4BMWPC2
2K3중형2019-12-13 09:39:255휘발유오토5기아PC2
3K5 2세대중형2019-12-13 09:22:254LPG오토5기아MOBILE2
4LF쏘나타중형2019-12-01 21:30:185LPG오토5현대PC2
5LF쏘나타중형2019-12-02 07:39:564LPG오토5현대MOBILE2
6LF쏘나타중형2019-12-06 09:19:125LPG오토5현대PC2
7LF쏘나타중형2019-12-17 10:15:232LPG오토5현대PC2
8SM6중형2019-12-14 12:22:425LPG오토5르노삼성MOBILE2
9그랜저IG고급2019-12-07 15:23:155휘발유오토5현대PC2