Overview

Dataset statistics

Number of variables3
Number of observations26
Missing cells0
Missing cells (%)0.0%
Duplicate rows3
Duplicate rows (%)11.5%
Total size in memory782.0 B
Average record size in memory30.1 B

Variable types

Categorical1
Numeric1
DateTime1

Dataset

Description서울주택도시공사의 차량 운영현황입니다. 공사가 보유하고 있는 차량의 차량 종류와 차량별 배기량(cc), 등록일자 등을 파악할 수 있는 공공데이터 파일입니다
Author서울주택도시공사
URLhttps://www.data.go.kr/data/15066031/fileData.do

Alerts

Dataset has 3 (11.5%) duplicate rowsDuplicates
배기량(CC) is highly overall correlated with 차종High correlation
차종 is highly overall correlated with 배기량(CC)High correlation

Reproduction

Analysis started2023-12-12 15:56:21.544262
Analysis finished2023-12-12 15:56:21.906527
Duration0.36 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

차종
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)38.5%
Missing0
Missing (%)0.0%
Memory size340.0 B
아이오닉 일렉트릭
10 
코나 일렉트릭
카니발
봉고Ⅲ
 
1
쏘나타 하이브리드
 
1
Other values (5)

Length

Max length10
Median length9
Mean length6.7692308
Min length2

Unique

Unique7 ?
Unique (%)26.9%

Sample

1st row봉고Ⅲ
2nd row쏘나타 하이브리드
3rd rowk5
4th row카니발
5th row그랜드 스타렉스

Common Values

ValueCountFrequency (%)
아이오닉 일렉트릭 10
38.5%
코나 일렉트릭 5
19.2%
카니발 4
 
15.4%
봉고Ⅲ 1
 
3.8%
쏘나타 하이브리드 1
 
3.8%
k5 1
 
3.8%
그랜드 스타렉스 1
 
3.8%
그랜저IG하이브리드 1
 
3.8%
레스타 1
 
3.8%
유니버스 1
 
3.8%

Length

2023-12-13T00:56:21.998345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:56:22.152878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일렉트릭 15
34.9%
아이오닉 10
23.3%
코나 5
 
11.6%
카니발 4
 
9.3%
봉고ⅲ 1
 
2.3%
쏘나타 1
 
2.3%
하이브리드 1
 
2.3%
k5 1
 
2.3%
그랜드 1
 
2.3%
스타렉스 1
 
2.3%
Other values (3) 3
 
7.0%

배기량(CC)
Real number (ℝ)

HIGH CORRELATION 

Distinct9
Distinct (%)34.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1477.3462
Minimum78
Maximum12742
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2023-12-13T00:56:22.315443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum78
5-th percentile78
Q178
median180
Q32199
95-th percentile3484.75
Maximum12742
Range12664
Interquartile range (IQR)2121

Descriptive statistics

Standard deviation2575.7009
Coefficient of variation (CV)1.7434647
Kurtosis15.312484
Mean1477.3462
Median Absolute Deviation (MAD)102
Skewness3.5714376
Sum38411
Variance6634235.1
MonotonicityNot monotonic
2023-12-13T00:56:22.471008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
78 10
38.5%
180 5
19.2%
2199 4
 
15.4%
1999 2
 
7.7%
2539 1
 
3.8%
2497 1
 
3.8%
2359 1
 
3.8%
3800 1
 
3.8%
12742 1
 
3.8%
ValueCountFrequency (%)
78 10
38.5%
180 5
19.2%
1999 2
 
7.7%
2199 4
 
15.4%
2359 1
 
3.8%
2497 1
 
3.8%
2539 1
 
3.8%
3800 1
 
3.8%
12742 1
 
3.8%
ValueCountFrequency (%)
12742 1
 
3.8%
3800 1
 
3.8%
2539 1
 
3.8%
2497 1
 
3.8%
2359 1
 
3.8%
2199 4
 
15.4%
1999 2
 
7.7%
180 5
19.2%
78 10
38.5%
Distinct14
Distinct (%)53.8%
Missing0
Missing (%)0.0%
Memory size340.0 B
Minimum2011-03-11 00:00:00
Maximum2021-04-06 00:00:00
2023-12-13T00:56:22.629957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:56:22.760124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)

Interactions

2023-12-13T00:56:21.651675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:56:22.862259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
차종배기량(CC)등록일자
차종1.0001.0001.000
배기량(CC)1.0001.0001.000
등록일자1.0001.0001.000
2023-12-13T00:56:22.977857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
배기량(CC)차종
배기량(CC)1.0000.853
차종0.8531.000

Missing values

2023-12-13T00:56:21.775423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:56:21.862432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

차종배기량(CC)등록일자
0봉고Ⅲ25392011-03-11
1쏘나타 하이브리드19992013-08-27
2k519992014-01-07
3카니발21992015-01-29
4그랜드 스타렉스24972015-03-27
5카니발21992017-03-08
6아이오닉 일렉트릭782017-09-14
7그랜저IG하이브리드23592018-02-09
8아이오닉 일렉트릭782018-11-08
9아이오닉 일렉트릭782018-11-08
차종배기량(CC)등록일자
16아이오닉 일렉트릭782018-11-08
17레스타38002019-04-22
18카니발21992020-04-24
19카니발21992020-04-24
20코나 일렉트릭1802020-06-23
21코나 일렉트릭1802020-06-23
22코나 일렉트릭1802020-06-23
23코나 일렉트릭1802020-06-23
24코나 일렉트릭1802020-12-09
25유니버스127422021-04-06

Duplicate rows

Most frequently occurring

차종배기량(CC)등록일자# duplicates
0아이오닉 일렉트릭782018-11-089
2코나 일렉트릭1802020-06-234
1카니발21992020-04-242