Overview

Dataset statistics

Number of variables9
Number of observations108
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.0 KiB
Average record size in memory76.2 B

Variable types

Numeric2
Text1
Categorical6

Dataset

Description화성도시공사의 유종별 버스 차종 현황입니다. 버스 차종별 차량명, 연식, 유종, 배기량, 승차인원 등을 확인할 수 있습니다.
Author화성도시공사
URLhttps://www.data.go.kr/data/15126971/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
승차인원 is highly overall correlated with 연번 and 5 other fieldsHigh correlation
연식 is highly overall correlated with 연번 and 5 other fieldsHigh correlation
차랑명 is highly overall correlated with 연번 and 5 other fieldsHigh correlation
유종 is highly overall correlated with 연번 and 4 other fieldsHigh correlation
구분 is highly overall correlated with 연번 and 4 other fieldsHigh correlation
연번 is highly overall correlated with 차랑명 and 4 other fieldsHigh correlation
배기량 is highly overall correlated with 차랑명 and 4 other fieldsHigh correlation
연번 has unique valuesUnique
차량번호 has unique valuesUnique

Reproduction

Analysis started2024-03-14 12:03:18.633990
Analysis finished2024-03-14 12:03:20.804296
Duration2.17 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct108
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54.5
Minimum1
Maximum108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-03-14T21:03:21.024274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.35
Q127.75
median54.5
Q381.25
95-th percentile102.65
Maximum108
Range107
Interquartile range (IQR)53.5

Descriptive statistics

Standard deviation31.32092
Coefficient of variation (CV)0.57469577
Kurtosis-1.2
Mean54.5
Median Absolute Deviation (MAD)27
Skewness0
Sum5886
Variance981
MonotonicityStrictly increasing
2024-03-14T21:03:21.479815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.9%
70 1
 
0.9%
81 1
 
0.9%
80 1
 
0.9%
79 1
 
0.9%
78 1
 
0.9%
77 1
 
0.9%
76 1
 
0.9%
75 1
 
0.9%
74 1
 
0.9%
Other values (98) 98
90.7%
ValueCountFrequency (%)
1 1
0.9%
2 1
0.9%
3 1
0.9%
4 1
0.9%
5 1
0.9%
6 1
0.9%
7 1
0.9%
8 1
0.9%
9 1
0.9%
10 1
0.9%
ValueCountFrequency (%)
108 1
0.9%
107 1
0.9%
106 1
0.9%
105 1
0.9%
104 1
0.9%
103 1
0.9%
102 1
0.9%
101 1
0.9%
100 1
0.9%
99 1
0.9%

차량번호
Text

UNIQUE 

Distinct108
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size992.0 B
2024-03-14T21:03:22.586957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length9
Mean length9
Min length9

Characters and Unicode

Total characters972
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)100.0%

Sample

1st row경기76아4701
2nd row경기76아4702
3rd row경기76아4703
4th row경기76아1454
5th row경기76아4700
ValueCountFrequency (%)
경기76아4701 1
 
0.9%
경기76아4711 1
 
0.9%
경기76아4715 1
 
0.9%
경기76아4736 1
 
0.9%
경기76아4735 1
 
0.9%
경기76아4722 1
 
0.9%
경기76아4721 1
 
0.9%
경기76아4720 1
 
0.9%
경기76아4719 1
 
0.9%
경기76아4718 1
 
0.9%
Other values (98) 98
90.7%
2024-03-14T21:03:24.106298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7 165
17.0%
6 118
12.1%
108
11.1%
108
11.1%
108
11.1%
4 98
10.1%
1 77
7.9%
5 54
 
5.6%
0 52
 
5.3%
2 33
 
3.4%
Other values (3) 51
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 648
66.7%
Other Letter 324
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
7 165
25.5%
6 118
18.2%
4 98
15.1%
1 77
11.9%
5 54
 
8.3%
0 52
 
8.0%
2 33
 
5.1%
3 24
 
3.7%
8 16
 
2.5%
9 11
 
1.7%
Other Letter
ValueCountFrequency (%)
108
33.3%
108
33.3%
108
33.3%

Most occurring scripts

ValueCountFrequency (%)
Common 648
66.7%
Hangul 324
33.3%

Most frequent character per script

Common
ValueCountFrequency (%)
7 165
25.5%
6 118
18.2%
4 98
15.1%
1 77
11.9%
5 54
 
8.3%
0 52
 
8.0%
2 33
 
5.1%
3 24
 
3.7%
8 16
 
2.5%
9 11
 
1.7%
Hangul
ValueCountFrequency (%)
108
33.3%
108
33.3%
108
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 648
66.7%
Hangul 324
33.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7 165
25.5%
6 118
18.2%
4 98
15.1%
1 77
11.9%
5 54
 
8.3%
0 52
 
8.0%
2 33
 
5.1%
3 24
 
3.7%
8 16
 
2.5%
9 11
 
1.7%
Hangul
ValueCountFrequency (%)
108
33.3%
108
33.3%
108
33.3%

차랑명
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size992.0 B
뉴카운티
38 
현대그린시티
36 
브이버스60
20 
BS090
브이버스90N

Length

Max length7
Median length6
Mean length5.2407407
Min length3

Unique

Unique1 ?
Unique (%)0.9%

Sample

1st row현대그린시티
2nd row현대그린시티
3rd row현대그린시티
4th row현대그린시티
5th row현대그린시티

Common Values

ValueCountFrequency (%)
뉴카운티 38
35.2%
현대그린시티 36
33.3%
브이버스60 20
18.5%
BS090 8
 
7.4%
브이버스90N 5
 
4.6%
쏠라티 1
 
0.9%

Length

2024-03-14T21:03:24.558535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T21:03:24.838543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
뉴카운티 38
35.2%
현대그린시티 36
33.3%
브이버스60 20
18.5%
bs090 8
 
7.4%
브이버스90n 5
 
4.6%
쏠라티 1
 
0.9%

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size992.0 B
중형승합
67 
대형승합
41 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대형승합
2nd row대형승합
3rd row대형승합
4th row대형승합
5th row대형승합

Common Values

ValueCountFrequency (%)
중형승합 67
62.0%
대형승합 41
38.0%

Length

2024-03-14T21:03:25.042346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T21:03:25.205394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중형승합 67
62.0%
대형승합 41
38.0%

연식
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size992.0 B
2020
56 
2019
27 
2021
20 
2022
 
5

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2019
5th row2020

Common Values

ValueCountFrequency (%)
2020 56
51.9%
2019 27
25.0%
2021 20
 
18.5%
2022 5
 
4.6%

Length

2024-03-14T21:03:25.381335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T21:03:25.552683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 56
51.9%
2019 27
25.0%
2021 20
 
18.5%
2022 5
 
4.6%

유종
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size992.0 B
경유
83 
전기
25 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경유
2nd row경유
3rd row경유
4th row경유
5th row경유

Common Values

ValueCountFrequency (%)
경유 83
76.9%
전기 25
 
23.1%

Length

2024-03-14T21:03:25.767578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T21:03:25.996841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경유 83
76.9%
전기 25
 
23.1%

배기량
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4017.3611
Minimum288
Maximum6299
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-03-14T21:03:26.140860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum288
5-th percentile288
Q13933
median3933
Q36299
95-th percentile6299
Maximum6299
Range6011
Interquartile range (IQR)2366

Descriptive statistics

Standard deviation2286.9537
Coefficient of variation (CV)0.56926765
Kurtosis-1.0072812
Mean4017.3611
Median Absolute Deviation (MAD)2366
Skewness-0.62621551
Sum433875
Variance5230157.3
MonotonicityNot monotonic
2024-03-14T21:03:26.322029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
3933 38
35.2%
6299 36
33.3%
288 20
18.5%
5890 8
 
7.4%
456 5
 
4.6%
2497 1
 
0.9%
ValueCountFrequency (%)
288 20
18.5%
456 5
 
4.6%
2497 1
 
0.9%
3933 38
35.2%
5890 8
 
7.4%
6299 36
33.3%
ValueCountFrequency (%)
6299 36
33.3%
5890 8
 
7.4%
3933 38
35.2%
2497 1
 
0.9%
456 5
 
4.6%
288 20
18.5%

승차인원
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size992.0 B
15인승
58 
25인승
44 
45인승
 
5
14인승
 
1

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique1 ?
Unique (%)0.9%

Sample

1st row25인승
2nd row25인승
3rd row25인승
4th row25인승
5th row25인승

Common Values

ValueCountFrequency (%)
15인승 58
53.7%
25인승 44
40.7%
45인승 5
 
4.6%
14인승 1
 
0.9%

Length

2024-03-14T21:03:26.525206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T21:03:26.699780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
15인승 58
53.7%
25인승 44
40.7%
45인승 5
 
4.6%
14인승 1
 
0.9%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size992.0 B
2023-12-31
108 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-12-31
2nd row2023-12-31
3rd row2023-12-31
4th row2023-12-31
5th row2023-12-31

Common Values

ValueCountFrequency (%)
2023-12-31 108
100.0%

Length

2024-03-14T21:03:26.884741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T21:03:27.131593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-12-31 108
100.0%

Interactions

2024-03-14T21:03:19.602723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T21:03:19.113976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T21:03:19.844130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T21:03:19.352968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T21:03:27.236480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번차랑명구분연식유종배기량승차인원
연번1.0000.8970.9980.8940.9740.8090.826
차랑명0.8971.0001.0000.9821.0001.0001.000
구분0.9981.0001.0000.7490.2390.9650.975
연식0.8940.9820.7491.0001.0000.7810.952
유종0.9741.0000.2391.0001.0001.0000.754
배기량0.8091.0000.9650.7811.0001.0001.000
승차인원0.8261.0000.9750.9520.7541.0001.000
2024-03-14T21:03:27.632912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
승차인원연식차랑명유종구분
승차인원1.0000.7060.9900.5410.849
연식0.7061.0000.9010.9910.537
차랑명0.9900.9011.0000.9810.981
유종0.5410.9910.9811.0000.153
구분0.8490.5370.9810.1531.000
2024-03-14T21:03:27.906601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번배기량차랑명구분연식유종승차인원
연번1.000-0.4560.7360.9280.7480.8270.641
배기량-0.4561.0000.9900.7500.6790.9910.838
차랑명0.7360.9901.0000.9810.9010.9810.990
구분0.9280.7500.9811.0000.5370.1530.849
연식0.7480.6790.9010.5371.0000.9910.706
유종0.8270.9910.9810.1530.9911.0000.541
승차인원0.6410.8380.9900.8490.7060.5411.000

Missing values

2024-03-14T21:03:20.192621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T21:03:20.635643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번차량번호차랑명구분연식유종배기량승차인원데이터기준일자
01경기76아4701현대그린시티대형승합2020경유629925인승2023-12-31
12경기76아4702현대그린시티대형승합2020경유629925인승2023-12-31
23경기76아4703현대그린시티대형승합2020경유629925인승2023-12-31
34경기76아1454현대그린시티대형승합2019경유629925인승2023-12-31
45경기76아4700현대그린시티대형승합2020경유629925인승2023-12-31
56경기76아4725현대그린시티대형승합2020경유629925인승2023-12-31
67경기76아4723현대그린시티대형승합2020경유629925인승2023-12-31
78경기76아4724현대그린시티대형승합2020경유629925인승2023-12-31
89경기76아1559현대그린시티대형승합2020경유629925인승2023-12-31
910경기76아1416현대그린시티대형승합2019경유629925인승2023-12-31
연번차량번호차랑명구분연식유종배기량승차인원데이터기준일자
9899경기76아4743뉴카운티중형승합2020경유393315인승2023-12-31
99100경기76아4729뉴카운티중형승합2020경유393315인승2023-12-31
100101경기76아1409BS090중형승합2019경유589025인승2023-12-31
101102경기76아1410BS090중형승합2019경유589025인승2023-12-31
102103경기76아1411BS090중형승합2019경유589025인승2023-12-31
103104경기76아1412BS090중형승합2019경유589025인승2023-12-31
104105경기76아1413BS090중형승합2019경유589025인승2023-12-31
105106경기76아1414BS090중형승합2019경유589025인승2023-12-31
106107경기76아1415BS090중형승합2019경유589025인승2023-12-31
107108경기76아1408BS090중형승합2019경유589025인승2023-12-31