Overview

Dataset statistics

Number of variables5
Number of observations84
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.6 KiB
Average record size in memory43.6 B

Variable types

Numeric2
Categorical3

Dataset

Description매년 한국철도공사에서 발행하는 철도통계연보에 수록된 일반열차 여객 수송실적으로 구분,표종류,기관차종,단위,인원 항목을 제공합니다.
URLhttps://www.data.go.kr/data/3050961/fileData.do

Alerts

단위 has constant value ""Constant
인원 has unique valuesUnique
인원 has 1 (1.2%) zerosZeros

Reproduction

Analysis started2023-12-12 19:01:41.999662
Analysis finished2023-12-12 19:01:43.132227
Duration1.13 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

Distinct7
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019
Minimum2016
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size888.0 B
2023-12-13T04:01:43.208943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2016
5-th percentile2016
Q12017
median2019
Q32021
95-th percentile2022
Maximum2022
Range6
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.0120121
Coefficient of variation (CV)0.00099653894
Kurtosis-1.2527477
Mean2019
Median Absolute Deviation (MAD)2
Skewness0
Sum169596
Variance4.0481928
MonotonicityIncreasing
2023-12-13T04:01:43.369681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
2016 12
14.3%
2017 12
14.3%
2018 12
14.3%
2019 12
14.3%
2020 12
14.3%
2021 12
14.3%
2022 12
14.3%
ValueCountFrequency (%)
2016 12
14.3%
2017 12
14.3%
2018 12
14.3%
2019 12
14.3%
2020 12
14.3%
2021 12
14.3%
2022 12
14.3%
ValueCountFrequency (%)
2022 12
14.3%
2021 12
14.3%
2020 12
14.3%
2019 12
14.3%
2018 12
14.3%
2017 12
14.3%
2016 12
14.3%

구분
Categorical

Distinct3
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size804.0 B
정기
28 
비정기
28 
기타
28 

Length

Max length5
Median length4
Mean length3.6666667
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정기
2nd row정기
3rd row정기
4th row정기
5th row 비정기

Common Values

ValueCountFrequency (%)
정기 28
33.3%
비정기 28
33.3%
기타 28
33.3%

Length

2023-12-13T04:01:43.525922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:01:43.689527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정기 28
33.3%
비정기 28
33.3%
기타 28
33.3%

열차종
Categorical

Distinct5
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Memory size804.0 B
새마을
21 
무궁화
21 
통근열차
21 
KTX
14 
고속철도

Length

Max length6
Median length5
Mean length5.3333333
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row KTX
2nd row 새마을
3rd row 무궁화
4th row 통근열차
5th row KTX

Common Values

ValueCountFrequency (%)
새마을 21
25.0%
무궁화 21
25.0%
통근열차 21
25.0%
KTX 14
16.7%
고속철도 7
 
8.3%

Length

2023-12-13T04:01:43.819596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:01:43.959691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
새마을 21
25.0%
무궁화 21
25.0%
통근열차 21
25.0%
ktx 14
16.7%
고속철도 7
 
8.3%

단위
Categorical

CONSTANT 

Distinct1
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size804.0 B
84 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
84
100.0%

Length

2023-12-13T04:01:44.125699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:01:44.265815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
84
100.0%

인원
Real number (ℝ)

UNIQUE  ZEROS 

Distinct84
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9974511.8
Minimum0
Maximum66569254
Zeros1
Zeros (%)1.2%
Negative0
Negative (%)0.0%
Memory size888.0 B
2023-12-13T04:01:44.405578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile19.2
Q147584.75
median495731
Q36454385
95-th percentile55799695
Maximum66569254
Range66569254
Interquartile range (IQR)6406800.2

Descriptive statistics

Standard deviation18938696
Coefficient of variation (CV)1.898709
Kurtosis2.4881182
Mean9974511.8
Median Absolute Deviation (MAD)494881
Skewness1.9854086
Sum8.3785899 × 108
Variance3.5867419 × 1014
MonotonicityNot monotonic
2023-12-13T04:01:44.590593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3106160 1
 
1.2%
30218912 1
 
1.2%
4737546 1
 
1.2%
424514 1
 
1.2%
3426750 1
 
1.2%
18 1
 
1.2%
67704 1
 
1.2%
20031 1
 
1.2%
278026 1
 
1.2%
97430 1
 
1.2%
Other values (74) 74
88.1%
ValueCountFrequency (%)
0 1
1.2%
2 1
1.2%
4 1
1.2%
16 1
1.2%
18 1
1.2%
26 1
1.2%
96 1
1.2%
1604 1
1.2%
7900 1
1.2%
12219 1
1.2%
ValueCountFrequency (%)
66569254 1
1.2%
66467183 1
1.2%
63111757 1
1.2%
60822555 1
1.2%
56143419 1
1.2%
53851928 1
1.2%
50997104 1
1.2%
50764047 1
1.2%
50763062 1
1.2%
46931814 1
1.2%

Interactions

2023-12-13T04:01:42.398657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:01:42.173055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:01:42.815419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:01:42.296699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:01:44.702381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도구분열차종인원
연도1.0000.0000.0000.000
구분0.0001.0000.3530.628
열차종0.0000.3531.0000.514
인원0.0000.6280.5141.000
2023-12-13T04:01:44.821703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분열차종
구분1.0000.281
열차종0.2811.000
2023-12-13T04:01:44.940397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도인원구분열차종
연도1.000-0.0620.0000.000
인원-0.0621.0000.4840.339
구분0.0000.4841.0000.281
열차종0.0000.3390.2811.000

Missing values

2023-12-13T04:01:42.959638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:01:43.079892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도구분열차종단위인원
02016정기KTX3106160
12016정기새마을359248
22016정기무궁화6642374
32016정기통근열차17366
42016비정기KTX60822555
52016비정기새마을9370394
62016비정기무궁화53851928
72016비정기통근열차472604
82016기타고속철도688413
92016기타새마을51775
연도구분열차종단위인원
742022정기무궁화5121520
752022정기통근열차22620
762022비정기KTX66467183
772022비정기새마을9750233
782022비정기무궁화34938782
792022비정기통근열차175957
802022기타고속철도294640
812022기타새마을16265
822022기타무궁화47821
832022기타통근열차26