Overview

Dataset statistics

Number of variables4
Number of observations429
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory14.0 KiB
Average record size in memory33.3 B

Variable types

Categorical3
Numeric1

Dataset

Description매년 한국철도공사에서 발행하는 철도통계연보에 수록된 수도권전철 여객 수송실적으로 승차노선,표종,인원,단위 항목을 지원합니다.
URLhttps://www.data.go.kr/data/3050644/fileData.do

Alerts

단위 has constant value ""Constant
인원 has 146 (34.0%) zerosZeros

Reproduction

Analysis started2023-12-12 17:41:22.909253
Analysis finished2023-12-12 17:41:23.263812
Duration0.35 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

승차노선
Categorical

Distinct33
Distinct (%)7.7%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
경부선
 
13
중앙선
 
13
경인선
 
13
수인선
 
13
장항선
 
13
Other values (28)
364 

Length

Max length7
Median length6
Mean length4.0909091
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경부선
2nd row경부선
3rd row경부선
4th row경부선
5th row경부선

Common Values

ValueCountFrequency (%)
경부선 13
 
3.0%
중앙선 13
 
3.0%
경인선 13
 
3.0%
수인선 13
 
3.0%
장항선 13
 
3.0%
경의선 13
 
3.0%
경원선 13
 
3.0%
경춘선 13
 
3.0%
안산선 13
 
3.0%
과천선 13
 
3.0%
Other values (23) 299
69.7%

Length

2023-12-13T02:41:23.339551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경부선 13
 
3.0%
지하3호선 13
 
3.0%
용인경전철 13
 
3.0%
신분당선 13
 
3.0%
공항철도 13
 
3.0%
9호선 13
 
3.0%
김포도시철도 13
 
3.0%
신림선 13
 
3.0%
우이신설경전철 13
 
3.0%
인천2호선 13
 
3.0%
Other values (23) 299
69.7%

표종
Categorical

Distinct13
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
1회권무임
33 
1회권어른
33 
1회권어린이
33 
1회권청소년
33 
RF무임
33 
Other values (8)
264 

Length

Max length6
Median length5
Mean length4.3076923
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1회권무임
2nd row1회권어른
3rd row1회권어린이
4th row1회권청소년
5th rowRF무임

Common Values

ValueCountFrequency (%)
1회권무임 33
 
7.7%
1회권어른 33
 
7.7%
1회권어린이 33
 
7.7%
1회권청소년 33
 
7.7%
RF무임 33
 
7.7%
RF어린이 33
 
7.7%
RF일반 33
 
7.7%
RF정기권 33
 
7.7%
RF청소년 33
 
7.7%
단체권 33
 
7.7%
Other values (3) 99
23.1%

Length

2023-12-13T02:41:23.465216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1회권무임 33
 
7.7%
1회권어른 33
 
7.7%
1회권어린이 33
 
7.7%
1회권청소년 33
 
7.7%
rf무임 33
 
7.7%
rf어린이 33
 
7.7%
rf일반 33
 
7.7%
rf정기권 33
 
7.7%
rf청소년 33
 
7.7%
단체권 33
 
7.7%
Other values (3) 99
23.1%

인원
Real number (ℝ)

ZEROS 

Distinct284
Distinct (%)66.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2356071
Minimum0
Maximum1.289836 × 108
Zeros146
Zeros (%)34.0%
Negative0
Negative (%)0.0%
Memory size3.9 KiB
2023-12-13T02:41:23.606582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median31258
Q3343016
95-th percentile11611795
Maximum1.289836 × 108
Range1.289836 × 108
Interquartile range (IQR)343016

Descriptive statistics

Standard deviation10150581
Coefficient of variation (CV)4.3082663
Kurtosis80.80222
Mean2356071
Median Absolute Deviation (MAD)31258
Skewness8.0973083
Sum1.0107545 × 109
Variance1.030343 × 1014
MonotonicityNot monotonic
2023-12-13T02:41:23.752733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 146
34.0%
1235529 1
 
0.2%
122372 1
 
0.2%
955983 1
 
0.2%
941108 1
 
0.2%
31190227 1
 
0.2%
87785 1
 
0.2%
6840816 1
 
0.2%
29780 1
 
0.2%
189442 1
 
0.2%
Other values (274) 274
63.9%
ValueCountFrequency (%)
0 146
34.0%
29 1
 
0.2%
89 1
 
0.2%
126 1
 
0.2%
220 1
 
0.2%
273 1
 
0.2%
344 1
 
0.2%
390 1
 
0.2%
528 1
 
0.2%
570 1
 
0.2%
ValueCountFrequency (%)
128983605 1
0.2%
97578200 1
0.2%
66523337 1
0.2%
56256378 1
0.2%
53985896 1
0.2%
31425695 1
0.2%
31190227 1
0.2%
29173439 1
0.2%
26296569 1
0.2%
25973440 1
0.2%

단위
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
429 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
429
100.0%

Length

2023-12-13T02:41:23.866221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:41:23.974999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
429
100.0%

Interactions

2023-12-13T02:41:23.032912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:41:24.037821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
승차노선표종인원
승차노선1.0000.0000.000
표종0.0001.0000.439
인원0.0000.4391.000
2023-12-13T02:41:24.129628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
표종승차노선
표종1.0000.000
승차노선0.0001.000
2023-12-13T02:41:24.208756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인원승차노선표종
인원1.0000.0000.220
승차노선0.0001.0000.000
표종0.2200.0001.000

Missing values

2023-12-13T02:41:23.149151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:41:23.234446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

승차노선표종인원단위
0경부선1회권무임1235529
1경부선1회권어른984382
2경부선1회권어린이195836
3경부선1회권청소년0
4경부선RF무임22668706
5경부선RF어린이377359
6경부선RF일반128983605
7경부선RF정기권1569292
8경부선RF청소년4202597
9경부선단체권570
승차노선표종인원단위
419의정부경전철1회권청소년0
420의정부경전철RF무임183907
421의정부경전철RF어린이6889
422의정부경전철RF일반1569351
423의정부경전철RF정기권57285
424의정부경전철RF청소년82145
425의정부경전철단체권0
426의정부경전철어른0
427의정부경전철어린이0
428의정부경전철정액권0