Overview

Dataset statistics

Number of variables6
Number of observations48
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory55.8 B

Variable types

Numeric5
Categorical1

Dataset

Description부산교통공사_열차운행실적_20211231
Author부산교통공사
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3057438

Alerts

계획(횟수) is highly overall correlated with 실적(횟수) and 3 other fieldsHigh correlation
실적(횟수) is highly overall correlated with 계획(횟수) and 3 other fieldsHigh correlation
계획(운행거리Km) is highly overall correlated with 계획(횟수) and 3 other fieldsHigh correlation
실적(운행거리Km) is highly overall correlated with 계획(횟수) and 3 other fieldsHigh correlation
호선별 is highly overall correlated with 계획(횟수) and 3 other fieldsHigh correlation

Reproduction

Analysis started2023-12-10 16:15:19.185519
Analysis finished2023-12-10 16:15:22.471999
Duration3.29 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

월별
Real number (ℝ)

Distinct12
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.5
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2023-12-11T01:15:22.523876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13.75
median6.5
Q39.25
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)5.5

Descriptive statistics

Standard deviation3.4885832
Coefficient of variation (CV)0.53670511
Kurtosis-1.2175129
Mean6.5
Median Absolute Deviation (MAD)3
Skewness0
Sum312
Variance12.170213
MonotonicityIncreasing
2023-12-11T01:15:22.640582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
1 4
8.3%
2 4
8.3%
3 4
8.3%
4 4
8.3%
5 4
8.3%
6 4
8.3%
7 4
8.3%
8 4
8.3%
9 4
8.3%
10 4
8.3%
Other values (2) 8
16.7%
ValueCountFrequency (%)
1 4
8.3%
2 4
8.3%
3 4
8.3%
4 4
8.3%
5 4
8.3%
6 4
8.3%
7 4
8.3%
8 4
8.3%
9 4
8.3%
10 4
8.3%
ValueCountFrequency (%)
12 4
8.3%
11 4
8.3%
10 4
8.3%
9 4
8.3%
8 4
8.3%
7 4
8.3%
6 4
8.3%
5 4
8.3%
4 4
8.3%
3 4
8.3%

호선별
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size516.0 B
1호선
12 
2호선
12 
3호선
12 
4호선
12 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1호선
2nd row2호선
3rd row3호선
4th row4호선
5th row1호선

Common Values

ValueCountFrequency (%)
1호선 12
25.0%
2호선 12
25.0%
3호선 12
25.0%
4호선 12
25.0%

Length

2023-12-11T01:15:22.808393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:15:22.912998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1호선 12
25.0%
2호선 12
25.0%
3호선 12
25.0%
4호선 12
25.0%

계획(횟수)
Real number (ℝ)

HIGH CORRELATION 

Distinct40
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9949.2708
Minimum8348
Maximum11144
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2023-12-11T01:15:23.028412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8348
5-th percentile8971.7
Q19373.5
median9653
Q310692
95-th percentile11097.8
Maximum11144
Range2796
Interquartile range (IQR)1318.5

Descriptive statistics

Standard deviation775.53638
Coefficient of variation (CV)0.077949067
Kurtosis-1.2792745
Mean9949.2708
Median Absolute Deviation (MAD)642
Skewness0.017209883
Sum477565
Variance601456.67
MonotonicityNot monotonic
2023-12-11T01:15:23.202000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
9116 3
 
6.2%
9376 3
 
6.2%
10400 3
 
6.2%
10812 3
 
6.2%
10967 1
 
2.1%
9382 1
 
2.1%
10486 1
 
2.1%
9450 1
 
2.1%
9188 1
 
2.1%
10578 1
 
2.1%
Other values (30) 30
62.5%
ValueCountFrequency (%)
8348 1
 
2.1%
8582 1
 
2.1%
8894 1
 
2.1%
9116 3
6.2%
9142 1
 
2.1%
9184 1
 
2.1%
9188 1
 
2.1%
9228 1
 
2.1%
9316 1
 
2.1%
9366 1
 
2.1%
ValueCountFrequency (%)
11144 1
 
2.1%
11129 1
 
2.1%
11123 1
 
2.1%
11051 1
 
2.1%
10967 1
 
2.1%
10934 1
 
2.1%
10922 1
 
2.1%
10812 3
6.2%
10708 1
 
2.1%
10704 1
 
2.1%

실적(횟수)
Real number (ℝ)

HIGH CORRELATION 

Distinct40
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9948.9583
Minimum8348
Maximum11144
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2023-12-11T01:15:23.364532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8348
5-th percentile8971.7
Q19373.5
median9653
Q310692
95-th percentile11097.8
Maximum11144
Range2796
Interquartile range (IQR)1318.5

Descriptive statistics

Standard deviation775.13406
Coefficient of variation (CV)0.077911077
Kurtosis-1.279003
Mean9948.9583
Median Absolute Deviation (MAD)642
Skewness0.016450093
Sum477550
Variance600832.81
MonotonicityNot monotonic
2023-12-11T01:15:23.536345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
9116 3
 
6.2%
9376 3
 
6.2%
10400 3
 
6.2%
10812 3
 
6.2%
10967 1
 
2.1%
9382 1
 
2.1%
10486 1
 
2.1%
9450 1
 
2.1%
9188 1
 
2.1%
10578 1
 
2.1%
Other values (30) 30
62.5%
ValueCountFrequency (%)
8348 1
 
2.1%
8582 1
 
2.1%
8894 1
 
2.1%
9116 3
6.2%
9142 1
 
2.1%
9184 1
 
2.1%
9188 1
 
2.1%
9228 1
 
2.1%
9316 1
 
2.1%
9366 1
 
2.1%
ValueCountFrequency (%)
11144 1
 
2.1%
11129 1
 
2.1%
11123 1
 
2.1%
11051 1
 
2.1%
10967 1
 
2.1%
10922 1
 
2.1%
10919 1
 
2.1%
10812 3
6.2%
10708 1
 
2.1%
10704 1
 
2.1%

계획(운행거리Km)
Real number (ℝ)

HIGH CORRELATION 

Distinct40
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean285355.84
Minimum100176
Maximum456143.8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2023-12-11T01:15:23.754018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum100176
5-th percentile109392
Q1144682.65
median278608.6
Q3428888.8
95-th percentile453833.76
Maximum456143.8
Range355967.8
Interquartile range (IQR)284206.15

Descriptive statistics

Standard deviation149225.86
Coefficient of variation (CV)0.52294656
Kurtosis-1.9659764
Mean285355.84
Median Absolute Deviation (MAD)150439.8
Skewness-0.042359462
Sum13697080
Variance2.2268356 × 1010
MonotonicityNot monotonic
2023-12-11T01:15:23.928680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
109392.0 3
 
6.2%
169705.6 3
 
6.2%
442876.4 3
 
6.2%
416525.6 3
 
6.2%
422499.0 1
 
2.1%
112584.0 1
 
2.1%
446549.4 1
 
2.1%
171045.0 1
 
2.1%
110256.0 1
 
2.1%
407539.4 1
 
2.1%
Other values (30) 30
62.5%
ValueCountFrequency (%)
100176.0 1
 
2.1%
106728.0 1
 
2.1%
109392.0 3
6.2%
110208.0 1
 
2.1%
110256.0 1
 
2.1%
110736.0 1
 
2.1%
111792.0 1
 
2.1%
112392.0 1
 
2.1%
112584.0 1
 
2.1%
112728.0 1
 
2.1%
ValueCountFrequency (%)
456143.8 1
 
2.1%
455934.2 1
 
2.1%
454929.4 1
 
2.1%
451799.0 1
 
2.1%
448205.4 1
 
2.1%
446549.4 1
 
2.1%
445499.8 1
 
2.1%
442876.4 3
6.2%
431733.2 1
 
2.1%
429367.6 1
 
2.1%

실적(운행거리Km)
Real number (ℝ)

HIGH CORRELATION 

Distinct40
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean285347.59
Minimum100176
Maximum456143.8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2023-12-11T01:15:24.097537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum100176
5-th percentile109392
Q1144682.65
median278608.6
Q3428888.8
95-th percentile453833.76
Maximum456143.8
Range355967.8
Interquartile range (IQR)284206.15

Descriptive statistics

Standard deviation149218.2
Coefficient of variation (CV)0.52293485
Kurtosis-1.9659254
Mean285347.59
Median Absolute Deviation (MAD)150439.8
Skewness-0.042338763
Sum13696684
Variance2.2266071 × 1010
MonotonicityNot monotonic
2023-12-11T01:15:24.607952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
109392.0 3
 
6.2%
169705.6 3
 
6.2%
442876.4 3
 
6.2%
416525.6 3
 
6.2%
422499.0 1
 
2.1%
112584.0 1
 
2.1%
446549.4 1
 
2.1%
171045.0 1
 
2.1%
110256.0 1
 
2.1%
407539.4 1
 
2.1%
Other values (30) 30
62.5%
ValueCountFrequency (%)
100176.0 1
 
2.1%
106728.0 1
 
2.1%
109392.0 3
6.2%
110208.0 1
 
2.1%
110256.0 1
 
2.1%
110736.0 1
 
2.1%
111792.0 1
 
2.1%
112392.0 1
 
2.1%
112584.0 1
 
2.1%
112728.0 1
 
2.1%
ValueCountFrequency (%)
456143.8 1
 
2.1%
455934.2 1
 
2.1%
454929.4 1
 
2.1%
451799.0 1
 
2.1%
448205.4 1
 
2.1%
446549.4 1
 
2.1%
445499.8 1
 
2.1%
442876.4 3
6.2%
431733.2 1
 
2.1%
429367.6 1
 
2.1%

Interactions

2023-12-11T01:15:21.725627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:19.464177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:20.082858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:20.640398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:21.142029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:21.835094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:19.580887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:20.193848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:20.734198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:21.270877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:21.951046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:19.717654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:20.296321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:20.824680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:21.384536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:22.051023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:19.842266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:20.396387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:20.922186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:21.502400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:22.163026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:19.957865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:20.513361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:21.031782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:15:21.615371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:15:24.712494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
월별호선별계획(횟수)실적(횟수)계획(운행거리Km)실적(운행거리Km)
월별1.0000.0000.0000.0000.0000.000
호선별0.0001.0000.8620.8620.9320.932
계획(횟수)0.0000.8621.0001.0000.8930.893
실적(횟수)0.0000.8621.0001.0000.8930.893
계획(운행거리Km)0.0000.9320.8930.8931.0001.000
실적(운행거리Km)0.0000.9320.8930.8931.0001.000
2023-12-11T01:15:24.852014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
월별계획(횟수)실적(횟수)계획(운행거리Km)실적(운행거리Km)호선별
월별1.0000.0370.0380.0220.0230.000
계획(횟수)0.0371.0001.0000.8480.8480.650
실적(횟수)0.0381.0001.0000.8480.8480.650
계획(운행거리Km)0.0220.8480.8481.0001.0000.808
실적(운행거리Km)0.0230.8480.8481.0001.0000.808
호선별0.0000.6500.6500.8080.8081.000

Missing values

2023-12-11T01:15:22.312142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:15:22.427651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

월별호선별계획(횟수)실적(횟수)계획(운행거리Km)실적(운행거리Km)
011호선1096710967422499.0422499.0
112호선1051810518448205.4448205.4
213호선94849484171660.4171660.4
314호선92289228110736.0110736.0
421호선99239923382371.2382371.2
522호선95009500404890.4404890.4
623호선85828582155334.2155334.2
724호선83488348100176.0100176.0
831호선1114411144429367.6429367.6
932호선1070810708456143.8456143.8
월별호선별계획(횟수)실적(횟수)계획(운행거리Km)실적(운행거리Km)
38103호선94389438170827.8170827.8
39104호선91849184110208.0110208.0
40111호선1081210812416525.6416525.6
41112호선1040010400442876.4442876.4
42113호선93769376169705.6169705.6
43114호선91169116109392.0109392.0
44121호선1112311123428452.8428452.8
45122호선1068810688454929.4454929.4
46123호선96389638174447.8174447.8
47124호선93669366112392.0112392.0