Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 26 |
Missing cells | 49 |
Missing cells (%) | 20.9% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.2 KiB |
Average record size in memory | 86.1 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 4 |
Dataset
Description | 매년 한국철도공사에서 발행하는 철도통계연보에 수록된 화차 하중별 보유현황으로 하중,종류,대수,단위 항목을 지원합니다. |
---|---|
Author | 한국철도공사 |
URL | https://www.data.go.kr/data/15053621/fileData.do |
적재하중 is highly overall correlated with 소 화 물 and 2 other fields | High correlation |
보유량수 is highly overall correlated with 무 개 차 and 4 other fields | High correlation |
무 개 차 is highly overall correlated with 보유량수 and 3 other fields | High correlation |
평 판 차 is highly overall correlated with 보유량수 and 3 other fields | High correlation |
소 화 물 is highly overall correlated with 적재하중 and 4 other fields | High correlation |
유 개 차 is highly overall correlated with 적재하중 and 4 other fields | High correlation |
차 장 차 is highly overall correlated with 적재하중 and 1 other fields | High correlation |
유 개 차 is highly imbalanced (55.4%) | Imbalance |
유 조 차 is highly imbalanced (76.5%) | Imbalance |
차 장 차 is highly imbalanced (70.5%) | Imbalance |
침 식 차 is highly imbalanced (76.5%) | Imbalance |
무 개 차 has 19 (73.1%) missing values | Missing |
평 판 차 has 19 (73.1%) missing values | Missing |
소 화 물 has 11 (42.3%) missing values | Missing |
적재하중 has unique values | Unique |
적재하중 has 1 (3.8%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 09:05:23.270368 |
---|---|
Analysis finished | 2023-12-12 09:05:26.700345 |
Duration | 3.43 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
적재하중
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
  ZEROS
 
Distinct | 26 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 54.623077 |
Minimum | 0 |
---|---|
Maximum | 165 |
Zeros | 1 |
Zeros (%) | 3.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 366.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 16.425 |
Q1 | 47.625 |
median | 53.4 |
Q3 | 59.125 |
95-th percentile | 97.5 |
Maximum | 165 |
Range | 165 |
Interquartile range (IQR) | 11.5 |
Descriptive statistics
Standard deviation | 30.663676 |
---|---|
Coefficient of variation (CV) | 0.56136853 |
Kurtosis | 6.2863737 |
Mean | 54.623077 |
Median Absolute Deviation (MAD) | 6.25 |
Skewness | 1.7653374 |
Sum | 1420.2 |
Variance | 940.26105 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
0.0 | 1 | 3.8% |
54.0 | 1 | 3.8% |
165.0 | 1 | 3.8% |
100.0 | 1 | 3.8% |
90.0 | 1 | 3.8% |
70.0 | 1 | 3.8% |
62.6 | 1 | 3.8% |
61.0 | 1 | 3.8% |
60.0 | 1 | 3.8% |
56.5 | 1 | 3.8% |
Other values (16) | 16 |
Value | Count | Frequency (%) |
0.0 | 1 | |
15.0 | 1 | |
20.7 | 1 | |
25.0 | 1 | |
28.3 | 1 | |
40.0 | 1 | |
47.5 | 1 | |
48.0 | 1 | |
50.0 | 1 | |
51.0 | 1 |
Value | Count | Frequency (%) |
165.0 | 1 | |
100.0 | 1 | |
90.0 | 1 | |
70.0 | 1 | |
62.6 | 1 | |
61.0 | 1 | |
60.0 | 1 | |
56.5 | 1 | |
55.8 | 1 | |
55.0 | 1 |
보유량수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 22 |
---|---|
Distinct (%) | 84.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 350.92308 |
Minimum | 1 |
---|---|
Maximum | 2742 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 366.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 11.5 |
median | 50 |
Q3 | 281.25 |
95-th percentile | 2062.5 |
Maximum | 2742 |
Range | 2741 |
Interquartile range (IQR) | 269.75 |
Descriptive statistics
Standard deviation | 700.72739 |
---|---|
Coefficient of variation (CV) | 1.9968119 |
Kurtosis | 7.1343829 |
Mean | 350.92308 |
Median Absolute Deviation (MAD) | 49 |
Skewness | 2.7519748 |
Sum | 9124 |
Variance | 491018.87 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 3 | 11.5% |
13 | 2 | 7.7% |
3 | 2 | 7.7% |
222 | 1 | 3.8% |
122 | 1 | 3.8% |
255 | 1 | 3.8% |
20 | 1 | 3.8% |
390 | 1 | 3.8% |
115 | 1 | 3.8% |
21 | 1 | 3.8% |
Other values (12) | 12 |
Value | Count | Frequency (%) |
1 | 3 | |
3 | 2 | |
7 | 1 | 3.8% |
11 | 1 | 3.8% |
13 | 2 | |
20 | 1 | 3.8% |
21 | 1 | 3.8% |
35 | 1 | 3.8% |
38 | 1 | 3.8% |
62 | 1 | 3.8% |
Value | Count | Frequency (%) |
2742 | 1 | |
2371 | 1 | |
1137 | 1 | |
591 | 1 | |
543 | 1 | |
390 | 1 | |
290 | 1 | |
255 | 1 | |
222 | 1 | |
122 | 1 |
유 개 차
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 23.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 340.0 B |
<NA> | |
---|---|
11 | 1 |
516 | 1 |
243 | 1 |
114 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.7307692 |
Min length | 2 |
Unique
Unique | 5 ? |
---|---|
Unique (%) | 19.2% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | 11 |
Common Values
Value | Count | Frequency (%) |
<NA> | 21 | |
11 | 1 | 3.8% |
516 | 1 | 3.8% |
243 | 1 | 3.8% |
114 | 1 | 3.8% |
30 | 1 | 3.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 21 | |
11 | 1 | 3.8% |
516 | 1 | 3.8% |
243 | 1 | 3.8% |
114 | 1 | 3.8% |
30 | 1 | 3.8% |
무 개 차
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 7 |
---|---|
Distinct (%) | 100.0% |
Missing | 19 |
Missing (%) | 73.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 364.14286 |
Minimum | 7 |
---|---|
Maximum | 2126 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 366.0 B |
Quantile statistics
Minimum | 7 |
---|---|
5-th percentile | 10.9 |
Q1 | 21.5 |
median | 35 |
Q3 | 169 |
95-th percentile | 1578.2 |
Maximum | 2126 |
Range | 2119 |
Interquartile range (IQR) | 147.5 |
Descriptive statistics
Standard deviation | 783.72219 |
---|---|
Coefficient of variation (CV) | 2.1522383 |
Kurtosis | 6.6017941 |
Mean | 364.14286 |
Median Absolute Deviation (MAD) | 15 |
Skewness | 2.5556465 |
Sum | 2549 |
Variance | 614220.48 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
38 | 1 | 3.8% |
35 | 1 | 3.8% |
7 | 1 | 3.8% |
300 | 1 | 3.8% |
23 | 1 | 3.8% |
2126 | 1 | 3.8% |
20 | 1 | 3.8% |
(Missing) | 19 |
Value | Count | Frequency (%) |
7 | 1 | |
20 | 1 | |
23 | 1 | |
35 | 1 | |
38 | 1 | |
300 | 1 | |
2126 | 1 |
Value | Count | Frequency (%) |
2126 | 1 | |
300 | 1 | |
38 | 1 | |
35 | 1 | |
23 | 1 | |
20 | 1 | |
7 | 1 |
평 판 차
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 7 |
---|---|
Distinct (%) | 100.0% |
Missing | 19 |
Missing (%) | 73.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 403.28571 |
Minimum | 3 |
---|---|
Maximum | 1023 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 366.0 B |
Quantile statistics
Minimum | 3 |
---|---|
5-th percentile | 29.1 |
Q1 | 102.5 |
median | 290 |
Q3 | 651 |
95-th percentile | 1009.2 |
Maximum | 1023 |
Range | 1020 |
Interquartile range (IQR) | 548.5 |
Descriptive statistics
Standard deviation | 423.01328 |
---|---|
Coefficient of variation (CV) | 1.0489171 |
Kurtosis | -1.0532547 |
Mean | 403.28571 |
Median Absolute Deviation (MAD) | 200 |
Skewness | 0.94344714 |
Sum | 2823 |
Variance | 178940.24 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 1 | 3.8% |
977 | 1 | 3.8% |
325 | 1 | 3.8% |
90 | 1 | 3.8% |
290 | 1 | 3.8% |
1023 | 1 | 3.8% |
115 | 1 | 3.8% |
(Missing) | 19 |
Value | Count | Frequency (%) |
3 | 1 | |
90 | 1 | |
115 | 1 | |
290 | 1 | |
325 | 1 | |
977 | 1 | |
1023 | 1 |
Value | Count | Frequency (%) |
1023 | 1 | |
977 | 1 | |
325 | 1 | |
290 | 1 | |
115 | 1 | |
90 | 1 | |
3 | 1 |
소 화 물
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 13 |
---|---|
Distinct (%) | 86.7% |
Missing | 11 |
Missing (%) | 42.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 184.2 |
Minimum | 1 |
---|---|
Maximum | 1094 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 366.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 8 |
median | 102 |
Q3 | 188.5 |
95-th percentile | 679.6 |
Maximum | 1094 |
Range | 1093 |
Interquartile range (IQR) | 180.5 |
Descriptive statistics
Standard deviation | 294.0292 |
---|---|
Coefficient of variation (CV) | 1.5962497 |
Kurtosis | 6.5801088 |
Mean | 184.2 |
Median Absolute Deviation (MAD) | 99 |
Skewness | 2.4484042 |
Sum | 2763 |
Variance | 86453.171 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 3 | 11.5% |
27 | 1 | 3.8% |
1094 | 1 | 3.8% |
117 | 1 | 3.8% |
502 | 1 | 3.8% |
102 | 1 | 3.8% |
114 | 1 | 3.8% |
21 | 1 | 3.8% |
13 | 1 | 3.8% |
390 | 1 | 3.8% |
Other values (3) | 3 | 11.5% |
(Missing) | 11 |
Value | Count | Frequency (%) |
1 | 3 | |
3 | 1 | 3.8% |
13 | 1 | 3.8% |
21 | 1 | 3.8% |
27 | 1 | 3.8% |
102 | 1 | 3.8% |
114 | 1 | 3.8% |
117 | 1 | 3.8% |
122 | 1 | 3.8% |
255 | 1 | 3.8% |
Value | Count | Frequency (%) |
1094 | 1 | |
502 | 1 | |
390 | 1 | |
255 | 1 | |
122 | 1 | |
117 | 1 | |
114 | 1 | |
102 | 1 | |
27 | 1 | |
21 | 1 |
유 조 차
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 7.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 340.0 B |
<NA> | |
---|---|
16 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9230769 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.8% |
Sample
1st row | <NA> |
---|---|
2nd row | 16 |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 25 | |
16 | 1 | 3.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 25 | |
16 | 1 | 3.8% |
차 장 차
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 11.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 340.0 B |
<NA> | |
---|---|
13 | 1 |
22 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.8461538 |
Min length | 2 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 7.7% |
Sample
1st row | 13 |
---|---|
2nd row | 22 |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 24 | |
13 | 1 | 3.8% |
22 | 1 | 3.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 24 | |
13 | 1 | 3.8% |
22 | 1 | 3.8% |
침 식 차
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 7.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 340.0 B |
<NA> | |
---|---|
24 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9230769 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.8% |
Sample
1st row | <NA> |
---|---|
2nd row | 24 |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 25 | |
24 | 1 | 3.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 25 | |
24 | 1 | 3.8% |
적재하중 | 보유량수 | 유 개 차 | 무 개 차 | 평 판 차 | 소 화 물 | 차 장 차 | |
---|---|---|---|---|---|---|---|
적재하중 | 1.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | NaN |
보유량수 | 0.000 | 1.000 | 1.000 | 1.000 | 0.936 | 0.827 | NaN |
유 개 차 | 1.000 | 1.000 | 1.000 | 0.000 | 0.000 | 1.000 | NaN |
무 개 차 | 0.000 | 1.000 | 0.000 | 1.000 | 0.000 | 0.000 | NaN |
평 판 차 | 0.000 | 0.936 | 0.000 | 0.000 | 1.000 | 1.000 | NaN |
소 화 물 | 0.000 | 0.827 | 1.000 | 0.000 | 1.000 | 1.000 | NaN |
차 장 차 | NaN | NaN | NaN | NaN | NaN | NaN | 1.000 |
침 식 차 | 유 조 차 | 차 장 차 | 유 개 차 | |
---|---|---|---|---|
침 식 차 | 1.000 | NaN | NaN | NaN |
유 조 차 | NaN | 1.000 | NaN | NaN |
차 장 차 | NaN | NaN | 1.000 | NaN |
유 개 차 | NaN | NaN | NaN | 1.000 |
적재하중 | 보유량수 | 무 개 차 | 평 판 차 | 소 화 물 | 유 개 차 | 유 조 차 | 차 장 차 | 침 식 차 | |
---|---|---|---|---|---|---|---|---|---|
적재하중 | 1.000 | -0.189 | -0.036 | 0.250 | -0.559 | 1.000 | NaN | 1.000 | NaN |
보유량수 | -0.189 | 1.000 | 0.893 | 0.929 | 0.874 | 1.000 | NaN | 1.000 | NaN |
무 개 차 | -0.036 | 0.893 | 1.000 | 1.000 | -1.000 | 1.000 | 0.000 | 0.000 | 0.000 |
평 판 차 | 0.250 | 0.929 | 1.000 | 1.000 | 0.500 | 1.000 | 0.000 | 0.000 | 0.000 |
소 화 물 | -0.559 | 0.874 | -1.000 | 0.500 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 |
유 개 차 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 |
유 조 차 | NaN | NaN | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | NaN | NaN |
차 장 차 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | NaN | 1.000 | NaN |
침 식 차 | NaN | NaN | 0.000 | 0.000 | 0.000 | 0.000 | NaN | NaN | 1.000 |
적재하중 | 보유량수 | 유 개 차 | 무 개 차 | 평 판 차 | 소 화 물 | 유 조 차 | 차 장 차 | 침 식 차 | |
---|---|---|---|---|---|---|---|---|---|
0 | 0.0 | 13 | <NA> | <NA> | <NA> | <NA> | <NA> | 13 | <NA> |
1 | 15.0 | 62 | <NA> | <NA> | <NA> | <NA> | 16 | 22 | 24 |
2 | 20.7 | 38 | <NA> | 38 | <NA> | <NA> | <NA> | <NA> | <NA> |
3 | 25.0 | 35 | <NA> | 35 | <NA> | <NA> | <NA> | <NA> | <NA> |
4 | 28.3 | 11 | 11 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
5 | 40.0 | 3 | <NA> | <NA> | 3 | <NA> | <NA> | <NA> | <NA> |
6 | 47.5 | 7 | <NA> | 7 | <NA> | <NA> | <NA> | <NA> | <NA> |
7 | 48.0 | 543 | 516 | <NA> | <NA> | 27 | <NA> | <NA> | <NA> |
8 | 50.0 | 2371 | <NA> | 300 | 977 | 1094 | <NA> | <NA> | <NA> |
9 | 51.0 | 591 | 243 | 23 | 325 | <NA> | <NA> | <NA> | <NA> |
적재하중 | 보유량수 | 유 개 차 | 무 개 차 | 평 판 차 | 소 화 물 | 유 조 차 | 차 장 차 | 침 식 차 | |
---|---|---|---|---|---|---|---|---|---|
16 | 55.0 | 115 | <NA> | <NA> | 115 | <NA> | <NA> | <NA> | <NA> |
17 | 55.8 | 13 | <NA> | <NA> | <NA> | 13 | <NA> | <NA> | <NA> |
18 | 56.5 | 390 | <NA> | <NA> | <NA> | 390 | <NA> | <NA> | <NA> |
19 | 60.0 | 3 | <NA> | <NA> | <NA> | 3 | <NA> | <NA> | <NA> |
20 | 61.0 | 20 | <NA> | 20 | <NA> | <NA> | <NA> | <NA> | <NA> |
21 | 62.6 | 255 | <NA> | <NA> | <NA> | 255 | <NA> | <NA> | <NA> |
22 | 70.0 | 122 | <NA> | <NA> | <NA> | 122 | <NA> | <NA> | <NA> |
23 | 90.0 | 1 | <NA> | <NA> | <NA> | 1 | <NA> | <NA> | <NA> |
24 | 100.0 | 1 | <NA> | <NA> | <NA> | 1 | <NA> | <NA> | <NA> |
25 | 165.0 | 1 | <NA> | <NA> | <NA> | 1 | <NA> | <NA> | <NA> |