Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 89 |
Missing cells | 115 |
Missing cells (%) | 14.4% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 7.2 KiB |
Average record size in memory | 82.5 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 3 |
Unsupported | 1 |
Dataset
Description | 경기도_BMS 예비차 상태 정보 |
---|---|
Author | 경기도 |
URL | https://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=GTBX9KXJQ425RIVG609U33362683&infSeq=1 |
업체아이디 is highly overall correlated with 시외무정차수 | High correlation |
예비차총수 is highly overall correlated with 일반수 and 4 other fields | High correlation |
일반수 is highly overall correlated with 예비차총수 | High correlation |
무정차수 is highly overall correlated with 예비차총수 and 1 other fields | High correlation |
갱신일자 is highly overall correlated with 예비차총수 and 1 other fields | High correlation |
좌석수 is highly overall correlated with 예비차총수 | High correlation |
급행차수 is highly overall correlated with 무정차수 | High correlation |
시외무정차수 is highly overall correlated with 업체아이디 and 2 other fields | High correlation |
좌석수 is highly imbalanced (53.0%) | Imbalance |
급행차수 is highly imbalanced (60.3%) | Imbalance |
시외무정차수 is highly imbalanced (63.3%) | Imbalance |
일반수 has 13 (14.6%) missing values | Missing |
무정차수 has 13 (14.6%) missing values | Missing |
갱신아이디 has 89 (100.0%) missing values | Missing |
업체아이디 has unique values | Unique |
갱신아이디 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
예비차총수 has 25 (28.1%) zeros | Zeros |
일반수 has 35 (39.3%) zeros | Zeros |
무정차수 has 43 (48.3%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 22:30:56.708998 |
---|---|
Analysis finished | 2023-12-10 22:30:59.365647 |
Duration | 2.66 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
업체아이디
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 89 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4112749.4 |
Minimum | 4100200 |
---|---|
Maximum | 4155200 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 933.0 B |
Quantile statistics
Minimum | 4100200 |
---|---|
5-th percentile | 4100640 |
Q1 | 4102700 |
median | 4106300 |
Q3 | 4110700 |
95-th percentile | 4151640 |
Maximum | 4155200 |
Range | 55000 |
Interquartile range (IQR) | 8000 |
Descriptive statistics
Standard deviation | 17329.737 |
---|---|
Coefficient of variation (CV) | 0.0042136624 |
Kurtosis | 1.4466162 |
Mean | 4112749.4 |
Median Absolute Deviation (MAD) | 4100 |
Skewness | 1.7730721 |
Sum | 3.660347 × 108 |
Variance | 3.003198 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4111700 | 1 | 1.1% |
4103900 | 1 | 1.1% |
4108900 | 1 | 1.1% |
4105900 | 1 | 1.1% |
4106100 | 1 | 1.1% |
4110600 | 1 | 1.1% |
4103600 | 1 | 1.1% |
4107400 | 1 | 1.1% |
4103500 | 1 | 1.1% |
4111300 | 1 | 1.1% |
Other values (79) | 79 |
Value | Count | Frequency (%) |
4100200 | 1 | |
4100300 | 1 | |
4100400 | 1 | |
4100500 | 1 | |
4100600 | 1 | |
4100700 | 1 | |
4100800 | 1 | |
4100900 | 1 | |
4101100 | 1 | |
4101200 | 1 |
Value | Count | Frequency (%) |
4155200 | 1 | |
4155100 | 1 | |
4155000 | 1 | |
4153600 | 1 | |
4151800 | 1 | |
4151400 | 1 | |
4151100 | 1 | |
4150700 | 1 | |
4150600 | 1 | |
4150500 | 1 |
예비차총수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 22 |
---|---|
Distinct (%) | 24.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.2134831 |
Minimum | 0 |
---|---|
Maximum | 36 |
Zeros | 25 |
Zeros (%) | 28.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 933.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 2 |
Q3 | 7 |
95-th percentile | 23.2 |
Maximum | 36 |
Range | 36 |
Interquartile range (IQR) | 7 |
Descriptive statistics
Standard deviation | 7.5309542 |
---|---|
Coefficient of variation (CV) | 1.4445149 |
Kurtosis | 4.3877762 |
Mean | 5.2134831 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 2.1265539 |
Sum | 464 |
Variance | 56.715271 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 25 | |
1 | 12 | |
2 | 9 | 10.1% |
3 | 9 | 10.1% |
4 | 6 | 6.7% |
7 | 4 | 4.5% |
5 | 3 | 3.4% |
6 | 2 | 2.2% |
15 | 2 | 2.2% |
8 | 2 | 2.2% |
Other values (12) | 15 |
Value | Count | Frequency (%) |
0 | 25 | |
1 | 12 | |
2 | 9 | 10.1% |
3 | 9 | 10.1% |
4 | 6 | 6.7% |
5 | 3 | 3.4% |
6 | 2 | 2.2% |
7 | 4 | 4.5% |
8 | 2 | 2.2% |
9 | 2 | 2.2% |
Value | Count | Frequency (%) |
36 | 1 | |
28 | 2 | |
27 | 1 | |
24 | 1 | |
22 | 1 | |
19 | 1 | |
17 | 1 | |
15 | 2 | |
14 | 1 | |
13 | 2 |
일반수
Real number (ℝ)
HIGH CORRELATION
  MISSING
  ZEROS
 
Distinct | 15 |
---|---|
Distinct (%) | 19.7% |
Missing | 13 |
Missing (%) | 14.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.9736842 |
Minimum | 0 |
---|---|
Maximum | 27 |
Zeros | 35 |
Zeros (%) | 39.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 933.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 1 |
Q3 | 4 |
95-th percentile | 13 |
Maximum | 27 |
Range | 27 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 4.8825504 |
---|---|
Coefficient of variation (CV) | 1.6419196 |
Kurtosis | 7.7040092 |
Mean | 2.9736842 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 2.5102497 |
Sum | 226 |
Variance | 23.839298 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 35 | |
1 | 10 | 11.2% |
3 | 6 | 6.7% |
4 | 5 | 5.6% |
5 | 4 | 4.5% |
2 | 4 | 4.5% |
7 | 2 | 2.2% |
12 | 2 | 2.2% |
13 | 2 | 2.2% |
9 | 1 | 1.1% |
Other values (5) | 5 | 5.6% |
(Missing) | 13 | 14.6% |
Value | Count | Frequency (%) |
0 | 35 | |
1 | 10 | 11.2% |
2 | 4 | 4.5% |
3 | 6 | 6.7% |
4 | 5 | 5.6% |
5 | 4 | 4.5% |
7 | 2 | 2.2% |
8 | 1 | 1.1% |
9 | 1 | 1.1% |
11 | 1 | 1.1% |
Value | Count | Frequency (%) |
27 | 1 | 1.1% |
16 | 1 | 1.1% |
15 | 1 | 1.1% |
13 | 2 | |
12 | 2 | |
11 | 1 | 1.1% |
9 | 1 | 1.1% |
8 | 1 | 1.1% |
7 | 2 | |
5 | 4 |
좌석수
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 4.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 844.0 B |
0 | |
---|---|
<NA> | |
1 | 4 |
4 | 1 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.4382022 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.1% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 71 | |
<NA> | 13 | 14.6% |
1 | 4 | 4.5% |
4 | 1 | 1.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 71 | |
na | 13 | 14.6% |
1 | 4 | 4.5% |
4 | 1 | 1.1% |
무정차수
Real number (ℝ)
HIGH CORRELATION
  MISSING
  ZEROS
 
Distinct | 15 |
---|---|
Distinct (%) | 19.7% |
Missing | 13 |
Missing (%) | 14.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.6447368 |
Minimum | 0 |
---|---|
Maximum | 27 |
Zeros | 43 |
Zeros (%) | 48.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 933.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 3 |
95-th percentile | 12 |
Maximum | 27 |
Range | 27 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 5.0377348 |
---|---|
Coefficient of variation (CV) | 1.9048151 |
Kurtosis | 8.6925659 |
Mean | 2.6447368 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.7795179 |
Sum | 201 |
Variance | 25.378772 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 43 | |
3 | 7 | 7.9% |
1 | 6 | 6.7% |
2 | 4 | 4.5% |
5 | 3 | 3.4% |
11 | 2 | 2.2% |
4 | 2 | 2.2% |
8 | 2 | 2.2% |
20 | 1 | 1.1% |
9 | 1 | 1.1% |
Other values (5) | 5 | 5.6% |
(Missing) | 13 | 14.6% |
Value | Count | Frequency (%) |
0 | 43 | |
1 | 6 | 6.7% |
2 | 4 | 4.5% |
3 | 7 | 7.9% |
4 | 2 | 2.2% |
5 | 3 | 3.4% |
7 | 1 | 1.1% |
8 | 2 | 2.2% |
9 | 1 | 1.1% |
10 | 1 | 1.1% |
Value | Count | Frequency (%) |
27 | 1 | 1.1% |
20 | 1 | 1.1% |
17 | 1 | 1.1% |
15 | 1 | 1.1% |
11 | 2 | |
10 | 1 | 1.1% |
9 | 1 | 1.1% |
8 | 2 | |
7 | 1 | 1.1% |
5 | 3 |
급행차수
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 5.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 844.0 B |
0 | |
---|---|
<NA> | |
1 | 2 |
13 | 1 |
2 | 1 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.4494382 |
Min length | 1 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 2.2% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 72 | |
<NA> | 13 | 14.6% |
1 | 2 | 2.2% |
13 | 1 | 1.1% |
2 | 1 | 1.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 72 | |
na | 13 | 14.6% |
1 | 2 | 2.2% |
13 | 1 | 1.1% |
2 | 1 | 1.1% |
시외무정차수
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 5.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 844.0 B |
<NA> | |
---|---|
0 | 7 |
3 | 2 |
1 | 2 |
2 | 2 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.5617978 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 76 | |
0 | 7 | 7.9% |
3 | 2 | 2.2% |
1 | 2 | 2.2% |
2 | 2 | 2.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 76 | |
0 | 7 | 7.9% |
3 | 2 | 2.2% |
1 | 2 | 2.2% |
2 | 2 | 2.2% |
갱신일자
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 40 |
---|---|
Distinct (%) | 44.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0223948 × 1013 |
Minimum | 2.0220701 × 1013 |
---|---|
Maximum | 2.023051 × 1013 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 933.0 B |
Quantile statistics
Minimum | 2.0220701 × 1013 |
---|---|
5-th percentile | 2.0220701 × 1013 |
Q1 | 2.0221209 × 1013 |
median | 2.0221209 × 1013 |
Q3 | 2.0230112 × 1013 |
95-th percentile | 2.023051 × 1013 |
Maximum | 2.023051 × 1013 |
Range | 9.8090255 × 109 |
Interquartile range (IQR) | 8.9030281 × 109 |
Descriptive statistics
Standard deviation | 4.2623893 × 109 |
---|---|
Coefficient of variation (CV) | 0.00021075951 |
Kurtosis | -1.2668479 |
Mean | 2.0223948 × 1013 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.86847063 |
Sum | 1.7999314 × 1015 |
Variance | 1.8167963 × 1019 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20221209125050 | 16 | |
20221209125048 | 15 | |
20221209125049 | 15 | |
20220701090116 | 3 | 3.4% |
20221209125047 | 3 | 3.4% |
20220701090115 | 3 | 3.4% |
20220701085634 | 1 | 1.1% |
20230510110659 | 1 | 1.1% |
20230510110339 | 1 | 1.1% |
20230510110406 | 1 | 1.1% |
Other values (30) | 30 |
Value | Count | Frequency (%) |
20220701085634 | 1 | 1.1% |
20220701090115 | 3 | 3.4% |
20220701090116 | 3 | 3.4% |
20221209125047 | 3 | 3.4% |
20221209125048 | 15 | |
20221209125049 | 15 | |
20221209125050 | 16 | |
20221216094014 | 1 | 1.1% |
20221223135829 | 1 | 1.1% |
20221223140025 | 1 | 1.1% |
Value | Count | Frequency (%) |
20230510111110 | 1 | |
20230510111045 | 1 | |
20230510110919 | 1 | |
20230510110659 | 1 | |
20230510110613 | 1 | |
20230510110534 | 1 | |
20230510110502 | 1 | |
20230510110432 | 1 | |
20230510110406 | 1 | |
20230510110339 | 1 |
갱신아이디
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 89 |
---|---|
Missing (%) | 100.0% |
Memory size | 933.0 B |
업체아이디 | 예비차총수 | 일반수 | 좌석수 | 무정차수 | 급행차수 | 시외무정차수 | 갱신일자 | |
---|---|---|---|---|---|---|---|---|
업체아이디 | 1.000 | 0.110 | 0.000 | 0.000 | 0.000 | 0.000 | NaN | 0.125 |
예비차총수 | 0.110 | 1.000 | 0.801 | 0.873 | 0.931 | 0.529 | NaN | 0.524 |
일반수 | 0.000 | 0.801 | 1.000 | 0.528 | 0.364 | 0.336 | NaN | 0.306 |
좌석수 | 0.000 | 0.873 | 0.528 | 1.000 | 0.807 | 0.303 | NaN | 0.065 |
무정차수 | 0.000 | 0.931 | 0.364 | 0.807 | 1.000 | 0.693 | NaN | 0.536 |
급행차수 | 0.000 | 0.529 | 0.336 | 0.303 | 0.693 | 1.000 | NaN | 0.354 |
시외무정차수 | NaN | NaN | NaN | NaN | NaN | NaN | 1.000 | 0.904 |
갱신일자 | 0.125 | 0.524 | 0.306 | 0.065 | 0.536 | 0.354 | 0.904 | 1.000 |
급행차수 | 좌석수 | 시외무정차수 | |
---|---|---|---|
급행차수 | 1.000 | 0.288 | NaN |
좌석수 | 0.288 | 1.000 | NaN |
시외무정차수 | NaN | NaN | 1.000 |
업체아이디 | 예비차총수 | 일반수 | 무정차수 | 갱신일자 | 좌석수 | 급행차수 | 시외무정차수 | |
---|---|---|---|---|---|---|---|---|
업체아이디 | 1.000 | -0.436 | -0.167 | -0.214 | -0.237 | 0.000 | 0.000 | 1.000 |
예비차총수 | -0.436 | 1.000 | 0.679 | 0.651 | 0.596 | 0.571 | 0.353 | 1.000 |
일반수 | -0.167 | 0.679 | 1.000 | 0.048 | 0.306 | 0.403 | 0.230 | 0.000 |
무정차수 | -0.214 | 0.651 | 0.048 | 1.000 | 0.443 | 0.492 | 0.507 | 0.000 |
갱신일자 | -0.237 | 0.596 | 0.306 | 0.443 | 1.000 | 0.110 | 0.234 | 0.643 |
좌석수 | 0.000 | 0.571 | 0.403 | 0.492 | 0.110 | 1.000 | 0.288 | 0.000 |
급행차수 | 0.000 | 0.353 | 0.230 | 0.507 | 0.234 | 0.288 | 1.000 | 0.000 |
시외무정차수 | 1.000 | 1.000 | 0.000 | 0.000 | 0.643 | 0.000 | 0.000 | 1.000 |
업체아이디 | 예비차총수 | 일반수 | 좌석수 | 무정차수 | 급행차수 | 시외무정차수 | 갱신일자 | 갱신아이디 | |
---|---|---|---|---|---|---|---|---|---|
0 | 4111700 | 1 | 0 | 0 | 1 | 0 | <NA> | 20220701085634 | <NA> |
1 | 4111100 | 7 | 7 | 0 | 0 | 0 | <NA> | 20221209125050 | <NA> |
2 | 4104400 | 5 | 5 | 0 | 0 | 0 | <NA> | 20221209125050 | <NA> |
3 | 4111500 | 0 | 0 | 0 | 0 | 0 | <NA> | 20221209125048 | <NA> |
4 | 4108500 | 0 | 0 | 0 | 0 | 0 | <NA> | 20230206135049 | <NA> |
5 | 4108700 | 2 | 2 | 0 | 0 | 0 | <NA> | 20230127113402 | <NA> |
6 | 4110700 | 0 | 0 | 0 | 0 | 0 | <NA> | 20221209125047 | <NA> |
7 | 4105700 | 12 | 12 | 0 | 0 | 0 | <NA> | 20230510110534 | <NA> |
8 | 4151400 | 0 | 0 | 0 | 0 | 0 | <NA> | 20221209125047 | <NA> |
9 | 4105000 | 5 | 0 | 0 | 5 | 0 | <NA> | 20230510110613 | <NA> |
업체아이디 | 예비차총수 | 일반수 | 좌석수 | 무정차수 | 급행차수 | 시외무정차수 | 갱신일자 | 갱신아이디 | |
---|---|---|---|---|---|---|---|---|---|
79 | 4150600 | 0 | <NA> | <NA> | <NA> | <NA> | 0 | 20220701090115 | <NA> |
80 | 4150100 | 0 | <NA> | <NA> | <NA> | <NA> | 0 | 20220701090115 | <NA> |
81 | 4151100 | 1 | <NA> | <NA> | <NA> | <NA> | 1 | 20221209125050 | <NA> |
82 | 4150300 | 0 | <NA> | <NA> | <NA> | <NA> | 0 | 20230206140738 | <NA> |
83 | 4155200 | 2 | <NA> | <NA> | <NA> | <NA> | 2 | 20230510111045 | <NA> |
84 | 4150700 | 0 | <NA> | <NA> | <NA> | <NA> | 0 | 20220701090116 | <NA> |
85 | 4150500 | 0 | <NA> | <NA> | <NA> | <NA> | 0 | 20220701090116 | <NA> |
86 | 4153600 | 2 | <NA> | <NA> | <NA> | <NA> | 2 | 20230510111110 | <NA> |
87 | 4155000 | 1 | <NA> | <NA> | <NA> | <NA> | 1 | 20221209125050 | <NA> |
88 | 4151800 | 0 | <NA> | <NA> | <NA> | <NA> | 0 | 20220701090116 | <NA> |