Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 6645 |
Missing cells | 887 |
Missing cells (%) | 2.7% |
Duplicate rows | 2 |
Duplicate rows (%) | < 0.1% |
Total size in memory | 285.7 KiB |
Average record size in memory | 44.0 B |
Variable types
Categorical | 1 |
---|---|
Numeric | 3 |
DateTime | 1 |
Dataset
Description | 빛가람정보포탈 내 제공중인 나주시 버스시간 및 노선ID |
---|---|
Author | 한전KDN(주) |
URL | https://www.data.go.kr/data/15038342/fileData.do |
Dataset has 2 (< 0.1%) duplicate rows | Duplicates |
시간 has 887 (13.3%) missing values | Missing |
시간순서 has 207 (3.1%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 00:46:33.077391 |
---|---|
Analysis finished | 2023-12-12 00:46:35.201831 |
Duration | 2.12 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
평일주말구분(0:평일, 1:주말)
Categorical
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 52.0 KiB |
0 | |
---|---|
1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 4832 | |
1 | 1813 | 27.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 4832 | |
1 | 1813 | 27.3% |
정류장ID
Real number (ℝ)
Distinct | 229 |
---|---|
Distinct (%) | 3.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 610013.4 |
Minimum | 47 |
---|---|
Maximum | 7021070 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 58.5 KiB |
Quantile statistics
Minimum | 47 |
---|---|
5-th percentile | 1250 |
Q1 | 1510 |
median | 2010 |
Q3 | 2330 |
95-th percentile | 7011050 |
Maximum | 7021070 |
Range | 7021023 |
Interquartile range (IQR) | 820 |
Descriptive statistics
Standard deviation | 1972578.8 |
---|---|
Coefficient of variation (CV) | 3.2336648 |
Kurtosis | 6.6372933 |
Mean | 610013.4 |
Median Absolute Deviation (MAD) | 500 |
Skewness | 2.9385842 |
Sum | 4.053539 × 109 |
Variance | 3.8910673 × 1012 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
1870 | 217 | 3.3% |
2020 | 210 | 3.2% |
2010 | 210 | 3.2% |
1300 | 210 | 3.2% |
1290 | 210 | 3.2% |
1860 | 193 | 2.9% |
2170 | 171 | 2.6% |
2070 | 168 | 2.5% |
2160 | 168 | 2.5% |
1580 | 168 | 2.5% |
Other values (219) | 4720 |
Value | Count | Frequency (%) |
47 | 25 | |
48 | 24 | |
811 | 25 | |
817 | 25 | |
1010 | 1 | < 0.1% |
1020 | 2 | < 0.1% |
1040 | 44 | |
1050 | 24 | |
1080 | 47 | |
1090 | 4 | 0.1% |
Value | Count | Frequency (%) |
7021070 | 21 | |
7021060 | 21 | |
7021050 | 21 | |
7021040 | 21 | |
7021030 | 21 | |
7021020 | 21 | |
7021010 | 21 | |
7021000 | 21 | |
7011110 | 27 | |
7011100 | 27 |
시간순서
Real number (ℝ)
ZEROS
 
Distinct | 42 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.12611 |
Minimum | 0 |
---|---|
Maximum | 41 |
Zeros | 207 |
Zeros (%) | 3.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 58.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 4 |
median | 12 |
Q3 | 20 |
95-th percentile | 30 |
Maximum | 41 |
Range | 41 |
Interquartile range (IQR) | 16 |
Descriptive statistics
Standard deviation | 9.8741503 |
---|---|
Coefficient of variation (CV) | 0.7522526 |
Kurtosis | -0.45844639 |
Mean | 13.12611 |
Median Absolute Deviation (MAD) | 8 |
Skewness | 0.52840591 |
Sum | 87223 |
Variance | 97.498844 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 888 | 13.4% |
11 | 208 | 3.1% |
20 | 208 | 3.1% |
18 | 208 | 3.1% |
17 | 208 | 3.1% |
16 | 208 | 3.1% |
15 | 208 | 3.1% |
14 | 208 | 3.1% |
13 | 208 | 3.1% |
12 | 208 | 3.1% |
Other values (32) | 3885 |
Value | Count | Frequency (%) |
0 | 207 | 3.1% |
1 | 888 | |
2 | 208 | 3.1% |
3 | 208 | 3.1% |
4 | 208 | 3.1% |
5 | 208 | 3.1% |
6 | 208 | 3.1% |
7 | 208 | 3.1% |
8 | 208 | 3.1% |
9 | 208 | 3.1% |
Value | Count | Frequency (%) |
41 | 28 | |
40 | 28 | |
39 | 28 | |
38 | 28 | |
37 | 28 | |
36 | 28 | |
35 | 28 | |
34 | 28 | |
33 | 28 | |
32 | 28 |
노선ID
Real number (ℝ)
Distinct | 9 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.5504891 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 58.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 5 |
Q3 | 7 |
95-th percentile | 8 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.5216197 |
---|---|
Coefficient of variation (CV) | 0.55414256 |
Kurtosis | -1.4232131 |
Mean | 4.5504891 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.11387087 |
Sum | 30238 |
Variance | 6.3585658 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
7 | 1338 | |
1 | 1328 | |
6 | 1320 | |
3 | 1209 | |
8 | 486 | 7.3% |
2 | 398 | 6.0% |
5 | 209 | 3.1% |
4 | 189 | 2.8% |
9 | 168 | 2.5% |
Value | Count | Frequency (%) |
1 | 1328 | |
2 | 398 | 6.0% |
3 | 1209 | |
4 | 189 | 2.8% |
5 | 209 | 3.1% |
6 | 1320 | |
7 | 1338 | |
8 | 486 | 7.3% |
9 | 168 | 2.5% |
Value | Count | Frequency (%) |
9 | 168 | 2.5% |
8 | 486 | 7.3% |
7 | 1338 | |
6 | 1320 | |
5 | 209 | 3.1% |
4 | 189 | 2.8% |
3 | 1209 | |
2 | 398 | 6.0% |
1 | 1328 |
시간
Date
MISSING
 
Distinct | 1014 |
---|---|
Distinct (%) | 17.6% |
Missing | 887 |
Missing (%) | 13.3% |
Memory size | 52.0 KiB |
Minimum | 2023-12-12 05:33:00 |
---|---|
Maximum | 2023-12-12 23:48:00 |
평일주말구분(0:평일, 1:주말) | 정류장ID | 시간순서 | 노선ID | |
---|---|---|---|---|
평일주말구분(0:평일, 1:주말) | 1.000 | 0.291 | 0.318 | 0.498 |
정류장ID | 0.291 | 1.000 | 0.083 | 0.783 |
시간순서 | 0.318 | 0.083 | 1.000 | 0.367 |
노선ID | 0.498 | 0.783 | 0.367 | 1.000 |
정류장ID | 시간순서 | 노선ID | 평일주말구분(0:평일, 1:주말) | |
---|---|---|---|---|
정류장ID | 1.000 | -0.127 | 0.200 | 0.188 |
시간순서 | -0.127 | 1.000 | -0.052 | 0.244 |
노선ID | 0.200 | -0.052 | 1.000 | 0.499 |
평일주말구분(0:평일, 1:주말) | 0.188 | 0.244 | 0.499 | 1.000 |
평일주말구분(0:평일, 1:주말) | 정류장ID | 시간순서 | 노선ID | 시간 | |
---|---|---|---|---|---|
0 | 0 | 47 | 0 | 2 | <NA> |
1 | 0 | 47 | 1 | 2 | 07:55 |
2 | 0 | 47 | 2 | 2 | 08:50 |
3 | 0 | 47 | 3 | 2 | 09:30 |
4 | 0 | 47 | 4 | 2 | 10:10 |
5 | 0 | 47 | 5 | 2 | 10:25 |
6 | 0 | 47 | 6 | 2 | 11:35 |
7 | 0 | 47 | 7 | 2 | 12:15 |
8 | 0 | 47 | 8 | 2 | 13:00 |
9 | 0 | 47 | 9 | 2 | 13:15 |
평일주말구분(0:평일, 1:주말) | 정류장ID | 시간순서 | 노선ID | 시간 | |
---|---|---|---|---|---|
6635 | 0 | 7021070 | 11 | 9 | 15:05 |
6636 | 0 | 7021070 | 12 | 9 | 15:45 |
6637 | 0 | 7021070 | 13 | 9 | 16:25 |
6638 | 0 | 7021070 | 14 | 9 | 17:05 |
6639 | 0 | 7021070 | 15 | 9 | 18:35 |
6640 | 0 | 7021070 | 16 | 9 | 19:15 |
6641 | 0 | 7021070 | 17 | 9 | 19:40 |
6642 | 0 | 7021070 | 18 | 9 | 20:20 |
6643 | 0 | 7021070 | 19 | 9 | 20:45 |
6644 | 0 | 7021070 | 20 | 9 | 21:30 |
Most frequently occurring
평일주말구분(0:평일, 1:주말) | 정류장ID | 시간순서 | 노선ID | 시간 | # duplicates | |
---|---|---|---|---|---|---|
0 | 0 | 4670 | 1 | 1 | <NA> | 2 |
1 | 0 | 4680 | 1 | 1 | <NA> | 2 |