Overview

Dataset statistics

Number of variables5
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.3 KiB
Average record size in memory44.3 B

Variable types

Categorical2
Numeric3

Alerts

평균소요시간(초) is highly overall correlated with 도착시간대High correlation
도착시간대 is highly overall correlated with 평균소요시간(초)High correlation

Reproduction

Analysis started2023-12-10 10:31:45.599021
Analysis finished2023-12-10 10:31:47.998139
Duration2.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

출발시간대
Categorical

Distinct21
Distinct (%)21.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2019-11-30 18:00
12 
2019-11-30 16:20
11 
2019-11-30 17:20
10 
2019-11-30 16:40
2019-11-30 15:40
Other values (16)
51 

Length

Max length16
Median length16
Mean length16
Min length16

Unique

Unique5 ?
Unique (%)5.0%

Sample

1st row2019-11-30 06:40
2nd row2019-11-30 07:40
3rd row2019-11-30 07:40
4th row2019-11-30 10:20
5th row2019-11-30 10:40

Common Values

ValueCountFrequency (%)
2019-11-30 18:00 12
12.0%
2019-11-30 16:20 11
11.0%
2019-11-30 17:20 10
10.0%
2019-11-30 16:40 9
9.0%
2019-11-30 15:40 7
 
7.0%
2019-11-30 17:40 7
 
7.0%
2019-11-30 14:20 7
 
7.0%
2019-11-30 14:40 6
 
6.0%
2019-11-30 15:00 5
 
5.0%
2019-11-30 17:00 5
 
5.0%
Other values (11) 21
21.0%

Length

2023-12-10T19:31:48.127466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2019-11-30 100
50.0%
18:00 12
 
6.0%
16:20 11
 
5.5%
17:20 10
 
5.0%
16:40 9
 
4.5%
15:40 7
 
3.5%
17:40 7
 
3.5%
14:20 7
 
3.5%
14:40 6
 
3.0%
15:00 5
 
2.5%
Other values (12) 26
 
13.0%

도착시간대
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2019-12-01 00:00
28 
2019-12-01 00:20
15 
2019-12-01 00:40
12 
2019-12-01 01:00
2019-12-01 01:40
Other values (20)
30 

Length

Max length16
Median length16
Mean length16
Min length16

Unique

Unique12 ?
Unique (%)12.0%

Sample

1st row2019-12-01 00:20
2nd row2019-12-01 00:00
3rd row2019-12-01 00:40
4th row2019-12-01 06:00
5th row2019-12-01 00:20

Common Values

ValueCountFrequency (%)
2019-12-01 00:00 28
28.0%
2019-12-01 00:20 15
15.0%
2019-12-01 00:40 12
12.0%
2019-12-01 01:00 9
 
9.0%
2019-12-01 01:40 6
 
6.0%
2019-12-01 02:20 3
 
3.0%
2019-12-01 14:40 3
 
3.0%
2019-12-01 01:20 2
 
2.0%
2019-12-01 09:20 2
 
2.0%
2019-12-01 13:20 2
 
2.0%
Other values (15) 18
18.0%

Length

2023-12-10T19:31:48.351317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2019-12-01 100
50.0%
00:00 28
 
14.0%
00:20 15
 
7.5%
00:40 12
 
6.0%
01:00 9
 
4.5%
01:40 6
 
3.0%
02:20 3
 
1.5%
14:40 3
 
1.5%
12:00 2
 
1.0%
03:40 2
 
1.0%
Other values (16) 20
 
10.0%

출발행정동코드
Real number (ℝ)

Distinct78
Distinct (%)78.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1991428.1
Minimum1101060
Maximum3901058
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:31:48.655681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1101060
5-th percentile1102055
Q11113075
median1122062.5
Q33122529.8
95-th percentile3709231.9
Maximum3901058
Range2799998
Interquartile range (IQR)2009454.8

Descriptive statistics

Standard deviation1056838.9
Coefficient of variation (CV)0.53069397
Kurtosis-1.599932
Mean1991428.1
Median Absolute Deviation (MAD)20007.5
Skewness0.48440405
Sum1.9914281 × 108
Variance1.1169084 × 1012
MonotonicityNot monotonic
2023-12-10T19:31:48.925987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1119071 4
 
4.0%
1102055 4
 
4.0%
1103073 4
 
4.0%
1113075 4
 
4.0%
3102358 2
 
2.0%
2304065 2
 
2.0%
1123064 2
 
2.0%
1106082 2
 
2.0%
1117061 2
 
2.0%
1116069 2
 
2.0%
Other values (68) 72
72.0%
ValueCountFrequency (%)
1101060 1
 
1.0%
1101064 1
 
1.0%
1102052 2
2.0%
1102055 4
4.0%
1102071 1
 
1.0%
1103052 1
 
1.0%
1103065 1
 
1.0%
1103073 4
4.0%
1104068 1
 
1.0%
1104073 1
 
1.0%
ValueCountFrequency (%)
3901058 1
1.0%
3811457 1
1.0%
3807058 1
1.0%
3736012 1
1.0%
3732011 1
1.0%
3708033 1
1.0%
3701267 1
1.0%
3501174 1
1.0%
3408038 1
1.0%
3408012 1
1.0%

도착행정동코드
Real number (ℝ)

Distinct77
Distinct (%)77.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1507116.4
Minimum1101061
Maximum3901065
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:31:49.219174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1101061
5-th percentile1101073
Q11106071.8
median1116066
Q31123059.5
95-th percentile3401155.2
Maximum3901065
Range2800004
Interquartile range (IQR)16987.75

Descriptive statistics

Standard deviation838549.03
Coefficient of variation (CV)0.556393
Kurtosis1.4227574
Mean1507116.4
Median Absolute Deviation (MAD)9002
Skewness1.7719254
Sum1.5071164 × 108
Variance7.0316447 × 1011
MonotonicityNot monotonic
2023-12-10T19:31:49.494247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1121069 5
 
5.0%
1113075 4
 
4.0%
1114066 4
 
4.0%
1118052 3
 
3.0%
1103065 3
 
3.0%
1105053 3
 
3.0%
3901065 2
 
2.0%
3109178 2
 
2.0%
1105062 2
 
2.0%
1103074 2
 
2.0%
Other values (67) 70
70.0%
ValueCountFrequency (%)
1101061 1
 
1.0%
1101063 1
 
1.0%
1101070 1
 
1.0%
1101071 1
 
1.0%
1101073 2
2.0%
1103065 3
3.0%
1103072 1
 
1.0%
1103073 1
 
1.0%
1103074 2
2.0%
1104071 1
 
1.0%
ValueCountFrequency (%)
3901065 2
2.0%
3803076 1
1.0%
3404055 1
1.0%
3401159 1
1.0%
3401155 1
1.0%
3235011 1
1.0%
3203061 1
1.0%
3109178 2
2.0%
3107037 1
1.0%
3107011 1
1.0%

평균소요시간(초)
Real number (ℝ)

HIGH CORRELATION 

Distinct92
Distinct (%)92.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean654.6
Minimum337
Maximum1416
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:31:49.819178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum337
5-th percentile385.75
Q1464.25
median554.5
Q3725.5
95-th percentile1290.7
Maximum1416
Range1079
Interquartile range (IQR)261.25

Descriptive statistics

Standard deviation285.78773
Coefficient of variation (CV)0.43658376
Kurtosis0.39824529
Mean654.6
Median Absolute Deviation (MAD)98.5
Skewness1.2910644
Sum65460
Variance81674.626
MonotonicityNot monotonic
2023-12-10T19:31:50.092378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
586 2
 
2.0%
403 2
 
2.0%
579 2
 
2.0%
523 2
 
2.0%
498 2
 
2.0%
501 2
 
2.0%
480 2
 
2.0%
595 2
 
2.0%
466 1
 
1.0%
423 1
 
1.0%
Other values (82) 82
82.0%
ValueCountFrequency (%)
337 1
1.0%
344 1
1.0%
350 1
1.0%
354 1
1.0%
381 1
1.0%
386 1
1.0%
391 1
1.0%
403 2
2.0%
404 1
1.0%
405 1
1.0%
ValueCountFrequency (%)
1416 1
1.0%
1375 1
1.0%
1343 1
1.0%
1306 1
1.0%
1304 1
1.0%
1290 1
1.0%
1254 1
1.0%
1235 1
1.0%
1171 1
1.0%
1170 1
1.0%

Interactions

2023-12-10T19:31:47.074683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:31:45.979859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:31:46.540059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:31:47.320415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:31:46.156023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:31:46.730490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:31:47.532602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:31:46.316637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:31:46.895398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:31:50.259103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출발시간대도착시간대출발행정동코드도착행정동코드평균소요시간(초)
출발시간대1.0000.3130.6650.5490.789
도착시간대0.3131.0000.5580.6300.917
출발행정동코드0.6650.5581.0000.0000.586
도착행정동코드0.5490.6300.0001.0000.425
평균소요시간(초)0.7890.9170.5860.4251.000
2023-12-10T19:31:50.434762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도착시간대출발시간대
도착시간대1.0000.067
출발시간대0.0671.000
2023-12-10T19:31:50.587700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출발행정동코드도착행정동코드평균소요시간(초)출발시간대도착시간대
출발행정동코드1.000-0.1990.3180.3190.228
도착행정동코드-0.1991.0000.0830.2420.303
평균소요시간(초)0.3180.0831.0000.4140.579
출발시간대0.3190.2420.4141.0000.067
도착시간대0.2280.3030.5790.0671.000

Missing values

2023-12-10T19:31:47.724794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:31:47.920251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

출발시간대도착시간대출발행정동코드도착행정동코드평균소요시간(초)
02019-11-30 06:402019-12-01 00:20111606939010651069
12019-11-30 07:402019-12-01 00:0023010641101063988
22019-11-30 07:402019-12-01 00:40323803111230531008
32019-11-30 10:202019-12-01 06:00333501111230651171
42019-11-30 10:402019-12-01 00:2021310111114066816
52019-11-30 11:002019-12-01 09:20333701111180511343
62019-11-30 13:002019-12-01 01:2011030733107037723
72019-11-30 13:002019-12-01 08:40370803311210691167
82019-11-30 13:402019-12-01 00:2029010111103065626
92019-11-30 13:402019-12-01 00:4011020552901011653
출발시간대도착시간대출발행정동코드도착행정동코드평균소요시간(초)
902019-11-30 18:002019-12-01 00:4011060821106072412
912019-11-30 18:002019-12-01 00:4031180531124064403
922019-11-30 18:002019-12-01 02:2031240361120068498
932019-11-30 18:002019-12-01 02:4031023581122058509
942019-11-30 18:002019-12-01 03:4032020681114059588
952019-11-30 18:002019-12-01 10:40111307531070111000
962019-11-30 18:002019-12-01 12:00373601211190711080
972019-11-30 18:002019-12-01 13:20240205611150531170
982019-11-30 18:202019-12-01 00:0011080641121066354
992019-11-30 18:202019-12-01 00:0011130751114066337