Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 161 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 2 |
Duplicate rows (%) | 1.2% |
Total size in memory | 6.7 KiB |
Average record size in memory | 42.8 B |
Variable types
Categorical | 3 |
---|---|
Numeric | 2 |
Dataset
Description | Sample |
---|---|
Author | 모토브 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=MTVTAXI0000000000003 |
Dataset has 2 (1.2%) duplicate rows | Duplicates |
위도 is highly overall correlated with 택시ID | High correlation |
경도 is highly overall correlated with 택시ID | High correlation |
택시ID is highly overall correlated with 위도 and 2 other fields | High correlation |
승객탑승여부 is highly overall correlated with 택시ID | High correlation |
Reproduction
Analysis started | 2023-12-10 06:24:08.319538 |
---|---|
Analysis finished | 2023-12-10 06:24:10.067615 |
Duration | 1.75 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
택시ID
Categorical
HIGH CORRELATION
 
Distinct | 34 |
---|---|
Distinct (%) | 21.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.4 KiB |
T_92408066 | 5 |
---|---|
T_94605377 | 5 |
T_70612477 | 5 |
T_45227948 | 5 |
T_45154705 | 5 |
Other values (29) |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.6% |
Sample
1st row | T_92408066 |
---|---|
2nd row | T_15668530 |
3rd row | T_70612477 |
4th row | T_45227948 |
5th row | T_45154705 |
Common Values
Value | Count | Frequency (%) |
T_92408066 | 5 | 3.1% |
T_94605377 | 5 | 3.1% |
T_70612477 | 5 | 3.1% |
T_45227948 | 5 | 3.1% |
T_45154705 | 5 | 3.1% |
T_90650218 | 5 | 3.1% |
T_91089680 | 5 | 3.1% |
T_66950294 | 5 | 3.1% |
T_17133403 | 5 | 3.1% |
T_93872940 | 5 | 3.1% |
Other values (24) | 111 |
Length
Value | Count | Frequency (%) |
t_92408066 | 5 | 3.1% |
t_95484301 | 5 | 3.1% |
t_43836318 | 5 | 3.1% |
t_15888261 | 5 | 3.1% |
t_45520923 | 5 | 3.1% |
t_15668530 | 5 | 3.1% |
t_67170025 | 5 | 3.1% |
t_19257470 | 5 | 3.1% |
t_42591176 | 5 | 3.1% |
t_69001117 | 5 | 3.1% |
Other values (24) | 111 |
위도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 112 |
---|---|
Distinct (%) | 69.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37.488582 |
Minimum | 37.388767 |
---|---|
Maximum | 37.73002 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.5 KiB |
Quantile statistics
Minimum | 37.388767 |
---|---|
5-th percentile | 37.42045 |
Q1 | 37.45636 |
median | 37.468243 |
Q3 | 37.508747 |
95-th percentile | 37.56846 |
Maximum | 37.73002 |
Range | 0.341253 |
Interquartile range (IQR) | 0.052387 |
Descriptive statistics
Standard deviation | 0.058850422 |
---|---|
Coefficient of variation (CV) | 0.0015698226 |
Kurtosis | 6.7164068 |
Mean | 37.488582 |
Median Absolute Deviation (MAD) | 0.025592 |
Skewness | 2.0539725 |
Sum | 6035.6617 |
Variance | 0.0034633722 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
37.484592 | 5 | 3.1% |
37.54526 | 5 | 3.1% |
37.460526 | 5 | 3.1% |
37.49383 | 4 | 2.5% |
37.56846 | 4 | 2.5% |
37.501568 | 4 | 2.5% |
37.458023 | 4 | 2.5% |
37.542427 | 3 | 1.9% |
37.45625 | 3 | 1.9% |
37.462593 | 3 | 1.9% |
Other values (102) | 121 |
Value | Count | Frequency (%) |
37.388767 | 1 | 0.6% |
37.388905 | 1 | 0.6% |
37.38904 | 1 | 0.6% |
37.38918 | 1 | 0.6% |
37.420433 | 2 | |
37.420437 | 1 | 0.6% |
37.420444 | 1 | 0.6% |
37.42045 | 1 | 0.6% |
37.423233 | 3 | |
37.423237 | 1 | 0.6% |
Value | Count | Frequency (%) |
37.73002 | 1 | 0.6% |
37.730015 | 1 | 0.6% |
37.730007 | 1 | 0.6% |
37.730003 | 1 | 0.6% |
37.729992 | 1 | 0.6% |
37.56846 | 4 | |
37.568455 | 1 | 0.6% |
37.54526 | 5 | |
37.542427 | 3 | |
37.542423 | 2 | 1.2% |
경도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 98 |
---|---|
Distinct (%) | 60.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 126.70777 |
Minimum | 126.63418 |
---|---|
Maximum | 126.96877 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.5 KiB |
Quantile statistics
Minimum | 126.63418 |
---|---|
5-th percentile | 126.63836 |
Q1 | 126.67371 |
median | 126.70456 |
Q3 | 126.72907 |
95-th percentile | 126.76492 |
Maximum | 126.96877 |
Range | 0.334585 |
Interquartile range (IQR) | 0.05536 |
Descriptive statistics
Standard deviation | 0.058437438 |
---|---|
Coefficient of variation (CV) | 0.00046119853 |
Kurtosis | 10.096486 |
Mean | 126.70777 |
Median Absolute Deviation (MAD) | 0.0264 |
Skewness | 2.6334851 |
Sum | 20399.951 |
Variance | 0.0034149342 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
126.76492 | 5 | 3.1% |
126.67371 | 5 | 3.1% |
126.66354 | 5 | 3.1% |
126.65718 | 5 | 3.1% |
126.73673 | 5 | 3.1% |
126.63836 | 5 | 3.1% |
126.63418 | 4 | 2.5% |
126.69036 | 4 | 2.5% |
126.646324 | 4 | 2.5% |
126.71691 | 4 | 2.5% |
Other values (88) | 115 |
Value | Count | Frequency (%) |
126.63418 | 4 | |
126.63768 | 1 | 0.6% |
126.63778 | 1 | 0.6% |
126.63787 | 1 | 0.6% |
126.637955 | 1 | 0.6% |
126.63836 | 5 | |
126.64631 | 1 | 0.6% |
126.646324 | 4 | |
126.65718 | 5 | |
126.66354 | 5 |
Value | Count | Frequency (%) |
126.968765 | 1 | 0.6% |
126.968506 | 1 | 0.6% |
126.96826 | 1 | 0.6% |
126.968 | 1 | 0.6% |
126.96776 | 1 | 0.6% |
126.76492 | 5 | |
126.75928 | 1 | 0.6% |
126.75912 | 1 | 0.6% |
126.758965 | 1 | 0.6% |
126.758835 | 1 | 0.6% |
승객탑승여부
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.4 KiB |
미탑승 | |
---|---|
탑승 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 2.5403727 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 탑승 |
---|---|
2nd row | 탑승 |
3rd row | 미탑승 |
4th row | 탑승 |
5th row | 탑승 |
Common Values
Value | Count | Frequency (%) |
미탑승 | 87 | |
탑승 | 74 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
미탑승 | 87 | |
탑승 | 74 |
측정 시간
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 3.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.4 KiB |
2020-06-08 00:00:01 | |
---|---|
2020-06-08 00:00:03 | |
2020-06-08 00:00:00 | |
2020-06-08 00:00:02 | |
2020-06-08 00:00:04 |
Length
Max length | 19 |
---|---|
Median length | 19 |
Mean length | 19 |
Min length | 19 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020-06-08 00:00:00 |
---|---|
2nd row | 2020-06-08 00:00:00 |
3rd row | 2020-06-08 00:00:00 |
4th row | 2020-06-08 00:00:00 |
5th row | 2020-06-08 00:00:00 |
Common Values
Value | Count | Frequency (%) |
2020-06-08 00:00:01 | 34 | |
2020-06-08 00:00:03 | 34 | |
2020-06-08 00:00:00 | 33 | |
2020-06-08 00:00:02 | 32 | |
2020-06-08 00:00:04 | 28 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020-06-08 | 161 | |
00:00:01 | 34 | 10.6% |
00:00:03 | 34 | 10.6% |
00:00:00 | 33 | 10.2% |
00:00:02 | 32 | 9.9% |
00:00:04 | 28 | 8.7% |
택시ID | 위도 | 경도 | 승객탑승여부 | 측정 시간 | |
---|---|---|---|---|---|
택시ID | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 |
위도 | 1.000 | 1.000 | 0.598 | 0.322 | 0.000 |
경도 | 1.000 | 0.598 | 1.000 | 0.355 | 0.000 |
승객탑승여부 | 1.000 | 0.322 | 0.355 | 1.000 | 0.000 |
측정 시간 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 |
택시ID | 측정 시간 | 승객탑승여부 | |
---|---|---|---|
택시ID | 1.000 | 0.000 | 0.894 |
측정 시간 | 0.000 | 1.000 | 0.000 |
승객탑승여부 | 0.894 | 0.000 | 1.000 |
위도 | 경도 | 택시ID | 승객탑승여부 | 측정 시간 | |
---|---|---|---|---|---|
위도 | 1.000 | 0.456 | 0.908 | 0.339 | 0.000 |
경도 | 0.456 | 1.000 | 0.902 | 0.428 | 0.000 |
택시ID | 0.908 | 0.902 | 1.000 | 0.894 | 0.000 |
승객탑승여부 | 0.339 | 0.428 | 0.894 | 1.000 | 0.000 |
측정 시간 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 |
택시ID | 위도 | 경도 | 승객탑승여부 | 측정 시간 | |
---|---|---|---|---|---|
0 | T_92408066 | 37.448227 | 126.73584 | 탑승 | 2020-06-08 00:00:00 |
1 | T_15668530 | 37.50838 | 126.968765 | 탑승 | 2020-06-08 00:00:00 |
2 | T_70612477 | 37.457962 | 126.6897 | 미탑승 | 2020-06-08 00:00:00 |
3 | T_45227948 | 37.46821 | 126.69036 | 탑승 | 2020-06-08 00:00:00 |
4 | T_45154705 | 37.504585 | 126.73096 | 탑승 | 2020-06-08 00:00:00 |
5 | T_90650218 | 37.475273 | 126.69245 | 미탑승 | 2020-06-08 00:00:00 |
6 | T_91089680 | 37.462593 | 126.70907 | 탑승 | 2020-06-08 00:00:00 |
7 | T_66950294 | 37.542427 | 126.73673 | 미탑승 | 2020-06-08 00:00:00 |
8 | T_17133403 | 37.521465 | 126.669044 | 미탑승 | 2020-06-08 00:00:00 |
9 | T_42298201 | 37.456356 | 126.71468 | 미탑승 | 2020-06-08 00:00:00 |
택시ID | 위도 | 경도 | 승객탑승여부 | 측정 시간 | |
---|---|---|---|---|---|
151 | T_69587066 | 37.73002 | 126.75872 | 미탑승 | 2020-06-08 00:00:04 |
152 | T_94605377 | 37.534145 | 126.75168 | 미탑승 | 2020-06-08 00:00:04 |
153 | T_45520923 | 37.52068 | 126.704575 | 미탑승 | 2020-06-08 00:00:04 |
154 | T_15888261 | 37.42045 | 126.72907 | 탑승 | 2020-06-08 00:00:04 |
155 | T_69001117 | 37.462425 | 126.68045 | 탑승 | 2020-06-08 00:00:04 |
156 | T_95484301 | 37.53736 | 126.72796 | 탑승 | 2020-06-08 00:00:04 |
157 | T_92920772 | 37.501564 | 126.74687 | 탑승 | 2020-06-08 00:00:04 |
158 | T_43836318 | 37.50567 | 126.71688 | 탑승 | 2020-06-08 00:00:04 |
159 | T_43177125 | 37.49383 | 126.68102 | 미탑승 | 2020-06-08 00:00:04 |
160 | T_93872940 | 37.491005 | 126.71512 | 탑승 | 2020-06-08 00:00:04 |
Most frequently occurring
택시ID | 위도 | 경도 | 승객탑승여부 | 측정 시간 | # duplicates | |
---|---|---|---|---|---|---|
0 | T_42884150 | 37.460526 | 126.63836 | 탑승 | 2020-06-08 00:00:01 | 2 |
1 | T_42884150 | 37.460526 | 126.63836 | 탑승 | 2020-06-08 00:00:03 | 2 |