Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 2000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 101.7 KiB |
Average record size in memory | 52.1 B |
Variable types
Numeric | 4 |
---|---|
Text | 1 |
Categorical | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | (주)모토브 / 신재훈 |
URL | https://www.bigdata-transportation.kr/frn/prdt/detail?prdtId=PRDTNUM_000000020252 |
register_at has constant value "" | Constant |
light_sensor_value_id has unique values | Unique |
Reproduction
Analysis started | 2024-04-22 00:31:16.959637 |
---|---|
Analysis finished | 2024-04-22 00:31:18.768365 |
Duration | 1.81 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
light_sensor_value_id
Real number (ℝ)
UNIQUE
 
Distinct | 2000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1000.5 |
Minimum | 1 |
---|---|
Maximum | 2000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 17.7 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 100.95 |
Q1 | 500.75 |
median | 1000.5 |
Q3 | 1500.25 |
95-th percentile | 1900.05 |
Maximum | 2000 |
Range | 1999 |
Interquartile range (IQR) | 999.5 |
Descriptive statistics
Standard deviation | 577.49459 |
---|---|
Coefficient of variation (CV) | 0.57720599 |
Kurtosis | -1.2 |
Mean | 1000.5 |
Median Absolute Deviation (MAD) | 500 |
Skewness | 0 |
Sum | 2001000 |
Variance | 333500 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.1% |
1331 | 1 | 0.1% |
1344 | 1 | 0.1% |
1343 | 1 | 0.1% |
1342 | 1 | 0.1% |
1341 | 1 | 0.1% |
1340 | 1 | 0.1% |
1339 | 1 | 0.1% |
1338 | 1 | 0.1% |
1337 | 1 | 0.1% |
Other values (1990) | 1990 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
2000 | 1 | |
1999 | 1 | |
1998 | 1 | |
1997 | 1 | |
1996 | 1 | |
1995 | 1 | |
1994 | 1 | |
1993 | 1 | |
1992 | 1 | |
1991 | 1 |
taxi_id
Text
Distinct | 152 |
---|---|
Distinct (%) | 7.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.8 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 20000 |
---|---|
Distinct characters | 12 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | T_96289981 |
---|---|
2nd row | T_73322493 |
3rd row | T_45081461 |
4th row | T_44934973 |
5th row | T_47791477 |
Value | Count | Frequency (%) |
t_41712252 | 14 | 0.7% |
t_95044839 | 14 | 0.7% |
t_72223838 | 14 | 0.7% |
t_94898351 | 14 | 0.7% |
t_49402838 | 14 | 0.7% |
t_47644990 | 14 | 0.7% |
t_94605377 | 14 | 0.7% |
t_19037739 | 14 | 0.7% |
t_97974586 | 14 | 0.7% |
t_95704032 | 14 | 0.7% |
Other values (142) | 1860 |
Most occurring characters
Value | Count | Frequency (%) |
4 | 2111 | |
T | 2000 | |
_ | 2000 | |
9 | 1800 | |
7 | 1700 | |
8 | 1695 | |
3 | 1652 | |
6 | 1582 | |
2 | 1522 | |
1 | 1471 | |
Other values (2) | 2467 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 16000 | |
Uppercase Letter | 2000 | 10.0% |
Connector Punctuation | 2000 | 10.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
4 | 2111 | |
9 | 1800 | |
7 | 1700 | |
8 | 1695 | |
3 | 1652 | |
6 | 1582 | |
2 | 1522 | |
1 | 1471 | |
5 | 1247 | |
0 | 1220 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 2000 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 2000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 18000 | |
Latin | 2000 | 10.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
4 | 2111 | |
_ | 2000 | |
9 | 1800 | |
7 | 1700 | |
8 | 1695 | |
3 | 1652 | |
6 | 1582 | |
2 | 1522 | |
1 | 1471 | |
5 | 1247 |
Latin
Value | Count | Frequency (%) |
T | 2000 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 20000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
4 | 2111 | |
T | 2000 | |
_ | 2000 | |
9 | 1800 | |
7 | 1700 | |
8 | 1695 | |
3 | 1652 | |
6 | 1582 | |
2 | 1522 | |
1 | 1471 | |
Other values (2) | 2467 |
latitude
Real number (ℝ)
Distinct | 1332 |
---|---|
Distinct (%) | 66.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37.497015 |
Minimum | 36.941452 |
---|---|
Maximum | 37.778934 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 17.7 KiB |
Quantile statistics
Minimum | 36.941452 |
---|---|
5-th percentile | 37.406297 |
Q1 | 37.460313 |
median | 37.497256 |
Q3 | 37.530572 |
95-th percentile | 37.60284 |
Maximum | 37.778934 |
Range | 0.837482 |
Interquartile range (IQR) | 0.07025875 |
Descriptive statistics
Standard deviation | 0.076530507 |
---|---|
Coefficient of variation (CV) | 0.002040976 |
Kurtosis | 17.339603 |
Mean | 37.497015 |
Median Absolute Deviation (MAD) | 0.0349585 |
Skewness | -2.0596037 |
Sum | 74994.03 |
Variance | 0.0058569185 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
37.479614 | 15 | 0.8% |
37.473686 | 14 | 0.7% |
37.5649 | 14 | 0.7% |
37.60681 | 14 | 0.7% |
37.41697 | 14 | 0.7% |
37.523636 | 14 | 0.7% |
37.443947 | 13 | 0.7% |
37.43951 | 13 | 0.7% |
37.48688 | 13 | 0.7% |
37.46929 | 13 | 0.7% |
Other values (1322) | 1863 |
Value | Count | Frequency (%) |
36.941452 | 1 | |
36.941525 | 1 | |
36.941605 | 1 | |
36.941692 | 1 | |
36.941776 | 1 | |
36.94187 | 1 | |
36.941967 | 1 | |
36.94207 | 1 | |
36.942173 | 1 | |
36.942276 | 1 |
Value | Count | Frequency (%) |
37.778934 | 1 | |
37.778854 | 1 | |
37.778793 | 1 | |
37.77873 | 1 | |
37.778675 | 1 | |
37.778606 | 1 | |
37.77855 | 1 | |
37.778515 | 1 | |
37.7785 | 1 | |
37.778496 | 2 |
longitude
Real number (ℝ)
Distinct | 1025 |
---|---|
Distinct (%) | 51.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 126.82073 |
Minimum | 126.48848 |
---|---|
Maximum | 127.46536 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 17.7 KiB |
Quantile statistics
Minimum | 126.48848 |
---|---|
5-th percentile | 126.63768 |
Q1 | 126.69009 |
median | 126.72744 |
Q3 | 126.95759 |
95-th percentile | 127.12796 |
Maximum | 127.46536 |
Range | 0.97688 |
Interquartile range (IQR) | 0.26749625 |
Descriptive statistics
Standard deviation | 0.17486108 |
---|---|
Coefficient of variation (CV) | 0.0013788052 |
Kurtosis | -0.14400696 |
Mean | 126.82073 |
Median Absolute Deviation (MAD) | 0.0739175 |
Skewness | 0.77904006 |
Sum | 253641.45 |
Variance | 0.030576398 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
126.69009 | 17 | 0.9% |
126.74964 | 14 | 0.7% |
126.65602 | 14 | 0.7% |
127.12796 | 14 | 0.7% |
126.71236 | 14 | 0.7% |
126.659004 | 14 | 0.7% |
127.06097 | 14 | 0.7% |
126.67689 | 14 | 0.7% |
126.65866 | 14 | 0.7% |
126.70156 | 14 | 0.7% |
Other values (1015) | 1857 |
Value | Count | Frequency (%) |
126.48848 | 1 | |
126.488625 | 1 | |
126.48877 | 1 | |
126.48893 | 1 | |
126.48908 | 1 | |
126.48923 | 1 | |
126.48937 | 1 | |
126.4895 | 1 | |
126.48965 | 1 | |
126.48978 | 1 |
Value | Count | Frequency (%) |
127.46536 | 1 | |
127.46499 | 1 | |
127.464615 | 1 | |
127.46424 | 1 | |
127.46387 | 1 | |
127.46349 | 1 | |
127.46312 | 1 | |
127.462746 | 1 | |
127.46239 | 1 | |
127.46201 | 1 |
light_level
Real number (ℝ)
Distinct | 53 |
---|---|
Distinct (%) | 2.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.8255 |
Minimum | 1 |
---|---|
Maximum | 234 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 17.7 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 6 |
median | 10 |
Q3 | 17 |
95-th percentile | 33 |
Maximum | 234 |
Range | 233 |
Interquartile range (IQR) | 11 |
Descriptive statistics
Standard deviation | 14.921105 |
---|---|
Coefficient of variation (CV) | 1.0792452 |
Kurtosis | 42.231743 |
Mean | 13.8255 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 5.0371618 |
Sum | 27651 |
Variance | 222.63937 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 145 | 7.2% |
7 | 141 | 7.0% |
6 | 133 | 6.7% |
10 | 115 | 5.8% |
3 | 103 | 5.1% |
8 | 100 | 5.0% |
9 | 100 | 5.0% |
4 | 95 | 4.8% |
2 | 93 | 4.7% |
13 | 84 | 4.2% |
Other values (43) | 891 |
Value | Count | Frequency (%) |
1 | 25 | 1.2% |
2 | 93 | |
3 | 103 | |
4 | 95 | |
5 | 145 | |
6 | 133 | |
7 | 141 | |
8 | 100 | |
9 | 100 | |
10 | 115 |
Value | Count | Frequency (%) |
234 | 1 | 0.1% |
122 | 2 | 0.1% |
116 | 2 | 0.1% |
115 | 2 | 0.1% |
114 | 11 | |
93 | 5 | |
61 | 2 | 0.1% |
60 | 12 | |
57 | 2 | 0.1% |
50 | 1 | 0.1% |
register_at
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.8 KiB |
2020-09-10 22:00 |
---|
Length
Max length | 16 |
---|---|
Median length | 16 |
Mean length | 16 |
Min length | 16 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020-09-10 22:00 |
---|---|
2nd row | 2020-09-10 22:00 |
3rd row | 2020-09-10 22:00 |
4th row | 2020-09-10 22:00 |
5th row | 2020-09-10 22:00 |
Common Values
Value | Count | Frequency (%) |
2020-09-10 22:00 | 2000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020-09-10 | 2000 | |
22:00 | 2000 |
light_sensor_value_id | latitude | longitude | light_level | |
---|---|---|---|---|
light_sensor_value_id | 1.000 | 0.000 | 0.000 | 0.000 |
latitude | 0.000 | 1.000 | 0.686 | 0.211 |
longitude | 0.000 | 0.686 | 1.000 | 0.260 |
light_level | 0.000 | 0.211 | 0.260 | 1.000 |
light_sensor_value_id | latitude | longitude | light_level | |
---|---|---|---|---|
light_sensor_value_id | 1.000 | -0.002 | -0.004 | -0.010 |
latitude | -0.002 | 1.000 | 0.206 | -0.013 |
longitude | -0.004 | 0.206 | 1.000 | -0.093 |
light_level | -0.010 | -0.013 | -0.093 | 1.000 |
light_sensor_value_id | taxi_id | latitude | longitude | light_level | register_at | |
---|---|---|---|---|---|---|
0 | 1 | T_96289981 | 37.610664 | 126.724655 | 8 | 2020-09-10 22:00 |
1 | 2 | T_73322493 | 37.621788 | 127.08755 | 13 | 2020-09-10 22:00 |
2 | 3 | T_45081461 | 37.554714 | 126.673065 | 2 | 2020-09-10 22:00 |
3 | 4 | T_44934973 | 37.52825 | 126.67643 | 10 | 2020-09-10 22:00 |
4 | 5 | T_47791477 | 37.502796 | 127.04183 | 25 | 2020-09-10 22:00 |
5 | 6 | T_43689831 | 37.52204 | 126.79642 | 3 | 2020-09-10 22:00 |
6 | 7 | T_97388636 | 37.527405 | 126.90563 | 10 | 2020-09-10 22:00 |
7 | 8 | T_92701041 | 37.462093 | 126.63755 | 6 | 2020-09-10 22:00 |
8 | 9 | T_18378546 | 37.50068 | 126.730644 | 60 | 2020-09-10 22:00 |
9 | 10 | T_73102763 | 37.55852 | 126.859764 | 18 | 2020-09-10 22:00 |
light_sensor_value_id | taxi_id | latitude | longitude | light_level | register_at | |
---|---|---|---|---|---|---|
1990 | 1991 | T_94825108 | 37.379314 | 126.65602 | 4 | 2020-09-10 22:00 |
1991 | 1992 | T_91602386 | 37.465008 | 126.70485 | 9 | 2020-09-10 22:00 |
1992 | 1993 | T_47425259 | 37.490837 | 127.055374 | 15 | 2020-09-10 22:00 |
1993 | 1994 | T_66437588 | 37.388325 | 126.76614 | 2 | 2020-09-10 22:00 |
1994 | 1995 | T_20136394 | 37.56815 | 126.66369 | 2 | 2020-09-10 22:00 |
1995 | 1996 | T_48304183 | 37.53475 | 127.136215 | 16 | 2020-09-10 22:00 |
1996 | 1997 | T_47059040 | 37.511803 | 126.888214 | 6 | 2020-09-10 22:00 |
1997 | 1998 | T_49402838 | 37.421562 | 127.15898 | 2 | 2020-09-10 22:00 |
1998 | 1999 | T_44788486 | 37.524105 | 126.76774 | 6 | 2020-09-10 22:00 |
1999 | 2000 | T_41712252 | 37.595737 | 126.83291 | 4 | 2020-09-10 22:00 |