Overview

Dataset statistics

Number of variables6
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.2 KiB
Average record size in memory53.3 B

Variable types

Numeric4
Categorical2

Alerts

순번 is highly overall correlated with 측정시각High correlation
위도값 is highly overall correlated with 택시아이디High correlation
경도값 is highly overall correlated with 택시아이디High correlation
온도감지기값 is highly overall correlated with 택시아이디High correlation
택시아이디 is highly overall correlated with 위도값 and 2 other fieldsHigh correlation
측정시각 is highly overall correlated with 순번High correlation
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 06:28:17.324458
Analysis finished2023-12-10 06:28:20.375394
Duration3.05 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T15:28:20.507684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-12-10T15:28:20.747876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

택시아이디
Categorical

HIGH CORRELATION 

Distinct33
Distinct (%)33.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
T_70978696
 
4
T_95630788
 
3
T_45740654
 
3
T_45227948
 
3
T_18671521
 
3
Other values (28)
84 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowT_69001117
2nd rowT_17133403
3rd rowT_45740654
4th rowT_45227948
5th rowT_18671521

Common Values

ValueCountFrequency (%)
T_70978696 4
 
4.0%
T_95630788 3
 
3.0%
T_45740654 3
 
3.0%
T_45227948 3
 
3.0%
T_18671521 3
 
3.0%
T_69587066 3
 
3.0%
T_70832208 3
 
3.0%
T_92627797 3
 
3.0%
T_43616587 3
 
3.0%
T_70612477 3
 
3.0%
Other values (23) 69
69.0%

Length

2023-12-10T15:28:20.968126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
t_70978696 4
 
4.0%
t_69001117 3
 
3.0%
t_99585947 3
 
3.0%
t_66437588 3
 
3.0%
t_17719353 3
 
3.0%
t_92408066 3
 
3.0%
t_70685721 3
 
3.0%
t_68415167 3
 
3.0%
t_95484301 3
 
3.0%
t_19330713 3
 
3.0%
Other values (23) 69
69.0%

위도값
Real number (ℝ)

HIGH CORRELATION 

Distinct71
Distinct (%)71.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.496931
Minimum37.400574
Maximum37.666523
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T15:28:21.168800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.400574
5-th percentile37.404026
Q137.45186
median37.492458
Q337.528256
95-th percentile37.602406
Maximum37.666523
Range0.265949
Interquartile range (IQR)0.0763965

Descriptive statistics

Standard deviation0.059345692
Coefficient of variation (CV)0.0015826813
Kurtosis0.5514198
Mean37.496931
Median Absolute Deviation (MAD)0.04054
Skewness0.70860496
Sum3749.6931
Variance0.0035219111
MonotonicityNot monotonic
2023-12-10T15:28:21.756609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.570866 4
 
4.0%
37.443077 3
 
3.0%
37.49017 3
 
3.0%
37.438824 3
 
3.0%
37.526722 3
 
3.0%
37.404026 3
 
3.0%
37.45186 3
 
3.0%
37.424 3
 
3.0%
37.499813 3
 
3.0%
37.666523 3
 
3.0%
Other values (61) 69
69.0%
ValueCountFrequency (%)
37.400574 1
 
1.0%
37.40072 1
 
1.0%
37.400864 1
 
1.0%
37.404026 3
3.0%
37.422703 1
 
1.0%
37.422707 2
2.0%
37.424 3
3.0%
37.438667 1
 
1.0%
37.438705 1
 
1.0%
37.43874 1
 
1.0%
ValueCountFrequency (%)
37.666523 3
3.0%
37.602413 1
 
1.0%
37.60241 1
 
1.0%
37.602406 1
 
1.0%
37.602287 2
2.0%
37.602283 1
 
1.0%
37.570866 4
4.0%
37.553173 1
 
1.0%
37.553093 1
 
1.0%
37.553013 1
 
1.0%

경도값
Real number (ℝ)

HIGH CORRELATION 

Distinct62
Distinct (%)62.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.7258
Minimum126.60767
Maximum127.01224
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T15:28:21.998466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.60767
5-th percentile126.61886
Q1126.67575
median126.70962
Q3126.73504
95-th percentile127.00959
Maximum127.01224
Range0.404574
Interquartile range (IQR)0.0592815

Descriptive statistics

Standard deviation0.086133757
Coefficient of variation (CV)0.00067968602
Kurtosis5.239267
Mean126.7258
Median Absolute Deviation (MAD)0.02664
Skewness2.2218239
Sum12672.58
Variance0.007419024
MonotonicityNot monotonic
2023-12-10T15:28:22.253060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.728775 3
 
3.0%
126.74704 3
 
3.0%
126.67446 3
 
3.0%
126.71003 3
 
3.0%
126.718864 3
 
3.0%
126.70348 3
 
3.0%
126.70272 3
 
3.0%
126.741684 3
 
3.0%
126.70241 3
 
3.0%
126.73626 3
 
3.0%
Other values (52) 70
70.0%
ValueCountFrequency (%)
126.607666 1
 
1.0%
126.60794 1
 
1.0%
126.60822 1
 
1.0%
126.61877 1
 
1.0%
126.61881 1
 
1.0%
126.618866 1
 
1.0%
126.65049 1
 
1.0%
126.650635 1
 
1.0%
126.6507 1
 
1.0%
126.65853 3
3.0%
ValueCountFrequency (%)
127.01224 1
1.0%
127.01221 1
1.0%
127.01218 1
1.0%
127.009605 2
2.0%
127.00959 1
1.0%
126.88309 1
1.0%
126.88297 1
1.0%
126.88286 1
1.0%
126.76439 1
1.0%
126.764336 1
1.0%

온도감지기값
Real number (ℝ)

HIGH CORRELATION 

Distinct54
Distinct (%)54.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.20380716
Minimum-2.859541
Maximum4.339666
Zeros0
Zeros (%)0.0%
Negative34
Negative (%)34.0%
Memory size1.0 KiB
2023-12-10T15:28:22.502343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-2.859541
5-th percentile-2.2113186
Q1-0.48294825
median0.272755
Q30.876249
95-th percentile2.937743
Maximum4.339666
Range7.199207
Interquartile range (IQR)1.3591972

Descriptive statistics

Standard deviation1.3893824
Coefficient of variation (CV)6.8171424
Kurtosis1.5124991
Mean0.20380716
Median Absolute Deviation (MAD)0.7423515
Skewness0.34401905
Sum20.380716
Variance1.9303836
MonotonicityNot monotonic
2023-12-10T15:28:22.744035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.448997 3
 
3.0%
0.181964 3
 
3.0%
-0.923552 3
 
3.0%
0.670634 3
 
3.0%
0.686656 3
 
3.0%
1.332876 3
 
3.0%
1.324865 3
 
3.0%
-1.454948 3
 
3.0%
-0.82208 3
 
3.0%
0.035096 3
 
3.0%
Other values (44) 70
70.0%
ValueCountFrequency (%)
-2.859541 2
2.0%
-2.816815 2
2.0%
-2.224002 1
 
1.0%
-2.210651 2
2.0%
-1.71664 2
2.0%
-1.673915 1
 
1.0%
-1.454948 3
3.0%
-1.252003 2
2.0%
-1.225299 1
 
1.0%
-0.923552 3
3.0%
ValueCountFrequency (%)
4.339666 1
 
1.0%
4.326314 2
2.0%
2.951095 1
 
1.0%
2.937743 2
2.0%
1.466392 1
 
1.0%
1.458381 2
2.0%
1.45037 2
2.0%
1.445029 1
 
1.0%
1.332876 3
3.0%
1.324865 3
3.0%

측정시각
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2021-01-01 00:00:00
33 
2021-01-01 00:00:01
33 
2021-01-01 00:00:02
33 
2021-01-01 00:00:03
 
1

Length

Max length19
Median length19
Mean length19
Min length19

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row2021-01-01 00:00:00
2nd row2021-01-01 00:00:00
3rd row2021-01-01 00:00:00
4th row2021-01-01 00:00:00
5th row2021-01-01 00:00:00

Common Values

ValueCountFrequency (%)
2021-01-01 00:00:00 33
33.0%
2021-01-01 00:00:01 33
33.0%
2021-01-01 00:00:02 33
33.0%
2021-01-01 00:00:03 1
 
1.0%

Length

2023-12-10T15:28:22.958105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:28:23.119695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-01-01 100
50.0%
00:00:00 33
 
16.5%
00:00:01 33
 
16.5%
00:00:02 33
 
16.5%
00:00:03 1
 
0.5%

Interactions

2023-12-10T15:28:19.461476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:28:17.771054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:28:18.289595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:28:18.872063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:28:19.581796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:28:17.904594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:28:18.422993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:28:19.013583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:28:19.712562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:28:18.024590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:28:18.554581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:28:19.164097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:28:19.881327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:28:18.162957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:28:18.720961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:28:19.319823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:28:23.233567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번택시아이디위도값경도값온도감지기값측정시각
순번1.0000.0000.0000.0000.1470.881
택시아이디0.0001.0001.0001.0000.9920.000
위도값0.0001.0001.0000.7010.8600.046
경도값0.0001.0000.7011.0000.6510.000
온도감지기값0.1470.9920.8600.6511.0000.000
측정시각0.8810.0000.0460.0000.0001.000
2023-12-10T15:28:23.399645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
택시아이디측정시각
택시아이디1.0000.000
측정시각0.0001.000
2023-12-10T15:28:23.549807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번위도값경도값온도감지기값택시아이디측정시각
순번1.0000.0740.018-0.0630.0000.725
위도값0.0741.0000.120-0.3990.8580.000
경도값0.0180.1201.0000.0790.8440.000
온도감지기값-0.063-0.3990.0791.0000.8010.000
택시아이디0.0000.8580.8440.8011.0000.000
측정시각0.7250.0000.0000.0000.0001.000

Missing values

2023-12-10T15:28:20.135404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:28:20.313208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번택시아이디위도값경도값온도감지기값측정시각
01T_6900111737.478817126.6188660.4489972021-01-01 00:00:00
12T_1713340337.424126.74704-0.2025642021-01-01 00:00:00
23T_4574065437.602287126.658530.0270852021-01-01 00:00:00
34T_4522794837.438824126.709620.9483482021-01-01 00:00:00
45T_1867152137.48517126.725271.1860082021-01-01 00:00:00
56T_6958706637.60241126.6507-2.2240022021-01-01 00:00:00
67T_7083220837.45186126.67498-0.4909592021-01-01 00:00:00
78T_9262779737.43874126.7033540.1979862021-01-01 00:00:00
89T_4361658737.48851126.70654-0.4722672021-01-01 00:00:00
910T_7097869637.570866126.73626-2.8595412021-01-01 00:00:00
순번택시아이디위도값경도값온도감지기값측정시각
9091T_6958706637.602406126.65049-2.2106512021-01-01 00:00:02
9192T_4574065437.602287126.658530.0404362021-01-01 00:00:02
9293T_9108968037.499813126.70272-0.9235522021-01-01 00:00:02
9394T_9387294037.493458126.68659-0.822082021-01-01 00:00:02
9495T_4537443637.518387126.710030.8762492021-01-01 00:00:02
9596T_1969693237.505493126.6722950.6439312021-01-01 00:00:02
9697T_1962368837.666523126.73462-1.6739152021-01-01 00:00:02
9798T_4229820137.44233126.674460.6866562021-01-01 00:00:02
9899T_4185873937.508797126.703480.1819642021-01-01 00:00:02
99100T_7097869637.570866126.73626-2.8168152021-01-01 00:00:03