Overview

Dataset statistics

Number of variables6
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.2 KiB
Average record size in memory53.3 B

Variable types

Numeric4
Categorical1
DateTime1

Alerts

위도값 is highly overall correlated with 택시아이디High correlation
경도값 is highly overall correlated with 택시아이디High correlation
이산화탄소값 is highly overall correlated with 택시아이디High correlation
택시아이디 is highly overall correlated with 위도값 and 2 other fieldsHigh correlation
순번 has unique valuesUnique
이산화탄소값 has 6 (6.0%) zerosZeros

Reproduction

Analysis started2023-12-10 06:22:41.589006
Analysis finished2023-12-10 06:22:44.595616
Duration3.01 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T15:22:44.745401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-12-10T15:22:45.034374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

택시아이디
Categorical

HIGH CORRELATION 

Distinct46
Distinct (%)46.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
T_47132284
 
5
T_24823989
 
3
T_98047829
 
3
T_73468981
 
3
T_49476082
 
3
Other values (41)
83 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowT_99659190
2nd rowT_47278771
3rd rowT_48743645
4th rowT_96070250
5th rowT_97974586

Common Values

ValueCountFrequency (%)
T_47132284 5
 
5.0%
T_24823989 3
 
3.0%
T_98047829 3
 
3.0%
T_73468981 3
 
3.0%
T_49476082 3
 
3.0%
T_49988787 3
 
3.0%
T_48816888 2
 
2.0%
T_96070250 2
 
2.0%
T_97974586 2
 
2.0%
T_72223838 2
 
2.0%
Other values (36) 72
72.0%

Length

2023-12-10T15:22:45.303009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
t_47132284 5
 
5.0%
t_98047829 3
 
3.0%
t_73468981 3
 
3.0%
t_49476082 3
 
3.0%
t_49988787 3
 
3.0%
t_24823989 3
 
3.0%
t_72883031 2
 
2.0%
t_72297082 2
 
2.0%
t_21747755 2
 
2.0%
t_22773166 2
 
2.0%
Other values (36) 72
72.0%

위도값
Real number (ℝ)

HIGH CORRELATION 

Distinct90
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.526705
Minimum37.430637
Maximum37.652752
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T15:22:45.526207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.430637
5-th percentile37.466428
Q137.489065
median37.518619
Q337.56588
95-th percentile37.605349
Maximum37.652752
Range0.222115
Interquartile range (IQR)0.076815

Descriptive statistics

Standard deviation0.050030662
Coefficient of variation (CV)0.0013332016
Kurtosis-0.58461713
Mean37.526705
Median Absolute Deviation (MAD)0.0380435
Skewness0.26499014
Sum3752.6705
Variance0.0025030671
MonotonicityNot monotonic
2023-12-10T15:22:45.780009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.475964 3
 
3.0%
37.434925 3
 
3.0%
37.52096 2
 
2.0%
37.565876 2
 
2.0%
37.594215 2
 
2.0%
37.4938 2
 
2.0%
37.56588 2
 
2.0%
37.55336 2
 
2.0%
37.490456 1
 
1.0%
37.4947 1
 
1.0%
Other values (80) 80
80.0%
ValueCountFrequency (%)
37.430637 1
 
1.0%
37.43064 1
 
1.0%
37.434925 3
3.0%
37.468086 1
 
1.0%
37.4681 1
 
1.0%
37.468117 1
 
1.0%
37.469578 1
 
1.0%
37.469692 1
 
1.0%
37.471317 1
 
1.0%
37.471348 1
 
1.0%
ValueCountFrequency (%)
37.652752 1
1.0%
37.652744 1
1.0%
37.61152 1
1.0%
37.6114 1
1.0%
37.605473 1
1.0%
37.605343 1
1.0%
37.599495 1
1.0%
37.59949 1
1.0%
37.599487 1
1.0%
37.594215 2
2.0%

경도값
Real number (ℝ)

HIGH CORRELATION 

Distinct74
Distinct (%)74.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.00868
Minimum126.83659
Maximum127.25926
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T15:22:46.036576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.83659
5-th percentile126.89151
Q1126.97607
median127.00986
Q3127.04314
95-th percentile127.10415
Maximum127.25926
Range0.422674
Interquartile range (IQR)0.067061

Descriptive statistics

Standard deviation0.071681869
Coefficient of variation (CV)0.00056438558
Kurtosis2.1094746
Mean127.00868
Median Absolute Deviation (MAD)0.033781
Skewness0.57470914
Sum12700.868
Variance0.0051382903
MonotonicityNot monotonic
2023-12-10T15:22:46.260759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.98782 5
 
5.0%
126.98166 3
 
3.0%
127.0619 3
 
3.0%
126.99694 3
 
3.0%
127.05532 2
 
2.0%
126.9207 2
 
2.0%
127.0862 2
 
2.0%
127.01416 2
 
2.0%
127.07274 2
 
2.0%
126.905 2
 
2.0%
Other values (64) 74
74.0%
ValueCountFrequency (%)
126.836586 2
2.0%
126.88755 1
1.0%
126.887596 1
1.0%
126.89137 1
1.0%
126.89152 1
1.0%
126.89854 2
2.0%
126.905 2
2.0%
126.9207 2
2.0%
126.94429 2
2.0%
126.94505 1
1.0%
ValueCountFrequency (%)
127.25926 1
1.0%
127.25925 1
1.0%
127.15487 1
1.0%
127.15483 1
1.0%
127.10418 1
1.0%
127.10415 1
1.0%
127.10293 1
1.0%
127.1028 1
1.0%
127.0897 2
2.0%
127.08865 1
1.0%

이산화탄소값
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct20
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1196.23
Minimum0
Maximum14680
Zeros6
Zeros (%)6.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T15:22:46.440146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1400
median400
Q31531
95-th percentile3294
Maximum14680
Range14680
Interquartile range (IQR)1131

Descriptive statistics

Standard deviation2140.1929
Coefficient of variation (CV)1.7891149
Kurtosis30.869643
Mean1196.23
Median Absolute Deviation (MAD)0.5
Skewness5.2172892
Sum119623
Variance4580425.6
MonotonicityNot monotonic
2023-12-10T15:22:46.963440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
400 50
50.0%
0 6
 
6.0%
1531 5
 
5.0%
401 5
 
5.0%
1537 3
 
3.0%
1968 3
 
3.0%
1610 3
 
3.0%
14680 2
 
2.0%
956 2
 
2.0%
1523 2
 
2.0%
Other values (10) 19
 
19.0%
ValueCountFrequency (%)
0 6
 
6.0%
400 50
50.0%
401 5
 
5.0%
480 2
 
2.0%
574 2
 
2.0%
956 2
 
2.0%
1270 2
 
2.0%
1495 2
 
2.0%
1523 2
 
2.0%
1531 5
 
5.0%
ValueCountFrequency (%)
14680 2
2.0%
4228 2
2.0%
3294 2
2.0%
3156 2
2.0%
2584 1
 
1.0%
2402 2
2.0%
1968 3
3.0%
1959 2
2.0%
1610 3
3.0%
1537 3
3.0%
Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2021-01-01 00:00:00
Maximum2021-01-01 00:00:02
2023-12-10T15:22:47.149675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:47.409030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=3)

Interactions

2023-12-10T15:22:43.716973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:41.900880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:42.535488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:43.145927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:43.859463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:42.065823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:42.688411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:43.264158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:44.001837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:42.205929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:42.875362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:43.411988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:44.149196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:42.363069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:43.011312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:43.541988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:22:47.572987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번택시아이디위도값경도값이산화탄소값측정시각
순번1.0000.0000.0000.0000.3480.892
택시아이디0.0001.0001.0001.0000.9990.000
위도값0.0001.0001.0000.6320.4510.000
경도값0.0001.0000.6321.0000.4590.000
이산화탄소값0.3480.9990.4510.4591.0000.078
측정시각0.8920.0000.0000.0000.0781.000
2023-12-10T15:22:47.758447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번위도값경도값이산화탄소값택시아이디
순번1.000-0.0080.0410.1400.000
위도값-0.0081.0000.045-0.0560.775
경도값0.0410.0451.0000.0020.770
이산화탄소값0.140-0.0560.0021.0000.743
택시아이디0.0000.7750.7700.7431.000

Missing values

2023-12-10T15:22:44.361678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:22:44.525884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번택시아이디위도값경도값이산화탄소값측정시각
01T_9965919037.52096127.0179724022021-01-01 00:00:00
12T_4727877137.6114127.072744002021-01-01 00:00:00
23T_4874364537.54574126.83658602021-01-01 00:00:00
34T_9607025037.55994127.154834002021-01-01 00:00:00
45T_9797458637.546444126.8875964002021-01-01 00:00:00
56T_7222383837.487724126.982474002021-01-01 00:00:00
67T_4998878737.475964126.9816619682021-01-01 00:00:00
78T_9921972837.536297126.9631654802021-01-01 00:00:00
89T_2108856237.489605126.891524002021-01-01 00:00:00
910T_2475074637.57153127.0094432942021-01-01 00:00:00
순번택시아이디위도값경도값이산화탄소값측정시각
9091T_2482398937.59949127.061915372021-01-01 00:00:01
9192T_2372533437.57569126.979645742021-01-01 00:00:01
9293T_4713228437.56588126.9878215312021-01-01 00:00:01
9394T_7332249337.570114127.0102714952021-01-01 00:00:01
9495T_7346898137.500324126.985344012021-01-01 00:00:02
9596T_4947608237.468117127.0418616102021-01-01 00:00:02
9697T_4713228437.565884126.9878215312021-01-01 00:00:02
9798T_9804782937.434925126.996944002021-01-01 00:00:02
9899T_4998878737.475964126.9816619682021-01-01 00:00:02
99100T_2482398937.599495127.061915372021-01-01 00:00:02