Overview

Dataset statistics

Number of variables6
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.2 KiB
Average record size in memory53.3 B

Variable types

Numeric4
Categorical2

Alerts

순번 is highly overall correlated with 측정시각High correlation
위도값 is highly overall correlated with 택시아이디High correlation
경도값 is highly overall correlated with 택시아이디High correlation
총휘발성유기화합물값 is highly overall correlated with 택시아이디High correlation
택시아이디 is highly overall correlated with 위도값 and 2 other fieldsHigh correlation
측정시각 is highly overall correlated with 순번High correlation
순번 has unique valuesUnique
총휘발성유기화합물값 has 61 (61.0%) zerosZeros

Reproduction

Analysis started2023-12-10 06:13:20.486504
Analysis finished2023-12-10 06:13:23.458844
Duration2.97 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T15:13:23.557063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-12-10T15:13:23.760872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

택시아이디
Categorical

HIGH CORRELATION 

Distinct46
Distinct (%)46.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
T_47132284
 
5
T_24823989
 
3
T_98047829
 
3
T_73468981
 
3
T_49476082
 
3
Other values (41)
83 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowT_99659190
2nd rowT_47278771
3rd rowT_48743645
4th rowT_96070250
5th rowT_97974586

Common Values

ValueCountFrequency (%)
T_47132284 5
 
5.0%
T_24823989 3
 
3.0%
T_98047829 3
 
3.0%
T_73468981 3
 
3.0%
T_49476082 3
 
3.0%
T_49988787 3
 
3.0%
T_48816888 2
 
2.0%
T_96070250 2
 
2.0%
T_97974586 2
 
2.0%
T_72223838 2
 
2.0%
Other values (36) 72
72.0%

Length

2023-12-10T15:13:23.915751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
t_47132284 5
 
5.0%
t_98047829 3
 
3.0%
t_73468981 3
 
3.0%
t_49476082 3
 
3.0%
t_49988787 3
 
3.0%
t_24823989 3
 
3.0%
t_72883031 2
 
2.0%
t_72297082 2
 
2.0%
t_21747755 2
 
2.0%
t_22773166 2
 
2.0%
Other values (36) 72
72.0%

위도값
Real number (ℝ)

HIGH CORRELATION 

Distinct90
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.526705
Minimum37.430637
Maximum37.652752
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T15:13:24.064369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.430637
5-th percentile37.466428
Q137.489065
median37.518619
Q337.56588
95-th percentile37.605349
Maximum37.652752
Range0.222115
Interquartile range (IQR)0.076815

Descriptive statistics

Standard deviation0.050030662
Coefficient of variation (CV)0.0013332016
Kurtosis-0.58461713
Mean37.526705
Median Absolute Deviation (MAD)0.0380435
Skewness0.26499014
Sum3752.6705
Variance0.0025030671
MonotonicityNot monotonic
2023-12-10T15:13:24.266049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.475964 3
 
3.0%
37.434925 3
 
3.0%
37.52096 2
 
2.0%
37.565876 2
 
2.0%
37.594215 2
 
2.0%
37.4938 2
 
2.0%
37.56588 2
 
2.0%
37.55336 2
 
2.0%
37.490456 1
 
1.0%
37.4947 1
 
1.0%
Other values (80) 80
80.0%
ValueCountFrequency (%)
37.430637 1
 
1.0%
37.43064 1
 
1.0%
37.434925 3
3.0%
37.468086 1
 
1.0%
37.4681 1
 
1.0%
37.468117 1
 
1.0%
37.469578 1
 
1.0%
37.469692 1
 
1.0%
37.471317 1
 
1.0%
37.471348 1
 
1.0%
ValueCountFrequency (%)
37.652752 1
1.0%
37.652744 1
1.0%
37.61152 1
1.0%
37.6114 1
1.0%
37.605473 1
1.0%
37.605343 1
1.0%
37.599495 1
1.0%
37.59949 1
1.0%
37.599487 1
1.0%
37.594215 2
2.0%

경도값
Real number (ℝ)

HIGH CORRELATION 

Distinct74
Distinct (%)74.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.00868
Minimum126.83659
Maximum127.25926
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T15:13:24.831616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.83659
5-th percentile126.89151
Q1126.97607
median127.00986
Q3127.04314
95-th percentile127.10415
Maximum127.25926
Range0.422674
Interquartile range (IQR)0.067061

Descriptive statistics

Standard deviation0.071681869
Coefficient of variation (CV)0.00056438558
Kurtosis2.1094746
Mean127.00868
Median Absolute Deviation (MAD)0.033781
Skewness0.57470914
Sum12700.868
Variance0.0051382903
MonotonicityNot monotonic
2023-12-10T15:13:25.092097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.98782 5
 
5.0%
126.98166 3
 
3.0%
127.0619 3
 
3.0%
126.99694 3
 
3.0%
127.05532 2
 
2.0%
126.9207 2
 
2.0%
127.0862 2
 
2.0%
127.01416 2
 
2.0%
127.07274 2
 
2.0%
126.905 2
 
2.0%
Other values (64) 74
74.0%
ValueCountFrequency (%)
126.836586 2
2.0%
126.88755 1
1.0%
126.887596 1
1.0%
126.89137 1
1.0%
126.89152 1
1.0%
126.89854 2
2.0%
126.905 2
2.0%
126.9207 2
2.0%
126.94429 2
2.0%
126.94505 1
1.0%
ValueCountFrequency (%)
127.25926 1
1.0%
127.25925 1
1.0%
127.15487 1
1.0%
127.15483 1
1.0%
127.10418 1
1.0%
127.10415 1
1.0%
127.10293 1
1.0%
127.1028 1
1.0%
127.0897 2
2.0%
127.08865 1
1.0%

총휘발성유기화합물값
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct18
Distinct (%)18.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5269
Minimum0
Maximum63620
Zeros61
Zeros (%)61.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T15:13:25.325097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q33860
95-th percentile31924
Maximum63620
Range63620
Interquartile range (IQR)3860

Descriptive statistics

Standard deviation12666.508
Coefficient of variation (CV)2.4039682
Kurtosis9.8012427
Mean5269
Median Absolute Deviation (MAD)0
Skewness3.1271688
Sum526900
Variance1.6044043 × 108
MonotonicityNot monotonic
2023-12-10T15:13:25.572228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
0 61
61.0%
3860 5
 
5.0%
3940 3
 
3.0%
13840 3
 
3.0%
5080 3
 
3.0%
30972 2
 
2.0%
1680 2
 
2.0%
3740 2
 
2.0%
44024 2
 
2.0%
2640 2
 
2.0%
Other values (8) 15
 
15.0%
ValueCountFrequency (%)
0 61
61.0%
240 2
 
2.0%
520 2
 
2.0%
1680 2
 
2.0%
2640 2
 
2.0%
3320 2
 
2.0%
3740 2
 
2.0%
3860 5
 
5.0%
3940 3
 
3.0%
4084 2
 
2.0%
ValueCountFrequency (%)
63620 2
2.0%
44024 2
2.0%
39220 1
 
1.0%
31540 2
2.0%
30972 2
2.0%
13840 3
3.0%
13520 2
2.0%
5080 3
3.0%
4084 2
2.0%
3940 3
3.0%

측정시각
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2021-01-01 00:00:00
47 
2021-01-01 00:00:01
47 
2021-01-01 00:00:02

Length

Max length19
Median length19
Mean length19
Min length19

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-01-01 00:00:00
2nd row2021-01-01 00:00:00
3rd row2021-01-01 00:00:00
4th row2021-01-01 00:00:00
5th row2021-01-01 00:00:00

Common Values

ValueCountFrequency (%)
2021-01-01 00:00:00 47
47.0%
2021-01-01 00:00:01 47
47.0%
2021-01-01 00:00:02 6
 
6.0%

Length

2023-12-10T15:13:25.760443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:13:25.916495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-01-01 100
50.0%
00:00:00 47
23.5%
00:00:01 47
23.5%
00:00:02 6
 
3.0%

Interactions

2023-12-10T15:13:22.792262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:21.522363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:21.953322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:22.358516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:22.919088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:21.632069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:22.065030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:22.479152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:23.012932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:21.744290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:22.156771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:22.586419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:23.123485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:21.844019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:22.250137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:13:22.683592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:13:26.070809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번택시아이디위도값경도값총휘발성유기화합물값측정시각
순번1.0000.0000.0000.0000.0000.892
택시아이디0.0001.0001.0001.0000.9950.000
위도값0.0001.0001.0000.6320.6370.000
경도값0.0001.0000.6321.0000.7310.000
총휘발성유기화합물값0.0000.9950.6370.7311.0000.000
측정시각0.8920.0000.0000.0000.0001.000
2023-12-10T15:13:26.298114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
택시아이디측정시각
택시아이디1.0000.000
측정시각0.0001.000
2023-12-10T15:13:26.446504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번위도값경도값총휘발성유기화합물값택시아이디측정시각
순번1.000-0.0080.0410.1190.0000.807
위도값-0.0081.0000.045-0.0610.7750.000
경도값0.0410.0451.0000.0530.7700.000
총휘발성유기화합물값0.119-0.0610.0531.0000.7240.000
택시아이디0.0000.7750.7700.7241.0000.000
측정시각0.8070.0000.0000.0000.0001.000

Missing values

2023-12-10T15:13:23.265960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:13:23.404116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번택시아이디위도값경도값총휘발성유기화합물값측정시각
01T_9965919037.52096127.01797315402021-01-01 00:00:00
12T_4727877137.6114127.0727402021-01-01 00:00:00
23T_4874364537.54574126.83658602021-01-01 00:00:00
34T_9607025037.55994127.1548302021-01-01 00:00:00
45T_9797458637.546444126.88759602021-01-01 00:00:00
56T_7222383837.487724126.9824702021-01-01 00:00:00
67T_4998878737.475964126.98166138402021-01-01 00:00:00
78T_9921972837.536297126.9631652402021-01-01 00:00:00
89T_2108856237.489605126.8915202021-01-01 00:00:00
910T_2475074637.57153127.0094440842021-01-01 00:00:00
순번택시아이디위도값경도값총휘발성유기화합물값측정시각
9091T_2482398937.59949127.061939402021-01-01 00:00:01
9192T_2372533437.57569126.979645202021-01-01 00:00:01
9293T_4713228437.56588126.9878238602021-01-01 00:00:01
9394T_7332249337.570114127.0102733202021-01-01 00:00:01
9495T_7346898137.500324126.9853402021-01-01 00:00:02
9596T_4947608237.468117127.0418650802021-01-01 00:00:02
9697T_4713228437.565884126.9878238602021-01-01 00:00:02
9798T_9804782937.434925126.9969402021-01-01 00:00:02
9899T_4998878737.475964126.98166138402021-01-01 00:00:02
99100T_2482398937.599495127.061939402021-01-01 00:00:02