Overview

Dataset statistics

Number of variables10
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.8 KiB
Average record size in memory90.3 B

Variable types

Numeric1
Categorical8
DateTime1

Alerts

노드 아이디 has constant value ""Constant
co 측정값 has constant value ""Constant
온도 has constant value ""Constant
co2 측정값 is highly overall correlated with PM1.0 측정값 and 3 other fieldsHigh correlation
PM1.0 측정값 is highly overall correlated with co2 측정값 and 3 other fieldsHigh correlation
PM2.5 측정값 is highly overall correlated with co2 측정값 and 3 other fieldsHigh correlation
PM10 측정값 is highly overall correlated with co2 측정값 and 3 other fieldsHigh correlation
습도 is highly overall correlated with co2 측정값 and 3 other fieldsHigh correlation
co2 측정값 is highly imbalanced (85.9%)Imbalance
PM1.0 측정값 is highly imbalanced (85.9%)Imbalance
PM2.5 측정값 is highly imbalanced (85.9%)Imbalance
PM10 측정값 is highly imbalanced (85.9%)Imbalance
습도 is highly imbalanced (85.9%)Imbalance
시퀀스 has unique valuesUnique

Reproduction

Analysis started2023-12-10 13:31:45.224617
Analysis finished2023-12-10 13:31:46.156178
Duration0.93 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시퀀스
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2758664
Minimum2750167
Maximum2767182
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:31:46.294420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2750167
5-th percentile2751019.5
Q12754394
median2758621
Q32762930.5
95-th percentile2766329.6
Maximum2767182
Range17015
Interquartile range (IQR)8536.5

Descriptive statistics

Standard deviation4988.4934
Coefficient of variation (CV)0.0018083005
Kurtosis-1.2030928
Mean2758664
Median Absolute Deviation (MAD)4311.5
Skewness0.0059271423
Sum2.758664 × 108
Variance24885067
MonotonicityStrictly increasing
2023-12-10T22:31:46.617999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2750167 1
 
1.0%
2761166 1
 
1.0%
2762888 1
 
1.0%
2762717 1
 
1.0%
2762544 1
 
1.0%
2762374 1
 
1.0%
2762197 1
 
1.0%
2762030 1
 
1.0%
2761860 1
 
1.0%
2761692 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
2750167 1
1.0%
2750340 1
1.0%
2750512 1
1.0%
2750687 1
1.0%
2750857 1
1.0%
2751028 1
1.0%
2751202 1
1.0%
2751374 1
1.0%
2751544 1
1.0%
2751721 1
1.0%
ValueCountFrequency (%)
2767182 1
1.0%
2767014 1
1.0%
2766838 1
1.0%
2766665 1
1.0%
2766493 1
1.0%
2766321 1
1.0%
2766150 1
1.0%
2765978 1
1.0%
2765807 1
1.0%
2765632 1
1.0%

노드 아이디
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
201101
100 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row201101
2nd row201101
3rd row201101
4th row201101
5th row201101

Common Values

ValueCountFrequency (%)
201101 100
100.0%

Length

2023-12-10T22:31:46.848826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:31:46.981194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
201101 100
100.0%

co 측정값
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
0.3000000119
100 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.3000000119
2nd row0.3000000119
3rd row0.3000000119
4th row0.3000000119
5th row0.3000000119

Common Values

ValueCountFrequency (%)
0.3000000119 100
100.0%

Length

2023-12-10T22:31:47.145295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:31:47.285984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0.3000000119 100
100.0%

co2 측정값
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
432
98 
413
 
2

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row432
2nd row432
3rd row432
4th row432
5th row432

Common Values

ValueCountFrequency (%)
432 98
98.0%
413 2
 
2.0%

Length

2023-12-10T22:31:47.430453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:31:47.581676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
432 98
98.0%
413 2
 
2.0%

PM1.0 측정값
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
7
98 
5
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row7
2nd row7
3rd row7
4th row7
5th row7

Common Values

ValueCountFrequency (%)
7 98
98.0%
5 2
 
2.0%

Length

2023-12-10T22:31:47.729630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:31:47.864468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
7 98
98.0%
5 2
 
2.0%

PM2.5 측정값
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
10
98 
6
 
2

Length

Max length2
Median length2
Mean length1.98
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row10
2nd row10
3rd row10
4th row10
5th row10

Common Values

ValueCountFrequency (%)
10 98
98.0%
6 2
 
2.0%

Length

2023-12-10T22:31:48.367120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:31:48.513852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
10 98
98.0%
6 2
 
2.0%

PM10 측정값
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
12
98 
6
 
2

Length

Max length2
Median length2
Mean length1.98
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row12
2nd row12
3rd row12
4th row12
5th row12

Common Values

ValueCountFrequency (%)
12 98
98.0%
6 2
 
2.0%

Length

2023-12-10T22:31:48.665654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:31:48.834333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
12 98
98.0%
6 2
 
2.0%

온도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
30.600000381
100 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row30.600000381
2nd row30.600000381
3rd row30.600000381
4th row30.600000381
5th row30.600000381

Common Values

ValueCountFrequency (%)
30.600000381 100
100.0%

Length

2023-12-10T22:31:49.011601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:31:49.166357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
30.600000381 100
100.0%

습도
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
67.800003052
98 
67.300003052
 
2

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row67.800003052
2nd row67.800003052
3rd row67.800003052
4th row67.800003052
5th row67.800003052

Common Values

ValueCountFrequency (%)
67.800003052 98
98.0%
67.300003052 2
 
2.0%

Length

2023-12-10T22:31:49.327989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:31:49.495242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
67.800003052 98
98.0%
67.300003052 2
 
2.0%
Distinct40
Distinct (%)40.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2022-07-01 09:34:42
Maximum2022-07-01 09:38:01
2023-12-10T22:31:49.652320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:31:49.871011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)

Interactions

2023-12-10T22:31:45.691846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:31:50.046067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시퀀스co2 측정값PM1.0 측정값PM2.5 측정값PM10 측정값습도기록 시간
시퀀스1.0000.4160.4160.4160.4160.4160.617
co2 측정값0.4161.0000.9190.9190.9190.9191.000
PM1.0 측정값0.4160.9191.0000.9190.9190.9191.000
PM2.5 측정값0.4160.9190.9191.0000.9190.9191.000
PM10 측정값0.4160.9190.9190.9191.0000.9191.000
습도0.4160.9190.9190.9190.9191.0001.000
기록 시간0.6171.0001.0001.0001.0001.0001.000
2023-12-10T22:31:50.234249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
co2 측정값PM1.0 측정값PM10 측정값습도PM2.5 측정값
co2 측정값1.0000.7420.7420.7420.742
PM1.0 측정값0.7421.0000.7420.7420.742
PM10 측정값0.7420.7421.0000.7420.742
습도0.7420.7420.7421.0000.742
PM2.5 측정값0.7420.7420.7420.7421.000
2023-12-10T22:31:50.401018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시퀀스co2 측정값PM1.0 측정값PM2.5 측정값PM10 측정값습도
시퀀스1.0000.3060.3060.3060.3060.306
co2 측정값0.3061.0000.7420.7420.7420.742
PM1.0 측정값0.3060.7421.0000.7420.7420.742
PM2.5 측정값0.3060.7420.7421.0000.7420.742
PM10 측정값0.3060.7420.7420.7421.0000.742
습도0.3060.7420.7420.7420.7421.000

Missing values

2023-12-10T22:31:45.891099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:31:46.091474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시퀀스노드 아이디co 측정값co2 측정값PM1.0 측정값PM2.5 측정값PM10 측정값온도습도기록 시간
027501672011010.34327101230.667.8000032022-07-01 09:34:42
127503402011010.34327101230.667.8000032022-07-01 09:34:44
227505122011010.34327101230.667.8000032022-07-01 09:34:46
327506872011010.34327101230.667.8000032022-07-01 09:34:48
427508572011010.34327101230.667.8000032022-07-01 09:34:50
527510282011010.34327101230.667.8000032022-07-01 09:34:52
627512022011010.34327101230.667.8000032022-07-01 09:34:54
727513742011010.34327101230.667.8000032022-07-01 09:34:56
827515442011010.34327101230.667.8000032022-07-01 09:34:58
927517212011010.34327101230.667.8000032022-07-01 09:35:00
시퀀스노드 아이디co 측정값co2 측정값PM1.0 측정값PM2.5 측정값PM10 측정값온도습도기록 시간
9027656322011010.34327101230.667.8000032022-07-01 09:35:57
9127658072011010.34327101230.667.8000032022-07-01 09:35:57
9227659782011010.34327101230.667.8000032022-07-01 09:35:57
9327661502011010.34327101230.667.8000032022-07-01 09:35:57
9427663212011010.34327101230.667.8000032022-07-01 09:35:57
9527664932011010.34327101230.667.8000032022-07-01 09:35:57
9627666652011010.34327101230.667.8000032022-07-01 09:35:57
9727668382011010.34327101230.667.8000032022-07-01 09:35:57
9827670142011010.341356630.667.3000032022-07-01 09:37:59
9927671822011010.341356630.667.3000032022-07-01 09:38:01