Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.9 KiB
Average record size in memory70.3 B

Variable types

Categorical4
Numeric4

Alerts

지점 has constant value ""Constant
주소 has constant value ""Constant
측정시간 is highly overall correlated with 온도(℃)High correlation
온도(℃) is highly overall correlated with 측정시간High correlation
측정일 is highly imbalanced (75.8%)Imbalance
측정시간 has 2 (2.0%) zerosZeros
온도(℃) has 2 (2.0%) zerosZeros

Reproduction

Analysis started2023-12-10 12:42:03.208022
Analysis finished2023-12-10 12:42:05.063379
Duration1.86 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지점
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
A-0010-0083E-6
100 

Length

Max length14
Median length14
Mean length14
Min length14

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowA-0010-0083E-6
2nd rowA-0010-0083E-6
3rd rowA-0010-0083E-6
4th rowA-0010-0083E-6
5th rowA-0010-0083E-6

Common Values

ValueCountFrequency (%)
A-0010-0083E-6 100
100.0%

Length

2023-12-10T21:42:05.139843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:42:05.243313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
a-0010-0083e-6 100
100.0%

측정일
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
20200101
96 
20200102
 
4

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20200101
2nd row20200101
3rd row20200101
4th row20200101
5th row20200101

Common Values

ValueCountFrequency (%)
20200101 96
96.0%
20200102 4
 
4.0%

Length

2023-12-10T21:42:05.371124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:42:05.481879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20200101 96
96.0%
20200102 4
 
4.0%

측정시간
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct96
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1146.75
Minimum0
Maximum2345
Zeros2
Zeros (%)2.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:42:05.628361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile97.25
Q1541.25
median1122.5
Q31733.75
95-th percentile2230.75
Maximum2345
Range2345
Interquartile range (IQR)1192.5

Descriptive statistics

Standard deviation700.21836
Coefficient of variation (CV)0.61061117
Kurtosis-1.1835169
Mean1146.75
Median Absolute Deviation (MAD)600
Skewness0.026925197
Sum114675
Variance490305.74
MonotonicityNot monotonic
2023-12-10T21:42:05.800436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2
 
2.0%
1000 2
 
2.0%
1015 2
 
2.0%
100 2
 
2.0%
2315 1
 
1.0%
400 1
 
1.0%
345 1
 
1.0%
330 1
 
1.0%
315 1
 
1.0%
300 1
 
1.0%
Other values (86) 86
86.0%
ValueCountFrequency (%)
0 2
2.0%
15 1
1.0%
30 1
1.0%
45 1
1.0%
100 2
2.0%
115 1
1.0%
130 1
1.0%
145 1
1.0%
200 1
1.0%
215 1
1.0%
ValueCountFrequency (%)
2345 1
1.0%
2330 1
1.0%
2315 1
1.0%
2300 1
1.0%
2245 1
1.0%
2230 1
1.0%
2215 1
1.0%
2200 1
1.0%
2145 1
1.0%
2130 1
1.0%

온도(℃)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct59
Distinct (%)59.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.297
Minimum-2.8
Maximum7.4
Zeros2
Zeros (%)2.0%
Negative30
Negative (%)30.0%
Memory size1.0 KiB
2023-12-10T21:42:05.985033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-2.8
5-th percentile-2.3
Q1-0.2
median2.25
Q35.225
95-th percentile7.2
Maximum7.4
Range10.2
Interquartile range (IQR)5.425

Descriptive statistics

Standard deviation3.1626755
Coefficient of variation (CV)1.3768722
Kurtosis-1.2692901
Mean2.297
Median Absolute Deviation (MAD)2.65
Skewness0.17527085
Sum229.7
Variance10.002516
MonotonicityNot monotonic
2023-12-10T21:42:06.182113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-0.2 4
 
4.0%
-2.4 3
 
3.0%
6.5 3
 
3.0%
7.2 3
 
3.0%
7.3 3
 
3.0%
2.3 3
 
3.0%
1.3 3
 
3.0%
0.4 3
 
3.0%
2.6 3
 
3.0%
-1.7 3
 
3.0%
Other values (49) 69
69.0%
ValueCountFrequency (%)
-2.8 1
 
1.0%
-2.4 3
3.0%
-2.3 2
2.0%
-2.1 2
2.0%
-2.0 2
2.0%
-1.7 3
3.0%
-1.6 1
 
1.0%
-1.4 2
2.0%
-1.1 1
 
1.0%
-1.0 1
 
1.0%
ValueCountFrequency (%)
7.4 1
 
1.0%
7.3 3
3.0%
7.2 3
3.0%
7.1 2
2.0%
7.0 1
 
1.0%
6.9 1
 
1.0%
6.8 2
2.0%
6.6 1
 
1.0%
6.5 3
3.0%
6.4 1
 
1.0%

습도(%)
Real number (ℝ)

Distinct89
Distinct (%)89.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44.044
Minimum19.2
Maximum64.8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:42:06.350545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19.2
5-th percentile21.8
Q137.25
median44.45
Q352.425
95-th percentile60.715
Maximum64.8
Range45.6
Interquartile range (IQR)15.175

Descriptive statistics

Standard deviation11.370708
Coefficient of variation (CV)0.25816701
Kurtosis-0.46505057
Mean44.044
Median Absolute Deviation (MAD)7.4
Skewness-0.38415344
Sum4404.4
Variance129.29299
MonotonicityNot monotonic
2023-12-10T21:42:06.787636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.4 2
 
2.0%
38.9 2
 
2.0%
44.3 2
 
2.0%
37.1 2
 
2.0%
47.8 2
 
2.0%
37.0 2
 
2.0%
21.8 2
 
2.0%
44.6 2
 
2.0%
50.1 2
 
2.0%
51.6 2
 
2.0%
Other values (79) 80
80.0%
ValueCountFrequency (%)
19.2 1
1.0%
20.0 1
1.0%
20.2 1
1.0%
20.4 1
1.0%
21.8 2
2.0%
23.6 1
1.0%
24.0 1
1.0%
24.4 1
1.0%
25.2 1
1.0%
26.9 1
1.0%
ValueCountFrequency (%)
64.8 1
1.0%
64.6 1
1.0%
63.9 1
1.0%
63.1 1
1.0%
61.0 1
1.0%
60.7 1
1.0%
60.4 1
1.0%
59.8 1
1.0%
59.7 1
1.0%
57.5 1
1.0%

풍향
Categorical

Distinct6
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
남서
32 
26 
22 
북서
11 
남동

Length

Max length2
Median length1
Mean length1.49
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row남서
3rd row남서
4th row북서
5th row

Common Values

ValueCountFrequency (%)
남서 32
32.0%
26
26.0%
22
22.0%
북서 11
 
11.0%
남동 6
 
6.0%
3
 
3.0%

Length

2023-12-10T21:42:06.976424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:42:07.091801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남서 32
32.0%
26
26.0%
22
22.0%
북서 11
 
11.0%
남동 6
 
6.0%
3
 
3.0%

풍속(m/s)
Real number (ℝ)

Distinct35
Distinct (%)35.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.137
Minimum0.1
Maximum5.1
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:42:07.198208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.1
5-th percentile0.5
Q11.4
median2.15
Q32.625
95-th percentile4.2
Maximum5.1
Range5
Interquartile range (IQR)1.225

Descriptive statistics

Standard deviation0.97893521
Coefficient of variation (CV)0.45808854
Kurtosis0.44787689
Mean2.137
Median Absolute Deviation (MAD)0.65
Skewness0.50569627
Sum213.7
Variance0.95831414
MonotonicityNot monotonic
2023-12-10T21:42:07.318408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
2.3 7
 
7.0%
2.5 6
 
6.0%
2.2 6
 
6.0%
1.4 6
 
6.0%
2.0 5
 
5.0%
3.1 5
 
5.0%
1.2 4
 
4.0%
2.7 4
 
4.0%
1.7 3
 
3.0%
2.4 3
 
3.0%
Other values (25) 51
51.0%
ValueCountFrequency (%)
0.1 1
 
1.0%
0.3 1
 
1.0%
0.4 2
 
2.0%
0.5 2
 
2.0%
0.9 3
3.0%
1.0 2
 
2.0%
1.1 3
3.0%
1.2 4
4.0%
1.3 3
3.0%
1.4 6
6.0%
ValueCountFrequency (%)
5.1 1
 
1.0%
4.6 1
 
1.0%
4.3 2
 
2.0%
4.2 2
 
2.0%
3.8 1
 
1.0%
3.6 1
 
1.0%
3.4 2
 
2.0%
3.3 1
 
1.0%
3.2 1
 
1.0%
3.1 5
5.0%

주소
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
경남 양산시 동면
100 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경남 양산시 동면
2nd row경남 양산시 동면
3rd row경남 양산시 동면
4th row경남 양산시 동면
5th row경남 양산시 동면

Common Values

ValueCountFrequency (%)
경남 양산시 동면 100
100.0%

Length

2023-12-10T21:42:07.435882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:42:07.520606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경남 100
33.3%
양산시 100
33.3%
동면 100
33.3%

Interactions

2023-12-10T21:42:04.503071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:42:03.480318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:42:03.841914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:42:04.157942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:42:04.574743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:42:03.569574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:42:03.915682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:42:04.240136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:42:04.655085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:42:03.662125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:42:03.996073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:42:04.326896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:42:04.737724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:42:03.751910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:42:04.079689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:42:04.415110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T21:42:07.580618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정일측정시간온도(℃)습도(%)풍향풍속(m/s)
측정일1.0000.2840.2970.0320.0000.000
측정시간0.2841.0000.9340.8920.6210.704
온도(℃)0.2970.9341.0000.9230.4630.459
습도(%)0.0320.8920.9231.0000.3900.531
풍향0.0000.6210.4630.3901.0000.459
풍속(m/s)0.0000.7040.4590.5310.4591.000
2023-12-10T21:42:07.705135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
풍향측정일
풍향1.0000.000
측정일0.0001.000
2023-12-10T21:42:07.825506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정시간온도(℃)습도(%)풍속(m/s)측정일풍향
측정시간1.0000.6210.438-0.1500.2070.378
온도(℃)0.6211.000-0.0570.1640.3050.265
습도(%)0.438-0.0571.000-0.4040.0000.210
풍속(m/s)-0.1500.164-0.4041.0000.0000.217
측정일0.2070.3050.0000.0001.0000.000
풍향0.3780.2650.2100.2170.0001.000

Missing values

2023-12-10T21:42:04.868219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T21:42:05.007640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지점측정일측정시간온도(℃)습도(%)풍향풍속(m/s)주소
0A-0010-0083E-6202001010-2.427.11.2경남 양산시 동면
1A-0010-0083E-620200101100-2.024.0남서1.6경남 양산시 동면
2A-0010-0083E-62020010110002.846.6남서1.1경남 양산시 동면
3A-0010-0083E-62020010110153.347.0북서0.9경남 양산시 동면
4A-0010-0083E-62020010110303.545.62.3경남 양산시 동면
5A-0010-0083E-62020010110454.343.81.5경남 양산시 동면
6A-0010-0083E-62020010111004.642.91.9경남 양산시 동면
7A-0010-0083E-62020010111154.841.6남서2.3경남 양산시 동면
8A-0010-0083E-62020010111304.840.4남서0.9경남 양산시 동면
9A-0010-0083E-62020010111455.638.91.5경남 양산시 동면
지점측정일측정시간온도(℃)습도(%)풍향풍속(m/s)주소
90A-0010-0083E-620200101830-0.559.8남서1.4경남 양산시 동면
91A-0010-0083E-620200101845-0.260.4남서2.3경남 양산시 동면
92A-0010-0083E-6202001019000.655.9남서2.3경남 양산시 동면
93A-0010-0083E-6202001019151.352.90.4경남 양산시 동면
94A-0010-0083E-6202001019301.950.12.2경남 양산시 동면
95A-0010-0083E-6202001019452.548.01.1경남 양산시 동면
96A-0010-0083E-62020010202.353.42.0경남 양산시 동면
97A-0010-0083E-6202001021002.254.0남서2.6경남 양산시 동면
98A-0010-0083E-62020010210004.750.3남서2.3경남 양산시 동면
99A-0010-0083E-62020010210155.250.12.8경남 양산시 동면