Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.9 KiB
Average record size in memory70.3 B

Variable types

Categorical4
Numeric4

Alerts

지점 has constant value ""Constant
주소 has constant value ""Constant
측정시간 is highly overall correlated with 습도(%)High correlation
온도(℃) is highly overall correlated with 습도(%) and 1 other fieldsHigh correlation
습도(%) is highly overall correlated with 측정시간 and 2 other fieldsHigh correlation
풍속(m/s) is highly overall correlated with 온도(℃) and 1 other fieldsHigh correlation
측정일 is highly imbalanced (75.8%)Imbalance
측정시간 has 2 (2.0%) zerosZeros
풍속(m/s) has 3 (3.0%) zerosZeros

Reproduction

Analysis started2023-12-10 12:41:35.146600
Analysis finished2023-12-10 12:41:37.061387
Duration1.91 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지점
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
A-0010-0083E-6
100 

Length

Max length14
Median length14
Mean length14
Min length14

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowA-0010-0083E-6
2nd rowA-0010-0083E-6
3rd rowA-0010-0083E-6
4th rowA-0010-0083E-6
5th rowA-0010-0083E-6

Common Values

ValueCountFrequency (%)
A-0010-0083E-6 100
100.0%

Length

2023-12-10T21:41:37.126031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:41:37.216378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
a-0010-0083e-6 100
100.0%

측정일
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
20200601
96 
20200602
 
4

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20200601
2nd row20200601
3rd row20200601
4th row20200601
5th row20200601

Common Values

ValueCountFrequency (%)
20200601 96
96.0%
20200602 4
 
4.0%

Length

2023-12-10T21:41:37.320843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:41:37.406277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20200601 96
96.0%
20200602 4
 
4.0%

측정시간
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct96
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1146.75
Minimum0
Maximum2345
Zeros2
Zeros (%)2.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:41:37.542859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile97.25
Q1541.25
median1122.5
Q31733.75
95-th percentile2230.75
Maximum2345
Range2345
Interquartile range (IQR)1192.5

Descriptive statistics

Standard deviation700.21836
Coefficient of variation (CV)0.61061117
Kurtosis-1.1835169
Mean1146.75
Median Absolute Deviation (MAD)600
Skewness0.026925197
Sum114675
Variance490305.74
MonotonicityNot monotonic
2023-12-10T21:41:37.759512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2
 
2.0%
1000 2
 
2.0%
1015 2
 
2.0%
100 2
 
2.0%
2315 1
 
1.0%
400 1
 
1.0%
345 1
 
1.0%
330 1
 
1.0%
315 1
 
1.0%
300 1
 
1.0%
Other values (86) 86
86.0%
ValueCountFrequency (%)
0 2
2.0%
15 1
1.0%
30 1
1.0%
45 1
1.0%
100 2
2.0%
115 1
1.0%
130 1
1.0%
145 1
1.0%
200 1
1.0%
215 1
1.0%
ValueCountFrequency (%)
2345 1
1.0%
2330 1
1.0%
2315 1
1.0%
2300 1
1.0%
2245 1
1.0%
2230 1
1.0%
2215 1
1.0%
2200 1
1.0%
2145 1
1.0%
2130 1
1.0%

온도(℃)
Real number (ℝ)

HIGH CORRELATION 

Distinct63
Distinct (%)63.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23.809
Minimum18
Maximum31.2
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:41:37.948756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum18
5-th percentile18.695
Q119.5
median23.1
Q327.95
95-th percentile30.6
Maximum31.2
Range13.2
Interquartile range (IQR)8.45

Descriptive statistics

Standard deviation4.445656
Coefficient of variation (CV)0.18672166
Kurtosis-1.548012
Mean23.809
Median Absolute Deviation (MAD)3.75
Skewness0.25709361
Sum2380.9
Variance19.763858
MonotonicityNot monotonic
2023-12-10T21:41:38.096431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
19.8 7
 
7.0%
19.4 5
 
5.0%
19.5 4
 
4.0%
18.7 4
 
4.0%
30.4 3
 
3.0%
19.2 3
 
3.0%
18.6 3
 
3.0%
19.3 2
 
2.0%
20.1 2
 
2.0%
24.3 2
 
2.0%
Other values (53) 65
65.0%
ValueCountFrequency (%)
18.0 1
 
1.0%
18.1 1
 
1.0%
18.6 3
3.0%
18.7 4
4.0%
18.8 1
 
1.0%
18.9 1
 
1.0%
19.0 1
 
1.0%
19.1 1
 
1.0%
19.2 3
3.0%
19.3 2
2.0%
ValueCountFrequency (%)
31.2 1
 
1.0%
30.8 1
 
1.0%
30.7 2
2.0%
30.6 2
2.0%
30.4 3
3.0%
30.2 1
 
1.0%
30.1 1
 
1.0%
30.0 1
 
1.0%
29.9 1
 
1.0%
29.8 2
2.0%

습도(%)
Real number (ℝ)

HIGH CORRELATION 

Distinct89
Distinct (%)89.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean59.424
Minimum28.8
Maximum92.6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:41:38.242489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum28.8
5-th percentile30.185
Q135.825
median56
Q387.475
95-th percentile91.205
Maximum92.6
Range63.8
Interquartile range (IQR)51.65

Descriptive statistics

Standard deviation23.502186
Coefficient of variation (CV)0.3954999
Kurtosis-1.5685648
Mean59.424
Median Absolute Deviation (MAD)23.1
Skewness0.16030044
Sum5942.4
Variance552.35275
MonotonicityNot monotonic
2023-12-10T21:41:38.391659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
30.5 3
 
3.0%
88.4 2
 
2.0%
56.0 2
 
2.0%
91.3 2
 
2.0%
29.9 2
 
2.0%
91.2 2
 
2.0%
30.8 2
 
2.0%
33.0 2
 
2.0%
38.6 2
 
2.0%
88.7 2
 
2.0%
Other values (79) 79
79.0%
ValueCountFrequency (%)
28.8 1
 
1.0%
29.5 1
 
1.0%
29.8 1
 
1.0%
29.9 2
2.0%
30.2 1
 
1.0%
30.4 1
 
1.0%
30.5 3
3.0%
30.8 2
2.0%
31.0 1
 
1.0%
31.2 1
 
1.0%
ValueCountFrequency (%)
92.6 1
1.0%
92.2 1
1.0%
92.1 1
1.0%
91.3 2
2.0%
91.2 2
2.0%
91.1 1
1.0%
90.7 1
1.0%
90.5 1
1.0%
90.3 1
1.0%
90.2 1
1.0%

풍향
Categorical

Distinct8
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
37 
남서
26 
17 
북서
11 
북동
Other values (3)

Length

Max length2
Median length1
Mean length1.43
Min length1

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row
2nd row남서
3rd row남서
4th row
5th row

Common Values

ValueCountFrequency (%)
37
37.0%
남서 26
26.0%
17
17.0%
북서 11
 
11.0%
북동 4
 
4.0%
2
 
2.0%
남동 2
 
2.0%
1
 
1.0%

Length

2023-12-10T21:41:38.524652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:41:38.830919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
37
37.0%
남서 26
26.0%
17
17.0%
북서 11
 
11.0%
북동 4
 
4.0%
2
 
2.0%
남동 2
 
2.0%
1
 
1.0%

풍속(m/s)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct42
Distinct (%)42.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.121
Minimum0
Maximum5.5
Zeros3
Zeros (%)3.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:41:38.967642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.4
Q11.275
median1.9
Q32.8
95-th percentile4.71
Maximum5.5
Range5.5
Interquartile range (IQR)1.525

Descriptive statistics

Standard deviation1.2443241
Coefficient of variation (CV)0.58666859
Kurtosis0.18549084
Mean2.121
Median Absolute Deviation (MAD)0.75
Skewness0.68877208
Sum212.1
Variance1.5483424
MonotonicityNot monotonic
2023-12-10T21:41:39.141152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
ValueCountFrequency (%)
1.8 6
 
6.0%
2.0 4
 
4.0%
1.7 4
 
4.0%
1.1 4
 
4.0%
1.3 4
 
4.0%
0.8 4
 
4.0%
1.9 4
 
4.0%
0.7 4
 
4.0%
2.8 4
 
4.0%
2.6 4
 
4.0%
Other values (32) 58
58.0%
ValueCountFrequency (%)
0.0 3
3.0%
0.4 4
4.0%
0.5 1
 
1.0%
0.6 1
 
1.0%
0.7 4
4.0%
0.8 4
4.0%
0.9 1
 
1.0%
1.1 4
4.0%
1.2 3
3.0%
1.3 4
4.0%
ValueCountFrequency (%)
5.5 1
1.0%
5.4 1
1.0%
5.0 1
1.0%
4.9 2
2.0%
4.7 1
1.0%
4.2 2
2.0%
4.1 1
1.0%
4.0 1
1.0%
3.9 1
1.0%
3.8 1
1.0%

주소
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
경남 양산시 동면
100 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경남 양산시 동면
2nd row경남 양산시 동면
3rd row경남 양산시 동면
4th row경남 양산시 동면
5th row경남 양산시 동면

Common Values

ValueCountFrequency (%)
경남 양산시 동면 100
100.0%

Length

2023-12-10T21:41:39.263548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:41:39.351042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경남 100
33.3%
양산시 100
33.3%
동면 100
33.3%

Interactions

2023-12-10T21:41:36.415861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:35.392027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:35.721858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:36.075375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:36.484576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:35.468777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:35.811399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:36.150098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:36.579135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:35.555496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:35.895153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:36.236031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:36.668268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:35.642748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:35.982160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:36.337684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T21:41:39.408078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정일측정시간온도(℃)습도(%)풍향풍속(m/s)
측정일1.0000.2840.3250.1720.0000.000
측정시간0.2841.0000.9120.9160.6200.735
온도(℃)0.3250.9121.0000.9190.5700.749
습도(%)0.1720.9160.9191.0000.5140.621
풍향0.0000.6200.5700.5141.0000.387
풍속(m/s)0.0000.7350.7490.6210.3871.000
2023-12-10T21:41:39.525682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
풍향측정일
풍향1.0000.000
측정일0.0001.000
2023-12-10T21:41:39.633939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정시간온도(℃)습도(%)풍속(m/s)측정일풍향
측정시간1.0000.449-0.7290.1090.2070.350
온도(℃)0.4491.000-0.8330.6970.2370.311
습도(%)-0.729-0.8331.000-0.5790.1230.271
풍속(m/s)0.1090.697-0.5791.0000.0000.185
측정일0.2070.2370.1230.0001.0000.000
풍향0.3500.3110.2710.1850.0001.000

Missing values

2023-12-10T21:41:36.775708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T21:41:37.010372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지점측정일측정시간온도(℃)습도(%)풍향풍속(m/s)주소
0A-0010-0083E-620200601020.085.92.1경남 양산시 동면
1A-0010-0083E-62020060110019.888.0남서1.8경남 양산시 동면
2A-0010-0083E-620200601100024.964.4남서1.5경남 양산시 동면
3A-0010-0083E-620200601101525.661.41.7경남 양산시 동면
4A-0010-0083E-620200601103025.660.01.1경남 양산시 동면
5A-0010-0083E-620200601104526.357.22.4경남 양산시 동면
6A-0010-0083E-620200601110026.655.21.3경남 양산시 동면
7A-0010-0083E-620200601111527.952.5북서0.8경남 양산시 동면
8A-0010-0083E-620200601113027.851.9북서1.9경남 양산시 동면
9A-0010-0083E-620200601114527.652.9북서2.3경남 양산시 동면
지점측정일측정시간온도(℃)습도(%)풍향풍속(m/s)주소
90A-0010-0083E-62020060183022.771.0북서2.3경남 양산시 동면
91A-0010-0083E-62020060184522.669.20.4경남 양산시 동면
92A-0010-0083E-62020060190022.870.11.4경남 양산시 동면
93A-0010-0083E-62020060191523.369.4북서1.3경남 양산시 동면
94A-0010-0083E-62020060193024.366.70.4경남 양산시 동면
95A-0010-0083E-62020060194524.464.8남서1.1경남 양산시 동면
96A-0010-0083E-620200602018.656.0남서1.1경남 양산시 동면
97A-0010-0083E-62020060210018.656.51.8경남 양산시 동면
98A-0010-0083E-620200602100025.230.8남서2.4경남 양산시 동면
99A-0010-0083E-620200602101525.533.62.7경남 양산시 동면