Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.9 KiB
Average record size in memory70.3 B

Variable types

Categorical4
Numeric4

Alerts

지점 has constant value ""Constant
주소 has constant value ""Constant
측정시간 is highly overall correlated with 온도(℃) and 1 other fieldsHigh correlation
온도(℃) is highly overall correlated with 측정시간 and 2 other fieldsHigh correlation
습도(%) is highly overall correlated with 온도(℃) and 1 other fieldsHigh correlation
풍속(m/s) is highly overall correlated with 온도(℃) and 1 other fieldsHigh correlation
풍향 is highly overall correlated with 측정시간High correlation
측정일 is highly imbalanced (75.8%)Imbalance
측정시간 has 2 (2.0%) zerosZeros

Reproduction

Analysis started2023-12-10 12:41:40.458645
Analysis finished2023-12-10 12:41:42.345712
Duration1.89 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지점
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
A-0010-0083E-6
100 

Length

Max length14
Median length14
Mean length14
Min length14

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowA-0010-0083E-6
2nd rowA-0010-0083E-6
3rd rowA-0010-0083E-6
4th rowA-0010-0083E-6
5th rowA-0010-0083E-6

Common Values

ValueCountFrequency (%)
A-0010-0083E-6 100
100.0%

Length

2023-12-10T21:41:42.404333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:41:42.487020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
a-0010-0083e-6 100
100.0%

측정일
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
20200501
96 
20200502
 
4

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20200501
2nd row20200501
3rd row20200501
4th row20200501
5th row20200501

Common Values

ValueCountFrequency (%)
20200501 96
96.0%
20200502 4
 
4.0%

Length

2023-12-10T21:41:42.569960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:41:42.655325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20200501 96
96.0%
20200502 4
 
4.0%

측정시간
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct96
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1146.75
Minimum0
Maximum2345
Zeros2
Zeros (%)2.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:41:42.795882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile97.25
Q1541.25
median1122.5
Q31733.75
95-th percentile2230.75
Maximum2345
Range2345
Interquartile range (IQR)1192.5

Descriptive statistics

Standard deviation700.21836
Coefficient of variation (CV)0.61061117
Kurtosis-1.1835169
Mean1146.75
Median Absolute Deviation (MAD)600
Skewness0.026925197
Sum114675
Variance490305.74
MonotonicityNot monotonic
2023-12-10T21:41:43.097667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2
 
2.0%
1000 2
 
2.0%
1015 2
 
2.0%
100 2
 
2.0%
2315 1
 
1.0%
400 1
 
1.0%
345 1
 
1.0%
330 1
 
1.0%
315 1
 
1.0%
300 1
 
1.0%
Other values (86) 86
86.0%
ValueCountFrequency (%)
0 2
2.0%
15 1
1.0%
30 1
1.0%
45 1
1.0%
100 2
2.0%
115 1
1.0%
130 1
1.0%
145 1
1.0%
200 1
1.0%
215 1
1.0%
ValueCountFrequency (%)
2345 1
1.0%
2330 1
1.0%
2315 1
1.0%
2300 1
1.0%
2245 1
1.0%
2230 1
1.0%
2215 1
1.0%
2200 1
1.0%
2145 1
1.0%
2130 1
1.0%

온도(℃)
Real number (ℝ)

HIGH CORRELATION 

Distinct58
Distinct (%)58.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.564
Minimum17
Maximum25.9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:41:43.226495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum17
5-th percentile17.1
Q117.775
median19.8
Q323.35
95-th percentile25.105
Maximum25.9
Range8.9
Interquartile range (IQR)5.575

Descriptive statistics

Standard deviation2.9282159
Coefficient of variation (CV)0.14239525
Kurtosis-1.4368298
Mean20.564
Median Absolute Deviation (MAD)2.5
Skewness0.31817104
Sum2056.4
Variance8.5744485
MonotonicityNot monotonic
2023-12-10T21:41:43.353070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
17.1 7
 
7.0%
17.2 6
 
6.0%
17.3 4
 
4.0%
17.9 4
 
4.0%
24.7 3
 
3.0%
23.2 3
 
3.0%
17.5 3
 
3.0%
18.2 3
 
3.0%
18.6 2
 
2.0%
18.9 2
 
2.0%
Other values (48) 63
63.0%
ValueCountFrequency (%)
17.0 1
 
1.0%
17.1 7
7.0%
17.2 6
6.0%
17.3 4
4.0%
17.5 3
3.0%
17.6 2
 
2.0%
17.7 2
 
2.0%
17.8 2
 
2.0%
17.9 4
4.0%
18.2 3
3.0%
ValueCountFrequency (%)
25.9 1
 
1.0%
25.6 1
 
1.0%
25.4 1
 
1.0%
25.3 1
 
1.0%
25.2 1
 
1.0%
25.1 1
 
1.0%
25.0 1
 
1.0%
24.9 1
 
1.0%
24.8 2
2.0%
24.7 3
3.0%

습도(%)
Real number (ℝ)

HIGH CORRELATION 

Distinct82
Distinct (%)82.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean74.907
Minimum55.9
Maximum91
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:41:43.504277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum55.9
5-th percentile58
Q163.3
median77.85
Q385.075
95-th percentile86.9
Maximum91
Range35.1
Interquartile range (IQR)21.775

Descriptive statistics

Standard deviation10.976156
Coefficient of variation (CV)0.14653045
Kurtosis-1.4836583
Mean74.907
Median Absolute Deviation (MAD)8.5
Skewness-0.34967147
Sum7490.7
Variance120.47601
MonotonicityNot monotonic
2023-12-10T21:41:43.688183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
86.9 3
 
3.0%
86.6 3
 
3.0%
58.0 3
 
3.0%
86.1 2
 
2.0%
86.4 2
 
2.0%
86.3 2
 
2.0%
87.0 2
 
2.0%
84.4 2
 
2.0%
85.0 2
 
2.0%
86.5 2
 
2.0%
Other values (72) 77
77.0%
ValueCountFrequency (%)
55.9 1
 
1.0%
56.7 1
 
1.0%
56.9 1
 
1.0%
57.3 1
 
1.0%
58.0 3
3.0%
58.8 1
 
1.0%
58.9 2
2.0%
59.1 1
 
1.0%
59.2 1
 
1.0%
59.3 1
 
1.0%
ValueCountFrequency (%)
91.0 1
 
1.0%
87.1 1
 
1.0%
87.0 2
2.0%
86.9 3
3.0%
86.7 1
 
1.0%
86.6 3
3.0%
86.5 2
2.0%
86.4 2
2.0%
86.3 2
2.0%
86.2 1
 
1.0%

풍향
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
남서
74 
26 

Length

Max length2
Median length2
Mean length1.74
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남서
2nd row
3rd row남서
4th row남서
5th row남서

Common Values

ValueCountFrequency (%)
남서 74
74.0%
26
 
26.0%

Length

2023-12-10T21:41:43.835758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:41:43.938684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남서 74
74.0%
26
 
26.0%

풍속(m/s)
Real number (ℝ)

HIGH CORRELATION 

Distinct45
Distinct (%)45.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.326
Minimum2
Maximum7.2
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:41:44.099672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile2.69
Q13.4
median4.1
Q35.1
95-th percentile6.6
Maximum7.2
Range5.2
Interquartile range (IQR)1.7

Descriptive statistics

Standard deviation1.1996313
Coefficient of variation (CV)0.27730727
Kurtosis-0.43782238
Mean4.326
Median Absolute Deviation (MAD)0.8
Skewness0.48889148
Sum432.6
Variance1.4391152
MonotonicityNot monotonic
2023-12-10T21:41:44.243592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
3.4 6
 
6.0%
4.0 6
 
6.0%
4.1 4
 
4.0%
3.2 4
 
4.0%
3.5 4
 
4.0%
2.9 4
 
4.0%
3.7 4
 
4.0%
3.8 4
 
4.0%
4.3 3
 
3.0%
3.3 3
 
3.0%
Other values (35) 58
58.0%
ValueCountFrequency (%)
2.0 1
 
1.0%
2.3 1
 
1.0%
2.4 2
2.0%
2.5 1
 
1.0%
2.7 2
2.0%
2.9 4
4.0%
3.0 1
 
1.0%
3.1 2
2.0%
3.2 4
4.0%
3.3 3
3.0%
ValueCountFrequency (%)
7.2 1
1.0%
7.1 1
1.0%
6.8 1
1.0%
6.7 1
1.0%
6.6 2
2.0%
6.5 1
1.0%
6.4 1
1.0%
6.3 1
1.0%
6.2 2
2.0%
6.0 1
1.0%

주소
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
경남 양산시 동면
100 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경남 양산시 동면
2nd row경남 양산시 동면
3rd row경남 양산시 동면
4th row경남 양산시 동면
5th row경남 양산시 동면

Common Values

ValueCountFrequency (%)
경남 양산시 동면 100
100.0%

Length

2023-12-10T21:41:44.372486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:41:44.460448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경남 100
33.3%
양산시 100
33.3%
동면 100
33.3%

Interactions

2023-12-10T21:41:41.814958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:40.705252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:41.058850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:41.465777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:41.890318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:40.783883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:41.150275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:41.549137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:41.964224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:40.884108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:41.241870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:41.629858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:42.042226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:40.980077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:41.350592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:41.725890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T21:41:44.516434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정일측정시간온도(℃)습도(%)풍향풍속(m/s)
측정일1.0000.2840.1700.6290.0000.000
측정시간0.2841.0000.9340.8940.6800.589
온도(℃)0.1700.9341.0000.9400.6050.623
습도(%)0.6290.8940.9401.0000.6530.658
풍향0.0000.6800.6050.6531.0000.284
풍속(m/s)0.0000.5890.6230.6580.2841.000
2023-12-10T21:41:44.640997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
풍향측정일
풍향1.0000.000
측정일0.0001.000
2023-12-10T21:41:44.746127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정시간온도(℃)습도(%)풍속(m/s)측정일풍향
측정시간1.0000.534-0.4430.0510.2070.507
온도(℃)0.5341.000-0.9810.6710.1210.448
습도(%)-0.443-0.9811.000-0.7270.4670.485
풍속(m/s)0.0510.671-0.7271.0000.0000.233
측정일0.2070.1210.4670.0001.0000.000
풍향0.5070.4480.4850.2330.0001.000

Missing values

2023-12-10T21:41:42.156115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T21:41:42.287954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지점측정일측정시간온도(℃)습도(%)풍향풍속(m/s)주소
0A-0010-0083E-620200501017.883.5남서3.4경남 양산시 동면
1A-0010-0083E-62020050110017.884.73.4경남 양산시 동면
2A-0010-0083E-620200501100022.666.8남서6.2경남 양산시 동면
3A-0010-0083E-620200501101523.264.0남서4.0경남 양산시 동면
4A-0010-0083E-620200501103023.563.3남서5.5경남 양산시 동면
5A-0010-0083E-620200501104523.961.7남서5.2경남 양산시 동면
6A-0010-0083E-620200501110023.263.3남서5.0경남 양산시 동면
7A-0010-0083E-620200501111523.961.6남서4.8경남 양산시 동면
8A-0010-0083E-620200501113024.061.6남서4.9경남 양산시 동면
9A-0010-0083E-620200501114523.263.0남서6.8경남 양산시 동면
지점측정일측정시간온도(℃)습도(%)풍향풍속(m/s)주소
90A-0010-0083E-62020050183019.676.14.7경남 양산시 동면
91A-0010-0083E-62020050184520.175.0남서4.5경남 양산시 동면
92A-0010-0083E-62020050190020.673.3남서4.3경남 양산시 동면
93A-0010-0083E-62020050191521.171.35.2경남 양산시 동면
94A-0010-0083E-62020050193021.271.0남서5.2경남 양산시 동면
95A-0010-0083E-62020050194522.168.3남서4.1경남 양산시 동면
96A-0010-0083E-620200502018.286.4남서2.5경남 양산시 동면
97A-0010-0083E-62020050210017.591.0남서2.9경남 양산시 동면
98A-0010-0083E-620200502100022.670.4남서3.3경남 양산시 동면
99A-0010-0083E-620200502101522.969.6남서4.0경남 양산시 동면