Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.9 KiB
Average record size in memory70.3 B

Variable types

Categorical4
Numeric4

Alerts

지점 has constant value ""Constant
주소 has constant value ""Constant
온도(℃) is highly overall correlated with 습도(%)High correlation
습도(%) is highly overall correlated with 온도(℃)High correlation
측정일 is highly imbalanced (75.8%)Imbalance
측정시간 has 2 (2.0%) zerosZeros
풍속(m/s) has 7 (7.0%) zerosZeros

Reproduction

Analysis started2023-12-10 12:41:46.207492
Analysis finished2023-12-10 12:41:48.628914
Duration2.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지점
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
A-0010-0083E-6
100 

Length

Max length14
Median length14
Mean length14
Min length14

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowA-0010-0083E-6
2nd rowA-0010-0083E-6
3rd rowA-0010-0083E-6
4th rowA-0010-0083E-6
5th rowA-0010-0083E-6

Common Values

ValueCountFrequency (%)
A-0010-0083E-6 100
100.0%

Length

2023-12-10T21:41:48.715972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:41:48.818514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
a-0010-0083e-6 100
100.0%

측정일
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
20200401
96 
20200402
 
4

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20200401
2nd row20200401
3rd row20200401
4th row20200401
5th row20200401

Common Values

ValueCountFrequency (%)
20200401 96
96.0%
20200402 4
 
4.0%

Length

2023-12-10T21:41:48.959733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:41:49.069539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20200401 96
96.0%
20200402 4
 
4.0%

측정시간
Real number (ℝ)

ZEROS 

Distinct96
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1146.75
Minimum0
Maximum2345
Zeros2
Zeros (%)2.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:41:49.187065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile97.25
Q1541.25
median1122.5
Q31733.75
95-th percentile2230.75
Maximum2345
Range2345
Interquartile range (IQR)1192.5

Descriptive statistics

Standard deviation700.21836
Coefficient of variation (CV)0.61061117
Kurtosis-1.1835169
Mean1146.75
Median Absolute Deviation (MAD)600
Skewness0.026925197
Sum114675
Variance490305.74
MonotonicityNot monotonic
2023-12-10T21:41:49.313760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2
 
2.0%
1000 2
 
2.0%
1015 2
 
2.0%
100 2
 
2.0%
2315 1
 
1.0%
400 1
 
1.0%
345 1
 
1.0%
330 1
 
1.0%
315 1
 
1.0%
300 1
 
1.0%
Other values (86) 86
86.0%
ValueCountFrequency (%)
0 2
2.0%
15 1
1.0%
30 1
1.0%
45 1
1.0%
100 2
2.0%
115 1
1.0%
130 1
1.0%
145 1
1.0%
200 1
1.0%
215 1
1.0%
ValueCountFrequency (%)
2345 1
1.0%
2330 1
1.0%
2315 1
1.0%
2300 1
1.0%
2245 1
1.0%
2230 1
1.0%
2215 1
1.0%
2200 1
1.0%
2145 1
1.0%
2130 1
1.0%

온도(℃)
Real number (ℝ)

HIGH CORRELATION 

Distinct62
Distinct (%)62.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.005
Minimum8.6
Maximum19.4
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:41:49.484383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8.6
5-th percentile9.795
Q110.2
median11.55
Q315.725
95-th percentile18.905
Maximum19.4
Range10.8
Interquartile range (IQR)5.525

Descriptive statistics

Standard deviation3.1847609
Coefficient of variation (CV)0.24488742
Kurtosis-1.0157971
Mean13.005
Median Absolute Deviation (MAD)1.75
Skewness0.6294371
Sum1300.5
Variance10.142702
MonotonicityNot monotonic
2023-12-10T21:41:49.643133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9.8 7
 
7.0%
10.7 5
 
5.0%
9.9 5
 
5.0%
10.1 4
 
4.0%
10.9 3
 
3.0%
11.1 3
 
3.0%
10.3 3
 
3.0%
10.0 3
 
3.0%
10.2 2
 
2.0%
15.8 2
 
2.0%
Other values (52) 63
63.0%
ValueCountFrequency (%)
8.6 1
 
1.0%
9.5 2
 
2.0%
9.6 1
 
1.0%
9.7 1
 
1.0%
9.8 7
7.0%
9.9 5
5.0%
10.0 3
3.0%
10.1 4
4.0%
10.2 2
 
2.0%
10.3 3
3.0%
ValueCountFrequency (%)
19.4 1
1.0%
19.3 1
1.0%
19.2 1
1.0%
19.1 1
1.0%
19.0 1
1.0%
18.9 2
2.0%
18.7 1
1.0%
18.6 1
1.0%
18.1 1
1.0%
18.0 2
2.0%

습도(%)
Real number (ℝ)

HIGH CORRELATION 

Distinct90
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean62.251
Minimum29.3
Maximum94.3
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:41:49.793588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum29.3
5-th percentile31.4
Q142.95
median67.25
Q375.525
95-th percentile91.615
Maximum94.3
Range65
Interquartile range (IQR)32.575

Descriptive statistics

Standard deviation19.996022
Coefficient of variation (CV)0.32121608
Kurtosis-1.1915904
Mean62.251
Median Absolute Deviation (MAD)16.55
Skewness-0.15400737
Sum6225.1
Variance399.84091
MonotonicityNot monotonic
2023-12-10T21:41:49.945153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
90.0 2
 
2.0%
88.6 2
 
2.0%
70.6 2
 
2.0%
43.3 2
 
2.0%
31.4 2
 
2.0%
71.5 2
 
2.0%
36.6 2
 
2.0%
85.6 2
 
2.0%
30.5 2
 
2.0%
32.8 2
 
2.0%
Other values (80) 80
80.0%
ValueCountFrequency (%)
29.3 1
1.0%
30.3 1
1.0%
30.5 2
2.0%
31.4 2
2.0%
31.6 1
1.0%
32.0 1
1.0%
32.3 1
1.0%
32.7 1
1.0%
32.8 2
2.0%
33.3 1
1.0%
ValueCountFrequency (%)
94.3 1
1.0%
92.9 1
1.0%
92.5 1
1.0%
92.2 1
1.0%
91.9 1
1.0%
91.6 1
1.0%
91.1 1
1.0%
90.2 1
1.0%
90.0 2
2.0%
88.8 1
1.0%

풍향
Categorical

Distinct8
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
북동
32 
21 
남서
14 
10 
남동
Other values (3)
15 

Length

Max length2
Median length2
Mean length1.58
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row북동
3rd row북동
4th row
5th row북동

Common Values

ValueCountFrequency (%)
북동 32
32.0%
21
21.0%
남서 14
14.0%
10
 
10.0%
남동 8
 
8.0%
6
 
6.0%
5
 
5.0%
북서 4
 
4.0%

Length

2023-12-10T21:41:50.105554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:41:50.255624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
북동 32
32.0%
21
21.0%
남서 14
14.0%
10
 
10.0%
남동 8
 
8.0%
6
 
6.0%
5
 
5.0%
북서 4
 
4.0%

풍속(m/s)
Real number (ℝ)

ZEROS 

Distinct42
Distinct (%)42.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.959
Minimum0
Maximum6.6
Zeros7
Zeros (%)7.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:41:50.409661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10.975
median1.45
Q32.6
95-th percentile5.1
Maximum6.6
Range6.6
Interquartile range (IQR)1.625

Descriptive statistics

Standard deviation1.4852198
Coefficient of variation (CV)0.758152
Kurtosis0.66912874
Mean1.959
Median Absolute Deviation (MAD)0.75
Skewness1.0317458
Sum195.9
Variance2.2058778
MonotonicityNot monotonic
2023-12-10T21:41:50.541507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
ValueCountFrequency (%)
0.0 7
 
7.0%
1.4 7
 
7.0%
1.3 6
 
6.0%
1.1 5
 
5.0%
1.9 4
 
4.0%
1.2 4
 
4.0%
0.9 4
 
4.0%
2.2 4
 
4.0%
0.7 4
 
4.0%
1.8 4
 
4.0%
Other values (32) 51
51.0%
ValueCountFrequency (%)
0.0 7
7.0%
0.1 1
 
1.0%
0.2 1
 
1.0%
0.3 2
 
2.0%
0.4 1
 
1.0%
0.5 2
 
2.0%
0.6 2
 
2.0%
0.7 4
4.0%
0.8 1
 
1.0%
0.9 4
4.0%
ValueCountFrequency (%)
6.6 1
1.0%
5.8 2
2.0%
5.3 1
1.0%
5.1 2
2.0%
5.0 1
1.0%
4.7 1
1.0%
4.3 1
1.0%
4.2 1
1.0%
4.1 2
2.0%
4.0 1
1.0%

주소
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
경남 양산시 동면
100 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경남 양산시 동면
2nd row경남 양산시 동면
3rd row경남 양산시 동면
4th row경남 양산시 동면
5th row경남 양산시 동면

Common Values

ValueCountFrequency (%)
경남 양산시 동면 100
100.0%

Length

2023-12-10T21:41:50.681899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:41:50.769139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경남 100
33.3%
양산시 100
33.3%
동면 100
33.3%

Interactions

2023-12-10T21:41:47.736176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:46.513136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:46.905875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:47.271802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:47.854822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:46.612734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:47.010408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:47.389124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:47.965690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:46.699329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:47.090504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:47.519258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:48.089906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:46.797038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:47.177085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:47.622166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T21:41:50.838845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정일측정시간온도(℃)습도(%)풍향풍속(m/s)
측정일1.0000.2840.6120.0000.0000.249
측정시간0.2841.0000.8870.9390.6090.548
온도(℃)0.6120.8871.0000.8750.3300.475
습도(%)0.0000.9390.8751.0000.4960.651
풍향0.0000.6090.3300.4961.0000.232
풍속(m/s)0.2490.5480.4750.6510.2321.000
2023-12-10T21:41:50.981060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
풍향측정일
풍향1.0000.000
측정일0.0001.000
2023-12-10T21:41:51.077549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정시간온도(℃)습도(%)풍속(m/s)측정일풍향
측정시간1.0000.427-0.3140.2960.2070.342
온도(℃)0.4271.000-0.9120.4900.4540.159
습도(%)-0.314-0.9121.000-0.4690.0000.259
풍속(m/s)0.2960.490-0.4691.0000.1800.107
측정일0.2070.4540.0000.1801.0000.000
풍향0.3420.1590.2590.1070.0001.000

Missing values

2023-12-10T21:41:48.427563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T21:41:48.579005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지점측정일측정시간온도(℃)습도(%)풍향풍속(m/s)주소
0A-0010-0083E-620200401010.662.80.0경남 양산시 동면
1A-0010-0083E-62020040110010.773.1북동1.1경남 양산시 동면
2A-0010-0083E-620200401100014.541.9북동3.1경남 양산시 동면
3A-0010-0083E-620200401101514.940.04.1경남 양산시 동면
4A-0010-0083E-620200401103015.039.7북동2.6경남 양산시 동면
5A-0010-0083E-620200401104515.443.31.3경남 양산시 동면
6A-0010-0083E-620200401110015.940.82.4경남 양산시 동면
7A-0010-0083E-620200401111515.840.31.1경남 양산시 동면
8A-0010-0083E-620200401113015.738.71.8경남 양산시 동면
9A-0010-0083E-620200401114516.336.6북동1.8경남 양산시 동면
지점측정일측정시간온도(℃)습도(%)풍향풍속(m/s)주소
90A-0010-0083E-62020040183012.163.52.2경남 양산시 동면
91A-0010-0083E-62020040184512.558.61.4경남 양산시 동면
92A-0010-0083E-62020040190013.058.4북동2.6경남 양산시 동면
93A-0010-0083E-62020040191513.260.21.2경남 양산시 동면
94A-0010-0083E-62020040193013.262.0남동1.1경남 양산시 동면
95A-0010-0083E-62020040194514.146.3북동4.7경남 양산시 동면
96A-0010-0083E-62020040209.670.03.5경남 양산시 동면
97A-0010-0083E-6202004021008.668.4북동4.2경남 양산시 동면
98A-0010-0083E-620200402100013.333.31.2경남 양산시 동면
99A-0010-0083E-620200402101514.132.7북동1.8경남 양산시 동면