Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.9 KiB
Average record size in memory70.3 B

Variable types

Categorical4
Numeric4

Alerts

지점 has constant value ""Constant
주소 has constant value ""Constant
측정시간 is highly overall correlated with 온도(℃) and 1 other fieldsHigh correlation
온도(℃) is highly overall correlated with 측정시간 and 2 other fieldsHigh correlation
습도(%) is highly overall correlated with 측정시간 and 2 other fieldsHigh correlation
풍속(m/s) is highly overall correlated with 온도(℃) and 1 other fieldsHigh correlation
측정일 is highly imbalanced (75.8%)Imbalance
측정시간 has 2 (2.0%) zerosZeros
온도(℃) has 4 (4.0%) zerosZeros
풍속(m/s) has 2 (2.0%) zerosZeros

Reproduction

Analysis started2023-12-10 12:41:57.387463
Analysis finished2023-12-10 12:41:59.267132
Duration1.88 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지점
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
A-0010-0083E-6
100 

Length

Max length14
Median length14
Mean length14
Min length14

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowA-0010-0083E-6
2nd rowA-0010-0083E-6
3rd rowA-0010-0083E-6
4th rowA-0010-0083E-6
5th rowA-0010-0083E-6

Common Values

ValueCountFrequency (%)
A-0010-0083E-6 100
100.0%

Length

2023-12-10T21:41:59.323937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:41:59.419909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
a-0010-0083e-6 100
100.0%

측정일
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
20200201
96 
20200202
 
4

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20200201
2nd row20200201
3rd row20200201
4th row20200201
5th row20200201

Common Values

ValueCountFrequency (%)
20200201 96
96.0%
20200202 4
 
4.0%

Length

2023-12-10T21:41:59.513412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:41:59.603787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20200201 96
96.0%
20200202 4
 
4.0%

측정시간
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct96
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1146.75
Minimum0
Maximum2345
Zeros2
Zeros (%)2.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:41:59.706037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile97.25
Q1541.25
median1122.5
Q31733.75
95-th percentile2230.75
Maximum2345
Range2345
Interquartile range (IQR)1192.5

Descriptive statistics

Standard deviation700.21836
Coefficient of variation (CV)0.61061117
Kurtosis-1.1835169
Mean1146.75
Median Absolute Deviation (MAD)600
Skewness0.026925197
Sum114675
Variance490305.74
MonotonicityNot monotonic
2023-12-10T21:41:59.863388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2
 
2.0%
1000 2
 
2.0%
1015 2
 
2.0%
100 2
 
2.0%
2315 1
 
1.0%
400 1
 
1.0%
345 1
 
1.0%
330 1
 
1.0%
315 1
 
1.0%
300 1
 
1.0%
Other values (86) 86
86.0%
ValueCountFrequency (%)
0 2
2.0%
15 1
1.0%
30 1
1.0%
45 1
1.0%
100 2
2.0%
115 1
1.0%
130 1
1.0%
145 1
1.0%
200 1
1.0%
215 1
1.0%
ValueCountFrequency (%)
2345 1
1.0%
2330 1
1.0%
2315 1
1.0%
2300 1
1.0%
2245 1
1.0%
2230 1
1.0%
2215 1
1.0%
2200 1
1.0%
2145 1
1.0%
2130 1
1.0%

온도(℃)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct61
Distinct (%)61.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.052
Minimum-0.7
Maximum10.4
Zeros4
Zeros (%)4.0%
Negative12
Negative (%)12.0%
Memory size1.0 KiB
2023-12-10T21:42:00.097885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-0.7
5-th percentile-0.3
Q10.3
median3.3
Q38.05
95-th percentile9.9
Maximum10.4
Range11.1
Interquartile range (IQR)7.75

Descriptive statistics

Standard deviation3.7583975
Coefficient of variation (CV)0.92754133
Kurtosis-1.3919119
Mean4.052
Median Absolute Deviation (MAD)3.2
Skewness0.35991487
Sum405.2
Variance14.125552
MonotonicityNot monotonic
2023-12-10T21:42:00.236806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.1 6
 
6.0%
0.3 6
 
6.0%
-0.2 4
 
4.0%
0.0 4
 
4.0%
2.6 3
 
3.0%
9.5 3
 
3.0%
9.7 3
 
3.0%
9.9 3
 
3.0%
-0.3 3
 
3.0%
10.1 2
 
2.0%
Other values (51) 63
63.0%
ValueCountFrequency (%)
-0.7 1
 
1.0%
-0.5 2
 
2.0%
-0.4 1
 
1.0%
-0.3 3
3.0%
-0.2 4
4.0%
-0.1 1
 
1.0%
0.0 4
4.0%
0.1 6
6.0%
0.3 6
6.0%
0.5 1
 
1.0%
ValueCountFrequency (%)
10.4 1
 
1.0%
10.2 1
 
1.0%
10.1 2
2.0%
9.9 3
3.0%
9.7 3
3.0%
9.6 1
 
1.0%
9.5 3
3.0%
9.4 1
 
1.0%
9.3 1
 
1.0%
9.1 2
2.0%

습도(%)
Real number (ℝ)

HIGH CORRELATION 

Distinct92
Distinct (%)92.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean61.901
Minimum34.3
Maximum85.1
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:42:00.375558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum34.3
5-th percentile35.68
Q142.675
median65.4
Q378.85
95-th percentile82.925
Maximum85.1
Range50.8
Interquartile range (IQR)36.175

Descriptive statistics

Standard deviation17.569932
Coefficient of variation (CV)0.28383923
Kurtosis-1.4540403
Mean61.901
Median Absolute Deviation (MAD)14.95
Skewness-0.28487553
Sum6190.1
Variance308.70252
MonotonicityNot monotonic
2023-12-10T21:42:00.567122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
78.8 3
 
3.0%
82.3 3
 
3.0%
38.2 2
 
2.0%
37.7 2
 
2.0%
75.5 2
 
2.0%
82.2 2
 
2.0%
63.7 1
 
1.0%
67.3 1
 
1.0%
79.3 1
 
1.0%
79.6 1
 
1.0%
Other values (82) 82
82.0%
ValueCountFrequency (%)
34.3 1
1.0%
34.8 1
1.0%
35.0 1
1.0%
35.2 1
1.0%
35.3 1
1.0%
35.7 1
1.0%
35.9 1
1.0%
36.0 1
1.0%
36.6 1
1.0%
36.8 1
1.0%
ValueCountFrequency (%)
85.1 1
 
1.0%
84.9 1
 
1.0%
84.8 1
 
1.0%
83.7 1
 
1.0%
83.4 1
 
1.0%
82.9 1
 
1.0%
82.6 1
 
1.0%
82.4 1
 
1.0%
82.3 3
3.0%
82.2 2
2.0%

풍향
Categorical

Distinct8
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
남서
26 
북서
17 
13 
13 
10 
Other values (3)
21 

Length

Max length2
Median length2
Mean length1.54
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row남서
4th row
5th row남서

Common Values

ValueCountFrequency (%)
남서 26
26.0%
북서 17
17.0%
13
13.0%
13
13.0%
10
 
10.0%
10
 
10.0%
북동 7
 
7.0%
남동 4
 
4.0%

Length

2023-12-10T21:42:00.725380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:42:00.842100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남서 26
26.0%
북서 17
17.0%
13
13.0%
13
13.0%
10
 
10.0%
10
 
10.0%
북동 7
 
7.0%
남동 4
 
4.0%

풍속(m/s)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct39
Distinct (%)39.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.799
Minimum0
Maximum5.1
Zeros2
Zeros (%)2.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:42:01.218799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.195
Q10.8
median1.5
Q32.5
95-th percentile3.91
Maximum5.1
Range5.1
Interquartile range (IQR)1.7

Descriptive statistics

Standard deviation1.2366863
Coefficient of variation (CV)0.68742983
Kurtosis-0.1136332
Mean1.799
Median Absolute Deviation (MAD)0.8
Skewness0.73841983
Sum179.9
Variance1.5293929
MonotonicityNot monotonic
2023-12-10T21:42:01.390426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
1.1 7
 
7.0%
1.7 6
 
6.0%
2.0 6
 
6.0%
0.6 5
 
5.0%
2.2 5
 
5.0%
0.7 5
 
5.0%
1.3 4
 
4.0%
1.5 4
 
4.0%
1.4 4
 
4.0%
0.2 3
 
3.0%
Other values (29) 51
51.0%
ValueCountFrequency (%)
0.0 2
 
2.0%
0.1 3
3.0%
0.2 3
3.0%
0.3 2
 
2.0%
0.4 1
 
1.0%
0.5 2
 
2.0%
0.6 5
5.0%
0.7 5
5.0%
0.8 3
3.0%
0.9 2
 
2.0%
ValueCountFrequency (%)
5.1 1
1.0%
4.9 2
2.0%
4.6 1
1.0%
4.1 1
1.0%
3.9 2
2.0%
3.8 2
2.0%
3.7 2
2.0%
3.6 1
1.0%
3.5 1
1.0%
3.3 1
1.0%

주소
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
경남 양산시 동면
100 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경남 양산시 동면
2nd row경남 양산시 동면
3rd row경남 양산시 동면
4th row경남 양산시 동면
5th row경남 양산시 동면

Common Values

ValueCountFrequency (%)
경남 양산시 동면 100
100.0%

Length

2023-12-10T21:42:01.538104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:42:01.650217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경남 100
33.3%
양산시 100
33.3%
동면 100
33.3%

Interactions

2023-12-10T21:41:58.761622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:57.631189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:58.007848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:58.378723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:58.834095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:57.714605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:58.101818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:58.474466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:58.911333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:57.803129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:58.196601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:58.588452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:58.990788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:57.898020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:58.282985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:41:58.672593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T21:42:01.715601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정일측정시간온도(℃)습도(%)풍향풍속(m/s)
측정일1.0000.2840.3400.5670.1280.000
측정시간0.2841.0000.9180.9350.7440.746
온도(℃)0.3400.9181.0000.9590.6690.733
습도(%)0.5670.9350.9591.0000.6500.686
풍향0.1280.7440.6690.6501.0000.622
풍속(m/s)0.0000.7460.7330.6860.6221.000
2023-12-10T21:42:01.815280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
풍향측정일
풍향1.0000.089
측정일0.0891.000
2023-12-10T21:42:01.898726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정시간온도(℃)습도(%)풍속(m/s)측정일풍향
측정시간1.0000.585-0.5890.3330.2070.472
온도(℃)0.5851.000-0.9840.6690.2480.394
습도(%)-0.589-0.9841.000-0.6520.4190.377
풍속(m/s)0.3330.669-0.6521.0000.0000.353
측정일0.2070.2480.4190.0001.0000.089
풍향0.4720.3940.3770.3530.0891.000

Missing values

2023-12-10T21:41:59.101573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T21:41:59.217076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지점측정일측정시간온도(℃)습도(%)풍향풍속(m/s)주소
0A-0010-0083E-62020020101.374.00.9경남 양산시 동면
1A-0010-0083E-6202002011000.377.90.1경남 양산시 동면
2A-0010-0083E-62020020110004.167.1남서2.3경남 양산시 동면
3A-0010-0083E-62020020110154.958.03.0경남 양산시 동면
4A-0010-0083E-62020020110305.551.4남서3.8경남 양산시 동면
5A-0010-0083E-62020020110456.449.8남서3.7경남 양산시 동면
6A-0010-0083E-62020020111006.846.93.2경남 양산시 동면
7A-0010-0083E-62020020111157.442.7북서2.8경남 양산시 동면
8A-0010-0083E-62020020111308.238.2북서2.2경남 양산시 동면
9A-0010-0083E-62020020111458.337.7북서3.2경남 양산시 동면
지점측정일측정시간온도(℃)습도(%)풍향풍속(m/s)주소
90A-0010-0083E-6202002018300.382.2남서2.0경남 양산시 동면
91A-0010-0083E-6202002018450.877.6남서2.2경남 양산시 동면
92A-0010-0083E-6202002019001.575.51.7경남 양산시 동면
93A-0010-0083E-6202002019152.075.5남서2.0경남 양산시 동면
94A-0010-0083E-6202002019302.872.9남서2.0경남 양산시 동면
95A-0010-0083E-6202002019453.569.5남서1.9경남 양산시 동면
96A-0010-0083E-62020020201.970.5남서0.2경남 양산시 동면
97A-0010-0083E-6202002021001.972.3남서1.4경남 양산시 동면
98A-0010-0083E-62020020210005.357.81.4경남 양산시 동면
99A-0010-0083E-62020020210156.153.91.3경남 양산시 동면