Dataset statistics
Number of variables | 14 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 11.8 KiB |
Average record size in memory | 121.3 B |
Variable types
DateTime | 1 |
---|---|
Numeric | 5 |
Categorical | 3 |
Boolean | 5 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 한국기상산업기술원 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=12dd5eb0-a066-11ee-b38e-6783ddae4cb6 |
연월일 has constant value "" | Constant |
위도 has constant value "" | Constant |
경도 has constant value "" | Constant |
강수유무 has constant value "" | Constant |
기압QC has constant value "" | Constant |
기온QC has constant value "" | Constant |
습도QC has constant value "" | Constant |
PM2.5QC has constant value "" | Constant |
시 is highly overall correlated with 기압 | High correlation |
기압 is highly overall correlated with 시 | High correlation |
습도 is highly imbalanced (89.8%) | Imbalance |
시 has 11 (11.0%) zeros | Zeros |
분 has 2 (2.0%) zeros | Zeros |
Reproduction
Analysis started | 2024-03-13 11:43:20.966356 |
---|---|
Analysis finished | 2024-03-13 11:43:24.919843 |
Duration | 3.95 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연월일
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Minimum | 2023-12-08 00:00:00 |
---|---|
Maximum | 2023-12-08 00:00:00 |
시
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 9 |
---|---|
Distinct (%) | 9.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.82 |
Minimum | 0 |
---|---|
Maximum | 8 |
Zeros | 11 |
Zeros (%) | 11.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 2 |
median | 4 |
Q3 | 6 |
95-th percentile | 8 |
Maximum | 8 |
Range | 8 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.4959766 |
---|---|
Coefficient of variation (CV) | 0.65339701 |
Kurtosis | -1.1922646 |
Mean | 3.82 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.04615684 |
Sum | 382 |
Variance | 6.229899 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 12 | |
2 | 12 | |
3 | 12 | |
5 | 12 | |
7 | 12 | |
0 | 11 | |
4 | 11 | |
6 | 11 | |
8 | 7 |
Value | Count | Frequency (%) |
0 | 11 | |
1 | 12 | |
2 | 12 | |
3 | 12 | |
4 | 11 | |
5 | 12 | |
6 | 11 | |
7 | 12 | |
8 | 7 |
Value | Count | Frequency (%) |
8 | 7 | |
7 | 12 | |
6 | 11 | |
5 | 12 | |
4 | 11 | |
3 | 12 | |
2 | 12 | |
1 | 12 | |
0 | 11 |
분
Real number (ℝ)
ZEROS
 
Distinct | 60 |
---|---|
Distinct (%) | 60.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 28.16 |
Minimum | 0 |
---|---|
Maximum | 59 |
Zeros | 2 |
Zeros (%) | 2.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 2 |
Q1 | 13.75 |
median | 27.5 |
Q3 | 43 |
95-th percentile | 55 |
Maximum | 59 |
Range | 59 |
Interquartile range (IQR) | 29.25 |
Descriptive statistics
Standard deviation | 17.155098 |
---|---|
Coefficient of variation (CV) | 0.60920091 |
Kurtosis | -1.1695705 |
Mean | 28.16 |
Median Absolute Deviation (MAD) | 14.5 |
Skewness | 0.082753204 |
Sum | 2816 |
Variance | 294.29737 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
55 | 4 | 4.0% |
50 | 4 | 4.0% |
2 | 3 | 3.0% |
7 | 3 | 3.0% |
38 | 3 | 3.0% |
43 | 3 | 3.0% |
27 | 2 | 2.0% |
31 | 2 | 2.0% |
17 | 2 | 2.0% |
22 | 2 | 2.0% |
Other values (50) | 72 |
Value | Count | Frequency (%) |
0 | 2 | |
1 | 1 | 1.0% |
2 | 3 | |
3 | 2 | |
4 | 1 | 1.0% |
5 | 2 | |
6 | 1 | 1.0% |
7 | 3 | |
8 | 2 | |
9 | 1 | 1.0% |
Value | Count | Frequency (%) |
59 | 1 | 1.0% |
58 | 1 | 1.0% |
57 | 1 | 1.0% |
56 | 1 | 1.0% |
55 | 4 | |
54 | 1 | 1.0% |
53 | 1 | 1.0% |
52 | 1 | 1.0% |
51 | 1 | 1.0% |
50 | 4 |
위도
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
37.466038 |
---|
Length
Max length | 9 |
---|---|
Median length | 9 |
Mean length | 9 |
Min length | 9 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 37.466038 |
---|---|
2nd row | 37.466038 |
3rd row | 37.466038 |
4th row | 37.466038 |
5th row | 37.466038 |
Common Values
Value | Count | Frequency (%) |
37.466038 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
37.466038 | 100 |
경도
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
127.119338 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 127.119338 |
---|---|
2nd row | 127.119338 |
3rd row | 127.119338 |
4th row | 127.119338 |
5th row | 127.119338 |
Common Values
Value | Count | Frequency (%) |
127.119338 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
127.119338 | 100 |
기압
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 16 |
---|---|
Distinct (%) | 16.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1012.968 |
Minimum | 1012.1 |
---|---|
Maximum | 1013.7 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1012.1 |
---|---|
5-th percentile | 1012.2 |
Q1 | 1012.4 |
median | 1013 |
Q3 | 1013.5 |
95-th percentile | 1013.7 |
Maximum | 1013.7 |
Range | 1.6 |
Interquartile range (IQR) | 1.1 |
Descriptive statistics
Standard deviation | 0.53783704 |
---|---|
Coefficient of variation (CV) | 0.00053095166 |
Kurtosis | -1.5587623 |
Mean | 1012.968 |
Median Absolute Deviation (MAD) | 0.5 |
Skewness | -0.049522857 |
Sum | 101296.8 |
Variance | 0.28926869 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1013.6 | 13 | |
1013.0 | 13 | |
1013.5 | 13 | |
1012.4 | 12 | |
1012.2 | 9 | |
1012.5 | 8 | |
1013.7 | 8 | |
1012.3 | 5 | 5.0% |
1013.4 | 4 | 4.0% |
1013.1 | 4 | 4.0% |
Other values (6) | 11 |
Value | Count | Frequency (%) |
1012.1 | 1 | 1.0% |
1012.2 | 9 | |
1012.3 | 5 | 5.0% |
1012.4 | 12 | |
1012.5 | 8 | |
1012.6 | 4 | 4.0% |
1012.7 | 1 | 1.0% |
1012.8 | 2 | 2.0% |
1012.9 | 2 | 2.0% |
1013.0 | 13 |
Value | Count | Frequency (%) |
1013.7 | 8 | |
1013.6 | 13 | |
1013.5 | 13 | |
1013.4 | 4 | 4.0% |
1013.2 | 1 | 1.0% |
1013.1 | 4 | 4.0% |
1013.0 | 13 | |
1012.9 | 2 | 2.0% |
1012.8 | 2 | 2.0% |
1012.7 | 1 | 1.0% |
기온
Real number (ℝ)
Distinct | 32 |
---|---|
Distinct (%) | 32.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7.536 |
Minimum | 5.8 |
---|---|
Maximum | 8.9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 5.8 |
---|---|
5-th percentile | 6.195 |
Q1 | 7.075 |
median | 7.6 |
Q3 | 8.2 |
95-th percentile | 8.7 |
Maximum | 8.9 |
Range | 3.1 |
Interquartile range (IQR) | 1.125 |
Descriptive statistics
Standard deviation | 0.8133383 |
---|---|
Coefficient of variation (CV) | 0.10792706 |
Kurtosis | -0.81955729 |
Mean | 7.536 |
Median Absolute Deviation (MAD) | 0.6 |
Skewness | -0.30558363 |
Sum | 753.6 |
Variance | 0.66151919 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
7.3 | 9 | 9.0% |
7.7 | 7 | 7.0% |
7.8 | 6 | 6.0% |
8.5 | 5 | 5.0% |
8.4 | 5 | 5.0% |
6.5 | 5 | 5.0% |
8.7 | 4 | 4.0% |
7.9 | 4 | 4.0% |
7.6 | 4 | 4.0% |
7.5 | 4 | 4.0% |
Other values (22) | 47 |
Value | Count | Frequency (%) |
5.8 | 1 | 1.0% |
5.9 | 2 | 2.0% |
6.0 | 1 | 1.0% |
6.1 | 1 | 1.0% |
6.2 | 3 | |
6.3 | 2 | 2.0% |
6.4 | 3 | |
6.5 | 5 | |
6.6 | 2 | 2.0% |
6.7 | 1 | 1.0% |
Value | Count | Frequency (%) |
8.9 | 1 | 1.0% |
8.8 | 3 | |
8.7 | 4 | |
8.6 | 3 | |
8.5 | 5 | |
8.4 | 5 | |
8.3 | 3 | |
8.2 | 3 | |
8.1 | 2 | 2.0% |
8.0 | 3 |
습도
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
100.0 | |
---|---|
99.9 | 1 |
99.1 | 1 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 4.98 |
Min length | 4 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | 100.0 |
---|---|
2nd row | 100.0 |
3rd row | 100.0 |
4th row | 100.0 |
5th row | 100.0 |
Common Values
Value | Count | Frequency (%) |
100.0 | 98 | |
99.9 | 1 | 1.0% |
99.1 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
100.0 | 98 | |
99.9 | 1 | 1.0% |
99.1 | 1 | 1.0% |
강수유무
Boolean
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 232.0 B |
False |
---|
Value | Count | Frequency (%) |
False | 100 |
PM2.5
Real number (ℝ)
Distinct | 48 |
---|---|
Distinct (%) | 48.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7.06971 |
Minimum | 4.804 |
---|---|
Maximum | 10.365 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 4.804 |
---|---|
5-th percentile | 5.13185 |
Q1 | 6.19425 |
median | 7.128 |
Q3 | 7.792 |
95-th percentile | 8.871 |
Maximum | 10.365 |
Range | 5.561 |
Interquartile range (IQR) | 1.59775 |
Descriptive statistics
Standard deviation | 1.1747754 |
---|---|
Coefficient of variation (CV) | 0.16617023 |
Kurtosis | -0.31369696 |
Mean | 7.06971 |
Median Absolute Deviation (MAD) | 0.664 |
Skewness | -0.0034780692 |
Sum | 706.971 |
Variance | 1.3800972 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
7.626 | 7 | 7.0% |
6.713 | 6 | 6.0% |
7.294 | 4 | 4.0% |
5.219 | 4 | 4.0% |
7.709 | 4 | 4.0% |
6.132 | 4 | 4.0% |
7.128 | 4 | 4.0% |
7.377 | 3 | 3.0% |
7.792 | 3 | 3.0% |
7.045 | 3 | 3.0% |
Other values (38) | 58 |
Value | Count | Frequency (%) |
4.804 | 1 | 1.0% |
4.887 | 2 | |
5.053 | 2 | |
5.136 | 1 | 1.0% |
5.219 | 4 | |
5.302 | 1 | 1.0% |
5.385 | 3 | |
5.468 | 1 | 1.0% |
5.634 | 1 | 1.0% |
5.717 | 1 | 1.0% |
Value | Count | Frequency (%) |
10.365 | 1 | 1.0% |
9.452 | 1 | 1.0% |
9.203 | 2 | |
8.871 | 3 | |
8.705 | 2 | |
8.539 | 1 | 1.0% |
8.456 | 2 | |
8.373 | 2 | |
8.29 | 1 | 1.0% |
8.207 | 2 |
기압QC
Boolean
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 232.0 B |
True |
---|
Value | Count | Frequency (%) |
True | 100 |
기온QC
Boolean
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 232.0 B |
True |
---|
Value | Count | Frequency (%) |
True | 100 |
습도QC
Boolean
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 232.0 B |
True |
---|
Value | Count | Frequency (%) |
True | 100 |
PM2.5QC
Boolean
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 232.0 B |
True |
---|
Value | Count | Frequency (%) |
True | 100 |
시 | 분 | 기압 | 기온 | 습도 | PM2.5 | |
---|---|---|---|---|---|---|
시 | 1.000 | 0.000 | 0.963 | 0.774 | 0.487 | 0.605 |
분 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 |
기압 | 0.963 | 0.000 | 1.000 | 0.764 | 0.540 | 0.482 |
기온 | 0.774 | 0.000 | 0.764 | 1.000 | 0.000 | 0.524 |
습도 | 0.487 | 0.000 | 0.540 | 0.000 | 1.000 | 0.000 |
PM2.5 | 0.605 | 0.000 | 0.482 | 0.524 | 0.000 | 1.000 |
시 | 분 | 기압 | 기온 | PM2.5 | 습도 | |
---|---|---|---|---|---|---|
시 | 1.000 | -0.084 | -0.902 | -0.076 | 0.224 | 0.236 |
분 | -0.084 | 1.000 | 0.013 | -0.129 | -0.158 | 0.000 |
기압 | -0.902 | 0.013 | 1.000 | 0.030 | -0.219 | 0.273 |
기온 | -0.076 | -0.129 | 0.030 | 1.000 | 0.473 | 0.000 |
PM2.5 | 0.224 | -0.158 | -0.219 | 0.473 | 1.000 | 0.000 |
습도 | 0.236 | 0.000 | 0.273 | 0.000 | 0.000 | 1.000 |
연월일 | 시 | 분 | 위도 | 경도 | 기압 | 기온 | 습도 | 강수유무 | PM2.5 | 기압QC | 기온QC | 습도QC | PM2.5QC | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 2023-12-08 | 0 | 8 | 37.466038 | 127.119338 | 1013.6 | 8.8 | 100.0 | N | 7.626 | Y | Y | Y | Y |
1 | 2023-12-08 | 0 | 13 | 37.466038 | 127.119338 | 1013.6 | 8.7 | 100.0 | N | 6.879 | Y | Y | Y | Y |
2 | 2023-12-08 | 0 | 18 | 37.466038 | 127.119338 | 1013.6 | 8.7 | 100.0 | N | 7.626 | Y | Y | Y | Y |
3 | 2023-12-08 | 0 | 23 | 37.466038 | 127.119338 | 1013.6 | 8.7 | 100.0 | N | 7.128 | Y | Y | Y | Y |
4 | 2023-12-08 | 0 | 28 | 37.466038 | 127.119338 | 1013.5 | 8.6 | 100.0 | N | 7.211 | Y | Y | Y | Y |
5 | 2023-12-08 | 0 | 33 | 37.466038 | 127.119338 | 1013.6 | 8.6 | 100.0 | N | 9.203 | Y | Y | Y | Y |
6 | 2023-12-08 | 0 | 38 | 37.466038 | 127.119338 | 1013.7 | 8.5 | 100.0 | N | 7.377 | Y | Y | Y | Y |
7 | 2023-12-08 | 0 | 43 | 37.466038 | 127.119338 | 1013.7 | 8.4 | 100.0 | N | 7.294 | Y | Y | Y | Y |
8 | 2023-12-08 | 0 | 50 | 37.466038 | 127.119338 | 1013.7 | 8.4 | 100.0 | N | 7.294 | Y | Y | Y | Y |
9 | 2023-12-08 | 0 | 55 | 37.466038 | 127.119338 | 1013.7 | 8.3 | 100.0 | N | 8.373 | Y | Y | Y | Y |
연월일 | 시 | 분 | 위도 | 경도 | 기압 | 기온 | 습도 | 강수유무 | PM2.5 | 기압QC | 기온QC | 습도QC | PM2.5QC | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
90 | 2023-12-08 | 7 | 50 | 37.466038 | 127.119338 | 1012.4 | 7.8 | 100.0 | N | 7.543 | Y | Y | Y | Y |
91 | 2023-12-08 | 7 | 55 | 37.466038 | 127.119338 | 1012.4 | 7.8 | 100.0 | N | 7.875 | Y | Y | Y | Y |
92 | 2023-12-08 | 8 | 2 | 37.466038 | 127.119338 | 1012.5 | 8.1 | 100.0 | N | 8.705 | Y | Y | Y | Y |
93 | 2023-12-08 | 8 | 7 | 37.466038 | 127.119338 | 1012.5 | 8.1 | 100.0 | N | 9.203 | Y | Y | Y | Y |
94 | 2023-12-08 | 8 | 12 | 37.466038 | 127.119338 | 1012.5 | 8.2 | 100.0 | N | 8.456 | Y | Y | Y | Y |
95 | 2023-12-08 | 8 | 17 | 37.466038 | 127.119338 | 1012.5 | 8.4 | 100.0 | N | 7.294 | Y | Y | Y | Y |
96 | 2023-12-08 | 8 | 22 | 37.466038 | 127.119338 | 1012.5 | 8.5 | 100.0 | N | 7.958 | Y | Y | Y | Y |
97 | 2023-12-08 | 8 | 27 | 37.466038 | 127.119338 | 1012.5 | 8.8 | 99.9 | N | 7.898 | Y | Y | Y | Y |
98 | 2023-12-08 | 8 | 32 | 37.466038 | 127.119338 | 1012.6 | 8.9 | 99.1 | N | 7.003 | Y | Y | Y | Y |
99 | 2023-12-08 | 0 | 3 | 37.466038 | 127.119338 | 1013.6 | 8.8 | 100.0 | N | 7.128 | Y | Y | Y | Y |