Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 7.9 KiB |
Average record size in memory | 81.3 B |
Variable types
Categorical | 7 |
---|---|
Numeric | 2 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 한국기상산업기술원 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=dd2e6e20-a066-11ee-a443-a7e161ec5b2c |
예보발표일자 has constant value "" | Constant |
경도 격자점 has constant value "" | Constant |
위도 격자점 has constant value "" | Constant |
초미세먼지등급 has constant value "" | Constant |
예보시각 is highly overall correlated with 기온 | High correlation |
기온 is highly overall correlated with 예보시각 and 1 other fields | High correlation |
예보일 is highly overall correlated with 기온 | High correlation |
예보시각 has 4 (4.0%) zeros | Zeros |
Reproduction
Analysis started | 2024-03-13 11:43:14.747151 |
---|---|
Analysis finished | 2024-03-13 11:43:17.238582 |
Duration | 2.49 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
예보발표일자
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
20231208 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20231208 |
---|---|
2nd row | 20231208 |
3rd row | 20231208 |
4th row | 20231208 |
5th row | 20231208 |
Common Values
Value | Count | Frequency (%) |
20231208 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20231208 | 100 |
예보발표시각
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 | |
---|---|
100 |
Length
Max length | 3 |
---|---|
Median length | 1 |
Mean length | 1.72 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 64 | |
100 | 36 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 64 | |
100 | 36 |
경도 격자점
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
60 |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 60 |
---|---|
2nd row | 60 |
3rd row | 60 |
4th row | 60 |
5th row | 60 |
Common Values
Value | Count | Frequency (%) |
60 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
60 | 100 |
위도 격자점
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
127 |
---|
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 127 |
---|---|
2nd row | 127 |
3rd row | 127 |
4th row | 127 |
5th row | 127 |
Common Values
Value | Count | Frequency (%) |
127 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
127 | 100 |
예보일
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
20231208 | |
---|---|
20231209 | |
20231210 |
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20231208 |
---|---|
2nd row | 20231208 |
3rd row | 20231208 |
4th row | 20231208 |
5th row | 20231208 |
Common Values
Value | Count | Frequency (%) |
20231208 | 47 | |
20231209 | 37 | |
20231210 | 16 | 16.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20231208 | 47 | |
20231209 | 37 | |
20231210 | 16 | 16.0% |
예보시각
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 24 |
---|---|
Distinct (%) | 24.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1026 |
Minimum | 0 |
---|---|
Maximum | 2300 |
Zeros | 4 |
Zeros (%) | 4.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 100 |
Q1 | 500 |
median | 1000 |
Q3 | 1500 |
95-th percentile | 2200 |
Maximum | 2300 |
Range | 2300 |
Interquartile range (IQR) | 1000 |
Descriptive statistics
Standard deviation | 661.74242 |
---|---|
Coefficient of variation (CV) | 0.64497312 |
Kurtosis | -0.98961273 |
Mean | 1026 |
Median Absolute Deviation (MAD) | 500 |
Skewness | 0.26921315 |
Sum | 102600 |
Variance | 437903.03 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1200 | 5 | 5.0% |
200 | 5 | 5.0% |
300 | 5 | 5.0% |
400 | 5 | 5.0% |
500 | 5 | 5.0% |
600 | 5 | 5.0% |
700 | 5 | 5.0% |
800 | 5 | 5.0% |
900 | 5 | 5.0% |
1000 | 5 | 5.0% |
Other values (14) | 50 |
Value | Count | Frequency (%) |
0 | 4 | |
100 | 5 | |
200 | 5 | |
300 | 5 | |
400 | 5 | |
500 | 5 | |
600 | 5 | |
700 | 5 | |
800 | 5 | |
900 | 5 |
Value | Count | Frequency (%) |
2300 | 3 | |
2200 | 3 | |
2100 | 3 | |
2000 | 3 | |
1900 | 3 | |
1800 | 3 | |
1700 | 3 | |
1600 | 3 | |
1500 | 4 | |
1400 | 4 |
날씨
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
맑음 | |
---|---|
구름조금 | |
흐림 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.6 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 맑음 |
---|---|
2nd row | 맑음 |
3rd row | 맑음 |
4th row | 맑음 |
5th row | 맑음 |
Common Values
Value | Count | Frequency (%) |
맑음 | 58 | |
구름조금 | 30 | |
흐림 | 12 | 12.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
맑음 | 58 | |
구름조금 | 30 | |
흐림 | 12 | 12.0% |
기온
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 66 |
---|---|
Distinct (%) | 66.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10.233576 |
Minimum | 2.7748 |
---|---|
Maximum | 16.0366 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 2.7748 |
---|---|
5-th percentile | 4.27676 |
Q1 | 6.7246 |
median | 11.0454 |
Q3 | 12.60265 |
95-th percentile | 15.959 |
Maximum | 16.0366 |
Range | 13.2618 |
Interquartile range (IQR) | 5.87805 |
Descriptive statistics
Standard deviation | 3.5337463 |
---|---|
Coefficient of variation (CV) | 0.34530904 |
Kurtosis | -0.82280712 |
Mean | 10.233576 |
Median Absolute Deviation (MAD) | 2.1815 |
Skewness | -0.35188018 |
Sum | 1023.3576 |
Variance | 12.487363 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10.6988 | 2 | 2.0% |
13.7244 | 2 | 2.0% |
12.619 | 2 | 2.0% |
12.5532 | 2 | 2.0% |
11.7304 | 2 | 2.0% |
12.554 | 2 | 2.0% |
11.6698 | 2 | 2.0% |
11.5528 | 2 | 2.0% |
11.4208 | 2 | 2.0% |
10.6584 | 2 | 2.0% |
Other values (56) | 80 |
Value | Count | Frequency (%) |
2.7748 | 1 | |
2.797 | 1 | |
3.5038 | 1 | |
3.5362 | 1 | |
3.9758 | 1 | |
4.2926 | 1 | |
4.3168 | 1 | |
4.37 | 1 | |
4.698 | 1 | |
4.956 | 1 |
Value | Count | Frequency (%) |
16.0366 | 2 | |
16.0364 | 2 | |
15.959 | 2 | |
15.139 | 2 | |
15.0168 | 2 | |
14.7776 | 2 | |
13.8874 | 1 | |
13.7244 | 2 | |
13.7104 | 1 | |
13.251 | 2 |
초미세먼지등급
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 2 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
2 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 100 |
예보발표시각 | 예보일 | 예보시각 | 날씨 | 기온 | |
---|---|---|---|---|---|
예보발표시각 | 1.000 | 0.199 | 0.000 | 0.125 | 0.000 |
예보일 | 0.199 | 1.000 | 0.000 | 0.636 | 0.768 |
예보시각 | 0.000 | 0.000 | 1.000 | 0.565 | 0.807 |
날씨 | 0.125 | 0.636 | 0.565 | 1.000 | 0.576 |
기온 | 0.000 | 0.768 | 0.807 | 0.576 | 1.000 |
예보발표시각 | 예보일 | 날씨 | |
---|---|---|---|
예보발표시각 | 1.000 | 0.324 | 0.205 |
예보일 | 0.324 | 1.000 | 0.300 |
날씨 | 0.205 | 0.300 | 1.000 |
예보시각 | 기온 | 예보발표시각 | 예보일 | 날씨 | |
---|---|---|---|---|---|
예보시각 | 1.000 | 0.600 | 0.000 | 0.000 | 0.392 |
기온 | 0.600 | 1.000 | 0.000 | 0.622 | 0.403 |
예보발표시각 | 0.000 | 0.000 | 1.000 | 0.324 | 0.205 |
예보일 | 0.000 | 0.622 | 0.324 | 1.000 | 0.300 |
날씨 | 0.392 | 0.403 | 0.205 | 0.300 | 1.000 |
예보발표일자 | 예보발표시각 | 경도 격자점 | 위도 격자점 | 예보일 | 예보시각 | 날씨 | 기온 | 초미세먼지등급 | |
---|---|---|---|---|---|---|---|---|---|
0 | 20231208 | 0 | 60 | 127 | 20231208 | 0 | 맑음 | 4.956 | 2 |
1 | 20231208 | 0 | 60 | 127 | 20231208 | 100 | 맑음 | 5.7396 | 2 |
2 | 20231208 | 0 | 60 | 127 | 20231208 | 200 | 맑음 | 5.7248 | 2 |
3 | 20231208 | 0 | 60 | 127 | 20231208 | 300 | 맑음 | 6.5446 | 2 |
4 | 20231208 | 0 | 60 | 127 | 20231208 | 400 | 맑음 | 6.556 | 2 |
5 | 20231208 | 0 | 60 | 127 | 20231208 | 500 | 맑음 | 6.611 | 2 |
6 | 20231208 | 0 | 60 | 127 | 20231208 | 600 | 맑음 | 6.6736 | 2 |
7 | 20231208 | 0 | 60 | 127 | 20231208 | 700 | 맑음 | 6.7246 | 2 |
8 | 20231208 | 0 | 60 | 127 | 20231208 | 800 | 구름조금 | 7.6074 | 2 |
9 | 20231208 | 0 | 60 | 127 | 20231208 | 900 | 구름조금 | 8.888 | 2 |
예보발표일자 | 예보발표시각 | 경도 격자점 | 위도 격자점 | 예보일 | 예보시각 | 날씨 | 기온 | 초미세먼지등급 | |
---|---|---|---|---|---|---|---|---|---|
90 | 20231208 | 100 | 60 | 127 | 20231209 | 300 | 흐림 | 11.4208 | 2 |
91 | 20231208 | 100 | 60 | 127 | 20231209 | 400 | 흐림 | 10.6584 | 2 |
92 | 20231208 | 100 | 60 | 127 | 20231209 | 500 | 구름조금 | 10.68 | 2 |
93 | 20231208 | 100 | 60 | 127 | 20231209 | 600 | 구름조금 | 10.6762 | 2 |
94 | 20231208 | 100 | 60 | 127 | 20231209 | 700 | 구름조금 | 10.6742 | 2 |
95 | 20231208 | 100 | 60 | 127 | 20231209 | 800 | 맑음 | 9.8672 | 2 |
96 | 20231208 | 100 | 60 | 127 | 20231209 | 900 | 맑음 | 10.6988 | 2 |
97 | 20231208 | 100 | 60 | 127 | 20231209 | 1000 | 맑음 | 11.5488 | 2 |
98 | 20231208 | 100 | 60 | 127 | 20231209 | 1100 | 맑음 | 11.6424 | 2 |
99 | 20231208 | 100 | 60 | 127 | 20231209 | 1200 | 맑음 | 12.557 | 2 |