Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 10000 |
Missing cells | 487 |
Missing cells (%) | 0.8% |
Duplicate rows | 32 |
Duplicate rows (%) | 0.3% |
Total size in memory | 585.9 KiB |
Average record size in memory | 60.0 B |
Variable types
DateTime | 1 |
---|---|
Numeric | 4 |
Categorical | 1 |
Dataset
Description | 2022년 논 가뭄에 대한 가뭄 정보와 변경된 저수율을 시군별로 나타내는 것 작성 기준일에 따른 가뭄 정보(N-현재)(1-1개월)(2-2개월)(3-3개월) |
---|---|
URL | https://www.data.go.kr/data/15117185/fileData.do |
Dataset has 32 (0.3%) duplicate rows | Duplicates |
저수율 is highly overall correlated with 평년 and 1 other fields | High correlation |
평년 is highly overall correlated with 저수율 | High correlation |
평년대비 is highly overall correlated with 저수율 and 1 other fields | High correlation |
가뭄단계 is highly overall correlated with 평년대비 | High correlation |
가뭄단계 is highly imbalanced (85.3%) | Imbalance |
표준코드 has 487 (4.9%) missing values | Missing |
저수율 has 710 (7.1%) zeros | Zeros |
평년 has 710 (7.1%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 10:19:09.821970 |
---|---|
Analysis finished | 2023-12-12 10:19:12.693153 |
Duration | 2.87 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
기준일자
Date
Distinct | 365 |
---|---|
Distinct (%) | 3.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2022-01-01 00:00:00 |
---|---|
Maximum | 2022-12-31 00:00:00 |
표준코드
Real number (ℝ)
MISSING
 
Distinct | 159 |
---|---|
Distinct (%) | 1.7% |
Missing | 487 |
Missing (%) | 4.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 44510.158 |
Minimum | 26710 |
---|---|
Maximum | 50130 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 26710 |
---|---|
5-th percentile | 41170 |
Q1 | 42170 |
median | 44825 |
Q3 | 47170 |
95-th percentile | 48840 |
Maximum | 50130 |
Range | 23420 |
Interquartile range (IQR) | 5000 |
Descriptive statistics
Standard deviation | 3780.6259 |
---|---|
Coefficient of variation (CV) | 0.084938496 |
Kurtosis | 7.0310518 |
Mean | 44510.158 |
Median Absolute Deviation (MAD) | 2425 |
Skewness | -2.0598151 |
Sum | 4.2342513 × 108 |
Variance | 14293132 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
42170 | 79 | 0.8% |
42820 | 75 | 0.8% |
46890 | 75 | 0.8% |
46830 | 73 | 0.7% |
44250 | 73 | 0.7% |
50110 | 72 | 0.7% |
48860 | 72 | 0.7% |
44825 | 71 | 0.7% |
44760 | 71 | 0.7% |
42800 | 70 | 0.7% |
Other values (149) | 8782 | |
(Missing) | 487 | 4.9% |
Value | Count | Frequency (%) |
26710 | 57 | |
27710 | 59 | |
28710 | 56 | |
28720 | 54 | |
31710 | 53 | |
41110 | 49 | |
41130 | 65 | |
41150 | 60 | |
41170 | 54 | |
41190 | 58 |
Value | Count | Frequency (%) |
50130 | 68 | |
50110 | 72 | |
48890 | 46 | |
48880 | 63 | |
48870 | 54 | |
48860 | 72 | |
48850 | 63 | |
48840 | 62 | |
48820 | 58 | |
48740 | 49 |
저수율
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 80 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 71.5093 |
Minimum | 0 |
---|---|
Maximum | 110 |
Zeros | 710 |
Zeros (%) | 7.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 60 |
median | 79 |
Q3 | 90 |
95-th percentile | 99 |
Maximum | 110 |
Range | 110 |
Interquartile range (IQR) | 30 |
Descriptive statistics
Standard deviation | 25.61492 |
---|---|
Coefficient of variation (CV) | 0.35820404 |
Kurtosis | 1.7193784 |
Mean | 71.5093 |
Median Absolute Deviation (MAD) | 14 |
Skewness | -1.4385966 |
Sum | 715093 |
Variance | 656.12413 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 710 | 7.1% |
100 | 343 | 3.4% |
85 | 277 | 2.8% |
82 | 271 | 2.7% |
94 | 262 | 2.6% |
93 | 261 | 2.6% |
95 | 260 | 2.6% |
88 | 254 | 2.5% |
96 | 247 | 2.5% |
87 | 245 | 2.5% |
Other values (70) | 6870 |
Value | Count | Frequency (%) |
0 | 710 | |
17 | 1 | < 0.1% |
28 | 1 | < 0.1% |
29 | 1 | < 0.1% |
30 | 7 | 0.1% |
31 | 6 | 0.1% |
32 | 6 | 0.1% |
33 | 9 | 0.1% |
34 | 13 | 0.1% |
35 | 14 | 0.1% |
Value | Count | Frequency (%) |
110 | 2 | < 0.1% |
109 | 13 | 0.1% |
108 | 6 | 0.1% |
107 | 4 | < 0.1% |
106 | 4 | < 0.1% |
100 | 343 | |
99 | 178 | |
98 | 201 | |
97 | 198 | |
96 | 247 |
평년
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 65 |
---|---|
Distinct (%) | 0.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 68.9144 |
Minimum | 0 |
---|---|
Maximum | 100 |
Zeros | 710 |
Zeros (%) | 7.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 64 |
median | 74 |
Q3 | 81 |
95-th percentile | 91 |
Maximum | 100 |
Range | 100 |
Interquartile range (IQR) | 17 |
Descriptive statistics
Standard deviation | 21.764279 |
---|---|
Coefficient of variation (CV) | 0.31581613 |
Kurtosis | 4.3712949 |
Mean | 68.9144 |
Median Absolute Deviation (MAD) | 8 |
Skewness | -2.0939301 |
Sum | 689144 |
Variance | 473.68384 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 710 | 7.1% |
75 | 378 | 3.8% |
77 | 372 | 3.7% |
80 | 350 | 3.5% |
71 | 341 | 3.4% |
78 | 332 | 3.3% |
79 | 309 | 3.1% |
83 | 307 | 3.1% |
73 | 304 | 3.0% |
74 | 294 | 2.9% |
Other values (55) | 6303 |
Value | Count | Frequency (%) |
0 | 710 | |
31 | 1 | < 0.1% |
37 | 3 | < 0.1% |
38 | 10 | 0.1% |
39 | 12 | 0.1% |
40 | 8 | 0.1% |
42 | 2 | < 0.1% |
43 | 2 | < 0.1% |
44 | 3 | < 0.1% |
45 | 9 | 0.1% |
Value | Count | Frequency (%) |
100 | 71 | |
99 | 6 | 0.1% |
98 | 15 | 0.1% |
97 | 35 | |
96 | 63 | |
95 | 42 | |
94 | 86 | |
93 | 68 | |
92 | 76 | |
91 | 80 |
평년대비
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 146 |
---|---|
Distinct (%) | 1.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 104.1749 |
Minimum | 34 |
---|---|
Maximum | 283 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 34 |
---|---|
5-th percentile | 71 |
Q1 | 95 |
median | 104 |
Q3 | 116 |
95-th percentile | 134 |
Maximum | 283 |
Range | 249 |
Interquartile range (IQR) | 21 |
Descriptive statistics
Standard deviation | 19.404205 |
---|---|
Coefficient of variation (CV) | 0.18626564 |
Kurtosis | 2.0803914 |
Mean | 104.1749 |
Median Absolute Deviation (MAD) | 11 |
Skewness | 0.23289693 |
Sum | 1041749 |
Variance | 376.52316 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
100 | 1049 | 10.5% |
105 | 265 | 2.6% |
107 | 253 | 2.5% |
106 | 251 | 2.5% |
104 | 233 | 2.3% |
109 | 227 | 2.3% |
103 | 217 | 2.2% |
102 | 206 | 2.1% |
101 | 202 | 2.0% |
110 | 201 | 2.0% |
Other values (136) | 6896 |
Value | Count | Frequency (%) |
34 | 1 | < 0.1% |
39 | 2 | < 0.1% |
42 | 1 | < 0.1% |
43 | 2 | < 0.1% |
44 | 6 | |
45 | 2 | < 0.1% |
46 | 6 | |
47 | 4 | |
48 | 6 | |
49 | 7 |
Value | Count | Frequency (%) |
283 | 1 | < 0.1% |
211 | 1 | < 0.1% |
198 | 1 | < 0.1% |
195 | 2 | < 0.1% |
190 | 1 | < 0.1% |
183 | 1 | < 0.1% |
182 | 5 | |
181 | 1 | < 0.1% |
180 | 5 | |
179 | 7 |
가뭄단계
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
정상 | |
---|---|
관심 | 326 |
주의 | 115 |
경계 | 43 |
심각 | 3 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 정상 |
---|---|
2nd row | 정상 |
3rd row | 정상 |
4th row | 정상 |
5th row | 정상 |
Common Values
Value | Count | Frequency (%) |
정상 | 9513 | |
관심 | 326 | 3.3% |
주의 | 115 | 1.1% |
경계 | 43 | 0.4% |
심각 | 3 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
정상 | 9513 | |
관심 | 326 | 3.3% |
주의 | 115 | 1.1% |
경계 | 43 | 0.4% |
심각 | 3 | < 0.1% |
표준코드 | 저수율 | 평년 | 평년대비 | 가뭄단계 | |
---|---|---|---|---|---|
표준코드 | 1.000 | 0.466 | 0.413 | 0.290 | 0.153 |
저수율 | 0.466 | 1.000 | 0.726 | 0.637 | 0.838 |
평년 | 0.413 | 0.726 | 1.000 | 0.441 | 0.128 |
평년대비 | 0.290 | 0.637 | 0.441 | 1.000 | 0.718 |
가뭄단계 | 0.153 | 0.838 | 0.128 | 0.718 | 1.000 |
표준코드 | 저수율 | 평년 | 평년대비 | 가뭄단계 | |
---|---|---|---|---|---|
표준코드 | 1.000 | -0.257 | -0.207 | -0.154 | 0.103 |
저수율 | -0.257 | 1.000 | 0.691 | 0.694 | 0.500 |
평년 | -0.207 | 0.691 | 1.000 | 0.040 | 0.078 |
평년대비 | -0.154 | 0.694 | 0.040 | 1.000 | 0.521 |
가뭄단계 | 0.103 | 0.500 | 0.078 | 0.521 | 1.000 |
기준일자 | 표준코드 | 저수율 | 평년 | 평년대비 | 가뭄단계 | |
---|---|---|---|---|---|---|
32202 | 2022-03-24 | 47850 | 93 | 84 | 111 | 정상 |
39674 | 2022-09-12 | 46170 | 43 | 56 | 76 | 정상 |
39437 | 2022-01-18 | 46170 | 68 | 58 | 118 | 정상 |
273 | 2022-10-01 | 42150 | 88 | 76 | 116 | 정상 |
35785 | 2022-01-16 | <NA> | 76 | 77 | 99 | 정상 |
42690 | 2022-12-17 | 46870 | 46 | 64 | 73 | 정상 |
6233 | 2022-01-29 | 42730 | 87 | 88 | 100 | 정상 |
24098 | 2022-01-09 | 48890 | 81 | 74 | 111 | 정상 |
40010 | 2022-08-14 | 46710 | 40 | 60 | 67 | 관심 |
43865 | 2022-03-07 | 46800 | 71 | 72 | 98 | 정상 |
기준일자 | 표준코드 | 저수율 | 평년 | 평년대비 | 가뭄단계 | |
---|---|---|---|---|---|---|
19548 | 2022-07-23 | 48840 | 47 | 83 | 57 | 주의 |
21023 | 2022-08-07 | 48330 | 81 | 85 | 96 | 정상 |
1186 | 2022-04-02 | 42230 | 100 | 99 | 102 | 정상 |
12557 | 2022-05-28 | 41170 | 0 | 0 | 100 | 정상 |
2209 | 2022-01-20 | 42830 | 97 | 90 | 108 | 정상 |
52364 | 2022-06-19 | 44710 | 67 | 60 | 113 | 정상 |
33539 | 2022-11-21 | 27710 | 82 | 71 | 117 | 정상 |
52738 | 2022-06-28 | 44230 | 44 | 50 | 87 | 정상 |
27286 | 2022-10-04 | 47920 | 78 | 49 | 162 | 정상 |
27983 | 2022-09-01 | 47840 | 42 | 68 | 61 | 관심 |
Most frequently occurring
기준일자 | 표준코드 | 저수율 | 평년 | 평년대비 | 가뭄단계 | # duplicates | |
---|---|---|---|---|---|---|---|
16 | 2022-06-11 | <NA> | 0 | 0 | 100 | 정상 | 3 |
17 | 2022-06-13 | <NA> | 0 | 0 | 100 | 정상 | 3 |
22 | 2022-08-24 | <NA> | 0 | 0 | 100 | 정상 | 3 |
0 | 2022-01-11 | <NA> | 0 | 0 | 100 | 정상 | 2 |
1 | 2022-01-12 | <NA> | 0 | 0 | 100 | 정상 | 2 |
2 | 2022-01-23 | <NA> | 0 | 0 | 100 | 정상 | 2 |
3 | 2022-02-05 | <NA> | 0 | 0 | 100 | 정상 | 2 |
4 | 2022-02-19 | <NA> | 0 | 0 | 100 | 정상 | 2 |
5 | 2022-02-28 | <NA> | 0 | 0 | 100 | 정상 | 2 |
6 | 2022-03-03 | <NA> | 0 | 0 | 100 | 정상 | 2 |