Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 403 |
Missing cells | 4 |
Missing cells (%) | 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 32.4 KiB |
Average record size in memory | 82.3 B |
Variable types
DateTime | 2 |
---|---|
Categorical | 5 |
Numeric | 2 |
Text | 1 |
Dataset
Description | 미세먼지 경보 조회 서비스 |
---|---|
Author | 경기도 |
URL | https://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=T0VV0B225CV46E4HSTEE30126364&infSeq=1 |
발령농도(μg/m³) is highly overall correlated with 해제농도(μg/m³) and 2 other fields | High correlation |
해제농도(μg/m³) is highly overall correlated with 발령농도(μg/m³) and 2 other fields | High correlation |
항목명 is highly overall correlated with 발령농도(μg/m³) and 1 other fields | High correlation |
경보단계명 is highly overall correlated with 발령농도(μg/m³) and 1 other fields | High correlation |
경보단계명 is highly imbalanced (54.9%) | Imbalance |
Reproduction
Analysis started | 2024-05-10 21:32:01.110328 |
---|---|
Analysis finished | 2024-05-10 21:32:03.819682 |
Duration | 2.71 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
기준일자
Date
Distinct | 136 |
---|---|
Distinct (%) | 33.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
Minimum | 2018-03-12 00:00:00 |
---|---|
Maximum | 2024-04-18 00:00:00 |
권역명
Categorical
Distinct | 4 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
중부권 | |
---|---|
남부권 | |
북부권 | |
동부권 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 북부권 |
---|---|
2nd row | 북부권 |
3rd row | 중부권 |
4th row | 동부권 |
5th row | 중부권 |
Common Values
Value | Count | Frequency (%) |
중부권 | 112 | |
남부권 | 112 | |
북부권 | 93 | |
동부권 | 86 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
중부권 | 112 | |
남부권 | 112 | |
북부권 | 93 | |
동부권 | 86 |
항목명
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
PM25 | |
---|---|
PM10 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | PM10 |
---|---|
2nd row | PM10 |
3rd row | PM10 |
4th row | PM10 |
5th row | PM10 |
Common Values
Value | Count | Frequency (%) |
PM25 | 209 | |
PM10 | 194 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
pm25 | 209 | |
pm10 | 194 |
경보단계명
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
주의보 | |
---|---|
경보 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 2.9057072 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 주의보 |
---|---|
2nd row | 주의보 |
3rd row | 주의보 |
4th row | 주의보 |
5th row | 주의보 |
Common Values
Value | Count | Frequency (%) |
주의보 | 365 | |
경보 | 38 | 9.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
주의보 | 365 | |
경보 | 38 | 9.4% |
발령일자
Date
Distinct | 135 |
---|---|
Distinct (%) | 33.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
Minimum | 2018-03-12 00:00:00 |
---|---|
Maximum | 2024-04-18 00:00:00 |
발령시간
Categorical
Distinct | 24 |
---|---|
Distinct (%) | 6.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
11:00 | |
---|---|
10:00 | |
12:00 | |
13:00 | |
20:00 | 25 |
Other values (19) |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 12:00 |
---|---|
2nd row | 14:00 |
3rd row | 14:00 |
4th row | 15:00 |
5th row | 15:00 |
Common Values
Value | Count | Frequency (%) |
11:00 | 35 | 8.7% |
10:00 | 33 | 8.2% |
12:00 | 29 | 7.2% |
13:00 | 28 | 6.9% |
20:00 | 25 | 6.2% |
14:00 | 25 | 6.2% |
15:00 | 23 | 5.7% |
22:00 | 21 | 5.2% |
09:00 | 20 | 5.0% |
16:00 | 18 | 4.5% |
Other values (14) | 146 |
Length
Value | Count | Frequency (%) |
11:00 | 35 | 8.7% |
10:00 | 33 | 8.2% |
12:00 | 29 | 7.2% |
13:00 | 28 | 6.9% |
20:00 | 25 | 6.2% |
14:00 | 25 | 6.2% |
15:00 | 23 | 5.7% |
22:00 | 21 | 5.2% |
09:00 | 20 | 5.0% |
16:00 | 18 | 4.5% |
Other values (14) | 146 |
발령농도(μg/m³)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 154 |
---|---|
Distinct (%) | 38.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 150.19107 |
Minimum | 64 |
---|---|
Maximum | 564 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.7 KiB |
Quantile statistics
Minimum | 64 |
---|---|
5-th percentile | 75 |
Q1 | 81 |
median | 123 |
Q3 | 174 |
95-th percentile | 358.9 |
Maximum | 564 |
Range | 500 |
Interquartile range (IQR) | 93 |
Descriptive statistics
Standard deviation | 94.332089 |
---|---|
Coefficient of variation (CV) | 0.62808056 |
Kurtosis | 3.408162 |
Mean | 150.19107 |
Median Absolute Deviation (MAD) | 44 |
Skewness | 1.8002489 |
Sum | 60527 |
Variance | 8898.543 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
81 | 21 | 5.2% |
77 | 20 | 5.0% |
79 | 19 | 4.7% |
75 | 13 | 3.2% |
78 | 12 | 3.0% |
84 | 12 | 3.0% |
76 | 12 | 3.0% |
80 | 11 | 2.7% |
85 | 9 | 2.2% |
83 | 9 | 2.2% |
Other values (144) | 265 |
Value | Count | Frequency (%) |
64 | 2 | 0.5% |
70 | 3 | 0.7% |
71 | 2 | 0.5% |
72 | 1 | 0.2% |
73 | 1 | 0.2% |
74 | 2 | 0.5% |
75 | 13 | |
76 | 12 | |
77 | 20 | |
78 | 12 |
Value | Count | Frequency (%) |
564 | 1 | |
559 | 1 | |
533 | 1 | |
524 | 1 | |
505 | 1 | |
463 | 1 | |
440 | 1 | |
434 | 1 | |
430 | 1 | |
396 | 1 |
해제일자
Text
Distinct | 133 |
---|---|
Distinct (%) | 33.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 9.9205955 |
Min length | 2 |
Characters and Unicode
Total characters | 3998 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 40 ? |
---|---|
Unique (%) | 9.9% |
Sample
1st row | 2024-04-18 |
---|---|
2nd row | 2024-04-18 |
3rd row | 2024-04-18 |
4th row | 2024-04-18 |
5th row | 2024-04-17 |
Value | Count | Frequency (%) |
2024-03-29 | 12 | 3.0% |
2021-05-09 | 12 | 3.0% |
2018-11-28 | 10 | 2.5% |
2022-12-13 | 10 | 2.5% |
2023-04-07 | 9 | 2.2% |
2019-03-07 | 9 | 2.2% |
2023-11-23 | 8 | 2.0% |
2020-02-22 | 8 | 2.0% |
2021-03-29 | 8 | 2.0% |
2019-12-11 | 8 | 2.0% |
Other values (123) | 309 |
Most occurring characters
Value | Count | Frequency (%) |
2 | 992 | |
0 | 916 | |
- | 806 | |
1 | 518 | |
3 | 261 | 6.5% |
4 | 118 | 3.0% |
9 | 115 | 2.9% |
8 | 97 | 2.4% |
5 | 72 | 1.8% |
7 | 71 | 1.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 3192 | |
Dash Punctuation | 806 | 20.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 992 | |
0 | 916 | |
1 | 518 | |
3 | 261 | 8.2% |
4 | 118 | 3.7% |
9 | 115 | 3.6% |
8 | 97 | 3.0% |
5 | 72 | 2.3% |
7 | 71 | 2.2% |
6 | 32 | 1.0% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 806 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 3998 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
2 | 992 | |
0 | 916 | |
- | 806 | |
1 | 518 | |
3 | 261 | 6.5% |
4 | 118 | 3.0% |
9 | 115 | 2.9% |
8 | 97 | 2.4% |
5 | 72 | 1.8% |
7 | 71 | 1.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 3998 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2 | 992 | |
0 | 916 | |
- | 806 | |
1 | 518 | |
3 | 261 | 6.5% |
4 | 118 | 3.0% |
9 | 115 | 2.9% |
8 | 97 | 2.4% |
5 | 72 | 1.8% |
7 | 71 | 1.8% |
해제시간
Categorical
Distinct | 25 |
---|---|
Distinct (%) | 6.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
16:00 | |
---|---|
14:00 | |
17:00 | |
15:00 | |
13:00 | 23 |
Other values (20) |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 4.9801489 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 16:00 |
---|---|
2nd row | 06:00 |
3rd row | 18:00 |
4th row | 19:00 |
5th row | 02:00 |
Common Values
Value | Count | Frequency (%) |
16:00 | 43 | 10.7% |
14:00 | 35 | 8.7% |
17:00 | 29 | 7.2% |
15:00 | 25 | 6.2% |
13:00 | 23 | 5.7% |
12:00 | 23 | 5.7% |
04:00 | 22 | 5.5% |
01:00 | 21 | 5.2% |
19:00 | 18 | 4.5% |
02:00 | 18 | 4.5% |
Other values (15) | 146 |
Length
Value | Count | Frequency (%) |
16:00 | 43 | 10.7% |
14:00 | 35 | 8.7% |
17:00 | 29 | 7.2% |
15:00 | 25 | 6.2% |
13:00 | 23 | 5.7% |
12:00 | 23 | 5.7% |
04:00 | 22 | 5.5% |
01:00 | 21 | 5.2% |
19:00 | 18 | 4.5% |
02:00 | 18 | 4.5% |
Other values (15) | 146 |
해제농도(μg/m³)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 112 |
---|---|
Distinct (%) | 28.1% |
Missing | 4 |
Missing (%) | 1.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 86.511278 |
Minimum | 11 |
---|---|
Maximum | 564 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.7 KiB |
Quantile statistics
Minimum | 11 |
---|---|
5-th percentile | 25 |
Q1 | 32 |
median | 72 |
Q3 | 96 |
95-th percentile | 326.9 |
Maximum | 564 |
Range | 553 |
Interquartile range (IQR) | 64 |
Descriptive statistics
Standard deviation | 91.952163 |
---|---|
Coefficient of variation (CV) | 1.0628922 |
Kurtosis | 9.2663766 |
Mean | 86.511278 |
Median Absolute Deviation (MAD) | 39 |
Skewness | 2.9088235 |
Sum | 34518 |
Variance | 8455.2002 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
33 | 36 | 8.9% |
32 | 27 | 6.7% |
34 | 27 | 6.7% |
31 | 18 | 4.5% |
99 | 15 | 3.7% |
30 | 15 | 3.7% |
29 | 13 | 3.2% |
93 | 11 | 2.7% |
96 | 11 | 2.7% |
98 | 10 | 2.5% |
Other values (102) | 216 |
Value | Count | Frequency (%) |
11 | 1 | 0.2% |
16 | 1 | 0.2% |
20 | 2 | 0.5% |
21 | 1 | 0.2% |
22 | 2 | 0.5% |
23 | 3 | 0.7% |
24 | 6 | |
25 | 6 | |
26 | 5 | |
27 | 8 |
Value | Count | Frequency (%) |
564 | 1 | |
559 | 1 | |
533 | 1 | |
524 | 1 | |
505 | 1 | |
440 | 1 | |
434 | 1 | |
430 | 1 | |
396 | 1 | |
392 | 1 |
권역명 | 항목명 | 경보단계명 | 발령시간 | 발령농도(μg/m³) | 해제시간 | 해제농도(μg/m³) | |
---|---|---|---|---|---|---|---|
권역명 | 1.000 | 0.143 | 0.000 | 0.000 | 0.015 | 0.000 | 0.000 |
항목명 | 0.143 | 1.000 | 0.203 | 0.346 | 0.992 | 0.244 | 0.986 |
경보단계명 | 0.000 | 0.203 | 1.000 | 0.247 | 0.827 | 0.074 | 0.755 |
발령시간 | 0.000 | 0.346 | 0.247 | 1.000 | 0.396 | 0.382 | 0.424 |
발령농도(μg/m³) | 0.015 | 0.992 | 0.827 | 0.396 | 1.000 | 0.239 | 0.774 |
해제시간 | 0.000 | 0.244 | 0.074 | 0.382 | 0.239 | 1.000 | 0.232 |
해제농도(μg/m³) | 0.000 | 0.986 | 0.755 | 0.424 | 0.774 | 0.232 | 1.000 |
권역명 | 발령시간 | 경보단계명 | 항목명 | 해제시간 | |
---|---|---|---|---|---|
권역명 | 1.000 | 0.000 | 0.000 | 0.094 | 0.000 |
발령시간 | 0.000 | 1.000 | 0.190 | 0.267 | 0.105 |
경보단계명 | 0.000 | 0.190 | 1.000 | 0.130 | 0.061 |
항목명 | 0.094 | 0.267 | 0.130 | 1.000 | 0.204 |
해제시간 | 0.000 | 0.105 | 0.061 | 0.204 | 1.000 |
발령농도(μg/m³) | 해제농도(μg/m³) | 권역명 | 항목명 | 경보단계명 | 발령시간 | 해제시간 | |
---|---|---|---|---|---|---|---|
발령농도(μg/m³) | 1.000 | 0.784 | 0.005 | 0.911 | 0.653 | 0.153 | 0.084 |
해제농도(μg/m³) | 0.784 | 1.000 | 0.000 | 0.887 | 0.576 | 0.159 | 0.081 |
권역명 | 0.005 | 0.000 | 1.000 | 0.094 | 0.000 | 0.000 | 0.000 |
항목명 | 0.911 | 0.887 | 0.094 | 1.000 | 0.130 | 0.267 | 0.204 |
경보단계명 | 0.653 | 0.576 | 0.000 | 0.130 | 1.000 | 0.190 | 0.061 |
발령시간 | 0.153 | 0.159 | 0.000 | 0.267 | 0.190 | 1.000 | 0.105 |
해제시간 | 0.084 | 0.081 | 0.000 | 0.204 | 0.061 | 0.105 | 1.000 |
기준일자 | 권역명 | 항목명 | 경보단계명 | 발령일자 | 발령시간 | 발령농도(μg/m³) | 해제일자 | 해제시간 | 해제농도(μg/m³) | |
---|---|---|---|---|---|---|---|---|---|---|
0 | 2024-04-18 | 북부권 | PM10 | 주의보 | 2024-04-18 | 12:00 | 156 | 2024-04-18 | 16:00 | 93 |
1 | 2024-04-17 | 북부권 | PM10 | 주의보 | 2024-04-17 | 14:00 | 221 | 2024-04-18 | 06:00 | 99 |
2 | 2024-04-17 | 중부권 | PM10 | 주의보 | 2024-04-17 | 14:00 | 199 | 2024-04-18 | 18:00 | 80 |
3 | 2024-04-17 | 동부권 | PM10 | 주의보 | 2024-04-17 | 15:00 | 223 | 2024-04-18 | 19:00 | 84 |
4 | 2024-04-16 | 중부권 | PM10 | 주의보 | 2024-04-16 | 15:00 | 160 | 2024-04-17 | 02:00 | 96 |
5 | 2024-04-16 | 동부권 | PM10 | 주의보 | 2024-04-16 | 16:00 | 190 | 2024-04-17 | 04:00 | 98 |
6 | 2024-04-16 | 남부권 | PM10 | 주의보 | 2024-04-16 | 15:00 | 183 | 2024-04-18 | 19:00 | 99 |
7 | 2024-03-29 | 중부권 | PM10 | 주의보 | 2024-03-29 | 02:00 | 339 | 2024-03-29 | 03:00 | 346 |
8 | 2024-03-29 | 북부권 | PM10 | 경보 | 2024-03-29 | 04:00 | 362 | 2024-03-29 | 14:00 | 117 |
9 | 2024-03-29 | 동부권 | PM10 | 주의보 | 2024-03-29 | 02:00 | 274 | 2024-03-29 | 04:00 | 370 |
기준일자 | 권역명 | 항목명 | 경보단계명 | 발령일자 | 발령시간 | 발령농도(μg/m³) | 해제일자 | 해제시간 | 해제농도(μg/m³) | |
---|---|---|---|---|---|---|---|---|---|---|
393 | 2018-03-25 | 중부권 | PM10 | 주의보 | 2018-03-25 | 13:00 | 172 | 2018-03-26 | 00:00 | 93 |
394 | 2018-03-25 | 남부권 | PM10 | 주의보 | 2018-03-25 | 05:00 | 154 | 2018-03-25 | 18:00 | 97 |
395 | 2018-03-24 | 남부권 | PM25 | 주의보 | 2018-03-24 | 10:00 | 95 | 2018-03-26 | 16:00 | 47 |
396 | 2018-03-24 | 북부권 | PM25 | 주의보 | 2018-03-24 | 10:00 | 97 | 2018-03-26 | 18:00 | 44 |
397 | 2018-03-24 | 동부권 | PM25 | 주의보 | 2018-03-24 | 10:00 | 91 | 2018-03-26 | 16:00 | 48 |
398 | 2018-03-24 | 중부권 | PM25 | 주의보 | 2018-03-24 | 21:00 | 97 | 2018-03-26 | 16:00 | 45 |
399 | 2018-03-12 | 남부권 | PM25 | 주의보 | 2018-03-12 | 20:00 | 97 | 2018-03-13 | 13:00 | 35 |
400 | 2018-03-12 | 동부권 | PM25 | 주의보 | 2018-03-12 | 19:00 | 94 | 2018-03-13 | 13:00 | 41 |
401 | 2018-03-12 | 중부권 | PM10 | 주의보 | 2018-03-12 | 16:00 | 151 | 2018-03-13 | 03:00 | 98 |
402 | 2018-03-12 | 중부권 | PM25 | 주의보 | 2018-03-12 | 15:00 | 94 | 2018-03-13 | 08:00 | 48 |