Overview

Dataset statistics

Number of variables27
Number of observations10000
Missing cells7465
Missing cells (%)2.8%
Duplicate rows34
Duplicate rows (%)0.3%
Total size in memory2.3 MiB
Average record size in memory243.0 B

Variable types

Categorical16
Numeric11

Dataset

Description대기오염측정망(측정소, 망이름, 아황산가스농도, 일산화탄소농도, 오존농도, 이상화질소농도, 미세먼지농도 등) 정보를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/bigdata/collect/view.chungnam?menuCd=DOM_000000201001001000&apiIdx=112

Alerts

Dataset has 34 (0.3%) duplicate rowsDuplicates
망이름 is highly imbalanced (52.1%)Imbalance
아황산가스지수 is highly imbalanced (67.7%)Imbalance
일산화탄소지수 is highly imbalanced (78.6%)Imbalance
이산화탄소지수 is highly imbalanced (74.6%)Imbalance
CO상태정보 is highly imbalanced (81.2%)Imbalance
미세먼지(PM2.5)상태정보 is highly imbalanced (75.6%)Imbalance
미세먼지(PM10)상태정보 is highly imbalanced (78.0%)Imbalance
이산화질소상태정보 is highly imbalanced (82.3%)Imbalance
오존상태정보 is highly imbalanced (81.8%)Imbalance
아황산가스상태정보 is highly imbalanced (70.0%)Imbalance
아황산가스농도 has 1130 (11.3%) missing valuesMissing
일산화탄소농도 has 616 (6.2%) missing valuesMissing
오존농도 has 600 (6.0%) missing valuesMissing
이산화질소농도 has 584 (5.8%) missing valuesMissing
미세먼지(PM10)농도 has 750 (7.5%) missing valuesMissing
미세먼지(PM25)농도 has 853 (8.5%) missing valuesMissing
통합대기환경수치 has 1112 (11.1%) missing valuesMissing
미세먼지(PM10)24시간예측이동농도 has 837 (8.4%) missing valuesMissing
미세먼지(PM2.5)24시간예측이동농도 has 983 (9.8%) missing valuesMissing
시간코드 has 409 (4.1%) zerosZeros

Reproduction

Analysis started2024-03-13 11:47:34.518660
Analysis finished2024-03-13 11:47:35.425506
Duration0.91 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

측정소코드
Categorical

Distinct46
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
충남_백석동
 
259
충남_금산읍
 
247
충남_홍성읍
 
243
충남_이원면
 
237
충남_서면
 
236
Other values (41)
8778 

Length

Max length8
Median length6
Mean length6.0632
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충남_독곶리
2nd row충남_주교면
3rd row충남_신방동
4th row충남_청양읍
5th row충남_대산항

Common Values

ValueCountFrequency (%)
충남_백석동 259
 
2.6%
충남_금산읍 247
 
2.5%
충남_홍성읍 243
 
2.4%
충남_이원면 237
 
2.4%
충남_서면 236
 
2.4%
충남_성황동 236
 
2.4%
충남_외연도 235
 
2.4%
충남_송산면 234
 
2.3%
충남_독곶리 230
 
2.3%
충남_연무읍 229
 
2.3%
Other values (36) 7614
76.1%

Length

2024-03-13T20:47:35.856936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
충남_백석동 259
 
2.6%
충남_금산읍 247
 
2.5%
충남_홍성읍 243
 
2.4%
충남_이원면 237
 
2.4%
충남_서면 236
 
2.4%
충남_성황동 236
 
2.4%
충남_외연도 235
 
2.4%
충남_송산면 234
 
2.3%
충남_독곶리 230
 
2.3%
충남_연무읍 229
 
2.3%
Other values (36) 7614
76.1%

기준일자
Real number (ℝ)

Distinct99
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20210683
Minimum20210226
Maximum20211111
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-13T20:47:36.048609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20210226
5-th percentile20210311
Q120210505
median20210709
Q320210906
95-th percentile20211020
Maximum20211111
Range885
Interquartile range (IQR)401

Descriptive statistics

Standard deviation239.4254
Coefficient of variation (CV)1.1846478 × 10-5
Kurtosis-1.1298295
Mean20210683
Median Absolute Deviation (MAD)198
Skewness-0.072500827
Sum2.0210683 × 1011
Variance57324.524
MonotonicityNot monotonic
2024-03-13T20:47:36.238189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20210528 158
 
1.6%
20210709 130
 
1.3%
20210802 130
 
1.3%
20210906 127
 
1.3%
20210624 126
 
1.3%
20210517 126
 
1.3%
20210825 124
 
1.2%
20210531 124
 
1.2%
20210829 124
 
1.2%
20210630 124
 
1.2%
Other values (89) 8707
87.1%
ValueCountFrequency (%)
20210226 97
1.0%
20210227 87
0.9%
20210301 112
1.1%
20210302 109
1.1%
20210308 92
0.9%
20210311 84
0.8%
20210312 105
1.1%
20210314 103
1.0%
20210319 92
0.9%
20210320 90
0.9%
ValueCountFrequency (%)
20211111 28
 
0.3%
20211110 32
 
0.3%
20211109 53
0.5%
20211107 44
0.4%
20211103 56
0.6%
20211101 58
0.6%
20211030 99
1.0%
20211028 65
0.7%
20211026 48
0.5%
20211020 108
1.1%

시간코드
Real number (ℝ)

ZEROS 

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.4846
Minimum0
Maximum23
Zeros409
Zeros (%)4.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-13T20:47:36.393936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q16
median11
Q317
95-th percentile22
Maximum23
Range23
Interquartile range (IQR)11

Descriptive statistics

Standard deviation6.8859207
Coefficient of variation (CV)0.59957863
Kurtosis-1.182628
Mean11.4846
Median Absolute Deviation (MAD)6
Skewness0.0048597699
Sum114846
Variance47.415904
MonotonicityNot monotonic
2024-03-13T20:47:36.564681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
8 463
 
4.6%
16 460
 
4.6%
10 445
 
4.5%
15 442
 
4.4%
23 431
 
4.3%
1 430
 
4.3%
12 430
 
4.3%
5 425
 
4.2%
14 422
 
4.2%
3 420
 
4.2%
Other values (14) 5632
56.3%
ValueCountFrequency (%)
0 409
4.1%
1 430
4.3%
2 400
4.0%
3 420
4.2%
4 383
3.8%
5 425
4.2%
6 414
4.1%
7 406
4.1%
8 463
4.6%
9 412
4.1%
ValueCountFrequency (%)
23 431
4.3%
22 403
4.0%
21 398
4.0%
20 409
4.1%
19 389
3.9%
18 410
4.1%
17 401
4.0%
16 460
4.6%
15 442
4.4%
14 422
4.2%

망이름
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
도시대기
8010 
항만
 
667
국가배경농도
 
454
도로변대기
 
435
교외대기
 
434

Length

Max length6
Median length4
Mean length4.0009
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row도시대기
2nd row도시대기
3rd row도시대기
4th row도시대기
5th row항만

Common Values

ValueCountFrequency (%)
도시대기 8010
80.1%
항만 667
 
6.7%
국가배경농도 454
 
4.5%
도로변대기 435
 
4.3%
교외대기 434
 
4.3%

Length

2024-03-13T20:47:36.759261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T20:47:36.916295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
도시대기 8010
80.1%
항만 667
 
6.7%
국가배경농도 454
 
4.5%
도로변대기 435
 
4.3%
교외대기 434
 
4.3%

아황산가스농도
Real number (ℝ)

MISSING 

Distinct23
Distinct (%)0.3%
Missing1130
Missing (%)11.3%
Infinite0
Infinite (%)0.0%
Mean0.0034349493
Minimum0
Maximum0.022
Zeros10
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-13T20:47:37.049504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.001
Q10.003
median0.003
Q30.004
95-th percentile0.005
Maximum0.022
Range0.022
Interquartile range (IQR)0.001

Descriptive statistics

Standard deviation0.0014627315
Coefficient of variation (CV)0.42583788
Kurtosis22.190059
Mean0.0034349493
Median Absolute Deviation (MAD)0.001
Skewness2.7825602
Sum30.468
Variance2.1395835 × 10-6
MonotonicityNot monotonic
2024-03-13T20:47:37.316224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
0.003 3499
35.0%
0.004 2820
28.2%
0.002 805
 
8.1%
0.001 667
 
6.7%
0.005 654
 
6.5%
0.006 207
 
2.1%
0.007 87
 
0.9%
0.008 37
 
0.4%
0.01 20
 
0.2%
0.009 20
 
0.2%
Other values (13) 54
 
0.5%
(Missing) 1130
 
11.3%
ValueCountFrequency (%)
0.0 10
 
0.1%
0.001 667
 
6.7%
0.002 805
 
8.1%
0.003 3499
35.0%
0.004 2820
28.2%
0.005 654
 
6.5%
0.006 207
 
2.1%
0.007 87
 
0.9%
0.008 37
 
0.4%
0.009 20
 
0.2%
ValueCountFrequency (%)
0.022 1
 
< 0.1%
0.021 1
 
< 0.1%
0.02 1
 
< 0.1%
0.019 1
 
< 0.1%
0.018 2
 
< 0.1%
0.017 4
< 0.1%
0.016 6
0.1%
0.015 2
 
< 0.1%
0.014 2
 
< 0.1%
0.013 5
0.1%

일산화탄소농도
Real number (ℝ)

MISSING 

Distinct21
Distinct (%)0.2%
Missing616
Missing (%)6.2%
Infinite0
Infinite (%)0.0%
Mean0.37535166
Minimum0
Maximum5.6
Zeros19
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-13T20:47:37.526849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.2
Q10.3
median0.4
Q30.4
95-th percentile0.6
Maximum5.6
Range5.6
Interquartile range (IQR)0.1

Descriptive statistics

Standard deviation0.158677
Coefficient of variation (CV)0.42274224
Kurtosis148.60479
Mean0.37535166
Median Absolute Deviation (MAD)0.1
Skewness5.9977945
Sum3522.3
Variance0.025178391
MonotonicityNot monotonic
2024-03-13T20:47:37.773753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
0.3 2930
29.3%
0.4 2645
26.5%
0.5 1393
13.9%
0.2 1351
13.5%
0.6 543
 
5.4%
0.7 191
 
1.9%
0.1 168
 
1.7%
0.8 79
 
0.8%
0.9 27
 
0.3%
0.0 19
 
0.2%
Other values (11) 38
 
0.4%
(Missing) 616
 
6.2%
ValueCountFrequency (%)
0.0 19
 
0.2%
0.1 168
 
1.7%
0.2 1351
13.5%
0.3 2930
29.3%
0.4 2645
26.5%
0.5 1393
13.9%
0.6 543
 
5.4%
0.7 191
 
1.9%
0.8 79
 
0.8%
0.9 27
 
0.3%
ValueCountFrequency (%)
5.6 1
 
< 0.1%
3.2 1
 
< 0.1%
3.0 1
 
< 0.1%
2.1 1
 
< 0.1%
1.8 2
 
< 0.1%
1.5 4
< 0.1%
1.4 2
 
< 0.1%
1.3 4
< 0.1%
1.2 4
< 0.1%
1.1 5
0.1%

오존농도
Real number (ℝ)

MISSING 

Distinct122
Distinct (%)1.3%
Missing600
Missing (%)6.0%
Infinite0
Infinite (%)0.0%
Mean0.039505957
Minimum0
Maximum0.144
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-13T20:47:38.028964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.009
Q10.025
median0.039
Q30.052
95-th percentile0.072
Maximum0.144
Range0.144
Interquartile range (IQR)0.027

Descriptive statistics

Standard deviation0.019215112
Coefficient of variation (CV)0.48638517
Kurtosis0.26145177
Mean0.039505957
Median Absolute Deviation (MAD)0.013
Skewness0.41590138
Sum371.356
Variance0.00036922052
MonotonicityNot monotonic
2024-03-13T20:47:38.267808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.032 219
 
2.2%
0.04 193
 
1.9%
0.041 193
 
1.9%
0.045 192
 
1.9%
0.033 187
 
1.9%
0.034 186
 
1.9%
0.037 186
 
1.9%
0.046 186
 
1.9%
0.048 185
 
1.8%
0.036 180
 
1.8%
Other values (112) 7493
74.9%
(Missing) 600
 
6.0%
ValueCountFrequency (%)
0.0 1
 
< 0.1%
0.002 15
 
0.1%
0.003 52
0.5%
0.004 58
0.6%
0.005 64
0.6%
0.006 62
0.6%
0.007 85
0.9%
0.008 84
0.8%
0.009 69
0.7%
0.01 69
0.7%
ValueCountFrequency (%)
0.144 1
< 0.1%
0.136 1
< 0.1%
0.129 1
< 0.1%
0.126 1
< 0.1%
0.123 1
< 0.1%
0.122 2
< 0.1%
0.118 1
< 0.1%
0.116 2
< 0.1%
0.115 1
< 0.1%
0.113 1
< 0.1%

이산화질소농도
Real number (ℝ)

MISSING 

Distinct68
Distinct (%)0.7%
Missing584
Missing (%)5.8%
Infinite0
Infinite (%)0.0%
Mean0.010670667
Minimum0
Maximum0.14
Zeros3
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-13T20:47:38.461253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.003
Q10.006
median0.008
Q30.013
95-th percentile0.026
Maximum0.14
Range0.14
Interquartile range (IQR)0.007

Descriptive statistics

Standard deviation0.007777985
Coefficient of variation (CV)0.72891274
Kurtosis16.087616
Mean0.010670667
Median Absolute Deviation (MAD)0.003
Skewness2.7377119
Sum100.475
Variance6.0497051 × 10-5
MonotonicityNot monotonic
2024-03-13T20:47:38.662308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.005 942
 
9.4%
0.006 905
 
9.0%
0.007 882
 
8.8%
0.008 742
 
7.4%
0.004 735
 
7.3%
0.009 695
 
7.0%
0.01 578
 
5.8%
0.011 458
 
4.6%
0.003 387
 
3.9%
0.012 382
 
3.8%
Other values (58) 2710
27.1%
(Missing) 584
 
5.8%
ValueCountFrequency (%)
0.0 3
 
< 0.1%
0.001 14
 
0.1%
0.002 117
 
1.2%
0.003 387
3.9%
0.004 735
7.3%
0.005 942
9.4%
0.006 905
9.0%
0.007 882
8.8%
0.008 742
7.4%
0.009 695
7.0%
ValueCountFrequency (%)
0.14 1
< 0.1%
0.083 1
< 0.1%
0.079 1
< 0.1%
0.077 1
< 0.1%
0.074 1
< 0.1%
0.07 1
< 0.1%
0.065 1
< 0.1%
0.062 1
< 0.1%
0.061 2
< 0.1%
0.059 2
< 0.1%

미세먼지(PM10)농도
Real number (ℝ)

MISSING 

Distinct217
Distinct (%)2.3%
Missing750
Missing (%)7.5%
Infinite0
Infinite (%)0.0%
Mean34.232324
Minimum1
Maximum365
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-13T20:47:38.897352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile9
Q117
median27
Q342
95-th percentile85
Maximum365
Range364
Interquartile range (IQR)25

Descriptive statistics

Standard deviation28.82256
Coefficient of variation (CV)0.84196911
Kurtosis16.916043
Mean34.232324
Median Absolute Deviation (MAD)11
Skewness3.2383293
Sum316649
Variance830.73994
MonotonicityNot monotonic
2024-03-13T20:47:39.082421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20 285
 
2.9%
18 283
 
2.8%
27 276
 
2.8%
15 262
 
2.6%
17 261
 
2.6%
16 258
 
2.6%
22 257
 
2.6%
21 252
 
2.5%
19 251
 
2.5%
23 236
 
2.4%
Other values (207) 6629
66.3%
(Missing) 750
 
7.5%
ValueCountFrequency (%)
1 3
 
< 0.1%
2 21
 
0.2%
3 27
 
0.3%
4 39
 
0.4%
5 54
 
0.5%
6 76
0.8%
7 107
1.1%
8 116
1.2%
9 141
1.4%
10 170
1.7%
ValueCountFrequency (%)
365 1
< 0.1%
340 1
< 0.1%
327 1
< 0.1%
299 1
< 0.1%
276 1
< 0.1%
273 1
< 0.1%
262 1
< 0.1%
248 1
< 0.1%
243 1
< 0.1%
238 2
< 0.1%

미세먼지(PM25)농도
Real number (ℝ)

MISSING 

Distinct123
Distinct (%)1.3%
Missing853
Missing (%)8.5%
Infinite0
Infinite (%)0.0%
Mean17.368973
Minimum0
Maximum136
Zeros14
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-13T20:47:39.257696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q18
median14
Q322
95-th percentile46
Maximum136
Range136
Interquartile range (IQR)14

Descriptive statistics

Standard deviation15.138712
Coefficient of variation (CV)0.87159508
Kurtosis11.432953
Mean17.368973
Median Absolute Deviation (MAD)6
Skewness2.699299
Sum158874
Variance229.18059
MonotonicityNot monotonic
2024-03-13T20:47:39.458465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
6 465
 
4.7%
8 452
 
4.5%
10 448
 
4.5%
9 401
 
4.0%
14 400
 
4.0%
12 398
 
4.0%
11 395
 
4.0%
13 389
 
3.9%
7 375
 
3.8%
15 344
 
3.4%
Other values (113) 5080
50.8%
(Missing) 853
 
8.5%
ValueCountFrequency (%)
0 14
 
0.1%
1 182
 
1.8%
2 211
2.1%
3 247
2.5%
4 263
2.6%
5 327
3.3%
6 465
4.7%
7 375
3.8%
8 452
4.5%
9 401
4.0%
ValueCountFrequency (%)
136 2
< 0.1%
132 1
< 0.1%
131 2
< 0.1%
130 1
< 0.1%
128 2
< 0.1%
127 1
< 0.1%
126 1
< 0.1%
124 1
< 0.1%
123 2
< 0.1%
121 1
< 0.1%

통합대기환경수치
Real number (ℝ)

MISSING 

Distinct226
Distinct (%)2.5%
Missing1112
Missing (%)11.1%
Infinite0
Infinite (%)0.0%
Mean73.208483
Minimum3
Maximum330
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-13T20:47:39.649135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile37
Q153
median64
Q378
95-th percentile140
Maximum330
Range327
Interquartile range (IQR)25

Descriptive statistics

Standard deviation42.427198
Coefficient of variation (CV)0.57953937
Kurtosis14.568013
Mean73.208483
Median Absolute Deviation (MAD)12
Skewness3.4675729
Sum650677
Variance1800.0671
MonotonicityNot monotonic
2024-03-13T20:47:39.874893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
58 264
 
2.6%
53 258
 
2.6%
59 255
 
2.5%
63 251
 
2.5%
56 247
 
2.5%
51 237
 
2.4%
68 231
 
2.3%
66 229
 
2.3%
64 216
 
2.2%
61 216
 
2.2%
Other values (216) 6484
64.8%
(Missing) 1112
 
11.1%
ValueCountFrequency (%)
3 1
 
< 0.1%
7 3
 
< 0.1%
10 2
 
< 0.1%
13 3
 
< 0.1%
17 5
 
0.1%
18 1
 
< 0.1%
20 16
0.2%
22 7
 
0.1%
23 26
0.3%
25 23
0.2%
ValueCountFrequency (%)
330 1
 
< 0.1%
328 1
 
< 0.1%
327 1
 
< 0.1%
326 2
< 0.1%
322 1
 
< 0.1%
320 3
< 0.1%
319 2
< 0.1%
318 1
 
< 0.1%
317 2
< 0.1%
316 4
< 0.1%
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2
6380 
1
1676 
<NA>
1112 
3
662 
4
 
170

Length

Max length4
Median length1
Mean length1.3336
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row2
3rd row2
4th row1
5th row2

Common Values

ValueCountFrequency (%)
2 6380
63.8%
1 1676
 
16.8%
<NA> 1112
 
11.1%
3 662
 
6.6%
4 170
 
1.7%

Length

2024-03-13T20:47:40.122039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T20:47:40.395272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 6380
63.8%
1 1676
 
16.8%
na 1112
 
11.1%
3 662
 
6.6%
4 170
 
1.7%

아황산가스지수
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
8868 
<NA>
1130 
2
 
2

Length

Max length4
Median length1
Mean length1.339
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 8868
88.7%
<NA> 1130
 
11.3%
2 2
 
< 0.1%

Length

2024-03-13T20:47:40.626856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T20:47:40.833489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 8868
88.7%
na 1130
 
11.3%
2 2
 
< 0.1%

일산화탄소지수
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
9380 
<NA>
 
616
2
 
4

Length

Max length4
Median length1
Mean length1.1848
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 9380
93.8%
<NA> 616
 
6.2%
2 4
 
< 0.1%

Length

2024-03-13T20:47:41.063314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T20:47:41.211212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 9380
93.8%
na 616
 
6.2%
2 4
 
< 0.1%

오존지수
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2
6155 
1
3163 
<NA>
 
600
3
 
82

Length

Max length4
Median length1
Mean length1.18
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row1
4th row1
5th row2

Common Values

ValueCountFrequency (%)
2 6155
61.6%
1 3163
31.6%
<NA> 600
 
6.0%
3 82
 
0.8%

Length

2024-03-13T20:47:41.379246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T20:47:41.539436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 6155
61.6%
1 3163
31.6%
na 600
 
6.0%
3 82
 
0.8%

이산화탄소지수
Categorical

IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
9137 
<NA>
 
584
2
 
269
3
 
10

Length

Max length4
Median length1
Mean length1.1752
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 9137
91.4%
<NA> 584
 
5.8%
2 269
 
2.7%
3 10
 
0.1%

Length

2024-03-13T20:47:41.702027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T20:47:41.848918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 9137
91.4%
na 584
 
5.8%
2 269
 
2.7%
3 10
 
0.1%
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
5348 
2
3413 
<NA>
837 
3
 
336
4
 
66

Length

Max length4
Median length1
Mean length1.2511
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row2
3rd row1
4th row1
5th row2

Common Values

ValueCountFrequency (%)
1 5348
53.5%
2 3413
34.1%
<NA> 837
 
8.4%
3 336
 
3.4%
4 66
 
0.7%

Length

2024-03-13T20:47:41.980855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T20:47:42.160170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 5348
53.5%
2 3413
34.1%
na 837
 
8.4%
3 336
 
3.4%
4 66
 
0.7%
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
5166 
2
3223 
<NA>
983 
3
571 
4
 
57

Length

Max length4
Median length1
Mean length1.2949
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row2
3rd row2
4th row1
5th row2

Common Values

ValueCountFrequency (%)
1 5166
51.7%
2 3223
32.2%
<NA> 983
 
9.8%
3 571
 
5.7%
4 57
 
0.6%

Length

2024-03-13T20:47:42.334417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T20:47:42.498801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 5166
51.7%
2 3223
32.2%
na 983
 
9.8%
3 571
 
5.7%
4 57
 
0.6%
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
5481 
2
3236 
<NA>
750 
3
 
429
4
 
104

Length

Max length4
Median length1
Mean length1.225
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row2
3rd row1
4th row1
5th row2

Common Values

ValueCountFrequency (%)
1 5481
54.8%
2 3236
32.4%
<NA> 750
 
7.5%
3 429
 
4.3%
4 104
 
1.0%

Length

2024-03-13T20:47:42.710749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T20:47:42.855317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 5481
54.8%
2 3236
32.4%
na 750
 
7.5%
3 429
 
4.3%
4 104
 
1.0%
Distinct182
Distinct (%)2.0%
Missing837
Missing (%)8.4%
Infinite0
Infinite (%)0.0%
Mean33.673688
Minimum4
Maximum277
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-13T20:47:43.006236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile11
Q118
median27
Q341
95-th percentile76
Maximum277
Range273
Interquartile range (IQR)23

Descriptive statistics

Standard deviation24.696301
Coefficient of variation (CV)0.73340054
Kurtosis12.188582
Mean33.673688
Median Absolute Deviation (MAD)11
Skewness2.7649156
Sum308552
Variance609.90726
MonotonicityNot monotonic
2024-03-13T20:47:43.188627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15 308
 
3.1%
16 297
 
3.0%
19 293
 
2.9%
20 281
 
2.8%
21 278
 
2.8%
18 272
 
2.7%
24 267
 
2.7%
25 265
 
2.6%
17 257
 
2.6%
22 255
 
2.5%
Other values (172) 6390
63.9%
(Missing) 837
 
8.4%
ValueCountFrequency (%)
4 2
 
< 0.1%
5 19
 
0.2%
6 37
 
0.4%
7 54
 
0.5%
8 75
 
0.8%
9 97
1.0%
10 142
1.4%
11 175
1.8%
12 191
1.9%
13 224
2.2%
ValueCountFrequency (%)
277 1
< 0.1%
241 1
< 0.1%
220 1
< 0.1%
218 1
< 0.1%
217 1
< 0.1%
216 1
< 0.1%
215 2
< 0.1%
214 1
< 0.1%
213 1
< 0.1%
210 1
< 0.1%
Distinct105
Distinct (%)1.2%
Missing983
Missing (%)9.8%
Infinite0
Infinite (%)0.0%
Mean16.821892
Minimum1
Maximum120
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-13T20:47:43.750735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q19
median14
Q321
95-th percentile40
Maximum120
Range119
Interquartile range (IQR)12

Descriptive statistics

Standard deviation12.491772
Coefficient of variation (CV)0.74259018
Kurtosis10.187306
Mean16.821892
Median Absolute Deviation (MAD)6
Skewness2.5324878
Sum151683
Variance156.04436
MonotonicityNot monotonic
2024-03-13T20:47:43.922893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8 479
 
4.8%
9 468
 
4.7%
13 462
 
4.6%
7 456
 
4.6%
11 452
 
4.5%
12 447
 
4.5%
10 429
 
4.3%
6 392
 
3.9%
16 385
 
3.9%
14 383
 
3.8%
Other values (95) 4664
46.6%
(Missing) 983
 
9.8%
ValueCountFrequency (%)
1 5
 
0.1%
2 61
 
0.6%
3 145
 
1.5%
4 242
2.4%
5 363
3.6%
6 392
3.9%
7 456
4.6%
8 479
4.8%
9 468
4.7%
10 429
4.3%
ValueCountFrequency (%)
120 1
 
< 0.1%
111 1
 
< 0.1%
108 3
< 0.1%
107 1
 
< 0.1%
106 1
 
< 0.1%
104 2
< 0.1%
102 1
 
< 0.1%
101 3
< 0.1%
100 2
< 0.1%
99 2
< 0.1%

CO상태정보
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9384 
통신장애
 
329
점검및교정
 
157
장비점검
 
79
자료이상
 
51

Length

Max length5
Median length4
Mean length4.0157
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9384
93.8%
통신장애 329
 
3.3%
점검및교정 157
 
1.6%
장비점검 79
 
0.8%
자료이상 51
 
0.5%

Length

2024-03-13T20:47:44.096350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T20:47:44.218292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9384
93.8%
통신장애 329
 
3.3%
점검및교정 157
 
1.6%
장비점검 79
 
0.8%
자료이상 51
 
0.5%

미세먼지(PM2.5)상태정보
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9147 
자료이상
 
381
점검및교정
 
292
통신장애
 
112
장비점검
 
68

Length

Max length5
Median length4
Mean length4.0292
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row자료이상
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9147
91.5%
자료이상 381
 
3.8%
점검및교정 292
 
2.9%
통신장애 112
 
1.1%
장비점검 68
 
0.7%

Length

2024-03-13T20:47:44.362784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T20:47:44.490845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9147
91.5%
자료이상 381
 
3.8%
점검및교정 292
 
2.9%
통신장애 112
 
1.1%
장비점검 68
 
0.7%

미세먼지(PM10)상태정보
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9250 
통신장애
 
329
점검및교정
 
284
자료이상
 
71
장비점검
 
66

Length

Max length5
Median length4
Mean length4.0284
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9250
92.5%
통신장애 329
 
3.3%
점검및교정 284
 
2.8%
자료이상 71
 
0.7%
장비점검 66
 
0.7%

Length

2024-03-13T20:47:44.634228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T20:47:44.790891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9250
92.5%
통신장애 329
 
3.3%
점검및교정 284
 
2.8%
자료이상 71
 
0.7%
장비점검 66
 
0.7%

이산화질소상태정보
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9416 
통신장애
 
329
점검및교정
 
157
장비점검
 
74
자료이상
 
24

Length

Max length5
Median length4
Mean length4.0157
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9416
94.2%
통신장애 329
 
3.3%
점검및교정 157
 
1.6%
장비점검 74
 
0.7%
자료이상 24
 
0.2%

Length

2024-03-13T20:47:44.975026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T20:47:45.089632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9416
94.2%
통신장애 329
 
3.3%
점검및교정 157
 
1.6%
장비점검 74
 
0.7%
자료이상 24
 
0.2%

오존상태정보
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9400 
통신장애
 
329
점검및교정
 
154
장비점검
 
94
자료이상
 
23

Length

Max length5
Median length4
Mean length4.0154
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9400
94.0%
통신장애 329
 
3.3%
점검및교정 154
 
1.5%
장비점검 94
 
0.9%
자료이상 23
 
0.2%

Length

2024-03-13T20:47:45.222079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T20:47:45.334181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9400
94.0%
통신장애 329
 
3.3%
점검및교정 154
 
1.5%
장비점검 94
 
0.9%
자료이상 23
 
0.2%

아황산가스상태정보
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
8870 
자료이상
 
570
통신장애
 
329
점검및교정
 
157
장비점검
 
74

Length

Max length5
Median length4
Mean length4.0157
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8870
88.7%
자료이상 570
 
5.7%
통신장애 329
 
3.3%
점검및교정 157
 
1.6%
장비점검 74
 
0.7%

Length

2024-03-13T20:47:45.451953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T20:47:45.574652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 8870
88.7%
자료이상 570
 
5.7%
통신장애 329
 
3.3%
점검및교정 157
 
1.6%
장비점검 74
 
0.7%

Sample

측정소코드기준일자시간코드망이름아황산가스농도일산화탄소농도오존농도이산화질소농도미세먼지(PM10)농도미세먼지(PM25)농도통합대기환경수치통합대기환경지수아황산가스지수일산화탄소지수오존지수이산화탄소지수미세먼지(PM10)24시간등급미세먼지(PM2.5)24시간등급미세먼지(PM10)1시간등급미세먼지(PM10)24시간예측이동농도미세먼지(PM2.5)24시간예측이동농도CO상태정보미세먼지(PM2.5)상태정보미세먼지(PM10)상태정보이산화질소상태정보오존상태정보아황산가스상태정보
59462충남_독곶리2021080710도시대기0.0040.20.0360.01715<NA><NA><NA>11211<NA>113<NA><NA>자료이상<NA><NA><NA><NA>
42337충남_주교면2021062416도시대기0.0040.60.0890.016422299211212223621<NA><NA><NA><NA><NA><NA>
53812충남_신방동202107235도시대기0.0040.20.0160.01271751211111212516<NA><NA><NA><NA><NA><NA>
51588충남_청양읍202107164도시대기0.0030.20.0070.0061672711111111166<NA><NA><NA><NA><NA><NA>
95632충남_대산항202111012항만0.0090.40.0580.012332695211212224533<NA><NA><NA><NA><NA><NA>
85395충남_동문동2021100712도시대기0.0030.30.0380.0069<NA><NA><NA>1121<NA><NA>1<NA><NA><NA>자료이상<NA><NA><NA><NA>
84562충남_성성동2021100217도로변대기0.0030.30.0620.012302077211211212919<NA><NA><NA><NA><NA><NA>
2410충남_격렬비열도2021030110국가배경농도<NA><NA><NA><NA><NA>4692<NA><NA><NA><NA><NA>2<NA><NA>23통신장애<NA>통신장애통신장애통신장애통신장애
14412충남_백석동2021040723도시대기0.0050.40.0350.023411165211212224517<NA><NA><NA><NA><NA><NA>
78충남_백석동202102262도시대기0.0040.60.0030.0427855139311122327546<NA><NA><NA><NA><NA><NA>
측정소코드기준일자시간코드망이름아황산가스농도일산화탄소농도오존농도이산화질소농도미세먼지(PM10)농도미세먼지(PM25)농도통합대기환경수치통합대기환경지수아황산가스지수일산화탄소지수오존지수이산화탄소지수미세먼지(PM10)24시간등급미세먼지(PM2.5)24시간등급미세먼지(PM10)1시간등급미세먼지(PM10)24시간예측이동농도미세먼지(PM2.5)24시간예측이동농도CO상태정보미세먼지(PM2.5)상태정보미세먼지(PM10)상태정보이산화질소상태정보오존상태정보아황산가스상태정보
26237충남_공주202105115도시대기0.0040.20.0290.0071611<NA><NA>11111<NA>114<NA><NA><NA><NA><NA><NA><NA>
28275충남_금산읍2021051717도시대기<NA>0.30.0260.009105431<NA>111111116<NA><NA><NA><NA><NA>자료이상
88069충남_평택당진항2021101122항만0.0010.20.0210.014221345111111112713<NA><NA><NA><NA><NA><NA>
48370충남_엄사면202107095도시대기0.0030.50.0260.003924311111111105<NA><NA><NA><NA><NA><NA>
55160충남_예산군2021072610도시대기0.0050.30.040.005141158211211111812<NA><NA><NA><NA><NA><NA>
51904충남_예산군2021071611도시대기0.0060.40.0450.007321963211211122515<NA><NA><NA><NA><NA><NA>
34197충남_인주면2021052820도시대기0.0040.40.0750.006522788211212224423<NA><NA><NA><NA><NA><NA>
17713충남_금산읍202104162도시대기<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>자료이상자료이상자료이상자료이상자료이상자료이상
89027충남_공주2021101419도시대기0.0020.30.0540.00613117021121111168<NA><NA><NA><NA><NA><NA>
47961충남_서면2021070720도시대기0.0030.20.0240.015634011111111158<NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

측정소코드기준일자시간코드망이름아황산가스농도일산화탄소농도오존농도이산화질소농도미세먼지(PM10)농도미세먼지(PM25)농도통합대기환경수치통합대기환경지수아황산가스지수일산화탄소지수오존지수이산화탄소지수미세먼지(PM10)24시간등급미세먼지(PM2.5)24시간등급미세먼지(PM10)1시간등급미세먼지(PM10)24시간예측이동농도미세먼지(PM2.5)24시간예측이동농도CO상태정보미세먼지(PM2.5)상태정보미세먼지(PM10)상태정보이산화질소상태정보오존상태정보아황산가스상태정보# duplicates
16충남_백석동2021092716도시대기<NA><NA><NA><NA>22<NA><NA><NA><NA><NA><NA><NA>1<NA>117<NA>점검및교정점검및교정<NA>점검및교정점검및교정점검및교정4
33충남_홍성읍2021052823도시대기0.0030.40.0660.005533080211212224827<NA><NA><NA><NA><NA><NA>4
1충남_공주202105281도시대기0.0030.50.0440.005693979211212225927<NA><NA><NA><NA><NA><NA>3
3충남_금산읍202105281도시대기0.0030.30.0290.005462377211112225226<NA><NA><NA><NA><NA><NA>3
8충남_대산리2021052823도시대기0.0030.40.0650.012521779211212123715<NA><NA><NA><NA><NA><NA>3
11충남_독곶리2021052823도시대기0.0040.50.0660.00931980211212123415<NA><NA><NA><NA><NA><NA>3
18충남_서면202105281도시대기0.0030.50.0440.008651192211212227217<NA><NA><NA><NA><NA><NA>3
22충남_성성동2021052823도로변대기0.0040.50.0680.013523482211212224626<NA><NA><NA><NA><NA><NA>3
25충남_엄사면2021061616도시대기<NA>0.40.0360.008126552<NA>121111115<NA><NA><NA><NA><NA>자료이상3
27충남_이원면2021052823도시대기0.0030.40.0540.006361270211212224116<NA><NA><NA><NA><NA><NA>3