Overview

Dataset statistics

Number of variables13
Number of observations10000
Missing cells3523
Missing cells (%)2.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 MiB
Average record size in memory122.0 B

Variable types

Numeric9
Categorical3
Text1

Dataset

Description경기도 대기환경정보 월평균자료
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=F32ZDZ5AAZWMDY363OMC33625778&infSeq=1

Alerts

측정망구분 is highly overall correlated with 측정망코드High correlation
측정망코드 is highly overall correlated with 측정망구분High correlation
도시코드 is highly overall correlated with 측정소코드 and 1 other fieldsHigh correlation
측정소코드 is highly overall correlated with 도시코드 and 1 other fieldsHigh correlation
측정날짜 is highly overall correlated with SO2(ppm)High correlation
SO2(ppm) is highly overall correlated with 측정날짜High correlation
NO2(ppm) is highly overall correlated with CO(ppm) and 3 other fieldsHigh correlation
CO(ppm) is highly overall correlated with NO2(ppm) and 2 other fieldsHigh correlation
O3(ppm) is highly overall correlated with NO2(ppm) and 1 other fieldsHigh correlation
PM10(㎍/㎥) is highly overall correlated with NO2(ppm) and 1 other fieldsHigh correlation
PM25(㎍/㎥) is highly overall correlated with NO2(ppm) and 2 other fieldsHigh correlation
도시명 is highly overall correlated with 도시코드 and 1 other fieldsHigh correlation
측정망코드 is highly imbalanced (57.0%)Imbalance
측정망구분 is highly imbalanced (57.0%)Imbalance
SO2(ppm) has 202 (2.0%) missing valuesMissing
NO2(ppm) has 246 (2.5%) missing valuesMissing
CO(ppm) has 279 (2.8%) missing valuesMissing
O3(ppm) has 422 (4.2%) missing valuesMissing
PM10(㎍/㎥) has 373 (3.7%) missing valuesMissing
PM25(㎍/㎥) has 2001 (20.0%) missing valuesMissing

Reproduction

Analysis started2023-12-10 22:32:07.756063
Analysis finished2023-12-10 22:32:19.080125
Duration11.32 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

도시코드
Real number (ℝ)

HIGH CORRELATION 

Distinct31
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.947
Minimum1
Maximum31
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:32:19.147240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q15
median12
Q320
95-th percentile28
Maximum31
Range30
Interquartile range (IQR)15

Descriptive statistics

Standard deviation8.7592045
Coefficient of variation (CV)0.67654318
Kurtosis-1.1177058
Mean12.947
Median Absolute Deviation (MAD)7
Skewness0.29892949
Sum129470
Variance76.723663
MonotonicityNot monotonic
2023-12-11T07:32:19.255987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
9 820
 
8.2%
2 813
 
8.1%
1 773
 
7.7%
19 568
 
5.7%
24 541
 
5.4%
5 513
 
5.1%
10 455
 
4.5%
15 446
 
4.5%
7 437
 
4.4%
13 430
 
4.3%
Other values (21) 4204
42.0%
ValueCountFrequency (%)
1 773
7.7%
2 813
8.1%
3 241
 
2.4%
4 410
4.1%
5 513
5.1%
6 206
 
2.1%
7 437
4.4%
8 104
 
1.0%
9 820
8.2%
10 455
4.5%
ValueCountFrequency (%)
31 140
 
1.4%
30 143
 
1.4%
29 117
 
1.2%
28 177
 
1.8%
27 172
 
1.7%
26 171
 
1.7%
25 178
 
1.8%
24 541
5.4%
23 394
3.9%
22 187
 
1.9%

도시명
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
안산시
820 
성남시
813 
수원시
773 
용인시
 
568
화성시
 
541
Other values (26)
6485 

Length

Max length4
Median length3
Mean length3.0775
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row의왕시
2nd row안양시
3rd row파주시
4th row남양주시
5th row고양시

Common Values

ValueCountFrequency (%)
안산시 820
 
8.2%
성남시 813
 
8.1%
수원시 773
 
7.7%
용인시 568
 
5.7%
화성시 541
 
5.4%
부천시 513
 
5.1%
고양시 455
 
4.5%
시흥시 446
 
4.5%
평택시 437
 
4.4%
남양주시 430
 
4.3%
Other values (21) 4204
42.0%

Length

2023-12-11T07:32:19.566565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
안산시 820
 
8.2%
성남시 813
 
8.1%
수원시 773
 
7.7%
용인시 568
 
5.7%
화성시 541
 
5.4%
부천시 513
 
5.1%
고양시 455
 
4.5%
시흥시 446
 
4.5%
평택시 437
 
4.4%
남양주시 430
 
4.3%
Other values (21) 4204
42.0%

측정소코드
Real number (ℝ)

HIGH CORRELATION 

Distinct122
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean167223.72
Minimum131111
Maximum831155
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:32:19.664094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum131111
5-th percentile131115
Q1131191
median131343
Q3131501
95-th percentile831151
Maximum831155
Range700044
Interquartile range (IQR)310

Descriptive statistics

Standard deviation154396.7
Coefficient of variation (CV)0.92329427
Kurtosis14.555089
Mean167223.72
Median Absolute Deviation (MAD)158
Skewness4.068434
Sum1.6722372 × 109
Variance2.3838341 × 1010
MonotonicityNot monotonic
2023-12-11T07:32:19.789072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
831155 104
 
1.0%
131553 104
 
1.0%
131211 104
 
1.0%
131128 104
 
1.0%
131581 104
 
1.0%
131471 104
 
1.0%
131232 104
 
1.0%
131129 104
 
1.0%
131561 104
 
1.0%
131116 104
 
1.0%
Other values (112) 8960
89.6%
ValueCountFrequency (%)
131111 102
1.0%
131112 102
1.0%
131113 104
1.0%
131114 102
1.0%
131115 102
1.0%
131116 104
1.0%
131117 103
1.0%
131118 35
 
0.4%
131119 19
 
0.2%
131120 100
1.0%
ValueCountFrequency (%)
831155 104
1.0%
831154 102
1.0%
831153 101
1.0%
831152 103
1.0%
831151 103
1.0%
131622 37
 
0.4%
131621 103
1.0%
131612 40
 
0.4%
131611 103
1.0%
131602 14
 
0.1%
Distinct122
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T07:32:20.101358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length3.0991
Min length2

Characters and Unicode

Total characters30991
Distinct characters127
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고천동
2nd row부림동
3rd row운정
4th row금곡동
5th row행신동
ValueCountFrequency (%)
송내대로 104
 
1.0%
당동 104
 
1.0%
광교동 104
 
1.0%
과천동 104
 
1.0%
경수대로 104
 
1.0%
백석읍 104
 
1.0%
상대원동 104
 
1.0%
사우동 104
 
1.0%
봉산동 104
 
1.0%
운중동 104
 
1.0%
Other values (112) 8960
89.6%
2023-12-11T07:32:20.631875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6680
 
21.6%
1204
 
3.9%
1068
 
3.4%
880
 
2.8%
795
 
2.6%
771
 
2.5%
693
 
2.2%
612
 
2.0%
517
 
1.7%
514
 
1.7%
Other values (117) 17257
55.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 30402
98.1%
Decimal Number 589
 
1.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6680
22.0%
1204
 
4.0%
1068
 
3.5%
880
 
2.9%
795
 
2.6%
771
 
2.5%
693
 
2.3%
612
 
2.0%
517
 
1.7%
514
 
1.7%
Other values (113) 16668
54.8%
Decimal Number
ValueCountFrequency (%)
2 202
34.3%
1 147
25.0%
3 137
23.3%
8 103
17.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 30402
98.1%
Common 589
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6680
22.0%
1204
 
4.0%
1068
 
3.5%
880
 
2.9%
795
 
2.6%
771
 
2.5%
693
 
2.3%
612
 
2.0%
517
 
1.7%
514
 
1.7%
Other values (113) 16668
54.8%
Common
ValueCountFrequency (%)
2 202
34.3%
1 147
25.0%
3 137
23.3%
8 103
17.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 30402
98.1%
ASCII 589
 
1.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6680
22.0%
1204
 
4.0%
1068
 
3.5%
880
 
2.9%
795
 
2.6%
771
 
2.5%
693
 
2.3%
612
 
2.0%
517
 
1.7%
514
 
1.7%
Other values (113) 16668
54.8%
ASCII
ValueCountFrequency (%)
2 202
34.3%
1 147
25.0%
3 137
23.3%
8 103
17.5%

측정망코드
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
9120 
2
 
880

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 9120
91.2%
2 880
 
8.8%

Length

2023-12-11T07:32:20.787297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:32:20.923076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 9120
91.2%
2 880
 
8.8%

측정망구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
도시대기
9120 
도로변
 
880

Length

Max length4
Median length4
Mean length3.912
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row도시대기
2nd row도시대기
3rd row도시대기
4th row도시대기
5th row도시대기

Common Values

ValueCountFrequency (%)
도시대기 9120
91.2%
도로변 880
 
8.8%

Length

2023-12-11T07:32:21.071618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:32:21.187959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
도시대기 9120
91.2%
도로변 880
 
8.8%

측정날짜
Real number (ℝ)

HIGH CORRELATION 

Distinct104
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean201934.05
Minimum201501
Maximum202308
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:32:21.299469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum201501
5-th percentile201507
Q1201708
median202001
Q3202112
95-th percentile202304
Maximum202308
Range807
Interquartile range (IQR)404

Descriptive statistics

Standard deviation250.0281
Coefficient of variation (CV)0.0012381672
Kurtosis-1.1514134
Mean201934.05
Median Absolute Deviation (MAD)201
Skewness-0.22016933
Sum2.0193405 × 109
Variance62514.053
MonotonicityNot monotonic
2023-12-11T07:32:21.449986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
202306 121
 
1.2%
202210 121
 
1.2%
202211 121
 
1.2%
202308 120
 
1.2%
202201 120
 
1.2%
202307 120
 
1.2%
202212 120
 
1.2%
202303 120
 
1.2%
202202 120
 
1.2%
202304 120
 
1.2%
Other values (94) 8797
88.0%
ValueCountFrequency (%)
201501 77
0.8%
201502 75
0.8%
201503 78
0.8%
201504 78
0.8%
201505 79
0.8%
201506 78
0.8%
201507 79
0.8%
201508 78
0.8%
201509 79
0.8%
201510 79
0.8%
ValueCountFrequency (%)
202308 120
1.2%
202307 120
1.2%
202306 121
1.2%
202305 118
1.2%
202304 120
1.2%
202303 120
1.2%
202302 119
1.2%
202301 118
1.2%
202212 120
1.2%
202211 121
1.2%

SO2(ppm)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct51
Distinct (%)0.5%
Missing202
Missing (%)2.0%
Infinite0
Infinite (%)0.0%
Mean0.0035000816
Minimum0.0008
Maximum0.015
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:32:21.595851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.0008
5-th percentile0.002
Q10.0022
median0.003
Q30.004
95-th percentile0.006
Maximum0.015
Range0.0142
Interquartile range (IQR)0.0018

Descriptive statistics

Standard deviation0.0015130388
Coefficient of variation (CV)0.43228672
Kurtosis4.1088134
Mean0.0035000816
Median Absolute Deviation (MAD)0.001
Skewness1.3976902
Sum34.2938
Variance2.2892865 × 10-6
MonotonicityNot monotonic
2023-12-11T07:32:21.769780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.003 2709
27.1%
0.004 2080
20.8%
0.002 1939
19.4%
0.005 1146
11.5%
0.006 508
 
5.1%
0.007 219
 
2.2%
0.001 177
 
1.8%
0.008 76
 
0.8%
0.0025 62
 
0.6%
0.0024 53
 
0.5%
Other values (41) 829
 
8.3%
(Missing) 202
 
2.0%
ValueCountFrequency (%)
0.0008 2
 
< 0.1%
0.0009 8
 
0.1%
0.001 177
1.8%
0.0011 19
 
0.2%
0.0012 25
 
0.2%
0.0013 29
 
0.3%
0.0014 33
 
0.3%
0.0015 30
 
0.3%
0.0016 30
 
0.3%
0.0017 45
 
0.4%
ValueCountFrequency (%)
0.015 2
 
< 0.1%
0.014 3
 
< 0.1%
0.013 4
 
< 0.1%
0.012 7
 
0.1%
0.011 14
 
0.1%
0.01 19
 
0.2%
0.009 37
 
0.4%
0.008 76
 
0.8%
0.007 219
2.2%
0.006 508
5.1%

NO2(ppm)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct337
Distinct (%)3.5%
Missing246
Missing (%)2.5%
Infinite0
Infinite (%)0.0%
Mean0.023250851
Minimum0.0017
Maximum0.067
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:32:21.928292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.0017
5-th percentile0.008
Q10.015
median0.023
Q30.0302
95-th percentile0.041
Maximum0.067
Range0.0653
Interquartile range (IQR)0.0152

Descriptive statistics

Standard deviation0.010353247
Coefficient of variation (CV)0.44528465
Kurtosis-0.32714194
Mean0.023250851
Median Absolute Deviation (MAD)0.008
Skewness0.41187943
Sum226.7888
Variance0.00010718972
MonotonicityNot monotonic
2023-12-11T07:32:22.092857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.017 315
 
3.1%
0.023 307
 
3.1%
0.019 306
 
3.1%
0.028 305
 
3.0%
0.022 303
 
3.0%
0.016 297
 
3.0%
0.015 294
 
2.9%
0.013 293
 
2.9%
0.029 290
 
2.9%
0.024 289
 
2.9%
Other values (327) 6755
67.5%
ValueCountFrequency (%)
0.0017 1
 
< 0.1%
0.0027 1
 
< 0.1%
0.0029 1
 
< 0.1%
0.003 3
 
< 0.1%
0.0033 1
 
< 0.1%
0.0035 2
 
< 0.1%
0.0037 1
 
< 0.1%
0.0039 2
 
< 0.1%
0.004 19
0.2%
0.0041 2
 
< 0.1%
ValueCountFrequency (%)
0.067 1
 
< 0.1%
0.066 1
 
< 0.1%
0.063 1
 
< 0.1%
0.062 1
 
< 0.1%
0.06 2
 
< 0.1%
0.059 6
0.1%
0.058 3
< 0.1%
0.057 2
 
< 0.1%
0.056 3
< 0.1%
0.055 6
0.1%

CO(ppm)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct57
Distinct (%)0.6%
Missing279
Missing (%)2.8%
Infinite0
Infinite (%)0.0%
Mean0.47611563
Minimum0.1
Maximum1.1
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:32:22.274403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.1
5-th percentile0.3
Q10.4
median0.5
Q30.6
95-th percentile0.7
Maximum1.1
Range1
Interquartile range (IQR)0.2

Descriptive statistics

Standard deviation0.1434032
Coefficient of variation (CV)0.30119406
Kurtosis0.24663967
Mean0.47611563
Median Absolute Deviation (MAD)0.1
Skewness0.54898019
Sum4628.32
Variance0.020564478
MonotonicityNot monotonic
2023-12-11T07:32:22.415827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.4 2361
23.6%
0.5 2335
23.4%
0.6 1490
14.9%
0.3 1357
13.6%
0.7 717
 
7.2%
0.8 296
 
3.0%
0.2 214
 
2.1%
0.9 84
 
0.8%
0.33 34
 
0.3%
0.36 34
 
0.3%
Other values (47) 799
 
8.0%
(Missing) 279
 
2.8%
ValueCountFrequency (%)
0.1 1
 
< 0.1%
0.19 2
 
< 0.1%
0.2 214
2.1%
0.21 2
 
< 0.1%
0.22 7
 
0.1%
0.23 14
 
0.1%
0.24 17
 
0.2%
0.25 28
 
0.3%
0.26 27
 
0.3%
0.27 31
 
0.3%
ValueCountFrequency (%)
1.1 5
 
0.1%
1.0 17
 
0.2%
0.9 84
 
0.8%
0.8 296
3.0%
0.73 1
 
< 0.1%
0.7 717
7.2%
0.68 2
 
< 0.1%
0.67 2
 
< 0.1%
0.66 6
 
0.1%
0.65 3
 
< 0.1%

O3(ppm)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct386
Distinct (%)4.0%
Missing422
Missing (%)4.2%
Infinite0
Infinite (%)0.0%
Mean0.02716351
Minimum0.005
Maximum0.063
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:32:22.554625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.005
5-th percentile0.011
Q10.018
median0.027
Q30.035
95-th percentile0.046
Maximum0.063
Range0.058
Interquartile range (IQR)0.017

Descriptive statistics

Standard deviation0.010998488
Coefficient of variation (CV)0.40489937
Kurtosis-0.75792883
Mean0.02716351
Median Absolute Deviation (MAD)0.009
Skewness0.26190028
Sum260.1721
Variance0.00012096674
MonotonicityNot monotonic
2023-12-11T07:32:22.693905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.015 306
 
3.1%
0.016 289
 
2.9%
0.029 287
 
2.9%
0.017 286
 
2.9%
0.014 284
 
2.8%
0.018 275
 
2.8%
0.028 274
 
2.7%
0.031 272
 
2.7%
0.022 271
 
2.7%
0.026 269
 
2.7%
Other values (376) 6765
67.7%
(Missing) 422
 
4.2%
ValueCountFrequency (%)
0.005 4
 
< 0.1%
0.006 11
 
0.1%
0.007 36
 
0.4%
0.008 54
 
0.5%
0.0086 1
 
< 0.1%
0.009 107
1.1%
0.01 159
1.6%
0.0107 1
 
< 0.1%
0.011 204
2.0%
0.0112 1
 
< 0.1%
ValueCountFrequency (%)
0.063 1
 
< 0.1%
0.061 1
 
< 0.1%
0.06 3
< 0.1%
0.058 5
0.1%
0.057 4
< 0.1%
0.0567 1
 
< 0.1%
0.056 5
0.1%
0.0558 1
 
< 0.1%
0.0557 1
 
< 0.1%
0.0556 1
 
< 0.1%

PM10(㎍/㎥)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct101
Distinct (%)1.0%
Missing373
Missing (%)3.7%
Infinite0
Infinite (%)0.0%
Mean44.672276
Minimum6
Maximum115
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:32:22.900523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile21
Q132
median44
Q354
95-th percentile75
Maximum115
Range109
Interquartile range (IQR)22

Descriptive statistics

Standard deviation16.230033
Coefficient of variation (CV)0.36331333
Kurtosis-0.010024726
Mean44.672276
Median Absolute Deviation (MAD)11
Skewness0.50812712
Sum430060
Variance263.41399
MonotonicityNot monotonic
2023-12-11T07:32:23.055463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
46 271
 
2.7%
44 267
 
2.7%
43 264
 
2.6%
47 261
 
2.6%
49 261
 
2.6%
41 260
 
2.6%
45 255
 
2.5%
42 244
 
2.4%
39 234
 
2.3%
40 225
 
2.2%
Other values (91) 7085
70.9%
(Missing) 373
 
3.7%
ValueCountFrequency (%)
6 1
 
< 0.1%
8 1
 
< 0.1%
10 1
 
< 0.1%
11 3
 
< 0.1%
12 4
 
< 0.1%
13 9
 
0.1%
14 17
 
0.2%
15 17
 
0.2%
16 33
0.3%
17 53
0.5%
ValueCountFrequency (%)
115 1
 
< 0.1%
111 1
 
< 0.1%
109 1
 
< 0.1%
108 1
 
< 0.1%
106 1
 
< 0.1%
103 2
< 0.1%
102 2
< 0.1%
101 2
< 0.1%
100 3
< 0.1%
99 1
 
< 0.1%

PM25(㎍/㎥)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct55
Distinct (%)0.7%
Missing2001
Missing (%)20.0%
Infinite0
Infinite (%)0.0%
Mean22.810226
Minimum3
Maximum62
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:32:23.221384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile10
Q116
median22
Q328
95-th percentile39
Maximum62
Range59
Interquartile range (IQR)12

Descriptive statistics

Standard deviation8.9743315
Coefficient of variation (CV)0.39343457
Kurtosis0.12020145
Mean22.810226
Median Absolute Deviation (MAD)6
Skewness0.56941642
Sum182459
Variance80.538625
MonotonicityNot monotonic
2023-12-11T07:32:23.386090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
23 341
 
3.4%
17 340
 
3.4%
18 339
 
3.4%
16 338
 
3.4%
20 329
 
3.3%
19 325
 
3.2%
24 322
 
3.2%
25 317
 
3.2%
22 316
 
3.2%
26 311
 
3.1%
Other values (45) 4721
47.2%
(Missing) 2001
20.0%
ValueCountFrequency (%)
3 3
 
< 0.1%
4 4
 
< 0.1%
5 15
 
0.1%
6 28
 
0.3%
7 43
 
0.4%
8 81
 
0.8%
9 126
1.3%
10 184
1.8%
11 219
2.2%
12 270
2.7%
ValueCountFrequency (%)
62 1
 
< 0.1%
57 1
 
< 0.1%
56 3
 
< 0.1%
54 7
 
0.1%
53 7
 
0.1%
52 8
 
0.1%
51 9
0.1%
50 10
0.1%
49 20
0.2%
48 16
0.2%

Interactions

2023-12-11T07:32:17.983896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:10.595385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:11.647212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:12.676865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:13.575061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:14.649527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:15.450617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:16.291239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:17.166546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:18.066486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:10.699656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:11.758923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:12.788661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:13.663457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:14.733449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:15.543627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:16.380372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:17.279602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:18.158975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:10.809519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:11.860579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:12.909194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:13.769104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:14.831570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:15.635172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:16.491541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:17.372703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:18.248591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:10.942298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:11.977206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:13.015118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:13.866697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:14.930241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:15.722886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:16.588076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:17.462216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:18.335572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:11.068715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:12.080089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:13.107906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:13.955600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:15.025609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:15.812373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:16.694169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:17.549907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:18.414611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:11.198627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:12.187183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:13.213457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:14.060053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:15.127121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:15.916276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:16.775758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:17.633686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:18.489959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:11.315180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:12.311699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:13.306258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:14.147417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:15.206805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:15.996585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:16.855561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:17.710916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:18.567027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:11.431752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:12.436497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:13.401706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:14.234757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:15.286099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:16.081651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:16.953406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:17.796199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:18.639967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:11.539971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:12.536717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:13.489991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:14.568958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:15.369185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:16.183129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:17.062565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:17.905335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:32:23.515096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도시코드도시명측정소코드측정망코드측정망구분측정날짜SO2(ppm)NO2(ppm)CO(ppm)O3(ppm)PM10(㎍/㎥)PM25(㎍/㎥)
도시코드1.0001.0000.8180.2490.2490.1360.3150.5080.3370.2120.1570.147
도시명1.0001.0001.0000.3880.3880.0930.4020.5240.3920.2250.1960.190
측정소코드0.8181.0001.0000.1440.1440.0400.2130.1790.0970.0580.0950.080
측정망코드0.2490.3880.1441.0001.0000.0000.0600.4920.2780.2050.0740.060
측정망구분0.2490.3880.1441.0001.0000.0000.0600.4920.2780.2050.0740.060
측정날짜0.1360.0930.0400.0000.0001.0000.6200.4380.4840.3240.5220.542
SO2(ppm)0.3150.4020.2130.0600.0600.6201.0000.4900.4630.2740.5350.373
NO2(ppm)0.5080.5240.1790.4920.4920.4380.4901.0000.7370.6400.6030.577
CO(ppm)0.3370.3920.0970.2780.2780.4840.4630.7371.0000.6260.4970.533
O3(ppm)0.2120.2250.0580.2050.2050.3240.2740.6400.6261.0000.4230.489
PM10(㎍/㎥)0.1570.1960.0950.0740.0740.5220.5350.6030.4970.4231.0000.838
PM25(㎍/㎥)0.1470.1900.0800.0600.0600.5420.3730.5770.5330.4890.8381.000
2023-12-11T07:32:23.688398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도시명측정망구분측정망코드
도시명1.0000.3300.330
측정망구분0.3301.0000.999
측정망코드0.3300.9991.000
2023-12-11T07:32:23.799067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도시코드측정소코드측정날짜SO2(ppm)NO2(ppm)CO(ppm)O3(ppm)PM10(㎍/㎥)PM25(㎍/㎥)도시명측정망코드측정망구분
도시코드1.0000.7390.111-0.114-0.409-0.2500.126-0.0420.0050.9990.1840.184
측정소코드0.7391.0000.088-0.008-0.350-0.2530.1030.0020.0270.9990.0930.093
측정날짜0.1110.0881.000-0.678-0.417-0.3700.205-0.331-0.2760.0330.0000.000
SO2(ppm)-0.114-0.008-0.6781.0000.4830.409-0.2520.4970.4710.1530.0460.046
NO2(ppm)-0.409-0.350-0.4170.4831.0000.727-0.6320.5700.6080.2130.3780.378
CO(ppm)-0.250-0.253-0.3700.4090.7271.000-0.5920.4790.5490.1490.2190.219
O3(ppm)0.1260.1030.205-0.252-0.632-0.5921.000-0.217-0.3540.0810.1570.157
PM10(㎍/㎥)-0.0420.002-0.3310.4970.5700.479-0.2171.0000.8530.0700.0570.057
PM25(㎍/㎥)0.0050.027-0.2760.4710.6080.549-0.3540.8531.0000.0680.0460.046
도시명0.9990.9990.0330.1530.2130.1490.0810.0700.0681.0000.3300.330
측정망코드0.1840.0930.0000.0460.3780.2190.1570.0570.0460.3301.0000.999
측정망구분0.1840.0930.0000.0460.3780.2190.1570.0570.0460.3300.9991.000

Missing values

2023-12-11T07:32:18.752996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:32:18.893833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T07:32:19.011386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

도시코드도시명측정소코드측정소명측정망코드측정망구분측정날짜SO2(ppm)NO2(ppm)CO(ppm)O3(ppm)PM10(㎍/㎥)PM25(㎍/㎥)
778117의왕시131223고천동1도시대기2019110.0030.0310.50.0144021
62114안양시131142부림동1도시대기2020110.0030.0420.60.0154229
874220파주시131372운정1도시대기2022070.0020.0090.40.0352617
206013남양주시131241금곡동1도시대기2016020.0040.0310.50.01744<NA>
47910고양시131381행신동1도시대기2020100.0040.0210.40.033617
741619용인시131413기흥1도시대기2022070.0020.0120.40.0382314
118912구리시131212동구동1도시대기2019120.0030.0280.50.0115133
162225광주시131394오포1동1도시대기2022110.0020.0250.60.0154423
820919용인시131416이동읍1도시대기2022090.0020.0110.40.0292210
36082성남시131124수내동1도시대기2016110.0040.0380.60.0125122
도시코드도시명측정소코드측정소명측정망코드측정망구분측정날짜SO2(ppm)NO2(ppm)CO(ppm)O3(ppm)PM10(㎍/㎥)PM25(㎍/㎥)
89997평택시131343평택항1도시대기2018040.0070.0250.50.0327437
641926양주시131562고읍1도시대기2021020.0040.0210.50.0254827
57499안산시131198중앙대로2도로변2015050.0060.0370.5<NA>48<NA>
31962성남시131120대왕판교로2도로변2017030.0060.0580.8<NA>5832
493315시흥시131232시화산단1도시대기2020020.0040.0330.50.0245124
923918하남시131542미사1도시대기2020120.0030.0320.60.0144625
848921이천시131442창전동1도시대기2018070.0030.0120.30.0312816
31102성남시131121단대동1도시대기2018090.0030.0170.40.032110
789717의왕시131222부곡3동1도시대기2020110.0030.0350.50.0174026
51139안산시131196대부동1도시대기2020030.0030.0120.40.044321