Overview

Dataset statistics

Number of variables7
Number of observations65
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.8 KiB
Average record size in memory60.0 B

Variable types

DateTime1
Numeric2
Categorical4

Dataset

Description인천광역시 부평구 대기오염측정망 공개 데이터입니다. (월별,미세먼지,초미세먼지,아황산가스,이산화질소,오존,일산화탄소) ex) 2021-01-17,19,13,0.024,0.018,0.5,0.004
URLhttps://www.data.go.kr/data/3045130/fileData.do

Alerts

미세먼지(마이크로그램당 세제곱미터) is highly overall correlated with 초미세먼지(마이크로그램당 세제곱미터)High correlation
초미세먼지(마이크로그램당 세제곱미터) is highly overall correlated with 미세먼지(마이크로그램당 세제곱미터)High correlation
아황산가스 is highly overall correlated with 이산화질소 and 1 other fieldsHigh correlation
이산화질소 is highly overall correlated with 아황산가스 and 1 other fieldsHigh correlation
일산화탄소 is highly overall correlated with 아황산가스 and 1 other fieldsHigh correlation
년월별 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:07:27.883003
Analysis finished2023-12-12 21:07:28.700668
Duration0.82 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년월별
Date

UNIQUE 

Distinct65
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size652.0 B
Minimum2020-01-17 00:00:00
Maximum2023-05-19 00:00:00
2023-12-13T06:07:28.762731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:07:28.888439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

미세먼지(마이크로그램당 세제곱미터)
Real number (ℝ)

HIGH CORRELATION 

Distinct36
Distinct (%)55.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39.307692
Minimum15
Maximum74
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size717.0 B
2023-12-13T06:07:29.014640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum15
5-th percentile21.2
Q130
median40
Q345
95-th percentile64
Maximum74
Range59
Interquartile range (IQR)15

Descriptive statistics

Standard deviation12.72046
Coefficient of variation (CV)0.32361248
Kurtosis0.13631122
Mean39.307692
Median Absolute Deviation (MAD)9
Skewness0.49450127
Sum2555
Variance161.8101
MonotonicityNot monotonic
2023-12-13T06:07:29.163079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
40 7
 
10.8%
41 4
 
6.2%
42 4
 
6.2%
30 3
 
4.6%
22 3
 
4.6%
44 2
 
3.1%
28 2
 
3.1%
64 2
 
3.1%
43 2
 
3.1%
48 2
 
3.1%
Other values (26) 34
52.3%
ValueCountFrequency (%)
15 1
 
1.5%
20 1
 
1.5%
21 2
3.1%
22 3
4.6%
23 1
 
1.5%
25 2
3.1%
26 2
3.1%
27 1
 
1.5%
28 2
3.1%
29 1
 
1.5%
ValueCountFrequency (%)
74 1
1.5%
69 1
1.5%
65 1
1.5%
64 2
3.1%
57 1
1.5%
54 1
1.5%
53 2
3.1%
52 1
1.5%
51 1
1.5%
50 2
3.1%

초미세먼지(마이크로그램당 세제곱미터)
Real number (ℝ)

HIGH CORRELATION 

Distinct27
Distinct (%)41.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.384615
Minimum8
Maximum38
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size717.0 B
2023-12-13T06:07:29.271263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8
5-th percentile11.2
Q117
median22
Q327
95-th percentile36.8
Maximum38
Range30
Interquartile range (IQR)10

Descriptive statistics

Standard deviation7.4071172
Coefficient of variation (CV)0.33090214
Kurtosis-0.46670807
Mean22.384615
Median Absolute Deviation (MAD)5
Skewness0.2207035
Sum1455
Variance54.865385
MonotonicityNot monotonic
2023-12-13T06:07:29.380213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
27 7
 
10.8%
22 6
 
9.2%
17 5
 
7.7%
25 4
 
6.2%
24 4
 
6.2%
13 4
 
6.2%
31 4
 
6.2%
18 3
 
4.6%
19 3
 
4.6%
23 2
 
3.1%
Other values (17) 23
35.4%
ValueCountFrequency (%)
8 1
 
1.5%
9 1
 
1.5%
10 1
 
1.5%
11 1
 
1.5%
12 1
 
1.5%
13 4
6.2%
14 2
 
3.1%
15 1
 
1.5%
16 2
 
3.1%
17 5
7.7%
ValueCountFrequency (%)
38 2
 
3.1%
37 2
 
3.1%
36 1
 
1.5%
32 2
 
3.1%
31 4
6.2%
29 1
 
1.5%
28 1
 
1.5%
27 7
10.8%
26 1
 
1.5%
25 4
6.2%

아황산가스
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)15.4%
Missing0
Missing (%)0.0%
Memory size652.0 B
4
22 
3
19 
5
13 
8
6
 
2
Other values (5)

Length

Max length5
Median length1
Mean length1.1692308
Min length1

Unique

Unique5 ?
Unique (%)7.7%

Sample

1st row7ppb
2nd row8
3rd row8
4th row8
5th row8

Common Values

ValueCountFrequency (%)
4 22
33.8%
3 19
29.2%
5 13
20.0%
8 4
 
6.2%
6 2
 
3.1%
7ppb 1
 
1.5%
7 1
 
1.5%
0.004 1
 
1.5%
3.9 1
 
1.5%
3.3 1
 
1.5%

Length

2023-12-13T06:07:29.517661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:07:29.630236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
4 22
33.8%
3 19
29.2%
5 13
20.0%
8 4
 
6.2%
6 2
 
3.1%
7ppb 1
 
1.5%
7 1
 
1.5%
0.004 1
 
1.5%
3.9 1
 
1.5%
3.3 1
 
1.5%

이산화질소
Categorical

HIGH CORRELATION 

Distinct27
Distinct (%)41.5%
Missing0
Missing (%)0.0%
Memory size652.0 B
33
29
14
 
4
32
 
4
23
 
4
Other values (22)
40 

Length

Max length5
Median length2
Mean length2.1846154
Min length2

Unique

Unique9 ?
Unique (%)13.8%

Sample

1st row26ppb
2nd row28
3rd row29
4th row25
5th row23

Common Values

ValueCountFrequency (%)
33 7
 
10.8%
29 6
 
9.2%
14 4
 
6.2%
32 4
 
6.2%
23 4
 
6.2%
18 3
 
4.6%
34 3
 
4.6%
31 3
 
4.6%
36 3
 
4.6%
24 3
 
4.6%
Other values (17) 25
38.5%

Length

2023-12-13T06:07:29.755706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
33 7
 
10.8%
29 6
 
9.2%
14 4
 
6.2%
32 4
 
6.2%
23 4
 
6.2%
18 3
 
4.6%
34 3
 
4.6%
31 3
 
4.6%
36 3
 
4.6%
24 3
 
4.6%
Other values (17) 25
38.5%

오 존
Categorical

Distinct32
Distinct (%)49.2%
Missing0
Missing (%)0.0%
Memory size652.0 B
35
 
4
22
 
4
34
 
4
42
 
3
26
 
3
Other values (27)
47 

Length

Max length5
Median length2
Mean length2.1846154
Min length2

Unique

Unique12 ?
Unique (%)18.5%

Sample

1st row16ppb
2nd row20
3rd row26
4th row35
5th row37

Common Values

ValueCountFrequency (%)
35 4
 
6.2%
22 4
 
6.2%
34 4
 
6.2%
42 3
 
4.6%
26 3
 
4.6%
33 3
 
4.6%
24 3
 
4.6%
17 3
 
4.6%
16 3
 
4.6%
20 3
 
4.6%
Other values (22) 32
49.2%

Length

2023-12-13T06:07:29.868677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
35 4
 
6.2%
34 4
 
6.2%
22 4
 
6.2%
42 3
 
4.6%
26 3
 
4.6%
33 3
 
4.6%
24 3
 
4.6%
17 3
 
4.6%
16 3
 
4.6%
20 3
 
4.6%
Other values (22) 32
49.2%

일산화탄소
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)13.8%
Missing0
Missing (%)0.0%
Memory size652.0 B
0.6
19 
0.5
17 
0.4
15 
0.7
0.8
Other values (4)

Length

Max length6
Median length3
Mean length3.0923077
Min length3

Unique

Unique2 ?
Unique (%)3.1%

Sample

1st row0.9ppb
2nd row0.8
3rd row0.6
4th row0.6
5th row0.6

Common Values

ValueCountFrequency (%)
0.6 19
29.2%
0.5 17
26.2%
0.4 15
23.1%
0.7 6
 
9.2%
0.8 2
 
3.1%
0.3 2
 
3.1%
0.58 2
 
3.1%
0.9ppb 1
 
1.5%
0.46 1
 
1.5%

Length

2023-12-13T06:07:29.986420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:07:30.104779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0.6 19
29.2%
0.5 17
26.2%
0.4 15
23.1%
0.7 6
 
9.2%
0.8 2
 
3.1%
0.3 2
 
3.1%
0.58 2
 
3.1%
0.9ppb 1
 
1.5%
0.46 1
 
1.5%

Interactions

2023-12-13T06:07:28.339895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:07:28.200300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:07:28.419261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:07:28.270538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:07:30.203125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년월별미세먼지(마이크로그램당 세제곱미터)초미세먼지(마이크로그램당 세제곱미터)아황산가스이산화질소오 존일산화탄소
년월별1.0001.0001.0001.0001.0001.0001.000
미세먼지(마이크로그램당 세제곱미터)1.0001.0000.8980.6560.8160.6490.680
초미세먼지(마이크로그램당 세제곱미터)1.0000.8981.0000.1160.0000.0000.586
아황산가스1.0000.6560.1161.0000.9260.9170.815
이산화질소1.0000.8160.0000.9261.0000.8770.960
오 존1.0000.6490.0000.9170.8771.0000.847
일산화탄소1.0000.6800.5860.8150.9600.8471.000
2023-12-13T06:07:30.302538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
아황산가스이산화질소오 존일산화탄소
아황산가스1.0000.5580.4910.544
이산화질소0.5581.0000.3550.583
오 존0.4910.3551.0000.390
일산화탄소0.5440.5830.3901.000
2023-12-13T06:07:30.386793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
미세먼지(마이크로그램당 세제곱미터)초미세먼지(마이크로그램당 세제곱미터)아황산가스이산화질소오 존일산화탄소
미세먼지(마이크로그램당 세제곱미터)1.0000.8830.2490.3780.2080.386
초미세먼지(마이크로그램당 세제곱미터)0.8831.0000.0000.0000.0000.358
아황산가스0.2490.0001.0000.5580.4910.544
이산화질소0.3780.0000.5581.0000.3550.583
오 존0.2080.0000.4910.3551.0000.390
일산화탄소0.3860.3580.5440.5830.3901.000

Missing values

2023-12-13T06:07:28.526266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:07:28.654825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년월별미세먼지(마이크로그램당 세제곱미터)초미세먼지(마이크로그램당 세제곱미터)아황산가스이산화질소오 존일산화탄소
02020-01-1745257ppb26ppb16ppb0.9ppb
12020-02-174022828200.8
22020-03-175337829260.6
32020-04-175032825350.6
42020-05-175329823370.6
52020-06-174025718340.6
62020-07-174023422280.5
72020-08-172515423290.4
82020-09-173117429340.5
92020-10-172613429240.4
년월별미세먼지(마이크로그램당 세제곱미터)초미세먼지(마이크로그램당 세제곱미터)아황산가스이산화질소오 존일산화탄소
552022-08-192114314330.4
562022-09-192617318350.4
572022-10-193019323260.5
582022-11-194127332200.6
592022-12-193722329180.6
602023-01-195131432170.7
612023-02-194836329250.6
622023-03-1969373.929.337.50.58
632023-04-196527319.741.30.58
642023-05-1942243.319.242.50.46