Overview

Dataset statistics

Number of variables5
Number of observations37
Missing cells29
Missing cells (%)15.7%
Duplicate rows4
Duplicate rows (%)10.8%
Total size in memory1.6 KiB
Average record size in memory43.6 B

Variable types

Text1
Categorical1
Unsupported3

Dataset

Description김해시 대기환경측정망 5개소(도시대기 : 동상동, 삼방동, 장유동, 진영읍, 도로변대기 : 김해대로)에서 측정한 미세먼지, 초미세먼지, 오존에 대한 월별 농도를 제공하고 있습니다
Author경상남도 김해시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15105060

Alerts

Dataset has 4 (10.8%) duplicate rowsDuplicates
(초)미세먼지 및 오존 농도 현황 has 29 (78.4%) missing valuesMissing
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-11 00:54:17.271911
Analysis finished2023-12-11 00:54:17.481372
Duration0.21 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct8
Distinct (%)100.0%
Missing29
Missing (%)78.4%
Memory size428.0 B
2023-12-11T09:54:17.545181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.25
Min length2

Characters and Unicode

Total characters58
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)100.0%

Sample

1st row구분
2nd row2022년 1월
3rd row2022년 2월
4th row2022년 3월
5th row2022년 4월
ValueCountFrequency (%)
2022년 7
46.7%
구분 1
 
6.7%
1월 1
 
6.7%
2월 1
 
6.7%
3월 1
 
6.7%
4월 1
 
6.7%
5월 1
 
6.7%
6월 1
 
6.7%
7월 1
 
6.7%
2023-12-11T09:54:17.795482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 22
37.9%
0 7
 
12.1%
7
 
12.1%
7
 
12.1%
7
 
12.1%
1
 
1.7%
1
 
1.7%
1 1
 
1.7%
3 1
 
1.7%
4 1
 
1.7%
Other values (3) 3
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 35
60.3%
Other Letter 16
27.6%
Space Separator 7
 
12.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 22
62.9%
0 7
 
20.0%
1 1
 
2.9%
3 1
 
2.9%
4 1
 
2.9%
5 1
 
2.9%
6 1
 
2.9%
7 1
 
2.9%
Other Letter
ValueCountFrequency (%)
7
43.8%
7
43.8%
1
 
6.2%
1
 
6.2%
Space Separator
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 42
72.4%
Hangul 16
 
27.6%

Most frequent character per script

Common
ValueCountFrequency (%)
2 22
52.4%
0 7
 
16.7%
7
 
16.7%
1 1
 
2.4%
3 1
 
2.4%
4 1
 
2.4%
5 1
 
2.4%
6 1
 
2.4%
7 1
 
2.4%
Hangul
ValueCountFrequency (%)
7
43.8%
7
43.8%
1
 
6.2%
1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 42
72.4%
Hangul 16
 
27.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 22
52.4%
0 7
 
16.7%
7
 
16.7%
1 1
 
2.4%
3 1
 
2.4%
4 1
 
2.4%
5 1
 
2.4%
6 1
 
2.4%
7 1
 
2.4%
Hangul
ValueCountFrequency (%)
7
43.8%
7
43.8%
1
 
6.2%
1
 
6.2%

Unnamed: 1
Categorical

Distinct7
Distinct (%)18.9%
Missing0
Missing (%)0.0%
Memory size428.0 B
동상동
삼방동
장유동
진영읍
김해대로
Other values (2)

Length

Max length4
Median length3
Mean length3.2432432
Min length3

Unique

Unique2 ?
Unique (%)5.4%

Sample

1st row측정소명
2nd row<NA>
3rd row동상동
4th row삼방동
5th row장유동

Common Values

ValueCountFrequency (%)
동상동 7
18.9%
삼방동 7
18.9%
장유동 7
18.9%
진영읍 7
18.9%
김해대로 7
18.9%
측정소명 1
 
2.7%
<NA> 1
 
2.7%

Length

2023-12-11T09:54:17.914275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:54:18.012174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
동상동 7
18.9%
삼방동 7
18.9%
장유동 7
18.9%
진영읍 7
18.9%
김해대로 7
18.9%
측정소명 1
 
2.7%
na 1
 
2.7%

Unnamed: 2
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size428.0 B

Unnamed: 3
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size428.0 B

Unnamed: 4
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size428.0 B

Correlations

2023-12-11T09:54:18.088587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
(초)미세먼지 및 오존 농도 현황Unnamed: 1
(초)미세먼지 및 오존 농도 현황1.0001.000
Unnamed: 11.0001.000

Missing values

2023-12-11T09:54:17.363338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:54:17.447893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

(초)미세먼지 및 오존 농도 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4
0구분측정소명오존(O3)미세먼지(PM10)초미세먼지(PM2.5)
1<NA><NA>(0.06ppm/8시간)(50㎍/㎥/1년)(15㎍/㎥/1년)
22022년 1월동상동0.0243822
3<NA>삼방동0.0252915
4<NA>장유동0.0223121
5<NA>진영읍0.0193624
6<NA>김해대로0.0193221
72022년 2월동상동0.0353520
8<NA>삼방동0.0352714
9<NA>장유동0.0353119
(초)미세먼지 및 오존 농도 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4
272022년 6월동상동0.034179
28<NA>삼방동0.038137
29<NA>장유동0.0311811
30<NA>진영읍0.0372113
31<NA>김해대로0.0322011
322022년 7월동상동0.0342114
33<NA>삼방동0.039168
34<NA>장유동0.0332315
35<NA>진영읍0.0352716
36<NA>김해대로0.0292516

Duplicate rows

Most frequently occurring

(초)미세먼지 및 오존 농도 현황Unnamed: 1# duplicates
0<NA>김해대로7
1<NA>삼방동7
2<NA>장유동7
3<NA>진영읍7