Overview

Dataset statistics

Number of variables4
Number of observations28
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.0 KiB
Average record size in memory36.7 B

Variable types

Categorical2
Text2

Dataset

Description한국남동발전 환경화학 시스템 내 환경성과평가 지표 정보입니다. 지표그룹에 따른 지표내역, 지표내용 등의 데이터를 포함하고 있습니다.
Author공공데이터포털
URLhttps://www.data.go.kr/data/15093043/fileData.do

Alerts

지표그룹 is highly overall correlated with 단위High correlation
단위 is highly overall correlated with 지표그룹High correlation
지표내역 has unique valuesUnique

Reproduction

Analysis started2024-04-18 04:49:17.895517
Analysis finished2024-04-18 04:49:19.991597
Duration2.1 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지표그룹
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size356.0 B
OPI
10 
MPI
CPI
ECI

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowECI
2nd rowECI
3rd rowECI
4th rowMPI
5th rowMPI

Common Values

ValueCountFrequency (%)
OPI 10
35.7%
MPI 8
28.6%
CPI 7
25.0%
ECI 3
 
10.7%

Length

2024-04-18T13:49:20.055440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T13:49:20.150069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
opi 10
35.7%
mpi 8
28.6%
cpi 7
25.0%
eci 3
 
10.7%

지표내역
Text

UNIQUE 

Distinct28
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size356.0 B
2024-04-18T13:49:20.335027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length13
Mean length10.107143
Min length5

Characters and Unicode

Total characters283
Distinct characters93
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)100.0%

Sample

1st row주변지역의 SOx 농도
2nd row주변지역의 NOx 농도
3rd row주변지역의 먼지 농도
4th row환경관련 사고 건수
5th row배출물질 초과건수
ValueCountFrequency (%)
배출량 4
 
5.9%
환경관련 4
 
5.9%
주변지역의 3
 
4.4%
농도 3
 
4.4%
건수 3
 
4.4%
석탄회 2
 
2.9%
기후변화관련 2
 
2.9%
재활용율 2
 
2.9%
폐기물 2
 
2.9%
사용량 2
 
2.9%
Other values (41) 41
60.3%
2024-04-18T13:49:20.680500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
41
 
14.5%
12
 
4.2%
11
 
3.9%
9
 
3.2%
8
 
2.8%
8
 
2.8%
7
 
2.5%
7
 
2.5%
6
 
2.1%
6
 
2.1%
Other values (83) 168
59.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 232
82.0%
Space Separator 41
 
14.5%
Uppercase Letter 6
 
2.1%
Lowercase Letter 2
 
0.7%
Decimal Number 1
 
0.4%
Other Punctuation 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
5.2%
11
 
4.7%
9
 
3.9%
8
 
3.4%
8
 
3.4%
7
 
3.0%
7
 
3.0%
6
 
2.6%
6
 
2.6%
6
 
2.6%
Other values (75) 152
65.5%
Uppercase Letter
ValueCountFrequency (%)
O 3
50.0%
S 1
 
16.7%
C 1
 
16.7%
N 1
 
16.7%
Space Separator
ValueCountFrequency (%)
41
100.0%
Lowercase Letter
ValueCountFrequency (%)
x 2
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 232
82.0%
Common 43
 
15.2%
Latin 8
 
2.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
5.2%
11
 
4.7%
9
 
3.9%
8
 
3.4%
8
 
3.4%
7
 
3.0%
7
 
3.0%
6
 
2.6%
6
 
2.6%
6
 
2.6%
Other values (75) 152
65.5%
Latin
ValueCountFrequency (%)
O 3
37.5%
x 2
25.0%
S 1
 
12.5%
C 1
 
12.5%
N 1
 
12.5%
Common
ValueCountFrequency (%)
41
95.3%
2 1
 
2.3%
/ 1
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 232
82.0%
ASCII 51
 
18.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
41
80.4%
O 3
 
5.9%
x 2
 
3.9%
S 1
 
2.0%
2 1
 
2.0%
C 1
 
2.0%
/ 1
 
2.0%
N 1
 
2.0%
Hangul
ValueCountFrequency (%)
12
 
5.2%
11
 
4.7%
9
 
3.9%
8
 
3.4%
8
 
3.4%
7
 
3.0%
7
 
3.0%
6
 
2.6%
6
 
2.6%
6
 
2.6%
Other values (75) 152
65.5%

단위
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)39.3%
Missing0
Missing (%)0.0%
Memory size356.0 B
건수
톤/MWH
%
PPB
Other values (6)

Length

Max length7
Median length6
Mean length3.2857143
Min length1

Unique

Unique4 ?
Unique (%)14.3%

Sample

1st rowPPB
2nd rowPPB
3rd row㎍/㎥
4th row건수
5th row건수

Common Values

ValueCountFrequency (%)
건수 8
28.6%
톤/MWH 5
17.9%
% 3
 
10.7%
PPB 2
 
7.1%
2
 
7.1%
TOE/MWH 2
 
7.1%
KG/MWH 2
 
7.1%
㎍/㎥ 1
 
3.6%
1
 
3.6%
G/MWH 1
 
3.6%

Length

2024-04-18T13:49:20.841342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
건수 8
28.6%
톤/mwh 5
17.9%
3
 
10.7%
ppb 2
 
7.1%
2
 
7.1%
toe/mwh 2
 
7.1%
kg/mwh 2
 
7.1%
㎍/㎥ 1
 
3.6%
1
 
3.6%
g/mwh 1
 
3.6%
Distinct24
Distinct (%)85.7%
Missing0
Missing (%)0.0%
Memory size356.0 B
2024-04-18T13:49:21.017937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length17.5
Mean length12
Min length5

Characters and Unicode

Total characters336
Distinct characters95
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)75.0%

Sample

1st row직접 입력 지표 : 평균
2nd row직접 입력 지표 : 평균
3rd row직접 입력 지표 : 평균
4th row직접 입력 지표 : 계
5th row대기환경 배출기준 초과 건수
ValueCountFrequency (%)
8
 
8.6%
직접 5
 
5.4%
지표 5
 
5.4%
입력 5
 
5.4%
배출량 4
 
4.3%
환경관련 4
 
4.3%
석탄회 4
 
4.3%
사용량 3
 
3.2%
건수 3
 
3.2%
평균 3
 
3.2%
Other values (42) 49
52.7%
2024-04-18T13:49:21.348296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
65
 
19.3%
11
 
3.3%
10
 
3.0%
8
 
2.4%
8
 
2.4%
: 8
 
2.4%
8
 
2.4%
7
 
2.1%
7
 
2.1%
7
 
2.1%
Other values (85) 197
58.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 248
73.8%
Space Separator 65
 
19.3%
Other Punctuation 11
 
3.3%
Decimal Number 5
 
1.5%
Uppercase Letter 5
 
1.5%
Close Punctuation 1
 
0.3%
Open Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
4.4%
10
 
4.0%
8
 
3.2%
8
 
3.2%
8
 
3.2%
7
 
2.8%
7
 
2.8%
7
 
2.8%
7
 
2.8%
6
 
2.4%
Other values (73) 169
68.1%
Uppercase Letter
ValueCountFrequency (%)
O 2
40.0%
C 1
20.0%
I 1
20.0%
S 1
20.0%
Decimal Number
ValueCountFrequency (%)
0 3
60.0%
2 1
 
20.0%
5 1
 
20.0%
Other Punctuation
ValueCountFrequency (%)
: 8
72.7%
/ 3
 
27.3%
Space Separator
ValueCountFrequency (%)
65
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 248
73.8%
Common 83
 
24.7%
Latin 5
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
4.4%
10
 
4.0%
8
 
3.2%
8
 
3.2%
8
 
3.2%
7
 
2.8%
7
 
2.8%
7
 
2.8%
7
 
2.8%
6
 
2.4%
Other values (73) 169
68.1%
Common
ValueCountFrequency (%)
65
78.3%
: 8
 
9.6%
/ 3
 
3.6%
0 3
 
3.6%
2 1
 
1.2%
) 1
 
1.2%
( 1
 
1.2%
5 1
 
1.2%
Latin
ValueCountFrequency (%)
O 2
40.0%
C 1
20.0%
I 1
20.0%
S 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 248
73.8%
ASCII 88
 
26.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
65
73.9%
: 8
 
9.1%
/ 3
 
3.4%
0 3
 
3.4%
O 2
 
2.3%
C 1
 
1.1%
2 1
 
1.1%
) 1
 
1.1%
( 1
 
1.1%
5 1
 
1.1%
Other values (2) 2
 
2.3%
Hangul
ValueCountFrequency (%)
11
 
4.4%
10
 
4.0%
8
 
3.2%
8
 
3.2%
8
 
3.2%
7
 
2.8%
7
 
2.8%
7
 
2.8%
7
 
2.8%
6
 
2.4%
Other values (73) 169
68.1%

Correlations

2024-04-18T13:49:21.438144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지표그룹지표내역단위지표내용
지표그룹1.0001.0000.8460.991
지표내역1.0001.0001.0001.000
단위0.8461.0001.0000.856
지표내용0.9911.0000.8561.000
2024-04-18T13:49:21.521858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지표그룹단위
지표그룹1.0000.584
단위0.5841.000
2024-04-18T13:49:21.596628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지표그룹단위
지표그룹1.0000.584
단위0.5841.000

Missing values

2024-04-18T13:49:19.958153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지표그룹지표내역단위지표내용
0ECI주변지역의 SOx 농도PPB직접 입력 지표 : 평균
1ECI주변지역의 NOx 농도PPB직접 입력 지표 : 평균
2ECI주변지역의 먼지 농도㎍/㎥직접 입력 지표 : 평균
3MPI환경관련 사고 건수건수직접 입력 지표 : 계
4MPI배출물질 초과건수건수대기환경 배출기준 초과 건수
5MPI환경관련 민원 발생 건수건수환경관련 민원 발생 건수
6MPI환경보전활동 등 환경행사 참가건수건수환경행사 관련(행사분류 0500)
7MPI내/외부 환경감사에 의한 부적합 건수건수내/외부 ISO에 의한 부적합 건수
8MPI환경관련 교육횟수환경관련 교육건수
9MPI환경관련 투자실적환경관련 투자실적
지표그룹지표내역단위지표내용
18OPI폐기물 배출량톤/MWH폐기물 배출량 : 남부/서부 석탄회 포함
19OPI폐기물 재활용율%폐기물 재활용율 : 남부/서부는 석탄회 포함
20OPI폐수 재이용량톤/MWH폐수 재이용량
21CPI에너지 사용량TOE/MWH에너지 사용량
22CPICO2 원단위 배출량톤/MWHCO2 원단위 배출량
23CPI온실가스 감축량톤_CO2온실가스 감축량
24CPI신재생에너지 발전비율%신재생에너지 발전비율
25CPI감축사업 시행건수건수감축사업 시행건수
26CPI기후변화관련 행사 및 활동건수건수기후변화 관련 행사 및 활동건수
27CPI기후변화관련 교육건수건수환경관련 교육건수 : 기후 관련