Overview

Dataset statistics

Number of variables9
Number of observations27
Missing cells8
Missing cells (%)3.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory76.9 B

Variable types

Categorical7
Text2

Dataset

Description본 데이터는 한국환경산업기술원 환경책임투자종합플랫폼(https://www.gmi.go.kr)에서 제공 및 실시하는 환경정보공개제도 관련하여, 2021년 1월 기준의 분야별 환경정보 등록기업(공개항목, 제조, 공공행정, 교육서비스, 보건, 기타 서비스 등)의 정보공개 대상 세부항목을 정리한 내용입니다.
Author한국환경산업기술원
URLhttps://www.data.go.kr/data/15089175/fileData.do

Alerts

보건 is highly overall correlated with 제조 and 4 other fieldsHigh correlation
기타 산업 is highly overall correlated with 제조 and 3 other fieldsHigh correlation
교육서비스 is highly overall correlated with 제조 and 4 other fieldsHigh correlation
기타 서비스 is highly overall correlated with 제조 and 4 other fieldsHigh correlation
제조 is highly overall correlated with 교육서비스 and 3 other fieldsHigh correlation
공공행정 is highly overall correlated with 교육서비스 and 2 other fieldsHigh correlation
주요내용 has 8 (29.6%) missing valuesMissing
공개항목 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:58:47.530645
Analysis finished2023-12-12 07:58:48.390330
Duration0.86 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct6
Distinct (%)22.2%
Missing0
Missing (%)0.0%
Memory size348.0 B
온실가스/환경오염
녹색제품서비스
자원/에너지
사회윤리적 책임
기업개요

Length

Max length9
Median length8
Mean length7.4444444
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기업개요
2nd row기업개요
3rd row녹색경영 시스템
4th row녹색경영 시스템
5th row자원/에너지

Common Values

ValueCountFrequency (%)
온실가스/환경오염 9
33.3%
녹색제품서비스 6
22.2%
자원/에너지 5
18.5%
사회윤리적 책임 3
 
11.1%
기업개요 2
 
7.4%
녹색경영 시스템 2
 
7.4%

Length

2023-12-12T16:58:48.467666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:58:48.587434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
온실가스/환경오염 9
28.1%
녹색제품서비스 6
18.8%
자원/에너지 5
15.6%
사회윤리적 3
 
9.4%
책임 3
 
9.4%
기업개요 2
 
6.2%
녹색경영 2
 
6.2%
시스템 2
 
6.2%

공개항목
Text

UNIQUE 

Distinct27
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size348.0 B
2023-12-12T16:58:48.863693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length19
Mean length15.37037
Min length4

Characters and Unicode

Total characters415
Distinct characters129
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)100.0%

Sample

1st row사업현황
2nd row환경관련 수상 및 협약 현황
3rd row비전, 전략, 방침, 목표
4th row전담조직, 교육훈련, 내부심사 등
5th row저감투자 및 기술도입
ValueCountFrequency (%)
11
 
12.0%
현황 9
 
9.8%
기술도입 5
 
5.4%
배출량·원단위 4
 
4.3%
저감투자 3
 
3.3%
사용량·원단위 2
 
2.2%
투자 2
 
2.2%
온실가스 2
 
2.2%
녹색구매 1
 
1.1%
지침 1
 
1.1%
Other values (52) 52
56.5%
2023-12-12T16:58:49.323215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
65
 
15.7%
· 13
 
3.1%
11
 
2.7%
11
 
2.7%
11
 
2.7%
10
 
2.4%
10
 
2.4%
10
 
2.4%
10
 
2.4%
9
 
2.2%
Other values (119) 255
61.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 322
77.6%
Space Separator 65
 
15.7%
Other Punctuation 18
 
4.3%
Lowercase Letter 3
 
0.7%
Close Punctuation 2
 
0.5%
Open Punctuation 2
 
0.5%
Uppercase Letter 1
 
0.2%
Letter Number 1
 
0.2%
Decimal Number 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
3.4%
11
 
3.4%
11
 
3.4%
10
 
3.1%
10
 
3.1%
10
 
3.1%
10
 
3.1%
9
 
2.8%
9
 
2.8%
8
 
2.5%
Other values (108) 223
69.3%
Lowercase Letter
ValueCountFrequency (%)
y 1
33.3%
e 1
33.3%
p 1
33.3%
Other Punctuation
ValueCountFrequency (%)
· 13
72.2%
, 5
 
27.8%
Space Separator
ValueCountFrequency (%)
65
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Uppercase Letter
ValueCountFrequency (%)
T 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 322
77.6%
Common 88
 
21.2%
Latin 5
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
3.4%
11
 
3.4%
11
 
3.4%
10
 
3.1%
10
 
3.1%
10
 
3.1%
10
 
3.1%
9
 
2.8%
9
 
2.8%
8
 
2.5%
Other values (108) 223
69.3%
Common
ValueCountFrequency (%)
65
73.9%
· 13
 
14.8%
, 5
 
5.7%
) 2
 
2.3%
( 2
 
2.3%
3 1
 
1.1%
Latin
ValueCountFrequency (%)
T 1
20.0%
y 1
20.0%
e 1
20.0%
1
20.0%
p 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 322
77.6%
ASCII 79
 
19.0%
None 13
 
3.1%
Number Forms 1
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
65
82.3%
, 5
 
6.3%
) 2
 
2.5%
( 2
 
2.5%
T 1
 
1.3%
y 1
 
1.3%
e 1
 
1.3%
3 1
 
1.3%
p 1
 
1.3%
None
ValueCountFrequency (%)
· 13
100.0%
Hangul
ValueCountFrequency (%)
11
 
3.4%
11
 
3.4%
11
 
3.4%
10
 
3.1%
10
 
3.1%
10
 
3.1%
10
 
3.1%
9
 
2.8%
9
 
2.8%
8
 
2.5%
Other values (108) 223
69.3%
Number Forms
ValueCountFrequency (%)
1
100.0%

제조
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)7.4%
Missing0
Missing (%)0.0%
Memory size348.0 B
자율
14 
의무
13 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row의무
2nd row자율
3rd row자율
4th row의무
5th row의무

Common Values

ValueCountFrequency (%)
자율 14
51.9%
의무 13
48.1%

Length

2023-12-12T16:58:49.497454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:58:49.614806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자율 14
51.9%
의무 13
48.1%

공공행정
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)11.1%
Missing0
Missing (%)0.0%
Memory size348.0 B
자율
11 
의무
해당없음

Length

Max length4
Median length2
Mean length2.5925926
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row의무
2nd row자율
3rd row의무
4th row의무
5th row자율

Common Values

ValueCountFrequency (%)
자율 11
40.7%
의무 8
29.6%
해당없음 8
29.6%

Length

2023-12-12T16:58:49.765962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:58:49.902523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자율 11
40.7%
의무 8
29.6%
해당없음 8
29.6%

교육서비스
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)11.1%
Missing0
Missing (%)0.0%
Memory size348.0 B
자율
13 
해당없음
의무

Length

Max length4
Median length2
Mean length2.5925926
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row의무
2nd row자율
3rd row자율
4th row의무
5th row자율

Common Values

ValueCountFrequency (%)
자율 13
48.1%
해당없음 8
29.6%
의무 6
22.2%

Length

2023-12-12T16:58:50.052957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:58:50.174714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자율 13
48.1%
해당없음 8
29.6%
의무 6
22.2%

보건
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)11.1%
Missing0
Missing (%)0.0%
Memory size348.0 B
자율
12 
해당없음
의무

Length

Max length4
Median length2
Mean length2.5925926
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row의무
2nd row자율
3rd row자율
4th row의무
5th row자율

Common Values

ValueCountFrequency (%)
자율 12
44.4%
해당없음 8
29.6%
의무 7
25.9%

Length

2023-12-12T16:58:50.288494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:58:50.411768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자율 12
44.4%
해당없음 8
29.6%
의무 7
25.9%

기타 서비스
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)11.1%
Missing0
Missing (%)0.0%
Memory size348.0 B
자율
13 
해당없음
의무

Length

Max length4
Median length2
Mean length2.5925926
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row의무
2nd row자율
3rd row자율
4th row의무
5th row자율

Common Values

ValueCountFrequency (%)
자율 13
48.1%
해당없음 8
29.6%
의무 6
22.2%

Length

2023-12-12T16:58:50.571305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:58:50.711442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자율 13
48.1%
해당없음 8
29.6%
의무 6
22.2%

기타 산업
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)7.4%
Missing0
Missing (%)0.0%
Memory size348.0 B
자율
14 
의무
13 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row의무
2nd row자율
3rd row자율
4th row의무
5th row의무

Common Values

ValueCountFrequency (%)
자율 14
51.9%
의무 13
48.1%

Length

2023-12-12T16:58:50.848852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:58:50.965963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자율 14
51.9%
의무 13
48.1%

주요내용
Text

MISSING 

Distinct19
Distinct (%)100.0%
Missing8
Missing (%)29.6%
Memory size348.0 B
2023-12-12T16:58:51.222541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length34
Mean length32.210526
Min length8

Characters and Unicode

Total characters612
Distinct characters160
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)100.0%

Sample

1st row매출액, 종업원수, 소속 사업장 현황 등
2nd row환경관련 수상, 인증, 협약 등
3rd row녹색경영 비전·전략, 목표 및 계획
4th row녹색경영 관련 전담 조직, 교육훈련, 내부심사, 환경안전사고 관련 대응체계 및 훈련 등
5th row용수(상수, 지하수, 하천수 등) 사용량 및 재활용 실적
ValueCountFrequency (%)
14
 
9.3%
8
 
5.3%
실적 6
 
4.0%
현황 4
 
2.6%
발생량 3
 
2.0%
에너지 3
 
2.0%
관련 3
 
2.0%
배출량 2
 
1.3%
운영 2
 
1.3%
등의 2
 
1.3%
Other values (94) 104
68.9%
2023-12-12T16:58:51.704873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
132
 
21.6%
, 26
 
4.2%
14
 
2.3%
13
 
2.1%
12
 
2.0%
11
 
1.8%
11
 
1.8%
9
 
1.5%
9
 
1.5%
9
 
1.5%
Other values (150) 366
59.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 414
67.6%
Space Separator 132
 
21.6%
Other Punctuation 33
 
5.4%
Uppercase Letter 23
 
3.8%
Close Punctuation 3
 
0.5%
Open Punctuation 3
 
0.5%
Lowercase Letter 2
 
0.3%
Dash Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
3.4%
13
 
3.1%
12
 
2.9%
11
 
2.7%
11
 
2.7%
9
 
2.2%
9
 
2.2%
9
 
2.2%
9
 
2.2%
8
 
1.9%
Other values (131) 309
74.6%
Uppercase Letter
ValueCountFrequency (%)
O 5
21.7%
T 3
13.0%
S 3
13.0%
D 2
 
8.7%
P 2
 
8.7%
N 2
 
8.7%
R 1
 
4.3%
G 1
 
4.3%
E 1
 
4.3%
M 1
 
4.3%
Other values (2) 2
 
8.7%
Other Punctuation
ValueCountFrequency (%)
, 26
78.8%
· 7
 
21.2%
Space Separator
ValueCountFrequency (%)
132
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Lowercase Letter
ValueCountFrequency (%)
x 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 414
67.6%
Common 173
28.3%
Latin 25
 
4.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
3.4%
13
 
3.1%
12
 
2.9%
11
 
2.7%
11
 
2.7%
9
 
2.2%
9
 
2.2%
9
 
2.2%
9
 
2.2%
8
 
1.9%
Other values (131) 309
74.6%
Latin
ValueCountFrequency (%)
O 5
20.0%
T 3
12.0%
S 3
12.0%
D 2
 
8.0%
P 2
 
8.0%
x 2
 
8.0%
N 2
 
8.0%
R 1
 
4.0%
G 1
 
4.0%
E 1
 
4.0%
Other values (3) 3
12.0%
Common
ValueCountFrequency (%)
132
76.3%
, 26
 
15.0%
· 7
 
4.0%
) 3
 
1.7%
( 3
 
1.7%
- 2
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 414
67.6%
ASCII 191
31.2%
None 7
 
1.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
132
69.1%
, 26
 
13.6%
O 5
 
2.6%
T 3
 
1.6%
) 3
 
1.6%
S 3
 
1.6%
( 3
 
1.6%
D 2
 
1.0%
P 2
 
1.0%
x 2
 
1.0%
Other values (8) 10
 
5.2%
Hangul
ValueCountFrequency (%)
14
 
3.4%
13
 
3.1%
12
 
2.9%
11
 
2.7%
11
 
2.7%
9
 
2.2%
9
 
2.2%
9
 
2.2%
9
 
2.2%
8
 
1.9%
Other values (131) 309
74.6%
None
ValueCountFrequency (%)
· 7
100.0%

Correlations

2023-12-12T16:58:51.824507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분공개항목제조공공행정교육서비스보건기타 서비스기타 산업주요내용
구분1.0001.0000.5950.7670.6220.5270.6220.5951.000
공개항목1.0001.0001.0001.0001.0001.0001.0001.0001.000
제조0.5951.0001.0000.1740.3210.3620.3210.9931.000
공공행정0.7671.0000.1741.0000.9930.9830.9930.1741.000
교육서비스0.6221.0000.3210.9931.0000.9981.0000.3211.000
보건0.5271.0000.3620.9830.9981.0000.9980.3621.000
기타 서비스0.6221.0000.3210.9931.0000.9981.0000.3211.000
기타 산업0.5951.0000.9930.1740.3210.3620.3211.0001.000
주요내용1.0001.0001.0001.0001.0001.0001.0001.0001.000
2023-12-12T16:58:51.954883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
보건기타 산업구분교육서비스기타 서비스제조공공행정
보건1.0000.5630.2230.9420.9420.5630.840
기타 산업0.5631.0000.3880.5040.5040.9230.276
구분0.2230.3881.0000.2870.2870.3880.409
교육서비스0.9420.5040.2871.0001.0000.5040.896
기타 서비스0.9420.5040.2871.0001.0000.5040.896
제조0.5630.9230.3880.5040.5041.0000.276
공공행정0.8400.2760.4090.8960.8960.2761.000
2023-12-12T16:58:52.071677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분제조공공행정교육서비스보건기타 서비스기타 산업
구분1.0000.3880.4090.2870.2230.2870.388
제조0.3881.0000.2760.5040.5630.5040.923
공공행정0.4090.2761.0000.8960.8400.8960.276
교육서비스0.2870.5040.8961.0000.9421.0000.504
보건0.2230.5630.8400.9421.0000.9420.563
기타 서비스0.2870.5040.8961.0000.9421.0000.504
기타 산업0.3880.9230.2760.5040.5630.5041.000

Missing values

2023-12-12T16:58:48.190284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:58:48.327601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분공개항목제조공공행정교육서비스보건기타 서비스기타 산업주요내용
0기업개요사업현황의무의무의무의무의무의무매출액, 종업원수, 소속 사업장 현황 등
1기업개요환경관련 수상 및 협약 현황자율자율자율자율자율자율환경관련 수상, 인증, 협약 등
2녹색경영 시스템비전, 전략, 방침, 목표자율의무자율자율자율자율녹색경영 비전·전략, 목표 및 계획
3녹색경영 시스템전담조직, 교육훈련, 내부심사 등의무의무의무의무의무의무녹색경영 관련 전담 조직, 교육훈련, 내부심사, 환경안전사고 관련 대응체계 및 훈련 등
4자원/에너지저감투자 및 기술도입의무자율자율자율자율의무<NA>
5자원/에너지원부자재 사용량·원단위의무해당없음해당없음해당없음해당없음의무<NA>
6자원/에너지용수 사용량·원단위·재활용량의무의무의무의무의무의무용수(상수, 지하수, 하천수 등) 사용량 및 재활용 실적
7자원/에너지에너지 사용량·원단위의무의무의무의무의무의무온실가스목표관리제도 기준으로 에너지원별 사용량을 입력하고 에너지 총량(TOE단위) 공개
8자원/에너지신재생에너지 투자 및 기술도입자율자율자율자율자율자율신재생 에너지 개발·적용 및 도입, 환경친화적 에너지 개발 등의 투자 및 기술도입 실적
9온실가스/환경오염온실가스 저감투자 및 기술도입자율자율자율자율자율자율생산공정에서 온실가스 배출 저감을 위한 설비 교체, 고효율 설비 및 공정 개선 등 실적
구분공개항목제조공공행정교육서비스보건기타 서비스기타 산업주요내용
17온실가스/환경오염토양·소음진동·악취 관리 현황자율해당없음해당없음해당없음해당없음자율<NA>
18녹색제품서비스녹색제품·서비스개발 투자 및 기술도입자율해당없음해당없음해당없음해당없음자율<NA>
19녹색제품서비스친환경설계(에코디자인) 현황자율해당없음해당없음해당없음해당없음자율<NA>
20녹색제품서비스제 3자 인증 및 TypeⅡ인증 제품 현황자율해당없음해당없음해당없음해당없음자율<NA>
21녹색제품서비스녹색구매 지침 운영 현황자율의무자율자율자율자율녹색제품(환경마크, GR마크 등) 구매관련 지침 및 운영 실적
22녹색제품서비스협력업체 환경정보관리 및 환경성 평가자율해당없음해당없음해당없음해당없음자율<NA>
23녹색제품서비스환경기술 및 교육지원 현황자율해당없음해당없음해당없음해당없음자율<NA>
24사회윤리적 책임국내외 환경법규 위반 현황의무의무의무의무의무의무환경사고 발생, 민원발생 및 대응, 배출부과금 체납, 환경관련법규 위반 등 실적
25사회윤리적 책임환경(지속가능)보고서 발간 현황자율자율자율자율자율자율기업의 환경관리 활동 및 성과 등이 포함된 환경보고서 또는 지속가능보고서
26사회윤리적 책임이해관계자 환경정보 요청 대응현황자율자율자율자율자율자율환경관련 이해관계자 요청 및 대응 현황