Overview

Dataset statistics

Number of variables5
Number of observations215
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.7 KiB
Average record size in memory41.6 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description환경정보공개시스템 공개세목리스트(20년 기준 / 공개항목, 항목구분(수상현황, 인증현황, 매출액 등), 세목, 세목 리스트 등)에 대한 분류 기준 제공
Author한국환경산업기술원
URLhttps://www.data.go.kr/data/15072065/fileData.do

Alerts

번호 is highly overall correlated with 공개항목High correlation
공개항목 is highly overall correlated with 번호High correlation
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:10:17.740660
Analysis finished2023-12-12 22:10:18.388132
Duration0.65 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct215
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean108
Minimum1
Maximum215
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2023-12-13T07:10:18.449680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile11.7
Q154.5
median108
Q3161.5
95-th percentile204.3
Maximum215
Range214
Interquartile range (IQR)107

Descriptive statistics

Standard deviation62.209324
Coefficient of variation (CV)0.57601226
Kurtosis-1.2
Mean108
Median Absolute Deviation (MAD)54
Skewness0
Sum23220
Variance3870
MonotonicityStrictly increasing
2023-12-13T07:10:18.567758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
149 1
 
0.5%
138 1
 
0.5%
139 1
 
0.5%
140 1
 
0.5%
141 1
 
0.5%
142 1
 
0.5%
143 1
 
0.5%
144 1
 
0.5%
145 1
 
0.5%
Other values (205) 205
95.3%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
215 1
0.5%
214 1
0.5%
213 1
0.5%
212 1
0.5%
211 1
0.5%
210 1
0.5%
209 1
0.5%
208 1
0.5%
207 1
0.5%
206 1
0.5%

공개항목
Categorical

HIGH CORRELATION 

Distinct27
Distinct (%)12.6%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
녹색경영 전담조직 및 업무 · 역할 · 권한 교육훈련, 환경 · 안전사고 대응체계, 내부심사 실시 및 조치
26 
에너지 사용량
22 
토양 · 소음진동 · 악취 관리 현황
18 
환경오염물질 · 제품 · 서비스와 관련된 환경법규 위반 현황
16 
원부자재 · 용수 · 에너지 절감 투자 및 기술 도입
15 
Other values (22)
118 

Length

Max length59
Median length30
Mean length25.753488
Min length7

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row업종, 생산제품, 매출액, 생산량, 종업원수
2nd row업종, 생산제품, 매출액, 생산량, 종업원수
3rd row업종, 생산제품, 매출액, 생산량, 종업원수
4th row업종, 생산제품, 매출액, 생산량, 종업원수
5th row업종, 생산제품, 매출액, 생산량, 종업원수

Common Values

ValueCountFrequency (%)
녹색경영 전담조직 및 업무 · 역할 · 권한 교육훈련, 환경 · 안전사고 대응체계, 내부심사 실시 및 조치 26
 
12.1%
에너지 사용량 22
 
10.2%
토양 · 소음진동 · 악취 관리 현황 18
 
8.4%
환경오염물질 · 제품 · 서비스와 관련된 환경법규 위반 현황 16
 
7.4%
원부자재 · 용수 · 에너지 절감 투자 및 기술 도입 15
 
7.0%
대기 · 수질오염 · 화학물질관리시설 및 모니터링 시스템 현황 15
 
7.0%
녹색기업, 환경경영대상, 폐기물감축 자발적 협약, 녹색구매 자발적협약 등 9
 
4.2%
온실가스 관리수준(인벤토리, 목표-계획 8
 
3.7%
업종, 생산제품, 매출액, 생산량, 종업원수 7
 
3.3%
폐기물 발생량 · 재활용량 7
 
3.3%
Other values (17) 72
33.5%

Length

2023-12-13T07:10:18.677497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
· 225
 
14.9%
114
 
7.6%
현황 66
 
4.4%
에너지 42
 
2.8%
투자 34
 
2.3%
사용량 32
 
2.1%
녹색경영 28
 
1.9%
내부심사 26
 
1.7%
전담조직 26
 
1.7%
실시 26
 
1.7%
Other values (85) 887
58.9%
Distinct60
Distinct (%)27.9%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-13T07:10:18.877363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length21
Mean length14.609302
Min length4

Characters and Unicode

Total characters3141
Distinct characters159
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)7.0%

Sample

1st row업종, 생산제품, 매출액, 생산량, 종업원수
2nd row업종, 생산제품, 매출액, 생산량, 종업원수
3rd row업종, 생산제품, 매출액, 생산량, 종업원수
4th row업종, 생산제품, 매출액, 생산량, 종업원수
5th row업종, 생산제품, 매출액, 생산량, 종업원수
ValueCountFrequency (%)
현황 104
 
12.4%
55
 
6.6%
실적 46
 
5.5%
투자 34
 
4.1%
기술 34
 
4.1%
도입 34
 
4.1%
사용 25
 
3.0%
에너지원별 21
 
2.5%
관리시설 21
 
2.5%
관련 16
 
1.9%
Other values (92) 448
53.5%
2023-12-13T07:10:19.245985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
623
 
19.8%
118
 
3.8%
118
 
3.8%
71
 
2.3%
59
 
1.9%
56
 
1.8%
56
 
1.8%
55
 
1.8%
53
 
1.7%
52
 
1.7%
Other values (149) 1880
59.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2393
76.2%
Space Separator 623
 
19.8%
Other Punctuation 47
 
1.5%
Close Punctuation 21
 
0.7%
Open Punctuation 21
 
0.7%
Lowercase Letter 12
 
0.4%
Decimal Number 8
 
0.3%
Uppercase Letter 8
 
0.3%
Dash Punctuation 7
 
0.2%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
118
 
4.9%
118
 
4.9%
71
 
3.0%
59
 
2.5%
56
 
2.3%
56
 
2.3%
55
 
2.3%
53
 
2.2%
52
 
2.2%
51
 
2.1%
Other values (130) 1704
71.2%
Decimal Number
ValueCountFrequency (%)
7 4
50.0%
2 1
 
12.5%
1 1
 
12.5%
0 1
 
12.5%
3 1
 
12.5%
Other Punctuation
ValueCountFrequency (%)
, 29
61.7%
· 13
27.7%
% 4
 
8.5%
/ 1
 
2.1%
Lowercase Letter
ValueCountFrequency (%)
e 4
33.3%
y 4
33.3%
p 4
33.3%
Uppercase Letter
ValueCountFrequency (%)
I 4
50.0%
T 4
50.0%
Space Separator
ValueCountFrequency (%)
623
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2393
76.2%
Common 728
 
23.2%
Latin 20
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
118
 
4.9%
118
 
4.9%
71
 
3.0%
59
 
2.5%
56
 
2.3%
56
 
2.3%
55
 
2.3%
53
 
2.2%
52
 
2.2%
51
 
2.1%
Other values (130) 1704
71.2%
Common
ValueCountFrequency (%)
623
85.6%
, 29
 
4.0%
) 21
 
2.9%
( 21
 
2.9%
· 13
 
1.8%
- 7
 
1.0%
% 4
 
0.5%
7 4
 
0.5%
+ 1
 
0.1%
/ 1
 
0.1%
Other values (4) 4
 
0.5%
Latin
ValueCountFrequency (%)
I 4
20.0%
e 4
20.0%
T 4
20.0%
y 4
20.0%
p 4
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2393
76.2%
ASCII 735
 
23.4%
None 13
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
623
84.8%
, 29
 
3.9%
) 21
 
2.9%
( 21
 
2.9%
- 7
 
1.0%
% 4
 
0.5%
7 4
 
0.5%
I 4
 
0.5%
e 4
 
0.5%
T 4
 
0.5%
Other values (8) 14
 
1.9%
Hangul
ValueCountFrequency (%)
118
 
4.9%
118
 
4.9%
71
 
3.0%
59
 
2.5%
56
 
2.3%
56
 
2.3%
55
 
2.3%
53
 
2.2%
52
 
2.2%
51
 
2.1%
Other values (130) 1704
71.2%
None
ValueCountFrequency (%)
· 13
100.0%
Distinct99
Distinct (%)46.0%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-13T07:10:19.531161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length15
Mean length7.0604651
Min length2

Characters and Unicode

Total characters1518
Distinct characters168
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique59 ?
Unique (%)27.4%

Sample

1st row생산제품
2nd row생산량
3rd row단위
4th row매출액
5th row단위
ValueCountFrequency (%)
34
 
8.3%
현황 21
 
5.1%
투자 19
 
4.6%
모니터링 12
 
2.9%
기술도입 10
 
2.4%
대응 10
 
2.4%
기술 9
 
2.2%
도입 9
 
2.2%
용수 8
 
2.0%
조직별 8
 
2.0%
Other values (112) 269
65.8%
2023-12-13T07:10:19.952738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
194
 
12.8%
39
 
2.6%
34
 
2.2%
33
 
2.2%
30
 
2.0%
29
 
1.9%
29
 
1.9%
29
 
1.9%
28
 
1.8%
27
 
1.8%
Other values (158) 1046
68.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1231
81.1%
Space Separator 194
 
12.8%
Uppercase Letter 36
 
2.4%
Close Punctuation 20
 
1.3%
Open Punctuation 20
 
1.3%
Other Punctuation 6
 
0.4%
Dash Punctuation 5
 
0.3%
Decimal Number 4
 
0.3%
Lowercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
39
 
3.2%
34
 
2.8%
33
 
2.7%
30
 
2.4%
29
 
2.4%
29
 
2.4%
29
 
2.4%
28
 
2.3%
27
 
2.2%
26
 
2.1%
Other values (138) 927
75.3%
Uppercase Letter
ValueCountFrequency (%)
B 5
13.9%
P 5
13.9%
G 4
11.1%
L 4
11.1%
O 4
11.1%
S 3
8.3%
N 3
8.3%
D 2
 
5.6%
T 2
 
5.6%
C 2
 
5.6%
Other values (2) 2
 
5.6%
Other Punctuation
ValueCountFrequency (%)
. 4
66.7%
/ 2
33.3%
Space Separator
ValueCountFrequency (%)
194
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Decimal Number
ValueCountFrequency (%)
3 4
100.0%
Lowercase Letter
ValueCountFrequency (%)
x 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1231
81.1%
Common 249
 
16.4%
Latin 38
 
2.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
39
 
3.2%
34
 
2.8%
33
 
2.7%
30
 
2.4%
29
 
2.4%
29
 
2.4%
29
 
2.4%
28
 
2.3%
27
 
2.2%
26
 
2.1%
Other values (138) 927
75.3%
Latin
ValueCountFrequency (%)
B 5
13.2%
P 5
13.2%
G 4
10.5%
L 4
10.5%
O 4
10.5%
S 3
7.9%
N 3
7.9%
x 2
 
5.3%
D 2
 
5.3%
T 2
 
5.3%
Other values (3) 4
10.5%
Common
ValueCountFrequency (%)
194
77.9%
) 20
 
8.0%
( 20
 
8.0%
- 5
 
2.0%
. 4
 
1.6%
3 4
 
1.6%
/ 2
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1231
81.1%
ASCII 287
 
18.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
194
67.6%
) 20
 
7.0%
( 20
 
7.0%
B 5
 
1.7%
- 5
 
1.7%
P 5
 
1.7%
G 4
 
1.4%
L 4
 
1.4%
O 4
 
1.4%
. 4
 
1.4%
Other values (10) 22
 
7.7%
Hangul
ValueCountFrequency (%)
39
 
3.2%
34
 
2.8%
33
 
2.7%
30
 
2.4%
29
 
2.4%
29
 
2.4%
29
 
2.4%
28
 
2.3%
27
 
2.2%
26
 
2.1%
Other values (138) 927
75.3%
Distinct122
Distinct (%)56.7%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-13T07:10:20.234654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length16
Mean length5.2325581
Min length2

Characters and Unicode

Total characters1125
Distinct characters163
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique90 ?
Unique (%)41.9%

Sample

1st row생산제품
2nd row생산량
3rd row단위
4th row매출액
5th row단위
ValueCountFrequency (%)
내용 17
 
6.1%
모니터링 12
 
4.3%
투자비 8
 
2.9%
효과(절감량 7
 
2.5%
방법 7
 
2.5%
사업기간 6
 
2.2%
기간 6
 
2.2%
총량 6
 
2.2%
종류 6
 
2.2%
조치사항 5
 
1.8%
Other values (131) 198
71.2%
2023-12-13T07:10:20.689489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
63
 
5.6%
56
 
5.0%
41
 
3.6%
( 31
 
2.8%
) 31
 
2.8%
28
 
2.5%
28
 
2.5%
25
 
2.2%
20
 
1.8%
19
 
1.7%
Other values (153) 783
69.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 949
84.4%
Space Separator 63
 
5.6%
Uppercase Letter 36
 
3.2%
Open Punctuation 31
 
2.8%
Close Punctuation 31
 
2.8%
Other Punctuation 8
 
0.7%
Dash Punctuation 5
 
0.4%
Lowercase Letter 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
56
 
5.9%
41
 
4.3%
28
 
3.0%
28
 
3.0%
25
 
2.6%
20
 
2.1%
19
 
2.0%
18
 
1.9%
18
 
1.9%
17
 
1.8%
Other values (134) 679
71.5%
Uppercase Letter
ValueCountFrequency (%)
B 5
13.9%
P 5
13.9%
L 4
11.1%
G 4
11.1%
O 4
11.1%
S 3
8.3%
N 3
8.3%
D 2
 
5.6%
T 2
 
5.6%
C 2
 
5.6%
Other values (2) 2
 
5.6%
Other Punctuation
ValueCountFrequency (%)
/ 4
50.0%
. 4
50.0%
Space Separator
ValueCountFrequency (%)
63
100.0%
Open Punctuation
ValueCountFrequency (%)
( 31
100.0%
Close Punctuation
ValueCountFrequency (%)
) 31
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Lowercase Letter
ValueCountFrequency (%)
x 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 949
84.4%
Common 138
 
12.3%
Latin 38
 
3.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
56
 
5.9%
41
 
4.3%
28
 
3.0%
28
 
3.0%
25
 
2.6%
20
 
2.1%
19
 
2.0%
18
 
1.9%
18
 
1.9%
17
 
1.8%
Other values (134) 679
71.5%
Latin
ValueCountFrequency (%)
B 5
13.2%
P 5
13.2%
L 4
10.5%
G 4
10.5%
O 4
10.5%
S 3
7.9%
N 3
7.9%
x 2
 
5.3%
D 2
 
5.3%
T 2
 
5.3%
Other values (3) 4
10.5%
Common
ValueCountFrequency (%)
63
45.7%
( 31
22.5%
) 31
22.5%
- 5
 
3.6%
/ 4
 
2.9%
. 4
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 949
84.4%
ASCII 176
 
15.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
63
35.8%
( 31
17.6%
) 31
17.6%
- 5
 
2.8%
B 5
 
2.8%
P 5
 
2.8%
/ 4
 
2.3%
L 4
 
2.3%
G 4
 
2.3%
O 4
 
2.3%
Other values (9) 20
 
11.4%
Hangul
ValueCountFrequency (%)
56
 
5.9%
41
 
4.3%
28
 
3.0%
28
 
3.0%
25
 
2.6%
20
 
2.1%
19
 
2.0%
18
 
1.9%
18
 
1.9%
17
 
1.8%
Other values (134) 679
71.5%

Interactions

2023-12-13T07:10:18.206319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:10:20.798496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호공개항목항목구분세목(명)
번호1.0000.9870.9960.996
공개항목0.9871.0000.9980.997
항목구분0.9960.9981.0000.999
세목(명)0.9960.9970.9991.000
2023-12-13T07:10:20.893381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호공개항목
번호1.0000.881
공개항목0.8811.000

Missing values

2023-12-13T07:10:18.285830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:10:18.359179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호공개항목항목구분세목(명)세목리스트
01업종, 생산제품, 매출액, 생산량, 종업원수업종, 생산제품, 매출액, 생산량, 종업원수생산제품생산제품
12업종, 생산제품, 매출액, 생산량, 종업원수업종, 생산제품, 매출액, 생산량, 종업원수생산량생산량
23업종, 생산제품, 매출액, 생산량, 종업원수업종, 생산제품, 매출액, 생산량, 종업원수단위단위
34업종, 생산제품, 매출액, 생산량, 종업원수업종, 생산제품, 매출액, 생산량, 종업원수매출액매출액
45업종, 생산제품, 매출액, 생산량, 종업원수업종, 생산제품, 매출액, 생산량, 종업원수단위단위
56업종, 생산제품, 매출액, 생산량, 종업원수업종, 생산제품, 매출액, 생산량, 종업원수국내 종업원수국내 종업원수
67업종, 생산제품, 매출액, 생산량, 종업원수본사/사업장리스트본사사업장 현황 리스트본사사업장 현황 리스트
78녹색기업, 환경경영대상, 폐기물감축 자발적 협약, 녹색구매 자발적협약 등수상현황수상내용
89녹색기업, 환경경영대상, 폐기물감축 자발적 협약, 녹색구매 자발적협약 등수상현황수상주관
910녹색기업, 환경경영대상, 폐기물감축 자발적 협약, 녹색구매 자발적협약 등수상현황수상일시
번호공개항목항목구분세목(명)세목리스트
205206환경오염물질 · 제품 · 서비스와 관련된 환경법규 위반 현황환경관련법규 위반사례환경관련법규 위반사례행정처분사항
206207환경오염물질 · 제품 · 서비스와 관련된 환경법규 위반 현황환경관련법규 위반사례환경관련법규 위반사례조치사항
207208환경오염물질 · 제품 · 서비스와 관련된 환경법규 위반 현황환경관련법규 위반사례환경관련법규 위반사례처분일자
208209환경(지속가능) 보고서 발간 현황환경보고서 발간 현황첨부문서로 대체첨부문서로 대체
209210환경(지속가능) 보고서 발간 현황지속가능 보고서 발간 현황첨부문서로 대체첨부문서로 대체
210211이해관계자 환경정보 요청 대응현황이해관계자 환경정보 요청 대응현황대응 현황요청자
211212이해관계자 환경정보 요청 대응현황이해관계자 환경정보 요청 대응현황대응 현황요청일
212213이해관계자 환경정보 요청 대응현황이해관계자 환경정보 요청 대응현황대응 현황요청내용
213214이해관계자 환경정보 요청 대응현황이해관계자 환경정보 요청 대응현황대응 현황조치결과
214215이해관계자 환경정보 요청 대응현황이해관계자 환경정보 요청 대응현황대응 현황조치일