Overview

Dataset statistics

Number of variables5
Number of observations72
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.9 KiB
Average record size in memory41.8 B

Variable types

Text2
Categorical3

Dataset

Description국가 온실가스 인벤토리에서 사용하는 배출 활동별(에너지, 산업공정, 농업, LULUCF, 폐기물 등) 배출량 산정식 등 산정 방법에 대한 정보를 제공합니다.
Author환경부 온실가스종합정보센터
URLhttps://www.data.go.kr/data/15039852/fileData.do

Alerts

산정식방법론 is highly imbalanced (61.9%)Imbalance

Reproduction

Analysis started2023-12-12 10:31:43.803954
Analysis finished2023-12-12 10:31:44.469904
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct70
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size708.0 B
2023-12-12T19:31:44.640610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length14
Mean length10.972222
Min length4

Characters and Unicode

Total characters790
Distinct characters160
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique68 ?
Unique (%)94.4%

Sample

1st row에너지 N2O 산정식
2nd row천연가스 탈루 산정식
3rd row석유 탈루 산정식
4th row에너지 CH4 산정식
5th row에너지 CO2 산정식
ValueCountFrequency (%)
산정식 37
20.8%
n2o 12
 
6.7%
생산 10
 
5.6%
ch4 7
 
3.9%
농경지토양 5
 
2.8%
co2 4
 
2.2%
소각 4
 
2.2%
벼재배 3
 
1.7%
사용 3
 
1.7%
에너지 3
 
1.7%
Other values (75) 90
50.6%
2023-12-12T19:31:45.028659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
109
 
13.8%
58
 
7.3%
40
 
5.1%
39
 
4.9%
25
 
3.2%
O 19
 
2.4%
2 19
 
2.4%
16
 
2.0%
15
 
1.9%
C 14
 
1.8%
Other values (150) 436
55.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 542
68.6%
Space Separator 109
 
13.8%
Uppercase Letter 69
 
8.7%
Lowercase Letter 34
 
4.3%
Decimal Number 28
 
3.5%
Dash Punctuation 5
 
0.6%
Other Punctuation 2
 
0.3%
Connector Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
58
 
10.7%
40
 
7.4%
39
 
7.2%
25
 
4.6%
16
 
3.0%
15
 
2.8%
11
 
2.0%
10
 
1.8%
10
 
1.8%
9
 
1.7%
Other values (128) 309
57.0%
Uppercase Letter
ValueCountFrequency (%)
O 19
27.5%
C 14
20.3%
N 14
20.3%
H 10
14.5%
B 4
 
5.8%
I 3
 
4.3%
T 2
 
2.9%
P 1
 
1.4%
W 1
 
1.4%
F 1
 
1.4%
Lowercase Letter
ValueCountFrequency (%)
n 10
29.4%
o 7
20.6%
i 5
14.7%
g 4
 
11.8%
e 4
 
11.8%
c 4
 
11.8%
Decimal Number
ValueCountFrequency (%)
2 19
67.9%
4 9
32.1%
Space Separator
ValueCountFrequency (%)
109
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 542
68.6%
Common 145
 
18.4%
Latin 103
 
13.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
58
 
10.7%
40
 
7.4%
39
 
7.2%
25
 
4.6%
16
 
3.0%
15
 
2.8%
11
 
2.0%
10
 
1.8%
10
 
1.8%
9
 
1.7%
Other values (128) 309
57.0%
Latin
ValueCountFrequency (%)
O 19
18.4%
C 14
13.6%
N 14
13.6%
n 10
9.7%
H 10
9.7%
o 7
 
6.8%
i 5
 
4.9%
B 4
 
3.9%
g 4
 
3.9%
e 4
 
3.9%
Other values (6) 12
11.7%
Common
ValueCountFrequency (%)
109
75.2%
2 19
 
13.1%
4 9
 
6.2%
- 5
 
3.4%
/ 2
 
1.4%
_ 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 542
68.6%
ASCII 248
31.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
109
44.0%
O 19
 
7.7%
2 19
 
7.7%
C 14
 
5.6%
N 14
 
5.6%
n 10
 
4.0%
H 10
 
4.0%
4 9
 
3.6%
o 7
 
2.8%
- 5
 
2.0%
Other values (12) 32
 
12.9%
Hangul
ValueCountFrequency (%)
58
 
10.7%
40
 
7.4%
39
 
7.2%
25
 
4.6%
16
 
3.0%
15
 
2.8%
11
 
2.0%
10
 
1.8%
10
 
1.8%
9
 
1.7%
Other values (128) 309
57.0%

분야별
Categorical

Distinct7
Distinct (%)9.7%
Missing0
Missing (%)0.0%
Memory size708.0 B
산업공정
26 
농업
15 
폐기물
12 
LULUCF
에너지
Other values (2)
 
2

Length

Max length6
Median length5
Mean length3.5694444
Min length2

Unique

Unique2 ?
Unique (%)2.8%

Sample

1st row에너지
2nd row에너지
3rd row에너지
4th row에너지
5th row에너지

Common Values

ValueCountFrequency (%)
산업공정 26
36.1%
농업 15
20.8%
폐기물 12
16.7%
LULUCF 9
 
12.5%
에너지 8
 
11.1%
에너지 1
 
1.4%
산업공정 1
 
1.4%

Length

2023-12-12T19:31:45.191586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:31:45.317200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
산업공정 27
37.5%
농업 15
20.8%
폐기물 12
16.7%
lulucf 9
 
12.5%
에너지 9
 
12.5%

배출가스
Categorical

Distinct6
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size708.0 B
CO2
25 
N2O
18 
CH4
17 
HFCs
PFCs

Length

Max length4
Median length3
Mean length3.1388889
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowN2O
2nd rowCH4
3rd rowCH4
4th rowCH4
5th rowCO2

Common Values

ValueCountFrequency (%)
CO2 25
34.7%
N2O 18
25.0%
CH4 17
23.6%
HFCs 5
 
6.9%
PFCs 5
 
6.9%
SF6 2
 
2.8%

Length

2023-12-12T19:31:45.447406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:31:45.559984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
co2 25
34.7%
n2o 18
25.0%
ch4 17
23.6%
hfcs 5
 
6.9%
pfcs 5
 
6.9%
sf6 2
 
2.8%

산정식방법론
Categorical

IMBALANCE 

Distinct4
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size708.0 B
IPCC Default
62 
Reference Approach
 
6
IPCC Tier 2
 
3
IPCC Default
 
1

Length

Max length18
Median length12
Mean length12.472222
Min length11

Unique

Unique1 ?
Unique (%)1.4%

Sample

1st rowIPCC Default
2nd rowIPCC Default
3rd rowIPCC Default
4th rowIPCC Default
5th rowIPCC Default

Common Values

ValueCountFrequency (%)
IPCC Default 62
86.1%
Reference Approach 6
 
8.3%
IPCC Tier 2 3
 
4.2%
IPCC Default 1
 
1.4%

Length

2023-12-12T19:31:45.684698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:31:45.796660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
ipcc 66
44.9%
default 63
42.9%
reference 6
 
4.1%
approach 6
 
4.1%
tier 3
 
2.0%
2 3
 
2.0%
Distinct65
Distinct (%)90.3%
Missing0
Missing (%)0.0%
Memory size708.0 B
2023-12-12T19:31:46.010669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length170
Median length44
Mean length30.708333
Min length4

Characters and Unicode

Total characters2211
Distinct characters185
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique61 ?
Unique (%)84.7%

Sample

1st row활동자료*순발열량/총발열량*N2O EF*산화계수/1000000*41.868
2nd row활동자료*NG단위환산계수*천연가스탈루배출계수
3rd row활동자료*석유탈루배출계수*0.158987
4th row활동자료*순발열량/총발열량*CH4 EF*산화계수/1000000*41.868
5th row활동자료*순발열량/총발열량*CO2 EF*산화계수/1000*41.868
ValueCountFrequency (%)
배출계수*부생가스 7
 
6.0%
gwp+활동자료*부생가스 7
 
6.0%
c3f8 4
 
3.4%
c2f6 4
 
3.4%
cf4 4
 
3.4%
활동자료*연간배출률 4
 
3.4%
gwp)/1000 3
 
2.6%
활동자료*벼재배배출계수*물관리보정계수*볏짚시용보정계수*작기전물관리보정계수*벼재배일수/1000 3
 
2.6%
chf3 2
 
1.7%
활동자료 2
 
1.7%
Other values (74) 77
65.8%
2023-12-12T19:31:46.383049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 195
 
8.8%
* 164
 
7.4%
1 82
 
3.7%
/ 74
 
3.3%
74
 
3.3%
4 66
 
3.0%
63
 
2.8%
60
 
2.7%
56
 
2.5%
56
 
2.5%
Other values (175) 1321
59.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1161
52.5%
Decimal Number 429
 
19.4%
Other Punctuation 244
 
11.0%
Uppercase Letter 177
 
8.0%
Space Separator 46
 
2.1%
Connector Punctuation 41
 
1.9%
Lowercase Letter 33
 
1.5%
Dash Punctuation 23
 
1.0%
Close Punctuation 22
 
1.0%
Open Punctuation 21
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
74
 
6.4%
63
 
5.4%
60
 
5.2%
56
 
4.8%
56
 
4.8%
56
 
4.8%
56
 
4.8%
53
 
4.6%
47
 
4.0%
32
 
2.8%
Other values (125) 608
52.4%
Uppercase Letter
ValueCountFrequency (%)
C 36
20.3%
F 30
16.9%
O 16
9.0%
H 15
8.5%
P 13
 
7.3%
G 13
 
7.3%
W 13
 
7.3%
N 12
 
6.8%
E 9
 
5.1%
S 4
 
2.3%
Other values (7) 16
9.0%
Lowercase Letter
ValueCountFrequency (%)
c 6
18.2%
i 4
12.1%
f 4
12.1%
e 3
9.1%
u 3
9.1%
r 3
9.1%
s 3
9.1%
m 2
 
6.1%
d 2
 
6.1%
a 1
 
3.0%
Other values (2) 2
 
6.1%
Decimal Number
ValueCountFrequency (%)
0 195
45.5%
1 82
19.1%
4 66
 
15.4%
2 43
 
10.0%
8 22
 
5.1%
6 10
 
2.3%
3 7
 
1.6%
5 2
 
0.5%
9 1
 
0.2%
7 1
 
0.2%
Other Punctuation
ValueCountFrequency (%)
* 164
67.2%
/ 74
30.3%
. 5
 
2.0%
: 1
 
0.4%
Math Symbol
ValueCountFrequency (%)
+ 12
85.7%
~ 2
 
14.3%
Space Separator
ValueCountFrequency (%)
46
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 41
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 22
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1161
52.5%
Common 840
38.0%
Latin 210
 
9.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
74
 
6.4%
63
 
5.4%
60
 
5.2%
56
 
4.8%
56
 
4.8%
56
 
4.8%
56
 
4.8%
53
 
4.6%
47
 
4.0%
32
 
2.8%
Other values (125) 608
52.4%
Latin
ValueCountFrequency (%)
C 36
17.1%
F 30
14.3%
O 16
 
7.6%
H 15
 
7.1%
P 13
 
6.2%
G 13
 
6.2%
W 13
 
6.2%
N 12
 
5.7%
E 9
 
4.3%
c 6
 
2.9%
Other values (19) 47
22.4%
Common
ValueCountFrequency (%)
0 195
23.2%
* 164
19.5%
1 82
9.8%
/ 74
 
8.8%
4 66
 
7.9%
46
 
5.5%
2 43
 
5.1%
_ 41
 
4.9%
- 23
 
2.7%
8 22
 
2.6%
Other values (11) 84
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1161
52.5%
ASCII 1050
47.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 195
18.6%
* 164
15.6%
1 82
 
7.8%
/ 74
 
7.0%
4 66
 
6.3%
46
 
4.4%
2 43
 
4.1%
_ 41
 
3.9%
C 36
 
3.4%
F 30
 
2.9%
Other values (40) 273
26.0%
Hangul
ValueCountFrequency (%)
74
 
6.4%
63
 
5.4%
60
 
5.2%
56
 
4.8%
56
 
4.8%
56
 
4.8%
56
 
4.8%
53
 
4.6%
47
 
4.0%
32
 
2.8%
Other values (125) 608
52.4%

Correlations

2023-12-12T19:31:46.465081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
산정식명분야별배출가스산정식방법론산정식
산정식명1.0001.0000.9771.0000.978
분야별1.0001.0000.4700.5270.993
배출가스0.9770.4701.0000.0000.805
산정식방법론1.0000.5270.0001.0000.000
산정식0.9780.9930.8050.0001.000
2023-12-12T19:31:46.559705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
산정식방법론분야별배출가스
산정식방법론1.0000.3810.000
분야별0.3811.0000.299
배출가스0.0000.2991.000
2023-12-12T19:31:46.665380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분야별배출가스산정식방법론
분야별1.0000.2990.381
배출가스0.2991.0000.000
산정식방법론0.3810.0001.000

Missing values

2023-12-12T19:31:44.337205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:31:44.435624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

산정식명분야별배출가스산정식방법론산정식
0에너지 N2O 산정식에너지N2OIPCC Default활동자료*순발열량/총발열량*N2O EF*산화계수/1000000*41.868
1천연가스 탈루 산정식에너지CH4IPCC Default활동자료*NG단위환산계수*천연가스탈루배출계수
2석유 탈루 산정식에너지CH4IPCC Default활동자료*석유탈루배출계수*0.158987
3에너지 CH4 산정식에너지CH4IPCC Default활동자료*순발열량/총발열량*CH4 EF*산화계수/1000000*41.868
4에너지 CO2 산정식에너지CO2IPCC Default활동자료*순발열량/총발열량*CO2 EF*산화계수/1000*41.868
5탈루 촉매재생 산정식에너지CO2IPCC Default활동자료
6민간항공기 CO2 산정식에너지CO2IPCC Tier 2항공_LTO_CO2배출량+항공_cruise_CO2배출량
7민간항공기 CH4 산정식에너지CH4IPCC Tier 2항공_LTO_CH4배출량+항공_cruise_CH4배출량
8민간항공기 N2O 산정식에너지N2OIPCC Tier 2항공_LTO_N2O배출량+항공_cruise_N2O배출량
9시멘트 생산산업공정CO2Reference Approach활동자료*클링커배출계수*CKD보정계수/1000
산정식명분야별배출가스산정식방법론산정식
62소각 non-BIogenic CO2폐기물CO2IPCC Default활동자료*소각_dm*소각_cf*소각_fcf*소각_OF*년일수*44/12/1000
63소각 non-BIogenic CH4폐기물CH4IPCC Default활동자료*CH4배출계수*년일수/1000000000
64소각 non-BIogenic N2O폐기물N2OIPCC Default활동자료*N20배출계수*년일수/1000000000
65소각 Biogenic CO2폐기물CO2IPCC Default활동자료*소각_dm*소각_cf*소각_OF*소각_바이오함량*년일수*44/12/1000
66고형폐기물 생물학적처리폐기물N2OIPCC Default활동자료*N20배출계수/1000
67폐수처리 CH4 산정식폐기물CH4IPCC DefaultBOD부하량/1000
68공공하수처리 CH4산정식폐기물CH4IPCC Default(활동자료*CH4배출계수*인구율)-공공하수회수량
69미처리/미차집 CH4산정식폐기물CH4IPCC Default활동자료*CH4배출계수
70고도처리 N2O 산정식폐기물N2OIPCC Default(활동자료*N20배출계수*단백질비율/1000000)
71분뇨 N2O 산정식폐기물N2OIPCC Default(분뇨질소부하량)*N20배출계수*44/28