Overview

Dataset statistics

Number of variables17
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.6 KiB
Average record size in memory139.3 B

Variable types

Numeric1
Categorical12
Text4

Alerts

대상연도 has constant value ""Constant
온실가스 배출량 단위 has constant value ""Constant
에너지 사용량 단위 has constant value ""Constant
온실가스 배출량 원단위 단위 is highly overall correlated with 지정구분 and 4 other fieldsHigh correlation
에너지 사용량 원단위 값 is highly overall correlated with 지정구분 and 4 other fieldsHigh correlation
매출액(원) is highly overall correlated with 지정구분 and 4 other fieldsHigh correlation
지정구분 is highly overall correlated with 온실가스 배출량 원단위 값 and 4 other fieldsHigh correlation
에너지 사용량 원단위 단위 is highly overall correlated with 지정구분 and 4 other fieldsHigh correlation
온실가스 배출량 원단위 값 is highly overall correlated with 지정구분 and 4 other fieldsHigh correlation
소관부처 is highly overall correlated with 지정업종High correlation
지정업종 is highly overall correlated with 소관부처High correlation
온실가스 배출량 원단위 값 is highly imbalanced (66.3%)Imbalance
에너지 사용량 원단위 값 is highly imbalanced (66.2%)Imbalance
매출액(원) is highly imbalanced (66.3%)Imbalance
연번 has unique valuesUnique
법인명 has unique valuesUnique
온실가스 배출량 has unique valuesUnique

Reproduction

Analysis started2023-12-10 10:40:53.145315
Analysis finished2023-12-10 10:40:57.427877
Duration4.28 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean51.55
Minimum1
Maximum102
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:40:57.650162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q126.75
median51.5
Q377.25
95-th percentile97.05
Maximum102
Range101
Interquartile range (IQR)50.5

Descriptive statistics

Standard deviation29.654705
Coefficient of variation (CV)0.575261
Kurtosis-1.1918638
Mean51.55
Median Absolute Deviation (MAD)25.5
Skewness-0.0011413959
Sum5155
Variance879.40152
MonotonicityStrictly increasing
2023-12-10T19:40:57.897307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
66 1
 
1.0%
77 1
 
1.0%
76 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
102 1
1.0%
101 1
1.0%
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%

소관부처
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
산업통상자원부
37 
환경부
32 
국토교통부
21 
농림축산식품부
해양수산부
 
2

Length

Max length7
Median length5
Mean length5.26
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row환경부
2nd row국토교통부
3rd row환경부
4th row환경부
5th row농림축산식품부

Common Values

ValueCountFrequency (%)
산업통상자원부 37
37.0%
환경부 32
32.0%
국토교통부 21
21.0%
농림축산식품부 8
 
8.0%
해양수산부 2
 
2.0%

Length

2023-12-10T19:40:58.146642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:40:58.382746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
산업통상자원부 37
37.0%
환경부 32
32.0%
국토교통부 21
21.0%
농림축산식품부 8
 
8.0%
해양수산부 2
 
2.0%

법인명
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:40:58.985159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length18
Mean length8.05
Min length5

Characters and Unicode

Total characters805
Distinct characters185
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row(유)에스케이씨에보닉페록사이드코리아
2nd row(유)호남고속
3rd row(주)HJ매그놀리아용평호텔앤리조트
4th row(주)HM금속
5th row(주)MH에탄올
ValueCountFrequency (%)
주)동서기공 2
 
1.7%
주)동남 2
 
1.7%
주)샤니 2
 
1.7%
주)성호금속 2
 
1.7%
주)서진캠 2
 
1.7%
주)미래엔인천에너지 1
 
0.8%
주)빙그레 1
 
0.8%
주)비엠금속 1
 
0.8%
제2공장 1
 
0.8%
주)비에이치 1
 
0.8%
Other values (103) 103
87.3%
2023-12-10T19:41:00.037455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 100
 
12.4%
100
 
12.4%
) 100
 
12.4%
19
 
2.4%
18
 
2.2%
17
 
2.1%
15
 
1.9%
14
 
1.7%
12
 
1.5%
12
 
1.5%
Other values (175) 398
49.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 567
70.4%
Open Punctuation 100
 
12.4%
Close Punctuation 100
 
12.4%
Space Separator 18
 
2.2%
Uppercase Letter 17
 
2.1%
Decimal Number 2
 
0.2%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
100
 
17.6%
19
 
3.4%
17
 
3.0%
15
 
2.6%
14
 
2.5%
12
 
2.1%
12
 
2.1%
11
 
1.9%
11
 
1.9%
10
 
1.8%
Other values (160) 346
61.0%
Uppercase Letter
ValueCountFrequency (%)
M 4
23.5%
H 3
17.6%
C 2
11.8%
S 2
11.8%
A 1
 
5.9%
P 1
 
5.9%
I 1
 
5.9%
J 1
 
5.9%
B 1
 
5.9%
F 1
 
5.9%
Open Punctuation
ValueCountFrequency (%)
( 100
100.0%
Close Punctuation
ValueCountFrequency (%)
) 100
100.0%
Space Separator
ValueCountFrequency (%)
18
100.0%
Decimal Number
ValueCountFrequency (%)
2 2
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 567
70.4%
Common 221
 
27.5%
Latin 17
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
100
 
17.6%
19
 
3.4%
17
 
3.0%
15
 
2.6%
14
 
2.5%
12
 
2.1%
12
 
2.1%
11
 
1.9%
11
 
1.9%
10
 
1.8%
Other values (160) 346
61.0%
Latin
ValueCountFrequency (%)
M 4
23.5%
H 3
17.6%
C 2
11.8%
S 2
11.8%
A 1
 
5.9%
P 1
 
5.9%
I 1
 
5.9%
J 1
 
5.9%
B 1
 
5.9%
F 1
 
5.9%
Common
ValueCountFrequency (%)
( 100
45.2%
) 100
45.2%
18
 
8.1%
2 2
 
0.9%
& 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 567
70.4%
ASCII 238
29.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 100
42.0%
) 100
42.0%
18
 
7.6%
M 4
 
1.7%
H 3
 
1.3%
2 2
 
0.8%
C 2
 
0.8%
S 2
 
0.8%
A 1
 
0.4%
P 1
 
0.4%
Other values (5) 5
 
2.1%
Hangul
ValueCountFrequency (%)
100
 
17.6%
19
 
3.4%
17
 
3.0%
15
 
2.6%
14
 
2.5%
12
 
2.1%
12
 
2.1%
11
 
1.9%
11
 
1.9%
10
 
1.8%
Other values (160) 346
61.0%

주소
Text

Distinct98
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:41:00.685308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length30
Mean length23.85
Min length15

Characters and Unicode

Total characters2385
Distinct characters221
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique96 ?
Unique (%)96.0%

Sample

1st row울산광역시 남구 상개로 99(상개동)
2nd row전라북도 전주시 덕진구 신복천변로 28 (팔복동1가)
3rd row강원도 평창군 대관령면 올림픽로 715 용평리조트
4th row경상남도 함안군 군북면 함안산단로 170
5th row경상남도 창원시 마산회원구 내서읍 광려천남로 25
ValueCountFrequency (%)
서울특별시 20
 
3.9%
경기도 17
 
3.3%
경상북도 11
 
2.1%
충청남도 9
 
1.7%
경상남도 7
 
1.4%
전라북도 7
 
1.4%
인천광역시 6
 
1.2%
포항시 6
 
1.2%
충청북도 5
 
1.0%
강원도 5
 
1.0%
Other values (349) 424
82.0%
2023-12-10T19:41:01.472174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
433
 
18.2%
90
 
3.8%
84
 
3.5%
66
 
2.8%
1 63
 
2.6%
56
 
2.3%
2 51
 
2.1%
49
 
2.1%
48
 
2.0%
4 44
 
1.8%
Other values (211) 1401
58.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1498
62.8%
Space Separator 433
 
18.2%
Decimal Number 360
 
15.1%
Open Punctuation 36
 
1.5%
Close Punctuation 36
 
1.5%
Dash Punctuation 12
 
0.5%
Other Punctuation 6
 
0.3%
Uppercase Letter 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
90
 
6.0%
84
 
5.6%
66
 
4.4%
56
 
3.7%
49
 
3.3%
48
 
3.2%
39
 
2.6%
39
 
2.6%
37
 
2.5%
36
 
2.4%
Other values (192) 954
63.7%
Decimal Number
ValueCountFrequency (%)
1 63
17.5%
2 51
14.2%
4 44
12.2%
5 40
11.1%
3 37
10.3%
7 34
9.4%
6 31
8.6%
8 21
 
5.8%
0 20
 
5.6%
9 19
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
A 1
25.0%
S 1
25.0%
H 1
25.0%
E 1
25.0%
Space Separator
ValueCountFrequency (%)
433
100.0%
Open Punctuation
ValueCountFrequency (%)
( 36
100.0%
Close Punctuation
ValueCountFrequency (%)
) 36
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Other Punctuation
ValueCountFrequency (%)
, 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1498
62.8%
Common 883
37.0%
Latin 4
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
90
 
6.0%
84
 
5.6%
66
 
4.4%
56
 
3.7%
49
 
3.3%
48
 
3.2%
39
 
2.6%
39
 
2.6%
37
 
2.5%
36
 
2.4%
Other values (192) 954
63.7%
Common
ValueCountFrequency (%)
433
49.0%
1 63
 
7.1%
2 51
 
5.8%
4 44
 
5.0%
5 40
 
4.5%
3 37
 
4.2%
( 36
 
4.1%
) 36
 
4.1%
7 34
 
3.9%
6 31
 
3.5%
Other values (5) 78
 
8.8%
Latin
ValueCountFrequency (%)
A 1
25.0%
S 1
25.0%
H 1
25.0%
E 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1498
62.8%
ASCII 887
37.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
433
48.8%
1 63
 
7.1%
2 51
 
5.7%
4 44
 
5.0%
5 40
 
4.5%
3 37
 
4.2%
( 36
 
4.1%
) 36
 
4.1%
7 34
 
3.8%
6 31
 
3.5%
Other values (9) 82
 
9.2%
Hangul
ValueCountFrequency (%)
90
 
6.0%
84
 
5.6%
66
 
4.4%
56
 
3.7%
49
 
3.3%
48
 
3.2%
39
 
2.6%
39
 
2.6%
37
 
2.5%
36
 
2.4%
Other values (192) 954
63.7%

대상연도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2019
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2019 100
100.0%

Length

2023-12-10T19:41:01.732922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:41:01.906670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 100
100.0%

지정구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
사업장
82 
업체
18 

Length

Max length3
Median length3
Mean length2.82
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업장
2nd row사업장
3rd row사업장
4th row사업장
5th row사업장

Common Values

ValueCountFrequency (%)
사업장 82
82.0%
업체 18
 
18.0%

Length

2023-12-10T19:41:02.086482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:41:02.282255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업장 82
82.0%
업체 18
 
18.0%

지정업종
Categorical

HIGH CORRELATION 

Distinct27
Distinct (%)27.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
교통(여객)
12 
철강
식료품 제조업
비철금속
자동차
Other values (22)
59 

Length

Max length14
Median length7
Mean length4.24
Min length2

Unique

Unique8 ?
Unique (%)8.0%

Sample

1st row석유화학
2nd row교통(여객)
3rd row건물
4th row산업
5th row음료제조업

Common Values

ValueCountFrequency (%)
교통(여객) 12
 
12.0%
철강 9
 
9.0%
식료품 제조업 7
 
7.0%
비철금속 7
 
7.0%
자동차 6
 
6.0%
석유화학 6
 
6.0%
건물 6
 
6.0%
섬유 5
 
5.0%
시멘트 4
 
4.0%
반도체.디스플레이.전기전자 4
 
4.0%
Other values (17) 34
34.0%

Length

2023-12-10T19:41:02.505092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
교통(여객 12
 
10.6%
철강 9
 
8.0%
식료품 7
 
6.2%
제조업 7
 
6.2%
비철금속 7
 
6.2%
자동차 6
 
5.3%
석유화학 6
 
5.3%
건물 6
 
5.3%
유리 5
 
4.4%
섬유 5
 
4.4%
Other values (18) 43
38.1%
Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:41:03.047303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length6
Mean length6.11
Min length5

Characters and Unicode

Total characters611
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row53,909
2nd row24,809
3rd row32,955
4th row37,628
5th row10,062
ValueCountFrequency (%)
53,909 1
 
1.0%
42,110 1
 
1.0%
17,323 1
 
1.0%
58,209 1
 
1.0%
18,293 1
 
1.0%
15,845 1
 
1.0%
12,872 1
 
1.0%
13,408 1
 
1.0%
85,293 1
 
1.0%
18,387 1
 
1.0%
Other values (90) 90
90.0%
2023-12-10T19:41:03.858633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 101
16.5%
1 83
13.6%
2 69
11.3%
3 62
10.1%
7 50
8.2%
9 45
7.4%
6 44
7.2%
4 44
7.2%
8 43
7.0%
5 39
 
6.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 510
83.5%
Other Punctuation 101
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 83
16.3%
2 69
13.5%
3 62
12.2%
7 50
9.8%
9 45
8.8%
6 44
8.6%
4 44
8.6%
8 43
8.4%
5 39
7.6%
0 31
 
6.1%
Other Punctuation
ValueCountFrequency (%)
, 101
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 611
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
, 101
16.5%
1 83
13.6%
2 69
11.3%
3 62
10.1%
7 50
8.2%
9 45
7.4%
6 44
7.2%
4 44
7.2%
8 43
7.0%
5 39
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 611
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 101
16.5%
1 83
13.6%
2 69
11.3%
3 62
10.1%
7 50
8.2%
9 45
7.4%
6 44
7.2%
4 44
7.2%
8 43
7.0%
5 39
 
6.4%

온실가스 배출량 단위
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
tCO₂eq
100 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowtCO₂eq
2nd rowtCO₂eq
3rd rowtCO₂eq
4th rowtCO₂eq
5th rowtCO₂eq

Common Values

ValueCountFrequency (%)
tCO₂eq 100
100.0%

Length

2023-12-10T19:41:04.106657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:41:04.663187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
tco₂eq 100
100.0%

온실가스 배출량 원단위 값
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct19
Distinct (%)19.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
82 
118.21
 
1
55.04
 
1
190.31
 
1
13.54
 
1
Other values (14)
14 

Length

Max length6
Median length1
Mean length1.66
Min length1

Unique

Unique18 ?
Unique (%)18.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 82
82.0%
118.21 1
 
1.0%
55.04 1
 
1.0%
190.31 1
 
1.0%
13.54 1
 
1.0%
0.52 1
 
1.0%
6.41 1
 
1.0%
0.82 1
 
1.0%
57.05 1
 
1.0%
7.26 1
 
1.0%
Other values (9) 9
 
9.0%

Length

2023-12-10T19:41:04.837076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
82
82.0%
48.27 1
 
1.0%
18.68 1
 
1.0%
6.79 1
 
1.0%
43.08 1
 
1.0%
3.12 1
 
1.0%
5.3 1
 
1.0%
4.29 1
 
1.0%
814.11 1
 
1.0%
7.26 1
 
1.0%
Other values (9) 9
 
9.0%

온실가스 배출량 원단위 단위
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
82 
tCO2eq/억원
18 

Length

Max length9
Median length1
Mean length2.44
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 82
82.0%
tCO2eq/억원 18
 
18.0%

Length

2023-12-10T19:41:05.117784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:41:05.323052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
82
82.0%
tco2eq/억원 18
 
18.0%
Distinct97
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:41:05.789435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length3
Mean length3.41
Min length3

Characters and Unicode

Total characters341
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)94.0%

Sample

1st row1,124
2nd row384
3rd row653
4th row654
5th row293
ValueCountFrequency (%)
322 2
 
2.0%
342 2
 
2.0%
406 2
 
2.0%
363 1
 
1.0%
1,124 1
 
1.0%
1,156 1
 
1.0%
335 1
 
1.0%
325 1
 
1.0%
259 1
 
1.0%
273 1
 
1.0%
Other values (87) 87
87.0%
2023-12-10T19:41:06.653355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 58
17.0%
5 42
12.3%
2 39
11.4%
6 38
11.1%
4 35
10.3%
1 33
9.7%
7 26
7.6%
, 20
 
5.9%
9 19
 
5.6%
8 16
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 321
94.1%
Other Punctuation 20
 
5.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 58
18.1%
5 42
13.1%
2 39
12.1%
6 38
11.8%
4 35
10.9%
1 33
10.3%
7 26
8.1%
9 19
 
5.9%
8 16
 
5.0%
0 15
 
4.7%
Other Punctuation
ValueCountFrequency (%)
, 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 341
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 58
17.0%
5 42
12.3%
2 39
11.4%
6 38
11.1%
4 35
10.3%
1 33
9.7%
7 26
7.6%
, 20
 
5.9%
9 19
 
5.6%
8 16
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 341
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 58
17.0%
5 42
12.3%
2 39
11.4%
6 38
11.1%
4 35
10.3%
1 33
9.7%
7 26
7.6%
, 20
 
5.9%
9 19
 
5.6%
8 16
 
4.7%

에너지 사용량 단위
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
TJ
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTJ
2nd rowTJ
3rd rowTJ
4th rowTJ
5th rowTJ

Common Values

ValueCountFrequency (%)
TJ 100
100.0%

Length

2023-12-10T19:41:06.997424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:41:07.170240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
tj 100
100.0%

에너지 사용량 원단위 값
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct18
Distinct (%)18.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
82 
0.13
 
2
0.84
 
1
0.34
 
1
0.27
 
1
Other values (13)
13 

Length

Max length4
Median length1
Mean length1.52
Min length1

Unique

Unique16 ?
Unique (%)16.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 82
82.0%
0.13 2
 
2.0%
0.84 1
 
1.0%
0.34 1
 
1.0%
0.27 1
 
1.0%
0.01 1
 
1.0%
0.02 1
 
1.0%
0.88 1
 
1.0%
1.29 1
 
1.0%
0.1 1
 
1.0%
Other values (8) 8
 
8.0%

Length

2023-12-10T19:41:07.400927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
82
82.0%
0.13 2
 
2.0%
0.69 1
 
1.0%
0.32 1
 
1.0%
0.85 1
 
1.0%
0.06 1
 
1.0%
0.11 1
 
1.0%
0.09 1
 
1.0%
1.73 1
 
1.0%
0.1 1
 
1.0%
Other values (8) 8
 
8.0%

에너지 사용량 원단위 단위
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
82 
TJ/억원
18 

Length

Max length5
Median length1
Mean length1.72
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 82
82.0%
TJ/억원 18
 
18.0%

Length

2023-12-10T19:41:07.684028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:41:07.891094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
82
82.0%
tj/억원 18
 
18.0%
Distinct12
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
한국표준협회
37 
(주)한국경영인증원
17 
(재)한국품질재단
13 
이큐에이㈜
한국가스안전공사
Other values (7)
20 

Length

Max length14
Median length13
Mean length7.92
Min length5

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row(재)한국품질재단
2nd row(주)한국경영인증원
3rd row(재)한국품질재단
4th row한국표준협회
5th row(재)한국화학융합시험연구원

Common Values

ValueCountFrequency (%)
한국표준협회 37
37.0%
(주)한국경영인증원 17
17.0%
(재)한국품질재단 13
 
13.0%
이큐에이㈜ 8
 
8.0%
한국가스안전공사 5
 
5.0%
산림조합중앙회 5
 
5.0%
㈜한국품질보증원 4
 
4.0%
(주)비에스아이그룹코리아 4
 
4.0%
한국생산성본부인증원(주) 3
 
3.0%
대일이엔씨기술(주) 2
 
2.0%
Other values (2) 2
 
2.0%

Length

2023-12-10T19:41:08.110976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한국표준협회 37
37.0%
주)한국경영인증원 17
17.0%
재)한국품질재단 13
 
13.0%
이큐에이㈜ 8
 
8.0%
한국가스안전공사 5
 
5.0%
산림조합중앙회 5
 
5.0%
㈜한국품질보증원 4
 
4.0%
주)비에스아이그룹코리아 4
 
4.0%
한국생산성본부인증원(주 3
 
3.0%
대일이엔씨기술(주 2
 
2.0%
Other values (2) 2
 
2.0%

매출액(원)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct19
Distinct (%)19.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
82 
391,970,769,000
 
1
302,392,320,220
 
1
118,015,120,445
 
1
253,552,382,000
 
1
Other values (14)
14 

Length

Max length18
Median length1
Mean length3.65
Min length1

Unique

Unique18 ?
Unique (%)18.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 82
82.0%
391,970,769,000 1
 
1.0%
302,392,320,220 1
 
1.0%
118,015,120,445 1
 
1.0%
253,552,382,000 1
 
1.0%
21,195,065,000,000 1
 
1.0%
1,300,794,480,252 1
 
1.0%
8,091,938,554,000 1
 
1.0%
283,636,549,385 1
 
1.0%
577,078,497,000 1
 
1.0%
Other values (9) 9
 
9.0%

Length

2023-12-10T19:41:08.398631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
82
82.0%
140,415,225,000 1
 
1.0%
715,060,913,000 1
 
1.0%
857,206,874,000 1
 
1.0%
40,172,952,000 1
 
1.0%
3,323,374,073,000 1
 
1.0%
1,588,036,127,000 1
 
1.0%
1,709,299,492,000 1
 
1.0%
31,080,717,000 1
 
1.0%
577,078,497,000 1
 
1.0%
Other values (9) 9
 
9.0%

Interactions

2023-12-10T19:40:56.467226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:41:08.596022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소관부처법인명주소지정구분지정업종온실가스 배출량온실가스 배출량 원단위 값온실가스 배출량 원단위 단위에너지 사용량에너지 사용량 원단위 값에너지 사용량 원단위 단위검증수행기관매출액(원)
연번1.0000.3521.0000.9760.0000.3801.0000.1390.0000.7790.0000.0000.3730.139
소관부처0.3521.0001.0001.0000.1330.9771.0000.2220.1330.9450.2980.1330.5500.222
법인명1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
주소0.9761.0001.0001.0001.0001.0001.0000.0001.0000.9920.0001.0001.0000.000
지정구분0.0000.1331.0001.0001.0000.0001.0001.0000.9990.4781.0000.9990.3081.000
지정업종0.3800.9771.0001.0000.0001.0001.0000.5020.0000.0000.6350.0000.7680.502
온실가스 배출량1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
온실가스 배출량 원단위 값0.1390.2221.0000.0001.0000.5021.0001.0001.0000.9391.0001.0000.2851.000
온실가스 배출량 원단위 단위0.0000.1331.0001.0000.9990.0001.0001.0001.0000.4781.0000.9990.3081.000
에너지 사용량0.7790.9451.0000.9920.4780.0001.0000.9390.4781.0000.9210.4780.9620.939
에너지 사용량 원단위 값0.0000.2981.0000.0001.0000.6351.0001.0001.0000.9211.0001.0000.3481.000
에너지 사용량 원단위 단위0.0000.1331.0001.0000.9990.0001.0001.0000.9990.4781.0001.0000.3081.000
검증수행기관0.3730.5501.0001.0000.3080.7681.0000.2850.3080.9620.3480.3081.0000.285
매출액(원)0.1390.2221.0000.0001.0000.5021.0001.0001.0000.9391.0001.0000.2851.000
2023-12-10T19:41:08.910354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
온실가스 배출량 원단위 단위에너지 사용량 원단위 값매출액(원)지정구분에너지 사용량 원단위 단위소관부처온실가스 배출량 원단위 값검증수행기관지정업종
온실가스 배출량 원단위 단위1.0000.9150.9090.9660.9660.1590.9090.2240.000
에너지 사용량 원단위 값0.9151.0000.9940.9150.9150.1390.9940.1120.198
매출액(원)0.9090.9941.0000.9090.9090.0941.0000.0900.140
지정구분0.9660.9150.9091.0000.9660.1590.9090.2240.000
에너지 사용량 원단위 단위0.9660.9150.9090.9661.0000.1590.9090.2240.000
소관부처0.1590.1390.0940.1590.1591.0000.0940.3290.796
온실가스 배출량 원단위 값0.9090.9941.0000.9090.9090.0941.0000.0900.140
검증수행기관0.2240.1120.0900.2240.2240.3290.0901.0000.339
지정업종0.0000.1980.1400.0000.0000.7960.1400.3391.000
2023-12-10T19:41:09.150725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소관부처지정구분지정업종온실가스 배출량 원단위 값온실가스 배출량 원단위 단위에너지 사용량 원단위 값에너지 사용량 원단위 단위검증수행기관매출액(원)
연번1.0000.1470.0000.1210.0220.0000.0000.0000.1610.022
소관부처0.1471.0000.1590.7960.0940.1590.1390.1590.3290.094
지정구분0.0000.1591.0000.0000.9090.9660.9150.9660.2240.909
지정업종0.1210.7960.0001.0000.1400.0000.1980.0000.3390.140
온실가스 배출량 원단위 값0.0220.0940.9090.1401.0000.9090.9940.9090.0901.000
온실가스 배출량 원단위 단위0.0000.1590.9660.0000.9091.0000.9150.9660.2240.909
에너지 사용량 원단위 값0.0000.1390.9150.1980.9940.9151.0000.9150.1120.994
에너지 사용량 원단위 단위0.0000.1590.9660.0000.9090.9660.9151.0000.2240.909
검증수행기관0.1610.3290.2240.3390.0900.2240.1120.2241.0000.090
매출액(원)0.0220.0940.9090.1401.0000.9090.9940.9090.0901.000

Missing values

2023-12-10T19:40:56.709175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:40:57.182316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번소관부처법인명주소대상연도지정구분지정업종온실가스 배출량온실가스 배출량 단위온실가스 배출량 원단위 값온실가스 배출량 원단위 단위에너지 사용량에너지 사용량 단위에너지 사용량 원단위 값에너지 사용량 원단위 단위검증수행기관매출액(원)
01환경부(유)에스케이씨에보닉페록사이드코리아울산광역시 남구 상개로 99(상개동)2019사업장석유화학53,909tCO₂eq--1,124TJ--(재)한국품질재단-
12국토교통부(유)호남고속전라북도 전주시 덕진구 신복천변로 28 (팔복동1가)2019사업장교통(여객)24,809tCO₂eq--384TJ--(주)한국경영인증원-
23환경부(주)HJ매그놀리아용평호텔앤리조트강원도 평창군 대관령면 올림픽로 715 용평리조트2019사업장건물32,955tCO₂eq--653TJ--(재)한국품질재단-
34환경부(주)HM금속경상남도 함안군 군북면 함안산단로 1702019사업장산업37,628tCO₂eq--654TJ--한국표준협회-
45농림축산식품부(주)MH에탄올경상남도 창원시 마산회원구 내서읍 광려천남로 252019사업장음료제조업10,062tCO₂eq--293TJ--(재)한국화학융합시험연구원-
56환경부(주)MSC경상남도 양산시 소주회야로 45-732019사업장음식료품27,775tCO₂eq--555TJ--한국가스안전공사-
67환경부(주)SIMPAC인천광역시 부평구 부평북로 1412019업체산업463,352tCO₂eq118.21tCO2eq/억원5,067TJ1.29TJ/억원한국표준협회391,970,769,000
78환경부(주)강원랜드강원도 정선군 사북읍 하이원길 2652019사업장건물83,321tCO₂eq--1,686TJ--한국표준협회-
89환경부(주)건화경상남도 거제시 연초면 연하해안로841-542019사업장조선30,632tCO₂eq--534TJ--한국표준협회-
910국토교통부(주)경기고속경기도 광주시 광주대로 171 (송정동)2019업체교통(여객)166,441tCO₂eq55.04tCO2eq/억원2,550TJ0.84TJ/억원산림조합중앙회302,392,320,220
연번소관부처법인명주소대상연도지정구분지정업종온실가스 배출량온실가스 배출량 단위온실가스 배출량 원단위 값온실가스 배출량 원단위 단위에너지 사용량에너지 사용량 단위에너지 사용량 원단위 값에너지 사용량 원단위 단위검증수행기관매출액(원)
9093산업통상자원부(주)성신미네필드강원도 정선군 남면 칠현로 5042019사업장시멘트16,962tCO₂eq--327TJ--한국표준협회-
9194산업통상자원부(주)성호금속 경주2공장경상북도 경주시 천북면 천북산단로2길 702019사업장비철금속29,580tCO₂eq--528TJ--이큐에이㈜-
9295산업통상자원부(주)성호금속 영천공장경상북도 영천시 언하공단2길 252019사업장철강18,847tCO₂eq--342TJ--이큐에이㈜-
9396산업통상자원부(주)성호기업경상북도 경주시 천북면 천북산단로1길 74-512019사업장철강13,897tCO₂eq--275TJ--이큐에이㈜-
9497환경부(주)세아베스틸서울특별시 마포구 양화로 45(서교동)2019업체철강1,221,403tCO₂eq70.29tCO2eq/억원19,115TJ1.1TJ/억원한국표준협회1,737,589,434,000
9598환경부(주)세아씨엠전라북도 군산시 자유로 241(소룡동)2019사업장철강64,376tCO₂eq--1,295TJ--한국표준협회-
9699국토교통부(주)세아엘앤에스경상북도 포항시 남구 철강로 348 (호동)2019사업장교통(화물)16,597tCO₂eq--243TJ--한국표준협회-
97100환경부(주)세아제강서울특별시 마포구 양화로 45, 25,26,27층(서교동, 세아타워)2019사업장산업58,506tCO₂eq--1,186TJ--한국표준협회-
98101환경부(주)세아특수강경상북도 포항시 남구 괴동로 402019사업장철강26,816tCO₂eq--526TJ--한국표준협회-
99102농림축산식품부(주)세우경기도 시흥시 공단1대로379번안길 29 시화공단 4나 204호2019사업장식료품 제조업17,215tCO₂eq--345TJ--(재)한국품질재단-