Overview

Dataset statistics

Number of variables17
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.6 KiB
Average record size in memory139.3 B

Variable types

Numeric1
Categorical12
Text4

Alerts

대상연도 has constant value ""Constant
온실가스 배출량 단위 has constant value ""Constant
에너지 사용량 단위 has constant value ""Constant
온실가스 배출량 원단위 단위 is highly overall correlated with 지정구분 and 4 other fieldsHigh correlation
에너지 사용량 원단위 값 is highly overall correlated with 지정구분 and 4 other fieldsHigh correlation
매출액(원) is highly overall correlated with 지정구분 and 4 other fieldsHigh correlation
지정구분 is highly overall correlated with 온실가스 배출량 원단위 값 and 4 other fieldsHigh correlation
에너지 사용량 원단위 단위 is highly overall correlated with 지정구분 and 4 other fieldsHigh correlation
온실가스 배출량 원단위 값 is highly overall correlated with 지정구분 and 4 other fieldsHigh correlation
소관부처 is highly overall correlated with 지정업종High correlation
지정업종 is highly overall correlated with 소관부처High correlation
온실가스 배출량 원단위 값 is highly imbalanced (63.9%)Imbalance
에너지 사용량 원단위 값 is highly imbalanced (63.6%)Imbalance
매출액(원) is highly imbalanced (63.9%)Imbalance
연번 has unique valuesUnique
법인명 has unique valuesUnique
온실가스 배출량 has unique valuesUnique

Reproduction

Analysis started2023-12-10 10:41:11.513142
Analysis finished2023-12-10 10:41:16.136719
Duration4.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:41:16.293555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-12-10T19:41:16.580455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

소관부처
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
산업통상자원부
62 
국토교통부
25 
농림축산식품부
환경부
 
3
해양수산부
 
2

Length

Max length7
Median length7
Mean length6.34
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row산업통상자원부
2nd row국토교통부
3rd row국토교통부
4th row산업통상자원부
5th row농림축산식품부

Common Values

ValueCountFrequency (%)
산업통상자원부 62
62.0%
국토교통부 25
25.0%
농림축산식품부 8
 
8.0%
환경부 3
 
3.0%
해양수산부 2
 
2.0%

Length

2023-12-10T19:41:16.846839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:41:17.271325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
산업통상자원부 62
62.0%
국토교통부 25
25.0%
농림축산식품부 8
 
8.0%
환경부 3
 
3.0%
해양수산부 2
 
2.0%

법인명
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:41:17.800256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length8.22
Min length5

Characters and Unicode

Total characters822
Distinct characters188
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row(유)에스케이씨에보닉페록사이드코리아
2nd row(유)호남고속
3rd row(주)HJ매그놀리아용평호텔앤리조트
4th row(주)HM금속
5th row(주)MH에탄올
ValueCountFrequency (%)
주)성호금속 2
 
1.7%
주)동남 2
 
1.7%
주)샤니 2
 
1.7%
주)서진캠 2
 
1.7%
주)동서기공 2
 
1.7%
주)부산롯데호텔 1
 
0.8%
주)벽산 1
 
0.8%
주)브이샘 1
 
0.8%
주)백광소재 1
 
0.8%
주)로옴코리아 1
 
0.8%
Other values (105) 105
87.5%
2023-12-10T19:41:18.606872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
101
 
12.3%
( 100
 
12.2%
) 100
 
12.2%
20
 
2.4%
20
 
2.4%
19
 
2.3%
16
 
1.9%
15
 
1.8%
12
 
1.5%
12
 
1.5%
Other values (178) 407
49.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 583
70.9%
Open Punctuation 100
 
12.2%
Close Punctuation 100
 
12.2%
Space Separator 20
 
2.4%
Uppercase Letter 17
 
2.1%
Decimal Number 1
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
101
 
17.3%
20
 
3.4%
19
 
3.3%
16
 
2.7%
15
 
2.6%
12
 
2.1%
12
 
2.1%
11
 
1.9%
11
 
1.9%
10
 
1.7%
Other values (163) 356
61.1%
Uppercase Letter
ValueCountFrequency (%)
M 4
23.5%
H 3
17.6%
C 2
11.8%
S 2
11.8%
J 1
 
5.9%
B 1
 
5.9%
I 1
 
5.9%
P 1
 
5.9%
A 1
 
5.9%
F 1
 
5.9%
Open Punctuation
ValueCountFrequency (%)
( 100
100.0%
Close Punctuation
ValueCountFrequency (%)
) 100
100.0%
Space Separator
ValueCountFrequency (%)
20
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 583
70.9%
Common 222
 
27.0%
Latin 17
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
101
 
17.3%
20
 
3.4%
19
 
3.3%
16
 
2.7%
15
 
2.6%
12
 
2.1%
12
 
2.1%
11
 
1.9%
11
 
1.9%
10
 
1.7%
Other values (163) 356
61.1%
Latin
ValueCountFrequency (%)
M 4
23.5%
H 3
17.6%
C 2
11.8%
S 2
11.8%
J 1
 
5.9%
B 1
 
5.9%
I 1
 
5.9%
P 1
 
5.9%
A 1
 
5.9%
F 1
 
5.9%
Common
ValueCountFrequency (%)
( 100
45.0%
) 100
45.0%
20
 
9.0%
2 1
 
0.5%
& 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 583
70.9%
ASCII 239
29.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
101
 
17.3%
20
 
3.4%
19
 
3.3%
16
 
2.7%
15
 
2.6%
12
 
2.1%
12
 
2.1%
11
 
1.9%
11
 
1.9%
10
 
1.7%
Other values (163) 356
61.1%
ASCII
ValueCountFrequency (%)
( 100
41.8%
) 100
41.8%
20
 
8.4%
M 4
 
1.7%
H 3
 
1.3%
C 2
 
0.8%
S 2
 
0.8%
2 1
 
0.4%
J 1
 
0.4%
B 1
 
0.4%
Other values (5) 5
 
2.1%

주소
Text

Distinct98
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:41:19.538577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length29
Mean length23.56
Min length1

Characters and Unicode

Total characters2356
Distinct characters225
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique96 ?
Unique (%)96.0%

Sample

1st row울산광역시 남구 상개로 99(상개동)
2nd row전라북도 전주시 덕진구 신복천변로 28 (팔복동1가)
3rd row강원도 평창군 대관령면 올림픽로 715 용평리조트
4th row경상남도 함안군 군북면 함안산단로 170
5th row경상남도 창원시 마산회원구 내서읍 광려천남로 25
ValueCountFrequency (%)
경기도 18
 
3.5%
서울특별시 18
 
3.5%
경상북도 10
 
1.9%
전라북도 8
 
1.6%
충청남도 7
 
1.4%
경상남도 7
 
1.4%
강원도 5
 
1.0%
포항시 5
 
1.0%
중구 5
 
1.0%
충청북도 5
 
1.0%
Other values (346) 426
82.9%
2023-12-10T19:41:20.627601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
436
 
18.5%
86
 
3.7%
84
 
3.6%
66
 
2.8%
1 62
 
2.6%
55
 
2.3%
51
 
2.2%
2 50
 
2.1%
42
 
1.8%
4 40
 
1.7%
Other values (215) 1384
58.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1478
62.7%
Space Separator 436
 
18.5%
Decimal Number 352
 
14.9%
Close Punctuation 32
 
1.4%
Open Punctuation 32
 
1.4%
Dash Punctuation 15
 
0.6%
Uppercase Letter 8
 
0.3%
Other Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
86
 
5.8%
84
 
5.7%
66
 
4.5%
55
 
3.7%
51
 
3.5%
42
 
2.8%
39
 
2.6%
39
 
2.6%
37
 
2.5%
36
 
2.4%
Other values (193) 943
63.8%
Decimal Number
ValueCountFrequency (%)
1 62
17.6%
2 50
14.2%
4 40
11.4%
7 36
10.2%
5 36
10.2%
3 35
9.9%
6 33
9.4%
8 23
 
6.5%
0 19
 
5.4%
9 18
 
5.1%
Uppercase Letter
ValueCountFrequency (%)
A 2
25.0%
S 1
12.5%
E 1
12.5%
H 1
12.5%
I 1
12.5%
D 1
12.5%
K 1
12.5%
Space Separator
ValueCountFrequency (%)
436
100.0%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%
Open Punctuation
ValueCountFrequency (%)
( 32
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1478
62.7%
Common 870
36.9%
Latin 8
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
86
 
5.8%
84
 
5.7%
66
 
4.5%
55
 
3.7%
51
 
3.5%
42
 
2.8%
39
 
2.6%
39
 
2.6%
37
 
2.5%
36
 
2.4%
Other values (193) 943
63.8%
Common
ValueCountFrequency (%)
436
50.1%
1 62
 
7.1%
2 50
 
5.7%
4 40
 
4.6%
7 36
 
4.1%
5 36
 
4.1%
3 35
 
4.0%
6 33
 
3.8%
) 32
 
3.7%
( 32
 
3.7%
Other values (5) 78
 
9.0%
Latin
ValueCountFrequency (%)
A 2
25.0%
S 1
12.5%
E 1
12.5%
H 1
12.5%
I 1
12.5%
D 1
12.5%
K 1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1478
62.7%
ASCII 878
37.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
436
49.7%
1 62
 
7.1%
2 50
 
5.7%
4 40
 
4.6%
7 36
 
4.1%
5 36
 
4.1%
3 35
 
4.0%
6 33
 
3.8%
) 32
 
3.6%
( 32
 
3.6%
Other values (12) 86
 
9.8%
Hangul
ValueCountFrequency (%)
86
 
5.8%
84
 
5.7%
66
 
4.5%
55
 
3.7%
51
 
3.5%
42
 
2.8%
39
 
2.6%
39
 
2.6%
37
 
2.5%
36
 
2.4%
Other values (193) 943
63.8%

대상연도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2018
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018
2nd row2018
3rd row2018
4th row2018
5th row2018

Common Values

ValueCountFrequency (%)
2018 100
100.0%

Length

2023-12-10T19:41:20.944240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:41:21.149011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2018 100
100.0%

지정구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
사업장
80 
업체
20 

Length

Max length3
Median length3
Mean length2.8
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업장
2nd row사업장
3rd row사업장
4th row사업장
5th row사업장

Common Values

ValueCountFrequency (%)
사업장 80
80.0%
업체 20
 
20.0%

Length

2023-12-10T19:41:21.426152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:41:21.690190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업장 80
80.0%
업체 20
 
20.0%

지정업종
Categorical

HIGH CORRELATION 

Distinct28
Distinct (%)28.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
교통(여객)
14 
철강
10 
건물
비철금속
자동차
Other values (23)
56 

Length

Max length14
Median length10
Mean length4.16
Min length2

Unique

Unique8 ?
Unique (%)8.0%

Sample

1st row석유화학
2nd row교통(여객)
3rd row건물
4th row철강
5th row음료제조업

Common Values

ValueCountFrequency (%)
교통(여객) 14
14.0%
철강 10
 
10.0%
건물 7
 
7.0%
비철금속 7
 
7.0%
자동차 6
 
6.0%
식료품 제조업 5
 
5.0%
석유화학 5
 
5.0%
시멘트 5
 
5.0%
섬유 5
 
5.0%
제지 4
 
4.0%
Other values (18) 32
32.0%

Length

2023-12-10T19:41:21.883119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
교통(여객 14
 
12.5%
철강 10
 
8.9%
건물 7
 
6.2%
비철금속 7
 
6.2%
자동차 6
 
5.4%
식료품 5
 
4.5%
제조업 5
 
4.5%
석유화학 5
 
4.5%
시멘트 5
 
4.5%
섬유 5
 
4.5%
Other values (20) 43
38.4%
Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:41:22.444709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length6
Mean length6.11
Min length5

Characters and Unicode

Total characters611
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row58,372
2nd row25,429
3rd row32,776
4th row34,262
5th row23,436
ValueCountFrequency (%)
58,372 1
 
1.0%
110,202 1
 
1.0%
56,972 1
 
1.0%
19,681 1
 
1.0%
14,646 1
 
1.0%
14,174 1
 
1.0%
76,375 1
 
1.0%
591,629 1
 
1.0%
21,315 1
 
1.0%
60,124 1
 
1.0%
Other values (90) 90
90.0%
2023-12-10T19:41:23.303753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 101
16.5%
1 85
13.9%
2 73
11.9%
6 49
8.0%
3 48
7.9%
4 47
7.7%
5 46
7.5%
7 43
7.0%
8 41
6.7%
9 40
 
6.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 510
83.5%
Other Punctuation 101
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 85
16.7%
2 73
14.3%
6 49
9.6%
3 48
9.4%
4 47
9.2%
5 46
9.0%
7 43
8.4%
8 41
8.0%
9 40
7.8%
0 38
7.5%
Other Punctuation
ValueCountFrequency (%)
, 101
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 611
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
, 101
16.5%
1 85
13.9%
2 73
11.9%
6 49
8.0%
3 48
7.9%
4 47
7.7%
5 46
7.5%
7 43
7.0%
8 41
6.7%
9 40
 
6.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 611
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 101
16.5%
1 85
13.9%
2 73
11.9%
6 49
8.0%
3 48
7.9%
4 47
7.7%
5 46
7.5%
7 43
7.0%
8 41
6.7%
9 40
 
6.5%

온실가스 배출량 단위
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
tCO₂eq
100 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowtCO₂eq
2nd rowtCO₂eq
3rd rowtCO₂eq
4th rowtCO₂eq
5th rowtCO₂eq

Common Values

ValueCountFrequency (%)
tCO₂eq 100
100.0%

Length

2023-12-10T19:41:23.560145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:41:23.739610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
tco₂eq 100
100.0%

온실가스 배출량 원단위 값
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct21
Distinct (%)21.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
80 
170.54
 
1
61.43
 
1
219.17
 
1
18.95
 
1
Other values (16)
16 

Length

Max length6
Median length1
Mean length1.73
Min length1

Unique

Unique20 ?
Unique (%)20.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 80
80.0%
170.54 1
 
1.0%
61.43 1
 
1.0%
219.17 1
 
1.0%
18.95 1
 
1.0%
0.62 1
 
1.0%
6.86 1
 
1.0%
0.69 1
 
1.0%
62.92 1
 
1.0%
9.08 1
 
1.0%
Other values (11) 11
 
11.0%

Length

2023-12-10T19:41:23.931466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
80
80.0%
52.09 1
 
1.0%
18.71 1
 
1.0%
6.8 1
 
1.0%
357.99 1
 
1.0%
56.38 1
 
1.0%
3.55 1
 
1.0%
4.21 1
 
1.0%
4.32 1
 
1.0%
791.18 1
 
1.0%
Other values (11) 11
 
11.0%

온실가스 배출량 원단위 단위
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
80 
tCO2eq/억원
20 

Length

Max length9
Median length1
Mean length2.6
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 80
80.0%
tCO2eq/억원 20
 
20.0%

Length

2023-12-10T19:41:24.551518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:41:24.794512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
80
80.0%
tco2eq/억원 20
 
20.0%
Distinct92
Distinct (%)92.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:41:25.259287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length3
Mean length3.42
Min length2

Characters and Unicode

Total characters342
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique87 ?
Unique (%)87.0%

Sample

1st row1,191
2nd row393
3rd row649
4th row600
5th row292
ValueCountFrequency (%)
357 4
 
4.0%
360 3
 
3.0%
406 2
 
2.0%
393 2
 
2.0%
301 2
 
2.0%
2,256 1
 
1.0%
288 1
 
1.0%
1,398 1
 
1.0%
2,472 1
 
1.0%
394 1
 
1.0%
Other values (82) 82
82.0%
2023-12-10T19:41:26.066484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 51
14.9%
3 48
14.0%
1 44
12.9%
5 31
9.1%
4 31
9.1%
7 30
8.8%
6 24
7.0%
9 23
6.7%
0 22
6.4%
, 21
6.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 321
93.9%
Other Punctuation 21
 
6.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 51
15.9%
3 48
15.0%
1 44
13.7%
5 31
9.7%
4 31
9.7%
7 30
9.3%
6 24
7.5%
9 23
7.2%
0 22
6.9%
8 17
 
5.3%
Other Punctuation
ValueCountFrequency (%)
, 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 342
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 51
14.9%
3 48
14.0%
1 44
12.9%
5 31
9.1%
4 31
9.1%
7 30
8.8%
6 24
7.0%
9 23
6.7%
0 22
6.4%
, 21
6.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 342
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 51
14.9%
3 48
14.0%
1 44
12.9%
5 31
9.1%
4 31
9.1%
7 30
8.8%
6 24
7.0%
9 23
6.7%
0 22
6.4%
, 21
6.1%

에너지 사용량 단위
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
TJ
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTJ
2nd rowTJ
3rd rowTJ
4th rowTJ
5th rowTJ

Common Values

ValueCountFrequency (%)
TJ 100
100.0%

Length

2023-12-10T19:41:26.337669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:41:26.569928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
tj 100
100.0%

에너지 사용량 원단위 값
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct18
Distinct (%)18.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
80 
0.13
 
3
0.01
 
2
0.74
 
1
0.94
 
1
Other values (13)
13 

Length

Max length4
Median length1
Mean length1.58
Min length1

Unique

Unique15 ?
Unique (%)15.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 80
80.0%
0.13 3
 
3.0%
0.01 2
 
2.0%
0.74 1
 
1.0%
0.94 1
 
1.0%
0.4 1
 
1.0%
0.38 1
 
1.0%
0.14 1
 
1.0%
0.97 1
 
1.0%
1.86 1
 
1.0%
Other values (8) 8
 
8.0%

Length

2023-12-10T19:41:26.806214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
80
80.0%
0.13 3
 
3.0%
0.01 2
 
2.0%
0.09 1
 
1.0%
0.84 1
 
1.0%
0.31 1
 
1.0%
1.5 1
 
1.0%
1.11 1
 
1.0%
0.07 1
 
1.0%
0.08 1
 
1.0%
Other values (8) 8
 
8.0%

에너지 사용량 원단위 단위
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
80 
TJ/억원
20 

Length

Max length5
Median length1
Mean length1.8
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 80
80.0%
TJ/억원 20
 
20.0%

Length

2023-12-10T19:41:27.143263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:41:27.342046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
80
80.0%
tj/억원 20
 
20.0%
Distinct14
Distinct (%)14.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
한국표준협회
34 
(주)한국경영인증원
16 
(재)한국품질재단
13 
이큐에이㈜
10 
산림조합중앙회
Other values (9)
21 

Length

Max length14
Median length13
Mean length7.93
Min length5

Unique

Unique3 ?
Unique (%)3.0%

Sample

1st row(재)한국품질재단
2nd row(주)한국경영인증원
3rd row(재)한국품질재단
4th row한국표준협회
5th row(재)한국화학융합시험연구원

Common Values

ValueCountFrequency (%)
한국표준협회 34
34.0%
(주)한국경영인증원 16
16.0%
(재)한국품질재단 13
 
13.0%
이큐에이㈜ 10
 
10.0%
산림조합중앙회 6
 
6.0%
㈜한국품질보증원 4
 
4.0%
(주)비에스아이그룹코리아 4
 
4.0%
한국생산성본부인증원(주) 3
 
3.0%
한국가스안전공사 3
 
3.0%
한국산업기술시험원 2
 
2.0%
Other values (4) 5
 
5.0%

Length

2023-12-10T19:41:27.601564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한국표준협회 34
34.0%
주)한국경영인증원 16
16.0%
재)한국품질재단 13
 
13.0%
이큐에이㈜ 10
 
10.0%
산림조합중앙회 6
 
6.0%
㈜한국품질보증원 4
 
4.0%
주)비에스아이그룹코리아 4
 
4.0%
한국생산성본부인증원(주 3
 
3.0%
한국가스안전공사 3
 
3.0%
한국산업기술시험원 2
 
2.0%
Other values (4) 5
 
5.0%

매출액(원)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct21
Distinct (%)21.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
80 
268,863,854,000
 
1
280,130,669,128
 
1
100,474,019,631
 
1
279,085,217,000
 
1
Other values (16)
16 

Length

Max length18
Median length1
Mean length3.94
Min length1

Unique

Unique20 ?
Unique (%)20.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 80
80.0%
268,863,854,000 1
 
1.0%
280,130,669,128 1
 
1.0%
100,474,019,631 1
 
1.0%
279,085,217,000 1
 
1.0%
17,567,958,000,000 1
 
1.0%
1,196,669,138,021 1
 
1.0%
10,204,674,621,000 1
 
1.0%
249,552,728,975 1
 
1.0%
474,418,754,000 1
 
1.0%
Other values (11) 11
 
11.0%

Length

2023-12-10T19:41:27.965062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
80
80.0%
132,213,672,000 1
 
1.0%
755,002,726,000 1
 
1.0%
838,159,210,000 1
 
1.0%
165,264,530,601 1
 
1.0%
41,561,118,000 1
 
1.0%
3,108,352,389,000 1
 
1.0%
2,096,642,934,000 1
 
1.0%
1,632,631,810,000 1
 
1.0%
27,027,310,000 1
 
1.0%
Other values (11) 11
 
11.0%

Interactions

2023-12-10T19:41:15.112632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:41:28.203483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소관부처법인명주소지정구분지정업종온실가스 배출량온실가스 배출량 원단위 값온실가스 배출량 원단위 단위에너지 사용량에너지 사용량 원단위 값에너지 사용량 원단위 단위검증수행기관매출액(원)
연번1.0000.2951.0000.8700.0000.3041.0000.1230.0000.7630.2140.0000.3300.123
소관부처0.2951.0001.0001.0000.0611.0001.0000.4040.0610.9270.3960.0610.5700.404
법인명1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
주소0.8701.0001.0001.0001.0000.9941.0001.0001.0000.9891.0001.0000.9931.000
지정구분0.0000.0611.0001.0001.0000.3081.0001.0000.9991.0001.0000.9990.0001.000
지정업종0.3041.0001.0000.9940.3081.0001.0000.7240.3080.9880.6930.3080.7970.724
온실가스 배출량1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
온실가스 배출량 원단위 값0.1230.4041.0001.0001.0000.7241.0001.0001.0001.0001.0001.0000.0001.000
온실가스 배출량 원단위 단위0.0000.0611.0001.0000.9990.3081.0001.0001.0001.0001.0000.9990.0001.000
에너지 사용량0.7630.9271.0000.9891.0000.9881.0001.0001.0001.0001.0001.0000.8041.000
에너지 사용량 원단위 값0.2140.3961.0001.0001.0000.6931.0001.0001.0001.0001.0001.0000.0001.000
에너지 사용량 원단위 단위0.0000.0611.0001.0000.9990.3081.0001.0000.9991.0001.0001.0000.0001.000
검증수행기관0.3300.5701.0000.9930.0000.7971.0000.0000.0000.8040.0000.0001.0000.000
매출액(원)0.1230.4041.0001.0001.0000.7241.0001.0001.0001.0001.0001.0000.0001.000
2023-12-10T19:41:28.611480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
온실가스 배출량 원단위 단위에너지 사용량 원단위 값매출액(원)지정구분에너지 사용량 원단위 단위소관부처온실가스 배출량 원단위 값검증수행기관지정업종
온실가스 배출량 원단위 단위1.0000.9150.8980.9680.9680.0700.8980.0000.203
에너지 사용량 원단위 값0.9151.0000.9820.9150.9150.1950.9820.0000.240
매출액(원)0.8980.9821.0000.8980.8980.1871.0000.0000.247
지정구분0.9680.9150.8981.0000.9680.0700.8980.0000.203
에너지 사용량 원단위 단위0.9680.9150.8980.9681.0000.0700.8980.0000.203
소관부처0.0700.1950.1870.0700.0701.0000.1870.3230.871
온실가스 배출량 원단위 값0.8980.9821.0000.8980.8980.1871.0000.0000.247
검증수행기관0.0000.0000.0000.0000.0000.3230.0001.0000.304
지정업종0.2030.2400.2470.2030.2030.8710.2470.3041.000
2023-12-10T19:41:28.916210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소관부처지정구분지정업종온실가스 배출량 원단위 값온실가스 배출량 원단위 단위에너지 사용량 원단위 값에너지 사용량 원단위 단위검증수행기관매출액(원)
연번1.0000.1200.0000.0860.0000.0000.0670.0000.1310.000
소관부처0.1201.0000.0700.8710.1870.0700.1950.0700.3230.187
지정구분0.0000.0701.0000.2030.8980.9680.9150.9680.0000.898
지정업종0.0860.8710.2031.0000.2470.2030.2400.2030.3040.247
온실가스 배출량 원단위 값0.0000.1870.8980.2471.0000.8980.9820.8980.0001.000
온실가스 배출량 원단위 단위0.0000.0700.9680.2030.8981.0000.9150.9680.0000.898
에너지 사용량 원단위 값0.0670.1950.9150.2400.9820.9151.0000.9150.0000.982
에너지 사용량 원단위 단위0.0000.0700.9680.2030.8980.9680.9151.0000.0000.898
검증수행기관0.1310.3230.0000.3040.0000.0000.0000.0001.0000.000
매출액(원)0.0000.1870.8980.2471.0000.8980.9820.8980.0001.000

Missing values

2023-12-10T19:41:15.411875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:41:15.919008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번소관부처법인명주소대상연도지정구분지정업종온실가스 배출량온실가스 배출량 단위온실가스 배출량 원단위 값온실가스 배출량 원단위 단위에너지 사용량에너지 사용량 단위에너지 사용량 원단위 값에너지 사용량 원단위 단위검증수행기관매출액(원)
01산업통상자원부(유)에스케이씨에보닉페록사이드코리아울산광역시 남구 상개로 99(상개동)2018사업장석유화학58,372tCO₂eq--1,191TJ--(재)한국품질재단-
12국토교통부(유)호남고속전라북도 전주시 덕진구 신복천변로 28 (팔복동1가)2018사업장교통(여객)25,429tCO₂eq--393TJ--(주)한국경영인증원-
23국토교통부(주)HJ매그놀리아용평호텔앤리조트강원도 평창군 대관령면 올림픽로 715 용평리조트2018사업장건물32,776tCO₂eq--649TJ--(재)한국품질재단-
34산업통상자원부(주)HM금속경상남도 함안군 군북면 함안산단로 1702018사업장철강34,262tCO₂eq--600TJ--한국표준협회-
45농림축산식품부(주)MH에탄올경상남도 창원시 마산회원구 내서읍 광려천남로 252018사업장음료제조업23,436tCO₂eq--292TJ--(재)한국화학융합시험연구원-
56농림축산식품부(주)MSC경상남도 양산시 소주회야로 45-732018사업장음식료품26,317tCO₂eq--525TJ--한국산업기술시험원-
67산업통상자원부(주)SIMPAC인천광역시 부평구 부평북로 1412018업체산업458,521tCO₂eq170.54tCO2eq/억원4,998TJ1.86TJ/억원한국표준협회268,863,854,000
78국토교통부(주)강원랜드강원도 정선군 사북읍 하이원길 2652018사업장건물76,114tCO₂eq--1,534TJ--한국표준협회-
89산업통상자원부(주)건화경상남도 거제시 연초면 연하해안로841-542018사업장조선22,213tCO₂eq--410TJ--한국표준협회-
910국토교통부(주)경기고속경기도 광주시 광주대로 1712018업체교통(여객)172,083tCO₂eq61.43tCO2eq/억원2,644TJ0.94TJ/억원산림조합중앙회280,130,669,128
연번소관부처법인명주소대상연도지정구분지정업종온실가스 배출량온실가스 배출량 단위온실가스 배출량 원단위 값온실가스 배출량 원단위 단위에너지 사용량에너지 사용량 단위에너지 사용량 원단위 값에너지 사용량 원단위 단위검증수행기관매출액(원)
9091산업통상자원부(주)선일다이파스충청북도 진천군 광혜원면 광혜원산단2길 142018사업장기계18,329tCO₂eq--373TJ--(주)한국경영인증원-
9192국토교통부(주)선진운수서울특별시 은평구 서오릉로 207 (구산동)2018사업장교통(여객)27,881tCO₂eq--521TJ--(재)한국품질재단-
9293산업통상자원부(주)성광벤드부산광역시 강서구 녹산산단262로 26 (송정동)2018사업장철강12,781tCO₂eq--246TJ--한국표준협회-
9394산업통상자원부(주)성신미네필드강원도 정선군 남면 칠현로 5042018사업장시멘트16,890tCO₂eq--327TJ--한국표준협회-
9495산업통상자원부(주)성호금속 경주2공장경상북도 경주시 천북면 천북산단로2길 702018사업장비철금속28,315tCO₂eq--505TJ--이큐에이㈜-
9596산업통상자원부(주)성호금속 영천공장경상북도 영천시 언하공단2길 252018사업장철강19,676tCO₂eq--357TJ--이큐에이㈜-
9697산업통상자원부(주)성호기업경상북도 경주시 천북면 천북산단로1길 74-512018사업장철강15,285tCO₂eq--301TJ--이큐에이㈜-
9798산업통상자원부(주)세아베스틸서울특별시 마포구 양화로 45(서교동)2018업체철강1,421,461tCO₂eq69.6tCO2eq/억원22,316TJ1.09TJ/억원한국표준협회2,042,371,454,000
9899산업통상자원부(주)세아씨엠전라북도 군산시 자유로 241(소룡동)2018사업장철강66,216tCO₂eq--1,338TJ--한국표준협회-
99100국토교통부(주)세아엘앤에스경상북도 포항시 남구 철강로 348 (호동)2018사업장교통(화물)18,520tCO₂eq--270TJ--한국표준협회-