Overview

Dataset statistics

Number of variables17
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.6 KiB
Average record size in memory139.3 B

Variable types

Numeric1
Categorical12
Text4

Alerts

대상연도 has constant value ""Constant
온실가스 배출량 단위 has constant value ""Constant
에너지 사용량 단위 has constant value ""Constant
온실가스 배출량 원단위 단위 is highly overall correlated with 지정구분 and 4 other fieldsHigh correlation
에너지 사용량 원단위 값 is highly overall correlated with 지정구분 and 4 other fieldsHigh correlation
매출액(원) is highly overall correlated with 지정구분 and 4 other fieldsHigh correlation
지정구분 is highly overall correlated with 온실가스 배출량 원단위 값 and 4 other fieldsHigh correlation
에너지 사용량 원단위 단위 is highly overall correlated with 지정구분 and 4 other fieldsHigh correlation
온실가스 배출량 원단위 값 is highly overall correlated with 지정구분 and 4 other fieldsHigh correlation
소관부처 is highly overall correlated with 지정업종High correlation
지정업종 is highly overall correlated with 소관부처High correlation
온실가스 배출량 원단위 값 is highly imbalanced (65.1%)Imbalance
에너지 사용량 원단위 값 is highly imbalanced (64.6%)Imbalance
매출액(원) is highly imbalanced (62.7%)Imbalance
연번 has unique valuesUnique
법인명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 10:41:30.726539
Analysis finished2023-12-10 10:41:35.684301
Duration4.96 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:41:35.824713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-12-10T19:41:36.106094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

소관부처
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
산업통상자원부
64 
국토교통부
22 
농림축산식품부
환경부
 
3
해양수산부
 
3

Length

Max length7
Median length7
Mean length6.38
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row산업통상자원부
2nd row국토교통부
3rd row산업통상자원부
4th row농림축산식품부
5th row농림축산식품부

Common Values

ValueCountFrequency (%)
산업통상자원부 64
64.0%
국토교통부 22
 
22.0%
농림축산식품부 8
 
8.0%
환경부 3
 
3.0%
해양수산부 3
 
3.0%

Length

2023-12-10T19:41:36.404999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:41:36.627016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
산업통상자원부 64
64.0%
국토교통부 22
 
22.0%
농림축산식품부 8
 
8.0%
환경부 3
 
3.0%
해양수산부 3
 
3.0%

법인명
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:41:37.189306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length15
Mean length7.97
Min length5

Characters and Unicode

Total characters797
Distinct characters187
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row(유)에스케이씨에보닉페록사이드코리아
2nd row(유)호남고속
3rd row(주)HM금속
4th row(주)MH에탄올
5th row(주)MSC
ValueCountFrequency (%)
주)성호금속 2
 
1.7%
주)샤니 2
 
1.7%
주)동서기공 2
 
1.7%
주)동남 2
 
1.7%
주)무주덕유산리조트 1
 
0.8%
주)삼천리 1
 
0.8%
주)삼원강재 1
 
0.8%
주)삼동 1
 
0.8%
주)빙그레 1
 
0.8%
주)비엠금속 1
 
0.8%
Other values (104) 104
88.1%
2023-12-10T19:41:38.131319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
101
 
12.7%
( 100
 
12.5%
) 100
 
12.5%
18
 
2.3%
15
 
1.9%
15
 
1.9%
15
 
1.9%
12
 
1.5%
12
 
1.5%
11
 
1.4%
Other values (177) 398
49.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 554
69.5%
Open Punctuation 100
 
12.5%
Close Punctuation 100
 
12.5%
Uppercase Letter 23
 
2.9%
Space Separator 18
 
2.3%
Decimal Number 1
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
101
 
18.2%
15
 
2.7%
15
 
2.7%
15
 
2.7%
12
 
2.2%
12
 
2.2%
11
 
2.0%
11
 
2.0%
10
 
1.8%
10
 
1.8%
Other values (160) 342
61.7%
Uppercase Letter
ValueCountFrequency (%)
M 5
21.7%
S 3
13.0%
P 3
13.0%
H 2
 
8.7%
C 2
 
8.7%
A 2
 
8.7%
L 1
 
4.3%
T 1
 
4.3%
E 1
 
4.3%
I 1
 
4.3%
Other values (2) 2
 
8.7%
Open Punctuation
ValueCountFrequency (%)
( 100
100.0%
Close Punctuation
ValueCountFrequency (%)
) 100
100.0%
Space Separator
ValueCountFrequency (%)
18
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 554
69.5%
Common 220
 
27.6%
Latin 23
 
2.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
101
 
18.2%
15
 
2.7%
15
 
2.7%
15
 
2.7%
12
 
2.2%
12
 
2.2%
11
 
2.0%
11
 
2.0%
10
 
1.8%
10
 
1.8%
Other values (160) 342
61.7%
Latin
ValueCountFrequency (%)
M 5
21.7%
S 3
13.0%
P 3
13.0%
H 2
 
8.7%
C 2
 
8.7%
A 2
 
8.7%
L 1
 
4.3%
T 1
 
4.3%
E 1
 
4.3%
I 1
 
4.3%
Other values (2) 2
 
8.7%
Common
ValueCountFrequency (%)
( 100
45.5%
) 100
45.5%
18
 
8.2%
2 1
 
0.5%
& 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 554
69.5%
ASCII 243
30.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
101
 
18.2%
15
 
2.7%
15
 
2.7%
15
 
2.7%
12
 
2.2%
12
 
2.2%
11
 
2.0%
11
 
2.0%
10
 
1.8%
10
 
1.8%
Other values (160) 342
61.7%
ASCII
ValueCountFrequency (%)
( 100
41.2%
) 100
41.2%
18
 
7.4%
M 5
 
2.1%
S 3
 
1.2%
P 3
 
1.2%
H 2
 
0.8%
C 2
 
0.8%
A 2
 
0.8%
2 1
 
0.4%
Other values (7) 7
 
2.9%

주소
Text

Distinct98
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:41:38.711310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length28
Mean length23.85
Min length15

Characters and Unicode

Total characters2385
Distinct characters221
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique96 ?
Unique (%)96.0%

Sample

1st row울산광역시 남구 상개로 99(상개동)
2nd row전라북도 전주시 덕진구 신복천변로 28 (팔복동1가)
3rd row경상남도 함안군 군북면 함안산단로 170
4th row경상남도 창원시 마산회원구 내서읍 광려천남로 25
5th row경상남도 양산시 소주회야로 45-73
ValueCountFrequency (%)
서울특별시 19
 
3.7%
경기도 17
 
3.3%
경상북도 13
 
2.5%
전라북도 8
 
1.6%
남구 7
 
1.4%
포항시 7
 
1.4%
경상남도 7
 
1.4%
중구 7
 
1.4%
충청남도 7
 
1.4%
충청북도 6
 
1.2%
Other values (342) 417
81.0%
2023-12-10T19:41:39.683748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
443
 
18.6%
88
 
3.7%
87
 
3.6%
69
 
2.9%
1 60
 
2.5%
56
 
2.3%
52
 
2.2%
2 48
 
2.0%
45
 
1.9%
43
 
1.8%
Other values (211) 1394
58.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1500
62.9%
Space Separator 443
 
18.6%
Decimal Number 356
 
14.9%
Open Punctuation 34
 
1.4%
Close Punctuation 34
 
1.4%
Dash Punctuation 15
 
0.6%
Uppercase Letter 2
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
88
 
5.9%
87
 
5.8%
69
 
4.6%
56
 
3.7%
52
 
3.5%
45
 
3.0%
43
 
2.9%
40
 
2.7%
36
 
2.4%
35
 
2.3%
Other values (194) 949
63.3%
Decimal Number
ValueCountFrequency (%)
1 60
16.9%
2 48
13.5%
3 39
11.0%
5 39
11.0%
4 39
11.0%
7 39
11.0%
6 34
9.6%
8 22
 
6.2%
0 19
 
5.3%
9 17
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
D 1
50.0%
K 1
50.0%
Space Separator
ValueCountFrequency (%)
443
100.0%
Open Punctuation
ValueCountFrequency (%)
( 34
100.0%
Close Punctuation
ValueCountFrequency (%)
) 34
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1500
62.9%
Common 883
37.0%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
88
 
5.9%
87
 
5.8%
69
 
4.6%
56
 
3.7%
52
 
3.5%
45
 
3.0%
43
 
2.9%
40
 
2.7%
36
 
2.4%
35
 
2.3%
Other values (194) 949
63.3%
Common
ValueCountFrequency (%)
443
50.2%
1 60
 
6.8%
2 48
 
5.4%
3 39
 
4.4%
5 39
 
4.4%
4 39
 
4.4%
7 39
 
4.4%
( 34
 
3.9%
) 34
 
3.9%
6 34
 
3.9%
Other values (5) 74
 
8.4%
Latin
ValueCountFrequency (%)
D 1
50.0%
K 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1500
62.9%
ASCII 885
37.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
443
50.1%
1 60
 
6.8%
2 48
 
5.4%
3 39
 
4.4%
5 39
 
4.4%
4 39
 
4.4%
7 39
 
4.4%
( 34
 
3.8%
) 34
 
3.8%
6 34
 
3.8%
Other values (7) 76
 
8.6%
Hangul
ValueCountFrequency (%)
88
 
5.9%
87
 
5.8%
69
 
4.6%
56
 
3.7%
52
 
3.5%
45
 
3.0%
43
 
2.9%
40
 
2.7%
36
 
2.4%
35
 
2.3%
Other values (194) 949
63.3%

대상연도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2017
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2017 100
100.0%

Length

2023-12-10T19:41:39.957476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:41:40.128355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017 100
100.0%

지정구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
사업장
81 
업체
19 

Length

Max length3
Median length3
Mean length2.81
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업장
2nd row사업장
3rd row사업장
4th row사업장
5th row사업장

Common Values

ValueCountFrequency (%)
사업장 81
81.0%
업체 19
 
19.0%

Length

2023-12-10T19:41:40.333221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:41:40.540291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업장 81
81.0%
업체 19
 
19.0%

지정업종
Categorical

HIGH CORRELATION 

Distinct28
Distinct (%)28.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
철강
12 
교통(여객)
11 
석유화학
건물
비철금속
Other values (23)
56 

Length

Max length14
Median length10
Mean length4.29
Min length2

Unique

Unique7 ?
Unique (%)7.0%

Sample

1st row석유화학
2nd row교통(여객)
3rd row철강
4th row음료제조업
5th row음식료품

Common Values

ValueCountFrequency (%)
철강 12
 
12.0%
교통(여객) 11
 
11.0%
석유화학 7
 
7.0%
건물 7
 
7.0%
비철금속 7
 
7.0%
자동차 5
 
5.0%
반도체.디스플레이.전기전자 5
 
5.0%
식료품 제조업 5
 
5.0%
섬유 4
 
4.0%
제지 4
 
4.0%
Other values (18) 33
33.0%

Length

2023-12-10T19:41:40.865674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
철강 12
 
10.9%
교통(여객 11
 
10.0%
석유화학 7
 
6.4%
건물 7
 
6.4%
비철금속 7
 
6.4%
자동차 5
 
4.5%
반도체.디스플레이.전기전자 5
 
4.5%
식료품 5
 
4.5%
제조업 5
 
4.5%
제지 4
 
3.6%
Other values (20) 42
38.2%
Distinct99
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:41:41.546111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length6
Mean length6.11
Min length5

Characters and Unicode

Total characters611
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)98.0%

Sample

1st row52,368
2nd row24,726
3rd row30,616
4th row23,319
5th row25,355
ValueCountFrequency (%)
24,314 2
 
2.0%
28,647 1
 
1.0%
36,978 1
 
1.0%
30,582 1
 
1.0%
44,686 1
 
1.0%
57,264 1
 
1.0%
23,053 1
 
1.0%
15,231 1
 
1.0%
74,991 1
 
1.0%
537,928 1
 
1.0%
Other values (89) 89
89.0%
2023-12-10T19:41:42.447549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 101
16.5%
1 69
11.3%
2 63
10.3%
3 61
10.0%
6 54
8.8%
5 53
8.7%
4 50
8.2%
8 45
7.4%
7 42
6.9%
9 41
6.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 510
83.5%
Other Punctuation 101
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 69
13.5%
2 63
12.4%
3 61
12.0%
6 54
10.6%
5 53
10.4%
4 50
9.8%
8 45
8.8%
7 42
8.2%
9 41
8.0%
0 32
6.3%
Other Punctuation
ValueCountFrequency (%)
, 101
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 611
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
, 101
16.5%
1 69
11.3%
2 63
10.3%
3 61
10.0%
6 54
8.8%
5 53
8.7%
4 50
8.2%
8 45
7.4%
7 42
6.9%
9 41
6.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 611
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 101
16.5%
1 69
11.3%
2 63
10.3%
3 61
10.0%
6 54
8.8%
5 53
8.7%
4 50
8.2%
8 45
7.4%
7 42
6.9%
9 41
6.7%

온실가스 배출량 단위
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
tCO₂eq
100 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowtCO₂eq
2nd rowtCO₂eq
3rd rowtCO₂eq
4th rowtCO₂eq
5th rowtCO₂eq

Common Values

ValueCountFrequency (%)
tCO₂eq 100
100.0%

Length

2023-12-10T19:41:42.756610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:41:42.939474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
tco₂eq 100
100.0%

온실가스 배출량 원단위 값
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct20
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
81 
177.87
 
1
62.19
 
1
228.69
 
1
20.66
 
1
Other values (15)
15 

Length

Max length6
Median length1
Mean length1.73
Min length1

Unique

Unique19 ?
Unique (%)19.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 81
81.0%
177.87 1
 
1.0%
62.19 1
 
1.0%
228.69 1
 
1.0%
20.66 1
 
1.0%
0.68 1
 
1.0%
6.92 1
 
1.0%
0.62 1
 
1.0%
63.59 1
 
1.0%
10.19 1
 
1.0%
Other values (10) 10
 
10.0%

Length

2023-12-10T19:41:43.147045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
81
81.0%
177.87 1
 
1.0%
72.38 1
 
1.0%
419.89 1
 
1.0%
7.17 1
 
1.0%
329.13 1
 
1.0%
3.63 1
 
1.0%
105.1 1
 
1.0%
4.19 1
 
1.0%
48.09 1
 
1.0%
Other values (10) 10
 
10.0%

온실가스 배출량 원단위 단위
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
81 
tCO2eq/억원
19 

Length

Max length9
Median length1
Mean length2.52
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 81
81.0%
tCO2eq/억원 19
 
19.0%

Length

2023-12-10T19:41:43.446147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:41:43.638162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
81
81.0%
tco2eq/억원 19
 
19.0%
Distinct97
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:41:44.168296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length3
Mean length3.45
Min length2

Characters and Unicode

Total characters345
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)94.0%

Sample

1st row1,123
2nd row381
3rd row534
4th row287
5th row506
ValueCountFrequency (%)
364 2
 
2.0%
406 2
 
2.0%
417 2
 
2.0%
308 1
 
1.0%
2,308 1
 
1.0%
422 1
 
1.0%
679 1
 
1.0%
614 1
 
1.0%
912 1
 
1.0%
1,130 1
 
1.0%
Other values (87) 87
87.0%
2023-12-10T19:41:45.090767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 48
13.9%
1 41
11.9%
2 41
11.9%
4 40
11.6%
7 32
9.3%
0 30
8.7%
5 26
7.5%
6 24
7.0%
, 23
6.7%
8 22
6.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 322
93.3%
Other Punctuation 23
 
6.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 48
14.9%
1 41
12.7%
2 41
12.7%
4 40
12.4%
7 32
9.9%
0 30
9.3%
5 26
8.1%
6 24
7.5%
8 22
6.8%
9 18
 
5.6%
Other Punctuation
ValueCountFrequency (%)
, 23
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 345
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 48
13.9%
1 41
11.9%
2 41
11.9%
4 40
11.6%
7 32
9.3%
0 30
8.7%
5 26
7.5%
6 24
7.0%
, 23
6.7%
8 22
6.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 345
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 48
13.9%
1 41
11.9%
2 41
11.9%
4 40
11.6%
7 32
9.3%
0 30
8.7%
5 26
7.5%
6 24
7.0%
, 23
6.7%
8 22
6.4%

에너지 사용량 단위
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
TJ
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTJ
2nd rowTJ
3rd rowTJ
4th rowTJ
5th rowTJ

Common Values

ValueCountFrequency (%)
TJ 100
100.0%

Length

2023-12-10T19:41:45.536153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:41:45.754397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
tj 100
100.0%

에너지 사용량 원단위 값
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct17
Distinct (%)17.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
81 
0.14
 
2
0.15
 
2
0.01
 
2
0.09
 
1
Other values (12)
12 

Length

Max length4
Median length1
Mean length1.56
Min length1

Unique

Unique13 ?
Unique (%)13.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 81
81.0%
0.14 2
 
2.0%
0.15 2
 
2.0%
0.01 2
 
2.0%
0.09 1
 
1.0%
1.14 1
 
1.0%
4.78 1
 
1.0%
1.41 1
 
1.0%
0.07 1
 
1.0%
2.06 1
 
1.0%
Other values (7) 7
 
7.0%

Length

2023-12-10T19:41:46.023666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
81
81.0%
0.15 2
 
2.0%
0.01 2
 
2.0%
0.14 2
 
2.0%
1.93 1
 
1.0%
0.42 1
 
1.0%
0.41 1
 
1.0%
0.96 1
 
1.0%
0.98 1
 
1.0%
0.69 1
 
1.0%
Other values (7) 7
 
7.0%

에너지 사용량 원단위 단위
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
81 
TJ/억원
19 

Length

Max length5
Median length1
Mean length1.76
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 81
81.0%
TJ/억원 19
 
19.0%

Length

2023-12-10T19:41:46.295737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:41:46.571399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
81
81.0%
tj/억원 19
 
19.0%
Distinct16
Distinct (%)16.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
한국표준협회
35 
(주)한국경영인증원
13 
(재)한국품질재단
11 
이큐에이㈜
11 
㈜한국품질보증원
Other values (11)
25 

Length

Max length19
Median length18
Mean length8.15
Min length5

Unique

Unique4 ?
Unique (%)4.0%

Sample

1st row(재)한국품질재단
2nd row(주)한국경영인증원
3rd row한국표준협회
4th row(재)한국화학융합시험연구원
5th row한국산업기술시험원

Common Values

ValueCountFrequency (%)
한국표준협회 35
35.0%
(주)한국경영인증원 13
 
13.0%
(재)한국품질재단 11
 
11.0%
이큐에이㈜ 11
 
11.0%
㈜한국품질보증원 5
 
5.0%
산림조합중앙회 4
 
4.0%
(주)비에스아이그룹코리아 4
 
4.0%
한국산업기술시험원 3
 
3.0%
한국생산성본부인증원(주) 3
 
3.0%
한국가스안전공사 3
 
3.0%
Other values (6) 8
 
8.0%

Length

2023-12-10T19:41:47.308231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한국표준협회 35
35.0%
주)한국경영인증원 13
 
13.0%
재)한국품질재단 11
 
11.0%
이큐에이㈜ 11
 
11.0%
㈜한국품질보증원 5
 
5.0%
산림조합중앙회 4
 
4.0%
주)비에스아이그룹코리아 4
 
4.0%
한국산업기술시험원 3
 
3.0%
한국생산성본부인증원(주 3
 
3.0%
한국가스안전공사 3
 
3.0%
Other values (6) 8
 
8.0%

매출액(원)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct22
Distinct (%)22.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-
79 
229,245,382,265
 
1
286,641,971,000
 
1
95,256,928,000
 
1
288,103,985,000
 
1
Other values (17)
17 

Length

Max length18
Median length1
Mean length4.06
Min length1

Unique

Unique21 ?
Unique (%)21.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 79
79.0%
229,245,382,265 1
 
1.0%
286,641,971,000 1
 
1.0%
95,256,928,000 1
 
1.0%
288,103,985,000 1
 
1.0%
16,236,933,000,000 1
 
1.0%
1,159,461,915,100 1
 
1.0%
11,332,055,896,000 1
 
1.0%
250,704,777,253 1
 
1.0%
413,905,870,000 1
 
1.0%
Other values (12) 12
 
12.0%

Length

2023-12-10T19:41:47.688592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
79
79.0%
229,245,382,265 1
 
1.0%
1,927,463,423,000 1
 
1.0%
30,577,002,375 1
 
1.0%
798,578,906,000 1
 
1.0%
163,437,678,000 1
 
1.0%
39,003,625,000 1
 
1.0%
3,095,560,472,000 1
 
1.0%
439,005,019,516 1
 
1.0%
1,569,204,857,000 1
 
1.0%
Other values (12) 12
 
12.0%

Interactions

2023-12-10T19:41:34.757345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:41:47.930768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소관부처법인명주소지정구분지정업종온실가스 배출량온실가스 배출량 원단위 값온실가스 배출량 원단위 단위에너지 사용량에너지 사용량 원단위 값에너지 사용량 원단위 단위검증수행기관매출액(원)
연번1.0000.0001.0000.9740.0000.4360.9400.1220.0000.9390.2590.0000.3590.129
소관부처0.0001.0001.0001.0000.1911.0001.0000.0000.1910.0000.2360.1910.5060.443
법인명1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
주소0.9741.0001.0001.0001.0000.9970.9970.0001.0000.9920.0001.0001.0000.000
지정구분0.0000.1911.0001.0001.0000.3041.0001.0000.9991.0001.0000.9990.0001.000
지정업종0.4361.0001.0000.9970.3041.0000.9970.5470.3040.9810.5660.3040.7490.680
온실가스 배출량0.9401.0001.0000.9971.0000.9971.0001.0001.0000.9971.0001.0000.9941.000
온실가스 배출량 원단위 값0.1220.0001.0000.0001.0000.5471.0001.0001.0001.0001.0001.0000.0001.000
온실가스 배출량 원단위 단위0.0000.1911.0001.0000.9990.3041.0001.0001.0001.0001.0000.9990.0001.000
에너지 사용량0.9390.0001.0000.9921.0000.9810.9971.0001.0001.0001.0001.0000.0001.000
에너지 사용량 원단위 값0.2590.2361.0000.0001.0000.5661.0001.0001.0001.0001.0001.0000.0001.000
에너지 사용량 원단위 단위0.0000.1911.0001.0000.9990.3041.0001.0000.9991.0001.0001.0000.0001.000
검증수행기관0.3590.5061.0001.0000.0000.7490.9940.0000.0000.0000.0000.0001.0000.000
매출액(원)0.1290.4431.0000.0001.0000.6801.0001.0001.0001.0001.0001.0000.0001.000
2023-12-10T19:41:48.305527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
온실가스 배출량 원단위 단위에너지 사용량 원단위 값매출액(원)지정구분에너지 사용량 원단위 단위소관부처온실가스 배출량 원단위 값검증수행기관지정업종
온실가스 배출량 원단위 단위1.0000.9200.8920.9670.9670.2290.9040.0000.200
에너지 사용량 원단위 값0.9201.0000.9690.9200.9200.1080.9820.0000.172
매출액(원)0.8920.9691.0000.8920.8920.2100.9870.0000.220
지정구분0.9670.9200.8921.0000.9670.2290.9040.0000.200
에너지 사용량 원단위 단위0.9670.9200.8920.9671.0000.2290.9040.0000.200
소관부처0.2290.1080.2100.2290.2291.0000.0000.2670.871
온실가스 배출량 원단위 값0.9040.9820.9870.9040.9040.0001.0000.0000.154
검증수행기관0.0000.0000.0000.0000.0000.2670.0001.0000.289
지정업종0.2000.1720.2200.2000.2000.8710.1540.2891.000
2023-12-10T19:41:48.687045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소관부처지정구분지정업종온실가스 배출량 원단위 값온실가스 배출량 원단위 단위에너지 사용량 원단위 값에너지 사용량 원단위 단위검증수행기관매출액(원)
연번1.0000.0000.0000.1440.0000.0000.0900.0000.1380.000
소관부처0.0001.0000.2290.8710.0000.2290.1080.2290.2670.210
지정구분0.0000.2291.0000.2000.9040.9670.9200.9670.0000.892
지정업종0.1440.8710.2001.0000.1540.2000.1720.2000.2890.220
온실가스 배출량 원단위 값0.0000.0000.9040.1541.0000.9040.9820.9040.0000.987
온실가스 배출량 원단위 단위0.0000.2290.9670.2000.9041.0000.9200.9670.0000.892
에너지 사용량 원단위 값0.0900.1080.9200.1720.9820.9201.0000.9200.0000.969
에너지 사용량 원단위 단위0.0000.2290.9670.2000.9040.9670.9201.0000.0000.892
검증수행기관0.1380.2670.0000.2890.0000.0000.0000.0001.0000.000
매출액(원)0.0000.2100.8920.2200.9870.8920.9690.8920.0001.000

Missing values

2023-12-10T19:41:35.051203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:41:35.526931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번소관부처법인명주소대상연도지정구분지정업종온실가스 배출량온실가스 배출량 단위온실가스 배출량 원단위 값온실가스 배출량 원단위 단위에너지 사용량에너지 사용량 단위에너지 사용량 원단위 값에너지 사용량 원단위 단위검증수행기관매출액(원)
01산업통상자원부(유)에스케이씨에보닉페록사이드코리아울산광역시 남구 상개로 99(상개동)2017사업장석유화학52,368tCO₂eq--1,123TJ--(재)한국품질재단-
12국토교통부(유)호남고속전라북도 전주시 덕진구 신복천변로 28 (팔복동1가)2017사업장교통(여객)24,726tCO₂eq--381TJ--(주)한국경영인증원-
23산업통상자원부(주)HM금속경상남도 함안군 군북면 함안산단로 1702017사업장철강30,616tCO₂eq--534TJ--한국표준협회-
34농림축산식품부(주)MH에탄올경상남도 창원시 마산회원구 내서읍 광려천남로 252017사업장음료제조업23,319tCO₂eq--287TJ--(재)한국화학융합시험연구원-
45농림축산식품부(주)MSC경상남도 양산시 소주회야로 45-732017사업장음식료품25,355tCO₂eq--506TJ--한국산업기술시험원-
56산업통상자원부(주)SIMPAC METAL경상북도 포항시 남구 괴동로 1532017업체철강407,757tCO₂eq177.87tCO2eq/억원4,430TJ1.93TJ/억원한국표준협회229,245,382,265
67산업통상자원부(주)SPP조선경남 사천시 사남면 해안산업로 5372017사업장조선3,396tCO₂eq--56TJ--㈜디엔브이지엘비즈니스어슈어런스코리아-
78국토교통부(주)강원랜드강원도 정선군 사북읍 하이원길 2652017사업장건물71,158tCO₂eq--1,438TJ--한국표준협회-
89산업통상자원부(주)건화경상남도 거제시 연초면 연하해안로841-542017사업장조선25,198tCO₂eq--437TJ--한국표준협회-
910국토교통부(주)경기고속경기도 광주시 광주대로 1712017업체교통(여객)178,260tCO₂eq62.19tCO2eq/억원2,738TJ0.96TJ/억원산림조합중앙회286,641,971,000
연번소관부처법인명주소대상연도지정구분지정업종온실가스 배출량온실가스 배출량 단위온실가스 배출량 원단위 값온실가스 배출량 원단위 단위에너지 사용량에너지 사용량 단위에너지 사용량 원단위 값에너지 사용량 원단위 단위검증수행기관매출액(원)
9091산업통상자원부(주)세아베스틸서울특별시 마포구 양화로 45(서교동)2017업체철강1,395,133tCO₂eq72.38tCO2eq/억원22,013TJ1.14TJ/억원한국표준협회1,927,463,423,000
9192산업통상자원부(주)세아씨엠전라북도 군산시 자유로 241(소룡동)2017사업장철강64,654tCO₂eq--1,322TJ--한국표준협회-
9293국토교통부(주)세아엘앤에스경상북도 포항시 남구 철강로 348 (호동)2017사업장교통(화물)18,675tCO₂eq--272TJ--한국표준협회-
9394산업통상자원부(주)세아제강서울특별시 마포구 양화로 45 세아타워2017사업장철강79,202tCO₂eq--1,611TJ--한국표준협회-
9495산업통상자원부(주)세아특수강경상북도 포항시 남구 괴동로 402017사업장철강26,853tCO₂eq--535TJ--한국표준협회-
9596해양수산부(주)세주부산광역시 중구 해관로 65 (중앙동4가)2017사업장교통(해운)30,544tCO₂eq--417TJ--㈜디엔브이지엘비즈니스어슈어런스코리아-
9697산업통상자원부(주)스타플렉스충청북도 음성군 삼성면 대성로 4172017사업장석유화학15,797tCO₂eq--321TJ--이큐에이㈜-
9798산업통상자원부(주)신동강원도 정선군 남면 곰골길 159-132017사업장시멘트78,689tCO₂eq--253TJ--한국표준협회-
9899산업통상자원부(주)신성이엔지충청북도 증평군 증평읍 증평산단로 142017사업장반도체23,655tCO₂eq--452TJ--한국표준협회-
99100국토교통부(주)신세계서울특별시 중구 소공로 63(충무로1가)2017업체건물165,810tCO₂eq9.96tCO2eq/억원3,382TJ0.2TJ/억원㈜한국품질보증원1,665,520,576,000