Overview

Dataset statistics

Number of variables6
Number of observations127
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.2 KiB
Average record size in memory50.0 B

Variable types

Categorical5
Numeric1

Dataset

Description한국서부발전의 발전소 연료 통관 물량 정보입니다. 제공데이터는 년월별 수량(톤), 연료명, 국가 정보 등 입니다.
URLhttps://www.data.go.kr/data/15106314/fileData.do

Alerts

단위 has constant value ""Constant
연료명 has constant value ""Constant
국가코드 is highly overall correlated with 국가명High correlation
국가명 is highly overall correlated with 국가코드High correlation
수량(톤) has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:52:24.743737
Analysis finished2023-12-12 21:52:25.210443
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년월
Categorical

Distinct30
Distinct (%)23.6%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2022-01
 
6
2022-07
 
6
2021-03
 
6
2022-06
 
5
2023-01
 
5
Other values (25)
99 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-01
2nd row2021-01
3rd row2021-01
4th row2021-02
5th row2021-02

Common Values

ValueCountFrequency (%)
2022-01 6
 
4.7%
2022-07 6
 
4.7%
2021-03 6
 
4.7%
2022-06 5
 
3.9%
2023-01 5
 
3.9%
2022-05 5
 
3.9%
2023-06 5
 
3.9%
2022-08 5
 
3.9%
2022-12 5
 
3.9%
2023-05 5
 
3.9%
Other values (20) 74
58.3%

Length

2023-12-13T06:52:25.281221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2022-01 6
 
4.7%
2021-03 6
 
4.7%
2022-07 6
 
4.7%
2023-06 5
 
3.9%
2023-05 5
 
3.9%
2022-08 5
 
3.9%
2022-12 5
 
3.9%
2022-05 5
 
3.9%
2023-01 5
 
3.9%
2022-06 5
 
3.9%
Other values (20) 74
58.3%

수량(톤)
Real number (ℝ)

UNIQUE 

Distinct127
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean229978.25
Minimum12923
Maximum827861
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-13T06:52:25.416748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12923
5-th percentile74979.9
Q1110331
median167120
Q3281880
95-th percentile563663.1
Maximum827861
Range814938
Interquartile range (IQR)171549

Descriptive statistics

Standard deviation166285.14
Coefficient of variation (CV)0.72304723
Kurtosis2.6531313
Mean229978.25
Median Absolute Deviation (MAD)85947
Skewness1.6145534
Sum29207238
Variance2.7650747 × 1010
MonotonicityNot monotonic
2023-12-13T06:52:25.879089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
77211.0 1
 
0.8%
160987.0 1
 
0.8%
279307.0 1
 
0.8%
596048.0 1
 
0.8%
160315.0 1
 
0.8%
159901.0 1
 
0.8%
164880.0 1
 
0.8%
135375.0 1
 
0.8%
202480.0 1
 
0.8%
244632.0 1
 
0.8%
Other values (117) 117
92.1%
ValueCountFrequency (%)
12923.0 1
0.8%
44000.0 1
0.8%
68025.0 1
0.8%
72380.0 1
0.8%
73505.0 1
0.8%
74000.0 1
0.8%
74949.0 1
0.8%
75052.0 1
0.8%
75121.0 1
0.8%
75363.0 1
0.8%
ValueCountFrequency (%)
827861.0 1
0.8%
824556.0 1
0.8%
788341.0 1
0.8%
708123.0 1
0.8%
602306.0 1
0.8%
596048.0 1
0.8%
563901.0 1
0.8%
563108.0 1
0.8%
549155.0 1
0.8%
510564.0 1
0.8%

단위
Categorical

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
MT
127 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMT
2nd rowMT
3rd rowMT
4th rowMT
5th rowMT

Common Values

ValueCountFrequency (%)
MT 127
100.0%

Length

2023-12-13T06:52:26.042679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:52:26.141133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
mt 127
100.0%

연료명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
유연탄
127 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유연탄
2nd row유연탄
3rd row유연탄
4th row유연탄
5th row유연탄

Common Values

ValueCountFrequency (%)
유연탄 127
100.0%

Length

2023-12-13T06:52:26.254861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:52:26.356965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유연탄 127
100.0%

국가코드
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)7.9%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
AU
30 
RU
27 
ID
25 
US
16 
ZA
16 
Other values (5)
13 

Length

Max length2
Median length2
Mean length1.984252
Min length1

Unique

Unique3 ?
Unique (%)2.4%

Sample

1st rowUS
2nd rowID
3rd rowAU
4th rowRU
5th rowID

Common Values

ValueCountFrequency (%)
AU 30
23.6%
RU 27
21.3%
ID 25
19.7%
US 16
12.6%
ZA 16
12.6%
CO 8
 
6.3%
C 2
 
1.6%
DE 1
 
0.8%
CH 1
 
0.8%
PH 1
 
0.8%

Length

2023-12-13T06:52:26.513535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:52:26.672392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
au 30
23.6%
ru 27
21.3%
id 25
19.7%
us 16
12.6%
za 16
12.6%
co 8
 
6.3%
c 2
 
1.6%
de 1
 
0.8%
ch 1
 
0.8%
ph 1
 
0.8%

국가명
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
호주
30 
러시아
27 
인니
25 
미국
16 
남아공
16 
Other values (4)
13 

Length

Max length4
Median length2
Mean length2.511811
Min length2

Unique

Unique3 ?
Unique (%)2.4%

Sample

1st row미국
2nd row인니
3rd row호주
4th row러시아
5th row인니

Common Values

ValueCountFrequency (%)
호주 30
23.6%
러시아 27
21.3%
인니 25
19.7%
미국 16
12.6%
남아공 16
12.6%
콜롬비아 10
 
7.9%
덴마크 1
 
0.8%
중국 1
 
0.8%
필리핀 1
 
0.8%

Length

2023-12-13T06:52:26.859440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:52:27.007939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
호주 30
23.6%
러시아 27
21.3%
인니 25
19.7%
미국 16
12.6%
남아공 16
12.6%
콜롬비아 10
 
7.9%
덴마크 1
 
0.8%
중국 1
 
0.8%
필리핀 1
 
0.8%

Interactions

2023-12-13T06:52:24.920531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:52:27.100751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년월수량(톤)국가코드국가명
년월1.0000.4600.0000.000
수량(톤)0.4601.0000.0740.210
국가코드0.0000.0741.0001.000
국가명0.0000.2101.0001.000
2023-12-13T06:52:27.220021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
국가코드국가명년월
국가코드1.0000.9960.000
국가명0.9961.0000.000
년월0.0000.0001.000
2023-12-13T06:52:27.322155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수량(톤)년월국가코드국가명
수량(톤)1.0000.1480.0000.088
년월0.1481.0000.0000.000
국가코드0.0000.0001.0000.996
국가명0.0880.0000.9961.000

Missing values

2023-12-13T06:52:25.055311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:52:25.162986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년월수량(톤)단위연료명국가코드국가명
02021-0177211.0MT유연탄US미국
12021-01160987.0MT유연탄ID인니
22021-01462461.0MT유연탄AU호주
32021-02282058.0MT유연탄RU러시아
42021-02144561.0MT유연탄ID인니
52021-02708123.0MT유연탄AU호주
62021-03152288.686MT유연탄DE덴마크
72021-0344000.0MT유연탄RU러시아
82021-03151640.0MT유연탄US미국
92021-03194232.0MT유연탄ID인니
년월수량(톤)단위연료명국가코드국가명
1172023-05312400.0MT유연탄RU러시아
1182023-0579671.0MT유연탄US미국
1192023-05110065.0MT유연탄ID인니
1202023-0580173.0MT유연탄CO콜롬비아
1212023-05110037.0MT유연탄AU호주
1222023-06371891.0MT유연탄RU러시아
1232023-0674949.0MT유연탄US미국
1242023-06313123.0MT유연탄ID인니
1252023-06142657.0MT유연탄CO콜롬비아
1262023-06111365.0MT유연탄AU호주