Overview

Dataset statistics

Number of variables3
Number of observations29
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory857.0 B
Average record size in memory29.6 B

Variable types

DateTime1
Categorical1
Numeric1

Dataset

Description한국서부발전의 미이용산림바이오 정보 데이터 입니다. 제공데이터는 사업소별 미이용산림바이오(톤) 정보입니다.
URLhttps://www.data.go.kr/data/15106333/fileData.do

Alerts

사업소 has constant value ""Constant
연월 has unique valuesUnique
미이용산림바이오(톤) has 12 (41.4%) zerosZeros

Reproduction

Analysis started2023-12-12 10:12:18.528128
Analysis finished2023-12-12 10:12:18.834659
Duration0.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연월
Date

UNIQUE 

Distinct29
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size364.0 B
Minimum2020-08-01 00:00:00
Maximum2022-12-01 00:00:00
2023-12-12T19:12:18.887820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:12:19.021589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)

사업소
Categorical

CONSTANT 

Distinct1
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size364.0 B
태안
29 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row태안
2nd row태안
3rd row태안
4th row태안
5th row태안

Common Values

ValueCountFrequency (%)
태안 29
100.0%

Length

2023-12-12T19:12:19.158346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:12:19.275684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
태안 29
100.0%

미이용산림바이오(톤)
Real number (ℝ)

ZEROS 

Distinct18
Distinct (%)62.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1461.2759
Minimum0
Maximum4683
Zeros12
Zeros (%)41.4%
Negative0
Negative (%)0.0%
Memory size393.0 B
2023-12-12T19:12:19.365854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1312
Q32507
95-th percentile3900
Maximum4683
Range4683
Interquartile range (IQR)2507

Descriptive statistics

Standard deviation1507.9514
Coefficient of variation (CV)1.0319416
Kurtosis-1.0956079
Mean1461.2759
Median Absolute Deviation (MAD)1312
Skewness0.47245973
Sum42377
Variance2273917.3
MonotonicityNot monotonic
2023-12-12T19:12:19.486732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
0 12
41.4%
704 1
 
3.4%
350 1
 
3.4%
1312 1
 
3.4%
2280 1
 
3.4%
2032 1
 
3.4%
2390 1
 
3.4%
4683 1
 
3.4%
2323 1
 
3.4%
3055 1
 
3.4%
Other values (8) 8
27.6%
ValueCountFrequency (%)
0 12
41.4%
350 1
 
3.4%
704 1
 
3.4%
1312 1
 
3.4%
1886 1
 
3.4%
2032 1
 
3.4%
2081 1
 
3.4%
2280 1
 
3.4%
2323 1
 
3.4%
2390 1
 
3.4%
ValueCountFrequency (%)
4683 1
3.4%
4064 1
3.4%
3654 1
3.4%
3359 1
3.4%
3055 1
3.4%
2894 1
3.4%
2803 1
3.4%
2507 1
3.4%
2390 1
3.4%
2323 1
3.4%

Interactions

2023-12-12T19:12:18.595219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:12:19.576320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연월미이용산림바이오(톤)
연월1.0001.000
미이용산림바이오(톤)1.0001.000

Missing values

2023-12-12T19:12:18.703747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:12:18.800553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연월사업소미이용산림바이오(톤)
02020-08태안704
12020-09태안2894
22020-10태안1886
32020-11태안2507
42020-12태안4064
52021-01태안2803
62021-02태안3359
72021-03태안3654
82021-04태안2081
92021-05태안3055
연월사업소미이용산림바이오(톤)
192022-03태안0
202022-04태안0
212022-05태안0
222022-06태안0
232022-07태안0
242022-08태안0
252022-09태안0
262022-10태안0
272022-11태안0
282022-12태안0