Overview

Dataset statistics

Number of variables3
Number of observations74
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory26.8 B

Variable types

Categorical2
Numeric1

Dataset

Description한국중부발전(주)의 화학물질 사용 정보이며, 목록명은 "사업소", "화학물질명", "사용량(kg)"으로 이루어져 있습니다.
Author한국중부발전(주)
URLhttps://www.data.go.kr/data/15119619/fileData.do

Alerts

사용량(kg) has 2 (2.7%) zerosZeros

Reproduction

Analysis started2024-04-21 02:27:24.598138
Analysis finished2024-04-21 02:27:25.856995
Duration1.26 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업소
Categorical

Distinct7
Distinct (%)9.5%
Missing0
Missing (%)0.0%
Memory size724.0 B
제주
12 
세종
12 
서울
11 
보령
10 
인천
10 
Other values (2)
19 

Length

Max length3
Median length2
Mean length2.2567568
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보령
2nd row보령
3rd row보령
4th row보령
5th row보령

Common Values

ValueCountFrequency (%)
제주 12
16.2%
세종 12
16.2%
서울 11
14.9%
보령 10
13.5%
인천 10
13.5%
신서천 10
13.5%
신보령 9
12.2%

Length

2024-04-21T11:27:25.919140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:27:26.019447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제주 12
16.2%
세종 12
16.2%
서울 11
14.9%
보령 10
13.5%
인천 10
13.5%
신서천 10
13.5%
신보령 9
12.2%

화학물질명
Categorical

Distinct33
Distinct (%)44.6%
Missing0
Missing (%)0.0%
Memory size724.0 B
염산(9%)
가성소다(4%)
암모니아수(9%)
스케일억제제
응집보조제
Other values (28)
43 

Length

Max length14
Median length10
Mean length6.7972973
Min length3

Unique

Unique21 ?
Unique (%)28.4%

Sample

1st row염산(9%)
2nd row가성소다(4%)
3rd row응집보조제
4th rowPAC
5th row암모니아수(9%)

Common Values

ValueCountFrequency (%)
염산(9%) 7
 
9.5%
가성소다(4%) 7
 
9.5%
암모니아수(9%) 7
 
9.5%
스케일억제제 5
 
6.8%
응집보조제 5
 
6.8%
PAC 5
 
6.8%
차아염소산나트륨 4
 
5.4%
중아황산나트륨 4
 
5.4%
카보하이드라자이드(10%) 3
 
4.1%
소포제 2
 
2.7%
Other values (23) 25
33.8%

Length

2024-04-21T11:27:26.142051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
염산(9 7
 
9.5%
암모니아수(9 7
 
9.5%
가성소다(4 7
 
9.5%
스케일억제제 5
 
6.8%
응집보조제 5
 
6.8%
pac 5
 
6.8%
차아염소산나트륨 4
 
5.4%
중아황산나트륨 4
 
5.4%
카보하이드라자이드(10 3
 
4.1%
무수암모니아 2
 
2.7%
Other values (23) 25
33.8%

사용량(kg)
Real number (ℝ)

ZEROS 

Distinct70
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean303664.43
Minimum0
Maximum3886200
Zeros2
Zeros (%)2.7%
Negative0
Negative (%)0.0%
Memory size798.0 B
2024-04-21T11:27:26.289655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile106
Q1810
median6031
Q363797.25
95-th percentile1986664.6
Maximum3886200
Range3886200
Interquartile range (IQR)62987.25

Descriptive statistics

Standard deviation758004.5
Coefficient of variation (CV)2.4961912
Kurtosis9.9420683
Mean303664.43
Median Absolute Deviation (MAD)5901
Skewness3.1086353
Sum22471168
Variance5.7457082 × 1011
MonotonicityNot monotonic
2024-04-21T11:27:26.441604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
800 2
 
2.7%
0 2
 
2.7%
840 2
 
2.7%
1000 2
 
2.7%
88042 1
 
1.4%
1147733 1
 
1.4%
283 1
 
1.4%
23 1
 
1.4%
2040 1
 
1.4%
3060 1
 
1.4%
Other values (60) 60
81.1%
ValueCountFrequency (%)
0 2
2.7%
23 1
1.4%
80 1
1.4%
120 1
1.4%
140 1
1.4%
200 1
1.4%
240 1
1.4%
260 1
1.4%
283 1
1.4%
320 1
1.4%
ValueCountFrequency (%)
3886200 1
1.4%
3355408 1
1.4%
2190960 1
1.4%
2026249 1
1.4%
1965350 1
1.4%
1914360 1
1.4%
1300360 1
1.4%
1257106 1
1.4%
1147733 1
1.4%
765888 1
1.4%

Interactions

2024-04-21T11:27:25.473737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T11:27:26.541684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업소화학물질명사용량(kg)
사업소1.0000.0000.000
화학물질명0.0001.0000.284
사용량(kg)0.0000.2841.000
2024-04-21T11:27:26.632559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업소화학물질명
사업소1.0000.000
화학물질명0.0001.000
2024-04-21T11:27:26.717070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용량(kg)사업소화학물질명
사용량(kg)1.0000.0000.039
사업소0.0001.0000.000
화학물질명0.0390.0001.000

Missing values

2024-04-21T11:27:25.764797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T11:27:25.824934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업소화학물질명사용량(kg)
0보령염산(9%)650095
1보령가성소다(4%)1257106
2보령응집보조제1172
3보령PAC182625
4보령암모니아수(9%)65986
5보령무수암모니아3886200
6보령차아염소산나트륨8171
7보령카보하이드라자이드(10%)2400
8보령소포제14805
9보령스케일억제제3200
사업소화학물질명사용량(kg)
64세종유화방지제345
65신보령염산(9%)192802
66신보령가성소다(4%)2190960
67신보령중아황산나트륨260
68신보령무수암모니아3355408
69신보령암모니아수(9%)18657
70신보령응집보조제1370
71신보령PAC92285
72신보령차아염소산나트륨140
73신보령스케일억제제7328