Overview

Dataset statistics

Number of variables5
Number of observations35
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory47.8 B

Variable types

Numeric4
Categorical1

Dataset

Description한국동서발전의 녹색제품 구매실적 정보입니다. 녹색제품 구매실적은 연도, 구분, 총구매(천원), 녹색제품(천원), 구매비율(퍼센트) 등의 항목으로 구성됩니다.
URLhttps://www.data.go.kr/data/15048355/fileData.do

Alerts

총구매(천원) is highly overall correlated with 녹색제품(천원)High correlation
녹색제품(천원) is highly overall correlated with 총구매(천원) and 1 other fieldsHigh correlation
구매비율(퍼센트) is highly overall correlated with 녹색제품(천원)High correlation
녹색제품(천원) has unique valuesUnique
총구매(천원) has 5 (14.3%) zerosZeros
구매비율(퍼센트) has 5 (14.3%) zerosZeros

Reproduction

Analysis started2023-12-12 02:48:19.716776
Analysis finished2023-12-12 02:48:21.584358
Duration1.87 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

Distinct6
Distinct (%)17.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.4286
Minimum2017
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size447.0 B
2023-12-12T11:48:21.630001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2017
5-th percentile2017
Q12018
median2019
Q32021
95-th percentile2022
Maximum2022
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7026919
Coefficient of variation (CV)0.00084315529
Kurtosis-1.2392851
Mean2019.4286
Median Absolute Deviation (MAD)1
Skewness0.034820473
Sum70680
Variance2.8991597
MonotonicityIncreasing
2023-12-12T11:48:21.728937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2017 6
17.1%
2018 6
17.1%
2019 6
17.1%
2020 6
17.1%
2021 6
17.1%
2022 5
14.3%
ValueCountFrequency (%)
2017 6
17.1%
2018 6
17.1%
2019 6
17.1%
2020 6
17.1%
2021 6
17.1%
2022 5
14.3%
ValueCountFrequency (%)
2022 5
14.3%
2021 6
17.1%
2020 6
17.1%
2019 6
17.1%
2018 6
17.1%
2017 6
17.1%

구분
Categorical

Distinct6
Distinct (%)17.1%
Missing0
Missing (%)0.0%
Memory size412.0 B
본사
당진
울산
동해
일산

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row본사
2nd row당진
3rd row울산
4th row호남
5th row동해

Common Values

ValueCountFrequency (%)
본사 6
17.1%
당진 6
17.1%
울산 6
17.1%
동해 6
17.1%
일산 6
17.1%
호남 5
14.3%

Length

2023-12-12T11:48:21.871112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:48:22.009862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
본사 6
17.1%
당진 6
17.1%
울산 6
17.1%
동해 6
17.1%
일산 6
17.1%
호남 5
14.3%

총구매(천원)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct31
Distinct (%)88.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean459686.14
Minimum0
Maximum3865099
Zeros5
Zeros (%)14.3%
Negative0
Negative (%)0.0%
Memory size447.0 B
2023-12-12T11:48:22.169125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q142529.5
median233188
Q3443614
95-th percentile1529051.5
Maximum3865099
Range3865099
Interquartile range (IQR)401084.5

Descriptive statistics

Standard deviation746424.38
Coefficient of variation (CV)1.6237696
Kurtosis12.434866
Mean459686.14
Median Absolute Deviation (MAD)204040
Skewness3.180889
Sum16089015
Variance5.5714935 × 1011
MonotonicityNot monotonic
2023-12-12T11:48:22.307115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
0 5
 
14.3%
412847 1
 
2.9%
1212265 1
 
2.9%
57738 1
 
2.9%
251668 1
 
2.9%
102821 1
 
2.9%
222180 1
 
2.9%
1368053 1
 
2.9%
1519825 1
 
2.9%
74489 1
 
2.9%
Other values (21) 21
60.0%
ValueCountFrequency (%)
0 5
14.3%
12519 1
 
2.9%
19270 1
 
2.9%
29130 1
 
2.9%
34010 1
 
2.9%
51049 1
 
2.9%
57738 1
 
2.9%
74489 1
 
2.9%
94172 1
 
2.9%
102821 1
 
2.9%
ValueCountFrequency (%)
3865099 1
2.9%
1550580 1
2.9%
1519825 1
2.9%
1368053 1
2.9%
1236155 1
2.9%
1212265 1
2.9%
534651 1
2.9%
473184 1
2.9%
450000 1
2.9%
437228 1
2.9%

녹색제품(천원)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean498960.34
Minimum12519
Maximum3751594
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size447.0 B
2023-12-12T11:48:22.454941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12519
5-th percentile26172
Q198496.5
median258088
Q3492246
95-th percentile1325157.8
Maximum3751594
Range3739075
Interquartile range (IQR)393749.5

Descriptive statistics

Standard deviation704617.42
Coefficient of variation (CV)1.4121712
Kurtosis13.010871
Mean498960.34
Median Absolute Deviation (MAD)183599
Skewness3.2014994
Sum17463612
Variance4.9648571 × 1011
MonotonicityNot monotonic
2023-12-12T11:48:22.598381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
372388 1
 
2.9%
1165459 1
 
2.9%
135538 1
 
2.9%
233188 1
 
2.9%
33841 1
 
2.9%
74489 1
 
2.9%
1198704 1
 
2.9%
1277045 1
 
2.9%
219610 1
 
2.9%
102821 1
 
2.9%
Other values (25) 25
71.4%
ValueCountFrequency (%)
12519 1
2.9%
19270 1
2.9%
29130 1
2.9%
33841 1
2.9%
51049 1
2.9%
57738 1
2.9%
64467 1
2.9%
74489 1
2.9%
94172 1
2.9%
102821 1
2.9%
ValueCountFrequency (%)
3751594 1
2.9%
1387140 1
2.9%
1298594 1
2.9%
1277045 1
2.9%
1198704 1
2.9%
1179241 1
2.9%
1165459 1
2.9%
534651 1
2.9%
534492 1
2.9%
450000 1
2.9%

구매비율(퍼센트)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct18
Distinct (%)51.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean83.188571
Minimum0
Maximum100
Zeros5
Zeros (%)14.3%
Negative0
Negative (%)0.0%
Memory size447.0 B
2023-12-12T11:48:22.747361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q191.75
median98.5
Q3100
95-th percentile100
Maximum100
Range100
Interquartile range (IQR)8.25

Descriptive statistics

Standard deviation34.732099
Coefficient of variation (CV)0.41751046
Kurtosis2.5291021
Mean83.188571
Median Absolute Deviation (MAD)1.5
Skewness-2.068594
Sum2911.6
Variance1206.3187
MonotonicityNot monotonic
2023-12-12T11:48:22.879630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
100.0 13
37.1%
0.0 5
 
14.3%
99.5 2
 
5.7%
90.2 1
 
2.9%
97.1 1
 
2.9%
96.3 1
 
2.9%
98.8 1
 
2.9%
93.3 1
 
2.9%
78.9 1
 
2.9%
95.4 1
 
2.9%
Other values (8) 8
22.9%
ValueCountFrequency (%)
0.0 5
14.3%
78.9 1
 
2.9%
89.5 1
 
2.9%
90.1 1
 
2.9%
90.2 1
 
2.9%
93.3 1
 
2.9%
94.3 1
 
2.9%
95.4 1
 
2.9%
96.1 1
 
2.9%
96.3 1
 
2.9%
ValueCountFrequency (%)
100.0 13
37.1%
99.8 1
 
2.9%
99.5 2
 
5.7%
98.8 1
 
2.9%
98.5 1
 
2.9%
97.8 1
 
2.9%
97.1 1
 
2.9%
96.5 1
 
2.9%
96.3 1
 
2.9%
96.1 1
 
2.9%

Interactions

2023-12-12T11:48:21.036592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:48:19.867231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:48:20.224895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:48:20.637624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:48:21.150965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:48:19.950476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:48:20.317149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:48:20.733714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:48:21.244356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:48:20.038611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:48:20.433611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:48:20.848002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:48:21.336068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:48:20.124729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:48:20.555164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:48:20.947919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T11:48:22.992330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도구분총구매(천원)녹색제품(천원)구매비율(퍼센트)
연도1.0000.0000.5660.1420.642
구분0.0001.0000.4560.6840.000
총구매(천원)0.5660.4561.0000.8870.668
녹색제품(천원)0.1420.6840.8871.0000.297
구매비율(퍼센트)0.6420.0000.6680.2971.000
2023-12-12T11:48:23.139093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도총구매(천원)녹색제품(천원)구매비율(퍼센트)구분
연도1.000-0.3460.067-0.4050.000
총구매(천원)-0.3461.0000.704-0.0520.318
녹색제품(천원)0.0670.7041.000-0.5260.497
구매비율(퍼센트)-0.405-0.052-0.5261.0000.000
구분0.0000.3180.4970.0001.000

Missing values

2023-12-12T11:48:21.457963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:48:21.550745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도구분총구매(천원)녹색제품(천원)구매비율(퍼센트)
02017본사41284737238890.2
12017당진1212265116545996.1
22017울산17072016819998.5
32017호남24179924070199.5
42017동해47318442652390.1
52017일산2913029130100.0
62018본사534651534651100.0
72018당진43722843647499.8
82018울산9417294172100.0
92018호남450000450000100.0
연도구분총구매(천원)녹색제품(천원)구매비율(퍼센트)
252021당진1368053127704593.3
262021울산22218021961098.8
272021호남102821102821100.0
282021동해25166824246196.3
292021일산5773857738100.0
302022본사05344920.0
312022당진012985940.0
322022울산01323920.0
332022동해02812430.0
342022일산0644670.0