Overview

Dataset statistics

Number of variables5
Number of observations124
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.5 KiB
Average record size in memory45.1 B

Variable types

Text1
Categorical2
Numeric2

Dataset

Description한전KDN의 녹색제품 구매실적에 대한 정보입니다. 2022년도 녹색제품 분류 별 조달구매 실적과 신규실적 추가 금액에 대한 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15062373/fileData.do

Alerts

신규실적 추가_녹색구매금액 is highly overall correlated with 녹색구매금액 총합계 and 2 other fieldsHigh correlation
녹색구매금액 총합계 is highly overall correlated with 신규실적 추가_녹색구매금액 and 2 other fieldsHigh correlation
조달구매실적_녹색구매금액 is highly overall correlated with 신규실적 추가_녹색구매금액 and 2 other fieldsHigh correlation
녹색장터구매실적 is highly overall correlated with 신규실적 추가_녹색구매금액 and 2 other fieldsHigh correlation
조달구매실적_녹색구매금액 is highly imbalanced (88.1%)Imbalance
녹색장터구매실적 is highly imbalanced (89.9%)Imbalance
분류명 has unique valuesUnique
신규실적 추가_녹색구매금액 has 116 (93.5%) zerosZeros
녹색구매금액 총합계 has 113 (91.1%) zerosZeros

Reproduction

Analysis started2023-12-12 20:23:35.682686
Analysis finished2023-12-12 20:23:36.694040
Duration1.01 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

분류명
Text

UNIQUE 

Distinct124
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T05:23:36.957728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length5.9596774
Min length2

Characters and Unicode

Total characters739
Distinct characters205
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique124 ?
Unique (%)100.0%

Sample

1st row합계
2nd rowOA칸막이
3rd row보관용 가구
4th row의자
5th row주방가구
ValueCountFrequency (%)
18
 
8.8%
기타 13
 
6.4%
건설용 2
 
1.0%
필기구 2
 
1.0%
산업용 2
 
1.0%
절수형 2
 
1.0%
모니터 2
 
1.0%
수도꼭지 2
 
1.0%
의복 2
 
1.0%
서비스 2
 
1.0%
Other values (153) 157
77.0%
2023-12-13T05:23:37.467537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
80
 
10.8%
41
 
5.5%
24
 
3.2%
19
 
2.6%
18
 
2.4%
15
 
2.0%
13
 
1.8%
13
 
1.8%
13
 
1.8%
13
 
1.8%
Other values (195) 490
66.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 648
87.7%
Space Separator 80
 
10.8%
Other Punctuation 5
 
0.7%
Close Punctuation 2
 
0.3%
Open Punctuation 2
 
0.3%
Uppercase Letter 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
41
 
6.3%
24
 
3.7%
19
 
2.9%
18
 
2.8%
15
 
2.3%
13
 
2.0%
13
 
2.0%
13
 
2.0%
13
 
2.0%
10
 
1.5%
Other values (187) 469
72.4%
Other Punctuation
ValueCountFrequency (%)
/ 3
60.0%
· 1
 
20.0%
, 1
 
20.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
50.0%
O 1
50.0%
Space Separator
ValueCountFrequency (%)
80
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 648
87.7%
Common 89
 
12.0%
Latin 2
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
41
 
6.3%
24
 
3.7%
19
 
2.9%
18
 
2.8%
15
 
2.3%
13
 
2.0%
13
 
2.0%
13
 
2.0%
13
 
2.0%
10
 
1.5%
Other values (187) 469
72.4%
Common
ValueCountFrequency (%)
80
89.9%
/ 3
 
3.4%
) 2
 
2.2%
( 2
 
2.2%
· 1
 
1.1%
, 1
 
1.1%
Latin
ValueCountFrequency (%)
A 1
50.0%
O 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 648
87.7%
ASCII 90
 
12.2%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
80
88.9%
/ 3
 
3.3%
) 2
 
2.2%
( 2
 
2.2%
, 1
 
1.1%
A 1
 
1.1%
O 1
 
1.1%
Hangul
ValueCountFrequency (%)
41
 
6.3%
24
 
3.7%
19
 
2.9%
18
 
2.8%
15
 
2.3%
13
 
2.0%
13
 
2.0%
13
 
2.0%
13
 
2.0%
10
 
1.5%
Other values (187) 469
72.4%
None
ValueCountFrequency (%)
· 1
100.0%

조달구매실적_녹색구매금액
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
0
122 
16350000
 
2

Length

Max length8
Median length1
Mean length1.1129032
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row16350000
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 122
98.4%
16350000 2
 
1.6%

Length

2023-12-13T05:23:37.602120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:23:37.709865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 122
98.4%
16350000 2
 
1.6%

신규실적 추가_녹색구매금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct9
Distinct (%)7.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19637744
Minimum0
Maximum1.2175401 × 109
Zeros116
Zeros (%)93.5%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-13T05:23:37.804811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1.2049179 × 108
Maximum1.2175401 × 109
Range1.2175401 × 109
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.1717205 × 108
Coefficient of variation (CV)5.9666756
Kurtosis90.654979
Mean19637744
Median Absolute Deviation (MAD)0
Skewness9.056679
Sum2.4350802 × 109
Variance1.3729288 × 1016
MonotonicityNot monotonic
2023-12-13T05:23:37.925073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
0 116
93.5%
1217540110 1
 
0.8%
38038000 1
 
0.8%
183484400 1
 
0.8%
135042455 1
 
0.8%
297149405 1
 
0.8%
251651378 1
 
0.8%
170500682 1
 
0.8%
141673790 1
 
0.8%
ValueCountFrequency (%)
0 116
93.5%
38038000 1
 
0.8%
135042455 1
 
0.8%
141673790 1
 
0.8%
170500682 1
 
0.8%
183484400 1
 
0.8%
251651378 1
 
0.8%
297149405 1
 
0.8%
1217540110 1
 
0.8%
ValueCountFrequency (%)
1217540110 1
 
0.8%
297149405 1
 
0.8%
251651378 1
 
0.8%
183484400 1
 
0.8%
170500682 1
 
0.8%
141673790 1
 
0.8%
135042455 1
 
0.8%
38038000 1
 
0.8%
0 116
93.5%

녹색장터구매실적
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
0
121 
70900
 
1
17900
 
1
53000
 
1

Length

Max length5
Median length1
Mean length1.0967742
Min length1

Unique

Unique3 ?
Unique (%)2.4%

Sample

1st row70900
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 121
97.6%
70900 1
 
0.8%
17900 1
 
0.8%
53000 1
 
0.8%

Length

2023-12-13T05:23:38.058661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:23:38.177066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 121
97.6%
70900 1
 
0.8%
17900 1
 
0.8%
53000 1
 
0.8%

녹색구매금액 총합계
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct12
Distinct (%)9.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19902597
Minimum0
Maximum1.233961 × 109
Zeros113
Zeros (%)91.1%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-13T05:23:38.272457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1.2049179 × 108
Maximum1.233961 × 109
Range1.233961 × 109
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.1852505 × 108
Coefficient of variation (CV)5.9552555
Kurtosis91.349019
Mean19902597
Median Absolute Deviation (MAD)0
Skewness9.0989284
Sum2.467922 × 109
Variance1.4048187 × 1016
MonotonicityNot monotonic
2023-12-13T05:23:38.407451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
0 113
91.1%
1233961010 1
 
0.8%
16350000 1
 
0.8%
38038000 1
 
0.8%
183484400 1
 
0.8%
135042455 1
 
0.8%
297149405 1
 
0.8%
251651378 1
 
0.8%
170500682 1
 
0.8%
17900 1
 
0.8%
Other values (2) 2
 
1.6%
ValueCountFrequency (%)
0 113
91.1%
17900 1
 
0.8%
53000 1
 
0.8%
16350000 1
 
0.8%
38038000 1
 
0.8%
135042455 1
 
0.8%
141673790 1
 
0.8%
170500682 1
 
0.8%
183484400 1
 
0.8%
251651378 1
 
0.8%
ValueCountFrequency (%)
1233961010 1
0.8%
297149405 1
0.8%
251651378 1
0.8%
183484400 1
0.8%
170500682 1
0.8%
141673790 1
0.8%
135042455 1
0.8%
38038000 1
0.8%
16350000 1
0.8%
53000 1
0.8%

Interactions

2023-12-13T05:23:36.195877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:23:35.941450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:23:36.310036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:23:36.064217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:23:38.484242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
조달구매실적_녹색구매금액신규실적 추가_녹색구매금액녹색장터구매실적녹색구매금액 총합계
조달구매실적_녹색구매금액1.0000.8870.8870.887
신규실적 추가_녹색구매금액0.8871.0000.8871.000
녹색장터구매실적0.8870.8871.0000.887
녹색구매금액 총합계0.8871.0000.8871.000
2023-12-13T05:23:38.573522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
녹색장터구매실적조달구매실적_녹색구매금액
녹색장터구매실적1.0000.690
조달구매실적_녹색구매금액0.6901.000
2023-12-13T05:23:38.647146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신규실적 추가_녹색구매금액녹색구매금액 총합계조달구매실적_녹색구매금액녹색장터구매실적
신규실적 추가_녹색구매금액1.0000.8630.6900.563
녹색구매금액 총합계0.8631.0000.6900.563
조달구매실적_녹색구매금액0.6900.6901.0000.690
녹색장터구매실적0.5630.5630.6901.000

Missing values

2023-12-13T05:23:36.491436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:23:36.632264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

분류명조달구매실적_녹색구매금액신규실적 추가_녹색구매금액녹색장터구매실적녹색구매금액 총합계
0합계163500001217540110709001233961010
1OA칸막이0000
2보관용 가구0000
3의자0000
4주방가구0000
5책상(탁자)0000
6침대 및 침대매트릭스0000
7기타 가구 및 부속품0000
8공기청정기0000
9냉장고0000
분류명조달구매실적_녹색구매금액신규실적 추가_녹색구매금액녹색장터구매실적녹색구매금액 총합계
114기타 도로용품0000
115식음료품0000
116기타 서비스0000
117운송 서비스0000
118에너지0000
119사료0000
120육묘상자0000
121인공어초 및 부자0000
122토양개량제0000
123기타0000