Overview

Dataset statistics

Number of variables6
Number of observations137
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.8 KiB
Average record size in memory51.0 B

Variable types

Categorical5
Numeric1

Dataset

Description한국동서발전의 지역농산물 구매현황 정보를 제공합니다. 연도, 지역, 사업소, 품목, 구매 금액 등의 항목을 나타냅니다.
URLhttps://www.data.go.kr/data/15087284/fileData.do

Alerts

사업소 is highly overall correlated with 지역High correlation
지역 is highly overall correlated with 사업소High correlation

Reproduction

Analysis started2023-12-11 23:03:09.650650
Analysis finished2023-12-11 23:03:10.153852
Duration0.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Categorical

Distinct4
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2022
46 
2020
34 
2021
31 
2019
26 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2022 46
33.6%
2020 34
24.8%
2021 31
22.6%
2019 26
19.0%

Length

2023-12-12T08:03:10.213480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:03:10.311831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 46
33.6%
2020 34
24.8%
2021 31
22.6%
2019 26
19.0%

지역
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
강원동해
25 
충남당진
24 
경기일산
23 
울산중구
22 
울산남구
21 
Other values (2)
22 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row울산중구
2nd row울산중구
3rd row울산중구
4th row울산중구
5th row울산남구

Common Values

ValueCountFrequency (%)
강원동해 25
18.2%
충남당진 24
17.5%
경기일산 23
16.8%
울산중구 22
16.1%
울산남구 21
15.3%
전남여수 14
10.2%
충북음성 8
 
5.8%

Length

2023-12-12T08:03:10.436597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:03:10.540213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강원동해 25
18.2%
충남당진 24
17.5%
경기일산 23
16.8%
울산중구 22
16.1%
울산남구 21
15.3%
전남여수 14
10.2%
충북음성 8
 
5.8%

사업소
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
동해바이오발전본부
25 
당진발전본부
24 
일산발전본부
23 
본사
22 
울산발전본부
21 
Other values (2)
22 

Length

Max length9
Median length6
Mean length5.9635036
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row본사
2nd row본사
3rd row본사
4th row본사
5th row울산발전본부

Common Values

ValueCountFrequency (%)
동해바이오발전본부 25
18.2%
당진발전본부 24
17.5%
일산발전본부 23
16.8%
본사 22
16.1%
울산발전본부 21
15.3%
호남발전본부 14
10.2%
음성그린에너지 8
 
5.8%

Length

2023-12-12T08:03:10.682016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:03:10.838558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
동해바이오발전본부 25
18.2%
당진발전본부 24
17.5%
일산발전본부 23
16.8%
본사 22
16.1%
울산발전본부 21
15.3%
호남발전본부 14
10.2%
음성그린에너지 8
 
5.8%

품목
Categorical

Distinct6
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
양곡
33 
과일
29 
가공
28 
채소
22 
화훼
14 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row양곡
2nd row과일
3rd row채소
4th row가공
5th row과일

Common Values

ValueCountFrequency (%)
양곡 33
24.1%
과일 29
21.2%
가공 28
20.4%
채소 22
16.1%
화훼 14
10.2%
축산 11
 
8.0%

Length

2023-12-12T08:03:10.988422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:03:11.116766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
양곡 33
24.1%
과일 29
21.2%
가공 28
20.4%
채소 22
16.1%
화훼 14
10.2%
축산 11
 
8.0%

구매금액
Real number (ℝ)

Distinct130
Distinct (%)94.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10274486
Minimum240000
Maximum57071000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T08:03:11.260235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum240000
5-th percentile643200
Q12165030
median5600000
Q314664942
95-th percentile30622966
Maximum57071000
Range56831000
Interquartile range (IQR)12499912

Descriptive statistics

Standard deviation11313920
Coefficient of variation (CV)1.1011666
Kurtosis3.2144132
Mean10274486
Median Absolute Deviation (MAD)4380000
Skewness1.7408678
Sum1.4076045 × 109
Variance1.280048 × 1014
MonotonicityNot monotonic
2023-12-12T08:03:11.392801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2660000 2
 
1.5%
3000000 2
 
1.5%
5435000 2
 
1.5%
526000 2
 
1.5%
400000 2
 
1.5%
1615000 2
 
1.5%
3259000 2
 
1.5%
942000 1
 
0.7%
27902000 1
 
0.7%
7269900 1
 
0.7%
Other values (120) 120
87.6%
ValueCountFrequency (%)
240000 1
0.7%
400000 2
1.5%
429000 1
0.7%
526000 2
1.5%
600000 1
0.7%
654000 1
0.7%
690000 1
0.7%
692000 1
0.7%
700000 1
0.7%
833000 1
0.7%
ValueCountFrequency (%)
57071000 1
0.7%
50500000 1
0.7%
50336117 1
0.7%
41125740 1
0.7%
37637280 1
0.7%
32066050 1
0.7%
31114830 1
0.7%
30500000 1
0.7%
30244127 1
0.7%
28576640 1
0.7%

비고
Categorical

Distinct2
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
지역농산물구매
83 
농산물구매
54 

Length

Max length7
Median length7
Mean length6.2116788
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지역농산물구매
2nd row지역농산물구매
3rd row지역농산물구매
4th row지역농산물구매
5th row지역농산물구매

Common Values

ValueCountFrequency (%)
지역농산물구매 83
60.6%
농산물구매 54
39.4%

Length

2023-12-12T08:03:11.551991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:03:11.692003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지역농산물구매 83
60.6%
농산물구매 54
39.4%

Interactions

2023-12-12T08:03:09.922378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:03:11.756109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도지역사업소품목구매금액비고
연도1.0000.0210.0210.0000.0870.217
지역0.0211.0001.0000.0000.3480.319
사업소0.0211.0001.0000.0000.3480.319
품목0.0000.0000.0001.0000.1940.152
구매금액0.0870.3480.3480.1941.0000.102
비고0.2170.3190.3190.1520.1021.000
2023-12-12T08:03:11.852879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목연도사업소지역비고
품목1.0000.0000.0000.0000.107
연도0.0001.0000.0000.0000.142
사업소0.0000.0001.0001.0000.335
지역0.0000.0001.0001.0000.335
비고0.1070.1420.3350.3351.000
2023-12-12T08:03:11.964999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구매금액연도지역사업소품목비고
구매금액1.0000.0400.1790.1790.1010.050
연도0.0401.0000.0000.0000.0000.142
지역0.1790.0001.0001.0000.0000.335
사업소0.1790.0001.0001.0000.0000.335
품목0.1010.0000.0000.0001.0000.107
비고0.0500.1420.3350.3350.1071.000

Missing values

2023-12-12T08:03:10.020072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:03:10.116254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도지역사업소품목구매금액비고
02019울산중구본사양곡8110000지역농산물구매
12019울산중구본사과일24400000지역농산물구매
22019울산중구본사채소5500000지역농산물구매
32019울산중구본사가공5600000지역농산물구매
42019울산남구울산발전본부과일1500000지역농산물구매
52019충남당진당진발전본부양곡57071000지역농산물구매
62019충남당진당진발전본부과일6737000지역농산물구매
72019충남당진당진발전본부채소27511750지역농산물구매
82019충남당진당진발전본부가공18367000지역농산물구매
92019강원동해동해바이오발전본부양곡2069000농산물구매
연도지역사업소품목구매금액비고
1272022경기일산일산발전본부양곡6469500지역농산물구매
1282022경기일산일산발전본부가공3000000지역농산물구매
1292022경기일산일산발전본부화훼920000지역농산물구매
1302022전남여수호남발전본부과일2639000농산물구매
1312022전남여수호남발전본부양곡5370000지역농산물구매
1322022충북음성음성그린에너지가공4700000농산물구매
1332022충북음성음성그린에너지양곡8465000지역농산물구매
1342022충북음성음성그린에너지채소3000000지역농산물구매
1352022충북음성음성그린에너지가공2150000지역농산물구매
1362022충북음성음성그린에너지축산429000지역농산물구매