Overview

Dataset statistics

Number of variables6
Number of observations343
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory17.2 KiB
Average record size in memory51.4 B

Variable types

Categorical4
Text1
Numeric1

Dataset

Description국립종자원 정부보급종 공급가격 내역에 대한 데이터로 년산,작물명,품종명,공급단위,소독구분,공급가격 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15066253/fileData.do

Alerts

공급단위 is highly overall correlated with 공급가격 and 1 other fieldsHigh correlation
작물명 is highly overall correlated with 공급가격 and 1 other fieldsHigh correlation
공급가격 is highly overall correlated with 년산 and 3 other fieldsHigh correlation
년산 is highly overall correlated with 공급가격High correlation
소독구분 is highly overall correlated with 공급가격High correlation

Reproduction

Analysis started2023-12-12 16:09:53.707390
Analysis finished2023-12-12 16:09:54.489198
Duration0.78 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년산
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2020
147 
2021
104 
2022
92 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2020 147
42.9%
2021 104
30.3%
2022 92
26.8%

Length

2023-12-13T01:09:54.575169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:09:54.692408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 147
42.9%
2021 104
30.3%
2022 92
26.8%

작물명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
132 
62 
보리
56 
24 
보리(비축)
22 
Other values (8)
47 

Length

Max length6
Median length1
Mean length2.0087464
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
132
38.5%
62
18.1%
보리 56
16.3%
24
 
7.0%
보리(비축) 22
 
6.4%
벼(비축) 15
 
4.4%
보리(춘파) 12
 
3.5%
밀(비축) 6
 
1.7%
4
 
1.2%
호밀 3
 
0.9%
Other values (3) 7
 
2.0%

Length

2023-12-13T01:09:54.810612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
132
38.5%
62
18.1%
보리 56
16.3%
24
 
7.0%
보리(비축 22
 
6.4%
벼(비축 15
 
4.4%
보리(춘파 12
 
3.5%
밀(비축 6
 
1.7%
4
 
1.2%
호밀 3
 
0.9%
Other values (3) 7
 
2.0%
Distinct77
Distinct (%)22.4%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2023-12-13T01:09:55.076936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length3
Mean length3.8104956
Min length2

Characters and Unicode

Total characters1307
Distinct characters85
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)1.5%

Sample

1st row금강밀
2nd row금강밀
3rd row백강밀
4th row백강밀
5th row새금강밀
ValueCountFrequency (%)
큰알보리1호 11
 
3.2%
흰찰쌀보리 11
 
3.2%
새쌀보리 10
 
2.9%
새찰쌀보리 10
 
2.9%
백강밀 9
 
2.6%
선풍콩 8
 
2.3%
곡우 8
 
2.3%
조경밀 8
 
2.3%
영양보리 8
 
2.3%
누리찰쌀보리 8
 
2.3%
Other values (67) 252
73.5%
2023-12-13T01:09:55.555622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
115
 
8.8%
97
 
7.4%
91
 
7.0%
64
 
4.9%
58
 
4.4%
54
 
4.1%
46
 
3.5%
30
 
2.3%
29
 
2.2%
28
 
2.1%
Other values (75) 695
53.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1264
96.7%
Decimal Number 19
 
1.5%
Close Punctuation 12
 
0.9%
Open Punctuation 12
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
115
 
9.1%
97
 
7.7%
91
 
7.2%
64
 
5.1%
58
 
4.6%
54
 
4.3%
46
 
3.6%
30
 
2.4%
29
 
2.3%
28
 
2.2%
Other values (72) 652
51.6%
Decimal Number
ValueCountFrequency (%)
1 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1264
96.7%
Common 43
 
3.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
115
 
9.1%
97
 
7.7%
91
 
7.2%
64
 
5.1%
58
 
4.6%
54
 
4.3%
46
 
3.6%
30
 
2.4%
29
 
2.3%
28
 
2.2%
Other values (72) 652
51.6%
Common
ValueCountFrequency (%)
1 19
44.2%
) 12
27.9%
( 12
27.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1264
96.7%
ASCII 43
 
3.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
115
 
9.1%
97
 
7.7%
91
 
7.2%
64
 
5.1%
58
 
4.6%
54
 
4.3%
46
 
3.6%
30
 
2.4%
29
 
2.3%
28
 
2.2%
Other values (72) 652
51.6%
ASCII
ValueCountFrequency (%)
1 19
44.2%
) 12
27.9%
( 12
27.9%

공급단위
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
20
275 
5
68 

Length

Max length2
Median length2
Mean length1.8017493
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20
2nd row20
3rd row20
4th row20
5th row20

Common Values

ValueCountFrequency (%)
20 275
80.2%
5 68
 
19.8%

Length

2023-12-13T01:09:55.721550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:09:55.823858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20 275
80.2%
5 68
 
19.8%

소독구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
미소독
215 
소독
128 

Length

Max length3
Median length3
Mean length2.6268222
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row소독
2nd row미소독
3rd row미소독
4th row소독
5th row미소독

Common Values

ValueCountFrequency (%)
미소독 215
62.7%
소독 128
37.3%

Length

2023-12-13T01:09:55.920077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:09:56.025128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미소독 215
62.7%
소독 128
37.3%

공급가격
Real number (ℝ)

HIGH CORRELATION 

Distinct38
Distinct (%)11.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36110.058
Minimum19500
Maximum52930
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.1 KiB
2023-12-13T01:09:56.155918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19500
5-th percentile20840
Q126830
median29410
Q349020
95-th percentile51150
Maximum52930
Range33430
Interquartile range (IQR)22190

Descriptive statistics

Standard deviation11735.503
Coefficient of variation (CV)0.32499264
Kurtosis-1.7263021
Mean36110.058
Median Absolute Deviation (MAD)9230
Skewness0.09095758
Sum12385750
Variance1.3772203 × 108
MonotonicityNot monotonic
2023-12-13T01:09:56.295752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=38)
ValueCountFrequency (%)
49170 34
 
9.9%
51150 34
 
9.9%
44880 34
 
9.9%
49020 32
 
9.3%
28080 20
 
5.8%
27800 18
 
5.2%
29410 14
 
4.1%
21280 11
 
3.2%
22620 11
 
3.2%
22640 11
 
3.2%
Other values (28) 124
36.2%
ValueCountFrequency (%)
19500 7
2.0%
20180 8
2.3%
20840 7
2.0%
21280 11
3.2%
21430 7
2.0%
22620 11
3.2%
22640 11
3.2%
23890 8
2.3%
25060 5
1.5%
25970 5
1.5%
ValueCountFrequency (%)
52930 4
 
1.2%
51150 34
9.9%
50950 4
 
1.2%
50820 3
 
0.9%
49170 34
9.9%
49020 32
9.3%
48240 1
 
0.3%
46460 2
 
0.6%
45030 1
 
0.3%
44880 34
9.9%

Interactions

2023-12-13T01:09:54.057938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:09:56.395365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년산작물명품종명공급단위소독구분공급가격
년산1.0000.2150.0000.0350.1040.714
작물명0.2151.0000.9491.0000.1910.833
품종명0.0000.9491.0001.0000.0000.881
공급단위0.0351.0001.0001.0000.1460.879
소독구분0.1040.1910.0000.1461.0000.735
공급가격0.7140.8330.8810.8790.7351.000
2023-12-13T01:09:56.504943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공급단위소독구분년산작물명
공급단위1.0000.0930.0580.984
소독구분0.0931.0000.1720.174
년산0.0580.1721.0000.121
작물명0.9840.1740.1211.000
2023-12-13T01:09:56.614822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공급가격년산작물명공급단위소독구분
공급가격1.0000.5750.5420.7070.563
년산0.5751.0000.1210.0580.172
작물명0.5420.1211.0000.9840.174
공급단위0.7070.0580.9841.0000.093
소독구분0.5630.1720.1740.0931.000

Missing values

2023-12-13T01:09:54.240627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:09:54.432994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년산작물명품종명공급단위소독구분공급가격
02020금강밀20소독27170
12020금강밀20미소독25970
22020백강밀20미소독25970
32020백강밀20소독27170
42020새금강밀20미소독25970
52020새금강밀20소독27170
62020조경밀20소독27170
72020조경밀20미소독25970
82020밀(비축)조경밀20소독27170
92020밀(비축)조경밀20미소독25970
년산작물명품종명공급단위소독구분공급가격
3332022태광콩5미소독27800
3342022태광콩5소독27800
3352022풍산나물콩5미소독29700
3362022풍산나물콩5소독29700
3372022콩(비축)선풍콩5미소독27800
3382022콩(비축)선풍콩5소독27800
3392022아라리팥5미소독41880
3402022아라리팥5소독41880
3412022호밀곡우20미소독41590
3422022호밀(비축)곡우20미소독41590