Overview

Dataset statistics

Number of variables11
Number of observations2424
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory217.9 KiB
Average record size in memory92.1 B

Variable types

Categorical7
DateTime2
Numeric2

Dataset

Description정부보급종 정선 저장빈 현황으로 년산,지원명,부서명,날짜,방번호,작물명,품종명,구분,저장빈량,수분 등의 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15066324/fileData.do

Alerts

데이터추출일 has constant value ""Constant
부서명 is highly overall correlated with 지원명 and 1 other fieldsHigh correlation
지원명 is highly overall correlated with 부서명 and 1 other fieldsHigh correlation
작물명 is highly overall correlated with 품종명 and 1 other fieldsHigh correlation
품종명 is highly overall correlated with 지원명 and 3 other fieldsHigh correlation
구분 is highly overall correlated with 수분High correlation
수분 is highly overall correlated with 작물명 and 2 other fieldsHigh correlation
작물명 is highly imbalanced (72.5%)Imbalance
방번호 has 51 (2.1%) zerosZeros
저장빈량 has 47 (1.9%) zerosZeros

Reproduction

Analysis started2023-12-12 18:57:39.777114
Analysis finished2023-12-12 18:57:41.806101
Duration2.03 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년산
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size19.1 KiB
2021
855 
2020
808 
2022
761 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2021 855
35.3%
2020 808
33.3%
2022 761
31.4%

Length

2023-12-13T03:57:41.883415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:57:42.032635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 855
35.3%
2020 808
33.3%
2022 761
31.4%

지원명
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size19.1 KiB
충남지원
562 
경북지원
479 
전북지원
448 
전남지원
369 
강원지원
179 
Other values (3)
387 

Length

Max length7
Median length4
Mean length4.164604
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기종자관리소
2nd row경기종자관리소
3rd row경기종자관리소
4th row경기종자관리소
5th row경기종자관리소

Common Values

ValueCountFrequency (%)
충남지원 562
23.2%
경북지원 479
19.8%
전북지원 448
18.5%
전남지원 369
15.2%
강원지원 179
 
7.4%
경남지원 163
 
6.7%
경기종자관리소 133
 
5.5%
충북지원 91
 
3.8%

Length

2023-12-13T03:57:42.177831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:57:42.333154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충남지원 562
23.2%
경북지원 479
19.8%
전북지원 448
18.5%
전남지원 369
15.2%
강원지원 179
 
7.4%
경남지원 163
 
6.7%
경기종자관리소 133
 
5.5%
충북지원 91
 
3.8%

부서명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size19.1 KiB
<NA>
1607 
정읍
234 
함평
226 
익산
214 
영암
 
143

Length

Max length4
Median length4
Mean length3.3259076
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 1607
66.3%
정읍 234
 
9.7%
함평 226
 
9.3%
익산 214
 
8.8%
영암 143
 
5.9%

Length

2023-12-13T03:57:42.637046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:57:42.818514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 1607
66.3%
정읍 234
 
9.7%
함평 226
 
9.3%
익산 214
 
8.8%
영암 143
 
5.9%

날짜
Date

Distinct371
Distinct (%)15.3%
Missing0
Missing (%)0.0%
Memory size19.1 KiB
Minimum2020-06-23 00:00:00
Maximum2023-07-03 00:00:00
2023-12-13T03:57:43.019962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:57:43.244411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

방번호
Real number (ℝ)

ZEROS 

Distinct49
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.712046
Minimum0
Maximum48
Zeros51
Zeros (%)2.1%
Negative0
Negative (%)0.0%
Memory size21.4 KiB
2023-12-13T03:57:43.481997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q111
median19
Q328
95-th percentile38
Maximum48
Range48
Interquartile range (IQR)17

Descriptive statistics

Standard deviation11.267963
Coefficient of variation (CV)0.57162827
Kurtosis-0.6900837
Mean19.712046
Median Absolute Deviation (MAD)8
Skewness0.22659282
Sum47782
Variance126.96699
MonotonicityNot monotonic
2023-12-13T03:57:43.735146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
26 105
 
4.3%
18 93
 
3.8%
14 88
 
3.6%
13 85
 
3.5%
19 84
 
3.5%
10 83
 
3.4%
11 82
 
3.4%
28 81
 
3.3%
20 80
 
3.3%
21 79
 
3.3%
Other values (39) 1564
64.5%
ValueCountFrequency (%)
0 51
2.1%
1 22
 
0.9%
2 62
2.6%
3 64
2.6%
4 51
2.1%
5 49
2.0%
6 64
2.6%
7 37
1.5%
8 34
1.4%
9 42
1.7%
ValueCountFrequency (%)
48 7
 
0.3%
47 11
0.5%
46 14
0.6%
45 11
0.5%
44 14
0.6%
43 8
 
0.3%
42 4
 
0.2%
41 10
0.4%
40 15
0.6%
39 22
0.9%

작물명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size19.1 KiB
2121 
보리
 
161
 
121
호밀
 
16
보리(비축)
 
4

Length

Max length6
Median length1
Mean length1.0829208
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
2121
87.5%
보리 161
 
6.6%
121
 
5.0%
호밀 16
 
0.7%
보리(비축) 4
 
0.2%
밀(비축) 1
 
< 0.1%

Length

2023-12-13T03:57:44.027287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:57:44.265417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2121
87.5%
보리 161
 
6.6%
121
 
5.0%
호밀 16
 
0.7%
보리(비축 4
 
0.2%
밀(비축 1
 
< 0.1%

품종명
Categorical

HIGH CORRELATION 

Distinct47
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size19.1 KiB
삼광벼
435 
일품벼
321 
신동진벼
253 
친들벼
175 
새청무
139 
Other values (42)
1101 

Length

Max length6
Median length3
Mean length3.4005776
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row고시히카리
2nd row고시히카리
3rd row고시히카리
4th row고시히카리
5th row고시히카리

Common Values

ValueCountFrequency (%)
삼광벼 435
17.9%
일품벼 321
13.2%
신동진벼 253
10.4%
친들벼 175
 
7.2%
새청무 139
 
5.7%
오대벼 125
 
5.2%
추청벼 102
 
4.2%
동진찰벼 99
 
4.1%
새일미벼 92
 
3.8%
흰찰쌀보리 70
 
2.9%
Other values (37) 613
25.3%

Length

2023-12-13T03:57:44.523759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
삼광벼 435
17.9%
일품벼 321
13.2%
신동진벼 253
10.4%
친들벼 175
 
7.2%
새청무 139
 
5.7%
오대벼 125
 
5.2%
추청벼 102
 
4.2%
동진찰벼 99
 
4.1%
새일미벼 92
 
3.8%
흰찰쌀보리 70
 
2.9%
Other values (37) 613
25.3%

구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size19.1 KiB
건조입고
1548 
정밀불출
539 
출고
337 

Length

Max length4
Median length4
Mean length3.7219472
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건조입고
2nd row건조입고
3rd row건조입고
4th row건조입고
5th row건조입고

Common Values

ValueCountFrequency (%)
건조입고 1548
63.9%
정밀불출 539
 
22.2%
출고 337
 
13.9%

Length

2023-12-13T03:57:44.836367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:57:45.035482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건조입고 1548
63.9%
정밀불출 539
 
22.2%
출고 337
 
13.9%

저장빈량
Real number (ℝ)

ZEROS 

Distinct1932
Distinct (%)79.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean49434.441
Minimum-731
Maximum250000
Zeros47
Zeros (%)1.9%
Negative8
Negative (%)0.3%
Memory size21.4 KiB
2023-12-13T03:57:45.247632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-731
5-th percentile3000
Q120456.5
median43330
Q377347.25
95-th percentile105466.35
Maximum250000
Range250731
Interquartile range (IQR)56890.75

Descriptive statistics

Standard deviation34459.653
Coefficient of variation (CV)0.69707784
Kurtosis-0.57656362
Mean49434.441
Median Absolute Deviation (MAD)27211.5
Skewness0.44895794
Sum1.1982908 × 108
Variance1.1874677 × 109
MonotonicityNot monotonic
2023-12-13T03:57:45.942589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100000 113
 
4.7%
0 47
 
1.9%
107000 22
 
0.9%
4000 17
 
0.7%
105000 14
 
0.6%
103000 13
 
0.5%
120000 11
 
0.5%
10000 10
 
0.4%
3000 10
 
0.4%
5000 7
 
0.3%
Other values (1922) 2160
89.1%
ValueCountFrequency (%)
-731 1
 
< 0.1%
-434 1
 
< 0.1%
-317 1
 
< 0.1%
-302 1
 
< 0.1%
-248 1
 
< 0.1%
-236 1
 
< 0.1%
-230 1
 
< 0.1%
-210 1
 
< 0.1%
0 47
1.9%
88 1
 
< 0.1%
ValueCountFrequency (%)
250000 1
 
< 0.1%
166049 1
 
< 0.1%
140000 4
0.2%
139957 1
 
< 0.1%
139569 1
 
< 0.1%
137136 1
 
< 0.1%
135601 1
 
< 0.1%
135525 1
 
< 0.1%
134126 1
 
< 0.1%
129029 1
 
< 0.1%

수분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size19.1 KiB
15
1470 
0
887 
12
 
67

Length

Max length2
Median length2
Mean length1.6340759
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row15
2nd row15
3rd row15
4th row15
5th row15

Common Values

ValueCountFrequency (%)
15 1470
60.6%
0 887
36.6%
12 67
 
2.8%

Length

2023-12-13T03:57:46.152031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:57:46.310626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
15 1470
60.6%
0 887
36.6%
12 67
 
2.8%

데이터추출일
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size19.1 KiB
Minimum2023-07-24 00:00:00
Maximum2023-07-24 00:00:00
2023-12-13T03:57:46.436481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:57:46.587287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T03:57:41.149231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:57:40.838552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:57:41.274521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:57:40.999318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:57:46.713719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년산지원명부서명방번호작물명품종명구분저장빈량수분
년산1.0000.2270.0000.1410.2320.3990.1970.0810.250
지원명0.2271.0001.0000.4300.2610.9560.5650.4000.419
부서명0.0001.0001.0000.4550.2540.8600.2740.2020.200
방번호0.1410.4300.4551.0000.1990.7510.1680.1150.126
작물명0.2320.2610.2540.1991.0000.9570.5230.1850.845
품종명0.3990.9560.8600.7510.9571.0000.6770.4020.807
구분0.1970.5650.2740.1680.5230.6771.0000.4920.940
저장빈량0.0810.4000.2020.1150.1850.4020.4921.0000.195
수분0.2500.4190.2000.1260.8450.8070.9400.1951.000
2023-12-13T03:57:46.906868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년산수분작물명부서명품종명지원명구분
년산1.0000.0810.0980.0000.2120.1470.061
수분0.0811.0000.5360.1900.5820.2940.700
작물명0.0980.5361.0000.2100.7870.1480.252
부서명0.0000.1900.2101.0000.6490.9990.182
품종명0.2120.5820.7870.6491.0000.7590.435
지원명0.1470.2940.1480.9990.7591.0000.430
구분0.0610.7000.2520.1820.4350.4301.000
2023-12-13T03:57:47.148386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
방번호저장빈량년산지원명부서명작물명품종명구분수분
방번호1.0000.1060.0850.2230.3080.1090.3690.1010.074
저장빈량0.1061.0000.0510.1430.1310.1040.1620.3590.125
년산0.0850.0511.0000.1470.0000.0980.2120.0610.081
지원명0.2230.1430.1471.0000.9990.1480.7590.4300.294
부서명0.3080.1310.0000.9991.0000.2100.6490.1820.190
작물명0.1090.1040.0980.1480.2101.0000.7870.2520.536
품종명0.3690.1620.2120.7590.6490.7871.0000.4350.582
구분0.1010.3590.0610.4300.1820.2520.4351.0000.700
수분0.0740.1250.0810.2940.1900.5360.5820.7001.000

Missing values

2023-12-13T03:57:41.469261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:57:41.710609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년산지원명부서명날짜방번호작물명품종명구분저장빈량수분데이터추출일
02020경기종자관리소<NA>2020-12-1016고시히카리건조입고95595152023-07-24
12020경기종자관리소<NA>2020-12-1116고시히카리건조입고4405152023-07-24
22020경기종자관리소<NA>2020-12-1115고시히카리건조입고100000152023-07-24
32020경기종자관리소<NA>2020-12-1114고시히카리건조입고40000152023-07-24
42020경기종자관리소<NA>2020-12-1113고시히카리건조입고54018152023-07-24
52020경기종자관리소<NA>2020-12-140고시히카리건조입고0152023-07-24
62020경기종자관리소<NA>2020-12-0219대안벼건조입고79691152023-07-24
72020경기종자관리소<NA>2020-12-0418대안벼건조입고58037152023-07-24
82020경기종자관리소<NA>2020-12-0319대안벼건조입고10000152023-07-24
92020경기종자관리소<NA>2020-12-0311대안벼건조입고93000152023-07-24
년산지원명부서명날짜방번호작물명품종명구분저장빈량수분데이터추출일
24142022강원지원<NA>2022-11-302삼광벼건조입고40000152023-07-24
24152022강원지원<NA>2022-11-301삼광벼건조입고76078152023-07-24
24162022강원지원<NA>2022-12-021삼광벼건조입고-210152023-07-24
24172022강원지원<NA>2023-02-223삼광벼정밀불출2231002023-07-24
24182022강원지원<NA>2023-03-082삼광벼정밀불출280402023-07-24
24192022강원지원<NA>2023-03-081삼광벼정밀불출1810602023-07-24
24202022강원지원<NA>2023-03-091삼광벼정밀불출2331002023-07-24
24212022강원지원<NA>2023-03-022삼광벼정밀불출2110702023-07-24
24222022강원지원<NA>2023-03-062삼광벼정밀불출2230402023-07-24
24232022강원지원<NA>2023-03-072삼광벼정밀불출2840702023-07-24