Overview

Dataset statistics

Number of variables9
Number of observations786
Missing cells28
Missing cells (%)0.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory56.9 KiB
Average record size in memory74.2 B

Variable types

Categorical3
Text2
DateTime3
Numeric1

Dataset

Description정부보급종 파종 실적으로 년산,지원명,기관명,작물명,품종명,시작일,종료일,파종이앙실적(a) 등의 정보가 제공됩니다.
URLhttps://www.data.go.kr/data/15066283/fileData.do

Alerts

데이터추출일 has constant value ""Constant
시작일 has 14 (1.8%) missing valuesMissing
종료일 has 14 (1.8%) missing valuesMissing

Reproduction

Analysis started2023-12-12 20:39:39.551401
Analysis finished2023-12-12 20:39:40.626993
Duration1.08 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년산
Categorical

Distinct3
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size6.3 KiB
2022
265 
2021
261 
2020
260 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2022 265
33.7%
2021 261
33.2%
2020 260
33.1%

Length

2023-12-13T05:39:40.699035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:39:41.135437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 265
33.7%
2021 261
33.2%
2020 260
33.1%

지원명
Categorical

Distinct8
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size6.3 KiB
전북지원
161 
전남지원
138 
충남지원
119 
경북지원
100 
경남지원
80 
Other values (3)
188 

Length

Max length7
Median length4
Mean length4.2938931
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기종자관리소
2nd row경기종자관리소
3rd row경기종자관리소
4th row경기종자관리소
5th row경기종자관리소

Common Values

ValueCountFrequency (%)
전북지원 161
20.5%
전남지원 138
17.6%
충남지원 119
15.1%
경북지원 100
12.7%
경남지원 80
10.2%
경기종자관리소 77
9.8%
충북지원 56
 
7.1%
강원지원 55
 
7.0%

Length

2023-12-13T05:39:41.278263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:39:41.389857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전북지원 161
20.5%
전남지원 138
17.6%
충남지원 119
15.1%
경북지원 100
12.7%
경남지원 80
10.2%
경기종자관리소 77
9.8%
충북지원 56
 
7.1%
강원지원 55
 
7.0%
Distinct247
Distinct (%)31.4%
Missing0
Missing (%)0.0%
Memory size6.3 KiB
2023-12-13T05:39:41.725453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length2
Mean length2.5559796
Min length2

Characters and Unicode

Total characters2009
Distinct characters174
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)3.8%

Sample

1st row적성
2nd row포승
3rd row양교단지
4th row신흥단지
5th row중대단지
ValueCountFrequency (%)
공음 14
 
1.7%
대죽 12
 
1.5%
남산 9
 
1.1%
왕태 9
 
1.1%
두곡 8
 
1.0%
고수 7
 
0.9%
황룡위탁영농 7
 
0.9%
좌운 7
 
0.9%
신전 6
 
0.7%
6
 
0.7%
Other values (248) 735
89.6%
2023-12-13T05:39:42.202260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
109
 
5.4%
97
 
4.8%
70
 
3.5%
68
 
3.4%
54
 
2.7%
48
 
2.4%
48
 
2.4%
42
 
2.1%
38
 
1.9%
35
 
1.7%
Other values (164) 1400
69.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1894
94.3%
Space Separator 68
 
3.4%
Decimal Number 18
 
0.9%
Open Punctuation 15
 
0.7%
Close Punctuation 14
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
109
 
5.8%
97
 
5.1%
70
 
3.7%
54
 
2.9%
48
 
2.5%
48
 
2.5%
42
 
2.2%
38
 
2.0%
35
 
1.8%
33
 
1.7%
Other values (159) 1320
69.7%
Decimal Number
ValueCountFrequency (%)
1 9
50.0%
2 9
50.0%
Space Separator
ValueCountFrequency (%)
68
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1894
94.3%
Common 115
 
5.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
109
 
5.8%
97
 
5.1%
70
 
3.7%
54
 
2.9%
48
 
2.5%
48
 
2.5%
42
 
2.2%
38
 
2.0%
35
 
1.8%
33
 
1.7%
Other values (159) 1320
69.7%
Common
ValueCountFrequency (%)
68
59.1%
( 15
 
13.0%
) 14
 
12.2%
1 9
 
7.8%
2 9
 
7.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1894
94.3%
ASCII 115
 
5.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
109
 
5.8%
97
 
5.1%
70
 
3.7%
54
 
2.9%
48
 
2.5%
48
 
2.5%
42
 
2.2%
38
 
2.0%
35
 
1.8%
33
 
1.7%
Other values (159) 1320
69.7%
ASCII
ValueCountFrequency (%)
68
59.1%
( 15
 
13.0%
) 14
 
12.2%
1 9
 
7.8%
2 9
 
7.8%

작물명
Categorical

Distinct6
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size6.3 KiB
472 
171 
보리
77 
51 
 
9

Length

Max length2
Median length1
Mean length1.105598
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
472
60.1%
171
 
21.8%
보리 77
 
9.8%
51
 
6.5%
9
 
1.1%
호밀 6
 
0.8%

Length

2023-12-13T05:39:42.344997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:39:42.452489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
472
60.1%
171
 
21.8%
보리 77
 
9.8%
51
 
6.5%
9
 
1.1%
호밀 6
 
0.8%
Distinct58
Distinct (%)7.4%
Missing0
Missing (%)0.0%
Memory size6.3 KiB
2023-12-13T05:39:42.649582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length3
Mean length3.4923664
Min length2

Characters and Unicode

Total characters2745
Distinct characters74
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)0.5%

Sample

1st row대원콩
2nd row대원콩
3rd row추청벼
4th row추청벼
5th row추청벼
ValueCountFrequency (%)
대원콩 97
 
12.3%
삼광벼 72
 
9.2%
신동진벼 69
 
8.8%
새청무 44
 
5.6%
추청벼 37
 
4.7%
일품벼 37
 
4.7%
흰찰쌀보리 29
 
3.7%
새일미벼 27
 
3.4%
오대벼 24
 
3.1%
영호진미 23
 
2.9%
Other values (48) 327
41.6%
2023-12-13T05:39:43.067901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
352
 
12.8%
171
 
6.2%
144
 
5.2%
116
 
4.2%
108
 
3.9%
100
 
3.6%
97
 
3.5%
87
 
3.2%
84
 
3.1%
84
 
3.1%
Other values (64) 1402
51.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2734
99.6%
Decimal Number 11
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
352
 
12.9%
171
 
6.3%
144
 
5.3%
116
 
4.2%
108
 
4.0%
100
 
3.7%
97
 
3.5%
87
 
3.2%
84
 
3.1%
84
 
3.1%
Other values (63) 1391
50.9%
Decimal Number
ValueCountFrequency (%)
1 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2734
99.6%
Common 11
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
352
 
12.9%
171
 
6.3%
144
 
5.3%
116
 
4.2%
108
 
4.0%
100
 
3.7%
97
 
3.5%
87
 
3.2%
84
 
3.1%
84
 
3.1%
Other values (63) 1391
50.9%
Common
ValueCountFrequency (%)
1 11
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2734
99.6%
ASCII 11
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
352
 
12.9%
171
 
6.3%
144
 
5.3%
116
 
4.2%
108
 
4.0%
100
 
3.7%
97
 
3.5%
87
 
3.2%
84
 
3.1%
84
 
3.1%
Other values (63) 1391
50.9%
ASCII
ValueCountFrequency (%)
1 11
100.0%

시작일
Date

MISSING 

Distinct201
Distinct (%)26.0%
Missing14
Missing (%)1.8%
Memory size6.3 KiB
Minimum2019-10-15 00:00:00
Maximum2022-06-20 00:00:00
2023-12-13T05:39:43.256194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:39:43.382180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

종료일
Date

MISSING 

Distinct208
Distinct (%)26.9%
Missing14
Missing (%)1.8%
Memory size6.3 KiB
Minimum2019-10-18 00:00:00
Maximum2022-07-08 00:00:00
2023-12-13T05:39:43.539714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:39:43.695911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

파종이앙실적(a)
Real number (ℝ)

Distinct363
Distinct (%)46.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2317.3728
Minimum18
Maximum8050
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.0 KiB
2023-12-13T05:39:43.827491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum18
5-th percentile777
Q11510
median2168.5
Q33000
95-th percentile4395
Maximum8050
Range8032
Interquartile range (IQR)1490

Descriptive statistics

Standard deviation1142.1798
Coefficient of variation (CV)0.49287702
Kurtosis1.4168904
Mean2317.3728
Median Absolute Deviation (MAD)718.5
Skewness0.85347326
Sum1821455
Variance1304574.6
MonotonicityNot monotonic
2023-12-13T05:39:43.974896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2000 28
 
3.6%
1000 25
 
3.2%
2500 20
 
2.5%
2200 18
 
2.3%
1600 16
 
2.0%
2600 13
 
1.7%
1500 13
 
1.7%
2100 11
 
1.4%
3000 11
 
1.4%
1700 10
 
1.3%
Other values (353) 621
79.0%
ValueCountFrequency (%)
18 1
0.1%
21 1
0.1%
120 1
0.1%
140 1
0.1%
161 1
0.1%
184 1
0.1%
199 1
0.1%
200 1
0.1%
272 1
0.1%
303 1
0.1%
ValueCountFrequency (%)
8050 1
0.1%
6977 1
0.1%
6600 1
0.1%
6460 1
0.1%
6160 1
0.1%
6099 1
0.1%
6015 2
0.3%
5861 1
0.1%
5737 1
0.1%
5510 1
0.1%

데이터추출일
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.3 KiB
Minimum2023-07-24 00:00:00
Maximum2023-07-24 00:00:00
2023-12-13T05:39:44.121732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:39:44.291248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T05:39:40.074381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:39:44.405423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년산지원명작물명품종명파종이앙실적(a)
년산1.0000.0000.0970.0000.000
지원명0.0001.0000.2630.9310.342
작물명0.0970.2631.0001.0000.390
품종명0.0000.9311.0001.0000.578
파종이앙실적(a)0.0000.3420.3900.5781.000
2023-12-13T05:39:44.510162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지원명작물명년산
지원명1.0000.1490.000
작물명0.1491.0000.040
년산0.0000.0401.000
2023-12-13T05:39:44.605480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
파종이앙실적(a)년산지원명작물명
파종이앙실적(a)1.0000.0000.1700.217
년산0.0001.0000.0000.040
지원명0.1700.0001.0000.149
작물명0.2170.0400.1491.000

Missing values

2023-12-13T05:39:40.252589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:39:40.403684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T05:39:40.556610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

년산지원명기관명작물명품종명시작일종료일파종이앙실적(a)데이터추출일
02020경기종자관리소적성대원콩2020-05-252020-06-061992023-07-24
12020경기종자관리소포승대원콩2020-05-252020-06-062722023-07-24
22020경기종자관리소양교단지추청벼2020-05-162020-05-2830402023-07-24
32020경기종자관리소신흥단지추청벼2020-05-132020-05-3025392023-07-24
42020경기종자관리소중대단지추청벼2020-05-072020-05-2524902023-07-24
52020경기종자관리소중대단지참드림2020-05-072020-05-2511602023-07-24
62020경기종자관리소원창단지추청벼2020-05-082020-05-2346502023-07-24
72020경기종자관리소창신단지추청벼2020-05-062020-05-2540302023-07-24
82020경기종자관리소안화단지추청벼2020-05-182020-06-0229202023-07-24
92020경기종자관리소동고단지고시히카리2020-05-082020-05-2044002023-07-24
년산지원명기관명작물명품종명시작일종료일파종이앙실적(a)데이터추출일
7762022강원지원화지오대벼2022-04-292022-05-1032002023-07-24
7772022강원지원좌운대원콩2022-06-022022-06-2019002023-07-24
7782022강원지원내포오대벼2022-04-292022-05-0934002023-07-24
7792022강원지원주천청아콩2022-06-142022-06-2516002023-07-24
7802022강원지원자등대원콩2022-06-032022-06-2017002023-07-24
7812022강원지원잠곡대원콩2022-06-092022-06-2319002023-07-24
7822022강원지원용사대원콩2022-06-102022-06-2323002023-07-24
7832022강원지원사천오륜벼2022-05-162022-05-2010002023-07-24
7842022강원지원공순원대원콩2022-06-092022-06-2116002023-07-24
7852022강원지원동면아라리팥2022-06-142022-07-0726002023-07-24