Overview

Dataset statistics

Number of variables8
Number of observations289
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory19.3 KiB
Average record size in memory68.5 B

Variable types

Categorical4
Text1
Numeric3

Dataset

Description제주특별자치도 GAP 인증 농가 정보 데이터로, 제주특별자치도 내 GAP 농가의 생산자, 품목, 농가 수, 재매 면적 등의 정보를 제공합니다.
Author제주특별자치도
URLhttps://www.data.go.kr/data/15096835/fileData.do

Alerts

데이터 기준일 has constant value ""Constant
농가 수 is highly overall correlated with 재배면적 and 1 other fieldsHigh correlation
재배면적 is highly overall correlated with 농가 수 and 1 other fieldsHigh correlation
생산계획량(톤) is highly overall correlated with 농가 수 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-12 09:37:51.672625
Analysis finished2023-12-12 09:37:53.507013
Duration1.83 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지정 연도
Categorical

Distinct3
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2020
139 
2021
102 
2019
48 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2020 139
48.1%
2021 102
35.3%
2019 48
 
16.6%

Length

2023-12-12T18:37:53.575506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:37:53.678186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 139
48.1%
2021 102
35.3%
2019 48
 
16.6%
Distinct288
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2023-12-12T18:37:53.934007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length3
Mean length4.9723183
Min length2

Characters and Unicode

Total characters1437
Distinct characters250
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique287 ?
Unique (%)99.3%

Sample

1st row에코밸리농업주식회사
2nd row유계준
3rd row대영농장
4th row아람농원
5th row이가농원
ValueCountFrequency (%)
서귀포농협 4
 
1.3%
공선회 2
 
0.6%
남원농협 2
 
0.6%
강재현 2
 
0.6%
농업회사법인 2
 
0.6%
박형찬 2
 
0.6%
감귤gap 2
 
0.6%
이정오 1
 
0.3%
박정식 1
 
0.3%
김대호 1
 
0.3%
Other values (292) 292
93.9%
2023-12-12T18:37:54.341442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
82
 
5.7%
51
 
3.5%
39
 
2.7%
33
 
2.3%
33
 
2.3%
32
 
2.2%
31
 
2.2%
29
 
2.0%
29
 
2.0%
28
 
1.9%
Other values (240) 1050
73.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1320
91.9%
Space Separator 51
 
3.5%
Uppercase Letter 45
 
3.1%
Decimal Number 12
 
0.8%
Close Punctuation 4
 
0.3%
Open Punctuation 4
 
0.3%
Lowercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
82
 
6.2%
39
 
3.0%
33
 
2.5%
33
 
2.5%
32
 
2.4%
31
 
2.3%
29
 
2.2%
29
 
2.2%
28
 
2.1%
25
 
1.9%
Other values (223) 959
72.7%
Uppercase Letter
ValueCountFrequency (%)
P 14
31.1%
A 13
28.9%
G 13
28.9%
S 1
 
2.2%
U 1
 
2.2%
I 1
 
2.2%
V 1
 
2.2%
N 1
 
2.2%
Decimal Number
ValueCountFrequency (%)
2 5
41.7%
1 2
 
16.7%
3 2
 
16.7%
4 2
 
16.7%
5 1
 
8.3%
Space Separator
ValueCountFrequency (%)
51
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Lowercase Letter
ValueCountFrequency (%)
s 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1318
91.7%
Common 71
 
4.9%
Latin 46
 
3.2%
Han 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
82
 
6.2%
39
 
3.0%
33
 
2.5%
33
 
2.5%
32
 
2.4%
31
 
2.4%
29
 
2.2%
29
 
2.2%
28
 
2.1%
25
 
1.9%
Other values (221) 957
72.6%
Latin
ValueCountFrequency (%)
P 14
30.4%
A 13
28.3%
G 13
28.3%
s 1
 
2.2%
S 1
 
2.2%
U 1
 
2.2%
I 1
 
2.2%
V 1
 
2.2%
N 1
 
2.2%
Common
ValueCountFrequency (%)
51
71.8%
2 5
 
7.0%
) 4
 
5.6%
( 4
 
5.6%
1 2
 
2.8%
3 2
 
2.8%
4 2
 
2.8%
5 1
 
1.4%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1318
91.7%
ASCII 117
 
8.1%
CJK 2
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
82
 
6.2%
39
 
3.0%
33
 
2.5%
33
 
2.5%
32
 
2.4%
31
 
2.4%
29
 
2.2%
29
 
2.2%
28
 
2.1%
25
 
1.9%
Other values (221) 957
72.6%
ASCII
ValueCountFrequency (%)
51
43.6%
P 14
 
12.0%
A 13
 
11.1%
G 13
 
11.1%
2 5
 
4.3%
) 4
 
3.4%
( 4
 
3.4%
1 2
 
1.7%
3 2
 
1.7%
4 2
 
1.7%
Other values (7) 7
 
6.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

품목
Categorical

Distinct37
Distinct (%)12.8%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
감귤
132 
딸기
26 
한라봉
17 
레드향
16 
천혜향
 
12
Other values (32)
86 

Length

Max length12
Median length2
Mean length2.83391
Min length1

Unique

Unique15 ?
Unique (%)5.2%

Sample

1st row당근
2nd row천혜향
3rd row감귤
4th row감귤
5th row참다래(키위)

Common Values

ValueCountFrequency (%)
감귤 132
45.7%
딸기 26
 
9.0%
한라봉 17
 
5.9%
레드향 16
 
5.5%
천혜향 12
 
4.2%
블루베리 12
 
4.2%
만감 9
 
3.1%
일반양배추 7
 
2.4%
하우스감귤 5
 
1.7%
브로코리(녹색꽃양배추) 5
 
1.7%
Other values (27) 48
 
16.6%

Length

2023-12-12T18:37:54.479273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
감귤 132
45.7%
딸기 26
 
9.0%
한라봉 17
 
5.9%
레드향 16
 
5.5%
천혜향 12
 
4.2%
블루베리 12
 
4.2%
만감 9
 
3.1%
일반양배추 7
 
2.4%
브로코리(녹색꽃양배추 5
 
1.7%
하우스감귤 5
 
1.7%
Other values (27) 48
 
16.6%

농가 수
Real number (ℝ)

HIGH CORRELATION 

Distinct35
Distinct (%)12.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.892734
Minimum1
Maximum704
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.7 KiB
2023-12-12T18:37:54.612447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile103.2
Maximum704
Range703
Interquartile range (IQR)0

Descriptive statistics

Standard deviation79.393484
Coefficient of variation (CV)3.9910796
Kurtosis41.170196
Mean19.892734
Median Absolute Deviation (MAD)0
Skewness6.0867736
Sum5749
Variance6303.3253
MonotonicityNot monotonic
2023-12-12T18:37:54.767794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
1 249
86.2%
35 2
 
0.7%
185 2
 
0.7%
5 2
 
0.7%
30 2
 
0.7%
46 2
 
0.7%
7 2
 
0.7%
704 1
 
0.3%
140 1
 
0.3%
11 1
 
0.3%
Other values (25) 25
 
8.7%
ValueCountFrequency (%)
1 249
86.2%
5 2
 
0.7%
7 2
 
0.7%
11 1
 
0.3%
19 1
 
0.3%
23 1
 
0.3%
26 1
 
0.3%
30 2
 
0.7%
35 2
 
0.7%
39 1
 
0.3%
ValueCountFrequency (%)
704 1
0.3%
603 1
0.3%
552 1
0.3%
516 1
0.3%
328 1
0.3%
297 1
0.3%
233 1
0.3%
185 2
0.7%
176 1
0.3%
145 1
0.3%

재배면적
Real number (ℝ)

HIGH CORRELATION 

Distinct276
Distinct (%)95.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean196286.56
Minimum428
Maximum7310861
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.7 KiB
2023-12-12T18:37:54.932892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum428
5-th percentile2168.8
Q14290
median8590
Q322047
95-th percentile996805.38
Maximum7310861
Range7310433
Interquartile range (IQR)17757

Descriptive statistics

Standard deviation787394.87
Coefficient of variation (CV)4.0114559
Kurtosis42.365546
Mean196286.56
Median Absolute Deviation (MAD)5254
Skewness6.1664255
Sum56726816
Variance6.1999069 × 1011
MonotonicityNot monotonic
2023-12-12T18:37:55.114490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3960.0 7
 
2.4%
6600.0 2
 
0.7%
2310.0 2
 
0.7%
9900.0 2
 
0.7%
4290.0 2
 
0.7%
3795.0 2
 
0.7%
2970.0 2
 
0.7%
5280.0 2
 
0.7%
3306.0 1
 
0.3%
4455.0 1
 
0.3%
Other values (266) 266
92.0%
ValueCountFrequency (%)
428.0 1
0.3%
660.0 1
0.3%
694.0 1
0.3%
826.0 1
0.3%
1170.0 1
0.3%
1354.0 1
0.3%
1442.0 1
0.3%
1495.0 1
0.3%
1652.0 1
0.3%
1692.0 1
0.3%
ValueCountFrequency (%)
7310861.0 1
0.3%
5498549.5 1
0.3%
5393331.15 1
0.3%
5018631.2 1
0.3%
3888958.0 1
0.3%
3153749.02 1
0.3%
2178616.0 1
0.3%
1754826.0 1
0.3%
1603120.7 1
0.3%
1529042.7 1
0.3%

생산계획량(톤)
Real number (ℝ)

HIGH CORRELATION 

Distinct162
Distinct (%)56.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean982.83168
Minimum0.5
Maximum40926.72
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.7 KiB
2023-12-12T18:37:55.268173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.5
5-th percentile3
Q110
median24.8
Q363
95-th percentile4720.1479
Maximum40926.72
Range40926.22
Interquartile range (IQR)53

Descriptive statistics

Standard deviation4217.007
Coefficient of variation (CV)4.2906706
Kurtosis55.656667
Mean982.83168
Median Absolute Deviation (MAD)16.8
Skewness6.878427
Sum284038.36
Variance17783148
MonotonicityNot monotonic
2023-12-12T18:37:55.433440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10.0 15
 
5.2%
15.0 14
 
4.8%
20.0 11
 
3.8%
12.0 11
 
3.8%
3.0 8
 
2.8%
25.0 8
 
2.8%
5.0 7
 
2.4%
2.0 7
 
2.4%
30.0 7
 
2.4%
40.0 6
 
2.1%
Other values (152) 195
67.5%
ValueCountFrequency (%)
0.5 1
 
0.3%
0.9 1
 
0.3%
0.99 1
 
0.3%
1.0 1
 
0.3%
1.2 1
 
0.3%
1.8 1
 
0.3%
2.0 7
2.4%
2.82 1
 
0.3%
3.0 8
2.8%
3.5 5
1.7%
ValueCountFrequency (%)
40926.72 1
0.3%
40020.0 1
0.3%
19880.262 1
0.3%
16921.89744 1
0.3%
15858.73 1
0.3%
15352.16 1
0.3%
14318.57 1
0.3%
12231.23 1
0.3%
12152.29 1
0.3%
10995.0 1
0.3%

소재지
Categorical

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
서귀포시
167 
제주시
122 

Length

Max length4
Median length4
Mean length3.5778547
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제주시
2nd row서귀포시
3rd row제주시
4th row서귀포시
5th row서귀포시

Common Values

ValueCountFrequency (%)
서귀포시 167
57.8%
제주시 122
42.2%

Length

2023-12-12T18:37:55.654563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:37:55.772615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서귀포시 167
57.8%
제주시 122
42.2%

데이터 기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2021-11-24
289 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-11-24
2nd row2021-11-24
3rd row2021-11-24
4th row2021-11-24
5th row2021-11-24

Common Values

ValueCountFrequency (%)
2021-11-24 289
100.0%

Length

2023-12-12T18:37:55.891576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:37:55.990386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-11-24 289
100.0%

Interactions

2023-12-12T18:37:52.894002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:37:52.151777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:37:52.535888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:37:53.004444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:37:52.269231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:37:52.648260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:37:53.118423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:37:52.423227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:37:52.760632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:37:56.059610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지정 연도품목농가 수재배면적생산계획량(톤)소재지
지정 연도1.0000.5810.0000.0000.0000.000
품목0.5811.0000.0000.0000.0000.408
농가 수0.0000.0001.0000.9830.8750.000
재배면적0.0000.0000.9831.0000.9150.114
생산계획량(톤)0.0000.0000.8750.9151.0000.098
소재지0.0000.4080.0000.1140.0981.000
2023-12-12T18:37:56.169750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지정 연도소재지품목
지정 연도1.0000.0000.333
소재지0.0001.0000.322
품목0.3330.3221.000
2023-12-12T18:37:56.286504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
농가 수재배면적생산계획량(톤)지정 연도품목소재지
농가 수1.0000.5910.5880.0000.0000.000
재배면적0.5911.0000.8300.0000.0000.083
생산계획량(톤)0.5880.8301.0000.0000.0000.069
지정 연도0.0000.0000.0001.0000.3330.000
품목0.0000.0000.0000.3331.0000.322
소재지0.0000.0830.0690.0000.3221.000

Missing values

2023-12-12T18:37:53.251230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:37:53.434519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지정 연도생산자품목농가 수재배면적생산계획량(톤)소재지데이터 기준일
02021에코밸리농업주식회사당근124580.095.0제주시2021-11-24
12021유계준천혜향11652.07.5서귀포시2021-11-24
22021대영농장감귤188166.0104.8제주시2021-11-24
32021아람농원감귤111578.040.0서귀포시2021-11-24
42021이가농원참다래(키위)12970.010.0서귀포시2021-11-24
52021중문농협GAP감귤공선회5감귤99688262.52357.0서귀포시2021-11-24
62021중문농협GAP감귤공선회4감귤77662331.02912.8서귀포시2021-11-24
72021중문농협GAP감귤공선회3감귤89935888.02898.99서귀포시2021-11-24
82021종남농장한라봉18915.032.5서귀포시2021-11-24
92021아띠4s감귤농장감귤13140.09.0서귀포시2021-11-24
지정 연도생산자품목농가 수재배면적생산계획량(톤)소재지데이터 기준일
2792019들녘딸기16022.011.0서귀포시2021-11-24
2802019양옥림감귤11709.01.0제주시2021-11-24
2812019제주거북농산영농조합법인11754826.010995.0서귀포시2021-11-24
2822019현창호레드향14191.010.0서귀포시2021-11-24
2832019말선명품제주농원감귤14628.023.0서귀포시2021-11-24
2842019송석희딸기12310.08.0서귀포시2021-11-24
2852019철이네열린농장감귤120127.080.0제주시2021-11-24
2862019김원식딸기12607.07.5제주시2021-11-24
2872019남원농협레드향공선회레드향30123961.1559.32서귀포시2021-11-24
2882019송창섭양배추135388.0310.0서귀포시2021-11-24