Overview

Dataset statistics

Number of variables10
Number of observations6361
Missing cells203
Missing cells (%)0.3%
Duplicate rows305
Duplicate rows (%)4.8%
Total size in memory528.1 KiB
Average record size in memory85.0 B

Variable types

Categorical6
Numeric3
Text1

Dataset

Description전라북도 임실군에서 소득이 관리되는 농업인(농가)정보 데이터 입니다. 데이터 세부내역에는 관리연도, 시군명, 읍면동명, 농가형태구분, 농업인연령, 농가상태, 기타소득, 주작목, 논면적, 밭면적을 포함하여 제공하고 있습니다.
Author전라북도 임실군
URLhttps://www.data.go.kr/data/15090027/fileData.do

Alerts

관리연도 has constant value ""Constant
시군명 has constant value ""Constant
기타소득(원) has constant value ""Constant
Dataset has 305 (4.8%) duplicate rowsDuplicates
농가형태구분 is highly imbalanced (79.5%)Imbalance
농가상태 is highly imbalanced (81.9%)Imbalance
농업인연령 has 203 (3.2%) missing valuesMissing
논면적 is highly skewed (γ1 = 79.47207918)Skewed
논면적 has 2684 (42.2%) zerosZeros
밭면적 has 2800 (44.0%) zerosZeros

Reproduction

Analysis started2023-12-12 14:21:35.178689
Analysis finished2023-12-12 14:21:36.814674
Duration1.64 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

관리연도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size49.8 KiB
2023
6361 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023
2nd row2023
3rd row2023
4th row2023
5th row2023

Common Values

ValueCountFrequency (%)
2023 6361
100.0%

Length

2023-12-12T23:21:36.872465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:21:36.951502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 6361
100.0%

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size49.8 KiB
임실군
6361 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row임실군
2nd row임실군
3rd row임실군
4th row임실군
5th row임실군

Common Values

ValueCountFrequency (%)
임실군 6361
100.0%

Length

2023-12-12T23:21:37.026360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:21:37.101551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
임실군 6361
100.0%

읍면동명
Categorical

Distinct12
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size49.8 KiB
오수면
877 
관촌면
739 
임실읍
720 
삼계면
607 
운암면
475 
Other values (7)
2943 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row운암면
2nd row신평면
3rd row관촌면
4th row신덕면
5th row신덕면

Common Values

ValueCountFrequency (%)
오수면 877
13.8%
관촌면 739
11.6%
임실읍 720
11.3%
삼계면 607
9.5%
운암면 475
7.5%
강진면 457
7.2%
신덕면 451
7.1%
성수면 417
6.6%
덕치면 414
6.5%
청웅면 412
6.5%
Other values (2) 792
12.5%

Length

2023-12-12T23:21:37.191200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
오수면 877
13.8%
관촌면 739
11.6%
임실읍 720
11.3%
삼계면 607
9.5%
운암면 475
7.5%
강진면 457
7.2%
신덕면 451
7.1%
성수면 417
6.6%
덕치면 414
6.5%
청웅면 412
6.5%
Other values (2) 792
12.5%

농가형태구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size49.8 KiB
개인
6157 
단체
 
204

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row단체
2nd row단체
3rd row개인
4th row개인
5th row개인

Common Values

ValueCountFrequency (%)
개인 6157
96.8%
단체 204
 
3.2%

Length

2023-12-12T23:21:37.287516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:21:37.375260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 6157
96.8%
단체 204
 
3.2%

농업인연령
Real number (ℝ)

MISSING 

Distinct74
Distinct (%)1.2%
Missing203
Missing (%)3.2%
Infinite0
Infinite (%)0.0%
Mean68.190971
Minimum22
Maximum101
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size56.0 KiB
2023-12-12T23:21:37.498178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum22
5-th percentile47
Q160
median69
Q378
95-th percentile86
Maximum101
Range79
Interquartile range (IQR)18

Descriptive statistics

Standard deviation12.345846
Coefficient of variation (CV)0.1810481
Kurtosis-0.22105672
Mean68.190971
Median Absolute Deviation (MAD)9
Skewness-0.43222425
Sum419920
Variance152.41992
MonotonicityNot monotonic
2023-12-12T23:21:37.641278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
78 281
 
4.4%
73 221
 
3.5%
70 189
 
3.0%
79 185
 
2.9%
81 180
 
2.8%
72 177
 
2.8%
65 175
 
2.8%
61 172
 
2.7%
71 170
 
2.7%
82 167
 
2.6%
Other values (64) 4241
66.7%
(Missing) 203
 
3.2%
ValueCountFrequency (%)
22 1
 
< 0.1%
24 1
 
< 0.1%
25 5
0.1%
28 3
 
< 0.1%
29 3
 
< 0.1%
30 2
 
< 0.1%
31 6
0.1%
32 5
0.1%
33 10
0.2%
34 8
0.1%
ValueCountFrequency (%)
101 1
 
< 0.1%
97 1
 
< 0.1%
96 2
 
< 0.1%
95 2
 
< 0.1%
94 9
 
0.1%
93 8
 
0.1%
92 15
 
0.2%
91 23
0.4%
90 28
0.4%
89 42
0.7%

농가상태
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size49.8 KiB
관리농가
6074 
비관리농가
 
257
달성농가
 
30

Length

Max length5
Median length4
Mean length4.0404025
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row관리농가
2nd row관리농가
3rd row관리농가
4th row관리농가
5th row관리농가

Common Values

ValueCountFrequency (%)
관리농가 6074
95.5%
비관리농가 257
 
4.0%
달성농가 30
 
0.5%

Length

2023-12-12T23:21:37.805474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:21:37.955523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
관리농가 6074
95.5%
비관리농가 257
 
4.0%
달성농가 30
 
0.5%

기타소득(원)
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size49.8 KiB
0
6361 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 6361
100.0%

Length

2023-12-12T23:21:38.070551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:21:38.256840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 6361
100.0%
Distinct143
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size49.8 KiB
2023-12-12T23:21:38.496089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length1
Mean length2.3013677
Min length1

Characters and Unicode

Total characters14639
Distinct characters172
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)0.5%

Sample

1st row
2nd row고구마
3rd row일반미(쌀 일반)
4th row노지고추
5th row번식우
ValueCountFrequency (%)
3536
50.9%
일반미(쌀 576
 
8.3%
일반 576
 
8.3%
253
 
3.6%
노지고추 234
 
3.4%
고추 175
 
2.5%
번식우 139
 
2.0%
복숭아 107
 
1.5%
잡곡 104
 
1.5%
들깨 85
 
1.2%
Other values (136) 1156
 
16.7%
2023-12-12T23:21:38.897700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4113
28.1%
1178
 
8.0%
1173
 
8.0%
( 623
 
4.3%
) 623
 
4.3%
596
 
4.1%
580
 
4.0%
524
 
3.6%
476
 
3.3%
256
 
1.7%
Other values (162) 4497
30.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12813
87.5%
Open Punctuation 623
 
4.3%
Close Punctuation 623
 
4.3%
Space Separator 580
 
4.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4113
32.1%
1178
 
9.2%
1173
 
9.2%
596
 
4.7%
524
 
4.1%
476
 
3.7%
256
 
2.0%
254
 
2.0%
238
 
1.9%
226
 
1.8%
Other values (159) 3779
29.5%
Open Punctuation
ValueCountFrequency (%)
( 623
100.0%
Close Punctuation
ValueCountFrequency (%)
) 623
100.0%
Space Separator
ValueCountFrequency (%)
580
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12813
87.5%
Common 1826
 
12.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4113
32.1%
1178
 
9.2%
1173
 
9.2%
596
 
4.7%
524
 
4.1%
476
 
3.7%
256
 
2.0%
254
 
2.0%
238
 
1.9%
226
 
1.8%
Other values (159) 3779
29.5%
Common
ValueCountFrequency (%)
( 623
34.1%
) 623
34.1%
580
31.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12813
87.5%
ASCII 1826
 
12.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4113
32.1%
1178
 
9.2%
1173
 
9.2%
596
 
4.7%
524
 
4.1%
476
 
3.7%
256
 
2.0%
254
 
2.0%
238
 
1.9%
226
 
1.8%
Other values (159) 3779
29.5%
ASCII
ValueCountFrequency (%)
( 623
34.1%
) 623
34.1%
580
31.8%

논면적
Real number (ℝ)

SKEWED  ZEROS 

Distinct3445
Distinct (%)54.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13426.7
Minimum0
Maximum39121084
Zeros2684
Zeros (%)42.2%
Negative0
Negative (%)0.0%
Memory size56.0 KiB
2023-12-12T23:21:39.050143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2043
Q38148
95-th percentile31937
Maximum39121084
Range39121084
Interquartile range (IQR)8148

Descriptive statistics

Standard deviation491012.83
Coefficient of variation (CV)36.569883
Kurtosis6330.2909
Mean13426.7
Median Absolute Deviation (MAD)2043
Skewness79.472079
Sum85407236
Variance2.410936 × 1011
MonotonicityNot monotonic
2023-12-12T23:21:39.532587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 2684
42.2%
1613.0 5
 
0.1%
2000.0 4
 
0.1%
4035.0 4
 
0.1%
1269.0 4
 
0.1%
3891.0 3
 
< 0.1%
3506.0 3
 
< 0.1%
6807.0 3
 
< 0.1%
2214.0 3
 
< 0.1%
2003.0 3
 
< 0.1%
Other values (3435) 3645
57.3%
ValueCountFrequency (%)
0.0 2684
42.2%
4.0 1
 
< 0.1%
53.0 1
 
< 0.1%
66.0 1
 
< 0.1%
69.0 1
 
< 0.1%
73.0 1
 
< 0.1%
82.0 1
 
< 0.1%
101.0 1
 
< 0.1%
106.0 1
 
< 0.1%
109.0 1
 
< 0.1%
ValueCountFrequency (%)
39121084.0 1
< 0.1%
1573923.0 1
< 0.1%
397700.0 1
< 0.1%
171025.1 1
< 0.1%
157123.0 1
< 0.1%
134648.7 1
< 0.1%
133822.2 1
< 0.1%
133441.5 1
< 0.1%
132493.9 1
< 0.1%
118409.9 1
< 0.1%

밭면적
Real number (ℝ)

ZEROS 

Distinct2816
Distinct (%)44.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2708.2879
Minimum0
Maximum200178
Zeros2800
Zeros (%)44.0%
Negative0
Negative (%)0.0%
Memory size56.0 KiB
2023-12-12T23:21:39.699156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median680
Q33056
95-th percentile11203
Maximum200178
Range200178
Interquartile range (IQR)3056

Descriptive statistics

Standard deviation6432.098
Coefficient of variation (CV)2.3749684
Kurtosis294.70723
Mean2708.2879
Median Absolute Deviation (MAD)680
Skewness12.171401
Sum17227419
Variance41371885
MonotonicityNot monotonic
2023-12-12T23:21:39.869061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 2800
44.0%
1000.0 13
 
0.2%
1200.0 9
 
0.1%
1320.0 7
 
0.1%
1650.0 7
 
0.1%
1100.0 6
 
0.1%
1365.0 5
 
0.1%
1094.0 5
 
0.1%
10.0 5
 
0.1%
1557.0 5
 
0.1%
Other values (2806) 3499
55.0%
ValueCountFrequency (%)
0.0 2800
44.0%
1.0 1
 
< 0.1%
2.0 2
 
< 0.1%
3.0 1
 
< 0.1%
5.0 1
 
< 0.1%
10.0 5
 
0.1%
11.0 1
 
< 0.1%
20.0 4
 
0.1%
21.0 1
 
< 0.1%
23.0 1
 
< 0.1%
ValueCountFrequency (%)
200178.0 1
< 0.1%
196400.2 1
< 0.1%
100389.7 1
< 0.1%
83758.0 1
< 0.1%
75223.0 1
< 0.1%
68284.0 1
< 0.1%
66934.0 1
< 0.1%
60949.0 1
< 0.1%
60930.4 1
< 0.1%
58457.7 1
< 0.1%

Interactions

2023-12-12T23:21:36.343720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:21:35.726982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:21:36.046314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:21:36.438214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:21:35.848584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:21:36.161825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:21:36.526250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:21:35.948151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:21:36.253228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:21:39.987824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
읍면동명농가형태구분농업인연령농가상태논면적밭면적
읍면동명1.0000.1190.0950.4760.0330.060
농가형태구분0.1191.0000.0000.2160.0000.000
농업인연령0.0950.0001.0000.0530.0000.000
농가상태0.4760.2160.0531.0000.0000.000
논면적0.0330.0000.0000.0001.0000.000
밭면적0.0600.0000.0000.0000.0001.000
2023-12-12T23:21:40.113909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
농가상태농가형태구분읍면동명
농가상태1.0000.3540.245
농가형태구분0.3541.0000.092
읍면동명0.2450.0921.000
2023-12-12T23:21:40.239106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
농업인연령논면적밭면적읍면동명농가형태구분농가상태
농업인연령1.0000.1050.1360.0400.0000.031
논면적0.1051.0000.4550.0260.0000.000
밭면적0.1360.4551.0000.0290.0000.000
읍면동명0.0400.0260.0291.0000.0920.245
농가형태구분0.0000.0000.0000.0921.0000.354
농가상태0.0310.0000.0000.2450.3541.000

Missing values

2023-12-12T23:21:36.648782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:21:36.761907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

관리연도시군명읍면동명농가형태구분농업인연령농가상태기타소득(원)주작목논면적밭면적
02023임실군운암면단체<NA>관리농가00.00.0
12023임실군신평면단체<NA>관리농가0고구마0.00.0
22023임실군관촌면개인80관리농가0일반미(쌀 일반)13210.86340.0
32023임실군신덕면개인51관리농가0노지고추25284.514032.6
42023임실군신덕면개인81관리농가0번식우3352.08799.0
52023임실군임실읍개인76관리농가0번식우11971.93282.0
62023임실군관촌면개인73관리농가0청단풍0.00.0
72023임실군임실읍개인52관리농가0시설고추3856.01709.0
82023임실군임실읍개인65관리농가0젖소0.01265.0
92023임실군임실읍개인78관리농가0일반미(쌀 일반)2833.01230.0
관리연도시군명읍면동명농가형태구분농업인연령농가상태기타소득(원)주작목논면적밭면적
63512023임실군임실읍개인73관리농가03732.01242.0
63522023임실군임실읍개인67관리농가0번식우13306.40.0
63532023임실군삼계면개인78관리농가00.00.0
63542023임실군덕치면단체<NA>비관리농가00.00.0
63552023임실군강진면단체<NA>관리농가00.00.0
63562023임실군강진면단체<NA>관리농가0땅두릅0.00.0
63572023임실군강진면단체<NA>관리농가0복숭아0.00.0
63582023임실군덕치면단체<NA>관리농가0오디0.00.0
63592023임실군덕치면단체<NA>관리농가0(원목)표고버섯0.00.0
63602023임실군오수면개인86관리농가00.00.0

Duplicate rows

Most frequently occurring

관리연도시군명읍면동명농가형태구분농업인연령농가상태기타소득(원)주작목논면적밭면적# duplicates
2222023임실군운암면개인61관리농가00.00.015
2202023임실군운암면개인59관리농가00.00.012
2212023임실군운암면개인60관리농가00.00.012
2602023임실군운암면단체<NA>관리농가00.00.012
1312023임실군성수면단체<NA>비관리농가00.00.010
1472023임실군신덕면개인60관리농가00.00.010
2342023임실군운암면개인69관리농가00.00.010
2402023임실군운암면개인73관리농가00.00.010
2252023임실군운암면개인63관리농가00.00.09
1712023임실군신평면단체<NA>관리농가00.00.08