Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory576.2 KiB
Average record size in memory59.0 B

Variable types

Numeric2
Categorical2
Text2

Dataset

Description드론 실측에 기반한 과학적 자료 구축으로 스마트 농정 실현 토대 마련, 농업 데이터 축적, 확산을 위한 기반 조정, 재배 현황 조사를 통한 공간정보입니다.
Author경상북도
URLhttps://www.data.go.kr/data/15098002/fileData.do

Alerts

지번코드 has constant value ""Constant
재배작물 is highly imbalanced (53.5%)Imbalance
팜맵관리번호 has unique valuesUnique
면적(제곱미터) has unique valuesUnique

Reproduction

Analysis started2023-12-12 03:35:24.743536
Analysis finished2023-12-12 03:35:26.262278
Duration1.52 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

팜맵관리번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5390670.3
Minimum4875951
Maximum13088379
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T12:35:26.399016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4875951
5-th percentile4881144.7
Q14903012.5
median4929500
Q34957429.2
95-th percentile13083224
Maximum13088379
Range8212428
Interquartile range (IQR)54416.75

Descriptive statistics

Standard deviation1887820
Coefficient of variation (CV)0.35020134
Kurtosis12.637598
Mean5390670.3
Median Absolute Deviation (MAD)27077.5
Skewness3.8249042
Sum5.3906703 × 1010
Variance3.5638642 × 1012
MonotonicityNot monotonic
2023-12-12T12:35:26.604626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4967794 1
 
< 0.1%
4933242 1
 
< 0.1%
4942125 1
 
< 0.1%
13084135 1
 
< 0.1%
4926628 1
 
< 0.1%
4917836 1
 
< 0.1%
4902868 1
 
< 0.1%
4898721 1
 
< 0.1%
4881709 1
 
< 0.1%
4968490 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
4875951 1
< 0.1%
4875954 1
< 0.1%
4875955 1
< 0.1%
4875956 1
< 0.1%
4875957 1
< 0.1%
4875977 1
< 0.1%
4876000 1
< 0.1%
4876003 1
< 0.1%
4876056 1
< 0.1%
4876070 1
< 0.1%
ValueCountFrequency (%)
13088379 1
< 0.1%
13088349 1
< 0.1%
13088347 1
< 0.1%
13088342 1
< 0.1%
13088327 1
< 0.1%
13088323 1
< 0.1%
13088308 1
< 0.1%
13088292 1
< 0.1%
13088286 1
< 0.1%
13088276 1
< 0.1%

면적(제곱미터)
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1281.4706
Minimum6.0080547
Maximum30096.45
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T12:35:26.802938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6.0080547
5-th percentile92.408005
Q1428.813
median965.74505
Q31772.5745
95-th percentile3398.7946
Maximum30096.45
Range30090.442
Interquartile range (IQR)1343.7615

Descriptive statistics

Standard deviation1362.2872
Coefficient of variation (CV)1.0630655
Kurtosis48.130905
Mean1281.4706
Median Absolute Deviation (MAD)624.09946
Skewness4.6308392
Sum12814706
Variance1855826.4
MonotonicityNot monotonic
2023-12-12T12:35:26.965094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
623.4074881 1
 
< 0.1%
2002.201641 1
 
< 0.1%
1484.61627 1
 
< 0.1%
49.08761251 1
 
< 0.1%
2023.32045 1
 
< 0.1%
1198.460914 1
 
< 0.1%
262.1918685 1
 
< 0.1%
1110.050629 1
 
< 0.1%
2401.158853 1
 
< 0.1%
523.1161889 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
6.0080547 1
< 0.1%
7.063423461 1
< 0.1%
7.757123757 1
< 0.1%
8.62630187 1
< 0.1%
8.667545502 1
< 0.1%
8.70693469 1
< 0.1%
9.696999259 1
< 0.1%
10.28222071 1
< 0.1%
10.39487165 1
< 0.1%
10.45116736 1
< 0.1%
ValueCountFrequency (%)
30096.45026 1
< 0.1%
20499.78917 1
< 0.1%
20428.20003 1
< 0.1%
19788.9679 1
< 0.1%
17195.05687 1
< 0.1%
16909.47297 1
< 0.1%
15973.8511 1
< 0.1%
15972.12996 1
< 0.1%
15793.65504 1
< 0.1%
13696.97955 1
< 0.1%

지번코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
4720000000000000000
10000 

Length

Max length19
Median length19
Mean length19
Min length19

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4720000000000000000
2nd row4720000000000000000
3rd row4720000000000000000
4th row4720000000000000000
5th row4720000000000000000

Common Values

ValueCountFrequency (%)
4720000000000000000 10000
100.0%

Length

2023-12-12T12:35:27.112855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:35:27.213815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
4720000000000000000 10000
100.0%

동리
Text

Distinct208
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T12:35:27.595582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length16
Mean length15.5648
Min length12

Characters and Unicode

Total characters155648
Distinct characters132
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상북도 영천시 괴연동
2nd row경상북도 영천시 금호읍 냉천리
3rd row경상북도 영천시 대창면 구지리
4th row경상북도 영천시 고경면 부리
5th row경상북도 영천시 화산면 당곡리
ValueCountFrequency (%)
경상북도 10000
25.8%
영천시 10000
25.8%
고경면 1170
 
3.0%
금호읍 1058
 
2.7%
북안면 943
 
2.4%
임고면 935
 
2.4%
청통면 802
 
2.1%
화산면 762
 
2.0%
신녕면 732
 
1.9%
화남면 731
 
1.9%
Other values (204) 11595
29.9%
2023-12-12T12:35:28.173030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30000
19.3%
11501
 
7.4%
11170
 
7.2%
11023
 
7.1%
10403
 
6.7%
10297
 
6.6%
10000
 
6.4%
10000
 
6.4%
8876
 
5.7%
7670
 
4.9%
Other values (122) 34708
22.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 125648
80.7%
Space Separator 30000
 
19.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11501
 
9.2%
11170
 
8.9%
11023
 
8.8%
10403
 
8.3%
10297
 
8.2%
10000
 
8.0%
10000
 
8.0%
8876
 
7.1%
7670
 
6.1%
2377
 
1.9%
Other values (121) 32331
25.7%
Space Separator
ValueCountFrequency (%)
30000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 125648
80.7%
Common 30000
 
19.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11501
 
9.2%
11170
 
8.9%
11023
 
8.8%
10403
 
8.3%
10297
 
8.2%
10000
 
8.0%
10000
 
8.0%
8876
 
7.1%
7670
 
6.1%
2377
 
1.9%
Other values (121) 32331
25.7%
Common
ValueCountFrequency (%)
30000
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 125648
80.7%
ASCII 30000
 
19.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
30000
100.0%
Hangul
ValueCountFrequency (%)
11501
 
9.2%
11170
 
8.9%
11023
 
8.8%
10403
 
8.3%
10297
 
8.2%
10000
 
8.0%
10000
 
8.0%
8876
 
7.1%
7670
 
6.1%
2377
 
1.9%
Other values (121) 32331
25.7%

지번
Text

Distinct6715
Distinct (%)67.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T12:35:28.581098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length5.024
Min length2

Characters and Unicode

Total characters50240
Distinct characters34
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5012 ?
Unique (%)50.1%

Sample

1st row621답
2nd row423-3답
3rd row81전
4th row42답
5th row423 답
ValueCountFrequency (%)
469
 
4.2%
321
 
2.8%
265
 
2.3%
72
 
0.6%
64
 
0.6%
25
 
0.2%
17
 
0.2%
121답 12
 
0.1%
11
 
0.1%
427답 10
 
0.1%
Other values (6651) 10016
88.8%
2023-12-12T12:35:29.139098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 6475
12.9%
5864
11.7%
2 4378
8.7%
- 4221
 
8.4%
3 3850
 
7.7%
4 3419
 
6.8%
5 3145
 
6.3%
6 2937
 
5.8%
7 2725
 
5.4%
8 2637
 
5.2%
Other values (24) 10589
21.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 34370
68.4%
Other Letter 10367
 
20.6%
Dash Punctuation 4221
 
8.4%
Space Separator 1282
 
2.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5864
56.6%
2298
 
22.2%
627
 
6.0%
447
 
4.3%
398
 
3.8%
367
 
3.5%
113
 
1.1%
54
 
0.5%
54
 
0.5%
44
 
0.4%
Other values (12) 101
 
1.0%
Decimal Number
ValueCountFrequency (%)
1 6475
18.8%
2 4378
12.7%
3 3850
11.2%
4 3419
9.9%
5 3145
9.2%
6 2937
8.5%
7 2725
7.9%
8 2637
7.7%
9 2442
 
7.1%
0 2362
 
6.9%
Dash Punctuation
ValueCountFrequency (%)
- 4221
100.0%
Space Separator
ValueCountFrequency (%)
1282
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 39873
79.4%
Hangul 10367
 
20.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5864
56.6%
2298
 
22.2%
627
 
6.0%
447
 
4.3%
398
 
3.8%
367
 
3.5%
113
 
1.1%
54
 
0.5%
54
 
0.5%
44
 
0.4%
Other values (12) 101
 
1.0%
Common
ValueCountFrequency (%)
1 6475
16.2%
2 4378
11.0%
- 4221
10.6%
3 3850
9.7%
4 3419
8.6%
5 3145
7.9%
6 2937
7.4%
7 2725
6.8%
8 2637
6.6%
9 2442
 
6.1%
Other values (2) 3644
9.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 39873
79.4%
Hangul 10367
 
20.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 6475
16.2%
2 4378
11.0%
- 4221
10.6%
3 3850
9.7%
4 3419
8.6%
5 3145
7.9%
6 2937
7.4%
7 2725
6.8%
8 2637
6.6%
9 2442
 
6.1%
Other values (2) 3644
9.1%
Hangul
ValueCountFrequency (%)
5864
56.6%
2298
 
22.2%
627
 
6.0%
447
 
4.3%
398
 
3.8%
367
 
3.5%
113
 
1.1%
54
 
0.5%
54
 
0.5%
44
 
0.4%
Other values (12) 101
 
1.0%

재배작물
Categorical

IMBALANCE 

Distinct14
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
비경지
6520 
논_벼
1237 
휴경지
1134 
시설
 
325
과수_기타
 
314
Other values (9)
 
470

Length

Max length5
Median length3
Mean length3.0178
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row비경지
2nd row비경지
3rd row비경지
4th row논_벼
5th row비경지

Common Values

ValueCountFrequency (%)
비경지 6520
65.2%
논_벼 1237
 
12.4%
휴경지 1134
 
11.3%
시설 325
 
3.2%
과수_기타 314
 
3.1%
마늘 161
 
1.6%
기타경작지 103
 
1.0%
포도 87
 
0.9%
사과 58
 
0.6%
복숭아 39
 
0.4%
Other values (4) 22
 
0.2%

Length

2023-12-12T12:35:29.359988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
비경지 6520
65.2%
논_벼 1237
 
12.4%
휴경지 1134
 
11.3%
시설 325
 
3.2%
과수_기타 314
 
3.1%
마늘 161
 
1.6%
기타경작지 103
 
1.0%
포도 87
 
0.9%
사과 58
 
0.6%
복숭아 39
 
0.4%
Other values (4) 22
 
0.2%

Interactions

2023-12-12T12:35:25.658201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:35:25.353559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:35:25.815971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:35:25.495359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:35:29.481249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
팜맵관리번호면적(제곱미터)재배작물
팜맵관리번호1.0000.0660.084
면적(제곱미터)0.0661.0000.157
재배작물0.0840.1571.000
2023-12-12T12:35:29.585447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
팜맵관리번호면적(제곱미터)재배작물
팜맵관리번호1.000-0.0650.065
면적(제곱미터)-0.0651.0000.069
재배작물0.0650.0691.000

Missing values

2023-12-12T12:35:26.034942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:35:26.183604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

팜맵관리번호면적(제곱미터)지번코드동리지번재배작물
61994967794623.4074884720000000000000000경상북도 영천시 괴연동621답비경지
152394889070726.8673674720000000000000000경상북도 영천시 금호읍 냉천리423-3답비경지
9535849748964081.9091054720000000000000000경상북도 영천시 대창면 구지리81전비경지
726854931477286.7978354720000000000000000경상북도 영천시 고경면 부리42답논_벼
435324879221715.9941964720000000000000000경상북도 영천시 화산면 당곡리423 답비경지
887449283061984.1791224720000000000000000경상북도 영천시 오미동1423답비경지
791334940409119.9789174720000000000000000경상북도 영천시 고경면 오룡리200대휴경지
4978449234302831.8521314720000000000000000경상북도 영천시 화북면 오산리1345답비경지
75215493469761.1800984720000000000000000경상북도 영천시 고경면 해선리281대휴경지
3349237081139.2745874720000000000000000경상북도 영천시 조교동48-1 과비경지
팜맵관리번호면적(제곱미터)지번코드동리지번재배작물
246344884804946.9819964720000000000000000경상북도 영천시 청통면 신덕리315답논_벼
113954925027320.8283994720000000000000000경상북도 영천시 언하동624-3답비경지
211449281981095.0625064720000000000000000경상북도 영천시 화룡동677-2답비경지
114984925528565.6001284720000000000000000경상북도 영천시 언하동733-80 천비경지
7623949350439444.3375674720000000000000000경상북도 영천시 고경면 동도리566-4과비경지
918248998562101.6217364720000000000000000경상북도 영천시 오수동52-1답비경지
4020449006051849.4125654720000000000000000경상북도 영천시 화산면 효정리1144답비경지
64858130856651800.5206364720000000000000000경상북도 영천시 임고면 효리1144-5과과수_기타
771564931510111.636554720000000000000000경상북도 영천시 고경면 삼귀리산13-1임휴경지
6607130868377477.039274720000000000000000경상북도 영천시 대전동709과비경지