Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory576.2 KiB
Average record size in memory59.0 B

Variable types

Numeric2
Categorical2
Text2

Dataset

Description드론 실측에 기반한 과학적 자료 구축으로 스마트 농정 실현 토대 마련, 농업 데이터 축적, 확산을 위한 기반 조정, 재배 현황 조사를 통한 공간정보입니다.
Author경상북도
URLhttps://www.data.go.kr/data/15098004/fileData.do

Alerts

재배작물 is highly imbalanced (64.1%)Imbalance
팜맵관리번호 has unique valuesUnique
면적(제곱미터) has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:24:15.836369
Analysis finished2023-12-12 21:24:16.838502
Duration1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

팜맵관리번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6313852.1
Minimum5147854
Maximum13105251
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T06:24:16.911733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5147854
5-th percentile5154687.7
Q15179222.8
median5228776
Q35264946.8
95-th percentile13096747
Maximum13105251
Range7957397
Interquartile range (IQR)85724

Descriptive statistics

Standard deviation2714930.7
Coefficient of variation (CV)0.42999593
Kurtosis2.2617125
Mean6313852.1
Median Absolute Deviation (MAD)42918.5
Skewness2.0630921
Sum6.3138521 × 1010
Variance7.3708487 × 1012
MonotonicityNot monotonic
2023-12-13T06:24:17.087184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5182625 1
 
< 0.1%
5178624 1
 
< 0.1%
5258102 1
 
< 0.1%
5246417 1
 
< 0.1%
5243839 1
 
< 0.1%
5271950 1
 
< 0.1%
5152168 1
 
< 0.1%
5272962 1
 
< 0.1%
5180251 1
 
< 0.1%
5226505 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
5147854 1
< 0.1%
5147855 1
< 0.1%
5147878 1
< 0.1%
5147882 1
< 0.1%
5147888 1
< 0.1%
5147890 1
< 0.1%
5147929 1
< 0.1%
5147963 1
< 0.1%
5147969 1
< 0.1%
5147974 1
< 0.1%
ValueCountFrequency (%)
13105251 1
< 0.1%
13105246 1
< 0.1%
13105235 1
< 0.1%
13105234 1
< 0.1%
13105229 1
< 0.1%
13105228 1
< 0.1%
13105225 1
< 0.1%
13105219 1
< 0.1%
13105218 1
< 0.1%
13105200 1
< 0.1%

면적(제곱미터)
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1525.6041
Minimum8.3826507
Maximum23431.406
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T06:24:17.258026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8.3826507
5-th percentile108.90561
Q1460.81789
median1100.6638
Q32034.8819
95-th percentile4293.6582
Maximum23431.406
Range23423.023
Interquartile range (IQR)1574.064

Descriptive statistics

Standard deviation1606.3737
Coefficient of variation (CV)1.0529427
Kurtosis22.245298
Mean1525.6041
Median Absolute Deviation (MAD)740.19265
Skewness3.354467
Sum15256041
Variance2580436.4
MonotonicityNot monotonic
2023-12-13T06:24:17.491152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
140.1271318 1
 
< 0.1%
2081.886055 1
 
< 0.1%
55.53722162 1
 
< 0.1%
997.0183836 1
 
< 0.1%
3266.445254 1
 
< 0.1%
89.09078693 1
 
< 0.1%
805.89566 1
 
< 0.1%
1522.992277 1
 
< 0.1%
250.6026313 1
 
< 0.1%
1714.711068 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
8.3826507 1
< 0.1%
8.740151864 1
< 0.1%
10.6889044 1
< 0.1%
12.59152682 1
< 0.1%
13.16830416 1
< 0.1%
13.54120702 1
< 0.1%
14.22646897 1
< 0.1%
14.94905083 1
< 0.1%
15.68122005 1
< 0.1%
16.04426024 1
< 0.1%
ValueCountFrequency (%)
23431.40607 1
< 0.1%
21969.10792 1
< 0.1%
19922.19847 1
< 0.1%
18991.53118 1
< 0.1%
18939.01574 1
< 0.1%
18661.64383 1
< 0.1%
17022.68398 1
< 0.1%
16627.66885 1
< 0.1%
16053.34437 1
< 0.1%
15972.74989 1
< 0.1%

지번코드
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
4773040000000000000
7357 
4773030000000000000
2643 

Length

Max length19
Median length19
Mean length19
Min length19

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4773040000000000000
2nd row4773040000000000000
3rd row4773040000000000000
4th row4773040000000000000
5th row4773030000000000000

Common Values

ValueCountFrequency (%)
4773040000000000000 7357
73.6%
4773030000000000000 2643
 
26.4%

Length

2023-12-13T06:24:17.618123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:24:17.734014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
4773040000000000000 7357
73.6%
4773030000000000000 2643
 
26.4%

동리
Text

Distinct145
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T06:24:18.095651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length16
Mean length15.9654
Min length15

Characters and Unicode

Total characters159654
Distinct characters130
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상북도 의성군 비안면 장춘리
2nd row경상북도 의성군 단밀면 용곡리
3rd row경상북도 의성군 금성면 대리리
4th row경상북도 의성군 단밀면 속암리
5th row경상북도 의성군 점곡면 동변리
ValueCountFrequency (%)
경상북도 10000
25.0%
의성군 10000
25.0%
금성면 1161
 
2.9%
비안면 990
 
2.5%
봉양면 917
 
2.3%
안계면 857
 
2.1%
단밀면 701
 
1.8%
의성읍 684
 
1.7%
단북면 668
 
1.7%
구천면 629
 
1.6%
Other values (150) 13393
33.5%
2023-12-13T06:24:18.647967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30000
18.8%
12161
 
7.6%
10684
 
6.7%
10668
 
6.7%
10517
 
6.6%
10388
 
6.5%
10112
 
6.3%
10085
 
6.3%
10000
 
6.3%
9316
 
5.8%
Other values (120) 35723
22.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 129654
81.2%
Space Separator 30000
 
18.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12161
 
9.4%
10684
 
8.2%
10668
 
8.2%
10517
 
8.1%
10388
 
8.0%
10112
 
7.8%
10085
 
7.8%
10000
 
7.7%
9316
 
7.2%
2183
 
1.7%
Other values (119) 33540
25.9%
Space Separator
ValueCountFrequency (%)
30000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 129654
81.2%
Common 30000
 
18.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12161
 
9.4%
10684
 
8.2%
10668
 
8.2%
10517
 
8.1%
10388
 
8.0%
10112
 
7.8%
10085
 
7.8%
10000
 
7.7%
9316
 
7.2%
2183
 
1.7%
Other values (119) 33540
25.9%
Common
ValueCountFrequency (%)
30000
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 129654
81.2%
ASCII 30000
 
18.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
30000
100.0%
Hangul
ValueCountFrequency (%)
12161
 
9.4%
10684
 
8.2%
10668
 
8.2%
10517
 
8.1%
10388
 
8.0%
10112
 
7.8%
10085
 
7.8%
10000
 
7.7%
9316
 
7.2%
2183
 
1.7%
Other values (119) 33540
25.9%

지번
Text

Distinct7113
Distinct (%)71.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T06:24:19.039069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length5.1672
Min length2

Characters and Unicode

Total characters51672
Distinct characters33
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5312 ?
Unique (%)53.1%

Sample

1st row산19임
2nd row1016-1전
3rd row575 답
4th row908 답
5th row623-13전
ValueCountFrequency (%)
976
 
8.1%
428
 
3.6%
195
 
1.6%
164
 
1.4%
57
 
0.5%
53
 
0.4%
32
 
0.3%
18
 
0.2%
14
 
0.1%
12
 
0.1%
Other values (6995) 10047
83.8%
2023-12-13T06:24:19.569185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 6646
12.9%
5174
10.0%
2 4422
 
8.6%
- 4371
 
8.5%
3 3597
 
7.0%
4 3352
 
6.5%
5 3240
 
6.3%
6 2989
 
5.8%
7 2812
 
5.4%
2747
 
5.3%
Other values (23) 12322
23.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 34513
66.8%
Other Letter 10791
 
20.9%
Dash Punctuation 4371
 
8.5%
Space Separator 1996
 
3.9%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5174
47.9%
2747
25.5%
945
 
8.8%
792
 
7.3%
381
 
3.5%
309
 
2.9%
138
 
1.3%
80
 
0.7%
57
 
0.5%
56
 
0.5%
Other values (10) 112
 
1.0%
Decimal Number
ValueCountFrequency (%)
1 6646
19.3%
2 4422
12.8%
3 3597
10.4%
4 3352
9.7%
5 3240
9.4%
6 2989
8.7%
7 2812
8.1%
8 2613
 
7.6%
9 2443
 
7.1%
0 2399
 
7.0%
Dash Punctuation
ValueCountFrequency (%)
- 4371
100.0%
Space Separator
ValueCountFrequency (%)
1996
100.0%
Modifier Symbol
ValueCountFrequency (%)
´ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 40881
79.1%
Hangul 10791
 
20.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5174
47.9%
2747
25.5%
945
 
8.8%
792
 
7.3%
381
 
3.5%
309
 
2.9%
138
 
1.3%
80
 
0.7%
57
 
0.5%
56
 
0.5%
Other values (10) 112
 
1.0%
Common
ValueCountFrequency (%)
1 6646
16.3%
2 4422
10.8%
- 4371
10.7%
3 3597
8.8%
4 3352
8.2%
5 3240
7.9%
6 2989
7.3%
7 2812
6.9%
8 2613
 
6.4%
9 2443
 
6.0%
Other values (3) 4396
10.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 40880
79.1%
Hangul 10791
 
20.9%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 6646
16.3%
2 4422
10.8%
- 4371
10.7%
3 3597
8.8%
4 3352
8.2%
5 3240
7.9%
6 2989
7.3%
7 2812
6.9%
8 2613
 
6.4%
9 2443
 
6.0%
Other values (2) 4395
10.8%
Hangul
ValueCountFrequency (%)
5174
47.9%
2747
25.5%
945
 
8.8%
792
 
7.3%
381
 
3.5%
309
 
2.9%
138
 
1.3%
80
 
0.7%
57
 
0.5%
56
 
0.5%
Other values (10) 112
 
1.0%
None
ValueCountFrequency (%)
´ 1
100.0%

재배작물
Categorical

IMBALANCE 

Distinct14
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
비경지
7321 
논_벼
1135 
휴경지
971 
시설
 
318
마늘
 
82
Other values (9)
 
173

Length

Max length5
Median length3
Mean length2.9734
Min length1

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row시설
2nd row비경지
3rd row논_벼
4th row비경지
5th row사과

Common Values

ValueCountFrequency (%)
비경지 7321
73.2%
논_벼 1135
 
11.3%
휴경지 971
 
9.7%
시설 318
 
3.2%
마늘 82
 
0.8%
과수_기타 73
 
0.7%
사과 49
 
0.5%
기타경작지 24
 
0.2%
복숭아 18
 
0.2%
묘목 3
 
< 0.1%
Other values (4) 6
 
0.1%

Length

2023-12-13T06:24:20.027478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
비경지 7321
73.2%
논_벼 1135
 
11.3%
휴경지 971
 
9.7%
시설 318
 
3.2%
마늘 82
 
0.8%
과수_기타 73
 
0.7%
사과 49
 
0.5%
기타경작지 24
 
0.2%
복숭아 18
 
0.2%
묘목 3
 
< 0.1%
Other values (4) 6
 
0.1%

Interactions

2023-12-13T06:24:16.455550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:24:16.270179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:24:16.575435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:24:16.371472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:24:20.130260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
팜맵관리번호면적(제곱미터)지번코드재배작물
팜맵관리번호1.0000.1170.0840.137
면적(제곱미터)0.1171.0000.0520.157
지번코드0.0840.0521.0000.263
재배작물0.1370.1570.2631.000
2023-12-13T06:24:20.230523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
재배작물지번코드
재배작물1.0000.206
지번코드0.2061.000
2023-12-13T06:24:20.329964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
팜맵관리번호면적(제곱미터)지번코드재배작물
팜맵관리번호1.0000.0820.0540.107
면적(제곱미터)0.0821.0000.0400.064
지번코드0.0540.0401.0000.206
재배작물0.1070.0640.2061.000

Missing values

2023-12-13T06:24:16.674983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:24:16.777222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

팜맵관리번호면적(제곱미터)지번코드동리지번재배작물
651095182625140.1271324773040000000000000경상북도 의성군 비안면 장춘리산19임시설
7237052629062166.8559144773040000000000000경상북도 의성군 단밀면 용곡리1016-1전비경지
3624851662571201.5726264773040000000000000경상북도 의성군 금성면 대리리575 답논_벼
7211452601631750.2338034773040000000000000경상북도 의성군 단밀면 속암리908 답비경지
1399252273951673.7170134773030000000000000경상북도 의성군 점곡면 동변리623-13전사과
8850052424053725.4392524773040000000000000경상북도 의성군 안계면 위양리823-1답논_벼
912015247253857.921634773040000000000000경상북도 의성군 안계면 도덕리320답논_벼
1233652286493110.0993444773030000000000000경상북도 의성군 점곡면 서변리49-2전비경지
664275267543832.8524444773040000000000000경상북도 의성군 구천면 모흥리162답논_벼
240065152643840.2314394773030000000000000경상북도 의성군 사곡면 화전리953-1답비경지
팜맵관리번호면적(제곱미터)지번코드동리지번재배작물
7416152656511300.6909994773040000000000000경상북도 의성군 단밀면 위중리1481-2 답비경지
4557051614303181.6220714773040000000000000경상북도 의성군 금성면 수정리160답논_벼
9695352906002915.007944773040000000000000경상북도 의성군 다인면 가원리625-5답논_벼
2897551567931342.2008144773040000000000000경상북도 의성군 춘산면 빙계리1017-60과과수_기타
8177913102754494.9762764773040000000000000경상북도 의성군 단북면 연제리15 도비경지
9770130949651400.5858314773030000000000000경상북도 의성군 단촌면 병방리808전비경지
2157551525752183.4279714773030000000000000경상북도 의성군 사곡면 음지리662답논_벼
391513094140570.2754634773030000000000000경상북도 의성군 의성읍 팔성리666전과수_기타
4059651744382380.2307524773040000000000000경상북도 의성군 금성면 하리산115임비경지
9529952847287058.1982844773040000000000000경상북도 의성군 다인면 산내리149-10 과비경지