Overview

Dataset statistics

Number of variables5
Number of observations546
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.2%
Total size in memory22.0 KiB
Average record size in memory41.2 B

Variable types

Categorical1
Text1
Numeric1
DateTime2

Dataset

Description경상남도 거창군 농지전용현황에 대한 데이터로 구분(농지전용협의, 농지전용신고, 농지전용용도변경승인, 농지전용타용도일시사용협의), 지번주소, 전용면적, 전용일자를 제공합니다.
Author경상남도 거창군
URLhttps://www.data.go.kr/data/15122915/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 1 (0.2%) duplicate rowsDuplicates
구분 is highly imbalanced (87.9%)Imbalance

Reproduction

Analysis started2023-12-12 10:08:30.284626
Analysis finished2023-12-12 10:08:30.875763
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
농지전용협의
537 
농지타용도일시사용협의
 
9

Length

Max length11
Median length6
Mean length6.0824176
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row농지전용협의
2nd row농지전용협의
3rd row농지전용협의
4th row농지전용협의
5th row농지전용협의

Common Values

ValueCountFrequency (%)
농지전용협의 537
98.4%
농지타용도일시사용협의 9
 
1.6%

Length

2023-12-12T19:08:30.963524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:08:31.080842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
농지전용협의 537
98.4%
농지타용도일시사용협의 9
 
1.6%
Distinct450
Distinct (%)82.4%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2023-12-12T19:08:31.374935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length26
Mean length25.992674
Min length25

Characters and Unicode

Total characters14192
Distinct characters97
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique399 ?
Unique (%)73.1%

Sample

1st row경상남도 거창군 거창읍 장팔리 0458-0000
2nd row경상남도 거창군 고제면 봉산리 1421-0016
3rd row경상남도 거창군 거창읍 서변리 0208-0001
4th row경상남도 거창군 신원면 덕산리 0721-0003
5th row경상남도 거창군 신원면 덕산리 0721-0001
ValueCountFrequency (%)
경상남도 546
20.0%
거창군 546
20.0%
고제면 85
 
3.1%
개명리 79
 
2.9%
거창읍 71
 
2.6%
남상면 69
 
2.5%
마리면 62
 
2.3%
웅양면 56
 
2.1%
가조면 47
 
1.7%
대동리 47
 
1.7%
Other values (506) 1122
41.1%
2023-12-12T19:08:31.851092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2258
15.9%
2184
15.4%
667
 
4.7%
664
 
4.7%
618
 
4.4%
617
 
4.3%
608
 
4.3%
562
 
4.0%
- 546
 
3.8%
546
 
3.8%
Other values (87) 4922
34.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7094
50.0%
Decimal Number 4368
30.8%
Space Separator 2184
 
15.4%
Dash Punctuation 546
 
3.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
667
 
9.4%
664
 
9.4%
618
 
8.7%
617
 
8.7%
608
 
8.6%
562
 
7.9%
546
 
7.7%
546
 
7.7%
475
 
6.7%
100
 
1.4%
Other values (75) 1691
23.8%
Decimal Number
ValueCountFrequency (%)
0 2258
51.7%
1 526
 
12.0%
2 305
 
7.0%
3 237
 
5.4%
6 218
 
5.0%
5 196
 
4.5%
9 172
 
3.9%
7 166
 
3.8%
4 153
 
3.5%
8 137
 
3.1%
Space Separator
ValueCountFrequency (%)
2184
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 546
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7098
50.0%
Hangul 7094
50.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
667
 
9.4%
664
 
9.4%
618
 
8.7%
617
 
8.7%
608
 
8.6%
562
 
7.9%
546
 
7.7%
546
 
7.7%
475
 
6.7%
100
 
1.4%
Other values (75) 1691
23.8%
Common
ValueCountFrequency (%)
0 2258
31.8%
2184
30.8%
- 546
 
7.7%
1 526
 
7.4%
2 305
 
4.3%
3 237
 
3.3%
6 218
 
3.1%
5 196
 
2.8%
9 172
 
2.4%
7 166
 
2.3%
Other values (2) 290
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7098
50.0%
Hangul 7094
50.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2258
31.8%
2184
30.8%
- 546
 
7.7%
1 526
 
7.4%
2 305
 
4.3%
3 237
 
3.3%
6 218
 
3.1%
5 196
 
2.8%
9 172
 
2.4%
7 166
 
2.3%
Other values (2) 290
 
4.1%
Hangul
ValueCountFrequency (%)
667
 
9.4%
664
 
9.4%
618
 
8.7%
617
 
8.7%
608
 
8.6%
562
 
7.9%
546
 
7.7%
546
 
7.7%
475
 
6.7%
100
 
1.4%
Other values (75) 1691
23.8%
Distinct399
Distinct (%)73.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean736.08059
Minimum1
Maximum12549
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.9 KiB
2023-12-12T19:08:32.055736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6
Q167.25
median293
Q3952
95-th percentile2693.25
Maximum12549
Range12548
Interquartile range (IQR)884.75

Descriptive statistics

Standard deviation1225.2613
Coefficient of variation (CV)1.6645749
Kurtosis26.787635
Mean736.08059
Median Absolute Deviation (MAD)273.5
Skewness4.3088854
Sum401900
Variance1501265.2
MonotonicityNot monotonic
2023-12-12T19:08:32.218235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4 7
 
1.3%
6 6
 
1.1%
3 5
 
0.9%
5 5
 
0.9%
28 5
 
0.9%
8 4
 
0.7%
25 4
 
0.7%
99 4
 
0.7%
53 4
 
0.7%
51 4
 
0.7%
Other values (389) 498
91.2%
ValueCountFrequency (%)
1 4
0.7%
2 2
 
0.4%
3 5
0.9%
4 7
1.3%
5 5
0.9%
6 6
1.1%
7 3
0.5%
8 4
0.7%
9 1
 
0.2%
10 3
0.5%
ValueCountFrequency (%)
12549 1
0.2%
9500 1
0.2%
7649 1
0.2%
7074 1
0.2%
6679 1
0.2%
6675 1
0.2%
6663 1
0.2%
6446 1
0.2%
5518 1
0.2%
4816 1
0.2%
Distinct75
Distinct (%)13.7%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
Minimum2023-01-02 00:00:00
Maximum2023-08-31 00:00:00
2023-12-12T19:08:32.414828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:08:32.590113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
Minimum2023-09-11 00:00:00
Maximum2023-09-11 00:00:00
2023-12-12T19:08:32.702225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:08:32.810994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T19:08:30.496626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:08:32.901657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분전용면적(제곱미터)전용일자
구분1.0000.0670.684
전용면적(제곱미터)0.0671.0000.000
전용일자0.6840.0001.000
2023-12-12T19:08:33.032975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
전용면적(제곱미터)구분
전용면적(제곱미터)1.0000.067
구분0.0671.000

Missing values

2023-12-12T19:08:30.702351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:08:30.831956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분지번주소전용면적(제곱미터)전용일자데이터기준일자
0농지전용협의경상남도 거창군 거창읍 장팔리 0458-00006602023-08-312023-09-11
1농지전용협의경상남도 거창군 고제면 봉산리 1421-00166592023-08-312023-09-11
2농지전용협의경상남도 거창군 거창읍 서변리 0208-00016102023-08-282023-09-11
3농지전용협의경상남도 거창군 신원면 덕산리 0721-00033042023-08-232023-09-11
4농지전용협의경상남도 거창군 신원면 덕산리 0721-0001602023-08-232023-09-11
5농지전용협의경상남도 거창군 신원면 덕산리 0720-0001922023-08-232023-09-11
6농지전용협의경상남도 거창군 신원면 덕산리 0719-00046202023-08-232023-09-11
7농지전용협의경상남도 거창군 신원면 덕산리 0719-00035242023-08-232023-09-11
8농지전용협의경상남도 거창군 신원면 덕산리 0718-000252023-08-232023-09-11
9농지전용협의경상남도 거창군 신원면 덕산리 0716-00031112023-08-232023-09-11
구분지번주소전용면적(제곱미터)전용일자데이터기준일자
536농지전용협의경상남도 거창군 웅양면 신촌리 0209-000518092023-01-022023-09-11
537농지타용도일시사용협의경상남도 거창군 거창읍 송정리 0612-000313152023-05-162023-09-11
538농지타용도일시사용협의경상남도 거창군 거창읍 송정리 0612-000218932023-05-162023-09-11
539농지타용도일시사용협의경상남도 거창군 남하면 둔마리 1269-000314282023-04-052023-09-11
540농지타용도일시사용협의경상남도 거창군 남하면 둔마리 1269-00024982023-04-052023-09-11
541농지타용도일시사용협의경상남도 거창군 남하면 둔마리 1269-00012982023-04-052023-09-11
542농지타용도일시사용협의경상남도 거창군 웅양면 노현리 0226-00015702023-03-222023-09-11
543농지타용도일시사용협의경상남도 거창군 남상면 대산리 1252-00008302023-03-132023-09-11
544농지타용도일시사용협의경상남도 거창군 마리면 말흘리 0777-000462023-02-082023-09-11
545농지타용도일시사용협의경상남도 거창군 마리면 말흘리 0777-000113692023-02-082023-09-11

Duplicate rows

Most frequently occurring

구분지번주소전용면적(제곱미터)전용일자데이터기준일자# duplicates
0농지전용협의경상남도 거창군 가조면 동례리 1633-0001562023-04-052023-09-113