Overview

Dataset statistics

Number of variables5
Number of observations4914
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory201.7 KiB
Average record size in memory42.0 B

Variable types

Text2
Numeric2
Categorical1

Dataset

Description전북특별자치도 장수군 농지(소재지, 번지, 호, 지목, 면적, 번지, 지목, 면적)에 대한 데이터를 제공하고자 합니다
Author전북특별자치도 장수군
URLhttps://www.data.go.kr/data/15112841/fileData.do

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates
has 923 (18.8%) zerosZeros

Reproduction

Analysis started2024-04-06 08:18:43.220540
Analysis finished2024-04-06 08:18:44.974819
Duration1.75 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct73
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size38.5 KiB
2024-04-06T17:18:45.324901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length19
Mean length19
Min length19

Characters and Unicode

Total characters93366
Distinct characters90
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전북특별자치도 장수군 장수읍 장수리
2nd row전북특별자치도 장수군 장수읍 장수리
3rd row전북특별자치도 장수군 장수읍 장수리
4th row전북특별자치도 장수군 장수읍 장수리
5th row전북특별자치도 장수군 장수읍 장수리
ValueCountFrequency (%)
전북특별자치도 4914
25.0%
장수군 4914
25.0%
장수읍 989
 
5.0%
천천면 925
 
4.7%
장계면 813
 
4.1%
산서면 606
 
3.1%
계북면 602
 
3.1%
계남면 529
 
2.7%
번암면 450
 
2.3%
춘송리 368
 
1.9%
Other values (70) 4546
23.1%
2024-04-06T17:18:45.984246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14742
15.8%
7132
 
7.6%
6081
 
6.5%
5516
 
5.9%
5273
 
5.6%
4914
 
5.3%
4914
 
5.3%
4914
 
5.3%
4914
 
5.3%
4914
 
5.3%
Other values (80) 30052
32.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 78624
84.2%
Space Separator 14742
 
15.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7132
 
9.1%
6081
 
7.7%
5516
 
7.0%
5273
 
6.7%
4914
 
6.2%
4914
 
6.2%
4914
 
6.2%
4914
 
6.2%
4914
 
6.2%
4914
 
6.2%
Other values (79) 25138
32.0%
Space Separator
ValueCountFrequency (%)
14742
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 78624
84.2%
Common 14742
 
15.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7132
 
9.1%
6081
 
7.7%
5516
 
7.0%
5273
 
6.7%
4914
 
6.2%
4914
 
6.2%
4914
 
6.2%
4914
 
6.2%
4914
 
6.2%
4914
 
6.2%
Other values (79) 25138
32.0%
Common
ValueCountFrequency (%)
14742
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 78624
84.2%
ASCII 14742
 
15.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
14742
100.0%
Hangul
ValueCountFrequency (%)
7132
 
9.1%
6081
 
7.7%
5516
 
7.0%
5273
 
6.7%
4914
 
6.2%
4914
 
6.2%
4914
 
6.2%
4914
 
6.2%
4914
 
6.2%
4914
 
6.2%
Other values (79) 25138
32.0%

번지
Real number (ℝ)

Distinct1494
Distinct (%)30.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean665.78673
Minimum1
Maximum2473
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size43.3 KiB
2024-04-06T17:18:46.368162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile55.65
Q1296
median564
Q3931.75
95-th percentile1743.35
Maximum2473
Range2472
Interquartile range (IQR)635.75

Descriptive statistics

Standard deviation486.81199
Coefficient of variation (CV)0.73118307
Kurtosis0.18799403
Mean665.78673
Median Absolute Deviation (MAD)315
Skewness0.90648467
Sum3271676
Variance236985.91
MonotonicityNot monotonic
2024-04-06T17:18:46.772410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
364 54
 
1.1%
311 46
 
0.9%
292 42
 
0.9%
1833 40
 
0.8%
21 36
 
0.7%
1803 33
 
0.7%
225 25
 
0.5%
365 24
 
0.5%
803 23
 
0.5%
1454 22
 
0.4%
Other values (1484) 4569
93.0%
ValueCountFrequency (%)
1 11
0.2%
2 9
0.2%
3 6
0.1%
4 4
 
0.1%
5 3
 
0.1%
7 1
 
< 0.1%
8 5
0.1%
10 3
 
0.1%
12 1
 
< 0.1%
13 1
 
< 0.1%
ValueCountFrequency (%)
2473 1
< 0.1%
2472 1
< 0.1%
2394 1
< 0.1%
2319 1
< 0.1%
2318 2
< 0.1%
2118 2
< 0.1%
2099 1
< 0.1%
2083 1
< 0.1%
2056 1
< 0.1%
1973 1
< 0.1%


Real number (ℝ)

ZEROS 

Distinct125
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.8876679
Minimum0
Maximum195
Zeros923
Zeros (%)18.8%
Negative0
Negative (%)0.0%
Memory size43.3 KiB
2024-04-06T17:18:47.231011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q36
95-th percentile40
Maximum195
Range195
Interquartile range (IQR)5

Descriptive statistics

Standard deviation17.582836
Coefficient of variation (CV)2.2291552
Kurtosis18.895321
Mean7.8876679
Median Absolute Deviation (MAD)2
Skewness4.0866183
Sum38760
Variance309.15611
MonotonicityNot monotonic
2024-04-06T17:18:47.562174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1177
24.0%
0 923
18.8%
2 490
10.0%
3 458
 
9.3%
4 345
 
7.0%
5 227
 
4.6%
6 177
 
3.6%
7 129
 
2.6%
8 104
 
2.1%
9 75
 
1.5%
Other values (115) 809
16.5%
ValueCountFrequency (%)
0 923
18.8%
1 1177
24.0%
2 490
10.0%
3 458
 
9.3%
4 345
 
7.0%
5 227
 
4.6%
6 177
 
3.6%
7 129
 
2.6%
8 104
 
2.1%
9 75
 
1.5%
ValueCountFrequency (%)
195 1
< 0.1%
133 1
< 0.1%
131 1
< 0.1%
130 1
< 0.1%
129 1
< 0.1%
127 1
< 0.1%
124 1
< 0.1%
123 1
< 0.1%
122 1
< 0.1%
121 1
< 0.1%

지목
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size38.5 KiB
2726 
2141 
과수원
 
47

Length

Max length3
Median length1
Mean length1.019129
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
2726
55.5%
2141
43.6%
과수원 47
 
1.0%

Length

2024-04-06T17:18:47.813682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:18:48.003320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2726
55.5%
2141
43.6%
과수원 47
 
1.0%
Distinct1214
Distinct (%)24.7%
Missing0
Missing (%)0.0%
Memory size38.5 KiB
2024-04-06T17:18:48.667099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length2.6821327
Min length1

Characters and Unicode

Total characters13180
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique601 ?
Unique (%)12.2%

Sample

1st row5413
2nd row116
3rd row387
4th row80
5th row100
ValueCountFrequency (%)
33 41
 
0.8%
36 36
 
0.7%
8 35
 
0.7%
7 35
 
0.7%
13 35
 
0.7%
26 34
 
0.7%
4 33
 
0.7%
23 33
 
0.7%
40 32
 
0.7%
3 31
 
0.6%
Other values (1203) 4567
93.0%
2024-04-06T17:18:49.701868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 2344
17.8%
2 1749
13.3%
3 1519
11.5%
4 1231
9.3%
6 1165
8.8%
5 1116
8.5%
8 1065
8.1%
9 1031
7.8%
7 1010
7.7%
0 944
7.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 13174
> 99.9%
Space Separator 6
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 2344
17.8%
2 1749
13.3%
3 1519
11.5%
4 1231
9.3%
6 1165
8.8%
5 1116
8.5%
8 1065
8.1%
9 1031
7.8%
7 1010
7.7%
0 944
7.2%
Space Separator
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 13180
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 2344
17.8%
2 1749
13.3%
3 1519
11.5%
4 1231
9.3%
6 1165
8.8%
5 1116
8.5%
8 1065
8.1%
9 1031
7.8%
7 1010
7.7%
0 944
7.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13180
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 2344
17.8%
2 1749
13.3%
3 1519
11.5%
4 1231
9.3%
6 1165
8.8%
5 1116
8.5%
8 1065
8.1%
9 1031
7.8%
7 1010
7.7%
0 944
7.2%

Interactions

2024-04-06T17:18:44.194898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:18:43.700229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:18:44.382385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:18:43.953849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:18:49.878192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지번지지목
소재지1.0000.8080.5490.597
번지0.8081.0000.2220.279
0.5490.2221.0000.284
지목0.5970.2790.2841.000
2024-04-06T17:18:50.056973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번지지목
번지1.000-0.1700.174
-0.1701.0000.188
지목0.1740.1881.000

Missing values

2024-04-06T17:18:44.651832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:18:44.891945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

소재지번지지목면적
0전북특별자치도 장수군 장수읍 장수리5415413
1전북특별자치도 장수군 장수읍 장수리551116
2전북특별자치도 장수군 장수읍 장수리704387
3전북특별자치도 장수군 장수읍 장수리70680
4전북특별자치도 장수군 장수읍 장수리766100
5전북특별자치도 장수군 장수읍 장수리76777
6전북특별자치도 장수군 장수읍 장수리76996
7전북특별자치도 장수군 장수읍 장수리7610171
8전북특별자치도 장수군 장수읍 장수리780149
9전북특별자치도 장수군 장수읍 장수리881175
소재지번지지목면적
4904전북특별자치도 장수군 계북면 양악리7493581
4905전북특별자치도 장수군 계북면 양악리7493867
4906전북특별자치도 장수군 계북면 양악리778895
4907전북특별자치도 장수군 계북면 양악리7781389
4908전북특별자치도 장수군 계북면 양악리7922255
4909전북특별자치도 장수군 계북면 양악리7923174
4910전북특별자치도 장수군 계북면 양악리8010496
4911전북특별자치도 장수군 계북면 양악리8653145
4912전북특별자치도 장수군 계북면 양악리8770377
4913전북특별자치도 장수군 계북면 양악리11342189

Duplicate rows

Most frequently occurring

소재지번지지목면적# duplicates
0전북특별자치도 장수군 계남면 화음리1480118652