Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows36
Duplicate rows (%)0.4%
Total size in memory722.7 KiB
Average record size in memory74.0 B

Variable types

Categorical3
Text1
Unsupported2
Numeric2

Dataset

Description본 공공데이터는 완주군내 개별공시지가 현황 정보를 제공하는 공공데이터로 매년 결정하는 개별공시지가 1월기준 필지별, 읍면별 현황의 정보를 포함하고 있습니다.
Author전라북도 완주군
URLhttps://www.data.go.kr/data/15013365/fileData.do

Alerts

Dataset has 36 (0.4%) duplicate rowsDuplicates
구분 is highly overall correlated with 공부지목High correlation
공부지목 is highly overall correlated with 구분High correlation
구분 is highly imbalanced (69.1%)Imbalance
총면적 is highly skewed (γ1 = 69.71543676)Skewed
본번 is an unsupported type, check if it needs cleaning or further analysisUnsupported
부번 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-14 23:22:48.675517
Analysis finished2024-03-14 23:22:51.224247
Duration2.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

법정동
Categorical

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
봉동읍
2354 
삼례읍
1878 
용진읍
1763 
이서면
1681 
소양면
1428 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row소양면
2nd row이서면
3rd row봉동읍
4th row봉동읍
5th row용진읍

Common Values

ValueCountFrequency (%)
봉동읍 2354
23.5%
삼례읍 1878
18.8%
용진읍 1763
17.6%
이서면 1681
16.8%
소양면 1428
14.3%
상관면 896
 
9.0%

Length

2024-03-15T08:22:51.374231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T08:22:51.747971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
봉동읍 2354
23.5%
삼례읍 1878
18.8%
용진읍 1763
17.6%
이서면 1681
16.8%
소양면 1428
14.3%
상관면 896
 
9.0%


Text

Distinct52
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T08:22:52.712059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9441
Min length2

Characters and Unicode

Total characters29441
Distinct characters64
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row명덕리
2nd row상개리
3rd row성덕리
4th row고천리
5th row상삼리
ValueCountFrequency (%)
삼례리 442
 
4.4%
운곡리 355
 
3.5%
은교리 313
 
3.1%
은하리 307
 
3.1%
용암리 301
 
3.0%
신지리 300
 
3.0%
명덕리 297
 
3.0%
하리 284
 
2.8%
신리 275
 
2.8%
간중리 251
 
2.5%
Other values (42) 6875
68.8%
2024-03-15T08:22:53.784082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10000
34.0%
1183
 
4.0%
968
 
3.3%
773
 
2.6%
756
 
2.6%
686
 
2.3%
658
 
2.2%
657
 
2.2%
636
 
2.2%
620
 
2.1%
Other values (54) 12504
42.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29441
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10000
34.0%
1183
 
4.0%
968
 
3.3%
773
 
2.6%
756
 
2.6%
686
 
2.3%
658
 
2.2%
657
 
2.2%
636
 
2.2%
620
 
2.1%
Other values (54) 12504
42.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29441
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10000
34.0%
1183
 
4.0%
968
 
3.3%
773
 
2.6%
756
 
2.6%
686
 
2.3%
658
 
2.2%
657
 
2.2%
636
 
2.2%
620
 
2.1%
Other values (54) 12504
42.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29441
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10000
34.0%
1183
 
4.0%
968
 
3.3%
773
 
2.6%
756
 
2.6%
686
 
2.3%
658
 
2.2%
657
 
2.2%
636
 
2.2%
620
 
2.1%
Other values (54) 12504
42.5%

구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반
9013 
945 
블럭
 
42

Length

Max length2
Median length2
Mean length1.9055
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
일반 9013
90.1%
945
 
9.4%
블럭 42
 
0.4%

Length

2024-03-15T08:22:54.195240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T08:22:54.491035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 9013
90.1%
945
 
9.4%
블럭 42
 
0.4%

본번
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size156.2 KiB

부번
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size156.2 KiB

결정지가
Real number (ℝ)

Distinct2206
Distinct (%)22.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean76288.898
Minimum346
Maximum2969000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T08:22:55.025186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum346
5-th percentile6798
Q130700
median43900
Q377300
95-th percentile220820
Maximum2969000
Range2968654
Interquartile range (IQR)46600

Descriptive statistics

Standard deviation128194.41
Coefficient of variation (CV)1.680381
Kurtosis143.60456
Mean76288.898
Median Absolute Deviation (MAD)18400
Skewness9.3115507
Sum7.6288898 × 108
Variance1.6433807 × 1010
MonotonicityNot monotonic
2024-03-15T08:22:55.375804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
35000 57
 
0.6%
34000 56
 
0.6%
31000 54
 
0.5%
39400 47
 
0.5%
25000 43
 
0.4%
41400 43
 
0.4%
42000 42
 
0.4%
33500 42
 
0.4%
41000 42
 
0.4%
34500 39
 
0.4%
Other values (2196) 9535
95.3%
ValueCountFrequency (%)
346 1
 
< 0.1%
494 1
 
< 0.1%
509 2
< 0.1%
513 1
 
< 0.1%
514 1
 
< 0.1%
518 2
< 0.1%
520 3
< 0.1%
577 1
 
< 0.1%
589 1
 
< 0.1%
612 1
 
< 0.1%
ValueCountFrequency (%)
2969000 1
 
< 0.1%
2939000 2
< 0.1%
2655000 2
< 0.1%
1801000 1
 
< 0.1%
1745000 2
< 0.1%
1715000 1
 
< 0.1%
1675000 1
 
< 0.1%
1668000 3
< 0.1%
1485000 1
 
< 0.1%
1445000 1
 
< 0.1%

공부지목
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3232 
2595 
2111 
임야
1284 
잡종지
 
211
Other values (20)
567 

Length

Max length5
Median length1
Mean length1.2826
Min length1

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row
2nd row임야
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
3232
32.3%
2595
25.9%
2111
21.1%
임야 1284
 
12.8%
잡종지 211
 
2.1%
도로 97
 
1.0%
하천 96
 
1.0%
공장용지 89
 
0.9%
창고용지 72
 
0.7%
구거 38
 
0.4%
Other values (15) 175
 
1.8%

Length

2024-03-15T08:22:55.731092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
3232
32.3%
2595
25.9%
2111
21.1%
임야 1284
 
12.8%
잡종지 211
 
2.1%
도로 97
 
1.0%
하천 96
 
1.0%
공장용지 89
 
0.9%
창고용지 72
 
0.7%
구거 38
 
0.4%
Other values (15) 175
 
1.8%

총면적
Real number (ℝ)

SKEWED 

Distinct3198
Distinct (%)32.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2580.1228
Minimum1
Maximum2329427
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T08:22:56.028516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile28
Q1232
median575
Q31525.5
95-th percentile5468.55
Maximum2329427
Range2329426
Interquartile range (IQR)1293.5

Descriptive statistics

Standard deviation26539.945
Coefficient of variation (CV)10.286311
Kurtosis5941.7693
Mean2580.1228
Median Absolute Deviation (MAD)443
Skewness69.715437
Sum25801228
Variance7.043687 × 108
MonotonicityNot monotonic
2024-03-15T08:22:56.362685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
198.0 42
 
0.4%
4000.0 41
 
0.4%
496.0 40
 
0.4%
13.0 39
 
0.4%
10.0 39
 
0.4%
298.0 38
 
0.4%
3.0 33
 
0.3%
1.0 31
 
0.3%
992.0 31
 
0.3%
66.0 31
 
0.3%
Other values (3188) 9635
96.4%
ValueCountFrequency (%)
1.0 31
0.3%
2.0 18
0.2%
3.0 33
0.3%
4.0 18
0.2%
5.0 17
0.2%
6.0 9
 
0.1%
7.0 28
0.3%
8.0 24
0.2%
9.0 16
0.2%
10.0 39
0.4%
ValueCountFrequency (%)
2329427.0 1
< 0.1%
488926.0 1
< 0.1%
447742.0 1
< 0.1%
441422.0 1
< 0.1%
309864.0 1
< 0.1%
237025.0 1
< 0.1%
217388.0 1
< 0.1%
215119.6 1
< 0.1%
205362.0 1
< 0.1%
197950.0 1
< 0.1%

Interactions

2024-03-15T08:22:50.119412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:22:49.408633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:22:50.398802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:22:49.846373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T08:22:56.554509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법정동구분결정지가공부지목총면적
법정동1.0000.9990.3400.1150.3340.044
0.9991.0000.4130.4600.4820.000
구분0.3400.4131.0000.0900.7990.054
결정지가0.1150.4600.0901.0000.2220.000
공부지목0.3340.4820.7990.2221.0000.000
총면적0.0440.0000.0540.0000.0001.000
2024-03-15T08:22:56.785180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분공부지목법정동
구분1.0000.6020.150
공부지목0.6021.0000.156
법정동0.1500.1561.000
2024-03-15T08:22:57.049788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
결정지가총면적법정동구분공부지목
결정지가1.000-0.3180.0570.0390.086
총면적-0.3181.0000.0280.0510.000
법정동0.0570.0281.0000.1500.156
구분0.0390.0510.1501.0000.602
공부지목0.0860.0000.1560.6021.000

Missing values

2024-03-15T08:22:50.774990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T08:22:51.126201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

법정동구분본번부번결정지가공부지목총면적
89379소양면명덕리일반892162320059.0
74573이서면상개리79318200임야565.0
35889봉동읍성덕리일반12322394002457.0
37808봉동읍고천리일반154073100192.0
41919용진읍상삼리일반11523200106.0
48588용진읍상운리일반86982460051.0
64365상관면죽림리일반61303365001709.0
15697삼례읍수계리일반12182552000468.0
17957삼례읍하리일반106523320043.0
73460이서면상개리일반391039700258.0
법정동구분본번부번결정지가공부지목총면적
62746상관면신리5921170임야98182.0
77219이서면은교리일반32084250076.0
59257용진읍신지리일반11404138800364.0
58988용진읍신지리일반105661078008.0
7745삼례읍해전리일반2333050000145.0
71465이서면이성리일반344247900구거99.0
56682용진읍운곡리2014500임야8033.0
29244봉동읍율소리일반4731522002116.0
45480용진읍구억리일반4180139100301.0
1429삼례읍삼례리일반91623191000395.0

Duplicate rows

Most frequently occurring

법정동구분결정지가공부지목총면적# duplicates
15삼례읍삼례리일반395004000.04
21삼례읍신금리일반410004000.04
1봉동읍구미리일반313004000.03
12봉동읍율소리일반310004000.03
16삼례읍삼례리일반575004000.03
0봉동읍고천리일반394001653.02
2봉동읍구암리일반32700198.02
3봉동읍구암리일반360004010.02
4봉동읍구암리일반393001049.02
5봉동읍구암리일반50600잡종지1525.02