Overview

Dataset statistics

Number of variables12
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.0 MiB
Average record size in memory110.0 B

Variable types

Numeric2
Categorical8
Text1
Boolean1

Dataset

Description부산광역시_연제구_개별공시지가정보_20200316
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15039887

Alerts

기준년도 has constant value ""Constant
기준월 has constant value ""Constant
공시일자 has constant value ""Constant
데이터기준일자 has constant value ""Constant
특수지구분명 is highly overall correlated with 특수지구분코드High correlation
특수지구분코드 is highly overall correlated with 특수지구분명High correlation
고유번호 is highly overall correlated with 법정동코드High correlation
법정동코드 is highly overall correlated with 고유번호High correlation
특수지구분코드 is highly imbalanced (85.2%)Imbalance
특수지구분명 is highly imbalanced (85.2%)Imbalance
표준지여부 is highly imbalanced (82.1%)Imbalance
고유번호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:11:17.442609
Analysis finished2023-12-10 16:11:18.865777
Duration1.42 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

고유번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.6470102 × 1018
Minimum2.6470101 × 1018
Maximum2.6470102 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:11:18.935006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.6470101 × 1018
5-th percentile2.6470101 × 1018
Q12.6470101 × 1018
median2.6470102 × 1018
Q32.6470102 × 1018
95-th percentile2.6470102 × 1018
Maximum2.6470102 × 1018
Range1.0010219 × 1011
Interquartile range (IQR)1.0000307 × 1011

Descriptive statistics

Standard deviation4.6604822 × 1010
Coefficient of variation (CV)1.760659 × 10-8
Kurtosis-1.3953368
Mean2.6470102 × 1018
Median Absolute Deviation (MAD)12230144
Skewness-0.7777801
Sum-9.7606346 × 1017
Variance2.1720094 × 1021
MonotonicityNot monotonic
2023-12-11T01:11:19.069336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2647010100110920016 1
 
< 0.1%
2647010200105860004 1
 
< 0.1%
2647010200103050028 1
 
< 0.1%
2647010100103810015 1
 
< 0.1%
2647010200118760237 1
 
< 0.1%
2647010200103740050 1
 
< 0.1%
2647010200107300002 1
 
< 0.1%
2647010200106530006 1
 
< 0.1%
2647010100100360016 1
 
< 0.1%
2647010200106390013 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
2647010100100010001 1
< 0.1%
2647010100100010005 1
< 0.1%
2647010100100010007 1
< 0.1%
2647010100100010008 1
< 0.1%
2647010100100010011 1
< 0.1%
2647010100100010016 1
< 0.1%
2647010100100010018 1
< 0.1%
2647010100100010020 1
< 0.1%
2647010100100010022 1
< 0.1%
2647010100100010023 1
< 0.1%
ValueCountFrequency (%)
2647010200202200002 1
< 0.1%
2647010200202200000 1
< 0.1%
2647010200201880007 1
< 0.1%
2647010200201870005 1
< 0.1%
2647010200201860034 1
< 0.1%
2647010200201860018 1
< 0.1%
2647010200201860012 1
< 0.1%
2647010200201860009 1
< 0.1%
2647010200201840000 1
< 0.1%
2647010200201810042 1
< 0.1%

법정동코드
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2647010200
6812 
2647010100
3188 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2647010100
2nd row2647010200
3rd row2647010200
4th row2647010200
5th row2647010100

Common Values

ValueCountFrequency (%)
2647010200 6812
68.1%
2647010100 3188
31.9%

Length

2023-12-11T01:11:19.177934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:11:19.281628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2647010200 6812
68.1%
2647010100 3188
31.9%

법정동명
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
거제동
7188 
연산동
2812 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row거제동
2nd row거제동
3rd row거제동
4th row거제동
5th row거제동

Common Values

ValueCountFrequency (%)
거제동 7188
71.9%
연산동 2812
 
28.1%

Length

2023-12-11T01:11:19.409918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:11:19.491570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
거제동 7188
71.9%
연산동 2812
 
28.1%

특수지구분코드
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
9789 
2
 
211

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 9789
97.9%
2 211
 
2.1%

Length

2023-12-11T01:11:19.576263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:11:19.652003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 9789
97.9%
2 211
 
2.1%

특수지구분명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반
9789 
 
211

Length

Max length2
Median length2
Mean length1.9789
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
일반 9789
97.9%
211
 
2.1%

Length

2023-12-11T01:11:19.740873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:11:19.825755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 9789
97.9%
211
 
2.1%

지번
Text

Distinct9785
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:11:20.081162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length9
Mean length9
Min length9

Characters and Unicode

Total characters90000
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9570 ?
Unique (%)95.7%

Sample

1st row1092-0016
2nd row1833-0003
3rd row1127-0028
4th row0640-0029
5th row0733-0068
ValueCountFrequency (%)
0794-0009 2
 
< 0.1%
0339-0001 2
 
< 0.1%
0463-0007 2
 
< 0.1%
1255-0001 2
 
< 0.1%
0649-0020 2
 
< 0.1%
0489-0008 2
 
< 0.1%
0057-0002 2
 
< 0.1%
1233-0001 2
 
< 0.1%
0794-0013 2
 
< 0.1%
0303-0006 2
 
< 0.1%
Other values (9775) 9980
99.8%
2023-12-11T01:11:20.457134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 32772
36.4%
1 10342
 
11.5%
- 10000
 
11.1%
2 6511
 
7.2%
3 5329
 
5.9%
4 4882
 
5.4%
6 4837
 
5.4%
8 4193
 
4.7%
7 4158
 
4.6%
5 3749
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 80000
88.9%
Dash Punctuation 10000
 
11.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 32772
41.0%
1 10342
 
12.9%
2 6511
 
8.1%
3 5329
 
6.7%
4 4882
 
6.1%
6 4837
 
6.0%
8 4193
 
5.2%
7 4158
 
5.2%
5 3749
 
4.7%
9 3227
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 10000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 90000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 32772
36.4%
1 10342
 
11.5%
- 10000
 
11.1%
2 6511
 
7.2%
3 5329
 
5.9%
4 4882
 
5.4%
6 4837
 
5.4%
8 4193
 
4.7%
7 4158
 
4.6%
5 3749
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 90000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 32772
36.4%
1 10342
 
11.5%
- 10000
 
11.1%
2 6511
 
7.2%
3 5329
 
5.9%
4 4882
 
5.4%
6 4837
 
5.4%
8 4193
 
4.7%
7 4158
 
4.6%
5 3749
 
4.2%

기준년도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2019
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2019 10000
100.0%

Length

2023-12-11T01:11:20.585797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:11:20.673731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 10000
100.0%

기준월
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 10000
100.0%

Length

2023-12-11T01:11:20.761506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:11:20.839813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 10000
100.0%

공시지가
Real number (ℝ)

Distinct2390
Distinct (%)23.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1341670.6
Minimum1280
Maximum13210000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:11:21.177593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1280
5-th percentile356400
Q1842800
median1130000
Q31618000
95-th percentile3038100
Maximum13210000
Range13208720
Interquartile range (IQR)775200

Descriptive statistics

Standard deviation948908.1
Coefficient of variation (CV)0.70725861
Kurtosis25.903776
Mean1341670.6
Median Absolute Deviation (MAD)370000
Skewness3.4854611
Sum1.3416706 × 1010
Variance9.0042658 × 1011
MonotonicityNot monotonic
2023-12-11T01:11:21.312637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1050000 261
 
2.6%
1250000 177
 
1.8%
2080000 143
 
1.4%
1500000 127
 
1.3%
1650000 100
 
1.0%
594000 62
 
0.6%
623700 53
 
0.5%
356400 52
 
0.5%
3350000 51
 
0.5%
1380000 49
 
0.5%
Other values (2380) 8925
89.2%
ValueCountFrequency (%)
1280 2
 
< 0.1%
1610 2
 
< 0.1%
1660 1
 
< 0.1%
1680 4
< 0.1%
3000 5
0.1%
3750 1
 
< 0.1%
3990 1
 
< 0.1%
4050 3
< 0.1%
4170 1
 
< 0.1%
4240 1
 
< 0.1%
ValueCountFrequency (%)
13210000 4
< 0.1%
11910000 3
< 0.1%
11800000 2
< 0.1%
11400000 1
 
< 0.1%
11000000 1
 
< 0.1%
9270000 1
 
< 0.1%
9084000 1
 
< 0.1%
8600000 1
 
< 0.1%
8320000 1
 
< 0.1%
8137000 1
 
< 0.1%

공시일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2019-01-01
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019-01-01
2nd row2019-01-01
3rd row2019-01-01
4th row2019-01-01
5th row2019-01-01

Common Values

ValueCountFrequency (%)
2019-01-01 10000
100.0%

Length

2023-12-11T01:11:21.419058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:11:21.492795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019-01-01 10000
100.0%

표준지여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
9731 
True
 
269
ValueCountFrequency (%)
False 9731
97.3%
True 269
 
2.7%
2023-12-11T01:11:21.551843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2020-03-16
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-03-16
2nd row2020-03-16
3rd row2020-03-16
4th row2020-03-16
5th row2020-03-16

Common Values

ValueCountFrequency (%)
2020-03-16 10000
100.0%

Length

2023-12-11T01:11:21.625232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:11:21.701302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-03-16 10000
100.0%

Interactions

2023-12-11T01:11:18.368629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:18.178359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:18.464265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:18.276078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:11:21.746564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
고유번호법정동코드법정동명특수지구분코드특수지구분명공시지가표준지여부
고유번호1.0001.0000.6210.0490.0490.0880.022
법정동코드1.0001.0000.6220.0470.0470.0880.017
법정동명0.6210.6221.0000.0000.0000.1450.000
특수지구분코드0.0490.0470.0001.0001.0000.1200.000
특수지구분명0.0490.0470.0001.0001.0000.1200.000
공시지가0.0880.0880.1450.1200.1201.0000.101
표준지여부0.0220.0170.0000.0000.0000.1011.000
2023-12-11T01:11:21.833610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법정동코드표준지여부법정동명특수지구분명특수지구분코드
법정동코드1.0000.0110.4280.0300.030
표준지여부0.0111.0000.0000.0000.000
법정동명0.4280.0001.0000.0000.000
특수지구분명0.0300.0000.0001.0000.998
특수지구분코드0.0300.0000.0000.9981.000
2023-12-11T01:11:21.916318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
고유번호공시지가법정동코드법정동명특수지구분코드특수지구분명표준지여부
고유번호1.000-0.0621.0000.4280.0300.0300.011
공시지가-0.0621.0000.0670.1110.0930.0930.077
법정동코드1.0000.0671.0000.4280.0300.0300.011
법정동명0.4280.1110.4281.0000.0000.0000.000
특수지구분코드0.0300.0930.0300.0001.0000.9980.000
특수지구분명0.0300.0930.0300.0000.9981.0000.000
표준지여부0.0110.0770.0110.0000.0000.0001.000

Missing values

2023-12-11T01:11:18.608776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:11:18.797202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

고유번호법정동코드법정동명특수지구분코드특수지구분명지번기준년도기준월공시지가공시일자표준지여부데이터기준일자
694726470101001109200162647010100거제동1일반1092-00162019110620002019-01-01N2020-03-16
2316626470102001183300032647010200거제동1일반1833-00032019120490002019-01-01N2020-03-16
1885626470102001112700282647010200거제동1일반1127-00282019123800002019-01-01N2020-03-16
1422926470102001064000292647010200거제동1일반0640-00292019114500002019-01-01N2020-03-16
444126470101001073300682647010100거제동1일반0733-0068201916105002019-01-01N2020-03-16
2182726470102001176000052647010200거제동1일반1760-0005201919593002019-01-01N2020-03-16
1810126470102001096500032647010200거제동1일반0965-00032019113000002019-01-01N2020-03-16
285526470101001057200202647010100거제동1일반0572-00202019114550002019-01-01N2020-03-16
1469626470102001065500362647010200거제동1일반0655-00362019114200002019-01-01N2020-03-16
1416726470102001063800312647010200거제동1일반0638-00312019117600002019-01-01Y2020-03-16
고유번호법정동코드법정동명특수지구분코드특수지구분명지번기준년도기준월공시지가공시일자표준지여부데이터기준일자
803326470101001143800002647010100거제동1일반1438-0000201916336002019-01-01N2020-03-16
852226470101002010000062647010100거제동20100-0006201913411002019-01-01N2020-03-16
900326470102001005700962647010200연산동1일반0057-0096201917260002019-01-01N2020-03-16
1940126470102001123200202647010200거제동1일반1232-0020201915049002019-01-01N2020-03-16
439926470101001073300172647010100거제동1일반0733-00172019111120002019-01-01N2020-03-16
1484226470102001065800252647010200연산동1일반0658-0025201919900002019-01-01N2020-03-16
464226470101001075100242647010100거제동1일반0751-00242019115000002019-01-01N2020-03-16
2441826470102001199600452647010200연산동1일반1996-00452019115000002019-01-01N2020-03-16
535326470101001080200192647010100거제동1일반0802-0019201913861002019-01-01N2020-03-16
2346126470102001187600702647010200연산동1일반1876-0070201919256002019-01-01N2020-03-16