Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory585.9 KiB
Average record size in memory60.0 B

Variable types

Numeric4
Categorical2

Dataset

Description부산광역시_연제구_개별공시지가정보_20220914
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15039887

Alerts

연번 is highly overall correlated with 본번 and 1 other fieldsHigh correlation
본번 is highly overall correlated with 연번High correlation
법정동 is highly overall correlated with 연번High correlation
구분 is highly imbalanced (85.5%)Imbalance
연번 has unique valuesUnique
부번 has 263 (2.6%) zerosZeros

Reproduction

Analysis started2023-12-10 16:11:24.630266
Analysis finished2023-12-10 16:11:26.485104
Duration1.85 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12875.855
Minimum2
Maximum25796
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:11:26.546958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile1262.9
Q16389.25
median12844
Q319382.75
95-th percentile24517.1
Maximum25796
Range25794
Interquartile range (IQR)12993.5

Descriptive statistics

Standard deviation7485.2175
Coefficient of variation (CV)0.5813375
Kurtosis-1.2148473
Mean12875.855
Median Absolute Deviation (MAD)6504.5
Skewness0.0085431518
Sum1.2875855 × 108
Variance56028481
MonotonicityNot monotonic
2023-12-11T01:11:26.658448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4857 1
 
< 0.1%
8907 1
 
< 0.1%
22166 1
 
< 0.1%
24967 1
 
< 0.1%
25148 1
 
< 0.1%
19999 1
 
< 0.1%
6470 1
 
< 0.1%
24240 1
 
< 0.1%
12653 1
 
< 0.1%
19694 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
10 1
< 0.1%
11 1
< 0.1%
15 1
< 0.1%
16 1
< 0.1%
24 1
< 0.1%
ValueCountFrequency (%)
25796 1
< 0.1%
25794 1
< 0.1%
25791 1
< 0.1%
25781 1
< 0.1%
25778 1
< 0.1%
25774 1
< 0.1%
25773 1
< 0.1%
25771 1
< 0.1%
25766 1
< 0.1%
25762 1
< 0.1%

법정동
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
연산동
6693 
거제동
3307 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row거제동
2nd row연산동
3rd row연산동
4th row연산동
5th row연산동

Common Values

ValueCountFrequency (%)
연산동 6693
66.9%
거제동 3307
33.1%

Length

2023-12-11T01:11:26.755187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:11:26.826931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
연산동 6693
66.9%
거제동 3307
33.1%

구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반
9794 
 
206

Length

Max length2
Median length2
Mean length1.9794
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
일반 9794
97.9%
206
 
2.1%

Length

2023-12-11T01:11:26.906565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:11:26.982692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 9794
97.9%
206
 
2.1%

본번
Real number (ℝ)

HIGH CORRELATION 

Distinct1290
Distinct (%)12.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean951.0632
Minimum1
Maximum2380
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:11:27.069958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile70
Q1463
median776
Q31422.5
95-th percentile2079
Maximum2380
Range2379
Interquartile range (IQR)959.5

Descriptive statistics

Standard deviation628.06291
Coefficient of variation (CV)0.66037978
Kurtosis-0.88985463
Mean951.0632
Median Absolute Deviation (MAD)431
Skewness0.50378694
Sum9510632
Variance394463.02
MonotonicityNot monotonic
2023-12-11T01:11:27.179816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1811 296
 
3.0%
2022 208
 
2.1%
676 139
 
1.4%
643 85
 
0.9%
1876 83
 
0.8%
1941 78
 
0.8%
2063 72
 
0.7%
649 72
 
0.7%
766 70
 
0.7%
802 68
 
0.7%
Other values (1280) 8829
88.3%
ValueCountFrequency (%)
1 42
0.4%
2 23
0.2%
3 3
 
< 0.1%
4 3
 
< 0.1%
10 14
 
0.1%
11 7
 
0.1%
13 1
 
< 0.1%
14 4
 
< 0.1%
15 29
0.3%
16 1
 
< 0.1%
ValueCountFrequency (%)
2380 1
< 0.1%
2379 1
< 0.1%
2376 1
< 0.1%
2374 1
< 0.1%
2373 1
< 0.1%
2372 1
< 0.1%
2370 1
< 0.1%
2368 1
< 0.1%
2366 1
< 0.1%
2365 1
< 0.1%

부번
Real number (ℝ)

ZEROS 

Distinct578
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean51.4265
Minimum0
Maximum896
Zeros263
Zeros (%)2.6%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:11:27.297173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q17
median20
Q344
95-th percentile230.05
Maximum896
Range896
Interquartile range (IQR)37

Descriptive statistics

Standard deviation104.00716
Coefficient of variation (CV)2.0224429
Kurtosis21.158423
Mean51.4265
Median Absolute Deviation (MAD)15
Skewness4.2894892
Sum514265
Variance10817.49
MonotonicityNot monotonic
2023-12-11T01:11:27.415845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 450
 
4.5%
2 381
 
3.8%
3 315
 
3.1%
5 306
 
3.1%
6 290
 
2.9%
4 289
 
2.9%
7 275
 
2.8%
0 263
 
2.6%
9 261
 
2.6%
10 246
 
2.5%
Other values (568) 6924
69.2%
ValueCountFrequency (%)
0 263
2.6%
1 450
4.5%
2 381
3.8%
3 315
3.1%
4 289
2.9%
5 306
3.1%
6 290
2.9%
7 275
2.8%
8 240
2.4%
9 261
2.6%
ValueCountFrequency (%)
896 1
< 0.1%
891 1
< 0.1%
889 1
< 0.1%
870 1
< 0.1%
868 1
< 0.1%
860 1
< 0.1%
859 1
< 0.1%
858 1
< 0.1%
848 1
< 0.1%
844 1
< 0.1%

결정지가
Real number (ℝ)

Distinct2561
Distinct (%)25.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1772735.6
Minimum1580
Maximum18590000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:11:27.528939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1580
5-th percentile437200
Q11067000
median1525000
Q32046250
95-th percentile3935050
Maximum18590000
Range18588420
Interquartile range (IQR)979250

Descriptive statistics

Standard deviation1269576.8
Coefficient of variation (CV)0.71616815
Kurtosis27.940658
Mean1772735.6
Median Absolute Deviation (MAD)494000
Skewness3.6247608
Sum1.7727356 × 1010
Variance1.6118252 × 1012
MonotonicityNot monotonic
2023-12-11T01:11:27.649887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2019000 712
 
7.1%
1525000 252
 
2.5%
777100 73
 
0.7%
750700 65
 
0.7%
587400 48
 
0.5%
2351000 44
 
0.4%
2753000 43
 
0.4%
1961000 42
 
0.4%
862900 39
 
0.4%
1316000 36
 
0.4%
Other values (2551) 8646
86.5%
ValueCountFrequency (%)
1580 3
 
< 0.1%
1940 3
 
< 0.1%
2090 1
 
< 0.1%
2100 1
 
< 0.1%
2150 1
 
< 0.1%
3000 1
 
< 0.1%
3300 13
0.1%
3500 1
 
< 0.1%
4550 1
 
< 0.1%
4800 1
 
< 0.1%
ValueCountFrequency (%)
18590000 3
< 0.1%
16900000 2
< 0.1%
16390000 1
 
< 0.1%
15850000 1
 
< 0.1%
15700000 2
< 0.1%
14850000 1
 
< 0.1%
11250000 4
< 0.1%
10950000 1
 
< 0.1%
10750000 1
 
< 0.1%
10730000 1
 
< 0.1%

Interactions

2023-12-11T01:11:26.038848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:25.189119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:25.470434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:25.747568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:26.115990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:25.256092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:25.539032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:25.815567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:26.193889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:25.325458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:25.605770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:25.885271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:26.267241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:25.392544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:25.670128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:11:25.959823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:11:27.963674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번법정동구분본번부번결정지가
연번1.0000.9970.3740.9580.4790.391
법정동0.9971.0000.0910.6010.1200.216
구분0.3740.0911.0000.5420.0570.099
본번0.9580.6010.5421.0000.5000.373
부번0.4790.1200.0570.5001.0000.159
결정지가0.3910.2160.0990.3730.1591.000
2023-12-11T01:11:28.054291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법정동구분
법정동1.0000.058
구분0.0581.000
2023-12-11T01:11:28.132209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번본번부번결정지가법정동구분
연번1.0000.6950.073-0.1050.9550.287
본번0.6951.0000.065-0.0520.4650.418
부번0.0730.0651.000-0.1520.0920.044
결정지가-0.105-0.052-0.1521.0000.1660.077
법정동0.9550.4650.0920.1661.0000.058
구분0.2870.4180.0440.0770.0581.000

Missing values

2023-12-11T01:11:26.366343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:11:26.446480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번법정동구분본번부번결정지가
48564857거제동일반79062019000
1171011711연산동일반412211657000
97769777연산동일반31772695000
2530625307연산동일반22020748000
1348113482연산동일반615511245000
1150811509연산동일반405123921000
2023520236연산동일반158444694600
2099020991연산동일반18112141992000
1553715538연산동일반70873335000
1647616477연산동일반794111265000
연번법정동구분본번부번결정지가
1877518776연산동일반128973503000
46154616거제동일반76925419100
2382823829연산동일반20224731525000
2164221643연산동일반181688783200
45474548거제동일반766126725900
1530015301연산동일반683121415000
93039304연산동일반29811462000
1774317744연산동일반11208767200
1827518276연산동일반120821499000
2230022301연산동일반18761221148000