Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 10000 |
Missing cells | 10000 |
Missing cells (%) | 11.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 839.8 KiB |
Average record size in memory | 86.0 B |
Variable types
Numeric | 5 |
---|---|
Boolean | 1 |
Categorical | 2 |
Unsupported | 1 |
Dataset
Description | 부산광역시_연제구_개별공시지가정보_20200916 |
---|---|
Author | 부산광역시 연제구 |
URL | http://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15039887 |
No is highly overall correlated with 행정동 and 2 other fields | High correlation |
행정동 is highly overall correlated with No and 1 other fields | High correlation |
본번 is highly overall correlated with No | High correlation |
법정동 is highly overall correlated with No and 1 other fields | High correlation |
표준지여부 is highly imbalanced (81.8%) | Imbalance |
구분 is highly imbalanced (85.0%) | Imbalance |
Unnamed: 8 has 10000 (100.0%) missing values | Missing |
No has unique values | Unique |
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
부번 has 241 (2.4%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 16:11:30.490056 |
---|---|
Analysis finished | 2023-12-10 16:11:33.697699 |
Duration | 3.21 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
No
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13180.83 |
Minimum | 1 |
---|---|
Maximum | 26405 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1302.9 |
Q1 | 6537.25 |
median | 13188.5 |
Q3 | 19860.75 |
95-th percentile | 25118.05 |
Maximum | 26405 |
Range | 26404 |
Interquartile range (IQR) | 13323.5 |
Descriptive statistics
Standard deviation | 7653.0807 |
---|---|
Coefficient of variation (CV) | 0.58062207 |
Kurtosis | -1.2067778 |
Mean | 13180.83 |
Median Absolute Deviation (MAD) | 6662 |
Skewness | 0.0036694457 |
Sum | 1.318083 × 108 |
Variance | 58569644 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10176 | 1 | < 0.1% |
12889 | 1 | < 0.1% |
21360 | 1 | < 0.1% |
3771 | 1 | < 0.1% |
3700 | 1 | < 0.1% |
16968 | 1 | < 0.1% |
4598 | 1 | < 0.1% |
22032 | 1 | < 0.1% |
7386 | 1 | < 0.1% |
22422 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
4 | 1 | |
6 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 | |
14 | 1 | |
18 | 1 | |
21 | 1 |
Value | Count | Frequency (%) |
26405 | 1 | |
26401 | 1 | |
26399 | 1 | |
26396 | 1 | |
26393 | 1 | |
26392 | 1 | |
26391 | 1 | |
26389 | 1 | |
26388 | 1 | |
26384 | 1 |
표준지여부
Boolean
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 87.9 KiB |
False | |
---|---|
True | 276 |
Value | Count | Frequency (%) |
False | 9724 | |
True | 276 | 2.8% |
법정동
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
연산동 | |
---|---|
거제동 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 연산동 |
---|---|
2nd row | 연산동 |
3rd row | 거제동 |
4th row | 연산동 |
5th row | 거제동 |
Common Values
Value | Count | Frequency (%) |
연산동 | 6812 | |
거제동 | 3188 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
연산동 | 6812 | |
거제동 | 3188 |
행정동
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 12 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 668.357 |
Minimum | 610 |
---|---|
Maximum | 730 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 610 |
---|---|
5-th percentile | 610 |
Q1 | 630 |
median | 670 |
Q3 | 700 |
95-th percentile | 730 |
Maximum | 730 |
Range | 120 |
Interquartile range (IQR) | 70 |
Descriptive statistics
Standard deviation | 37.864415 |
---|---|
Coefficient of variation (CV) | 0.056652979 |
Kurtosis | -1.1491839 |
Mean | 668.357 |
Median Absolute Deviation (MAD) | 30 |
Skewness | 0.06785284 |
Sum | 6683570 |
Variance | 1433.7139 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
700 | 1148 | |
620 | 1051 | |
730 | 1001 | |
660 | 929 | |
680 | 888 | |
670 | 862 | |
610 | 814 | |
630 | 742 | |
720 | 706 | |
690 | 706 | |
Other values (2) | 1153 |
Value | Count | Frequency (%) |
610 | 814 | |
620 | 1051 | |
630 | 742 | |
640 | 581 | |
650 | 572 | |
660 | 929 | |
670 | 862 | |
680 | 888 | |
690 | 706 | |
700 | 1148 |
Value | Count | Frequency (%) |
730 | 1001 | |
720 | 706 | |
700 | 1148 | |
690 | 706 | |
680 | 888 | |
670 | 862 | |
660 | 929 | |
650 | 572 | |
640 | 581 | |
630 | 742 |
구분
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
일반 | |
---|---|
산 | 216 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 1.9784 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 일반 |
---|---|
2nd row | 일반 |
3rd row | 일반 |
4th row | 일반 |
5th row | 일반 |
Common Values
Value | Count | Frequency (%) |
일반 | 9784 | |
산 | 216 | 2.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
일반 | 9784 | |
산 | 216 | 2.2% |
본번
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 1310 |
---|---|
Distinct (%) | 13.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 950.7118 |
Minimum | 1 |
---|---|
Maximum | 2360 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 69 |
Q1 | 467.75 |
median | 785 |
Q3 | 1429 |
95-th percentile | 2063 |
Maximum | 2360 |
Range | 2359 |
Interquartile range (IQR) | 961.25 |
Descriptive statistics
Standard deviation | 623.99352 |
---|---|
Coefficient of variation (CV) | 0.65634351 |
Kurtosis | -0.90212155 |
Mean | 950.7118 |
Median Absolute Deviation (MAD) | 428 |
Skewness | 0.48947771 |
Sum | 9507118 |
Variance | 389367.92 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1811 | 308 | 3.1% |
2022 | 195 | 1.9% |
676 | 145 | 1.5% |
643 | 88 | 0.9% |
1941 | 80 | 0.8% |
1876 | 79 | 0.8% |
649 | 62 | 0.6% |
766 | 60 | 0.6% |
1824 | 59 | 0.6% |
815 | 59 | 0.6% |
Other values (1300) | 8865 |
Value | Count | Frequency (%) |
1 | 37 | |
2 | 30 | |
3 | 3 | < 0.1% |
4 | 2 | < 0.1% |
5 | 1 | < 0.1% |
6 | 1 | < 0.1% |
7 | 2 | < 0.1% |
9 | 1 | < 0.1% |
10 | 15 | |
11 | 8 | 0.1% |
Value | Count | Frequency (%) |
2360 | 1 | |
2359 | 1 | |
2356 | 1 | |
2355 | 1 | |
2351 | 1 | |
2350 | 1 | |
2342 | 1 | |
2338 | 1 | |
2336 | 1 | |
2334 | 1 |
부번
Real number (ℝ)
ZEROS
 
Distinct | 559 |
---|---|
Distinct (%) | 5.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 49.8932 |
Minimum | 0 |
---|---|
Maximum | 900 |
Zeros | 241 |
Zeros (%) | 2.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 7 |
median | 20 |
Q3 | 44 |
95-th percentile | 213 |
Maximum | 900 |
Range | 900 |
Interquartile range (IQR) | 37 |
Descriptive statistics
Standard deviation | 100.96593 |
---|---|
Coefficient of variation (CV) | 2.023641 |
Kurtosis | 22.86708 |
Mean | 49.8932 |
Median Absolute Deviation (MAD) | 15 |
Skewness | 4.4414324 |
Sum | 498932 |
Variance | 10194.118 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 433 | 4.3% |
2 | 404 | 4.0% |
3 | 352 | 3.5% |
6 | 307 | 3.1% |
4 | 305 | 3.0% |
5 | 277 | 2.8% |
7 | 257 | 2.6% |
0 | 241 | 2.4% |
9 | 232 | 2.3% |
8 | 225 | 2.2% |
Other values (549) | 6967 |
Value | Count | Frequency (%) |
0 | 241 | |
1 | 433 | |
2 | 404 | |
3 | 352 | |
4 | 305 | |
5 | 277 | |
6 | 307 | |
7 | 257 | |
8 | 225 | |
9 | 232 |
Value | Count | Frequency (%) |
900 | 1 | |
897 | 1 | |
896 | 1 | |
871 | 1 | |
870 | 1 | |
867 | 1 | |
864 | 1 | |
859 | 1 | |
856 | 1 | |
855 | 1 |
결정지가
Real number (ℝ)
Distinct | 2474 |
---|---|
Distinct (%) | 24.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1403703.9 |
Minimum | 1350 |
---|---|
Maximum | 14700000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1350 |
---|---|
5-th percentile | 360000 |
Q1 | 861000 |
median | 1202000 |
Q3 | 1654250 |
95-th percentile | 3227000 |
Maximum | 14700000 |
Range | 14698650 |
Interquartile range (IQR) | 793250 |
Descriptive statistics
Standard deviation | 1020326 |
---|---|
Coefficient of variation (CV) | 0.72688123 |
Kurtosis | 26.644404 |
Mean | 1403703.9 |
Median Absolute Deviation (MAD) | 383000 |
Skewness | 3.5302093 |
Sum | 1.4037039 × 1010 |
Variance | 1.0410652 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1240000 | 270 | 2.7% |
1400000 | 190 | 1.9% |
402600 | 83 | 0.8% |
653400 | 79 | 0.8% |
633600 | 72 | 0.7% |
504900 | 65 | 0.7% |
1600000 | 64 | 0.6% |
990000 | 56 | 0.6% |
3461000 | 51 | 0.5% |
534600 | 43 | 0.4% |
Other values (2464) | 9027 |
Value | Count | Frequency (%) |
1350 | 3 | < 0.1% |
1650 | 2 | < 0.1% |
1700 | 1 | < 0.1% |
1710 | 1 | < 0.1% |
1730 | 2 | < 0.1% |
1750 | 2 | < 0.1% |
3000 | 11 | |
3500 | 1 | < 0.1% |
3950 | 2 | < 0.1% |
3960 | 1 | < 0.1% |
Value | Count | Frequency (%) |
14700000 | 1 | < 0.1% |
14200000 | 3 | |
12920000 | 2 | |
12830000 | 1 | < 0.1% |
12800000 | 1 | < 0.1% |
12300000 | 1 | < 0.1% |
11930000 | 1 | < 0.1% |
11500000 | 1 | < 0.1% |
10580000 | 1 | < 0.1% |
9720000 | 1 | < 0.1% |
Unnamed: 8
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 10000 |
---|---|
Missing (%) | 100.0% |
Memory size | 166.0 KiB |
No | 표준지여부 | 법정동 | 행정동 | 구분 | 본번 | 부번 | 결정지가 | |
---|---|---|---|---|---|---|---|---|
No | 1.000 | 0.068 | 0.999 | 0.860 | 0.399 | 0.955 | 0.474 | 0.372 |
표준지여부 | 0.068 | 1.000 | 0.023 | 0.034 | 0.004 | 0.051 | 0.000 | 0.145 |
법정동 | 0.999 | 0.023 | 1.000 | 1.000 | 0.027 | 0.559 | 0.101 | 0.089 |
행정동 | 0.860 | 0.034 | 1.000 | 1.000 | 0.122 | 0.744 | 0.339 | 0.282 |
구분 | 0.399 | 0.004 | 0.027 | 0.122 | 1.000 | 0.557 | 0.058 | 0.122 |
본번 | 0.955 | 0.051 | 0.559 | 0.744 | 0.557 | 1.000 | 0.500 | 0.327 |
부번 | 0.474 | 0.000 | 0.101 | 0.339 | 0.058 | 0.500 | 1.000 | 0.180 |
결정지가 | 0.372 | 0.145 | 0.089 | 0.282 | 0.122 | 0.327 | 0.180 | 1.000 |
법정동 | 구분 | 표준지여부 | |
---|---|---|---|
법정동 | 1.000 | 0.017 | 0.015 |
구분 | 0.017 | 1.000 | 0.003 |
표준지여부 | 0.015 | 0.003 | 1.000 |
No | 행정동 | 본번 | 부번 | 결정지가 | 표준지여부 | 법정동 | 구분 | |
---|---|---|---|---|---|---|---|---|
No | 1.000 | 0.585 | 0.687 | 0.053 | -0.069 | 0.052 | 0.970 | 0.306 |
행정동 | 0.585 | 1.000 | 0.085 | 0.030 | -0.023 | 0.027 | 1.000 | 0.080 |
본번 | 0.687 | 0.085 | 1.000 | 0.047 | -0.059 | 0.039 | 0.432 | 0.430 |
부번 | 0.053 | 0.030 | 0.047 | 1.000 | -0.161 | 0.000 | 0.078 | 0.044 |
결정지가 | -0.069 | -0.023 | -0.059 | -0.161 | 1.000 | 0.111 | 0.068 | 0.095 |
표준지여부 | 0.052 | 0.027 | 0.039 | 0.000 | 0.111 | 1.000 | 0.015 | 0.003 |
법정동 | 0.970 | 1.000 | 0.432 | 0.078 | 0.068 | 0.015 | 1.000 | 0.017 |
구분 | 0.306 | 0.080 | 0.430 | 0.044 | 0.095 | 0.003 | 0.017 | 1.000 |
No | 표준지여부 | 법정동 | 행정동 | 구분 | 본번 | 부번 | 결정지가 | Unnamed: 8 | |
---|---|---|---|---|---|---|---|---|---|
10175 | 10176 | N | 연산동 | 720 | 일반 | 339 | 48 | 916500 | <NA> |
19830 | 19831 | N | 연산동 | 690 | 일반 | 1371 | 11 | 1940000 | <NA> |
4285 | 4286 | N | 거제동 | 630 | 일반 | 747 | 33 | 1907000 | <NA> |
10966 | 10967 | N | 연산동 | 720 | 일반 | 378 | 26 | 1019000 | <NA> |
2882 | 2883 | N | 거제동 | 630 | 일반 | 615 | 5 | 1029000 | <NA> |
6430 | 6431 | N | 거제동 | 620 | 일반 | 1013 | 3 | 1193000 | <NA> |
25268 | 25269 | N | 연산동 | 700 | 일반 | 2129 | 18 | 1294000 | <NA> |
18334 | 18335 | N | 연산동 | 680 | 일반 | 1136 | 11 | 1769000 | <NA> |
3272 | 3273 | N | 거제동 | 640 | 일반 | 649 | 119 | 1322000 | <NA> |
26212 | 26213 | N | 연산동 | 680 | 산 | 134 | 40 | 873800 | <NA> |
No | 표준지여부 | 법정동 | 행정동 | 구분 | 본번 | 부번 | 결정지가 | Unnamed: 8 | |
---|---|---|---|---|---|---|---|---|---|
7432 | 7433 | N | 거제동 | 620 | 일반 | 1312 | 4 | 889000 | <NA> |
20049 | 20050 | N | 연산동 | 660 | 일반 | 1461 | 1 | 3461000 | <NA> |
239 | 240 | N | 거제동 | 610 | 일반 | 18 | 33 | 633600 | <NA> |
1898 | 1899 | N | 거제동 | 610 | 일반 | 386 | 33 | 3524000 | <NA> |
23788 | 23789 | N | 연산동 | 700 | 일반 | 2018 | 27 | 1138000 | <NA> |
16863 | 16864 | N | 연산동 | 660 | 일반 | 822 | 100 | 965000 | <NA> |
24463 | 24464 | N | 연산동 | 660 | 일반 | 2027 | 10 | 1400000 | <NA> |
23808 | 23809 | N | 연산동 | 700 | 일반 | 2018 | 50 | 911800 | <NA> |
24654 | 24655 | N | 연산동 | 660 | 일반 | 2063 | 43 | 1710000 | <NA> |
25601 | 25602 | N | 연산동 | 700 | 일반 | 2139 | 22 | 1860000 | <NA> |