Dataset statistics
Number of variables | 18 |
---|---|
Number of observations | 200 |
Missing cells | 200 |
Missing cells (%) | 5.6% |
Duplicate rows | 4 |
Duplicate rows (%) | 2.0% |
Total size in memory | 29.6 KiB |
Average record size in memory | 151.7 B |
Variable types
Text | 6 |
---|---|
Categorical | 5 |
Numeric | 6 |
Unsupported | 1 |
Dataset
Description | Sample |
---|---|
Author | 오픈메이트 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=OPMAPTDEAL0000000019 |
Dataset has 4 (2.0%) duplicate rows | Duplicates |
BUILD_STDYM is highly overall correlated with HOUS_ID and 7 other fields | High correlation |
FLOOR is highly overall correlated with HOUS_ID and 4 other fields | High correlation |
CONT_CLSS is highly overall correlated with HOUS_ID and 5 other fields | High correlation |
BLD_CLSS is highly overall correlated with BLD_CD and 4 other fields | High correlation |
HOUS_ID is highly overall correlated with BLD_CD and 4 other fields | High correlation |
BLD_CD is highly overall correlated with HOUS_ID and 4 other fields | High correlation |
Y_AXIS is highly overall correlated with BUILD_STDYM and 2 other fields | High correlation |
BLK_CD is highly overall correlated with HOUS_ID and 2 other fields | High correlation |
CONT_DATE is highly overall correlated with BUILD_STDYM and 2 other fields | High correlation |
AMOUNT is highly overall correlated with BUILD_STDYM and 2 other fields | High correlation |
CONT_CLSS is highly imbalanced (95.5%) | Imbalance |
DEPOSIT is highly imbalanced (95.5%) | Imbalance |
RENT_AMOUNT has 200 (100.0%) missing values | Missing |
RENT_AMOUNT is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-12-10 06:34:45.250860 |
---|---|
Analysis finished | 2023-12-10 06:34:59.978712 |
Duration | 14.73 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
ADDRESS
Text
Distinct | 67 |
---|---|
Distinct (%) | 33.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
Length
Max length | 38 |
---|---|
Median length | 35 |
Mean length | 26.67 |
Min length | 20 |
Characters and Unicode
Total characters | 5334 |
---|---|
Distinct characters | 159 |
Distinct categories | 8 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 44 ? |
---|---|
Unique (%) | 22.0% |
Sample
1st row | 대구광역시 동구 입석동 893-29 센트로파크A동 |
---|---|
2nd row | 인천광역시 서구 왕길동 660-2 타워팰리스2 |
3rd row | 인천광역시 서구 왕길동 660-2 타워팰리스2 |
4th row | 인천광역시 서구 왕길동 660-2 타워팰리스2 |
5th row | 인천광역시 서구 왕길동 660-2 타워팰리스2 |
Value | Count | Frequency (%) |
서울특별시 | 170 | |
동대문구 | 167 | |
장안동 | 110 | 11.3% |
청량리동 | 43 | 4.4% |
453-8 | 35 | 3.6% |
서도휴빌3차 | 35 | 3.6% |
409-1 | 26 | 2.7% |
홀가하우스 | 26 | 2.7% |
235-1 | 13 | 1.3% |
미주 | 13 | 1.3% |
Other values (143) | 339 |
Most occurring characters
Value | Count | Frequency (%) |
777 | 14.6% | |
동 | 379 | 7.1% |
서 | 212 | 4.0% |
구 | 201 | 3.8% |
시 | 200 | 3.7% |
대 | 177 | 3.3% |
특 | 172 | 3.2% |
별 | 172 | 3.2% |
- | 171 | 3.2% |
울 | 171 | 3.2% |
Other values (149) | 2702 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 3354 | |
Decimal Number | 949 | 17.8% |
Space Separator | 777 | 14.6% |
Dash Punctuation | 171 | 3.2% |
Close Punctuation | 37 | 0.7% |
Open Punctuation | 37 | 0.7% |
Uppercase Letter | 8 | 0.1% |
Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 379 | 11.3% |
서 | 212 | 6.3% |
구 | 201 | 6.0% |
시 | 200 | 6.0% |
대 | 177 | 5.3% |
특 | 172 | 5.1% |
별 | 172 | 5.1% |
울 | 171 | 5.1% |
문 | 167 | 5.0% |
장 | 139 | 4.1% |
Other values (130) | 1364 |
Decimal Number
Value | Count | Frequency (%) |
4 | 156 | |
5 | 137 | |
3 | 129 | |
1 | 109 | |
6 | 96 | |
2 | 94 | |
0 | 79 | |
8 | 60 | 6.3% |
9 | 58 | 6.1% |
7 | 31 | 3.3% |
Uppercase Letter
Value | Count | Frequency (%) |
S | 3 | |
A | 2 | |
B | 2 | |
J | 1 | 12.5% |
Space Separator
Value | Count | Frequency (%) |
777 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 171 |
Close Punctuation
Value | Count | Frequency (%) |
) | 37 |
Open Punctuation
Value | Count | Frequency (%) |
( | 37 |
Other Punctuation
Value | Count | Frequency (%) |
. | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 3354 | |
Common | 1972 | |
Latin | 8 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 379 | 11.3% |
서 | 212 | 6.3% |
구 | 201 | 6.0% |
시 | 200 | 6.0% |
대 | 177 | 5.3% |
특 | 172 | 5.1% |
별 | 172 | 5.1% |
울 | 171 | 5.1% |
문 | 167 | 5.0% |
장 | 139 | 4.1% |
Other values (130) | 1364 |
Common
Value | Count | Frequency (%) |
777 | ||
- | 171 | 8.7% |
4 | 156 | 7.9% |
5 | 137 | 6.9% |
3 | 129 | 6.5% |
1 | 109 | 5.5% |
6 | 96 | 4.9% |
2 | 94 | 4.8% |
0 | 79 | 4.0% |
8 | 60 | 3.0% |
Other values (5) | 164 | 8.3% |
Latin
Value | Count | Frequency (%) |
S | 3 | |
A | 2 | |
B | 2 | |
J | 1 | 12.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 3354 | |
ASCII | 1980 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
777 | ||
- | 171 | 8.6% |
4 | 156 | 7.9% |
5 | 137 | 6.9% |
3 | 129 | 6.5% |
1 | 109 | 5.5% |
6 | 96 | 4.8% |
2 | 94 | 4.7% |
0 | 79 | 4.0% |
8 | 60 | 3.0% |
Other values (9) | 172 | 8.7% |
Hangul
Value | Count | Frequency (%) |
동 | 379 | 11.3% |
서 | 212 | 6.3% |
구 | 201 | 6.0% |
시 | 200 | 6.0% |
대 | 177 | 5.3% |
특 | 172 | 5.1% |
별 | 172 | 5.1% |
울 | 171 | 5.1% |
문 | 167 | 5.0% |
장 | 139 | 4.1% |
Other values (130) | 1364 |
APT_NM
Text
Distinct | 67 |
---|---|
Distinct (%) | 33.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
Value | Count | Frequency (%) |
서도휴빌3차 | 35 | |
홀가하우스 | 26 | 12.7% |
미주 | 13 | 6.3% |
장안푸르미에 | 11 | 5.4% |
괴정엔스타 | 11 | 5.4% |
제일풍경채에듀파크2단지 | 10 | 4.9% |
한신 | 10 | 4.9% |
신부파스칼텔 | 5 | 2.4% |
타워팰리스2 | 4 | 2.0% |
신부파스카(563 | 4 | 2.0% |
Other values (60) | 76 |
Most occurring characters
Value | Count | Frequency (%) |
스 | 69 | 5.5% |
빌 | 59 | 4.7% |
3 | 48 | 3.8% |
차 | 43 | 3.4% |
( | 37 | 2.9% |
) | 36 | 2.8% |
서 | 35 | 2.8% |
도 | 35 | 2.8% |
휴 | 35 | 2.8% |
하 | 30 | 2.4% |
Other values (124) | 839 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 964 | |
Decimal Number | 190 | 15.0% |
Open Punctuation | 37 | 2.9% |
Close Punctuation | 36 | 2.8% |
Dash Punctuation | 25 | 2.0% |
Uppercase Letter | 8 | 0.6% |
Space Separator | 5 | 0.4% |
Other Punctuation | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
스 | 69 | 7.2% |
빌 | 59 | 6.1% |
차 | 43 | 4.5% |
서 | 35 | 3.6% |
도 | 35 | 3.6% |
휴 | 35 | 3.6% |
하 | 30 | 3.1% |
가 | 29 | 3.0% |
우 | 28 | 2.9% |
파 | 26 | 2.7% |
Other values (105) | 575 |
Decimal Number
Value | Count | Frequency (%) |
3 | 48 | |
2 | 29 | |
4 | 22 | |
0 | 20 | |
1 | 19 | 10.0% |
5 | 18 | 9.5% |
6 | 15 | 7.9% |
8 | 8 | 4.2% |
9 | 7 | 3.7% |
7 | 4 | 2.1% |
Uppercase Letter
Value | Count | Frequency (%) |
S | 3 | |
A | 2 | |
B | 2 | |
J | 1 | 12.5% |
Open Punctuation
Value | Count | Frequency (%) |
( | 37 |
Close Punctuation
Value | Count | Frequency (%) |
) | 36 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 25 |
Space Separator
Value | Count | Frequency (%) |
5 |
Other Punctuation
Value | Count | Frequency (%) |
. | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 964 | |
Common | 294 | 23.2% |
Latin | 8 | 0.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
스 | 69 | 7.2% |
빌 | 59 | 6.1% |
차 | 43 | 4.5% |
서 | 35 | 3.6% |
도 | 35 | 3.6% |
휴 | 35 | 3.6% |
하 | 30 | 3.1% |
가 | 29 | 3.0% |
우 | 28 | 2.9% |
파 | 26 | 2.7% |
Other values (105) | 575 |
Common
Value | Count | Frequency (%) |
3 | 48 | |
( | 37 | |
) | 36 | |
2 | 29 | |
- | 25 | |
4 | 22 | |
0 | 20 | |
1 | 19 | 6.5% |
5 | 18 | 6.1% |
6 | 15 | 5.1% |
Other values (5) | 25 |
Latin
Value | Count | Frequency (%) |
S | 3 | |
A | 2 | |
B | 2 | |
J | 1 | 12.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 964 | |
ASCII | 302 | 23.9% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
스 | 69 | 7.2% |
빌 | 59 | 6.1% |
차 | 43 | 4.5% |
서 | 35 | 3.6% |
도 | 35 | 3.6% |
휴 | 35 | 3.6% |
하 | 30 | 3.1% |
가 | 29 | 3.0% |
우 | 28 | 2.9% |
파 | 26 | 2.7% |
Other values (105) | 575 |
ASCII
Value | Count | Frequency (%) |
3 | 48 | |
( | 37 | |
) | 36 | |
2 | 29 | |
- | 25 | |
4 | 22 | |
0 | 20 | |
1 | 19 | 6.3% |
5 | 18 | 6.0% |
6 | 15 | 5.0% |
Other values (9) | 33 |
BUILD_STDYM
Categorical
HIGH CORRELATION
 
Distinct | 28 |
---|---|
Distinct (%) | 14.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
2019 | |
---|---|
2015 | |
2013 | |
1978 | |
2011 | |
Other values (23) |
Length
Max length | 6 |
---|---|
Median length | 4 |
Mean length | 4.01 |
Min length | 4 |
Unique
Unique | 11 ? |
---|---|
Unique (%) | 5.5% |
Sample
1st row | 2015 |
---|---|
2nd row | 2015 |
3rd row | 2015 |
4th row | 2015 |
5th row | 2015 |
Common Values
Value | Count | Frequency (%) |
2019 | 61 | |
2015 | 27 | |
2013 | 13 | 6.5% |
1978 | 13 | 6.5% |
2011 | 12 | 6.0% |
2005 | 11 | 5.5% |
1997 | 10 | 5.0% |
2003 | 8 | 4.0% |
2002 | 6 | 3.0% |
2014 | 5 | 2.5% |
Other values (18) | 34 |
Length
Value | Count | Frequency (%) |
2019 | 61 | |
2015 | 27 | |
2013 | 13 | 6.5% |
1978 | 13 | 6.5% |
2011 | 12 | 6.0% |
2005 | 11 | 5.5% |
1997 | 10 | 5.0% |
2003 | 8 | 4.0% |
2002 | 6 | 3.0% |
2014 | 5 | 2.5% |
Other values (18) | 34 |
HOUS_ID
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 67 |
---|---|
Distinct (%) | 33.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.3747056 × 1018 |
Minimum | 2014 |
---|---|
Maximum | 3.6110107 × 1018 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 2014 |
---|---|
5-th percentile | 1.1230106 × 1018 |
Q1 | 1.1230106 × 1018 |
median | 1.1230106 × 1018 |
Q3 | 1.1230107 × 1018 |
95-th percentile | 2.915511 × 1018 |
Maximum | 3.6110107 × 1018 |
Range | 3.6110107 × 1018 |
Interquartile range (IQR) | 9.999804 × 1010 |
Descriptive statistics
Standard deviation | 6.272076 × 1017 |
---|---|
Coefficient of variation (CV) | 0.45624867 |
Kurtosis | 2.592567 |
Mean | 1.3747056 × 1018 |
Median Absolute Deviation (MAD) | 569984 |
Skewness | 1.9870784 |
Sum | -1.7600424 × 1018 |
Variance | 3.9338937 × 1035 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1123010600004530008 | 35 | |
1123010600004090001 | 26 | 13.0% |
1123010700002350001 | 13 | 6.5% |
2638010100012750000 | 11 | 5.5% |
1123010600004660005 | 11 | 5.5% |
2915511000005620000 | 10 | 5.0% |
1123010700000600000 | 10 | 5.0% |
1123010600004310005 | 5 | 2.5% |
1123010600005630000 | 4 | 2.0% |
2826012000006600002 | 4 | 2.0% |
Other values (57) | 71 |
Value | Count | Frequency (%) |
2014 | 1 | 0.5% |
1123010600003940007 | 1 | 0.5% |
1123010600003970002 | 1 | 0.5% |
1123010600004000001 | 1 | 0.5% |
1123010600004050016 | 1 | 0.5% |
1123010600004060001 | 2 | 1.0% |
1123010600004060002 | 1 | 0.5% |
1123010600004090001 | 26 | |
1123010600004100013 | 1 | 0.5% |
1123010600004160002 | 2 | 1.0% |
Value | Count | Frequency (%) |
3611010700007130000 | 1 | 0.5% |
3611010600009470000 | 1 | 0.5% |
2915511000005620000 | 10 | |
2826012000006600002 | 4 | 2.0% |
2826010300005250010 | 2 | 1.0% |
2714010900008930029 | 1 | 0.5% |
2638010100012750000 | 11 | |
1153010900000970009 | 2 | 1.0% |
1150010500007930006 | 1 | 0.5% |
1123010800000650000 | 1 | 0.5% |
BLD_CD
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 63 |
---|---|
Distinct (%) | 31.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.3790157 × 1024 |
Minimum | 1.1230106 × 1018 |
---|---|
Maximum | 4.473033 × 1024 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 1.1230106 × 1018 |
---|---|
5-th percentile | 1.1230106 × 1024 |
Q1 | 1.1230106 × 1024 |
median | 1.1230106 × 1024 |
Q3 | 1.1230107 × 1024 |
95-th percentile | 2.915511 × 1024 |
Maximum | 4.473033 × 1024 |
Range | 4.4730319 × 1024 |
Interquartile range (IQR) | 9.999804 × 1016 |
Descriptive statistics
Standard deviation | 6.4535191 × 1023 |
---|---|
Coefficient of variation (CV) | 0.4679801 |
Kurtosis | 3.9304275 |
Mean | 1.3790157 × 1024 |
Median Absolute Deviation (MAD) | 5.7002269 × 1011 |
Skewness | 2.1543943 |
Sum | 2.7580314 × 1026 |
Variance | 4.1647909 × 1047 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1.12301060010453e+24 | 35 | |
1.12301060010409e+24 | 26 | 13.0% |
1.12301070010235e+24 | 13 | 6.5% |
1.12301060010466e+24 | 12 | 6.0% |
2.63801010011016e+24 | 11 | 5.5% |
1.1230107001006e+24 | 10 | 5.0% |
2.91551100010562e+24 | 10 | 5.0% |
1.12301060010431e+24 | 7 | 3.5% |
1.12301060010563e+24 | 4 | 2.0% |
2.8260120001066e+24 | 4 | 2.0% |
Other values (53) | 68 |
Value | Count | Frequency (%) |
1.12301060000394e+18 | 1 | 0.5% |
1.12301060010192e+24 | 1 | 0.5% |
1.1230106001034e+24 | 1 | 0.5% |
1.12301060010394e+24 | 1 | 0.5% |
1.12301060010397e+24 | 1 | 0.5% |
1.123010600104e+24 | 1 | 0.5% |
1.12301060010405e+24 | 1 | 0.5% |
1.12301060010406e+24 | 3 | 1.5% |
1.12301060010409e+24 | 26 | |
1.1230106001041e+24 | 1 | 0.5% |
Value | Count | Frequency (%) |
4.4730330331054305e+24 | 1 | 0.5% |
3.61101070010488e+24 | 1 | 0.5% |
2.91551100010562e+24 | 10 | |
2.8260120001066e+24 | 4 | 2.0% |
2.82601030010525e+24 | 2 | 1.0% |
2.71401090010893e+24 | 1 | 0.5% |
2.63801010011016e+24 | 11 | |
1.15301090010097e+24 | 2 | 1.0% |
1.15001050010793e+24 | 1 | 0.5% |
1.12301080010065e+24 | 1 | 0.5% |
HOUS_ADDR
Text
Distinct | 67 |
---|---|
Distinct (%) | 33.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
Length
Max length | 40 |
---|---|
Median length | 37 |
Mean length | 28.745 |
Min length | 23 |
Characters and Unicode
Total characters | 5749 |
---|---|
Distinct characters | 160 |
Distinct categories | 8 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 44 ? |
---|---|
Unique (%) | 22.0% |
Sample
1st row | 대구광역시 동구 입석동 893-29번지 센트로파크A동 |
---|---|
2nd row | 인천광역시 서구 왕길동 660-2번지 타워팰리스2 |
3rd row | 인천광역시 서구 왕길동 660-2번지 타워팰리스2 |
4th row | 인천광역시 서구 왕길동 660-2번지 타워팰리스2 |
5th row | 인천광역시 서구 왕길동 660-2번지 타워팰리스2 |
Value | Count | Frequency (%) |
서울특별시 | 169 | |
동대문구 | 166 | |
장안동 | 120 | 12.0% |
청량리동 | 43 | 4.3% |
453-8번지 | 35 | 3.5% |
서도휴빌3차 | 35 | 3.5% |
409-1번지 | 26 | 2.6% |
홀가하우스 | 26 | 2.6% |
235-1번지 | 13 | 1.3% |
미주 | 13 | 1.3% |
Other values (151) | 353 |
Most occurring characters
Value | Count | Frequency (%) |
799 | 13.9% | |
동 | 376 | 6.5% |
지 | 212 | 3.7% |
서 | 211 | 3.7% |
구 | 200 | 3.5% |
시 | 199 | 3.5% |
번 | 199 | 3.5% |
대 | 176 | 3.1% |
특 | 171 | 3.0% |
별 | 171 | 3.0% |
Other values (150) | 3035 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 3733 | |
Decimal Number | 968 | 16.8% |
Space Separator | 799 | 13.9% |
Dash Punctuation | 168 | 2.9% |
Close Punctuation | 36 | 0.6% |
Open Punctuation | 36 | 0.6% |
Uppercase Letter | 8 | 0.1% |
Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 376 | 10.1% |
지 | 212 | 5.7% |
서 | 211 | 5.7% |
구 | 200 | 5.4% |
시 | 199 | 5.3% |
번 | 199 | 5.3% |
대 | 176 | 4.7% |
특 | 171 | 4.6% |
별 | 171 | 4.6% |
울 | 170 | 4.6% |
Other values (131) | 1648 |
Decimal Number
Value | Count | Frequency (%) |
4 | 153 | |
5 | 139 | |
3 | 130 | |
1 | 114 | |
6 | 99 | |
2 | 93 | |
0 | 88 | |
8 | 60 | 6.2% |
9 | 58 | 6.0% |
7 | 34 | 3.5% |
Uppercase Letter
Value | Count | Frequency (%) |
S | 3 | |
A | 2 | |
B | 2 | |
J | 1 | 12.5% |
Space Separator
Value | Count | Frequency (%) |
799 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 168 |
Close Punctuation
Value | Count | Frequency (%) |
) | 36 |
Open Punctuation
Value | Count | Frequency (%) |
( | 36 |
Other Punctuation
Value | Count | Frequency (%) |
. | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 3733 | |
Common | 2008 | |
Latin | 8 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 376 | 10.1% |
지 | 212 | 5.7% |
서 | 211 | 5.7% |
구 | 200 | 5.4% |
시 | 199 | 5.3% |
번 | 199 | 5.3% |
대 | 176 | 4.7% |
특 | 171 | 4.6% |
별 | 171 | 4.6% |
울 | 170 | 4.6% |
Other values (131) | 1648 |
Common
Value | Count | Frequency (%) |
799 | ||
- | 168 | 8.4% |
4 | 153 | 7.6% |
5 | 139 | 6.9% |
3 | 130 | 6.5% |
1 | 114 | 5.7% |
6 | 99 | 4.9% |
2 | 93 | 4.6% |
0 | 88 | 4.4% |
8 | 60 | 3.0% |
Other values (5) | 165 | 8.2% |
Latin
Value | Count | Frequency (%) |
S | 3 | |
A | 2 | |
B | 2 | |
J | 1 | 12.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 3733 | |
ASCII | 2016 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
799 | ||
- | 168 | 8.3% |
4 | 153 | 7.6% |
5 | 139 | 6.9% |
3 | 130 | 6.4% |
1 | 114 | 5.7% |
6 | 99 | 4.9% |
2 | 93 | 4.6% |
0 | 88 | 4.4% |
8 | 60 | 3.0% |
Other values (9) | 173 | 8.6% |
Hangul
Value | Count | Frequency (%) |
동 | 376 | 10.1% |
지 | 212 | 5.7% |
서 | 211 | 5.7% |
구 | 200 | 5.4% |
시 | 199 | 5.3% |
번 | 199 | 5.3% |
대 | 176 | 4.7% |
특 | 171 | 4.6% |
별 | 171 | 4.6% |
울 | 170 | 4.6% |
Other values (131) | 1648 |
ROAD_ADDR
Text
Distinct | 67 |
---|---|
Distinct (%) | 33.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
Length
Max length | 42 |
---|---|
Median length | 34 |
Mean length | 27.085 |
Min length | 21 |
Characters and Unicode
Total characters | 5417 |
---|---|
Distinct characters | 164 |
Distinct categories | 8 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 44 ? |
---|---|
Unique (%) | 22.0% |
Sample
1st row | 대구광역시 동구 동촌역사로3길 28 센트로파크A동 |
---|---|
2nd row | 인천광역시 서구 완정로117번길 61 타워팰리스2 |
3rd row | 인천광역시 서구 완정로117번길 61 타워팰리스2 |
4th row | 인천광역시 서구 완정로117번길 61 타워팰리스2 |
5th row | 인천광역시 서구 완정로117번길 61 타워팰리스2 |
Value | Count | Frequency (%) |
서울특별시 | 170 | 16.9% |
동대문구 | 167 | 16.7% |
17 | 36 | 3.6% |
천호대로93길 | 35 | 3.5% |
서도휴빌3차 | 35 | 3.5% |
한천로6길 | 29 | 2.9% |
26 | 28 | 2.8% |
홀가하우스 | 26 | 2.6% |
약령시로 | 13 | 1.3% |
147 | 13 | 1.3% |
Other values (178) | 451 |
Most occurring characters
Value | Count | Frequency (%) |
803 | 14.8% | |
대 | 260 | 4.8% |
시 | 216 | 4.0% |
서 | 213 | 3.9% |
구 | 201 | 3.7% |
로 | 200 | 3.7% |
동 | 192 | 3.5% |
별 | 172 | 3.2% |
특 | 172 | 3.2% |
울 | 171 | 3.2% |
Other values (154) | 2817 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 3576 | |
Decimal Number | 910 | 16.8% |
Space Separator | 803 | 14.8% |
Dash Punctuation | 45 | 0.8% |
Close Punctuation | 37 | 0.7% |
Open Punctuation | 37 | 0.7% |
Uppercase Letter | 8 | 0.1% |
Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 260 | 7.3% |
시 | 216 | 6.0% |
서 | 213 | 6.0% |
구 | 201 | 5.6% |
로 | 200 | 5.6% |
동 | 192 | 5.4% |
별 | 172 | 4.8% |
특 | 172 | 4.8% |
울 | 171 | 4.8% |
문 | 168 | 4.7% |
Other values (135) | 1611 |
Decimal Number
Value | Count | Frequency (%) |
3 | 160 | |
1 | 145 | |
2 | 127 | |
6 | 106 | |
7 | 88 | |
9 | 78 | |
4 | 69 | |
0 | 49 | 5.4% |
5 | 46 | 5.1% |
8 | 42 | 4.6% |
Uppercase Letter
Value | Count | Frequency (%) |
S | 3 | |
A | 2 | |
B | 2 | |
J | 1 | 12.5% |
Space Separator
Value | Count | Frequency (%) |
803 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 45 |
Close Punctuation
Value | Count | Frequency (%) |
) | 37 |
Open Punctuation
Value | Count | Frequency (%) |
( | 37 |
Other Punctuation
Value | Count | Frequency (%) |
. | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 3576 | |
Common | 1833 | |
Latin | 8 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 260 | 7.3% |
시 | 216 | 6.0% |
서 | 213 | 6.0% |
구 | 201 | 5.6% |
로 | 200 | 5.6% |
동 | 192 | 5.4% |
별 | 172 | 4.8% |
특 | 172 | 4.8% |
울 | 171 | 4.8% |
문 | 168 | 4.7% |
Other values (135) | 1611 |
Common
Value | Count | Frequency (%) |
803 | ||
3 | 160 | 8.7% |
1 | 145 | 7.9% |
2 | 127 | 6.9% |
6 | 106 | 5.8% |
7 | 88 | 4.8% |
9 | 78 | 4.3% |
4 | 69 | 3.8% |
0 | 49 | 2.7% |
5 | 46 | 2.5% |
Other values (5) | 162 | 8.8% |
Latin
Value | Count | Frequency (%) |
S | 3 | |
A | 2 | |
B | 2 | |
J | 1 | 12.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 3576 | |
ASCII | 1841 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
803 | ||
3 | 160 | 8.7% |
1 | 145 | 7.9% |
2 | 127 | 6.9% |
6 | 106 | 5.8% |
7 | 88 | 4.8% |
9 | 78 | 4.2% |
4 | 69 | 3.7% |
0 | 49 | 2.7% |
5 | 46 | 2.5% |
Other values (9) | 170 | 9.2% |
Hangul
Value | Count | Frequency (%) |
대 | 260 | 7.3% |
시 | 216 | 6.0% |
서 | 213 | 6.0% |
구 | 201 | 5.6% |
로 | 200 | 5.6% |
동 | 192 | 5.4% |
별 | 172 | 4.8% |
특 | 172 | 4.8% |
울 | 171 | 4.8% |
문 | 168 | 4.7% |
Other values (135) | 1611 |
X_AXIS
Text
Distinct | 66 |
---|---|
Distinct (%) | 33.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
Value | Count | Frequency (%) |
317983 | 35 | |
317220 | 26 | 12.7% |
315994 | 13 | 6.4% |
318025 | 11 | 5.4% |
490886 | 11 | 5.4% |
298686 | 10 | 4.9% |
316242 | 10 | 4.9% |
317613 | 5 | 2.5% |
318032 | 4 | 2.0% |
281589 | 4 | 2.0% |
Other values (60) | 75 |
Most occurring characters
Value | Count | Frequency (%) |
3 | 235 | |
1 | 206 | |
8 | 130 | |
7 | 120 | |
2 | 120 | |
9 | 109 | |
6 | 85 | 6.9% |
0 | 76 | 6.2% |
5 | 60 | 4.9% |
4 | 57 | 4.7% |
Other values (22) | 26 | 2.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1198 | |
Other Letter | 20 | 1.6% |
Space Separator | 4 | 0.3% |
Open Punctuation | 1 | 0.1% |
Close Punctuation | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 2 | 10.0% |
특 | 1 | 5.0% |
별 | 1 | 5.0% |
다 | 1 | 5.0% |
가 | 1 | 5.0% |
스 | 1 | 5.0% |
리 | 1 | 5.0% |
팰 | 1 | 5.0% |
광 | 1 | 5.0% |
길 | 1 | 5.0% |
Other values (9) | 9 |
Decimal Number
Value | Count | Frequency (%) |
3 | 235 | |
1 | 206 | |
8 | 130 | |
7 | 120 | |
2 | 120 | |
9 | 109 | |
6 | 85 | 7.1% |
0 | 76 | 6.3% |
5 | 60 | 5.0% |
4 | 57 | 4.8% |
Space Separator
Value | Count | Frequency (%) |
4 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1204 | |
Hangul | 20 | 1.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 2 | 10.0% |
특 | 1 | 5.0% |
별 | 1 | 5.0% |
다 | 1 | 5.0% |
가 | 1 | 5.0% |
스 | 1 | 5.0% |
리 | 1 | 5.0% |
팰 | 1 | 5.0% |
광 | 1 | 5.0% |
길 | 1 | 5.0% |
Other values (9) | 9 |
Common
Value | Count | Frequency (%) |
3 | 235 | |
1 | 206 | |
8 | 130 | |
7 | 120 | |
2 | 120 | |
9 | 109 | |
6 | 85 | 7.1% |
0 | 76 | 6.3% |
5 | 60 | 5.0% |
4 | 57 | 4.7% |
Other values (3) | 6 | 0.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1204 | |
Hangul | 20 | 1.6% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
3 | 235 | |
1 | 206 | |
8 | 130 | |
7 | 120 | |
2 | 120 | |
9 | 109 | |
6 | 85 | 7.1% |
0 | 76 | 6.3% |
5 | 60 | 5.0% |
4 | 57 | 4.7% |
Other values (3) | 6 | 0.5% |
Hangul
Value | Count | Frequency (%) |
동 | 2 | 10.0% |
특 | 1 | 5.0% |
별 | 1 | 5.0% |
다 | 1 | 5.0% |
가 | 1 | 5.0% |
스 | 1 | 5.0% |
리 | 1 | 5.0% |
팰 | 1 | 5.0% |
광 | 1 | 5.0% |
길 | 1 | 5.0% |
Other values (9) | 9 |
Y_AXIS
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 66 |
---|---|
Distinct (%) | 33.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 520294.59 |
Minimum | 278376 |
---|---|
Maximum | 556276 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 278376 |
---|---|
5-th percentile | 278376 |
Q1 | 551533 |
median | 551666 |
Q3 | 553566.5 |
95-th percentile | 554438.35 |
Maximum | 556276 |
Range | 277900 |
Interquartile range (IQR) | 2033.5 |
Descriptive statistics
Standard deviation | 86492.844 |
---|---|
Coefficient of variation (CV) | 0.16623822 |
Kurtosis | 3.8325839 |
Mean | 520294.59 |
Median Absolute Deviation (MAD) | 213 |
Skewness | -2.3879549 |
Sum | 1.0405892 × 108 |
Variance | 7.4810121 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
551533 | 35 | |
551666 | 26 | 13.0% |
553952 | 13 | 6.5% |
551418 | 11 | 5.5% |
278376 | 11 | 5.5% |
278667 | 10 | 5.0% |
554387 | 10 | 5.0% |
551658 | 5 | 2.5% |
556276 | 4 | 2.0% |
554055 | 4 | 2.0% |
Other values (56) | 71 |
Value | Count | Frequency (%) |
278376 | 11 | |
278667 | 10 | |
317720 | 1 | 0.5% |
365766 | 1 | 0.5% |
431210 | 1 | 0.5% |
431625 | 1 | 0.5% |
544772 | 2 | 1.0% |
551403 | 1 | 0.5% |
551418 | 11 | |
551480 | 2 | 1.0% |
Value | Count | Frequency (%) |
556276 | 4 | 2.0% |
555191 | 1 | 0.5% |
554980 | 1 | 0.5% |
554772 | 1 | 0.5% |
554722 | 1 | 0.5% |
554563 | 1 | 0.5% |
554464 | 1 | 0.5% |
554437 | 2 | 1.0% |
554410 | 1 | 0.5% |
554387 | 10 |
BLK_CD
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 62 |
---|---|
Distinct (%) | 31.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 306560.5 |
Minimum | 17938 |
---|---|
Maximum | 552189 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 17938 |
---|---|
5-th percentile | 76013.95 |
Q1 | 168932.5 |
median | 346436 |
Q3 | 363173 |
95-th percentile | 516861 |
Maximum | 552189 |
Range | 534251 |
Interquartile range (IQR) | 194240.5 |
Descriptive statistics
Standard deviation | 145174.85 |
---|---|
Coefficient of variation (CV) | 0.47356018 |
Kurtosis | -0.75520399 |
Mean | 306560.5 |
Median Absolute Deviation (MAD) | 41311 |
Skewness | -0.69759706 |
Sum | 61312100 |
Variance | 2.1075736 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
79595 | 35 | |
344817 | 26 | 13.0% |
363173 | 13 | 6.5% |
349020 | 12 | 6.0% |
486814 | 11 | 5.5% |
413133 | 10 | 5.0% |
516861 | 10 | 5.0% |
344807 | 7 | 3.5% |
412321 | 4 | 2.0% |
361986 | 4 | 2.0% |
Other values (52) | 68 |
Value | Count | Frequency (%) |
17938 | 2 | 1.0% |
35276 | 1 | 0.5% |
39606 | 1 | 0.5% |
39711 | 1 | 0.5% |
40607 | 2 | 1.0% |
50834 | 1 | 0.5% |
70256 | 2 | 1.0% |
76317 | 1 | 0.5% |
79595 | 35 | |
79597 | 2 | 1.0% |
Value | Count | Frequency (%) |
552189 | 1 | 0.5% |
516861 | 10 | |
501325 | 2 | 1.0% |
486814 | 11 | |
449885 | 1 | 0.5% |
449433 | 1 | 0.5% |
414605 | 1 | 0.5% |
413825 | 1 | 0.5% |
413766 | 1 | 0.5% |
413221 | 1 | 0.5% |
FLOOR
Categorical
HIGH CORRELATION
 
Distinct | 25 |
---|---|
Distinct (%) | 12.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
4 | |
---|---|
5 | |
3 | |
2 | |
7 | |
Other values (20) |
Length
Max length | 2 |
---|---|
Median length | 1 |
Mean length | 1.155 |
Min length | 1 |
Unique
Unique | 11 ? |
---|---|
Unique (%) | 5.5% |
Sample
1st row | 7 |
---|---|
2nd row | 7 |
3rd row | 7 |
4th row | 7 |
5th row | 6 |
Common Values
Value | Count | Frequency (%) |
4 | 34 | |
5 | 33 | |
3 | 30 | |
2 | 28 | |
7 | 12 | 6.0% |
1 | 10 | 5.0% |
6 | 10 | 5.0% |
8 | 6 | 3.0% |
9 | 6 | 3.0% |
10 | 6 | 3.0% |
Other values (15) | 25 |
Length
Value | Count | Frequency (%) |
4 | 34 | |
5 | 33 | |
3 | 30 | |
2 | 28 | |
7 | 12 | 6.0% |
1 | 11 | 5.5% |
6 | 10 | 5.0% |
8 | 6 | 3.0% |
9 | 6 | 3.0% |
10 | 6 | 3.0% |
Other values (14) | 24 |
CONT_CLSS
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
매매 | |
---|---|
2 | 1 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 1.995 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.5% |
Sample
1st row | 매매 |
---|---|
2nd row | 매매 |
3rd row | 매매 |
4th row | 매매 |
5th row | 매매 |
Common Values
Value | Count | Frequency (%) |
매매 | 199 | |
2 | 1 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
매매 | 199 | |
2 | 1 | 0.5% |
AREA
Text
Distinct | 113 |
---|---|
Distinct (%) | 56.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
Value | Count | Frequency (%) |
30.04 | 26 | 13.0% |
84.9184 | 10 | 5.0% |
12.9463 | 8 | 4.0% |
59.97 | 5 | 2.5% |
56.15 | 5 | 2.5% |
84.9226 | 5 | 2.5% |
56.41 | 5 | 2.5% |
84.99 | 4 | 2.0% |
84.92 | 4 | 2.0% |
101.62 | 3 | 1.5% |
Other values (103) | 125 |
Most occurring characters
Value | Count | Frequency (%) |
. | 197 | |
4 | 144 | |
3 | 96 | |
5 | 96 | |
9 | 89 | |
1 | 87 | |
2 | 85 | |
8 | 82 | |
0 | 75 | 7.1% |
6 | 66 | 6.2% |
Other values (2) | 40 | 3.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 858 | |
Other Punctuation | 197 | 18.6% |
Other Letter | 2 | 0.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
4 | 144 | |
3 | 96 | |
5 | 96 | |
9 | 89 | |
1 | 87 | |
2 | 85 | |
8 | 82 | |
0 | 75 | |
6 | 66 | |
7 | 38 | 4.4% |
Other Punctuation
Value | Count | Frequency (%) |
. | 197 |
Other Letter
Value | Count | Frequency (%) |
매 | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1055 | |
Hangul | 2 | 0.2% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
. | 197 | |
4 | 144 | |
3 | 96 | |
5 | 96 | |
9 | 89 | |
1 | 87 | |
2 | 85 | |
8 | 82 | |
0 | 75 | 7.1% |
6 | 66 | 6.3% |
Hangul
Value | Count | Frequency (%) |
매 | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1055 | |
Hangul | 2 | 0.2% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 197 | |
4 | 144 | |
3 | 96 | |
5 | 96 | |
9 | 89 | |
1 | 87 | |
2 | 85 | |
8 | 82 | |
0 | 75 | 7.1% |
6 | 66 | 6.3% |
Hangul
Value | Count | Frequency (%) |
매 | 2 |
CONT_DATE
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 105 |
---|---|
Distinct (%) | 52.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20089441 |
Minimum | 51.04 |
---|---|
Maximum | 20190629 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 51.04 |
---|---|
5-th percentile | 20190118 |
Q1 | 20190314 |
median | 20190419 |
Q3 | 20190513 |
95-th percentile | 20190617 |
Maximum | 20190629 |
Range | 20190578 |
Interquartile range (IQR) | 199.25 |
Descriptive statistics
Standard deviation | 1427672.8 |
---|---|
Coefficient of variation (CV) | 0.071065828 |
Kurtosis | 200 |
Mean | 20089441 |
Median Absolute Deviation (MAD) | 101.5 |
Skewness | -14.142135 |
Sum | 4.0178883 × 109 |
Variance | 2.0382496 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20190419.0 | 37 | 18.5% |
20190323.0 | 6 | 3.0% |
20190613.0 | 6 | 3.0% |
20190612.0 | 5 | 2.5% |
20190316.0 | 5 | 2.5% |
20190228.0 | 4 | 2.0% |
20190422.0 | 4 | 2.0% |
20190325.0 | 4 | 2.0% |
20190130.0 | 3 | 1.5% |
20190614.0 | 3 | 1.5% |
Other values (95) | 123 |
Value | Count | Frequency (%) |
51.04 | 1 | |
20190101.0 | 1 | |
20190103.0 | 1 | |
20190107.0 | 1 | |
20190110.0 | 1 | |
20190112.0 | 1 | |
20190114.0 | 1 | |
20190115.0 | 1 | |
20190117.0 | 1 | |
20190118.0 | 2 |
Value | Count | Frequency (%) |
20190629.0 | 2 | |
20190628.0 | 1 | |
20190626.0 | 1 | |
20190625.0 | 1 | |
20190624.0 | 1 | |
20190622.0 | 1 | |
20190620.0 | 1 | |
20190619.0 | 1 | |
20190618.0 | 1 | |
20190617.0 | 1 |
AMOUNT
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 121 |
---|---|
Distinct (%) | 60.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 135284.71 |
Minimum | 8000 |
---|---|
Maximum | 20190323 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 8000 |
---|---|
5-th percentile | 11895 |
Q1 | 25112.5 |
median | 29425 |
Q3 | 40200 |
95-th percentile | 83025 |
Maximum | 20190323 |
Range | 20182323 |
Interquartile range (IQR) | 15087.5 |
Descriptive statistics
Standard deviation | 1425361.8 |
---|---|
Coefficient of variation (CV) | 10.536015 |
Kurtosis | 199.9258 |
Mean | 135284.71 |
Median Absolute Deviation (MAD) | 7150 |
Skewness | 14.138224 |
Sum | 27056943 |
Variance | 2.0316561 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
29000 | 17 | 8.5% |
28000 | 11 | 5.5% |
29500 | 5 | 2.5% |
31000 | 4 | 2.0% |
51000 | 4 | 2.0% |
83000 | 4 | 2.0% |
12000 | 4 | 2.0% |
29450 | 3 | 1.5% |
50000 | 3 | 1.5% |
55000 | 3 | 1.5% |
Other values (111) | 142 |
Value | Count | Frequency (%) |
8000 | 1 | |
8400 | 1 | |
8500 | 1 | |
8600 | 1 | |
8700 | 1 | |
10600 | 2 | |
11000 | 1 | |
11800 | 2 | |
11900 | 2 | |
11950 | 1 |
Value | Count | Frequency (%) |
20190323 | 1 | |
115000 | 1 | |
110000 | 1 | |
89500 | 1 | |
89000 | 1 | |
88000 | 1 | |
87000 | 1 | |
85000 | 1 | |
84500 | 1 | |
83500 | 1 |
DEPOSIT
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
<NA> | |
---|---|
29000 | 1 |
Length
Max length | 5 |
---|---|
Median length | 4 |
Mean length | 4.005 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.5% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 199 | |
29000 | 1 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 199 | |
29000 | 1 | 0.5% |
RENT_AMOUNT
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 200 |
---|---|
Missing (%) | 100.0% |
Memory size | 1.9 KiB |
BLD_CLSS
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
연립/다세대 | |
---|---|
아파트 | |
기타 | |
<NA> | 1 |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 4.47 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.5% |
Sample
1st row | 아파트 |
---|---|
2nd row | 연립/다세대 |
3rd row | 연립/다세대 |
4th row | 연립/다세대 |
5th row | 연립/다세대 |
Common Values
Value | Count | Frequency (%) |
연립/다세대 | 102 | |
아파트 | 84 | |
기타 | 13 | 6.5% |
<NA> | 1 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
연립/다세대 | 102 | |
아파트 | 84 | |
기타 | 13 | 6.5% |
na | 1 | 0.5% |
ADDRESS | APT_NM | BUILD_STDYM | HOUS_ID | BLD_CD | HOUS_ADDR | ROAD_ADDR | X_AXIS | Y_AXIS | BLK_CD | FLOOR | CONT_CLSS | CONT_DATE | AMOUNT | BLD_CLSS | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
ADDRESS | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.476 | 1.000 | 1.000 | 1.000 | 1.000 |
APT_NM | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.476 | 1.000 | 1.000 | 1.000 | 1.000 |
BUILD_STDYM | 1.000 | 1.000 | 1.000 | 0.757 | 0.822 | 1.000 | 1.000 | 0.999 | 0.856 | 0.889 | 0.801 | 1.000 | 1.000 | 1.000 | 0.969 |
HOUS_ID | 1.000 | 1.000 | 0.757 | 1.000 | 0.888 | 1.000 | 1.000 | 1.000 | 0.987 | 0.861 | 0.800 | NaN | NaN | NaN | 0.248 |
BLD_CD | 1.000 | 1.000 | 0.822 | 0.888 | 1.000 | 1.000 | 1.000 | 1.000 | 0.960 | 0.931 | 0.749 | NaN | NaN | NaN | 0.341 |
HOUS_ADDR | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.476 | 1.000 | 1.000 | 1.000 | 1.000 |
ROAD_ADDR | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.476 | 1.000 | 1.000 | 1.000 | 1.000 |
X_AXIS | 1.000 | 1.000 | 0.999 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.489 | 1.000 | 1.000 | 1.000 | 0.999 |
Y_AXIS | 1.000 | 1.000 | 0.856 | 0.987 | 0.960 | 1.000 | 1.000 | 1.000 | 1.000 | 0.986 | 0.816 | 1.000 | 1.000 | 1.000 | 0.238 |
BLK_CD | 1.000 | 1.000 | 0.889 | 0.861 | 0.931 | 1.000 | 1.000 | 1.000 | 0.986 | 1.000 | 0.514 | 0.218 | 0.218 | 0.217 | 0.563 |
FLOOR | 0.476 | 0.476 | 0.801 | 0.800 | 0.749 | 0.476 | 0.476 | 0.489 | 0.816 | 0.514 | 1.000 | 1.000 | 1.000 | 1.000 | 0.619 |
CONT_CLSS | 1.000 | 1.000 | 1.000 | NaN | NaN | 1.000 | 1.000 | 1.000 | 1.000 | 0.218 | 1.000 | 1.000 | 0.700 | 0.700 | NaN |
CONT_DATE | 1.000 | 1.000 | 1.000 | NaN | NaN | 1.000 | 1.000 | 1.000 | 1.000 | 0.218 | 1.000 | 0.700 | 1.000 | 0.700 | NaN |
AMOUNT | 1.000 | 1.000 | 1.000 | NaN | NaN | 1.000 | 1.000 | 1.000 | 1.000 | 0.217 | 1.000 | 0.700 | 0.700 | 1.000 | NaN |
BLD_CLSS | 1.000 | 1.000 | 0.969 | 0.248 | 0.341 | 1.000 | 1.000 | 0.999 | 0.238 | 0.563 | 0.619 | NaN | NaN | NaN | 1.000 |
BUILD_STDYM | FLOOR | CONT_CLSS | DEPOSIT | BLD_CLSS | |
---|---|---|---|---|---|
BUILD_STDYM | 1.000 | 0.317 | 0.932 | NaN | 0.769 |
FLOOR | 0.317 | 1.000 | 0.940 | NaN | 0.340 |
CONT_CLSS | 0.932 | 0.940 | 1.000 | NaN | 1.000 |
DEPOSIT | NaN | NaN | NaN | 1.000 | NaN |
BLD_CLSS | 0.769 | 0.340 | 1.000 | NaN | 1.000 |
HOUS_ID | BLD_CD | Y_AXIS | BLK_CD | CONT_DATE | AMOUNT | BUILD_STDYM | FLOOR | CONT_CLSS | DEPOSIT | BLD_CLSS | |
---|---|---|---|---|---|---|---|---|---|---|---|
HOUS_ID | 1.000 | 0.979 | 0.026 | 0.638 | -0.064 | 0.237 | 0.624 | 0.629 | 0.992 | NaN | 0.236 |
BLD_CD | 0.979 | 1.000 | 0.004 | 0.651 | -0.067 | 0.226 | 0.879 | 0.094 | 0.820 | NaN | 0.824 |
Y_AXIS | 0.026 | 0.004 | 1.000 | 0.124 | -0.075 | 0.304 | 0.551 | 0.604 | 0.992 | NaN | 0.298 |
BLK_CD | 0.638 | 0.651 | 0.124 | 1.000 | -0.033 | 0.244 | 0.546 | 0.194 | 0.165 | NaN | 0.381 |
CONT_DATE | -0.064 | -0.067 | -0.075 | -0.033 | 1.000 | 0.034 | 0.932 | 0.940 | 0.494 | NaN | 1.000 |
AMOUNT | 0.237 | 0.226 | 0.304 | 0.244 | 0.034 | 1.000 | 0.932 | 0.940 | 0.494 | NaN | 1.000 |
BUILD_STDYM | 0.624 | 0.879 | 0.551 | 0.546 | 0.932 | 0.932 | 1.000 | 0.317 | 0.932 | NaN | 0.769 |
FLOOR | 0.629 | 0.094 | 0.604 | 0.194 | 0.940 | 0.940 | 0.317 | 1.000 | 0.940 | NaN | 0.340 |
CONT_CLSS | 0.992 | 0.820 | 0.992 | 0.165 | 0.494 | 0.494 | 0.932 | 0.940 | 1.000 | NaN | 1.000 |
DEPOSIT | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 1.000 | 0.000 |
BLD_CLSS | 0.236 | 0.824 | 0.298 | 0.381 | 1.000 | 1.000 | 0.769 | 0.340 | 1.000 | 0.000 | 1.000 |
ADDRESS | APT_NM | BUILD_STDYM | HOUS_ID | BLD_CD | HOUS_ADDR | ROAD_ADDR | X_AXIS | Y_AXIS | BLK_CD | FLOOR | CONT_CLSS | AREA | CONT_DATE | AMOUNT | DEPOSIT | RENT_AMOUNT | BLD_CLSS | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 대구광역시 동구 입석동 893-29 센트로파크A동 | 센트로파크A동 | 2015 | 2714010900008930029 | 2714010900108930029000001 | 대구광역시 동구 입석동 893-29번지 센트로파크A동 | 대구광역시 동구 동촌역사로3길 28 센트로파크A동 | 458963 | 365766 | 255470 | 7 | 매매 | 83.6835 | 20190325.0 | 23000 | <NA> | <NA> | 아파트 |
1 | 인천광역시 서구 왕길동 660-2 타워팰리스2 | 타워팰리스2 | 2015 | 2826012000006600002 | 2826012000106600002000001 | 인천광역시 서구 왕길동 660-2번지 타워팰리스2 | 인천광역시 서구 완정로117번길 61 타워팰리스2 | 281589 | 556276 | 412321 | 7 | 매매 | 48.5523 | 20190524.0 | 13300 | <NA> | <NA> | 연립/다세대 |
2 | 인천광역시 서구 왕길동 660-2 타워팰리스2 | 타워팰리스2 | 2015 | 2826012000006600002 | 2826012000106600002000001 | 인천광역시 서구 왕길동 660-2번지 타워팰리스2 | 인천광역시 서구 완정로117번길 61 타워팰리스2 | 281589 | 556276 | 412321 | 7 | 매매 | 49.8441 | 20190418.0 | 14500 | <NA> | <NA> | 연립/다세대 |
3 | 인천광역시 서구 왕길동 660-2 타워팰리스2 | 타워팰리스2 | 2015 | 2826012000006600002 | 2826012000106600002000001 | 인천광역시 서구 왕길동 660-2번지 타워팰리스2 | 인천광역시 서구 완정로117번길 61 타워팰리스2 | 281589 | 556276 | 412321 | 7 | 매매 | 49.2521 | 20190130.0 | 13500 | <NA> | <NA> | 연립/다세대 |
4 | 인천광역시 서구 왕길동 660-2 타워팰리스2 | 타워팰리스2 | 2015 | 2826012000006600002 | 2826012000106600002000001 | 인천광역시 서구 왕길동 660-2번지 타워팰리스2 | 인천광역시 서구 완정로117번길 61 타워팰리스2 | 281589 | 556276 | 412321 | 6 | 매매 | 49.2521 | 20190126.0 | 12000 | <NA> | <NA> | 연립/다세대 |
5 | 인천광역시 서구 검암동 525-10 베르데힐(525-10) | 베르데힐(525-10) | 2015 | 2826010300005250010 | 2826010300105250010000001 | 인천광역시 서구 검암동 525-10번지 베르데힐(525-10) | 인천광역시 서구 허암길 5-3 베르데힐(525-10) | 283401 | 552062 | 501325 | 4 | 매매 | 55.43 | 20190302.0 | 18500 | <NA> | <NA> | 연립/다세대 |
6 | 인천광역시 서구 검암동 525-10 베르데힐(525-10) | 베르데힐(525-10) | 2015 | 2826010300005250010 | 2826010300105250010000001 | 인천광역시 서구 검암동 525-10번지 베르데힐(525-10) | 인천광역시 서구 허암길 5-3 베르데힐(525-10) | 283401 | 552062 | 501325 | 3 | 매매 | 56 | 20190130.0 | 16000 | <NA> | <NA> | 연립/다세대 |
7 | 광주광역시 남구 행암동 562 제일풍경채에듀파크2단지 | 제일풍경채에듀파크2단지 | 2015 | 2915511000005620000 | 2915511000105620000000001 | 광주광역시 남구 행암동 562번지 제일풍경채에듀파크2단지 | 광주광역시 남구 효우2로 46 제일풍경채에듀파크2단지 | 298686 | 278667 | 516861 | 12 | 매매 | 84.9184 | 20190518.0 | 35900 | <NA> | <NA> | 아파트 |
8 | 광주광역시 남구 행암동 562 제일풍경채에듀파크2단지 | 제일풍경채에듀파크2단지 | 2015 | 2915511000005620000 | 2915511000105620000000001 | 광주광역시 남구 행암동 562번지 제일풍경채에듀파크2단지 | 광주광역시 남구 효우2로 46 제일풍경채에듀파크2단지 | 298686 | 278667 | 516861 | 6 | 매매 | 84.9184 | 20190419.0 | 35500 | <NA> | <NA> | 아파트 |
9 | 광주광역시 남구 행암동 562 제일풍경채에듀파크2단지 | 제일풍경채에듀파크2단지 | 2015 | 2915511000005620000 | 2915511000105620000000001 | 광주광역시 남구 행암동 562번지 제일풍경채에듀파크2단지 | 광주광역시 남구 효우2로 46 제일풍경채에듀파크2단지 | 298686 | 278667 | 516861 | 6 | 매매 | 84.9184 | 20190618.0 | 41900 | <NA> | <NA> | 아파트 |
ADDRESS | APT_NM | BUILD_STDYM | HOUS_ID | BLD_CD | HOUS_ADDR | ROAD_ADDR | X_AXIS | Y_AXIS | BLK_CD | FLOOR | CONT_CLSS | AREA | CONT_DATE | AMOUNT | DEPOSIT | RENT_AMOUNT | BLD_CLSS | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
190 | 서울특별시 동대문구 청량리동 829 (829-0) | (829-0) | 1980 | 1123010700008290000 | 1123010700108290000012176 | 서울특별시 동대문구 청량리동 829번지 (829-0) | 서울특별시 동대문구 제기로31길 32-13 (829-0) | 316058 | 554464 | 363112 | 1 | 매매 | 53.95 | 20190323.0 | 49000 | <NA> | <NA> | 연립/다세대 |
191 | 서울특별시 동대문구 청량리동 834 (834-0) | (834-0) | 1980 | 1123010700008340000 | 1123010700108340000012236 | 서울특별시 동대문구 청량리동 834번지 (834-0) | 서울특별시 동대문구 제기로31길 32-3 (834-0) | 316026 | 554437 | 363112 | 1 | 매매 | 86.18 | 20190125.0 | 41200 | <NA> | <NA> | 연립/다세대 |
192 | 서울특별시 동대문구 청량리동 834 (834-0) | (834-0) | 1980 | 1123010700008340000 | 1123010700108340000012236 | 서울특별시 동대문구 청량리동 834번지 (834-0) | 서울특별시 동대문구 제기로31길 32-3 (834-0) | 316026 | 554437 | 363112 | 2 | 매매 | 86.18 | 20190119.0 | 43292 | <NA> | <NA> | 연립/다세대 |
193 | 서울특별시 동대문구 청량리동 868 (868-0) | (868-0) | 1979 | 1123010700008680000 | 1123010700108680000012291 | 서울특별시 동대문구 청량리동 868번지 (868-0) | 서울특별시 동대문구 홍릉로24길 50-4 (868-0) | 315918 | 554410 | 363102 | 2 | 매매 | 54.98 | 20190228.0 | 47500 | <NA> | <NA> | 연립/다세대 |
194 | 서울특별시 동대문구 청량리동 905 (905-0) | (905-0) | 1981 | 1123010700009050000 | 1123010700109050000012634 | 서울특별시 동대문구 청량리동 905번지 (905-0) | 서울특별시 동대문구 제기로29길 20-3 (905-0) | 315980 | 554309 | 362686 | 1 | 매매 | 32.83 | 20190406.0 | 29000 | <NA> | <NA> | 연립/다세대 |
195 | 서울특별시 동대문구 청량리동 926 (926-0) | (926-0) | 1981 | 1123010700009260000 | 1123010700109260000013130 | 서울특별시 동대문구 청량리동 926번지 (926-0) | 서울특별시 동대문구 홍릉로22길 38 (926-0) | 315860 | 554364 | 362674 | 1 | 매매 | 51.21 | 20190521.0 | 36500 | <NA> | <NA> | 연립/다세대 |
196 | 서울특별시 동대문구 청량리동 949 상그레빌 | 상그레빌 | 2004 | 1123010700009490000 | 1123010700109490000000001 | 서울특별시 동대문구 청량리동 949번지 상그레빌 | 서울특별시 동대문구 회기로5길 100 상그레빌 | 315504 | 555191 | 412693 | 6 | 매매 | 80.16 | 20190612.0 | 51500 | <NA> | <NA> | 아파트 |
197 | 서울특별시 동대문구 회기동 54-47 탑스빌 | 탑스빌 | 2002 | 1123010800000540047 | 1123010800100540047011492 | 서울특별시 동대문구 회기동 54-47번지 탑스빌 | 서울특별시 동대문구 회기로23다길 18 탑스빌 | 316797 | 554980 | 362091 | 2 | 매매 | 58.33 | 20190509.0 | 22000 | <NA> | <NA> | 연립/다세대 |
198 | 서울특별시 동대문구 회기동 60-218 동일아트맨션19차나동 | 동일아트맨션19차나동 | 2003 | 1123010800000600218 | 1123010800100600218011504 | 서울특별시 동대문구 회기동 60-218번지 동일아트맨션19차나동 | 서울특별시 동대문구 회기로 108-10 동일아트맨션19차나동 | 316116 | 554772 | 362116 | 5 | 매매 | 52.53 | 20190123.0 | 20500 | <NA> | <NA> | 연립/다세대 |
199 | 서울특별시 동대문구 회기동 65 신현대 | 신현대 | 1989 | 1123010800000650000 | 1123010800100650000010212 | 서울특별시 동대문구 회기동 65번지 신현대 | 서울특별시 동대문구 이문로1길 21 신현대 | 316455 | 554563 | 362185 | 11 | 매매 | 84.96 | 20190331.0 | 55000 | <NA> | <NA> | 아파트 |
Most frequently occurring
ADDRESS | APT_NM | BUILD_STDYM | HOUS_ID | BLD_CD | HOUS_ADDR | ROAD_ADDR | X_AXIS | Y_AXIS | BLK_CD | FLOOR | CONT_CLSS | AREA | CONT_DATE | AMOUNT | DEPOSIT | BLD_CLSS | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2 | 서울특별시 동대문구 장안동 409-1 홀가하우스 | 홀가하우스 | 2019 | 1123010600004090001 | 1123010600104090001018025 | 서울특별시 동대문구 장안동 409-1번지 홀가하우스 | 서울특별시 동대문구 한천로6길 26 홀가하우스 | 317220 | 551666 | 344817 | 4 | 매매 | 30.04 | 20190316.0 | 29000 | <NA> | 연립/다세대 | 3 |
0 | 서울특별시 동대문구 장안동 409-1 홀가하우스 | 홀가하우스 | 2019 | 1123010600004090001 | 1123010600104090001018025 | 서울특별시 동대문구 장안동 409-1번지 홀가하우스 | 서울특별시 동대문구 한천로6길 26 홀가하우스 | 317220 | 551666 | 344817 | 3 | 매매 | 30.04 | 20190430.0 | 28000 | <NA> | 연립/다세대 | 2 |
1 | 서울특별시 동대문구 장안동 409-1 홀가하우스 | 홀가하우스 | 2019 | 1123010600004090001 | 1123010600104090001018025 | 서울특별시 동대문구 장안동 409-1번지 홀가하우스 | 서울특별시 동대문구 한천로6길 26 홀가하우스 | 317220 | 551666 | 344817 | 3 | 매매 | 30.04 | 20190517.0 | 28000 | <NA> | 연립/다세대 | 2 |
3 | 서울특별시 동대문구 장안동 409-1 홀가하우스 | 홀가하우스 | 2019 | 1123010600004090001 | 1123010600104090001018025 | 서울특별시 동대문구 장안동 409-1번지 홀가하우스 | 서울특별시 동대문구 한천로6길 26 홀가하우스 | 317220 | 551666 | 344817 | 5 | 매매 | 30.04 | 20190325.0 | 29000 | <NA> | 연립/다세대 | 2 |