Dataset statistics
Number of variables | 13 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.1 MiB |
Average record size in memory | 114.0 B |
Variable types
Numeric | 2 |
---|---|
Categorical | 3 |
Text | 8 |
Dataset
Description | Sample |
---|---|
Author | 리파인 |
URL | https://www.bigdata-realestate.kr/rebpp/usr/prd/prdInfoDetail.do?req_productId=183 |
SIDO_NM is highly overall correlated with APT_POTVALE_RLT_RT_CMPR_INFO_NO and 1 other fields | High correlation |
TNSHP_NM is highly overall correlated with APT_POTVALE_RLT_RT_CMPR_INFO_NO and 2 other fields | High correlation |
APT_POTVALE_RLT_RT_CMPR_INFO_NO is highly overall correlated with SIDO_NM and 1 other fields | High correlation |
EMD_ACCTO_POTVALE_RLT_RT is highly overall correlated with TNSHP_NM | High correlation |
TNSHP_NM is highly imbalanced (62.7%) | Imbalance |
APT_POTVALE_RLT_RT_CMPR_INFO_NO has unique values | Unique |
Reproduction
Analysis started | 2023-12-11 22:32:15.280230 |
---|---|
Analysis finished | 2023-12-11 22:32:18.348337 |
Duration | 3.07 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
APT_POTVALE_RLT_RT_CMPR_INFO_NO
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11748.064 |
Minimum | 1 |
---|---|
Maximum | 23550 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1139.85 |
Q1 | 5852.75 |
median | 11701 |
Q3 | 17694.5 |
95-th percentile | 22386.15 |
Maximum | 23550 |
Range | 23549 |
Interquartile range (IQR) | 11841.75 |
Descriptive statistics
Standard deviation | 6813.232 |
---|---|
Coefficient of variation (CV) | 0.57994511 |
Kurtosis | -1.2086111 |
Mean | 11748.064 |
Median Absolute Deviation (MAD) | 5919 |
Skewness | 0.0073313905 |
Sum | 1.1748064 × 108 |
Variance | 46420130 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
619 | 1 | < 0.1% |
14376 | 1 | < 0.1% |
6749 | 1 | < 0.1% |
17485 | 1 | < 0.1% |
7556 | 1 | < 0.1% |
19924 | 1 | < 0.1% |
6698 | 1 | < 0.1% |
12875 | 1 | < 0.1% |
20898 | 1 | < 0.1% |
12144 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
1 | 1 | |
4 | 1 | |
8 | 1 | |
10 | 1 | |
11 | 1 | |
12 | 1 | |
13 | 1 | |
15 | 1 | |
17 | 1 | |
20 | 1 |
Value | Count | Frequency (%) |
23550 | 1 | |
23549 | 1 | |
23548 | 1 | |
23543 | 1 | |
23542 | 1 | |
23532 | 1 | |
23529 | 1 | |
23526 | 1 | |
23520 | 1 | |
23517 | 1 |
SIDO_NM
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
경기도 | |
---|---|
인천광역시 | |
서울특별시 |
Length
Max length | 5 |
---|---|
Median length | 3 |
Mean length | 3.6524 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 서울특별시 |
---|---|
2nd row | 경기도 |
3rd row | 경기도 |
4th row | 인천광역시 |
5th row | 경기도 |
Common Values
Value | Count | Frequency (%) |
경기도 | 6738 | |
인천광역시 | 1731 | 17.3% |
서울특별시 | 1531 | 15.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
경기도 | 6738 | |
인천광역시 | 1731 | 17.3% |
서울특별시 | 1531 | 15.3% |
SIGNGU_NM
Text
Distinct | 64 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
평택시 | 681 | 6.8% |
수원시 | 564 | 5.6% |
화성시 | 544 | 5.4% |
고양시 | 481 | 4.8% |
용인시 | 467 | 4.7% |
서구 | 374 | 3.7% |
시흥시 | 358 | 3.6% |
남동구 | 350 | 3.5% |
부천시 | 319 | 3.2% |
파주시 | 295 | 2.9% |
Other values (54) | 5567 |
Most occurring characters
Value | Count | Frequency (%) |
시 | 7022 | |
구 | 3362 | 11.0% |
양 | 1404 | 4.6% |
성 | 1094 | 3.6% |
평 | 1071 | 3.5% |
남 | 1029 | 3.4% |
주 | 984 | 3.2% |
부 | 832 | 2.7% |
천 | 789 | 2.6% |
수 | 780 | 2.6% |
Other values (55) | 12066 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 30433 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
시 | 7022 | |
구 | 3362 | 11.0% |
양 | 1404 | 4.6% |
성 | 1094 | 3.6% |
평 | 1071 | 3.5% |
남 | 1029 | 3.4% |
주 | 984 | 3.2% |
부 | 832 | 2.7% |
천 | 789 | 2.6% |
수 | 780 | 2.6% |
Other values (55) | 12066 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 30433 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
시 | 7022 | |
구 | 3362 | 11.0% |
양 | 1404 | 4.6% |
성 | 1094 | 3.6% |
평 | 1071 | 3.5% |
남 | 1029 | 3.4% |
주 | 984 | 3.2% |
부 | 832 | 2.7% |
천 | 789 | 2.6% |
수 | 780 | 2.6% |
Other values (55) | 12066 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 30433 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
시 | 7022 | |
구 | 3362 | 11.0% |
양 | 1404 | 4.6% |
성 | 1094 | 3.6% |
평 | 1071 | 3.5% |
남 | 1029 | 3.4% |
주 | 984 | 3.2% |
부 | 832 | 2.7% |
천 | 789 | 2.6% |
수 | 780 | 2.6% |
Other values (55) | 12066 |
TNSHP_NM
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 18 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
영통구 | 240 |
일산서구 | 178 |
덕양구 | 175 |
권선구 | 173 |
Other values (13) |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.8226 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | 동안구 |
4th row | <NA> |
5th row | 권선구 |
Common Values
Value | Count | Frequency (%) |
<NA> | 7920 | |
영통구 | 240 | 2.4% |
일산서구 | 178 | 1.8% |
덕양구 | 175 | 1.8% |
권선구 | 173 | 1.7% |
기흥구 | 166 | 1.7% |
수지구 | 160 | 1.6% |
처인구 | 141 | 1.4% |
일산동구 | 128 | 1.3% |
단원구 | 123 | 1.2% |
Other values (8) | 596 | 6.0% |
Length
Value | Count | Frequency (%) |
na | 7920 | |
영통구 | 240 | 2.4% |
일산서구 | 178 | 1.8% |
덕양구 | 175 | 1.8% |
권선구 | 173 | 1.7% |
기흥구 | 166 | 1.7% |
수지구 | 160 | 1.6% |
처인구 | 141 | 1.4% |
일산동구 | 128 | 1.3% |
단원구 | 123 | 1.2% |
Other values (8) | 596 | 6.0% |
EMD_NM
Text
Distinct | 678 |
---|---|
Distinct (%) | 6.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
공도읍 | 124 | 1.2% |
송도동 | 107 | 1.1% |
영통동 | 102 | 1.0% |
만수동 | 97 | 1.0% |
배곧동 | 84 | 0.8% |
구월동 | 83 | 0.8% |
논현동 | 83 | 0.8% |
중산동 | 79 | 0.8% |
안양동 | 75 | 0.8% |
청라동 | 74 | 0.7% |
Other values (668) | 9092 |
Most occurring characters
Value | Count | Frequency (%) |
동 | 8962 | |
읍 | 1007 | 3.3% |
산 | 780 | 2.6% |
정 | 568 | 1.9% |
신 | 443 | 1.5% |
곡 | 396 | 1.3% |
현 | 385 | 1.3% |
도 | 376 | 1.2% |
면 | 334 | 1.1% |
가 | 313 | 1.0% |
Other values (241) | 16557 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 30015 | |
Decimal Number | 106 | 0.4% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 8962 | |
읍 | 1007 | 3.4% |
산 | 780 | 2.6% |
정 | 568 | 1.9% |
신 | 443 | 1.5% |
곡 | 396 | 1.3% |
현 | 385 | 1.3% |
도 | 376 | 1.3% |
면 | 334 | 1.1% |
가 | 313 | 1.0% |
Other values (233) | 16451 |
Decimal Number
Value | Count | Frequency (%) |
2 | 25 | |
1 | 24 | |
3 | 20 | |
7 | 18 | |
5 | 7 | 6.6% |
4 | 7 | 6.6% |
6 | 3 | 2.8% |
8 | 2 | 1.9% |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 30015 | |
Common | 106 | 0.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 8962 | |
읍 | 1007 | 3.4% |
산 | 780 | 2.6% |
정 | 568 | 1.9% |
신 | 443 | 1.5% |
곡 | 396 | 1.3% |
현 | 385 | 1.3% |
도 | 376 | 1.3% |
면 | 334 | 1.1% |
가 | 313 | 1.0% |
Other values (233) | 16451 |
Common
Value | Count | Frequency (%) |
2 | 25 | |
1 | 24 | |
3 | 20 | |
7 | 18 | |
5 | 7 | 6.6% |
4 | 7 | 6.6% |
6 | 3 | 2.8% |
8 | 2 | 1.9% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 30015 | |
ASCII | 106 | 0.4% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
동 | 8962 | |
읍 | 1007 | 3.4% |
산 | 780 | 2.6% |
정 | 568 | 1.9% |
신 | 443 | 1.5% |
곡 | 396 | 1.3% |
현 | 385 | 1.3% |
도 | 376 | 1.3% |
면 | 334 | 1.1% |
가 | 313 | 1.0% |
Other values (233) | 16451 |
ASCII
Value | Count | Frequency (%) |
2 | 25 | |
1 | 24 | |
3 | 20 | |
7 | 18 | |
5 | 7 | 6.6% |
4 | 7 | 6.6% |
6 | 3 | 2.8% |
8 | 2 | 1.9% |
HONO_NM
Text
Distinct | 3037 |
---|---|
Distinct (%) | 30.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
752 | 39 | 0.4% |
955 | 29 | 0.3% |
717 | 28 | 0.3% |
1149 | 27 | 0.3% |
176 | 25 | 0.2% |
36-1 | 25 | 0.2% |
693 | 25 | 0.2% |
1142 | 24 | 0.2% |
23 | 23 | 0.2% |
736 | 23 | 0.2% |
Other values (3027) | 9732 |
Most occurring characters
Value | Count | Frequency (%) |
10000 | ||
1 | 7425 | |
- | 3949 | 7.9% |
2 | 3678 | 7.4% |
3 | 3522 | 7.1% |
5 | 3464 | 7.0% |
6 | 3248 | 6.5% |
7 | 3066 | 6.2% |
4 | 2964 | 6.0% |
0 | 2909 | 5.8% |
Other values (2) | 5502 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 35778 | |
Control | 10000 | 20.1% |
Dash Punctuation | 3949 | 7.9% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 7425 | |
2 | 3678 | |
3 | 3522 | |
5 | 3464 | |
6 | 3248 | |
7 | 3066 | |
4 | 2964 | 8.3% |
0 | 2909 | 8.1% |
8 | 2804 | 7.8% |
9 | 2698 | 7.5% |
Control
Value | Count | Frequency (%) |
10000 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 3949 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 49727 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
10000 | ||
1 | 7425 | |
- | 3949 | 7.9% |
2 | 3678 | 7.4% |
3 | 3522 | 7.1% |
5 | 3464 | 7.0% |
6 | 3248 | 6.5% |
7 | 3066 | 6.2% |
4 | 2964 | 6.0% |
0 | 2909 | 5.8% |
Other values (2) | 5502 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 49727 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
10000 | ||
1 | 7425 | |
- | 3949 | 7.9% |
2 | 3678 | 7.4% |
3 | 3522 | 7.1% |
5 | 3464 | 7.0% |
6 | 3248 | 6.5% |
7 | 3066 | 6.2% |
4 | 2964 | 6.0% |
0 | 2909 | 5.8% |
Other values (2) | 5502 |
APT_NM
Text
Distinct | 3919 |
---|---|
Distinct (%) | 39.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
현대 | 60 | 0.6% |
벽산 | 38 | 0.4% |
주은풍림 | 37 | 0.4% |
동남 | 35 | 0.3% |
동탄역 | 34 | 0.3% |
옥정센트럴파크푸르지오 | 29 | 0.3% |
주은청설 | 27 | 0.3% |
산내마을9단지힐스테이트운정 | 27 | 0.3% |
삼성래미안 | 26 | 0.3% |
한신 | 24 | 0.2% |
Other values (3960) | 10002 |
Most occurring characters
Value | Count | Frequency (%) |
지 | 2042 | 2.8% |
1 | 1635 | 2.3% |
트 | 1608 | 2.2% |
마 | 1591 | 2.2% |
스 | 1560 | 2.1% |
이 | 1497 | 2.1% |
아 | 1449 | 2.0% |
단 | 1445 | 2.0% |
을 | 1437 | 2.0% |
동 | 1201 | 1.7% |
Other values (566) | 57182 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 62803 | |
Decimal Number | 5292 | 7.3% |
Open Punctuation | 1190 | 1.6% |
Close Punctuation | 1190 | 1.6% |
Uppercase Letter | 890 | 1.2% |
Dash Punctuation | 355 | 0.5% |
Space Separator | 344 | 0.5% |
Lowercase Letter | 330 | 0.5% |
Other Punctuation | 178 | 0.2% |
Math Symbol | 49 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
지 | 2042 | 3.3% |
트 | 1608 | 2.6% |
마 | 1591 | 2.5% |
스 | 1560 | 2.5% |
이 | 1497 | 2.4% |
아 | 1449 | 2.3% |
단 | 1445 | 2.3% |
을 | 1437 | 2.3% |
동 | 1201 | 1.9% |
한 | 1121 | 1.8% |
Other values (500) | 47852 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 148 | |
I | 88 | |
K | 82 | 9.2% |
C | 81 | 9.1% |
E | 58 | 6.5% |
V | 53 | 6.0% |
L | 46 | 5.2% |
W | 45 | 5.1% |
B | 40 | 4.5% |
D | 38 | 4.3% |
Other values (15) | 211 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 207 | |
k | 18 | 5.5% |
a | 17 | 5.2% |
r | 15 | 4.5% |
h | 10 | 3.0% |
i | 9 | 2.7% |
y | 9 | 2.7% |
w | 6 | 1.8% |
u | 6 | 1.8% |
t | 6 | 1.8% |
Other values (9) | 27 | 8.2% |
Decimal Number
Value | Count | Frequency (%) |
1 | 1635 | |
2 | 1149 | |
3 | 544 | 10.3% |
5 | 400 | 7.6% |
0 | 341 | 6.4% |
4 | 340 | 6.4% |
6 | 280 | 5.3% |
7 | 215 | 4.1% |
9 | 206 | 3.9% |
8 | 182 | 3.4% |
Other Punctuation
Value | Count | Frequency (%) |
, | 133 | |
. | 38 | 21.3% |
' | 4 | 2.2% |
& | 3 | 1.7% |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 16 | |
Ⅲ | 5 | 19.2% |
Ⅰ | 5 | 19.2% |
Open Punctuation
Value | Count | Frequency (%) |
( | 1190 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1190 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 355 |
Space Separator
Value | Count | Frequency (%) |
344 |
Math Symbol
Value | Count | Frequency (%) |
~ | 49 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 62802 | |
Common | 8598 | 11.8% |
Latin | 1246 | 1.7% |
Han | 1 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
지 | 2042 | 3.3% |
트 | 1608 | 2.6% |
마 | 1591 | 2.5% |
스 | 1560 | 2.5% |
이 | 1497 | 2.4% |
아 | 1449 | 2.3% |
단 | 1445 | 2.3% |
을 | 1437 | 2.3% |
동 | 1201 | 1.9% |
한 | 1121 | 1.8% |
Other values (499) | 47851 |
Latin
Value | Count | Frequency (%) |
e | 207 | |
S | 148 | 11.9% |
I | 88 | 7.1% |
K | 82 | 6.6% |
C | 81 | 6.5% |
E | 58 | 4.7% |
V | 53 | 4.3% |
L | 46 | 3.7% |
W | 45 | 3.6% |
B | 40 | 3.2% |
Other values (37) | 398 |
Common
Value | Count | Frequency (%) |
1 | 1635 | |
( | 1190 | |
) | 1190 | |
2 | 1149 | |
3 | 544 | 6.3% |
5 | 400 | 4.7% |
- | 355 | 4.1% |
344 | 4.0% | |
0 | 341 | 4.0% |
4 | 340 | 4.0% |
Other values (9) | 1110 |
Han
Value | Count | Frequency (%) |
家 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 62802 | |
ASCII | 9818 | 13.5% |
Number Forms | 26 | < 0.1% |
CJK | 1 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
지 | 2042 | 3.3% |
트 | 1608 | 2.6% |
마 | 1591 | 2.5% |
스 | 1560 | 2.5% |
이 | 1497 | 2.4% |
아 | 1449 | 2.3% |
단 | 1445 | 2.3% |
을 | 1437 | 2.3% |
동 | 1201 | 1.9% |
한 | 1121 | 1.8% |
Other values (499) | 47851 |
ASCII
Value | Count | Frequency (%) |
1 | 1635 | |
( | 1190 | |
) | 1190 | |
2 | 1149 | |
3 | 544 | 5.5% |
5 | 400 | 4.1% |
- | 355 | 3.6% |
344 | 3.5% | |
0 | 341 | 3.5% |
4 | 340 | 3.5% |
Other values (53) | 2330 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 16 | |
Ⅲ | 5 | 19.2% |
Ⅰ | 5 | 19.2% |
CJK
Value | Count | Frequency (%) |
家 | 1 |
SMOEU
Real number (ℝ)
Distinct | 2596 |
---|---|
Distinct (%) | 26.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 71.386963 |
Minimum | 11.72 |
---|---|
Maximum | 270.25 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 11.72 |
---|---|
5-th percentile | 24.4095 |
Q1 | 59.1275 |
median | 72.6 |
Q3 | 84.94 |
95-th percentile | 120.82 |
Maximum | 270.25 |
Range | 258.53 |
Interquartile range (IQR) | 25.8125 |
Descriptive statistics
Standard deviation | 27.487324 |
---|---|
Coefficient of variation (CV) | 0.38504684 |
Kurtosis | 3.5277313 |
Mean | 71.386963 |
Median Absolute Deviation (MAD) | 12.64 |
Skewness | 0.85249405 |
Sum | 713869.63 |
Variance | 755.553 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
84.99 | 382 | 3.8% |
84.98 | 295 | 2.9% |
59.99 | 212 | 2.1% |
84.97 | 212 | 2.1% |
84.96 | 208 | 2.1% |
59.97 | 140 | 1.4% |
59.98 | 133 | 1.3% |
84.94 | 130 | 1.3% |
59.94 | 125 | 1.2% |
59.96 | 123 | 1.2% |
Other values (2586) | 8040 |
Value | Count | Frequency (%) |
11.72 | 2 | |
11.74 | 1 | < 0.1% |
11.96 | 1 | < 0.1% |
12.02 | 1 | < 0.1% |
12.03 | 2 | |
12.04 | 2 | |
12.1 | 1 | < 0.1% |
12.11 | 1 | < 0.1% |
12.16 | 1 | < 0.1% |
12.19 | 4 |
Value | Count | Frequency (%) |
270.25 | 2 | |
258.28 | 1 | |
244.55 | 1 | |
244.22 | 1 | |
244.07 | 1 | |
242.34 | 1 | |
240.98 | 1 | |
239.19 | 1 | |
235.31 | 1 | |
226.45 | 1 |
SAPR
Text
Distinct | 1085 |
---|---|
Distinct (%) | 10.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
45,000 | 165 | 1.7% |
30,000 | 164 | 1.6% |
50,000 | 158 | 1.6% |
40,000 | 157 | 1.6% |
60,000 | 142 | 1.4% |
35,000 | 141 | 1.4% |
28,000 | 109 | 1.1% |
25,000 | 108 | 1.1% |
31,000 | 94 | 0.9% |
55,000 | 94 | 0.9% |
Other values (1075) | 8668 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 27437 | |
, | 10000 | 16.6% |
5 | 4312 | 7.2% |
1 | 3093 | 5.1% |
2 | 2971 | 4.9% |
3 | 2916 | 4.8% |
4 | 2501 | 4.2% |
8 | 1845 | 3.1% |
7 | 1820 | 3.0% |
6 | 1780 | 3.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 50165 | |
Other Punctuation | 10000 | 16.6% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 27437 | |
5 | 4312 | 8.6% |
1 | 3093 | 6.2% |
2 | 2971 | 5.9% |
3 | 2916 | 5.8% |
4 | 2501 | 5.0% |
8 | 1845 | 3.7% |
7 | 1820 | 3.6% |
6 | 1780 | 3.5% |
9 | 1490 | 3.0% |
Other Punctuation
Value | Count | Frequency (%) |
, | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 60165 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 27437 | |
, | 10000 | 16.6% |
5 | 4312 | 7.2% |
1 | 3093 | 5.1% |
2 | 2971 | 4.9% |
3 | 2916 | 4.8% |
4 | 2501 | 4.2% |
8 | 1845 | 3.1% |
7 | 1820 | 3.0% |
6 | 1780 | 3.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 60165 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 27437 | |
, | 10000 | 16.6% |
5 | 4312 | 7.2% |
1 | 3093 | 5.1% |
2 | 2971 | 4.9% |
3 | 2916 | 4.8% |
4 | 2501 | 4.2% |
8 | 1845 | 3.1% |
7 | 1820 | 3.0% |
6 | 1780 | 3.0% |
Distinct | 1696 |
---|---|
Distinct (%) | 17.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
13,500 | 37 | 0.4% |
23,200 | 37 | 0.4% |
18,600 | 32 | 0.3% |
10,500 | 32 | 0.3% |
29,800 | 31 | 0.3% |
28,800 | 30 | 0.3% |
22,000 | 30 | 0.3% |
40,500 | 30 | 0.3% |
19,300 | 30 | 0.3% |
10,700 | 29 | 0.3% |
Other values (1686) | 9682 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 20938 | |
, | 10000 | |
1 | 4384 | 7.4% |
2 | 3963 | 6.7% |
3 | 3666 | 6.2% |
4 | 3307 | 5.6% |
5 | 2992 | 5.1% |
6 | 2549 | 4.3% |
9 | 2437 | 4.1% |
8 | 2436 | 4.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 49025 | |
Other Punctuation | 10000 | 16.9% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 20938 | |
1 | 4384 | 8.9% |
2 | 3963 | 8.1% |
3 | 3666 | 7.5% |
4 | 3307 | 6.7% |
5 | 2992 | 6.1% |
6 | 2549 | 5.2% |
9 | 2437 | 5.0% |
8 | 2436 | 5.0% |
7 | 2353 | 4.8% |
Other Punctuation
Value | Count | Frequency (%) |
, | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 59025 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 20938 | |
, | 10000 | |
1 | 4384 | 7.4% |
2 | 3963 | 6.7% |
3 | 3666 | 6.2% |
4 | 3307 | 5.6% |
5 | 2992 | 5.1% |
6 | 2549 | 4.3% |
9 | 2437 | 4.1% |
8 | 2436 | 4.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 59025 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 20938 | |
, | 10000 | |
1 | 4384 | 7.4% |
2 | 3963 | 6.7% |
3 | 3666 | 6.2% |
4 | 3307 | 5.6% |
5 | 2992 | 5.1% |
6 | 2549 | 4.3% |
9 | 2437 | 4.1% |
8 | 2436 | 4.1% |
POTVALE_RLT_RT
Text
Distinct | 99 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
72 | 295 | 2.9% |
67 | 293 | 2.9% |
68 | 284 | 2.8% |
69 | 283 | 2.8% |
71 | 281 | 2.8% |
76 | 280 | 2.8% |
66 | 272 | 2.7% |
64 | 267 | 2.7% |
70 | 264 | 2.6% |
65 | 263 | 2.6% |
Other values (89) | 7218 |
Most occurring characters
Value | Count | Frequency (%) |
% | 10000 | |
6 | 3579 | 11.8% |
7 | 3568 | 11.7% |
8 | 3066 | 10.1% |
9 | 2208 | 7.3% |
5 | 2061 | 6.8% |
1 | 1434 | 4.7% |
0 | 1361 | 4.5% |
4 | 1130 | 3.7% |
3 | 999 | 3.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 20396 | |
Other Punctuation | 10000 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
6 | 3579 | |
7 | 3568 | |
8 | 3066 | |
9 | 2208 | |
5 | 2061 | |
1 | 1434 | |
0 | 1361 | 6.7% |
4 | 1130 | 5.5% |
3 | 999 | 4.9% |
2 | 990 | 4.9% |
Other Punctuation
Value | Count | Frequency (%) |
% | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 30396 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
% | 10000 | |
6 | 3579 | 11.8% |
7 | 3568 | 11.7% |
8 | 3066 | 10.1% |
9 | 2208 | 7.3% |
5 | 2061 | 6.8% |
1 | 1434 | 4.7% |
0 | 1361 | 4.5% |
4 | 1130 | 3.7% |
3 | 999 | 3.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 30396 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
% | 10000 | |
6 | 3579 | 11.8% |
7 | 3568 | 11.7% |
8 | 3066 | 10.1% |
9 | 2208 | 7.3% |
5 | 2061 | 6.8% |
1 | 1434 | 4.7% |
0 | 1361 | 4.5% |
4 | 1130 | 3.7% |
3 | 999 | 3.3% |
Distinct | 83 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
81 | 403 | 4.0% |
71 | 386 | 3.9% |
69 | 362 | 3.6% |
79 | 348 | 3.5% |
70 | 343 | 3.4% |
75 | 335 | 3.4% |
73 | 330 | 3.3% |
76 | 320 | 3.2% |
74 | 316 | 3.2% |
72 | 315 | 3.1% |
Other values (73) | 6542 |
Most occurring characters
Value | Count | Frequency (%) |
% | 10000 | |
7 | 4163 | |
6 | 3518 | 11.7% |
8 | 3412 | 11.4% |
9 | 2022 | 6.7% |
5 | 1664 | 5.5% |
1 | 1285 | 4.3% |
4 | 1036 | 3.4% |
3 | 1035 | 3.4% |
0 | 1027 | 3.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 20058 | |
Other Punctuation | 10000 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
7 | 4163 | |
6 | 3518 | |
8 | 3412 | |
9 | 2022 | |
5 | 1664 | 8.3% |
1 | 1285 | 6.4% |
4 | 1036 | 5.2% |
3 | 1035 | 5.2% |
0 | 1027 | 5.1% |
2 | 896 | 4.5% |
Other Punctuation
Value | Count | Frequency (%) |
% | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 30058 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
% | 10000 | |
7 | 4163 | |
6 | 3518 | 11.7% |
8 | 3412 | 11.4% |
9 | 2022 | 6.7% |
5 | 1664 | 5.5% |
1 | 1285 | 4.3% |
4 | 1036 | 3.4% |
3 | 1035 | 3.4% |
0 | 1027 | 3.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 30058 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
% | 10000 | |
7 | 4163 | |
6 | 3518 | 11.7% |
8 | 3412 | 11.4% |
9 | 2022 | 6.7% |
5 | 1664 | 5.5% |
1 | 1285 | 4.3% |
4 | 1036 | 3.4% |
3 | 1035 | 3.4% |
0 | 1027 | 3.4% |
EMD_ACCTO_POTVALE_RLT_RT
Categorical
HIGH CORRELATION
 
Distinct | 48 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
77% | 654 |
---|---|
72% | 588 |
78% | 515 |
81% | 478 |
70% | 435 |
Other values (43) |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 79% |
---|---|
2nd row | 61% |
3rd row | 81% |
4th row | 71% |
5th row | 85% |
Common Values
Value | Count | Frequency (%) |
77% | 654 | 6.5% |
72% | 588 | 5.9% |
78% | 515 | 5.1% |
81% | 478 | 4.8% |
70% | 435 | 4.3% |
71% | 431 | 4.3% |
73% | 431 | 4.3% |
82% | 416 | 4.2% |
76% | 416 | 4.2% |
74% | 402 | 4.0% |
Other values (38) | 5234 |
Length
Value | Count | Frequency (%) |
77 | 654 | 6.5% |
72 | 588 | 5.9% |
78 | 515 | 5.1% |
81 | 478 | 4.8% |
70 | 435 | 4.3% |
71 | 431 | 4.3% |
73 | 431 | 4.3% |
82 | 416 | 4.2% |
76 | 416 | 4.2% |
74 | 402 | 4.0% |
Other values (38) | 5234 |
APT_POTVALE_RLT_RT_CMPR_INFO_NO | SIDO_NM | SIGNGU_NM | TNSHP_NM | SMOEU | POTVALE_RLT_RT | APT_ACCTO_POTVALE_RLT_RT | EMD_ACCTO_POTVALE_RLT_RT | |
---|---|---|---|---|---|---|---|---|
APT_POTVALE_RLT_RT_CMPR_INFO_NO | 1.000 | 0.917 | 0.996 | 0.960 | 0.266 | 0.302 | 0.501 | 0.744 |
SIDO_NM | 0.917 | 1.000 | 0.999 | NaN | 0.247 | 0.179 | 0.323 | 0.571 |
SIGNGU_NM | 0.996 | 0.999 | 1.000 | 1.000 | 0.445 | 0.487 | 0.735 | 0.923 |
TNSHP_NM | 0.960 | NaN | 1.000 | 1.000 | 0.335 | 0.411 | 0.715 | 0.909 |
SMOEU | 0.266 | 0.247 | 0.445 | 0.335 | 1.000 | 0.312 | 0.463 | 0.334 |
POTVALE_RLT_RT | 0.302 | 0.179 | 0.487 | 0.411 | 0.312 | 1.000 | 0.968 | 0.648 |
APT_ACCTO_POTVALE_RLT_RT | 0.501 | 0.323 | 0.735 | 0.715 | 0.463 | 0.968 | 1.000 | 0.842 |
EMD_ACCTO_POTVALE_RLT_RT | 0.744 | 0.571 | 0.923 | 0.909 | 0.334 | 0.648 | 0.842 | 1.000 |
EMD_ACCTO_POTVALE_RLT_RT | SIDO_NM | TNSHP_NM | |
---|---|---|---|
EMD_ACCTO_POTVALE_RLT_RT | 1.000 | 0.323 | 0.521 |
SIDO_NM | 0.323 | 1.000 | 1.000 |
TNSHP_NM | 0.521 | 1.000 | 1.000 |
APT_POTVALE_RLT_RT_CMPR_INFO_NO | SMOEU | SIDO_NM | TNSHP_NM | EMD_ACCTO_POTVALE_RLT_RT | |
---|---|---|---|---|---|
APT_POTVALE_RLT_RT_CMPR_INFO_NO | 1.000 | 0.031 | 0.883 | 0.882 | 0.360 |
SMOEU | 0.031 | 1.000 | 0.152 | 0.137 | 0.120 |
SIDO_NM | 0.883 | 0.152 | 1.000 | 1.000 | 0.323 |
TNSHP_NM | 0.882 | 0.137 | 1.000 | 1.000 | 0.521 |
EMD_ACCTO_POTVALE_RLT_RT | 0.360 | 0.120 | 0.323 | 0.521 | 1.000 |
APT_POTVALE_RLT_RT_CMPR_INFO_NO | SIDO_NM | SIGNGU_NM | TNSHP_NM | EMD_NM | HONO_NM | APT_NM | SMOEU | SAPR | MOLIT_POTVALE_AMT | POTVALE_RLT_RT | APT_ACCTO_POTVALE_RLT_RT | EMD_ACCTO_POTVALE_RLT_RT | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
618 | 619 | 서울특별시 | 강서구 | <NA> | 마곡동 | 748 | 마곡13단지힐스테이트마스터 | 84.98 | 113,000 | 104,900 | 93% | 89% | 79% |
21919 | 21920 | 경기도 | 포천시 | <NA> | 신북면 | 101-6 | 신포천 | 50.67 | 6,500 | 3,500 | 54% | 59% | 61% |
15591 | 15592 | 경기도 | 안양시 | 동안구 | 관양동 | 1589 | 한가람(세경) | 49.68 | 47,000 | 45,300 | 96% | 86% | 81% |
3680 | 3681 | 인천광역시 | 계양구 | <NA> | 계산동 | 62-1 | 현대 | 84.95 | 37,800 | 30,300 | 80% | 75% | 71% |
12408 | 12409 | 경기도 | 수원시 | 권선구 | 호매실동 | 1408 | 엘에이치호매실스타힐스 | 59.98 | 38,500 | 34,200 | 89% | 84% | 85% |
9396 | 9397 | 경기도 | 군포시 | <NA> | 당동 | 954 | 무지개마을대림 | 84.87 | 55,000 | 42,300 | 77% | 79% | 70% |
21925 | 21926 | 경기도 | 포천시 | <NA> | 신북면 | 101-6 | 신포천 | 44.22 | 5,850 | 3,280 | 56% | 59% | 61% |
9430 | 9431 | 경기도 | 군포시 | <NA> | 대야미동 | 652-7 | 신안실크밸리(28-7) | 59.92 | 37,000 | 23,900 | 65% | 65% | 73% |
20539 | 20540 | 경기도 | 평택시 | <NA> | 서정동 | 787-2 | 대옥3 | 52.23 | 13,700 | 5,860 | 43% | 44% | 57% |
21320 | 21321 | 경기도 | 평택시 | <NA> | 청북읍 | 1104 | 부영사랑으로2단지 | 59.96 | 15,900 | 13,500 | 85% | 74% | 71% |
APT_POTVALE_RLT_RT_CMPR_INFO_NO | SIDO_NM | SIGNGU_NM | TNSHP_NM | EMD_NM | HONO_NM | APT_NM | SMOEU | SAPR | MOLIT_POTVALE_AMT | POTVALE_RLT_RT | APT_ACCTO_POTVALE_RLT_RT | EMD_ACCTO_POTVALE_RLT_RT | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
898 | 899 | 서울특별시 | 광진구 | <NA> | 화양동 | 110-37 | 화양타워 | 105.04 | 74,000 | 41,600 | 56% | 56% | 67% |
1224 | 1225 | 서울특별시 | 노원구 | <NA> | 상계동 | 1320 | 미라보(성림) | 60.0 | 42,500 | 31,200 | 73% | 73% | 77% |
7484 | 7485 | 인천광역시 | 중구 | <NA> | 인현동 | 3-2 | 뉴코아 | 60.0 | 13,000 | 8,440 | 65% | 59% | 59% |
16346 | 16347 | 경기도 | 양주시 | <NA> | 옥정동 | 1051 | e편한세상옥정에듀써밋 | 74.98 | 38,800 | 36,000 | 93% | 91% | 91% |
10998 | 10999 | 경기도 | 부천시 | <NA> | 고강동 | 367-5 | 건일 | 45.0 | 17,500 | 11,100 | 63% | 59% | 63% |
22687 | 22688 | 경기도 | 화성시 | <NA> | 병점동 | 859 | 병점역에듀포레 | 75.99 | 31,500 | 25,600 | 81% | 82% | 80% |
19946 | 19947 | 경기도 | 파주시 | <NA> | 문산읍 | 1352 | 양우내안애3단지 | 59.93 | 19,300 | 11,500 | 60% | 59% | 58% |
6458 | 6459 | 인천광역시 | 서구 | <NA> | 석남동 | 559 | 경인 | 44.82 | 13,300 | 7,960 | 60% | 60% | 70% |
11203 | 11204 | 경기도 | 부천시 | <NA> | 상동 | 413 | 상동스카이뷰자이 | 84.96 | 78,000 | 59,800 | 77% | 73% | 80% |
21930 | 21931 | 경기도 | 포천시 | <NA> | 신북면 | 101-3 | 후레쉬빌 | 49.97 | 8,450 | 5,080 | 60% | 64% | 61% |