Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 153 |
Missing cells (%) | 0.2% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 683.6 KiB |
Average record size in memory | 70.0 B |
Variable types
Text | 1 |
---|---|
Numeric | 6 |
Dataset
Description | Sample |
---|---|
Author | 스페이스워크 |
URL | https://www.bigdata-realestate.kr/rebpp/usr/prd/prdInfoDetail.do?req_productId=65 |
CRS_AREA_DIMS is highly overall correlated with BULD_CNT_TOT | High correlation |
LA is highly overall correlated with LO | High correlation |
LO is highly overall correlated with LA | High correlation |
BULD_CNT_TOT is highly overall correlated with CRS_AREA_DIMS and 2 other fields | High correlation |
SPANUAT_BULD_CNT_TOT is highly overall correlated with BULD_CNT_TOT | High correlation |
AREA_ISE_NMHSH is highly overall correlated with BULD_CNT_TOT | High correlation |
CRS_AREA_DIMS is highly skewed (γ1 = 41.16282147) | Skewed |
CRS_AREA_CD has unique values | Unique |
CRS_AREA_DIMS has unique values | Unique |
LA has unique values | Unique |
LO has unique values | Unique |
BULD_CNT_TOT has 2090 (20.9%) zeros | Zeros |
SPANUAT_BULD_CNT_TOT has 2880 (28.8%) zeros | Zeros |
AREA_ISE_NMHSH has 5546 (55.5%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-11 22:32:34.997656 |
---|---|
Analysis finished | 2023-12-11 22:32:41.209428 |
Duration | 6.21 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
CRS_AREA_CD
Text
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 20 |
---|---|
Median length | 20 |
Mean length | 20 |
Min length | 20 |
Characters and Unicode
Total characters | 200000 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | G1168010100107650000 |
---|---|
2nd row | G2641010300103780001 |
3rd row | G1120011100111050000 |
4th row | G1126010200101050033 |
5th row | G1171010700100290000 |
Value | Count | Frequency (%) |
g1168010100107650000 | 1 | < 0.1% |
g2629010900104860028 | 1 | < 0.1% |
g1114016200103690118 | 1 | < 0.1% |
g1159010200101860006 | 1 | < 0.1% |
g2644010400118750004 | 1 | < 0.1% |
g1150010400114740000 | 1 | < 0.1% |
g2641011000103021527 | 1 | < 0.1% |
g1174011000105750005 | 1 | < 0.1% |
g1130510100103190005 | 1 | < 0.1% |
g1168010100107490005 | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 79367 | |
1 | 48204 | |
2 | 14210 | 7.1% |
G | 10000 | 5.0% |
6 | 9373 | 4.7% |
3 | 9361 | 4.7% |
4 | 8325 | 4.2% |
5 | 7311 | 3.7% |
7 | 4928 | 2.5% |
8 | 4601 | 2.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 190000 | |
Uppercase Letter | 10000 | 5.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 79367 | |
1 | 48204 | |
2 | 14210 | 7.5% |
6 | 9373 | 4.9% |
3 | 9361 | 4.9% |
4 | 8325 | 4.4% |
5 | 7311 | 3.8% |
7 | 4928 | 2.6% |
8 | 4601 | 2.4% |
9 | 4320 | 2.3% |
Uppercase Letter
Value | Count | Frequency (%) |
G | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 190000 | |
Latin | 10000 | 5.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 79367 | |
1 | 48204 | |
2 | 14210 | 7.5% |
6 | 9373 | 4.9% |
3 | 9361 | 4.9% |
4 | 8325 | 4.4% |
5 | 7311 | 3.8% |
7 | 4928 | 2.6% |
8 | 4601 | 2.4% |
9 | 4320 | 2.3% |
Latin
Value | Count | Frequency (%) |
G | 10000 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 200000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 79367 | |
1 | 48204 | |
2 | 14210 | 7.1% |
G | 10000 | 5.0% |
6 | 9373 | 4.7% |
3 | 9361 | 4.7% |
4 | 8325 | 4.2% |
5 | 7311 | 3.7% |
7 | 4928 | 2.5% |
8 | 4601 | 2.3% |
CRS_AREA_DIMS
Real number (ℝ)
HIGH CORRELATION
  SKEWED
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5568.5241 |
Minimum | 0.22903138 |
---|---|
Maximum | 1730382.1 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0.22903138 |
---|---|
5-th percentile | 30.435615 |
Q1 | 649.15909 |
median | 1772.9828 |
Q3 | 4019.1591 |
95-th percentile | 18522.654 |
Maximum | 1730382.1 |
Range | 1730381.9 |
Interquartile range (IQR) | 3370 |
Descriptive statistics
Standard deviation | 29545.893 |
---|---|
Coefficient of variation (CV) | 5.305875 |
Kurtosis | 2269.0518 |
Mean | 5568.5241 |
Median Absolute Deviation (MAD) | 1384.7974 |
Skewness | 41.162821 |
Sum | 55685241 |
Variance | 8.7295977 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
16308.737853343637 | 1 | < 0.1% |
2854.6844998963143 | 1 | < 0.1% |
470.0022844113294 | 1 | < 0.1% |
77292.31035351618 | 1 | < 0.1% |
1042.6228563213613 | 1 | < 0.1% |
38.70780647479718 | 1 | < 0.1% |
1471.87841063393 | 1 | < 0.1% |
210.60597259329097 | 1 | < 0.1% |
2328.1762080354706 | 1 | < 0.1% |
5331.607092959873 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
0.2290313755413465 | 1 | |
0.2361778057972345 | 1 | |
0.2380810857910331 | 1 | |
0.3684492278014744 | 1 | |
0.3695113233139952 | 1 | |
0.4713636498889594 | 1 | |
0.6053744259714501 | 1 | |
0.6379450571384185 | 1 | |
0.6781792542362659 | 1 | |
0.7059860263085205 | 1 |
Value | Count | Frequency (%) |
1730382.144342454 | 1 | |
1700191.9751874544 | 1 | |
546695.7861630644 | 1 | |
404193.3930710859 | 1 | |
369108.92236382246 | 1 | |
368497.34197423374 | 1 | |
309581.8082353687 | 1 | |
277825.4899256274 | 1 | |
271614.41915045073 | 1 | |
262757.7141918418 | 1 |
LA
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 36.751547 |
Minimum | 34.380887 |
---|---|
Maximum | 37.690874 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 34.380887 |
---|---|
5-th percentile | 35.093583 |
Q1 | 35.203235 |
median | 37.506302 |
Q3 | 37.57025 |
95-th percentile | 37.633305 |
Maximum | 37.690874 |
Range | 3.3099874 |
Interquartile range (IQR) | 2.3670146 |
Descriptive statistics
Standard deviation | 1.1315413 |
---|---|
Coefficient of variation (CV) | 0.030788943 |
Kurtosis | -1.4954333 |
Mean | 36.751547 |
Median Absolute Deviation (MAD) | 0.085460709 |
Skewness | -0.70205035 |
Sum | 367515.47 |
Variance | 1.2803857 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
37.49674618975183 | 1 | < 0.1% |
37.49569687831988 | 1 | < 0.1% |
35.09888873527085 | 1 | < 0.1% |
37.56566291588756 | 1 | < 0.1% |
35.215831587397645 | 1 | < 0.1% |
37.56406512201457 | 1 | < 0.1% |
37.62166193725834 | 1 | < 0.1% |
35.14347972594845 | 1 | < 0.1% |
37.4731003481389 | 1 | < 0.1% |
37.67492386394811 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
34.38088684534138 | 1 | |
34.83616169228189 | 1 | |
34.98017905576831 | 1 | |
34.98090288619798 | 1 | |
34.98149466279844 | 1 | |
35.00478319677566 | 1 | |
35.00565118965179 | 1 | |
35.00956831066665 | 1 | |
35.011535783807346 | 1 | |
35.01176942785997 | 1 |
Value | Count | Frequency (%) |
37.69087425039329 | 1 | |
37.69013726449748 | 1 | |
37.68983247483298 | 1 | |
37.68852251434603 | 1 | |
37.688511145071 | 1 | |
37.687831489778944 | 1 | |
37.68660126990064 | 1 | |
37.68627425272608 | 1 | |
37.68604284707776 | 1 | |
37.685684887543815 | 1 |
LO
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 127.67087 |
Minimum | 126.19799 |
---|---|
Maximum | 129.25289 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 126.19799 |
---|---|
5-th percentile | 126.85971 |
Q1 | 126.95731 |
median | 127.05167 |
Q3 | 128.99998 |
95-th percentile | 129.09981 |
Maximum | 129.25289 |
Range | 3.054903 |
Interquartile range (IQR) | 2.0426706 |
Descriptive statistics
Standard deviation | 0.96602536 |
---|---|
Coefficient of variation (CV) | 0.0075665292 |
Kurtosis | -1.484753 |
Mean | 127.67087 |
Median Absolute Deviation (MAD) | 0.13011604 |
Skewness | 0.69498995 |
Sum | 1276708.7 |
Variance | 0.93320501 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
127.05168685930576 | 1 | < 0.1% |
127.03572530231024 | 1 | < 0.1% |
128.91091256206224 | 1 | < 0.1% |
126.85568561820742 | 1 | < 0.1% |
129.09713199034235 | 1 | < 0.1% |
127.1734811968142 | 1 | < 0.1% |
127.02502041531186 | 1 | < 0.1% |
129.0677887695764 | 1 | < 0.1% |
126.93465336793268 | 1 | < 0.1% |
127.04318850714762 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
126.1979900933286 | 1 | |
126.50857736942498 | 1 | |
126.75662862116948 | 1 | |
126.76734177921698 | 1 | |
126.76802879186206 | 1 | |
126.77461537517144 | 1 | |
126.77703896931688 | 1 | |
126.77719766855438 | 1 | |
126.7794565169882 | 1 | |
126.77955976740569 | 1 |
Value | Count | Frequency (%) |
129.2528930627633 | 1 | |
129.2283402455202 | 1 | |
129.20515853973768 | 1 | |
129.20488805629304 | 1 | |
129.20450926077208 | 1 | |
129.20441092685377 | 1 | |
129.20427552977932 | 1 | |
129.2041576475478 | 1 | |
129.2039889867936 | 1 | |
129.20266775705323 | 1 |
BULD_CNT_TOT
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 97 |
---|---|
Distinct (%) | 1.0% |
Missing | 51 |
Missing (%) | 0.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9.2362046 |
Minimum | 0 |
---|---|
Maximum | 234 |
Zeros | 2090 |
Zeros (%) | 20.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 6 |
Q3 | 13 |
95-th percentile | 30 |
Maximum | 234 |
Range | 234 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 11.623789 |
---|---|
Coefficient of variation (CV) | 1.2585028 |
Kurtosis | 31.870581 |
Mean | 9.2362046 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 3.7476646 |
Sum | 91891 |
Variance | 135.11248 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 2090 | |
1 | 623 | 6.2% |
6 | 517 | 5.2% |
4 | 495 | 5.0% |
8 | 479 | 4.8% |
2 | 472 | 4.7% |
7 | 470 | 4.7% |
3 | 458 | 4.6% |
5 | 439 | 4.4% |
10 | 376 | 3.8% |
Other values (87) | 3530 |
Value | Count | Frequency (%) |
0 | 2090 | |
1 | 623 | 6.2% |
2 | 472 | 4.7% |
3 | 458 | 4.6% |
4 | 495 | 5.0% |
5 | 439 | 4.4% |
6 | 517 | 5.2% |
7 | 470 | 4.7% |
8 | 479 | 4.8% |
9 | 334 | 3.3% |
Value | Count | Frequency (%) |
234 | 1 | < 0.1% |
158 | 1 | < 0.1% |
139 | 2 | |
126 | 1 | < 0.1% |
122 | 1 | < 0.1% |
119 | 3 | |
112 | 1 | < 0.1% |
109 | 1 | < 0.1% |
107 | 1 | < 0.1% |
103 | 1 | < 0.1% |
SPANUAT_BULD_CNT_TOT
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 80 |
---|---|
Distinct (%) | 0.8% |
Missing | 51 |
Missing (%) | 0.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.2538949 |
Minimum | 0 |
---|---|
Maximum | 140 |
Zeros | 2880 |
Zeros (%) | 28.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 4 |
Q3 | 9 |
95-th percentile | 22 |
Maximum | 140 |
Range | 140 |
Interquartile range (IQR) | 9 |
Descriptive statistics
Standard deviation | 8.9127402 |
---|---|
Coefficient of variation (CV) | 1.4251503 |
Kurtosis | 24.822146 |
Mean | 6.2538949 |
Median Absolute Deviation (MAD) | 4 |
Skewness | 3.6286321 |
Sum | 62220 |
Variance | 79.436938 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 2880 | |
1 | 788 | 7.9% |
2 | 656 | 6.6% |
3 | 646 | 6.5% |
4 | 595 | 5.9% |
5 | 546 | 5.5% |
6 | 489 | 4.9% |
7 | 440 | 4.4% |
8 | 390 | 3.9% |
9 | 326 | 3.3% |
Other values (70) | 2193 |
Value | Count | Frequency (%) |
0 | 2880 | |
1 | 788 | 7.9% |
2 | 656 | 6.6% |
3 | 646 | 6.5% |
4 | 595 | 5.9% |
5 | 546 | 5.5% |
6 | 489 | 4.9% |
7 | 440 | 4.4% |
8 | 390 | 3.9% |
9 | 326 | 3.3% |
Value | Count | Frequency (%) |
140 | 1 | |
123 | 1 | |
117 | 1 | |
112 | 1 | |
95 | 1 | |
92 | 2 | |
91 | 1 | |
86 | 1 | |
82 | 1 | |
75 | 1 |
AREA_ISE_NMHSH
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 488 |
---|---|
Distinct (%) | 4.9% |
Missing | 51 |
Missing (%) | 0.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 40.965122 |
Minimum | 0 |
---|---|
Maximum | 5656 |
Zeros | 5546 |
Zeros (%) | 55.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 23 |
95-th percentile | 142 |
Maximum | 5656 |
Range | 5656 |
Interquartile range (IQR) | 23 |
Descriptive statistics
Standard deviation | 190.82527 |
---|---|
Coefficient of variation (CV) | 4.6582376 |
Kurtosis | 282.39003 |
Mean | 40.965122 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 13.957549 |
Sum | 407562 |
Variance | 36414.285 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 5546 | |
8 | 186 | 1.9% |
16 | 140 | 1.4% |
1 | 139 | 1.4% |
12 | 112 | 1.1% |
10 | 108 | 1.1% |
6 | 106 | 1.1% |
4 | 94 | 0.9% |
18 | 92 | 0.9% |
9 | 89 | 0.9% |
Other values (478) | 3337 |
Value | Count | Frequency (%) |
0 | 5546 | |
1 | 139 | 1.4% |
2 | 79 | 0.8% |
3 | 74 | 0.7% |
4 | 94 | 0.9% |
5 | 45 | 0.4% |
6 | 106 | 1.1% |
7 | 88 | 0.9% |
8 | 186 | 1.9% |
9 | 89 | 0.9% |
Value | Count | Frequency (%) |
5656 | 1 | |
5385 | 1 | |
4586 | 1 | |
4492 | 1 | |
4098 | 1 | |
4066 | 1 | |
2777 | 1 | |
2727 | 1 | |
2694 | 1 | |
2594 | 1 |
CRS_AREA_DIMS | LA | LO | BULD_CNT_TOT | SPANUAT_BULD_CNT_TOT | AREA_ISE_NMHSH | |
---|---|---|---|---|---|---|
CRS_AREA_DIMS | 1.000 | 0.066 | 0.010 | 0.399 | 0.248 | 0.368 |
LA | 0.066 | 1.000 | 0.883 | 0.000 | 0.000 | 0.000 |
LO | 0.010 | 0.883 | 1.000 | 0.017 | 0.060 | 0.000 |
BULD_CNT_TOT | 0.399 | 0.000 | 0.017 | 1.000 | 0.860 | 0.518 |
SPANUAT_BULD_CNT_TOT | 0.248 | 0.000 | 0.060 | 0.860 | 1.000 | 0.198 |
AREA_ISE_NMHSH | 0.368 | 0.000 | 0.000 | 0.518 | 0.198 | 1.000 |
CRS_AREA_DIMS | LA | LO | BULD_CNT_TOT | SPANUAT_BULD_CNT_TOT | AREA_ISE_NMHSH | |
---|---|---|---|---|---|---|
CRS_AREA_DIMS | 1.000 | -0.006 | -0.055 | 0.615 | 0.437 | 0.490 |
LA | -0.006 | 1.000 | -0.584 | 0.100 | 0.116 | 0.153 |
LO | -0.055 | -0.584 | 1.000 | -0.126 | -0.111 | -0.184 |
BULD_CNT_TOT | 0.615 | 0.100 | -0.126 | 1.000 | 0.892 | 0.592 |
SPANUAT_BULD_CNT_TOT | 0.437 | 0.116 | -0.111 | 0.892 | 1.000 | 0.425 |
AREA_ISE_NMHSH | 0.490 | 0.153 | -0.184 | 0.592 | 0.425 | 1.000 |
CRS_AREA_CD | CRS_AREA_DIMS | LA | LO | BULD_CNT_TOT | SPANUAT_BULD_CNT_TOT | AREA_ISE_NMHSH | |
---|---|---|---|---|---|---|---|
51728 | G1168010100107650000 | 16308.737853 | 37.496746 | 127.051687 | 20 | 4 | 225 |
78793 | G2641010300103780001 | 56.148851 | 35.280733 | 129.085052 | 0 | 0 | 0 |
8127 | G1120011100111050000 | 4294.891812 | 37.551302 | 127.020881 | 31 | 26 | 23 |
13464 | G1126010200101050033 | 887.232934 | 37.594129 | 127.089179 | 7 | 7 | 0 |
53791 | G1171010700100290000 | 3722.735622 | 37.500723 | 127.123337 | 14 | 6 | 57 |
26756 | G1138010400102850220 | 1299.286824 | 37.626522 | 126.912543 | 5 | 5 | 20 |
6981 | G1120010700101280172 | 221.395839 | 37.557096 | 127.037785 | 0 | 0 | 0 |
63582 | G2620010200101700002 | 2387.638327 | 35.090825 | 129.041131 | 26 | 25 | 0 |
32740 | G1144012500102000384 | 1.988755 | 37.56838 | 126.908075 | 0 | 0 | 0 |
8185 | G1120011500108120000 | 1544.411628 | 37.533721 | 127.055246 | 0 | 0 | 0 |
CRS_AREA_CD | CRS_AREA_DIMS | LA | LO | BULD_CNT_TOT | SPANUAT_BULD_CNT_TOT | AREA_ISE_NMHSH | |
---|---|---|---|---|---|---|---|
4825 | G1117013000102100001 | 5398.931544 | 37.540324 | 126.992929 | 21 | 17 | 6 |
86353 | G2644010400134550001 | 2202.620387 | 35.095786 | 128.923399 | 7 | 0 | 5 |
64213 | G2626010700105330001 | 3933.801775 | 35.205791 | 129.079069 | 28 | 14 | 8 |
22231 | G1132010800105560076 | 1495.058966 | 37.686043 | 127.042909 | 8 | 5 | 18 |
65809 | G2623010400111940094 | 3924.139441 | 35.149714 | 129.050561 | 19 | 17 | 0 |
48026 | G1162010100106610020 | 3210.665488 | 37.488654 | 126.933328 | 11 | 6 | 62 |
64537 | G2620010700101330005 | 26.388322 | 35.084487 | 129.038778 | 0 | 0 | 0 |
43742 | G1156013300109060008 | 1528.035884 | 37.497152 | 126.905289 | 11 | 8 | 0 |
36317 | G1150010300108110001 | 5528.693993 | 37.533782 | 126.859958 | 21 | 11 | 107 |
35076 | G1150010900108290001 | 2130.301228 | 37.57757 | 126.812164 | 0 | 0 | 0 |