Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 507.8 KiB |
Average record size in memory | 52.0 B |
Variable types
Categorical | 2 |
---|---|
Numeric | 3 |
Dataset
Description | Sample |
---|---|
Author | 오아시스비즈니스 |
URL | https://www.bigdata-realestate.kr/rebpp/usr/prd/prdInfoDetail.do?req_productId=75 |
data_strd_ym has constant value "" | Constant |
pnu is highly overall correlated with legaldong_cd | High correlation |
legaldong_cd is highly overall correlated with pnu | High correlation |
Reproduction
Analysis started | 2023-12-11 22:32:01.189494 |
---|---|
Analysis finished | 2023-12-11 22:32:04.097199 |
Duration | 2.91 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
data_strd_ym
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
202307 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 202307 |
---|---|
2nd row | 202307 |
3rd row | 202307 |
4th row | 202307 |
5th row | 202307 |
Common Values
Value | Count | Frequency (%) |
202307 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
202307 | 10000 |
pnu
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 9189 |
---|---|
Distinct (%) | 91.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.1299028 × 1018 |
Minimum | 1.1110101 × 1018 |
---|---|
Maximum | 1.1500103 × 1018 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1.1110101 × 1018 |
---|---|
5-th percentile | 1.1110163 × 1018 |
Q1 | 1.1215101 × 1018 |
median | 1.1290136 × 1018 |
Q3 | 1.1410112 × 1018 |
95-th percentile | 1.1470103 × 1018 |
Maximum | 1.1500103 × 1018 |
Range | 3.90002 × 1016 |
Interquartile range (IQR) | 1.95011 × 1016 |
Descriptive statistics
Standard deviation | 1.1637862 × 1016 |
---|---|
Coefficient of variation (CV) | 0.01029988 |
Kurtosis | -1.1591914 |
Mean | 1.1299028 × 1018 |
Median Absolute Deviation (MAD) | 9.0021 × 1015 |
Skewness | 0.057839912 |
Sum | -8.8265033 × 1018 |
Variance | 1.3543984 × 1032 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1123010200107900000 | 5 | 0.1% |
1150010200107170000 | 5 | 0.1% |
1126010600106480000 | 5 | 0.1% |
1117012900103010053 | 4 | < 0.1% |
1135010500112620000 | 4 | < 0.1% |
1114010900100010000 | 4 | < 0.1% |
1126010600104780022 | 4 | < 0.1% |
1120011400106560003 | 4 | < 0.1% |
1147010200109070014 | 4 | < 0.1% |
1144012700116050000 | 4 | < 0.1% |
Other values (9179) | 9957 |
Value | Count | Frequency (%) |
1111010100100660000 | 1 | |
1111010100101310000 | 1 | |
1111010200100360000 | 1 | |
1111010200100570000 | 1 | |
1111010400100530000 | 1 | |
1111010400101630001 | 1 | |
1111010400101640007 | 1 | |
1111010500100980006 | 1 | |
1111010500101550000 | 1 | |
1111010500101580007 | 1 |
Value | Count | Frequency (%) |
1150010300109060014 | 1 | |
1150010300109050010 | 2 | |
1150010300109050005 | 1 | |
1150010300109050004 | 1 | |
1150010300109040012 | 1 | |
1150010300109040011 | 1 | |
1150010300109040001 | 1 | |
1150010300109020002 | 1 | |
1150010300109010015 | 1 | |
1150010300108990027 | 2 |
legaldong_cd
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 329 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11299028 |
Minimum | 11110101 |
---|---|
Maximum | 11500103 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 11110101 |
---|---|
5-th percentile | 11110163 |
Q1 | 11215101 |
median | 11290136 |
Q3 | 11410112 |
95-th percentile | 11470103 |
Maximum | 11500103 |
Range | 390002 |
Interquartile range (IQR) | 195011 |
Descriptive statistics
Standard deviation | 116378.62 |
---|---|
Coefficient of variation (CV) | 0.01029988 |
Kurtosis | -1.1591914 |
Mean | 11299028 |
Median Absolute Deviation (MAD) | 90021 |
Skewness | 0.057839912 |
Sum | 1.1299028 × 1011 |
Variance | 1.3543984 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11260101 | 259 | 2.6% |
11500103 | 251 | 2.5% |
11470101 | 248 | 2.5% |
11305103 | 240 | 2.4% |
11440120 | 228 | 2.3% |
11470102 | 226 | 2.3% |
11350105 | 225 | 2.2% |
11470103 | 216 | 2.2% |
11305101 | 204 | 2.0% |
11230106 | 197 | 2.0% |
Other values (319) | 7706 |
Value | Count | Frequency (%) |
11110101 | 2 | < 0.1% |
11110102 | 2 | < 0.1% |
11110104 | 3 | < 0.1% |
11110105 | 3 | < 0.1% |
11110106 | 9 | |
11110107 | 9 | |
11110108 | 16 | |
11110109 | 2 | < 0.1% |
11110110 | 11 | |
11110111 | 6 | 0.1% |
Value | Count | Frequency (%) |
11500103 | 251 | |
11500102 | 101 | |
11500101 | 45 | 0.4% |
11470103 | 216 | |
11470102 | 226 | |
11470101 | 248 | |
11440127 | 52 | 0.5% |
11440126 | 12 | 0.1% |
11440125 | 81 | 0.8% |
11440124 | 96 | 1.0% |
induty_cd
Categorical
Distinct | 42 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
A01 | |
---|---|
A03 | |
C01 | |
B02 | |
C05 | 465 |
Other values (37) |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | A01 |
---|---|
2nd row | B10 |
3rd row | C03 |
4th row | A12 |
5th row | A03 |
Common Values
Value | Count | Frequency (%) |
A01 | 1656 | |
A03 | 1304 | 13.0% |
C01 | 816 | 8.2% |
B02 | 710 | 7.1% |
C05 | 465 | 4.7% |
B01 | 402 | 4.0% |
C07 | 353 | 3.5% |
C03 | 347 | 3.5% |
C06 | 346 | 3.5% |
B03 | 321 | 3.2% |
Other values (32) | 3280 |
Length
Value | Count | Frequency (%) |
a01 | 1656 | |
a03 | 1304 | 13.0% |
c01 | 816 | 8.2% |
b02 | 710 | 7.1% |
c05 | 465 | 4.7% |
b01 | 402 | 4.0% |
c07 | 353 | 3.5% |
c03 | 347 | 3.5% |
c06 | 346 | 3.5% |
b03 | 321 | 3.2% |
Other values (32) | 3280 |
gtfc_scor
Real number (ℝ)
Distinct | 4001 |
---|---|
Distinct (%) | 40.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 48.94822 |
Minimum | 6.01 |
---|---|
Maximum | 100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 6.01 |
---|---|
5-th percentile | 30.258 |
Q1 | 42.09 |
median | 48.33 |
Q3 | 53.03 |
95-th percentile | 73.5625 |
Maximum | 100 |
Range | 93.99 |
Interquartile range (IQR) | 10.94 |
Descriptive statistics
Standard deviation | 12.824083 |
---|---|
Coefficient of variation (CV) | 0.26199284 |
Kurtosis | 2.5645892 |
Mean | 48.94822 |
Median Absolute Deviation (MAD) | 5.61 |
Skewness | 0.97889925 |
Sum | 489482.2 |
Variance | 164.45711 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
49.79 | 455 | 4.5% |
49.78 | 330 | 3.3% |
49.8 | 28 | 0.3% |
47.84 | 12 | 0.1% |
47.82 | 11 | 0.1% |
48.69 | 11 | 0.1% |
47.48 | 11 | 0.1% |
49.81 | 10 | 0.1% |
48.31 | 10 | 0.1% |
44.8 | 10 | 0.1% |
Other values (3991) | 9112 |
Value | Count | Frequency (%) |
6.01 | 1 | |
7.93 | 1 | |
9.49 | 1 | |
9.87 | 1 | |
10.94 | 1 | |
11.72 | 1 | |
11.89 | 1 | |
12.02 | 1 | |
12.35 | 1 | |
12.36 | 1 |
Value | Count | Frequency (%) |
100.0 | 9 | |
99.99 | 4 | |
99.98 | 1 | < 0.1% |
99.96 | 1 | < 0.1% |
99.95 | 2 | < 0.1% |
99.92 | 1 | < 0.1% |
99.84 | 1 | < 0.1% |
99.83 | 1 | < 0.1% |
99.75 | 1 | < 0.1% |
99.73 | 1 | < 0.1% |
pnu | legaldong_cd | induty_cd | gtfc_scor | |
---|---|---|---|---|
pnu | 1.000 | 1.000 | 0.207 | 0.082 |
legaldong_cd | 1.000 | 1.000 | 0.207 | 0.082 |
induty_cd | 0.207 | 0.207 | 1.000 | 0.477 |
gtfc_scor | 0.082 | 0.082 | 0.477 | 1.000 |
pnu | legaldong_cd | gtfc_scor | induty_cd | |
---|---|---|---|---|
pnu | 1.000 | 1.000 | 0.046 | 0.073 |
legaldong_cd | 1.000 | 1.000 | 0.046 | 0.073 |
gtfc_scor | 0.046 | 0.046 | 1.000 | 0.184 |
induty_cd | 0.073 | 0.073 | 0.184 | 1.000 |
data_strd_ym | pnu | legaldong_cd | induty_cd | gtfc_scor | |
---|---|---|---|---|---|
28762 | 202307 | 1121510500105530228 | 11215105 | A01 | 46.5 |
60422 | 202307 | 1132010700106500065 | 11320107 | B10 | 48.75 |
501 | 202307 | 1111011100101560001 | 11110111 | C03 | 36.67 |
70598 | 202307 | 1138010700100850005 | 11380107 | A12 | 54.64 |
58067 | 202307 | 1132010600102640064 | 11320106 | A03 | 48.69 |
30537 | 202307 | 1121510700100340005 | 11215107 | C05 | 39.87 |
33539 | 202307 | 1123010400102950009 | 11230104 | C01 | 49.78 |
33436 | 202307 | 1123010400101410063 | 11230104 | C01 | 49.78 |
17211 | 202307 | 1117013000101360002 | 11170130 | A12 | 45.07 |
80610 | 202307 | 1144010800103370012 | 11440108 | B03 | 46.0 |
data_strd_ym | pnu | legaldong_cd | induty_cd | gtfc_scor | |
---|---|---|---|---|---|
64017 | 202307 | 1135010500102920001 | 11350105 | B19 | 28.39 |
42493 | 202307 | 1126010300103080010 | 11260103 | A01 | 55.92 |
17691 | 202307 | 1117013100102570010 | 11170131 | C05 | 40.51 |
10410 | 202307 | 1114015100100190029 | 11140151 | C01 | 49.79 |
37802 | 202307 | 1123011000100780042 | 11230110 | A03 | 32.22 |
65626 | 202307 | 1135010500110020000 | 11350105 | B15 | 58.67 |
24497 | 202307 | 1121510100100970005 | 11215101 | B02 | 50.26 |
42679 | 202307 | 1126010300103230108 | 11260103 | B02 | 33.23 |
65992 | 202307 | 1135010500111320001 | 11350105 | B02 | 49.39 |
39684 | 202307 | 1126010100101920125 | 11260101 | C05 | 52.29 |