Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 507.8 KiB |
Average record size in memory | 52.0 B |
Variable types
Categorical | 2 |
---|---|
Numeric | 3 |
Dataset
Description | Sample |
---|---|
Author | 오아시스비즈니스 |
URL | https://www.bigdata-realestate.kr/rebpp/usr/prd/prdInfoDetail.do?req_productId=66 |
data_strd_ym has constant value "" | Constant |
pnu is highly overall correlated with legaldong_cd | High correlation |
legaldong_cd is highly overall correlated with pnu | High correlation |
pul_party_sopsrt_dims is highly skewed (γ1 = 30.25402225) | Skewed |
Reproduction
Analysis started | 2023-12-11 22:31:59.971019 |
---|---|
Analysis finished | 2023-12-11 22:32:03.094336 |
Duration | 3.12 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
data_strd_ym
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
202306 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 202306 |
---|---|
2nd row | 202306 |
3rd row | 202306 |
4th row | 202306 |
5th row | 202306 |
Common Values
Value | Count | Frequency (%) |
202306 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
202306 | 10000 |
pnu
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 8820 |
---|---|
Distinct (%) | 88.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.1248855 × 1018 |
Minimum | 1.1110101 × 1018 |
---|---|
Maximum | 1.1410111 × 1018 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1.1110101 × 1018 |
---|---|
5-th percentile | 1.1110157 × 1018 |
Q1 | 1.1200103 × 1018 |
median | 1.123011 × 1018 |
Q3 | 1.1320105 × 1018 |
95-th percentile | 1.1380107 × 1018 |
Maximum | 1.1410111 × 1018 |
Range | 3.0001 × 1016 |
Interquartile range (IQR) | 1.20002 × 1016 |
Descriptive statistics
Standard deviation | 8.4598481 × 1015 |
---|---|
Coefficient of variation (CV) | 0.00752063 |
Kurtosis | -1.0354692 |
Mean | 1.1248855 × 1018 |
Median Absolute Deviation (MAD) | 7.4991 × 1015 |
Skewness | -0.00078400995 |
Sum | -3.6584034 × 1018 |
Variance | 7.1569029 × 1031 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1123010400105910053 | 6 | 0.1% |
1120011300105600000 | 6 | 0.1% |
1111010700100800000 | 6 | 0.1% |
1129013800103160003 | 6 | 0.1% |
1132010700101340036 | 6 | 0.1% |
1129013600102220000 | 5 | 0.1% |
1121510300105460011 | 5 | 0.1% |
1120011500102800021 | 5 | 0.1% |
1123010400100100000 | 5 | 0.1% |
1111017700102330000 | 5 | 0.1% |
Other values (8810) | 9945 |
Value | Count | Frequency (%) |
1111010100100010000 | 1 | |
1111010100100500031 | 1 | |
1111010100100660000 | 1 | |
1111010100101110001 | 1 | |
1111010100101310000 | 1 | |
1111010200100010028 | 1 | |
1111010200100570000 | 1 | |
1111010300100130008 | 1 | |
1111010400100580000 | 1 | |
1111010400100700001 | 1 |
Value | Count | Frequency (%) |
1141011100102660006 | 1 | |
1141011100102500002 | 1 | |
1141011100102460003 | 1 | |
1141011100102450011 | 1 | |
1141011100102170002 | 1 | |
1141011100102140001 | 1 | |
1141011100102110000 | 1 | |
1141011100101790008 | 1 | |
1141011100101760001 | 1 | |
1141011100101750001 | 1 |
legaldong_cd
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 298 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11248855 |
Minimum | 11110101 |
---|---|
Maximum | 11410111 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 11110101 |
---|---|
5-th percentile | 11110157 |
Q1 | 11200103 |
median | 11230110 |
Q3 | 11320105 |
95-th percentile | 11380107 |
Maximum | 11410111 |
Range | 300010 |
Interquartile range (IQR) | 120002 |
Descriptive statistics
Standard deviation | 84598.481 |
---|---|
Coefficient of variation (CV) | 0.00752063 |
Kurtosis | -1.0354692 |
Mean | 11248855 |
Median Absolute Deviation (MAD) | 74991 |
Skewness | -0.00078400981 |
Sum | 1.1248855 × 1011 |
Variance | 7.1569029 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11260101 | 325 | 3.2% |
11305103 | 323 | 3.2% |
11350105 | 319 | 3.2% |
11305101 | 273 | 2.7% |
11215101 | 260 | 2.6% |
11230106 | 218 | 2.2% |
11320107 | 217 | 2.2% |
11380107 | 210 | 2.1% |
11215105 | 204 | 2.0% |
11350103 | 204 | 2.0% |
Other values (288) | 7447 |
Value | Count | Frequency (%) |
11110101 | 5 | 0.1% |
11110102 | 2 | < 0.1% |
11110103 | 1 | < 0.1% |
11110104 | 3 | < 0.1% |
11110105 | 4 | < 0.1% |
11110106 | 8 | |
11110107 | 16 | |
11110108 | 13 | |
11110109 | 7 | |
11110110 | 9 |
Value | Count | Frequency (%) |
11410111 | 34 | |
11410110 | 41 | |
11410109 | 2 | < 0.1% |
11410108 | 12 | 0.1% |
11410107 | 3 | < 0.1% |
11410106 | 4 | < 0.1% |
11410105 | 12 | 0.1% |
11410104 | 9 | 0.1% |
11410103 | 1 | < 0.1% |
11410102 | 18 |
induty_cd
Categorical
Distinct | 42 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
A01 | |
---|---|
A03 | |
C01 | |
B02 | |
C05 | 508 |
Other values (37) |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | A02 |
---|---|
2nd row | C07 |
3rd row | B22 |
4th row | B10 |
5th row | C05 |
Common Values
Value | Count | Frequency (%) |
A01 | 1644 | |
A03 | 1236 | 12.4% |
C01 | 747 | 7.5% |
B02 | 676 | 6.8% |
C05 | 508 | 5.1% |
C03 | 384 | 3.8% |
B01 | 363 | 3.6% |
C06 | 353 | 3.5% |
B05 | 350 | 3.5% |
C07 | 325 | 3.2% |
Other values (32) | 3414 |
Length
Value | Count | Frequency (%) |
a01 | 1644 | |
a03 | 1236 | 12.4% |
c01 | 747 | 7.5% |
b02 | 676 | 6.8% |
c05 | 508 | 5.1% |
c03 | 384 | 3.8% |
b01 | 363 | 3.6% |
c06 | 353 | 3.5% |
b05 | 350 | 3.5% |
c07 | 325 | 3.2% |
Other values (32) | 3414 |
pul_party_sopsrt_dims
Real number (ℝ)
SKEWED
 
Distinct | 512 |
---|---|
Distinct (%) | 5.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.528906 |
Minimum | 0.01 |
---|---|
Maximum | 957.12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0.01 |
---|---|
5-th percentile | 0.02 |
Q1 | 0.06 |
median | 0.14 |
Q3 | 0.34 |
95-th percentile | 1.71 |
Maximum | 957.12 |
Range | 957.11 |
Interquartile range (IQR) | 0.28 |
Descriptive statistics
Standard deviation | 20.089753 |
---|---|
Coefficient of variation (CV) | 13.139953 |
Kurtosis | 1144.6127 |
Mean | 1.528906 |
Median Absolute Deviation (MAD) | 0.1 |
Skewness | 30.254022 |
Sum | 15289.06 |
Variance | 403.59819 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.02 | 590 | 5.9% |
0.03 | 504 | 5.0% |
0.04 | 484 | 4.8% |
0.06 | 443 | 4.4% |
0.01 | 436 | 4.4% |
0.05 | 421 | 4.2% |
0.07 | 393 | 3.9% |
0.08 | 383 | 3.8% |
0.09 | 298 | 3.0% |
0.12 | 271 | 2.7% |
Other values (502) | 5777 |
Value | Count | Frequency (%) |
0.01 | 436 | |
0.02 | 590 | |
0.03 | 504 | |
0.04 | 484 | |
0.05 | 421 | |
0.06 | 443 | |
0.07 | 393 | |
0.08 | 383 | |
0.09 | 298 | |
0.1 | 266 |
Value | Count | Frequency (%) |
957.12 | 1 | < 0.1% |
849.17 | 1 | < 0.1% |
742.28 | 1 | < 0.1% |
586.33 | 1 | < 0.1% |
384.92 | 1 | < 0.1% |
319.18 | 1 | < 0.1% |
304.63 | 1 | < 0.1% |
292.96 | 4 | |
270.22 | 1 | < 0.1% |
245.29 | 1 | < 0.1% |
pnu | legaldong_cd | induty_cd | pul_party_sopsrt_dims | |
---|---|---|---|---|
pnu | 1.000 | 0.999 | 0.196 | 0.065 |
legaldong_cd | 0.999 | 1.000 | 0.194 | 0.067 |
induty_cd | 0.196 | 0.194 | 1.000 | 0.000 |
pul_party_sopsrt_dims | 0.065 | 0.067 | 0.000 | 1.000 |
pnu | legaldong_cd | pul_party_sopsrt_dims | induty_cd | |
---|---|---|---|---|
pnu | 1.000 | 1.000 | 0.089 | 0.069 |
legaldong_cd | 1.000 | 1.000 | 0.090 | 0.068 |
pul_party_sopsrt_dims | 0.089 | 0.090 | 1.000 | 0.000 |
induty_cd | 0.069 | 0.068 | 0.000 | 1.000 |
data_strd_ym | pnu | legaldong_cd | induty_cd | pul_party_sopsrt_dims | |
---|---|---|---|---|---|
80673 | 202306 | 1138010600100150120 | 11380106 | A02 | 0.04 |
38196 | 202306 | 1123010300109920029 | 11230103 | C07 | 0.33 |
36189 | 202306 | 1121510900101250054 | 11215109 | B22 | 0.25 |
26177 | 202306 | 1120011500102350001 | 11200115 | B10 | 0.65 |
61592 | 202306 | 1130510100108670099 | 11305101 | C05 | 0.43 |
74903 | 202306 | 1135010500106690000 | 11350105 | A03 | 0.16 |
70645 | 202306 | 1132010800106360001 | 11320108 | A03 | 1.52 |
1317 | 202306 | 1111012200100260000 | 11110122 | A01 | 0.01 |
24576 | 202306 | 1120011200111140000 | 11200112 | A02 | 0.49 |
39290 | 202306 | 1123010500100010049 | 11230105 | A01 | 0.65 |
data_strd_ym | pnu | legaldong_cd | induty_cd | pul_party_sopsrt_dims | |
---|---|---|---|---|---|
56549 | 202306 | 1129013500100030251 | 11290135 | C07 | 0.02 |
184 | 202306 | 1111010600100910050 | 11110106 | A01 | 0.01 |
74077 | 202306 | 1135010500103660003 | 11350105 | B02 | 0.92 |
16416 | 202306 | 1117010200100440002 | 11170102 | C03 | 0.04 |
62095 | 202306 | 1130510200104150038 | 11305102 | A14 | 0.18 |
78296 | 202306 | 1138010300102970028 | 11380103 | C03 | 0.35 |
23062 | 202306 | 1120010700100030027 | 11200107 | B01 | 0.43 |
73127 | 202306 | 1135010400101680001 | 11350104 | A13 | 0.38 |
46452 | 202306 | 1126010100104970007 | 11260101 | B18 | 0.62 |
52308 | 202306 | 1129010300106440000 | 11290103 | A01 | 0.07 |