Overview

Dataset statistics

Number of variables6
Number of observations24
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.4 KiB
Average record size in memory59.5 B

Variable types

Categorical1
Numeric5

Alerts

월별 is highly overall correlated with 년도별High correlation
총 분양가구수 is highly overall correlated with 미분양가구수 and 3 other fieldsHigh correlation
미분양가구수 is highly overall correlated with 총 분양가구수 and 3 other fieldsHigh correlation
분양률 is highly overall correlated with 총 분양가구수 and 3 other fieldsHigh correlation
미분양률 is highly overall correlated with 총 분양가구수 and 3 other fieldsHigh correlation
년도별 is highly overall correlated with 월별 and 4 other fieldsHigh correlation
년도별 is highly imbalanced (68.6%)Imbalance
미분양가구수 has unique valuesUnique
분양률 has unique valuesUnique
미분양률 has unique valuesUnique

Reproduction

Analysis started2024-01-09 21:10:46.385792
Analysis finished2024-01-09 21:10:48.860943
Duration2.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년도별
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size324.0 B
<NA>
22 
2020
 
1
2021
 
1

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique2 ?
Unique (%)8.3%

Sample

1st row2020
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 22
91.7%
2020 1
 
4.2%
2021 1
 
4.2%

Length

2024-01-10T06:10:48.927152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:10:49.012806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 22
91.7%
2020 1
 
4.2%
2021 1
 
4.2%

월별
Real number (ℝ)

HIGH CORRELATION 

Distinct12
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.5
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size348.0 B
2024-01-10T06:10:49.091295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1.15
Q13.75
median6.5
Q39.25
95-th percentile11.85
Maximum12
Range11
Interquartile range (IQR)5.5

Descriptive statistics

Standard deviation3.5262987
Coefficient of variation (CV)0.54250749
Kurtosis-1.2156934
Mean6.5
Median Absolute Deviation (MAD)3
Skewness0
Sum156
Variance12.434783
MonotonicityNot monotonic
2024-01-10T06:10:49.184213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
1 2
8.3%
2 2
8.3%
3 2
8.3%
4 2
8.3%
5 2
8.3%
6 2
8.3%
7 2
8.3%
8 2
8.3%
9 2
8.3%
10 2
8.3%
Other values (2) 4
16.7%
ValueCountFrequency (%)
1 2
8.3%
2 2
8.3%
3 2
8.3%
4 2
8.3%
5 2
8.3%
6 2
8.3%
7 2
8.3%
8 2
8.3%
9 2
8.3%
10 2
8.3%
ValueCountFrequency (%)
12 2
8.3%
11 2
8.3%
10 2
8.3%
9 2
8.3%
8 2
8.3%
7 2
8.3%
6 2
8.3%
5 2
8.3%
4 2
8.3%
3 2
8.3%

총 분양가구수
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)45.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4769.7083
Minimum2910
Maximum6821
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size348.0 B
2024-01-10T06:10:49.279043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2910
5-th percentile3083
Q13754
median5351
Q35358
95-th percentile6821
Maximum6821
Range3911
Interquartile range (IQR)1604

Descriptive statistics

Standard deviation1185.2571
Coefficient of variation (CV)0.24849676
Kurtosis-0.74039522
Mean4769.7083
Median Absolute Deviation (MAD)1064
Skewness0.092719462
Sum114473
Variance1404834.3
MonotonicityNot monotonic
2024-01-10T06:10:49.382905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
5358 7
29.2%
3083 3
12.5%
3754 3
12.5%
6821 3
12.5%
5351 2
 
8.3%
2910 1
 
4.2%
4031 1
 
4.2%
4370 1
 
4.2%
4415 1
 
4.2%
4204 1
 
4.2%
ValueCountFrequency (%)
2910 1
 
4.2%
3083 3
12.5%
3754 3
12.5%
4031 1
 
4.2%
4204 1
 
4.2%
4370 1
 
4.2%
4415 1
 
4.2%
5351 2
 
8.3%
5358 7
29.2%
5361 1
 
4.2%
ValueCountFrequency (%)
6821 3
12.5%
5361 1
 
4.2%
5358 7
29.2%
5351 2
 
8.3%
4415 1
 
4.2%
4370 1
 
4.2%
4204 1
 
4.2%
4031 1
 
4.2%
3754 3
12.5%
3083 3
12.5%

미분양가구수
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct24
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean776.54167
Minimum58
Maximum1796
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size348.0 B
2024-01-10T06:10:49.502964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum58
5-th percentile71.1
Q1298.25
median824.5
Q31132
95-th percentile1646.8
Maximum1796
Range1738
Interquartile range (IQR)833.75

Descriptive statistics

Standard deviation536.72696
Coefficient of variation (CV)0.69117599
Kurtosis-1.1501669
Mean776.54167
Median Absolute Deviation (MAD)493.5
Skewness0.23807233
Sum18637
Variance288075.82
MonotonicityNot monotonic
2024-01-10T06:10:49.608841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
1385 1
 
4.2%
552 1
 
4.2%
153 1
 
4.2%
58 1
 
4.2%
63 1
 
4.2%
117 1
 
4.2%
211 1
 
4.2%
272 1
 
4.2%
307 1
 
4.2%
337 1
 
4.2%
Other values (14) 14
58.3%
ValueCountFrequency (%)
58 1
4.2%
63 1
4.2%
117 1
4.2%
153 1
4.2%
211 1
4.2%
272 1
4.2%
307 1
4.2%
337 1
4.2%
410 1
4.2%
469 1
4.2%
ValueCountFrequency (%)
1796 1
4.2%
1693 1
4.2%
1385 1
4.2%
1382 1
4.2%
1324 1
4.2%
1192 1
4.2%
1112 1
4.2%
1062 1
4.2%
1042 1
4.2%
1033 1
4.2%

분양률
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct24
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.844443
Minimum0.8503152
Maximum47.842302
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size348.0 B
2024-01-10T06:10:49.713634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.8503152
5-th percentile1.1124398
Q15.5664427
median16.34274
Q330.355565
95-th percentile47.179297
Maximum47.842302
Range46.991986
Interquartile range (IQR)24.789122

Descriptive statistics

Standard deviation16.494289
Coefficient of variation (CV)0.83117926
Kurtosis-1.2432972
Mean19.844443
Median Absolute Deviation (MAD)11.254843
Skewness0.48700793
Sum476.26662
Variance272.06157
MonotonicityNot monotonic
2024-01-10T06:10:49.823410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
47.59450172 1
 
4.2%
10.30235162 1
 
4.2%
2.243072863 1
 
4.2%
0.850315203 1
 
4.2%
0.923618238 1
 
4.2%
2.182428651 1
 
4.2%
3.938036581 1
 
4.2%
5.07652109 1
 
4.2%
5.729749907 1
 
4.2%
6.289660321 1
 
4.2%
Other values (14) 14
58.3%
ValueCountFrequency (%)
0.850315203 1
4.2%
0.923618238 1
4.2%
2.182428651 1
4.2%
2.243072863 1
4.2%
3.938036581 1
4.2%
5.07652109 1
4.2%
5.729749907 1
4.2%
6.289660321 1
4.2%
7.652108996 1
4.2%
8.753266144 1
4.2%
ValueCountFrequency (%)
47.84230155 1
4.2%
47.59450172 1
4.2%
44.82646773 1
4.2%
42.94518326 1
4.2%
40.27117031 1
4.2%
38.66363931 1
4.2%
27.5862069 1
4.2%
27.11774108 1
4.2%
25.09323388 1
4.2%
24.3020595 1
4.2%

미분양률
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct24
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean80.155557
Minimum52.157698
Maximum99.149685
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size348.0 B
2024-01-10T06:10:49.938512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum52.157698
5-th percentile52.820703
Q169.644435
median83.65726
Q394.433557
95-th percentile98.88756
Maximum99.149685
Range46.991986
Interquartile range (IQR)24.789122

Descriptive statistics

Standard deviation16.494289
Coefficient of variation (CV)0.20577848
Kurtosis-1.2432972
Mean80.155557
Median Absolute Deviation (MAD)11.254843
Skewness-0.48700793
Sum1923.7334
Variance272.06157
MonotonicityNot monotonic
2024-01-10T06:10:50.053197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
52.40549828 1
 
4.2%
89.69764838 1
 
4.2%
97.75692714 1
 
4.2%
99.1496848 1
 
4.2%
99.07638176 1
 
4.2%
97.81757135 1
 
4.2%
96.06196342 1
 
4.2%
94.92347891 1
 
4.2%
94.27025009 1
 
4.2%
93.71033968 1
 
4.2%
Other values (14) 14
58.3%
ValueCountFrequency (%)
52.15769845 1
4.2%
52.40549828 1
4.2%
55.17353227 1
4.2%
57.05481674 1
4.2%
59.72882969 1
4.2%
61.33636069 1
4.2%
72.4137931 1
4.2%
72.88225892 1
4.2%
74.90676612 1
4.2%
75.6979405 1
4.2%
ValueCountFrequency (%)
99.1496848 1
4.2%
99.07638176 1
4.2%
97.81757135 1
4.2%
97.75692714 1
4.2%
96.06196342 1
4.2%
94.92347891 1
4.2%
94.27025009 1
4.2%
93.71033968 1
4.2%
92.347891 1
4.2%
91.24673386 1
4.2%

Interactions

2024-01-10T06:10:48.188884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:46.568574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:46.957755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:47.378531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:47.797273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:48.268325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:46.634883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:47.045817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:47.456529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:47.877237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:48.373380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:46.705293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:47.133309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:47.552342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:47.961437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:48.467981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:46.785367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:47.215410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:47.637712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:48.041144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:48.559715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:46.868763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:47.297646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:47.713404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:10:48.113724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T06:10:50.128323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도별월별총 분양가구수미분양가구수분양률미분양률
년도별1.000NaNNaN0.0000.0000.000
월별NaN1.0000.0000.0000.2020.202
총 분양가구수NaN0.0001.0000.7340.8320.832
미분양가구수0.0000.0000.7341.0000.8900.890
분양률0.0000.2020.8320.8901.0001.000
미분양률0.0000.2020.8320.8901.0001.000
2024-01-10T06:10:50.231084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
월별총 분양가구수미분양가구수분양률미분양률년도별
월별1.0000.451-0.337-0.4270.4271.000
총 분양가구수0.4511.000-0.909-0.9560.9561.000
미분양가구수-0.337-0.9091.0000.981-0.9811.000
분양률-0.427-0.9560.9811.000-1.0001.000
미분양률0.4270.956-0.981-1.0001.0001.000
년도별1.0001.0001.0001.0001.0001.000

Missing values

2024-01-10T06:10:48.689852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:10:48.808849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년도별월별총 분양가구수미분양가구수분양률미분양률
0202012910138547.59450252.405498
1<NA>23083138244.82646855.173532
2<NA>33083132442.94518357.054817
3<NA>43083119238.66363961.336361
4<NA>53754179647.84230252.157698
5<NA>63754101827.11774172.882259
6<NA>7375494225.09323474.906766
7<NA>84031111227.58620772.413793
8<NA>94370106224.30205975.697941
9<NA>104415103323.39750876.602492
년도별월별총 분양가구수미분양가구수분양률미분양률
14<NA>353584698.75326691.246734
15<NA>453584107.65210992.347891
16<NA>553583376.2896693.71034
17<NA>653583075.7297594.27025
18<NA>753582725.07652194.923479
19<NA>853582113.93803796.061963
20<NA>953611172.18242997.817571
21<NA>106821630.92361899.076382
22<NA>116821580.85031599.149685
23<NA>1268211532.24307397.756927