Overview

Dataset statistics

Number of variables23
Number of observations10000
Missing cells7025
Missing cells (%)3.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 MiB
Average record size in memory210.0 B

Variable types

Numeric15
Categorical8

Dataset

Description1. 영구임대아파트에 거주중인 입주자의 퇴거 여부를 예측하기 위한 기계학습용 과거이력 데이터
Author대구도시공사
URLhttps://www.data.go.kr/data/15094266/fileData.do

Alerts

퇴거여부 is highly imbalanced (72.6%)Imbalance
퇴거연도 has 7025 (70.2%) missing valuesMissing

Reproduction

Analysis started2023-12-12 22:29:05.147004
Analysis finished2023-12-12 22:29:05.466202
Duration0.32 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

Distinct6101
Distinct (%)61.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6429.8336
Minimum1
Maximum12882
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:29:05.529493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile656.85
Q13140.75
median6415.5
Q39701.25
95-th percentile12273
Maximum12882
Range12881
Interquartile range (IQR)6560.5

Descriptive statistics

Standard deviation3762.1145
Coefficient of variation (CV)0.58510294
Kurtosis-1.2261469
Mean6429.8336
Median Absolute Deviation (MAD)3278.5
Skewness0.0163903
Sum64298336
Variance14153506
MonotonicityNot monotonic
2023-12-13T07:29:05.665879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7085 6
 
0.1%
5332 6
 
0.1%
6345 6
 
0.1%
6103 6
 
0.1%
8668 6
 
0.1%
5898 6
 
0.1%
8860 5
 
0.1%
2274 5
 
0.1%
9885 5
 
0.1%
3576 5
 
0.1%
Other values (6091) 9944
99.4%
ValueCountFrequency (%)
1 1
 
< 0.1%
3 1
 
< 0.1%
4 1
 
< 0.1%
6 3
< 0.1%
7 1
 
< 0.1%
8 1
 
< 0.1%
10 1
 
< 0.1%
15 1
 
< 0.1%
16 1
 
< 0.1%
17 1
 
< 0.1%
ValueCountFrequency (%)
12882 3
< 0.1%
12880 3
< 0.1%
12879 1
 
< 0.1%
12877 3
< 0.1%
12875 2
< 0.1%
12873 1
 
< 0.1%
12872 1
 
< 0.1%
12871 2
< 0.1%
12865 1
 
< 0.1%
12861 4
< 0.1%

계약구분
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
유효
7025 
해지
2975 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유효
2nd row유효
3rd row유효
4th row해지
5th row유효

Common Values

ValueCountFrequency (%)
유효 7025
70.2%
해지 2975
29.8%

Length

2023-12-13T07:29:05.771547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:29:05.855368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유효 7025
70.2%
해지 2975
29.8%

재계약횟수
Real number (ℝ)

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.3706
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:29:05.927366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q15
median8
Q310
95-th percentile10
Maximum12
Range11
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.7600756
Coefficient of variation (CV)0.37447096
Kurtosis-0.75335857
Mean7.3706
Median Absolute Deviation (MAD)2
Skewness-0.67327149
Sum73706
Variance7.6180174
MonotonicityNot monotonic
2023-12-13T07:29:06.017590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
10 3109
31.1%
9 1237
 
12.4%
8 1028
 
10.3%
6 883
 
8.8%
7 826
 
8.3%
5 679
 
6.8%
4 656
 
6.6%
3 621
 
6.2%
2 423
 
4.2%
11 288
 
2.9%
Other values (2) 250
 
2.5%
ValueCountFrequency (%)
1 247
 
2.5%
2 423
 
4.2%
3 621
 
6.2%
4 656
 
6.6%
5 679
 
6.8%
6 883
 
8.8%
7 826
 
8.3%
8 1028
 
10.3%
9 1237
 
12.4%
10 3109
31.1%
ValueCountFrequency (%)
12 3
 
< 0.1%
11 288
 
2.9%
10 3109
31.1%
9 1237
 
12.4%
8 1028
 
10.3%
7 826
 
8.3%
6 883
 
8.8%
5 679
 
6.8%
4 656
 
6.6%
3 621
 
6.2%

거주개월
Real number (ℝ)

Distinct293
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean174.2105
Minimum1
Maximum323
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:29:06.123071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile49
Q1126
median197
Q3222
95-th percentile243
Maximum323
Range322
Interquartile range (IQR)96

Descriptive statistics

Standard deviation64.897846
Coefficient of variation (CV)0.37252545
Kurtosis-0.52669462
Mean174.2105
Median Absolute Deviation (MAD)37
Skewness-0.69217517
Sum1742105
Variance4211.7304
MonotonicityNot monotonic
2023-12-13T07:29:06.243323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
222 1593
 
15.9%
234 1056
 
10.6%
246 132
 
1.3%
198 128
 
1.3%
114 96
 
1.0%
162 92
 
0.9%
186 87
 
0.9%
210 83
 
0.8%
126 81
 
0.8%
230 77
 
0.8%
Other values (283) 6575
65.8%
ValueCountFrequency (%)
1 6
0.1%
2 3
 
< 0.1%
4 3
 
< 0.1%
5 1
 
< 0.1%
6 3
 
< 0.1%
7 3
 
< 0.1%
8 7
0.1%
9 6
0.1%
10 3
 
< 0.1%
11 11
0.1%
ValueCountFrequency (%)
323 1
 
< 0.1%
322 1
 
< 0.1%
318 1
 
< 0.1%
316 1
 
< 0.1%
315 3
 
< 0.1%
314 7
 
0.1%
313 5
 
0.1%
312 9
0.1%
311 3
 
< 0.1%
310 22
0.2%

아파트 이름
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
비둘기아파트
4278 
용지아파트
3775 
지산5단지아파트
1600 
까치아파트
 
215
강남아파트
 
132

Length

Max length8
Median length6
Mean length5.9078
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row비둘기아파트
2nd row비둘기아파트
3rd row비둘기아파트
4th row비둘기아파트
5th row용지아파트

Common Values

ValueCountFrequency (%)
비둘기아파트 4278
42.8%
용지아파트 3775
37.8%
지산5단지아파트 1600
 
16.0%
까치아파트 215
 
2.1%
강남아파트 132
 
1.3%

Length

2023-12-13T07:29:06.357481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:29:06.441221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
비둘기아파트 4278
42.8%
용지아파트 3775
37.8%
지산5단지아파트 1600
 
16.0%
까치아파트 215
 
2.1%
강남아파트 132
 
1.3%

아파트 ID
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
4278 
2
3775 
3
1600 
4
 
215
5
 
132

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row2

Common Values

ValueCountFrequency (%)
1 4278
42.8%
2 3775
37.8%
3 1600
 
16.0%
4 215
 
2.1%
5 132
 
1.3%

Length

2023-12-13T07:29:06.527693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:29:06.625595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 4278
42.8%
2 3775
37.8%
3 1600
 
16.0%
4 215
 
2.1%
5 132
 
1.3%

아파트 평점
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
5
4278 
7
3907 
8
1600 
10
 
215

Length

Max length2
Median length1
Mean length1.0215
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5
2nd row5
3rd row5
4th row5
5th row7

Common Values

ValueCountFrequency (%)
5 4278
42.8%
7 3907
39.1%
8 1600
 
16.0%
10 215
 
2.1%

Length

2023-12-13T07:29:06.733111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:29:06.824941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5 4278
42.8%
7 3907
39.1%
8 1600
 
16.0%
10 215
 
2.1%

호실고유번호
Real number (ℝ)

Distinct4795
Distinct (%)47.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean42982.568
Minimum1
Maximum86891
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:29:06.950205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4152.35
Q121043.75
median42638
Q365105.25
95-th percentile82462
Maximum86891
Range86890
Interquartile range (IQR)44061.5

Descriptive statistics

Standard deviation25264.606
Coefficient of variation (CV)0.58778726
Kurtosis-1.2179958
Mean42982.568
Median Absolute Deviation (MAD)21998
Skewness0.030670757
Sum4.2982568 × 108
Variance6.3830029 × 108
MonotonicityNot monotonic
2023-12-13T07:29:07.053406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
35355 9
 
0.1%
46894 9
 
0.1%
41364 8
 
0.1%
40930 8
 
0.1%
36926 8
 
0.1%
32314 8
 
0.1%
41060 8
 
0.1%
34455 7
 
0.1%
40729 7
 
0.1%
16382 7
 
0.1%
Other values (4785) 9921
99.2%
ValueCountFrequency (%)
1 2
< 0.1%
18 2
< 0.1%
45 1
< 0.1%
69 1
< 0.1%
93 1
< 0.1%
106 1
< 0.1%
132 1
< 0.1%
146 1
< 0.1%
172 1
< 0.1%
185 2
< 0.1%
ValueCountFrequency (%)
86891 3
< 0.1%
86865 3
< 0.1%
86852 1
 
< 0.1%
86826 3
< 0.1%
86812 4
< 0.1%
86799 2
< 0.1%
86762 1
 
< 0.1%
86736 4
< 0.1%
86723 2
< 0.1%
86710 2
< 0.1%


Real number (ℝ)

Distinct15
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.9024
Minimum1
Maximum15
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:29:07.143850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median8
Q312
95-th percentile15
Maximum15
Range14
Interquartile range (IQR)8

Descriptive statistics

Standard deviation4.2764826
Coefficient of variation (CV)0.5411625
Kurtosis-1.2039996
Mean7.9024
Median Absolute Deviation (MAD)4
Skewness0.025321741
Sum79024
Variance18.288303
MonotonicityNot monotonic
2023-12-13T07:29:07.226705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
4 722
 
7.2%
6 705
 
7.0%
10 704
 
7.0%
3 691
 
6.9%
13 684
 
6.8%
7 679
 
6.8%
2 668
 
6.7%
8 667
 
6.7%
12 660
 
6.6%
5 656
 
6.6%
Other values (5) 3164
31.6%
ValueCountFrequency (%)
1 652
6.5%
2 668
6.7%
3 691
6.9%
4 722
7.2%
5 656
6.6%
6 705
7.0%
7 679
6.8%
8 667
6.7%
9 641
6.4%
10 704
7.0%
ValueCountFrequency (%)
15 578
5.8%
14 654
6.5%
13 684
6.8%
12 660
6.6%
11 639
6.4%
10 704
7.0%
9 641
6.4%
8 667
6.7%
7 679
6.8%
6 705
7.0%

평형대
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
12
7348 
15
1401 
19
1251 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row12
2nd row15
3rd row12
4th row19
5th row12

Common Values

ValueCountFrequency (%)
12 7348
73.5%
15 1401
 
14.0%
19 1251
 
12.5%

Length

2023-12-13T07:29:07.327871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:29:07.405771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
12 7348
73.5%
15 1401
 
14.0%
19 1251
 
12.5%

계약자고유번호
Real number (ℝ)

Distinct6077
Distinct (%)60.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43865.371
Minimum14
Maximum86868
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:29:07.499270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum14
5-th percentile4240
Q122024
median44181
Q365962.75
95-th percentile82463
Maximum86868
Range86854
Interquartile range (IQR)43938.75

Descriptive statistics

Standard deviation25252.829
Coefficient of variation (CV)0.5756894
Kurtosis-1.220282
Mean43865.371
Median Absolute Deviation (MAD)21945
Skewness-0.031010876
Sum4.3865371 × 108
Variance6.3770539 × 108
MonotonicityNot monotonic
2023-12-13T07:29:07.607062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
30873 6
 
0.1%
38075 6
 
0.1%
18252 6
 
0.1%
51666 6
 
0.1%
71445 6
 
0.1%
35687 6
 
0.1%
22620 5
 
0.1%
17899 5
 
0.1%
6433 5
 
0.1%
11599 5
 
0.1%
Other values (6067) 9944
99.4%
ValueCountFrequency (%)
14 2
< 0.1%
27 1
 
< 0.1%
40 1
 
< 0.1%
66 2
< 0.1%
79 1
 
< 0.1%
84 1
 
< 0.1%
111 3
< 0.1%
124 1
 
< 0.1%
128 1
 
< 0.1%
136 3
< 0.1%
ValueCountFrequency (%)
86868 2
< 0.1%
86867 1
< 0.1%
86864 1
< 0.1%
86857 1
< 0.1%
86849 1
< 0.1%
86834 1
< 0.1%
86806 1
< 0.1%
86793 1
< 0.1%
86766 1
< 0.1%
86742 1
< 0.1%

계약서고유번호
Real number (ℝ)

Distinct6101
Distinct (%)61.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43831.52
Minimum14
Maximum86901
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:29:07.710698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum14
5-th percentile4255.35
Q121975
median44100
Q366099.25
95-th percentile82590.35
Maximum86901
Range86887
Interquartile range (IQR)44124.25

Descriptive statistics

Standard deviation25280.413
Coefficient of variation (CV)0.57676332
Kurtosis-1.2220458
Mean43831.52
Median Absolute Deviation (MAD)22048.5
Skewness-0.025868654
Sum4.383152 × 108
Variance6.3909929 × 108
MonotonicityNot monotonic
2023-12-13T07:29:07.811043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
18684 6
 
0.1%
73283 6
 
0.1%
36163 6
 
0.1%
31300 6
 
0.1%
38619 6
 
0.1%
53629 6
 
0.1%
25005 5
 
0.1%
74536 5
 
0.1%
22217 5
 
0.1%
57914 5
 
0.1%
Other values (6091) 9944
99.4%
ValueCountFrequency (%)
14 2
< 0.1%
27 1
 
< 0.1%
40 1
 
< 0.1%
66 2
< 0.1%
79 1
 
< 0.1%
84 1
 
< 0.1%
111 3
< 0.1%
124 1
 
< 0.1%
128 1
 
< 0.1%
136 3
< 0.1%
ValueCountFrequency (%)
86901 1
< 0.1%
86892 1
< 0.1%
86884 1
< 0.1%
86867 1
< 0.1%
86861 1
< 0.1%
86839 1
< 0.1%
86835 1
< 0.1%
86827 1
< 0.1%
86823 1
< 0.1%
86809 1
< 0.1%

입주연도
Real number (ℝ)

Distinct27
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2005.232
Minimum1994
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:29:07.913499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1994
5-th percentile2001
Q12002
median2003
Q32008
95-th percentile2016
Maximum2020
Range26
Interquartile range (IQR)6

Descriptive statistics

Standard deviation4.8271426
Coefficient of variation (CV)0.0024072739
Kurtosis0.50133757
Mean2005.232
Median Absolute Deviation (MAD)1
Skewness1.1071016
Sum20052320
Variance23.301306
MonotonicityNot monotonic
2023-12-13T07:29:08.053852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
2003 2521
25.2%
2002 2259
22.6%
2001 708
 
7.1%
2005 511
 
5.1%
2004 426
 
4.3%
2014 368
 
3.7%
2007 334
 
3.3%
2010 320
 
3.2%
2011 306
 
3.1%
2009 304
 
3.0%
Other values (17) 1943
19.4%
ValueCountFrequency (%)
1994 3
 
< 0.1%
1995 73
 
0.7%
1996 43
 
0.4%
1997 12
 
0.1%
1998 48
 
0.5%
1999 65
 
0.7%
2000 147
 
1.5%
2001 708
 
7.1%
2002 2259
22.6%
2003 2521
25.2%
ValueCountFrequency (%)
2020 41
 
0.4%
2019 68
 
0.7%
2018 85
 
0.9%
2017 125
 
1.2%
2016 195
1.9%
2015 184
1.8%
2014 368
3.7%
2013 113
 
1.1%
2012 187
1.9%
2011 306
3.1%

퇴거연도
Real number (ℝ)

MISSING 

Distinct13
Distinct (%)0.4%
Missing7025
Missing (%)70.2%
Infinite0
Infinite (%)0.0%
Mean2015.9234
Minimum2008
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:29:08.167838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2008
5-th percentile2010
Q12014
median2016
Q32019
95-th percentile2020
Maximum2020
Range12
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.1315329
Coefficient of variation (CV)0.0015533988
Kurtosis-0.57346655
Mean2015.9234
Median Absolute Deviation (MAD)2
Skewness-0.58884941
Sum5997372
Variance9.8064984
MonotonicityNot monotonic
2023-12-13T07:29:08.277117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
2018 390
 
3.9%
2019 381
 
3.8%
2020 364
 
3.6%
2016 347
 
3.5%
2017 334
 
3.3%
2015 255
 
2.5%
2014 218
 
2.2%
2013 184
 
1.8%
2012 155
 
1.6%
2011 136
 
1.4%
Other values (3) 211
 
2.1%
(Missing) 7025
70.2%
ValueCountFrequency (%)
2008 30
 
0.3%
2009 72
 
0.7%
2010 109
 
1.1%
2011 136
 
1.4%
2012 155
1.6%
2013 184
1.8%
2014 218
2.2%
2015 255
2.5%
2016 347
3.5%
2017 334
3.3%
ValueCountFrequency (%)
2020 364
3.6%
2019 381
3.8%
2018 390
3.9%
2017 334
3.3%
2016 347
3.5%
2015 255
2.5%
2014 218
2.2%
2013 184
1.8%
2012 155
 
1.6%
2011 136
 
1.4%

거주연도
Real number (ℝ)

Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2013.9782
Minimum2008
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:29:08.389322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2008
5-th percentile2008
Q12011
median2014
Q32017
95-th percentile2020
Maximum2020
Range12
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.7458414
Coefficient of variation (CV)0.0018599215
Kurtosis-1.2180487
Mean2013.9782
Median Absolute Deviation (MAD)3
Skewness0.019491852
Sum20139782
Variance14.031328
MonotonicityNot monotonic
2023-12-13T07:29:08.519338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
2011 814
 
8.1%
2016 811
 
8.1%
2020 788
 
7.9%
2012 787
 
7.9%
2010 784
 
7.8%
2009 775
 
7.8%
2019 765
 
7.6%
2014 759
 
7.6%
2008 756
 
7.6%
2015 750
 
7.5%
Other values (3) 2211
22.1%
ValueCountFrequency (%)
2008 756
7.6%
2009 775
7.8%
2010 784
7.8%
2011 814
8.1%
2012 787
7.9%
2013 740
7.4%
2014 759
7.6%
2015 750
7.5%
2016 811
8.1%
2017 724
7.2%
ValueCountFrequency (%)
2020 788
7.9%
2019 765
7.6%
2018 747
7.5%
2017 724
7.2%
2016 811
8.1%
2015 750
7.5%
2014 759
7.6%
2013 740
7.4%
2012 787
7.9%
2011 814
8.1%

월세(원)
Real number (ℝ)

Distinct682
Distinct (%)6.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean57996.675
Minimum31300
Maximum270480
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:29:08.642593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum31300
5-th percentile35700
Q140300
median43600
Q362500
95-th percentile123800
Maximum270480
Range239180
Interquartile range (IQR)22200

Descriptive statistics

Standard deviation31398.275
Coefficient of variation (CV)0.54138061
Kurtosis8.0139681
Mean57996.675
Median Absolute Deviation (MAD)6300
Skewness2.5933557
Sum5.7996675 × 108
Variance9.8585168 × 108
MonotonicityNot monotonic
2023-12-13T07:29:08.806701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
39200 612
 
6.1%
42400 608
 
6.1%
40400 544
 
5.4%
43600 482
 
4.8%
40300 352
 
3.5%
41500 346
 
3.5%
38100 207
 
2.1%
41200 142
 
1.4%
37500 134
 
1.3%
34700 131
 
1.3%
Other values (672) 6442
64.4%
ValueCountFrequency (%)
31300 1
 
< 0.1%
32200 15
 
0.1%
32800 3
 
< 0.1%
33000 29
 
0.3%
33200 19
 
0.2%
33700 68
0.7%
34000 28
 
0.3%
34400 2
 
< 0.1%
34600 1
 
< 0.1%
34700 131
1.3%
ValueCountFrequency (%)
270480 2
 
< 0.1%
266640 5
0.1%
262800 2
 
< 0.1%
259320 4
< 0.1%
247900 1
 
< 0.1%
244440 1
 
< 0.1%
235920 1
 
< 0.1%
230520 2
 
< 0.1%
227260 1
 
< 0.1%
225400 2
 
< 0.1%

보증금(원)
Real number (ℝ)

Distinct463
Distinct (%)4.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3412421.4
Minimum1520000
Maximum18536400
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:29:09.247654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1520000
5-th percentile1842000
Q11954000
median2144000
Q33869500
95-th percentile8549000
Maximum18536400
Range17016400
Interquartile range (IQR)1915500

Descriptive statistics

Standard deviation2442706.1
Coefficient of variation (CV)0.71582779
Kurtosis6.0764724
Mean3412421.4
Median Absolute Deviation (MAD)299000
Skewness2.3288739
Sum3.4124214 × 1010
Variance5.9668131 × 1012
MonotonicityNot monotonic
2023-12-13T07:29:09.382995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2052000 789
 
7.9%
2062000 728
 
7.3%
1964000 670
 
6.7%
1907000 665
 
6.7%
1954000 598
 
6.0%
1897000 496
 
5.0%
1777000 206
 
2.1%
1725000 203
 
2.0%
1959000 192
 
1.9%
1852000 149
 
1.5%
Other values (453) 5304
53.0%
ValueCountFrequency (%)
1520000 1
 
< 0.1%
1595000 1
 
< 0.1%
1670000 4
 
< 0.1%
1675000 22
 
0.2%
1680000 7
 
0.1%
1725000 203
2.0%
1754000 4
 
< 0.1%
1764000 7
 
0.1%
1777000 206
2.1%
1842000 46
 
0.5%
ValueCountFrequency (%)
18536400 4
 
< 0.1%
17815200 12
0.1%
16330800 1
 
< 0.1%
15576000 1
 
< 0.1%
15511200 8
0.1%
15447600 1
 
< 0.1%
15447000 5
0.1%
14846000 12
0.1%
14773000 2
 
< 0.1%
14734800 8
0.1%

대표나이
Real number (ℝ)

Distinct84
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean66.4926
Minimum22
Maximum121
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:29:09.520966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum22
5-th percentile45
Q159
median66
Q376
95-th percentile88
Maximum121
Range99
Interquartile range (IQR)17

Descriptive statistics

Standard deviation12.888629
Coefficient of variation (CV)0.19383554
Kurtosis0.14283045
Mean66.4926
Median Absolute Deviation (MAD)8
Skewness-0.14924064
Sum664926
Variance166.11676
MonotonicityNot monotonic
2023-12-13T07:29:09.669729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
64 407
 
4.1%
66 391
 
3.9%
69 361
 
3.6%
59 359
 
3.6%
65 357
 
3.6%
61 333
 
3.3%
60 326
 
3.3%
63 321
 
3.2%
62 319
 
3.2%
67 294
 
2.9%
Other values (74) 6532
65.3%
ValueCountFrequency (%)
22 2
 
< 0.1%
23 3
 
< 0.1%
24 1
 
< 0.1%
25 9
0.1%
26 6
 
0.1%
27 10
0.1%
28 13
0.1%
29 7
 
0.1%
30 13
0.1%
31 18
0.2%
ValueCountFrequency (%)
121 1
 
< 0.1%
105 3
 
< 0.1%
104 3
 
< 0.1%
103 4
 
< 0.1%
101 4
 
< 0.1%
100 5
 
0.1%
99 5
 
0.1%
98 10
0.1%
97 12
0.1%
96 20
0.2%

나이
Real number (ℝ)

Distinct79
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean59.4708
Minimum20
Maximum112
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:29:09.828630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile37
Q151
median59
Q369
95-th percentile81
Maximum112
Range92
Interquartile range (IQR)18

Descriptive statistics

Standard deviation13.252762
Coefficient of variation (CV)0.22284486
Kurtosis-0.043015641
Mean59.4708
Median Absolute Deviation (MAD)9
Skewness-0.1111745
Sum594708
Variance175.63571
MonotonicityNot monotonic
2023-12-13T07:29:09.996248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
58 340
 
3.4%
57 330
 
3.3%
54 326
 
3.3%
56 312
 
3.1%
53 299
 
3.0%
61 298
 
3.0%
63 292
 
2.9%
60 291
 
2.9%
55 289
 
2.9%
59 288
 
2.9%
Other values (69) 6935
69.3%
ValueCountFrequency (%)
20 10
 
0.1%
21 9
 
0.1%
22 14
0.1%
23 18
0.2%
24 25
0.2%
25 20
0.2%
26 19
0.2%
27 25
0.2%
28 21
0.2%
29 29
0.3%
ValueCountFrequency (%)
112 1
 
< 0.1%
99 1
 
< 0.1%
97 2
 
< 0.1%
95 2
 
< 0.1%
94 6
 
0.1%
93 7
 
0.1%
92 11
 
0.1%
91 19
0.2%
90 19
0.2%
89 32
0.3%

성별
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
5917 
4083 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
5917
59.2%
4083
40.8%

Length

2023-12-13T07:29:10.136363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:29:10.273440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5917
59.2%
4083
40.8%

결혼여부
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
미혼
8533 
기혼
1467 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row미혼
2nd row미혼
3rd row미혼
4th row미혼
5th row미혼

Common Values

ValueCountFrequency (%)
미혼 8533
85.3%
기혼 1467
 
14.7%

Length

2023-12-13T07:29:10.379324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:29:10.472260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미혼 8533
85.3%
기혼 1467
 
14.7%

거주자 수
Real number (ℝ)

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.6551
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:29:10.565765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile3
Maximum9
Range8
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.91532932
Coefficient of variation (CV)0.55303566
Kurtosis3.6445508
Mean1.6551
Median Absolute Deviation (MAD)0
Skewness1.6671595
Sum16551
Variance0.83782777
MonotonicityNot monotonic
2023-12-13T07:29:10.683177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
1 5645
56.5%
2 2792
27.9%
3 1092
 
10.9%
4 354
 
3.5%
5 88
 
0.9%
6 19
 
0.2%
7 5
 
0.1%
8 4
 
< 0.1%
9 1
 
< 0.1%
ValueCountFrequency (%)
1 5645
56.5%
2 2792
27.9%
3 1092
 
10.9%
4 354
 
3.5%
5 88
 
0.9%
6 19
 
0.2%
7 5
 
0.1%
8 4
 
< 0.1%
9 1
 
< 0.1%
ValueCountFrequency (%)
9 1
 
< 0.1%
8 4
 
< 0.1%
7 5
 
0.1%
6 19
 
0.2%
5 88
 
0.9%
4 354
 
3.5%
3 1092
 
10.9%
2 2792
27.9%
1 5645
56.5%

퇴거여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
미퇴거
9530 
퇴거
 
470

Length

Max length3
Median length3
Mean length2.953
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row미퇴거
2nd row미퇴거
3rd row미퇴거
4th row미퇴거
5th row미퇴거

Common Values

ValueCountFrequency (%)
미퇴거 9530
95.3%
퇴거 470
 
4.7%

Length

2023-12-13T07:29:10.810042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:29:10.906793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미퇴거 9530
95.3%
퇴거 470
 
4.7%

Sample

순번계약구분재계약횟수거주개월아파트 이름아파트 ID아파트 평점호실고유번호평형대계약자고유번호계약서고유번호입주연도퇴거연도거주연도월세(원)보증금(원)대표나이나이성별결혼여부거주자 수퇴거여부
297844445유효10234비둘기아파트1529775151254298562402002<NA>20183750020620009289미혼1미퇴거
175572588유효9234비둘기아파트1517271101558132600632002<NA>20094870023720007260미혼1미퇴거
301484499유효10234비둘기아파트153026211254421563632002<NA>20153470019640008882미혼1미퇴거
90191311해지5112비둘기아파트155829141961147631022006201520148230059370007265미혼4미퇴거
602299013유효10222용지아파트276021931217584180162003<NA>20194240020520008179미혼1미퇴거
162412416해지5109비둘기아파트151525731954504779422008201720166240031120006257미혼3미퇴거
7385011064유효488지산5단지아파트387384141212644124652014<NA>20194930031030005351미혼1미퇴거
106951559유효10230비둘기아파트15827861265012669262002<NA>20113920019070005141미혼1미퇴거
184132710유효5120비둘기아파트1518403151971350815662011<NA>20205860032680005352기혼5미퇴거
337835065유효9241비둘기아파트153754621952825547742001<NA>20085360029430006956미혼1미퇴거
순번계약구분재계약횟수거주개월아파트 이름아파트 ID아파트 평점호실고유번호평형대계약자고유번호계약서고유번호입주연도퇴거연도거주연도월세(원)보증금(원)대표나이나이성별결혼여부거주자 수퇴거여부
6917710266해지6135용지아파트276917721240905414502003201420095720043200007159미혼1미퇴거
8237312258유효9212지산5단지아파트3882372219874587182003<NA>20105290027710007362미혼1미퇴거
527117931유효490용지아파트275270661246765475092014<NA>20144150019540006659미혼1미퇴거
234913504해지8194비둘기아파트152348651256586585012004202020144840030390007063미혼1미퇴거
90261313유효10234비둘기아파트155842141961155631102002<NA>20086240030310008067미혼2미퇴거
477967197유효11222용지아파트274779231228229286302003<NA>20134040019540008375미혼2미퇴거
5450801유효116비둘기아파트152255101286604867402020<NA>20204240020620006665미혼1미퇴거
219083265유효10234비둘기아파트1521907141249852518022002<NA>20105920041530006756미혼1미퇴거
530697973유효10222용지아파트275305891226270266592003<NA>2020174000116244007170미혼2미퇴거
8650712831해지8197지산5단지아파트38864971215840383762003201920105980027710008675미혼1미퇴거