Overview

Dataset statistics

Number of variables11
Number of observations2424
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory220.3 KiB
Average record size in memory93.1 B

Variable types

Numeric3
DateTime3
Categorical5

Dataset

Description대구도시개발공사 전세임대 우대금리 데이터 입니다. 메타데이터기반 공공데이터 개방자료이기 때문에 가공되지 않은 원본 테이블의 데이터가 등록되었습니다.
URLhttps://www.data.go.kr/data/15120621/fileData.do

Alerts

자격구분 is highly overall correlated with 자격_우대금리High correlation
자격_우대금리 is highly overall correlated with 자녀수 and 1 other fieldsHigh correlation
자녀수 is highly overall correlated with 자격_우대금리 and 1 other fieldsHigh correlation
자녀_우대금리 is highly overall correlated with 자녀수High correlation
등록자번호 is highly overall correlated with 수정자번호High correlation
수정자번호 is highly overall correlated with 등록자번호High correlation
자격구분 is highly imbalanced (74.1%)Imbalance
계약횟수 has 643 (26.5%) zerosZeros
자녀수 has 1507 (62.2%) zerosZeros

Reproduction

Analysis started2023-12-12 13:15:46.199477
Analysis finished2023-12-12 13:15:47.960826
Duration1.76 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

계약자번호
Real number (ℝ)

Distinct1101
Distinct (%)45.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47310105
Minimum12015001
Maximum82023005
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.4 KiB
2023-12-12T22:15:48.030684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12015001
5-th percentile12019004
Q122021003
median52014102
Q362017013
95-th percentile82015012
Maximum82023005
Range70008004
Interquartile range (IQR)39996010

Descriptive statistics

Standard deviation21538909
Coefficient of variation (CV)0.45527078
Kurtosis-0.96657954
Mean47310105
Median Absolute Deviation (MAD)10008900
Skewness-0.12945045
Sum1.1467969 × 1011
Variance4.6392458 × 1014
MonotonicityNot monotonic
2023-12-12T22:15:48.181010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
52019013 6
 
0.2%
52013071 5
 
0.2%
22016008 5
 
0.2%
52013090 5
 
0.2%
52013084 5
 
0.2%
12016004 5
 
0.2%
12017011 5
 
0.2%
22018006 4
 
0.2%
22018017 4
 
0.2%
52015062 4
 
0.2%
Other values (1091) 2376
98.0%
ValueCountFrequency (%)
12015001 3
0.1%
12016001 2
 
0.1%
12016002 4
0.2%
12016004 5
0.2%
12016007 3
0.1%
12016008 3
0.1%
12016009 2
 
0.1%
12016010 3
0.1%
12016011 4
0.2%
12017001 3
0.1%
ValueCountFrequency (%)
82023005 1
< 0.1%
82023004 1
< 0.1%
82023003 1
< 0.1%
82023002 1
< 0.1%
82023001 1
< 0.1%
82022003 1
< 0.1%
82022002 1
< 0.1%
82022001 1
< 0.1%
82021003 1
< 0.1%
82021002 1
< 0.1%

계약횟수
Real number (ℝ)

ZEROS 

Distinct8
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.730198
Minimum0
Maximum7
Zeros643
Zeros (%)26.5%
Negative0
Negative (%)0.0%
Memory size21.4 KiB
2023-12-12T22:15:48.279701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q33
95-th percentile4
Maximum7
Range7
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.4286655
Coefficient of variation (CV)0.82572372
Kurtosis-0.67039323
Mean1.730198
Median Absolute Deviation (MAD)1
Skewness0.42077724
Sum4194
Variance2.0410852
MonotonicityNot monotonic
2023-12-12T22:15:48.389476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
0 643
26.5%
2 555
22.9%
1 488
20.1%
3 435
17.9%
4 232
 
9.6%
5 65
 
2.7%
6 4
 
0.2%
7 2
 
0.1%
ValueCountFrequency (%)
0 643
26.5%
1 488
20.1%
2 555
22.9%
3 435
17.9%
4 232
 
9.6%
5 65
 
2.7%
6 4
 
0.2%
7 2
 
0.1%
ValueCountFrequency (%)
7 2
 
0.1%
6 4
 
0.2%
5 65
 
2.7%
4 232
 
9.6%
3 435
17.9%
2 555
22.9%
1 488
20.1%
0 643
26.5%
Distinct754
Distinct (%)31.1%
Missing0
Missing (%)0.0%
Memory size19.1 KiB
Minimum2020-01-15 00:00:00
Maximum2023-12-24 00:00:00
2023-12-12T22:15:48.536301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:15:48.675459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

자격구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct7
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size19.1 KiB
수급자
2013 
한부모가정
389 
주거지원시급가구
 
8
장애인(1순위)
 
7
소득50%이하
 
4
Other values (2)
 
3

Length

Max length8
Median length3
Mean length3.3622112
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row수급자
2nd row한부모가정
3rd row수급자
4th row수급자
5th row수급자

Common Values

ValueCountFrequency (%)
수급자 2013
83.0%
한부모가정 389
 
16.0%
주거지원시급가구 8
 
0.3%
장애인(1순위) 7
 
0.3%
소득50%이하 4
 
0.2%
장애인(일반) 2
 
0.1%
일반탈락 1
 
< 0.1%

Length

2023-12-12T22:15:48.795871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:15:49.264941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수급자 2013
83.0%
한부모가정 389
 
16.0%
주거지원시급가구 8
 
0.3%
장애인(1순위 7
 
0.3%
소득50%이하 4
 
0.2%
장애인(일반 2
 
0.1%
일반탈락 1
 
< 0.1%

자녀수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct6
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.55610561
Minimum0
Maximum5
Zeros1507
Zeros (%)62.2%
Negative0
Negative (%)0.0%
Memory size21.4 KiB
2023-12-12T22:15:49.369185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile2
Maximum5
Range5
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.83475121
Coefficient of variation (CV)1.501066
Kurtosis2.3112304
Mean0.55610561
Median Absolute Deviation (MAD)0
Skewness1.56681
Sum1348
Variance0.69680959
MonotonicityNot monotonic
2023-12-12T22:15:49.505960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 1507
62.2%
1 579
 
23.9%
2 264
 
10.9%
3 57
 
2.4%
4 15
 
0.6%
5 2
 
0.1%
ValueCountFrequency (%)
0 1507
62.2%
1 579
 
23.9%
2 264
 
10.9%
3 57
 
2.4%
4 15
 
0.6%
5 2
 
0.1%
ValueCountFrequency (%)
5 2
 
0.1%
4 15
 
0.6%
3 57
 
2.4%
2 264
 
10.9%
1 579
 
23.9%
0 1507
62.2%

자격_우대금리
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size19.1 KiB
0.2
2013 
0.0
411 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.2
2nd row0.0
3rd row0.2
4th row0.2
5th row0.2

Common Values

ValueCountFrequency (%)
0.2 2013
83.0%
0.0 411
 
17.0%

Length

2023-12-12T22:15:49.709925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:15:49.868927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0.2 2013
83.0%
0.0 411
 
17.0%

자녀_우대금리
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size19.1 KiB
0.0
1507 
0.2
579 
0.3
264 
0.5
 
74

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.0
2nd row0.2
3rd row0.0
4th row0.3
5th row0.2

Common Values

ValueCountFrequency (%)
0.0 1507
62.2%
0.2 579
 
23.9%
0.3 264
 
10.9%
0.5 74
 
3.1%

Length

2023-12-12T22:15:50.041490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:15:50.214981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0.0 1507
62.2%
0.2 579
 
23.9%
0.3 264
 
10.9%
0.5 74
 
3.1%

등록자번호
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size19.1 KiB
19920113
951 
20200305
667 
20090218
173 
99999992
151 
19920107
117 
Other values (11)
365 

Length

Max length8
Median length8
Mean length7.9962871
Min length1

Unique

Unique4 ?
Unique (%)0.2%

Sample

1st row19920113
2nd row19920113
3rd row19920113
4th row19920113
5th row19920113

Common Values

ValueCountFrequency (%)
19920113 951
39.2%
20200305 667
27.5%
20090218 173
 
7.1%
99999992 151
 
6.2%
19920107 117
 
4.8%
20180271 109
 
4.5%
20190293 103
 
4.2%
20200306 64
 
2.6%
20150230 41
 
1.7%
20050190 39
 
1.6%
Other values (6) 9
 
0.4%

Length

2023-12-12T22:15:50.399503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
19920113 951
39.2%
20200305 667
27.5%
20090218 173
 
7.1%
99999992 151
 
6.2%
19920107 117
 
4.8%
20180271 109
 
4.5%
20190293 103
 
4.2%
20200306 64
 
2.6%
20150230 41
 
1.7%
20050190 39
 
1.6%
Other values (6) 9
 
0.4%
Distinct214
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size19.1 KiB
Minimum2020-01-15 13:12:44
Maximum2023-08-23 15:17:25
2023-12-12T22:15:50.528890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:15:50.698942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

수정자번호
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size19.1 KiB
19920113
954 
20200305
626 
20090218
186 
19920107
163 
20180271
163 
Other values (10)
332 

Length

Max length8
Median length8
Mean length7.9962871
Min length1

Unique

Unique4 ?
Unique (%)0.2%

Sample

1st row19920113
2nd row19920113
3rd row19920113
4th row19920113
5th row19920113

Common Values

ValueCountFrequency (%)
19920113 954
39.4%
20200305 626
25.8%
20090218 186
 
7.7%
19920107 163
 
6.7%
20180271 163
 
6.7%
99999992 140
 
5.8%
20190293 104
 
4.3%
20150230 43
 
1.8%
20050190 25
 
1.0%
20200306 13
 
0.5%
Other values (5) 7
 
0.3%

Length

2023-12-12T22:15:50.868644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
19920113 954
39.4%
20200305 626
25.8%
20090218 186
 
7.7%
19920107 163
 
6.7%
20180271 163
 
6.7%
99999992 140
 
5.8%
20190293 104
 
4.3%
20150230 43
 
1.8%
20050190 25
 
1.0%
20200306 13
 
0.5%
Other values (5) 7
 
0.3%
Distinct237
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Memory size19.1 KiB
Minimum2020-01-15 13:12:44
Maximum2023-08-23 15:17:25
2023-12-12T22:15:51.003947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:15:51.168243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T22:15:47.431484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:15:46.802731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:15:47.123654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:15:47.542530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:15:46.895464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:15:47.229262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:15:47.640588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:15:46.996572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:15:47.331546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:15:51.287226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계약자번호계약횟수자격구분자녀수자격_우대금리자녀_우대금리등록자번호수정자번호
계약자번호1.0000.5630.0610.1610.0780.2160.2060.149
계약횟수0.5631.0000.0000.0000.1120.0280.4460.358
자격구분0.0610.0001.0000.3821.0000.4240.0910.065
자녀수0.1610.0000.3821.0000.6831.0000.1780.100
자격_우대금리0.0780.1121.0000.6831.0000.7020.1710.143
자녀_우대금리0.2160.0280.4241.0000.7021.0000.2290.136
등록자번호0.2060.4460.0910.1780.1710.2291.0000.973
수정자번호0.1490.3580.0650.1000.1430.1360.9731.000
2023-12-12T22:15:51.422829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수정자번호자녀_우대금리자격구분자격_우대금리등록자번호
수정자번호1.0000.0770.0290.1300.821
자녀_우대금리0.0771.0000.3050.4980.109
자격구분0.0290.3051.0000.9990.042
자격_우대금리0.1300.4980.9991.0000.134
등록자번호0.8210.1090.0420.1341.000
2023-12-12T22:15:51.535350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계약자번호계약횟수자녀수자격구분자격_우대금리자녀_우대금리등록자번호수정자번호
계약자번호1.0000.172-0.0070.0330.0600.0980.0730.064
계약횟수0.1721.000-0.0640.0000.0840.0130.1710.161
자녀수-0.007-0.0641.0000.2390.5001.0000.0860.046
자격구분0.0330.0000.2391.0000.9990.3050.0420.029
자격_우대금리0.0600.0840.5000.9991.0000.4980.1340.130
자녀_우대금리0.0980.0131.0000.3050.4981.0000.1090.077
등록자번호0.0730.1710.0860.0420.1340.1091.0000.821
수정자번호0.0640.1610.0460.0290.1300.0770.8211.000

Missing values

2023-12-12T22:15:47.762066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:15:47.900565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

계약자번호계약횟수적용일자자격구분자녀수자격_우대금리자녀_우대금리등록자번호등록일시수정자번호수정일시
01201901202020-01-15수급자00.20.0199201132020-01-15 13:12:44199201132020-01-15 13:12:44
17201901802020-01-15한부모가정10.00.2199201132020-01-15 13:12:44199201132020-01-15 13:12:44
22201903002020-01-15수급자00.20.0199201132020-01-15 13:12:44199201132020-01-15 13:12:44
35201702612020-01-15수급자20.20.3199201132020-01-15 13:12:44199201132020-01-15 13:12:44
45201503532020-01-15수급자10.20.2199201132020-01-15 13:12:44199201132020-01-15 13:12:44
52201601922020-01-15수급자00.20.0199201132020-01-15 13:12:44199201132020-01-15 13:12:44
68201800402020-01-15수급자00.20.0199201132020-01-15 13:12:44199201132020-01-15 13:12:44
75201605112020-01-15수급자00.20.0199201132020-01-15 13:12:44199201132020-01-15 13:12:44
85201410522020-01-15수급자00.20.0199201132020-01-15 13:12:44199201132020-01-15 13:12:44
91201801502020-01-15한부모가정10.00.2199201132020-01-15 13:12:44199201132020-01-15 13:12:44
계약자번호계약횟수적용일자자격구분자녀수자격_우대금리자녀_우대금리등록자번호등록일시수정자번호수정일시
24141201500142023-10-24수급자10.20.2202003052023-06-09 13:21:36201802712023-08-23 15:17:25
24156201900522023-06-13수급자00.20.0202003052023-06-09 13:21:36202003052023-06-09 13:25:20
24162201902922023-08-09한부모가정20.00.3202003052023-06-09 13:21:36201802712023-08-08 13:44:04
24171202100812023-11-09수급자00.20.0202003052023-06-09 13:21:36201802712023-08-23 15:17:25
24185201901322023-08-06수급자10.20.2202003052023-06-09 13:21:36201802712023-07-20 13:29:12
24191201900922023-06-20수급자00.20.0202003052023-06-09 13:21:36202003052023-06-14 10:25:56
24204201700432023-08-04수급자00.20.0202003052023-06-09 13:21:36201802712023-07-20 13:29:12
24218201301652023-08-27수급자00.20.0202003052023-06-09 13:21:36201802712023-08-23 15:17:25
24225201604942023-06-21수급자00.20.0202003052023-06-09 13:21:36202003052023-06-14 10:25:56
24234201901022023-08-09수급자00.20.0202003052023-06-09 13:21:36201802712023-08-08 13:44:04