Overview

Dataset statistics

Number of variables10
Number of observations392
Missing cells391
Missing cells (%)10.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory33.1 KiB
Average record size in memory86.3 B

Variable types

Numeric6
DateTime3
Categorical1

Dataset

Description대구도시개발공사 전세임대 공유자 데이터 입니다. 메타데이터기반 공공데이터 개방자료이기 때문에 가공되지 않은 원본 테이블의 데이터가 등록되었습니다.
URLhttps://www.data.go.kr/data/15120617/fileData.do

Alerts

해제일자 has constant value ""Constant
전세고객번호_임대인 is highly overall correlated with 일련번호 and 1 other fieldsHigh correlation
일련번호 is highly overall correlated with 전세고객번호_임대인 and 1 other fieldsHigh correlation
전세고객번호 is highly overall correlated with 전세고객번호_임대인 and 1 other fieldsHigh correlation
등록자번호 is highly overall correlated with 지원번호 and 1 other fieldsHigh correlation
지원번호 is highly overall correlated with 등록자번호High correlation
수정자번호 is highly overall correlated with 등록자번호High correlation
해제일자 has 391 (99.7%) missing valuesMissing
지원번호 has 119 (30.4%) zerosZeros

Reproduction

Analysis started2023-12-12 21:13:22.361061
Analysis finished2023-12-12 21:13:26.638899
Duration4.28 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

계약자번호
Real number (ℝ)

Distinct196
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44772113
Minimum12016002
Maximum82020002
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-13T06:13:26.708013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12016002
5-th percentile12017555
Q122019011
median52014007
Q352018011
95-th percentile82014470
Maximum82020002
Range70004000
Interquartile range (IQR)29999000

Descriptive statistics

Standard deviation22378006
Coefficient of variation (CV)0.49982018
Kurtosis-1.0766632
Mean44772113
Median Absolute Deviation (MAD)20004996
Skewness0.043907588
Sum1.7550668 × 1010
Variance5.0077515 × 1014
MonotonicityNot monotonic
2023-12-13T06:13:26.841522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12017004 8
 
2.0%
52015104 7
 
1.8%
22015001 6
 
1.5%
22016028 5
 
1.3%
52014108 5
 
1.3%
52014100 5
 
1.3%
52013012 5
 
1.3%
52014071 4
 
1.0%
52017029 4
 
1.0%
22016026 4
 
1.0%
Other values (186) 339
86.5%
ValueCountFrequency (%)
12016002 1
 
0.3%
12016008 4
1.0%
12017003 1
 
0.3%
12017004 8
2.0%
12017008 4
1.0%
12017009 2
 
0.5%
12018001 1
 
0.3%
12018004 3
 
0.8%
12018005 1
 
0.3%
12018015 3
 
0.8%
ValueCountFrequency (%)
82020002 1
 
0.3%
82017002 1
 
0.3%
82016013 2
0.5%
82015028 2
0.5%
82015026 3
0.8%
82015025 4
1.0%
82015024 2
0.5%
82015004 4
1.0%
82015001 1
 
0.3%
82014035 1
 
0.3%

전세고객번호_임대인
Real number (ℝ)

HIGH CORRELATION 

Distinct215
Distinct (%)54.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean669124.89
Minimum661371
Maximum678514
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-13T06:13:27.210329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum661371
5-th percentile661509
Q1666197.5
median667825.5
Q3672382
95-th percentile677411
Maximum678514
Range17143
Interquartile range (IQR)6184.5

Descriptive statistics

Standard deviation4932.776
Coefficient of variation (CV)0.007371981
Kurtosis-0.75215935
Mean669124.89
Median Absolute Deviation (MAD)2527
Skewness0.34216843
Sum2.6229696 × 108
Variance24332279
MonotonicityNot monotonic
2023-12-13T06:13:27.314492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
661838 6
 
1.5%
666158 6
 
1.5%
665418 4
 
1.0%
666145 4
 
1.0%
668196 4
 
1.0%
661611 4
 
1.0%
666612 4
 
1.0%
667786 4
 
1.0%
665404 4
 
1.0%
666261 4
 
1.0%
Other values (205) 348
88.8%
ValueCountFrequency (%)
661371 1
0.3%
661375 1
0.3%
661381 1
0.3%
661390 1
0.3%
661400 1
0.3%
661428 1
0.3%
661430 1
0.3%
661443 2
0.5%
661447 1
0.3%
661464 1
0.3%
ValueCountFrequency (%)
678514 1
0.3%
678501 1
0.3%
678497 1
0.3%
678495 1
0.3%
678480 1
0.3%
678477 1
0.3%
678462 2
0.5%
678460 1
0.3%
678424 1
0.3%
678415 1
0.3%

일련번호
Real number (ℝ)

HIGH CORRELATION 

Distinct196
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean76.183673
Minimum1
Maximum198
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-13T06:13:27.431234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q128
median70
Q3117
95-th percentile178.45
Maximum198
Range197
Interquartile range (IQR)89

Descriptive statistics

Standard deviation55.769043
Coefficient of variation (CV)0.73203405
Kurtosis-0.85794046
Mean76.183673
Median Absolute Deviation (MAD)44.5
Skewness0.38564494
Sum29864
Variance3110.1861
MonotonicityNot monotonic
2023-12-13T06:13:27.552056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 42
 
10.7%
5 4
 
1.0%
55 4
 
1.0%
50 4
 
1.0%
41 4
 
1.0%
22 4
 
1.0%
24 4
 
1.0%
21 4
 
1.0%
3 4
 
1.0%
58 4
 
1.0%
Other values (186) 314
80.1%
ValueCountFrequency (%)
1 42
10.7%
2 3
 
0.8%
3 4
 
1.0%
4 2
 
0.5%
5 4
 
1.0%
6 1
 
0.3%
7 3
 
0.8%
8 1
 
0.3%
9 1
 
0.3%
10 1
 
0.3%
ValueCountFrequency (%)
198 1
0.3%
197 1
0.3%
196 1
0.3%
195 1
0.3%
194 1
0.3%
193 1
0.3%
192 1
0.3%
191 1
0.3%
190 1
0.3%
189 1
0.3%

전세고객번호
Real number (ℝ)

HIGH CORRELATION 

Distinct221
Distinct (%)56.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean669667.68
Minimum661561
Maximum678515
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-13T06:13:27.662030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum661561
5-th percentile661949
Q1666315
median668190
Q3672453
95-th percentile677418.05
Maximum678515
Range16954
Interquartile range (IQR)6138

Descriptive statistics

Standard deviation4871.1627
Coefficient of variation (CV)0.0072739999
Kurtosis-0.89946809
Mean669667.68
Median Absolute Deviation (MAD)2423
Skewness0.27244056
Sum2.6250973 × 108
Variance23728226
MonotonicityNot monotonic
2023-12-13T06:13:27.784139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
665452 4
 
1.0%
661561 4
 
1.0%
668197 4
 
1.0%
661959 4
 
1.0%
667787 4
 
1.0%
666666 4
 
1.0%
665443 4
 
1.0%
666264 4
 
1.0%
666206 4
 
1.0%
666148 4
 
1.0%
Other values (211) 352
89.8%
ValueCountFrequency (%)
661561 4
1.0%
661939 1
 
0.3%
661940 1
 
0.3%
661941 1
 
0.3%
661942 1
 
0.3%
661943 2
0.5%
661944 1
 
0.3%
661945 1
 
0.3%
661946 1
 
0.3%
661947 1
 
0.3%
ValueCountFrequency (%)
678515 1
0.3%
678502 1
0.3%
678498 1
0.3%
678496 1
0.3%
678481 1
0.3%
678478 1
0.3%
678464 1
0.3%
678463 1
0.3%
678461 1
0.3%
678425 1
0.3%

해제일자
Date

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing391
Missing (%)99.7%
Memory size3.2 KiB
Minimum2014-07-30 00:00:00
Maximum2014-07-30 00:00:00
2023-12-13T06:13:27.875394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:27.948041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

등록자번호
Real number (ℝ)

HIGH CORRELATION 

Distinct19
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20927577
Minimum19880040
Maximum99999992
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-13T06:13:28.042369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19880040
5-th percentile19920107
Q120080211
median20150230
Q320192796
95-th percentile20200305
Maximum99999992
Range80119952
Interquartile range (IQR)112585

Descriptive statistics

Standard deviation8039417.1
Coefficient of variation (CV)0.38415422
Kurtosis94.195573
Mean20927577
Median Absolute Deviation (MAD)50075
Skewness9.7826804
Sum8.2036102 × 109
Variance6.4632227 × 1013
MonotonicityNot monotonic
2023-12-13T06:13:28.139352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
20200305 85
21.7%
20050190 40
10.2%
19920113 34
 
8.7%
20150230 34
 
8.7%
20090219 33
 
8.4%
20190293 32
 
8.2%
20090218 26
 
6.6%
19920107 17
 
4.3%
20139023 16
 
4.1%
20139024 11
 
2.8%
Other values (9) 64
16.3%
ValueCountFrequency (%)
19880040 7
 
1.8%
19920107 17
4.3%
19920113 34
8.7%
20050190 40
10.2%
20090218 26
6.6%
20090219 33
8.4%
20139023 16
 
4.1%
20139024 11
 
2.8%
20150230 34
8.7%
20150237 6
 
1.5%
ValueCountFrequency (%)
99999992 4
 
1.0%
20200306 9
 
2.3%
20200305 85
21.7%
20190293 32
 
8.2%
20180271 7
 
1.8%
20179076 10
 
2.6%
20179075 6
 
1.5%
20159051 9
 
2.3%
20159042 6
 
1.5%
20150237 6
 
1.5%
Distinct359
Distinct (%)91.6%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
Minimum2013-08-30 14:08:59
Maximum2023-08-18 11:37:21
2023-12-13T06:13:28.248135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:28.354348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

수정자번호
Categorical

HIGH CORRELATION 

Distinct20
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
20200305
99 
20050190
40 
19920113
32 
20090219
31 
20150230
31 
Other values (15)
159 

Length

Max length8
Median length8
Mean length7.755102
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20159051
2nd row20179075
3rd row20150237
4th row20200305
5th row20179076

Common Values

ValueCountFrequency (%)
20200305 99
25.3%
20050190 40
10.2%
19920113 32
 
8.2%
20090219 31
 
7.9%
20150230 31
 
7.9%
20190293 29
 
7.4%
20090218 24
 
6.1%
자료이관 24
 
6.1%
19920107 17
 
4.3%
20200306 11
 
2.8%
Other values (10) 54
13.8%

Length

2023-12-13T06:13:28.465348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
20200305 99
25.3%
20050190 40
10.2%
19920113 32
 
8.2%
20090219 31
 
7.9%
20150230 31
 
7.9%
20190293 29
 
7.4%
20090218 24
 
6.1%
자료이관 24
 
6.1%
19920107 17
 
4.3%
20200306 11
 
2.8%
Other values (10) 54
13.8%
Distinct328
Distinct (%)83.7%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
Minimum2013-08-30 14:10:30
Maximum2023-08-18 11:37:21
2023-12-13T06:13:28.593817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:28.696171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

지원번호
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct7
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.5255102
Minimum0
Maximum6
Zeros119
Zeros (%)30.4%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-13T06:13:28.791035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32.25
95-th percentile4
Maximum6
Range6
Interquartile range (IQR)2.25

Descriptive statistics

Standard deviation1.4139829
Coefficient of variation (CV)0.92689178
Kurtosis-0.42004978
Mean1.5255102
Median Absolute Deviation (MAD)1
Skewness0.68440724
Sum598
Variance1.9993476
MonotonicityNot monotonic
2023-12-13T06:13:28.887288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
0 119
30.4%
1 102
26.0%
2 73
18.6%
3 54
13.8%
4 33
 
8.4%
5 10
 
2.6%
6 1
 
0.3%
ValueCountFrequency (%)
0 119
30.4%
1 102
26.0%
2 73
18.6%
3 54
13.8%
4 33
 
8.4%
5 10
 
2.6%
6 1
 
0.3%
ValueCountFrequency (%)
6 1
 
0.3%
5 10
 
2.6%
4 33
 
8.4%
3 54
13.8%
2 73
18.6%
1 102
26.0%
0 119
30.4%

Interactions

2023-12-13T06:13:25.817962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:22.646148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:23.178472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:23.708485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:24.417351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:25.126474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:25.918096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:22.730269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:23.272420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:23.801768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:24.549126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:25.242264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:26.032109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:22.826221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:23.370637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:23.896967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:24.672447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:25.372217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:26.144125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:22.917541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:23.459455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:24.047536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:24.790921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:25.486448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:26.222100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:22.992810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:23.540996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:24.178605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:24.887471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:25.591395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:26.306939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:23.076343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:23.627378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:24.301274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:25.011497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:13:25.714847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:13:28.959297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계약자번호전세고객번호_임대인일련번호전세고객번호등록자번호수정자번호지원번호
계약자번호1.0000.4810.4510.4470.1240.4520.332
전세고객번호_임대인0.4811.0000.8840.9950.1470.7980.348
일련번호0.4510.8841.0000.9210.1900.8450.321
전세고객번호0.4470.9950.9211.0000.1460.8160.337
등록자번호0.1240.1470.1900.1461.0001.0000.000
수정자번호0.4520.7980.8450.8161.0001.0000.637
지원번호0.3320.3480.3210.3370.0000.6371.000
2023-12-13T06:13:29.048743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계약자번호전세고객번호_임대인일련번호전세고객번호등록자번호지원번호수정자번호
계약자번호1.000-0.183-0.195-0.1950.0160.1520.195
전세고객번호_임대인-0.1831.0000.9030.903-0.196-0.2530.468
일련번호-0.1950.9031.0000.999-0.176-0.1760.437
전세고객번호-0.1950.9030.9991.000-0.175-0.1750.488
등록자번호0.016-0.196-0.176-0.1751.0000.5550.977
지원번호0.152-0.253-0.176-0.1750.5551.0000.333
수정자번호0.1950.4680.4370.4880.9770.3331.000

Missing values

2023-12-13T06:13:26.436221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:13:26.587409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

계약자번호전세고객번호_임대인일련번호전세고객번호해제일자등록자번호등록일시수정자번호수정일시지원번호
0520141006654185665452<NA>201590512016-11-11 13:27:23201590512016-11-11 13:27:231
18201500466614521666148<NA>201790752017-04-25 10:48:42201790752017-04-25 10:48:421
28201403166550213665504<NA>201502372016-12-16 13:59:57201502372016-12-16 13:59:571
35201301266628825666289<NA>201790762017-07-04 11:08:06202003052021-07-13 00:00:002
45201602466629126666293<NA>201790762017-07-07 16:31:00201790762017-07-07 16:31:001
55201702066629627666299<NA>201790752017-07-14 15:59:21201790752017-07-14 15:59:210
65201302466175330666341<NA>201502372017-08-14 09:50:23201502372017-08-14 09:50:232
71201800466775151667752<NA>199201132018-06-01 16:44:02199201132018-06-01 16:44:020
87201800966777853667779<NA>200902192018-06-14 15:29:27200902192018-06-14 15:29:270
92201802066778454667785<NA>199201132018-06-20 14:22:21199201132018-06-20 14:22:210
계약자번호전세고객번호_임대인일련번호전세고객번호해제일자등록자번호등록일시수정자번호수정일시지원번호
38252022008677307165677308<NA>200501902022-09-21 13:51:29200501902022-09-21 13:51:290
3831201900266968181669682<NA>202003052023-02-06 14:57:35202003052023-02-06 14:57:352
38422021001674968119674969<NA>202003052023-02-09 13:25:40202003052023-02-09 13:25:401
38552015020672448115672449<NA>202003052023-03-23 11:20:08202003052023-03-23 11:20:084
38612021004674980120674981<NA>202003052023-04-20 15:34:41202003052023-04-20 15:34:411
38712017004674991143676147<NA>202003052023-05-02 11:43:36202003052023-05-02 11:43:364
3887201900466965977669660<NA>202003052023-05-16 16:39:03202003052023-05-16 16:39:032
38932021005676111131676112<NA>202003052023-06-22 16:49:44202003052023-06-22 16:49:441
39032021005678411187678412<NA>202003052023-06-22 16:59:21202003052023-06-22 16:59:211
39112023006678424189678425<NA>198800402023-06-28 17:54:25198800402023-06-28 17:54:250