Overview

Dataset statistics

Number of variables8
Number of observations352
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows2
Duplicate rows (%)0.6%
Total size in memory23.2 KiB
Average record size in memory67.4 B

Variable types

Numeric3
Categorical5

Dataset

Description철거정보(매수완료번호,토지고유코드,철거순번,계약일자,철거준공일,공사명,폐기물착공일,폐기물준공일 등)
URLhttps://www.data.go.kr/data/15069239/fileData.do

Alerts

Dataset has 2 (0.6%) duplicate rowsDuplicates
철거준공일 is highly overall correlated with 철거순번 and 4 other fieldsHigh correlation
폐기물준공일 is highly overall correlated with 철거순번 and 4 other fieldsHigh correlation
폐기물착공일 is highly overall correlated with 철거순번 and 4 other fieldsHigh correlation
계약일자 is highly overall correlated with 철거순번 and 4 other fieldsHigh correlation
매수완료번호 is highly overall correlated with 철거순번High correlation
철거순번 is highly overall correlated with 매수완료번호 and 5 other fieldsHigh correlation
공사명 is highly overall correlated with 철거순번 and 4 other fieldsHigh correlation
공사명 is highly imbalanced (52.1%)Imbalance

Reproduction

Analysis started2023-12-12 08:01:59.938899
Analysis finished2023-12-12 08:02:02.542674
Duration2.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

매수완료번호
Real number (ℝ)

HIGH CORRELATION 

Distinct301
Distinct (%)85.8%
Missing1
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean4810.3447
Minimum2
Maximum1325000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-12T17:02:02.627701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile97
Q1399.5
median937
Q31390
95-th percentile1759.5
Maximum1325000
Range1324998
Interquartile range (IQR)990.5

Descriptive statistics

Standard deviation70714.598
Coefficient of variation (CV)14.700526
Kurtosis350.06505
Mean4810.3447
Median Absolute Deviation (MAD)468
Skewness18.698486
Sum1688431
Variance5.0005543 × 109
MonotonicityNot monotonic
2023-12-12T17:02:02.851426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1434 6
 
1.7%
736 6
 
1.7%
314 3
 
0.9%
219 3
 
0.9%
181 3
 
0.9%
1418 3
 
0.9%
322 3
 
0.9%
237 3
 
0.9%
196 2
 
0.6%
35 2
 
0.6%
Other values (291) 317
90.1%
ValueCountFrequency (%)
2 1
0.3%
12 1
0.3%
13 1
0.3%
24 1
0.3%
26 1
0.3%
28 1
0.3%
35 2
0.6%
36 1
0.3%
53 1
0.3%
57 1
0.3%
ValueCountFrequency (%)
1325000 1
0.3%
48000 1
0.3%
1815 1
0.3%
1810 1
0.3%
1802 1
0.3%
1798 1
0.3%
1791 2
0.6%
1790 2
0.6%
1789 1
0.3%
1788 2
0.6%

토지고유코드
Real number (ℝ)

Distinct330
Distinct (%)93.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.3242583 × 1018
Minimum3.0110121 × 1018
Maximum4.574034 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-12T17:02:03.022713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.0110121 × 1018
5-th percentile3.0110128 × 1018
Q14.373031 × 1018
median4.373038 × 1018
Q34.572025 × 1018
95-th percentile4.573035 × 1018
Maximum4.574034 × 1018
Range1.5630219 × 1018
Interquartile range (IQR)1.98994 × 1017

Descriptive statistics

Standard deviation3.9608624 × 1017
Coefficient of variation (CV)0.091596343
Kurtosis6.8112351
Mean4.3242583 × 1018
Median Absolute Deviation (MAD)1.0019995 × 1015
Skewness-2.8502878
Sum-8.9408535 × 1018
Variance1.5688431 × 1035
MonotonicityNot monotonic
2023-12-12T17:02:03.198545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4572025033102030001 3
 
0.9%
4574034028106160003 2
 
0.6%
4572034024107790000 2
 
0.6%
4373033030102250034 2
 
0.6%
4372038527102670001 2
 
0.6%
4373033022103410002 2
 
0.6%
4373033027101470003 2
 
0.6%
4374040025102720001 2
 
0.6%
4471032033106910001 2
 
0.6%
4373038025105000002 2
 
0.6%
Other values (320) 331
94.0%
ValueCountFrequency (%)
3011012100102000007 1
0.3%
3011012100103390003 1
0.3%
3011012100103830002 1
0.3%
3011012300101240001 1
0.3%
3011012300102010001 1
0.3%
3011012500104900000 1
0.3%
3011012500105520003 1
0.3%
3011012600102530002 1
0.3%
3011012600103400000 1
0.3%
3011012700100100000 1
0.3%
ValueCountFrequency (%)
4574034028106160004 1
0.3%
4574034028106160003 2
0.6%
4574034028102200001 1
0.3%
4574034028101640003 1
0.3%
4574034028101640001 1
0.3%
4574034027114460082 1
0.3%
4574034026102970000 1
0.3%
4574033522106280005 1
0.3%
4574033521100010007 1
0.3%
4574025030120210000 1
0.3%

철거순번
Real number (ℝ)

HIGH CORRELATION 

Distinct350
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean180.94318
Minimum1
Maximum365
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-12T17:02:03.387742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile17.55
Q189.75
median179.5
Q3271.25
95-th percentile347.45
Maximum365
Range364
Interquartile range (IQR)181.5

Descriptive statistics

Standard deviation106.20049
Coefficient of variation (CV)0.58692727
Kurtosis-1.1943174
Mean180.94318
Median Absolute Deviation (MAD)91
Skewness0.025168454
Sum63692
Variance11278.544
MonotonicityNot monotonic
2023-12-12T17:02:03.590964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 2
 
0.6%
296 2
 
0.6%
9 1
 
0.3%
231 1
 
0.3%
12 1
 
0.3%
192 1
 
0.3%
19 1
 
0.3%
230 1
 
0.3%
48 1
 
0.3%
15 1
 
0.3%
Other values (340) 340
96.6%
ValueCountFrequency (%)
1 2
0.6%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
365 1
0.3%
364 1
0.3%
363 1
0.3%
362 1
0.3%
361 1
0.3%
360 1
0.3%
359 1
0.3%
358 1
0.3%
357 1
0.3%
356 1
0.3%

계약일자
Categorical

HIGH CORRELATION 

Distinct30
Distinct (%)8.5%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2006-05-18
37 
2011-06-17
32 
2016-11-16
28 
2013-06-04
27 
2012-06-05
27 
Other values (25)
201 

Length

Max length10
Median length10
Mean length9.9659091
Min length4

Unique

Unique7 ?
Unique (%)2.0%

Sample

1st row2011-06-17
2nd row2011-06-17
3rd row2011-06-17
4th row2008-04-11
5th row2011-06-17

Common Values

ValueCountFrequency (%)
2006-05-18 37
 
10.5%
2011-06-17 32
 
9.1%
2016-11-16 28
 
8.0%
2013-06-04 27
 
7.7%
2012-06-05 27
 
7.7%
2014-07-07 24
 
6.8%
2006-05-17 21
 
6.0%
2015-08-26 21
 
6.0%
2009-12-04 15
 
4.3%
2009-02-09 15
 
4.3%
Other values (20) 105
29.8%

Length

2023-12-12T17:02:03.759944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2006-05-18 37
 
10.5%
2011-06-17 32
 
9.1%
2016-11-16 28
 
8.0%
2013-06-04 27
 
7.7%
2012-06-05 27
 
7.7%
2014-07-07 24
 
6.8%
2006-05-17 21
 
6.0%
2015-08-26 21
 
6.0%
2009-12-04 15
 
4.3%
2009-02-09 15
 
4.3%
Other values (20) 105
29.8%

철거준공일
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2006-12-31
37 
2011-12-16
32 
2016-12-30
28 
2013-10-31
27 
2012-10-25
27 
Other values (26)
201 

Length

Max length10
Median length10
Mean length9.9659091
Min length4

Unique

Unique8 ?
Unique (%)2.3%

Sample

1st row2011-12-16
2nd row2011-12-16
3rd row2011-12-16
4th row2008-07-04
5th row2011-12-16

Common Values

ValueCountFrequency (%)
2006-12-31 37
 
10.5%
2011-12-16 32
 
9.1%
2016-12-30 28
 
8.0%
2013-10-31 27
 
7.7%
2012-10-25 27
 
7.7%
2014-11-03 24
 
6.8%
2006-12-30 21
 
6.0%
2015-11-21 21
 
6.0%
2009-03-10 15
 
4.3%
2010-01-02 15
 
4.3%
Other values (21) 105
29.8%

Length

2023-12-12T17:02:03.919098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2006-12-31 37
 
10.5%
2011-12-16 32
 
9.1%
2016-12-30 28
 
8.0%
2013-10-31 27
 
7.7%
2012-10-25 27
 
7.7%
2014-11-03 24
 
6.8%
2006-12-30 21
 
6.0%
2015-11-21 21
 
6.0%
2009-03-10 15
 
4.3%
2010-01-02 15
 
4.3%
Other values (21) 105
29.8%

공사명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
금강수계 매수토지내 지장물 철거공사
275 
2016년 철거 및 폐기물 처리
34 
2015년 철거 및 폐기물 처리
 
22
<NA>
 
20
충청환경산업(주)
 
1

Length

Max length19
Median length19
Mean length17.801136
Min length4

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row금강수계 매수토지내 지장물 철거공사
2nd row금강수계 매수토지내 지장물 철거공사
3rd row금강수계 매수토지내 지장물 철거공사
4th row금강수계 매수토지내 지장물 철거공사
5th row금강수계 매수토지내 지장물 철거공사

Common Values

ValueCountFrequency (%)
금강수계 매수토지내 지장물 철거공사 275
78.1%
2016년 철거 및 폐기물 처리 34
 
9.7%
2015년 철거 및 폐기물 처리 22
 
6.2%
<NA> 20
 
5.7%
충청환경산업(주) 1
 
0.3%

Length

2023-12-12T17:02:04.072304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:02:04.207692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
금강수계 275
19.6%
매수토지내 275
19.6%
지장물 275
19.6%
철거공사 275
19.6%
철거 56
 
4.0%
56
 
4.0%
폐기물 56
 
4.0%
처리 56
 
4.0%
2016년 34
 
2.4%
2015년 22
 
1.6%
Other values (2) 21
 
1.5%

폐기물착공일
Categorical

HIGH CORRELATION 

Distinct30
Distinct (%)8.5%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2006-05-18
42 
2011-06-17
31 
2016-11-16
28 
2012-06-05
27 
2013-06-04
27 
Other values (25)
197 

Length

Max length10
Median length10
Mean length9.6590909
Min length4

Unique

Unique7 ?
Unique (%)2.0%

Sample

1st row2011-06-17
2nd row2011-06-17
3rd row2011-06-17
4th row2008-04-11
5th row2011-06-17

Common Values

ValueCountFrequency (%)
2006-05-18 42
11.9%
2011-06-17 31
 
8.8%
2016-11-16 28
 
8.0%
2012-06-05 27
 
7.7%
2013-06-04 27
 
7.7%
2014-07-07 23
 
6.5%
2015-08-26 21
 
6.0%
<NA> 20
 
5.7%
2009-12-04 15
 
4.3%
2009-02-09 15
 
4.3%
Other values (20) 103
29.3%

Length

2023-12-12T17:02:04.369629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2006-05-18 42
11.9%
2011-06-17 31
 
8.8%
2016-11-16 28
 
8.0%
2012-06-05 27
 
7.7%
2013-06-04 27
 
7.7%
2014-07-07 23
 
6.5%
2015-08-26 21
 
6.0%
na 20
 
5.7%
2009-12-04 15
 
4.3%
2009-02-09 15
 
4.3%
Other values (20) 103
29.3%

폐기물준공일
Categorical

HIGH CORRELATION 

Distinct35
Distinct (%)9.9%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2006-12-31
45 
2011-12-16
31 
2016-12-30
28 
2013-10-31
27 
2012-10-25
27 
Other values (30)
194 

Length

Max length10
Median length10
Mean length9.6590909
Min length4

Unique

Unique13 ?
Unique (%)3.7%

Sample

1st row2011-12-16
2nd row2011-12-16
3rd row2011-12-16
4th row2008-07-04
5th row2011-12-16

Common Values

ValueCountFrequency (%)
2006-12-31 45
12.8%
2011-12-16 31
 
8.8%
2016-12-30 28
 
8.0%
2013-10-31 27
 
7.7%
2012-10-25 27
 
7.7%
2014-11-03 22
 
6.2%
2015-11-21 21
 
6.0%
<NA> 20
 
5.7%
2010-01-02 15
 
4.3%
2009-03-10 15
 
4.3%
Other values (25) 101
28.7%

Length

2023-12-12T17:02:04.528451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2006-12-31 45
12.8%
2011-12-16 31
 
8.8%
2016-12-30 28
 
8.0%
2013-10-31 27
 
7.7%
2012-10-25 27
 
7.7%
2014-11-03 22
 
6.2%
2015-11-21 21
 
6.0%
na 20
 
5.7%
2010-01-02 15
 
4.3%
2009-03-10 15
 
4.3%
Other values (25) 101
28.7%

Interactions

2023-12-12T17:02:01.498461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:02:00.616070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:02:01.060963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:02:01.639136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:02:00.767255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:02:01.193437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:02:01.792491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:02:00.908190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:02:01.310046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:02:04.644114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
매수완료번호토지고유코드철거순번계약일자철거준공일공사명폐기물착공일폐기물준공일
매수완료번호1.0000.0000.0160.0000.0000.0000.0000.000
토지고유코드0.0001.0000.4730.6350.6770.1340.6330.679
철거순번0.0160.4731.0000.9830.9950.8490.9810.981
계약일자0.0000.6350.9831.0001.0000.9450.9990.999
철거준공일0.0000.6770.9951.0001.0000.9440.9980.999
공사명0.0000.1340.8490.9450.9441.0000.9450.948
폐기물착공일0.0000.6330.9810.9990.9980.9451.0000.997
폐기물준공일0.0000.6790.9810.9990.9990.9480.9971.000
2023-12-12T17:02:04.806490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
철거준공일폐기물준공일폐기물착공일계약일자공사명
철거준공일1.0000.9670.9540.9980.784
폐기물준공일0.9671.0000.9310.9650.770
폐기물착공일0.9540.9311.0000.9730.780
계약일자0.9980.9650.9731.0000.786
공사명0.7840.7700.7800.7861.000
2023-12-12T17:02:04.942446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
매수완료번호토지고유코드철거순번계약일자철거준공일공사명폐기물착공일폐기물준공일
매수완료번호1.0000.1550.5160.0000.0000.0000.0000.000
토지고유코드0.1551.0000.1750.3950.3970.1270.3920.425
철거순번0.5160.1751.0000.8560.8540.6900.8410.833
계약일자0.0000.3950.8561.0000.9980.7860.9730.965
철거준공일0.0000.3970.8540.9981.0000.7840.9540.967
공사명0.0000.1270.6900.7860.7841.0000.7800.770
폐기물착공일0.0000.3920.8410.9730.9540.7801.0000.931
폐기물준공일0.0000.4250.8330.9650.9670.7700.9311.000

Missing values

2023-12-12T17:02:02.311706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:02:02.485678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

매수완료번호토지고유코드철거순번계약일자철거준공일공사명폐기물착공일폐기물준공일
0101143730360331003600001162011-06-172011-12-16금강수계 매수토지내 지장물 철거공사2011-06-172011-12-16
1101343740390271004600011312011-06-172011-12-16금강수계 매수토지내 지장물 철거공사2011-06-172011-12-16
2104743730310251061200041352011-06-172011-12-16금강수계 매수토지내 지장물 철거공사2011-06-172011-12-16
310545730350251009200022732008-04-112008-07-04금강수계 매수토지내 지장물 철거공사2008-04-112008-07-04
4105245740250221032700001422011-06-172011-12-16금강수계 매수토지내 지장물 철거공사2011-06-172011-12-16
5105543730310251061000221362011-06-172011-12-16금강수계 매수토지내 지장물 철거공사2011-06-172011-12-16
6105743730330221034300021372011-06-172011-12-16금강수계 매수토지내 지장물 철거공사2011-06-172011-12-16
7105945730350251030500001412011-06-172011-12-16금강수계 매수토지내 지장물 철거공사2011-06-172011-12-16
8106043730250391025100001382011-06-172011-12-16금강수계 매수토지내 지장물 철거공사2011-06-172011-12-16
9106345720330231209800001082012-06-052012-10-25금강수계 매수토지내 지장물 철거공사2012-06-052012-10-25
매수완료번호토지고유코드철거순번계약일자철거준공일공사명폐기물착공일폐기물준공일
34293945720340251077800521462011-06-172011-12-16금강수계 매수토지내 지장물 철거공사2011-06-172011-12-16
34394545740340281016400011432011-06-172011-12-16금강수계 매수토지내 지장물 철거공사2011-06-172011-12-16
34495301101260010340000022006-05-172006-12-30금강수계 매수토지내 지장물 철거공사2006-05-182006-12-31
34595343740400211040700051132011-06-172011-12-16금강수계 매수토지내 지장물 철거공사2011-06-162011-12-16
34695744710320331034700021272011-06-172011-12-16금강수계 매수토지내 지장물 철거공사2011-06-172011-12-16
34796543740400251026900011142011-06-172011-12-16금강수계 매수토지내 지장물 철거공사2011-06-172011-12-16
3489943720370391032600011872006-05-182006-12-31금강수계 매수토지내 지장물 철거공사2006-05-182006-12-31
349994372037039103260001222006-05-182006-12-31금강수계 매수토지내 지장물 철거공사2006-05-182006-12-31
35099345720310211029300091452011-06-172011-12-16금강수계 매수토지내 지장물 철거공사2011-06-172011-12-16
351<NA>4373036024100470001252006-05-182006-12-31<NA><NA><NA>

Duplicate rows

Most frequently occurring

매수완료번호토지고유코드철거순번계약일자철거준공일공사명폐기물착공일폐기물준공일# duplicates
0260301101280010020000612006-05-172006-12-30금강수계 매수토지내 지장물 철거공사2006-05-182006-12-312
1127743740400251027200012962013-06-042013-10-31금강수계 매수토지내 지장물 철거공사2013-06-042013-10-312