Overview

Dataset statistics

Number of variables8
Number of observations1000
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows207
Duplicate rows (%)20.7%
Total size in memory65.6 KiB
Average record size in memory67.1 B

Variable types

Categorical3
Numeric3
Text1
DateTime1

Dataset

Description한국주택금융공사 주택연금부 업무 관련 공개 공공데이터이며 을구순위와 등기목적, 접수일자, 금액 관련 정보가 포함되어있습니다. (해당 부서의 업무와 관련된 데이터베이스에서 공개 가능한 원천 데이터)
Author한국주택금융공사
URLhttps://www.data.go.kr/data/15072790/fileData.do

Alerts

Dataset has 207 (20.7%) duplicate rowsDuplicates
등기목적코드 is highly overall correlated with 등기사유내용High correlation
등기사유내용 is highly overall correlated with 등기목적코드High correlation
구분 is highly imbalanced (65.3%)Imbalance
등기목적코드 is highly imbalanced (56.5%)Imbalance
등기사유내용 is highly imbalanced (65.1%)Imbalance

Reproduction

Analysis started2023-12-12 12:56:51.914077
Analysis finished2023-12-12 12:56:53.676019
Duration1.76 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
전유
897 
토지
 
84
건물
 
19

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전유
2nd row전유
3rd row전유
4th row전유
5th row전유

Common Values

ValueCountFrequency (%)
전유 897
89.7%
토지 84
 
8.4%
건물 19
 
1.9%

Length

2023-12-12T21:56:53.748409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:56:53.905747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전유 897
89.7%
토지 84
 
8.4%
건물 19
 
1.9%

등록번호
Real number (ℝ)

Distinct11
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.244329 × 1013
Minimum1.2012 × 1013
Maximum1.3562 × 1013
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-12T21:56:54.004036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.2012 × 1013
5-th percentile1.2012 × 1013
Q11.2422 × 1013
median1.2462 × 1013
Q31.2472 × 1013
95-th percentile1.2492 × 1013
Maximum1.3562 × 1013
Range1.55 × 1012
Interquartile range (IQR)5 × 1010

Descriptive statistics

Standard deviation1.8432474 × 1011
Coefficient of variation (CV)0.014813184
Kurtosis17.519467
Mean1.244329 × 1013
Median Absolute Deviation (MAD)3 × 1010
Skewness2.4448618
Sum1.244329 × 1016
Variance3.3975612 × 1022
MonotonicityNot monotonic
2023-12-12T21:56:54.149596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
12422000000000 276
27.6%
12472000000000 194
19.4%
12462000000000 166
16.6%
12482000000000 113
11.3%
12412000000000 72
 
7.2%
12492000000000 69
 
6.9%
12012000000000 68
 
6.8%
12452000000000 20
 
2.0%
13452000000000 14
 
1.4%
13012000000000 4
 
0.4%
ValueCountFrequency (%)
12012000000000 68
 
6.8%
12412000000000 72
 
7.2%
12422000000000 276
27.6%
12452000000000 20
 
2.0%
12462000000000 166
16.6%
12472000000000 194
19.4%
12482000000000 113
11.3%
12492000000000 69
 
6.9%
13012000000000 4
 
0.4%
13452000000000 14
 
1.4%
ValueCountFrequency (%)
13562000000000 4
 
0.4%
13452000000000 14
 
1.4%
13012000000000 4
 
0.4%
12492000000000 69
 
6.9%
12482000000000 113
11.3%
12472000000000 194
19.4%
12462000000000 166
16.6%
12452000000000 20
 
2.0%
12422000000000 276
27.6%
12412000000000 72
 
7.2%

을구순위
Real number (ℝ)

Distinct56
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.018
Minimum1
Maximum71
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-12T21:56:54.312569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median5
Q39
95-th percentile20
Maximum71
Range70
Interquartile range (IQR)6

Descriptive statistics

Standard deviation9.5956111
Coefficient of variation (CV)1.1967587
Kurtosis19.239386
Mean8.018
Median Absolute Deviation (MAD)2
Skewness4.0249271
Sum8018
Variance92.075752
MonotonicityNot monotonic
2023-12-12T21:56:54.474558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3 141
14.1%
5 108
10.8%
4 107
10.7%
2 91
9.1%
7 75
 
7.5%
6 72
 
7.2%
1 60
 
6.0%
8 57
 
5.7%
9 49
 
4.9%
10 47
 
4.7%
Other values (46) 193
19.3%
ValueCountFrequency (%)
1 60
6.0%
2 91
9.1%
3 141
14.1%
4 107
10.7%
5 108
10.8%
6 72
7.2%
7 75
7.5%
8 57
5.7%
9 49
 
4.9%
10 47
 
4.7%
ValueCountFrequency (%)
71 1
0.1%
70 2
0.2%
69 1
0.1%
68 1
0.1%
67 2
0.2%
66 1
0.1%
65 1
0.1%
64 1
0.1%
61 1
0.1%
58 1
0.1%
Distinct108
Distinct (%)10.8%
Missing1
Missing (%)0.1%
Memory size7.9 KiB
2023-12-12T21:56:54.750158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length75
Median length73
Mean length10.92993
Min length6

Characters and Unicode

Total characters10919
Distinct characters59
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique69 ?
Unique (%)6.9%

Sample

1st row근저당권설정
2nd row근저당권설정
3rd row근저당권설정
4th row5번근저당권설정등기말소
5th row4번근저당권설정등기말소
ValueCountFrequency (%)
근저당권설정 409
33.6%
1번근저당권설정등기말소 134
 
11.0%
등기말소 97
 
8.0%
3번근저당권설정등기말소 54
 
4.4%
2번근저당권설정등기말소 51
 
4.2%
1번근저당권설정 48
 
3.9%
2번근저당권설정 47
 
3.9%
5번근저당권설정등기말소 35
 
2.9%
4번근저당권설정등기말소 28
 
2.3%
6번근저당권설정등기말소 26
 
2.1%
Other values (79) 289
23.7%
2023-12-12T21:56:55.144408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1117
10.2%
1101
10.1%
1099
10.1%
1099
10.1%
1097
10.0%
1096
10.0%
711
 
6.5%
566
 
5.2%
566
 
5.2%
564
 
5.2%
Other values (49) 1903
17.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9743
89.2%
Decimal Number 828
 
7.6%
Space Separator 219
 
2.0%
Other Punctuation 122
 
1.1%
Dash Punctuation 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1117
11.5%
1101
11.3%
1099
11.3%
1099
11.3%
1097
11.3%
1096
11.2%
711
7.3%
566
5.8%
566
5.8%
564
5.8%
Other values (36) 727
7.5%
Decimal Number
ValueCountFrequency (%)
1 292
35.3%
2 129
15.6%
3 95
 
11.5%
5 69
 
8.3%
4 66
 
8.0%
6 50
 
6.0%
8 41
 
5.0%
7 36
 
4.3%
9 32
 
3.9%
0 18
 
2.2%
Space Separator
ValueCountFrequency (%)
219
100.0%
Other Punctuation
ValueCountFrequency (%)
, 122
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9743
89.2%
Common 1176
 
10.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1117
11.5%
1101
11.3%
1099
11.3%
1099
11.3%
1097
11.3%
1096
11.2%
711
7.3%
566
5.8%
566
5.8%
564
5.8%
Other values (36) 727
7.5%
Common
ValueCountFrequency (%)
1 292
24.8%
219
18.6%
2 129
11.0%
, 122
10.4%
3 95
 
8.1%
5 69
 
5.9%
4 66
 
5.6%
6 50
 
4.3%
8 41
 
3.5%
7 36
 
3.1%
Other values (3) 57
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9743
89.2%
ASCII 1176
 
10.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1117
11.5%
1101
11.3%
1099
11.3%
1099
11.3%
1097
11.3%
1096
11.2%
711
7.3%
566
5.8%
566
5.8%
564
5.8%
Other values (36) 727
7.5%
ASCII
ValueCountFrequency (%)
1 292
24.8%
219
18.6%
2 129
11.0%
, 122
10.4%
3 95
 
8.1%
5 69
 
5.9%
4 66
 
5.6%
6 50
 
4.3%
8 41
 
3.5%
7 36
 
3.1%
Other values (3) 57
 
4.8%

등기목적코드
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct13
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
F3ZZGZ
452 
F3ZZBZ
420 
ZZZZGZ
97 
F3ZZEZ
 
11
D1ZZGZ
 
6
Other values (8)
 
14

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique5 ?
Unique (%)0.5%

Sample

1st rowF3ZZBZ
2nd rowF3ZZBZ
3rd rowF3ZZBZ
4th rowF3ZZGZ
5th rowF3ZZGZ

Common Values

ValueCountFrequency (%)
F3ZZGZ 452
45.2%
F3ZZBZ 420
42.0%
ZZZZGZ 97
 
9.7%
F3ZZEZ 11
 
1.1%
D1ZZGZ 6
 
0.6%
G1ZZGZ 5
 
0.5%
B1ZZGZ 2
 
0.2%
P2ZZZZ 2
 
0.2%
11ZZBZ 1
 
0.1%
C2ZZEZ 1
 
0.1%
Other values (3) 3
 
0.3%

Length

2023-12-12T21:56:55.292333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
f3zzgz 452
45.2%
f3zzbz 420
42.0%
zzzzgz 97
 
9.7%
f3zzez 11
 
1.1%
d1zzgz 6
 
0.6%
g1zzgz 5
 
0.5%
b1zzgz 2
 
0.2%
p2zzzz 2
 
0.2%
11zzbz 1
 
0.1%
c2zzez 1
 
0.1%
Other values (3) 3
 
0.3%
Distinct514
Distinct (%)51.4%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
Minimum1990-02-08 00:00:00
Maximum2020-06-01 00:00:00
2023-12-12T21:56:55.478054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:56:55.663270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

접수번호
Real number (ℝ)

Distinct779
Distinct (%)77.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean87022.895
Minimum-55308
Maximum516283
Zeros0
Zeros (%)0.0%
Negative1
Negative (%)0.1%
Memory size8.9 KiB
2023-12-12T21:56:55.810543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-55308
5-th percentile6706
Q134287.5
median61441.5
Q387895
95-th percentile411451.1
Maximum516283
Range571591
Interquartile range (IQR)53607.5

Descriptive statistics

Standard deviation103883.51
Coefficient of variation (CV)1.1937492
Kurtosis6.8873672
Mean87022.895
Median Absolute Deviation (MAD)26490
Skewness2.7143961
Sum87022895
Variance1.0791783 × 1010
MonotonicityNot monotonic
2023-12-12T21:56:55.997177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
97232 4
 
0.4%
46135 4
 
0.4%
99013 4
 
0.4%
6479 4
 
0.4%
6706 4
 
0.4%
54461 4
 
0.4%
52687 3
 
0.3%
84271 2
 
0.2%
69243 2
 
0.2%
85808 2
 
0.2%
Other values (769) 967
96.7%
ValueCountFrequency (%)
-55308 1
0.1%
126 1
0.1%
127 1
0.1%
235 1
0.1%
261 1
0.1%
535 1
0.1%
621 1
0.1%
1050 2
0.2%
1105 1
0.1%
1117 1
0.1%
ValueCountFrequency (%)
516283 1
0.1%
490709 1
0.1%
477775 2
0.2%
477746 2
0.2%
477327 1
0.1%
475717 2
0.2%
475641 2
0.2%
475483 2
0.2%
475474 2
0.2%
473510 2
0.2%

등기사유내용
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct13
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
해지
543 
설정계약
420 
일부포기
 
10
변경계약
 
6
임의경매로 인한 매각
 
5
Other values (8)
 
16

Length

Max length11
Median length2
Mean length2.949
Min length2

Unique

Unique4 ?
Unique (%)0.4%

Sample

1st row설정계약
2nd row설정계약
3rd row설정계약
4th row해지
5th row해지

Common Values

ValueCountFrequency (%)
해지 543
54.3%
설정계약 420
42.0%
일부포기 10
 
1.0%
변경계약 6
 
0.6%
임의경매로 인한 매각 5
 
0.5%
계약인수 4
 
0.4%
<NA> 4
 
0.4%
매각 2
 
0.2%
상호변경 2
 
0.2%
공매 1
 
0.1%
Other values (3) 3
 
0.3%

Length

2023-12-12T21:56:56.205016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
해지 543
53.7%
설정계약 420
41.5%
일부포기 10
 
1.0%
매각 7
 
0.7%
변경계약 6
 
0.6%
임의경매로 5
 
0.5%
인한 5
 
0.5%
계약인수 4
 
0.4%
na 4
 
0.4%
상호변경 2
 
0.2%
Other values (5) 5
 
0.5%

Interactions

2023-12-12T21:56:53.068262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:56:52.311667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:56:52.697605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:56:53.209329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:56:52.456045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:56:52.827470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:56:53.341298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:56:52.576290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:56:52.942508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:56:56.314959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분등록번호을구순위등기목적코드접수번호등기사유내용
구분1.0000.1240.6400.4180.1880.494
등록번호0.1241.0000.0000.0000.2080.000
을구순위0.6400.0001.0000.0000.2820.000
등기목적코드0.4180.0000.0001.0000.3640.876
접수번호0.1880.2080.2820.3641.0000.400
등기사유내용0.4940.0000.0000.8760.4001.000
2023-12-12T21:56:56.440525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등기목적코드구분등기사유내용
등기목적코드1.0000.2600.622
구분0.2601.0000.256
등기사유내용0.6220.2561.000
2023-12-12T21:56:56.547425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록번호을구순위접수번호구분등기목적코드등기사유내용
등록번호1.0000.1040.0160.1300.0000.000
을구순위0.1041.000-0.0380.4720.0000.000
접수번호0.016-0.0381.0000.1130.1810.180
구분0.1300.4720.1131.0000.2600.256
등기목적코드0.0000.0000.1810.2601.0000.622
등기사유내용0.0000.0000.1800.2560.6221.000

Missing values

2023-12-12T21:56:53.477822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:56:53.613548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분등록번호을구순위등기목적등기목적코드접수일자접수번호등기사유내용
0전유120120000000001근저당권설정F3ZZBZ2019-03-1184271설정계약
1전유124720000000008근저당권설정F3ZZBZ2019-03-0882905설정계약
2전유124720000000008근저당권설정F3ZZBZ2019-03-0882905설정계약
3전유1247200000000075번근저당권설정등기말소F3ZZGZ2018-12-26469691해지
4전유1247200000000064번근저당권설정등기말소F3ZZGZ2015-07-3167845해지
5전유1247200000000031번근저당권설정, 2번근저당권설정 등기말소ZZZZGZ2005-11-0469901해지
6전유1248200000000010근저당권설정F3ZZBZ2019-03-1591895설정계약
7전유1248200000000010근저당권설정F3ZZBZ2019-03-1591895설정계약
8전유1248200000000097번근저당권설정, 8번근저당권설정 등기말소ZZZZGZ2012-09-2845280해지
9전유1248200000000064번근저당권설정등기말소F3ZZGZ2011-05-2525765해지
구분등록번호을구순위등기목적등기목적코드접수일자접수번호등기사유내용
990토지124720000000002616번근저당권설정등기말소F3ZZGZ2000-02-147128해지
991토지124720000000002417번근저당권설정등기말소F3ZZGZ1999-12-1179039해지
992토지12472000000000223번근저당권설정등기말소F3ZZGZ1999-08-1654592해지
993토지124720000000002015번근저당권설정등기말소F3ZZGZ1999-04-2726292해지
994토지12472000000000213번근저당권변경F3ZZEZ2008-06-2455812변경계약
995토지12472000000000113번근저당권변경F3ZZEZ2007-05-1636061변경계약
996토지124720000000001322번박백화지분근저당권설정F3ZZBZ1991-07-0379012설정계약
997토지124720000000001322번박백화지분근저당권설정F3ZZBZ1991-07-0379012설정계약
998토지124720000000001111번근저당권변경F3ZZEZ1990-04-1434339면책적 채무인수
999토지124720000000001117번이상철지분근저당권설정F3ZZBZ1990-02-089487추가설정계약

Duplicate rows

Most frequently occurring

구분등록번호을구순위등기목적등기목적코드접수일자접수번호등기사유내용# duplicates
0건물120120000000003근저당권설정F3ZZBZ2019-03-2097232설정계약2
1건물124120000000008근저당권설정F3ZZBZ2019-02-0846135설정계약2
2건물124520000000001근저당권설정F3ZZBZ2019-03-116479설정계약2
3건물124520000000001근저당권설정F3ZZBZ2019-03-126706설정계약2
4건물124620000000001근저당권설정F3ZZBZ2018-12-31477746설정계약2
5건물124620000000001근저당권설정F3ZZBZ2019-02-1554461설정계약2
6건물124620000000003근저당권설정F3ZZBZ2019-03-2199013설정계약2
7전유120120000000001근저당권설정F3ZZBZ2019-01-099206설정계약2
8전유120120000000001근저당권설정F3ZZBZ2019-01-2126144설정계약2
9전유120120000000001근저당권설정F3ZZBZ2019-01-2228551설정계약2