Overview

Dataset statistics

Number of variables6
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.1 KiB
Average record size in memory52.3 B

Variable types

Numeric2
Categorical3
DateTime1

Dataset

Description한국주택금융공사 채권관리부 업무 관련 공개 데이터 (해당 부서의 업무와 관련된 데이터베이스에서 공개 가능한 원천 데이터)
Author한국주택금융공사
URLhttps://www.data.go.kr/data/15072836/fileData.do

Alerts

최초재산조사년도 is highly overall correlated with 등록부점코드High correlation
등록부점코드 is highly overall correlated with 최초재산조사년도High correlation
등록자사번 is highly overall correlated with 비고High correlation
비고 is highly overall correlated with 등록자사번High correlation
재산번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:32:18.954234
Analysis finished2023-12-12 21:32:19.879978
Duration0.93 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

재산번호
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean921822.88
Minimum921773
Maximum921873
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-13T06:32:19.967618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum921773
5-th percentile921777.95
Q1921797.75
median921822.5
Q3921848.25
95-th percentile921868.05
Maximum921873
Range100
Interquartile range (IQR)50.5

Descriptive statistics

Standard deviation29.422825
Coefficient of variation (CV)3.1918089 × 10-5
Kurtosis-1.2125745
Mean921822.88
Median Absolute Deviation (MAD)25.5
Skewness0.011785926
Sum92182288
Variance865.70263
MonotonicityStrictly decreasing
2023-12-13T06:32:20.096405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
921873 1
 
1.0%
921808 1
 
1.0%
921798 1
 
1.0%
921799 1
 
1.0%
921800 1
 
1.0%
921801 1
 
1.0%
921802 1
 
1.0%
921803 1
 
1.0%
921804 1
 
1.0%
921805 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
921773 1
1.0%
921774 1
1.0%
921775 1
1.0%
921776 1
1.0%
921777 1
1.0%
921778 1
1.0%
921779 1
1.0%
921780 1
1.0%
921781 1
1.0%
921782 1
1.0%
ValueCountFrequency (%)
921873 1
1.0%
921872 1
1.0%
921871 1
1.0%
921870 1
1.0%
921869 1
1.0%
921868 1
1.0%
921867 1
1.0%
921866 1
1.0%
921865 1
1.0%
921864 1
1.0%

최초재산조사년도
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2020
88 
2019
12 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2020 88
88.0%
2019 12
 
12.0%

Length

2023-12-13T06:32:20.256373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:32:20.349841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 88
88.0%
2019 12
 
12.0%

비고
Categorical

HIGH CORRELATION 

Distinct18
Distinct (%)18.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
<NA>
32 
보증목적물
29 
현주소지
10 
주민등록초본
행자부 주민정보
Other values (13)
15 

Length

Max length42
Median length21
Mean length5.71
Min length4

Unique

Unique12 ?
Unique (%)12.0%

Sample

1st row현주소지
2nd row보증목적물
3rd row<NA>
4th row주민등록초본
5th row보증목적물

Common Values

ValueCountFrequency (%)
<NA> 32
32.0%
보증목적물 29
29.0%
현주소지 10
 
10.0%
주민등록초본 8
 
8.0%
행자부 주민정보 6
 
6.0%
현주소지 3
 
3.0%
주민초본 1
 
1.0%
보증목적물 임차보증금 1
 
1.0%
행자부 주민정보 1
 
1.0%
18.07.09 전입 1
 
1.0%
Other values (8) 8
 
8.0%

Length

2023-12-13T06:32:20.491374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 32
26.4%
보증목적물 32
26.4%
현주소지 14
11.6%
주민등록초본 9
 
7.4%
행자부 7
 
5.8%
주민정보 7
 
5.8%
임차보증금 2
 
1.7%
강성구 1
 
0.8%
변경된 1
 
0.8%
소유재산 1
 
0.8%
Other values (15) 15
12.4%
Distinct94
Distinct (%)94.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2020-01-06 15:55:00
Maximum2020-01-08 13:33:00
2023-12-13T06:32:20.628281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:32:21.087304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

등록자사번
Real number (ℝ)

HIGH CORRELATION 

Distinct30
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3227.58
Minimum1339
Maximum53353
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-13T06:32:21.265001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1339
5-th percentile1476
Q11611.75
median1676
Q31799
95-th percentile1935.05
Maximum53353
Range52014
Interquartile range (IQR)187.25

Descriptive statistics

Standard deviation8860.8036
Coefficient of variation (CV)2.7453397
Kurtosis29.880231
Mean3227.58
Median Absolute Deviation (MAD)119
Skewness5.5922686
Sum322758
Variance78513840
MonotonicityNot monotonic
2023-12-13T06:32:21.402666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1616 15
15.0%
1476 11
 
11.0%
1676 9
 
9.0%
1637 8
 
8.0%
1795 6
 
6.0%
1592 5
 
5.0%
1799 4
 
4.0%
1867 3
 
3.0%
1907 3
 
3.0%
53353 3
 
3.0%
Other values (20) 33
33.0%
ValueCountFrequency (%)
1339 1
 
1.0%
1386 2
 
2.0%
1460 1
 
1.0%
1476 11
11.0%
1488 2
 
2.0%
1523 1
 
1.0%
1592 5
 
5.0%
1595 1
 
1.0%
1599 1
 
1.0%
1616 15
15.0%
ValueCountFrequency (%)
53353 3
3.0%
1936 2
2.0%
1935 2
2.0%
1907 3
3.0%
1905 1
 
1.0%
1891 3
3.0%
1877 2
2.0%
1867 3
3.0%
1853 2
2.0%
1800 1
 
1.0%

등록부점코드
Categorical

HIGH CORRELATION 

Distinct20
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
ACS
23 
TAD
11 
TLB
11 
TQA
TOA
Other values (15)
37 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique6 ?
Unique (%)6.0%

Sample

1st rowTLA
2nd rowTLA
3rd rowQAD
4th rowTQA
5th rowTQA

Common Values

ValueCountFrequency (%)
ACS 23
23.0%
TAD 11
11.0%
TLB 11
11.0%
TQA 9
 
9.0%
TOA 9
 
9.0%
TPA 6
 
6.0%
THB 5
 
5.0%
TLA 4
 
4.0%
TBB 4
 
4.0%
TNA 3
 
3.0%
Other values (10) 15
15.0%

Length

2023-12-13T06:32:21.529852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
acs 23
23.0%
tad 11
11.0%
tlb 11
11.0%
tqa 9
 
9.0%
toa 9
 
9.0%
tpa 6
 
6.0%
thb 5
 
5.0%
tla 4
 
4.0%
tbb 4
 
4.0%
qad 3
 
3.0%
Other values (10) 15
15.0%

Interactions

2023-12-13T06:32:19.526345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:32:19.360794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:32:19.606523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:32:19.450128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:32:21.634472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
재산번호최초재산조사년도비고등록일시등록자사번등록부점코드
재산번호1.0000.0000.4041.0000.5860.732
최초재산조사년도0.0001.0000.2661.0000.0001.000
비고0.4040.2661.0000.000NaN0.875
등록일시1.0001.0000.0001.0001.0000.987
등록자사번0.5860.000NaN1.0001.0000.000
등록부점코드0.7321.0000.8750.9870.0001.000
2023-12-13T06:32:21.782189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
비고최초재산조사년도등록부점코드
비고1.0000.2020.491
최초재산조사년도0.2021.0000.904
등록부점코드0.4910.9041.000
2023-12-13T06:32:21.883314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
재산번호등록자사번최초재산조사년도비고등록부점코드
재산번호1.000-0.1250.0000.1470.306
등록자사번-0.1251.0000.0001.0000.000
최초재산조사년도0.0000.0001.0000.2020.904
비고0.1471.0000.2021.0000.491
등록부점코드0.3060.0000.9040.4911.000

Missing values

2023-12-13T06:32:19.732219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:32:19.840185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

재산번호최초재산조사년도비고등록일시등록자사번등록부점코드
09218732019현주소지2020-01-08 13:331799TLA
19218722019보증목적물2020-01-08 13:321799TLA
29218712020<NA>2020-01-08 13:201794QAD
39218702020주민등록초본2020-01-08 13:201795TQA
49218692020보증목적물2020-01-08 13:191795TQA
59218682020<NA>2020-01-08 13:131637TAD
69218672020보증목적물 임차보증금2020-01-08 12:521800TPB
79218662019<NA>2020-01-08 12:381592THB
89218652020<NA>2020-01-08 12:211616ACS
99218642020<NA>2020-01-08 12:201616ACS
재산번호최초재산조사년도비고등록일시등록자사번등록부점코드
909217822020현주소지2020-01-06 17:271690TAD
919217812019보증목적물2020-01-06 17:021891TNA
929217802020임차보증금2020-01-06 16:471907TQA
939217792020무허가주택2020-01-06 16:451907TQA
949217782020보증목적물2020-01-06 16:401907TQA
959217772020보증목적물2020-01-06 16:191339THO
969217762020<NA>2020-01-06 16:131460TMA
979217752020현주소지2020-01-06 15:591867TPA
989217742020변경된 목적물2020-01-06 15:571867TPA
999217732020보증목적물2020-01-06 15:551867TPA