Overview

Dataset statistics

Number of variables5
Number of observations100
Missing cells61
Missing cells (%)12.2%
Duplicate rows2
Duplicate rows (%)2.0%
Total size in memory4.2 KiB
Average record size in memory43.3 B

Variable types

Numeric2
Text1
DateTime1
Categorical1

Dataset

Description한국주택금융공사 채권관리부 업무 관련 공개 데이터로 잡수품의번호, 등록일시, 등록자사번, 등록부점코드 등의 항목을 제공합니다.
Author한국주택금융공사
URLhttps://www.data.go.kr/data/15040245/fileData.do

Alerts

Dataset has 2 (2.0%) duplicate rowsDuplicates
접수품의번호 is highly overall correlated with 등록자사번High correlation
등록자사번 is highly overall correlated with 접수품의번호High correlation
비고 has 61 (61.0%) missing valuesMissing

Reproduction

Analysis started2023-12-12 09:48:46.448242
Analysis finished2023-12-12 09:48:47.471768
Duration1.02 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

접수품의번호
Real number (ℝ)

HIGH CORRELATION 

Distinct92
Distinct (%)92.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0089008 × 1010
Minimum2.0050434 × 1010
Maximum2.0180406 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T18:48:47.568865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0050434 × 1010
5-th percentile2.0059903 × 1010
Q12.0060418 × 1010
median2.0080401 × 1010
Q32.0092902 × 1010
95-th percentile2.0170404 × 1010
Maximum2.0180406 × 1010
Range1.2997222 × 108
Interquartile range (IQR)32484016

Descriptive statistics

Standard deviation35643870
Coefficient of variation (CV)0.0017742972
Kurtosis0.89207229
Mean2.0089008 × 1010
Median Absolute Deviation (MAD)19982866
Skewness1.3817406
Sum2.0089008 × 1012
Variance1.2704855 × 1015
MonotonicityNot monotonic
2023-12-12T18:48:47.720277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20160403929 3
 
3.0%
20060415888 2
 
2.0%
20080405220 2
 
2.0%
20080405227 2
 
2.0%
20090402006 2
 
2.0%
20060404533 2
 
2.0%
20090402051 2
 
2.0%
20050449697 1
 
1.0%
20060409928 1
 
1.0%
20060402617 1
 
1.0%
Other values (82) 82
82.0%
ValueCountFrequency (%)
20050434162 1
1.0%
20050441392 1
1.0%
20050443932 1
1.0%
20050444094 1
1.0%
20050449697 1
1.0%
20060400311 1
1.0%
20060402617 1
1.0%
20060402881 1
1.0%
20060404533 2
2.0%
20060409435 1
1.0%
ValueCountFrequency (%)
20180406381 1
 
1.0%
20180405880 1
 
1.0%
20180403827 1
 
1.0%
20180402151 1
 
1.0%
20170407130 1
 
1.0%
20170403990 1
 
1.0%
20170403902 1
 
1.0%
20170403293 1
 
1.0%
20160403929 3
3.0%
20150407703 1
 
1.0%

비고
Text

MISSING 

Distinct32
Distinct (%)82.1%
Missing61
Missing (%)61.0%
Memory size932.0 B
2023-12-12T18:48:47.963236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length25
Mean length12.25641
Min length3

Characters and Unicode

Total characters478
Distinct characters113
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)66.7%

Sample

1st row선순위조세채권 다수 및 경합채권 1.379.132.895원
2nd row소유자 : 김종운
3rd row1번째 목적물
4th row2번째 목적물
5th row3번째 목적물
ValueCountFrequency (%)
목적물 6
 
6.1%
공동담보 5
 
5.1%
토지와 4
 
4.0%
본건 4
 
4.0%
배당종결 4
 
4.0%
경매종결 3
 
3.0%
감정가는 3
 
3.0%
3
 
3.0%
완료 2
 
2.0%
경매배당 2
 
2.0%
Other values (55) 63
63.6%
2023-12-12T18:48:48.665523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
60
 
12.6%
. 17
 
3.6%
12
 
2.5%
12
 
2.5%
11
 
2.3%
11
 
2.3%
11
 
2.3%
11
 
2.3%
10
 
2.1%
9
 
1.9%
Other values (103) 314
65.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 321
67.2%
Space Separator 60
 
12.6%
Decimal Number 59
 
12.3%
Other Punctuation 26
 
5.4%
Close Punctuation 5
 
1.0%
Open Punctuation 5
 
1.0%
Dash Punctuation 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
3.7%
12
 
3.7%
11
 
3.4%
11
 
3.4%
11
 
3.4%
11
 
3.4%
10
 
3.1%
9
 
2.8%
8
 
2.5%
8
 
2.5%
Other values (85) 218
67.9%
Decimal Number
ValueCountFrequency (%)
0 9
15.3%
2 7
11.9%
1 6
10.2%
5 6
10.2%
9 6
10.2%
7 6
10.2%
3 6
10.2%
6 5
8.5%
4 5
8.5%
8 3
 
5.1%
Other Punctuation
ValueCountFrequency (%)
. 17
65.4%
/ 4
 
15.4%
: 3
 
11.5%
' 2
 
7.7%
Space Separator
ValueCountFrequency (%)
60
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 321
67.2%
Common 157
32.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
3.7%
12
 
3.7%
11
 
3.4%
11
 
3.4%
11
 
3.4%
11
 
3.4%
10
 
3.1%
9
 
2.8%
8
 
2.5%
8
 
2.5%
Other values (85) 218
67.9%
Common
ValueCountFrequency (%)
60
38.2%
. 17
 
10.8%
0 9
 
5.7%
2 7
 
4.5%
1 6
 
3.8%
5 6
 
3.8%
9 6
 
3.8%
7 6
 
3.8%
3 6
 
3.8%
) 5
 
3.2%
Other values (8) 29
18.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 321
67.2%
ASCII 157
32.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
60
38.2%
. 17
 
10.8%
0 9
 
5.7%
2 7
 
4.5%
1 6
 
3.8%
5 6
 
3.8%
9 6
 
3.8%
7 6
 
3.8%
3 6
 
3.8%
) 5
 
3.2%
Other values (8) 29
18.5%
Hangul
ValueCountFrequency (%)
12
 
3.7%
12
 
3.7%
11
 
3.4%
11
 
3.4%
11
 
3.4%
11
 
3.4%
10
 
3.1%
9
 
2.8%
8
 
2.5%
8
 
2.5%
Other values (85) 218
67.9%
Distinct88
Distinct (%)88.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2007-02-01 11:27:00
Maximum2018-11-12 16:47:00
2023-12-12T18:48:48.798722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:48:48.961002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

등록자사번
Real number (ℝ)

HIGH CORRELATION 

Distinct48
Distinct (%)48.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1257.92
Minimum1086
Maximum1867
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T18:48:49.109332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1086
5-th percentile1115.9
Q11153.5
median1187
Q31330
95-th percentile1630
Maximum1867
Range781
Interquartile range (IQR)176.5

Descriptive statistics

Standard deviation161.92371
Coefficient of variation (CV)0.12872337
Kurtosis2.373321
Mean1257.92
Median Absolute Deviation (MAD)66
Skewness1.6000468
Sum125792
Variance26219.286
MonotonicityNot monotonic
2023-12-12T18:48:49.282754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=48)
ValueCountFrequency (%)
1157 11
 
11.0%
1185 8
 
8.0%
1214 5
 
5.0%
1131 5
 
5.0%
1187 5
 
5.0%
1121 5
 
5.0%
1327 4
 
4.0%
1280 4
 
4.0%
1630 3
 
3.0%
1118 3
 
3.0%
Other values (38) 47
47.0%
ValueCountFrequency (%)
1086 1
 
1.0%
1088 1
 
1.0%
1091 2
 
2.0%
1095 1
 
1.0%
1117 1
 
1.0%
1118 3
3.0%
1121 5
5.0%
1124 1
 
1.0%
1127 1
 
1.0%
1131 5
5.0%
ValueCountFrequency (%)
1867 1
 
1.0%
1722 1
 
1.0%
1706 1
 
1.0%
1688 1
 
1.0%
1630 3
3.0%
1592 1
 
1.0%
1491 1
 
1.0%
1455 1
 
1.0%
1438 1
 
1.0%
1422 1
 
1.0%
Distinct16
Distinct (%)16.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
서울남부지사
21 
대구지사
12 
대전지사
12 
서울중부지사
10 
광주지사
Other values (11)
36 

Length

Max length6
Median length4
Mean length4.9
Min length4

Unique

Unique4 ?
Unique (%)4.0%

Sample

1st row울산지사
2nd row대구지사
3rd row대구지사
4th row충남지사
5th row인천지사

Common Values

ValueCountFrequency (%)
서울남부지사 21
21.0%
대구지사 12
12.0%
대전지사 12
12.0%
서울중부지사 10
10.0%
광주지사 9
9.0%
경기남부지사 8
 
8.0%
제주지사 8
 
8.0%
인천지사 6
 
6.0%
강원서부지사 3
 
3.0%
충북지사 3
 
3.0%
Other values (6) 8
 
8.0%

Length

2023-12-12T18:48:49.472179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울남부지사 21
21.0%
대구지사 12
12.0%
대전지사 12
12.0%
서울중부지사 10
10.0%
광주지사 9
9.0%
경기남부지사 8
 
8.0%
제주지사 8
 
8.0%
인천지사 6
 
6.0%
강원서부지사 3
 
3.0%
충북지사 3
 
3.0%
Other values (6) 8
 
8.0%

Interactions

2023-12-12T18:48:47.053105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:48:46.702601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:48:47.179098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:48:46.810745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:48:49.576301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
접수품의번호비고등록일시등록자사번등록부점코드
접수품의번호1.0000.9591.0000.8100.712
비고0.9591.0000.9680.9740.978
등록일시1.0000.9681.0001.0001.000
등록자사번0.8100.9741.0001.0000.794
등록부점코드0.7120.9781.0000.7941.000
2023-12-12T18:48:49.683264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
접수품의번호등록자사번등록부점코드
접수품의번호1.0000.5620.363
등록자사번0.5621.0000.450
등록부점코드0.3630.4501.000

Missing values

2023-12-12T18:48:47.326213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:48:47.425916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

접수품의번호비고등록일시등록자사번등록부점코드
020180406381<NA>2018-11-12 16:471722울산지사
120180405880<NA>2018-10-15 16:301867대구지사
220180403827<NA>2018-07-24 14:211187대구지사
320180402151<NA>2018-04-25 16:581688충남지사
420170407130<NA>2018-01-19 10:101188인천지사
520170403990선순위조세채권 다수 및 경합채권 1.379.132.895원2017-08-11 10:231455서울남부지사
620170403902소유자 : 김종운2017-07-14 11:391592서울남부지사
720170403293<NA>2017-06-07 21:441706인천지사
8201604039291번째 목적물2016-07-07 13:351630경기남부지사
9201604039292번째 목적물2016-07-07 13:351630경기남부지사
접수품의번호비고등록일시등록자사번등록부점코드
902006040453305.4.15. 배당종결2007-05-25 10:221141충북지사
9120060404533임대아파트/ 기심사서 습용2007-05-25 10:221141충북지사
9220060420385<NA>2007-05-14 15:201118대전지사
9320050444094목적물2007-05-11 18:451127인천지사
942006041160306.2.10. 배당종결2007-05-03 10:301141충북지사
9520070404148가계일반자금대출과 공동담보2007-04-30 17:221091대구지사
9620060409929<NA>2007-04-13 13:481131대전지사
9720060409930<NA>2007-04-13 13:481131대전지사
9820060423115임대차 존재2007-02-05 17:421118대전지사
9920070401964<NA>2007-02-01 11:271185제주지사

Duplicate rows

Most frequently occurring

접수품의번호비고등록일시등록자사번등록부점코드# duplicates
020060415888<NA>2008-07-14 16:271399광주지사2
120090402051담보처분(최종배당일 : '09.5.6)2010-06-08 20:431296강원서부지사2