Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 1000 |
Missing cells | 889 |
Missing cells (%) | 11.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 66.5 KiB |
Average record size in memory | 68.1 B |
Variable types
Text | 1 |
---|---|
Numeric | 4 |
Categorical | 2 |
DateTime | 1 |
Dataset
Description | 한국주택금융공사 채권관리부 시효내역 업무 관련 공개 공공데이터 (해당 부서의 업무와 관련된 데이터베이스에서 공개 가능한 원천 데이터) 입니다. |
---|---|
Author | 한국주택금융공사 |
URL | https://www.data.go.kr/data/15072960/fileData.do |
채무관계자고객번호 is highly overall correlated with 주채무자고객번호 | High correlation |
주채무자고객번호 is highly overall correlated with 채무관계자고객번호 | High correlation |
변경자사번 is highly overall correlated with 변경부점코드 | High correlation |
등록자사번 is highly overall correlated with 등록부점코드 | High correlation |
변경부점코드 is highly overall correlated with 변경자사번 and 1 other fields | High correlation |
등록부점코드 is highly overall correlated with 등록자사번 and 1 other fields | High correlation |
변경부점코드 is highly imbalanced (75.0%) | Imbalance |
변경자사번 has 838 (83.8%) missing values | Missing |
등록자사번 has 51 (5.1%) missing values | Missing |
Reproduction
Analysis started | 2023-12-12 18:46:02.048880 |
---|---|
Analysis finished | 2023-12-12 18:46:06.114879 |
Duration | 4.07 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
보증번호
Text
Distinct | 822 |
---|---|
Distinct (%) | 82.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
Length
Max length | 13 |
---|---|
Median length | 13 |
Mean length | 13 |
Min length | 13 |
Characters and Unicode
Total characters | 13000 |
---|---|
Distinct characters | 27 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 717 ? |
---|---|
Unique (%) | 71.7% |
Sample
1st row | TOA2000000763 |
---|---|
2nd row | TOA2010011407 |
3rd row | THA2010055042 |
4th row | QAC2001080430 |
5th row | QAC2001108266 |
Value | Count | Frequency (%) |
tab2012032155 | 12 | 1.2% |
thb2012041934 | 9 | 0.9% |
tho2007003647 | 9 | 0.9% |
toa2011004107 | 9 | 0.9% |
taa2012075024 | 8 | 0.8% |
tla2012012148 | 7 | 0.7% |
tma2014007207 | 5 | 0.5% |
thb2013032610 | 5 | 0.5% |
qad2014014490 | 5 | 0.5% |
qad2006059244 | 5 | 0.5% |
Other values (812) | 926 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 2896 | |
2 | 1681 | |
1 | 1525 | |
A | 980 | 7.5% |
T | 822 | 6.3% |
4 | 638 | 4.9% |
3 | 633 | 4.9% |
7 | 558 | 4.3% |
6 | 546 | 4.2% |
8 | 543 | 4.2% |
Other values (17) | 2178 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 10000 | |
Uppercase Letter | 3000 | 23.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
A | 980 | |
T | 822 | |
Q | 220 | 7.3% |
H | 205 | 6.8% |
B | 183 | 6.1% |
D | 176 | 5.9% |
O | 138 | 4.6% |
C | 90 | 3.0% |
P | 65 | 2.2% |
N | 32 | 1.1% |
Other values (7) | 89 | 3.0% |
Decimal Number
Value | Count | Frequency (%) |
0 | 2896 | |
2 | 1681 | |
1 | 1525 | |
4 | 638 | 6.4% |
3 | 633 | 6.3% |
7 | 558 | 5.6% |
6 | 546 | 5.5% |
8 | 543 | 5.4% |
9 | 490 | 4.9% |
5 | 490 | 4.9% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 10000 | |
Latin | 3000 | 23.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
A | 980 | |
T | 822 | |
Q | 220 | 7.3% |
H | 205 | 6.8% |
B | 183 | 6.1% |
D | 176 | 5.9% |
O | 138 | 4.6% |
C | 90 | 3.0% |
P | 65 | 2.2% |
N | 32 | 1.1% |
Other values (7) | 89 | 3.0% |
Common
Value | Count | Frequency (%) |
0 | 2896 | |
2 | 1681 | |
1 | 1525 | |
4 | 638 | 6.4% |
3 | 633 | 6.3% |
7 | 558 | 5.6% |
6 | 546 | 5.5% |
8 | 543 | 5.4% |
9 | 490 | 4.9% |
5 | 490 | 4.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 13000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 2896 | |
2 | 1681 | |
1 | 1525 | |
A | 980 | 7.5% |
T | 822 | 6.3% |
4 | 638 | 4.9% |
3 | 633 | 4.9% |
7 | 558 | 4.3% |
6 | 546 | 4.2% |
8 | 543 | 4.2% |
Other values (17) | 2178 |
채무관계자고객번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 901 |
---|---|
Distinct (%) | 90.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 85690709 |
Minimum | 7010718 |
---|---|
Maximum | 1.4530602 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 7010718 |
---|---|
5-th percentile | 21854280 |
Q1 | 62378803 |
median | 89057387 |
Q3 | 1.1325799 × 108 |
95-th percentile | 1.4501209 × 108 |
Maximum | 1.4530602 × 108 |
Range | 1.3829531 × 108 |
Interquartile range (IQR) | 50879184 |
Descriptive statistics
Standard deviation | 36331009 |
---|---|
Coefficient of variation (CV) | 0.42397839 |
Kurtosis | -0.76694522 |
Mean | 85690709 |
Median Absolute Deviation (MAD) | 25157986 |
Skewness | -0.33064852 |
Sum | 8.5690709 × 1010 |
Variance | 1.3199422 × 1015 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
18880092 | 4 | 0.4% |
76668569 | 3 | 0.3% |
69153285 | 3 | 0.3% |
23783870 | 3 | 0.3% |
64109425 | 2 | 0.2% |
108367194 | 2 | 0.2% |
69584508 | 2 | 0.2% |
92799522 | 2 | 0.2% |
15380605 | 2 | 0.2% |
97549544 | 2 | 0.2% |
Other values (891) | 975 |
Value | Count | Frequency (%) |
7010718 | 1 | |
7900417 | 1 | |
7901513 | 2 | |
9005970 | 1 | |
9370728 | 1 | |
9515965 | 1 | |
10199464 | 1 | |
11006255 | 1 | |
11338936 | 1 | |
11706458 | 1 |
Value | Count | Frequency (%) |
145306024 | 1 | |
145279533 | 2 | |
145279371 | 2 | |
145279216 | 2 | |
145278301 | 2 | |
145278259 | 2 | |
145271023 | 2 | |
145191352 | 1 | |
145172083 | 1 | |
145172070 | 1 |
주채무자고객번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 798 |
---|---|
Distinct (%) | 79.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 78303049 |
Minimum | 7900417 |
---|---|
Maximum | 1.3810488 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 7900417 |
---|---|
5-th percentile | 22712866 |
Q1 | 56645238 |
median | 84097795 |
Q3 | 1.0166293 × 108 |
95-th percentile | 1.2229328 × 108 |
Maximum | 1.3810488 × 108 |
Range | 1.3020446 × 108 |
Interquartile range (IQR) | 45017688 |
Descriptive statistics
Standard deviation | 31872776 |
---|---|
Coefficient of variation (CV) | 0.40704387 |
Kurtosis | -0.89355179 |
Mean | 78303049 |
Median Absolute Deviation (MAD) | 23879610 |
Skewness | -0.39820605 |
Sum | 7.8303049 × 1010 |
Variance | 1.0158739 × 1015 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
87194051 | 12 | 1.2% |
45612356 | 9 | 0.9% |
62949865 | 9 | 0.9% |
81984665 | 9 | 0.9% |
57348248 | 8 | 0.8% |
89860464 | 7 | 0.7% |
77924969 | 5 | 0.5% |
93819113 | 5 | 0.5% |
96504917 | 5 | 0.5% |
61500399 | 5 | 0.5% |
Other values (788) | 926 |
Value | Count | Frequency (%) |
7900417 | 1 | |
7901513 | 2 | |
9005970 | 1 | |
9515965 | 1 | |
10199464 | 1 | |
11006255 | 1 | |
11338936 | 1 | |
13136631 | 1 | |
14929656 | 1 | |
14939361 | 1 |
Value | Count | Frequency (%) |
138104880 | 1 | |
131765404 | 1 | |
130954009 | 1 | |
130715235 | 1 | |
129984093 | 1 | |
129722161 | 1 | |
129424993 | 1 | |
129304796 | 1 | |
128782319 | 1 | |
128660460 | 1 |
변경자사번
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 54 |
---|---|
Distinct (%) | 33.3% |
Missing | 838 |
Missing (%) | 83.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1523.8889 |
Minimum | 1007 |
---|---|
Maximum | 8889 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 1007 |
---|---|
5-th percentile | 1122.05 |
Q1 | 1221 |
median | 1503.5 |
Q3 | 1711 |
95-th percentile | 1921 |
Maximum | 8889 |
Range | 7882 |
Interquartile range (IQR) | 490 |
Descriptive statistics
Standard deviation | 646.81839 |
---|---|
Coefficient of variation (CV) | 0.42445246 |
Kurtosis | 105.38463 |
Mean | 1523.8889 |
Median Absolute Deviation (MAD) | 280.5 |
Skewness | 9.2703925 |
Sum | 246870 |
Variance | 418374.02 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1253 | 26 | 2.6% |
1221 | 17 | 1.7% |
1520 | 13 | 1.3% |
1890 | 13 | 1.3% |
1603 | 9 | 0.9% |
1842 | 5 | 0.5% |
1201 | 4 | 0.4% |
1921 | 4 | 0.4% |
1339 | 3 | 0.3% |
1166 | 3 | 0.3% |
Other values (44) | 65 | 6.5% |
(Missing) | 838 |
Value | Count | Frequency (%) |
1007 | 1 | |
1032 | 2 | |
1037 | 1 | |
1086 | 1 | |
1103 | 1 | |
1108 | 1 | |
1121 | 2 | |
1142 | 2 | |
1149 | 1 | |
1157 | 1 |
Value | Count | Frequency (%) |
8889 | 1 | 0.1% |
1978 | 3 | 0.3% |
1958 | 2 | 0.2% |
1935 | 1 | 0.1% |
1934 | 1 | 0.1% |
1921 | 4 | 0.4% |
1890 | 13 | |
1883 | 1 | 0.1% |
1872 | 2 | 0.2% |
1869 | 1 | 0.1% |
변경부점코드
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 24 |
---|---|
Distinct (%) | 2.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
<NA> | |
---|---|
ACS | 72 |
TOA | 15 |
QAD | 14 |
TAC | 8 |
Other values (19) | 53 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.838 |
Min length | 3 |
Unique
Unique | 9 ? |
---|---|
Unique (%) | 0.9% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | ACS |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 838 | |
ACS | 72 | 7.2% |
TOA | 15 | 1.5% |
QAD | 14 | 1.4% |
TAC | 8 | 0.8% |
TBA | 8 | 0.8% |
THA | 7 | 0.7% |
TAB | 5 | 0.5% |
TMA | 5 | 0.5% |
TAA | 5 | 0.5% |
Other values (14) | 23 | 2.3% |
Length
Value | Count | Frequency (%) |
na | 838 | |
acs | 72 | 7.2% |
toa | 15 | 1.5% |
qad | 14 | 1.4% |
tac | 8 | 0.8% |
tba | 8 | 0.8% |
tha | 7 | 0.7% |
tab | 5 | 0.5% |
tma | 5 | 0.5% |
taa | 5 | 0.5% |
Other values (14) | 23 | 2.3% |
등록일시
Date
Distinct | 622 |
---|---|
Distinct (%) | 62.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
Minimum | 2010-07-21 13:47:00 |
---|---|
Maximum | 2020-10-27 18:06:00 |
등록자사번
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 145 |
---|---|
Distinct (%) | 15.3% |
Missing | 51 |
Missing (%) | 5.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2030.5553 |
Minimum | 1088 |
---|---|
Maximum | 52042 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 1088 |
---|---|
5-th percentile | 1253 |
Q1 | 1544 |
median | 1648 |
Q3 | 1867 |
95-th percentile | 1973 |
Maximum | 52042 |
Range | 50954 |
Interquartile range (IQR) | 323 |
Descriptive statistics
Standard deviation | 3704.0655 |
---|---|
Coefficient of variation (CV) | 1.8241638 |
Kurtosis | 164.89458 |
Mean | 2030.5553 |
Median Absolute Deviation (MAD) | 150 |
Skewness | 12.614862 |
Sum | 1926997 |
Variance | 13720102 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1690 | 50 | 5.0% |
1520 | 50 | 5.0% |
1605 | 48 | 4.8% |
1603 | 47 | 4.7% |
1872 | 41 | 4.1% |
1890 | 41 | 4.1% |
1253 | 40 | 4.0% |
1842 | 31 | 3.1% |
1867 | 31 | 3.1% |
1590 | 28 | 2.8% |
Other values (135) | 542 | |
(Missing) | 51 | 5.1% |
Value | Count | Frequency (%) |
1088 | 1 | |
1121 | 1 | |
1127 | 1 | |
1144 | 1 | |
1148 | 1 | |
1156 | 1 | |
1158 | 2 | |
1163 | 1 | |
1170 | 2 | |
1184 | 1 |
Value | Count | Frequency (%) |
52042 | 1 | 0.1% |
51646 | 1 | 0.1% |
51641 | 1 | 0.1% |
51010 | 1 | 0.1% |
50711 | 1 | 0.1% |
8889 | 13 | |
7403 | 1 | 0.1% |
5003 | 1 | 0.1% |
2002 | 1 | 0.1% |
1995 | 1 | 0.1% |
등록부점코드
Categorical
HIGH CORRELATION
 
Distinct | 28 |
---|---|
Distinct (%) | 2.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
ACS | |
---|---|
QAD | |
TAC | |
TAA | |
TOA | |
Other values (23) |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | TOA |
---|---|
2nd row | TOA |
3rd row | TOA |
4th row | ACS |
5th row | ACS |
Common Values
Value | Count | Frequency (%) |
ACS | 170 | |
QAD | 122 | |
TAC | 81 | 8.1% |
TAA | 78 | 7.8% |
TOA | 65 | 6.5% |
THA | 52 | 5.2% |
TAD | 51 | 5.1% |
TAB | 45 | 4.5% |
TPA | 44 | 4.4% |
TBA | 41 | 4.1% |
Other values (18) | 251 |
Length
Value | Count | Frequency (%) |
acs | 170 | |
qad | 122 | |
tac | 81 | 8.1% |
taa | 78 | 7.8% |
toa | 65 | 6.5% |
tha | 52 | 5.2% |
tad | 51 | 5.1% |
tab | 45 | 4.5% |
tpa | 44 | 4.4% |
tba | 41 | 4.1% |
Other values (18) | 251 |
채무관계자고객번호 | 주채무자고객번호 | 변경자사번 | 변경부점코드 | 등록자사번 | 등록부점코드 | |
---|---|---|---|---|---|---|
채무관계자고객번호 | 1.000 | 0.972 | 0.000 | 0.601 | 0.115 | 0.488 |
주채무자고객번호 | 0.972 | 1.000 | 0.000 | 0.532 | 0.112 | 0.470 |
변경자사번 | 0.000 | 0.000 | 1.000 | 0.907 | 0.000 | 0.551 |
변경부점코드 | 0.601 | 0.532 | 0.907 | 1.000 | 0.000 | 0.980 |
등록자사번 | 0.115 | 0.112 | 0.000 | 0.000 | 1.000 | 0.855 |
등록부점코드 | 0.488 | 0.470 | 0.551 | 0.980 | 0.855 | 1.000 |
변경부점코드 | 등록부점코드 | |
---|---|---|
변경부점코드 | 1.000 | 0.797 |
등록부점코드 | 0.797 | 1.000 |
채무관계자고객번호 | 주채무자고객번호 | 변경자사번 | 등록자사번 | 변경부점코드 | 등록부점코드 | |
---|---|---|---|---|---|---|
채무관계자고객번호 | 1.000 | 0.748 | 0.004 | 0.054 | 0.256 | 0.195 |
주채무자고객번호 | 0.748 | 1.000 | 0.042 | 0.175 | 0.224 | 0.186 |
변경자사번 | 0.004 | 0.042 | 1.000 | 0.139 | 0.715 | 0.307 |
등록자사번 | 0.054 | 0.175 | 0.139 | 1.000 | 0.000 | 0.667 |
변경부점코드 | 0.256 | 0.224 | 0.715 | 0.000 | 1.000 | 0.797 |
등록부점코드 | 0.195 | 0.186 | 0.307 | 0.667 | 0.797 | 1.000 |
보증번호 | 채무관계자고객번호 | 주채무자고객번호 | 변경자사번 | 변경부점코드 | 등록일시 | 등록자사번 | 등록부점코드 | |
---|---|---|---|---|---|---|---|---|
0 | TOA2000000763 | 21694709 | 21694709 | <NA> | <NA> | 2020-10-27 18:06 | 1520 | TOA |
1 | TOA2010011407 | 79181700 | 79181700 | <NA> | <NA> | 2020-10-27 18:06 | 1520 | TOA |
2 | THA2010055042 | 79974632 | 79974632 | <NA> | <NA> | 2020-10-27 18:06 | 1520 | TOA |
3 | QAC2001080430 | 32282058 | 32282003 | 1424 | ACS | 2012-10-17 12:36 | 51010 | ACS |
4 | QAC2001108266 | 33354215 | 33354215 | <NA> | <NA> | 2020-10-27 17:42 | 1935 | ACS |
5 | TQH2000000326 | 22686855 | 22686855 | <NA> | <NA> | 2020-10-27 17:42 | 1935 | ACS |
6 | TOA2013021265 | 95805251 | 95805251 | <NA> | <NA> | 2020-10-27 16:55 | 1520 | TOA |
7 | TOA2013013420 | 94256733 | 94256733 | <NA> | <NA> | 2020-10-27 16:55 | 1520 | TOA |
8 | TOA2009001097 | 73056556 | 73056200 | <NA> | <NA> | 2020-10-27 16:53 | 1520 | TOA |
9 | TOA2009001097 | 73056200 | 73056200 | <NA> | <NA> | 2020-10-27 16:53 | 1520 | TOA |
보증번호 | 채무관계자고객번호 | 주채무자고객번호 | 변경자사번 | 변경부점코드 | 등록일시 | 등록자사번 | 등록부점코드 | |
---|---|---|---|---|---|---|---|---|
990 | THA2011064740 | 84737372 | 84737372 | 1842 | THA | 2015-11-10 11:42 | 1402 | THA |
991 | QAD2010046192 | 37239789 | 37239789 | <NA> | <NA> | 2020-10-07 16:14 | 1872 | TAC |
992 | QAD2010046192 | 37239789 | 37239789 | 1872 | TAC | 2015-10-27 13:26 | 1428 | TAC |
993 | TNA2015018437 | 109004359 | 109004359 | <NA> | <NA> | 2020-10-07 15:57 | 1915 | TNA |
994 | TNA2015005266 | 95465565 | 95465565 | <NA> | <NA> | 2020-10-07 15:57 | 1915 | TNA |
995 | TNA2018009705 | 122477248 | 122477248 | <NA> | <NA> | 2020-10-07 15:57 | 1915 | TNA |
996 | TNA2015011935 | 97966431 | 97966431 | <NA> | <NA> | 2020-10-07 15:57 | 1915 | TNA |
997 | TNA2014014896 | 99214741 | 99214741 | <NA> | <NA> | 2020-10-07 15:57 | 1915 | TNA |
998 | TNA2018017254 | 124730938 | 124730938 | <NA> | <NA> | 2020-10-07 15:57 | 1915 | TNA |
999 | TNA2011009119 | 83993573 | 83993573 | <NA> | <NA> | 2020-10-07 15:57 | 1915 | TNA |