Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 1000 |
Missing cells | 133 |
Missing cells (%) | 1.9% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 57.7 KiB |
Average record size in memory | 59.1 B |
Variable types
Text | 1 |
---|---|
Boolean | 1 |
Numeric | 3 |
Categorical | 2 |
Dataset
Description | 한국주택금융공사 채권관리부 업무 관련 공개 데이터 (해당 부서의 업무와 관련된 데이터베이스에서 공개 가능한 원천 데이터) 보증번호,본부수관여부,주채무자고객번호,변경사번,변경부점코드,등록사번,등록부점코드가 포함된 데이터 입니다. |
---|---|
Author | 한국주택금융공사 |
URL | https://www.data.go.kr/data/15072970/fileData.do |
등록부점코드 is highly overall correlated with 등록사번 and 1 other fields | High correlation |
변경부점코드 is highly overall correlated with 등록사번 and 1 other fields | High correlation |
주채무자고객번호 is highly overall correlated with 본부수관여부 | High correlation |
변경사번 is highly overall correlated with 등록사번 | High correlation |
등록사번 is highly overall correlated with 변경사번 and 2 other fields | High correlation |
본부수관여부 is highly overall correlated with 주채무자고객번호 | High correlation |
본부수관여부 is highly imbalanced (89.0%) | Imbalance |
본부수관여부 has 42 (4.2%) missing values | Missing |
변경사번 has 36 (3.6%) missing values | Missing |
등록사번 has 55 (5.5%) missing values | Missing |
보증번호 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 23:37:04.808224 |
---|---|
Analysis finished | 2023-12-12 23:37:06.556225 |
Duration | 1.75 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
보증번호
Text
UNIQUE
 
Distinct | 1000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
Length
Max length | 13 |
---|---|
Median length | 13 |
Mean length | 13 |
Min length | 13 |
Characters and Unicode
Total characters | 13000 |
---|---|
Distinct characters | 25 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | TQH2000000140 |
---|---|
2nd row | TAA2014130650 |
3rd row | TAC2018087561 |
4th row | TAC2014068194 |
5th row | TAC2014055963 |
Value | Count | Frequency (%) |
tqh2000000140 | 1 | 0.1% |
tla2017007324 | 1 | 0.1% |
taa2011145898 | 1 | 0.1% |
qad2003111447 | 1 | 0.1% |
tpa2016004380 | 1 | 0.1% |
tpa2016003042 | 1 | 0.1% |
taa2015098181 | 1 | 0.1% |
taa2017054958 | 1 | 0.1% |
tac2012017816 | 1 | 0.1% |
tha2011074772 | 1 | 0.1% |
Other values (990) | 990 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 2747 | |
2 | 1683 | |
1 | 1563 | |
A | 994 | 7.6% |
T | 850 | 6.5% |
8 | 643 | 4.9% |
6 | 632 | 4.9% |
3 | 597 | 4.6% |
4 | 591 | 4.5% |
7 | 552 | 4.2% |
Other values (15) | 2148 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 10000 | |
Uppercase Letter | 3000 | 23.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
A | 994 | |
T | 850 | |
Q | 213 | 7.1% |
B | 200 | 6.7% |
D | 195 | 6.5% |
H | 186 | 6.2% |
O | 109 | 3.6% |
P | 83 | 2.8% |
C | 65 | 2.2% |
L | 34 | 1.1% |
Other values (5) | 71 | 2.4% |
Decimal Number
Value | Count | Frequency (%) |
0 | 2747 | |
2 | 1683 | |
1 | 1563 | |
8 | 643 | 6.4% |
6 | 632 | 6.3% |
3 | 597 | 6.0% |
4 | 591 | 5.9% |
7 | 552 | 5.5% |
5 | 515 | 5.1% |
9 | 477 | 4.8% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 10000 | |
Latin | 3000 | 23.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
A | 994 | |
T | 850 | |
Q | 213 | 7.1% |
B | 200 | 6.7% |
D | 195 | 6.5% |
H | 186 | 6.2% |
O | 109 | 3.6% |
P | 83 | 2.8% |
C | 65 | 2.2% |
L | 34 | 1.1% |
Other values (5) | 71 | 2.4% |
Common
Value | Count | Frequency (%) |
0 | 2747 | |
2 | 1683 | |
1 | 1563 | |
8 | 643 | 6.4% |
6 | 632 | 6.3% |
3 | 597 | 6.0% |
4 | 591 | 5.9% |
7 | 552 | 5.5% |
5 | 515 | 5.1% |
9 | 477 | 4.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 13000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 2747 | |
2 | 1683 | |
1 | 1563 | |
A | 994 | 7.6% |
T | 850 | 6.5% |
8 | 643 | 4.9% |
6 | 632 | 4.9% |
3 | 597 | 4.6% |
4 | 591 | 4.5% |
7 | 552 | 4.2% |
Other values (15) | 2148 |
본부수관여부
Boolean
HIGH CORRELATION
  IMBALANCE
  MISSING
 
Distinct | 2 |
---|---|
Distinct (%) | 0.2% |
Missing | 42 |
Missing (%) | 4.2% |
Memory size | 2.1 KiB |
False | |
---|---|
True | 14 |
(Missing) | 42 |
Value | Count | Frequency (%) |
False | 944 | |
True | 14 | 1.4% |
(Missing) | 42 | 4.2% |
주채무자고객번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 974 |
---|---|
Distinct (%) | 97.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 88416508 |
Minimum | 2510268 |
---|---|
Maximum | 1.3988455 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 2510268 |
---|---|
5-th percentile | 31414687 |
Q1 | 71068820 |
median | 95491068 |
Q3 | 1.1451872 × 108 |
95-th percentile | 1.2480046 × 108 |
Maximum | 1.3988455 × 108 |
Range | 1.3737428 × 108 |
Interquartile range (IQR) | 43449902 |
Descriptive statistics
Standard deviation | 30845874 |
---|---|
Coefficient of variation (CV) | 0.34887007 |
Kurtosis | -0.51301306 |
Mean | 88416508 |
Median Absolute Deviation (MAD) | 20727141 |
Skewness | -0.70103416 |
Sum | 8.8416508 × 1010 |
Variance | 9.5146792 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
42841285 | 3 | 0.3% |
97772337 | 2 | 0.2% |
110741025 | 2 | 0.2% |
75201697 | 2 | 0.2% |
85816289 | 2 | 0.2% |
123320734 | 2 | 0.2% |
97692462 | 2 | 0.2% |
108367194 | 2 | 0.2% |
95441563 | 2 | 0.2% |
78272555 | 2 | 0.2% |
Other values (964) | 979 |
Value | Count | Frequency (%) |
2510268 | 1 | |
6246347 | 1 | |
7850583 | 1 | |
9368743 | 1 | |
11338936 | 1 | |
12493634 | 1 | |
13136631 | 1 | |
14712588 | 1 | |
14929656 | 1 | |
15226565 | 1 |
Value | Count | Frequency (%) |
139884549 | 1 | |
138104880 | 1 | |
137655932 | 1 | |
136387201 | 1 | |
136202904 | 1 | |
136104060 | 1 | |
131765404 | 1 | |
130954009 | 1 | |
130928776 | 1 | |
130715235 | 1 |
변경사번
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 104 |
---|---|
Distinct (%) | 10.8% |
Missing | 36 |
Missing (%) | 3.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2229.5612 |
Minimum | 1159 |
---|---|
Maximum | 61794 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 1159 |
---|---|
5-th percentile | 1487 |
Q1 | 1605 |
median | 1690 |
Q3 | 1872 |
95-th percentile | 2000.95 |
Maximum | 61794 |
Range | 60635 |
Interquartile range (IQR) | 267 |
Descriptive statistics
Standard deviation | 4277.34 |
---|---|
Coefficient of variation (CV) | 1.9184672 |
Kurtosis | 144.88309 |
Mean | 2229.5612 |
Median Absolute Deviation (MAD) | 146 |
Skewness | 11.844126 |
Sum | 2149297 |
Variance | 18295637 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1690 | 98 | 9.8% |
1487 | 74 | 7.4% |
1605 | 73 | 7.3% |
1696 | 54 | 5.4% |
1590 | 34 | 3.4% |
1592 | 32 | 3.2% |
1842 | 29 | 2.9% |
1867 | 26 | 2.6% |
1872 | 26 | 2.6% |
1883 | 25 | 2.5% |
Other values (94) | 493 | |
(Missing) | 36 | 3.6% |
Value | Count | Frequency (%) |
1159 | 1 | 0.1% |
1179 | 1 | 0.1% |
1248 | 1 | 0.1% |
1253 | 2 | 0.2% |
1360 | 1 | 0.1% |
1406 | 15 | |
1438 | 1 | 0.1% |
1455 | 1 | 0.1% |
1459 | 1 | 0.1% |
1461 | 2 | 0.2% |
Value | Count | Frequency (%) |
61794 | 1 | 0.1% |
53655 | 1 | 0.1% |
53569 | 2 | |
53567 | 1 | 0.1% |
53566 | 1 | 0.1% |
6201 | 1 | 0.1% |
6153 | 1 | 0.1% |
6151 | 3 | |
6070 | 2 | |
6066 | 3 |
변경부점코드
Categorical
HIGH CORRELATION
 
Distinct | 28 |
---|---|
Distinct (%) | 2.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
QAD | |
---|---|
TAA | |
TAD | |
TAC | |
THA | |
Other values (23) |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.036 |
Min length | 3 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | ACS |
---|---|
2nd row | TAA |
3rd row | TAC |
4th row | TAC |
5th row | TAC |
Common Values
Value | Count | Frequency (%) |
QAD | 118 | |
TAA | 115 | |
TAD | 106 | 10.6% |
TAC | 67 | 6.7% |
THA | 60 | 6.0% |
TAB | 58 | 5.8% |
TPA | 56 | 5.6% |
THO | 50 | 5.0% |
TQA | 45 | 4.5% |
TBA | 39 | 3.9% |
Other values (18) | 286 |
Length
Value | Count | Frequency (%) |
qad | 118 | |
taa | 115 | |
tad | 106 | 10.6% |
tac | 67 | 6.7% |
tha | 60 | 6.0% |
tab | 58 | 5.8% |
tpa | 56 | 5.6% |
tho | 50 | 5.0% |
tqa | 45 | 4.5% |
tba | 39 | 3.9% |
Other values (18) | 286 |
등록사번
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 114 |
---|---|
Distinct (%) | 12.1% |
Missing | 55 |
Missing (%) | 5.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1676.9365 |
Minimum | 1020 |
---|---|
Maximum | 2002 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 1020 |
---|---|
5-th percentile | 1365.2 |
Q1 | 1590 |
median | 1690 |
Q3 | 1843 |
95-th percentile | 1937 |
Maximum | 2002 |
Range | 982 |
Interquartile range (IQR) | 253 |
Descriptive statistics
Standard deviation | 183.70802 |
---|---|
Coefficient of variation (CV) | 0.10954978 |
Kurtosis | 0.41946445 |
Mean | 1676.9365 |
Median Absolute Deviation (MAD) | 133 |
Skewness | -0.49830097 |
Sum | 1584705 |
Variance | 33748.638 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1690 | 105 | 10.5% |
1605 | 84 | 8.4% |
1487 | 74 | 7.4% |
1696 | 57 | 5.7% |
1592 | 33 | 3.3% |
1590 | 33 | 3.3% |
1867 | 29 | 2.9% |
1872 | 27 | 2.7% |
1842 | 27 | 2.7% |
1883 | 24 | 2.4% |
Other values (104) | 452 | |
(Missing) | 55 | 5.5% |
Value | Count | Frequency (%) |
1020 | 1 | |
1088 | 1 | |
1096 | 1 | |
1098 | 1 | |
1129 | 1 | |
1130 | 1 | |
1133 | 1 | |
1137 | 1 | |
1141 | 1 | |
1145 | 1 |
Value | Count | Frequency (%) |
2002 | 1 | 0.1% |
1995 | 7 | 0.7% |
1987 | 5 | 0.5% |
1980 | 4 | 0.4% |
1978 | 21 | |
1973 | 5 | 0.5% |
1958 | 4 | 0.4% |
1937 | 10 | |
1934 | 10 | |
1929 | 1 | 0.1% |
등록부점코드
Categorical
HIGH CORRELATION
 
Distinct | 25 |
---|---|
Distinct (%) | 2.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
TAA | |
---|---|
QAD | |
TAD | |
THO | |
TAC | |
Other values (20) |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | TBA |
---|---|
2nd row | TAA |
3rd row | TAC |
4th row | TAC |
5th row | TAC |
Common Values
Value | Count | Frequency (%) |
TAA | 129 | |
QAD | 129 | |
TAD | 105 | |
THO | 76 | 7.6% |
TAC | 66 | 6.6% |
THA | 66 | 6.6% |
TAB | 58 | 5.8% |
TPA | 57 | 5.7% |
TBA | 50 | 5.0% |
TQA | 47 | 4.7% |
Other values (15) | 217 |
Length
Value | Count | Frequency (%) |
taa | 129 | |
qad | 129 | |
tad | 105 | |
tho | 76 | 7.6% |
tac | 66 | 6.6% |
tha | 66 | 6.6% |
tab | 58 | 5.8% |
tpa | 57 | 5.7% |
tba | 50 | 5.0% |
tqa | 47 | 4.7% |
Other values (15) | 217 |
본부수관여부 | 주채무자고객번호 | 변경사번 | 변경부점코드 | 등록사번 | 등록부점코드 | |
---|---|---|---|---|---|---|
본부수관여부 | 1.000 | 0.746 | 0.000 | 0.391 | 0.184 | 0.183 |
주채무자고객번호 | 0.746 | 1.000 | 0.138 | 0.392 | 0.446 | 0.289 |
변경사번 | 0.000 | 0.138 | 1.000 | 0.356 | 0.278 | 0.000 |
변경부점코드 | 0.391 | 0.392 | 0.356 | 1.000 | 0.872 | 0.998 |
등록사번 | 0.184 | 0.446 | 0.278 | 0.872 | 1.000 | 0.855 |
등록부점코드 | 0.183 | 0.289 | 0.000 | 0.998 | 0.855 | 1.000 |
본부수관여부 | 등록부점코드 | 변경부점코드 | |
---|---|---|---|
본부수관여부 | 1.000 | 0.156 | 0.307 |
등록부점코드 | 0.156 | 1.000 | 0.958 |
변경부점코드 | 0.307 | 0.958 | 1.000 |
주채무자고객번호 | 변경사번 | 등록사번 | 본부수관여부 | 변경부점코드 | 등록부점코드 | |
---|---|---|---|---|---|---|
주채무자고객번호 | 1.000 | 0.069 | 0.250 | 0.583 | 0.150 | 0.105 |
변경사번 | 0.069 | 1.000 | 0.752 | 0.000 | 0.170 | 0.000 |
등록사번 | 0.250 | 0.752 | 1.000 | 0.141 | 0.546 | 0.504 |
본부수관여부 | 0.583 | 0.000 | 0.141 | 1.000 | 0.307 | 0.156 |
변경부점코드 | 0.150 | 0.170 | 0.546 | 0.307 | 1.000 | 0.958 |
등록부점코드 | 0.105 | 0.000 | 0.504 | 0.156 | 0.958 | 1.000 |
보증번호 | 본부수관여부 | 주채무자고객번호 | 변경사번 | 변경부점코드 | 등록사번 | 등록부점코드 | |
---|---|---|---|---|---|---|---|
0 | TQH2000000140 | Y | 22391645 | 1890 | ACS | <NA> | TBA |
1 | TAA2014130650 | N | 99749119 | 1883 | TAA | 1605 | TAA |
2 | TAC2018087561 | N | 124058610 | 1590 | TAC | 1590 | TAC |
3 | TAC2014068194 | N | 97554474 | 1590 | TAC | 1590 | TAC |
4 | TAC2014055963 | N | 97554474 | 1590 | TAC | 1590 | TAC |
5 | TAC2016051061 | N | 110402283 | 1590 | TAC | 1590 | TAC |
6 | TLB2018010688 | N | 121545090 | 1476 | TLB | 1476 | TLB |
7 | THO2017055226 | N | 119431462 | 1614 | THO | 1614 | THO |
8 | TOA2017021994 | N | 118421318 | 1934 | TOA | 1934 | TOA |
9 | TJA2015000856 | N | 100785538 | 1530 | TJB | 1530 | TJB |
보증번호 | 본부수관여부 | 주채무자고객번호 | 변경사번 | 변경부점코드 | 등록사번 | 등록부점코드 | |
---|---|---|---|---|---|---|---|
990 | TPA2012017978 | N | 87508773 | 1521 | TPB | 1521 | TPB |
991 | TLB2016003060 | N | 87825948 | 6054 | TLB | 1476 | TLB |
992 | TLB2016011220 | N | 110434523 | 1926 | TLB | 1926 | TLB |
993 | QAD2011041447 | N | 82435553 | 1686 | QAD | 1686 | QAD |
994 | TOA2012023195 | N | 89820381 | 1934 | TOA | 1520 | TOA |
995 | TQA2018019079 | N | 70411657 | 1679 | TQA | 1679 | TQA |
996 | TPA2015045490 | N | 51611008 | 1668 | TPA | 1668 | TPA |
997 | TMA2019036614 | N | 61787002 | 6151 | TMA | 1606 | TMA |
998 | TMA2013008997 | N | 90641465 | 1606 | TMA | 1606 | TMA |
999 | TRA2019006076 | N | 49563854 | 1726 | TRA | 1726 | TRA |