Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 536 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 18.4 KiB |
Average record size in memory | 35.2 B |
Variable types
Text | 1 |
---|---|
Numeric | 1 |
Categorical | 2 |
Dataset
Description | 한국주택금융공사 채권관리부에서 제공하는 담보물내용배치에 대한 데이터로, 보증번호, 이행청구일자, 처리순번, 담보물내용순번 등의 항목을 제공합니다. |
---|---|
Author | 한국주택금융공사 |
URL | https://www.data.go.kr/data/15073033/fileData.do |
PROCESS_SEQ has constant value "" | Constant |
SCRTY_CONT_SEQ is highly imbalanced (96.5%) | Imbalance |
Reproduction
Analysis started | 2023-12-12 23:46:27.250337 |
---|---|
Analysis finished | 2023-12-12 23:46:27.597342 |
Duration | 0.35 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
GUARNT_NO
Text
Distinct | 421 |
---|---|
Distinct (%) | 78.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.3 KiB |
Length
Max length | 13 |
---|---|
Median length | 13 |
Mean length | 13 |
Min length | 13 |
Characters and Unicode
Total characters | 6968 |
---|---|
Distinct characters | 24 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 341 ? |
---|---|
Unique (%) | 63.6% |
Sample
1st row | THO2014029259 |
---|---|
2nd row | TBA2018000923 |
3rd row | TPB2017003479 |
4th row | TAB2013027419 |
5th row | TBA2018000923 |
Value | Count | Frequency (%) |
tha2015051284 | 7 | 1.3% |
tlb2017009751 | 5 | 0.9% |
tho2016057593 | 4 | 0.7% |
tpa2014031195 | 4 | 0.7% |
tlb2015009657 | 4 | 0.7% |
tac2015032789 | 4 | 0.7% |
tab2014005247 | 4 | 0.7% |
tho2014040037 | 3 | 0.6% |
tlb2015020348 | 3 | 0.6% |
tho2014056885 | 3 | 0.6% |
Other values (411) | 495 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 1343 | |
2 | 947 | |
1 | 832 | |
A | 454 | 6.5% |
5 | 424 | 6.1% |
T | 421 | 6.0% |
3 | 413 | 5.9% |
4 | 383 | 5.5% |
8 | 278 | 4.0% |
7 | 259 | 3.7% |
Other values (14) | 1214 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 5360 | |
Uppercase Letter | 1608 | 23.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
A | 454 | |
T | 421 | |
H | 181 | 11.3% |
Q | 125 | 7.8% |
D | 122 | 7.6% |
B | 117 | 7.3% |
O | 117 | 7.3% |
C | 36 | 2.2% |
L | 16 | 1.0% |
P | 12 | 0.7% |
Other values (4) | 7 | 0.4% |
Decimal Number
Value | Count | Frequency (%) |
0 | 1343 | |
2 | 947 | |
1 | 832 | |
5 | 424 | 7.9% |
3 | 413 | 7.7% |
4 | 383 | 7.1% |
8 | 278 | 5.2% |
7 | 259 | 4.8% |
9 | 247 | 4.6% |
6 | 234 | 4.4% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 5360 | |
Latin | 1608 | 23.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
A | 454 | |
T | 421 | |
H | 181 | 11.3% |
Q | 125 | 7.8% |
D | 122 | 7.6% |
B | 117 | 7.3% |
O | 117 | 7.3% |
C | 36 | 2.2% |
L | 16 | 1.0% |
P | 12 | 0.7% |
Other values (4) | 7 | 0.4% |
Common
Value | Count | Frequency (%) |
0 | 1343 | |
2 | 947 | |
1 | 832 | |
5 | 424 | 7.9% |
3 | 413 | 7.7% |
4 | 383 | 7.1% |
8 | 278 | 5.2% |
7 | 259 | 4.8% |
9 | 247 | 4.6% |
6 | 234 | 4.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 6968 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 1343 | |
2 | 947 | |
1 | 832 | |
A | 454 | 6.5% |
5 | 424 | 6.1% |
T | 421 | 6.0% |
3 | 413 | 5.9% |
4 | 383 | 5.5% |
8 | 278 | 4.0% |
7 | 259 | 3.7% |
Other values (14) | 1214 |
DISCHRG_DEMND_DY
Real number (ℝ)
Distinct | 387 |
---|---|
Distinct (%) | 72.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20176599 |
Minimum | 20090731 |
---|---|
Maximum | 20201023 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.8 KiB |
Quantile statistics
Minimum | 20090731 |
---|---|
5-th percentile | 20150614 |
Q1 | 20160904 |
median | 20180461 |
Q3 | 20190916 |
95-th percentile | 20200824 |
Maximum | 20201023 |
Range | 110292 |
Interquartile range (IQR) | 30012.25 |
Descriptive statistics
Standard deviation | 21747.966 |
---|---|
Coefficient of variation (CV) | 0.0010778807 |
Kurtosis | 4.0413468 |
Mean | 20176599 |
Median Absolute Deviation (MAD) | 10666.5 |
Skewness | -1.6048367 |
Sum | 1.0814657 × 1010 |
Variance | 4.7297401 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20190607 | 6 | 1.1% |
20160518 | 5 | 0.9% |
20160715 | 4 | 0.7% |
20180315 | 4 | 0.7% |
20190830 | 4 | 0.7% |
20200306 | 4 | 0.7% |
20200617 | 4 | 0.7% |
20170628 | 4 | 0.7% |
20181129 | 3 | 0.6% |
20160127 | 3 | 0.6% |
Other values (377) | 495 |
Value | Count | Frequency (%) |
20090731 | 2 | |
20090806 | 1 | |
20090819 | 1 | |
20090921 | 1 | |
20091209 | 2 | |
20091210 | 1 | |
20091214 | 1 | |
20100120 | 1 | |
20100310 | 2 | |
20100325 | 1 |
Value | Count | Frequency (%) |
20201023 | 1 | 0.2% |
20201021 | 2 | |
20201019 | 1 | 0.2% |
20201016 | 3 | |
20201015 | 3 | |
20201014 | 2 | |
20201008 | 2 | |
20200929 | 2 | |
20200928 | 1 | 0.2% |
20200925 | 1 | 0.2% |
PROCESS_SEQ
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.3 KiB |
1 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 536 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 536 |
SCRTY_CONT_SEQ
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.3 KiB |
1 | |
---|---|
2 | 2 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 534 | |
2 | 2 | 0.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 534 | |
2 | 2 | 0.4% |
DISCHRG_DEMND_DY | SCRTY_CONT_SEQ | |
---|---|---|
DISCHRG_DEMND_DY | 1.000 | 0.000 |
SCRTY_CONT_SEQ | 0.000 | 1.000 |
DISCHRG_DEMND_DY | SCRTY_CONT_SEQ | |
---|---|---|
DISCHRG_DEMND_DY | 1.000 | 0.140 |
SCRTY_CONT_SEQ | 0.140 | 1.000 |
GUARNT_NO | DISCHRG_DEMND_DY | PROCESS_SEQ | SCRTY_CONT_SEQ | |
---|---|---|---|---|
0 | THO2014029259 | 20201023 | 1 | 1 |
1 | TBA2018000923 | 20201021 | 1 | 1 |
2 | TPB2017003479 | 20201021 | 1 | 1 |
3 | TAB2013027419 | 20201019 | 1 | 1 |
4 | TBA2018000923 | 20201016 | 1 | 1 |
5 | THO2012000842 | 20201016 | 1 | 1 |
6 | THB2015040480 | 20201016 | 1 | 1 |
7 | TAD2015007294 | 20201015 | 1 | 1 |
8 | TAC2015043842 | 20201015 | 1 | 1 |
9 | THA2013030069 | 20201015 | 1 | 1 |
GUARNT_NO | DISCHRG_DEMND_DY | PROCESS_SEQ | SCRTY_CONT_SEQ | |
---|---|---|---|---|
526 | TLA2002008445 | 20100326 | 1 | 1 |
527 | THA2002023086 | 20090921 | 1 | 1 |
528 | THA2002020834 | 20090819 | 1 | 1 |
529 | THA2002067127 | 20090806 | 1 | 1 |
530 | THO2002116849 | 20091214 | 1 | 1 |
531 | THO2003039955 | 20100607 | 1 | 1 |
532 | THA2002024282 | 20101013 | 1 | 1 |
533 | QAC2002064084 | 20090731 | 1 | 2 |
534 | QAC2002064084 | 20090731 | 1 | 1 |
535 | TLA2002008445 | 20100325 | 1 | 1 |