Dataset statistics
Number of variables | 3 |
---|---|
Number of observations | 1000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 156 |
Duplicate rows (%) | 15.6% |
Total size in memory | 25.5 KiB |
Average record size in memory | 26.1 B |
Variable types
Text | 1 |
---|---|
Numeric | 1 |
Categorical | 1 |
Dataset
Description | 한국주택금융공사 채권관리부 비용내용 업무 관련 공개 데이터 (해당 부서의 업무와 관련된 데이터베이스에서 공개 가능한 원천 데이터) 입니다. |
---|---|
Author | 한국주택금융공사 |
URL | https://www.data.go.kr/data/15072952/fileData.do |
PROCESS_SEQ has constant value "" | Constant |
Dataset has 156 (15.6%) duplicate rows | Duplicates |
Reproduction
Analysis started | 2023-12-12 19:48:37.519245 |
---|---|
Analysis finished | 2023-12-12 19:48:37.933352 |
Duration | 0.41 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
GUARNT_NO
Text
Distinct | 699 |
---|---|
Distinct (%) | 69.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
Length
Max length | 13 |
---|---|
Median length | 13 |
Mean length | 13 |
Min length | 13 |
Characters and Unicode
Total characters | 13000 |
---|---|
Distinct characters | 25 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 500 ? |
---|---|
Unique (%) | 50.0% |
Sample
1st row | TLA2012005060 |
---|---|
2nd row | TAD2018047084 |
3rd row | TAC2016050579 |
4th row | THB2018049406 |
5th row | TAC2018087561 |
Value | Count | Frequency (%) |
tma2011019772 | 10 | 1.0% |
taa2010075463 | 10 | 1.0% |
tha2018025254 | 6 | 0.6% |
tba2015031242 | 6 | 0.6% |
tha2016068933 | 6 | 0.6% |
taa2011077993 | 6 | 0.6% |
qad2013031910 | 5 | 0.5% |
tba2019002283 | 5 | 0.5% |
tab2013036041 | 5 | 0.5% |
tba2013034710 | 4 | 0.4% |
Other values (689) | 937 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 2621 | |
1 | 1673 | |
2 | 1571 | |
A | 991 | 7.6% |
T | 894 | 6.9% |
3 | 626 | 4.8% |
5 | 618 | 4.8% |
7 | 611 | 4.7% |
4 | 608 | 4.7% |
6 | 597 | 4.6% |
Other values (15) | 2190 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 10000 | |
Uppercase Letter | 3000 | 23.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
A | 991 | |
T | 894 | |
B | 261 | 8.7% |
H | 180 | 6.0% |
Q | 166 | 5.5% |
D | 145 | 4.8% |
P | 107 | 3.6% |
O | 71 | 2.4% |
C | 68 | 2.3% |
L | 42 | 1.4% |
Other values (5) | 75 | 2.5% |
Decimal Number
Value | Count | Frequency (%) |
0 | 2621 | |
1 | 1673 | |
2 | 1571 | |
3 | 626 | 6.3% |
5 | 618 | 6.2% |
7 | 611 | 6.1% |
4 | 608 | 6.1% |
6 | 597 | 6.0% |
8 | 590 | 5.9% |
9 | 485 | 4.9% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 10000 | |
Latin | 3000 | 23.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
A | 991 | |
T | 894 | |
B | 261 | 8.7% |
H | 180 | 6.0% |
Q | 166 | 5.5% |
D | 145 | 4.8% |
P | 107 | 3.6% |
O | 71 | 2.4% |
C | 68 | 2.3% |
L | 42 | 1.4% |
Other values (5) | 75 | 2.5% |
Common
Value | Count | Frequency (%) |
0 | 2621 | |
1 | 1673 | |
2 | 1571 | |
3 | 626 | 6.3% |
5 | 618 | 6.2% |
7 | 611 | 6.1% |
4 | 608 | 6.1% |
6 | 597 | 6.0% |
8 | 590 | 5.9% |
9 | 485 | 4.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 13000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 2621 | |
1 | 1673 | |
2 | 1571 | |
A | 991 | 7.6% |
T | 894 | 6.9% |
3 | 626 | 4.8% |
5 | 618 | 4.8% |
7 | 611 | 4.7% |
4 | 608 | 4.7% |
6 | 597 | 4.6% |
Other values (15) | 2190 |
DISCHRG_DEMND_DY
Real number (ℝ)
Distinct | 212 |
---|---|
Distinct (%) | 21.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20199891 |
Minimum | 20191206 |
---|---|
Maximum | 20201023 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 20191206 |
---|---|
5-th percentile | 20191220 |
Q1 | 20200310 |
median | 20200601 |
Q3 | 20200810 |
95-th percentile | 20201007 |
Maximum | 20201023 |
Range | 9817 |
Interquartile range (IQR) | 500 |
Descriptive statistics
Standard deviation | 2466.2944 |
---|---|
Coefficient of variation (CV) | 0.00012209444 |
Kurtosis | 8.4016826 |
Mean | 20199891 |
Median Absolute Deviation (MAD) | 225 |
Skewness | -3.1990739 |
Sum | 2.0199891 × 1010 |
Variance | 6082608.2 |
Monotonicity | Decreasing |
Value | Count | Frequency (%) |
20200623 | 16 | 1.6% |
20200514 | 13 | 1.3% |
20200303 | 12 | 1.2% |
20200723 | 11 | 1.1% |
20200513 | 11 | 1.1% |
20200615 | 10 | 1.0% |
20200921 | 10 | 1.0% |
20200618 | 10 | 1.0% |
20200428 | 10 | 1.0% |
20200727 | 10 | 1.0% |
Other values (202) | 887 |
Value | Count | Frequency (%) |
20191206 | 6 | |
20191209 | 4 | |
20191210 | 5 | |
20191211 | 1 | 0.1% |
20191212 | 2 | 0.2% |
20191213 | 5 | |
20191216 | 3 | 0.3% |
20191217 | 5 | |
20191218 | 9 | |
20191219 | 6 |
Value | Count | Frequency (%) |
20201023 | 4 | |
20201022 | 6 | |
20201020 | 6 | |
20201019 | 5 | |
20201016 | 5 | |
20201015 | 8 | |
20201013 | 1 | 0.1% |
20201012 | 6 | |
20201008 | 4 | |
20201007 | 9 |
PROCESS_SEQ
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
1 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 1000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 1000 |
GUARNT_NO | DISCHRG_DEMND_DY | PROCESS_SEQ | |
---|---|---|---|
0 | TLA2012005060 | 20201023 | 1 |
1 | TAD2018047084 | 20201023 | 1 |
2 | TAC2016050579 | 20201023 | 1 |
3 | THB2018049406 | 20201023 | 1 |
4 | TAC2018087561 | 20201022 | 1 |
5 | QAD2013087981 | 20201022 | 1 |
6 | QAD2015086581 | 20201022 | 1 |
7 | TAA2014062865 | 20201022 | 1 |
8 | TAA2018091318 | 20201022 | 1 |
9 | TAA2014037479 | 20201022 | 1 |
GUARNT_NO | DISCHRG_DEMND_DY | PROCESS_SEQ | |
---|---|---|---|
990 | TQA2018025519 | 20191209 | 1 |
991 | TQA2018025519 | 20191209 | 1 |
992 | TAB2015053078 | 20191209 | 1 |
993 | TAB2015053078 | 20191209 | 1 |
994 | TPA2015018732 | 20191206 | 1 |
995 | THB2014000983 | 20191206 | 1 |
996 | THB2014000983 | 20191206 | 1 |
997 | THO2017029139 | 20191206 | 1 |
998 | THO2017029139 | 20191206 | 1 |
999 | TAC2015060728 | 20191206 | 1 |
Most frequently occurring
GUARNT_NO | DISCHRG_DEMND_DY | PROCESS_SEQ | # duplicates | |
---|---|---|---|---|
79 | TBA2019002283 | 20200901 | 1 | 5 |
22 | TAA2010075463 | 20200401 | 1 | 4 |
23 | TAA2010075463 | 20200402 | 1 | 4 |
36 | TAA2015001184 | 20200911 | 1 | 4 |
116 | TMA2011019772 | 20200513 | 1 | 4 |
11 | QAD2015045026 | 20191226 | 1 | 3 |
43 | TAB2011080660 | 20200727 | 1 | 3 |
47 | TAB2013036041 | 20200520 | 1 | 3 |
55 | TAB2015049582 | 20200626 | 1 | 3 |
57 | TAB2016005155 | 20200923 | 1 | 3 |