Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 1000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 42.1 KiB |
Average record size in memory | 43.1 B |
Variable types
Text | 1 |
---|---|
Categorical | 1 |
Numeric | 2 |
Boolean | 1 |
Dataset
Description | 한국주택금융공사 주택연금부 업무 관련 공개 공공데이터 보증번호 신용정보 여부가 포함되어있습니다. (해당 부서의 업무와 관련된 데이터베이스에서 공개 가능한 원천 데이터) |
---|---|
Author | 한국주택금융공사 |
URL | https://www.data.go.kr/data/15072830/fileData.do |
신용정보여부 is highly imbalanced (60.5%) | Imbalance |
Reproduction
Analysis started | 2023-12-12 00:53:28.709180 |
---|---|
Analysis finished | 2023-12-12 00:53:29.910238 |
Duration | 1.2 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
보증번호
Text
Distinct | 706 |
---|---|
Distinct (%) | 70.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
Length
Max length | 14 |
---|---|
Median length | 14 |
Mean length | 14 |
Min length | 14 |
Characters and Unicode
Total characters | 14000 |
---|---|
Distinct characters | 23 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 412 ? |
---|---|
Unique (%) | 41.2% |
Sample
1st row | RTHA2011000308 |
---|---|
2nd row | RTHA2011000308 |
3rd row | RQAD2011000516 |
4th row | RQAD2011000516 |
5th row | RQAD2011000515 |
Value | Count | Frequency (%) |
rtha2011000308 | 2 | 0.2% |
rtab2011000329 | 2 | 0.2% |
rtna2011000038 | 2 | 0.2% |
rtba2011000076 | 2 | 0.2% |
rqad2011000382 | 2 | 0.2% |
rqad2011000459 | 2 | 0.2% |
rtaa2011000347 | 2 | 0.2% |
rtab2011000324 | 2 | 0.2% |
rtaa2011000290 | 2 | 0.2% |
rtab2011000245 | 2 | 0.2% |
Other values (696) | 980 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 4382 | |
1 | 2461 | |
2 | 1376 | 9.8% |
A | 1019 | 7.3% |
R | 1002 | 7.2% |
T | 806 | 5.8% |
3 | 390 | 2.8% |
4 | 334 | 2.4% |
H | 265 | 1.9% |
B | 262 | 1.9% |
Other values (13) | 1703 | 12.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 10000 | |
Uppercase Letter | 4000 | 28.6% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
A | 1019 | |
R | 1002 | |
T | 806 | |
H | 265 | 6.6% |
B | 262 | 6.6% |
Q | 211 | 5.3% |
D | 194 | 4.9% |
O | 109 | 2.7% |
P | 45 | 1.1% |
M | 38 | 0.9% |
Other values (3) | 49 | 1.2% |
Decimal Number
Value | Count | Frequency (%) |
0 | 4382 | |
1 | 2461 | |
2 | 1376 | 13.8% |
3 | 390 | 3.9% |
4 | 334 | 3.3% |
5 | 227 | 2.3% |
7 | 217 | 2.2% |
8 | 216 | 2.2% |
9 | 209 | 2.1% |
6 | 188 | 1.9% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 10000 | |
Latin | 4000 | 28.6% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
A | 1019 | |
R | 1002 | |
T | 806 | |
H | 265 | 6.6% |
B | 262 | 6.6% |
Q | 211 | 5.3% |
D | 194 | 4.9% |
O | 109 | 2.7% |
P | 45 | 1.1% |
M | 38 | 0.9% |
Other values (3) | 49 | 1.2% |
Common
Value | Count | Frequency (%) |
0 | 4382 | |
1 | 2461 | |
2 | 1376 | 13.8% |
3 | 390 | 3.9% |
4 | 334 | 3.3% |
5 | 227 | 2.3% |
7 | 217 | 2.2% |
8 | 216 | 2.2% |
9 | 209 | 2.1% |
6 | 188 | 1.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 14000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 4382 | |
1 | 2461 | |
2 | 1376 | 9.8% |
A | 1019 | 7.3% |
R | 1002 | 7.2% |
T | 806 | 5.8% |
3 | 390 | 2.8% |
4 | 334 | 2.4% |
H | 265 | 1.9% |
B | 262 | 1.9% |
Other values (13) | 1703 | 12.2% |
회차
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.9 KiB |
1 | |
---|---|
2 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 1 |
3rd row | 2 |
4th row | 1 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
1 | 624 | |
2 | 376 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 624 | |
2 | 376 |
고객번호
Real number (ℝ)
Distinct | 967 |
---|---|
Distinct (%) | 96.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 78469957 |
Minimum | 7297641 |
---|---|
Maximum | 84308662 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 7297641 |
---|---|
5-th percentile | 35800774 |
Q1 | 82859389 |
median | 83534622 |
Q3 | 84074168 |
95-th percentile | 84245389 |
Maximum | 84308662 |
Range | 77011021 |
Interquartile range (IQR) | 1214779.8 |
Descriptive statistics
Standard deviation | 15934879 |
---|---|
Coefficient of variation (CV) | 0.20306981 |
Kurtosis | 10.43606 |
Mean | 78469957 |
Median Absolute Deviation (MAD) | 575552 |
Skewness | -3.3731329 |
Sum | 7.8469957 × 1010 |
Variance | 2.5392037 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
83944661 | 2 | 0.2% |
84153435 | 2 | 0.2% |
84077508 | 2 | 0.2% |
83393762 | 2 | 0.2% |
17564102 | 2 | 0.2% |
84069396 | 2 | 0.2% |
84147399 | 2 | 0.2% |
84147289 | 2 | 0.2% |
84069189 | 2 | 0.2% |
83944768 | 2 | 0.2% |
Other values (957) | 980 |
Value | Count | Frequency (%) |
7297641 | 1 | |
7703614 | 1 | |
8571162 | 1 | |
8613884 | 1 | |
8695161 | 1 | |
8724922 | 1 | |
9021491 | 1 | |
9031083 | 1 | |
9263602 | 1 | |
9440218 | 1 |
Value | Count | Frequency (%) |
84308662 | 1 | |
84308565 | 1 | |
84307155 | 1 | |
84307126 | 1 | |
84304996 | 1 | |
84304938 | 2 | |
84302105 | 1 | |
84302082 | 1 | |
84295322 | 1 | |
84295241 | 1 |
신용정보여부
Boolean
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.1 KiB |
True | |
---|---|
False | 78 |
Value | Count | Frequency (%) |
True | 922 | |
False | 78 | 7.8% |
등록사번
Real number (ℝ)
Distinct | 56 |
---|---|
Distinct (%) | 5.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5432.264 |
Minimum | 1050 |
---|---|
Maximum | 7481 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 1050 |
---|---|
5-th percentile | 1173 |
Q1 | 1431 |
median | 7313 |
Q3 | 7394 |
95-th percentile | 7471 |
Maximum | 7481 |
Range | 6431 |
Interquartile range (IQR) | 5963 |
Descriptive statistics
Standard deviation | 2848.3652 |
---|---|
Coefficient of variation (CV) | 0.5243422 |
Kurtosis | -1.399501 |
Mean | 5432.264 |
Median Absolute Deviation (MAD) | 144 |
Skewness | -0.77387095 |
Sum | 5432264 |
Variance | 8113184.6 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1198 | 74 | 7.4% |
7383 | 57 | 5.7% |
7313 | 56 | 5.6% |
7394 | 54 | 5.4% |
7309 | 53 | 5.3% |
7300 | 50 | 5.0% |
7382 | 47 | 4.7% |
7471 | 45 | 4.5% |
7457 | 44 | 4.4% |
7350 | 41 | 4.1% |
Other values (46) | 479 |
Value | Count | Frequency (%) |
1050 | 3 | 0.3% |
1104 | 12 | |
1130 | 1 | 0.1% |
1133 | 6 | |
1141 | 9 | |
1143 | 1 | 0.1% |
1147 | 4 | 0.4% |
1149 | 1 | 0.1% |
1152 | 2 | 0.2% |
1170 | 2 | 0.2% |
Value | Count | Frequency (%) |
7481 | 3 | 0.3% |
7476 | 27 | |
7471 | 45 | |
7468 | 3 | 0.3% |
7462 | 38 | |
7461 | 30 | |
7457 | 44 | |
7444 | 35 | |
7394 | 54 | |
7390 | 15 | 1.5% |
회차 | 고객번호 | 신용정보여부 | 등록사번 | |
---|---|---|---|---|
회차 | 1.000 | 0.039 | 0.056 | 0.000 |
고객번호 | 0.039 | 1.000 | 0.000 | 0.103 |
신용정보여부 | 0.056 | 0.000 | 1.000 | 0.386 |
등록사번 | 0.000 | 0.103 | 0.386 | 1.000 |
신용정보여부 | 회차 | |
---|---|---|
신용정보여부 | 1.000 | 0.035 |
회차 | 0.035 | 1.000 |
고객번호 | 등록사번 | 회차 | 신용정보여부 | |
---|---|---|---|---|
고객번호 | 1.000 | 0.031 | 0.000 | 0.000 |
등록사번 | 0.031 | 1.000 | 0.000 | 0.250 |
회차 | 0.000 | 0.000 | 1.000 | 0.035 |
신용정보여부 | 0.000 | 0.250 | 0.035 | 1.000 |
보증번호 | 회차 | 고객번호 | 신용정보여부 | 등록사번 | |
---|---|---|---|---|---|
0 | RTHA2011000308 | 2 | 28830676 | Y | 7394 |
1 | RTHA2011000308 | 1 | 73411667 | Y | 7394 |
2 | RQAD2011000516 | 2 | 84308662 | Y | 7350 |
3 | RQAD2011000516 | 1 | 84308565 | Y | 7350 |
4 | RQAD2011000515 | 2 | 84307155 | Y | 7309 |
5 | RQAD2011000515 | 1 | 84307126 | Y | 7309 |
6 | RTHB2011000179 | 2 | 84304996 | Y | 7462 |
7 | RTHB2011000179 | 1 | 84304938 | Y | 7462 |
8 | RTHA2011000307 | 1 | 84304938 | Y | 7394 |
9 | RTAA2011000422 | 1 | 8724922 | N | 1173 |
보증번호 | 회차 | 고객번호 | 신용정보여부 | 등록사번 | |
---|---|---|---|---|---|
990 | RQAD2011000434 | 2 | 83713135 | Y | 7309 |
991 | RTHB2011000144 | 2 | 83706867 | Y | 7462 |
992 | RTBA2011000142 | 1 | 83710222 | Y | 7471 |
993 | RTPA2011000110 | 2 | 83384357 | Y | 7457 |
994 | RTQA2011000038 | 2 | 79647448 | Y | 1133 |
995 | RTHB2011000140 | 2 | 83658326 | Y | 7462 |
996 | RTHA2011000250 | 1 | 83661339 | Y | 7394 |
997 | RTOA2011000059 | 2 | 83622132 | Y | 7299 |
998 | RTMA2011000096 | 2 | 83618489 | Y | 7444 |
999 | RTHA2011000246 | 1 | 83617260 | Y | 7394 |