Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 500 |
Missing cells | 376 |
Missing cells (%) | 9.4% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 32.8 KiB |
Average record size in memory | 67.3 B |
Variable types
Text | 3 |
---|---|
Numeric | 3 |
Categorical | 1 |
Boolean | 1 |
Dataset
Description | 해당 파일 데이터는 신용보증기금의 공통RM요구사항업무대행상세에 대한 정보를 확인하실 수 있는 자료이니 데이터 활용에 참고하여 주시기 바랍니다. |
---|---|
Author | 신용보증기금 |
URL | https://www.data.go.kr/data/15093195/fileData.do |
삭제여부 has constant value "" | Constant |
최종수정수 is highly overall correlated with 전자결재상태코드 | High correlation |
전자결재상태코드 is highly overall correlated with 최종수정수 | High correlation |
전자결재상태코드 is highly imbalanced (69.0%) | Imbalance |
원장번호 has 350 (70.0%) missing values | Missing |
현재책임자직원번호 has 13 (2.6%) missing values | Missing |
현재담당자직원번호 has 13 (2.6%) missing values | Missing |
요구사항ID has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 14:15:37.575411 |
---|---|
Analysis finished | 2023-12-12 14:15:39.299594 |
Duration | 1.72 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
요구사항ID
Text
UNIQUE
 
Distinct | 500 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 5000 |
---|---|
Distinct characters | 62 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 500 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 9dnOayu7wf |
---|---|
2nd row | 9dnSYcYcBs |
3rd row | 9dnSXv3Udz |
4th row | 9dmH2zYSOu |
5th row | 9dnObjpe1F |
Value | Count | Frequency (%) |
9dnoayu7wf | 1 | 0.2% |
9dmgdt7cr4 | 1 | 0.2% |
9dmetaiwib | 1 | 0.2% |
9dmgdp3a45 | 1 | 0.2% |
9dmgdhoujr | 1 | 0.2% |
9dmgqickof | 1 | 0.2% |
9dmgremdqn | 1 | 0.2% |
9dmgucbkc3 | 1 | 0.2% |
9dmguu0nxt | 1 | 0.2% |
9dmgszts0h | 1 | 0.2% |
Other values (490) | 490 |
Most occurring characters
Value | Count | Frequency (%) |
d | 565 | 11.3% |
9 | 548 | 11.0% |
m | 381 | 7.6% |
n | 208 | 4.2% |
l | 92 | 1.8% |
J | 76 | 1.5% |
v | 74 | 1.5% |
1 | 72 | 1.4% |
G | 72 | 1.4% |
H | 71 | 1.4% |
Other values (52) | 2841 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 2508 | |
Uppercase Letter | 1446 | |
Decimal Number | 1046 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
d | 565 | |
m | 381 | |
n | 208 | 8.3% |
l | 92 | 3.7% |
v | 74 | 3.0% |
z | 71 | 2.8% |
p | 70 | 2.8% |
u | 68 | 2.7% |
o | 68 | 2.7% |
k | 68 | 2.7% |
Other values (16) | 843 |
Uppercase Letter
Value | Count | Frequency (%) |
J | 76 | 5.3% |
G | 72 | 5.0% |
H | 71 | 4.9% |
Z | 67 | 4.6% |
P | 67 | 4.6% |
A | 67 | 4.6% |
X | 64 | 4.4% |
D | 64 | 4.4% |
S | 64 | 4.4% |
M | 60 | 4.1% |
Other values (16) | 774 |
Decimal Number
Value | Count | Frequency (%) |
9 | 548 | |
1 | 72 | 6.9% |
3 | 62 | 5.9% |
0 | 57 | 5.4% |
5 | 56 | 5.4% |
6 | 53 | 5.1% |
2 | 53 | 5.1% |
8 | 52 | 5.0% |
4 | 51 | 4.9% |
7 | 42 | 4.0% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 3954 | |
Common | 1046 | 20.9% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
d | 565 | 14.3% |
m | 381 | 9.6% |
n | 208 | 5.3% |
l | 92 | 2.3% |
J | 76 | 1.9% |
v | 74 | 1.9% |
G | 72 | 1.8% |
H | 71 | 1.8% |
z | 71 | 1.8% |
p | 70 | 1.8% |
Other values (42) | 2274 |
Common
Value | Count | Frequency (%) |
9 | 548 | |
1 | 72 | 6.9% |
3 | 62 | 5.9% |
0 | 57 | 5.4% |
5 | 56 | 5.4% |
6 | 53 | 5.1% |
2 | 53 | 5.1% |
8 | 52 | 5.0% |
4 | 51 | 4.9% |
7 | 42 | 4.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 5000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
d | 565 | 11.3% |
9 | 548 | 11.0% |
m | 381 | 7.6% |
n | 208 | 4.2% |
l | 92 | 1.8% |
J | 76 | 1.5% |
v | 74 | 1.5% |
1 | 72 | 1.4% |
G | 72 | 1.4% |
H | 71 | 1.4% |
Other values (52) | 2841 |
원장번호
Text
MISSING
 
Distinct | 145 |
---|---|
Distinct (%) | 96.7% |
Missing | 350 |
Missing (%) | 70.0% |
Memory size | 4.0 KiB |
Length
Max length | 12 |
---|---|
Median length | 12 |
Mean length | 12 |
Min length | 12 |
Characters and Unicode
Total characters | 1800 |
---|---|
Distinct characters | 35 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 140 ? |
---|---|
Unique (%) | 93.3% |
Sample
1st row | QAC202107107 |
---|---|
2nd row | THZ202102554 |
3rd row | TBC202000054 |
4th row | TAO202100393 |
5th row | TIG201600818 |
Value | Count | Frequency (%) |
tif202102026 | 2 | 1.3% |
tpq202101569 | 2 | 1.3% |
jac201800020 | 2 | 1.3% |
tao202100349 | 2 | 1.3% |
tal202003546 | 2 | 1.3% |
tib202101836 | 1 | 0.7% |
ndn202100768 | 1 | 0.7% |
qac201601543 | 1 | 0.7% |
tib202101837 | 1 | 0.7% |
tqa20213683 | 1 | 0.7% |
Other values (134) | 134 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 432 | |
2 | 302 | |
1 | 203 | |
T | 134 | 7.4% |
5 | 92 | 5.1% |
3 | 64 | 3.6% |
A | 52 | 2.9% |
7 | 51 | 2.8% |
6 | 51 | 2.8% |
4 | 50 | 2.8% |
Other values (25) | 369 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1334 | |
Uppercase Letter | 447 | 24.8% |
Space Separator | 19 | 1.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
T | 134 | |
A | 52 | 11.6% |
H | 29 | 6.5% |
I | 24 | 5.4% |
B | 24 | 5.4% |
N | 23 | 5.1% |
Q | 19 | 4.3% |
L | 18 | 4.0% |
P | 17 | 3.8% |
J | 16 | 3.6% |
Other values (14) | 91 |
Decimal Number
Value | Count | Frequency (%) |
0 | 432 | |
2 | 302 | |
1 | 203 | |
5 | 92 | 6.9% |
3 | 64 | 4.8% |
7 | 51 | 3.8% |
6 | 51 | 3.8% |
4 | 50 | 3.7% |
9 | 46 | 3.4% |
8 | 43 | 3.2% |
Space Separator
Value | Count | Frequency (%) |
19 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1353 | |
Latin | 447 | 24.8% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
T | 134 | |
A | 52 | 11.6% |
H | 29 | 6.5% |
I | 24 | 5.4% |
B | 24 | 5.4% |
N | 23 | 5.1% |
Q | 19 | 4.3% |
L | 18 | 4.0% |
P | 17 | 3.8% |
J | 16 | 3.6% |
Other values (14) | 91 |
Common
Value | Count | Frequency (%) |
0 | 432 | |
2 | 302 | |
1 | 203 | |
5 | 92 | 6.8% |
3 | 64 | 4.7% |
7 | 51 | 3.8% |
6 | 51 | 3.8% |
4 | 50 | 3.7% |
9 | 46 | 3.4% |
8 | 43 | 3.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1800 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 432 | |
2 | 302 | |
1 | 203 | |
T | 134 | 7.4% |
5 | 92 | 5.1% |
3 | 64 | 3.6% |
A | 52 | 2.9% |
7 | 51 | 2.8% |
6 | 51 | 2.8% |
4 | 50 | 2.8% |
Other values (25) | 369 |
현재책임자직원번호
Real number (ℝ)
MISSING
 
Distinct | 250 |
---|---|
Distinct (%) | 51.3% |
Missing | 13 |
Missing (%) | 2.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3932.4928 |
Minimum | 2522 |
---|---|
Maximum | 4647 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 2522 |
---|---|
5-th percentile | 3173 |
Q1 | 3650 |
median | 4005 |
Q3 | 4231 |
95-th percentile | 4521 |
Maximum | 4647 |
Range | 2125 |
Interquartile range (IQR) | 581 |
Descriptive statistics
Standard deviation | 416.17292 |
---|---|
Coefficient of variation (CV) | 0.10582929 |
Kurtosis | 0.14722348 |
Mean | 3932.4928 |
Median Absolute Deviation (MAD) | 276 |
Skewness | -0.63473776 |
Sum | 1915124 |
Variance | 173199.9 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4261 | 23 | 4.6% |
4517 | 14 | 2.8% |
4554 | 8 | 1.6% |
4033 | 7 | 1.4% |
4521 | 7 | 1.4% |
4213 | 6 | 1.2% |
3984 | 6 | 1.2% |
4391 | 5 | 1.0% |
3650 | 5 | 1.0% |
3979 | 4 | 0.8% |
Other values (240) | 402 | |
(Missing) | 13 | 2.6% |
Value | Count | Frequency (%) |
2522 | 1 | 0.2% |
2689 | 4 | |
2710 | 3 | |
2895 | 2 | |
2981 | 1 | 0.2% |
3055 | 3 | |
3059 | 1 | 0.2% |
3060 | 1 | 0.2% |
3082 | 1 | 0.2% |
3094 | 1 | 0.2% |
Value | Count | Frequency (%) |
4647 | 3 | 0.6% |
4596 | 1 | 0.2% |
4577 | 1 | 0.2% |
4563 | 1 | 0.2% |
4558 | 2 | 0.4% |
4554 | 8 | |
4536 | 1 | 0.2% |
4524 | 2 | 0.4% |
4521 | 7 | |
4517 | 14 |
현재담당자직원번호
Text
MISSING
 
Distinct | 334 |
---|---|
Distinct (%) | 68.6% |
Missing | 13 |
Missing (%) | 2.6% |
Memory size | 4.0 KiB |
Value | Count | Frequency (%) |
5736 | 20 | 4.1% |
5797 | 12 | 2.5% |
5340 | 5 | 1.0% |
5636 | 5 | 1.0% |
6045 | 5 | 1.0% |
4334 | 4 | 0.8% |
4767 | 4 | 0.8% |
5554 | 4 | 0.8% |
5236 | 4 | 0.8% |
5609 | 4 | 0.8% |
Other values (324) | 420 |
Most occurring characters
Value | Count | Frequency (%) |
5 | 405 | |
4 | 251 | |
6 | 232 | |
0 | 180 | |
3 | 177 | |
7 | 154 | 7.9% |
1 | 146 | 7.5% |
9 | 143 | 7.3% |
8 | 138 | 7.1% |
2 | 123 | 6.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1949 | |
Uppercase Letter | 3 | 0.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 405 | |
4 | 251 | |
6 | 232 | |
0 | 180 | |
3 | 177 | |
7 | 154 | 7.9% |
1 | 146 | 7.5% |
9 | 143 | 7.3% |
8 | 138 | 7.1% |
2 | 123 | 6.3% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1949 | |
Latin | 3 | 0.2% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
5 | 405 | |
4 | 251 | |
6 | 232 | |
0 | 180 | |
3 | 177 | |
7 | 154 | 7.9% |
1 | 146 | 7.5% |
9 | 143 | 7.3% |
8 | 138 | 7.1% |
2 | 123 | 6.3% |
Latin
Value | Count | Frequency (%) |
A | 3 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1952 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
5 | 405 | |
4 | 251 | |
6 | 232 | |
0 | 180 | |
3 | 177 | |
7 | 154 | 7.9% |
1 | 146 | 7.5% |
9 | 143 | 7.3% |
8 | 138 | 7.1% |
2 | 123 | 6.3% |
전자결재상태코드
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
13 | |
---|---|
11 | 4 |
12 | 2 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 1.884 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 11 |
---|---|
2nd row | |
3rd row | |
4th row | 13 |
5th row | 13 |
Common Values
Value | Count | Frequency (%) |
13 | 436 | |
58 | 11.6% | |
11 | 4 | 0.8% |
12 | 2 | 0.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
13 | 436 | |
11 | 4 | 0.9% |
12 | 2 | 0.5% |
삭제여부
Boolean
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 632.0 B |
False |
---|
Value | Count | Frequency (%) |
False | 500 |
최종수정수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 14 |
---|---|
Distinct (%) | 2.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.384 |
Minimum | 1 |
---|---|
Maximum | 18 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 6 |
median | 6 |
Q3 | 7 |
95-th percentile | 11 |
Maximum | 18 |
Range | 17 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 2.5926797 |
---|---|
Coefficient of variation (CV) | 0.40612151 |
Kurtosis | 2.3462113 |
Mean | 6.384 |
Median Absolute Deviation (MAD) | 1 |
Skewness | -0.021985457 |
Sum | 3192 |
Variance | 6.721988 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6 | 214 | |
7 | 106 | |
1 | 58 | 11.6% |
8 | 47 | 9.4% |
9 | 26 | 5.2% |
10 | 14 | 2.8% |
11 | 11 | 2.2% |
12 | 7 | 1.4% |
5 | 5 | 1.0% |
14 | 4 | 0.8% |
Other values (4) | 8 | 1.6% |
Value | Count | Frequency (%) |
1 | 58 | 11.6% |
2 | 3 | 0.6% |
4 | 1 | 0.2% |
5 | 5 | 1.0% |
6 | 214 | |
7 | 106 | |
8 | 47 | 9.4% |
9 | 26 | 5.2% |
10 | 14 | 2.8% |
11 | 11 | 2.2% |
Value | Count | Frequency (%) |
18 | 2 | 0.4% |
14 | 4 | 0.8% |
13 | 2 | 0.4% |
12 | 7 | 1.4% |
11 | 11 | 2.2% |
10 | 14 | 2.8% |
9 | 26 | 5.2% |
8 | 47 | 9.4% |
7 | 106 | |
6 | 214 |
처리직원번호
Real number (ℝ)
Distinct | 71 |
---|---|
Distinct (%) | 14.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5278.712 |
Minimum | 2522 |
---|---|
Maximum | 6156 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 2522 |
---|---|
5-th percentile | 4496 |
Q1 | 4496 |
median | 5544 |
Q3 | 5803 |
95-th percentile | 6023 |
Maximum | 6156 |
Range | 3634 |
Interquartile range (IQR) | 1307 |
Descriptive statistics
Standard deviation | 606.90309 |
---|---|
Coefficient of variation (CV) | 0.11497181 |
Kurtosis | -0.54211291 |
Mean | 5278.712 |
Median Absolute Deviation (MAD) | 279 |
Skewness | -0.62450281 |
Sum | 2639356 |
Variance | 368331.36 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4496 | 146 | |
5573 | 52 | 10.4% |
5823 | 46 | 9.2% |
5544 | 36 | 7.2% |
5803 | 35 | 7.0% |
5804 | 28 | 5.6% |
5354 | 22 | 4.4% |
6105 | 15 | 3.0% |
5423 | 12 | 2.4% |
5741 | 9 | 1.8% |
Other values (61) | 99 |
Value | Count | Frequency (%) |
2522 | 1 | 0.2% |
3368 | 1 | 0.2% |
4040 | 1 | 0.2% |
4168 | 6 | 1.2% |
4334 | 2 | 0.4% |
4496 | 146 | |
4596 | 1 | 0.2% |
4761 | 1 | 0.2% |
4793 | 1 | 0.2% |
4835 | 2 | 0.4% |
Value | Count | Frequency (%) |
6156 | 1 | 0.2% |
6105 | 15 | |
6099 | 1 | 0.2% |
6096 | 1 | 0.2% |
6078 | 1 | 0.2% |
6067 | 1 | 0.2% |
6045 | 3 | 0.6% |
6035 | 1 | 0.2% |
6023 | 6 | 1.2% |
6000 | 6 | 1.2% |
현재책임자직원번호 | 전자결재상태코드 | 최종수정수 | 처리직원번호 | |
---|---|---|---|---|
현재책임자직원번호 | 1.000 | 0.000 | 0.000 | 0.248 |
전자결재상태코드 | 0.000 | 1.000 | 0.825 | 0.329 |
최종수정수 | 0.000 | 0.825 | 1.000 | 0.356 |
처리직원번호 | 0.248 | 0.329 | 0.356 | 1.000 |
현재책임자직원번호 | 최종수정수 | 처리직원번호 | 전자결재상태코드 | |
---|---|---|---|---|
현재책임자직원번호 | 1.000 | -0.022 | 0.090 | 0.000 |
최종수정수 | -0.022 | 1.000 | 0.082 | 0.695 |
처리직원번호 | 0.090 | 0.082 | 1.000 | 0.236 |
전자결재상태코드 | 0.000 | 0.695 | 0.236 | 1.000 |
요구사항ID | 원장번호 | 현재책임자직원번호 | 현재담당자직원번호 | 전자결재상태코드 | 삭제여부 | 최종수정수 | 처리직원번호 | |
---|---|---|---|---|---|---|---|---|
0 | 9dnOayu7wf | <NA> | 4197 | 5489 | 11 | N | 7 | 5423 |
1 | 9dnSYcYcBs | <NA> | 4517 | 5797 | N | 1 | 5797 | |
2 | 9dnSXv3Udz | <NA> | 3642 | 4366 | N | 1 | 5901 | |
3 | 9dmH2zYSOu | <NA> | 4391 | 5241 | 13 | N | 8 | 5573 |
4 | 9dnObjpe1F | <NA> | 3570 | 6067 | 13 | N | 9 | 5573 |
5 | 9dnLzEL1AA | QAC202107107 | 3981 | 4799 | 13 | N | 6 | 4496 |
6 | 9dnOv8nYGJ | <NA> | 4517 | 5797 | N | 1 | 5797 | |
7 | 9dnOf1QkCG | THZ202102554 | 4554 | 6045 | 13 | N | 6 | 4496 |
8 | 9dnJSAkYTg | <NA> | 3581 | 5926 | 13 | N | 9 | 5544 |
9 | 9dnOboJYJp | <NA> | 4521 | 5243 | 13 | N | 6 | 4496 |
요구사항ID | 원장번호 | 현재책임자직원번호 | 현재담당자직원번호 | 전자결재상태코드 | 삭제여부 | 최종수정수 | 처리직원번호 | |
---|---|---|---|---|---|---|---|---|
490 | 9dl2HI1dRj | <NA> | 3553 | 4522 | 13 | N | 6 | 4496 |
491 | 9dl9bO11x1 | TOI201600515 | 4213 | 5636 | N | 1 | 5636 | |
492 | 9dl2ztoOLx | TBI202102278 | 4475 | 5411 | 13 | N | 8 | 5823 |
493 | 9dl1dQQ6Vh | TAV201500813 | 3418 | 6012 | 13 | N | 6 | 5823 |
494 | 9dl2P5bsbX | JPA202100053 | 4020 | 4020 | 13 | N | 7 | 5573 |
495 | 9dl1u0vVnJ | TOA201500599 | 4497 | 6109 | 12 | N | 4 | 5823 |
496 | 9dl2KXvBpv | <NA> | 4124 | 5445 | N | 1 | 5445 | |
497 | 9dl1oTQrCp | TIG202103577 | 3982 | 4710 | 13 | N | 7 | 5573 |
498 | 9dlTCiFEDl | <NA> | 4261 | 4767 | 13 | N | 8 | 5544 |
499 | 9dl1n3j1mw | <NA> | 3149 | 4794 | 13 | N | 6 | 5823 |