Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 408 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 43 |
Duplicate rows (%) | 10.5% |
Total size in memory | 23.6 KiB |
Average record size in memory | 59.3 B |
Variable types
Categorical | 4 |
---|---|
Boolean | 1 |
Numeric | 2 |
Dataset
Description | 해당 파일 데이터는 신용보증기금의 품질감리정보에 대해 확인하실 수 있는 자료이니 데이터 활용에 참고하여 주시기 바랍니다. |
---|---|
Author | 신용보증기금 |
URL | https://www.data.go.kr/data/15093007/fileData.do |
Dataset has 43 (10.5%) duplicate rows | Duplicates |
삭제여부 is highly overall correlated with 최종수정수 and 4 other fields | High correlation |
최초처리시각 is highly overall correlated with 처리직원번호 and 3 other fields | High correlation |
감리전자결재상태코드 is highly overall correlated with 처리직원번호 and 3 other fields | High correlation |
최초처리직원번호 is highly overall correlated with 처리직원번호 and 3 other fields | High correlation |
최종수정수 is highly overall correlated with 삭제여부 | High correlation |
처리직원번호 is highly overall correlated with 감리전자결재상태코드 and 3 other fields | High correlation |
감리전자결재상태코드 is highly imbalanced (88.2%) | Imbalance |
삭제여부 is highly imbalanced (84.7%) | Imbalance |
최초처리시각 is highly imbalanced (86.9%) | Imbalance |
최초처리직원번호 is highly imbalanced (86.4%) | Imbalance |
Reproduction
Analysis started | 2023-12-12 23:09:56.138770 |
---|---|
Analysis finished | 2023-12-12 23:09:57.136327 |
Duration | 1 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
감리구분코드
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
2 | |
---|---|
1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 2 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
2 | 275 | |
1 | 133 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 275 | |
1 | 133 |
감리전자결재상태코드
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
23 | |
---|---|
13 | 10 |
12 | 2 |
1 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 1.997549 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | 23 |
---|---|
2nd row | 23 |
3rd row | 23 |
4th row | 23 |
5th row | 23 |
Common Values
Value | Count | Frequency (%) |
23 | 395 | |
13 | 10 | 2.5% |
12 | 2 | 0.5% |
1 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
23 | 395 | |
13 | 10 | 2.5% |
12 | 2 | 0.5% |
삭제여부
Boolean
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 540.0 B |
False | |
---|---|
True | 9 |
Value | Count | Frequency (%) |
False | 399 | |
True | 9 | 2.2% |
최종수정수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 25 |
---|---|
Distinct (%) | 6.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7.5980392 |
Minimum | 1 |
---|---|
Maximum | 39 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.7 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 4 |
Q1 | 5 |
median | 6 |
Q3 | 8.25 |
95-th percentile | 17 |
Maximum | 39 |
Range | 38 |
Interquartile range (IQR) | 3.25 |
Descriptive statistics
Standard deviation | 4.4853237 |
---|---|
Coefficient of variation (CV) | 0.59032648 |
Kurtosis | 8.7535103 |
Mean | 7.5980392 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 2.4292669 |
Sum | 3100 |
Variance | 20.118129 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 156 | |
6 | 58 | 14.2% |
7 | 35 | 8.6% |
8 | 32 | 7.8% |
9 | 19 | 4.7% |
10 | 13 | 3.2% |
11 | 12 | 2.9% |
13 | 12 | 2.9% |
4 | 12 | 2.9% |
2 | 10 | 2.5% |
Other values (15) | 49 | 12.0% |
Value | Count | Frequency (%) |
1 | 3 | 0.7% |
2 | 10 | 2.5% |
4 | 12 | 2.9% |
5 | 156 | |
6 | 58 | 14.2% |
7 | 35 | 8.6% |
8 | 32 | 7.8% |
9 | 19 | 4.7% |
10 | 13 | 3.2% |
11 | 12 | 2.9% |
Value | Count | Frequency (%) |
39 | 1 | 0.2% |
30 | 1 | 0.2% |
28 | 1 | 0.2% |
24 | 1 | 0.2% |
22 | 1 | 0.2% |
21 | 1 | 0.2% |
20 | 4 | |
19 | 4 | |
18 | 6 | |
17 | 6 |
처리직원번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 16 |
---|---|
Distinct (%) | 3.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3869.277 |
Minimum | 2969 |
---|---|
Maximum | 5470 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.7 KiB |
Quantile statistics
Minimum | 2969 |
---|---|
5-th percentile | 3559 |
Q1 | 3559 |
median | 3559 |
Q3 | 4138 |
95-th percentile | 4597 |
Maximum | 5470 |
Range | 2501 |
Interquartile range (IQR) | 579 |
Descriptive statistics
Standard deviation | 436.73655 |
---|---|
Coefficient of variation (CV) | 0.11287291 |
Kurtosis | -0.29197203 |
Mean | 3869.277 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 0.96243005 |
Sum | 1578665 |
Variance | 190738.81 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3559 | 249 | |
4509 | 72 | 17.6% |
4042 | 50 | 12.3% |
4597 | 15 | 3.7% |
4875 | 5 | 1.2% |
4129 | 3 | 0.7% |
3513 | 3 | 0.7% |
4293 | 2 | 0.5% |
5470 | 2 | 0.5% |
4964 | 1 | 0.2% |
Other values (6) | 6 | 1.5% |
Value | Count | Frequency (%) |
2969 | 1 | 0.2% |
3513 | 3 | 0.7% |
3559 | 249 | |
4042 | 50 | 12.3% |
4129 | 3 | 0.7% |
4165 | 1 | 0.2% |
4172 | 1 | 0.2% |
4293 | 2 | 0.5% |
4436 | 1 | 0.2% |
4509 | 72 | 17.6% |
Value | Count | Frequency (%) |
5470 | 2 | 0.5% |
4964 | 1 | 0.2% |
4875 | 5 | 1.2% |
4632 | 1 | 0.2% |
4606 | 1 | 0.2% |
4597 | 15 | 3.7% |
4509 | 72 | |
4436 | 1 | 0.2% |
4293 | 2 | 0.5% |
4172 | 1 | 0.2% |
최초처리시각
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 26 |
---|---|
Distinct (%) | 6.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
0001-01-01 00:00:00.000000 | |
---|---|
32:00.2 | 1 |
07:27.9 | 1 |
14:53.8 | 1 |
38:57.6 | 1 |
Other values (21) | 21 |
Length
Max length | 26 |
---|---|
Median length | 26 |
Mean length | 24.835784 |
Min length | 7 |
Unique
Unique | 25 ? |
---|---|
Unique (%) | 6.1% |
Sample
1st row | 32:00.2 |
---|---|
2nd row | 19:09.9 |
3rd row | 07:27.9 |
4th row | 14:53.8 |
5th row | 38:57.6 |
Common Values
Value | Count | Frequency (%) |
0001-01-01 00:00:00.000000 | 383 | |
32:00.2 | 1 | 0.2% |
07:27.9 | 1 | 0.2% |
14:53.8 | 1 | 0.2% |
38:57.6 | 1 | 0.2% |
06:29.2 | 1 | 0.2% |
34:41.9 | 1 | 0.2% |
44:37.0 | 1 | 0.2% |
04:26.4 | 1 | 0.2% |
14:38.6 | 1 | 0.2% |
Other values (16) | 16 | 3.9% |
Length
Value | Count | Frequency (%) |
0001-01-01 | 383 | |
00:00:00.000000 | 383 | |
12:26.9 | 1 | 0.1% |
19:09.9 | 1 | 0.1% |
23:02.9 | 1 | 0.1% |
17:12.7 | 1 | 0.1% |
54:24.4 | 1 | 0.1% |
39:24.9 | 1 | 0.1% |
40:53.3 | 1 | 0.1% |
00:12.1 | 1 | 0.1% |
Other values (17) | 17 | 2.1% |
최초처리직원번호
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 17 |
---|---|
Distinct (%) | 4.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
BATCH | |
---|---|
4875 | 6 |
5470 | 3 |
4168 | 2 |
4444 | 2 |
Other values (12) | 12 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 4.9387255 |
Min length | 4 |
Unique
Unique | 12 ? |
---|---|
Unique (%) | 2.9% |
Sample
1st row | 6105 |
---|---|
2nd row | 5921 |
3rd row | 5873 |
4th row | 4432 |
5th row | 5107 |
Common Values
Value | Count | Frequency (%) |
BATCH | 383 | |
4875 | 6 | 1.5% |
5470 | 3 | 0.7% |
4168 | 2 | 0.5% |
4444 | 2 | 0.5% |
5921 | 1 | 0.2% |
5873 | 1 | 0.2% |
4432 | 1 | 0.2% |
5107 | 1 | 0.2% |
5573 | 1 | 0.2% |
Other values (7) | 7 | 1.7% |
Length
Value | Count | Frequency (%) |
batch | 383 | |
4875 | 6 | 1.5% |
5470 | 3 | 0.7% |
4168 | 2 | 0.5% |
4444 | 2 | 0.5% |
4964 | 1 | 0.2% |
6105 | 1 | 0.2% |
4064 | 1 | 0.2% |
4436 | 1 | 0.2% |
5314 | 1 | 0.2% |
Other values (7) | 7 | 1.7% |
감리구분코드 | 감리전자결재상태코드 | 삭제여부 | 최종수정수 | 처리직원번호 | 최초처리시각 | 최초처리직원번호 | |
---|---|---|---|---|---|---|---|
감리구분코드 | 1.000 | 0.000 | 0.020 | 0.134 | 0.334 | 0.000 | 0.000 |
감리전자결재상태코드 | 0.000 | 1.000 | 0.968 | 0.554 | 0.744 | 0.987 | 0.970 |
삭제여부 | 0.020 | 0.968 | 1.000 | 0.573 | 0.970 | 1.000 | 0.929 |
최종수정수 | 0.134 | 0.554 | 0.573 | 1.000 | 0.491 | 0.683 | 0.583 |
처리직원번호 | 0.334 | 0.744 | 0.970 | 0.491 | 1.000 | 0.899 | 0.849 |
최초처리시각 | 0.000 | 0.987 | 1.000 | 0.683 | 0.899 | 1.000 | 1.000 |
최초처리직원번호 | 0.000 | 0.970 | 0.929 | 0.583 | 0.849 | 1.000 | 1.000 |
삭제여부 | 최초처리시각 | 감리전자결재상태코드 | 감리구분코드 | 최초처리직원번호 | |
---|---|---|---|---|---|
삭제여부 | 1.000 | 0.970 | 0.836 | 0.012 | 0.890 |
최초처리시각 | 0.970 | 1.000 | 0.918 | 0.000 | 0.988 |
감리전자결재상태코드 | 0.836 | 0.918 | 1.000 | 0.000 | 0.902 |
감리구분코드 | 0.012 | 0.000 | 0.000 | 1.000 | 0.000 |
최초처리직원번호 | 0.890 | 0.988 | 0.902 | 0.000 | 1.000 |
최종수정수 | 처리직원번호 | 감리구분코드 | 감리전자결재상태코드 | 삭제여부 | 최초처리시각 | 최초처리직원번호 | |
---|---|---|---|---|---|---|---|
최종수정수 | 1.000 | 0.003 | 0.130 | 0.387 | 0.572 | 0.320 | 0.268 |
처리직원번호 | 0.003 | 1.000 | 0.244 | 0.577 | 0.839 | 0.597 | 0.544 |
감리구분코드 | 0.130 | 0.244 | 1.000 | 0.000 | 0.012 | 0.000 | 0.000 |
감리전자결재상태코드 | 0.387 | 0.577 | 0.000 | 1.000 | 0.836 | 0.918 | 0.902 |
삭제여부 | 0.572 | 0.839 | 0.012 | 0.836 | 1.000 | 0.970 | 0.890 |
최초처리시각 | 0.320 | 0.597 | 0.000 | 0.918 | 0.970 | 1.000 | 0.988 |
최초처리직원번호 | 0.268 | 0.544 | 0.000 | 0.902 | 0.890 | 0.988 | 1.000 |
감리구분코드 | 감리전자결재상태코드 | 삭제여부 | 최종수정수 | 처리직원번호 | 최초처리시각 | 최초처리직원번호 | |
---|---|---|---|---|---|---|---|
0 | 2 | 23 | N | 18 | 4597 | 32:00.2 | 6105 |
1 | 2 | 23 | N | 12 | 4597 | 19:09.9 | 5921 |
2 | 2 | 23 | N | 10 | 4597 | 07:27.9 | 5873 |
3 | 2 | 23 | N | 13 | 4597 | 14:53.8 | 4432 |
4 | 2 | 23 | N | 8 | 4597 | 38:57.6 | 5107 |
5 | 2 | 23 | N | 13 | 4597 | 06:29.2 | 5573 |
6 | 2 | 23 | N | 18 | 4597 | 34:41.9 | 5470 |
7 | 2 | 23 | N | 18 | 4597 | 44:37.0 | 4444 |
8 | 2 | 12 | N | 1 | 4964 | 04:26.4 | 4964 |
9 | 2 | 23 | N | 9 | 4597 | 14:38.6 | 4875 |
감리구분코드 | 감리전자결재상태코드 | 삭제여부 | 최종수정수 | 처리직원번호 | 최초처리시각 | 최초처리직원번호 | |
---|---|---|---|---|---|---|---|
398 | 1 | 23 | N | 5 | 3559 | 0001-01-01 00:00:00.000000 | BATCH |
399 | 2 | 23 | N | 8 | 3559 | 0001-01-01 00:00:00.000000 | BATCH |
400 | 1 | 23 | N | 5 | 3559 | 0001-01-01 00:00:00.000000 | BATCH |
401 | 2 | 23 | N | 5 | 4509 | 0001-01-01 00:00:00.000000 | BATCH |
402 | 2 | 23 | N | 5 | 3559 | 0001-01-01 00:00:00.000000 | BATCH |
403 | 1 | 23 | N | 5 | 4509 | 0001-01-01 00:00:00.000000 | BATCH |
404 | 2 | 23 | N | 6 | 3559 | 0001-01-01 00:00:00.000000 | BATCH |
405 | 1 | 23 | N | 7 | 3559 | 0001-01-01 00:00:00.000000 | BATCH |
406 | 2 | 23 | N | 15 | 3559 | 0001-01-01 00:00:00.000000 | BATCH |
407 | 2 | 23 | N | 5 | 3559 | 0001-01-01 00:00:00.000000 | BATCH |
Most frequently occurring
감리구분코드 | 감리전자결재상태코드 | 삭제여부 | 최종수정수 | 처리직원번호 | 최초처리시각 | 최초처리직원번호 | # duplicates | |
---|---|---|---|---|---|---|---|---|
17 | 2 | 23 | N | 5 | 3559 | 0001-01-01 00:00:00.000000 | BATCH | 60 |
0 | 1 | 23 | N | 5 | 3559 | 0001-01-01 00:00:00.000000 | BATCH | 42 |
19 | 2 | 23 | N | 5 | 4509 | 0001-01-01 00:00:00.000000 | BATCH | 24 |
2 | 1 | 23 | N | 6 | 3559 | 0001-01-01 00:00:00.000000 | BATCH | 19 |
20 | 2 | 23 | N | 6 | 3559 | 0001-01-01 00:00:00.000000 | BATCH | 19 |
18 | 2 | 23 | N | 5 | 4042 | 0001-01-01 00:00:00.000000 | BATCH | 17 |
23 | 2 | 23 | N | 7 | 3559 | 0001-01-01 00:00:00.000000 | BATCH | 16 |
1 | 1 | 23 | N | 5 | 4509 | 0001-01-01 00:00:00.000000 | BATCH | 13 |
25 | 2 | 23 | N | 8 | 3559 | 0001-01-01 00:00:00.000000 | BATCH | 13 |
6 | 1 | 23 | N | 8 | 3559 | 0001-01-01 00:00:00.000000 | BATCH | 9 |