Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 500 |
Missing cells | 2000 |
Missing cells (%) | 50.0% |
Duplicate rows | 16 |
Duplicate rows (%) | 3.2% |
Total size in memory | 33.8 KiB |
Average record size in memory | 69.3 B |
Variable types
Categorical | 2 |
---|---|
Unsupported | 4 |
Boolean | 1 |
Numeric | 1 |
Dataset
Description | 해당 파일 데이터는 신용보증기금의 시스템관리공통코드마스터에 대한 정보를 확인하실 수 있는 자료이니 데이터 활용에 참고하여 주시기 바랍니다. |
---|---|
Author | 신용보증기금 |
URL | https://www.data.go.kr/data/15093315/fileData.do |
코드유형구분코드 has constant value "" | Constant |
Dataset has 16 (3.2%) duplicate rows | Duplicates |
최종수정수 is highly overall correlated with 최초처리직원번호 | High correlation |
최초처리직원번호 is highly overall correlated with 최종수정수 | High correlation |
삭제여부 is highly imbalanced (74.9%) | Imbalance |
목록코드테이블명 has 500 (100.0%) missing values | Missing |
목록코드테이블논리명 has 500 (100.0%) missing values | Missing |
목록코드컬럼명 has 500 (100.0%) missing values | Missing |
목록코드컬럼논리명 has 500 (100.0%) missing values | Missing |
목록코드테이블명 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
목록코드테이블논리명 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
목록코드컬럼명 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
목록코드컬럼논리명 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-12-12 14:39:54.057272 |
---|---|
Analysis finished | 2023-12-12 14:39:54.604388 |
Duration | 0.55 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
코드유형구분코드
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
C |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | C |
---|---|
2nd row | C |
3rd row | C |
4th row | C |
5th row | C |
Common Values
Value | Count | Frequency (%) |
C | 500 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
c | 500 |
목록코드테이블명
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 500 |
---|---|
Missing (%) | 100.0% |
Memory size | 4.5 KiB |
목록코드테이블논리명
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 500 |
---|---|
Missing (%) | 100.0% |
Memory size | 4.5 KiB |
목록코드컬럼명
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 500 |
---|---|
Missing (%) | 100.0% |
Memory size | 4.5 KiB |
목록코드컬럼논리명
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 500 |
---|---|
Missing (%) | 100.0% |
Memory size | 4.5 KiB |
삭제여부
Boolean
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 632.0 B |
False | |
---|---|
True | 21 |
Value | Count | Frequency (%) |
False | 479 | |
True | 21 | 4.2% |
최종수정수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 13 |
---|---|
Distinct (%) | 2.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.736 |
Minimum | 1 |
---|---|
Maximum | 36 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 2 |
95-th percentile | 2 |
Maximum | 36 |
Range | 35 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 3.2260065 |
---|---|
Coefficient of variation (CV) | 1.8582987 |
Kurtosis | 76.5679 |
Mean | 1.736 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 8.5576571 |
Sum | 868 |
Variance | 10.407118 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 335 | |
2 | 144 | |
3 | 6 | 1.2% |
4 | 4 | 0.8% |
5 | 3 | 0.6% |
7 | 1 | 0.2% |
24 | 1 | 0.2% |
25 | 1 | 0.2% |
31 | 1 | 0.2% |
30 | 1 | 0.2% |
Other values (3) | 3 | 0.6% |
Value | Count | Frequency (%) |
1 | 335 | |
2 | 144 | |
3 | 6 | 1.2% |
4 | 4 | 0.8% |
5 | 3 | 0.6% |
7 | 1 | 0.2% |
10 | 1 | 0.2% |
24 | 1 | 0.2% |
25 | 1 | 0.2% |
30 | 1 | 0.2% |
Value | Count | Frequency (%) |
36 | 1 | 0.2% |
33 | 1 | 0.2% |
31 | 1 | 0.2% |
30 | 1 | 0.2% |
25 | 1 | 0.2% |
24 | 1 | 0.2% |
10 | 1 | 0.2% |
7 | 1 | 0.2% |
5 | 3 | |
4 | 4 |
최초처리직원번호
Categorical
HIGH CORRELATION
 
Distinct | 21 |
---|---|
Distinct (%) | 4.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
5099 | |
---|---|
5220 | |
6105 | |
BATCH | 25 |
5803 | 9 |
Other values (16) |
Length
Max length | 5 |
---|---|
Median length | 4 |
Mean length | 4.06 |
Min length | 4 |
Unique
Unique | 4 ? |
---|---|
Unique (%) | 0.8% |
Sample
1st row | BATCH |
---|---|
2nd row | 5423 |
3rd row | BATCH |
4th row | 4444 |
5th row | 4444 |
Common Values
Value | Count | Frequency (%) |
5099 | 251 | |
5220 | 128 | |
6105 | 45 | 9.0% |
BATCH | 25 | 5.0% |
5803 | 9 | 1.8% |
4444 | 6 | 1.2% |
5823 | 6 | 1.2% |
5423 | 4 | 0.8% |
6009 | 3 | 0.6% |
5222 | 3 | 0.6% |
Other values (11) | 20 | 4.0% |
Length
Value | Count | Frequency (%) |
5099 | 251 | |
5220 | 128 | |
6105 | 45 | 9.0% |
batch | 25 | 5.0% |
5803 | 9 | 1.8% |
4444 | 6 | 1.2% |
5823 | 6 | 1.2% |
5423 | 4 | 0.8% |
4509 | 3 | 0.6% |
exc41 | 3 | 0.6% |
Other values (11) | 20 | 4.0% |
삭제여부 | 최종수정수 | 최초처리직원번호 | |
---|---|---|---|
삭제여부 | 1.000 | 0.000 | 0.069 |
최종수정수 | 0.000 | 1.000 | 0.929 |
최초처리직원번호 | 0.069 | 0.929 | 1.000 |
삭제여부 | 최초처리직원번호 | |
---|---|---|
삭제여부 | 1.000 | 0.058 |
최초처리직원번호 | 0.058 | 1.000 |
최종수정수 | 삭제여부 | 최초처리직원번호 | |
---|---|---|---|
최종수정수 | 1.000 | 0.000 | 0.731 |
삭제여부 | 0.000 | 1.000 | 0.058 |
최초처리직원번호 | 0.731 | 0.058 | 1.000 |
코드유형구분코드 | 목록코드테이블명 | 목록코드테이블논리명 | 목록코드컬럼명 | 목록코드컬럼논리명 | 삭제여부 | 최종수정수 | 최초처리직원번호 | |
---|---|---|---|---|---|---|---|---|
0 | C | <NA> | <NA> | <NA> | <NA> | N | 2 | BATCH |
1 | C | <NA> | <NA> | <NA> | <NA> | N | 4 | 5423 |
2 | C | <NA> | <NA> | <NA> | <NA> | N | 2 | BATCH |
3 | C | <NA> | <NA> | <NA> | <NA> | N | 2 | 4444 |
4 | C | <NA> | <NA> | <NA> | <NA> | N | 2 | 4444 |
5 | C | <NA> | <NA> | <NA> | <NA> | N | 2 | 4444 |
6 | C | <NA> | <NA> | <NA> | <NA> | N | 2 | 4444 |
7 | C | <NA> | <NA> | <NA> | <NA> | N | 2 | 4444 |
8 | C | <NA> | <NA> | <NA> | <NA> | N | 2 | 4444 |
9 | C | <NA> | <NA> | <NA> | <NA> | N | 1 | 6009 |
코드유형구분코드 | 목록코드테이블명 | 목록코드테이블논리명 | 목록코드컬럼명 | 목록코드컬럼논리명 | 삭제여부 | 최종수정수 | 최초처리직원번호 | |
---|---|---|---|---|---|---|---|---|
490 | C | <NA> | <NA> | <NA> | <NA> | Y | 1 | 5099 |
491 | C | <NA> | <NA> | <NA> | <NA> | Y | 1 | 5099 |
492 | C | <NA> | <NA> | <NA> | <NA> | Y | 1 | 5099 |
493 | C | <NA> | <NA> | <NA> | <NA> | Y | 1 | 5099 |
494 | C | <NA> | <NA> | <NA> | <NA> | Y | 1 | 5099 |
495 | C | <NA> | <NA> | <NA> | <NA> | Y | 1 | 5099 |
496 | C | <NA> | <NA> | <NA> | <NA> | Y | 1 | 5099 |
497 | C | <NA> | <NA> | <NA> | <NA> | Y | 1 | 5099 |
498 | C | <NA> | <NA> | <NA> | <NA> | Y | 1 | 5099 |
499 | C | <NA> | <NA> | <NA> | <NA> | Y | 1 | 5099 |
Most frequently occurring
코드유형구분코드 | 삭제여부 | 최종수정수 | 최초처리직원번호 | # duplicates | |
---|---|---|---|---|---|
1 | C | N | 1 | 5099 | 129 |
2 | C | N | 1 | 5220 | 126 |
10 | C | N | 2 | 5099 | 100 |
8 | C | N | 1 | 6105 | 39 |
13 | C | N | 2 | BATCH | 24 |
15 | C | Y | 1 | 5099 | 21 |
11 | C | N | 2 | 5803 | 7 |
9 | C | N | 2 | 4444 | 6 |
4 | C | N | 1 | 5823 | 5 |
12 | C | N | 2 | 6105 | 4 |