Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 500 |
Missing cells | 479 |
Missing cells (%) | 10.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 37.2 KiB |
Average record size in memory | 76.3 B |
Variable types
Categorical | 6 |
---|---|
Text | 2 |
Numeric | 1 |
Dataset
Description | 해당 파일 데이터는 신용보증기금의 보증상담기업 신용정보 상세정보를 확인하실 수 있는 자료이니 데이터 활용에 참고하여 주시기 바랍니다. |
---|---|
Author | 신용보증기금 |
URL | https://www.data.go.kr/data/15093227/fileData.do |
이력일련번호 has constant value "" | Constant |
최종수정수 has constant value "" | Constant |
상담기업개요ID is highly overall correlated with 처리직원번호 | High correlation |
처리직원번호 is highly overall correlated with 상담기업개요ID | High correlation |
여부항목여부 is highly imbalanced (78.3%) | Imbalance |
일자항목일자 is highly imbalanced (89.4%) | Imbalance |
문자항목명 has 479 (95.8%) missing values | Missing |
숫자항목값 is highly skewed (γ1 = 20.6393162) | Skewed |
숫자항목값 has 493 (98.6%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 03:21:08.658730 |
---|---|
Analysis finished | 2023-12-12 03:21:10.082794 |
Duration | 1.42 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
상담기업개요ID
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
9bLqoOA34p | |
---|---|
9bLqlSJQvi | |
9bLqkoCoJ6 | |
9bLqjgGybW | |
9bLqhQ42Cv |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 9bLqoOA34p |
---|---|
2nd row | 9bLqoOA34p |
3rd row | 9bLqoOA34p |
4th row | 9bLqoOA34p |
5th row | 9bLqoOA34p |
Common Values
Value | Count | Frequency (%) |
9bLqoOA34p | 113 | |
9bLqlSJQvi | 113 | |
9bLqkoCoJ6 | 113 | |
9bLqjgGybW | 113 | |
9bLqhQ42Cv | 48 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
9blqooa34p | 113 | |
9blqlsjqvi | 113 | |
9blqkocoj6 | 113 | |
9blqjggybw | 113 | |
9blqhq42cv | 48 |
상담신용정보항목코드
Text
Distinct | 113 |
---|---|
Distinct (%) | 22.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
Value | Count | Frequency (%) |
ca001yn | 5 | 1.0% |
cd005ch | 5 | 1.0% |
cb008dc | 5 | 1.0% |
cb005ch | 5 | 1.0% |
cb009ch | 5 | 1.0% |
cc001yn | 5 | 1.0% |
cc002ch | 5 | 1.0% |
cc004ch | 5 | 1.0% |
cc005ch | 5 | 1.0% |
cc006ch | 5 | 1.0% |
Other values (103) | 450 |
Most occurring characters
Value | Count | Frequency (%) |
C | 953 | |
0 | 838 | |
H | 288 | 8.2% |
D | 265 | 7.6% |
1 | 242 | 6.9% |
A | 95 | 2.7% |
T | 90 | 2.6% |
F | 84 | 2.4% |
E | 84 | 2.4% |
2 | 79 | 2.3% |
Other values (11) | 482 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 2000 | |
Decimal Number | 1500 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
C | 953 | |
H | 288 | 14.4% |
D | 265 | 13.2% |
A | 95 | 4.8% |
T | 90 | 4.5% |
F | 84 | 4.2% |
E | 84 | 4.2% |
B | 45 | 2.2% |
N | 32 | 1.6% |
Y | 32 | 1.6% |
Decimal Number
Value | Count | Frequency (%) |
0 | 838 | |
1 | 242 | 16.1% |
2 | 79 | 5.3% |
3 | 58 | 3.9% |
5 | 54 | 3.6% |
4 | 54 | 3.6% |
6 | 48 | 3.2% |
7 | 48 | 3.2% |
9 | 40 | 2.7% |
8 | 39 | 2.6% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 2000 | |
Common | 1500 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
C | 953 | |
H | 288 | 14.4% |
D | 265 | 13.2% |
A | 95 | 4.8% |
T | 90 | 4.5% |
F | 84 | 4.2% |
E | 84 | 4.2% |
B | 45 | 2.2% |
N | 32 | 1.6% |
Y | 32 | 1.6% |
Common
Value | Count | Frequency (%) |
0 | 838 | |
1 | 242 | 16.1% |
2 | 79 | 5.3% |
3 | 58 | 3.9% |
5 | 54 | 3.6% |
4 | 54 | 3.6% |
6 | 48 | 3.2% |
7 | 48 | 3.2% |
9 | 40 | 2.7% |
8 | 39 | 2.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 3500 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
C | 953 | |
0 | 838 | |
H | 288 | 8.2% |
D | 265 | 7.6% |
1 | 242 | 6.9% |
A | 95 | 2.7% |
T | 90 | 2.6% |
F | 84 | 2.4% |
E | 84 | 2.4% |
2 | 79 | 2.3% |
Other values (11) | 482 |
이력일련번호
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
1 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 500 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 500 |
여부항목여부
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
N | 24 |
---|---|
Y | 4 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | N |
---|---|
2nd row | |
3rd row | |
4th row | |
5th row |
Common Values
Value | Count | Frequency (%) |
472 | ||
N | 24 | 4.8% |
Y | 4 | 0.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
n | 24 | |
y | 4 | 14.3% |
문자항목명
Text
MISSING
 
Distinct | 18 |
---|---|
Distinct (%) | 85.7% |
Missing | 479 |
Missing (%) | 95.8% |
Memory size | 4.0 KiB |
Value | Count | Frequency (%) |
6.60e+12 | 2 | 9.5% |
cbr-2 | 2 | 9.5% |
2 | 2 | 9.5% |
92 | 1 | 4.8% |
4 | 1 | 4.8% |
2010-10-15-19.20.22.129735 | 1 | 4.8% |
thy | 1 | 4.8% |
박홍준 | 1 | 4.8% |
1 | 1 | 4.8% |
51 | 1 | 4.8% |
Other values (8) | 8 |
Most occurring characters
Value | Count | Frequency (%) |
2 | 13 | 11.5% |
1 | 10 | 8.8% |
9 | 9 | 8.0% |
- | 7 | 6.2% |
0 | 6 | 5.3% |
B | 6 | 5.3% |
. | 5 | 4.4% |
C | 5 | 4.4% |
R | 4 | 3.5% |
6 | 4 | 3.5% |
Other values (31) | 44 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 50 | |
Uppercase Letter | 31 | |
Lowercase Letter | 10 | 8.8% |
Other Letter | 8 | 7.1% |
Dash Punctuation | 7 | 6.2% |
Other Punctuation | 5 | 4.4% |
Math Symbol | 2 | 1.8% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
B | 6 | |
C | 5 | |
R | 4 | |
E | 4 | |
H | 2 | 6.5% |
Y | 2 | 6.5% |
T | 2 | 6.5% |
S | 1 | 3.2% |
U | 1 | 3.2% |
M | 1 | 3.2% |
Other values (3) | 3 |
Decimal Number
Value | Count | Frequency (%) |
2 | 13 | |
1 | 10 | |
9 | 9 | |
0 | 6 | |
6 | 4 | 8.0% |
5 | 3 | 6.0% |
3 | 2 | 4.0% |
4 | 2 | 4.0% |
7 | 1 | 2.0% |
Lowercase Letter
Value | Count | Frequency (%) |
b | 3 | |
o | 1 | 10.0% |
j | 1 | 10.0% |
s | 1 | 10.0% |
p | 1 | 10.0% |
g | 1 | 10.0% |
q | 1 | 10.0% |
l | 1 | 10.0% |
Other Letter
Value | Count | Frequency (%) |
명 | 1 | |
임 | 1 | |
준 | 1 | |
권 | 1 | |
박 | 1 | |
홍 | 1 | |
감 | 1 | |
사 | 1 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 7 |
Other Punctuation
Value | Count | Frequency (%) |
. | 5 |
Math Symbol
Value | Count | Frequency (%) |
+ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 64 | |
Latin | 41 | |
Hangul | 8 | 7.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
B | 6 | |
C | 5 | |
R | 4 | 9.8% |
E | 4 | 9.8% |
b | 3 | 7.3% |
H | 2 | 4.9% |
Y | 2 | 4.9% |
T | 2 | 4.9% |
o | 1 | 2.4% |
S | 1 | 2.4% |
Other values (11) | 11 |
Common
Value | Count | Frequency (%) |
2 | 13 | |
1 | 10 | |
9 | 9 | |
- | 7 | |
0 | 6 | |
. | 5 | 7.8% |
6 | 4 | 6.2% |
5 | 3 | 4.7% |
+ | 2 | 3.1% |
3 | 2 | 3.1% |
Other values (2) | 3 | 4.7% |
Hangul
Value | Count | Frequency (%) |
명 | 1 | |
임 | 1 | |
준 | 1 | |
권 | 1 | |
박 | 1 | |
홍 | 1 | |
감 | 1 | |
사 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 105 | |
Hangul | 8 | 7.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2 | 13 | 12.4% |
1 | 10 | 9.5% |
9 | 9 | 8.6% |
- | 7 | 6.7% |
0 | 6 | 5.7% |
B | 6 | 5.7% |
. | 5 | 4.8% |
C | 5 | 4.8% |
R | 4 | 3.8% |
6 | 4 | 3.8% |
Other values (23) | 36 |
Hangul
Value | Count | Frequency (%) |
명 | 1 | |
임 | 1 | |
준 | 1 | |
권 | 1 | |
박 | 1 | |
홍 | 1 | |
감 | 1 | |
사 | 1 |
숫자항목값
Real number (ℝ)
SKEWED
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 724613.87 |
Minimum | 0 |
---|---|
Maximum | 2.88 × 108 |
Zeros | 493 |
Zeros (%) | 98.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 0 |
Maximum | 2.88 × 108 |
Range | 2.88 × 108 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 13295093 |
---|---|
Coefficient of variation (CV) | 18.347831 |
Kurtosis | 441.01461 |
Mean | 724613.87 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 20.639316 |
Sum | 3.6230693 × 108 |
Variance | 1.7675949 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 493 | |
1 | 3 | 0.6% |
74306903 | 1 | 0.2% |
288000000 | 1 | 0.2% |
26 | 1 | 0.2% |
2 | 1 | 0.2% |
Value | Count | Frequency (%) |
0 | 493 | |
1 | 3 | 0.6% |
2 | 1 | 0.2% |
26 | 1 | 0.2% |
74306903 | 1 | 0.2% |
288000000 | 1 | 0.2% |
Value | Count | Frequency (%) |
288000000 | 1 | 0.2% |
74306903 | 1 | 0.2% |
26 | 1 | 0.2% |
2 | 1 | 0.2% |
1 | 3 | 0.6% |
0 | 493 |
일자항목일자
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
0001-01-01 00:00:00.000000 | |
---|---|
00:00.0 | 7 |
Length
Max length | 26 |
---|---|
Median length | 26 |
Mean length | 25.734 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0001-01-01 00:00:00.000000 |
---|---|
2nd row | 00:00.0 |
3rd row | 0001-01-01 00:00:00.000000 |
4th row | 0001-01-01 00:00:00.000000 |
5th row | 0001-01-01 00:00:00.000000 |
Common Values
Value | Count | Frequency (%) |
0001-01-01 00:00:00.000000 | 493 | |
00:00.0 | 7 | 1.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0001-01-01 | 493 | |
00:00:00.000000 | 493 | |
00:00.0 | 7 | 0.7% |
최종수정수
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
1 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 500 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 500 |
처리직원번호
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
94211 | |
---|---|
3491 | |
3471 | |
3452 |
Length
Max length | 5 |
---|---|
Median length | 4 |
Mean length | 4.322 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 3491 |
---|---|
2nd row | 3491 |
3rd row | 3491 |
4th row | 3491 |
5th row | 3491 |
Common Values
Value | Count | Frequency (%) |
94211 | 161 | |
3491 | 113 | |
3471 | 113 | |
3452 | 113 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
94211 | 161 | |
3491 | 113 | |
3471 | 113 | |
3452 | 113 |
상담기업개요ID | 여부항목여부 | 문자항목명 | 숫자항목값 | 일자항목일자 | 처리직원번호 | |
---|---|---|---|---|---|---|
상담기업개요ID | 1.000 | 0.032 | 0.780 | 0.000 | 0.000 | 1.000 |
여부항목여부 | 0.032 | 1.000 | NaN | 0.000 | 0.000 | 0.053 |
문자항목명 | 0.780 | NaN | 1.000 | NaN | NaN | 0.880 |
숫자항목값 | 0.000 | 0.000 | NaN | 1.000 | 0.000 | 0.031 |
일자항목일자 | 0.000 | 0.000 | NaN | 0.000 | 1.000 | 0.013 |
처리직원번호 | 1.000 | 0.053 | 0.880 | 0.031 | 0.013 | 1.000 |
여부항목여부 | 상담기업개요ID | 처리직원번호 | 일자항목일자 | |
---|---|---|---|---|
여부항목여부 | 1.000 | 0.023 | 0.050 | 0.000 |
상담기업개요ID | 0.023 | 1.000 | 0.999 | 0.000 |
처리직원번호 | 0.050 | 0.999 | 1.000 | 0.008 |
일자항목일자 | 0.000 | 0.000 | 0.008 | 1.000 |
숫자항목값 | 상담기업개요ID | 여부항목여부 | 일자항목일자 | 처리직원번호 | |
---|---|---|---|---|---|
숫자항목값 | 1.000 | 0.000 | 0.000 | 0.000 | 0.029 |
상담기업개요ID | 0.000 | 1.000 | 0.023 | 0.000 | 0.999 |
여부항목여부 | 0.000 | 0.023 | 1.000 | 0.000 | 0.050 |
일자항목일자 | 0.000 | 0.000 | 0.000 | 1.000 | 0.008 |
처리직원번호 | 0.029 | 0.999 | 0.050 | 0.008 | 1.000 |
상담기업개요ID | 상담신용정보항목코드 | 이력일련번호 | 여부항목여부 | 문자항목명 | 숫자항목값 | 일자항목일자 | 최종수정수 | 처리직원번호 | |
---|---|---|---|---|---|---|---|---|---|
0 | 9bLqoOA34p | CA001YN | 1 | N | <NA> | 0 | 0001-01-01 00:00:00.000000 | 1 | 3491 |
1 | 9bLqoOA34p | CH007DT | 1 | <NA> | 0 | 00:00.0 | 1 | 3491 | |
2 | 9bLqoOA34p | CH006DC | 1 | <NA> | 74306903 | 0001-01-01 00:00:00.000000 | 1 | 3491 | |
3 | 9bLqoOA34p | CH005DC | 1 | <NA> | 1 | 0001-01-01 00:00:00.000000 | 1 | 3491 | |
4 | 9bLqoOA34p | CH004DT | 1 | <NA> | 0 | 0001-01-01 00:00:00.000000 | 1 | 3491 | |
5 | 9bLqoOA34p | CH003DC | 1 | <NA> | 0 | 0001-01-01 00:00:00.000000 | 1 | 3491 | |
6 | 9bLqoOA34p | CH002DC | 1 | <NA> | 0 | 0001-01-01 00:00:00.000000 | 1 | 3491 | |
7 | 9bLqoOA34p | CH001DC | 1 | <NA> | 0 | 0001-01-01 00:00:00.000000 | 1 | 3491 | |
8 | 9bLqoOA34p | CG008DC | 1 | <NA> | 0 | 0001-01-01 00:00:00.000000 | 1 | 3491 | |
9 | 9bLqoOA34p | CG007DC | 1 | <NA> | 0 | 0001-01-01 00:00:00.000000 | 1 | 3491 |
상담기업개요ID | 상담신용정보항목코드 | 이력일련번호 | 여부항목여부 | 문자항목명 | 숫자항목값 | 일자항목일자 | 최종수정수 | 처리직원번호 | |
---|---|---|---|---|---|---|---|---|---|
490 | 9bLqhQ42Cv | CA012CH | 1 | <NA> | 0 | 0001-01-01 00:00:00.000000 | 1 | 94211 | |
491 | 9bLqhQ42Cv | CA011DT | 1 | <NA> | 0 | 0001-01-01 00:00:00.000000 | 1 | 94211 | |
492 | 9bLqhQ42Cv | CA010CH | 1 | <NA> | 0 | 0001-01-01 00:00:00.000000 | 1 | 94211 | |
493 | 9bLqhQ42Cv | CA009DT | 1 | <NA> | 0 | 0001-01-01 00:00:00.000000 | 1 | 94211 | |
494 | 9bLqhQ42Cv | CA007DC | 1 | <NA> | 0 | 0001-01-01 00:00:00.000000 | 1 | 94211 | |
495 | 9bLqhQ42Cv | CA006CH | 1 | <NA> | 0 | 0001-01-01 00:00:00.000000 | 1 | 94211 | |
496 | 9bLqhQ42Cv | CA005DT | 1 | <NA> | 0 | 0001-01-01 00:00:00.000000 | 1 | 94211 | |
497 | 9bLqhQ42Cv | CA004CH | 1 | <NA> | 0 | 0001-01-01 00:00:00.000000 | 1 | 94211 | |
498 | 9bLqhQ42Cv | CA003DT | 1 | <NA> | 0 | 0001-01-01 00:00:00.000000 | 1 | 94211 | |
499 | 9bLqhQ42Cv | CA002CH | 1 | <NA> | 0 | 0001-01-01 00:00:00.000000 | 1 | 94211 |