Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 500 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 26.0 KiB |
Average record size in memory | 53.3 B |
Variable types
Numeric | 3 |
---|---|
Categorical | 2 |
Boolean | 1 |
Dataset
Description | 해당 파일 데이터는 신용보증기금의 공통전자문서서식프로그램정보에 대해 확인하실 수 있는 자료이니 데이터 활용에 참고하여 주시기 바랍니다. |
---|---|
Author | 신용보증기금 |
URL | https://www.data.go.kr/data/15093169/fileData.do |
컬럼상위레벨값 has constant value "" | Constant |
전자문서서식프로그램 is highly overall correlated with 최종수정수 and 2 other fields | High correlation |
최종수정수 is highly overall correlated with 전자문서서식프로그램 and 1 other fields | High correlation |
처리직원번호 is highly overall correlated with 전자문서서식프로그램 and 1 other fields | High correlation |
컬럼레벨값 is highly overall correlated with 전자문서서식프로그램 | High correlation |
컬럼레벨값 is highly imbalanced (63.4%) | Imbalance |
삭제여부 is highly imbalanced (59.1%) | Imbalance |
전자문서서식프로그램 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 12:53:55.910586 |
---|---|
Analysis finished | 2023-12-12 12:53:57.181419 |
Duration | 1.27 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
전자문서서식프로그램
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 500 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 651.222 |
Minimum | 379 |
---|---|
Maximum | 908 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 379 |
---|---|
5-th percentile | 425.95 |
Q1 | 525.75 |
median | 650.5 |
Q3 | 775.25 |
95-th percentile | 883.05 |
Maximum | 908 |
Range | 529 |
Interquartile range (IQR) | 249.5 |
Descriptive statistics
Standard deviation | 145.83184 |
---|---|
Coefficient of variation (CV) | 0.22393568 |
Kurtosis | -1.1715046 |
Mean | 651.222 |
Median Absolute Deviation (MAD) | 125 |
Skewness | 0.021354357 |
Sum | 325611 |
Variance | 21266.927 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
867 | 1 | 0.2% |
572 | 1 | 0.2% |
559 | 1 | 0.2% |
560 | 1 | 0.2% |
561 | 1 | 0.2% |
562 | 1 | 0.2% |
553 | 1 | 0.2% |
564 | 1 | 0.2% |
565 | 1 | 0.2% |
566 | 1 | 0.2% |
Other values (490) | 490 |
Value | Count | Frequency (%) |
379 | 1 | |
393 | 1 | |
403 | 1 | |
404 | 1 | |
405 | 1 | |
406 | 1 | |
407 | 1 | |
408 | 1 | |
409 | 1 | |
410 | 1 |
Value | Count | Frequency (%) |
908 | 1 | |
907 | 1 | |
906 | 1 | |
905 | 1 | |
904 | 1 | |
903 | 1 | |
902 | 1 | |
901 | 1 | |
900 | 1 | |
899 | 1 |
컬럼레벨값
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
1 | |
---|---|
0 | 35 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 465 | |
0 | 35 | 7.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 465 | |
0 | 35 | 7.0% |
컬럼상위레벨값
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 500 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 500 |
삭제여부
Boolean
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 632.0 B |
False | |
---|---|
True | 41 |
Value | Count | Frequency (%) |
False | 459 | |
True | 41 | 8.2% |
최종수정수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 13 |
---|---|
Distinct (%) | 2.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.788 |
Minimum | 1 |
---|---|
Maximum | 13 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 2 |
Q3 | 8 |
95-th percentile | 12 |
Maximum | 13 |
Range | 12 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 4.1022815 |
---|---|
Coefficient of variation (CV) | 0.85678394 |
Kurtosis | -0.93677587 |
Mean | 4.788 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 0.84152022 |
Sum | 2394 |
Variance | 16.828713 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 175 | |
1 | 89 | |
12 | 86 | |
8 | 49 | 9.8% |
3 | 32 | 6.4% |
4 | 31 | 6.2% |
10 | 15 | 3.0% |
6 | 12 | 2.4% |
5 | 5 | 1.0% |
13 | 2 | 0.4% |
Other values (3) | 4 | 0.8% |
Value | Count | Frequency (%) |
1 | 89 | |
2 | 175 | |
3 | 32 | 6.4% |
4 | 31 | 6.2% |
5 | 5 | 1.0% |
6 | 12 | 2.4% |
7 | 1 | 0.2% |
8 | 49 | 9.8% |
9 | 1 | 0.2% |
10 | 15 | 3.0% |
Value | Count | Frequency (%) |
13 | 2 | 0.4% |
12 | 86 | |
11 | 2 | 0.4% |
10 | 15 | 3.0% |
9 | 1 | 0.2% |
8 | 49 | |
7 | 1 | 0.2% |
6 | 12 | 2.4% |
5 | 5 | 1.0% |
4 | 31 | 6.2% |
처리직원번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5002.588 |
Minimum | 4917 |
---|---|
Maximum | 5536 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 4917 |
---|---|
5-th percentile | 4917 |
Q1 | 4917 |
median | 4917 |
Q3 | 4917 |
95-th percentile | 5536 |
Maximum | 5536 |
Range | 619 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 183.91641 |
---|---|
Coefficient of variation (CV) | 0.036764252 |
Kurtosis | 3.4295915 |
Mean | 5002.588 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.1714267 |
Sum | 2501294 |
Variance | 33825.245 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4917 | 385 | |
5536 | 44 | 8.8% |
5093 | 41 | 8.2% |
5222 | 18 | 3.6% |
5176 | 11 | 2.2% |
4920 | 1 | 0.2% |
Value | Count | Frequency (%) |
4917 | 385 | |
4920 | 1 | 0.2% |
5093 | 41 | 8.2% |
5176 | 11 | 2.2% |
5222 | 18 | 3.6% |
5536 | 44 | 8.8% |
Value | Count | Frequency (%) |
5536 | 44 | 8.8% |
5222 | 18 | 3.6% |
5176 | 11 | 2.2% |
5093 | 41 | 8.2% |
4920 | 1 | 0.2% |
4917 | 385 |
전자문서서식프로그램 | 컬럼레벨값 | 삭제여부 | 최종수정수 | 처리직원번호 | |
---|---|---|---|---|---|
전자문서서식프로그램 | 1.000 | 0.697 | 0.465 | 0.897 | 0.607 |
컬럼레벨값 | 0.697 | 1.000 | 0.080 | 0.204 | 0.635 |
삭제여부 | 0.465 | 0.080 | 1.000 | 0.449 | 0.000 |
최종수정수 | 0.897 | 0.204 | 0.449 | 1.000 | 0.125 |
처리직원번호 | 0.607 | 0.635 | 0.000 | 0.125 | 1.000 |
삭제여부 | 컬럼레벨값 | |
---|---|---|
삭제여부 | 1.000 | 0.051 |
컬럼레벨값 | 0.051 | 1.000 |
전자문서서식프로그램 | 최종수정수 | 처리직원번호 | 컬럼레벨값 | 삭제여부 | |
---|---|---|---|---|---|
전자문서서식프로그램 | 1.000 | -0.910 | 0.702 | 0.539 | 0.345 |
최종수정수 | -0.910 | 1.000 | -0.599 | 0.155 | 0.345 |
처리직원번호 | 0.702 | -0.599 | 1.000 | 0.055 | 0.105 |
컬럼레벨값 | 0.539 | 0.155 | 0.055 | 1.000 | 0.051 |
삭제여부 | 0.345 | 0.345 | 0.105 | 0.051 | 1.000 |
전자문서서식프로그램 | 컬럼레벨값 | 컬럼상위레벨값 | 삭제여부 | 최종수정수 | 처리직원번호 | |
---|---|---|---|---|---|---|
0 | 867 | 1 | 0 | N | 3 | 5536 |
1 | 812 | 1 | 0 | N | 2 | 5536 |
2 | 811 | 1 | 0 | N | 2 | 5536 |
3 | 908 | 1 | 0 | N | 2 | 5536 |
4 | 907 | 1 | 0 | N | 1 | 5536 |
5 | 906 | 1 | 0 | N | 3 | 5536 |
6 | 903 | 1 | 0 | N | 3 | 5536 |
7 | 905 | 1 | 0 | N | 3 | 5536 |
8 | 904 | 1 | 0 | N | 3 | 5536 |
9 | 902 | 1 | 0 | N | 1 | 5536 |
전자문서서식프로그램 | 컬럼레벨값 | 컬럼상위레벨값 | 삭제여부 | 최종수정수 | 처리직원번호 | |
---|---|---|---|---|---|---|
490 | 412 | 1 | 0 | N | 12 | 4917 |
491 | 411 | 1 | 0 | N | 12 | 4917 |
492 | 410 | 1 | 0 | N | 12 | 4917 |
493 | 409 | 1 | 0 | N | 12 | 4917 |
494 | 408 | 1 | 0 | N | 12 | 4917 |
495 | 407 | 1 | 0 | N | 12 | 4917 |
496 | 406 | 1 | 0 | N | 12 | 4917 |
497 | 405 | 1 | 0 | N | 12 | 4917 |
498 | 404 | 1 | 0 | N | 12 | 4917 |
499 | 393 | 1 | 0 | N | 12 | 4917 |