Overview

Dataset statistics

Number of variables12
Number of observations500
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory49.4 KiB
Average record size in memory101.3 B

Variable types

Text1
Categorical10
Boolean1

Dataset

Description해당 파일 데이터는 신용보증기금의 제1차 비용 집계 내역을 확인하실 수 있는 자료이오니 활용에 참고하여 주시기 바랍니다.
Author신용보증기금
URLhttps://www.data.go.kr/data/15092680/fileData.do

Alerts

회계년월 has constant value ""Constant
회계구분코드 has constant value ""Constant
소관내부거래금액 has constant value ""Constant
회계내부거래금액 has constant value ""Constant
삭제여부 has constant value ""Constant
최종수정수 has constant value ""Constant
처리직원번호 has constant value ""Constant
최초처리직원번호 has constant value ""Constant
국가단위사업코드 is highly overall correlated with 원가분류코드 and 1 other fieldsHigh correlation
원가분류코드 is highly overall correlated with 프로그램코드 and 1 other fieldsHigh correlation
프로그램코드 is highly overall correlated with 원가분류코드 and 1 other fieldsHigh correlation
원가분류코드 is highly imbalanced (65.4%)Imbalance
프로그램코드 is highly imbalanced (63.1%)Imbalance
국가단위사업코드 is highly imbalanced (63.1%)Imbalance
제1차비용집계내역ID has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:38:41.255375
Analysis finished2023-12-12 07:38:41.882401
Duration0.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct500
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2023-12-12T16:38:42.379262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters5000
Distinct characters62
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique500 ?
Unique (%)100.0%

Sample

1st row9dnSMTbQ96
2nd row9dnSMTbQ4X
3rd row9dnSMTbQZc
4th row9dnSMTbQJp
5th row9dnSMTbQEw
ValueCountFrequency (%)
9dnsmtaygn 2
 
0.4%
9dnsmtbcj4 1
 
0.2%
9dnsmtbaph 1
 
0.2%
9dnsmtba0v 1
 
0.2%
9dnsmtba6j 1
 
0.2%
9dnsmtbbdc 1
 
0.2%
9dnsmtbbk7 1
 
0.2%
9dnsmtbbsp 1
 
0.2%
9dnsmtbbak 1
 
0.2%
9dnsmtbbfb 1
 
0.2%
Other values (489) 489
97.8%
2023-12-12T16:38:42.822098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9 531
10.6%
d 526
10.5%
S 524
10.5%
M 524
10.5%
n 522
10.4%
T 522
10.4%
b 370
 
7.4%
a 171
 
3.4%
J 33
 
0.7%
E 33
 
0.7%
Other values (52) 1244
24.9%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 2126
42.5%
Lowercase Letter 2121
42.4%
Decimal Number 753
 
15.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
d 526
24.8%
n 522
24.6%
b 370
17.4%
a 171
 
8.1%
z 31
 
1.5%
v 30
 
1.4%
i 30
 
1.4%
c 27
 
1.3%
h 27
 
1.3%
m 27
 
1.3%
Other values (16) 360
17.0%
Uppercase Letter
ValueCountFrequency (%)
S 524
24.6%
M 524
24.6%
T 522
24.6%
J 33
 
1.6%
E 33
 
1.6%
P 30
 
1.4%
C 29
 
1.4%
V 28
 
1.3%
Z 28
 
1.3%
I 27
 
1.3%
Other values (16) 348
16.4%
Decimal Number
ValueCountFrequency (%)
9 531
70.5%
4 32
 
4.2%
0 29
 
3.9%
2 27
 
3.6%
3 24
 
3.2%
8 23
 
3.1%
1 23
 
3.1%
7 22
 
2.9%
5 21
 
2.8%
6 21
 
2.8%

Most occurring scripts

ValueCountFrequency (%)
Latin 4247
84.9%
Common 753
 
15.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
d 526
12.4%
S 524
12.3%
M 524
12.3%
n 522
12.3%
T 522
12.3%
b 370
 
8.7%
a 171
 
4.0%
J 33
 
0.8%
E 33
 
0.8%
z 31
 
0.7%
Other values (42) 991
23.3%
Common
ValueCountFrequency (%)
9 531
70.5%
4 32
 
4.2%
0 29
 
3.9%
2 27
 
3.6%
3 24
 
3.2%
8 23
 
3.1%
1 23
 
3.1%
7 22
 
2.9%
5 21
 
2.8%
6 21
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9 531
10.6%
d 526
10.5%
S 524
10.5%
M 524
10.5%
n 522
10.4%
T 522
10.4%
b 370
 
7.4%
a 171
 
3.4%
J 33
 
0.7%
E 33
 
0.7%
Other values (52) 1244
24.9%

회계년월
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
202109
500 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row202109
2nd row202109
3rd row202109
4th row202109
5th row202109

Common Values

ValueCountFrequency (%)
202109 500
100.0%

Length

2023-12-12T16:38:43.011681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:38:43.127085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
202109 500
100.0%

회계구분코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
G
500 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowG
2nd rowG
3rd rowG
4th rowG
5th rowG

Common Values

ValueCountFrequency (%)
G 500
100.0%

Length

2023-12-12T16:38:43.243567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:38:43.379303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
g 500
100.0%

원가분류코드
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
15
439 
14
 
35
11
 
21
12
 
5

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row14
2nd row14
3rd row14
4th row12
5th row15

Common Values

ValueCountFrequency (%)
15 439
87.8%
14 35
 
7.0%
11 21
 
4.2%
12 5
 
1.0%

Length

2023-12-12T16:38:43.482158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:38:43.610789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
15 439
87.8%
14 35
 
7.0%
11 21
 
4.2%
12 5
 
1.0%

프로그램코드
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
3300
439 
2400
56 
 
5

Length

Max length4
Median length4
Mean length3.97
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2400
2nd row2400
3rd row2400
4th row
5th row3300

Common Values

ValueCountFrequency (%)
3300 439
87.8%
2400 56
 
11.2%
5
 
1.0%

Length

2023-12-12T16:38:43.751566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:38:43.877458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3300 439
88.7%
2400 56
 
11.3%

국가단위사업코드
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
3378
439 
2431
56 
 
5

Length

Max length4
Median length4
Mean length3.97
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2431
2nd row2431
3rd row2431
4th row
5th row3378

Common Values

ValueCountFrequency (%)
3378 439
87.8%
2431 56
 
11.2%
5
 
1.0%

Length

2023-12-12T16:38:44.015450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:38:44.144892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3378 439
88.7%
2431 56
 
11.3%

소관내부거래금액
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
0
500 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 500
100.0%

Length

2023-12-12T16:38:44.267610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:38:44.380682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 500
100.0%

회계내부거래금액
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
0
500 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 500
100.0%

Length

2023-12-12T16:38:44.480028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:38:44.583721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 500
100.0%

삭제여부
Boolean

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size632.0 B
False
500 
ValueCountFrequency (%)
False 500
100.0%
2023-12-12T16:38:44.659658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

최종수정수
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
1
500 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 500
100.0%

Length

2023-12-12T16:38:44.764358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:38:44.864116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 500
100.0%

처리직원번호
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
AND07
500 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAND07
2nd rowAND07
3rd rowAND07
4th rowAND07
5th rowAND07

Common Values

ValueCountFrequency (%)
AND07 500
100.0%

Length

2023-12-12T16:38:44.966775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:38:45.055556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
and07 500
100.0%

최초처리직원번호
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
AND07
500 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAND07
2nd rowAND07
3rd rowAND07
4th rowAND07
5th rowAND07

Common Values

ValueCountFrequency (%)
AND07 500
100.0%

Length

2023-12-12T16:38:45.157326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:38:45.256563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
and07 500
100.0%

Correlations

2023-12-12T16:38:45.312030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
원가분류코드프로그램코드국가단위사업코드
원가분류코드1.0001.0001.000
프로그램코드1.0001.0001.000
국가단위사업코드1.0001.0001.000
2023-12-12T16:38:45.426439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
국가단위사업코드원가분류코드프로그램코드
국가단위사업코드1.0000.9991.000
원가분류코드0.9991.0000.999
프로그램코드1.0000.9991.000
2023-12-12T16:38:45.534705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
원가분류코드프로그램코드국가단위사업코드
원가분류코드1.0000.9990.999
프로그램코드0.9991.0001.000
국가단위사업코드0.9991.0001.000

Missing values

2023-12-12T16:38:41.581444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:38:41.782926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

제1차비용집계내역ID회계년월회계구분코드원가분류코드프로그램코드국가단위사업코드소관내부거래금액회계내부거래금액삭제여부최종수정수처리직원번호최초처리직원번호
09dnSMTbQ96202109G142400243100N1AND07AND07
19dnSMTbQ4X202109G142400243100N1AND07AND07
29dnSMTbQZc202109G142400243100N1AND07AND07
39dnSMTbQJp202109G1200N1AND07AND07
49dnSMTbQEw202109G153300337800N1AND07AND07
59dnSMTbQzG202109G153300337800N1AND07AND07
69dnSMTbQt3202109G153300337800N1AND07AND07
79dnSMTbQoY202109G153300337800N1AND07AND07
89dnSMTbQi5202109G153300337800N1AND07AND07
99dnSMTbQcx202109G153300337800N1AND07AND07
제1차비용집계내역ID회계년월회계구분코드원가분류코드프로그램코드국가단위사업코드소관내부거래금액회계내부거래금액삭제여부최종수정수처리직원번호최초처리직원번호
4909dnSMTaSxp202109G153300337800N1AND07AND07
4919dnSMTaSsi202109G153300337800N1AND07AND07
4929dnSMTaSm3202109G153300337800N1AND07AND07
4939dnSMTaShC202109G153300337800N1AND07AND07
4949dnSMTaR8H202109G153300337800N1AND07AND07
4959dnSMTaRUp202109G153300337800N1AND07AND07
4969dnSMTaRO8202109G153300337800N1AND07AND07
4979dnSMTaRI2202109G153300337800N1AND07AND07
4989dnSMTaRDm202109G153300337800N1AND07AND07
4999dnSMTaRyx202109G153300337800N1AND07AND07