Overview

Dataset statistics

Number of variables15
Number of observations500
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory61.7 KiB
Average record size in memory126.3 B

Variable types

Text1
Categorical11
Boolean1
DateTime2

Dataset

Description해당 파일 데이터는 신용보증기금 회계 비용 집계 내역에 관련된 정보를 확인하실 수 있는 자료이니 활용에 참고하시기 바랍니다.
Author신용보증기금
URLhttps://www.data.go.kr/data/15092678/fileData.do

Alerts

회계년월 has constant value ""Constant
회계구분코드 has constant value ""Constant
소관내부거래금액 has constant value ""Constant
회계내부거래금액 has constant value ""Constant
삭제여부 has constant value ""Constant
최종수정수 has constant value ""Constant
처리직원번호 has constant value ""Constant
최초처리직원번호 has constant value ""Constant
프로그램코드 is highly overall correlated with 원가분류코드 and 2 other fieldsHigh correlation
원가분류코드 is highly overall correlated with 프로그램코드 and 2 other fieldsHigh correlation
국가단위사업코드 is highly overall correlated with 원가분류코드 and 2 other fieldsHigh correlation
귀속유형구분코드 is highly overall correlated with 원가분류코드 and 2 other fieldsHigh correlation
원가분류코드 is highly imbalanced (63.0%)Imbalance
프로그램코드 is highly imbalanced (59.6%)Imbalance
국가단위사업코드 is highly imbalanced (59.6%)Imbalance
귀속유형구분코드 is highly imbalanced (60.2%)Imbalance
구분회계비용집계내역ID has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:57:09.399184
Analysis finished2023-12-12 17:57:10.316398
Duration0.92 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct500
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2023-12-13T02:57:10.595940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters5000
Distinct characters62
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique500 ?
Unique (%)100.0%

Sample

1st row9dnSMTxneV
2nd row9dnSMTxm8X
3rd row9dnSMTxm28
4th row9dnSMTxmW6
5th row9dnSMTxmRw
ValueCountFrequency (%)
9dnsmtwxjt 2
 
0.4%
9dnsmtxgkq 2
 
0.4%
9dnsmtxnev 1
 
0.2%
9dnsmtwkji 1
 
0.2%
9dnsmtwgup 1
 
0.2%
9dnsmtwgz9 1
 
0.2%
9dnsmtwg8g 1
 
0.2%
9dnsmtwhed 1
 
0.2%
9dnsmtwhke 1
 
0.2%
9dnsmtwhqg 1
 
0.2%
Other values (488) 488
97.6%
2023-12-13T02:57:11.121734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
n 535
10.7%
T 530
10.6%
9 528
10.6%
M 522
10.4%
d 517
10.3%
S 516
10.3%
w 417
 
8.3%
x 116
 
2.3%
O 34
 
0.7%
Z 34
 
0.7%
Other values (52) 1251
25.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2132
42.6%
Uppercase Letter 2129
42.6%
Decimal Number 739
 
14.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 535
25.1%
d 517
24.2%
w 417
19.6%
x 116
 
5.4%
m 33
 
1.5%
v 31
 
1.5%
s 30
 
1.4%
e 29
 
1.4%
f 28
 
1.3%
l 28
 
1.3%
Other values (16) 368
17.3%
Uppercase Letter
ValueCountFrequency (%)
T 530
24.9%
M 522
24.5%
S 516
24.2%
O 34
 
1.6%
Z 34
 
1.6%
G 30
 
1.4%
P 30
 
1.4%
V 30
 
1.4%
H 30
 
1.4%
A 29
 
1.4%
Other values (16) 344
16.2%
Decimal Number
ValueCountFrequency (%)
9 528
71.4%
8 30
 
4.1%
6 28
 
3.8%
1 25
 
3.4%
0 24
 
3.2%
3 23
 
3.1%
2 23
 
3.1%
5 21
 
2.8%
7 20
 
2.7%
4 17
 
2.3%

Most occurring scripts

ValueCountFrequency (%)
Latin 4261
85.2%
Common 739
 
14.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 535
12.6%
T 530
12.4%
M 522
12.3%
d 517
12.1%
S 516
12.1%
w 417
9.8%
x 116
 
2.7%
O 34
 
0.8%
Z 34
 
0.8%
m 33
 
0.8%
Other values (42) 1007
23.6%
Common
ValueCountFrequency (%)
9 528
71.4%
8 30
 
4.1%
6 28
 
3.8%
1 25
 
3.4%
0 24
 
3.2%
3 23
 
3.1%
2 23
 
3.1%
5 21
 
2.8%
7 20
 
2.7%
4 17
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 535
10.7%
T 530
10.6%
9 528
10.6%
M 522
10.4%
d 517
10.3%
S 516
10.3%
w 417
 
8.3%
x 116
 
2.3%
O 34
 
0.7%
Z 34
 
0.7%
Other values (52) 1251
25.0%

회계년월
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
202109
500 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row202109
2nd row202109
3rd row202109
4th row202109
5th row202109

Common Values

ValueCountFrequency (%)
202109 500
100.0%

Length

2023-12-13T02:57:11.331945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:57:11.460574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
202109 500
100.0%

회계구분코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
G
500 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowG
2nd rowG
3rd rowG
4th rowG
5th rowG

Common Values

ValueCountFrequency (%)
G 500
100.0%

Length

2023-12-13T02:57:11.584852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:57:11.711217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
g 500
100.0%

원가분류코드
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
15
436 
14
 
33
11
 
19
12
 
12

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row14
2nd row14
3rd row14
4th row14
5th row15

Common Values

ValueCountFrequency (%)
15 436
87.2%
14 33
 
6.6%
11 19
 
3.8%
12 12
 
2.4%

Length

2023-12-13T02:57:11.839430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:57:11.968324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
15 436
87.2%
14 33
 
6.6%
11 19
 
3.8%
12 12
 
2.4%

프로그램코드
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
3300
436 
2400
52 
 
12

Length

Max length4
Median length4
Mean length3.928
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2400
2nd row2400
3rd row2400
4th row2400
5th row3300

Common Values

ValueCountFrequency (%)
3300 436
87.2%
2400 52
 
10.4%
12
 
2.4%

Length

2023-12-13T02:57:12.149798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:57:12.302361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3300 436
89.3%
2400 52
 
10.7%

국가단위사업코드
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
3378
436 
2431
52 
 
12

Length

Max length4
Median length4
Mean length3.928
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2431
2nd row2431
3rd row2431
4th row2431
5th row3378

Common Values

ValueCountFrequency (%)
3378 436
87.2%
2431 52
 
10.4%
12
 
2.4%

Length

2023-12-13T02:57:12.448135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:57:12.571856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3378 436
89.3%
2431 52
 
10.7%

귀속유형구분코드
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
51
424 
1
75 
52
 
1

Length

Max length2
Median length2
Mean length1.85
Min length1

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row51

Common Values

ValueCountFrequency (%)
51 424
84.8%
1 75
 
15.0%
52 1
 
0.2%

Length

2023-12-13T02:57:12.714375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:57:12.848224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
51 424
84.8%
1 75
 
15.0%
52 1
 
0.2%

소관내부거래금액
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
0
500 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 500
100.0%

Length

2023-12-13T02:57:12.991892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:57:13.083898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 500
100.0%

회계내부거래금액
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
0
500 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 500
100.0%

Length

2023-12-13T02:57:13.193067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:57:13.307230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 500
100.0%

삭제여부
Boolean

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size632.0 B
False
500 
ValueCountFrequency (%)
False 500
100.0%
2023-12-13T02:57:13.395348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

최종수정수
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
1
500 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 500
100.0%

Length

2023-12-13T02:57:13.504769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:57:13.623744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 500
100.0%
Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
Minimum2023-12-13 06:40:18
Maximum2023-12-13 06:40:30
2023-12-13T02:57:13.710908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:57:13.863955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=3)

처리직원번호
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
AND07
500 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAND07
2nd rowAND07
3rd rowAND07
4th rowAND07
5th rowAND07

Common Values

ValueCountFrequency (%)
AND07 500
100.0%

Length

2023-12-13T02:57:14.007854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:57:14.112844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
and07 500
100.0%
Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
Minimum2023-12-13 06:40:18
Maximum2023-12-13 06:40:30
2023-12-13T02:57:14.202339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:57:14.310095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=3)

최초처리직원번호
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
AND07
500 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAND07
2nd rowAND07
3rd rowAND07
4th rowAND07
5th rowAND07

Common Values

ValueCountFrequency (%)
AND07 500
100.0%

Length

2023-12-13T02:57:14.444739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:57:14.542649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
and07 500
100.0%

Correlations

2023-12-13T02:57:14.604679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
원가분류코드프로그램코드국가단위사업코드귀속유형구분코드처리시각최초처리시각
원가분류코드1.0001.0001.0000.5920.3770.377
프로그램코드1.0001.0001.0000.8950.7120.712
국가단위사업코드1.0001.0001.0000.8950.7120.712
귀속유형구분코드0.5920.8950.8951.0000.7560.756
처리시각0.3770.7120.7120.7561.0001.000
최초처리시각0.3770.7120.7120.7561.0001.000
2023-12-13T02:57:14.747619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
프로그램코드귀속유형구분코드원가분류코드국가단위사업코드
프로그램코드1.0000.6060.9991.000
귀속유형구분코드0.6061.0000.6040.606
원가분류코드0.9990.6041.0000.999
국가단위사업코드1.0000.6060.9991.000
2023-12-13T02:57:14.853944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
원가분류코드프로그램코드국가단위사업코드귀속유형구분코드
원가분류코드1.0000.9990.9990.604
프로그램코드0.9991.0001.0000.606
국가단위사업코드0.9991.0001.0000.606
귀속유형구분코드0.6040.6060.6061.000

Missing values

2023-12-13T02:57:09.931124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:57:10.205169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분회계비용집계내역ID회계년월회계구분코드원가분류코드프로그램코드국가단위사업코드귀속유형구분코드소관내부거래금액회계내부거래금액삭제여부최종수정수처리시각처리직원번호최초처리시각최초처리직원번호
09dnSMTxneV202109G1424002431100N106:40.5AND0706:40.5AND07
19dnSMTxm8X202109G1424002431100N106:40.5AND0706:40.5AND07
29dnSMTxm28202109G1424002431100N106:40.5AND0706:40.5AND07
39dnSMTxmW6202109G1424002431100N106:40.5AND0706:40.5AND07
49dnSMTxmRw202109G15330033785100N106:40.5AND0706:40.5AND07
59dnSMTxmMD202109G15330033785100N106:40.5AND0706:40.5AND07
69dnSMTxmCj202109G15330033785100N106:40.5AND0706:40.5AND07
79dnSMTxmsF202109G15330033785100N106:40.5AND0706:40.5AND07
89dnSMTxmkc202109G15330033785100N106:40.5AND0706:40.5AND07
99dnSMTxmfc202109G15330033785100N106:40.5AND0706:40.5AND07
구분회계비용집계내역ID회계년월회계구분코드원가분류코드프로그램코드국가단위사업코드귀속유형구분코드소관내부거래금액회계내부거래금액삭제여부최종수정수처리시각처리직원번호최초처리시각최초처리직원번호
4909dnSMTwlB4202109G1424002431100N106:40.3AND0706:40.3AND07
4919dnSMTwlum202109G1424002431100N106:40.3AND0706:40.3AND07
4929dnSMTwlnz202109G1424002431100N106:40.3AND0706:40.3AND07
4939dnSMTwlgn202109G1424002431100N106:40.3AND0706:40.3AND07
4949dnSMTwk83202109G1424002431100N106:40.3AND0706:40.3AND07
4959dnSMTwk1Q202109G1424002431100N106:40.3AND0706:40.3AND07
4969dnSMTwkTR202109G1424002431100N106:40.3AND0706:40.3AND07
4979dnSMTwkNZ202109G1424002431100N106:40.3AND0706:40.3AND07
4989dnSMTwkIo202109G1424002431100N106:40.3AND0706:40.3AND07
4999dnSMTwkCj202109G1424002431100N106:40.3AND0706:40.3AND07