Overview

Dataset statistics

Number of variables10
Number of observations470
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory36.8 KiB
Average record size in memory80.3 B

Variable types

Text1
Categorical8
DateTime1

Dataset

Description한국기계연구원의 연구관리 분야에서 사업/과제협약정보관리 테이블 정보(과제번호, 참여형태, 참여기관, 협약일자, 입금계좌은행, 입금계좌은행지점명 등을 관리)
URLhttps://www.data.go.kr/data/15078100/fileData.do

Alerts

협약구분 has constant value ""Constant
입금계좌은행 has constant value ""Constant
입금계좌은행지점명 has constant value ""Constant
작성일 has constant value ""Constant
참여기관 is highly overall correlated with 지급계좌은행High correlation
지급계좌은행 is highly overall correlated with 참여기관High correlation
지급계좌은행지점명 is highly imbalanced (91.2%)Imbalance

Reproduction

Analysis started2023-12-12 20:35:55.217724
Analysis finished2023-12-12 20:35:56.373408
Duration1.16 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct241
Distinct (%)51.3%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-12-13T05:35:56.698498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters2820
Distinct characters31
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique141 ?
Unique (%)30.0%

Sample

1st rowSC1330
2nd rowSC1300
3rd rowNK237A
4th rowNK231A
5th rowSC1300
ValueCountFrequency (%)
nk217g 7
 
1.5%
nk238f 7
 
1.5%
nk224g 7
 
1.5%
nk210h 7
 
1.5%
nk226f 7
 
1.5%
nk232f 7
 
1.5%
nk230f 5
 
1.1%
nk236f 5
 
1.1%
nk213g 5
 
1.1%
nk217b 5
 
1.1%
Other values (231) 408
86.8%
2023-12-13T05:35:57.166003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 625
22.2%
K 461
16.3%
N 456
16.2%
3 215
 
7.6%
1 202
 
7.2%
0 97
 
3.4%
4 83
 
2.9%
F 68
 
2.4%
C 67
 
2.4%
B 65
 
2.3%
Other values (21) 481
17.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1428
50.6%
Uppercase Letter 1392
49.4%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
K 461
33.1%
N 456
32.8%
F 68
 
4.9%
C 67
 
4.8%
B 65
 
4.7%
E 51
 
3.7%
D 50
 
3.6%
A 49
 
3.5%
G 42
 
3.0%
H 20
 
1.4%
Other values (11) 63
 
4.5%
Decimal Number
ValueCountFrequency (%)
2 625
43.8%
3 215
 
15.1%
1 202
 
14.1%
0 97
 
6.8%
4 83
 
5.8%
6 57
 
4.0%
7 55
 
3.9%
8 37
 
2.6%
9 30
 
2.1%
5 27
 
1.9%

Most occurring scripts

ValueCountFrequency (%)
Common 1428
50.6%
Latin 1392
49.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
K 461
33.1%
N 456
32.8%
F 68
 
4.9%
C 67
 
4.8%
B 65
 
4.7%
E 51
 
3.7%
D 50
 
3.6%
A 49
 
3.5%
G 42
 
3.0%
H 20
 
1.4%
Other values (11) 63
 
4.5%
Common
ValueCountFrequency (%)
2 625
43.8%
3 215
 
15.1%
1 202
 
14.1%
0 97
 
6.8%
4 83
 
5.8%
6 57
 
4.0%
7 55
 
3.9%
8 37
 
2.6%
9 30
 
2.1%
5 27
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2820
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 625
22.2%
K 461
16.3%
N 456
16.2%
3 215
 
7.6%
1 202
 
7.2%
0 97
 
3.4%
4 83
 
2.9%
F 68
 
2.4%
C 67
 
2.4%
B 65
 
2.3%
Other values (21) 481
17.1%

참여형태
Categorical

Distinct7
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
총괄
230 
위탁
191 
위탁2
25 
관리기관
 
11
위탁3
 
10
Other values (2)
 
3

Length

Max length4
Median length2
Mean length2.1234043
Min length2

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row위탁
2nd row위탁
3rd row위탁
4th row위탁
5th row위탁

Common Values

ValueCountFrequency (%)
총괄 230
48.9%
위탁 191
40.6%
위탁2 25
 
5.3%
관리기관 11
 
2.3%
위탁3 10
 
2.1%
공동 2
 
0.4%
위탁4 1
 
0.2%

Length

2023-12-13T05:35:57.347039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:35:57.499369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
총괄 230
48.9%
위탁 191
40.6%
위탁2 25
 
5.3%
관리기관 11
 
2.3%
위탁3 10
 
2.1%
공동 2
 
0.4%
위탁4 1
 
0.2%

참여기관
Categorical

HIGH CORRELATION 

Distinct47
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
*국기*
193 
*가과*
40 
*국과*
28 
*남대*
24 
*울대*
 
16
Other values (42)
169 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique11 ?
Unique (%)2.3%

Sample

1st row*북대*
2nd row*북대*
3rd row*국기*
4th row*국기*
5th row*서대*

Common Values

ValueCountFrequency (%)
*국기* 193
41.1%
*가과* 40
 
8.5%
*국과* 28
 
6.0%
*남대* 24
 
5.1%
*울대* 16
 
3.4%
*북대* 12
 
2.6%
*주)* 12
 
2.6%
*양대* 11
 
2.3%
*세대* 11
 
2.3%
*ni* 10
 
2.1%
Other values (37) 113
24.0%

Length

2023-12-13T05:35:57.648728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
국기 193
41.1%
가과 40
 
8.5%
국과 28
 
6.0%
남대 24
 
5.1%
울대 16
 
3.4%
북대 12
 
2.6%
12
 
2.6%
양대 11
 
2.3%
세대 11
 
2.3%
ni 10
 
2.1%
Other values (37) 113
24.0%

협약구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
단년도
470 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row단년도
2nd row단년도
3rd row단년도
4th row단년도
5th row단년도

Common Values

ValueCountFrequency (%)
단년도 470
100.0%

Length

2023-12-13T05:35:57.767927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:35:57.898767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
단년도 470
100.0%
Distinct76
Distinct (%)16.2%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
Minimum2018-01-02 00:00:00
Maximum2022-05-11 00:00:00
2023-12-13T05:35:58.034341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:35:58.181076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

입금계좌은행
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
기타
470 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기타
2nd row기타
3rd row기타
4th row기타
5th row기타

Common Values

ValueCountFrequency (%)
기타 470
100.0%

Length

2023-12-13T05:35:58.307006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:35:58.404802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기타 470
100.0%

입금계좌은행지점명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
기타
470 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기타
2nd row기타
3rd row기타
4th row기타
5th row기타

Common Values

ValueCountFrequency (%)
기타 470
100.0%

Length

2023-12-13T05:35:58.495316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:35:58.594127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기타 470
100.0%

지급계좌은행
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
기타
262 
우리은행
63 
KEB하나은행
43 
농협은행
40 
신한은행
 
25
Other values (9)
37 

Length

Max length7
Median length2
Mean length3.1659574
Min length2

Unique

Unique2 ?
Unique (%)0.4%

Sample

1st row농협은행
2nd row농협은행
3rd row신한은행
4th row신한은행
5th row기업은행

Common Values

ValueCountFrequency (%)
기타 262
55.7%
우리은행 63
 
13.4%
KEB하나은행 43
 
9.1%
농협은행 40
 
8.5%
신한은행 25
 
5.3%
대구은행 9
 
1.9%
기업은행 6
 
1.3%
국민은행 6
 
1.3%
경남은행 6
 
1.3%
수협중앙회 3
 
0.6%
Other values (4) 7
 
1.5%

Length

2023-12-13T05:35:58.707644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
기타 262
55.7%
우리은행 63
 
13.4%
keb하나은행 43
 
9.1%
농협은행 40
 
8.5%
신한은행 25
 
5.3%
대구은행 9
 
1.9%
기업은행 6
 
1.3%
국민은행 6
 
1.3%
경남은행 6
 
1.3%
수협중앙회 3
 
0.6%
Other values (4) 7
 
1.5%

지급계좌은행지점명
Categorical

IMBALANCE 

Distinct11
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
기타
455 
Bank of America
 
3
Norddeutsche Landesbank Hannover
 
3
First Tennessee Bank
 
2
(0660)충남대
 
1
Other values (6)
 
6

Length

Max length56
Median length2
Mean length2.5404255
Min length2

Unique

Unique7 ?
Unique (%)1.5%

Sample

1st row기타
2nd row기타
3rd row기타
4th row기타
5th row기타

Common Values

ValueCountFrequency (%)
기타 455
96.8%
Bank of America 3
 
0.6%
Norddeutsche Landesbank Hannover 3
 
0.6%
First Tennessee Bank 2
 
0.4%
(0660)충남대 1
 
0.2%
부산대학교지점 1
 
0.2%
울산영업부 1
 
0.2%
해외은행 1
 
0.2%
Johns Hopkins University Central Lockbox Bank of America 1
 
0.2%
First Horizon Bank 1
 
0.2%

Length

2023-12-13T05:35:58.825028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
기타 455
91.9%
bank 7
 
1.4%
of 4
 
0.8%
america 4
 
0.8%
norddeutsche 3
 
0.6%
landesbank 3
 
0.6%
hannover 3
 
0.6%
first 3
 
0.6%
tennessee 2
 
0.4%
hopkins 1
 
0.2%
Other values (10) 10
 
2.0%

작성일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-07-28
470 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-28
2nd row2023-07-28
3rd row2023-07-28
4th row2023-07-28
5th row2023-07-28

Common Values

ValueCountFrequency (%)
2023-07-28 470
100.0%

Length

2023-12-13T05:35:58.955240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:35:59.050740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-07-28 470
100.0%

Correlations

2023-12-13T05:35:59.103382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
참여형태참여기관협약일자지급계좌은행지급계좌은행지점명
참여형태1.0000.8300.8960.7540.673
참여기관0.8301.0000.9020.9850.810
협약일자0.8960.9021.0000.7260.969
지급계좌은행0.7540.9850.7261.0000.000
지급계좌은행지점명0.6730.8100.9690.0001.000
2023-12-13T05:35:59.196350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지급계좌은행참여기관참여형태지급계좌은행지점명
지급계좌은행1.0000.8160.3800.000
참여기관0.8161.0000.4950.404
참여형태0.3800.4951.0000.414
지급계좌은행지점명0.0000.4040.4141.000
2023-12-13T05:35:59.279702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
참여형태참여기관지급계좌은행지급계좌은행지점명
참여형태1.0000.4950.3800.414
참여기관0.4951.0000.8160.404
지급계좌은행0.3800.8161.0000.000
지급계좌은행지점명0.4140.4040.0001.000

Missing values

2023-12-13T05:35:56.099436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:35:56.295592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업_과제번호참여형태참여기관협약구분협약일자입금계좌은행입금계좌은행지점명지급계좌은행지급계좌은행지점명작성일
0SC1330위탁*북대*단년도2019-02-22기타기타농협은행기타2023-07-28
1SC1300위탁*북대*단년도2018-03-15기타기타농협은행기타2023-07-28
2NK237A위탁*국기*단년도2022-01-01기타기타신한은행기타2023-07-28
3NK231A위탁*국기*단년도2021-01-01기타기타신한은행기타2023-07-28
4SC1300위탁*서대*단년도2018-02-12기타기타기업은행기타2023-07-28
5NK222H총괄*가과*단년도2019-05-01기타기타기타기타2023-07-28
6NK215I관리기관*가과*단년도2018-10-01기타기타기타기타2023-07-28
7NK221I관리기관*가과*단년도2019-01-01기타기타기타기타2023-07-28
8NK221J총괄*가과*단년도2019-01-01기타기타기타기타2023-07-28
9NK223A총괄*가과*단년도2019-01-01기타기타기타기타2023-07-28
사업_과제번호참여형태참여기관협약구분협약일자입금계좌은행입금계좌은행지점명지급계좌은행지급계좌은행지점명작성일
460NK221M총괄*주)*단년도2019-01-01기타기타기타기타2023-07-28
461NK226D위탁2*스홉*단년도2020-04-01기타기타기타Johns Hopkins University Central Lockbox Bank of America2023-07-28
462NK217F위탁2*스홉*단년도2019-05-21기타기타기타Bank of America2023-07-28
463NK236A공동*ni*단년도2022-05-11기타기타기타First Horizon Bank2023-07-28
464NK230A위탁3*ni*단년도2021-04-29기타기타기타First Tennessee Bank2023-07-28
465NK224A위탁3*ni*단년도2020-04-06기타기타기타First Tennessee Bank2023-07-28
466NK217B위탁3*ni*단년도2019-08-01기타기타기타외국은행2023-07-28
467NK236J위탁*국재*단년도2022-01-01기타기타우리은행기타2023-07-28
468NK238D위탁*국보*단년도2022-01-01기타기타국민은행기타2023-07-28
469NK238B위탁*상국*단년도2022-01-01기타기타농협은행기타2023-07-28