Overview

Dataset statistics

Number of variables12
Number of observations200
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory20.2 KiB
Average record size in memory103.7 B

Variable types

Text1
Categorical9
Boolean1
Numeric1

Dataset

Description한국기계연구원의 연구관리 분야에서 사업/과제계획서참여기관을 관리하는 테이블 정보(과제번호, 참여기관, 참여형태, 발주자여부 및 금액 등을 관리)
URLhttps://www.data.go.kr/data/15078052/fileData.do

Alerts

참여기관부담금액_현물 has constant value ""Constant
참여기관연구비_정부 has constant value ""Constant
참여기관연구비_현금 has constant value ""Constant
참여기관연구비_현물 has constant value ""Constant
참여기관연구비_상대국_현금 has constant value ""Constant
참여기관연구비_상대국현물 has constant value ""Constant
작성일 has constant value ""Constant
참여기관 is highly overall correlated with 참여형태 and 1 other fieldsHigh correlation
참여형태 is highly overall correlated with 참여기관 and 1 other fieldsHigh correlation
발주자여부 is highly overall correlated with 참여기관 and 1 other fieldsHigh correlation
참여기관 is highly imbalanced (62.5%)Imbalance
참여형태 is highly imbalanced (69.9%)Imbalance
발주자여부 is highly imbalanced (63.4%)Imbalance

Reproduction

Analysis started2023-12-12 04:37:59.295153
Analysis finished2023-12-12 04:37:59.887009
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct190
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-12T13:38:00.254059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters1200
Distinct characters31
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique186 ?
Unique (%)93.0%

Sample

1st rowNK215I
2nd rowNK221I
3rd rowNK221J
4th rowNK223A
5th rowNK215K
ValueCountFrequency (%)
nk218b 4
 
2.0%
nk219f 4
 
2.0%
nk224c 3
 
1.5%
nk237b 3
 
1.5%
nk226a 1
 
0.5%
nk213a 1
 
0.5%
nk224a 1
 
0.5%
nk217c 1
 
0.5%
nk215i 1
 
0.5%
nk214o 1
 
0.5%
Other values (180) 180
90.0%
2023-12-12T13:38:00.834477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 268
22.3%
K 193
16.1%
N 190
15.8%
1 112
9.3%
3 60
 
5.0%
4 57
 
4.8%
C 37
 
3.1%
0 34
 
2.8%
B 27
 
2.2%
D 24
 
2.0%
Other values (21) 198
16.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 612
51.0%
Uppercase Letter 588
49.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
K 193
32.8%
N 190
32.3%
C 37
 
6.3%
B 27
 
4.6%
D 24
 
4.1%
E 22
 
3.7%
A 21
 
3.6%
F 19
 
3.2%
S 13
 
2.2%
G 11
 
1.9%
Other values (11) 31
 
5.3%
Decimal Number
ValueCountFrequency (%)
2 268
43.8%
1 112
18.3%
3 60
 
9.8%
4 57
 
9.3%
0 34
 
5.6%
7 19
 
3.1%
5 18
 
2.9%
6 15
 
2.5%
8 15
 
2.5%
9 14
 
2.3%

Most occurring scripts

ValueCountFrequency (%)
Common 612
51.0%
Latin 588
49.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
K 193
32.8%
N 190
32.3%
C 37
 
6.3%
B 27
 
4.6%
D 24
 
4.1%
E 22
 
3.7%
A 21
 
3.6%
F 19
 
3.2%
S 13
 
2.2%
G 11
 
1.9%
Other values (11) 31
 
5.3%
Common
ValueCountFrequency (%)
2 268
43.8%
1 112
18.3%
3 60
 
9.8%
4 57
 
9.3%
0 34
 
5.6%
7 19
 
3.1%
5 18
 
2.9%
6 15
 
2.5%
8 15
 
2.5%
9 14
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1200
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 268
22.3%
K 193
16.1%
N 190
15.8%
1 112
9.3%
3 60
 
5.0%
4 57
 
4.8%
C 37
 
3.1%
0 34
 
2.8%
B 27
 
2.2%
D 24
 
2.0%
Other values (21) 198
16.5%

참여기관
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct17
Distinct (%)8.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
*국기*
144 
*가과*
33 
*주)*
 
7
*식회*
 
2
*북대*
 
2
Other values (12)
 
12

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique12 ?
Unique (%)6.0%

Sample

1st row*가과*
2nd row*가과*
3rd row*가과*
4th row*가과*
5th row*가과*

Common Values

ValueCountFrequency (%)
*국기* 144
72.0%
*가과* 33
 
16.5%
*주)* 7
 
3.5%
*식회* 2
 
1.0%
*북대* 2
 
1.0%
*울과* 1
 
0.5%
*양대* 1
 
0.5%
*국항* 1
 
0.5%
*하대* 1
 
0.5%
*원대* 1
 
0.5%
Other values (7) 7
 
3.5%

Length

2023-12-12T13:38:01.032961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
국기 144
72.0%
가과 33
 
16.5%
7
 
3.5%
식회 2
 
1.0%
북대 2
 
1.0%
학기 1
 
0.5%
이오 1
 
0.5%
구경 1
 
0.5%
국과 1
 
0.5%
크테 1
 
0.5%
Other values (7) 7
 
3.5%

참여형태
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
총괄
176 
관리기관
 
10
위탁
 
6
위탁2
 
3
참여
 
3

Length

Max length4
Median length2
Mean length2.125
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row관리기관
2nd row관리기관
3rd row총괄
4th row총괄
5th row총괄

Common Values

ValueCountFrequency (%)
총괄 176
88.0%
관리기관 10
 
5.0%
위탁 6
 
3.0%
위탁2 3
 
1.5%
참여 3
 
1.5%
위탁3 2
 
1.0%

Length

2023-12-12T13:38:01.223399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:38:01.413368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
총괄 176
88.0%
관리기관 10
 
5.0%
위탁 6
 
3.0%
위탁2 3
 
1.5%
참여 3
 
1.5%
위탁3 2
 
1.0%

발주자여부
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
True
186 
False
 
14
ValueCountFrequency (%)
True 186
93.0%
False 14
 
7.0%
2023-12-12T13:38:01.557783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct61
Distinct (%)30.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.1739369 × 109
Minimum17000000
Maximum2.1287965 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-12T13:38:01.738562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum17000000
5-th percentile38000000
Q12.019335 × 108
median4.29015 × 109
Q31.064191 × 1010
95-th percentile1.8736698 × 1010
Maximum2.1287965 × 1010
Range2.1270965 × 1010
Interquartile range (IQR)1.0439976 × 1010

Descriptive statistics

Standard deviation6.4220057 × 109
Coefficient of variation (CV)1.04018
Kurtosis-0.60898842
Mean6.1739369 × 109
Median Absolute Deviation (MAD)4.22115 × 109
Skewness0.7894307
Sum1.2347874 × 1012
Variance4.1242157 × 1019
MonotonicityNot monotonic
2023-12-12T13:38:01.974237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
220000000 11
 
5.5%
40000000 11
 
5.5%
20000000 7
 
3.5%
16687113000 6
 
3.0%
9491773000 6
 
3.0%
15676876000 6
 
3.0%
4191054000 6
 
3.0%
16288399000 6
 
3.0%
6509752500 6
 
3.0%
2649630000 6
 
3.0%
Other values (51) 129
64.5%
ValueCountFrequency (%)
17000000 2
 
1.0%
20000000 7
3.5%
38000000 2
 
1.0%
40000000 11
5.5%
42000000 6
3.0%
45000000 1
 
0.5%
45100000 1
 
0.5%
50000000 1
 
0.5%
55000000 1
 
0.5%
60000000 3
 
1.5%
ValueCountFrequency (%)
21287965000 6
3.0%
18736698000 6
3.0%
16687113000 6
3.0%
16288399000 6
3.0%
15676876000 6
3.0%
14384564000 6
3.0%
13504131000 6
3.0%
10779373000 6
3.0%
10641910000 6
3.0%
10071551242 5
2.5%

참여기관부담금액_현물
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
0
200 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 200
100.0%

Length

2023-12-12T13:38:02.168797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:38:02.321473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 200
100.0%

참여기관연구비_정부
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
0
200 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 200
100.0%

Length

2023-12-12T13:38:02.777045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:38:02.895772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 200
100.0%

참여기관연구비_현금
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
0
200 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 200
100.0%

Length

2023-12-12T13:38:02.996618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:38:03.085851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 200
100.0%

참여기관연구비_현물
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
0
200 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 200
100.0%

Length

2023-12-12T13:38:03.174161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:38:03.266583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 200
100.0%
Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
0
200 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 200
100.0%

Length

2023-12-12T13:38:03.382517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:38:03.474930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 200
100.0%
Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
0
200 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 200
100.0%

Length

2023-12-12T13:38:03.579565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:38:03.698280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 200
100.0%

작성일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-07-28
200 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-28
2nd row2023-07-28
3rd row2023-07-28
4th row2023-07-28
5th row2023-07-28

Common Values

ValueCountFrequency (%)
2023-07-28 200
100.0%

Length

2023-12-12T13:38:03.819124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:38:03.910954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-07-28 200
100.0%

Interactions

2023-12-12T13:37:59.521494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:38:03.977966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
참여기관참여형태발주자여부참여기관부담금액_현금
참여기관1.0000.9530.9510.000
참여형태0.9531.0001.0000.061
발주자여부0.9511.0001.0000.382
참여기관부담금액_현금0.0000.0610.3821.000
2023-12-12T13:38:04.099574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
참여기관참여형태발주자여부
참여기관1.0000.8110.905
참여형태0.8111.0000.990
발주자여부0.9050.9901.000
2023-12-12T13:38:04.205770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
참여기관부담금액_현금참여기관참여형태발주자여부
참여기관부담금액_현금1.0000.0000.0000.279
참여기관0.0001.0000.8110.905
참여형태0.0000.8111.0000.990
발주자여부0.2790.9050.9901.000

Missing values

2023-12-12T13:37:59.646257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:37:59.821127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업_과제번호참여기관참여형태발주자여부참여기관부담금액_현금참여기관부담금액_현물참여기관연구비_정부참여기관연구비_현금참여기관연구비_현물참여기관연구비_상대국_현금참여기관연구비_상대국현물작성일
0NK215I*가과*관리기관Y170000000000002023-07-28
1NK221I*가과*관리기관Y1742570000000002023-07-28
2NK221J*가과*총괄Y2140930000000002023-07-28
3NK223A*가과*총괄Y13110000000000002023-07-28
4NK215K*가과*총괄Y380000000000002023-07-28
5NK222K*가과*총괄Y420000000000002023-07-28
6NK221F*가과*총괄Y2025780000000002023-07-28
7NK222C*가과*관리기관Y420000000000002023-07-28
8NK215H*가과*관리기관Y170000000000002023-07-28
9NK221D*가과*총괄Y2166890000000002023-07-28
사업_과제번호참여기관참여형태발주자여부참여기관부담금액_현금참여기관부담금액_현물참여기관연구비_정부참여기관연구비_현금참여기관연구비_현물참여기관연구비_상대국_현금참여기관연구비_상대국현물작성일
190NK224C*국과*위탁2N400000000000002023-07-28
191NK219F*구경*위탁N500000000000002023-07-28
192NK227M*식회*참여N200000000000002023-07-28
193NK214H*주)*총괄Y2200000000000002023-07-28
194NK214P*주)*총괄Y2200000000000002023-07-28
195NK219F*칸소*위탁2N550000000000002023-07-28
196NK221K*주)*총괄Y2027270000000002023-07-28
197NK214U*식회*총괄Y1068610000000002023-07-28
198NK221M*주)*총괄Y2157380000000002023-07-28
199NK227K*주)*참여N200000000000002023-07-28