Overview

Dataset statistics

Number of variables9
Number of observations227
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.8 KiB
Average record size in memory75.6 B

Variable types

Text1
Numeric2
Categorical4
DateTime2

Dataset

Description한국기계연구원의 연구관리 분야에서 사업/과제계획서연구비지급일정 테이블 정보(과제번호, 참여기관, 연구비지급차수, 연구비지급구분, 연구비지급예정일자, 연구비지급예정금액 등을 관리)
URLhttps://www.data.go.kr/data/15078084/fileData.do

Alerts

연구비지급차수 has constant value ""Constant
연구비지급구분 has constant value ""Constant
작성일 has constant value ""Constant
참여형태 is highly imbalanced (63.4%)Imbalance

Reproduction

Analysis started2023-12-12 20:20:20.412197
Analysis finished2023-12-12 20:20:22.271380
Duration1.86 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct100
Distinct (%)44.1%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-13T05:20:22.582924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters1362
Distinct characters23
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)17.6%

Sample

1st rowNK213F
2nd rowNK213F
3rd rowNK213G
4th rowNK213G
5th rowNK213F
ValueCountFrequency (%)
nk217g 6
 
2.6%
nk224g 6
 
2.6%
nk238f 6
 
2.6%
nk232f 6
 
2.6%
nk226f 6
 
2.6%
nk210h 5
 
2.2%
nk236c 4
 
1.8%
nk230a 4
 
1.8%
nk213g 4
 
1.8%
nk226b 4
 
1.8%
Other values (90) 176
77.5%
2023-12-13T05:20:23.135573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 299
22.0%
K 221
16.2%
N 220
16.2%
3 117
 
8.6%
1 91
 
6.7%
0 49
 
3.6%
F 46
 
3.4%
6 39
 
2.9%
B 39
 
2.9%
C 29
 
2.1%
Other values (13) 212
15.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 688
50.5%
Uppercase Letter 674
49.5%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
K 221
32.8%
N 220
32.6%
F 46
 
6.8%
B 39
 
5.8%
C 29
 
4.3%
G 25
 
3.7%
E 25
 
3.7%
A 24
 
3.6%
D 23
 
3.4%
H 7
 
1.0%
Other values (3) 15
 
2.2%
Decimal Number
ValueCountFrequency (%)
2 299
43.5%
3 117
 
17.0%
1 91
 
13.2%
0 49
 
7.1%
6 39
 
5.7%
7 28
 
4.1%
4 23
 
3.3%
8 22
 
3.2%
9 11
 
1.6%
5 9
 
1.3%

Most occurring scripts

ValueCountFrequency (%)
Common 688
50.5%
Latin 674
49.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
K 221
32.8%
N 220
32.6%
F 46
 
6.8%
B 39
 
5.8%
C 29
 
4.3%
G 25
 
3.7%
E 25
 
3.7%
A 24
 
3.6%
D 23
 
3.4%
H 7
 
1.0%
Other values (3) 15
 
2.2%
Common
ValueCountFrequency (%)
2 299
43.5%
3 117
 
17.0%
1 91
 
13.2%
0 49
 
7.1%
6 39
 
5.7%
7 28
 
4.1%
4 23
 
3.3%
8 22
 
3.2%
9 11
 
1.6%
5 9
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1362
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 299
22.0%
K 221
16.2%
N 220
16.2%
3 117
 
8.6%
1 91
 
6.7%
0 49
 
3.6%
F 46
 
3.4%
6 39
 
2.9%
B 39
 
2.9%
C 29
 
2.1%
Other values (13) 212
15.6%

순번
Real number (ℝ)

Distinct6
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0660793
Minimum1
Maximum6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-13T05:20:23.306464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q33
95-th percentile4
Maximum6
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.2337597
Coefficient of variation (CV)0.59715019
Kurtosis0.98424607
Mean2.0660793
Median Absolute Deviation (MAD)1
Skewness1.1855407
Sum469
Variance1.5221629
MonotonicityNot monotonic
2023-12-13T05:20:23.452782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 99
43.6%
2 59
26.0%
3 40
17.6%
4 18
 
7.9%
5 6
 
2.6%
6 5
 
2.2%
ValueCountFrequency (%)
1 99
43.6%
2 59
26.0%
3 40
17.6%
4 18
 
7.9%
5 6
 
2.6%
6 5
 
2.2%
ValueCountFrequency (%)
6 5
 
2.2%
5 6
 
2.6%
4 18
 
7.9%
3 40
17.6%
2 59
26.0%
1 99
43.6%

참여기관
Categorical

Distinct43
Distinct (%)18.9%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
*국과*
28 
*남대*
24 
*울대*
16 
*북대*
 
12
*양대*
 
11
Other values (38)
136 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique8 ?
Unique (%)3.5%

Sample

1st row*양대*
2nd row*코공*
3rd row*국대*
4th row*산과*
5th row*오공*

Common Values

ValueCountFrequency (%)
*국과* 28
 
12.3%
*남대* 24
 
10.6%
*울대* 16
 
7.0%
*북대* 12
 
5.3%
*양대* 11
 
4.8%
*세대* 11
 
4.8%
*ni* 9
 
4.0%
*려대* 9
 
4.0%
*울과* 8
 
3.5%
*주대* 7
 
3.1%
Other values (33) 92
40.5%

Length

2023-12-13T05:20:23.596657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
국과 28
 
12.3%
남대 24
 
10.6%
울대 16
 
7.0%
북대 12
 
5.3%
양대 11
 
4.8%
세대 11
 
4.8%
ni 9
 
4.0%
려대 9
 
4.0%
울과 8
 
3.5%
주대 7
 
3.1%
Other values (33) 92
40.5%

참여형태
Categorical

IMBALANCE 

Distinct5
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
위탁
190 
위탁2
24 
위탁3
 
10
공동
 
2
위탁4
 
1

Length

Max length3
Median length2
Mean length2.154185
Min length2

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row위탁
2nd row위탁
3rd row위탁
4th row위탁
5th row위탁

Common Values

ValueCountFrequency (%)
위탁 190
83.7%
위탁2 24
 
10.6%
위탁3 10
 
4.4%
공동 2
 
0.9%
위탁4 1
 
0.4%

Length

2023-12-13T05:20:23.741213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:20:23.872355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
위탁 190
83.7%
위탁2 24
 
10.6%
위탁3 10
 
4.4%
공동 2
 
0.9%
위탁4 1
 
0.4%

연구비지급차수
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
1
227 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 227
100.0%

Length

2023-12-13T05:20:24.055883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:20:24.200061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 227
100.0%

연구비지급구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
위탁연구비
227 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row위탁연구비
2nd row위탁연구비
3rd row위탁연구비
4th row위탁연구비
5th row위탁연구비

Common Values

ValueCountFrequency (%)
위탁연구비 227
100.0%

Length

2023-12-13T05:20:24.331235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:20:24.442665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
위탁연구비 227
100.0%
Distinct80
Distinct (%)35.2%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
Minimum2018-02-20 00:00:00
Maximum2022-06-20 00:00:00
2023-12-13T05:20:24.570262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:24.803458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct27
Distinct (%)11.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean53713254
Minimum48000
Maximum1.5 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-13T05:20:24.999329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum48000
5-th percentile30000000
Q140000000
median50000000
Q360000000
95-th percentile90000000
Maximum1.5 × 108
Range1.49952 × 108
Interquartile range (IQR)20000000

Descriptive statistics

Standard deviation21451817
Coefficient of variation (CV)0.3993766
Kurtosis4.2003591
Mean53713254
Median Absolute Deviation (MAD)10000000
Skewness1.5220436
Sum1.2192909 × 1010
Variance4.6018044 × 1014
MonotonicityNot monotonic
2023-12-13T05:20:25.182221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
50000000 67
29.5%
40000000 43
18.9%
30000000 27
11.9%
60000000 22
 
9.7%
80000000 18
 
7.9%
55000000 7
 
3.1%
90000000 7
 
3.1%
70000000 6
 
2.6%
100000000 5
 
2.2%
44000000 4
 
1.8%
Other values (17) 21
 
9.3%
ValueCountFrequency (%)
48000 1
 
0.4%
92000 1
 
0.4%
20000000 1
 
0.4%
30000000 27
11.9%
33000000 2
 
0.9%
40000000 43
18.9%
43000000 1
 
0.4%
44000000 4
 
1.8%
45000000 1
 
0.4%
48400000 1
 
0.4%
ValueCountFrequency (%)
150000000 2
 
0.9%
140000000 1
 
0.4%
128000000 1
 
0.4%
100000000 5
 
2.2%
96076520 1
 
0.4%
95000000 1
 
0.4%
90000000 7
 
3.1%
85000000 1
 
0.4%
80000000 18
7.9%
70000000 6
 
2.6%

작성일
Date

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
Minimum2023-07-28 00:00:00
Maximum2023-07-28 00:00:00
2023-12-13T05:20:25.321670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:25.451864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T05:20:21.621758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:21.349669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:21.753721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:20:21.489447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:20:25.552582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과제번호순번참여기관참여형태연구비지급예정일자연구비지급예정금액
과제번호1.0000.0000.0000.0000.9120.668
순번0.0001.0000.3920.0000.6100.248
참여기관0.0000.3921.0000.5540.9160.794
참여형태0.0000.0000.5541.0000.7880.000
연구비지급예정일자0.9120.6100.9160.7881.0000.000
연구비지급예정금액0.6680.2480.7940.0000.0001.000
2023-12-13T05:20:25.705753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
참여기관참여형태
참여기관1.0000.266
참여형태0.2661.000
2023-12-13T05:20:25.820084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번연구비지급예정금액참여기관참여형태
순번1.0000.1800.1610.000
연구비지급예정금액0.1801.0000.4220.000
참여기관0.1610.4221.0000.266
참여형태0.0000.0000.2661.000

Missing values

2023-12-13T05:20:21.963352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:20:22.193365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

과제번호순번참여기관참여형태연구비지급차수연구비지급구분연구비지급예정일자연구비지급예정금액작성일
0NK213F2*양대*위탁1위탁연구비2018-02-27500000002023-07-28
1NK213F4*코공*위탁1위탁연구비2018-04-17450000002023-07-28
2NK213G1*국대*위탁1위탁연구비2018-02-20500000002023-07-28
3NK213G3*산과*위탁1위탁연구비2018-03-27400000002023-07-28
4NK213F1*오공*위탁1위탁연구비2018-02-23500000002023-07-28
5NK210E1*국해*위탁1위탁연구비2018-03-21400000002023-07-28
6NK231C2*천대*위탁1위탁연구비2021-04-30500000002023-07-28
7NK225C1*국과*위탁21위탁연구비2020-04-06500000002023-07-28
8SC12801*국과*위탁1위탁연구비2018-02-211000000002023-07-28
9SC13201*국과*위탁1위탁연구비2019-02-20950000002023-07-28
과제번호순번참여기관참여형태연구비지급차수연구비지급구분연구비지급예정일자연구비지급예정금액작성일
217NK232C1*산과*위탁1위탁연구비2021-05-04400000002023-07-28
218NK238C1*산과*위탁1위탁연구비2022-04-29400000002023-07-28
219NK236A4*ni*공동1위탁연구비2022-06-20600000002023-07-28
220NK231B3*국항*위탁1위탁연구비2021-05-04600000002023-07-28
221NK237B3*국항*위탁1위탁연구비2022-05-20600000002023-07-28
222NK230F3*구경*위탁1위탁연구비2021-04-30500000002023-07-28
223NK236F1*구경*위탁1위탁연구비2022-04-27500000002023-07-28
224NK237G2*북대*위탁1위탁연구비2022-05-12500000002023-07-28
225NK231G1*등기*위탁1위탁연구비2021-04-291000000002023-07-28
226NK237F1*등기*위탁1위탁연구비2022-04-281000000002023-07-28