Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 1371 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 112.6 KiB |
Average record size in memory | 84.1 B |
Variable types
Text | 2 |
---|---|
Categorical | 1 |
DateTime | 3 |
Numeric | 4 |
Dataset
Description | 한국기계연구원의 연구관리 분야에서 과제계획서참여연구원을 관리하는 정보(과제번호, 참여자, 참여형태, 참여시작일, 참여종료일, 참여율 등을 관리) |
---|---|
URL | https://www.data.go.kr/data/15078050/fileData.do |
작성일 has constant value "" | Constant |
참여개월 is highly overall correlated with 참여일수 | High correlation |
참여일수 is highly overall correlated with 참여개월 | High correlation |
참여율 is highly overall correlated with 참여연구원인건비 | High correlation |
참여연구원인건비 is highly overall correlated with 참여율 | High correlation |
참여형태 is highly imbalanced (64.2%) | Imbalance |
참여율 has 21 (1.5%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 15:55:56.720699 |
---|---|
Analysis finished | 2023-12-12 15:55:59.984914 |
Duration | 3.26 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
사업_과제번호
Text
Distinct | 91 |
---|---|
Distinct (%) | 6.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 10.8 KiB |
Value | Count | Frequency (%) |
nk237b | 43 | 3.1% |
nk231b | 41 | 3.0% |
nk232d | 39 | 2.8% |
nk236i | 37 | 2.7% |
nk237a | 35 | 2.6% |
nk240a | 34 | 2.5% |
nk234a | 33 | 2.4% |
nk232f | 33 | 2.4% |
nk230c | 32 | 2.3% |
nk238f | 31 | 2.3% |
Other values (81) | 1013 |
Most occurring characters
Value | Count | Frequency (%) |
2 | 1487 | |
K | 1343 | |
N | 1340 | |
3 | 1292 | |
0 | 335 | 4.1% |
A | 255 | 3.1% |
6 | 239 | 2.9% |
4 | 210 | 2.6% |
1 | 201 | 2.4% |
B | 194 | 2.4% |
Other values (14) | 1330 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 4151 | |
Uppercase Letter | 4075 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
K | 1343 | |
N | 1340 | |
A | 255 | 6.3% |
B | 194 | 4.8% |
C | 180 | 4.4% |
F | 178 | 4.4% |
D | 161 | 4.0% |
E | 139 | 3.4% |
G | 93 | 2.3% |
I | 62 | 1.5% |
Other values (5) | 130 | 3.2% |
Decimal Number
Value | Count | Frequency (%) |
2 | 1487 | |
3 | 1292 | |
0 | 335 | 8.1% |
6 | 239 | 5.8% |
4 | 210 | 5.1% |
1 | 201 | 4.8% |
7 | 188 | 4.5% |
8 | 143 | 3.4% |
9 | 56 | 1.3% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 4151 | |
Latin | 4075 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
K | 1343 | |
N | 1340 | |
A | 255 | 6.3% |
B | 194 | 4.8% |
C | 180 | 4.4% |
F | 178 | 4.4% |
D | 161 | 4.0% |
E | 139 | 3.4% |
G | 93 | 2.3% |
I | 62 | 1.5% |
Other values (5) | 130 | 3.2% |
Common
Value | Count | Frequency (%) |
2 | 1487 | |
3 | 1292 | |
0 | 335 | 8.1% |
6 | 239 | 5.8% |
4 | 210 | 5.1% |
1 | 201 | 4.8% |
7 | 188 | 4.5% |
8 | 143 | 3.4% |
9 | 56 | 1.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 8226 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2 | 1487 | |
K | 1343 | |
N | 1340 | |
3 | 1292 | |
0 | 335 | 4.1% |
A | 255 | 3.1% |
6 | 239 | 2.9% |
4 | 210 | 2.6% |
1 | 201 | 2.4% |
B | 194 | 2.4% |
Other values (14) | 1330 |
참여자명
Text
Distinct | 114 |
---|---|
Distinct (%) | 8.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 10.8 KiB |
Value | Count | Frequency (%) |
정 | 58 | 4.2% |
성 | 55 | 4.0% |
준 | 54 | 3.9% |
현 | 51 | 3.7% |
영 | 49 | 3.6% |
상 | 49 | 3.6% |
재 | 46 | 3.4% |
동 | 45 | 3.3% |
민 | 41 | 3.0% |
승 | 39 | 2.8% |
Other values (104) | 886 |
Most occurring characters
Value | Count | Frequency (%) |
* | 2742 | |
정 | 58 | 1.4% |
성 | 55 | 1.3% |
준 | 54 | 1.3% |
현 | 51 | 1.2% |
영 | 49 | 1.2% |
상 | 49 | 1.2% |
재 | 46 | 1.1% |
동 | 45 | 1.1% |
민 | 41 | 1.0% |
Other values (105) | 923 | 22.4% |
Most occurring categories
Value | Count | Frequency (%) |
Other Punctuation | 2742 | |
Other Letter | 1369 | |
Space Separator | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
정 | 58 | 4.2% |
성 | 55 | 4.0% |
준 | 54 | 3.9% |
현 | 51 | 3.7% |
영 | 49 | 3.6% |
상 | 49 | 3.6% |
재 | 46 | 3.4% |
동 | 45 | 3.3% |
민 | 41 | 3.0% |
승 | 39 | 2.8% |
Other values (103) | 882 |
Other Punctuation
Value | Count | Frequency (%) |
* | 2742 |
Space Separator
Value | Count | Frequency (%) |
2 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 2744 | |
Hangul | 1369 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
정 | 58 | 4.2% |
성 | 55 | 4.0% |
준 | 54 | 3.9% |
현 | 51 | 3.7% |
영 | 49 | 3.6% |
상 | 49 | 3.6% |
재 | 46 | 3.4% |
동 | 45 | 3.3% |
민 | 41 | 3.0% |
승 | 39 | 2.8% |
Other values (103) | 882 |
Common
Value | Count | Frequency (%) |
* | 2742 | |
2 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2744 | |
Hangul | 1369 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
* | 2742 | |
2 | 0.1% |
Hangul
Value | Count | Frequency (%) |
정 | 58 | 4.2% |
성 | 55 | 4.0% |
준 | 54 | 3.9% |
현 | 51 | 3.7% |
영 | 49 | 3.6% |
상 | 49 | 3.6% |
재 | 46 | 3.4% |
동 | 45 | 3.3% |
민 | 41 | 3.0% |
승 | 39 | 2.8% |
Other values (103) | 882 |
참여형태
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 10.8 KiB |
연구원 | |
---|---|
책임자 | 93 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 책임자 |
---|---|
2nd row | 연구원 |
3rd row | 연구원 |
4th row | 연구원 |
5th row | 연구원 |
Common Values
Value | Count | Frequency (%) |
연구원 | 1278 | |
책임자 | 93 | 6.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
연구원 | 1278 | |
책임자 | 93 | 6.8% |
참여시작일
Date
Distinct | 48 |
---|---|
Distinct (%) | 3.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 10.8 KiB |
Minimum | 2021-01-01 00:00:00 |
---|---|
Maximum | 2022-12-01 00:00:00 |
참여종료일
Date
Distinct | 44 |
---|---|
Distinct (%) | 3.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 10.8 KiB |
Minimum | 2021-01-27 00:00:00 |
---|---|
Maximum | 2022-12-31 00:00:00 |
참여개월
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 12 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11.126185 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 12.2 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 4 |
Q1 | 12 |
median | 12 |
Q3 | 12 |
95-th percentile | 12 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 2.4278148 |
---|---|
Coefficient of variation (CV) | 0.21820729 |
Kurtosis | 6.8421714 |
Mean | 11.126185 |
Median Absolute Deviation (MAD) | 0 |
Skewness | -2.8277215 |
Sum | 15254 |
Variance | 5.8942846 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
12 | 1176 | |
2 | 27 | 2.0% |
10 | 24 | 1.8% |
4 | 23 | 1.7% |
6 | 22 | 1.6% |
9 | 22 | 1.6% |
3 | 17 | 1.2% |
7 | 17 | 1.2% |
8 | 13 | 0.9% |
5 | 13 | 0.9% |
Other values (2) | 17 | 1.2% |
Value | Count | Frequency (%) |
1 | 10 | 0.7% |
2 | 27 | |
3 | 17 | |
4 | 23 | |
5 | 13 | |
6 | 22 | |
7 | 17 | |
8 | 13 | |
9 | 22 | |
10 | 24 |
Value | Count | Frequency (%) |
12 | 1176 | |
11 | 7 | 0.5% |
10 | 24 | 1.8% |
9 | 22 | 1.6% |
8 | 13 | 0.9% |
7 | 17 | 1.2% |
6 | 22 | 1.6% |
5 | 13 | 0.9% |
4 | 23 | 1.7% |
3 | 17 | 1.2% |
참여일수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 77 |
---|---|
Distinct (%) | 5.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 338.48505 |
Minimum | 17 |
---|---|
Maximum | 365 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 12.2 KiB |
Quantile statistics
Minimum | 17 |
---|---|
5-th percentile | 122 |
Q1 | 365 |
median | 365 |
Q3 | 365 |
95-th percentile | 365 |
Maximum | 365 |
Range | 348 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 73.652728 |
---|---|
Coefficient of variation (CV) | 0.21759522 |
Kurtosis | 6.9100609 |
Mean | 338.48505 |
Median Absolute Deviation (MAD) | 0 |
Skewness | -2.8349037 |
Sum | 464063 |
Variance | 5424.7244 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
365 | 1173 | |
59 | 23 | 1.7% |
275 | 14 | 1.0% |
306 | 13 | 0.9% |
184 | 12 | 0.9% |
122 | 9 | 0.7% |
243 | 8 | 0.6% |
107 | 6 | 0.4% |
187 | 5 | 0.4% |
91 | 5 | 0.4% |
Other values (67) | 103 | 7.5% |
Value | Count | Frequency (%) |
17 | 1 | 0.1% |
25 | 1 | 0.1% |
27 | 1 | 0.1% |
29 | 1 | 0.1% |
31 | 2 | 0.1% |
32 | 4 | 0.3% |
53 | 1 | 0.1% |
59 | 23 | |
73 | 2 | 0.1% |
76 | 1 | 0.1% |
Value | Count | Frequency (%) |
365 | 1173 | |
364 | 1 | 0.1% |
362 | 2 | 0.1% |
342 | 1 | 0.1% |
334 | 2 | 0.1% |
329 | 1 | 0.1% |
321 | 3 | 0.2% |
319 | 1 | 0.1% |
314 | 1 | 0.1% |
312 | 1 | 0.1% |
참여율
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 522 |
---|---|
Distinct (%) | 38.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 40.827206 |
Minimum | 0 |
---|---|
Maximum | 100 |
Zeros | 21 |
Zeros (%) | 1.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 12.2 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1.7 |
Q1 | 22.35 |
median | 44.1 |
Q3 | 52.05 |
95-th percentile | 99.35 |
Maximum | 100 |
Range | 100 |
Interquartile range (IQR) | 29.7 |
Descriptive statistics
Standard deviation | 24.956924 |
---|---|
Coefficient of variation (CV) | 0.61128169 |
Kurtosis | 0.058112502 |
Mean | 40.827206 |
Median Absolute Deviation (MAD) | 15.1 |
Skewness | 0.46911993 |
Sum | 55974.1 |
Variance | 622.84804 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
48.0 | 78 | 5.7% |
100.0 | 66 | 4.8% |
10.0 | 32 | 2.3% |
24.0 | 29 | 2.1% |
20.0 | 26 | 1.9% |
0.0 | 21 | 1.5% |
51.0 | 19 | 1.4% |
19.6 | 13 | 0.9% |
50.7 | 13 | 0.9% |
2.0 | 10 | 0.7% |
Other values (512) | 1064 |
Value | Count | Frequency (%) |
0.0 | 21 | |
0.5 | 2 | 0.1% |
0.6 | 4 | 0.3% |
0.7 | 1 | 0.1% |
0.8 | 2 | 0.1% |
0.9 | 3 | 0.2% |
1.0 | 5 | 0.4% |
1.1 | 6 | 0.4% |
1.3 | 3 | 0.2% |
1.4 | 9 |
Value | Count | Frequency (%) |
100.0 | 66 | |
99.9 | 1 | 0.1% |
99.7 | 1 | 0.1% |
99.6 | 1 | 0.1% |
99.1 | 2 | 0.1% |
98.8 | 1 | 0.1% |
98.1 | 1 | 0.1% |
97.8 | 1 | 0.1% |
96.9 | 1 | 0.1% |
96.5 | 2 | 0.1% |
참여연구원인건비
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 1258 |
---|---|
Distinct (%) | 91.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 29630409 |
Minimum | 57446 |
---|---|
Maximum | 1.18655 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 12.2 KiB |
Quantile statistics
Minimum | 57446 |
---|---|
5-th percentile | 1659500 |
Q1 | 11445828 |
median | 27892000 |
Q3 | 44440000 |
95-th percentile | 63297152 |
Maximum | 1.18655 × 108 |
Range | 1.1859755 × 108 |
Interquartile range (IQR) | 32994172 |
Descriptive statistics
Standard deviation | 20958774 |
---|---|
Coefficient of variation (CV) | 0.70734003 |
Kurtosis | 0.016081396 |
Mean | 29630409 |
Median Absolute Deviation (MAD) | 16500344 |
Skewness | 0.57968134 |
Sum | 4.0623291 × 1010 |
Variance | 4.3927023 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2000000 | 19 | 1.4% |
3000000 | 16 | 1.2% |
1000000 | 12 | 0.9% |
1400000 | 8 | 0.6% |
6000000 | 6 | 0.4% |
1300000 | 5 | 0.4% |
600000 | 5 | 0.4% |
1500000 | 5 | 0.4% |
550000 | 4 | 0.3% |
2500000 | 4 | 0.3% |
Other values (1248) | 1287 |
Value | Count | Frequency (%) |
57446 | 1 | 0.1% |
300000 | 1 | 0.1% |
400000 | 1 | 0.1% |
550000 | 4 | |
599206 | 1 | 0.1% |
600000 | 5 | |
800000 | 1 | 0.1% |
860000 | 1 | 0.1% |
900000 | 3 | |
960000 | 1 | 0.1% |
Value | Count | Frequency (%) |
118655000 | 1 | |
111390228 | 1 | |
100522000 | 1 | |
99535113 | 1 | |
97337000 | 1 | |
96135407 | 1 | |
93790625 | 1 | |
93590000 | 1 | |
92611267 | 1 | |
90490000 | 1 |
작성일
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 10.8 KiB |
Minimum | 2023-07-28 00:00:00 |
---|---|
Maximum | 2023-07-28 00:00:00 |
사업_과제번호 | 참여형태 | 참여시작일 | 참여종료일 | 참여개월 | 참여일수 | 참여율 | 참여연구원인건비 | |
---|---|---|---|---|---|---|---|---|
사업_과제번호 | 1.000 | 0.182 | 0.662 | 0.478 | 0.585 | 0.589 | 0.780 | 0.513 |
참여형태 | 0.182 | 1.000 | 0.000 | 0.000 | 0.084 | 0.091 | 0.212 | 0.323 |
참여시작일 | 0.662 | 0.000 | 1.000 | 0.000 | 0.950 | 0.940 | 0.591 | 0.000 |
참여종료일 | 0.478 | 0.000 | 0.000 | 1.000 | 0.912 | 0.911 | 0.333 | 0.000 |
참여개월 | 0.585 | 0.084 | 0.950 | 0.912 | 1.000 | 0.995 | 0.445 | 0.379 |
참여일수 | 0.589 | 0.091 | 0.940 | 0.911 | 0.995 | 1.000 | 0.431 | 0.367 |
참여율 | 0.780 | 0.212 | 0.591 | 0.333 | 0.445 | 0.431 | 1.000 | 0.803 |
참여연구원인건비 | 0.513 | 0.323 | 0.000 | 0.000 | 0.379 | 0.367 | 0.803 | 1.000 |
참여개월 | 참여일수 | 참여율 | 참여연구원인건비 | 참여형태 | |
---|---|---|---|---|---|
참여개월 | 1.000 | 0.994 | -0.111 | 0.363 | 0.064 |
참여일수 | 0.994 | 1.000 | -0.111 | 0.361 | 0.069 |
참여율 | -0.111 | -0.111 | 1.000 | 0.640 | 0.178 |
참여연구원인건비 | 0.363 | 0.361 | 0.640 | 1.000 | 0.247 |
참여형태 | 0.064 | 0.069 | 0.178 | 0.247 | 1.000 |
사업_과제번호 | 참여자명 | 참여형태 | 참여시작일 | 참여종료일 | 참여개월 | 참여일수 | 참여율 | 참여연구원인건비 | 작성일 | |
---|---|---|---|---|---|---|---|---|---|---|
0 | NK237C | *민* | 책임자 | 2022-01-01 | 2022-12-31 | 12 | 365 | 43.5 | 41572220 | 2023-07-28 |
1 | NK237C | *한* | 연구원 | 2022-01-01 | 2022-12-31 | 12 | 365 | 35.2 | 38266465 | 2023-07-28 |
2 | NK237C | *영* | 연구원 | 2022-01-01 | 2022-12-31 | 12 | 365 | 44.7 | 52254718 | 2023-07-28 |
3 | NK232C | *치* | 연구원 | 2021-01-01 | 2021-12-31 | 12 | 365 | 48.0 | 56710000 | 2023-07-28 |
4 | NK232C | *재* | 연구원 | 2021-01-01 | 2021-12-31 | 12 | 365 | 47.7 | 51518000 | 2023-07-28 |
5 | NK232C | *태* | 연구원 | 2021-01-01 | 2021-12-31 | 12 | 365 | 48.0 | 46801000 | 2023-07-28 |
6 | NK232C | *상* | 연구원 | 2021-01-01 | 2021-12-31 | 12 | 365 | 48.0 | 42109000 | 2023-07-28 |
7 | NK232C | *상* | 연구원 | 2021-01-01 | 2021-12-31 | 12 | 365 | 48.0 | 35402000 | 2023-07-28 |
8 | NK232C | *승* | 연구원 | 2021-01-01 | 2021-12-31 | 12 | 365 | 49.6 | 37199540 | 2023-07-28 |
9 | NK232C | *선* | 연구원 | 2021-01-01 | 2021-12-31 | 12 | 365 | 48.0 | 29069000 | 2023-07-28 |
사업_과제번호 | 참여자명 | 참여형태 | 참여시작일 | 참여종료일 | 참여개월 | 참여일수 | 참여율 | 참여연구원인건비 | 작성일 | |
---|---|---|---|---|---|---|---|---|---|---|
1361 | NK236G | *수* | 연구원 | 2022-01-01 | 2022-12-31 | 12 | 365 | 22.0 | 17048197 | 2023-07-28 |
1362 | NK236G | *준* | 연구원 | 2022-09-14 | 2022-12-31 | 4 | 109 | 49.2 | 10230605 | 2023-07-28 |
1363 | NK236G | *봉* | 연구원 | 2022-01-01 | 2022-02-22 | 2 | 53 | 92.5 | 2979063 | 2023-07-28 |
1364 | NK236G | *명* | 연구원 | 2022-01-01 | 2022-12-31 | 12 | 365 | 18.2 | 4993163 | 2023-07-28 |
1365 | NK236G | *종* | 연구원 | 2022-01-01 | 2022-12-31 | 12 | 365 | 100.0 | 7870630 | 2023-07-28 |
1366 | NK236C | *명* | 연구원 | 2022-01-01 | 2022-12-31 | 12 | 365 | 100.0 | 19049125 | 2023-07-28 |
1367 | NK236C | *구* | 연구원 | 2022-01-01 | 2022-12-31 | 12 | 365 | 100.0 | 26700875 | 2023-07-28 |
1368 | NK236C | *혁* | 연구원 | 2022-01-01 | 2022-12-31 | 12 | 365 | 35.6 | 8000000 | 2023-07-28 |
1369 | NK236C | *규* | 연구원 | 2022-01-01 | 2022-12-31 | 12 | 365 | 56.1 | 17000000 | 2023-07-28 |
1370 | NK236C | *재* | 연구원 | 2022-09-01 | 2022-12-31 | 4 | 122 | 0.0 | 6250000 | 2023-07-28 |