Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 573 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 42.1 KiB |
Average record size in memory | 75.2 B |
Variable types
Categorical | 2 |
---|---|
Text | 1 |
DateTime | 4 |
Numeric | 2 |
Dataset
Description | 한국기계연구원의 연구관리 분야에서 사업/과제계획서참여연구원월별참여율을 관리하는 테이블 정보(과제번호, 참여자, 참여월, 참여시작일, 참여종료일, 참여율 등을 관리) |
---|---|
URL | https://www.data.go.kr/data/15078048/fileData.do |
참여연구원참여적용년월일 has constant value "" | Constant |
작성일 has constant value "" | Constant |
참여율 is highly overall correlated with 참여연구원인건비 | High correlation |
참여연구원인건비 is highly overall correlated with 참여율 | High correlation |
참여일수 is highly imbalanced (97.7%) | Imbalance |
Reproduction
Analysis started | 2023-12-12 06:40:15.581021 |
---|---|
Analysis finished | 2023-12-12 06:40:16.720993 |
Duration | 1.14 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
사업_과제번호
Categorical
Distinct | 48 |
---|---|
Distinct (%) | 8.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.6 KiB |
NK236I | 34 |
---|---|
NK240A | 31 |
NK237B | 29 |
NK237A | 26 |
NK236C | 25 |
Other values (43) |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | NK240K |
---|---|
2nd row | NK240B |
3rd row | NK240B |
4th row | NK240K |
5th row | NK240B |
Common Values
Value | Count | Frequency (%) |
NK236I | 34 | 5.9% |
NK240A | 31 | 5.4% |
NK237B | 29 | 5.1% |
NK237A | 26 | 4.5% |
NK236C | 25 | 4.4% |
NK237G | 24 | 4.2% |
NK238B | 23 | 4.0% |
NK238F | 22 | 3.8% |
NK237C | 21 | 3.7% |
NK236F | 21 | 3.7% |
Other values (38) | 317 |
Length
Value | Count | Frequency (%) |
nk236i | 34 | 5.9% |
nk240a | 31 | 5.4% |
nk237b | 29 | 5.1% |
nk237a | 26 | 4.5% |
nk236c | 25 | 4.4% |
nk237g | 24 | 4.2% |
nk238b | 23 | 4.0% |
nk238f | 22 | 3.8% |
nk237c | 21 | 3.7% |
nk236f | 21 | 3.7% |
Other values (38) | 317 |
참여자명
Text
Distinct | 92 |
---|---|
Distinct (%) | 16.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.6 KiB |
Value | Count | Frequency (%) |
정 | 26 | 4.5% |
준 | 25 | 4.4% |
영 | 23 | 4.0% |
상 | 22 | 3.8% |
현 | 22 | 3.8% |
성 | 22 | 3.8% |
동 | 20 | 3.5% |
재 | 20 | 3.5% |
민 | 17 | 3.0% |
승 | 15 | 2.6% |
Other values (82) | 361 |
Most occurring characters
Value | Count | Frequency (%) |
* | 1146 | |
정 | 26 | 1.5% |
준 | 25 | 1.5% |
영 | 23 | 1.3% |
성 | 22 | 1.3% |
상 | 22 | 1.3% |
현 | 22 | 1.3% |
동 | 20 | 1.2% |
재 | 20 | 1.2% |
민 | 17 | 1.0% |
Other values (83) | 376 | 21.9% |
Most occurring categories
Value | Count | Frequency (%) |
Other Punctuation | 1146 | |
Other Letter | 573 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
정 | 26 | 4.5% |
준 | 25 | 4.4% |
영 | 23 | 4.0% |
성 | 22 | 3.8% |
상 | 22 | 3.8% |
현 | 22 | 3.8% |
동 | 20 | 3.5% |
재 | 20 | 3.5% |
민 | 17 | 3.0% |
승 | 15 | 2.6% |
Other values (82) | 361 |
Other Punctuation
Value | Count | Frequency (%) |
* | 1146 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1146 | |
Hangul | 573 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
정 | 26 | 4.5% |
준 | 25 | 4.4% |
영 | 23 | 4.0% |
성 | 22 | 3.8% |
상 | 22 | 3.8% |
현 | 22 | 3.8% |
동 | 20 | 3.5% |
재 | 20 | 3.5% |
민 | 17 | 3.0% |
승 | 15 | 2.6% |
Other values (82) | 361 |
Common
Value | Count | Frequency (%) |
* | 1146 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1146 | |
Hangul | 573 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
* | 1146 |
Hangul
Value | Count | Frequency (%) |
정 | 26 | 4.5% |
준 | 25 | 4.4% |
영 | 23 | 4.0% |
성 | 22 | 3.8% |
상 | 22 | 3.8% |
현 | 22 | 3.8% |
동 | 20 | 3.5% |
재 | 20 | 3.5% |
민 | 17 | 3.0% |
승 | 15 | 2.6% |
Other values (82) | 361 |
참여연구원참여적용년월일
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.6 KiB |
Minimum | 2022-01-28 00:00:00 |
---|---|
Maximum | 2022-01-28 00:00:00 |
참여시작일
Date
Distinct | 2 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.6 KiB |
Minimum | 2022-01-01 00:00:00 |
---|---|
Maximum | 2022-01-24 00:00:00 |
참여종료일
Date
Distinct | 2 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.6 KiB |
Minimum | 2022-01-25 00:00:00 |
---|---|
Maximum | 2022-01-31 00:00:00 |
참여일수
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.6 KiB |
31 | |
---|---|
8 | 1 |
25 | 1 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 1.9982548 |
Min length | 1 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 0.3% |
Sample
1st row | 31 |
---|---|
2nd row | 31 |
3rd row | 31 |
4th row | 31 |
5th row | 31 |
Common Values
Value | Count | Frequency (%) |
31 | 571 | |
8 | 1 | 0.2% |
25 | 1 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
31 | 571 | |
8 | 1 | 0.2% |
25 | 1 | 0.2% |
참여율
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 264 |
---|---|
Distinct (%) | 46.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 39.639616 |
Minimum | 0.5 |
---|---|
Maximum | 100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.2 KiB |
Quantile statistics
Minimum | 0.5 |
---|---|
5-th percentile | 1.66 |
Q1 | 20.9 |
median | 49.4 |
Q3 | 52.6 |
95-th percentile | 63.5 |
Maximum | 100 |
Range | 99.5 |
Interquartile range (IQR) | 31.7 |
Descriptive statistics
Standard deviation | 21.61813 |
---|---|
Coefficient of variation (CV) | 0.54536679 |
Kurtosis | -0.20571018 |
Mean | 39.639616 |
Median Absolute Deviation (MAD) | 8.4 |
Skewness | -0.18699418 |
Sum | 22713.5 |
Variance | 467.34355 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10.0 | 13 | 2.3% |
51.8 | 11 | 1.9% |
20.1 | 9 | 1.6% |
52.6 | 9 | 1.6% |
52.0 | 8 | 1.4% |
100.0 | 8 | 1.4% |
52.2 | 8 | 1.4% |
50.4 | 7 | 1.2% |
52.3 | 7 | 1.2% |
51.7 | 7 | 1.2% |
Other values (254) | 486 |
Value | Count | Frequency (%) |
0.5 | 1 | 0.2% |
0.6 | 3 | |
0.9 | 3 | |
1.0 | 2 | 0.3% |
1.1 | 6 | |
1.3 | 2 | 0.3% |
1.4 | 6 | |
1.5 | 1 | 0.2% |
1.6 | 5 | |
1.7 | 3 |
Value | Count | Frequency (%) |
100.0 | 8 | |
99.0 | 1 | 0.2% |
98.3 | 1 | 0.2% |
85.5 | 1 | 0.2% |
82.4 | 1 | 0.2% |
82.0 | 1 | 0.2% |
81.5 | 1 | 0.2% |
81.4 | 1 | 0.2% |
80.3 | 1 | 0.2% |
80.1 | 1 | 0.2% |
참여연구원인건비
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 537 |
---|---|
Distinct (%) | 93.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3020252.4 |
Minimum | 46712 |
---|---|
Maximum | 10077548 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.2 KiB |
Quantile statistics
Minimum | 46712 |
---|---|
5-th percentile | 132085.4 |
Q1 | 1692855 |
median | 3260945 |
Q3 | 4260504 |
95-th percentile | 5352281.4 |
Maximum | 10077548 |
Range | 10030836 |
Interquartile range (IQR) | 2567649 |
Descriptive statistics
Standard deviation | 1743239 |
---|---|
Coefficient of variation (CV) | 0.57718321 |
Kurtosis | -0.18733202 |
Mean | 3020252.4 |
Median Absolute Deviation (MAD) | 1267178 |
Skewness | 0.031285158 |
Sum | 1.7306046 × 109 |
Variance | 3.0388821 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
169863 | 15 | 2.6% |
84932 | 6 | 1.0% |
46712 | 4 | 0.7% |
254795 | 3 | 0.5% |
76438 | 3 | 0.5% |
3212449 | 3 | 0.5% |
3903877 | 2 | 0.3% |
622973 | 2 | 0.3% |
1167044 | 2 | 0.3% |
2509896 | 2 | 0.3% |
Other values (527) | 531 |
Value | Count | Frequency (%) |
46712 | 4 | |
67945 | 1 | 0.2% |
73041 | 1 | 0.2% |
76438 | 3 | |
81534 | 1 | 0.2% |
84932 | 6 | |
92151 | 1 | 0.2% |
93425 | 1 | 0.2% |
103871 | 1 | 0.2% |
104466 | 1 | 0.2% |
Value | Count | Frequency (%) |
10077548 | 1 | |
8070022 | 1 | |
7978466 | 1 | |
7685452 | 1 | |
7596189 | 1 | |
7571219 | 1 | |
7549732 | 1 | |
7213742 | 1 | |
7125074 | 1 | |
6887436 | 1 |
작성일
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.6 KiB |
Minimum | 2023-07-28 00:00:00 |
---|---|
Maximum | 2023-07-28 00:00:00 |
사업_과제번호 | 참여자명 | 참여시작일 | 참여종료일 | 참여일수 | 참여율 | 참여연구원인건비 | |
---|---|---|---|---|---|---|---|
사업_과제번호 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.807 | 0.709 |
참여자명 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.476 | 0.486 |
참여시작일 | 0.000 | 0.000 | 1.000 | 0.000 | 1.000 | 0.139 | 0.000 |
참여종료일 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.000 | 0.000 |
참여일수 | 0.000 | 0.000 | 1.000 | 1.000 | 1.000 | 0.000 | 0.000 |
참여율 | 0.807 | 0.476 | 0.139 | 0.000 | 0.000 | 1.000 | 0.837 |
참여연구원인건비 | 0.709 | 0.486 | 0.000 | 0.000 | 0.000 | 0.837 | 1.000 |
참여일수 | 사업_과제번호 | |
---|---|---|
참여일수 | 1.000 | 0.000 |
사업_과제번호 | 0.000 | 1.000 |
참여율 | 참여연구원인건비 | 사업_과제번호 | 참여일수 | |
---|---|---|---|---|
참여율 | 1.000 | 0.843 | 0.414 | 0.000 |
참여연구원인건비 | 0.843 | 1.000 | 0.330 | 0.000 |
사업_과제번호 | 0.414 | 0.330 | 1.000 | 0.000 |
참여일수 | 0.000 | 0.000 | 0.000 | 1.000 |
사업_과제번호 | 참여자명 | 참여연구원참여적용년월일 | 참여시작일 | 참여종료일 | 참여일수 | 참여율 | 참여연구원인건비 | 작성일 | |
---|---|---|---|---|---|---|---|---|---|
0 | NK240K | *정* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | 0.6 | 46712 | 2023-07-28 |
1 | NK240B | *종* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | 55.5 | 5060134 | 2023-07-28 |
2 | NK240B | *경* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | 31.2 | 2305721 | 2023-07-28 |
3 | NK240K | *동* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | 0.5 | 46712 | 2023-07-28 |
4 | NK240B | *동* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | 31.9 | 2497751 | 2023-07-28 |
5 | NK237B | *명* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | 21.6 | 1683767 | 2023-07-28 |
6 | NK237B | *유* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | 35.5 | 3011162 | 2023-07-28 |
7 | NK237B | *용* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | 52.2 | 5299301 | 2023-07-28 |
8 | NK237B | *필* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | 51.4 | 5033975 | 2023-07-28 |
9 | NK239A | *준* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | 30.2 | 2819726 | 2023-07-28 |
사업_과제번호 | 참여자명 | 참여연구원참여적용년월일 | 참여시작일 | 참여종료일 | 참여일수 | 참여율 | 참여연구원인건비 | 작성일 | |
---|---|---|---|---|---|---|---|---|---|
563 | NK240A | *영* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | 34.7 | 2746430 | 2023-07-28 |
564 | NK240A | *현* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | 53.6 | 3956874 | 2023-07-28 |
565 | NK240A | *정* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | 34.4 | 2845460 | 2023-07-28 |
566 | NK240C | *형* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | 1.4 | 131644 | 2023-07-28 |
567 | NK240C | *재* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | 1.6 | 144553 | 2023-07-28 |
568 | NK240C | *기* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | 1.4 | 140901 | 2023-07-28 |
569 | NK240C | *기* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | 1.4 | 131729 | 2023-07-28 |
570 | NK240C | *순* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | 1.3 | 104466 | 2023-07-28 |
571 | NK240C | *준* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | 1.4 | 103871 | 2023-07-28 |
572 | NK240C | *학* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | 1.4 | 92151 | 2023-07-28 |