Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 634 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 45.9 KiB |
Average record size in memory | 74.2 B |
Variable types
Categorical | 6 |
---|---|
Text | 1 |
Boolean | 1 |
Numeric | 1 |
Dataset
Description | 한국기계연구원의 연구관리 분야에서 사업/과제계획서참여연구원월별관리를 하는 정보(과제번호, 참여자, 참여년월, 참여시작일, 참여종료일, 참여연구원인건비 등을 관리) |
---|---|
URL | https://www.data.go.kr/data/15078045/fileData.do |
Reproduction
Analysis started | 2023-12-13 00:49:03.498992 |
---|---|
Analysis finished | 2023-12-13 00:49:03.982262 |
Duration | 0.48 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
사업_과제번호
Categorical
Distinct | 48 |
---|---|
Distinct (%) | 7.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.1 KiB |
NK237B | 36 |
---|---|
NK236I | 36 |
NK240A | 33 |
NK237A | 29 |
NK236C | 29 |
Other values (43) |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | NK237C |
---|---|
2nd row | NK237C |
3rd row | NK237C |
4th row | NK237C |
5th row | NK237C |
Common Values
Value | Count | Frequency (%) |
NK237B | 36 | 5.7% |
NK236I | 36 | 5.7% |
NK240A | 33 | 5.2% |
NK237A | 29 | 4.6% |
NK236C | 29 | 4.6% |
NK238F | 27 | 4.3% |
NK238D | 26 | 4.1% |
NK237G | 25 | 3.9% |
NK238B | 24 | 3.8% |
NK236F | 23 | 3.6% |
Other values (38) | 346 |
Length
Value | Count | Frequency (%) |
nk237b | 36 | 5.7% |
nk236i | 36 | 5.7% |
nk240a | 33 | 5.2% |
nk237a | 29 | 4.6% |
nk236c | 29 | 4.6% |
nk238f | 27 | 4.3% |
nk238d | 26 | 4.1% |
nk237g | 25 | 3.9% |
nk238b | 24 | 3.8% |
nk236f | 23 | 3.6% |
Other values (38) | 346 |
참여자명
Text
Distinct | 105 |
---|---|
Distinct (%) | 16.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.1 KiB |
Value | Count | Frequency (%) |
정 | 29 | 4.6% |
준 | 26 | 4.1% |
성 | 26 | 4.1% |
현 | 24 | 3.8% |
상 | 23 | 3.6% |
영 | 23 | 3.6% |
재 | 23 | 3.6% |
동 | 22 | 3.5% |
민 | 19 | 3.0% |
지 | 17 | 2.7% |
Other values (95) | 403 |
Most occurring characters
Value | Count | Frequency (%) |
* | 1268 | |
정 | 29 | 1.5% |
성 | 26 | 1.4% |
준 | 26 | 1.4% |
현 | 24 | 1.3% |
상 | 23 | 1.2% |
영 | 23 | 1.2% |
재 | 23 | 1.2% |
동 | 22 | 1.2% |
민 | 19 | 1.0% |
Other values (96) | 419 | 22.0% |
Most occurring categories
Value | Count | Frequency (%) |
Other Punctuation | 1268 | |
Other Letter | 633 | |
Space Separator | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
정 | 29 | 4.6% |
성 | 26 | 4.1% |
준 | 26 | 4.1% |
현 | 24 | 3.8% |
상 | 23 | 3.6% |
영 | 23 | 3.6% |
재 | 23 | 3.6% |
동 | 22 | 3.5% |
민 | 19 | 3.0% |
지 | 17 | 2.7% |
Other values (94) | 401 |
Other Punctuation
Value | Count | Frequency (%) |
* | 1268 |
Space Separator
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1269 | |
Hangul | 633 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
정 | 29 | 4.6% |
성 | 26 | 4.1% |
준 | 26 | 4.1% |
현 | 24 | 3.8% |
상 | 23 | 3.6% |
영 | 23 | 3.6% |
재 | 23 | 3.6% |
동 | 22 | 3.5% |
민 | 19 | 3.0% |
지 | 17 | 2.7% |
Other values (94) | 401 |
Common
Value | Count | Frequency (%) |
* | 1268 | |
1 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1269 | |
Hangul | 633 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
* | 1268 | |
1 | 0.1% |
Hangul
Value | Count | Frequency (%) |
정 | 29 | 4.6% |
성 | 26 | 4.1% |
준 | 26 | 4.1% |
현 | 24 | 3.8% |
상 | 23 | 3.6% |
영 | 23 | 3.6% |
재 | 23 | 3.6% |
동 | 22 | 3.5% |
민 | 19 | 3.0% |
지 | 17 | 2.7% |
Other values (94) | 401 |
참여연구원참여적용년월일
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.1 KiB |
2022-01-28 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2022-01-28 |
---|---|
2nd row | 2022-01-28 |
3rd row | 2022-01-28 |
4th row | 2022-01-28 |
5th row | 2022-01-28 |
Common Values
Value | Count | Frequency (%) |
2022-01-28 | 634 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2022-01-28 | 634 |
참여시작일
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.1 KiB |
2022-01-01 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2022-01-01 |
---|---|
2nd row | 2022-01-01 |
3rd row | 2022-01-01 |
4th row | 2022-01-01 |
5th row | 2022-01-01 |
Common Values
Value | Count | Frequency (%) |
2022-01-01 | 634 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2022-01-01 | 634 |
참여종료일
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.1 KiB |
2022-01-31 | |
---|---|
2022-01-25 | 1 |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | 2022-01-31 |
---|---|
2nd row | 2022-01-31 |
3rd row | 2022-01-31 |
4th row | 2022-01-31 |
5th row | 2022-01-31 |
Common Values
Value | Count | Frequency (%) |
2022-01-31 | 633 | |
2022-01-25 | 1 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2022-01-31 | 633 | |
2022-01-25 | 1 | 0.2% |
참여일수
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.1 KiB |
31 | |
---|---|
25 | 1 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | 31 |
---|---|
2nd row | 31 |
3rd row | 31 |
4th row | 31 |
5th row | 31 |
Common Values
Value | Count | Frequency (%) |
31 | 633 | |
25 | 1 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
31 | 633 | |
25 | 1 | 0.2% |
집행여부
Boolean
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 766.0 B |
True |
---|
Value | Count | Frequency (%) |
True | 634 |
인건비배분금액
Real number (ℝ)
Distinct | 593 |
---|---|
Distinct (%) | 93.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2933307.8 |
Minimum | 10000 |
---|---|
Maximum | 10077548 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.7 KiB |
Quantile statistics
Minimum | 10000 |
---|---|
5-th percentile | 139147.15 |
Q1 | 1634333.2 |
median | 3141149.5 |
Q3 | 4170540 |
95-th percentile | 5356451.8 |
Maximum | 10077548 |
Range | 10067548 |
Interquartile range (IQR) | 2536206.8 |
Descriptive statistics
Standard deviation | 1737364 |
---|---|
Coefficient of variation (CV) | 0.59228832 |
Kurtosis | -0.22241845 |
Mean | 2933307.8 |
Median Absolute Deviation (MAD) | 1274567 |
Skewness | 0.1364348 |
Sum | 1.8597172 × 109 |
Variance | 3.0184335 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
169863 | 15 | 2.4% |
84932 | 6 | 0.9% |
2000000 | 6 | 0.9% |
46712 | 4 | 0.6% |
3212449 | 3 | 0.5% |
254795 | 3 | 0.5% |
76438 | 3 | 0.5% |
1167044 | 2 | 0.3% |
110411 | 2 | 0.3% |
127397 | 2 | 0.3% |
Other values (583) | 588 |
Value | Count | Frequency (%) |
10000 | 1 | 0.2% |
46712 | 4 | |
67945 | 1 | 0.2% |
73041 | 1 | 0.2% |
76438 | 3 | |
81534 | 1 | 0.2% |
84932 | 6 | |
92151 | 1 | 0.2% |
93425 | 1 | 0.2% |
103871 | 1 | 0.2% |
Value | Count | Frequency (%) |
10077548 | 1 | |
8070022 | 1 | |
7978466 | 1 | |
7685452 | 1 | |
7596189 | 1 | |
7571219 | 1 | |
7549732 | 1 | |
7213742 | 1 | |
7125074 | 1 | |
6887436 | 1 |
작성일
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.1 KiB |
2023-07-28 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2023-07-28 |
---|---|
2nd row | 2023-07-28 |
3rd row | 2023-07-28 |
4th row | 2023-07-28 |
5th row | 2023-07-28 |
Common Values
Value | Count | Frequency (%) |
2023-07-28 | 634 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2023-07-28 | 634 |
사업_과제번호 | 참여종료일 | 참여일수 | 인건비배분금액 | |
---|---|---|---|---|
사업_과제번호 | 1.000 | 0.000 | 0.000 | 0.692 |
참여종료일 | 0.000 | 1.000 | 0.705 | 0.000 |
참여일수 | 0.000 | 0.705 | 1.000 | 0.000 |
인건비배분금액 | 0.692 | 0.000 | 0.000 | 1.000 |
사업_과제번호 | 참여일수 | 참여종료일 | |
---|---|---|---|
사업_과제번호 | 1.000 | 0.000 | 0.000 |
참여일수 | 0.000 | 1.000 | 0.498 |
참여종료일 | 0.000 | 0.498 | 1.000 |
인건비배분금액 | 사업_과제번호 | 참여종료일 | 참여일수 | |
---|---|---|---|---|
인건비배분금액 | 1.000 | 0.304 | 0.000 | 0.000 |
사업_과제번호 | 0.304 | 1.000 | 0.000 | 0.000 |
참여종료일 | 0.000 | 0.000 | 1.000 | 0.498 |
참여일수 | 0.000 | 0.000 | 0.498 | 1.000 |
사업_과제번호 | 참여자명 | 참여연구원참여적용년월일 | 참여시작일 | 참여종료일 | 참여일수 | 집행여부 | 인건비배분금액 | 작성일 | |
---|---|---|---|---|---|---|---|---|---|
0 | NK237C | *현* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | Y | 4216340 | 2023-07-28 |
1 | NK237C | *해* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | Y | 2651493 | 2023-07-28 |
2 | NK237C | *은* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | Y | 1357290 | 2023-07-28 |
3 | NK237C | *연* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | Y | 1760800 | 2023-07-28 |
4 | NK237C | *상* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | Y | 1785855 | 2023-07-28 |
5 | NK237C | *정* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | Y | 3658849 | 2023-07-28 |
6 | NK237C | *경* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | Y | 3209222 | 2023-07-28 |
7 | NK237C | *제* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | Y | 4499756 | 2023-07-28 |
8 | NK237C | *원* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | Y | 3065263 | 2023-07-28 |
9 | NK237C | *영* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | Y | 5167488 | 2023-07-28 |
사업_과제번호 | 참여자명 | 참여연구원참여적용년월일 | 참여시작일 | 참여종료일 | 참여일수 | 집행여부 | 인건비배분금액 | 작성일 | |
---|---|---|---|---|---|---|---|---|---|
624 | NK240A | *용* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | Y | 6505707 | 2023-07-28 |
625 | NK240A | *건* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | Y | 2943132 | 2023-07-28 |
626 | NK239I | *준* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | Y | 671299 | 2023-07-28 |
627 | NK239I | *평* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | Y | 707055 | 2023-07-28 |
628 | NK239I | *혁* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | Y | 647178 | 2023-07-28 |
629 | NK239I | *윤* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | Y | 856110 | 2023-07-28 |
630 | NK239B | *민* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | Y | 659493 | 2023-07-28 |
631 | NK239B | *기* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | Y | 736356 | 2023-07-28 |
632 | NK239B | *장* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | Y | 757674 | 2023-07-28 |
633 | NK239B | *현* | 2022-01-28 | 2022-01-01 | 2022-01-31 | 31 | Y | 904351 | 2023-07-28 |