Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 2043 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 167.7 KiB |
Average record size in memory | 84.1 B |
Variable types
Text | 3 |
---|---|
Numeric | 3 |
Categorical | 2 |
DateTime | 2 |
Dataset
Description | 한국기계연구원에서 발표한 논문 과제정보(사업과제신청번호,사업_과제번호,성과활동관련과제구분,작성자,작성일,과제기여율 등) |
---|---|
URL | https://www.data.go.kr/data/15049190/fileData.do |
성과활동관련과제구분 has constant value "" | Constant |
수정자 has constant value "" | Constant |
과제기여율 is highly overall correlated with 오더번호 | High correlation |
오더번호 is highly overall correlated with 과제기여율 | High correlation |
Reproduction
Analysis started | 2023-12-12 11:15:30.909874 |
---|---|
Analysis finished | 2023-12-12 11:15:33.392758 |
Duration | 2.48 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
성과활동신청번호
Text
Distinct | 1340 |
---|---|
Distinct (%) | 65.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 16.1 KiB |
Value | Count | Frequency (%) |
t20210427 | 6 | 0.3% |
t20220435 | 5 | 0.2% |
t20200310 | 5 | 0.2% |
t20220408 | 5 | 0.2% |
t20220419 | 5 | 0.2% |
t20220434 | 5 | 0.2% |
t20210377 | 4 | 0.2% |
t20210642 | 4 | 0.2% |
t20180300 | 4 | 0.2% |
t20210287 | 4 | 0.2% |
Other values (1330) | 1996 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 5174 | |
2 | 4542 | |
T | 2043 | 11.1% |
1 | 1963 | 10.7% |
8 | 778 | 4.2% |
9 | 746 | 4.1% |
3 | 736 | 4.0% |
4 | 693 | 3.8% |
5 | 623 | 3.4% |
6 | 572 | 3.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 16344 | |
Uppercase Letter | 2043 | 11.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 5174 | |
2 | 4542 | |
1 | 1963 | 12.0% |
8 | 778 | 4.8% |
9 | 746 | 4.6% |
3 | 736 | 4.5% |
4 | 693 | 4.2% |
5 | 623 | 3.8% |
6 | 572 | 3.5% |
7 | 517 | 3.2% |
Uppercase Letter
Value | Count | Frequency (%) |
T | 2043 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 16344 | |
Latin | 2043 | 11.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 5174 | |
2 | 4542 | |
1 | 1963 | 12.0% |
8 | 778 | 4.8% |
9 | 746 | 4.6% |
3 | 736 | 4.5% |
4 | 693 | 4.2% |
5 | 623 | 3.8% |
6 | 572 | 3.5% |
7 | 517 | 3.2% |
Latin
Value | Count | Frequency (%) |
T | 2043 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 18387 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 5174 | |
2 | 4542 | |
T | 2043 | 11.1% |
1 | 1963 | 10.7% |
8 | 778 | 4.2% |
9 | 746 | 4.1% |
3 | 736 | 4.0% |
4 | 693 | 3.8% |
5 | 623 | 3.4% |
6 | 572 | 3.1% |
사업_과제번호
Text
Distinct | 833 |
---|---|
Distinct (%) | 40.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 16.1 KiB |
Value | Count | Frequency (%) |
nk226d | 26 | 1.3% |
nk211b | 17 | 0.8% |
nm9730 | 14 | 0.7% |
nk232a | 13 | 0.6% |
nk213e | 13 | 0.6% |
nk230c | 13 | 0.6% |
nk231f | 13 | 0.6% |
nm9440 | 12 | 0.6% |
nk220d | 12 | 0.6% |
nk238d | 11 | 0.5% |
Other values (823) | 1899 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 2015 | |
2 | 1359 | |
N | 1350 | |
1 | 840 | 6.9% |
K | 715 | 5.8% |
3 | 696 | 5.7% |
M | 550 | 4.5% |
9 | 516 | 4.2% |
7 | 484 | 3.9% |
6 | 435 | 3.5% |
Other values (19) | 3298 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 7441 | |
Uppercase Letter | 4817 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
N | 1350 | |
K | 715 | |
M | 550 | |
B | 363 | 7.5% |
E | 363 | 7.5% |
T | 256 | 5.3% |
D | 246 | 5.1% |
O | 233 | 4.8% |
C | 179 | 3.7% |
G | 174 | 3.6% |
Other values (9) | 388 | 8.1% |
Decimal Number
Value | Count | Frequency (%) |
0 | 2015 | |
2 | 1359 | |
1 | 840 | |
3 | 696 | 9.4% |
9 | 516 | 6.9% |
7 | 484 | 6.5% |
6 | 435 | 5.8% |
4 | 380 | 5.1% |
8 | 377 | 5.1% |
5 | 339 | 4.6% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 7441 | |
Latin | 4817 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
N | 1350 | |
K | 715 | |
M | 550 | |
B | 363 | 7.5% |
E | 363 | 7.5% |
T | 256 | 5.3% |
D | 246 | 5.1% |
O | 233 | 4.8% |
C | 179 | 3.7% |
G | 174 | 3.6% |
Other values (9) | 388 | 8.1% |
Common
Value | Count | Frequency (%) |
0 | 2015 | |
2 | 1359 | |
1 | 840 | |
3 | 696 | 9.4% |
9 | 516 | 6.9% |
7 | 484 | 6.5% |
6 | 435 | 5.8% |
4 | 380 | 5.1% |
8 | 377 | 5.1% |
5 | 339 | 4.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 12258 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 2015 | |
2 | 1359 | |
N | 1350 | |
1 | 840 | 6.9% |
K | 715 | 5.8% |
3 | 696 | 5.7% |
M | 550 | 4.5% |
9 | 516 | 4.2% |
7 | 484 | 3.9% |
6 | 435 | 3.5% |
Other values (19) | 3298 |
사업_과제신청순번
Real number (ℝ)
Distinct | 10 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.7963779 |
Minimum | 1 |
---|---|
Maximum | 10 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 18.1 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 3 |
Q3 | 4 |
95-th percentile | 5 |
Maximum | 10 |
Range | 9 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.4734498 |
---|---|
Coefficient of variation (CV) | 0.52691371 |
Kurtosis | 0.93232891 |
Mean | 2.7963779 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 0.79244933 |
Sum | 5713 |
Variance | 2.1710545 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 526 | |
1 | 468 | |
2 | 462 | |
4 | 312 | |
5 | 203 | 9.9% |
6 | 50 | 2.4% |
8 | 8 | 0.4% |
7 | 6 | 0.3% |
9 | 6 | 0.3% |
10 | 2 | 0.1% |
Value | Count | Frequency (%) |
1 | 468 | |
2 | 462 | |
3 | 526 | |
4 | 312 | |
5 | 203 | 9.9% |
6 | 50 | 2.4% |
7 | 6 | 0.3% |
8 | 8 | 0.4% |
9 | 6 | 0.3% |
10 | 2 | 0.1% |
Value | Count | Frequency (%) |
10 | 2 | 0.1% |
9 | 6 | 0.3% |
8 | 8 | 0.4% |
7 | 6 | 0.3% |
6 | 50 | 2.4% |
5 | 203 | 9.9% |
4 | 312 | |
3 | 526 | |
2 | 462 | |
1 | 468 |
성과활동관련과제구분
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 16.1 KiB |
논문 |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 논문 |
---|---|
2nd row | 논문 |
3rd row | 논문 |
4th row | 논문 |
5th row | 논문 |
Common Values
Value | Count | Frequency (%) |
논문 | 2043 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
논문 | 2043 |
작성자
Text
Distinct | 309 |
---|---|
Distinct (%) | 15.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 16.1 KiB |
Value | Count | Frequency (%) |
0795 | 59 | 2.9% |
0934 | 39 | 1.9% |
0869 | 35 | 1.7% |
0612 | 35 | 1.7% |
1066 | 34 | 1.7% |
0868 | 33 | 1.6% |
0994 | 30 | 1.5% |
1043 | 30 | 1.5% |
0872 | 29 | 1.4% |
1079 | 28 | 1.4% |
Other values (299) | 1691 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 2098 | |
1 | 1286 | |
8 | 899 | |
9 | 892 | |
6 | 659 | 8.1% |
7 | 505 | 6.2% |
4 | 491 | 6.0% |
3 | 488 | 6.0% |
2 | 411 | 5.0% |
5 | 356 | 4.4% |
Other values (3) | 87 | 1.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 8085 | |
Uppercase Letter | 87 | 1.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 2098 | |
1 | 1286 | |
8 | 899 | |
9 | 892 | |
6 | 659 | 8.2% |
7 | 505 | 6.2% |
4 | 491 | 6.1% |
3 | 488 | 6.0% |
2 | 411 | 5.1% |
5 | 356 | 4.4% |
Uppercase Letter
Value | Count | Frequency (%) |
M | 52 | |
H | 28 | |
U | 7 | 8.0% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 8085 | |
Latin | 87 | 1.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 2098 | |
1 | 1286 | |
8 | 899 | |
9 | 892 | |
6 | 659 | 8.2% |
7 | 505 | 6.2% |
4 | 491 | 6.1% |
3 | 488 | 6.0% |
2 | 411 | 5.1% |
5 | 356 | 4.4% |
Latin
Value | Count | Frequency (%) |
M | 52 | |
H | 28 | |
U | 7 | 8.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 8172 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 2098 | |
1 | 1286 | |
8 | 899 | |
9 | 892 | |
6 | 659 | 8.1% |
7 | 505 | 6.2% |
4 | 491 | 6.0% |
3 | 488 | 6.0% |
2 | 411 | 5.0% |
5 | 356 | 4.4% |
Other values (3) | 87 | 1.1% |
작성일
Date
Distinct | 710 |
---|---|
Distinct (%) | 34.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 16.1 KiB |
Minimum | 2018-01-03 00:00:00 |
---|---|
Maximum | 2023-02-14 00:00:00 |
수정자
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 16.1 KiB |
9999 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 9999 |
---|---|
2nd row | 9999 |
3rd row | 9999 |
4th row | 9999 |
5th row | 9999 |
Common Values
Value | Count | Frequency (%) |
9999 | 2043 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
9999 | 2043 |
수정일
Date
Distinct | 710 |
---|---|
Distinct (%) | 34.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 16.1 KiB |
Minimum | 2018-01-03 00:00:00 |
---|---|
Maximum | 2023-02-14 00:00:00 |
과제기여율
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 29 |
---|---|
Distinct (%) | 1.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 62.100343 |
Minimum | 0 |
---|---|
Maximum | 100 |
Zeros | 3 |
Zeros (%) | 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 18.1 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 10 |
Q1 | 40 |
median | 50 |
Q3 | 100 |
95-th percentile | 100 |
Maximum | 100 |
Range | 100 |
Interquartile range (IQR) | 60 |
Descriptive statistics
Standard deviation | 31.965291 |
---|---|
Coefficient of variation (CV) | 0.51473615 |
Kurtosis | -1.2936384 |
Mean | 62.100343 |
Median Absolute Deviation (MAD) | 30 |
Skewness | -0.0774988 |
Sum | 126871 |
Variance | 1021.7798 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
100 | 708 | |
50 | 526 | |
30 | 167 | 8.2% |
20 | 106 | 5.2% |
40 | 106 | 5.2% |
70 | 82 | 4.0% |
10 | 71 | 3.5% |
60 | 58 | 2.8% |
80 | 50 | 2.4% |
1 | 45 | 2.2% |
Other values (19) | 124 | 6.1% |
Value | Count | Frequency (%) |
0 | 3 | 0.1% |
1 | 45 | |
2 | 4 | 0.2% |
3 | 1 | < 0.1% |
5 | 24 | 1.2% |
10 | 71 | |
15 | 4 | 0.2% |
20 | 106 | |
25 | 29 | 1.4% |
26 | 1 | < 0.1% |
Value | Count | Frequency (%) |
100 | 708 | |
95 | 4 | 0.2% |
90 | 19 | 0.9% |
85 | 1 | < 0.1% |
80 | 50 | 2.4% |
70 | 82 | 4.0% |
67 | 1 | < 0.1% |
60 | 58 | 2.8% |
51 | 2 | 0.1% |
50 | 526 |
오더번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.4395497 |
Minimum | 1 |
---|---|
Maximum | 6 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 18.1 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 2 |
95-th percentile | 3 |
Maximum | 6 |
Range | 5 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 0.69744335 |
---|---|
Coefficient of variation (CV) | 0.48448717 |
Kurtosis | 3.9729702 |
Mean | 1.4395497 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.8071709 |
Sum | 2941 |
Variance | 0.48642722 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 1339 | |
2 | 552 | |
3 | 118 | 5.8% |
4 | 27 | 1.3% |
5 | 6 | 0.3% |
6 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1 | 1339 | |
2 | 552 | |
3 | 118 | 5.8% |
4 | 27 | 1.3% |
5 | 6 | 0.3% |
6 | 1 | < 0.1% |
Value | Count | Frequency (%) |
6 | 1 | < 0.1% |
5 | 6 | 0.3% |
4 | 27 | 1.3% |
3 | 118 | 5.8% |
2 | 552 | |
1 | 1339 |
사업_과제신청순번 | 과제기여율 | 오더번호 | |
---|---|---|---|
사업_과제신청순번 | 1.000 | 0.000 | 0.016 |
과제기여율 | 0.000 | 1.000 | 0.583 |
오더번호 | 0.016 | 0.583 | 1.000 |
사업_과제신청순번 | 과제기여율 | 오더번호 | |
---|---|---|---|
사업_과제신청순번 | 1.000 | 0.061 | -0.083 |
과제기여율 | 0.061 | 1.000 | -0.656 |
오더번호 | -0.083 | -0.656 | 1.000 |
성과활동신청번호 | 사업_과제번호 | 사업_과제신청순번 | 성과활동관련과제구분 | 작성자 | 작성일 | 수정자 | 수정일 | 과제기여율 | 오더번호 | |
---|---|---|---|---|---|---|---|---|---|---|
0 | T20180177 | MO8580 | 3 | 논문 | 0813 | 2018-01-17 | 9999 | 2018-01-17 | 100 | 1 |
1 | T20180186 | NK213D | 1 | 논문 | 0908 | 2018-01-22 | 9999 | 2018-01-22 | 100 | 1 |
2 | T20180210 | MO5590 | 5 | 논문 | 0985 | 2018-02-19 | 9999 | 2018-02-19 | 100 | 1 |
3 | T20180726 | NK211B | 1 | 논문 | 0973 | 2018-09-14 | 9999 | 2018-09-14 | 50 | 1 |
4 | T20180726 | OD1760 | 5 | 논문 | 0973 | 2018-09-14 | 9999 | 2018-09-14 | 50 | 2 |
5 | T20180982 | OD1790 | 3 | 논문 | 0736 | 2018-11-16 | 9999 | 2018-11-16 | 100 | 1 |
6 | T20180958 | NE6300 | 1 | 논문 | 1037 | 2018-11-13 | 9999 | 2018-11-13 | 100 | 1 |
7 | T20181141 | NK212D | 1 | 논문 | 1004 | 2018-12-20 | 9999 | 2018-12-20 | 50 | 1 |
8 | T20181141 | NK215C | 1 | 논문 | 1004 | 2018-12-20 | 9999 | 2018-12-20 | 50 | 2 |
9 | T20190030 | MO9060 | 4 | 논문 | 0667 | 2019-01-08 | 9999 | 2019-01-08 | 100 | 1 |
성과활동신청번호 | 사업_과제번호 | 사업_과제신청순번 | 성과활동관련과제구분 | 작성자 | 작성일 | 수정자 | 수정일 | 과제기여율 | 오더번호 | |
---|---|---|---|---|---|---|---|---|---|---|
2033 | T20230055 | NB1730 | 1 | 논문 | 0621 | 2023-01-05 | 9999 | 2023-01-05 | 100 | 1 |
2034 | T20230057 | NE7970 | 2 | 논문 | 0727 | 2023-01-05 | 9999 | 2023-01-05 | 100 | 1 |
2035 | T20230059 | NK237A | 5 | 논문 | 0837 | 2023-01-05 | 9999 | 2023-01-05 | 100 | 1 |
2036 | T20230060 | NK237A | 5 | 논문 | 0727 | 2023-01-05 | 9999 | 2023-01-05 | 50 | 1 |
2037 | T20230060 | AI3860 | 1 | 논문 | 0727 | 2023-01-05 | 9999 | 2023-01-05 | 50 | 2 |
2038 | T20230065 | GG3230 | 2 | 논문 | 0736 | 2023-01-05 | 9999 | 2023-01-05 | 100 | 1 |
2039 | T20230114 | NE7930 | 3 | 논문 | 0947 | 2023-01-06 | 9999 | 2023-01-06 | 100 | 1 |
2040 | T20230152 | NE8080 | 5 | 논문 | 0951 | 2023-02-14 | 9999 | 2023-02-14 | 100 | 1 |
2041 | T20230126 | NK213F | 4 | 논문 | 0021 | 2023-01-16 | 9999 | 2023-01-16 | 40 | 1 |
2042 | T20230126 | MT2130 | 2 | 논문 | 0021 | 2023-01-16 | 9999 | 2023-01-16 | 60 | 2 |