Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 502 |
Missing cells | 4008 |
Missing cells (%) | 88.7% |
Duplicate rows | 1 |
Duplicate rows (%) | 0.2% |
Total size in memory | 35.9 KiB |
Average record size in memory | 73.3 B |
Variable types
Categorical | 1 |
---|---|
Text | 7 |
DateTime | 1 |
Dataset
Description | 2018년 종료 농림식품 산림생산 연구개발사업 논문의(과제번호, 사업명, 연구책임자, 논문명, 학술년도, 저자, 학술지명) |
---|---|
Author | 농림식품기술기획평가원 |
URL | https://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20191014000000001329 |
분류 has constant value "" | Constant |
과제번호 has constant value "" | Constant |
과제명 has constant value "" | Constant |
연구책임자 has constant value "" | Constant |
논문명 has constant value "" | Constant |
학술지 게재일자 has constant value "" | Constant |
저자 has constant value "" | Constant |
학술지명 has constant value "" | Constant |
Dataset has 1 (0.2%) duplicate rows | Duplicates |
번호 is highly imbalanced (97.9%) | Imbalance |
분류 has 501 (99.8%) missing values | Missing |
과제번호 has 501 (99.8%) missing values | Missing |
과제명 has 501 (99.8%) missing values | Missing |
연구책임자 has 501 (99.8%) missing values | Missing |
논문명 has 501 (99.8%) missing values | Missing |
학술지 게재일자 has 501 (99.8%) missing values | Missing |
저자 has 501 (99.8%) missing values | Missing |
학술지명 has 501 (99.8%) missing values | Missing |
Reproduction
Analysis started | 2023-12-11 03:31:07.360400 |
---|---|
Analysis finished | 2023-12-11 03:31:08.082304 |
Duration | 0.72 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
번호
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.1 KiB |
<NA> | |
---|---|
1 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9940239 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | 1 |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 501 | |
1 | 1 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 501 | |
1 | 1 | 0.2% |
분류
Text
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 100.0% |
Missing | 501 |
Missing (%) | 99.8% |
Memory size | 4.1 KiB |
Value | Count | Frequency (%) |
임산 | 1 | |
공학 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
임 | 1 | |
산 | 1 | |
1 | ||
공 | 1 | |
학 | 1 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 4 | |
Space Separator | 1 | 20.0% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
임 | 1 | |
산 | 1 | |
공 | 1 | |
학 | 1 |
Space Separator
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 4 | |
Common | 1 | 20.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
임 | 1 | |
산 | 1 | |
공 | 1 | |
학 | 1 |
Common
Value | Count | Frequency (%) |
1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 4 | |
ASCII | 1 | 20.0% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
임 | 1 | |
산 | 1 | |
공 | 1 | |
학 | 1 |
ASCII
Value | Count | Frequency (%) |
1 |
과제번호
Text
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 100.0% |
Missing | 501 |
Missing (%) | 99.8% |
Memory size | 4.1 KiB |
Value | Count | Frequency (%) |
118040-3 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 2 | |
0 | 2 | |
8 | 1 | |
4 | 1 | |
- | 1 | |
3 | 1 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 7 | |
Dash Punctuation | 1 | 12.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 2 | |
0 | 2 | |
8 | 1 | |
4 | 1 | |
3 | 1 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 8 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 2 | |
0 | 2 | |
8 | 1 | |
4 | 1 | |
- | 1 | |
3 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 8 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 2 | |
0 | 2 | |
8 | 1 | |
4 | 1 | |
- | 1 | |
3 | 1 |
과제명
Text
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 100.0% |
Missing | 501 |
Missing (%) | 99.8% |
Memory size | 4.1 KiB |
Value | Count | Frequency (%) |
농업부산물로 | 1 | |
제조된 | 1 | |
화학펄프 | 1 | |
및 | 1 | |
나노셀룰로오스를 | 1 | |
활용한 | 1 | |
친환경 | 1 | |
고강도 | 1 | |
과일봉지 | 1 | |
원지 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
10 | 20.4% | |
지 | 2 | 4.1% |
로 | 2 | 4.1% |
농 | 1 | 2.0% |
고 | 1 | 2.0% |
활 | 1 | 2.0% |
용 | 1 | 2.0% |
한 | 1 | 2.0% |
친 | 1 | 2.0% |
환 | 1 | 2.0% |
Other values (28) | 28 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 39 | |
Space Separator | 10 | 20.4% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
지 | 2 | 5.1% |
로 | 2 | 5.1% |
농 | 1 | 2.6% |
고 | 1 | 2.6% |
활 | 1 | 2.6% |
용 | 1 | 2.6% |
한 | 1 | 2.6% |
친 | 1 | 2.6% |
환 | 1 | 2.6% |
경 | 1 | 2.6% |
Other values (27) | 27 |
Space Separator
Value | Count | Frequency (%) |
10 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 39 | |
Common | 10 | 20.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
지 | 2 | 5.1% |
로 | 2 | 5.1% |
농 | 1 | 2.6% |
고 | 1 | 2.6% |
활 | 1 | 2.6% |
용 | 1 | 2.6% |
한 | 1 | 2.6% |
친 | 1 | 2.6% |
환 | 1 | 2.6% |
경 | 1 | 2.6% |
Other values (27) | 27 |
Common
Value | Count | Frequency (%) |
10 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 39 | |
ASCII | 10 | 20.4% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
10 |
Hangul
Value | Count | Frequency (%) |
지 | 2 | 5.1% |
로 | 2 | 5.1% |
농 | 1 | 2.6% |
고 | 1 | 2.6% |
활 | 1 | 2.6% |
용 | 1 | 2.6% |
한 | 1 | 2.6% |
친 | 1 | 2.6% |
환 | 1 | 2.6% |
경 | 1 | 2.6% |
Other values (27) | 27 |
연구책임자
Text
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 100.0% |
Missing | 501 |
Missing (%) | 99.8% |
Memory size | 4.1 KiB |
Value | Count | Frequency (%) |
이지영 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
이 | 1 | |
지 | 1 | |
영 | 1 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 3 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 1 | |
지 | 1 | |
영 | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 3 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 1 | |
지 | 1 | |
영 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 3 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
이 | 1 | |
지 | 1 | |
영 | 1 |
논문명
Text
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 100.0% |
Missing | 501 |
Missing (%) | 99.8% |
Memory size | 4.1 KiB |
Value | Count | Frequency (%) |
지력증강제 | 1 | |
투입에 | 1 | |
따른 | 1 | |
농업부산물 | 1 | |
유기충전제 | 1 | |
적용 | 1 | |
판지의 | 1 | |
물성 | 1 | |
평가 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
8 | ||
지 | 2 | 5.4% |
물 | 2 | 5.4% |
제 | 2 | 5.4% |
평 | 1 | 2.7% |
성 | 1 | 2.7% |
의 | 1 | 2.7% |
판 | 1 | 2.7% |
용 | 1 | 2.7% |
적 | 1 | 2.7% |
Other values (17) | 17 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 29 | |
Space Separator | 8 | 21.6% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
지 | 2 | 6.9% |
물 | 2 | 6.9% |
제 | 2 | 6.9% |
평 | 1 | 3.4% |
성 | 1 | 3.4% |
의 | 1 | 3.4% |
판 | 1 | 3.4% |
용 | 1 | 3.4% |
적 | 1 | 3.4% |
전 | 1 | 3.4% |
Other values (16) | 16 |
Space Separator
Value | Count | Frequency (%) |
8 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 29 | |
Common | 8 | 21.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
지 | 2 | 6.9% |
물 | 2 | 6.9% |
제 | 2 | 6.9% |
평 | 1 | 3.4% |
성 | 1 | 3.4% |
의 | 1 | 3.4% |
판 | 1 | 3.4% |
용 | 1 | 3.4% |
적 | 1 | 3.4% |
전 | 1 | 3.4% |
Other values (16) | 16 |
Common
Value | Count | Frequency (%) |
8 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 29 | |
ASCII | 8 | 21.6% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
8 |
Hangul
Value | Count | Frequency (%) |
지 | 2 | 6.9% |
물 | 2 | 6.9% |
제 | 2 | 6.9% |
평 | 1 | 3.4% |
성 | 1 | 3.4% |
의 | 1 | 3.4% |
판 | 1 | 3.4% |
용 | 1 | 3.4% |
적 | 1 | 3.4% |
전 | 1 | 3.4% |
Other values (16) | 16 |
학술지 게재일자
Date
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 100.0% |
Missing | 501 |
Missing (%) | 99.8% |
Memory size | 4.1 KiB |
Minimum | 2031-08-18 00:00:00 |
---|---|
Maximum | 2031-08-18 00:00:00 |
저자
Text
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 100.0% |
Missing | 501 |
Missing (%) | 99.8% |
Memory size | 4.1 KiB |
Value | Count | Frequency (%) |
주저자 | 1 | |
1 | ||
이지영 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
2 | ||
주 | 1 | |
저 | 1 | |
자 | 1 | |
: | 1 | |
이 | 1 | |
지 | 1 | |
영 | 1 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 6 | |
Space Separator | 2 | 22.2% |
Other Punctuation | 1 | 11.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
주 | 1 | |
저 | 1 | |
자 | 1 | |
이 | 1 | |
지 | 1 | |
영 | 1 |
Space Separator
Value | Count | Frequency (%) |
2 |
Other Punctuation
Value | Count | Frequency (%) |
: | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 6 | |
Common | 3 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
주 | 1 | |
저 | 1 | |
자 | 1 | |
이 | 1 | |
지 | 1 | |
영 | 1 |
Common
Value | Count | Frequency (%) |
2 | ||
: | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 6 | |
ASCII | 3 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2 | ||
: | 1 |
Hangul
Value | Count | Frequency (%) |
주 | 1 | |
저 | 1 | |
자 | 1 | |
이 | 1 | |
지 | 1 | |
영 | 1 |
학술지명
Text
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 100.0% |
Missing | 501 |
Missing (%) | 99.8% |
Memory size | 4.1 KiB |
Length
Max length | 79 |
---|---|
Median length | 79 |
Mean length | 79 |
Min length | 79 |
Characters and Unicode
Total characters | 79 |
---|---|
Distinct characters | 30 |
Distinct categories | 5 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 펄프 종이기술 = Journal of Korea Technical Association of the Pulp and Paper Industry |
---|
Value | Count | Frequency (%) |
of | 2 | |
펄프 | 1 | 7.1% |
종이기술 | 1 | 7.1% |
1 | 7.1% | |
journal | 1 | 7.1% |
korea | 1 | 7.1% |
technical | 1 | 7.1% |
association | 1 | 7.1% |
the | 1 | 7.1% |
pulp | 1 | 7.1% |
Other values (3) | 3 |
Most occurring characters
Value | Count | Frequency (%) |
13 | ||
o | 6 | 7.6% |
a | 6 | 7.6% |
n | 5 | 6.3% |
e | 4 | 5.1% |
r | 4 | 5.1% |
i | 3 | 3.8% |
l | 3 | 3.8% |
t | 3 | 3.8% |
s | 3 | 3.8% |
Other values (20) | 29 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 52 | |
Space Separator | 13 | 16.5% |
Uppercase Letter | 7 | 8.9% |
Other Letter | 6 | 7.6% |
Math Symbol | 1 | 1.3% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
o | 6 | |
a | 6 | |
n | 5 | |
e | 4 | 7.7% |
r | 4 | 7.7% |
i | 3 | 5.8% |
l | 3 | 5.8% |
t | 3 | 5.8% |
s | 3 | 5.8% |
c | 3 | 5.8% |
Other values (6) | 12 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 2 | |
A | 1 | |
I | 1 | |
T | 1 | |
K | 1 | |
J | 1 |
Other Letter
Value | Count | Frequency (%) |
펄 | 1 | |
프 | 1 | |
술 | 1 | |
기 | 1 | |
이 | 1 | |
종 | 1 |
Space Separator
Value | Count | Frequency (%) |
13 |
Math Symbol
Value | Count | Frequency (%) |
= | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 59 | |
Common | 14 | 17.7% |
Hangul | 6 | 7.6% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
o | 6 | 10.2% |
a | 6 | 10.2% |
n | 5 | 8.5% |
e | 4 | 6.8% |
r | 4 | 6.8% |
i | 3 | 5.1% |
l | 3 | 5.1% |
t | 3 | 5.1% |
s | 3 | 5.1% |
c | 3 | 5.1% |
Other values (12) | 19 |
Hangul
Value | Count | Frequency (%) |
펄 | 1 | |
프 | 1 | |
술 | 1 | |
기 | 1 | |
이 | 1 | |
종 | 1 |
Common
Value | Count | Frequency (%) |
13 | ||
= | 1 | 7.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 73 | |
Hangul | 6 | 7.6% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
13 | ||
o | 6 | 8.2% |
a | 6 | 8.2% |
n | 5 | 6.8% |
e | 4 | 5.5% |
r | 4 | 5.5% |
i | 3 | 4.1% |
l | 3 | 4.1% |
t | 3 | 4.1% |
s | 3 | 4.1% |
Other values (14) | 23 |
Hangul
Value | Count | Frequency (%) |
펄 | 1 | |
프 | 1 | |
술 | 1 | |
기 | 1 | |
이 | 1 | |
종 | 1 |
번호 | 분류 | 과제번호 | 과제명 | 연구책임자 | 논문명 | 학술지 게재일자 | 저자 | 학술지명 | |
---|---|---|---|---|---|---|---|---|---|
0 | 1 | 임산 공학 | 118040-3 | 농업부산물로 제조된 화학펄프 및 나노셀룰로오스를 활용한 친환경 고강도 과일봉지 원지 개발 | 이지영 | 지력증강제 투입에 따른 농업부산물 유기충전제 적용 판지의 물성 평가 | 18/08/31 | 주저자 : 이지영 | 펄프 종이기술 = Journal of Korea Technical Association of the Pulp and Paper Industry |
1 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
2 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
3 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
4 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
5 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
6 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
7 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
8 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
9 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
번호 | 분류 | 과제번호 | 과제명 | 연구책임자 | 논문명 | 학술지 게재일자 | 저자 | 학술지명 | |
---|---|---|---|---|---|---|---|---|---|
492 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
493 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
494 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
495 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
496 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
497 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
498 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
499 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
500 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
501 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
Most frequently occurring
번호 | 분류 | 과제번호 | 과제명 | 연구책임자 | 논문명 | 학술지 게재일자 | 저자 | 학술지명 | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|
0 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 501 |