Overview

Dataset statistics

Number of variables5
Number of observations1552
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory62.3 KiB
Average record size in memory41.1 B

Variable types

Numeric1
Categorical4

Dataset

Description농림수산식품 생산시스템R&D 논문 정보
Author농림식품기술기획평가원
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220211000000001840

Alerts

논문명 has a high cardinality: 1517 distinct values High cardinality
저자 has a high cardinality: 1109 distinct values High cardinality
게재지명 has a high cardinality: 552 distinct values High cardinality
순번 has unique values Unique

Reproduction

Analysis started2022-08-12 14:51:41.580481
Analysis finished2022-08-12 14:51:42.441844
Duration0.86 seconds
Software versionpandas-profiling v3.2.0
Download configurationconfig.json

Variables

순번
Real number (ℝ≥0)

UNIQUE

Distinct1552
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean776.5
Minimum1
Maximum1552
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size13.8 KiB
2022-08-12T23:51:42.696245image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile78.55
Q1388.75
median776.5
Q31164.25
95-th percentile1474.45
Maximum1552
Range1551
Interquartile range (IQR)775.5

Descriptive statistics

Standard deviation448.1681232
Coefficient of variation (CV)0.577164357
Kurtosis-1.2
Mean776.5
Median Absolute Deviation (MAD)388
Skewness0
Sum1205128
Variance200854.6667
MonotonicityStrictly increasing
2022-08-12T23:51:42.936257image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11
 
0.1%
481
 
0.1%
231
 
0.1%
31
 
0.1%
41
 
0.1%
51
 
0.1%
61
 
0.1%
71
 
0.1%
81
 
0.1%
91
 
0.1%
Other values (1542)1542
99.4%
ValueCountFrequency (%)
11
0.1%
21
0.1%
31
0.1%
41
0.1%
51
0.1%
61
0.1%
71
0.1%
81
0.1%
91
0.1%
101
0.1%
ValueCountFrequency (%)
15521
0.1%
15511
0.1%
15501
0.1%
15491
0.1%
15481
0.1%
15471
0.1%
15461
0.1%
15451
0.1%
15441
0.1%
15431
0.1%

발표년월
Categorical

Distinct49
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size12.2 KiB
2012-12
435 
2011-12
356 
2010-12
202 
2009-12
148 
2010-06
 
30
Other values (44)
381 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique3 ?
Unique (%)0.2%

Sample

1st row2012-12
2nd row2012-12
3rd row2012-12
4th row2012-12
5th row2012-12

Common Values

ValueCountFrequency (%)
2012-12435
28.0%
2011-12356
22.9%
2010-12202
13.0%
2009-12148
 
9.5%
2010-0630
 
1.9%
2011-0925
 
1.6%
2010-0322
 
1.4%
2012-0620
 
1.3%
2010-1017
 
1.1%
2011-0617
 
1.1%
Other values (39)280
18.0%

Length

2022-08-12T23:51:43.177063image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2012-12435
28.0%
2011-12356
22.9%
2010-12202
13.0%
2009-12148
 
9.5%
2010-0630
 
1.9%
2011-0925
 
1.6%
2010-0322
 
1.4%
2012-0620
 
1.3%
2010-1017
 
1.1%
2011-0617
 
1.1%
Other values (39)280
18.0%

논문명
Categorical

HIGH CARDINALITY

Distinct1517
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size12.2 KiB
억새(Miscanthus sinensis) 종자로부터 캘러스 유도를 통한 식물체 재생에 영향을 미치는 요인
 
3
Development of a rapid detection method to detect tdh gene in Vibrio parahaemolyticus using 2-step ultrarapid real-time polymerase chain reaction
 
2
Epistatic Relationships among Genes Related to Endosperm Starch Synthesis in Rice.
 
2
A Compound Isolated from Rumex japonicus Induces Early Growth Response Gene-I Expression
 
2
Overexpression of FTL1/DDF1, an AP2 transcription factor, enhances tolerance to cold, drought, and heat stresses in Arabidopsis thaliana
 
2
Other values (1512)
1541 

Length

Max length234
Median length152
Mean length78.17719072
Min length6

Unique

Unique1483 ?
Unique (%)95.6%

Sample

1st row한발시 가축분뇨 액비와 경운이 사료작물 수량의 수량과 뚝새풀 발생에 끼치는 영향
2nd rowIsolation and inheritance of microsatellite loci for the oily bittering (Acheilognathus koreensis): applications for analysis of genetic diversity of wild populations
3rd rowBurkholderia denitrificans sp nov., Isolated from the Soil of Dokdo Island, Korea
4th rowVortex-in-cell method combined with a boundary element method for incompressible viscous flow analysis
5th row벼 초다수 내랭성 신품종 `다산1호`

Common Values

ValueCountFrequency (%)
억새(Miscanthus sinensis) 종자로부터 캘러스 유도를 통한 식물체 재생에 영향을 미치는 요인3
 
0.2%
Development of a rapid detection method to detect tdh gene in Vibrio parahaemolyticus using 2-step ultrarapid real-time polymerase chain reaction2
 
0.1%
Epistatic Relationships among Genes Related to Endosperm Starch Synthesis in Rice.2
 
0.1%
A Compound Isolated from Rumex japonicus Induces Early Growth Response Gene-I Expression2
 
0.1%
Overexpression of FTL1/DDF1, an AP2 transcription factor, enhances tolerance to cold, drought, and heat stresses in Arabidopsis thaliana2
 
0.1%
착색단고추 재배 온실의 피복재 종류에 따른 내부 온습도 변화2
 
0.1%
Proteomic identification of an embryo-specific 1Cys-Prx promoter and analysis of its activity in transgenic rice2
 
0.1%
Monocistronic approach using synthetic biology for micro-bial system re-engineering2
 
0.1%
Detection and Quantification of Major Royal Jelly Protein 1 in Honeybees by ELISA using a Monoclonal Antibody2
 
0.1%
Expression of the Human Tissue-Plasminogen Activator in Hairy Roots of Oriental Melon (Cucumis melo)2
 
0.1%
Other values (1507)1531
98.6%

Length

2022-08-12T23:51:43.436092image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
of1003
 
5.5%
and526
 
2.9%
in469
 
2.6%
the344
 
1.9%
a192
 
1.0%
182
 
1.0%
from165
 
0.9%
for132
 
0.7%
on96
 
0.5%
to92
 
0.5%
Other values (6649)15129
82.5%

저자
Categorical

HIGH CARDINALITY

Distinct1109
Distinct (%)71.5%
Missing0
Missing (%)0.0%
Memory size12.2 KiB
윤용철
 
8
김현
 
7
유미선
 
7
김우진
 
7
김호철
 
7
Other values (1104)
1516 

Length

Max length58
Median length3
Mean length6.491623711
Min length2

Unique

Unique839 ?
Unique (%)54.1%

Sample

1st row조광민
2nd row김우진
3rd rowLee, Chang-Muk
4th rowYoo-ChulKim
5th row이점호

Common Values

ValueCountFrequency (%)
윤용철8
 
0.5%
김현7
 
0.5%
유미선7
 
0.5%
김우진7
 
0.5%
김호철7
 
0.5%
이수영6
 
0.4%
An, Hye Suck6
 
0.4%
김용균6
 
0.4%
석순자6
 
0.4%
최정섭6
 
0.4%
Other values (1099)1486
95.7%

Length

2022-08-12T23:51:43.679129image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
kim58
 
2.7%
lee39
 
1.8%
park23
 
1.1%
hye14
 
0.7%
young14
 
0.7%
jung12
 
0.6%
kang11
 
0.5%
hwang10
 
0.5%
hyun10
 
0.5%
hee9
 
0.4%
Other values (1213)1923
90.6%

게재지명
Categorical

HIGH CARDINALITY

Distinct552
Distinct (%)35.6%
Missing0
Missing (%)0.0%
Memory size12.2 KiB
한국국제농업개발학회지
 
43
한국육종학회지
 
42
생물환경조절학회지
 
28
바이오시스템공학
 
27
한국토양비료학회지
 
27
Other values (547)
1385 

Length

Max length127
Median length65
Mean length20.77061856
Min length1

Unique

Unique327 ?
Unique (%)21.1%

Sample

1st row한국국제농업개발학회지
2nd rowANIMAL CELLS AND SYSTEMS
3rd rowJOURNAL OF MICROBIOLOGY
4th rowInternational Jounal for Numerical Methods in Fluids
5th row한국육종학회지

Common Values

ValueCountFrequency (%)
한국국제농업개발학회지43
 
2.8%
한국육종학회지42
 
2.7%
생물환경조절학회지28
 
1.8%
바이오시스템공학27
 
1.7%
한국토양비료학회지27
 
1.7%
식물병연구27
 
1.7%
한국수산과학회지26
 
1.7%
한국동물자원과학회지22
 
1.4%
KOREAN JOURNAL OF HORTICULTURAL SCIENCE & TECHNOLOGY21
 
1.4%
한국양봉학회지20
 
1.3%
Other values (542)1269
81.8%

Length

2022-08-12T23:51:44.027040image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
of375
 
9.0%
journal367
 
8.8%
and195
 
4.7%
science114
 
2.7%
plant108
 
2.6%
korean102
 
2.5%
101
 
2.4%
biotechnology81
 
1.9%
the60
 
1.4%
chemistry55
 
1.3%
Other values (522)2600
62.5%

Interactions

2022-08-12T23:51:41.803963image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Correlations

2022-08-12T23:51:44.198252image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2022-08-12T23:51:44.323705image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2022-08-12T23:51:44.446304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2022-08-12T23:51:44.578481image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

2022-08-12T23:51:42.191309image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
A simple visualization of nullity by column.
2022-08-12T23:51:42.370176image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

순번발표년월논문명저자게재지명
012012-12한발시 가축분뇨 액비와 경운이 사료작물 수량의 수량과 뚝새풀 발생에 끼치는 영향조광민한국국제농업개발학회지
122012-12Isolation and inheritance of microsatellite loci for the oily bittering (Acheilognathus koreensis): applications for analysis of genetic diversity of wild populations김우진ANIMAL CELLS AND SYSTEMS
232012-12Burkholderia denitrificans sp nov., Isolated from the Soil of Dokdo Island, KoreaLee, Chang-MukJOURNAL OF MICROBIOLOGY
342012-12Vortex-in-cell method combined with a boundary element method for incompressible viscous flow analysisYoo-ChulKimInternational Jounal for Numerical Methods in Fluids
452012-12벼 초다수 내랭성 신품종 `다산1호`이점호한국육종학회지
562012-12소규모 작업장 작업자들의 인간공학적 평가 및 정량적 부하 평가구혜란한국농촌지도학회지
672012-12천연물 유래 d-Limonene의 가시박 방제효과최정섭한국잡초학회지
782012-12억새(Miscanthus sinensis) 종자로부터 캘러스 유도를 통한 식물체 재생에 영향을 미치는 요인권영주한국육종학회지
892012-12큰느타리버섯 갓우수 품종 육종임착한한국균학회지
9102012-12Prediction of Cobb-angle for monitoring system in idiopathic scoliosis using multiple regression analysis문정환바이오시스템공학

Last rows

순번발표년월논문명저자게재지명
154215432013-04Reduced activity of ATP synthase in mitochondria causes cytoplasmic male sterility in chili pepperJinjieLiPlanta
154315442013-04태양에너지 이용에 관한 실험적 검토윤용철경상대학교 농업생명과학연구
154415452013-03The rice RING finger E3 ligase, OsHCI1, drives nuclear export of multiple substrate proteins and its heterogeneous overexpression enhances acquired thermotoleranceSungDonLimJournal of Experimental Botany
154515462013-03Complete mitochondrial genome sequence and identification of a candidate gene responsible for cytoplasmic male sterility in radish (Raphanus sativus L.) containing DCGMS cytoplasmJeeYoungParkTheoretical and Applied Genetics
154615472013-03동절기 온실의 열 손실에 관한 실태조사윤용철한국생물환경조절학회지
154715482013-01Characterization of Cellulolytic and Xylanolytic Enzymes of Bacillus licheniformis JK7 Isolated from the Rumen of a Native Korean GoatJ.K.SeoAsian-Australasian Journal of Animal Sciences
154815492013-01고들빼기 부위별 메탄올 추출물의 폴리페놀 함량 및 항산화성 연구천상욱한작지
154915502013-01The use of a frequency domain reflectometry sensor to establish a non -drainage hydroponic system with a coconut coir substrate for tomato cultivation최은영Journal of Plant Nutrition
155015512012-06억새(Miscanthus sinensis) 종자로부터 캘러스 유도를 통한 식물체 재생에 영향을 미치는 요인권영주한국육종학회지
155115522012-06억새(Miscanthus sinensis) 종자로부터 캘러스 유도를 통한 식물체 재생에 영향을 미치는 요인권영주한국육종학회지