Overview

Dataset statistics

Number of variables5
Number of observations1640
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory65.8 KiB
Average record size in memory41.1 B

Variable types

Numeric1
Categorical4

Dataset

Description농림수산식품 바이오R&D 논문 정보
Author농림식품기술기획평가원
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220211000000001830

Alerts

발표년월 has a high cardinality: 51 distinct values High cardinality
논문명 has a high cardinality: 1593 distinct values High cardinality
저자 has a high cardinality: 1228 distinct values High cardinality
게재지명 has a high cardinality: 792 distinct values High cardinality
순번 has unique values Unique

Reproduction

Analysis started2022-08-12 14:53:58.809621
Analysis finished2022-08-12 14:53:59.524696
Duration0.72 seconds
Software versionpandas-profiling v3.2.0
Download configurationconfig.json

Variables

순번
Real number (ℝ≥0)

UNIQUE

Distinct1640
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean820.5
Minimum1
Maximum1640
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.5 KiB
2022-08-12T23:53:59.614194image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile82.95
Q1410.75
median820.5
Q31230.25
95-th percentile1558.05
Maximum1640
Range1639
Interquartile range (IQR)819.5

Descriptive statistics

Standard deviation473.5715363
Coefficient of variation (CV)0.5771743282
Kurtosis-1.2
Mean820.5
Median Absolute Deviation (MAD)410
Skewness0
Sum1345620
Variance224270
MonotonicityStrictly increasing
2022-08-12T23:53:59.796105image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11
 
0.1%
131
 
0.1%
41
 
0.1%
51
 
0.1%
61
 
0.1%
71
 
0.1%
81
 
0.1%
91
 
0.1%
101
 
0.1%
111
 
0.1%
Other values (1630)1630
99.4%
ValueCountFrequency (%)
11
0.1%
21
0.1%
31
0.1%
41
0.1%
51
0.1%
61
0.1%
71
0.1%
81
0.1%
91
0.1%
101
0.1%
ValueCountFrequency (%)
16401
0.1%
16391
0.1%
16381
0.1%
16371
0.1%
16361
0.1%
16351
0.1%
16341
0.1%
16331
0.1%
16321
0.1%
16311
0.1%

발표년월
Categorical

HIGH CARDINALITY

Distinct51
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size12.9 KiB
2012-12
591 
2011-12
341 
2010-12
60 
2009-12
 
56
2012-09
 
28
Other values (46)
564 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row2012-12
2nd row2012-12
3rd row2012-12
4th row2012-12
5th row2012-12

Common Values

ValueCountFrequency (%)
2012-12591
36.0%
2011-12341
20.8%
2010-1260
 
3.7%
2009-1256
 
3.4%
2012-0928
 
1.7%
2012-1025
 
1.5%
2011-0925
 
1.5%
2012-0821
 
1.3%
2011-0621
 
1.3%
2012-0321
 
1.3%
Other values (41)451
27.5%

Length

2022-08-12T23:53:59.925465image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2012-12591
36.0%
2011-12341
20.8%
2010-1260
 
3.7%
2009-1256
 
3.4%
2012-0928
 
1.7%
2012-1025
 
1.5%
2011-0925
 
1.5%
2012-0821
 
1.3%
2011-0621
 
1.3%
2012-0321
 
1.3%
Other values (41)451
27.5%

논문명
Categorical

HIGH CARDINALITY

Distinct1593
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size12.9 KiB
Antioxidant effects of ovotransferrin and its hydrolysates
 
4
Antioxidant, antimicrobial, andcytotoxic activities of ovotransferrin from egg white
 
3
Caffeinated coffee, decaffeinated coffee, and the phenolic phytochemical chlorogenic acid up-regulate NQO1 expression and prevent H2O2-induced apoptosis in primary cortical neurons
 
3
LC?MS-based chemotaxonomic classification of wild-type Lespedeza sp. and its correlation with genotype
 
3
Crude extract of Ceriporia lacerata has a protective effect on dexamethasone-induced cytotoxicity in INS-1 cells via the modulation of PI3K/PKB activity
 
3
Other values (1588)
1624 

Length

Max length288
Median length153
Mean length97.72378049
Min length11

Unique

Unique1553 ?
Unique (%)94.7%

Sample

1st rowAdenovirus Expressing Human Interferon Inhibits Replication of Foot and Mouth Disease Virus and Reduces Fatal Rate in Mice
2nd rowZearalenone Exposure Affects the Immune Related Parameters in Lymphoid rgans and Serum of Rats Vaccinated with Porcine Parvovirus Vaccine
3rd rowSex Identification of newly hatched chicks by fluorescence in situ hybridization using a W-specific DNA probe in feather follicle cells
4th rowCytotoxic Effect of Zostera asiatica on Growth of Human Cancer Cells
5th rowAromatic Hydroxyl Group Plays a Critical Role in Antibacterial Activity of the Curcumin Analogues

Common Values

ValueCountFrequency (%)
Antioxidant effects of ovotransferrin and its hydrolysates4
 
0.2%
Antioxidant, antimicrobial, andcytotoxic activities of ovotransferrin from egg white3
 
0.2%
Caffeinated coffee, decaffeinated coffee, and the phenolic phytochemical chlorogenic acid up-regulate NQO1 expression and prevent H2O2-induced apoptosis in primary cortical neurons3
 
0.2%
LC?MS-based chemotaxonomic classification of wild-type Lespedeza sp. and its correlation with genotype3
 
0.2%
Crude extract of Ceriporia lacerata has a protective effect on dexamethasone-induced cytotoxicity in INS-1 cells via the modulation of PI3K/PKB activity3
 
0.2%
Hyperglycemic effect of submerged culture extract of Ceriporia lacerata in streptozotocin-induced diabetic rats3
 
0.2%
Quantitative Analysis of Tetracycline-Inducible Expression of the Green Fluorescent Protein Gene in Transgenic Chickens2
 
0.1%
Effect of Green Tea Extract/Poly-γ-Glutamic Acid Complex in Obese Type 2 Diabetic Mice.2
 
0.1%
Metabolomics-Based Optimal Koji Fermentation for Tyrosinase Inhibition Supplemented with Astragalus Radix2
 
0.1%
Protective effects of recombinant Brucella abortus Omp28 against infection with a virulent strain of Brucella abortus 544 in mice2
 
0.1%
Other values (1583)1613
98.4%

Length

2022-08-12T23:54:00.177087image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
of1401
 
6.5%
and767
 
3.5%
in734
 
3.4%
the443
 
2.0%
a321
 
1.5%
from275
 
1.3%
by197
 
0.9%
on176
 
0.8%
for160
 
0.7%
with141
 
0.6%
Other values (6511)17103
78.8%

저자
Categorical

HIGH CARDINALITY

Distinct1228
Distinct (%)74.9%
Missing0
Missing (%)0.0%
Memory size12.9 KiB
한상미
 
13
윤형주
 
10
송재경
 
10
권해용
 
7
양인
 
7
Other values (1223)
1593 

Length

Max length38
Median length3
Mean length7.441463415
Min length1

Unique

Unique981 ?
Unique (%)59.8%

Sample

1st row초가기
2nd row최병국
3rd row손시환
4th rowY. Seo
5th rowKim, Mi Kyoung

Common Values

ValueCountFrequency (%)
한상미13
 
0.8%
윤형주10
 
0.6%
송재경10
 
0.6%
권해용7
 
0.4%
양인7
 
0.4%
수구나7
 
0.4%
박승원7
 
0.4%
정종화6
 
0.4%
PilJoonSeo6
 
0.4%
김창국6
 
0.4%
Other values (1218)1561
95.2%

Length

2022-08-12T23:54:00.383868image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
kim83
 
3.6%
park42
 
1.8%
lee39
 
1.7%
young19
 
0.8%
jung19
 
0.8%
jin15
 
0.6%
s14
 
0.6%
j13
 
0.6%
한상미13
 
0.6%
seo12
 
0.5%
Other values (1334)2065
88.5%

게재지명
Categorical

HIGH CARDINALITY

Distinct792
Distinct (%)48.3%
Missing0
Missing (%)0.0%
Memory size12.9 KiB
한국동물번식학회지(Reproductive & Developmental Biology)
 
35
PLOS ONE
 
29
International Journal of Industrial Entomology
 
29
한국양봉학회지
 
17
International Journal of Systematic and Evolutionary Microbiology
 
16
Other values (787)
1514 

Length

Max length133
Median length57
Mean length26.81402439
Min length4

Unique

Unique518 ?
Unique (%)31.6%

Sample

1st rowJournal of Bacteriology and Virology
2nd rowToxicological research
3rd rowThe journal of poultry science
4th rowKSBB
5th rowNATURAL PRODUCT COMMUNICATIONS

Common Values

ValueCountFrequency (%)
한국동물번식학회지(Reproductive & Developmental Biology)35
 
2.1%
PLOS ONE29
 
1.8%
International Journal of Industrial Entomology29
 
1.8%
한국양봉학회지17
 
1.0%
International Journal of Systematic and Evolutionary Microbiology16
 
1.0%
ASIAN-AUSTRALASIAN JOURNAL OF ANIMAL SCIENCES15
 
0.9%
MOLECULES AND CELLS15
 
0.9%
한국잠사곤충학회지15
 
0.9%
JOURNAL OF MICROBIOLOGY AND BIOTECHNOLOGY14
 
0.9%
생명과학회지14
 
0.9%
Other values (782)1441
87.9%

Length

2022-08-12T23:54:00.687975image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
of546
 
9.9%
journal492
 
8.9%
and332
 
6.0%
136
 
2.5%
plant118
 
2.1%
science113
 
2.0%
international106
 
1.9%
research106
 
1.9%
biotechnology104
 
1.9%
microbiology96
 
1.7%
Other values (661)3389
61.2%

Interactions

2022-08-12T23:53:59.025096image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Correlations

2022-08-12T23:54:00.870002image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2022-08-12T23:54:01.015157image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2022-08-12T23:54:01.133782image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2022-08-12T23:54:01.240313image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

2022-08-12T23:53:59.337224image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
A simple visualization of nullity by column.
2022-08-12T23:53:59.473796image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

순번발표년월논문명저자게재지명
012012-12Adenovirus Expressing Human Interferon Inhibits Replication of Foot and Mouth Disease Virus and Reduces Fatal Rate in Mice초가기Journal of Bacteriology and Virology
122012-12Zearalenone Exposure Affects the Immune Related Parameters in Lymphoid rgans and Serum of Rats Vaccinated with Porcine Parvovirus Vaccine최병국Toxicological research
232012-12Sex Identification of newly hatched chicks by fluorescence in situ hybridization using a W-specific DNA probe in feather follicle cells손시환The journal of poultry science
342012-12Cytotoxic Effect of Zostera asiatica on Growth of Human Cancer CellsY. SeoKSBB
452012-12Aromatic Hydroxyl Group Plays a Critical Role in Antibacterial Activity of the Curcumin AnaloguesKim, Mi KyoungNATURAL PRODUCT COMMUNICATIONS
562012-12Porcine LMNA Is a Positional Candidate Gene Associated with Growth and Fat Deposition최봉환ASIAN-AUSTRALASIAN JOURNAL OF ANIMAL SCIENCES
672012-12정제봉독을 함유한 봉독화장품의 여드름 개선 효과한상미한국미용학회지
782012-12QTL analyses of heterosis for grain yield and yield-related traits in indica-japonica crosses of rice (Oryza sativa L.)추상호GENES and GENOMICS
892012-12Post-transcriptional regulation of Gcn5, a putative regulator of Hox in mouse embryonic fibroblast cells이유라Journal of experimental and biomedical science(한국의생명과학회지)
9102012-12Complete Genome Analysis of Porcine Enterovirus B Isolated in Korea문형준Journal of Virology

Last rows

순번발표년월논문명저자게재지명
163016312012-07Mucilaginibacter jinjuensis sp. nov., with xylan degrading activity하지칸International Journal of Systematic & Evolutionary Microbiology
163116322012-07LC?MS-based chemotaxonomic classification of wild-type Lespedeza sp. and its correlation with genotypeYoung Mi KimPlant Cell Reports
163216332012-05Metabolomics-Based Optimal Koji Fermentation for Tyrosinase Inhibition Supplemented with Astragalus RadixAh Jin KIMBioscience, Biotechnology, and Biochemistry
163316342012-04Caffeinated coffee, decaffeinated coffee, and the phenolic phytochemical chlorogenic acid up-regulate NQO1 expression and prevent H2O2-induced apoptosis in primary cortical neurons김지영Neurochemistry International
163416352012-04Caffeinated coffee, decaffeinated coffee, and the phenolic phytochemical chlorogenic acid up-regulate NQO1 expression and prevent H2O2-induced apoptosis in primary cortical neurons김지영Neurochemistry International
163516362012-03Effects of 12-week oral supplementation of Ecklonia cava polyphenils on anthropometric and blood lipid parameters in overweight korean individualsHyeon¡©Cheol ShinPHYTOTHERAPY RESEARCH
163616372012-03Extract from Dioscorea batatas Ameliorates Insulin Resistance in Mice Fed a High-Fat DietSoyoung KimJOURNAL OF MEDICINAL FOOD
163716382012-02Chitinophaga oryziterrae sp. nov., isolated from the rhizosphere soil of rice (Oryza sativa L.)정유진International Journal of Systematic & Evolutionary Microbiology
163816392012-01Diversity and Characterization of Endophytic Bacteria Associated with Tidal Flat PlantsAntagonistic to Oomycete Plant Pathogens페미다The Plant Pathology Journal
163916402012-01Anti-hyperlipidemic Effect of Polyphenol Extract (SeapolynolTM) and DieckolYung Choon YooPrev Nutr Food Sci