Overview

Dataset statistics

Number of variables3
Number of observations578
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory14.2 KiB
Average record size in memory25.2 B

Variable types

Numeric1
Categorical1
Text1

Dataset

Description국립생태원 연구과제관리정보를 나타낸 자료로써 동식물, 생태, 자연 등에 관련한 연구개발성과_논문 데이터 입니다.
Author국립생태원
URLhttps://www.data.go.kr/data/15088002/fileData.do

Alerts

분야 has constant value ""Constant
일련번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:47:17.387775
Analysis finished2023-12-12 09:47:17.937801
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일련번호
Real number (ℝ)

UNIQUE 

Distinct578
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean289.5
Minimum1
Maximum578
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.2 KiB
2023-12-12T18:47:18.032478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile29.85
Q1145.25
median289.5
Q3433.75
95-th percentile549.15
Maximum578
Range577
Interquartile range (IQR)288.5

Descriptive statistics

Standard deviation166.9985
Coefficient of variation (CV)0.57685148
Kurtosis-1.2
Mean289.5
Median Absolute Deviation (MAD)144.5
Skewness0
Sum167331
Variance27888.5
MonotonicityStrictly increasing
2023-12-12T18:47:18.206760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
435 1
 
0.2%
383 1
 
0.2%
384 1
 
0.2%
385 1
 
0.2%
386 1
 
0.2%
387 1
 
0.2%
388 1
 
0.2%
389 1
 
0.2%
390 1
 
0.2%
Other values (568) 568
98.3%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
578 1
0.2%
577 1
0.2%
576 1
0.2%
575 1
0.2%
574 1
0.2%
573 1
0.2%
572 1
0.2%
571 1
0.2%
570 1
0.2%
569 1
0.2%

분야
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
국내외 학술지 게재(논문)
578 

Length

Max length14
Median length14
Mean length14
Min length14

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내외 학술지 게재(논문)
2nd row국내외 학술지 게재(논문)
3rd row국내외 학술지 게재(논문)
4th row국내외 학술지 게재(논문)
5th row국내외 학술지 게재(논문)

Common Values

ValueCountFrequency (%)
국내외 학술지 게재(논문) 578
100.0%

Length

2023-12-12T18:47:18.388085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:47:18.523836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외 578
33.3%
학술지 578
33.3%
게재(논문 578
33.3%
Distinct519
Distinct (%)89.8%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
2023-12-12T18:47:18.855452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length222
Median length135
Mean length80.980969
Min length7

Characters and Unicode

Total characters46807
Distinct characters480
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique474 ?
Unique (%)82.0%

Sample

1st rowMapping Interests by Stakeholders’ Subjectivities toward Ecotourism Resources: The Case of Seocheon-Gun, Korea
2nd row멸종위기 개리 (Anser cygnoides)의 월동기 서식지 이용과 먹이원
3rd rowSequencing and analyzing complete mitochondrial genome of Anser cygnoides (Anserini: Anserinae)
4th rowDifferential predation drives the geographical divergence in multiple traits in aposematic frogs
5th rowApplying the Concept of Perceived Restoration to the Case of Cheonggyecheon Stream Park in Seoul, Korea
ValueCountFrequency (%)
of 373
 
5.3%
in 233
 
3.3%
and 190
 
2.7%
the 171
 
2.4%
korea 113
 
1.6%
a 73
 
1.0%
on 63
 
0.9%
연구 51
 
0.7%
south 49
 
0.7%
to 46
 
0.7%
Other values (2678) 5619
80.5%
2023-12-12T18:47:19.408841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6459
 
13.8%
e 3333
 
7.1%
i 3124
 
6.7%
a 2892
 
6.2%
n 2780
 
5.9%
o 2774
 
5.9%
t 2465
 
5.3%
r 2010
 
4.3%
s 2005
 
4.3%
l 1362
 
2.9%
Other values (470) 17603
37.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 31322
66.9%
Space Separator 6459
 
13.8%
Other Letter 5582
 
11.9%
Uppercase Letter 2503
 
5.3%
Other Punctuation 259
 
0.6%
Dash Punctuation 188
 
0.4%
Decimal Number 172
 
0.4%
Open Punctuation 150
 
0.3%
Close Punctuation 149
 
0.3%
Final Punctuation 15
 
< 0.1%
Other values (3) 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
178
 
3.2%
149
 
2.7%
133
 
2.4%
102
 
1.8%
100
 
1.8%
94
 
1.7%
92
 
1.6%
85
 
1.5%
83
 
1.5%
81
 
1.5%
Other values (388) 4485
80.3%
Lowercase Letter
ValueCountFrequency (%)
e 3333
10.6%
i 3124
10.0%
a 2892
 
9.2%
n 2780
 
8.9%
o 2774
 
8.9%
t 2465
 
7.9%
r 2010
 
6.4%
s 2005
 
6.4%
l 1362
 
4.3%
c 1156
 
3.7%
Other values (19) 7421
23.7%
Uppercase Letter
ValueCountFrequency (%)
A 266
 
10.6%
C 243
 
9.7%
S 223
 
8.9%
P 179
 
7.2%
K 149
 
6.0%
M 142
 
5.7%
R 134
 
5.4%
D 131
 
5.2%
E 128
 
5.1%
T 125
 
5.0%
Other values (16) 783
31.3%
Decimal Number
ValueCountFrequency (%)
1 50
29.1%
2 25
14.5%
3 17
 
9.9%
5 15
 
8.7%
7 15
 
8.7%
0 14
 
8.1%
8 14
 
8.1%
6 9
 
5.2%
4 7
 
4.1%
9 6
 
3.5%
Other Punctuation
ValueCountFrequency (%)
: 108
41.7%
, 108
41.7%
. 30
 
11.6%
· 4
 
1.5%
& 4
 
1.5%
/ 3
 
1.2%
' 1
 
0.4%
; 1
 
0.4%
Math Symbol
ValueCountFrequency (%)
~ 4
80.0%
+ 1
 
20.0%
Space Separator
ValueCountFrequency (%)
6459
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 188
100.0%
Open Punctuation
ValueCountFrequency (%)
( 150
100.0%
Close Punctuation
ValueCountFrequency (%)
) 149
100.0%
Final Punctuation
ValueCountFrequency (%)
15
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 33823
72.3%
Common 7398
 
15.8%
Hangul 5582
 
11.9%
Greek 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
178
 
3.2%
149
 
2.7%
133
 
2.4%
102
 
1.8%
100
 
1.8%
94
 
1.7%
92
 
1.6%
85
 
1.5%
83
 
1.5%
81
 
1.5%
Other values (388) 4485
80.3%
Latin
ValueCountFrequency (%)
e 3333
 
9.9%
i 3124
 
9.2%
a 2892
 
8.6%
n 2780
 
8.2%
o 2774
 
8.2%
t 2465
 
7.3%
r 2010
 
5.9%
s 2005
 
5.9%
l 1362
 
4.0%
c 1156
 
3.4%
Other values (43) 9922
29.3%
Common
ValueCountFrequency (%)
6459
87.3%
- 188
 
2.5%
( 150
 
2.0%
) 149
 
2.0%
: 108
 
1.5%
, 108
 
1.5%
1 50
 
0.7%
. 30
 
0.4%
2 25
 
0.3%
3 17
 
0.2%
Other values (16) 114
 
1.5%
Greek
ValueCountFrequency (%)
α 2
50.0%
β 1
25.0%
κ 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 41199
88.0%
Hangul 5582
 
11.9%
Punctuation 16
 
< 0.1%
None 8
 
< 0.1%
Number Forms 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6459
15.7%
e 3333
 
8.1%
i 3124
 
7.6%
a 2892
 
7.0%
n 2780
 
6.7%
o 2774
 
6.7%
t 2465
 
6.0%
r 2010
 
4.9%
s 2005
 
4.9%
l 1362
 
3.3%
Other values (65) 11995
29.1%
Hangul
ValueCountFrequency (%)
178
 
3.2%
149
 
2.7%
133
 
2.4%
102
 
1.8%
100
 
1.8%
94
 
1.7%
92
 
1.6%
85
 
1.5%
83
 
1.5%
81
 
1.5%
Other values (388) 4485
80.3%
Punctuation
ValueCountFrequency (%)
15
93.8%
1
 
6.2%
None
ValueCountFrequency (%)
· 4
50.0%
α 2
25.0%
β 1
 
12.5%
κ 1
 
12.5%
Number Forms
ValueCountFrequency (%)
2
100.0%

Interactions

2023-12-12T18:47:17.656059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T18:47:17.827840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:47:17.905208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일련번호분야논문명
01국내외 학술지 게재(논문)Mapping Interests by Stakeholders’ Subjectivities toward Ecotourism Resources: The Case of Seocheon-Gun, Korea
12국내외 학술지 게재(논문)멸종위기 개리 (Anser cygnoides)의 월동기 서식지 이용과 먹이원
23국내외 학술지 게재(논문)Sequencing and analyzing complete mitochondrial genome of Anser cygnoides (Anserini: Anserinae)
34국내외 학술지 게재(논문)Differential predation drives the geographical divergence in multiple traits in aposematic frogs
45국내외 학술지 게재(논문)Applying the Concept of Perceived Restoration to the Case of Cheonggyecheon Stream Park in Seoul, Korea
56국내외 학술지 게재(논문)Meta-corridor solutions for climate-vulnerable plant species groups in South Korea
67국내외 학술지 게재(논문)Performing Ecosystem Services at Mud Flats in Seocheon, Korea: Using Q Methodology for Cooperative Decision Making
78국내외 학술지 게재(논문)Stakeholders' views on reducing financial support in government-led ecotourism areas
89국내외 학술지 게재(논문)Government-led Ecotourism and Resident-led Ecotourism
910국내외 학술지 게재(논문)건강환경 조성을 위한 주의회복이론 관점의 치유환경 고찰
일련번호분야논문명
568569국내외 학술지 게재(논문)영광 불갑천 하구역의 유역환경 특성과 지속가능관리 방안
569570국내외 학술지 게재(논문)Distribution of Fish Species in Wetland Protected Areas in South Korea
570571국내외 학술지 게재(논문)다중채널 자동챔버시스템에 의한 삼림토양의 이산화탄소 유출량의 연속측정
571572국내외 학술지 게재(논문)Spatial Movement Patterns and Local Co-Occurrence of Nutria Individuals in Association with Habitats Using Geo-Self-Organizing Map
572573국내외 학술지 게재(논문)Diagnostic Evaluation and Preparation of the Reference Information for River Restoration in South Korea
573574국내외 학술지 게재(논문)Phenological Changes of Mongolian Oak Depending on the Micro-Climate Changes Due to Urbanization
574575국내외 학술지 게재(논문)지역의 미래 환경 비전 공간화를 위한 토지이용계획 시나리오 체계 연구
575576국내외 학술지 게재(논문)Evaluation of morphological, physiological, and biochemical traits for assessing drought resistance in eleven tree species
576577국내외 학술지 게재(논문)Analysis of Arthropod Communities in Sunflower-cultivated Fields to Develop Risk Assessment Guidelines for LMO Used for Environmental Remediation
577578국내외 학술지 게재(논문)Characteristics of Vascular Plants in Yongyangbo Wetlands