Overview

Dataset statistics

Number of variables4
Number of observations1413
Missing cells5
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory47.0 KiB
Average record size in memory34.1 B

Variable types

Numeric2
Text2

Dataset

Description국립생태원 연구과제관리정보를 나타낸 자료로써 동식물, 생태, 자연 등에 관련한 연구개발과제_기본 데이터 입니다.
Author국립생태원
URLhttps://www.data.go.kr/data/15087999/fileData.do

Alerts

일련번호 is highly overall correlated with 수행연도High correlation
수행연도 is highly overall correlated with 일련번호High correlation
일련번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:48:38.912703
Analysis finished2023-12-12 14:48:40.161637
Duration1.25 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일련번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1413
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean707
Minimum1
Maximum1413
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.5 KiB
2023-12-12T23:48:40.262320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile71.6
Q1354
median707
Q31060
95-th percentile1342.4
Maximum1413
Range1412
Interquartile range (IQR)706

Descriptive statistics

Standard deviation408.04228
Coefficient of variation (CV)0.57714608
Kurtosis-1.2
Mean707
Median Absolute Deviation (MAD)353
Skewness0
Sum998991
Variance166498.5
MonotonicityStrictly increasing
2023-12-12T23:48:40.408649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
885 1
 
0.1%
949 1
 
0.1%
948 1
 
0.1%
947 1
 
0.1%
946 1
 
0.1%
945 1
 
0.1%
944 1
 
0.1%
943 1
 
0.1%
942 1
 
0.1%
Other values (1403) 1403
99.3%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1413 1
0.1%
1412 1
0.1%
1411 1
0.1%
1410 1
0.1%
1409 1
0.1%
1408 1
0.1%
1407 1
0.1%
1406 1
0.1%
1405 1
0.1%
1404 1
0.1%

수행연도
Real number (ℝ)

HIGH CORRELATION 

Distinct8
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2018.2781
Minimum2014
Maximum2021
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.5 KiB
2023-12-12T23:48:40.576202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2014
5-th percentile2015
Q12017
median2018
Q32020
95-th percentile2021
Maximum2021
Range7
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7639959
Coefficient of variation (CV)0.00087401031
Kurtosis-0.49499582
Mean2018.2781
Median Absolute Deviation (MAD)1
Skewness-0.22371989
Sum2851827
Variance3.1116815
MonotonicityNot monotonic
2023-12-12T23:48:40.745018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
2017 374
26.5%
2019 252
17.8%
2018 227
16.1%
2020 215
15.2%
2021 181
12.8%
2016 75
 
5.3%
2015 48
 
3.4%
2014 41
 
2.9%
ValueCountFrequency (%)
2014 41
 
2.9%
2015 48
 
3.4%
2016 75
 
5.3%
2017 374
26.5%
2018 227
16.1%
2019 252
17.8%
2020 215
15.2%
2021 181
12.8%
ValueCountFrequency (%)
2021 181
12.8%
2020 215
15.2%
2019 252
17.8%
2018 227
16.1%
2017 374
26.5%
2016 75
 
5.3%
2015 48
 
3.4%
2014 41
 
2.9%
Distinct429
Distinct (%)30.4%
Missing0
Missing (%)0.0%
Memory size11.2 KiB
2023-12-12T23:48:41.034021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length97
Median length48
Mean length23.676575
Min length5

Characters and Unicode

Total characters33455
Distinct characters403
Distinct categories15 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique191 ?
Unique (%)13.5%

Sample

1st row제4차 전국자연환경조사
2nd row합성 Cry 살충 단백질을 이용한 토양미생물 군집 변화 (I)
3rd row유전자재조합생물체(LMO)의 동시검출을 위한 Multiplex-PCR법 개발 (II)
4th rowLMO가 국내 곤충상에 미치는 영향 세부평가 및 평가기준 개발연구(III)
5th rowLM작물과 국내근연종간의 유전자이동성 평가기법 개발(III)
ValueCountFrequency (%)
연구 510
 
6.7%
393
 
5.1%
구축 179
 
2.3%
생태계 169
 
2.2%
평가 130
 
1.7%
생태계서비스 102
 
1.3%
기후변화 100
 
1.3%
기반 94
 
1.2%
정밀조사 92
 
1.2%
위한 80
 
1.0%
Other values (817) 5790
75.8%
2023-12-12T23:48:41.586705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6283
 
18.8%
1252
 
3.7%
1040
 
3.1%
932
 
2.8%
885
 
2.6%
709
 
2.1%
588
 
1.8%
526
 
1.6%
488
 
1.5%
398
 
1.2%
Other values (393) 20354
60.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 24089
72.0%
Space Separator 6283
 
18.8%
Decimal Number 1142
 
3.4%
Uppercase Letter 979
 
2.9%
Close Punctuation 232
 
0.7%
Open Punctuation 232
 
0.7%
Lowercase Letter 218
 
0.7%
Other Punctuation 171
 
0.5%
Dash Punctuation 63
 
0.2%
Modifier Symbol 21
 
0.1%
Other values (5) 25
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1252
 
5.2%
1040
 
4.3%
932
 
3.9%
885
 
3.7%
709
 
2.9%
588
 
2.4%
526
 
2.2%
488
 
2.0%
398
 
1.7%
394
 
1.6%
Other values (330) 16877
70.1%
Uppercase Letter
ValueCountFrequency (%)
M 174
17.8%
I 157
16.0%
L 135
13.8%
O 134
13.7%
B 81
8.3%
D 70
7.2%
S 56
 
5.7%
E 51
 
5.2%
Z 35
 
3.6%
G 29
 
3.0%
Other values (7) 57
 
5.8%
Lowercase Letter
ValueCountFrequency (%)
c 36
16.5%
o 30
13.8%
a 30
13.8%
n 24
11.0%
k 24
11.0%
i 14
 
6.4%
s 12
 
5.5%
l 10
 
4.6%
p 8
 
3.7%
t 8
 
3.7%
Other values (6) 22
10.1%
Decimal Number
ValueCountFrequency (%)
2 352
30.8%
1 275
24.1%
0 256
22.4%
8 61
 
5.3%
9 58
 
5.1%
4 56
 
4.9%
3 30
 
2.6%
7 24
 
2.1%
5 19
 
1.7%
6 11
 
1.0%
Other Punctuation
ValueCountFrequency (%)
: 57
33.3%
· 39
22.8%
, 35
20.5%
' 24
14.0%
. 13
 
7.6%
/ 3
 
1.8%
Letter Number
ValueCountFrequency (%)
6
40.0%
6
40.0%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Space Separator
ValueCountFrequency (%)
6283
100.0%
Close Punctuation
ValueCountFrequency (%)
) 232
100.0%
Open Punctuation
ValueCountFrequency (%)
( 232
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 63
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 21
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%
Initial Punctuation
ValueCountFrequency (%)
4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 24089
72.0%
Common 8154
 
24.4%
Latin 1212
 
3.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1252
 
5.2%
1040
 
4.3%
932
 
3.9%
885
 
3.7%
709
 
2.9%
588
 
2.4%
526
 
2.2%
488
 
2.0%
398
 
1.7%
394
 
1.6%
Other values (330) 16877
70.1%
Latin
ValueCountFrequency (%)
M 174
14.4%
I 157
13.0%
L 135
11.1%
O 134
11.1%
B 81
 
6.7%
D 70
 
5.8%
S 56
 
4.6%
E 51
 
4.2%
c 36
 
3.0%
Z 35
 
2.9%
Other values (28) 283
23.3%
Common
ValueCountFrequency (%)
6283
77.1%
2 352
 
4.3%
1 275
 
3.4%
0 256
 
3.1%
) 232
 
2.8%
( 232
 
2.8%
- 63
 
0.8%
8 61
 
0.7%
9 58
 
0.7%
: 57
 
0.7%
Other values (15) 285
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 24088
72.0%
ASCII 9307
 
27.8%
None 39
 
0.1%
Number Forms 15
 
< 0.1%
Punctuation 5
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6283
67.5%
2 352
 
3.8%
1 275
 
3.0%
0 256
 
2.8%
) 232
 
2.5%
( 232
 
2.5%
M 174
 
1.9%
I 157
 
1.7%
L 135
 
1.5%
O 134
 
1.4%
Other values (45) 1077
 
11.6%
Hangul
ValueCountFrequency (%)
1252
 
5.2%
1040
 
4.3%
932
 
3.9%
885
 
3.7%
709
 
2.9%
588
 
2.4%
526
 
2.2%
488
 
2.0%
398
 
1.7%
394
 
1.6%
Other values (329) 16876
70.1%
None
ValueCountFrequency (%)
· 39
100.0%
Number Forms
ValueCountFrequency (%)
6
40.0%
6
40.0%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Punctuation
ValueCountFrequency (%)
4
80.0%
1
 
20.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct65
Distinct (%)4.6%
Missing5
Missing (%)0.4%
Memory size11.2 KiB
2023-12-12T23:48:41.879308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length7
Mean length6.6292614
Min length2

Characters and Unicode

Total characters9334
Distinct characters111
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)0.7%

Sample

1st row자연환경조사부
2nd row위해생물연구부
3rd row위해생물연구부
4th row위해생물연구부
5th row위해생물연구부
ValueCountFrequency (%)
생태보전연구실 289
20.2%
생태기반연구실 158
 
11.1%
융합연구실 132
 
9.2%
국제협력팀 63
 
4.4%
lmo연구팀 52
 
3.6%
생태조사연구실 50
 
3.5%
생태평가연구실 50
 
3.5%
환경영향평가팀 43
 
3.0%
습지연구팀 35
 
2.5%
특정보호지역조사팀 32
 
2.2%
Other values (60) 524
36.7%
2023-12-12T23:48:42.301724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1155
 
12.4%
1098
 
11.8%
817
 
8.8%
767
 
8.2%
696
 
7.5%
592
 
6.3%
361
 
3.9%
305
 
3.3%
248
 
2.7%
158
 
1.7%
Other values (101) 3137
33.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9102
97.5%
Uppercase Letter 183
 
2.0%
Space Separator 20
 
0.2%
Decimal Number 12
 
0.1%
Open Punctuation 7
 
0.1%
Close Punctuation 7
 
0.1%
Other Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1155
 
12.7%
1098
 
12.1%
817
 
9.0%
767
 
8.4%
696
 
7.6%
592
 
6.5%
361
 
4.0%
305
 
3.4%
248
 
2.7%
158
 
1.7%
Other values (84) 2905
31.9%
Uppercase Letter
ValueCountFrequency (%)
O 52
28.4%
M 52
28.4%
L 52
28.4%
F 10
 
5.5%
T 9
 
4.9%
D 3
 
1.6%
R 3
 
1.6%
A 1
 
0.5%
S 1
 
0.5%
Decimal Number
ValueCountFrequency (%)
3 7
58.3%
1 3
25.0%
4 1
 
8.3%
2 1
 
8.3%
Space Separator
ValueCountFrequency (%)
20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Other Punctuation
ValueCountFrequency (%)
& 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9102
97.5%
Latin 183
 
2.0%
Common 49
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1155
 
12.7%
1098
 
12.1%
817
 
9.0%
767
 
8.4%
696
 
7.6%
592
 
6.5%
361
 
4.0%
305
 
3.4%
248
 
2.7%
158
 
1.7%
Other values (84) 2905
31.9%
Latin
ValueCountFrequency (%)
O 52
28.4%
M 52
28.4%
L 52
28.4%
F 10
 
5.5%
T 9
 
4.9%
D 3
 
1.6%
R 3
 
1.6%
A 1
 
0.5%
S 1
 
0.5%
Common
ValueCountFrequency (%)
20
40.8%
( 7
 
14.3%
) 7
 
14.3%
3 7
 
14.3%
1 3
 
6.1%
& 3
 
6.1%
4 1
 
2.0%
2 1
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9102
97.5%
ASCII 232
 
2.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1155
 
12.7%
1098
 
12.1%
817
 
9.0%
767
 
8.4%
696
 
7.6%
592
 
6.5%
361
 
4.0%
305
 
3.4%
248
 
2.7%
158
 
1.7%
Other values (84) 2905
31.9%
ASCII
ValueCountFrequency (%)
O 52
22.4%
M 52
22.4%
L 52
22.4%
20
 
8.6%
F 10
 
4.3%
T 9
 
3.9%
( 7
 
3.0%
) 7
 
3.0%
3 7
 
3.0%
D 3
 
1.3%
Other values (7) 13
 
5.6%

Interactions

2023-12-12T23:48:39.631033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:48:39.340189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:48:39.806199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:48:39.486366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:48:42.426249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호수행연도주관연구기관부서
일련번호1.0000.8440.775
수행연도0.8441.0000.868
주관연구기관부서0.7750.8681.000
2023-12-12T23:48:42.521245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호수행연도
일련번호1.0000.900
수행연도0.9001.000

Missing values

2023-12-12T23:48:39.993747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:48:40.111250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일련번호수행연도과제명주관연구기관부서
012014제4차 전국자연환경조사자연환경조사부
122014합성 Cry 살충 단백질을 이용한 토양미생물 군집 변화 (I)위해생물연구부
232014유전자재조합생물체(LMO)의 동시검출을 위한 Multiplex-PCR법 개발 (II)위해생물연구부
342014LMO가 국내 곤충상에 미치는 영향 세부평가 및 평가기준 개발연구(III)위해생물연구부
452014LM작물과 국내근연종간의 유전자이동성 평가기법 개발(III)위해생물연구부
562014LMO 자연환경 모니터링 및 사후관리 연구(VI)위해생물연구부
672014외래생물 정밀조사위해생물연구부
782014생태계교란종 모니터링위해생물연구부
892014국가장기생태연구생태평가부
9102014생태정보시스템 구축 및 활용생태정보연구부
일련번호수행연도과제명주관연구기관부서
1403140420202020년 환경부 소관 LMO 위해성평가 체계 구축 및 평가기관 운영LMO연구팀
140414052021지역의 생태가치 평가 및 인식 증진방안 연구생태계서비스팀
1405140620202020년 유전자 기반 LMO의 안전성 평가기반 구축LMO연구팀
1406140720202020년 LMO 자연환경 모니터링 및 사후관리 연구LMO연구팀
140714082020도시생태현황지도 용역단가 산정기준 연구생태자연도연구팀
140814092020람사르습지도시 운영관리 평가체계 구축습지협력팀
140914102021생태계의 기후변화 리스크에 대응한 적응역량 강화 연구기후변화연구팀
141014112021핵심 생태자산과 생태계서비스 가치평가 및 보전방안생태계서비스팀
141114122021멸종위기 야생생물 인공증식 제도 개선방안 마련 연구복원평가분석팀
141214132021국가 보호지역 관리효과성평가(MEE) 지침 마련습지센터(실)