Overview

Dataset statistics

Number of variables9
Number of observations49
Missing cells47
Missing cells (%)10.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.7 KiB
Average record size in memory76.7 B

Variable types

Numeric1
Text3
Categorical5

Dataset

Description세계자연보전연맹(IUCN) 적색목록(멸종위기에 처한)에 대한 초본, 목본 등 49종 목록을 제공하고자 합니다.
Author산림청
URLhttps://www.data.go.kr/data/15090611/fileData.do

Alerts

천연기념물 has constant value ""Constant
멸종위기식물(2017년12월) is highly overall correlated with 순번 and 4 other fieldsHigh correlation
평가시 세부항목 is highly overall correlated with 멸종위기수준 and 1 other fieldsHigh correlation
종류 is highly overall correlated with 멸종위기식물(2017년12월)High correlation
등록년도 is highly overall correlated with 순번 and 1 other fieldsHigh correlation
멸종위기수준 is highly overall correlated with 순번 and 2 other fieldsHigh correlation
순번 is highly overall correlated with 멸종위기수준 and 2 other fieldsHigh correlation
천연기념물 has 47 (95.9%) missing valuesMissing
순번 has unique valuesUnique
학명 has unique valuesUnique
국내명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:24:22.958184
Analysis finished2023-12-12 09:24:23.705922
Duration0.75 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25
Minimum1
Maximum49
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-12T18:24:23.783479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.4
Q113
median25
Q337
95-th percentile46.6
Maximum49
Range48
Interquartile range (IQR)24

Descriptive statistics

Standard deviation14.28869
Coefficient of variation (CV)0.57154761
Kurtosis-1.2
Mean25
Median Absolute Deviation (MAD)12
Skewness0
Sum1225
Variance204.16667
MonotonicityStrictly increasing
2023-12-12T18:24:23.936543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
1 1
 
2.0%
38 1
 
2.0%
28 1
 
2.0%
29 1
 
2.0%
30 1
 
2.0%
31 1
 
2.0%
32 1
 
2.0%
33 1
 
2.0%
34 1
 
2.0%
35 1
 
2.0%
Other values (39) 39
79.6%
ValueCountFrequency (%)
1 1
2.0%
2 1
2.0%
3 1
2.0%
4 1
2.0%
5 1
2.0%
6 1
2.0%
7 1
2.0%
8 1
2.0%
9 1
2.0%
10 1
2.0%
ValueCountFrequency (%)
49 1
2.0%
48 1
2.0%
47 1
2.0%
46 1
2.0%
45 1
2.0%
44 1
2.0%
43 1
2.0%
42 1
2.0%
41 1
2.0%
40 1
2.0%

학명
Text

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
2023-12-12T18:24:24.228467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length34
Mean length29.367347
Min length18

Characters and Unicode

Total characters1439
Distinct characters53
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)100.0%

Sample

1st rowBupleurum latissimum Nakai
2nd rowGlochidion chodoense J.S.Lee & Im
3rd rowMankyua chejuense B.Y.Sun & M.H.Kim & C.H.Kim
4th rowAnemone maxima Nakai
5th rowPentactina rupicola Nakai
ValueCountFrequency (%)
nakai 30
 
16.9%
7
 
4.0%
h.lev 6
 
3.4%
koreana 3
 
1.7%
anemone 2
 
1.1%
iris 2
 
1.1%
koraiensis 2
 
1.1%
coreanum 2
 
1.1%
saniculifolia 2
 
1.1%
subsessilis 2
 
1.1%
Other values (110) 119
67.2%
2023-12-12T18:24:24.719744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 180
 
12.5%
i 151
 
10.5%
132
 
9.2%
e 98
 
6.8%
s 81
 
5.6%
o 68
 
4.7%
n 65
 
4.5%
r 55
 
3.8%
m 53
 
3.7%
u 51
 
3.5%
Other values (43) 505
35.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1093
76.0%
Uppercase Letter 144
 
10.0%
Space Separator 132
 
9.2%
Other Punctuation 44
 
3.1%
Close Punctuation 13
 
0.9%
Open Punctuation 13
 
0.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 180
16.5%
i 151
13.8%
e 98
 
9.0%
s 81
 
7.4%
o 68
 
6.2%
n 65
 
5.9%
r 55
 
5.0%
m 53
 
4.8%
u 51
 
4.7%
l 44
 
4.0%
Other values (16) 247
22.6%
Uppercase Letter
ValueCountFrequency (%)
N 31
21.5%
H 15
10.4%
L 12
 
8.3%
S 10
 
6.9%
C 9
 
6.2%
B 8
 
5.6%
K 8
 
5.6%
A 7
 
4.9%
M 7
 
4.9%
P 5
 
3.5%
Other values (12) 32
22.2%
Other Punctuation
ValueCountFrequency (%)
. 37
84.1%
& 7
 
15.9%
Space Separator
ValueCountFrequency (%)
132
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1237
86.0%
Common 202
 
14.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 180
14.6%
i 151
 
12.2%
e 98
 
7.9%
s 81
 
6.5%
o 68
 
5.5%
n 65
 
5.3%
r 55
 
4.4%
m 53
 
4.3%
u 51
 
4.1%
l 44
 
3.6%
Other values (38) 391
31.6%
Common
ValueCountFrequency (%)
132
65.3%
. 37
 
18.3%
) 13
 
6.4%
( 13
 
6.4%
& 7
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1439
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 180
 
12.5%
i 151
 
10.5%
132
 
9.2%
e 98
 
6.8%
s 81
 
5.6%
o 68
 
4.7%
n 65
 
4.5%
r 55
 
3.8%
m 53
 
3.7%
u 51
 
3.5%
Other values (43) 505
35.1%

국내명
Text

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
2023-12-12T18:24:25.038610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length4.2653061
Min length2

Characters and Unicode

Total characters209
Distinct characters118
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)100.0%

Sample

1st row섬시호
2nd row조도만두나무
3rd row제주고사리삼
4th row섬노루귀
5th row금강인가목
ValueCountFrequency (%)
섬시호 1
 
2.0%
세뿔투구꽃 1
 
2.0%
참배암차즈기 1
 
2.0%
개회나무 1
 
2.0%
백부자 1
 
2.0%
홀아비바람꽃 1
 
2.0%
솜다리 1
 
2.0%
광릉골무꽃 1
 
2.0%
매미꽃 1
 
2.0%
벌개미취 1
 
2.0%
Other values (39) 39
79.6%
2023-12-12T18:24:25.480806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12
 
5.7%
11
 
5.3%
11
 
5.3%
8
 
3.8%
6
 
2.9%
5
 
2.4%
5
 
2.4%
4
 
1.9%
3
 
1.4%
3
 
1.4%
Other values (108) 141
67.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 209
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
5.7%
11
 
5.3%
11
 
5.3%
8
 
3.8%
6
 
2.9%
5
 
2.4%
5
 
2.4%
4
 
1.9%
3
 
1.4%
3
 
1.4%
Other values (108) 141
67.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 209
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
5.7%
11
 
5.3%
11
 
5.3%
8
 
3.8%
6
 
2.9%
5
 
2.4%
5
 
2.4%
4
 
1.9%
3
 
1.4%
3
 
1.4%
Other values (108) 141
67.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 209
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
12
 
5.7%
11
 
5.3%
11
 
5.3%
8
 
3.8%
6
 
2.9%
5
 
2.4%
5
 
2.4%
4
 
1.9%
3
 
1.4%
3
 
1.4%
Other values (108) 141
67.5%

멸종위기수준
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)12.2%
Missing0
Missing (%)0.0%
Memory size524.0 B
LC
19 
EN
15 
CR
VU
DD

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)2.0%

Sample

1st rowCR
2nd rowCR
3rd rowCR
4th rowCR
5th rowCR

Common Values

ValueCountFrequency (%)
LC 19
38.8%
EN 15
30.6%
CR 7
 
14.3%
VU 5
 
10.2%
DD 2
 
4.1%
NT 1
 
2.0%

Length

2023-12-12T18:24:25.636443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:24:25.751461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
lc 19
38.8%
en 15
30.6%
cr 7
 
14.3%
vu 5
 
10.2%
dd 2
 
4.1%
nt 1
 
2.0%

평가시 세부항목
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)28.6%
Missing0
Missing (%)0.0%
Memory size524.0 B
<NA>
22 
B2ab(iii)
B1ab(iii)
B1ab(iii)+2ab(iii)
B2ab(iii,v)
Other values (9)
11 

Length

Max length22
Median length21
Mean length8.4897959
Min length2

Unique

Unique8 ?
Unique (%)16.3%

Sample

1st rowB1ab(iii)
2nd rowB1ab(iii,v)+2ab(iii,v)
3rd rowB1ab(iii)+2ab(iii)
4th rowB1ab(iii)
5th rowB1ab(iii)

Common Values

ValueCountFrequency (%)
<NA> 22
44.9%
B2ab(iii) 5
 
10.2%
B1ab(iii) 4
 
8.2%
B1ab(iii)+2ab(iii) 4
 
8.2%
B2ab(iii,v) 3
 
6.1%
D2 3
 
6.1%
B1ab(iii,v)+2ab(iii,v) 1
 
2.0%
B2ab(iii,v); C2a(i) 1
 
2.0%
B2ab(i,iii,iv) 1
 
2.0%
B2ab(ii,iii,v) 1
 
2.0%
Other values (4) 4
 
8.2%

Length

2023-12-12T18:24:25.930866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 22
43.1%
b2ab(iii 5
 
9.8%
b1ab(iii 4
 
7.8%
b1ab(iii)+2ab(iii 4
 
7.8%
b2ab(iii,v 4
 
7.8%
d2 3
 
5.9%
b1ab(iii,v)+2ab(iii,v 1
 
2.0%
c2a(i 1
 
2.0%
b2ab(i,iii,iv 1
 
2.0%
b2ab(ii,iii,v 1
 
2.0%
Other values (5) 5
 
9.8%

등록년도
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size524.0 B
2016
33 
2017
10 
2018

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2016
2nd row2016
3rd row2016
4th row2016
5th row2016

Common Values

ValueCountFrequency (%)
2016 33
67.3%
2017 10
 
20.4%
2018 6
 
12.2%

Length

2023-12-12T18:24:26.098970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:24:26.199734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2016 33
67.3%
2017 10
 
20.4%
2018 6
 
12.2%

종류
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size524.0 B
초본
29 
목본
20 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row초본
2nd row목본
3rd row초본
4th row초본
5th row목본

Common Values

ValueCountFrequency (%)
초본 29
59.2%
목본 20
40.8%

Length

2023-12-12T18:24:26.319769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:24:26.430902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
초본 29
59.2%
목본 20
40.8%

천연기념물
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)50.0%
Missing47
Missing (%)95.9%
Memory size524.0 B
2023-12-12T18:24:26.490869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters2
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
ValueCountFrequency (%)
2
100.0%
2023-12-12T18:24:26.690002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2
100.0%

Most occurring categories

ValueCountFrequency (%)
Other Symbol 2
100.0%

Most frequent character per category

Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Geometric Shapes 2
100.0%

Most frequent character per block

Geometric Shapes
ValueCountFrequency (%)
2
100.0%

멸종위기식물(2017년12월)
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size524.0 B
<NA>
43 
2급

Length

Max length4
Median length4
Mean length3.755102
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2급
2nd row<NA>
3rd row2급
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 43
87.8%
2급 6
 
12.2%

Length

2023-12-12T18:24:26.840846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:24:26.940115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 43
87.8%
2급 6
 
12.2%

Interactions

2023-12-12T18:24:23.372413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:24:26.999046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번학명국내명멸종위기수준평가시 세부항목등록년도종류
순번1.0001.0001.0000.8780.6850.9250.285
학명1.0001.0001.0001.0001.0001.0001.000
국내명1.0001.0001.0001.0001.0001.0001.000
멸종위기수준0.8781.0001.0001.0000.9130.7930.000
평가시 세부항목0.6851.0001.0000.9131.0000.5550.000
등록년도0.9251.0001.0000.7930.5551.0000.000
종류0.2851.0001.0000.0000.0000.0001.000
2023-12-12T18:24:27.116224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
멸종위기식물(2017년12월)평가시 세부항목종류등록년도멸종위기수준
멸종위기식물(2017년12월)1.0001.0001.0001.0001.000
평가시 세부항목1.0001.0000.0000.3730.639
종류1.0000.0001.0000.0000.000
등록년도1.0000.3730.0001.0000.456
멸종위기수준1.0000.6390.0000.4561.000
2023-12-12T18:24:27.227370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번멸종위기수준평가시 세부항목등록년도종류멸종위기식물(2017년12월)
순번1.0000.7160.4230.8260.0761.000
멸종위기수준0.7161.0000.6390.4560.0001.000
평가시 세부항목0.4230.6391.0000.3730.0001.000
등록년도0.8260.4560.3731.0000.0001.000
종류0.0760.0000.0000.0001.0001.000
멸종위기식물(2017년12월)1.0001.0001.0001.0001.0001.000

Missing values

2023-12-12T18:24:23.499607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:24:23.648357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번학명국내명멸종위기수준평가시 세부항목등록년도종류천연기념물멸종위기식물(2017년12월)
01Bupleurum latissimum Nakai섬시호CRB1ab(iii)2016초본<NA>2급
12Glochidion chodoense J.S.Lee & Im조도만두나무CRB1ab(iii,v)+2ab(iii,v)2016목본<NA><NA>
23Mankyua chejuense B.Y.Sun & M.H.Kim & C.H.Kim제주고사리삼CRB1ab(iii)+2ab(iii)2016초본<NA>2급
34Anemone maxima Nakai섬노루귀CRB1ab(iii)2016초본<NA><NA>
45Pentactina rupicola Nakai금강인가목CRB1ab(iii)2016목본<NA><NA>
56Bupleurum euphorbioides Nakai등대시호ENB2ab(iii)2016초본<NA><NA>
67Gymnospermium microrrhynchum (S.Moore) Takht.한계령풀ENB2ab(iii,v)2016초본<NA><NA>
78Hanabusaya asiatica (Nakai) Nakai금강초롱꽃ENB2ab(iii,v); C2a(i)2016초본<NA><NA>
89Sophora koreensis Nakai개느삼ENB1ab(iii)+2ab(iii)2016목본<NA>
910Corylopsis coreana Uyeki히어리ENB2ab(iii)2016목본<NA><NA>
순번학명국내명멸종위기수준평가시 세부항목등록년도종류천연기념물멸종위기식물(2017년12월)
3940Lonicera subsessilis Rehder청괴불나무LC<NA>2017목본<NA><NA>
4041Lespedeza maritima Nakai해변싸리LC<NA>2017목본<NA><NA>
4142Clematis trichotoma Nakai할미밀망LC<NA>2017목본<NA><NA>
4243Patrinia saniculifolia Hemsley금마타리LC<NA>2017초본<NA><NA>
4344Codonopsis minima Nakai애기더덕CRB1ab(ⅲ,ⅴ)+2ab(ⅲ,ⅴ): D2018초본<NA><NA>
4445Rhamnus taquetii (H.Lev. & Vaniot) H.Lev.좀갈매나무CRB1ab(iii)2018목본<NA><NA>
4546Ajuga spectabilis Nakai자란초LC<NA>2018초본<NA><NA>
4647Dystaenia takesimana (Nakai) Kitag섬바디LC<NA>2018초본<NA><NA>
4748Clematis brachyura Maxim.외대으아리LC<NA>2018목본<NA><NA>
4849Prunus takesimensis Nakai섬벚나무LC<NA>2018목본<NA><NA>