Overview

Dataset statistics

Number of variables7
Number of observations267
Missing cells221
Missing cells (%)11.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory14.7 KiB
Average record size in memory56.5 B

Variable types

Categorical4
Text2
Boolean1

Dataset

Description야생생물 보호 및 관리에 관한 법률에 따라 효과적인 보호를 위하여 야생생물을 대상으로 환경부가 지정 보호하는 생물로 한국의 멸종위기종과 관련하여 분류군, 국명, 학명, 등급 정보 관련 정보를 제공하고 있습니다.
Author환경부 국립생물자원관
URLhttps://www.data.go.kr/data/3071040/fileData.do

Alerts

고유종 has constant value ""Constant
고유종 has 221 (82.8%) missing valuesMissing
국명 has unique valuesUnique
학명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:35:12.358261
Analysis finished2023-12-12 09:35:13.104431
Duration0.75 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

분류군
Categorical

Distinct9
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
식물
88 
조류
63 
무척추동물
32 
어류
27 
곤충류
26 
Other values (4)
31 

Length

Max length5
Median length2
Mean length2.6367041
Min length2

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row포유류
2nd row포유류
3rd row포유류
4th row포유류
5th row포유류

Common Values

ValueCountFrequency (%)
식물 88
33.0%
조류 63
23.6%
무척추동물 32
 
12.0%
어류 27
 
10.1%
곤충류 26
 
9.7%
포유류 20
 
7.5%
양서파충류 8
 
3.0%
해조류 2
 
0.7%
고등균류 1
 
0.4%

Length

2023-12-12T18:35:13.208655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:35:13.482904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
식물 88
33.0%
조류 63
23.6%
무척추동물 32
 
12.0%
어류 27
 
10.1%
곤충류 26
 
9.7%
포유류 20
 
7.5%
양서파충류 8
 
3.0%
해조류 2
 
0.7%
고등균류 1
 
0.4%

등급
Categorical

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
II
207 
I
60 

Length

Max length2
Median length2
Mean length1.7752809
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowI
3rd rowI
4th rowI
5th rowI

Common Values

ValueCountFrequency (%)
II 207
77.5%
I 60
 
22.5%

Length

2023-12-12T18:35:13.724517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:35:13.881887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
ii 207
77.5%
i 60
 
22.5%

국명
Text

UNIQUE 

Distinct267
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-12T18:35:14.226875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length8
Mean length4.5355805
Min length1

Characters and Unicode

Total characters1211
Distinct characters313
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique267 ?
Unique (%)100.0%

Sample

1st row늑대
2nd row대륙사슴
3rd row반달가슴곰
4th row붉은박쥐
5th row사향노루
ValueCountFrequency (%)
늑대 1
 
0.4%
죽백란 1
 
0.4%
나도승마 1
 
0.4%
착생깃산호 1
 
0.4%
참달팽이 1
 
0.4%
측맵시산호 1
 
0.4%
칼세오리옆새우 1
 
0.4%
해송 1
 
0.4%
흰발농게 1
 
0.4%
흰수지맨드라미 1
 
0.4%
Other values (257) 257
96.3%
2023-12-12T18:35:14.802181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
57
 
4.7%
25
 
2.1%
25
 
2.1%
25
 
2.1%
24
 
2.0%
24
 
2.0%
22
 
1.8%
18
 
1.5%
18
 
1.5%
18
 
1.5%
Other values (303) 955
78.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1211
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
57
 
4.7%
25
 
2.1%
25
 
2.1%
25
 
2.1%
24
 
2.0%
24
 
2.0%
22
 
1.8%
18
 
1.5%
18
 
1.5%
18
 
1.5%
Other values (303) 955
78.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1211
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
57
 
4.7%
25
 
2.1%
25
 
2.1%
25
 
2.1%
24
 
2.0%
24
 
2.0%
22
 
1.8%
18
 
1.5%
18
 
1.5%
18
 
1.5%
Other values (303) 955
78.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1211
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
57
 
4.7%
25
 
2.1%
25
 
2.1%
25
 
2.1%
24
 
2.0%
24
 
2.0%
22
 
1.8%
18
 
1.5%
18
 
1.5%
18
 
1.5%
Other values (303) 955
78.9%

학명
Text

UNIQUE 

Distinct267
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-12T18:35:15.266441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length31
Mean length19.749064
Min length10

Characters and Unicode

Total characters5273
Distinct characters51
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique267 ?
Unique (%)100.0%

Sample

1st rowCanis lupus coreanus
2nd rowCervus nippon hortulorum
3rd rowUrsus thibetanus ussuricus
4th rowMyotis rufoniger
5th rowMoschus moschiferus
ValueCountFrequency (%)
var 10
 
1.8%
grus 5
 
0.9%
dendronephthya 5
 
0.9%
japonica 5
 
0.9%
japonicus 5
 
0.9%
accipiter 4
 
0.7%
chinensis 4
 
0.7%
cygnus 4
 
0.7%
iris 4
 
0.7%
aquila 3
 
0.5%
Other values (463) 519
91.4%
2023-12-12T18:35:15.871991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 561
 
10.6%
i 471
 
8.9%
s 394
 
7.5%
377
 
7.1%
e 340
 
6.4%
r 328
 
6.2%
o 324
 
6.1%
u 320
 
6.1%
n 297
 
5.6%
l 240
 
4.6%
Other values (41) 1621
30.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 4617
87.6%
Space Separator 377
 
7.1%
Uppercase Letter 267
 
5.1%
Other Punctuation 11
 
0.2%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 561
12.2%
i 471
10.2%
s 394
 
8.5%
e 340
 
7.4%
r 328
 
7.1%
o 324
 
7.0%
u 320
 
6.9%
n 297
 
6.4%
l 240
 
5.2%
t 223
 
4.8%
Other values (16) 1119
24.2%
Uppercase Letter
ValueCountFrequency (%)
C 43
16.1%
P 29
10.9%
A 25
9.4%
L 20
 
7.5%
G 17
 
6.4%
M 17
 
6.4%
E 16
 
6.0%
D 16
 
6.0%
S 13
 
4.9%
H 11
 
4.1%
Other values (12) 60
22.5%
Space Separator
ValueCountFrequency (%)
377
100.0%
Other Punctuation
ValueCountFrequency (%)
. 11
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4884
92.6%
Common 389
 
7.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 561
11.5%
i 471
 
9.6%
s 394
 
8.1%
e 340
 
7.0%
r 328
 
6.7%
o 324
 
6.6%
u 320
 
6.6%
n 297
 
6.1%
l 240
 
4.9%
t 223
 
4.6%
Other values (38) 1386
28.4%
Common
ValueCountFrequency (%)
377
96.9%
. 11
 
2.8%
- 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5273
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 561
 
10.6%
i 471
 
8.9%
s 394
 
7.5%
377
 
7.1%
e 340
 
6.4%
r 328
 
6.2%
o 324
 
6.1%
u 320
 
6.1%
n 297
 
5.6%
l 240
 
4.6%
Other values (41) 1621
30.7%

고유종
Boolean

CONSTANT  MISSING 

Distinct1
Distinct (%)2.2%
Missing221
Missing (%)82.8%
Memory size666.0 B
True
46 
(Missing)
221 
ValueCountFrequency (%)
True 46
 
17.2%
(Missing) 221
82.8%
2023-12-12T18:35:16.020449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct8
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
VU
99 
EN
78 
CR
34 
<NA>
30 
RE
 
9
Other values (3)
17 

Length

Max length4
Median length2
Mean length2.2247191
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowRE
2nd rowRE
3rd rowEN
4th rowVU
5th rowCR

Common Values

ValueCountFrequency (%)
VU 99
37.1%
EN 78
29.2%
CR 34
 
12.7%
<NA> 30
 
11.2%
RE 9
 
3.4%
LC 8
 
3.0%
NE 5
 
1.9%
NT 4
 
1.5%

Length

2023-12-12T18:35:16.150084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:35:16.342953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
vu 99
37.1%
en 78
29.2%
cr 34
 
12.7%
na 30
 
11.2%
re 9
 
3.4%
lc 8
 
3.0%
ne 5
 
1.9%
nt 4
 
1.5%
Distinct7
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
<NA>
151 
LC
63 
VU
24 
EN
16 
NT
 
7
Other values (2)
 
6

Length

Max length4
Median length4
Mean length3.1310861
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowLC
2nd rowLC
3rd rowVU
4th rowLC
5th rowVU

Common Values

ValueCountFrequency (%)
<NA> 151
56.6%
LC 63
23.6%
VU 24
 
9.0%
EN 16
 
6.0%
NT 7
 
2.6%
CR 3
 
1.1%
DD 3
 
1.1%

Length

2023-12-12T18:35:16.581170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:35:16.739162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 151
56.6%
lc 63
23.6%
vu 24
 
9.0%
en 16
 
6.0%
nt 7
 
2.6%
cr 3
 
1.1%
dd 3
 
1.1%

Correlations

2023-12-12T18:35:16.849375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분류군등급국가적색목록세계자연보전연맹
분류군1.0000.2860.5110.576
등급0.2861.0000.3460.118
국가적색목록0.5110.3461.0000.395
세계자연보전연맹0.5760.1180.3951.000
2023-12-12T18:35:16.977878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세계자연보전연맹등급국가적색목록분류군
세계자연보전연맹1.0000.0810.1510.388
등급0.0811.0000.3660.281
국가적색목록0.1510.3661.0000.199
분류군0.3880.2810.1991.000
2023-12-12T18:35:17.126561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분류군등급국가적색목록세계자연보전연맹
분류군1.0000.2810.1990.388
등급0.2811.0000.3660.081
국가적색목록0.1990.3661.0000.151
세계자연보전연맹0.3880.0810.1511.000

Missing values

2023-12-12T18:35:12.895038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:35:13.055316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

분류군등급국명학명고유종국가적색목록세계자연보전연맹
0포유류I늑대Canis lupus coreanus<NA>RELC
1포유류I대륙사슴Cervus nippon hortulorum<NA>RELC
2포유류I반달가슴곰Ursus thibetanus ussuricus<NA>ENVU
3포유류I붉은박쥐Myotis rufoniger<NA>VULC
4포유류I사향노루Moschus moschiferus<NA>CRVU
5포유류I산양Naemorhedus caudatus<NA>VUVU
6포유류I수달Lutra lutra<NA>VUNT
7포유류I스라소니Lynx lynx<NA>RELC
8포유류I여우Vulpes vulpes peculiosa<NA>ENLC
9포유류I작은관코박쥐Murina ussuriensis<NA>ENLC
분류군등급국명학명고유종국가적색목록세계자연보전연맹
257식물II피뿌리풀Stellera chamaejasme<NA>NE<NA>
258식물II한라송이풀Pedicularis hallaisanensisYEN<NA>
259식물II한라옥잠난초Liparis auriculata<NA>EN<NA>
260식물II해오라비난초Habenaria radiata<NA>EN<NA>
261식물II혹난초Bulbophyllum inconspicuum<NA>VU<NA>
262식물II홍월귤Arctous alpinus var. japonicus<NA>VU<NA>
263식물II황근Hibiscus hamabo<NA>VU<NA>
264해조류II그물공말Dictyosphaeria cavernosa<NA><NA><NA>
265해조류II삼나무말Coccophora langsdorfii<NA><NA><NA>
266고등균류II화경버섯Lampteromyces japonicus<NA><NA><NA>