Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 701 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 3 |
Duplicate rows (%) | 0.4% |
Total size in memory | 38.5 KiB |
Average record size in memory | 56.2 B |
Variable types
Text | 2 |
---|---|
Categorical | 5 |
Dataset
Description | 백두대간 지역의 희귀식물 및 멸종위기, 특산식물과 귀화식물에 대한 데이터로, 식물명 및 학명과 분류정보 등을 제공합니다. |
---|---|
Author | 산림청 |
URL | https://www.data.go.kr/data/15093672/fileData.do |
Dataset has 3 (0.4%) duplicate rows | Duplicates |
1급멸종위기식물 분류 is highly overall correlated with 희귀식물 분류 | High correlation |
희귀식물 분류 is highly overall correlated with 1급멸종위기식물 분류 and 2 other fields | High correlation |
특산식물 분류 is highly overall correlated with 희귀식물 분류 and 1 other fields | High correlation |
2급멸종위기식물 분류 is highly overall correlated with 희귀식물 분류 and 1 other fields | High correlation |
1급멸종위기식물 분류 is highly imbalanced (91.0%) | Imbalance |
2급멸종위기식물 분류 is highly imbalanced (57.4%) | Imbalance |
귀화식물 분류 is highly imbalanced (53.6%) | Imbalance |
Reproduction
Analysis started | 2023-12-12 22:00:08.612759 |
---|---|
Analysis finished | 2023-12-12 22:00:09.167959 |
Duration | 0.56 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
식물국문명
Text
Distinct | 671 |
---|---|
Distinct (%) | 95.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.6 KiB |
Value | Count | Frequency (%) |
흰바디나물 | 3 | 0.4% |
섬초롱꽃 | 3 | 0.4% |
흰바디 | 3 | 0.4% |
섬쥐똥나무 | 2 | 0.3% |
섬백리향 | 2 | 0.3% |
흰솔나리 | 2 | 0.3% |
고산구슬봉이 | 2 | 0.3% |
털긴잎갈퀴 | 2 | 0.3% |
흰등괴불 | 2 | 0.3% |
기생꽃 | 2 | 0.3% |
Other values (661) | 680 |
Most occurring characters
Value | Count | Frequency (%) |
나 | 140 | 4.5% |
리 | 121 | 3.8% |
무 | 103 | 3.3% |
꽃 | 71 | 2.3% |
이 | 68 | 2.2% |
풀 | 68 | 2.2% |
초 | 60 | 1.9% |
개 | 55 | 1.7% |
섬 | 46 | 1.5% |
산 | 45 | 1.4% |
Other values (371) | 2366 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 3135 | |
Other Punctuation | 4 | 0.1% |
Space Separator | 2 | 0.1% |
Close Punctuation | 1 | < 0.1% |
Open Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
나 | 140 | 4.5% |
리 | 121 | 3.9% |
무 | 103 | 3.3% |
꽃 | 71 | 2.3% |
이 | 68 | 2.2% |
풀 | 68 | 2.2% |
초 | 60 | 1.9% |
개 | 55 | 1.8% |
섬 | 46 | 1.5% |
산 | 45 | 1.4% |
Other values (367) | 2358 |
Other Punctuation
Value | Count | Frequency (%) |
' | 4 |
Space Separator
Value | Count | Frequency (%) |
2 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 3135 | |
Common | 8 | 0.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
나 | 140 | 4.5% |
리 | 121 | 3.9% |
무 | 103 | 3.3% |
꽃 | 71 | 2.3% |
이 | 68 | 2.2% |
풀 | 68 | 2.2% |
초 | 60 | 1.9% |
개 | 55 | 1.8% |
섬 | 46 | 1.5% |
산 | 45 | 1.4% |
Other values (367) | 2358 |
Common
Value | Count | Frequency (%) |
' | 4 | |
2 | ||
) | 1 | 12.5% |
( | 1 | 12.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 3135 | |
ASCII | 8 | 0.3% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
나 | 140 | 4.5% |
리 | 121 | 3.9% |
무 | 103 | 3.3% |
꽃 | 71 | 2.3% |
이 | 68 | 2.2% |
풀 | 68 | 2.2% |
초 | 60 | 1.9% |
개 | 55 | 1.8% |
섬 | 46 | 1.5% |
산 | 45 | 1.4% |
Other values (367) | 2358 |
ASCII
Value | Count | Frequency (%) |
' | 4 | |
2 | ||
) | 1 | 12.5% |
( | 1 | 12.5% |
식물학명
Text
Distinct | 610 |
---|---|
Distinct (%) | 87.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.6 KiB |
Length
Max length | 71 |
---|---|
Median length | 53 |
Mean length | 31.673324 |
Min length | 12 |
Characters and Unicode
Total characters | 22203 |
---|---|
Distinct characters | 60 |
Distinct categories | 7 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 535 ? |
---|---|
Unique (%) | 76.3% |
Sample
1st row | Fagus engleriana Seemen ex Diels |
---|---|
2nd row | Serratula coronata var. insularis (Iljin) Kitam. for. insularis |
3rd row | Ligusticum tachiroei (Franch. & Sav.) M.Hiroe & Constance |
4th row | Aconitum koreanum R.Raymund |
5th row | Lysimachia coreana Nakai |
Value | Count | Frequency (%) |
nakai | 259 | 9.3% |
var | 123 | 4.4% |
76 | 2.7% | |
l | 72 | 2.6% |
h.lev | 39 | 1.4% |
maxim | 39 | 1.4% |
ohwi | 36 | 1.3% |
ex | 32 | 1.1% |
makino | 22 | 0.8% |
saussurea | 21 | 0.8% |
Other values (1065) | 2073 |
Most occurring characters
Value | Count | Frequency (%) |
a | 2738 | 12.3% |
2091 | 9.4% | |
i | 1981 | 8.9% |
e | 1298 | 5.8% |
s | 1118 | 5.0% |
r | 1102 | 5.0% |
o | 1055 | 4.8% |
n | 1026 | 4.6% |
u | 928 | 4.2% |
. | 833 | 3.8% |
Other values (50) | 8033 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 16843 | |
Space Separator | 2091 | 9.4% |
Uppercase Letter | 1992 | 9.0% |
Other Punctuation | 917 | 4.1% |
Open Punctuation | 176 | 0.8% |
Close Punctuation | 176 | 0.8% |
Dash Punctuation | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 2738 | |
i | 1981 | |
e | 1298 | 7.7% |
s | 1118 | 6.6% |
r | 1102 | 6.5% |
o | 1055 | 6.3% |
n | 1026 | 6.1% |
u | 928 | 5.5% |
l | 805 | 4.8% |
t | 666 | 4.0% |
Other values (16) | 4126 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 276 | |
L | 204 | 10.2% |
S | 172 | 8.6% |
C | 133 | 6.7% |
M | 122 | 6.1% |
H | 121 | 6.1% |
P | 115 | 5.8% |
K | 110 | 5.5% |
A | 107 | 5.4% |
T | 90 | 4.5% |
Other values (16) | 542 |
Other Punctuation
Value | Count | Frequency (%) |
. | 833 | |
& | 76 | 8.3% |
, | 4 | 0.4% |
' | 4 | 0.4% |
Space Separator
Value | Count | Frequency (%) |
2091 |
Open Punctuation
Value | Count | Frequency (%) |
( | 176 |
Close Punctuation
Value | Count | Frequency (%) |
) | 176 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 8 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 18835 | |
Common | 3368 | 15.2% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 2738 | |
i | 1981 | 10.5% |
e | 1298 | 6.9% |
s | 1118 | 5.9% |
r | 1102 | 5.9% |
o | 1055 | 5.6% |
n | 1026 | 5.4% |
u | 928 | 4.9% |
l | 805 | 4.3% |
t | 666 | 3.5% |
Other values (42) | 6118 |
Common
Value | Count | Frequency (%) |
2091 | ||
. | 833 | 24.7% |
( | 176 | 5.2% |
) | 176 | 5.2% |
& | 76 | 2.3% |
- | 8 | 0.2% |
, | 4 | 0.1% |
' | 4 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 22203 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 2738 | 12.3% |
2091 | 9.4% | |
i | 1981 | 8.9% |
e | 1298 | 5.8% |
s | 1118 | 5.0% |
r | 1102 | 5.0% |
o | 1055 | 4.8% |
n | 1026 | 4.6% |
u | 928 | 4.2% |
. | 833 | 3.8% |
Other values (50) | 8033 |
희귀식물 분류
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.6 KiB |
<NA> | |
---|---|
희귀식물 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 희귀식물 |
---|---|
2nd row | <NA> |
3rd row | 희귀식물 |
4th row | 희귀식물 |
5th row | 희귀식물 |
Common Values
Value | Count | Frequency (%) |
<NA> | 443 | |
희귀식물 | 258 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 443 | |
희귀식물 | 258 |
1급멸종위기식물 분류
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.6 KiB |
<NA> | |
---|---|
1급멸종위기식물 | 8 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.0456491 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 693 | |
1급멸종위기식물 | 8 | 1.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 693 | |
1급멸종위기식물 | 8 | 1.1% |
2급멸종위기식물 분류
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.6 KiB |
<NA> | |
---|---|
2급멸종위기식물 | 61 |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.3480742 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | 2급멸종위기식물 |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 640 | |
2급멸종위기식물 | 61 | 8.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 640 | |
2급멸종위기식물 | 61 | 8.7% |
특산식물 분류
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.6 KiB |
특산식물 | |
---|---|
<NA> |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 특산식물 |
---|---|
2nd row | 특산식물 |
3rd row | <NA> |
4th row | <NA> |
5th row | 특산식물 |
Common Values
Value | Count | Frequency (%) |
특산식물 | 421 | |
<NA> | 280 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
특산식물 | 421 | |
na | 280 |
귀화식물 분류
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.6 KiB |
<NA> | |
---|---|
귀화식물 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 632 | |
귀화식물 | 69 | 9.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 632 | |
귀화식물 | 69 | 9.8% |
1급멸종위기식물 분류 | 귀화식물 분류 | 희귀식물 분류 | 특산식물 분류 | 2급멸종위기식물 분류 | |
---|---|---|---|---|---|
1급멸종위기식물 분류 | 1.000 | NaN | 1.000 | NaN | NaN |
귀화식물 분류 | NaN | 1.000 | NaN | NaN | NaN |
희귀식물 분류 | 1.000 | NaN | 1.000 | 1.000 | 1.000 |
특산식물 분류 | NaN | NaN | 1.000 | 1.000 | 1.000 |
2급멸종위기식물 분류 | NaN | NaN | 1.000 | 1.000 | 1.000 |
희귀식물 분류 | 1급멸종위기식물 분류 | 2급멸종위기식물 분류 | 특산식물 분류 | 귀화식물 분류 | |
---|---|---|---|---|---|
희귀식물 분류 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 |
1급멸종위기식물 분류 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 |
2급멸종위기식물 분류 | 1.000 | 0.000 | 1.000 | 1.000 | 0.000 |
특산식물 분류 | 1.000 | 0.000 | 1.000 | 1.000 | 0.000 |
귀화식물 분류 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 |
식물국문명 | 식물학명 | 희귀식물 분류 | 1급멸종위기식물 분류 | 2급멸종위기식물 분류 | 특산식물 분류 | 귀화식물 분류 | |
---|---|---|---|---|---|---|---|
0 | 너도밤나무 | Fagus engleriana Seemen ex Diels | 희귀식물 | <NA> | <NA> | 특산식물 | <NA> |
1 | 산비장이 | Serratula coronata var. insularis (Iljin) Kitam. for. insularis | <NA> | <NA> | <NA> | 특산식물 | <NA> |
2 | 개회향 | Ligusticum tachiroei (Franch. & Sav.) M.Hiroe & Constance | 희귀식물 | <NA> | <NA> | <NA> | <NA> |
3 | 백부자 | Aconitum koreanum R.Raymund | 희귀식물 | <NA> | 2급멸종위기식물 | <NA> | <NA> |
4 | 참좁쌀풀 | Lysimachia coreana Nakai | 희귀식물 | <NA> | <NA> | 특산식물 | <NA> |
5 | 섬피나무 | Tilia insularis Nakai | <NA> | <NA> | <NA> | 특산식물 | <NA> |
6 | 섬거복꼬리 | Boehmeria taquetii Nakai | <NA> | <NA> | <NA> | 특산식물 | <NA> |
7 | 기생초 | Coreopsis tinctoria Nutt. | <NA> | <NA> | <NA> | <NA> | 귀화식물 |
8 | 흰정향나무 | Syringa patula var. kamibayshii for. lactea (Nakai) K.Kim | <NA> | <NA> | <NA> | 특산식물 | <NA> |
9 | 꽃잔대 | Adenophora koreana Kitam. | <NA> | <NA> | <NA> | 특산식물 | <NA> |
식물국문명 | 식물학명 | 희귀식물 분류 | 1급멸종위기식물 분류 | 2급멸종위기식물 분류 | 특산식물 분류 | 귀화식물 분류 | |
---|---|---|---|---|---|---|---|
691 | 두루미천남성 | Arisaema heterophyllum Blume | 희귀식물 | <NA> | <NA> | <NA> | <NA> |
692 | 애기이삭사초 | Carex ochrochlamis Ohwi | <NA> | <NA> | <NA> | 특산식물 | <NA> |
693 | 갈퀴아재비 | Asperula lasiantha Nakai | <NA> | <NA> | <NA> | 특산식물 | <NA> |
694 | 박달목서 | Osmanthus insularis Koidz. | 희귀식물 | <NA> | 2급멸종위기식물 | <NA> | <NA> |
695 | 흰등괴불 | Lonicera maximowiczii var. latifolia (Ohwi) Hara | <NA> | <NA> | <NA> | 특산식물 | <NA> |
696 | 구상나무 | Abies koreana Wilson | 희귀식물 | <NA> | <NA> | 특산식물 | <NA> |
697 | 주걱개망초 | Erigeron strigosus Muhl. | <NA> | <NA> | <NA> | <NA> | 귀화식물 |
698 | 한라분취 | Saussurea maximowiczii var. triceps (H.Lev. & Vaniot) Kitam. | <NA> | <NA> | <NA> | 특산식물 | <NA> |
699 | 왕제비꽃 | Viola websteri Hemsl. | 희귀식물 | <NA> | 2급멸종위기식물 | <NA> | <NA> |
700 | 강활 | ostericum praeteritum | <NA> | <NA> | <NA> | 특산식물 | <NA> |
Most frequently occurring
식물국문명 | 식물학명 | 희귀식물 분류 | 1급멸종위기식물 분류 | 2급멸종위기식물 분류 | 특산식물 분류 | 귀화식물 분류 | # duplicates | |
---|---|---|---|---|---|---|---|---|
0 | 개서나무 | Carpinus tschonoskii Maxim. var. tschonoskii | 희귀식물 | <NA> | <NA> | <NA> | <NA> | 2 |
1 | 서나무 | Carpinus laxiflora (Siebold & Zucc.) Blume var. laxiflora | <NA> | <NA> | <NA> | 특산식물 | <NA> | 2 |
2 | 털긴잎갈퀴 | Galium boreale var. koreanum Nakai | <NA> | <NA> | <NA> | 특산식물 | <NA> | 2 |