Dataset statistics
Number of variables | 3 |
---|---|
Number of observations | 2222 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 52.2 KiB |
Average record size in memory | 24.1 B |
Variable types
Categorical | 2 |
---|---|
Text | 1 |
Dataset
Description | 식물검역 규제병해충 지정 현황으로 검역지위와 분류, 학명, 일반명 등에 대한 자료를 제공합니다. 현재 규제병해충 지정현황은 2022건입니다. |
---|---|
URL | https://www.data.go.kr/data/3055531/fileData.do |
Reproduction
Analysis started | 2023-12-12 22:08:04.929243 |
---|---|
Analysis finished | 2023-12-12 22:08:05.244793 |
Duration | 0.32 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
검역지위
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 17.5 KiB |
관리병해충 | |
---|---|
금지병해충 | 77 |
규제비검역병해충 | 51 |
금지병해충(매개충) | 7 |
Length
Max length | 10 |
---|---|
Median length | 5 |
Mean length | 5.0846085 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 금지병해충 |
---|---|
2nd row | 금지병해충 |
3rd row | 금지병해충 |
4th row | 금지병해충 |
5th row | 금지병해충 |
Common Values
Value | Count | Frequency (%) |
관리병해충 | 2087 | |
금지병해충 | 77 | 3.5% |
규제비검역병해충 | 51 | 2.3% |
금지병해충(매개충) | 7 | 0.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
관리병해충 | 2087 | |
금지병해충 | 77 | 3.5% |
규제비검역병해충 | 51 | 2.3% |
금지병해충(매개충 | 7 | 0.3% |
분류
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 11 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 17.5 KiB |
곤충 | |
---|---|
진균 | |
바이러스 | 112 |
세균 | 62 |
응애 | 57 |
Other values (6) | 126 |
Length
Max length | 7 |
---|---|
Median length | 2 |
Mean length | 2.139964 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 진균 |
---|---|
2nd row | 진균 |
3rd row | 진균 |
4th row | 진균 |
5th row | 진균 |
Common Values
Value | Count | Frequency (%) |
곤충 | 1517 | |
진균 | 348 | 15.7% |
바이러스 | 112 | 5.0% |
세균 | 62 | 2.8% |
응애 | 57 | 2.6% |
잡초 | 46 | 2.1% |
선충 | 41 | 1.8% |
달팽이 | 21 | 0.9% |
바이로이드 | 10 | 0.5% |
곤충(매개충) | 7 | 0.3% |
Length
Value | Count | Frequency (%) |
곤충 | 1517 | |
진균 | 348 | 15.7% |
바이러스 | 112 | 5.0% |
세균 | 62 | 2.8% |
응애 | 57 | 2.6% |
잡초 | 46 | 2.1% |
선충 | 41 | 1.8% |
달팽이 | 21 | 0.9% |
바이로이드 | 10 | 0.5% |
곤충(매개충 | 7 | 0.3% |
학명
Text
Distinct | 2221 |
---|---|
Distinct (%) | > 99.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 17.5 KiB |
Length
Max length | 341 |
---|---|
Median length | 146 |
Mean length | 34.132763 |
Min length | 11 |
Characters and Unicode
Total characters | 75843 |
---|---|
Distinct characters | 83 |
Distinct categories | 10 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 2220 ? |
---|---|
Unique (%) | 99.9% |
Sample
1st row | Balansia oryzae-sativae |
---|---|
2nd row | Cronartium coleosporioides |
3rd row | Peronospora tabacina |
4th row | Phytophthora ramorum |
5th row | Synchytrium endobioticum |
Value | Count | Frequency (%) |
305 | 3.5% | |
virus | 106 | 1.2% |
fabricius | 92 | 1.1% |
linnaeus | 64 | 0.7% |
walker | 51 | 0.6% |
l | 46 | 0.5% |
et | 39 | 0.4% |
say | 35 | 0.4% |
cockerell | 31 | 0.4% |
al | 30 | 0.3% |
Other values (4651) | 7957 |
Most occurring characters
Value | Count | Frequency (%) |
a | 6648 | 8.8% |
6578 | 8.7% | |
i | 5665 | 7.5% |
e | 5205 | 6.9% |
s | 4717 | 6.2% |
r | 4674 | 6.2% |
o | 4390 | 5.8% |
l | 3600 | 4.7% |
n | 3468 | 4.6% |
t | 3124 | 4.1% |
Other values (73) | 27774 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 58769 | |
Space Separator | 6578 | 8.7% |
Uppercase Letter | 5807 | 7.7% |
Other Punctuation | 1446 | 1.9% |
Open Punctuation | 1388 | 1.8% |
Close Punctuation | 1386 | 1.8% |
Decimal Number | 269 | 0.4% |
Math Symbol | 160 | 0.2% |
Dash Punctuation | 27 | < 0.1% |
Other Letter | 13 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 6648 | |
i | 5665 | 9.6% |
e | 5205 | 8.9% |
s | 4717 | 8.0% |
r | 4674 | 8.0% |
o | 4390 | 7.5% |
l | 3600 | 6.1% |
n | 3468 | 5.9% |
t | 3124 | 5.3% |
u | 3114 | 5.3% |
Other values (17) | 14164 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 623 | 10.7% |
P | 528 | 9.1% |
S | 487 | 8.4% |
M | 424 | 7.3% |
B | 389 | 6.7% |
L | 346 | 6.0% |
D | 316 | 5.4% |
A | 309 | 5.3% |
H | 299 | 5.1% |
F | 275 | 4.7% |
Other values (16) | 1811 |
Decimal Number
Value | Count | Frequency (%) |
1 | 72 | |
9 | 56 | |
8 | 38 | |
7 | 23 | 8.6% |
3 | 16 | 5.9% |
5 | 15 | 5.6% |
4 | 14 | 5.2% |
0 | 13 | 4.8% |
2 | 12 | 4.5% |
6 | 10 | 3.7% |
Other Letter
Value | Count | Frequency (%) |
외 | 2 | |
군 | 2 | |
제 | 2 | |
단 | 2 | |
하 | 1 | |
위 | 1 | |
류 | 1 | |
포 | 1 | |
함 | 1 |
Other Punctuation
Value | Count | Frequency (%) |
. | 993 | |
& | 270 | 18.7% |
, | 114 | 7.9% |
? | 64 | 4.4% |
' | 3 | 0.2% |
: | 2 | 0.1% |
Space Separator
Value | Count | Frequency (%) |
6578 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1388 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1386 |
Math Symbol
Value | Count | Frequency (%) |
= | 160 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 27 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 64576 | |
Common | 11254 | 14.8% |
Hangul | 13 | < 0.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 6648 | 10.3% |
i | 5665 | 8.8% |
e | 5205 | 8.1% |
s | 4717 | 7.3% |
r | 4674 | 7.2% |
o | 4390 | 6.8% |
l | 3600 | 5.6% |
n | 3468 | 5.4% |
t | 3124 | 4.8% |
u | 3114 | 4.8% |
Other values (43) | 19971 |
Common
Value | Count | Frequency (%) |
6578 | ||
( | 1388 | 12.3% |
) | 1386 | 12.3% |
. | 993 | 8.8% |
& | 270 | 2.4% |
= | 160 | 1.4% |
, | 114 | 1.0% |
1 | 72 | 0.6% |
? | 64 | 0.6% |
9 | 56 | 0.5% |
Other values (11) | 173 | 1.5% |
Hangul
Value | Count | Frequency (%) |
외 | 2 | |
군 | 2 | |
제 | 2 | |
단 | 2 | |
하 | 1 | |
위 | 1 | |
류 | 1 | |
포 | 1 | |
함 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 75829 | |
Hangul | 13 | < 0.1% |
None | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 6648 | 8.8% |
6578 | 8.7% | |
i | 5665 | 7.5% |
e | 5205 | 6.9% |
s | 4717 | 6.2% |
r | 4674 | 6.2% |
o | 4390 | 5.8% |
l | 3600 | 4.7% |
n | 3468 | 4.6% |
t | 3124 | 4.1% |
Other values (63) | 27760 |
Hangul
Value | Count | Frequency (%) |
외 | 2 | |
군 | 2 | |
제 | 2 | |
단 | 2 | |
하 | 1 | |
위 | 1 | |
류 | 1 | |
포 | 1 | |
함 | 1 |
None
Value | Count | Frequency (%) |
ø | 1 |
검역지위 | 분류 | |
---|---|---|
검역지위 | 1.000 | 0.779 |
분류 | 0.779 | 1.000 |
분류 | 검역지위 | |
---|---|---|
분류 | 1.000 | 0.603 |
검역지위 | 0.603 | 1.000 |
검역지위 | 분류 | |
---|---|---|
검역지위 | 1.000 | 0.603 |
분류 | 0.603 | 1.000 |
검역지위 | 분류 | 학명 | |
---|---|---|---|
0 | 금지병해충 | 진균 | Balansia oryzae-sativae |
1 | 금지병해충 | 진균 | Cronartium coleosporioides |
2 | 금지병해충 | 진균 | Peronospora tabacina |
3 | 금지병해충 | 진균 | Phytophthora ramorum |
4 | 금지병해충 | 진균 | Synchytrium endobioticum |
5 | 금지병해충 | 세균 | Candidatus Liberibacter solanacearum |
6 | 금지병해충 | 세균 | Citrus huanglongbing(greening) disease |
7 | 금지병해충 | 세균 | Xylella fastidiosa |
8 | 금지병해충 | 세균 | Erwinia amylovora |
9 | 금지병해충 | 세균 | Apple proliferation phytoplasma |
검역지위 | 분류 | 학명 | |
---|---|---|---|
2212 | 규제비검역병해충 | 잡초 | Aneilema keisak Hassk. |
2213 | 규제비검역병해충 | 잡초 | Capsella bursa-pastoris (L.) Medik. |
2214 | 규제비검역병해충 | 잡초 | Cruciferae family |
2215 | 규제비검역병해충 | 잡초 | Cuscuta spp. |
2216 | 규제비검역병해충 | 잡초 | Echinochloa crus-galli (하위군류군 포함) (L.) Beauv. |
2217 | 규제비검역병해충 | 잡초 | Echinochloa utilis Ohwi et Yabuno |
2218 | 규제비검역병해충 | 잡초 | Monochoria vaginalis (Burn. f.) Presl |
2219 | 규제비검역병해충 | 잡초 | Persicaria hydropiper (L.) Spach |
2220 | 규제비검역병해충 | 잡초 | Rotala indica (Willd.) Koehne |
2221 | 규제비검역병해충 | 잡초 | Schoenoplectiella juncoides (Roxb.) Lye |