Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 3899 |
Missing cells | 6218 |
Missing cells (%) | 26.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 186.7 KiB |
Average record size in memory | 49.0 B |
Variable types
Numeric | 1 |
---|---|
Categorical | 1 |
Text | 4 |
Dataset
Description | 식물병해충 예찰조사 과정 및 연구사업 등에서 확보한 표본 정보의 리스트로, 종합적인 표본의 보존, 관리 및 활용을 목적으로 한다. |
---|---|
Author | 농림축산식품부 농림축산검역본부 |
URL | https://www.data.go.kr/data/15107667/fileData.do |
Reproduction
Analysis started | 2023-12-12 07:36:45.058802 |
---|---|
Analysis finished | 2023-12-12 07:36:46.146820 |
Duration | 1.09 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
번호
Real number (ℝ)
UNIQUE
 
Distinct | 3899 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1999.8248 |
Minimum | 1 |
---|---|
Maximum | 3951 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 34.4 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 247.9 |
Q1 | 1027.5 |
median | 2002 |
Q3 | 2976.5 |
95-th percentile | 3756.1 |
Maximum | 3951 |
Range | 3950 |
Interquartile range (IQR) | 1949 |
Descriptive statistics
Standard deviation | 1129.3412 |
---|---|
Coefficient of variation (CV) | 0.56472007 |
Kurtosis | -1.1868309 |
Mean | 1999.8248 |
Median Absolute Deviation (MAD) | 975 |
Skewness | -0.010492774 |
Sum | 7797317 |
Variance | 1275411.6 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | < 0.1% |
2673 | 1 | < 0.1% |
2645 | 1 | < 0.1% |
2646 | 1 | < 0.1% |
2647 | 1 | < 0.1% |
2648 | 1 | < 0.1% |
2649 | 1 | < 0.1% |
2650 | 1 | < 0.1% |
2651 | 1 | < 0.1% |
2652 | 1 | < 0.1% |
Other values (3889) | 3889 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
3951 | 1 | |
3950 | 1 | |
3949 | 1 | |
3948 | 1 | |
3947 | 1 | |
3946 | 1 | |
3945 | 1 | |
3944 | 1 | |
3943 | 1 | |
3942 | 1 |
목
Categorical
Distinct | 10 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 30.6 KiB |
Lepidoptera | |
---|---|
<NA> | |
Coleoptera | |
Hemiptera | 181 |
Hymenoptera | 122 |
Other values (5) | 23 |
Length
Max length | 11 |
---|---|
Median length | 11 |
Mean length | 9.2680174 |
Min length | 4 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | Hemiptera |
---|---|
2nd row | Hemiptera |
3rd row | Hemiptera |
4th row | Hemiptera |
5th row | Hemiptera |
Common Values
Value | Count | Frequency (%) |
Lepidoptera | 2291 | |
<NA> | 839 | 21.5% |
Coleoptera | 443 | 11.4% |
Hemiptera | 181 | 4.6% |
Hymenoptera | 122 | 3.1% |
Diptera | 15 | 0.4% |
Orthoptera | 5 | 0.1% |
Odonata | 1 | < 0.1% |
Neoptera | 1 | < 0.1% |
Isoptera | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
lepidoptera | 2291 | |
na | 839 | 21.5% |
coleoptera | 443 | 11.4% |
hemiptera | 181 | 4.6% |
hymenoptera | 122 | 3.1% |
diptera | 15 | 0.4% |
orthoptera | 5 | 0.1% |
odonata | 1 | < 0.1% |
neoptera | 1 | < 0.1% |
isoptera | 1 | < 0.1% |
과
Text
MISSING
 
Distinct | 83 |
---|---|
Distinct (%) | 3.8% |
Missing | 1699 |
Missing (%) | 43.6% |
Memory size | 30.6 KiB |
Length
Max length | 16 |
---|---|
Median length | 15 |
Mean length | 10.383182 |
Min length | 7 |
Characters and Unicode
Total characters | 22843 |
---|---|
Distinct characters | 38 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 23 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | Velliidae |
---|---|
2nd row | Lygaeidae |
3rd row | Lygaeidae |
4th row | Lygaeidae |
5th row | Lygaeidae |
Value | Count | Frequency (%) |
geometridae | 374 | |
noctuidae | 340 | |
curculionidae | 185 | |
tortricidae | 181 | |
erebidae | 154 | 7.0% |
formicidae | 120 | 5.5% |
arctiidae | 117 | 5.3% |
cerambycidae | 117 | 5.3% |
pyralidae | 103 | 4.7% |
notodontidae | 60 | 2.7% |
Other values (73) | 449 |
Most occurring characters
Value | Count | Frequency (%) |
e | 3424 | |
i | 3074 | |
a | 2707 | |
d | 2304 | |
r | 1757 | |
o | 1616 | |
t | 1287 | 5.6% |
c | 1169 | 5.1% |
m | 751 | 3.3% |
u | 751 | 3.3% |
Other values (28) | 4003 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 20645 | |
Uppercase Letter | 2198 | 9.6% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 3424 | |
i | 3074 | |
a | 2707 | |
d | 2304 | |
r | 1757 | |
o | 1616 | |
t | 1287 | 6.2% |
c | 1169 | 5.7% |
m | 751 | 3.6% |
u | 751 | 3.6% |
Other values (10) | 1805 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 414 | |
G | 388 | |
C | 378 | |
T | 221 | |
P | 176 | |
E | 163 | 7.4% |
A | 125 | 5.7% |
F | 122 | 5.6% |
L | 54 | 2.5% |
B | 50 | 2.3% |
Other values (8) | 107 | 4.9% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 22843 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 3424 | |
i | 3074 | |
a | 2707 | |
d | 2304 | |
r | 1757 | |
o | 1616 | |
t | 1287 | 5.6% |
c | 1169 | 5.1% |
m | 751 | 3.3% |
u | 751 | 3.3% |
Other values (28) | 4003 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 22843 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 3424 | |
i | 3074 | |
a | 2707 | |
d | 2304 | |
r | 1757 | |
o | 1616 | |
t | 1287 | 5.6% |
c | 1169 | 5.1% |
m | 751 | 3.3% |
u | 751 | 3.3% |
Other values (28) | 4003 |
속
Text
MISSING
 
Distinct | 538 |
---|---|
Distinct (%) | 24.4% |
Missing | 1698 |
Missing (%) | 43.5% |
Memory size | 30.6 KiB |
Length
Max length | 16 |
---|---|
Median length | 14 |
Mean length | 8.7955475 |
Min length | 3 |
Characters and Unicode
Total characters | 19359 |
---|---|
Distinct characters | 49 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 239 ? |
---|---|
Unique (%) | 10.9% |
Sample
1st row | Microvelia |
---|---|
2nd row | Dimorphopterus |
3rd row | Dimorphopterus |
4th row | Dimorphopterus |
5th row | Paradieuches |
Value | Count | Frequency (%) |
platypus | 167 | 7.6% |
monochamus | 86 | 3.9% |
athetis | 58 | 2.6% |
meteima | 48 | 2.2% |
papilio | 41 | 1.9% |
cydia | 34 | 1.5% |
zanclognatha | 30 | 1.4% |
chiasmia | 26 | 1.2% |
scopula | 26 | 1.2% |
tetramorium | 24 | 1.1% |
Other values (527) | 1661 |
Most occurring characters
Value | Count | Frequency (%) |
a | 2577 | |
o | 1597 | 8.2% |
i | 1569 | 8.1% |
t | 1333 | 6.9% |
s | 1198 | 6.2% |
r | 1166 | 6.0% |
e | 1121 | 5.8% |
p | 886 | 4.6% |
l | 835 | 4.3% |
h | 831 | 4.3% |
Other values (39) | 6246 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 17159 | |
Uppercase Letter | 2200 | 11.4% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 2577 | |
o | 1597 | 9.3% |
i | 1569 | 9.1% |
t | 1333 | 7.8% |
s | 1198 | 7.0% |
r | 1166 | 6.8% |
e | 1121 | 6.5% |
p | 886 | 5.2% |
l | 835 | 4.9% |
h | 831 | 4.8% |
Other values (15) | 4046 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 405 | |
A | 295 | |
M | 230 | |
C | 227 | |
S | 208 | |
E | 114 | 5.2% |
T | 99 | 4.5% |
H | 97 | 4.4% |
O | 68 | 3.1% |
D | 61 | 2.8% |
Other values (14) | 396 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 19359 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 2577 | |
o | 1597 | 8.2% |
i | 1569 | 8.1% |
t | 1333 | 6.9% |
s | 1198 | 6.2% |
r | 1166 | 6.0% |
e | 1121 | 5.8% |
p | 886 | 4.6% |
l | 835 | 4.3% |
h | 831 | 4.3% |
Other values (39) | 6246 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 19359 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 2577 | |
o | 1597 | 8.2% |
i | 1569 | 8.1% |
t | 1333 | 6.9% |
s | 1198 | 6.2% |
r | 1166 | 6.0% |
e | 1121 | 5.8% |
p | 886 | 4.6% |
l | 835 | 4.3% |
h | 831 | 4.3% |
Other values (39) | 6246 |
종
Text
MISSING
 
Distinct | 719 |
---|---|
Distinct (%) | 32.7% |
Missing | 1700 |
Missing (%) | 43.6% |
Memory size | 30.6 KiB |
Length
Max length | 21 |
---|---|
Median length | 18 |
Mean length | 8.990905 |
Min length | 2 |
Characters and Unicode
Total characters | 19771 |
---|---|
Distinct characters | 35 |
Distinct categories | 6 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 399 ? |
---|---|
Unique (%) | 18.1% |
Sample
1st row | horvathi |
---|---|
2nd row | pallipes |
3rd row | pallipes |
4th row | pallipes |
5th row | dissimils |
Value | Count | Frequency (%) |
koryoensis | 160 | 7.2% |
saltuarius | 56 | 2.5% |
mediorufa | 48 | 2.2% |
japonica | 34 | 1.5% |
alternatus | 29 | 1.3% |
japonicus | 29 | 1.3% |
tsushimae | 24 | 1.1% |
debilitata | 24 | 1.1% |
hebesata | 23 | 1.0% |
flava | 22 | 1.0% |
Other values (708) | 1766 |
Most occurring characters
Value | Count | Frequency (%) |
a | 2957 | |
i | 2275 | |
s | 1849 | |
e | 1481 | 7.5% |
r | 1357 | 6.9% |
n | 1272 | 6.4% |
t | 1263 | 6.4% |
o | 1137 | 5.8% |
l | 1083 | 5.5% |
u | 992 | 5.0% |
Other values (25) | 4105 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 19684 | |
Space Separator | 67 | 0.3% |
Uppercase Letter | 10 | 0.1% |
Open Punctuation | 4 | < 0.1% |
Close Punctuation | 4 | < 0.1% |
Other Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 2957 | |
i | 2275 | |
s | 1849 | |
e | 1481 | 7.5% |
r | 1357 | 6.9% |
n | 1272 | 6.5% |
t | 1263 | 6.4% |
o | 1137 | 5.8% |
l | 1083 | 5.5% |
u | 992 | 5.0% |
Other values (16) | 4018 |
Uppercase Letter
Value | Count | Frequency (%) |
F | 4 | |
E | 2 | |
D | 2 | |
X | 1 | 10.0% |
R | 1 | 10.0% |
Space Separator
Value | Count | Frequency (%) |
67 |
Open Punctuation
Value | Count | Frequency (%) |
( | 4 |
Close Punctuation
Value | Count | Frequency (%) |
) | 4 |
Other Punctuation
Value | Count | Frequency (%) |
. | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 19694 | |
Common | 77 | 0.4% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 2957 | |
i | 2275 | |
s | 1849 | |
e | 1481 | 7.5% |
r | 1357 | 6.9% |
n | 1272 | 6.5% |
t | 1263 | 6.4% |
o | 1137 | 5.8% |
l | 1083 | 5.5% |
u | 992 | 5.0% |
Other values (21) | 4028 |
Common
Value | Count | Frequency (%) |
67 | ||
( | 4 | 5.2% |
) | 4 | 5.2% |
. | 2 | 2.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 19771 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 2957 | |
i | 2275 | |
s | 1849 | |
e | 1481 | 7.5% |
r | 1357 | 6.9% |
n | 1272 | 6.4% |
t | 1263 | 6.4% |
o | 1137 | 5.8% |
l | 1083 | 5.5% |
u | 992 | 5.0% |
Other values (25) | 4105 |
채집일
Text
MISSING
 
Distinct | 464 |
---|---|
Distinct (%) | 16.7% |
Missing | 1121 |
Missing (%) | 28.8% |
Memory size | 30.6 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 27780 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 230 ? |
---|---|
Unique (%) | 8.3% |
Sample
1st row | 2018-04-15 |
---|---|
2nd row | 2018-09-29 |
3rd row | 2018-09-29 |
4th row | 2018-09-29 |
5th row | 2018-08-25 |
Value | Count | Frequency (%) |
2010-08-19 | 109 | 3.9% |
2016-07-09 | 103 | 3.7% |
2011-01-01 | 97 | 3.5% |
2016-05-28 | 85 | 3.1% |
2010-08-20 | 84 | 3.0% |
2016-04-23 | 84 | 3.0% |
2016-04-10 | 80 | 2.9% |
2010-08-22 | 74 | 2.7% |
2010-09-07 | 67 | 2.4% |
2016-09-17 | 57 | 2.1% |
Other values (454) | 1938 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 8046 | |
- | 5556 | |
2 | 4422 | |
1 | 4106 | |
9 | 1144 | 4.1% |
8 | 1078 | 3.9% |
6 | 1023 | 3.7% |
7 | 976 | 3.5% |
5 | 533 | 1.9% |
4 | 493 | 1.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 22224 | |
Dash Punctuation | 5556 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 8046 | |
2 | 4422 | |
1 | 4106 | |
9 | 1144 | 5.1% |
8 | 1078 | 4.9% |
6 | 1023 | 4.6% |
7 | 976 | 4.4% |
5 | 533 | 2.4% |
4 | 493 | 2.2% |
3 | 403 | 1.8% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 5556 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 27780 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 8046 | |
- | 5556 | |
2 | 4422 | |
1 | 4106 | |
9 | 1144 | 4.1% |
8 | 1078 | 3.9% |
6 | 1023 | 3.7% |
7 | 976 | 3.5% |
5 | 533 | 1.9% |
4 | 493 | 1.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 27780 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 8046 | |
- | 5556 | |
2 | 4422 | |
1 | 4106 | |
9 | 1144 | 4.1% |
8 | 1078 | 3.9% |
6 | 1023 | 3.7% |
7 | 976 | 3.5% |
5 | 533 | 1.9% |
4 | 493 | 1.8% |
번호 | 목 | 과 | |
---|---|---|---|
번호 | 1.000 | 0.777 | 0.934 |
목 | 0.777 | 1.000 | 1.000 |
과 | 0.934 | 1.000 | 1.000 |
번호 | 목 | |
---|---|---|
번호 | 1.000 | 0.353 |
목 | 0.353 | 1.000 |
번호 | 목 | 과 | 속 | 종 | 채집일 | |
---|---|---|---|---|---|---|
0 | 1 | Hemiptera | Velliidae | Microvelia | horvathi | 2018-04-15 |
1 | 2 | Hemiptera | Lygaeidae | Dimorphopterus | pallipes | 2018-09-29 |
2 | 3 | Hemiptera | Lygaeidae | Dimorphopterus | pallipes | 2018-09-29 |
3 | 4 | Hemiptera | Lygaeidae | Dimorphopterus | pallipes | 2018-09-29 |
4 | 5 | Hemiptera | Lygaeidae | Paradieuches | dissimils | 2018-08-25 |
5 | 6 | Hemiptera | Lygaeidae | Paradieuches | dissimils | 2018-08-25 |
6 | 7 | Hemiptera | Tingidae | Stephanitis | pyrioides | 2018-08-04 |
7 | 8 | Hemiptera | Tingidae | Stephanitis | pyrioides | 2018-08-04 |
8 | 9 | Hemiptera | Beytidae | Yemma | exilis | 2018-08-19 |
9 | 10 | Hemiptera | Tingidae | Corythucha | marmorata | 2018-08-10 |
번호 | 목 | 과 | 속 | 종 | 채집일 | |
---|---|---|---|---|---|---|
3889 | 3942 | Lepidoptera | Noctuidae | Corgatha | dictaria | <NA> |
3890 | 3943 | Lepidoptera | Noctuidae | Mythimna | bani | <NA> |
3891 | 3944 | Lepidoptera | Noctuidae | Bryophilina | mollicula | <NA> |
3892 | 3945 | Lepidoptera | Noctuidae | Blasticorhinus | ussuriensis | <NA> |
3893 | 3946 | Coleoptera | Cerambycidae | Aegosoma | sinicum | 2018-07-14 |
3894 | 3947 | Lepidoptera | Sphingidae | Sphinx | morio | <NA> |
3895 | 3948 | Lepidoptera | Noctuidae | Mythimna | loreyi | <NA> |
3896 | 3949 | Lepidoptera | Noctuidae | Mythimna | loreyi | <NA> |
3897 | 3950 | Lepidoptera | Noctuidae | Mythimna | loreyi | <NA> |
3898 | 3951 | Isoptera | Rhinotermitidae | Coptotermes | gestroi | <NA> |