Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 2005 |
Missing cells | 3 |
Missing cells (%) | < 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 78.4 KiB |
Average record size in memory | 40.1 B |
Variable types
Text | 4 |
---|---|
Categorical | 1 |
Dataset
Description | 국내 외 최초로 검역관련 한국의 식물병해충 소장 표본 목록을 제공함으로써 국민의 알권리와 교육 자료로 활용하게 함으로써 해외에서 유입되는 외래 병해충의 국내유입을 미리 예방하여 국내 자연과 환경을 보호하고자 함 |
---|---|
Author | 공공데이터포털 |
URL | https://www.data.go.kr/data/15117735/fileData.do |
Reproduction
Analysis started | 2024-04-21 09:03:07.277443 |
---|---|
Analysis finished | 2024-04-21 09:03:08.179146 |
Duration | 0.9 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
표본번호
Text
UNIQUE
 
Distinct | 2005 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.8 KiB |
Length
Max length | 12 |
---|---|
Median length | 12 |
Mean length | 12 |
Min length | 12 |
Characters and Unicode
Total characters | 24060 |
---|---|
Distinct characters | 14 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 2005 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | PQG 0000001 |
---|---|
2nd row | PQG 0000002 |
3rd row | PQG 0000003 |
4th row | PQG 0000004 |
5th row | PQG 0000005 |
Value | Count | Frequency (%) |
pqg | 2005 | |
0001333 | 1 | < 0.1% |
0001346 | 1 | < 0.1% |
0001345 | 1 | < 0.1% |
0001344 | 1 | < 0.1% |
0001343 | 1 | < 0.1% |
0001342 | 1 | < 0.1% |
0001341 | 1 | < 0.1% |
0001340 | 1 | < 0.1% |
0001339 | 1 | < 0.1% |
Other values (1996) | 1996 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 7624 | |
4010 | ||
P | 2005 | 8.3% |
Q | 2005 | 8.3% |
G | 2005 | 8.3% |
1 | 1601 | 6.7% |
2 | 607 | 2.5% |
3 | 601 | 2.5% |
4 | 601 | 2.5% |
5 | 601 | 2.5% |
Other values (4) | 2400 | 10.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 14035 | |
Uppercase Letter | 6015 | |
Space Separator | 4010 | 16.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 7624 | |
1 | 1601 | 11.4% |
2 | 607 | 4.3% |
3 | 601 | 4.3% |
4 | 601 | 4.3% |
5 | 601 | 4.3% |
6 | 600 | 4.3% |
7 | 600 | 4.3% |
8 | 600 | 4.3% |
9 | 600 | 4.3% |
Uppercase Letter
Value | Count | Frequency (%) |
P | 2005 | |
Q | 2005 | |
G | 2005 |
Space Separator
Value | Count | Frequency (%) |
4010 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 18045 | |
Latin | 6015 | 25.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 7624 | |
4010 | ||
1 | 1601 | 8.9% |
2 | 607 | 3.4% |
3 | 601 | 3.3% |
4 | 601 | 3.3% |
5 | 601 | 3.3% |
6 | 600 | 3.3% |
7 | 600 | 3.3% |
8 | 600 | 3.3% |
Latin
Value | Count | Frequency (%) |
P | 2005 | |
Q | 2005 | |
G | 2005 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 24060 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 7624 | |
4010 | ||
P | 2005 | 8.3% |
Q | 2005 | 8.3% |
G | 2005 | 8.3% |
1 | 1601 | 6.7% |
2 | 607 | 2.5% |
3 | 601 | 2.5% |
4 | 601 | 2.5% |
5 | 601 | 2.5% |
Other values (4) | 2400 | 10.0% |
학명
Text
Distinct | 367 |
---|---|
Distinct (%) | 18.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.8 KiB |
Length
Max length | 52 |
---|---|
Median length | 47 |
Mean length | 32.715711 |
Min length | 11 |
Characters and Unicode
Total characters | 65595 |
---|---|
Distinct characters | 58 |
Distinct categories | 8 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 153 ? |
---|---|
Unique (%) | 7.6% |
Sample
1st row | Aleurocanthus spiniferus (Quaintance) |
---|---|
2nd row | Aleurocanthus spiniferus (Quaintance) |
3rd row | Aleurocanthus spiniferus (Quaintance) |
4th row | Aleurocanthus woglumi Ashby |
5th row | Aleurocanthus woglumi Ashby |
Value | Count | Frequency (%) |
takahashi | 222 | 3.5% |
lepidosaphes | 194 | 3.0% |
maskell | 190 | 3.0% |
trialeurodes | 156 | 2.4% |
westwood | 147 | 2.3% |
vaporariorum | 141 | 2.2% |
kuwana | 138 | 2.2% |
and | 128 | 2.0% |
green | 127 | 2.0% |
ceroplastes | 120 | 1.9% |
Other values (585) | 4818 |
Most occurring characters
Value | Count | Frequency (%) |
8549 | ||
a | 6349 | 9.7% |
e | 5138 | 7.8% |
i | 5081 | 7.7% |
s | 4755 | 7.2% |
r | 3701 | 5.6% |
o | 3592 | 5.5% |
l | 3076 | 4.7% |
u | 2928 | 4.5% |
c | 2355 | 3.6% |
Other values (48) | 20071 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 49793 | |
Space Separator | 8549 | 13.0% |
Uppercase Letter | 4154 | 6.3% |
Open Punctuation | 1475 | 2.2% |
Close Punctuation | 1475 | 2.2% |
Other Punctuation | 118 | 0.2% |
Connector Punctuation | 24 | < 0.1% |
Decimal Number | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 6349 | |
e | 5138 | |
i | 5081 | |
s | 4755 | |
r | 3701 | 7.4% |
o | 3592 | 7.2% |
l | 3076 | 6.2% |
u | 2928 | 5.9% |
c | 2355 | 4.7% |
n | 2341 | 4.7% |
Other values (16) | 10477 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 537 | |
T | 472 | |
C | 455 | |
A | 396 | |
M | 342 | |
L | 285 | 6.9% |
B | 261 | 6.3% |
G | 220 | 5.3% |
W | 216 | 5.2% |
D | 206 | 5.0% |
Other values (15) | 764 |
Other Punctuation
Value | Count | Frequency (%) |
. | 117 | |
, | 1 | 0.8% |
Space Separator
Value | Count | Frequency (%) |
8549 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1475 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1475 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 24 |
Decimal Number
Value | Count | Frequency (%) |
1 | 7 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 53947 | |
Common | 11648 | 17.8% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 6349 | |
e | 5138 | 9.5% |
i | 5081 | 9.4% |
s | 4755 | 8.8% |
r | 3701 | 6.9% |
o | 3592 | 6.7% |
l | 3076 | 5.7% |
u | 2928 | 5.4% |
c | 2355 | 4.4% |
n | 2341 | 4.3% |
Other values (41) | 14631 |
Common
Value | Count | Frequency (%) |
8549 | ||
( | 1475 | 12.7% |
) | 1475 | 12.7% |
. | 117 | 1.0% |
_ | 24 | 0.2% |
1 | 7 | 0.1% |
, | 1 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 65595 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
8549 | ||
a | 6349 | 9.7% |
e | 5138 | 7.8% |
i | 5081 | 7.7% |
s | 4755 | 7.2% |
r | 3701 | 5.6% |
o | 3592 | 5.5% |
l | 3076 | 4.7% |
u | 2928 | 4.5% |
c | 2355 | 3.6% |
Other values (48) | 20071 |
국가
Text
Distinct | 51 |
---|---|
Distinct (%) | 2.5% |
Missing | 3 |
Missing (%) | 0.1% |
Memory size | 15.8 KiB |
Value | Count | Frequency (%) |
대한민국 | 1001 | |
라오스 | 269 | 13.3% |
베트남 | 122 | 6.1% |
미국 | 119 | 5.9% |
대만 | 77 | 3.8% |
일본 | 53 | 2.6% |
태국 | 46 | 2.3% |
중국 | 44 | 2.2% |
호주 | 40 | 2.0% |
필리핀 | 31 | 1.5% |
Other values (47) | 214 | 10.6% |
Most occurring characters
Value | Count | Frequency (%) |
국 | 1234 | |
대 | 1078 | |
한 | 1001 | |
민 | 1001 | |
스 | 302 | 4.4% |
라 | 280 | 4.0% |
오 | 269 | 3.9% |
남 | 140 | 2.0% |
트 | 128 | 1.9% |
베 | 126 | 1.8% |
Other values (86) | 1355 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 6900 | |
Space Separator | 14 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
국 | 1234 | |
대 | 1078 | |
한 | 1001 | |
민 | 1001 | |
스 | 302 | 4.4% |
라 | 280 | 4.1% |
오 | 269 | 3.9% |
남 | 140 | 2.0% |
트 | 128 | 1.9% |
베 | 126 | 1.8% |
Other values (85) | 1341 |
Space Separator
Value | Count | Frequency (%) |
14 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 6900 | |
Common | 14 | 0.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
국 | 1234 | |
대 | 1078 | |
한 | 1001 | |
민 | 1001 | |
스 | 302 | 4.4% |
라 | 280 | 4.1% |
오 | 269 | 3.9% |
남 | 140 | 2.0% |
트 | 128 | 1.9% |
베 | 126 | 1.8% |
Other values (85) | 1341 |
Common
Value | Count | Frequency (%) |
14 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 6900 | |
ASCII | 14 | 0.2% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
국 | 1234 | |
대 | 1078 | |
한 | 1001 | |
민 | 1001 | |
스 | 302 | 4.4% |
라 | 280 | 4.1% |
오 | 269 | 3.9% |
남 | 140 | 2.0% |
트 | 128 | 1.9% |
베 | 126 | 1.8% |
Other values (85) | 1341 |
ASCII
Value | Count | Frequency (%) |
14 |
검사_채집일
Text
Distinct | 610 |
---|---|
Distinct (%) | 30.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.8 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 9.9850374 |
Min length | 4 |
Characters and Unicode
Total characters | 20020 |
---|---|
Distinct characters | 12 |
Distinct categories | 3 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 304 ? |
---|---|
Unique (%) | 15.2% |
Sample
1st row | 1990-06-18 |
---|---|
2nd row | 2011-01-14 |
3rd row | 2011-01-14 |
4th row | 1965-08-06 |
5th row | 2005-05-10 |
Value | Count | Frequency (%) |
2015-04-27 | 99 | 4.9% |
2015-04-28 | 47 | 2.3% |
2003-08-30 | 35 | 1.7% |
2003-10-03 | 31 | 1.5% |
1998-04-15 | 26 | 1.3% |
2015-04-29 | 24 | 1.2% |
2015-04-30 | 24 | 1.2% |
2008-03-18 | 21 | 1.0% |
2007-05-10 | 19 | 0.9% |
2003-10-04 | 18 | 0.9% |
Other values (600) | 1661 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 5208 | |
- | 4010 | |
2 | 2770 | |
1 | 2548 | |
9 | 1179 | 5.9% |
5 | 983 | 4.9% |
8 | 726 | 3.6% |
3 | 717 | 3.6% |
7 | 648 | 3.2% |
4 | 628 | 3.1% |
Other values (2) | 603 | 3.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 16000 | |
Dash Punctuation | 4010 | 20.0% |
Other Punctuation | 10 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 5208 | |
2 | 2770 | |
1 | 2548 | |
9 | 1179 | 7.4% |
5 | 983 | 6.1% |
8 | 726 | 4.5% |
3 | 717 | 4.5% |
7 | 648 | 4.0% |
4 | 628 | 3.9% |
6 | 593 | 3.7% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 4010 |
Other Punctuation
Value | Count | Frequency (%) |
. | 10 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 20020 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 5208 | |
- | 4010 | |
2 | 2770 | |
1 | 2548 | |
9 | 1179 | 5.9% |
5 | 983 | 4.9% |
8 | 726 | 3.6% |
3 | 717 | 3.6% |
7 | 648 | 3.2% |
4 | 628 | 3.1% |
Other values (2) | 603 | 3.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 20020 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 5208 | |
- | 4010 | |
2 | 2770 | |
1 | 2548 | |
9 | 1179 | 5.9% |
5 | 983 | 4.9% |
8 | 726 | 3.6% |
3 | 717 | 3.6% |
7 | 648 | 3.2% |
4 | 628 | 3.1% |
Other values (2) | 603 | 3.0% |
라벨 종류
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.8 KiB |
슬라이드 | |
---|---|
해충(건조) | 2 |
Length
Max length | 6 |
---|---|
Median length | 4 |
Mean length | 4.001995 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 슬라이드 |
---|---|
2nd row | 슬라이드 |
3rd row | 슬라이드 |
4th row | 슬라이드 |
5th row | 슬라이드 |
Common Values
Value | Count | Frequency (%) |
슬라이드 | 2003 | |
해충(건조) | 2 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
슬라이드 | 2003 | |
해충(건조 | 2 | 0.1% |
국가 | 라벨 종류 | |
---|---|---|
국가 | 1.000 | 0.133 |
라벨 종류 | 0.133 | 1.000 |
표본번호 | 학명 | 국가 | 검사_채집일 | 라벨 종류 | |
---|---|---|---|---|---|
0 | PQG 0000001 | Aleurocanthus spiniferus (Quaintance) | 대만 | 1990-06-18 | 슬라이드 |
1 | PQG 0000002 | Aleurocanthus spiniferus (Quaintance) | 대만 | 2011-01-14 | 슬라이드 |
2 | PQG 0000003 | Aleurocanthus spiniferus (Quaintance) | 대만 | 2011-01-14 | 슬라이드 |
3 | PQG 0000004 | Aleurocanthus woglumi Ashby | 케냐 | 1965-08-06 | 슬라이드 |
4 | PQG 0000005 | Aleurocanthus woglumi Ashby | 미국 | 2005-05-10 | 슬라이드 |
5 | PQG 0000006 | Aleuroduplidens eucalyptifolia Martin | 호주 | 2006-04-26 | 슬라이드 |
6 | PQG 0000007 | Aleurolobus marlatti (Quaintance) | 대만 | 2003-08-30 | 슬라이드 |
7 | PQG 0000008 | Aleurolobus marlatti (Quaintance) | 베트남 | 2013-02-06 | 슬라이드 |
8 | PQG 0000009 | Aleurolobus marlatti (Quaintance) | 베트남 | 2013-02-06 | 슬라이드 |
9 | PQG 0000010 | Aleurotrachelus dryandrae Solomon | 호주 | 2006-05-22 | 슬라이드 |
표본번호 | 학명 | 국가 | 검사_채집일 | 라벨 종류 | |
---|---|---|---|---|---|
1995 | PQG 0001996 | Pseudaulacaspis cockerelli (Cooley) | 대한민국 | 2014-03-13 | 슬라이드 |
1996 | PQG 0001997 | Pseudaulacaspis cockerelli (Cooley) | 대한민국 | 2014-04-02 | 슬라이드 |
1997 | PQG 0001998 | Pseudaulacaspis cockerelli (Cooley) | 대한민국 | 2014-04-01 | 슬라이드 |
1998 | PQG 0001999 | Pseudaulacaspis cockerelli (Cooley) | 대한민국 | 2014-04-01 | 슬라이드 |
1999 | PQG 0002000 | Pseudaulacaspis cockerelli (Cooley) | 대한민국 | 2014-07-27 | 슬라이드 |
2000 | PQG 0002001 | Tinocallis kahawaluokalani (Kirkaldy) | 대한민국 | 2014-05-20 | 슬라이드 |
2001 | PQG 0002002 | Tinocallis kahawaluokalani (Kirkaldy) | 대한민국 | 2014-05-20 | 슬라이드 |
2002 | PQG 0002003 | Tinocallis kahawaluokalani (Kirkaldy) | 대한민국 | 2014-05-20 | 슬라이드 |
2003 | PQG 0002004 | Aspidiotus chinensis Kuwana | 일본 | 2015-02-10 | 슬라이드 |
2004 | PQG 0002005 | Octaspidiotus stauntoniae (Takahashi) | 대한민국 | 2014-12-19 | 슬라이드 |