Overview

Dataset statistics

Number of variables3
Number of observations34
Missing cells4
Missing cells (%)3.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory982.0 B
Average record size in memory28.9 B

Variable types

Numeric1
Text2

Dataset

Description병해충위험분석 결과에 따라 기존 병해충의 검역적 지위 변경을 반영한 관리병해충 목록으로서 수입식물에서 검출된 경우 소독,폐기,반송 등의 검역 처분을 받게 되는 병해충임
Author농림축산식품부 농림축산검역본부
URLhttps://www.data.go.kr/data/15091842/fileData.do

Alerts

Unnamed: 2 has 4 (11.8%) missing valuesMissing
잡초 has unique valuesUnique
잡초명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:26:50.853895
Analysis finished2023-12-12 15:26:51.262723
Duration0.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

잡초
Real number (ℝ)

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.5
Minimum1
Maximum34
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2023-12-13T00:26:51.332606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.65
Q19.25
median17.5
Q325.75
95-th percentile32.35
Maximum34
Range33
Interquartile range (IQR)16.5

Descriptive statistics

Standard deviation9.9582462
Coefficient of variation (CV)0.56904264
Kurtosis-1.2
Mean17.5
Median Absolute Deviation (MAD)8.5
Skewness0
Sum595
Variance99.166667
MonotonicityStrictly increasing
2023-12-13T00:26:51.508455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
1 1
 
2.9%
27 1
 
2.9%
21 1
 
2.9%
22 1
 
2.9%
23 1
 
2.9%
24 1
 
2.9%
25 1
 
2.9%
26 1
 
2.9%
28 1
 
2.9%
19 1
 
2.9%
Other values (24) 24
70.6%
ValueCountFrequency (%)
1 1
2.9%
2 1
2.9%
3 1
2.9%
4 1
2.9%
5 1
2.9%
6 1
2.9%
7 1
2.9%
8 1
2.9%
9 1
2.9%
10 1
2.9%
ValueCountFrequency (%)
34 1
2.9%
33 1
2.9%
32 1
2.9%
31 1
2.9%
30 1
2.9%
29 1
2.9%
28 1
2.9%
27 1
2.9%
26 1
2.9%
25 1
2.9%

잡초명
Text

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-12-13T00:26:51.780246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length25
Mean length21.382353
Min length11

Characters and Unicode

Total characters727
Distinct characters48
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st rowAchyranthes aspera
2nd rowAlternanthera philoxeroides
3rd rowAmsinckia menziesii var. intermedia (= Amsinckia intermedia)
4th rowBrachiaria decumbens
5th rowBrassica tournefortii
ValueCountFrequency (%)
4
 
4.9%
spp 3
 
3.7%
centaurea 2
 
2.4%
echioides 2
 
2.4%
digitaria 2
 
2.4%
salvinia 2
 
2.4%
intermedia 2
 
2.4%
repens 2
 
2.4%
cirsium 2
 
2.4%
amsinckia 2
 
2.4%
Other values (59) 59
72.0%
2023-12-13T00:26:52.206293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 82
 
11.3%
a 78
 
10.7%
e 54
 
7.4%
n 50
 
6.9%
s 46
 
6.3%
44
 
6.1%
r 41
 
5.6%
u 38
 
5.2%
o 33
 
4.5%
t 31
 
4.3%
Other values (38) 230
31.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 625
86.0%
Space Separator 44
 
6.1%
Uppercase Letter 38
 
5.2%
Close Punctuation 4
 
0.6%
Math Symbol 4
 
0.6%
Open Punctuation 4
 
0.6%
Control 4
 
0.6%
Other Punctuation 4
 
0.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 82
13.1%
a 78
12.5%
e 54
 
8.6%
n 50
 
8.0%
s 46
 
7.4%
r 41
 
6.6%
u 38
 
6.1%
o 33
 
5.3%
t 31
 
5.0%
m 28
 
4.5%
Other values (16) 144
23.0%
Uppercase Letter
ValueCountFrequency (%)
C 10
26.3%
S 5
13.2%
A 4
 
10.5%
O 3
 
7.9%
D 2
 
5.3%
B 2
 
5.3%
L 2
 
5.3%
M 2
 
5.3%
I 1
 
2.6%
R 1
 
2.6%
Other values (6) 6
15.8%
Space Separator
ValueCountFrequency (%)
44
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Math Symbol
ValueCountFrequency (%)
= 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Control
ValueCountFrequency (%)
4
100.0%
Other Punctuation
ValueCountFrequency (%)
. 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 663
91.2%
Common 64
 
8.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 82
12.4%
a 78
11.8%
e 54
 
8.1%
n 50
 
7.5%
s 46
 
6.9%
r 41
 
6.2%
u 38
 
5.7%
o 33
 
5.0%
t 31
 
4.7%
m 28
 
4.2%
Other values (32) 182
27.5%
Common
ValueCountFrequency (%)
44
68.8%
) 4
 
6.2%
= 4
 
6.2%
( 4
 
6.2%
4
 
6.2%
. 4
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 727
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 82
 
11.3%
a 78
 
10.7%
e 54
 
7.4%
n 50
 
6.9%
s 46
 
6.3%
44
 
6.1%
r 41
 
5.6%
u 38
 
5.2%
o 33
 
4.5%
t 31
 
4.3%
Other values (38) 230
31.6%

Unnamed: 2
Text

MISSING 

Distinct30
Distinct (%)100.0%
Missing4
Missing (%)11.8%
Memory size404.0 B
2023-12-13T00:26:52.462142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length82
Median length22
Mean length18.533333
Min length6

Characters and Unicode

Total characters556
Distinct characters42
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st rowdevil’s horsewhip, burweed, prickly chaff flower, rough chaff flower, chaff-flower
2nd rowAlligator weed
3rd rowCoast fiddleneck
4th rowsignal grass
5th rowAfrican mustard
ValueCountFrequency (%)
thistle 4
 
5.8%
prickly 2
 
2.9%
chaff 2
 
2.9%
burweed 2
 
2.9%
flower 2
 
2.9%
grass 2
 
2.9%
witchweed 1
 
1.4%
dayflower 1
 
1.4%
spiny 1
 
1.4%
chaff-flower 1
 
1.4%
Other values (51) 51
73.9%
2023-12-13T00:26:52.872978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 61
 
11.0%
r 40
 
7.2%
40
 
7.2%
a 35
 
6.3%
i 33
 
5.9%
s 30
 
5.4%
l 30
 
5.4%
o 29
 
5.2%
d 28
 
5.0%
t 28
 
5.0%
Other values (32) 202
36.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 473
85.1%
Space Separator 40
 
7.2%
Uppercase Letter 28
 
5.0%
Other Punctuation 7
 
1.3%
Dash Punctuation 7
 
1.3%
Final Punctuation 1
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 61
12.9%
r 40
 
8.5%
a 35
 
7.4%
i 33
 
7.0%
s 30
 
6.3%
l 30
 
6.3%
o 29
 
6.1%
d 28
 
5.9%
t 28
 
5.9%
n 22
 
4.7%
Other values (14) 137
29.0%
Uppercase Letter
ValueCountFrequency (%)
S 6
21.4%
C 4
14.3%
K 2
 
7.1%
W 2
 
7.1%
D 2
 
7.1%
R 2
 
7.1%
P 2
 
7.1%
A 2
 
7.1%
F 1
 
3.6%
N 1
 
3.6%
Other values (4) 4
14.3%
Space Separator
ValueCountFrequency (%)
40
100.0%
Other Punctuation
ValueCountFrequency (%)
, 7
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 501
90.1%
Common 55
 
9.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 61
 
12.2%
r 40
 
8.0%
a 35
 
7.0%
i 33
 
6.6%
s 30
 
6.0%
l 30
 
6.0%
o 29
 
5.8%
d 28
 
5.6%
t 28
 
5.6%
n 22
 
4.4%
Other values (28) 165
32.9%
Common
ValueCountFrequency (%)
40
72.7%
, 7
 
12.7%
- 7
 
12.7%
1
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 555
99.8%
Punctuation 1
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 61
 
11.0%
r 40
 
7.2%
40
 
7.2%
a 35
 
6.3%
i 33
 
5.9%
s 30
 
5.4%
l 30
 
5.4%
o 29
 
5.2%
d 28
 
5.0%
t 28
 
5.0%
Other values (31) 201
36.2%
Punctuation
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-13T00:26:50.983413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:26:53.240560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
잡초잡초명Unnamed: 2
잡초1.0001.0001.000
잡초명1.0001.0001.000
Unnamed: 21.0001.0001.000

Missing values

2023-12-13T00:26:51.125116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:26:51.226794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

잡초잡초명Unnamed: 2
01Achyranthes asperadevil’s horsewhip, burweed, prickly chaff flower, rough chaff flower, chaff-flower
12Alternanthera philoxeroidesAlligator weed
23Amsinckia menziesii var. intermedia (= Amsinckia intermedia)Coast fiddleneck
34Brachiaria decumbenssignal grass
45Brassica tournefortiiAfrican mustard
56Calopogonium mucunoidescalopo, wild ground nut
67Carduus tenuiflorusSeaside thistle
78Cenchrus longispinusLongspine sandbur
89Centaurea solstitialis<NA>
910Chondrilla junceaRush skeletonweed
잡초잡초명Unnamed: 2
2425Oenanthe pimpinelloidesCorky-fruited water-dropwort
2526Onopordum acanthiumScotch thistle
2627Orobanche spp.<NA>
2728Rhaponticum repens (= Centaurea repens)Russian Knapweed
2829Salvinia adnata (= Salvinia molesta)Karibaweed
2930Senecio jacobaeaStinking willie
3031Solanum elaeagnifoliumWhite horsenettle
3132Striga spp.Witchweed
3233Themeda quadrivalvisGrader grass
3334Xanthium spinosumPrickly burweed