Overview

Dataset statistics

Number of variables31
Number of observations5083
Missing cells17651
Missing cells (%)11.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 MiB
Average record size in memory249.0 B

Variable types

Text11
Categorical19
Unsupported1

Dataset

Description종코드,국명,학명,계명(영문),계명(국문),문명(영문),문명(국문),강명(영문),강명(국문),목명(영문),목명(국문),과명(영문),과명(국문),국명_이명,형태특성,생태특성,서울시보호종여부,고유종 여부,멸종위기야생동식물 여부,먹는자 처벌대상 야생동식물 여부,포획금지 야생동식물 여부,인공증식을 위한 포획허가대상 야생동물 여부,유해야생동물 여부,수렵동물 여부,생태계교란 야생동식물 여부,야생화된 동물 여부,수출입 허가대상 야생동물 여부,국외반출승인대상생물자원 여부,국제적멸종위기종 여부,외래식물 여부,천연기념물 지정정보
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-2199/S/1/datasetView.do

Alerts

문명(영문) is highly imbalanced (51.9%)Imbalance
문명(국문) is highly imbalanced (51.9%)Imbalance
강명(영문) is highly imbalanced (53.8%)Imbalance
강명(국문) is highly imbalanced (53.4%)Imbalance
서울시보호종여부 is highly imbalanced (92.2%)Imbalance
고유종 여부 is highly imbalanced (74.3%)Imbalance
멸종위기야생동식물 여부 is highly imbalanced (93.0%)Imbalance
먹는자 처벌대상 야생동식물 여부 is highly imbalanced (96.0%)Imbalance
포획금지 야생동식물 여부 is highly imbalanced (73.6%)Imbalance
인공증식을 위한 포획허가대상 야생동물 여부 is highly imbalanced (97.6%)Imbalance
유해야생동물 여부 is highly imbalanced (97.6%)Imbalance
수렵동물 여부 is highly imbalanced (97.1%)Imbalance
생태계교란 야생동식물 여부 is highly imbalanced (98.7%)Imbalance
수출입 허가대상 야생동물 여부 is highly imbalanced (71.9%)Imbalance
국외반출승인대상생물자원 여부 is highly imbalanced (87.1%)Imbalance
국제적멸종위기종 여부 is highly imbalanced (97.4%)Imbalance
외래식물 여부 is highly imbalanced (77.9%)Imbalance
국명_이명 has 4961 (97.6%) missing valuesMissing
형태특성 has 1234 (24.3%) missing valuesMissing
생태특성 has 1286 (25.3%) missing valuesMissing
야생화된 동물 여부 has 5083 (100.0%) missing valuesMissing
천연기념물 지정정보 has 5067 (99.7%) missing valuesMissing
종코드 has unique valuesUnique
야생화된 동물 여부 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-18 07:09:53.726349
Analysis finished2024-05-18 07:09:59.352290
Duration5.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

종코드
Text

UNIQUE 

Distinct5083
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
2024-05-18T16:10:00.168230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters25415
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5083 ?
Unique (%)100.0%

Sample

1st rows0001
2nd rows0002
3rd rows0003
4th rows0004
5th rows0005
ValueCountFrequency (%)
s0001 1
 
< 0.1%
s3573 1
 
< 0.1%
s3582 1
 
< 0.1%
s3581 1
 
< 0.1%
s3580 1
 
< 0.1%
s3579 1
 
< 0.1%
s3577 1
 
< 0.1%
s3576 1
 
< 0.1%
s3586 1
 
< 0.1%
s3572 1
 
< 0.1%
Other values (5073) 5073
99.8%
2024-05-18T16:10:02.096763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
s 5083
20.0%
2 2578
10.1%
1 2539
10.0%
0 2503
9.8%
3 2480
9.8%
4 2455
9.7%
5 1834
 
7.2%
7 1497
 
5.9%
6 1483
 
5.8%
8 1482
 
5.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 20332
80.0%
Lowercase Letter 5083
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 2578
12.7%
1 2539
12.5%
0 2503
12.3%
3 2480
12.2%
4 2455
12.1%
5 1834
9.0%
7 1497
7.4%
6 1483
7.3%
8 1482
7.3%
9 1481
7.3%
Lowercase Letter
ValueCountFrequency (%)
s 5083
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 20332
80.0%
Latin 5083
 
20.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 2578
12.7%
1 2539
12.5%
0 2503
12.3%
3 2480
12.2%
4 2455
12.1%
5 1834
9.0%
7 1497
7.4%
6 1483
7.3%
8 1482
7.3%
9 1481
7.3%
Latin
ValueCountFrequency (%)
s 5083
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 25415
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s 5083
20.0%
2 2578
10.1%
1 2539
10.0%
0 2503
9.8%
3 2480
9.8%
4 2455
9.7%
5 1834
 
7.2%
7 1497
 
5.9%
6 1483
 
5.8%
8 1482
 
5.8%

국명
Text

Distinct5068
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
2024-05-18T16:10:02.914588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length5.3303167
Min length1

Characters and Unicode

Total characters27094
Distinct characters708
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5057 ?
Unique (%)99.5%

Sample

1st row가는가래
2nd row가는갈퀴
3rd row가는개여뀌
4th row가는괴불주머니
5th row가는금강아지
ValueCountFrequency (%)
모기류 4
 
0.1%
소금쟁이류 3
 
0.1%
잎벌레류 3
 
0.1%
구주개밀 2
 
< 0.1%
병아리꽃나무 2
 
< 0.1%
꽃등에과류 2
 
< 0.1%
주엽나무 2
 
< 0.1%
플라나리아류 2
 
< 0.1%
실잠자리류 2
 
< 0.1%
애날도래류 2
 
< 0.1%
Other values (5056) 5060
99.5%
2024-05-18T16:10:04.232967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1399
 
5.2%
871
 
3.2%
817
 
3.0%
670
 
2.5%
652
 
2.4%
484
 
1.8%
400
 
1.5%
384
 
1.4%
363
 
1.3%
362
 
1.3%
Other values (698) 20692
76.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27049
99.8%
Space Separator 17
 
0.1%
Other Punctuation 11
 
< 0.1%
Close Punctuation 7
 
< 0.1%
Open Punctuation 7
 
< 0.1%
Lowercase Letter 1
 
< 0.1%
Decimal Number 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1399
 
5.2%
871
 
3.2%
817
 
3.0%
670
 
2.5%
652
 
2.4%
484
 
1.8%
400
 
1.5%
384
 
1.4%
363
 
1.3%
362
 
1.3%
Other values (691) 20647
76.3%
Space Separator
ValueCountFrequency (%)
17
100.0%
Other Punctuation
ValueCountFrequency (%)
? 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Lowercase Letter
ValueCountFrequency (%)
f 1
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 27049
99.8%
Common 44
 
0.2%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1399
 
5.2%
871
 
3.2%
817
 
3.0%
670
 
2.5%
652
 
2.4%
484
 
1.8%
400
 
1.5%
384
 
1.4%
363
 
1.3%
362
 
1.3%
Other values (691) 20647
76.3%
Common
ValueCountFrequency (%)
17
38.6%
? 11
25.0%
) 7
15.9%
( 7
15.9%
2 1
 
2.3%
- 1
 
2.3%
Latin
ValueCountFrequency (%)
f 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 27049
99.8%
ASCII 45
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1399
 
5.2%
871
 
3.2%
817
 
3.0%
670
 
2.5%
652
 
2.4%
484
 
1.8%
400
 
1.5%
384
 
1.4%
363
 
1.3%
362
 
1.3%
Other values (691) 20647
76.3%
ASCII
ValueCountFrequency (%)
17
37.8%
? 11
24.4%
) 7
15.6%
( 7
15.6%
f 1
 
2.2%
2 1
 
2.2%
- 1
 
2.2%

학명
Text

Distinct5057
Distinct (%)99.8%
Missing14
Missing (%)0.3%
Memory size39.8 KiB
2024-05-18T16:10:05.014257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length77
Median length62
Mean length28.124088
Min length5

Characters and Unicode

Total characters142561
Distinct characters75
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5045 ?
Unique (%)99.5%

Sample

1st rowPotamogeton cristatus Regel 4984
2nd rowVicia angustifolia L. var. minor (Bertol.) Ohwi
3rd rowPersicaria trigonocarpa (Makino) Nakai
4th rowCorydalis ochotensis Turcz. var. raddeana Nakai
5th rowSetaria pallidefusca (Schumach.) Stapfe
ValueCountFrequency (%)
l 408
 
2.4%
var 302
 
1.8%
fr 226
 
1.3%
nakai 187
 
1.1%
1 153
 
0.9%
sp 145
 
0.9%
japonica 138
 
0.8%
butler 131
 
0.8%
maxim 116
 
0.7%
thunb 109
 
0.6%
Other values (7589) 15133
88.8%
2024-05-18T16:10:06.735508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 14372
 
10.1%
12055
 
8.5%
i 11280
 
7.9%
e 9396
 
6.6%
s 8594
 
6.0%
r 8562
 
6.0%
o 7348
 
5.2%
n 6636
 
4.7%
u 6632
 
4.7%
l 6466
 
4.5%
Other values (65) 51220
35.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 112135
78.7%
Space Separator 12055
 
8.5%
Uppercase Letter 10973
 
7.7%
Other Punctuation 2922
 
2.0%
Close Punctuation 2082
 
1.5%
Open Punctuation 2080
 
1.5%
Decimal Number 252
 
0.2%
Dash Punctuation 61
 
< 0.1%
Other Letter 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 14372
12.8%
i 11280
10.1%
e 9396
 
8.4%
s 8594
 
7.7%
r 8562
 
7.6%
o 7348
 
6.6%
n 6636
 
5.9%
u 6632
 
5.9%
l 6466
 
5.8%
t 5756
 
5.1%
Other values (16) 27093
24.2%
Uppercase Letter
ValueCountFrequency (%)
L 1062
 
9.7%
S 978
 
8.9%
M 974
 
8.9%
C 959
 
8.7%
P 858
 
7.8%
B 741
 
6.8%
A 736
 
6.7%
H 503
 
4.6%
F 503
 
4.6%
T 488
 
4.4%
Other values (16) 3171
28.9%
Decimal Number
ValueCountFrequency (%)
1 180
71.4%
8 19
 
7.5%
9 14
 
5.6%
7 11
 
4.4%
4 8
 
3.2%
5 5
 
2.0%
6 5
 
2.0%
2 5
 
2.0%
0 4
 
1.6%
3 1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
. 2802
95.9%
: 92
 
3.1%
, 21
 
0.7%
? 5
 
0.2%
; 1
 
< 0.1%
& 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 2077
99.8%
] 5
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 2075
99.8%
[ 5
 
0.2%
Space Separator
ValueCountFrequency (%)
12055
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 61
100.0%
Other Letter
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 123108
86.4%
Common 19452
 
13.6%
Hangul 1
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 14372
 
11.7%
i 11280
 
9.2%
e 9396
 
7.6%
s 8594
 
7.0%
r 8562
 
7.0%
o 7348
 
6.0%
n 6636
 
5.4%
u 6632
 
5.4%
l 6466
 
5.3%
t 5756
 
4.7%
Other values (42) 38066
30.9%
Common
ValueCountFrequency (%)
12055
62.0%
. 2802
 
14.4%
) 2077
 
10.7%
( 2075
 
10.7%
1 180
 
0.9%
: 92
 
0.5%
- 61
 
0.3%
, 21
 
0.1%
8 19
 
0.1%
9 14
 
0.1%
Other values (12) 56
 
0.3%
Hangul
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 142560
> 99.9%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 14372
 
10.1%
12055
 
8.5%
i 11280
 
7.9%
e 9396
 
6.6%
s 8594
 
6.0%
r 8562
 
6.0%
o 7348
 
5.2%
n 6636
 
4.7%
u 6632
 
4.7%
l 6466
 
4.5%
Other values (64) 51219
35.9%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

계명(영문)
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
Animalia
2723 
Plantae
1996 
Fungi
364 

Length

Max length8
Median length8
Mean length7.3924848
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPlantae
2nd rowPlantae
3rd rowPlantae
4th rowPlantae
5th rowPlantae

Common Values

ValueCountFrequency (%)
Animalia 2723
53.6%
Plantae 1996
39.3%
Fungi 364
 
7.2%

Length

2024-05-18T16:10:07.425078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T16:10:07.786536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
animalia 2723
53.6%
plantae 1996
39.3%
fungi 364
 
7.2%

계명(국문)
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
동물계
2723 
식물계
1996 
균계
364 

Length

Max length3
Median length3
Mean length2.9283887
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row식물계
2nd row식물계
3rd row식물계
4th row식물계
5th row식물계

Common Values

ValueCountFrequency (%)
동물계 2723
53.6%
식물계 1996
39.3%
균계 364
 
7.2%

Length

2024-05-18T16:10:08.150203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T16:10:08.443813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
동물계 2723
53.6%
식물계 1996
39.3%
균계 364
 
7.2%

문명(영문)
Categorical

IMBALANCE 

Distinct14
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
Arthropoda
2324 
Magnoliophyta
1880 
Chordata
361 
Basidiomycota
346 
Filicophyta
 
58
Other values (9)
 
114

Length

Max length15
Median length14
Mean length11.166634
Min length8

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st rowMagnoliophyta
2nd rowMagnoliophyta
3rd rowMagnoliophyta
4th rowMagnoliophyta
5th rowMagnoliophyta

Common Values

ValueCountFrequency (%)
Arthropoda 2324
45.7%
Magnoliophyta 1880
37.0%
Chordata 361
 
7.1%
Basidiomycota 346
 
6.8%
Filicophyta 58
 
1.1%
Pinophyta 52
 
1.0%
Ascomycota 17
 
0.3%
Mollusca 17
 
0.3%
Annelida 16
 
0.3%
Sphenophyta 4
 
0.1%
Other values (4) 8
 
0.2%

Length

2024-05-18T16:10:08.912367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
arthropoda 2324
45.7%
magnoliophyta 1880
37.0%
chordata 361
 
7.1%
basidiomycota 346
 
6.8%
filicophyta 58
 
1.1%
pinophyta 52
 
1.0%
ascomycota 17
 
0.3%
mollusca 17
 
0.3%
annelida 16
 
0.3%
sphenophyta 4
 
0.1%
Other values (4) 8
 
0.2%

문명(국문)
Categorical

IMBALANCE 

Distinct14
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
절지동물문
2324 
목련문
1880 
척색동물문
361 
담자균문
346 
고사리문
 
58
Other values (9)
 
114

Length

Max length6
Median length5
Mean length4.1544364
Min length3

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row목련문
2nd row목련문
3rd row목련문
4th row목련문
5th row목련문

Common Values

ValueCountFrequency (%)
절지동물문 2324
45.7%
목련문 1880
37.0%
척색동물문 361
 
7.1%
담자균문 346
 
6.8%
고사리문 58
 
1.1%
구과문 52
 
1.0%
자낭균문 17
 
0.3%
연체동물문 17
 
0.3%
환형동물문 16
 
0.3%
속새문 4
 
0.1%
Other values (4) 8
 
0.2%

Length

2024-05-18T16:10:09.401458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
절지동물문 2324
45.7%
목련문 1880
37.0%
척색동물문 361
 
7.1%
담자균문 346
 
6.8%
고사리문 58
 
1.1%
구과문 52
 
1.0%
자낭균문 17
 
0.3%
연체동물문 17
 
0.3%
환형동물문 16
 
0.3%
속새문 4
 
0.1%
Other values (4) 8
 
0.2%

강명(영문)
Categorical

IMBALANCE 

Distinct34
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
Insecta
2204 
Magnoliopsida
1604 
Eubasidiomycetes
329 
Liliopsida
276 
Aves
224 
Other values (29)
446 

Length

Max length20
Median length16
Mean length9.8589416
Min length4

Unique

Unique7 ?
Unique (%)0.1%

Sample

1st rowLiliopsida
2nd rowMagnoliopsida
3rd rowMagnoliopsida
4th rowMagnoliopsida
5th rowMagnoliopsida

Common Values

ValueCountFrequency (%)
Insecta 2204
43.4%
Magnoliopsida 1604
31.6%
Eubasidiomycetes 329
 
6.5%
Liliopsida 276
 
5.4%
Aves 224
 
4.4%
Arachnida 110
 
2.2%
Actinopterygii 75
 
1.5%
Filicopsida 56
 
1.1%
Coniferopsida 50
 
1.0%
Mammalia 30
 
0.6%
Other values (24) 125
 
2.5%

Length

2024-05-18T16:10:09.967102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
insecta 2204
43.4%
magnoliopsida 1604
31.6%
eubasidiomycetes 329
 
6.5%
liliopsida 276
 
5.4%
aves 224
 
4.4%
arachnida 110
 
2.2%
actinopterygii 75
 
1.5%
filicopsida 56
 
1.1%
coniferopsida 50
 
1.0%
mammalia 30
 
0.6%
Other values (24) 125
 
2.5%

강명(국문)
Categorical

IMBALANCE 

Distinct33
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
곤충강
2204 
목련강
1604 
진정담자균강
329 
백합강
276 
조류강
224 
Other values (28)
446 

Length

Max length6
Median length3
Mean length3.2130632
Min length3

Unique

Unique6 ?
Unique (%)0.1%

Sample

1st row백합강
2nd row목련강
3rd row목련강
4th row목련강
5th row목련강

Common Values

ValueCountFrequency (%)
곤충강 2204
43.4%
목련강 1604
31.6%
진정담자균강 329
 
6.5%
백합강 276
 
5.4%
조류강 224
 
4.4%
거미강 110
 
2.2%
조기강 75
 
1.5%
고사리강 56
 
1.1%
구과강 50
 
1.0%
포유강 30
 
0.6%
Other values (23) 125
 
2.5%

Length

2024-05-18T16:10:10.514215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
곤충강 2204
43.4%
목련강 1604
31.6%
진정담자균강 329
 
6.5%
백합강 276
 
5.4%
조류강 224
 
4.4%
거미강 110
 
2.2%
조기강 75
 
1.5%
고사리강 56
 
1.1%
구과강 50
 
1.0%
포유강 30
 
0.6%
Other values (23) 125
 
2.5%
Distinct152
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
2024-05-18T16:10:11.249470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length10.036396
Min length5

Characters and Unicode

Total characters51015
Distinct characters47
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)0.4%

Sample

1st rowNajadales
2nd rowFabales
3rd rowPolygonales
4th rowNepenthales
5th rowAsterales
ValueCountFrequency (%)
lepidoptera 857
 
16.9%
coleoptera 537
 
10.6%
asterales 432
 
8.5%
diptera 159
 
3.1%
hemiptera 155
 
3.0%
hymenoptera 152
 
3.0%
agaricales 140
 
2.8%
aphyllophorales 128
 
2.5%
araneae 105
 
2.1%
passeriformes 103
 
2.0%
Other values (142) 2315
45.5%
2024-05-18T16:10:12.489427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 7856
15.4%
a 6186
12.1%
r 4472
8.8%
l 4037
 
7.9%
p 3905
 
7.7%
o 3700
 
7.3%
s 3569
 
7.0%
t 2970
 
5.8%
i 2900
 
5.7%
d 1113
 
2.2%
Other values (37) 10307
20.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 45932
90.0%
Uppercase Letter 5083
 
10.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 7856
17.1%
a 6186
13.5%
r 4472
9.7%
l 4037
8.8%
p 3905
8.5%
o 3700
8.1%
s 3569
7.8%
t 2970
 
6.5%
i 2900
 
6.3%
d 1113
 
2.4%
Other values (16) 5224
11.4%
Uppercase Letter
ValueCountFrequency (%)
L 1045
20.6%
A 965
19.0%
C 911
17.9%
H 401
 
7.9%
P 349
 
6.9%
R 213
 
4.2%
D 210
 
4.1%
F 204
 
4.0%
S 196
 
3.9%
O 149
 
2.9%
Other values (11) 440
8.7%

Most occurring scripts

ValueCountFrequency (%)
Latin 51015
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 7856
15.4%
a 6186
12.1%
r 4472
8.8%
l 4037
 
7.9%
p 3905
 
7.7%
o 3700
 
7.3%
s 3569
 
7.0%
t 2970
 
5.8%
i 2900
 
5.7%
d 1113
 
2.2%
Other values (37) 10307
20.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 51015
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 7856
15.4%
a 6186
12.1%
r 4472
8.8%
l 4037
 
7.9%
p 3905
 
7.7%
o 3700
 
7.3%
s 3569
 
7.0%
t 2970
 
5.8%
i 2900
 
5.7%
d 1113
 
2.2%
Other values (37) 10307
20.2%
Distinct156
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
2024-05-18T16:10:13.136352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length3
Mean length3.6494196
Min length2

Characters and Unicode

Total characters18550
Distinct characters214
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)0.5%

Sample

1st row나자스말목
2nd row콩목
3rd row마디풀목
4th row벌레잡이풀목
5th row국화목
ValueCountFrequency (%)
나비목 857
 
16.9%
딱정벌레목 537
 
10.6%
국화목 432
 
8.5%
파리목 159
 
3.1%
노린재목 155
 
3.0%
벌목 152
 
3.0%
주름버섯목 140
 
2.8%
민주름버섯목 128
 
2.5%
거미목 105
 
2.1%
참새목 103
 
2.0%
Other values (146) 2315
45.5%
2024-05-18T16:10:14.255186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5103
27.5%
1067
 
5.8%
922
 
5.0%
710
 
3.8%
558
 
3.0%
543
 
2.9%
543
 
2.9%
432
 
2.3%
432
 
2.3%
377
 
2.0%
Other values (204) 7863
42.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18540
99.9%
Lowercase Letter 5
 
< 0.1%
Decimal Number 4
 
< 0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5103
27.5%
1067
 
5.8%
922
 
5.0%
710
 
3.8%
558
 
3.0%
543
 
2.9%
543
 
2.9%
432
 
2.3%
432
 
2.3%
377
 
2.0%
Other values (197) 7853
42.4%
Lowercase Letter
ValueCountFrequency (%)
n 2
40.0%
k 1
20.0%
o 1
20.0%
w 1
20.0%
Decimal Number
ValueCountFrequency (%)
0 3
75.0%
1 1
 
25.0%
Uppercase Letter
ValueCountFrequency (%)
U 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18540
99.9%
Latin 6
 
< 0.1%
Common 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5103
27.5%
1067
 
5.8%
922
 
5.0%
710
 
3.8%
558
 
3.0%
543
 
2.9%
543
 
2.9%
432
 
2.3%
432
 
2.3%
377
 
2.0%
Other values (197) 7853
42.4%
Latin
ValueCountFrequency (%)
n 2
33.3%
k 1
16.7%
U 1
16.7%
o 1
16.7%
w 1
16.7%
Common
ValueCountFrequency (%)
0 3
75.0%
1 1
 
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 18540
99.9%
ASCII 10
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5103
27.5%
1067
 
5.8%
922
 
5.0%
710
 
3.8%
558
 
3.0%
543
 
2.9%
543
 
2.9%
432
 
2.3%
432
 
2.3%
377
 
2.0%
Other values (197) 7853
42.4%
ASCII
ValueCountFrequency (%)
0 3
30.0%
n 2
20.0%
k 1
 
10.0%
U 1
 
10.0%
o 1
 
10.0%
w 1
 
10.0%
1 1
 
10.0%
Distinct612
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
2024-05-18T16:10:14.899424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16
Mean length10.674602
Min length6

Characters and Unicode

Total characters54259
Distinct characters50
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique187 ?
Unique (%)3.7%

Sample

1st rowPotamogetonaceae
2nd rowLeguminosae
3rd rowPolygonaceae
4th rowPapaveraceae
5th rowGramineae
ValueCountFrequency (%)
noctuidae 203
 
4.0%
asteraceae 202
 
4.0%
gramineae 184
 
3.6%
pyralidae 145
 
2.9%
geometridae 120
 
2.4%
tortricidae 104
 
2.0%
cyperaceae 97
 
1.9%
rosaceae 95
 
1.9%
chrysomelidae 93
 
1.8%
leguminosae 92
 
1.8%
Other values (598) 3748
73.7%
2024-05-18T16:10:16.171332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 9442
17.4%
a 9175
16.9%
i 5402
10.0%
c 3427
 
6.3%
d 3198
 
5.9%
r 3050
 
5.6%
o 2500
 
4.6%
l 2208
 
4.1%
t 1787
 
3.3%
n 1577
 
2.9%
Other values (40) 12493
23.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 49177
90.6%
Uppercase Letter 5076
 
9.4%
Space Separator 6
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 9442
19.2%
a 9175
18.7%
i 5402
11.0%
c 3427
 
7.0%
d 3198
 
6.5%
r 3050
 
6.2%
o 2500
 
5.1%
l 2208
 
4.5%
t 1787
 
3.6%
n 1577
 
3.2%
Other values (16) 7411
15.1%
Uppercase Letter
ValueCountFrequency (%)
C 881
17.4%
A 574
11.3%
P 505
9.9%
G 418
8.2%
L 408
8.0%
T 354
7.0%
S 344
 
6.8%
N 308
 
6.1%
R 284
 
5.6%
H 137
 
2.7%
Other values (13) 863
17.0%
Space Separator
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 54253
> 99.9%
Common 6
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 9442
17.4%
a 9175
16.9%
i 5402
10.0%
c 3427
 
6.3%
d 3198
 
5.9%
r 3050
 
5.6%
o 2500
 
4.6%
l 2208
 
4.1%
t 1787
 
3.3%
n 1577
 
2.9%
Other values (39) 12487
23.0%
Common
ValueCountFrequency (%)
6
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 54259
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 9442
17.4%
a 9175
16.9%
i 5402
10.0%
c 3427
 
6.3%
d 3198
 
5.9%
r 3050
 
5.6%
o 2500
 
4.6%
l 2208
 
4.1%
t 1787
 
3.3%
n 1577
 
2.9%
Other values (40) 12493
23.0%
Distinct609
Distinct (%)12.0%
Missing6
Missing (%)0.1%
Memory size39.8 KiB
2024-05-18T16:10:16.969251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length4.1366949
Min length2

Characters and Unicode

Total characters21002
Distinct characters417
Distinct categories4 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique190 ?
Unique (%)3.7%

Sample

1st row가래과
2nd row콩과
3rd row마디풀과
4th row양귀비과
5th row벼과
ValueCountFrequency (%)
밤나방과 203
 
4.0%
국화과 203
 
4.0%
벼과 184
 
3.6%
명나방과 145
 
2.9%
자나방과 120
 
2.4%
잎말이나방과 104
 
2.0%
사초과 97
 
1.9%
장미과 95
 
1.9%
잎벌레과 93
 
1.8%
콩과 92
 
1.8%
Other values (594) 3741
73.7%
2024-05-18T16:10:18.287873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5079
24.2%
1204
 
5.7%
798
 
3.8%
464
 
2.2%
433
 
2.1%
423
 
2.0%
414
 
2.0%
375
 
1.8%
350
 
1.7%
349
 
1.7%
Other values (407) 11113
52.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20984
99.9%
Lowercase Letter 10
 
< 0.1%
Space Separator 6
 
< 0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5079
24.2%
1204
 
5.7%
798
 
3.8%
464
 
2.2%
433
 
2.1%
423
 
2.0%
414
 
2.0%
375
 
1.8%
350
 
1.7%
349
 
1.7%
Other values (401) 11095
52.9%
Lowercase Letter
ValueCountFrequency (%)
n 4
40.0%
k 2
20.0%
w 2
20.0%
o 2
20.0%
Space Separator
ValueCountFrequency (%)
6
100.0%
Uppercase Letter
ValueCountFrequency (%)
U 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20983
99.9%
Latin 12
 
0.1%
Common 6
 
< 0.1%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5079
24.2%
1204
 
5.7%
798
 
3.8%
464
 
2.2%
433
 
2.1%
423
 
2.0%
414
 
2.0%
375
 
1.8%
350
 
1.7%
349
 
1.7%
Other values (400) 11094
52.9%
Latin
ValueCountFrequency (%)
n 4
33.3%
U 2
16.7%
k 2
16.7%
w 2
16.7%
o 2
16.7%
Common
ValueCountFrequency (%)
6
100.0%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20983
99.9%
ASCII 18
 
0.1%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5079
24.2%
1204
 
5.7%
798
 
3.8%
464
 
2.2%
433
 
2.1%
423
 
2.0%
414
 
2.0%
375
 
1.8%
350
 
1.7%
349
 
1.7%
Other values (400) 11094
52.9%
ASCII
ValueCountFrequency (%)
6
33.3%
n 4
22.2%
U 2
 
11.1%
k 2
 
11.1%
w 2
 
11.1%
o 2
 
11.1%
CJK
ValueCountFrequency (%)
1
100.0%

국명_이명
Text

MISSING 

Distinct122
Distinct (%)100.0%
Missing4961
Missing (%)97.6%
Memory size39.8 KiB
2024-05-18T16:10:19.119917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length5.6639344
Min length1

Characters and Unicode

Total characters691
Distinct characters231
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique122 ?
Unique (%)100.0%

Sample

1st row가는잎금불초
2nd row호밀풀
3rd row나사백
4th row쉬땅나무
5th row개쉽싸리
ValueCountFrequency (%)
노랑검정바구미 1
 
0.8%
아카시재목버섯 1
 
0.8%
불로초 1
 
0.8%
외날개하루살이 1
 
0.8%
작은고추잠자리 1
 
0.8%
장수측범잠자리 1
 
0.8%
어리나나니벌 1
 
0.8%
종이꽃낙엽버섯 1
 
0.8%
이끼패랭이버섯 1
 
0.8%
애딱부리긴노린재 1
 
0.8%
Other values (123) 123
92.5%
2024-05-18T16:10:20.867124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35
 
5.1%
27
 
3.9%
18
 
2.6%
18
 
2.6%
16
 
2.3%
15
 
2.2%
14
 
2.0%
14
 
2.0%
12
 
1.7%
11
 
1.6%
Other values (221) 511
74.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 668
96.7%
Other Punctuation 11
 
1.6%
Space Separator 11
 
1.6%
Modifier Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
 
5.2%
27
 
4.0%
18
 
2.7%
18
 
2.7%
16
 
2.4%
15
 
2.2%
14
 
2.1%
14
 
2.1%
12
 
1.8%
11
 
1.6%
Other values (218) 488
73.1%
Other Punctuation
ValueCountFrequency (%)
, 11
100.0%
Space Separator
ValueCountFrequency (%)
11
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 668
96.7%
Common 23
 
3.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
35
 
5.2%
27
 
4.0%
18
 
2.7%
18
 
2.7%
16
 
2.4%
15
 
2.2%
14
 
2.1%
14
 
2.1%
12
 
1.8%
11
 
1.6%
Other values (218) 488
73.1%
Common
ValueCountFrequency (%)
, 11
47.8%
11
47.8%
` 1
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 668
96.7%
ASCII 23
 
3.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
35
 
5.2%
27
 
4.0%
18
 
2.7%
18
 
2.7%
16
 
2.4%
15
 
2.2%
14
 
2.1%
14
 
2.1%
12
 
1.8%
11
 
1.6%
Other values (218) 488
73.1%
ASCII
ValueCountFrequency (%)
, 11
47.8%
11
47.8%
` 1
 
4.3%

형태특성
Text

MISSING 

Distinct3849
Distinct (%)100.0%
Missing1234
Missing (%)24.3%
Memory size39.8 KiB
2024-05-18T16:10:21.754327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length599
Median length303
Mean length138.95609
Min length12

Characters and Unicode

Total characters534842
Distinct characters998
Distinct categories14 ?
Distinct scripts5 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3849 ?
Unique (%)100.0%

Sample

1st row뿌리는 지하경이 옆으로 길게 뻗는다. 잎은 길이 4~6cm, 나비 0.7mm 정도이며 물 위에 뜬다. 열매는 둥글고 대가 있으며 단단한 살에 싸여 있고 뒷면에 돌기가 있다. 꽃은 5-9월에 피며 황록색이다.
2nd row높이 90cm 가량으로 잎은 어긋나며 잎자루가 짧으며 끝은 덩굴손이다. 4월에 잎 겨드랑이에 홍자색의 꽃이 핀다.
3rd row잎은 길이 3-7cm, 폭 4-8mm의 선상 피침형으로 수분이 없으면 갈색을 띤다. 어긋나기이고 엽병은 거의 없다. 가장자리 부근의 맥 위에 잔 복모가 난다. 꽃은 길이 2-2.2mm의 홍색꽃으로 가지의 끝에 수상화서로 달리고 길이는 2-3cm 정도 된다. 개화시기는 8-10월에 개화한다. 열매는 세모진 난형의 수과로 적갈색이고 길이는 2mm 정도이다.
4th row높이 1m 이상이다. 잎은 어긋나고 잎자루가 길며 3장의 작은 잎이 나온 모양이다. 꽃은 7~8월에 황색으로 피는데 총상화서를 이룬다. 열매는 삭과이고 검은 씨앗이 2줄로 들어 있다.
5th row줄기는 높이 30-70cm이고 곧게 선다. 뿌리는 뿌리줄기가 옆으로 뻗어 자라 군집을 이룬다. 근생엽과 밑부분의 잎은 꽃이 피면 스러지고 중앙부의 잎은 길이 4-9cm, 폭 6-10mm의 선상 피침형 또는 선형으로 끝이 뾰족하다. 꽃은 지름 18-25mm로 가지와 원줄기 끝에 하나씩 6-8월에 핀다. 설상화는 황색으로 길이는 10mm 정도이다. 열매는 원주형의 길이 1mm 정도의 수과로 8-9월에 익으며 관모는 길이 3mm 정도이다.
ValueCountFrequency (%)
있다 3390
 
2.8%
길이 1639
 
1.4%
잎은 1529
 
1.3%
꽃은 1482
 
1.2%
열매는 1283
 
1.1%
털이 1054
 
0.9%
몸길이는 910
 
0.8%
있으며 823
 
0.7%
띤다 809
 
0.7%
있고 782
 
0.7%
Other values (17715) 106356
88.6%
2024-05-18T16:10:23.241402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
116538
 
21.8%
24912
 
4.7%
. 17285
 
3.2%
17051
 
3.2%
12545
 
2.3%
12424
 
2.3%
12072
 
2.3%
m 9265
 
1.7%
9023
 
1.7%
8017
 
1.5%
Other values (988) 295710
55.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 350820
65.6%
Space Separator 116540
 
21.8%
Decimal Number 25558
 
4.8%
Other Punctuation 22115
 
4.1%
Lowercase Letter 12678
 
2.4%
Dash Punctuation 3800
 
0.7%
Math Symbol 2914
 
0.5%
Other Symbol 230
 
< 0.1%
Open Punctuation 62
 
< 0.1%
Close Punctuation 61
 
< 0.1%
Other values (4) 64
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24912
 
7.1%
17051
 
4.9%
12545
 
3.6%
12424
 
3.5%
12072
 
3.4%
9023
 
2.6%
8017
 
2.3%
7825
 
2.2%
7736
 
2.2%
7421
 
2.1%
Other values (910) 231794
66.1%
Lowercase Letter
ValueCountFrequency (%)
m 9265
73.1%
c 3062
 
24.2%
x 73
 
0.6%
a 35
 
0.3%
l 27
 
0.2%
e 24
 
0.2%
g 21
 
0.2%
r 20
 
0.2%
i 20
 
0.2%
k 17
 
0.1%
Other values (14) 114
 
0.9%
Uppercase Letter
ValueCountFrequency (%)
V 9
16.7%
U 8
14.8%
T 7
13.0%
P 6
11.1%
K 3
 
5.6%
M 3
 
5.6%
X 3
 
5.6%
W 2
 
3.7%
Y 2
 
3.7%
I 2
 
3.7%
Other values (8) 9
16.7%
Decimal Number
ValueCountFrequency (%)
1 4726
18.5%
5 3974
15.5%
0 3398
13.3%
2 3274
12.8%
3 2682
10.5%
4 1787
 
7.0%
8 1648
 
6.4%
6 1631
 
6.4%
7 1472
 
5.8%
9 966
 
3.8%
Other Punctuation
ValueCountFrequency (%)
. 17285
78.2%
, 4472
 
20.2%
? 281
 
1.3%
/ 52
 
0.2%
17
 
0.1%
: 4
 
< 0.1%
! 2
 
< 0.1%
2
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 2891
99.2%
× 19
 
0.7%
2
 
0.1%
< 1
 
< 0.1%
> 1
 
< 0.1%
Other Symbol
ValueCountFrequency (%)
87
37.8%
85
37.0%
58
25.2%
Space Separator
ValueCountFrequency (%)
116538
> 99.9%
  2
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 60
96.8%
[ 2
 
3.2%
Close Punctuation
ValueCountFrequency (%)
) 60
98.4%
] 1
 
1.6%
Dash Punctuation
ValueCountFrequency (%)
- 3800
100.0%
Letter Number
ValueCountFrequency (%)
6
100.0%
Other Number
ValueCountFrequency (%)
2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 350714
65.6%
Common 171284
32.0%
Latin 12731
 
2.4%
Han 106
 
< 0.1%
Greek 7
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24912
 
7.1%
17051
 
4.9%
12545
 
3.6%
12424
 
3.5%
12072
 
3.4%
9023
 
2.6%
8017
 
2.3%
7825
 
2.2%
7736
 
2.2%
7421
 
2.1%
Other values (853) 231688
66.1%
Han
ValueCountFrequency (%)
12
 
11.3%
8
 
7.5%
7
 
6.6%
7
 
6.6%
5
 
4.7%
4
 
3.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
2
 
1.9%
Other values (47) 53
50.0%
Latin
ValueCountFrequency (%)
m 9265
72.8%
c 3062
 
24.1%
x 73
 
0.6%
a 35
 
0.3%
l 27
 
0.2%
e 24
 
0.2%
g 21
 
0.2%
r 20
 
0.2%
i 20
 
0.2%
k 17
 
0.1%
Other values (32) 167
 
1.3%
Common
ValueCountFrequency (%)
116538
68.0%
. 17285
 
10.1%
1 4726
 
2.8%
, 4472
 
2.6%
5 3974
 
2.3%
- 3800
 
2.2%
0 3398
 
2.0%
2 3274
 
1.9%
~ 2891
 
1.7%
3 2682
 
1.6%
Other values (25) 8244
 
4.8%
Greek
ValueCountFrequency (%)
μ 7
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 350710
65.6%
ASCII 183735
34.4%
CJK Compat 230
 
< 0.1%
CJK 104
 
< 0.1%
None 47
 
< 0.1%
Number Forms 8
 
< 0.1%
Compat Jamo 4
 
< 0.1%
Math Operators 2
 
< 0.1%
CJK Compat Ideographs 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
116538
63.4%
. 17285
 
9.4%
m 9265
 
5.0%
1 4726
 
2.6%
, 4472
 
2.4%
5 3974
 
2.2%
- 3800
 
2.1%
0 3398
 
1.8%
2 3274
 
1.8%
c 3062
 
1.7%
Other values (57) 13941
 
7.6%
Hangul
ValueCountFrequency (%)
24912
 
7.1%
17051
 
4.9%
12545
 
3.6%
12424
 
3.5%
12072
 
3.4%
9023
 
2.6%
8017
 
2.3%
7825
 
2.2%
7736
 
2.2%
7421
 
2.1%
Other values (849) 231684
66.1%
CJK Compat
ValueCountFrequency (%)
87
37.8%
85
37.0%
58
25.2%
None
ValueCountFrequency (%)
× 19
40.4%
17
36.2%
μ 7
 
14.9%
  2
 
4.3%
2
 
4.3%
CJK
ValueCountFrequency (%)
12
 
11.5%
8
 
7.7%
7
 
6.7%
7
 
6.7%
5
 
4.8%
4
 
3.8%
3
 
2.9%
3
 
2.9%
2
 
1.9%
2
 
1.9%
Other values (45) 51
49.0%
Number Forms
ValueCountFrequency (%)
6
75.0%
2
 
25.0%
Math Operators
ValueCountFrequency (%)
2
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
50.0%
1
50.0%
Compat Jamo
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

생태특성
Text

MISSING 

Distinct3493
Distinct (%)92.0%
Missing1286
Missing (%)25.3%
Memory size39.8 KiB
2024-05-18T16:10:24.182447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length355
Median length185
Mean length48.75981
Min length6

Characters and Unicode

Total characters185141
Distinct characters820
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3383 ?
Unique (%)89.1%

Sample

1st row우리나라가 원산지이며 다년생 수초이다.
2nd row제주도, 울릉도 등지에 분포하며 어린 순을 나물로 한다.
3rd row일년생 초본식물로 우리나라 곳곳에 분포한다.
4th row우리나라, 일본, 만주 등지에 분포하는 이년생 식물이다.
5th row다년생 초본식물로 우리나라와 일본, 중국, 러시아에 분포한다.
ValueCountFrequency (%)
분포한다 1675
 
4.1%
일본 1422
 
3.5%
중국 1140
 
2.8%
한국 1134
 
2.8%
등지에 1064
 
2.6%
우리나라 727
 
1.8%
성충은 690
 
1.7%
다년생 553
 
1.3%
주로 511
 
1.2%
자란다 474
 
1.2%
Other values (7531) 31787
77.2%
2024-05-18T16:10:25.512275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
37501
 
20.3%
, 7078
 
3.8%
6428
 
3.5%
6155
 
3.3%
. 5490
 
3.0%
4900
 
2.6%
3901
 
2.1%
3469
 
1.9%
3399
 
1.8%
3160
 
1.7%
Other values (810) 103660
56.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 130197
70.3%
Space Separator 37501
 
20.3%
Other Punctuation 12693
 
6.9%
Decimal Number 3610
 
1.9%
Math Symbol 712
 
0.4%
Lowercase Letter 179
 
0.1%
Dash Punctuation 146
 
0.1%
Uppercase Letter 52
 
< 0.1%
Open Punctuation 25
 
< 0.1%
Close Punctuation 25
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6428
 
4.9%
6155
 
4.7%
4900
 
3.8%
3901
 
3.0%
3469
 
2.7%
3399
 
2.6%
3160
 
2.4%
2920
 
2.2%
2733
 
2.1%
2311
 
1.8%
Other values (770) 90821
69.8%
Decimal Number
ValueCountFrequency (%)
0 642
17.8%
1 572
15.8%
6 476
13.2%
8 431
11.9%
5 409
11.3%
7 348
9.6%
9 233
 
6.5%
4 202
 
5.6%
2 183
 
5.1%
3 108
 
3.0%
Other values (3) 6
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
m 161
89.9%
c 5
 
2.8%
l 4
 
2.2%
e 2
 
1.1%
u 2
 
1.1%
h 1
 
0.6%
i 1
 
0.6%
o 1
 
0.6%
t 1
 
0.6%
s 1
 
0.6%
Uppercase Letter
ValueCountFrequency (%)
I 46
88.5%
Q 1
 
1.9%
C 1
 
1.9%
B 1
 
1.9%
D 1
 
1.9%
O 1
 
1.9%
A 1
 
1.9%
Other Punctuation
ValueCountFrequency (%)
, 7078
55.8%
. 5490
43.3%
? 122
 
1.0%
3
 
< 0.1%
Space Separator
ValueCountFrequency (%)
37501
100.0%
Math Symbol
ValueCountFrequency (%)
~ 712
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 146
100.0%
Open Punctuation
ValueCountFrequency (%)
( 25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 25
100.0%
Other Symbol
ValueCountFrequency (%)
° 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 130187
70.3%
Common 54713
29.6%
Latin 231
 
0.1%
Han 10
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6428
 
4.9%
6155
 
4.7%
4900
 
3.8%
3901
 
3.0%
3469
 
2.7%
3399
 
2.6%
3160
 
2.4%
2920
 
2.2%
2733
 
2.1%
2311
 
1.8%
Other values (762) 90811
69.8%
Common
ValueCountFrequency (%)
37501
68.5%
, 7078
 
12.9%
. 5490
 
10.0%
~ 712
 
1.3%
0 642
 
1.2%
1 572
 
1.0%
6 476
 
0.9%
8 431
 
0.8%
5 409
 
0.7%
7 348
 
0.6%
Other values (13) 1054
 
1.9%
Latin
ValueCountFrequency (%)
m 161
69.7%
I 46
 
19.9%
c 5
 
2.2%
l 4
 
1.7%
e 2
 
0.9%
u 2
 
0.9%
Q 1
 
0.4%
C 1
 
0.4%
h 1
 
0.4%
i 1
 
0.4%
Other values (7) 7
 
3.0%
Han
ValueCountFrequency (%)
2
20.0%
2
20.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 130180
70.3%
ASCII 54934
29.7%
None 10
 
< 0.1%
CJK 10
 
< 0.1%
Compat Jamo 7
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
37501
68.3%
, 7078
 
12.9%
. 5490
 
10.0%
~ 712
 
1.3%
0 642
 
1.2%
1 572
 
1.0%
6 476
 
0.9%
8 431
 
0.8%
5 409
 
0.7%
7 348
 
0.6%
Other values (25) 1275
 
2.3%
Hangul
ValueCountFrequency (%)
6428
 
4.9%
6155
 
4.7%
4900
 
3.8%
3901
 
3.0%
3469
 
2.7%
3399
 
2.6%
3160
 
2.4%
2920
 
2.2%
2733
 
2.1%
2311
 
1.8%
Other values (759) 90804
69.8%
Compat Jamo
ValueCountFrequency (%)
5
71.4%
1
 
14.3%
1
 
14.3%
None
ValueCountFrequency (%)
3
30.0%
3
30.0%
2
20.0%
° 1
 
10.0%
1
 
10.0%
CJK
ValueCountFrequency (%)
2
20.0%
2
20.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%

서울시보호종여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
<NA>
5034 
서울시 보호종
 
49

Length

Max length7
Median length4
Mean length4.0289199
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5034
99.0%
서울시 보호종 49
 
1.0%

Length

2024-05-18T16:10:26.021020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T16:10:26.346353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 5034
98.1%
서울시 49
 
1.0%
보호종 49
 
1.0%

고유종 여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
<NA>
4863 
고유종
 
220

Length

Max length4
Median length4
Mean length3.9567185
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 4863
95.7%
고유종 220
 
4.3%

Length

2024-05-18T16:10:26.932933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T16:10:27.294291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 4863
95.7%
고유종 220
 
4.3%

멸종위기야생동식물 여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
<NA>
5040 
멸종위기 야생 동식물
 
43

Length

Max length11
Median length4
Mean length4.059217
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5040
99.2%
멸종위기 야생 동식물 43
 
0.8%

Length

2024-05-18T16:10:27.694638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T16:10:28.018425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 5040
97.5%
멸종위기 43
 
0.8%
야생 43
 
0.8%
동식물 43
 
0.8%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
<NA>
5061 
먹는자 처벌대상 야생 동식물
 
22

Length

Max length15
Median length4
Mean length4.0476097
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5061
99.6%
먹는자 처벌대상 야생 동식물 22
 
0.4%

Length

2024-05-18T16:10:28.363248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T16:10:28.704700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 5061
98.3%
먹는자 22
 
0.4%
처벌대상 22
 
0.4%
야생 22
 
0.4%
동식물 22
 
0.4%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
<NA>
4855 
포획금지 야생 동식물
 
228

Length

Max length11
Median length4
Mean length4.3139878
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 4855
95.5%
포획금지 야생 동식물 228
 
4.5%

Length

2024-05-18T16:10:29.239452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T16:10:29.608681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 4855
87.7%
포획금지 228
 
4.1%
야생 228
 
4.1%
동식물 228
 
4.1%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
<NA>
5071 
인공증식을 위한 포획허가대상 야생동물
 
12

Length

Max length20
Median length4
Mean length4.037773
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5071
99.8%
인공증식을 위한 포획허가대상 야생동물 12
 
0.2%

Length

2024-05-18T16:10:29.979963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T16:10:30.310536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 5071
99.1%
인공증식을 12
 
0.2%
위한 12
 
0.2%
포획허가대상 12
 
0.2%
야생동물 12
 
0.2%

유해야생동물 여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
<NA>
5071 
유해 야생 동물
 
12

Length

Max length8
Median length4
Mean length4.0094432
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5071
99.8%
유해 야생 동물 12
 
0.2%

Length

2024-05-18T16:10:30.617870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T16:10:30.851758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 5071
99.3%
유해 12
 
0.2%
야생 12
 
0.2%
동물 12
 
0.2%

수렵동물 여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
<NA>
5068 
수렵 동물
 
15

Length

Max length5
Median length4
Mean length4.002951
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5068
99.7%
수렵 동물 15
 
0.3%

Length

2024-05-18T16:10:31.100318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T16:10:31.380225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 5068
99.4%
수렵 15
 
0.3%
동물 15
 
0.3%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
<NA>
5077 
생태계 교란 야생 동식물
 
6

Length

Max length13
Median length4
Mean length4.0106236
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5077
99.9%
생태계 교란 야생 동식물 6
 
0.1%

Length

2024-05-18T16:10:31.840899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T16:10:32.163764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 5077
99.5%
생태계 6
 
0.1%
교란 6
 
0.1%
야생 6
 
0.1%
동식물 6
 
0.1%

야생화된 동물 여부
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5083
Missing (%)100.0%
Memory size44.8 KiB
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
<NA>
4835 
수출입 허가대상 야생동물
 
248

Length

Max length13
Median length4
Mean length4.4391108
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 4835
95.1%
수출입 허가대상 야생동물 248
 
4.9%

Length

2024-05-18T16:10:32.427738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T16:10:32.745776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 4835
86.7%
수출입 248
 
4.4%
허가대상 248
 
4.4%
야생동물 248
 
4.4%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
<NA>
4992 
국외반출 승인대상 생물자원
 
91

Length

Max length14
Median length4
Mean length4.1790281
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 4992
98.2%
국외반출 승인대상 생물자원 91
 
1.8%

Length

2024-05-18T16:10:33.163793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T16:10:33.457948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 4992
94.8%
국외반출 91
 
1.7%
승인대상 91
 
1.7%
생물자원 91
 
1.7%

국제적멸종위기종 여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
<NA>
5070 
국제적 멸종위기종
 
13

Length

Max length9
Median length4
Mean length4.0127877
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5070
99.7%
국제적 멸종위기종 13
 
0.3%

Length

2024-05-18T16:10:33.860091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T16:10:34.239786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 5070
99.5%
국제적 13
 
0.3%
멸종위기종 13
 
0.3%

외래식물 여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size39.8 KiB
<NA>
4903 
외래식물
 
180

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 4903
96.5%
외래식물 180
 
3.5%

Length

2024-05-18T16:10:34.587546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T16:10:34.955523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 4903
96.5%
외래식물 180
 
3.5%
Distinct8
Distinct (%)50.0%
Missing5067
Missing (%)99.7%
Memory size39.8 KiB
2024-05-18T16:10:35.302149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.4375
Min length1

Characters and Unicode

Total characters151
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)25.0%

Sample

1st row천연기념물 323호
2nd row천연기념물 325호
3rd row천연기념물 201호
4th row천연기념물 453호
5th row천연기념물 243호
ValueCountFrequency (%)
천연기념물 15
48.4%
323호 4
 
12.9%
324호 4
 
12.9%
201호 2
 
6.5%
243호 2
 
6.5%
325호 1
 
3.2%
453호 1
 
3.2%
327호 1
 
3.2%
1 1
 
3.2%
2024-05-18T16:10:36.134429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 17
11.3%
15
9.9%
15
9.9%
15
9.9%
15
9.9%
15
9.9%
15
9.9%
15
9.9%
2 14
9.3%
4 7
4.6%
Other values (4) 8
5.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 90
59.6%
Decimal Number 46
30.5%
Space Separator 15
 
9.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 17
37.0%
2 14
30.4%
4 7
15.2%
1 3
 
6.5%
0 2
 
4.3%
5 2
 
4.3%
7 1
 
2.2%
Other Letter
ValueCountFrequency (%)
15
16.7%
15
16.7%
15
16.7%
15
16.7%
15
16.7%
15
16.7%
Space Separator
ValueCountFrequency (%)
15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 90
59.6%
Common 61
40.4%

Most frequent character per script

Common
ValueCountFrequency (%)
3 17
27.9%
15
24.6%
2 14
23.0%
4 7
11.5%
1 3
 
4.9%
0 2
 
3.3%
5 2
 
3.3%
7 1
 
1.6%
Hangul
ValueCountFrequency (%)
15
16.7%
15
16.7%
15
16.7%
15
16.7%
15
16.7%
15
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 90
59.6%
ASCII 61
40.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 17
27.9%
15
24.6%
2 14
23.0%
4 7
11.5%
1 3
 
4.9%
0 2
 
3.3%
5 2
 
3.3%
7 1
 
1.6%
Hangul
ValueCountFrequency (%)
15
16.7%
15
16.7%
15
16.7%
15
16.7%
15
16.7%
15
16.7%

Sample

종코드국명학명계명(영문)계명(국문)문명(영문)문명(국문)강명(영문)강명(국문)목명(영문)목명(국문)과명(영문)과명(국문)국명_이명형태특성생태특성서울시보호종여부고유종 여부멸종위기야생동식물 여부먹는자 처벌대상 야생동식물 여부포획금지 야생동식물 여부인공증식을 위한 포획허가대상 야생동물 여부유해야생동물 여부수렵동물 여부생태계교란 야생동식물 여부야생화된 동물 여부수출입 허가대상 야생동물 여부국외반출승인대상생물자원 여부국제적멸종위기종 여부외래식물 여부천연기념물 지정정보
0s0001가는가래Potamogeton cristatus Regel 4984Plantae식물계Magnoliophyta목련문Liliopsida백합강Najadales나자스말목Potamogetonaceae가래과<NA>뿌리는 지하경이 옆으로 길게 뻗는다. 잎은 길이 4~6cm, 나비 0.7mm 정도이며 물 위에 뜬다. 열매는 둥글고 대가 있으며 단단한 살에 싸여 있고 뒷면에 돌기가 있다. 꽃은 5-9월에 피며 황록색이다.우리나라가 원산지이며 다년생 수초이다.<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
1s0002가는갈퀴Vicia angustifolia L. var. minor (Bertol.) OhwiPlantae식물계Magnoliophyta목련문Magnoliopsida목련강Fabales콩목Leguminosae콩과<NA>높이 90cm 가량으로 잎은 어긋나며 잎자루가 짧으며 끝은 덩굴손이다. 4월에 잎 겨드랑이에 홍자색의 꽃이 핀다.제주도, 울릉도 등지에 분포하며 어린 순을 나물로 한다.<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2s0003가는개여뀌Persicaria trigonocarpa (Makino) NakaiPlantae식물계Magnoliophyta목련문Magnoliopsida목련강Polygonales마디풀목Polygonaceae마디풀과<NA>잎은 길이 3-7cm, 폭 4-8mm의 선상 피침형으로 수분이 없으면 갈색을 띤다. 어긋나기이고 엽병은 거의 없다. 가장자리 부근의 맥 위에 잔 복모가 난다. 꽃은 길이 2-2.2mm의 홍색꽃으로 가지의 끝에 수상화서로 달리고 길이는 2-3cm 정도 된다. 개화시기는 8-10월에 개화한다. 열매는 세모진 난형의 수과로 적갈색이고 길이는 2mm 정도이다.일년생 초본식물로 우리나라 곳곳에 분포한다.<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
3s0004가는괴불주머니Corydalis ochotensis Turcz. var. raddeana NakaiPlantae식물계Magnoliophyta목련문Magnoliopsida목련강Nepenthales벌레잡이풀목Papaveraceae양귀비과<NA>높이 1m 이상이다. 잎은 어긋나고 잎자루가 길며 3장의 작은 잎이 나온 모양이다. 꽃은 7~8월에 황색으로 피는데 총상화서를 이룬다. 열매는 삭과이고 검은 씨앗이 2줄로 들어 있다.우리나라, 일본, 만주 등지에 분포하는 이년생 식물이다.<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
4s0005가는금강아지Setaria pallidefusca (Schumach.) StapfePlantae식물계Magnoliophyta목련문Magnoliopsida목련강Asterales국화목Gramineae벼과<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
5s0006가는금불초Inula britannica L. var. linariaefolia RegelPlantae식물계Magnoliophyta목련문Magnoliopsida목련강Asterales국화목Asteraceae국화과가는잎금불초줄기는 높이 30-70cm이고 곧게 선다. 뿌리는 뿌리줄기가 옆으로 뻗어 자라 군집을 이룬다. 근생엽과 밑부분의 잎은 꽃이 피면 스러지고 중앙부의 잎은 길이 4-9cm, 폭 6-10mm의 선상 피침형 또는 선형으로 끝이 뾰족하다. 꽃은 지름 18-25mm로 가지와 원줄기 끝에 하나씩 6-8월에 핀다. 설상화는 황색으로 길이는 10mm 정도이다. 열매는 원주형의 길이 1mm 정도의 수과로 8-9월에 익으며 관모는 길이 3mm 정도이다.다년생 초본식물로 우리나라와 일본, 중국, 러시아에 분포한다.<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
6s0007가는기름나물Peucedanum elegans Kom.Plantae식물계Magnoliophyta목련문Magnoliopsida목련강Apiales산형목Umbelliferae산형과<NA>줄기 표면이 밋밋하며 높이가 90cm에 달한다. 뿌리잎은 밑부분이 넓어 본 줄기를 감싸안은 모양이다. 타원형의 열매는 분과이다. 흰색 꽃은 복산형화서로서 7~8월에 개화한다. 총포가 없다.다년초이고 우리나라 북부, 압록강 연안, 백두산 지역에서 자란다.<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
7s0008가는기린초Sedum aizoon L.Plantae식물계Magnoliophyta목련문Magnoliopsida목련강Rosales장미목Crassulaceae돌나물과<NA>줄기는 높이 20~50cm이다. 잎은 피침형이거나 타원형이고 길이 3~6cm, 폭 7~15mm이며 털이 없다. 꽃은 7~8월에 피고 지름 10~13mm이며 취산화서에 많은 꽃이 달린다.우리나라 전국 각처의 산지에서 자라는 다년생 초본식물이다.<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
8s0009가는꽃녹슬은방아벌레Agrypnus fuliginosus (Candeze)Animalia동물계Arthropoda절지동물문Insecta곤충강Coleoptera딱정벌레목Elateridae방아벌레과<NA>몸의 길이는 13~20mm정도이고 몸 색깔은 적갈색이다. 겉날개 옆부분에 연한 갈색 점이 있다.한국, 일본 등지에 분포한다.<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
9s0010가는무늬밑빠진벌레Cryptarcha strigata (Fabricius)Animalia동물계Arthropoda절지동물문Insecta곤충강Coleoptera딱정벌레목Nitidulidae밑빠진벌레과<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
종코드국명학명계명(영문)계명(국문)문명(영문)문명(국문)강명(영문)강명(국문)목명(영문)목명(국문)과명(영문)과명(국문)국명_이명형태특성생태특성서울시보호종여부고유종 여부멸종위기야생동식물 여부먹는자 처벌대상 야생동식물 여부포획금지 야생동식물 여부인공증식을 위한 포획허가대상 야생동물 여부유해야생동물 여부수렵동물 여부생태계교란 야생동식물 여부야생화된 동물 여부수출입 허가대상 야생동물 여부국외반출승인대상생물자원 여부국제적멸종위기종 여부외래식물 여부천연기념물 지정정보
5073s5324파리매류Asilidae sp.Animalia동물계Arthropoda절지동물문Insecta곤충강Diptera파리목Asilidae파리매과<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
5074s5325팔랑나비류Hesperiidae sp.Animalia동물계Arthropoda절지동물문Insecta곤충강Lepidoptera나비목Hesperiidae팔랑나비과<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
5075s5326한라푸른부전나비Udara dilectaAnimalia동물계Arthropoda절지동물문Insecta곤충강Lepidoptera나비목Lycaenidae부전나비과<NA>수컷은 날개 윗면이 밝은 남색이며, 뒷날개 제5~6실 부근에 백색 무늬가 있다. 암컷은 날개 외연이 흑갈색이며 기부(基部)에 청백색의 무늬가 나타난다.해발고도 1700m 이상인 한라산의 풀밭에서 산다. 한국,중국 서남부,일본,타이완,네팔,필리핀 등에 분포한다.<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
5076s5327햇사초Carex pseudo-chinensisPlantae식물계Magnoliophyta목련문Liliopsida백합강Cyperales사초목Cyperaceae사초과<NA><NA><NA><NA>고유종<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
5077s5328허리노린재류Coreidae sp.Animalia동물계Arthropoda절지동물문Insecta곤충강Hemiptera노린재목Coreidae허리노린재과<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
5078s5329호랑무늬파리매Astochia virgatipes CoquillettAnimalia동물계Arthropoda절지동물문Insecta곤충강Diptera파리목Asilidae파리매과<NA>몸길이가 약 19~24mm이고 바탕색은 대체로 황갈색이다. 더듬이는 검고 가늘며 끝으로 갈수록 가늘어지며 털이 없습니다. 6개의 배마디 마디 검은색 바탕에 노란색 가로띠가 뚜렷하게 나타나 호랑무늬처럼 보이며 회백색 가루가 많다.한국,일본,대만에 분포한다.<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
5079s5330호박벌류Bombus sp.Animalia동물계Arthropoda절지동물문Insecta곤충강Hymenoptera벌목Unknow꿀벌과<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
5080s5331혹가슴검정쇠똥풍뎅이Onthophagus atripennisAnimalia동물계Arthropoda절지동물문Insecta곤충강Coleoptera딱정벌레목Scarabaeidae소똥구리과<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
5081s5332혹집게벌레Anechura harmandiAnimalia동물계Arthropoda절지동물문Insecta곤충강Dermaptera집게벌레목Forficulidae집게벌레과<NA><NA><NA><NA>고유종<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
5082s5333흰줄바구미Cleonus japonicus FaustAnimalia동물계Arthropoda절지동물문Insecta곤충강Coleoptera딱정벌레목Curculionidae바구미과<NA>몸빛깔은 전체적으로 검은색을 나타내며, 더듬이와 다리의 발목마디 부분은 붉은색이다. 앞가슴등판 양 옆쪽으로 흰색의 아주 짧은 털이 있다. 몸길이 12mm이다. 몸은 긴 타원형을 취하고 있으며, 주둥이는 굵으면서 약간 길다.한국,일본에 분포하며 들판에 주로 서식한다.<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>