Dataset statistics
Number of variables | 11 |
---|---|
Number of observations | 10000 |
Missing cells | 5483 |
Missing cells (%) | 5.0% |
Duplicate rows | 6 |
Duplicate rows (%) | 0.1% |
Total size in memory | 966.8 KiB |
Average record size in memory | 99.0 B |
Variable types
Text | 7 |
---|---|
Numeric | 3 |
Categorical | 1 |
Dataset
Description | 종코드,국명,학명,서식지코드,서식지명,세부통계용명칭,출현년도,원전,X좌표,Y좌표,서식지비고정보 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-2200/S/1/datasetView.do |
Dataset has 6 (0.1%) duplicate rows | Duplicates |
X좌표 is highly overall correlated with 서식지비고정보 | High correlation |
Y좌표 is highly overall correlated with 서식지비고정보 | High correlation |
서식지비고정보 is highly overall correlated with X좌표 and 1 other fields | High correlation |
세부통계용명칭 has 1705 (17.1%) missing values | Missing |
X좌표 has 1885 (18.9%) missing values | Missing |
Y좌표 has 1885 (18.9%) missing values | Missing |
Reproduction
Analysis started | 2024-05-11 02:13:31.547094 |
---|---|
Analysis finished | 2024-05-11 02:13:40.678697 |
Duration | 9.13 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
종코드
Text
Distinct | 2636 |
---|---|
Distinct (%) | 26.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
s3918 | 35 | 0.4% |
s0214 | 33 | 0.3% |
s1261 | 32 | 0.3% |
s4502 | 31 | 0.3% |
s0712 | 31 | 0.3% |
s1725 | 30 | 0.3% |
s1978 | 30 | 0.3% |
s4526 | 29 | 0.3% |
s2078 | 28 | 0.3% |
s3837 | 26 | 0.3% |
Other values (2626) | 9695 |
Most occurring characters
Value | Count | Frequency (%) |
s | 10000 | |
2 | 5588 | |
1 | 5218 | |
3 | 5104 | |
0 | 5025 | |
4 | 4116 | |
5 | 3338 | 6.7% |
8 | 3018 | 6.0% |
9 | 2948 | 5.9% |
7 | 2884 | 5.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 40000 | |
Lowercase Letter | 10000 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 5588 | |
1 | 5218 | |
3 | 5104 | |
0 | 5025 | |
4 | 4116 | |
5 | 3338 | |
8 | 3018 | |
9 | 2948 | |
7 | 2884 | |
6 | 2761 |
Lowercase Letter
Value | Count | Frequency (%) |
s | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 40000 | |
Latin | 10000 | 20.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
2 | 5588 | |
1 | 5218 | |
3 | 5104 | |
0 | 5025 | |
4 | 4116 | |
5 | 3338 | |
8 | 3018 | |
9 | 2948 | |
7 | 2884 | |
6 | 2761 |
Latin
Value | Count | Frequency (%) |
s | 10000 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 50000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
s | 10000 | |
2 | 5588 | |
1 | 5218 | |
3 | 5104 | |
0 | 5025 | |
4 | 4116 | |
5 | 3338 | 6.7% |
8 | 3018 | 6.0% |
9 | 2948 | 5.9% |
7 | 2884 | 5.8% |
국명
Text
Distinct | 2642 |
---|---|
Distinct (%) | 26.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
참새 | 35 | 0.4% |
개망초 | 33 | 0.3% |
닭의장풀 | 32 | 0.3% |
멧비둘기 | 31 | 0.3% |
까치 | 31 | 0.3% |
환삼덩굴 | 31 | 0.3% |
박새 | 30 | 0.3% |
황새냉이 | 29 | 0.3% |
뱀딸기 | 28 | 0.3% |
무당벌레 | 26 | 0.3% |
Other values (2620) | 9694 |
Most occurring characters
Value | Count | Frequency (%) |
나 | 2315 | 5.3% |
리 | 1768 | 4.1% |
무 | 1537 | 3.5% |
이 | 1104 | 2.5% |
비 | 822 | 1.9% |
개 | 775 | 1.8% |
기 | 728 | 1.7% |
풀 | 661 | 1.5% |
방 | 637 | 1.5% |
꽃 | 634 | 1.5% |
Other values (616) | 32325 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 43262 | |
Space Separator | 17 | < 0.1% |
Other Punctuation | 16 | < 0.1% |
Close Punctuation | 4 | < 0.1% |
Open Punctuation | 4 | < 0.1% |
Dash Punctuation | 2 | < 0.1% |
Lowercase Letter | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
나 | 2315 | 5.4% |
리 | 1768 | 4.1% |
무 | 1537 | 3.6% |
이 | 1104 | 2.6% |
비 | 822 | 1.9% |
개 | 775 | 1.8% |
기 | 728 | 1.7% |
풀 | 661 | 1.5% |
방 | 637 | 1.5% |
꽃 | 634 | 1.5% |
Other values (609) | 32281 |
Other Punctuation
Value | Count | Frequency (%) |
? | 15 | |
? | 1 | 6.2% |
Space Separator
Value | Count | Frequency (%) |
17 |
Close Punctuation
Value | Count | Frequency (%) |
) | 4 |
Open Punctuation
Value | Count | Frequency (%) |
( | 4 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2 |
Lowercase Letter
Value | Count | Frequency (%) |
f | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 43262 | |
Common | 43 | 0.1% |
Latin | 1 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
나 | 2315 | 5.4% |
리 | 1768 | 4.1% |
무 | 1537 | 3.6% |
이 | 1104 | 2.6% |
비 | 822 | 1.9% |
개 | 775 | 1.8% |
기 | 728 | 1.7% |
풀 | 661 | 1.5% |
방 | 637 | 1.5% |
꽃 | 634 | 1.5% |
Other values (609) | 32281 |
Common
Value | Count | Frequency (%) |
17 | ||
? | 15 | |
) | 4 | 9.3% |
( | 4 | 9.3% |
- | 2 | 4.7% |
? | 1 | 2.3% |
Latin
Value | Count | Frequency (%) |
f | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 43262 | |
ASCII | 43 | 0.1% |
None | 1 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
나 | 2315 | 5.4% |
리 | 1768 | 4.1% |
무 | 1537 | 3.6% |
이 | 1104 | 2.6% |
비 | 822 | 1.9% |
개 | 775 | 1.8% |
기 | 728 | 1.7% |
풀 | 661 | 1.5% |
방 | 637 | 1.5% |
꽃 | 634 | 1.5% |
Other values (609) | 32281 |
ASCII
Value | Count | Frequency (%) |
17 | ||
? | 15 | |
) | 4 | 9.3% |
( | 4 | 9.3% |
- | 2 | 4.7% |
f | 1 | 2.3% |
None
Value | Count | Frequency (%) |
? | 1 |
학명
Text
Distinct | 3037 |
---|---|
Distinct (%) | 30.4% |
Missing | 8 |
Missing (%) | 0.1% |
Memory size | 156.2 KiB |
Length
Max length | 77 |
---|---|
Median length | 61 |
Mean length | 26.818255 |
Min length | 7 |
Characters and Unicode
Total characters | 267968 |
---|---|
Distinct characters | 71 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 1555 ? |
---|---|
Unique (%) | 15.6% |
Sample
1st row | Disporum smilacinum A. Gray |
---|---|
2nd row | Agropyron tsukushiense var. transiens (Hack.) Ohwi |
3rd row | Streptopelia orientalis |
4th row | Plantago asiatica L. |
5th row | Galium spurium L. |
Value | Count | Frequency (%) |
l | 1507 | 4.5% |
var | 734 | 2.2% |
thunb | 427 | 1.3% |
japonica | 417 | 1.3% |
nakai | 362 | 1.1% |
siebold | 310 | 0.9% |
maxim | 279 | 0.8% |
miq | 207 | 0.6% |
makino | 191 | 0.6% |
188 | 0.6% | |
Other values (4523) | 28564 |
Most occurring characters
Value | Count | Frequency (%) |
a | 27231 | 10.2% |
23801 | 8.9% | |
i | 22119 | 8.3% |
e | 16806 | 6.3% |
s | 16087 | 6.0% |
r | 15770 | 5.9% |
o | 13298 | 5.0% |
n | 13201 | 4.9% |
u | 13052 | 4.9% |
l | 11647 | 4.3% |
Other values (61) | 94956 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 209847 | |
Space Separator | 23801 | 8.9% |
Uppercase Letter | 21321 | 8.0% |
Other Punctuation | 6635 | 2.5% |
Close Punctuation | 3081 | 1.1% |
Open Punctuation | 3078 | 1.1% |
Dash Punctuation | 179 | 0.1% |
Decimal Number | 25 | < 0.1% |
Other Letter | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 27231 | |
i | 22119 | |
e | 16806 | 8.0% |
s | 16087 | 7.7% |
r | 15770 | 7.5% |
o | 13298 | 6.3% |
n | 13201 | 6.3% |
u | 13052 | 6.2% |
l | 11647 | 5.6% |
t | 9940 | 4.7% |
Other values (16) | 50696 |
Uppercase Letter
Value | Count | Frequency (%) |
L | 2525 | |
S | 2003 | 9.4% |
P | 1914 | 9.0% |
C | 1769 | 8.3% |
M | 1762 | 8.3% |
A | 1477 | 6.9% |
B | 1241 | 5.8% |
T | 1193 | 5.6% |
H | 932 | 4.4% |
R | 856 | 4.0% |
Other values (16) | 5649 |
Other Punctuation
Value | Count | Frequency (%) |
. | 6557 | |
: | 38 | 0.6% |
, | 26 | 0.4% |
& | 11 | 0.2% |
? | 2 | < 0.1% |
; | 1 | < 0.1% |
Decimal Number
Value | Count | Frequency (%) |
1 | 14 | |
7 | 3 | 12.0% |
8 | 3 | 12.0% |
2 | 2 | 8.0% |
9 | 2 | 8.0% |
5 | 1 | 4.0% |
Close Punctuation
Value | Count | Frequency (%) |
) | 3079 | |
] | 2 | 0.1% |
Open Punctuation
Value | Count | Frequency (%) |
( | 3076 | |
[ | 2 | 0.1% |
Space Separator
Value | Count | Frequency (%) |
23801 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 179 |
Other Letter
Value | Count | Frequency (%) |
ㅍ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 231168 | |
Common | 36799 | 13.7% |
Hangul | 1 | < 0.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 27231 | 11.8% |
i | 22119 | 9.6% |
e | 16806 | 7.3% |
s | 16087 | 7.0% |
r | 15770 | 6.8% |
o | 13298 | 5.8% |
n | 13201 | 5.7% |
u | 13052 | 5.6% |
l | 11647 | 5.0% |
t | 9940 | 4.3% |
Other values (42) | 72017 |
Common
Value | Count | Frequency (%) |
23801 | ||
. | 6557 | 17.8% |
) | 3079 | 8.4% |
( | 3076 | 8.4% |
- | 179 | 0.5% |
: | 38 | 0.1% |
, | 26 | 0.1% |
1 | 14 | < 0.1% |
& | 11 | < 0.1% |
7 | 3 | < 0.1% |
Other values (8) | 15 | < 0.1% |
Hangul
Value | Count | Frequency (%) |
ㅍ | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 267967 | |
Compat Jamo | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 27231 | 10.2% |
23801 | 8.9% | |
i | 22119 | 8.3% |
e | 16806 | 6.3% |
s | 16087 | 6.0% |
r | 15770 | 5.9% |
o | 13298 | 5.0% |
n | 13201 | 4.9% |
u | 13052 | 4.9% |
l | 11647 | 4.3% |
Other values (60) | 94955 |
Compat Jamo
Value | Count | Frequency (%) |
ㅍ | 1 |
서식지코드
Text
Distinct | 244 |
---|---|
Distinct (%) | 2.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
p0102 | 649 | 6.5% |
p0058 | 561 | 5.6% |
p0198 | 529 | 5.3% |
p0050 | 335 | 3.4% |
p0036 | 324 | 3.2% |
p0243 | 295 | 2.9% |
p0271 | 244 | 2.4% |
p0279 | 237 | 2.4% |
p0249 | 237 | 2.4% |
p0148 | 218 | 2.2% |
Other values (234) | 6371 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 15109 | |
p | 10000 | |
2 | 4951 | 9.9% |
1 | 4082 | 8.2% |
3 | 3383 | 6.8% |
5 | 2763 | 5.5% |
9 | 2438 | 4.9% |
8 | 2358 | 4.7% |
4 | 1940 | 3.9% |
6 | 1591 | 3.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 40000 | |
Lowercase Letter | 10000 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 15109 | |
2 | 4951 | 12.4% |
1 | 4082 | 10.2% |
3 | 3383 | 8.5% |
5 | 2763 | 6.9% |
9 | 2438 | 6.1% |
8 | 2358 | 5.9% |
4 | 1940 | 4.9% |
6 | 1591 | 4.0% |
7 | 1385 | 3.5% |
Lowercase Letter
Value | Count | Frequency (%) |
p | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 40000 | |
Latin | 10000 | 20.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 15109 | |
2 | 4951 | 12.4% |
1 | 4082 | 10.2% |
3 | 3383 | 8.5% |
5 | 2763 | 6.9% |
9 | 2438 | 6.1% |
8 | 2358 | 5.9% |
4 | 1940 | 4.9% |
6 | 1591 | 4.0% |
7 | 1385 | 3.5% |
Latin
Value | Count | Frequency (%) |
p | 10000 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 50000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 15109 | |
p | 10000 | |
2 | 4951 | 9.9% |
1 | 4082 | 8.2% |
3 | 3383 | 6.8% |
5 | 2763 | 5.5% |
9 | 2438 | 4.9% |
8 | 2358 | 4.7% |
4 | 1940 | 3.9% |
6 | 1591 | 3.2% |
서식지명
Text
Distinct | 278 |
---|---|
Distinct (%) | 2.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
생태경관보전지역 | 842 | 7.1% |
북한산 | 649 | 5.5% |
남산 | 627 | 5.3% |
월드컵공원 | 529 | 4.5% |
청계산 | 333 | 2.8% |
길동생태공원 | 326 | 2.7% |
관악산 | 325 | 2.7% |
탄천 | 237 | 2.0% |
헌인릉 | 237 | 2.0% |
수락산 | 218 | 1.8% |
Other values (321) | 7559 |
Most occurring characters
Value | Count | Frequency (%) |
산 | 3600 | 7.4% |
1882 | 3.9% | |
천 | 1723 | 3.5% |
원 | 1638 | 3.4% |
생 | 1358 | 2.8% |
태 | 1354 | 2.8% |
지 | 1307 | 2.7% |
관 | 1278 | 2.6% |
동 | 1207 | 2.5% |
공 | 1130 | 2.3% |
Other values (236) | 32070 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 42922 | |
Space Separator | 1882 | 3.9% |
Decimal Number | 1671 | 3.4% |
Uppercase Letter | 1510 | 3.1% |
Math Symbol | 358 | 0.7% |
Dash Punctuation | 54 | 0.1% |
Open Punctuation | 52 | 0.1% |
Close Punctuation | 52 | 0.1% |
Other Punctuation | 44 | 0.1% |
Lowercase Letter | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
산 | 3600 | 8.4% |
천 | 1723 | 4.0% |
원 | 1638 | 3.8% |
생 | 1358 | 3.2% |
태 | 1354 | 3.2% |
지 | 1307 | 3.0% |
관 | 1278 | 3.0% |
동 | 1207 | 2.8% |
공 | 1130 | 2.6% |
한 | 1033 | 2.4% |
Other values (208) | 27294 |
Decimal Number
Value | Count | Frequency (%) |
1 | 312 | |
2 | 266 | |
4 | 222 | |
5 | 219 | |
3 | 207 | |
6 | 134 | |
7 | 108 | 6.5% |
8 | 84 | 5.0% |
9 | 60 | 3.6% |
0 | 59 | 3.5% |
Uppercase Letter
Value | Count | Frequency (%) |
C | 317 | |
A | 262 | |
H | 236 | |
E | 201 | |
G | 157 | |
F | 135 | |
B | 115 | 7.6% |
D | 87 | 5.8% |
Other Punctuation
Value | Count | Frequency (%) |
? | 42 | |
/ | 1 | 2.3% |
. | 1 | 2.3% |
Lowercase Letter
Value | Count | Frequency (%) |
k | 1 | |
m | 1 |
Space Separator
Value | Count | Frequency (%) |
1882 |
Math Symbol
Value | Count | Frequency (%) |
~ | 358 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 54 |
Open Punctuation
Value | Count | Frequency (%) |
( | 52 |
Close Punctuation
Value | Count | Frequency (%) |
) | 52 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 42922 | |
Common | 4113 | 8.5% |
Latin | 1512 | 3.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
산 | 3600 | 8.4% |
천 | 1723 | 4.0% |
원 | 1638 | 3.8% |
생 | 1358 | 3.2% |
태 | 1354 | 3.2% |
지 | 1307 | 3.0% |
관 | 1278 | 3.0% |
동 | 1207 | 2.8% |
공 | 1130 | 2.6% |
한 | 1033 | 2.4% |
Other values (208) | 27294 |
Common
Value | Count | Frequency (%) |
1882 | ||
~ | 358 | 8.7% |
1 | 312 | 7.6% |
2 | 266 | 6.5% |
4 | 222 | 5.4% |
5 | 219 | 5.3% |
3 | 207 | 5.0% |
6 | 134 | 3.3% |
7 | 108 | 2.6% |
8 | 84 | 2.0% |
Other values (8) | 321 | 7.8% |
Latin
Value | Count | Frequency (%) |
C | 317 | |
A | 262 | |
H | 236 | |
E | 201 | |
G | 157 | |
F | 135 | |
B | 115 | 7.6% |
D | 87 | 5.8% |
k | 1 | 0.1% |
m | 1 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 42922 | |
ASCII | 5625 | 11.6% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
산 | 3600 | 8.4% |
천 | 1723 | 4.0% |
원 | 1638 | 3.8% |
생 | 1358 | 3.2% |
태 | 1354 | 3.2% |
지 | 1307 | 3.0% |
관 | 1278 | 3.0% |
동 | 1207 | 2.8% |
공 | 1130 | 2.6% |
한 | 1033 | 2.4% |
Other values (208) | 27294 |
ASCII
Value | Count | Frequency (%) |
1882 | ||
~ | 358 | 6.4% |
C | 317 | 5.6% |
1 | 312 | 5.5% |
2 | 266 | 4.7% |
A | 262 | 4.7% |
H | 236 | 4.2% |
4 | 222 | 3.9% |
5 | 219 | 3.9% |
3 | 207 | 3.7% |
Other values (18) | 1344 |
세부통계용명칭
Text
MISSING
 
Distinct | 61 |
---|---|
Distinct (%) | 0.7% |
Missing | 1705 |
Missing (%) | 17.1% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
한강 | 1150 | 12.7% |
북한산 | 697 | 7.7% |
국립공원 | 697 | 7.7% |
남산 | 667 | 7.4% |
월드컵공원 | 621 | 6.9% |
청계산 | 427 | 4.7% |
길동생태공원 | 335 | 3.7% |
중랑천 | 325 | 3.6% |
관악산 | 324 | 3.6% |
청계천 | 289 | 3.2% |
Other values (52) | 3524 |
Most occurring characters
Value | Count | Frequency (%) |
산 | 3550 | 11.7% |
원 | 1986 | 6.6% |
공 | 1915 | 6.3% |
한 | 1847 | 6.1% |
강 | 1359 | 4.5% |
천 | 1326 | 4.4% |
761 | 2.5% | |
국 | 756 | 2.5% |
립 | 756 | 2.5% |
북 | 752 | 2.5% |
Other values (98) | 15232 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 29479 | |
Space Separator | 761 | 2.5% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
산 | 3550 | 12.0% |
원 | 1986 | 6.7% |
공 | 1915 | 6.5% |
한 | 1847 | 6.3% |
강 | 1359 | 4.6% |
천 | 1326 | 4.5% |
국 | 756 | 2.6% |
립 | 756 | 2.6% |
북 | 752 | 2.6% |
청 | 716 | 2.4% |
Other values (97) | 14516 |
Space Separator
Value | Count | Frequency (%) |
761 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 29479 | |
Common | 761 | 2.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
산 | 3550 | 12.0% |
원 | 1986 | 6.7% |
공 | 1915 | 6.5% |
한 | 1847 | 6.3% |
강 | 1359 | 4.6% |
천 | 1326 | 4.5% |
국 | 756 | 2.6% |
립 | 756 | 2.6% |
북 | 752 | 2.6% |
청 | 716 | 2.4% |
Other values (97) | 14516 |
Common
Value | Count | Frequency (%) |
761 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 29479 | |
ASCII | 761 | 2.5% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
산 | 3550 | 12.0% |
원 | 1986 | 6.7% |
공 | 1915 | 6.5% |
한 | 1847 | 6.3% |
강 | 1359 | 4.6% |
천 | 1326 | 4.5% |
국 | 756 | 2.6% |
립 | 756 | 2.6% |
북 | 752 | 2.6% |
청 | 716 | 2.4% |
Other values (97) | 14516 |
ASCII
Value | Count | Frequency (%) |
761 |
출현년도
Real number (ℝ)
Distinct | 24 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2002.5167 |
Minimum | 1948 |
---|---|
Maximum | 2012 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1948 |
---|---|
5-th percentile | 1994 |
Q1 | 2001 |
median | 2004 |
Q3 | 2006 |
95-th percentile | 2009 |
Maximum | 2012 |
Range | 64 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 6.7497761 |
---|---|
Coefficient of variation (CV) | 0.0033706466 |
Kurtosis | 32.408135 |
Mean | 2002.5167 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -4.7330307 |
Sum | 20025167 |
Variance | 45.559477 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2004 | 2271 | |
2007 | 1025 | |
2006 | 990 | |
2001 | 810 | 8.1% |
2002 | 809 | 8.1% |
2005 | 767 | 7.7% |
2009 | 642 | 6.4% |
2003 | 574 | 5.7% |
1999 | 511 | 5.1% |
1997 | 376 | 3.8% |
Other values (14) | 1225 |
Value | Count | Frequency (%) |
1948 | 76 | |
1972 | 53 | 0.5% |
1984 | 8 | 0.1% |
1986 | 76 | |
1987 | 72 | |
1989 | 33 | 0.3% |
1992 | 66 | |
1993 | 93 | |
1994 | 156 | |
1996 | 29 | 0.3% |
Value | Count | Frequency (%) |
2012 | 53 | 0.5% |
2009 | 642 | 6.4% |
2008 | 131 | 1.3% |
2007 | 1025 | |
2006 | 990 | |
2005 | 767 | 7.7% |
2004 | 2271 | |
2003 | 574 | 5.7% |
2002 | 809 | 8.1% |
2001 | 810 | 8.1% |
원전
Text
Distinct | 95 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 46 |
---|---|
Median length | 36 |
Mean length | 19.931 |
Min length | 6 |
Characters and Unicode
Total characters | 199310 |
---|---|
Distinct characters | 200 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 산림생태계조사 연구보고서 |
---|---|
2nd row | 2007년 한강생태계 조사 |
3rd row | 2007년 한강생태계 조사 |
4th row | 서울시 우수 생태계지역 정밀조사 연구 |
5th row | 서울시 도시숲(산림) 생태계 조사 학술 연구 |
Value | Count | Frequency (%) |
서울시 | 2813 | 7.1% |
한강생태계 | 1587 | 4.0% |
및 | 1531 | 3.9% |
생물다양성 | 1510 | 3.8% |
증진방안 | 1510 | 3.8% |
비오톱유형별 | 1510 | 3.8% |
조사 | 1482 | 3.8% |
조사연구 | 1075 | 2.7% |
2007년 | 1055 | 2.7% |
연구 | 1025 | 2.6% |
Other values (203) | 24369 |
Most occurring characters
Value | Count | Frequency (%) |
30481 | 15.3% | |
생 | 8642 | 4.3% |
태 | 6441 | 3.2% |
계 | 5628 | 2.8% |
연 | 5065 | 2.5% |
서 | 4841 | 2.4% |
관 | 4779 | 2.4% |
0 | 4577 | 2.3% |
조 | 4268 | 2.1% |
사 | 4210 | 2.1% |
Other values (190) | 120378 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 153715 | |
Space Separator | 30481 | 15.3% |
Decimal Number | 10430 | 5.2% |
Open Punctuation | 1745 | 0.9% |
Close Punctuation | 1745 | 0.9% |
Other Punctuation | 792 | 0.4% |
Dash Punctuation | 208 | 0.1% |
Lowercase Letter | 152 | 0.1% |
Uppercase Letter | 42 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
생 | 8642 | 5.6% |
태 | 6441 | 4.2% |
계 | 5628 | 3.7% |
연 | 5065 | 3.3% |
서 | 4841 | 3.1% |
관 | 4779 | 3.1% |
조 | 4268 | 2.8% |
사 | 4210 | 2.7% |
구 | 3954 | 2.6% |
시 | 3509 | 2.3% |
Other values (168) | 102378 |
Decimal Number
Value | Count | Frequency (%) |
0 | 4577 | |
2 | 2578 | |
7 | 1421 | 13.6% |
8 | 443 | 4.2% |
4 | 426 | 4.1% |
6 | 326 | 3.1% |
3 | 285 | 2.7% |
1 | 170 | 1.6% |
5 | 132 | 1.3% |
9 | 72 | 0.7% |
Other Punctuation
Value | Count | Frequency (%) |
, | 277 | |
? | 233 | |
. | 204 | |
: | 72 | 9.1% |
/ | 6 | 0.8% |
Uppercase Letter
Value | Count | Frequency (%) |
I | 21 | |
V | 21 |
Space Separator
Value | Count | Frequency (%) |
30481 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1745 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1745 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 208 |
Lowercase Letter
Value | Count | Frequency (%) |
p | 152 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 153715 | |
Common | 45401 | 22.8% |
Latin | 194 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
생 | 8642 | 5.6% |
태 | 6441 | 4.2% |
계 | 5628 | 3.7% |
연 | 5065 | 3.3% |
서 | 4841 | 3.1% |
관 | 4779 | 3.1% |
조 | 4268 | 2.8% |
사 | 4210 | 2.7% |
구 | 3954 | 2.6% |
시 | 3509 | 2.3% |
Other values (168) | 102378 |
Common
Value | Count | Frequency (%) |
30481 | ||
0 | 4577 | 10.1% |
2 | 2578 | 5.7% |
( | 1745 | 3.8% |
) | 1745 | 3.8% |
7 | 1421 | 3.1% |
8 | 443 | 1.0% |
4 | 426 | 0.9% |
6 | 326 | 0.7% |
3 | 285 | 0.6% |
Other values (9) | 1374 | 3.0% |
Latin
Value | Count | Frequency (%) |
p | 152 | |
I | 21 | 10.8% |
V | 21 | 10.8% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 153715 | |
ASCII | 45595 | 22.9% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
30481 | ||
0 | 4577 | 10.0% |
2 | 2578 | 5.7% |
( | 1745 | 3.8% |
) | 1745 | 3.8% |
7 | 1421 | 3.1% |
8 | 443 | 1.0% |
4 | 426 | 0.9% |
6 | 326 | 0.7% |
3 | 285 | 0.6% |
Other values (12) | 1568 | 3.4% |
Hangul
Value | Count | Frequency (%) |
생 | 8642 | 5.6% |
태 | 6441 | 4.2% |
계 | 5628 | 3.7% |
연 | 5065 | 3.3% |
서 | 4841 | 3.1% |
관 | 4779 | 3.1% |
조 | 4268 | 2.8% |
사 | 4210 | 2.7% |
구 | 3954 | 2.6% |
시 | 3509 | 2.3% |
Other values (168) | 102378 |
X좌표
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 144 |
---|---|
Distinct (%) | 1.8% |
Missing | 1885 |
Missing (%) | 18.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 202748.8 |
Minimum | 182204.3 |
---|---|
Maximum | 256839.46 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 182204.3 |
---|---|
5-th percentile | 189443.6 |
Q1 | 196429 |
median | 199406.4 |
Q3 | 207170.8 |
95-th percentile | 213686.3 |
Maximum | 256839.46 |
Range | 74635.157 |
Interquartile range (IQR) | 10741.8 |
Descriptive statistics
Standard deviation | 10891.678 |
---|---|
Coefficient of variation (CV) | 0.053720059 |
Kurtosis | 6.7382043 |
Mean | 202748.8 |
Median Absolute Deviation (MAD) | 6693.1 |
Skewness | 1.9060867 |
Sum | 1.6453065 × 109 |
Variance | 1.1862864 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
198426.6 | 649 | 6.5% |
199375.0 | 561 | 5.6% |
190107.9 | 529 | 5.3% |
213611.0 | 335 | 3.4% |
196429.0 | 324 | 3.2% |
203920.1 | 295 | 2.9% |
198658.9 | 244 | 2.4% |
208246.3 | 237 | 2.4% |
207170.8 | 237 | 2.4% |
206693.8 | 218 | 2.2% |
Other values (134) | 4486 | |
(Missing) | 1885 |
Value | Count | Frequency (%) |
182204.3 | 49 | |
182514.7 | 10 | 0.1% |
182711.8 | 1 | < 0.1% |
182726.2 | 60 | |
182949.6 | 5 | 0.1% |
184424.4 | 24 | 0.2% |
184641.9 | 11 | 0.1% |
185160.6 | 4 | < 0.1% |
185616.0 | 6 | 0.1% |
186082.6 | 1 | < 0.1% |
Value | Count | Frequency (%) |
256839.4568 | 27 | 0.3% |
254100.1354 | 10 | 0.1% |
252811.7026 | 81 | 0.8% |
246332.7914 | 25 | 0.2% |
234793.9968 | 62 | 0.6% |
233493.0421 | 40 | 0.4% |
232023.3238 | 59 | 0.6% |
214034.7 | 8 | 0.1% |
213686.3 | 127 | 1.3% |
213611.0 | 335 |
Y좌표
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 145 |
---|---|
Distinct (%) | 1.8% |
Missing | 1885 |
Missing (%) | 18.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 456607.36 |
Minimum | 437272.4 |
---|---|
Maximum | 608607.71 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 437272.4 |
---|---|
5-th percentile | 437564.6 |
Q1 | 445458 |
median | 451442.6 |
Q3 | 453647.8 |
95-th percentile | 465056.8 |
Maximum | 608607.71 |
Range | 171335.31 |
Interquartile range (IQR) | 8189.8 |
Descriptive statistics
Standard deviation | 30005.006 |
---|---|
Coefficient of variation (CV) | 0.065712927 |
Kurtosis | 14.95384 |
Mean | 456607.36 |
Median Absolute Deviation (MAD) | 4871.5 |
Skewness | 3.9732704 |
Sum | 3.7053687 × 109 |
Variance | 9.003004 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
461398.5 | 649 | 6.5% |
449747.8 | 561 | 5.6% |
451442.6 | 529 | 5.3% |
448693.0 | 335 | 3.4% |
440098.8 | 324 | 3.2% |
437272.4 | 295 | 2.9% |
445458.0 | 244 | 2.4% |
443827.0 | 237 | 2.4% |
440246.8 | 237 | 2.4% |
465056.8 | 218 | 2.2% |
Other values (135) | 4486 | |
(Missing) | 1885 |
Value | Count | Frequency (%) |
437272.4 | 295 | |
437564.6 | 132 | |
437765.4 | 7 | 0.1% |
440098.8 | 324 | |
440246.8 | 237 | |
440298.9 | 1 | < 0.1% |
440690.2 | 16 | 0.2% |
440774.0 | 5 | 0.1% |
440980.5 | 41 | 0.4% |
441372.6 | 115 | 1.1% |
Value | Count | Frequency (%) |
608607.7106 | 20 | 0.2% |
598855.5575 | 59 | 0.6% |
597511.3537 | 25 | 0.2% |
591534.6811 | 51 | 0.5% |
586144.1399 | 27 | 0.3% |
585739.9074 | 81 | 0.8% |
585505.296 | 10 | 0.1% |
580078.2372 | 62 | 0.6% |
578962.5137 | 40 | 0.4% |
465056.8 | 218 |
서식지비고정보
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
동경측지계 | |
---|---|
<NA> | |
세계측지계 | 603 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 4.8343 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 동경측지계 |
---|---|
2nd row | 동경측지계 |
3rd row | 동경측지계 |
4th row | 동경측지계 |
5th row | 세계측지계 |
Common Values
Value | Count | Frequency (%) |
동경측지계 | 7740 | |
<NA> | 1657 | 16.6% |
세계측지계 | 603 | 6.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
동경측지계 | 7740 | |
na | 1657 | 16.6% |
세계측지계 | 603 | 6.0% |
세부통계용명칭 | 출현년도 | 원전 | X좌표 | Y좌표 | 서식지비고정보 | |
---|---|---|---|---|---|---|
세부통계용명칭 | 1.000 | 0.784 | 0.992 | 0.987 | 0.993 | 0.903 |
출현년도 | 0.784 | 1.000 | 1.000 | 0.288 | 0.219 | 0.208 |
원전 | 0.992 | 1.000 | 1.000 | 0.933 | 0.948 | 0.946 |
X좌표 | 0.987 | 0.288 | 0.933 | 1.000 | 0.810 | 0.866 |
Y좌표 | 0.993 | 0.219 | 0.948 | 0.810 | 1.000 | 1.000 |
서식지비고정보 | 0.903 | 0.208 | 0.946 | 0.866 | 1.000 | 1.000 |
출현년도 | X좌표 | Y좌표 | 서식지비고정보 | |
---|---|---|---|---|
출현년도 | 1.000 | 0.130 | 0.063 | 0.151 |
X좌표 | 0.130 | 1.000 | 0.020 | 0.899 |
Y좌표 | 0.063 | 0.020 | 1.000 | 1.000 |
서식지비고정보 | 0.151 | 0.899 | 1.000 | 1.000 |
종코드 | 국명 | 학명 | 서식지코드 | 서식지명 | 세부통계용명칭 | 출현년도 | 원전 | X좌표 | Y좌표 | 서식지비고정보 | |
---|---|---|---|---|---|---|---|---|---|---|---|
38552 | s2975 | 애기나리 | Disporum smilacinum A. Gray | p0205 | 인왕산 | 인왕산 | 1998 | 산림생태계조사 연구보고서 | 196252.9 | 453421.1 | 동경측지계 |
40028 | s0226 | 개밀 | Agropyron tsukushiense var. transiens (Hack.) Ohwi | p0222 | 중랑천 | 중랑천 | 2007 | 2007년 한강생태계 조사 | 206389.4 | 453647.8 | 동경측지계 |
31807 | s1725 | 멧비둘기 | Streptopelia orientalis | p0175 | 여의도샛강 | 여의도샛강 | 2006 | 2007년 한강생태계 조사 | 192687.5 | 446182.8 | 동경측지계 |
259 | s3855 | 질경이 | Plantago asiatica L. | p0008 | 개포동 달터근린공원 | 달터근린공원 | 2001 | 서울시 우수 생태계지역 정밀조사 연구 | 204367.0 | 442054.9 | 동경측지계 |
53369 | s0169 | 갈퀴덩굴 | Galium spurium L. | p0293 | 천왕산 | 한강 | 2007 | 서울시 도시숲(산림) 생태계 조사 학술 연구 | 252811.7026 | 585739.9074 | 세계측지계 |
11790 | s1353 | 도둑놈의갈고리 | Desmodium oxyphyllum DC. | p0058 | 남산 | 남산 | 1948 | 남산의 식물 | 199375.0 | 449747.8 | 동경측지계 |
53147 | s0419 | 고양이 | Felis catus | p0292 | A1 | 초안산 | 2004 | 서울시 비오톱유형별 생물다양성 증진방안 | 207230.5258 | 608607.7106 | 세계측지계 |
17939 | s4609 | 흰눈썹황금새 | Ficedula zanthopygia | p0094 | 보라매공원 | 보라매공원 | 2006 | 소규모 생물서식공간 생태계 모니터링 | 192713.3 | 443357.8 | 동경측지계 |
5973 | s3409 | 이스라지 | Prunus japonica Thunb. var. nakaii (Lev.) Rehder | p0042 | 구룡산 물박달나무군집 | 구룡산 | 2001 | 서울시 우수 생태계지역 정밀조사 연구 | 205028.7 | 440690.2 | 동경측지계 |
7522 | s0517 | 구슬무당거저리 | Ceropria induta (Wiedemann) | p0050 | 길동생태공원 | 길동생태공원 | 2004 | 2004년 운영결과보고서 | 213611.0 | 448693.0 | 동경측지계 |
종코드 | 국명 | 학명 | 서식지코드 | 서식지명 | 세부통계용명칭 | 출현년도 | 원전 | X좌표 | Y좌표 | 서식지비고정보 | |
---|---|---|---|---|---|---|---|---|---|---|---|
55598 | s3617 | 조릿대 | Sasa borealis (Hack.) Makino | p0301 | B1 | <NA> | 2004 | 서울시 비오톱유형별 생물다양성 증진방안 | <NA> | <NA> | 세계측지계 |
11913 | s1739 | 명자꽃 | Chaenomeles lagenaria (Loisel) Koidz. | p0058 | 남산 | 남산 | 1986 | 남산공원의 자연환경실태 및 보존대책 pp.1-78 | 199375.0 | 449747.8 | 동경측지계 |
59428 | s2246 | 북쪽비단노린재 | Eurydema gebleri Kolenati | p0321 | C8 | <NA> | 2004 | 서울시 비오톱유형별 생물다양성 증진방안 | <NA> | <NA> | <NA> |
31768 | s0735 | 깝작도요 | Actitis hypoleucos | p0175 | 여의도샛강 | 여의도샛강 | 2001 | 서울시 우수 생태계지역 정밀조사 연구 | 192687.5 | 446182.8 | 동경측지계 |
58286 | s1754 | 모메뚜기 | Tetrix japonica (Bolivar) | p0313 | C12 | <NA> | 2004 | 서울시 비오톱유형별 생물다양성 증진방안 | <NA> | <NA> | <NA> |
5724 | s3561 | 점박이둥글노린재 | Eysarcoris guttiger (Thunberg) | p0038 | 광나루 | 한강 | 2006 | 2007년 한강생태계 조사 | 210903.0 | 450647.3 | 동경측지계 |
62671 | s4278 | 톱다리개미허리노린재 | Riptortus clavatus (Thunberg) | p0343 | G3 | <NA> | 2004 | 서울시 비오톱유형별 생물다양성 증진방안 | <NA> | <NA> | <NA> |
51448 | s4805 | 산거울 | Carex humilis | p0279 | 헌인릉 오리나무군집 | 헌인릉 | 2009 | 헌인릉 생태경관보전지역 관리계획 수립 연구3차년도 | 207170.8 | 440246.8 | 동경측지계 |
14630 | s3846 | 진득찰 | Siegesbeckia glabrescens Makino | p0065 | 대모산 | 대모산 | 1997 | 산림생태계조사 연구보고서 | 206564.3 | 441372.6 | 동경측지계 |
10853 | s0043 | 가래나무 | Juglans mandshurica Maxim. | p0058 | 남산 | 남산 | 1987 | 남산의 식물상, 자연보호 59:36-48 | 199375.0 | 449747.8 | 동경측지계 |
Most frequently occurring
종코드 | 국명 | 학명 | 서식지코드 | 서식지명 | 세부통계용명칭 | 출현년도 | 원전 | X좌표 | Y좌표 | 서식지비고정보 | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
4 | s4752 | 깔다구과류 | Chironomidae sp. | p0222 | 중랑천 | 중랑천 | 2007 | 2007년 한강생태계 조사 | 206389.4 | 453647.8 | 동경측지계 | 3 |
0 | s1754 | 모메뚜기 | Tetrix japonica (Bolivar) | p0222 | 중랑천 | 중랑천 | 2006 | 2007년 한강생태계 조사 | 206389.4 | 453647.8 | 동경측지계 | 2 |
1 | s1754 | 모메뚜기 | Tetrix japonica (Bolivar) | p0246 | 청계천 | 청계천 | 2007 | 2007년 한강생태계 조사 | 204171.6 | 451564.6 | 동경측지계 | 2 |
2 | s2011 | 방가지똥 | Sonchus oleraceus L. | p0246 | 청계천 | 청계천 | 2007 | 2007년 한강생태계 조사 | 204171.6 | 451564.6 | 동경측지계 | 2 |
3 | s4752 | 깔다구과류 | Chironomidae sp. | p0158 | 안양천 | 안양천 | 2006 | 2007년 한강생태계 조사 | 189443.6 | 447227.4 | 동경측지계 | 2 |
5 | s4752 | 깔다구과류 | Chironomidae sp. | p0298 | 백운천 | 항동수목원 | 2005 | 서울시 복개하천 복원 타당성 조사연구(2005) | 233493.0421 | 578962.5137 | 세계측지계 | 2 |