Overview

Dataset statistics

Number of variables10
Number of observations2962
Missing cells23
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory240.2 KiB
Average record size in memory83.0 B

Variable types

Numeric3
Text5
Boolean2

Dataset

Description국가표준식물종관리시스템은 전국 공, 공, 사립수목원을 대상으로 내부 식물관리용으로 개발된 시스템으로서 해당 데이터는 표준식물종정보로 학명, 결실주기, 멸종위기 등 관련 정보를 제공하고자 함
Author산림청
URLhttps://www.data.go.kr/data/15092915/fileData.do

Alerts

개화기시작(월) is highly overall correlated with 결실기(월)High correlation
결실기(월) is highly overall correlated with 개화기시작(월)High correlation
특산식물여부 is highly imbalanced (64.2%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:17:08.621720
Analysis finished2023-12-12 22:17:10.551317
Duration1.93 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct2962
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1481.5
Minimum1
Maximum2962
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.2 KiB
2023-12-13T07:17:10.616735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile149.05
Q1741.25
median1481.5
Q32221.75
95-th percentile2813.95
Maximum2962
Range2961
Interquartile range (IQR)1480.5

Descriptive statistics

Standard deviation855.20007
Coefficient of variation (CV)0.57725283
Kurtosis-1.2
Mean1481.5
Median Absolute Deviation (MAD)740.5
Skewness0
Sum4388203
Variance731367.17
MonotonicityStrictly increasing
2023-12-13T07:17:10.734768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
1980 1
 
< 0.1%
1971 1
 
< 0.1%
1972 1
 
< 0.1%
1973 1
 
< 0.1%
1974 1
 
< 0.1%
1975 1
 
< 0.1%
1976 1
 
< 0.1%
1977 1
 
< 0.1%
1978 1
 
< 0.1%
Other values (2952) 2952
99.7%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
2962 1
< 0.1%
2961 1
< 0.1%
2960 1
< 0.1%
2959 1
< 0.1%
2958 1
< 0.1%
2957 1
< 0.1%
2956 1
< 0.1%
2955 1
< 0.1%
2954 1
< 0.1%
2953 1
< 0.1%

국명
Text

Distinct2938
Distinct (%)> 99.9%
Missing23
Missing (%)0.8%
Memory size23.3 KiB
2023-12-13T07:17:11.004780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length4.3538619
Min length1

Characters and Unicode

Total characters12796
Distinct characters527
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2937 ?
Unique (%)99.9%

Sample

1st row실새삼
2nd row갯실새삼
3rd row애기메꽃
4th row큰메꽃
5th row선메꽃
ValueCountFrequency (%)
왕보리수나무 2
 
0.1%
섬말나리 1
 
< 0.1%
누른하늘말나리 1
 
< 0.1%
나도옥잠화 1
 
< 0.1%
실새삼 1
 
< 0.1%
참비비추 1
 
< 0.1%
좀비비추 1
 
< 0.1%
비비추 1
 
< 0.1%
각시원추리 1
 
< 0.1%
큰원추리 1
 
< 0.1%
Other values (2928) 2928
99.6%
2023-12-13T07:17:11.367861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
717
 
5.6%
538
 
4.2%
383
 
3.0%
299
 
2.3%
286
 
2.2%
285
 
2.2%
242
 
1.9%
201
 
1.6%
188
 
1.5%
184
 
1.4%
Other values (517) 9473
74.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12796
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
717
 
5.6%
538
 
4.2%
383
 
3.0%
299
 
2.3%
286
 
2.2%
285
 
2.2%
242
 
1.9%
201
 
1.6%
188
 
1.5%
184
 
1.4%
Other values (517) 9473
74.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12796
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
717
 
5.6%
538
 
4.2%
383
 
3.0%
299
 
2.3%
286
 
2.2%
285
 
2.2%
242
 
1.9%
201
 
1.6%
188
 
1.5%
184
 
1.4%
Other values (517) 9473
74.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12796
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
717
 
5.6%
538
 
4.2%
383
 
3.0%
299
 
2.3%
286
 
2.2%
285
 
2.2%
242
 
1.9%
201
 
1.6%
188
 
1.5%
184
 
1.4%
Other values (517) 9473
74.0%

학명
Text

Distinct2959
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size23.3 KiB
2023-12-13T07:17:11.618803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length104
Median length75
Mean length33.937205
Min length12

Characters and Unicode

Total characters100522
Distinct characters61
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2956 ?
Unique (%)99.8%

Sample

1st rowCuscuta australis R.Br.
2nd rowCuscuta chinensis Lam.
3rd rowCalystegia hederacea Wall.
4th rowCalystegia sepium (L.) R.Br.
5th rowCalystegia dahurica (Herb.) Choisy
ValueCountFrequency (%)
nakai 518
 
4.0%
var 510
 
3.9%
364
 
2.8%
l 340
 
2.6%
ex 251
 
1.9%
f 238
 
1.8%
maxim 227
 
1.7%
thunb 170
 
1.3%
makino 150
 
1.2%
japonica 118
 
0.9%
Other values (3462) 10128
77.8%
2023-12-13T07:17:11.982826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 10703
 
10.6%
10052
 
10.0%
i 8239
 
8.2%
e 5916
 
5.9%
r 5185
 
5.2%
s 4536
 
4.5%
. 4495
 
4.5%
n 4470
 
4.4%
o 4355
 
4.3%
u 4307
 
4.3%
Other values (51) 38264
38.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 74282
73.9%
Space Separator 10052
 
10.0%
Uppercase Letter 8989
 
8.9%
Other Punctuation 4950
 
4.9%
Open Punctuation 1106
 
1.1%
Close Punctuation 1106
 
1.1%
Dash Punctuation 37
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 10703
14.4%
i 8239
11.1%
e 5916
 
8.0%
r 5185
 
7.0%
s 4536
 
6.1%
n 4470
 
6.0%
o 4355
 
5.9%
u 4307
 
5.8%
l 3966
 
5.3%
c 3099
 
4.2%
Other values (16) 19506
26.3%
Uppercase Letter
ValueCountFrequency (%)
L 927
 
10.3%
M 797
 
8.9%
S 780
 
8.7%
C 665
 
7.4%
N 629
 
7.0%
T 556
 
6.2%
H 543
 
6.0%
P 499
 
5.6%
A 471
 
5.2%
K 450
 
5.0%
Other values (16) 2672
29.7%
Other Punctuation
ValueCountFrequency (%)
. 4495
90.8%
& 364
 
7.4%
? 78
 
1.6%
' 8
 
0.2%
, 5
 
0.1%
Space Separator
ValueCountFrequency (%)
10052
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1106
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1106
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 37
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 83271
82.8%
Common 17251
 
17.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 10703
 
12.9%
i 8239
 
9.9%
e 5916
 
7.1%
r 5185
 
6.2%
s 4536
 
5.4%
n 4470
 
5.4%
o 4355
 
5.2%
u 4307
 
5.2%
l 3966
 
4.8%
c 3099
 
3.7%
Other values (42) 28495
34.2%
Common
ValueCountFrequency (%)
10052
58.3%
. 4495
26.1%
( 1106
 
6.4%
) 1106
 
6.4%
& 364
 
2.1%
? 78
 
0.5%
- 37
 
0.2%
' 8
 
< 0.1%
, 5
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 100522
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 10703
 
10.6%
10052
 
10.0%
i 8239
 
8.2%
e 5916
 
5.9%
r 5185
 
5.2%
s 4536
 
4.5%
. 4495
 
4.5%
n 4470
 
4.4%
o 4355
 
4.3%
u 4307
 
4.3%
Other values (51) 38264
38.1%
Distinct150
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size23.3 KiB
2023-12-13T07:17:12.213638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length3.5688724
Min length2

Characters and Unicode

Total characters10571
Distinct characters192
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)1.1%

Sample

1st row메꽃과
2nd row메꽃과
3rd row메꽃과
4th row메꽃과
5th row메꽃과
ValueCountFrequency (%)
국화과 231
 
7.8%
장미과 210
 
7.1%
사초과 200
 
6.8%
벼과 196
 
6.6%
미나리아재비과 123
 
4.2%
콩과 118
 
4.0%
백합과 104
 
3.5%
꿀풀과 96
 
3.2%
난초과 93
 
3.1%
현삼과 76
 
2.6%
Other values (140) 1515
51.1%
2023-12-13T07:17:12.541051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2962
28.0%
554
 
5.2%
391
 
3.7%
389
 
3.7%
339
 
3.2%
250
 
2.4%
246
 
2.3%
244
 
2.3%
217
 
2.1%
205
 
1.9%
Other values (182) 4774
45.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10571
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2962
28.0%
554
 
5.2%
391
 
3.7%
389
 
3.7%
339
 
3.2%
250
 
2.4%
246
 
2.3%
244
 
2.3%
217
 
2.1%
205
 
1.9%
Other values (182) 4774
45.2%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10571
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2962
28.0%
554
 
5.2%
391
 
3.7%
389
 
3.7%
339
 
3.2%
250
 
2.4%
246
 
2.3%
244
 
2.3%
217
 
2.1%
205
 
1.9%
Other values (182) 4774
45.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10571
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2962
28.0%
554
 
5.2%
391
 
3.7%
389
 
3.7%
339
 
3.2%
250
 
2.4%
246
 
2.3%
244
 
2.3%
217
 
2.1%
205
 
1.9%
Other values (182) 4774
45.2%

과명
Text

Distinct150
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size23.3 KiB
2023-12-13T07:17:12.745009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length10.343687
Min length7

Characters and Unicode

Total characters30638
Distinct characters45
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)1.1%

Sample

1st rowConvolvulaceae
2nd rowConvolvulaceae
3rd rowConvolvulaceae
4th rowConvolvulaceae
5th rowConvolvulaceae
ValueCountFrequency (%)
asteraceae 231
 
7.8%
rosaceae 210
 
7.1%
cyperaceae 200
 
6.8%
poaceae 196
 
6.6%
ranunculaceae 123
 
4.2%
fabaceae 118
 
4.0%
liliaceae 104
 
3.5%
lamiaceae 96
 
3.2%
orchidaceae 93
 
3.1%
scrophulariaceae 76
 
2.6%
Other values (140) 1515
51.1%
2023-12-13T07:17:13.048585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 7234
23.6%
e 6755
22.0%
c 3562
11.6%
r 1458
 
4.8%
i 1376
 
4.5%
l 1065
 
3.5%
o 1034
 
3.4%
n 751
 
2.5%
s 727
 
2.4%
p 662
 
2.2%
Other values (35) 6014
19.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 27676
90.3%
Uppercase Letter 2962
 
9.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 7234
26.1%
e 6755
24.4%
c 3562
12.9%
r 1458
 
5.3%
i 1376
 
5.0%
l 1065
 
3.8%
o 1034
 
3.7%
n 751
 
2.7%
s 727
 
2.6%
p 662
 
2.4%
Other values (14) 3052
11.0%
Uppercase Letter
ValueCountFrequency (%)
C 486
16.4%
A 412
13.9%
R 408
13.8%
P 382
12.9%
L 235
7.9%
O 169
 
5.7%
S 169
 
5.7%
F 146
 
4.9%
B 126
 
4.3%
E 96
 
3.2%
Other values (11) 333
11.2%

Most occurring scripts

ValueCountFrequency (%)
Latin 30638
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 7234
23.6%
e 6755
22.0%
c 3562
11.6%
r 1458
 
4.8%
i 1376
 
4.5%
l 1065
 
3.5%
o 1034
 
3.4%
n 751
 
2.5%
s 727
 
2.4%
p 662
 
2.2%
Other values (35) 6014
19.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 30638
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 7234
23.6%
e 6755
22.0%
c 3562
11.6%
r 1458
 
4.8%
i 1376
 
4.5%
l 1065
 
3.5%
o 1034
 
3.4%
n 751
 
2.5%
s 727
 
2.4%
p 662
 
2.2%
Other values (35) 6014
19.6%

개화기시작(월)
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.0955436
Minimum1
Maximum11
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.2 KiB
2023-12-13T07:17:13.155329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q15
median6
Q37
95-th percentile8
Maximum11
Range10
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.4687975
Coefficient of variation (CV)0.24096252
Kurtosis-0.55902823
Mean6.0955436
Median Absolute Deviation (MAD)1
Skewness0.057958035
Sum18055
Variance2.1573662
MonotonicityNot monotonic
2023-12-13T07:17:13.243727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
5 727
24.5%
7 689
23.3%
6 562
19.0%
8 448
15.1%
4 367
12.4%
9 90
 
3.0%
3 53
 
1.8%
10 17
 
0.6%
2 5
 
0.2%
1 2
 
0.1%
ValueCountFrequency (%)
1 2
 
0.1%
2 5
 
0.2%
3 53
 
1.8%
4 367
12.4%
5 727
24.5%
6 562
19.0%
7 689
23.3%
8 448
15.1%
9 90
 
3.0%
10 17
 
0.6%
ValueCountFrequency (%)
11 2
 
0.1%
10 17
 
0.6%
9 90
 
3.0%
8 448
15.1%
7 689
23.3%
6 562
19.0%
5 727
24.5%
4 367
12.4%
3 53
 
1.8%
2 5
 
0.2%

결실기(월)
Real number (ℝ)

HIGH CORRELATION 

Distinct10
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.1313302
Minimum3
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.2 KiB
2023-12-13T07:17:13.329787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile4
Q16
median7
Q38
95-th percentile10
Maximum12
Range9
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.6730286
Coefficient of variation (CV)0.23460261
Kurtosis-0.71453442
Mean7.1313302
Median Absolute Deviation (MAD)1
Skewness-0.055646491
Sum21123
Variance2.7990248
MonotonicityNot monotonic
2023-12-13T07:17:13.408618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
8 773
26.1%
6 518
17.5%
5 450
15.2%
7 446
15.1%
9 408
13.8%
10 193
 
6.5%
4 137
 
4.6%
11 18
 
0.6%
3 13
 
0.4%
12 6
 
0.2%
ValueCountFrequency (%)
3 13
 
0.4%
4 137
 
4.6%
5 450
15.2%
6 518
17.5%
7 446
15.1%
8 773
26.1%
9 408
13.8%
10 193
 
6.5%
11 18
 
0.6%
12 6
 
0.2%
ValueCountFrequency (%)
12 6
 
0.2%
11 18
 
0.6%
10 193
 
6.5%
9 408
13.8%
8 773
26.1%
7 446
15.1%
6 518
17.5%
5 450
15.2%
4 137
 
4.6%
3 13
 
0.4%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
False
2554 
True
408 
ValueCountFrequency (%)
False 2554
86.2%
True 408
 
13.8%
2023-12-13T07:17:13.488034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

특산식물여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
False
2761 
True
 
201
ValueCountFrequency (%)
False 2761
93.2%
True 201
 
6.8%
2023-12-13T07:17:13.549677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

크기
Text

Distinct1875
Distinct (%)63.3%
Missing0
Missing (%)0.0%
Memory size23.3 KiB
2023-12-13T07:17:13.749318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length46
Mean length12.612762
Min length2

Characters and Unicode

Total characters37359
Distinct characters160
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1446 ?
Unique (%)48.8%

Sample

1st row길이 50cm 내외이다.
2nd row줄기의 길이가 1m에 이른다.
3rd row길이가 20-70cm 정도로 자란다.
4th row길이가 20-70cm정도로 자란다.
5th row높이가 60cm 정도이다.
ValueCountFrequency (%)
높이 1915
24.1%
높이가 333
 
4.2%
자란다 324
 
4.1%
높이는 320
 
4.0%
길이 169
 
2.1%
1m 135
 
1.7%
정도 132
 
1.7%
달한다 125
 
1.6%
지름 123
 
1.5%
이른다 119
 
1.5%
Other values (1193) 4258
53.5%
2023-12-13T07:17:14.204910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5075
13.6%
3805
 
10.2%
0 3379
 
9.0%
m 2872
 
7.7%
2572
 
6.9%
. 2176
 
5.8%
c 1719
 
4.6%
1498
 
4.0%
1 1491
 
4.0%
5 1223
 
3.3%
Other values (150) 11549
30.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13606
36.4%
Decimal Number 9337
25.0%
Space Separator 5075
 
13.6%
Lowercase Letter 4591
 
12.3%
Other Punctuation 2408
 
6.4%
Dash Punctuation 1162
 
3.1%
Math Symbol 789
 
2.1%
Other Symbol 363
 
1.0%
Close Punctuation 14
 
< 0.1%
Open Punctuation 14
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3805
28.0%
2572
18.9%
1498
 
11.0%
561
 
4.1%
554
 
4.1%
475
 
3.5%
467
 
3.4%
340
 
2.5%
324
 
2.4%
255
 
1.9%
Other values (126) 2755
20.2%
Decimal Number
ValueCountFrequency (%)
0 3379
36.2%
1 1491
16.0%
5 1223
 
13.1%
2 934
 
10.0%
3 873
 
9.3%
4 447
 
4.8%
6 366
 
3.9%
8 279
 
3.0%
7 263
 
2.8%
9 82
 
0.9%
Other Punctuation
ValueCountFrequency (%)
. 2176
90.4%
, 226
 
9.4%
: 6
 
0.2%
Math Symbol
ValueCountFrequency (%)
~ 769
97.5%
19
 
2.4%
1
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
m 2872
62.6%
c 1719
37.4%
Other Symbol
ValueCountFrequency (%)
362
99.7%
1
 
0.3%
Space Separator
ValueCountFrequency (%)
5075
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1162
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 19162
51.3%
Hangul 13588
36.4%
Latin 4591
 
12.3%
Han 18
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3805
28.0%
2572
18.9%
1498
 
11.0%
561
 
4.1%
554
 
4.1%
475
 
3.5%
467
 
3.4%
340
 
2.5%
324
 
2.4%
255
 
1.9%
Other values (124) 2737
20.1%
Common
ValueCountFrequency (%)
5075
26.5%
0 3379
17.6%
. 2176
11.4%
1 1491
 
7.8%
5 1223
 
6.4%
- 1162
 
6.1%
2 934
 
4.9%
3 873
 
4.6%
~ 769
 
4.0%
4 447
 
2.3%
Other values (12) 1633
 
8.5%
Latin
ValueCountFrequency (%)
m 2872
62.6%
c 1719
37.4%
Han
ValueCountFrequency (%)
9
50.0%
9
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 23370
62.6%
Hangul 13588
36.4%
CJK Compat 363
 
1.0%
Math Operators 19
 
0.1%
CJK 18
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5075
21.7%
0 3379
14.5%
m 2872
12.3%
. 2176
9.3%
c 1719
 
7.4%
1 1491
 
6.4%
5 1223
 
5.2%
- 1162
 
5.0%
2 934
 
4.0%
3 873
 
3.7%
Other values (10) 2466
10.6%
Hangul
ValueCountFrequency (%)
3805
28.0%
2572
18.9%
1498
 
11.0%
561
 
4.1%
554
 
4.1%
475
 
3.5%
467
 
3.4%
340
 
2.5%
324
 
2.4%
255
 
1.9%
Other values (124) 2737
20.1%
CJK Compat
ValueCountFrequency (%)
362
99.7%
1
 
0.3%
Math Operators
ValueCountFrequency (%)
19
100.0%
CJK
ValueCountFrequency (%)
9
50.0%
9
50.0%
None
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-13T07:17:10.036430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:17:09.328775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:17:09.548431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:17:10.111986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:17:09.406205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:17:09.867592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:17:10.185378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:17:09.478140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:17:09.953811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:17:14.303673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번개화기시작(월)결실기(월)보호식물여부특산식물여부
연번1.0000.3970.4860.1240.086
개화기시작(월)0.3971.0000.9070.0000.000
결실기(월)0.4860.9071.0000.0800.096
보호식물여부0.1240.0000.0801.0000.191
특산식물여부0.0860.0000.0960.1911.000
2023-12-13T07:17:14.411016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
보호식물여부특산식물여부
보호식물여부1.0000.123
특산식물여부0.1231.000
2023-12-13T07:17:14.538622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번개화기시작(월)결실기(월)보호식물여부특산식물여부
연번1.0000.1150.1370.0950.066
개화기시작(월)0.1151.0000.8690.0000.000
결실기(월)0.1370.8691.0000.0620.074
보호식물여부0.0950.0000.0621.0000.123
특산식물여부0.0660.0000.0740.1231.000

Missing values

2023-12-13T07:17:10.286083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:17:10.487200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번국명학명과국명과명개화기시작(월)결실기(월)보호식물여부특산식물여부크기
01실새삼Cuscuta australis R.Br.메꽃과Convolvulaceae78NN길이 50cm 내외이다.
12갯실새삼Cuscuta chinensis Lam.메꽃과Convolvulaceae78NN줄기의 길이가 1m에 이른다.
23애기메꽃Calystegia hederacea Wall.메꽃과Convolvulaceae68NN길이가 20-70cm 정도로 자란다.
34큰메꽃Calystegia sepium (L.) R.Br.메꽃과Convolvulaceae68NN길이가 20-70cm정도로 자란다.
45선메꽃Calystegia dahurica (Herb.) Choisy메꽃과Convolvulaceae68NN높이가 60cm 정도이다.
56갯메꽃Calystegia soldanella (L.) Roem. & Schultb.메꽃과Convolvulaceae56NN길이는2~3cm, 폭은 3~5cm.
67방울꽃Strobilanthes oliganthus Miq.쥐꼬리망초과Acanthaceae99NN높이 30~60cm이다.
78쥐꼬리망초Justicia procumbens L.쥐꼬리망초과Acanthaceae79NN높이 30cm이다.
89물잎풀Hygrophila salicifolia (Vahl) Nees쥐꼬리망초과Acanthaceae99NN높이 30-60cm.
910애기도라지Wahlenbergia marginata (Thunb.) A.DC.초롱꽃과Campanulaceae68NN높이 20~40cm이다.
연번국명학명과국명과명개화기시작(월)결실기(월)보호식물여부특산식물여부크기
29522953눈까치밥나무Ribes triste Pall.까치밥나무과Grossulariaceae55NN길이 1.5m이내
29532954개앵도나무Ribes mandshuricum (Maxim.) Kom. var. subglabrum Kom.까치밥나무과Grossulariaceae55NN높이 2m에 달한다.
29542955넓은잎까치밥나무Ribes latifolium Jancz.까치밥나무과Grossulariaceae67NN높이 1~2m
29552956까마귀밥나무Ribes fasciculatum Siebold & Zucc. var. chinense Maxim.까치밥나무과Grossulariaceae45NN높이 1 ~ 1.5m.
29562957명자순Ribes maximowiczianum Kom.까치밥나무과Grossulariaceae44NN높이는 1m 안팎
29572958좀꼬리까치밥나무Ribes komarovii Pojark. var. breviracemum (Nakai) T.B.Lee까치밥나무과Grossulariaceae44NN높이 2.5m에 달한다.
29582959겨우살이Viscum album L. var. coloratum (Kom.) Ohwi겨우살이과Viscaceae44NN높이 40~60cm, 지름 1m.
29592960붉은겨우살이Viscum album f. rubroauranticum (Makino) Ohwi겨우살이과Viscaceae44NN높이 40-60cm. 직경이 1m에 달하는 것도 있다.
29602961동백나무겨우살이Korthalsella japonica (Thunb.) Engl.겨우살이과Viscaceae45NN높이 5-30cm정도로 자란다.
29612962뚜껑덩굴Actinostemma lobatum (Maxim.) Maxim. ex Franch. & Sav.박과Cucurbitaceae89NN줄기의 길이가 2m에 달한다.