Overview

Dataset statistics

Number of variables50
Number of observations654
Missing cells3506
Missing cells (%)10.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory260.1 KiB
Average record size in memory407.2 B

Variable types

Numeric6
Text9
Categorical34
Unsupported1

Alerts

last_load_dttm has constant value ""Constant
platanus_orientalis is highly imbalanced (82.9%)Imbalance
platanus is highly imbalanced (80.3%)Imbalance
sophora_japonica is highly imbalanced (83.7%)Imbalance
metasequoia is highly imbalanced (82.4%)Imbalance
horse_chestnut is highly imbalanced (83.7%)Imbalance
acer_buergerianum is highly imbalanced (81.9%)Imbalance
celtis_sinensis is highly imbalanced (83.4%)Imbalance
tulipifera is highly imbalanced (83.1%)Imbalance
acer_palmatum is highly imbalanced (82.6%)Imbalance
firmiana_simplex is highly imbalanced (81.5%)Imbalance
persimmon is highly imbalanced (76.5%)Imbalance
cornus_kousa is highly imbalanced (81.0%)Imbalance
chinese_quince is highly imbalanced (76.5%)Imbalance
goldenrain_tree is highly imbalanced (79.1%)Imbalance
cinnamon_tree is highly imbalanced (76.5%)Imbalance
ailanthus_altissima is highly imbalanced (76.5%)Imbalance
amur_cork_tree is highly imbalanced (76.5%)Imbalance
babylon_willow is highly imbalanced (71.4%)Imbalance
three_flowered_maple is highly imbalanced (76.5%)Imbalance
japanese_elm is highly imbalanced (71.4%)Imbalance
jujube is highly imbalanced (76.5%)Imbalance
pinus_thunbergii is highly imbalanced (81.8%)Imbalance
myrsinaleaf_oak is highly imbalanced (81.6%)Imbalance
castanopsis_sieboldii is highly imbalanced (83.3%)Imbalance
cedrus_deodara is highly imbalanced (82.3%)Imbalance
camphor_tree is highly imbalanced (81.7%)Imbalance
torulosa is highly imbalanced (81.7%)Imbalance
neolitsea_sericea is highly imbalanced (79.1%)Imbalance
taxus_cuspidata is highly imbalanced (76.5%)Imbalance
sweet_viburnum is highly imbalanced (71.4%)Imbalance
etc_tree is highly imbalanced (76.5%)Imbalance
reference_date is highly imbalanced (51.0%)Imbalance
plant_distance has 120 (18.3%) missing valuesMissing
prunus_yedoensis has 372 (56.9%) missing valuesMissing
ginkgo has 359 (54.9%) missing valuesMissing
sawleaf_zelkova has 440 (67.3%) missing valuesMissing
chinese_fringe_tree has 498 (76.1%) missing valuesMissing
pin_oak has 654 (100.0%) missing valuesMissing
silver_magnolia has 531 (81.2%) missing valuesMissing
kurogane_holly has 525 (80.3%) missing valuesMissing
skey has unique valuesUnique
pin_oak is an unsupported type, check if it needs cleaning or further analysisUnsupported
chinese_fringe_tree has 49 (7.5%) zerosZeros

Reproduction

Analysis started2024-04-16 05:53:04.523603
Analysis finished2024-04-16 05:53:05.518042
Duration0.99 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

skey
Real number (ℝ)

UNIQUE 

Distinct654
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4480.3364
Minimum3641
Maximum4842
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.9 KiB
2024-04-16T14:53:05.571421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3641
5-th percentile4192.65
Q14323.25
median4515.5
Q34678.75
95-th percentile4809.35
Maximum4842
Range1201
Interquartile range (IQR)355.5

Descriptive statistics

Standard deviation259.11567
Coefficient of variation (CV)0.057833976
Kurtosis2.1748093
Mean4480.3364
Median Absolute Deviation (MAD)178
Skewness-1.2202589
Sum2930140
Variance67140.928
MonotonicityNot monotonic
2024-04-16T14:53:05.691985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4544 1
 
0.2%
4378 1
 
0.2%
4398 1
 
0.2%
4399 1
 
0.2%
4400 1
 
0.2%
4801 1
 
0.2%
4203 1
 
0.2%
4204 1
 
0.2%
4205 1
 
0.2%
4206 1
 
0.2%
Other values (644) 644
98.5%
ValueCountFrequency (%)
3641 1
0.2%
3642 1
0.2%
3643 1
0.2%
3644 1
0.2%
3645 1
0.2%
3646 1
0.2%
3647 1
0.2%
3648 1
0.2%
3649 1
0.2%
3650 1
0.2%
ValueCountFrequency (%)
4842 1
0.2%
4841 1
0.2%
4840 1
0.2%
4839 1
0.2%
4838 1
0.2%
4837 1
0.2%
4836 1
0.2%
4835 1
0.2%
4834 1
0.2%
4833 1
0.2%

loc_nm
Text

Distinct627
Distinct (%)96.0%
Missing1
Missing (%)0.2%
Memory size5.2 KiB
2024-04-16T14:53:05.862963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length24
Mean length14.719755
Min length3

Characters and Unicode

Total characters9612
Distinct characters269
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique612 ?
Unique (%)93.7%

Sample

1st row부산광역시 해운대구 동백로
2nd row부산광역시 해운대구 달맞이길
3rd row부산광역시 해운대구 좌동순환로
4th row부산광역시 해운대구 좌동순환로 15번길
5th row부산광역시 해운대구 좌동순환로 468번길
ValueCountFrequency (%)
부산광역시 559
29.0%
해운대구 67
 
3.5%
부산진구 63
 
3.3%
기장군 60
 
3.1%
사하구 49
 
2.5%
북구 48
 
2.5%
동래구 48
 
2.5%
사상구 43
 
2.2%
영도구 30
 
1.6%
동구 29
 
1.5%
Other values (610) 929
48.3%
2024-04-16T14:53:06.146791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1389
 
14.5%
703
 
7.3%
643
 
6.7%
629
 
6.5%
582
 
6.1%
575
 
6.0%
559
 
5.8%
532
 
5.5%
226
 
2.4%
226
 
2.4%
Other values (259) 3548
36.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7553
78.6%
Space Separator 1389
 
14.5%
Decimal Number 505
 
5.3%
Open Punctuation 65
 
0.7%
Close Punctuation 65
 
0.7%
Other Punctuation 17
 
0.2%
Math Symbol 9
 
0.1%
Uppercase Letter 8
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
703
 
9.3%
643
 
8.5%
629
 
8.3%
582
 
7.7%
575
 
7.6%
559
 
7.4%
532
 
7.0%
226
 
3.0%
226
 
3.0%
167
 
2.2%
Other values (233) 2711
35.9%
Decimal Number
ValueCountFrequency (%)
1 97
19.2%
3 72
14.3%
2 67
13.3%
0 50
9.9%
4 49
9.7%
5 39
7.7%
9 37
 
7.3%
7 36
 
7.1%
6 34
 
6.7%
8 24
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
C 2
25.0%
I 1
12.5%
G 1
12.5%
L 1
12.5%
E 1
12.5%
P 1
12.5%
A 1
12.5%
Math Symbol
ValueCountFrequency (%)
~ 6
66.7%
+ 2
 
22.2%
1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
. 15
88.2%
, 2
 
11.8%
Space Separator
ValueCountFrequency (%)
1389
100.0%
Open Punctuation
ValueCountFrequency (%)
( 65
100.0%
Close Punctuation
ValueCountFrequency (%)
) 65
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7553
78.6%
Common 2051
 
21.3%
Latin 8
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
703
 
9.3%
643
 
8.5%
629
 
8.3%
582
 
7.7%
575
 
7.6%
559
 
7.4%
532
 
7.0%
226
 
3.0%
226
 
3.0%
167
 
2.2%
Other values (233) 2711
35.9%
Common
ValueCountFrequency (%)
1389
67.7%
1 97
 
4.7%
3 72
 
3.5%
2 67
 
3.3%
( 65
 
3.2%
) 65
 
3.2%
0 50
 
2.4%
4 49
 
2.4%
5 39
 
1.9%
9 37
 
1.8%
Other values (9) 121
 
5.9%
Latin
ValueCountFrequency (%)
C 2
25.0%
I 1
12.5%
G 1
12.5%
L 1
12.5%
E 1
12.5%
P 1
12.5%
A 1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7553
78.6%
ASCII 2058
 
21.4%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1389
67.5%
1 97
 
4.7%
3 72
 
3.5%
2 67
 
3.3%
( 65
 
3.2%
) 65
 
3.2%
0 50
 
2.4%
4 49
 
2.4%
5 39
 
1.9%
9 37
 
1.8%
Other values (15) 128
 
6.2%
Hangul
ValueCountFrequency (%)
703
 
9.3%
643
 
8.5%
629
 
8.3%
582
 
7.7%
575
 
7.6%
559
 
7.4%
532
 
7.0%
226
 
3.0%
226
 
3.0%
167
 
2.2%
Other values (233) 2711
35.9%
Math Operators
ValueCountFrequency (%)
1
100.0%

lat
Real number (ℝ)

Distinct616
Distinct (%)94.3%
Missing1
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean35.160187
Minimum35.030373
Maximum35.374455
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.9 KiB
2024-04-16T14:53:06.254262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum35.030373
5-th percentile35.072638
Q135.101761
median35.153641
Q335.200214
95-th percentile35.310974
Maximum35.374455
Range0.344082
Interquartile range (IQR)0.098453

Descriptive statistics

Standard deviation0.068431644
Coefficient of variation (CV)0.0019462821
Kurtosis0.088836381
Mean35.160187
Median Absolute Deviation (MAD)0.050580917
Skewness0.73245927
Sum22959.602
Variance0.0046828898
MonotonicityNot monotonic
2024-04-16T14:53:06.372978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
35.208078 4
 
0.6%
35.101713 3
 
0.5%
35.16230000334156 3
 
0.5%
35.15545936863982 3
 
0.5%
35.191714 3
 
0.5%
35.095758 3
 
0.5%
35.232297 2
 
0.3%
35.1256321 2
 
0.3%
35.197626 2
 
0.3%
35.197558 2
 
0.3%
Other values (606) 626
95.7%
ValueCountFrequency (%)
35.030373 1
0.2%
35.031321 1
0.2%
35.049232 1
0.2%
35.05039 1
0.2%
35.053596 1
0.2%
35.053981 1
0.2%
35.056847 1
0.2%
35.060066 1
0.2%
35.060767 1
0.2%
35.061426 1
0.2%
ValueCountFrequency (%)
35.374455 1
0.2%
35.370551 1
0.2%
35.357995 1
0.2%
35.351631 1
0.2%
35.344148 1
0.2%
35.340786 1
0.2%
35.340124 1
0.2%
35.340092 1
0.2%
35.338915 1
0.2%
35.337265 1
0.2%

lng
Real number (ℝ)

Distinct616
Distinct (%)94.3%
Missing1
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean129.04961
Minimum128.81688
Maximum129.27168
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.9 KiB
2024-04-16T14:53:06.492364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum128.81688
5-th percentile128.88323
Q1129.00913
median129.05906
Q3129.09361
95-th percentile129.19396
Maximum129.27168
Range0.454801
Interquartile range (IQR)0.084477

Descriptive statistics

Standard deviation0.079907654
Coefficient of variation (CV)0.00061920103
Kurtosis0.9212496
Mean129.04961
Median Absolute Deviation (MAD)0.042769
Skewness-0.22853801
Sum84269.398
Variance0.0063852332
MonotonicityNot monotonic
2024-04-16T14:53:06.827258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
129.069547 4
 
0.6%
129.153866 3
 
0.5%
128.98593871486636 3
 
0.5%
129.100261 3
 
0.5%
128.868141 3
 
0.5%
128.99133288259324 3
 
0.5%
129.09549 2
 
0.3%
128.959676 2
 
0.3%
129.091161 2
 
0.3%
129.08786 2
 
0.3%
Other values (606) 626
95.7%
ValueCountFrequency (%)
128.816884 1
0.2%
128.81853 1
0.2%
128.820129 1
0.2%
128.827667 1
0.2%
128.83021 1
0.2%
128.831475 1
0.2%
128.831808 1
0.2%
128.831938 1
0.2%
128.833038 1
0.2%
128.833334 1
0.2%
ValueCountFrequency (%)
129.271685 1
0.2%
129.269173 1
0.2%
129.258504 1
0.2%
129.258446 1
0.2%
129.255467 1
0.2%
129.253591 1
0.2%
129.249412 1
0.2%
129.243941 1
0.2%
129.243469 1
0.2%
129.242121 1
0.2%
Distinct574
Distinct (%)87.9%
Missing1
Missing (%)0.2%
Memory size5.2 KiB
2024-04-16T14:53:07.052666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length6.0290965
Min length3

Characters and Unicode

Total characters3937
Distinct characters346
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique514 ?
Unique (%)78.7%

Sample

1st row운촌삼거리
2nd row미포육거리
3rd row중동 E-마트
4th row수영세무서
5th row동일아파트 앞 사거리
ValueCountFrequency (%)
10
 
1.3%
아파트 9
 
1.1%
8
 
1.0%
삼거리 6
 
0.8%
충렬대로 5
 
0.6%
백양대로 5
 
0.6%
입구 5
 
0.6%
경계 4
 
0.5%
사거리 4
 
0.5%
방통대 4
 
0.5%
Other values (620) 726
92.4%
2024-04-16T14:53:07.392743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
199
 
5.1%
180
 
4.6%
113
 
2.9%
92
 
2.3%
88
 
2.2%
83
 
2.1%
74
 
1.9%
72
 
1.8%
67
 
1.7%
66
 
1.7%
Other values (336) 2903
73.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3422
86.9%
Decimal Number 222
 
5.6%
Space Separator 199
 
5.1%
Uppercase Letter 62
 
1.6%
Dash Punctuation 13
 
0.3%
Open Punctuation 8
 
0.2%
Close Punctuation 8
 
0.2%
Math Symbol 1
 
< 0.1%
Other Symbol 1
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
180
 
5.3%
113
 
3.3%
92
 
2.7%
88
 
2.6%
83
 
2.4%
74
 
2.2%
72
 
2.1%
67
 
2.0%
66
 
1.9%
64
 
1.9%
Other values (304) 2523
73.7%
Uppercase Letter
ValueCountFrequency (%)
S 9
14.5%
K 9
14.5%
I 8
12.9%
C 7
11.3%
E 7
11.3%
W 4
6.5%
V 4
6.5%
L 4
6.5%
G 3
 
4.8%
N 2
 
3.2%
Other values (5) 5
8.1%
Decimal Number
ValueCountFrequency (%)
1 62
27.9%
2 36
16.2%
5 23
 
10.4%
0 19
 
8.6%
3 17
 
7.7%
6 16
 
7.2%
4 16
 
7.2%
7 13
 
5.9%
9 12
 
5.4%
8 8
 
3.6%
Space Separator
ValueCountFrequency (%)
199
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3423
86.9%
Common 452
 
11.5%
Latin 62
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
180
 
5.3%
113
 
3.3%
92
 
2.7%
88
 
2.6%
83
 
2.4%
74
 
2.2%
72
 
2.1%
67
 
2.0%
66
 
1.9%
64
 
1.9%
Other values (305) 2524
73.7%
Common
ValueCountFrequency (%)
199
44.0%
1 62
 
13.7%
2 36
 
8.0%
5 23
 
5.1%
0 19
 
4.2%
3 17
 
3.8%
6 16
 
3.5%
4 16
 
3.5%
- 13
 
2.9%
7 13
 
2.9%
Other values (6) 38
 
8.4%
Latin
ValueCountFrequency (%)
S 9
14.5%
K 9
14.5%
I 8
12.9%
C 7
11.3%
E 7
11.3%
W 4
6.5%
V 4
6.5%
L 4
6.5%
G 3
 
4.8%
N 2
 
3.2%
Other values (5) 5
8.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3422
86.9%
ASCII 514
 
13.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
199
38.7%
1 62
 
12.1%
2 36
 
7.0%
5 23
 
4.5%
0 19
 
3.7%
3 17
 
3.3%
6 16
 
3.1%
4 16
 
3.1%
- 13
 
2.5%
7 13
 
2.5%
Other values (21) 100
19.5%
Hangul
ValueCountFrequency (%)
180
 
5.3%
113
 
3.3%
92
 
2.7%
88
 
2.6%
83
 
2.4%
74
 
2.2%
72
 
2.1%
67
 
2.0%
66
 
1.9%
64
 
1.9%
Other values (304) 2523
73.7%
None
ValueCountFrequency (%)
1
100.0%
Distinct588
Distinct (%)90.2%
Missing2
Missing (%)0.3%
Memory size5.2 KiB
2024-04-16T14:53:07.633511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length17
Mean length6.2147239
Min length2

Characters and Unicode

Total characters4052
Distinct characters377
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique541 ?
Unique (%)83.0%

Sample

1st row해수욕장
2nd row송정터널
3rd row수영세무서
4th row중동지하차도
5th row동일아파트 101동
ValueCountFrequency (%)
입구 18
 
2.2%
경계 13
 
1.6%
아파트 12
 
1.5%
10
 
1.2%
10
 
1.2%
수영 4
 
0.5%
수영구경계 4
 
0.5%
부산은행 4
 
0.5%
동래구경계 4
 
0.5%
기장군 4
 
0.5%
Other values (644) 732
89.8%
2024-04-16T14:53:08.024529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
224
 
5.5%
122
 
3.0%
105
 
2.6%
96
 
2.4%
91
 
2.2%
89
 
2.2%
87
 
2.1%
83
 
2.0%
75
 
1.9%
61
 
1.5%
Other values (367) 3019
74.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3488
86.1%
Decimal Number 228
 
5.6%
Space Separator 224
 
5.5%
Uppercase Letter 67
 
1.7%
Dash Punctuation 14
 
0.3%
Close Punctuation 10
 
0.2%
Open Punctuation 10
 
0.2%
Other Punctuation 5
 
0.1%
Math Symbol 2
 
< 0.1%
Lowercase Letter 2
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
122
 
3.5%
105
 
3.0%
96
 
2.8%
91
 
2.6%
89
 
2.6%
87
 
2.5%
83
 
2.4%
75
 
2.2%
61
 
1.7%
59
 
1.7%
Other values (333) 2620
75.1%
Uppercase Letter
ValueCountFrequency (%)
I 12
17.9%
C 10
14.9%
S 9
13.4%
E 6
9.0%
K 6
9.0%
L 5
7.5%
W 5
7.5%
V 4
 
6.0%
G 3
 
4.5%
H 3
 
4.5%
Other values (3) 4
 
6.0%
Decimal Number
ValueCountFrequency (%)
1 57
25.0%
2 41
18.0%
3 26
11.4%
6 20
 
8.8%
5 19
 
8.3%
0 16
 
7.0%
4 15
 
6.6%
9 13
 
5.7%
7 11
 
4.8%
8 10
 
4.4%
Other Punctuation
ValueCountFrequency (%)
, 4
80.0%
; 1
 
20.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
50.0%
g 1
50.0%
Space Separator
ValueCountFrequency (%)
224
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3489
86.1%
Common 493
 
12.2%
Latin 70
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
122
 
3.5%
105
 
3.0%
96
 
2.8%
91
 
2.6%
89
 
2.6%
87
 
2.5%
83
 
2.4%
75
 
2.1%
61
 
1.7%
59
 
1.7%
Other values (334) 2621
75.1%
Common
ValueCountFrequency (%)
224
45.4%
1 57
 
11.6%
2 41
 
8.3%
3 26
 
5.3%
6 20
 
4.1%
5 19
 
3.9%
0 16
 
3.2%
4 15
 
3.0%
- 14
 
2.8%
9 13
 
2.6%
Other values (7) 48
 
9.7%
Latin
ValueCountFrequency (%)
I 12
17.1%
C 10
14.3%
S 9
12.9%
E 6
8.6%
K 6
8.6%
L 5
7.1%
W 5
7.1%
V 4
 
5.7%
G 3
 
4.3%
H 3
 
4.3%
Other values (6) 7
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3488
86.1%
ASCII 562
 
13.9%
None 1
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
224
39.9%
1 57
 
10.1%
2 41
 
7.3%
3 26
 
4.6%
6 20
 
3.6%
5 19
 
3.4%
0 16
 
2.8%
4 15
 
2.7%
- 14
 
2.5%
9 13
 
2.3%
Other values (22) 117
20.8%
Hangul
ValueCountFrequency (%)
122
 
3.5%
105
 
3.0%
96
 
2.8%
91
 
2.6%
89
 
2.6%
87
 
2.5%
83
 
2.4%
75
 
2.2%
61
 
1.7%
59
 
1.7%
Other values (333) 2620
75.1%
None
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

plant_distance
Text

MISSING 

Distinct342
Distinct (%)64.0%
Missing120
Missing (%)18.3%
Memory size5.2 KiB
2024-04-16T14:53:08.357576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length3
Mean length3.5543071
Min length1

Characters and Unicode

Total characters1898
Distinct characters13
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique250 ?
Unique (%)46.8%

Sample

1st row720
2nd row5200
3rd row6030
4th row200
5th row170
ValueCountFrequency (%)
100 13
 
2.4%
200 13
 
2.4%
500 12
 
2.2%
300 9
 
1.7%
900 8
 
1.5%
800 7
 
1.3%
350 7
 
1.3%
700 7
 
1.3%
0.2 6
 
1.1%
250 6
 
1.1%
Other values (293) 446
83.5%
2024-04-16T14:53:08.784267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 612
32.2%
1 198
 
10.4%
2 179
 
9.4%
5 132
 
7.0%
3 131
 
6.9%
4 110
 
5.8%
. 100
 
5.3%
6 99
 
5.2%
8 97
 
5.1%
77
 
4.1%
Other values (3) 163
 
8.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1691
89.1%
Other Punctuation 130
 
6.8%
Space Separator 77
 
4.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 612
36.2%
1 198
 
11.7%
2 179
 
10.6%
5 132
 
7.8%
3 131
 
7.7%
4 110
 
6.5%
6 99
 
5.9%
8 97
 
5.7%
7 69
 
4.1%
9 64
 
3.8%
Other Punctuation
ValueCountFrequency (%)
. 100
76.9%
, 30
 
23.1%
Space Separator
ValueCountFrequency (%)
77
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1898
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 612
32.2%
1 198
 
10.4%
2 179
 
9.4%
5 132
 
7.0%
3 131
 
6.9%
4 110
 
5.8%
. 100
 
5.3%
6 99
 
5.2%
8 97
 
5.1%
77
 
4.1%
Other values (3) 163
 
8.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1898
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 612
32.2%
1 198
 
10.4%
2 179
 
9.4%
5 132
 
7.0%
3 131
 
6.9%
4 110
 
5.8%
. 100
 
5.3%
6 99
 
5.2%
8 97
 
5.1%
77
 
4.1%
Other values (3) 163
 
8.6%

total
Real number (ℝ)

Distinct339
Distinct (%)51.9%
Missing1
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean252.88515
Minimum2
Maximum8659
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.9 KiB
2024-04-16T14:53:08.913174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile13
Q141
median103
Q3252
95-th percentile976.6
Maximum8659
Range8657
Interquartile range (IQR)211

Descriptive statistics

Standard deviation517.83067
Coefficient of variation (CV)2.0476911
Kurtosis113.24475
Mean252.88515
Median Absolute Deviation (MAD)79
Skewness8.4259201
Sum165134
Variance268148.6
MonotonicityNot monotonic
2024-04-16T14:53:09.022255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20 11
 
1.7%
34 9
 
1.4%
13 8
 
1.2%
48 8
 
1.2%
25 8
 
1.2%
14 8
 
1.2%
62 6
 
0.9%
18 6
 
0.9%
16 6
 
0.9%
63 6
 
0.9%
Other values (329) 577
88.2%
ValueCountFrequency (%)
2 2
0.3%
3 1
 
0.2%
4 3
0.5%
5 2
0.3%
6 2
0.3%
7 2
0.3%
8 4
0.6%
9 3
0.5%
10 4
0.6%
11 4
0.6%
ValueCountFrequency (%)
8659 1
0.2%
3797 1
0.2%
2984 1
0.2%
2718 1
0.2%
2660 1
0.2%
2407 1
0.2%
2363 1
0.2%
2241 1
0.2%
2205 1
0.2%
1991 1
0.2%

prunus_yedoensis
Text

MISSING 

Distinct175
Distinct (%)62.1%
Missing372
Missing (%)56.9%
Memory size5.2 KiB
2024-04-16T14:53:09.285828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.2695035
Min length1

Characters and Unicode

Total characters640
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique123 ?
Unique (%)43.6%

Sample

1st row766
2nd row17
3rd row64
4th row384
5th row105
ValueCountFrequency (%)
0 32
 
11.4%
13 6
 
2.1%
28 5
 
1.8%
25 4
 
1.4%
17 4
 
1.4%
20 4
 
1.4%
45 4
 
1.4%
16 3
 
1.1%
46 3
 
1.1%
62 3
 
1.1%
Other values (159) 213
75.8%
2024-04-16T14:53:09.655704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 120
18.8%
2 87
13.6%
0 69
10.8%
4 65
10.2%
3 60
9.4%
5 53
8.3%
6 51
8.0%
8 48
 
7.5%
7 39
 
6.1%
9 38
 
5.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 630
98.4%
Space Separator 10
 
1.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 120
19.0%
2 87
13.8%
0 69
11.0%
4 65
10.3%
3 60
9.5%
5 53
8.4%
6 51
8.1%
8 48
 
7.6%
7 39
 
6.2%
9 38
 
6.0%
Space Separator
ValueCountFrequency (%)
10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 640
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 120
18.8%
2 87
13.6%
0 69
10.8%
4 65
10.2%
3 60
9.4%
5 53
8.3%
6 51
8.0%
8 48
 
7.5%
7 39
 
6.1%
9 38
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 640
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 120
18.8%
2 87
13.6%
0 69
10.8%
4 65
10.2%
3 60
9.4%
5 53
8.3%
6 51
8.0%
8 48
 
7.5%
7 39
 
6.1%
9 38
 
5.9%

ginkgo
Text

MISSING 

Distinct173
Distinct (%)58.6%
Missing359
Missing (%)54.9%
Memory size5.2 KiB
2024-04-16T14:53:09.918013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.159322
Min length1

Characters and Unicode

Total characters637
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique122 ?
Unique (%)41.4%

Sample

1st row23
2nd row0
3rd row168
4th row0
5th row119
ValueCountFrequency (%)
0 50
 
17.0%
41 7
 
2.4%
14 6
 
2.0%
66 6
 
2.0%
21 5
 
1.7%
2 4
 
1.4%
20 3
 
1.0%
44 3
 
1.0%
12 3
 
1.0%
23 3
 
1.0%
Other values (148) 204
69.4%
2024-04-16T14:53:10.308769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 105
16.5%
0 89
14.0%
2 81
12.7%
4 67
10.5%
3 64
10.0%
9 48
7.5%
6 47
7.4%
8 45
7.1%
5 38
 
6.0%
7 33
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 617
96.9%
Space Separator 20
 
3.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 105
17.0%
0 89
14.4%
2 81
13.1%
4 67
10.9%
3 64
10.4%
9 48
7.8%
6 47
7.6%
8 45
7.3%
5 38
 
6.2%
7 33
 
5.3%
Space Separator
ValueCountFrequency (%)
20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 637
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 105
16.5%
0 89
14.0%
2 81
12.7%
4 67
10.5%
3 64
10.0%
9 48
7.5%
6 47
7.4%
8 45
7.1%
5 38
 
6.0%
7 33
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 637
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 105
16.5%
0 89
14.0%
2 81
12.7%
4 67
10.5%
3 64
10.0%
9 48
7.5%
6 47
7.4%
8 45
7.1%
5 38
 
6.0%
7 33
 
5.2%

sawleaf_zelkova
Text

MISSING 

Distinct121
Distinct (%)56.5%
Missing440
Missing (%)67.3%
Memory size5.2 KiB
2024-04-16T14:53:10.523496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.0747664
Min length1

Characters and Unicode

Total characters444
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique80 ?
Unique (%)37.4%

Sample

1st row539
2nd row13
3rd row18
4th row333
5th row461
ValueCountFrequency (%)
0 35
 
16.4%
13 6
 
2.8%
16 5
 
2.3%
1 5
 
2.3%
17 4
 
1.9%
20 4
 
1.9%
27 4
 
1.9%
6 4
 
1.9%
31 3
 
1.4%
29 3
 
1.4%
Other values (104) 140
65.7%
2024-04-16T14:53:10.852181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 79
17.8%
0 68
15.3%
2 53
11.9%
3 50
11.3%
5 38
8.6%
7 33
7.4%
4 33
7.4%
6 30
 
6.8%
8 23
 
5.2%
9 22
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 429
96.6%
Space Separator 15
 
3.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 79
18.4%
0 68
15.9%
2 53
12.4%
3 50
11.7%
5 38
8.9%
7 33
7.7%
4 33
7.7%
6 30
 
7.0%
8 23
 
5.4%
9 22
 
5.1%
Space Separator
ValueCountFrequency (%)
15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 444
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 79
17.8%
0 68
15.3%
2 53
11.9%
3 50
11.3%
5 38
8.6%
7 33
7.4%
4 33
7.4%
6 30
 
6.8%
8 23
 
5.2%
9 22
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 444
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 79
17.8%
0 68
15.3%
2 53
11.9%
3 50
11.3%
5 38
8.6%
7 33
7.4%
4 33
7.4%
6 30
 
6.8%
8 23
 
5.2%
9 22
 
5.0%

platanus_orientalis
Categorical

IMBALANCE 

Distinct19
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
577 
0
59 
99
 
2
3
 
1
70
 
1
Other values (14)
 
14

Length

Max length4
Median length4
Mean length3.67737
Min length1

Unique

Unique16 ?
Unique (%)2.4%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 577
88.2%
0 59
 
9.0%
99 2
 
0.3%
3 1
 
0.2%
70 1
 
0.2%
66 1
 
0.2%
122 1
 
0.2%
464 1
 
0.2%
112 1
 
0.2%
130 1
 
0.2%
Other values (9) 9
 
1.4%

Length

2024-04-16T14:53:10.970203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 577
88.4%
0 59
 
9.0%
99 2
 
0.3%
8 1
 
0.2%
211 1
 
0.2%
54 1
 
0.2%
35 1
 
0.2%
44 1
 
0.2%
53 1
 
0.2%
30 1
 
0.2%
Other values (8) 8
 
1.2%

platanus
Categorical

IMBALANCE 

Distinct35
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
561 
0
 
56
1
 
2
3
 
2
27
 
2
Other values (30)
 
31

Length

Max length4
Median length4
Mean length3.6299694
Min length1

Unique

Unique29 ?
Unique (%)4.4%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 561
85.8%
0 56
 
8.6%
1 2
 
0.3%
3 2
 
0.3%
27 2
 
0.3%
7 2
 
0.3%
2 1
 
0.2%
11 1
 
0.2%
1
 
0.2%
21 1
 
0.2%
Other values (25) 25
 
3.8%

Length

2024-04-16T14:53:11.068104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 561
85.9%
0 56
 
8.6%
1 2
 
0.3%
3 2
 
0.3%
27 2
 
0.3%
7 2
 
0.3%
179 1
 
0.2%
49 1
 
0.2%
25 1
 
0.2%
3040 1
 
0.2%
Other values (24) 24
 
3.7%

chinese_fringe_tree
Real number (ℝ)

MISSING  ZEROS 

Distinct79
Distinct (%)50.6%
Missing498
Missing (%)76.1%
Infinite0
Infinite (%)0.0%
Mean90.320513
Minimum0
Maximum1686
Zeros49
Zeros (%)7.5%
Negative0
Negative (%)0.0%
Memory size5.9 KiB
2024-04-16T14:53:11.167184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median21
Q384.5
95-th percentile460.75
Maximum1686
Range1686
Interquartile range (IQR)84.5

Descriptive statistics

Standard deviation199.87089
Coefficient of variation (CV)2.212907
Kurtosis30.449419
Mean90.320513
Median Absolute Deviation (MAD)21
Skewness4.8580273
Sum14090
Variance39948.374
MonotonicityNot monotonic
2024-04-16T14:53:11.274889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 49
 
7.5%
17 4
 
0.6%
8 4
 
0.6%
34 3
 
0.5%
20 3
 
0.5%
48 3
 
0.5%
513 2
 
0.3%
96 2
 
0.3%
18 2
 
0.3%
50 2
 
0.3%
Other values (69) 82
 
12.5%
(Missing) 498
76.1%
ValueCountFrequency (%)
0 49
7.5%
1 2
 
0.3%
4 1
 
0.2%
5 2
 
0.3%
6 1
 
0.2%
8 4
 
0.6%
9 2
 
0.3%
10 1
 
0.2%
11 1
 
0.2%
12 1
 
0.2%
ValueCountFrequency (%)
1686 1
0.2%
1086 1
0.2%
771 1
0.2%
680 1
0.2%
574 1
0.2%
513 2
0.3%
481 1
0.2%
454 1
0.2%
329 1
0.2%
280 1
0.2%

sophora_japonica
Categorical

IMBALANCE 

Distinct21
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
583 
0
 
51
7
 
2
466
 
1
47
 
1
Other values (16)
 
16

Length

Max length4
Median length4
Mean length3.7110092
Min length1

Unique

Unique18 ?
Unique (%)2.8%

Sample

1st row<NA>
2nd row<NA>
3rd row466
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 583
89.1%
0 51
 
7.8%
7 2
 
0.3%
466 1
 
0.2%
47 1
 
0.2%
116 1
 
0.2%
66 1
 
0.2%
1
 
0.2%
14 1
 
0.2%
32 1
 
0.2%
Other values (11) 11
 
1.7%

Length

2024-04-16T14:53:11.392624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 583
89.3%
0 51
 
7.8%
7 2
 
0.3%
19 1
 
0.2%
325 1
 
0.2%
25 1
 
0.2%
9 1
 
0.2%
566 1
 
0.2%
923 1
 
0.2%
1149 1
 
0.2%
Other values (10) 10
 
1.5%

metasequoia
Categorical

IMBALANCE 

Distinct27
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
573 
0
 
55
45
 
2
398
 
1
163
 
1
Other values (22)
 
22

Length

Max length4
Median length4
Mean length3.6850153
Min length1

Unique

Unique24 ?
Unique (%)3.7%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 573
87.6%
0 55
 
8.4%
45 2
 
0.3%
398 1
 
0.2%
163 1
 
0.2%
416 1
 
0.2%
275 1
 
0.2%
13 1
 
0.2%
56 1
 
0.2%
7 1
 
0.2%
Other values (17) 17
 
2.6%

Length

2024-04-16T14:53:11.502771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 573
87.7%
0 55
 
8.4%
45 2
 
0.3%
16 1
 
0.2%
129 1
 
0.2%
134 1
 
0.2%
184 1
 
0.2%
40 1
 
0.2%
188 1
 
0.2%
33 1
 
0.2%
Other values (16) 16
 
2.5%

horse_chestnut
Categorical

IMBALANCE 

Distinct17
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
583 
0
 
56
183
 
1
103
 
1
337
 
1
Other values (12)
 
12

Length

Max length4
Median length4
Mean length3.7018349
Min length1

Unique

Unique15 ?
Unique (%)2.3%

Sample

1st row<NA>
2nd row<NA>
3rd row26
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 583
89.1%
0 56
 
8.6%
183 1
 
0.2%
103 1
 
0.2%
337 1
 
0.2%
38 1
 
0.2%
276 1
 
0.2%
1
 
0.2%
26 1
 
0.2%
194 1
 
0.2%
Other values (7) 7
 
1.1%

Length

2024-04-16T14:53:11.602966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 583
89.3%
0 56
 
8.6%
183 1
 
0.2%
103 1
 
0.2%
337 1
 
0.2%
38 1
 
0.2%
276 1
 
0.2%
26 1
 
0.2%
194 1
 
0.2%
3 1
 
0.2%
Other values (6) 6
 
0.9%

acer_buergerianum
Categorical

IMBALANCE 

Distinct7
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
592 
0
 
57
25
 
1
35
 
1
 
1
Other values (2)
 
2

Length

Max length4
Median length4
Mean length3.7262997
Min length1

Unique

Unique5 ?
Unique (%)0.8%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 592
90.5%
0 57
 
8.7%
25 1
 
0.2%
35 1
 
0.2%
1
 
0.2%
112 1
 
0.2%
2060 1
 
0.2%

Length

2024-04-16T14:53:11.703312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:11.793819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 592
90.7%
0 57
 
8.7%
25 1
 
0.2%
35 1
 
0.2%
112 1
 
0.2%
2060 1
 
0.2%

celtis_sinensis
Categorical

IMBALANCE 

Distinct11
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
588 
0
 
57
24
 
1
3
 
1
28
 
1
Other values (6)
 
6

Length

Max length4
Median length4
Mean length3.7125382
Min length1

Unique

Unique9 ?
Unique (%)1.4%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 588
89.9%
0 57
 
8.7%
24 1
 
0.2%
3 1
 
0.2%
28 1
 
0.2%
1
 
0.2%
47 1
 
0.2%
79 1
 
0.2%
288 1
 
0.2%
1061 1
 
0.2%

Length

2024-04-16T14:53:11.895076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 588
90.0%
0 57
 
8.7%
24 1
 
0.2%
3 1
 
0.2%
28 1
 
0.2%
47 1
 
0.2%
79 1
 
0.2%
288 1
 
0.2%
1061 1
 
0.2%
48 1
 
0.2%

tulipifera
Categorical

IMBALANCE 

Distinct21
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
577 
0
58 
98
 
1
12
 
1
3
 
1
Other values (16)
 
16

Length

Max length4
Median length4
Mean length3.6743119
Min length1

Unique

Unique19 ?
Unique (%)2.9%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 577
88.2%
0 58
 
8.9%
98 1
 
0.2%
12 1
 
0.2%
3 1
 
0.2%
51 1
 
0.2%
92 1
 
0.2%
36 1
 
0.2%
45 1
 
0.2%
167 1
 
0.2%
Other values (11) 11
 
1.7%

Length

2024-04-16T14:53:11.996344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 577
88.4%
0 58
 
8.9%
13 1
 
0.2%
47 1
 
0.2%
28 1
 
0.2%
1 1
 
0.2%
10 1
 
0.2%
8 1
 
0.2%
40 1
 
0.2%
141 1
 
0.2%
Other values (10) 10
 
1.5%

acer_palmatum
Categorical

IMBALANCE 

Distinct9
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
588 
0
59 
28
 
1
181
 
1
28
 
1
Other values (4)
 
4

Length

Max length4
Median length4
Mean length3.7110092
Min length1

Unique

Unique7 ?
Unique (%)1.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 588
89.9%
0 59
 
9.0%
28 1
 
0.2%
181 1
 
0.2%
28 1
 
0.2%
48 1
 
0.2%
63 1
 
0.2%
1
 
0.2%
15 1
 
0.2%

Length

2024-04-16T14:53:12.109804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:12.212401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 588
90.0%
0 59
 
9.0%
28 2
 
0.3%
181 1
 
0.2%
48 1
 
0.2%
63 1
 
0.2%
15 1
 
0.2%

firmiana_simplex
Categorical

IMBALANCE 

Distinct7
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
590 
0
 
59
1
 
1
117
 
1
3
 
1
Other values (2)
 
2

Length

Max length4
Median length4
Mean length3.7094801
Min length1

Unique

Unique5 ?
Unique (%)0.8%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 590
90.2%
0 59
 
9.0%
1 1
 
0.2%
117 1
 
0.2%
3 1
 
0.2%
5 1
 
0.2%
1
 
0.2%

Length

2024-04-16T14:53:12.317966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:12.407473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 590
90.4%
0 59
 
9.0%
1 1
 
0.2%
117 1
 
0.2%
3 1
 
0.2%
5 1
 
0.2%

pin_oak
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing654
Missing (%)100.0%
Memory size5.9 KiB

persimmon
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
593 
0
 
59
14
 
1
 
1

Length

Max length4
Median length4
Mean length3.7217125
Min length1

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 593
90.7%
0 59
 
9.0%
14 1
 
0.2%
1
 
0.2%

Length

2024-04-16T14:53:12.513129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:12.606397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 593
90.8%
0 59
 
9.0%
14 1
 
0.2%

cornus_kousa
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
593 
0
 
57
 
1
85
 
1
44
 
1

Length

Max length4
Median length4
Mean length3.7247706
Min length1

Unique

Unique4 ?
Unique (%)0.6%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 593
90.7%
0 57
 
8.7%
1
 
0.2%
85 1
 
0.2%
44 1
 
0.2%
51 1
 
0.2%

Length

2024-04-16T14:53:12.712127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:12.806808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 593
90.8%
0 57
 
8.7%
85 1
 
0.2%
44 1
 
0.2%
51 1
 
0.2%

chinese_quince
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
593 
0
 
59
27
 
1
 
1

Length

Max length4
Median length4
Mean length3.7217125
Min length1

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 593
90.7%
0 59
 
9.0%
27 1
 
0.2%
1
 
0.2%

Length

2024-04-16T14:53:12.904682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:12.992315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 593
90.8%
0 59
 
9.0%
27 1
 
0.2%

goldenrain_tree
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
592 
0
 
59
 
1
30
 
1
9
 
1

Length

Max length4
Median length4
Mean length3.7171254
Min length1

Unique

Unique3 ?
Unique (%)0.5%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 592
90.5%
0 59
 
9.0%
1
 
0.2%
30 1
 
0.2%
9 1
 
0.2%

Length

2024-04-16T14:53:13.094170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:13.203002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 592
90.7%
0 59
 
9.0%
30 1
 
0.2%
9 1
 
0.2%

cinnamon_tree
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
593 
0
 
59
14
 
1
 
1

Length

Max length4
Median length4
Mean length3.7217125
Min length1

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row<NA>
2nd row14
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 593
90.7%
0 59
 
9.0%
14 1
 
0.2%
1
 
0.2%

Length

2024-04-16T14:53:13.318706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:13.411949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 593
90.8%
0 59
 
9.0%
14 1
 
0.2%

ailanthus_altissima
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
593 
0
 
59
8
 
1
 
1

Length

Max length4
Median length4
Mean length3.7217125
Min length1

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 593
90.7%
0 59
 
9.0%
8 1
 
0.2%
1
 
0.2%

Length

2024-04-16T14:53:13.504297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:13.589004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 593
90.8%
0 59
 
9.0%
8 1
 
0.2%

amur_cork_tree
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
593 
0
 
59
 
1
7
 
1

Length

Max length4
Median length4
Mean length3.7201835
Min length1

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 593
90.7%
0 59
 
9.0%
1
 
0.2%
7 1
 
0.2%

Length

2024-04-16T14:53:13.680035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:13.767792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 593
90.8%
0 59
 
9.0%
7 1
 
0.2%

babylon_willow
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
594 
0
 
59
 
1

Length

Max length4
Median length4
Mean length3.7247706
Min length1

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 594
90.8%
0 59
 
9.0%
1
 
0.2%

Length

2024-04-16T14:53:13.869687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:13.957139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 594
91.0%
0 59
 
9.0%

three_flowered_maple
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
593 
0
 
59
4
 
1
 
1

Length

Max length4
Median length4
Mean length3.7201835
Min length1

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row<NA>
2nd row<NA>
3rd row4
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 593
90.7%
0 59
 
9.0%
4 1
 
0.2%
1
 
0.2%

Length

2024-04-16T14:53:14.045674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:14.130331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 593
90.8%
0 59
 
9.0%
4 1
 
0.2%

japanese_elm
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
594 
0
 
59
 
1

Length

Max length4
Median length4
Mean length3.7247706
Min length1

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 594
90.8%
0 59
 
9.0%
1
 
0.2%

Length

2024-04-16T14:53:14.228875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:14.317422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 594
91.0%
0 59
 
9.0%

jujube
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
593 
0
 
59
2
 
1
 
1

Length

Max length4
Median length4
Mean length3.7201835
Min length1

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 593
90.7%
0 59
 
9.0%
2 1
 
0.2%
1
 
0.2%

Length

2024-04-16T14:53:14.403617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:14.726789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 593
90.8%
0 59
 
9.0%
2 1
 
0.2%

silver_magnolia
Text

MISSING 

Distinct51
Distinct (%)41.5%
Missing531
Missing (%)81.2%
Memory size5.2 KiB
2024-04-16T14:53:14.841266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length1
Mean length1.5284553
Min length1

Characters and Unicode

Total characters188
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)30.9%

Sample

1st row58
2nd row7
3rd row3
4th row0
5th row0
ValueCountFrequency (%)
0 57
46.7%
13 3
 
2.5%
14 3
 
2.5%
61 3
 
2.5%
3 3
 
2.5%
25 2
 
1.6%
12 2
 
1.6%
4 2
 
1.6%
2 2
 
1.6%
60 2
 
1.6%
Other values (39) 43
35.2%
2024-04-16T14:53:15.101117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 65
34.6%
1 25
 
13.3%
2 16
 
8.5%
5 15
 
8.0%
3 14
 
7.4%
8 13
 
6.9%
4 12
 
6.4%
6 11
 
5.9%
7 8
 
4.3%
9 6
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 185
98.4%
Space Separator 3
 
1.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 65
35.1%
1 25
 
13.5%
2 16
 
8.6%
5 15
 
8.1%
3 14
 
7.6%
8 13
 
7.0%
4 12
 
6.5%
6 11
 
5.9%
7 8
 
4.3%
9 6
 
3.2%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 188
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 65
34.6%
1 25
 
13.3%
2 16
 
8.5%
5 15
 
8.0%
3 14
 
7.4%
8 13
 
6.9%
4 12
 
6.4%
6 11
 
5.9%
7 8
 
4.3%
9 6
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 188
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 65
34.6%
1 25
 
13.3%
2 16
 
8.5%
5 15
 
8.0%
3 14
 
7.4%
8 13
 
6.9%
4 12
 
6.4%
6 11
 
5.9%
7 8
 
4.3%
9 6
 
3.2%

kurogane_holly
Text

MISSING 

Distinct60
Distinct (%)46.5%
Missing525
Missing (%)80.3%
Memory size5.2 KiB
2024-04-16T14:53:15.243325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length1
Mean length1.6511628
Min length1

Characters and Unicode

Total characters213
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)37.2%

Sample

1st row19
2nd row294
3rd row0
4th row0
5th row198
ValueCountFrequency (%)
0 54
42.2%
2 5
 
3.9%
1 3
 
2.3%
26 3
 
2.3%
13 3
 
2.3%
31 2
 
1.6%
6 2
 
1.6%
14 2
 
1.6%
3 2
 
1.6%
11 2
 
1.6%
Other values (47) 50
39.1%
2024-04-16T14:53:15.526004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 61
28.6%
1 35
16.4%
2 26
12.2%
3 21
 
9.9%
6 17
 
8.0%
7 12
 
5.6%
5 11
 
5.2%
8 10
 
4.7%
7
 
3.3%
9 7
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 206
96.7%
Space Separator 7
 
3.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 61
29.6%
1 35
17.0%
2 26
12.6%
3 21
 
10.2%
6 17
 
8.3%
7 12
 
5.8%
5 11
 
5.3%
8 10
 
4.9%
9 7
 
3.4%
4 6
 
2.9%
Space Separator
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 213
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 61
28.6%
1 35
16.4%
2 26
12.2%
3 21
 
9.9%
6 17
 
8.0%
7 12
 
5.6%
5 11
 
5.2%
8 10
 
4.7%
7
 
3.3%
9 7
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 213
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 61
28.6%
1 35
16.4%
2 26
12.2%
3 21
 
9.9%
6 17
 
8.0%
7 12
 
5.6%
5 11
 
5.2%
8 10
 
4.7%
7
 
3.3%
9 7
 
3.3%

pinus_thunbergii
Categorical

IMBALANCE 

Distinct27
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
569 
0
58 
7
 
2
21
 
2
126
 
1
Other values (22)
 
22

Length

Max length4
Median length4
Mean length3.6559633
Min length1

Unique

Unique23 ?
Unique (%)3.5%

Sample

1st row24
2nd row<NA>
3rd row<NA>
4th row29
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 569
87.0%
0 58
 
8.9%
7 2
 
0.3%
21 2
 
0.3%
126 1
 
0.2%
29 1
 
0.2%
51 1
 
0.2%
67 1
 
0.2%
457 1
 
0.2%
42 1
 
0.2%
Other values (17) 17
 
2.6%

Length

2024-04-16T14:53:15.636008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 569
87.1%
0 58
 
8.9%
7 2
 
0.3%
21 2
 
0.3%
73 1
 
0.2%
24 1
 
0.2%
740 1
 
0.2%
49 1
 
0.2%
15 1
 
0.2%
95 1
 
0.2%
Other values (16) 16
 
2.5%

myrsinaleaf_oak
Categorical

IMBALANCE 

Distinct27
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
569 
0
57 
260
 
2
12
 
2
9
 
2
Other values (22)
 
22

Length

Max length4
Median length4
Mean length3.6651376
Min length1

Unique

Unique22 ?
Unique (%)3.4%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 569
87.0%
0 57
 
8.7%
260 2
 
0.3%
12 2
 
0.3%
9 2
 
0.3%
11 1
 
0.2%
27 1
 
0.2%
27 1
 
0.2%
21 1
 
0.2%
143 1
 
0.2%
Other values (17) 17
 
2.6%

Length

2024-04-16T14:53:15.743000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 569
87.1%
0 57
 
8.7%
260 2
 
0.3%
12 2
 
0.3%
9 2
 
0.3%
27 2
 
0.3%
512 1
 
0.2%
1 1
 
0.2%
42 1
 
0.2%
8 1
 
0.2%
Other values (15) 15
 
2.3%

castanopsis_sieboldii
Categorical

IMBALANCE 

Distinct12
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
585 
0
59 
14
 
1
 
1
29
 
1
Other values (7)
 
7

Length

Max length4
Median length4
Mean length3.6941896
Min length1

Unique

Unique10 ?
Unique (%)1.5%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 585
89.4%
0 59
 
9.0%
14 1
 
0.2%
1
 
0.2%
29 1
 
0.2%
15 1
 
0.2%
38 1
 
0.2%
50 1
 
0.2%
5 1
 
0.2%
10 1
 
0.2%
Other values (2) 2
 
0.3%

Length

2024-04-16T14:53:15.843626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 585
89.6%
0 59
 
9.0%
14 1
 
0.2%
29 1
 
0.2%
15 1
 
0.2%
38 1
 
0.2%
50 1
 
0.2%
5 1
 
0.2%
10 1
 
0.2%
2 1
 
0.2%

cedrus_deodara
Categorical

IMBALANCE 

Distinct12
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
584 
0
 
56
1
 
4
1
 
2
4
 
1
Other values (7)
 
7

Length

Max length4
Median length4
Mean length3.6911315
Min length1

Unique

Unique8 ?
Unique (%)1.2%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 584
89.3%
0 56
 
8.6%
1 4
 
0.6%
1 2
 
0.3%
4 1
 
0.2%
10 1
 
0.2%
1
 
0.2%
7 1
 
0.2%
16 1
 
0.2%
237 1
 
0.2%
Other values (2) 2
 
0.3%

Length

2024-04-16T14:53:15.942537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 584
89.4%
0 56
 
8.6%
1 6
 
0.9%
4 1
 
0.2%
10 1
 
0.2%
7 1
 
0.2%
16 1
 
0.2%
237 1
 
0.2%
3 1
 
0.2%
21 1
 
0.2%

camphor_tree
Categorical

IMBALANCE 

Distinct8
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
588 
0
59 
8
 
2
26
 
1
15
 
1
Other values (3)
 
3

Length

Max length4
Median length4
Mean length3.7033639
Min length1

Unique

Unique5 ?
Unique (%)0.8%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 588
89.9%
0 59
 
9.0%
8 2
 
0.3%
26 1
 
0.2%
15 1
 
0.2%
1
 
0.2%
33 1
 
0.2%
48 1
 
0.2%

Length

2024-04-16T14:53:16.051559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:16.163332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 588
90.0%
0 59
 
9.0%
8 2
 
0.3%
26 1
 
0.2%
15 1
 
0.2%
33 1
 
0.2%
48 1
 
0.2%

torulosa
Categorical

IMBALANCE 

Distinct7
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
591 
0
 
58
6
 
1
10
 
1
 
1
Other values (2)
 
2

Length

Max length4
Median length4
Mean length3.7186544
Min length1

Unique

Unique5 ?
Unique (%)0.8%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 591
90.4%
0 58
 
8.9%
6 1
 
0.2%
10 1
 
0.2%
1
 
0.2%
5 1
 
0.2%
110 1
 
0.2%

Length

2024-04-16T14:53:16.284719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:16.380552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 591
90.5%
0 58
 
8.9%
6 1
 
0.2%
10 1
 
0.2%
5 1
 
0.2%
110 1
 
0.2%

neolitsea_sericea
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
592 
0
 
59
 
1
8
 
1
5
 
1

Length

Max length4
Median length4
Mean length3.7155963
Min length1

Unique

Unique3 ?
Unique (%)0.5%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 592
90.5%
0 59
 
9.0%
1
 
0.2%
8 1
 
0.2%
5 1
 
0.2%

Length

2024-04-16T14:53:16.479347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:16.567417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 592
90.7%
0 59
 
9.0%
8 1
 
0.2%
5 1
 
0.2%

taxus_cuspidata
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
593 
0
 
59
 
1
3
 
1

Length

Max length4
Median length4
Mean length3.7201835
Min length1

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 593
90.7%
0 59
 
9.0%
1
 
0.2%
3 1
 
0.2%

Length

2024-04-16T14:53:16.658002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:16.745334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 593
90.8%
0 59
 
9.0%
3 1
 
0.2%

sweet_viburnum
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
594 
0
 
59
 
1

Length

Max length4
Median length4
Mean length3.7247706
Min length1

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 594
90.8%
0 59
 
9.0%
1
 
0.2%

Length

2024-04-16T14:53:16.842921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:16.927389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 594
91.0%
0 59
 
9.0%

etc_tree
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
<NA>
593 
0
 
59
 
1
67
 
1

Length

Max length4
Median length4
Mean length3.7217125
Min length1

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 593
90.7%
0 59
 
9.0%
1
 
0.2%
67 1
 
0.2%

Length

2024-04-16T14:53:17.016590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:17.102053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 593
90.8%
0 59
 
9.0%
67 1
 
0.2%

gugun
Categorical

Distinct17
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
부산광역시 해운대구
68 
부산광역시 부산진구
63 
부산광역시 기장군
60 
부산광역시 강서구
59 
부산광역시 사하구
49 
Other values (12)
355 

Length

Max length10
Median length9
Mean length8.9770642
Min length4

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row부산광역시 해운대구
2nd row부산광역시 해운대구
3rd row부산광역시 해운대구
4th row부산광역시 해운대구
5th row부산광역시 해운대구

Common Values

ValueCountFrequency (%)
부산광역시 해운대구 68
10.4%
부산광역시 부산진구 63
9.6%
부산광역시 기장군 60
 
9.2%
부산광역시 강서구 59
 
9.0%
부산광역시 사하구 49
 
7.5%
부산광역시 동래구 48
 
7.3%
부산광역시 북구 48
 
7.3%
부산광역시 사상구 43
 
6.6%
부산광역시 금정구 34
 
5.2%
부산광역시 영도구 30
 
4.6%
Other values (7) 152
23.2%

Length

2024-04-16T14:53:17.191921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
부산광역시 653
50.0%
해운대구 68
 
5.2%
부산진구 63
 
4.8%
기장군 60
 
4.6%
강서구 59
 
4.5%
사하구 49
 
3.7%
동래구 48
 
3.7%
북구 48
 
3.7%
사상구 43
 
3.3%
금정구 34
 
2.6%
Other values (8) 182
 
13.9%

reference_date
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
2020-07-31
493 
2020-08-31
102 
2020-08-26
 
29
2020-09-11
 
29
<NA>
 
1

Length

Max length10
Median length10
Mean length9.9908257
Min length4

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row2020-08-31
2nd row2020-08-31
3rd row2020-08-31
4th row2020-08-31
5th row2020-08-31

Common Values

ValueCountFrequency (%)
2020-07-31 493
75.4%
2020-08-31 102
 
15.6%
2020-08-26 29
 
4.4%
2020-09-11 29
 
4.4%
<NA> 1
 
0.2%

Length

2024-04-16T14:53:17.293215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:17.385057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-07-31 493
75.4%
2020-08-31 102
 
15.6%
2020-08-26 29
 
4.4%
2020-09-11 29
 
4.4%
na 1
 
0.2%

instt_code
Real number (ℝ)

Distinct16
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3332171.3
Minimum3250000
Maximum3400000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.9 KiB
2024-04-16T14:53:17.469780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3250000
5-th percentile3260000
Q13300000
median3330000
Q33360000
95-th percentile3400000
Maximum3400000
Range150000
Interquartile range (IQR)60000

Descriptive statistics

Standard deviation42374.34
Coefficient of variation (CV)0.012716735
Kurtosis-1.0248628
Mean3332171.3
Median Absolute Deviation (MAD)30000
Skewness-0.013489865
Sum2.17924 × 109
Variance1.7955847 × 109
MonotonicityNot monotonic
2024-04-16T14:53:17.567046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
3330000 68
10.4%
3290000 63
9.6%
3400000 60
 
9.2%
3360000 59
 
9.0%
3340000 49
 
7.5%
3300000 48
 
7.3%
3320000 48
 
7.3%
3390000 43
 
6.6%
3350000 34
 
5.2%
3280000 31
 
4.7%
Other values (6) 151
23.1%
ValueCountFrequency (%)
3250000 15
 
2.3%
3260000 20
 
3.1%
3270000 29
4.4%
3280000 31
4.7%
3290000 63
9.6%
3300000 48
7.3%
3310000 29
4.4%
3320000 48
7.3%
3330000 68
10.4%
3340000 49
7.5%
ValueCountFrequency (%)
3400000 60
9.2%
3390000 43
6.6%
3380000 29
4.4%
3370000 29
4.4%
3360000 59
9.0%
3350000 34
5.2%
3340000 49
7.5%
3330000 68
10.4%
3320000 48
7.3%
3310000 29
4.4%

last_load_dttm
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
2021-01-05 11:34:33
654 

Length

Max length19
Median length19
Mean length19
Min length19

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-01-05 11:34:33
2nd row2021-01-05 11:34:33
3rd row2021-01-05 11:34:33
4th row2021-01-05 11:34:33
5th row2021-01-05 11:34:33

Common Values

ValueCountFrequency (%)
2021-01-05 11:34:33 654
100.0%

Length

2024-04-16T14:53:17.664296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T14:53:17.741450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-01-05 654
50.0%
11:34:33 654
50.0%

Sample

skeyloc_nmlatlngsec_timepointsec_endpointplant_distancetotalprunus_yedoensisginkgosawleaf_zelkovaplatanus_orientalisplatanuschinese_fringe_treesophora_japonicametasequoiahorse_chestnutacer_buergerianumceltis_sinensistulipiferaacer_palmatumfirmiana_simplexpin_oakpersimmoncornus_kousachinese_quincegoldenrain_treecinnamon_treeailanthus_altissimaamur_cork_treebabylon_willowthree_flowered_maplejapanese_elmjujubesilver_magnoliakurogane_hollypinus_thunbergiimyrsinaleaf_oakcastanopsis_sieboldiicedrus_deodaracamphor_treetorulosaneolitsea_sericeataxus_cuspidatasweet_viburnumetc_treegugunreference_dateinstt_codelast_load_dttm
04544부산광역시 해운대구 동백로35.191714129.153866운촌삼거리해수욕장720101<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>581924<NA><NA><NA><NA><NA><NA><NA><NA><NA>부산광역시 해운대구2020-08-3133300002021-01-05 11:34:33
14545부산광역시 해운대구 달맞이길35.094076129.101808미포육거리송정터널5200780766<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>14<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>부산광역시 해운대구2020-08-3133300002021-01-05 11:34:33
24546부산광역시 해운대구 좌동순환로35.095758129.100261중동 E-마트수영세무서603014291723539<NA><NA>60466<NA>26<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>4<NA><NA><NA>294<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>부산광역시 해운대구2020-08-3133300002021-01-05 11:34:33
34547부산광역시 해운대구 좌동순환로 15번길35.100486129.095993수영세무서중동지하차도20042<NA><NA>13<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>29<NA><NA><NA><NA><NA><NA><NA><NA><NA>부산광역시 해운대구2020-08-3133300002021-01-05 11:34:33
44548부산광역시 해운대구 좌동순환로 468번길35.095802129.101827동일아파트 앞 사거리동일아파트 101동17018<NA><NA>18<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>부산광역시 해운대구2020-08-3133300002021-01-05 11:34:33
54549부산광역시 해운대구 수영강변대로35.100386129.074375수영1호교반여농산물6090731<NA><NA>333<NA><NA><NA><NA>398<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>부산광역시 해운대구2020-08-3133300002021-01-05 11:34:33
64550부산광역시 해운대구 해운대로(구.충렬로)35.113734129.065691원동IC올림픽교차로3550462<NA><NA>461<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>1<NA><NA><NA><NA><NA><NA>부산광역시 해운대구2020-08-3133300002021-01-05 11:34:33
74551부산광역시 해운대구 센텀2로35.100918129.075708센텀시티역센텀뷰라움3606464<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>부산광역시 해운대구2020-08-3133300002021-01-05 11:34:33
84552부산광역시 해운대구 센텀3로35.100856129.074055센텀역 교차로트럼프월드센텀Ⅱ36071<NA><NA>71<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>부산광역시 해운대구2020-08-3133300002021-01-05 11:34:33
94553부산광역시 해운대구 센텀4로35.100237129.105728신세계백화점홈플러스 센텀시티점620393384<NA>9<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>부산광역시 해운대구2020-08-3133300002021-01-05 11:34:33
skeyloc_nmlatlngsec_timepointsec_endpointplant_distancetotalprunus_yedoensisginkgosawleaf_zelkovaplatanus_orientalisplatanuschinese_fringe_treesophora_japonicametasequoiahorse_chestnutacer_buergerianumceltis_sinensistulipiferaacer_palmatumfirmiana_simplexpin_oakpersimmoncornus_kousachinese_quincegoldenrain_treecinnamon_treeailanthus_altissimaamur_cork_treebabylon_willowthree_flowered_maplejapanese_elmjujubesilver_magnoliakurogane_hollypinus_thunbergiimyrsinaleaf_oakcastanopsis_sieboldiicedrus_deodaracamphor_treetorulosaneolitsea_sericeataxus_cuspidatasweet_viburnumetc_treegugunreference_dateinstt_codelast_load_dttm
6444645부산광역시 부산진구 월드컵대로35.182954129.050462연제구경계시민도서관삼거리44089<NA><NA>89<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>부산광역시 부산진구2020-07-3132900002021-01-05 11:34:33
6454691부산광역시 사하구 하신중앙로35.077313128.963795하단오거리65호광장352452929898<NA>55<NA><NA>7<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>54<NA><NA>17<NA><NA><NA><NA><NA><NA><NA><NA>부산광역시 사하구2020-07-3133400002021-01-05 11:34:33
6464747부산광역시 기장군 반룡로35.351631129.255467기룡교차로반룡교<NA>35318335<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>부산광역시 기장군2020-07-3134000002021-01-05 11:34:33
6474748부산광역시 기장군 기장대로35.197558129.205996송정1호교기장체육관<NA>12031129<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>74<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>부산광역시 기장군2020-07-3134000002021-01-05 11:34:33
6484749부산광역시 기장군 정관로35.323242129.196571달음교입구eg1차<NA>1086<NA><NA><NA><NA><NA>1086<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>부산광역시 기장군2020-07-3134000002021-01-05 11:34:33
6494750부산광역시 기장군 장곡길35.310137129.241097좌천마을문중마을<NA>206206<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>부산광역시 기장군2020-07-3134000002021-01-05 11:34:33
6504751부산광역시 기장군 대변로35.235843129.217788청강사거리무양교차로<NA>213213<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>부산광역시 기장군2020-07-3134000002021-01-05 11:34:33
6514752부산광역시 기장군 기장해안로35.182464129.208422송정2호교연화리<NA>1991510321272<NA><NA>513<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>115260<NA><NA><NA><NA><NA><NA><NA><NA>부산광역시 기장군2020-07-3134000002021-01-05 11:34:33
6524753부산광역시 기장군 철마로35.283522129.125725신암마을금정경계<NA>26719933<NA><NA><NA><NA><NA>35<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>부산광역시 기장군2020-07-3134000002021-01-05 11:34:33
6534754부산광역시 기장군 곰내길35.294346129.168449웅천마을정관곰내제<NA>510510<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>부산광역시 기장군2020-07-3134000002021-01-05 11:34:33