Overview

Dataset statistics

Number of variables10
Number of observations1290
Missing cells160
Missing cells (%)1.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory104.7 KiB
Average record size in memory83.1 B

Variable types

Numeric3
Text5
Categorical2

Dataset

Description전북특별자치도 대아수목원 목본식물 보유 현황(일련번호, 학명, 과명, 국명, 수량, 보유년월 등)우리 기관에서는 더 이상 생성 불가 데이터입니다.
Author전북특별자치도
URLhttps://www.data.go.kr/data/15055681/fileData.do

Alerts

일련번호 is highly overall correlated with 산지High correlation
보유년월 is highly overall correlated with 산지High correlation
보유경위 is highly overall correlated with 산지High correlation
산지 is highly overall correlated with 일련번호 and 2 other fieldsHigh correlation
보유경위 is highly imbalanced (70.9%)Imbalance
보유년월 has 159 (12.3%) missing valuesMissing
수량 is highly skewed (γ1 = 24.71060086)Skewed
일련번호 has unique valuesUnique

Reproduction

Analysis started2024-03-14 14:53:16.532393
Analysis finished2024-03-14 14:53:21.609420
Duration5.08 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일련번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1290
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean645.5
Minimum1
Maximum1290
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.5 KiB
2024-03-14T23:53:21.837294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile65.45
Q1323.25
median645.5
Q3967.75
95-th percentile1225.55
Maximum1290
Range1289
Interquartile range (IQR)644.5

Descriptive statistics

Standard deviation372.53523
Coefficient of variation (CV)0.57712662
Kurtosis-1.2
Mean645.5
Median Absolute Deviation (MAD)322.5
Skewness0
Sum832695
Variance138782.5
MonotonicityStrictly increasing
2024-03-14T23:53:22.292956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
888 1
 
0.1%
866 1
 
0.1%
865 1
 
0.1%
864 1
 
0.1%
863 1
 
0.1%
862 1
 
0.1%
861 1
 
0.1%
860 1
 
0.1%
859 1
 
0.1%
Other values (1280) 1280
99.2%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1290 1
0.1%
1289 1
0.1%
1288 1
0.1%
1287 1
0.1%
1286 1
0.1%
1285 1
0.1%
1284 1
0.1%
1283 1
0.1%
1282 1
0.1%
1281 1
0.1%

학명
Text

Distinct1031
Distinct (%)79.9%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2024-03-14T23:53:23.548435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length42
Mean length24.108527
Min length9

Characters and Unicode

Total characters31100
Distinct characters70
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique992 ?
Unique (%)76.9%

Sample

1st rowCycas revoluta Thunb.
2nd rowZamia pumila L.
3rd rowGinkgo spp.
4th rowGinkgo biloba L.
5th rowCephalotaxus koreana Nakai
ValueCountFrequency (%)
spp 327
 
7.9%
rosa 139
 
3.4%
japonica 96
 
2.3%
var 90
 
2.2%
hibiscus 88
 
2.1%
camellia 73
 
1.8%
syriacus 70
 
1.7%
l 61
 
1.5%
magnolia 60
 
1.4%
et 54
 
1.3%
Other values (1392) 3085
74.5%
2024-03-14T23:53:25.231437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3296
 
10.6%
a 3207
 
10.3%
i 2286
 
7.4%
s 1987
 
6.4%
e 1687
 
5.4%
r 1585
 
5.1%
o 1561
 
5.0%
n 1487
 
4.8%
u 1408
 
4.5%
l 1225
 
3.9%
Other values (60) 11371
36.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 23386
75.2%
Space Separator 3296
 
10.6%
Uppercase Letter 2583
 
8.3%
Other Punctuation 1669
 
5.4%
Open Punctuation 64
 
0.2%
Close Punctuation 64
 
0.2%
Decimal Number 24
 
0.1%
Dash Punctuation 12
 
< 0.1%
Modifier Symbol 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 3207
13.7%
i 2286
 
9.8%
s 1987
 
8.5%
e 1687
 
7.2%
r 1585
 
6.8%
o 1561
 
6.7%
n 1487
 
6.4%
u 1408
 
6.0%
l 1225
 
5.2%
p 1181
 
5.1%
Other values (16) 5772
24.7%
Uppercase Letter
ValueCountFrequency (%)
R 285
11.0%
C 256
 
9.9%
M 212
 
8.2%
H 207
 
8.0%
P 206
 
8.0%
S 202
 
7.8%
L 162
 
6.3%
A 139
 
5.4%
T 137
 
5.3%
B 111
 
4.3%
Other values (16) 666
25.8%
Decimal Number
ValueCountFrequency (%)
6 7
29.2%
1 4
16.7%
2 4
16.7%
9 2
 
8.3%
8 2
 
8.3%
3 1
 
4.2%
5 1
 
4.2%
7 1
 
4.2%
0 1
 
4.2%
4 1
 
4.2%
Other Punctuation
ValueCountFrequency (%)
. 992
59.4%
' 639
38.3%
" 38
 
2.3%
Space Separator
ValueCountFrequency (%)
3296
100.0%
Open Punctuation
ValueCountFrequency (%)
( 64
100.0%
Close Punctuation
ValueCountFrequency (%)
) 64
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 25969
83.5%
Common 5131
 
16.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 3207
 
12.3%
i 2286
 
8.8%
s 1987
 
7.7%
e 1687
 
6.5%
r 1585
 
6.1%
o 1561
 
6.0%
n 1487
 
5.7%
u 1408
 
5.4%
l 1225
 
4.7%
p 1181
 
4.5%
Other values (42) 8355
32.2%
Common
ValueCountFrequency (%)
3296
64.2%
. 992
 
19.3%
' 639
 
12.5%
( 64
 
1.2%
) 64
 
1.2%
" 38
 
0.7%
- 12
 
0.2%
6 7
 
0.1%
1 4
 
0.1%
2 4
 
0.1%
Other values (8) 11
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 31100
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3296
 
10.6%
a 3207
 
10.3%
i 2286
 
7.4%
s 1987
 
6.4%
e 1687
 
5.4%
r 1585
 
5.1%
o 1561
 
5.0%
n 1487
 
4.8%
u 1408
 
4.5%
l 1225
 
3.9%
Other values (60) 11371
36.6%

과명
Text

Distinct90
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2024-03-14T23:53:26.180469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length2.996124
Min length1

Characters and Unicode

Total characters3865
Distinct characters143
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)2.2%

Sample

1st row소철
2nd row소철
3rd row은행나무
4th row은행나무
5th row주목
ValueCountFrequency (%)
장미 238
18.4%
아욱 88
 
6.8%
차나무 79
 
6.1%
목련 63
 
4.9%
미나리아재비 60
 
4.7%
진달래 60
 
4.7%
단풍나무 51
 
4.0%
측백나무 44
 
3.4%
37
 
2.9%
인동 37
 
2.9%
Other values (80) 533
41.3%
2024-03-14T23:53:27.507826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
545
 
14.1%
481
 
12.4%
299
 
7.7%
238
 
6.2%
153
 
4.0%
88
 
2.3%
80
 
2.1%
79
 
2.0%
71
 
1.8%
68
 
1.8%
Other values (133) 1763
45.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3859
99.8%
Space Separator 6
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
545
 
14.1%
481
 
12.5%
299
 
7.7%
238
 
6.2%
153
 
4.0%
88
 
2.3%
80
 
2.1%
79
 
2.0%
71
 
1.8%
68
 
1.8%
Other values (132) 1757
45.5%
Space Separator
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3859
99.8%
Common 6
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
545
 
14.1%
481
 
12.5%
299
 
7.7%
238
 
6.2%
153
 
4.0%
88
 
2.3%
80
 
2.1%
79
 
2.0%
71
 
1.8%
68
 
1.8%
Other values (132) 1757
45.5%
Common
ValueCountFrequency (%)
6
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3859
99.8%
ASCII 6
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
545
 
14.1%
481
 
12.5%
299
 
7.7%
238
 
6.2%
153
 
4.0%
88
 
2.3%
80
 
2.1%
79
 
2.0%
71
 
1.8%
68
 
1.8%
Other values (132) 1757
45.5%
ASCII
ValueCountFrequency (%)
6
100.0%

국명
Text

Distinct1105
Distinct (%)85.7%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2024-03-14T23:53:28.668188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length14
Mean length6.0945736
Min length1

Characters and Unicode

Total characters7862
Distinct characters517
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1083 ?
Unique (%)84.0%

Sample

1st row소철
2nd row멕시코소철
3rd row왕방울은행나무*
4th row은행나무
5th row개비자나무
ValueCountFrequency (%)
동백나무(재배종 66
 
5.0%
목련(재배종 49
 
3.7%
철쭉류 18
 
1.4%
모란(재배종 12
 
0.9%
무궁화(품종 11
 
0.8%
수국(재배종 11
 
0.8%
단풍나무(재배종 6
 
0.5%
수국 6
 
0.5%
품종 5
 
0.4%
무궁화류 4
 
0.3%
Other values (1106) 1139
85.8%
2024-03-14T23:53:30.238830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
590
 
7.5%
497
 
6.3%
* 332
 
4.2%
( 291
 
3.7%
) 291
 
3.7%
- 230
 
2.9%
214
 
2.7%
173
 
2.2%
166
 
2.1%
154
 
2.0%
Other values (507) 4924
62.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6621
84.2%
Other Punctuation 371
 
4.7%
Open Punctuation 291
 
3.7%
Close Punctuation 291
 
3.7%
Dash Punctuation 230
 
2.9%
Space Separator 42
 
0.5%
Decimal Number 16
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
590
 
8.9%
497
 
7.5%
214
 
3.2%
173
 
2.6%
166
 
2.5%
154
 
2.3%
145
 
2.2%
138
 
2.1%
122
 
1.8%
106
 
1.6%
Other values (492) 4316
65.2%
Decimal Number
ValueCountFrequency (%)
2 4
25.0%
9 3
18.8%
1 3
18.8%
8 3
18.8%
4 1
 
6.2%
0 1
 
6.2%
7 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
* 332
89.5%
' 37
 
10.0%
? 1
 
0.3%
, 1
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 291
100.0%
Close Punctuation
ValueCountFrequency (%)
) 291
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 230
100.0%
Space Separator
ValueCountFrequency (%)
42
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6621
84.2%
Common 1241
 
15.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
590
 
8.9%
497
 
7.5%
214
 
3.2%
173
 
2.6%
166
 
2.5%
154
 
2.3%
145
 
2.2%
138
 
2.1%
122
 
1.8%
106
 
1.6%
Other values (492) 4316
65.2%
Common
ValueCountFrequency (%)
* 332
26.8%
( 291
23.4%
) 291
23.4%
- 230
18.5%
42
 
3.4%
' 37
 
3.0%
2 4
 
0.3%
9 3
 
0.2%
1 3
 
0.2%
8 3
 
0.2%
Other values (5) 5
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6621
84.2%
ASCII 1241
 
15.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
590
 
8.9%
497
 
7.5%
214
 
3.2%
173
 
2.6%
166
 
2.5%
154
 
2.3%
145
 
2.2%
138
 
2.1%
122
 
1.8%
106
 
1.6%
Other values (492) 4316
65.2%
ASCII
ValueCountFrequency (%)
* 332
26.8%
( 291
23.4%
) 291
23.4%
- 230
18.5%
42
 
3.4%
' 37
 
3.0%
2 4
 
0.3%
9 3
 
0.2%
1 3
 
0.2%
8 3
 
0.2%
Other values (5) 5
 
0.4%

수량
Real number (ℝ)

SKEWED 

Distinct130
Distinct (%)10.1%
Missing1
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean178.5384
Minimum1
Maximum68071
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.5 KiB
2024-03-14T23:53:30.647629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q320
95-th percentile123.4
Maximum68071
Range68070
Interquartile range (IQR)19

Descriptive statistics

Standard deviation2228.5323
Coefficient of variation (CV)12.48209
Kurtosis700.50091
Mean178.5384
Median Absolute Deviation (MAD)1
Skewness24.710601
Sum230136
Variance4966356.3
MonotonicityNot monotonic
2024-03-14T23:53:31.091408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 572
44.3%
2 132
 
10.2%
10 65
 
5.0%
3 64
 
5.0%
30 48
 
3.7%
20 40
 
3.1%
5 28
 
2.2%
4 28
 
2.2%
100 22
 
1.7%
28 16
 
1.2%
Other values (120) 274
21.2%
ValueCountFrequency (%)
1 572
44.3%
2 132
 
10.2%
3 64
 
5.0%
4 28
 
2.2%
5 28
 
2.2%
6 9
 
0.7%
7 10
 
0.8%
8 5
 
0.4%
9 3
 
0.2%
10 65
 
5.0%
ValueCountFrequency (%)
68071 1
0.1%
29970 1
0.1%
20000 1
0.1%
15170 1
0.1%
6444 1
0.1%
6435 1
0.1%
6205 1
0.1%
5030 1
0.1%
4830 1
0.1%
4555 1
0.1%

보유년월
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct20
Distinct (%)1.8%
Missing159
Missing (%)12.3%
Infinite0
Infinite (%)0.0%
Mean2001.1485
Minimum1970
Maximum2014
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.5 KiB
2024-03-14T23:53:31.489997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1970
5-th percentile1989
Q12000
median2002
Q32002
95-th percentile2011
Maximum2014
Range44
Interquartile range (IQR)2

Descriptive statistics

Standard deviation4.8378161
Coefficient of variation (CV)0.0024175197
Kurtosis5.1191664
Mean2001.1485
Median Absolute Deviation (MAD)1
Skewness-0.79142949
Sum2263299
Variance23.404465
MonotonicityNot monotonic
2024-03-14T23:53:31.856328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
2002 537
41.6%
2003 177
 
13.7%
2000 138
 
10.7%
1989 63
 
4.9%
2014 44
 
3.4%
2001 33
 
2.6%
2004 28
 
2.2%
1994 24
 
1.9%
1996 18
 
1.4%
2011 17
 
1.3%
Other values (10) 52
 
4.0%
(Missing) 159
 
12.3%
ValueCountFrequency (%)
1970 2
 
0.2%
1989 63
4.9%
1990 12
 
0.9%
1991 7
 
0.5%
1992 1
 
0.1%
1993 1
 
0.1%
1994 24
 
1.9%
1995 16
 
1.2%
1996 18
 
1.4%
1997 2
 
0.2%
ValueCountFrequency (%)
2014 44
 
3.4%
2012 1
 
0.1%
2011 17
 
1.3%
2004 28
 
2.2%
2003 177
 
13.7%
2002 537
41.6%
2001 33
 
2.6%
2000 138
 
10.7%
1999 7
 
0.5%
1998 3
 
0.2%

보유경위
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
수집
1126 
자생
158 
자생,수집
 
5
분양(국립수목원)
 
1

Length

Max length9
Median length2
Mean length2.0170543
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row수집
2nd row수집
3rd row수집
4th row수집
5th row자생

Common Values

ValueCountFrequency (%)
수집 1126
87.3%
자생 158
 
12.2%
자생,수집 5
 
0.4%
분양(국립수목원) 1
 
0.1%

Length

2024-03-14T23:53:32.385317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T23:53:32.722282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수집 1126
87.3%
자생 158
 
12.2%
자생,수집 5
 
0.4%
분양(국립수목원 1
 
0.1%
Distinct56
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2024-03-14T23:53:33.319601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length2
Mean length2.2503876
Min length2

Characters and Unicode

Total characters2903
Distinct characters68
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)2.6%

Sample

1st row한국
2nd row멕시코
3rd row중국
4th row중국
5th row한국
ValueCountFrequency (%)
한국 701
53.4%
일본 224
 
17.1%
중국 125
 
9.5%
프랑스 63
 
4.8%
유럽 44
 
3.4%
독일 40
 
3.0%
미국 26
 
2.0%
북아메리카 18
 
1.4%
아시아 12
 
0.9%
인도 8
 
0.6%
Other values (36) 51
 
3.9%
2024-03-14T23:53:34.384112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
855
29.5%
701
24.1%
265
 
9.1%
225
 
7.8%
131
 
4.5%
68
 
2.3%
66
 
2.3%
64
 
2.2%
62
 
2.1%
46
 
1.6%
Other values (58) 420
14.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2856
98.4%
Space Separator 24
 
0.8%
Other Punctuation 23
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
855
29.9%
701
24.5%
265
 
9.3%
225
 
7.9%
131
 
4.6%
68
 
2.4%
66
 
2.3%
64
 
2.2%
62
 
2.2%
46
 
1.6%
Other values (56) 373
13.1%
Space Separator
ValueCountFrequency (%)
24
100.0%
Other Punctuation
ValueCountFrequency (%)
, 23
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2856
98.4%
Common 47
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
855
29.9%
701
24.5%
265
 
9.3%
225
 
7.9%
131
 
4.6%
68
 
2.4%
66
 
2.3%
64
 
2.2%
62
 
2.2%
46
 
1.6%
Other values (56) 373
13.1%
Common
ValueCountFrequency (%)
24
51.1%
, 23
48.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2856
98.4%
ASCII 47
 
1.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
855
29.9%
701
24.5%
265
 
9.3%
225
 
7.9%
131
 
4.6%
68
 
2.4%
66
 
2.3%
64
 
2.2%
62
 
2.2%
46
 
1.6%
Other values (56) 373
13.1%
ASCII
ValueCountFrequency (%)
24
51.1%
, 23
48.9%

산지
Categorical

HIGH CORRELATION 

Distinct32
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
완주
591 
천리포수목원
195 
뉴코리아장미
128 
전북산림
96 
미림종묘
 
46
Other values (27)
234 

Length

Max length6
Median length2
Mean length3.4348837
Min length2

Unique

Unique10 ?
Unique (%)0.8%

Sample

1st row완주
2nd row완주
3rd row전북산림
4th row완주
5th row완주

Common Values

ValueCountFrequency (%)
완주 591
45.8%
천리포수목원 195
 
15.1%
뉴코리아장미 128
 
9.9%
전북산림 96
 
7.4%
미림종묘 46
 
3.6%
프롬앤 44
 
3.4%
프롬엔 42
 
3.3%
순창 33
 
2.6%
전원생활 23
 
1.8%
천보식물원 16
 
1.2%
Other values (22) 76
 
5.9%

Length

2024-03-14T23:53:34.623716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
완주 591
45.7%
천리포수목원 195
 
15.1%
뉴코리아장미 128
 
9.9%
전북산림 96
 
7.4%
미림종묘 46
 
3.6%
프롬앤 44
 
3.4%
프롬엔 42
 
3.3%
순창 33
 
2.6%
전원생활 23
 
1.8%
천보식물원 16
 
1.2%
Other values (23) 78
 
6.0%
Distinct98
Distinct (%)7.6%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2024-03-14T23:53:35.166473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length4.1953488
Min length2

Characters and Unicode

Total characters5412
Distinct characters66
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)3.6%

Sample

1st row온실
2nd row온실
3rd row테마정원, 표본수원
4th row분재원, 약용수원
5th row자생
ValueCountFrequency (%)
표본수원 443
29.6%
온실 166
 
11.1%
자생 162
 
10.8%
장미원 117
 
7.8%
무궁화원 85
 
5.7%
동백원 60
 
4.0%
목련원 56
 
3.7%
관상수원 51
 
3.4%
묘포장 50
 
3.3%
약용수원 45
 
3.0%
Other values (25) 262
17.5%
2024-03-14T23:53:36.194496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1120
20.7%
650
12.0%
444
 
8.2%
444
 
8.2%
221
 
4.1%
, 207
 
3.8%
198
 
3.7%
173
 
3.2%
169
 
3.1%
166
 
3.1%
Other values (56) 1620
29.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4984
92.1%
Space Separator 221
 
4.1%
Other Punctuation 207
 
3.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1120
22.5%
650
13.0%
444
 
8.9%
444
 
8.9%
198
 
4.0%
173
 
3.5%
169
 
3.4%
166
 
3.3%
162
 
3.3%
118
 
2.4%
Other values (54) 1340
26.9%
Space Separator
ValueCountFrequency (%)
221
100.0%
Other Punctuation
ValueCountFrequency (%)
, 207
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4984
92.1%
Common 428
 
7.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1120
22.5%
650
13.0%
444
 
8.9%
444
 
8.9%
198
 
4.0%
173
 
3.5%
169
 
3.4%
166
 
3.3%
162
 
3.3%
118
 
2.4%
Other values (54) 1340
26.9%
Common
ValueCountFrequency (%)
221
51.6%
, 207
48.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4984
92.1%
ASCII 428
 
7.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1120
22.5%
650
13.0%
444
 
8.9%
444
 
8.9%
198
 
4.0%
173
 
3.5%
169
 
3.4%
166
 
3.3%
162
 
3.3%
118
 
2.4%
Other values (54) 1340
26.9%
ASCII
ValueCountFrequency (%)
221
51.6%
, 207
48.4%

Interactions

2024-03-14T23:53:19.761100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:53:18.154307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:53:18.952830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:53:20.024442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:53:18.419300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:53:19.226053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:53:20.296839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:53:18.695132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:53:19.499366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T23:53:36.455478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호과명수량보유년월보유경위원산지산지참고사항(식재위치)
일련번호1.0000.9910.0000.8160.2520.7370.8560.886
과명0.9911.0000.6140.7880.4520.9310.8710.926
수량0.0000.6141.0000.0780.0000.0000.0000.846
보유년월0.8160.7880.0781.0000.0940.6970.8770.886
보유경위0.2520.4520.0000.0941.0000.0000.8430.957
원산지0.7370.9310.0000.6970.0001.0000.8850.650
산지0.8560.8710.0000.8770.8430.8851.0000.962
참고사항(식재위치)0.8860.9260.8460.8860.9570.6500.9621.000
2024-03-14T23:53:36.738498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
보유경위산지
보유경위1.0000.609
산지0.6091.000
2024-03-14T23:53:36.979698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호수량보유년월보유경위산지
일련번호1.000-0.0340.1410.1530.509
수량-0.0341.000-0.1780.0000.000
보유년월0.141-0.1781.0000.0630.567
보유경위0.1530.0000.0631.0000.609
산지0.5090.0000.5670.6091.000

Missing values

2024-03-14T23:53:20.655156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T23:53:21.132656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T23:53:21.467188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

일련번호학명과명국명수량보유년월보유경위원산지산지참고사항(식재위치)
01Cycas revoluta Thunb.소철소철211994수집한국완주온실
12Zamia pumila L.소철멕시코소철11994수집멕시코완주온실
23Ginkgo spp.은행나무왕방울은행나무*22003수집중국전북산림테마정원, 표본수원
34Ginkgo biloba L.은행나무은행나무981989수집중국완주분재원, 약용수원
45Cephalotaxus koreana Nakai주목개비자나무1<NA>자생한국완주자생
56Cephalotaxus koreana var. nana Nak.주목눈개비자나무1<NA>자생한국완주자생
67Taxus spp.주목황금주목(팔방성)*12003수집한국완주온실, 표본수원
78Taxus baccata 'Aurea'주목황금주목32002수집한국완주표본수원
89Taxus spp.주목황금주목(변종)*32002수집한국완주표본수원
910Taxus cuspidata 'Nana'주목눈주목12002수집한국완주표본수원
일련번호학명과명국명수량보유년월보유경위원산지산지참고사항(식재위치)
12801281Hydrangea macrophylla "Eisvogel"범의귀수국 품종 '아이스보글'102014수집유럽프롬앤표본수원
12811282Hydrangea macrophylla "Hamburg"범의귀수국 품종 '함부르크'102014수집유럽프롬앤표본수원
12821283Hydrangea macrophylla "Libelle"범의귀수국 품종 '리벨르'102014수집유럽프롬앤표본수원
12831284Hydrangea mcarophylla "Alpengluchen"범의귀수국 품종 '알펜그루헨'102014수집유럽프롬앤표본수원
12841285Lonicera nitida "Lemon Beauty"인동동청괴불나무 '레몬 뷰티'102014수집유럽프롬앤표본수원
12851286Lonicera nitida "Maigun"인동동청괴불나무 '마이준'102014수집유럽프롬앤표본수원
12861287Lonicera pileata인동필레아타괴불나무102014수집유럽프롬앤표본수원
12871288Rosa cannia장미장미속류102014수집유럽프롬앤표본수원
12881289Rosa rubignosa장미장미속류102014수집유럽프롬앤표본수원
12891290Stephandra incisa "Crispa"장미국수나무 '크리스파'102014수집유럽, 아시아프롬앤표본수원