Overview

Dataset statistics

Number of variables14
Number of observations618
Missing cells685
Missing cells (%)7.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory68.3 KiB
Average record size in memory113.2 B

Variable types

Numeric1
Text7
Categorical3
DateTime3

Dataset

Description국립산림품종관리센터에 접수된 품종보호출원내역입니다. 작물명, 품종명칭, 대조품종, 출원인 업종구분, 출원일자 등 정보를 포함합니다.(출원이 되지 않은 경우 출원일자 등이 공란으로 표시되어 있습니다.)
Author산림청 국립산림품종관리센터
URLhttps://www.data.go.kr/data/15005206/fileData.do

Alerts

대조품종 has 21 (3.4%) missing valuesMissing
품종보호등록번호 has 323 (52.3%) missing valuesMissing
등록일자 has 323 (52.3%) missing valuesMissing
순번 has unique valuesUnique
출원번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 13:01:10.512286
Analysis finished2023-12-12 13:01:12.097909
Duration1.59 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct618
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean309.5
Minimum1
Maximum618
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2023-12-12T22:01:12.188131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile31.85
Q1155.25
median309.5
Q3463.75
95-th percentile587.15
Maximum618
Range617
Interquartile range (IQR)308.5

Descriptive statistics

Standard deviation178.54551
Coefficient of variation (CV)0.57688372
Kurtosis-1.2
Mean309.5
Median Absolute Deviation (MAD)154.5
Skewness0
Sum191271
Variance31878.5
MonotonicityStrictly increasing
2023-12-12T22:01:12.355601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
416 1
 
0.2%
409 1
 
0.2%
410 1
 
0.2%
411 1
 
0.2%
412 1
 
0.2%
413 1
 
0.2%
414 1
 
0.2%
415 1
 
0.2%
417 1
 
0.2%
Other values (608) 608
98.4%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
618 1
0.2%
617 1
0.2%
616 1
0.2%
615 1
0.2%
614 1
0.2%
613 1
0.2%
612 1
0.2%
611 1
0.2%
610 1
0.2%
609 1
0.2%

출원번호
Text

UNIQUE 

Distinct618
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
2023-12-12T22:01:12.760597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length6.7880259
Min length6

Characters and Unicode

Total characters4195
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique618 ?
Unique (%)100.0%

Sample

1st row2008-1
2nd row2008-2
3rd row2008-3
4th row2008-4
5th row2008-5
ValueCountFrequency (%)
2008-1 1
 
0.2%
2018-42 1
 
0.2%
2018-36 1
 
0.2%
2018-37 1
 
0.2%
2019-6 1
 
0.2%
2018-38 1
 
0.2%
2018-39 1
 
0.2%
2018-40 1
 
0.2%
2018-41 1
 
0.2%
2018-43 1
 
0.2%
Other values (608) 608
98.4%
2023-12-12T22:01:13.298585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 1065
25.4%
0 823
19.6%
1 703
16.8%
- 618
14.7%
3 220
 
5.2%
4 168
 
4.0%
9 148
 
3.5%
8 126
 
3.0%
7 111
 
2.6%
5 109
 
2.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3577
85.3%
Dash Punctuation 618
 
14.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 1065
29.8%
0 823
23.0%
1 703
19.7%
3 220
 
6.2%
4 168
 
4.7%
9 148
 
4.1%
8 126
 
3.5%
7 111
 
3.1%
5 109
 
3.0%
6 104
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 618
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4195
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 1065
25.4%
0 823
19.6%
1 703
16.8%
- 618
14.7%
3 220
 
5.2%
4 168
 
4.0%
9 148
 
3.5%
8 126
 
3.0%
7 111
 
2.6%
5 109
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4195
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 1065
25.4%
0 823
19.6%
1 703
16.8%
- 618
14.7%
3 220
 
5.2%
4 168
 
4.0%
9 148
 
3.5%
8 126
 
3.0%
7 111
 
2.6%
5 109
 
2.6%
Distinct156
Distinct (%)25.2%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
2023-12-12T22:01:13.635171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length12
Mean length3.4320388
Min length1

Characters and Unicode

Total characters2121
Distinct characters205
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique81 ?
Unique (%)13.1%

Sample

1st row밤나무
2nd row밤나무
3rd row백운풀
4th row표고
5th row밤나무
ValueCountFrequency (%)
표고 87
 
14.1%
잔디 40
 
6.5%
감나무 29
 
4.7%
다래 23
 
3.7%
나무수국 20
 
3.2%
밤나무 19
 
3.1%
구절초 18
 
2.9%
대추나무 12
 
1.9%
호두나무 12
 
1.9%
솔체꽃 10
 
1.6%
Other values (147) 349
56.4%
2023-12-12T22:01:14.033473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
202
 
9.5%
200
 
9.4%
89
 
4.2%
87
 
4.1%
65
 
3.1%
64
 
3.0%
64
 
3.0%
49
 
2.3%
40
 
1.9%
37
 
1.7%
Other values (195) 1224
57.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2081
98.1%
Math Symbol 33
 
1.6%
Uppercase Letter 3
 
0.1%
Other Punctuation 2
 
0.1%
Lowercase Letter 1
 
< 0.1%
Space Separator 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
202
 
9.7%
200
 
9.6%
89
 
4.3%
87
 
4.2%
65
 
3.1%
64
 
3.1%
64
 
3.1%
49
 
2.4%
40
 
1.9%
37
 
1.8%
Other values (190) 1184
56.9%
Math Symbol
ValueCountFrequency (%)
× 33
100.0%
Uppercase Letter
ValueCountFrequency (%)
X 3
100.0%
Other Punctuation
ValueCountFrequency (%)
' 2
100.0%
Lowercase Letter
ValueCountFrequency (%)
x 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2081
98.1%
Common 36
 
1.7%
Latin 4
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
202
 
9.7%
200
 
9.6%
89
 
4.3%
87
 
4.2%
65
 
3.1%
64
 
3.1%
64
 
3.1%
49
 
2.4%
40
 
1.9%
37
 
1.8%
Other values (190) 1184
56.9%
Common
ValueCountFrequency (%)
× 33
91.7%
' 2
 
5.6%
1
 
2.8%
Latin
ValueCountFrequency (%)
X 3
75.0%
x 1
 
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2081
98.1%
None 33
 
1.6%
ASCII 7
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
202
 
9.7%
200
 
9.6%
89
 
4.3%
87
 
4.2%
65
 
3.1%
64
 
3.1%
64
 
3.1%
49
 
2.4%
40
 
1.9%
37
 
1.8%
Other values (190) 1184
56.9%
None
ValueCountFrequency (%)
× 33
100.0%
ASCII
ValueCountFrequency (%)
X 3
42.9%
' 2
28.6%
x 1
 
14.3%
1
 
14.3%

작물구분
Categorical

Distinct7
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
야생화
164 
산과수
129 
조경수
121 
버섯류
108 
특용
77 
Other values (2)
19 

Length

Max length3
Median length3
Mean length2.8446602
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row산과수
2nd row산과수
3rd row특용
4th row버섯류
5th row산과수

Common Values

ValueCountFrequency (%)
야생화 164
26.5%
산과수 129
20.9%
조경수 121
19.6%
버섯류 108
17.5%
특용 77
12.5%
산채 17
 
2.8%
기타 2
 
0.3%

Length

2023-12-12T22:01:14.233608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:01:14.366038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
야생화 164
26.5%
산과수 129
20.9%
조경수 121
19.6%
버섯류 108
17.5%
특용 77
12.5%
산채 17
 
2.8%
기타 2
 
0.3%
Distinct614
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
2023-12-12T22:01:14.656770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length13
Mean length4.0226537
Min length1

Characters and Unicode

Total characters2486
Distinct characters352
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique610 ?
Unique (%)98.7%

Sample

1st row대한
2nd row미풍
3rd row백약
4th row하나참
5th row대보
ValueCountFrequency (%)
엘씨 13
 
1.9%
1호 8
 
1.2%
한라그린 7
 
1.0%
산조 5
 
0.7%
한초 4
 
0.6%
15호 3
 
0.4%
핑크 3
 
0.4%
아우라 2
 
0.3%
2호 2
 
0.3%
블루 2
 
0.3%
Other values (635) 644
92.9%
2023-12-12T22:01:15.118328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
130
 
5.2%
1 114
 
4.6%
76
 
3.1%
71
 
2.9%
58
 
2.3%
0 55
 
2.2%
2 41
 
1.6%
38
 
1.5%
38
 
1.5%
36
 
1.4%
Other values (342) 1829
73.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2061
82.9%
Decimal Number 334
 
13.4%
Space Separator 76
 
3.1%
Dash Punctuation 13
 
0.5%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
130
 
6.3%
71
 
3.4%
58
 
2.8%
38
 
1.8%
38
 
1.8%
36
 
1.7%
35
 
1.7%
34
 
1.6%
32
 
1.6%
32
 
1.6%
Other values (328) 1557
75.5%
Decimal Number
ValueCountFrequency (%)
1 114
34.1%
0 55
16.5%
2 41
 
12.3%
7 31
 
9.3%
3 22
 
6.6%
5 21
 
6.3%
4 16
 
4.8%
9 15
 
4.5%
8 12
 
3.6%
6 7
 
2.1%
Uppercase Letter
ValueCountFrequency (%)
C 1
50.0%
S 1
50.0%
Space Separator
ValueCountFrequency (%)
76
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2061
82.9%
Common 423
 
17.0%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
130
 
6.3%
71
 
3.4%
58
 
2.8%
38
 
1.8%
38
 
1.8%
36
 
1.7%
35
 
1.7%
34
 
1.6%
32
 
1.6%
32
 
1.6%
Other values (328) 1557
75.5%
Common
ValueCountFrequency (%)
1 114
27.0%
76
18.0%
0 55
13.0%
2 41
 
9.7%
7 31
 
7.3%
3 22
 
5.2%
5 21
 
5.0%
4 16
 
3.8%
9 15
 
3.5%
- 13
 
3.1%
Other values (2) 19
 
4.5%
Latin
ValueCountFrequency (%)
C 1
50.0%
S 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2061
82.9%
ASCII 425
 
17.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
130
 
6.3%
71
 
3.4%
58
 
2.8%
38
 
1.8%
38
 
1.8%
36
 
1.7%
35
 
1.7%
34
 
1.6%
32
 
1.6%
32
 
1.6%
Other values (328) 1557
75.5%
ASCII
ValueCountFrequency (%)
1 114
26.8%
76
17.9%
0 55
12.9%
2 41
 
9.6%
7 31
 
7.3%
3 22
 
5.2%
5 21
 
4.9%
4 16
 
3.8%
9 15
 
3.5%
- 13
 
3.1%
Other values (4) 21
 
4.9%

대조품종
Text

MISSING 

Distinct304
Distinct (%)50.9%
Missing21
Missing (%)3.4%
Memory size5.0 KiB
2023-12-12T22:01:15.499195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length31
Mean length4.519263
Min length2

Characters and Unicode

Total characters2698
Distinct characters332
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique199 ?
Unique (%)33.3%

Sample

1st row유마
2nd row유마
3rd row백운풀
4th row균흥556
5th row유마
ValueCountFrequency (%)
일반종 42
 
6.3%
구절초 16
 
2.4%
산조701호 12
 
1.8%
라임라이트 11
 
1.6%
그린조아 11
 
1.6%
유마 9
 
1.3%
곰솔 8
 
1.2%
재래종 8
 
1.2%
상주둥시 7
 
1.0%
평창재래 6
 
0.9%
Other values (326) 537
80.5%
2023-12-12T22:01:16.036489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 88
 
3.3%
78
 
2.9%
78
 
2.9%
76
 
2.8%
73
 
2.7%
0 72
 
2.7%
55
 
2.0%
49
 
1.8%
48
 
1.8%
47
 
1.7%
Other values (322) 2034
75.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2081
77.1%
Decimal Number 299
 
11.1%
Lowercase Letter 94
 
3.5%
Uppercase Letter 77
 
2.9%
Space Separator 73
 
2.7%
Other Punctuation 30
 
1.1%
Open Punctuation 15
 
0.6%
Close Punctuation 15
 
0.6%
Dash Punctuation 13
 
0.5%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
78
 
3.7%
78
 
3.7%
76
 
3.7%
55
 
2.6%
49
 
2.4%
48
 
2.3%
47
 
2.3%
44
 
2.1%
40
 
1.9%
39
 
1.9%
Other values (265) 1527
73.4%
Lowercase Letter
ValueCountFrequency (%)
i 13
13.8%
n 10
10.6%
e 9
9.6%
a 9
9.6%
r 9
9.6%
s 6
 
6.4%
o 5
 
5.3%
l 5
 
5.3%
f 4
 
4.3%
u 4
 
4.3%
Other values (11) 20
21.3%
Uppercase Letter
ValueCountFrequency (%)
S 9
11.7%
H 8
 
10.4%
L 6
 
7.8%
R 6
 
7.8%
M 5
 
6.5%
J 5
 
6.5%
K 5
 
6.5%
B 4
 
5.2%
A 4
 
5.2%
P 4
 
5.2%
Other values (9) 21
27.3%
Decimal Number
ValueCountFrequency (%)
1 88
29.4%
0 72
24.1%
7 34
 
11.4%
2 32
 
10.7%
9 14
 
4.7%
8 14
 
4.7%
3 12
 
4.0%
6 11
 
3.7%
5 11
 
3.7%
4 11
 
3.7%
Other Punctuation
ValueCountFrequency (%)
, 22
73.3%
' 8
 
26.7%
Space Separator
ValueCountFrequency (%)
73
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Math Symbol
ValueCountFrequency (%)
× 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2081
77.1%
Common 446
 
16.5%
Latin 171
 
6.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
78
 
3.7%
78
 
3.7%
76
 
3.7%
55
 
2.6%
49
 
2.4%
48
 
2.3%
47
 
2.3%
44
 
2.1%
40
 
1.9%
39
 
1.9%
Other values (265) 1527
73.4%
Latin
ValueCountFrequency (%)
i 13
 
7.6%
n 10
 
5.8%
S 9
 
5.3%
e 9
 
5.3%
a 9
 
5.3%
r 9
 
5.3%
H 8
 
4.7%
L 6
 
3.5%
s 6
 
3.5%
R 6
 
3.5%
Other values (30) 86
50.3%
Common
ValueCountFrequency (%)
1 88
19.7%
73
16.4%
0 72
16.1%
7 34
 
7.6%
2 32
 
7.2%
, 22
 
4.9%
( 15
 
3.4%
) 15
 
3.4%
9 14
 
3.1%
8 14
 
3.1%
Other values (7) 67
15.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2081
77.1%
ASCII 616
 
22.8%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 88
 
14.3%
73
 
11.9%
0 72
 
11.7%
7 34
 
5.5%
2 32
 
5.2%
, 22
 
3.6%
( 15
 
2.4%
) 15
 
2.4%
9 14
 
2.3%
8 14
 
2.3%
Other values (46) 237
38.5%
Hangul
ValueCountFrequency (%)
78
 
3.7%
78
 
3.7%
76
 
3.7%
55
 
2.6%
49
 
2.4%
48
 
2.3%
47
 
2.3%
44
 
2.1%
40
 
1.9%
39
 
1.9%
Other values (265) 1527
73.4%
None
ValueCountFrequency (%)
× 1
100.0%

상태
Categorical

Distinct8
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
보호등록
277 
심사
218 
취하
75 
거절
 
18
소멸
 
18
Other values (3)
 
12

Length

Max length4
Median length2
Mean length2.8964401
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보호등록
2nd row보호등록
3rd row무효
4th row거절
5th row보호등록

Common Values

ValueCountFrequency (%)
보호등록 277
44.8%
심사 218
35.3%
취하 75
 
12.1%
거절 18
 
2.9%
소멸 18
 
2.9%
포기 7
 
1.1%
이관 3
 
0.5%
무효 2
 
0.3%

Length

2023-12-12T22:01:16.234084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:01:16.388262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보호등록 277
44.8%
심사 218
35.3%
취하 75
 
12.1%
거절 18
 
2.9%
소멸 18
 
2.9%
포기 7
 
1.1%
이관 3
 
0.5%
무효 2
 
0.3%
Distinct6
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
개인
247 
국가
127 
지자체
91 
종자업계
65 
외국
62 

Length

Max length4
Median length2
Mean length2.3576052
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국가
2nd row국가
3rd row개인
4th row종자업계
5th row국가

Common Values

ValueCountFrequency (%)
개인 247
40.0%
국가 127
20.6%
지자체 91
 
14.7%
종자업계 65
 
10.5%
외국 62
 
10.0%
기타 26
 
4.2%

Length

2023-12-12T22:01:16.567639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:01:16.708797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 247
40.0%
국가 127
20.6%
지자체 91
 
14.7%
종자업계 65
 
10.5%
외국 62
 
10.0%
기타 26
 
4.2%
Distinct325
Distinct (%)52.6%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
Minimum2008-04-21 00:00:00
Maximum2022-12-21 00:00:00
2023-12-12T22:01:16.887439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:17.342363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct612
Distinct (%)100.0%
Missing6
Missing (%)1.0%
Memory size5.0 KiB
2023-12-12T22:01:17.726983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length6.8088235
Min length2

Characters and Unicode

Total characters4167
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique612 ?
Unique (%)100.0%

Sample

1st row2008-1
2nd row2008-2
3rd row2008-4
4th row2008-3
5th row2008-5
ValueCountFrequency (%)
2008-9 1
 
0.2%
2019-6 1
 
0.2%
2019-9 1
 
0.2%
2019-16 1
 
0.2%
2019-1 1
 
0.2%
2019-2 1
 
0.2%
2019-3 1
 
0.2%
2019-4 1
 
0.2%
2019-5 1
 
0.2%
2019-7 1
 
0.2%
Other values (601) 601
98.4%
2023-12-12T22:01:18.315276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 1047
25.1%
0 829
19.9%
1 697
16.7%
- 611
14.7%
3 231
 
5.5%
4 151
 
3.6%
9 139
 
3.3%
5 124
 
3.0%
8 121
 
2.9%
7 111
 
2.7%
Other values (2) 106
 
2.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3554
85.3%
Dash Punctuation 611
 
14.7%
Space Separator 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 1047
29.5%
0 829
23.3%
1 697
19.6%
3 231
 
6.5%
4 151
 
4.2%
9 139
 
3.9%
5 124
 
3.5%
8 121
 
3.4%
7 111
 
3.1%
6 104
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 611
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4167
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 1047
25.1%
0 829
19.9%
1 697
16.7%
- 611
14.7%
3 231
 
5.5%
4 151
 
3.6%
9 139
 
3.3%
5 124
 
3.0%
8 121
 
2.9%
7 111
 
2.7%
Other values (2) 106
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4167
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 1047
25.1%
0 829
19.9%
1 697
16.7%
- 611
14.7%
3 231
 
5.5%
4 151
 
3.6%
9 139
 
3.3%
5 124
 
3.0%
8 121
 
2.9%
7 111
 
2.7%
Other values (2) 106
 
2.5%
Distinct144
Distinct (%)23.5%
Missing6
Missing (%)1.0%
Memory size5.0 KiB
Minimum2008-05-15 00:00:00
Maximum2023-01-15 00:00:00
2023-12-12T22:01:18.490426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:18.638586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct145
Distinct (%)23.7%
Missing6
Missing (%)1.0%
Memory size5.0 KiB
2023-12-12T22:01:18.891943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length5.0816993
Min length5

Characters and Unicode

Total characters3110
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)6.0%

Sample

1st row제119호
2nd row제119호
3rd row제120호
4th row제119호
5th row제121호
ValueCountFrequency (%)
50
 
7.6%
제198호 18
 
2.7%
제286호 18
 
2.7%
제270호 16
 
2.4%
제273호 15
 
2.3%
제128호 14
 
2.1%
제210호 13
 
2.0%
제186호 13
 
2.0%
제294호 11
 
1.7%
제258호 11
 
1.7%
Other values (136) 483
73.0%
2023-12-12T22:01:19.284313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
612
19.7%
612
19.7%
2 512
16.5%
1 318
10.2%
8 177
 
5.7%
6 147
 
4.7%
4 146
 
4.7%
3 130
 
4.2%
7 114
 
3.7%
5 105
 
3.4%
Other values (3) 237
 
7.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1836
59.0%
Other Letter 1224
39.4%
Space Separator 50
 
1.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 512
27.9%
1 318
17.3%
8 177
 
9.6%
6 147
 
8.0%
4 146
 
8.0%
3 130
 
7.1%
7 114
 
6.2%
5 105
 
5.7%
9 100
 
5.4%
0 87
 
4.7%
Other Letter
ValueCountFrequency (%)
612
50.0%
612
50.0%
Space Separator
ValueCountFrequency (%)
50
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1886
60.6%
Hangul 1224
39.4%

Most frequent character per script

Common
ValueCountFrequency (%)
2 512
27.1%
1 318
16.9%
8 177
 
9.4%
6 147
 
7.8%
4 146
 
7.7%
3 130
 
6.9%
7 114
 
6.0%
5 105
 
5.6%
9 100
 
5.3%
0 87
 
4.6%
Hangul
ValueCountFrequency (%)
612
50.0%
612
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1886
60.6%
Hangul 1224
39.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
612
50.0%
612
50.0%
ASCII
ValueCountFrequency (%)
2 512
27.1%
1 318
16.9%
8 177
 
9.4%
6 147
 
7.8%
4 146
 
7.7%
3 130
 
6.9%
7 114
 
6.0%
5 105
 
5.6%
9 100
 
5.3%
0 87
 
4.6%
Distinct294
Distinct (%)99.7%
Missing323
Missing (%)52.3%
Memory size5.0 KiB
2023-12-12T22:01:19.624527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length4.6338983
Min length3

Characters and Unicode

Total characters1367
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique293 ?
Unique (%)99.3%

Sample

1st row제1호
2nd row제2호
3rd row제3호
4th row제4호
5th row제5호
ValueCountFrequency (%)
제231호 2
 
0.7%
제173호 1
 
0.3%
제238호 1
 
0.3%
제191호 1
 
0.3%
제187호 1
 
0.3%
제267호 1
 
0.3%
제170호 1
 
0.3%
제169호 1
 
0.3%
제197호 1
 
0.3%
제1호 1
 
0.3%
Other values (284) 284
96.3%
2023-12-12T22:01:20.123749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
295
21.6%
295
21.6%
1 161
11.8%
2 155
11.3%
3 61
 
4.5%
5 60
 
4.4%
4 60
 
4.4%
7 59
 
4.3%
8 59
 
4.3%
6 59
 
4.3%
Other values (2) 103
 
7.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 777
56.8%
Other Letter 590
43.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 161
20.7%
2 155
19.9%
3 61
 
7.9%
5 60
 
7.7%
4 60
 
7.7%
7 59
 
7.6%
8 59
 
7.6%
6 59
 
7.6%
9 54
 
6.9%
0 49
 
6.3%
Other Letter
ValueCountFrequency (%)
295
50.0%
295
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 777
56.8%
Hangul 590
43.2%

Most frequent character per script

Common
ValueCountFrequency (%)
1 161
20.7%
2 155
19.9%
3 61
 
7.9%
5 60
 
7.7%
4 60
 
7.7%
7 59
 
7.6%
8 59
 
7.6%
6 59
 
7.6%
9 54
 
6.9%
0 49
 
6.3%
Hangul
ValueCountFrequency (%)
295
50.0%
295
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 777
56.8%
Hangul 590
43.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
295
50.0%
295
50.0%
ASCII
ValueCountFrequency (%)
1 161
20.7%
2 155
19.9%
3 61
 
7.9%
5 60
 
7.7%
4 60
 
7.7%
7 59
 
7.6%
8 59
 
7.6%
6 59
 
7.6%
9 54
 
6.9%
0 49
 
6.3%

등록일자
Date

MISSING 

Distinct196
Distinct (%)66.4%
Missing323
Missing (%)52.3%
Memory size5.0 KiB
Minimum2011-11-16 00:00:00
Maximum2022-07-22 00:00:00
2023-12-12T22:01:20.262482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:01:20.405394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T22:01:11.441874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:01:20.505710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번작물구분상태출원인 업종 구분
순번1.0000.2380.5870.372
작물구분0.2381.0000.2840.493
상태0.5870.2841.0000.291
출원인 업종 구분0.3720.4930.2911.000
2023-12-12T22:01:20.610100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상태출원인 업종 구분작물구분
상태1.0000.1660.156
출원인 업종 구분0.1661.0000.321
작물구분0.1560.3211.000
2023-12-12T22:01:20.704041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번작물구분상태출원인 업종 구분
순번1.0000.1220.3290.205
작물구분0.1221.0000.1560.321
상태0.3290.1561.0000.166
출원인 업종 구분0.2050.3210.1661.000

Missing values

2023-12-12T22:01:11.612568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:01:11.845032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T22:01:12.010703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번출원번호작물명작물구분품종명칭대조품종상태출원인 업종 구분출원일자공개번호공개일자공보 회차품종보호등록번호등록일자
012008-1밤나무산과수대한유마보호등록국가2008-04-212008-12008-05-15제119호제1호2011-11-16
122008-2밤나무산과수미풍유마보호등록국가2008-04-212008-22008-05-15제119호제2호2011-11-16
232008-3백운풀특용백약백운풀무효개인2008-05-142008-42008-07-15제120호<NA><NA>
342008-4표고버섯류하나참균흥556거절종자업계2008-05-262008-32008-06-15제119호<NA><NA>
452008-5밤나무산과수대보유마보호등록국가2008-06-092008-52008-08-15제121호제3호2011-11-16
562008-6밤나무산과수박미1호유마보호등록국가2008-06-092008-62008-08-15제121호제4호2011-11-16
672008-7밤나무산과수박미2호유마보호등록국가2008-06-092008-72008-08-15제121호제5호2011-11-16
782008-8표고버섯류산림9호<NA>보호등록국가2008-07-252008-82008-08-15제121호제57호2014-12-03
892008-9표고버섯류산조702호산조701호보호등록종자업계2008-12-052008-92008-12-15제125호제24호2013-02-25
9102008-10감나무산과수이대시상주대시거절개인2008-12-102008-102009-02-15제127호<NA><NA>
순번출원번호작물명작물구분품종명칭대조품종상태출원인 업종 구분출원일자공개번호공개일자공보 회차품종보호등록번호등록일자
6086092022-43구절초야생화국야황선구절초심사개인2022-12-012023-022023-01-15제294호<NA><NA>
6096102022-44구절초야생화국야반홍구절초심사개인2022-12-012023-032023-01-15제294호<NA><NA>
6106112022-45굴참나무조경수신안천사상수리나무 '미설목'심사지자체2022-12-022023-042023-01-15제294호<NA><NA>
6116122022-46나무수국조경수지알에이치피08라임라이트심사외국2022-12-092023-052023-01-15제294호<NA><NA>
6126132022-47부산꼬리풀X큰구와꼬리풀야생화에스제이-디스코부산꼬리풀심사기타2022-12-152023-062023-01-15제294호<NA><NA>
6136142022-48큰산꼬리풀X이삭꼬리풀 '알바'야생화에스제이-캔들큰산꼬리풀심사기타2022-12-152023-072023-01-15제294호<NA><NA>
6146152022-49벚나무조경수코리골드일반종심사개인2022-12-152023-082023-01-15제294호<NA><NA>
6156162022-50참억새야생화일장춘몽참억새 '그라실리무스'심사기타2022-12-192023-092023-01-15제294호<NA><NA>
6166172022-51표고버섯류산조722호산조718호심사종자업계2022-12-212023-102023-01-15제294호<NA><NA>
6176182022-52목이버섯류산조902호산조901호심사종자업계2022-12-212023-112023-01-15제294호<NA><NA>