Overview

Dataset statistics

Number of variables4
Number of observations67
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory34.9 B

Variable types

Text2
Numeric1
Categorical1

Dataset

Description2019년까지 경상남도농업기술원의 특허등록 현황입니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15070938

Alerts

종류 has constant value ""Constant
특허명 has unique valuesUnique
등록번호 has unique valuesUnique

Reproduction

Analysis started2024-04-20 21:58:42.930479
Analysis finished2024-04-20 21:58:43.936177
Duration1.01 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

특허명
Text

UNIQUE 

Distinct67
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size664.0 B
2024-04-21T06:58:44.577772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length30
Mean length23.268657
Min length4

Characters and Unicode

Total characters1559
Distinct characters278
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)100.0%

Sample

1st row양파 발효주의 제조방법
2nd row양파 이식방법과 장치
3rd row양파당과 제조방법
4th row황색고구마 국수 제조법
5th row녹차 추출물을 함유한 액체세제의 조성물 및 제조 방법
ValueCountFrequency (%)
27
 
6.9%
제조방법 17
 
4.4%
방법 12
 
3.1%
이용한 11
 
2.8%
9
 
2.3%
제조 7
 
1.8%
조성물 7
 
1.8%
단감 6
 
1.5%
포함하는 5
 
1.3%
양파 4
 
1.0%
Other values (234) 285
73.1%
2024-04-21T06:58:45.889752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
325
 
20.8%
45
 
2.9%
45
 
2.9%
44
 
2.8%
40
 
2.6%
33
 
2.1%
32
 
2.1%
28
 
1.8%
27
 
1.7%
26
 
1.7%
Other values (268) 914
58.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1172
75.2%
Space Separator 325
 
20.8%
Uppercase Letter 27
 
1.7%
Decimal Number 16
 
1.0%
Lowercase Letter 6
 
0.4%
Other Punctuation 4
 
0.3%
Close Punctuation 3
 
0.2%
Open Punctuation 3
 
0.2%
Dash Punctuation 2
 
0.1%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
45
 
3.8%
45
 
3.8%
44
 
3.8%
40
 
3.4%
33
 
2.8%
32
 
2.7%
28
 
2.4%
27
 
2.3%
26
 
2.2%
25
 
2.1%
Other values (238) 827
70.6%
Uppercase Letter
ValueCountFrequency (%)
C 7
25.9%
P 6
22.2%
R 3
11.1%
A 2
 
7.4%
M 2
 
7.4%
T 1
 
3.7%
B 1
 
3.7%
X 1
 
3.7%
D 1
 
3.7%
E 1
 
3.7%
Other values (2) 2
 
7.4%
Decimal Number
ValueCountFrequency (%)
1 6
37.5%
2 3
18.8%
4 2
 
12.5%
3 2
 
12.5%
5 2
 
12.5%
9 1
 
6.2%
Lowercase Letter
ValueCountFrequency (%)
t 1
16.7%
n 1
16.7%
y 1
16.7%
e 1
16.7%
h 1
16.7%
u 1
16.7%
Space Separator
ValueCountFrequency (%)
325
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Math Symbol
ValueCountFrequency (%)
× 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1172
75.2%
Common 354
 
22.7%
Latin 33
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
45
 
3.8%
45
 
3.8%
44
 
3.8%
40
 
3.4%
33
 
2.8%
32
 
2.7%
28
 
2.4%
27
 
2.3%
26
 
2.2%
25
 
2.1%
Other values (238) 827
70.6%
Latin
ValueCountFrequency (%)
C 7
21.2%
P 6
18.2%
R 3
 
9.1%
A 2
 
6.1%
M 2
 
6.1%
t 1
 
3.0%
T 1
 
3.0%
n 1
 
3.0%
y 1
 
3.0%
e 1
 
3.0%
Other values (8) 8
24.2%
Common
ValueCountFrequency (%)
325
91.8%
1 6
 
1.7%
, 4
 
1.1%
) 3
 
0.8%
2 3
 
0.8%
( 3
 
0.8%
- 2
 
0.6%
4 2
 
0.6%
3 2
 
0.6%
5 2
 
0.6%
Other values (2) 2
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1172
75.2%
ASCII 386
 
24.8%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
325
84.2%
C 7
 
1.8%
P 6
 
1.6%
1 6
 
1.6%
, 4
 
1.0%
) 3
 
0.8%
R 3
 
0.8%
2 3
 
0.8%
( 3
 
0.8%
A 2
 
0.5%
Other values (19) 24
 
6.2%
Hangul
ValueCountFrequency (%)
45
 
3.8%
45
 
3.8%
44
 
3.8%
40
 
3.4%
33
 
2.8%
32
 
2.7%
28
 
2.4%
27
 
2.3%
26
 
2.2%
25
 
2.1%
Other values (238) 827
70.6%
None
ValueCountFrequency (%)
× 1
100.0%

등록년도
Real number (ℝ)

Distinct20
Distinct (%)29.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2011.8955
Minimum2000
Maximum2021
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size731.0 B
2024-04-21T06:58:46.266651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2000
5-th percentile2002.3
Q12007
median2014
Q32017
95-th percentile2020
Maximum2021
Range21
Interquartile range (IQR)10

Descriptive statistics

Standard deviation6.0880766
Coefficient of variation (CV)0.0030260401
Kurtosis-1.1794633
Mean2011.8955
Median Absolute Deviation (MAD)5
Skewness-0.30869517
Sum134797
Variance37.064677
MonotonicityIncreasing
2024-04-21T06:58:46.637515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
2014 7
10.4%
2012 6
 
9.0%
2008 6
 
9.0%
2017 5
 
7.5%
2003 5
 
7.5%
2020 5
 
7.5%
2015 5
 
7.5%
2005 4
 
6.0%
2019 4
 
6.0%
2004 3
 
4.5%
Other values (10) 17
25.4%
ValueCountFrequency (%)
2000 1
 
1.5%
2001 1
 
1.5%
2002 2
 
3.0%
2003 5
7.5%
2004 3
4.5%
2005 4
6.0%
2007 2
 
3.0%
2008 6
9.0%
2009 1
 
1.5%
2011 1
 
1.5%
ValueCountFrequency (%)
2021 2
 
3.0%
2020 5
7.5%
2019 4
6.0%
2018 3
4.5%
2017 5
7.5%
2016 3
4.5%
2015 5
7.5%
2014 7
10.4%
2013 1
 
1.5%
2012 6
9.0%

등록번호
Text

UNIQUE 

Distinct67
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size664.0 B
2024-04-21T06:58:47.558594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length12
Mean length11.447761
Min length9

Characters and Unicode

Total characters767
Distinct characters14
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)100.0%

Sample

1st row제0256772호
2nd row제0307388호
3rd row제0346923호
4th row제0334517호
5th row제0373391호
ValueCountFrequency (%)
제0256772호 1
 
1.5%
제10-1366936호 1
 
1.5%
제10-1438214호 1
 
1.5%
제10-1431768호 1
 
1.5%
제10-1389628호 1
 
1.5%
제10-1348905호 1
 
1.5%
제10-1578925호 1
 
1.5%
제10-1507225호 1
 
1.5%
제10-1517678호 1
 
1.5%
제10-0009876호 1
 
1.5%
Other values (57) 57
85.1%
2024-04-21T06:58:48.891245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 136
17.7%
1 113
14.7%
67
8.7%
67
8.7%
2 56
7.3%
- 51
 
6.6%
8 51
 
6.6%
7 42
 
5.5%
3 40
 
5.2%
6 38
 
5.0%
Other values (4) 106
13.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 581
75.7%
Other Letter 134
 
17.5%
Dash Punctuation 51
 
6.6%
Space Separator 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 136
23.4%
1 113
19.4%
2 56
9.6%
8 51
 
8.8%
7 42
 
7.2%
3 40
 
6.9%
6 38
 
6.5%
4 38
 
6.5%
9 35
 
6.0%
5 32
 
5.5%
Other Letter
ValueCountFrequency (%)
67
50.0%
67
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 51
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 633
82.5%
Hangul 134
 
17.5%

Most frequent character per script

Common
ValueCountFrequency (%)
0 136
21.5%
1 113
17.9%
2 56
8.8%
- 51
 
8.1%
8 51
 
8.1%
7 42
 
6.6%
3 40
 
6.3%
6 38
 
6.0%
4 38
 
6.0%
9 35
 
5.5%
Other values (2) 33
 
5.2%
Hangul
ValueCountFrequency (%)
67
50.0%
67
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 633
82.5%
Hangul 134
 
17.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 136
21.5%
1 113
17.9%
2 56
8.8%
- 51
 
8.1%
8 51
 
8.1%
7 42
 
6.6%
3 40
 
6.3%
6 38
 
6.0%
4 38
 
6.0%
9 35
 
5.5%
Other values (2) 33
 
5.2%
Hangul
ValueCountFrequency (%)
67
50.0%
67
50.0%

종류
Categorical

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size664.0 B
특허
67 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row특허
2nd row특허
3rd row특허
4th row특허
5th row특허

Common Values

ValueCountFrequency (%)
특허 67
100.0%

Length

2024-04-21T06:58:49.304056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T06:58:49.612556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
특허 67
100.0%

Interactions

2024-04-21T06:58:43.249475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T06:58:49.790465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
특허명등록년도등록번호
특허명1.0001.0001.000
등록년도1.0001.0001.000
등록번호1.0001.0001.000

Missing values

2024-04-21T06:58:43.558965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T06:58:43.830721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

특허명등록년도등록번호종류
0양파 발효주의 제조방법2000제0256772호특허
1양파 이식방법과 장치2001제0307388호특허
2양파당과 제조방법2002제0346923호특허
3황색고구마 국수 제조법2002제0334517호특허
4녹차 추출물을 함유한 액체세제의 조성물 및 제조 방법2003제0373391호특허
5매실캔디2003제0392030호특허
6조미건조두부 제조법2003제0385461호특허
7부직포를 이용한 벼 육묘 방법2003제0402721호특허
8녹차 잎의 처리 방법을 이용한 아이스캔디 및 그 제조 방법2003제0375542호특허
9파프리카잼2004제0419075호특허
특허명등록년도등록번호종류
57증숙단계를 포함하는 사과말랭이 제조방법2019제10-20200048호특허
58분쇄된 사과말랭이를 이용한 가공식품의 제조방법2019제10-20200053호특허
59기호도 및 기능성이 향상된 섬애쑥 발효물 및 이의 제조방법2019제10-2007569호특허
60곤충의 효소 가수분해물을 함유하는 미백용 조성물2020제10-2265019호특허
61곤충의 효소 가수분해물을 함유하는 항비만 조성물2020제10-2276689호특허
62항비만 효능을 갖는 펩타이드 및 이의 용도2020제10-2288716호특허
63고구마 라떼용 믹스제조방법 및 이를 이용하여 제조된 고구마 라떼2020제10-23042620000호특허
64고로쇠 수액을 이용한 전통주 제조방법2020제10-22800820000호특허
65파이토프토라 속 균주 특이 검출을 위한 프라이머 세트 및 이의 용도2021제10-2238486호특허
66단감잎차 및 단감 착즙액을 포함하는 2차 발효에 의한 감잎 발효음료 제조방법2021제10-2269917호특허