Overview

Dataset statistics

Number of variables4
Number of observations82
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)1.2%
Total size in memory2.8 KiB
Average record size in memory34.6 B

Variable types

Text2
Numeric1
Categorical1

Dataset

Description2023년까지 경상남도농업기술원의 특허등록 현황입니다.특허명, 등록년도, 등록번호, 종류를 포함하고 있습니다.
Author경상남도
URLhttps://www.data.go.kr/data/15070938/fileData.do

Alerts

종류 has constant value ""Constant
Dataset has 1 (1.2%) duplicate rowsDuplicates

Reproduction

Analysis started2024-03-14 12:06:35.655770
Analysis finished2024-03-14 12:06:36.466420
Duration0.81 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct81
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size784.0 B
2024-03-14T21:06:37.127173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length34.5
Mean length25.707317
Min length4

Characters and Unicode

Total characters2108
Distinct characters298
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique80 ?
Unique (%)97.6%

Sample

1st row양파 발효주의 제조방법
2nd row양파 이식방법과 장치
3rd row양파당과 제조방법
4th row황색고구마 국수 제조법
5th row녹차 추출물을 함유한 액체세제의 조성물 및 제조 방법
ValueCountFrequency (%)
32
 
6.2%
제조방법 19
 
3.7%
방법 13
 
2.5%
이용한 13
 
2.5%
조성물 12
 
2.3%
9
 
1.8%
이의 8
 
1.6%
제조 8
 
1.6%
함유하는 7
 
1.4%
단감 6
 
1.2%
Other values (292) 385
75.2%
2024-03-14T21:06:38.695976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
434
 
20.6%
60
 
2.8%
50
 
2.4%
48
 
2.3%
48
 
2.3%
43
 
2.0%
39
 
1.9%
37
 
1.8%
34
 
1.6%
32
 
1.5%
Other values (288) 1283
60.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1557
73.9%
Space Separator 434
 
20.6%
Uppercase Letter 44
 
2.1%
Decimal Number 39
 
1.9%
Open Punctuation 8
 
0.4%
Close Punctuation 8
 
0.4%
Other Punctuation 6
 
0.3%
Lowercase Letter 6
 
0.3%
Dash Punctuation 5
 
0.2%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
60
 
3.9%
50
 
3.2%
48
 
3.1%
48
 
3.1%
43
 
2.8%
39
 
2.5%
37
 
2.4%
34
 
2.2%
32
 
2.1%
31
 
2.0%
Other values (254) 1135
72.9%
Uppercase Letter
ValueCountFrequency (%)
C 11
25.0%
P 8
18.2%
B 4
 
9.1%
A 4
 
9.1%
K 3
 
6.8%
R 3
 
6.8%
L 2
 
4.5%
M 2
 
4.5%
F 2
 
4.5%
E 1
 
2.3%
Other values (4) 4
 
9.1%
Decimal Number
ValueCountFrequency (%)
1 10
25.6%
3 6
15.4%
2 6
15.4%
5 5
12.8%
4 5
12.8%
8 3
 
7.7%
9 3
 
7.7%
0 1
 
2.6%
Lowercase Letter
ValueCountFrequency (%)
y 1
16.7%
e 1
16.7%
n 1
16.7%
t 1
16.7%
u 1
16.7%
h 1
16.7%
Space Separator
ValueCountFrequency (%)
434
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Other Punctuation
ValueCountFrequency (%)
, 6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Math Symbol
ValueCountFrequency (%)
× 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1557
73.9%
Common 501
 
23.8%
Latin 50
 
2.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
60
 
3.9%
50
 
3.2%
48
 
3.1%
48
 
3.1%
43
 
2.8%
39
 
2.5%
37
 
2.4%
34
 
2.2%
32
 
2.1%
31
 
2.0%
Other values (254) 1135
72.9%
Latin
ValueCountFrequency (%)
C 11
22.0%
P 8
16.0%
B 4
 
8.0%
A 4
 
8.0%
K 3
 
6.0%
R 3
 
6.0%
L 2
 
4.0%
M 2
 
4.0%
F 2
 
4.0%
E 1
 
2.0%
Other values (10) 10
20.0%
Common
ValueCountFrequency (%)
434
86.6%
1 10
 
2.0%
( 8
 
1.6%
) 8
 
1.6%
3 6
 
1.2%
2 6
 
1.2%
, 6
 
1.2%
5 5
 
1.0%
4 5
 
1.0%
- 5
 
1.0%
Other values (4) 8
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1557
73.9%
ASCII 550
 
26.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
434
78.9%
C 11
 
2.0%
1 10
 
1.8%
P 8
 
1.5%
( 8
 
1.5%
) 8
 
1.5%
3 6
 
1.1%
2 6
 
1.1%
, 6
 
1.1%
5 5
 
0.9%
Other values (23) 48
 
8.7%
Hangul
ValueCountFrequency (%)
60
 
3.9%
50
 
3.2%
48
 
3.1%
48
 
3.1%
43
 
2.8%
39
 
2.5%
37
 
2.4%
34
 
2.2%
32
 
2.1%
31
 
2.0%
Other values (254) 1135
72.9%
None
ValueCountFrequency (%)
× 1
100.0%

등록년도
Real number (ℝ)

Distinct22
Distinct (%)26.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2013.8293
Minimum2000
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size866.0 B
2024-03-14T21:06:39.070409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2000
5-th percentile2003
Q12008
median2015
Q32020
95-th percentile2023
Maximum2023
Range23
Interquartile range (IQR)12

Descriptive statistics

Standard deviation6.8687966
Coefficient of variation (CV)0.0034108138
Kurtosis-1.0913671
Mean2013.8293
Median Absolute Deviation (MAD)6
Skewness-0.36408905
Sum165134
Variance47.180367
MonotonicityIncreasing
2024-03-14T21:06:39.452299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
2023 8
 
9.8%
2014 7
 
8.5%
2022 6
 
7.3%
2012 6
 
7.3%
2008 6
 
7.3%
2020 5
 
6.1%
2017 5
 
6.1%
2003 5
 
6.1%
2015 5
 
6.1%
2019 4
 
4.9%
Other values (12) 25
30.5%
ValueCountFrequency (%)
2000 1
 
1.2%
2001 1
 
1.2%
2002 2
 
2.4%
2003 5
6.1%
2004 3
3.7%
2005 4
4.9%
2007 2
 
2.4%
2008 6
7.3%
2009 1
 
1.2%
2011 1
 
1.2%
ValueCountFrequency (%)
2023 8
9.8%
2022 6
7.3%
2021 3
 
3.7%
2020 5
6.1%
2019 4
4.9%
2018 3
 
3.7%
2017 5
6.1%
2016 3
 
3.7%
2015 5
6.1%
2014 7
8.5%
Distinct81
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size784.0 B
2024-03-14T21:06:40.381651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length12
Mean length12.463415
Min length9

Characters and Unicode

Total characters1022
Distinct characters14
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique80 ?
Unique (%)97.6%

Sample

1st row제0256772호
2nd row제0307388호
3rd row제0346923호
4th row제0334517호
5th row제0373391호
ValueCountFrequency (%)
제10-2486956-0000호 2
 
2.4%
제0256772호 1
 
1.2%
제10-1790049호 1
 
1.2%
제10-20200053호 1
 
1.2%
제10-20200048호 1
 
1.2%
제10-2029913호 1
 
1.2%
제10-1825813호 1
 
1.2%
제10-1906683호 1
 
1.2%
제10-1835224호 1
 
1.2%
제10-1775778호 1
 
1.2%
Other values (71) 71
86.6%
2024-03-14T21:06:41.765061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 219
21.4%
1 132
12.9%
82
 
8.0%
82
 
8.0%
- 81
 
7.9%
2 79
 
7.7%
8 61
 
6.0%
4 51
 
5.0%
7 51
 
5.0%
9 49
 
4.8%
Other values (4) 135
13.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 776
75.9%
Other Letter 164
 
16.0%
Dash Punctuation 81
 
7.9%
Space Separator 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 219
28.2%
1 132
17.0%
2 79
 
10.2%
8 61
 
7.9%
4 51
 
6.6%
7 51
 
6.6%
9 49
 
6.3%
3 47
 
6.1%
6 45
 
5.8%
5 42
 
5.4%
Other Letter
ValueCountFrequency (%)
82
50.0%
82
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 81
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 858
84.0%
Hangul 164
 
16.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 219
25.5%
1 132
15.4%
- 81
 
9.4%
2 79
 
9.2%
8 61
 
7.1%
4 51
 
5.9%
7 51
 
5.9%
9 49
 
5.7%
3 47
 
5.5%
6 45
 
5.2%
Other values (2) 43
 
5.0%
Hangul
ValueCountFrequency (%)
82
50.0%
82
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 858
84.0%
Hangul 164
 
16.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 219
25.5%
1 132
15.4%
- 81
 
9.4%
2 79
 
9.2%
8 61
 
7.1%
4 51
 
5.9%
7 51
 
5.9%
9 49
 
5.7%
3 47
 
5.5%
6 45
 
5.2%
Other values (2) 43
 
5.0%
Hangul
ValueCountFrequency (%)
82
50.0%
82
50.0%

종류
Categorical

CONSTANT 

Distinct1
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size784.0 B
특허
82 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row특허
2nd row특허
3rd row특허
4th row특허
5th row특허

Common Values

ValueCountFrequency (%)
특허 82
100.0%

Length

2024-03-14T21:06:42.056204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T21:06:42.221043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
특허 82
100.0%

Interactions

2024-03-14T21:06:35.989994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T21:06:42.326715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
특허명등록년도등록번호
특허명1.0001.0001.000
등록년도1.0001.0001.000
등록번호1.0001.0001.000

Missing values

2024-03-14T21:06:36.260308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T21:06:36.405883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

특허명등록년도등록번호종류
0양파 발효주의 제조방법2000제0256772호특허
1양파 이식방법과 장치2001제0307388호특허
2양파당과 제조방법2002제0346923호특허
3황색고구마 국수 제조법2002제0334517호특허
4녹차 추출물을 함유한 액체세제의 조성물 및 제조 방법2003제0373391호특허
5매실캔디2003제0392030호특허
6조미건조두부 제조법2003제0385461호특허
7부직포를 이용한 벼 육묘 방법2003제0402721호특허
8녹차 잎의 처리 방법을 이용한 아이스캔디 및 그 제조 방법2003제0375542호특허
9파프리카잼2004제0419075호특허
특허명등록년도등록번호종류
72흰점박이꽃무지 유충의 효소 가수분해물을 함유하는 항비만 조성물2022제10-2429417-0000호특허
73표고버섯 병재배가능 재배기간 단축 균주 BL48 (KACC93342P)2022제10-2479468-0000호특허
74갈색느티만가닥버섯과 공동재배가 가능한 백색느티만가닥버섯 신균주 BW80 (KACC93351P)2023제10-2499480-0000호특허
75흰점박이꽃무지 유충의 효소 가수분해물을 함유하는 피부 미백용 조성물2023제10-2486956-0000호특허
76항비만 효능을 갖는 펩타이드(펩타이드 8) 및 이의 용도2023제10-2537908-0000호특허
77항비만 효능을 갖는 펩타이드(펩타이드 15) 및 이의 용도2023제10-2537907-0000호특허
78감 과실 자동 등분 장치2023제10-2490512-0000호특허
79분무 살포가 가능한 식물의 에틸렌 작용 억제제로서 1-(2,2-디메틸프로필)-사이클로프로펜의 제법 및 이의 용도2023제10-2533748-0000호특허
80진균에 대한 선택적 향균 활성을 갖는 바실러스 소노렌시스 FFB415 균주2023제10-2597882-0000호특허
81흰점박이꽃무지 유충의 효소 가수분해물을 함유하는 피부 미백용 조성물2023제10-2486956-0000호특허

Duplicate rows

Most frequently occurring

특허명등록년도등록번호종류# duplicates
0흰점박이꽃무지 유충의 효소 가수분해물을 함유하는 피부 미백용 조성물2023제10-2486956-0000호특허2