Overview

Dataset statistics

Number of variables6
Number of observations37
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory52.6 B

Variable types

Numeric1
Text2
Categorical2
DateTime1

Dataset

Description충청남도가 보유한 도유특허권(특허, 실용신안, 디자인)의 명칭, 등록번호, 특허 관리부서 등의 항목을 제공하고 있습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=426&beforeMenuCd=DOM_000000201001001000&publicdatapk=15019724

Alerts

분류 is highly overall correlated with 담당기관High correlation
담당기관 is highly overall correlated with 분류High correlation
분류 is highly imbalanced (59.4%)Imbalance
연번 has unique valuesUnique
명칭 has unique valuesUnique
특허번호 has unique valuesUnique

Reproduction

Analysis started2024-01-09 19:58:28.443317
Analysis finished2024-01-09 19:58:29.179900
Duration0.74 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19
Minimum1
Maximum37
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size465.0 B
2024-01-10T04:58:29.260103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.8
Q110
median19
Q328
95-th percentile35.2
Maximum37
Range36
Interquartile range (IQR)18

Descriptive statistics

Standard deviation10.824355
Coefficient of variation (CV)0.56970291
Kurtosis-1.2
Mean19
Median Absolute Deviation (MAD)9
Skewness0
Sum703
Variance117.16667
MonotonicityStrictly increasing
2024-01-10T04:58:29.480537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
1 1
 
2.7%
29 1
 
2.7%
22 1
 
2.7%
23 1
 
2.7%
24 1
 
2.7%
25 1
 
2.7%
26 1
 
2.7%
27 1
 
2.7%
28 1
 
2.7%
30 1
 
2.7%
Other values (27) 27
73.0%
ValueCountFrequency (%)
1 1
2.7%
2 1
2.7%
3 1
2.7%
4 1
2.7%
5 1
2.7%
6 1
2.7%
7 1
2.7%
8 1
2.7%
9 1
2.7%
10 1
2.7%
ValueCountFrequency (%)
37 1
2.7%
36 1
2.7%
35 1
2.7%
34 1
2.7%
33 1
2.7%
32 1
2.7%
31 1
2.7%
30 1
2.7%
29 1
2.7%
28 1
2.7%

명칭
Text

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size428.0 B
2024-01-10T04:58:29.905099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length34
Mean length29.351351
Min length6

Characters and Unicode

Total characters1086
Distinct characters212
Distinct categories6 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)100.0%

Sample

1st row고순도의 4-엔-부틸레조시놀 결정을 제조하는 방법
2nd rowNSB-1 균주, 이를 함유하는 탄저병 방제용 조성물 및 탄저병 방제방법
3rd row액비제조 균주 공급이 가능한 탈취시스템 및 이를 구비한 액비화 장치
4th row도로안전시설물 가드레일 부착 차단막
5th row생산성 향상을 위한 어류용 사료 조성물 및 이의 제조방법
ValueCountFrequency (%)
21
 
7.7%
조성물 10
 
3.7%
이용한 9
 
3.3%
제조방법 9
 
3.3%
인삼 7
 
2.6%
기능성 6
 
2.2%
항비만 6
 
2.2%
함유하는 5
 
1.8%
제조된 5
 
1.8%
위한 5
 
1.8%
Other values (158) 189
69.5%
2024-01-10T04:58:30.505741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
235
 
21.6%
36
 
3.3%
32
 
2.9%
31
 
2.9%
24
 
2.2%
23
 
2.1%
22
 
2.0%
21
 
1.9%
21
 
1.9%
21
 
1.9%
Other values (202) 620
57.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 821
75.6%
Space Separator 235
 
21.6%
Decimal Number 14
 
1.3%
Uppercase Letter 7
 
0.6%
Other Punctuation 5
 
0.5%
Dash Punctuation 4
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
36
 
4.4%
32
 
3.9%
31
 
3.8%
24
 
2.9%
23
 
2.8%
22
 
2.7%
21
 
2.6%
21
 
2.6%
21
 
2.6%
19
 
2.3%
Other values (188) 571
69.5%
Uppercase Letter
ValueCountFrequency (%)
B 2
28.6%
P 1
14.3%
A 1
14.3%
F 1
14.3%
S 1
14.3%
N 1
14.3%
Decimal Number
ValueCountFrequency (%)
1 4
28.6%
2 3
21.4%
3 3
21.4%
0 2
14.3%
4 2
14.3%
Space Separator
ValueCountFrequency (%)
235
100.0%
Other Punctuation
ValueCountFrequency (%)
, 5
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 820
75.5%
Common 258
 
23.8%
Latin 7
 
0.6%
Han 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
36
 
4.4%
32
 
3.9%
31
 
3.8%
24
 
2.9%
23
 
2.8%
22
 
2.7%
21
 
2.6%
21
 
2.6%
21
 
2.6%
19
 
2.3%
Other values (187) 570
69.5%
Common
ValueCountFrequency (%)
235
91.1%
, 5
 
1.9%
- 4
 
1.6%
1 4
 
1.6%
2 3
 
1.2%
3 3
 
1.2%
0 2
 
0.8%
4 2
 
0.8%
Latin
ValueCountFrequency (%)
B 2
28.6%
P 1
14.3%
A 1
14.3%
F 1
14.3%
S 1
14.3%
N 1
14.3%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 820
75.5%
ASCII 265
 
24.4%
CJK 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
235
88.7%
, 5
 
1.9%
- 4
 
1.5%
1 4
 
1.5%
2 3
 
1.1%
3 3
 
1.1%
0 2
 
0.8%
B 2
 
0.8%
4 2
 
0.8%
P 1
 
0.4%
Other values (4) 4
 
1.5%
Hangul
ValueCountFrequency (%)
36
 
4.4%
32
 
3.9%
31
 
3.8%
24
 
2.9%
23
 
2.8%
22
 
2.7%
21
 
2.6%
21
 
2.6%
21
 
2.6%
19
 
2.3%
Other values (187) 570
69.5%
CJK
ValueCountFrequency (%)
1
100.0%

분류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size428.0 B
특허
34 
디자인
 
3

Length

Max length3
Median length2
Mean length2.0810811
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row특허
2nd row특허
3rd row특허
4th row특허
5th row특허

Common Values

ValueCountFrequency (%)
특허 34
91.9%
디자인 3
 
8.1%

Length

2024-01-10T04:58:30.716696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T04:58:30.870237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
특허 34
91.9%
디자인 3
 
8.1%
Distinct34
Distinct (%)91.9%
Missing0
Missing (%)0.0%
Memory size428.0 B
Minimum2009-04-22 00:00:00
Maximum2023-01-27 00:00:00
2024-01-10T04:58:31.035042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:58:31.201833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)

특허번호
Text

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size428.0 B
2024-01-10T04:58:31.481891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length11
Mean length11.135135
Min length11

Characters and Unicode

Total characters412
Distinct characters12
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)100.0%

Sample

1st row제10-0895586
2nd row제10-0976760
3rd row제10-1061690
4th row제10-1099966
5th row제10-1127311
ValueCountFrequency (%)
제10-0895586 1
 
2.7%
제10-1558530 1
 
2.7%
제10-1632842 1
 
2.7%
제30-0870852 1
 
2.7%
제30-0909794 1
 
2.7%
제10-1761877 1
 
2.7%
제10-1765304 1
 
2.7%
제10-1999561 1
 
2.7%
제10-1976572 1
 
2.7%
제10-1998034 1
 
2.7%
Other values (27) 27
73.0%
2024-01-10T04:58:32.009459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 80
19.4%
0 59
14.3%
- 38
9.2%
37
9.0%
2 36
8.7%
6 29
 
7.0%
4 28
 
6.8%
9 27
 
6.6%
5 22
 
5.3%
7 20
 
4.9%
Other values (2) 36
8.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 337
81.8%
Dash Punctuation 38
 
9.2%
Other Letter 37
 
9.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 80
23.7%
0 59
17.5%
2 36
10.7%
6 29
 
8.6%
4 28
 
8.3%
9 27
 
8.0%
5 22
 
6.5%
7 20
 
5.9%
3 19
 
5.6%
8 17
 
5.0%
Dash Punctuation
ValueCountFrequency (%)
- 38
100.0%
Other Letter
ValueCountFrequency (%)
37
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 375
91.0%
Hangul 37
 
9.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 80
21.3%
0 59
15.7%
- 38
10.1%
2 36
9.6%
6 29
 
7.7%
4 28
 
7.5%
9 27
 
7.2%
5 22
 
5.9%
7 20
 
5.3%
3 19
 
5.1%
Hangul
ValueCountFrequency (%)
37
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 375
91.0%
Hangul 37
 
9.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 80
21.3%
0 59
15.7%
- 38
10.1%
2 36
9.6%
6 29
 
7.7%
4 28
 
7.5%
9 27
 
7.2%
5 22
 
5.9%
7 20
 
5.3%
3 19
 
5.1%
Hangul
ValueCountFrequency (%)
37
100.0%

담당기관
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)40.5%
Missing0
Missing (%)0.0%
Memory size428.0 B
농림축산국 식량원예과
산림자원연구소
농업기술원
축산기술연구소
수산자원연구소
Other values (10)
11 

Length

Max length13
Median length11
Mean length7.8918919
Min length5

Unique

Unique9 ?
Unique (%)24.3%

Sample

1st row산림자원연구소
2nd row농업기술원
3rd row축산기술연구소
4th row종합건설사업소
5th row수산자원연구소

Common Values

ValueCountFrequency (%)
농림축산국 식량원예과 8
21.6%
산림자원연구소 6
16.2%
농업기술원 4
10.8%
축산기술연구소 4
10.8%
수산자원연구소 4
10.8%
홍성소방서 2
 
5.4%
종합건설사업소 1
 
2.7%
농업기술원구기자연구소 1
 
2.7%
문화체육관광국 문화유산과 1
 
2.7%
보건정책과 1
 
2.7%
Other values (5) 5
13.5%

Length

2024-01-10T04:58:32.223903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
농림축산국 8
16.7%
식량원예과 8
16.7%
산림자원연구소 6
12.5%
농업기술원 4
8.3%
축산기술연구소 4
8.3%
수산자원연구소 4
8.3%
소방본부 2
 
4.2%
홍성소방서 2
 
4.2%
천안서북소방서 1
 
2.1%
계룡소방서 1
 
2.1%
Other values (8) 8
16.7%

Interactions

2024-01-10T04:58:28.812846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T04:58:32.354004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번명칭분류등록일특허번호담당기관
연번1.0001.0000.5210.9151.0000.473
명칭1.0001.0001.0001.0001.0001.000
분류0.5211.0001.0001.0001.0001.000
등록일0.9151.0001.0001.0001.0000.976
특허번호1.0001.0001.0001.0001.0001.000
담당기관0.4731.0001.0000.9761.0001.000
2024-01-10T04:58:32.514790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
담당기관분류
담당기관1.0000.793
분류0.7931.000
2024-01-10T04:58:32.625651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번분류담당기관
연번1.0000.3440.138
분류0.3441.0000.793
담당기관0.1380.7931.000

Missing values

2024-01-10T04:58:28.985271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T04:58:29.127946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번명칭분류등록일특허번호담당기관
01고순도의 4-엔-부틸레조시놀 결정을 제조하는 방법특허2009-04-22제10-0895586산림자원연구소
12NSB-1 균주, 이를 함유하는 탄저병 방제용 조성물 및 탄저병 방제방법특허2010-08-12제10-0976760농업기술원
23액비제조 균주 공급이 가능한 탈취시스템 및 이를 구비한 액비화 장치특허2011-08-26제10-1061690축산기술연구소
34도로안전시설물 가드레일 부착 차단막특허2011-12-21제10-1099966종합건설사업소
45생산성 향상을 위한 어류용 사료 조성물 및 이의 제조방법특허2012-03-08제10-1127311수산자원연구소
56먹넌출 추출물을 함유하는 항산화 또는 미백용 조성물특허2012-04-09제10-1136822산림자원연구소
67한우의 지방산 조성을 확인하기 위한 FABP3 유전자의 마커 및 이를 이용한 한우의 선별방법특허2012-06-21제10-1160794축산기술연구소
78양액식물 재배시스템특허2012-10-05제10-1190139농업기술원
89황복 수정란 부화방법특허2013-05-02제10-1262626수산자원연구소
910환경조절을 통한 황복의 性성숙유도 방법특허2013-06-26제10-1281396수산자원연구소
연번명칭분류등록일특허번호담당기관
2728비수리 추출물을 포함하는 천연 방부제 조성물특허2019-05-02제10-1976572산림자원연구소
2829선충분리장치특허2019-07-02제10-1998034산림자원연구소
2930분뇨 부숙 촉진용 조성물 및 이를 활용한 액상비료의 제조법특허2020-11-11제10-2179646축산기술연구소
3031책상형 안전사다리특허2021-03-08제10-2227143홍성소방서
3132유해물질 노출 차단장치특허2021-04-30제10-2249012소방본부 119특수구조단
3233소방 호스 배낭특허2022-02-10제10-2362925소방본부 예방안전과
3334소방호스 운반기구디자인2022-08-11제30-2021-0056464계룡소방서
3435가스팽창식 인명구조 안전매트특허2022-08-19제10-2436059홍성소방서
3536소화전 배수확인 점검기구특허2022-12-07제10-2476803천안서북소방서
3637액젓 폐기물을 이용한 해조류 양식 황백화 및 패류 양식 영양결핍 개선용 조성물특허2023-01-27제10-2494254수산자원과