Overview

Dataset statistics

Number of variables5
Number of observations34
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory44.9 B

Variable types

Text3
Categorical1
Numeric1

Dataset

Description중소기업의 기술경쟁력 제고를 위해 한국가스공사에서 보유한 지식재산권(특허, 실용실안, 프로그램)을 무상으로 기술이전 받을 수 있도록 기술나눔 사업을 추진하고 있습니다. 본 데이터는 기술나눔 대상으로 기술분야, 지식재산권명, 등록번호, 등록년도를 포함하고 있습니다.
Author공공데이터포털
URLhttps://www.data.go.kr/data/15102686/fileData.do

Alerts

등록구분 is highly imbalanced (67.7%)Imbalance
지식재산권명 has unique valuesUnique
등록번호 has unique valuesUnique

Reproduction

Analysis started2024-04-17 18:14:34.313787
Analysis finished2024-04-17 18:14:34.723132
Duration0.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct25
Distinct (%)73.5%
Missing0
Missing (%)0.0%
Memory size404.0 B
2024-04-18T03:14:34.823494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length4.6176471
Min length3

Characters and Unicode

Total characters157
Distinct characters70
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)50.0%

Sample

1st row가스배관방향표시
2nd row소음기
3rd row저장탱크 공급배관
4th row볼밸브
5th row가스히터
ValueCountFrequency (%)
기화기 3
 
7.7%
가스히터 2
 
5.1%
정압기 2
 
5.1%
천연가스 2
 
5.1%
차량 2
 
5.1%
가스버너 2
 
5.1%
천연가스차량 2
 
5.1%
이설공사 2
 
5.1%
볼밸브 2
 
5.1%
내진시험 1
 
2.6%
Other values (19) 19
48.7%
2024-04-18T03:14:35.170837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12
 
7.6%
10
 
6.4%
9
 
5.7%
5
 
3.2%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
Other values (60) 97
61.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 149
94.9%
Space Separator 5
 
3.2%
Uppercase Letter 3
 
1.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
8.1%
10
 
6.7%
9
 
6.0%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
Other values (56) 90
60.4%
Uppercase Letter
ValueCountFrequency (%)
H 1
33.3%
P 1
33.3%
C 1
33.3%
Space Separator
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 149
94.9%
Common 5
 
3.2%
Latin 3
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
8.1%
10
 
6.7%
9
 
6.0%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
Other values (56) 90
60.4%
Latin
ValueCountFrequency (%)
H 1
33.3%
P 1
33.3%
C 1
33.3%
Common
ValueCountFrequency (%)
5
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 149
94.9%
ASCII 8
 
5.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
12
 
8.1%
10
 
6.7%
9
 
6.0%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
Other values (56) 90
60.4%
ASCII
ValueCountFrequency (%)
5
62.5%
H 1
 
12.5%
P 1
 
12.5%
C 1
 
12.5%

지식재산권명
Text

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
2024-04-18T03:14:35.373955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length31
Mean length21.617647
Min length4

Characters and Unicode

Total characters735
Distinct characters179
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row가스 배관라인 표시못장치(Indication Nail Apparatus for Gas Pipe)
2nd row방산탑 소음기 및 방산탑 소음기 제작방법
3rd row액화가스 저장탱크의 액화가스 공급용 배관
4th row천연가스 배관용 볼밸브의 누설 시험 장치 및 이를 이용한 천연가스 배관용 볼밸브의 누설 시험 방법
5th row가스히터 튜브번들 건전성 검사방법
ValueCountFrequency (%)
장치 10
 
5.5%
9
 
4.9%
이용한 6
 
3.3%
시험 4
 
2.2%
방법 3
 
1.6%
lng 3
 
1.6%
천연가스 3
 
1.6%
액화가스 3
 
1.6%
배관의 3
 
1.6%
가스용 3
 
1.6%
Other values (116) 136
74.3%
2024-04-18T03:14:35.662114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
149
 
20.3%
22
 
3.0%
22
 
3.0%
21
 
2.9%
19
 
2.6%
18
 
2.4%
18
 
2.4%
14
 
1.9%
13
 
1.8%
12
 
1.6%
Other values (169) 427
58.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 517
70.3%
Space Separator 149
 
20.3%
Lowercase Letter 42
 
5.7%
Uppercase Letter 23
 
3.1%
Open Punctuation 2
 
0.3%
Close Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
 
4.3%
22
 
4.3%
21
 
4.1%
19
 
3.7%
18
 
3.5%
18
 
3.5%
14
 
2.7%
13
 
2.5%
12
 
2.3%
12
 
2.3%
Other values (139) 346
66.9%
Lowercase Letter
ValueCountFrequency (%)
a 7
16.7%
p 5
11.9%
o 5
11.9%
r 4
9.5%
i 4
9.5%
t 3
7.1%
s 3
7.1%
n 2
 
4.8%
c 2
 
4.8%
u 2
 
4.8%
Other values (5) 5
11.9%
Uppercase Letter
ValueCountFrequency (%)
P 3
13.0%
3
13.0%
3
13.0%
3
13.0%
C 2
8.7%
H 2
8.7%
A 2
8.7%
L 1
 
4.3%
D 1
 
4.3%
N 1
 
4.3%
Other values (2) 2
8.7%
Space Separator
ValueCountFrequency (%)
149
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 517
70.3%
Common 153
 
20.8%
Latin 65
 
8.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
 
4.3%
22
 
4.3%
21
 
4.1%
19
 
3.7%
18
 
3.5%
18
 
3.5%
14
 
2.7%
13
 
2.5%
12
 
2.3%
12
 
2.3%
Other values (139) 346
66.9%
Latin
ValueCountFrequency (%)
a 7
 
10.8%
p 5
 
7.7%
o 5
 
7.7%
r 4
 
6.2%
i 4
 
6.2%
P 3
 
4.6%
3
 
4.6%
3
 
4.6%
t 3
 
4.6%
3
 
4.6%
Other values (17) 25
38.5%
Common
ValueCountFrequency (%)
149
97.4%
( 2
 
1.3%
) 2
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 517
70.3%
ASCII 209
28.4%
None 9
 
1.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
149
71.3%
a 7
 
3.3%
p 5
 
2.4%
o 5
 
2.4%
r 4
 
1.9%
i 4
 
1.9%
P 3
 
1.4%
t 3
 
1.4%
s 3
 
1.4%
C 2
 
1.0%
Other values (17) 24
 
11.5%
Hangul
ValueCountFrequency (%)
22
 
4.3%
22
 
4.3%
21
 
4.1%
19
 
3.7%
18
 
3.5%
18
 
3.5%
14
 
2.7%
13
 
2.5%
12
 
2.3%
12
 
2.3%
Other values (139) 346
66.9%
None
ValueCountFrequency (%)
3
33.3%
3
33.3%
3
33.3%

등록구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
특허
32 
실용신안
 
2

Length

Max length4
Median length2
Mean length2.1176471
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row실용신안
2nd row특허
3rd row특허
4th row특허
5th row특허

Common Values

ValueCountFrequency (%)
특허 32
94.1%
실용신안 2
 
5.9%

Length

2024-04-18T03:14:35.770412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T03:14:35.849189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
특허 32
94.1%
실용신안 2
 
5.9%

등록번호
Text

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
2024-04-18T03:14:35.992756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters340
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row20-0481589
2nd row10-1752885
3rd row10-0964826
4th row10-1620262
5th row10-0869896
ValueCountFrequency (%)
20-0481589 1
 
2.9%
10-1752885 1
 
2.9%
20-0479526 1
 
2.9%
10-1274883 1
 
2.9%
10-0840223 1
 
2.9%
10-1131507 1
 
2.9%
10-0923981 1
 
2.9%
10-0753264 1
 
2.9%
10-0708523 1
 
2.9%
10-0751974 1
 
2.9%
Other values (24) 24
70.6%
2024-04-18T03:14:36.261962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 68
20.0%
0 67
19.7%
- 34
10.0%
7 28
8.2%
2 25
 
7.4%
6 24
 
7.1%
4 23
 
6.8%
8 23
 
6.8%
5 17
 
5.0%
9 17
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 306
90.0%
Dash Punctuation 34
 
10.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 68
22.2%
0 67
21.9%
7 28
9.2%
2 25
 
8.2%
6 24
 
7.8%
4 23
 
7.5%
8 23
 
7.5%
5 17
 
5.6%
9 17
 
5.6%
3 14
 
4.6%
Dash Punctuation
ValueCountFrequency (%)
- 34
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 340
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 68
20.0%
0 67
19.7%
- 34
10.0%
7 28
8.2%
2 25
 
7.4%
6 24
 
7.1%
4 23
 
6.8%
8 23
 
6.8%
5 17
 
5.0%
9 17
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 340
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 68
20.0%
0 67
19.7%
- 34
10.0%
7 28
8.2%
2 25
 
7.4%
6 24
 
7.1%
4 23
 
6.8%
8 23
 
6.8%
5 17
 
5.0%
9 17
 
5.0%

등록년도
Real number (ℝ)

Distinct14
Distinct (%)41.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2012.7941
Minimum2007
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2024-04-18T03:14:36.358392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2007
5-th percentile2007
Q12010
median2013
Q32016
95-th percentile2018.35
Maximum2020
Range13
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.8279626
Coefficient of variation (CV)0.0019018153
Kurtosis-1.0629934
Mean2012.7941
Median Absolute Deviation (MAD)3
Skewness-0.052827867
Sum68435
Variance14.653298
MonotonicityNot monotonic
2024-04-18T03:14:36.446505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
2016 5
14.7%
2007 5
14.7%
2013 4
11.8%
2011 4
11.8%
2017 3
8.8%
2014 2
 
5.9%
2015 2
 
5.9%
2010 2
 
5.9%
2009 2
 
5.9%
2008 1
 
2.9%
Other values (4) 4
11.8%
ValueCountFrequency (%)
2007 5
14.7%
2008 1
 
2.9%
2009 2
 
5.9%
2010 2
 
5.9%
2011 4
11.8%
2012 1
 
2.9%
2013 4
11.8%
2014 2
 
5.9%
2015 2
 
5.9%
2016 5
14.7%
ValueCountFrequency (%)
2020 1
 
2.9%
2019 1
 
2.9%
2018 1
 
2.9%
2017 3
8.8%
2016 5
14.7%
2015 2
 
5.9%
2014 2
 
5.9%
2013 4
11.8%
2012 1
 
2.9%
2011 4
11.8%

Interactions

2024-04-18T03:14:34.540859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-18T03:14:36.730739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기술분야지식재산권명등록구분등록번호등록년도
기술분야1.0001.0001.0001.0000.834
지식재산권명1.0001.0001.0001.0001.000
등록구분1.0001.0001.0001.0000.000
등록번호1.0001.0001.0001.0001.000
등록년도0.8341.0000.0001.0001.000
2024-04-18T03:14:36.803349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록년도등록구분
등록년도1.0000.081
등록구분0.0811.000

Missing values

2024-04-18T03:14:34.627090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-18T03:14:34.695790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기술분야지식재산권명등록구분등록번호등록년도
0가스배관방향표시가스 배관라인 표시못장치(Indication Nail Apparatus for Gas Pipe)실용신안20-04815892014
1소음기방산탑 소음기 및 방산탑 소음기 제작방법특허10-17528852015
2저장탱크 공급배관액화가스 저장탱크의 액화가스 공급용 배관특허10-09648262010
3볼밸브천연가스 배관용 볼밸브의 누설 시험 장치 및 이를 이용한 천연가스 배관용 볼밸브의 누설 시험 방법특허10-16202622016
4가스히터가스히터 튜브번들 건전성 검사방법특허10-08698962008
5정압기용량 가변형 가스용 정압기특허10-17217782017
6전동기 정비지그전동기 회전자용 정비지그특허10-13040122013
7안전보호구도장작업용 보안면특허10-20646872020
8토목공사 설비포스트 텐션 작업용 곤돌라특허10-15410442015
9기화기중간매체식 기화장치특허10-07519742007
기술분야지식재산권명등록구분등록번호등록년도
24내진시험내진 성능 시험 장치특허10-16917222016
25비파괴검사배관의 비파괴 검사용 차폐 대차특허10-19573832018
26천연가스차량LNG 자중을 이용한 무동력 LNG 충전 시스템 및 충전방법특허10-07085232007
27천연가스차량압력차이를 이용한 무동력 LNG 충전 시스템 및 충전방법특허10-07532642007
28천연가스 차량액화가스 저장용기의 포트어셈블리 용접구조특허10-09239812009
29천연가스 차량액화 및 압축가스 충전장치 및 상기 충전장치에서의 가스흐름 제어방법특허10-11315072012
30설비관리유틸리티파이프 리프팅 장치 및 그 장치를 이용한 도장방법특허10-08402232007
31전송기함전송기함특허10-12748832013
32도어락도어락 장치(Door Lock Apparatus)실용신안20-04795262016
33PHC파일PHC 파일 인발 장치 및 이를 이용한 PHC 파일 인발 방법특허10-17872912017