Overview

Dataset statistics

Number of variables6
Number of observations439
Missing cells0
Missing cells (%)0.0%
Duplicate rows4
Duplicate rows (%)0.9%
Total size in memory21.1 KiB
Average record size in memory49.3 B

Variable types

Categorical2
Numeric1
DateTime1
Text1
Boolean1

Dataset

Description(주)한국가스기술공사의 지식재산권 특허 등록 및 출원인, 출원등록번호, 등록일자 등 특허출원 현황자료를 제공합니다.
URLhttps://www.data.go.kr/data/15103285/fileData.do

Alerts

출원인 has constant value ""Constant
Dataset has 4 (0.9%) duplicate rowsDuplicates
등록번호 is highly overall correlated with 권리High correlation
권리 is highly overall correlated with 등록번호High correlation
공동 is highly imbalanced (91.0%)Imbalance

Reproduction

Analysis started2023-12-12 18:56:04.654079
Analysis finished2023-12-12 18:56:05.320438
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

권리
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
실용신안
315 
특허
101 
상표
 
15
디자인
 
8

Length

Max length4
Median length4
Mean length3.453303
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row특허
2nd row특허
3rd row특허
4th row특허
5th row특허

Common Values

ValueCountFrequency (%)
실용신안 315
71.8%
특허 101
 
23.0%
상표 15
 
3.4%
디자인 8
 
1.8%

Length

2023-12-13T03:56:05.408576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:56:05.567307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
실용신안 315
71.8%
특허 101
 
23.0%
상표 15
 
3.4%
디자인 8
 
1.8%

등록번호
Real number (ℝ)

HIGH CORRELATION 

Distinct165
Distinct (%)37.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.8664265 × 1012
Minimum1.0054 × 1012
Maximum4.10395 × 1012
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.0 KiB
2023-12-13T03:56:05.719692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.0054 × 1012
5-th percentile1.008717 × 1012
Q12.00363 × 1012
median2.00486 × 1012
Q32.00495 × 1012
95-th percentile3.010722 × 1012
Maximum4.10395 × 1012
Range3.09855 × 1012
Interquartile range (IQR)1.32 × 109

Descriptive statistics

Standard deviation6.0935386 × 1011
Coefficient of variation (CV)0.32648156
Kurtosis4.0936755
Mean1.8664265 × 1012
Median Absolute Deviation (MAD)1.1 × 108
Skewness1.1395023
Sum8.1936125 × 1014
Variance3.7131212 × 1023
MonotonicityNot monotonic
2023-12-13T03:56:05.889169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2004960000000 54
 
12.3%
2004950000000 33
 
7.5%
2004940000000 24
 
5.5%
2004970000000 23
 
5.2%
2004930000000 16
 
3.6%
2004900000000 15
 
3.4%
2004870000000 12
 
2.7%
2004840000000 9
 
2.1%
2004830000000 7
 
1.6%
4103950000000 7
 
1.6%
Other values (155) 239
54.4%
ValueCountFrequency (%)
1005400000000 1
0.2%
1005900000000 1
0.2%
1006060000000 1
0.2%
1006530000000 1
0.2%
1006540000000 2
0.5%
1006610000000 1
0.2%
1006940000000 1
0.2%
1007160000000 1
0.2%
1007200000000 1
0.2%
1007380000000 1
0.2%
ValueCountFrequency (%)
4103950000000 7
 
1.6%
4101310000000 3
 
0.7%
4018870000000 3
 
0.7%
4013300000000 2
 
0.5%
3011560000000 1
 
0.2%
3011480000000 4
 
0.9%
3011100000000 1
 
0.2%
3011010000000 1
 
0.2%
3010690000000 1
 
0.2%
2004970000000 23
5.2%
Distinct276
Distinct (%)62.9%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
Minimum2001-09-04 00:00:00
Maximum2023-07-17 00:00:00
2023-12-13T03:56:06.065553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:56:06.300989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

명칭
Text

Distinct416
Distinct (%)94.8%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
2023-12-13T03:56:06.740154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length40
Mean length15.487472
Min length2

Characters and Unicode

Total characters6799
Distinct characters411
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique406 ?
Unique (%)92.5%

Sample

1st row지하시설물의 촬영장치 및 그 촬영방법
2nd row미압 안전밸브의 성능시험장비
3rd row밸브조작기용 인디케이터
4th row계측 데이터 무선 송수신 방법 및 장치
5th row조인트부의 가스누설 점검장치
ValueCountFrequency (%)
장치 58
 
3.3%
38
 
2.2%
지그 29
 
1.7%
분해 18
 
1.0%
가스 18
 
1.0%
15
 
0.9%
15
 
0.9%
37 15
 
0.9%
방법 15
 
0.9%
조립용 14
 
0.8%
Other values (920) 1500
86.5%
2023-12-13T03:56:08.043872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1297
 
19.1%
253
 
3.7%
244
 
3.6%
196
 
2.9%
152
 
2.2%
127
 
1.9%
126
 
1.9%
122
 
1.8%
112
 
1.6%
105
 
1.5%
Other values (401) 4065
59.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5298
77.9%
Space Separator 1297
 
19.1%
Uppercase Letter 98
 
1.4%
Lowercase Letter 34
 
0.5%
Decimal Number 31
 
0.5%
Close Punctuation 17
 
0.3%
Open Punctuation 17
 
0.3%
Other Punctuation 5
 
0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
253
 
4.8%
244
 
4.6%
196
 
3.7%
152
 
2.9%
127
 
2.4%
126
 
2.4%
122
 
2.3%
112
 
2.1%
105
 
2.0%
92
 
1.7%
Other values (353) 3769
71.1%
Uppercase Letter
ValueCountFrequency (%)
G 14
14.3%
N 13
13.3%
L 13
13.3%
S 11
11.2%
C 6
 
6.1%
T 6
 
6.1%
V 5
 
5.1%
P 5
 
5.1%
O 4
 
4.1%
I 4
 
4.1%
Other values (13) 17
17.3%
Lowercase Letter
ValueCountFrequency (%)
o 6
17.6%
l 5
14.7%
e 4
11.8%
t 3
8.8%
i 2
 
5.9%
n 2
 
5.9%
s 2
 
5.9%
r 2
 
5.9%
v 2
 
5.9%
a 2
 
5.9%
Other values (4) 4
11.8%
Decimal Number
ValueCountFrequency (%)
3 15
48.4%
7 15
48.4%
2 1
 
3.2%
Close Punctuation
ValueCountFrequency (%)
] 15
88.2%
) 2
 
11.8%
Open Punctuation
ValueCountFrequency (%)
[ 15
88.2%
( 2
 
11.8%
Other Punctuation
ValueCountFrequency (%)
/ 3
60.0%
, 2
40.0%
Space Separator
ValueCountFrequency (%)
1297
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5298
77.9%
Common 1369
 
20.1%
Latin 132
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
253
 
4.8%
244
 
4.6%
196
 
3.7%
152
 
2.9%
127
 
2.4%
126
 
2.4%
122
 
2.3%
112
 
2.1%
105
 
2.0%
92
 
1.7%
Other values (353) 3769
71.1%
Latin
ValueCountFrequency (%)
G 14
 
10.6%
N 13
 
9.8%
L 13
 
9.8%
S 11
 
8.3%
C 6
 
4.5%
o 6
 
4.5%
T 6
 
4.5%
l 5
 
3.8%
V 5
 
3.8%
P 5
 
3.8%
Other values (27) 48
36.4%
Common
ValueCountFrequency (%)
1297
94.7%
] 15
 
1.1%
[ 15
 
1.1%
3 15
 
1.1%
7 15
 
1.1%
/ 3
 
0.2%
( 2
 
0.1%
- 2
 
0.1%
) 2
 
0.1%
, 2
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5298
77.9%
ASCII 1495
 
22.0%
None 6
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1297
86.8%
] 15
 
1.0%
[ 15
 
1.0%
3 15
 
1.0%
7 15
 
1.0%
G 14
 
0.9%
N 13
 
0.9%
L 13
 
0.9%
S 11
 
0.7%
C 6
 
0.4%
Other values (32) 81
 
5.4%
Hangul
ValueCountFrequency (%)
253
 
4.8%
244
 
4.6%
196
 
3.7%
152
 
2.9%
127
 
2.4%
126
 
2.4%
122
 
2.3%
112
 
2.1%
105
 
2.0%
92
 
1.7%
Other values (353) 3769
71.1%
None
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

출원인
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
㈜한국가스기술공사
439 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row㈜한국가스기술공사
2nd row㈜한국가스기술공사
3rd row㈜한국가스기술공사
4th row㈜한국가스기술공사
5th row㈜한국가스기술공사

Common Values

ValueCountFrequency (%)
㈜한국가스기술공사 439
100.0%

Length

2023-12-13T03:56:08.312861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:56:08.489777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
㈜한국가스기술공사 439
100.0%

공동
Boolean

IMBALANCE 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size571.0 B
False
434 
True
 
5
ValueCountFrequency (%)
False 434
98.9%
True 5
 
1.1%
2023-12-13T03:56:08.660832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-13T03:56:04.997311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:56:08.782014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
권리등록번호공동
권리1.0001.0000.181
등록번호1.0001.0000.183
공동0.1810.1831.000
2023-12-13T03:56:08.921647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공동권리
공동1.0000.120
권리0.1201.000
2023-12-13T03:56:09.074656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록번호권리공동
등록번호1.0001.0000.120
권리1.0001.0000.120
공동0.1200.1201.000

Missing values

2023-12-13T03:56:05.147730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:56:05.270191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

권리등록번호등록일자명칭출원인공동
0특허10054000000002005-12-26지하시설물의 촬영장치 및 그 촬영방법㈜한국가스기술공사N
1특허10059000000002006-06-07미압 안전밸브의 성능시험장비㈜한국가스기술공사N
2특허10060600000002006-07-20밸브조작기용 인디케이터㈜한국가스기술공사N
3특허10065300000002006-11-24계측 데이터 무선 송수신 방법 및 장치㈜한국가스기술공사N
4특허10065400000002006-11-28조인트부의 가스누설 점검장치㈜한국가스기술공사N
5특허10065400000002006-11-28항타용 지그㈜한국가스기술공사N
6특허10066100000002006-12-18제품 이송용 완충장치㈜한국가스기술공사N
7특허10069400000002007-03-06시설물 점검함 조립체㈜한국가스기술공사N
8특허10071600000002007-05-02해수펌프의 커플링 분해방법㈜한국가스기술공사N
9특허10072000000002007-05-11매설물 표시용 라인마커 설치장치㈜한국가스기술공사N
권리등록번호등록일자명칭출원인공동
429상표41039500000002017-04-24제 [37] 류㈜한국가스기술공사N
430상표41039500000002017-04-24제 [37] 류㈜한국가스기술공사N
431상표41039500000002017-04-24제 [37] 류㈜한국가스기술공사N
432상표41039500000002017-04-24제 [37] 류㈜한국가스기술공사N
433상표41039500000002017-04-24제 [37] 류㈜한국가스기술공사N
434상표40133000000002018-02-09제 [37] 류㈜한국가스기술공사N
435상표40133000000002018-02-09제 [37] 류㈜한국가스기술공사N
436상표40188700000002022-07-06제 [37] 류㈜한국가스기술공사N
437상표40188700000002022-07-06제 [37] 류㈜한국가스기술공사N
438상표40188700000002022-07-06제 [37] 류㈜한국가스기술공사N

Duplicate rows

Most frequently occurring

권리등록번호등록일자명칭출원인공동# duplicates
3상표41039500000002017-04-24제 [37] 류㈜한국가스기술공사N7
1상표40188700000002022-07-06제 [37] 류㈜한국가스기술공사N3
2상표41013100000002006-04-17제 [37] 류㈜한국가스기술공사N3
0상표40133000000002018-02-09제 [37] 류㈜한국가스기술공사N2