Overview

Dataset statistics

Number of variables6
Number of observations232
Missing cells16
Missing cells (%)1.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.0 KiB
Average record size in memory48.6 B

Variable types

Categorical1
Text3
DateTime2

Dataset

Description한국산업기술시험원이 지식재산권 보유 현황 데이터입니다.등록일 기준 `06.03.03.~`24.2.23 기간 동안 작성된 자료입니다.
Author한국산업기술시험원
URLhttps://www.data.go.kr/data/15127339/fileData.do

Alerts

구분 is highly imbalanced (81.9%)Imbalance
등록번호 has 8 (3.4%) missing valuesMissing
등록일자 has 8 (3.4%) missing valuesMissing
출원번호 has unique valuesUnique

Reproduction

Analysis started2024-04-06 08:14:27.748901
Analysis finished2024-04-06 08:14:28.920269
Duration1.17 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
특허
222 
PCT
 
8
실용신안
 
2

Length

Max length4
Median length2
Mean length2.0517241
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row특허
2nd row특허
3rd row특허
4th row특허
5th row특허

Common Values

ValueCountFrequency (%)
특허 222
95.7%
PCT 8
 
3.4%
실용신안 2
 
0.9%

Length

2024-04-06T17:14:29.143341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:14:29.347574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
특허 222
95.7%
pct 8
 
3.4%
실용신안 2
 
0.9%

출원번호
Text

UNIQUE 

Distinct232
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2024-04-06T17:14:29.679631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length15.068966
Min length15

Characters and Unicode

Total characters3496
Distinct characters17
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique232 ?
Unique (%)100.0%

Sample

1st row10-2003-0079480
2nd row10-2004-0078894
3rd row10-2005-0025991
4th row10-2008-0093253
5th row10-2008-0080471
ValueCountFrequency (%)
10-2003-0079480 1
 
0.4%
10-2020-0124652 1
 
0.4%
10-2021-0046519 1
 
0.4%
10-2018-0130964 1
 
0.4%
10-2018-0097082 1
 
0.4%
10-2019-0108531 1
 
0.4%
10-2019-0125442 1
 
0.4%
10-2019-0036489 1
 
0.4%
10-2019-0108533 1
 
0.4%
10-2020-0139917 1
 
0.4%
Other values (222) 222
95.7%
2024-04-06T17:14:30.382293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 968
27.7%
1 616
17.6%
- 448
12.8%
2 435
12.4%
3 162
 
4.6%
4 153
 
4.4%
5 149
 
4.3%
9 141
 
4.0%
8 128
 
3.7%
7 121
 
3.5%
Other values (7) 175
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2992
85.6%
Dash Punctuation 448
 
12.8%
Uppercase Letter 40
 
1.1%
Other Punctuation 16
 
0.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 968
32.4%
1 616
20.6%
2 435
14.5%
3 162
 
5.4%
4 153
 
5.1%
5 149
 
5.0%
9 141
 
4.7%
8 128
 
4.3%
7 121
 
4.0%
6 119
 
4.0%
Uppercase Letter
ValueCountFrequency (%)
P 8
20.0%
C 8
20.0%
T 8
20.0%
K 8
20.0%
R 8
20.0%
Dash Punctuation
ValueCountFrequency (%)
- 448
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3456
98.9%
Latin 40
 
1.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 968
28.0%
1 616
17.8%
- 448
13.0%
2 435
12.6%
3 162
 
4.7%
4 153
 
4.4%
5 149
 
4.3%
9 141
 
4.1%
8 128
 
3.7%
7 121
 
3.5%
Other values (2) 135
 
3.9%
Latin
ValueCountFrequency (%)
P 8
20.0%
C 8
20.0%
T 8
20.0%
K 8
20.0%
R 8
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3496
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 968
27.7%
1 616
17.6%
- 448
12.8%
2 435
12.4%
3 162
 
4.6%
4 153
 
4.4%
5 149
 
4.3%
9 141
 
4.0%
8 128
 
3.7%
7 121
 
3.5%
Other values (7) 175
 
5.0%
Distinct190
Distinct (%)81.9%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
Minimum2003-11-11 00:00:00
Maximum2023-12-28 00:00:00
2024-04-06T17:14:30.664828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:14:30.967460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

등록번호
Text

MISSING 

Distinct224
Distinct (%)100.0%
Missing8
Missing (%)3.4%
Memory size1.9 KiB
2024-04-06T17:14:31.322826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length16
Mean length16
Min length16

Characters and Unicode

Total characters3584
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique224 ?
Unique (%)100.0%

Sample

1st row10-0559050-00-00
2nd row10-0614811-00-00
3rd row10-0693070-00-00
4th row10-0970277-00-00
5th row10-0999072-00-00
ValueCountFrequency (%)
10-0614811-00-00 1
 
0.4%
10-1329152-00-00 1
 
0.4%
10-2266262-00-00 1
 
0.4%
10-2227514-00-00 1
 
0.4%
10-2230216-00-00 1
 
0.4%
10-2230349-00-00 1
 
0.4%
10-2232385-00-00 1
 
0.4%
10-2233463-00-00 1
 
0.4%
10-2237914-00-00 1
 
0.4%
10-2247355-00-00 1
 
0.4%
Other values (214) 214
95.5%
2024-04-06T17:14:31.880862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1238
34.5%
- 672
18.8%
1 454
 
12.7%
2 267
 
7.4%
6 153
 
4.3%
5 150
 
4.2%
7 139
 
3.9%
9 132
 
3.7%
4 131
 
3.7%
3 125
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2912
81.2%
Dash Punctuation 672
 
18.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1238
42.5%
1 454
 
15.6%
2 267
 
9.2%
6 153
 
5.3%
5 150
 
5.2%
7 139
 
4.8%
9 132
 
4.5%
4 131
 
4.5%
3 125
 
4.3%
8 123
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 672
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3584
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1238
34.5%
- 672
18.8%
1 454
 
12.7%
2 267
 
7.4%
6 153
 
4.3%
5 150
 
4.2%
7 139
 
3.9%
9 132
 
3.7%
4 131
 
3.7%
3 125
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3584
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1238
34.5%
- 672
18.8%
1 454
 
12.7%
2 267
 
7.4%
6 153
 
4.3%
5 150
 
4.2%
7 139
 
3.9%
9 132
 
3.7%
4 131
 
3.7%
3 125
 
3.5%

등록일자
Date

MISSING 

Distinct193
Distinct (%)86.2%
Missing8
Missing (%)3.4%
Memory size1.9 KiB
Minimum2006-03-03 00:00:00
Maximum2024-02-23 00:00:00
2024-04-06T17:14:32.148122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:14:32.752219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

명칭
Text

Distinct221
Distinct (%)95.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2024-04-06T17:14:33.377410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length86
Median length55
Mean length25.646552
Min length6

Characters and Unicode

Total characters5950
Distinct characters400
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique212 ?
Unique (%)91.4%

Sample

1st row권상기용 기어박스의 성능 및 신뢰성 평가용부하시험장치
2nd row체인을 구비한 감속기의 성능측정시스템
3rd row투명전도막의 접착력 시험방법
4th row전계 측정기용 교정 장치
5th row용접전류 측정기의 교정 시스템 및 교정 방법
ValueCountFrequency (%)
85
 
5.7%
시스템 57
 
3.8%
장치 48
 
3.2%
이용한 45
 
3.0%
방법 44
 
2.9%
이를 28
 
1.9%
성능 15
 
1.0%
13
 
0.9%
위한 13
 
0.9%
평가 12
 
0.8%
Other values (744) 1138
76.0%
2024-04-06T17:14:34.424339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1268
 
21.3%
154
 
2.6%
141
 
2.4%
136
 
2.3%
129
 
2.2%
129
 
2.2%
115
 
1.9%
112
 
1.9%
100
 
1.7%
96
 
1.6%
Other values (390) 3570
60.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4596
77.2%
Space Separator 1268
 
21.3%
Uppercase Letter 48
 
0.8%
Other Punctuation 18
 
0.3%
Decimal Number 8
 
0.1%
Dash Punctuation 5
 
0.1%
Lowercase Letter 5
 
0.1%
Other Symbol 1
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
154
 
3.4%
141
 
3.1%
136
 
3.0%
129
 
2.8%
129
 
2.8%
115
 
2.5%
112
 
2.4%
100
 
2.2%
96
 
2.1%
95
 
2.1%
Other values (352) 3389
73.7%
Uppercase Letter
ValueCountFrequency (%)
O 6
12.5%
5
 
10.4%
D 4
 
8.3%
A 3
 
6.2%
U 3
 
6.2%
S 3
 
6.2%
3
 
6.2%
V 3
 
6.2%
B 2
 
4.2%
E 2
 
4.2%
Other values (12) 14
29.2%
Decimal Number
ValueCountFrequency (%)
5 2
25.0%
0 2
25.0%
3 2
25.0%
2 1
12.5%
1 1
12.5%
Lowercase Letter
ValueCountFrequency (%)
m 1
20.0%
n 1
20.0%
i 1
20.0%
v 1
20.0%
e 1
20.0%
Other Punctuation
ValueCountFrequency (%)
, 15
83.3%
/ 3
 
16.7%
Space Separator
ValueCountFrequency (%)
1268
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Math Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4596
77.2%
Common 1301
 
21.9%
Latin 53
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
154
 
3.4%
141
 
3.1%
136
 
3.0%
129
 
2.8%
129
 
2.8%
115
 
2.5%
112
 
2.4%
100
 
2.2%
96
 
2.1%
95
 
2.1%
Other values (352) 3389
73.7%
Latin
ValueCountFrequency (%)
O 6
 
11.3%
5
 
9.4%
D 4
 
7.5%
A 3
 
5.7%
U 3
 
5.7%
S 3
 
5.7%
3
 
5.7%
V 3
 
5.7%
B 2
 
3.8%
E 2
 
3.8%
Other values (17) 19
35.8%
Common
ValueCountFrequency (%)
1268
97.5%
, 15
 
1.2%
- 5
 
0.4%
/ 3
 
0.2%
5 2
 
0.2%
0 2
 
0.2%
3 2
 
0.2%
2 1
 
0.1%
1
 
0.1%
1 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4596
77.2%
ASCII 1338
 
22.5%
None 14
 
0.2%
CJK Compat 1
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1268
94.8%
, 15
 
1.1%
O 6
 
0.4%
- 5
 
0.4%
D 4
 
0.3%
/ 3
 
0.2%
A 3
 
0.2%
U 3
 
0.2%
S 3
 
0.2%
V 3
 
0.2%
Other values (19) 25
 
1.9%
Hangul
ValueCountFrequency (%)
154
 
3.4%
141
 
3.1%
136
 
3.0%
129
 
2.8%
129
 
2.8%
115
 
2.5%
112
 
2.4%
100
 
2.2%
96
 
2.1%
95
 
2.1%
Other values (352) 3389
73.7%
None
ValueCountFrequency (%)
5
35.7%
3
21.4%
2
 
14.3%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
CJK Compat
ValueCountFrequency (%)
1
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

Missing values

2024-04-06T17:14:28.438211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:14:28.668704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-06T17:14:28.825882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

구분출원번호출원일자등록번호등록일자명칭
0특허10-2003-00794802003-11-1110-0559050-00-002006-03-03권상기용 기어박스의 성능 및 신뢰성 평가용부하시험장치
1특허10-2004-00788942004-10-0510-0614811-00-002006-08-16체인을 구비한 감속기의 성능측정시스템
2특허10-2005-00259912005-03-2910-0693070-00-002007-03-05투명전도막의 접착력 시험방법
3특허10-2008-00932532008-09-2310-0970277-00-002010-07-07전계 측정기용 교정 장치
4특허10-2008-00804712008-08-1810-0999072-00-002010-12-01용접전류 측정기의 교정 시스템 및 교정 방법
5특허10-2008-00986262008-10-0810-1006248-00-002010-12-29정전전압 측정기용 교정 장치
6특허10-2009-01120232009-11-1910-1046604-00-002011-06-29주파수 디텍팅 기능을 구비한 진동 측정 장치 및 그 동작 방법
7특허10-2009-01111432009-11-1710-1104671-00-002012-01-04토크렌치 교정 자동화 시스템
8특허10-2010-00197772010-03-0510-1127429-00-002012-03-09주파수 디텍팅 장치 및 방법
9특허10-2011-01132342011-11-0210-1260038-00-002013-04-25소음계 지향특성 측정장치
구분출원번호출원일자등록번호등록일자명칭
222실용신안20-2016-00013492016-03-1420-0484612-00-002017-09-22부유물질 차단배플을 구비한 경사판침전장치
223실용신안20-2020-00046822020-12-1820-0495463-00-002022-05-24공기 살균 및 정화장치의 유로구조
224PCTPCT/KR2014/0044052014-05-16<NA><NA>이산화탄소 포집제 성능평가장치
225PCTPCT/KR2015/0042802015-04-29<NA><NA>해양선박 폐열을 이용한 막증류 수처리 장치
226PCTPCT/KR2015/0075922015-07-22<NA><NA>분리막 성능평가장치
227PCTPCT/KR2016/0008012016-01-26<NA><NA>무대 시설물의 이상 감지 및 이를 이용한 고장 예측 시스템과, 그 방법
228PCTPCT/KR2017/0061272017-06-13<NA><NA>연료증발가스 포집기 성능평가장치
229PCTPCT/KR2018/0144022018-11-22<NA><NA>광트랩을 이용한 윈드라이다의 교정장치 및 이를 이용한 교정방법
230PCTPCT/KR2022/0215592022-12-29<NA><NA>사고 통지 방법 및 이를 이용하는 차량과 이를 포함하는 지능형 교통 정보 제공 시스템
231PCTPCT/KR2023/0219152023-12-28<NA><NA>무선 통신 시스템에서 핸드오버 절차를 수행하는 방법 및 장치