Overview

Dataset statistics

Number of variables5
Number of observations973
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory40.0 KiB
Average record size in memory42.1 B

Variable types

Numeric2
DateTime1
Text2

Dataset

Description특허청이 보유한 국내외 지식재산권 관련 모든 정보를 DB구축하여 이용자가 인터넷을 통해 검색 및 열람할 수 있도록 하는 대국민 특허정보서비스 슈퍼 인용정보 제공
Author특허청
URLhttps://www.data.go.kr/data/15089856/fileData.do

Alerts

출원번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 00:54:43.974909
Analysis finished2023-12-12 00:54:45.342449
Duration1.37 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

출원번호
Real number (ℝ)

UNIQUE 

Distinct973
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.0386997 × 1012
Minimum1.0202 × 1012
Maximum2.0202 × 1012
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.7 KiB
2023-12-12T09:54:45.456224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.0202 × 1012
5-th percentile1.0202 × 1012
Q11.0202 × 1012
median1.0202 × 1012
Q31.0202001 × 1012
95-th percentile1.0202001 × 1012
Maximum2.0202 × 1012
Range1 × 1012
Interquartile range (IQR)55086

Descriptive statistics

Standard deviation1.3481813 × 1011
Coefficient of variation (CV)0.1297951
Kurtosis49.333713
Mean1.0386997 × 1012
Median Absolute Deviation (MAD)26046
Skewness7.157675
Sum1.0106548 × 1015
Variance1.8175929 × 1022
MonotonicityNot monotonic
2023-12-12T09:54:45.638027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1020200068848 1
 
0.1%
1020200029992 1
 
0.1%
1020200034497 1
 
0.1%
1020200034478 1
 
0.1%
1020200034305 1
 
0.1%
1020200034510 1
 
0.1%
1020200034524 1
 
0.1%
1020200033878 1
 
0.1%
1020200033835 1
 
0.1%
1020200033599 1
 
0.1%
Other values (963) 963
99.0%
ValueCountFrequency (%)
1020200000189 1
0.1%
1020200000191 1
0.1%
1020200000234 1
0.1%
1020200000385 1
0.1%
1020200000564 1
0.1%
1020200000665 1
0.1%
1020200000746 1
0.1%
1020200000981 1
0.1%
1020200001070 1
0.1%
1020200001104 1
0.1%
ValueCountFrequency (%)
2020200002606 1
0.1%
2020200002265 1
0.1%
2020200001853 1
0.1%
2020200001681 1
0.1%
2020200001375 1
0.1%
2020200001240 1
0.1%
2020200001148 1
0.1%
2020200000990 1
0.1%
2020200000850 1
0.1%
2020200000773 1
0.1%
Distinct230
Distinct (%)23.6%
Missing0
Missing (%)0.0%
Memory size7.7 KiB
Minimum2020-01-02 00:00:00
Maximum2021-05-11 00:00:00
2023-12-12T09:54:45.936196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:54:46.247600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct964
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size7.7 KiB
2023-12-12T09:54:46.707886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length196
Median length72
Mean length25.414183
Min length2

Characters and Unicode

Total characters24728
Distinct characters687
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique958 ?
Unique (%)98.5%

Sample

1st row감염예방 인증 키오스크
2nd row가공송전선의 안정적 설치를 위한 지지기구
3rd row마이크로니들 패치를 이용한 최소 침습적 아토피 피부염 검사 방법 및 마이크로니들 패치를 포함하는 최소 침습적 아토피 검사 키트
4th row셀룰로오스계 및 아크릴계 증점제, 분말형 고유동화제, 섬유를 함유한 수중 불분리성 친환경 폴리머 모르타르 조성물 및 이를 이용한 단면 보수 보강 공법
5th row라텍스 수지를 포함하는 방수아스팔트(LMA) 콘크리트 조성물 및 이의 시공방법
ValueCountFrequency (%)
408
 
6.3%
방법 153
 
2.4%
이용한 148
 
2.3%
시스템 148
 
2.3%
장치 136
 
2.1%
이를 94
 
1.4%
조성물 66
 
1.0%
61
 
0.9%
위한 54
 
0.8%
제조방법 42
 
0.6%
Other values (3009) 5173
79.8%
2023-12-12T09:54:47.485951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5511
 
22.3%
595
 
2.4%
472
 
1.9%
462
 
1.9%
419
 
1.7%
411
 
1.7%
397
 
1.6%
391
 
1.6%
358
 
1.4%
339
 
1.4%
Other values (677) 15373
62.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18712
75.7%
Space Separator 5511
 
22.3%
Uppercase Letter 242
 
1.0%
Other Punctuation 91
 
0.4%
Lowercase Letter 89
 
0.4%
Decimal Number 38
 
0.2%
Close Punctuation 15
 
0.1%
Open Punctuation 15
 
0.1%
Dash Punctuation 13
 
0.1%
Other Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
595
 
3.2%
472
 
2.5%
462
 
2.5%
419
 
2.2%
411
 
2.2%
397
 
2.1%
391
 
2.1%
358
 
1.9%
339
 
1.8%
333
 
1.8%
Other values (614) 14535
77.7%
Uppercase Letter
ValueCountFrequency (%)
S 34
14.0%
C 30
12.4%
T 18
 
7.4%
D 18
 
7.4%
I 17
 
7.0%
V 15
 
6.2%
A 15
 
6.2%
E 12
 
5.0%
P 12
 
5.0%
L 11
 
4.5%
Other values (16) 60
24.8%
Lowercase Letter
ValueCountFrequency (%)
a 9
 
10.1%
o 8
 
9.0%
t 8
 
9.0%
e 8
 
9.0%
r 8
 
9.0%
i 6
 
6.7%
u 5
 
5.6%
s 5
 
5.6%
n 5
 
5.6%
l 4
 
4.5%
Other values (9) 23
25.8%
Decimal Number
ValueCountFrequency (%)
3 11
28.9%
2 9
23.7%
1 5
13.2%
4 3
 
7.9%
0 3
 
7.9%
9 2
 
5.3%
8 2
 
5.3%
5 2
 
5.3%
7 1
 
2.6%
Other Punctuation
ValueCountFrequency (%)
, 77
84.6%
/ 9
 
9.9%
. 3
 
3.3%
· 2
 
2.2%
Space Separator
ValueCountFrequency (%)
5511
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Other Symbol
ValueCountFrequency (%)
° 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18710
75.7%
Common 5685
 
23.0%
Latin 331
 
1.3%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
595
 
3.2%
472
 
2.5%
462
 
2.5%
419
 
2.2%
411
 
2.2%
397
 
2.1%
391
 
2.1%
358
 
1.9%
339
 
1.8%
333
 
1.8%
Other values (612) 14533
77.7%
Latin
ValueCountFrequency (%)
S 34
 
10.3%
C 30
 
9.1%
T 18
 
5.4%
D 18
 
5.4%
I 17
 
5.1%
V 15
 
4.5%
A 15
 
4.5%
E 12
 
3.6%
P 12
 
3.6%
L 11
 
3.3%
Other values (35) 149
45.0%
Common
ValueCountFrequency (%)
5511
96.9%
, 77
 
1.4%
) 15
 
0.3%
( 15
 
0.3%
- 13
 
0.2%
3 11
 
0.2%
/ 9
 
0.2%
2 9
 
0.2%
1 5
 
0.1%
. 3
 
0.1%
Other values (8) 17
 
0.3%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 18710
75.7%
ASCII 6010
 
24.3%
None 6
 
< 0.1%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5511
91.7%
, 77
 
1.3%
S 34
 
0.6%
C 30
 
0.5%
T 18
 
0.3%
D 18
 
0.3%
I 17
 
0.3%
V 15
 
0.2%
) 15
 
0.2%
A 15
 
0.2%
Other values (49) 260
 
4.3%
Hangul
ValueCountFrequency (%)
595
 
3.2%
472
 
2.5%
462
 
2.5%
419
 
2.2%
411
 
2.2%
397
 
2.1%
391
 
2.1%
358
 
1.9%
339
 
1.8%
333
 
1.8%
Other values (612) 14533
77.7%
None
ValueCountFrequency (%)
° 2
33.3%
· 2
33.3%
1
16.7%
1
16.7%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct695
Distinct (%)71.4%
Missing0
Missing (%)0.0%
Memory size7.7 KiB
2023-12-12T09:54:48.010227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length9.692703
Min length9

Characters and Unicode

Total characters9431
Distinct characters31
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique559 ?
Unique (%)57.5%

Sample

1st rowG07F 17/40
2nd rowH02G 7/20
3rd rowA61B 5/00
4th rowC04B 26/04
5th rowC04B 26/26
ValueCountFrequency (%)
g06q 59
 
3.0%
h02g 42
 
2.2%
c04b 24
 
1.2%
e04h 22
 
1.1%
a61l 22
 
1.1%
b01d 21
 
1.1%
g08g 21
 
1.1%
g06f 19
 
1.0%
a41d 17
 
0.9%
a62b 17
 
0.9%
Other values (686) 1682
86.4%
2023-12-12T09:54:48.657110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1490
15.8%
1 975
10.3%
973
10.3%
/ 973
10.3%
2 683
 
7.2%
6 514
 
5.5%
4 414
 
4.4%
3 411
 
4.4%
B 336
 
3.6%
5 328
 
3.5%
Other values (21) 2334
24.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5539
58.7%
Uppercase Letter 1946
 
20.6%
Space Separator 973
 
10.3%
Other Punctuation 973
 
10.3%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
B 336
17.3%
G 317
16.3%
H 205
10.5%
A 191
9.8%
C 153
7.9%
F 151
7.8%
D 134
 
6.9%
E 129
 
6.6%
Q 65
 
3.3%
L 65
 
3.3%
Other values (9) 200
10.3%
Decimal Number
ValueCountFrequency (%)
0 1490
26.9%
1 975
17.6%
2 683
12.3%
6 514
 
9.3%
4 414
 
7.5%
3 411
 
7.4%
5 328
 
5.9%
7 248
 
4.5%
8 241
 
4.4%
9 235
 
4.2%
Space Separator
ValueCountFrequency (%)
973
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 973
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7485
79.4%
Latin 1946
 
20.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
B 336
17.3%
G 317
16.3%
H 205
10.5%
A 191
9.8%
C 153
7.9%
F 151
7.8%
D 134
 
6.9%
E 129
 
6.6%
Q 65
 
3.3%
L 65
 
3.3%
Other values (9) 200
10.3%
Common
ValueCountFrequency (%)
0 1490
19.9%
1 975
13.0%
973
13.0%
/ 973
13.0%
2 683
9.1%
6 514
 
6.9%
4 414
 
5.5%
3 411
 
5.5%
5 328
 
4.4%
7 248
 
3.3%
Other values (2) 476
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9431
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1490
15.8%
1 975
10.3%
973
10.3%
/ 973
10.3%
2 683
 
7.2%
6 514
 
5.5%
4 414
 
4.4%
3 411
 
4.4%
B 336
 
3.6%
5 328
 
3.5%
Other values (21) 2334
24.7%

피인용_횟수
Real number (ℝ)

Distinct9
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.2476876
Minimum1
Maximum17
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.7 KiB
2023-12-12T09:54:48.819677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile2
Maximum17
Range16
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.88267929
Coefficient of variation (CV)0.70745218
Kurtosis132.45806
Mean1.2476876
Median Absolute Deviation (MAD)0
Skewness9.3263735
Sum1214
Variance0.77912273
MonotonicityDecreasing
2023-12-12T09:54:48.935229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
1 821
84.4%
2 114
 
11.7%
3 21
 
2.2%
4 8
 
0.8%
5 3
 
0.3%
7 2
 
0.2%
6 2
 
0.2%
17 1
 
0.1%
12 1
 
0.1%
ValueCountFrequency (%)
1 821
84.4%
2 114
 
11.7%
3 21
 
2.2%
4 8
 
0.8%
5 3
 
0.3%
6 2
 
0.2%
7 2
 
0.2%
12 1
 
0.1%
17 1
 
0.1%
ValueCountFrequency (%)
17 1
 
0.1%
12 1
 
0.1%
7 2
 
0.2%
6 2
 
0.2%
5 3
 
0.3%
4 8
 
0.8%
3 21
 
2.2%
2 114
 
11.7%
1 821
84.4%

Interactions

2023-12-12T09:54:44.653462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:54:44.427107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:54:44.782310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:54:44.538546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:54:49.028505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출원번호피인용_횟수
출원번호1.0000.000
피인용_횟수0.0001.000
2023-12-12T09:54:49.127534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출원번호피인용_횟수
출원번호1.000-0.065
피인용_횟수-0.0651.000

Missing values

2023-12-12T09:54:45.188059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:54:45.293184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

출원번호출원일자발명의 명칭분류정보피인용_횟수
010202000688482020-06-08감염예방 인증 키오스크G07F 17/4017
110202000233002020-02-26가공송전선의 안정적 설치를 위한 지지기구H02G 7/2012
210202000419572020-04-07마이크로니들 패치를 이용한 최소 침습적 아토피 피부염 검사 방법 및 마이크로니들 패치를 포함하는 최소 침습적 아토피 검사 키트A61B 5/007
310202000039912020-01-13셀룰로오스계 및 아크릴계 증점제, 분말형 고유동화제, 섬유를 함유한 수중 불분리성 친환경 폴리머 모르타르 조성물 및 이를 이용한 단면 보수 보강 공법C04B 26/047
410202000521022020-04-29라텍스 수지를 포함하는 방수아스팔트(LMA) 콘크리트 조성물 및 이의 시공방법C04B 26/266
510202000379372020-03-30전염병 환자 추척 시스템 및 이를 이용한 전염병 환자 추적 방법G16H 50/806
610202001097992020-08-31사물인터넷을 이용한 송배전선로의 전력정보 측정, 수집 및 분석방법 및 시스템G06Q 50/065
710202000450272020-04-14감염병 확산 방지를 위한 출입통제 시스템G07C 9/005
810202000228272020-02-25수중 불분리성 시멘트 모르타르 조성물 및 이를 이용한 수처리 구조물 보수보강공법C04B 22/085
910202000747112020-06-19안면인식 발열체크 및 방역용 전신 소독시스템A61L 2/244
출원번호출원일자발명의 명칭분류정보피인용_횟수
96310202000015042020-01-06저속회전으로도 데드존의 발생을 최소화시켜 수처리 효율을 향상시킬 수 있는 수처리용 교반기B01F 7/001
96410202000013662020-01-06제과제빵용 스팀통A21B 1/401
96510202000011042020-01-05건물 구조체E04H 1/121
96610202000006652020-01-03겔타임 조절이 용이한 시멘트 광물계 친환경 그라우트 조성물C09K 17/101
96710202000010702020-01-03무선 통신망에서 시간 민감 통신 보조 정보에 기초한 버스트 도착 시간 기준 클럭 지원 방법 및 장치H04L 7/021
96810202000005642020-01-03배차정보의 수정이 가능한 화물중개시스템G06Q 10/081
96910202000007462020-01-03내용물 토출 용기B65D 51/321
97010202000009812020-01-03건강 베개A47G 9/101
97110202000001892020-01-02펄스 미세 전자기장 방식의 키성장 보조장치 및 그 장치의 구동방법A61N 2/021
97210202000002342020-01-02배전선로 감시 시스템G01R 31/081