Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory488.3 KiB
Average record size in memory50.0 B

Variable types

Numeric2
Text3

Dataset

Description한국지식재산연구원 지식재산전문도서관 지식재산관련 전문 단행본, 연구보고서, 전문자료 등 관련 서지정보입니다.
URLhttps://www.data.go.kr/data/15055899/fileData.do

Alerts

번호 is highly overall correlated with 발행년High correlation
발행년 is highly overall correlated with 번호High correlation
발행년 is highly skewed (γ1 = 99.60731934)Skewed
번호 has unique valuesUnique
자료명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:43:01.048968
Analysis finished2023-12-12 08:43:04.890202
Duration3.84 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8760.6792
Minimum1
Maximum17538
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:43:04.972061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile868.95
Q14438.75
median8717.5
Q313109.25
95-th percentile16685.1
Maximum17538
Range17537
Interquartile range (IQR)8670.5

Descriptive statistics

Standard deviation5048.5455
Coefficient of variation (CV)0.5762733
Kurtosis-1.1842273
Mean8760.6792
Median Absolute Deviation (MAD)4332
Skewness0.01187324
Sum87606792
Variance25487812
MonotonicityNot monotonic
2023-12-12T17:43:05.129180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5937 1
 
< 0.1%
148 1
 
< 0.1%
14234 1
 
< 0.1%
3925 1
 
< 0.1%
16923 1
 
< 0.1%
4598 1
 
< 0.1%
5720 1
 
< 0.1%
12617 1
 
< 0.1%
6688 1
 
< 0.1%
4589 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
ValueCountFrequency (%)
17538 1
< 0.1%
17537 1
< 0.1%
17536 1
< 0.1%
17533 1
< 0.1%
17530 1
< 0.1%
17529 1
< 0.1%
17527 1
< 0.1%
17521 1
< 0.1%
17520 1
< 0.1%
17518 1
< 0.1%

자료명
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:43:05.543640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length200
Median length157
Mean length32.7117
Min length1

Characters and Unicode

Total characters327117
Distinct characters2124
Distinct categories16 ?
Distinct scripts6 ?
Distinct blocks12 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowロイヤルティ料率デ?タハンドブック: 現代産業選書―知的財産?務シリ?ズ
2nd row特許訴訟 下?
3rd row(平年5年度)デ?タべ?スの法的保?に?する調査?究
4th row산업의 녹색기술개발과 표준화를 위한 법제연구[Ⅴ]: 한국의 녹색기술혁신을 위한 정책과 법의 비교연구
5th row사랑하기 때문에 : 기욤 뮈소 장편소설
ValueCountFrequency (%)
and 1554
 
2.8%
1333
 
2.4%
the 1092
 
2.0%
of 900
 
1.6%
intellectual 643
 
1.2%
in 614
 
1.1%
property 610
 
1.1%
law 536
 
1.0%
연구 413
 
0.7%
400
 
0.7%
Other values (19353) 47085
85.3%
2023-12-12T17:43:06.131171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
48181
 
14.7%
e 14686
 
4.5%
n 12172
 
3.7%
t 12051
 
3.7%
a 11115
 
3.4%
i 9885
 
3.0%
o 9793
 
3.0%
r 8665
 
2.6%
s 6294
 
1.9%
l 6251
 
1.9%
Other values (2114) 188024
57.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 124478
38.1%
Other Letter 116254
35.5%
Space Separator 48190
 
14.7%
Uppercase Letter 17159
 
5.2%
Other Punctuation 8656
 
2.6%
Decimal Number 7328
 
2.2%
Close Punctuation 2023
 
0.6%
Open Punctuation 2021
 
0.6%
Dash Punctuation 699
 
0.2%
Letter Number 154
 
< 0.1%
Other values (6) 155
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2577
 
2.2%
1649
 
1.4%
1634
 
1.4%
1627
 
1.4%
1590
 
1.4%
1524
 
1.3%
1430
 
1.2%
1358
 
1.2%
1208
 
1.0%
1126
 
1.0%
Other values (1973) 100531
86.5%
Lowercase Letter
ValueCountFrequency (%)
e 14686
11.8%
n 12172
9.8%
t 12051
9.7%
a 11115
 
8.9%
i 9885
 
7.9%
o 9793
 
7.9%
r 8665
 
7.0%
s 6294
 
5.1%
l 6251
 
5.0%
c 4822
 
3.9%
Other values (18) 28744
23.1%
Uppercase Letter
ValueCountFrequency (%)
P 1922
11.2%
I 1883
11.0%
T 1732
 
10.1%
C 1434
 
8.4%
A 1327
 
7.7%
S 1075
 
6.3%
E 987
 
5.8%
R 878
 
5.1%
L 791
 
4.6%
D 689
 
4.0%
Other values (18) 4441
25.9%
Other Punctuation
ValueCountFrequency (%)
: 3546
41.0%
? 1765
20.4%
, 1352
 
15.6%
. 717
 
8.3%
· 505
 
5.8%
& 245
 
2.8%
' 195
 
2.3%
/ 150
 
1.7%
! 88
 
1.0%
" 25
 
0.3%
Other values (13) 68
 
0.8%
Decimal Number
ValueCountFrequency (%)
0 1821
24.8%
2 1776
24.2%
1 1587
21.7%
4 401
 
5.5%
3 373
 
5.1%
9 317
 
4.3%
5 308
 
4.2%
7 273
 
3.7%
8 250
 
3.4%
6 218
 
3.0%
Other values (3) 4
 
0.1%
Letter Number
ValueCountFrequency (%)
48
31.2%
41
26.6%
21
13.6%
16
 
10.4%
9
 
5.8%
7
 
4.5%
6
 
3.9%
2
 
1.3%
2
 
1.3%
1
 
0.6%
Math Symbol
ValueCountFrequency (%)
~ 51
36.2%
= 33
23.4%
+ 21
14.9%
> 14
 
9.9%
< 13
 
9.2%
| 4
 
2.8%
2
 
1.4%
1
 
0.7%
1
 
0.7%
1
 
0.7%
Close Punctuation
ValueCountFrequency (%)
) 1918
94.8%
] 50
 
2.5%
38
 
1.9%
7
 
0.3%
5
 
0.2%
2
 
0.1%
1
 
< 0.1%
1
 
< 0.1%
1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 1919
95.0%
[ 50
 
2.5%
37
 
1.8%
7
 
0.3%
5
 
0.2%
2
 
0.1%
1
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 576
82.4%
122
 
17.5%
1
 
0.1%
Other Number
ValueCountFrequency (%)
1
33.3%
1
33.3%
² 1
33.3%
Space Separator
ValueCountFrequency (%)
48181
> 99.9%
  9
 
< 0.1%
Connector Punctuation
ValueCountFrequency (%)
_ 6
100.0%
Final Punctuation
ValueCountFrequency (%)
3
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 141791
43.3%
Hangul 92952
28.4%
Common 69072
21.1%
Han 15896
 
4.9%
Hiragana 4023
 
1.2%
Katakana 3383
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2577
 
2.8%
1649
 
1.8%
1634
 
1.8%
1627
 
1.8%
1590
 
1.7%
1524
 
1.6%
1430
 
1.5%
1358
 
1.5%
1208
 
1.3%
1126
 
1.2%
Other values (939) 77229
83.1%
Han
ValueCountFrequency (%)
733
 
4.6%
459
 
2.9%
411
 
2.6%
411
 
2.6%
406
 
2.6%
402
 
2.5%
387
 
2.4%
231
 
1.5%
200
 
1.3%
198
 
1.2%
Other values (882) 12058
75.9%
Katakana
ValueCountFrequency (%)
310
 
9.2%
171
 
5.1%
169
 
5.0%
133
 
3.9%
125
 
3.7%
124
 
3.7%
115
 
3.4%
113
 
3.3%
100
 
3.0%
94
 
2.8%
Other values (66) 1929
57.0%
Common
ValueCountFrequency (%)
48181
69.8%
: 3546
 
5.1%
( 1919
 
2.8%
) 1918
 
2.8%
0 1821
 
2.6%
2 1776
 
2.6%
? 1765
 
2.6%
1 1587
 
2.3%
, 1352
 
2.0%
. 717
 
1.0%
Other values (64) 4490
 
6.5%
Latin
ValueCountFrequency (%)
e 14686
 
10.4%
n 12172
 
8.6%
t 12051
 
8.5%
a 11115
 
7.8%
i 9885
 
7.0%
o 9793
 
6.9%
r 8665
 
6.1%
s 6294
 
4.4%
l 6251
 
4.4%
c 4822
 
3.4%
Other values (57) 46057
32.5%
Hiragana
ValueCountFrequency (%)
1020
25.4%
316
 
7.9%
315
 
7.8%
259
 
6.4%
150
 
3.7%
140
 
3.5%
129
 
3.2%
123
 
3.1%
93
 
2.3%
91
 
2.3%
Other values (56) 1387
34.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 209898
64.2%
Hangul 92880
28.4%
CJK 15778
 
4.8%
Hiragana 4023
 
1.2%
Katakana 3383
 
1.0%
None 682
 
0.2%
Number Forms 154
 
< 0.1%
Punctuation 125
 
< 0.1%
CJK Compat Ideographs 118
 
< 0.1%
Compat Jamo 72
 
< 0.1%
Other values (2) 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
48181
23.0%
e 14686
 
7.0%
n 12172
 
5.8%
t 12051
 
5.7%
a 11115
 
5.3%
i 9885
 
4.7%
o 9793
 
4.7%
r 8665
 
4.1%
s 6294
 
3.0%
l 6251
 
3.0%
Other values (79) 70805
33.7%
Hangul
ValueCountFrequency (%)
2577
 
2.8%
1649
 
1.8%
1634
 
1.8%
1627
 
1.8%
1590
 
1.7%
1524
 
1.6%
1430
 
1.5%
1358
 
1.5%
1208
 
1.3%
1126
 
1.2%
Other values (937) 77157
83.1%
Hiragana
ValueCountFrequency (%)
1020
25.4%
316
 
7.9%
315
 
7.8%
259
 
6.4%
150
 
3.7%
140
 
3.5%
129
 
3.2%
123
 
3.1%
93
 
2.3%
91
 
2.3%
Other values (56) 1387
34.5%
CJK
ValueCountFrequency (%)
733
 
4.6%
459
 
2.9%
411
 
2.6%
411
 
2.6%
406
 
2.6%
402
 
2.5%
387
 
2.5%
231
 
1.5%
200
 
1.3%
198
 
1.3%
Other values (858) 11940
75.7%
None
ValueCountFrequency (%)
· 505
74.0%
38
 
5.6%
37
 
5.4%
22
 
3.2%
  9
 
1.3%
7
 
1.0%
7
 
1.0%
7
 
1.0%
5
 
0.7%
5
 
0.7%
Other values (26) 40
 
5.9%
Katakana
ValueCountFrequency (%)
310
 
9.2%
171
 
5.1%
169
 
5.0%
133
 
3.9%
125
 
3.7%
124
 
3.7%
115
 
3.4%
113
 
3.3%
100
 
3.0%
94
 
2.8%
Other values (66) 1929
57.0%
Punctuation
ValueCountFrequency (%)
122
97.6%
3
 
2.4%
Compat Jamo
ValueCountFrequency (%)
71
98.6%
1
 
1.4%
Number Forms
ValueCountFrequency (%)
48
31.2%
41
26.6%
21
13.6%
16
 
10.4%
9
 
5.8%
7
 
4.5%
6
 
3.9%
2
 
1.3%
2
 
1.3%
1
 
0.6%
CJK Compat Ideographs
ValueCountFrequency (%)
34
28.8%
13
 
11.0%
9
 
7.6%
8
 
6.8%
8
 
6.8%
4
 
3.4%
4
 
3.4%
4
 
3.4%
4
 
3.4%
4
 
3.4%
Other values (14) 26
22.0%
Math Operators
ValueCountFrequency (%)
2
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
50.0%
1
50.0%

저자
Text

Distinct6815
Distinct (%)68.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:43:06.486067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length111
Median length75
Mean length8.3839
Min length2

Characters and Unicode

Total characters83839
Distinct characters1644
Distinct categories11 ?
Distinct scripts6 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5792 ?
Unique (%)57.9%

Sample

1st row經濟産業省知的財産政策室
2nd row大?哲也
3rd rowIIP(知的財産硏究所)
4th row이유봉
5th row기욤 뮈소
ValueCountFrequency (%)
특허청 434
 
2.8%
한국지식재산연구원 244
 
1.6%
특허청(kipo 112
 
0.7%
특허심판원 97
 
0.6%
m 74
 
0.5%
united 73
 
0.5%
states 71
 
0.5%
한국지식재산연구원(kiip 70
 
0.4%
david 67
 
0.4%
a 60
 
0.4%
Other values (8594) 14323
91.7%
2023-12-12T17:43:07.097473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5638
 
6.7%
e 3639
 
4.3%
a 3254
 
3.9%
n 2631
 
3.1%
r 2545
 
3.0%
i 2440
 
2.9%
o 2190
 
2.6%
, 1895
 
2.3%
t 1834
 
2.2%
l 1690
 
2.0%
Other values (1634) 56083
66.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 36209
43.2%
Lowercase Letter 29414
35.1%
Uppercase Letter 8677
 
10.3%
Space Separator 5644
 
6.7%
Other Punctuation 2985
 
3.6%
Close Punctuation 381
 
0.5%
Open Punctuation 380
 
0.5%
Dash Punctuation 89
 
0.1%
Decimal Number 39
 
< 0.1%
Math Symbol 17
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
918
 
2.5%
836
 
2.3%
815
 
2.3%
814
 
2.2%
800
 
2.2%
724
 
2.0%
676
 
1.9%
655
 
1.8%
643
 
1.8%
642
 
1.8%
Other values (1547) 28686
79.2%
Lowercase Letter
ValueCountFrequency (%)
e 3639
12.4%
a 3254
11.1%
n 2631
8.9%
r 2545
 
8.7%
i 2440
 
8.3%
o 2190
 
7.4%
t 1834
 
6.2%
l 1690
 
5.7%
s 1515
 
5.2%
h 1157
 
3.9%
Other values (18) 6519
22.2%
Uppercase Letter
ValueCountFrequency (%)
P 752
 
8.7%
I 667
 
7.7%
S 647
 
7.5%
M 594
 
6.8%
C 522
 
6.0%
K 516
 
5.9%
A 515
 
5.9%
R 448
 
5.2%
B 404
 
4.7%
J 404
 
4.7%
Other values (16) 3208
37.0%
Other Punctuation
ValueCountFrequency (%)
, 1895
63.5%
? 463
 
15.5%
. 453
 
15.2%
; 96
 
3.2%
& 52
 
1.7%
' 8
 
0.3%
7
 
0.2%
· 5
 
0.2%
" 2
 
0.1%
: 2
 
0.1%
Other values (2) 2
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 13
33.3%
3 8
20.5%
2 6
15.4%
5 5
 
12.8%
4 3
 
7.7%
0 3
 
7.7%
6 1
 
2.6%
Math Symbol
ValueCountFrequency (%)
< 8
47.1%
> 8
47.1%
1
 
5.9%
Other Symbol
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Space Separator
ValueCountFrequency (%)
5638
99.9%
  6
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 371
97.6%
[ 9
 
2.4%
Close Punctuation
ValueCountFrequency (%)
) 371
97.4%
] 10
 
2.6%
Dash Punctuation
ValueCountFrequency (%)
- 66
74.2%
23
 
25.8%

Most occurring scripts

ValueCountFrequency (%)
Latin 38091
45.4%
Hangul 28470
34.0%
Common 9539
 
11.4%
Han 7191
 
8.6%
Katakana 495
 
0.6%
Hiragana 53
 
0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
194
 
2.7%
182
 
2.5%
177
 
2.5%
168
 
2.3%
166
 
2.3%
163
 
2.3%
148
 
2.1%
139
 
1.9%
135
 
1.9%
122
 
1.7%
Other values (789) 5597
77.8%
Hangul
ValueCountFrequency (%)
918
 
3.2%
836
 
2.9%
815
 
2.9%
814
 
2.9%
800
 
2.8%
724
 
2.5%
676
 
2.4%
655
 
2.3%
643
 
2.3%
642
 
2.3%
Other values (659) 20947
73.6%
Katakana
ValueCountFrequency (%)
57
 
11.5%
23
 
4.6%
21
 
4.2%
20
 
4.0%
20
 
4.0%
18
 
3.6%
18
 
3.6%
17
 
3.4%
17
 
3.4%
16
 
3.2%
Other values (55) 268
54.1%
Latin
ValueCountFrequency (%)
e 3639
 
9.6%
a 3254
 
8.5%
n 2631
 
6.9%
r 2545
 
6.7%
i 2440
 
6.4%
o 2190
 
5.7%
t 1834
 
4.8%
l 1690
 
4.4%
s 1515
 
4.0%
h 1157
 
3.0%
Other values (44) 15196
39.9%
Common
ValueCountFrequency (%)
5638
59.1%
, 1895
 
19.9%
? 463
 
4.9%
. 453
 
4.7%
( 371
 
3.9%
) 371
 
3.9%
; 96
 
1.0%
- 66
 
0.7%
& 52
 
0.5%
23
 
0.2%
Other values (23) 111
 
1.2%
Hiragana
ValueCountFrequency (%)
8
15.1%
4
 
7.5%
4
 
7.5%
3
 
5.7%
3
 
5.7%
3
 
5.7%
3
 
5.7%
2
 
3.8%
2
 
3.8%
2
 
3.8%
Other values (14) 19
35.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 47580
56.8%
Hangul 28469
34.0%
CJK 7120
 
8.5%
Katakana 495
 
0.6%
CJK Compat Ideographs 71
 
0.1%
Hiragana 53
 
0.1%
None 24
 
< 0.1%
Punctuation 23
 
< 0.1%
Enclosed Alphanum 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5638
 
11.8%
e 3639
 
7.6%
a 3254
 
6.8%
n 2631
 
5.5%
r 2545
 
5.3%
i 2440
 
5.1%
o 2190
 
4.6%
, 1895
 
4.0%
t 1834
 
3.9%
l 1690
 
3.6%
Other values (66) 19824
41.7%
Hangul
ValueCountFrequency (%)
918
 
3.2%
836
 
2.9%
815
 
2.9%
814
 
2.9%
800
 
2.8%
724
 
2.5%
676
 
2.4%
655
 
2.3%
643
 
2.3%
642
 
2.3%
Other values (658) 20946
73.6%
CJK
ValueCountFrequency (%)
194
 
2.7%
182
 
2.6%
177
 
2.5%
168
 
2.4%
166
 
2.3%
163
 
2.3%
148
 
2.1%
139
 
2.0%
135
 
1.9%
122
 
1.7%
Other values (767) 5526
77.6%
Katakana
ValueCountFrequency (%)
57
 
11.5%
23
 
4.6%
21
 
4.2%
20
 
4.0%
20
 
4.0%
18
 
3.6%
18
 
3.6%
17
 
3.4%
17
 
3.4%
16
 
3.2%
Other values (55) 268
54.1%
CJK Compat Ideographs
ValueCountFrequency (%)
29
40.8%
9
 
12.7%
3
 
4.2%
3
 
4.2%
3
 
4.2%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
Other values (12) 14
19.7%
Punctuation
ValueCountFrequency (%)
23
100.0%
Hiragana
ValueCountFrequency (%)
8
15.1%
4
 
7.5%
4
 
7.5%
3
 
5.7%
3
 
5.7%
3
 
5.7%
3
 
5.7%
2
 
3.8%
2
 
3.8%
2
 
3.8%
Other values (14) 19
35.8%
None
ValueCountFrequency (%)
7
29.2%
  6
25.0%
· 5
20.8%
ł 2
 
8.3%
1
 
4.2%
1
 
4.2%
ø 1
 
4.2%
1
 
4.2%
Enclosed Alphanum
ValueCountFrequency (%)
2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%
Distinct3440
Distinct (%)34.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:43:07.484941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length75
Median length55
Mean length8.4295
Min length1

Characters and Unicode

Total characters84295
Distinct characters1018
Distinct categories11 ?
Distinct scripts6 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2284 ?
Unique (%)22.8%

Sample

1st row??産業調査?
2nd row本??格
3rd rowIIP(知的財産硏究所)
4th row한국법제연구원
5th row밝은세상
ValueCountFrequency (%)
특허청 525
 
4.0%
press 345
 
2.6%
한국지식재산연구원 232
 
1.8%
university 198
 
1.5%
박영사 181
 
1.4%
특허청(kipo 165
 
1.3%
대외경제정책연구원(kiep 147
 
1.1%
한빛지적소유권센터 140
 
1.1%
books 131
 
1.0%
한국지식재산연구원(kiip 129
 
1.0%
Other values (3071) 10859
83.2%
2023-12-12T17:43:08.088645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4468
 
5.3%
e 3174
 
3.8%
r 2602
 
3.1%
s 2383
 
2.8%
i 2279
 
2.7%
a 2040
 
2.4%
n 1897
 
2.3%
o 1894
 
2.2%
l 1589
 
1.9%
P 1546
 
1.8%
Other values (1008) 60423
71.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 40143
47.6%
Lowercase Letter 26928
31.9%
Uppercase Letter 9752
 
11.6%
Space Separator 4469
 
5.3%
Other Punctuation 1013
 
1.2%
Open Punctuation 882
 
1.0%
Close Punctuation 881
 
1.0%
Decimal Number 141
 
0.2%
Dash Punctuation 79
 
0.1%
Math Symbol 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1410
 
3.5%
1342
 
3.3%
1210
 
3.0%
1082
 
2.7%
1006
 
2.5%
977
 
2.4%
846
 
2.1%
843
 
2.1%
824
 
2.1%
747
 
1.9%
Other values (924) 29856
74.4%
Uppercase Letter
ValueCountFrequency (%)
P 1546
15.9%
I 1383
14.2%
K 824
 
8.4%
E 735
 
7.5%
S 566
 
5.8%
O 525
 
5.4%
A 465
 
4.8%
B 398
 
4.1%
C 363
 
3.7%
L 360
 
3.7%
Other values (19) 2587
26.5%
Lowercase Letter
ValueCountFrequency (%)
e 3174
11.8%
r 2602
 
9.7%
s 2383
 
8.8%
i 2279
 
8.5%
a 2040
 
7.6%
n 1897
 
7.0%
o 1894
 
7.0%
l 1589
 
5.9%
t 1481
 
5.5%
d 910
 
3.4%
Other values (16) 6679
24.8%
Other Punctuation
ValueCountFrequency (%)
? 733
72.4%
& 89
 
8.8%
. 62
 
6.1%
: 52
 
5.1%
, 32
 
3.2%
/ 15
 
1.5%
' 12
 
1.2%
; 6
 
0.6%
4
 
0.4%
# 4
 
0.4%
Other values (2) 4
 
0.4%
Decimal Number
ValueCountFrequency (%)
1 67
47.5%
2 65
46.1%
8 4
 
2.8%
0 2
 
1.4%
5 2
 
1.4%
3 1
 
0.7%
Space Separator
ValueCountFrequency (%)
4468
> 99.9%
  1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 881
99.9%
[ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 880
99.9%
] 1
 
0.1%
Dash Punctuation
ValueCountFrequency (%)
- 63
79.7%
16
 
20.3%
Math Symbol
ValueCountFrequency (%)
| 4
80.0%
+ 1
 
20.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 36680
43.5%
Hangul 32933
39.1%
Common 7470
 
8.9%
Han 6452
 
7.7%
Katakana 649
 
0.8%
Hiragana 111
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1410
 
4.3%
1342
 
4.1%
1210
 
3.7%
1082
 
3.3%
1006
 
3.1%
977
 
3.0%
846
 
2.6%
843
 
2.6%
824
 
2.5%
747
 
2.3%
Other values (513) 22646
68.8%
Han
ValueCountFrequency (%)
447
 
6.9%
234
 
3.6%
232
 
3.6%
214
 
3.3%
197
 
3.1%
185
 
2.9%
163
 
2.5%
160
 
2.5%
159
 
2.5%
158
 
2.4%
Other values (317) 4303
66.7%
Katakana
ValueCountFrequency (%)
72
 
11.1%
41
 
6.3%
39
 
6.0%
33
 
5.1%
28
 
4.3%
26
 
4.0%
21
 
3.2%
20
 
3.1%
19
 
2.9%
18
 
2.8%
Other values (52) 332
51.2%
Latin
ValueCountFrequency (%)
e 3174
 
8.7%
r 2602
 
7.1%
s 2383
 
6.5%
i 2279
 
6.2%
a 2040
 
5.6%
n 1897
 
5.2%
o 1894
 
5.2%
l 1589
 
4.3%
P 1546
 
4.2%
t 1481
 
4.0%
Other values (45) 15795
43.1%
Common
ValueCountFrequency (%)
4468
59.8%
( 881
 
11.8%
) 880
 
11.8%
? 733
 
9.8%
& 89
 
1.2%
1 67
 
0.9%
2 65
 
0.9%
- 63
 
0.8%
. 62
 
0.8%
: 52
 
0.7%
Other values (18) 110
 
1.5%
Hiragana
ValueCountFrequency (%)
13
11.7%
11
9.9%
11
9.9%
11
9.9%
11
9.9%
11
9.9%
7
 
6.3%
6
 
5.4%
4
 
3.6%
4
 
3.6%
Other values (13) 22
19.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 44122
52.3%
Hangul 32930
39.1%
CJK 6443
 
7.6%
Katakana 649
 
0.8%
Hiragana 111
 
0.1%
Punctuation 16
 
< 0.1%
None 14
 
< 0.1%
CJK Compat Ideographs 9
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4468
 
10.1%
e 3174
 
7.2%
r 2602
 
5.9%
s 2383
 
5.4%
i 2279
 
5.2%
a 2040
 
4.6%
n 1897
 
4.3%
o 1894
 
4.3%
l 1589
 
3.6%
P 1546
 
3.5%
Other values (65) 20250
45.9%
Hangul
ValueCountFrequency (%)
1410
 
4.3%
1342
 
4.1%
1210
 
3.7%
1082
 
3.3%
1006
 
3.1%
977
 
3.0%
846
 
2.6%
843
 
2.6%
824
 
2.5%
747
 
2.3%
Other values (511) 22643
68.8%
CJK
ValueCountFrequency (%)
447
 
6.9%
234
 
3.6%
232
 
3.6%
214
 
3.3%
197
 
3.1%
185
 
2.9%
163
 
2.5%
160
 
2.5%
159
 
2.5%
158
 
2.5%
Other values (311) 4294
66.6%
Katakana
ValueCountFrequency (%)
72
 
11.1%
41
 
6.3%
39
 
6.0%
33
 
5.1%
28
 
4.3%
26
 
4.0%
21
 
3.2%
20
 
3.1%
19
 
2.9%
18
 
2.8%
Other values (52) 332
51.2%
Punctuation
ValueCountFrequency (%)
16
100.0%
Hiragana
ValueCountFrequency (%)
13
11.7%
11
9.9%
11
9.9%
11
9.9%
11
9.9%
11
9.9%
7
 
6.3%
6
 
5.4%
4
 
3.6%
4
 
3.6%
Other values (13) 22
19.8%
None
ValueCountFrequency (%)
4
28.6%
· 3
21.4%
2
14.3%
  1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
CJK Compat Ideographs
ValueCountFrequency (%)
2
22.2%
2
22.2%
2
22.2%
1
11.1%
1
11.1%
1
11.1%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

발행년
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct66
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2011.843
Minimum1600
Maximum20111
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:43:08.276757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1600
5-th percentile1996
Q12006
median2011
Q32016
95-th percentile2020
Maximum20111
Range18511
Interquartile range (IQR)10

Descriptive statistics

Standard deviation181.24629
Coefficient of variation (CV)0.090089681
Kurtosis9947.8683
Mean2011.843
Median Absolute Deviation (MAD)5
Skewness99.607319
Sum20118430
Variance32850.219
MonotonicityNot monotonic
2023-12-12T17:43:08.495476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2017 682
 
6.8%
2011 637
 
6.4%
2008 615
 
6.2%
2013 550
 
5.5%
2018 517
 
5.2%
2012 511
 
5.1%
2009 507
 
5.1%
2010 507
 
5.1%
2016 504
 
5.0%
2019 438
 
4.4%
Other values (56) 4532
45.3%
ValueCountFrequency (%)
1600 1
< 0.1%
1852 1
< 0.1%
1871 1
< 0.1%
1891 1
< 0.1%
1902 1
< 0.1%
1906 1
< 0.1%
1911 1
< 0.1%
1916 1
< 0.1%
1925 1
< 0.1%
1959 1
< 0.1%
ValueCountFrequency (%)
20111 1
 
< 0.1%
2023 8
 
0.1%
2022 133
 
1.3%
2021 248
 
2.5%
2020 308
3.1%
2019 438
4.4%
2018 517
5.2%
2017 682
6.8%
2016 504
5.0%
2015 427
4.3%

Interactions

2023-12-12T17:43:04.299322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:43:04.012757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:43:04.500929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:43:04.153809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:43:08.612792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호발행년
번호1.0000.000
발행년0.0001.000
2023-12-12T17:43:08.700469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호발행년
번호1.000-0.712
발행년-0.7121.000

Missing values

2023-12-12T17:43:04.698754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:43:04.834404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호자료명저자발행지발행년
59365937ロイヤルティ料率デ?タハンドブック: 現代産業選書―知的財産?務シリ?ズ經濟産業省知的財産政策室??産業調査?2010
84988499特許訴訟 下?大?哲也本??格2012
47794780(平年5年度)デ?タべ?スの法的保?に?する調査?究IIP(知的財産硏究所)IIP(知的財産硏究所)1996
88548855산업의 녹색기술개발과 표준화를 위한 법제연구[Ⅴ]: 한국의 녹색기술혁신을 위한 정책과 법의 비교연구이유봉한국법제연구원2012
1003410035사랑하기 때문에 : 기욤 뮈소 장편소설기욤 뮈소밝은세상2007
62376238特許廳 訓令例規集 (Ⅱ)特許廳 (특허청)특허청2014
98119812Competition Policy and Patent Law under Uncertainty : Regulating InnovationGeoffrey A. ManneCambridge University Press2011
53985399The Protection of Geographical Indications in India: A New Perspective on the French and European Experience (SAGE Law)Marie-Vivien, DelphineSAGE2015
22142215특허법 판례의 정해박병언고시계사2019
17294172952008년도 지식재산활동 실태조사한국지식재산연구원특허청2008
번호자료명저자발행지발행년
59175918Pharmaceutical Patents Issues and ConsiderationsFlorian AertsNovinka2013
56165617발명자의 지식재산 창출 실태 분석: 발명자-경영자 및 석·박사과정 발명자를 중심으로한국발명진흥회한국발명진흥회2008
1484314844Patent EthicsHricik, DavidOxford University Press2010
1736117362커뮤니케이션 중심의제시대정진홍지식산업사1998
35203521민족의 통일과 다문화사회의 갈등: 독일 문학의 예를 중심으로최윤영서울대학교출판문화원2016
1503515036Legal Writing in Plain English : A Text With ExercisesGarner, Bryan AUniversity Of Chicago Press2001
1539615397연구개발 조세지원제도 실무매뉴얼한국산업기술진흥협회한국산업기술진흥협회2009
1246912470Securing America's Industrial StrengthNational Research CouncilNational Academy1999
83828383대한민국 미래비전과 전략(대통령직속)미래기획위원회(대통령직속)미래기획위원회2013
23772378가족 파트너십 모델Davis, Hilton정담미디어2018