Overview

Dataset statistics

Number of variables4
Number of observations3373
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory108.8 KiB
Average record size in memory33.0 B

Variable types

Numeric1
Text2
DateTime1

Dataset

Description지식재산관련 인력정보(산업계,학계, 연구계, 정부, 기타) 자료로 활동정보 및 상세정보를 제공하는 자료입니다.
URLhttps://www.data.go.kr/data/15090917/fileData.do

Alerts

번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:03:07.886301
Analysis finished2023-12-12 09:03:08.852974
Duration0.97 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct3373
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1691.2117
Minimum1
Maximum3379
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size29.8 KiB
2023-12-12T18:03:08.959153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile169.6
Q1848
median1692
Q32535
95-th percentile3210.4
Maximum3379
Range3378
Interquartile range (IQR)1687

Descriptive statistics

Standard deviation975.28238
Coefficient of variation (CV)0.5766767
Kurtosis-1.1984404
Mean1691.2117
Median Absolute Deviation (MAD)844
Skewness-0.0016745488
Sum5704457
Variance951175.72
MonotonicityStrictly increasing
2023-12-12T18:03:09.180406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
2272 1
 
< 0.1%
2248 1
 
< 0.1%
2249 1
 
< 0.1%
2250 1
 
< 0.1%
2251 1
 
< 0.1%
2252 1
 
< 0.1%
2253 1
 
< 0.1%
2254 1
 
< 0.1%
2255 1
 
< 0.1%
Other values (3363) 3363
99.7%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
3379 1
< 0.1%
3378 1
< 0.1%
3377 1
< 0.1%
3376 1
< 0.1%
3375 1
< 0.1%
3374 1
< 0.1%
3373 1
< 0.1%
3372 1
< 0.1%
3371 1
< 0.1%
3370 1
< 0.1%

이름
Text

Distinct3367
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size26.5 KiB
2023-12-12T18:03:09.615497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length3
Mean length3.0501038
Min length2

Characters and Unicode

Total characters10288
Distinct characters277
Distinct categories6 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3361 ?
Unique (%)99.6%

Sample

1st row강경남
2nd row강경정
3rd row강구
4th row강기봉
5th row강동세
ValueCountFrequency (%)
김성은 2
 
0.1%
김동욱 2
 
0.1%
이재성 2
 
0.1%
나혜란 2
 
0.1%
김정민 2
 
0.1%
김인숙 2
 
0.1%
최윤효 1
 
< 0.1%
최종민 1
 
< 0.1%
최정아 1
 
< 0.1%
최재호 1
 
< 0.1%
Other values (3358) 3358
99.5%
2023-12-12T18:03:10.230381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
600
 
5.8%
469
 
4.6%
353
 
3.4%
301
 
2.9%
289
 
2.8%
208
 
2.0%
198
 
1.9%
184
 
1.8%
174
 
1.7%
170
 
1.7%
Other values (267) 7342
71.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10077
97.9%
Uppercase Letter 201
 
2.0%
Space Separator 6
 
0.1%
Decimal Number 2
 
< 0.1%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
600
 
6.0%
469
 
4.7%
353
 
3.5%
301
 
3.0%
289
 
2.9%
208
 
2.1%
198
 
2.0%
184
 
1.8%
174
 
1.7%
170
 
1.7%
Other values (257) 7131
70.8%
Uppercase Letter
ValueCountFrequency (%)
B 86
42.8%
A 84
41.8%
C 23
 
11.4%
D 7
 
3.5%
E 1
 
0.5%
Decimal Number
ValueCountFrequency (%)
1 1
50.0%
2 1
50.0%
Space Separator
ValueCountFrequency (%)
6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10074
97.9%
Latin 201
 
2.0%
Common 10
 
0.1%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
600
 
6.0%
469
 
4.7%
353
 
3.5%
301
 
3.0%
289
 
2.9%
208
 
2.1%
198
 
2.0%
184
 
1.8%
174
 
1.7%
170
 
1.7%
Other values (254) 7128
70.8%
Latin
ValueCountFrequency (%)
B 86
42.8%
A 84
41.8%
C 23
 
11.4%
D 7
 
3.5%
E 1
 
0.5%
Common
ValueCountFrequency (%)
6
60.0%
1 1
 
10.0%
2 1
 
10.0%
) 1
 
10.0%
( 1
 
10.0%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10074
97.9%
ASCII 211
 
2.1%
CJK 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
600
 
6.0%
469
 
4.7%
353
 
3.5%
301
 
3.0%
289
 
2.9%
208
 
2.1%
198
 
2.0%
184
 
1.8%
174
 
1.7%
170
 
1.7%
Other values (254) 7128
70.8%
ASCII
ValueCountFrequency (%)
B 86
40.8%
A 84
39.8%
C 23
 
10.9%
D 7
 
3.3%
6
 
2.8%
1 1
 
0.5%
2 1
 
0.5%
) 1
 
0.5%
( 1
 
0.5%
E 1
 
0.5%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Distinct2073
Distinct (%)61.5%
Missing0
Missing (%)0.0%
Memory size26.5 KiB
2023-12-12T18:03:10.612601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length110
Median length68
Mean length9.1179958
Min length1

Characters and Unicode

Total characters30755
Distinct characters481
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1863 ?
Unique (%)55.2%

Sample

1st row신지식재산,지식재산 실태조사, 지식재산제도와 혁신활동
2nd row형사법
3rd row형사법
4th row지적재산권법
5th row지적재산권법
ValueCountFrequency (%)
지식재산 449
 
7.7%
특허 275
 
4.7%
저작권 151
 
2.6%
지식재산권 100
 
1.7%
기타 88
 
1.5%
특허법 63
 
1.1%
지적재산권법 63
 
1.1%
저작권법 62
 
1.1%
경영 62
 
1.1%
경영학 59
 
1.0%
Other values (2317) 4476
76.5%
2023-12-12T18:03:11.211457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 2639
 
8.6%
2508
 
8.2%
1150
 
3.7%
1124
 
3.7%
1011
 
3.3%
891
 
2.9%
835
 
2.7%
795
 
2.6%
779
 
2.5%
778
 
2.5%
Other values (471) 18245
59.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 24930
81.1%
Other Punctuation 2705
 
8.8%
Space Separator 2508
 
8.2%
Uppercase Letter 345
 
1.1%
Lowercase Letter 243
 
0.8%
Decimal Number 8
 
< 0.1%
Close Punctuation 7
 
< 0.1%
Open Punctuation 7
 
< 0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1150
 
4.6%
1124
 
4.5%
1011
 
4.1%
891
 
3.6%
835
 
3.3%
795
 
3.2%
779
 
3.1%
778
 
3.1%
770
 
3.1%
736
 
3.0%
Other values (417) 16061
64.4%
Lowercase Letter
ValueCountFrequency (%)
n 26
10.7%
e 26
10.7%
a 25
10.3%
t 25
10.3%
i 21
8.6%
l 19
 
7.8%
o 18
 
7.4%
r 12
 
4.9%
s 11
 
4.5%
d 8
 
3.3%
Other values (12) 52
21.4%
Uppercase Letter
ValueCountFrequency (%)
I 67
19.4%
P 48
13.9%
T 39
11.3%
D 36
10.4%
R 28
8.1%
C 20
 
5.8%
M 19
 
5.5%
A 17
 
4.9%
F 14
 
4.1%
S 12
 
3.5%
Other values (10) 45
13.0%
Other Punctuation
ValueCountFrequency (%)
, 2639
97.6%
/ 38
 
1.4%
& 18
 
0.7%
. 10
 
0.4%
Decimal Number
ValueCountFrequency (%)
4 4
50.0%
2 2
25.0%
3 1
 
12.5%
1 1
 
12.5%
Space Separator
ValueCountFrequency (%)
2508
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 24930
81.1%
Common 5237
 
17.0%
Latin 588
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1150
 
4.6%
1124
 
4.5%
1011
 
4.1%
891
 
3.6%
835
 
3.3%
795
 
3.2%
779
 
3.1%
778
 
3.1%
770
 
3.1%
736
 
3.0%
Other values (417) 16061
64.4%
Latin
ValueCountFrequency (%)
I 67
 
11.4%
P 48
 
8.2%
T 39
 
6.6%
D 36
 
6.1%
R 28
 
4.8%
n 26
 
4.4%
e 26
 
4.4%
a 25
 
4.3%
t 25
 
4.3%
i 21
 
3.6%
Other values (32) 247
42.0%
Common
ValueCountFrequency (%)
, 2639
50.4%
2508
47.9%
/ 38
 
0.7%
& 18
 
0.3%
. 10
 
0.2%
) 7
 
0.1%
( 7
 
0.1%
4 4
 
0.1%
2 2
 
< 0.1%
- 2
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 24925
81.0%
ASCII 5825
 
18.9%
Compat Jamo 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 2639
45.3%
2508
43.1%
I 67
 
1.2%
P 48
 
0.8%
T 39
 
0.7%
/ 38
 
0.7%
D 36
 
0.6%
R 28
 
0.5%
n 26
 
0.4%
e 26
 
0.4%
Other values (44) 370
 
6.4%
Hangul
ValueCountFrequency (%)
1150
 
4.6%
1124
 
4.5%
1011
 
4.1%
891
 
3.6%
835
 
3.4%
795
 
3.2%
779
 
3.1%
778
 
3.1%
770
 
3.1%
736
 
3.0%
Other values (413) 16056
64.4%
Compat Jamo
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%
Distinct27
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size26.5 KiB
Minimum2015-02-12 00:00:00
Maximum2022-11-18 00:00:00
2023-12-12T18:03:11.408469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:03:11.606969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)

Interactions

2023-12-12T18:03:08.503806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:03:11.736295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호등록일
번호1.0000.925
등록일0.9251.000

Missing values

2023-12-12T18:03:08.696441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:03:08.800974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호이름업종분류등록일
01강경남신지식재산,지식재산 실태조사, 지식재산제도와 혁신활동2015-02-12
12강경정형사법2015-02-12
23강구형사법2015-02-12
34강기봉지적재산권법2015-02-12
45강동세지적재산권법2015-02-12
56강명수A무역학, 상표전환2015-02-12
67강명수B지적재산권법, 공정거래법, 직무발명, 발명특허, 특허보호2015-02-12
78강민구정보법2015-02-12
89강병화개인정보보호법, 전자서명법, 저작권법, 프로그램저작물2015-02-12
910강보라경제법, 경쟁법, 지식재산권법, 특허괴물2015-02-12
번호이름업종분류등록일
33633370오민지농수해양학,농학2022-11-18
33643371오용준신소재공학2022-11-18
33653372오원용경영학2022-11-18
33663373오유원산업공학, 기계공학, 경영학2022-11-18
33673374오은식복합학, 과학기술학,기술정책2022-11-18
33683375오은해경영학2022-11-18
33693376오준호역사학,한의학, 한국어문학2022-11-18
33703377오진표농수해양학,농학2022-11-18
33713378오찬희문헌정보학2022-11-18
33723379오충현생물학,환경공학2022-11-18