Overview

Dataset statistics

Number of variables6
Number of observations185
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.3 KiB
Average record size in memory51.7 B

Variable types

Numeric3
Text1
Categorical2

Dataset

Description전 국민에게 발명 및 지식재산의 중요성을 인식시키고, 기업체, 초중고 학생, 발명교사, 대학생, 개인에 이르기까지 지식재산 분야의 다양한 온라인 교육 정보 안내
URLhttps://www.data.go.kr/data/15069205/fileData.do

Alerts

대상 is highly overall correlated with 번호 and 2 other fieldsHigh correlation
구분 is highly overall correlated with 번호 and 1 other fieldsHigh correlation
번호 is highly overall correlated with 대상 and 1 other fieldsHigh correlation
차시 is highly overall correlated with 대상High correlation
대상 is highly imbalanced (50.5%)Imbalance
번호 has unique valuesUnique
콘텐츠명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 02:19:32.686922
Analysis finished2023-12-12 02:19:34.109605
Duration1.42 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct185
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean93
Minimum1
Maximum185
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-12T11:19:34.198061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10.2
Q147
median93
Q3139
95-th percentile175.8
Maximum185
Range184
Interquartile range (IQR)92

Descriptive statistics

Standard deviation53.549043
Coefficient of variation (CV)0.57579616
Kurtosis-1.2
Mean93
Median Absolute Deviation (MAD)46
Skewness0
Sum17205
Variance2867.5
MonotonicityStrictly increasing
2023-12-12T11:19:34.383878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
128 1
 
0.5%
119 1
 
0.5%
120 1
 
0.5%
121 1
 
0.5%
122 1
 
0.5%
123 1
 
0.5%
124 1
 
0.5%
125 1
 
0.5%
126 1
 
0.5%
Other values (175) 175
94.6%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
185 1
0.5%
184 1
0.5%
183 1
0.5%
182 1
0.5%
181 1
0.5%
180 1
0.5%
179 1
0.5%
178 1
0.5%
177 1
0.5%
176 1
0.5%

콘텐츠명
Text

UNIQUE 

Distinct185
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-12T11:19:34.636738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length66
Median length40
Mean length25.605405
Min length8

Characters and Unicode

Total characters4737
Distinct characters310
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique185 ?
Unique (%)100.0%

Sample

1st row기술이전 및 사업화 전략(10차시)
2nd rowCIIP-특허법률영어(20차시)
3rd rowIP Panorama-글로벌 IP경영과정(10차시)
4th row실전! 의견서·보정서 작성(5차시)
5th row특허침해판단과 청구범위해석(4차시)
ValueCountFrequency (%)
26
 
4.2%
지식재산심판 20
 
3.2%
디자인 19
 
3.1%
경영과 16
 
2.6%
브랜드 16
 
2.6%
전략(브랜드 7
 
1.1%
지식재산권 7
 
1.1%
연구개발과 6
 
1.0%
인터넷과 6
 
1.0%
소송실무(특허 5
 
0.8%
Other values (422) 489
79.3%
2023-12-12T11:19:35.108904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
432
 
9.1%
) 282
 
6.0%
( 282
 
6.0%
184
 
3.9%
183
 
3.9%
146
 
3.1%
, 127
 
2.7%
122
 
2.6%
120
 
2.5%
118
 
2.5%
Other values (300) 2741
57.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3262
68.9%
Space Separator 432
 
9.1%
Close Punctuation 282
 
6.0%
Open Punctuation 282
 
6.0%
Decimal Number 230
 
4.9%
Other Punctuation 132
 
2.8%
Lowercase Letter 58
 
1.2%
Uppercase Letter 46
 
1.0%
Dash Punctuation 13
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
184
 
5.6%
183
 
5.6%
146
 
4.5%
122
 
3.7%
120
 
3.7%
118
 
3.6%
72
 
2.2%
64
 
2.0%
61
 
1.9%
56
 
1.7%
Other values (254) 2136
65.5%
Lowercase Letter
ValueCountFrequency (%)
o 8
13.8%
n 7
12.1%
a 7
12.1%
r 6
10.3%
t 6
10.3%
e 5
8.6%
i 4
6.9%
f 2
 
3.4%
u 2
 
3.4%
s 2
 
3.4%
Other values (7) 9
15.5%
Decimal Number
ValueCountFrequency (%)
1 56
24.3%
2 41
17.8%
6 29
12.6%
3 27
11.7%
5 23
10.0%
4 19
 
8.3%
0 15
 
6.5%
8 8
 
3.5%
7 7
 
3.0%
9 5
 
2.2%
Uppercase Letter
ValueCountFrequency (%)
I 14
30.4%
P 12
26.1%
T 4
 
8.7%
C 4
 
8.7%
R 3
 
6.5%
G 2
 
4.3%
Z 2
 
4.3%
O 2
 
4.3%
E 2
 
4.3%
D 1
 
2.2%
Other Punctuation
ValueCountFrequency (%)
, 127
96.2%
! 2
 
1.5%
· 1
 
0.8%
& 1
 
0.8%
? 1
 
0.8%
Space Separator
ValueCountFrequency (%)
432
100.0%
Close Punctuation
ValueCountFrequency (%)
) 282
100.0%
Open Punctuation
ValueCountFrequency (%)
( 282
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3262
68.9%
Common 1371
28.9%
Latin 104
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
184
 
5.6%
183
 
5.6%
146
 
4.5%
122
 
3.7%
120
 
3.7%
118
 
3.6%
72
 
2.2%
64
 
2.0%
61
 
1.9%
56
 
1.7%
Other values (254) 2136
65.5%
Latin
ValueCountFrequency (%)
I 14
13.5%
P 12
 
11.5%
o 8
 
7.7%
n 7
 
6.7%
a 7
 
6.7%
r 6
 
5.8%
t 6
 
5.8%
e 5
 
4.8%
i 4
 
3.8%
T 4
 
3.8%
Other values (17) 31
29.8%
Common
ValueCountFrequency (%)
432
31.5%
) 282
20.6%
( 282
20.6%
, 127
 
9.3%
1 56
 
4.1%
2 41
 
3.0%
6 29
 
2.1%
3 27
 
2.0%
5 23
 
1.7%
4 19
 
1.4%
Other values (9) 53
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3262
68.9%
ASCII 1474
31.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
432
29.3%
) 282
19.1%
( 282
19.1%
, 127
 
8.6%
1 56
 
3.8%
2 41
 
2.8%
6 29
 
2.0%
3 27
 
1.8%
5 23
 
1.6%
4 19
 
1.3%
Other values (35) 156
 
10.6%
Hangul
ValueCountFrequency (%)
184
 
5.6%
183
 
5.6%
146
 
4.5%
122
 
3.7%
120
 
3.7%
118
 
3.6%
72
 
2.2%
64
 
2.0%
61
 
1.9%
56
 
1.7%
Other values (254) 2136
65.5%
None
ValueCountFrequency (%)
· 1
100.0%

대상
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
일반인
153 
청소년
26 
교원
 
6

Length

Max length3
Median length3
Mean length2.9675676
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반인
2nd row일반인
3rd row일반인
4th row일반인
5th row일반인

Common Values

ValueCountFrequency (%)
일반인 153
82.7%
청소년 26
 
14.1%
교원 6
 
3.2%

Length

2023-12-12T11:19:35.289509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:19:35.406420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반인 153
82.7%
청소년 26
 
14.1%
교원 6
 
3.2%

구분
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
보호(1단계)
42 
경영(1단계)
23 
경영(2단계)
23 
권리화(1단계)
22 
초급과정
15 
Other values (10)
60 

Length

Max length8
Median length7
Mean length6.627027
Min length3

Unique

Unique2 ?
Unique (%)1.1%

Sample

1st row활용(1단계)
2nd row권리화(2단계)
3rd row경영(1단계)
4th row권리화(2단계)
5th row보호(1단계)

Common Values

ValueCountFrequency (%)
보호(1단계) 42
22.7%
경영(1단계) 23
12.4%
경영(2단계) 23
12.4%
권리화(1단계) 22
11.9%
초급과정 15
 
8.1%
창출(1단계) 14
 
7.6%
활용(1단계) 11
 
5.9%
권리화(2단계) 11
 
5.9%
연수용 6
 
3.2%
창출(2단계) 5
 
2.7%
Other values (5) 13
 
7.0%

Length

2023-12-12T11:19:35.553875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
보호(1단계 42
22.7%
경영(1단계 23
12.4%
경영(2단계 23
12.4%
권리화(1단계 22
11.9%
초급과정 15
 
8.1%
창출(1단계 14
 
7.6%
활용(1단계 11
 
5.9%
권리화(2단계 11
 
5.9%
연수용 6
 
3.2%
창출(2단계 5
 
2.7%
Other values (5) 13
 
7.0%

차시
Real number (ℝ)

HIGH CORRELATION 

Distinct18
Distinct (%)9.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.5243243
Minimum0
Maximum30
Zeros1
Zeros (%)0.5%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-12T11:19:35.710231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q13
median5
Q310
95-th percentile26
Maximum30
Range30
Interquartile range (IQR)7

Descriptive statistics

Standard deviation7.513083
Coefficient of variation (CV)0.998506
Kurtosis1.4672531
Mean7.5243243
Median Absolute Deviation (MAD)3
Skewness1.5559501
Sum1392
Variance56.446416
MonotonicityNot monotonic
2023-12-12T11:19:35.880163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
1 28
15.1%
3 25
13.5%
4 18
9.7%
2 17
9.2%
5 16
8.6%
26 16
8.6%
6 13
7.0%
10 13
7.0%
15 11
 
5.9%
8 7
 
3.8%
Other values (8) 21
11.4%
ValueCountFrequency (%)
0 1
 
0.5%
1 28
15.1%
2 17
9.2%
3 25
13.5%
4 18
9.7%
5 16
8.6%
6 13
7.0%
7 6
 
3.2%
8 7
 
3.8%
9 5
 
2.7%
ValueCountFrequency (%)
30 2
 
1.1%
26 16
8.6%
20 2
 
1.1%
18 1
 
0.5%
17 1
 
0.5%
15 11
5.9%
12 3
 
1.6%
10 13
7.0%
9 5
 
2.7%
8 7
3.8%

개발연도
Real number (ℝ)

Distinct15
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2015.8703
Minimum2004
Maximum2019
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-12T11:19:36.009936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2004
5-th percentile2009
Q12015
median2017
Q32018
95-th percentile2018
Maximum2019
Range15
Interquartile range (IQR)3

Descriptive statistics

Standard deviation3.1110659
Coefficient of variation (CV)0.0015432868
Kurtosis1.8836677
Mean2015.8703
Median Absolute Deviation (MAD)1
Skewness-1.6075906
Sum372936
Variance9.6787309
MonotonicityNot monotonic
2023-12-12T11:19:36.122991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
2018 83
44.9%
2017 42
22.7%
2013 13
 
7.0%
2015 11
 
5.9%
2012 8
 
4.3%
2010 5
 
2.7%
2011 5
 
2.7%
2009 4
 
2.2%
2016 4
 
2.2%
2007 3
 
1.6%
Other values (5) 7
 
3.8%
ValueCountFrequency (%)
2004 1
 
0.5%
2005 1
 
0.5%
2007 3
 
1.6%
2008 2
 
1.1%
2009 4
 
2.2%
2010 5
 
2.7%
2011 5
 
2.7%
2012 8
4.3%
2013 13
7.0%
2014 2
 
1.1%
ValueCountFrequency (%)
2019 1
 
0.5%
2018 83
44.9%
2017 42
22.7%
2016 4
 
2.2%
2015 11
 
5.9%
2014 2
 
1.1%
2013 13
 
7.0%
2012 8
 
4.3%
2011 5
 
2.7%
2010 5
 
2.7%

Interactions

2023-12-12T11:19:33.623748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:19:33.058023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:19:33.338838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:19:33.720232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:19:33.145816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:19:33.448907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:19:33.816429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:19:33.240058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:19:33.534648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T11:19:36.210733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호대상구분차시개발연도
번호1.0000.7970.8600.5070.882
대상0.7971.0001.0000.8300.630
구분0.8601.0001.0000.6550.689
차시0.5070.8300.6551.0000.567
개발연도0.8820.6300.6890.5671.000
2023-12-12T11:19:36.342082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대상구분
대상1.0000.966
구분0.9661.000
2023-12-12T11:19:36.449022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호차시개발연도대상구분
번호1.0000.0210.3690.6740.535
차시0.0211.000-0.2970.5350.318
개발연도0.369-0.2971.0000.4680.329
대상0.6740.5350.4681.0000.966
구분0.5350.3180.3290.9661.000

Missing values

2023-12-12T11:19:33.955311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:19:34.064437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호콘텐츠명대상구분차시개발연도
01기술이전 및 사업화 전략(10차시)일반인활용(1단계)102007
12CIIP-특허법률영어(20차시)일반인권리화(2단계)202007
23IP Panorama-글로벌 IP경영과정(10차시)일반인경영(1단계)102005
34실전! 의견서·보정서 작성(5차시)일반인권리화(2단계)52008
45특허침해판단과 청구범위해석(4차시)일반인보호(1단계)02004
56특허침해 손해배상액 산정실무(1차시,현장강의)일반인보호(1단계)12009
67선행기술검색 노하우(5차시)일반인경영(2단계)52009
78해외특허출원시 유의사항(10차시)일반인권리화(2단계)102009
89상표등록 왜 필요한가?(1차시)일반인보호(1단계)12009
910TRIZ를 활용한 기술혁신(15차시)일반인창출(1단계)152010
번호콘텐츠명대상구분차시개발연도
175176저작권과 친구될래요-중등용(3차시)청소년중급과정32015
176177플립러닝 우수학교 현장촬영(1차시)청소년교사과정12016
177178플립러닝 우수학교 현장촬영(4차시)청소년교사과정42017
178179트리즈로 멋진 발명품 만들기(3차시)청소년초급과정32018
179180교과 속에서 찾은 발명이야기교원연수용302014
180181실전발명영재교육교원연수용302015
181182교실을 바꾸는 G-러닝 발명이야기교원연수용152017
182183초등과학연계 발명수업의 실제교원연수용152017
183184내 발명수업을 레벨업 시키는 TRIZ교원연수용152018
184185선생님과 함께하는 에디슨 육성 프로젝트교원연수용152018