Overview

Dataset statistics

Number of variables7
Number of observations120
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.8 KiB
Average record size in memory58.1 B

Variable types

Categorical5
Text1
DateTime1

Dataset

DescriptionR&D전문기관인 한국에너지기술평가원에서 담당하는 에너지기술개발사업의 수행성과로서 논문 게재 및 특허 등록/출원 성과에 관한 정보
Author한국에너지기술평가원
URLhttps://www.data.go.kr/data/15104700/fileData.do

Alerts

데이터기준일 has constant value ""Constant
세부항목 is highly overall correlated with 구분 and 1 other fieldsHigh correlation
구분 is highly overall correlated with 구분상세 and 1 other fieldsHigh correlation
구분상세 is highly overall correlated with 구분 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-12 18:16:17.977012
Analysis finished2023-12-12 18:16:18.427613
Duration0.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
논문
60 
특허
60 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row논문
2nd row논문
3rd row논문
4th row논문
5th row논문

Common Values

ValueCountFrequency (%)
논문 60
50.0%
특허 60
50.0%

Length

2023-12-13T03:16:18.501370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:16:18.629515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
논문 60
50.0%
특허 60
50.0%

구분상세
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
SCI논문
30 
일반논문
30 
출원특허
30 
등록특허
30 

Length

Max length5
Median length4
Mean length4.25
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSCI논문
2nd rowSCI논문
3rd rowSCI논문
4th rowSCI논문
5th rowSCI논문

Common Values

ValueCountFrequency (%)
SCI논문 30
25.0%
일반논문 30
25.0%
출원특허 30
25.0%
등록특허 30
25.0%

Length

2023-12-13T03:16:18.752492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:16:18.956728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
sci논문 30
25.0%
일반논문 30
25.0%
출원특허 30
25.0%
등록특허 30
25.0%

국내외여부
Categorical

Distinct2
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
국내
60 
국외
60 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내
2nd row국내
3rd row국내
4th row국내
5th row국내

Common Values

ValueCountFrequency (%)
국내 60
50.0%
국외 60
50.0%

Length

2023-12-13T03:16:19.079220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:16:19.179870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내 60
50.0%
국외 60
50.0%

세부항목
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
종료연도별 논문 건수
20 
출연금 10억원당 논문 건수
20 
종료연도별 과제당 논문 건수
20 
종료연도별 특허 건수
20 
출연금 10억원당 특허 건수
20 

Length

Max length15
Median length13
Mean length12.666667
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종료연도별 논문 건수
2nd row종료연도별 논문 건수
3rd row종료연도별 논문 건수
4th row종료연도별 논문 건수
5th row종료연도별 논문 건수

Common Values

ValueCountFrequency (%)
종료연도별 논문 건수 20
16.7%
출연금 10억원당 논문 건수 20
16.7%
종료연도별 과제당 논문 건수 20
16.7%
종료연도별 특허 건수 20
16.7%
출연금 10억원당 특허 건수 20
16.7%
과제당 특허 건수 20
16.7%

Length

2023-12-13T03:16:19.319823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:16:19.505517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건수 120
28.6%
종료연도별 60
14.3%
논문 60
14.3%
특허 60
14.3%
출연금 40
 
9.5%
10억원당 40
 
9.5%
과제당 40
 
9.5%

연도
Categorical

Distinct5
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2016
24 
2017
24 
2018
24 
2019
24 
2020
24 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2016
2nd row2017
3rd row2018
4th row2019
5th row2020

Common Values

ValueCountFrequency (%)
2016 24
20.0%
2017 24
20.0%
2018 24
20.0%
2019 24
20.0%
2020 24
20.0%

Length

2023-12-13T03:16:19.663145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:16:20.151116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2016 24
20.0%
2017 24
20.0%
2018 24
20.0%
2019 24
20.0%
2020 24
20.0%
Distinct105
Distinct (%)87.5%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T03:16:20.499588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length4
Mean length4.1583333
Min length2

Characters and Unicode

Total characters499
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique91 ?
Unique (%)75.8%

Sample

1st row50.3
2nd row56.1
3rd row40.7
4th row40.6
5th row15.8
ValueCountFrequency (%)
0.06 3
 
2.5%
0.55 2
 
1.7%
0.51 2
 
1.7%
0.29 2
 
1.7%
0.07 2
 
1.7%
0.34 2
 
1.7%
0.16 2
 
1.7%
295.1 2
 
1.7%
0.17 2
 
1.7%
1.21 2
 
1.7%
Other values (95) 99
82.5%
2023-12-13T03:16:20.977022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 115
23.0%
0 73
14.6%
2 55
11.0%
1 52
10.4%
3 37
 
7.4%
6 36
 
7.2%
5 33
 
6.6%
7 29
 
5.8%
4 24
 
4.8%
9 23
 
4.6%
Other values (2) 22
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 382
76.6%
Other Punctuation 117
 
23.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 73
19.1%
2 55
14.4%
1 52
13.6%
3 37
9.7%
6 36
9.4%
5 33
8.6%
7 29
 
7.6%
4 24
 
6.3%
9 23
 
6.0%
8 20
 
5.2%
Other Punctuation
ValueCountFrequency (%)
. 115
98.3%
, 2
 
1.7%

Most occurring scripts

ValueCountFrequency (%)
Common 499
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
. 115
23.0%
0 73
14.6%
2 55
11.0%
1 52
10.4%
3 37
 
7.4%
6 36
 
7.2%
5 33
 
6.6%
7 29
 
5.8%
4 24
 
4.8%
9 23
 
4.6%
Other values (2) 22
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 499
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 115
23.0%
0 73
14.6%
2 55
11.0%
1 52
10.4%
3 37
 
7.4%
6 36
 
7.2%
5 33
 
6.6%
7 29
 
5.8%
4 24
 
4.8%
9 23
 
4.6%
Other values (2) 22
 
4.4%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
Minimum2022-09-01 00:00:00
Maximum2022-09-01 00:00:00
2023-12-13T03:16:21.101737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:16:21.225636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-13T03:16:21.316675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분구분상세국내외여부세부항목연도
구분1.0001.0000.0001.0000.000
구분상세1.0001.0000.0000.7190.000
국내외여부0.0000.0001.0000.0000.000
세부항목1.0000.7190.0001.0000.000
연도0.0000.0000.0000.0001.000
2023-12-13T03:16:21.444693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세부항목구분연도구분상세국내외여부
세부항목1.0000.9830.0000.5470.000
구분0.9831.0000.0000.9910.000
연도0.0000.0001.0000.0000.000
구분상세0.5470.9910.0001.0000.000
국내외여부0.0000.0000.0000.0001.000
2023-12-13T03:16:21.560024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분구분상세국내외여부세부항목연도
구분1.0000.9910.0000.9830.000
구분상세0.9911.0000.0000.5470.000
국내외여부0.0000.0001.0000.0000.000
세부항목0.9830.5470.0001.0000.000
연도0.0000.0000.0000.0001.000

Missing values

2023-12-13T03:16:18.264132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:16:18.382842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분구분상세국내외여부세부항목연도건수(개_건)데이터기준일
0논문SCI논문국내종료연도별 논문 건수201650.32022-09-01
1논문SCI논문국내종료연도별 논문 건수201756.12022-09-01
2논문SCI논문국내종료연도별 논문 건수201840.72022-09-01
3논문SCI논문국내종료연도별 논문 건수201940.62022-09-01
4논문SCI논문국내종료연도별 논문 건수202015.82022-09-01
5논문SCI논문국외종료연도별 논문 건수2016722.92022-09-01
6논문SCI논문국외종료연도별 논문 건수2017853.52022-09-01
7논문SCI논문국외종료연도별 논문 건수2018463.82022-09-01
8논문SCI논문국외종료연도별 논문 건수2019500.92022-09-01
9논문SCI논문국외종료연도별 논문 건수2020256.22022-09-01
구분구분상세국내외여부세부항목연도건수(개_건)데이터기준일
110특허등록특허국내과제당 특허 건수20161.882022-09-01
111특허등록특허국내과제당 특허 건수20171.612022-09-01
112특허등록특허국내과제당 특허 건수20182.232022-09-01
113특허등록특허국내과제당 특허 건수20192.312022-09-01
114특허등록특허국내과제당 특허 건수20201.262022-09-01
115특허등록특허국외과제당 특허 건수20160.342022-09-01
116특허등록특허국외과제당 특허 건수20170.172022-09-01
117특허등록특허국외과제당 특허 건수20180.292022-09-01
118특허등록특허국외과제당 특허 건수20190.172022-09-01
119특허등록특허국외과제당 특허 건수20200.042022-09-01