Overview

Dataset statistics

Number of variables9
Number of observations29
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory80.6 B

Variable types

Categorical2
Numeric4
Text1
DateTime2

Dataset

Description한국지역난방공사와 중소기업 간 R&D 과제 현황에 관한 정보로 과제명, 연구기간, 연구비, 지원비에 대한 정보를 제공합니다.
Author한국지역난방공사
URLhttps://www.data.go.kr/data/15090332/fileData.do

Alerts

단위 has constant value ""Constant
연도 is highly overall correlated with 과제종류High correlation
연구기간 is highly overall correlated with 총연구비_백만원 and 2 other fieldsHigh correlation
총연구비_백만원 is highly overall correlated with 연구기간 and 1 other fieldsHigh correlation
우리공사지원내역_백만원 is highly overall correlated with 연구기간 and 1 other fieldsHigh correlation
과제종류 is highly overall correlated with 연도 and 1 other fieldsHigh correlation
과제명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:15:21.925291
Analysis finished2023-12-12 23:15:24.487763
Duration2.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

과제종류
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)20.7%
Missing0
Missing (%)0.0%
Memory size364.0 B
민관공동투자기술개발사업
구매조건부신제품개발사업(공동투자형)
협력연구개발사업
구매조건부신제품개발사업
수·위탁기업간네트워크형공동사업

Length

Max length19
Median length16
Mean length13.931034
Min length8

Unique

Unique1 ?
Unique (%)3.4%

Sample

1st row협력연구개발사업
2nd row민관공동투자기술개발사업
3rd row민관공동투자기술개발사업
4th row민관공동투자기술개발사업
5th row민관공동투자기술개발사업

Common Values

ValueCountFrequency (%)
민관공동투자기술개발사업 9
31.0%
구매조건부신제품개발사업(공동투자형) 7
24.1%
협력연구개발사업 4
13.8%
구매조건부신제품개발사업 4
13.8%
수·위탁기업간네트워크형공동사업 4
13.8%
구매조건부신제품개발사업(구매연계형) 1
 
3.4%

Length

2023-12-13T08:15:24.579725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:15:24.684029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
민관공동투자기술개발사업 9
31.0%
구매조건부신제품개발사업(공동투자형 7
24.1%
협력연구개발사업 4
13.8%
구매조건부신제품개발사업 4
13.8%
수·위탁기업간네트워크형공동사업 4
13.8%
구매조건부신제품개발사업(구매연계형 1
 
3.4%

연도
Real number (ℝ)

HIGH CORRELATION 

Distinct8
Distinct (%)27.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2018.2069
Minimum2015
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2023-12-13T08:15:24.792622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2015
5-th percentile2015
Q12016
median2018
Q32020
95-th percentile2021
Maximum2022
Range7
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.2579907
Coefficient of variation (CV)0.0011188103
Kurtosis-1.3544096
Mean2018.2069
Median Absolute Deviation (MAD)2
Skewness-0.036204695
Sum58528
Variance5.0985222
MonotonicityIncreasing
2023-12-13T08:15:24.916301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
2015 5
17.2%
2020 5
17.2%
2021 5
17.2%
2017 4
13.8%
2018 4
13.8%
2016 3
10.3%
2019 2
 
6.9%
2022 1
 
3.4%
ValueCountFrequency (%)
2015 5
17.2%
2016 3
10.3%
2017 4
13.8%
2018 4
13.8%
2019 2
 
6.9%
2020 5
17.2%
2021 5
17.2%
2022 1
 
3.4%
ValueCountFrequency (%)
2022 1
 
3.4%
2021 5
17.2%
2020 5
17.2%
2019 2
 
6.9%
2018 4
13.8%
2017 4
13.8%
2016 3
10.3%
2015 5
17.2%

과제명
Text

UNIQUE 

Distinct29
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size364.0 B
2023-12-13T08:15:25.145111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length32
Mean length27.413793
Min length13

Characters and Unicode

Total characters795
Distinct characters219
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)100.0%

Sample

1st row액화질소를이용한열수송관(300A)차수공법개발
2nd row다기능모듈형브리더국산화개발
3rd row유량조절이가능한이중기밀볼밸브개발
4th row급탕및난방수공급배관용보급형전자기장복합변조방식스케일제거장치개발
5th row열배관유효에너지활용과에너지효율향상을위한전력공급장치개발
ValueCountFrequency (%)
개발 6
 
8.6%
4
 
5.7%
진단 2
 
2.9%
가상발전소향 1
 
1.4%
태양광발전량 1
 
1.4%
예측 1
 
1.4%
통합운영 1
 
1.4%
플랫폼 1
 
1.4%
구역전기계통 1
 
1.4%
건전성 1
 
1.4%
Other values (51) 51
72.9%
2023-12-13T08:15:25.481659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
41
 
5.2%
28
 
3.5%
27
 
3.4%
18
 
2.3%
e 13
 
1.6%
s 12
 
1.5%
12
 
1.5%
11
 
1.4%
11
 
1.4%
11
 
1.4%
Other values (209) 611
76.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 559
70.3%
Lowercase Letter 94
 
11.8%
Uppercase Letter 49
 
6.2%
Space Separator 41
 
5.2%
Decimal Number 25
 
3.1%
Open Punctuation 9
 
1.1%
Close Punctuation 9
 
1.1%
Other Punctuation 6
 
0.8%
Dash Punctuation 3
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
28
 
5.0%
27
 
4.8%
18
 
3.2%
12
 
2.1%
11
 
2.0%
11
 
2.0%
11
 
2.0%
10
 
1.8%
9
 
1.6%
9
 
1.6%
Other values (163) 413
73.9%
Lowercase Letter
ValueCountFrequency (%)
e 13
13.8%
s 12
12.8%
i 9
9.6%
n 8
8.5%
t 7
 
7.4%
r 7
 
7.4%
l 6
 
6.4%
a 5
 
5.3%
m 4
 
4.3%
g 4
 
4.3%
Other values (8) 19
20.2%
Uppercase Letter
ValueCountFrequency (%)
C 7
14.3%
T 5
10.2%
A 5
10.2%
S 4
8.2%
G 4
8.2%
M 3
 
6.1%
H 3
 
6.1%
D 3
 
6.1%
V 3
 
6.1%
I 3
 
6.1%
Other values (6) 9
18.4%
Decimal Number
ValueCountFrequency (%)
0 9
36.0%
2 4
16.0%
1 4
16.0%
3 4
16.0%
5 3
 
12.0%
6 1
 
4.0%
Other Punctuation
ValueCountFrequency (%)
, 5
83.3%
· 1
 
16.7%
Space Separator
ValueCountFrequency (%)
41
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 559
70.3%
Latin 143
 
18.0%
Common 93
 
11.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
28
 
5.0%
27
 
4.8%
18
 
3.2%
12
 
2.1%
11
 
2.0%
11
 
2.0%
11
 
2.0%
10
 
1.8%
9
 
1.6%
9
 
1.6%
Other values (163) 413
73.9%
Latin
ValueCountFrequency (%)
e 13
 
9.1%
s 12
 
8.4%
i 9
 
6.3%
n 8
 
5.6%
t 7
 
4.9%
r 7
 
4.9%
C 7
 
4.9%
l 6
 
4.2%
a 5
 
3.5%
T 5
 
3.5%
Other values (24) 64
44.8%
Common
ValueCountFrequency (%)
41
44.1%
( 9
 
9.7%
0 9
 
9.7%
) 9
 
9.7%
, 5
 
5.4%
2 4
 
4.3%
1 4
 
4.3%
3 4
 
4.3%
5 3
 
3.2%
- 3
 
3.2%
Other values (2) 2
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 559
70.3%
ASCII 235
29.6%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
41
 
17.4%
e 13
 
5.5%
s 12
 
5.1%
( 9
 
3.8%
0 9
 
3.8%
) 9
 
3.8%
i 9
 
3.8%
n 8
 
3.4%
t 7
 
3.0%
r 7
 
3.0%
Other values (35) 111
47.2%
Hangul
ValueCountFrequency (%)
28
 
5.0%
27
 
4.8%
18
 
3.2%
12
 
2.1%
11
 
2.0%
11
 
2.0%
11
 
2.0%
10
 
1.8%
9
 
1.6%
9
 
1.6%
Other values (163) 413
73.9%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct19
Distinct (%)65.5%
Missing0
Missing (%)0.0%
Memory size364.0 B
Minimum2015-06-18 00:00:00
Maximum2022-06-10 00:00:00
2023-12-13T08:15:25.590453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:15:25.697764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
Distinct22
Distinct (%)75.9%
Missing0
Missing (%)0.0%
Memory size364.0 B
Minimum2017-01-31 00:00:00
Maximum2023-09-12 00:00:00
2023-12-13T08:15:25.800146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:15:25.898339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)

연구기간
Real number (ℝ)

HIGH CORRELATION 

Distinct7
Distinct (%)24.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.586207
Minimum7
Maximum27
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2023-12-13T08:15:26.005070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7
5-th percentile7
Q112
median18
Q324
95-th percentile24
Maximum27
Range20
Interquartile range (IQR)12

Descriptive statistics

Standard deviation6.7162768
Coefficient of variation (CV)0.38190594
Kurtosis-1.5295849
Mean17.586207
Median Absolute Deviation (MAD)6
Skewness-0.27526947
Sum510
Variance45.108374
MonotonicityNot monotonic
2023-12-13T08:15:26.113666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
24 9
31.0%
12 7
24.1%
7 4
13.8%
23 4
13.8%
15 3
 
10.3%
18 1
 
3.4%
27 1
 
3.4%
ValueCountFrequency (%)
7 4
13.8%
12 7
24.1%
15 3
 
10.3%
18 1
 
3.4%
23 4
13.8%
24 9
31.0%
27 1
 
3.4%
ValueCountFrequency (%)
27 1
 
3.4%
24 9
31.0%
23 4
13.8%
18 1
 
3.4%
15 3
 
10.3%
12 7
24.1%
7 4
13.8%

단위
Categorical

CONSTANT 

Distinct1
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size364.0 B
개월
29 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개월
2nd row개월
3rd row개월
4th row개월
5th row개월

Common Values

ValueCountFrequency (%)
개월 29
100.0%

Length

2023-12-13T08:15:26.233806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:15:26.313501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개월 29
100.0%

총연구비_백만원
Real number (ℝ)

HIGH CORRELATION 

Distinct26
Distinct (%)89.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean483.51724
Minimum50
Maximum1350
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2023-12-13T08:15:26.415924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum50
5-th percentile70
Q1189
median380
Q3637
95-th percentile1288.4
Maximum1350
Range1300
Interquartile range (IQR)448

Descriptive statistics

Standard deviation389.20272
Coefficient of variation (CV)0.80494073
Kurtosis0.076909806
Mean483.51724
Median Absolute Deviation (MAD)211
Skewness1.0178954
Sum14022
Variance151478.76
MonotonicityNot monotonic
2023-12-13T08:15:26.565504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
100 3
 
10.3%
50 2
 
6.9%
512 1
 
3.4%
190 1
 
3.4%
360 1
 
3.4%
1220 1
 
3.4%
1123 1
 
3.4%
648 1
 
3.4%
1350 1
 
3.4%
169 1
 
3.4%
Other values (16) 16
55.2%
ValueCountFrequency (%)
50 2
6.9%
100 3
10.3%
169 1
 
3.4%
180 1
 
3.4%
189 1
 
3.4%
190 1
 
3.4%
231 1
 
3.4%
240 1
 
3.4%
275 1
 
3.4%
288 1
 
3.4%
ValueCountFrequency (%)
1350 1
3.4%
1334 1
3.4%
1220 1
3.4%
1123 1
3.4%
878 1
3.4%
870 1
3.4%
648 1
3.4%
637 1
3.4%
626 1
3.4%
575 1
3.4%

우리공사지원내역_백만원
Real number (ℝ)

HIGH CORRELATION 

Distinct24
Distinct (%)82.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean192
Minimum15
Maximum600
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2023-12-13T08:15:26.687188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum15
5-th percentile20.2
Q163
median103
Q3288
95-th percentile500
Maximum600
Range585
Interquartile range (IQR)225

Descriptive statistics

Standard deviation174.18607
Coefficient of variation (CV)0.9072191
Kurtosis-0.2648918
Mean192
Median Absolute Deviation (MAD)75
Skewness0.9625553
Sum5568
Variance30340.786
MonotonicityNot monotonic
2023-12-13T08:15:26.803856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
84 2
 
6.9%
30 2
 
6.9%
15 2
 
6.9%
75 2
 
6.9%
500 2
 
6.9%
384 1
 
3.4%
28 1
 
3.4%
270 1
 
3.4%
499 1
 
3.4%
288 1
 
3.4%
Other values (14) 14
48.3%
ValueCountFrequency (%)
15 2
6.9%
28 1
3.4%
30 2
6.9%
32 1
3.4%
62 1
3.4%
63 1
3.4%
75 2
6.9%
84 2
6.9%
87 1
3.4%
90 1
3.4%
ValueCountFrequency (%)
600 1
3.4%
500 2
6.9%
499 1
3.4%
390 1
3.4%
384 1
3.4%
326 1
3.4%
288 1
3.4%
270 1
3.4%
238 1
3.4%
216 1
3.4%

Interactions

2023-12-13T08:15:23.814695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:15:22.333369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:15:23.031065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:15:23.429195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:15:23.922698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:15:22.445068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:15:23.138181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:15:23.526225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:15:24.025481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:15:22.827228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:15:23.246757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:15:23.635543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:15:24.114724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:15:22.928776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:15:23.337814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:15:23.724992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:15:26.906647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과제종류연도과제명시작일종료일연구기간총연구비_백만원우리공사지원내역_백만원
과제종류1.0000.7131.0000.9560.9780.9270.4920.449
연도0.7131.0001.0001.0001.0000.3780.0000.405
과제명1.0001.0001.0001.0001.0001.0001.0001.000
시작일0.9561.0001.0001.0000.9940.9310.6450.275
종료일0.9781.0001.0000.9941.0000.9810.6190.000
연구기간0.9270.3781.0000.9310.9811.0000.6120.000
총연구비_백만원0.4920.0001.0000.6450.6190.6121.0000.757
우리공사지원내역_백만원0.4490.4051.0000.2750.0000.0000.7571.000
2023-12-13T08:15:27.046975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도연구기간총연구비_백만원우리공사지원내역_백만원과제종류
연도1.000-0.1610.1680.1850.510
연구기간-0.1611.0000.6860.5510.661
총연구비_백만원0.1680.6861.0000.8460.270
우리공사지원내역_백만원0.1850.5510.8461.0000.198
과제종류0.5100.6610.2700.1981.000

Missing values

2023-12-13T08:15:24.244778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:15:24.422101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

과제종류연도과제명시작일종료일연구기간단위총연구비_백만원우리공사지원내역_백만원
0협력연구개발사업2015액화질소를이용한열수송관(300A)차수공법개발2015-06-182017-06-1724개월512384
1민관공동투자기술개발사업2015다기능모듈형브리더국산화개발2015-08-012017-01-3118개월275103
2민관공동투자기술개발사업2015유량조절이가능한이중기밀볼밸브개발2015-10-012017-09-3024개월380143
3민관공동투자기술개발사업2015급탕및난방수공급배관용보급형전자기장복합변조방식스케일제거장치개발2015-12-012017-11-3024개월23187
4민관공동투자기술개발사업2015열배관유효에너지활용과에너지효율향상을위한전력공급장치개발2015-12-012017-11-3024개월870326
5민관공동투자기술개발사업2016배기가스의폐열회수및백연저감,오염물질저감설비개발2016-04-012017-03-0312개월550206
6민관공동투자기술개발사업2016지역난방중온수최적유량(초음파)측정기법및제품제작2016-10-012018-01-0315개월24090
7구매조건부신제품개발사업2016안전성이확보된고·저압격실구조의2단적내아크배전반개발2016-08-042017-08-0312개월40884
8수·위탁기업간네트워크형공동사업2017파주지사M501-F3연소기Outershell국산화(추가)2017-05-102017-11-307개월10030
9수·위탁기업간네트워크형공동사업2017GasTurbine압축기에어필터개발(추가)2017-05-102017-11-017개월5015
과제종류연도과제명시작일종료일연구기간단위총연구비_백만원우리공사지원내역_백만원
19구매조건부신제품개발사업(공동투자형)2020가스터빈용ExpansionJoints성능개선및국산화연구2020-12-022022-03-0115개월18984
20구매조건부신제품개발사업(구매연계형)2020안전관제시스템을적용한스마트배전반2020-10-202022-12-3127개월62663
21협력연구개발사업2020HITACHI-25가스터빈소모자재국산화개발2020-05-262021-05-2512개월10075
22협력연구개발사업2020M501F(3)GTSupportRingAssembly국산화개발2020-05-262021-05-2512개월180135
23구매조건부신제품개발사업(공동투자형)2021전동기 운전 중 상태 표시 및 진단 장치 개발2021-06-012022-05-3112개월16975
24구매조건부신제품개발사업(공동투자형)2021가상발전소향 태양광발전량 예측 및 통합운영 플랫폼 개발2021-10-012023-08-3123개월1350600
25구매조건부신제품개발사업(공동투자형)2021구역전기계통 건전성 진단 및 계통해석 프로그램 개발 연구2021-10-012023-08-3123개월648288
26구매조건부신제품개발사업(공동투자형)2021극저온 냉매를 이용한 냉각수 급냉 시스템 개발2021-10-012023-09-1223개월1123499
27구매조건부신제품개발사업(공동투자형)2021친환경 복합 탈질 환원제 개발 및 실증2021-10-012023-08-3123개월1220500
28협력연구개발사업2022Siemens SGT6-3000E GT Interstage Seal Housing Assembly 국산화 개발2022-06-102023-06-0912개월360270