Overview

Dataset statistics

Number of variables5
Number of observations71
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.0 KiB
Average record size in memory42.9 B

Variable types

Numeric1
Text2
Categorical2

Dataset

Description대덕연구단지에 있는 기관 중 데이터 제공에 동의한 한국조폐공사, 한국수자원공사의 연구과제 리스트에 대한 데이터 입니다. 해당 데이터는 2022년 기준입니다.
Author한국조폐공사
URLhttps://www.data.go.kr/data/15106336/fileData.do

Alerts

기준연도 has constant value ""Constant
순번 is highly overall correlated with 기관명High correlation
기관명 is highly overall correlated with 순번High correlation
순번 has unique valuesUnique
연구과제명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:51:54.314213
Analysis finished2023-12-12 15:51:54.967196
Duration0.65 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct71
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36
Minimum1
Maximum71
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size771.0 B
2023-12-13T00:51:55.053097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.5
Q118.5
median36
Q353.5
95-th percentile67.5
Maximum71
Range70
Interquartile range (IQR)35

Descriptive statistics

Standard deviation20.639767
Coefficient of variation (CV)0.57332687
Kurtosis-1.2
Mean36
Median Absolute Deviation (MAD)18
Skewness0
Sum2556
Variance426
MonotonicityStrictly increasing
2023-12-13T00:51:55.247201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.4%
2 1
 
1.4%
53 1
 
1.4%
52 1
 
1.4%
51 1
 
1.4%
50 1
 
1.4%
49 1
 
1.4%
48 1
 
1.4%
47 1
 
1.4%
46 1
 
1.4%
Other values (61) 61
85.9%
ValueCountFrequency (%)
1 1
1.4%
2 1
1.4%
3 1
1.4%
4 1
1.4%
5 1
1.4%
6 1
1.4%
7 1
1.4%
8 1
1.4%
9 1
1.4%
10 1
1.4%
ValueCountFrequency (%)
71 1
1.4%
70 1
1.4%
69 1
1.4%
68 1
1.4%
67 1
1.4%
66 1
1.4%
65 1
1.4%
64 1
1.4%
63 1
1.4%
62 1
1.4%

연구과제명
Text

UNIQUE 

Distinct71
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size700.0 B
2023-12-13T00:51:55.608514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length37
Mean length29.647887
Min length13

Characters and Unicode

Total characters2105
Distinct characters287
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)100.0%

Sample

1st row보안제품용 특수물질 감응솔루션 고도화 연구
2nd row은행권용지 품질 안정화 및 소재 고도화 연구
3rd row은행권용 요판잉크 응용기술 개발
4th row은행권, 신분증 등 주요제품용 미세소재 기반 광학 보안요소 양산화 연구
5th row블록체인 기반 DID 및 본인인증 기술개발
ValueCountFrequency (%)
연구 39
 
7.6%
30
 
5.8%
개발 13
 
2.5%
위한 12
 
2.3%
수립 9
 
1.7%
방안 7
 
1.4%
구축 7
 
1.4%
고도화 6
 
1.2%
6
 
1.2%
따른 6
 
1.2%
Other values (316) 380
73.8%
2023-12-13T00:51:56.093477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
444
 
21.1%
64
 
3.0%
52
 
2.5%
49
 
2.3%
34
 
1.6%
30
 
1.4%
30
 
1.4%
29
 
1.4%
28
 
1.3%
26
 
1.2%
Other values (277) 1319
62.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1500
71.3%
Space Separator 444
 
21.1%
Lowercase Letter 52
 
2.5%
Uppercase Letter 30
 
1.4%
Decimal Number 21
 
1.0%
Open Punctuation 16
 
0.8%
Close Punctuation 16
 
0.8%
Other Punctuation 14
 
0.7%
Dash Punctuation 11
 
0.5%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
64
 
4.3%
52
 
3.5%
49
 
3.3%
34
 
2.3%
30
 
2.0%
30
 
2.0%
29
 
1.9%
28
 
1.9%
26
 
1.7%
25
 
1.7%
Other values (239) 1133
75.5%
Lowercase Letter
ValueCountFrequency (%)
e 8
15.4%
t 8
15.4%
r 7
13.5%
a 6
11.5%
w 5
9.6%
o 5
9.6%
l 3
 
5.8%
m 2
 
3.8%
g 1
 
1.9%
s 1
 
1.9%
Other values (6) 6
11.5%
Uppercase Letter
ValueCountFrequency (%)
I 8
26.7%
K 7
23.3%
A 3
 
10.0%
D 3
 
10.0%
S 3
 
10.0%
P 2
 
6.7%
M 1
 
3.3%
B 1
 
3.3%
T 1
 
3.3%
O 1
 
3.3%
Decimal Number
ValueCountFrequency (%)
2 11
52.4%
1 5
23.8%
0 4
 
19.0%
6 1
 
4.8%
Other Punctuation
ValueCountFrequency (%)
· 8
57.1%
, 5
35.7%
/ 1
 
7.1%
Space Separator
ValueCountFrequency (%)
444
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1500
71.3%
Common 523
 
24.8%
Latin 82
 
3.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
64
 
4.3%
52
 
3.5%
49
 
3.3%
34
 
2.3%
30
 
2.0%
30
 
2.0%
29
 
1.9%
28
 
1.9%
26
 
1.7%
25
 
1.7%
Other values (239) 1133
75.5%
Latin
ValueCountFrequency (%)
e 8
 
9.8%
t 8
 
9.8%
I 8
 
9.8%
r 7
 
8.5%
K 7
 
8.5%
a 6
 
7.3%
w 5
 
6.1%
o 5
 
6.1%
A 3
 
3.7%
D 3
 
3.7%
Other values (16) 22
26.8%
Common
ValueCountFrequency (%)
444
84.9%
( 16
 
3.1%
) 16
 
3.1%
- 11
 
2.1%
2 11
 
2.1%
· 8
 
1.5%
, 5
 
1.0%
1 5
 
1.0%
0 4
 
0.8%
~ 1
 
0.2%
Other values (2) 2
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1499
71.2%
ASCII 597
 
28.4%
None 8
 
0.4%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
444
74.4%
( 16
 
2.7%
) 16
 
2.7%
- 11
 
1.8%
2 11
 
1.8%
e 8
 
1.3%
t 8
 
1.3%
I 8
 
1.3%
r 7
 
1.2%
K 7
 
1.2%
Other values (27) 61
 
10.2%
Hangul
ValueCountFrequency (%)
64
 
4.3%
52
 
3.5%
49
 
3.3%
34
 
2.3%
30
 
2.0%
30
 
2.0%
29
 
1.9%
28
 
1.9%
26
 
1.7%
25
 
1.7%
Other values (238) 1132
75.5%
None
ValueCountFrequency (%)
· 8
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

기준연도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size700.0 B
2021년
71 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021년
2nd row2021년
3rd row2021년
4th row2021년
5th row2021년

Common Values

ValueCountFrequency (%)
2021년 71
100.0%

Length

2023-12-13T00:51:56.224734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:51:56.355625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021년 71
100.0%
Distinct66
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Memory size700.0 B
2023-12-13T00:51:56.649938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters213
Distinct characters87
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique61 ?
Unique (%)85.9%

Sample

1st row최일훈
2nd row허용대
3rd row서범준
4th row주성현
5th row이호상
ValueCountFrequency (%)
김성원 2
 
2.8%
이선홍 2
 
2.8%
최재원 2
 
2.8%
이규철 2
 
2.8%
남우성 2
 
2.8%
최일훈 1
 
1.4%
권미애 1
 
1.4%
지연숙 1
 
1.4%
박범수 1
 
1.4%
홍강택 1
 
1.4%
Other values (56) 56
78.9%
2023-12-13T00:51:57.188510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13
 
6.1%
8
 
3.8%
8
 
3.8%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.9%
4
 
1.9%
Other values (77) 150
70.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 213
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
6.1%
8
 
3.8%
8
 
3.8%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.9%
4
 
1.9%
Other values (77) 150
70.4%

Most occurring scripts

ValueCountFrequency (%)
Hangul 213
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
 
6.1%
8
 
3.8%
8
 
3.8%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.9%
4
 
1.9%
Other values (77) 150
70.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 213
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
13
 
6.1%
8
 
3.8%
8
 
3.8%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.9%
4
 
1.9%
Other values (77) 150
70.4%

기관명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size700.0 B
한국수자원공사
61 
한국조폐공사
10 

Length

Max length7
Median length7
Mean length6.8591549
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한국조폐공사
2nd row한국조폐공사
3rd row한국조폐공사
4th row한국조폐공사
5th row한국조폐공사

Common Values

ValueCountFrequency (%)
한국수자원공사 61
85.9%
한국조폐공사 10
 
14.1%

Length

2023-12-13T00:51:57.382220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:51:57.511357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한국수자원공사 61
85.9%
한국조폐공사 10
 
14.1%

Interactions

2023-12-13T00:51:54.603648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:51:57.622114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번연구과제명과제책임자기관명
순번1.0001.0000.6360.981
연구과제명1.0001.0001.0001.000
과제책임자0.6361.0001.0001.000
기관명0.9811.0001.0001.000
2023-12-13T00:51:57.738088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번기관명
순번1.0000.846
기관명0.8461.000

Missing values

2023-12-13T00:51:54.757819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:51:54.914099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번연구과제명기준연도과제책임자기관명
01보안제품용 특수물질 감응솔루션 고도화 연구2021년최일훈한국조폐공사
12은행권용지 품질 안정화 및 소재 고도화 연구2021년허용대한국조폐공사
23은행권용 요판잉크 응용기술 개발2021년서범준한국조폐공사
34은행권, 신분증 등 주요제품용 미세소재 기반 광학 보안요소 양산화 연구2021년주성현한국조폐공사
45블록체인 기반 DID 및 본인인증 기술개발2021년이호상한국조폐공사
56블록체인 코어기술 개발 및 디지털 결제 서비스 연구2021년현소선한국조폐공사
67차세대 전자여권 자재 국산화 기술 및 ID카드 보안요소 개발2021년채우석한국조폐공사
78차세대 보안모듈(KShell62) 및 응용기술 개발2021년진민식한국조폐공사
89보안패턴을 활용한 응용기술 개발2021년오창진한국조폐공사
910특수압인제품 다양화 연구2021년손희승한국조폐공사
순번연구과제명기준연도과제책임자기관명
6162IoT기반 물 정보 데이터 공유를 위한 네트워킹 기술 개발2021년김대욱한국수자원공사
6263가뭄-수질분석 등 이수운영을 고려한 장기 기상예측 고도화2021년김태국한국수자원공사
6364가상현실을 이용한 K-water 건설현장 안전사고 예방기술 연구2021년서덕영한국수자원공사
6465관 세척 기반 상수관망 설계운영관리 기술 개발2021년배철호한국수자원공사
6566도시유역의 종합 물순환 정량화 체계 구축 및 활용2021년김준성한국수자원공사
6667수자원 시설물 조기경보시스템 개선을 위한 관리기준 고도화 연구2021년윤국희한국수자원공사
6768염분침투 특성을 고려한 해양 콘크리트 구조물 잔존수명 예측2021년장봉석한국수자원공사
6869전력시장 신규제도 도입에 따른 K-water 최적전력거래 방안 및 전략 연구2021년임선택한국수자원공사
6970지속가능한 유역물관리 재정연구 (1차년도)2021년류문현한국수자원공사
7071하천 취수지점(미계측지역)의 가뭄평가/예측을 위한 하천-지하수위 분석체계구축 및 활용2021년남우성한국수자원공사