Overview

Dataset statistics

Number of variables8
Number of observations3563
Missing cells5361
Missing cells (%)18.8%
Duplicate rows27
Duplicate rows (%)0.8%
Total size in memory222.8 KiB
Average record size in memory64.0 B

Variable types

Categorical1
Text2
DateTime5

Dataset

Description전통의학정보포털 오아시스의 한의연구보고서 입력 정보입니다. 과제수행년도, 세부과제명, 총연구기간(시작일), 총연구기간(종료일), 당해년도연구기간(시작일), 당해년도연구기간(종료일), 발행일자, 연구관리기관으로 이루어져있습니다.
Author한국한의학연구원
URLhttps://www.data.go.kr/data/15086068/fileData.do

Alerts

Dataset has 27 (0.8%) duplicate rowsDuplicates
총연구기간(시작일) has 940 (26.4%) missing valuesMissing
총연구기간(종료일) has 940 (26.4%) missing valuesMissing
당해년도연구기간(시작일) has 1147 (32.2%) missing valuesMissing
당해년도연구기간(종료일) has 1147 (32.2%) missing valuesMissing
발행일자 has 1187 (33.3%) missing valuesMissing

Reproduction

Analysis started2023-12-12 22:16:13.855230
Analysis finished2023-12-12 22:16:15.132187
Duration1.28 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct40
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size28.0 KiB
2,007
331 
2,006
315 
2,005
282 
2,003
252 
2,004
227 
Other values (35)
2156 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique5 ?
Unique (%)0.1%

Sample

1st row2,011
2nd row2,003
3rd row2,012
4th row2,013
5th row2,008

Common Values

ValueCountFrequency (%)
2,007 331
 
9.3%
2,006 315
 
8.8%
2,005 282
 
7.9%
2,003 252
 
7.1%
2,004 227
 
6.4%
2,002 217
 
6.1%
2,001 177
 
5.0%
2,000 176
 
4.9%
2,008 152
 
4.3%
2,010 139
 
3.9%
Other values (30) 1295
36.3%

Length

2023-12-13T07:16:15.207677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2,007 331
 
9.3%
2,006 315
 
8.8%
2,005 282
 
7.9%
2,003 252
 
7.1%
2,004 227
 
6.4%
2,002 217
 
6.1%
2,001 177
 
5.0%
2,000 176
 
4.9%
2,008 152
 
4.3%
2,010 139
 
3.9%
Other values (30) 1295
36.3%
Distinct3072
Distinct (%)86.2%
Missing0
Missing (%)0.0%
Memory size28.0 KiB
2023-12-13T07:16:15.528659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length116
Median length80
Mean length30.596969
Min length5

Characters and Unicode

Total characters109017
Distinct characters1016
Distinct categories13 ?
Distinct scripts5 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2728 ?
Unique (%)76.6%

Sample

1st row사상체질에 따른 Gut hormone profiling을 통한 식욕의 개체 차이에 대한 기전 연구
2nd row대구한의대학교 한방생명자원연구센터 지역혁신센터사업
3rd rowUV/Microwave를 활용한 Phase Ⅱ enzyme 조절 항염증 천연물 metabolite 도출 연구
4th row복합천연추출물을 이용한 비만 예방 및 개선용 건강기능식품 개발 및 사업화
5th row한약자원 향장 소재은행
ValueCountFrequency (%)
1309
 
5.1%
연구 1239
 
4.8%
개발 1042
 
4.1%
이용한 434
 
1.7%
위한 375
 
1.5%
관한 357
 
1.4%
대한 246
 
1.0%
한약재 179
 
0.7%
의한 172
 
0.7%
통한 131
 
0.5%
Other values (8028) 20234
78.7%
2023-12-13T07:16:16.006145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22159
 
20.3%
3529
 
3.2%
2810
 
2.6%
2142
 
2.0%
2138
 
2.0%
1791
 
1.6%
1604
 
1.5%
1593
 
1.5%
1575
 
1.4%
1573
 
1.4%
Other values (1006) 68103
62.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 78536
72.0%
Space Separator 22159
 
20.3%
Lowercase Letter 4134
 
3.8%
Uppercase Letter 1587
 
1.5%
Other Punctuation 587
 
0.5%
Open Punctuation 540
 
0.5%
Close Punctuation 540
 
0.5%
Decimal Number 500
 
0.5%
Dash Punctuation 306
 
0.3%
Letter Number 94
 
0.1%
Other values (3) 34
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3529
 
4.5%
2810
 
3.6%
2142
 
2.7%
2138
 
2.7%
1791
 
2.3%
1604
 
2.0%
1593
 
2.0%
1575
 
2.0%
1573
 
2.0%
1396
 
1.8%
Other values (902) 58385
74.3%
Lowercase Letter
ValueCountFrequency (%)
e 438
10.6%
i 400
 
9.7%
o 384
 
9.3%
n 343
 
8.3%
a 321
 
7.8%
r 263
 
6.4%
l 253
 
6.1%
s 249
 
6.0%
t 241
 
5.8%
c 202
 
4.9%
Other values (21) 1040
25.2%
Uppercase Letter
ValueCountFrequency (%)
I 211
13.3%
D 142
 
8.9%
A 135
 
8.5%
P 115
 
7.2%
N 101
 
6.4%
C 98
 
6.2%
B 91
 
5.7%
M 89
 
5.6%
T 77
 
4.9%
S 70
 
4.4%
Other values (16) 458
28.9%
Other Punctuation
ValueCountFrequency (%)
, 352
60.0%
· 105
 
17.9%
/ 50
 
8.5%
: 34
 
5.8%
' 12
 
2.0%
. 10
 
1.7%
& 9
 
1.5%
; 7
 
1.2%
" 6
 
1.0%
# 1
 
0.2%
Decimal Number
ValueCountFrequency (%)
1 106
21.2%
0 101
20.2%
2 93
18.6%
3 59
11.8%
9 43
8.6%
5 24
 
4.8%
4 23
 
4.6%
7 18
 
3.6%
6 17
 
3.4%
8 16
 
3.2%
Open Punctuation
ValueCountFrequency (%)
( 526
97.4%
6
 
1.1%
6
 
1.1%
1
 
0.2%
[ 1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 526
97.4%
6
 
1.1%
6
 
1.1%
1
 
0.2%
] 1
 
0.2%
Letter Number
ValueCountFrequency (%)
37
39.4%
33
35.1%
19
20.2%
4
 
4.3%
1
 
1.1%
Math Symbol
ValueCountFrequency (%)
+ 12
66.7%
~ 3
 
16.7%
< 1
 
5.6%
> 1
 
5.6%
1
 
5.6%
Initial Punctuation
ValueCountFrequency (%)
7
87.5%
1
 
12.5%
Final Punctuation
ValueCountFrequency (%)
7
87.5%
1
 
12.5%
Space Separator
ValueCountFrequency (%)
22159
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 306
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 77852
71.4%
Common 24666
 
22.6%
Latin 5799
 
5.3%
Han 684
 
0.6%
Greek 16
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3529
 
4.5%
2810
 
3.6%
2142
 
2.8%
2138
 
2.7%
1791
 
2.3%
1604
 
2.1%
1593
 
2.0%
1575
 
2.0%
1573
 
2.0%
1396
 
1.8%
Other values (625) 57701
74.1%
Han
ValueCountFrequency (%)
25
 
3.7%
20
 
2.9%
20
 
2.9%
19
 
2.8%
17
 
2.5%
16
 
2.3%
16
 
2.3%
15
 
2.2%
12
 
1.8%
11
 
1.6%
Other values (267) 513
75.0%
Latin
ValueCountFrequency (%)
e 438
 
7.6%
i 400
 
6.9%
o 384
 
6.6%
n 343
 
5.9%
a 321
 
5.5%
r 263
 
4.5%
l 253
 
4.4%
s 249
 
4.3%
t 241
 
4.2%
I 211
 
3.6%
Other values (47) 2696
46.5%
Common
ValueCountFrequency (%)
22159
89.8%
( 526
 
2.1%
) 526
 
2.1%
, 352
 
1.4%
- 306
 
1.2%
1 106
 
0.4%
· 105
 
0.4%
0 101
 
0.4%
2 93
 
0.4%
3 59
 
0.2%
Other values (32) 333
 
1.4%
Greek
ValueCountFrequency (%)
κ 5
31.2%
β 4
25.0%
α 3
18.8%
δ 3
18.8%
γ 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 77826
71.4%
ASCII 30223
 
27.7%
CJK 669
 
0.6%
None 148
 
0.1%
Number Forms 94
 
0.1%
Compat Jamo 26
 
< 0.1%
Punctuation 16
 
< 0.1%
CJK Compat Ideographs 15
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
22159
73.3%
( 526
 
1.7%
) 526
 
1.7%
e 438
 
1.4%
i 400
 
1.3%
o 384
 
1.3%
, 352
 
1.2%
n 343
 
1.1%
a 321
 
1.1%
- 306
 
1.0%
Other values (72) 4468
 
14.8%
Hangul
ValueCountFrequency (%)
3529
 
4.5%
2810
 
3.6%
2142
 
2.8%
2138
 
2.7%
1791
 
2.3%
1604
 
2.1%
1593
 
2.0%
1575
 
2.0%
1573
 
2.0%
1396
 
1.8%
Other values (624) 57675
74.1%
None
ValueCountFrequency (%)
· 105
70.9%
6
 
4.1%
6
 
4.1%
6
 
4.1%
6
 
4.1%
κ 5
 
3.4%
β 4
 
2.7%
α 3
 
2.0%
δ 3
 
2.0%
γ 1
 
0.7%
Other values (3) 3
 
2.0%
Number Forms
ValueCountFrequency (%)
37
39.4%
33
35.1%
19
20.2%
4
 
4.3%
1
 
1.1%
Compat Jamo
ValueCountFrequency (%)
26
100.0%
CJK
ValueCountFrequency (%)
25
 
3.7%
20
 
3.0%
20
 
3.0%
19
 
2.8%
17
 
2.5%
16
 
2.4%
16
 
2.4%
15
 
2.2%
12
 
1.8%
11
 
1.6%
Other values (262) 498
74.4%
Punctuation
ValueCountFrequency (%)
7
43.8%
7
43.8%
1
 
6.2%
1
 
6.2%
CJK Compat Ideographs
ValueCountFrequency (%)
5
33.3%
4
26.7%
3
20.0%
2
 
13.3%
1
 
6.7%
Distinct426
Distinct (%)16.2%
Missing940
Missing (%)26.4%
Memory size28.0 KiB
Minimum1979-12-01 00:00:00
Maximum2020-08-01 00:00:00
2023-12-13T07:16:16.182895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:16:16.358922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct490
Distinct (%)18.7%
Missing940
Missing (%)26.4%
Memory size28.0 KiB
Minimum1980-12-01 00:00:00
Maximum2022-10-01 00:00:00
2023-12-13T07:16:16.542884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:16:16.703671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct438
Distinct (%)18.1%
Missing1147
Missing (%)32.2%
Memory size28.0 KiB
Minimum1979-12-01 00:00:00
Maximum2020-08-01 00:00:00
2023-12-13T07:16:16.872253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:16:17.415476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct473
Distinct (%)19.6%
Missing1147
Missing (%)32.2%
Memory size28.0 KiB
Minimum1980-12-01 00:00:00
Maximum2020-12-31 00:00:00
2023-12-13T07:16:17.643065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:16:17.827203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

발행일자
Date

MISSING 

Distinct1010
Distinct (%)42.5%
Missing1187
Missing (%)33.3%
Memory size28.0 KiB
Minimum1981-03-03 00:00:00
Maximum2021-02-15 00:00:00
2023-12-13T07:16:18.002677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:16:18.178213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct79
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size28.0 KiB
2023-12-13T07:16:18.466574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length13
Mean length6.3738423
Min length2

Characters and Unicode

Total characters22710
Distinct characters111
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)0.7%

Sample

1st row미래창조과학부
2nd row산업통상자원부
3rd row미래창조과학부
4th row연구개발특구진흥재단
5th row교육과학기술부
ValueCountFrequency (%)
없음 510
14.3%
보건복지부 402
11.3%
식품의약품안전청 337
 
9.5%
한국보건산업진흥원 279
 
7.8%
한국과학재단 240
 
6.7%
교육과학기술부 235
 
6.6%
한국연구재단 201
 
5.6%
농촌진흥청 181
 
5.1%
한국산업기술평가원 178
 
5.0%
한국학술진흥재단 140
 
3.9%
Other values (69) 862
24.2%
2023-12-13T07:16:18.870317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1352
 
6.0%
1314
 
5.8%
967
 
4.3%
935
 
4.1%
911
 
4.0%
899
 
4.0%
845
 
3.7%
770
 
3.4%
743
 
3.3%
724
 
3.2%
Other values (101) 13250
58.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22707
> 99.9%
Space Separator 2
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1352
 
6.0%
1314
 
5.8%
967
 
4.3%
935
 
4.1%
911
 
4.0%
899
 
4.0%
845
 
3.7%
770
 
3.4%
743
 
3.3%
724
 
3.2%
Other values (99) 13247
58.3%
Space Separator
ValueCountFrequency (%)
2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22708
> 99.9%
Common 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1352
 
6.0%
1314
 
5.8%
967
 
4.3%
935
 
4.1%
911
 
4.0%
899
 
4.0%
845
 
3.7%
770
 
3.4%
743
 
3.3%
724
 
3.2%
Other values (100) 13248
58.3%
Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22707
> 99.9%
ASCII 2
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1352
 
6.0%
1314
 
5.8%
967
 
4.3%
935
 
4.1%
911
 
4.0%
899
 
4.0%
845
 
3.7%
770
 
3.4%
743
 
3.3%
724
 
3.2%
Other values (99) 13247
58.3%
ASCII
ValueCountFrequency (%)
2
100.0%
None
ValueCountFrequency (%)
1
100.0%

Correlations

2023-12-13T07:16:18.975757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과제수행년도연구관리기관
과제수행년도1.0000.875
연구관리기관0.8751.000

Missing values

2023-12-13T07:16:14.791092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:16:14.909891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T07:16:15.025615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

과제수행년도세부과제명총연구기간(시작일)총연구기간(종료일)당해년도연구기간(시작일)당해년도연구기간(종료일)발행일자연구관리기관
02,011사상체질에 따른 Gut hormone profiling을 통한 식욕의 개체 차이에 대한 기전 연구2011-06-012014-05-302011-06-012014-05-302014-06-25미래창조과학부
12,003대구한의대학교 한방생명자원연구센터 지역혁신센터사업2003-08-012013-03-282003-08-012013-03-282013-05-01산업통상자원부
22,012UV/Microwave를 활용한 Phase Ⅱ enzyme 조절 항염증 천연물 metabolite 도출 연구2012-10-012015-10-012012-10-012015-10-012015-10-23미래창조과학부
32,013복합천연추출물을 이용한 비만 예방 및 개선용 건강기능식품 개발 및 사업화2013-07-012015-07-012013-07-012015-07-012015-07-24연구개발특구진흥재단
42,008한약자원 향장 소재은행2008-05-012013-03-282008-05-012013-03-282012-12-27교육과학기술부
52,011한의학 경혈점 비접촉 자기장 집속 자극에 의한 생체신호 반응 조사2011-10-012014-10-012011-10-012014-10-012014-10-22미래창조과학부
62,011신체자가인식 변형 모델을 통한 침 치료 뇌신경생리학적 작용기전 연구2011-06-012014-05-302011-06-012014-05-302014-06-26미래창조과학부
72,014비알콜성 지방간 질환 조절 기전구명 및 기능성 천연물 소재 탐색<NA><NA><NA><NA>2015-03-28농촌진흥청
82,009한의학적 아토피 피부염 동물모델 개발 및 한방제제의 효과와 그 기전 규명2009-10-012012-10-012009-10-012012-10-012012-11-30교육과학기술부
92,012한의학 칠정(七情)에 기반을 둔 핵심감정평가도구 제작과 신뢰도ㆍ타당도 연구2012-06-012014-05-302012-06-012014-05-302014-06-25교육과학기술부
과제수행년도세부과제명총연구기간(시작일)총연구기간(종료일)당해년도연구기간(시작일)당해년도연구기간(종료일)발행일자연구관리기관
35532,012흰민들레로부터 알쯔하이머 예방 및 항노화 활성 연구와 기능성 물질의 분리 및 작용 메카니즘 규명2010-09-012013-08-312012-09-012013-08-312013-11-21한국연구재단
35542,017암환자 통증 및 악액질 완화 양·한방 통합 치료기술 개발2017-06-302019-12-312017-06-302019-12-312019-10-15과학기술정보통신부
35552,017암성 악액질 예방 및 치료를 위한 양한방 통합관리체계 개발2017-06-302019-12-312017-06-302019-12-312020-01-14과학기술정보통신부
35562,018뇌혈관 노화의 분자해부학적 분석과 황화수소 제어에 의한 뇌혈류 개선2016-06-012019-05-312018-04-012019-02-282019-06-24한국연구재단
35572,019온병(溫病) 변증이론에 기반한 대장질환 치료약물 탐색 및 AMPK를 중심으로 한 기전 구명2016-06-012019-05-312019-03-012019-05-312019-12-23한국연구재단
35582,019염증성 장질환 모델에서 Tumor necrosis factor-alpha inhibitor 및 한약재 병행투여에 따른 면역기전 연구2017-03-012020-02-292019-03-012020-02-292020-03-26한국연구재단
35592,019암성 통증 완화 통합 치료기술 연구2017-06-302022-06-292019-01-302019-12-312019-10-14한국연구재단
35602,018울금-커큐민 성분을 이용한 기능성 건강식품 개발2018-03-012018-11-302018-03-012018-11-302018-12-03농업기술실용화재단
35612,019암성 악액질 관리체계 개발 및 한방 치료기술 검증2017-06-302022-06-292019-01-302019-12-312020-01-14한국연구재단
35622,019의료기기중심 한의약 임상시험센터2014-06-012019-05-312019-02-012019-05-312019-07-12한국보건산업진흥원

Duplicate rows

Most frequently occurring

과제수행년도세부과제명총연구기간(시작일)총연구기간(종료일)당해년도연구기간(시작일)당해년도연구기간(종료일)발행일자연구관리기관# duplicates
12,005한약재 중 유해 물질 모니터링 자료 DB입력<NA><NA><NA><NA><NA>식품의약품안전청3
62,007간보호 및 간질환 치료 복합제제, CGX의 한방신약 제품화사업을 위한 개발연구2007-09-012009-08-312007-09-012009-08-312010-02-27보건복지가족부3
72,007한약재 생리활성 성분 효능 확인 및 효능유전자 연구2007-04-212008-12-302007-04-212008-12-302009-02-15식품의약품안전청3
182,008한약재 표준가공공정지침제정 연구2008-04-102008-12-302008-04-102008-12-302009-03-02식품의약품안전청3
02,005소아 만성질환 치료용 한방 신제형 개발2005-07-012008-05-012005-07-012008-05-012008-05-27보건복지가족부2
22,005한의약을 활용한 혈관질환 예방 및 치료제 개발2005-07-012008-05-012005-07-012008-05-012008-05-29보건복지가족부2
32,006국가자생식물 유래 한약재로부터 신경보호효과 식의약품개발에 관한 연구2006-06-012010-05-012006-06-012010-05-012010-06-30교육과학기술부2
42,006지능형 온침기기 제품화2006-12-012008-03-292006-12-012008-03-292008-05-01보건복지부2
52,007DNA 분석에 의한 한약재 종감별 연구(금은화, 인동, 황백, 연교, 황련, 독활)2007-05-172008-12-302007-05-172008-12-302009-03-13식품의약품안전청2
82,007한약재의 생리활성성분 분리 및 분석연구(황백, 연교)2007-03-282008-12-302007-03-282008-12-302008-12-30식품의약품안전청2