Overview

Dataset statistics

Number of variables5
Number of observations402
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.2 KiB
Average record size in memory41.3 B

Variable types

Numeric1
Text1
Categorical2
DateTime1

Dataset

Description국토교통과학진흥원 R&D 과제에 대한 대내외 현황요청 내역에 관한 정보(요청명, 요청업무 구분, 제출처, 요청일자) 제공합니다.
Author국토교통과학기술진흥원
URLhttps://www.data.go.kr/data/15060799/fileData.do

Alerts

순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:59:06.683274
Analysis finished2023-12-12 09:59:07.474018
Duration0.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct402
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean201.5
Minimum1
Maximum402
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.7 KiB
2023-12-12T18:59:07.557610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile21.05
Q1101.25
median201.5
Q3301.75
95-th percentile381.95
Maximum402
Range401
Interquartile range (IQR)200.5

Descriptive statistics

Standard deviation116.19165
Coefficient of variation (CV)0.57663351
Kurtosis-1.2
Mean201.5
Median Absolute Deviation (MAD)100.5
Skewness0
Sum81003
Variance13500.5
MonotonicityStrictly increasing
2023-12-12T18:59:07.716962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
266 1
 
0.2%
276 1
 
0.2%
275 1
 
0.2%
274 1
 
0.2%
273 1
 
0.2%
272 1
 
0.2%
271 1
 
0.2%
270 1
 
0.2%
269 1
 
0.2%
Other values (392) 392
97.5%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
402 1
0.2%
401 1
0.2%
400 1
0.2%
399 1
0.2%
398 1
0.2%
397 1
0.2%
396 1
0.2%
395 1
0.2%
394 1
0.2%
393 1
0.2%
Distinct382
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-12T18:59:08.041369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length126
Median length46
Mean length26.838308
Min length5

Characters and Unicode

Total characters10789
Distinct characters320
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique368 ?
Unique (%)91.5%

Sample

1st row2020년 협약과제 세세부과제 현황 요청(연락처/이메일 포함)
2nd row철도기술연구사업 최근 3년간(18, 19, 20년도) 과제별(총괄기준) TRL 현황
3rd row20년 기술실시계약 및 기술료 실적정보 요청(업데이트)
4th row20년 사업화실적
5th row사업화제품화 상세 매출액 발생일 정보를 요청드립니다.
ValueCountFrequency (%)
요청 211
 
9.4%
현황 126
 
5.6%
57
 
2.5%
과제 40
 
1.8%
21년 31
 
1.4%
r&d 29
 
1.3%
정보 29
 
1.3%
종료과제 28
 
1.2%
자료 26
 
1.2%
기술료 24
 
1.1%
Other values (767) 1646
73.3%
2023-12-12T18:59:08.633671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1856
 
17.2%
2 366
 
3.4%
298
 
2.8%
287
 
2.7%
279
 
2.6%
271
 
2.5%
1 257
 
2.4%
222
 
2.1%
219
 
2.0%
209
 
1.9%
Other values (310) 6525
60.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6977
64.7%
Space Separator 1856
 
17.2%
Decimal Number 990
 
9.2%
Other Punctuation 334
 
3.1%
Uppercase Letter 190
 
1.8%
Close Punctuation 169
 
1.6%
Open Punctuation 169
 
1.6%
Math Symbol 58
 
0.5%
Dash Punctuation 30
 
0.3%
Lowercase Letter 9
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
298
 
4.3%
287
 
4.1%
279
 
4.0%
271
 
3.9%
222
 
3.2%
219
 
3.1%
209
 
3.0%
205
 
2.9%
172
 
2.5%
166
 
2.4%
Other values (261) 4649
66.6%
Uppercase Letter
ValueCountFrequency (%)
R 61
32.1%
D 56
29.5%
I 17
 
8.9%
S 12
 
6.3%
N 11
 
5.8%
T 9
 
4.7%
L 4
 
2.1%
A 3
 
1.6%
E 3
 
1.6%
O 3
 
1.6%
Other values (6) 11
 
5.8%
Decimal Number
ValueCountFrequency (%)
2 366
37.0%
1 257
26.0%
0 196
19.8%
5 34
 
3.4%
3 32
 
3.2%
7 28
 
2.8%
6 26
 
2.6%
9 23
 
2.3%
8 16
 
1.6%
4 12
 
1.2%
Other Punctuation
ValueCountFrequency (%)
. 93
27.8%
' 80
24.0%
, 80
24.0%
& 53
15.9%
/ 17
 
5.1%
" 8
 
2.4%
% 2
 
0.6%
: 1
 
0.3%
Lowercase Letter
ValueCountFrequency (%)
o 2
22.2%
n 2
22.2%
u 1
11.1%
b 1
11.1%
l 1
11.1%
r 1
11.1%
d 1
11.1%
Close Punctuation
ValueCountFrequency (%)
) 146
86.4%
] 23
 
13.6%
Open Punctuation
ValueCountFrequency (%)
( 146
86.4%
[ 23
 
13.6%
Space Separator
ValueCountFrequency (%)
1856
100.0%
Math Symbol
ValueCountFrequency (%)
~ 58
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 30
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6977
64.7%
Common 3613
33.5%
Latin 199
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
298
 
4.3%
287
 
4.1%
279
 
4.0%
271
 
3.9%
222
 
3.2%
219
 
3.1%
209
 
3.0%
205
 
2.9%
172
 
2.5%
166
 
2.4%
Other values (261) 4649
66.6%
Common
ValueCountFrequency (%)
1856
51.4%
2 366
 
10.1%
1 257
 
7.1%
0 196
 
5.4%
) 146
 
4.0%
( 146
 
4.0%
. 93
 
2.6%
' 80
 
2.2%
, 80
 
2.2%
~ 58
 
1.6%
Other values (16) 335
 
9.3%
Latin
ValueCountFrequency (%)
R 61
30.7%
D 56
28.1%
I 17
 
8.5%
S 12
 
6.0%
N 11
 
5.5%
T 9
 
4.5%
L 4
 
2.0%
A 3
 
1.5%
E 3
 
1.5%
O 3
 
1.5%
Other values (13) 20
 
10.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6977
64.7%
ASCII 3812
35.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1856
48.7%
2 366
 
9.6%
1 257
 
6.7%
0 196
 
5.1%
) 146
 
3.8%
( 146
 
3.8%
. 93
 
2.4%
' 80
 
2.1%
, 80
 
2.1%
R 61
 
1.6%
Other values (39) 531
 
13.9%
Hangul
ValueCountFrequency (%)
298
 
4.3%
287
 
4.1%
279
 
4.0%
271
 
3.9%
222
 
3.2%
219
 
3.1%
209
 
3.0%
205
 
2.9%
172
 
2.5%
166
 
2.4%
Other values (261) 4649
66.6%
Distinct7
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
과제
169 
성과
78 
인력
64 
정산
42 
조사분석
19 
Other values (2)
30 

Length

Max length4
Median length2
Mean length2.1691542
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row과제
2nd row과제
3rd row성과
4th row성과
5th row성과

Common Values

ValueCountFrequency (%)
과제 169
42.0%
성과 78
19.4%
인력 64
 
15.9%
정산 42
 
10.4%
조사분석 19
 
4.7%
평가 15
 
3.7%
<NA> 15
 
3.7%

Length

2023-12-12T18:59:08.842321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:59:09.013163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
과제 169
42.0%
성과 78
19.4%
인력 64
 
15.9%
정산 42
 
10.4%
조사분석 19
 
4.7%
평가 15
 
3.7%
na 15
 
3.7%

제출처
Categorical

Distinct8
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
기타
211 
과기정통부
50 
국회
44 
국토부
36 
기재부
23 
Other values (3)
38 

Length

Max length5
Median length2
Mean length2.6716418
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국토부
2nd row국토부
3rd row기타
4th row기타
5th row기타

Common Values

ValueCountFrequency (%)
기타 211
52.5%
과기정통부 50
 
12.4%
국회 44
 
10.9%
국토부 36
 
9.0%
기재부 23
 
5.7%
<NA> 23
 
5.7%
감사 10
 
2.5%
중소기업부 5
 
1.2%

Length

2023-12-12T18:59:09.204625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:59:09.373282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기타 211
52.5%
과기정통부 50
 
12.4%
국회 44
 
10.9%
국토부 36
 
9.0%
기재부 23
 
5.7%
na 23
 
5.7%
감사 10
 
2.5%
중소기업부 5
 
1.2%
Distinct227
Distinct (%)56.5%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
Minimum2021-01-04 00:00:00
Maximum2022-07-13 00:00:00
2023-12-12T18:59:09.546311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:59:09.753105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T18:59:07.083938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:59:09.867944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번요청업무 구분제출처
순번1.0000.2160.373
요청업무 구분0.2161.0000.247
제출처0.3730.2471.000
2023-12-12T18:59:09.985762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
제출처요청업무 구분
제출처1.0000.149
요청업무 구분0.1491.000
2023-12-12T18:59:10.104443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번요청업무 구분제출처
순번1.0000.1140.198
요청업무 구분0.1141.0000.149
제출처0.1980.1491.000

Missing values

2023-12-12T18:59:07.282668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:59:07.430928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번요청내역요청업무 구분제출처요청일
012020년 협약과제 세세부과제 현황 요청(연락처/이메일 포함)과제국토부2021-01-04
12철도기술연구사업 최근 3년간(18, 19, 20년도) 과제별(총괄기준) TRL 현황과제국토부2021-01-04
2320년 기술실시계약 및 기술료 실적정보 요청(업데이트)성과기타2021-01-04
3420년 사업화실적성과기타2021-01-05
45사업화제품화 상세 매출액 발생일 정보를 요청드립니다.성과기타2021-01-11
56종료과제 정보 요청성과과기정통부2021-01-11
67국제공동연구현황 조사(2016년 이후)과제국토부2021-01-12
7820년 기술실시계약 및 기술료 실적정보 요청(업데이트)평가기타2021-01-12
89기술사업화지원사업 과제별 수행과제 목록(2011~2020)과제과기정통부2021-01-12
91020년 협약과제 현황 요청과제국토부2021-01-13
순번요청내역요청업무 구분제출처요청일
392393NTIS 등록을 위한 21년 기술료 및 정부납부기술료 정보(업데이트)성과기타2022-06-30
393394주관, 공동 연구책임자 현황(이메일, 핸드폰 포함)인력기타2022-07-01
394395전문기관 실태조사 자료작성 대응용 자료 요청성과과기정통부2022-07-04
395396전문기관 실태조사 자료작성 대응용 자료 요청(추가정보 요청)성과과기정통부2022-07-05
396397위탁회계법인지정을 위한 과제현황요청정산기타2022-07-07
397398과기부 결산 요구자료 요청정산과기정통부2022-07-07
398399표준 채택 지표 전체성과 요청성과기타2022-07-11
399400성과지표 전성과 요청 드립니다.성과기타2022-07-12
400401공공데이터 업데이트 자료 요청과제기타2022-07-13
401402(공공데이터) 공공데이터 포털 게시용 데이터 업데이트 요청과제기타2022-07-13