Overview

Dataset statistics

Number of variables4
Number of observations268
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.8 KiB
Average record size in memory33.5 B

Variable types

Categorical2
Text2

Dataset

Description국토교통 R&D와 관련한 해외의 기술,산업,정책 정보를 수집하여 리포트 형식으로 분석 가공하여 제공하는 정보 목록
URLhttps://www.data.go.kr/data/15025512/fileData.do

Alerts

보고서 명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:03:45.840160
Analysis finished2023-12-12 08:03:46.527635
Duration0.69 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

리포트 종류
Categorical

Distinct3
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
기술리포트
166 
정책리포트
84 
산업리포트
18 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기술리포트
2nd row정책리포트
3rd row기술리포트
4th row정책리포트
5th row기술리포트

Common Values

ValueCountFrequency (%)
기술리포트 166
61.9%
정책리포트 84
31.3%
산업리포트 18
 
6.7%

Length

2023-12-12T17:03:46.615465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:03:46.772216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기술리포트 166
61.9%
정책리포트 84
31.3%
산업리포트 18
 
6.7%

보고서 명
Text

UNIQUE 

Distinct268
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-12T17:03:47.107231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length66
Median length42
Mean length28.66791
Min length8

Characters and Unicode

Total characters7683
Distinct characters485
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique268 ?
Unique (%)100.0%

Sample

1st row대중교통에서 자기신원검증 사례 기반의 블록체인 지원 분산식별 관리방법
2nd row친환경 건축 재료의 환경 평가가 지속가능한 등급 시스템에 미치는 영향
3rd row새로운 건설 및 철거 폐기물 처리 및 재사용과 재활용 극대화를 위한 용도
4th rowColombia Bogota의 지능형 도시 및 스마트 교통 분야 거버넌스 사례연구
5th row지능형 도시 구축을 위한 콜롬비아의 스마트 교통 사례연구
ValueCountFrequency (%)
63
 
3.3%
위한 47
 
2.5%
대한 32
 
1.7%
스마트 23
 
1.2%
친환경 16
 
0.9%
건설 15
 
0.8%
14
 
0.7%
분석 13
 
0.7%
지속가능한 13
 
0.7%
영향 13
 
0.7%
Other values (1148) 1632
86.8%
2023-12-12T17:03:47.702810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1630
 
21.2%
183
 
2.4%
138
 
1.8%
118
 
1.5%
98
 
1.3%
98
 
1.3%
93
 
1.2%
87
 
1.1%
84
 
1.1%
82
 
1.1%
Other values (475) 5072
66.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5542
72.1%
Space Separator 1630
 
21.2%
Uppercase Letter 167
 
2.2%
Lowercase Letter 126
 
1.6%
Decimal Number 121
 
1.6%
Other Punctuation 40
 
0.5%
Dash Punctuation 29
 
0.4%
Open Punctuation 9
 
0.1%
Close Punctuation 9
 
0.1%
Connector Punctuation 7
 
0.1%
Other values (3) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
183
 
3.3%
138
 
2.5%
118
 
2.1%
98
 
1.8%
98
 
1.8%
93
 
1.7%
87
 
1.6%
84
 
1.5%
82
 
1.5%
79
 
1.4%
Other values (412) 4482
80.9%
Lowercase Letter
ValueCountFrequency (%)
a 22
17.5%
i 16
12.7%
e 11
8.7%
l 10
 
7.9%
t 9
 
7.1%
o 8
 
6.3%
n 8
 
6.3%
r 7
 
5.6%
c 5
 
4.0%
d 4
 
3.2%
Other values (11) 26
20.6%
Uppercase Letter
ValueCountFrequency (%)
A 20
12.0%
C 18
10.8%
D 15
 
9.0%
O 12
 
7.2%
I 11
 
6.6%
V 10
 
6.0%
M 10
 
6.0%
T 9
 
5.4%
U 9
 
5.4%
E 9
 
5.4%
Other values (9) 44
26.3%
Decimal Number
ValueCountFrequency (%)
0 28
23.1%
1 26
21.5%
2 26
21.5%
9 9
 
7.4%
3 9
 
7.4%
4 8
 
6.6%
8 5
 
4.1%
5 5
 
4.1%
7 3
 
2.5%
6 2
 
1.7%
Other Punctuation
ValueCountFrequency (%)
: 19
47.5%
, 13
32.5%
. 5
 
12.5%
% 2
 
5.0%
· 1
 
2.5%
Space Separator
ValueCountFrequency (%)
1630
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 29
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 9
100.0%
Close Punctuation
ValueCountFrequency (%)
] 9
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 7
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5542
72.1%
Common 1848
 
24.1%
Latin 293
 
3.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
183
 
3.3%
138
 
2.5%
118
 
2.1%
98
 
1.8%
98
 
1.8%
93
 
1.7%
87
 
1.6%
84
 
1.5%
82
 
1.5%
79
 
1.4%
Other values (412) 4482
80.9%
Latin
ValueCountFrequency (%)
a 22
 
7.5%
A 20
 
6.8%
C 18
 
6.1%
i 16
 
5.5%
D 15
 
5.1%
O 12
 
4.1%
I 11
 
3.8%
e 11
 
3.8%
V 10
 
3.4%
l 10
 
3.4%
Other values (30) 148
50.5%
Common
ValueCountFrequency (%)
1630
88.2%
- 29
 
1.6%
0 28
 
1.5%
1 26
 
1.4%
2 26
 
1.4%
: 19
 
1.0%
, 13
 
0.7%
[ 9
 
0.5%
9 9
 
0.5%
] 9
 
0.5%
Other values (13) 50
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5542
72.1%
ASCII 2138
 
27.8%
Punctuation 2
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1630
76.2%
- 29
 
1.4%
0 28
 
1.3%
1 26
 
1.2%
2 26
 
1.2%
a 22
 
1.0%
A 20
 
0.9%
: 19
 
0.9%
C 18
 
0.8%
i 16
 
0.7%
Other values (50) 304
 
14.2%
Hangul
ValueCountFrequency (%)
183
 
3.3%
138
 
2.5%
118
 
2.1%
98
 
1.8%
98
 
1.8%
93
 
1.7%
87
 
1.6%
84
 
1.5%
82
 
1.5%
79
 
1.4%
Other values (412) 4482
80.9%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct184
Distinct (%)68.7%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-12T17:03:48.034837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length145
Median length71
Mean length38.473881
Min length3

Characters and Unicode

Total characters10311
Distinct characters137
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique162 ?
Unique (%)60.4%

Sample

1st row Blockchain: Research and Applications
2nd row3Civil Engineering Department, Universiti Teknologi Petronas, Perak, Malaysia
3rd rowAdvances in Building Energy Research
4th rowAin Shams Engineering Journal 11 (2020) 2534
5th rowAin Shams Engineering Journal xxx (xxxx) xxx
ValueCountFrequency (%)
of 81
 
6.0%
research 51
 
3.8%
journal 49
 
3.6%
and 46
 
3.4%
engineering 39
 
2.9%
30
 
2.2%
international 27
 
2.0%
transportation 25
 
1.8%
technology 24
 
1.8%
board 22
 
1.6%
Other values (432) 961
70.9%
2023-12-12T17:03:48.645597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1107
 
10.7%
n 732
 
7.1%
e 623
 
6.0%
i 585
 
5.7%
a 533
 
5.2%
o 505
 
4.9%
t 495
 
4.8%
r 478
 
4.6%
s 311
 
3.0%
c 275
 
2.7%
Other values (127) 4667
45.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 6174
59.9%
Uppercase Letter 1618
 
15.7%
Space Separator 1107
 
10.7%
Decimal Number 643
 
6.2%
Other Punctuation 325
 
3.2%
Other Letter 240
 
2.3%
Close Punctuation 92
 
0.9%
Open Punctuation 92
 
0.9%
Dash Punctuation 20
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
 
9.2%
18
 
7.5%
17
 
7.1%
16
 
6.7%
16
 
6.7%
16
 
6.7%
9
 
3.8%
8
 
3.3%
6
 
2.5%
6
 
2.5%
Other values (56) 106
44.2%
Lowercase Letter
ValueCountFrequency (%)
n 732
11.9%
e 623
10.1%
i 585
9.5%
a 533
 
8.6%
o 505
 
8.2%
t 495
 
8.0%
r 478
 
7.7%
s 311
 
5.0%
c 275
 
4.5%
l 229
 
3.7%
Other values (16) 1408
22.8%
Uppercase Letter
ValueCountFrequency (%)
R 181
11.2%
E 168
10.4%
T 160
9.9%
A 158
9.8%
S 142
8.8%
I 125
 
7.7%
N 99
 
6.1%
C 92
 
5.7%
O 88
 
5.4%
P 67
 
4.1%
Other values (15) 338
20.9%
Decimal Number
ValueCountFrequency (%)
0 168
26.1%
2 146
22.7%
1 132
20.5%
3 38
 
5.9%
9 35
 
5.4%
5 30
 
4.7%
6 28
 
4.4%
8 28
 
4.4%
4 24
 
3.7%
7 14
 
2.2%
Other Punctuation
ValueCountFrequency (%)
/ 111
34.2%
. 101
31.1%
, 59
18.2%
: 44
 
13.5%
& 6
 
1.8%
; 4
 
1.2%
Space Separator
ValueCountFrequency (%)
1107
100.0%
Close Punctuation
ValueCountFrequency (%)
) 92
100.0%
Open Punctuation
ValueCountFrequency (%)
( 92
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 7792
75.6%
Common 2279
 
22.1%
Hangul 230
 
2.2%
Han 10
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
 
9.6%
18
 
7.8%
17
 
7.4%
16
 
7.0%
16
 
7.0%
16
 
7.0%
9
 
3.9%
8
 
3.5%
6
 
2.6%
6
 
2.6%
Other values (47) 96
41.7%
Latin
ValueCountFrequency (%)
n 732
 
9.4%
e 623
 
8.0%
i 585
 
7.5%
a 533
 
6.8%
o 505
 
6.5%
t 495
 
6.4%
r 478
 
6.1%
s 311
 
4.0%
c 275
 
3.5%
l 229
 
2.9%
Other values (41) 3026
38.8%
Common
ValueCountFrequency (%)
1107
48.6%
0 168
 
7.4%
2 146
 
6.4%
1 132
 
5.8%
/ 111
 
4.9%
. 101
 
4.4%
) 92
 
4.0%
( 92
 
4.0%
, 59
 
2.6%
: 44
 
1.9%
Other values (10) 227
 
10.0%
Han
ValueCountFrequency (%)
2
20.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 10071
97.7%
Hangul 230
 
2.2%
CJK 10
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1107
 
11.0%
n 732
 
7.3%
e 623
 
6.2%
i 585
 
5.8%
a 533
 
5.3%
o 505
 
5.0%
t 495
 
4.9%
r 478
 
4.7%
s 311
 
3.1%
c 275
 
2.7%
Other values (61) 4427
44.0%
Hangul
ValueCountFrequency (%)
22
 
9.6%
18
 
7.8%
17
 
7.4%
16
 
7.0%
16
 
7.0%
16
 
7.0%
9
 
3.9%
8
 
3.5%
6
 
2.6%
6
 
2.6%
Other values (47) 96
41.7%
CJK
ValueCountFrequency (%)
2
20.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%

등록연도
Categorical

Distinct3
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2021
113 
2020
86 
2019
69 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2020
3rd row2021
4th row2020
5th row2019

Common Values

ValueCountFrequency (%)
2021 113
42.2%
2020 86
32.1%
2019 69
25.7%

Length

2023-12-12T17:03:48.846553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:03:49.013370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 113
42.2%
2020 86
32.1%
2019 69
25.7%

Correlations

2023-12-12T17:03:49.109062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
리포트 종류등록연도
리포트 종류1.0000.284
등록연도0.2841.000
2023-12-12T17:03:49.232182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
리포트 종류등록연도
리포트 종류1.0000.094
등록연도0.0941.000
2023-12-12T17:03:49.336152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
리포트 종류등록연도
리포트 종류1.0000.094
등록연도0.0941.000

Missing values

2023-12-12T17:03:46.362985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:03:46.480406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

리포트 종류보고서 명발행처등록연도
0기술리포트대중교통에서 자기신원검증 사례 기반의 블록체인 지원 분산식별 관리방법Blockchain: Research and Applications2021
1정책리포트친환경 건축 재료의 환경 평가가 지속가능한 등급 시스템에 미치는 영향3Civil Engineering Department, Universiti Teknologi Petronas, Perak, Malaysia2020
2기술리포트새로운 건설 및 철거 폐기물 처리 및 재사용과 재활용 극대화를 위한 용도Advances in Building Energy Research2021
3정책리포트Colombia Bogota의 지능형 도시 및 스마트 교통 분야 거버넌스 사례연구Ain Shams Engineering Journal 11 (2020) 25342020
4기술리포트지능형 도시 구축을 위한 콜롬비아의 스마트 교통 사례연구Ain Shams Engineering Journal xxx (xxxx) xxx2019
5기술리포트복잡한 배관 설비에 대한 지능형 교육 플랫폼으로서의 BIM 기반 AR 유지관리 시스템applied sciences2021
6정책리포트스마트시티로 진화하기 위한 바르셀로나의 급진적인 시도 [자동차 수 억제 정책]BBC News World 6 january 20202020
7기술리포트교통 분야의 빅 데이터에 대한 체계적인 문헌 검토: 개념 및 응용Big DataResearch17(2019)35442020
8기술리포트포트하커트에 폐기물 관리용으로 폐플라스틱병 벽돌 사용에 대한 정량적 분석Biodiversity international journal2021
9정책리포트건설산업의 미래 혁신을 위한 전략과제와 문제의 재정립Building Information Modelling (BIM) in Design, Construction and Operations2021
리포트 종류보고서 명발행처등록연도
258기술리포트수소연료전지열차실용화데이터 미집계2021
259기술리포트남아프리카 공화국 일부 선정된 도시에 풍력발전 수소충전소 최적설계데이터 미집계2021
260기술리포트글로벌 항공교통시스템을 선도하는 차세대 안전·레질리언스 프로그램데이터 미집계2021
261기술리포트영국 교통 비전 2050 미래 모빌리티에 투자하기데이터 미집계2021
262기술리포트IEA 세계 에너지 전망 2021데이터 미집계2021
263기술리포트탄소중립시대 천연가스 배관망의 활용방안데이터 미집계2021
264기술리포트철도역사 수소에너지 공급시스템데이터 미집계2021
265기술리포트교량충격 대응에 대한 사례연구-미국 주별 관행데이터 미집계2021
266기술리포트티핑포인트 에너지저장시스템데이터 미집계2021
267기술리포트2021년 교통 및 환경통계 연차보고서데이터 미집계2021