Overview

Dataset statistics

Number of variables3
Number of observations9955
Missing cells0
Missing cells (%)0.0%
Duplicate rows33
Duplicate rows (%)0.3%
Total size in memory233.4 KiB
Average record size in memory24.0 B

Variable types

DateTime1
Text2

Dataset

Description국가공무원의 국외출장보고서 목록입니다. 국외출장보고서 등록일, 국외출장보고서 제목, 국외출장보고서 등록기관 자료를 제공합니다.
Author인사혁신처
URLhttps://www.data.go.kr/data/15085750/fileData.do

Alerts

Dataset has 33 (0.3%) duplicate rowsDuplicates

Reproduction

Analysis started2024-03-14 13:32:46.704398
Analysis finished2024-03-14 13:32:48.107445
Duration1.4 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct693
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size77.9 KiB
Minimum2021-01-05 00:00:00
Maximum2023-06-30 00:00:00
2024-03-14T22:32:48.254443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T22:32:48.516161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct9738
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Memory size77.9 KiB
2024-03-14T22:32:49.735129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length120
Median length102
Mean length32.3334
Min length2

Characters and Unicode

Total characters321879
Distinct characters963
Distinct categories18 ?
Distinct scripts4 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9567 ?
Unique (%)96.1%

Sample

1st row2023 PNS annual meeting 참여
2nd row국제 포럼(International Forum of Japonica Rice with Good Quality) 참석
3rd row공무국외출장보고서 및 증빙서류
4th row사기범죄 대응체계 개선을 위한 싱가포르 공무국외출장 결과보고
5th row2023 GCTF 국제워크숍 참석
ValueCountFrequency (%)
3694
 
5.4%
참석 2645
 
3.9%
위한 1316
 
1.9%
2022 929
 
1.4%
참가 713
 
1.0%
발표 695
 
1.0%
결과 618
 
0.9%
보고서 578
 
0.9%
출장 528
 
0.8%
2023 526
 
0.8%
Other values (18032) 55742
82.0%
2024-03-14T22:32:51.244339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
58029
 
18.0%
2 8126
 
2.5%
5226
 
1.6%
4459
 
1.4%
e 4409
 
1.4%
4104
 
1.3%
n 4098
 
1.3%
4067
 
1.3%
3712
 
1.2%
3489
 
1.1%
Other values (953) 222160
69.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 180358
56.0%
Space Separator 58029
 
18.0%
Lowercase Letter 35276
 
11.0%
Uppercase Letter 24638
 
7.7%
Decimal Number 15605
 
4.8%
Close Punctuation 2230
 
0.7%
Open Punctuation 2230
 
0.7%
Other Punctuation 1908
 
0.6%
Dash Punctuation 1236
 
0.4%
Math Symbol 103
 
< 0.1%
Other values (8) 266
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5226
 
2.9%
4459
 
2.5%
4104
 
2.3%
4067
 
2.3%
3712
 
2.1%
3489
 
1.9%
3341
 
1.9%
3173
 
1.8%
2984
 
1.7%
2859
 
1.6%
Other values (844) 142944
79.3%
Lowercase Letter
ValueCountFrequency (%)
e 4409
12.5%
n 4098
11.6%
o 3310
9.4%
i 3021
8.6%
a 2931
 
8.3%
t 2655
 
7.5%
r 2559
 
7.3%
s 1727
 
4.9%
c 1588
 
4.5%
l 1588
 
4.5%
Other values (16) 7390
20.9%
Uppercase Letter
ValueCountFrequency (%)
C 2861
11.6%
A 2505
 
10.2%
I 2456
 
10.0%
S 2281
 
9.3%
E 2076
 
8.4%
M 1354
 
5.5%
P 1243
 
5.0%
T 1161
 
4.7%
O 1154
 
4.7%
R 898
 
3.6%
Other values (16) 6649
27.0%
Other Punctuation
ValueCountFrequency (%)
, 789
41.4%
. 302
 
15.8%
/ 248
 
13.0%
' 163
 
8.5%
· 144
 
7.5%
& 141
 
7.4%
: 49
 
2.6%
" 42
 
2.2%
* 9
 
0.5%
# 6
 
0.3%
Other values (4) 15
 
0.8%
Decimal Number
ValueCountFrequency (%)
2 8126
52.1%
0 3181
 
20.4%
3 1519
 
9.7%
1 1127
 
7.2%
4 342
 
2.2%
5 324
 
2.1%
6 280
 
1.8%
7 275
 
1.8%
8 218
 
1.4%
9 213
 
1.4%
Close Punctuation
ValueCountFrequency (%)
) 2158
96.8%
] 30
 
1.3%
25
 
1.1%
15
 
0.7%
1
 
< 0.1%
1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 2156
96.7%
[ 31
 
1.4%
26
 
1.2%
15
 
0.7%
1
 
< 0.1%
1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 38
36.9%
+ 33
32.0%
> 16
15.5%
< 15
 
14.6%
1
 
1.0%
Letter Number
ValueCountFrequency (%)
7
70.0%
2
 
20.0%
1
 
10.0%
Final Punctuation
ValueCountFrequency (%)
43
51.2%
41
48.8%
Initial Punctuation
ValueCountFrequency (%)
40
60.6%
26
39.4%
Other Symbol
ValueCountFrequency (%)
7
70.0%
3
30.0%
Other Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
58029
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1236
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 91
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Control
ValueCountFrequency (%)
 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 180249
56.0%
Common 81597
25.4%
Latin 59924
 
18.6%
Han 109
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5226
 
2.9%
4459
 
2.5%
4104
 
2.3%
4067
 
2.3%
3712
 
2.1%
3489
 
1.9%
3341
 
1.9%
3173
 
1.8%
2984
 
1.7%
2859
 
1.6%
Other values (790) 142835
79.2%
Latin
ValueCountFrequency (%)
e 4409
 
7.4%
n 4098
 
6.8%
o 3310
 
5.5%
i 3021
 
5.0%
a 2931
 
4.9%
C 2861
 
4.8%
t 2655
 
4.4%
r 2559
 
4.3%
A 2505
 
4.2%
I 2456
 
4.1%
Other values (45) 29119
48.6%
Common
ValueCountFrequency (%)
58029
71.1%
2 8126
 
10.0%
0 3181
 
3.9%
) 2158
 
2.6%
( 2156
 
2.6%
3 1519
 
1.9%
- 1236
 
1.5%
1 1127
 
1.4%
, 789
 
1.0%
4 342
 
0.4%
Other values (44) 2934
 
3.6%
Han
ValueCountFrequency (%)
20
 
18.3%
7
 
6.4%
6
 
5.5%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.8%
2
 
1.8%
2
 
1.8%
Other values (44) 57
52.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 180234
56.0%
ASCII 141118
43.8%
None 230
 
0.1%
Punctuation 150
 
< 0.1%
CJK 108
 
< 0.1%
Compat Jamo 15
 
< 0.1%
Number Forms 10
 
< 0.1%
Geometric Shapes 10
 
< 0.1%
Enclosed Alphanum 2
 
< 0.1%
Arrows 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
58029
41.1%
2 8126
 
5.8%
e 4409
 
3.1%
n 4098
 
2.9%
o 3310
 
2.3%
0 3181
 
2.3%
i 3021
 
2.1%
a 2931
 
2.1%
C 2861
 
2.0%
t 2655
 
1.9%
Other values (77) 48497
34.4%
Hangul
ValueCountFrequency (%)
5226
 
2.9%
4459
 
2.5%
4104
 
2.3%
4067
 
2.3%
3712
 
2.1%
3489
 
1.9%
3341
 
1.9%
3173
 
1.8%
2984
 
1.7%
2859
 
1.6%
Other values (788) 142820
79.2%
None
ValueCountFrequency (%)
· 144
62.6%
26
 
11.3%
25
 
10.9%
15
 
6.5%
15
 
6.5%
1
 
0.4%
1
 
0.4%
1
 
0.4%
1
 
0.4%
1
 
0.4%
Punctuation
ValueCountFrequency (%)
43
28.7%
41
27.3%
40
26.7%
26
17.3%
CJK
ValueCountFrequency (%)
20
 
18.5%
7
 
6.5%
6
 
5.6%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (43) 56
51.9%
Compat Jamo
ValueCountFrequency (%)
14
93.3%
1
 
6.7%
Number Forms
ValueCountFrequency (%)
7
70.0%
2
 
20.0%
1
 
10.0%
Geometric Shapes
ValueCountFrequency (%)
7
70.0%
3
30.0%
Arrows
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct75
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size77.9 KiB
2024-03-14T22:32:51.958893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length3
Mean length3.9245605
Min length3

Characters and Unicode

Total characters39069
Distinct characters129
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)0.1%

Sample

1st row교육부
2nd row농촌진흥청
3rd row교육부
4th row경찰청
5th row국민권익위원회
ValueCountFrequency (%)
교육부 5209
52.3%
국방부 555
 
5.6%
방위사업청 338
 
3.4%
농촌진흥청 337
 
3.4%
문화체육관광부 282
 
2.8%
국토교통부 280
 
2.8%
농림축산식품부 278
 
2.8%
기획재정부 207
 
2.1%
과학기술정보통신부 157
 
1.6%
경찰청 155
 
1.6%
Other values (51) 2157
21.7%
2024-03-14T22:32:52.894353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7880
20.2%
5491
 
14.1%
5489
 
14.0%
1682
 
4.3%
1066
 
2.7%
942
 
2.4%
659
 
1.7%
647
 
1.7%
621
 
1.6%
554
 
1.4%
Other values (119) 14038
35.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 38841
99.4%
Space Separator 140
 
0.4%
Close Punctuation 43
 
0.1%
Open Punctuation 43
 
0.1%
Decimal Number 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7880
20.3%
5491
 
14.1%
5489
 
14.1%
1682
 
4.3%
1066
 
2.7%
942
 
2.4%
659
 
1.7%
647
 
1.7%
621
 
1.6%
554
 
1.4%
Other values (115) 13810
35.6%
Space Separator
ValueCountFrequency (%)
140
100.0%
Close Punctuation
ValueCountFrequency (%)
) 43
100.0%
Open Punctuation
ValueCountFrequency (%)
( 43
100.0%
Decimal Number
ValueCountFrequency (%)
4 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 38841
99.4%
Common 228
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7880
20.3%
5491
 
14.1%
5489
 
14.1%
1682
 
4.3%
1066
 
2.7%
942
 
2.4%
659
 
1.7%
647
 
1.7%
621
 
1.6%
554
 
1.4%
Other values (115) 13810
35.6%
Common
ValueCountFrequency (%)
140
61.4%
) 43
 
18.9%
( 43
 
18.9%
4 2
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 38841
99.4%
ASCII 228
 
0.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7880
20.3%
5491
 
14.1%
5489
 
14.1%
1682
 
4.3%
1066
 
2.7%
942
 
2.4%
659
 
1.7%
647
 
1.7%
621
 
1.6%
554
 
1.4%
Other values (115) 13810
35.6%
ASCII
ValueCountFrequency (%)
140
61.4%
) 43
 
18.9%
( 43
 
18.9%
4 2
 
0.9%

Missing values

2024-03-14T22:32:47.743354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T22:32:47.990007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

등록일국외출장보고서 목록부처명
02023-06-302023 PNS annual meeting 참여교육부
12023-06-30국제 포럼(International Forum of Japonica Rice with Good Quality) 참석농촌진흥청
22023-06-30공무국외출장보고서 및 증빙서류교육부
32023-06-30사기범죄 대응체계 개선을 위한 싱가포르 공무국외출장 결과보고경찰청
42023-06-302023 GCTF 국제워크숍 참석국민권익위원회
52023-06-30한.태국 예술교류지원사업 현지모니터링 및 국제협업방안 협의교육부
62023-06-30싱가포르 국립대 및 난양공대 에너지 센터 방문 협의에 관한 보고서교육부
72023-06-30공무국외출장(미국 AIAA Aviation 학회 참석)교육부
82023-06-3013th Advances in Cement Based Materials 학회 참석교육부
92023-06-30UN 제7회 PKO 기술협력 심포지움 (PKO관련 경험 공유, 혁신적 접근과 해결책 식별)국방부
등록일국외출장보고서 목록부처명
99452021-01-13캐나다 워털루대학교 기계학습관련 공동연구 파견교육부
99462021-01-12컴퓨팅 인텔리전스 국제학술회의교육부
99472021-01-12연구년에 해외 공동연구 진행교육부
99482021-01-12해외파견_연구년제연구교수교육부
99492021-01-11비행검사용 항공기 CL601-3R) 한정자격 취득국토교통부
99502021-01-07캄보디아 영농기술전수를 통한 농업생산성 증대사업 시설운영 PMC용역교육부
99512021-01-06미국 오레곤대학교 장기 연수교육부
99522021-01-05UAE 사막 벼 2차 실증시험 관련 토양평가 및 자문농촌진흥청
99532021-01-05통계 영역 교수 학습 및 평가 개선 방안 탐색교육부
99542021-01-05위험의 불평등성: 소도시의 코로나19 확산교육부

Duplicate rows

Most frequently occurring

등록일국외출장보고서 목록부처명# duplicates
22021-09-06카자흐스탄 안장 독립유공자 유해봉환국가보훈처3
62022-08-01항공사 역학조사 수행을 위한 국외출장고용노동부3
02021-08-05Georgia Southern University와 공동연구 추진교육부2
12021-09-02해외출장보고교육부2
32021-09-072021 SAGES 학회 발표 및 참석교육부2
42021-10-01UN DPO 초청 교관을 위한 교육참관국방부2
52022-05-17World Hydrogen 2022 참가를 통한 수소산업 기술개발과 발전 동향 파악교육부2
72022-08-02우즈베키스탄 국립아동병원 건립사업 컨설팅서비스 사후관리 현지방문 교육교육부2
82022-08-03해외 교류협력 대학과의 Mou 체결 및 협력논의, 파란사다리 사업 참가학생 인솔 및 안전관리를 위한 국외출장 결과보고서교육부2
92022-08-082022 코소보 세계 대학 핸드볼 월드컵 출전에 따른 국외 출장 보고교육부2