Overview

Dataset statistics

Number of variables3
Number of observations2812
Missing cells5
Missing cells (%)0.1%
Duplicate rows14
Duplicate rows (%)0.5%
Total size in memory66.0 KiB
Average record size in memory24.0 B

Variable types

Categorical1
Text2

Dataset

Description한국국제협력단 봉사단원이 파견된 기관에 대한 정보 데이터로 국가명, 파견기관명(국문), 파견기관명(영문) 항목을 제공합니다.
Author한국국제협력단
URLhttps://www.data.go.kr/data/15076582/fileData.do

Alerts

Dataset has 14 (0.5%) duplicate rowsDuplicates

Reproduction

Analysis started2024-03-14 14:34:42.785715
Analysis finished2024-03-14 14:34:43.856201
Duration1.07 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

국가명
Categorical

Distinct41
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size22.1 KiB
태국
254 
에콰도르
245 
몽골
193 
페루
 
174
도미니카공화국
 
138
Other values (36)
1808 

Length

Max length7
Median length6
Mean length3.4459459
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row엘살바도르
2nd row태국
3rd row캄보디아
4th row캄보디아
5th row캄보디아

Common Values

ValueCountFrequency (%)
태국 254
 
9.0%
에콰도르 245
 
8.7%
몽골 193
 
6.9%
페루 174
 
6.2%
도미니카공화국 138
 
4.9%
네팔 121
 
4.3%
우즈베키스탄 118
 
4.2%
르완다 116
 
4.1%
탄자니아 114
 
4.1%
피지 108
 
3.8%
Other values (31) 1231
43.8%

Length

2024-03-14T23:34:44.072983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
태국 254
 
9.0%
에콰도르 245
 
8.7%
몽골 193
 
6.9%
페루 174
 
6.2%
도미니카공화국 138
 
4.9%
네팔 121
 
4.3%
우즈베키스탄 118
 
4.2%
르완다 116
 
4.1%
탄자니아 114
 
4.1%
피지 108
 
3.8%
Other values (31) 1231
43.8%
Distinct2711
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Memory size22.1 KiB
2024-03-14T23:34:45.180883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length39
Mean length11.611309
Min length3

Characters and Unicode

Total characters32651
Distinct characters789
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2652 ?
Unique (%)94.3%

Sample

1st row엘살바도르 적십자
2nd row피분송크람 라차팟 대학교
3rd row프레이벵 교육청
4th row꺼꽁 교육청
5th row파일린 교육청
ValueCountFrequency (%)
대학교 141
 
2.0%
시청 131
 
1.8%
초등학교 95
 
1.3%
학교 94
 
1.3%
병원 65
 
0.9%
국립 63
 
0.9%
고등학교 58
 
0.8%
직업훈련원 53
 
0.7%
아이막 51
 
0.7%
유치원 50
 
0.7%
Other values (3314) 6361
88.8%
2024-03-14T23:34:46.758772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4612
 
14.1%
1297
 
4.0%
1221
 
3.7%
668
 
2.0%
535
 
1.6%
491
 
1.5%
488
 
1.5%
488
 
1.5%
480
 
1.5%
432
 
1.3%
Other values (779) 21939
67.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27095
83.0%
Space Separator 4612
 
14.1%
Decimal Number 257
 
0.8%
Uppercase Letter 243
 
0.7%
Close Punctuation 141
 
0.4%
Open Punctuation 140
 
0.4%
Lowercase Letter 101
 
0.3%
Other Punctuation 47
 
0.1%
Connector Punctuation 6
 
< 0.1%
Other Symbol 3
 
< 0.1%
Other values (3) 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1297
 
4.8%
1221
 
4.5%
668
 
2.5%
535
 
2.0%
491
 
1.8%
488
 
1.8%
488
 
1.8%
480
 
1.8%
432
 
1.6%
398
 
1.5%
Other values (710) 20597
76.0%
Lowercase Letter
ValueCountFrequency (%)
a 11
10.9%
i 10
 
9.9%
o 9
 
8.9%
n 9
 
8.9%
t 8
 
7.9%
e 7
 
6.9%
l 6
 
5.9%
f 5
 
5.0%
r 5
 
5.0%
d 5
 
5.0%
Other values (12) 26
25.7%
Uppercase Letter
ValueCountFrequency (%)
S 29
11.9%
A 27
11.1%
I 25
10.3%
N 23
9.5%
E 21
8.6%
T 16
 
6.6%
C 15
 
6.2%
O 13
 
5.3%
R 12
 
4.9%
M 12
 
4.9%
Other values (11) 50
20.6%
Decimal Number
ValueCountFrequency (%)
1 67
26.1%
2 40
15.6%
3 35
13.6%
4 30
11.7%
5 22
 
8.6%
6 18
 
7.0%
9 13
 
5.1%
8 11
 
4.3%
0 11
 
4.3%
7 10
 
3.9%
Other Punctuation
ValueCountFrequency (%)
, 30
63.8%
* 6
 
12.8%
. 4
 
8.5%
& 2
 
4.3%
; 2
 
4.3%
/ 2
 
4.3%
# 1
 
2.1%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
4612
100.0%
Close Punctuation
ValueCountFrequency (%)
) 141
100.0%
Open Punctuation
ValueCountFrequency (%)
( 140
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 6
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Math Symbol
ValueCountFrequency (%)
3
100.0%
Format
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 27095
83.0%
Common 5210
 
16.0%
Latin 346
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1297
 
4.8%
1221
 
4.5%
668
 
2.5%
535
 
2.0%
491
 
1.8%
488
 
1.8%
488
 
1.8%
480
 
1.8%
432
 
1.6%
398
 
1.5%
Other values (710) 20597
76.0%
Latin
ValueCountFrequency (%)
S 29
 
8.4%
A 27
 
7.8%
I 25
 
7.2%
N 23
 
6.6%
E 21
 
6.1%
T 16
 
4.6%
C 15
 
4.3%
O 13
 
3.8%
R 12
 
3.5%
M 12
 
3.5%
Other values (35) 153
44.2%
Common
ValueCountFrequency (%)
4612
88.5%
) 141
 
2.7%
( 140
 
2.7%
1 67
 
1.3%
2 40
 
0.8%
3 35
 
0.7%
, 30
 
0.6%
4 30
 
0.6%
5 22
 
0.4%
6 18
 
0.3%
Other values (14) 75
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 27095
83.0%
ASCII 5547
 
17.0%
Letterlike Symbols 3
 
< 0.1%
Math Operators 3
 
< 0.1%
Number Forms 2
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4612
83.1%
) 141
 
2.5%
( 140
 
2.5%
1 67
 
1.2%
2 40
 
0.7%
3 35
 
0.6%
, 30
 
0.5%
4 30
 
0.5%
S 29
 
0.5%
A 27
 
0.5%
Other values (54) 396
 
7.1%
Hangul
ValueCountFrequency (%)
1297
 
4.8%
1221
 
4.5%
668
 
2.5%
535
 
2.0%
491
 
1.8%
488
 
1.8%
488
 
1.8%
480
 
1.8%
432
 
1.6%
398
 
1.5%
Other values (710) 20597
76.0%
Letterlike Symbols
ValueCountFrequency (%)
3
100.0%
Math Operators
ValueCountFrequency (%)
3
100.0%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%
Punctuation
ValueCountFrequency (%)
1
100.0%
Distinct2721
Distinct (%)96.9%
Missing5
Missing (%)0.2%
Memory size22.1 KiB
2024-03-14T23:34:48.006849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length192
Median length100
Mean length39.212683
Min length4

Characters and Unicode

Total characters110070
Distinct characters92
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2650 ?
Unique (%)94.4%

Sample

1st rowSalvadoran Red Cross
2nd rowPibulsongkram Rajabhat University
3rd rowPrey Veng Provincial Department of Education, Youth and Sport
4th rowKoh Kong Provincial Department of Education, Youth and Sport
5th rowPailin Provincial Department of Education, Youth and Sport
ValueCountFrequency (%)
of 1012
 
7.2%
school 442
 
3.1%
and 429
 
3.0%
university 302
 
2.1%
ministry 248
 
1.8%
de 232
 
1.6%
center 231
 
1.6%
national 177
 
1.3%
college 160
 
1.1%
education 158
 
1.1%
Other values (3538) 10740
76.0%
2024-03-14T23:34:49.769330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11359
 
10.3%
a 9033
 
8.2%
i 7999
 
7.3%
e 7694
 
7.0%
o 7675
 
7.0%
n 7669
 
7.0%
t 6132
 
5.6%
r 5618
 
5.1%
l 4583
 
4.2%
c 3611
 
3.3%
Other values (82) 38697
35.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 81692
74.2%
Uppercase Letter 14353
 
13.0%
Space Separator 11359
 
10.3%
Other Punctuation 984
 
0.9%
Open Punctuation 609
 
0.6%
Close Punctuation 606
 
0.6%
Decimal Number 413
 
0.4%
Other Letter 17
 
< 0.1%
Control 13
 
< 0.1%
Connector Punctuation 13
 
< 0.1%
Other values (3) 11
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 9033
11.1%
i 7999
9.8%
e 7694
9.4%
o 7675
9.4%
n 7669
9.4%
t 6132
 
7.5%
r 5618
 
6.9%
l 4583
 
5.6%
c 3611
 
4.4%
s 3117
 
3.8%
Other values (16) 18561
22.7%
Uppercase Letter
ValueCountFrequency (%)
S 1571
 
10.9%
C 1349
 
9.4%
M 1045
 
7.3%
A 1033
 
7.2%
T 996
 
6.9%
P 891
 
6.2%
D 806
 
5.6%
E 782
 
5.4%
N 755
 
5.3%
I 730
 
5.1%
Other values (16) 4395
30.6%
Other Letter
ValueCountFrequency (%)
3
17.6%
3
17.6%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (3) 3
17.6%
Decimal Number
ValueCountFrequency (%)
1 87
21.1%
3 74
17.9%
9 57
13.8%
2 49
11.9%
4 37
9.0%
5 29
 
7.0%
0 26
 
6.3%
6 23
 
5.6%
7 18
 
4.4%
8 13
 
3.1%
Other Punctuation
ValueCountFrequency (%)
, 411
41.8%
; 204
20.7%
& 172
17.5%
. 140
 
14.2%
# 42
 
4.3%
/ 10
 
1.0%
: 3
 
0.3%
' 2
 
0.2%
Letter Number
ValueCountFrequency (%)
2
50.0%
2
50.0%
Space Separator
ValueCountFrequency (%)
11359
100.0%
Open Punctuation
ValueCountFrequency (%)
( 609
100.0%
Close Punctuation
ValueCountFrequency (%)
) 606
100.0%
Control
ValueCountFrequency (%)
13
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 13
100.0%
Other Symbol
ValueCountFrequency (%)
6
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 96049
87.3%
Common 14004
 
12.7%
Hangul 17
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 9033
 
9.4%
i 7999
 
8.3%
e 7694
 
8.0%
o 7675
 
8.0%
n 7669
 
8.0%
t 6132
 
6.4%
r 5618
 
5.8%
l 4583
 
4.8%
c 3611
 
3.8%
s 3117
 
3.2%
Other values (44) 32918
34.3%
Common
ValueCountFrequency (%)
11359
81.1%
( 609
 
4.3%
) 606
 
4.3%
, 411
 
2.9%
; 204
 
1.5%
& 172
 
1.2%
. 140
 
1.0%
1 87
 
0.6%
3 74
 
0.5%
9 57
 
0.4%
Other values (15) 285
 
2.0%
Hangul
ValueCountFrequency (%)
3
17.6%
3
17.6%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (3) 3
17.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 110042
> 99.9%
Hangul 17
 
< 0.1%
Letterlike Symbols 6
 
< 0.1%
Number Forms 4
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11359
 
10.3%
a 9033
 
8.2%
i 7999
 
7.3%
e 7694
 
7.0%
o 7675
 
7.0%
n 7669
 
7.0%
t 6132
 
5.6%
r 5618
 
5.1%
l 4583
 
4.2%
c 3611
 
3.3%
Other values (65) 38669
35.1%
Letterlike Symbols
ValueCountFrequency (%)
6
100.0%
Hangul
ValueCountFrequency (%)
3
17.6%
3
17.6%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (3) 3
17.6%
Number Forms
ValueCountFrequency (%)
2
50.0%
2
50.0%
Punctuation
ValueCountFrequency (%)
1
100.0%

Missing values

2024-03-14T23:34:43.499094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T23:34:43.751166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

국가명기관명_한글기관명_영문
0엘살바도르엘살바도르 적십자Salvadoran Red Cross
1태국피분송크람 라차팟 대학교Pibulsongkram Rajabhat University
2캄보디아프레이벵 교육청Prey Veng Provincial Department of Education, Youth and Sport
3캄보디아꺼꽁 교육청Koh Kong Provincial Department of Education, Youth and Sport
4캄보디아파일린 교육청Pailin Provincial Department of Education, Youth and Sport
5에티오피아연방경찰병원Federal Police Hospital
6에콰도르에스메랄다스 체육협회Provincial Sports Federation of Esmeraldas
7라오스교육체육국 사완나켓Department of Education and Sports (Savannakhet Province)
8탄자니아마피가 중등학교Mafiga Secondary School
9탄자니아톤고니 중고등학교Tongoni Secondary School
국가명기관명_한글기관명_영문
2802에콰도르임바부라 주청 복지센터Provincial Health Center of Imbabura
2803필리핀알코이 지방 정부(LGU Alcoy, Cebu) Local Government Unit Municipality of Alcoy
2804에티오피아아드와 교사양성학교Adwa Colllege of Teachers Education
2805에티오피아아드와 중등학교Adwa Nigiste Saba Secondary School
2806에콰도르로메오 무리죠 파스미뇨 초등학교Elementary School Romeo Murillo Pazmino
2807에콰도르파스타사 주청 복지센터Provincial Health Center of Pastaza
2808라오스공공사업교통부Ministry of Public Works and Transport
2809우간다우간다 개발청Uganda Development Corporation
2810요르단문화부(파인아트센터)Ministry of Culture(Fine Art Center)
2811베트남베트남 여성연맹Vietnam Women&#39;s Union

Duplicate rows

Most frequently occurring

국가명기관명_한글기관명_영문# duplicates
2동티모르체육청소년청년스포츠사무국Secretary of State for Youth and Sport3
0가나호 티칭 병원Ho teaching hospital (Ho)2
1네팔버럿풀 보건과학대학Bharatpur School of Health Sciences2
3방글라데시청년개발청 다카Department of Youth Development(수도)2
4베트남공안부 경찰 아카데미People&#39;s Police Academy2
5스리랑카기능학교 반다라웰라Technical College (Bandarawela)2
6스리랑카한스직업훈련원 사푸스간다Sri Lanka Youth Training Centre (Korean Tech) (Sapugaskanda)2
7에콰도르센트로 파올라 디 로사Paola Di Rosa Center2
8온두라스국립산림부 국립보존협회ICF_Instituto de Conservacion Forestal2
9탄자니아부간도 메디컬센터Bugando Medical Centre2