Overview

Dataset statistics

Number of variables14
Number of observations10000
Missing cells13508
Missing cells (%)9.6%
Duplicate rows2
Duplicate rows (%)< 0.1%
Total size in memory1.2 MiB
Average record size in memory122.0 B

Variable types

Text6
Categorical5
DateTime2
Numeric1

Dataset

Description1. KOICA-ODA 사업정보 KF-공공외교 사업 정보 목록 조회: 한글 국가명 또는 ISO국가코드(다.참고 1 ISO국가코드 이용), 한글 사업명으로 KOICA-ODA 사업정보 KF-공공외교 사업 정보 목록 조회
Author한국국제협력단
URLhttps://www.data.go.kr/data/15099254/fileData.do

Alerts

Dataset has 2 (< 0.1%) duplicate rowsDuplicates
다년구분코드명 is highly overall correlated with 사업유형코드 and 2 other fieldsHigh correlation
다년구분코드 is highly overall correlated with 사업유형코드 and 2 other fieldsHigh correlation
사업유형명 is highly overall correlated with 사업유형코드 and 2 other fieldsHigh correlation
사업유형코드 is highly overall correlated with 사업유형명 and 2 other fieldsHigh correlation
사업유형코드 is highly imbalanced (88.5%)Imbalance
사업유형명 is highly imbalanced (88.5%)Imbalance
다년구분코드 is highly imbalanced (59.8%)Imbalance
다년구분코드명 is highly imbalanced (59.8%)Imbalance
사업명(영문) has 6533 (65.3%) missing valuesMissing
사업시작일 has 2646 (26.5%) missing valuesMissing
사업종료일 has 2648 (26.5%) missing valuesMissing
수혜기관명 has 1617 (16.2%) missing valuesMissing

Reproduction

Analysis started2023-12-12 19:59:43.757361
Analysis finished2023-12-12 19:59:46.488736
Duration2.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct122
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T04:59:46.706018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length3.0189
Min length2

Characters and Unicode

Total characters30189
Distinct characters144
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)0.1%

Sample

1st row대한민국
2nd row캐나다
3rd row미국
4th row슬로베니아
5th row우크라이나
ValueCountFrequency (%)
대한민국 2685
26.9%
미국 2450
24.5%
중국 529
 
5.3%
러시아 347
 
3.5%
일본 275
 
2.8%
독일 265
 
2.6%
베트남 233
 
2.3%
영국 230
 
2.3%
캐나다 161
 
1.6%
호주 161
 
1.6%
Other values (112) 2664
26.6%
2023-12-13T04:59:47.113604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6029
20.0%
2685
 
8.9%
2685
 
8.9%
2685
 
8.9%
2499
 
8.3%
1059
 
3.5%
696
 
2.3%
557
 
1.8%
546
 
1.8%
530
 
1.8%
Other values (134) 10218
33.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 30189
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6029
20.0%
2685
 
8.9%
2685
 
8.9%
2685
 
8.9%
2499
 
8.3%
1059
 
3.5%
696
 
2.3%
557
 
1.8%
546
 
1.8%
530
 
1.8%
Other values (134) 10218
33.8%

Most occurring scripts

ValueCountFrequency (%)
Hangul 30189
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6029
20.0%
2685
 
8.9%
2685
 
8.9%
2685
 
8.9%
2499
 
8.3%
1059
 
3.5%
696
 
2.3%
557
 
1.8%
546
 
1.8%
530
 
1.8%
Other values (134) 10218
33.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 30189
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6029
20.0%
2685
 
8.9%
2685
 
8.9%
2685
 
8.9%
2499
 
8.3%
1059
 
3.5%
696
 
2.3%
557
 
1.8%
546
 
1.8%
530
 
1.8%
Other values (134) 10218
33.8%
Distinct121
Distinct (%)1.2%
Missing31
Missing (%)0.3%
Memory size156.2 KiB
2023-12-13T04:59:47.416247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length24
Mean length10.702779
Min length3

Characters and Unicode

Total characters106696
Distinct characters55
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)0.1%

Sample

1st rowKorea
2nd rowCanada
3rd rowUnited States of America
4th rowSlovenia
5th rowUkraine
ValueCountFrequency (%)
united 2695
15.2%
korea 2685
15.1%
states 2450
13.8%
of 2450
13.8%
america 2450
13.8%
china 529
 
3.0%
russia 347
 
2.0%
japan 275
 
1.5%
germany 265
 
1.5%
vietnam 233
 
1.3%
Other values (128) 3393
19.1%
2023-12-13T04:59:47.921346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 13404
12.6%
e 12096
11.3%
t 8530
 
8.0%
i 8370
 
7.8%
7803
 
7.3%
r 6647
 
6.2%
n 6148
 
5.8%
o 6103
 
5.7%
s 3942
 
3.7%
d 3780
 
3.5%
Other values (45) 29873
28.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 83479
78.2%
Uppercase Letter 15376
 
14.4%
Space Separator 7803
 
7.3%
Other Punctuation 35
 
< 0.1%
Dash Punctuation 3
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 13404
16.1%
e 12096
14.5%
t 8530
10.2%
i 8370
10.0%
r 6647
8.0%
n 6148
7.4%
o 6103
7.3%
s 3942
 
4.7%
d 3780
 
4.5%
m 3474
 
4.2%
Other values (17) 10985
13.2%
Uppercase Letter
ValueCountFrequency (%)
K 3000
19.5%
U 2780
18.1%
A 2779
18.1%
S 2753
17.9%
C 889
 
5.8%
R 439
 
2.9%
I 430
 
2.8%
G 306
 
2.0%
J 302
 
2.0%
T 293
 
1.9%
Other values (13) 1405
9.1%
Other Punctuation
ValueCountFrequency (%)
: 15
42.9%
' 15
42.9%
& 5
 
14.3%
Space Separator
ValueCountFrequency (%)
7803
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 98855
92.7%
Common 7841
 
7.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 13404
13.6%
e 12096
12.2%
t 8530
 
8.6%
i 8370
 
8.5%
r 6647
 
6.7%
n 6148
 
6.2%
o 6103
 
6.2%
s 3942
 
4.0%
d 3780
 
3.8%
m 3474
 
3.5%
Other values (40) 26361
26.7%
Common
ValueCountFrequency (%)
7803
99.5%
: 15
 
0.2%
' 15
 
0.2%
& 5
 
0.1%
- 3
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 106681
> 99.9%
None 15
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 13404
12.6%
e 12096
11.3%
t 8530
 
8.0%
i 8370
 
7.8%
7803
 
7.3%
r 6647
 
6.2%
n 6148
 
5.8%
o 6103
 
5.7%
s 3942
 
3.7%
d 3780
 
3.5%
Other values (44) 29858
28.0%
None
ValueCountFrequency (%)
ô 15
100.0%
Distinct122
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T04:59:48.188371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters20000
Distinct characters26
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)0.1%

Sample

1st rowKR
2nd rowCA
3rd rowUS
4th rowSI
5th rowUA
ValueCountFrequency (%)
kr 2685
26.9%
us 2450
24.5%
cn 529
 
5.3%
ru 347
 
3.5%
jp 275
 
2.8%
de 265
 
2.6%
vn 233
 
2.3%
gb 230
 
2.3%
ca 161
 
1.6%
au 161
 
1.6%
Other values (112) 2664
26.6%
2023-12-13T04:59:48.556221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
R 3480
17.4%
U 3078
15.4%
K 2899
14.5%
S 2692
13.5%
N 1079
 
5.4%
C 894
 
4.5%
E 628
 
3.1%
A 580
 
2.9%
I 477
 
2.4%
T 463
 
2.3%
Other values (16) 3730
18.6%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 20000
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
R 3480
17.4%
U 3078
15.4%
K 2899
14.5%
S 2692
13.5%
N 1079
 
5.4%
C 894
 
4.5%
E 628
 
3.1%
A 580
 
2.9%
I 477
 
2.4%
T 463
 
2.3%
Other values (16) 3730
18.6%

Most occurring scripts

ValueCountFrequency (%)
Latin 20000
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
R 3480
17.4%
U 3078
15.4%
K 2899
14.5%
S 2692
13.5%
N 1079
 
5.4%
C 894
 
4.5%
E 628
 
3.1%
A 580
 
2.9%
I 477
 
2.4%
T 463
 
2.3%
Other values (16) 3730
18.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 20000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
R 3480
17.4%
U 3078
15.4%
K 2899
14.5%
S 2692
13.5%
N 1079
 
5.4%
C 894
 
4.5%
E 628
 
3.1%
A 580
 
2.9%
I 477
 
2.4%
T 463
 
2.3%
Other values (16) 3730
18.6%

대륙명
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
아시아
4661 
북아메리카
2802 
유럽
1965 
호주(오세아니아)
 
190
남아메리카
 
187
Other values (2)
 
195

Length

Max length9
Median length5
Mean length3.5348
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row아시아
2nd row북아메리카
3rd row북아메리카
4th row유럽
5th row유럽

Common Values

ValueCountFrequency (%)
아시아 4661
46.6%
북아메리카 2802
28.0%
유럽 1965
19.7%
호주(오세아니아) 190
 
1.9%
남아메리카 187
 
1.9%
아프리카 164
 
1.6%
<NA> 31
 
0.3%

Length

2023-12-13T04:59:48.722589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:59:48.867441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
아시아 4661
46.6%
북아메리카 2802
28.0%
유럽 1965
19.7%
호주(오세아니아 190
 
1.9%
남아메리카 187
 
1.9%
아프리카 164
 
1.6%
na 31
 
0.3%

사업유형코드
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
9845 
2
 
155

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 9845
98.5%
2 155
 
1.6%

Length

2023-12-13T04:59:49.016150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:59:49.111796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 9845
98.5%
2 155
 
1.6%

사업유형명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
KF
9845 
KOICA
 
155

Length

Max length5
Median length2
Mean length2.0465
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowKF
2nd rowKF
3rd rowKF
4th rowKF
5th rowKF

Common Values

ValueCountFrequency (%)
KF 9845
98.5%
KOICA 155
 
1.6%

Length

2023-12-13T04:59:49.219128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:59:49.314418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kf 9845
98.5%
koica 155
 
1.6%
Distinct7972
Distinct (%)80.0%
Missing33
Missing (%)0.3%
Memory size156.2 KiB
2023-12-13T04:59:49.656248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length125
Median length89
Mean length24.001806
Min length3

Characters and Unicode

Total characters239226
Distinct characters926
Distinct categories17 ?
Distinct scripts7 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6918 ?
Unique (%)69.4%

Sample

1st row2003년도 뉴스레터 국문 8월
2nd row2007년도 UBC 한국법 기금교수직 설치(2/3)
3rd row1995 조지워싱턴대 동아시아연구센터한국연구프로그램운영
4th row[전자자료지원] 2019 슬로베니아 류블라냐대
5th row[지자체] <이주와 정주의 삶>
ValueCountFrequency (%)
한국어 1112
 
2.5%
미국 1108
 
2.5%
한국학 883
 
2.0%
객원교수 839
 
1.9%
지원 793
 
1.8%
뉴스레터 353
 
0.8%
중국 300
 
0.7%
289
 
0.7%
운영 278
 
0.6%
설치 275
 
0.6%
Other values (9775) 37788
85.8%
2023-12-13T04:59:50.525167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34388
 
14.4%
7677
 
3.2%
5761
 
2.4%
2 5222
 
2.2%
0 5143
 
2.1%
4580
 
1.9%
3822
 
1.6%
1 3692
 
1.5%
3307
 
1.4%
] 3025
 
1.3%
Other values (916) 162609
68.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 138932
58.1%
Space Separator 34388
 
14.4%
Decimal Number 19830
 
8.3%
Lowercase Letter 19138
 
8.0%
Uppercase Letter 10485
 
4.4%
Close Punctuation 5692
 
2.4%
Open Punctuation 5691
 
2.4%
Dash Punctuation 2298
 
1.0%
Other Punctuation 2014
 
0.8%
Math Symbol 552
 
0.2%
Other values (7) 206
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7677
 
5.5%
5761
 
4.1%
4580
 
3.3%
3822
 
2.8%
3307
 
2.4%
2804
 
2.0%
2672
 
1.9%
2670
 
1.9%
2415
 
1.7%
2210
 
1.6%
Other values (816) 101014
72.7%
Lowercase Letter
ValueCountFrequency (%)
e 2305
12.0%
i 2079
10.9%
o 1780
9.3%
n 1743
9.1%
t 1641
8.6%
a 1637
8.6%
r 1454
 
7.6%
s 1212
 
6.3%
l 864
 
4.5%
u 617
 
3.2%
Other values (19) 3806
19.9%
Uppercase Letter
ValueCountFrequency (%)
S 1243
11.9%
C 1055
 
10.1%
I 963
 
9.2%
A 917
 
8.7%
K 756
 
7.2%
U 679
 
6.5%
F 574
 
5.5%
E 549
 
5.2%
T 545
 
5.2%
P 492
 
4.7%
Other values (16) 2712
25.9%
Other Punctuation
ValueCountFrequency (%)
/ 954
47.4%
, 358
 
17.8%
. 187
 
9.3%
' 140
 
7.0%
: 137
 
6.8%
" 119
 
5.9%
& 54
 
2.7%
· 51
 
2.5%
? 9
 
0.4%
! 4
 
0.2%
Decimal Number
ValueCountFrequency (%)
2 5222
26.3%
0 5143
25.9%
1 3692
18.6%
9 1622
 
8.2%
5 934
 
4.7%
3 714
 
3.6%
8 685
 
3.5%
6 670
 
3.4%
7 641
 
3.2%
4 507
 
2.6%
Close Punctuation
ValueCountFrequency (%)
] 3025
53.1%
) 2624
46.1%
34
 
0.6%
9
 
0.2%
Open Punctuation
ValueCountFrequency (%)
[ 3025
53.2%
( 2623
46.1%
34
 
0.6%
9
 
0.2%
Math Symbol
ValueCountFrequency (%)
> 272
49.3%
< 271
49.1%
~ 5
 
0.9%
+ 4
 
0.7%
Initial Punctuation
ValueCountFrequency (%)
16
53.3%
14
46.7%
Final Punctuation
ValueCountFrequency (%)
15
51.7%
14
48.3%
Letter Number
ValueCountFrequency (%)
5
83.3%
1
 
16.7%
Space Separator
ValueCountFrequency (%)
34388
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2298
100.0%
Control
ValueCountFrequency (%)
94
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 44
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 138902
58.1%
Common 70663
29.5%
Latin 29627
 
12.4%
Han 29
 
< 0.1%
Hiragana 3
 
< 0.1%
Greek 1
 
< 0.1%
Cyrillic 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7677
 
5.5%
5761
 
4.1%
4580
 
3.3%
3822
 
2.8%
3307
 
2.4%
2804
 
2.0%
2672
 
1.9%
2670
 
1.9%
2415
 
1.7%
2210
 
1.6%
Other values (791) 100984
72.7%
Latin
ValueCountFrequency (%)
e 2305
 
7.8%
i 2079
 
7.0%
o 1780
 
6.0%
n 1743
 
5.9%
t 1641
 
5.5%
a 1637
 
5.5%
r 1454
 
4.9%
S 1243
 
4.2%
s 1212
 
4.1%
C 1055
 
3.6%
Other values (45) 13478
45.5%
Common
ValueCountFrequency (%)
34388
48.7%
2 5222
 
7.4%
0 5143
 
7.3%
1 3692
 
5.2%
] 3025
 
4.3%
[ 3025
 
4.3%
) 2624
 
3.7%
( 2623
 
3.7%
- 2298
 
3.3%
9 1622
 
2.3%
Other values (32) 7001
 
9.9%
Han
ValueCountFrequency (%)
5
 
17.2%
2
 
6.9%
2
 
6.9%
1
 
3.4%
1
 
3.4%
1
 
3.4%
1
 
3.4%
1
 
3.4%
1
 
3.4%
1
 
3.4%
Other values (13) 13
44.8%
Hiragana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Greek
ValueCountFrequency (%)
ο 1
100.0%
Cyrillic
ValueCountFrequency (%)
о 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 138891
58.1%
ASCII 100086
41.8%
None 141
 
0.1%
Punctuation 59
 
< 0.1%
CJK 29
 
< 0.1%
Compat Jamo 9
 
< 0.1%
Number Forms 6
 
< 0.1%
Hiragana 3
 
< 0.1%
Katakana 1
 
< 0.1%
Cyrillic 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
34388
34.4%
2 5222
 
5.2%
0 5143
 
5.1%
1 3692
 
3.7%
] 3025
 
3.0%
[ 3025
 
3.0%
) 2624
 
2.6%
( 2623
 
2.6%
e 2305
 
2.3%
- 2298
 
2.3%
Other values (74) 35741
35.7%
Hangul
ValueCountFrequency (%)
7677
 
5.5%
5761
 
4.1%
4580
 
3.3%
3822
 
2.8%
3307
 
2.4%
2804
 
2.0%
2672
 
1.9%
2670
 
1.9%
2415
 
1.7%
2210
 
1.6%
Other values (787) 100973
72.7%
None
ValueCountFrequency (%)
· 51
36.2%
34
24.1%
34
24.1%
9
 
6.4%
9
 
6.4%
2
 
1.4%
ô 1
 
0.7%
ο 1
 
0.7%
Punctuation
ValueCountFrequency (%)
16
27.1%
15
25.4%
14
23.7%
14
23.7%
Compat Jamo
ValueCountFrequency (%)
6
66.7%
2
 
22.2%
1
 
11.1%
Number Forms
ValueCountFrequency (%)
5
83.3%
1
 
16.7%
CJK
ValueCountFrequency (%)
5
 
17.2%
2
 
6.9%
2
 
6.9%
1
 
3.4%
1
 
3.4%
1
 
3.4%
1
 
3.4%
1
 
3.4%
1
 
3.4%
1
 
3.4%
Other values (13) 13
44.8%
Katakana
ValueCountFrequency (%)
1
100.0%
Hiragana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Cyrillic
ValueCountFrequency (%)
о 1
100.0%

사업명(영문)
Text

MISSING 

Distinct2035
Distinct (%)58.7%
Missing6533
Missing (%)65.3%
Memory size156.2 KiB
2023-12-13T04:59:50.895784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length203
Median length147
Mean length46.26882
Min length3

Characters and Unicode

Total characters160414
Distinct characters100
Distinct categories15 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1606 ?
Unique (%)46.3%

Sample

1st row2003 Newsletter Korean August
2nd row2007 Establishment of Professorships Program
3rd rowAnalysing Korea's Central Role in Northeast Asian Affairs
4th rowThe Second Korea-Japan Journalist and Expert Dialogue
5th rowOpera <The Wedding>
ValueCountFrequency (%)
of 1395
 
6.3%
program 1079
 
4.8%
the 716
 
3.2%
korean 710
 
3.2%
for 471
 
2.1%
and 427
 
1.9%
visiting 377
 
1.7%
korea 351
 
1.6%
staff 342
 
1.5%
teaching 341
 
1.5%
Other values (3003) 16058
72.1%
2023-12-13T04:59:51.598014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18824
 
11.7%
e 12319
 
7.7%
o 11246
 
7.0%
r 10874
 
6.8%
a 9686
 
6.0%
n 8811
 
5.5%
i 8774
 
5.5%
t 8305
 
5.2%
s 8205
 
5.1%
l 4249
 
2.6%
Other values (90) 59121
36.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 111966
69.8%
Uppercase Letter 19897
 
12.4%
Space Separator 18824
 
11.7%
Decimal Number 7506
 
4.7%
Other Punctuation 967
 
0.6%
Dash Punctuation 570
 
0.4%
Open Punctuation 257
 
0.2%
Close Punctuation 254
 
0.2%
Math Symbol 63
 
< 0.1%
Final Punctuation 45
 
< 0.1%
Other values (5) 65
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 12319
11.0%
o 11246
10.0%
r 10874
9.7%
a 9686
 
8.7%
n 8811
 
7.9%
i 8774
 
7.8%
t 8305
 
7.4%
s 8205
 
7.3%
l 4249
 
3.8%
m 3772
 
3.4%
Other values (16) 25725
23.0%
Uppercase Letter
ValueCountFrequency (%)
P 2515
12.6%
S 1988
 
10.0%
E 1775
 
8.9%
K 1527
 
7.7%
A 1284
 
6.5%
T 1200
 
6.0%
C 1186
 
6.0%
N 994
 
5.0%
R 879
 
4.4%
F 871
 
4.4%
Other values (16) 5678
28.5%
Other Punctuation
ValueCountFrequency (%)
: 214
22.1%
, 203
21.0%
. 150
15.5%
" 144
14.9%
' 137
14.2%
& 79
 
8.2%
/ 21
 
2.2%
; 10
 
1.0%
? 4
 
0.4%
! 4
 
0.4%
Decimal Number
ValueCountFrequency (%)
0 2596
34.6%
2 1747
23.3%
1 1149
15.3%
9 819
 
10.9%
8 245
 
3.3%
7 236
 
3.1%
3 197
 
2.6%
6 189
 
2.5%
5 184
 
2.5%
4 144
 
1.9%
Other Letter
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Math Symbol
ValueCountFrequency (%)
< 26
41.3%
> 24
38.1%
| 8
 
12.7%
+ 5
 
7.9%
Open Punctuation
ValueCountFrequency (%)
( 144
56.0%
[ 112
43.6%
1
 
0.4%
Close Punctuation
ValueCountFrequency (%)
) 141
55.5%
] 112
44.1%
1
 
0.4%
Letter Number
ValueCountFrequency (%)
3
42.9%
2
28.6%
2
28.6%
Dash Punctuation
ValueCountFrequency (%)
- 569
99.8%
1
 
0.2%
Initial Punctuation
ValueCountFrequency (%)
31
77.5%
9
 
22.5%
Final Punctuation
ValueCountFrequency (%)
27
60.0%
18
40.0%
Space Separator
ValueCountFrequency (%)
18824
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 12
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 131870
82.2%
Common 28539
 
17.8%
Han 5
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 12319
 
9.3%
o 11246
 
8.5%
r 10874
 
8.2%
a 9686
 
7.3%
n 8811
 
6.7%
i 8774
 
6.7%
t 8305
 
6.3%
s 8205
 
6.2%
l 4249
 
3.2%
m 3772
 
2.9%
Other values (45) 45629
34.6%
Common
ValueCountFrequency (%)
18824
66.0%
0 2596
 
9.1%
2 1747
 
6.1%
1 1149
 
4.0%
9 819
 
2.9%
- 569
 
2.0%
8 245
 
0.9%
7 236
 
0.8%
: 214
 
0.7%
, 203
 
0.7%
Other values (30) 1937
 
6.8%
Han
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 160314
99.9%
Punctuation 86
 
0.1%
Number Forms 7
 
< 0.1%
CJK 5
 
< 0.1%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18824
 
11.7%
e 12319
 
7.7%
o 11246
 
7.0%
r 10874
 
6.8%
a 9686
 
6.0%
n 8811
 
5.5%
i 8774
 
5.5%
t 8305
 
5.2%
s 8205
 
5.1%
l 4249
 
2.7%
Other values (75) 59021
36.8%
Punctuation
ValueCountFrequency (%)
31
36.0%
27
31.4%
18
20.9%
9
 
10.5%
1
 
1.2%
Number Forms
ValueCountFrequency (%)
3
42.9%
2
28.6%
2
28.6%
CJK
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
None
ValueCountFrequency (%)
1
50.0%
1
50.0%

사업시작일
Date

MISSING 

Distinct2133
Distinct (%)29.0%
Missing2646
Missing (%)26.5%
Memory size156.2 KiB
Minimum1992-01-01 00:00:00
Maximum2040-08-21 00:00:00
2023-12-13T04:59:51.797979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:59:52.009226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업종료일
Date

MISSING 

Distinct2124
Distinct (%)28.9%
Missing2648
Missing (%)26.5%
Memory size156.2 KiB
Minimum1992-02-27 00:00:00
Maximum2024-07-31 00:00:00
2023-12-13T04:59:52.196289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:59:52.363583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

다년구분코드
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
S
7385 
M
2232 
<NA>
 
233
MC
 
85
MN
 
64

Length

Max length4
Median length1
Mean length1.0849
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowS
2nd rowS
3rd rowS
4th rowS
5th rowS

Common Values

ValueCountFrequency (%)
S 7385
73.9%
M 2232
 
22.3%
<NA> 233
 
2.3%
MC 85
 
0.9%
MN 64
 
0.6%
SN 1
 
< 0.1%

Length

2023-12-13T04:59:52.551832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:59:52.726055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
s 7385
73.9%
m 2232
 
22.3%
na 233
 
2.3%
mc 85
 
0.9%
mn 64
 
0.6%
sn 1
 
< 0.1%

다년구분코드명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
단년
7385 
다년
2232 
<NA>
 
233
다년계속
 
85
다년신규
 
64

Length

Max length4
Median length2
Mean length2.0766
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row단년
2nd row단년
3rd row단년
4th row단년
5th row단년

Common Values

ValueCountFrequency (%)
단년 7385
73.9%
다년 2232
 
22.3%
<NA> 233
 
2.3%
다년계속 85
 
0.9%
다년신규 64
 
0.6%
단년신규 1
 
< 0.1%

Length

2023-12-13T04:59:52.872976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:59:53.022993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
단년 7385
73.9%
다년 2232
 
22.3%
na 233
 
2.3%
다년계속 85
 
0.9%
다년신규 64
 
0.6%
단년신규 1
 
< 0.1%

수혜기관명
Text

MISSING 

Distinct3003
Distinct (%)35.8%
Missing1617
Missing (%)16.2%
Memory size156.2 KiB
2023-12-13T04:59:53.292587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length100
Median length88
Mean length14.640344
Min length2

Characters and Unicode

Total characters122730
Distinct characters1154
Distinct categories18 ?
Distinct scripts15 ?
Distinct blocks18 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1631 ?
Unique (%)19.5%

Sample

1st row한국국제교류재단
2nd row브리티시컬럼비아대(UBC)
3rd row조지워싱턴대
4th rowUniverza v Ljubljani, Fakulteta za družbene vede, Raziskovalno središče za Vzhodno Azijo (EARL)
5th rowAndong Culture & Art Center
ValueCountFrequency (%)
of 577
 
3.1%
university 564
 
3.1%
한국국제교류재단 381
 
2.1%
357
 
1.9%
대사관 333
 
1.8%
대한민국 319
 
1.7%
and 194
 
1.1%
미국 170
 
0.9%
for 162
 
0.9%
studies 157
 
0.9%
Other values (3971) 15208
82.6%
2023-12-13T04:59:53.761086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10094
 
8.2%
i 4783
 
3.9%
e 4669
 
3.8%
4225
 
3.4%
n 4100
 
3.3%
a 4037
 
3.3%
t 3596
 
2.9%
r 3515
 
2.9%
o 3112
 
2.5%
3081
 
2.5%
Other values (1144) 77518
63.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 53123
43.3%
Lowercase Letter 44651
36.4%
Uppercase Letter 11311
 
9.2%
Space Separator 10094
 
8.2%
Close Punctuation 1123
 
0.9%
Open Punctuation 1117
 
0.9%
Other Punctuation 594
 
0.5%
Dash Punctuation 417
 
0.3%
Decimal Number 152
 
0.1%
Nonspacing Mark 86
 
0.1%
Other values (8) 62
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4225
 
8.0%
3081
 
5.8%
1826
 
3.4%
1819
 
3.4%
1651
 
3.1%
1213
 
2.3%
1161
 
2.2%
1064
 
2.0%
987
 
1.9%
951
 
1.8%
Other values (873) 35145
66.2%
Lowercase Letter
ValueCountFrequency (%)
i 4783
10.7%
e 4669
10.5%
n 4100
 
9.2%
a 4037
 
9.0%
t 3596
 
8.1%
r 3515
 
7.9%
o 3112
 
7.0%
s 2803
 
6.3%
l 1621
 
3.6%
u 1502
 
3.4%
Other values (117) 10913
24.4%
Uppercase Letter
ValueCountFrequency (%)
U 1266
 
11.2%
S 1112
 
9.8%
C 1070
 
9.5%
A 1020
 
9.0%
I 764
 
6.8%
L 600
 
5.3%
N 494
 
4.4%
E 470
 
4.2%
M 450
 
4.0%
K 393
 
3.5%
Other values (64) 3672
32.5%
Nonspacing Mark
ValueCountFrequency (%)
12
14.0%
̣ 9
10.5%
9
10.5%
7
8.1%
6
 
7.0%
6
 
7.0%
6
 
7.0%
̀ 6
 
7.0%
5
 
5.8%
4
 
4.7%
Other values (11) 16
18.6%
Other Punctuation
ValueCountFrequency (%)
, 311
52.4%
. 118
 
19.9%
/ 39
 
6.6%
& 32
 
5.4%
· 25
 
4.2%
' 25
 
4.2%
; 20
 
3.4%
" 11
 
1.9%
6
 
1.0%
: 4
 
0.7%
Other values (2) 3
 
0.5%
Decimal Number
ValueCountFrequency (%)
1 42
27.6%
2 35
23.0%
7 23
15.1%
0 14
 
9.2%
3 10
 
6.6%
5 9
 
5.9%
8 8
 
5.3%
4 7
 
4.6%
9 3
 
2.0%
6 1
 
0.7%
Spacing Mark
ValueCountFrequency (%)
ि 8
29.6%
7
25.9%
6
22.2%
2
 
7.4%
2
 
7.4%
2
 
7.4%
Math Symbol
ValueCountFrequency (%)
< 2
33.3%
> 2
33.3%
+ 1
16.7%
~ 1
16.7%
Open Punctuation
ValueCountFrequency (%)
( 1110
99.4%
5
 
0.4%
[ 2
 
0.2%
Final Punctuation
ValueCountFrequency (%)
» 1
33.3%
1
33.3%
1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 1121
99.8%
] 2
 
0.2%
Dash Punctuation
ValueCountFrequency (%)
- 416
99.8%
1
 
0.2%
Initial Punctuation
ValueCountFrequency (%)
6
85.7%
« 1
 
14.3%
Space Separator
ValueCountFrequency (%)
10094
100.0%
Other Symbol
ValueCountFrequency (%)
8
100.0%
Modifier Letter
ValueCountFrequency (%)
6
100.0%
Format
ValueCountFrequency (%)
4
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 53837
43.9%
Hangul 51592
42.0%
Common 13520
 
11.0%
Cyrillic 1949
 
1.6%
Han 755
 
0.6%
Hebrew 231
 
0.2%
Thai 220
 
0.2%
Arabic 211
 
0.2%
Armenian 155
 
0.1%
Sinhala 74
 
0.1%
Other values (5) 186
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4225
 
8.2%
3081
 
6.0%
1826
 
3.5%
1819
 
3.5%
1651
 
3.2%
1213
 
2.4%
1161
 
2.3%
1064
 
2.1%
987
 
1.9%
951
 
1.8%
Other values (587) 33614
65.2%
Han
ValueCountFrequency (%)
66
 
8.7%
59
 
7.8%
34
 
4.5%
32
 
4.2%
25
 
3.3%
22
 
2.9%
20
 
2.6%
18
 
2.4%
17
 
2.3%
17
 
2.3%
Other values (148) 445
58.9%
Latin
ValueCountFrequency (%)
i 4783
 
8.9%
e 4669
 
8.7%
n 4100
 
7.6%
a 4037
 
7.5%
t 3596
 
6.7%
r 3515
 
6.5%
o 3112
 
5.8%
s 2803
 
5.2%
l 1621
 
3.0%
u 1502
 
2.8%
Other values (102) 20099
37.3%
Cyrillic
ValueCountFrequency (%)
е 190
 
9.7%
и 183
 
9.4%
т 157
 
8.1%
н 152
 
7.8%
а 132
 
6.8%
о 127
 
6.5%
с 127
 
6.5%
р 99
 
5.1%
в 88
 
4.5%
к 82
 
4.2%
Other values (41) 612
31.4%
Common
ValueCountFrequency (%)
10094
74.7%
) 1121
 
8.3%
( 1110
 
8.2%
- 416
 
3.1%
, 311
 
2.3%
. 118
 
0.9%
1 42
 
0.3%
/ 39
 
0.3%
2 35
 
0.3%
& 32
 
0.2%
Other values (31) 202
 
1.5%
Thai
ValueCountFrequency (%)
28
 
12.7%
20
 
9.1%
16
 
7.3%
12
 
5.5%
12
 
5.5%
12
 
5.5%
11
 
5.0%
9
 
4.1%
9
 
4.1%
8
 
3.6%
Other values (25) 83
37.7%
Armenian
ValueCountFrequency (%)
Ա 42
27.1%
Ե 13
 
8.4%
Ն 12
 
7.7%
Կ 9
 
5.8%
Ր 9
 
5.8%
ա 7
 
4.5%
Լ 6
 
3.9%
Տ 6
 
3.9%
Ի 6
 
3.9%
Վ 6
 
3.9%
Other values (18) 39
25.2%
Devanagari
ValueCountFrequency (%)
ि 8
11.9%
7
 
10.4%
7
 
10.4%
6
 
9.0%
6
 
9.0%
5
 
7.5%
3
 
4.5%
2
 
3.0%
2
 
3.0%
2
 
3.0%
Other values (15) 19
28.4%
Arabic
ValueCountFrequency (%)
ا 44
20.9%
ل 28
13.3%
ة 18
8.5%
م 16
 
7.6%
ن 15
 
7.1%
ع 14
 
6.6%
ي 13
 
6.2%
س 11
 
5.2%
ج 8
 
3.8%
د 6
 
2.8%
Other values (13) 38
18.0%
Lao
ValueCountFrequency (%)
9
19.6%
5
 
10.9%
4
 
8.7%
3
 
6.5%
3
 
6.5%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
1
 
2.2%
Other values (13) 13
28.3%
Hebrew
ValueCountFrequency (%)
י 46
19.9%
ר 22
9.5%
ב 20
8.7%
ו 19
8.2%
א 19
8.2%
ה 14
 
6.1%
ס 13
 
5.6%
נ 12
 
5.2%
ת 12
 
5.2%
ט 12
 
5.2%
Other values (11) 42
18.2%
Sinhala
ValueCountFrequency (%)
12
16.2%
6
 
8.1%
6
 
8.1%
6
 
8.1%
6
 
8.1%
6
 
8.1%
4
 
5.4%
4
 
5.4%
4
 
5.4%
2
 
2.7%
Other values (9) 18
24.3%
Georgian
ValueCountFrequency (%)
5
23.8%
4
19.0%
2
 
9.5%
2
 
9.5%
2
 
9.5%
2
 
9.5%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
Katakana
ValueCountFrequency (%)
10
30.3%
8
24.2%
8
24.2%
2
 
6.1%
2
 
6.1%
1
 
3.0%
1
 
3.0%
1
 
3.0%
Inherited
ValueCountFrequency (%)
̣ 9
47.4%
̀ 6
31.6%
4
21.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 66904
54.5%
Hangul 51584
42.0%
Cyrillic 1949
 
1.6%
CJK 755
 
0.6%
None 343
 
0.3%
Hebrew 231
 
0.2%
Thai 220
 
0.2%
Arabic 211
 
0.2%
Armenian 155
 
0.1%
Latin Ext Additional 85
 
0.1%
Other values (8) 293
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10094
15.1%
i 4783
 
7.1%
e 4669
 
7.0%
n 4100
 
6.1%
a 4037
 
6.0%
t 3596
 
5.4%
r 3515
 
5.3%
o 3112
 
4.7%
s 2803
 
4.2%
l 1621
 
2.4%
Other values (72) 24574
36.7%
Hangul
ValueCountFrequency (%)
4225
 
8.2%
3081
 
6.0%
1826
 
3.5%
1819
 
3.5%
1651
 
3.2%
1213
 
2.4%
1161
 
2.3%
1064
 
2.1%
987
 
1.9%
951
 
1.8%
Other values (586) 33606
65.1%
Cyrillic
ValueCountFrequency (%)
е 190
 
9.7%
и 183
 
9.4%
т 157
 
8.1%
н 152
 
7.8%
а 132
 
6.8%
о 127
 
6.5%
с 127
 
6.5%
р 99
 
5.1%
в 88
 
4.5%
к 82
 
4.2%
Other values (41) 612
31.4%
CJK
ValueCountFrequency (%)
66
 
8.7%
59
 
7.8%
34
 
4.5%
32
 
4.2%
25
 
3.3%
22
 
2.9%
20
 
2.6%
18
 
2.4%
17
 
2.3%
17
 
2.3%
Other values (148) 445
58.9%
Hebrew
ValueCountFrequency (%)
י 46
19.9%
ר 22
9.5%
ב 20
8.7%
ו 19
8.2%
א 19
8.2%
ה 14
 
6.1%
ס 13
 
5.6%
נ 12
 
5.2%
ת 12
 
5.2%
ט 12
 
5.2%
Other values (11) 42
18.2%
Arabic
ValueCountFrequency (%)
ا 44
20.9%
ل 28
13.3%
ة 18
8.5%
م 16
 
7.6%
ن 15
 
7.1%
ع 14
 
6.6%
ي 13
 
6.2%
س 11
 
5.2%
ج 8
 
3.8%
د 6
 
2.8%
Other values (13) 38
18.0%
Armenian
ValueCountFrequency (%)
Ա 42
27.1%
Ե 13
 
8.4%
Ն 12
 
7.7%
Կ 9
 
5.8%
Ր 9
 
5.8%
ա 7
 
4.5%
Լ 6
 
3.9%
Տ 6
 
3.9%
Ի 6
 
3.9%
Վ 6
 
3.9%
Other values (18) 39
25.2%
None
ValueCountFrequency (%)
ä 29
 
8.5%
· 25
 
7.3%
á 25
 
7.3%
à 25
 
7.3%
é 24
 
7.0%
Đ 22
 
6.4%
ü 18
 
5.2%
ö 16
 
4.7%
š 11
 
3.2%
ư 10
 
2.9%
Other values (41) 138
40.2%
Thai
ValueCountFrequency (%)
28
 
12.7%
20
 
9.1%
16
 
7.3%
12
 
5.5%
12
 
5.5%
12
 
5.5%
11
 
5.0%
9
 
4.1%
9
 
4.1%
8
 
3.6%
Other values (25) 83
37.7%
Latin Ext Additional
ValueCountFrequency (%)
23
27.1%
21
24.7%
8
 
9.4%
8
 
9.4%
7
 
8.2%
5
 
5.9%
4
 
4.7%
4
 
4.7%
ế 2
 
2.4%
1
 
1.2%
Other values (2) 2
 
2.4%
Sinhala
ValueCountFrequency (%)
12
16.2%
6
 
8.1%
6
 
8.1%
6
 
8.1%
6
 
8.1%
6
 
8.1%
4
 
5.4%
4
 
5.4%
4
 
5.4%
2
 
2.7%
Other values (9) 18
24.3%
Katakana
ValueCountFrequency (%)
10
22.2%
8
17.8%
8
17.8%
6
13.3%
6
13.3%
2
 
4.4%
2
 
4.4%
1
 
2.2%
1
 
2.2%
1
 
2.2%
Lao
ValueCountFrequency (%)
9
19.6%
5
 
10.9%
4
 
8.7%
3
 
6.5%
3
 
6.5%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
1
 
2.2%
Other values (13) 13
28.3%
Diacriticals
ValueCountFrequency (%)
̣ 9
60.0%
̀ 6
40.0%
Devanagari
ValueCountFrequency (%)
ि 8
11.9%
7
 
10.4%
7
 
10.4%
6
 
9.0%
6
 
9.0%
5
 
7.5%
3
 
4.5%
2
 
3.0%
2
 
3.0%
2
 
3.0%
Other values (15) 19
28.4%
IPA Ext
ValueCountFrequency (%)
ə 6
100.0%
Punctuation
ValueCountFrequency (%)
6
31.6%
5
26.3%
4
21.1%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Georgian
ValueCountFrequency (%)
5
23.8%
4
19.0%
2
 
9.5%
2
 
9.5%
2
 
9.5%
2
 
9.5%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%

사업연도
Real number (ℝ)

Distinct33
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2010.6041
Minimum1992
Maximum2024
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T04:59:53.927307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1992
5-th percentile1996
Q12006
median2011
Q32017
95-th percentile2021
Maximum2024
Range32
Interquartile range (IQR)11

Descriptive statistics

Standard deviation7.5308323
Coefficient of variation (CV)0.003745557
Kurtosis-0.43254071
Mean2010.6041
Median Absolute Deviation (MAD)5
Skewness-0.54337017
Sum20106041
Variance56.713435
MonotonicityNot monotonic
2023-12-13T04:59:54.050889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
2019 648
 
6.5%
2010 570
 
5.7%
2011 556
 
5.6%
2009 529
 
5.3%
2008 489
 
4.9%
2015 482
 
4.8%
2018 472
 
4.7%
2020 468
 
4.7%
2012 460
 
4.6%
2007 458
 
4.6%
Other values (23) 4868
48.7%
ValueCountFrequency (%)
1992 101
1.0%
1993 93
0.9%
1994 128
1.3%
1995 162
1.6%
1996 152
1.5%
1997 155
1.6%
1998 115
1.1%
1999 126
1.3%
2000 156
1.6%
2001 175
1.8%
ValueCountFrequency (%)
2024 1
 
< 0.1%
2023 1
 
< 0.1%
2022 245
 
2.5%
2021 428
4.3%
2020 468
4.7%
2019 648
6.5%
2018 472
4.7%
2017 420
4.2%
2016 421
4.2%
2015 482
4.8%

Interactions

2023-12-13T04:59:45.780232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:59:54.158136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대륙명사업유형코드사업유형명다년구분코드다년구분코드명사업연도
대륙명1.0000.3610.3610.2180.2180.176
사업유형코드0.3611.0001.0001.0001.0000.374
사업유형명0.3611.0001.0001.0001.0000.374
다년구분코드0.2181.0001.0001.0001.0000.412
다년구분코드명0.2181.0001.0001.0001.0000.412
사업연도0.1760.3740.3740.4120.4121.000
2023-12-13T04:59:54.261001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대륙명다년구분코드명다년구분코드사업유형명사업유형코드
대륙명1.0000.1490.1490.2600.260
다년구분코드명0.1491.0001.0001.0001.000
다년구분코드0.1491.0001.0001.0001.000
사업유형명0.2601.0001.0001.0000.997
사업유형코드0.2601.0001.0000.9971.000
2023-12-13T04:59:54.364372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업연도대륙명사업유형코드사업유형명다년구분코드다년구분코드명
사업연도1.0000.0960.2870.2870.1850.185
대륙명0.0961.0000.2600.2600.1490.149
사업유형코드0.2870.2601.0000.9971.0001.000
사업유형명0.2870.2600.9971.0001.0001.000
다년구분코드0.1850.1491.0001.0001.0001.000
다년구분코드명0.1850.1491.0001.0001.0001.000

Missing values

2023-12-13T04:59:45.933497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:59:46.165324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T04:59:46.369443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

국가명국가영문명iso 2자리코드대륙명사업유형코드사업유형명사업명(국문)사업명(영문)사업시작일사업종료일다년구분코드다년구분코드명수혜기관명사업연도
618대한민국KoreaKR아시아1KF2003년도 뉴스레터 국문 8월2003 Newsletter Korean August<NA><NA>S단년한국국제교류재단2003
10531캐나다CanadaCA북아메리카1KF2007년도 UBC 한국법 기금교수직 설치(2/3)2007 Establishment of Professorships Program<NA><NA>S단년브리티시컬럼비아대(UBC)2007
5014미국United States of AmericaUS북아메리카1KF1995 조지워싱턴대 동아시아연구센터한국연구프로그램운영<NA><NA><NA>S단년조지워싱턴대1995
8092슬로베니아SloveniaSI유럽1KF[전자자료지원] 2019 슬로베니아 류블라냐대<NA>2019-01-012019-12-31S단년Univerza v Ljubljani, Fakulteta za družbene vede, Raziskovalno središče za Vzhodno Azijo (EARL)2019
8821우크라이나UkraineUA유럽1KF[지자체] <이주와 정주의 삶><NA>2019-04-162019-05-17S단년Andong Culture & Art Center2019
8523영국United KingdomGB유럽1KF제20차 한영미래포럼<NA>2012-06-132012-06-14S단년<NA>2012
11190파라과이ParaguayPY남아메리카1KF[전시] 한-파 이민 50주년 기념展 《순수의 땅으로》<NA>2015-06-242015-07-18S단년주한 파라과이 대사관2015
8033스페인SpainES유럽1KF[서유럽] 2018-19 스페인 살라망카대 교원고용지원 사업<NA>2018-09-012018-12-31M다년Universidad de Salamanca2018
1146대한민국KoreaKR아시아1KF일제의 전시체제와 조선인 동원<NA>2006-03-032006-03-03S단년낙성대경제연구소2006
4435미국United States of AmericaUS북아메리카1KF미국 브루킹스연구소Analysing Korea's Central Role in Northeast Asian Affairs<NA><NA>S단년주 미국 대한민국 대사관2003
국가명국가영문명iso 2자리코드대륙명사업유형코드사업유형명사업명(국문)사업명(영문)사업시작일사업종료일다년구분코드다년구분코드명수혜기관명사업연도
10636캐나다CanadaCA북아메리카1KF캐나다 UBC 한국어 기금강사직 설치 지원(3/5)<NA>2015-09-012016-08-31M다년브리티시컬럼비아대(UBC)2015
281대한민국KoreaKR아시아1KF2010년도 뉴스레터 영문 12월2010 Newsletter English December<NA><NA>S단년와우이미지2010
1871대한민국KoreaKR아시아1KF형 그리고 영, 한국 초상화 걸작선Great Korean Portraits2010-01-012010-12-31S단년돌베개2010
9476일본JapanJP아시아1KF일본민예관 초청 쇳대박물관 소장유물전<NA>2008-09-092008-11-20S단년쇳대박물관2008
5774미국United States of AmericaUS북아메리카1KF미국 컬럼비아대 WEAI<NA>2009-01-012009-12-31S단년컬럼비아대학교2009
4725미국United States of AmericaUS북아메리카1KFAATK 연례회의, 워크샵 개최<NA><NA><NA>S단년미국한국어교육자협회(AATK)2001
7045미국United States of AmericaUS북아메리카1KF[현안세미나] CFTNI<NA>2020-05-012021-05-30S단년Center for the National Interest (CFTNI)2020
7796브라질BrazilBR남아메리카1KF[중남미] 2016 브라질 상파울루대 한국어 객원교수 파견 (이나현)<NA>2016-04-102016-12-20S단년상파울루대2016
8579영국United KingdomGB유럽1KF[서유럽] 2017-22 영국 에딘버러대 한국학 부교수직 설치 지원 (1/5)<NA>2017-09-182018-09-17M다년에든버러대학교 국제처2017
9276인도네시아IndonesiaID아시아1KF2011 한인문예총종합예술제<NA>2011-11-272011-11-27<NA><NA><NA>2011

Duplicate rows

Most frequently occurring

국가명국가영문명iso 2자리코드대륙명사업유형코드사업유형명사업명(국문)사업명(영문)사업시작일사업종료일다년구분코드다년구분코드명수혜기관명사업연도# duplicates
0미국United States of AmericaUS북아메리카1KF미국 CSIS<NA>2002-01-012002-12-31S단년주 미국 대한민국 대사관20022
1미국United States of AmericaUS북아메리카1KF미국 CSIS<NA><NA><NA>S단년주 미국 대한민국 대사관20042