gimi9 Pandas Profiling

Dataset statistics

Number of variables	14
Number of observations	10000
Missing cells	13508
Missing cells (%)	9.6%
Duplicate rows	2
Duplicate rows (%)	< 0.1%
Total size in memory	1.2 MiB
Average record size in memory	122.0 B

Variable types

Text	6
Categorical	5
DateTime	2
Numeric	1

Dataset

Description	1. KOICA-ODA 사업정보 KF-공공외교 사업 정보 목록 조회: 한글 국가명 또는 ISO국가코드(다.참고 1 ISO국가코드 이용), 한글 사업명으로 KOICA-ODA 사업정보 KF-공공외교 사업 정보 목록 조회
Author	한국국제협력단
URL	https://www.data.go.kr/data/15099254/fileData.do

Alerts

Dataset has 2 (< 0.1%) duplicate rows	Duplicates
`다년구분코드명` is highly overall correlated with `사업유형코드` and 2 other fields	High correlation
`다년구분코드` is highly overall correlated with `사업유형코드` and 2 other fields	High correlation
`사업유형명` is highly overall correlated with `사업유형코드` and 2 other fields	High correlation
`사업유형코드` is highly overall correlated with `사업유형명` and 2 other fields	High correlation
`사업유형코드` is highly imbalanced (88.5%)	Imbalance
`사업유형명` is highly imbalanced (88.5%)	Imbalance
`다년구분코드` is highly imbalanced (59.8%)	Imbalance
`다년구분코드명` is highly imbalanced (59.8%)	Imbalance
`사업명(영문)` has 6533 (65.3%) missing values	Missing
`사업시작일` has 2646 (26.5%) missing values	Missing
`사업종료일` has 2648 (26.5%) missing values	Missing
`수혜기관명` has 1617 (16.2%) missing values	Missing

Reproduction

Analysis started	2023-12-12 19:59:43.757361
Analysis finished	2023-12-12 19:59:46.488736
Duration	2.73 seconds
Software version	ydata-profiling vv4.5.1
Download configuration	config.json

국가명
Text

Distinct	122
Distinct (%)	1.2%
Missing	0
Missing (%)	0.0%
Memory size	156.2 KiB

Length

Max length	10
Median length	9
Mean length	3.0189
Min length	2

Characters and Unicode

Total characters	30189
Distinct characters	144
Distinct categories	1 ?
Distinct scripts	1 ?
Distinct blocks	1 ?

The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique	11 ?
Unique (%)	0.1%

Sample

1st row	대한민국
2nd row	캐나다
3rd row	미국
4th row	슬로베니아
5th row	우크라이나

Value	Count	Frequency (%)
대한민국	2685	26.9%
미국	2450	24.5%
중국	529	5.3%
러시아	347	3.5%
일본	275	2.8%
독일	265	2.6%
베트남	233	2.3%
영국	230	2.3%
캐나다	161	1.6%
호주	161	1.6%
Other values (112)	2664	26.6%

Most occurring characters

Value	Count	Frequency (%)
국	6029	20.0%
대	2685	8.9%
한	2685	8.9%
민	2685	8.9%
미	2499	8.3%
아	1059	3.5%
스	696	2.3%
일	557	1.8%
시	546	1.8%
중	530	1.8%
Other values (134)	10218	33.8%

Most occurring categories

Value	Count	Frequency (%)
Other Letter	30189	100.0%

Most frequent character per category

Other Letter

Value	Count	Frequency (%)
국	6029	20.0%
대	2685	8.9%
한	2685	8.9%
민	2685	8.9%
미	2499	8.3%
아	1059	3.5%
스	696	2.3%
일	557	1.8%
시	546	1.8%
중	530	1.8%
Other values (134)	10218	33.8%

Most occurring scripts

Value	Count	Frequency (%)
Hangul	30189	100.0%

Most frequent character per script

Hangul

Value	Count	Frequency (%)
국	6029	20.0%
대	2685	8.9%
한	2685	8.9%
민	2685	8.9%
미	2499	8.3%
아	1059	3.5%
스	696	2.3%
일	557	1.8%
시	546	1.8%
중	530	1.8%
Other values (134)	10218	33.8%

Most occurring blocks

Value	Count	Frequency (%)
Hangul	30189	100.0%

Most frequent character per block

Hangul

Value	Count	Frequency (%)
국	6029	20.0%
대	2685	8.9%
한	2685	8.9%
민	2685	8.9%
미	2499	8.3%
아	1059	3.5%
스	696	2.3%
일	557	1.8%
시	546	1.8%
중	530	1.8%
Other values (134)	10218	33.8%

국가영문명
Text

Distinct	121
Distinct (%)	1.2%
Missing	31
Missing (%)	0.3%
Memory size	156.2 KiB

Length

Max length	26
Median length	24
Mean length	10.702779
Min length	3

Characters and Unicode

Total characters	106696
Distinct characters	55
Distinct categories	5 ?
Distinct scripts	2 ?
Distinct blocks	2 ?

The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique	11 ?
Unique (%)	0.1%

Sample

1st row	Korea
2nd row	Canada
3rd row	United States of America
4th row	Slovenia
5th row	Ukraine

Value	Count	Frequency (%)
united	2695	15.2%
korea	2685	15.1%
states	2450	13.8%
of	2450	13.8%
america	2450	13.8%
china	529	3.0%
russia	347	2.0%
japan	275	1.5%
germany	265	1.5%
vietnam	233	1.3%
Other values (128)	3393	19.1%

Most occurring characters

Value	Count	Frequency (%)
a	13404	12.6%
e	12096	11.3%
t	8530	8.0%
i	8370	7.8%
	7803	7.3%
r	6647	6.2%
n	6148	5.8%
o	6103	5.7%
s	3942	3.7%
d	3780	3.5%
Other values (45)	29873	28.0%

Most occurring categories

Value	Count	Frequency (%)
Lowercase Letter	83479	78.2%
Uppercase Letter	15376	14.4%
Space Separator	7803	7.3%
Other Punctuation	35	< 0.1%
Dash Punctuation	3	< 0.1%

Most frequent character per category

Lowercase Letter

Value	Count	Frequency (%)
a	13404	16.1%
e	12096	14.5%
t	8530	10.2%
i	8370	10.0%
r	6647	8.0%
n	6148	7.4%
o	6103	7.3%
s	3942	4.7%
d	3780	4.5%
m	3474	4.2%
Other values (17)	10985	13.2%

Uppercase Letter

Value	Count	Frequency (%)
K	3000	19.5%
U	2780	18.1%
A	2779	18.1%
S	2753	17.9%
C	889	5.8%
R	439	2.9%
I	430	2.8%
G	306	2.0%
J	302	2.0%
T	293	1.9%
Other values (13)	1405	9.1%

Other Punctuation

Value	Count	Frequency (%)
:	15	42.9%
'	15	42.9%
&	5	14.3%

Space Separator

Value	Count	Frequency (%)
	7803	100.0%

Dash Punctuation

Value	Count	Frequency (%)
-	3	100.0%

Most occurring scripts

Value	Count	Frequency (%)
Latin	98855	92.7%
Common	7841	7.3%

Most frequent character per script

Latin

Value	Count	Frequency (%)
a	13404	13.6%
e	12096	12.2%
t	8530	8.6%
i	8370	8.5%
r	6647	6.7%
n	6148	6.2%
o	6103	6.2%
s	3942	4.0%
d	3780	3.8%
m	3474	3.5%
Other values (40)	26361	26.7%

Common

Value	Count	Frequency (%)
	7803	99.5%
:	15	0.2%
'	15	0.2%
&	5	0.1%
-	3	< 0.1%

Most occurring blocks

Value	Count	Frequency (%)
ASCII	106681	> 99.9%
None	15	< 0.1%

Most frequent character per block

ASCII

Value	Count	Frequency (%)
a	13404	12.6%
e	12096	11.3%
t	8530	8.0%
i	8370	7.8%
	7803	7.3%
r	6647	6.2%
n	6148	5.8%
o	6103	5.7%
s	3942	3.7%
d	3780	3.5%
Other values (44)	29858	28.0%

None

Value	Count	Frequency (%)
ô	15	100.0%

iso 2자리코드
Text

Distinct	122
Distinct (%)	1.2%
Missing	0
Missing (%)	0.0%
Memory size	156.2 KiB

Length

Max length	2
Median length	2
Mean length	2
Min length	2

Characters and Unicode

Total characters	20000
Distinct characters	26
Distinct categories	1 ?
Distinct scripts	1 ?
Distinct blocks	1 ?

The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique	11 ?
Unique (%)	0.1%

Sample

1st row	KR
2nd row	CA
3rd row	US
4th row	SI
5th row	UA

Value	Count	Frequency (%)
kr	2685	26.9%
us	2450	24.5%
cn	529	5.3%
ru	347	3.5%
jp	275	2.8%
de	265	2.6%
vn	233	2.3%
gb	230	2.3%
ca	161	1.6%
au	161	1.6%
Other values (112)	2664	26.6%

Most occurring characters

Value	Count	Frequency (%)
R	3480	17.4%
U	3078	15.4%
K	2899	14.5%
S	2692	13.5%
N	1079	5.4%
C	894	4.5%
E	628	3.1%
A	580	2.9%
I	477	2.4%
T	463	2.3%
Other values (16)	3730	18.6%

Most occurring categories

Value	Count	Frequency (%)
Uppercase Letter	20000	100.0%

Most frequent character per category

Uppercase Letter

Value	Count	Frequency (%)
R	3480	17.4%
U	3078	15.4%
K	2899	14.5%
S	2692	13.5%
N	1079	5.4%
C	894	4.5%
E	628	3.1%
A	580	2.9%
I	477	2.4%
T	463	2.3%
Other values (16)	3730	18.6%

Most occurring scripts

Value	Count	Frequency (%)
Latin	20000	100.0%

Most frequent character per script

Latin

Value	Count	Frequency (%)
R	3480	17.4%
U	3078	15.4%
K	2899	14.5%
S	2692	13.5%
N	1079	5.4%
C	894	4.5%
E	628	3.1%
A	580	2.9%
I	477	2.4%
T	463	2.3%
Other values (16)	3730	18.6%

Most occurring blocks

Value	Count	Frequency (%)
ASCII	20000	100.0%

Most frequent character per block

ASCII

Value	Count	Frequency (%)
R	3480	17.4%
U	3078	15.4%
K	2899	14.5%
S	2692	13.5%
N	1079	5.4%
C	894	4.5%
E	628	3.1%
A	580	2.9%
I	477	2.4%
T	463	2.3%
Other values (16)	3730	18.6%

대륙명
Categorical

Distinct	7
Distinct (%)	0.1%
Missing	0
Missing (%)	0.0%
Memory size	156.2 KiB

아시아	4661
북아메리카	2802
유럽	1965
호주(오세아니아)	190
남아메리카	187
Other values (2)	195

Length

Max length	9
Median length	5
Mean length	3.5348
Min length	2

Unique

Unique	0 ?
Unique (%)	0.0%

Sample

1st row	아시아
2nd row	북아메리카
3rd row	북아메리카
4th row	유럽
5th row	유럽

Common Values

Value	Count	Frequency (%)
아시아	4661	46.6%
북아메리카	2802	28.0%
유럽	1965	19.7%
호주(오세아니아)	190	1.9%
남아메리카	187	1.9%
아프리카	164	1.6%
<NA>	31	0.3%

Length

Histogram of lengths of the category

Common Values (Plot)

Value	Count	Frequency (%)
아시아	4661	46.6%
북아메리카	2802	28.0%
유럽	1965	19.7%
호주(오세아니아	190	1.9%
남아메리카	187	1.9%
아프리카	164	1.6%
na	31	0.3%

사업유형코드
Categorical

HIGH CORRELATION IMBALANCE

Distinct	2
Distinct (%)	< 0.1%
Missing	0
Missing (%)	0.0%
Memory size	156.2 KiB

1	9845
2	155

Length

Max length	1
Median length	1
Mean length	1
Min length	1

Unique

Unique	0 ?
Unique (%)	0.0%

Sample

1st row	1
2nd row	1
3rd row	1
4th row	1
5th row	1

Common Values

Value	Count	Frequency (%)
1	9845	98.5%
2	155	1.6%

Length

Histogram of lengths of the category

Common Values (Plot)

Value	Count	Frequency (%)
1	9845	98.5%
2	155	1.6%

사업유형명
Categorical

HIGH CORRELATION IMBALANCE

Distinct	2
Distinct (%)	< 0.1%
Missing	0
Missing (%)	0.0%
Memory size	156.2 KiB

KF	9845
KOICA	155

Length

Max length	5
Median length	2
Mean length	2.0465
Min length	2

Unique

Unique	0 ?
Unique (%)	0.0%

Sample

1st row	KF
2nd row	KF
3rd row	KF
4th row	KF
5th row	KF

Common Values

Value	Count	Frequency (%)
KF	9845	98.5%
KOICA	155	1.6%

Length

Histogram of lengths of the category

Common Values (Plot)

Value	Count	Frequency (%)
kf	9845	98.5%
koica	155	1.6%

사업명(국문)
Text

Distinct	7972
Distinct (%)	80.0%
Missing	33
Missing (%)	0.3%
Memory size	156.2 KiB

Length

Max length	125
Median length	89
Mean length	24.001806
Min length	3

Characters and Unicode

Total characters	239226
Distinct characters	926
Distinct categories	17 ?
Distinct scripts	7 ?
Distinct blocks	10 ?

The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique	6918 ?
Unique (%)	69.4%

Sample

1st row	2003년도 뉴스레터 국문 8월
2nd row	2007년도 UBC 한국법 기금교수직 설치(2/3)
3rd row	1995 조지워싱턴대 동아시아연구센터한국연구프로그램운영
4th row	[전자자료지원] 2019 슬로베니아 류블라냐대
5th row	[지자체] <이주와 정주의 삶>

Value	Count	Frequency (%)
한국어	1112	2.5%
미국	1108	2.5%
한국학	883	2.0%
객원교수	839	1.9%
지원	793	1.8%
뉴스레터	353	0.8%
중국	300	0.7%
및	289	0.7%
운영	278	0.6%
설치	275	0.6%
Other values (9775)	37788	85.8%

Most occurring characters

Value	Count	Frequency (%)
	34388	14.4%
국	7677	3.2%
한	5761	2.4%
2	5222	2.2%
0	5143	2.1%
대	4580	1.9%
원	3822	1.6%
1	3692	1.5%
교	3307	1.4%
]	3025	1.3%
Other values (916)	162609	68.0%

Most occurring categories

Value	Count	Frequency (%)
Other Letter	138932	58.1%
Space Separator	34388	14.4%
Decimal Number	19830	8.3%
Lowercase Letter	19138	8.0%
Uppercase Letter	10485	4.4%
Close Punctuation	5692	2.4%
Open Punctuation	5691	2.4%
Dash Punctuation	2298	1.0%
Other Punctuation	2014	0.8%
Math Symbol	552	0.2%
Other values (7)	206	0.1%

Most frequent character per category

Other Letter

Value	Count	Frequency (%)
국	7677	5.5%
한	5761	4.1%
대	4580	3.3%
원	3822	2.8%
교	3307	2.4%
아	2804	2.0%
지	2672	1.9%
학	2670	1.9%
미	2415	1.7%
스	2210	1.6%
Other values (816)	101014	72.7%

Lowercase Letter

Value	Count	Frequency (%)
e	2305	12.0%
i	2079	10.9%
o	1780	9.3%
n	1743	9.1%
t	1641	8.6%
a	1637	8.6%
r	1454	7.6%
s	1212	6.3%
l	864	4.5%
u	617	3.2%
Other values (19)	3806	19.9%

Uppercase Letter

Value	Count	Frequency (%)
S	1243	11.9%
C	1055	10.1%
I	963	9.2%
A	917	8.7%
K	756	7.2%
U	679	6.5%
F	574	5.5%
E	549	5.2%
T	545	5.2%
P	492	4.7%
Other values (16)	2712	25.9%

Other Punctuation

Value	Count	Frequency (%)
/	954	47.4%
,	358	17.8%
.	187	9.3%
'	140	7.0%
:	137	6.8%
"	119	5.9%
&	54	2.7%
·	51	2.5%
?	9	0.4%
!	4	0.2%

Decimal Number

Value	Count	Frequency (%)
2	5222	26.3%
0	5143	25.9%
1	3692	18.6%
9	1622	8.2%
5	934	4.7%
3	714	3.6%
8	685	3.5%
6	670	3.4%
7	641	3.2%
4	507	2.6%

Close Punctuation

Value	Count	Frequency (%)
]	3025	53.1%
)	2624	46.1%
》	34	0.6%
」	9	0.2%

Open Punctuation

Value	Count	Frequency (%)
[	3025	53.2%
(	2623	46.1%
《	34	0.6%
「	9	0.2%

Math Symbol

Value	Count	Frequency (%)
>	272	49.3%
<	271	49.1%
~	5	0.9%
+	4	0.7%

Initial Punctuation

Value	Count	Frequency (%)
“	16	53.3%
‘	14	46.7%

Final Punctuation

Value	Count	Frequency (%)
’	15	51.7%
”	14	48.3%

Letter Number

Value	Count	Frequency (%)
Ⅱ	5	83.3%
Ⅲ	1	16.7%

Space Separator

Value	Count	Frequency (%)
	34388	100.0%

Dash Punctuation

Value	Count	Frequency (%)
-	2298	100.0%

Control

Value	Count	Frequency (%)
	94	100.0%

Connector Punctuation

Value	Count	Frequency (%)
_	44	100.0%

Other Symbol

Value	Count	Frequency (%)
㈜	2	100.0%

Modifier Symbol

Value	Count	Frequency (%)
`	1	100.0%

Most occurring scripts

Value	Count	Frequency (%)
Hangul	138902	58.1%
Common	70663	29.5%
Latin	29627	12.4%
Han	29	< 0.1%
Hiragana	3	< 0.1%
Greek	1	< 0.1%
Cyrillic	1	< 0.1%

Most frequent character per script

Hangul

Value	Count	Frequency (%)
국	7677	5.5%
한	5761	4.1%
대	4580	3.3%
원	3822	2.8%
교	3307	2.4%
아	2804	2.0%
지	2672	1.9%
학	2670	1.9%
미	2415	1.7%
스	2210	1.6%
Other values (791)	100984	72.7%

Latin

Value	Count	Frequency (%)
e	2305	7.8%
i	2079	7.0%
o	1780	6.0%
n	1743	5.9%
t	1641	5.5%
a	1637	5.5%
r	1454	4.9%
S	1243	4.2%
s	1212	4.1%
C	1055	3.6%
Other values (45)	13478	45.5%

Common

Value	Count	Frequency (%)
	34388	48.7%
2	5222	7.4%
0	5143	7.3%
1	3692	5.2%
]	3025	4.3%
[	3025	4.3%
)	2624	3.7%
(	2623	3.7%
-	2298	3.3%
9	1622	2.3%
Other values (32)	7001	9.9%

Han

Value	Count	Frequency (%)
展	5	17.2%
和	2	6.9%
美	2	6.9%
共	1	3.4%
在	1	3.4%
日	1	3.4%
本	1	3.4%
百	1	3.4%
濟	1	3.4%
文	1	3.4%
Other values (13)	13	44.8%

Hiragana

Value	Count	Frequency (%)
の	1	33.3%
を	1	33.3%
む	1	33.3%

Greek

Value	Count	Frequency (%)
ο	1	100.0%

Cyrillic

Value	Count	Frequency (%)
о	1	100.0%

Most occurring blocks

Value	Count	Frequency (%)
Hangul	138891	58.1%
ASCII	100086	41.8%
None	141	0.1%
Punctuation	59	< 0.1%
CJK	29	< 0.1%
Compat Jamo	9	< 0.1%
Number Forms	6	< 0.1%
Hiragana	3	< 0.1%
Katakana	1	< 0.1%
Cyrillic	1	< 0.1%

Most frequent character per block

ASCII

Value	Count	Frequency (%)
	34388	34.4%
2	5222	5.2%
0	5143	5.1%
1	3692	3.7%
]	3025	3.0%
[	3025	3.0%
)	2624	2.6%
(	2623	2.6%
e	2305	2.3%
-	2298	2.3%
Other values (74)	35741	35.7%

Hangul

Value	Count	Frequency (%)
국	7677	5.5%
한	5761	4.1%
대	4580	3.3%
원	3822	2.8%
교	3307	2.4%
아	2804	2.0%
지	2672	1.9%
학	2670	1.9%
미	2415	1.7%
스	2210	1.6%
Other values (787)	100973	72.7%

None

Value	Count	Frequency (%)
·	51	36.2%
》	34	24.1%
《	34	24.1%
」	9	6.4%
「	9	6.4%
㈜	2	1.4%
ô	1	0.7%
ο	1	0.7%

Punctuation

Value	Count	Frequency (%)
“	16	27.1%
’	15	25.4%
‘	14	23.7%
”	14	23.7%

Compat Jamo

Value	Count	Frequency (%)
ㆍ	6	66.7%
ㅇ	2	22.2%
ㄱ	1	11.1%

Number Forms

Value	Count	Frequency (%)
Ⅱ	5	83.3%
Ⅲ	1	16.7%

CJK

Value	Count	Frequency (%)
展	5	17.2%
和	2	6.9%
美	2	6.9%
共	1	3.4%
在	1	3.4%
日	1	3.4%
本	1	3.4%
百	1	3.4%
濟	1	3.4%
文	1	3.4%
Other values (13)	13	44.8%

Katakana

Value	Count	Frequency (%)
・	1	100.0%

Hiragana

Value	Count	Frequency (%)
の	1	33.3%
を	1	33.3%
む	1	33.3%

Cyrillic

Value	Count	Frequency (%)
о	1	100.0%

사업명(영문)
Text

MISSING

Distinct	2035
Distinct (%)	58.7%
Missing	6533
Missing (%)	65.3%
Memory size	156.2 KiB

Length

Max length	203
Median length	147
Mean length	46.26882
Min length	3

Characters and Unicode

Total characters	160414
Distinct characters	100
Distinct categories	15 ?
Distinct scripts	3 ?
Distinct blocks	5 ?

The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique	1606 ?
Unique (%)	46.3%

Sample

1st row	2003 Newsletter Korean August
2nd row	2007 Establishment of Professorships Program
3rd row	Analysing Korea's Central Role in Northeast Asian Affairs
4th row	The Second Korea-Japan Journalist and Expert Dialogue
5th row	Opera <The Wedding>

Value	Count	Frequency (%)
of	1395	6.3%
program	1079	4.8%
the	716	3.2%
korean	710	3.2%
for	471	2.1%
and	427	1.9%
visiting	377	1.7%
korea	351	1.6%
staff	342	1.5%
teaching	341	1.5%
Other values (3003)	16058	72.1%

Most occurring characters

Value	Count	Frequency (%)
	18824	11.7%
e	12319	7.7%
o	11246	7.0%
r	10874	6.8%
a	9686	6.0%
n	8811	5.5%
i	8774	5.5%
t	8305	5.2%
s	8205	5.1%
l	4249	2.6%
Other values (90)	59121	36.9%

Most occurring categories

Value	Count	Frequency (%)
Lowercase Letter	111966	69.8%
Uppercase Letter	19897	12.4%
Space Separator	18824	11.7%
Decimal Number	7506	4.7%
Other Punctuation	967	0.6%
Dash Punctuation	570	0.4%
Open Punctuation	257	0.2%
Close Punctuation	254	0.2%
Math Symbol	63	< 0.1%
Final Punctuation	45	< 0.1%
Other values (5)	65	< 0.1%

Most frequent character per category

Lowercase Letter

Value	Count	Frequency (%)
e	12319	11.0%
o	11246	10.0%
r	10874	9.7%
a	9686	8.7%
n	8811	7.9%
i	8774	7.8%
t	8305	7.4%
s	8205	7.3%
l	4249	3.8%
m	3772	3.4%
Other values (16)	25725	23.0%

Uppercase Letter

Value	Count	Frequency (%)
P	2515	12.6%
S	1988	10.0%
E	1775	8.9%
K	1527	7.7%
A	1284	6.5%
T	1200	6.0%
C	1186	6.0%
N	994	5.0%
R	879	4.4%
F	871	4.4%
Other values (16)	5678	28.5%

Other Punctuation

Value	Count	Frequency (%)
:	214	22.1%
,	203	21.0%
.	150	15.5%
"	144	14.9%
'	137	14.2%
&	79	8.2%
/	21	2.2%
;	10	1.0%
?	4	0.4%
!	4	0.4%

Decimal Number

Value	Count	Frequency (%)
0	2596	34.6%
2	1747	23.3%
1	1149	15.3%
9	819	10.9%
8	245	3.3%
7	236	3.1%
3	197	2.6%
6	189	2.5%
5	184	2.5%
4	144	1.9%

Other Letter

Value	Count	Frequency (%)
九	1	20.0%
州	1	20.0%
大	1	20.0%
學	1	20.0%
校	1	20.0%

Math Symbol

Value	Count	Frequency (%)
<	26	41.3%
>	24	38.1%
\|	8	12.7%
+	5	7.9%

Open Punctuation

Value	Count	Frequency (%)
(	144	56.0%
[	112	43.6%
《	1	0.4%

Close Punctuation

Value	Count	Frequency (%)
)	141	55.5%
]	112	44.1%
》	1	0.4%

Letter Number

Value	Count	Frequency (%)
Ⅰ	3	42.9%
Ⅱ	2	28.6%
Ⅲ	2	28.6%

Dash Punctuation

Value	Count	Frequency (%)
-	569	99.8%
–	1	0.2%

Initial Punctuation

Value	Count	Frequency (%)
“	31	77.5%
‘	9	22.5%

Final Punctuation

Value	Count	Frequency (%)
”	27	60.0%
’	18	40.0%

Space Separator

Value	Count	Frequency (%)
	18824	100.0%

Connector Punctuation

Value	Count	Frequency (%)
_	12	100.0%

Modifier Symbol

Value	Count	Frequency (%)
`	1	100.0%

Most occurring scripts

Value	Count	Frequency (%)
Latin	131870	82.2%
Common	28539	17.8%
Han	5	< 0.1%

Most frequent character per script

Latin

Value	Count	Frequency (%)
e	12319	9.3%
o	11246	8.5%
r	10874	8.2%
a	9686	7.3%
n	8811	6.7%
i	8774	6.7%
t	8305	6.3%
s	8205	6.2%
l	4249	3.2%
m	3772	2.9%
Other values (45)	45629	34.6%

Common

Value	Count	Frequency (%)
	18824	66.0%
0	2596	9.1%
2	1747	6.1%
1	1149	4.0%
9	819	2.9%
-	569	2.0%
8	245	0.9%
7	236	0.8%
:	214	0.7%
,	203	0.7%
Other values (30)	1937	6.8%

Han

Value	Count	Frequency (%)
九	1	20.0%
州	1	20.0%
大	1	20.0%
學	1	20.0%
校	1	20.0%

Most occurring blocks

Value	Count	Frequency (%)
ASCII	160314	99.9%
Punctuation	86	0.1%
Number Forms	7	< 0.1%
CJK	5	< 0.1%
None	2	< 0.1%

Most frequent character per block

ASCII

Value	Count	Frequency (%)
	18824	11.7%
e	12319	7.7%
o	11246	7.0%
r	10874	6.8%
a	9686	6.0%
n	8811	5.5%
i	8774	5.5%
t	8305	5.2%
s	8205	5.1%
l	4249	2.7%
Other values (75)	59021	36.8%

Punctuation

Value	Count	Frequency (%)
“	31	36.0%
”	27	31.4%
’	18	20.9%
‘	9	10.5%
–	1	1.2%

Number Forms

Value	Count	Frequency (%)
Ⅰ	3	42.9%
Ⅱ	2	28.6%
Ⅲ	2	28.6%

CJK

Value	Count	Frequency (%)
九	1	20.0%
州	1	20.0%
大	1	20.0%
學	1	20.0%
校	1	20.0%

None

Value	Count	Frequency (%)
》	1	50.0%
《	1	50.0%

사업시작일
Date

MISSING

Distinct	2133
Distinct (%)	29.0%
Missing	2646
Missing (%)	26.5%
Memory size	156.2 KiB

Minimum	1992-01-01 00:00:00
Maximum	2040-08-21 00:00:00

Histogram

Histogram with fixed size bins (bins=50)

사업종료일
Date

MISSING

Distinct	2124
Distinct (%)	28.9%
Missing	2648
Missing (%)	26.5%
Memory size	156.2 KiB

Minimum	1992-02-27 00:00:00
Maximum	2024-07-31 00:00:00

Histogram

Histogram with fixed size bins (bins=50)

다년구분코드
Categorical

HIGH CORRELATION IMBALANCE

Distinct	6
Distinct (%)	0.1%
Missing	0
Missing (%)	0.0%
Memory size	156.2 KiB

S	7385
M	2232
<NA>	233
MC	85
MN	64

Length

Max length	4
Median length	1
Mean length	1.0849
Min length	1

Unique

Unique	1 ?
Unique (%)	< 0.1%

Sample

1st row	S
2nd row	S
3rd row	S
4th row	S
5th row	S

Common Values

Value	Count	Frequency (%)
S	7385	73.9%
M	2232	22.3%
<NA>	233	2.3%
MC	85	0.9%
MN	64	0.6%
SN	1	< 0.1%

Length

Histogram of lengths of the category

Common Values (Plot)

Value	Count	Frequency (%)
s	7385	73.9%
m	2232	22.3%
na	233	2.3%
mc	85	0.9%
mn	64	0.6%
sn	1	< 0.1%

다년구분코드명
Categorical

HIGH CORRELATION IMBALANCE

Distinct	6
Distinct (%)	0.1%
Missing	0
Missing (%)	0.0%
Memory size	156.2 KiB

단년	7385
다년	2232
<NA>	233
다년계속	85
다년신규	64

Length

Max length	4
Median length	2
Mean length	2.0766
Min length	2

Unique

Unique	1 ?
Unique (%)	< 0.1%

Sample

1st row	단년
2nd row	단년
3rd row	단년
4th row	단년
5th row	단년

Common Values

Value	Count	Frequency (%)
단년	7385	73.9%
다년	2232	22.3%
<NA>	233	2.3%
다년계속	85	0.9%
다년신규	64	0.6%
단년신규	1	< 0.1%

Length

Histogram of lengths of the category

Common Values (Plot)

Value	Count	Frequency (%)
단년	7385	73.9%
다년	2232	22.3%
na	233	2.3%
다년계속	85	0.9%
다년신규	64	0.6%
단년신규	1	< 0.1%

수혜기관명
Text

MISSING

Distinct	3003
Distinct (%)	35.8%
Missing	1617
Missing (%)	16.2%
Memory size	156.2 KiB

Length

Max length	100
Median length	88
Mean length	14.640344
Min length	2

Characters and Unicode

Total characters	122730
Distinct characters	1154
Distinct categories	18 ?
Distinct scripts	15 ?
Distinct blocks	18 ?

The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique	1631 ?
Unique (%)	19.5%

Sample

1st row	한국국제교류재단
2nd row	브리티시컬럼비아대(UBC)
3rd row	조지워싱턴대
4th row	Univerza v Ljubljani, Fakulteta za družbene vede, Raziskovalno središče za Vzhodno Azijo (EARL)
5th row	Andong Culture & Art Center

Value	Count	Frequency (%)
of	577	3.1%
university	564	3.1%
한국국제교류재단	381	2.1%
주	357	1.9%
대사관	333	1.8%
대한민국	319	1.7%
and	194	1.1%
미국	170	0.9%
for	162	0.9%
studies	157	0.9%
Other values (3971)	15208	82.6%

Most occurring characters

Value	Count	Frequency (%)
	10094	8.2%
i	4783	3.9%
e	4669	3.8%
대	4225	3.4%
n	4100	3.3%
a	4037	3.3%
t	3596	2.9%
r	3515	2.9%
o	3112	2.5%
국	3081	2.5%
Other values (1144)	77518	63.2%

Most occurring categories

Value	Count	Frequency (%)
Other Letter	53123	43.3%
Lowercase Letter	44651	36.4%
Uppercase Letter	11311	9.2%
Space Separator	10094	8.2%
Close Punctuation	1123	0.9%
Open Punctuation	1117	0.9%
Other Punctuation	594	0.5%
Dash Punctuation	417	0.3%
Decimal Number	152	0.1%
Nonspacing Mark	86	0.1%
Other values (8)	62	0.1%

Most frequent character per category

Other Letter

Value	Count	Frequency (%)
대	4225	8.0%
국	3081	5.8%
학	1826	3.4%
한	1819	3.4%
교	1651	3.1%
아	1213	2.3%
사	1161	2.2%
스	1064	2.0%
관	987	1.9%
제	951	1.8%
Other values (873)	35145	66.2%

Lowercase Letter

Value	Count	Frequency (%)
i	4783	10.7%
e	4669	10.5%
n	4100	9.2%
a	4037	9.0%
t	3596	8.1%
r	3515	7.9%
o	3112	7.0%
s	2803	6.3%
l	1621	3.6%
u	1502	3.4%
Other values (117)	10913	24.4%

Uppercase Letter

Value	Count	Frequency (%)
U	1266	11.2%
S	1112	9.8%
C	1070	9.5%
A	1020	9.0%
I	764	6.8%
L	600	5.3%
N	494	4.4%
E	470	4.2%
M	450	4.0%
K	393	3.5%
Other values (64)	3672	32.5%

Nonspacing Mark

Value	Count	Frequency (%)
ั	12	14.0%
̣	9	10.5%
ิ	9	10.5%
्	7	8.1%
්	6	7.0%
์	6	7.0%
ි	6	7.0%
̀	6	7.0%
่	5	5.8%
ี	4	4.7%
Other values (11)	16	18.6%

Other Punctuation

Value	Count	Frequency (%)
,	311	52.4%
.	118	19.9%
/	39	6.6%
&	32	5.4%
·	25	4.2%
'	25	4.2%
;	20	3.4%
"	11	1.9%
・	6	1.0%
:	4	0.7%
Other values (2)	3	0.5%

Decimal Number

Value	Count	Frequency (%)
1	42	27.6%
2	35	23.0%
7	23	15.1%
0	14	9.2%
3	10	6.6%
5	9	5.9%
8	8	5.3%
4	7	4.6%
9	3	2.0%
6	1	0.7%

Spacing Mark

Value	Count	Frequency (%)
ि	8	29.6%
ा	7	25.9%
ා	6	22.2%
ी	2	7.4%
ැ	2	7.4%
ං	2	7.4%

Math Symbol

Value	Count	Frequency (%)
<	2	33.3%
>	2	33.3%
+	1	16.7%
~	1	16.7%

Open Punctuation

Value	Count	Frequency (%)
(	1110	99.4%
„	5	0.4%
[	2	0.2%

Final Punctuation

Value	Count	Frequency (%)
»	1	33.3%
”	1	33.3%
’	1	33.3%

Close Punctuation

Value	Count	Frequency (%)
)	1121	99.8%
]	2	0.2%

Dash Punctuation

Value	Count	Frequency (%)
-	416	99.8%
–	1	0.2%

Initial Punctuation

Value	Count	Frequency (%)
“	6	85.7%
«	1	14.3%

Space Separator

Value	Count	Frequency (%)
	10094	100.0%

Other Symbol

Value	Count	Frequency (%)
㈜	8	100.0%

Modifier Letter

Value	Count	Frequency (%)
ー	6	100.0%

Format

Value	Count	Frequency (%)
‍	4	100.0%

Control

Value	Count	Frequency (%)
	1	100.0%

Most occurring scripts

Value	Count	Frequency (%)
Latin	53837	43.9%
Hangul	51592	42.0%
Common	13520	11.0%
Cyrillic	1949	1.6%
Han	755	0.6%
Hebrew	231	0.2%
Thai	220	0.2%
Arabic	211	0.2%
Armenian	155	0.1%
Sinhala	74	0.1%
Other values (5)	186	0.2%

Most frequent character per script

Hangul

Value	Count	Frequency (%)
대	4225	8.2%
국	3081	6.0%
학	1826	3.5%
한	1819	3.5%
교	1651	3.2%
아	1213	2.4%
사	1161	2.3%
스	1064	2.1%
관	987	1.9%
제	951	1.8%
Other values (587)	33614	65.2%

Han

Value	Count	Frequency (%)
学	66	8.7%
大	59	7.8%
社	34	4.5%
国	32	4.2%
究	25	3.3%
研	22	2.9%
學	20	2.6%
語	18	2.4%
京	17	2.3%
版	17	2.3%
Other values (148)	445	58.9%

Latin

Value	Count	Frequency (%)
i	4783	8.9%
e	4669	8.7%
n	4100	7.6%
a	4037	7.5%
t	3596	6.7%
r	3515	6.5%
o	3112	5.8%
s	2803	5.2%
l	1621	3.0%
u	1502	2.8%
Other values (102)	20099	37.3%

Cyrillic

Value	Count	Frequency (%)
е	190	9.7%
и	183	9.4%
т	157	8.1%
н	152	7.8%
а	132	6.8%
о	127	6.5%
с	127	6.5%
р	99	5.1%
в	88	4.5%
к	82	4.2%
Other values (41)	612	31.4%

Common

Value	Count	Frequency (%)
	10094	74.7%
)	1121	8.3%
(	1110	8.2%
-	416	3.1%
,	311	2.3%
.	118	0.9%
1	42	0.3%
/	39	0.3%
2	35	0.3%
&	32	0.2%
Other values (31)	202	1.5%

Thai

Value	Count	Frequency (%)
า	28	12.7%
ย	20	9.1%
ม	16	7.3%
ห	12	5.5%
ร	12	5.5%
ั	12	5.5%
ล	11	5.0%
ิ	9	4.1%
ว	9	4.1%
ท	8	3.6%
Other values (25)	83	37.7%

Armenian

Value	Count	Frequency (%)
Ա	42	27.1%
Ե	13	8.4%
Ն	12	7.7%
Կ	9	5.8%
Ր	9	5.8%
ա	7	4.5%
Լ	6	3.9%
Տ	6	3.9%
Ի	6	3.9%
Վ	6	3.9%
Other values (18)	39	25.2%

Devanagari

Value	Count	Frequency (%)
ि	8	11.9%
्	7	10.4%
ा	7	10.4%
व	6	9.0%
य	6	9.0%
ल	5	7.5%
द	3	4.5%
श	2	3.0%
ज	2	3.0%
ी	2	3.0%
Other values (15)	19	28.4%

Arabic

Value	Count	Frequency (%)
ا	44	20.9%
ل	28	13.3%
ة	18	8.5%
م	16	7.6%
ن	15	7.1%
ع	14	6.6%
ي	13	6.2%
س	11	5.2%
ج	8	3.8%
د	6	2.8%
Other values (13)	38	18.0%

Lao

Value	Count	Frequency (%)
າ	9	19.6%
ສ	5	10.9%
ະ	4	8.7%
ນ	3	6.5%
ພ	3	6.5%
ຸ	2	4.3%
ວ	2	4.3%
ົ	2	4.3%
ຫ	2	4.3%
ູ	1	2.2%
Other values (13)	13	28.3%

Hebrew

Value	Count	Frequency (%)
י	46	19.9%
ר	22	9.5%
ב	20	8.7%
ו	19	8.2%
א	19	8.2%
ה	14	6.1%
ס	13	5.6%
נ	12	5.2%
ת	12	5.2%
ט	12	5.2%
Other values (11)	42	18.2%

Sinhala

Value	Count	Frequency (%)
ය	12	16.2%
ා	6	8.1%
න	6	8.1%
්	6	8.1%
ව	6	8.1%
ි	6	8.1%
අ	4	5.4%
ල	4	5.4%
ශ	4	5.4%
ණ	2	2.7%
Other values (9)	18	24.3%

Georgian

Value	Count	Frequency (%)
ი	5	23.8%
ა	4	19.0%
ს	2	9.5%
ც	2	9.5%
რ	2	9.5%
ტ	2	9.5%
უ	1	4.8%
ო	1	4.8%
მ	1	4.8%
ე	1	4.8%

Katakana

Value	Count	Frequency (%)
ン	10	30.3%
タ	8	24.2%
セ	8	24.2%
オ	2	6.1%
ク	2	6.1%
ソ	1	3.0%
ウ	1	3.0%
ル	1	3.0%

Inherited

Value	Count	Frequency (%)
̣	9	47.4%
̀	6	31.6%
‍	4	21.1%

Most occurring blocks

Value	Count	Frequency (%)
ASCII	66904	54.5%
Hangul	51584	42.0%
Cyrillic	1949	1.6%
CJK	755	0.6%
None	343	0.3%
Hebrew	231	0.2%
Thai	220	0.2%
Arabic	211	0.2%
Armenian	155	0.1%
Latin Ext Additional	85	0.1%
Other values (8)	293	0.2%

Most frequent character per block

ASCII

Value	Count	Frequency (%)
	10094	15.1%
i	4783	7.1%
e	4669	7.0%
n	4100	6.1%
a	4037	6.0%
t	3596	5.4%
r	3515	5.3%
o	3112	4.7%
s	2803	4.2%
l	1621	2.4%
Other values (72)	24574	36.7%

Hangul

Value	Count	Frequency (%)
대	4225	8.2%
국	3081	6.0%
학	1826	3.5%
한	1819	3.5%
교	1651	3.2%
아	1213	2.4%
사	1161	2.3%
스	1064	2.1%
관	987	1.9%
제	951	1.8%
Other values (586)	33606	65.1%

Cyrillic

Value	Count	Frequency (%)
е	190	9.7%
и	183	9.4%
т	157	8.1%
н	152	7.8%
а	132	6.8%
о	127	6.5%
с	127	6.5%
р	99	5.1%
в	88	4.5%
к	82	4.2%
Other values (41)	612	31.4%

CJK

Value	Count	Frequency (%)
学	66	8.7%
大	59	7.8%
社	34	4.5%
国	32	4.2%
究	25	3.3%
研	22	2.9%
學	20	2.6%
語	18	2.4%
京	17	2.3%
版	17	2.3%
Other values (148)	445	58.9%

Hebrew

Value	Count	Frequency (%)
י	46	19.9%
ר	22	9.5%
ב	20	8.7%
ו	19	8.2%
א	19	8.2%
ה	14	6.1%
ס	13	5.6%
נ	12	5.2%
ת	12	5.2%
ט	12	5.2%
Other values (11)	42	18.2%

Arabic

Value	Count	Frequency (%)
ا	44	20.9%
ل	28	13.3%
ة	18	8.5%
م	16	7.6%
ن	15	7.1%
ع	14	6.6%
ي	13	6.2%
س	11	5.2%
ج	8	3.8%
د	6	2.8%
Other values (13)	38	18.0%

Armenian

Value	Count	Frequency (%)
Ա	42	27.1%
Ե	13	8.4%
Ն	12	7.7%
Կ	9	5.8%
Ր	9	5.8%
ա	7	4.5%
Լ	6	3.9%
Տ	6	3.9%
Ի	6	3.9%
Վ	6	3.9%
Other values (18)	39	25.2%

None

Value	Count	Frequency (%)
ä	29	8.5%
·	25	7.3%
á	25	7.3%
à	25	7.3%
é	24	7.0%
Đ	22	6.4%
ü	18	5.2%
ö	16	4.7%
š	11	3.2%
ư	10	2.9%
Other values (41)	138	40.2%

Thai

Value	Count	Frequency (%)
า	28	12.7%
ย	20	9.1%
ม	16	7.3%
ห	12	5.5%
ร	12	5.5%
ั	12	5.5%
ล	11	5.0%
ิ	9	4.1%
ว	9	4.1%
ท	8	3.6%
Other values (25)	83	37.7%

Latin Ext Additional

Value	Count	Frequency (%)
ạ	23	27.1%
ọ	21	24.7%
ộ	8	9.4%
ữ	8	9.4%
ờ	7	8.2%
ệ	5	5.9%
ố	4	4.7%
ẵ	4	4.7%
ế	2	2.4%
ồ	1	1.2%
Other values (2)	2	2.4%

Sinhala

Value	Count	Frequency (%)
ය	12	16.2%
ා	6	8.1%
න	6	8.1%
්	6	8.1%
ව	6	8.1%
ි	6	8.1%
අ	4	5.4%
ල	4	5.4%
ශ	4	5.4%
ණ	2	2.7%
Other values (9)	18	24.3%

Katakana

Value	Count	Frequency (%)
ン	10	22.2%
タ	8	17.8%
セ	8	17.8%
ー	6	13.3%
・	6	13.3%
オ	2	4.4%
ク	2	4.4%
ソ	1	2.2%
ウ	1	2.2%
ル	1	2.2%

Lao

Value	Count	Frequency (%)
າ	9	19.6%
ສ	5	10.9%
ະ	4	8.7%
ນ	3	6.5%
ພ	3	6.5%
ຸ	2	4.3%
ວ	2	4.3%
ົ	2	4.3%
ຫ	2	4.3%
ູ	1	2.2%
Other values (13)	13	28.3%

Diacriticals

Value	Count	Frequency (%)
̣	9	60.0%
̀	6	40.0%

Devanagari

Value	Count	Frequency (%)
ि	8	11.9%
्	7	10.4%
ा	7	10.4%
व	6	9.0%
य	6	9.0%
ल	5	7.5%
द	3	4.5%
श	2	3.0%
ज	2	3.0%
ी	2	3.0%
Other values (15)	19	28.4%

IPA Ext

Value	Count	Frequency (%)
ə	6	100.0%

Punctuation

Value	Count	Frequency (%)
“	6	31.6%
„	5	26.3%
‍	4	21.1%
–	1	5.3%
”	1	5.3%
’	1	5.3%
…	1	5.3%

Georgian

Value	Count	Frequency (%)
ი	5	23.8%
ა	4	19.0%
ს	2	9.5%
ც	2	9.5%
რ	2	9.5%
ტ	2	9.5%
უ	1	4.8%
ო	1	4.8%
მ	1	4.8%
ე	1	4.8%

사업연도
Real number (ℝ)

Distinct	33
Distinct (%)	0.3%
Missing	0
Missing (%)	0.0%
Infinite	0
Infinite (%)	0.0%
Mean	2010.6041

Minimum	1992
Maximum	2024
Zeros	0
Zeros (%)	0.0%
Negative	0
Negative (%)	0.0%
Memory size	166.0 KiB

Quantile statistics

Minimum	1992
5-th percentile	1996
Q1	2006
median	2011
Q3	2017
95-th percentile	2021
Maximum	2024
Range	32
Interquartile range (IQR)	11

Descriptive statistics

Standard deviation	7.5308323
Coefficient of variation (CV)	0.003745557
Kurtosis	-0.43254071
Mean	2010.6041
Median Absolute Deviation (MAD)	5
Skewness	-0.54337017
Sum	20106041
Variance	56.713435
Monotonicity	Not monotonic

Histogram with fixed size bins (bins=33)

Value	Count	Frequency (%)
2019	648	6.5%
2010	570	5.7%
2011	556	5.6%
2009	529	5.3%
2008	489	4.9%
2015	482	4.8%
2018	472	4.7%
2020	468	4.7%
2012	460	4.6%
2007	458	4.6%
Other values (23)	4868	48.7%

Minimum 10 values
Maximum 10 values

Value	Count	Frequency (%)
1992	101	1.0%
1993	93	0.9%
1994	128	1.3%
1995	162	1.6%
1996	152	1.5%
1997	155	1.6%
1998	115	1.1%
1999	126	1.3%
2000	156	1.6%
2001	175	1.8%

Value	Count	Frequency (%)
2024	1	< 0.1%
2023	1	< 0.1%
2022	245	2.5%
2021	428	4.3%
2020	468	4.7%
2019	648	6.5%
2018	472	4.7%
2017	420	4.2%
2016	421	4.2%
2015	482	4.8%

사업연도

사업연도

Heatmap
Table

	대륙명	사업유형코드	사업유형명	다년구분코드	다년구분코드명	사업연도
대륙명	1.000	0.361	0.361	0.218	0.218	0.176
사업유형코드	0.361	1.000	1.000	1.000	1.000	0.374
사업유형명	0.361	1.000	1.000	1.000	1.000	0.374
다년구분코드	0.218	1.000	1.000	1.000	1.000	0.412
다년구분코드명	0.218	1.000	1.000	1.000	1.000	0.412
사업연도	0.176	0.374	0.374	0.412	0.412	1.000

Heatmap
Table

	대륙명	다년구분코드명	다년구분코드	사업유형명	사업유형코드
대륙명	1.000	0.149	0.149	0.260	0.260
다년구분코드명	0.149	1.000	1.000	1.000	1.000
다년구분코드	0.149	1.000	1.000	1.000	1.000
사업유형명	0.260	1.000	1.000	1.000	0.997
사업유형코드	0.260	1.000	1.000	0.997	1.000

Heatmap
Table

	사업연도	대륙명	사업유형코드	사업유형명	다년구분코드	다년구분코드명
사업연도	1.000	0.096	0.287	0.287	0.185	0.185
대륙명	0.096	1.000	0.260	0.260	0.149	0.149
사업유형코드	0.287	0.260	1.000	0.997	1.000	1.000
사업유형명	0.287	0.260	0.997	1.000	1.000	1.000
다년구분코드	0.185	0.149	1.000	1.000	1.000	1.000
다년구분코드명	0.185	0.149	1.000	1.000	1.000	1.000

A simple visualization of nullity by column.

Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

First rows
Last rows

	국가명	국가영문명	iso 2자리코드	대륙명	사업유형코드	사업유형명	사업명(국문)	사업명(영문)	사업시작일	사업종료일	다년구분코드	다년구분코드명	수혜기관명	사업연도
618	대한민국	Korea	KR	아시아	1	KF	2003년도 뉴스레터 국문 8월	2003 Newsletter Korean August	<NA>	<NA>	S	단년	한국국제교류재단	2003
10531	캐나다	Canada	CA	북아메리카	1	KF	2007년도 UBC 한국법 기금교수직 설치(2/3)	2007 Establishment of Professorships Program	<NA>	<NA>	S	단년	브리티시컬럼비아대(UBC)	2007
5014	미국	United States of America	US	북아메리카	1	KF	1995 조지워싱턴대 동아시아연구센터한국연구프로그램운영	<NA>	<NA>	<NA>	S	단년	조지워싱턴대	1995
8092	슬로베니아	Slovenia	SI	유럽	1	KF	[전자자료지원] 2019 슬로베니아 류블라냐대	<NA>	2019-01-01	2019-12-31	S	단년	Univerza v Ljubljani, Fakulteta za družbene vede, Raziskovalno središče za Vzhodno Azijo (EARL)	2019
8821	우크라이나	Ukraine	UA	유럽	1	KF	[지자체] <이주와 정주의 삶>	<NA>	2019-04-16	2019-05-17	S	단년	Andong Culture & Art Center	2019
8523	영국	United Kingdom	GB	유럽	1	KF	제20차 한영미래포럼	<NA>	2012-06-13	2012-06-14	S	단년	<NA>	2012
11190	파라과이	Paraguay	PY	남아메리카	1	KF	[전시] 한-파 이민 50주년 기념展《순수의 땅으로》	<NA>	2015-06-24	2015-07-18	S	단년	주한 파라과이 대사관	2015
8033	스페인	Spain	ES	유럽	1	KF	[서유럽] 2018-19 스페인 살라망카대 교원고용지원 사업	<NA>	2018-09-01	2018-12-31	M	다년	Universidad de Salamanca	2018
1146	대한민국	Korea	KR	아시아	1	KF	일제의 전시체제와 조선인 동원	<NA>	2006-03-03	2006-03-03	S	단년	낙성대경제연구소	2006
4435	미국	United States of America	US	북아메리카	1	KF	미국 브루킹스연구소	Analysing Korea's Central Role in Northeast Asian Affairs	<NA>	<NA>	S	단년	주 미국 대한민국 대사관	2003

	국가명	국가영문명	iso 2자리코드	대륙명	사업유형코드	사업유형명	사업명(국문)	사업명(영문)	사업시작일	사업종료일	다년구분코드	다년구분코드명	수혜기관명	사업연도
10636	캐나다	Canada	CA	북아메리카	1	KF	캐나다 UBC 한국어 기금강사직 설치 지원(3/5)	<NA>	2015-09-01	2016-08-31	M	다년	브리티시컬럼비아대(UBC)	2015
281	대한민국	Korea	KR	아시아	1	KF	2010년도 뉴스레터 영문 12월	2010 Newsletter English December	<NA>	<NA>	S	단년	와우이미지	2010
1871	대한민국	Korea	KR	아시아	1	KF	형 그리고 영, 한국 초상화 걸작선	Great Korean Portraits	2010-01-01	2010-12-31	S	단년	돌베개	2010
9476	일본	Japan	JP	아시아	1	KF	일본민예관 초청 쇳대박물관 소장유물전	<NA>	2008-09-09	2008-11-20	S	단년	쇳대박물관	2008
5774	미국	United States of America	US	북아메리카	1	KF	미국 컬럼비아대 WEAI	<NA>	2009-01-01	2009-12-31	S	단년	컬럼비아대학교	2009
4725	미국	United States of America	US	북아메리카	1	KF	AATK 연례회의, 워크샵 개최	<NA>	<NA>	<NA>	S	단년	미국한국어교육자협회(AATK)	2001
7045	미국	United States of America	US	북아메리카	1	KF	[현안세미나] CFTNI	<NA>	2020-05-01	2021-05-30	S	단년	Center for the National Interest (CFTNI)	2020
7796	브라질	Brazil	BR	남아메리카	1	KF	[중남미] 2016 브라질 상파울루대 한국어 객원교수 파견 (이나현)	<NA>	2016-04-10	2016-12-20	S	단년	상파울루대	2016
8579	영국	United Kingdom	GB	유럽	1	KF	[서유럽] 2017-22 영국 에딘버러대 한국학 부교수직 설치 지원 (1/5)	<NA>	2017-09-18	2018-09-17	M	다년	에든버러대학교 국제처	2017
9276	인도네시아	Indonesia	ID	아시아	1	KF	2011 한인문예총종합예술제	<NA>	2011-11-27	2011-11-27	<NA>	<NA>	<NA>	2011

Most frequently occurring

	국가명	국가영문명	iso 2자리코드	대륙명	사업유형코드	사업유형명	사업명(국문)	사업명(영문)	사업시작일	사업종료일	다년구분코드	다년구분코드명	수혜기관명	사업연도	# duplicates
0	미국	United States of America	US	북아메리카	1	KF	미국 CSIS	<NA>	2002-01-01	2002-12-31	S	단년	주 미국 대한민국 대사관	2002	2
1	미국	United States of America	US	북아메리카	1	KF	미국 CSIS	<NA>	<NA>	<NA>	S	단년	주 미국 대한민국 대사관	2004	2

Overview

Variables

Most occurring characters

Most occurring categories

Most frequent character per category

Other Letter

Most occurring scripts

Most frequent character per script

Hangul

Most occurring blocks

Most frequent character per block

Hangul

Most occurring characters

Most occurring categories

Most frequent character per category

Lowercase Letter

Uppercase Letter

Other Punctuation

Space Separator

Dash Punctuation

Most occurring scripts

Most frequent character per script

Latin

Common

Most occurring blocks

Most frequent character per block

ASCII

None

Most occurring characters

Most occurring categories

Most frequent character per category

Uppercase Letter

Most occurring scripts

Most frequent character per script

Latin

Most occurring blocks

Most frequent character per block

ASCII

Common Values

Length

Common Values (Plot)

Common Values

Length

Common Values (Plot)

Common Values

Length

Common Values (Plot)

Most occurring characters

Most occurring categories

Most frequent character per category

Other Letter

Lowercase Letter

Uppercase Letter

Other Punctuation

Decimal Number

Close Punctuation

Open Punctuation

Math Symbol

Initial Punctuation

Final Punctuation

Letter Number

Space Separator

Dash Punctuation

Control

Connector Punctuation

Other Symbol

Modifier Symbol

Most occurring scripts

Most frequent character per script

Hangul

Latin

Common

Han

Hiragana

Greek

Cyrillic

Most occurring blocks

Most frequent character per block

ASCII

Hangul