Overview

Dataset statistics

Number of variables4
Number of observations702
Missing cells59
Missing cells (%)2.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory22.1 KiB
Average record size in memory32.2 B

Variable types

Text2
DateTime2

Dataset

Description1. 외교일지 목록 조회 : 대상 국가명을 이용하여 외교일지 목록을 조회 - 외교일지 관련 정보를 제공하는 공공파일데이터
Author외교부
URLhttps://www.data.go.kr/data/15099242/fileData.do

Alerts

대상 국가명 has 59 (8.4%) missing valuesMissing

Reproduction

Analysis started2024-04-17 20:53:53.359600
Analysis finished2024-04-17 20:53:53.950433
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

대상 국가명
Text

MISSING 

Distinct262
Distinct (%)40.7%
Missing59
Missing (%)8.4%
Memory size5.6 KiB
2024-04-18T05:53:54.060340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length32
Mean length7.5769829
Min length2

Characters and Unicode

Total characters4872
Distinct characters187
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique163 ?
Unique (%)25.3%

Sample

1st row한국, 호주, 중국, 태국, 미국, 베트남
2nd row한국, 베트남
3rd row한국, 미국
4th row한국, 미국
5th row북한
ValueCountFrequency (%)
한국 448
35.1%
미국 86
 
6.7%
일본 52
 
4.1%
중국 40
 
3.1%
베트남 25
 
2.0%
호주 25
 
2.0%
인도 24
 
1.9%
un 22
 
1.7%
뉴질랜드 18
 
1.4%
독일 16
 
1.3%
Other values (191) 520
40.8%
2024-04-18T05:53:54.359605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
634
 
13.0%
, 623
 
12.8%
610
 
12.5%
456
 
9.4%
107
 
2.2%
105
 
2.2%
E 91
 
1.9%
A 91
 
1.9%
N 85
 
1.7%
U 82
 
1.7%
Other values (177) 1988
40.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2856
58.6%
Uppercase Letter 729
 
15.0%
Space Separator 634
 
13.0%
Other Punctuation 623
 
12.8%
Decimal Number 19
 
0.4%
Lowercase Letter 10
 
0.2%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
610
21.4%
456
 
16.0%
107
 
3.7%
105
 
3.7%
71
 
2.5%
70
 
2.5%
56
 
2.0%
52
 
1.8%
47
 
1.6%
46
 
1.6%
Other values (143) 1236
43.3%
Uppercase Letter
ValueCountFrequency (%)
E 91
12.5%
A 91
12.5%
N 85
11.7%
U 82
11.2%
C 79
10.8%
O 50
6.9%
S 49
6.7%
P 35
 
4.8%
I 32
 
4.4%
R 21
 
2.9%
Other values (12) 114
15.6%
Lowercase Letter
ValueCountFrequency (%)
o 3
30.0%
n 2
20.0%
m 2
20.0%
e 2
20.0%
c 1
 
10.0%
Decimal Number
ValueCountFrequency (%)
0 8
42.1%
2 8
42.1%
4 2
 
10.5%
3 1
 
5.3%
Space Separator
ValueCountFrequency (%)
634
100.0%
Other Punctuation
ValueCountFrequency (%)
, 623
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2856
58.6%
Common 1277
26.2%
Latin 739
 
15.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
610
21.4%
456
 
16.0%
107
 
3.7%
105
 
3.7%
71
 
2.5%
70
 
2.5%
56
 
2.0%
52
 
1.8%
47
 
1.6%
46
 
1.6%
Other values (143) 1236
43.3%
Latin
ValueCountFrequency (%)
E 91
12.3%
A 91
12.3%
N 85
11.5%
U 82
11.1%
C 79
10.7%
O 50
6.8%
S 49
 
6.6%
P 35
 
4.7%
I 32
 
4.3%
R 21
 
2.8%
Other values (17) 124
16.8%
Common
ValueCountFrequency (%)
634
49.6%
, 623
48.8%
0 8
 
0.6%
2 8
 
0.6%
4 2
 
0.2%
- 1
 
0.1%
3 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2856
58.6%
ASCII 2016
41.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
634
31.4%
, 623
30.9%
E 91
 
4.5%
A 91
 
4.5%
N 85
 
4.2%
U 82
 
4.1%
C 79
 
3.9%
O 50
 
2.5%
S 49
 
2.4%
P 35
 
1.7%
Other values (24) 197
 
9.8%
Hangul
ValueCountFrequency (%)
610
21.4%
456
 
16.0%
107
 
3.7%
105
 
3.7%
71
 
2.5%
70
 
2.5%
56
 
2.0%
52
 
1.8%
47
 
1.6%
46
 
1.6%
Other values (143) 1236
43.3%
Distinct261
Distinct (%)37.2%
Missing0
Missing (%)0.0%
Memory size5.6 KiB
Minimum2020-01-01 00:00:00
Maximum2020-12-29 00:00:00
2024-04-18T05:53:54.496586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T05:53:54.617146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct257
Distinct (%)36.6%
Missing0
Missing (%)0.0%
Memory size5.6 KiB
Minimum2020-01-01 00:00:00
Maximum2020-12-31 00:00:00
2024-04-18T05:53:54.751228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T05:53:54.876812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct657
Distinct (%)93.6%
Missing0
Missing (%)0.0%
Memory size5.6 KiB
2024-04-18T05:53:55.123008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length126
Median length78
Mean length31.01567
Min length5

Characters and Unicode

Total characters21773
Distinct characters519
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique628 ?
Unique (%)89.5%

Sample

1st row호주 정부, 중화인민공화국 정부, 대한민국정부, 타이왕국 정부, 미합중국 정부 그리고 베트남사회주의공화국 정부 간의 협정 발효(조약 제2446호)
2nd row대한민국과 베트남사회주의공화국 간의 쌀에 대한 저율관세할당물량에 관한 교환각서 발효(조약 제2447호)
3rd row대한민국과 미합중국 간의 쌀에 대한 저율관세할당물량에 관한 교환각서 발효(조약 제2448호)
4th row한국-미국 북핵수석대표 간 전화협의
5th row북한, 조선노동당 중앙위원회 제7기 5차전원회의(12.28.-31.) 결과 발표
ValueCountFrequency (%)
전화협의 182
 
5.1%
161
 
4.5%
외교장관 80
 
2.3%
코로나19 59
 
1.7%
관련 56
 
1.6%
외교부 49
 
1.4%
정상 45
 
1.3%
한국-미국 31
 
0.9%
유엔 27
 
0.8%
대응 27
 
0.8%
Other values (1431) 2836
79.8%
2024-04-18T05:53:55.478778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2854
 
13.1%
771
 
3.5%
) 741
 
3.4%
( 741
 
3.4%
606
 
2.8%
604
 
2.8%
550
 
2.5%
410
 
1.9%
371
 
1.7%
359
 
1.6%
Other values (509) 13766
63.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13378
61.4%
Space Separator 2854
 
13.1%
Lowercase Letter 1516
 
7.0%
Uppercase Letter 1128
 
5.2%
Decimal Number 880
 
4.0%
Close Punctuation 742
 
3.4%
Open Punctuation 742
 
3.4%
Dash Punctuation 332
 
1.5%
Other Punctuation 183
 
0.8%
Math Symbol 8
 
< 0.1%
Other values (2) 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
771
 
5.8%
606
 
4.5%
604
 
4.5%
550
 
4.1%
410
 
3.1%
371
 
2.8%
359
 
2.7%
346
 
2.6%
332
 
2.5%
258
 
1.9%
Other values (421) 8771
65.6%
Lowercase Letter
ValueCountFrequency (%)
a 213
14.1%
i 152
 
10.0%
e 137
 
9.0%
o 130
 
8.6%
n 128
 
8.4%
r 110
 
7.3%
s 69
 
4.6%
t 64
 
4.2%
u 61
 
4.0%
h 60
 
4.0%
Other values (26) 392
25.9%
Uppercase Letter
ValueCountFrequency (%)
A 136
 
12.1%
C 111
 
9.8%
S 97
 
8.6%
M 84
 
7.4%
E 82
 
7.3%
N 65
 
5.8%
O 62
 
5.5%
P 52
 
4.6%
I 50
 
4.4%
U 46
 
4.1%
Other values (16) 343
30.4%
Decimal Number
ValueCountFrequency (%)
2 216
24.5%
1 183
20.8%
0 131
14.9%
9 86
 
9.8%
3 63
 
7.2%
4 62
 
7.0%
5 45
 
5.1%
7 43
 
4.9%
6 29
 
3.3%
8 22
 
2.5%
Other Punctuation
ValueCountFrequency (%)
, 108
59.0%
. 29
 
15.8%
/ 21
 
11.5%
· 12
 
6.6%
' 9
 
4.9%
: 3
 
1.6%
1
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 741
99.9%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 741
99.9%
1
 
0.1%
Space Separator
ValueCountFrequency (%)
2854
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 332
100.0%
Math Symbol
ValueCountFrequency (%)
+ 8
100.0%
Final Punctuation
ValueCountFrequency (%)
5
100.0%
Initial Punctuation
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13377
61.4%
Common 5751
26.4%
Latin 2644
 
12.1%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
771
 
5.8%
606
 
4.5%
604
 
4.5%
550
 
4.1%
410
 
3.1%
371
 
2.8%
359
 
2.7%
346
 
2.6%
332
 
2.5%
258
 
1.9%
Other values (420) 8770
65.6%
Latin
ValueCountFrequency (%)
a 213
 
8.1%
i 152
 
5.7%
e 137
 
5.2%
A 136
 
5.1%
o 130
 
4.9%
n 128
 
4.8%
C 111
 
4.2%
r 110
 
4.2%
S 97
 
3.7%
M 84
 
3.2%
Other values (52) 1346
50.9%
Common
ValueCountFrequency (%)
2854
49.6%
) 741
 
12.9%
( 741
 
12.9%
- 332
 
5.8%
2 216
 
3.8%
1 183
 
3.2%
0 131
 
2.3%
, 108
 
1.9%
9 86
 
1.5%
3 63
 
1.1%
Other values (16) 296
 
5.1%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13377
61.4%
ASCII 8354
38.4%
None 30
 
0.1%
Punctuation 11
 
0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2854
34.2%
) 741
 
8.9%
( 741
 
8.9%
- 332
 
4.0%
2 216
 
2.6%
a 213
 
2.5%
1 183
 
2.2%
i 152
 
1.8%
e 137
 
1.6%
A 136
 
1.6%
Other values (62) 2649
31.7%
Hangul
ValueCountFrequency (%)
771
 
5.8%
606
 
4.5%
604
 
4.5%
550
 
4.1%
410
 
3.1%
371
 
2.8%
359
 
2.7%
346
 
2.6%
332
 
2.5%
258
 
1.9%
Other values (420) 8770
65.6%
None
ValueCountFrequency (%)
· 12
40.0%
á 4
 
13.3%
ó 2
 
6.7%
í 2
 
6.7%
é 2
 
6.7%
ā 1
 
3.3%
ū 1
 
3.3%
ä 1
 
3.3%
1
 
3.3%
1
 
3.3%
Other values (3) 3
 
10.0%
Punctuation
ValueCountFrequency (%)
5
45.5%
5
45.5%
1
 
9.1%
CJK
ValueCountFrequency (%)
1
100.0%

Missing values

2024-04-18T05:53:53.922630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

대상 국가명시작일종료일일지 제목
0한국, 호주, 중국, 태국, 미국, 베트남2020-01-012020-01-01호주 정부, 중화인민공화국 정부, 대한민국정부, 타이왕국 정부, 미합중국 정부 그리고 베트남사회주의공화국 정부 간의 협정 발효(조약 제2446호)
1한국, 베트남2020-01-012020-01-01대한민국과 베트남사회주의공화국 간의 쌀에 대한 저율관세할당물량에 관한 교환각서 발효(조약 제2447호)
2한국, 미국2020-01-012020-01-01대한민국과 미합중국 간의 쌀에 대한 저율관세할당물량에 관한 교환각서 발효(조약 제2448호)
3한국, 미국2020-01-012020-01-01한국-미국 북핵수석대표 간 전화협의
4북한2020-01-012020-01-01북한, 조선노동당 중앙위원회 제7기 5차전원회의(12.28.-31.) 결과 발표
5크로아티아, EU2020-01-012020-06-30크로아티아, 2020년 상반기 EU 이사회 의장국
6한국, 미국2020-01-032020-01-03한국-미국 외교차관보급 협의(Washington, D.C.)
7한국, ANATF2020-01-062020-01-06우리나라, 아프간 군신탁기금(ANATF) 이사회 공동의장직 수임(1년 임기)
8한국, 라트비아2020-01-082020-01-12Ināra Mūrniece(이나라 무르니에쩨) 라트비아 국회의장 공식방한
9미국, 일본2020-01-102020-01-10미국-일본 북핵수석대표협의(Washington,D,C.)
대상 국가명시작일종료일일지 제목
692한국, 미국2020-12-222020-12-22한국-미국 북핵수석대표 간 전화협의
693한국, 미국2020-12-232020-12-23대한민국 정부와 미합중국 정부 간의 과학 및 기술협력에 관한 협정의 연장을 위한 교환각서 발효(고시 제938호)
694한국, 중국2020-12-232020-12-23한국-중국 외교차관회담(화상회의)
695한국, 인도2020-12-232020-12-23한국-인도 외교차관회담(화상회의)
696한국, 일본2020-12-232020-12-23한국-일본 북핵수석대표 간 전화협의
697한국, 터키2020-12-232020-12-23이원익 주터키대사 신임장 제정(Ankara)
698<NA>2020-12-252020-12-25최종문 외교부 제2차관 취임
699NEACHS2020-12-292020-12-29동북아 방역·보건 협력체(NEACHS) 출범 트랙1.5 실무회의(화상회의)
700한국, 러시아2020-12-292020-12-29한국-러시아 북핵수석대표 간 전화협의
701한국, 중국2020-12-292020-12-29노규덕 외교부 한반도평화교섭본부장, Wu Jianghao(우장하오) 중국 외교부 부장조리와 전화협의