Overview

Dataset statistics

Number of variables8
Number of observations1298
Missing cells985
Missing cells (%)9.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory81.3 KiB
Average record size in memory64.1 B

Variable types

Text6
DateTime2

Dataset

Description1. 공관별 휴일 현황 목록 조회: 공관명을 이용하여 공관별 휴일 현황 목록 조회
Author외교부
URLhttps://www.data.go.kr/data/15099234/fileData.do

Alerts

휴일설명 has 985 (75.9%) missing valuesMissing

Reproduction

Analysis started2023-12-12 08:34:39.864295
Analysis finished2023-12-12 08:34:41.143668
Duration1.28 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct85
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size10.3 KiB
2023-12-12T17:34:41.370512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length8
Mean length3.8389831
Min length2

Characters and Unicode

Total characters4983
Distinct characters129
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row네팔
2nd row네팔
3rd row네팔
4th row네팔
5th row네팔
ValueCountFrequency (%)
이란 27
 
2.1%
네팔 26
 
2.0%
태국 24
 
1.8%
방글라데시 21
 
1.6%
콜롬비아 21
 
1.6%
스리랑카 21
 
1.6%
탄자니아 21
 
1.6%
동티모르 21
 
1.6%
말레이시아 20
 
1.5%
인도 20
 
1.5%
Other values (75) 1076
82.9%
2023-12-12T17:34:41.865889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
349
 
7.0%
233
 
4.7%
185
 
3.7%
182
 
3.7%
159
 
3.2%
128
 
2.6%
111
 
2.2%
99
 
2.0%
96
 
1.9%
94
 
1.9%
Other values (119) 3347
67.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4931
99.0%
Uppercase Letter 26
 
0.5%
Close Punctuation 13
 
0.3%
Open Punctuation 13
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
349
 
7.1%
233
 
4.7%
185
 
3.8%
182
 
3.7%
159
 
3.2%
128
 
2.6%
111
 
2.3%
99
 
2.0%
96
 
1.9%
94
 
1.9%
Other values (115) 3295
66.8%
Uppercase Letter
ValueCountFrequency (%)
R 13
50.0%
D 13
50.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4931
99.0%
Common 26
 
0.5%
Latin 26
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
349
 
7.1%
233
 
4.7%
185
 
3.8%
182
 
3.7%
159
 
3.2%
128
 
2.6%
111
 
2.3%
99
 
2.0%
96
 
1.9%
94
 
1.9%
Other values (115) 3295
66.8%
Common
ValueCountFrequency (%)
) 13
50.0%
( 13
50.0%
Latin
ValueCountFrequency (%)
R 13
50.0%
D 13
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4931
99.0%
ASCII 52
 
1.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
349
 
7.1%
233
 
4.7%
185
 
3.8%
182
 
3.7%
159
 
3.2%
128
 
2.6%
111
 
2.3%
99
 
2.0%
96
 
1.9%
94
 
1.9%
Other values (115) 3295
66.8%
ASCII
ValueCountFrequency (%)
) 13
25.0%
R 13
25.0%
( 13
25.0%
D 13
25.0%
Distinct85
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size10.3 KiB
2023-12-12T17:34:42.221812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length21
Mean length8.0323575
Min length4

Characters and Unicode

Total characters10426
Distinct characters50
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNepal
2nd rowNepal
3rd rowNepal
4th rowNepal
5th rowNepal
ValueCountFrequency (%)
29
 
1.9%
new 29
 
1.9%
republic 29
 
1.9%
iran 27
 
1.7%
nepal 26
 
1.7%
thailand 24
 
1.5%
united 24
 
1.5%
timor-leste 21
 
1.3%
tanzania 21
 
1.3%
lanka 21
 
1.3%
Other values (90) 1315
84.0%
2023-12-12T17:34:42.736355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 1677
16.1%
i 906
 
8.7%
n 806
 
7.7%
e 682
 
6.5%
r 579
 
5.6%
o 454
 
4.4%
l 452
 
4.3%
d 376
 
3.6%
u 321
 
3.1%
t 319
 
3.1%
Other values (40) 3854
37.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 8502
81.5%
Uppercase Letter 1593
 
15.3%
Space Separator 268
 
2.6%
Other Punctuation 42
 
0.4%
Dash Punctuation 21
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 1677
19.7%
i 906
10.7%
n 806
9.5%
e 682
 
8.0%
r 579
 
6.8%
o 454
 
5.3%
l 452
 
5.3%
d 376
 
4.4%
u 321
 
3.8%
t 319
 
3.8%
Other values (14) 1930
22.7%
Uppercase Letter
ValueCountFrequency (%)
S 137
 
8.6%
C 134
 
8.4%
T 131
 
8.2%
A 115
 
7.2%
I 112
 
7.0%
B 105
 
6.6%
N 97
 
6.1%
R 88
 
5.5%
P 85
 
5.3%
U 79
 
5.0%
Other values (12) 510
32.0%
Other Punctuation
ValueCountFrequency (%)
: 24
57.1%
& 18
42.9%
Space Separator
ValueCountFrequency (%)
268
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 10095
96.8%
Common 331
 
3.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 1677
16.6%
i 906
 
9.0%
n 806
 
8.0%
e 682
 
6.8%
r 579
 
5.7%
o 454
 
4.5%
l 452
 
4.5%
d 376
 
3.7%
u 321
 
3.2%
t 319
 
3.2%
Other values (36) 3523
34.9%
Common
ValueCountFrequency (%)
268
81.0%
: 24
 
7.3%
- 21
 
6.3%
& 18
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 10426
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 1677
16.1%
i 906
 
8.7%
n 806
 
7.7%
e 682
 
6.5%
r 579
 
5.6%
o 454
 
4.4%
l 452
 
4.3%
d 376
 
3.6%
u 321
 
3.1%
t 319
 
3.1%
Other values (40) 3854
37.0%
Distinct85
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size10.3 KiB
2023-12-12T17:34:43.022847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters2596
Distinct characters25
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNP
2nd rowNP
3rd rowNP
4th rowNP
5th rowNP
ValueCountFrequency (%)
ir 27
 
2.1%
np 26
 
2.0%
th 24
 
1.8%
bd 21
 
1.6%
co 21
 
1.6%
lk 21
 
1.6%
tz 21
 
1.6%
tl 21
 
1.6%
my 20
 
1.5%
in 20
 
1.5%
Other values (75) 1076
82.9%
2023-12-12T17:34:43.457031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
T 198
 
7.6%
R 178
 
6.9%
N 165
 
6.4%
A 156
 
6.0%
E 156
 
6.0%
C 133
 
5.1%
G 131
 
5.0%
I 130
 
5.0%
Z 130
 
5.0%
L 127
 
4.9%
Other values (15) 1092
42.1%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 2596
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
T 198
 
7.6%
R 178
 
6.9%
N 165
 
6.4%
A 156
 
6.0%
E 156
 
6.0%
C 133
 
5.1%
G 131
 
5.0%
I 130
 
5.0%
Z 130
 
5.0%
L 127
 
4.9%
Other values (15) 1092
42.1%

Most occurring scripts

ValueCountFrequency (%)
Latin 2596
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
T 198
 
7.6%
R 178
 
6.9%
N 165
 
6.4%
A 156
 
6.0%
E 156
 
6.0%
C 133
 
5.1%
G 131
 
5.0%
I 130
 
5.0%
Z 130
 
5.0%
L 127
 
4.9%
Other values (15) 1092
42.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2596
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
T 198
 
7.6%
R 178
 
6.9%
N 165
 
6.4%
A 156
 
6.0%
E 156
 
6.0%
C 133
 
5.1%
G 131
 
5.0%
I 130
 
5.0%
Z 130
 
5.0%
L 127
 
4.9%
Other values (15) 1092
42.1%
Distinct85
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size10.3 KiB
2023-12-12T17:34:43.773290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length19
Mean length14.966872
Min length13

Characters and Unicode

Total characters19427
Distinct characters135
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주 네팔 대한민국 대사관
2nd row주 네팔 대한민국 대사관
3rd row주 네팔 대한민국 대사관
4th row주 네팔 대한민국 대사관
5th row주 네팔 대한민국 대사관
ValueCountFrequency (%)
1298
24.9%
대한민국 1298
24.9%
대사관 1298
24.9%
이란 27
 
0.5%
네팔 26
 
0.5%
태국 24
 
0.5%
방글라데시 21
 
0.4%
콜롬비아 21
 
0.4%
스리랑카 21
 
0.4%
탄자니아 21
 
0.4%
Other values (79) 1151
22.1%
2023-12-12T17:34:44.285368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3908
20.1%
2596
13.4%
1450
 
7.5%
1324
 
6.8%
1311
 
6.7%
1298
 
6.7%
1298
 
6.7%
1298
 
6.7%
349
 
1.8%
233
 
1.2%
Other values (125) 4362
22.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15519
79.9%
Space Separator 3908
 
20.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2596
16.7%
1450
 
9.3%
1324
 
8.5%
1311
 
8.4%
1298
 
8.4%
1298
 
8.4%
1298
 
8.4%
349
 
2.2%
233
 
1.5%
185
 
1.2%
Other values (124) 4177
26.9%
Space Separator
ValueCountFrequency (%)
3908
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15519
79.9%
Common 3908
 
20.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2596
16.7%
1450
 
9.3%
1324
 
8.5%
1311
 
8.4%
1298
 
8.4%
1298
 
8.4%
1298
 
8.4%
349
 
2.2%
233
 
1.5%
185
 
1.2%
Other values (124) 4177
26.9%
Common
ValueCountFrequency (%)
3908
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15519
79.9%
ASCII 3908
 
20.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3908
100.0%
Hangul
ValueCountFrequency (%)
2596
16.7%
1450
 
9.3%
1324
 
8.5%
1311
 
8.4%
1298
 
8.4%
1298
 
8.4%
1298
 
8.4%
349
 
2.2%
233
 
1.5%
185
 
1.2%
Other values (124) 4177
26.9%
Distinct849
Distinct (%)65.4%
Missing0
Missing (%)0.0%
Memory size10.3 KiB
2023-12-12T17:34:44.694830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length140
Median length58.5
Mean length15.20339
Min length2

Characters and Unicode

Total characters19734
Distinct characters410
Distinct categories12 ?
Distinct scripts5 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique783 ?
Unique (%)60.3%

Sample

1st rowNew Year's Day
2nd rowMakar Sankranti Festival
3rd rowSonam Lhosar
4th rowShiva Ratri
5th rowInternational Women's Day
ValueCountFrequency (%)
day 235
 
6.7%
de 96
 
2.8%
88
 
2.5%
개천절 77
 
2.2%
광복절 76
 
2.2%
한글날 74
 
2.1%
la 44
 
1.3%
삼일절 43
 
1.2%
of 40
 
1.1%
new 39
 
1.1%
Other values (1279) 2673
76.7%
2023-12-12T17:34:45.323231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2228
 
11.3%
a 1601
 
8.1%
e 875
 
4.4%
i 720
 
3.6%
n 679
 
3.4%
o 667
 
3.4%
r 599
 
3.0%
d 565
 
2.9%
t 528
 
2.7%
( 485
 
2.5%
Other values (400) 10787
54.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 9323
47.2%
Other Letter 4981
25.2%
Space Separator 2228
 
11.3%
Uppercase Letter 1813
 
9.2%
Open Punctuation 485
 
2.5%
Close Punctuation 485
 
2.5%
Other Punctuation 196
 
1.0%
Decimal Number 135
 
0.7%
Dash Punctuation 76
 
0.4%
Final Punctuation 8
 
< 0.1%
Other values (2) 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
430
 
8.6%
407
 
8.2%
224
 
4.5%
178
 
3.6%
138
 
2.8%
128
 
2.6%
122
 
2.4%
108
 
2.2%
89
 
1.8%
89
 
1.8%
Other values (299) 3068
61.6%
Lowercase Letter
ValueCountFrequency (%)
a 1601
17.2%
e 875
9.4%
i 720
 
7.7%
n 679
 
7.3%
o 667
 
7.2%
r 599
 
6.4%
d 565
 
6.1%
t 528
 
5.7%
y 463
 
5.0%
s 462
 
5.0%
Other values (38) 2164
23.2%
Uppercase Letter
ValueCountFrequency (%)
D 369
20.4%
N 130
 
7.2%
A 123
 
6.8%
M 115
 
6.3%
F 113
 
6.2%
S 104
 
5.7%
E 97
 
5.4%
C 96
 
5.3%
I 78
 
4.3%
P 77
 
4.2%
Other values (18) 511
28.2%
Decimal Number
ValueCountFrequency (%)
1 55
40.7%
3 38
28.1%
2 12
 
8.9%
5 10
 
7.4%
0 6
 
4.4%
9 5
 
3.7%
6 3
 
2.2%
4 2
 
1.5%
7 2
 
1.5%
8 2
 
1.5%
Other Punctuation
ValueCountFrequency (%)
' 66
33.7%
. 56
28.6%
* 31
15.8%
, 28
14.3%
/ 8
 
4.1%
· 4
 
2.0%
& 3
 
1.5%
Close Punctuation
ValueCountFrequency (%)
) 484
99.8%
1
 
0.2%
Space Separator
ValueCountFrequency (%)
2228
100.0%
Open Punctuation
ValueCountFrequency (%)
( 485
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 76
100.0%
Final Punctuation
ValueCountFrequency (%)
8
100.0%
Format
ValueCountFrequency (%)
2
100.0%
Control
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 11128
56.4%
Hangul 4976
25.2%
Common 3617
 
18.3%
Cyrillic 8
 
< 0.1%
Han 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
430
 
8.6%
407
 
8.2%
224
 
4.5%
178
 
3.6%
138
 
2.8%
128
 
2.6%
122
 
2.5%
108
 
2.2%
89
 
1.8%
89
 
1.8%
Other values (296) 3063
61.6%
Latin
ValueCountFrequency (%)
a 1601
14.4%
e 875
 
7.9%
i 720
 
6.5%
n 679
 
6.1%
o 667
 
6.0%
r 599
 
5.4%
d 565
 
5.1%
t 528
 
4.7%
y 463
 
4.2%
s 462
 
4.2%
Other values (59) 3969
35.7%
Common
ValueCountFrequency (%)
2228
61.6%
( 485
 
13.4%
) 484
 
13.4%
- 76
 
2.1%
' 66
 
1.8%
. 56
 
1.5%
1 55
 
1.5%
3 38
 
1.1%
* 31
 
0.9%
, 28
 
0.8%
Other values (15) 70
 
1.9%
Cyrillic
ValueCountFrequency (%)
а 2
25.0%
Р 1
12.5%
н 1
12.5%
д 1
12.5%
ц 1
12.5%
и 1
12.5%
о 1
12.5%
Han
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 14585
73.9%
Hangul 4976
 
25.2%
None 150
 
0.8%
Punctuation 10
 
0.1%
Cyrillic 8
 
< 0.1%
CJK 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2228
 
15.3%
a 1601
 
11.0%
e 875
 
6.0%
i 720
 
4.9%
n 679
 
4.7%
o 667
 
4.6%
r 599
 
4.1%
d 565
 
3.9%
t 528
 
3.6%
( 485
 
3.3%
Other values (62) 5638
38.7%
Hangul
ValueCountFrequency (%)
430
 
8.6%
407
 
8.2%
224
 
4.5%
178
 
3.6%
138
 
2.8%
128
 
2.6%
122
 
2.5%
108
 
2.2%
89
 
1.8%
89
 
1.8%
Other values (296) 3063
61.6%
None
ValueCountFrequency (%)
í 57
38.0%
ó 18
 
12.0%
ñ 15
 
10.0%
ê 10
 
6.7%
ı 9
 
6.0%
é 7
 
4.7%
ü 7
 
4.7%
ç 5
 
3.3%
ã 4
 
2.7%
· 4
 
2.7%
Other values (10) 14
 
9.3%
Punctuation
ValueCountFrequency (%)
8
80.0%
2
 
20.0%
CJK
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%
Cyrillic
ValueCountFrequency (%)
а 2
25.0%
Р 1
12.5%
н 1
12.5%
д 1
12.5%
ц 1
12.5%
и 1
12.5%
о 1
12.5%
Distinct267
Distinct (%)20.6%
Missing0
Missing (%)0.0%
Memory size10.3 KiB
Minimum2022-01-01 00:00:00
Maximum2022-12-31 00:00:00
2023-12-12T17:34:45.581991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:34:45.759611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct267
Distinct (%)20.6%
Missing0
Missing (%)0.0%
Memory size10.3 KiB
Minimum2022-01-01 00:00:00
Maximum2022-12-31 00:00:00
2023-12-12T17:34:45.915253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:34:46.116827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴일설명
Text

MISSING 

Distinct84
Distinct (%)26.8%
Missing985
Missing (%)75.9%
Memory size10.3 KiB
2023-12-12T17:34:46.413549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length109
Median length8
Mean length11.303514
Min length2

Characters and Unicode

Total characters3538
Distinct characters225
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique60 ?
Unique (%)19.2%

Sample

1st row우리나라 국경일
2nd row우리나라 국경일
3rd row우리나라 국경일
4th row우리나라 국경일
5th row우리나라 국경일
ValueCountFrequency (%)
국경일 165
 
19.5%
우리나라 161
 
19.1%
15
 
1.8%
따라 14
 
1.7%
축제 14
 
1.7%
매년 13
 
1.5%
대체휴일 12
 
1.4%
있음 11
 
1.3%
ireland 10
 
1.2%
10
 
1.2%
Other values (218) 420
49.7%
2023-12-12T17:34:46.935456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
541
 
15.3%
267
 
7.5%
182
 
5.1%
182
 
5.1%
181
 
5.1%
176
 
5.0%
169
 
4.8%
165
 
4.7%
. 68
 
1.9%
2 68
 
1.9%
Other values (215) 1539
43.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2459
69.5%
Space Separator 541
 
15.3%
Decimal Number 233
 
6.6%
Other Punctuation 129
 
3.6%
Lowercase Letter 88
 
2.5%
Open Punctuation 30
 
0.8%
Close Punctuation 30
 
0.8%
Uppercase Letter 18
 
0.5%
Math Symbol 8
 
0.2%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
267
 
10.9%
182
 
7.4%
182
 
7.4%
181
 
7.4%
176
 
7.2%
169
 
6.9%
165
 
6.7%
51
 
2.1%
35
 
1.4%
33
 
1.3%
Other values (178) 1018
41.4%
Decimal Number
ValueCountFrequency (%)
2 68
29.2%
1 53
22.7%
0 39
16.7%
4 16
 
6.9%
5 14
 
6.0%
7 12
 
5.2%
9 12
 
5.2%
3 9
 
3.9%
8 6
 
2.6%
6 4
 
1.7%
Lowercase Letter
ValueCountFrequency (%)
a 17
19.3%
r 15
17.0%
e 14
15.9%
d 13
14.8%
l 12
13.6%
n 10
11.4%
o 4
 
4.5%
h 1
 
1.1%
t 1
 
1.1%
i 1
 
1.1%
Other Punctuation
ValueCountFrequency (%)
. 68
52.7%
* 26
 
20.2%
, 19
 
14.7%
/ 6
 
4.7%
: 6
 
4.7%
' 4
 
3.1%
Uppercase Letter
ValueCountFrequency (%)
I 12
66.7%
K 4
 
22.2%
A 1
 
5.6%
F 1
 
5.6%
Math Symbol
ValueCountFrequency (%)
< 3
37.5%
> 3
37.5%
~ 2
25.0%
Space Separator
ValueCountFrequency (%)
541
100.0%
Open Punctuation
ValueCountFrequency (%)
( 30
100.0%
Close Punctuation
ValueCountFrequency (%)
) 30
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2453
69.3%
Common 973
 
27.5%
Latin 106
 
3.0%
Han 6
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
267
 
10.9%
182
 
7.4%
182
 
7.4%
181
 
7.4%
176
 
7.2%
169
 
6.9%
165
 
6.7%
51
 
2.1%
35
 
1.4%
33
 
1.3%
Other values (176) 1012
41.3%
Common
ValueCountFrequency (%)
541
55.6%
. 68
 
7.0%
2 68
 
7.0%
1 53
 
5.4%
0 39
 
4.0%
( 30
 
3.1%
) 30
 
3.1%
* 26
 
2.7%
, 19
 
2.0%
4 16
 
1.6%
Other values (13) 83
 
8.5%
Latin
ValueCountFrequency (%)
a 17
16.0%
r 15
14.2%
e 14
13.2%
d 13
12.3%
l 12
11.3%
I 12
11.3%
n 10
9.4%
o 4
 
3.8%
K 4
 
3.8%
h 1
 
0.9%
Other values (4) 4
 
3.8%
Han
ValueCountFrequency (%)
3
50.0%
3
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2453
69.3%
ASCII 1079
30.5%
CJK 6
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
541
50.1%
. 68
 
6.3%
2 68
 
6.3%
1 53
 
4.9%
0 39
 
3.6%
( 30
 
2.8%
) 30
 
2.8%
* 26
 
2.4%
, 19
 
1.8%
a 17
 
1.6%
Other values (27) 188
 
17.4%
Hangul
ValueCountFrequency (%)
267
 
10.9%
182
 
7.4%
182
 
7.4%
181
 
7.4%
176
 
7.2%
169
 
6.9%
165
 
6.7%
51
 
2.1%
35
 
1.4%
33
 
1.3%
Other values (176) 1012
41.3%
CJK
ValueCountFrequency (%)
3
50.0%
3
50.0%

Correlations

2023-12-12T17:34:47.386125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
국가명국가영문명iso 2자리코드공관명휴일설명
국가명1.0001.0001.0001.0000.961
국가영문명1.0001.0001.0001.0000.961
iso 2자리코드1.0001.0001.0001.0000.961
공관명1.0001.0001.0001.0000.961
휴일설명0.9610.9610.9610.9611.000

Missing values

2023-12-12T17:34:40.912436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:34:41.084110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

국가명국가영문명iso 2자리코드공관명휴일명휴일시작일휴일종료일휴일설명
0네팔NepalNP주 네팔 대한민국 대사관New Year's Day2022-01-012022-01-01<NA>
1네팔NepalNP주 네팔 대한민국 대사관Makar Sankranti Festival2022-01-152022-01-15<NA>
2네팔NepalNP주 네팔 대한민국 대사관Sonam Lhosar2022-02-022022-02-02<NA>
3네팔NepalNP주 네팔 대한민국 대사관Shiva Ratri2022-03-012022-03-01<NA>
4네팔NepalNP주 네팔 대한민국 대사관International Women's Day2022-03-082022-03-08<NA>
5네팔NepalNP주 네팔 대한민국 대사관Fagu Purnima2022-03-172022-03-17<NA>
6네팔NepalNP주 네팔 대한민국 대사관Ghode Jatra2022-04-012022-04-01<NA>
7네팔NepalNP주 네팔 대한민국 대사관Nepali New Year2022-04-142022-04-14<NA>
8네팔NepalNP주 네팔 대한민국 대사관International Labor Day2022-05-012022-05-01<NA>
9네팔NepalNP주 네팔 대한민국 대사관Ramjan Edul Fikra2022-05-032022-05-03<NA>
국가명국가영문명iso 2자리코드공관명휴일명휴일시작일휴일종료일휴일설명
1288탄자니아TanzaniaTZ주 탄자니아 대한민국 대사관Eid al-Hajj2022-07-102022-07-10변경가능
1289탄자니아TanzaniaTZ주 탄자니아 대한민국 대사관Nanenane2022-08-082022-08-08<NA>
1290탄자니아TanzaniaTZ주 탄자니아 대한민국 대사관광복절2022-08-152022-08-15<NA>
1291탄자니아TanzaniaTZ주 탄자니아 대한민국 대사관개천절2022-10-032022-10-03<NA>
1292탄자니아TanzaniaTZ주 탄자니아 대한민국 대사관한글날2022-10-092022-10-09<NA>
1293탄자니아TanzaniaTZ주 탄자니아 대한민국 대사관Maulid Day2022-10-092022-10-09변경가능
1294탄자니아TanzaniaTZ주 탄자니아 대한민국 대사관Nyerere Day2022-10-142022-10-14<NA>
1295탄자니아TanzaniaTZ주 탄자니아 대한민국 대사관Independence Day2022-12-092022-12-09<NA>
1296탄자니아TanzaniaTZ주 탄자니아 대한민국 대사관Christmas2022-12-252022-12-25<NA>
1297탄자니아TanzaniaTZ주 탄자니아 대한민국 대사관Boxing Day2022-12-262022-12-26<NA>