Overview

Dataset statistics

Number of variables13
Number of observations197
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory20.5 KiB
Average record size in memory106.7 B

Variable types

Text10
Numeric2
Categorical1

Dataset

Description외교부 홈페이지에 공개 중인 각 국가별 약황 정보 중 일반정보를 CSV 형식으로 제공 합니다.(데이터 업데이트 주기: 12개월, 실시간 정보는 동명의 API 참고)
Author외교부
URLhttps://www.data.go.kr/data/15076557/fileData.do

Alerts

인구 is highly overall correlated with 면적High correlation
면적 is highly overall correlated with 인구High correlation
기후 is highly imbalanced (62.1%)Imbalance
한글국가명 has unique valuesUnique
영문국가명 has unique valuesUnique
수도 has unique valuesUnique
면적 has unique valuesUnique

Reproduction

Analysis started2024-04-21 01:43:36.912858
Analysis finished2024-04-21 01:43:40.270341
Duration3.36 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

한글국가명
Text

UNIQUE 

Distinct197
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-04-21T10:43:40.496706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length3.964467
Min length2

Characters and Unicode

Total characters781
Distinct characters180
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique197 ?
Unique (%)100.0%

Sample

1st row가나
2nd row가봉
3rd row가이아나
4th row감비아
5th row과테말라
ValueCountFrequency (%)
가나 1
 
0.5%
슬로베니아 1
 
0.5%
오스트리아 1
 
0.5%
온두라스 1
 
0.5%
요르단 1
 
0.5%
우간다 1
 
0.5%
우루과이 1
 
0.5%
우즈베키스탄 1
 
0.5%
우크라이나 1
 
0.5%
이라크 1
 
0.5%
Other values (187) 187
94.9%
2024-04-21T10:43:40.925806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
55
 
7.0%
31
 
4.0%
28
 
3.6%
24
 
3.1%
24
 
3.1%
23
 
2.9%
20
 
2.6%
16
 
2.0%
16
 
2.0%
14
 
1.8%
Other values (170) 530
67.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 781
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
55
 
7.0%
31
 
4.0%
28
 
3.6%
24
 
3.1%
24
 
3.1%
23
 
2.9%
20
 
2.6%
16
 
2.0%
16
 
2.0%
14
 
1.8%
Other values (170) 530
67.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 781
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
55
 
7.0%
31
 
4.0%
28
 
3.6%
24
 
3.1%
24
 
3.1%
23
 
2.9%
20
 
2.6%
16
 
2.0%
16
 
2.0%
14
 
1.8%
Other values (170) 530
67.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 781
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
55
 
7.0%
31
 
4.0%
28
 
3.6%
24
 
3.1%
24
 
3.1%
23
 
2.9%
20
 
2.6%
16
 
2.0%
16
 
2.0%
14
 
1.8%
Other values (170) 530
67.9%

영문국가명
Text

UNIQUE 

Distinct197
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-04-21T10:43:41.198030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length26
Mean length8.5380711
Min length3

Characters and Unicode

Total characters1682
Distinct characters57
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique197 ?
Unique (%)100.0%

Sample

1st rowGhana
2nd rowGabon
3rd rowGuyana
4th rowGambia
5th rowGuatemala
ValueCountFrequency (%)
republic 5
 
2.0%
5
 
2.0%
of 4
 
1.6%
united 3
 
1.2%
st 3
 
1.2%
guinea 3
 
1.2%
islands 3
 
1.2%
states 2
 
0.8%
sudan 2
 
0.8%
south 2
 
0.8%
Other values (217) 219
87.3%
2024-04-21T10:43:41.600701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 249
14.8%
i 149
 
8.9%
n 125
 
7.4%
e 117
 
7.0%
o 90
 
5.4%
r 90
 
5.4%
u 66
 
3.9%
t 64
 
3.8%
l 58
 
3.4%
s 57
 
3.4%
Other values (47) 617
36.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1361
80.9%
Uppercase Letter 251
 
14.9%
Space Separator 54
 
3.2%
Other Punctuation 12
 
0.7%
Dash Punctuation 4
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 249
18.3%
i 149
10.9%
n 125
9.2%
e 117
 
8.6%
o 90
 
6.6%
r 90
 
6.6%
u 66
 
4.8%
t 64
 
4.7%
l 58
 
4.3%
s 57
 
4.2%
Other values (16) 296
21.7%
Uppercase Letter
ValueCountFrequency (%)
S 29
 
11.6%
C 20
 
8.0%
M 20
 
8.0%
B 19
 
7.6%
A 17
 
6.8%
T 15
 
6.0%
G 15
 
6.0%
N 14
 
5.6%
L 13
 
5.2%
P 12
 
4.8%
Other values (14) 77
30.7%
Other Punctuation
ValueCountFrequency (%)
. 3
25.0%
& 3
25.0%
? 3
25.0%
: 2
16.7%
' 1
 
8.3%
Space Separator
ValueCountFrequency (%)
54
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1612
95.8%
Common 70
 
4.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 249
15.4%
i 149
 
9.2%
n 125
 
7.8%
e 117
 
7.3%
o 90
 
5.6%
r 90
 
5.6%
u 66
 
4.1%
t 64
 
4.0%
l 58
 
3.6%
s 57
 
3.5%
Other values (40) 547
33.9%
Common
ValueCountFrequency (%)
54
77.1%
- 4
 
5.7%
. 3
 
4.3%
& 3
 
4.3%
? 3
 
4.3%
: 2
 
2.9%
' 1
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1682
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 249
14.8%
i 149
 
8.9%
n 125
 
7.4%
e 117
 
7.0%
o 90
 
5.4%
r 90
 
5.4%
u 66
 
3.9%
t 64
 
3.8%
l 58
 
3.4%
s 57
 
3.4%
Other values (47) 617
36.7%
Distinct196
Distinct (%)100.0%
Missing1
Missing (%)0.5%
Memory size1.7 KiB
2024-04-21T10:43:41.975560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters392
Distinct characters26
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique196 ?
Unique (%)100.0%

Sample

1st rowGH
2nd rowGA
3rd rowGY
4th rowGM
5th rowGT
ValueCountFrequency (%)
gh 1
 
0.5%
si 1
 
0.5%
at 1
 
0.5%
hn 1
 
0.5%
jo 1
 
0.5%
ug 1
 
0.5%
uy 1
 
0.5%
uz 1
 
0.5%
ua 1
 
0.5%
iq 1
 
0.5%
Other values (186) 186
94.9%
2024-04-21T10:43:42.440375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
M 30
 
7.7%
S 27
 
6.9%
T 24
 
6.1%
G 23
 
5.9%
A 22
 
5.6%
C 21
 
5.4%
B 21
 
5.4%
N 19
 
4.8%
E 19
 
4.8%
L 19
 
4.8%
Other values (16) 167
42.6%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 392
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
M 30
 
7.7%
S 27
 
6.9%
T 24
 
6.1%
G 23
 
5.9%
A 22
 
5.6%
C 21
 
5.4%
B 21
 
5.4%
N 19
 
4.8%
E 19
 
4.8%
L 19
 
4.8%
Other values (16) 167
42.6%

Most occurring scripts

ValueCountFrequency (%)
Latin 392
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
M 30
 
7.7%
S 27
 
6.9%
T 24
 
6.1%
G 23
 
5.9%
A 22
 
5.6%
C 21
 
5.4%
B 21
 
5.4%
N 19
 
4.8%
E 19
 
4.8%
L 19
 
4.8%
Other values (16) 167
42.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 392
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
M 30
 
7.7%
S 27
 
6.9%
T 24
 
6.1%
G 23
 
5.9%
A 22
 
5.6%
C 21
 
5.4%
B 21
 
5.4%
N 19
 
4.8%
E 19
 
4.8%
L 19
 
4.8%
Other values (16) 167
42.6%

수도
Text

UNIQUE 

Distinct197
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-04-21T10:43:42.699035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length92
Median length49
Mean length17.771574
Min length2

Characters and Unicode

Total characters3501
Distinct characters312
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique197 ?
Unique (%)100.0%

Sample

1st row아크라(Accra)
2nd row리브르빌(Libreville)
3rd row조지타운(Georgetown)
4th row반줄(Banjul)
5th row과테말라시티(Guatemala City)
ValueCountFrequency (%)
15
 
3.4%
인구 13
 
3.0%
10
 
2.3%
7
 
1.6%
기준 5
 
1.1%
city 4
 
0.9%
행정수도 3
 
0.7%
수도 3
 
0.7%
la 2
 
0.5%
33만명 2
 
0.5%
Other values (366) 373
85.4%
2024-04-21T10:43:43.135152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
242
 
6.9%
a 205
 
5.9%
( 195
 
5.6%
) 195
 
5.6%
o 103
 
2.9%
, 99
 
2.8%
i 98
 
2.8%
n 88
 
2.5%
r 76
 
2.2%
75
 
2.1%
Other values (302) 2125
60.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1139
32.5%
Lowercase Letter 1096
31.3%
Decimal Number 270
 
7.7%
Space Separator 242
 
6.9%
Uppercase Letter 212
 
6.1%
Open Punctuation 195
 
5.6%
Close Punctuation 195
 
5.6%
Other Punctuation 146
 
4.2%
Dash Punctuation 4
 
0.1%
Initial Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
75
 
6.6%
71
 
6.2%
34
 
3.0%
32
 
2.8%
28
 
2.5%
28
 
2.5%
23
 
2.0%
21
 
1.8%
19
 
1.7%
19
 
1.7%
Other values (230) 789
69.3%
Lowercase Letter
ValueCountFrequency (%)
a 205
18.7%
o 103
9.4%
i 98
8.9%
n 88
 
8.0%
r 76
 
6.9%
e 71
 
6.5%
u 59
 
5.4%
t 56
 
5.1%
s 53
 
4.8%
l 46
 
4.2%
Other values (15) 241
22.0%
Uppercase Letter
ValueCountFrequency (%)
S 19
 
9.0%
B 19
 
9.0%
M 19
 
9.0%
P 18
 
8.5%
A 17
 
8.0%
D 15
 
7.1%
N 13
 
6.1%
T 13
 
6.1%
C 13
 
6.1%
L 12
 
5.7%
Other values (13) 54
25.5%
Decimal Number
ValueCountFrequency (%)
2 48
17.8%
1 47
17.4%
0 42
15.6%
7 30
11.1%
4 23
8.5%
3 21
7.8%
8 17
 
6.3%
6 17
 
6.3%
9 13
 
4.8%
5 12
 
4.4%
Other Punctuation
ValueCountFrequency (%)
, 99
67.8%
. 20
 
13.7%
: 8
 
5.5%
? 7
 
4.8%
' 7
 
4.8%
/ 2
 
1.4%
2
 
1.4%
· 1
 
0.7%
Space Separator
ValueCountFrequency (%)
242
100.0%
Open Punctuation
ValueCountFrequency (%)
( 195
100.0%
Close Punctuation
ValueCountFrequency (%)
) 195
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1308
37.4%
Hangul 1135
32.4%
Common 1054
30.1%
Han 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
75
 
6.6%
71
 
6.3%
34
 
3.0%
32
 
2.8%
28
 
2.5%
28
 
2.5%
23
 
2.0%
21
 
1.9%
19
 
1.7%
19
 
1.7%
Other values (227) 785
69.2%
Latin
ValueCountFrequency (%)
a 205
15.7%
o 103
 
7.9%
i 98
 
7.5%
n 88
 
6.7%
r 76
 
5.8%
e 71
 
5.4%
u 59
 
4.5%
t 56
 
4.3%
s 53
 
4.1%
l 46
 
3.5%
Other values (38) 453
34.6%
Common
ValueCountFrequency (%)
242
23.0%
( 195
18.5%
) 195
18.5%
, 99
9.4%
2 48
 
4.6%
1 47
 
4.5%
0 42
 
4.0%
7 30
 
2.8%
4 23
 
2.2%
3 21
 
2.0%
Other values (14) 112
10.6%
Han
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2357
67.3%
Hangul 1135
32.4%
CJK 4
 
0.1%
Punctuation 4
 
0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
242
 
10.3%
a 205
 
8.7%
( 195
 
8.3%
) 195
 
8.3%
o 103
 
4.4%
, 99
 
4.2%
i 98
 
4.2%
n 88
 
3.7%
r 76
 
3.2%
e 71
 
3.0%
Other values (58) 985
41.8%
Hangul
ValueCountFrequency (%)
75
 
6.6%
71
 
6.3%
34
 
3.0%
32
 
2.8%
28
 
2.5%
28
 
2.5%
23
 
2.0%
21
 
1.9%
19
 
1.7%
19
 
1.7%
Other values (227) 785
69.2%
CJK
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Punctuation
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
None
ValueCountFrequency (%)
· 1
100.0%

인구
Real number (ℝ)

HIGH CORRELATION 

Distinct195
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39521004
Minimum274
Maximum1.40967 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2024-04-21T10:43:43.274565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum274
5-th percentile54211
Q11580000
median8610000
Q328430000
95-th percentile1.2612 × 108
Maximum1.40967 × 109
Range1.4096697 × 109
Interquartile range (IQR)26850000

Descriptive statistics

Standard deviation1.4696817 × 108
Coefficient of variation (CV)3.7187357
Kurtosis76.290503
Mean39521004
Median Absolute Deviation (MAD)8170000
Skewness8.4222966
Sum7.7856378 × 109
Variance2.1599643 × 1016
MonotonicityNot monotonic
2024-04-21T10:43:43.409538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
440000 2
 
1.0%
110000 2
 
1.0%
32400000 1
 
0.5%
62390000 1
 
0.5%
11310000 1
 
0.5%
47000000 1
 
0.5%
3480000 1
 
0.5%
33900000 1
 
0.5%
41600000 1
 
0.5%
39650000 1
 
0.5%
Other values (185) 185
93.9%
ValueCountFrequency (%)
274 1
0.5%
1000 1
0.5%
1600 1
0.5%
10800 1
0.5%
11925 1
0.5%
17601 1
0.5%
18000 1
0.5%
31223 1
0.5%
33745 1
0.5%
39055 1
0.5%
ValueCountFrequency (%)
1409670000 1
0.5%
1407000000 1
0.5%
334910000 1
0.5%
277430000 1
0.5%
230000000 1
0.5%
218540000 1
0.5%
210000000 1
0.5%
170000000 1
0.5%
143200000 1
0.5%
130120000 1
0.5%
Distinct97
Distinct (%)49.2%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-04-21T10:43:43.669194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length20
Mean length11.253807
Min length5

Characters and Unicode

Total characters2217
Distinct characters111
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique74 ?
Unique (%)37.6%

Sample

1st row('22, EIU)
2nd row(‘21 World Bank)
3rd row(2019 CIA)
4th row('22 World Bank)
5th row(2022, IMF)
ValueCountFrequency (%)
imf 45
 
11.0%
world 38
 
9.3%
bank 37
 
9.0%
2022 30
 
7.3%
2021 25
 
6.1%
22 19
 
4.6%
null 18
 
4.4%
21 14
 
3.4%
2023 12
 
2.9%
cia 12
 
2.9%
Other values (78) 160
39.0%
2024-04-21T10:43:44.017024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 333
 
15.0%
217
 
9.8%
) 174
 
7.8%
( 174
 
7.8%
0 110
 
5.0%
1 73
 
3.3%
I 69
 
3.1%
, 67
 
3.0%
' 54
 
2.4%
W 50
 
2.3%
Other values (101) 896
40.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 576
26.0%
Uppercase Letter 380
17.1%
Lowercase Letter 326
14.7%
Space Separator 217
 
9.8%
Close Punctuation 174
 
7.8%
Open Punctuation 174
 
7.8%
Other Punctuation 165
 
7.4%
Other Letter 157
 
7.1%
Dash Punctuation 24
 
1.1%
Initial Punctuation 19
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
6.4%
10
 
6.4%
10
 
6.4%
9
 
5.7%
9
 
5.7%
8
 
5.1%
8
 
5.1%
5
 
3.2%
5
 
3.2%
4
 
2.5%
Other values (50) 79
50.3%
Lowercase Letter
ValueCountFrequency (%)
o 50
15.3%
r 48
14.7%
d 43
13.2%
l 43
13.2%
k 38
11.7%
a 38
11.7%
n 37
11.3%
e 10
 
3.1%
t 6
 
1.8%
m 5
 
1.5%
Other values (5) 8
 
2.5%
Uppercase Letter
ValueCountFrequency (%)
I 69
18.2%
W 50
13.2%
M 47
12.4%
F 47
12.4%
B 45
11.8%
L 36
9.5%
U 26
 
6.8%
N 20
 
5.3%
A 15
 
3.9%
C 15
 
3.9%
Other values (4) 10
 
2.6%
Decimal Number
ValueCountFrequency (%)
2 333
57.8%
0 110
 
19.1%
1 73
 
12.7%
3 35
 
6.1%
7 8
 
1.4%
5 5
 
0.9%
9 4
 
0.7%
8 3
 
0.5%
6 3
 
0.5%
4 2
 
0.3%
Other Punctuation
ValueCountFrequency (%)
, 67
40.6%
' 54
32.7%
. 35
21.2%
/ 7
 
4.2%
% 1
 
0.6%
: 1
 
0.6%
Space Separator
ValueCountFrequency (%)
217
100.0%
Close Punctuation
ValueCountFrequency (%)
) 174
100.0%
Open Punctuation
ValueCountFrequency (%)
( 174
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%
Initial Punctuation
ValueCountFrequency (%)
19
100.0%
Final Punctuation
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1354
61.1%
Latin 706
31.8%
Hangul 157
 
7.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
6.4%
10
 
6.4%
10
 
6.4%
9
 
5.7%
9
 
5.7%
8
 
5.1%
8
 
5.1%
5
 
3.2%
5
 
3.2%
4
 
2.5%
Other values (50) 79
50.3%
Latin
ValueCountFrequency (%)
I 69
 
9.8%
W 50
 
7.1%
o 50
 
7.1%
r 48
 
6.8%
M 47
 
6.7%
F 47
 
6.7%
B 45
 
6.4%
d 43
 
6.1%
l 43
 
6.1%
k 38
 
5.4%
Other values (19) 226
32.0%
Common
ValueCountFrequency (%)
2 333
24.6%
217
16.0%
) 174
12.9%
( 174
12.9%
0 110
 
8.1%
1 73
 
5.4%
, 67
 
4.9%
' 54
 
4.0%
. 35
 
2.6%
3 35
 
2.6%
Other values (12) 82
 
6.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2036
91.8%
Hangul 157
 
7.1%
Punctuation 24
 
1.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 333
16.4%
217
 
10.7%
) 174
 
8.5%
( 174
 
8.5%
0 110
 
5.4%
1 73
 
3.6%
I 69
 
3.4%
, 67
 
3.3%
' 54
 
2.7%
W 50
 
2.5%
Other values (39) 715
35.1%
Punctuation
ValueCountFrequency (%)
19
79.2%
5
 
20.8%
Hangul
ValueCountFrequency (%)
10
 
6.4%
10
 
6.4%
10
 
6.4%
9
 
5.7%
9
 
5.7%
8
 
5.1%
8
 
5.1%
5
 
3.2%
5
 
3.2%
4
 
2.5%
Other values (50) 79
50.3%

면적
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct197
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean679697.84
Minimum0.44
Maximum17090000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2024-04-21T10:43:44.143448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.44
5-th percentile290.6
Q120770
median113370
Q3527968
95-th percentile2352234.6
Maximum17090000
Range17090000
Interquartile range (IQR)507198

Descriptive statistics

Standard deviation1907099.4
Coefficient of variation (CV)2.8058047
Kurtosis36.572186
Mean679697.84
Median Absolute Deviation (MAD)112914.7
Skewness5.6100353
Sum1.3390047 × 108
Variance3.6370282 × 1012
MonotonicityNot monotonic
2024-04-21T10:43:44.279113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
238537.0 1
 
0.5%
527968.0 1
 
0.5%
83879.0 1
 
0.5%
112492.0 1
 
0.5%
89342.0 1
 
0.5%
241038.0 1
 
0.5%
176000.0 1
 
0.5%
447400.0 1
 
0.5%
603500.0 1
 
0.5%
441839.0 1
 
0.5%
Other values (187) 187
94.9%
ValueCountFrequency (%)
0.44 1
0.5%
2.02 1
0.5%
21.0 1
0.5%
25.9 1
0.5%
60.5 1
0.5%
160.0 1
0.5%
182.0 1
0.5%
240.0 1
0.5%
259.0 1
0.5%
261.0 1
0.5%
ValueCountFrequency (%)
17090000.0 1
0.5%
9970000.0 1
0.5%
9830000.0 1
0.5%
9600000.0 1
0.5%
8510000.0 1
0.5%
7690000.0 1
0.5%
3287782.0 1
0.5%
2790000.0 1
0.5%
2724900.0 1
0.5%
2381741.0 1
0.5%
Distinct150
Distinct (%)76.1%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-04-21T10:43:44.616474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length47
Mean length13.670051
Min length6

Characters and Unicode

Total characters2693
Distinct characters174
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique125 ?
Unique (%)63.5%

Sample

1st row(한반도 1.1배)
2nd row(한반도의 1.2배)
3rd row(NULL)
4th row(한반도의 1/20)
5th row(한반도의 1/2)
ValueCountFrequency (%)
한반도의 130
23.0%
39
 
6.9%
한반도 14
 
2.5%
null 11
 
1.9%
세계 11
 
1.9%
구성 11
 
1.9%
1/4 10
 
1.8%
1/3 9
 
1.6%
도서로 7
 
1.2%
1/2 7
 
1.2%
Other values (207) 316
55.9%
2024-04-21T10:43:45.077538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
375
 
13.9%
( 201
 
7.5%
) 201
 
7.5%
173
 
6.4%
163
 
6.1%
158
 
5.9%
147
 
5.5%
1 133
 
4.9%
100
 
3.7%
/ 74
 
2.7%
Other values (164) 968
35.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1169
43.4%
Decimal Number 446
 
16.6%
Space Separator 375
 
13.9%
Open Punctuation 201
 
7.5%
Close Punctuation 201
 
7.5%
Other Punctuation 171
 
6.3%
Uppercase Letter 69
 
2.6%
Lowercase Letter 57
 
2.1%
Other Symbol 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
173
14.8%
163
13.9%
158
13.5%
147
12.6%
100
 
8.6%
45
 
3.8%
18
 
1.5%
16
 
1.4%
16
 
1.4%
15
 
1.3%
Other values (114) 318
27.2%
Lowercase Letter
ValueCountFrequency (%)
a 13
22.8%
o 7
12.3%
n 6
10.5%
i 5
 
8.8%
r 4
 
7.0%
u 4
 
7.0%
g 3
 
5.3%
e 3
 
5.3%
d 2
 
3.5%
t 2
 
3.5%
Other values (7) 8
14.0%
Uppercase Letter
ValueCountFrequency (%)
L 22
31.9%
U 12
17.4%
N 11
15.9%
A 7
 
10.1%
C 6
 
8.7%
I 5
 
7.2%
M 2
 
2.9%
R 1
 
1.4%
G 1
 
1.4%
B 1
 
1.4%
Decimal Number
ValueCountFrequency (%)
1 133
29.8%
2 64
14.3%
3 58
13.0%
5 47
 
10.5%
0 36
 
8.1%
4 32
 
7.2%
7 25
 
5.6%
6 21
 
4.7%
8 18
 
4.0%
9 12
 
2.7%
Other Punctuation
ValueCountFrequency (%)
/ 74
43.3%
. 61
35.7%
, 21
 
12.3%
% 6
 
3.5%
4
 
2.3%
: 3
 
1.8%
* 2
 
1.2%
Other Symbol
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Space Separator
ValueCountFrequency (%)
375
100.0%
Open Punctuation
ValueCountFrequency (%)
( 201
100.0%
Close Punctuation
ValueCountFrequency (%)
) 201
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1398
51.9%
Hangul 1163
43.2%
Latin 126
 
4.7%
Han 6
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
173
14.9%
163
14.0%
158
13.6%
147
12.6%
100
 
8.6%
45
 
3.9%
18
 
1.5%
16
 
1.4%
16
 
1.4%
15
 
1.3%
Other values (108) 312
26.8%
Latin
ValueCountFrequency (%)
L 22
17.5%
a 13
10.3%
U 12
 
9.5%
N 11
 
8.7%
A 7
 
5.6%
o 7
 
5.6%
C 6
 
4.8%
n 6
 
4.8%
I 5
 
4.0%
i 5
 
4.0%
Other values (18) 32
25.4%
Common
ValueCountFrequency (%)
375
26.8%
( 201
14.4%
) 201
14.4%
1 133
 
9.5%
/ 74
 
5.3%
2 64
 
4.6%
. 61
 
4.4%
3 58
 
4.1%
5 47
 
3.4%
0 36
 
2.6%
Other values (12) 148
 
10.6%
Han
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1516
56.3%
Hangul 1163
43.2%
CJK 6
 
0.2%
Punctuation 4
 
0.1%
CJK Compat 4
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
375
24.7%
( 201
13.3%
) 201
13.3%
1 133
 
8.8%
/ 74
 
4.9%
2 64
 
4.2%
. 61
 
4.0%
3 58
 
3.8%
5 47
 
3.1%
0 36
 
2.4%
Other values (37) 266
17.5%
Hangul
ValueCountFrequency (%)
173
14.9%
163
14.0%
158
13.6%
147
12.6%
100
 
8.6%
45
 
3.9%
18
 
1.5%
16
 
1.4%
16
 
1.4%
15
 
1.3%
Other values (108) 312
26.8%
Punctuation
ValueCountFrequency (%)
4
100.0%
CJK Compat
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
CJK
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

언어
Text

Distinct153
Distinct (%)77.7%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-04-21T10:43:45.297104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length70
Median length42
Mean length15.964467
Min length2

Characters and Unicode

Total characters3145
Distinct characters252
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique145 ?
Unique (%)73.6%

Sample

1st row영어(공용어), 아산테어, 에웨어 등
2nd row불어(공용어), Fang어
3rd row영어(공용어), Creole(현지 토속어)
4th row영어(공용어), Wolof어
5th row스페인어(공용어), 23개 공인 원주민어
ValueCountFrequency (%)
영어 43
 
8.2%
null 27
 
5.2%
영어(공용어 17
 
3.3%
스페인어 14
 
2.7%
아랍어 13
 
2.5%
통용 12
 
2.3%
불어 12
 
2.3%
불어(공용어 11
 
2.1%
11
 
2.1%
토착어 10
 
1.9%
Other values (267) 352
67.4%
2024-04-21T10:43:45.673197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
470
 
14.9%
329
 
10.5%
) 190
 
6.0%
( 190
 
6.0%
, 181
 
5.8%
108
 
3.4%
94
 
3.0%
82
 
2.6%
76
 
2.4%
55
 
1.7%
Other values (242) 1370
43.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1831
58.2%
Space Separator 329
 
10.5%
Other Punctuation 232
 
7.4%
Close Punctuation 190
 
6.0%
Open Punctuation 190
 
6.0%
Uppercase Letter 137
 
4.4%
Lowercase Letter 136
 
4.3%
Decimal Number 92
 
2.9%
Dash Punctuation 6
 
0.2%
Initial Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
470
25.7%
108
 
5.9%
94
 
5.1%
82
 
4.5%
76
 
4.2%
55
 
3.0%
30
 
1.6%
28
 
1.5%
27
 
1.5%
26
 
1.4%
Other values (182) 835
45.6%
Lowercase Letter
ValueCountFrequency (%)
a 19
14.0%
o 19
14.0%
i 14
10.3%
e 11
8.1%
l 9
 
6.6%
n 9
 
6.6%
h 8
 
5.9%
r 8
 
5.9%
g 7
 
5.1%
t 6
 
4.4%
Other values (11) 26
19.1%
Uppercase Letter
ValueCountFrequency (%)
L 55
40.1%
N 27
19.7%
U 27
19.7%
C 5
 
3.6%
S 4
 
2.9%
P 4
 
2.9%
K 3
 
2.2%
M 2
 
1.5%
W 2
 
1.5%
F 2
 
1.5%
Other values (5) 6
 
4.4%
Decimal Number
ValueCountFrequency (%)
1 17
18.5%
0 13
14.1%
2 12
13.0%
3 12
13.0%
5 11
12.0%
8 11
12.0%
9 5
 
5.4%
6 5
 
5.4%
4 3
 
3.3%
7 3
 
3.3%
Other Punctuation
ValueCountFrequency (%)
, 181
78.0%
% 34
 
14.7%
. 6
 
2.6%
: 4
 
1.7%
· 3
 
1.3%
? 2
 
0.9%
* 1
 
0.4%
1
 
0.4%
Space Separator
ValueCountFrequency (%)
329
100.0%
Close Punctuation
ValueCountFrequency (%)
) 190
100.0%
Open Punctuation
ValueCountFrequency (%)
( 190
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1830
58.2%
Common 1041
33.1%
Latin 273
 
8.7%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
470
25.7%
108
 
5.9%
94
 
5.1%
82
 
4.5%
76
 
4.2%
55
 
3.0%
30
 
1.6%
28
 
1.5%
27
 
1.5%
26
 
1.4%
Other values (181) 834
45.6%
Latin
ValueCountFrequency (%)
L 55
20.1%
N 27
 
9.9%
U 27
 
9.9%
a 19
 
7.0%
o 19
 
7.0%
i 14
 
5.1%
e 11
 
4.0%
l 9
 
3.3%
n 9
 
3.3%
h 8
 
2.9%
Other values (26) 75
27.5%
Common
ValueCountFrequency (%)
329
31.6%
) 190
18.3%
( 190
18.3%
, 181
17.4%
% 34
 
3.3%
1 17
 
1.6%
0 13
 
1.2%
2 12
 
1.2%
3 12
 
1.2%
5 11
 
1.1%
Other values (14) 52
 
5.0%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1830
58.2%
ASCII 1308
41.6%
None 3
 
0.1%
Punctuation 3
 
0.1%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
470
25.7%
108
 
5.9%
94
 
5.1%
82
 
4.5%
76
 
4.2%
55
 
3.0%
30
 
1.6%
28
 
1.5%
27
 
1.5%
26
 
1.4%
Other values (181) 834
45.6%
ASCII
ValueCountFrequency (%)
329
25.2%
) 190
14.5%
( 190
14.5%
, 181
13.8%
L 55
 
4.2%
% 34
 
2.6%
N 27
 
2.1%
U 27
 
2.1%
a 19
 
1.5%
o 19
 
1.5%
Other values (46) 237
18.1%
None
ValueCountFrequency (%)
· 3
100.0%
Punctuation
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
CJK
ValueCountFrequency (%)
1
100.0%

종교
Text

Distinct191
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-04-21T10:43:45.907865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length45
Mean length28.187817
Min length6

Characters and Unicode

Total characters5553
Distinct characters183
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique189 ?
Unique (%)95.9%

Sample

1st row기독교(71%), 이슬람교(17.6%)
2nd row기독교(가톨릭 포함) 85%, 회교 9.8%, 토착종교
3rd row기독교 57%, 힌두교 33%, 회교 9%, 기타 1%
4th row회교(90%), 기독교(9%) 등
5th row가톨릭(41%), 개신교(38.8%), 기타(2.7%)
ValueCountFrequency (%)
38
 
5.0%
기타 25
 
3.3%
기독교 18
 
2.4%
가톨릭 15
 
2.0%
이슬람교 12
 
1.6%
개신교 10
 
1.3%
이슬람교(수니파 10
 
1.3%
시아파 9
 
1.2%
힌두교 8
 
1.0%
7
 
0.9%
Other values (499) 610
80.1%
2024-04-21T10:43:46.293095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
570
 
10.3%
% 481
 
8.7%
) 428
 
7.7%
( 428
 
7.7%
, 371
 
6.7%
360
 
6.5%
. 163
 
2.9%
1 140
 
2.5%
134
 
2.4%
2 124
 
2.2%
Other values (173) 2354
42.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2048
36.9%
Other Punctuation 1022
18.4%
Decimal Number 988
17.8%
Space Separator 570
 
10.3%
Close Punctuation 428
 
7.7%
Open Punctuation 428
 
7.7%
Lowercase Letter 35
 
0.6%
Uppercase Letter 26
 
0.5%
Dash Punctuation 7
 
0.1%
Final Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
360
17.6%
134
 
6.5%
106
 
5.2%
98
 
4.8%
98
 
4.8%
96
 
4.7%
95
 
4.6%
82
 
4.0%
75
 
3.7%
68
 
3.3%
Other values (129) 836
40.8%
Lowercase Letter
ValueCountFrequency (%)
n 5
14.3%
h 4
11.4%
i 4
11.4%
o 3
8.6%
a 3
8.6%
m 2
 
5.7%
c 2
 
5.7%
s 2
 
5.7%
t 2
 
5.7%
e 2
 
5.7%
Other values (4) 6
17.1%
Decimal Number
ValueCountFrequency (%)
1 140
14.2%
2 124
12.6%
5 119
12.0%
3 103
10.4%
7 93
9.4%
8 89
9.0%
9 89
9.0%
0 82
8.3%
4 78
7.9%
6 71
7.2%
Other Punctuation
ValueCountFrequency (%)
% 481
47.1%
, 371
36.3%
. 163
 
15.9%
: 2
 
0.2%
2
 
0.2%
* 1
 
0.1%
/ 1
 
0.1%
· 1
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
L 10
38.5%
U 5
19.2%
N 5
19.2%
C 2
 
7.7%
S 2
 
7.7%
R 1
 
3.8%
W 1
 
3.8%
Space Separator
ValueCountFrequency (%)
570
100.0%
Close Punctuation
ValueCountFrequency (%)
) 428
100.0%
Open Punctuation
ValueCountFrequency (%)
( 428
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3444
62.0%
Hangul 2045
36.8%
Latin 61
 
1.1%
Han 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
360
17.6%
134
 
6.6%
106
 
5.2%
98
 
4.8%
98
 
4.8%
96
 
4.7%
95
 
4.6%
82
 
4.0%
75
 
3.7%
68
 
3.3%
Other values (126) 833
40.7%
Common
ValueCountFrequency (%)
570
16.6%
% 481
14.0%
) 428
12.4%
( 428
12.4%
, 371
10.8%
. 163
 
4.7%
1 140
 
4.1%
2 124
 
3.6%
5 119
 
3.5%
3 103
 
3.0%
Other values (13) 517
15.0%
Latin
ValueCountFrequency (%)
L 10
16.4%
n 5
 
8.2%
U 5
 
8.2%
N 5
 
8.2%
h 4
 
6.6%
i 4
 
6.6%
o 3
 
4.9%
a 3
 
4.9%
m 2
 
3.3%
c 2
 
3.3%
Other values (11) 18
29.5%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3501
63.0%
Hangul 2045
36.8%
Punctuation 3
 
0.1%
CJK 3
 
0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
570
16.3%
% 481
13.7%
) 428
12.2%
( 428
12.2%
, 371
10.6%
. 163
 
4.7%
1 140
 
4.0%
2 124
 
3.5%
5 119
 
3.4%
3 103
 
2.9%
Other values (31) 574
16.4%
Hangul
ValueCountFrequency (%)
360
17.6%
134
 
6.6%
106
 
5.2%
98
 
4.8%
98
 
4.8%
96
 
4.7%
95
 
4.6%
82
 
4.0%
75
 
3.7%
68
 
3.3%
Other values (126) 833
40.7%
Punctuation
ValueCountFrequency (%)
2
66.7%
1
33.3%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
None
ValueCountFrequency (%)
· 1
100.0%

민족
Text

Distinct156
Distinct (%)79.2%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-04-21T10:43:46.521985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length119
Median length61
Mean length29.060914
Min length3

Characters and Unicode

Total characters5725
Distinct characters296
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique155 ?
Unique (%)78.7%

Sample

1st row아칸족(48%), 몰다그바니족(17%), 에웨족(14%), 가아단베족(7%), 구르마족(6%), 구안족(4%)
2nd rowFang, Echira, Adouma 등 40여 종족
3rd row동인도계 39.8%, 흑인 29.2%, 혼혈 19.9%, 아메리카 인디안 10.5%
4th rowMandinka, Peul, Wolof 등
5th row메스티소 56%, 마야인 41.7%, 흑인?흑인계 혼혈 0.2%, Garifuna인 0.1%, 외국인 0.2%, Xinca 원주민 1.8%
ValueCountFrequency (%)
56
 
7.0%
null 42
 
5.3%
기타 20
 
2.5%
12
 
1.5%
혼혈 8
 
1.0%
유럽계 5
 
0.6%
2 5
 
0.6%
종족 4
 
0.5%
인도계 4
 
0.5%
백인 4
 
0.5%
Other values (581) 639
80.0%
2024-04-21T10:43:47.130028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
608
 
10.6%
% 416
 
7.3%
( 381
 
6.7%
) 381
 
6.7%
, 368
 
6.4%
190
 
3.3%
. 158
 
2.8%
1 143
 
2.5%
114
 
2.0%
2 112
 
2.0%
Other values (286) 2854
49.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1859
32.5%
Other Punctuation 953
16.6%
Decimal Number 854
14.9%
Space Separator 608
 
10.6%
Lowercase Letter 423
 
7.4%
Open Punctuation 381
 
6.7%
Close Punctuation 381
 
6.7%
Uppercase Letter 259
 
4.5%
Dash Punctuation 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
190
 
10.2%
114
 
6.1%
100
 
5.4%
73
 
3.9%
58
 
3.1%
54
 
2.9%
49
 
2.6%
40
 
2.2%
38
 
2.0%
37
 
2.0%
Other values (218) 1106
59.5%
Lowercase Letter
ValueCountFrequency (%)
a 77
18.2%
e 49
11.6%
n 40
9.5%
o 33
7.8%
i 31
 
7.3%
u 29
 
6.9%
s 24
 
5.7%
l 24
 
5.7%
r 24
 
5.7%
h 12
 
2.8%
Other values (13) 80
18.9%
Uppercase Letter
ValueCountFrequency (%)
L 87
33.6%
N 43
16.6%
U 42
16.2%
B 12
 
4.6%
S 10
 
3.9%
M 9
 
3.5%
C 8
 
3.1%
K 8
 
3.1%
P 6
 
2.3%
A 5
 
1.9%
Other values (12) 29
 
11.2%
Decimal Number
ValueCountFrequency (%)
1 143
16.7%
2 112
13.1%
6 82
9.6%
5 82
9.6%
4 80
9.4%
3 77
9.0%
8 76
8.9%
7 70
8.2%
9 69
8.1%
0 63
7.4%
Other Punctuation
ValueCountFrequency (%)
% 416
43.7%
, 368
38.6%
. 158
 
16.6%
? 3
 
0.3%
: 2
 
0.2%
/ 2
 
0.2%
· 2
 
0.2%
' 1
 
0.1%
* 1
 
0.1%
Space Separator
ValueCountFrequency (%)
608
100.0%
Open Punctuation
ValueCountFrequency (%)
( 381
100.0%
Close Punctuation
ValueCountFrequency (%)
) 381
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3184
55.6%
Hangul 1857
32.4%
Latin 682
 
11.9%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
190
 
10.2%
114
 
6.1%
100
 
5.4%
73
 
3.9%
58
 
3.1%
54
 
2.9%
49
 
2.6%
40
 
2.2%
38
 
2.0%
37
 
2.0%
Other values (216) 1104
59.5%
Latin
ValueCountFrequency (%)
L 87
 
12.8%
a 77
 
11.3%
e 49
 
7.2%
N 43
 
6.3%
U 42
 
6.2%
n 40
 
5.9%
o 33
 
4.8%
i 31
 
4.5%
u 29
 
4.3%
s 24
 
3.5%
Other values (35) 227
33.3%
Common
ValueCountFrequency (%)
608
19.1%
% 416
13.1%
( 381
12.0%
) 381
12.0%
, 368
11.6%
. 158
 
5.0%
1 143
 
4.5%
2 112
 
3.5%
6 82
 
2.6%
5 82
 
2.6%
Other values (13) 453
14.2%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3864
67.5%
Hangul 1857
32.4%
None 2
 
< 0.1%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
608
15.7%
% 416
 
10.8%
( 381
 
9.9%
) 381
 
9.9%
, 368
 
9.5%
. 158
 
4.1%
1 143
 
3.7%
2 112
 
2.9%
L 87
 
2.3%
6 82
 
2.1%
Other values (57) 1128
29.2%
Hangul
ValueCountFrequency (%)
190
 
10.2%
114
 
6.1%
100
 
5.4%
73
 
3.9%
58
 
3.1%
54
 
2.9%
49
 
2.6%
40
 
2.2%
38
 
2.0%
37
 
2.0%
Other values (216) 1104
59.5%
None
ValueCountFrequency (%)
· 2
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

기후
Categorical

IMBALANCE 

Distinct44
Distinct (%)22.3%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
(NULL)
150 
열대성 기후
 
3
고온다습한 열대성 기후
 
3
고온 다습(5-10월 35-45도, 11월-4월 15-35도)
 
1
고온건조
 
1
Other values (39)
39 

Length

Max length52
Median length6
Mean length8.5228426
Min length4

Unique

Unique41 ?
Unique (%)20.8%

Sample

1st row(NULL)
2nd row(NULL)
3rd row(NULL)
4th row(NULL)
5th row(NULL)

Common Values

ValueCountFrequency (%)
(NULL) 150
76.1%
열대성 기후 3
 
1.5%
고온다습한 열대성 기후 3
 
1.5%
고온 다습(5-10월 35-45도, 11월-4월 15-35도) 1
 
0.5%
고온건조 1
 
0.5%
아열대, 사막성 건조기후 1
 
0.5%
대륙성, 아열대성(남부) 1
 
0.5%
열대우림, 사바나 1
 
0.5%
아열대 기후 1
 
0.5%
아열대성 해양기후(여름 33℃,겨울 13℃ 평균) 1
 
0.5%
Other values (34) 34
 
17.3%

Length

2024-04-21T10:43:47.266071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
null 150
46.4%
기후 16
 
5.0%
열대성 9
 
2.8%
대륙성 8
 
2.5%
고온다습 7
 
2.2%
건조 5
 
1.5%
열대몬순 5
 
1.5%
아열대 5
 
1.5%
고온다습한 4
 
1.2%
연평균 4
 
1.2%
Other values (97) 110
34.1%

건국
Text

Distinct75
Distinct (%)38.1%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-04-21T10:43:47.531106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length6
Mean length12.766497
Min length6

Characters and Unicode

Total characters2515
Distinct characters148
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique74 ?
Unique (%)37.6%

Sample

1st row1957.3.6.(영국에서 독립)
2nd row1960.8.17. 프랑스로부터 독립
3rd row(NULL)
4th row1965.2.18(영국에서 독립)
5th row(NULL)
ValueCountFrequency (%)
null 123
33.6%
독립 58
15.8%
31
 
8.5%
독립일 5
 
1.4%
국경일 3
 
0.8%
프랑스로부터 2
 
0.5%
영국으로부터 2
 
0.5%
독립(국경일 2
 
0.5%
최초 2
 
0.5%
주권회복 2
 
0.5%
Other values (134) 136
37.2%
2024-04-21T10:43:47.939188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
L 246
 
9.8%
. 196
 
7.8%
( 192
 
7.6%
) 192
 
7.6%
171
 
6.8%
1 146
 
5.8%
U 124
 
4.9%
N 124
 
4.9%
9 95
 
3.8%
73
 
2.9%
Other values (138) 956
38.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 665
26.4%
Decimal Number 554
22.0%
Uppercase Letter 499
19.8%
Other Punctuation 221
 
8.8%
Open Punctuation 192
 
7.6%
Close Punctuation 192
 
7.6%
Space Separator 171
 
6.8%
Dash Punctuation 16
 
0.6%
Lowercase Letter 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
73
 
11.0%
70
 
10.5%
39
 
5.9%
39
 
5.9%
36
 
5.4%
35
 
5.3%
25
 
3.8%
24
 
3.6%
22
 
3.3%
21
 
3.2%
Other values (106) 281
42.3%
Decimal Number
ValueCountFrequency (%)
1 146
26.4%
9 95
17.1%
0 66
11.9%
2 58
 
10.5%
6 56
 
10.1%
7 38
 
6.9%
8 29
 
5.2%
3 22
 
4.0%
4 22
 
4.0%
5 22
 
4.0%
Uppercase Letter
ValueCountFrequency (%)
L 246
49.3%
U 124
24.8%
N 124
24.8%
A 2
 
0.4%
G 1
 
0.2%
V 1
 
0.2%
S 1
 
0.2%
Other Punctuation
ValueCountFrequency (%)
. 196
88.7%
: 15
 
6.8%
, 7
 
3.2%
' 1
 
0.5%
1
 
0.5%
* 1
 
0.5%
Lowercase Letter
ValueCountFrequency (%)
u 1
20.0%
s 1
20.0%
t 1
20.0%
a 1
20.0%
v 1
20.0%
Open Punctuation
ValueCountFrequency (%)
( 192
100.0%
Close Punctuation
ValueCountFrequency (%)
) 192
100.0%
Space Separator
ValueCountFrequency (%)
171
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1346
53.5%
Hangul 664
26.4%
Latin 504
 
20.0%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
73
 
11.0%
70
 
10.5%
39
 
5.9%
39
 
5.9%
36
 
5.4%
35
 
5.3%
25
 
3.8%
24
 
3.6%
22
 
3.3%
21
 
3.2%
Other values (105) 280
42.2%
Common
ValueCountFrequency (%)
. 196
14.6%
( 192
14.3%
) 192
14.3%
171
12.7%
1 146
10.8%
9 95
7.1%
0 66
 
4.9%
2 58
 
4.3%
6 56
 
4.2%
7 38
 
2.8%
Other values (10) 136
10.1%
Latin
ValueCountFrequency (%)
L 246
48.8%
U 124
24.6%
N 124
24.6%
A 2
 
0.4%
G 1
 
0.2%
u 1
 
0.2%
s 1
 
0.2%
t 1
 
0.2%
a 1
 
0.2%
v 1
 
0.2%
Other values (2) 2
 
0.4%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1849
73.5%
Hangul 664
 
26.4%
CJK 1
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
L 246
13.3%
. 196
10.6%
( 192
10.4%
) 192
10.4%
171
9.2%
1 146
7.9%
U 124
6.7%
N 124
6.7%
9 95
 
5.1%
0 66
 
3.6%
Other values (21) 297
16.1%
Hangul
ValueCountFrequency (%)
73
 
11.0%
70
 
10.5%
39
 
5.9%
39
 
5.9%
36
 
5.4%
35
 
5.3%
25
 
3.8%
24
 
3.6%
22
 
3.3%
21
 
3.2%
Other values (105) 280
42.2%
CJK
ValueCountFrequency (%)
1
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%

Interactions

2024-04-21T10:43:39.808739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:43:39.560749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:43:39.907480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:43:39.706709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T10:43:48.033482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인구인구설명면적기후건국
인구1.0000.9840.5730.0000.000
인구설명0.9841.0000.8820.9420.967
면적0.5730.8821.0000.0000.000
기후0.0000.9420.0001.0000.950
건국0.0000.9670.0000.9501.000
2024-04-21T10:43:48.131882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인구면적기후
인구1.0000.8260.000
면적0.8261.0000.000
기후0.0000.0001.000

Missing values

2024-04-21T10:43:40.051140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T10:43:40.204307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

한글국가명영문국가명국가코드(ISO 2자리)수도인구인구설명면적면적설명언어종교민족기후건국
0가나GhanaGH아크라(Accra)32400000('22, EIU)238537.0(한반도 1.1배)영어(공용어), 아산테어, 에웨어 등기독교(71%), 이슬람교(17.6%)아칸족(48%), 몰다그바니족(17%), 에웨족(14%), 가아단베족(7%), 구르마족(6%), 구안족(4%)(NULL)1957.3.6.(영국에서 독립)
1가봉GabonGA리브르빌(Libreville)2340000(‘21 World Bank)267000.0(한반도의 1.2배)불어(공용어), Fang어기독교(가톨릭 포함) 85%, 회교 9.8%, 토착종교Fang, Echira, Adouma 등 40여 종족(NULL)1960.8.17. 프랑스로부터 독립
2가이아나GuyanaGY조지타운(Georgetown)750000(2019 CIA)214969.0(NULL)영어(공용어), Creole(현지 토속어)기독교 57%, 힌두교 33%, 회교 9%, 기타 1%동인도계 39.8%, 흑인 29.2%, 혼혈 19.9%, 아메리카 인디안 10.5%(NULL)(NULL)
3감비아GambiaGM반줄(Banjul)2710000('22 World Bank)11295.0(한반도의 1/20)영어(공용어), Wolof어회교(90%), 기독교(9%) 등Mandinka, Peul, Wolof 등(NULL)1965.2.18(영국에서 독립)
4과테말라GuatemalaGT과테말라시티(Guatemala City)18640000(2022, IMF)108889.0(한반도의 1/2)스페인어(공용어), 23개 공인 원주민어가톨릭(41%), 개신교(38.8%), 기타(2.7%)메스티소 56%, 마야인 41.7%, 흑인?흑인계 혼혈 0.2%, Garifuna인 0.1%, 외국인 0.2%, Xinca 원주민 1.8%(NULL)(NULL)
5교황청VaticanVAVatican(로마 북서부위치)1000(NULL)0.44(성베드로 대성당 성베드로 광장 교황거처 및 사무실 등 : 서울 창경궁 정도의 면적)라틴어(공식언어), 이탈리아어, 불어, 영어가톨릭(Roman Catholic)(NULL)(NULL)(NULL)
6그레나다GrenadaGD세인트 조지스(St. George's)110000(NULL)334.0(한반도의 1/643)영어, 파투아(프랑스 크레올)가톨릭(53%), 성공회(13.8%), 개신교(33.2%)흑인(82%), 유색혼혈 (13%), 남아시아계 및 유럽계(5%)(NULL)(NULL)
7그리스GreeceGR아테네(Athens, 약370만명)10390000(NULL)131957.0(한반도의 2/3 한국의 1.3배) (본토81% 도서19%)그리스어그리스정교(90%), 기독교(3%), 이슬람교(1.3%) 등(NULL)(NULL)(NULL)
8기니GuineaGN코나크리(Conakry)13860000('22 World Bank)246000.0(한반도 크기)불어(공용어)회교(80%), 기독교(15%), 토착종교(5%)Peul(40%), Malink?(30%), Soussou(20%)(NULL)1958.10.2.(프랑스에서 독립)
9기니비사우Guinea-BissauGW비사우(Bissau)2110000(‘22 World Bank)36125.0(한반도의 1/7배)포르투갈어(공용어)토속신앙(65%), 회교(30%), 기독교(5%)Balantes, Fulas, Manjaca 등(NULL)1973.9.24(포르투갈로부터 독립)
한글국가명영문국가명국가코드(ISO 2자리)수도인구인구설명면적면적설명언어종교민족기후건국
187팔레스타인PalestinePS라말라(Ramallah: 임시 수도)5100000('20)6020.0(NULL)아랍어, 영어이슬람교(98%), 기독교(1.37%)(NULL)(NULL)(NULL)
188페루PeruPE리마(Lima, 1,000만명)33400000-20221280000.0(한반도의 약 6배)스페인어, 께추아어, 아이마라어(NULL)(NULL)(NULL)(NULL)
189포르투갈PortugalPT리스본(시내 57만명, 수도권내 287만명)10340000(2021년)92225.61(한반도의 약 2/5)포르투갈어(로망스어계)카톨릭(전체 인구의 80%이상이나 국교는 아님)이베리아족, 켈트족, 라틴족, 게르만족, 무어족 등의 혼혈대서양, 지중해 및 대륙성 혼합, 건기(5-10월)/우기(11-4월)(NULL)
190폴란드PolandPL바르샤바(171만명)38560000(2022년)312685.0(한반도의 1.4배)(NULL)가톨릭(87%), 정교회, 개신교 등폴란드인(96.9%), 독일인, 벨라루스인 등(NULL)(NULL)
191프랑스FranceFR파리(Paris)67810000-2022.1675417.0(속령 포함 / 한반도의 3.1배)프랑스어가톨릭, 신교, 유대교, 이슬람교골족, 프랑크족 등(NULL)(NULL)
192피지FijiFJ수바(Suva)903000(2021. World Bank)18333.0(세계 151위, 경상북도 크기 330여개의 도서로 구성)영어(공식어), 피지어, 힌두어, 로투만어기독교 64%, 힌두교 28%, 이슬람교 6%, 기타 2%피지 원주민 56.8%, 인도계 37.5%, 기타 5.7% 등(NULL)(NULL)
193핀란드FinlandFI헬싱키(Helsinki)5590000(2021.7월 / CIA 추정)338145.0(한반도의 약 1.5배)핀란드어(87.6%), 스웨덴어(5.2%)루터교(69.8%), 그리스정교(1.1%)핀란드인, 스웨덴인, 사미족 등(NULL)(NULL)
194필리핀PhilippinesPH마닐라(Manila, 약 184만 명)112890000(2023, IMF)300000.0(한반도의 1.3배)영어 및 타갈로그어천주교(79%), 개신교(7%), 이슬람교(6%) 등말레이계가 주인종이며 미국·스페인계 등 혼혈 다수고온다습한 아열대성 기후(NULL)
195헝가리HungaryHU부다페스트(170만명)9770000(NULL)93030.0(한반도의 2/5)(NULL)카톨릭(37.2%), 개신교(13.8%), 그리스정교(1.8%) 등마자르인(85.6%), 루마니아(3.2%), 독일(1.9%), 기타(2.6%)(NULL)(NULL)
196호주AustraliaAU캔버라(Canberra)25980000(2022. 기준)7690000.0(한반도의 35배, 세계 6위)영어기독교 44%, 무교 39%, 기타(불교, 이슬람교 등) 17%앵글로색슨 80%, 이시안, 원주민 및 기타 약 20%(NULL)(NULL)