Overview

Dataset statistics

Number of variables3
Number of observations192
Missing cells1
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.8 KiB
Average record size in memory25.7 B

Variable types

Numeric1
Text2

Dataset

Description현대한국구술자료관 구술자료와 관련된 국가명, 국가코드가 포함됨
Author한국학중앙연구원
URLhttps://www.data.go.kr/data/15049075/fileData.do

Alerts

번호 has unique valuesUnique
국가명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:54:03.327616
Analysis finished2023-12-12 08:54:03.786912
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct192
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean522.23958
Minimum184
Maximum894
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-12T17:54:03.898926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum184
5-th percentile216.2
Q1347
median518
Q3691
95-th percentile836.7
Maximum894
Range710
Interquartile range (IQR)344

Descriptive statistics

Standard deviation202.75897
Coefficient of variation (CV)0.38824895
Kurtosis-1.1811906
Mean522.23958
Median Absolute Deviation (MAD)173
Skewness0.040779166
Sum100270
Variance41111.199
MonotonicityStrictly increasing
2023-12-12T17:54:04.091117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
184 1
 
0.5%
524 1
 
0.5%
620 1
 
0.5%
624 1
 
0.5%
626 1
 
0.5%
630 1
 
0.5%
634 1
 
0.5%
638 1
 
0.5%
642 1
 
0.5%
643 1
 
0.5%
Other values (182) 182
94.8%
ValueCountFrequency (%)
184 1
0.5%
188 1
0.5%
191 1
0.5%
192 1
0.5%
196 1
0.5%
203 1
0.5%
204 1
0.5%
208 1
0.5%
212 1
0.5%
214 1
0.5%
ValueCountFrequency (%)
894 1
0.5%
887 1
0.5%
882 1
0.5%
876 1
0.5%
862 1
0.5%
860 1
0.5%
858 1
0.5%
854 1
0.5%
850 1
0.5%
840 1
0.5%

국가명
Text

UNIQUE 

Distinct192
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-12T17:54:04.492360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length12.5
Mean length4.453125
Min length1

Characters and Unicode

Total characters855
Distinct characters199
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique192 ?
Unique (%)100.0%

Sample

1st row쿡 제도
2nd row코스타리카
3rd row크로아티아
4th row쿠바
5th row키프로스
ValueCountFrequency (%)
제도 11
 
4.7%
4
 
1.7%
공화국 3
 
1.3%
프랑스령 3
 
1.3%
도미니카 2
 
0.9%
기니 2
 
0.9%
미국령 2
 
0.9%
루마니아 1
 
0.4%
러시아 1
 
0.4%
앵귈라 1
 
0.4%
Other values (202) 202
87.1%
2023-12-12T17:54:05.111406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
47
 
5.5%
40
 
4.7%
34
 
4.0%
27
 
3.2%
22
 
2.6%
22
 
2.6%
21
 
2.5%
20
 
2.3%
18
 
2.1%
16
 
1.9%
Other values (189) 588
68.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 815
95.3%
Space Separator 40
 
4.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
47
 
5.8%
34
 
4.2%
27
 
3.3%
22
 
2.7%
22
 
2.7%
21
 
2.6%
20
 
2.5%
18
 
2.2%
16
 
2.0%
15
 
1.8%
Other values (188) 573
70.3%
Space Separator
ValueCountFrequency (%)
40
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 815
95.3%
Common 40
 
4.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
47
 
5.8%
34
 
4.2%
27
 
3.3%
22
 
2.7%
22
 
2.7%
21
 
2.6%
20
 
2.5%
18
 
2.2%
16
 
2.0%
15
 
1.8%
Other values (188) 573
70.3%
Common
ValueCountFrequency (%)
40
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 815
95.3%
ASCII 40
 
4.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
47
 
5.8%
34
 
4.2%
27
 
3.3%
22
 
2.7%
22
 
2.7%
21
 
2.6%
20
 
2.5%
18
 
2.2%
16
 
2.0%
15
 
1.8%
Other values (188) 573
70.3%
ASCII
ValueCountFrequency (%)
40
100.0%
Distinct191
Distinct (%)100.0%
Missing1
Missing (%)0.5%
Memory size1.6 KiB
2023-12-12T17:54:05.524647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters382
Distinct characters26
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique191 ?
Unique (%)100.0%

Sample

1st rowCK
2nd rowCR
3rd rowHR
4th rowCU
5th rowCY
ValueCountFrequency (%)
ck 1
 
0.5%
sh 1
 
0.5%
pl 1
 
0.5%
pt 1
 
0.5%
gw 1
 
0.5%
tl 1
 
0.5%
pr 1
 
0.5%
qa 1
 
0.5%
re 1
 
0.5%
ro 1
 
0.5%
Other values (181) 181
94.8%
2023-12-12T17:54:05.997892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
M 33
 
8.6%
G 28
 
7.3%
S 27
 
7.1%
T 23
 
6.0%
E 21
 
5.5%
N 21
 
5.5%
P 19
 
5.0%
R 18
 
4.7%
I 18
 
4.7%
L 17
 
4.5%
Other values (16) 157
41.1%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 382
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
M 33
 
8.6%
G 28
 
7.3%
S 27
 
7.1%
T 23
 
6.0%
E 21
 
5.5%
N 21
 
5.5%
P 19
 
5.0%
R 18
 
4.7%
I 18
 
4.7%
L 17
 
4.5%
Other values (16) 157
41.1%

Most occurring scripts

ValueCountFrequency (%)
Latin 382
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
M 33
 
8.6%
G 28
 
7.3%
S 27
 
7.1%
T 23
 
6.0%
E 21
 
5.5%
N 21
 
5.5%
P 19
 
5.0%
R 18
 
4.7%
I 18
 
4.7%
L 17
 
4.5%
Other values (16) 157
41.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 382
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
M 33
 
8.6%
G 28
 
7.3%
S 27
 
7.1%
T 23
 
6.0%
E 21
 
5.5%
N 21
 
5.5%
P 19
 
5.0%
R 18
 
4.7%
I 18
 
4.7%
L 17
 
4.5%
Other values (16) 157
41.1%

Interactions

2023-12-12T17:54:03.504371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T17:54:03.652492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:54:03.747810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호국가명국가코드
0184쿡 제도CK
1188코스타리카CR
2191크로아티아HR
3192쿠바CU
4196키프로스CY
5203체코CZ
6204베냉BJ
7208덴마크DK
8212도미니카DM
9214도미니카 공화국DO
번호국가명국가코드
182840미국US
183850미국령 버진아일랜드VI
184854부르키나파소BF
185858우루과이UY
186860우즈베키스탄UZ
187862베네수엘라VE
188876왈리스 퓌튀나WF
189882사모아WS
190887예멘YE
191894잠비아ZM