Overview

Dataset statistics

Number of variables7
Number of observations169
Missing cells89
Missing cells (%)7.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.5 KiB
Average record size in memory57.8 B

Variable types

Text2
Categorical4
Numeric1

Dataset

Description한국국제교류재단이 한국에 대한 올바른 이해를 위해 발간한 자료(역사, 문화, 사회 등)에 관한 정보를 제공합니다.
Author한국국제교류재단
URLhttps://www.data.go.kr/data/15044309/fileData.do

Alerts

형식 is highly overall correlated with 연도 and 2 other fieldsHigh correlation
국제표준자료번호 유형 is highly overall correlated with 연도 and 2 other fieldsHigh correlation
연도 is highly overall correlated with 주제 and 2 other fieldsHigh correlation
주제 is highly overall correlated with 연도 and 2 other fieldsHigh correlation
국제표준자료번호 has 89 (52.7%) missing valuesMissing

Reproduction

Analysis started2024-03-14 11:32:57.254267
Analysis finished2024-03-14 11:32:58.843216
Duration1.59 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct94
Distinct (%)55.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-14T20:33:00.003283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length85
Median length71
Mean length39.272189
Min length8

Characters and Unicode

Total characters6637
Distinct characters125
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique72 ?
Unique (%)42.6%

Sample

1st rowKoreana
2nd rowKoreana
3rd rowKoreana
4th rowKoreana
5th rowKoreana
ValueCountFrequency (%)
korean 80
 
7.8%
of 51
 
5.0%
korea 46
 
4.5%
the 42
 
4.1%
series 40
 
3.9%
culture 26
 
2.5%
and 24
 
2.3%
essentials 20
 
2.0%
traditional 14
 
1.4%
a 13
 
1.3%
Other values (251) 669
65.3%
2024-03-14T20:33:02.505557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
919
13.8%
e 690
 
10.4%
r 456
 
6.9%
o 452
 
6.8%
a 445
 
6.7%
n 392
 
5.9%
s 308
 
4.6%
i 292
 
4.4%
t 271
 
4.1%
l 166
 
2.5%
Other values (115) 2246
33.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 4398
66.3%
Space Separator 919
 
13.8%
Uppercase Letter 776
 
11.7%
Other Letter 239
 
3.6%
Other Punctuation 130
 
2.0%
Decimal Number 99
 
1.5%
Close Punctuation 27
 
0.4%
Open Punctuation 27
 
0.4%
Final Punctuation 12
 
0.2%
Math Symbol 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15
 
6.3%
12
 
5.0%
12
 
5.0%
12
 
5.0%
11
 
4.6%
11
 
4.6%
11
 
4.6%
11
 
4.6%
11
 
4.6%
11
 
4.6%
Other values (43) 122
51.0%
Lowercase Letter
ValueCountFrequency (%)
e 690
15.7%
r 456
10.4%
o 452
10.3%
a 445
10.1%
n 392
8.9%
s 308
 
7.0%
i 292
 
6.6%
t 271
 
6.2%
l 166
 
3.8%
u 156
 
3.5%
Other values (16) 770
17.5%
Uppercase Letter
ValueCountFrequency (%)
K 157
20.2%
S 106
13.7%
C 61
 
7.9%
T 59
 
7.6%
E 47
 
6.1%
B 45
 
5.8%
H 37
 
4.8%
A 34
 
4.4%
J 25
 
3.2%
D 25
 
3.2%
Other values (14) 180
23.2%
Decimal Number
ValueCountFrequency (%)
1 31
31.3%
0 17
17.2%
2 11
 
11.1%
4 11
 
11.1%
5 10
 
10.1%
3 6
 
6.1%
6 4
 
4.0%
9 3
 
3.0%
7 3
 
3.0%
8 3
 
3.0%
Other Punctuation
ValueCountFrequency (%)
, 79
60.8%
: 26
 
20.0%
. 10
 
7.7%
' 9
 
6.9%
! 4
 
3.1%
& 2
 
1.5%
Space Separator
ValueCountFrequency (%)
919
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Final Punctuation
ValueCountFrequency (%)
12
100.0%
Math Symbol
ValueCountFrequency (%)
~ 8
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 5176
78.0%
Common 1222
 
18.4%
Hangul 239
 
3.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15
 
6.3%
12
 
5.0%
12
 
5.0%
12
 
5.0%
11
 
4.6%
11
 
4.6%
11
 
4.6%
11
 
4.6%
11
 
4.6%
11
 
4.6%
Other values (43) 122
51.0%
Latin
ValueCountFrequency (%)
e 690
13.3%
r 456
 
8.8%
o 452
 
8.7%
a 445
 
8.6%
n 392
 
7.6%
s 308
 
6.0%
i 292
 
5.6%
t 271
 
5.2%
l 166
 
3.2%
K 157
 
3.0%
Other values (41) 1547
29.9%
Common
ValueCountFrequency (%)
919
75.2%
, 79
 
6.5%
1 31
 
2.5%
) 27
 
2.2%
( 27
 
2.2%
: 26
 
2.1%
0 17
 
1.4%
12
 
1.0%
2 11
 
0.9%
4 11
 
0.9%
Other values (11) 62
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6384
96.2%
Hangul 239
 
3.6%
Punctuation 12
 
0.2%
Number Forms 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
919
14.4%
e 690
 
10.8%
r 456
 
7.1%
o 452
 
7.1%
a 445
 
7.0%
n 392
 
6.1%
s 308
 
4.8%
i 292
 
4.6%
t 271
 
4.2%
l 166
 
2.6%
Other values (60) 1993
31.2%
Hangul
ValueCountFrequency (%)
15
 
6.3%
12
 
5.0%
12
 
5.0%
12
 
5.0%
11
 
4.6%
11
 
4.6%
11
 
4.6%
11
 
4.6%
11
 
4.6%
11
 
4.6%
Other values (43) 122
51.0%
Punctuation
ValueCountFrequency (%)
12
100.0%
Number Forms
ValueCountFrequency (%)
2
100.0%

주제
Categorical

HIGH CORRELATION 

Distinct50
Distinct (%)29.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
Cultural Heritage
13 
Fine Arts
12 
Korean Literature
 
11
Literature, Housing, Clothing, Food, People, Cultural heritage
 
9
Policy, Economic Situation
 
9
Other values (45)
115 

Length

Max length81
Median length45
Mean length24.147929
Min length4

Unique

Unique22 ?
Unique (%)13.0%

Sample

1st rowLiterature, Housing, Clothing, Food, People, Cultural heritage
2nd rowLiterature, Housing, Clothing, Food, People, Cultural heritage
3rd rowLiterature, Housing, Clothing, Food, People, Cultural heritage
4th rowLiterature, Housing, Clothing, Food, People, Cultural heritage
5th rowLiterature, Housing, Clothing, Food, People, Cultural heritage

Common Values

ValueCountFrequency (%)
Cultural Heritage 13
 
7.7%
Fine Arts 12
 
7.1%
Korean Literature 11
 
6.5%
Literature, Housing, Clothing, Food, People, Cultural heritage 9
 
5.3%
Policy, Economic Situation 9
 
5.3%
Korea, Tourism, Food, Architecture, Fine Arts 9
 
5.3%
Food 7
 
4.1%
Performance 6
 
3.6%
Housing, Clothing, Festivals, Fine Arts, Music, Performance 6
 
3.6%
Language 5
 
3.0%
Other values (40) 82
48.5%

Length

2024-03-14T20:33:03.090278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
arts 40
 
8.0%
cultural 39
 
7.8%
heritage 34
 
6.8%
fine 30
 
6.0%
food 28
 
5.6%
clothing 26
 
5.2%
architecture 23
 
4.6%
culture 22
 
4.4%
korean 21
 
4.2%
literature 21
 
4.2%
Other values (44) 216
43.2%

언어
Categorical

Distinct13
Distinct (%)7.7%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
English
82 
Spanish
19 
French
16 
Chinese
14 
Korean
11 
Other values (8)
27 

Length

Max length13
Median length7
Mean length6.9230769
Min length4

Unique

Unique2 ?
Unique (%)1.2%

Sample

1st rowArabic
2nd rowChinese
3rd rowEnglish
4th rowFrench
5th rowGerman

Common Values

ValueCountFrequency (%)
English 82
48.5%
Spanish 19
 
11.2%
French 16
 
9.5%
Chinese 14
 
8.3%
Korean 11
 
6.5%
Russian 6
 
3.6%
German 5
 
3.0%
Japanese 5
 
3.0%
Arabic 4
 
2.4%
Vietnamese 3
 
1.8%
Other values (3) 4
 
2.4%

Length

2024-03-14T20:33:03.506606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
english 82
48.5%
spanish 19
 
11.2%
french 16
 
9.5%
chinese 14
 
8.3%
korean 11
 
6.5%
russian 6
 
3.6%
german 5
 
3.0%
japanese 5
 
3.0%
arabic 4
 
2.4%
vietnamese 3
 
1.8%
Other values (3) 4
 
2.4%

형식
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
DVD
86 
Book
73 
Periodical (Quarterly)
Webzine (Monthly)
 
1

Length

Max length22
Median length3
Mean length4.5266272
Min length3

Unique

Unique1 ?
Unique (%)0.6%

Sample

1st rowPeriodical (Quarterly)
2nd rowPeriodical (Quarterly)
3rd rowPeriodical (Quarterly)
4th rowPeriodical (Quarterly)
5th rowPeriodical (Quarterly)

Common Values

ValueCountFrequency (%)
DVD 86
50.9%
Book 73
43.2%
Periodical (Quarterly) 9
 
5.3%
Webzine (Monthly) 1
 
0.6%

Length

2024-03-14T20:33:03.895823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:33:04.107334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
dvd 86
48.0%
book 73
40.8%
periodical 9
 
5.0%
quarterly 9
 
5.0%
webzine 1
 
0.6%
monthly 1
 
0.6%

연도
Real number (ℝ)

HIGH CORRELATION 

Distinct26
Distinct (%)15.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2007.8462
Minimum1987
Maximum2021
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2024-03-14T20:33:04.300536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1987
5-th percentile1987.8
Q12006
median2009
Q32011
95-th percentile2019
Maximum2021
Range34
Interquartile range (IQR)5

Descriptive statistics

Standard deviation6.953074
Coefficient of variation (CV)0.0034629516
Kurtosis2.9974351
Mean2007.8462
Median Absolute Deviation (MAD)2
Skewness-1.4260624
Sum339326
Variance48.345238
MonotonicityIncreasing
2024-03-14T20:33:04.520774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
2010 24
14.2%
2006 21
12.4%
2007 19
11.2%
2008 18
10.7%
2009 17
10.1%
2012 13
7.7%
1987 9
 
5.3%
2011 8
 
4.7%
2014 6
 
3.6%
2013 5
 
3.0%
Other values (16) 29
17.2%
ValueCountFrequency (%)
1987 9
5.3%
1989 1
 
0.6%
1992 1
 
0.6%
1993 1
 
0.6%
1994 1
 
0.6%
1996 1
 
0.6%
1997 2
 
1.2%
1998 1
 
0.6%
2004 2
 
1.2%
2005 5
3.0%
ValueCountFrequency (%)
2021 4
 
2.4%
2020 1
 
0.6%
2019 5
 
3.0%
2018 1
 
0.6%
2017 1
 
0.6%
2016 1
 
0.6%
2015 1
 
0.6%
2014 6
3.6%
2013 5
 
3.0%
2012 13
7.7%

국제표준자료번호 유형
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
<NA>
89 
ISBN
70 
ISSN
10 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowISSN
2nd rowISSN
3rd rowISSN
4th rowISSN
5th rowISSN

Common Values

ValueCountFrequency (%)
<NA> 89
52.7%
ISBN 70
41.4%
ISSN 10
 
5.9%

Length

2024-03-14T20:33:04.750426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:33:05.001236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 89
52.7%
isbn 70
41.4%
issn 10
 
5.9%
Distinct80
Distinct (%)100.0%
Missing89
Missing (%)52.7%
Memory size1.4 KiB
2024-03-14T20:33:05.716588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length13
Mean length12.375
Min length8

Characters and Unicode

Total characters990
Distinct characters12
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique80 ?
Unique (%)100.0%

Sample

1st row17386446
2nd row12258083
3rd row10160744
4th row12259101
5th row19750617
ValueCountFrequency (%)
9788986090338 1
 
1.2%
10160744 1
 
1.2%
9788991913875 1
 
1.2%
9788997639403 1
 
1.2%
9788997639397 1
 
1.2%
9788997639373 1
 
1.2%
9788997639045 1
 
1.2%
9788997639236 1
 
1.2%
9788997639076 1
 
1.2%
9788997639052 1
 
1.2%
Other values (70) 70
87.5%
2024-03-14T20:33:06.748137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9 213
21.5%
8 157
15.9%
7 110
11.1%
1 99
10.0%
6 88
8.9%
3 73
 
7.4%
0 70
 
7.1%
5 67
 
6.8%
2 66
 
6.7%
4 40
 
4.0%
Other values (2) 7
 
0.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 983
99.3%
Space Separator 6
 
0.6%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
9 213
21.7%
8 157
16.0%
7 110
11.2%
1 99
10.1%
6 88
9.0%
3 73
 
7.4%
0 70
 
7.1%
5 67
 
6.8%
2 66
 
6.7%
4 40
 
4.1%
Space Separator
ValueCountFrequency (%)
6
100.0%
Uppercase Letter
ValueCountFrequency (%)
X 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 989
99.9%
Latin 1
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
9 213
21.5%
8 157
15.9%
7 110
11.1%
1 99
10.0%
6 88
8.9%
3 73
 
7.4%
0 70
 
7.1%
5 67
 
6.8%
2 66
 
6.7%
4 40
 
4.0%
Latin
ValueCountFrequency (%)
X 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 990
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9 213
21.5%
8 157
15.9%
7 110
11.1%
1 99
10.0%
6 88
8.9%
3 73
 
7.4%
0 70
 
7.1%
5 67
 
6.8%
2 66
 
6.7%
4 40
 
4.0%
Other values (2) 7
 
0.7%

Interactions

2024-03-14T20:32:57.867804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T20:33:06.916475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자료명주제언어형식연도국제표준자료번호 유형국제표준자료번호
자료명1.0001.0000.0001.0000.9971.0001.000
주제1.0001.0000.0000.9890.8851.0001.000
언어0.0000.0001.0000.5190.2710.6591.000
형식1.0000.9890.5191.0000.7191.0001.000
연도0.9970.8850.2710.7191.0001.0001.000
국제표준자료번호 유형1.0001.0000.6591.0001.0001.0001.000
국제표준자료번호1.0001.0001.0001.0001.0001.0001.000
2024-03-14T20:33:07.101596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
형식주제언어국제표준자료번호 유형
형식1.0000.7900.3200.994
주제0.7901.0000.0000.760
언어0.3200.0001.0000.483
국제표준자료번호 유형0.9940.7600.4831.000
2024-03-14T20:33:07.284961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도주제언어형식국제표준자료번호 유형
연도1.0000.5490.1360.7080.954
주제0.5491.0000.0000.7900.760
언어0.1360.0001.0000.3200.483
형식0.7080.7900.3201.0000.994
국제표준자료번호 유형0.9540.7600.4830.9941.000

Missing values

2024-03-14T20:32:58.285608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T20:32:58.684899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

자료명주제언어형식연도국제표준자료번호 유형국제표준자료번호
0KoreanaLiterature, Housing, Clothing, Food, People, Cultural heritageArabicPeriodical (Quarterly)1987ISSN17386446
1KoreanaLiterature, Housing, Clothing, Food, People, Cultural heritageChinesePeriodical (Quarterly)1987ISSN12258083
2KoreanaLiterature, Housing, Clothing, Food, People, Cultural heritageEnglishPeriodical (Quarterly)1987ISSN10160744
3KoreanaLiterature, Housing, Clothing, Food, People, Cultural heritageFrenchPeriodical (Quarterly)1987ISSN12259101
4KoreanaLiterature, Housing, Clothing, Food, People, Cultural heritageGermanPeriodical (Quarterly)1987ISSN19750617
5KoreanaLiterature, Housing, Clothing, Food, People, Cultural heritageJapanesePeriodical (Quarterly)1987ISSN12254592
6KoreanaLiterature, Housing, Clothing, Food, People, Cultural heritageRussianPeriodical (Quarterly)1987ISSN17388252
7KoreanaLiterature, Housing, Clothing, Food, People, Cultural heritageSpanishPeriodical (Quarterly)1987ISSN12254606
8KoreanaLiterature, Housing, Clothing, Food, People, Cultural heritageIndonesianPeriodical (Quarterly)1987ISSN22875565
9Korean Relics in the U.S. (Vol. 1~2)Arts, CraftworkEnglishBook1989<NA><NA>
자료명주제언어형식연도국제표준자료번호 유형국제표준자료번호
159공공미술로 읽는 베트남 사회와 문화: 벽화로 이어진 3년의 기록Public Art, Wall PaintingKoreanBook2019ISBN9791189688202
160한국현대단편소설선집 러시아어판Korean LiteratureRussianBook2019ISBN9791156043331
161한국현대단편소설선집 베트남어판 1권Korean LiteratureVietnameseBook2019ISBN9786046856528
162한국현대단편소설선집 인도네시아어판Korean LiteratureIndonesianBook2019ISBN9786020632179
163한국현대단편소설선집 태국어판Korean LiteratureThaiBook2019ISBN9786169241270
164한국현대단편소설선집 독일어판Korean LiteratureGermanBook2020ISBN9783862056361
165한국현대단편소설선집 베트남어판 2권Korean LiteratureVietnameseBook2021ISBN9786043353891
166한국현대단편소설선집 스페인어판 1권Korean LiteratureSpanishBook2021ISBN9788413374857
167한국현대단편소설선집 스페인어판 2권Korean LiteratureSpanishBook2021ISBN9788413374864
168한국현대단편소설선집 일본어판Korean LiteratureJapaneseBook2021ISBN9784910214238