Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 169 |
Missing cells | 89 |
Missing cells (%) | 7.5% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 9.5 KiB |
Average record size in memory | 57.8 B |
Variable types
Text | 2 |
---|---|
Categorical | 4 |
Numeric | 1 |
Dataset
Description | 한국국제교류재단이 한국에 대한 올바른 이해를 위해 발간한 자료(역사, 문화, 사회 등)에 관한 정보를 제공합니다. |
---|---|
Author | 한국국제교류재단 |
URL | https://www.data.go.kr/data/15044309/fileData.do |
형식 is highly overall correlated with 연도 and 2 other fields | High correlation |
국제표준자료번호 유형 is highly overall correlated with 연도 and 2 other fields | High correlation |
연도 is highly overall correlated with 주제 and 2 other fields | High correlation |
주제 is highly overall correlated with 연도 and 2 other fields | High correlation |
국제표준자료번호 has 89 (52.7%) missing values | Missing |
Reproduction
Analysis started | 2024-03-14 11:32:57.254267 |
---|---|
Analysis finished | 2024-03-14 11:32:58.843216 |
Duration | 1.59 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
자료명
Text
Distinct | 94 |
---|---|
Distinct (%) | 55.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.4 KiB |
Value | Count | Frequency (%) |
korean | 80 | 7.8% |
of | 51 | 5.0% |
korea | 46 | 4.5% |
the | 42 | 4.1% |
series | 40 | 3.9% |
culture | 26 | 2.5% |
and | 24 | 2.3% |
essentials | 20 | 2.0% |
traditional | 14 | 1.4% |
a | 13 | 1.3% |
Other values (251) | 669 |
Most occurring characters
Value | Count | Frequency (%) |
919 | ||
e | 690 | 10.4% |
r | 456 | 6.9% |
o | 452 | 6.8% |
a | 445 | 6.7% |
n | 392 | 5.9% |
s | 308 | 4.6% |
i | 292 | 4.4% |
t | 271 | 4.1% |
l | 166 | 2.5% |
Other values (115) | 2246 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 4398 | |
Space Separator | 919 | 13.8% |
Uppercase Letter | 776 | 11.7% |
Other Letter | 239 | 3.6% |
Other Punctuation | 130 | 2.0% |
Decimal Number | 99 | 1.5% |
Close Punctuation | 27 | 0.4% |
Open Punctuation | 27 | 0.4% |
Final Punctuation | 12 | 0.2% |
Math Symbol | 8 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
편 | 15 | 6.3% |
나 | 12 | 5.0% |
국 | 12 | 5.0% |
라 | 12 | 5.0% |
현 | 11 | 4.6% |
한 | 11 | 4.6% |
대 | 11 | 4.6% |
단 | 11 | 4.6% |
선 | 11 | 4.6% |
어 | 11 | 4.6% |
Other values (43) | 122 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 690 | |
r | 456 | |
o | 452 | |
a | 445 | |
n | 392 | |
s | 308 | 7.0% |
i | 292 | 6.6% |
t | 271 | 6.2% |
l | 166 | 3.8% |
u | 156 | 3.5% |
Other values (16) | 770 |
Uppercase Letter
Value | Count | Frequency (%) |
K | 157 | |
S | 106 | |
C | 61 | 7.9% |
T | 59 | 7.6% |
E | 47 | 6.1% |
B | 45 | 5.8% |
H | 37 | 4.8% |
A | 34 | 4.4% |
J | 25 | 3.2% |
D | 25 | 3.2% |
Other values (14) | 180 |
Decimal Number
Value | Count | Frequency (%) |
1 | 31 | |
0 | 17 | |
2 | 11 | 11.1% |
4 | 11 | 11.1% |
5 | 10 | 10.1% |
3 | 6 | 6.1% |
6 | 4 | 4.0% |
9 | 3 | 3.0% |
7 | 3 | 3.0% |
8 | 3 | 3.0% |
Other Punctuation
Value | Count | Frequency (%) |
, | 79 | |
: | 26 | 20.0% |
. | 10 | 7.7% |
' | 9 | 6.9% |
! | 4 | 3.1% |
& | 2 | 1.5% |
Space Separator
Value | Count | Frequency (%) |
919 |
Close Punctuation
Value | Count | Frequency (%) |
) | 27 |
Open Punctuation
Value | Count | Frequency (%) |
( | 27 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 12 |
Math Symbol
Value | Count | Frequency (%) |
~ | 8 |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 5176 | |
Common | 1222 | 18.4% |
Hangul | 239 | 3.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
편 | 15 | 6.3% |
나 | 12 | 5.0% |
국 | 12 | 5.0% |
라 | 12 | 5.0% |
현 | 11 | 4.6% |
한 | 11 | 4.6% |
대 | 11 | 4.6% |
단 | 11 | 4.6% |
선 | 11 | 4.6% |
어 | 11 | 4.6% |
Other values (43) | 122 |
Latin
Value | Count | Frequency (%) |
e | 690 | |
r | 456 | 8.8% |
o | 452 | 8.7% |
a | 445 | 8.6% |
n | 392 | 7.6% |
s | 308 | 6.0% |
i | 292 | 5.6% |
t | 271 | 5.2% |
l | 166 | 3.2% |
K | 157 | 3.0% |
Other values (41) | 1547 |
Common
Value | Count | Frequency (%) |
919 | ||
, | 79 | 6.5% |
1 | 31 | 2.5% |
) | 27 | 2.2% |
( | 27 | 2.2% |
: | 26 | 2.1% |
0 | 17 | 1.4% |
’ | 12 | 1.0% |
2 | 11 | 0.9% |
4 | 11 | 0.9% |
Other values (11) | 62 | 5.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 6384 | |
Hangul | 239 | 3.6% |
Punctuation | 12 | 0.2% |
Number Forms | 2 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
919 | ||
e | 690 | 10.8% |
r | 456 | 7.1% |
o | 452 | 7.1% |
a | 445 | 7.0% |
n | 392 | 6.1% |
s | 308 | 4.8% |
i | 292 | 4.6% |
t | 271 | 4.2% |
l | 166 | 2.6% |
Other values (60) | 1993 |
Hangul
Value | Count | Frequency (%) |
편 | 15 | 6.3% |
나 | 12 | 5.0% |
국 | 12 | 5.0% |
라 | 12 | 5.0% |
현 | 11 | 4.6% |
한 | 11 | 4.6% |
대 | 11 | 4.6% |
단 | 11 | 4.6% |
선 | 11 | 4.6% |
어 | 11 | 4.6% |
Other values (43) | 122 |
Punctuation
Value | Count | Frequency (%) |
’ | 12 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 2 |
주제
Categorical
HIGH CORRELATION
 
Distinct | 50 |
---|---|
Distinct (%) | 29.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.4 KiB |
Cultural Heritage | |
---|---|
Fine Arts | |
Korean Literature | 11 |
Literature, Housing, Clothing, Food, People, Cultural heritage | 9 |
Policy, Economic Situation | 9 |
Other values (45) |
Length
Max length | 81 |
---|---|
Median length | 45 |
Mean length | 24.147929 |
Min length | 4 |
Unique
Unique | 22 ? |
---|---|
Unique (%) | 13.0% |
Sample
1st row | Literature, Housing, Clothing, Food, People, Cultural heritage |
---|---|
2nd row | Literature, Housing, Clothing, Food, People, Cultural heritage |
3rd row | Literature, Housing, Clothing, Food, People, Cultural heritage |
4th row | Literature, Housing, Clothing, Food, People, Cultural heritage |
5th row | Literature, Housing, Clothing, Food, People, Cultural heritage |
Common Values
Value | Count | Frequency (%) |
Cultural Heritage | 13 | 7.7% |
Fine Arts | 12 | 7.1% |
Korean Literature | 11 | 6.5% |
Literature, Housing, Clothing, Food, People, Cultural heritage | 9 | 5.3% |
Policy, Economic Situation | 9 | 5.3% |
Korea, Tourism, Food, Architecture, Fine Arts | 9 | 5.3% |
Food | 7 | 4.1% |
Performance | 6 | 3.6% |
Housing, Clothing, Festivals, Fine Arts, Music, Performance | 6 | 3.6% |
Language | 5 | 3.0% |
Other values (40) | 82 |
Length
Value | Count | Frequency (%) |
arts | 40 | 8.0% |
cultural | 39 | 7.8% |
heritage | 34 | 6.8% |
fine | 30 | 6.0% |
food | 28 | 5.6% |
clothing | 26 | 5.2% |
architecture | 23 | 4.6% |
culture | 22 | 4.4% |
korean | 21 | 4.2% |
literature | 21 | 4.2% |
Other values (44) | 216 |
언어
Categorical
Distinct | 13 |
---|---|
Distinct (%) | 7.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.4 KiB |
English | |
---|---|
Spanish | |
French | |
Chinese | |
Korean | |
Other values (8) |
Length
Max length | 13 |
---|---|
Median length | 7 |
Mean length | 6.9230769 |
Min length | 4 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 1.2% |
Sample
1st row | Arabic |
---|---|
2nd row | Chinese |
3rd row | English |
4th row | French |
5th row | German |
Common Values
Value | Count | Frequency (%) |
English | 82 | |
Spanish | 19 | 11.2% |
French | 16 | 9.5% |
Chinese | 14 | 8.3% |
Korean | 11 | 6.5% |
Russian | 6 | 3.6% |
German | 5 | 3.0% |
Japanese | 5 | 3.0% |
Arabic | 4 | 2.4% |
Vietnamese | 3 | 1.8% |
Other values (3) | 4 | 2.4% |
Length
Value | Count | Frequency (%) |
english | 82 | |
spanish | 19 | 11.2% |
french | 16 | 9.5% |
chinese | 14 | 8.3% |
korean | 11 | 6.5% |
russian | 6 | 3.6% |
german | 5 | 3.0% |
japanese | 5 | 3.0% |
arabic | 4 | 2.4% |
vietnamese | 3 | 1.8% |
Other values (3) | 4 | 2.4% |
형식
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 2.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.4 KiB |
DVD | |
---|---|
Book | |
Periodical (Quarterly) | |
Webzine (Monthly) | 1 |
Length
Max length | 22 |
---|---|
Median length | 3 |
Mean length | 4.5266272 |
Min length | 3 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.6% |
Sample
1st row | Periodical (Quarterly) |
---|---|
2nd row | Periodical (Quarterly) |
3rd row | Periodical (Quarterly) |
4th row | Periodical (Quarterly) |
5th row | Periodical (Quarterly) |
Common Values
Value | Count | Frequency (%) |
DVD | 86 | |
Book | 73 | |
Periodical (Quarterly) | 9 | 5.3% |
Webzine (Monthly) | 1 | 0.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
dvd | 86 | |
book | 73 | |
periodical | 9 | 5.0% |
quarterly | 9 | 5.0% |
webzine | 1 | 0.6% |
monthly | 1 | 0.6% |
연도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 26 |
---|---|
Distinct (%) | 15.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2007.8462 |
Minimum | 1987 |
---|---|
Maximum | 2021 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.6 KiB |
Quantile statistics
Minimum | 1987 |
---|---|
5-th percentile | 1987.8 |
Q1 | 2006 |
median | 2009 |
Q3 | 2011 |
95-th percentile | 2019 |
Maximum | 2021 |
Range | 34 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 6.953074 |
---|---|
Coefficient of variation (CV) | 0.0034629516 |
Kurtosis | 2.9974351 |
Mean | 2007.8462 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -1.4260624 |
Sum | 339326 |
Variance | 48.345238 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
2010 | 24 | |
2006 | 21 | |
2007 | 19 | |
2008 | 18 | |
2009 | 17 | |
2012 | 13 | |
1987 | 9 | 5.3% |
2011 | 8 | 4.7% |
2014 | 6 | 3.6% |
2013 | 5 | 3.0% |
Other values (16) | 29 |
Value | Count | Frequency (%) |
1987 | 9 | |
1989 | 1 | 0.6% |
1992 | 1 | 0.6% |
1993 | 1 | 0.6% |
1994 | 1 | 0.6% |
1996 | 1 | 0.6% |
1997 | 2 | 1.2% |
1998 | 1 | 0.6% |
2004 | 2 | 1.2% |
2005 | 5 |
Value | Count | Frequency (%) |
2021 | 4 | 2.4% |
2020 | 1 | 0.6% |
2019 | 5 | 3.0% |
2018 | 1 | 0.6% |
2017 | 1 | 0.6% |
2016 | 1 | 0.6% |
2015 | 1 | 0.6% |
2014 | 6 | |
2013 | 5 | 3.0% |
2012 | 13 |
국제표준자료번호 유형
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 1.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.4 KiB |
<NA> | |
---|---|
ISBN | |
ISSN |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | ISSN |
---|---|
2nd row | ISSN |
3rd row | ISSN |
4th row | ISSN |
5th row | ISSN |
Common Values
Value | Count | Frequency (%) |
<NA> | 89 | |
ISBN | 70 | |
ISSN | 10 | 5.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 89 | |
isbn | 70 | |
issn | 10 | 5.9% |
국제표준자료번호
Text
MISSING
 
Distinct | 80 |
---|---|
Distinct (%) | 100.0% |
Missing | 89 |
Missing (%) | 52.7% |
Memory size | 1.4 KiB |
Value | Count | Frequency (%) |
9788986090338 | 1 | 1.2% |
10160744 | 1 | 1.2% |
9788991913875 | 1 | 1.2% |
9788997639403 | 1 | 1.2% |
9788997639397 | 1 | 1.2% |
9788997639373 | 1 | 1.2% |
9788997639045 | 1 | 1.2% |
9788997639236 | 1 | 1.2% |
9788997639076 | 1 | 1.2% |
9788997639052 | 1 | 1.2% |
Other values (70) | 70 |
Most occurring characters
Value | Count | Frequency (%) |
9 | 213 | |
8 | 157 | |
7 | 110 | |
1 | 99 | |
6 | 88 | |
3 | 73 | 7.4% |
0 | 70 | 7.1% |
5 | 67 | 6.8% |
2 | 66 | 6.7% |
4 | 40 | 4.0% |
Other values (2) | 7 | 0.7% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 983 | |
Space Separator | 6 | 0.6% |
Uppercase Letter | 1 | 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
9 | 213 | |
8 | 157 | |
7 | 110 | |
1 | 99 | |
6 | 88 | |
3 | 73 | 7.4% |
0 | 70 | 7.1% |
5 | 67 | 6.8% |
2 | 66 | 6.7% |
4 | 40 | 4.1% |
Space Separator
Value | Count | Frequency (%) |
6 |
Uppercase Letter
Value | Count | Frequency (%) |
X | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 989 | |
Latin | 1 | 0.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
9 | 213 | |
8 | 157 | |
7 | 110 | |
1 | 99 | |
6 | 88 | |
3 | 73 | 7.4% |
0 | 70 | 7.1% |
5 | 67 | 6.8% |
2 | 66 | 6.7% |
4 | 40 | 4.0% |
Latin
Value | Count | Frequency (%) |
X | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 990 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
9 | 213 | |
8 | 157 | |
7 | 110 | |
1 | 99 | |
6 | 88 | |
3 | 73 | 7.4% |
0 | 70 | 7.1% |
5 | 67 | 6.8% |
2 | 66 | 6.7% |
4 | 40 | 4.0% |
Other values (2) | 7 | 0.7% |
자료명 | 주제 | 언어 | 형식 | 연도 | 국제표준자료번호 유형 | 국제표준자료번호 | |
---|---|---|---|---|---|---|---|
자료명 | 1.000 | 1.000 | 0.000 | 1.000 | 0.997 | 1.000 | 1.000 |
주제 | 1.000 | 1.000 | 0.000 | 0.989 | 0.885 | 1.000 | 1.000 |
언어 | 0.000 | 0.000 | 1.000 | 0.519 | 0.271 | 0.659 | 1.000 |
형식 | 1.000 | 0.989 | 0.519 | 1.000 | 0.719 | 1.000 | 1.000 |
연도 | 0.997 | 0.885 | 0.271 | 0.719 | 1.000 | 1.000 | 1.000 |
국제표준자료번호 유형 | 1.000 | 1.000 | 0.659 | 1.000 | 1.000 | 1.000 | 1.000 |
국제표준자료번호 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
형식 | 주제 | 언어 | 국제표준자료번호 유형 | |
---|---|---|---|---|
형식 | 1.000 | 0.790 | 0.320 | 0.994 |
주제 | 0.790 | 1.000 | 0.000 | 0.760 |
언어 | 0.320 | 0.000 | 1.000 | 0.483 |
국제표준자료번호 유형 | 0.994 | 0.760 | 0.483 | 1.000 |
연도 | 주제 | 언어 | 형식 | 국제표준자료번호 유형 | |
---|---|---|---|---|---|
연도 | 1.000 | 0.549 | 0.136 | 0.708 | 0.954 |
주제 | 0.549 | 1.000 | 0.000 | 0.790 | 0.760 |
언어 | 0.136 | 0.000 | 1.000 | 0.320 | 0.483 |
형식 | 0.708 | 0.790 | 0.320 | 1.000 | 0.994 |
국제표준자료번호 유형 | 0.954 | 0.760 | 0.483 | 0.994 | 1.000 |
자료명 | 주제 | 언어 | 형식 | 연도 | 국제표준자료번호 유형 | 국제표준자료번호 | |
---|---|---|---|---|---|---|---|
0 | Koreana | Literature, Housing, Clothing, Food, People, Cultural heritage | Arabic | Periodical (Quarterly) | 1987 | ISSN | 17386446 |
1 | Koreana | Literature, Housing, Clothing, Food, People, Cultural heritage | Chinese | Periodical (Quarterly) | 1987 | ISSN | 12258083 |
2 | Koreana | Literature, Housing, Clothing, Food, People, Cultural heritage | English | Periodical (Quarterly) | 1987 | ISSN | 10160744 |
3 | Koreana | Literature, Housing, Clothing, Food, People, Cultural heritage | French | Periodical (Quarterly) | 1987 | ISSN | 12259101 |
4 | Koreana | Literature, Housing, Clothing, Food, People, Cultural heritage | German | Periodical (Quarterly) | 1987 | ISSN | 19750617 |
5 | Koreana | Literature, Housing, Clothing, Food, People, Cultural heritage | Japanese | Periodical (Quarterly) | 1987 | ISSN | 12254592 |
6 | Koreana | Literature, Housing, Clothing, Food, People, Cultural heritage | Russian | Periodical (Quarterly) | 1987 | ISSN | 17388252 |
7 | Koreana | Literature, Housing, Clothing, Food, People, Cultural heritage | Spanish | Periodical (Quarterly) | 1987 | ISSN | 12254606 |
8 | Koreana | Literature, Housing, Clothing, Food, People, Cultural heritage | Indonesian | Periodical (Quarterly) | 1987 | ISSN | 22875565 |
9 | Korean Relics in the U.S. (Vol. 1~2) | Arts, Craftwork | English | Book | 1989 | <NA> | <NA> |
자료명 | 주제 | 언어 | 형식 | 연도 | 국제표준자료번호 유형 | 국제표준자료번호 | |
---|---|---|---|---|---|---|---|
159 | 공공미술로 읽는 베트남 사회와 문화: 벽화로 이어진 3년의 기록 | Public Art, Wall Painting | Korean | Book | 2019 | ISBN | 9791189688202 |
160 | 한국현대단편소설선집 러시아어판 | Korean Literature | Russian | Book | 2019 | ISBN | 9791156043331 |
161 | 한국현대단편소설선집 베트남어판 1권 | Korean Literature | Vietnamese | Book | 2019 | ISBN | 9786046856528 |
162 | 한국현대단편소설선집 인도네시아어판 | Korean Literature | Indonesian | Book | 2019 | ISBN | 9786020632179 |
163 | 한국현대단편소설선집 태국어판 | Korean Literature | Thai | Book | 2019 | ISBN | 9786169241270 |
164 | 한국현대단편소설선집 독일어판 | Korean Literature | German | Book | 2020 | ISBN | 9783862056361 |
165 | 한국현대단편소설선집 베트남어판 2권 | Korean Literature | Vietnamese | Book | 2021 | ISBN | 9786043353891 |
166 | 한국현대단편소설선집 스페인어판 1권 | Korean Literature | Spanish | Book | 2021 | ISBN | 9788413374857 |
167 | 한국현대단편소설선집 스페인어판 2권 | Korean Literature | Spanish | Book | 2021 | ISBN | 9788413374864 |
168 | 한국현대단편소설선집 일본어판 | Korean Literature | Japanese | Book | 2021 | ISBN | 9784910214238 |