Overview

Dataset statistics

Number of variables4
Number of observations35
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory35.8 B

Variable types

Text4

Dataset

Description국내외 외국인을 대상으로 한국 전반에 대한 홍보 콘텐츠(교육, 행사 프로그램 등)를 서비스하고 있는 재외한국문화원 홈페이지에 대한 정보 제공(문화원 이름, 제공 언어, URL, 문화원 주소)
URLhttps://www.data.go.kr/data/3057781/fileData.do

Alerts

문화원명 has unique valuesUnique
주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:06:12.096335
Analysis finished2023-12-12 06:06:12.473400
Duration0.38 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

문화원명
Text

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-12T15:06:12.622148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length3.3428571
Min length2

Characters and Unicode

Total characters117
Distinct characters75
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row이집트
2nd row베트남
3rd row아르헨티나
4th row러시아
5th row태국
ValueCountFrequency (%)
이집트 1
 
2.9%
이탈리아 1
 
2.9%
홍콩 1
 
2.9%
워싱턴 1
 
2.9%
남아프리카공화국 1
 
2.9%
프랑스 1
 
2.9%
북경 1
 
2.9%
상해 1
 
2.9%
캐나다 1
 
2.9%
오사카 1
 
2.9%
Other values (25) 25
71.4%
2023-12-12T15:06:13.012519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8
 
6.8%
7
 
6.0%
5
 
4.3%
4
 
3.4%
4
 
3.4%
3
 
2.6%
3
 
2.6%
3
 
2.6%
3
 
2.6%
3
 
2.6%
Other values (65) 74
63.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 115
98.3%
Uppercase Letter 2
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
7.0%
7
 
6.1%
5
 
4.3%
4
 
3.5%
4
 
3.5%
3
 
2.6%
3
 
2.6%
3
 
2.6%
3
 
2.6%
3
 
2.6%
Other values (63) 72
62.6%
Uppercase Letter
ValueCountFrequency (%)
L 1
50.0%
A 1
50.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 115
98.3%
Latin 2
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
7.0%
7
 
6.1%
5
 
4.3%
4
 
3.5%
4
 
3.5%
3
 
2.6%
3
 
2.6%
3
 
2.6%
3
 
2.6%
3
 
2.6%
Other values (63) 72
62.6%
Latin
ValueCountFrequency (%)
L 1
50.0%
A 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 115
98.3%
ASCII 2
 
1.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8
 
7.0%
7
 
6.1%
5
 
4.3%
4
 
3.5%
4
 
3.5%
3
 
2.6%
3
 
2.6%
3
 
2.6%
3
 
2.6%
3
 
2.6%
Other values (63) 72
62.6%
ASCII
ValueCountFrequency (%)
L 1
50.0%
A 1
50.0%
Distinct19
Distinct (%)54.3%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-12T15:06:13.186824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length12
Mean length8.7142857
Min length7

Characters and Unicode

Total characters305
Distinct characters42
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)37.1%

Sample

1st row한국어, 아랍어
2nd row한국어, 베트남어
3rd row한국어, 스페인어
4th row한국어, 러시아어
5th row한국어, 태국어
ValueCountFrequency (%)
한국어 35
47.3%
영어 14
 
18.9%
스페인어 3
 
4.1%
독일어 2
 
2.7%
러시아어 2
 
2.7%
일본어 2
 
2.7%
중국어 2
 
2.7%
프랑스어 2
 
2.7%
아랍어 2
 
2.7%
베트남어 1
 
1.4%
Other values (9) 9
 
12.2%
2023-12-12T15:06:13.494649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
74
24.3%
, 39
12.8%
39
12.8%
38
12.5%
35
11.5%
14
 
4.6%
6
 
2.0%
5
 
1.6%
4
 
1.3%
4
 
1.3%
Other values (32) 47
15.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 227
74.4%
Other Punctuation 39
 
12.8%
Space Separator 39
 
12.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
74
32.6%
38
16.7%
35
15.4%
14
 
6.2%
6
 
2.6%
5
 
2.2%
4
 
1.8%
4
 
1.8%
3
 
1.3%
3
 
1.3%
Other values (30) 41
18.1%
Other Punctuation
ValueCountFrequency (%)
, 39
100.0%
Space Separator
ValueCountFrequency (%)
39
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 227
74.4%
Common 78
 
25.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
74
32.6%
38
16.7%
35
15.4%
14
 
6.2%
6
 
2.6%
5
 
2.2%
4
 
1.8%
4
 
1.8%
3
 
1.3%
3
 
1.3%
Other values (30) 41
18.1%
Common
ValueCountFrequency (%)
, 39
50.0%
39
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 227
74.4%
ASCII 78
 
25.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
74
32.6%
38
16.7%
35
15.4%
14
 
6.2%
6
 
2.6%
5
 
2.2%
4
 
1.8%
4
 
1.8%
3
 
1.3%
3
 
1.3%
Other values (30) 41
18.1%
ASCII
ValueCountFrequency (%)
, 39
50.0%
39
50.0%
Distinct34
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-12T15:06:13.736113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length26
Mean length21.828571
Min length12

Characters and Unicode

Total characters764
Distinct characters26
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)94.3%

Sample

1st rowegypt.korean-culture.org
2nd rowvietnam.korean-culture.org
3rd rowargentina.korean-culture.org
4th rowrussia.korean-culture.org
5th rowthailand.korean-culture.org
ValueCountFrequency (%)
c.kocenter.cn 2
 
5.7%
vietnam.korean-culture.org 1
 
2.9%
vienna.korean-culture.org 1
 
2.9%
www.koreanculture.org 1
 
2.9%
www.kccla.org 1
 
2.9%
www.kulturkorea.org 1
 
2.9%
kccuk.org.uk 1
 
2.9%
www.koreanculture.org.au 1
 
2.9%
www.koreanculture.jp 1
 
2.9%
phil.korean-culture.org 1
 
2.9%
Other values (24) 24
68.6%
2023-12-12T15:06:14.119302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
r 100
13.1%
e 74
9.7%
. 71
9.3%
u 69
 
9.0%
o 64
 
8.4%
a 49
 
6.4%
c 44
 
5.8%
n 42
 
5.5%
k 39
 
5.1%
t 39
 
5.1%
Other values (16) 173
22.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 667
87.3%
Other Punctuation 71
 
9.3%
Dash Punctuation 26
 
3.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r 100
15.0%
e 74
11.1%
u 69
10.3%
o 64
9.6%
a 49
7.3%
c 44
6.6%
n 42
 
6.3%
k 39
 
5.8%
t 39
 
5.8%
l 38
 
5.7%
Other values (14) 109
16.3%
Other Punctuation
ValueCountFrequency (%)
. 71
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 667
87.3%
Common 97
 
12.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
r 100
15.0%
e 74
11.1%
u 69
10.3%
o 64
9.6%
a 49
7.3%
c 44
6.6%
n 42
 
6.3%
k 39
 
5.8%
t 39
 
5.8%
l 38
 
5.7%
Other values (14) 109
16.3%
Common
ValueCountFrequency (%)
. 71
73.2%
- 26
 
26.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 764
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
r 100
13.1%
e 74
9.7%
. 71
9.3%
u 69
 
9.0%
o 64
 
8.4%
a 49
 
6.4%
c 44
 
5.8%
n 42
 
5.5%
k 39
 
5.1%
t 39
 
5.1%
Other values (16) 173
22.6%

주소
Text

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-12T15:06:14.513830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length101
Median length61
Mean length54.971429
Min length29

Characters and Unicode

Total characters1924
Distinct characters73
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row8 Boulus Hanna St., Dokki, Cairo, Egypt
2nd row49 Nguyen Du Street, Hai Ba Trung District, Hanoi, Vietnam
3rd rowAv. Maipu 972 C1006ACN, Buenos Aires, Argentina
4th rowMoscow, Arbat St.24 (3rd, 4th floor), 119002
5th row219/2 ( Sukhumvit 15 - 17 )Sukhumvit Road, Klongteoy-Nua, Wattana, Bangkok 10110 Thailand
ValueCountFrequency (%)
street 7
 
2.4%
road 6
 
2.0%
floor 5
 
1.7%
de 3
 
1.0%
district 3
 
1.0%
la 3
 
1.0%
2
 
0.7%
13 2
 
0.7%
rue 2
 
0.7%
sukhumvit 2
 
0.7%
Other values (249) 262
88.2%
2023-12-12T15:06:15.357996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
263
 
13.7%
a 136
 
7.1%
, 99
 
5.1%
e 99
 
5.1%
t 91
 
4.7%
o 87
 
4.5%
i 86
 
4.5%
n 84
 
4.4%
r 76
 
4.0%
l 53
 
2.8%
Other values (63) 850
44.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1055
54.8%
Space Separator 263
 
13.7%
Uppercase Letter 238
 
12.4%
Decimal Number 210
 
10.9%
Other Punctuation 134
 
7.0%
Dash Punctuation 9
 
0.5%
Open Punctuation 5
 
0.3%
Math Symbol 5
 
0.3%
Close Punctuation 5
 
0.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 136
12.9%
e 99
9.4%
t 91
 
8.6%
o 87
 
8.2%
i 86
 
8.2%
n 84
 
8.0%
r 76
 
7.2%
l 53
 
5.0%
u 52
 
4.9%
s 51
 
4.8%
Other values (18) 240
22.7%
Uppercase Letter
ValueCountFrequency (%)
S 25
 
10.5%
B 24
 
10.1%
C 18
 
7.6%
N 18
 
7.6%
A 17
 
7.1%
P 17
 
7.1%
R 12
 
5.0%
F 10
 
4.2%
L 10
 
4.2%
D 9
 
3.8%
Other values (15) 78
32.8%
Decimal Number
ValueCountFrequency (%)
0 49
23.3%
1 40
19.0%
2 33
15.7%
3 19
 
9.0%
4 17
 
8.1%
5 14
 
6.7%
7 11
 
5.2%
6 11
 
5.2%
9 9
 
4.3%
8 7
 
3.3%
Other Punctuation
ValueCountFrequency (%)
, 99
73.9%
. 22
 
16.4%
? 9
 
6.7%
/ 2
 
1.5%
" 2
 
1.5%
Space Separator
ValueCountFrequency (%)
263
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1293
67.2%
Common 631
32.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 136
 
10.5%
e 99
 
7.7%
t 91
 
7.0%
o 87
 
6.7%
i 86
 
6.7%
n 84
 
6.5%
r 76
 
5.9%
l 53
 
4.1%
u 52
 
4.0%
s 51
 
3.9%
Other values (43) 478
37.0%
Common
ValueCountFrequency (%)
263
41.7%
, 99
 
15.7%
0 49
 
7.8%
1 40
 
6.3%
2 33
 
5.2%
. 22
 
3.5%
3 19
 
3.0%
4 17
 
2.7%
5 14
 
2.2%
7 11
 
1.7%
Other values (10) 64
 
10.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1922
99.9%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
263
 
13.7%
a 136
 
7.1%
, 99
 
5.2%
e 99
 
5.2%
t 91
 
4.7%
o 87
 
4.5%
i 86
 
4.5%
n 84
 
4.4%
r 76
 
4.0%
l 53
 
2.8%
Other values (61) 848
44.1%
None
ValueCountFrequency (%)
ı 1
50.0%
ß 1
50.0%

Correlations

2023-12-12T15:06:15.491433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
문화원명홈페이지 언어홈페이지주소주소
문화원명1.0001.0001.0001.000
홈페이지 언어1.0001.0001.0001.000
홈페이지주소1.0001.0001.0001.000
주소1.0001.0001.0001.000

Missing values

2023-12-12T15:06:12.345565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:06:12.440136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

문화원명홈페이지 언어홈페이지주소주소
0이집트한국어, 아랍어egypt.korean-culture.org8 Boulus Hanna St., Dokki, Cairo, Egypt
1베트남한국어, 베트남어vietnam.korean-culture.org49 Nguyen Du Street, Hai Ba Trung District, Hanoi, Vietnam
2아르헨티나한국어, 스페인어argentina.korean-culture.orgAv. Maipu 972 C1006ACN, Buenos Aires, Argentina
3러시아한국어, 러시아어russia.korean-culture.orgMoscow, Arbat St.24 (3rd, 4th floor), 119002
4태국한국어, 태국어thailand.korean-culture.org219/2 ( Sukhumvit 15 - 17 )Sukhumvit Road, Klongteoy-Nua, Wattana, Bangkok 10110 Thailand
5폴란드한국어, 폴란드어pl.korean-culture.orgUl. Leona Kruczkowskiego 8(Nordic Park, Parter), 00-380, Warszawa, Poland
6카자흐스탄한국어, 러시아어kaz.korean-culture.orgNur-sultan, Imanov street 13, "Nursaulet-2" business center, 1st floor
7나이지리아한국어, 영어ngr.korean-culture.orgRivers State Building, 2nd Floor, Plot 83 Ralph Shodeinde Street, Central Business District, Abuja
8인도네시아한국어, 인도네시아어id.korean-culture.orgEquity Tower 17th Fl. Jl.Jend.Sudirman, SCBD, Lot 9, Jakarta, 12190
9필리핀한국어, 영어phil.korean-culture.org59 Bayani Road, Fort Bonifacio, Taguig City, Metro Manila 1630
문화원명홈페이지 언어홈페이지주소주소
25상해한국어, 중국어c.kocenter.cn2,3F, Huizhi Building, No.396, North Caoxi Road, Shanghai (200030)
26오사카한국어, 일본어www.k-culture.jp4th FL. Mindan Bldg. 2~4~2 Nakazaki, Kita~ku, Osaka, Japan
27동경한국어, 일본어www.koreanculture.jp4~4~10 Yotsuya. Shinjuku, Tokyo
28호주한국어, 영어www.koreanculture.org.auGround Floor, 255 Elizabeth Street, Sydney, 2000
29영국한국어, 영어kccuk.org.ukGrand Buildings, 1-3 Strand, London WC2N 5BW, United Kingdom
30독일한국어, 독일어www.kulturkorea.orgLeipziger Platz 3, 10117 Berlin
31LA한국어, 영어www.kccla.org5505 Wilshire Blvd. Los Angeles, CA 90036
32뉴욕한국어, 영어www.koreanculture.org460 Park Avenue, 6th Floor (at 57th Street) New York, NY 10022
33오스트리아한국어, 독일어vienna.korean-culture.orgK?rntner Straße 43, 1010 Wien, ?sterreich
34스웨덴한국어, 영어sweden.korean-culture.orgKungsholmsgatan 23, 112 27 Stockholm, Sweden