Overview

Dataset statistics

Number of variables6
Number of observations34
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory51.9 B

Variable types

Categorical5
Text1

Dataset

Description재외국민의 교육지원 등에 관한 법률 제8조에 의거하여 재외한국학교의 교육과정은 초·중등교육법 제23조의 규정에 따라 교육부장관이 정하는 교육과정에 준하여 편성하고 있는바, 전 세계 재외한국학교(34개교)에서 운영 중인 교육과정 현황을 제공하고자 합니다.
URLhttps://www.data.go.kr/data/15102661/fileData.do

Alerts

유치원 is highly overall correlated with 국가 and 3 other fieldsHigh correlation
초등학교 is highly overall correlated with 국가 and 3 other fieldsHigh correlation
고등학교 is highly overall correlated with 국가 and 3 other fieldsHigh correlation
국가 is highly overall correlated with 유치원 and 3 other fieldsHigh correlation
중학교 is highly overall correlated with 국가 and 3 other fieldsHigh correlation
초등학교 is highly imbalanced (80.9%)Imbalance
학교명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 13:34:51.782424
Analysis finished2023-12-12 13:34:52.215864
Duration0.43 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

국가
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)47.1%
Missing0
Missing (%)0.0%
Memory size404.0 B
중국
13 
일본
대만
베트남
사우디
Other values (11)
11 

Length

Max length5
Median length2
Mean length2.6470588
Min length2

Unique

Unique11 ?
Unique (%)32.4%

Sample

1st row일본
2nd row일본
3rd row일본
4th row일본
5th row중국

Common Values

ValueCountFrequency (%)
중국 13
38.2%
일본 4
 
11.8%
대만 2
 
5.9%
베트남 2
 
5.9%
사우디 2
 
5.9%
인도네시아 1
 
2.9%
싱가포르 1
 
2.9%
태국 1
 
2.9%
필리핀 1
 
2.9%
파라과이 1
 
2.9%
Other values (6) 6
17.6%

Length

2023-12-12T22:34:52.301742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
중국 13
38.2%
일본 4
 
11.8%
대만 2
 
5.9%
베트남 2
 
5.9%
사우디 2
 
5.9%
인도네시아 1
 
2.9%
싱가포르 1
 
2.9%
태국 1
 
2.9%
필리핀 1
 
2.9%
파라과이 1
 
2.9%
Other values (6) 6
17.6%

학교명
Text

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-12-12T22:34:52.545763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length7.9117647
Min length6

Characters and Unicode

Total characters269
Distinct characters84
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row동경한국학교
2nd row교토국제학교
3rd row오사카금강학교
4th row건국한국학교
5th row북경한국국제학교
ValueCountFrequency (%)
동경한국학교 1
 
2.9%
교토국제학교 1
 
2.9%
말레이시아한국국제학교 1
 
2.9%
카이로한국학교 1
 
2.9%
테헤란한국학교 1
 
2.9%
모스크바한국학교 1
 
2.9%
아르헨티나한국학교 1
 
2.9%
파라과이한국학교 1
 
2.9%
필리핀한국국제학교 1
 
2.9%
홍콩한국국제학교 1
 
2.9%
Other values (24) 24
70.6%
2023-12-12T22:34:52.923376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
50
18.6%
35
13.0%
34
12.6%
32
 
11.9%
17
 
6.3%
9
 
3.3%
3
 
1.1%
3
 
1.1%
3
 
1.1%
3
 
1.1%
Other values (74) 80
29.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 269
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
50
18.6%
35
13.0%
34
12.6%
32
 
11.9%
17
 
6.3%
9
 
3.3%
3
 
1.1%
3
 
1.1%
3
 
1.1%
3
 
1.1%
Other values (74) 80
29.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 269
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
50
18.6%
35
13.0%
34
12.6%
32
 
11.9%
17
 
6.3%
9
 
3.3%
3
 
1.1%
3
 
1.1%
3
 
1.1%
3
 
1.1%
Other values (74) 80
29.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 269
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
50
18.6%
35
13.0%
34
12.6%
32
 
11.9%
17
 
6.3%
9
 
3.3%
3
 
1.1%
3
 
1.1%
3
 
1.1%
3
 
1.1%
Other values (74) 80
29.7%

유치원
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
O
19 
<NA>
15 

Length

Max length4
Median length1
Mean length2.3235294
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th rowO
5th rowO

Common Values

ValueCountFrequency (%)
O 19
55.9%
<NA> 15
44.1%

Length

2023-12-12T22:34:53.110753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:34:53.250953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
o 19
55.9%
na 15
44.1%

초등학교
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
O
33 
<NA>
 
1

Length

Max length4
Median length1
Mean length1.0882353
Min length1

Unique

Unique1 ?
Unique (%)2.9%

Sample

1st rowO
2nd row<NA>
3rd rowO
4th rowO
5th rowO

Common Values

ValueCountFrequency (%)
O 33
97.1%
<NA> 1
 
2.9%

Length

2023-12-12T22:34:53.387084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:34:53.516477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
o 33
97.1%
na 1
 
2.9%

중학교
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
O
23 
<NA>
11 

Length

Max length4
Median length1
Mean length1.9705882
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowO
2nd rowO
3rd rowO
4th rowO
5th rowO

Common Values

ValueCountFrequency (%)
O 23
67.6%
<NA> 11
32.4%

Length

2023-12-12T22:34:53.627566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:34:53.728935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
o 23
67.6%
na 11
32.4%

고등학교
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
O
23 
<NA>
11 

Length

Max length4
Median length1
Mean length1.9705882
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowO
2nd rowO
3rd rowO
4th rowO
5th rowO

Common Values

ValueCountFrequency (%)
O 23
67.6%
<NA> 11
32.4%

Length

2023-12-12T22:34:53.851400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:34:53.946465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
o 23
67.6%
na 11
32.4%

Correlations

2023-12-12T22:34:54.004282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
국가학교명
국가1.0001.000
학교명1.0001.000
2023-12-12T22:34:54.091822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유치원초등학교고등학교국가중학교
유치원1.0001.0001.0001.0001.000
초등학교1.0001.0001.0001.0001.000
고등학교1.0001.0001.0001.0001.000
국가1.0001.0001.0001.0001.000
중학교1.0001.0001.0001.0001.000
2023-12-12T22:34:54.222477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
국가유치원초등학교중학교고등학교
국가1.0001.0001.0001.0001.000
유치원1.0001.0001.0001.0001.000
초등학교1.0001.0001.0001.0001.000
중학교1.0001.0001.0001.0001.000
고등학교1.0001.0001.0001.0001.000

Missing values

2023-12-12T22:34:52.038874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:34:52.166665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

국가학교명유치원초등학교중학교고등학교
0일본동경한국학교<NA>OOO
1일본교토국제학교<NA><NA>OO
2일본오사카금강학교<NA>OOO
3일본건국한국학교OOOO
4중국북경한국국제학교OOOO
5중국천진한국국제학교OOOO
6중국상해한국학교<NA>OOO
7중국무석한국학교OOOO
8중국소주한국학교OOOO
9중국홍콩한국국제학교OOOO
국가학교명유치원초등학교중학교고등학교
24싱가포르싱가포르한국국제학교OOOO
25태국방콕한국국제학교<NA>OOO
26필리핀필리핀한국국제학교OOOO
27파라과이파라과이한국학교OO<NA><NA>
28아르헨티나아르헨티나한국학교OO<NA><NA>
29러시아모스크바한국학교OO<NA><NA>
30이란테헤란한국학교<NA>O<NA><NA>
31이집트카이로한국학교<NA>O<NA><NA>
32말레이시아말레이시아한국국제학교OO<NA><NA>
33캄보디아프놈펜한국국제학교<NA>O<NA><NA>