Overview

Dataset statistics

Number of variables3
Number of observations46
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory27.9 B

Variable types

Numeric1
Categorical1
Text1

Dataset

Description한국연구재단이 보유하고 있는 '외국교육기관 및 외국인학교종합안내' 시스템 내의 '학교검색 리스트 목록' 데이터 입니다. 학교ID, 지역명, 학교명 의 컬럼이 있습니다
URLhttps://www.data.go.kr/data/15117893/fileData.do

Alerts

학교ID is highly overall correlated with 지역명High correlation
지역명 is highly overall correlated with 학교IDHigh correlation
학교명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:29:09.945744
Analysis finished2023-12-12 06:29:10.365567
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

학교ID
Real number (ℝ)

HIGH CORRELATION 

Distinct45
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.195652
Minimum1
Maximum82
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size546.0 B
2023-12-12T15:29:10.456267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.25
Q115.25
median26.5
Q343
95-th percentile77.5
Maximum82
Range81
Interquartile range (IQR)27.75

Descriptive statistics

Standard deviation21.833328
Coefficient of variation (CV)0.69988368
Kurtosis0.0015942145
Mean31.195652
Median Absolute Deviation (MAD)13
Skewness0.86506561
Sum1435
Variance476.6942
MonotonicityNot monotonic
2023-12-12T15:29:10.639872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
31 2
 
4.3%
1 1
 
2.2%
30 1
 
2.2%
22 1
 
2.2%
23 1
 
2.2%
24 1
 
2.2%
25 1
 
2.2%
26 1
 
2.2%
27 1
 
2.2%
28 1
 
2.2%
Other values (35) 35
76.1%
ValueCountFrequency (%)
1 1
2.2%
3 1
2.2%
4 1
2.2%
5 1
2.2%
6 1
2.2%
8 1
2.2%
10 1
2.2%
11 1
2.2%
12 1
2.2%
13 1
2.2%
ValueCountFrequency (%)
82 1
2.2%
81 1
2.2%
78 1
2.2%
76 1
2.2%
62 1
2.2%
59 1
2.2%
58 1
2.2%
57 1
2.2%
56 1
2.2%
47 1
2.2%

지역명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)8.7%
Missing0
Missing (%)0.0%
Memory size500.0 B
서울
19 
전국_외국인학교
15 
경기
전국_외국교육기관

Length

Max length9
Median length2
Mean length4.8695652
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울
2nd row서울
3rd row서울
4th row서울
5th row서울

Common Values

ValueCountFrequency (%)
서울 19
41.3%
전국_외국인학교 15
32.6%
경기 6
 
13.0%
전국_외국교육기관 6
 
13.0%

Length

2023-12-12T15:29:10.804775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:29:10.909763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울 19
41.3%
전국_외국인학교 15
32.6%
경기 6
 
13.0%
전국_외국교육기관 6
 
13.0%

학교명
Text

UNIQUE 

Distinct46
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size500.0 B
2023-12-12T15:29:11.135984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length23
Mean length19.065217
Min length14

Characters and Unicode

Total characters877
Distinct characters125
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)100.0%

Sample

1st row1. 한국외국인학교-서울캠퍼스(강남구)
2nd row2. 한국켄트외국인학교(광진구)
3rd row3. 아시아퍼시픽국제외국인학교(노원구)
4th row4. 서울외국인학교(서대문구)
5th row5. 코리아외국인학교(서초구)
ValueCountFrequency (%)
부산 5
 
3.5%
연수구 5
 
3.5%
4 4
 
2.8%
2 4
 
2.8%
6 4
 
2.8%
5 4
 
2.8%
1 4
 
2.8%
3 4
 
2.8%
동구 3
 
2.1%
13 2
 
1.4%
Other values (89) 102
72.3%
2023-12-12T15:29:11.499531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
95
 
10.8%
55
 
6.3%
( 46
 
5.2%
) 46
 
5.2%
. 46
 
5.2%
44
 
5.0%
43
 
4.9%
42
 
4.8%
29
 
3.3%
1 22
 
2.5%
Other values (115) 409
46.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 578
65.9%
Space Separator 95
 
10.8%
Decimal Number 62
 
7.1%
Other Punctuation 49
 
5.6%
Open Punctuation 46
 
5.2%
Close Punctuation 46
 
5.2%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
55
 
9.5%
44
 
7.6%
43
 
7.4%
42
 
7.3%
29
 
5.0%
19
 
3.3%
18
 
3.1%
16
 
2.8%
16
 
2.8%
14
 
2.4%
Other values (98) 282
48.8%
Decimal Number
ValueCountFrequency (%)
1 22
35.5%
4 6
 
9.7%
2 6
 
9.7%
5 6
 
9.7%
3 6
 
9.7%
6 5
 
8.1%
9 3
 
4.8%
8 3
 
4.8%
7 3
 
4.8%
0 2
 
3.2%
Other Punctuation
ValueCountFrequency (%)
. 46
93.9%
· 2
 
4.1%
, 1
 
2.0%
Space Separator
ValueCountFrequency (%)
95
100.0%
Open Punctuation
ValueCountFrequency (%)
( 46
100.0%
Close Punctuation
ValueCountFrequency (%)
) 46
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 578
65.9%
Common 299
34.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
55
 
9.5%
44
 
7.6%
43
 
7.4%
42
 
7.3%
29
 
5.0%
19
 
3.3%
18
 
3.1%
16
 
2.8%
16
 
2.8%
14
 
2.4%
Other values (98) 282
48.8%
Common
ValueCountFrequency (%)
95
31.8%
( 46
15.4%
) 46
15.4%
. 46
15.4%
1 22
 
7.4%
4 6
 
2.0%
2 6
 
2.0%
5 6
 
2.0%
3 6
 
2.0%
6 5
 
1.7%
Other values (7) 15
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 578
65.9%
ASCII 297
33.9%
None 2
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
95
32.0%
( 46
15.5%
) 46
15.5%
. 46
15.5%
1 22
 
7.4%
4 6
 
2.0%
2 6
 
2.0%
5 6
 
2.0%
3 6
 
2.0%
6 5
 
1.7%
Other values (6) 13
 
4.4%
Hangul
ValueCountFrequency (%)
55
 
9.5%
44
 
7.6%
43
 
7.4%
42
 
7.3%
29
 
5.0%
19
 
3.3%
18
 
3.1%
16
 
2.8%
16
 
2.8%
14
 
2.4%
Other values (98) 282
48.8%
None
ValueCountFrequency (%)
· 2
100.0%

Interactions

2023-12-12T15:29:10.103469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:29:11.595344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학교ID지역명학교명
학교ID1.0000.8681.000
지역명0.8681.0001.000
학교명1.0001.0001.000
2023-12-12T15:29:11.695716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학교ID지역명
학교ID1.0000.718
지역명0.7181.000

Missing values

2023-12-12T15:29:10.234973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:29:10.324675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

학교ID지역명학교명
01서울1. 한국외국인학교-서울캠퍼스(강남구)
13서울2. 한국켄트외국인학교(광진구)
24서울3. 아시아퍼시픽국제외국인학교(노원구)
35서울4. 서울외국인학교(서대문구)
46서울5. 코리아외국인학교(서초구)
58서울6. 지구촌기독외국인학교(용산구)
610서울7. 서울용산국제학교(용산구)
711서울8. 프란치스코 외국인 유치원(용산구)
812서울9. 남산국제유치원(중구)
913서울10. 한국영등포화교소학교(영등포구)
학교ID지역명학교명
3640전국_외국인학교12. 원주화교소학교 (강원 원주)
3745전국_외국인학교13. 경남국제외국인학교 (경남 사천)
3844전국_외국인학교14. 애서튼국제외국인학교 (경남 거제)
3947전국_외국인학교15. 청라달튼외국인학교 (인천 서구)
4058전국_외국교육기관1. 대구국제학교(대구 동구)
4159전국_외국교육기관2. 채드윅송도국제학교(인천 연수구)
4278전국_외국교육기관3. 한국조지메이슨대학교 송도캠퍼스(인천 연수구)
4381전국_외국교육기관4. 유타 대학교 아시아 캠퍼스(인천 연수구)
4462전국_외국교육기관5. 한국뉴욕주립대학교(인천 연수구)
4582전국_외국교육기관6. 겐트대학교 글로벌캠퍼스(인천 연수구)