Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 160 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 11.2 KiB |
Average record size in memory | 71.8 B |
Variable types
Text | 1 |
---|---|
Numeric | 6 |
Categorical | 1 |
Dataset
Description | 국외인적자원관리시스템 내 정부초청외국인장학생의 국가별, 과정별 현황※ 시스템에 등록된 데이터 기준으로 실제 사업부서의 보유자료와 일부 차이가 있을 수 있음 |
---|---|
Author | 교육부 국립국제교육원 |
URL | https://www.data.go.kr/data/15052777/fileData.do |
학사 is highly overall correlated with 석사 and 2 other fields | High correlation |
석사 is highly overall correlated with 학사 and 2 other fields | High correlation |
박사 is highly overall correlated with 학사 and 3 other fields | High correlation |
연구 is highly overall correlated with 박사 | High correlation |
기타 is highly overall correlated with 학사 and 2 other fields | High correlation |
석박사 is highly imbalanced (75.6%) | Imbalance |
구분 has unique values | Unique |
학사 has 75 (46.9%) zeros | Zeros |
석사 has 7 (4.4%) zeros | Zeros |
박사 has 34 (21.2%) zeros | Zeros |
연구 has 108 (67.5%) zeros | Zeros |
연수 has 131 (81.9%) zeros | Zeros |
기타 has 16 (10.0%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 04:54:05.199623 |
---|---|
Analysis finished | 2023-12-12 04:54:09.193334 |
Duration | 3.99 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
구분
Text
UNIQUE
 
Distinct | 160 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.4 KiB |
Value | Count | Frequency (%) |
기니 | 2 | 1.2% |
가나 | 1 | 0.6% |
중국 | 1 | 0.6% |
우루과이 | 1 | 0.6% |
우즈베키스탄 | 1 | 0.6% |
우크라이나 | 1 | 0.6% |
유고슬라비아 | 1 | 0.6% |
이라크 | 1 | 0.6% |
이란 | 1 | 0.6% |
이스라엘 | 1 | 0.6% |
Other values (152) | 152 |
Most occurring characters
Value | Count | Frequency (%) |
아 | 48 | 7.9% |
스 | 28 | 4.6% |
리 | 24 | 3.9% |
르 | 20 | 3.3% |
니 | 20 | 3.3% |
이 | 19 | 3.1% |
라 | 19 | 3.1% |
비 | 13 | 2.1% |
나 | 13 | 2.1% |
바 | 12 | 2.0% |
Other values (145) | 395 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 607 | |
Space Separator | 3 | 0.5% |
Other Punctuation | 1 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
아 | 48 | 7.9% |
스 | 28 | 4.6% |
리 | 24 | 4.0% |
르 | 20 | 3.3% |
니 | 20 | 3.3% |
이 | 19 | 3.1% |
라 | 19 | 3.1% |
비 | 13 | 2.1% |
나 | 13 | 2.1% |
바 | 12 | 2.0% |
Other values (143) | 391 |
Space Separator
Value | Count | Frequency (%) |
3 |
Other Punctuation
Value | Count | Frequency (%) |
· | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 607 | |
Common | 4 | 0.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
아 | 48 | 7.9% |
스 | 28 | 4.6% |
리 | 24 | 4.0% |
르 | 20 | 3.3% |
니 | 20 | 3.3% |
이 | 19 | 3.1% |
라 | 19 | 3.1% |
비 | 13 | 2.1% |
나 | 13 | 2.1% |
바 | 12 | 2.0% |
Other values (143) | 391 |
Common
Value | Count | Frequency (%) |
3 | ||
· | 1 | 25.0% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 607 | |
ASCII | 3 | 0.5% |
None | 1 | 0.2% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
아 | 48 | 7.9% |
스 | 28 | 4.6% |
리 | 24 | 4.0% |
르 | 20 | 3.3% |
니 | 20 | 3.3% |
이 | 19 | 3.1% |
라 | 19 | 3.1% |
비 | 13 | 2.1% |
나 | 13 | 2.1% |
바 | 12 | 2.0% |
Other values (143) | 391 |
ASCII
Value | Count | Frequency (%) |
3 |
None
Value | Count | Frequency (%) |
· | 1 |
학사
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 45 |
---|---|
Distinct (%) | 28.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.925 |
Minimum | 0 |
---|---|
Maximum | 128 |
Zeros | 75 |
Zeros (%) | 46.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 1 |
Q3 | 21.25 |
95-th percentile | 62.2 |
Maximum | 128 |
Range | 128 |
Interquartile range (IQR) | 21.25 |
Descriptive statistics
Standard deviation | 23.038265 |
---|---|
Coefficient of variation (CV) | 1.6544535 |
Kurtosis | 7.0395443 |
Mean | 13.925 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 2.4607369 |
Sum | 2228 |
Variance | 530.76164 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 75 | |
1 | 7 | 4.4% |
2 | 5 | 3.1% |
26 | 4 | 2.5% |
19 | 3 | 1.9% |
15 | 3 | 1.9% |
30 | 3 | 1.9% |
36 | 3 | 1.9% |
4 | 3 | 1.9% |
43 | 2 | 1.2% |
Other values (35) | 52 |
Value | Count | Frequency (%) |
0 | 75 | |
1 | 7 | 4.4% |
2 | 5 | 3.1% |
3 | 2 | 1.2% |
4 | 3 | 1.9% |
5 | 2 | 1.2% |
6 | 2 | 1.2% |
8 | 2 | 1.2% |
9 | 2 | 1.2% |
10 | 2 | 1.2% |
Value | Count | Frequency (%) |
128 | 1 | |
111 | 1 | |
104 | 1 | |
93 | 1 | |
83 | 1 | |
75 | 1 | |
67 | 1 | |
66 | 1 | |
62 | 1 | |
60 | 1 |
석사
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 81 |
---|---|
Distinct (%) | 50.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 52.55625 |
Minimum | 0 |
---|---|
Maximum | 413 |
Zeros | 7 |
Zeros (%) | 4.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 9 |
median | 20.5 |
Q3 | 58 |
95-th percentile | 239.8 |
Maximum | 413 |
Range | 413 |
Interquartile range (IQR) | 49 |
Descriptive statistics
Standard deviation | 79.398267 |
---|---|
Coefficient of variation (CV) | 1.5107293 |
Kurtosis | 6.3725476 |
Mean | 52.55625 |
Median Absolute Deviation (MAD) | 15.5 |
Skewness | 2.5367698 |
Sum | 8409 |
Variance | 6304.0849 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 7 | 4.4% |
4 | 6 | 3.8% |
11 | 6 | 3.8% |
10 | 6 | 3.8% |
16 | 6 | 3.8% |
7 | 5 | 3.1% |
18 | 5 | 3.1% |
2 | 5 | 3.1% |
26 | 5 | 3.1% |
12 | 5 | 3.1% |
Other values (71) | 104 |
Value | Count | Frequency (%) |
0 | 7 | |
1 | 3 | |
2 | 5 | |
3 | 1 | 0.6% |
4 | 6 | |
5 | 3 | |
6 | 4 | |
7 | 5 | |
8 | 3 | |
9 | 5 |
Value | Count | Frequency (%) |
413 | 1 | |
389 | 1 | |
355 | 1 | |
297 | 1 | |
285 | 1 | |
274 | 1 | |
266 | 1 | |
255 | 1 | |
239 | 1 | |
238 | 1 |
박사
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 44 |
---|---|
Distinct (%) | 27.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.74375 |
Minimum | 0 |
---|---|
Maximum | 206 |
Zeros | 34 |
Zeros (%) | 21.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 3 |
Q3 | 15 |
95-th percentile | 69.05 |
Maximum | 206 |
Range | 206 |
Interquartile range (IQR) | 14 |
Descriptive statistics
Standard deviation | 27.871052 |
---|---|
Coefficient of variation (CV) | 2.0279074 |
Kurtosis | 18.197645 |
Mean | 13.74375 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 3.8365849 |
Sum | 2199 |
Variance | 776.79556 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 34 | |
1 | 22 | |
2 | 18 | 11.2% |
3 | 17 | 10.6% |
4 | 5 | 3.1% |
6 | 4 | 2.5% |
8 | 4 | 2.5% |
15 | 4 | 2.5% |
17 | 3 | 1.9% |
9 | 3 | 1.9% |
Other values (34) | 46 |
Value | Count | Frequency (%) |
0 | 34 | |
1 | 22 | |
2 | 18 | |
3 | 17 | |
4 | 5 | 3.1% |
5 | 2 | 1.2% |
6 | 4 | 2.5% |
7 | 1 | 0.6% |
8 | 4 | 2.5% |
9 | 3 | 1.9% |
Value | Count | Frequency (%) |
206 | 1 | |
134 | 1 | |
124 | 1 | |
111 | 1 | |
103 | 1 | |
90 | 1 | |
80 | 1 | |
70 | 1 | |
69 | 1 | |
65 | 1 |
석박사
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 2.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.4 KiB |
0 | |
---|---|
1 | 10 |
2 | 2 |
4 | 1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.6% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 147 | |
1 | 10 | 6.2% |
2 | 2 | 1.2% |
4 | 1 | 0.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 147 | |
1 | 10 | 6.2% |
2 | 2 | 1.2% |
4 | 1 | 0.6% |
연구
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 11 |
---|---|
Distinct (%) | 6.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1 |
Minimum | 0 |
---|---|
Maximum | 17 |
Zeros | 108 |
Zeros (%) | 67.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 1 |
95-th percentile | 5 |
Maximum | 17 |
Range | 17 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 2.452056 |
---|---|
Coefficient of variation (CV) | 2.452056 |
Kurtosis | 20.711369 |
Mean | 1 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 4.1782162 |
Sum | 160 |
Variance | 6.0125786 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 108 | |
1 | 25 | 15.6% |
2 | 7 | 4.4% |
4 | 6 | 3.8% |
3 | 5 | 3.1% |
5 | 3 | 1.9% |
8 | 2 | 1.2% |
6 | 1 | 0.6% |
15 | 1 | 0.6% |
13 | 1 | 0.6% |
Value | Count | Frequency (%) |
0 | 108 | |
1 | 25 | 15.6% |
2 | 7 | 4.4% |
3 | 5 | 3.1% |
4 | 6 | 3.8% |
5 | 3 | 1.9% |
6 | 1 | 0.6% |
8 | 2 | 1.2% |
13 | 1 | 0.6% |
15 | 1 | 0.6% |
Value | Count | Frequency (%) |
17 | 1 | 0.6% |
15 | 1 | 0.6% |
13 | 1 | 0.6% |
8 | 2 | 1.2% |
6 | 1 | 0.6% |
5 | 3 | 1.9% |
4 | 6 | 3.8% |
3 | 5 | 3.1% |
2 | 7 | 4.4% |
1 | 25 |
연수
Real number (ℝ)
ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 3.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.3 |
Minimum | 0 |
---|---|
Maximum | 5 |
Zeros | 131 |
Zeros (%) | 81.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 2 |
Maximum | 5 |
Range | 5 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 0.75901293 |
---|---|
Coefficient of variation (CV) | 2.5300431 |
Kurtosis | 13.019067 |
Mean | 0.3 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 3.2806792 |
Sum | 48 |
Variance | 0.57610063 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 131 | |
1 | 16 | 10.0% |
2 | 10 | 6.2% |
4 | 1 | 0.6% |
3 | 1 | 0.6% |
5 | 1 | 0.6% |
Value | Count | Frequency (%) |
0 | 131 | |
1 | 16 | 10.0% |
2 | 10 | 6.2% |
3 | 1 | 0.6% |
4 | 1 | 0.6% |
5 | 1 | 0.6% |
Value | Count | Frequency (%) |
5 | 1 | 0.6% |
4 | 1 | 0.6% |
3 | 1 | 0.6% |
2 | 10 | 6.2% |
1 | 16 | 10.0% |
0 | 131 |
기타
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 44 |
---|---|
Distinct (%) | 27.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.1 |
Minimum | 0 |
---|---|
Maximum | 154 |
Zeros | 16 |
Zeros (%) | 10.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 2 |
median | 4 |
Q3 | 13 |
95-th percentile | 58.45 |
Maximum | 154 |
Range | 154 |
Interquartile range (IQR) | 11 |
Descriptive statistics
Standard deviation | 21.601945 |
---|---|
Coefficient of variation (CV) | 1.6490034 |
Kurtosis | 13.030563 |
Mean | 13.1 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 3.1530826 |
Sum | 2096 |
Variance | 466.64403 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 22 | |
0 | 16 | 10.0% |
2 | 16 | 10.0% |
3 | 15 | 9.4% |
4 | 12 | 7.5% |
7 | 8 | 5.0% |
11 | 7 | 4.4% |
6 | 7 | 4.4% |
10 | 4 | 2.5% |
5 | 4 | 2.5% |
Other values (34) | 49 |
Value | Count | Frequency (%) |
0 | 16 | |
1 | 22 | |
2 | 16 | |
3 | 15 | |
4 | 12 | |
5 | 4 | 2.5% |
6 | 7 | 4.4% |
7 | 8 | 5.0% |
8 | 4 | 2.5% |
9 | 4 | 2.5% |
Value | Count | Frequency (%) |
154 | 1 | |
96 | 1 | |
80 | 1 | |
74 | 1 | |
72 | 1 | |
71 | 1 | |
69 | 1 | |
67 | 1 | |
58 | 1 | |
56 | 2 |
학사 | 석사 | 박사 | 석박사 | 연구 | 연수 | 기타 | |
---|---|---|---|---|---|---|---|
학사 | 1.000 | 0.954 | 0.804 | 0.644 | 0.635 | 0.668 | 0.801 |
석사 | 0.954 | 1.000 | 0.815 | 0.642 | 0.772 | 0.721 | 0.819 |
박사 | 0.804 | 0.815 | 1.000 | 0.655 | 0.937 | 0.496 | 0.862 |
석박사 | 0.644 | 0.642 | 0.655 | 1.000 | 0.465 | 0.118 | 0.823 |
연구 | 0.635 | 0.772 | 0.937 | 0.465 | 1.000 | 0.707 | 0.832 |
연수 | 0.668 | 0.721 | 0.496 | 0.118 | 0.707 | 1.000 | 0.355 |
기타 | 0.801 | 0.819 | 0.862 | 0.823 | 0.832 | 0.355 | 1.000 |
학사 | 석사 | 박사 | 연구 | 연수 | 기타 | 석박사 | |
---|---|---|---|---|---|---|---|
학사 | 1.000 | 0.666 | 0.526 | 0.182 | 0.038 | 0.741 | 0.436 |
석사 | 0.666 | 1.000 | 0.804 | 0.477 | 0.289 | 0.849 | 0.434 |
박사 | 0.526 | 0.804 | 1.000 | 0.516 | 0.231 | 0.732 | 0.362 |
연구 | 0.182 | 0.477 | 0.516 | 1.000 | 0.383 | 0.329 | 0.219 |
연수 | 0.038 | 0.289 | 0.231 | 0.383 | 1.000 | 0.162 | 0.074 |
기타 | 0.741 | 0.849 | 0.732 | 0.329 | 0.162 | 1.000 | 0.485 |
석박사 | 0.436 | 0.434 | 0.362 | 0.219 | 0.074 | 0.485 | 1.000 |
구분 | 학사 | 석사 | 박사 | 석박사 | 연구 | 연수 | 기타 | |
---|---|---|---|---|---|---|---|---|
0 | 가나 | 26 | 94 | 23 | 0 | 0 | 0 | 26 |
1 | 가봉 | 23 | 23 | 13 | 0 | 1 | 0 | 7 |
2 | 가이아나 | 0 | 3 | 0 | 0 | 0 | 0 | 0 |
3 | 감비아 | 1 | 9 | 1 | 0 | 0 | 0 | 3 |
4 | 과테말라 | 22 | 17 | 3 | 0 | 0 | 0 | 11 |
5 | 그리스 | 0 | 16 | 3 | 0 | 2 | 1 | 4 |
6 | 기니 | 1 | 12 | 3 | 0 | 0 | 0 | 4 |
7 | 기니비사우 | 2 | 0 | 0 | 0 | 0 | 0 | 2 |
8 | 나미비아 | 0 | 1 | 0 | 0 | 0 | 0 | 1 |
9 | 나이지리아 | 28 | 77 | 37 | 0 | 0 | 0 | 29 |
구분 | 학사 | 석사 | 박사 | 석박사 | 연구 | 연수 | 기타 | |
---|---|---|---|---|---|---|---|---|
150 | 팔레스타인 | 0 | 7 | 2 | 0 | 0 | 0 | 3 |
151 | 페루 | 29 | 71 | 2 | 0 | 0 | 1 | 16 |
152 | 포르투갈 | 0 | 11 | 3 | 0 | 0 | 0 | 1 |
153 | 폴란드 | 18 | 53 | 21 | 0 | 1 | 2 | 6 |
154 | 프랑스 | 3 | 61 | 19 | 0 | 1 | 5 | 7 |
155 | 피지 | 3 | 7 | 4 | 0 | 0 | 0 | 4 |
156 | 핀란드 | 1 | 21 | 3 | 0 | 3 | 1 | 4 |
157 | 필리핀 | 43 | 217 | 36 | 0 | 5 | 1 | 80 |
158 | 헝가리 | 2 | 48 | 14 | 0 | 4 | 0 | 4 |
159 | 홍콩 | 1 | 18 | 0 | 0 | 0 | 0 | 0 |