Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 918.0 KiB |
Average record size in memory | 94.0 B |
Variable types
Categorical | 6 |
---|---|
Numeric | 4 |
Dataset
Description | 약사 국가시험 응시자의 성적 현황을 분석할 수 있는 정보(연도, 직종, 회차, 일련번호, 과목명, 과목별 점수, 총점, 합격여부, 성별, 연령대)를 제공합니다. |
---|---|
URL | https://www.data.go.kr/data/15083505/fileData.do |
직종 has constant value "" | Constant |
회차 is highly overall correlated with 일련번호 and 1 other fields | High correlation |
일련번호 is highly overall correlated with 회차 and 1 other fields | High correlation |
과목별점수 is highly overall correlated with 총점 and 1 other fields | High correlation |
총점 is highly overall correlated with 과목별점수 and 1 other fields | High correlation |
연도 is highly overall correlated with 회차 and 1 other fields | High correlation |
합격여부 is highly overall correlated with 과목별점수 and 1 other fields | High correlation |
연령대 is highly imbalanced (60.8%) | Imbalance |
과목별점수 has 1562 (15.6%) zeros | Zeros |
총점 has 1546 (15.5%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 06:39:45.072894 |
---|---|
Analysis finished | 2023-12-12 06:39:48.024174 |
Duration | 2.95 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연도
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2000 | |
---|---|
2001 | |
2002 | |
2003 | |
2004 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2000 |
---|---|
2nd row | 2001 |
3rd row | 2000 |
4th row | 2000 |
5th row | 2000 |
Common Values
Value | Count | Frequency (%) |
2000 | 3694 | |
2001 | 1965 | |
2002 | 1935 | |
2003 | 1887 | |
2004 | 519 | 5.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2000 | 3694 | |
2001 | 1965 | |
2002 | 1935 | |
2003 | 1887 | |
2004 | 519 | 5.2% |
직종
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
약사(4년제) |
---|
Length
Max length | 36 |
---|---|
Median length | 36 |
Mean length | 36 |
Min length | 36 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 약사(4년제) |
---|---|
2nd row | 약사(4년제) |
3rd row | 약사(4년제) |
4th row | 약사(4년제) |
5th row | 약사(4년제) |
Common Values
Value | Count | Frequency (%) |
약사(4년제) | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
약사(4년제 | 10000 |
회차
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 52.1374 |
Minimum | 50 |
---|---|
Maximum | 55 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 50 |
---|---|
5-th percentile | 50 |
Q1 | 51 |
median | 52 |
Q3 | 53 |
95-th percentile | 55 |
Maximum | 55 |
Range | 5 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.5574222 |
---|---|
Coefficient of variation (CV) | 0.029871497 |
Kurtosis | -1.1807253 |
Mean | 52.1374 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 0.051393876 |
Sum | 521374 |
Variance | 2.4255638 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
50 | 2198 | |
52 | 1965 | |
53 | 1935 | |
54 | 1887 | |
51 | 1496 | |
55 | 519 | 5.2% |
Value | Count | Frequency (%) |
50 | 2198 | |
51 | 1496 | |
52 | 1965 | |
53 | 1935 | |
54 | 1887 | |
55 | 519 | 5.2% |
Value | Count | Frequency (%) |
55 | 519 | 5.2% |
54 | 1887 | |
53 | 1935 | |
52 | 1965 | |
51 | 1496 | |
50 | 2198 |
일련번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 5883 |
---|---|
Distinct (%) | 58.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3962.6133 |
Minimum | 1 |
---|---|
Maximum | 7941 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 392.95 |
Q1 | 1971.5 |
median | 3959 |
Q3 | 5937.25 |
95-th percentile | 7556 |
Maximum | 7941 |
Range | 7940 |
Interquartile range (IQR) | 3965.75 |
Descriptive statistics
Standard deviation | 2291.0541 |
---|---|
Coefficient of variation (CV) | 0.57816747 |
Kurtosis | -1.1977711 |
Mean | 3962.6133 |
Median Absolute Deviation (MAD) | 1982 |
Skewness | 0.0091735263 |
Sum | 39626133 |
Variance | 5248928.8 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4369 | 7 | 0.1% |
5992 | 6 | 0.1% |
3880 | 6 | 0.1% |
6541 | 6 | 0.1% |
6763 | 5 | 0.1% |
4110 | 5 | 0.1% |
1206 | 5 | 0.1% |
7475 | 5 | 0.1% |
4234 | 5 | 0.1% |
5900 | 5 | 0.1% |
Other values (5873) | 9945 |
Value | Count | Frequency (%) |
1 | 1 | < 0.1% |
2 | 1 | < 0.1% |
3 | 1 | < 0.1% |
7 | 1 | < 0.1% |
9 | 1 | < 0.1% |
11 | 2 | |
12 | 3 | |
13 | 1 | < 0.1% |
14 | 1 | < 0.1% |
15 | 3 |
Value | Count | Frequency (%) |
7941 | 2 | |
7940 | 2 | |
7939 | 3 | |
7938 | 3 | |
7937 | 3 | |
7936 | 1 | < 0.1% |
7935 | 1 | < 0.1% |
7933 | 1 | < 0.1% |
7931 | 1 | < 0.1% |
7929 | 2 |
과목명
Categorical
Distinct | 13 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
정량분석학 | |
---|---|
위생화학 | |
생화학 | |
약전 | |
유기약품제조학 | |
Other values (8) |
Length
Max length | 18 |
---|---|
Median length | 6 |
Mean length | 4.4005 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 정성분석학 |
---|---|
2nd row | 무기약품제조학 |
3rd row | 정량분석학 |
4th row | 약전 |
5th row | 약물학 |
Common Values
Value | Count | Frequency (%) |
정량분석학 | 861 | |
위생화학 | 839 | |
생화학 | 839 | |
약전 | 836 | |
유기약품제조학 | 833 | |
약물학 | 832 | |
정성분석학 | 825 | |
약사관계법규 | 825 | |
미생물학 | 820 | |
생약학 | 819 | |
Other values (3) | 1671 |
Length
Value | Count | Frequency (%) |
정량분석학 | 861 | |
위생화학 | 839 | |
생화학 | 839 | |
약전 | 836 | |
유기약품제조학 | 833 | |
약물학 | 832 | |
정성분석학 | 825 | |
약사관계법규 | 825 | |
미생물학 | 820 | |
생약학 | 819 | |
Other values (8) | 1916 |
과목별점수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 26 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.5092 |
Minimum | 0 |
---|---|
Maximum | 25 |
Zeros | 1562 |
Zeros (%) | 15.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 13 |
median | 18 |
Q3 | 21 |
95-th percentile | 24 |
Maximum | 25 |
Range | 25 |
Interquartile range (IQR) | 8 |
Descriptive statistics
Standard deviation | 7.7098417 |
---|---|
Coefficient of variation (CV) | 0.49711408 |
Kurtosis | -0.12992912 |
Mean | 15.5092 |
Median Absolute Deviation (MAD) | 4 |
Skewness | -1.0712501 |
Sum | 155092 |
Variance | 59.44166 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1562 | |
21 | 930 | |
20 | 873 | |
22 | 869 | |
19 | 824 | |
18 | 713 | 7.1% |
23 | 657 | 6.6% |
17 | 582 | 5.8% |
16 | 566 | 5.7% |
24 | 424 | 4.2% |
Other values (16) | 2000 |
Value | Count | Frequency (%) |
0 | 1562 | |
1 | 1 | < 0.1% |
2 | 1 | < 0.1% |
3 | 7 | 0.1% |
4 | 23 | 0.2% |
5 | 38 | 0.4% |
6 | 40 | 0.4% |
7 | 70 | 0.7% |
8 | 71 | 0.7% |
9 | 89 | 0.9% |
Value | Count | Frequency (%) |
25 | 172 | 1.7% |
24 | 424 | |
23 | 657 | |
22 | 869 | |
21 | 930 | |
20 | 873 | |
19 | 824 | |
18 | 713 | |
17 | 582 | |
16 | 566 |
총점
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 225 |
---|---|
Distinct (%) | 2.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 186.5266 |
Minimum | 0 |
---|---|
Maximum | 289 |
Zeros | 1546 |
Zeros (%) | 15.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 166 |
median | 222 |
Q3 | 247 |
95-th percentile | 267 |
Maximum | 289 |
Range | 289 |
Interquartile range (IQR) | 81 |
Descriptive statistics
Standard deviation | 88.2877 |
---|---|
Coefficient of variation (CV) | 0.47332498 |
Kurtosis | 0.28327079 |
Mean | 186.5266 |
Median Absolute Deviation (MAD) | 32 |
Skewness | -1.299229 |
Sum | 1865266 |
Variance | 7794.718 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1546 | 15.5% |
243 | 128 | 1.3% |
240 | 125 | 1.2% |
234 | 123 | 1.2% |
232 | 121 | 1.2% |
241 | 118 | 1.2% |
245 | 115 | 1.1% |
228 | 113 | 1.1% |
239 | 112 | 1.1% |
237 | 112 | 1.1% |
Other values (215) | 7387 |
Value | Count | Frequency (%) |
0 | 1546 | |
15 | 2 | < 0.1% |
21 | 2 | < 0.1% |
24 | 1 | < 0.1% |
28 | 1 | < 0.1% |
40 | 1 | < 0.1% |
46 | 2 | < 0.1% |
54 | 3 | < 0.1% |
58 | 2 | < 0.1% |
65 | 3 | < 0.1% |
Value | Count | Frequency (%) |
289 | 1 | < 0.1% |
288 | 1 | < 0.1% |
286 | 8 | 0.1% |
285 | 11 | |
284 | 5 | 0.1% |
283 | 6 | 0.1% |
282 | 11 | |
281 | 15 | |
280 | 26 | |
279 | 13 |
합격여부
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
합격 | |
---|---|
불합격 | |
결시 | |
응시결격 | 3 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.1578 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 결시 |
---|---|
2nd row | 합격 |
3rd row | 결시 |
4th row | 합격 |
5th row | 결시 |
Common Values
Value | Count | Frequency (%) |
합격 | 6885 | |
불합격 | 1572 | 15.7% |
결시 | 1540 | 15.4% |
응시결격 | 3 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
합격 | 6885 | |
불합격 | 1572 | 15.7% |
결시 | 1540 | 15.4% |
응시결격 | 3 | < 0.1% |
성별
Categorical
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
여 | |
---|---|
남 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 여 |
---|---|
2nd row | 여 |
3rd row | 남 |
4th row | 남 |
5th row | 여 |
Common Values
Value | Count | Frequency (%) |
여 | 6659 | |
남 | 3341 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
여 | 6659 | |
남 | 3341 |
연령대
Categorical
IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
20 | |
---|---|
30 | |
40 | 260 |
50 | 72 |
60 | 15 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20 |
---|---|
2nd row | 20 |
3rd row | 20 |
4th row | 30 |
5th row | 20 |
Common Values
Value | Count | Frequency (%) |
20 | 7909 | |
30 | 1744 | 17.4% |
40 | 260 | 2.6% |
50 | 72 | 0.7% |
60 | 15 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20 | 7909 | |
30 | 1744 | 17.4% |
40 | 260 | 2.6% |
50 | 72 | 0.7% |
60 | 15 | 0.1% |
연도 | 회차 | 일련번호 | 과목명 | 과목별점수 | 총점 | 합격여부 | 성별 | 연령대 | |
---|---|---|---|---|---|---|---|---|---|
연도 | 1.000 | 1.000 | 0.985 | 0.275 | 0.480 | 0.490 | 0.307 | 0.031 | 0.127 |
회차 | 1.000 | 1.000 | 0.934 | 0.269 | 0.495 | 0.503 | 0.537 | 0.054 | 0.070 |
일련번호 | 0.985 | 0.934 | 1.000 | 0.170 | 0.578 | 0.592 | 0.559 | 0.136 | 0.181 |
과목명 | 0.275 | 0.269 | 0.170 | 1.000 | 0.163 | 0.020 | 0.000 | 0.000 | 0.000 |
과목별점수 | 0.480 | 0.495 | 0.578 | 0.163 | 1.000 | 0.888 | 0.840 | 0.282 | 0.351 |
총점 | 0.490 | 0.503 | 0.592 | 0.020 | 0.888 | 1.000 | 0.889 | 0.354 | 0.461 |
합격여부 | 0.307 | 0.537 | 0.559 | 0.000 | 0.840 | 0.889 | 1.000 | 0.344 | 0.222 |
성별 | 0.031 | 0.054 | 0.136 | 0.000 | 0.282 | 0.354 | 0.344 | 1.000 | 0.231 |
연령대 | 0.127 | 0.070 | 0.181 | 0.000 | 0.351 | 0.461 | 0.222 | 0.231 | 1.000 |
연도 | 합격여부 | 연령대 | 과목명 | 성별 | |
---|---|---|---|---|---|
연도 | 1.000 | 0.255 | 0.048 | 0.152 | 0.038 |
합격여부 | 0.255 | 1.000 | 0.183 | 0.000 | 0.230 |
연령대 | 0.048 | 0.183 | 1.000 | 0.000 | 0.282 |
과목명 | 0.152 | 0.000 | 0.000 | 1.000 | 0.000 |
성별 | 0.038 | 0.230 | 0.282 | 0.000 | 1.000 |
회차 | 일련번호 | 과목별점수 | 총점 | 연도 | 과목명 | 합격여부 | 성별 | 연령대 | |
---|---|---|---|---|---|---|---|---|---|
회차 | 1.000 | 0.982 | 0.346 | 0.363 | 1.000 | 0.137 | 0.375 | 0.039 | 0.047 |
일련번호 | 0.982 | 1.000 | 0.362 | 0.384 | 0.825 | 0.071 | 0.369 | 0.105 | 0.076 |
과목별점수 | 0.346 | 0.362 | 1.000 | 0.849 | 0.220 | 0.068 | 0.682 | 0.218 | 0.152 |
총점 | 0.363 | 0.384 | 0.849 | 1.000 | 0.225 | 0.008 | 0.763 | 0.272 | 0.209 |
연도 | 1.000 | 0.825 | 0.220 | 0.225 | 1.000 | 0.152 | 0.255 | 0.038 | 0.048 |
과목명 | 0.137 | 0.071 | 0.068 | 0.008 | 0.152 | 1.000 | 0.000 | 0.000 | 0.000 |
합격여부 | 0.375 | 0.369 | 0.682 | 0.763 | 0.255 | 0.000 | 1.000 | 0.230 | 0.183 |
성별 | 0.039 | 0.105 | 0.218 | 0.272 | 0.038 | 0.000 | 0.230 | 1.000 | 0.282 |
연령대 | 0.047 | 0.076 | 0.152 | 0.209 | 0.048 | 0.000 | 0.183 | 0.282 | 1.000 |
연도 | 직종 | 회차 | 일련번호 | 과목명 | 과목별점수 | 총점 | 합격여부 | 성별 | 연령대 | |
---|---|---|---|---|---|---|---|---|---|---|
3255 | 2000 | 약사(4년제) | 50 | 272 | 정성분석학 | 0 | 0 | 결시 | 여 | 20 |
47477 | 2001 | 약사(4년제) | 52 | 3957 | 무기약품제조학 | 18 | 221 | 합격 | 여 | 20 |
15432 | 2000 | 약사(4년제) | 50 | 1287 | 정량분석학 | 0 | 0 | 결시 | 남 | 20 |
414 | 2000 | 약사(4년제) | 50 | 35 | 약전 | 20 | 209 | 합격 | 남 | 30 |
17891 | 2000 | 약사(4년제) | 50 | 1491 | 약물학 | 0 | 0 | 결시 | 여 | 20 |
87177 | 2003 | 약사(4년제) | 54 | 7265 | 약제학 | 21 | 241 | 합격 | 남 | 20 |
30183 | 2000 | 약사(4년제) | 51 | 2516 | 정성분석학 | 21 | 247 | 합격 | 여 | 20 |
68883 | 2002 | 약사(4년제) | 53 | 5741 | 정성분석학 | 20 | 220 | 합격 | 여 | 20 |
52000 | 2001 | 약사(4년제) | 52 | 4334 | 미생물학 | 25 | 279 | 합격 | 여 | 20 |
33901 | 2000 | 약사(4년제) | 51 | 2826 | 위생화학 | 22 | 253 | 합격 | 여 | 20 |
연도 | 직종 | 회차 | 일련번호 | 과목명 | 과목별점수 | 총점 | 합격여부 | 성별 | 연령대 | |
---|---|---|---|---|---|---|---|---|---|---|
85822 | 2003 | 약사(4년제) | 54 | 7152 | 약사관계법규 | 18 | 218 | 합격 | 여 | 20 |
41107 | 2001 | 약사(4년제) | 52 | 3426 | 생화학 | 23 | 265 | 합격 | 여 | 20 |
54036 | 2002 | 약사(4년제) | 53 | 4504 | 정량분석학 | 0 | 0 | 결시 | 남 | 30 |
24610 | 2000 | 약사(4년제) | 51 | 2051 | 약사관계법규 | 13 | 141 | 불합격 | 여 | 20 |
67324 | 2002 | 약사(4년제) | 53 | 5611 | 미생물학 | 19 | 246 | 합격 | 여 | 20 |
60691 | 2002 | 약사(4년제) | 53 | 5058 | 생화학 | 20 | 227 | 합격 | 여 | 20 |
25371 | 2000 | 약사(4년제) | 51 | 2115 | 정성분석학 | 21 | 257 | 합격 | 여 | 20 |
38837 | 2001 | 약사(4년제) | 52 | 3237 | 무기약품제조학 | 17 | 217 | 합격 | 여 | 20 |
7727 | 2000 | 약사(4년제) | 50 | 644 | 약물학 | 0 | 0 | 결시 | 여 | 20 |
79846 | 2003 | 약사(4년제) | 54 | 6654 | 약사관계법규 | 21 | 239 | 합격 | 여 | 20 |