Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 918.0 KiB |
Average record size in memory | 94.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 5 |
Dataset
Description | 2급 언어재활사 국가시험 응시자의 성적 현황을 분석할 수 있는 정보(연도, 직종, 회차, 일련번호, 과목명, 과목별 점수, 총점, 합격여부, 성별)를 제공합니다. |
---|---|
URL | https://www.data.go.kr/data/15083528/fileData.do |
직종 has constant value "" | Constant |
연도 is highly overall correlated with 회차 and 1 other fields | High correlation |
회차 is highly overall correlated with 연도 and 1 other fields | High correlation |
일련번호 is highly overall correlated with 연도 and 1 other fields | High correlation |
과목별점수 is highly overall correlated with 총점 and 1 other fields | High correlation |
총점 is highly overall correlated with 과목별점수 and 1 other fields | High correlation |
합격여부 is highly overall correlated with 과목별점수 and 1 other fields | High correlation |
합격여부 is highly imbalanced (52.6%) | Imbalance |
성별 is highly imbalanced (52.9%) | Imbalance |
과목별점수 has 326 (3.3%) zeros | Zeros |
총점 has 323 (3.2%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 13:11:24.291352 |
---|---|
Analysis finished | 2023-12-12 13:11:28.971917 |
Duration | 4.68 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 10 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2017.0007 |
Minimum | 2013 |
---|---|
Maximum | 2022 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2013 |
---|---|
5-th percentile | 2014 |
Q1 | 2014 |
median | 2016 |
Q3 | 2020 |
95-th percentile | 2022 |
Maximum | 2022 |
Range | 9 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 2.8606849 |
---|---|
Coefficient of variation (CV) | 0.0014182865 |
Kurtosis | -1.2756077 |
Mean | 2017.0007 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.41436281 |
Sum | 20170007 |
Variance | 8.1835179 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2014 | 3191 | |
2015 | 946 | 9.5% |
2022 | 909 | 9.1% |
2016 | 899 | 9.0% |
2021 | 847 | 8.5% |
2020 | 795 | 8.0% |
2018 | 793 | 7.9% |
2017 | 780 | 7.8% |
2019 | 770 | 7.7% |
2013 | 70 | 0.7% |
Value | Count | Frequency (%) |
2013 | 70 | 0.7% |
2014 | 3191 | |
2015 | 946 | 9.5% |
2016 | 899 | 9.0% |
2017 | 780 | 7.8% |
2018 | 793 | 7.9% |
2019 | 770 | 7.7% |
2020 | 795 | 8.0% |
2021 | 847 | 8.5% |
2022 | 909 | 9.1% |
Value | Count | Frequency (%) |
2022 | 909 | 9.1% |
2021 | 847 | 8.5% |
2020 | 795 | 8.0% |
2019 | 770 | 7.7% |
2018 | 793 | 7.9% |
2017 | 780 | 7.8% |
2016 | 899 | 9.0% |
2015 | 946 | 9.5% |
2014 | 3191 | |
2013 | 70 | 0.7% |
직종
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2급언어재활사 |
---|
Length
Max length | 34 |
---|---|
Median length | 34 |
Mean length | 34 |
Min length | 34 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2급언어재활사 |
---|---|
2nd row | 2급언어재활사 |
3rd row | 2급언어재활사 |
4th row | 2급언어재활사 |
5th row | 2급언어재활사 |
Common Values
Value | Count | Frequency (%) |
2급언어재활사 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2급언어재활사 | 10000 |
회차
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 11 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.7682 |
Minimum | 1 |
---|---|
Maximum | 11 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 5 |
Q3 | 9 |
95-th percentile | 11 |
Maximum | 11 |
Range | 10 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 3.1259312 |
---|---|
Coefficient of variation (CV) | 0.5419249 |
Kurtosis | -1.3075636 |
Mean | 5.7682 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0.25383527 |
Sum | 57682 |
Variance | 9.7714459 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 2255 | |
4 | 946 | |
3 | 936 | |
11 | 909 | |
5 | 899 | 9.0% |
10 | 847 | 8.5% |
9 | 795 | 8.0% |
7 | 793 | 7.9% |
6 | 780 | 7.8% |
8 | 770 | 7.7% |
Value | Count | Frequency (%) |
1 | 70 | 0.7% |
2 | 2255 | |
3 | 936 | |
4 | 946 | |
5 | 899 | 9.0% |
6 | 780 | 7.8% |
7 | 793 | 7.9% |
8 | 770 | 7.7% |
9 | 795 | 8.0% |
10 | 847 | 8.5% |
Value | Count | Frequency (%) |
11 | 909 | |
10 | 847 | 8.5% |
9 | 795 | 8.0% |
8 | 770 | 7.7% |
7 | 793 | 7.9% |
6 | 780 | 7.8% |
5 | 899 | 9.0% |
4 | 946 | |
3 | 936 | |
2 | 2255 |
일련번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 7862 |
---|---|
Distinct (%) | 78.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8478.6392 |
Minimum | 1 |
---|---|
Maximum | 16975 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 872.9 |
Q1 | 4275.25 |
median | 8366.5 |
Q3 | 12774.5 |
95-th percentile | 16189.05 |
Maximum | 16975 |
Range | 16974 |
Interquartile range (IQR) | 8499.25 |
Descriptive statistics
Standard deviation | 4916.0113 |
---|---|
Coefficient of variation (CV) | 0.57981135 |
Kurtosis | -1.2092886 |
Mean | 8478.6392 |
Median Absolute Deviation (MAD) | 4255 |
Skewness | 0.021434157 |
Sum | 84786392 |
Variance | 24167167 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
9503 | 4 | < 0.1% |
897 | 4 | < 0.1% |
6266 | 4 | < 0.1% |
9996 | 4 | < 0.1% |
10863 | 4 | < 0.1% |
15544 | 4 | < 0.1% |
11860 | 4 | < 0.1% |
11375 | 4 | < 0.1% |
12055 | 4 | < 0.1% |
1542 | 4 | < 0.1% |
Other values (7852) | 9960 |
Value | Count | Frequency (%) |
1 | 1 | < 0.1% |
2 | 1 | < 0.1% |
5 | 1 | < 0.1% |
8 | 1 | < 0.1% |
9 | 1 | < 0.1% |
12 | 1 | < 0.1% |
15 | 1 | < 0.1% |
18 | 2 | |
19 | 3 | |
21 | 1 | < 0.1% |
Value | Count | Frequency (%) |
16975 | 1 | < 0.1% |
16971 | 3 | |
16970 | 2 | |
16964 | 1 | < 0.1% |
16963 | 1 | < 0.1% |
16962 | 1 | < 0.1% |
16960 | 1 | < 0.1% |
16958 | 1 | < 0.1% |
16956 | 1 | < 0.1% |
16955 | 1 | < 0.1% |
과목명
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
신경언어장애 | |
---|---|
언어발달장애 | |
조음음운장애 | |
음성장애 | |
유창성장애 |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 5.4036 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 신경언어장애 |
---|---|
2nd row | 조음음운장애 |
3rd row | 신경언어장애 |
4th row | 신경언어장애 |
5th row | 조음음운장애 |
Common Values
Value | Count | Frequency (%) |
신경언어장애 | 2038 | |
언어발달장애 | 1997 | |
조음음운장애 | 1996 | |
음성장애 | 1995 | |
유창성장애 | 1974 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
신경언어장애 | 2038 | |
언어발달장애 | 1997 | |
조음음운장애 | 1996 | |
음성장애 | 1995 | |
유창성장애 | 1974 |
과목별점수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 34 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20.8393 |
Minimum | 0 |
---|---|
Maximum | 35 |
Zeros | 326 |
Zeros (%) | 3.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 9 |
Q1 | 17 |
median | 21 |
Q3 | 26 |
95-th percentile | 31 |
Maximum | 35 |
Range | 35 |
Interquartile range (IQR) | 9 |
Descriptive statistics
Standard deviation | 6.9698015 |
---|---|
Coefficient of variation (CV) | 0.33445469 |
Kurtosis | 0.93370572 |
Mean | 20.8393 |
Median Absolute Deviation (MAD) | 4 |
Skewness | -0.74550692 |
Sum | 208393 |
Variance | 48.578133 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
21 | 668 | 6.7% |
23 | 651 | 6.5% |
22 | 647 | 6.5% |
19 | 643 | 6.4% |
20 | 618 | 6.2% |
24 | 595 | 5.9% |
18 | 488 | 4.9% |
25 | 479 | 4.8% |
17 | 435 | 4.3% |
26 | 427 | 4.3% |
Other values (24) | 4349 |
Value | Count | Frequency (%) |
0 | 326 | |
3 | 1 | < 0.1% |
4 | 5 | 0.1% |
5 | 12 | 0.1% |
6 | 27 | 0.3% |
7 | 36 | 0.4% |
8 | 69 | 0.7% |
9 | 87 | 0.9% |
10 | 129 | 1.3% |
11 | 162 |
Value | Count | Frequency (%) |
35 | 15 | 0.1% |
34 | 77 | 0.8% |
33 | 142 | 1.4% |
32 | 195 | |
31 | 235 | |
30 | 299 | |
29 | 344 | |
28 | 380 | |
27 | 411 | |
26 | 427 |
총점
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 118 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 104.1544 |
Minimum | 0 |
---|---|
Maximum | 148 |
Zeros | 323 |
Zeros (%) | 3.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 56 |
Q1 | 91 |
median | 111 |
Q3 | 124 |
95-th percentile | 136 |
Maximum | 148 |
Range | 148 |
Interquartile range (IQR) | 33 |
Descriptive statistics
Standard deviation | 28.535595 |
---|---|
Coefficient of variation (CV) | 0.27397398 |
Kurtosis | 3.5686569 |
Mean | 104.1544 |
Median Absolute Deviation (MAD) | 15 |
Skewness | -1.668047 |
Sum | 1041544 |
Variance | 814.28019 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 323 | 3.2% |
121 | 233 | 2.3% |
128 | 212 | 2.1% |
127 | 209 | 2.1% |
118 | 205 | 2.1% |
129 | 204 | 2.0% |
119 | 202 | 2.0% |
120 | 202 | 2.0% |
126 | 200 | 2.0% |
125 | 199 | 2.0% |
Other values (108) | 7811 |
Value | Count | Frequency (%) |
0 | 323 | |
25 | 1 | < 0.1% |
32 | 1 | < 0.1% |
33 | 3 | < 0.1% |
34 | 3 | < 0.1% |
35 | 3 | < 0.1% |
36 | 2 | < 0.1% |
37 | 2 | < 0.1% |
39 | 1 | < 0.1% |
40 | 4 | < 0.1% |
Value | Count | Frequency (%) |
148 | 1 | < 0.1% |
147 | 3 | < 0.1% |
146 | 3 | < 0.1% |
145 | 6 | 0.1% |
144 | 24 | 0.2% |
143 | 18 | 0.2% |
142 | 25 | 0.2% |
141 | 61 | |
140 | 40 | |
139 | 83 |
합격여부
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
합격 | |
---|---|
불합격 | |
결시 | 321 |
응시결격 | 26 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.2083 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 불합격 |
---|---|
2nd row | 합격 |
3rd row | 합격 |
4th row | 합격 |
5th row | 합격 |
Common Values
Value | Count | Frequency (%) |
합격 | 7622 | |
불합격 | 2031 | 20.3% |
결시 | 321 | 3.2% |
응시결격 | 26 | 0.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
합격 | 7622 | |
불합격 | 2031 | 20.3% |
결시 | 321 | 3.2% |
응시결격 | 26 | 0.3% |
성별
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
여 | |
---|---|
남 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 남 |
---|---|
2nd row | 여 |
3rd row | 여 |
4th row | 여 |
5th row | 여 |
Common Values
Value | Count | Frequency (%) |
여 | 8993 | |
남 | 1007 | 10.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
여 | 8993 | |
남 | 1007 | 10.1% |
연령대
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
20 | |
---|---|
30 | |
40 | |
50 | 338 |
60 | 27 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20 |
---|---|
2nd row | 40 |
3rd row | 20 |
4th row | 20 |
5th row | 30 |
Common Values
Value | Count | Frequency (%) |
20 | 6842 | |
30 | 1712 | 17.1% |
40 | 1081 | 10.8% |
50 | 338 | 3.4% |
60 | 27 | 0.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20 | 6842 | |
30 | 1712 | 17.1% |
40 | 1081 | 10.8% |
50 | 338 | 3.4% |
60 | 27 | 0.3% |
연도 | 회차 | 일련번호 | 과목명 | 과목별점수 | 총점 | 합격여부 | 성별 | 연령대 | |
---|---|---|---|---|---|---|---|---|---|
연도 | 1.000 | 0.999 | 0.946 | 0.000 | 0.122 | 0.175 | 0.108 | 0.036 | 0.131 |
회차 | 0.999 | 1.000 | 0.985 | 0.000 | 0.194 | 0.260 | 0.157 | 0.068 | 0.199 |
일련번호 | 0.946 | 0.985 | 1.000 | 0.000 | 0.162 | 0.234 | 0.122 | 0.056 | 0.189 |
과목명 | 0.000 | 0.000 | 0.000 | 1.000 | 0.641 | 0.000 | 0.000 | 0.000 | 0.000 |
과목별점수 | 0.122 | 0.194 | 0.162 | 0.641 | 1.000 | 0.878 | 0.840 | 0.079 | 0.253 |
총점 | 0.175 | 0.260 | 0.234 | 0.000 | 0.878 | 1.000 | 0.910 | 0.083 | 0.283 |
합격여부 | 0.108 | 0.157 | 0.122 | 0.000 | 0.840 | 0.910 | 1.000 | 0.074 | 0.150 |
성별 | 0.036 | 0.068 | 0.056 | 0.000 | 0.079 | 0.083 | 0.074 | 1.000 | 0.028 |
연령대 | 0.131 | 0.199 | 0.189 | 0.000 | 0.253 | 0.283 | 0.150 | 0.028 | 1.000 |
성별 | 과목명 | 합격여부 | 연령대 | |
---|---|---|---|---|
성별 | 1.000 | 0.000 | 0.049 | 0.034 |
과목명 | 0.000 | 1.000 | 0.000 | 0.000 |
합격여부 | 0.049 | 0.000 | 1.000 | 0.123 |
연령대 | 0.034 | 0.000 | 0.123 | 1.000 |
연도 | 회차 | 일련번호 | 과목별점수 | 총점 | 과목명 | 합격여부 | 성별 | 연령대 | |
---|---|---|---|---|---|---|---|---|---|
연도 | 1.000 | 0.990 | 0.981 | -0.081 | -0.106 | 0.000 | 0.083 | 0.052 | 0.082 |
회차 | 0.990 | 1.000 | 0.991 | -0.090 | -0.115 | 0.000 | 0.083 | 0.034 | 0.079 |
일련번호 | 0.981 | 0.991 | 1.000 | -0.093 | -0.119 | 0.000 | 0.073 | 0.043 | 0.079 |
과목별점수 | -0.081 | -0.090 | -0.093 | 1.000 | 0.722 | 0.320 | 0.682 | 0.069 | 0.106 |
총점 | -0.106 | -0.115 | -0.119 | 0.722 | 1.000 | 0.000 | 0.799 | 0.063 | 0.121 |
과목명 | 0.000 | 0.000 | 0.000 | 0.320 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 |
합격여부 | 0.083 | 0.083 | 0.073 | 0.682 | 0.799 | 0.000 | 1.000 | 0.049 | 0.123 |
성별 | 0.052 | 0.034 | 0.043 | 0.069 | 0.063 | 0.000 | 0.049 | 1.000 | 0.034 |
연령대 | 0.082 | 0.079 | 0.079 | 0.106 | 0.121 | 0.000 | 0.123 | 0.034 | 1.000 |
연도 | 직종 | 회차 | 일련번호 | 과목명 | 과목별점수 | 총점 | 합격여부 | 성별 | 연령대 | |
---|---|---|---|---|---|---|---|---|---|---|
904 | 2014 | 2급언어재활사 | 2 | 181 | 신경언어장애 | 16 | 80 | 불합격 | 남 | 20 |
58485 | 2019 | 2급언어재활사 | 8 | 11698 | 조음음운장애 | 32 | 129 | 합격 | 여 | 40 |
48024 | 2017 | 2급언어재활사 | 6 | 9605 | 신경언어장애 | 24 | 132 | 합격 | 여 | 20 |
39564 | 2016 | 2급언어재활사 | 5 | 7913 | 신경언어장애 | 30 | 139 | 합격 | 여 | 20 |
50445 | 2018 | 2급언어재활사 | 7 | 10090 | 조음음운장애 | 27 | 119 | 합격 | 여 | 30 |
58757 | 2019 | 2급언어재활사 | 8 | 11752 | 음성장애 | 17 | 118 | 합격 | 남 | 20 |
82706 | 2022 | 2급언어재활사 | 11 | 16542 | 언어발달장애 | 0 | 0 | 결시 | 여 | 20 |
72507 | 2021 | 2급언어재활사 | 10 | 14502 | 음성장애 | 15 | 116 | 합격 | 여 | 20 |
8754 | 2014 | 2급언어재활사 | 2 | 1751 | 신경언어장애 | 24 | 115 | 합격 | 여 | 20 |
61993 | 2019 | 2급언어재활사 | 8 | 12399 | 유창성장애 | 22 | 115 | 합격 | 여 | 20 |
연도 | 직종 | 회차 | 일련번호 | 과목명 | 과목별점수 | 총점 | 합격여부 | 성별 | 연령대 | |
---|---|---|---|---|---|---|---|---|---|---|
59287 | 2019 | 2급언어재활사 | 8 | 11858 | 음성장애 | 19 | 126 | 합격 | 여 | 20 |
9409 | 2014 | 2급언어재활사 | 2 | 1882 | 신경언어장애 | 24 | 131 | 합격 | 여 | 20 |
42792 | 2016 | 2급언어재활사 | 5 | 8559 | 음성장애 | 11 | 77 | 불합격 | 여 | 30 |
4303 | 2014 | 2급언어재활사 | 2 | 861 | 유창성장애 | 20 | 99 | 합격 | 남 | 20 |
35379 | 2016 | 2급언어재활사 | 5 | 7076 | 신경언어장애 | 22 | 98 | 합격 | 여 | 20 |
61918 | 2019 | 2급언어재활사 | 8 | 12384 | 유창성장애 | 5 | 37 | 응시결격 | 여 | 20 |
9630 | 2014 | 2급언어재활사 | 2 | 1927 | 음성장애 | 23 | 136 | 합격 | 여 | 20 |
64924 | 2020 | 2급언어재활사 | 9 | 12985 | 신경언어장애 | 29 | 120 | 합격 | 여 | 20 |
46713 | 2017 | 2급언어재활사 | 6 | 9343 | 유창성장애 | 17 | 94 | 합격 | 여 | 40 |
17215 | 2014 | 2급언어재활사 | 2 | 3444 | 조음음운장애 | 32 | 128 | 합격 | 여 | 20 |