Overview

Dataset statistics

Number of variables10
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory918.0 KiB
Average record size in memory94.0 B

Variable types

Numeric5
Categorical5

Dataset

Description2급 장애인재활상담사 국가시험 응시자의 성적 현황을 분석할 수 있는 정보(연도, 직종, 회차, 일련번호, 과목명, 과목별 점수, 총점, 합격여부, 성별, 연령대)를 제공합니다.
URLhttps://www.data.go.kr/data/15083530/fileData.do

Alerts

직종 has constant value ""Constant
연도 is highly overall correlated with 회차 and 1 other fieldsHigh correlation
회차 is highly overall correlated with 연도 and 1 other fieldsHigh correlation
일련번호 is highly overall correlated with 연도 and 1 other fieldsHigh correlation
과목별점수 is highly overall correlated with 총점 and 1 other fieldsHigh correlation
총점 is highly overall correlated with 과목별점수 and 1 other fieldsHigh correlation
합격여부 is highly overall correlated with 과목별점수 and 1 other fieldsHigh correlation
합격여부 is highly imbalanced (52.8%)Imbalance
과목별점수 has 640 (6.4%) zerosZeros
총점 has 638 (6.4%) zerosZeros

Reproduction

Analysis started2023-12-12 07:37:08.662109
Analysis finished2023-12-12 07:37:14.153643
Duration5.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.4816
Minimum2018
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T16:37:14.254834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2018
5-th percentile2018
Q12018
median2019
Q32020
95-th percentile2023
Maximum2023
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.540097
Coefficient of variation (CV)0.00076261995
Kurtosis-0.082591938
Mean2019.4816
Median Absolute Deviation (MAD)1
Skewness0.91655314
Sum20194816
Variance2.3718986
MonotonicityNot monotonic
2023-12-12T16:37:14.472242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2018 3604
36.0%
2019 2147
21.5%
2020 2081
20.8%
2021 997
 
10.0%
2023 832
 
8.3%
2022 339
 
3.4%
ValueCountFrequency (%)
2018 3604
36.0%
2019 2147
21.5%
2020 2081
20.8%
2021 997
 
10.0%
2022 339
 
3.4%
2023 832
 
8.3%
ValueCountFrequency (%)
2023 832
 
8.3%
2022 339
 
3.4%
2021 997
 
10.0%
2020 2081
20.8%
2019 2147
21.5%
2018 3604
36.0%

직종
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2급 장애인재활상담사
10000 

Length

Max length31
Median length31
Mean length31
Min length31

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2급 장애인재활상담사
2nd row2급 장애인재활상담사
3rd row2급 장애인재활상담사
4th row2급 장애인재활상담사
5th row2급 장애인재활상담사

Common Values

ValueCountFrequency (%)
2급 장애인재활상담사 10000
100.0%

Length

2023-12-12T16:37:14.680716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:37:14.868940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2급 10000
50.0%
장애인재활상담사 10000
50.0%

회차
Real number (ℝ)

HIGH CORRELATION 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.3067
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T16:37:15.018024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q34
95-th percentile7
Maximum7
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.7419927
Coefficient of variation (CV)0.526807
Kurtosis-0.43337712
Mean3.3067
Median Absolute Deviation (MAD)1
Skewness0.53750009
Sum33067
Variance3.0345386
MonotonicityNot monotonic
2023-12-12T16:37:15.207701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
3 2147
21.5%
4 2081
20.8%
2 1855
18.6%
1 1749
17.5%
5 997
10.0%
7 832
 
8.3%
6 339
 
3.4%
ValueCountFrequency (%)
1 1749
17.5%
2 1855
18.6%
3 2147
21.5%
4 2081
20.8%
5 997
10.0%
6 339
 
3.4%
7 832
 
8.3%
ValueCountFrequency (%)
7 832
 
8.3%
6 339
 
3.4%
5 997
10.0%
4 2081
20.8%
3 2147
21.5%
2 1855
18.6%
1 1749
17.5%

일련번호
Real number (ℝ)

HIGH CORRELATION 

Distinct1898
Distinct (%)19.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean947.5516
Minimum1
Maximum1898
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T16:37:15.461924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile91
Q1472
median949.5
Q31419
95-th percentile1803
Maximum1898
Range1897
Interquartile range (IQR)947

Descriptive statistics

Standard deviation546.95064
Coefficient of variation (CV)0.57722517
Kurtosis-1.1950916
Mean947.5516
Median Absolute Deviation (MAD)473.5
Skewness-0.0014415511
Sum9475516
Variance299155
MonotonicityNot monotonic
2023-12-12T16:37:15.774252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1639 7
 
0.1%
1094 7
 
0.1%
1760 7
 
0.1%
1494 7
 
0.1%
1220 7
 
0.1%
1524 7
 
0.1%
1563 7
 
0.1%
315 7
 
0.1%
805 7
 
0.1%
361 7
 
0.1%
Other values (1888) 9930
99.3%
ValueCountFrequency (%)
1 5
0.1%
2 6
0.1%
3 6
0.1%
4 5
0.1%
5 5
0.1%
6 6
0.1%
7 5
0.1%
8 4
< 0.1%
9 6
0.1%
10 4
< 0.1%
ValueCountFrequency (%)
1898 6
0.1%
1897 7
0.1%
1896 5
0.1%
1895 4
< 0.1%
1894 4
< 0.1%
1893 3
< 0.1%
1892 5
0.1%
1891 6
0.1%
1890 6
0.1%
1889 6
0.1%

과목명
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
직업평가
1451 
직무개발과 배치
1449 
재활행정
1443 
재활정책
1439 
재활사례관리
1414 
Other values (2)
2804 

Length

Max length8
Median length4
Mean length5.1422
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row재활정책
2nd row재활사례관리
3rd row직무개발과 배치
4th row직업재활개론
5th row재활행정

Common Values

ValueCountFrequency (%)
직업평가 1451
14.5%
직무개발과 배치 1449
14.5%
재활행정 1443
14.4%
재활정책 1439
14.4%
재활사례관리 1414
14.1%
재활상담 1405
14.1%
직업재활개론 1399
14.0%

Length

2023-12-12T16:37:16.095432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:37:16.289915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
직업평가 1451
12.7%
직무개발과 1449
12.7%
배치 1449
12.7%
재활행정 1443
12.6%
재활정책 1439
12.6%
재활사례관리 1414
12.4%
재활상담 1405
12.3%
직업재활개론 1399
12.2%

과목별점수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct29
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.7313
Minimum0
Maximum28
Zeros640
Zeros (%)6.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T16:37:16.439932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q111
median16
Q319
95-th percentile23
Maximum28
Range28
Interquartile range (IQR)8

Descriptive statistics

Standard deviation6.1435067
Coefficient of variation (CV)0.41703765
Kurtosis0.0071397461
Mean14.7313
Median Absolute Deviation (MAD)4
Skewness-0.72752426
Sum147313
Variance37.742675
MonotonicityNot monotonic
2023-12-12T16:37:16.594447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
19 852
 
8.5%
20 850
 
8.5%
18 727
 
7.3%
21 685
 
6.9%
17 667
 
6.7%
0 640
 
6.4%
16 582
 
5.8%
11 527
 
5.3%
10 519
 
5.2%
12 464
 
4.6%
Other values (19) 3487
34.9%
ValueCountFrequency (%)
0 640
6.4%
1 2
 
< 0.1%
2 8
 
0.1%
3 24
 
0.2%
4 41
 
0.4%
5 82
 
0.8%
6 128
 
1.3%
7 221
 
2.2%
8 332
3.3%
9 379
3.8%
ValueCountFrequency (%)
28 5
 
0.1%
27 18
 
0.2%
26 50
 
0.5%
25 81
 
0.8%
24 153
 
1.5%
23 197
 
2.0%
22 441
4.4%
21 685
6.9%
20 850
8.5%
19 852
8.5%

총점
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct91
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean103.2493
Minimum0
Maximum144
Zeros638
Zeros (%)6.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T16:37:16.732630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q199
median112
Q3121
95-th percentile130
Maximum144
Range144
Interquartile range (IQR)22

Descriptive statistics

Standard deviation30.736884
Coefficient of variation (CV)0.29769581
Kurtosis5.3568581
Mean103.2493
Median Absolute Deviation (MAD)10
Skewness-2.3572492
Sum1032493
Variance944.75603
MonotonicityNot monotonic
2023-12-12T16:37:16.918236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 638
 
6.4%
120 304
 
3.0%
111 296
 
3.0%
115 293
 
2.9%
114 293
 
2.9%
116 287
 
2.9%
112 283
 
2.8%
122 279
 
2.8%
118 271
 
2.7%
123 271
 
2.7%
Other values (81) 6785
67.8%
ValueCountFrequency (%)
0 638
6.4%
22 5
 
0.1%
37 6
 
0.1%
43 4
 
< 0.1%
50 7
 
0.1%
52 6
 
0.1%
55 6
 
0.1%
56 13
 
0.1%
60 5
 
0.1%
61 4
 
< 0.1%
ValueCountFrequency (%)
144 3
 
< 0.1%
142 13
 
0.1%
141 22
0.2%
140 4
 
< 0.1%
139 5
 
0.1%
138 16
 
0.2%
137 21
0.2%
136 28
0.3%
135 43
0.4%
134 45
0.4%

합격여부
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
합격
7969 
불합격
1337 
결시
 
638
응시결격
 
56

Length

Max length4
Median length2
Mean length2.1449
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row합격
2nd row합격
3rd row합격
4th row합격
5th row합격

Common Values

ValueCountFrequency (%)
합격 7969
79.7%
불합격 1337
 
13.4%
결시 638
 
6.4%
응시결격 56
 
0.6%

Length

2023-12-12T16:37:17.105367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:37:17.236323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
합격 7969
79.7%
불합격 1337
 
13.4%
결시 638
 
6.4%
응시결격 56
 
0.6%

성별
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
6144 
3856 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
6144
61.4%
3856
38.6%

Length

2023-12-12T16:37:17.353361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:37:17.455998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
6144
61.4%
3856
38.6%

연령대
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
20
6215 
30
1791 
40
1476 
50
 
485
60
 
33

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20
2nd row20
3rd row20
4th row20
5th row20

Common Values

ValueCountFrequency (%)
20 6215
62.2%
30 1791
 
17.9%
40 1476
 
14.8%
50 485
 
4.9%
60 33
 
0.3%

Length

2023-12-12T16:37:17.560545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:37:17.683739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20 6215
62.2%
30 1791
 
17.9%
40 1476
 
14.8%
50 485
 
4.9%
60 33
 
0.3%

Interactions

2023-12-12T16:37:12.629573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:09.691087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:10.276676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:10.890088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:11.445785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:12.847006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:09.851771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:10.401295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:11.003593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:11.569423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:13.046440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:09.951881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:10.514891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:11.111402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:11.748723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:13.278809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:10.062999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:10.632906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:11.240237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:12.253434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:13.474118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:10.156519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:10.752423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:11.334579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:37:12.436351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:37:17.765722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도회차일련번호과목명과목별점수총점합격여부성별연령대
연도1.0001.0000.8650.0000.4660.5920.2040.0750.531
회차1.0001.0000.9290.0000.2940.3820.2270.0730.312
일련번호0.8650.9291.0000.0000.3400.4380.2290.0880.445
과목명0.0000.0000.0001.0000.6150.0000.0000.0000.000
과목별점수0.4660.2940.3400.6151.0000.8120.7850.0680.255
총점0.5920.3820.4380.0000.8121.0000.8310.1340.395
합격여부0.2040.2270.2290.0000.7850.8311.0000.0910.160
성별0.0750.0730.0880.0000.0680.1340.0911.0000.096
연령대0.5310.3120.4450.0000.2550.3950.1600.0961.000
2023-12-12T16:37:17.883594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과목명합격여부연령대성별
과목명1.0000.0000.0000.000
합격여부0.0001.0000.1310.060
연령대0.0000.1311.0000.117
성별0.0000.0600.1171.000
2023-12-12T16:37:17.993133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도회차일련번호과목별점수총점과목명합격여부성별연령대
연도1.0000.9820.966-0.095-0.1380.0000.1550.0730.201
회차0.9821.0000.984-0.093-0.1340.0000.1570.0790.205
일련번호0.9660.9841.000-0.086-0.1250.0000.1390.0670.200
과목별점수-0.095-0.093-0.0861.0000.5250.3700.6040.0520.110
총점-0.138-0.134-0.1250.5251.0000.0000.6680.1050.175
과목명0.0000.0000.0000.3700.0001.0000.0000.0000.000
합격여부0.1550.1570.1390.6040.6680.0001.0000.0600.131
성별0.0730.0790.0670.0520.1050.0000.0601.0000.117
연령대0.2010.2050.2000.1100.1750.0000.1310.1171.000

Missing values

2023-12-12T16:37:13.726232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:37:14.025853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도직종회차일련번호과목명과목별점수총점합격여부성별연령대
208020182급 장애인재활상담사1298재활정책9121합격20
434320182급 장애인재활상담사2621재활사례관리21126합격20
125020182급 장애인재활상담사1179직무개발과 배치1695합격20
3320182급 장애인재활상담사15직업재활개론19124합격20
762320192급 장애인재활상담사31090재활행정13125합격20
1308820232급 장애인재활상담사71870직업재활개론2179합격30
827220202급 장애인재활상담사41182직업재활개론21115합격40
647120192급 장애인재활상담사3925재활사례관리17101합격20
169520182급 장애인재활상담사1243재활정책9118합격20
1214220222급 장애인재활상담사61735직무개발과 배치17106합격30
연도직종회차일련번호과목명과목별점수총점합격여부성별연령대
1156120212급 장애인재활상담사51652직무개발과 배치21115합격30
75620182급 장애인재활상담사1109재활행정11103합격30
1179620222급 장애인재활상담사61686재활정책00결시40
301820182급 장애인재활상담사2432재활정책13125합격30
123520182급 장애인재활상담사1177재활사례관리18120합격20
382020182급 장애인재활상담사2546직업재활개론20123합격20
191720182급 장애인재활상담사1274재활상담19116합격20
91820182급 장애인재활상담사1132재활정책12134합격20
429620182급 장애인재활상담사2614직업재활개론970불합격40
202320182급 장애인재활상담사1290재활행정12119합격20