Overview

Dataset statistics

Number of variables10
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory918.0 KiB
Average record size in memory94.0 B

Variable types

Numeric5
Categorical5

Dataset

Description작업치료사 국가시험 응시자의 성적 현황을 분석할 수 있는 정보(연도, 직종, 회차, 일련번호, 과목명, 과목별 점수, 총점, 합격여부, 성별, 연령대)를 제공합니다.
URLhttps://www.data.go.kr/data/15083514/fileData.do

Alerts

직종 has constant value ""Constant
연도 is highly overall correlated with 회차 and 1 other fieldsHigh correlation
회차 is highly overall correlated with 연도 and 1 other fieldsHigh correlation
일련번호 is highly overall correlated with 연도 and 1 other fieldsHigh correlation
총점 is highly overall correlated with 합격여부High correlation
합격여부 is highly overall correlated with 총점High correlation
합격여부 is highly imbalanced (53.4%)Imbalance
연령대 is highly imbalanced (84.3%)Imbalance
과목별점수 has 199 (2.0%) zerosZeros
총점 has 199 (2.0%) zerosZeros

Reproduction

Analysis started2023-12-12 01:22:41.851648
Analysis finished2023-12-12 01:22:47.853424
Duration6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

HIGH CORRELATION 

Distinct14
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2009.4132
Minimum2000
Maximum2014
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T10:22:47.915441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2000
5-th percentile2003
Q12008
median2010
Q32012
95-th percentile2013
Maximum2014
Range14
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.0962275
Coefficient of variation (CV)0.0015408615
Kurtosis-0.19970905
Mean2009.4132
Median Absolute Deviation (MAD)2
Skewness-0.64546349
Sum20094132
Variance9.5866244
MonotonicityNot monotonic
2023-12-12T10:22:48.083946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
2009 1763
17.6%
2013 1510
15.1%
2012 1135
11.3%
2011 1050
10.5%
2010 941
9.4%
2008 748
7.5%
2007 648
 
6.5%
2006 565
 
5.7%
2003 521
 
5.2%
2005 470
 
4.7%
Other values (4) 649
 
6.5%
ValueCountFrequency (%)
2000 18
 
0.2%
2001 70
 
0.7%
2002 141
 
1.4%
2003 521
 
5.2%
2005 470
 
4.7%
2006 565
 
5.7%
2007 648
 
6.5%
2008 748
7.5%
2009 1763
17.6%
2010 941
9.4%
ValueCountFrequency (%)
2014 420
 
4.2%
2013 1510
15.1%
2012 1135
11.3%
2011 1050
10.5%
2010 941
9.4%
2009 1763
17.6%
2008 748
7.5%
2007 648
 
6.5%
2006 565
 
5.7%
2005 470
 
4.7%

직종
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
작업치료사
10000 

Length

Max length35
Median length35
Mean length35
Min length35

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row작업치료사
2nd row작업치료사
3rd row작업치료사
4th row작업치료사
5th row작업치료사

Common Values

ValueCountFrequency (%)
작업치료사 10000
100.0%

Length

2023-12-12T10:22:48.333421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:22:48.484679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
작업치료사 10000
100.0%

회차
Real number (ℝ)

HIGH CORRELATION 

Distinct16
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.0368
Minimum27
Maximum42
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T10:22:48.604694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum27
5-th percentile31
Q135
median38
Q340
95-th percentile41
Maximum42
Range15
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.4467715
Coefficient of variation (CV)0.093063426
Kurtosis-0.55723444
Mean37.0368
Median Absolute Deviation (MAD)3
Skewness-0.5541578
Sum370368
Variance11.880234
MonotonicityNot monotonic
2023-12-12T10:22:48.752539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
41 1510
15.1%
40 1135
11.3%
39 1050
10.5%
38 941
9.4%
37 894
8.9%
36 869
8.7%
35 748
7.5%
34 648
6.5%
33 565
 
5.7%
32 470
 
4.7%
Other values (6) 1170
11.7%
ValueCountFrequency (%)
27 18
 
0.2%
28 70
 
0.7%
29 141
 
1.4%
30 235
 
2.4%
31 286
 
2.9%
32 470
4.7%
33 565
5.7%
34 648
6.5%
35 748
7.5%
36 869
8.7%
ValueCountFrequency (%)
42 420
 
4.2%
41 1510
15.1%
40 1135
11.3%
39 1050
10.5%
38 941
9.4%
37 894
8.9%
36 869
8.7%
35 748
7.5%
34 648
6.5%
33 565
 
5.7%

일련번호
Real number (ℝ)

HIGH CORRELATION 

Distinct7402
Distinct (%)74.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6818.5914
Minimum1
Maximum14022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T10:22:48.952870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile618.95
Q13423
median6798.5
Q310198
95-th percentile12950.1
Maximum14022
Range14021
Interquartile range (IQR)6775

Descriptive statistics

Standard deviation3954.7638
Coefficient of variation (CV)0.57999718
Kurtosis-1.1747245
Mean6818.5914
Median Absolute Deviation (MAD)3387
Skewness0.0085214053
Sum68185914
Variance15640157
MonotonicityNot monotonic
2023-12-12T10:22:49.158535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
545 5
 
0.1%
1213 5
 
0.1%
12593 5
 
0.1%
4298 5
 
0.1%
10915 4
 
< 0.1%
8783 4
 
< 0.1%
6989 4
 
< 0.1%
311 4
 
< 0.1%
12758 4
 
< 0.1%
11765 4
 
< 0.1%
Other values (7392) 9956
99.6%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
5 1
< 0.1%
7 1
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
14 2
< 0.1%
17 1
< 0.1%
19 2
< 0.1%
ValueCountFrequency (%)
14022 1
< 0.1%
14021 1
< 0.1%
14020 1
< 0.1%
14012 1
< 0.1%
14008 1
< 0.1%
14003 1
< 0.1%
14002 1
< 0.1%
14001 1
< 0.1%
14000 1
< 0.1%
13999 1
< 0.1%

과목명
Categorical

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
작업치료사 실기
1440 
의료관계법규
1435 
공중보건학 개론
1405 
일상생활동작
1400 
수예 및 공작
1358 
Other values (5)
2962 

Length

Max length8
Median length7
Mean length7.0314
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공중보건학 개론
2nd row일상생활동작
3rd row수예 및 공작
4th row수예 및 공작
5th row수예 및 공작

Common Values

ValueCountFrequency (%)
작업치료사 실기 1440
14.4%
의료관계법규 1435
14.3%
공중보건학 개론 1405
14.1%
일상생활동작 1400
14.0%
수예 및 공작 1358
13.6%
작업치료 개론 978
9.8%
해부생리학 개요 873
8.7%
작업치료학 560
 
5.6%
해부생리학 개론 427
 
4.3%
작업치료학 기초 124
 
1.2%

Length

2023-12-12T10:22:49.333852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:22:49.509818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개론 2810
15.6%
작업치료사 1440
8.0%
실기 1440
8.0%
의료관계법규 1435
8.0%
공중보건학 1405
7.8%
일상생활동작 1400
7.8%
수예 1358
7.6%
1358
7.6%
공작 1358
7.6%
해부생리학 1300
7.2%
Other values (4) 2659
14.8%

과목별점수
Real number (ℝ)

ZEROS 

Distinct101
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28.93785
Minimum0
Maximum97.5
Zeros199
Zeros (%)2.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T10:22:49.721226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile9
Q113
median19
Q342
95-th percentile77.5
Maximum97.5
Range97.5
Interquartile range (IQR)29

Descriptive statistics

Standard deviation21.978279
Coefficient of variation (CV)0.75949939
Kurtosis0.15047124
Mean28.93785
Median Absolute Deviation (MAD)7
Skewness1.1521891
Sum289378.5
Variance483.04477
MonotonicityNot monotonic
2023-12-12T10:22:49.974203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
13.0 554
 
5.5%
12.0 539
 
5.4%
14.0 499
 
5.0%
11.0 466
 
4.7%
15.0 452
 
4.5%
16.0 432
 
4.3%
18.0 389
 
3.9%
17.0 385
 
3.9%
10.0 326
 
3.3%
19.0 280
 
2.8%
Other values (91) 5678
56.8%
ValueCountFrequency (%)
0.0 199
2.0%
1.0 1
 
< 0.1%
2.0 1
 
< 0.1%
3.0 6
 
0.1%
4.0 6
 
0.1%
5.0 16
 
0.2%
6.0 37
 
0.4%
7.0 74
 
0.7%
8.0 135
1.4%
9.0 230
2.3%
ValueCountFrequency (%)
97.5 1
 
< 0.1%
95.0 6
 
0.1%
92.5 15
 
0.1%
90.0 30
 
0.3%
87.5 51
0.5%
85.0 79
0.8%
82.5 100
1.0%
82.0 1
 
< 0.1%
81.0 3
 
< 0.1%
80.0 120
1.2%

총점
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct320
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean185.7656
Minimum0
Maximum271.5
Zeros199
Zeros (%)2.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T10:22:50.175541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile89
Q1153
median204.5
Q3228.5
95-th percentile251
Maximum271.5
Range271.5
Interquartile range (IQR)75.5

Descriptive statistics

Standard deviation57.462841
Coefficient of variation (CV)0.30932983
Kurtosis0.52525124
Mean185.7656
Median Absolute Deviation (MAD)30
Skewness-1.0359609
Sum1857656
Variance3301.9781
MonotonicityNot monotonic
2023-12-12T10:22:50.388965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 199
 
2.0%
118.0 108
 
1.1%
119.0 100
 
1.0%
113.0 100
 
1.0%
116.0 99
 
1.0%
115.0 87
 
0.9%
117.0 84
 
0.8%
218.0 81
 
0.8%
114.0 79
 
0.8%
112.0 74
 
0.7%
Other values (310) 8989
89.9%
ValueCountFrequency (%)
0.0 199
2.0%
46.0 2
 
< 0.1%
47.0 1
 
< 0.1%
52.0 3
 
< 0.1%
54.0 4
 
< 0.1%
55.0 3
 
< 0.1%
58.0 1
 
< 0.1%
59.0 2
 
< 0.1%
60.0 2
 
< 0.1%
61.0 1
 
< 0.1%
ValueCountFrequency (%)
271.5 3
 
< 0.1%
271.0 8
0.1%
270.5 2
 
< 0.1%
270.0 3
 
< 0.1%
269.5 6
0.1%
269.0 7
0.1%
268.5 8
0.1%
268.0 3
 
< 0.1%
267.5 3
 
< 0.1%
267.0 5
0.1%

합격여부
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
합격
7400 
불합격
2397 
결시
 
199
응시결격
 
4

Length

Max length4
Median length2
Mean length2.2405
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row합격
2nd row합격
3rd row합격
4th row합격
5th row합격

Common Values

ValueCountFrequency (%)
합격 7400
74.0%
불합격 2397
 
24.0%
결시 199
 
2.0%
응시결격 4
 
< 0.1%

Length

2023-12-12T10:22:50.625441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:22:50.789700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
합격 7400
74.0%
불합격 2397
 
24.0%
결시 199
 
2.0%
응시결격 4
 
< 0.1%

성별
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
7522 
2478 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
7522
75.2%
2478
 
24.8%

Length

2023-12-12T10:22:50.918793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:22:51.031419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
7522
75.2%
2478
 
24.8%

연령대
Categorical

IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
20
9507 
30
 
430
40
 
54
50
 
9

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20
2nd row20
3rd row20
4th row20
5th row20

Common Values

ValueCountFrequency (%)
20 9507
95.1%
30 430
 
4.3%
40 54
 
0.5%
50 9
 
0.1%

Length

2023-12-12T10:22:51.161759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:22:51.275591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20 9507
95.1%
30 430
 
4.3%
40 54
 
0.5%
50 9
 
0.1%

Interactions

2023-12-12T10:22:46.726976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:43.781487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:44.519253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:45.261712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:45.980895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:46.867367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:43.934100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:44.659769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:45.389965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:46.121961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:47.015097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:44.081242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:44.835733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:45.532187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:46.267472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:47.164424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:44.237734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:44.992390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:45.688741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:46.442281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:47.324877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:44.377638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:45.135678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:45.830040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:22:46.582734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:22:51.370947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도회차일련번호과목명과목별점수총점합격여부성별연령대
연도1.0000.9990.9710.5260.3090.6560.1780.1400.042
회차0.9991.0000.9740.4890.2990.6480.1730.1330.044
일련번호0.9710.9741.0000.5900.3290.6350.1640.1100.066
과목명0.5260.4890.5901.0000.8740.2430.0340.0370.000
과목별점수0.3090.2990.3290.8741.0000.6220.5240.0500.061
총점0.6560.6480.6350.2430.6221.0000.9050.0800.145
합격여부0.1780.1730.1640.0340.5240.9051.0000.0940.208
성별0.1400.1330.1100.0370.0500.0800.0941.0000.226
연령대0.0420.0440.0660.0000.0610.1450.2080.2261.000
2023-12-12T10:22:51.826599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
합격여부연령대과목명성별
합격여부1.0000.0830.0210.062
연령대0.0831.0000.0000.150
과목명0.0210.0001.0000.029
성별0.0620.1500.0291.000
2023-12-12T10:22:51.935888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도회차일련번호과목별점수총점과목명합격여부성별연령대
연도1.0000.9980.9930.025-0.3130.1850.1070.1080.025
회차0.9981.0000.9950.025-0.3110.1690.1020.1040.028
일련번호0.9930.9951.0000.027-0.3060.2170.0980.0840.040
과목별점수0.0250.0250.0271.0000.2460.4550.3410.0380.036
총점-0.313-0.311-0.3060.2461.0000.0770.7900.0610.087
과목명0.1850.1690.2170.4550.0771.0000.0210.0290.000
합격여부0.1070.1020.0980.3410.7900.0211.0000.0620.083
성별0.1080.1040.0840.0380.0610.0290.0621.0000.150
연령대0.0250.0280.0400.0360.0870.0000.0830.1501.000

Missing values

2023-12-12T10:22:47.529588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:22:47.763647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도직종회차일련번호과목명과목별점수총점합격여부성별연령대
98942005작업치료사321414공중보건학 개론18.0255.0합격20
554112010작업치료사387916일상생활동작24.0237.0합격20
2122001작업치료사2831수예 및 공작11.0247.5합격20
287092008작업치료사354102수예 및 공작8.0207.0합격20
579692011작업치료사398282수예 및 공작11.0231.0합격20
42292003작업치료사31605의료관계법규18.0224.5합격20
275442008작업치료사353935일상생활동작23.0223.5합격20
498402010작업치료사387121해부생리학 개요23.0251.0합격20
251722008작업치료사353597해부생리학 개론27.0212.5합격20
357862009작업치료사365113수예 및 공작15.0237.0합격20
연도직종회차일련번호과목명과목별점수총점합격여부성별연령대
513762010작업치료사387340공중보건학 개론15.0216.5합격20
248802008작업치료사353555수예 및 공작6.0212.0합격20
193532007작업치료사342765작업치료학52.0198.0합격20
298912008작업치료사354271의료관계법규12.0204.0합격20
630292011작업치료사399005의료관계법규16.0205.5합격20
922742014작업치료사4213268작업치료학61.0170.0합격20
621912011작업치료사398885공중보건학 개론18.0219.5합격20
757932012작업치료사4010828작업치료사 실기75.0212.0합격20
344572009작업치료사364923공중보건학 개론13.0214.5합격20
710352012작업치료사4010148일상생활동작20.0214.0합격20