Overview

Dataset statistics

Number of variables10
Number of observations435
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory36.7 KiB
Average record size in memory86.3 B

Variable types

Numeric5
Categorical5

Dataset

Description보건교육사 1급 국가시험 응시자의 성적 현황을 분석할 수 있는 정보(연도, 직종, 회차, 일련번호, 과목명, 과목별 점수, 총점, 합격여부, 성별, 연령대)를 제공합니다.
URLhttps://www.data.go.kr/data/15083524/fileData.do

Alerts

직종 has constant value ""Constant
연도 is highly overall correlated with 회차 and 2 other fieldsHigh correlation
회차 is highly overall correlated with 연도 and 2 other fieldsHigh correlation
일련번호 is highly overall correlated with 연도 and 2 other fieldsHigh correlation
과목별점수 is highly overall correlated with 총점 and 1 other fieldsHigh correlation
총점 is highly overall correlated with 연도 and 4 other fieldsHigh correlation
합격여부 is highly overall correlated with 과목별점수 and 1 other fieldsHigh correlation
과목별점수 has 54 (12.4%) zerosZeros
총점 has 54 (12.4%) zerosZeros

Reproduction

Analysis started2023-12-12 12:48:59.099442
Analysis finished2023-12-12 12:49:02.577475
Duration3.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

HIGH CORRELATION 

Distinct12
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2017.1724
Minimum2011
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.0 KiB
2023-12-12T21:49:02.642611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2011
5-th percentile2011
Q12013
median2018
Q32021
95-th percentile2023
Maximum2023
Range12
Interquartile range (IQR)8

Descriptive statistics

Standard deviation3.9108937
Coefficient of variation (CV)0.0019387999
Kurtosis-1.2253403
Mean2017.1724
Median Absolute Deviation (MAD)3
Skewness-0.25543151
Sum877470
Variance15.29509
MonotonicityIncreasing
2023-12-12T21:49:02.814375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
2012 63
14.5%
2019 63
14.5%
2018 54
12.4%
2017 51
11.7%
2021 51
11.7%
2011 42
9.7%
2022 39
9.0%
2023 30
6.9%
2013 18
 
4.1%
2015 9
 
2.1%
Other values (2) 15
 
3.4%
ValueCountFrequency (%)
2011 42
9.7%
2012 63
14.5%
2013 18
 
4.1%
2014 6
 
1.4%
2015 9
 
2.1%
2016 9
 
2.1%
2017 51
11.7%
2018 54
12.4%
2019 63
14.5%
2021 51
11.7%
ValueCountFrequency (%)
2023 30
6.9%
2022 39
9.0%
2021 51
11.7%
2019 63
14.5%
2018 54
12.4%
2017 51
11.7%
2016 9
 
2.1%
2015 9
 
2.1%
2014 6
 
1.4%
2013 18
 
4.1%

직종
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
보건교육사 1급
435 

Length

Max length34
Median length34
Mean length34
Min length34

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보건교육사 1급
2nd row보건교육사 1급
3rd row보건교육사 1급
4th row보건교육사 1급
5th row보건교육사 1급

Common Values

ValueCountFrequency (%)
보건교육사 1급 435
100.0%

Length

2023-12-12T21:49:02.977250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:49:03.121069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보건교육사 435
50.0%
1급 435
50.0%

회차
Real number (ℝ)

HIGH CORRELATION 

Distinct13
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.6965517
Minimum2
Maximum14
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.0 KiB
2023-12-12T21:49:03.243477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile2
Q15
median10
Q312
95-th percentile14
Maximum14
Range12
Interquartile range (IQR)7

Descriptive statistics

Standard deviation3.9037808
Coefficient of variation (CV)0.44888835
Kurtosis-1.1148623
Mean8.6965517
Median Absolute Deviation (MAD)2
Skewness-0.52749089
Sum3783
Variance15.239504
MonotonicityIncreasing
2023-12-12T21:49:03.369923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
11 63
14.5%
10 54
12.4%
9 51
11.7%
12 51
11.7%
3 45
10.3%
2 42
9.7%
13 39
9.0%
14 30
6.9%
4 18
 
4.1%
5 18
 
4.1%
Other values (3) 24
 
5.5%
ValueCountFrequency (%)
2 42
9.7%
3 45
10.3%
4 18
 
4.1%
5 18
 
4.1%
6 6
 
1.4%
7 9
 
2.1%
8 9
 
2.1%
9 51
11.7%
10 54
12.4%
11 63
14.5%
ValueCountFrequency (%)
14 30
6.9%
13 39
9.0%
12 51
11.7%
11 63
14.5%
10 54
12.4%
9 51
11.7%
8 9
 
2.1%
7 9
 
2.1%
6 6
 
1.4%
5 18
 
4.1%

일련번호
Real number (ℝ)

HIGH CORRELATION 

Distinct145
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean73
Minimum1
Maximum145
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.0 KiB
2023-12-12T21:49:03.530165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8
Q137
median73
Q3109
95-th percentile138
Maximum145
Range144
Interquartile range (IQR)72

Descriptive statistics

Standard deviation41.905094
Coefficient of variation (CV)0.57404238
Kurtosis-1.2001026
Mean73
Median Absolute Deviation (MAD)36
Skewness0
Sum31755
Variance1756.0369
MonotonicityIncreasing
2023-12-12T21:49:03.702929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 3
 
0.7%
110 3
 
0.7%
94 3
 
0.7%
95 3
 
0.7%
96 3
 
0.7%
97 3
 
0.7%
98 3
 
0.7%
99 3
 
0.7%
100 3
 
0.7%
101 3
 
0.7%
Other values (135) 405
93.1%
ValueCountFrequency (%)
1 3
0.7%
2 3
0.7%
3 3
0.7%
4 3
0.7%
5 3
0.7%
6 3
0.7%
7 3
0.7%
8 3
0.7%
9 3
0.7%
10 3
0.7%
ValueCountFrequency (%)
145 3
0.7%
144 3
0.7%
143 3
0.7%
142 3
0.7%
141 3
0.7%
140 3
0.7%
139 3
0.7%
138 3
0.7%
137 3
0.7%
136 3
0.7%

과목명
Categorical

Distinct3
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
보건교육방법론
145 
보건사업관리
145 
보건프로그램 개발 및 평가
145 

Length

Max length14
Median length7
Mean length9
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보건교육방법론
2nd row보건사업관리
3rd row보건프로그램 개발 및 평가
4th row보건사업관리
5th row보건교육방법론

Common Values

ValueCountFrequency (%)
보건교육방법론 145
33.3%
보건사업관리 145
33.3%
보건프로그램 개발 및 평가 145
33.3%

Length

2023-12-12T21:49:03.891675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:49:04.033839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보건교육방법론 145
16.7%
보건사업관리 145
16.7%
보건프로그램 145
16.7%
개발 145
16.7%
145
16.7%
평가 145
16.7%

과목별점수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct25
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.491954
Minimum0
Maximum30
Zeros54
Zeros (%)12.4%
Negative0
Negative (%)0.0%
Memory size4.0 KiB
2023-12-12T21:49:04.142957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q19
median12
Q315
95-th percentile20
Maximum30
Range30
Interquartile range (IQR)6

Descriptive statistics

Standard deviation5.8582963
Coefficient of variation (CV)0.50977373
Kurtosis0.16637261
Mean11.491954
Median Absolute Deviation (MAD)3
Skewness-0.34164794
Sum4999
Variance34.319636
MonotonicityNot monotonic
2023-12-12T21:49:04.304235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
0 54
12.4%
11 42
9.7%
10 40
9.2%
13 40
9.2%
12 39
9.0%
14 32
 
7.4%
15 30
 
6.9%
16 25
 
5.7%
9 21
 
4.8%
8 20
 
4.6%
Other values (15) 92
21.1%
ValueCountFrequency (%)
0 54
12.4%
3 1
 
0.2%
4 3
 
0.7%
5 4
 
0.9%
6 8
 
1.8%
7 7
 
1.6%
8 20
 
4.6%
9 21
 
4.8%
10 40
9.2%
11 42
9.7%
ValueCountFrequency (%)
30 1
 
0.2%
25 4
 
0.9%
24 4
 
0.9%
23 2
 
0.5%
22 5
 
1.1%
21 5
 
1.1%
20 11
2.5%
19 12
2.8%
18 10
2.3%
17 15
3.4%

총점
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct39
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34.475862
Minimum0
Maximum65
Zeros54
Zeros (%)12.4%
Negative0
Negative (%)0.0%
Memory size4.0 KiB
2023-12-12T21:49:04.473867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q131
median36
Q346
95-th percentile54
Maximum65
Range65
Interquartile range (IQR)15

Descriptive statistics

Standard deviation15.850577
Coefficient of variation (CV)0.45975867
Kurtosis0.47105326
Mean34.475862
Median Absolute Deviation (MAD)7
Skewness-0.87500587
Sum14997
Variance251.24078
MonotonicityNot monotonic
2023-12-12T21:49:04.636537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
0 54
 
12.4%
37 27
 
6.2%
31 24
 
5.5%
34 24
 
5.5%
36 21
 
4.8%
32 18
 
4.1%
47 18
 
4.1%
35 15
 
3.4%
40 15
 
3.4%
42 15
 
3.4%
Other values (29) 204
46.9%
ValueCountFrequency (%)
0 54
12.4%
15 3
 
0.7%
20 3
 
0.7%
22 6
 
1.4%
24 6
 
1.4%
26 9
 
2.1%
27 6
 
1.4%
28 6
 
1.4%
29 9
 
2.1%
30 6
 
1.4%
ValueCountFrequency (%)
65 6
1.4%
64 3
 
0.7%
62 3
 
0.7%
58 6
1.4%
57 3
 
0.7%
54 6
1.4%
53 6
1.4%
52 12
2.8%
51 6
1.4%
50 9
2.1%

합격여부
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
불합격
261 
합격
114 
결시
54 
응시결격
 
6

Length

Max length4
Median length3
Mean length2.6275862
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row불합격
2nd row불합격
3rd row불합격
4th row불합격
5th row불합격

Common Values

ValueCountFrequency (%)
불합격 261
60.0%
합격 114
26.2%
결시 54
 
12.4%
응시결격 6
 
1.4%

Length

2023-12-12T21:49:04.789007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:49:05.270750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
불합격 261
60.0%
합격 114
26.2%
결시 54
 
12.4%
응시결격 6
 
1.4%

성별
Categorical

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
339 
96 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
339
77.9%
96
 
22.1%

Length

2023-12-12T21:49:05.407413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:49:05.541663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
339
77.9%
96
 
22.1%

연령대
Categorical

Distinct5
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
30
156 
40
147 
50
84 
20
30 
60
18 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row30
2nd row30
3rd row30
4th row40
5th row40

Common Values

ValueCountFrequency (%)
30 156
35.9%
40 147
33.8%
50 84
19.3%
20 30
 
6.9%
60 18
 
4.1%

Length

2023-12-12T21:49:05.673043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:49:05.813771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
30 156
35.9%
40 147
33.8%
50 84
19.3%
20 30
 
6.9%
60 18
 
4.1%

Interactions

2023-12-12T21:49:01.646031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:48:59.567849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:49:00.116687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:49:00.635154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:49:01.128174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:49:01.790680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:48:59.689381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:49:00.220914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:49:00.738107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:49:01.211036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:49:01.899185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:48:59.816645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:49:00.313783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:49:00.842843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:49:01.324351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:49:02.026322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:48:59.924195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:49:00.414384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:49:00.930319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:49:01.436055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:49:02.147223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:49:00.018638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:49:00.522816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:49:01.029119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:49:01.540789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:49:05.926905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도회차일련번호과목명과목별점수총점합격여부성별연령대
연도1.0001.0000.9040.0000.5080.7590.3940.1330.382
회차1.0001.0000.9730.0000.6010.6000.4390.2680.540
일련번호0.9040.9731.0000.0000.5180.5960.4130.2390.530
과목명0.0000.0000.0001.0000.3020.0000.0000.0000.000
과목별점수0.5080.6010.5180.3021.0000.7820.7830.2680.300
총점0.7590.6000.5960.0000.7821.0000.7910.2420.365
합격여부0.3940.4390.4130.0000.7830.7911.0000.2350.207
성별0.1330.2680.2390.0000.2680.2420.2351.0000.264
연령대0.3820.5400.5300.0000.3000.3650.2070.2641.000
2023-12-12T21:49:06.083158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과목명성별합격여부연령대
과목명1.0000.0000.0000.000
성별0.0001.0000.1560.322
합격여부0.0000.1561.0000.170
연령대0.0000.3220.1701.000
2023-12-12T21:49:06.209311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도회차일련번호과목별점수총점과목명합격여부성별연령대
연도1.0000.9990.993-0.421-0.5490.0000.2740.2040.234
회차0.9991.0000.994-0.418-0.5450.0000.2760.1890.247
일련번호0.9930.9941.000-0.421-0.5470.0000.2610.1770.250
과목별점수-0.421-0.418-0.4211.0000.8330.2070.5780.1940.137
총점-0.549-0.545-0.5470.8331.0000.0000.6450.2250.218
과목명0.0000.0000.0000.2070.0001.0000.0000.0000.000
합격여부0.2740.2760.2610.5780.6450.0001.0000.1560.170
성별0.2040.1890.1770.1940.2250.0000.1561.0000.322
연령대0.2340.2470.2500.1370.2180.0000.1700.3221.000

Missing values

2023-12-12T21:49:02.315711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:49:02.517517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도직종회차일련번호과목명과목별점수총점합격여부성별연령대
02011보건교육사 1급21보건교육방법론1546불합격30
12011보건교육사 1급21보건사업관리1846불합격30
22011보건교육사 1급21보건프로그램 개발 및 평가1346불합격30
32011보건교육사 1급22보건사업관리2051불합격40
42011보건교육사 1급22보건교육방법론1151불합격40
52011보건교육사 1급22보건프로그램 개발 및 평가2051불합격40
62011보건교육사 1급23보건사업관리2465합격40
72011보건교육사 1급23보건교육방법론1965합격40
82011보건교육사 1급23보건프로그램 개발 및 평가2265합격40
92011보건교육사 1급24보건사업관리1338불합격30
연도직종회차일련번호과목명과목별점수총점합격여부성별연령대
4252023보건교육사 1급14142보건프로그램 개발 및 평가00결시40
4262023보건교육사 1급14143보건사업관리00결시50
4272023보건교육사 1급14143보건교육방법론00결시50
4282023보건교육사 1급14143보건프로그램 개발 및 평가00결시50
4292023보건교육사 1급14144보건사업관리828불합격60
4302023보건교육사 1급14144보건교육방법론1028불합격60
4312023보건교육사 1급14144보건프로그램 개발 및 평가1028불합격60
4322023보건교육사 1급14145보건사업관리1437합격50
4332023보건교육사 1급14145보건교육방법론1137합격50
4342023보건교육사 1급14145보건프로그램 개발 및 평가1237합격50