Overview

Dataset statistics

Number of variables10
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory918.0 KiB
Average record size in memory94.0 B

Variable types

Categorical6
Numeric4

Dataset

Description위생사 국가시험 응시자의 성적 현황을 분석할 수 있는 정보(연도, 직종, 회차, 일련번호, 과목명, 과목별 점수, 총점, 합격여부, 성별, 연령대)를 제공합니다.
URLhttps://www.data.go.kr/data/15083518/fileData.do

Alerts

연도 has constant value ""Constant
직종 has constant value ""Constant
회차 has constant value ""Constant
과목별점수 is highly overall correlated with 총점 and 1 other fieldsHigh correlation
총점 is highly overall correlated with 과목별점수 and 1 other fieldsHigh correlation
합격여부 is highly overall correlated with 과목별점수 and 1 other fieldsHigh correlation
과목별점수 has 1116 (11.2%) zerosZeros
총점 has 1115 (11.2%) zerosZeros

Reproduction

Analysis started2023-12-12 13:31:25.176799
Analysis finished2023-12-12 13:31:28.449229
Duration3.27 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2022
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 10000
100.0%

Length

2023-12-12T22:31:28.513856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:31:28.602997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 10000
100.0%

직종
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
위생사
10000 

Length

Max length37
Median length37
Mean length37
Min length37

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row위생사
2nd row위생사
3rd row위생사
4th row위생사
5th row위생사

Common Values

ValueCountFrequency (%)
위생사 10000
100.0%

Length

2023-12-12T22:31:28.694392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:31:28.811045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
위생사 10000
100.0%

회차
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
44
10000 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row44
2nd row44
3rd row44
4th row44
5th row44

Common Values

ValueCountFrequency (%)
44 10000
100.0%

Length

2023-12-12T22:31:28.899243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:31:29.001525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
44 10000
100.0%

일련번호
Real number (ℝ)

Distinct6434
Distinct (%)64.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4648.6986
Minimum1
Maximum9255
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T22:31:29.108378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile469.8
Q12336.75
median4664
Q36971
95-th percentile8778
Maximum9255
Range9254
Interquartile range (IQR)4634.25

Descriptive statistics

Standard deviation2662.8512
Coefficient of variation (CV)0.57281648
Kurtosis-1.2009627
Mean4648.6986
Median Absolute Deviation (MAD)2316
Skewness-0.012498425
Sum46486986
Variance7090776.3
MonotonicityNot monotonic
2023-12-12T22:31:29.233582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1254 5
 
0.1%
7511 5
 
0.1%
3125 5
 
0.1%
6089 5
 
0.1%
1332 5
 
0.1%
6936 5
 
0.1%
1796 5
 
0.1%
3901 5
 
0.1%
4097 5
 
0.1%
3553 5
 
0.1%
Other values (6424) 9950
99.5%
ValueCountFrequency (%)
1 2
< 0.1%
2 1
< 0.1%
5 2
< 0.1%
8 2
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
15 1
< 0.1%
16 1
< 0.1%
ValueCountFrequency (%)
9255 2
< 0.1%
9253 1
< 0.1%
9252 1
< 0.1%
9251 1
< 0.1%
9249 1
< 0.1%
9248 2
< 0.1%
9247 2
< 0.1%
9246 1
< 0.1%
9245 1
< 0.1%
9244 1
< 0.1%

과목명
Categorical

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공중보건학
1701 
위생곤충학
1681 
위생 관계 법령
1680 
식품위생학
1675 
환경위생학
1632 

Length

Max length8
Median length5
Mean length5.3409
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row위생 관계 법령
2nd row위생 관계 법령
3rd row위생곤충학
4th row식품위생학
5th row위생 관계 법령

Common Values

ValueCountFrequency (%)
공중보건학 1701
17.0%
위생곤충학 1681
16.8%
위생 관계 법령 1680
16.8%
식품위생학 1675
16.8%
환경위생학 1632
16.3%
실기시험 1631
16.3%

Length

2023-12-12T22:31:29.391490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:31:29.507789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공중보건학 1701
12.7%
위생곤충학 1681
12.6%
위생 1680
12.6%
관계 1680
12.6%
법령 1680
12.6%
식품위생학 1675
12.5%
환경위생학 1632
12.2%
실기시험 1631
12.2%

과목별점수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct46
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.0674
Minimum0
Maximum47
Zeros1116
Zeros (%)11.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T22:31:29.625048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q115
median22
Q329
95-th percentile37
Maximum47
Range47
Interquartile range (IQR)14

Descriptive statistics

Standard deviation10.571974
Coefficient of variation (CV)0.50181674
Kurtosis-0.29686846
Mean21.0674
Median Absolute Deviation (MAD)7
Skewness-0.4176229
Sum210674
Variance111.76663
MonotonicityNot monotonic
2023-12-12T22:31:29.758130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=46)
ValueCountFrequency (%)
0 1116
 
11.2%
19 468
 
4.7%
21 452
 
4.5%
20 424
 
4.2%
18 416
 
4.2%
23 395
 
4.0%
17 394
 
3.9%
22 381
 
3.8%
24 368
 
3.7%
26 338
 
3.4%
Other values (36) 5248
52.5%
ValueCountFrequency (%)
0 1116
11.2%
3 3
 
< 0.1%
4 5
 
0.1%
5 9
 
0.1%
6 19
 
0.2%
7 27
 
0.3%
8 55
 
0.5%
9 80
 
0.8%
10 92
 
0.9%
11 154
 
1.5%
ValueCountFrequency (%)
47 1
 
< 0.1%
46 4
 
< 0.1%
45 10
 
0.1%
44 15
 
0.1%
43 21
 
0.2%
42 29
 
0.3%
41 50
0.5%
40 71
0.7%
39 96
1.0%
38 110
1.1%

총점
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct165
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean120.0317
Minimum0
Maximum209
Zeros1115
Zeros (%)11.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T22:31:29.897501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q190
median140
Q3161
95-th percentile184
Maximum209
Range209
Interquartile range (IQR)71

Descriptive statistics

Standard deviation54.606742
Coefficient of variation (CV)0.45493601
Kurtosis0.013842018
Mean120.0317
Median Absolute Deviation (MAD)35
Skewness-0.93284459
Sum1200317
Variance2981.8963
MonotonicityNot monotonic
2023-12-12T22:31:30.084059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1115
 
11.2%
161 152
 
1.5%
107 145
 
1.5%
155 140
 
1.4%
146 133
 
1.3%
150 132
 
1.3%
148 131
 
1.3%
152 126
 
1.3%
142 125
 
1.2%
164 125
 
1.2%
Other values (155) 7676
76.8%
ValueCountFrequency (%)
0 1115
11.2%
37 1
 
< 0.1%
40 1
 
< 0.1%
43 3
 
< 0.1%
44 1
 
< 0.1%
45 1
 
< 0.1%
46 2
 
< 0.1%
47 3
 
< 0.1%
48 2
 
< 0.1%
49 4
 
< 0.1%
ValueCountFrequency (%)
209 2
 
< 0.1%
208 4
 
< 0.1%
206 1
 
< 0.1%
204 6
 
0.1%
203 3
 
< 0.1%
202 6
 
0.1%
201 4
 
< 0.1%
200 4
 
< 0.1%
199 10
0.1%
198 17
0.2%

합격여부
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
합격
5431 
불합격
3454 
결시
1115 

Length

Max length3
Median length2
Mean length2.3454
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row합격
2nd row합격
3rd row불합격
4th row합격
5th row합격

Common Values

ValueCountFrequency (%)
합격 5431
54.3%
불합격 3454
34.5%
결시 1115
 
11.2%

Length

2023-12-12T22:31:30.233973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:31:30.331436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
합격 5431
54.3%
불합격 3454
34.5%
결시 1115
 
11.2%

성별
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
7854 
2146 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
7854
78.5%
2146
 
21.5%

Length

2023-12-12T22:31:30.440958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:31:30.557716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
7854
78.5%
2146
 
21.5%

연령대
Real number (ℝ)

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.064
Minimum20
Maximum70
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T22:31:30.677320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile20
Q120
median20
Q320
95-th percentile40
Maximum70
Range50
Interquartile range (IQR)0

Descriptive statistics

Standard deviation6.1370734
Coefficient of variation (CV)0.27814872
Kurtosis12.826631
Mean22.064
Median Absolute Deviation (MAD)0
Skewness3.4526269
Sum220640
Variance37.66367
MonotonicityNot monotonic
2023-12-12T22:31:30.796119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
20 8726
87.3%
30 712
 
7.1%
40 375
 
3.8%
50 151
 
1.5%
60 31
 
0.3%
70 5
 
0.1%
ValueCountFrequency (%)
20 8726
87.3%
30 712
 
7.1%
40 375
 
3.8%
50 151
 
1.5%
60 31
 
0.3%
70 5
 
0.1%
ValueCountFrequency (%)
70 5
 
0.1%
60 31
 
0.3%
50 151
 
1.5%
40 375
 
3.8%
30 712
 
7.1%
20 8726
87.3%

Interactions

2023-12-12T22:31:27.710308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:31:26.357463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:31:26.836804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:31:27.267094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:31:27.811147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:31:26.526388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:31:26.952549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:31:27.381041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:31:27.915612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:31:26.617176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:31:27.055865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:31:27.477294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:31:28.050997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:31:26.719510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:31:27.157781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:31:27.579996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:31:30.884793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호과목명과목별점수총점합격여부성별연령대
일련번호1.0000.0250.0740.0920.0690.0620.081
과목명0.0251.0000.5930.0000.0000.0000.000
과목별점수0.0740.5931.0000.8370.8590.0760.140
총점0.0920.0000.8371.0000.9660.0800.150
합격여부0.0690.0000.8590.9661.0000.0300.277
성별0.0620.0000.0760.0800.0301.0000.059
연령대0.0810.0000.1400.1500.2770.0591.000
2023-12-12T22:31:31.003506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별과목명합격여부
성별1.0000.0000.050
과목명0.0001.0000.000
합격여부0.0500.0001.000
2023-12-12T22:31:31.095200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호과목별점수총점연령대과목명합격여부성별
일련번호1.000-0.007-0.014-0.0270.0130.0410.048
과목별점수-0.0071.0000.691-0.1000.3630.7820.058
총점-0.0140.6911.000-0.1060.0000.9700.061
연령대-0.027-0.100-0.1061.0000.0000.1190.043
과목명0.0130.3630.0000.0001.0000.0000.000
합격여부0.0410.7820.9700.1190.0001.0000.050
성별0.0480.0580.0610.0430.0000.0501.000

Missing values

2023-12-12T22:31:28.209227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:31:28.371457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도직종회차일련번호과목명과목별점수총점합격여부성별연령대
331432022위생사445524위생 관계 법령13140합격20
20392022위생사44340위생 관계 법령17155합격20
203682022위생사443395위생곤충학1489불합격20
539172022위생사448987식품위생학32147합격20
423772022위생사447063위생 관계 법령21174합격20
36672022위생사44612식품위생학25155합격20
419122022위생사446986공중보건학2190불합격20
116022022위생사441934위생곤충학968불합격20
516172022위생사448603위생 관계 법령13100불합격20
232242022위생사443871위생곤충학1797불합격20
연도직종회차일련번호과목명과목별점수총점합격여부성별연령대
25222022위생사44421공중보건학23167합격20
132782022위생사442214실기시험21104불합격30
222312022위생사443706식품위생학00결시30
124722022위생사442079위생곤충학00결시30
363302022위생사446056실기시험28171합격20
175442022위생사442925실기시험1871불합격20
100552022위생사441676위생 관계 법령19170합격20
468732022위생사447813식품위생학1994불합격20
544902022위생사449082위생곤충학12106불합격20
144342022위생사442406위생곤충학22175합격20