Overview

Dataset statistics

Number of variables10
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory918.0 KiB
Average record size in memory94.0 B

Variable types

Categorical6
Numeric4

Dataset

Description영양사 국가시험 응시자의 성적 현황을 분석할 수 있는 정보(연도, 직종, 회차, 일련번호, 과목명, 과목별 점수, 총점, 합격여부, 성별, 연령대)를 제공합니다.
URLhttps://www.data.go.kr/data/15083506/fileData.do

Alerts

연도 has constant value ""Constant
직종 has constant value ""Constant
회차 has constant value ""Constant
과목별점수 is highly overall correlated with 총점 and 1 other fieldsHigh correlation
총점 is highly overall correlated with 과목별점수 and 1 other fieldsHigh correlation
합격여부 is highly overall correlated with 과목별점수 and 1 other fieldsHigh correlation
과목별점수 has 875 (8.8%) zerosZeros
총점 has 874 (8.7%) zerosZeros

Reproduction

Analysis started2023-12-12 08:00:56.829409
Analysis finished2023-12-12 08:01:00.212601
Duration3.38 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2022
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 10000
100.0%

Length

2023-12-12T17:01:00.296260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:01:00.426822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 10000
100.0%

직종
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영양사
10000 

Length

Max length37
Median length37
Mean length37
Min length37

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영양사
2nd row영양사
3rd row영양사
4th row영양사
5th row영양사

Common Values

ValueCountFrequency (%)
영양사 10000
100.0%

Length

2023-12-12T17:01:00.539437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:01:00.643798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영양사 10000
100.0%

회차
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
46
10000 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row46
2nd row46
3rd row46
4th row46
5th row46

Common Values

ValueCountFrequency (%)
46 10000
100.0%

Length

2023-12-12T17:01:00.749831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:01:00.850335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
46 10000
100.0%

일련번호
Real number (ℝ)

Distinct5268
Distinct (%)52.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2966.999
Minimum1
Maximum5897
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:01:00.991456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile275
Q11454
median2990.5
Q34440.25
95-th percentile5599.05
Maximum5897
Range5896
Interquartile range (IQR)2986.25

Descriptive statistics

Standard deviation1712.5882
Coefficient of variation (CV)0.57721227
Kurtosis-1.2085234
Mean2966.999
Median Absolute Deviation (MAD)1491
Skewness-0.030077318
Sum29669990
Variance2932958.4
MonotonicityNot monotonic
2023-12-12T17:01:01.209519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
573 4
 
< 0.1%
4402 4
 
< 0.1%
2528 4
 
< 0.1%
2965 4
 
< 0.1%
1414 4
 
< 0.1%
692 4
 
< 0.1%
4607 4
 
< 0.1%
3877 4
 
< 0.1%
2211 4
 
< 0.1%
901 4
 
< 0.1%
Other values (5258) 9960
99.6%
ValueCountFrequency (%)
1 4
< 0.1%
2 2
< 0.1%
4 2
< 0.1%
5 3
< 0.1%
6 4
< 0.1%
7 1
 
< 0.1%
8 2
< 0.1%
9 2
< 0.1%
10 3
< 0.1%
11 3
< 0.1%
ValueCountFrequency (%)
5897 2
< 0.1%
5895 1
 
< 0.1%
5894 2
< 0.1%
5893 2
< 0.1%
5892 3
< 0.1%
5891 3
< 0.1%
5889 2
< 0.1%
5888 3
< 0.1%
5887 2
< 0.1%
5886 3
< 0.1%

과목명
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영양학 및 생화학
2539 
급식, 위생 및 관계법규
2528 
영양교육, 식사요법 및 생리학
2474 
식품학 및 조리원리
2459 

Length

Max length16
Median length13
Mean length11.9889
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영양교육, 식사요법 및 생리학
2nd row영양교육, 식사요법 및 생리학
3rd row영양학 및 생화학
4th row식품학 및 조리원리
5th row식품학 및 조리원리

Common Values

ValueCountFrequency (%)
영양학 및 생화학 2539
25.4%
급식, 위생 및 관계법규 2528
25.3%
영양교육, 식사요법 및 생리학 2474
24.7%
식품학 및 조리원리 2459
24.6%

Length

2023-12-12T17:01:01.379222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:01:01.499974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
10000
28.6%
영양학 2539
 
7.3%
생화학 2539
 
7.3%
급식 2528
 
7.2%
위생 2528
 
7.2%
관계법규 2528
 
7.2%
영양교육 2474
 
7.1%
식사요법 2474
 
7.1%
생리학 2474
 
7.1%
식품학 2459
 
7.0%

과목별점수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct56
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33.6823
Minimum0
Maximum60
Zeros875
Zeros (%)8.8%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:01:01.636652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q126
median35
Q345
95-th percentile53
Maximum60
Range60
Interquartile range (IQR)19

Descriptive statistics

Standard deviation14.482484
Coefficient of variation (CV)0.42997313
Kurtosis0.24093863
Mean33.6823
Median Absolute Deviation (MAD)9
Skewness-0.79617032
Sum336823
Variance209.74234
MonotonicityNot monotonic
2023-12-12T17:01:01.816229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 875
 
8.8%
35 331
 
3.3%
32 331
 
3.3%
34 298
 
3.0%
30 291
 
2.9%
36 288
 
2.9%
33 284
 
2.8%
46 283
 
2.8%
31 282
 
2.8%
37 281
 
2.8%
Other values (46) 6456
64.6%
ValueCountFrequency (%)
0 875
8.8%
5 1
 
< 0.1%
7 1
 
< 0.1%
8 2
 
< 0.1%
9 4
 
< 0.1%
10 6
 
0.1%
11 9
 
0.1%
12 12
 
0.1%
13 17
 
0.2%
14 19
 
0.2%
ValueCountFrequency (%)
60 6
 
0.1%
59 20
 
0.2%
58 45
 
0.4%
57 69
 
0.7%
56 76
 
0.8%
55 108
1.1%
54 120
1.2%
53 139
1.4%
52 180
1.8%
51 223
2.2%

총점
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct162
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean134.5322
Minimum0
Maximum215
Zeros874
Zeros (%)8.7%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:01:01.957180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1112
median146
Q3173
95-th percentile194
Maximum215
Range215
Interquartile range (IQR)61

Descriptive statistics

Standard deviation52.334567
Coefficient of variation (CV)0.38901145
Kurtosis1.2117844
Mean134.5322
Median Absolute Deviation (MAD)30
Skewness-1.2756549
Sum1345322
Variance2738.9069
MonotonicityNot monotonic
2023-12-12T17:01:02.101213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 874
 
8.7%
166 127
 
1.3%
181 121
 
1.2%
172 117
 
1.2%
180 116
 
1.2%
185 115
 
1.1%
174 114
 
1.1%
176 113
 
1.1%
157 112
 
1.1%
160 111
 
1.1%
Other values (152) 8080
80.8%
ValueCountFrequency (%)
0 874
8.7%
35 1
 
< 0.1%
40 3
 
< 0.1%
47 2
 
< 0.1%
48 2
 
< 0.1%
51 1
 
< 0.1%
52 1
 
< 0.1%
55 1
 
< 0.1%
58 2
 
< 0.1%
63 2
 
< 0.1%
ValueCountFrequency (%)
215 2
 
< 0.1%
214 13
0.1%
213 1
 
< 0.1%
212 6
 
0.1%
211 6
 
0.1%
210 16
0.2%
209 3
 
< 0.1%
208 18
0.2%
207 21
0.2%
206 9
0.1%

합격여부
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
합격
6070 
불합격
2992 
결시
872 
응시결격
 
66

Length

Max length4
Median length2
Mean length2.3124
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row합격
2nd row불합격
3rd row결시
4th row불합격
5th row합격

Common Values

ValueCountFrequency (%)
합격 6070
60.7%
불합격 2992
29.9%
결시 872
 
8.7%
응시결격 66
 
0.7%

Length

2023-12-12T17:01:02.264655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:01:02.387717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
합격 6070
60.7%
불합격 2992
29.9%
결시 872
 
8.7%
응시결격 66
 
0.7%

성별
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
8857 
1143 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
8857
88.6%
1143
 
11.4%

Length

2023-12-12T17:01:02.532249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:01:02.635841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
8857
88.6%
1143
 
11.4%

연령대
Real number (ℝ)

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24.284
Minimum20
Maximum70
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:01:02.713287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile20
Q120
median20
Q320
95-th percentile50
Maximum70
Range50
Interquartile range (IQR)0

Descriptive statistics

Standard deviation9.1288377
Coefficient of variation (CV)0.37591985
Kurtosis3.7306311
Mean24.284
Median Absolute Deviation (MAD)0
Skewness2.1514666
Sum242840
Variance83.335678
MonotonicityNot monotonic
2023-12-12T17:01:02.849532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
20 7801
78.0%
30 847
 
8.5%
40 737
 
7.4%
50 503
 
5.0%
60 106
 
1.1%
70 6
 
0.1%
ValueCountFrequency (%)
20 7801
78.0%
30 847
 
8.5%
40 737
 
7.4%
50 503
 
5.0%
60 106
 
1.1%
70 6
 
0.1%
ValueCountFrequency (%)
70 6
 
0.1%
60 106
 
1.1%
50 503
 
5.0%
40 737
 
7.4%
30 847
 
8.5%
20 7801
78.0%

Interactions

2023-12-12T17:00:59.084694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:57.753260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:58.207515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:58.641004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:59.575598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:57.899188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:58.303420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:58.758959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:59.683326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:58.006369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:58.420404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:58.878515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:59.775169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:58.110013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:58.523973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:00:58.966943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:01:02.935774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호과목명과목별점수총점합격여부성별연령대
일련번호1.0000.0000.1300.1770.1430.0480.104
과목명0.0001.0000.4860.0000.0000.0000.000
과목별점수0.1300.4861.0000.9010.8470.0740.249
총점0.1770.0000.9011.0000.9110.1010.270
합격여부0.1430.0000.8470.9111.0000.0860.270
성별0.0480.0000.0740.1010.0861.0000.074
연령대0.1040.0000.2490.2700.2700.0741.000
2023-12-12T17:01:03.061424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과목명성별합격여부
과목명1.0000.0000.000
성별0.0001.0000.057
합격여부0.0000.0571.000
2023-12-12T17:01:03.189878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호과목별점수총점연령대과목명합격여부성별
일련번호1.000-0.035-0.0490.0070.0000.0860.037
과목별점수-0.0351.0000.833-0.2410.3110.6940.061
총점-0.0490.8331.000-0.2690.0000.8020.077
연령대0.007-0.241-0.2691.0000.0000.1770.053
과목명0.0000.3110.0000.0001.0000.0000.000
합격여부0.0860.6940.8020.1770.0001.0000.057
성별0.0370.0610.0770.0530.0000.0571.000

Missing values

2023-12-12T17:00:59.904822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:01:00.105705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도직종회차일련번호과목명과목별점수총점합격여부성별연령대
221982022영양사465550영양교육, 식사요법 및 생리학45167합격20
188862022영양사464722영양교육, 식사요법 및 생리학33113불합격20
191922022영양사464799영양학 및 생화학00결시20
44912022영양사461123식품학 및 조리원리19108불합격40
161232022영양사464031식품학 및 조리원리21140합격20
179422022영양사464486영양교육, 식사요법 및 생리학48175합격20
171362022영양사464285영양학 및 생화학51185합격20
233272022영양사465832식품학 및 조리원리35182합격20
93372022영양사462335급식, 위생 및 관계법규2597불합격20
175652022영양사464392급식, 위생 및 관계법규44160합격20
연도직종회차일련번호과목명과목별점수총점합격여부성별연령대
99362022영양사462485영양학 및 생화학51180합격20
21882022영양사46548영양학 및 생화학00결시20
160522022영양사464014영양학 및 생화학58207합격20
106132022영양사462654급식, 위생 및 관계법규54199합격20
75142022영양사461879영양교육, 식사요법 및 생리학45154합격40
232522022영양사465814영양학 및 생화학27103불합격20
83072022영양사462077식품학 및 조리원리23117불합격20
118992022영양사462975식품학 및 조리원리27152합격30
174412022영양사464361급식, 위생 및 관계법규35116불합격20
175162022영양사464380영양학 및 생화학45166합격20