Overview

Dataset statistics

Number of variables10
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory918.0 KiB
Average record size in memory94.0 B

Variable types

Numeric6
Categorical4

Dataset

Description안경사 국가시험 응시자의 성적 현황을 분석할 수 있는 정보(연도, 직종, 회차, 일련번호, 과목명, 과목별 점수, 총점, 합격여부, 성별, 연령대)를 제공합니다.
URLhttps://www.data.go.kr/data/15083516/fileData.do

Alerts

직종 has constant value ""Constant
연도 is highly overall correlated with 회차 and 1 other fieldsHigh correlation
회차 is highly overall correlated with 연도 and 1 other fieldsHigh correlation
일련번호 is highly overall correlated with 연도 and 1 other fieldsHigh correlation
과목별점수 is highly overall correlated with 총점 and 2 other fieldsHigh correlation
총점 is highly overall correlated with 과목별점수 and 1 other fieldsHigh correlation
과목명 is highly overall correlated with 과목별점수High correlation
합격여부 is highly overall correlated with 과목별점수 and 1 other fieldsHigh correlation
과목별점수 has 858 (8.6%) zerosZeros
총점 has 847 (8.5%) zerosZeros

Reproduction

Analysis started2023-12-12 23:20:45.638536
Analysis finished2023-12-12 23:20:50.919031
Duration5.28 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

HIGH CORRELATION 

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2003.9912
Minimum2000
Maximum2008
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T08:20:50.988311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2000
5-th percentile2000
Q12002
median2004
Q32006
95-th percentile2008
Maximum2008
Range8
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.4253063
Coefficient of variation (CV)0.001210238
Kurtosis-1.1718646
Mean2003.9912
Median Absolute Deviation (MAD)2
Skewness-0.09069974
Sum20039912
Variance5.8821108
MonotonicityNot monotonic
2023-12-13T08:20:51.111251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
2007 1363
13.6%
2006 1344
13.4%
2005 1321
13.2%
2003 1293
12.9%
2002 1086
10.9%
2001 1055
10.5%
2004 993
9.9%
2000 967
9.7%
2008 578
5.8%
ValueCountFrequency (%)
2000 967
9.7%
2001 1055
10.5%
2002 1086
10.9%
2003 1293
12.9%
2004 993
9.9%
2005 1321
13.2%
2006 1344
13.4%
2007 1363
13.6%
2008 578
5.8%
ValueCountFrequency (%)
2008 578
5.8%
2007 1363
13.6%
2006 1344
13.4%
2005 1321
13.2%
2004 993
9.9%
2003 1293
12.9%
2002 1086
10.9%
2001 1055
10.5%
2000 967
9.7%

직종
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
안경사
10000 

Length

Max length37
Median length37
Mean length37
Min length37

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row안경사
2nd row안경사
3rd row안경사
4th row안경사
5th row안경사

Common Values

ValueCountFrequency (%)
안경사 10000
100.0%

Length

2023-12-13T08:20:51.238529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:20:51.329948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
안경사 10000
100.0%

회차
Real number (ℝ)

HIGH CORRELATION 

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.9912
Minimum12
Maximum20
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T08:20:51.436099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12
5-th percentile12
Q114
median16
Q318
95-th percentile20
Maximum20
Range8
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.4253063
Coefficient of variation (CV)0.15166506
Kurtosis-1.1718646
Mean15.9912
Median Absolute Deviation (MAD)2
Skewness-0.09069974
Sum159912
Variance5.8821108
MonotonicityNot monotonic
2023-12-13T08:20:51.612525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
19 1363
13.6%
18 1344
13.4%
17 1321
13.2%
15 1293
12.9%
14 1086
10.9%
13 1055
10.5%
16 993
9.9%
12 967
9.7%
20 578
5.8%
ValueCountFrequency (%)
12 967
9.7%
13 1055
10.5%
14 1086
10.9%
15 1293
12.9%
16 993
9.9%
17 1321
13.2%
18 1344
13.4%
19 1363
13.6%
20 578
5.8%
ValueCountFrequency (%)
20 578
5.8%
19 1363
13.6%
18 1344
13.4%
17 1321
13.2%
16 993
9.9%
15 1293
12.9%
14 1086
10.9%
13 1055
10.5%
12 967
9.7%

일련번호
Real number (ℝ)

HIGH CORRELATION 

Distinct8107
Distinct (%)81.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9575.5919
Minimum1
Maximum19058
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T08:20:51.751447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile970.8
Q14797
median9640.5
Q314343.25
95-th percentile18069
Maximum19058
Range19057
Interquartile range (IQR)9546.25

Descriptive statistics

Standard deviation5499.2581
Coefficient of variation (CV)0.57429955
Kurtosis-1.205982
Mean9575.5919
Median Absolute Deviation (MAD)4768.5
Skewness-0.012651541
Sum95755919
Variance30241840
MonotonicityNot monotonic
2023-12-13T08:20:51.906250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2288 4
 
< 0.1%
14765 4
 
< 0.1%
15789 4
 
< 0.1%
12325 4
 
< 0.1%
18083 4
 
< 0.1%
10475 4
 
< 0.1%
765 4
 
< 0.1%
1585 4
 
< 0.1%
81 4
 
< 0.1%
9432 4
 
< 0.1%
Other values (8097) 9960
99.6%
ValueCountFrequency (%)
1 2
< 0.1%
3 1
< 0.1%
4 2
< 0.1%
8 1
< 0.1%
14 1
< 0.1%
17 1
< 0.1%
23 2
< 0.1%
25 1
< 0.1%
27 2
< 0.1%
34 1
< 0.1%
ValueCountFrequency (%)
19058 1
< 0.1%
19056 1
< 0.1%
19054 1
< 0.1%
19043 1
< 0.1%
19042 2
< 0.1%
19037 1
< 0.1%
19035 1
< 0.1%
19033 1
< 0.1%
19032 1
< 0.1%
19031 2
< 0.1%

과목명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
안광학
2127 
의료관계법규
2007 
안과학
1982 
안경학
1949 
안경사 실기
1935 

Length

Max length6
Median length3
Mean length4.1826
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row안과학
2nd row안경사 실기
3rd row안경사 실기
4th row의료관계법규
5th row안경학

Common Values

ValueCountFrequency (%)
안광학 2127
21.3%
의료관계법규 2007
20.1%
안과학 1982
19.8%
안경학 1949
19.5%
안경사 실기 1935
19.4%

Length

2023-12-13T08:20:52.062911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:20:52.185755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
안광학 2127
17.8%
의료관계법규 2007
16.8%
안과학 1982
16.6%
안경학 1949
16.3%
안경사 1935
16.2%
실기 1935
16.2%

과목별점수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct74
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.1777
Minimum0
Maximum98
Zeros858
Zeros (%)8.6%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T08:20:52.338811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q114
median20
Q334
95-th percentile76
Maximum98
Range98
Interquartile range (IQR)20

Descriptive statistics

Standard deviation21.576596
Coefficient of variation (CV)0.79390808
Kurtosis0.83994892
Mean27.1777
Median Absolute Deviation (MAD)8
Skewness1.2508311
Sum271777
Variance465.54948
MonotonicityNot monotonic
2023-12-13T08:20:52.527998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 858
 
8.6%
18 507
 
5.1%
19 461
 
4.6%
17 441
 
4.4%
15 396
 
4.0%
20 370
 
3.7%
16 360
 
3.6%
14 333
 
3.3%
22 300
 
3.0%
21 292
 
2.9%
Other values (64) 5682
56.8%
ValueCountFrequency (%)
0 858
8.6%
2 3
 
< 0.1%
3 5
 
0.1%
4 12
 
0.1%
5 29
 
0.3%
6 53
 
0.5%
7 94
 
0.9%
8 121
 
1.2%
9 154
 
1.5%
10 188
 
1.9%
ValueCountFrequency (%)
98 1
 
< 0.1%
96 7
 
0.1%
94 12
 
0.1%
92 9
 
0.1%
90 39
0.4%
88 45
0.4%
86 48
0.5%
84 62
0.6%
82 67
0.7%
80 80
0.8%

총점
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct177
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean122.5743
Minimum0
Maximum222
Zeros847
Zeros (%)8.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T08:20:52.710755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q164
median151
Q3174
95-th percentile196
Maximum222
Range222
Interquartile range (IQR)110

Descriptive statistics

Standard deviation63.243266
Coefficient of variation (CV)0.51595861
Kurtosis-1.0983854
Mean122.5743
Median Absolute Deviation (MAD)35
Skewness-0.57178977
Sum1225743
Variance3999.7107
MonotonicityNot monotonic
2023-12-13T08:20:52.865091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 847
 
8.5%
170 138
 
1.4%
176 137
 
1.4%
165 131
 
1.3%
171 131
 
1.3%
167 129
 
1.3%
169 126
 
1.3%
161 124
 
1.2%
168 123
 
1.2%
177 122
 
1.2%
Other values (167) 7992
79.9%
ValueCountFrequency (%)
0 847
8.5%
14 1
 
< 0.1%
18 1
 
< 0.1%
22 1
 
< 0.1%
24 1
 
< 0.1%
25 2
 
< 0.1%
26 4
 
< 0.1%
27 5
 
0.1%
28 8
 
0.1%
29 6
 
0.1%
ValueCountFrequency (%)
222 1
 
< 0.1%
221 1
 
< 0.1%
219 1
 
< 0.1%
216 1
 
< 0.1%
215 1
 
< 0.1%
214 4
 
< 0.1%
213 8
0.1%
212 10
0.1%
211 8
0.1%
210 6
0.1%

합격여부
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
합격
5743 
불합격
3411 
결시
838 
응시결격
 
8

Length

Max length4
Median length2
Mean length2.3427
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row합격
2nd row불합격
3rd row불합격
4th row합격
5th row합격

Common Values

ValueCountFrequency (%)
합격 5743
57.4%
불합격 3411
34.1%
결시 838
 
8.4%
응시결격 8
 
0.1%

Length

2023-12-13T08:20:53.026773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:20:53.154658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
합격 5743
57.4%
불합격 3411
34.1%
결시 838
 
8.4%
응시결격 8
 
0.1%

성별
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
5429 
4571 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
5429
54.3%
4571
45.7%

Length

2023-12-13T08:20:53.603165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:20:53.715089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5429
54.3%
4571
45.7%

연령대
Real number (ℝ)

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.827
Minimum20
Maximum70
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T08:20:53.828226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile20
Q120
median20
Q320
95-th percentile30
Maximum70
Range50
Interquartile range (IQR)0

Descriptive statistics

Standard deviation5.8464937
Coefficient of variation (CV)0.25612186
Kurtosis6.163588
Mean22.827
Median Absolute Deviation (MAD)0
Skewness2.325749
Sum228270
Variance34.181489
MonotonicityNot monotonic
2023-12-13T08:20:53.945201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
20 7757
77.6%
30 1753
 
17.5%
40 412
 
4.1%
50 63
 
0.6%
60 14
 
0.1%
70 1
 
< 0.1%
ValueCountFrequency (%)
20 7757
77.6%
30 1753
 
17.5%
40 412
 
4.1%
50 63
 
0.6%
60 14
 
0.1%
70 1
 
< 0.1%
ValueCountFrequency (%)
70 1
 
< 0.1%
60 14
 
0.1%
50 63
 
0.6%
40 412
 
4.1%
30 1753
 
17.5%
20 7757
77.6%

Interactions

2023-12-13T08:20:50.022851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:46.933308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:47.727527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:48.287053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:48.859238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:49.375930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:50.134689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:47.037803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:47.837438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:48.391547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:48.945060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:49.473160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:50.240420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:47.153305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:47.942161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:48.498441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:49.041495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:49.628941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:50.355063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:47.248954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:48.034047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:48.588373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:49.137183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:49.732140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:50.464345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:47.329660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:48.111492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:48.672371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:49.216856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:49.829745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:50.554644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:47.417327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:48.197260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:48.771246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:49.295326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:20:49.923230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:20:54.025964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도회차일련번호과목명과목별점수총점합격여부성별연령대
연도1.0001.0000.9460.0060.1530.1940.0840.1380.110
회차1.0001.0000.9590.0000.1510.1950.0560.0980.119
일련번호0.9460.9591.0000.0000.1940.2470.0890.1260.121
과목명0.0060.0000.0001.0000.9050.0000.0000.0000.000
과목별점수0.1530.1510.1940.9051.0000.7720.7030.1470.121
총점0.1940.1950.2470.0000.7721.0000.9030.2020.166
합격여부0.0840.0560.0890.0000.7030.9031.0000.2060.150
성별0.1380.0980.1260.0000.1470.2020.2061.0000.293
연령대0.1100.1190.1210.0000.1210.1660.1500.2931.000
2023-12-13T08:20:54.139218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
합격여부과목명성별
합격여부1.0000.0000.137
과목명0.0001.0000.000
성별0.1370.0001.000
2023-12-13T08:20:54.232467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도회차일련번호과목별점수총점연령대과목명합격여부성별
연도1.0001.0000.9930.0060.0470.1170.0000.0360.098
회차1.0001.0000.9930.0060.0470.1170.0000.0360.098
일련번호0.9930.9931.0000.0230.0730.0900.0000.0530.097
과목별점수0.0060.0060.0231.0000.554-0.0970.5980.5050.113
총점0.0470.0470.0730.5541.000-0.1190.0000.7870.155
연령대0.1170.1170.090-0.097-0.1191.0000.0000.0970.211
과목명0.0000.0000.0000.5980.0000.0001.0000.0000.000
합격여부0.0360.0360.0530.5050.7870.0970.0001.0000.137
성별0.0980.0980.0970.1130.1550.2110.0000.1371.000

Missing values

2023-12-13T08:20:50.690475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:20:50.848207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도직종회차일련번호과목명과목별점수총점합격여부성별연령대
728262006안경사1814566안과학27187합격30
280272002안경사145606안경사 실기5669불합격20
461322004안경사169227안경사 실기7670불합격20
508382004안경사1610168의료관계법규19204합격20
73142000안경사121463안경학33159합격40
437762004안경사168756안과학19150합격40
310132003안경사156203의료관계법규13163합격20
221802002안경사144437안광학21163합격20
163782001안경사133276의료관계법규19181합격20
674012006안경사1813481안과학19169합격20
연도직종회차일련번호과목명과목별점수총점합격여부성별연령대
70272000안경사121406안경사 실기090불합격20
534902005안경사1710699안광학00결시40
379642003안경사157593안경학40189합격20
330382003안경사156608의료관계법규00결시30
904932008안경사2018099의료관계법규967불합격20
405312003안경사158107안과학17166합격20
345452003안경사156910안광학18155합격20
905192008안경사2018104안경학2052불합격30
488582004안경사169772의료관계법규769불합격20
942452008안경사2018850안광학1870불합격20