Overview

Dataset statistics

Number of variables10
Number of observations1032
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory86.8 KiB
Average record size in memory86.1 B

Variable types

Categorical7
Numeric3

Dataset

Description한약조제자격 국가시험 응시자의 성적 현황을 분석할 수 있는 정보(연도, 직종, 회차, 일련번호, 과목명, 과목별 점수, 총점, 합격여부, 성별, 연령대)를 제공합니다. 한약조제자격 국가시험은 2009년도 이후 응시자 데이터가 없습니다.
URLhttps://www.data.go.kr/data/15083521/fileData.do

Alerts

직종 has constant value ""Constant
회차 is highly overall correlated with 일련번호 and 3 other fieldsHigh correlation
연도 is highly overall correlated with 일련번호 and 3 other fieldsHigh correlation
일련번호 is highly overall correlated with 연도 and 1 other fieldsHigh correlation
과목별점수 is highly overall correlated with 총점 and 1 other fieldsHigh correlation
총점 is highly overall correlated with 과목별점수 and 4 other fieldsHigh correlation
합격여부 is highly overall correlated with 과목별점수 and 1 other fieldsHigh correlation
연령대 is highly overall correlated with 총점 and 2 other fieldsHigh correlation
연령대 is highly imbalanced (61.8%)Imbalance
과목별점수 has 102 (9.9%) zerosZeros
총점 has 100 (9.7%) zerosZeros

Reproduction

Analysis started2023-12-12 16:15:17.565672
Analysis finished2023-12-12 16:15:18.879762
Duration1.31 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
2000
500 
2001
356 
2002
152 
2003
 
20
2009
 
4

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2000
2nd row2000
3rd row2000
4th row2000
5th row2000

Common Values

ValueCountFrequency (%)
2000 500
48.4%
2001 356
34.5%
2002 152
 
14.7%
2003 20
 
1.9%
2009 4
 
0.4%

Length

2023-12-13T01:15:18.942518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:15:19.043663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2000 500
48.4%
2001 356
34.5%
2002 152
 
14.7%
2003 20
 
1.9%
2009 4
 
0.4%

직종
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
한약조제자격
1032 

Length

Max length34
Median length34
Mean length34
Min length34

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한약조제자격
2nd row한약조제자격
3rd row한약조제자격
4th row한약조제자격
5th row한약조제자격

Common Values

ValueCountFrequency (%)
한약조제자격 1032
100.0%

Length

2023-12-13T01:15:19.141311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:15:19.214154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한약조제자격 1032
100.0%

회차
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
7
500 
8
356 
9
152 
10
 
20
11
 
4

Length

Max length2
Median length1
Mean length1.0232558
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row7
2nd row7
3rd row7
4th row7
5th row7

Common Values

ValueCountFrequency (%)
7 500
48.4%
8 356
34.5%
9 152
 
14.7%
10 20
 
1.9%
11 4
 
0.4%

Length

2023-12-13T01:15:19.293211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:15:19.375838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
7 500
48.4%
8 356
34.5%
9 152
 
14.7%
10 20
 
1.9%
11 4
 
0.4%

일련번호
Real number (ℝ)

HIGH CORRELATION 

Distinct258
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean129.5
Minimum1
Maximum258
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.2 KiB
2023-12-13T01:15:19.476809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13.55
Q165
median129.5
Q3194
95-th percentile245.45
Maximum258
Range257
Interquartile range (IQR)129

Descriptive statistics

Standard deviation74.513736
Coefficient of variation (CV)0.57539564
Kurtosis-1.200034
Mean129.5
Median Absolute Deviation (MAD)64.5
Skewness0
Sum133644
Variance5552.2968
MonotonicityIncreasing
2023-12-13T01:15:19.597113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 4
 
0.4%
195 4
 
0.4%
165 4
 
0.4%
166 4
 
0.4%
167 4
 
0.4%
168 4
 
0.4%
169 4
 
0.4%
170 4
 
0.4%
171 4
 
0.4%
172 4
 
0.4%
Other values (248) 992
96.1%
ValueCountFrequency (%)
1 4
0.4%
2 4
0.4%
3 4
0.4%
4 4
0.4%
5 4
0.4%
6 4
0.4%
7 4
0.4%
8 4
0.4%
9 4
0.4%
10 4
0.4%
ValueCountFrequency (%)
258 4
0.4%
257 4
0.4%
256 4
0.4%
255 4
0.4%
254 4
0.4%
253 4
0.4%
252 4
0.4%
251 4
0.4%
250 4
0.4%
249 4
0.4%

과목명
Categorical

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
한약조제지침서
258 
본초학
258 
방제학
258 
50종 이상의 한약재 감별능력
258 

Length

Max length16
Median length11.5
Mean length7.25
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한약조제지침서
2nd row본초학
3rd row방제학
4th row50종 이상의 한약재 감별능력
5th row한약조제지침서

Common Values

ValueCountFrequency (%)
한약조제지침서 258
25.0%
본초학 258
25.0%
방제학 258
25.0%
50종 이상의 한약재 감별능력 258
25.0%

Length

2023-12-13T01:15:19.711083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:15:19.812188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한약조제지침서 258
14.3%
본초학 258
14.3%
방제학 258
14.3%
50종 258
14.3%
이상의 258
14.3%
한약재 258
14.3%
감별능력 258
14.3%

과목별점수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct55
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32.993702
Minimum0
Maximum58
Zeros102
Zeros (%)9.9%
Negative0
Negative (%)0.0%
Memory size9.2 KiB
2023-12-13T01:15:19.917104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q128.5
median36
Q341
95-th percentile49
Maximum58
Range58
Interquartile range (IQR)12.5

Descriptive statistics

Standard deviation13.442849
Coefficient of variation (CV)0.40743683
Kurtosis1.0772731
Mean32.993702
Median Absolute Deviation (MAD)6
Skewness-1.250855
Sum34049.5
Variance180.71019
MonotonicityNot monotonic
2023-12-13T01:15:20.032489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 102
 
9.9%
36.0 85
 
8.2%
37.5 67
 
6.5%
33.0 66
 
6.4%
39.0 63
 
6.1%
34.5 62
 
6.0%
40.5 60
 
5.8%
42.0 41
 
4.0%
30.0 38
 
3.7%
31.5 36
 
3.5%
Other values (45) 412
39.9%
ValueCountFrequency (%)
0.0 102
9.9%
7.5 1
 
0.1%
10.5 1
 
0.1%
12.0 2
 
0.2%
13.5 4
 
0.4%
15.0 3
 
0.3%
16.5 7
 
0.7%
18.0 9
 
0.9%
19.5 8
 
0.8%
20.0 1
 
0.1%
ValueCountFrequency (%)
58.0 1
 
0.1%
57.0 1
 
0.1%
56.0 2
 
0.2%
54.0 3
 
0.3%
53.0 4
 
0.4%
52.5 1
 
0.1%
52.0 6
 
0.6%
51.0 15
1.5%
50.0 13
1.3%
49.5 5
 
0.5%

총점
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct130
Distinct (%)12.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean131.97481
Minimum0
Maximum218.5
Zeros100
Zeros (%)9.7%
Negative0
Negative (%)0.0%
Memory size9.2 KiB
2023-12-13T01:15:20.149338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1122
median149.25
Q3161
95-th percentile177
Maximum218.5
Range218.5
Interquartile range (IQR)39

Descriptive statistics

Standard deviation49.358743
Coefficient of variation (CV)0.37400125
Kurtosis2.2575092
Mean131.97481
Median Absolute Deviation (MAD)15
Skewness-1.7550627
Sum136198
Variance2436.2855
MonotonicityNot monotonic
2023-12-13T01:15:20.258355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 100
 
9.7%
155.5 24
 
2.3%
163.0 24
 
2.3%
162.0 20
 
1.9%
156.0 20
 
1.9%
156.5 20
 
1.9%
155.0 20
 
1.9%
137.0 20
 
1.9%
159.5 16
 
1.6%
147.5 16
 
1.6%
Other values (120) 752
72.9%
ValueCountFrequency (%)
0.0 100
9.7%
39.0 4
 
0.4%
51.0 4
 
0.4%
62.0 4
 
0.4%
85.5 4
 
0.4%
89.5 4
 
0.4%
92.0 4
 
0.4%
94.0 4
 
0.4%
95.5 4
 
0.4%
96.5 4
 
0.4%
ValueCountFrequency (%)
218.5 4
0.4%
193.5 4
0.4%
192.5 4
0.4%
187.0 8
0.8%
186.5 4
0.4%
183.5 4
0.4%
180.5 4
0.4%
179.5 4
0.4%
179.0 4
0.4%
178.5 4
0.4%

합격여부
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
합격
592 
불합격
336 
결시
100 
응시결격
 
4

Length

Max length4
Median length2
Mean length2.3333333
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row합격
2nd row합격
3rd row합격
4th row합격
5th row합격

Common Values

ValueCountFrequency (%)
합격 592
57.4%
불합격 336
32.6%
결시 100
 
9.7%
응시결격 4
 
0.4%

Length

2023-12-13T01:15:20.376861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:15:20.468668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
합격 592
57.4%
불합격 336
32.6%
결시 100
 
9.7%
응시결격 4
 
0.4%

성별
Categorical

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
916 
116 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
916
88.8%
116
 
11.2%

Length

2023-12-13T01:15:20.568345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:15:20.644106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
916
88.8%
116
 
11.2%

연령대
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
20
896 
30
128 
40
 
8

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20
2nd row20
3rd row20
4th row20
5th row20

Common Values

ValueCountFrequency (%)
20 896
86.8%
30 128
 
12.4%
40 8
 
0.8%

Length

2023-12-13T01:15:20.730283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:15:20.813860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20 896
86.8%
30 128
 
12.4%
40 8
 
0.8%

Interactions

2023-12-13T01:15:18.427422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:15:17.986233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:15:18.191608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:15:18.504683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:15:18.052828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:15:18.266272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:15:18.577693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:15:18.124020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:15:18.342274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:15:20.872655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도회차일련번호과목명과목별점수총점합격여부성별연령대
연도1.0001.0000.9420.0000.5200.8800.1440.2160.562
회차1.0001.0000.9420.0000.5200.8800.1440.2160.562
일련번호0.9420.9421.0000.0000.2190.5440.2660.4420.289
과목명0.0000.0000.0001.0000.5140.0000.0000.0000.000
과목별점수0.5200.5200.2190.5141.0000.8940.8370.1040.317
총점0.8800.8800.5440.0000.8941.0000.8830.2280.659
합격여부0.1440.1440.2660.0000.8370.8831.0000.0000.499
성별0.2160.2160.4420.0000.1040.2280.0001.0000.000
연령대0.5620.5620.2890.0000.3170.6590.4990.0001.000
2023-12-13T01:15:20.973770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별과목명연령대회차합격여부연도
성별1.0000.0000.0000.2640.0000.264
과목명0.0001.0000.0000.0000.0000.000
연령대0.0000.0001.0000.5060.4980.506
회차0.2640.0000.5061.0000.1181.000
합격여부0.0000.0000.4980.1181.0000.118
연도0.2640.0000.5061.0000.1181.000
2023-12-13T01:15:21.062560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호과목별점수총점연도회차과목명합격여부성별연령대
일련번호1.000-0.091-0.1350.6720.6720.0000.1610.3380.180
과목별점수-0.0911.0000.7680.2410.2410.3320.6760.0800.200
총점-0.1350.7681.0000.5550.5550.0000.7500.1740.507
연도0.6720.2410.5551.0001.0000.0000.1180.2640.506
회차0.6720.2410.5551.0001.0000.0000.1180.2640.506
과목명0.0000.3320.0000.0000.0001.0000.0000.0000.000
합격여부0.1610.6760.7500.1180.1180.0001.0000.0000.498
성별0.3380.0800.1740.2640.2640.0000.0001.0000.000
연령대0.1800.2000.5070.5060.5060.0000.4980.0001.000

Missing values

2023-12-13T01:15:18.691258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:15:18.823387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도직종회차일련번호과목명과목별점수총점합격여부성별연령대
02000한약조제자격71한약조제지침서36.0162.0합격20
12000한약조제자격71본초학40.5162.0합격20
22000한약조제자격71방제학34.5162.0합격20
32000한약조제자격7150종 이상의 한약재 감별능력51.0162.0합격20
42000한약조제자격72한약조제지침서33.0153.5합격20
52000한약조제자격72본초학37.5153.5합격20
62000한약조제자격72방제학36.0153.5합격20
72000한약조제자격7250종 이상의 한약재 감별능력47.0153.5합격20
82000한약조제자격73한약조제지침서34.5162.0합격20
92000한약조제자격73본초학42.0162.0합격20
연도직종회차일련번호과목명과목별점수총점합격여부성별연령대
10222003한약조제자격10256방제학34.5163.5합격30
10232003한약조제자격1025650종 이상의 한약재 감별능력42.0163.5합격30
10242003한약조제자격10257한약조제지침서31.5136.0불합격20
10252003한약조제자격10257본초학33.0136.0불합격20
10262003한약조제자격10257방제학31.5136.0불합격20
10272003한약조제자격1025750종 이상의 한약재 감별능력40.0136.0불합격20
10282009한약조제자격11258한약조제지침서57.0218.5합격40
10292009한약조제자격11258본초학49.5218.5합격40
10302009한약조제자격11258방제학54.0218.5합격40
10312009한약조제자격1125850종 이상의 한약재 감별능력58.0218.5합격40