Overview

Dataset statistics

Number of variables10
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory918.0 KiB
Average record size in memory94.0 B

Variable types

Numeric5
Categorical5

Dataset

Description방사선사 국가시험 응시자의 성적 현황을 분석할 수 있는 정보(연도, 직종, 회차, 일련번호, 과목명, 과목별 점수, 총점, 합격여부, 성별, 연령대)를 제공합니다.
URLhttps://www.data.go.kr/data/15083508/fileData.do

Alerts

직종 has constant value ""Constant
연도 is highly overall correlated with 회차 and 1 other fieldsHigh correlation
회차 is highly overall correlated with 연도 and 1 other fieldsHigh correlation
일련번호 is highly overall correlated with 연도 and 1 other fieldsHigh correlation
총점 is highly overall correlated with 합격여부High correlation
합격여부 is highly overall correlated with 총점High correlation
연령대 is highly imbalanced (79.3%)Imbalance
과목별점수 has 704 (7.0%) zerosZeros
총점 has 689 (6.9%) zerosZeros

Reproduction

Analysis started2023-12-12 08:17:11.264053
Analysis finished2023-12-12 08:17:15.816387
Duration4.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2002.2903
Minimum2000
Maximum2005
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:17:15.884401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2000
5-th percentile2000
Q12001
median2002
Q32004
95-th percentile2005
Maximum2005
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.5146139
Coefficient of variation (CV)0.00075644072
Kurtosis-1.1140711
Mean2002.2903
Median Absolute Deviation (MAD)1
Skewness-0.025811445
Sum20022903
Variance2.2940553
MonotonicityNot monotonic
2023-12-12T17:17:16.016173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2003 2139
21.4%
2004 2015
20.2%
2002 1858
18.6%
2001 1770
17.7%
2000 1630
16.3%
2005 588
 
5.9%
ValueCountFrequency (%)
2000 1630
16.3%
2001 1770
17.7%
2002 1858
18.6%
2003 2139
21.4%
2004 2015
20.2%
2005 588
 
5.9%
ValueCountFrequency (%)
2005 588
 
5.9%
2004 2015
20.2%
2003 2139
21.4%
2002 1858
18.6%
2001 1770
17.7%
2000 1630
16.3%

직종
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
방사선사
10000 

Length

Max length36
Median length36
Mean length36
Min length36

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row방사선사
2nd row방사선사
3rd row방사선사
4th row방사선사
5th row방사선사

Common Values

ValueCountFrequency (%)
방사선사 10000
100.0%

Length

2023-12-12T17:17:16.189696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:17:16.298300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
방사선사 10000
100.0%

회차
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29.2903
Minimum27
Maximum32
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:17:16.400827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum27
5-th percentile27
Q128
median29
Q331
95-th percentile32
Maximum32
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.5146139
Coefficient of variation (CV)0.051710427
Kurtosis-1.1140711
Mean29.2903
Median Absolute Deviation (MAD)1
Skewness-0.025811445
Sum292903
Variance2.2940553
MonotonicityNot monotonic
2023-12-12T17:17:16.550718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
30 2139
21.4%
31 2015
20.2%
29 1858
18.6%
28 1770
17.7%
27 1630
16.3%
32 588
 
5.9%
ValueCountFrequency (%)
27 1630
16.3%
28 1770
17.7%
29 1858
18.6%
30 2139
21.4%
31 2015
20.2%
32 588
 
5.9%
ValueCountFrequency (%)
32 588
 
5.9%
31 2015
20.2%
30 2139
21.4%
29 1858
18.6%
28 1770
17.7%
27 1630
16.3%

일련번호
Real number (ℝ)

HIGH CORRELATION 

Distinct6694
Distinct (%)66.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5344.8803
Minimum1
Maximum10588
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:17:16.700845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile561.95
Q12721.75
median5380
Q37969
95-th percentile10073.05
Maximum10588
Range10587
Interquartile range (IQR)5247.25

Descriptive statistics

Standard deviation3040.3333
Coefficient of variation (CV)0.56883095
Kurtosis-1.1860301
Mean5344.8803
Median Absolute Deviation (MAD)2626.5
Skewness-0.013981146
Sum53448803
Variance9243626.7
MonotonicityNot monotonic
2023-12-12T17:17:16.864463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2382 5
 
0.1%
8876 5
 
0.1%
5965 5
 
0.1%
7895 5
 
0.1%
7804 5
 
0.1%
9999 5
 
0.1%
6446 5
 
0.1%
4047 5
 
0.1%
7255 5
 
0.1%
7082 5
 
0.1%
Other values (6684) 9950
99.5%
ValueCountFrequency (%)
1 3
< 0.1%
2 1
 
< 0.1%
3 1
 
< 0.1%
6 1
 
< 0.1%
7 1
 
< 0.1%
11 1
 
< 0.1%
13 1
 
< 0.1%
15 1
 
< 0.1%
16 3
< 0.1%
18 1
 
< 0.1%
ValueCountFrequency (%)
10588 1
 
< 0.1%
10587 1
 
< 0.1%
10584 2
< 0.1%
10583 1
 
< 0.1%
10582 1
 
< 0.1%
10581 4
< 0.1%
10580 2
< 0.1%
10579 2
< 0.1%
10578 3
< 0.1%
10577 1
 
< 0.1%

과목명
Categorical

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영상진단기술학
1160 
방사선이론
1158 
공중보건학 개론
1139 
방사선치료기술학
1116 
핵의학기술학
1113 
Other values (4)
4314 

Length

Max length8
Median length7
Mean length6.6683
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row방사선이론
2nd row해부생리학 개론
3rd row핵의학기술학
4th row방사선치료기술학
5th row공중보건학 개론

Common Values

ValueCountFrequency (%)
영상진단기술학 1160
11.6%
방사선이론 1158
11.6%
공중보건학 개론 1139
11.4%
방사선치료기술학 1116
11.2%
핵의학기술학 1113
11.1%
해부생리학 개론 1095
10.9%
방사선응용 1087
10.9%
방사선사 실기 1068
10.7%
의료관계법규 1064
10.6%

Length

2023-12-12T17:17:17.019245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:17:17.163453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개론 2234
16.8%
영상진단기술학 1160
8.7%
방사선이론 1158
8.7%
공중보건학 1139
8.6%
방사선치료기술학 1116
8.4%
핵의학기술학 1113
8.4%
해부생리학 1095
8.2%
방사선응용 1087
8.2%
방사선사 1068
8.0%
실기 1068
8.0%

과목별점수
Real number (ℝ)

ZEROS 

Distinct68
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.2273
Minimum0
Maximum96
Zeros704
Zeros (%)7.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:17:17.334018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q111
median15
Q323
95-th percentile66
Maximum96
Range96
Interquartile range (IQR)12

Descriptive statistics

Standard deviation17.234023
Coefficient of variation (CV)0.85201795
Kurtosis3.8517879
Mean20.2273
Median Absolute Deviation (MAD)5
Skewness1.9942891
Sum202273
Variance297.01154
MonotonicityNot monotonic
2023-12-12T17:17:17.840033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 704
 
7.0%
13 640
 
6.4%
12 596
 
6.0%
14 561
 
5.6%
15 559
 
5.6%
11 520
 
5.2%
17 516
 
5.2%
16 504
 
5.0%
18 439
 
4.4%
10 431
 
4.3%
Other values (58) 4530
45.3%
ValueCountFrequency (%)
0 704
7.0%
1 5
 
0.1%
2 10
 
0.1%
3 21
 
0.2%
4 43
 
0.4%
5 67
 
0.7%
6 118
 
1.2%
7 190
 
1.9%
8 268
 
2.7%
9 305
3.0%
ValueCountFrequency (%)
96 1
 
< 0.1%
92 3
 
< 0.1%
90 4
 
< 0.1%
88 5
 
0.1%
86 12
 
0.1%
84 21
 
0.2%
82 32
0.3%
80 49
0.5%
78 62
0.6%
76 63
0.6%

총점
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct218
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean167.7439
Minimum0
Maximum288
Zeros689
Zeros (%)6.9%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:17:18.073200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1109
median198
Q3223
95-th percentile246
Maximum288
Range288
Interquartile range (IQR)114

Descriptive statistics

Standard deviation71.664501
Coefficient of variation (CV)0.42722568
Kurtosis-0.28299801
Mean167.7439
Median Absolute Deviation (MAD)36
Skewness-0.87220977
Sum1677439
Variance5135.8007
MonotonicityNot monotonic
2023-12-12T17:17:18.264715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 689
 
6.9%
206 125
 
1.2%
205 124
 
1.2%
202 122
 
1.2%
213 118
 
1.2%
217 111
 
1.1%
228 111
 
1.1%
222 110
 
1.1%
220 109
 
1.1%
203 109
 
1.1%
Other values (208) 8272
82.7%
ValueCountFrequency (%)
0 689
6.9%
2 2
 
< 0.1%
19 1
 
< 0.1%
25 1
 
< 0.1%
27 3
 
< 0.1%
33 3
 
< 0.1%
36 7
 
0.1%
39 2
 
< 0.1%
40 1
 
< 0.1%
41 3
 
< 0.1%
ValueCountFrequency (%)
288 4
 
< 0.1%
273 3
 
< 0.1%
271 1
 
< 0.1%
270 1
 
< 0.1%
269 7
0.1%
268 5
0.1%
267 3
 
< 0.1%
266 1
 
< 0.1%
265 7
0.1%
264 10
0.1%

합격여부
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
합격
5635 
불합격
3682 
결시
682 
응시결격
 
1

Length

Max length4
Median length2
Mean length2.3684
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row합격
2nd row합격
3rd row합격
4th row합격
5th row합격

Common Values

ValueCountFrequency (%)
합격 5635
56.4%
불합격 3682
36.8%
결시 682
 
6.8%
응시결격 1
 
< 0.1%

Length

2023-12-12T17:17:18.449049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:17:18.595728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
합격 5635
56.4%
불합격 3682
36.8%
결시 682
 
6.8%
응시결격 1
 
< 0.1%

성별
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
6045 
3955 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
6045
60.5%
3955
39.6%

Length

2023-12-12T17:17:18.735004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:17:18.824558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
6045
60.5%
3955
39.6%

연령대
Categorical

IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
20
9231 
30
 
731
40
 
37
50
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row20
2nd row30
3rd row20
4th row20
5th row20

Common Values

ValueCountFrequency (%)
20 9231
92.3%
30 731
 
7.3%
40 37
 
0.4%
50 1
 
< 0.1%

Length

2023-12-12T17:17:18.951493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:17:19.091993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20 9231
92.3%
30 731
 
7.3%
40 37
 
0.4%
50 1
 
< 0.1%

Interactions

2023-12-12T17:17:14.868045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:12.402653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:12.932816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:13.551728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:14.176748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:14.991325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:12.505531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:13.047281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:13.704740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:14.299930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:15.120928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:12.612289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:13.174033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:13.843460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:14.426171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:15.244663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:12.710901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:13.290946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:13.950108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:14.569889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:15.381743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:12.815249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:13.417424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:14.059031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:17:14.719074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:17:19.199591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도회차일련번호과목명과목별점수총점합격여부성별연령대
연도1.0001.0000.9150.0000.1770.3020.1110.0350.054
회차1.0001.0000.9350.0000.1160.2070.1310.0710.063
일련번호0.9150.9351.0000.0220.1690.3030.1910.1220.097
과목명0.0000.0000.0221.0000.7240.0000.0220.0000.008
과목별점수0.1770.1160.1690.7241.0000.6810.6050.0880.192
총점0.3020.2070.3030.0000.6811.0000.8920.1370.268
합격여부0.1110.1310.1910.0220.6050.8921.0000.0830.359
성별0.0350.0710.1220.0000.0880.1370.0831.0000.264
연령대0.0540.0630.0970.0080.1920.2680.3590.2641.000
2023-12-12T17:17:19.337420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과목명연령대합격여부성별
과목명1.0000.0050.0140.000
연령대0.0051.0000.1460.176
합격여부0.0140.1461.0000.055
성별0.0000.1760.0551.000
2023-12-12T17:17:19.462013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도회차일련번호과목별점수총점과목명합격여부성별연령대
연도1.0001.0000.9830.0550.0990.0000.0840.0510.041
회차1.0001.0000.9830.0550.0990.0000.0840.0510.041
일련번호0.9830.9831.0000.0730.1260.0100.1150.0940.058
과목별점수0.0550.0550.0731.0000.4960.4370.4090.0670.116
총점0.0990.0990.1260.4961.0000.0000.7660.1050.163
과목명0.0000.0000.0100.4370.0001.0000.0140.0000.005
합격여부0.0840.0840.1150.4090.7660.0141.0000.0550.146
성별0.0510.0510.0940.0670.1050.0000.0551.0000.176
연령대0.0410.0410.0580.1160.1630.0050.1460.1761.000

Missing values

2023-12-12T17:17:15.530097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:17:15.737217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도직종회차일련번호과목명과목별점수총점합격여부성별연령대
635772003방사선사307065방사선이론20204합격20
223712001방사선사282486해부생리학 개론15215합격30
78032000방사선사27868핵의학기술학12203합격20
854062004방사선사319490방사선치료기술학13232합격20
354312002방사선사293937공중보건학 개론17217합격20
745502004방사선사318284방사선응용26265합격20
838562004방사선사319318방사선응용24249합격20
437052002방사선사294857방사선이론28191합격20
53742000방사선사27598방사선이론29216합격20
627942003방사선사306978방사선이론28231합격20
연도직종회차일련번호과목명과목별점수총점합격여부성별연령대
47712000방사선사27531방사선이론31249합격20
521122003방사선사305791방사선사 실기60106불합격30
671812003방사선사307465방사선치료기술학12112불합격20
145982000방사선사271623핵의학기술학10187합격30
792182004방사선사318803핵의학기술학14243합격20
508792002방사선사295654방사선사 실기52181불합격20
746172004방사선사318291공중보건학 개론18209불합격20
664362003방사선사307382공중보건학 개론18229합격20
414312002방사선사294604의료관계법규18184불합격20
45932000방사선사27511방사선응용24232합격20