Overview

Dataset statistics

Number of variables10
Number of observations10000
Missing cells4
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory918.0 KiB
Average record size in memory94.0 B

Variable types

Numeric5
Categorical5

Dataset

Description1급 응급구조사 국가시험 응시자의 성적 현황을 분석할 수 있는 정보(연도, 직종, 회차, 일련번호, 과목명, 과목별 점수, 총점, 합격여부, 성별, 연령대)를 제공합니다.
URLhttps://www.data.go.kr/data/15060453/fileData.do

Alerts

직종 has constant value ""Constant
연도 is highly overall correlated with 회차 and 1 other fieldsHigh correlation
회차 is highly overall correlated with 연도 and 1 other fieldsHigh correlation
일련번호 is highly overall correlated with 연도 and 1 other fieldsHigh correlation
총점 is highly overall correlated with 합격여부High correlation
합격여부 is highly overall correlated with 총점High correlation
합격여부 is highly imbalanced (60.3%)Imbalance
연령대 is highly imbalanced (68.7%)Imbalance
과목별점수 has 509 (5.1%) zerosZeros
총점 has 381 (3.8%) zerosZeros

Reproduction

Analysis started2023-12-12 02:08:24.617707
Analysis finished2023-12-12 02:08:30.137420
Duration5.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

HIGH CORRELATION 

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2009.3387
Minimum2000
Maximum2016
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T11:08:30.215921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2000
5-th percentile2000
Q12006
median2010
Q32014
95-th percentile2016
Maximum2016
Range16
Interquartile range (IQR)8

Descriptive statistics

Standard deviation4.8814102
Coefficient of variation (CV)0.0024293615
Kurtosis-0.98769221
Mean2009.3387
Median Absolute Deviation (MAD)4
Skewness-0.46783849
Sum20093387
Variance23.828165
MonotonicityNot monotonic
2023-12-12T11:08:30.371711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
2015 1012
 
10.1%
2014 933
 
9.3%
2013 802
 
8.0%
2012 766
 
7.7%
2011 752
 
7.5%
2010 657
 
6.6%
2009 617
 
6.2%
2016 614
 
6.1%
2002 524
 
5.2%
2000 505
 
5.1%
Other values (7) 2818
28.2%
ValueCountFrequency (%)
2000 505
5.1%
2001 496
5.0%
2002 524
5.2%
2003 168
 
1.7%
2004 387
3.9%
2005 382
3.8%
2006 396
4.0%
2007 486
4.9%
2008 503
5.0%
2009 617
6.2%
ValueCountFrequency (%)
2016 614
6.1%
2015 1012
10.1%
2014 933
9.3%
2013 802
8.0%
2012 766
7.7%
2011 752
7.5%
2010 657
6.6%
2009 617
6.2%
2008 503
5.0%
2007 486
4.9%

직종
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
응급구조사1급
10000 

Length

Max length34
Median length34
Mean length34
Min length34

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row응급구조사1급
2nd row응급구조사1급
3rd row응급구조사1급
4th row응급구조사1급
5th row응급구조사1급

Common Values

ValueCountFrequency (%)
응급구조사1급 10000
100.0%

Length

2023-12-12T11:08:30.551696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:08:30.663563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
응급구조사1급 10000
100.0%

회차
Real number (ℝ)

HIGH CORRELATION 

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.3387
Minimum6
Maximum22
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T11:08:30.794900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile6
Q112
median16
Q320
95-th percentile22
Maximum22
Range16
Interquartile range (IQR)8

Descriptive statistics

Standard deviation4.8814102
Coefficient of variation (CV)0.31824145
Kurtosis-0.98769221
Mean15.3387
Median Absolute Deviation (MAD)4
Skewness-0.46783849
Sum153387
Variance23.828165
MonotonicityNot monotonic
2023-12-12T11:08:30.940962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
21 1012
 
10.1%
20 933
 
9.3%
19 802
 
8.0%
18 766
 
7.7%
17 752
 
7.5%
16 657
 
6.6%
15 617
 
6.2%
22 614
 
6.1%
8 524
 
5.2%
6 505
 
5.1%
Other values (7) 2818
28.2%
ValueCountFrequency (%)
6 505
5.1%
7 496
5.0%
8 524
5.2%
9 168
 
1.7%
10 387
3.9%
11 382
3.8%
12 396
4.0%
13 486
4.9%
14 503
5.0%
15 617
6.2%
ValueCountFrequency (%)
22 614
6.1%
21 1012
10.1%
20 933
9.3%
19 802
8.0%
18 766
7.7%
17 752
7.5%
16 657
6.6%
15 617
6.2%
14 503
5.0%
13 486
4.9%

일련번호
Real number (ℝ)

HIGH CORRELATION 

Distinct7706
Distinct (%)77.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7909.5822
Minimum1
Maximum15883
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T11:08:31.132978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile814.95
Q13903.75
median7915.5
Q311858
95-th percentile15041.2
Maximum15883
Range15882
Interquartile range (IQR)7954.25

Descriptive statistics

Standard deviation4578.5028
Coefficient of variation (CV)0.57885521
Kurtosis-1.1983639
Mean7909.5822
Median Absolute Deviation (MAD)3983.5
Skewness0.0023953891
Sum79095822
Variance20962688
MonotonicityNot monotonic
2023-12-12T11:08:31.352058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4536 5
 
0.1%
10221 5
 
0.1%
9537 4
 
< 0.1%
14127 4
 
< 0.1%
13045 4
 
< 0.1%
10199 4
 
< 0.1%
8232 4
 
< 0.1%
5209 4
 
< 0.1%
6384 4
 
< 0.1%
6063 4
 
< 0.1%
Other values (7696) 9958
99.6%
ValueCountFrequency (%)
1 1
< 0.1%
3 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 2
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
16 1
< 0.1%
19 2
< 0.1%
ValueCountFrequency (%)
15883 1
< 0.1%
15882 2
< 0.1%
15880 1
< 0.1%
15877 1
< 0.1%
15872 1
< 0.1%
15870 1
< 0.1%
15869 2
< 0.1%
15868 1
< 0.1%
15867 1
< 0.1%
15863 1
< 0.1%

과목명
Categorical

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
응급환자관리
1687 
기초의학
1675 
응급의료관련법령
1627 
전문응급처치학각론
1439 
1급응급구조사 실기
1416 
Other values (4)
2156 

Length

Max length10
Median length9
Mean length7.35
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row응급의료관련법령
2nd row전문응급처치학총론
3rd row응급의료관련법령
4th row응급의료관련법령
5th row기초의학

Common Values

ValueCountFrequency (%)
응급환자관리 1687
16.9%
기초의학 1675
16.8%
응급의료관련법령 1627
16.3%
전문응급처치학각론 1439
14.4%
1급응급구조사 실기 1416
14.2%
전문응급처치학총론 1391
13.9%
실기시험 279
 
2.8%
임상응급의학 260
 
2.6%
응급의학총론 226
 
2.3%

Length

2023-12-12T11:08:31.556401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:08:31.722931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
응급환자관리 1687
14.8%
기초의학 1675
14.7%
응급의료관련법령 1627
14.3%
전문응급처치학각론 1439
12.6%
1급응급구조사 1416
12.4%
실기 1416
12.4%
전문응급처치학총론 1391
12.2%
실기시험 279
 
2.4%
임상응급의학 260
 
2.3%
응급의학총론 226
 
2.0%

과목별점수
Real number (ℝ)

ZEROS 

Distinct233
Distinct (%)2.3%
Missing4
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean34.723059
Minimum0
Maximum100
Zeros509
Zeros (%)5.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T11:08:31.932161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q123
median29
Q350
95-th percentile69
Maximum100
Range100
Interquartile range (IQR)27

Descriptive statistics

Standard deviation18.452613
Coefficient of variation (CV)0.53142245
Kurtosis0.22612849
Mean34.723059
Median Absolute Deviation (MAD)8
Skewness0.64181307
Sum347091.7
Variance340.49893
MonotonicityNot monotonic
2023-12-12T11:08:32.131383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 509
 
5.1%
28.0 464
 
4.6%
27.0 436
 
4.4%
26.0 435
 
4.3%
25.0 422
 
4.2%
29.0 414
 
4.1%
24.0 384
 
3.8%
23.0 361
 
3.6%
30.0 360
 
3.6%
22.0 338
 
3.4%
Other values (223) 5873
58.7%
ValueCountFrequency (%)
0.0 509
5.1%
6.0 3
 
< 0.1%
6.5 1
 
< 0.1%
7.0 3
 
< 0.1%
8.0 6
 
0.1%
9.0 7
 
0.1%
10.0 18
 
0.2%
11.0 21
 
0.2%
12.0 34
 
0.3%
12.5 1
 
< 0.1%
ValueCountFrequency (%)
100.0 10
0.1%
97.0 22
0.2%
96.0 3
 
< 0.1%
95.0 3
 
< 0.1%
94.0 7
 
0.1%
93.0 8
 
0.1%
92.0 3
 
< 0.1%
91.0 3
 
< 0.1%
90.0 3
 
< 0.1%
89.0 5
 
0.1%

총점
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct869
Distinct (%)8.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean206.42718
Minimum0
Maximum323
Zeros381
Zeros (%)3.8%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T11:08:32.321203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile33.4875
Q1195
median219.225
Q3238
95-th percentile262
Maximum323
Range323
Interquartile range (IQR)43

Descriptive statistics

Standard deviation57.26282
Coefficient of variation (CV)0.27739961
Kurtosis5.5467596
Mean206.42718
Median Absolute Deviation (MAD)20.725
Skewness-2.2432808
Sum2064271.8
Variance3279.0305
MonotonicityNot monotonic
2023-12-12T11:08:32.484456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 381
 
3.8%
224.0 82
 
0.8%
229.0 80
 
0.8%
231.0 80
 
0.8%
219.0 80
 
0.8%
223.0 79
 
0.8%
214.0 79
 
0.8%
230.0 79
 
0.8%
227.0 77
 
0.8%
235.0 75
 
0.8%
Other values (859) 8908
89.1%
ValueCountFrequency (%)
0.0 381
3.8%
0.5 1
 
< 0.1%
6.5 2
 
< 0.1%
7.0 4
 
< 0.1%
8.0 2
 
< 0.1%
9.0 2
 
< 0.1%
10.0 3
 
< 0.1%
10.5 2
 
< 0.1%
12.0 2
 
< 0.1%
12.5 1
 
< 0.1%
ValueCountFrequency (%)
323.0 1
 
< 0.1%
320.0 1
 
< 0.1%
316.0 2
 
< 0.1%
312.0 1
 
< 0.1%
311.0 4
< 0.1%
310.0 4
< 0.1%
309.0 6
0.1%
308.0 3
< 0.1%
307.0 3
< 0.1%
305.0 2
 
< 0.1%

합격여부
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
합격
8246 
불합격
1469 
결시
 
259
응시결격
 
26

Length

Max length4
Median length2
Mean length2.1521
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row불합격
2nd row합격
3rd row합격
4th row합격
5th row합격

Common Values

ValueCountFrequency (%)
합격 8246
82.5%
불합격 1469
 
14.7%
결시 259
 
2.6%
응시결격 26
 
0.3%

Length

2023-12-12T11:08:32.644638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:08:32.806756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
합격 8246
82.5%
불합격 1469
 
14.7%
결시 259
 
2.6%
응시결격 26
 
0.3%

성별
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
5492 
4508 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
5492
54.9%
4508
45.1%

Length

2023-12-12T11:08:32.937071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:08:33.062793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5492
54.9%
4508
45.1%

연령대
Categorical

IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
20
8893 
30
 
636
40
 
444
50
 
27

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20
2nd row20
3rd row40
4th row20
5th row20

Common Values

ValueCountFrequency (%)
20 8893
88.9%
30 636
 
6.4%
40 444
 
4.4%
50 27
 
0.3%

Length

2023-12-12T11:08:33.181894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:08:33.325039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20 8893
88.9%
30 636
 
6.4%
40 444
 
4.4%
50 27
 
0.3%

Interactions

2023-12-12T11:08:29.011790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:26.345469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:27.043770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:27.621464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:28.247409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:29.147011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:26.504442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:27.180890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:27.739639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:28.407813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:29.294639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:26.641978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:27.309628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:27.851964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:28.562197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:29.463289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:26.782237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:27.425693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:27.974601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:28.710194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:29.611608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:26.895298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:27.524520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:28.105706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:08:28.848876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T11:08:33.765976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도회차일련번호과목명과목별점수총점합격여부성별연령대
연도1.0000.9990.9870.4740.2170.1730.1850.1620.147
회차0.9991.0000.9840.4760.3280.4570.1960.1510.147
일련번호0.9870.9841.0000.4510.3310.4490.1750.1140.146
과목명0.4740.4760.4511.0000.6990.1550.0470.0170.020
과목별점수0.2170.3280.3310.6991.0000.8060.6550.1960.433
총점0.1730.4570.4490.1550.8061.0000.7780.2560.514
합격여부0.1850.1960.1750.0470.6550.7781.0000.2590.644
성별0.1620.1510.1140.0170.1960.2560.2591.0000.351
연령대0.1470.1470.1460.0200.4330.5140.6440.3511.000
2023-12-12T11:08:33.909199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별합격여부연령대과목명
성별1.0000.1720.2350.017
합격여부0.1721.0000.3000.030
연령대0.2350.3001.0000.013
과목명0.0170.0300.0131.000
2023-12-12T11:08:34.026173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도회차일련번호과목별점수총점과목명합격여부성별연령대
연도1.0001.0000.998-0.017-0.1020.2390.1150.1180.088
회차1.0001.0000.998-0.017-0.1020.2390.1150.1180.088
일련번호0.9980.9981.000-0.014-0.0990.2240.1060.0870.087
과목별점수-0.017-0.017-0.0141.0000.4450.4090.4510.1540.270
총점-0.102-0.102-0.0990.4451.0000.0710.5950.1960.333
과목명0.2390.2390.2240.4090.0711.0000.0300.0170.013
합격여부0.1150.1150.1060.4510.5950.0301.0000.1720.300
성별0.1180.1180.0870.1540.1960.0170.1721.0000.235
연령대0.0880.0880.0870.2700.3330.0130.3000.2351.000

Missing values

2023-12-12T11:08:29.804575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:08:30.040151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도직종회차일련번호과목명과목별점수총점합격여부성별연령대
601432012응급구조사1급1810024응급의료관련법령17.0166.0불합격20
787172014응급구조사1급2013120전문응급처치학총론27.0220.5합격20
579892012응급구조사1급189665응급의료관련법령27.0209.2합격40
72412001응급구조사1급71207응급의료관련법령24.0256.0합격20
431422010응급구조사1급167191기초의학33.0256.0합격20
111912002응급구조사1급818661급응급구조사 실기47.5212.5합격20
804602015응급구조사1급2113411응급환자관리21.0145.0불합격20
711162014응급구조사1급20118531급응급구조사 실기53.0239.0합격20
532222011응급구조사1급178871기초의학29.0247.0합격20
322000응급구조사1급66응급환자관리0.00.0결시30
연도직종회차일련번호과목명과목별점수총점합격여부성별연령대
210782005응급구조사1급113514응급환자관리27.0215.5합격20
733332014응급구조사1급2012223전문응급처치학각론72.0229.0합격20
152602003응급구조사1급92544기초의학23.0193.0합격20
755822014응급구조사1급2012598응급환자관리27.0222.0합격20
101962002응급구조사1급81700응급환자관리0.00.0불합격20
276242007응급구조사1급134605응급환자관리31.0250.0합격20
471602010응급구조사1급167861응급환자관리0.031.9불합격20
720482014응급구조사1급2012009응급환자관리0.00.0불합격40
214172005응급구조사1급113570전문응급처치학총론24.0216.5합격20
865542015응급구조사1급2114426실기시험19.019.0불합격40