Overview

Dataset statistics

Number of variables10
Number of observations117
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.7 KiB
Average record size in memory85.1 B

Variable types

Categorical9
Numeric1

Dataset

Description3급 장애인재활상담사 국가시험 응시자의 현황을 분석할 수 있는 정보(연도, 직종, 회차, 성별, 연령대, 응시지역, 졸업여부, 합격여부, 학교소재지)를 개인을 식별할 수 없는 형태로 제공합니다. 3급 장애인재활상담사는 2021년도까지만 시행하여 이후 데이터는 없습니다.
Author공공데이터포털
URLhttps://www.data.go.kr/data/15083500/fileData.do

Alerts

직종 has constant value ""Constant
연도 is highly overall correlated with 일련번호 and 2 other fieldsHigh correlation
회차 is highly overall correlated with 일련번호 and 3 other fieldsHigh correlation
일련번호 is highly overall correlated with 연도 and 3 other fieldsHigh correlation
응시지역 is highly overall correlated with 일련번호 and 2 other fieldsHigh correlation
졸업여부 is highly overall correlated with 일련번호 and 1 other fieldsHigh correlation
합격여부 is highly imbalanced (53.8%)Imbalance
일련번호 has unique valuesUnique

Reproduction

Analysis started2024-04-20 22:53:20.869085
Analysis finished2024-04-20 22:53:22.685874
Duration1.82 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2018
66 
2020
33 
2019
10 
2021

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018
2nd row2018
3rd row2018
4th row2018
5th row2018

Common Values

ValueCountFrequency (%)
2018 66
56.4%
2020 33
28.2%
2019 10
 
8.5%
2021 8
 
6.8%

Length

2024-04-21T07:53:22.888105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T07:53:23.203215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2018 66
56.4%
2020 33
28.2%
2019 10
 
8.5%
2021 8
 
6.8%

직종
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
3급 장애인재활상담사
117 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3급 장애인재활상담사
2nd row3급 장애인재활상담사
3rd row3급 장애인재활상담사
4th row3급 장애인재활상담사
5th row3급 장애인재활상담사

Common Values

ValueCountFrequency (%)
3급 장애인재활상담사 117
100.0%

Length

2024-04-21T07:53:23.560205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T07:53:23.853308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3급 117
50.0%
장애인재활상담사 117
50.0%

회차
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2
39 
4
33 
1
27 
3
10 
5

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
2 39
33.3%
4 33
28.2%
1 27
23.1%
3 10
 
8.5%
5 8
 
6.8%

Length

2024-04-21T07:53:24.163173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T07:53:24.491603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 39
33.3%
4 33
28.2%
1 27
23.1%
3 10
 
8.5%
5 8
 
6.8%

일련번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct117
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean59
Minimum1
Maximum117
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2024-04-21T07:53:24.863488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.8
Q130
median59
Q388
95-th percentile111.2
Maximum117
Range116
Interquartile range (IQR)58

Descriptive statistics

Standard deviation33.919021
Coefficient of variation (CV)0.57489866
Kurtosis-1.2
Mean59
Median Absolute Deviation (MAD)29
Skewness0
Sum6903
Variance1150.5
MonotonicityStrictly increasing
2024-04-21T07:53:25.305835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.9%
75 1
 
0.9%
87 1
 
0.9%
86 1
 
0.9%
85 1
 
0.9%
84 1
 
0.9%
83 1
 
0.9%
82 1
 
0.9%
81 1
 
0.9%
80 1
 
0.9%
Other values (107) 107
91.5%
ValueCountFrequency (%)
1 1
0.9%
2 1
0.9%
3 1
0.9%
4 1
0.9%
5 1
0.9%
6 1
0.9%
7 1
0.9%
8 1
0.9%
9 1
0.9%
10 1
0.9%
ValueCountFrequency (%)
117 1
0.9%
116 1
0.9%
115 1
0.9%
114 1
0.9%
113 1
0.9%
112 1
0.9%
111 1
0.9%
110 1
0.9%
109 1
0.9%
108 1
0.9%

성별
Categorical

Distinct2
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
90 
27 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
90
76.9%
27
 
23.1%

Length

2024-04-21T07:53:25.728730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T07:53:26.030713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
90
76.9%
27
 
23.1%

연령대
Categorical

Distinct5
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
20
43 
40
38 
50
24 
30
11 
60
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)0.9%

Sample

1st row20
2nd row40
3rd row40
4th row50
5th row20

Common Values

ValueCountFrequency (%)
20 43
36.8%
40 38
32.5%
50 24
20.5%
30 11
 
9.4%
60 1
 
0.9%

Length

2024-04-21T07:53:26.354669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T07:53:26.678080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20 43
36.8%
40 38
32.5%
50 24
20.5%
30 11
 
9.4%
60 1
 
0.9%

응시지역
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
서울특별시
87 
대구광역시
28 
전주
 
2

Length

Max length5
Median length5
Mean length4.9487179
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 87
74.4%
대구광역시 28
 
23.9%
전주 2
 
1.7%

Length

2024-04-21T07:53:27.065361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T07:53:27.397322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 87
74.4%
대구광역시 28
 
23.9%
전주 2
 
1.7%

졸업여부
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
졸업예정
72 
졸업
45 

Length

Max length4
Median length4
Mean length3.2307692
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row졸업
2nd row졸업
3rd row졸업
4th row졸업
5th row졸업

Common Values

ValueCountFrequency (%)
졸업예정 72
61.5%
졸업 45
38.5%

Length

2024-04-21T07:53:27.778332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T07:53:28.117702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
졸업예정 72
61.5%
졸업 45
38.5%

합격여부
Categorical

IMBALANCE 

Distinct4
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
합격
94 
불합격
17 
결시
 
4
응시결격
 
2

Length

Max length4
Median length2
Mean length2.1794872
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row불합격
2nd row합격
3rd row합격
4th row합격
5th row합격

Common Values

ValueCountFrequency (%)
합격 94
80.3%
불합격 17
 
14.5%
결시 4
 
3.4%
응시결격 2
 
1.7%

Length

2024-04-21T07:53:28.478729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T07:53:28.847712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
합격 94
80.3%
불합격 17
 
14.5%
결시 4
 
3.4%
응시결격 2
 
1.7%

학교소재지
Categorical

Distinct5
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
대구광역시
59 
경상북도
55 
전라남도
 
1
부산광역시
 
1
강원도
 
1

Length

Max length5
Median length5
Mean length4.5042735
Min length3

Unique

Unique3 ?
Unique (%)2.6%

Sample

1st row경상북도
2nd row경상북도
3rd row대구광역시
4th row대구광역시
5th row경상북도

Common Values

ValueCountFrequency (%)
대구광역시 59
50.4%
경상북도 55
47.0%
전라남도 1
 
0.9%
부산광역시 1
 
0.9%
강원도 1
 
0.9%

Length

2024-04-21T07:53:29.132568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T07:53:29.336837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구광역시 59
50.4%
경상북도 55
47.0%
전라남도 1
 
0.9%
부산광역시 1
 
0.9%
강원도 1
 
0.9%

Interactions

2024-04-21T07:53:21.690826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T07:53:29.473630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도회차일련번호성별연령대응시지역졸업여부합격여부학교소재지
연도1.0001.0000.8970.1940.2310.5290.4830.1570.223
회차1.0001.0000.9820.0760.4110.5780.5710.1080.402
일련번호0.8970.9821.0000.1120.0000.7630.8370.0000.311
성별0.1940.0760.1121.0000.2720.0000.0000.2050.017
연령대0.2310.4110.0000.2721.0000.0630.2270.0820.678
응시지역0.5290.5780.7630.0000.0631.0000.1520.1640.000
졸업여부0.4830.5710.8370.0000.2270.1521.0000.0000.256
합격여부0.1570.1080.0000.2050.0820.1640.0001.0000.000
학교소재지0.2230.4020.3110.0170.6780.0000.2560.0001.000
2024-04-21T07:53:29.682277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
합격여부연령대연도응시지역성별학교소재지졸업여부회차
합격여부1.0000.0650.0610.1540.1340.0000.0000.086
연령대0.0651.0000.1890.0440.3270.3120.2740.163
연도0.0610.1891.0000.5290.1260.1820.3240.996
응시지역0.1540.0440.5291.0000.0000.0000.2490.521
성별0.1340.3270.1260.0001.0000.0110.0000.090
학교소재지0.0000.3120.1820.0000.0111.0000.3080.159
졸업여부0.0000.2740.3240.2490.0000.3081.0000.681
회차0.0860.1630.9960.5210.0900.1590.6811.000
2024-04-21T07:53:29.882132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호연도회차성별연령대응시지역졸업여부합격여부학교소재지
일련번호1.0000.7560.7930.0790.0000.6190.6450.0000.129
연도0.7561.0000.9960.1260.1890.5290.3240.0610.182
회차0.7930.9961.0000.0900.1630.5210.6810.0860.159
성별0.0790.1260.0901.0000.3270.0000.0000.1340.011
연령대0.0000.1890.1630.3271.0000.0440.2740.0650.312
응시지역0.6190.5290.5210.0000.0441.0000.2490.1540.000
졸업여부0.6450.3240.6810.0000.2740.2491.0000.0000.308
합격여부0.0000.0610.0860.1340.0650.1540.0001.0000.000
학교소재지0.1290.1820.1590.0110.3120.0000.3080.0001.000

Missing values

2024-04-21T07:53:22.040021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T07:53:22.507096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도직종회차일련번호성별연령대응시지역졸업여부합격여부학교소재지
020183급 장애인재활상담사1120서울특별시졸업불합격경상북도
120183급 장애인재활상담사1240서울특별시졸업합격경상북도
220183급 장애인재활상담사1340서울특별시졸업합격대구광역시
320183급 장애인재활상담사1450서울특별시졸업합격대구광역시
420183급 장애인재활상담사1520서울특별시졸업합격경상북도
520183급 장애인재활상담사1630서울특별시졸업합격대구광역시
620183급 장애인재활상담사1750서울특별시졸업합격대구광역시
720183급 장애인재활상담사1850서울특별시졸업합격대구광역시
820183급 장애인재활상담사1920서울특별시졸업합격대구광역시
920183급 장애인재활상담사11040서울특별시졸업합격대구광역시
연도직종회차일련번호성별연령대응시지역졸업여부합격여부학교소재지
10720203급 장애인재활상담사410850전주졸업합격대구광역시
10820203급 장애인재활상담사410940전주졸업합격대구광역시
10920213급 장애인재활상담사511020대구광역시졸업예정합격경상북도
11020213급 장애인재활상담사511130대구광역시졸업예정합격경상북도
11120213급 장애인재활상담사511250대구광역시졸업예정합격경상북도
11220213급 장애인재활상담사511320대구광역시졸업예정합격경상북도
11320213급 장애인재활상담사511420대구광역시졸업예정합격경상북도
11420213급 장애인재활상담사511520대구광역시졸업예정합격경상북도
11520213급 장애인재활상담사511660대구광역시졸업예정합격경상북도
11620213급 장애인재활상담사511740대구광역시졸업예정합격경상북도