Overview

Dataset statistics

Number of variables11
Number of observations7081
Missing cells3446
Missing cells (%)4.4%
Duplicate rows505
Duplicate rows (%)7.1%
Total size in memory622.5 KiB
Average record size in memory90.0 B

Variable types

Categorical9
Text1
Numeric1

Dataset

Description2022년 한국장애인고용공단에서 실시한 지원고용에 대한 통계 자료 - 제공항목: 계획년도, 계획기관, 훈련구분, 위탁기관, 산업분류대분류, 성별, 장애유형, 중증여부, 훈련결과, 최종학력, 연령
URLhttps://www.data.go.kr/data/15046221/fileData.do

Alerts

계획년도 has constant value ""Constant
Dataset has 505 (7.1%) duplicate rowsDuplicates
장애유형 is highly imbalanced (64.7%)Imbalance
중증여부 is highly imbalanced (90.4%)Imbalance
훈련결과 is highly imbalanced (55.1%)Imbalance
위탁기관 has 3446 (48.7%) missing valuesMissing

Reproduction

Analysis started2023-12-12 12:50:58.799367
Analysis finished2023-12-12 12:51:00.381076
Duration1.58 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

계획년도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size55.4 KiB
2022
7081 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 7081
100.0%

Length

2023-12-12T21:51:00.780674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:51:00.894575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 7081
100.0%

계획기관
Categorical

Distinct28
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size55.4 KiB
부산지역본부
501 
서울남부지사
 
475
서울지역본부
 
453
서울동부지사
 
442
인천지사
 
420
Other values (23)
4790 

Length

Max length12
Median length6
Mean length5.3886457
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울북부지사
2nd row서울북부지사
3rd row경기서부지사
4th row경기서부지사
5th row경기북부지사

Common Values

ValueCountFrequency (%)
부산지역본부 501
 
7.1%
서울남부지사 475
 
6.7%
서울지역본부 453
 
6.4%
서울동부지사 442
 
6.2%
인천지사 420
 
5.9%
경기동부지사 403
 
5.7%
경기지역본부 382
 
5.4%
경기북부지사 370
 
5.2%
대전지역본부 326
 
4.6%
경남지사 314
 
4.4%
Other values (18) 2995
42.3%

Length

2023-12-12T21:51:01.047168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
부산지역본부 501
 
7.1%
서울남부지사 475
 
6.7%
서울지역본부 453
 
6.4%
서울동부지사 442
 
6.2%
인천지사 420
 
5.9%
경기동부지사 403
 
5.7%
경기지역본부 382
 
5.4%
경기북부지사 370
 
5.2%
대전지역본부 326
 
4.6%
경남지사 314
 
4.4%
Other values (18) 2995
42.3%

훈련구분
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size55.4 KiB
민간위탁
3637 
지원고용
3444 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row민간위탁
2nd row지원고용
3rd row민간위탁
4th row지원고용
5th row지원고용

Common Values

ValueCountFrequency (%)
민간위탁 3637
51.4%
지원고용 3444
48.6%

Length

2023-12-12T21:51:01.197461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:51:01.310262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
민간위탁 3637
51.4%
지원고용 3444
48.6%

위탁기관
Text

MISSING 

Distinct178
Distinct (%)4.9%
Missing3446
Missing (%)48.7%
Memory size55.4 KiB
2023-12-12T21:51:01.541123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length17
Mean length10.952407
Min length3

Characters and Unicode

Total characters39812
Distinct characters214
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row성동장애인종합복지관
2nd row안양시수리장애인종합복지관
3rd row경기도시각장애인복지관
4th row안양시관악장애인복지관
5th row숭인사회복귀시설
ValueCountFrequency (%)
대구장애인종합복지관 86
 
2.2%
사단법인 74
 
1.9%
한국교통장애인협회구미시장애인종합복지관 68
 
1.8%
서울장애인종합복지관 66
 
1.7%
부산진구장애인복지관 58
 
1.5%
영도구장애인복지관 57
 
1.5%
영천시장애인종합복지관 57
 
1.5%
경산시장애인종합복지관 54
 
1.4%
광주광역시장애인종합복지관 54
 
1.4%
혜원장애인종합복지관 51
 
1.3%
Other values (176) 3211
83.7%
2023-12-12T21:51:01.974783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3300
 
8.3%
3121
 
7.8%
3101
 
7.8%
3040
 
7.6%
2861
 
7.2%
2791
 
7.0%
1856
 
4.7%
1779
 
4.5%
1444
 
3.6%
619
 
1.6%
Other values (204) 15900
39.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39308
98.7%
Space Separator 201
 
0.5%
Close Punctuation 169
 
0.4%
Open Punctuation 134
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3300
 
8.4%
3121
 
7.9%
3101
 
7.9%
3040
 
7.7%
2861
 
7.3%
2791
 
7.1%
1856
 
4.7%
1779
 
4.5%
1444
 
3.7%
619
 
1.6%
Other values (201) 15396
39.2%
Space Separator
ValueCountFrequency (%)
201
100.0%
Close Punctuation
ValueCountFrequency (%)
) 169
100.0%
Open Punctuation
ValueCountFrequency (%)
( 134
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39308
98.7%
Common 504
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3300
 
8.4%
3121
 
7.9%
3101
 
7.9%
3040
 
7.7%
2861
 
7.3%
2791
 
7.1%
1856
 
4.7%
1779
 
4.5%
1444
 
3.7%
619
 
1.6%
Other values (201) 15396
39.2%
Common
ValueCountFrequency (%)
201
39.9%
) 169
33.5%
( 134
26.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39308
98.7%
ASCII 504
 
1.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3300
 
8.4%
3121
 
7.9%
3101
 
7.9%
3040
 
7.7%
2861
 
7.3%
2791
 
7.1%
1856
 
4.7%
1779
 
4.5%
1444
 
3.7%
619
 
1.6%
Other values (201) 15396
39.2%
ASCII
ValueCountFrequency (%)
201
39.9%
) 169
33.5%
( 134
26.6%
Distinct19
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size55.4 KiB
제조업
1731 
도매 및 소매업
919 
숙박 및 음식점업
780 
보건업 및 사회복지 서비스업
677 
공공 행정, 국방 및 사회보장 행정
659 
Other values (14)
2315 

Length

Max length24
Median length22
Mean length10.913289
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제조업
2nd row사업시설 관리, 사업 지원 및 임대 서비스업
3rd row도매 및 소매업
4th row제조업
5th row도매 및 소매업

Common Values

ValueCountFrequency (%)
제조업 1731
24.4%
도매 및 소매업 919
13.0%
숙박 및 음식점업 780
11.0%
보건업 및 사회복지 서비스업 677
 
9.6%
공공 행정, 국방 및 사회보장 행정 659
 
9.3%
협회 및 단체, 수리 및 기타 개인 서비스업 500
 
7.1%
사업시설 관리, 사업 지원 및 임대 서비스업 475
 
6.7%
교육 서비스업 437
 
6.2%
<NA> 256
 
3.6%
전문, 과학 및 기술 서비스업 186
 
2.6%
Other values (9) 461
 
6.5%

Length

2023-12-12T21:51:02.142488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
4942
20.4%
서비스업 2340
 
9.7%
제조업 1731
 
7.1%
행정 1318
 
5.4%
소매업 919
 
3.8%
도매 919
 
3.8%
숙박 780
 
3.2%
음식점업 780
 
3.2%
보건업 677
 
2.8%
사회복지 677
 
2.8%
Other values (43) 9139
37.7%

성별
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size55.4 KiB
4507 
2574 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
4507
63.6%
2574
36.4%

Length

2023-12-12T21:51:02.256787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:51:02.352341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
4507
63.6%
2574
36.4%

장애유형
Categorical

IMBALANCE 

Distinct15
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size55.4 KiB
지적장애
5327 
정신장애
724 
자폐성장애
 
509
청각장애
 
144
뇌병변장애
 
136
Other values (10)
 
241

Length

Max length6
Median length4
Mean length4.0927835
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지적장애
2nd row지적장애
3rd row지적장애
4th row지적장애
5th row지적장애

Common Values

ValueCountFrequency (%)
지적장애 5327
75.2%
정신장애 724
 
10.2%
자폐성장애 509
 
7.2%
청각장애 144
 
2.0%
뇌병변장애 136
 
1.9%
지체장애 130
 
1.8%
시각장애 42
 
0.6%
신장장애 35
 
0.5%
언어장애 15
 
0.2%
심장장애 4
 
0.1%
Other values (5) 15
 
0.2%

Length

2023-12-12T21:51:02.481744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
지적장애 5327
75.2%
정신장애 724
 
10.2%
자폐성장애 509
 
7.2%
청각장애 144
 
2.0%
뇌병변장애 136
 
1.9%
지체장애 130
 
1.8%
시각장애 42
 
0.6%
신장장애 35
 
0.5%
언어장애 15
 
0.2%
심장장애 4
 
0.1%
Other values (5) 15
 
0.2%

중증여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size55.4 KiB
중증
6993 
경증
 
88

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중증
2nd row중증
3rd row중증
4th row중증
5th row중증

Common Values

ValueCountFrequency (%)
중증 6993
98.8%
경증 88
 
1.2%

Length

2023-12-12T21:51:02.617561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:51:02.713888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중증 6993
98.8%
경증 88
 
1.2%

훈련결과
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size55.4 KiB
수료
6417 
탈락
664 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수료
2nd row수료
3rd row탈락
4th row수료
5th row탈락

Common Values

ValueCountFrequency (%)
수료 6417
90.6%
탈락 664
 
9.4%

Length

2023-12-12T21:51:02.820204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:51:02.930078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수료 6417
90.6%
탈락 664
 
9.4%

최종학력
Categorical

Distinct19
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size55.4 KiB
고교졸업
4144 
특수학교 고교졸업
624 
고교재학
614 
초대졸업
 
402
특수학교 고교재학
 
368
Other values (14)
929 

Length

Max length9
Median length4
Mean length4.737184
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row고교졸업
2nd row고교졸업
3rd row고교졸업
4th row초대재학
5th row고교졸업

Common Values

ValueCountFrequency (%)
고교졸업 4144
58.5%
특수학교 고교졸업 624
 
8.8%
고교재학 614
 
8.7%
초대졸업 402
 
5.7%
특수학교 고교재학 368
 
5.2%
대학교졸업 297
 
4.2%
중학졸업 182
 
2.6%
초등졸업 104
 
1.5%
무학 93
 
1.3%
고교중퇴 59
 
0.8%
Other values (9) 194
 
2.7%

Length

2023-12-12T21:51:03.051820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
고교졸업 4768
59.0%
특수학교 994
 
12.3%
고교재학 982
 
12.2%
초대졸업 402
 
5.0%
대학교졸업 297
 
3.7%
중학졸업 182
 
2.3%
초등졸업 104
 
1.3%
무학 93
 
1.2%
고교중퇴 61
 
0.8%
대학교중퇴 56
 
0.7%
Other values (7) 136
 
1.7%

연령
Real number (ℝ)

Distinct59
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29.640587
Minimum17
Maximum75
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size62.4 KiB
2023-12-12T21:51:03.219478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum17
5-th percentile19
Q121
median26
Q334
95-th percentile53
Maximum75
Range58
Interquartile range (IQR)13

Descriptive statistics

Standard deviation11.004354
Coefficient of variation (CV)0.37125964
Kurtosis1.4355114
Mean29.640587
Median Absolute Deviation (MAD)5
Skewness1.3928379
Sum209885
Variance121.0958
MonotonicityNot monotonic
2023-12-12T21:51:03.359485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
21 642
 
9.1%
20 638
 
9.0%
19 454
 
6.4%
22 441
 
6.2%
23 416
 
5.9%
24 353
 
5.0%
26 305
 
4.3%
27 286
 
4.0%
25 281
 
4.0%
28 251
 
3.5%
Other values (49) 3014
42.6%
ValueCountFrequency (%)
17 10
 
0.1%
18 148
 
2.1%
19 454
6.4%
20 638
9.0%
21 642
9.1%
22 441
6.2%
23 416
5.9%
24 353
5.0%
25 281
4.0%
26 305
4.3%
ValueCountFrequency (%)
75 1
 
< 0.1%
74 1
 
< 0.1%
73 2
 
< 0.1%
72 4
 
0.1%
71 5
 
0.1%
70 2
 
< 0.1%
69 9
0.1%
68 6
 
0.1%
67 16
0.2%
66 11
0.2%

Interactions

2023-12-12T21:50:59.904075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:51:03.453437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계획기관훈련구분산업분류대분류성별장애유형중증여부훈련결과최종학력연령
계획기관1.0000.3270.5140.0850.3380.1010.0000.3240.287
훈련구분0.3271.0000.3790.0000.1110.0000.0520.1980.245
산업분류대분류0.5140.3791.0000.0860.1400.0480.1210.2070.267
성별0.0850.0000.0861.0000.1940.0000.0490.0650.073
장애유형0.3380.1110.1400.1941.0000.4460.0320.3030.506
중증여부0.1010.0000.0480.0000.4461.0000.0000.0630.226
훈련결과0.0000.0520.1210.0490.0320.0001.0000.0520.088
최종학력0.3240.1980.2070.0650.3030.0630.0521.0000.502
연령0.2870.2450.2670.0730.5060.2260.0880.5021.000
2023-12-12T21:51:03.583622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
훈련결과최종학력훈련구분계획기관장애유형성별산업분류대분류중증여부
훈련결과1.0000.0460.0330.0000.0300.0310.0960.000
최종학력0.0461.0000.1750.0930.1020.0570.0640.056
훈련구분0.0330.1751.0000.2590.1010.0000.2990.000
계획기관0.0000.0930.2591.0000.1080.0670.1660.080
장애유형0.0300.1020.1010.1081.0000.1760.0460.408
성별0.0310.0570.0000.0670.1761.0000.0680.000
산업분류대분류0.0960.0640.2990.1660.0460.0681.0000.038
중증여부0.0000.0560.0000.0800.4080.0000.0381.000
2023-12-12T21:51:03.713637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연령계획기관훈련구분산업분류대분류성별장애유형중증여부훈련결과최종학력
연령1.0000.1070.1880.1050.0560.2140.1730.0670.213
계획기관0.1071.0000.2590.1660.0670.1080.0800.0000.093
훈련구분0.1880.2591.0000.2990.0000.1010.0000.0330.175
산업분류대분류0.1050.1660.2991.0000.0680.0460.0380.0960.064
성별0.0560.0670.0000.0681.0000.1760.0000.0310.057
장애유형0.2140.1080.1010.0460.1761.0000.4080.0300.102
중증여부0.1730.0800.0000.0380.0000.4081.0000.0000.056
훈련결과0.0670.0000.0330.0960.0310.0300.0001.0000.046
최종학력0.2130.0930.1750.0640.0570.1020.0560.0461.000

Missing values

2023-12-12T21:51:00.088529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:51:00.294895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

계획년도계획기관훈련구분위탁기관산업분류대분류성별장애유형중증여부훈련결과최종학력연령
02022서울북부지사민간위탁성동장애인종합복지관제조업지적장애중증수료고교졸업20
12022서울북부지사지원고용<NA>사업시설 관리, 사업 지원 및 임대 서비스업지적장애중증수료고교졸업21
22022경기서부지사민간위탁안양시수리장애인종합복지관도매 및 소매업지적장애중증탈락고교졸업27
32022경기서부지사지원고용<NA>제조업지적장애중증수료초대재학31
42022경기북부지사지원고용<NA>도매 및 소매업지적장애중증탈락고교졸업25
52022경기북부지사민간위탁경기도시각장애인복지관도매 및 소매업지적장애중증수료고교졸업26
62022경기지역본부민간위탁안양시관악장애인복지관교육 서비스업지적장애중증탈락고교졸업30
72022서울동부지사지원고용<NA>운수 및 창고업자폐성장애중증수료고교졸업31
82022울산지사지원고용<NA>숙박 및 음식점업지적장애중증수료고교졸업22
92022울산지사지원고용<NA>숙박 및 음식점업지적장애중증수료고교졸업26
계획년도계획기관훈련구분위탁기관산업분류대분류성별장애유형중증여부훈련결과최종학력연령
70712022서울지역본부지원고용<NA>도매 및 소매업정신장애중증수료초대졸업41
70722022서울지역본부지원고용<NA>도매 및 소매업지적장애중증수료고교졸업29
70732022서울지역본부지원고용<NA>도매 및 소매업자폐성장애중증수료고교졸업31
70742022서울지역본부지원고용<NA>사업시설 관리, 사업 지원 및 임대 서비스업지적장애중증수료고교졸업34
70752022서울지역본부지원고용<NA>보건업 및 사회복지 서비스업청각장애중증수료특수학교 고교졸업62
70762022서울지역본부지원고용<NA>숙박 및 음식점업지적장애중증수료특수학교 고교재학23
70772022서울지역본부지원고용<NA>숙박 및 음식점업지적장애중증수료고교졸업22
70782022서울지역본부지원고용<NA>제조업지적장애중증수료고교재학19
70792022서울지역본부지원고용<NA>숙박 및 음식점업지적장애중증수료고교졸업36
70802022서울지역본부지원고용<NA>보건업 및 사회복지 서비스업자폐성장애중증수료고교졸업20

Duplicate rows

Most frequently occurring

계획년도계획기관훈련구분위탁기관산업분류대분류성별장애유형중증여부훈련결과최종학력연령# duplicates
4862022충북지사지원고용<NA>공공 행정, 국방 및 사회보장 행정지적장애중증수료고교재학1812
4932022충북지사지원고용<NA>공공 행정, 국방 및 사회보장 행정지적장애중증수료고교재학1911
1072022경기지역본부지원고용<NA>공공 행정, 국방 및 사회보장 행정지적장애중증수료고교재학1910
1092022경기지역본부지원고용<NA>공공 행정, 국방 및 사회보장 행정지적장애중증수료고교졸업208
4872022충북지사지원고용<NA>공공 행정, 국방 및 사회보장 행정지적장애중증수료고교재학198
552022경기동부지사지원고용<NA>협회 및 단체, 수리 및 기타 개인 서비스업지적장애중증수료고교재학197
3022022서울남부지사지원고용<NA>사업시설 관리, 사업 지원 및 임대 서비스업지적장애중증수료고교재학197
3072022서울남부지사지원고용<NA>숙박 및 음식점업지적장애중증수료고교졸업217
1242022경기지역본부지원고용<NA>제조업지적장애중증수료고교졸업206
2242022대구지역본부지원고용<NA>공공 행정, 국방 및 사회보장 행정지적장애중증수료특수학교 고교졸업216