Overview

Dataset statistics

Number of variables7
Number of observations54
Missing cells57
Missing cells (%)15.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.2 KiB
Average record size in memory61.4 B

Variable types

Numeric2
Text1
Categorical3
Unsupported1

Dataset

Description장애학생 통합형 직업교육 거점학교 지정 운영 현황(연도별 거점학교 지정운영 현황)으로써 지정년도, 학교명, 교육청, 설립별, 지정유형을 포함하고 있습니다.자료는 매년 갱신됩니다.
Author교육부 국립특수교육원
URLhttps://www.data.go.kr/data/15047505/fileData.do

Alerts

순번 is highly overall correlated with 지정연도 and 1 other fieldsHigh correlation
지정연도 is highly overall correlated with 순번 and 1 other fieldsHigh correlation
지정유형 is highly overall correlated with 순번 and 1 other fieldsHigh correlation
설립별 is highly imbalanced (83.2%)Imbalance
순번 has 1 (1.9%) missing valuesMissing
지정연도 has 1 (1.9%) missing valuesMissing
학교명 has 1 (1.9%) missing valuesMissing
Unnamed: 6 has 54 (100.0%) missing valuesMissing
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 13:17:08.708441
Analysis finished2023-12-12 13:17:09.650454
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct53
Distinct (%)100.0%
Missing1
Missing (%)1.9%
Infinite0
Infinite (%)0.0%
Mean27
Minimum1
Maximum53
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size618.0 B
2023-12-12T22:17:09.722934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.6
Q114
median27
Q340
95-th percentile50.4
Maximum53
Range52
Interquartile range (IQR)26

Descriptive statistics

Standard deviation15.443445
Coefficient of variation (CV)0.57197945
Kurtosis-1.2
Mean27
Median Absolute Deviation (MAD)13
Skewness0
Sum1431
Variance238.5
MonotonicityStrictly increasing
2023-12-12T22:17:09.862817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.9%
41 1
 
1.9%
30 1
 
1.9%
31 1
 
1.9%
32 1
 
1.9%
33 1
 
1.9%
34 1
 
1.9%
35 1
 
1.9%
36 1
 
1.9%
37 1
 
1.9%
Other values (43) 43
79.6%
ValueCountFrequency (%)
1 1
1.9%
2 1
1.9%
3 1
1.9%
4 1
1.9%
5 1
1.9%
6 1
1.9%
7 1
1.9%
8 1
1.9%
9 1
1.9%
10 1
1.9%
ValueCountFrequency (%)
53 1
1.9%
52 1
1.9%
51 1
1.9%
50 1
1.9%
49 1
1.9%
48 1
1.9%
47 1
1.9%
46 1
1.9%
45 1
1.9%
44 1
1.9%

지정연도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct13
Distinct (%)24.5%
Missing1
Missing (%)1.9%
Infinite0
Infinite (%)0.0%
Mean2013.6981
Minimum2010
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size618.0 B
2023-12-12T22:17:09.964428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2010
5-th percentile2010
Q12011
median2012
Q32016
95-th percentile2022
Maximum2023
Range13
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.6877391
Coefficient of variation (CV)0.0018313267
Kurtosis0.47827153
Mean2013.6981
Median Absolute Deviation (MAD)2
Skewness1.1364706
Sum106726
Variance13.599419
MonotonicityIncreasing
2023-12-12T22:17:10.060499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
2011 11
20.4%
2010 9
16.7%
2012 8
14.8%
2015 5
9.3%
2013 4
 
7.4%
2017 4
 
7.4%
2016 3
 
5.6%
2014 2
 
3.7%
2022 2
 
3.7%
2023 2
 
3.7%
Other values (3) 3
 
5.6%
ValueCountFrequency (%)
2010 9
16.7%
2011 11
20.4%
2012 8
14.8%
2013 4
 
7.4%
2014 2
 
3.7%
2015 5
9.3%
2016 3
 
5.6%
2017 4
 
7.4%
2018 1
 
1.9%
2019 1
 
1.9%
ValueCountFrequency (%)
2023 2
 
3.7%
2022 2
 
3.7%
2021 1
 
1.9%
2019 1
 
1.9%
2018 1
 
1.9%
2017 4
7.4%
2016 3
5.6%
2015 5
9.3%
2014 2
 
3.7%
2013 4
7.4%

학교명
Text

MISSING 

Distinct53
Distinct (%)100.0%
Missing1
Missing (%)1.9%
Memory size564.0 B
2023-12-12T22:17:10.260703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length8.0754717
Min length5

Characters and Unicode

Total characters428
Distinct characters87
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)100.0%

Sample

1st row상암고등학교
2nd row대구과학기술고등학교
3rd row광주전자공업고등학교
4th row성남테크노과학고등학교
5th row이천제일고등학교
ValueCountFrequency (%)
상암고등학교 1
 
1.9%
전남기술과학고등학교 1
 
1.9%
대전전자디자인고등학교 1
 
1.9%
서울문화고등학교 1
 
1.9%
온양용화고등학교 1
 
1.9%
천안공업고등학교 1
 
1.9%
경기고등학교 1
 
1.9%
태안고등학교 1
 
1.9%
대구농업마이스터고등학교 1
 
1.9%
박문여자고등학교 1
 
1.9%
Other values (43) 43
81.1%
2023-12-12T22:17:10.576002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
63
14.7%
53
 
12.4%
52
 
12.1%
52
 
12.1%
14
 
3.3%
11
 
2.6%
10
 
2.3%
9
 
2.1%
8
 
1.9%
8
 
1.9%
Other values (77) 148
34.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 428
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
63
14.7%
53
 
12.4%
52
 
12.1%
52
 
12.1%
14
 
3.3%
11
 
2.6%
10
 
2.3%
9
 
2.1%
8
 
1.9%
8
 
1.9%
Other values (77) 148
34.6%

Most occurring scripts

ValueCountFrequency (%)
Hangul 428
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
63
14.7%
53
 
12.4%
52
 
12.1%
52
 
12.1%
14
 
3.3%
11
 
2.6%
10
 
2.3%
9
 
2.1%
8
 
1.9%
8
 
1.9%
Other values (77) 148
34.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 428
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
63
14.7%
53
 
12.4%
52
 
12.1%
52
 
12.1%
14
 
3.3%
11
 
2.6%
10
 
2.3%
9
 
2.1%
8
 
1.9%
8
 
1.9%
Other values (77) 148
34.6%

교육청
Categorical

Distinct16
Distinct (%)29.6%
Missing0
Missing (%)0.0%
Memory size564.0 B
충남
12 
경기
서울
인천
충북
Other values (11)
21 

Length

Max length4
Median length2
Mean length2.037037
Min length2

Unique

Unique5 ?
Unique (%)9.3%

Sample

1st row서울
2nd row대구
3rd row광주
4th row경기
5th row경기

Common Values

ValueCountFrequency (%)
충남 12
22.2%
경기 8
14.8%
서울 6
11.1%
인천 4
 
7.4%
충북 3
 
5.6%
전남 3
 
5.6%
제주 3
 
5.6%
대전 3
 
5.6%
전북 3
 
5.6%
대구 2
 
3.7%
Other values (6) 7
13.0%

Length

2023-12-12T22:17:10.694953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
충남 12
22.2%
경기 8
14.8%
서울 6
11.1%
인천 4
 
7.4%
충북 3
 
5.6%
전남 3
 
5.6%
제주 3
 
5.6%
대전 3
 
5.6%
전북 3
 
5.6%
대구 2
 
3.7%
Other values (6) 7
13.0%

설립별
Categorical

IMBALANCE 

Distinct3
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size564.0 B
공립
52 
사립
 
1
<NA>
 
1

Length

Max length4
Median length2
Mean length2.037037
Min length2

Unique

Unique2 ?
Unique (%)3.7%

Sample

1st row공립
2nd row공립
3rd row공립
4th row공립
5th row공립

Common Values

ValueCountFrequency (%)
공립 52
96.3%
사립 1
 
1.9%
<NA> 1
 
1.9%

Length

2023-12-12T22:17:10.821326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:17:10.926962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공립 52
96.3%
사립 1
 
1.9%
na 1
 
1.9%

지정유형
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)7.4%
Missing0
Missing (%)0.0%
Memory size564.0 B
교육부 지정
34 
시도 지정
12 
시도지정
<NA>
 
1

Length

Max length6
Median length6
Mean length5.4814815
Min length4

Unique

Unique1 ?
Unique (%)1.9%

Sample

1st row교육부 지정
2nd row교육부 지정
3rd row교육부 지정
4th row교육부 지정
5th row교육부 지정

Common Values

ValueCountFrequency (%)
교육부 지정 34
63.0%
시도 지정 12
 
22.2%
시도지정 7
 
13.0%
<NA> 1
 
1.9%

Length

2023-12-12T22:17:11.016141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:17:11.118399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지정 46
46.0%
교육부 34
34.0%
시도 12
 
12.0%
시도지정 7
 
7.0%
na 1
 
1.0%

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing54
Missing (%)100.0%
Memory size618.0 B

Interactions

2023-12-12T22:17:09.176209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:17:09.006068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:17:09.270543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:17:09.080451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:17:11.186648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번지정연도학교명교육청설립별지정유형
순번1.0000.9651.0000.4470.1740.830
지정연도0.9651.0001.0000.1550.0000.978
학교명1.0001.0001.0001.0001.0001.000
교육청0.4470.1551.0001.0000.0000.492
설립별0.1740.0001.0000.0001.0000.000
지정유형0.8300.9781.0000.4920.0001.000
2023-12-12T22:17:11.273747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설립별교육청지정유형
설립별1.0000.0000.000
교육청0.0001.0000.215
지정유형0.0000.2151.000
2023-12-12T22:17:11.354906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번지정연도교육청설립별지정유형
순번1.0000.9900.1340.1080.704
지정연도0.9901.0000.0000.0000.784
교육청0.1340.0001.0000.0000.215
설립별0.1080.0000.0001.0000.000
지정유형0.7040.7840.2150.0001.000

Missing values

2023-12-12T22:17:09.371543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:17:09.477175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T22:17:09.585255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번지정연도학교명교육청설립별지정유형Unnamed: 6
012010상암고등학교서울공립교육부 지정<NA>
122010대구과학기술고등학교대구공립교육부 지정<NA>
232010광주전자공업고등학교광주공립교육부 지정<NA>
342010성남테크노과학고등학교경기공립교육부 지정<NA>
452010이천제일고등학교경기공립교육부 지정<NA>
562010제천제일고등학교충북공립교육부 지정<NA>
672010공주생명과학고등학교충남공립교육부 지정<NA>
782010목포공업고등학교전남공립교육부 지정<NA>
892010제주고등학교제주공립교육부 지정<NA>
9102011경복고등학교서울공립교육부 지정<NA>
순번지정연도학교명교육청설립별지정유형Unnamed: 6
44452017청양고등학교충남공립시도 지정<NA>
45462017인동고등학교경북공립시도 지정<NA>
46472018서산중앙고등학교충남공립시도지정<NA>
47482019전주생명과학고등학교전북공립시도지정<NA>
48492021전북유니텍고등학교전북공립시도지정<NA>
49502022계산공업고등학교인천공립시도지정<NA>
50512022서귀포산업과학고등학교제주공립시도지정<NA>
51522023갈마중학교대전공립시도지정<NA>
52532023대전전자디자인고등학교대전공립시도지정<NA>
53<NA><NA><NA><NA><NA><NA><NA>