Overview

Dataset statistics

Number of variables14
Number of observations10000
Missing cells10006
Missing cells (%)7.1%
Duplicate rows36
Duplicate rows (%)0.4%
Total size in memory1.2 MiB
Average record size in memory124.0 B

Variable types

Categorical7
Text3
Boolean1
Numeric3

Dataset

Description표시과목별 교원 현황(중,고)(교과별)
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=30L1YTHD4GCWPPV7G73O23357104&infSeq=2

Alerts

제외사유 has constant value ""Constant
Dataset has 36 (0.4%) duplicate rowsDuplicates
제외여부 is highly overall correlated with 남성교원수(명) and 3 other fieldsHigh correlation
교과명 is highly overall correlated with 제외여부High correlation
지역명 is highly overall correlated with 시군명 and 1 other fieldsHigh correlation
시군명 is highly overall correlated with 지역교육청명 and 1 other fieldsHigh correlation
남성교원수(명) is highly overall correlated with 합계교원수(명) and 1 other fieldsHigh correlation
여성교원수(명) is highly overall correlated with 합계교원수(명) and 1 other fieldsHigh correlation
합계교원수(명) is highly overall correlated with 남성교원수(명) and 2 other fieldsHigh correlation
지역교육청명 is highly overall correlated with 시군명 and 2 other fieldsHigh correlation
학교급명 is highly overall correlated with 지역교육청명High correlation
제외여부 is highly imbalanced (99.7%)Imbalance
제외사유 has 9998 (> 99.9%) missing valuesMissing
남성교원수(명) has 3881 (38.8%) zerosZeros
여성교원수(명) has 1623 (16.2%) zerosZeros

Reproduction

Analysis started2023-12-10 22:29:51.437241
Analysis finished2023-12-10 22:29:54.091669
Duration2.65 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준년도
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2018
3047 
2017
1774 
2019
1752 
2016
1739 
2015
1688 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2015
2nd row2018
3rd row2018
4th row2019
5th row2017

Common Values

ValueCountFrequency (%)
2018 3047
30.5%
2017 1774
17.7%
2019 1752
17.5%
2016 1739
17.4%
2015 1688
16.9%

Length

2023-12-11T07:29:54.142537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:29:54.227329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2018 3047
30.5%
2017 1774
17.7%
2019 1752
17.5%
2016 1739
17.4%
2015 1688
16.9%

시군명
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
수원시
928 
용인시
791 
성남시
739 
고양시
700 
부천시
 
556
Other values (26)
6286 

Length

Max length4
Median length3
Mean length3.0909
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row의왕시
2nd row남양주시
3rd row남양주시
4th row안산시
5th row부천시

Common Values

ValueCountFrequency (%)
수원시 928
 
9.3%
용인시 791
 
7.9%
성남시 739
 
7.4%
고양시 700
 
7.0%
부천시 556
 
5.6%
화성시 532
 
5.3%
안산시 472
 
4.7%
남양주시 467
 
4.7%
안양시 429
 
4.3%
평택시 397
 
4.0%
Other values (21) 3989
39.9%

Length

2023-12-11T07:29:54.331183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수원시 928
 
9.3%
용인시 791
 
7.9%
성남시 739
 
7.4%
고양시 700
 
7.0%
부천시 556
 
5.6%
화성시 532
 
5.3%
안산시 472
 
4.7%
남양주시 467
 
4.7%
안양시 429
 
4.3%
평택시 397
 
4.0%
Other values (21) 3989
39.9%

지역교육청명
Categorical

HIGH CORRELATION 

Distinct26
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도교육청
3769 
경기도수원교육지원청
559 
경기도용인교육지원청
541 
경기도화성오산교육지원청
457 
경기도성남교육지원청
448 
Other values (21)
4226 

Length

Max length13
Median length12
Mean length8.9102
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도군포의왕교육지원청
2nd row경기도구리남양주교육지원청
3rd row경기도구리남양주교육지원청
4th row경기도안산교육지원청
5th row경기도교육청

Common Values

ValueCountFrequency (%)
경기도교육청 3769
37.7%
경기도수원교육지원청 559
 
5.6%
경기도용인교육지원청 541
 
5.4%
경기도화성오산교육지원청 457
 
4.6%
경기도성남교육지원청 448
 
4.5%
경기도고양교육지원청 417
 
4.2%
경기도구리남양주교육지원청 412
 
4.1%
경기도부천교육지원청 316
 
3.2%
경기도안산교육지원청 278
 
2.8%
경기도안양과천교육지원청 276
 
2.8%
Other values (16) 2527
25.3%

Length

2023-12-11T07:29:54.429688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도교육청 3769
37.7%
경기도수원교육지원청 559
 
5.6%
경기도용인교육지원청 541
 
5.4%
경기도화성오산교육지원청 457
 
4.6%
경기도성남교육지원청 448
 
4.5%
경기도고양교육지원청 417
 
4.2%
경기도구리남양주교육지원청 412
 
4.1%
경기도부천교육지원청 316
 
3.2%
경기도안산교육지원청 278
 
2.8%
경기도안양과천교육지원청 276
 
2.8%
Other values (16) 2527
25.3%

지역명
Categorical

HIGH CORRELATION 

Distinct42
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도 부천시
 
556
경기도 화성시
 
532
경기도 남양주시
 
467
경기도 성남시 분당구
 
455
경기도 평택시
 
397
Other values (37)
7593 

Length

Max length12
Median length7
Mean length8.7567
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도 의왕시
2nd row경기도 남양주시
3rd row경기도 남양주시
4th row경기도 안산시 단원구
5th row경기도 부천시

Common Values

ValueCountFrequency (%)
경기도 부천시 556
 
5.6%
경기도 화성시 532
 
5.3%
경기도 남양주시 467
 
4.7%
경기도 성남시 분당구 455
 
4.5%
경기도 평택시 397
 
4.0%
경기도 파주시 366
 
3.7%
경기도 의정부시 332
 
3.3%
경기도 시흥시 330
 
3.3%
경기도 김포시 328
 
3.3%
경기도 용인시 기흥구 315
 
3.1%
Other values (32) 5922
59.2%

Length

2023-12-11T07:29:54.533864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 10000
41.6%
수원시 928
 
3.9%
용인시 791
 
3.3%
성남시 739
 
3.1%
고양시 700
 
2.9%
부천시 556
 
2.3%
화성시 532
 
2.2%
안산시 472
 
2.0%
남양주시 467
 
1.9%
분당구 455
 
1.9%
Other values (39) 8419
35.0%
Distinct1130
Distinct (%)11.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T07:29:54.950144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length5
Mean length5.9982
Min length5

Characters and Unicode

Total characters59982
Distinct characters274
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row고천중학교
2nd row도농중학교
3rd row예봉중학교
4th row별망중학교
5th row부천고등학교
ValueCountFrequency (%)
늘푸른중학교 18
 
0.2%
화홍중학교 17
 
0.2%
마석중학교 17
 
0.2%
용호중학교 17
 
0.2%
상하중학교 17
 
0.2%
행신중학교 17
 
0.2%
향남중학교 17
 
0.2%
상현중학교 17
 
0.2%
대송중학교 16
 
0.2%
궁내중학교 16
 
0.2%
Other values (1121) 9841
98.3%
2023-12-11T07:29:55.281361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10224
17.0%
10176
17.0%
6453
 
10.8%
3987
 
6.6%
3808
 
6.3%
622
 
1.0%
611
 
1.0%
604
 
1.0%
597
 
1.0%
593
 
1.0%
Other values (264) 22307
37.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 59797
99.7%
Lowercase Letter 135
 
0.2%
Uppercase Letter 40
 
0.1%
Space Separator 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10224
17.1%
10176
17.0%
6453
 
10.8%
3987
 
6.7%
3808
 
6.4%
622
 
1.0%
611
 
1.0%
604
 
1.0%
597
 
1.0%
593
 
1.0%
Other values (251) 22122
37.0%
Lowercase Letter
ValueCountFrequency (%)
s 40
29.6%
i 20
14.8%
n 20
14.8%
e 15
 
11.1%
g 10
 
7.4%
l 10
 
7.4%
h 10
 
7.4%
u 10
 
7.4%
Uppercase Letter
ValueCountFrequency (%)
E 10
25.0%
T 10
25.0%
I 10
25.0%
B 10
25.0%
Space Separator
ValueCountFrequency (%)
10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 59797
99.7%
Latin 175
 
0.3%
Common 10
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10224
17.1%
10176
17.0%
6453
 
10.8%
3987
 
6.7%
3808
 
6.4%
622
 
1.0%
611
 
1.0%
604
 
1.0%
597
 
1.0%
593
 
1.0%
Other values (251) 22122
37.0%
Latin
ValueCountFrequency (%)
s 40
22.9%
i 20
11.4%
n 20
11.4%
e 15
 
8.6%
g 10
 
5.7%
l 10
 
5.7%
h 10
 
5.7%
E 10
 
5.7%
u 10
 
5.7%
T 10
 
5.7%
Other values (2) 20
11.4%
Common
ValueCountFrequency (%)
10
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 59797
99.7%
ASCII 185
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10224
17.1%
10176
17.0%
6453
 
10.8%
3987
 
6.7%
3808
 
6.4%
622
 
1.0%
611
 
1.0%
604
 
1.0%
597
 
1.0%
593
 
1.0%
Other values (251) 22122
37.0%
ASCII
ValueCountFrequency (%)
s 40
21.6%
i 20
10.8%
n 20
10.8%
e 15
 
8.1%
10
 
5.4%
g 10
 
5.4%
l 10
 
5.4%
h 10
 
5.4%
E 10
 
5.4%
u 10
 
5.4%
Other values (3) 30
16.2%

학교급명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
중학교
4455 
고등학교
3730 
<NA>
1752 
방통고
 
39
방통중
 
24

Length

Max length4
Median length4
Mean length3.5482
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중학교
2nd row중학교
3rd row중학교
4th row<NA>
5th row고등학교

Common Values

ValueCountFrequency (%)
중학교 4455
44.5%
고등학교 3730
37.3%
<NA> 1752
 
17.5%
방통고 39
 
0.4%
방통중 24
 
0.2%

Length

2023-12-11T07:29:55.408467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:29:55.498604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중학교 4455
44.5%
고등학교 3730
37.3%
na 1752
 
17.5%
방통고 39
 
0.4%
방통중 24
 
0.2%

설립구분명
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공립
8148 
사립
1852 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공립
2nd row공립
3rd row공립
4th row공립
5th row공립

Common Values

ValueCountFrequency (%)
공립 8148
81.5%
사립 1852
 
18.5%

Length

2023-12-11T07:29:55.594990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:29:55.672865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공립 8148
81.5%
사립 1852
 
18.5%

제외여부
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
9998 
True
 
2
ValueCountFrequency (%)
False 9998
> 99.9%
True 2
 
< 0.1%
2023-12-11T07:29:55.743825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

제외사유
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)50.0%
Missing9998
Missing (%)> 99.9%
Memory size156.2 KiB
2023-12-11T07:29:55.857612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length29
Mean length29
Min length29

Characters and Unicode

Total characters58
Distinct characters24
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row본교는 해당항목에 대해서 통계자료가 없으므로 제외함.
2nd row본교는 해당항목에 대해서 통계자료가 없으므로 제외함.
ValueCountFrequency (%)
본교는 2
16.7%
해당항목에 2
16.7%
대해서 2
16.7%
통계자료가 2
16.7%
없으므로 2
16.7%
제외함 2
16.7%
2023-12-11T07:29:56.138803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10
 
17.2%
4
 
6.9%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
2
 
3.4%
Other values (14) 28
48.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 46
79.3%
Space Separator 10
 
17.2%
Other Punctuation 2
 
3.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
 
8.7%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
Other values (12) 24
52.2%
Space Separator
ValueCountFrequency (%)
10
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 46
79.3%
Common 12
 
20.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
 
8.7%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
Other values (12) 24
52.2%
Common
ValueCountFrequency (%)
10
83.3%
. 2
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 46
79.3%
ASCII 12
 
20.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10
83.3%
. 2
 
16.7%
Hangul
ValueCountFrequency (%)
4
 
8.7%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
2
 
4.3%
Other values (12) 24
52.2%
Distinct85
Distinct (%)0.9%
Missing2
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-11T07:29:56.331889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length33
Mean length17.194439
Min length2

Characters and Unicode

Total characters171910
Distinct characters106
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)0.1%

Sample

1st row공통교육과정-교과
2nd row공통교육과정-교과-체육
3rd row공통교육과정-교과-과학/기술·가정
4th row공통교육과정-교과-사회(역사포함)/도덕
5th row고등학교선택교육과정-교과-보통교과-생활교양-기술·가정/제2외국어/한문/교양
ValueCountFrequency (%)
공통교육과정-교과 1805
18.0%
공통교육과정-교과-국어 603
 
6.0%
공통교육과정-교과-사회(역사포함)/도덕 543
 
5.4%
공통교육과정-교과-수학 532
 
5.3%
공통교육과정-교과-체육 523
 
5.2%
고등학교선택교육과정-교과-보통교과-기초 499
 
5.0%
공통교육과정-교과-영어 478
 
4.8%
공통교육과정-교과-선택 456
 
4.5%
공통교육과정-교과-예술(음악/미술 453
 
4.5%
고등학교선택교육과정-교과-보통교과-체육예술 366
 
3.6%
Other values (80) 3791
37.7%
2023-12-11T07:29:56.664002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
26087
15.2%
23444
13.6%
- 22654
13.2%
11623
 
6.8%
11083
 
6.4%
9463
 
5.5%
6133
 
3.6%
4590
 
2.7%
4128
 
2.4%
4128
 
2.4%
Other values (96) 48577
28.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 141582
82.4%
Dash Punctuation 22654
 
13.2%
Other Punctuation 3805
 
2.2%
Close Punctuation 1626
 
0.9%
Open Punctuation 1626
 
0.9%
Decimal Number 533
 
0.3%
Space Separator 51
 
< 0.1%
Letter Number 33
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26087
18.4%
23444
16.6%
11623
 
8.2%
11083
 
7.8%
9463
 
6.7%
6133
 
4.3%
4590
 
3.2%
4128
 
2.9%
4128
 
2.9%
3584
 
2.5%
Other values (83) 37319
26.4%
Decimal Number
ValueCountFrequency (%)
2 293
55.0%
0 160
30.0%
9 80
 
15.0%
Other Punctuation
ValueCountFrequency (%)
/ 2794
73.4%
· 1011
 
26.6%
Close Punctuation
ValueCountFrequency (%)
) 1405
86.4%
] 221
 
13.6%
Open Punctuation
ValueCountFrequency (%)
( 1405
86.4%
[ 221
 
13.6%
Letter Number
ValueCountFrequency (%)
27
81.8%
6
 
18.2%
Dash Punctuation
ValueCountFrequency (%)
- 22654
100.0%
Space Separator
ValueCountFrequency (%)
51
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 141582
82.4%
Common 30295
 
17.6%
Latin 33
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26087
18.4%
23444
16.6%
11623
 
8.2%
11083
 
7.8%
9463
 
6.7%
6133
 
4.3%
4590
 
3.2%
4128
 
2.9%
4128
 
2.9%
3584
 
2.5%
Other values (83) 37319
26.4%
Common
ValueCountFrequency (%)
- 22654
74.8%
/ 2794
 
9.2%
) 1405
 
4.6%
( 1405
 
4.6%
· 1011
 
3.3%
2 293
 
1.0%
] 221
 
0.7%
[ 221
 
0.7%
0 160
 
0.5%
9 80
 
0.3%
Latin
ValueCountFrequency (%)
27
81.8%
6
 
18.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 141582
82.4%
ASCII 29284
 
17.0%
None 1011
 
0.6%
Number Forms 33
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
26087
18.4%
23444
16.6%
11623
 
8.2%
11083
 
7.8%
9463
 
6.7%
6133
 
4.3%
4590
 
3.2%
4128
 
2.9%
4128
 
2.9%
3584
 
2.5%
Other values (83) 37319
26.4%
ASCII
ValueCountFrequency (%)
- 22654
77.4%
/ 2794
 
9.5%
) 1405
 
4.8%
( 1405
 
4.8%
2 293
 
1.0%
] 221
 
0.8%
[ 221
 
0.8%
0 160
 
0.5%
9 80
 
0.3%
51
 
0.2%
None
ValueCountFrequency (%)
· 1011
100.0%
Number Forms
ValueCountFrequency (%)
27
81.8%
6
 
18.2%

교과명
Categorical

HIGH CORRELATION 

Distinct46
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
국어
1286 
체육
1195 
수학
1144 
영어
1138 
예술(음악/미술)
1025 
Other values (41)
4212 

Length

Max length17
Median length2
Mean length5.058
Min length2

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row예술(음악/미술)
2nd row체육
3rd row과학/기술·가정
4th row사회(역사포함)/도덕
5th row기술·가정/제2외국어/한문/교양

Common Values

ValueCountFrequency (%)
국어 1286
12.9%
체육 1195
11.9%
수학 1144
11.4%
영어 1138
11.4%
예술(음악/미술) 1025
10.2%
선택 920
9.2%
사회(역사포함)/도덕 763
7.6%
과학/기술·가정 555
5.5%
사회(역사/도덕포함) 433
 
4.3%
과학 428
 
4.3%
Other values (36) 1113
11.1%

Length

2023-12-11T07:29:56.803181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
국어 1286
12.8%
체육 1196
11.9%
수학 1144
11.4%
영어 1138
11.3%
예술(음악/미술 1025
10.2%
선택 920
9.1%
사회(역사포함)/도덕 763
7.6%
과학/기술·가정 555
5.5%
사회(역사/도덕포함 433
 
4.3%
과학 430
 
4.3%
Other values (40) 1182
11.7%

남성교원수(명)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct30
Distinct (%)0.3%
Missing2
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean1.3759752
Minimum0
Maximum72
Zeros3881
Zeros (%)38.8%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:29:56.968652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile5
Maximum72
Range72
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.2525304
Coefficient of variation (CV)1.6370429
Kurtosis177.47282
Mean1.3759752
Median Absolute Deviation (MAD)1
Skewness8.9821814
Sum13757
Variance5.0738931
MonotonicityNot monotonic
2023-12-11T07:29:57.157016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
0 3881
38.8%
1 2979
29.8%
2 1462
 
14.6%
3 730
 
7.3%
4 399
 
4.0%
5 227
 
2.3%
6 119
 
1.2%
7 72
 
0.7%
8 33
 
0.3%
9 26
 
0.3%
Other values (20) 70
 
0.7%
ValueCountFrequency (%)
0 3881
38.8%
1 2979
29.8%
2 1462
 
14.6%
3 730
 
7.3%
4 399
 
4.0%
5 227
 
2.3%
6 119
 
1.2%
7 72
 
0.7%
8 33
 
0.3%
9 26
 
0.3%
ValueCountFrequency (%)
72 1
 
< 0.1%
53 1
 
< 0.1%
44 1
 
< 0.1%
38 1
 
< 0.1%
37 1
 
< 0.1%
33 2
< 0.1%
31 2
< 0.1%
30 1
 
< 0.1%
26 2
< 0.1%
25 3
< 0.1%

여성교원수(명)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct30
Distinct (%)0.3%
Missing2
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean2.6017203
Minimum0
Maximum45
Zeros1623
Zeros (%)16.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:29:57.289057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q34
95-th percentile7
Maximum45
Range45
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.5956938
Coefficient of variation (CV)0.99768363
Kurtosis18.859415
Mean2.6017203
Median Absolute Deviation (MAD)1
Skewness2.6756487
Sum26012
Variance6.7376263
MonotonicityNot monotonic
2023-12-11T07:29:57.415062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1 2547
25.5%
2 1915
19.1%
0 1623
16.2%
3 1217
12.2%
4 922
 
9.2%
5 609
 
6.1%
6 399
 
4.0%
7 272
 
2.7%
8 177
 
1.8%
9 121
 
1.2%
Other values (20) 196
 
2.0%
ValueCountFrequency (%)
0 1623
16.2%
1 2547
25.5%
2 1915
19.1%
3 1217
12.2%
4 922
 
9.2%
5 609
 
6.1%
6 399
 
4.0%
7 272
 
2.7%
8 177
 
1.8%
9 121
 
1.2%
ValueCountFrequency (%)
45 1
< 0.1%
33 2
< 0.1%
31 1
< 0.1%
29 1
< 0.1%
26 1
< 0.1%
25 2
< 0.1%
24 2
< 0.1%
22 1
< 0.1%
21 2
< 0.1%
20 2
< 0.1%

합계교원수(명)
Real number (ℝ)

HIGH CORRELATION 

Distinct43
Distinct (%)0.4%
Missing2
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean3.9776955
Minimum0
Maximum86
Zeros27
Zeros (%)0.3%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:29:57.568542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q35
95-th percentile11
Maximum86
Range86
Interquartile range (IQR)3

Descriptive statistics

Standard deviation3.7701813
Coefficient of variation (CV)0.94783053
Kurtosis59.717126
Mean3.9776955
Median Absolute Deviation (MAD)2
Skewness5.056238
Sum39769
Variance14.214267
MonotonicityNot monotonic
2023-12-11T07:29:57.736349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=43)
ValueCountFrequency (%)
2 2221
22.2%
1 2074
20.7%
3 1480
14.8%
4 1248
12.5%
5 754
 
7.5%
6 568
 
5.7%
7 376
 
3.8%
9 277
 
2.8%
8 265
 
2.6%
10 196
 
2.0%
Other values (33) 539
 
5.4%
ValueCountFrequency (%)
0 27
 
0.3%
1 2074
20.7%
2 2221
22.2%
3 1480
14.8%
4 1248
12.5%
5 754
 
7.5%
6 568
 
5.7%
7 376
 
3.8%
8 265
 
2.6%
9 277
 
2.8%
ValueCountFrequency (%)
86 1
< 0.1%
67 1
< 0.1%
66 1
< 0.1%
56 1
< 0.1%
47 1
< 0.1%
46 1
< 0.1%
45 1
< 0.1%
44 1
< 0.1%
43 1
< 0.1%
41 1
< 0.1%

Interactions

2023-12-11T07:29:53.466808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:29:52.936336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:29:53.206467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:29:53.550354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:29:53.020170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:29:53.301874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:29:53.620054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:29:53.112383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:29:53.393331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:29:57.850137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준년도시군명지역교육청명지역명학교급명설립구분명제외여부상위교과명교과명남성교원수(명)여성교원수(명)합계교원수(명)
기준년도1.0000.0570.3580.0590.0530.0630.0190.8570.3720.0780.2400.220
시군명0.0571.0000.9841.0000.1960.3740.1090.0000.0550.0700.1870.122
지역교육청명0.3580.9841.0000.9870.8180.3750.0000.6590.4470.0380.1980.270
지역명0.0591.0000.9871.0000.2470.4900.1090.0510.0160.1450.2200.157
학교급명0.0530.1960.8180.2471.0000.2850.0000.8760.5940.1150.1660.262
설립구분명0.0630.3740.3750.4900.2851.0000.0000.2580.1810.1640.1260.056
제외여부0.0190.1090.0000.1090.0000.0001.000NaNNaNNaNNaNNaN
상위교과명0.8570.0000.6590.0510.8760.258NaN1.0000.9990.5590.6010.688
교과명0.3720.0550.4470.0160.5940.181NaN0.9991.0000.6340.6120.676
남성교원수(명)0.0780.0700.0380.1450.1150.164NaN0.5590.6341.0000.6650.940
여성교원수(명)0.2400.1870.1980.2200.1660.126NaN0.6010.6120.6651.0000.919
합계교원수(명)0.2200.1220.2700.1570.2620.056NaN0.6880.6760.9400.9191.000
2023-12-11T07:29:58.004161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
제외여부지역교육청명교과명기준년도설립구분명지역명시군명학교급명
제외여부1.0000.0001.0000.0230.0000.0860.0930.000
지역교육청명0.0001.0000.1120.1800.2970.7740.7740.585
교과명1.0000.1121.0000.1700.1500.0030.0120.341
기준년도0.0230.1800.1701.0000.0770.0270.0270.021
설립구분명0.0000.2970.1500.0771.0000.3910.3190.189
지역명0.0860.7740.0030.0270.3911.0000.9990.127
시군명0.0930.7740.0120.0270.3190.9991.0000.103
학교급명0.0000.5850.3410.0210.1890.1270.1031.000
2023-12-11T07:29:58.146976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
남성교원수(명)여성교원수(명)합계교원수(명)기준년도시군명지역교육청명지역명학교급명설립구분명제외여부교과명
남성교원수(명)1.000-0.0370.5100.0450.0260.0140.0530.0740.1641.0000.266
여성교원수(명)-0.0371.0000.7900.1400.0690.0690.0820.0950.1271.0000.252
합계교원수(명)0.5100.7901.0000.1290.0460.1050.0570.1700.0561.0000.296
기준년도0.0450.1400.1291.0000.0270.1800.0270.0210.0770.0230.170
시군명0.0260.0690.0460.0271.0000.7740.9990.1030.3190.0930.012
지역교육청명0.0140.0690.1050.1800.7741.0000.7740.5850.2970.0000.112
지역명0.0530.0820.0570.0270.9990.7741.0000.1270.3910.0860.003
학교급명0.0740.0950.1700.0210.1030.5850.1271.0000.1890.0000.341
설립구분명0.1640.1270.0560.0770.3190.2970.3910.1891.0000.0000.150
제외여부1.0001.0001.0000.0230.0930.0000.0860.0000.0001.0001.000
교과명0.2660.2520.2960.1700.0120.1120.0030.3410.1501.0001.000

Missing values

2023-12-11T07:29:53.730001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:29:53.887323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T07:29:54.013562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유상위교과명교과명남성교원수(명)여성교원수(명)합계교원수(명)
527962015의왕시경기도군포의왕교육지원청경기도 의왕시고천중학교중학교공립N<NA>공통교육과정-교과예술(음악/미술)022
129792018남양주시경기도구리남양주교육지원청경기도 남양주시도농중학교중학교공립N<NA>공통교육과정-교과-체육체육202
134842018남양주시경기도구리남양주교육지원청경기도 남양주시예봉중학교중학교공립N<NA>공통교육과정-교과-과학/기술·가정과학/기술·가정257
49792019안산시경기도안산교육지원청경기도 안산시 단원구별망중학교<NA>공립N<NA>공통교육과정-교과-사회(역사포함)/도덕사회(역사포함)/도덕123
289922017부천시경기도교육청경기도 부천시부천고등학교고등학교공립N<NA>고등학교선택교육과정-교과-보통교과-생활교양-기술·가정/제2외국어/한문/교양기술·가정/제2외국어/한문/교양156
526582015용인시경기도용인교육지원청경기도 용인시 기흥구구성중학교중학교공립N<NA>공통교육과정-교과체육325
154332018성남시경기도성남교육지원청경기도 성남시 분당구낙원중학교중학교공립N<NA>공통교육과정-교과-선택선택011
329632017오산시경기도화성오산교육지원청경기도 오산시오산중학교중학교사립N<NA>공통교육과정-교과-국어국어415
128062018김포시경기도교육청경기도 김포시장기고등학교고등학교공립N<NA>선택중심교육과정-보통교과-기초-국어국어022
218132018용인시경기도용인교육지원청경기도 용인시 기흥구언동중학교중학교공립N<NA>공통교육과정-교과-체육체육202
기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유상위교과명교과명남성교원수(명)여성교원수(명)합계교원수(명)
127372018김포시경기도김포교육지원청경기도 김포시금파중학교중학교공립N<NA>기본교육과정-교과-선택선택011
360942016고양시경기도교육청경기도 고양시 일산동구중산고등학교고등학교공립N<NA>고등학교선택교육과정-교과-보통교과-체육예술예술(음악/미술)224
247852018평택시경기도교육청경기도 평택시현화고등학교고등학교공립N<NA>고등학교선택교육과정-교과-보통교과-탐구-사회(역사/도덕포함)사회(역사/도덕포함)257
343502017파주시경기도교육청경기도 파주시파주여자고등학교고등학교사립N<NA>고등학교선택교육과정-교과-보통교과-기초-국어국어224
314402017안산시경기도교육청경기도 안산시 상록구경기모바일과학고등학교고등학교공립N<NA>고등학교선택교육과정-교과-보통교과-기초-영어영어123
325462017양평군경기도양평교육지원청경기도 양평군개군중학교중학교사립N<NA>공통교육과정-교과-사회(역사포함)/도덕사회(역사포함)/도덕202
397902016수원시경기도교육청경기도 수원시 팔달구수원고등학교고등학교사립N<NA>고등학교선택교육과정-교과-보통교과-체육예술체육505
108582018고양시경기도고양교육지원청경기도 고양시 일산동구풍산중학교중학교공립N<NA>공통교육과정-교과-체육체육101
91232019화성시경기도화성오산교육지원청경기도 화성시예당중학교<NA>공립N<NA>공통교육과정-교과-수학수학022
433292016의왕시경기도군포의왕교육지원청경기도 의왕시갈뫼중학교중학교공립N<NA>공통교육과정-교과영어145

Duplicate rows

Most frequently occurring

기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유상위교과명교과명남성교원수(명)여성교원수(명)합계교원수(명)# duplicates
02017하남시경기도교육청경기도 하남시한국애니메이션고등학교고등학교공립Y본교는 해당항목에 대해서 통계자료가 없으므로 제외함.<NA><NA><NA><NA><NA>2
12018부천시경기도부천교육지원청경기도 부천시까치울중학교중학교공립N<NA>공통교육과정-교과-선택선택0112
22018성남시경기도성남교육지원청경기도 성남시 분당구삼평중학교부설방송통신중학교방통중공립N<NA>공통교육과정-교과체육1012
32018수원시경기도수원교육지원청경기도 수원시 권선구서호중학교중학교공립N<NA>공통교육과정-교과-국어국어0112
42018안양시경기도안양과천교육지원청경기도 안양시 동안구신기중학교중학교공립N<NA>공통교육과정-교과-국어국어0222
52018안양시경기도안양과천교육지원청경기도 안양시 만안구박달중학교중학교공립N<NA>공통교육과정-교과-영어영어0112
62018양주시경기도동두천양주교육지원청경기도 양주시회천중학교중학교공립N<NA>기본교육과정-교과-선택선택0112
72018여주시경기도여주교육지원청경기도 여주시여주여자중학교중학교공립N<NA>공통교육과정-교과-선택선택0112
82018용인시경기도용인교육지원청경기도 용인시 기흥구용인신릉중학교중학교공립N<NA>공통교육과정-교과-예술(음악/미술)예술(음악/미술)0112
92018이천시경기도이천교육지원청경기도 이천시이천사동중학교중학교공립N<NA>공통교육과정-교과-예술(음악/미술)예술(음악/미술)0222