Overview

Dataset statistics

Number of variables14
Number of observations10000
Missing cells85
Missing cells (%)0.1%
Duplicate rows8
Duplicate rows (%)0.1%
Total size in memory1.2 MiB
Average record size in memory124.0 B

Variable types

Categorical7
Text3
Boolean1
Numeric3

Dataset

Description표시과목별 교원 현황(중,고)(과목별)
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=0CF5AJX4YVA0AH8ZDEL223349111&infSeq=2

Alerts

Dataset has 8 (0.1%) duplicate rowsDuplicates
기준년도 is highly overall correlated with 제외사유High correlation
지역명 is highly overall correlated with 시군명 and 2 other fieldsHigh correlation
학교급명 is highly overall correlated with 지역교육청명 and 1 other fieldsHigh correlation
제외여부 is highly overall correlated with 남성교원수(명) and 3 other fieldsHigh correlation
제외사유 is highly overall correlated with 기준년도 and 6 other fieldsHigh correlation
지역교육청명 is highly overall correlated with 시군명 and 3 other fieldsHigh correlation
설립구분명 is highly overall correlated with 제외사유High correlation
시군명 is highly overall correlated with 지역교육청명 and 2 other fieldsHigh correlation
남성교원수(명) is highly overall correlated with 제외여부High correlation
여성교원수(명) is highly overall correlated with 합계교원수(명) and 1 other fieldsHigh correlation
합계교원수(명) is highly overall correlated with 여성교원수(명) and 1 other fieldsHigh correlation
제외여부 is highly imbalanced (99.2%)Imbalance
제외사유 is highly imbalanced (99.2%)Imbalance
남성교원수(명) has 5436 (54.4%) zerosZeros
여성교원수(명) has 2892 (28.9%) zerosZeros
합계교원수(명) has 225 (2.2%) zerosZeros

Reproduction

Analysis started2023-12-10 21:43:54.544679
Analysis finished2023-12-10 21:43:57.229097
Duration2.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준년도
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2018
4255 
2017
3488 
2016
2257 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018
2nd row2017
3rd row2018
4th row2018
5th row2016

Common Values

ValueCountFrequency (%)
2018 4255
42.5%
2017 3488
34.9%
2016 2257
22.6%

Length

2023-12-11T06:43:57.287390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:43:57.380640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2018 4255
42.5%
2017 3488
34.9%
2016 2257
22.6%

시군명
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
수원시
1105 
고양시
910 
성남시
887 
부천시
733 
안산시
632 
Other values (26)
5733 

Length

Max length4
Median length3
Mean length3.0967
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고양시
2nd row양주시
3rd row성남시
4th row동두천시
5th row군포시

Common Values

ValueCountFrequency (%)
수원시 1105
 
11.1%
고양시 910
 
9.1%
성남시 887
 
8.9%
부천시 733
 
7.3%
안산시 632
 
6.3%
용인시 580
 
5.8%
남양주시 558
 
5.6%
화성시 397
 
4.0%
시흥시 376
 
3.8%
안양시 360
 
3.6%
Other values (21) 3462
34.6%

Length

2023-12-11T06:43:57.473946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수원시 1105
 
11.1%
고양시 910
 
9.1%
성남시 887
 
8.9%
부천시 733
 
7.3%
안산시 632
 
6.3%
용인시 580
 
5.8%
남양주시 558
 
5.6%
화성시 397
 
4.0%
시흥시 376
 
3.8%
안양시 360
 
3.6%
Other values (21) 3462
34.6%

지역교육청명
Categorical

HIGH CORRELATION 

Distinct26
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도교육청
6374 
경기도수원교육지원청
 
365
경기도성남교육지원청
 
326
경기도고양교육지원청
 
277
경기도구리남양주교육지원청
 
277
Other values (21)
2381 

Length

Max length13
Median length6
Mean length7.6846
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도교육청
2nd row경기도동두천양주교육지원청
3rd row경기도교육청
4th row경기도교육청
5th row경기도교육청

Common Values

ValueCountFrequency (%)
경기도교육청 6374
63.7%
경기도수원교육지원청 365
 
3.6%
경기도성남교육지원청 326
 
3.3%
경기도고양교육지원청 277
 
2.8%
경기도구리남양주교육지원청 277
 
2.8%
경기도부천교육지원청 251
 
2.5%
경기도용인교육지원청 244
 
2.4%
경기도화성오산교육지원청 229
 
2.3%
경기도안산교육지원청 216
 
2.2%
경기도시흥교육지원청 142
 
1.4%
Other values (16) 1299
 
13.0%

Length

2023-12-11T06:43:57.613798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도교육청 6374
63.7%
경기도수원교육지원청 365
 
3.6%
경기도성남교육지원청 326
 
3.3%
경기도고양교육지원청 277
 
2.8%
경기도구리남양주교육지원청 277
 
2.8%
경기도부천교육지원청 251
 
2.5%
경기도용인교육지원청 244
 
2.4%
경기도화성오산교육지원청 229
 
2.3%
경기도안산교육지원청 216
 
2.2%
경기도시흥교육지원청 142
 
1.4%
Other values (16) 1299
 
13.0%

지역명
Categorical

HIGH CORRELATION 

Distinct42
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도 부천시
 
733
경기도 남양주시
 
558
경기도 성남시 분당구
 
551
경기도 화성시
 
397
경기도 시흥시
 
376
Other values (37)
7385 

Length

Max length12
Median length11
Mean length8.9423
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도 고양시 일산서구
2nd row경기도 양주시
3rd row경기도 성남시 중원구
4th row경기도 동두천시
5th row경기도 군포시

Common Values

ValueCountFrequency (%)
경기도 부천시 733
 
7.3%
경기도 남양주시 558
 
5.6%
경기도 성남시 분당구 551
 
5.5%
경기도 화성시 397
 
4.0%
경기도 시흥시 376
 
3.8%
경기도 고양시 덕양구 350
 
3.5%
경기도 김포시 349
 
3.5%
경기도 수원시 영통구 338
 
3.4%
경기도 평택시 325
 
3.2%
경기도 고양시 일산서구 324
 
3.2%
Other values (32) 5699
57.0%

Length

2023-12-11T06:43:57.746299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 10000
40.9%
수원시 1105
 
4.5%
고양시 910
 
3.7%
성남시 887
 
3.6%
부천시 733
 
3.0%
안산시 632
 
2.6%
용인시 580
 
2.4%
남양주시 558
 
2.3%
분당구 551
 
2.3%
화성시 397
 
1.6%
Other values (39) 8121
33.2%
Distinct1118
Distinct (%)11.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T06:43:57.995225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length18
Mean length6.419
Min length5

Characters and Unicode

Total characters64190
Distinct characters273
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)0.2%

Sample

1st row대화고등학교
2nd row덕정중학교
3rd row성남테크노과학고등학교
4th row동두천외국어고등학교
5th row군포e비즈니스고등학교
ValueCountFrequency (%)
수원농생명과학고등학교 32
 
0.3%
삼일공업고등학교 29
 
0.3%
신일비즈니스고등학교 29
 
0.3%
부천공업고등학교 29
 
0.3%
분당대진고등학교 29
 
0.3%
동두천중앙고등학교 28
 
0.3%
오남고등학교 27
 
0.3%
고양예술고등학교 27
 
0.3%
경기영상과학고등학교 27
 
0.3%
유신고등학교 27
 
0.3%
Other values (1109) 9728
97.2%
2023-12-11T06:43:58.395665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10311
16.1%
10178
15.9%
6658
 
10.4%
6426
 
10.0%
3894
 
6.1%
730
 
1.1%
631
 
1.0%
628
 
1.0%
621
 
1.0%
609
 
0.9%
Other values (263) 23504
36.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 63966
99.7%
Lowercase Letter 174
 
0.3%
Uppercase Letter 38
 
0.1%
Space Separator 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10311
16.1%
10178
15.9%
6658
 
10.4%
6426
 
10.0%
3894
 
6.1%
730
 
1.1%
631
 
1.0%
628
 
1.0%
621
 
1.0%
609
 
1.0%
Other values (250) 23280
36.4%
Lowercase Letter
ValueCountFrequency (%)
s 48
27.6%
e 30
17.2%
n 24
13.8%
i 24
13.8%
g 12
 
6.9%
l 12
 
6.9%
h 12
 
6.9%
u 12
 
6.9%
Uppercase Letter
ValueCountFrequency (%)
E 12
31.6%
B 12
31.6%
T 7
18.4%
I 7
18.4%
Space Separator
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 63966
99.7%
Latin 212
 
0.3%
Common 12
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10311
16.1%
10178
15.9%
6658
 
10.4%
6426
 
10.0%
3894
 
6.1%
730
 
1.1%
631
 
1.0%
628
 
1.0%
621
 
1.0%
609
 
1.0%
Other values (250) 23280
36.4%
Latin
ValueCountFrequency (%)
s 48
22.6%
e 30
14.2%
n 24
11.3%
i 24
11.3%
E 12
 
5.7%
g 12
 
5.7%
l 12
 
5.7%
h 12
 
5.7%
u 12
 
5.7%
B 12
 
5.7%
Other values (2) 14
 
6.6%
Common
ValueCountFrequency (%)
12
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 63966
99.7%
ASCII 224
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10311
16.1%
10178
15.9%
6658
 
10.4%
6426
 
10.0%
3894
 
6.1%
730
 
1.1%
631
 
1.0%
628
 
1.0%
621
 
1.0%
609
 
1.0%
Other values (250) 23280
36.4%
ASCII
ValueCountFrequency (%)
s 48
21.4%
e 30
13.4%
n 24
10.7%
i 24
10.7%
E 12
 
5.4%
g 12
 
5.4%
l 12
 
5.4%
h 12
 
5.4%
12
 
5.4%
u 12
 
5.4%
Other values (3) 26
11.6%

학교급명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
고등학교
6322 
중학교
3603 
방통고
 
52
방통중
 
23

Length

Max length4
Median length4
Mean length3.6322
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고등학교
2nd row중학교
3rd row고등학교
4th row고등학교
5th row고등학교

Common Values

ValueCountFrequency (%)
고등학교 6322
63.2%
중학교 3603
36.0%
방통고 52
 
0.5%
방통중 23
 
0.2%

Length

2023-12-11T06:43:58.554296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:43:58.661027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고등학교 6322
63.2%
중학교 3603
36.0%
방통고 52
 
0.5%
방통중 23
 
0.2%

설립구분명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공립
7833 
사립
2167 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공립
2nd row공립
3rd row공립
4th row공립
5th row공립

Common Values

ValueCountFrequency (%)
공립 7833
78.3%
사립 2167
 
21.7%

Length

2023-12-11T06:43:58.767305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:43:58.871163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공립 7833
78.3%
사립 2167
 
21.7%

제외여부
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
9993 
True
 
7
ValueCountFrequency (%)
False 9993
99.9%
True 7
 
0.1%
2023-12-11T06:43:58.947176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

제외사유
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9993 
본교는 해당항목에 대해서 통계자료가 없으므로 제외함.
 
7

Length

Max length29
Median length4
Mean length4.0175
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9993
99.9%
본교는 해당항목에 대해서 통계자료가 없으므로 제외함. 7
 
0.1%

Length

2023-12-11T06:43:59.056561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:43:59.168715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9993
99.6%
본교는 7
 
0.1%
해당항목에 7
 
0.1%
대해서 7
 
0.1%
통계자료가 7
 
0.1%
없으므로 7
 
0.1%
제외함 7
 
0.1%
Distinct78
Distinct (%)0.8%
Missing7
Missing (%)0.1%
Memory size156.2 KiB
2023-12-11T06:43:59.384726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length37
Mean length23.621135
Min length12

Characters and Unicode

Total characters236046
Distinct characters109
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)0.1%

Sample

1st row고등학교선택교육과정-교과-보통교과-기초-영어
2nd row공통교육과정-교과-국어
3rd row선택중심교육과정-전문교과-전문교과Ⅱ-건설
4th row고등학교선택교육과정-교과-보통교과-탐구-사회(역사/도덕포함)
5th row고등학교선택교육과정-교과-전문교과-상업정보에관한교과
ValueCountFrequency (%)
고등학교선택교육과정-교과-보통교과-탐구-사회(역사/도덕포함 1004
 
10.0%
고등학교선택교육과정-교과-보통교과-탐구-과학 777
 
7.7%
고등학교선택교육과정-교과-보통교과-생활교양-기술·가정/제2외국어/한문/교양 727
 
7.2%
공통교육과정-교과-사회(역사포함)/도덕 685
 
6.8%
공통교육과정-교과-선택 593
 
5.9%
고등학교선택교육과정-교과-보통교과-기초-수학 585
 
5.8%
고등학교선택교육과정-교과-보통교과-기초-국어 474
 
4.7%
공통교육과정-교과-과학/기술·가정 468
 
4.7%
고등학교선택교육과정-교과-보통교과-기초-영어 448
 
4.5%
공통교육과정-교과-예술(음악/미술 443
 
4.4%
Other values (71) 3842
38.2%
2023-12-11T06:43:59.810467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
33227
 
14.1%
- 30881
 
13.1%
27477
 
11.6%
11952
 
5.1%
11649
 
4.9%
9048
 
3.8%
8004
 
3.4%
6943
 
2.9%
6943
 
2.9%
5862
 
2.5%
Other values (99) 84060
35.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 190882
80.9%
Dash Punctuation 30881
 
13.1%
Other Punctuation 7575
 
3.2%
Open Punctuation 2695
 
1.1%
Close Punctuation 2695
 
1.1%
Decimal Number 1227
 
0.5%
Space Separator 53
 
< 0.1%
Letter Number 38
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33227
17.4%
27477
14.4%
11952
 
6.3%
11649
 
6.1%
9048
 
4.7%
8004
 
4.2%
6943
 
3.6%
6943
 
3.6%
5862
 
3.1%
5486
 
2.9%
Other values (85) 64291
33.7%
Other Punctuation
ValueCountFrequency (%)
/ 5813
76.7%
· 1760
 
23.2%
2
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
2 948
77.3%
0 186
 
15.2%
9 93
 
7.6%
Open Punctuation
ValueCountFrequency (%)
( 2556
94.8%
[ 139
 
5.2%
Close Punctuation
ValueCountFrequency (%)
) 2556
94.8%
] 139
 
5.2%
Letter Number
ValueCountFrequency (%)
32
84.2%
6
 
15.8%
Dash Punctuation
ValueCountFrequency (%)
- 30881
100.0%
Space Separator
ValueCountFrequency (%)
53
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 190882
80.9%
Common 45126
 
19.1%
Latin 38
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33227
17.4%
27477
14.4%
11952
 
6.3%
11649
 
6.1%
9048
 
4.7%
8004
 
4.2%
6943
 
3.6%
6943
 
3.6%
5862
 
3.1%
5486
 
2.9%
Other values (85) 64291
33.7%
Common
ValueCountFrequency (%)
- 30881
68.4%
/ 5813
 
12.9%
( 2556
 
5.7%
) 2556
 
5.7%
· 1760
 
3.9%
2 948
 
2.1%
0 186
 
0.4%
[ 139
 
0.3%
] 139
 
0.3%
9 93
 
0.2%
Other values (2) 55
 
0.1%
Latin
ValueCountFrequency (%)
32
84.2%
6
 
15.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 190882
80.9%
ASCII 43364
 
18.4%
None 1762
 
0.7%
Number Forms 38
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
33227
17.4%
27477
14.4%
11952
 
6.3%
11649
 
6.1%
9048
 
4.7%
8004
 
4.2%
6943
 
3.6%
6943
 
3.6%
5862
 
3.1%
5486
 
2.9%
Other values (85) 64291
33.7%
ASCII
ValueCountFrequency (%)
- 30881
71.2%
/ 5813
 
13.4%
( 2556
 
5.9%
) 2556
 
5.9%
2 948
 
2.2%
0 186
 
0.4%
[ 139
 
0.3%
] 139
 
0.3%
9 93
 
0.2%
53
 
0.1%
None
ValueCountFrequency (%)
· 1760
99.9%
2
 
0.1%
Number Forms
ValueCountFrequency (%)
32
84.2%
6
 
15.8%
Distinct652
Distinct (%)6.6%
Missing57
Missing (%)0.6%
Memory size156.2 KiB
2023-12-11T06:44:00.079259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length23
Mean length3.6049482
Min length2

Characters and Unicode

Total characters35844
Distinct characters338
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique348 ?
Unique (%)3.5%

Sample

1st row영어Ⅱ
2nd row국어
3rd row기초 제도
4th row사회·문화
5th row웹애니메이션
ValueCountFrequency (%)
수학 405
 
3.9%
기술·가정 385
 
3.7%
국어 381
 
3.7%
체육 365
 
3.5%
과학 358
 
3.4%
영어 355
 
3.4%
사회 327
 
3.1%
음악 289
 
2.8%
미술 272
 
2.6%
도덕 234
 
2.2%
Other values (701) 7029
67.6%
2023-12-11T06:44:00.429394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1967
 
5.5%
1861
 
5.2%
1640
 
4.6%
1321
 
3.7%
1177
 
3.3%
1072
 
3.0%
1054
 
2.9%
882
 
2.5%
872
 
2.4%
871
 
2.4%
Other values (328) 23127
64.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 32603
91.0%
Letter Number 1688
 
4.7%
Other Punctuation 510
 
1.4%
Uppercase Letter 477
 
1.3%
Space Separator 457
 
1.3%
Lowercase Letter 81
 
0.2%
Decimal Number 10
 
< 0.1%
Open Punctuation 8
 
< 0.1%
Close Punctuation 8
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1967
 
6.0%
1861
 
5.7%
1640
 
5.0%
1177
 
3.6%
1072
 
3.3%
1054
 
3.2%
882
 
2.7%
872
 
2.7%
871
 
2.7%
856
 
2.6%
Other values (280) 20351
62.4%
Uppercase Letter
ValueCountFrequency (%)
I 415
87.0%
A 16
 
3.4%
P 10
 
2.1%
D 8
 
1.7%
C 5
 
1.0%
S 4
 
0.8%
T 3
 
0.6%
E 3
 
0.6%
B 3
 
0.6%
L 2
 
0.4%
Other values (7) 8
 
1.7%
Lowercase Letter
ValueCountFrequency (%)
i 11
13.6%
s 10
12.3%
t 9
11.1%
a 8
9.9%
r 8
9.9%
e 7
8.6%
c 5
6.2%
n 5
6.2%
l 4
 
4.9%
h 3
 
3.7%
Other values (7) 11
13.6%
Letter Number
ValueCountFrequency (%)
1321
78.3%
354
 
21.0%
10
 
0.6%
3
 
0.2%
Decimal Number
ValueCountFrequency (%)
1 6
60.0%
3 3
30.0%
2 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
· 509
99.8%
1
 
0.2%
Space Separator
ValueCountFrequency (%)
457
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 32603
91.0%
Latin 2246
 
6.3%
Common 995
 
2.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1967
 
6.0%
1861
 
5.7%
1640
 
5.0%
1177
 
3.6%
1072
 
3.3%
1054
 
3.2%
882
 
2.7%
872
 
2.7%
871
 
2.7%
856
 
2.6%
Other values (280) 20351
62.4%
Latin
ValueCountFrequency (%)
1321
58.8%
I 415
 
18.5%
354
 
15.8%
A 16
 
0.7%
i 11
 
0.5%
s 10
 
0.4%
10
 
0.4%
P 10
 
0.4%
t 9
 
0.4%
a 8
 
0.4%
Other values (28) 82
 
3.7%
Common
ValueCountFrequency (%)
· 509
51.2%
457
45.9%
( 8
 
0.8%
) 8
 
0.8%
1 6
 
0.6%
3 3
 
0.3%
1
 
0.1%
- 1
 
0.1%
_ 1
 
0.1%
2 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 32598
90.9%
Number Forms 1688
 
4.7%
ASCII 1043
 
2.9%
None 510
 
1.4%
Compat Jamo 5
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1967
 
6.0%
1861
 
5.7%
1640
 
5.0%
1177
 
3.6%
1072
 
3.3%
1054
 
3.2%
882
 
2.7%
872
 
2.7%
871
 
2.7%
856
 
2.6%
Other values (279) 20346
62.4%
Number Forms
ValueCountFrequency (%)
1321
78.3%
354
 
21.0%
10
 
0.6%
3
 
0.2%
None
ValueCountFrequency (%)
· 509
99.8%
1
 
0.2%
ASCII
ValueCountFrequency (%)
457
43.8%
I 415
39.8%
A 16
 
1.5%
i 11
 
1.1%
s 10
 
1.0%
P 10
 
1.0%
t 9
 
0.9%
( 8
 
0.8%
) 8
 
0.8%
a 8
 
0.8%
Other values (32) 91
 
8.7%
Compat Jamo
ValueCountFrequency (%)
5
100.0%

남성교원수(명)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct8
Distinct (%)0.1%
Missing7
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean0.59991994
Minimum0
Maximum8
Zeros5436
Zeros (%)54.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:44:00.534888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile2
Maximum8
Range8
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.79066308
Coefficient of variation (CV)1.3179477
Kurtosis4.1902187
Mean0.59991994
Median Absolute Deviation (MAD)0
Skewness1.640223
Sum5995
Variance0.62514811
MonotonicityNot monotonic
2023-12-11T06:44:00.631409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
0 5436
54.4%
1 3474
34.7%
2 827
 
8.3%
3 182
 
1.8%
4 53
 
0.5%
5 19
 
0.2%
8 1
 
< 0.1%
6 1
 
< 0.1%
(Missing) 7
 
0.1%
ValueCountFrequency (%)
0 5436
54.4%
1 3474
34.7%
2 827
 
8.3%
3 182
 
1.8%
4 53
 
0.5%
5 19
 
0.2%
6 1
 
< 0.1%
8 1
 
< 0.1%
ValueCountFrequency (%)
8 1
 
< 0.1%
6 1
 
< 0.1%
5 19
 
0.2%
4 53
 
0.5%
3 182
 
1.8%
2 827
 
8.3%
1 3474
34.7%
0 5436
54.4%

여성교원수(명)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct11
Distinct (%)0.1%
Missing7
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean1.1450015
Minimum0
Maximum10
Zeros2892
Zeros (%)28.9%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:44:00.736404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile3
Maximum10
Range10
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.14915
Coefficient of variation (CV)1.0036231
Kurtosis4.4397616
Mean1.1450015
Median Absolute Deviation (MAD)1
Skewness1.7132525
Sum11442
Variance1.3205457
MonotonicityNot monotonic
2023-12-11T06:44:00.831486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
1 4594
45.9%
0 2892
28.9%
2 1451
 
14.5%
3 588
 
5.9%
4 271
 
2.7%
5 119
 
1.2%
6 54
 
0.5%
7 17
 
0.2%
8 4
 
< 0.1%
9 2
 
< 0.1%
(Missing) 7
 
0.1%
ValueCountFrequency (%)
0 2892
28.9%
1 4594
45.9%
2 1451
 
14.5%
3 588
 
5.9%
4 271
 
2.7%
5 119
 
1.2%
6 54
 
0.5%
7 17
 
0.2%
8 4
 
< 0.1%
9 2
 
< 0.1%
ValueCountFrequency (%)
10 1
 
< 0.1%
9 2
 
< 0.1%
8 4
 
< 0.1%
7 17
 
0.2%
6 54
 
0.5%
5 119
 
1.2%
4 271
 
2.7%
3 588
 
5.9%
2 1451
 
14.5%
1 4594
45.9%

합계교원수(명)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct11
Distinct (%)0.1%
Missing7
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean1.7449214
Minimum0
Maximum12
Zeros225
Zeros (%)2.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:44:00.933460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q32
95-th percentile4
Maximum12
Range12
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.1931552
Coefficient of variation (CV)0.68378735
Kurtosis4.0974994
Mean1.7449214
Median Absolute Deviation (MAD)0
Skewness1.8093095
Sum17437
Variance1.4236194
MonotonicityNot monotonic
2023-12-11T06:44:01.046754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
1 5566
55.7%
2 2258
22.6%
3 1028
 
10.3%
4 541
 
5.4%
0 225
 
2.2%
5 219
 
2.2%
6 100
 
1.0%
7 43
 
0.4%
8 9
 
0.1%
9 3
 
< 0.1%
(Missing) 7
 
0.1%
ValueCountFrequency (%)
0 225
 
2.2%
1 5566
55.7%
2 2258
22.6%
3 1028
 
10.3%
4 541
 
5.4%
5 219
 
2.2%
6 100
 
1.0%
7 43
 
0.4%
8 9
 
0.1%
9 3
 
< 0.1%
ValueCountFrequency (%)
12 1
 
< 0.1%
9 3
 
< 0.1%
8 9
 
0.1%
7 43
 
0.4%
6 100
 
1.0%
5 219
 
2.2%
4 541
 
5.4%
3 1028
 
10.3%
2 2258
22.6%
1 5566
55.7%

Interactions

2023-12-11T06:43:56.474923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:43:55.940397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:43:56.216390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:43:56.564166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:43:56.031003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:43:56.308509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:43:56.643328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:43:56.124101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:43:56.394251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:44:01.337663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준년도시군명지역교육청명지역명학교급명설립구분명제외여부상위교과명남성교원수(명)여성교원수(명)합계교원수(명)
기준년도1.0000.4650.2790.4950.0750.0210.0200.5610.0690.1290.189
시군명0.4651.0000.9571.0000.2000.3570.2610.2490.1350.1560.149
지역교육청명0.2790.9571.0000.9630.8210.2930.0000.6970.1510.2780.245
지역명0.4951.0000.9631.0000.2600.4980.2760.3750.1780.2090.176
학교급명0.0750.2000.8210.2601.0000.2730.0160.8310.1620.2420.174
설립구분명0.0210.3570.2930.4980.2731.0000.0000.3000.3220.3480.070
제외여부0.0200.2610.0000.2760.0160.0001.000NaNNaNNaNNaN
상위교과명0.5610.2490.6970.3750.8310.300NaN1.0000.4360.5560.550
남성교원수(명)0.0690.1350.1510.1780.1620.322NaN0.4361.0000.4380.549
여성교원수(명)0.1290.1560.2780.2090.2420.348NaN0.5560.4381.0000.871
합계교원수(명)0.1890.1490.2450.1760.1740.070NaN0.5500.5490.8711.000
2023-12-11T06:44:01.453945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준년도지역명학교급명제외여부제외사유지역교육청명설립구분명시군명
기준년도1.0000.2660.0710.0331.0000.1480.0350.266
지역명0.2661.0000.1340.2201.0000.6080.3980.999
학교급명0.0710.1341.0000.0101.0000.5890.1820.105
제외여부0.0330.2200.0101.0001.0000.0000.0000.222
제외사유1.0001.0001.0001.0001.0001.0001.0001.000
지역교육청명0.1480.6080.5890.0001.0001.0000.2320.606
설립구분명0.0350.3980.1820.0001.0000.2321.0000.304
시군명0.2660.9990.1050.2221.0000.6060.3041.000
2023-12-11T06:44:01.556586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
남성교원수(명)여성교원수(명)합계교원수(명)기준년도시군명지역교육청명지역명학교급명설립구분명제외여부제외사유
남성교원수(명)1.000-0.4480.3360.0430.0530.0610.0680.0740.2421.0000.000
여성교원수(명)-0.4481.0000.6300.0680.0380.0910.0560.1210.1781.0000.000
합계교원수(명)0.3360.6301.0000.0830.0530.0890.0620.1110.0701.0000.000
기준년도0.0430.0680.0831.0000.2660.1480.2660.0710.0350.0331.000
시군명0.0530.0380.0530.2661.0000.6060.9990.1050.3040.2221.000
지역교육청명0.0610.0910.0890.1480.6061.0000.6080.5890.2320.0001.000
지역명0.0680.0560.0620.2660.9990.6081.0000.1340.3980.2201.000
학교급명0.0740.1210.1110.0710.1050.5890.1341.0000.1820.0101.000
설립구분명0.2420.1780.0700.0350.3040.2320.3980.1821.0000.0001.000
제외여부1.0001.0001.0000.0330.2220.0000.2200.0100.0001.0001.000
제외사유0.0000.0000.0001.0001.0001.0001.0001.0001.0001.0001.000

Missing values

2023-12-11T06:43:56.774088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:43:56.981916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T06:43:57.138175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유상위교과명과목명남성교원수(명)여성교원수(명)합계교원수(명)
20692018고양시경기도교육청경기도 고양시 일산서구대화고등학교고등학교공립N<NA>고등학교선택교육과정-교과-보통교과-기초-영어영어Ⅱ033
476702017양주시경기도동두천양주교육지원청경기도 양주시덕정중학교중학교공립N<NA>공통교육과정-교과-국어국어224
109072018성남시경기도교육청경기도 성남시 중원구성남테크노과학고등학교고등학교공립N<NA>선택중심교육과정-전문교과-전문교과Ⅱ-건설기초 제도202
76262018동두천시경기도교육청경기도 동두천시동두천외국어고등학교고등학교공립N<NA>고등학교선택교육과정-교과-보통교과-탐구-사회(역사/도덕포함)사회·문화011
616962016군포시경기도교육청경기도 군포시군포e비즈니스고등학교고등학교공립N<NA>고등학교선택교육과정-교과-전문교과-상업정보에관한교과웹애니메이션011
595302016고양시경기도고양교육지원청경기도 고양시 일산서구덕이중학교중학교공립N<NA>공통교육과정-교과-체육체육527
293582018화성시경기도교육청경기도 화성시삼괴고등학교고등학교사립N<NA>고등학교선택교육과정-교과-보통교과-체육예술-예술(음악/미술)미술문화101
118972018성남시경기도성남교육지원청경기도 성남시 중원구숭신여자중학교중학교사립N<NA>공통교육과정-교과-영어영어011
299782018화성시경기도화성오산교육지원청경기도 화성시푸른중학교중학교공립N<NA>공통교육과정-교과-예술(음악/미술)미술011
44852018구리시경기도구리남양주교육지원청경기도 구리시구리여자중학교중학교공립N<NA>공통교육과정-교과-수학수학022
기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유상위교과명과목명남성교원수(명)여성교원수(명)합계교원수(명)
181692018안성시경기도안성교육지원청경기도 안성시안성여자중학교중학교공립N<NA>공통교육과정-교과-영어영어112
185882018안양시경기도안양과천교육지원청경기도 안양시 동안구인덕원중학교중학교공립N<NA>공통교육과정-교과-예술(음악/미술)미술011
165162018안산시경기도안산교육지원청경기도 안산시 상록구성포중학교중학교공립N<NA>공통교육과정-교과-과학/기술·가정/정보과학011
613372016구리시경기도교육청경기도 구리시토평고등학교고등학교공립N<NA>고등학교선택교육과정-교과-보통교과-생활교양-기술·가정/제2외국어/한문/교양중국어Ⅱ011
117502018성남시경기도교육청경기도 성남시 분당구불곡고등학교고등학교공립N<NA>선택중심교육과정-보통교과-기초-국어국어033
540892017평택시경기도교육청경기도 평택시평택여자고등학교고등학교공립N<NA>고등학교선택교육과정-교과-보통교과-기초-국어고전044
194332018안양시경기도교육청경기도 안양시 만안구안양여자상업고등학교고등학교사립N<NA>선택중심교육과정-보통교과-기초-국어국어112
23522018고양시경기도교육청경기도 고양시 일산동구백신고등학교고등학교공립N<NA>고등학교선택교육과정-교과-보통교과-탐구-과학화학Ⅱ101
663172016성남시경기도교육청경기도 성남시 수정구복정고등학교고등학교공립N<NA>고등학교선택교육과정-교과-보통교과-기초-국어문학112
569732017화성시경기도교육청경기도 화성시반송고등학교고등학교공립N<NA>고등학교선택교육과정-교과-보통교과-탐구-사회(역사/도덕포함)한국사022

Duplicate rows

Most frequently occurring

기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유상위교과명과목명남성교원수(명)여성교원수(명)합계교원수(명)# duplicates
02017하남시경기도교육청경기도 하남시한국애니메이션고등학교고등학교공립Y본교는 해당항목에 대해서 통계자료가 없으므로 제외함.<NA><NA><NA><NA><NA>7
12018고양시경기도고양교육지원청경기도 고양시 일산서구대송중학교중학교공립N<NA>공통교육과정-교과-과학/기술·가정기술·가정0112
22018구리시경기도구리남양주교육지원청경기도 구리시토평중학교중학교공립N<NA>공통교육과정-교과-사회(역사포함)/도덕사회0112
32018부천시경기도부천교육지원청경기도 부천시부인중학교중학교공립N<NA>공통교육과정-교과-예술(음악/미술)음악0112
42018성남시경기도성남교육지원청경기도 성남시 수정구창성중학교중학교공립N<NA>공통교육과정-교과-예술(음악/미술)미술0112
52018수원시경기도수원교육지원청경기도 수원시 영통구영덕중학교중학교공립N<NA>공통교육과정-교과-예술(음악/미술)음악0112
62018이천시경기도이천교육지원청경기도 이천시이천중학교중학교공립N<NA>공통교육과정-교과-사회(역사포함)/도덕사회0112
72018하남시경기도광주하남교육지원청경기도 하남시윤슬중학교중학교공립N<NA>공통교육과정-교과-사회(역사포함)/도덕도덕0112