Overview

Dataset statistics

Number of variables18
Number of observations10000
Missing cells120
Missing cells (%)0.1%
Duplicate rows516
Duplicate rows (%)5.2%
Total size in memory1.5 MiB
Average record size in memory162.0 B

Variable types

Categorical10
Text1
Boolean1
Numeric6

Alerts

Dataset has 516 (5.2%) duplicate rowsDuplicates
지역교육청명 is highly overall correlated with 시군명 and 5 other fieldsHigh correlation
설립구분명 is highly overall correlated with 일반직남자수(명) and 7 other fieldsHigh correlation
시군명 is highly overall correlated with 지역교육청명 and 4 other fieldsHigh correlation
기준년도 is highly overall correlated with 제외사유 and 3 other fieldsHigh correlation
제외사유 is highly overall correlated with 기준년도 and 4 other fieldsHigh correlation
지역명 is highly overall correlated with 시군명 and 5 other fieldsHigh correlation
별정직여자수(명) is highly overall correlated with 일반직남자수(명) and 14 other fieldsHigh correlation
학교급명 is highly overall correlated with 제외여부 and 4 other fieldsHigh correlation
제외여부 is highly overall correlated with 일반직남자수(명) and 10 other fieldsHigh correlation
별정직남자수(명) is highly overall correlated with 일반직남자수(명) and 14 other fieldsHigh correlation
별정직합계수(명) is highly overall correlated with 일반직남자수(명) and 14 other fieldsHigh correlation
일반직남자수(명) is highly overall correlated with 일반직합계수(명) and 5 other fieldsHigh correlation
일반직여자수(명) is highly overall correlated with 일반직합계수(명) and 7 other fieldsHigh correlation
일반직합계수(명) is highly overall correlated with 일반직남자수(명) and 6 other fieldsHigh correlation
기타직남자수(명) is highly overall correlated with 제외여부 and 3 other fieldsHigh correlation
기타직여자수(명) is highly overall correlated with 일반직여자수(명) and 5 other fieldsHigh correlation
기타직합계(명) is highly overall correlated with 일반직여자수(명) and 5 other fieldsHigh correlation
설립구분명 is highly imbalanced (69.7%)Imbalance
제외여부 is highly imbalanced (97.9%)Imbalance
제외사유 is highly imbalanced (99.2%)Imbalance
일반직남자수(명) has 856 (8.6%) zerosZeros
일반직여자수(명) has 728 (7.3%) zerosZeros
기타직남자수(명) has 7784 (77.8%) zerosZeros
기타직여자수(명) has 182 (1.8%) zerosZeros
기타직합계(명) has 166 (1.7%) zerosZeros

Reproduction

Analysis started2023-12-10 23:07:54.985923
Analysis finished2023-12-10 23:08:01.156775
Duration6.17 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준년도
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2019
2233 
2017
2221 
2016
2204 
2018
2191 
2020
1151 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2020
3rd row2016
4th row2020
5th row2018

Common Values

ValueCountFrequency (%)
2019 2233
22.3%
2017 2221
22.2%
2016 2204
22.0%
2018 2191
21.9%
2020 1151
11.5%

Length

2023-12-11T08:08:01.243437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:08:01.328804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 2233
22.3%
2017 2221
22.2%
2016 2204
22.0%
2018 2191
21.9%
2020 1151
11.5%

시군명
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
수원시
827 
용인시
789 
고양시
694 
성남시
654 
화성시
 
612
Other values (26)
6424 

Length

Max length4
Median length3
Mean length3.0891
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row의왕시
2nd row안산시
3rd row하남시
4th row포천시
5th row오산시

Common Values

ValueCountFrequency (%)
수원시 827
 
8.3%
용인시 789
 
7.9%
고양시 694
 
6.9%
성남시 654
 
6.5%
화성시 612
 
6.1%
부천시 536
 
5.4%
남양주시 497
 
5.0%
안산시 442
 
4.4%
평택시 427
 
4.3%
파주시 427
 
4.3%
Other values (21) 4095
40.9%

Length

2023-12-11T08:08:01.426588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수원시 827
 
8.3%
용인시 789
 
7.9%
고양시 694
 
6.9%
성남시 654
 
6.5%
화성시 612
 
6.1%
부천시 536
 
5.4%
남양주시 497
 
5.0%
안산시 442
 
4.4%
평택시 427
 
4.3%
파주시 427
 
4.3%
Other values (21) 4095
40.9%

지역교육청명
Categorical

HIGH CORRELATION 

Distinct27
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도교육청
2156 
경기도용인교육지원청
657 
경기도화성오산교육지원청
625 
경기도수원교육지원청
617 
경기도고양교육지원청
 
534
Other values (22)
5411 

Length

Max length13
Median length10
Mean length9.6634
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도군포의왕교육지원청
2nd row경기도안산교육지원청
3rd row경기도광주하남교육지원청
4th row경기도포천교육지원청
5th row경기도교육청

Common Values

ValueCountFrequency (%)
경기도교육청 2156
21.6%
경기도용인교육지원청 657
 
6.6%
경기도화성오산교육지원청 625
 
6.2%
경기도수원교육지원청 617
 
6.2%
경기도고양교육지원청 534
 
5.3%
경기도구리남양주교육지원청 496
 
5.0%
경기도성남교육지원청 487
 
4.9%
경기도부천교육지원청 400
 
4.0%
경기도파주교육지원청 337
 
3.4%
경기도안산교육지원청 335
 
3.4%
Other values (17) 3356
33.6%

Length

2023-12-11T08:08:01.538605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도교육청 2156
21.6%
경기도용인교육지원청 657
 
6.6%
경기도화성오산교육지원청 625
 
6.2%
경기도수원교육지원청 617
 
6.2%
경기도고양교육지원청 534
 
5.3%
경기도구리남양주교육지원청 496
 
5.0%
경기도성남교육지원청 487
 
4.9%
경기도부천교육지원청 400
 
4.0%
경기도파주교육지원청 337
 
3.4%
경기도안산교육지원청 335
 
3.4%
Other values (17) 3356
33.6%

지역명
Categorical

HIGH CORRELATION 

Distinct42
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도 화성시
 
612
경기도 부천시
 
536
경기도 남양주시
 
497
경기도 파주시
 
427
경기도 평택시
 
427
Other values (37)
7501 

Length

Max length12
Median length7
Mean length8.6321
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도 의왕시
2nd row경기도 안산시 단원구
3rd row경기도 하남시
4th row경기도 포천시
5th row경기도 오산시

Common Values

ValueCountFrequency (%)
경기도 화성시 612
 
6.1%
경기도 부천시 536
 
5.4%
경기도 남양주시 497
 
5.0%
경기도 파주시 427
 
4.3%
경기도 평택시 427
 
4.3%
경기도 성남시 분당구 376
 
3.8%
경기도 시흥시 337
 
3.4%
경기도 김포시 322
 
3.2%
경기도 용인시 기흥구 315
 
3.1%
경기도 고양시 덕양구 296
 
3.0%
Other values (32) 5855
58.6%

Length

2023-12-11T08:08:01.687730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 10000
42.1%
수원시 827
 
3.5%
용인시 789
 
3.3%
고양시 694
 
2.9%
성남시 654
 
2.8%
화성시 612
 
2.6%
부천시 536
 
2.3%
남양주시 497
 
2.1%
안산시 442
 
1.9%
평택시 427
 
1.8%
Other values (39) 8280
34.9%
Distinct2475
Distinct (%)24.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T08:08:01.901055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length6
Mean length6.2643
Min length4

Characters and Unicode

Total characters62643
Distinct characters335
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique121 ?
Unique (%)1.2%

Sample

1st row고천중학교
2nd row송호초등학교
3rd row은가람중학교
4th row신봉초등학교
5th row성호고등학교
ValueCountFrequency (%)
석천초등학교 11
 
0.1%
상원초등학교 10
 
0.1%
용인백현중학교 8
 
0.1%
언동초등학교 8
 
0.1%
덕성초등학교 8
 
0.1%
서해초등학교 8
 
0.1%
여강고등학교 8
 
0.1%
신곡중학교 8
 
0.1%
천천중학교 8
 
0.1%
송림고등학교 8
 
0.1%
Other values (2466) 9919
99.2%
2023-12-11T08:08:02.221735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10218
16.3%
10150
16.2%
7335
 
11.7%
5339
 
8.5%
2743
 
4.4%
2240
 
3.6%
649
 
1.0%
619
 
1.0%
604
 
1.0%
604
 
1.0%
Other values (325) 22142
35.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 62569
99.9%
Lowercase Letter 56
 
0.1%
Uppercase Letter 14
 
< 0.1%
Space Separator 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10218
16.3%
10150
16.2%
7335
 
11.7%
5339
 
8.5%
2743
 
4.4%
2240
 
3.6%
649
 
1.0%
619
 
1.0%
604
 
1.0%
604
 
1.0%
Other values (312) 22068
35.3%
Lowercase Letter
ValueCountFrequency (%)
s 16
28.6%
n 8
14.3%
i 8
14.3%
e 8
14.3%
u 4
 
7.1%
h 4
 
7.1%
g 4
 
7.1%
l 4
 
7.1%
Uppercase Letter
ValueCountFrequency (%)
B 4
28.6%
E 4
28.6%
I 3
21.4%
T 3
21.4%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 62569
99.9%
Latin 70
 
0.1%
Common 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10218
16.3%
10150
16.2%
7335
 
11.7%
5339
 
8.5%
2743
 
4.4%
2240
 
3.6%
649
 
1.0%
619
 
1.0%
604
 
1.0%
604
 
1.0%
Other values (312) 22068
35.3%
Latin
ValueCountFrequency (%)
s 16
22.9%
n 8
11.4%
i 8
11.4%
e 8
11.4%
u 4
 
5.7%
B 4
 
5.7%
h 4
 
5.7%
E 4
 
5.7%
g 4
 
5.7%
l 4
 
5.7%
Other values (2) 6
 
8.6%
Common
ValueCountFrequency (%)
4
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 62569
99.9%
ASCII 74
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10218
16.3%
10150
16.2%
7335
 
11.7%
5339
 
8.5%
2743
 
4.4%
2240
 
3.6%
649
 
1.0%
619
 
1.0%
604
 
1.0%
604
 
1.0%
Other values (312) 22068
35.3%
ASCII
ValueCountFrequency (%)
s 16
21.6%
n 8
10.8%
i 8
10.8%
e 8
10.8%
u 4
 
5.4%
B 4
 
5.4%
h 4
 
5.4%
E 4
 
5.4%
g 4
 
5.4%
l 4
 
5.4%
Other values (3) 10
13.5%

학교급명
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
6740 
초등학교
1702 
중학교
837 
고등학교
 
663
특수학교
 
45
Other values (2)
 
13

Length

Max length4
Median length4
Mean length3.915
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 6740
67.4%
초등학교 1702
 
17.0%
중학교 837
 
8.4%
고등학교 663
 
6.6%
특수학교 45
 
0.4%
방통고 7
 
0.1%
방통중 6
 
0.1%

Length

2023-12-11T08:08:02.354071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:08:02.448221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 6740
67.4%
초등학교 1702
 
17.0%
중학교 837
 
8.4%
고등학교 663
 
6.6%
특수학교 45
 
0.4%
방통고 7
 
0.1%
방통중 6
 
0.1%

설립구분명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공립
8981 
사립
1012 
국립
 
7

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공립
2nd row공립
3rd row공립
4th row공립
5th row공립

Common Values

ValueCountFrequency (%)
공립 8981
89.8%
사립 1012
 
10.1%
국립 7
 
0.1%

Length

2023-12-11T08:08:02.542130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:08:02.616427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공립 8981
89.8%
사립 1012
 
10.1%
국립 7
 
0.1%

제외여부
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
9980 
True
 
20
ValueCountFrequency (%)
False 9980
99.8%
True 20
 
0.2%
2023-12-11T08:08:02.694071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

제외사유
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9980 
본교는 해당항목에 대해서 통계자료가 없으므로 제외함.
 
8
방송통신고등학교로 본교에서 출석수업일에 근무를 지원하고 있음.
 
3
수원여자고등학교부설방송통신고등학교로 본교에서 정보공시
 
2
수성고등학교와 겸임직원으로 배정되어 수성고등학교에서 공시함
 
2
Other values (4)
 
5

Length

Max length34
Median length4
Mean length4.049
Min length4

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9980
99.8%
본교는 해당항목에 대해서 통계자료가 없으므로 제외함. 8
 
0.1%
방송통신고등학교로 본교에서 출석수업일에 근무를 지원하고 있음. 3
 
< 0.1%
수원여자고등학교부설방송통신고등학교로 본교에서 정보공시 2
 
< 0.1%
수성고등학교와 겸임직원으로 배정되어 수성고등학교에서 공시함 2
 
< 0.1%
본교행정실 직원 겸임으로 본교에서 정보공시 2
 
< 0.1%
서현고등학교 정보공시로 대체함 1
 
< 0.1%
서현고등학교(본교) 정보공시로 대체함. 1
 
< 0.1%
수성고등학교 사무직원이 겸임하므로 수성고등학교에서 공시함 1
 
< 0.1%

Length

2023-12-11T08:08:02.799802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:08:02.972477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9980
99.0%
해당항목에 8
 
0.1%
대해서 8
 
0.1%
통계자료가 8
 
0.1%
없으므로 8
 
0.1%
제외함 8
 
0.1%
본교는 8
 
0.1%
본교에서 7
 
0.1%
정보공시 4
 
< 0.1%
있음 3
 
< 0.1%
Other values (20) 41
 
0.4%

일반직남자수(명)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct18
Distinct (%)0.2%
Missing20
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean1.9145291
Minimum0
Maximum19
Zeros856
Zeros (%)8.6%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T08:08:03.121492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q32
95-th percentile4
Maximum19
Range19
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.4646208
Coefficient of variation (CV)0.76500317
Kurtosis16.036287
Mean1.9145291
Median Absolute Deviation (MAD)1
Skewness2.6954414
Sum19107
Variance2.1451141
MonotonicityNot monotonic
2023-12-11T08:08:03.290123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
1 3550
35.5%
2 3178
31.8%
3 1428
14.3%
0 856
 
8.6%
4 557
 
5.6%
5 207
 
2.1%
6 76
 
0.8%
8 41
 
0.4%
7 25
 
0.2%
9 23
 
0.2%
Other values (8) 39
 
0.4%
(Missing) 20
 
0.2%
ValueCountFrequency (%)
0 856
 
8.6%
1 3550
35.5%
2 3178
31.8%
3 1428
14.3%
4 557
 
5.6%
5 207
 
2.1%
6 76
 
0.8%
7 25
 
0.2%
8 41
 
0.4%
9 23
 
0.2%
ValueCountFrequency (%)
19 1
 
< 0.1%
17 4
 
< 0.1%
16 2
 
< 0.1%
15 1
 
< 0.1%
13 8
 
0.1%
12 6
 
0.1%
11 8
 
0.1%
10 9
 
0.1%
9 23
0.2%
8 41
0.4%

일반직여자수(명)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct14
Distinct (%)0.1%
Missing20
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean2.2851703
Minimum0
Maximum21
Zeros728
Zeros (%)7.3%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T08:08:03.452908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q33
95-th percentile4
Maximum21
Range21
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.4067681
Coefficient of variation (CV)0.61560753
Kurtosis20.537229
Mean2.2851703
Median Absolute Deviation (MAD)1
Skewness2.1270838
Sum22806
Variance1.9789964
MonotonicityNot monotonic
2023-12-11T08:08:03.580937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
2 3249
32.5%
3 2392
23.9%
1 2048
20.5%
4 1107
 
11.1%
0 728
 
7.3%
5 309
 
3.1%
6 79
 
0.8%
7 26
 
0.3%
9 14
 
0.1%
8 14
 
0.1%
Other values (4) 14
 
0.1%
(Missing) 20
 
0.2%
ValueCountFrequency (%)
0 728
 
7.3%
1 2048
20.5%
2 3249
32.5%
3 2392
23.9%
4 1107
 
11.1%
5 309
 
3.1%
6 79
 
0.8%
7 26
 
0.3%
8 14
 
0.1%
9 14
 
0.1%
ValueCountFrequency (%)
21 3
 
< 0.1%
20 4
 
< 0.1%
11 4
 
< 0.1%
10 3
 
< 0.1%
9 14
 
0.1%
8 14
 
0.1%
7 26
 
0.3%
6 79
 
0.8%
5 309
 
3.1%
4 1107
11.1%

일반직합계수(명)
Real number (ℝ)

HIGH CORRELATION 

Distinct27
Distinct (%)0.3%
Missing20
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean4.1996994
Minimum0
Maximum37
Zeros46
Zeros (%)0.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T08:08:03.706131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q13
median4
Q35
95-th percentile7
Maximum37
Range37
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.0886505
Coefficient of variation (CV)0.49733334
Kurtosis46.352385
Mean4.1996994
Median Absolute Deviation (MAD)1
Skewness4.3363765
Sum41913
Variance4.3624611
MonotonicityNot monotonic
2023-12-11T08:08:03.834842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
4 2638
26.4%
3 2587
25.9%
5 1800
18.0%
6 1040
 
10.4%
2 1017
 
10.2%
7 315
 
3.1%
1 184
 
1.8%
8 116
 
1.2%
9 63
 
0.6%
0 46
 
0.5%
Other values (17) 174
 
1.7%
ValueCountFrequency (%)
0 46
 
0.5%
1 184
 
1.8%
2 1017
 
10.2%
3 2587
25.9%
4 2638
26.4%
5 1800
18.0%
6 1040
 
10.4%
7 315
 
3.1%
8 116
 
1.2%
9 63
 
0.6%
ValueCountFrequency (%)
37 4
 
< 0.1%
36 1
 
< 0.1%
34 1
 
< 0.1%
33 1
 
< 0.1%
23 1
 
< 0.1%
21 2
 
< 0.1%
20 6
0.1%
19 8
0.1%
18 3
 
< 0.1%
17 11
0.1%

별정직남자수(명)
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
6606 
<NA>
3394 

Length

Max length4
Median length1
Mean length2.0182
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row<NA>
3rd row0
4th row<NA>
5th row0

Common Values

ValueCountFrequency (%)
0 6606
66.1%
<NA> 3394
33.9%

Length

2023-12-11T08:08:03.992089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:08:04.093795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 6606
66.1%
na 3394
33.9%

별정직여자수(명)
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
6606 
<NA>
3394 

Length

Max length4
Median length1
Mean length2.0182
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row<NA>
3rd row0
4th row<NA>
5th row0

Common Values

ValueCountFrequency (%)
0 6606
66.1%
<NA> 3394
33.9%

Length

2023-12-11T08:08:04.199621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:08:04.293233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 6606
66.1%
na 3394
33.9%

별정직합계수(명)
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
6606 
<NA>
3394 

Length

Max length4
Median length1
Mean length2.0182
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row<NA>
3rd row0
4th row<NA>
5th row0

Common Values

ValueCountFrequency (%)
0 6606
66.1%
<NA> 3394
33.9%

Length

2023-12-11T08:08:04.423719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:08:04.530493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 6606
66.1%
na 3394
33.9%

기타직남자수(명)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct14
Distinct (%)0.1%
Missing20
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean0.3240481
Minimum0
Maximum14
Zeros7784
Zeros (%)77.8%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T08:08:04.618402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2
Maximum14
Range14
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.86020739
Coefficient of variation (CV)2.654567
Kurtosis56.656074
Mean0.3240481
Median Absolute Deviation (MAD)0
Skewness5.9522761
Sum3234
Variance0.73995675
MonotonicityNot monotonic
2023-12-11T08:08:04.727852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
0 7784
77.8%
1 1691
 
16.9%
2 325
 
3.2%
3 72
 
0.7%
5 35
 
0.4%
6 20
 
0.2%
4 17
 
0.2%
7 17
 
0.2%
9 6
 
0.1%
8 5
 
0.1%
Other values (4) 8
 
0.1%
(Missing) 20
 
0.2%
ValueCountFrequency (%)
0 7784
77.8%
1 1691
 
16.9%
2 325
 
3.2%
3 72
 
0.7%
4 17
 
0.2%
5 35
 
0.4%
6 20
 
0.2%
7 17
 
0.2%
8 5
 
0.1%
9 6
 
0.1%
ValueCountFrequency (%)
14 2
 
< 0.1%
13 3
 
< 0.1%
12 2
 
< 0.1%
10 1
 
< 0.1%
9 6
 
0.1%
8 5
 
0.1%
7 17
0.2%
6 20
0.2%
5 35
0.4%
4 17
0.2%

기타직여자수(명)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct36
Distinct (%)0.4%
Missing20
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean11.58497
Minimum0
Maximum35
Zeros182
Zeros (%)1.8%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T08:08:04.868087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q19
median12
Q315
95-th percentile19
Maximum35
Range35
Interquartile range (IQR)6

Descriptive statistics

Standard deviation4.8800724
Coefficient of variation (CV)0.42124169
Kurtosis0.15433419
Mean11.58497
Median Absolute Deviation (MAD)3
Skewness-0.21643213
Sum115618
Variance23.815106
MonotonicityNot monotonic
2023-12-11T08:08:05.003798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
12 969
 
9.7%
14 894
 
8.9%
13 869
 
8.7%
15 813
 
8.1%
11 762
 
7.6%
16 632
 
6.3%
10 616
 
6.2%
9 552
 
5.5%
17 479
 
4.8%
7 459
 
4.6%
Other values (26) 2935
29.3%
ValueCountFrequency (%)
0 182
 
1.8%
1 157
 
1.6%
2 195
 
1.9%
3 180
 
1.8%
4 201
 
2.0%
5 304
3.0%
6 354
3.5%
7 459
4.6%
8 458
4.6%
9 552
5.5%
ValueCountFrequency (%)
35 1
 
< 0.1%
34 1
 
< 0.1%
33 1
 
< 0.1%
32 2
 
< 0.1%
31 1
 
< 0.1%
30 3
 
< 0.1%
29 3
 
< 0.1%
28 2
 
< 0.1%
27 12
0.1%
26 9
0.1%

기타직합계(명)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct40
Distinct (%)0.4%
Missing20
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean11.909018
Minimum0
Maximum42
Zeros166
Zeros (%)1.7%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T08:08:05.146704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q19
median12
Q315
95-th percentile19
Maximum42
Range42
Interquartile range (IQR)6

Descriptive statistics

Standard deviation5.0838385
Coefficient of variation (CV)0.42688982
Kurtosis1.0430085
Mean11.909018
Median Absolute Deviation (MAD)3
Skewness0.049946233
Sum118852
Variance25.845414
MonotonicityNot monotonic
2023-12-11T08:08:05.270901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
12 899
 
9.0%
14 878
 
8.8%
13 877
 
8.8%
15 784
 
7.8%
11 747
 
7.5%
16 674
 
6.7%
10 614
 
6.1%
9 540
 
5.4%
17 511
 
5.1%
7 440
 
4.4%
Other values (30) 3016
30.2%
ValueCountFrequency (%)
0 166
 
1.7%
1 139
 
1.4%
2 161
 
1.6%
3 192
 
1.9%
4 216
 
2.2%
5 278
2.8%
6 335
3.4%
7 440
4.4%
8 436
4.4%
9 540
5.4%
ValueCountFrequency (%)
42 1
 
< 0.1%
41 2
 
< 0.1%
40 1
 
< 0.1%
39 1
 
< 0.1%
36 3
 
< 0.1%
35 4
 
< 0.1%
34 4
 
< 0.1%
33 5
0.1%
32 11
0.1%
31 3
 
< 0.1%

Interactions

2023-12-11T08:07:59.869130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:57.036399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:57.539258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:58.086843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:58.626544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:59.266035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:59.948268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:57.115644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:57.624092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:58.188716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:58.731712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:59.380871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:08:00.025220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:57.188124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:57.700721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:58.266970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:58.861979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:59.464621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:08:00.121665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:57.269494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:57.795760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:58.358573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:58.963754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:59.563631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:08:00.203724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:57.353409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:57.909018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:58.446625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:59.074471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:59.670725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:08:00.288924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:57.438821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:58.011699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:58.542563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:59.169112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:07:59.782402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:08:05.365315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준년도시군명지역교육청명지역명학교급명설립구분명제외여부제외사유일반직남자수(명)일반직여자수(명)일반직합계수(명)기타직남자수(명)기타직여자수(명)기타직합계(명)
기준년도1.0000.0000.0000.0000.0000.0000.0100.7700.1280.0000.0280.0400.1300.142
시군명0.0001.0000.9921.0000.1420.2760.0790.6570.2240.2560.2690.1810.3920.400
지역교육청명0.0000.9921.0000.9930.7570.9380.0630.0000.6520.7430.7840.2270.5430.434
지역명0.0001.0000.9931.0000.1940.3780.1560.8070.2980.3340.3250.2190.4250.433
학교급명0.0000.1420.7570.1941.0000.6340.7581.0000.4950.3080.4500.2650.3750.385
설립구분명0.0000.2760.9380.3780.6341.0000.003NaN0.7480.7760.9430.1600.6420.491
제외여부0.0100.0790.0630.1560.7580.0031.000NaNNaNNaNNaNNaNNaNNaN
제외사유0.7700.6570.0000.8071.000NaNNaN1.000NaNNaNNaNNaNNaNNaN
일반직남자수(명)0.1280.2240.6520.2980.4950.748NaNNaN1.0000.6270.8520.4010.5590.423
일반직여자수(명)0.0000.2560.7430.3340.3080.776NaNNaN0.6271.0000.7750.3550.4980.387
일반직합계수(명)0.0280.2690.7840.3250.4500.943NaNNaN0.8520.7751.0000.2940.5090.402
기타직남자수(명)0.0400.1810.2270.2190.2650.160NaNNaN0.4010.3550.2941.0000.5180.773
기타직여자수(명)0.1300.3920.5430.4250.3750.642NaNNaN0.5590.4980.5090.5181.0000.962
기타직합계(명)0.1420.4000.4340.4330.3850.491NaNNaN0.4230.3870.4020.7730.9621.000
2023-12-11T08:08:05.526090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역교육청명설립구분명시군명기준년도제외사유지역명별정직여자수(명)학교급명제외여부별정직남자수(명)별정직합계수(명)
지역교육청명1.0000.7500.8540.0000.0000.8551.0000.4540.0541.0001.000
설립구분명0.7501.0000.1440.0001.0000.1901.0000.3280.0051.0001.000
시군명0.8540.1441.0000.0000.4080.9991.0000.0620.0671.0001.000
기준년도0.0000.0000.0001.0000.5370.0001.0000.0000.0131.0001.000
제외사유0.0001.0000.4080.5371.0000.561NaN1.0001.000NaNNaN
지역명0.8550.1900.9990.0000.5611.0001.0000.0790.1241.0001.000
별정직여자수(명)1.0001.0001.0001.000NaN1.0001.0001.0001.0001.0001.000
학교급명0.4540.3280.0620.0001.0000.0791.0001.0000.5651.0001.000
제외여부0.0540.0050.0670.0131.0000.1241.0000.5651.0001.0001.000
별정직남자수(명)1.0001.0001.0001.000NaN1.0001.0001.0001.0001.0001.000
별정직합계수(명)1.0001.0001.0001.000NaN1.0001.0001.0001.0001.0001.000
2023-12-11T08:08:05.700591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일반직남자수(명)일반직여자수(명)일반직합계수(명)기타직남자수(명)기타직여자수(명)기타직합계(명)기준년도시군명지역교육청명지역명학교급명설립구분명제외여부제외사유별정직남자수(명)별정직여자수(명)별정직합계수(명)
일반직남자수(명)1.000-0.2070.5520.1650.0580.0780.0540.0800.3000.1070.3020.6191.0000.0001.0001.0001.000
일반직여자수(명)-0.2071.0000.6500.0530.5100.5060.0000.1090.4240.1320.1890.7181.0000.0001.0001.0001.000
일반직합계수(명)0.5520.6501.0000.1690.4780.4900.0160.1030.3890.1230.2690.7191.0000.0001.0001.0001.000
기타직남자수(명)0.1650.0530.1691.0000.0830.1920.0180.0600.0840.0740.1520.0971.0000.0001.0001.0001.000
기타직여자수(명)0.0580.5100.4780.0831.0000.9900.0580.1500.2270.1620.1980.4821.0000.0001.0001.0001.000
기타직합계(명)0.0780.5060.4900.1920.9901.0000.0610.1510.1680.1620.2140.3341.0000.0001.0001.0001.000
기준년도0.0540.0000.0160.0180.0580.0611.0000.0000.0000.0000.0000.0000.0130.5371.0001.0001.000
시군명0.0800.1090.1030.0600.1500.1510.0001.0000.8540.9990.0620.1440.0670.4081.0001.0001.000
지역교육청명0.3000.4240.3890.0840.2270.1680.0000.8541.0000.8550.4540.7500.0540.0001.0001.0001.000
지역명0.1070.1320.1230.0740.1620.1620.0000.9990.8551.0000.0790.1900.1240.5611.0001.0001.000
학교급명0.3020.1890.2690.1520.1980.2140.0000.0620.4540.0791.0000.3280.5651.0001.0001.0001.000
설립구분명0.6190.7180.7190.0970.4820.3340.0000.1440.7500.1900.3281.0000.0051.0001.0001.0001.000
제외여부1.0001.0001.0001.0001.0001.0000.0130.0670.0540.1240.5650.0051.0001.0001.0001.0001.000
제외사유0.0000.0000.0000.0000.0000.0000.5370.4080.0000.5611.0001.0001.0001.0000.0000.0000.000
별정직남자수(명)1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0000.0001.0001.0001.000
별정직여자수(명)1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0000.0001.0001.0001.000
별정직합계수(명)1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0000.0001.0001.0001.000

Missing values

2023-12-11T08:08:00.403674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:08:00.837478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T08:08:01.035891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유일반직남자수(명)일반직여자수(명)일반직합계수(명)별정직남자수(명)별정직여자수(명)별정직합계수(명)기타직남자수(명)기타직여자수(명)기타직합계(명)
158282017의왕시경기도군포의왕교육지원청경기도 의왕시고천중학교<NA>공립N<NA>123000189
12682020안산시경기도안산교육지원청경기도 안산시 단원구송호초등학교<NA>공립N<NA>145<NA><NA><NA>01515
214882016하남시경기도광주하남교육지원청경기도 하남시은가람중학교<NA>공립N<NA>224000033
22592020포천시경기도포천교육지원청경기도 포천시신봉초등학교<NA>공립N<NA>123<NA><NA><NA>01313
105892018오산시경기도교육청경기도 오산시성호고등학교<NA>공립N<NA>32500011415
200502016여주시경기도여주교육지원청경기도 여주시이포초등학교<NA>공립N<NA>202000066
93012018수원시경기도수원교육지원청경기도 수원시 영통구광교초등학교초등학교공립N<NA>04400001616
138932017성남시경기도성남교육지원청경기도 성남시 수정구위례중앙초등학교초등학교공립N<NA>000000066
181612016동두천시경기도동두천양주교육지원청경기도 동두천시보산초등학교<NA>공립N<NA>123000088
141262017수원시경기도수원교육지원청경기도 수원시 권선구권선초등학교초등학교공립N<NA>14500001414
기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유일반직남자수(명)일반직여자수(명)일반직합계수(명)별정직남자수(명)별정직여자수(명)별정직합계수(명)기타직남자수(명)기타직여자수(명)기타직합계(명)
122462017가평군경기도가평교육지원청경기도 가평군청심국제중학교중학교사립N<NA>325000415
137362017성남시경기도성남교육지원청경기도 성남시 분당구오리초등학교<NA>공립N<NA>03300001010
103042018양주시경기도동두천양주교육지원청경기도 양주시덕계초등학교초등학교공립N<NA>12300001111
10022020수원시경기도수원교육지원청경기도 수원시 권선구오목초등학교<NA>공립N<NA>224<NA><NA><NA>01414
9952020수원시경기도수원교육지원청경기도 수원시 권선구안룡초등학교<NA>공립N<NA>123<NA><NA><NA>01010
16212020오산시경기도화성오산교육지원청경기도 오산시세미초등학교<NA>공립N<NA>134<NA><NA><NA>01919
204132016용인시경기도용인교육지원청경기도 용인시 수지구성복중학교<NA>공립N<NA>13400001313
198672016양주시경기도동두천양주교육지원청경기도 양주시양주덕현초등학교초등학교공립N<NA>15600001616
145632017안산시경기도안산교육지원청경기도 안산시 상록구이호초등학교<NA>공립N<NA>32500011516
12062020안산시경기도안산교육지원청경기도 안산시 단원구신길초등학교<NA>공립N<NA>134<NA><NA><NA>11415

Duplicate rows

Most frequently occurring

기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유일반직남자수(명)일반직여자수(명)일반직합계수(명)별정직남자수(명)별정직여자수(명)별정직합계수(명)기타직남자수(명)기타직여자수(명)기타직합계(명)# duplicates
02019가평군경기도가평교육지원청경기도 가평군목동초등학교명지분교장<NA>공립N<NA>101<NA><NA><NA>0222
12019가평군경기도가평교육지원청경기도 가평군미원초등학교<NA>공립N<NA>314<NA><NA><NA>015152
22019가평군경기도가평교육지원청경기도 가평군상면초등학교<NA>공립N<NA>123<NA><NA><NA>0882
32019가평군경기도가평교육지원청경기도 가평군상색초등학교<NA>공립N<NA>314<NA><NA><NA>0882
42019가평군경기도가평교육지원청경기도 가평군연하초등학교<NA>공립N<NA>123<NA><NA><NA>1892
52019가평군경기도가평교육지원청경기도 가평군율길초등학교<NA>공립N<NA>011<NA><NA><NA>2572
62019가평군경기도가평교육지원청경기도 가평군조종중학교<NA>공립N<NA>303<NA><NA><NA>1232
72019가평군경기도교육청경기도 가평군설악고등학교<NA>공립N<NA>112<NA><NA><NA>37102
82019고양시경기도고양교육지원청경기도 고양시 덕양구도래울중학교<NA>공립N<NA>134<NA><NA><NA>011112
92019고양시경기도고양교육지원청경기도 고양시 덕양구서정중학교<NA>공립N<NA>112<NA><NA><NA>012122