Overview

Dataset statistics

Number of variables15
Number of observations10000
Missing cells26005
Missing cells (%)17.3%
Duplicate rows343
Duplicate rows (%)3.4%
Total size in memory1.3 MiB
Average record size in memory134.0 B

Variable types

Numeric5
Categorical8
Text1
Boolean1

Dataset

Description학생교육활동에 필요한 지원시설 현황
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=HJSP6IWYVEIXBR6MSCW023292516&infSeq=2

Alerts

Dataset has 343 (3.4%) duplicate rowsDuplicates
시군명 is highly overall correlated with 지역교육청명 and 1 other fieldsHigh correlation
수영장수(개) is highly overall correlated with 체육관수(개) and 4 other fieldsHigh correlation
지역명 is highly overall correlated with 시군명 and 1 other fieldsHigh correlation
제외사유 is highly overall correlated with 학교급명 and 2 other fieldsHigh correlation
제외여부 is highly overall correlated with 체육관수(개) and 5 other fieldsHigh correlation
체육관수(개) is highly overall correlated with 제외여부 and 1 other fieldsHigh correlation
강당수(개) is highly overall correlated with 제외여부 and 1 other fieldsHigh correlation
기숙사재실인원수(명) is highly overall correlated with 제외여부 and 1 other fieldsHigh correlation
진로상담실수(개) is highly overall correlated with 제외여부High correlation
지역교육청명 is highly overall correlated with 시군명 and 2 other fieldsHigh correlation
학교급명 is highly overall correlated with 제외사유 and 1 other fieldsHigh correlation
설립구분명 is highly overall correlated with 지역교육청명High correlation
설립유형명 is highly overall correlated with 제외사유High correlation
학교급명 is highly imbalanced (54.4%)Imbalance
설립구분명 is highly imbalanced (68.7%)Imbalance
제외사유 is highly imbalanced (85.2%)Imbalance
수영장수(개) is highly imbalanced (95.3%)Imbalance
설립유형명 is highly imbalanced (60.3%)Imbalance
체육관수(개) has 4802 (48.0%) missing valuesMissing
강당수(개) has 7007 (70.1%) missing valuesMissing
기숙사재실인원수(명) has 9076 (90.8%) missing valuesMissing
진로상담실수(개) has 5120 (51.2%) missing valuesMissing
기숙사재실인원수(명) has 119 (1.2%) zerosZeros

Reproduction

Analysis started2023-12-10 22:31:30.609317
Analysis finished2023-12-10 22:31:35.304503
Duration4.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준년도
Real number (ℝ)

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2017.3067
Minimum2015
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:31:35.388919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2015
5-th percentile2015
Q12016
median2017
Q32019
95-th percentile2020
Maximum2020
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.598903
Coefficient of variation (CV)0.00079259291
Kurtosis-1.1575088
Mean2017.3067
Median Absolute Deviation (MAD)1
Skewness0.064913206
Sum20173067
Variance2.5564908
MonotonicityNot monotonic
2023-12-11T07:31:35.518714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2018 1846
18.5%
2017 1842
18.4%
2019 1817
18.2%
2016 1792
17.9%
2015 1746
17.5%
2020 957
9.6%
ValueCountFrequency (%)
2015 1746
17.5%
2016 1792
17.9%
2017 1842
18.4%
2018 1846
18.5%
2019 1817
18.2%
2020 957
9.6%
ValueCountFrequency (%)
2020 957
9.6%
2019 1817
18.2%
2018 1846
18.5%
2017 1842
18.4%
2016 1792
17.9%
2015 1746
17.5%

시군명
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
수원시
826 
용인시
753 
고양시
705 
성남시
 
634
화성시
 
633
Other values (26)
6449 

Length

Max length4
Median length3
Mean length3.0869
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row양주시
2nd row부천시
3rd row군포시
4th row포천시
5th row의정부시

Common Values

ValueCountFrequency (%)
수원시 826
 
8.3%
용인시 753
 
7.5%
고양시 705
 
7.0%
성남시 634
 
6.3%
화성시 633
 
6.3%
부천시 531
 
5.3%
남양주시 483
 
4.8%
평택시 448
 
4.5%
파주시 432
 
4.3%
안산시 428
 
4.3%
Other values (21) 4127
41.3%

Length

2023-12-11T07:31:35.660278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수원시 826
 
8.3%
용인시 753
 
7.5%
고양시 705
 
7.0%
성남시 634
 
6.3%
화성시 633
 
6.3%
부천시 531
 
5.3%
남양주시 483
 
4.8%
평택시 448
 
4.5%
파주시 432
 
4.3%
안산시 428
 
4.3%
Other values (21) 4127
41.3%

지역교육청명
Categorical

HIGH CORRELATION 

Distinct27
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도교육청
2032 
경기도화성오산교육지원청
660 
경기도수원교육지원청
657 
경기도용인교육지원청
634 
경기도고양교육지원청
 
541
Other values (22)
5476 

Length

Max length13
Median length10
Mean length9.728
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도동두천양주교육지원청
2nd row경기도부천교육지원청
3rd row경기도군포의왕교육지원청
4th row경기도포천교육지원청
5th row경기도의정부교육지원청

Common Values

ValueCountFrequency (%)
경기도교육청 2032
20.3%
경기도화성오산교육지원청 660
 
6.6%
경기도수원교육지원청 657
 
6.6%
경기도용인교육지원청 634
 
6.3%
경기도고양교육지원청 541
 
5.4%
경기도구리남양주교육지원청 501
 
5.0%
경기도성남교육지원청 486
 
4.9%
경기도부천교육지원청 398
 
4.0%
경기도평택교육지원청 360
 
3.6%
경기도파주교육지원청 345
 
3.5%
Other values (17) 3386
33.9%

Length

2023-12-11T07:31:36.053675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도교육청 2032
20.3%
경기도화성오산교육지원청 660
 
6.6%
경기도수원교육지원청 657
 
6.6%
경기도용인교육지원청 634
 
6.3%
경기도고양교육지원청 541
 
5.4%
경기도구리남양주교육지원청 501
 
5.0%
경기도성남교육지원청 486
 
4.9%
경기도부천교육지원청 398
 
4.0%
경기도평택교육지원청 360
 
3.6%
경기도파주교육지원청 345
 
3.5%
Other values (17) 3386
33.9%

지역명
Categorical

HIGH CORRELATION 

Distinct42
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도 화성시
 
633
경기도 부천시
 
531
경기도 남양주시
 
483
경기도 평택시
 
448
경기도 파주시
 
432
Other values (37)
7473 

Length

Max length12
Median length7
Mean length8.6129
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도 양주시
2nd row경기도 부천시
3rd row경기도 군포시
4th row경기도 포천시
5th row경기도 의정부시

Common Values

ValueCountFrequency (%)
경기도 화성시 633
 
6.3%
경기도 부천시 531
 
5.3%
경기도 남양주시 483
 
4.8%
경기도 평택시 448
 
4.5%
경기도 파주시 432
 
4.3%
경기도 성남시 분당구 370
 
3.7%
경기도 김포시 327
 
3.3%
경기도 시흥시 322
 
3.2%
경기도 용인시 기흥구 301
 
3.0%
경기도 의정부시 284
 
2.8%
Other values (32) 5869
58.7%

Length

2023-12-11T07:31:36.184017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 10000
42.2%
수원시 826
 
3.5%
용인시 753
 
3.2%
고양시 705
 
3.0%
성남시 634
 
2.7%
화성시 633
 
2.7%
부천시 531
 
2.2%
남양주시 483
 
2.0%
평택시 448
 
1.9%
파주시 432
 
1.8%
Other values (39) 8264
34.9%
Distinct2474
Distinct (%)24.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T07:31:36.479099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length6
Mean length6.2849
Min length4

Characters and Unicode

Total characters62849
Distinct characters341
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique151 ?
Unique (%)1.5%

Sample

1st row삼숭중학교
2nd row도원초등학교
3rd row수리초등학교
4th row포천중학교
5th row효자중학교
ValueCountFrequency (%)
초당초등학교 11
 
0.1%
대선초등학교 10
 
0.1%
화성월문초등학교 9
 
0.1%
수성고등학교부설방송통신고등학교 9
 
0.1%
파주여자고등학교 9
 
0.1%
평택도곡초등학교 9
 
0.1%
안화고등학교 9
 
0.1%
한솔초등학교 9
 
0.1%
진안초등학교 9
 
0.1%
안양서초등학교 9
 
0.1%
Other values (2465) 9913
99.1%
2023-12-11T07:31:36.874917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10230
16.3%
10155
16.2%
7231
 
11.5%
5376
 
8.6%
2868
 
4.6%
2128
 
3.4%
616
 
1.0%
602
 
1.0%
601
 
1.0%
586
 
0.9%
Other values (331) 22456
35.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 62679
99.7%
Lowercase Letter 81
 
0.1%
Close Punctuation 32
 
0.1%
Open Punctuation 32
 
0.1%
Uppercase Letter 18
 
< 0.1%
Space Separator 6
 
< 0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10230
16.3%
10155
16.2%
7231
 
11.5%
5376
 
8.6%
2868
 
4.6%
2128
 
3.4%
616
 
1.0%
602
 
1.0%
601
 
1.0%
586
 
0.9%
Other values (315) 22286
35.6%
Lowercase Letter
ValueCountFrequency (%)
s 24
29.6%
i 12
14.8%
n 12
14.8%
e 9
 
11.1%
h 6
 
7.4%
u 6
 
7.4%
l 6
 
7.4%
g 6
 
7.4%
Uppercase Letter
ValueCountFrequency (%)
B 6
33.3%
E 6
33.3%
I 3
16.7%
T 3
16.7%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%
Open Punctuation
ValueCountFrequency (%)
( 32
100.0%
Space Separator
ValueCountFrequency (%)
6
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 62679
99.7%
Latin 99
 
0.2%
Common 71
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10230
16.3%
10155
16.2%
7231
 
11.5%
5376
 
8.6%
2868
 
4.6%
2128
 
3.4%
616
 
1.0%
602
 
1.0%
601
 
1.0%
586
 
0.9%
Other values (315) 22286
35.6%
Latin
ValueCountFrequency (%)
s 24
24.2%
i 12
12.1%
n 12
12.1%
e 9
 
9.1%
h 6
 
6.1%
u 6
 
6.1%
B 6
 
6.1%
E 6
 
6.1%
l 6
 
6.1%
g 6
 
6.1%
Other values (2) 6
 
6.1%
Common
ValueCountFrequency (%)
) 32
45.1%
( 32
45.1%
6
 
8.5%
1 1
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 62679
99.7%
ASCII 170
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10230
16.3%
10155
16.2%
7231
 
11.5%
5376
 
8.6%
2868
 
4.6%
2128
 
3.4%
616
 
1.0%
602
 
1.0%
601
 
1.0%
586
 
0.9%
Other values (315) 22286
35.6%
ASCII
ValueCountFrequency (%)
) 32
18.8%
( 32
18.8%
s 24
14.1%
i 12
 
7.1%
n 12
 
7.1%
e 9
 
5.3%
h 6
 
3.5%
u 6
 
3.5%
B 6
 
3.5%
6
 
3.5%
Other values (6) 25
14.7%

학교급명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
6422 
초등학교
1887 
중학교
913 
고등학교
707 
특수학교
 
39
Other values (5)
 
32

Length

Max length7
Median length4
Mean length3.9127
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중학교
2nd row초등학교
3rd row<NA>
4th row중학교
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 6422
64.2%
초등학교 1887
 
18.9%
중학교 913
 
9.1%
고등학교 707
 
7.1%
특수학교 39
 
0.4%
방통고 9
 
0.1%
각종학교(중) 8
 
0.1%
각종학교(초) 5
 
0.1%
각종학교(고) 5
 
0.1%
방통중 5
 
0.1%

Length

2023-12-11T07:31:37.003583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:31:37.123985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 6422
64.2%
초등학교 1887
 
18.9%
중학교 913
 
9.1%
고등학교 707
 
7.1%
특수학교 39
 
0.4%
방통고 9
 
0.1%
각종학교(중 8
 
0.1%
각종학교(초 5
 
< 0.1%
각종학교(고 5
 
< 0.1%
방통중 5
 
< 0.1%

설립구분명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공립
8933 
사립
1059 
국립
 
8

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공립
2nd row공립
3rd row공립
4th row공립
5th row공립

Common Values

ValueCountFrequency (%)
공립 8933
89.3%
사립 1059
 
10.6%
국립 8
 
0.1%

Length

2023-12-11T07:31:37.244980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:31:37.328573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공립 8933
89.3%
사립 1059
 
10.6%
국립 8
 
0.1%

제외여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
8469 
True
1531 
ValueCountFrequency (%)
False 8469
84.7%
True 1531
 
15.3%
2023-12-11T07:31:37.400319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

제외사유
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct21
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
8470 
본교는 해당항목에 대해서 통계자료가 없으므로 제외함.
1502 
본교인 삼평중학교에 따름
 
3
서현고등학교(본교)에서 관리하므로 제외 처리함.
 
3
수원여자고등학교부설방송통신고등학교로 본교에서 정보공시
 
2
Other values (16)
 
20

Length

Max length75
Median length4
Mean length7.8062
Min length3

Unique

Unique12 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8470
84.7%
본교는 해당항목에 대해서 통계자료가 없으므로 제외함. 1502
 
15.0%
본교인 삼평중학교에 따름 3
 
< 0.1%
서현고등학교(본교)에서 관리하므로 제외 처리함. 3
 
< 0.1%
수원여자고등학교부설방송통신고등학교로 본교에서 정보공시 2
 
< 0.1%
하늘꿈학교(고) 해당항목 열람 바람 2
 
< 0.1%
경기체육고등학교에서 통합관리하고 있으므로 경기체육고등학교에서 입력함 2
 
< 0.1%
서현고등학교 정보공시로 대체함 2
 
< 0.1%
수성고등학교 부설 학교로 본교 시설을 사용하고 있음. 2
 
< 0.1%
본교는 해당항목에 대해서 통계자료가 없으므로 제외함.(학생교육활동에 필요한 지원시설 체육관, 강당, 기숙사, 수영장, 진로상담실 없음) 1
 
< 0.1%
Other values (11) 11
 
0.1%

Length

2023-12-11T07:31:37.500795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 8470
48.1%
해당항목에 1503
 
8.5%
대해서 1503
 
8.5%
통계자료가 1503
 
8.5%
없으므로 1503
 
8.5%
본교는 1503
 
8.5%
제외함 1502
 
8.5%
제외 5
 
< 0.1%
경기체육고등학교에서 4
 
< 0.1%
처리함 4
 
< 0.1%
Other values (53) 101
 
0.6%

체육관수(개)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct15
Distinct (%)0.3%
Missing4802
Missing (%)48.0%
Infinite0
Infinite (%)0.0%
Mean1.1060023
Minimum0
Maximum16
Zeros21
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:31:37.598194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q31
95-th percentile2
Maximum16
Range16
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.7981353
Coefficient of variation (CV)0.72163981
Kurtosis202.53476
Mean1.1060023
Median Absolute Deviation (MAD)0
Skewness13.160271
Sum5749
Variance0.63701996
MonotonicityNot monotonic
2023-12-11T07:31:37.719644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
1 4896
49.0%
2 211
 
2.1%
3 29
 
0.3%
0 21
 
0.2%
4 12
 
0.1%
5 7
 
0.1%
9 4
 
< 0.1%
12 4
 
< 0.1%
14 3
 
< 0.1%
16 3
 
< 0.1%
Other values (5) 8
 
0.1%
(Missing) 4802
48.0%
ValueCountFrequency (%)
0 21
 
0.2%
1 4896
49.0%
2 211
 
2.1%
3 29
 
0.3%
4 12
 
0.1%
5 7
 
0.1%
6 2
 
< 0.1%
8 1
 
< 0.1%
9 4
 
< 0.1%
11 1
 
< 0.1%
ValueCountFrequency (%)
16 3
< 0.1%
15 2
 
< 0.1%
14 3
< 0.1%
13 2
 
< 0.1%
12 4
< 0.1%
11 1
 
< 0.1%
9 4
< 0.1%
8 1
 
< 0.1%
6 2
 
< 0.1%
5 7
0.1%

강당수(개)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct14
Distinct (%)0.5%
Missing7007
Missing (%)70.1%
Infinite0
Infinite (%)0.0%
Mean1.1299699
Minimum0
Maximum30
Zeros48
Zeros (%)0.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:31:37.822186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q31
95-th percentile2
Maximum30
Range30
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.1414706
Coefficient of variation (CV)1.0101778
Kurtosis360.36927
Mean1.1299699
Median Absolute Deviation (MAD)0
Skewness16.842967
Sum3382
Variance1.3029551
MonotonicityNot monotonic
2023-12-11T07:31:37.918872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
1 2762
 
27.6%
2 123
 
1.2%
0 48
 
0.5%
3 31
 
0.3%
4 6
 
0.1%
7 5
 
0.1%
9 4
 
< 0.1%
5 4
 
< 0.1%
10 3
 
< 0.1%
22 2
 
< 0.1%
Other values (4) 5
 
0.1%
(Missing) 7007
70.1%
ValueCountFrequency (%)
0 48
 
0.5%
1 2762
27.6%
2 123
 
1.2%
3 31
 
0.3%
4 6
 
0.1%
5 4
 
< 0.1%
7 5
 
0.1%
8 1
 
< 0.1%
9 4
 
< 0.1%
10 3
 
< 0.1%
ValueCountFrequency (%)
30 2
 
< 0.1%
22 2
 
< 0.1%
13 1
 
< 0.1%
11 1
 
< 0.1%
10 3
< 0.1%
9 4
< 0.1%
8 1
 
< 0.1%
7 5
0.1%
5 4
< 0.1%
4 6
0.1%

기숙사재실인원수(명)
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct243
Distinct (%)26.3%
Missing9076
Missing (%)90.8%
Infinite0
Infinite (%)0.0%
Mean119.30952
Minimum0
Maximum1124
Zeros119
Zeros (%)1.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:31:38.031874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q115
median56
Q3121.75
95-th percentile592.7
Maximum1124
Range1124
Interquartile range (IQR)106.75

Descriptive statistics

Standard deviation181.55616
Coefficient of variation (CV)1.521724
Kurtosis8.8935414
Mean119.30952
Median Absolute Deviation (MAD)47
Skewness2.7897028
Sum110242
Variance32962.641
MonotonicityNot monotonic
2023-12-11T07:31:38.154861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 119
 
1.2%
200 23
 
0.2%
14 16
 
0.2%
40 14
 
0.1%
80 14
 
0.1%
120 14
 
0.1%
7 13
 
0.1%
16 13
 
0.1%
102 12
 
0.1%
33 12
 
0.1%
Other values (233) 674
 
6.7%
(Missing) 9076
90.8%
ValueCountFrequency (%)
0 119
1.2%
1 2
 
< 0.1%
2 3
 
< 0.1%
3 3
 
< 0.1%
4 5
 
0.1%
5 6
 
0.1%
6 9
 
0.1%
7 13
 
0.1%
8 6
 
0.1%
9 11
 
0.1%
ValueCountFrequency (%)
1124 1
< 0.1%
1123 1
< 0.1%
1102 1
< 0.1%
1097 2
< 0.1%
1082 1
< 0.1%
1081 2
< 0.1%
1039 1
< 0.1%
897 1
< 0.1%
699 1
< 0.1%
696 1
< 0.1%

수영장수(개)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9861 
1
 
74
0
 
60
2
 
3
17
 
1

Length

Max length4
Median length4
Mean length3.9585
Min length1

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9861
98.6%
1 74
 
0.7%
0 60
 
0.6%
2 3
 
< 0.1%
17 1
 
< 0.1%
13 1
 
< 0.1%

Length

2023-12-11T07:31:38.272459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:31:38.363601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9861
98.6%
1 74
 
0.7%
0 60
 
0.6%
2 3
 
< 0.1%
17 1
 
< 0.1%
13 1
 
< 0.1%

진로상담실수(개)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct8
Distinct (%)0.2%
Missing5120
Missing (%)51.2%
Infinite0
Infinite (%)0.0%
Mean1.2405738
Minimum0
Maximum7
Zeros23
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:31:38.444347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q31
95-th percentile3
Maximum7
Range7
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.68413437
Coefficient of variation (CV)0.5514661
Kurtosis16.363004
Mean1.2405738
Median Absolute Deviation (MAD)0
Skewness3.5834159
Sum6054
Variance0.46803984
MonotonicityNot monotonic
2023-12-11T07:31:38.538791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
1 4093
40.9%
2 495
 
5.0%
3 154
 
1.5%
4 86
 
0.9%
0 23
 
0.2%
5 16
 
0.2%
7 7
 
0.1%
6 6
 
0.1%
(Missing) 5120
51.2%
ValueCountFrequency (%)
0 23
 
0.2%
1 4093
40.9%
2 495
 
5.0%
3 154
 
1.5%
4 86
 
0.9%
5 16
 
0.2%
6 6
 
0.1%
7 7
 
0.1%
ValueCountFrequency (%)
7 7
 
0.1%
6 6
 
0.1%
5 16
 
0.2%
4 86
 
0.9%
3 154
 
1.5%
2 495
 
5.0%
1 4093
40.9%
0 23
 
0.2%

설립유형명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
단설
8093 
<NA>
1154 
병설
 
698
부속
 
28
부설
 
27

Length

Max length4
Median length2
Mean length2.2308
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row단설
2nd row단설
3rd row단설
4th row단설
5th row단설

Common Values

ValueCountFrequency (%)
단설 8093
80.9%
<NA> 1154
 
11.5%
병설 698
 
7.0%
부속 28
 
0.3%
부설 27
 
0.3%

Length

2023-12-11T07:31:38.659357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:31:38.759629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
단설 8093
80.9%
na 1154
 
11.5%
병설 698
 
7.0%
부속 28
 
0.3%
부설 27
 
0.3%

Interactions

2023-12-11T07:31:34.084486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:32.089805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:32.530016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:33.057597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:33.556611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:34.175263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:32.177515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:32.620401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:33.143108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:33.646861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:34.289948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:32.259195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:32.715027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:33.237583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:33.758046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:34.393375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:32.345819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:32.819989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:33.331035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:33.850153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:34.481665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:32.437818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:32.938767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:33.447056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:31:33.964421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:31:38.839194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준년도시군명지역교육청명지역명학교급명설립구분명제외여부제외사유체육관수(개)강당수(개)기숙사재실인원수(명)수영장수(개)진로상담실수(개)설립유형명
기준년도1.0000.0000.0000.0000.0000.0000.0190.1310.0660.0590.0000.0000.0000.050
시군명0.0001.0000.9921.0000.2230.2710.2540.0000.1510.1910.6260.5840.3150.242
지역교육청명0.0000.9921.0000.9940.7590.9390.3260.2040.1640.0870.0000.6120.2480.276
지역명0.0001.0000.9941.0000.2430.3960.2970.4750.2450.3060.7330.8350.3600.294
학교급명0.0000.2230.7590.2431.0000.6340.2070.9130.0000.0000.0000.9010.5170.575
설립구분명0.0000.2710.9390.3960.6341.0000.0050.1270.0860.1090.1330.2190.1490.164
제외여부0.0190.2540.3260.2970.2070.0051.000NaNNaNNaNNaNNaNNaN0.188
제외사유0.1310.0000.2040.4750.9130.127NaN1.000NaNNaNNaNNaNNaN0.772
체육관수(개)0.0660.1510.1640.2450.0000.086NaNNaN1.0000.6070.1100.9910.0000.000
강당수(개)0.0590.1910.0870.3060.0000.109NaNNaN0.6071.0000.0000.9800.0460.000
기숙사재실인원수(명)0.0000.6260.0000.7330.0000.133NaNNaN0.1100.0001.0000.7840.4080.349
수영장수(개)0.0000.5840.6120.8350.9010.219NaNNaN0.9910.9800.7841.0000.1500.132
진로상담실수(개)0.0000.3150.2480.3600.5170.149NaNNaN0.0000.0460.4080.1501.0000.253
설립유형명0.0500.2420.2760.2940.5750.1640.1880.7720.0000.0000.3490.1320.2531.000
2023-12-11T07:31:39.004989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학교급명시군명설립유형명수영장수(개)지역교육청명지역명제외사유제외여부설립구분명
학교급명1.0000.0840.4070.5980.3640.0900.7520.2070.352
시군명0.0841.0000.1280.2730.8600.9990.0000.2160.141
설립유형명0.4070.1281.0000.1060.1480.1520.6300.1250.155
수영장수(개)0.5980.2730.1061.0000.3850.487NaN1.0000.167
지역교육청명0.3640.8600.1480.3851.0000.8600.0560.2800.751
지역명0.0900.9990.1520.4870.8601.0000.1350.2360.201
제외사유0.7520.0000.630NaN0.0560.1351.0001.0000.100
제외여부0.2070.2160.1251.0000.2800.2361.0001.0000.009
설립구분명0.3520.1410.1550.1670.7510.2010.1000.0091.000
2023-12-11T07:31:39.134544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준년도체육관수(개)강당수(개)기숙사재실인원수(명)진로상담실수(개)시군명지역교육청명지역명학교급명설립구분명제외여부제외사유수영장수(개)설립유형명
기준년도1.000-0.031-0.0230.0060.0360.0000.0000.0000.0000.0000.0330.0640.0000.042
체육관수(개)-0.0311.0000.2170.0900.0880.0550.0540.0850.0000.0351.0000.0000.8630.000
강당수(개)-0.0230.2171.0000.1540.2160.0520.0000.1130.0000.0601.0000.0000.8140.000
기숙사재실인원수(명)0.0060.0900.1541.0000.0900.2860.0000.3630.0000.0581.0000.0000.8370.230
진로상담실수(개)0.0360.0880.2160.0901.0000.1280.1010.1450.2840.0951.0000.0000.1400.115
시군명0.0000.0550.0520.2860.1281.0000.8600.9990.0840.1410.2160.0000.2730.128
지역교육청명0.0000.0540.0000.0000.1010.8601.0000.8600.3640.7510.2800.0560.3850.148
지역명0.0000.0850.1130.3630.1450.9990.8601.0000.0900.2010.2360.1350.4870.152
학교급명0.0000.0000.0000.0000.2840.0840.3640.0901.0000.3520.2070.7520.5980.407
설립구분명0.0000.0350.0600.0580.0950.1410.7510.2010.3521.0000.0090.1000.1670.155
제외여부0.0331.0001.0001.0001.0000.2160.2800.2360.2070.0091.0001.0001.0000.125
제외사유0.0640.0000.0000.0000.0000.0000.0560.1350.7520.1001.0001.0000.0000.630
수영장수(개)0.0000.8630.8140.8370.1400.2730.3850.4870.5980.1671.0000.0001.0000.106
설립유형명0.0420.0000.0000.2300.1150.1280.1480.1520.4070.1550.1250.6300.1061.000

Missing values

2023-12-11T07:31:34.644560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:31:34.906295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T07:31:35.124313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유체육관수(개)강당수(개)기숙사재실인원수(명)수영장수(개)진로상담실수(개)설립유형명
152102017양주시경기도동두천양주교육지원청경기도 양주시삼숭중학교중학교공립N<NA><NA>1<NA><NA>1단설
230962015부천시경기도부천교육지원청경기도 부천시도원초등학교초등학교공립N<NA>11<NA><NA><NA>단설
32112019군포시경기도군포의왕교육지원청경기도 군포시수리초등학교<NA>공립N<NA><NA><NA><NA><NA>1단설
214582016포천시경기도포천교육지원청경기도 포천시포천중학교중학교공립N<NA>2<NA><NA><NA><NA>단설
111922018의정부시경기도의정부교육지원청경기도 의정부시효자중학교<NA>공립N<NA>1<NA><NA><NA><NA>단설
160632017의정부시경기도의정부교육지원청경기도 의정부시의정부효자초등학교<NA>공립Y본교는 해당항목에 대해서 통계자료가 없으므로 제외함.<NA><NA><NA><NA><NA><NA>
88962018성남시경기도성남교육지원청경기도 성남시 수정구풍생중학교<NA>사립Y본교는 해당항목에 대해서 통계자료가 없으므로 제외함.<NA><NA><NA><NA><NA><NA>
27332019고양시경기도교육청경기도 고양시 일산동구정발고등학교<NA>공립N<NA>2<NA><NA><NA><NA>단설
178522016군포시경기도군포의왕교육지원청경기도 군포시한얼초등학교<NA>공립N<NA><NA>1<NA><NA><NA>단설
137322017부천시경기도부천교육지원청경기도 부천시부천북초등학교<NA>공립N<NA>1<NA><NA><NA>1단설
기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유체육관수(개)강당수(개)기숙사재실인원수(명)수영장수(개)진로상담실수(개)설립유형명
147772017안산시경기도안산교육지원청경기도 안산시 상록구안산중학교중학교사립N<NA><NA><NA><NA><NA>1단설
90992018성남시경기도성남교육지원청경기도 성남시 분당구성남화랑초등학교초등학교공립N<NA><NA>1<NA><NA><NA>단설
185152016부천시경기도부천교육지원청경기도 부천시부천북초등학교초등학교공립N<NA>1<NA><NA><NA>1단설
234642015성남시경기도교육청경기도 성남시 분당구운중고등학교고등학교공립N<NA>1<NA><NA><NA>3단설
58552019용인시경기도용인교육지원청경기도 용인시 처인구둔전제일초등학교<NA>공립N<NA><NA>1<NA><NA><NA>단설
130092017군포시경기도군포의왕교육지원청경기도 군포시용호중학교<NA>공립N<NA>1<NA><NA><NA>1단설
197082016안성시경기도안성교육지원청경기도 안성시일죽초등학교초등학교공립N<NA>1<NA><NA><NA><NA>단설
244842015안성시경기도교육청경기도 안성시안성여자고등학교<NA>공립N<NA><NA><NA><NA><NA>1단설
138812017성남시경기도성남교육지원청경기도 성남시 분당구신백현초등학교<NA>공립N<NA>1<NA><NA><NA><NA>단설
153482017여주시경기도여주교육지원청경기도 여주시금당초등학교초등학교공립Y본교는 해당항목에 대해서 통계자료가 없으므로 제외함.<NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유체육관수(개)강당수(개)기숙사재실인원수(명)수영장수(개)진로상담실수(개)설립유형명# duplicates
02019가평군경기도가평교육지원청경기도 가평군상색초등학교<NA>공립N<NA><NA>1<NA><NA><NA>단설2
12019가평군경기도가평교육지원청경기도 가평군상천초등학교<NA>공립N<NA><NA>1<NA><NA><NA>병설2
22019가평군경기도가평교육지원청경기도 가평군조종중학교<NA>공립N<NA><NA><NA><NA><NA>1병설2
32019가평군경기도교육청경기도 가평군청평고등학교<NA>공립N<NA>11<NA><NA>2단설2
42019고양시경기도고양교육지원청경기도 고양시 덕양구가람초등학교<NA>공립N<NA><NA>2<NA><NA>1단설2
52019고양시경기도고양교육지원청경기도 고양시 덕양구내유초등학교<NA>공립N<NA>1<NA><NA><NA><NA>단설2
62019고양시경기도고양교육지원청경기도 고양시 덕양구목암초등학교<NA>공립N<NA>1<NA><NA><NA>1단설2
72019고양시경기도고양교육지원청경기도 고양시 덕양구신원초등학교<NA>공립N<NA>1<NA><NA><NA>1단설2
82019고양시경기도고양교육지원청경기도 고양시 덕양구용두초등학교<NA>공립Y본교는 해당항목에 대해서 통계자료가 없으므로 제외함.<NA><NA><NA><NA><NA>단설2
92019고양시경기도고양교육지원청경기도 고양시 덕양구행신초등학교<NA>공립N<NA>11<NA><NA><NA>단설2