Overview

Dataset statistics

Number of variables14
Number of observations10000
Missing cells10141
Missing cells (%)7.2%
Duplicate rows348
Duplicate rows (%)3.5%
Total size in memory1.2 MiB
Average record size in memory126.0 B

Variable types

Numeric6
Categorical5
Text2
Boolean1

Dataset

Description학교도서관 현황(자료 보유현황)
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=YTT145F1BD9SI8GVWPY423493075&infSeq=2

Alerts

Dataset has 348 (3.5%) duplicate rowsDuplicates
시군명 is highly overall correlated with 지역교육청명 and 1 other fieldsHigh correlation
지역명 is highly overall correlated with 시군명 and 1 other fieldsHigh correlation
제외여부 is highly overall correlated with 자료수합계(권) and 2 other fieldsHigh correlation
학교급명 is highly overall correlated with 제외여부High correlation
도서자료수(권) is highly overall correlated with 비도서자료수(개) and 1 other fieldsHigh correlation
비도서자료수(개) is highly overall correlated with 도서자료수(권) and 1 other fieldsHigh correlation
자료수합계(권) is highly overall correlated with 도서자료수(권) and 2 other fieldsHigh correlation
전체학생수(명) is highly overall correlated with 1인당장서비율(%) and 1 other fieldsHigh correlation
1인당장서비율(%) is highly overall correlated with 전체학생수(명)High correlation
지역교육청명 is highly overall correlated with 시군명 and 2 other fieldsHigh correlation
설립구분명 is highly overall correlated with 지역교육청명High correlation
설립구분명 is highly imbalanced (70.3%)Imbalance
제외여부 is highly imbalanced (95.6%)Imbalance
제외사유 has 9952 (99.5%) missing valuesMissing
비도서자료수(개) is highly skewed (γ1 = 65.73113936)Skewed
도서자료수(권) has 145 (1.5%) zerosZeros
비도서자료수(개) has 596 (6.0%) zerosZeros
자료수합계(권) has 128 (1.3%) zerosZeros
1인당장서비율(%) has 146 (1.5%) zerosZeros

Reproduction

Analysis started2024-03-08 14:53:26.564734
Analysis finished2024-03-08 14:53:36.921614
Duration10.36 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준년도
Real number (ℝ)

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2017.3162
Minimum2015
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-08T23:53:36.994519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2015
5-th percentile2015
Q12016
median2017
Q32019
95-th percentile2020
Maximum2020
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.6012102
Coefficient of variation (CV)0.00079373286
Kurtosis-1.1657973
Mean2017.3162
Median Absolute Deviation (MAD)1
Skewness0.050471352
Sum20173162
Variance2.5638739
MonotonicityNot monotonic
2024-03-08T23:53:37.130821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2019 1869
18.7%
2017 1846
18.5%
2018 1827
18.3%
2015 1753
17.5%
2016 1753
17.5%
2020 952
9.5%
ValueCountFrequency (%)
2015 1753
17.5%
2016 1753
17.5%
2017 1846
18.5%
2018 1827
18.3%
2019 1869
18.7%
2020 952
9.5%
ValueCountFrequency (%)
2020 952
9.5%
2019 1869
18.7%
2018 1827
18.3%
2017 1846
18.5%
2016 1753
17.5%
2015 1753
17.5%

시군명
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
수원시
871 
용인시
755 
성남시
672 
고양시
657 
화성시
 
628
Other values (26)
6417 

Length

Max length4
Median length3
Mean length3.0898
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row김포시
2nd row수원시
3rd row가평군
4th row연천군
5th row김포시

Common Values

ValueCountFrequency (%)
수원시 871
 
8.7%
용인시 755
 
7.5%
성남시 672
 
6.7%
고양시 657
 
6.6%
화성시 628
 
6.3%
부천시 506
 
5.1%
남양주시 505
 
5.1%
평택시 456
 
4.6%
파주시 410
 
4.1%
안산시 406
 
4.1%
Other values (21) 4134
41.3%

Length

2024-03-08T23:53:37.463301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수원시 871
 
8.7%
용인시 755
 
7.5%
성남시 672
 
6.7%
고양시 657
 
6.6%
화성시 628
 
6.3%
부천시 506
 
5.1%
남양주시 505
 
5.1%
평택시 456
 
4.6%
파주시 410
 
4.1%
안산시 406
 
4.1%
Other values (21) 4134
41.3%

지역교육청명
Categorical

HIGH CORRELATION 

Distinct27
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도교육청
2059 
경기도수원교육지원청
673 
경기도화성오산교육지원청
669 
경기도용인교육지원청
630 
경기도고양교육지원청
 
507
Other values (22)
5462 

Length

Max length13
Median length10
Mean length9.7289
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도김포교육지원청
2nd row경기도수원교육지원청
3rd row경기도가평교육지원청
4th row경기도연천교육지원청
5th row경기도김포교육지원청

Common Values

ValueCountFrequency (%)
경기도교육청 2059
20.6%
경기도수원교육지원청 673
 
6.7%
경기도화성오산교육지원청 669
 
6.7%
경기도용인교육지원청 630
 
6.3%
경기도고양교육지원청 507
 
5.1%
경기도구리남양주교육지원청 507
 
5.1%
경기도성남교육지원청 500
 
5.0%
경기도부천교육지원청 383
 
3.8%
경기도평택교육지원청 368
 
3.7%
경기도파주교육지원청 330
 
3.3%
Other values (17) 3374
33.7%

Length

2024-03-08T23:53:37.591927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도교육청 2059
20.6%
경기도수원교육지원청 673
 
6.7%
경기도화성오산교육지원청 669
 
6.7%
경기도용인교육지원청 630
 
6.3%
경기도고양교육지원청 507
 
5.1%
경기도구리남양주교육지원청 507
 
5.1%
경기도성남교육지원청 500
 
5.0%
경기도부천교육지원청 383
 
3.8%
경기도평택교육지원청 368
 
3.7%
경기도파주교육지원청 330
 
3.3%
Other values (17) 3374
33.7%

지역명
Categorical

HIGH CORRELATION 

Distinct42
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도 화성시
 
628
경기도 부천시
 
506
경기도 남양주시
 
505
경기도 평택시
 
456
경기도 파주시
 
410
Other values (37)
7495 

Length

Max length12
Median length7
Mean length8.6301
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도 김포시
2nd row경기도 수원시 권선구
3rd row경기도 가평군
4th row경기도 연천군
5th row경기도 김포시

Common Values

ValueCountFrequency (%)
경기도 화성시 628
 
6.3%
경기도 부천시 506
 
5.1%
경기도 남양주시 505
 
5.1%
경기도 평택시 456
 
4.6%
경기도 파주시 410
 
4.1%
경기도 성남시 분당구 379
 
3.8%
경기도 시흥시 350
 
3.5%
경기도 김포시 337
 
3.4%
경기도 용인시 기흥구 318
 
3.2%
경기도 의정부시 296
 
3.0%
Other values (32) 5815
58.1%

Length

2024-03-08T23:53:37.715059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 10000
42.1%
수원시 871
 
3.7%
용인시 755
 
3.2%
성남시 672
 
2.8%
고양시 657
 
2.8%
화성시 628
 
2.6%
부천시 506
 
2.1%
남양주시 505
 
2.1%
평택시 456
 
1.9%
파주시 410
 
1.7%
Other values (39) 8294
34.9%
Distinct2470
Distinct (%)24.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-08T23:53:37.961920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length6
Mean length6.2839
Min length4

Characters and Unicode

Total characters62839
Distinct characters345
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique166 ?
Unique (%)1.7%

Sample

1st row푸른솔초등학교
2nd row수원중촌초등학교
3rd row대성초등학교
4th row군남중학교
5th row나비초등학교
ValueCountFrequency (%)
초당초등학교 13
 
0.1%
상원초등학교 11
 
0.1%
호계중학교 9
 
0.1%
평촌중학교 9
 
0.1%
고암초등학교 9
 
0.1%
원일초등학교 9
 
0.1%
동오초등학교 9
 
0.1%
삼성초등학교 9
 
0.1%
토평고등학교 9
 
0.1%
지평중학교 9
 
0.1%
Other values (2461) 9909
99.0%
2024-03-08T23:53:38.474440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10242
16.3%
10158
16.2%
7341
 
11.7%
5366
 
8.5%
2878
 
4.6%
2243
 
3.6%
617
 
1.0%
606
 
1.0%
599
 
1.0%
578
 
0.9%
Other values (335) 22211
35.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 62718
99.8%
Lowercase Letter 70
 
0.1%
Uppercase Letter 20
 
< 0.1%
Open Punctuation 13
 
< 0.1%
Close Punctuation 13
 
< 0.1%
Space Separator 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10242
16.3%
10158
16.2%
7341
 
11.7%
5366
 
8.6%
2878
 
4.6%
2243
 
3.6%
617
 
1.0%
606
 
1.0%
599
 
1.0%
578
 
0.9%
Other values (320) 22090
35.2%
Lowercase Letter
ValueCountFrequency (%)
s 20
28.6%
n 10
14.3%
e 10
14.3%
i 10
14.3%
l 5
 
7.1%
g 5
 
7.1%
h 5
 
7.1%
u 5
 
7.1%
Uppercase Letter
ValueCountFrequency (%)
I 5
25.0%
T 5
25.0%
E 5
25.0%
B 5
25.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 62718
99.8%
Latin 90
 
0.1%
Common 31
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10242
16.3%
10158
16.2%
7341
 
11.7%
5366
 
8.6%
2878
 
4.6%
2243
 
3.6%
617
 
1.0%
606
 
1.0%
599
 
1.0%
578
 
0.9%
Other values (320) 22090
35.2%
Latin
ValueCountFrequency (%)
s 20
22.2%
n 10
11.1%
e 10
11.1%
i 10
11.1%
l 5
 
5.6%
I 5
 
5.6%
T 5
 
5.6%
E 5
 
5.6%
g 5
 
5.6%
h 5
 
5.6%
Other values (2) 10
11.1%
Common
ValueCountFrequency (%)
( 13
41.9%
) 13
41.9%
5
 
16.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 62718
99.8%
ASCII 121
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10242
16.3%
10158
16.2%
7341
 
11.7%
5366
 
8.6%
2878
 
4.6%
2243
 
3.6%
617
 
1.0%
606
 
1.0%
599
 
1.0%
578
 
0.9%
Other values (320) 22090
35.2%
ASCII
ValueCountFrequency (%)
s 20
16.5%
( 13
10.7%
) 13
10.7%
n 10
 
8.3%
e 10
 
8.3%
i 10
 
8.3%
l 5
 
4.1%
I 5
 
4.1%
T 5
 
4.1%
E 5
 
4.1%
Other values (5) 25
20.7%

학교급명
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
6389 
초등학교
1932 
중학교
941 
고등학교
725 
방통고
 
7

Length

Max length4
Median length4
Mean length3.9046
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 6389
63.9%
초등학교 1932
 
19.3%
중학교 941
 
9.4%
고등학교 725
 
7.2%
방통고 7
 
0.1%
방통중 6
 
0.1%

Length

2024-03-08T23:53:38.669999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-08T23:53:38.909256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 6389
63.9%
초등학교 1932
 
19.3%
중학교 941
 
9.4%
고등학교 725
 
7.2%
방통고 7
 
0.1%
방통중 6
 
0.1%

설립구분명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공립
9004 
사립
993 
국립
 
3

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공립
2nd row공립
3rd row공립
4th row공립
5th row공립

Common Values

ValueCountFrequency (%)
공립 9004
90.0%
사립 993
 
9.9%
국립 3
 
< 0.1%

Length

2024-03-08T23:53:39.139621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-08T23:53:39.315116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공립 9004
90.0%
사립 993
 
9.9%
국립 3
 
< 0.1%

제외여부
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
9952 
True
 
48
ValueCountFrequency (%)
False 9952
99.5%
True 48
 
0.5%
2024-03-08T23:53:39.452559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

제외사유
Text

MISSING 

Distinct26
Distinct (%)54.2%
Missing9952
Missing (%)99.5%
Memory size156.2 KiB
2024-03-08T23:53:39.739306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length37
Mean length21.520833
Min length3

Characters and Unicode

Total characters1033
Distinct characters104
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)29.2%

Sample

1st row부설교
2nd row방통중 출석수업(일요일) 운영에 한계가 있어 힉교도서관 미운영으로 제외 처리함.
3rd row본교는 해당항목에 대해서 통계자료가 없으므로 제외함.
4th row수원여자고등학교부설방송통신고등학교로 본교에서 정보공시
5th row방통중 출석수업(일요일) 여건상의 한계로 도서관 미운영에 따라 제외처리함.
ValueCountFrequency (%)
학교도서관 10
 
5.0%
미설치 8
 
4.0%
제외 7
 
3.5%
처리함 6
 
3.0%
제외함 6
 
3.0%
부설교 5
 
2.5%
없으므로 5
 
2.5%
방송통신고용 5
 
2.5%
통계자료가 5
 
2.5%
해당항목에 5
 
2.5%
Other values (71) 138
69.0%
2024-03-08T23:53:40.231620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
152
 
14.7%
58
 
5.6%
40
 
3.9%
32
 
3.1%
26
 
2.5%
25
 
2.4%
23
 
2.2%
20
 
1.9%
20
 
1.9%
20
 
1.9%
Other values (94) 617
59.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 848
82.1%
Space Separator 152
 
14.7%
Other Punctuation 19
 
1.8%
Close Punctuation 6
 
0.6%
Open Punctuation 6
 
0.6%
Decimal Number 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
58
 
6.8%
40
 
4.7%
32
 
3.8%
26
 
3.1%
25
 
2.9%
23
 
2.7%
20
 
2.4%
20
 
2.4%
20
 
2.4%
19
 
2.2%
Other values (89) 565
66.6%
Space Separator
ValueCountFrequency (%)
152
100.0%
Other Punctuation
ValueCountFrequency (%)
. 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Decimal Number
ValueCountFrequency (%)
2 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 848
82.1%
Common 185
 
17.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
58
 
6.8%
40
 
4.7%
32
 
3.8%
26
 
3.1%
25
 
2.9%
23
 
2.7%
20
 
2.4%
20
 
2.4%
20
 
2.4%
19
 
2.2%
Other values (89) 565
66.6%
Common
ValueCountFrequency (%)
152
82.2%
. 19
 
10.3%
) 6
 
3.2%
( 6
 
3.2%
2 2
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 848
82.1%
ASCII 185
 
17.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
152
82.2%
. 19
 
10.3%
) 6
 
3.2%
( 6
 
3.2%
2 2
 
1.1%
Hangul
ValueCountFrequency (%)
58
 
6.8%
40
 
4.7%
32
 
3.8%
26
 
3.1%
25
 
2.9%
23
 
2.7%
20
 
2.4%
20
 
2.4%
20
 
2.4%
19
 
2.2%
Other values (89) 565
66.6%

도서자료수(권)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct6778
Distinct (%)68.0%
Missing31
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean16952.136
Minimum0
Maximum60197
Zeros145
Zeros (%)1.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-08T23:53:40.495640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile5662.8
Q112270
median16661
Q321343
95-th percentile28649.8
Maximum60197
Range60197
Interquartile range (IQR)9073

Descriptive statistics

Standard deviation7075.3437
Coefficient of variation (CV)0.41737181
Kurtosis0.96751728
Mean16952.136
Median Absolute Deviation (MAD)4518
Skewness0.32098839
Sum1.6899584 × 108
Variance50060488
MonotonicityNot monotonic
2024-03-08T23:53:40.730804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 145
 
1.5%
12659 6
 
0.1%
13124 6
 
0.1%
13261 6
 
0.1%
16885 6
 
0.1%
18282 6
 
0.1%
9499 5
 
0.1%
5187 5
 
0.1%
16717 5
 
0.1%
16687 5
 
0.1%
Other values (6768) 9774
97.7%
(Missing) 31
 
0.3%
ValueCountFrequency (%)
0 145
1.5%
100 1
 
< 0.1%
222 1
 
< 0.1%
300 1
 
< 0.1%
512 1
 
< 0.1%
564 1
 
< 0.1%
753 1
 
< 0.1%
833 1
 
< 0.1%
876 1
 
< 0.1%
939 1
 
< 0.1%
ValueCountFrequency (%)
60197 1
< 0.1%
59908 1
< 0.1%
59391 2
< 0.1%
58245 1
< 0.1%
49013 1
< 0.1%
46860 1
< 0.1%
46162 1
< 0.1%
45545 1
< 0.1%
45182 1
< 0.1%
45170 2
< 0.1%

비도서자료수(개)
Real number (ℝ)

HIGH CORRELATION  SKEWED  ZEROS 

Distinct1399
Distinct (%)14.0%
Missing31
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean431.37035
Minimum0
Maximum98206
Zeros596
Zeros (%)6.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-08T23:53:40.970053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1119
median298
Q3550
95-th percentile1196.6
Maximum98206
Range98206
Interquartile range (IQR)431

Descriptive statistics

Standard deviation1135.1213
Coefficient of variation (CV)2.6314311
Kurtosis5549.1434
Mean431.37035
Median Absolute Deviation (MAD)206
Skewness65.731139
Sum4300331
Variance1288500.5
MonotonicityNot monotonic
2024-03-08T23:53:41.190331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 596
 
6.0%
1 46
 
0.5%
3 40
 
0.4%
50 38
 
0.4%
10 37
 
0.4%
20 33
 
0.3%
81 30
 
0.3%
100 30
 
0.3%
77 30
 
0.3%
80 29
 
0.3%
Other values (1389) 9060
90.6%
(Missing) 31
 
0.3%
ValueCountFrequency (%)
0 596
6.0%
1 46
 
0.5%
2 24
 
0.2%
3 40
 
0.4%
4 25
 
0.2%
5 12
 
0.1%
6 14
 
0.1%
7 22
 
0.2%
8 18
 
0.2%
9 16
 
0.2%
ValueCountFrequency (%)
98206 1
< 0.1%
25304 1
< 0.1%
10659 2
< 0.1%
10080 1
< 0.1%
8772 1
< 0.1%
7235 1
< 0.1%
6955 1
< 0.1%
6768 1
< 0.1%
6721 1
< 0.1%
6395 1
< 0.1%

자료수합계(권)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct6866
Distinct (%)69.0%
Missing48
Missing (%)0.5%
Infinite0
Infinite (%)0.0%
Mean17413.201
Minimum0
Maximum107734
Zeros128
Zeros (%)1.3%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-08T23:53:41.398983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile5872.3
Q112565
median17170
Q321883.25
95-th percentile29401.6
Maximum107734
Range107734
Interquartile range (IQR)9318.25

Descriptive statistics

Standard deviation7301.4353
Coefficient of variation (CV)0.41930461
Kurtosis3.4412025
Mean17413.201
Median Absolute Deviation (MAD)4649.5
Skewness0.53958276
Sum1.7329617 × 108
Variance53310958
MonotonicityNot monotonic
2024-03-08T23:53:41.586916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 128
 
1.3%
9506 6
 
0.1%
13328 6
 
0.1%
16679 6
 
0.1%
18102 6
 
0.1%
14341 6
 
0.1%
12450 5
 
0.1%
22370 5
 
0.1%
14740 5
 
0.1%
10243 5
 
0.1%
Other values (6856) 9774
97.7%
(Missing) 48
 
0.5%
ValueCountFrequency (%)
0 128
1.3%
100 1
 
< 0.1%
222 1
 
< 0.1%
564 1
 
< 0.1%
753 1
 
< 0.1%
800 1
 
< 0.1%
840 1
 
< 0.1%
876 1
 
< 0.1%
954 1
 
< 0.1%
995 2
 
< 0.1%
ValueCountFrequency (%)
107734 1
< 0.1%
66676 1
< 0.1%
64966 1
< 0.1%
64395 2
< 0.1%
62129 1
< 0.1%
50684 1
< 0.1%
49755 1
< 0.1%
48934 2
< 0.1%
47298 1
< 0.1%
47119 1
< 0.1%

전체학생수(명)
Real number (ℝ)

HIGH CORRELATION 

Distinct1515
Distinct (%)15.2%
Missing48
Missing (%)0.5%
Infinite0
Infinite (%)0.0%
Mean648.16821
Minimum0
Maximum2243
Zeros2
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-08T23:53:41.762618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile61
Q1359
median652
Q3914
95-th percentile1271
Maximum2243
Range2243
Interquartile range (IQR)555

Descriptive statistics

Standard deviation378.34629
Coefficient of variation (CV)0.58371622
Kurtosis-0.4769534
Mean648.16821
Median Absolute Deviation (MAD)276
Skewness0.21800611
Sum6450570
Variance143145.92
MonotonicityNot monotonic
2024-03-08T23:53:41.946198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
65 24
 
0.2%
706 24
 
0.2%
50 22
 
0.2%
42 22
 
0.2%
948 21
 
0.2%
61 21
 
0.2%
51 21
 
0.2%
640 21
 
0.2%
71 20
 
0.2%
745 20
 
0.2%
Other values (1505) 9736
97.4%
(Missing) 48
 
0.5%
ValueCountFrequency (%)
0 2
 
< 0.1%
2 2
 
< 0.1%
3 3
< 0.1%
4 1
 
< 0.1%
5 4
< 0.1%
6 1
 
< 0.1%
7 3
< 0.1%
8 2
 
< 0.1%
9 5
0.1%
10 7
0.1%
ValueCountFrequency (%)
2243 1
< 0.1%
2193 1
< 0.1%
2006 1
< 0.1%
1958 1
< 0.1%
1904 1
< 0.1%
1849 1
< 0.1%
1843 2
< 0.1%
1842 2
< 0.1%
1831 1
< 0.1%
1830 1
< 0.1%

1인당장서비율(%)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct879
Distinct (%)8.8%
Missing31
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean50.043425
Minimum0
Maximum2531
Zeros146
Zeros (%)1.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-08T23:53:42.136514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile9
Q119
median29
Q347
95-th percentile183
Maximum2531
Range2531
Interquartile range (IQR)28

Descriptive statistics

Standard deviation76.914462
Coefficient of variation (CV)1.5369544
Kurtosis250.36436
Mean50.043425
Median Absolute Deviation (MAD)12
Skewness10.6849
Sum498882.9
Variance5915.8345
MonotonicityNot monotonic
2024-03-08T23:53:42.312026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
23.0 270
 
2.7%
17.0 254
 
2.5%
19.0 251
 
2.5%
20.0 247
 
2.5%
18.0 240
 
2.4%
21.0 237
 
2.4%
22.0 232
 
2.3%
25.0 222
 
2.2%
16.0 214
 
2.1%
26.0 214
 
2.1%
Other values (869) 7588
75.9%
ValueCountFrequency (%)
0.0 146
1.5%
0.6 1
 
< 0.1%
1.0 5
 
0.1%
2.0 9
 
0.1%
2.7 1
 
< 0.1%
3.0 15
 
0.1%
3.5 1
 
< 0.1%
4.0 25
 
0.2%
5.0 40
 
0.4%
6.0 56
 
0.6%
ValueCountFrequency (%)
2531.0 1
< 0.1%
2255.0 1
< 0.1%
2155.0 1
< 0.1%
1013.0 2
< 0.1%
991.0 2
< 0.1%
741.4 1
< 0.1%
734.1 1
< 0.1%
723.0 1
< 0.1%
701.0 2
< 0.1%
683.0 1
< 0.1%

Interactions

2024-03-08T23:53:35.571064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:31.184557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:32.293330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:33.019940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:33.870247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:34.805892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:35.675915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:31.351108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:32.421709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:33.152861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:34.035907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:34.950256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:35.809412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:31.501207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:32.560693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:33.308299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:34.220182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:35.082866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:35.961752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:31.681576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:32.674878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:33.454306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:34.336888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:35.223850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:36.085224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:31.988243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:32.790141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:33.578250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:34.477002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:35.366949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:36.198276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:32.183966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:32.883866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:33.736455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:34.663760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-08T23:53:35.478332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-08T23:53:42.428869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준년도시군명지역교육청명지역명학교급명설립구분명제외여부제외사유도서자료수(권)비도서자료수(개)자료수합계(권)전체학생수(명)1인당장서비율(%)
기준년도1.0000.0000.0000.0000.0000.0000.0000.8380.1150.0000.0690.0860.050
시군명0.0001.0000.9921.0000.1110.2650.1120.9670.4360.0000.3640.4760.246
지역교육청명0.0000.9921.0000.9940.7700.9360.0900.9760.4250.0390.3750.4970.272
지역명0.0001.0000.9941.0000.2040.3630.1830.9660.4770.0730.4100.4990.263
학교급명0.0000.1110.7700.2041.0000.3140.8451.0000.3630.0000.3300.4020.053
설립구분명0.0000.2650.9360.3630.3141.0000.0000.8610.2600.0000.2280.0590.028
제외여부0.0000.1120.0900.1830.8450.0001.000NaN0.2200.000NaNNaN0.000
제외사유0.8380.9670.9760.9661.0000.861NaN1.000NaNNaNNaNNaNNaN
도서자료수(권)0.1150.4360.4250.4770.3630.2600.220NaN1.0000.0120.8980.5030.091
비도서자료수(개)0.0000.0000.0390.0730.0000.0000.000NaN0.0121.0000.8860.0000.546
자료수합계(권)0.0690.3640.3750.4100.3300.228NaNNaN0.8980.8861.0000.3360.492
전체학생수(명)0.0860.4760.4970.4990.4020.059NaNNaN0.5030.0000.3361.0000.248
1인당장서비율(%)0.0500.2460.2720.2630.0530.0280.000NaN0.0910.5460.4920.2481.000
2024-03-08T23:53:42.661803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명지역명제외여부학교급명지역교육청명설립구분명
시군명1.0000.9990.0960.0520.8590.137
지역명0.9991.0000.1460.0950.8590.181
제외여부0.0960.1461.0000.9630.0770.000
학교급명0.0520.0950.9631.0000.4990.382
지역교육청명0.8590.8590.0770.4991.0000.746
설립구분명0.1370.1810.0000.3820.7461.000
2024-03-08T23:53:43.071984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준년도도서자료수(권)비도서자료수(개)자료수합계(권)전체학생수(명)1인당장서비율(%)시군명지역교육청명지역명학교급명설립구분명제외여부
기준년도1.0000.130-0.0150.127-0.0790.1440.0000.0000.0000.0000.0000.012
도서자료수(권)0.1301.0000.5180.9970.4070.1500.1680.1670.1840.2340.1600.169
비도서자료수(개)-0.0150.5181.0000.5560.2270.1030.0000.0210.0370.0000.0000.000
자료수합계(권)0.1270.9970.5561.0000.4080.1460.1500.1590.1690.2220.1481.000
전체학생수(명)-0.0790.4070.2270.4081.000-0.7630.1880.2030.1950.2640.0351.000
1인당장서비율(%)0.1440.1500.1030.146-0.7631.0000.1040.1160.1020.0500.0190.000
시군명0.0000.1680.0000.1500.1880.1041.0000.8590.9990.0520.1370.096
지역교육청명0.0000.1670.0210.1590.2030.1160.8591.0000.8590.4990.7460.077
지역명0.0000.1840.0370.1690.1950.1020.9990.8591.0000.0950.1810.146
학교급명0.0000.2340.0000.2220.2640.0500.0520.4990.0951.0000.3820.963
설립구분명0.0000.1600.0000.1480.0350.0190.1370.7460.1810.3821.0000.000
제외여부0.0120.1690.0001.0001.0000.0000.0960.0770.1460.9630.0001.000

Missing values

2024-03-08T23:53:36.377585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-08T23:53:36.631705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-08T23:53:36.804411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유도서자료수(권)비도서자료수(개)자료수합계(권)전체학생수(명)1인당장서비율(%)
4152020김포시경기도김포교육지원청경기도 김포시푸른솔초등학교<NA>공립N<NA>174508817538134913.0
44952019수원시경기도수원교육지원청경기도 수원시 권선구수원중촌초등학교<NA>공립N<NA>170491361718583920.0
216652015가평군경기도가평교육지원청경기도 가평군대성초등학교<NA>공립N<NA>13295591335493144.0
200002016연천군경기도연천교육지원청경기도 연천군군남중학교<NA>공립N<NA>181196041872365288.0
81812018김포시경기도김포교육지원청경기도 김포시나비초등학교<NA>공립N<NA>50535151049236.0
148662017안양시경기도교육청경기도 안양시 만안구성문고등학교<NA>사립N<NA>1198159612577114811.0
253192015의정부시경기도의정부교육지원청경기도 의정부시의정부호동초등학교초등학교공립N<NA>28997111230109158019.0
137632017성남시경기도교육청경기도 성남시 분당구계원예술고등학교<NA>사립N<NA>981018982897010.0
214782016화성시경기도화성오산교육지원청경기도 화성시동탄중앙초등학교초등학교공립N<NA>1379410713901121911.0
192222016안산시경기도안산교육지원청경기도 안산시 단원구안산양지초등학교<NA>공립N<NA>28183178129964105129.0
기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유도서자료수(권)비도서자료수(개)자료수합계(권)전체학생수(명)1인당장서비율(%)
117532018포천시경기도포천교육지원청경기도 포천시축석초등학교<NA>공립N<NA>117502541200472167.0
262322015화성시경기도화성오산교육지원청경기도 화성시한마음초등학교초등학교공립N<NA>257227725799116822.0
207132016이천시경기도이천교육지원청경기도 이천시이천사동초등학교초등학교공립N<NA>951282959454218.0
210292016평택시경기도평택교육지원청경기도 평택시오성중학교<NA>사립N<NA>1463111771580817491.0
66172019파주시경기도파주교육지원청경기도 파주시한가람중학교<NA>공립N<NA>133074651377286515.0
15192020양평군경기도교육청경기도 양평군용문고등학교<NA>사립N<NA>15087271511442535.6
57592019오산시경기도교육청경기도 오산시성호고등학교<NA>공립N<NA>186718511952288322.0
108282018용인시경기도용인교육지원청경기도 용인시 처인구용신중학교<NA>공립N<NA>167057431744892719.0
245822015여주시경기도여주교육지원청경기도 여주시천남초등학교<NA>공립N<NA>9804162996678128.0
28632019고양시경기도고양교육지원청경기도 고양시 일산서구장촌초등학교<NA>공립N<NA>188641241898835154.0

Duplicate rows

Most frequently occurring

기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유도서자료수(권)비도서자료수(개)자료수합계(권)전체학생수(명)1인당장서비율(%)# duplicates
02019가평군경기도가평교육지원청경기도 가평군목동초등학교명지분교장<NA>공립N<NA>21100211014150.02
12019가평군경기도가평교육지원청경기도 가평군상색초등학교<NA>공립N<NA>10228561028439263.02
22019가평군경기도가평교육지원청경기도 가평군조종중학교<NA>공립N<NA>156666121627820579.02
32019가평군경기도교육청경기도 가평군청평고등학교<NA>공립N<NA>9688147983525438.02
42019고양시경기도고양교육지원청경기도 고양시 덕양구고양화수초등학교<NA>공립N<NA>315754853206084338.02
52019고양시경기도고양교육지원청경기도 고양시 덕양구서정초등학교<NA>공립N<NA>187742961907057133.02
62019고양시경기도고양교육지원청경기도 고양시 덕양구성라초등학교<NA>공립N<NA>232754962377159240.02
72019고양시경기도고양교육지원청경기도 고양시 덕양구신능초등학교<NA>공립N<NA>256084252603329588.02
82019고양시경기도고양교육지원청경기도 고양시 덕양구티엘비유글로벌학교(중)<NA>사립N<NA>000410.02
92019고양시경기도고양교육지원청경기도 고양시 덕양구티엘비유글로벌학교(초)<NA>사립N<NA>000110.02