Overview

Dataset statistics

Number of variables16
Number of observations10000
Missing cells10030
Missing cells (%)6.3%
Duplicate rows358
Duplicate rows (%)3.6%
Total size in memory1.4 MiB
Average record size in memory144.0 B

Variable types

Numeric4
Categorical9
Text2
Boolean1

Dataset

Description학교도서관 현황(운영현황)
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=OSL02MD6FXIBGCJGYHEM23487593&infSeq=2

Alerts

Dataset has 358 (3.6%) duplicate rowsDuplicates
학교급명 is highly overall correlated with 제외여부High correlation
도서관직원수(명) is highly overall correlated with 사서자격증보유수(개) and 1 other fieldsHigh correlation
시군명 is highly overall correlated with 지역교육청명 and 1 other fieldsHigh correlation
제외여부 is highly overall correlated with 학교급명High correlation
사서자격증보유수(개) is highly overall correlated with 도서관직원수(명)High correlation
사서자격증미보유수(개) is highly overall correlated with 도서관직원수(명)High correlation
지역명 is highly overall correlated with 시군명 and 1 other fieldsHigh correlation
자료구입비예산액(원) is highly overall correlated with 운영비예산액(원)High correlation
운영비예산액(원) is highly overall correlated with 자료구입비예산액(원)High correlation
지역교육청명 is highly overall correlated with 시군명 and 2 other fieldsHigh correlation
설립구분명 is highly overall correlated with 지역교육청명High correlation
설립구분명 is highly imbalanced (70.9%)Imbalance
제외여부 is highly imbalanced (96.3%)Imbalance
도서관수(개) is highly imbalanced (92.9%)Imbalance
도서관직원수(명) is highly imbalanced (56.1%)Imbalance
제외사유 has 9961 (99.6%) missing valuesMissing
도서관(실)총좌석수(석) has 228 (2.3%) zerosZeros
운영비예산액(원) has 133 (1.3%) zerosZeros

Reproduction

Analysis started2023-12-10 21:47:36.776487
Analysis finished2023-12-10 21:47:40.277321
Duration3.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준년도
Real number (ℝ)

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2017.2952
Minimum2015
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:47:40.321984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2015
5-th percentile2015
Q12016
median2017
Q32019
95-th percentile2020
Maximum2020
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.6008477
Coefficient of variation (CV)0.00079356143
Kurtosis-1.1705563
Mean2017.2952
Median Absolute Deviation (MAD)1
Skewness0.068603777
Sum20172952
Variance2.5627132
MonotonicityNot monotonic
2023-12-11T06:47:40.424358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2019 1857
18.6%
2017 1820
18.2%
2016 1815
18.1%
2018 1808
18.1%
2015 1771
17.7%
2020 929
9.3%
ValueCountFrequency (%)
2015 1771
17.7%
2016 1815
18.1%
2017 1820
18.2%
2018 1808
18.1%
2019 1857
18.6%
2020 929
9.3%
ValueCountFrequency (%)
2020 929
9.3%
2019 1857
18.6%
2018 1808
18.1%
2017 1820
18.2%
2016 1815
18.1%
2015 1771
17.7%

시군명
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
수원시
874 
용인시
783 
고양시
664 
성남시
653 
화성시
 
631
Other values (26)
6395 

Length

Max length4
Median length3
Mean length3.0898
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row안산시
2nd row포천시
3rd row광명시
4th row안산시
5th row화성시

Common Values

ValueCountFrequency (%)
수원시 874
 
8.7%
용인시 783
 
7.8%
고양시 664
 
6.6%
성남시 653
 
6.5%
화성시 631
 
6.3%
부천시 509
 
5.1%
남양주시 491
 
4.9%
안산시 447
 
4.5%
평택시 436
 
4.4%
파주시 421
 
4.2%
Other values (21) 4091
40.9%

Length

2023-12-11T06:47:40.539507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수원시 874
 
8.7%
용인시 783
 
7.8%
고양시 664
 
6.6%
성남시 653
 
6.5%
화성시 631
 
6.3%
부천시 509
 
5.1%
남양주시 491
 
4.9%
안산시 447
 
4.5%
평택시 436
 
4.4%
파주시 421
 
4.2%
Other values (21) 4091
40.9%

지역교육청명
Categorical

HIGH CORRELATION 

Distinct27
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도교육청
2021 
경기도수원교육지원청
681 
경기도화성오산교육지원청
657 
경기도용인교육지원청
656 
경기도고양교육지원청
 
516
Other values (22)
5469 

Length

Max length13
Median length10
Mean length9.7232
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도안산교육지원청
2nd row경기도포천교육지원청
3rd row경기도광명교육지원청
4th row경기도안산교육지원청
5th row경기도화성오산교육지원청

Common Values

ValueCountFrequency (%)
경기도교육청 2021
20.2%
경기도수원교육지원청 681
 
6.8%
경기도화성오산교육지원청 657
 
6.6%
경기도용인교육지원청 656
 
6.6%
경기도고양교육지원청 516
 
5.2%
경기도성남교육지원청 500
 
5.0%
경기도구리남양주교육지원청 488
 
4.9%
경기도부천교육지원청 404
 
4.0%
경기도안산교육지원청 349
 
3.5%
경기도평택교육지원청 338
 
3.4%
Other values (17) 3390
33.9%

Length

2023-12-11T06:47:40.649557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도교육청 2021
20.2%
경기도수원교육지원청 681
 
6.8%
경기도화성오산교육지원청 657
 
6.6%
경기도용인교육지원청 656
 
6.6%
경기도고양교육지원청 516
 
5.2%
경기도성남교육지원청 500
 
5.0%
경기도구리남양주교육지원청 488
 
4.9%
경기도부천교육지원청 404
 
4.0%
경기도안산교육지원청 349
 
3.5%
경기도평택교육지원청 338
 
3.4%
Other values (17) 3390
33.9%

지역명
Categorical

HIGH CORRELATION 

Distinct42
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도 화성시
 
631
경기도 부천시
 
509
경기도 남양주시
 
491
경기도 평택시
 
436
경기도 파주시
 
421
Other values (37)
7512 

Length

Max length12
Median length7
Mean length8.6365
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도 안산시 단원구
2nd row경기도 포천시
3rd row경기도 광명시
4th row경기도 안산시 단원구
5th row경기도 화성시

Common Values

ValueCountFrequency (%)
경기도 화성시 631
 
6.3%
경기도 부천시 509
 
5.1%
경기도 남양주시 491
 
4.9%
경기도 평택시 436
 
4.4%
경기도 파주시 421
 
4.2%
경기도 성남시 분당구 378
 
3.8%
경기도 시흥시 359
 
3.6%
경기도 김포시 320
 
3.2%
경기도 의정부시 312
 
3.1%
경기도 용인시 기흥구 298
 
3.0%
Other values (32) 5845
58.5%

Length

2023-12-11T06:47:40.759237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 10000
42.1%
수원시 874
 
3.7%
용인시 783
 
3.3%
고양시 664
 
2.8%
성남시 653
 
2.7%
화성시 631
 
2.7%
부천시 509
 
2.1%
남양주시 491
 
2.1%
안산시 447
 
1.9%
평택시 436
 
1.8%
Other values (39) 8282
34.8%
Distinct2471
Distinct (%)24.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T06:47:40.962602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length6
Mean length6.2706
Min length4

Characters and Unicode

Total characters62706
Distinct characters343
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique161 ?
Unique (%)1.6%

Sample

1st row대동초등학교
2nd row중리초등학교
3rd row하안북중학교
4th row안산대월초등학교
5th row화성장안초등학교
ValueCountFrequency (%)
삼성초등학교 11
 
0.1%
성남금융고등학교 11
 
0.1%
오산초등학교 10
 
0.1%
원일초등학교 10
 
0.1%
팔곡초등학교 9
 
0.1%
계남초등학교 9
 
0.1%
석호중학교 9
 
0.1%
장명초등학교 9
 
0.1%
도장중학교 9
 
0.1%
청학고등학교 9
 
0.1%
Other values (2462) 9907
99.0%
2023-12-11T06:47:41.349793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10225
16.3%
10143
16.2%
7277
 
11.6%
5333
 
8.5%
2914
 
4.6%
2200
 
3.5%
646
 
1.0%
640
 
1.0%
625
 
1.0%
605
 
1.0%
Other values (333) 22098
35.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 62637
99.9%
Lowercase Letter 43
 
0.1%
Uppercase Letter 12
 
< 0.1%
Open Punctuation 5
 
< 0.1%
Close Punctuation 5
 
< 0.1%
Space Separator 3
 
< 0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10225
16.3%
10143
16.2%
7277
 
11.6%
5333
 
8.5%
2914
 
4.7%
2200
 
3.5%
646
 
1.0%
640
 
1.0%
625
 
1.0%
605
 
1.0%
Other values (317) 22029
35.2%
Lowercase Letter
ValueCountFrequency (%)
s 12
27.9%
e 7
16.3%
n 6
14.0%
i 6
14.0%
h 3
 
7.0%
g 3
 
7.0%
l 3
 
7.0%
u 3
 
7.0%
Uppercase Letter
ValueCountFrequency (%)
I 3
25.0%
T 3
25.0%
E 3
25.0%
B 3
25.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 62637
99.9%
Latin 55
 
0.1%
Common 14
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10225
16.3%
10143
16.2%
7277
 
11.6%
5333
 
8.5%
2914
 
4.7%
2200
 
3.5%
646
 
1.0%
640
 
1.0%
625
 
1.0%
605
 
1.0%
Other values (317) 22029
35.2%
Latin
ValueCountFrequency (%)
s 12
21.8%
e 7
12.7%
n 6
10.9%
i 6
10.9%
h 3
 
5.5%
I 3
 
5.5%
T 3
 
5.5%
g 3
 
5.5%
l 3
 
5.5%
E 3
 
5.5%
Other values (2) 6
10.9%
Common
ValueCountFrequency (%)
( 5
35.7%
) 5
35.7%
3
21.4%
1 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 62637
99.9%
ASCII 69
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10225
16.3%
10143
16.2%
7277
 
11.6%
5333
 
8.5%
2914
 
4.7%
2200
 
3.5%
646
 
1.0%
640
 
1.0%
625
 
1.0%
605
 
1.0%
Other values (317) 22029
35.2%
ASCII
ValueCountFrequency (%)
s 12
17.4%
e 7
10.1%
n 6
 
8.7%
i 6
 
8.7%
( 5
 
7.2%
) 5
 
7.2%
h 3
 
4.3%
I 3
 
4.3%
T 3
 
4.3%
g 3
 
4.3%
Other values (6) 16
23.2%

학교급명
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
6362 
초등학교
1900 
중학교
1014 
고등학교
711 
방통중
 
8

Length

Max length4
Median length4
Mean length3.8973
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row초등학교
2nd row초등학교
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 6362
63.6%
초등학교 1900
 
19.0%
중학교 1014
 
10.1%
고등학교 711
 
7.1%
방통중 8
 
0.1%
방통고 5
 
0.1%

Length

2023-12-11T06:47:41.467174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:47:41.565469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 6362
63.6%
초등학교 1900
 
19.0%
중학교 1014
 
10.1%
고등학교 711
 
7.1%
방통중 8
 
0.1%
방통고 5
 
< 0.1%

설립구분명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공립
9033 
사립
964 
국립
 
3

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공립
2nd row공립
3rd row공립
4th row공립
5th row공립

Common Values

ValueCountFrequency (%)
공립 9033
90.3%
사립 964
 
9.6%
국립 3
 
< 0.1%

Length

2023-12-11T06:47:41.667638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:47:41.963531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공립 9033
90.3%
사립 964
 
9.6%
국립 3
 
< 0.1%

제외여부
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
9961 
True
 
39
ValueCountFrequency (%)
False 9961
99.6%
True 39
 
0.4%
2023-12-11T06:47:42.039109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

제외사유
Text

MISSING 

Distinct25
Distinct (%)64.1%
Missing9961
Missing (%)99.6%
Memory size156.2 KiB
2023-12-11T06:47:42.225554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length29
Mean length20.794872
Min length3

Characters and Unicode

Total characters811
Distinct characters100
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)35.9%

Sample

1st row본교 삼평중학교에 따름
2nd row방통중 출석수업은 일요일 도서관 운영에 한계가 있어 제외처리함.
3rd row운영하지 않음
4th row방송통신중학교 도서관을 별도로 운영하지 않으므로 제외처리 함.
5th row방통중 출석수업은 일요일 도서관 운영에 한계가 있어 제외처리함.
ValueCountFrequency (%)
학교도서관 6
 
3.9%
운영하지 6
 
3.9%
미설치 5
 
3.2%
않음 4
 
2.6%
따름 4
 
2.6%
삼평중학교에 4
 
2.6%
제외함 4
 
2.6%
본교에서 4
 
2.6%
학교도서관을 4
 
2.6%
운영에 3
 
1.9%
Other values (63) 111
71.6%
2023-12-11T06:47:42.582310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
119
 
14.7%
53
 
6.5%
37
 
4.6%
26
 
3.2%
20
 
2.5%
20
 
2.5%
18
 
2.2%
17
 
2.1%
15
 
1.8%
15
 
1.8%
Other values (90) 471
58.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 664
81.9%
Space Separator 119
 
14.7%
Other Punctuation 15
 
1.8%
Open Punctuation 6
 
0.7%
Close Punctuation 6
 
0.7%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
53
 
8.0%
37
 
5.6%
26
 
3.9%
20
 
3.0%
20
 
3.0%
18
 
2.7%
17
 
2.6%
15
 
2.3%
15
 
2.3%
15
 
2.3%
Other values (85) 428
64.5%
Space Separator
ValueCountFrequency (%)
119
100.0%
Other Punctuation
ValueCountFrequency (%)
. 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 664
81.9%
Common 147
 
18.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
53
 
8.0%
37
 
5.6%
26
 
3.9%
20
 
3.0%
20
 
3.0%
18
 
2.7%
17
 
2.6%
15
 
2.3%
15
 
2.3%
15
 
2.3%
Other values (85) 428
64.5%
Common
ValueCountFrequency (%)
119
81.0%
. 15
 
10.2%
( 6
 
4.1%
) 6
 
4.1%
2 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 664
81.9%
ASCII 147
 
18.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
119
81.0%
. 15
 
10.2%
( 6
 
4.1%
) 6
 
4.1%
2 1
 
0.7%
Hangul
ValueCountFrequency (%)
53
 
8.0%
37
 
5.6%
26
 
3.9%
20
 
3.0%
20
 
3.0%
18
 
2.7%
17
 
2.6%
15
 
2.3%
15
 
2.3%
15
 
2.3%
Other values (85) 428
64.5%

도서관수(개)
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
9797 
0
 
153
2
 
26
<NA>
 
22
3
 
2

Length

Max length4
Median length1
Mean length1.0066
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 9797
98.0%
0 153
 
1.5%
2 26
 
0.3%
<NA> 22
 
0.2%
3 2
 
< 0.1%

Length

2023-12-11T06:47:42.743638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:47:42.850752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 9797
98.0%
0 153
 
1.5%
2 26
 
0.3%
na 22
 
0.2%
3 2
 
< 0.1%

도서관직원수(명)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
7788 
1
1193 
<NA>
951 
2
 
65
3
 
3

Length

Max length4
Median length1
Mean length1.2853
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row<NA>
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 7788
77.9%
1 1193
 
11.9%
<NA> 951
 
9.5%
2 65
 
0.7%
3 3
 
< 0.1%

Length

2023-12-11T06:47:42.946214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:47:43.057550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 7788
77.9%
1 1193
 
11.9%
na 951
 
9.5%
2 65
 
0.7%
3 3
 
< 0.1%

도서관(실)총좌석수(석)
Real number (ℝ)

ZEROS 

Distinct184
Distinct (%)1.8%
Missing22
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean59.757968
Minimum0
Maximum592
Zeros228
Zeros (%)2.3%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:47:43.182657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile20
Q140
median54
Q374
95-th percentile110
Maximum592
Range592
Interquartile range (IQR)34

Descriptive statistics

Standard deviation35.739567
Coefficient of variation (CV)0.598072
Kurtosis43.433312
Mean59.757968
Median Absolute Deviation (MAD)16
Skewness4.2103165
Sum596265
Variance1277.3167
MonotonicityNot monotonic
2023-12-11T06:47:43.327437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
36 652
 
6.5%
60 584
 
5.8%
50 544
 
5.4%
40 512
 
5.1%
80 420
 
4.2%
48 338
 
3.4%
30 320
 
3.2%
70 292
 
2.9%
100 246
 
2.5%
0 228
 
2.3%
Other values (174) 5842
58.4%
ValueCountFrequency (%)
0 228
2.3%
3 3
 
< 0.1%
4 6
 
0.1%
5 2
 
< 0.1%
6 18
 
0.2%
8 10
 
0.1%
9 2
 
< 0.1%
10 12
 
0.1%
11 3
 
< 0.1%
12 30
 
0.3%
ValueCountFrequency (%)
592 6
0.1%
465 3
< 0.1%
380 3
< 0.1%
375 3
< 0.1%
373 1
 
< 0.1%
356 3
< 0.1%
298 3
< 0.1%
282 7
0.1%
278 5
0.1%
276 3
< 0.1%

사서자격증보유수(개)
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
5497 
0
2690 
<NA>
1788 
2
 
25

Length

Max length4
Median length1
Mean length1.5364
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row0
5th row1

Common Values

ValueCountFrequency (%)
1 5497
55.0%
0 2690
26.9%
<NA> 1788
 
17.9%
2 25
 
0.2%

Length

2023-12-11T06:47:43.451156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:47:43.539980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 5497
55.0%
0 2690
26.9%
na 1788
 
17.9%
2 25
 
0.2%

사서자격증미보유수(개)
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
7230 
<NA>
1788 
1
975 
2
 
7

Length

Max length4
Median length1
Mean length1.5364
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 7230
72.3%
<NA> 1788
 
17.9%
1 975
 
9.8%
2 7
 
0.1%

Length

2023-12-11T06:47:43.653852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:47:43.757299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 7230
72.3%
na 1788
 
17.9%
1 975
 
9.8%
2 7
 
0.1%

자료구입비예산액(원)
Real number (ℝ)

HIGH CORRELATION 

Distinct2618
Distinct (%)26.2%
Missing22
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean11235668
Minimum0
Maximum80000000
Zeros99
Zeros (%)1.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:47:43.865937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile4000000
Q18100000
median11100000
Q314000000
95-th percentile18879000
Maximum80000000
Range80000000
Interquartile range (IQR)5900000

Descriptive statistics

Standard deviation4702333.2
Coefficient of variation (CV)0.41851836
Kurtosis7.2523443
Mean11235668
Median Absolute Deviation (MAD)2900000
Skewness0.8796057
Sum1.1210949 × 1011
Variance2.2111937 × 1013
MonotonicityNot monotonic
2023-12-11T06:47:44.001926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12000000 482
 
4.8%
10000000 392
 
3.9%
15000000 241
 
2.4%
6000000 205
 
2.1%
8000000 197
 
2.0%
9000000 187
 
1.9%
14000000 184
 
1.8%
11000000 165
 
1.7%
13000000 162
 
1.6%
7000000 157
 
1.6%
Other values (2608) 7606
76.1%
ValueCountFrequency (%)
0 99
1.0%
600 1
 
< 0.1%
1300 1
 
< 0.1%
10000 1
 
< 0.1%
11000 1
 
< 0.1%
14000 1
 
< 0.1%
14260 1
 
< 0.1%
300000 1
 
< 0.1%
334000 1
 
< 0.1%
500000 1
 
< 0.1%
ValueCountFrequency (%)
80000000 1
 
< 0.1%
60000000 1
 
< 0.1%
43900000 1
 
< 0.1%
40000000 4
< 0.1%
36991960 1
 
< 0.1%
35101000 1
 
< 0.1%
34500000 2
< 0.1%
34125000 1
 
< 0.1%
33300000 2
< 0.1%
32570000 1
 
< 0.1%

운영비예산액(원)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct2691
Distinct (%)27.0%
Missing25
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean4040732.2
Minimum0
Maximum52600000
Zeros133
Zeros (%)1.3%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:47:44.141208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile700000
Q12500000
median3750000
Q35000000
95-th percentile8050000
Maximum52600000
Range52600000
Interquartile range (IQR)2500000

Descriptive statistics

Standard deviation2787762.8
Coefficient of variation (CV)0.68991528
Kurtosis44.222679
Mean4040732.2
Median Absolute Deviation (MAD)1250000
Skewness4.2583695
Sum4.0306303 × 1010
Variance7.7716217 × 1012
MonotonicityNot monotonic
2023-12-11T06:47:44.265343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4000000 193
 
1.9%
1000000 175
 
1.8%
3000000 173
 
1.7%
2000000 168
 
1.7%
0 133
 
1.3%
5000000 115
 
1.1%
3500000 112
 
1.1%
2500000 112
 
1.1%
1500000 102
 
1.0%
4500000 98
 
1.0%
Other values (2681) 8594
85.9%
ValueCountFrequency (%)
0 133
1.3%
250 1
 
< 0.1%
2071 1
 
< 0.1%
2870 1
 
< 0.1%
3672 1
 
< 0.1%
6006 1
 
< 0.1%
50000 2
 
< 0.1%
58790 2
 
< 0.1%
60000 1
 
< 0.1%
72000 1
 
< 0.1%
ValueCountFrequency (%)
52600000 1
< 0.1%
48620000 1
< 0.1%
45253010 2
< 0.1%
38872000 1
< 0.1%
38486000 1
< 0.1%
38150000 1
< 0.1%
36646000 1
< 0.1%
31840000 1
< 0.1%
31800000 1
< 0.1%
27051600 2
< 0.1%

Interactions

2023-12-11T06:47:39.310373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:47:38.317396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:47:38.626571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:47:38.980691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:47:39.392572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:47:38.395514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:47:38.697651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:47:39.058033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:47:39.522401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:47:38.472701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:47:38.790855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:47:39.138969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:47:39.656724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:47:38.554926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:47:38.894230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:47:39.229180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:47:44.363591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준년도시군명지역교육청명지역명학교급명설립구분명제외여부제외사유도서관수(개)도서관직원수(명)도서관(실)총좌석수(석)사서자격증보유수(개)사서자격증미보유수(개)자료구입비예산액(원)운영비예산액(원)
기준년도1.0000.0000.0000.0000.0000.0160.0030.7890.025NaN0.0280.0800.0920.1690.263
시군명0.0001.0000.9921.0000.1110.2560.1021.0000.1380.0720.3010.3460.1950.3540.151
지역교육청명0.0000.9921.0000.9940.7710.9360.0711.0000.1330.0300.2780.3750.2040.3490.140
지역명0.0001.0000.9941.0000.2120.3620.1671.0000.2140.0940.3770.3880.2720.3740.193
학교급명0.0000.1110.7710.2121.0000.2961.0001.0000.1820.0380.2210.3310.2350.2330.061
설립구분명0.0160.2560.9360.3620.2961.0000.0001.0000.1600.0370.3030.4310.2270.1160.040
제외여부0.0030.1020.0710.1671.0000.0001.000NaN0.4330.0000.0230.0380.0040.0890.000
제외사유0.7891.0001.0001.0001.0001.000NaN1.0001.000NaNNaNNaNNaNNaNNaN
도서관수(개)0.0250.1380.1330.2140.1820.1600.4331.0001.0000.0420.1010.1660.0630.1960.032
도서관직원수(명)NaN0.0720.0300.0940.0380.0370.000NaN0.0421.0000.035NaNNaN0.0360.092
도서관(실)총좌석수(석)0.0280.3010.2780.3770.2210.3030.023NaN0.1010.0351.0000.2880.0990.2020.066
사서자격증보유수(개)0.0800.3460.3750.3880.3310.4310.038NaN0.166NaN0.2881.0000.5450.3250.148
사서자격증미보유수(개)0.0920.1950.2040.2720.2350.2270.004NaN0.063NaN0.0990.5451.0000.0800.034
자료구입비예산액(원)0.1690.3540.3490.3740.2330.1160.089NaN0.1960.0360.2020.3250.0801.0000.611
운영비예산액(원)0.2630.1510.1400.1930.0610.0400.000NaN0.0320.0920.0660.1480.0340.6111.000
2023-12-11T06:47:44.509679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학교급명지역교육청명도서관직원수(명)시군명제외여부사서자격증보유수(개)사서자격증미보유수(개)도서관수(개)지역명설립구분명
학교급명1.0000.5000.0360.0521.0000.1140.0750.0560.0990.361
지역교육청명0.5001.0000.0160.8620.0610.1880.0950.0700.8620.745
도서관직원수(명)0.0360.0161.0000.0370.0001.0001.0000.0170.0470.035
시군명0.0520.8620.0371.0000.0870.1860.0990.0720.9990.132
제외여부1.0000.0610.0000.0871.0000.0630.0060.2910.1330.000
사서자격증보유수(개)0.1140.1881.0000.1860.0631.0000.2320.1570.1960.163
사서자격증미보유수(개)0.0750.0951.0000.0990.0060.2321.0000.0600.1300.072
도서관수(개)0.0560.0700.0170.0720.2910.1570.0601.0000.1090.151
지역명0.0990.8620.0470.9990.1330.1960.1300.1091.0000.181
설립구분명0.3610.7450.0350.1320.0000.1630.0720.1510.1811.000
2023-12-11T06:47:44.630025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준년도도서관(실)총좌석수(석)자료구입비예산액(원)운영비예산액(원)시군명지역교육청명지역명학교급명설립구분명제외여부도서관수(개)도서관직원수(명)사서자격증보유수(개)사서자격증미보유수(개)
기준년도1.000-0.0570.2480.2770.0000.0000.0000.0000.0110.0000.0200.4710.0600.069
도서관(실)총좌석수(석)-0.0571.0000.3510.2880.1160.0970.1450.1430.1400.0230.0650.0230.1320.043
자료구입비예산액(원)0.2480.3511.0000.6850.1450.1470.1520.0990.0730.0670.0890.0250.2190.050
운영비예산액(원)0.2770.2880.6851.0000.0530.0510.0680.0400.0240.0000.0190.0590.0890.020
시군명0.0000.1160.1450.0531.0000.8620.9990.0520.1320.0870.0720.0370.1860.099
지역교육청명0.0000.0970.1470.0510.8621.0000.8620.5000.7450.0610.0700.0160.1880.095
지역명0.0000.1450.1520.0680.9990.8621.0000.0990.1810.1330.1090.0470.1960.130
학교급명0.0000.1430.0990.0400.0520.5000.0991.0000.3611.0000.0560.0360.1140.075
설립구분명0.0110.1400.0730.0240.1320.7450.1810.3611.0000.0000.1510.0350.1630.072
제외여부0.0000.0230.0670.0000.0870.0610.1331.0000.0001.0000.2910.0000.0630.006
도서관수(개)0.0200.0650.0890.0190.0720.0700.1090.0560.1510.2911.0000.0170.1570.060
도서관직원수(명)0.4710.0230.0250.0590.0370.0160.0470.0360.0350.0000.0171.0001.0001.000
사서자격증보유수(개)0.0600.1320.2190.0890.1860.1880.1960.1140.1630.0630.1571.0001.0000.232
사서자격증미보유수(개)0.0690.0430.0500.0200.0990.0950.1300.0750.0720.0060.0601.0000.2321.000

Missing values

2023-12-11T06:47:39.779432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:47:40.000024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T06:47:40.167524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유도서관수(개)도서관직원수(명)도서관(실)총좌석수(석)사서자격증보유수(개)사서자격증미보유수(개)자료구입비예산액(원)운영비예산액(원)
191942016안산시경기도안산교육지원청경기도 안산시 단원구대동초등학교초등학교공립N<NA>10240069000001300000
211822016포천시경기도포천교육지원청경기도 포천시중리초등학교초등학교공립N<NA>10200053500001784000
2152020광명시경기도광명교육지원청경기도 광명시하안북중학교<NA>공립N<NA>1<NA>4910180000008400000
50692019안산시경기도안산교육지원청경기도 안산시 단원구안산대월초등학교<NA>공립N<NA>104100188000005100000
120682018화성시경기도화성오산교육지원청경기도 화성시화성장안초등학교<NA>공립N<NA>105410134530004486000
134822017부천시경기도부천교육지원청경기도 부천시부천동초등학교초등학교공립N<NA>104611101000003376000
105622018오산시경기도화성오산교육지원청경기도 오산시수청초등학교초등학교공립N<NA>105610113000003840000
112932018이천시경기도교육청경기도 이천시이천양정여자고등학교<NA>사립N<NA>107210125000004300000
119142018화성시경기도화성오산교육지원청경기도 화성시화성반월초등학교초등학교공립N<NA>105510186000006200000
155412017용인시경기도용인교육지원청경기도 용인시 기흥구마북초등학교<NA>공립N<NA>105210117800004220000
기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유도서관수(개)도서관직원수(명)도서관(실)총좌석수(석)사서자격증보유수(개)사서자격증미보유수(개)자료구입비예산액(원)운영비예산액(원)
210252016평택시경기도평택교육지원청경기도 평택시오성중학교<NA>사립N<NA>10500065000001500000
124252017고양시경기도고양교육지원청경기도 고양시 일산동구하늘초등학교<NA>공립N<NA>101061094065203212000
198252016양평군경기도양평교육지원청경기도 양평군강하중학교<NA>공립N<NA>10300063000002900000
260812015화성시경기도화성오산교육지원청경기도 화성시기안초등학교초등학교공립N<NA>1172<NA><NA>90000003120000
64572019이천시경기도교육청경기도 이천시효양고등학교<NA>공립N<NA>108010137000004000000
214972016화성시경기도화성오산교육지원청경기도 화성시동탄중학교<NA>공립N<NA>105000129270604309020
259862015하남시경기도광주하남교육지원청경기도 하남시하남풍산초등학교<NA>공립N<NA>1046<NA><NA>80000001880000
126182017광명시경기도광명교육지원청경기도 광명시연서초등학교초등학교공립N<NA>10501197380003246000
37852019부천시경기도부천교육지원청경기도 부천시부천부흥초등학교<NA>공립N<NA>104310160000005088000
184032016성남시경기도교육청경기도 성남시 중원구동광고등학교<NA>사립N<NA>1080102200000038872000

Duplicate rows

Most frequently occurring

기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유도서관수(개)도서관직원수(명)도서관(실)총좌석수(석)사서자격증보유수(개)사서자격증미보유수(개)자료구입비예산액(원)운영비예산액(원)# duplicates
02019가평군경기도가평교육지원청경기도 가평군목동초등학교명지분교장<NA>공립N<NA>1000010000005000002
12019가평군경기도가평교육지원청경기도 가평군율길초등학교<NA>공립N<NA>102410633100021110002
22019가평군경기도교육청경기도 가평군가평고등학교<NA>공립N<NA>1090101445871048899902
32019가평군경기도교육청경기도 가평군설악고등학교<NA>공립N<NA>105010872500035000002
42019고양시경기도고양교육지원청경기도 고양시 덕양구고양오금초등학교<NA>공립N<NA>1036001123600037460002
52019고양시경기도고양교육지원청경기도 고양시 덕양구고양초등학교<NA>공립N<NA>1046001235000043150002
62019고양시경기도고양교육지원청경기도 고양시 덕양구능곡초등학교<NA>공립N<NA>1060101310000038000002
72019고양시경기도고양교육지원청경기도 고양시 덕양구도래울초등학교<NA>공립N<NA>1036101760000076400002
82019고양시경기도고양교육지원청경기도 고양시 덕양구성사초등학교<NA>공립N<NA>1054101853000080000002
92019고양시경기도고양교육지원청경기도 고양시 덕양구행신중학교<NA>공립N<NA>1060101364700046000002