Overview

Dataset statistics

Number of variables6
Number of observations1550
Missing cells0
Missing cells (%)0.0%
Duplicate rows310
Duplicate rows (%)20.0%
Total size in memory80.4 KiB
Average record size in memory53.1 B

Variable types

Numeric1
Categorical5

Dataset

Description특수학교 학교유형별 집계현황
Author경기복지재단(경기도장애인복지종합지원센터)
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=9PNNGZH4VHKHT7UK1GPB25227233&infSeq=1

Alerts

Dataset has 310 (20.0%) duplicate rowsDuplicates
국립학교수(개) is highly overall correlated with 시군명 and 2 other fieldsHigh correlation
시군명 is highly overall correlated with 총학교수(개) and 3 other fieldsHigh correlation
공립학교수(개) is highly overall correlated with 시군명High correlation
총학교수(개) is highly overall correlated with 시군명 and 2 other fieldsHigh correlation
사립학교수(개) is highly overall correlated with 시군명 and 2 other fieldsHigh correlation
국립학교수(개) is highly imbalanced (65.5%)Imbalance

Reproduction

Analysis started2023-12-10 21:17:49.871213
Analysis finished2023-12-10 21:17:50.441985
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준년도
Real number (ℝ)

Distinct11
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2016.04
Minimum2012
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size13.8 KiB
2023-12-11T06:17:50.489258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2012
5-th percentile2012
Q12014
median2016
Q32018
95-th percentile2022
Maximum2023
Range11
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.0007014
Coefficient of variation (CV)0.0014884136
Kurtosis-0.59615211
Mean2016.04
Median Absolute Deviation (MAD)2
Skewness0.54438043
Sum3124862
Variance9.0042092
MonotonicityDecreasing
2023-12-11T06:17:50.592622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
2017 186
12.0%
2016 186
12.0%
2015 186
12.0%
2014 186
12.0%
2013 186
12.0%
2012 186
12.0%
2018 155
10.0%
2021 93
6.0%
2020 93
6.0%
2022 62
 
4.0%
ValueCountFrequency (%)
2012 186
12.0%
2013 186
12.0%
2014 186
12.0%
2015 186
12.0%
2016 186
12.0%
2017 186
12.0%
2018 155
10.0%
2020 93
6.0%
2021 93
6.0%
2022 62
 
4.0%
ValueCountFrequency (%)
2023 31
 
2.0%
2022 62
 
4.0%
2021 93
6.0%
2020 93
6.0%
2018 155
10.0%
2017 186
12.0%
2016 186
12.0%
2015 186
12.0%
2014 186
12.0%
2013 186
12.0%

시군명
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size12.2 KiB
가평군
 
50
고양시
 
50
과천시
 
50
광명시
 
50
광주시
 
50
Other values (26)
1300 

Length

Max length4
Median length3
Mean length3.0967742
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row고양시
3rd row과천시
4th row광명시
5th row광주시

Common Values

ValueCountFrequency (%)
가평군 50
 
3.2%
고양시 50
 
3.2%
과천시 50
 
3.2%
광명시 50
 
3.2%
광주시 50
 
3.2%
구리시 50
 
3.2%
군포시 50
 
3.2%
김포시 50
 
3.2%
남양주시 50
 
3.2%
동두천시 50
 
3.2%
Other values (21) 1050
67.7%

Length

2023-12-11T06:17:50.709345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
가평군 50
 
3.2%
안양시 50
 
3.2%
하남시 50
 
3.2%
포천시 50
 
3.2%
평택시 50
 
3.2%
파주시 50
 
3.2%
이천시 50
 
3.2%
의정부시 50
 
3.2%
의왕시 50
 
3.2%
용인시 50
 
3.2%
Other values (21) 1050
67.7%

총학교수(개)
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size12.2 KiB
0
621 
1
429 
2
330 
3
100 
4
70 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row4
3rd row0
4th row0
5th row4

Common Values

ValueCountFrequency (%)
0 621
40.1%
1 429
27.7%
2 330
21.3%
3 100
 
6.5%
4 70
 
4.5%

Length

2023-12-11T06:17:50.814850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:17:50.906909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 621
40.1%
1 429
27.7%
2 330
21.3%
3 100
 
6.5%
4 70
 
4.5%

국립학교수(개)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.2 KiB
0
1450 
1
 
100

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 1450
93.5%
1 100
 
6.5%

Length

2023-12-11T06:17:51.009266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:17:51.106563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 1450
93.5%
1 100
 
6.5%

공립학교수(개)
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.2 KiB
0
1101 
1
399 
2
 
50

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 1101
71.0%
1 399
 
25.7%
2 50
 
3.2%

Length

2023-12-11T06:17:51.199098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:17:51.296173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 1101
71.0%
1 399
 
25.7%
2 50
 
3.2%

사립학교수(개)
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size12.2 KiB
0
850 
1
450 
2
150 
3
 
80
4
 
20

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row3
3rd row0
4th row0
5th row4

Common Values

ValueCountFrequency (%)
0 850
54.8%
1 450
29.0%
2 150
 
9.7%
3 80
 
5.2%
4 20
 
1.3%

Length

2023-12-11T06:17:51.390691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:17:51.488001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 850
54.8%
1 450
29.0%
2 150
 
9.7%
3 80
 
5.2%
4 20
 
1.3%

Interactions

2023-12-11T06:17:50.165914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:17:51.563402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준년도시군명총학교수(개)국립학교수(개)공립학교수(개)사립학교수(개)
기준년도1.0000.0000.0000.0000.1270.000
시군명0.0001.0000.9811.0000.9840.979
총학교수(개)0.0000.9811.0000.5160.5310.939
국립학교수(개)0.0001.0000.5161.0000.0990.467
공립학교수(개)0.1270.9840.5310.0991.0000.255
사립학교수(개)0.0000.9790.9390.4670.2551.000
2023-12-11T06:17:51.660404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
국립학교수(개)시군명사립학교수(개)공립학교수(개)총학교수(개)
국립학교수(개)1.0000.9910.5680.1640.626
시군명0.9911.0000.8920.9350.897
사립학교수(개)0.5680.8921.0000.1980.655
공립학교수(개)0.1640.9350.1981.0000.470
총학교수(개)0.6260.8970.6550.4701.000
2023-12-11T06:17:51.747989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준년도시군명총학교수(개)국립학교수(개)공립학교수(개)사립학교수(개)
기준년도1.0000.0000.0000.0000.0910.030
시군명0.0001.0000.8970.9910.9350.892
총학교수(개)0.0000.8971.0000.6260.4700.655
국립학교수(개)0.0000.9910.6261.0000.1640.568
공립학교수(개)0.0910.9350.4700.1641.0000.198
사립학교수(개)0.0300.8920.6550.5680.1981.000

Missing values

2023-12-11T06:17:50.281566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:17:50.402505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준년도시군명총학교수(개)국립학교수(개)공립학교수(개)사립학교수(개)
02023가평군0000
12023고양시4103
22023과천시0000
32023광명시0000
42023광주시4004
52023구리시0000
62023군포시0000
72023김포시1010
82023남양주시1010
92023동두천시0000
기준년도시군명총학교수(개)국립학교수(개)공립학교수(개)사립학교수(개)
15402012하남시1001
15412012하남시1001
15422012하남시1001
15432012하남시1001
15442012화성시2002
15452012화성시2002
15462012화성시2002
15472012화성시2002
15482012화성시2002
15492012화성시2002

Duplicate rows

Most frequently occurring

기준년도시군명총학교수(개)국립학교수(개)공립학교수(개)사립학교수(개)# duplicates
02012가평군00006
12012고양시41036
22012과천시00006
32012광명시00006
42012광주시30036
52012구리시00006
62012군포시00006
72012김포시00006
82012남양주시10106
92012동두천시00006