Overview

Dataset statistics

Number of variables6
Number of observations285
Missing cells36
Missing cells (%)2.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory14.0 KiB
Average record size in memory50.5 B

Variable types

Numeric2
Categorical4

Dataset

Description- 제주도 내 위치한 사설 학원 및 독서실 현황 정보를 제공합니다. - 출처: KOSIS 국가통계포털
Author제주특별자치도 미래성장과
URLhttps://www.jejudatahub.net/data/view/data/997

Alerts

개수 is highly overall correlated with 구분대분류High correlation
구분대분류 is highly overall correlated with 개수 and 1 other fieldsHigh correlation
구분중분류 is highly overall correlated with 구분대분류High correlation
구분대분류 is highly imbalanced (60.7%)Imbalance
개수 has 36 (12.6%) missing valuesMissing

Reproduction

Analysis started2023-12-11 19:37:58.501546
Analysis finished2023-12-11 19:37:59.629289
Duration1.13 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준 연도
Real number (ℝ)

Distinct9
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2017.0281
Minimum2012
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2023-12-12T04:37:59.705758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2012
5-th percentile2012
Q12015
median2018
Q32019
95-th percentile2020
Maximum2020
Range8
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.4608022
Coefficient of variation (CV)0.0012200138
Kurtosis-0.65351184
Mean2017.0281
Median Absolute Deviation (MAD)1
Skewness-0.66935019
Sum574853
Variance6.0555473
MonotonicityIncreasing
2023-12-12T04:37:59.860327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
2017 51
17.9%
2018 51
17.9%
2020 48
16.8%
2019 46
16.1%
2012 20
 
7.0%
2013 20
 
7.0%
2014 17
 
6.0%
2015 16
 
5.6%
2016 16
 
5.6%
ValueCountFrequency (%)
2012 20
 
7.0%
2013 20
 
7.0%
2014 17
 
6.0%
2015 16
 
5.6%
2016 16
 
5.6%
2017 51
17.9%
2018 51
17.9%
2019 46
16.1%
2020 48
16.8%
ValueCountFrequency (%)
2020 48
16.8%
2019 46
16.1%
2018 51
17.9%
2017 51
17.9%
2016 16
 
5.6%
2015 16
 
5.6%
2014 17
 
6.0%
2013 20
 
7.0%
2012 20
 
7.0%

시군구
Categorical

Distinct3
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
합계
156 
제주시
67 
서귀포시
62 

Length

Max length4
Median length2
Mean length2.6701754
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row합계
2nd row합계
3rd row합계
4th row합계
5th row합계

Common Values

ValueCountFrequency (%)
합계 156
54.7%
제주시 67
23.5%
서귀포시 62
 
21.8%

Length

2023-12-12T04:38:00.095340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T04:38:00.244155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
합계 156
54.7%
제주시 67
23.5%
서귀포시 62
 
21.8%

구분대분류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
학원수
233 
독서실
35 
강의실수
 
13
사무실수
 
2
실험실습실수
 
2

Length

Max length6
Median length3
Mean length3.0736842
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강의실수
2nd row독서실
3rd row독서실
4th row독서실
5th row사무실수

Common Values

ValueCountFrequency (%)
학원수 233
81.8%
독서실 35
 
12.3%
강의실수 13
 
4.6%
사무실수 2
 
0.7%
실험실습실수 2
 
0.7%

Length

2023-12-12T04:38:00.372009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T04:38:00.500084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
학원수 233
81.8%
독서실 35
 
12.3%
강의실수 13
 
4.6%
사무실수 2
 
0.7%
실험실습실수 2
 
0.7%

구분중분류
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
학교교과교습학원
116 
평생직업교육학원
100 
소계
34 
독서실수
16 
열람실수
16 

Length

Max length8
Median length8
Mean length6.8035088
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row소계
2nd row독서실수
3rd row열람실수
4th row열람좌석수
5th row소계

Common Values

ValueCountFrequency (%)
학교교과교습학원 116
40.7%
평생직업교육학원 100
35.1%
소계 34
 
11.9%
독서실수 16
 
5.6%
열람실수 16
 
5.6%
열람좌석수 3
 
1.1%

Length

2023-12-12T04:38:00.610907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T04:38:00.723502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
학교교과교습학원 116
40.7%
평생직업교육학원 100
35.1%
소계 34
 
11.9%
독서실수 16
 
5.6%
열람실수 16
 
5.6%
열람좌석수 3
 
1.1%

구분소분류
Categorical

Distinct10
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
소계
103 
국제화
34 
종합
33 
기예
17 
직업기술
17 
Other values (5)
81 

Length

Max length7
Median length2
Mean length2.7473684
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row소계
2nd row소계
3rd row소계
4th row소계
5th row소계

Common Values

ValueCountFrequency (%)
소계 103
36.1%
국제화 34
 
11.9%
종합 33
 
11.6%
기예 17
 
6.0%
직업기술 17
 
6.0%
기타 17
 
6.0%
예능 17
 
6.0%
입시검정및보습 17
 
6.0%
인문사회 16
 
5.6%
특수교육 14
 
4.9%

Length

2023-12-12T04:38:00.880890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T04:38:01.014875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
소계 103
36.1%
국제화 34
 
11.9%
종합 33
 
11.6%
기예 17
 
6.0%
직업기술 17
 
6.0%
기타 17
 
6.0%
예능 17
 
6.0%
입시검정및보습 17
 
6.0%
인문사회 16
 
5.6%
특수교육 14
 
4.9%

개수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct147
Distinct (%)59.0%
Missing36
Missing (%)12.6%
Infinite0
Infinite (%)0.0%
Mean443.35743
Minimum1
Maximum6973
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2023-12-12T04:38:01.169317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q19
median52
Q3348
95-th percentile2566.2
Maximum6973
Range6972
Interquartile range (IQR)339

Descriptive statistics

Standard deviation1074.3119
Coefficient of variation (CV)2.4231282
Kurtosis16.348272
Mean443.35743
Median Absolute Deviation (MAD)48
Skewness3.9597327
Sum110396
Variance1154146.1
MonotonicityNot monotonic
2023-12-12T04:38:01.387944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4 19
 
6.7%
6 9
 
3.2%
5 9
 
3.2%
3 8
 
2.8%
2 6
 
2.1%
10 6
 
2.1%
31 4
 
1.4%
28 4
 
1.4%
25 4
 
1.4%
30 4
 
1.4%
Other values (137) 176
61.8%
(Missing) 36
 
12.6%
ValueCountFrequency (%)
1 4
 
1.4%
2 6
 
2.1%
3 8
2.8%
4 19
6.7%
5 9
3.2%
6 9
3.2%
7 4
 
1.4%
8 2
 
0.7%
9 4
 
1.4%
10 6
 
2.1%
ValueCountFrequency (%)
6973 1
0.4%
5709 1
0.4%
5651 1
0.4%
5603 1
0.4%
5545 1
0.4%
5442 1
0.4%
4639 1
0.4%
4541 1
0.4%
3858 1
0.4%
3369 1
0.4%

Interactions

2023-12-12T04:37:59.075075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T04:37:58.824747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T04:37:59.219690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T04:37:58.947563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T04:38:01.573993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준 연도시군구구분대분류구분중분류구분소분류개수
기준 연도1.0000.5150.0600.0000.0000.046
시군구0.5151.0000.0000.0000.0000.109
구분대분류0.0600.0001.0000.7270.5560.819
구분중분류0.0000.0000.7271.0000.6760.726
구분소분류0.0000.0000.5560.6761.0000.303
개수0.0460.1090.8190.7260.3031.000
2023-12-12T04:38:01.864396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구구분대분류구분중분류구분소분류
시군구1.0000.0000.0000.000
구분대분류0.0001.0000.5940.260
구분중분류0.0000.5941.0000.436
구분소분류0.0000.2600.4361.000
2023-12-12T04:38:02.088458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준 연도개수시군구구분대분류구분중분류구분소분류
기준 연도1.000-0.1110.4020.0000.0000.000
개수-0.1111.0000.0460.6480.4610.101
시군구0.4020.0461.0000.0000.0000.000
구분대분류0.0000.6480.0001.0000.5940.260
구분중분류0.0000.4610.0000.5941.0000.436
구분소분류0.0000.1010.0000.2600.4361.000

Missing values

2023-12-12T04:37:59.393602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T04:37:59.556300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준 연도시군구구분대분류구분중분류구분소분류개수
02012합계강의실수소계소계2791
12012합계독서실독서실수소계65
22012합계독서실열람실수소계220
32012합계독서실열람좌석수소계6973
42012합계사무실수소계소계1032
52012합계실험실습실수소계소계2229
62012합계학원수소계소계980
72012합계학원수평생직업교육학원국제화5
82012합계학원수평생직업교육학원기예1
92012합계학원수평생직업교육학원소계43
기준 연도시군구구분대분류구분중분류구분소분류개수
2752020합계학원수평생직업교육학원인문사회4
2762020합계학원수평생직업교육학원종합10
2772020합계학원수평생직업교육학원직업기술36
2782020합계학원수학교교과교습학원국제화121
2792020합계학원수학교교과교습학원기타25
2802020합계학원수학교교과교습학원소계987
2812020합계학원수학교교과교습학원예능301
2822020합계학원수학교교과교습학원입시검정및보습499
2832020합계학원수학교교과교습학원종합41
2842020합계학원수학교교과교습학원특수교육<NA>