Overview

Dataset statistics

Number of variables6
Number of observations1098
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory55.9 KiB
Average record size in memory52.1 B

Variable types

Categorical2
Text1
Numeric3

Dataset

Description김해시에서 통계기반 도시현황 파악을 위해 개발한 통계지수 중 하나로서, 통계연도, 시도명, 시군구명, 유아천명당 어린이집의 수(개), 보육시설수(개), 0에서5세아동수(명)로 구성되어 있습니다. 김해시 중심의 통계지수로서, 데이터 수집, 가공 등의 어려움으로 김해시 외 지역의 정보는 누락될 수 있습니다.
Author경상남도 김해시
URLhttps://www.data.go.kr/data/15110149/fileData.do

Alerts

유아천명당 어린이집의 수(개) is highly overall correlated with 보육시설수(개)High correlation
보육시설수(개) is highly overall correlated with 유아천명당 어린이집의 수(개) and 1 other fieldsHigh correlation
0에서5세아동수(명) is highly overall correlated with 보육시설수(개)High correlation

Reproduction

Analysis started2023-12-12 05:54:07.458698
Analysis finished2023-12-12 05:54:09.051007
Duration1.59 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

통계연도
Categorical

Distinct5
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
2014
227 
2015
227 
2016
227 
2012
226 
2013
191 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2012
2nd row2012
3rd row2012
4th row2012
5th row2012

Common Values

ValueCountFrequency (%)
2014 227
20.7%
2015 227
20.7%
2016 227
20.7%
2012 226
20.6%
2013 191
17.4%

Length

2023-12-12T14:54:09.136765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:54:09.313378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2014 227
20.7%
2015 227
20.7%
2016 227
20.7%
2012 226
20.6%
2013 191
17.4%

시도명
Categorical

Distinct16
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
경기도
154 
경상북도
115 
전라남도
110 
서울특별시
100 
강원도
90 
Other values (11)
529 

Length

Max length7
Median length5
Mean length4.1147541
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
경기도 154
14.0%
경상북도 115
10.5%
전라남도 110
10.0%
서울특별시 100
9.1%
강원도 90
8.2%
경상남도 90
8.2%
부산광역시 80
7.3%
충청남도 75
6.8%
전라북도 70
6.4%
인천광역시 45
 
4.1%
Other values (6) 169
15.4%

Length

2023-12-12T14:54:09.450162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 154
14.0%
경상북도 115
10.5%
전라남도 110
10.0%
서울특별시 100
9.1%
강원도 90
8.2%
경상남도 90
8.2%
부산광역시 80
7.3%
충청남도 75
6.8%
전라북도 70
6.4%
인천광역시 45
 
4.1%
Other values (6) 169
15.4%
Distinct205
Distinct (%)18.7%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
2023-12-12T14:54:09.823138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.9253188
Min length2

Characters and Unicode

Total characters3212
Distinct characters130
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종로구
2nd row중구
3rd row용산구
4th row성동구
5th row광진구
ValueCountFrequency (%)
동구 30
 
2.7%
중구 29
 
2.6%
서구 25
 
2.3%
남구 20
 
1.8%
북구 20
 
1.8%
고성군 10
 
0.9%
강서구 9
 
0.8%
동해시 5
 
0.5%
군위군 5
 
0.5%
계룡시 5
 
0.5%
Other values (195) 940
85.6%
2023-12-12T14:54:10.422954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
417
 
13.0%
386
 
12.0%
339
 
10.6%
105
 
3.3%
97
 
3.0%
88
 
2.7%
87
 
2.7%
80
 
2.5%
78
 
2.4%
62
 
1.9%
Other values (120) 1473
45.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3212
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
417
 
13.0%
386
 
12.0%
339
 
10.6%
105
 
3.3%
97
 
3.0%
88
 
2.7%
87
 
2.7%
80
 
2.5%
78
 
2.4%
62
 
1.9%
Other values (120) 1473
45.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3212
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
417
 
13.0%
386
 
12.0%
339
 
10.6%
105
 
3.3%
97
 
3.0%
88
 
2.7%
87
 
2.7%
80
 
2.5%
78
 
2.4%
62
 
1.9%
Other values (120) 1473
45.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3212
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
417
 
13.0%
386
 
12.0%
339
 
10.6%
105
 
3.3%
97
 
3.0%
88
 
2.7%
87
 
2.7%
80
 
2.5%
78
 
2.4%
62
 
1.9%
Other values (120) 1473
45.9%

유아천명당 어린이집의 수(개)
Real number (ℝ)

HIGH CORRELATION 

Distinct755
Distinct (%)68.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.052286
Minimum2.31
Maximum28.25
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.8 KiB
2023-12-12T14:54:10.613608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.31
5-th percentile8.2285
Q111.4525
median13.565
Q316.845
95-th percentile20.5545
Maximum28.25
Range25.94
Interquartile range (IQR)5.3925

Descriptive statistics

Standard deviation3.9160243
Coefficient of variation (CV)0.27867525
Kurtosis0.33690403
Mean14.052286
Median Absolute Deviation (MAD)2.535
Skewness0.34056284
Sum15429.41
Variance15.335246
MonotonicityNot monotonic
2023-12-12T14:54:10.802134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
14.41 6
 
0.5%
13.15 6
 
0.5%
17.12 5
 
0.5%
12.76 5
 
0.5%
13.38 4
 
0.4%
11.0 4
 
0.4%
14.86 4
 
0.4%
17.33 4
 
0.4%
13.26 4
 
0.4%
19.25 4
 
0.4%
Other values (745) 1052
95.8%
ValueCountFrequency (%)
2.31 1
0.1%
2.32 1
0.1%
2.6 1
0.1%
2.79 1
0.1%
3.3 1
0.1%
4.14 1
0.1%
4.65 1
0.1%
5.51 1
0.1%
5.77 2
0.2%
5.85 1
0.1%
ValueCountFrequency (%)
28.25 1
0.1%
27.62 1
0.1%
27.3 1
0.1%
26.02 1
0.1%
25.96 1
0.1%
25.95 1
0.1%
25.79 1
0.1%
25.78 1
0.1%
25.37 1
0.1%
25.22 1
0.1%

보육시설수(개)
Real number (ℝ)

HIGH CORRELATION 

Distinct427
Distinct (%)38.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean184.63843
Minimum1
Maximum1311
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.8 KiB
2023-12-12T14:54:10.984718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile11
Q125.25
median104
Q3250.75
95-th percentile677.45
Maximum1311
Range1310
Interquartile range (IQR)225.5

Descriptive statistics

Standard deviation223.09692
Coefficient of variation (CV)1.2082908
Kurtosis5.2785685
Mean184.63843
Median Absolute Deviation (MAD)89
Skewness2.1089641
Sum202733
Variance49772.237
MonotonicityNot monotonic
2023-12-12T14:54:11.172631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
14 32
 
2.9%
13 27
 
2.5%
11 24
 
2.2%
17 21
 
1.9%
12 21
 
1.9%
16 16
 
1.5%
15 14
 
1.3%
26 12
 
1.1%
25 12
 
1.1%
39 12
 
1.1%
Other values (417) 907
82.6%
ValueCountFrequency (%)
1 1
 
0.1%
2 4
0.4%
3 2
 
0.2%
4 3
 
0.3%
5 6
0.5%
6 6
0.5%
7 4
0.4%
8 8
0.7%
9 6
0.5%
10 9
0.8%
ValueCountFrequency (%)
1311 1
0.1%
1284 1
0.1%
1266 1
0.1%
1254 1
0.1%
1187 1
0.1%
1182 1
0.1%
1166 1
0.1%
1161 1
0.1%
1134 1
0.1%
1114 1
0.1%

0에서5세아동수(명)
Real number (ℝ)

HIGH CORRELATION 

Distinct1056
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11827.185
Minimum291
Maximum71849
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.8 KiB
2023-12-12T14:54:11.347106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum291
5-th percentile1013.6
Q12227.75
median6863.5
Q317745.75
95-th percentile35789.65
Maximum71849
Range71558
Interquartile range (IQR)15518

Descriptive statistics

Standard deviation12860.77
Coefficient of variation (CV)1.0873906
Kurtosis3.9464356
Mean11827.185
Median Absolute Deviation (MAD)5528
Skewness1.8215064
Sum12986249
Variance1.653994 × 108
MonotonicityNot monotonic
2023-12-12T14:54:11.518728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1429 3
 
0.3%
10356 2
 
0.2%
14542 2
 
0.2%
1117 2
 
0.2%
3499 2
 
0.2%
3732 2
 
0.2%
1034 2
 
0.2%
746 2
 
0.2%
2632 2
 
0.2%
2325 2
 
0.2%
Other values (1046) 1077
98.1%
ValueCountFrequency (%)
291 1
0.1%
303 1
0.1%
316 1
0.1%
323 1
0.1%
342 1
0.1%
559 1
0.1%
563 1
0.1%
569 1
0.1%
574 1
0.1%
578 1
0.1%
ValueCountFrequency (%)
71849 1
0.1%
71795 1
0.1%
70161 1
0.1%
70147 1
0.1%
69027 1
0.1%
67766 1
0.1%
67435 1
0.1%
66464 1
0.1%
65496 1
0.1%
64446 1
0.1%

Interactions

2023-12-12T14:54:08.476815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:54:07.780066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:54:08.115044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:54:08.580589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:54:07.888025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:54:08.231807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:54:08.694592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:54:07.997652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:54:08.360228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:54:11.632504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
통계연도시도명유아천명당 어린이집의 수(개)보육시설수(개)0에서5세아동수(명)
통계연도1.0000.0000.2070.0000.000
시도명0.0001.0000.5400.5840.606
유아천명당 어린이집의 수(개)0.2070.5401.0000.5730.472
보육시설수(개)0.0000.5840.5731.0000.949
0에서5세아동수(명)0.0000.6060.4720.9491.000
2023-12-12T14:54:11.761773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
통계연도시도명
통계연도1.0000.000
시도명0.0001.000
2023-12-12T14:54:11.865634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유아천명당 어린이집의 수(개)보육시설수(개)0에서5세아동수(명)통계연도시도명
유아천명당 어린이집의 수(개)1.0000.6290.4840.0870.245
보육시설수(개)0.6291.0000.9810.0000.273
0에서5세아동수(명)0.4840.9811.0000.0000.288
통계연도0.0870.0000.0001.0000.000
시도명0.2450.2730.2880.0001.000

Missing values

2023-12-12T14:54:08.868171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:54:08.987107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

통계연도시도명시군구명유아천명당 어린이집의 수(개)보육시설수(개)0에서5세아동수(명)
02012서울특별시종로구11.89715971
12012서울특별시중구9.38576076
22012서울특별시용산구11.0113011811
32012서울특별시성동구11.6517615110
42012서울특별시광진구13.1522316953
52012서울특별시동대문구13.5722616653
62012서울특별시중랑구14.2527319160
72012서울특별시성북구12.7631924994
82012서울특별시강북구13.3321015759
92012서울특별시도봉구17.5629616858
통계연도시도명시군구명유아천명당 어린이집의 수(개)보육시설수(개)0에서5세아동수(명)
10882016경상남도창녕군10.48242291
10892016경상남도고성군12.83272104
10902016경상남도남해군15.31161045
10912016경상남도하동군12.0171417
10922016경상남도산청군10.8121111
10932016경상남도함양군11.77151274
10942016경상남도거창군12.94332550
10952016경상남도합천군10.48131240
10962016제주특별자치도제주시14.6942028585
10972016제주특별자치도서귀포시14.411238535