Overview

Dataset statistics

Number of variables7
Number of observations684
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory40.9 KiB
Average record size in memory61.2 B

Variable types

Categorical2
Text1
Numeric4

Dataset

Description김해시에서 통계기반 도시현황 파악을 위해 개발한 통계지수 중 하나로서, 통계연도, 시도명, 시군구명, 학교 평균 접근시간(분), 초등학교 평균 접근시간(분), 중학교 평균 접근시간(분), 고등학교 평균 접근시간(분)으로 구성되어 있습니다. 김해시 중심의 통계지수로서, 데이터 수집, 가공 등의 어려움으로 김해시 외 지역의 정보는 누락될 수 있습니다.
Author경상남도 김해시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15110188

Alerts

학교 평균 접근시간(분) is highly overall correlated with 초등학교 평균 접근시간(분) and 2 other fieldsHigh correlation
초등학교 평균 접근시간(분) is highly overall correlated with 학교 평균 접근시간(분) and 2 other fieldsHigh correlation
중학교 평균 접근시간(분) is highly overall correlated with 학교 평균 접근시간(분) and 2 other fieldsHigh correlation
고등학교 평균 접근시간(분) is highly overall correlated with 학교 평균 접근시간(분) and 2 other fieldsHigh correlation

Reproduction

Analysis started2023-12-10 23:09:21.656703
Analysis finished2023-12-10 23:09:23.503552
Duration1.85 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

통계연도
Categorical

Distinct3
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
2017
228 
2018
228 
2019
228 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2017 228
33.3%
2018 228
33.3%
2019 228
33.3%

Length

2023-12-11T08:09:23.603291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:09:23.716371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017 228
33.3%
2018 228
33.3%
2019 228
33.3%

시도명
Categorical

Distinct16
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
경기도
93 
서울특별시
75 
경상북도
69 
전라남도
66 
강원도
54 
Other values (11)
327 

Length

Max length7
Median length5
Mean length4.1359649
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원도
2nd row강원도
3rd row강원도
4th row강원도
5th row강원도

Common Values

ValueCountFrequency (%)
경기도 93
13.6%
서울특별시 75
11.0%
경상북도 69
10.1%
전라남도 66
9.6%
강원도 54
7.9%
경상남도 54
7.9%
부산광역시 48
7.0%
충청남도 45
6.6%
전라북도 42
 
6.1%
충청북도 33
 
4.8%
Other values (6) 105
15.4%

Length

2023-12-11T08:09:23.837444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 93
13.6%
서울특별시 75
11.0%
경상북도 69
10.1%
전라남도 66
9.6%
강원도 54
7.9%
경상남도 54
7.9%
부산광역시 48
7.0%
충청남도 45
6.6%
전라북도 42
 
6.1%
충청북도 33
 
4.8%
Other values (6) 105
15.4%
Distinct206
Distinct (%)30.1%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
2023-12-11T08:09:24.208625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.9342105
Min length2

Characters and Unicode

Total characters2007
Distinct characters132
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강릉시
2nd row고성군
3rd row동해시
4th row삼척시
5th row속초시
ValueCountFrequency (%)
중구 18
 
2.6%
동구 18
 
2.6%
서구 15
 
2.2%
북구 12
 
1.8%
남구 12
 
1.8%
고성군 6
 
0.9%
강서구 6
 
0.9%
아산시 3
 
0.4%
태안군 3
 
0.4%
청양군 3
 
0.4%
Other values (196) 588
86.0%
2023-12-11T08:09:24.703116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
255
 
12.7%
234
 
11.7%
222
 
11.1%
66
 
3.3%
60
 
3.0%
54
 
2.7%
54
 
2.7%
51
 
2.5%
48
 
2.4%
39
 
1.9%
Other values (122) 924
46.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2007
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
255
 
12.7%
234
 
11.7%
222
 
11.1%
66
 
3.3%
60
 
3.0%
54
 
2.7%
54
 
2.7%
51
 
2.5%
48
 
2.4%
39
 
1.9%
Other values (122) 924
46.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2007
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
255
 
12.7%
234
 
11.7%
222
 
11.1%
66
 
3.3%
60
 
3.0%
54
 
2.7%
54
 
2.7%
51
 
2.5%
48
 
2.4%
39
 
1.9%
Other values (122) 924
46.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2007
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
255
 
12.7%
234
 
11.7%
222
 
11.1%
66
 
3.3%
60
 
3.0%
54
 
2.7%
54
 
2.7%
51
 
2.5%
48
 
2.4%
39
 
1.9%
Other values (122) 924
46.0%

학교 평균 접근시간(분)
Real number (ℝ)

HIGH CORRELATION 

Distinct518
Distinct (%)75.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.342982
Minimum4.79
Maximum85.97
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.1 KiB
2023-12-11T08:09:24.859818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.79
5-th percentile5.2315
Q15.9275
median9.21
Q316.6975
95-th percentile27.2975
Maximum85.97
Range81.18
Interquartile range (IQR)10.77

Descriptive statistics

Standard deviation8.7764814
Coefficient of variation (CV)0.7110503
Kurtosis11.998568
Mean12.342982
Median Absolute Deviation (MAD)3.605
Skewness2.5878482
Sum8442.6
Variance77.026626
MonotonicityNot monotonic
2023-12-11T08:09:24.996964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5.61 6
 
0.9%
5.48 5
 
0.7%
5.81 5
 
0.7%
5.82 5
 
0.7%
5.83 5
 
0.7%
5.88 5
 
0.7%
5.54 5
 
0.7%
5.6 5
 
0.7%
5.85 5
 
0.7%
5.68 4
 
0.6%
Other values (508) 634
92.7%
ValueCountFrequency (%)
4.79 1
 
0.1%
4.89 1
 
0.1%
4.91 1
 
0.1%
4.92 1
 
0.1%
4.93 3
0.4%
4.96 1
 
0.1%
4.97 2
0.3%
4.98 1
 
0.1%
5.0 1
 
0.1%
5.05 2
0.3%
ValueCountFrequency (%)
85.97 1
0.1%
67.29 1
0.1%
59.87 1
0.1%
54.19 1
0.1%
53.35 1
0.1%
46.06 1
0.1%
44.15 1
0.1%
44.12 1
0.1%
43.1 1
0.1%
41.0 1
0.1%

초등학교 평균 접근시간(분)
Real number (ℝ)

HIGH CORRELATION 

Distinct470
Distinct (%)68.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.8479678
Minimum3.36
Maximum75.73
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.1 KiB
2023-12-11T08:09:25.168860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.36
5-th percentile3.7015
Q14.2
median5.79
Q312.1175
95-th percentile21.1715
Maximum75.73
Range72.37
Interquartile range (IQR)7.9175

Descriptive statistics

Standard deviation6.6244706
Coefficient of variation (CV)0.74869967
Kurtosis16.618895
Mean8.8479678
Median Absolute Deviation (MAD)1.985
Skewness2.7471227
Sum6052.01
Variance43.883611
MonotonicityNot monotonic
2023-12-11T08:09:25.329319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3.95 6
 
0.9%
4.3 6
 
0.9%
4.0 5
 
0.7%
3.9 5
 
0.7%
4.17 5
 
0.7%
3.94 5
 
0.7%
4.24 5
 
0.7%
3.68 5
 
0.7%
4.14 4
 
0.6%
4.2 4
 
0.6%
Other values (460) 634
92.7%
ValueCountFrequency (%)
3.36 1
0.1%
3.39 1
0.1%
3.4 1
0.1%
3.41 1
0.1%
3.43 1
0.1%
3.46 2
0.3%
3.48 1
0.1%
3.49 1
0.1%
3.5 1
0.1%
3.52 1
0.1%
ValueCountFrequency (%)
75.73 1
0.1%
40.03 1
0.1%
39.21 1
0.1%
38.25 1
0.1%
30.63 1
0.1%
30.37 1
0.1%
30.08 1
0.1%
28.31 1
0.1%
27.74 1
0.1%
27.6 1
0.1%

중학교 평균 접근시간(분)
Real number (ℝ)

HIGH CORRELATION 

Distinct525
Distinct (%)76.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.480921
Minimum4.61
Maximum92.21
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.1 KiB
2023-12-11T08:09:25.723678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.61
5-th percentile5.19
Q15.92
median8.88
Q316.71
95-th percentile28.562
Maximum92.21
Range87.6
Interquartile range (IQR)10.79

Descriptive statistics

Standard deviation9.0598788
Coefficient of variation (CV)0.72589826
Kurtosis13.261287
Mean12.480921
Median Absolute Deviation (MAD)3.42
Skewness2.6698712
Sum8536.95
Variance82.081404
MonotonicityNot monotonic
2023-12-11T08:09:25.861234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5.73 6
 
0.9%
5.56 6
 
0.9%
5.49 5
 
0.7%
5.4 5
 
0.7%
5.5 5
 
0.7%
5.58 5
 
0.7%
5.31 4
 
0.6%
5.83 4
 
0.6%
5.82 4
 
0.6%
5.66 4
 
0.6%
Other values (515) 636
93.0%
ValueCountFrequency (%)
4.61 1
0.1%
4.64 1
0.1%
4.68 1
0.1%
4.76 1
0.1%
4.8 1
0.1%
4.86 1
0.1%
4.87 1
0.1%
4.88 1
0.1%
4.89 1
0.1%
4.91 1
0.1%
ValueCountFrequency (%)
92.21 1
0.1%
72.59 1
0.1%
56.51 1
0.1%
51.27 1
0.1%
48.94 1
0.1%
48.03 1
0.1%
44.51 1
0.1%
44.25 1
0.1%
43.48 1
0.1%
42.52 1
0.1%

고등학교 평균 접근시간(분)
Real number (ℝ)

HIGH CORRELATION 

Distinct557
Distinct (%)81.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.705132
Minimum5.13
Maximum120
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.1 KiB
2023-12-11T08:09:26.014025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5.13
5-th percentile6.5615
Q17.78
median12.26
Q319.4125
95-th percentile34.116
Maximum120
Range114.87
Interquartile range (IQR)11.6325

Descriptive statistics

Standard deviation12.644355
Coefficient of variation (CV)0.8051098
Kurtosis27.058841
Mean15.705132
Median Absolute Deviation (MAD)4.84
Skewness4.183183
Sum10742.31
Variance159.87972
MonotonicityNot monotonic
2023-12-11T08:09:26.161799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7.3 6
 
0.9%
7.23 4
 
0.6%
8.67 4
 
0.6%
7.65 4
 
0.6%
6.85 4
 
0.6%
8.41 3
 
0.4%
7.55 3
 
0.4%
8.48 3
 
0.4%
7.06 3
 
0.4%
13.43 3
 
0.4%
Other values (547) 647
94.6%
ValueCountFrequency (%)
5.13 1
0.1%
5.35 1
0.1%
5.4 1
0.1%
5.58 1
0.1%
5.68 1
0.1%
5.69 1
0.1%
5.77 1
0.1%
5.83 1
0.1%
5.87 1
0.1%
5.94 1
0.1%
ValueCountFrequency (%)
120.0 3
0.4%
109.6 1
 
0.1%
89.12 1
 
0.1%
82.72 1
 
0.1%
82.07 1
 
0.1%
53.85 1
 
0.1%
51.77 1
 
0.1%
51.52 1
 
0.1%
47.4 1
 
0.1%
45.45 1
 
0.1%

Interactions

2023-12-11T08:09:22.956955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:09:21.947713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:09:22.325018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:09:22.641480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:09:23.039704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:09:22.037645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:09:22.408648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:09:22.716109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:09:23.121720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:09:22.129618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:09:22.493062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:09:22.817883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:09:23.197772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:09:22.227562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:09:22.564481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:09:22.885822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:09:26.255592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
통계연도시도명학교 평균 접근시간(분)초등학교 평균 접근시간(분)중학교 평균 접근시간(분)고등학교 평균 접근시간(분)
통계연도1.0000.0000.2600.2210.2360.136
시도명0.0001.0000.4800.4170.5410.533
학교 평균 접근시간(분)0.2600.4801.0000.8950.9300.899
초등학교 평균 접근시간(분)0.2210.4170.8951.0000.8440.784
중학교 평균 접근시간(분)0.2360.5410.9300.8441.0000.928
고등학교 평균 접근시간(분)0.1360.5330.8990.7840.9281.000
2023-12-11T08:09:26.373361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
통계연도시도명
통계연도1.0000.000
시도명0.0001.000
2023-12-11T08:09:26.471267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학교 평균 접근시간(분)초등학교 평균 접근시간(분)중학교 평균 접근시간(분)고등학교 평균 접근시간(분)통계연도시도명
학교 평균 접근시간(분)1.0000.9530.9740.9780.1180.219
초등학교 평균 접근시간(분)0.9531.0000.9450.9030.1510.204
중학교 평균 접근시간(분)0.9740.9451.0000.9290.1530.217
고등학교 평균 접근시간(분)0.9780.9030.9291.0000.0860.214
통계연도0.1180.1510.1530.0861.0000.000
시도명0.2190.2040.2170.2140.0001.000

Missing values

2023-12-11T08:09:23.330921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:09:23.447048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

통계연도시도명시군구명학교 평균 접근시간(분)초등학교 평균 접근시간(분)중학교 평균 접근시간(분)고등학교 평균 접근시간(분)
02017강원도강릉시10.386.7710.3913.99
12017강원도고성군17.110.5717.4123.33
22017강원도동해시7.955.249.29.4
32017강원도삼척시11.527.1212.5514.89
42017강원도속초시7.874.819.389.42
52017강원도양구군15.578.9612.4525.32
62017강원도양양군19.3413.020.5724.46
72017강원도영월군15.1212.4216.9116.05
82017강원도원주시8.695.788.8811.41
92017강원도인제군17.312.5818.0521.26
통계연도시도명시군구명학교 평균 접근시간(분)초등학교 평균 접근시간(분)중학교 평균 접근시간(분)고등학교 평균 접근시간(분)
6742019인천광역시중구8.074.868.8810.47
6752019인천광역시강화군17.0114.6120.016.42
6762019인천광역시계양구5.153.615.336.53
6772019인천광역시남동구5.193.644.976.98
6782019인천광역시부평구4.933.435.186.2
6792019인천광역시연수구5.393.855.496.85
6802019인천광역시옹진군67.2927.692.2182.07
6812019인천광역시미추홀구6.394.317.097.77
6822019제주특별자치도제주시9.666.049.4713.47
6832019제주특별자치도서귀포시13.118.5512.4718.32