Overview

Dataset statistics

Number of variables7
Number of observations1700
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory99.7 KiB
Average record size in memory60.1 B

Variable types

Categorical3
Text1
Numeric3

Dataset

Description김해시에서 통계기반 도시현황 파악을 위해 개발한 통계지수 중 하나로서, 통계연기, 시도명, 시군구명, 성별, 실업률(퍼센트), 실업자(천명), 경제활동인구수(천명)로 구성되어 있습니다. 김해시 중심의 통계지수로서, 데이터 수집, 가공 등의 어려움으로 김해시 외 지역의 정보는 누락될 수 있습니다.
Author경상남도 김해시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15110099

Alerts

실업률(퍼센트) is highly overall correlated with 실업자(천명) and 1 other fieldsHigh correlation
실업자(천명) is highly overall correlated with 실업률(퍼센트) and 1 other fieldsHigh correlation
경제활동인구수(천명) is highly overall correlated with 실업률(퍼센트) and 1 other fieldsHigh correlation
실업률(퍼센트) has 73 (4.3%) zerosZeros
실업자(천명) has 73 (4.3%) zerosZeros

Reproduction

Analysis started2023-12-10 23:23:22.079463
Analysis finished2023-12-10 23:23:23.647835
Duration1.57 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

통계연도
Categorical

Distinct5
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size13.4 KiB
2021
456 
2020
313 
2019
312 
2018
310 
2017
309 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2021 456
26.8%
2020 313
18.4%
2019 312
18.4%
2018 310
18.2%
2017 309
18.2%

Length

2023-12-11T08:23:23.721905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:23:23.856108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 456
26.8%
2020 313
18.4%
2019 312
18.4%
2018 310
18.2%
2017 309
18.2%

분기
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.4 KiB
하반기
851 
상반기
849 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row상반기
2nd row상반기
3rd row상반기
4th row상반기
5th row상반기

Common Values

ValueCountFrequency (%)
하반기 851
50.1%
상반기 849
49.9%

Length

2023-12-11T08:23:24.003330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:23:24.133391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
하반기 851
50.1%
상반기 849
49.9%

시도명
Categorical

Distinct16
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size13.4 KiB
경기도
311 
경상북도
230 
전라남도
220 
강원도
180 
경상남도
180 
Other values (11)
579 

Length

Max length7
Median length4
Mean length3.8335294
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 311
18.3%
경상북도 230
13.5%
전라남도 220
12.9%
강원도 180
10.6%
경상남도 180
10.6%
충청남도 157
9.2%
전라북도 140
8.2%
충청북도 114
 
6.7%
서울특별시 50
 
2.9%
부산광역시 32
 
1.9%
Other values (6) 86
 
5.1%

Length

2023-12-11T08:23:24.302127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 311
18.3%
경상북도 230
13.5%
전라남도 220
12.9%
강원도 180
10.6%
경상남도 180
10.6%
충청남도 157
9.2%
전라북도 140
8.2%
충청북도 114
 
6.7%
서울특별시 50
 
2.9%
부산광역시 32
 
1.9%
Other values (6) 86
 
5.1%
Distinct209
Distinct (%)12.3%
Missing0
Missing (%)0.0%
Memory size13.4 KiB
2023-12-11T08:23:24.606271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3.0011765
Min length2

Characters and Unicode

Total characters5102
Distinct characters132
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row수원시
2nd row성남시
3rd row의정부시
4th row안양시
5th row부천시
ValueCountFrequency (%)
고성군 20
 
1.2%
동구 12
 
0.7%
중구 12
 
0.7%
성남시 10
 
0.6%
김천시 10
 
0.6%
수원시 10
 
0.6%
장성군 10
 
0.6%
강진군 10
 
0.6%
해남군 10
 
0.6%
영암군 10
 
0.6%
Other values (199) 1586
93.3%
2023-12-11T08:23:25.004357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
822
 
16.1%
780
 
15.3%
204
 
4.0%
193
 
3.8%
180
 
3.5%
164
 
3.2%
140
 
2.7%
136
 
2.7%
110
 
2.2%
96
 
1.9%
Other values (122) 2277
44.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5102
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
822
 
16.1%
780
 
15.3%
204
 
4.0%
193
 
3.8%
180
 
3.5%
164
 
3.2%
140
 
2.7%
136
 
2.7%
110
 
2.2%
96
 
1.9%
Other values (122) 2277
44.6%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5102
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
822
 
16.1%
780
 
15.3%
204
 
4.0%
193
 
3.8%
180
 
3.5%
164
 
3.2%
140
 
2.7%
136
 
2.7%
110
 
2.2%
96
 
1.9%
Other values (122) 2277
44.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5102
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
822
 
16.1%
780
 
15.3%
204
 
4.0%
193
 
3.8%
180
 
3.5%
164
 
3.2%
140
 
2.7%
136
 
2.7%
110
 
2.2%
96
 
1.9%
Other values (122) 2277
44.6%

실업률(퍼센트)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct516
Distinct (%)30.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.3409118
Minimum0
Maximum7.1
Zeros73
Zeros (%)4.3%
Negative0
Negative (%)0.0%
Memory size15.1 KiB
2023-12-11T08:23:25.139527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.2995
Q11.13
median2.1
Q33.44
95-th percentile5.0505
Maximum7.1
Range7.1
Interquartile range (IQR)2.31

Descriptive statistics

Standard deviation1.5100428
Coefficient of variation (CV)0.64506609
Kurtosis-0.61526265
Mean2.3409118
Median Absolute Deviation (MAD)1.13
Skewness0.48350468
Sum3979.55
Variance2.2802292
MonotonicityNot monotonic
2023-12-11T08:23:25.276466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 73
 
4.3%
0.62 15
 
0.9%
0.67 12
 
0.7%
1.03 10
 
0.6%
0.92 10
 
0.6%
2.42 9
 
0.5%
1.35 9
 
0.5%
2.32 9
 
0.5%
1.74 9
 
0.5%
1.15 9
 
0.5%
Other values (506) 1535
90.3%
ValueCountFrequency (%)
0.0 73
4.3%
0.24 4
 
0.2%
0.25 7
 
0.4%
0.29 1
 
0.1%
0.3 1
 
0.1%
0.31 3
 
0.2%
0.32 3
 
0.2%
0.33 1
 
0.1%
0.34 1
 
0.1%
0.35 3
 
0.2%
ValueCountFrequency (%)
7.1 1
0.1%
6.98 1
0.1%
6.7 1
0.1%
6.59 1
0.1%
6.35 1
0.1%
6.24 1
0.1%
6.23 1
0.1%
6.21 1
0.1%
6.19 1
0.1%
6.15 1
0.1%

실업자(천명)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct207
Distinct (%)12.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.4900588
Minimum0
Maximum30.6
Zeros73
Zeros (%)4.3%
Negative0
Negative (%)0.0%
Memory size15.1 KiB
2023-12-11T08:23:25.410621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.1
Q10.3
median1
Q34.425
95-th percentile15.705
Maximum30.6
Range30.6
Interquartile range (IQR)4.125

Descriptive statistics

Standard deviation5.1676463
Coefficient of variation (CV)1.480676
Kurtosis4.4026021
Mean3.4900588
Median Absolute Deviation (MAD)0.9
Skewness2.1259799
Sum5933.1
Variance26.704569
MonotonicityNot monotonic
2023-12-11T08:23:25.530270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.1 165
 
9.7%
0.2 151
 
8.9%
0.3 116
 
6.8%
0.4 91
 
5.4%
0.0 73
 
4.3%
0.5 70
 
4.1%
0.6 49
 
2.9%
0.8 47
 
2.8%
0.7 39
 
2.3%
0.9 32
 
1.9%
Other values (197) 867
51.0%
ValueCountFrequency (%)
0.0 73
4.3%
0.1 165
9.7%
0.2 151
8.9%
0.3 116
6.8%
0.4 91
5.4%
0.5 70
4.1%
0.6 49
 
2.9%
0.7 39
 
2.3%
0.8 47
 
2.8%
0.9 32
 
1.9%
ValueCountFrequency (%)
30.6 1
 
0.1%
28.1 1
 
0.1%
26.4 1
 
0.1%
26.2 1
 
0.1%
25.8 1
 
0.1%
25.0 1
 
0.1%
24.4 3
0.2%
24.3 1
 
0.1%
24.1 1
 
0.1%
24.0 1
 
0.1%

경제활동인구수(천명)
Real number (ℝ)

HIGH CORRELATION 

Distinct1034
Distinct (%)60.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean104.07159
Minimum0
Maximum655.4
Zeros12
Zeros (%)0.7%
Negative0
Negative (%)0.0%
Memory size15.1 KiB
2023-12-11T08:23:25.947099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile14.795
Q124.9
median52.4
Q3140.2
95-th percentile388.465
Maximum655.4
Range655.4
Interquartile range (IQR)115.3

Descriptive statistics

Standard deviation122.14999
Coefficient of variation (CV)1.1737112
Kurtosis4.1535459
Mean104.07159
Median Absolute Deviation (MAD)33.45
Skewness2.0508973
Sum176921.7
Variance14920.62
MonotonicityNot monotonic
2023-12-11T08:23:26.102522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 12
 
0.7%
21.7 11
 
0.6%
16.1 9
 
0.5%
23.6 9
 
0.5%
16.4 9
 
0.5%
16.6 9
 
0.5%
14.4 8
 
0.5%
16.9 7
 
0.4%
21.8 7
 
0.4%
25.9 7
 
0.4%
Other values (1024) 1612
94.8%
ValueCountFrequency (%)
0.0 12
0.7%
5.6 1
 
0.1%
5.7 2
 
0.1%
5.8 1
 
0.1%
6.1 1
 
0.1%
6.2 3
 
0.2%
6.3 1
 
0.1%
6.4 1
 
0.1%
10.3 3
 
0.2%
10.4 2
 
0.1%
ValueCountFrequency (%)
655.4 1
0.1%
654.7 1
0.1%
651.2 1
0.1%
650.6 1
0.1%
635.7 1
0.1%
634.5 1
0.1%
632.4 1
0.1%
630.4 1
0.1%
620.3 1
0.1%
612.5 1
0.1%

Interactions

2023-12-11T08:23:23.085134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:23:22.447742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:23:22.771915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:23:23.200624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:23:22.547768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:23:22.879826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:23:23.309070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:23:22.660084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:23:22.965827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:23:26.211353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
통계연도분기시도명실업률(퍼센트)실업자(천명)경제활동인구수(천명)
통계연도1.0000.0000.4340.2490.1870.166
분기0.0001.0000.0000.0610.0000.000
시도명0.4340.0001.0000.5820.5310.574
실업률(퍼센트)0.2490.0610.5821.0000.7060.643
실업자(천명)0.1870.0000.5310.7061.0000.904
경제활동인구수(천명)0.1660.0000.5740.6430.9041.000
2023-12-11T08:23:26.317800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분기통계연도시도명
분기1.0000.0000.000
통계연도0.0001.0000.237
시도명0.0000.2371.000
2023-12-11T08:23:26.432624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
실업률(퍼센트)실업자(천명)경제활동인구수(천명)통계연도분기시도명
실업률(퍼센트)1.0000.9010.7180.1060.0470.272
실업자(천명)0.9011.0000.9360.0790.0000.239
경제활동인구수(천명)0.7180.9361.0000.0700.0000.266
통계연도0.1060.0790.0701.0000.0000.237
분기0.0470.0000.0000.0001.0000.000
시도명0.2720.2390.2660.2370.0001.000

Missing values

2023-12-11T08:23:23.466056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:23:23.595971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

통계연도분기시도명시군구명실업률(퍼센트)실업자(천명)경제활동인구수(천명)
02017상반기경기도수원시3.6122.1612.5
12017상반기경기도성남시2.6813.0485.4
22017상반기경기도의정부시5.010.5210.0
32017상반기경기도안양시3.3210.3310.6
42017상반기경기도부천시4.4920.0445.3
52017상반기경기도광명시4.527.5165.8
62017상반기경기도평택시2.034.9241.5
72017상반기경기도동두천시4.822.347.7
82017상반기경기도안산시4.2917.2400.7
92017상반기경기도고양시3.6918.0487.7
통계연도분기시도명시군구명실업률(퍼센트)실업자(천명)경제활동인구수(천명)
16902021하반기경상남도창녕군1.370.536.6
16912021하반기경상남도고성군1.960.630.6
16922021하반기경상남도남해군1.160.325.9
16932021하반기경상남도하동군1.190.325.2
16942021하반기경상남도산청군0.440.122.5
16952021하반기경상남도함양군0.880.222.8
16962021하반기경상남도거창군0.830.336.0
16972021하반기경상남도합천군0.40.124.7
16982021하반기제주특별자치도제주시2.67.3280.6
16992021하반기제주특별자치도서귀포시1.351.5111.2