Overview

Dataset statistics

Number of variables7
Number of observations2232
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory130.9 KiB
Average record size in memory60.1 B

Variable types

Categorical4
Numeric3

Dataset

Description경기도 비정규직 관련 근로형태별 성별 평균임금
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=QJ02BXV512A41KQ6442N30069568&infSeq=1

Alerts

근로자수 is highly overall correlated with 평균임금(만원)High correlation
평균임금(만원) is highly overall correlated with 근로자수 and 2 other fieldsHigh correlation
근로형태 is highly overall correlated with 평균임금(만원)High correlation
성별 is highly overall correlated with 평균임금(만원)High correlation

Reproduction

Analysis started2023-12-10 22:32:19.752843
Analysis finished2023-12-10 22:32:21.222383
Duration1.47 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

Distinct31
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size17.6 KiB
의정부시
 
72
안양시
 
72
부천시
 
72
광명시
 
72
평택시
 
72
Other values (26)
1872 

Length

Max length4
Median length3
Mean length3.0967742
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row의정부시
2nd row의정부시
3rd row의정부시
4th row의정부시
5th row의정부시

Common Values

ValueCountFrequency (%)
의정부시 72
 
3.2%
안양시 72
 
3.2%
부천시 72
 
3.2%
광명시 72
 
3.2%
평택시 72
 
3.2%
동두천시 72
 
3.2%
안산시 72
 
3.2%
고양시 72
 
3.2%
과천시 72
 
3.2%
구리시 72
 
3.2%
Other values (21) 1512
67.7%

Length

2023-12-11T07:32:21.284526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
의정부시 72
 
3.2%
용인시 72
 
3.2%
수원시 72
 
3.2%
양평군 72
 
3.2%
가평군 72
 
3.2%
연천군 72
 
3.2%
여주시 72
 
3.2%
포천시 72
 
3.2%
양주시 72
 
3.2%
광주시 72
 
3.2%
Other values (21) 1512
67.7%

근로형태
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size17.6 KiB
상용직
744 
임시직
744 
일용직
744 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row상용직
2nd row임시직
3rd row임시직
4th row일용직
5th row일용직

Common Values

ValueCountFrequency (%)
상용직 744
33.3%
임시직 744
33.3%
일용직 744
33.3%

Length

2023-12-11T07:32:21.410742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:32:21.540347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
상용직 744
33.3%
임시직 744
33.3%
일용직 744
33.3%

성별
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size17.6 KiB
여성
1116 
남성
1116 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row여성
2nd row남성
3rd row여성
4th row남성
5th row여성

Common Values

ValueCountFrequency (%)
여성 1116
50.0%
남성 1116
50.0%

Length

2023-12-11T07:32:21.676204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:32:21.788935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
여성 1116
50.0%
남성 1116
50.0%

조사년도
Real number (ℝ)

Distinct6
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2015.5
Minimum2013
Maximum2018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size19.7 KiB
2023-12-11T07:32:21.905842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2013
5-th percentile2013
Q12014
median2015.5
Q32017
95-th percentile2018
Maximum2018
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7082078
Coefficient of variation (CV)0.00084753552
Kurtosis-1.2687248
Mean2015.5
Median Absolute Deviation (MAD)1.5
Skewness0
Sum4498596
Variance2.917974
MonotonicityNot monotonic
2023-12-11T07:32:22.042720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2014 372
16.7%
2013 372
16.7%
2016 372
16.7%
2015 372
16.7%
2017 372
16.7%
2018 372
16.7%
ValueCountFrequency (%)
2013 372
16.7%
2014 372
16.7%
2015 372
16.7%
2016 372
16.7%
2017 372
16.7%
2018 372
16.7%
ValueCountFrequency (%)
2018 372
16.7%
2017 372
16.7%
2016 372
16.7%
2015 372
16.7%
2014 372
16.7%
2013 372
16.7%

조사반기
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size17.6 KiB
2
1116 
1
1116 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
2 1116
50.0%
1 1116
50.0%

Length

2023-12-11T07:32:22.185471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:32:22.307374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 1116
50.0%
1 1116
50.0%

근로자수
Real number (ℝ)

HIGH CORRELATION 

Distinct1975
Distinct (%)88.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25554.552
Minimum97
Maximum251881
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size19.7 KiB
2023-12-11T07:32:22.443438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum97
5-th percentile1004.2
Q13862
median10371.5
Q329605
95-th percentile110692.4
Maximum251881
Range251784
Interquartile range (IQR)25743

Descriptive statistics

Standard deviation37967.719
Coefficient of variation (CV)1.4857517
Kurtosis7.8192695
Mean25554.552
Median Absolute Deviation (MAD)8112.5
Skewness2.6741398
Sum57037761
Variance1.4415477 × 109
MonotonicityNot monotonic
2023-12-11T07:32:22.605611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4358 4
 
0.2%
3258 4
 
0.2%
6183 3
 
0.1%
2102 3
 
0.1%
13394 3
 
0.1%
3353 3
 
0.1%
9878 3
 
0.1%
5429 3
 
0.1%
6673 3
 
0.1%
12452 3
 
0.1%
Other values (1965) 2200
98.6%
ValueCountFrequency (%)
97 1
< 0.1%
123 1
< 0.1%
142 1
< 0.1%
143 1
< 0.1%
144 1
< 0.1%
145 1
< 0.1%
156 1
< 0.1%
157 1
< 0.1%
182 1
< 0.1%
191 1
< 0.1%
ValueCountFrequency (%)
251881 1
< 0.1%
236630 1
< 0.1%
232337 1
< 0.1%
226894 1
< 0.1%
222775 2
0.1%
217759 1
< 0.1%
217085 1
< 0.1%
216098 1
< 0.1%
216022 1
< 0.1%
209497 1
< 0.1%

평균임금(만원)
Real number (ℝ)

HIGH CORRELATION 

Distinct1396
Distinct (%)62.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean175.23763
Minimum35
Maximum506.5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size19.7 KiB
2023-12-11T07:32:22.802832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum35
5-th percentile72.82
Q1122.475
median163.55
Q3205.625
95-th percentile334.59
Maximum506.5
Range471.5
Interquartile range (IQR)83.15

Descriptive statistics

Standard deviation78.676646
Coefficient of variation (CV)0.44897117
Kurtosis0.88350952
Mean175.23763
Median Absolute Deviation (MAD)41.45
Skewness0.99747575
Sum391130.4
Variance6190.0146
MonotonicityNot monotonic
2023-12-11T07:32:22.973225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
124.6 8
 
0.4%
170.9 6
 
0.3%
130.6 6
 
0.3%
179.0 5
 
0.2%
140.1 5
 
0.2%
87.3 5
 
0.2%
198.7 5
 
0.2%
307.6 5
 
0.2%
121.9 5
 
0.2%
121.5 5
 
0.2%
Other values (1386) 2177
97.5%
ValueCountFrequency (%)
35.0 1
< 0.1%
37.2 1
< 0.1%
38.9 1
< 0.1%
40.6 1
< 0.1%
42.5 1
< 0.1%
43.3 1
< 0.1%
43.8 1
< 0.1%
46.6 1
< 0.1%
48.3 1
< 0.1%
49.3 1
< 0.1%
ValueCountFrequency (%)
506.5 1
< 0.1%
504.7 1
< 0.1%
489.2 2
0.1%
468.9 1
< 0.1%
464.1 1
< 0.1%
447.7 1
< 0.1%
442.9 1
< 0.1%
441.5 1
< 0.1%
440.2 1
< 0.1%
437.2 1
< 0.1%

Interactions

2023-12-11T07:32:20.672065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:20.082748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:20.397033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:20.766530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:20.180649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:20.482007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:20.910928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:20.304513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:32:20.582217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:32:23.072642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명근로형태성별조사년도조사반기근로자수평균임금(만원)
시군명1.0000.0000.0000.0000.0000.6500.469
근로형태0.0001.0000.0000.0000.0000.6200.764
성별0.0000.0001.0000.0000.0000.2340.808
조사년도0.0000.0000.0001.0000.0000.0000.198
조사반기0.0000.0000.0000.0001.0000.0000.034
근로자수0.6500.6200.2340.0000.0001.0000.790
평균임금(만원)0.4690.7640.8080.1980.0340.7901.000
2023-12-11T07:32:23.194315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
근로형태시군명성별조사반기
근로형태1.0000.0000.0000.000
시군명0.0001.0000.0000.000
성별0.0000.0001.0000.000
조사반기0.0000.0000.0001.000
2023-12-11T07:32:23.292947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
조사년도근로자수평균임금(만원)시군명근로형태성별조사반기
조사년도1.0000.0070.1540.0000.0000.0000.000
근로자수0.0071.0000.6450.2900.4640.1790.000
평균임금(만원)0.1540.6451.0000.1830.6390.6420.026
시군명0.0000.2900.1831.0000.0000.0000.000
근로형태0.0000.4640.6390.0001.0000.0000.000
성별0.0000.1790.6420.0000.0001.0000.000
조사반기0.0000.0000.0260.0000.0000.0001.000

Missing values

2023-12-11T07:32:21.053267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:32:21.172079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명근로형태성별조사년도조사반기근로자수평균임금(만원)
0의정부시상용직여성2014239038180.5
1의정부시임시직남성2014214753169.6
2의정부시임시직여성2014217371116.6
3의정부시일용직남성201426672125.2
4의정부시일용직여성20142324877.7
5안양시상용직남성20142108727338.8
6안양시상용직여성2014260328206.6
7안양시임시직남성2014219329144.3
8안양시임시직여성2014230630103.8
9안양시일용직남성201427083137.3
시군명근로형태성별조사년도조사반기근로자수평균임금(만원)
2222의정부시임시직남성2017210962179.7
2223의정부시임시직여성2017217381134.3
2224의정부시일용직남성201726934173.3
2225의정부시일용직여성201721999106.0
2226안양시상용직남성20172116007376.1
2227안양시상용직여성2017272946239.4
2228안양시임시직남성2017213573178.1
2229안양시임시직여성2017221143120.6
2230안양시일용직남성201725875177.4
2231안양시일용직여성20172362085.5