Overview

Dataset statistics

Number of variables5
Number of observations648
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory27.3 KiB
Average record size in memory43.2 B

Variable types

Categorical3
Numeric2

Dataset

Description취업자격 체류외국인(C-4, E-1~E-7, E-9~E-10, H-2) 중 특정활동(E-7) 외국인의 유형별(전문인력/준전문인력/일반기능인력/숙련기능인력 등) 및 국적(지역)별 현황을 월별로 제공
Author법무부
URLhttps://www.data.go.kr/data/15100033/fileData.do

Alerts

인원 has 81 (12.5%) zerosZeros

Reproduction

Analysis started2024-04-29 22:59:26.091473
Analysis finished2024-04-29 22:59:28.132963
Duration2.04 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables


Categorical

Distinct3
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
2022
288 
2023
288 
2024
72 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 288
44.4%
2023 288
44.4%
2024 72
 
11.1%

Length

2024-04-30T07:59:28.196259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:59:28.308661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 288
44.4%
2023 288
44.4%
2024 72
 
11.1%


Real number (ℝ)

Distinct12
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.8 KiB
2024-04-30T07:59:28.426330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q39
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.5617754
Coefficient of variation (CV)0.59362924
Kurtosis-1.2762043
Mean6
Median Absolute Deviation (MAD)3
Skewness0.17787037
Sum3888
Variance12.686244
MonotonicityNot monotonic
2024-04-30T07:59:28.530054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
1 72
11.1%
2 72
11.1%
3 72
11.1%
4 48
7.4%
5 48
7.4%
6 48
7.4%
7 48
7.4%
8 48
7.4%
9 48
7.4%
10 48
7.4%
Other values (2) 96
14.8%
ValueCountFrequency (%)
1 72
11.1%
2 72
11.1%
3 72
11.1%
4 48
7.4%
5 48
7.4%
6 48
7.4%
7 48
7.4%
8 48
7.4%
9 48
7.4%
10 48
7.4%
ValueCountFrequency (%)
12 48
7.4%
11 48
7.4%
10 48
7.4%
9 48
7.4%
8 48
7.4%
7 48
7.4%
6 48
7.4%
5 48
7.4%
4 48
7.4%
3 72
11.1%

유형별
Categorical

Distinct4
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
전문인력E71
162 
준전문인력E72
162 
일반기능인력E73
162 
숙련기능인력E74
162 

Length

Max length9
Median length8.5
Mean length8.25
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전문인력E71
2nd row전문인력E71
3rd row전문인력E71
4th row전문인력E71
5th row전문인력E71

Common Values

ValueCountFrequency (%)
전문인력E71 162
25.0%
준전문인력E72 162
25.0%
일반기능인력E73 162
25.0%
숙련기능인력E74 162
25.0%

Length

2024-04-30T07:59:28.656535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:59:28.806416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전문인력e71 162
25.0%
준전문인력e72 162
25.0%
일반기능인력e73 162
25.0%
숙련기능인력e74 162
25.0%

국적
Categorical

Distinct6
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
중국
108 
베트남
108 
미국
108 
네팔
108 
인도
108 

Length

Max length3
Median length2
Mean length2.1666667
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중국
2nd row베트남
3rd row미국
4th row네팔
5th row인도

Common Values

ValueCountFrequency (%)
중국 108
16.7%
베트남 108
16.7%
미국 108
16.7%
네팔 108
16.7%
인도 108
16.7%
기타 108
16.7%

Length

2024-04-30T07:59:28.936082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:59:29.043722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중국 108
16.7%
베트남 108
16.7%
미국 108
16.7%
네팔 108
16.7%
인도 108
16.7%
기타 108
16.7%

인원
Real number (ℝ)

ZEROS 

Distinct421
Distinct (%)65.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1201.966
Minimum0
Maximum11281
Zeros81
Zeros (%)12.5%
Negative0
Negative (%)0.0%
Memory size5.8 KiB
2024-04-30T07:59:29.167399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q147
median379.5
Q31452
95-th percentile5642.6
Maximum11281
Range11281
Interquartile range (IQR)1405

Descriptive statistics

Standard deviation1842.4715
Coefficient of variation (CV)1.5328815
Kurtosis5.0323007
Mean1201.966
Median Absolute Deviation (MAD)379.5
Skewness2.1893774
Sum778874
Variance3394701.3
MonotonicityNot monotonic
2024-04-30T07:59:29.310608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 81
 
12.5%
6 12
 
1.9%
5 12
 
1.9%
4 11
 
1.7%
105 7
 
1.1%
14 7
 
1.1%
17 5
 
0.8%
131 4
 
0.6%
21 4
 
0.6%
135 4
 
0.6%
Other values (411) 501
77.3%
ValueCountFrequency (%)
0 81
12.5%
4 11
 
1.7%
5 12
 
1.9%
6 12
 
1.9%
11 2
 
0.3%
13 3
 
0.5%
14 7
 
1.1%
17 5
 
0.8%
19 2
 
0.3%
21 4
 
0.6%
ValueCountFrequency (%)
11281 1
0.2%
10745 1
0.2%
10342 1
0.2%
9847 1
0.2%
8102 1
0.2%
7126 1
0.2%
7010 1
0.2%
6949 1
0.2%
6817 1
0.2%
6780 1
0.2%

Interactions

2024-04-30T07:59:27.821998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:59:27.472663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:59:27.897697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:59:27.603391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T07:59:29.396847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유형별국적인원
1.0000.4870.0000.0000.280
0.4871.0000.0000.0000.000
유형별0.0000.0001.0000.0000.497
국적0.0000.0000.0001.0000.590
인원0.2800.0000.4970.5901.000
2024-04-30T07:59:29.506105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
국적유형별
국적1.0000.0000.000
0.0001.0000.000
유형별0.0000.0001.000
2024-04-30T07:59:29.600017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인원유형별국적
1.0000.0180.3340.0000.000
인원0.0181.0000.1740.3180.360
0.3340.1741.0000.0000.000
유형별0.0000.3180.0001.0000.000
국적0.0000.3600.0000.0001.000

Missing values

2024-04-30T07:59:28.009081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T07:59:28.094962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

유형별국적인원
020221전문인력E71중국870
120221전문인력E71베트남1252
220221전문인력E71미국1305
320221전문인력E71네팔50
420221전문인력E71인도535
520221전문인력E71기타4194
620221준전문인력E72중국5735
720221준전문인력E72베트남134
820221준전문인력E72미국5
920221준전문인력E72네팔368
유형별국적인원
63820243일반기능인력E73네팔23
63920243일반기능인력E73미국0
64020243일반기능인력E73인도274
64120243일반기능인력E73기타4797
64220243숙련기능인력E74중국65
64320243숙련기능인력E74베트남3179
64420243숙련기능인력E74네팔4880
64520243숙련기능인력E74미국0
64620243숙련기능인력E74인도0
64720243숙련기능인력E74기타11281