Overview

Dataset statistics

Number of variables5
Number of observations1080
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory45.5 KiB
Average record size in memory43.1 B

Variable types

Categorical3
Numeric2

Dataset

Description취업자격 체류외국인(C-4, E-1~E-7, E-9~E-10, H-2) 중 방문취업(H-2) 외국인의 국적(지역)별 현황을 월별로 제공
Author법무부
URLhttps://www.data.go.kr/data/15100032/fileData.do

Alerts

방문취업자수 has 406 (37.6%) zerosZeros

Reproduction

Analysis started2024-04-29 22:59:18.393040
Analysis finished2024-04-29 22:59:20.860513
Duration2.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables


Categorical

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size8.6 KiB
2022
480 
2023
480 
2024
120 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 480
44.4%
2023 480
44.4%
2024 120
 
11.1%

Length

2024-04-30T07:59:20.922060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:59:21.023054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 480
44.4%
2023 480
44.4%
2024 120
 
11.1%


Real number (ℝ)

Distinct12
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.6 KiB
2024-04-30T07:59:21.113245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q39
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.5606749
Coefficient of variation (CV)0.59344582
Kurtosis-1.2759725
Mean6
Median Absolute Deviation (MAD)3
Skewness0.17770527
Sum6480
Variance12.678406
MonotonicityNot monotonic
2024-04-30T07:59:21.220552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
1 120
11.1%
2 120
11.1%
3 120
11.1%
4 80
7.4%
5 80
7.4%
6 80
7.4%
7 80
7.4%
8 80
7.4%
9 80
7.4%
10 80
7.4%
Other values (2) 160
14.8%
ValueCountFrequency (%)
1 120
11.1%
2 120
11.1%
3 120
11.1%
4 80
7.4%
5 80
7.4%
6 80
7.4%
7 80
7.4%
8 80
7.4%
9 80
7.4%
10 80
7.4%
ValueCountFrequency (%)
12 80
7.4%
11 80
7.4%
10 80
7.4%
9 80
7.4%
8 80
7.4%
7 80
7.4%
6 80
7.4%
5 80
7.4%
4 80
7.4%
3 120
11.1%
Distinct8
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size8.6 KiB
연고방취H21
135 
유학방취H22
135 
자진방취H23
135 
연수방취H24
135 
추첨방취H25
135 
Other values (3)
405 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row연고방취H21
2nd row연고방취H21
3rd row연고방취H21
4th row연고방취H21
5th row연고방취H21

Common Values

ValueCountFrequency (%)
연고방취H21 135
12.5%
유학방취H22 135
12.5%
자진방취H23 135
12.5%
연수방취H24 135
12.5%
추첨방취H25 135
12.5%
변경방취H26 135
12.5%
만기방취H27 135
12.5%
기타방취H29 135
12.5%

Length

2024-04-30T07:59:21.331584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:59:21.439658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
연고방취h21 135
12.5%
유학방취h22 135
12.5%
자진방취h23 135
12.5%
연수방취h24 135
12.5%
추첨방취h25 135
12.5%
변경방취h26 135
12.5%
만기방취h27 135
12.5%
기타방취h29 135
12.5%

국적지역
Categorical

Distinct5
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size8.6 KiB
중국
216 
우즈베키스탄
216 
카자흐스탄
216 
키르기즈
216 
기타
216 

Length

Max length6
Median length5
Mean length3.8
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중국
2nd row우즈베키스탄
3rd row카자흐스탄
4th row키르기즈
5th row기타

Common Values

ValueCountFrequency (%)
중국 216
20.0%
우즈베키스탄 216
20.0%
카자흐스탄 216
20.0%
키르기즈 216
20.0%
기타 216
20.0%

Length

2024-04-30T07:59:21.581939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:59:21.697056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중국 216
20.0%
우즈베키스탄 216
20.0%
카자흐스탄 216
20.0%
키르기즈 216
20.0%
기타 216
20.0%

방문취업자수
Real number (ℝ)

ZEROS 

Distinct380
Distinct (%)35.2%
Missing1
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean2832.342
Minimum0
Maximum155578
Zeros406
Zeros (%)37.6%
Negative0
Negative (%)0.0%
Memory size9.6 KiB
2024-04-30T07:59:21.814382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median3
Q3415
95-th percentile6779.5
Maximum155578
Range155578
Interquartile range (IQR)415

Descriptive statistics

Standard deviation12685.749
Coefficient of variation (CV)4.4788901
Kurtosis44.384653
Mean2832.342
Median Absolute Deviation (MAD)3
Skewness6.3425586
Sum3056097
Variance1.6092822 × 108
MonotonicityNot monotonic
2024-04-30T07:59:21.990727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 406
37.6%
1 93
 
8.6%
2 34
 
3.1%
8 27
 
2.5%
3 16
 
1.5%
4 12
 
1.1%
5 8
 
0.7%
386 7
 
0.6%
9 7
 
0.6%
379 6
 
0.6%
Other values (370) 463
42.9%
ValueCountFrequency (%)
0 406
37.6%
1 93
 
8.6%
2 34
 
3.1%
3 16
 
1.5%
4 12
 
1.1%
5 8
 
0.7%
6 4
 
0.4%
7 2
 
0.2%
8 27
 
2.5%
9 7
 
0.6%
ValueCountFrequency (%)
155578 1
0.1%
89931 1
0.1%
88475 1
0.1%
86790 1
0.1%
85868 1
0.1%
84925 1
0.1%
83110 1
0.1%
81549 1
0.1%
79848 1
0.1%
78647 1
0.1%

Interactions

2024-04-30T07:59:20.459215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:59:20.066197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:59:20.563110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:59:20.217462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T07:59:22.091618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
방문취업자격국적지역방문취업자수
1.0000.4960.0000.0000.138
0.4961.0000.0000.0000.000
방문취업자격0.0000.0001.0000.0000.345
국적지역0.0000.0000.0001.0000.419
방문취업자수0.1380.0000.3450.4191.000
2024-04-30T07:59:22.186869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
방문취업자격국적지역
방문취업자격1.0000.0000.000
국적지역0.0001.0000.000
0.0000.0001.000
2024-04-30T07:59:22.281552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
방문취업자수방문취업자격국적지역
1.000-0.0130.3420.0000.000
방문취업자수-0.0131.0000.1040.2200.167
0.3420.1041.0000.0000.000
방문취업자격0.0000.2200.0001.0000.000
국적지역0.0000.1670.0000.0001.000

Missing values

2024-04-30T07:59:20.711821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T07:59:20.817172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

방문취업자격국적지역방문취업자수
020221연고방취H21중국508
120221연고방취H21우즈베키스탄21
220221연고방취H21카자흐스탄3
320221연고방취H21키르기즈1
420221연고방취H21기타1
520221유학방취H22중국0
620221유학방취H22우즈베키스탄0
720221유학방취H22카자흐스탄0
820221유학방취H22키르기즈0
920221유학방취H22기타0
방문취업자격국적지역방문취업자수
107020243만기방취H27중국63359
107120243만기방취H27우즈베키스탄5353
107220243만기방취H27카자흐스탄1791
107320243만기방취H27키르기즈367
107420243만기방취H27기타276
107520243기타방취H29중국3618
107620243기타방취H29우즈베키스탄78
107720243기타방취H29카자흐스탄35
107820243기타방취H29키르기즈4
107920243기타방취H29기타141