Overview

Dataset statistics

Number of variables4
Number of observations468
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.1 KiB
Average record size in memory35.3 B

Variable types

Categorical2
Numeric2

Dataset

Description국내 체류중인 영주(F-5) 자격 소지자의 국적(지역)별 현황을 월별로 제공*외국국적동포 : 재외동포의 출입국과 법적지위에 관한 법률 제2조에 따라 대한민국의 국적을 보유하였던 자(대한민국정부 수립 전에 국외로 이주한 동포 포함) 또는 그 직계비속으로서 외국국적을 취득한 자 중 대통령령으로 정하는 자
Author법무부
URLhttps://www.data.go.kr/data/15100045/fileData.do

Alerts

영주F5자격소지자수 is highly overall correlated with 국적지역High correlation
국적지역 is highly overall correlated with 영주F5자격소지자수High correlation
영주F5자격소지자수 has 22 (4.7%) zerosZeros

Reproduction

Analysis started2024-04-29 22:59:53.431637
Analysis finished2024-04-29 22:59:55.168178
Duration1.74 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables


Categorical

Distinct4
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2021
144 
2022
144 
2023
144 
2024
36 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2021 144
30.8%
2022 144
30.8%
2023 144
30.8%
2024 36
 
7.7%

Length

2024-04-30T07:59:55.247568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:59:55.348394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 144
30.8%
2022 144
30.8%
2023 144
30.8%
2024 36
 
7.7%


Real number (ℝ)

Distinct12
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.1538462
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.2 KiB
2024-04-30T07:59:55.444010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q39
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.5377811
Coefficient of variation (CV)0.57488943
Kurtosis-1.2701622
Mean6.1538462
Median Absolute Deviation (MAD)3
Skewness0.12067394
Sum2880
Variance12.515895
MonotonicityNot monotonic
2024-04-30T07:59:55.537596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
1 48
10.3%
2 48
10.3%
3 48
10.3%
4 36
7.7%
5 36
7.7%
6 36
7.7%
7 36
7.7%
8 36
7.7%
9 36
7.7%
10 36
7.7%
Other values (2) 72
15.4%
ValueCountFrequency (%)
1 48
10.3%
2 48
10.3%
3 48
10.3%
4 36
7.7%
5 36
7.7%
6 36
7.7%
7 36
7.7%
8 36
7.7%
9 36
7.7%
10 36
7.7%
ValueCountFrequency (%)
12 36
7.7%
11 36
7.7%
10 36
7.7%
9 36
7.7%
8 36
7.7%
7 36
7.7%
6 36
7.7%
5 36
7.7%
4 36
7.7%
3 48
10.3%

국적지역
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
중국
39 
러시아
39 
미국
39 
우즈베키스탄
39 
캐나다
39 
Other values (7)
273 

Length

Max length7
Median length5.5
Mean length4.25
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중국
2nd row러시아
3rd row미국
4th row우즈베키스탄
5th row캐나다

Common Values

ValueCountFrequency (%)
중국 39
8.3%
러시아 39
8.3%
미국 39
8.3%
우즈베키스탄 39
8.3%
캐나다 39
8.3%
오스트레일리아 39
8.3%
카자흐스탄 39
8.3%
키르기즈 39
8.3%
우크라이나 39
8.3%
투르크메니스탄 39
8.3%
Other values (2) 78
16.7%

Length

2024-04-30T07:59:55.648407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
중국 39
8.3%
러시아 39
8.3%
미국 39
8.3%
우즈베키스탄 39
8.3%
캐나다 39
8.3%
오스트레일리아 39
8.3%
카자흐스탄 39
8.3%
키르기즈 39
8.3%
우크라이나 39
8.3%
투르크메니스탄 39
8.3%
Other values (2) 78
16.7%

영주F5자격소지자수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct313
Distinct (%)66.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9953.8846
Minimum0
Maximum128328
Zeros22
Zeros (%)4.7%
Negative0
Negative (%)0.0%
Memory size4.2 KiB
2024-04-30T07:59:55.766081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q133.5
median195.5
Q3625
95-th percentile112595.45
Maximum128328
Range128328
Interquartile range (IQR)591.5

Descriptive statistics

Standard deviation32005.502
Coefficient of variation (CV)3.215378
Kurtosis7.3618841
Mean9953.8846
Median Absolute Deviation (MAD)188.5
Skewness3.0446739
Sum4658418
Variance1.0243521 × 109
MonotonicityNot monotonic
2024-04-30T07:59:55.894010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 22
 
4.7%
1 19
 
4.1%
7 14
 
3.0%
2 7
 
1.5%
24 6
 
1.3%
6 6
 
1.3%
19 5
 
1.1%
5 5
 
1.1%
4 5
 
1.1%
13 4
 
0.9%
Other values (303) 375
80.1%
ValueCountFrequency (%)
0 22
4.7%
1 19
4.1%
2 7
 
1.5%
3 2
 
0.4%
4 5
 
1.1%
5 5
 
1.1%
6 6
 
1.3%
7 14
3.0%
8 3
 
0.6%
9 1
 
0.2%
ValueCountFrequency (%)
128328 1
0.2%
127692 1
0.2%
126419 1
0.2%
125492 1
0.2%
124697 1
0.2%
124216 1
0.2%
123714 1
0.2%
123400 1
0.2%
122946 1
0.2%
122781 1
0.2%

Interactions

2024-04-30T07:59:54.862966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:59:54.521329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:59:54.941914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:59:54.648392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T07:59:55.989818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
국적지역영주F5자격소지자수
1.0000.3410.0000.304
0.3411.0000.0000.000
국적지역0.0000.0001.0000.861
영주F5자격소지자수0.3040.0000.8611.000
2024-04-30T07:59:56.078698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
국적지역
국적지역1.0000.000
0.0001.000
2024-04-30T07:59:56.155183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영주F5자격소지자수국적지역
1.0000.0200.2090.000
영주F5자격소지자수0.0201.0000.1230.558
0.2090.1231.0000.000
국적지역0.0000.5580.0001.000

Missing values

2024-04-30T07:59:55.050991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T07:59:55.132167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

국적지역영주F5자격소지자수
020211중국106189
120211러시아498
220211미국427
320211우즈베키스탄403
420211캐나다187
520211오스트레일리아66
620211카자흐스탄42
720211키르기즈13
820211우크라이나6
920211투르크메니스탄0
국적지역영주F5자격소지자수
45820243우즈베키스탄1476
45920243미국772
46020243캐나다335
46120243카자흐스탄318
46220243오스트레일리아149
46320243키르기즈81
46420243우크라이나57
46520243타지키스탄8
46620243투르크메니스탄8
46720243기타303