Overview

Dataset statistics

Number of variables3
Number of observations453
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.6 KiB
Average record size in memory26.3 B

Variable types

Numeric2
Categorical1

Dataset

Description2011년부터 업데이트 시점까지 연도별 외국인 입국자의 수와 체류자격별 외국인 입국자수를 제공(연도, 체류자격, 입국자수)
URLhttps://www.data.go.kr/data/15099990/fileData.do

Reproduction

Analysis started2023-12-12 11:42:12.110015
Analysis finished2023-12-12 11:42:13.092039
Duration0.98 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables


Real number (ℝ)

Distinct12
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2016.5121
Minimum2011
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.1 KiB
2023-12-12T20:42:13.181229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2011
5-th percentile2011
Q12014
median2017
Q32019
95-th percentile2022
Maximum2022
Range11
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.4497605
Coefficient of variation (CV)0.0017107562
Kurtosis-1.2114164
Mean2016.5121
Median Absolute Deviation (MAD)3
Skewness-0.0015998041
Sum913480
Variance11.900848
MonotonicityIncreasing
2023-12-12T20:42:13.362674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
2012 38
8.4%
2014 38
8.4%
2015 38
8.4%
2016 38
8.4%
2017 38
8.4%
2018 38
8.4%
2019 38
8.4%
2021 38
8.4%
2022 38
8.4%
2011 37
8.2%
Other values (2) 74
16.3%
ValueCountFrequency (%)
2011 37
8.2%
2012 38
8.4%
2013 37
8.2%
2014 38
8.4%
2015 38
8.4%
2016 38
8.4%
2017 38
8.4%
2018 38
8.4%
2019 38
8.4%
2020 37
8.2%
ValueCountFrequency (%)
2022 38
8.4%
2021 38
8.4%
2020 37
8.2%
2019 38
8.4%
2018 38
8.4%
2017 38
8.4%
2016 38
8.4%
2015 38
8.4%
2014 38
8.4%
2013 37
8.2%

체류자격
Categorical

Distinct42
Distinct (%)9.3%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
외교A1
 
12
단기취업C4
 
12
구직D10
 
12
방문동거F1
 
12
사증면제B1
 
12
Other values (37)
393 

Length

Max length8
Median length6
Mean length5.2781457
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row외교A1
2nd row공무A2
3rd row협정A3
4th row사증면제B1
5th row관광통과B2

Common Values

ValueCountFrequency (%)
외교A1 12
 
2.6%
단기취업C4 12
 
2.6%
구직D10 12
 
2.6%
방문동거F1 12
 
2.6%
사증면제B1 12
 
2.6%
관광통과B2 12
 
2.6%
일시취재C1 12
 
2.6%
무역경영D9 12
 
2.6%
문화예술D1 12
 
2.6%
회화E2 12
 
2.6%
Other values (32) 333
73.5%

Length

2023-12-12T20:42:13.580034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
외교a1 12
 
2.6%
예술흥행e6 12
 
2.6%
공무a2 12
 
2.6%
단기취업c4 12
 
2.6%
전문직업e5 12
 
2.6%
관광취업h1 12
 
2.6%
특정활동e7 12
 
2.6%
비전문취업e9 12
 
2.6%
거주f2 12
 
2.6%
방문취업h2 12
 
2.6%
Other values (32) 333
73.5%

입국자수
Real number (ℝ)

Distinct452
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean284071.16
Minimum22
Maximum7158538
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.1 KiB
2023-12-12T20:42:13.775067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum22
5-th percentile387.4
Q13655
median17137
Q393857
95-th percentile1527865.6
Maximum7158538
Range7158516
Interquartile range (IQR)90202

Descriptive statistics

Standard deviation950622.95
Coefficient of variation (CV)3.3464254
Kurtosis23.538931
Mean284071.16
Median Absolute Deviation (MAD)16572
Skewness4.7640258
Sum1.2868424 × 108
Variance9.0368399 × 1011
MonotonicityNot monotonic
2023-12-12T20:42:14.014677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1934 2
 
0.4%
7543 1
 
0.2%
258067 1
 
0.2%
24507 1
 
0.2%
5184417 1
 
0.2%
1117 1
 
0.2%
7158538 1
 
0.2%
2053173 1
 
0.2%
111848 1
 
0.2%
45927 1
 
0.2%
Other values (442) 442
97.6%
ValueCountFrequency (%)
22 1
0.2%
71 1
0.2%
72 1
0.2%
76 1
0.2%
77 1
0.2%
98 1
0.2%
117 1
0.2%
119 1
0.2%
128 1
0.2%
196 1
0.2%
ValueCountFrequency (%)
7158538 1
0.2%
6378362 1
0.2%
5833225 1
0.2%
5524120 1
0.2%
5335626 1
0.2%
5269825 1
0.2%
5184417 1
0.2%
5040354 1
0.2%
5026435 1
0.2%
4969884 1
0.2%

Interactions

2023-12-12T20:42:12.614939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:42:12.295022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:42:12.746718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:42:12.458230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:42:14.180790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
체류자격입국자수
1.0000.0000.000
체류자격0.0001.0000.700
입국자수0.0000.7001.000
2023-12-12T20:42:14.325411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
입국자수체류자격
1.000-0.0960.000
입국자수-0.0961.0000.312
체류자격0.0000.3121.000

Missing values

2023-12-12T20:42:12.919477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:42:13.041176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

체류자격입국자수
02011외교A17543
12011공무A213242
22011협정A387054
32011사증면제B1909930
42011관광통과B24969884
52011일시취재C11605
62011단기상용C2182404
72011단기방문C31324640
82011단기취업C411799
92011문화예술D1255
체류자격입국자수
4432022방문동거F146485
4442022거주F212238
4452022동반F321388
4462022재외동포F4123610
4472022영주F524322
4482022결혼이민F666847
4492022기타G14868
4502022관광취업H13093
4512022방문취업H229480
4522022승무원506081