Overview

Dataset statistics

Number of variables3
Number of observations447
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.5 KiB
Average record size in memory26.3 B

Variable types

Numeric2
Categorical1

Dataset

Description관광, 친지방문 등의 목적으로 입국하여 90일 이내에서 단기간 국내 체류하는 단기체류외국인 및 등록외국인과 외국국적동포 국내거소신고자를 포함하는 장기체류 외국인의 체류자격별 현황을 연도별로 제공
URLhttps://www.data.go.kr/data/15100015/fileData.do

Reproduction

Analysis started2023-12-12 11:30:46.218645
Analysis finished2023-12-12 11:30:47.232199
Duration1.01 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables


Real number (ℝ)

Distinct12
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2016.5168
Minimum2011
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.1 KiB
2023-12-12T20:30:47.323176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2011
5-th percentile2011
Q12014
median2017
Q32020
95-th percentile2022
Maximum2022
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.4626852
Coefficient of variation (CV)0.0017171616
Kurtosis-1.2215696
Mean2016.5168
Median Absolute Deviation (MAD)3
Skewness-0.0014810891
Sum901383
Variance11.990189
MonotonicityIncreasing
2023-12-12T20:30:47.530064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
2014 38
8.5%
2021 38
8.5%
2022 38
8.5%
2011 37
8.3%
2012 37
8.3%
2013 37
8.3%
2015 37
8.3%
2016 37
8.3%
2017 37
8.3%
2018 37
8.3%
Other values (2) 74
16.6%
ValueCountFrequency (%)
2011 37
8.3%
2012 37
8.3%
2013 37
8.3%
2014 38
8.5%
2015 37
8.3%
2016 37
8.3%
2017 37
8.3%
2018 37
8.3%
2019 37
8.3%
2020 37
8.3%
ValueCountFrequency (%)
2022 38
8.5%
2021 38
8.5%
2020 37
8.3%
2019 37
8.3%
2018 37
8.3%
2017 37
8.3%
2016 37
8.3%
2015 37
8.3%
2014 38
8.5%
2013 37
8.3%

체류자격
Categorical

Distinct42
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
외교A1
 
12
단기취업C4
 
12
구직D10
 
12
방문동거F1
 
12
사증면제B1
 
12
Other values (37)
387 

Length

Max length7
Median length6
Mean length5.2259508
Min length2

Unique

Unique2 ?
Unique (%)0.4%

Sample

1st row외교A1
2nd row공무A2
3rd row협정A3
4th row사증면제B1
5th row관광통과B2

Common Values

ValueCountFrequency (%)
외교A1 12
 
2.7%
단기취업C4 12
 
2.7%
구직D10 12
 
2.7%
방문동거F1 12
 
2.7%
사증면제B1 12
 
2.7%
관광통과B2 12
 
2.7%
일시취재C1 12
 
2.7%
무역경영D9 12
 
2.7%
문화예술D1 12
 
2.7%
회화E2 12
 
2.7%
Other values (32) 327
73.2%

Length

2023-12-12T20:30:47.773087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
외교a1 12
 
2.7%
예술흥행e6 12
 
2.7%
공무a2 12
 
2.7%
단기취업c4 12
 
2.7%
전문직업e5 12
 
2.7%
관광취업h1 12
 
2.7%
특정활동e7 12
 
2.7%
비전문취업e9 12
 
2.7%
거주f2 12
 
2.7%
방문취업h2 12
 
2.7%
Other values (32) 327
73.2%

체류외국인 수
Real number (ℝ)

Distinct433
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean52515.26
Minimum18
Maximum502451
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.1 KiB
2023-12-12T20:30:47.999301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum18
5-th percentile77.4
Q11719
median7236
Q365091.5
95-th percentile248135.6
Maximum502451
Range502433
Interquartile range (IQR)63372.5

Descriptive statistics

Standard deviation87700.713
Coefficient of variation (CV)1.6700044
Kurtosis6.4191543
Mean52515.26
Median Absolute Deviation (MAD)7164
Skewness2.4052542
Sum23474321
Variance7.6914151 × 109
MonotonicityNot monotonic
2023-12-12T20:30:48.254506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1719 4
 
0.9%
28 2
 
0.4%
91 2
 
0.4%
2053 2
 
0.4%
2637 2
 
0.4%
89 2
 
0.4%
27 2
 
0.4%
75 2
 
0.4%
606 2
 
0.4%
257 2
 
0.4%
Other values (423) 425
95.1%
ValueCountFrequency (%)
18 1
0.2%
20 1
0.2%
21 2
0.4%
23 1
0.2%
26 1
0.2%
27 2
0.4%
28 2
0.4%
31 1
0.2%
36 1
0.2%
38 1
0.2%
ValueCountFrequency (%)
502451 1
0.2%
478442 1
0.2%
466682 1
0.2%
464152 1
0.2%
444880 1
0.2%
415121 1
0.2%
372533 1
0.2%
328187 1
0.2%
303368 1
0.2%
289427 1
0.2%

Interactions

2023-12-12T20:30:46.744477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:30:46.436064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:30:46.882468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:30:46.593171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:30:48.407422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
체류자격체류외국인 수
1.0000.0000.000
체류자격0.0001.0000.829
체류외국인 수0.0000.8291.000
2023-12-12T20:30:48.574749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
체류외국인 수체류자격
1.0000.0340.000
체류외국인 수0.0341.0000.446
체류자격0.0000.4461.000

Missing values

2023-12-12T20:30:47.048195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:30:47.179733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

체류자격체류외국인 수
02011외교A11737
12011공무A2956
22011협정A344791
32011사증면제B136639
42011관광통과B288976
52011일시취재C131
62011단기상용C219377
72011단기방문C368104
82011단기취업C4679
92011문화예술D175
체류자격체류외국인 수
4372022거주F244561
4382022동반F324917
4392022재외동포F4502451
4402022영주F5176107
4412022결혼이민F6136266
4422022기타G136446
4432022관광취업H12337
4442022방문취업H2105567
4452022관광상륙T1216
4462022기타70761