Overview

Dataset statistics

Number of variables3
Number of observations48
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory28.8 B

Variable types

Numeric2
Categorical1

Dataset

Description국내 체류중인 (장단기) 체류 외국인 현황을 연도별로 제공 *(장단기)체류외국인 : ‘관광 등 목적으로 90일 이내 단기체류하는 외국인’, ‘91일 이상 장기 거주하는 등록외국인 및 외국국적동포 거소신고자’ 등 대한민국의 국적을 갖지 아니하고, 대한민국에 체류하는 모든 체류 외국인 (체류기간 만료일을 경과한 불법체류외국인도 포함)
Author공공데이터포털
URLhttps://www.data.go.kr/data/15100007/fileData.do

Alerts

체류외국인수 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 체류외국인수High correlation
체류외국인수 has unique valuesUnique

Reproduction

Analysis started2024-04-17 09:18:52.068329
Analysis finished2024-04-17 09:18:52.517053
Duration0.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables


Real number (ℝ)

Distinct12
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2016.5
Minimum2011
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2024-04-17T18:18:52.560623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2011
5-th percentile2011
Q12013.75
median2016.5
Q32019.25
95-th percentile2022
Maximum2022
Range11
Interquartile range (IQR)5.5

Descriptive statistics

Standard deviation3.4885832
Coefficient of variation (CV)0.0017300189
Kurtosis-1.2175129
Mean2016.5
Median Absolute Deviation (MAD)3
Skewness0
Sum96792
Variance12.170213
MonotonicityIncreasing
2024-04-17T18:18:52.649035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
2011 4
8.3%
2012 4
8.3%
2013 4
8.3%
2014 4
8.3%
2015 4
8.3%
2016 4
8.3%
2017 4
8.3%
2018 4
8.3%
2019 4
8.3%
2020 4
8.3%
Other values (2) 8
16.7%
ValueCountFrequency (%)
2011 4
8.3%
2012 4
8.3%
2013 4
8.3%
2014 4
8.3%
2015 4
8.3%
2016 4
8.3%
2017 4
8.3%
2018 4
8.3%
2019 4
8.3%
2020 4
8.3%
ValueCountFrequency (%)
2022 4
8.3%
2021 4
8.3%
2020 4
8.3%
2019 4
8.3%
2018 4
8.3%
2017 4
8.3%
2016 4
8.3%
2015 4
8.3%
2014 4
8.3%
2013 4
8.3%

구분
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size516.0 B
총계
12 
장기체류등록
12 
장기체류거소
12 
단기체류
12 

Length

Max length6
Median length5
Mean length4.5
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row총계
2nd row장기체류등록
3rd row장기체류거소
4th row단기체류
5th row총계

Common Values

ValueCountFrequency (%)
총계 12
25.0%
장기체류등록 12
25.0%
장기체류거소 12
25.0%
단기체류 12
25.0%

Length

2024-04-17T18:18:52.749666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T18:18:52.842296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
총계 12
25.0%
장기체류등록 12
25.0%
장기체류거소 12
25.0%
단기체류 12
25.0%

체류외국인수
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct48
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean978096.71
Minimum135020
Maximum2524656
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2024-04-17T18:18:52.942090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum135020
5-th percentile248783.45
Q1424232.25
median862918
Q31302624.5
95-th percentile2223017.1
Maximum2524656
Range2389636
Interquartile range (IQR)878392.25

Descriptive statistics

Standard deviation672303.22
Coefficient of variation (CV)0.68735864
Kurtosis-0.56463982
Mean978096.71
Median Absolute Deviation (MAD)440205.5
Skewness0.75147146
Sum46948642
Variance4.5199162 × 1011
MonotonicityNot monotonic
2024-04-17T18:18:53.064411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=48)
ValueCountFrequency (%)
1395077 1
 
2.1%
1171762 1
 
2.1%
597399 1
 
2.1%
2367607 1
 
2.1%
1246626 1
 
2.1%
441107 1
 
2.1%
679874 1
 
2.1%
2524656 1
 
2.1%
1271807 1
 
2.1%
459996 1
 
2.1%
Other values (38) 38
79.2%
ValueCountFrequency (%)
135020 1
2.1%
187616 1
2.1%
233269 1
2.1%
277596 1
2.1%
286414 1
2.1%
324504 1
2.1%
324786 1
2.1%
356842 1
2.1%
368862 1
2.1%
386945 1
2.1%
ValueCountFrequency (%)
2524656 1
2.1%
2367607 1
2.1%
2245912 1
2.1%
2180498 1
2.1%
2049441 1
2.1%
2036075 1
2.1%
1956781 1
2.1%
1899519 1
2.1%
1797618 1
2.1%
1576034 1
2.1%

Interactions

2024-04-17T18:18:52.287563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T18:18:52.146197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T18:18:52.349936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T18:18:52.219696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-17T18:18:53.149967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분체류외국인수
1.0000.0000.000
구분0.0001.0000.908
체류외국인수0.0000.9081.000
2024-04-17T18:18:53.224729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
체류외국인수구분
1.0000.2830.000
체류외국인수0.2831.0000.744
구분0.0000.7441.000

Missing values

2024-04-17T18:18:52.432278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T18:18:52.490842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분체류외국인수
02011총계1395077
12011장기체류등록982461
22011장기체류거소135020
32011단기체류277596
42012총계1445103
52012장기체류등록932983
62012장기체류거소187616
72012단기체류324504
82013총계1576034
92013장기체류등록985923
구분체류외국인수
382020장기체류거소464783
392020단기체류425752
402021총계1956781
412021장기체류등록1093891
422021장기체류거소475945
432021단기체류386945
442022총계2245912
452022장기체류등록1189585
462022장기체류거소499270
472022단기체류557057