Overview

Dataset statistics

Number of variables5
Number of observations6757
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory283.9 KiB
Average record size in memory43.0 B

Variable types

Categorical2
Numeric2
Text1

Dataset

Description입국한 날로부터 91일 이상 대한민국에 체류할 목적으로 「출입국관리법」제31조에 따라 체류지를 관할하는 지방 출입국· 외국인관서의 장에게 외국인등록을 하고 고유한 등록번호를 부여받은 외국인의 거주지역 (시군구)별 현황을 월별로 제공
Author법무부
URLhttps://www.data.go.kr/data/15100022/fileData.do

Reproduction

Analysis started2024-04-29 22:58:50.402664
Analysis finished2024-04-29 22:58:52.686229
Duration2.28 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables


Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size52.9 KiB
2022
3000 
2023
3000 
2024
757 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 3000
44.4%
2023 3000
44.4%
2024 757
 
11.2%

Length

2024-04-30T07:58:52.775126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:58:52.879663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 3000
44.4%
2023 3000
44.4%
2024 757
 
11.2%


Real number (ℝ)

Distinct12
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.9957082
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size59.5 KiB
2024-04-30T07:58:52.971376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q39
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.5600423
Coefficient of variation (CV)0.59376511
Kurtosis-1.2757187
Mean5.9957082
Median Absolute Deviation (MAD)3
Skewness0.17899138
Sum40513
Variance12.673901
MonotonicityNot monotonic
2024-04-30T07:58:53.086324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
1 753
11.1%
2 752
11.1%
3 752
11.1%
4 500
7.4%
5 500
7.4%
6 500
7.4%
7 500
7.4%
8 500
7.4%
9 500
7.4%
10 500
7.4%
Other values (2) 1000
14.8%
ValueCountFrequency (%)
1 753
11.1%
2 752
11.1%
3 752
11.1%
4 500
7.4%
5 500
7.4%
6 500
7.4%
7 500
7.4%
8 500
7.4%
9 500
7.4%
10 500
7.4%
ValueCountFrequency (%)
12 500
7.4%
11 500
7.4%
10 500
7.4%
9 500
7.4%
8 500
7.4%
7 500
7.4%
6 500
7.4%
5 500
7.4%
4 500
7.4%
3 752
11.1%

시도
Categorical

Distinct18
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size52.9 KiB
경기도
966 
서울특별시
675 
경상북도
639 
전라남도
594 
경상남도
594 
Other values (13)
3289 

Length

Max length7
Median length5
Mean length4.1180997
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원도
2nd row강원도
3rd row강원도
4th row강원도
5th row강원도

Common Values

ValueCountFrequency (%)
경기도 966
14.3%
서울특별시 675
10.0%
경상북도 639
9.5%
전라남도 594
8.8%
경상남도 594
8.8%
강원도 486
7.2%
충청남도 432
 
6.4%
부산광역시 432
 
6.4%
전라북도 405
 
6.0%
충청북도 378
 
5.6%
Other values (8) 1156
17.1%

Length

2024-04-30T07:58:53.233877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 1141
16.9%
서울특별시 675
10.0%
경상북도 639
9.5%
전라남도 594
8.8%
경상남도 594
8.8%
강원도 486
7.2%
충청남도 432
 
6.4%
부산광역시 432
 
6.4%
전라북도 405
 
6.0%
충청북도 378
 
5.6%
Other values (7) 981
14.5%
Distinct232
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size52.9 KiB
2024-04-30T07:58:53.550683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length3
Mean length3.4864585
Min length2

Characters and Unicode

Total characters23558
Distinct characters148
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강릉시
2nd row고성군
3rd row동해시
4th row삼척시
5th row속초시
ValueCountFrequency (%)
중구 162
 
2.1%
동구 162
 
2.1%
서구 135
 
1.8%
남구 135
 
1.8%
북구 135
 
1.8%
창원시 135
 
1.8%
수원시 108
 
1.4%
청주시 108
 
1.4%
성남시 81
 
1.1%
고양시 81
 
1.1%
Other values (231) 6388
83.7%
2024-04-30T07:58:54.015141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2871
 
12.2%
2707
 
11.5%
2295
 
9.7%
873
 
3.7%
648
 
2.8%
628
 
2.7%
621
 
2.6%
594
 
2.5%
567
 
2.4%
540
 
2.3%
Other values (138) 11214
47.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22685
96.3%
Space Separator 873
 
3.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2871
 
12.7%
2707
 
11.9%
2295
 
10.1%
648
 
2.9%
628
 
2.8%
621
 
2.7%
594
 
2.6%
567
 
2.5%
540
 
2.4%
489
 
2.2%
Other values (137) 10725
47.3%
Space Separator
ValueCountFrequency (%)
873
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22685
96.3%
Common 873
 
3.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2871
 
12.7%
2707
 
11.9%
2295
 
10.1%
648
 
2.9%
628
 
2.8%
621
 
2.7%
594
 
2.6%
567
 
2.5%
540
 
2.4%
489
 
2.2%
Other values (137) 10725
47.3%
Common
ValueCountFrequency (%)
873
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22685
96.3%
ASCII 873
 
3.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2871
 
12.7%
2707
 
11.9%
2295
 
10.1%
648
 
2.9%
628
 
2.8%
621
 
2.7%
594
 
2.6%
567
 
2.5%
540
 
2.4%
489
 
2.2%
Other values (137) 10725
47.3%
ASCII
ValueCountFrequency (%)
873
100.0%

등록외국인 수
Real number (ℝ)

Distinct4676
Distinct (%)69.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4865.0672
Minimum61
Maximum47094
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size59.5 KiB
2024-04-30T07:58:54.190207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum61
5-th percentile327
Q11081
median2877
Q35722
95-th percentile15226.8
Maximum47094
Range47033
Interquartile range (IQR)4641

Descriptive statistics

Standard deviation5966.6489
Coefficient of variation (CV)1.2264268
Kurtosis10.098422
Mean4865.0672
Median Absolute Deviation (MAD)1989
Skewness2.7596173
Sum32873259
Variance35600899
MonotonicityNot monotonic
2024-04-30T07:58:54.335313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
935 9
 
0.1%
956 9
 
0.1%
415 8
 
0.1%
917 7
 
0.1%
952 7
 
0.1%
436 7
 
0.1%
323 7
 
0.1%
482 7
 
0.1%
915 7
 
0.1%
1584 6
 
0.1%
Other values (4666) 6683
98.9%
ValueCountFrequency (%)
61 1
 
< 0.1%
115 1
 
< 0.1%
116 1
 
< 0.1%
118 1
 
< 0.1%
121 1
 
< 0.1%
122 1
 
< 0.1%
123 1
 
< 0.1%
126 1
 
< 0.1%
127 4
0.1%
129 2
< 0.1%
ValueCountFrequency (%)
47094 1
< 0.1%
46608 1
< 0.1%
46126 1
< 0.1%
45334 1
< 0.1%
44733 1
< 0.1%
44070 1
< 0.1%
43550 1
< 0.1%
42921 1
< 0.1%
42263 1
< 0.1%
41845 1
< 0.1%

Interactions

2024-04-30T07:58:52.326882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:58:52.089609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:58:52.412373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:58:52.237352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T07:58:54.421144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도등록외국인 수
1.0000.5070.4760.122
0.5071.0000.0920.000
시도0.4760.0921.0000.520
등록외국인 수0.1220.0000.5201.000
2024-04-30T07:58:54.502376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도
1.0000.249
시도0.2491.000
2024-04-30T07:58:54.579168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록외국인 수시도
1.0000.0170.3530.035
등록외국인 수0.0171.0000.0730.228
0.3530.0731.0000.249
시도0.0350.2280.2491.000

Missing values

2024-04-30T07:58:52.530827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T07:58:52.634466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도시군구등록외국인 수
020221강원도강릉시2344
120221강원도고성군1202
220221강원도동해시724
320221강원도삼척시632
420221강원도속초시1035
520221강원도양구군323
620221강원도양양군290
720221강원도영월군245
820221강원도원주시3298
920221강원도인제군296
시도시군구등록외국인 수
674720243충청북도옥천군976
674820243충청북도음성군12467
674920243충청북도제천시2231
675020243충청북도증평군1132
675120243충청북도진천군7926
675220243충청북도청주시 상당구1317
675320243충청북도청주시 서원구3516
675420243충청북도청주시 청원구5453
675520243충청북도청주시 흥덕구6400
675620243충청북도충주시5814