Overview

Dataset statistics

Number of variables5
Number of observations7679
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory322.6 KiB
Average record size in memory43.0 B

Variable types

Categorical2
Numeric2
Text1

Dataset

Description국내 체류중인 결혼이민자의 국적(지역)별, 성별 현황을 월별로 제공*결혼이민자 : 국민과의 혼인관계를 바탕으로 국내에 체류하고 있는 외국인으로서, F-2-1(국민의배우자 구 체류자격), F-5-2(결혼이민 영주자격), F-6-1(국민의배우자), F-6-2(자녀양육), F-6-3(혼인단절) 체류자격 소지자** 귀화 등으로 한국국적을 취득한자(혼인귀화자)는 체류 외국인 통계에서 제외
Author법무부
URLhttps://www.data.go.kr/data/15100035/fileData.do

Reproduction

Analysis started2024-04-29 22:59:33.316580
Analysis finished2024-04-29 22:59:35.458866
Duration2.14 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables


Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
2023
3449 
2022
3372 
2024
858 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2023 3449
44.9%
2022 3372
43.9%
2024 858
 
11.2%

Length

2024-04-30T07:59:35.516823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:59:35.605298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 3449
44.9%
2022 3372
43.9%
2024 858
 
11.2%


Real number (ℝ)

Distinct12
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.0109389
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size67.6 KiB
2024-04-30T07:59:35.707181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q39
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.559546
Coefficient of variation (CV)0.59217803
Kurtosis-1.2775836
Mean6.0109389
Median Absolute Deviation (MAD)3
Skewness0.17300004
Sum46158
Variance12.670367
MonotonicityNot monotonic
2024-04-30T07:59:35.809305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
2 851
11.1%
3 851
11.1%
1 849
11.1%
10 572
7.4%
11 572
7.4%
6 570
7.4%
8 570
7.4%
9 570
7.4%
12 570
7.4%
5 569
7.4%
Other values (2) 1135
14.8%
ValueCountFrequency (%)
1 849
11.1%
2 851
11.1%
3 851
11.1%
4 566
7.4%
5 569
7.4%
6 570
7.4%
7 569
7.4%
8 570
7.4%
9 570
7.4%
10 572
7.4%
ValueCountFrequency (%)
12 570
7.4%
11 572
7.4%
10 572
7.4%
9 570
7.4%
8 570
7.4%
7 569
7.4%
6 570
7.4%
5 569
7.4%
4 566
7.4%
3 851
11.1%
Distinct176
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
2024-04-30T07:59:36.046852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length3.8924339
Min length2

Characters and Unicode

Total characters29890
Distinct characters174
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가나
2nd row가나
3rd row감비아
4th row과테말라
5th row과테말라
ValueCountFrequency (%)
가나 54
 
0.7%
쿠바 54
 
0.7%
오스트레일리아 54
 
0.7%
오스트리아 54
 
0.7%
온두라스 54
 
0.7%
요르단 54
 
0.7%
우간다 54
 
0.7%
우루과이 54
 
0.7%
우크라이나 54
 
0.7%
일본 54
 
0.7%
Other values (166) 7139
93.0%
2024-04-30T07:59:36.403116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2247
 
7.5%
1315
 
4.4%
1082
 
3.6%
977
 
3.3%
958
 
3.2%
860
 
2.9%
859
 
2.9%
535
 
1.8%
533
 
1.8%
519
 
1.7%
Other values (164) 20005
66.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29755
99.5%
Close Punctuation 50
 
0.2%
Open Punctuation 50
 
0.2%
Dash Punctuation 35
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2247
 
7.6%
1315
 
4.4%
1082
 
3.6%
977
 
3.3%
958
 
3.2%
860
 
2.9%
859
 
2.9%
535
 
1.8%
533
 
1.8%
519
 
1.7%
Other values (161) 19870
66.8%
Close Punctuation
ValueCountFrequency (%)
) 50
100.0%
Open Punctuation
ValueCountFrequency (%)
( 50
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 35
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29755
99.5%
Common 135
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2247
 
7.6%
1315
 
4.4%
1082
 
3.6%
977
 
3.3%
958
 
3.2%
860
 
2.9%
859
 
2.9%
535
 
1.8%
533
 
1.8%
519
 
1.7%
Other values (161) 19870
66.8%
Common
ValueCountFrequency (%)
) 50
37.0%
( 50
37.0%
- 35
25.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29755
99.5%
ASCII 135
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2247
 
7.6%
1315
 
4.4%
1082
 
3.6%
977
 
3.3%
958
 
3.2%
860
 
2.9%
859
 
2.9%
535
 
1.8%
533
 
1.8%
519
 
1.7%
Other values (161) 19870
66.8%
ASCII
ValueCountFrequency (%)
) 50
37.0%
( 50
37.0%
- 35
25.9%

성별
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.1 KiB
남성
3998 
여성
3681 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남성
2nd row여성
3rd row남성
4th row남성
5th row여성

Common Values

ValueCountFrequency (%)
남성 3998
52.1%
여성 3681
47.9%

Length

2024-04-30T07:59:36.525790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:59:36.609289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남성 3998
52.1%
여성 3681
47.9%

결혼이민자수
Real number (ℝ)

Distinct940
Distinct (%)12.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean604.28285
Minimum1
Maximum46177
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size67.6 KiB
2024-04-30T07:59:36.718105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median12
Q378
95-th percentile1323.2
Maximum46177
Range46176
Interquartile range (IQR)75

Descriptive statistics

Standard deviation3700.5457
Coefficient of variation (CV)6.1238635
Kurtosis102.4696
Mean604.28285
Median Absolute Deviation (MAD)11
Skewness9.6734782
Sum4640288
Variance13694038
MonotonicityNot monotonic
2024-04-30T07:59:36.861652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1216
 
15.8%
2 551
 
7.2%
3 450
 
5.9%
4 365
 
4.8%
5 266
 
3.5%
8 224
 
2.9%
9 182
 
2.4%
6 180
 
2.3%
13 141
 
1.8%
12 139
 
1.8%
Other values (930) 3965
51.6%
ValueCountFrequency (%)
1 1216
15.8%
2 551
7.2%
3 450
 
5.9%
4 365
 
4.8%
5 266
 
3.5%
6 180
 
2.3%
7 123
 
1.6%
8 224
 
2.9%
9 182
 
2.4%
10 124
 
1.6%
ValueCountFrequency (%)
46177 1
< 0.1%
46152 1
< 0.1%
46073 1
< 0.1%
46047 1
< 0.1%
46045 1
< 0.1%
46044 1
< 0.1%
46009 1
< 0.1%
45999 1
< 0.1%
45995 1
< 0.1%
45989 2
< 0.1%

Interactions

2024-04-30T07:59:35.131243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:59:34.909167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:59:35.217441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T07:59:35.049534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T07:59:36.969269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별결혼이민자수
1.0000.5080.0000.000
0.5081.0000.0000.000
성별0.0000.0001.0000.143
결혼이민자수0.0000.0000.1431.000
2024-04-30T07:59:37.064659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별
1.0000.000
성별0.0001.000
2024-04-30T07:59:37.149201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
결혼이민자수성별
1.0000.0010.3540.000
결혼이민자수0.0011.0000.0000.107
0.3540.0001.0000.000
성별0.0000.1070.0001.000

Missing values

2024-04-30T07:59:35.325473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T07:59:35.415732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

국적지역성별결혼이민자수
020221가나남성23
120221가나여성2
220221감비아남성1
320221과테말라남성2
420221과테말라여성12
520221그리스남성18
620221그리스여성9
720221기니남성5
820221기니비사우남성1
920221나이지리아남성176
국적지역성별결혼이민자수
766920243피지여성2
767020243핀란드남성23
767120243핀란드여성23
767220243필리핀남성614
767320243필리핀여성12029
767420243헝가리남성13
767520243헝가리여성41
767620243홍콩남성88
767720243홍콩여성632
767820243홍콩거주난민여성1