Overview

Dataset statistics

Number of variables23
Number of observations81
Missing cells1369
Missing cells (%)73.5%
Duplicate rows7
Duplicate rows (%)8.6%
Total size in memory14.7 KiB
Average record size in memory185.6 B

Variable types

Text1
Unsupported22

Dataset

Description북한이탈주민 정책 관련 △북한이탈주민 입국현황 △북한이탈주민 연령별 현황 △북한이탈주민 지역별 거주현황 등 통계 데이터 제공
Author통일부
URLhttps://www.data.go.kr/data/15019661/fileData.do

Alerts

Dataset has 7 (8.6%) duplicate rowsDuplicates
입국 현황(’20.3월말 입국자 기준) has 20 (24.7%) missing valuesMissing
Unnamed: 1 has 45 (55.6%) missing valuesMissing
Unnamed: 2 has 45 (55.6%) missing valuesMissing
Unnamed: 3 has 45 (55.6%) missing valuesMissing
Unnamed: 4 has 44 (54.3%) missing valuesMissing
Unnamed: 5 has 45 (55.6%) missing valuesMissing
Unnamed: 6 has 45 (55.6%) missing valuesMissing
Unnamed: 7 has 45 (55.6%) missing valuesMissing
Unnamed: 8 has 43 (53.1%) missing valuesMissing
Unnamed: 9 has 48 (59.3%) missing valuesMissing
Unnamed: 10 has 65 (80.2%) missing valuesMissing
Unnamed: 11 has 65 (80.2%) missing valuesMissing
Unnamed: 12 has 65 (80.2%) missing valuesMissing
Unnamed: 13 has 71 (87.7%) missing valuesMissing
Unnamed: 14 has 71 (87.7%) missing valuesMissing
Unnamed: 15 has 76 (93.8%) missing valuesMissing
Unnamed: 16 has 76 (93.8%) missing valuesMissing
Unnamed: 17 has 76 (93.8%) missing valuesMissing
Unnamed: 18 has 76 (93.8%) missing valuesMissing
Unnamed: 19 has 75 (92.6%) missing valuesMissing
Unnamed: 20 has 76 (93.8%) missing valuesMissing
Unnamed: 21 has 76 (93.8%) missing valuesMissing
Unnamed: 22 has 76 (93.8%) missing valuesMissing
Unnamed: 1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 16 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 17 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 18 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 19 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 20 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 21 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 22 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-20 13:48:02.068802
Analysis finished2024-04-20 13:48:02.269021
Duration0.2 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct39
Distinct (%)63.9%
Missing20
Missing (%)24.7%
Memory size776.0 B
2024-04-20T22:48:03.099633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length76
Median length61
Mean length12.065574
Min length1

Characters and Unicode

Total characters736
Distinct characters128
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)54.1%

Sample

1st row입국 현황표
2nd row구분
3rd row남(명)
4th row여(명)
5th row합계(명)
ValueCountFrequency (%)
8
 
4.2%
구분 7
 
3.7%
재북 6
 
3.2%
6
 
3.2%
6
 
3.2%
있는 5
 
2.6%
합계(명 5
 
2.6%
최근 5
 
2.6%
북한이탈주민 5
 
2.6%
입국인원과 4
 
2.1%
Other values (79) 132
69.8%
2024-04-20T22:48:04.543129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
127
 
17.3%
( 15
 
2.0%
) 15
 
2.0%
14
 
1.9%
14
 
1.9%
13
 
1.8%
13
 
1.8%
13
 
1.8%
13
 
1.8%
13
 
1.8%
Other values (118) 486
66.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 525
71.3%
Space Separator 127
 
17.3%
Other Punctuation 26
 
3.5%
Decimal Number 20
 
2.7%
Open Punctuation 15
 
2.0%
Close Punctuation 15
 
2.0%
Final Punctuation 5
 
0.7%
Control 2
 
0.3%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
2.7%
14
 
2.7%
13
 
2.5%
13
 
2.5%
13
 
2.5%
13
 
2.5%
13
 
2.5%
12
 
2.3%
12
 
2.3%
11
 
2.1%
Other values (99) 397
75.6%
Other Punctuation
ValueCountFrequency (%)
8
30.8%
, 7
26.9%
. 5
19.2%
· 2
 
7.7%
: 1
 
3.8%
' 1
 
3.8%
* 1
 
3.8%
/ 1
 
3.8%
Decimal Number
ValueCountFrequency (%)
0 7
35.0%
2 5
25.0%
3 5
25.0%
1 2
 
10.0%
9 1
 
5.0%
Space Separator
ValueCountFrequency (%)
127
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Final Punctuation
ValueCountFrequency (%)
5
100.0%
Control
ValueCountFrequency (%)
2
100.0%
Math Symbol
ValueCountFrequency (%)
= 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 525
71.3%
Common 211
28.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
2.7%
14
 
2.7%
13
 
2.5%
13
 
2.5%
13
 
2.5%
13
 
2.5%
13
 
2.5%
12
 
2.3%
12
 
2.3%
11
 
2.1%
Other values (99) 397
75.6%
Common
ValueCountFrequency (%)
127
60.2%
( 15
 
7.1%
) 15
 
7.1%
8
 
3.8%
, 7
 
3.3%
0 7
 
3.3%
2 5
 
2.4%
. 5
 
2.4%
3 5
 
2.4%
5
 
2.4%
Other values (9) 12
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 525
71.3%
ASCII 196
 
26.6%
Punctuation 13
 
1.8%
None 2
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
127
64.8%
( 15
 
7.7%
) 15
 
7.7%
, 7
 
3.6%
0 7
 
3.6%
2 5
 
2.6%
. 5
 
2.6%
3 5
 
2.6%
2
 
1.0%
1 2
 
1.0%
Other values (6) 6
 
3.1%
Hangul
ValueCountFrequency (%)
14
 
2.7%
14
 
2.7%
13
 
2.5%
13
 
2.5%
13
 
2.5%
13
 
2.5%
13
 
2.5%
12
 
2.3%
12
 
2.3%
11
 
2.1%
Other values (99) 397
75.6%
Punctuation
ValueCountFrequency (%)
8
61.5%
5
38.5%
None
ValueCountFrequency (%)
· 2
100.0%

Unnamed: 1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing45
Missing (%)55.6%
Memory size776.0 B

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing45
Missing (%)55.6%
Memory size776.0 B

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing45
Missing (%)55.6%
Memory size776.0 B

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing44
Missing (%)54.3%
Memory size776.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing45
Missing (%)55.6%
Memory size776.0 B

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing45
Missing (%)55.6%
Memory size776.0 B

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing45
Missing (%)55.6%
Memory size776.0 B

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing43
Missing (%)53.1%
Memory size776.0 B

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing48
Missing (%)59.3%
Memory size776.0 B

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing65
Missing (%)80.2%
Memory size776.0 B

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing65
Missing (%)80.2%
Memory size776.0 B

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing65
Missing (%)80.2%
Memory size776.0 B

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing71
Missing (%)87.7%
Memory size776.0 B

Unnamed: 14
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing71
Missing (%)87.7%
Memory size776.0 B

Unnamed: 15
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing76
Missing (%)93.8%
Memory size776.0 B

Unnamed: 16
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing76
Missing (%)93.8%
Memory size776.0 B

Unnamed: 17
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing76
Missing (%)93.8%
Memory size776.0 B

Unnamed: 18
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing76
Missing (%)93.8%
Memory size776.0 B

Unnamed: 19
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing75
Missing (%)92.6%
Memory size776.0 B

Unnamed: 20
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing76
Missing (%)93.8%
Memory size776.0 B

Unnamed: 21
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing76
Missing (%)93.8%
Memory size776.0 B

Unnamed: 22
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing76
Missing (%)93.8%
Memory size776.0 B

Sample

입국 현황(’20.3월말 입국자 기준)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20Unnamed: 21Unnamed: 22
0<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN(단위 : 명)NaNNaNNaN
1입국 현황표NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
2구분~'98~'01'02'03'04'05'06'07'08'09'10'11'12'13'14’15’16’17’18’19’20.3\n(잠정)합계
3남(명)831565510474626424515573608662591795404369305251302188168202399402
4여(명)1164786328111272960151319812195225218111911109811451092102411169399698459624256
5합계(명)947104311421285189813842028255428032914240227061502151413971275141811271137104713533658
6여성비율0.1224920.4582930.5534150.6311280.6701790.6936420.7460550.7756460.783090.7728210.7539550.7062080.7310250.7562750.7816750.8031370.7870240.8331850.8522430.8070680.7110.720661
7<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
8<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
9연령대별 입국현황(’20.3월말)NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
입국 현황(’20.3월말 입국자 기준)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20Unnamed: 21Unnamed: 22
71<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
72<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
73북한이탈주민 경제활동 현황NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
74<NA>NaNNaNNaNNaNNaNNaNNaNNaN(단위 : %)NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
75북한이탈주민 경제활동 현황표NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
76구분’08’09’10’11’12’13’14’15’16’17’18’19NaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
77경제활동 참가율49.648.642.656.554.156.956.659.457.961.264.862.1NaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
78고용률44.941.938.749.75051.453.154.65556.960.458.2NaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
79실업률9.513.79.212.17.59.76.24.85.176.96.3NaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
80※ 출처 : 남북하나재단 '19년 북한이탈주민 실태조사NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN

Duplicate rows

Most frequently occurring

입국 현황(’20.3월말 입국자 기준)# duplicates
6<NA>20
0구분7
16
26
5합계(명)5
3지역2
4합계2