Overview

Dataset statistics

Number of variables24
Number of observations67
Missing cells909
Missing cells (%)56.5%
Duplicate rows6
Duplicate rows (%)9.0%
Total size in memory12.7 KiB
Average record size in memory194.0 B

Variable types

Text1
Unsupported22
Categorical1

Dataset

Description△남북 교역액 현황 △연도별 남북교역액 현황 △연도별 남북교역 건수 현황 △연도별 남북교역 품목수 현황 △유형별 남북교역액 현황 △금강산 관광객 현황 △남북 협력사업 승인
Author통일부
URLhttps://www.data.go.kr/data/15053277/fileData.do

Alerts

Dataset has 6 (9.0%) duplicate rowsDuplicates
Unnamed: 22 is highly imbalanced (68.9%)Imbalance
남북교역 현황 - 연도별 남북교역액 현황 has 29 (43.3%) missing valuesMissing
Unnamed: 1 has 40 (59.7%) missing valuesMissing
Unnamed: 2 has 38 (56.7%) missing valuesMissing
Unnamed: 3 has 33 (49.3%) missing valuesMissing
Unnamed: 4 has 34 (50.7%) missing valuesMissing
Unnamed: 5 has 34 (50.7%) missing valuesMissing
Unnamed: 6 has 34 (50.7%) missing valuesMissing
Unnamed: 7 has 34 (50.7%) missing valuesMissing
Unnamed: 8 has 34 (50.7%) missing valuesMissing
Unnamed: 9 has 34 (50.7%) missing valuesMissing
Unnamed: 10 has 34 (50.7%) missing valuesMissing
Unnamed: 11 has 34 (50.7%) missing valuesMissing
Unnamed: 12 has 33 (49.3%) missing valuesMissing
Unnamed: 13 has 40 (59.7%) missing valuesMissing
Unnamed: 14 has 40 (59.7%) missing valuesMissing
Unnamed: 15 has 40 (59.7%) missing valuesMissing
Unnamed: 16 has 40 (59.7%) missing valuesMissing
Unnamed: 17 has 39 (58.2%) missing valuesMissing
Unnamed: 18 has 49 (73.1%) missing valuesMissing
Unnamed: 19 has 49 (73.1%) missing valuesMissing
Unnamed: 20 has 46 (68.7%) missing valuesMissing
Unnamed: 21 has 61 (91.0%) missing valuesMissing
Unnamed: 23 has 60 (89.6%) missing valuesMissing
Unnamed: 1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 16 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 17 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 18 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 19 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 20 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 21 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 23 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 22:10:52.967753
Analysis finished2023-12-12 22:10:53.126733
Duration0.16 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct27
Distinct (%)71.1%
Missing29
Missing (%)43.3%
Memory size668.0 B
2023-12-13T07:10:53.332956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length85
Median length41
Mean length11.973684
Min length1

Characters and Unicode

Total characters455
Distinct characters98
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)60.5%

Sample

1st row남북교역 현황 - 연도별 남북교역액 현황표
2nd row구분
3rd row반입
4th row반출
5th row
ValueCountFrequency (%)
현황 12
 
9.0%
9
 
6.7%
남북교역 9
 
6.7%
현황표 6
 
4.5%
연도별 6
 
4.5%
구분 6
 
4.5%
남북 6
 
4.5%
교역액 3
 
2.2%
유형별 3
 
2.2%
반입 3
 
2.2%
Other values (52) 71
53.0%
2023-12-13T07:10:53.714887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
96
21.1%
18
 
4.0%
18
 
4.0%
18
 
4.0%
16
 
3.5%
16
 
3.5%
13
 
2.9%
13
 
2.9%
10
 
2.2%
9
 
2.0%
Other values (88) 228
50.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 330
72.5%
Space Separator 96
 
21.1%
Other Punctuation 13
 
2.9%
Dash Punctuation 8
 
1.8%
Final Punctuation 2
 
0.4%
Initial Punctuation 2
 
0.4%
Decimal Number 2
 
0.4%
Open Punctuation 1
 
0.2%
Close Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
5.5%
18
 
5.5%
18
 
5.5%
16
 
4.8%
16
 
4.8%
13
 
3.9%
13
 
3.9%
10
 
3.0%
9
 
2.7%
8
 
2.4%
Other values (75) 191
57.9%
Other Punctuation
ValueCountFrequency (%)
' 4
30.8%
* 3
23.1%
/ 3
23.1%
, 2
15.4%
. 1
 
7.7%
Decimal Number
ValueCountFrequency (%)
0 1
50.0%
1 1
50.0%
Space Separator
ValueCountFrequency (%)
96
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%
Open Punctuation
ValueCountFrequency (%)
1
100.0%
Close Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 330
72.5%
Common 125
 
27.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18
 
5.5%
18
 
5.5%
18
 
5.5%
16
 
4.8%
16
 
4.8%
13
 
3.9%
13
 
3.9%
10
 
3.0%
9
 
2.7%
8
 
2.4%
Other values (75) 191
57.9%
Common
ValueCountFrequency (%)
96
76.8%
- 8
 
6.4%
' 4
 
3.2%
* 3
 
2.4%
/ 3
 
2.4%
, 2
 
1.6%
2
 
1.6%
2
 
1.6%
0 1
 
0.8%
1 1
 
0.8%
Other values (3) 3
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 330
72.5%
ASCII 119
 
26.2%
Punctuation 4
 
0.9%
None 2
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
96
80.7%
- 8
 
6.7%
' 4
 
3.4%
* 3
 
2.5%
/ 3
 
2.5%
, 2
 
1.7%
0 1
 
0.8%
1 1
 
0.8%
. 1
 
0.8%
Hangul
ValueCountFrequency (%)
18
 
5.5%
18
 
5.5%
18
 
5.5%
16
 
4.8%
16
 
4.8%
13
 
3.9%
13
 
3.9%
10
 
3.0%
9
 
2.7%
8
 
2.4%
Other values (75) 191
57.9%
Punctuation
ValueCountFrequency (%)
2
50.0%
2
50.0%
None
ValueCountFrequency (%)
1
50.0%
1
50.0%

Unnamed: 1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing40
Missing (%)59.7%
Memory size668.0 B

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing38
Missing (%)56.7%
Memory size668.0 B

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing33
Missing (%)49.3%
Memory size668.0 B

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing34
Missing (%)50.7%
Memory size668.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing34
Missing (%)50.7%
Memory size668.0 B

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing34
Missing (%)50.7%
Memory size668.0 B

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing34
Missing (%)50.7%
Memory size668.0 B

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing34
Missing (%)50.7%
Memory size668.0 B

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing34
Missing (%)50.7%
Memory size668.0 B

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing34
Missing (%)50.7%
Memory size668.0 B

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing34
Missing (%)50.7%
Memory size668.0 B

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing33
Missing (%)49.3%
Memory size668.0 B

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing40
Missing (%)59.7%
Memory size668.0 B

Unnamed: 14
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing40
Missing (%)59.7%
Memory size668.0 B

Unnamed: 15
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing40
Missing (%)59.7%
Memory size668.0 B

Unnamed: 16
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing40
Missing (%)59.7%
Memory size668.0 B

Unnamed: 17
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing39
Missing (%)58.2%
Memory size668.0 B

Unnamed: 18
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing49
Missing (%)73.1%
Memory size668.0 B

Unnamed: 19
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing49
Missing (%)73.1%
Memory size668.0 B

Unnamed: 20
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing46
Missing (%)68.7%
Memory size668.0 B

Unnamed: 21
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing61
Missing (%)91.0%
Memory size668.0 B

Unnamed: 22
Categorical

IMBALANCE 

Distinct3
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size668.0 B
<NA>
61 
-
 
5
´20
 
1

Length

Max length4
Median length4
Mean length3.761194
Min length1

Unique

Unique1 ?
Unique (%)1.5%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 61
91.0%
- 5
 
7.5%
´20 1
 
1.5%

Length

2023-12-13T07:10:53.887241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:10:54.014869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 61
91.0%
5
 
7.5%
´20 1
 
1.5%

Unnamed: 23
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing60
Missing (%)89.6%
Memory size668.0 B

Sample

남북교역 현황 - 연도별 남북교역액 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20Unnamed: 21Unnamed: 22Unnamed: 23
0<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN(단위 : 백만달러)NaN<NA>NaN
1남북교역 현황 - 연도별 남북교역액 현황표NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN
2구분‘89~02‘03‘04‘05‘06‘07‘08‘09‘10‘11‘12‘13‘14‘15‘16‘17‘18‘19‘20NaN<NA>NaN
3반입206628925834052076593293410449141074615120614521860110012607NaN<NA>NaN
4반출15054354397158301033888745868800897521113612621471217412254NaN<NA>NaN
53571724697105613501798182016791912171419711136234327143331317424861NaN<NA>NaN
6<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN
7*반올림에 의해 년도별 반입/반출의 합계와 전체금액의 합계가 다를 수 있음NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN
8<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN
9<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN
남북교역 현황 - 연도별 남북교역액 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20Unnamed: 21Unnamed: 22Unnamed: 23
57남북 협력사업 승인 현황NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN
58<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>(단위 : 건)
59남북 협력사업 승인 현황표NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN
60구분NaNNaN´91~´02´03´04´05´06´07´08´09´10´11´12´13´14´15´16´17´18´19´20
61<NA>NaNNaN´01NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>NaN
62경제민간경협NaN1612610469119--19-------93
63<NA>개성승인---1726151635310616533-----308
64<NA>공단신고--------121118221027-----82
65사회문화NaNNaN23713164726193-11--12--65-170
66NaNNaN39815398345188652337202834612--65-653

Duplicate rows

Most frequently occurring

남북교역 현황 - 연도별 남북교역액 현황Unnamed: 22# duplicates
5<NA><NA>27
1구분<NA>5
2반입<NA>3
3반출<NA>3
0<NA>2
4<NA>-2