Overview

Dataset statistics

Number of variables4
Number of observations126
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.2 KiB
Average record size in memory34.0 B

Variable types

Numeric1
Categorical1
Text2

Dataset

Description2017년도 대한무역투자진흥공사 해외무역관 정보에 대한 데이터로 본부명, 무역관명, 무역관 주소 정보를 제공합니다.
Author대한무역투자진흥공사
URLhttps://www.data.go.kr/data/15044524/fileData.do

Alerts

연번 is highly overall correlated with 본부명High correlation
본부명 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
무역관명 has unique valuesUnique
주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:05:33.274793
Analysis finished2023-12-12 21:05:33.824482
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct126
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean63.5
Minimum1
Maximum126
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-13T06:05:33.911338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.25
Q132.25
median63.5
Q394.75
95-th percentile119.75
Maximum126
Range125
Interquartile range (IQR)62.5

Descriptive statistics

Standard deviation36.517119
Coefficient of variation (CV)0.57507274
Kurtosis-1.2
Mean63.5
Median Absolute Deviation (MAD)31.5
Skewness0
Sum8001
Variance1333.5
MonotonicityStrictly increasing
2023-12-13T06:05:34.073593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.8%
81 1
 
0.8%
94 1
 
0.8%
93 1
 
0.8%
92 1
 
0.8%
91 1
 
0.8%
90 1
 
0.8%
89 1
 
0.8%
88 1
 
0.8%
87 1
 
0.8%
Other values (116) 116
92.1%
ValueCountFrequency (%)
1 1
0.8%
2 1
0.8%
3 1
0.8%
4 1
0.8%
5 1
0.8%
6 1
0.8%
7 1
0.8%
8 1
0.8%
9 1
0.8%
10 1
0.8%
ValueCountFrequency (%)
126 1
0.8%
125 1
0.8%
124 1
0.8%
123 1
0.8%
122 1
0.8%
121 1
0.8%
120 1
0.8%
119 1
0.8%
118 1
0.8%
117 1
0.8%

본부명
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)7.9%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
유럽
24 
중국
20 
동남아대양주
14 
중남미
14 
중동
14 
Other values (5)
40 

Length

Max length6
Median length2
Mean length2.952381
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCIS
2nd rowCIS
3rd rowCIS
4th rowCIS
5th rowCIS

Common Values

ValueCountFrequency (%)
유럽 24
19.0%
중국 20
15.9%
동남아대양주 14
11.1%
중남미 14
11.1%
중동 14
11.1%
북미 10
7.9%
아프리카 10
7.9%
CIS 9
 
7.1%
서남아시아 7
 
5.6%
일본 4
 
3.2%

Length

2023-12-13T06:05:34.231529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:05:34.361596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유럽 24
19.0%
중국 20
15.9%
동남아대양주 14
11.1%
중남미 14
11.1%
중동 14
11.1%
북미 10
7.9%
아프리카 10
7.9%
cis 9
 
7.1%
서남아시아 7
 
5.6%
일본 4
 
3.2%

무역관명
Text

UNIQUE 

Distinct126
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T06:05:34.710058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length3.5
Min length1

Characters and Unicode

Total characters441
Distinct characters170
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique126 ?
Unique (%)100.0%

Sample

1st row노보시비르스크
2nd row모스크바
3rd row민스크
4th row바쿠
5th row블라디보스톡
ValueCountFrequency (%)
노보시비르스크 1
 
0.8%
광저우 1
 
0.8%
창사 1
 
0.8%
정저우 1
 
0.8%
울란바토르 1
 
0.8%
우한 1
 
0.8%
시안 1
 
0.8%
선양 1
 
0.8%
샤먼 1
 
0.8%
선전 1
 
0.8%
Other values (116) 116
92.1%
2023-12-13T06:05:35.246187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20
 
4.5%
16
 
3.6%
15
 
3.4%
13
 
2.9%
13
 
2.9%
12
 
2.7%
11
 
2.5%
11
 
2.5%
9
 
2.0%
8
 
1.8%
Other values (160) 313
71.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 441
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20
 
4.5%
16
 
3.6%
15
 
3.4%
13
 
2.9%
13
 
2.9%
12
 
2.7%
11
 
2.5%
11
 
2.5%
9
 
2.0%
8
 
1.8%
Other values (160) 313
71.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 441
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
 
4.5%
16
 
3.6%
15
 
3.4%
13
 
2.9%
13
 
2.9%
12
 
2.7%
11
 
2.5%
11
 
2.5%
9
 
2.0%
8
 
1.8%
Other values (160) 313
71.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 441
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
20
 
4.5%
16
 
3.6%
15
 
3.4%
13
 
2.9%
13
 
2.9%
12
 
2.7%
11
 
2.5%
11
 
2.5%
9
 
2.0%
8
 
1.8%
Other values (160) 313
71.0%

주소
Text

UNIQUE 

Distinct126
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T06:05:35.581051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length165
Median length96.5
Mean length78.928571
Min length16

Characters and Unicode

Total characters9945
Distinct characters78
Distinct categories11 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique126 ?
Unique (%)100.0%

Sample

1st rowOffice 11-03, Bldg. 40 (Business Center 'MOST'), Kommunisticheskaya Street, Novosibirsk, Russia, 630007
2nd rowRm. 908, Office Bldg., World Trade Center, 12, Krasnopresnenskaya nab., Moscow, Russia(123610)
3rd rowOffice 509, Myasnikova Street 70, 220030, Minsk, Republic of Belarus
4th rowMaryam Plaza, office 304, 12 Basti Bagirova str., Baku, Azerbaijan(AZ1065)
5th row1th Floor, Hyundai Hotel, 29 Semenovskaya st., Vladivostok, Russia(690091)
ValueCountFrequency (%)
floor 31
 
2.1%
25
 
1.7%
center 24
 
1.6%
road 18
 
1.2%
tower 17
 
1.2%
street 13
 
0.9%
st 12
 
0.8%
of 11
 
0.7%
office 10
 
0.7%
district 10
 
0.7%
Other values (984) 1301
88.4%
2023-12-13T06:05:36.196655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1373
 
13.8%
a 736
 
7.4%
, 553
 
5.6%
e 524
 
5.3%
o 453
 
4.6%
n 450
 
4.5%
i 442
 
4.4%
r 389
 
3.9%
t 322
 
3.2%
l 274
 
2.8%
Other values (68) 4429
44.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 5230
52.6%
Space Separator 1373
 
13.8%
Uppercase Letter 1317
 
13.2%
Decimal Number 1133
 
11.4%
Other Punctuation 769
 
7.7%
Dash Punctuation 62
 
0.6%
Open Punctuation 28
 
0.3%
Close Punctuation 28
 
0.3%
Other Letter 3
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 736
14.1%
e 524
10.0%
o 453
 
8.7%
n 450
 
8.6%
i 442
 
8.5%
r 389
 
7.4%
t 322
 
6.2%
l 274
 
5.2%
s 215
 
4.1%
u 208
 
4.0%
Other values (16) 1217
23.3%
Uppercase Letter
ValueCountFrequency (%)
C 144
 
10.9%
S 110
 
8.4%
A 107
 
8.1%
B 99
 
7.5%
T 83
 
6.3%
P 75
 
5.7%
R 75
 
5.7%
F 67
 
5.1%
N 66
 
5.0%
M 54
 
4.1%
Other values (16) 437
33.2%
Decimal Number
ValueCountFrequency (%)
0 252
22.2%
1 239
21.1%
2 150
13.2%
3 109
9.6%
4 77
 
6.8%
5 72
 
6.4%
6 71
 
6.3%
8 63
 
5.6%
7 55
 
4.9%
9 45
 
4.0%
Other Punctuation
ValueCountFrequency (%)
, 553
71.9%
. 163
 
21.2%
/ 15
 
2.0%
: 10
 
1.3%
* 9
 
1.2%
' 9
 
1.2%
# 7
 
0.9%
& 2
 
0.3%
1
 
0.1%
Space Separator
ValueCountFrequency (%)
1373
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 62
100.0%
Open Punctuation
ValueCountFrequency (%)
( 28
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28
100.0%
Other Letter
ValueCountFrequency (%)
º 3
100.0%
Other Symbol
ValueCountFrequency (%)
° 1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 6550
65.9%
Common 3395
34.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 736
 
11.2%
e 524
 
8.0%
o 453
 
6.9%
n 450
 
6.9%
i 442
 
6.7%
r 389
 
5.9%
t 322
 
4.9%
l 274
 
4.2%
s 215
 
3.3%
u 208
 
3.2%
Other values (43) 2537
38.7%
Common
ValueCountFrequency (%)
1373
40.4%
, 553
16.3%
0 252
 
7.4%
1 239
 
7.0%
. 163
 
4.8%
2 150
 
4.4%
3 109
 
3.2%
4 77
 
2.3%
5 72
 
2.1%
6 71
 
2.1%
Other values (15) 336
 
9.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9940
99.9%
None 5
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1373
 
13.8%
a 736
 
7.4%
, 553
 
5.6%
e 524
 
5.3%
o 453
 
4.6%
n 450
 
4.5%
i 442
 
4.4%
r 389
 
3.9%
t 322
 
3.2%
l 274
 
2.8%
Other values (65) 4424
44.5%
None
ValueCountFrequency (%)
º 3
60.0%
° 1
 
20.0%
1
 
20.0%

Interactions

2023-12-13T06:05:33.506093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:05:36.305465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번본부명
연번1.0000.982
본부명0.9821.000
2023-12-13T06:05:36.398987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번본부명
연번1.0000.760
본부명0.7601.000

Missing values

2023-12-13T06:05:33.668363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:05:33.778907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번본부명무역관명주소
01CIS노보시비르스크Office 11-03, Bldg. 40 (Business Center 'MOST'), Kommunisticheskaya Street, Novosibirsk, Russia, 630007
12CIS모스크바Rm. 908, Office Bldg., World Trade Center, 12, Krasnopresnenskaya nab., Moscow, Russia(123610)
23CIS민스크Office 509, Myasnikova Street 70, 220030, Minsk, Republic of Belarus
34CIS바쿠Maryam Plaza, office 304, 12 Basti Bagirova str., Baku, Azerbaijan(AZ1065)
45CIS블라디보스톡1th Floor, Hyundai Hotel, 29 Semenovskaya st., Vladivostok, Russia(690091)
56CIS상트페테르부르크Nekrasova 32-A, St. Petersburg, Russia(191014)
67CIS알마티Pavilion No.15/107, 3rd Floor, Timiryazev str. 42, Almaty, 050057, Republic of Kazakhstan
78CIS키예프19a Khreschatik Street, 01001, Kiev, Ukraine
89CIS타슈켄트4th floor C6, International Business Center, 107-B, Amir Temur prospect, Tashkent 100084, Uzbekistan
910동남아대양주마닐라UNIT 1504, 15F, BDO Equitable Tower, 8751 Paseo de Roxas St., Makati City, Philippines
연번본부명무역관명주소
116117중동무스카트Al Fardan Building (1st Fl. Office 517), Al Azaiba, Muscat, Sultanate of Oman, * P.O.Box : 149, Postal Code 118
117118중동바그다드International Zone, Al-Kindi Area, District 15, Street 9, house no.12, Baghdad, Iraq
118119중동알제1, Rue Hamdani Lahcen, Les Cretes A, Hydra 16016, Alger, Algerie 16035
119120중동암만Office No. 604/605, 6th Floor, Zahran Plaza Complex at the 7th Circle, Amman, Jordan. * P.O.Box : 940593 Amman 11194, Jordan
120121중동카사블랑카Ghandi Mall, Boulevard Ghandi Immeuble 8, 2eme etage No.4, 20100 Casablanca, Morocco
121122중동카이로44 PalestineSt. 1st Floor, New Maadi, Cairo, Egypt
122123중동쿠웨이트5th Floor, Al-Hajeri Bldg, Al-Shuhada`a St, Mirqab, Kuwait City, Kuwait. * P.O.Box : 20771 SAFAT 13068
123124중동테헤란7th Floor, No. 18, Jim Alley, Mahmudiyeh, Tehran, Iran
124125중동텔아비브8th Fl., Beit Amot, 48 Menachem Begin st Tel Aviv 6618003, Israel
125126중동트리폴리Tripoli, Libya