Overview

Dataset statistics

Number of variables19
Number of observations10000
Missing cells24616
Missing cells (%)13.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 MiB
Average record size in memory161.0 B

Variable types

Categorical5
Text13
Numeric1

Dataset

Description전세계 124개국의 해외진출기업의 회사명, 주소, 홈페이지, 진출년도, 진출형태, 투자형태 등의 정보 제공 (2016년 발행본)
URLhttps://www.data.go.kr/data/15003297/fileData.do

Alerts

투자형태 is highly imbalanced (72.4%)Imbalance
회사명(영문) has 116 (1.2%) missing valuesMissing
홈페이지 has 3833 (38.3%) missing valuesMissing
진출년도 has 435 (4.3%) missing valuesMissing
외국사 합작지분 has 9703 (97.0%) missing valuesMissing
업종2 has 3709 (37.1%) missing valuesMissing
취급분야 has 304 (3.0%) missing valuesMissing
본사파견 has 2782 (27.8%) missing valuesMissing
현지채용 has 1541 (15.4%) missing valuesMissing
모기업명 has 2167 (21.7%) missing valuesMissing
현지채용 is highly skewed (γ1 = 33.45363541)Skewed

Reproduction

Analysis started2023-12-12 21:32:44.115161
Analysis finished2023-12-12 21:32:49.169967
Duration5.05 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
아시아
6860 
아시아
777 
북미
695 
유럽
 
595
중남미
 
383
Other values (4)
690 

Length

Max length4
Median length4
Mean length3.6131
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row아시아
2nd rowCIS
3rd row유럽
4th row아시아
5th row아시아

Common Values

ValueCountFrequency (%)
아시아 6860
68.6%
아시아 777
 
7.8%
북미 695
 
7.0%
유럽 595
 
5.9%
중남미 383
 
3.8%
CIS 280
 
2.8%
중동 192
 
1.9%
중동 126
 
1.3%
아프리카 92
 
0.9%

Length

2023-12-13T06:32:49.260128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:32:49.447346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
아시아 7637
76.4%
북미 695
 
7.0%
유럽 595
 
5.9%
중남미 383
 
3.8%
중동 318
 
3.2%
cis 280
 
2.8%
아프리카 92
 
0.9%
Distinct89
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T06:32:49.695558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length3.0164
Min length2

Characters and Unicode

Total characters30164
Distinct characters123
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)0.1%

Sample

1st row중국
2nd row러시아
3rd row덴마크
4th row인도
5th row인도네시아
ValueCountFrequency (%)
중국 3033
30.3%
베트남 2295
22.9%
미국 659
 
6.6%
인도네시아 433
 
4.3%
일본 333
 
3.3%
태국 305
 
3.0%
인도 236
 
2.4%
필리핀 190
 
1.9%
미얀마 168
 
1.7%
멕시코 163
 
1.6%
Other values (76) 2185
21.9%
2023-12-13T06:32:50.021640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4079
13.5%
3033
 
10.1%
2440
 
8.1%
2359
 
7.8%
2315
 
7.7%
2059
 
6.8%
1230
 
4.1%
954
 
3.2%
943
 
3.1%
687
 
2.3%
Other values (113) 10065
33.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 28105
93.2%
Space Separator 2059
 
6.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4079
14.5%
3033
 
10.8%
2440
 
8.7%
2359
 
8.4%
2315
 
8.2%
1230
 
4.4%
954
 
3.4%
943
 
3.4%
687
 
2.4%
675
 
2.4%
Other values (112) 9390
33.4%
Space Separator
ValueCountFrequency (%)
2059
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 28105
93.2%
Common 2059
 
6.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4079
14.5%
3033
 
10.8%
2440
 
8.7%
2359
 
8.4%
2315
 
8.2%
1230
 
4.4%
954
 
3.4%
943
 
3.4%
687
 
2.4%
675
 
2.4%
Other values (112) 9390
33.4%
Common
ValueCountFrequency (%)
2059
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 28105
93.2%
ASCII 2059
 
6.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4079
14.5%
3033
 
10.8%
2440
 
8.7%
2359
 
8.4%
2315
 
8.2%
1230
 
4.4%
954
 
3.4%
943
 
3.4%
687
 
2.4%
675
 
2.4%
Other values (112) 9390
33.4%
ASCII
ValueCountFrequency (%)
2059
100.0%
Distinct497
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T06:32:50.350022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length29
Mean length12.5264
Min length3

Characters and Unicode

Total characters125264
Distinct characters454
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique204 ?
Unique (%)2.0%

Sample

1st row화동-상하이(上海)
2nd row모스크바(Moscow)
3rd row내스트베드(Naestved)
4th row첸나이(Chennai)
5th row자카르타(Jakarta)
ValueCountFrequency (%)
호치민(ho 768
 
5.3%
minh 768
 
5.3%
chi 768
 
5.3%
화동-상하이(上海 746
 
5.1%
화동-산둥성(山省 667
 
4.6%
하노이(ha 400
 
2.8%
noi 400
 
2.8%
duong 354
 
2.4%
화북-톈진(天津 334
 
2.3%
화북-베이징(北京 318
 
2.2%
Other values (580) 8971
61.9%
2023-12-13T06:32:50.763221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 9966
 
8.0%
) 9966
 
8.0%
a 6557
 
5.2%
i 5473
 
4.4%
n 5266
 
4.2%
o 5000
 
4.0%
4498
 
3.6%
h 3432
 
2.7%
- 2891
 
2.3%
2494
 
2.0%
Other values (444) 69721
55.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 43297
34.6%
Lowercase Letter 43191
34.5%
Uppercase Letter 11455
 
9.1%
Open Punctuation 9966
 
8.0%
Close Punctuation 9966
 
8.0%
Space Separator 4498
 
3.6%
Dash Punctuation 2891
 
2.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2494
 
5.8%
2365
 
5.5%
2063
 
4.8%
1420
 
3.3%
1420
 
3.3%
1344
 
3.1%
947
 
2.2%
851
 
2.0%
820
 
1.9%
792
 
1.8%
Other values (389) 28781
66.5%
Lowercase Letter
ValueCountFrequency (%)
a 6557
15.2%
i 5473
12.7%
n 5266
12.2%
o 5000
11.6%
h 3432
7.9%
e 2385
 
5.5%
g 1961
 
4.5%
r 1788
 
4.1%
u 1599
 
3.7%
t 1442
 
3.3%
Other values (16) 8288
19.2%
Uppercase Letter
ValueCountFrequency (%)
C 1558
13.6%
H 1475
12.9%
N 1168
10.2%
M 1160
10.1%
D 941
8.2%
B 910
7.9%
P 567
 
4.9%
T 562
 
4.9%
J 513
 
4.5%
S 460
 
4.0%
Other values (15) 2141
18.7%
Open Punctuation
ValueCountFrequency (%)
( 9966
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9966
100.0%
Space Separator
ValueCountFrequency (%)
4498
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2891
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 54646
43.6%
Hangul 37715
30.1%
Common 27321
21.8%
Han 5582
 
4.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2494
 
6.6%
2365
 
6.3%
2063
 
5.5%
1420
 
3.8%
1344
 
3.6%
947
 
2.5%
851
 
2.3%
820
 
2.2%
792
 
2.1%
786
 
2.1%
Other values (357) 23833
63.2%
Latin
ValueCountFrequency (%)
a 6557
 
12.0%
i 5473
 
10.0%
n 5266
 
9.6%
o 5000
 
9.1%
h 3432
 
6.3%
e 2385
 
4.4%
g 1961
 
3.6%
r 1788
 
3.3%
u 1599
 
2.9%
C 1558
 
2.9%
Other values (41) 19627
35.9%
Han
ValueCountFrequency (%)
1420
25.4%
752
13.5%
746
13.4%
667
11.9%
362
 
6.5%
334
 
6.0%
334
 
6.0%
318
 
5.7%
188
 
3.4%
104
 
1.9%
Other values (22) 357
 
6.4%
Common
ValueCountFrequency (%)
( 9966
36.5%
) 9966
36.5%
4498
16.5%
- 2891
 
10.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 81967
65.4%
Hangul 37715
30.1%
CJK 5582
 
4.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 9966
 
12.2%
) 9966
 
12.2%
a 6557
 
8.0%
i 5473
 
6.7%
n 5266
 
6.4%
o 5000
 
6.1%
4498
 
5.5%
h 3432
 
4.2%
- 2891
 
3.5%
e 2385
 
2.9%
Other values (45) 26533
32.4%
Hangul
ValueCountFrequency (%)
2494
 
6.6%
2365
 
6.3%
2063
 
5.5%
1420
 
3.8%
1344
 
3.6%
947
 
2.5%
851
 
2.3%
820
 
2.2%
792
 
2.1%
786
 
2.1%
Other values (357) 23833
63.2%
CJK
ValueCountFrequency (%)
1420
25.4%
752
13.5%
746
13.4%
667
11.9%
362
 
6.5%
334
 
6.0%
334
 
6.0%
318
 
5.7%
188
 
3.4%
104
 
1.9%
Other values (22) 357
 
6.4%
Distinct9503
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T06:32:51.056716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length37
Mean length11.0269
Min length2

Characters and Unicode

Total characters110269
Distinct characters1835
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9241 ?
Unique (%)92.4%

Sample

1st row커이나신식기술유한공사/上海可以拿信息技有限公司
2nd row아시아나항공모스크바화물지점
3rd row노벤코 마린 앤드 오프쇼어
4th row동아인디아
5th row수출입은행
ValueCountFrequency (%)
삼성전자 18
 
0.2%
lg전자 16
 
0.2%
현대종합상사 12
 
0.1%
아시아나항공 10
 
0.1%
한진해운 10
 
0.1%
삼성전자㈜ 10
 
0.1%
현대건설 9
 
0.1%
포스코대우 9
 
0.1%
현대상선 8
 
0.1%
범한판토스 8
 
0.1%
Other values (9611) 10111
98.9%
2023-12-13T06:32:51.530559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3489
 
3.2%
3274
 
3.0%
/ 2960
 
2.7%
2755
 
2.5%
2661
 
2.4%
2648
 
2.4%
2625
 
2.4%
2521
 
2.3%
2500
 
2.3%
2078
 
1.9%
Other values (1825) 82758
75.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 97982
88.9%
Uppercase Letter 3359
 
3.0%
Other Punctuation 3127
 
2.8%
Open Punctuation 1848
 
1.7%
Close Punctuation 1844
 
1.7%
Lowercase Letter 979
 
0.9%
Other Symbol 625
 
0.6%
Space Separator 346
 
0.3%
Decimal Number 104
 
0.1%
Dash Punctuation 54
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3489
 
3.6%
3274
 
3.3%
2755
 
2.8%
2661
 
2.7%
2648
 
2.7%
2625
 
2.7%
2521
 
2.6%
2500
 
2.6%
2078
 
2.1%
1777
 
1.8%
Other values (1751) 71654
73.1%
Uppercase Letter
ValueCountFrequency (%)
S 515
15.3%
L 319
 
9.5%
G 318
 
9.5%
C 299
 
8.9%
K 295
 
8.8%
T 172
 
5.1%
A 146
 
4.3%
I 139
 
4.1%
D 123
 
3.7%
J 114
 
3.4%
Other values (16) 919
27.4%
Lowercase Letter
ValueCountFrequency (%)
n 124
12.7%
i 108
11.0%
e 101
10.3%
a 101
10.3%
o 84
 
8.6%
t 70
 
7.2%
r 54
 
5.5%
h 41
 
4.2%
l 38
 
3.9%
s 37
 
3.8%
Other values (14) 221
22.6%
Decimal Number
ValueCountFrequency (%)
1 35
33.7%
2 17
16.3%
3 13
 
12.5%
0 13
 
12.5%
4 7
 
6.7%
9 7
 
6.7%
5 5
 
4.8%
6 3
 
2.9%
8 3
 
2.9%
7 1
 
1.0%
Other Punctuation
ValueCountFrequency (%)
/ 2960
94.7%
& 109
 
3.5%
. 49
 
1.6%
, 4
 
0.1%
· 3
 
0.1%
: 1
 
< 0.1%
' 1
 
< 0.1%
Other Symbol
ValueCountFrequency (%)
624
99.8%
1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 1848
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1844
100.0%
Space Separator
ValueCountFrequency (%)
346
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 54
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 73571
66.7%
Han 25036
 
22.7%
Common 7324
 
6.6%
Latin 4338
 
3.9%

Most frequent character per script

Han
ValueCountFrequency (%)
2661
 
10.6%
2648
 
10.6%
2521
 
10.1%
2500
 
10.0%
772
 
3.1%
628
 
2.5%
420
 
1.7%
391
 
1.6%
359
 
1.4%
315
 
1.3%
Other values (945) 11821
47.2%
Hangul
ValueCountFrequency (%)
3489
 
4.7%
3274
 
4.5%
2755
 
3.7%
2625
 
3.6%
2078
 
2.8%
1777
 
2.4%
1213
 
1.6%
1175
 
1.6%
1134
 
1.5%
1034
 
1.4%
Other values (798) 53017
72.1%
Latin
ValueCountFrequency (%)
S 515
 
11.9%
L 319
 
7.4%
G 318
 
7.3%
C 299
 
6.9%
K 295
 
6.8%
T 172
 
4.0%
A 146
 
3.4%
I 139
 
3.2%
n 124
 
2.9%
D 123
 
2.8%
Other values (40) 1888
43.5%
Common
ValueCountFrequency (%)
/ 2960
40.4%
( 1848
25.2%
) 1844
25.2%
346
 
4.7%
& 109
 
1.5%
- 54
 
0.7%
. 49
 
0.7%
1 35
 
0.5%
2 17
 
0.2%
3 13
 
0.2%
Other values (12) 49
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 72946
66.2%
CJK 24997
 
22.7%
ASCII 11659
 
10.6%
None 628
 
0.6%
CJK Compat Ideographs 39
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3489
 
4.8%
3274
 
4.5%
2755
 
3.8%
2625
 
3.6%
2078
 
2.8%
1777
 
2.4%
1213
 
1.7%
1175
 
1.6%
1134
 
1.6%
1034
 
1.4%
Other values (796) 52392
71.8%
ASCII
ValueCountFrequency (%)
/ 2960
25.4%
( 1848
15.9%
) 1844
15.8%
S 515
 
4.4%
346
 
3.0%
L 319
 
2.7%
G 318
 
2.7%
C 299
 
2.6%
K 295
 
2.5%
T 172
 
1.5%
Other values (61) 2743
23.5%
CJK
ValueCountFrequency (%)
2661
 
10.6%
2648
 
10.6%
2521
 
10.1%
2500
 
10.0%
772
 
3.1%
628
 
2.5%
420
 
1.7%
391
 
1.6%
359
 
1.4%
315
 
1.3%
Other values (934) 11782
47.1%
None
ValueCountFrequency (%)
624
99.4%
· 3
 
0.5%
1
 
0.2%
CJK Compat Ideographs
ValueCountFrequency (%)
17
43.6%
11
28.2%
2
 
5.1%
2
 
5.1%
1
 
2.6%
1
 
2.6%
1
 
2.6%
1
 
2.6%
1
 
2.6%
1
 
2.6%

회사명(영문)
Text

MISSING 

Distinct9661
Distinct (%)97.7%
Missing116
Missing (%)1.2%
Memory size156.2 KiB
2023-12-13T06:32:51.945813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length97
Median length66
Mean length26.21641
Min length2

Characters and Unicode

Total characters259123
Distinct characters82
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9520 ?
Unique (%)96.3%

Sample

1st rowKOINA Co., Ltd.
2nd rowAsiana Airlines Repesentative Moscow Cargo Sales Office
3rd rowNovenco Marine & Offishore A/S
4th rowDong-A India
5th rowKorea Eximbank
ValueCountFrequency (%)
ltd 4962
 
12.4%
co 4405
 
11.0%
vina 688
 
1.7%
office 517
 
1.3%
inc 504
 
1.3%
electronics 441
 
1.1%
410
 
1.0%
vietnam 351
 
0.9%
corp 343
 
0.9%
shanghai 322
 
0.8%
Other values (8058) 27179
67.7%
2023-12-13T06:32:52.573538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30870
 
11.9%
n 19136
 
7.4%
a 16786
 
6.5%
o 16753
 
6.5%
i 15286
 
5.9%
e 13816
 
5.3%
t 13027
 
5.0%
. 12638
 
4.9%
r 8615
 
3.3%
d 8553
 
3.3%
Other values (72) 103643
40.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 161245
62.2%
Uppercase Letter 46222
 
17.8%
Space Separator 30874
 
11.9%
Other Punctuation 18367
 
7.1%
Open Punctuation 915
 
0.4%
Close Punctuation 914
 
0.4%
Dash Punctuation 444
 
0.2%
Decimal Number 131
 
0.1%
Final Punctuation 5
 
< 0.1%
Other Letter 4
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 19136
11.9%
a 16786
10.4%
o 16753
10.4%
i 15286
9.5%
e 13816
 
8.6%
t 13027
 
8.1%
r 8615
 
5.3%
d 8553
 
5.3%
c 6407
 
4.0%
s 6207
 
3.8%
Other values (17) 36659
22.7%
Uppercase Letter
ValueCountFrequency (%)
C 7660
16.6%
L 6527
14.1%
S 4505
 
9.7%
T 2335
 
5.1%
I 2207
 
4.8%
H 1940
 
4.2%
A 1857
 
4.0%
E 1830
 
4.0%
P 1750
 
3.8%
K 1748
 
3.8%
Other values (16) 13863
30.0%
Decimal Number
ValueCountFrequency (%)
1 35
26.7%
2 27
20.6%
3 22
16.8%
0 13
 
9.9%
4 12
 
9.2%
5 7
 
5.3%
9 6
 
4.6%
8 4
 
3.1%
6 4
 
3.1%
7 1
 
0.8%
Other Punctuation
ValueCountFrequency (%)
. 12638
68.8%
, 4937
 
26.9%
& 709
 
3.9%
' 51
 
0.3%
/ 19
 
0.1%
7
 
< 0.1%
· 4
 
< 0.1%
! 1
 
< 0.1%
: 1
 
< 0.1%
Other Letter
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Space Separator
ValueCountFrequency (%)
30870
> 99.9%
  4
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 915
100.0%
Close Punctuation
ValueCountFrequency (%)
) 914
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 444
100.0%
Final Punctuation
ValueCountFrequency (%)
5
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 207467
80.1%
Common 51652
 
19.9%
Hangul 4
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 19136
 
9.2%
a 16786
 
8.1%
o 16753
 
8.1%
i 15286
 
7.4%
e 13816
 
6.7%
t 13027
 
6.3%
r 8615
 
4.2%
d 8553
 
4.1%
C 7660
 
3.7%
L 6527
 
3.1%
Other values (43) 81308
39.2%
Common
ValueCountFrequency (%)
30870
59.8%
. 12638
24.5%
, 4937
 
9.6%
( 915
 
1.8%
) 914
 
1.8%
& 709
 
1.4%
- 444
 
0.9%
' 51
 
0.1%
1 35
 
0.1%
2 27
 
0.1%
Other values (16) 112
 
0.2%
Hangul
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 259098
> 99.9%
None 16
 
< 0.1%
Punctuation 5
 
< 0.1%
Compat Jamo 2
 
< 0.1%
Hangul 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
30870
 
11.9%
n 19136
 
7.4%
a 16786
 
6.5%
o 16753
 
6.5%
i 15286
 
5.9%
e 13816
 
5.3%
t 13027
 
5.0%
. 12638
 
4.9%
r 8615
 
3.3%
d 8553
 
3.3%
Other values (64) 103618
40.0%
None
ValueCountFrequency (%)
7
43.8%
· 4
25.0%
  4
25.0%
ł 1
 
6.2%
Punctuation
ValueCountFrequency (%)
5
100.0%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

주소
Text

Distinct9639
Distinct (%)96.6%
Missing26
Missing (%)0.3%
Memory size156.2 KiB
2023-12-13T06:32:53.342184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length199
Median length127
Mean length53.744335
Min length1

Characters and Unicode

Total characters536046
Distinct characters937
Distinct categories16 ?
Distinct scripts5 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9415 ?
Unique (%)94.4%

Sample

1st row中上海市古北新水城南路55明珠大801室(:201103)
2nd row Office 29, Block 2, Berezovay Alley 28, Domodedovo District St., District, Airport Domodedovo, Moscow142015., Russia
3rd rowIndustivej 22, DK-4700 Naestved, Denmark
4th rowNew No. 55, Thandalam Village, Sriperumbudur Taluk, Kanchipuram District, Chennai, India
5th rowMenara Mulia, Suite 2007 20th Fl Jl. Jend. Gatot Subroto Kav. 9-11 Jakarta Selatan 12930, Indonesia
ValueCountFrequency (%)
vietnam 1816
 
2.2%
dist 1273
 
1.5%
st 990
 
1.2%
road 903
 
1.1%
ward 846
 
1.0%
hcmc 768
 
0.9%
binh 729
 
0.9%
city 650
 
0.8%
usa 643
 
0.8%
1 598
 
0.7%
Other values (17911) 74044
88.9%
2023-12-13T06:32:53.998103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
73956
 
13.8%
a 37773
 
7.0%
n 28384
 
5.3%
, 28375
 
5.3%
i 24099
 
4.5%
o 22534
 
4.2%
e 20285
 
3.8%
t 16140
 
3.0%
r 14602
 
2.7%
u 11875
 
2.2%
Other values (927) 258023
48.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 257931
48.1%
Uppercase Letter 78430
 
14.6%
Space Separator 73957
 
13.8%
Decimal Number 53712
 
10.0%
Other Punctuation 38248
 
7.1%
Other Letter 28962
 
5.4%
Dash Punctuation 3771
 
0.7%
Open Punctuation 356
 
0.1%
Close Punctuation 354
 
0.1%
Math Symbol 284
 
0.1%
Other values (6) 41
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3177
 
11.0%
2823
 
9.7%
1763
 
6.1%
1304
 
4.5%
1084
 
3.7%
1045
 
3.6%
801
 
2.8%
724
 
2.5%
651
 
2.2%
537
 
1.9%
Other values (813) 15053
52.0%
Uppercase Letter
ValueCountFrequency (%)
C 7158
 
9.1%
S 6623
 
8.4%
T 6029
 
7.7%
B 5496
 
7.0%
D 5126
 
6.5%
P 4657
 
5.9%
A 4462
 
5.7%
N 4166
 
5.3%
H 3901
 
5.0%
M 3847
 
4.9%
Other values (20) 26965
34.4%
Lowercase Letter
ValueCountFrequency (%)
a 37773
14.6%
n 28384
11.0%
i 24099
 
9.3%
o 22534
 
8.7%
e 20285
 
7.9%
t 16140
 
6.3%
r 14602
 
5.7%
u 11875
 
4.6%
h 11543
 
4.5%
l 10367
 
4.0%
Other values (19) 60329
23.4%
Decimal Number
ValueCountFrequency (%)
1 11480
21.4%
0 9252
17.2%
2 7150
13.3%
3 5182
9.6%
5 4285
 
8.0%
4 3870
 
7.2%
6 3413
 
6.4%
8 3146
 
5.9%
7 3128
 
5.8%
9 2796
 
5.2%
Other values (8) 10
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
, 28375
74.2%
. 8018
 
21.0%
/ 1016
 
2.7%
# 489
 
1.3%
' 118
 
0.3%
& 108
 
0.3%
: 43
 
0.1%
" 40
 
0.1%
31
 
0.1%
6
 
< 0.1%
Other values (3) 4
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
221
77.8%
~ 34
 
12.0%
+ 12
 
4.2%
> 9
 
3.2%
< 8
 
2.8%
Letter Number
ValueCountFrequency (%)
4
57.1%
1
 
14.3%
1
 
14.3%
1
 
14.3%
Space Separator
ValueCountFrequency (%)
73956
> 99.9%
  1
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 3765
99.8%
6
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 355
99.7%
[ 1
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 353
99.7%
] 1
 
0.3%
Final Punctuation
ValueCountFrequency (%)
8
72.7%
3
 
27.3%
Initial Punctuation
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Connector Punctuation
ValueCountFrequency (%)
_ 14
100.0%
Other Symbol
ValueCountFrequency (%)
° 4
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 336372
62.8%
Common 170716
31.8%
Han 28939
 
5.4%
Hangul 18
 
< 0.1%
Cyrillic 1
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
3177
 
11.0%
2823
 
9.8%
1763
 
6.1%
1304
 
4.5%
1084
 
3.7%
1045
 
3.6%
801
 
2.8%
724
 
2.5%
651
 
2.2%
537
 
1.9%
Other values (795) 15030
51.9%
Latin
ValueCountFrequency (%)
a 37773
 
11.2%
n 28384
 
8.4%
i 24099
 
7.2%
o 22534
 
6.7%
e 20285
 
6.0%
t 16140
 
4.8%
r 14602
 
4.3%
u 11875
 
3.5%
h 11543
 
3.4%
l 10367
 
3.1%
Other values (53) 138770
41.3%
Common
ValueCountFrequency (%)
73956
43.3%
, 28375
 
16.6%
1 11480
 
6.7%
0 9252
 
5.4%
. 8018
 
4.7%
2 7150
 
4.2%
3 5182
 
3.0%
5 4285
 
2.5%
4 3870
 
2.3%
- 3765
 
2.2%
Other values (41) 15383
 
9.0%
Hangul
ValueCountFrequency (%)
2
 
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (7) 7
38.9%
Cyrillic
ValueCountFrequency (%)
А 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 506764
94.5%
CJK 28901
 
5.4%
None 302
 
0.1%
CJK Compat Ideographs 38
 
< 0.1%
Hangul 18
 
< 0.1%
Punctuation 15
 
< 0.1%
Number Forms 7
 
< 0.1%
Cyrillic 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
73956
 
14.6%
a 37773
 
7.5%
n 28384
 
5.6%
, 28375
 
5.6%
i 24099
 
4.8%
o 22534
 
4.4%
e 20285
 
4.0%
t 16140
 
3.2%
r 14602
 
2.9%
u 11875
 
2.3%
Other values (75) 228741
45.1%
CJK
ValueCountFrequency (%)
3177
 
11.0%
2823
 
9.8%
1763
 
6.1%
1304
 
4.5%
1084
 
3.8%
1045
 
3.6%
801
 
2.8%
724
 
2.5%
651
 
2.3%
537
 
1.9%
Other values (780) 14992
51.9%
None
ValueCountFrequency (%)
221
73.2%
31
 
10.3%
ł 8
 
2.6%
6
 
2.0%
ı 6
 
2.0%
6
 
2.0%
º 5
 
1.7%
° 4
 
1.3%
2
 
0.7%
2
 
0.7%
Other values (11) 11
 
3.6%
CJK Compat Ideographs
ValueCountFrequency (%)
9
23.7%
6
15.8%
6
15.8%
3
 
7.9%
3
 
7.9%
2
 
5.3%
1
 
2.6%
1
 
2.6%
1
 
2.6%
1
 
2.6%
Other values (5) 5
13.2%
Punctuation
ValueCountFrequency (%)
8
53.3%
3
 
20.0%
3
 
20.0%
1
 
6.7%
Number Forms
ValueCountFrequency (%)
4
57.1%
1
 
14.3%
1
 
14.3%
1
 
14.3%
Hangul
ValueCountFrequency (%)
2
 
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (7) 7
38.9%
Cyrillic
ValueCountFrequency (%)
А 1
100.0%

홈페이지
Text

MISSING 

Distinct4367
Distinct (%)70.8%
Missing3833
Missing (%)38.3%
Memory size156.2 KiB
2023-12-13T06:32:54.299935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length100
Median length47
Mean length16.889574
Min length8

Characters and Unicode

Total characters104158
Distinct characters61
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3717 ?
Unique (%)60.3%

Sample

1st rowwww.koina.net
2nd rowwww.asianacargo.com
3rd rowwww.novencogroup.com
4th rowwww.dacm.com
5th rowwww.cheil.com
ValueCountFrequency (%)
www.hanjin.com 44
 
0.7%
www.koreanair.com 43
 
0.7%
www.samsung.com 40
 
0.6%
www.daewoo.com 35
 
0.6%
www.flyasiana.com 25
 
0.4%
www.samsungcnt.com 24
 
0.4%
www.hmm21.com 23
 
0.4%
www.hyundaicorp.com 21
 
0.3%
www.cj.net 20
 
0.3%
www.pantos.com 19
 
0.3%
Other values (4320) 5893
95.2%
2023-12-13T06:32:54.730581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
w 19212
18.4%
. 14858
14.3%
o 9589
 
9.2%
c 8097
 
7.8%
m 5358
 
5.1%
n 4991
 
4.8%
a 4666
 
4.5%
e 4204
 
4.0%
r 4160
 
4.0%
s 3690
 
3.5%
Other values (51) 25333
24.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 87866
84.4%
Other Punctuation 15052
 
14.5%
Dash Punctuation 476
 
0.5%
Decimal Number 421
 
0.4%
Space Separator 235
 
0.2%
Uppercase Letter 102
 
0.1%
Connector Punctuation 5
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
w 19212
21.9%
o 9589
10.9%
c 8097
 
9.2%
m 5358
 
6.1%
n 4991
 
5.7%
a 4666
 
5.3%
e 4204
 
4.8%
r 4160
 
4.7%
s 3690
 
4.2%
k 3462
 
3.9%
Other values (20) 20437
23.3%
Uppercase Letter
ValueCountFrequency (%)
S 31
30.4%
J 20
19.6%
C 18
17.6%
K 8
 
7.8%
I 6
 
5.9%
P 5
 
4.9%
O 5
 
4.9%
L 3
 
2.9%
E 2
 
2.0%
M 1
 
1.0%
Other values (3) 3
 
2.9%
Decimal Number
ValueCountFrequency (%)
1 130
30.9%
2 108
25.7%
0 45
 
10.7%
3 26
 
6.2%
4 25
 
5.9%
8 24
 
5.7%
6 17
 
4.0%
9 17
 
4.0%
7 16
 
3.8%
5 13
 
3.1%
Other Punctuation
ValueCountFrequency (%)
. 14858
98.7%
/ 171
 
1.1%
, 23
 
0.2%
Space Separator
ValueCountFrequency (%)
230
97.9%
  5
 
2.1%
Dash Punctuation
ValueCountFrequency (%)
- 476
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%
Math Symbol
ValueCountFrequency (%)
= 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 87962
84.5%
Common 16190
 
15.5%
Cyrillic 6
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 19212
21.8%
o 9589
10.9%
c 8097
 
9.2%
m 5358
 
6.1%
n 4991
 
5.7%
a 4666
 
5.3%
e 4204
 
4.8%
r 4160
 
4.7%
s 3690
 
4.2%
k 3462
 
3.9%
Other values (29) 20533
23.3%
Common
ValueCountFrequency (%)
. 14858
91.8%
- 476
 
2.9%
230
 
1.4%
/ 171
 
1.1%
1 130
 
0.8%
2 108
 
0.7%
0 45
 
0.3%
3 26
 
0.2%
4 25
 
0.2%
8 24
 
0.1%
Other values (8) 97
 
0.6%
Cyrillic
ValueCountFrequency (%)
у 2
33.3%
р 2
33.3%
э 1
16.7%
к 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 104147
> 99.9%
Cyrillic 6
 
< 0.1%
None 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
w 19212
18.4%
. 14858
14.3%
o 9589
 
9.2%
c 8097
 
7.8%
m 5358
 
5.1%
n 4991
 
4.8%
a 4666
 
4.5%
e 4204
 
4.0%
r 4160
 
4.0%
s 3690
 
3.5%
Other values (46) 25322
24.3%
None
ValueCountFrequency (%)
  5
100.0%
Cyrillic
ValueCountFrequency (%)
у 2
33.3%
р 2
33.3%
э 1
16.7%
к 1
16.7%

진출년도
Text

MISSING 

Distinct68
Distinct (%)0.7%
Missing435
Missing (%)4.3%
Memory size156.2 KiB
2023-12-13T06:32:54.997200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length4
Mean length4.0026137
Min length4

Characters and Unicode

Total characters38285
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)0.2%

Sample

1st row2003
2nd row2009
3rd row2013
4th row2002
5th row1992
ValueCountFrequency (%)
2007 658
 
6.9%
2006 640
 
6.7%
2005 604
 
6.3%
2008 564
 
5.9%
2004 532
 
5.6%
2003 526
 
5.5%
2002 460
 
4.8%
2011 428
 
4.5%
2009 422
 
4.4%
2010 396
 
4.1%
Other values (58) 4335
45.3%
2023-12-13T06:32:55.413733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 13198
34.5%
2 8354
21.8%
1 5443
14.2%
9 4664
 
12.2%
5 1187
 
3.1%
7 1142
 
3.0%
4 1107
 
2.9%
8 1095
 
2.9%
3 1095
 
2.9%
6 995
 
2.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 38280
> 99.9%
Other Punctuation 5
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 13198
34.5%
2 8354
21.8%
1 5443
14.2%
9 4664
 
12.2%
5 1187
 
3.1%
7 1142
 
3.0%
4 1107
 
2.9%
8 1095
 
2.9%
3 1095
 
2.9%
6 995
 
2.6%
Other Punctuation
ValueCountFrequency (%)
/ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 38285
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 13198
34.5%
2 8354
21.8%
1 5443
14.2%
9 4664
 
12.2%
5 1187
 
3.1%
7 1142
 
3.0%
4 1107
 
2.9%
8 1095
 
2.9%
3 1095
 
2.9%
6 995
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 38285
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 13198
34.5%
2 8354
21.8%
1 5443
14.2%
9 4664
 
12.2%
5 1187
 
3.1%
7 1142
 
3.0%
4 1107
 
2.9%
8 1095
 
2.9%
3 1095
 
2.9%
6 995
 
2.6%

진출형태
Categorical

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
생산법인
4400 
서비스법인
2064 
판매법인
1417 
지점
861 
연락사무소
833 
Other values (4)
 
425

Length

Max length6
Median length4
Mean length4.1184
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row서비스법인
2nd row지점
3rd row판매법인
4th row생산법인
5th row서비스법인

Common Values

ValueCountFrequency (%)
생산법인 4400
44.0%
서비스법인 2064
20.6%
판매법인 1417
 
14.2%
지점 861
 
8.6%
연락사무소 833
 
8.3%
<NA> 419
 
4.2%
생산법인 3
 
< 0.1%
연락사무소 2
 
< 0.1%
서비스법인 1
 
< 0.1%

Length

2023-12-13T06:32:55.592325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:32:55.753817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
생산법인 4403
44.0%
서비스법인 2065
20.6%
판매법인 1417
 
14.2%
지점 861
 
8.6%
연락사무소 835
 
8.3%
na 419
 
4.2%

투자형태
Categorical

IMBALANCE 

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
단독
8476 
합작
 
662
<NA>
 
483
합자
 
340
M&A
 
21
Other values (4)
 
18

Length

Max length6
Median length2
Mean length2.1007
Min length2

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row합자
2nd row단독
3rd row합작
4th row단독
5th row합작

Common Values

ValueCountFrequency (%)
단독 8476
84.8%
합작 662
 
6.6%
<NA> 483
 
4.8%
합자 340
 
3.4%
M&A 21
 
0.2%
단독 15
 
0.1%
단독 1
 
< 0.1%
없음 1
 
< 0.1%
합자 1
 
< 0.1%

Length

2023-12-13T06:32:55.948004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:32:56.086818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
단독 8492
84.9%
합작 662
 
6.6%
na 483
 
4.8%
합자 341
 
3.4%
m&a 21
 
0.2%
없음 1
 
< 0.1%
Distinct51
Distinct (%)17.2%
Missing9703
Missing (%)97.0%
Memory size156.2 KiB
2023-12-13T06:32:56.263878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length2.9158249
Min length2

Characters and Unicode

Total characters866
Distinct characters19
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)8.4%

Sample

1st row85%
2nd row50%
3rd row40%
4th row49%
5th row50%
ValueCountFrequency (%)
50 67
22.5%
51 39
13.1%
49 24
 
8.1%
30 23
 
7.7%
10 17
 
5.7%
0 15
 
5.0%
20 13
 
4.4%
40 9
 
3.0%
5 9
 
3.0%
60 8
 
2.7%
Other values (42) 74
24.8%
2023-12-13T06:32:56.586761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
% 292
33.7%
0 168
19.4%
5 137
15.8%
1 70
 
8.1%
4 47
 
5.4%
3 41
 
4.7%
9 33
 
3.8%
2 24
 
2.8%
6 18
 
2.1%
7 13
 
1.5%
Other values (9) 23
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 557
64.3%
Other Punctuation 292
33.7%
Other Letter 16
 
1.8%
Space Separator 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 168
30.2%
5 137
24.6%
1 70
12.6%
4 47
 
8.4%
3 41
 
7.4%
9 33
 
5.9%
2 24
 
4.3%
6 18
 
3.2%
7 13
 
2.3%
8 6
 
1.1%
Other Letter
ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
% 292
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 850
98.2%
Hangul 16
 
1.8%

Most frequent character per script

Common
ValueCountFrequency (%)
% 292
34.4%
0 168
19.8%
5 137
16.1%
1 70
 
8.2%
4 47
 
5.5%
3 41
 
4.8%
9 33
 
3.9%
2 24
 
2.8%
6 18
 
2.1%
7 13
 
1.5%
Other values (2) 7
 
0.8%
Hangul
ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 850
98.2%
Hangul 16
 
1.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
% 292
34.4%
0 168
19.8%
5 137
16.1%
1 70
 
8.2%
4 47
 
5.5%
3 41
 
4.8%
9 33
 
3.9%
2 24
 
2.8%
6 18
 
2.1%
7 13
 
1.5%
Other values (2) 7
 
0.8%
Hangul
ValueCountFrequency (%)
4
25.0%
4
25.0%
4
25.0%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%

업종1
Categorical

Distinct16
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
제조업
4825 
도매 및 소매업
1749 
서비스업
1346 
운수업
706 
건설·공사업
583 
Other values (11)
791 

Length

Max length10
Median length3
Mean length4.345
Min length2

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row서비스업
2nd row운수업
3rd row도매·소매업
4th row제조업
5th row기타

Common Values

ValueCountFrequency (%)
제조업 4825
48.2%
도매 및 소매업 1749
 
17.5%
서비스업 1346
 
13.5%
운수업 706
 
7.1%
건설·공사업 583
 
5.8%
기타 300
 
3.0%
금융·보험업 270
 
2.7%
부동산 및 임대업 70
 
0.7%
광업·자원개발 66
 
0.7%
농업·임업 및 어업 54
 
0.5%
Other values (6) 31
 
0.3%

Length

2023-12-13T06:32:56.755479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제조업 4830
35.1%
1873
 
13.6%
소매업 1749
 
12.7%
도매 1749
 
12.7%
서비스업 1347
 
9.8%
운수업 706
 
5.1%
건설·공사업 583
 
4.2%
기타 300
 
2.2%
금융·보험업 270
 
2.0%
임대업 70
 
0.5%
Other values (8) 269
 
2.0%

업종2
Text

MISSING 

Distinct61
Distinct (%)1.0%
Missing3709
Missing (%)37.1%
Memory size156.2 KiB
2023-12-13T06:32:56.984565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16
Mean length8.3463678
Min length2

Characters and Unicode

Total characters52507
Distinct characters122
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)0.2%

Sample

1st row출판·영상·방송통신 및 정보
2nd rowMarine ventilation
3rd row고무·플라스틱
4th row공공행정
5th row출판·영상·방송통신 및 정보
ValueCountFrequency (%)
전기·전자·정밀기기·부품 1164
 
11.7%
1022
 
10.3%
기타 803
 
8.1%
의복·잡화류 648
 
6.5%
제조업 503
 
5.1%
제품 500
 
5.0%
자동차·자동차부품 478
 
4.8%
섬유·피혁 444
 
4.5%
기계·장비 378
 
3.8%
전문·과학 337
 
3.4%
Other values (63) 3636
36.7%
2023-12-13T06:32:57.379705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
· 7165
 
13.6%
5048
 
9.6%
3654
 
7.0%
2680
 
5.1%
2288
 
4.4%
2164
 
4.1%
1729
 
3.3%
1520
 
2.9%
1164
 
2.2%
1065
 
2.0%
Other values (112) 24030
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 41624
79.3%
Other Punctuation 7165
 
13.6%
Space Separator 3654
 
7.0%
Lowercase Letter 26
 
< 0.1%
Open Punctuation 18
 
< 0.1%
Close Punctuation 18
 
< 0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5048
 
12.1%
2680
 
6.4%
2288
 
5.5%
2164
 
5.2%
1729
 
4.2%
1520
 
3.7%
1164
 
2.8%
1065
 
2.6%
1061
 
2.5%
1024
 
2.5%
Other values (95) 21881
52.6%
Lowercase Letter
ValueCountFrequency (%)
i 4
15.4%
n 4
15.4%
e 3
11.5%
t 3
11.5%
a 2
7.7%
r 2
7.7%
l 2
7.7%
o 2
7.7%
c 2
7.7%
v 1
 
3.8%
Uppercase Letter
ValueCountFrequency (%)
M 1
50.0%
E 1
50.0%
Other Punctuation
ValueCountFrequency (%)
· 7165
100.0%
Space Separator
ValueCountFrequency (%)
3654
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41624
79.3%
Common 10855
 
20.7%
Latin 28
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5048
 
12.1%
2680
 
6.4%
2288
 
5.5%
2164
 
5.2%
1729
 
4.2%
1520
 
3.7%
1164
 
2.8%
1065
 
2.6%
1061
 
2.5%
1024
 
2.5%
Other values (95) 21881
52.6%
Latin
ValueCountFrequency (%)
i 4
14.3%
n 4
14.3%
e 3
10.7%
t 3
10.7%
a 2
7.1%
r 2
7.1%
l 2
7.1%
o 2
7.1%
c 2
7.1%
M 1
 
3.6%
Other values (3) 3
10.7%
Common
ValueCountFrequency (%)
· 7165
66.0%
3654
33.7%
( 18
 
0.2%
) 18
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 41584
79.2%
None 7165
 
13.6%
ASCII 3718
 
7.1%
Compat Jamo 40
 
0.1%

Most frequent character per block

None
ValueCountFrequency (%)
· 7165
100.0%
Hangul
ValueCountFrequency (%)
5048
 
12.1%
2680
 
6.4%
2288
 
5.5%
2164
 
5.2%
1729
 
4.2%
1520
 
3.7%
1164
 
2.8%
1065
 
2.6%
1061
 
2.6%
1024
 
2.5%
Other values (94) 21841
52.5%
ASCII
ValueCountFrequency (%)
3654
98.3%
( 18
 
0.5%
) 18
 
0.5%
i 4
 
0.1%
n 4
 
0.1%
e 3
 
0.1%
t 3
 
0.1%
a 2
 
0.1%
r 2
 
0.1%
l 2
 
0.1%
Other values (6) 8
 
0.2%
Compat Jamo
ValueCountFrequency (%)
40
100.0%

취급분야
Text

MISSING 

Distinct7788
Distinct (%)80.3%
Missing304
Missing (%)3.0%
Memory size156.2 KiB
2023-12-13T06:32:57.736943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length200
Median length98
Mean length13.556312
Min length1

Characters and Unicode

Total characters131442
Distinct characters863
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7179 ?
Unique (%)74.0%

Sample

1st row종합포털사이트, 모바일컨텐츠
2nd row항공화물운송
3rd row선박용 공조시스템 제조 및 판매
4th row자동차, 세탁기, 산업제품 등에 사용되는 고무제품
5th rowLeasing, 금융
ValueCountFrequency (%)
1452
 
4.6%
제조 544
 
1.7%
438
 
1.4%
부품 419
 
1.3%
판매 394
 
1.2%
생산 331
 
1.0%
자동차 322
 
1.0%
의류 302
 
1.0%
서비스 237
 
0.8%
건설 198
 
0.6%
Other values (9452) 26895
85.3%
2023-12-13T06:32:58.299878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23115
 
17.6%
, 7502
 
5.7%
2182
 
1.7%
2098
 
1.6%
2069
 
1.6%
1986
 
1.5%
1759
 
1.3%
1475
 
1.1%
1384
 
1.1%
1212
 
0.9%
Other values (853) 86660
65.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 82980
63.1%
Space Separator 23121
 
17.6%
Lowercase Letter 8642
 
6.6%
Other Punctuation 8446
 
6.4%
Uppercase Letter 6039
 
4.6%
Open Punctuation 949
 
0.7%
Close Punctuation 948
 
0.7%
Decimal Number 200
 
0.2%
Dash Punctuation 88
 
0.1%
Math Symbol 10
 
< 0.1%
Other values (4) 19
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2182
 
2.6%
2098
 
2.5%
2069
 
2.5%
1986
 
2.4%
1759
 
2.1%
1475
 
1.8%
1384
 
1.7%
1212
 
1.5%
1143
 
1.4%
1123
 
1.4%
Other values (765) 66549
80.2%
Lowercase Letter
ValueCountFrequency (%)
e 1040
12.0%
i 740
 
8.6%
a 739
 
8.6%
r 731
 
8.5%
o 707
 
8.2%
t 691
 
8.0%
n 688
 
8.0%
s 546
 
6.3%
l 536
 
6.2%
c 343
 
4.0%
Other values (16) 1881
21.8%
Uppercase Letter
ValueCountFrequency (%)
C 657
 
10.9%
P 641
 
10.6%
T 502
 
8.3%
S 451
 
7.5%
E 393
 
6.5%
D 367
 
6.1%
L 323
 
5.3%
A 313
 
5.2%
I 310
 
5.1%
M 291
 
4.8%
Other values (16) 1791
29.7%
Other Punctuation
ValueCountFrequency (%)
, 7502
88.8%
/ 545
 
6.5%
& 127
 
1.5%
· 115
 
1.4%
. 71
 
0.8%
: 47
 
0.6%
' 23
 
0.3%
% 9
 
0.1%
4
 
< 0.1%
; 3
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
0 52
26.0%
1 40
20.0%
3 38
19.0%
2 28
14.0%
6 10
 
5.0%
4 9
 
4.5%
5 8
 
4.0%
8 7
 
3.5%
7 5
 
2.5%
9 3
 
1.5%
Math Symbol
ValueCountFrequency (%)
~ 5
50.0%
> 2
 
20.0%
+ 2
 
20.0%
| 1
 
10.0%
Space Separator
ValueCountFrequency (%)
23115
> 99.9%
  6
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 948
99.9%
[ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 947
99.9%
] 1
 
0.1%
Final Punctuation
ValueCountFrequency (%)
4
57.1%
3
42.9%
Dash Punctuation
ValueCountFrequency (%)
- 88
100.0%
Other Symbol
ValueCountFrequency (%)
6
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 4
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 82985
63.1%
Common 33775
25.7%
Latin 14681
 
11.2%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2182
 
2.6%
2098
 
2.5%
2069
 
2.5%
1986
 
2.4%
1759
 
2.1%
1475
 
1.8%
1384
 
1.7%
1212
 
1.5%
1143
 
1.4%
1123
 
1.4%
Other values (765) 66554
80.2%
Latin
ValueCountFrequency (%)
e 1040
 
7.1%
i 740
 
5.0%
a 739
 
5.0%
r 731
 
5.0%
o 707
 
4.8%
t 691
 
4.7%
n 688
 
4.7%
C 657
 
4.5%
P 641
 
4.4%
s 546
 
3.7%
Other values (42) 7501
51.1%
Common
ValueCountFrequency (%)
23115
68.4%
, 7502
 
22.2%
( 948
 
2.8%
) 947
 
2.8%
/ 545
 
1.6%
& 127
 
0.4%
· 115
 
0.3%
- 88
 
0.3%
. 71
 
0.2%
0 52
 
0.2%
Other values (25) 265
 
0.8%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 82971
63.1%
ASCII 48322
36.8%
None 131
 
0.1%
Punctuation 9
 
< 0.1%
Compat Jamo 8
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
23115
47.8%
, 7502
 
15.5%
e 1040
 
2.2%
( 948
 
2.0%
) 947
 
2.0%
i 740
 
1.5%
a 739
 
1.5%
r 731
 
1.5%
o 707
 
1.5%
t 691
 
1.4%
Other values (71) 11162
23.1%
Hangul
ValueCountFrequency (%)
2182
 
2.6%
2098
 
2.5%
2069
 
2.5%
1986
 
2.4%
1759
 
2.1%
1475
 
1.8%
1384
 
1.7%
1212
 
1.5%
1143
 
1.4%
1123
 
1.4%
Other values (762) 66540
80.2%
None
ValueCountFrequency (%)
· 115
87.8%
  6
 
4.6%
6
 
4.6%
4
 
3.1%
Compat Jamo
ValueCountFrequency (%)
7
87.5%
1
 
12.5%
Punctuation
ValueCountFrequency (%)
4
44.4%
3
33.3%
2
22.2%
CJK
ValueCountFrequency (%)
1
100.0%

본사파견
Text

MISSING 

Distinct96
Distinct (%)1.3%
Missing2782
Missing (%)27.8%
Memory size156.2 KiB
2023-12-13T06:32:58.484658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length1
Mean length1.092962
Min length1

Characters and Unicode

Total characters7889
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique41 ?
Unique (%)0.6%

Sample

1st row2
2nd row1
3rd row3
4th row2
5th row4
ValueCountFrequency (%)
1 2323
32.2%
2 1485
20.6%
3 988
13.7%
4 620
 
8.6%
5 421
 
5.8%
6 256
 
3.5%
7 175
 
2.4%
8 140
 
1.9%
10 136
 
1.9%
9 100
 
1.4%
Other values (84) 574
 
8.0%
2023-12-13T06:32:58.798605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 2808
35.6%
2 1642
20.8%
3 1080
 
13.7%
4 679
 
8.6%
5 529
 
6.7%
0 338
 
4.3%
6 308
 
3.9%
7 217
 
2.8%
8 177
 
2.2%
9 108
 
1.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7886
> 99.9%
Dash Punctuation 3
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 2808
35.6%
2 1642
20.8%
3 1080
 
13.7%
4 679
 
8.6%
5 529
 
6.7%
0 338
 
4.3%
6 308
 
3.9%
7 217
 
2.8%
8 177
 
2.2%
9 108
 
1.4%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7889
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 2808
35.6%
2 1642
20.8%
3 1080
 
13.7%
4 679
 
8.6%
5 529
 
6.7%
0 338
 
4.3%
6 308
 
3.9%
7 217
 
2.8%
8 177
 
2.2%
9 108
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7889
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 2808
35.6%
2 1642
20.8%
3 1080
 
13.7%
4 679
 
8.6%
5 529
 
6.7%
0 338
 
4.3%
6 308
 
3.9%
7 217
 
2.8%
8 177
 
2.2%
9 108
 
1.4%

현지채용
Real number (ℝ)

MISSING  SKEWED 

Distinct608
Distinct (%)7.2%
Missing1541
Missing (%)15.4%
Infinite0
Infinite (%)0.0%
Mean290.19317
Minimum1
Maximum110000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T06:32:58.934933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q17
median30
Q3150
95-th percentile1093.7
Maximum110000
Range109999
Interquartile range (IQR)143

Descriptive statistics

Standard deviation1887.669
Coefficient of variation (CV)6.5048705
Kurtosis1574.5517
Mean290.19317
Median Absolute Deviation (MAD)27
Skewness33.453635
Sum2454744
Variance3563294.1
MonotonicityNot monotonic
2023-12-13T06:32:59.055832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 386
 
3.9%
2 380
 
3.8%
3 375
 
3.8%
5 369
 
3.7%
10 305
 
3.0%
4 291
 
2.9%
20 249
 
2.5%
30 241
 
2.4%
6 227
 
2.3%
100 207
 
2.1%
Other values (598) 5429
54.3%
(Missing) 1541
 
15.4%
ValueCountFrequency (%)
1 386
3.9%
2 380
3.8%
3 375
3.8%
4 291
2.9%
5 369
3.7%
6 227
2.3%
7 185
1.8%
8 187
1.9%
9 92
 
0.9%
10 305
3.0%
ValueCountFrequency (%)
110000 1
< 0.1%
55000 1
< 0.1%
42414 1
< 0.1%
42409 1
< 0.1%
42378 1
< 0.1%
42374 1
< 0.1%
36000 1
< 0.1%
23002 1
< 0.1%
22500 1
< 0.1%
22000 1
< 0.1%

내수_수출
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
3892 
수출 및 내수 병행
1715 
100% 주재국 내수시장 공급
1444 
100% 주재국 내수 공급
1345 
100% 제3국 수출
1270 
Other values (2)
 
334

Length

Max length16
Median length15
Mean length9.2312
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row100% 주재국 내수 공급
2nd row<NA>
3rd row수출 및 내수 병행
4th row수출 및 내수 병행
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 3892
38.9%
수출 및 내수 병행 1715
17.2%
100% 주재국 내수시장 공급 1444
 
14.4%
100% 주재국 내수 공급 1345
 
13.5%
100% 제3국 수출 1270
 
12.7%
수출 및 내수 병행 330
 
3.3%
100% 주재국 내수 공급 4
 
< 0.1%

Length

2023-12-13T06:32:59.191790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:32:59.292715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
100 4063
15.0%
na 3892
14.4%
내수 3394
12.5%
수출 3315
12.3%
주재국 2793
10.3%
공급 2793
10.3%
2045
7.6%
병행 2045
7.6%
내수시장 1444
 
5.3%
제3국 1270
 
4.7%

모기업명
Text

MISSING 

Distinct4034
Distinct (%)51.5%
Missing2167
Missing (%)21.7%
Memory size156.2 KiB
2023-12-13T06:32:59.552415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length22
Mean length5.7921614
Min length2

Characters and Unicode

Total characters45370
Distinct characters663
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3346 ?
Unique (%)42.7%

Sample

1st row커이나㈜
2nd row아시아나항공㈜
3rd row하이에어코리아
4th row동아화성㈜
5th row한국수출입은행
ValueCountFrequency (%)
현지단독진출 1237
 
15.5%
삼성전자㈜ 92
 
1.2%
lg전자㈜ 82
 
1.0%
포스코대우㈜ 51
 
0.6%
한진해운㈜ 49
 
0.6%
삼성물산㈜ 48
 
0.6%
포스코㈜ 47
 
0.6%
대한항공㈜ 46
 
0.6%
현대자동차㈜ 41
 
0.5%
신한은행㈜ 40
 
0.5%
Other values (4086) 6242
78.3%
2023-12-13T06:32:59.996871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5082
 
11.2%
1548
 
3.4%
1531
 
3.4%
1508
 
3.3%
1272
 
2.8%
1259
 
2.8%
1245
 
2.7%
1133
 
2.5%
854
 
1.9%
848
 
1.9%
Other values (653) 29090
64.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 36489
80.4%
Other Symbol 5083
 
11.2%
Uppercase Letter 2675
 
5.9%
Lowercase Letter 486
 
1.1%
Space Separator 363
 
0.8%
Other Punctuation 130
 
0.3%
Open Punctuation 52
 
0.1%
Close Punctuation 52
 
0.1%
Dash Punctuation 22
 
< 0.1%
Decimal Number 18
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1548
 
4.2%
1531
 
4.2%
1508
 
4.1%
1272
 
3.5%
1259
 
3.5%
1245
 
3.4%
1133
 
3.1%
854
 
2.3%
848
 
2.3%
744
 
2.0%
Other values (585) 24547
67.3%
Uppercase Letter
ValueCountFrequency (%)
S 391
14.6%
K 291
10.9%
G 267
10.0%
C 265
9.9%
L 248
 
9.3%
T 139
 
5.2%
E 115
 
4.3%
J 108
 
4.0%
I 107
 
4.0%
B 103
 
3.9%
Other values (16) 641
24.0%
Lowercase Letter
ValueCountFrequency (%)
o 58
11.9%
i 49
10.1%
e 47
9.7%
a 41
 
8.4%
t 36
 
7.4%
r 35
 
7.2%
n 34
 
7.0%
c 27
 
5.6%
s 27
 
5.6%
l 25
 
5.1%
Other values (14) 107
22.0%
Decimal Number
ValueCountFrequency (%)
2 7
38.9%
1 4
22.2%
3 3
16.7%
4 2
 
11.1%
0 1
 
5.6%
8 1
 
5.6%
Other Punctuation
ValueCountFrequency (%)
& 65
50.0%
. 48
36.9%
/ 10
 
7.7%
, 6
 
4.6%
: 1
 
0.8%
Other Symbol
ValueCountFrequency (%)
5082
> 99.9%
1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
362
99.7%
  1
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 52
100.0%
Close Punctuation
ValueCountFrequency (%)
) 52
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41572
91.6%
Latin 3161
 
7.0%
Common 637
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5082
 
12.2%
1548
 
3.7%
1531
 
3.7%
1508
 
3.6%
1272
 
3.1%
1259
 
3.0%
1245
 
3.0%
1133
 
2.7%
854
 
2.1%
848
 
2.0%
Other values (587) 25292
60.8%
Latin
ValueCountFrequency (%)
S 391
 
12.4%
K 291
 
9.2%
G 267
 
8.4%
C 265
 
8.4%
L 248
 
7.8%
T 139
 
4.4%
E 115
 
3.6%
J 108
 
3.4%
I 107
 
3.4%
B 103
 
3.3%
Other values (40) 1127
35.7%
Common
ValueCountFrequency (%)
362
56.8%
& 65
 
10.2%
( 52
 
8.2%
) 52
 
8.2%
. 48
 
7.5%
- 22
 
3.5%
/ 10
 
1.6%
2 7
 
1.1%
, 6
 
0.9%
1 4
 
0.6%
Other values (6) 9
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 36489
80.4%
None 5084
 
11.2%
ASCII 3797
 
8.4%

Most frequent character per block

None
ValueCountFrequency (%)
5082
> 99.9%
1
 
< 0.1%
  1
 
< 0.1%
Hangul
ValueCountFrequency (%)
1548
 
4.2%
1531
 
4.2%
1508
 
4.1%
1272
 
3.5%
1259
 
3.5%
1245
 
3.4%
1133
 
3.1%
854
 
2.3%
848
 
2.3%
744
 
2.0%
Other values (585) 24547
67.3%
ASCII
ValueCountFrequency (%)
S 391
 
10.3%
362
 
9.5%
K 291
 
7.7%
G 267
 
7.0%
C 265
 
7.0%
L 248
 
6.5%
T 139
 
3.7%
E 115
 
3.0%
J 108
 
2.8%
I 107
 
2.8%
Other values (55) 1504
39.6%
Distinct123
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T06:33:00.291624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length6
Mean length6.153
Min length3

Characters and Unicode

Total characters61530
Distinct characters168
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)0.1%

Sample

1st row상하이무역관
2nd row모스크바 무역관
3rd row코펜하겐
4th row첸나이무역관
5th row자카르타무역관
ValueCountFrequency (%)
호치민무역관 1483
 
14.6%
하노이무역관 812
 
8.0%
상하이무역관 755
 
7.4%
칭다오무역관 667
 
6.6%
자카르타무역관 405
 
4.0%
베이징무역관 348
 
3.4%
톈진무역관 334
 
3.3%
방콕무역관 305
 
3.0%
도쿄무역관 225
 
2.2%
뉴욕무역관 194
 
1.9%
Other values (114) 4650
45.7%
2023-12-13T06:33:00.765436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9931
16.1%
9926
16.1%
9926
16.1%
2504
 
4.1%
1641
 
2.7%
1500
 
2.4%
1485
 
2.4%
1483
 
2.4%
969
 
1.6%
938
 
1.5%
Other values (158) 21227
34.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 61315
99.7%
Space Separator 215
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9931
16.2%
9926
16.2%
9926
16.2%
2504
 
4.1%
1641
 
2.7%
1500
 
2.4%
1485
 
2.4%
1483
 
2.4%
969
 
1.6%
938
 
1.5%
Other values (157) 21012
34.3%
Space Separator
ValueCountFrequency (%)
215
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 61315
99.7%
Common 215
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9931
16.2%
9926
16.2%
9926
16.2%
2504
 
4.1%
1641
 
2.7%
1500
 
2.4%
1485
 
2.4%
1483
 
2.4%
969
 
1.6%
938
 
1.5%
Other values (157) 21012
34.3%
Common
ValueCountFrequency (%)
215
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 61315
99.7%
ASCII 215
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9931
16.2%
9926
16.2%
9926
16.2%
2504
 
4.1%
1641
 
2.7%
1500
 
2.4%
1485
 
2.4%
1483
 
2.4%
969
 
1.6%
938
 
1.5%
Other values (157) 21012
34.3%
ASCII
ValueCountFrequency (%)
215
100.0%

Interactions

2023-12-13T06:32:48.297705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:33:00.906942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분국가명진출년도진출형태투자형태외국사 합작지분업종1업종2본사파견현지채용내수_수출
구분1.0001.0000.3120.3280.1230.6580.3280.6750.3880.0590.612
국가명1.0001.0000.5080.5340.4420.7140.7390.8560.7600.1220.849
진출년도0.3120.5081.0000.3010.0000.0000.2610.5190.6030.0280.283
진출형태0.3280.5340.3011.0000.1580.6900.7630.7600.2390.0000.305
투자형태0.1230.4420.0000.1581.0000.9070.1190.1680.0000.0000.162
외국사 합작지분0.6580.7140.0000.6900.9071.0000.6690.0000.167NaN0.207
업종10.3280.7390.2610.7630.1190.6691.0000.9490.1010.0000.376
업종20.6750.8560.5190.7600.1680.0000.9491.0000.0000.0000.562
본사파견0.3880.7600.6030.2390.0000.1670.1010.0001.0000.8040.071
현지채용0.0590.1220.0280.0000.000NaN0.0000.0000.8041.0000.028
내수_수출0.6120.8490.2830.3050.1620.2070.3760.5620.0710.0281.000
2023-12-13T06:33:01.041716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
내수_수출진출형태구분투자형태업종1
내수_수출1.0000.1750.3580.0600.185
진출형태0.1751.0000.1670.0530.458
구분0.3580.1671.0000.0600.140
투자형태0.0600.0530.0601.0000.051
업종10.1850.4580.1400.0511.000
2023-12-13T06:33:01.157578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
현지채용구분진출형태투자형태업종1내수_수출
현지채용1.0000.0290.0000.0000.0000.019
구분0.0291.0000.1670.0600.1400.358
진출형태0.0000.1671.0000.0530.4580.175
투자형태0.0000.0600.0531.0000.0510.060
업종10.0000.1400.4580.0511.0000.185
내수_수출0.0190.3580.1750.0600.1851.000

Missing values

2023-12-13T06:32:48.467419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:32:48.711324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T06:32:48.946667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

구분국가명진출지역회사명(국문)회사명(영문)주소홈페이지진출년도진출형태투자형태외국사 합작지분업종1업종2취급분야본사파견현지채용내수_수출모기업명관할무역관
7926아시아중국화동-상하이(上海)커이나신식기술유한공사/上海可以拿信息技有限公司KOINA Co., Ltd.中上海市古北新水城南路55明珠大801室(:201103)www.koina.net2003서비스법인합자<NA>서비스업출판·영상·방송통신 및 정보종합포털사이트, 모바일컨텐츠212100% 주재국 내수 공급커이나㈜상하이무역관
9925CIS러시아모스크바(Moscow)아시아나항공모스크바화물지점Asiana Airlines Repesentative Moscow Cargo Sales OfficeOffice 29, Block 2, Berezovay Alley 28, Domodedovo District St., District, Airport Domodedovo, Moscow142015., Russiawww.asianacargo.com2009지점단독<NA>운수업<NA>항공화물운송13<NA>아시아나항공㈜모스크바 무역관
9218유럽덴마크내스트베드(Naestved)노벤코 마린 앤드 오프쇼어Novenco Marine & Offishore A/SIndustivej 22, DK-4700 Naestved, Denmarkwww.novencogroup.com2013판매법인합작<NA>도매·소매업Marine ventilation선박용 공조시스템 제조 및 판매3100수출 및 내수 병행하이에어코리아코펜하겐
5448아시아인도첸나이(Chennai)동아인디아Dong-A IndiaNew No. 55, Thandalam Village, Sriperumbudur Taluk, Kanchipuram District, Chennai, Indiawww.dacm.com2002생산법인단독<NA>제조업고무·플라스틱자동차, 세탁기, 산업제품 등에 사용되는 고무제품21수출 및 내수 병행동아화성㈜첸나이무역관
3720아시아인도네시아자카르타(Jakarta)수출입은행Korea EximbankMenara Mulia, Suite 2007 20th Fl Jl. Jend. Gatot Subroto Kav. 9-11 Jakarta Selatan 12930, Indonesia<NA>1992서비스법인합작85%기타공공행정Leasing, 금융425<NA>한국수출입은행자카르타무역관
11520중동사우디아라비아제다(Jeddah)제일기획㈜제다지사Cheil Worldwide Inc.P.O Box 7864, #703 Sakura Plaza, 7th Floor, Madinah Road, Across Aramex, Jeddah 23525 Additional No. 3889, Saudi Arabiawww.cheil.com2012지점단독<NA>서비스업출판·영상·방송통신 및 정보광고솔루션120100% 주재국 내수 공급제일기획㈜리야드무역관
10302북미미국뉴욕(New York)교보생명자산운용미국현지법인Kyobo Life Asset Management(America) Co., Ltd.19 West 44th St., Suite 518, New York, NY 10036, USAwww.kyobo.co.kr1996서비스법인단독<NA>금융·보험업<NA>자산운용2<NA><NA>교보생명보험㈜뉴욕무역관
5822아시아중국서남-충칭(重)LG상사연락사무소(충칭)/LG商事代表(重)LG International Chongqing Representative Office中重市江北洋河一路68信中心C2308室www.lgicorp.com2012연락사무소단독<NA>도매 및 소매업<NA>자동차부품, 철강17<NA>LG상사㈜충칭무역관
5283아시아인도뉴델리(New Delhi)브릭스인디아BRICS India Trade Pvt., Ltd.21/3&4, 2nd Floor, Yusuf Sarai Main Market, New Delhi 110016, Indiawww.bricsindia.com2009판매법인단독<NA>도매 및 소매업<NA>온라인 판매<NA>2100% 주재국 내수시장 공급현지단독진출뉴델리무역관
1920아시아베트남하노이(Ha Noi)제이케이태크JK Tech VietnamFloor 12, Hh3 Building, My Dinh - Me Tri, My Dinh Ward 1, Nam Tu Liem District, Hanoiwww.jktech.co.kr2012서비스법인단독<NA>서비스업전문·과학 및 기술기계공학 관련 설계도<NA>21100% 주재국 내수시장 공급제이케이테크㈜하노이무역관
구분국가명진출지역회사명(국문)회사명(영문)주소홈페이지진출년도진출형태투자형태외국사 합작지분업종1업종2취급분야본사파견현지채용내수_수출모기업명관할무역관
456아시아베트남동나이(Dong Nai)코리아훅Koreahook Vn Co., Ltd.Doc 47 Industrial Group, Tam Phuoc Commune, Bien Hoa City, Dong Nai, Vietnam<NA>2013생산법인단독<NA>제조업기타 제품 제조업낚시바늘 제조110<NA><NA>호치민무역관
4460아시아필리핀카비테(Cavite)인성필리핀Insung Elec. Phils. Inc.Cezia Rd., Phase 3, PEZA, Rosario, Cavite, Philippines<NA>1996생산법인단독<NA>제조업전기·전자·정밀기기·부품디지털 부품 조립110수출 및 내수 병행<NA>마닐라무역관
4063아시아태국방콕(Bangkok)하이빙글로벌Hiliving Global (Thailand)525/18 Soi Soonvijai 4 Rama Ix Rd. Bang Kapi, Huai Khwang, Bangkok 10310<NA>2006<NA>합작<NA>도매 및 소매업<NA>화장품 도매<NA><NA><NA><NA>방콕무역관
5488아시아인도푸네(Pune)금강공업인도법인Kumkang Kind India Pvt., Ltd.#5, 1st Floor, Destination Center, Magarpatta City, Hadapsar, Pune, 411013, Indiawww.kumkangkind.com2009생산법인단독<NA>제조업기계·장비건설용 거푸집 제작243100% 주재국 내수시장 공급금강공업㈜뭄바이무역관
10479북미미국뉴저지(New Jersey)한국전력공사북미지사Korea Electric Power Corp.(KEPCO), North America Office400 Kelby St., Parker Plaza 7th Fl., Fort Lee, NJ 07024, USAwww.kepco.co.kr1977연락사무소단독<NA>기타<NA>발전, 송배전 및 판매업31100% 주재국 내수시장 공급한국전력공사뉴욕무역관
4136아시아태국방콕(Bangkok)현대엔지니어링Hyundai Engineering (Thailand) Co., Ltd.252/91 Muang Thai-Phatra Complex Tower 2, 16Th Fl., Ratchadaphisek Road Huaykwang Bangkok 10311www.hec.co.kr2008지점단독<NA>건설·공사업<NA>건설277수출 및 내수 병행현대엔지니어링㈜방콕무역관
1639아시아베트남하노이(Ha Noi)와프인터내셔날Waf International Co., Ltd.No96, Hoang Van Thai, Khoang Mai, Hanoi<NA>2010판매법인단독<NA>도매 및 소매업<NA>바이오디젤 원료 수출, 건설자재 수입16수출 및 내수 병행<NA>하노이무역관
2995아시아베트남호치민(Ho Chi Minh)현대해상화재보험Hyundai Marine & Fire Insurance#710, 7th Fl., Sun Wah Tower, 115 Nguyen Hue St., Dist. 1, HCMC, Vietnamwww.hi.co.kr1997연락사무소단독<NA>금융·보험업<NA>손해 보험12<NA>현대해상화재보험㈜호치민무역관
10089CIS우즈베키스탄사마르칸트(Samarkand)영원무역Youngone Samarkand LLC.140119, Str. U.U.Dzhurakulova 166- A, Samarkand, Uzbekistan<NA>2014생산법인단독<NA>제조업섬유·피혁봉제<NA><NA><NA>영원무역㈜타슈켄트무역관
3087아시아베트남흥옌(Hung Yen)세원이씨에스비나Sewon ECS Vina Co., Ltd.Thon Moc Ty, xa Trung Trac, Huyen Van Lam, Hung Yen, Vietnamwww.yura.co.kr2007생산법인단독<NA>제조업자동차·자동차부품자동차부품51000100% 제3국 수출유라코퍼레이션㈜하노이무역관