Overview

Dataset statistics

Number of variables12
Number of observations10000
Missing cells4287
Missing cells (%)3.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1015.6 KiB
Average record size in memory104.0 B

Variable types

Categorical3
Text9

Dataset

Description해외진출기업 데이터는 성공적으로 진출한 기업의 성공사례 정보를 제공함으로써 해외진출을 처음 시도하는 기업에 도움이 되고자한다.
URLhttps://www.data.go.kr/data/15034787/fileData.do

Alerts

모기업명 has 4286 (42.9%) missing valuesMissing

Reproduction

Analysis started2023-12-12 15:46:51.332146
Analysis finished2023-12-12 15:46:54.578168
Duration3.25 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지역
Categorical

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
동남아대양주
5053 
중국 (홍콩, 대만 포함)
2025 
유럽
717 
북미
633 
서남아
 
389
Other values (5)
1183 

Length

Max length14
Median length6
Mean length6.5522
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row동남아대양주
2nd row서남아
3rd rowCIS
4th row동남아대양주
5th row중국 (홍콩, 대만 포함)

Common Values

ValueCountFrequency (%)
동남아대양주 5053
50.5%
중국 (홍콩, 대만 포함) 2025
20.2%
유럽 717
 
7.2%
북미 633
 
6.3%
서남아 389
 
3.9%
일본 381
 
3.8%
CIS 283
 
2.8%
중남미 238
 
2.4%
중동 231
 
2.3%
아프리카 50
 
0.5%

Length

2023-12-13T00:46:54.670373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:46:54.845393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
동남아대양주 5053
31.4%
중국 2025
12.6%
홍콩 2025
12.6%
대만 2025
12.6%
포함 2025
12.6%
유럽 717
 
4.5%
북미 633
 
3.9%
서남아 389
 
2.4%
일본 381
 
2.4%
cis 283
 
1.8%
Other values (3) 519
 
3.2%
Distinct86
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:46:55.437111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length2.956
Min length2

Characters and Unicode

Total characters29560
Distinct characters123
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row베트남
2nd row방글라데시
3rd row러시아
4th row캄보디아
5th row중국
ValueCountFrequency (%)
베트남 3155
31.6%
중국 1848
18.5%
인도네시아 885
 
8.8%
미국 603
 
6.0%
일본 381
 
3.8%
태국 318
 
3.2%
인도 244
 
2.4%
말레이시아 178
 
1.8%
필리핀 155
 
1.6%
폴란드 147
 
1.5%
Other values (76) 2086
20.9%
2023-12-13T00:46:55.873513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3267
 
11.1%
3217
 
10.9%
3169
 
10.7%
2866
 
9.7%
1848
 
6.3%
1540
 
5.2%
1384
 
4.7%
1155
 
3.9%
1140
 
3.9%
929
 
3.1%
Other values (113) 9045
30.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29560
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3267
 
11.1%
3217
 
10.9%
3169
 
10.7%
2866
 
9.7%
1848
 
6.3%
1540
 
5.2%
1384
 
4.7%
1155
 
3.9%
1140
 
3.9%
929
 
3.1%
Other values (113) 9045
30.6%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29560
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3267
 
11.1%
3217
 
10.9%
3169
 
10.7%
2866
 
9.7%
1848
 
6.3%
1540
 
5.2%
1384
 
4.7%
1155
 
3.9%
1140
 
3.9%
929
 
3.1%
Other values (113) 9045
30.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29560
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3267
 
11.1%
3217
 
10.9%
3169
 
10.7%
2866
 
9.7%
1848
 
6.3%
1540
 
5.2%
1384
 
4.7%
1155
 
3.9%
1140
 
3.9%
929
 
3.1%
Other values (113) 9045
30.6%
Distinct124
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:46:56.213853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length3
Mean length3.2463
Min length1

Characters and Unicode

Total characters32463
Distinct characters167
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row하노이
2nd row다카
3rd row모스크바
4th row프놈펜
5th row다롄
ValueCountFrequency (%)
호치민 1934
19.3%
하노이 1177
 
11.8%
자카르타 881
 
8.8%
상하이 562
 
5.6%
방콕 318
 
3.2%
칭다오 308
 
3.1%
도쿄 301
 
3.0%
베이징 298
 
3.0%
쿠알라룸푸르 178
 
1.8%
톈진 170
 
1.7%
Other values (114) 3873
38.7%
2023-12-13T00:46:56.727926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2581
 
8.0%
1956
 
6.0%
1938
 
6.0%
1934
 
6.0%
1806
 
5.6%
1600
 
4.9%
1201
 
3.7%
1172
 
3.6%
1082
 
3.3%
886
 
2.7%
Other values (157) 16307
50.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 32463
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2581
 
8.0%
1956
 
6.0%
1938
 
6.0%
1934
 
6.0%
1806
 
5.6%
1600
 
4.9%
1201
 
3.7%
1172
 
3.6%
1082
 
3.3%
886
 
2.7%
Other values (157) 16307
50.2%

Most occurring scripts

ValueCountFrequency (%)
Hangul 32463
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2581
 
8.0%
1956
 
6.0%
1938
 
6.0%
1934
 
6.0%
1806
 
5.6%
1600
 
4.9%
1201
 
3.7%
1172
 
3.6%
1082
 
3.3%
886
 
2.7%
Other values (157) 16307
50.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 32463
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2581
 
8.0%
1956
 
6.0%
1938
 
6.0%
1934
 
6.0%
1806
 
5.6%
1600
 
4.9%
1201
 
3.7%
1172
 
3.6%
1082
 
3.3%
886
 
2.7%
Other values (157) 16307
50.2%
Distinct9586
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:46:57.128822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length43
Mean length8.8481
Min length1

Characters and Unicode

Total characters88481
Distinct characters945
Distinct categories12 ?
Distinct scripts5 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9359 ?
Unique (%)93.6%

Sample

1st row시노펙스 베트남(주)
2nd rowLG전자
3rd rowHS애드 모스크바법인 ((구)LG애드 모스크바법인)
4th row송가네
5th row대련창조기계유한공사
ValueCountFrequency (%)
비나 400
 
2.4%
베트남 385
 
2.3%
법인 232
 
1.4%
주식회사 166
 
1.0%
인도네시아 118
 
0.7%
폴란드 85
 
0.5%
글로벌 66
 
0.4%
아메리카 61
 
0.4%
지사 53
 
0.3%
52
 
0.3%
Other values (9991) 14783
90.1%
2023-12-13T00:46:57.727018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6402
 
7.2%
2668
 
3.0%
2659
 
3.0%
2567
 
2.9%
2406
 
2.7%
1916
 
2.2%
1683
 
1.9%
1649
 
1.9%
1452
 
1.6%
1360
 
1.5%
Other values (935) 63719
72.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 74661
84.4%
Space Separator 6402
 
7.2%
Uppercase Letter 3656
 
4.1%
Open Punctuation 1111
 
1.3%
Close Punctuation 1109
 
1.3%
Lowercase Letter 1000
 
1.1%
Other Punctuation 260
 
0.3%
Other Symbol 120
 
0.1%
Decimal Number 74
 
0.1%
Dash Punctuation 54
 
0.1%
Other values (2) 34
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2668
 
3.6%
2659
 
3.6%
2567
 
3.4%
2406
 
3.2%
1916
 
2.6%
1683
 
2.3%
1649
 
2.2%
1452
 
1.9%
1360
 
1.8%
1252
 
1.7%
Other values (852) 55049
73.7%
Uppercase Letter
ValueCountFrequency (%)
S 379
 
10.4%
C 310
 
8.5%
L 265
 
7.2%
A 258
 
7.1%
T 227
 
6.2%
I 215
 
5.9%
K 206
 
5.6%
N 198
 
5.4%
G 193
 
5.3%
E 192
 
5.3%
Other values (24) 1213
33.2%
Lowercase Letter
ValueCountFrequency (%)
a 107
10.7%
n 102
10.2%
i 101
10.1%
o 87
 
8.7%
e 80
 
8.0%
t 78
 
7.8%
r 65
 
6.5%
s 51
 
5.1%
l 50
 
5.0%
d 44
 
4.4%
Other values (14) 235
23.5%
Decimal Number
ValueCountFrequency (%)
1 19
25.7%
2 16
21.6%
4 15
20.3%
3 9
12.2%
5 3
 
4.1%
0 3
 
4.1%
6 3
 
4.1%
9 3
 
4.1%
7 2
 
2.7%
8 1
 
1.4%
Other Punctuation
ValueCountFrequency (%)
. 107
41.2%
& 88
33.8%
/ 35
 
13.5%
, 26
 
10.0%
: 2
 
0.8%
· 2
 
0.8%
Open Punctuation
ValueCountFrequency (%)
( 1110
99.9%
1
 
0.1%
Other Symbol
ValueCountFrequency (%)
119
99.2%
1
 
0.8%
Space Separator
ValueCountFrequency (%)
6402
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1109
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 54
100.0%
Control
ValueCountFrequency (%)
32
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 74768
84.5%
Common 9045
 
10.2%
Latin 4647
 
5.3%
Han 12
 
< 0.1%
Cyrillic 9
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2668
 
3.6%
2659
 
3.6%
2567
 
3.4%
2406
 
3.2%
1916
 
2.6%
1683
 
2.3%
1649
 
2.2%
1452
 
1.9%
1360
 
1.8%
1252
 
1.7%
Other values (841) 55156
73.8%
Latin
ValueCountFrequency (%)
S 379
 
8.2%
C 310
 
6.7%
L 265
 
5.7%
A 258
 
5.6%
T 227
 
4.9%
I 215
 
4.6%
K 206
 
4.4%
N 198
 
4.3%
G 193
 
4.2%
E 192
 
4.1%
Other values (43) 2204
47.4%
Common
ValueCountFrequency (%)
6402
70.8%
( 1110
 
12.3%
) 1109
 
12.3%
. 107
 
1.2%
& 88
 
1.0%
- 54
 
0.6%
/ 35
 
0.4%
32
 
0.4%
, 26
 
0.3%
1 19
 
0.2%
Other values (14) 63
 
0.7%
Han
ValueCountFrequency (%)
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
Other values (2) 2
16.7%
Cyrillic
ValueCountFrequency (%)
О 3
33.3%
Н 2
22.2%
С 2
22.2%
Ф 1
 
11.1%
Г 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 74649
84.4%
ASCII 13685
 
15.5%
None 125
 
0.1%
CJK 12
 
< 0.1%
Cyrillic 9
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6402
46.8%
( 1110
 
8.1%
) 1109
 
8.1%
S 379
 
2.8%
C 310
 
2.3%
L 265
 
1.9%
A 258
 
1.9%
T 227
 
1.7%
I 215
 
1.6%
K 206
 
1.5%
Other values (61) 3204
23.4%
Hangul
ValueCountFrequency (%)
2668
 
3.6%
2659
 
3.6%
2567
 
3.4%
2406
 
3.2%
1916
 
2.6%
1683
 
2.3%
1649
 
2.2%
1452
 
1.9%
1360
 
1.8%
1252
 
1.7%
Other values (840) 55037
73.7%
None
ValueCountFrequency (%)
119
95.2%
· 2
 
1.6%
1
 
0.8%
1
 
0.8%
1
 
0.8%
1
 
0.8%
Cyrillic
ValueCountFrequency (%)
О 3
33.3%
Н 2
22.2%
С 2
22.2%
Ф 1
 
11.1%
Г 1
 
11.1%
CJK
ValueCountFrequency (%)
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
Other values (2) 2
16.7%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Distinct9669
Distinct (%)96.7%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-13T00:46:58.221879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length90
Median length65
Mean length25.259826
Min length1

Characters and Unicode

Total characters252573
Distinct characters115
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9445 ?
Unique (%)94.5%

Sample

1st rowSYNOPEX VIETNAM./JSC.
2nd rowLG
3rd rowGIIR RUS LLC
4th rowSONG''S FAMILY
5th rowDALIAN CHUANGZAO MACHINERY CO.
ValueCountFrequency (%)
ltd 4219
 
10.6%
co 3760
 
9.4%
vina 979
 
2.5%
vietnam 570
 
1.4%
inc 446
 
1.1%
400
 
1.0%
korea 317
 
0.8%
office 315
 
0.8%
indonesia 295
 
0.7%
international 285
 
0.7%
Other values (8082) 28206
70.9%
2023-12-13T00:46:58.920318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29787
 
11.8%
N 19955
 
7.9%
A 19165
 
7.6%
I 17818
 
7.1%
O 17337
 
6.9%
E 15360
 
6.1%
T 15150
 
6.0%
C 12766
 
5.1%
L 11122
 
4.4%
S 10521
 
4.2%
Other values (105) 83592
33.1%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 205100
81.2%
Space Separator 29793
 
11.8%
Other Punctuation 15113
 
6.0%
Open Punctuation 862
 
0.3%
Close Punctuation 858
 
0.3%
Dash Punctuation 425
 
0.2%
Decimal Number 185
 
0.1%
Lowercase Letter 185
 
0.1%
Other Letter 33
 
< 0.1%
Final Punctuation 8
 
< 0.1%
Other values (4) 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
 
6.1%
2
 
6.1%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
Other values (21) 21
63.6%
Uppercase Letter
ValueCountFrequency (%)
N 19955
 
9.7%
A 19165
 
9.3%
I 17818
 
8.7%
O 17337
 
8.5%
E 15360
 
7.5%
T 15150
 
7.4%
C 12766
 
6.2%
L 11122
 
5.4%
S 10521
 
5.1%
D 9598
 
4.7%
Other values (17) 56308
27.5%
Lowercase Letter
ValueCountFrequency (%)
a 25
13.5%
i 22
11.9%
o 19
10.3%
n 18
9.7%
h 17
9.2%
e 16
8.6%
t 15
8.1%
g 13
7.0%
r 9
 
4.9%
d 7
 
3.8%
Other values (11) 24
13.0%
Other Punctuation
ValueCountFrequency (%)
. 10416
68.9%
, 3887
 
25.7%
& 618
 
4.1%
/ 87
 
0.6%
' 75
 
0.5%
" 19
 
0.1%
: 4
 
< 0.1%
3
 
< 0.1%
· 2
 
< 0.1%
# 1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
2 55
29.7%
1 50
27.0%
4 23
12.4%
3 18
 
9.7%
9 10
 
5.4%
0 9
 
4.9%
7 6
 
3.2%
5 6
 
3.2%
8 5
 
2.7%
6 3
 
1.6%
Open Punctuation
ValueCountFrequency (%)
( 857
99.4%
[ 4
 
0.5%
1
 
0.1%
Space Separator
ValueCountFrequency (%)
29787
> 99.9%
  6
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 854
99.5%
] 4
 
0.5%
Control
ValueCountFrequency (%)
4
80.0%
1
 
20.0%
Math Symbol
ValueCountFrequency (%)
> 1
50.0%
+ 1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 425
100.0%
Final Punctuation
ValueCountFrequency (%)
8
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 205285
81.3%
Common 47255
 
18.7%
Hangul 23
 
< 0.1%
Han 10
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
N 19955
 
9.7%
A 19165
 
9.3%
I 17818
 
8.7%
O 17337
 
8.4%
E 15360
 
7.5%
T 15150
 
7.4%
C 12766
 
6.2%
L 11122
 
5.4%
S 10521
 
5.1%
D 9598
 
4.7%
Other values (38) 56493
27.5%
Common
ValueCountFrequency (%)
29787
63.0%
. 10416
 
22.0%
, 3887
 
8.2%
( 857
 
1.8%
) 854
 
1.8%
& 618
 
1.3%
- 425
 
0.9%
/ 87
 
0.2%
' 75
 
0.2%
2 55
 
0.1%
Other values (26) 194
 
0.4%
Hangul
ValueCountFrequency (%)
2
 
8.7%
2
 
8.7%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
Other values (11) 11
47.8%
Han
ValueCountFrequency (%)
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 252518
> 99.9%
Hangul 22
 
< 0.1%
None 13
 
< 0.1%
CJK 10
 
< 0.1%
Punctuation 8
 
< 0.1%
Compat Jamo 1
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
29787
 
11.8%
N 19955
 
7.9%
A 19165
 
7.6%
I 17818
 
7.1%
O 17337
 
6.9%
E 15360
 
6.1%
T 15150
 
6.0%
C 12766
 
5.1%
L 11122
 
4.4%
S 10521
 
4.2%
Other values (67) 83537
33.1%
Punctuation
ValueCountFrequency (%)
8
100.0%
None
ValueCountFrequency (%)
  6
46.2%
3
23.1%
· 2
 
15.4%
Ł 1
 
7.7%
1
 
7.7%
Hangul
ValueCountFrequency (%)
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (10) 10
45.5%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%

주소
Text

Distinct9752
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:46:59.481009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length201
Median length138
Mean length64.6889
Min length1

Characters and Unicode

Total characters646889
Distinct characters755
Distinct categories17 ?
Distinct scripts6 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9581 ?
Unique (%)95.8%

Sample

1st rowDONG THO MULTI - COMPLEX I.Z, DONG THO, YEN PHONG, BAC NINH
2nd rowSYMPHONY (6TH FLOOR), PLOT-SE(F)-9, ROAD-142SOUTH AVENUE, GULSHAN-1, DHAKA-1212
3rd row6 FLOOR, 4TH LESNOY PER., MOSCOW, RUSSIA
4th rowNO 49A (VIMEAN PHNOM PENH. ST 209, SANGKAT CHRAING CHAM RESS 1, KHAN REUSSEY KEO, PHNOM PENH CAMBODIA)
5th rowROOM 408, VIENNA BUILDING, NO.31, LIAOHE WEST ROAD, DALIAN CITY, LIAONING PROVINCE, CHINA
ValueCountFrequency (%)
vietnam 1664
 
1.5%
road 1620
 
1.5%
district 1227
 
1.1%
dist 1196
 
1.1%
ward 1149
 
1.1%
city 1024
 
1.0%
china 933
 
0.9%
no 922
 
0.9%
1 904
 
0.8%
indonesia 877
 
0.8%
Other values (17631) 96115
89.3%
2023-12-13T00:47:00.313797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
98441
15.2%
A 52492
 
8.1%
N 47618
 
7.4%
I 37495
 
5.8%
, 35084
 
5.4%
O 30709
 
4.7%
T 29910
 
4.6%
E 26910
 
4.2%
H 24203
 
3.7%
R 22773
 
3.5%
Other values (745) 241254
37.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 446807
69.1%
Space Separator 98445
 
15.2%
Decimal Number 46882
 
7.2%
Other Punctuation 44890
 
6.9%
Other Letter 5638
 
0.9%
Dash Punctuation 3343
 
0.5%
Open Punctuation 369
 
0.1%
Close Punctuation 368
 
0.1%
Math Symbol 50
 
< 0.1%
Connector Punctuation 47
 
< 0.1%
Other values (7) 50
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
405
 
7.2%
350
 
6.2%
176
 
3.1%
163
 
2.9%
152
 
2.7%
134
 
2.4%
113
 
2.0%
108
 
1.9%
107
 
1.9%
101
 
1.8%
Other values (625) 3829
67.9%
Uppercase Letter
ValueCountFrequency (%)
A 52492
 
11.7%
N 47618
 
10.7%
I 37495
 
8.4%
O 30709
 
6.9%
T 29910
 
6.7%
E 26910
 
6.0%
H 24203
 
5.4%
R 22773
 
5.1%
S 17923
 
4.0%
D 17920
 
4.0%
Other values (48) 138854
31.1%
Decimal Number
ValueCountFrequency (%)
1 10578
22.6%
0 7036
15.0%
2 6742
14.4%
3 4581
9.8%
5 3887
 
8.3%
4 3329
 
7.1%
6 2895
 
6.2%
7 2809
 
6.0%
8 2731
 
5.8%
9 2280
 
4.9%
Other values (10) 14
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
, 35084
78.2%
. 8750
 
19.5%
# 455
 
1.0%
; 166
 
0.4%
' 143
 
0.3%
& 136
 
0.3%
: 114
 
0.3%
11
 
< 0.1%
10
 
< 0.1%
" 8
 
< 0.1%
Other values (3) 13
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 26
52.0%
| 8
 
16.0%
< 7
 
14.0%
> 6
 
12.0%
2
 
4.0%
+ 1
 
2.0%
Open Punctuation
ValueCountFrequency (%)
( 353
95.7%
15
 
4.1%
1
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 352
95.7%
15
 
4.1%
1
 
0.3%
Control
ValueCountFrequency (%)
7
77.8%
 1
 
11.1%
1
 
11.1%
Space Separator
ValueCountFrequency (%)
98441
> 99.9%
  4
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 3342
> 99.9%
1
 
< 0.1%
Final Punctuation
ValueCountFrequency (%)
13
86.7%
2
 
13.3%
Other Symbol
ValueCountFrequency (%)
° 9
90.0%
1
 
10.0%
Initial Punctuation
ValueCountFrequency (%)
2
66.7%
1
33.3%
Connector Punctuation
ValueCountFrequency (%)
_ 47
100.0%
Lowercase Letter
ValueCountFrequency (%)
ß 7
100.0%
Letter Number
ValueCountFrequency (%)
4
100.0%
Other Number
ValueCountFrequency (%)
½ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 446143
69.0%
Common 194433
30.1%
Han 5165
 
0.8%
Cyrillic 678
 
0.1%
Hangul 444
 
0.1%
Katakana 26
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
405
 
7.8%
350
 
6.8%
176
 
3.4%
163
 
3.2%
152
 
2.9%
134
 
2.6%
113
 
2.2%
108
 
2.1%
107
 
2.1%
101
 
2.0%
Other values (448) 3356
65.0%
Hangul
ValueCountFrequency (%)
23
 
5.2%
18
 
4.1%
12
 
2.7%
9
 
2.0%
9
 
2.0%
9
 
2.0%
8
 
1.8%
8
 
1.8%
8
 
1.8%
8
 
1.8%
Other values (152) 332
74.8%
Common
ValueCountFrequency (%)
98441
50.6%
, 35084
 
18.0%
1 10578
 
5.4%
. 8750
 
4.5%
0 7036
 
3.6%
2 6742
 
3.5%
3 4581
 
2.4%
5 3887
 
2.0%
- 3342
 
1.7%
4 3329
 
1.7%
Other values (50) 12663
 
6.5%
Cyrillic
ValueCountFrequency (%)
О 79
 
11.7%
С 60
 
8.8%
А 57
 
8.4%
Р 46
 
6.8%
К 45
 
6.6%
Е 39
 
5.8%
Н 36
 
5.3%
И 35
 
5.2%
Д 30
 
4.4%
Л 29
 
4.3%
Other values (21) 222
32.7%
Latin
ValueCountFrequency (%)
A 52492
 
11.8%
N 47618
 
10.7%
I 37495
 
8.4%
O 30709
 
6.9%
T 29910
 
6.7%
E 26910
 
6.0%
H 24203
 
5.4%
R 22773
 
5.1%
S 17923
 
4.0%
D 17920
 
4.0%
Other values (20) 138190
31.0%
Katakana
ValueCountFrequency (%)
4
15.4%
4
15.4%
4
15.4%
2
7.7%
2
7.7%
2
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (4) 4
15.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 640430
99.0%
CJK 5165
 
0.8%
Cyrillic 678
 
0.1%
Hangul 444
 
0.1%
None 123
 
< 0.1%
Katakana 26
 
< 0.1%
Punctuation 18
 
< 0.1%
Number Forms 4
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
98441
15.4%
A 52492
 
8.2%
N 47618
 
7.4%
I 37495
 
5.9%
, 35084
 
5.5%
O 30709
 
4.8%
T 29910
 
4.7%
E 26910
 
4.2%
H 24203
 
3.8%
R 22773
 
3.6%
Other values (47) 234795
36.7%
CJK
ValueCountFrequency (%)
405
 
7.8%
350
 
6.8%
176
 
3.4%
163
 
3.2%
152
 
2.9%
134
 
2.6%
113
 
2.2%
108
 
2.1%
107
 
2.1%
101
 
2.0%
Other values (448) 3356
65.0%
Cyrillic
ValueCountFrequency (%)
О 79
 
11.7%
С 60
 
8.8%
А 57
 
8.4%
Р 46
 
6.8%
К 45
 
6.6%
Е 39
 
5.8%
Н 36
 
5.3%
И 35
 
5.2%
Д 30
 
4.4%
Л 29
 
4.3%
Other values (21) 222
32.7%
Hangul
ValueCountFrequency (%)
23
 
5.2%
18
 
4.1%
12
 
2.7%
9
 
2.0%
9
 
2.0%
9
 
2.0%
8
 
1.8%
8
 
1.8%
8
 
1.8%
8
 
1.8%
Other values (152) 332
74.8%
None
ValueCountFrequency (%)
15
12.2%
15
12.2%
Ł 15
12.2%
11
8.9%
10
 
8.1%
° 9
 
7.3%
¿ 7
 
5.7%
ß 7
 
5.7%
· 5
 
4.1%
  4
 
3.3%
Other values (17) 25
20.3%
Punctuation
ValueCountFrequency (%)
13
72.2%
2
 
11.1%
2
 
11.1%
1
 
5.6%
Katakana
ValueCountFrequency (%)
4
15.4%
4
15.4%
4
15.4%
2
7.7%
2
7.7%
2
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (4) 4
15.4%
Number Forms
ValueCountFrequency (%)
4
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Distinct2978
Distinct (%)29.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:47:00.851255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length21
Mean length5.4081
Min length1

Characters and Unicode

Total characters54081
Distinct characters64
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2025 ?
Unique (%)20.2%

Sample

1st row16000
2nd row1212
3rd row125047
4th row.
5th row116000
ValueCountFrequency (%)
700000 1870
 
18.3%
10000 425
 
4.2%
292
 
2.9%
16000 153
 
1.5%
201103 134
 
1.3%
100000 90
 
0.9%
15710 81
 
0.8%
10110 71
 
0.7%
200000 71
 
0.7%
116000 69
 
0.7%
Other values (3066) 6935
68.1%
2023-12-13T00:47:01.522114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 24159
44.7%
1 8298
 
15.3%
2 4107
 
7.6%
7 3484
 
6.4%
3 2691
 
5.0%
5 2615
 
4.8%
6 2481
 
4.6%
4 2296
 
4.2%
8 1321
 
2.4%
9 1223
 
2.3%
Other values (54) 1406
 
2.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 52675
97.4%
Uppercase Letter 534
 
1.0%
Other Punctuation 307
 
0.6%
Dash Punctuation 306
 
0.6%
Space Separator 194
 
0.4%
Lowercase Letter 64
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
E 39
 
7.3%
C 32
 
6.0%
N 31
 
5.8%
T 31
 
5.8%
B 29
 
5.4%
A 29
 
5.4%
M 28
 
5.2%
X 27
 
5.1%
P 26
 
4.9%
L 24
 
4.5%
Other values (16) 238
44.6%
Lowercase Letter
ValueCountFrequency (%)
o 6
 
9.4%
b 6
 
9.4%
i 5
 
7.8%
a 5
 
7.8%
e 5
 
7.8%
y 4
 
6.2%
n 4
 
6.2%
s 4
 
6.2%
h 3
 
4.7%
l 3
 
4.7%
Other values (12) 19
29.7%
Decimal Number
ValueCountFrequency (%)
0 24159
45.9%
1 8298
 
15.8%
2 4107
 
7.8%
7 3484
 
6.6%
3 2691
 
5.1%
5 2615
 
5.0%
6 2481
 
4.7%
4 2296
 
4.4%
8 1321
 
2.5%
9 1223
 
2.3%
Other Punctuation
ValueCountFrequency (%)
. 303
98.7%
/ 3
 
1.0%
: 1
 
0.3%
Dash Punctuation
ValueCountFrequency (%)
- 306
100.0%
Space Separator
ValueCountFrequency (%)
194
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 53483
98.9%
Latin 598
 
1.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
E 39
 
6.5%
C 32
 
5.4%
N 31
 
5.2%
T 31
 
5.2%
B 29
 
4.8%
A 29
 
4.8%
M 28
 
4.7%
X 27
 
4.5%
P 26
 
4.3%
L 24
 
4.0%
Other values (38) 302
50.5%
Common
ValueCountFrequency (%)
0 24159
45.2%
1 8298
 
15.5%
2 4107
 
7.7%
7 3484
 
6.5%
3 2691
 
5.0%
5 2615
 
4.9%
6 2481
 
4.6%
4 2296
 
4.3%
8 1321
 
2.5%
9 1223
 
2.3%
Other values (6) 808
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 54081
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 24159
44.7%
1 8298
 
15.3%
2 4107
 
7.6%
7 3484
 
6.4%
3 2691
 
5.0%
5 2615
 
4.8%
6 2481
 
4.6%
4 2296
 
4.2%
8 1321
 
2.4%
9 1223
 
2.3%
Other values (54) 1406
 
2.6%
Distinct271
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:47:01.734797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length4
Mean length5.0924
Min length1

Characters and Unicode

Total characters50924
Distinct characters240
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique196 ?
Unique (%)2.0%

Sample

1st row생산법인
2nd row해외지사
3rd row서비스법인
4th row생산법인
5th row생산법인
ValueCountFrequency (%)
생산법인 3955
35.2%
서비스법인 2674
23.8%
판매법인 1867
16.6%
해외지사 1372
 
12.2%
연락사무소 486
 
4.3%
기타 183
 
1.6%
137
 
1.2%
기타(법인 62
 
0.6%
기타(현지법인 29
 
0.3%
기타(건설 13
 
0.1%
Other values (279) 449
 
4.0%
2023-12-13T00:47:02.095109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8657
17.0%
8653
17.0%
3962
 
7.8%
3957
 
7.8%
2698
 
5.3%
2694
 
5.3%
2683
 
5.3%
1921
 
3.8%
1869
 
3.7%
1869
 
3.7%
Other values (230) 11961
23.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 47303
92.9%
Space Separator 1247
 
2.4%
Other Punctuation 1247
 
2.4%
Open Punctuation 404
 
0.8%
Close Punctuation 404
 
0.8%
Lowercase Letter 235
 
0.5%
Uppercase Letter 82
 
0.2%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8657
18.3%
8653
18.3%
3962
8.4%
3957
8.4%
2698
 
5.7%
2694
 
5.7%
2683
 
5.7%
1921
 
4.1%
1869
 
4.0%
1869
 
4.0%
Other values (182) 8340
17.6%
Lowercase Letter
ValueCountFrequency (%)
e 33
14.0%
r 22
9.4%
c 20
 
8.5%
a 20
 
8.5%
i 19
 
8.1%
t 18
 
7.7%
n 18
 
7.7%
o 14
 
6.0%
p 11
 
4.7%
f 9
 
3.8%
Other values (12) 51
21.7%
Uppercase Letter
ValueCountFrequency (%)
I 9
11.0%
T 8
 
9.8%
R 8
 
9.8%
S 7
 
8.5%
C 6
 
7.3%
A 6
 
7.3%
M 5
 
6.1%
B 5
 
6.1%
E 4
 
4.9%
N 4
 
4.9%
Other values (8) 20
24.4%
Other Punctuation
ValueCountFrequency (%)
, 1071
85.9%
. 148
 
11.9%
/ 23
 
1.8%
& 5
 
0.4%
Space Separator
ValueCountFrequency (%)
1247
100.0%
Open Punctuation
ValueCountFrequency (%)
( 404
100.0%
Close Punctuation
ValueCountFrequency (%)
) 404
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 47303
92.9%
Common 3304
 
6.5%
Latin 317
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8657
18.3%
8653
18.3%
3962
8.4%
3957
8.4%
2698
 
5.7%
2694
 
5.7%
2683
 
5.7%
1921
 
4.1%
1869
 
4.0%
1869
 
4.0%
Other values (182) 8340
17.6%
Latin
ValueCountFrequency (%)
e 33
 
10.4%
r 22
 
6.9%
c 20
 
6.3%
a 20
 
6.3%
i 19
 
6.0%
t 18
 
5.7%
n 18
 
5.7%
o 14
 
4.4%
p 11
 
3.5%
I 9
 
2.8%
Other values (30) 133
42.0%
Common
ValueCountFrequency (%)
1247
37.7%
, 1071
32.4%
( 404
 
12.2%
) 404
 
12.2%
. 148
 
4.5%
/ 23
 
0.7%
& 5
 
0.2%
- 2
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 47303
92.9%
ASCII 3621
 
7.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8657
18.3%
8653
18.3%
3962
8.4%
3957
8.4%
2698
 
5.7%
2694
 
5.7%
2683
 
5.7%
1921
 
4.1%
1869
 
4.0%
1869
 
4.0%
Other values (182) 8340
17.6%
ASCII
ValueCountFrequency (%)
1247
34.4%
, 1071
29.6%
( 404
 
11.2%
) 404
 
11.2%
. 148
 
4.1%
e 33
 
0.9%
/ 23
 
0.6%
r 22
 
0.6%
c 20
 
0.6%
a 20
 
0.6%
Other values (38) 229
 
6.3%
Distinct63
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T00:47:02.252763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length4
Mean length3.29
Min length1

Characters and Unicode

Total characters32900
Distinct characters24
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)0.4%

Sample

1st row.
2nd row단독투자
3rd row.
4th row단독투자
5th row단독투자
ValueCountFrequency (%)
단독투자 6419
64.2%
2653
26.5%
합작투자 500
 
5.0%
합자투자 202
 
2.0%
m&a 42
 
0.4%
합작투자(50 32
 
0.3%
합작투자(60 15
 
0.1%
합작투자(51 12
 
0.1%
합작투자(40 12
 
0.1%
합작투자(49 12
 
0.1%
Other values (51) 101
 
1.0%
2023-12-13T00:47:02.610432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7488
22.8%
7286
22.1%
6419
19.5%
6419
19.5%
. 2653
 
8.1%
867
 
2.6%
665
 
2.0%
( 184
 
0.6%
) 184
 
0.6%
% 184
 
0.6%
Other values (14) 551
 
1.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29144
88.6%
Other Punctuation 2898
 
8.8%
Decimal Number 363
 
1.1%
Open Punctuation 184
 
0.6%
Close Punctuation 184
 
0.6%
Uppercase Letter 122
 
0.4%
Space Separator 5
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 96
26.4%
5 70
19.3%
9 51
14.0%
1 29
 
8.0%
6 28
 
7.7%
4 27
 
7.4%
7 19
 
5.2%
8 16
 
4.4%
2 15
 
4.1%
3 12
 
3.3%
Other Letter
ValueCountFrequency (%)
7488
25.7%
7286
25.0%
6419
22.0%
6419
22.0%
867
 
3.0%
665
 
2.3%
Other Punctuation
ValueCountFrequency (%)
. 2653
91.5%
% 184
 
6.3%
& 61
 
2.1%
Uppercase Letter
ValueCountFrequency (%)
A 61
50.0%
M 61
50.0%
Open Punctuation
ValueCountFrequency (%)
( 184
100.0%
Close Punctuation
ValueCountFrequency (%)
) 184
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29144
88.6%
Common 3634
 
11.0%
Latin 122
 
0.4%

Most frequent character per script

Common
ValueCountFrequency (%)
. 2653
73.0%
( 184
 
5.1%
) 184
 
5.1%
% 184
 
5.1%
0 96
 
2.6%
5 70
 
1.9%
& 61
 
1.7%
9 51
 
1.4%
1 29
 
0.8%
6 28
 
0.8%
Other values (6) 94
 
2.6%
Hangul
ValueCountFrequency (%)
7488
25.7%
7286
25.0%
6419
22.0%
6419
22.0%
867
 
3.0%
665
 
2.3%
Latin
ValueCountFrequency (%)
A 61
50.0%
M 61
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29144
88.6%
ASCII 3756
 
11.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7488
25.7%
7286
25.0%
6419
22.0%
6419
22.0%
867
 
3.0%
665
 
2.3%
ASCII
ValueCountFrequency (%)
. 2653
70.6%
( 184
 
4.9%
) 184
 
4.9%
% 184
 
4.9%
0 96
 
2.6%
5 70
 
1.9%
A 61
 
1.6%
& 61
 
1.6%
M 61
 
1.6%
9 51
 
1.4%
Other values (8) 151
 
4.0%

모기업명
Text

MISSING 

Distinct3983
Distinct (%)69.7%
Missing4286
Missing (%)42.9%
Memory size156.2 KiB
2023-12-13T00:47:02.934838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length255
Median length52
Mean length6.8865943
Min length1

Characters and Unicode

Total characters39350
Distinct characters665
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3411 ?
Unique (%)59.7%

Sample

1st row시노펙스
2nd rowLG전자
3rd rowHS애드
4th row소울기어
5th row장백산업기계
ValueCountFrequency (%)
주식회사 214
 
3.0%
162
 
2.3%
ltd 85
 
1.2%
co 74
 
1.0%
69
 
1.0%
신한은행 52
 
0.7%
삼성전자 50
 
0.7%
우리은행 38
 
0.5%
포스코 37
 
0.5%
lg전자 32
 
0.4%
Other values (4131) 6317
88.6%
2023-12-13T00:47:03.516880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1539
 
3.9%
1438
 
3.7%
) 1223
 
3.1%
( 1205
 
3.1%
1077
 
2.7%
911
 
2.3%
692
 
1.8%
524
 
1.3%
493
 
1.3%
C 477
 
1.2%
Other values (655) 29771
75.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27319
69.4%
Uppercase Letter 4522
 
11.5%
Lowercase Letter 2790
 
7.1%
Space Separator 1438
 
3.7%
Close Punctuation 1223
 
3.1%
Open Punctuation 1206
 
3.1%
Other Punctuation 585
 
1.5%
Other Symbol 141
 
0.4%
Decimal Number 74
 
0.2%
Control 30
 
0.1%
Other values (3) 22
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1539
 
5.6%
1077
 
3.9%
911
 
3.3%
692
 
2.5%
524
 
1.9%
493
 
1.8%
421
 
1.5%
415
 
1.5%
391
 
1.4%
388
 
1.4%
Other values (576) 20468
74.9%
Uppercase Letter
ValueCountFrequency (%)
C 477
 
10.5%
S 449
 
9.9%
L 409
 
9.0%
O 312
 
6.9%
T 280
 
6.2%
I 267
 
5.9%
G 262
 
5.8%
K 244
 
5.4%
E 233
 
5.2%
N 231
 
5.1%
Other values (16) 1358
30.0%
Lowercase Letter
ValueCountFrequency (%)
o 371
13.3%
n 299
10.7%
a 253
9.1%
e 226
 
8.1%
i 217
 
7.8%
r 205
 
7.3%
t 204
 
7.3%
l 137
 
4.9%
c 133
 
4.8%
s 130
 
4.7%
Other values (16) 615
22.0%
Decimal Number
ValueCountFrequency (%)
2 17
23.0%
1 13
17.6%
4 12
16.2%
3 10
13.5%
8 6
 
8.1%
0 5
 
6.8%
7 5
 
6.8%
6 4
 
5.4%
5 1
 
1.4%
9 1
 
1.4%
Other Punctuation
ValueCountFrequency (%)
. 349
59.7%
/ 130
 
22.2%
, 56
 
9.6%
& 43
 
7.4%
¿ 3
 
0.5%
' 2
 
0.3%
% 1
 
0.2%
: 1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 1205
99.9%
1
 
0.1%
Space Separator
ValueCountFrequency (%)
1438
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1223
100.0%
Other Symbol
ValueCountFrequency (%)
141
100.0%
Control
ValueCountFrequency (%)
30
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Math Symbol
ValueCountFrequency (%)
| 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 27447
69.8%
Latin 7312
 
18.6%
Common 4578
 
11.6%
Han 13
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1539
 
5.6%
1077
 
3.9%
911
 
3.3%
692
 
2.5%
524
 
1.9%
493
 
1.8%
421
 
1.5%
415
 
1.5%
391
 
1.4%
388
 
1.4%
Other values (565) 20596
75.0%
Latin
ValueCountFrequency (%)
C 477
 
6.5%
S 449
 
6.1%
L 409
 
5.6%
o 371
 
5.1%
O 312
 
4.3%
n 299
 
4.1%
T 280
 
3.8%
I 267
 
3.7%
G 262
 
3.6%
a 253
 
3.5%
Other values (42) 3933
53.8%
Common
ValueCountFrequency (%)
1438
31.4%
) 1223
26.7%
( 1205
26.3%
. 349
 
7.6%
/ 130
 
2.8%
, 56
 
1.2%
& 43
 
0.9%
30
 
0.7%
- 20
 
0.4%
2 17
 
0.4%
Other values (16) 67
 
1.5%
Han
ValueCountFrequency (%)
2
15.4%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
Other values (2) 2
15.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 27306
69.4%
ASCII 11886
30.2%
None 145
 
0.4%
CJK 13
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1539
 
5.6%
1077
 
3.9%
911
 
3.3%
692
 
2.5%
524
 
1.9%
493
 
1.8%
421
 
1.5%
415
 
1.5%
391
 
1.4%
388
 
1.4%
Other values (564) 20455
74.9%
ASCII
ValueCountFrequency (%)
1438
 
12.1%
) 1223
 
10.3%
( 1205
 
10.1%
C 477
 
4.0%
S 449
 
3.8%
L 409
 
3.4%
o 371
 
3.1%
. 349
 
2.9%
O 312
 
2.6%
n 299
 
2.5%
Other values (66) 5354
45.0%
None
ValueCountFrequency (%)
141
97.2%
¿ 3
 
2.1%
1
 
0.7%
CJK
ValueCountFrequency (%)
2
15.4%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
Other values (2) 2
15.4%

업종 대분류
Categorical

Distinct23
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
C. 제조업
4658 
G. 도매 및 소매업
1445 
H. 운수 및 창고업
644 
F. 건설업
602 
M. 전문, 과학 및 기술 서비스업
554 
Other values (18)
2097 

Length

Max length36
Median length6
Mean length9.5251
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowC. 제조업
2nd rowG. 도매 및 소매업
3rd rowJ. 정보통신업
4th rowA. 농업, 임업 및 어업
5th rowC. 제조업

Common Values

ValueCountFrequency (%)
C. 제조업 4658
46.6%
G. 도매 및 소매업 1445
 
14.4%
H. 운수 및 창고업 644
 
6.4%
F. 건설업 602
 
6.0%
M. 전문, 과학 및 기술 서비스업 554
 
5.5%
K. 금융 및 보험업 386
 
3.9%
J. 정보통신업 304
 
3.0%
. 193
 
1.9%
N. 사업시설 관리, 사업 지원 및 임대 서비스업 181
 
1.8%
S. 협회 및 단체, 수리 및 기타 개인 서비스업 146
 
1.5%
Other values (13) 887
 
8.9%

Length

2023-12-13T00:47:03.711122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
c 4659
14.7%
제조업 4659
14.7%
4169
 
13.1%
g 1445
 
4.6%
도매 1445
 
4.6%
소매업 1445
 
4.6%
서비스업 1058
 
3.3%
h 644
 
2.0%
운수 644
 
2.0%
창고업 644
 
2.0%
Other values (76) 10915
34.4%

업종 중분류
Categorical

Distinct28
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
5340 
전자 부품, 컴퓨터, 영상, 음향 및 기통신장비 제조업
638 
기타 제품 제조업
627 
섬유제품 제조업 (의복 제외)
538 
자동차 및 트레일러 제조업
 
370
Other values (23)
2487 

Length

Max length30
Median length4
Mean length10.0098
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row의료, 정밀, 광학 기기 및 시계 제조업
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row자동차 및 트레일러 제조업

Common Values

ValueCountFrequency (%)
<NA> 5340
53.4%
전자 부품, 컴퓨터, 영상, 음향 및 기통신장비 제조업 638
 
6.4%
기타 제품 제조업 627
 
6.3%
섬유제품 제조업 (의복 제외) 538
 
5.4%
자동차 및 트레일러 제조업 370
 
3.7%
기타 기계 및 장비 제조업 311
 
3.1%
고무 및 플라스틱제품 제조업 293
 
2.9%
의복, 의복 액세서리 및 모피제품 제조업 274
 
2.7%
화학 물질 및 화학제품 제조업 (의약품 제외) 257
 
2.6%
금속 가공제품 제조업 (기계 및 가구 제외) 244
 
2.4%
Other values (18) 1108
 
11.1%

Length

2023-12-13T00:47:03.879135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 5340
19.1%
제조업 4511
16.2%
2920
 
10.5%
의복 1086
 
3.9%
제외 1073
 
3.8%
기타 990
 
3.5%
기통신장비 638
 
2.3%
전자 638
 
2.3%
음향 638
 
2.3%
컴퓨터 638
 
2.3%
Other values (52) 9436
33.8%

Correlations

2023-12-13T00:47:03.976804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역진출국가투자형태업종 대분류업종 중분류
지역1.0001.0000.4720.2930.437
진출국가1.0001.0000.8030.4900.641
투자형태0.4720.8031.0000.3240.389
업종 대분류0.2930.4900.3241.0000.219
업종 중분류0.4370.6410.3890.2191.000
2023-12-13T00:47:04.104457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종 중분류지역업종 대분류
업종 중분류1.0000.1730.103
지역0.1731.0000.112
업종 대분류0.1030.1121.000
2023-12-13T00:47:04.198807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역업종 대분류업종 중분류
지역1.0000.1120.173
업종 대분류0.1121.0000.103
업종 중분류0.1730.1031.000

Missing values

2023-12-13T00:46:54.122082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:46:54.303387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T00:46:54.483451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

지역진출국가관할무역관기업명(국문)기업명(영문)주소우편번호진출형태투자형태모기업명업종 대분류업종 중분류
8336동남아대양주베트남하노이시노펙스 베트남(주)SYNOPEX VIETNAM./JSC.DONG THO MULTI - COMPLEX I.Z, DONG THO, YEN PHONG, BAC NINH16000생산법인.시노펙스C. 제조업의료, 정밀, 광학 기기 및 시계 제조업
586서남아방글라데시다카LG전자LGSYMPHONY (6TH FLOOR), PLOT-SE(F)-9, ROAD-142SOUTH AVENUE, GULSHAN-1, DHAKA-12121212해외지사단독투자LG전자G. 도매 및 소매업<NA>
1883CIS러시아모스크바HS애드 모스크바법인 ((구)LG애드 모스크바법인)GIIR RUS LLC6 FLOOR, 4TH LESNOY PER., MOSCOW, RUSSIA125047서비스법인.HS애드J. 정보통신업<NA>
7505동남아대양주캄보디아프놈펜송가네SONG''S FAMILYNO 49A (VIMEAN PHNOM PENH. ST 209, SANGKAT CHRAING CHAM RESS 1, KHAN REUSSEY KEO, PHNOM PENH CAMBODIA).생산법인단독투자<NA>A. 농업, 임업 및 어업<NA>
506중국 (홍콩, 대만 포함)중국다롄대련창조기계유한공사DALIAN CHUANGZAO MACHINERY CO.ROOM 408, VIENNA BUILDING, NO.31, LIAOHE WEST ROAD, DALIAN CITY, LIAONING PROVINCE, CHINA116000생산법인단독투자<NA>C. 제조업자동차 및 트레일러 제조업
7840동남아대양주베트남하노이SEY CARTOON MANUFACTURING CO.,LTDSEY CARTOON MANUFACTURING CO., LTD.9TH FLOOR, MITEC BUILDING, LOT E2, CAU GIAY, YEN HOA, CAU GIAY, HANOI10000생산법인단독투자<NA>R. 예술, 스포츠 및 여가관련 서비스업<NA>
10194동남아대양주베트남호치민소울기어THE SOULGEAR VINA CO., LTD.LOT M-1-CN, ROAD NA7, MY PHUOC 2 IP, BEN CAT TOWN, BINH DUONG, VIETNAM700000생산법인단독투자소울기어C. 제조업가죽, 가방 및 신발 제조업
5910동남아대양주인도네시아자카르타켄리KENLEE (PARUNG FACTORY)JL. RAYA PARUNG KM20, BOGOR, INDONESIA16330생산법인.<NA>C. 제조업섬유제품 제조업 (의복 제외)
7247중국 (홍콩, 대만 포함)중국톈진천진덕인과기유한공사DUKIN TIANJIN TECHNOLOGY CO., LTD.天津市南水西道代城3021003300381판매법인단독투자<NA>G. 도매 및 소매업<NA>
502중국 (홍콩, 대만 포함)중국다롄대련장백물류유한공사DALIAN JANGBAEK LOGISTICS CO., LTD.CHANGXING ISLAND ECONOMIC ZONE, DALIAN, LIAONING PROVINCE, CHINA116000서비스법인단독투자장백산업기계N. 사업시설 관리, 사업 지원 및 임대 서비스업<NA>
지역진출국가관할무역관기업명(국문)기업명(영문)주소우편번호진출형태투자형태모기업명업종 대분류업종 중분류
6089동남아대양주인도네시아자카르타한솔인도끌라뗀HANSOLL INDO KLATENJL.BUGISAN RAYA 01,06 PRAMBANAN KAT KLATEN, JAWA TENGAH, INDONESIA57454생산법인.<NA>C. 제조업섬유제품 제조업 (의복 제외)
9909동남아대양주베트남호치민보성비나BU SUNG VINA CO., LTD.LOT E7-2, MINH HUNG-KOREA IP, MINH HUNG COMMUNE, CHON THANH DIST., BINH PHUOC, VIETNAM700000생산법인단독투자<NA>C. 제조업고무 및 플라스틱제품 제조업
3239CIS러시아상트페테르부르크대원루스DAEWON RUSRUSSIA, ST.PETERSBURG, LEVASHOVO, GORSKOE SHOSSE, 165, BLOCK 4, LIT. A.194361생산법인.주식회사 대원총업C. 제조업기타 제품 제조업
3398중국 (홍콩, 대만 포함)중국상하이경기섬유마케팅센터(GTC)GYEONGGI TEXTILE CENTER9B20, SHANGHAI, CHINA MART, 2299 YAN'AN WEST ROAD, CHENGNING DISTRICT, SHANGHAI, CHINA200051서비스법인단독투자걍기도경제과학진흥원M. 전문, 과학 및 기술 서비스업<NA>
4450동남아대양주싱가포르싱가포르어니언 테크놀로지ONION TECHNOLOGY PTE LTD.81 UBI AVE 4, #07-17 UB ONE, SINGAPORE408830서비스법인, 판매법인단독투자(주)어니언소프트웨어J. 정보통신업<NA>
6419중국 (홍콩, 대만 포함)중국칭다오연대지덕각륜공업유한공사YANTAI G-DOK CASTER & WHEEL MANUFACTURING CO., LTD.FUSHAN HIGH TECH INDUSTRIAL ZONE, YANTAI CITY, SHANDONG PROVINCE, CHINA265500생산법인단독투자<NA>C. 제조업고무 및 플라스틱제품 제조업
6535중국 (홍콩, 대만 포함)중국칭다오청도남경전자유한공사QINGDAO NANQING ELECTRONIC CO., LTD.NO. 14, TIANHE INDUSTRIAL PARK, 252 YANHE ROAD, QINGDAO ECONOMIC AND TECHNOLOGICAL DEVELOPMENT ZONE266500생산법인단독투자(주)대성하이테크C. 제조업자동차 및 트레일러 제조업
6643중국 (홍콩, 대만 포함)중국칭다오칭다오스베이사무기기유한공사QINGDAO SIBEI OFFICE EQUIPMENT CO., LTD.NO.5 ROAD, JIHONGTAN STREET, CHENGYANG DISTRICT, QINGDAO CITY266111생산법인합작투자(51%)(주)에스피씨C. 제조업기타 기계 및 장비 제조업
7959동남아대양주베트남하노이대선해운항공주식회사DAI SON TRADING & FORWARDING HANOI BRANCHLO11 BT1 ME TRI HA, TU LIEM, HANOI, VIETNAM10000서비스법인.Daesun Air&Sea TransportationH. 운수 및 창고업<NA>
7602유럽독일프랑크푸르트(주)보성상사 유럽지점BOSUNG ENGINEERING CO., HAMBURG BRANCHHEIDENKAMPSWEG 100, 8. OG20097기타(지점).<NA>G. 도매 및 소매업<NA>