Overview

Dataset statistics

Number of variables22
Number of observations10000
Missing cells3
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 MiB
Average record size in memory184.0 B

Variable types

Text16
Categorical6

Dataset

Description온라인수출 B2B플랫폼(고비즈코리아)에 등록된 회원사들의 업체 기본정보(지역, 수출규모, 공장여부, 국제인증 여부, 주 생산품 등)의 정보를 제공합니다
URLhttps://www.data.go.kr/data/15119047/fileData.do

Alerts

업체 국가(영문) is highly imbalanced (95.8%)Imbalance
기업정보개방여부(API) is highly imbalanced (71.2%)Imbalance
업체 일련번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:32:09.874439
Analysis finished2023-12-12 23:32:11.543096
Duration1.67 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업체 일련번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T08:32:11.723824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length15
Mean length14.8916
Min length14

Characters and Unicode

Total characters148916
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowCP2018031456990
2nd rowCP2018031473203
3rd rowCP2018031468557
4th rowCP2018031461652
5th rowCP2018031485564
ValueCountFrequency (%)
cp2018031456990 1
 
< 0.1%
cp2018031469891 1
 
< 0.1%
cp2018031473657 1
 
< 0.1%
cp2018031459896 1
 
< 0.1%
cp2018031472409 1
 
< 0.1%
cp2018031464671 1
 
< 0.1%
cp2018031471618 1
 
< 0.1%
cp2018031459097 1
 
< 0.1%
cp2018031491056 1
 
< 0.1%
cp2018031480906 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-13T08:32:12.048650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 26204
17.6%
1 24554
16.5%
2 15885
10.7%
3 13694
9.2%
8 13093
8.8%
4 11306
7.6%
C 10000
 
6.7%
P 10000
 
6.7%
7 7853
 
5.3%
6 6188
 
4.2%
Other values (2) 10139
 
6.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 128916
86.6%
Uppercase Letter 20000
 
13.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 26204
20.3%
1 24554
19.0%
2 15885
12.3%
3 13694
10.6%
8 13093
10.2%
4 11306
8.8%
7 7853
 
6.1%
6 6188
 
4.8%
5 6030
 
4.7%
9 4109
 
3.2%
Uppercase Letter
ValueCountFrequency (%)
C 10000
50.0%
P 10000
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 128916
86.6%
Latin 20000
 
13.4%

Most frequent character per script

Common
ValueCountFrequency (%)
0 26204
20.3%
1 24554
19.0%
2 15885
12.3%
3 13694
10.6%
8 13093
10.2%
4 11306
8.8%
7 7853
 
6.1%
6 6188
 
4.8%
5 6030
 
4.7%
9 4109
 
3.2%
Latin
ValueCountFrequency (%)
C 10000
50.0%
P 10000
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 148916
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 26204
17.6%
1 24554
16.5%
2 15885
10.7%
3 13694
9.2%
8 13093
8.8%
4 11306
7.6%
C 10000
 
6.7%
P 10000
 
6.7%
7 7853
 
5.3%
6 6188
 
4.2%
Other values (2) 10139
 
6.8%
Distinct1621
Distinct (%)16.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T08:32:12.378126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length7
Mean length6.5297
Min length1

Characters and Unicode

Total characters65297
Distinct characters47
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1341 ?
Unique (%)13.4%

Sample

1st row데이터 미집계
2nd row데이터 미집계
3rd row데이터 미집계
4th row데이터 미집계
5th row데이터 미집계
ValueCountFrequency (%)
데이터 7907
44.1%
미집계 7907
44.1%
8511 9
 
0.1%
15432 8
 
< 0.1%
10048 8
 
< 0.1%
8390 8
 
< 0.1%
16229 8
 
< 0.1%
48059 8
 
< 0.1%
14449 7
 
< 0.1%
13207 6
 
< 0.1%
Other values (1616) 2035
 
11.4%
2023-12-13T08:32:12.820093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7911
12.1%
7907
12.1%
7907
12.1%
7907
12.1%
7907
12.1%
7907
12.1%
7907
12.1%
1 1579
 
2.4%
2 1175
 
1.8%
5 1085
 
1.7%
Other values (37) 6105
9.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 47442
72.7%
Decimal Number 9891
 
15.1%
Space Separator 7911
 
12.1%
Lowercase Letter 30
 
< 0.1%
Uppercase Letter 17
 
< 0.1%
Dash Punctuation 6
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 4
13.3%
a 4
13.3%
x 3
10.0%
g 3
10.0%
o 2
 
6.7%
s 2
 
6.7%
t 2
 
6.7%
i 2
 
6.7%
m 1
 
3.3%
d 1
 
3.3%
Other values (6) 6
20.0%
Uppercase Letter
ValueCountFrequency (%)
P 2
11.8%
T 2
11.8%
H 2
11.8%
R 2
11.8%
A 1
 
5.9%
G 1
 
5.9%
C 1
 
5.9%
D 1
 
5.9%
S 1
 
5.9%
J 1
 
5.9%
Other values (3) 3
17.6%
Decimal Number
ValueCountFrequency (%)
1 1579
16.0%
2 1175
11.9%
5 1085
11.0%
4 1051
10.6%
0 1022
10.3%
3 963
9.7%
6 869
8.8%
8 799
8.1%
7 748
7.6%
9 600
 
6.1%
Other Letter
ValueCountFrequency (%)
7907
16.7%
7907
16.7%
7907
16.7%
7907
16.7%
7907
16.7%
7907
16.7%
Space Separator
ValueCountFrequency (%)
7911
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 47442
72.7%
Common 17808
 
27.3%
Latin 47
 
0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 4
 
8.5%
a 4
 
8.5%
x 3
 
6.4%
g 3
 
6.4%
o 2
 
4.3%
P 2
 
4.3%
T 2
 
4.3%
s 2
 
4.3%
H 2
 
4.3%
t 2
 
4.3%
Other values (19) 21
44.7%
Common
ValueCountFrequency (%)
7911
44.4%
1 1579
 
8.9%
2 1175
 
6.6%
5 1085
 
6.1%
4 1051
 
5.9%
0 1022
 
5.7%
3 963
 
5.4%
6 869
 
4.9%
8 799
 
4.5%
7 748
 
4.2%
Other values (2) 606
 
3.4%
Hangul
ValueCountFrequency (%)
7907
16.7%
7907
16.7%
7907
16.7%
7907
16.7%
7907
16.7%
7907
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 47442
72.7%
ASCII 17855
 
27.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7911
44.3%
1 1579
 
8.8%
2 1175
 
6.6%
5 1085
 
6.1%
4 1051
 
5.9%
0 1022
 
5.7%
3 963
 
5.4%
6 869
 
4.9%
8 799
 
4.5%
7 748
 
4.2%
Other values (31) 653
 
3.7%
Hangul
ValueCountFrequency (%)
7907
16.7%
7907
16.7%
7907
16.7%
7907
16.7%
7907
16.7%
7907
16.7%

업체 국가(영문)
Categorical

IMBALANCE 

Distinct49
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
KR
9799 
28
 
33
US
 
14
AF
 
14
CN
 
14
Other values (44)
 
126

Length

Max length7
Median length2
Mean length2.006
Min length2

Unique

Unique23 ?
Unique (%)0.2%

Sample

1st rowKR
2nd rowKR
3rd rowKR
4th rowKR
5th rowKR

Common Values

ValueCountFrequency (%)
KR 9799
98.0%
28 33
 
0.3%
US 14
 
0.1%
AF 14
 
0.1%
CN 14
 
0.1%
데이터 미집계 12
 
0.1%
ID 12
 
0.1%
IN 11
 
0.1%
SG 8
 
0.1%
PH 7
 
0.1%
Other values (39) 76
 
0.8%

Length

2023-12-13T08:32:12.955607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
kr 9799
97.9%
28 33
 
0.3%
us 14
 
0.1%
af 14
 
0.1%
cn 14
 
0.1%
데이터 12
 
0.1%
미집계 12
 
0.1%
id 12
 
0.1%
in 11
 
0.1%
sg 8
 
0.1%
Other values (40) 83
 
0.8%
Distinct354
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T08:32:13.257556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length42
Mean length5.4134
Min length3

Characters and Unicode

Total characters54134
Distinct characters106
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique186 ?
Unique (%)1.9%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row강원도
5th row경기도
ValueCountFrequency (%)
경기도 2245
20.1%
서울특별시 1904
17.1%
데이터 1070
 
9.6%
미집계 1070
 
9.6%
부산광역시 467
 
4.2%
인천광역시 431
 
3.9%
경상남도 400
 
3.6%
대구광역시 351
 
3.2%
경상북도 260
 
2.3%
충청남도 214
 
1.9%
Other values (350) 2730
24.5%
2023-12-13T08:32:13.788522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3675
 
6.8%
3523
 
6.5%
2905
 
5.4%
2245
 
4.1%
1970
 
3.6%
1939
 
3.6%
1939
 
3.6%
1904
 
3.5%
n 1896
 
3.5%
1759
 
3.2%
Other values (96) 30379
56.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 36494
67.4%
Lowercase Letter 13049
 
24.1%
Uppercase Letter 1827
 
3.4%
Dash Punctuation 1464
 
2.7%
Space Separator 1142
 
2.1%
Other Punctuation 132
 
0.2%
Decimal Number 24
 
< 0.1%
Math Symbol 1
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3675
 
10.1%
3523
 
9.7%
2905
 
8.0%
2245
 
6.2%
1970
 
5.4%
1939
 
5.3%
1939
 
5.3%
1904
 
5.2%
1759
 
4.8%
1613
 
4.4%
Other values (27) 13022
35.7%
Lowercase Letter
ValueCountFrequency (%)
n 1896
14.5%
g 1758
13.5%
o 1441
11.0%
u 1432
11.0%
e 1135
8.7%
a 1030
7.9%
i 989
7.6%
s 975
7.5%
h 388
 
3.0%
m 370
 
2.8%
Other values (16) 1635
12.5%
Uppercase Letter
ValueCountFrequency (%)
G 463
25.3%
S 389
21.3%
Y 119
 
6.5%
B 101
 
5.5%
D 93
 
5.1%
J 91
 
5.0%
C 85
 
4.7%
A 77
 
4.2%
H 76
 
4.2%
N 69
 
3.8%
Other values (13) 264
14.4%
Decimal Number
ValueCountFrequency (%)
0 6
25.0%
2 4
16.7%
3 3
12.5%
1 3
12.5%
5 2
 
8.3%
7 2
 
8.3%
8 2
 
8.3%
9 1
 
4.2%
6 1
 
4.2%
Other Punctuation
ValueCountFrequency (%)
, 117
88.6%
? 6
 
4.5%
. 5
 
3.8%
: 1
 
0.8%
# 1
 
0.8%
& 1
 
0.8%
; 1
 
0.8%
Dash Punctuation
ValueCountFrequency (%)
- 1464
100.0%
Space Separator
ValueCountFrequency (%)
1142
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 36494
67.4%
Latin 14876
27.5%
Common 2764
 
5.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 1896
12.7%
g 1758
11.8%
o 1441
9.7%
u 1432
9.6%
e 1135
 
7.6%
a 1030
 
6.9%
i 989
 
6.6%
s 975
 
6.6%
G 463
 
3.1%
S 389
 
2.6%
Other values (39) 3368
22.6%
Hangul
ValueCountFrequency (%)
3675
 
10.1%
3523
 
9.7%
2905
 
8.0%
2245
 
6.2%
1970
 
5.4%
1939
 
5.3%
1939
 
5.3%
1904
 
5.2%
1759
 
4.8%
1613
 
4.4%
Other values (27) 13022
35.7%
Common
ValueCountFrequency (%)
- 1464
53.0%
1142
41.3%
, 117
 
4.2%
0 6
 
0.2%
? 6
 
0.2%
. 5
 
0.2%
2 4
 
0.1%
3 3
 
0.1%
1 3
 
0.1%
5 2
 
0.1%
Other values (10) 12
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 36494
67.4%
ASCII 17640
32.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3675
 
10.1%
3523
 
9.7%
2905
 
8.0%
2245
 
6.2%
1970
 
5.4%
1939
 
5.3%
1939
 
5.3%
1904
 
5.2%
1759
 
4.8%
1613
 
4.4%
Other values (27) 13022
35.7%
ASCII
ValueCountFrequency (%)
n 1896
10.7%
g 1758
10.0%
- 1464
 
8.3%
o 1441
 
8.2%
u 1432
 
8.1%
1142
 
6.5%
e 1135
 
6.4%
a 1030
 
5.8%
i 989
 
5.6%
s 975
 
5.5%
Other values (59) 4378
24.8%
Distinct7473
Distinct (%)74.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T08:32:14.234838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length72
Median length47
Mean length15.1991
Min length3

Characters and Unicode

Total characters151991
Distinct characters108
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6676 ?
Unique (%)66.8%

Sample

1st rowJun**
2nd rowcon******
3rd rowEHW**************
4th rowHAN************
5th rowKor******************
ValueCountFrequency (%)
데이터 1047
 
9.1%
미집계 1047
 
9.1%
410
 
3.6%
dae 318
 
2.8%
han 265
 
2.3%
don 210
 
1.8%
kor 169
 
1.5%
shi 115
 
1.0%
sam 111
 
1.0%
seo 85
 
0.7%
Other values (2745) 7709
67.1%
2023-12-13T08:32:14.845305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 117803
77.5%
A 1567
 
1.0%
1495
 
1.0%
E 1430
 
0.9%
S 1297
 
0.9%
N 1232
 
0.8%
O 1218
 
0.8%
1049
 
0.7%
1049
 
0.7%
1047
 
0.7%
Other values (98) 22804
 
15.0%

Most occurring categories

ValueCountFrequency (%)
Other Punctuation 118041
77.7%
Uppercase Letter 17080
 
11.2%
Lowercase Letter 8786
 
5.8%
Other Letter 6329
 
4.2%
Space Separator 1495
 
1.0%
Decimal Number 129
 
0.1%
Dash Punctuation 115
 
0.1%
Open Punctuation 8
 
< 0.1%
Close Punctuation 7
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1049
16.6%
1049
16.6%
1047
16.5%
1047
16.5%
1047
16.5%
1047
16.5%
7
 
0.1%
4
 
0.1%
2
 
< 0.1%
2
 
< 0.1%
Other values (27) 28
 
0.4%
Uppercase Letter
ValueCountFrequency (%)
A 1567
 
9.2%
E 1430
 
8.4%
S 1297
 
7.6%
N 1232
 
7.2%
O 1218
 
7.1%
D 1040
 
6.1%
I 924
 
5.4%
H 918
 
5.4%
M 904
 
5.3%
C 838
 
4.9%
Other values (16) 5712
33.4%
Lowercase Letter
ValueCountFrequency (%)
a 1044
11.9%
e 1017
11.6%
o 1003
11.4%
n 746
 
8.5%
i 570
 
6.5%
u 485
 
5.5%
r 467
 
5.3%
m 368
 
4.2%
s 361
 
4.1%
h 345
 
3.9%
Other values (16) 2380
27.1%
Decimal Number
ValueCountFrequency (%)
2 37
28.7%
6 25
19.4%
1 20
15.5%
3 16
12.4%
5 15
11.6%
4 8
 
6.2%
0 6
 
4.7%
9 2
 
1.6%
Other Punctuation
ValueCountFrequency (%)
* 117803
99.8%
& 138
 
0.1%
. 92
 
0.1%
, 4
 
< 0.1%
' 3
 
< 0.1%
/ 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
1495
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 115
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 119796
78.8%
Latin 25866
 
17.0%
Hangul 6329
 
4.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 1567
 
6.1%
E 1430
 
5.5%
S 1297
 
5.0%
N 1232
 
4.8%
O 1218
 
4.7%
a 1044
 
4.0%
D 1040
 
4.0%
e 1017
 
3.9%
o 1003
 
3.9%
I 924
 
3.6%
Other values (42) 14094
54.5%
Hangul
ValueCountFrequency (%)
1049
16.6%
1049
16.6%
1047
16.5%
1047
16.5%
1047
16.5%
1047
16.5%
7
 
0.1%
4
 
0.1%
2
 
< 0.1%
2
 
< 0.1%
Other values (27) 28
 
0.4%
Common
ValueCountFrequency (%)
* 117803
98.3%
1495
 
1.2%
& 138
 
0.1%
- 115
 
0.1%
. 92
 
0.1%
2 37
 
< 0.1%
6 25
 
< 0.1%
1 20
 
< 0.1%
3 16
 
< 0.1%
5 15
 
< 0.1%
Other values (9) 40
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 145662
95.8%
Hangul 6329
 
4.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 117803
80.9%
A 1567
 
1.1%
1495
 
1.0%
E 1430
 
1.0%
S 1297
 
0.9%
N 1232
 
0.8%
O 1218
 
0.8%
a 1044
 
0.7%
D 1040
 
0.7%
e 1017
 
0.7%
Other values (61) 16519
 
11.3%
Hangul
ValueCountFrequency (%)
1049
16.6%
1049
16.6%
1047
16.5%
1047
16.5%
1047
16.5%
1047
16.5%
7
 
0.1%
4
 
0.1%
2
 
< 0.1%
2
 
< 0.1%
Other values (27) 28
 
0.4%
Distinct214
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T08:32:15.046903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length7
Mean length7.3932
Min length1

Characters and Unicode

Total characters73932
Distinct characters63
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique165 ?
Unique (%)1.7%

Sample

1st row데이터 미집계
2nd row데이터 미집계
3rd row데이터 미집계
4th row데이터 미집계
5th row데이터 미집계
ValueCountFrequency (%)
데이터 8259
44.9%
미집계 8259
44.9%
gyeonggi-do 431
 
2.3%
seoul 370
 
2.0%
korea 221
 
1.2%
incheon 87
 
0.5%
busan 61
 
0.3%
gyeongsangnam-do 61
 
0.3%
gyeongsangbuk-do 46
 
0.3%
daegu 45
 
0.2%
Other values (208) 552
 
3.0%
2023-12-13T08:32:15.353937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8392
11.4%
8259
11.2%
8259
11.2%
8259
11.2%
8259
11.2%
8259
11.2%
8259
11.2%
o 2350
 
3.2%
e 1640
 
2.2%
g 1520
 
2.1%
Other values (53) 10476
14.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 49554
67.0%
Lowercase Letter 13107
 
17.7%
Space Separator 8392
 
11.4%
Uppercase Letter 1914
 
2.6%
Dash Punctuation 834
 
1.1%
Other Punctuation 109
 
0.1%
Close Punctuation 11
 
< 0.1%
Open Punctuation 11
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 2350
17.9%
e 1640
12.5%
g 1520
11.6%
n 1484
11.3%
a 913
 
7.0%
u 870
 
6.6%
d 765
 
5.8%
y 584
 
4.5%
i 563
 
4.3%
l 547
 
4.2%
Other values (15) 1871
14.3%
Uppercase Letter
ValueCountFrequency (%)
G 629
32.9%
S 407
21.3%
K 248
 
13.0%
C 102
 
5.3%
I 97
 
5.1%
J 87
 
4.5%
D 81
 
4.2%
B 74
 
3.9%
N 37
 
1.9%
A 36
 
1.9%
Other values (13) 116
 
6.1%
Other Letter
ValueCountFrequency (%)
8259
16.7%
8259
16.7%
8259
16.7%
8259
16.7%
8259
16.7%
8259
16.7%
Other Punctuation
ValueCountFrequency (%)
, 82
75.2%
/ 12
 
11.0%
? 8
 
7.3%
. 6
 
5.5%
: 1
 
0.9%
Space Separator
ValueCountFrequency (%)
8392
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 834
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 49554
67.0%
Latin 15021
 
20.3%
Common 9357
 
12.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 2350
15.6%
e 1640
10.9%
g 1520
10.1%
n 1484
9.9%
a 913
 
6.1%
u 870
 
5.8%
d 765
 
5.1%
G 629
 
4.2%
y 584
 
3.9%
i 563
 
3.7%
Other values (38) 3703
24.7%
Common
ValueCountFrequency (%)
8392
89.7%
- 834
 
8.9%
, 82
 
0.9%
/ 12
 
0.1%
) 11
 
0.1%
( 11
 
0.1%
? 8
 
0.1%
. 6
 
0.1%
: 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
8259
16.7%
8259
16.7%
8259
16.7%
8259
16.7%
8259
16.7%
8259
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 49554
67.0%
ASCII 24378
33.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8392
34.4%
o 2350
 
9.6%
e 1640
 
6.7%
g 1520
 
6.2%
n 1484
 
6.1%
a 913
 
3.7%
u 870
 
3.6%
- 834
 
3.4%
d 765
 
3.1%
G 629
 
2.6%
Other values (47) 4981
20.4%
Hangul
ValueCountFrequency (%)
8259
16.7%
8259
16.7%
8259
16.7%
8259
16.7%
8259
16.7%
8259
16.7%
Distinct690
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T08:32:15.657811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length40
Mean length6.2676
Min length1

Characters and Unicode

Total characters62676
Distinct characters230
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique296 ?
Unique (%)3.0%

Sample

1st row경기도 광명시
2nd row
3rd row경기 시흥시
4th row강원도 춘천시
5th row
ValueCountFrequency (%)
경기도 1772
 
11.2%
서울특별시 1326
 
8.4%
경기 681
 
4.3%
서울 547
 
3.5%
인천광역시 328
 
2.1%
부산광역시 321
 
2.0%
강남구 312
 
2.0%
성남시 268
 
1.7%
대구광역시 255
 
1.6%
경상남도 234
 
1.5%
Other values (621) 9723
61.7%
2023-12-13T08:32:16.108770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10000
 
16.0%
6379
 
10.2%
4162
 
6.6%
3219
 
5.1%
2751
 
4.4%
2667
 
4.3%
2494
 
4.0%
2078
 
3.3%
1716
 
2.7%
1574
 
2.5%
Other values (220) 25636
40.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 49723
79.3%
Space Separator 10000
 
16.0%
Lowercase Letter 1458
 
2.3%
Decimal Number 641
 
1.0%
Uppercase Letter 366
 
0.6%
Other Punctuation 262
 
0.4%
Dash Punctuation 214
 
0.3%
Close Punctuation 4
 
< 0.1%
Open Punctuation 4
 
< 0.1%
Math Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6379
 
12.8%
4162
 
8.4%
3219
 
6.5%
2751
 
5.5%
2667
 
5.4%
2494
 
5.0%
2078
 
4.2%
1716
 
3.5%
1574
 
3.2%
1358
 
2.7%
Other values (150) 21325
42.9%
Lowercase Letter
ValueCountFrequency (%)
n 233
16.0%
o 208
14.3%
g 149
10.2%
a 136
9.3%
e 108
 
7.4%
i 81
 
5.6%
d 77
 
5.3%
u 72
 
4.9%
r 54
 
3.7%
h 46
 
3.2%
Other values (15) 294
20.2%
Uppercase Letter
ValueCountFrequency (%)
S 39
 
10.7%
G 36
 
9.8%
D 24
 
6.6%
A 23
 
6.3%
N 23
 
6.3%
B 20
 
5.5%
H 19
 
5.2%
Y 18
 
4.9%
O 18
 
4.9%
R 18
 
4.9%
Other values (14) 128
35.0%
Decimal Number
ValueCountFrequency (%)
1 137
21.4%
3 77
12.0%
2 74
11.5%
0 66
10.3%
6 58
9.0%
4 55
8.6%
5 52
 
8.1%
7 48
 
7.5%
9 38
 
5.9%
8 36
 
5.6%
Other Punctuation
ValueCountFrequency (%)
, 207
79.0%
# 29
 
11.1%
. 20
 
7.6%
/ 3
 
1.1%
: 2
 
0.8%
@ 1
 
0.4%
Space Separator
ValueCountFrequency (%)
10000
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 214
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 49723
79.3%
Common 11129
 
17.8%
Latin 1824
 
2.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6379
 
12.8%
4162
 
8.4%
3219
 
6.5%
2751
 
5.5%
2667
 
5.4%
2494
 
5.0%
2078
 
4.2%
1716
 
3.5%
1574
 
3.2%
1358
 
2.7%
Other values (150) 21325
42.9%
Latin
ValueCountFrequency (%)
n 233
 
12.8%
o 208
 
11.4%
g 149
 
8.2%
a 136
 
7.5%
e 108
 
5.9%
i 81
 
4.4%
d 77
 
4.2%
u 72
 
3.9%
r 54
 
3.0%
h 46
 
2.5%
Other values (39) 660
36.2%
Common
ValueCountFrequency (%)
10000
89.9%
- 214
 
1.9%
, 207
 
1.9%
1 137
 
1.2%
3 77
 
0.7%
2 74
 
0.7%
0 66
 
0.6%
6 58
 
0.5%
4 55
 
0.5%
5 52
 
0.5%
Other values (11) 189
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 49723
79.3%
ASCII 12953
 
20.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10000
77.2%
n 233
 
1.8%
- 214
 
1.7%
o 208
 
1.6%
, 207
 
1.6%
g 149
 
1.2%
1 137
 
1.1%
a 136
 
1.0%
e 108
 
0.8%
i 81
 
0.6%
Other values (60) 1480
 
11.4%
Hangul
ValueCountFrequency (%)
6379
 
12.8%
4162
 
8.4%
3219
 
6.5%
2751
 
5.5%
2667
 
5.4%
2494
 
5.0%
2078
 
4.2%
1716
 
3.5%
1574
 
3.2%
1358
 
2.7%
Other values (150) 21325
42.9%
Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
법인
6856 
개인
2968 
데이터 미집계
 
176

Length

Max length7
Median length2
Mean length2.088
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row법인
2nd row개인
3rd row개인
4th row개인
5th row법인

Common Values

ValueCountFrequency (%)
법인 6856
68.6%
개인 2968
29.7%
데이터 미집계 176
 
1.8%

Length

2023-12-13T08:32:16.243782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:32:16.347456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
법인 6856
67.4%
개인 2968
29.2%
데이터 176
 
1.7%
미집계 176
 
1.7%
Distinct313
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T08:32:16.708834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length7
Mean length6.7184
Min length1

Characters and Unicode

Total characters67184
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique234 ?
Unique (%)2.3%

Sample

1st row데이터 미집계
2nd row데이터 미집계
3rd row데이터 미집계
4th row데이터 미집계
5th row데이터 미집계
ValueCountFrequency (%)
데이터 9248
48.0%
미집계 9248
48.0%
0 42
 
0.2%
100 38
 
0.2%
1000 24
 
0.1%
50 22
 
0.1%
200 21
 
0.1%
10 19
 
0.1%
500 17
 
0.1%
300 16
 
0.1%
Other values (304) 553
 
2.9%
2023-12-13T08:32:17.300238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9248
13.8%
9248
13.8%
9248
13.8%
9248
13.8%
9248
13.8%
9248
13.8%
9248
13.8%
0 1131
 
1.7%
1 283
 
0.4%
2 210
 
0.3%
Other values (7) 824
 
1.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 55488
82.6%
Space Separator 9248
 
13.8%
Decimal Number 2448
 
3.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1131
46.2%
1 283
 
11.6%
2 210
 
8.6%
5 186
 
7.6%
3 171
 
7.0%
4 115
 
4.7%
6 102
 
4.2%
8 100
 
4.1%
7 90
 
3.7%
9 60
 
2.5%
Other Letter
ValueCountFrequency (%)
9248
16.7%
9248
16.7%
9248
16.7%
9248
16.7%
9248
16.7%
9248
16.7%
Space Separator
ValueCountFrequency (%)
9248
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 55488
82.6%
Common 11696
 
17.4%

Most frequent character per script

Common
ValueCountFrequency (%)
9248
79.1%
0 1131
 
9.7%
1 283
 
2.4%
2 210
 
1.8%
5 186
 
1.6%
3 171
 
1.5%
4 115
 
1.0%
6 102
 
0.9%
8 100
 
0.9%
7 90
 
0.8%
Hangul
ValueCountFrequency (%)
9248
16.7%
9248
16.7%
9248
16.7%
9248
16.7%
9248
16.7%
9248
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 55488
82.6%
ASCII 11696
 
17.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9248
16.7%
9248
16.7%
9248
16.7%
9248
16.7%
9248
16.7%
9248
16.7%
ASCII
ValueCountFrequency (%)
9248
79.1%
0 1131
 
9.7%
1 283
 
2.4%
2 210
 
1.8%
5 186
 
1.6%
3 171
 
1.5%
4 115
 
1.0%
6 102
 
0.9%
8 100
 
0.9%
7 90
 
0.8%
Distinct18
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도
2880 
서울시
2540 
데이터 미집계
648 
인천시
585 
부산시
564 
Other values (13)
2783 

Length

Max length7
Median length3
Mean length3.4274
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도
2nd row경기도
3rd row데이터 미집계
4th row강원도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 2880
28.8%
서울시 2540
25.4%
데이터 미집계 648
 
6.5%
인천시 585
 
5.9%
부산시 564
 
5.6%
경상남도 487
 
4.9%
대구시 436
 
4.4%
경상북도 329
 
3.3%
충청남도 273
 
2.7%
충청북도 267
 
2.7%
Other values (8) 991
 
9.9%

Length

2023-12-13T08:32:17.453876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 2880
27.0%
서울시 2540
23.9%
데이터 648
 
6.1%
미집계 648
 
6.1%
인천시 585
 
5.5%
부산시 564
 
5.3%
경상남도 487
 
4.6%
대구시 436
 
4.1%
경상북도 329
 
3.1%
충청남도 273
 
2.6%
Other values (9) 1258
11.8%
Distinct922
Distinct (%)9.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T08:32:17.708125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length1
Mean length2.8
Min length1

Characters and Unicode

Total characters28000
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique674 ?
Unique (%)6.7%

Sample

1st row0
2nd row100000000
3rd row데이터 미집계
4th row0
5th row136
ValueCountFrequency (%)
0 4746
40.5%
미집계 1732
 
14.8%
데이터 1732
 
14.8%
100 231
 
2.0%
10 168
 
1.4%
1000 148
 
1.3%
1 145
 
1.2%
50 109
 
0.9%
200 103
 
0.9%
300 88
 
0.8%
Other values (913) 2530
21.6%
2023-12-13T08:32:18.147178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 9696
34.6%
1732
 
6.2%
1732
 
6.2%
1732
 
6.2%
1732
 
6.2%
1732
 
6.2%
1732
 
6.2%
1732
 
6.2%
1 1626
 
5.8%
2 971
 
3.5%
Other values (7) 3583
 
12.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 15876
56.7%
Other Letter 10392
37.1%
Space Separator 1732
 
6.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 9696
61.1%
1 1626
 
10.2%
2 971
 
6.1%
5 896
 
5.6%
3 692
 
4.4%
4 472
 
3.0%
6 431
 
2.7%
8 409
 
2.6%
7 376
 
2.4%
9 307
 
1.9%
Other Letter
ValueCountFrequency (%)
1732
16.7%
1732
16.7%
1732
16.7%
1732
16.7%
1732
16.7%
1732
16.7%
Space Separator
ValueCountFrequency (%)
1732
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 17608
62.9%
Hangul 10392
37.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 9696
55.1%
1732
 
9.8%
1 1626
 
9.2%
2 971
 
5.5%
5 896
 
5.1%
3 692
 
3.9%
4 472
 
2.7%
6 431
 
2.4%
8 409
 
2.3%
7 376
 
2.1%
Hangul
ValueCountFrequency (%)
1732
16.7%
1732
16.7%
1732
16.7%
1732
16.7%
1732
16.7%
1732
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 17608
62.9%
Hangul 10392
37.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 9696
55.1%
1732
 
9.8%
1 1626
 
9.2%
2 971
 
5.5%
5 896
 
5.1%
3 692
 
3.9%
4 472
 
2.7%
6 431
 
2.4%
8 409
 
2.3%
7 376
 
2.1%
Hangul
ValueCountFrequency (%)
1732
16.7%
1732
16.7%
1732
16.7%
1732
16.7%
1732
16.7%
1732
16.7%
Distinct462
Distinct (%)4.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T08:32:18.434356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length154
Median length7
Mean length9.3784
Min length1

Characters and Unicode

Total characters93784
Distinct characters202
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique456 ?
Unique (%)4.6%

Sample

1st row데이터 미집계
2nd row데이터 미집계
3rd row데이터 미집계
4th row데이터 미집계
5th row데이터 미집계
ValueCountFrequency (%)
데이터 9530
43.0%
미집계 9530
43.0%
korea 304
 
1.4%
gyeonggi-do 113
 
0.5%
of 69
 
0.3%
rep 61
 
0.3%
seoul 28
 
0.1%
busan 28
 
0.1%
28
 
0.1%
incheon 27
 
0.1%
Other values (1713) 2454
 
11.1%
2023-12-13T08:32:18.889049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12302
13.1%
9533
10.2%
9533
10.2%
9531
10.2%
9530
10.2%
9530
10.2%
9530
10.2%
o 2093
 
2.2%
n 1932
 
2.1%
, 1601
 
1.7%
Other values (192) 18669
19.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 57608
61.4%
Lowercase Letter 14315
 
15.3%
Space Separator 12302
 
13.1%
Uppercase Letter 3339
 
3.6%
Decimal Number 2849
 
3.0%
Other Punctuation 1790
 
1.9%
Dash Punctuation 1508
 
1.6%
Open Punctuation 32
 
< 0.1%
Close Punctuation 29
 
< 0.1%
Math Symbol 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9533
16.5%
9533
16.5%
9531
16.5%
9530
16.5%
9530
16.5%
9530
16.5%
27
 
< 0.1%
20
 
< 0.1%
18
 
< 0.1%
16
 
< 0.1%
Other values (113) 340
 
0.6%
Lowercase Letter
ValueCountFrequency (%)
o 2093
14.6%
n 1932
13.5%
g 1486
10.4%
e 1438
10.0%
a 1261
8.8%
u 850
 
5.9%
i 789
 
5.5%
r 570
 
4.0%
s 505
 
3.5%
d 489
 
3.4%
Other values (16) 2902
20.3%
Uppercase Letter
ValueCountFrequency (%)
G 437
13.1%
K 365
 
10.9%
S 303
 
9.1%
O 221
 
6.6%
R 203
 
6.1%
A 195
 
5.8%
N 181
 
5.4%
E 180
 
5.4%
D 167
 
5.0%
C 158
 
4.7%
Other values (16) 929
27.8%
Decimal Number
ValueCountFrequency (%)
1 494
17.3%
2 362
12.7%
4 322
11.3%
0 316
11.1%
3 306
10.7%
8 244
8.6%
5 241
8.5%
7 211
7.4%
6 209
7.3%
9 144
 
5.1%
Other Punctuation
ValueCountFrequency (%)
, 1601
89.4%
. 121
 
6.8%
# 46
 
2.6%
' 10
 
0.6%
/ 5
 
0.3%
: 3
 
0.2%
? 2
 
0.1%
& 2
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 30
93.8%
[ 2
 
6.2%
Close Punctuation
ValueCountFrequency (%)
) 27
93.1%
] 2
 
6.9%
Math Symbol
ValueCountFrequency (%)
~ 6
85.7%
= 1
 
14.3%
Space Separator
ValueCountFrequency (%)
12302
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1508
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 57608
61.4%
Common 18522
 
19.7%
Latin 17654
 
18.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9533
16.5%
9533
16.5%
9531
16.5%
9530
16.5%
9530
16.5%
9530
16.5%
27
 
< 0.1%
20
 
< 0.1%
18
 
< 0.1%
16
 
< 0.1%
Other values (113) 340
 
0.6%
Latin
ValueCountFrequency (%)
o 2093
 
11.9%
n 1932
 
10.9%
g 1486
 
8.4%
e 1438
 
8.1%
a 1261
 
7.1%
u 850
 
4.8%
i 789
 
4.5%
r 570
 
3.2%
s 505
 
2.9%
d 489
 
2.8%
Other values (42) 6241
35.4%
Common
ValueCountFrequency (%)
12302
66.4%
, 1601
 
8.6%
- 1508
 
8.1%
1 494
 
2.7%
2 362
 
2.0%
4 322
 
1.7%
0 316
 
1.7%
3 306
 
1.7%
8 244
 
1.3%
5 241
 
1.3%
Other values (17) 826
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 57608
61.4%
ASCII 36176
38.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12302
34.0%
o 2093
 
5.8%
n 1932
 
5.3%
, 1601
 
4.4%
- 1508
 
4.2%
g 1486
 
4.1%
e 1438
 
4.0%
a 1261
 
3.5%
u 850
 
2.3%
i 789
 
2.2%
Other values (69) 10916
30.2%
Hangul
ValueCountFrequency (%)
9533
16.5%
9533
16.5%
9531
16.5%
9530
16.5%
9530
16.5%
9530
16.5%
27
 
< 0.1%
20
 
< 0.1%
18
 
< 0.1%
16
 
< 0.1%
Other values (113) 340
 
0.6%
Distinct5383
Distinct (%)53.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T08:32:19.403499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length135
Median length81
Mean length12.5263
Min length1

Characters and Unicode

Total characters125263
Distinct characters131
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5365 ?
Unique (%)53.6%

Sample

1st rowwww.juncs.co.kr
2nd row데이터 미집계
3rd row데이터 미집계
4th rowboatman.co.kr
5th row데이터 미집계
ValueCountFrequency (%)
데이터 4486
31.2%
미집계 4486
31.2%
3
 
< 0.1%
www.nicetech.net 2
 
< 0.1%
www.ant21.net 2
 
< 0.1%
www.kyungwonmedical.com 2
 
< 0.1%
www.bestlogis.co.kr 2
 
< 0.1%
www.goodsheet.co.kr 2
 
< 0.1%
swtextile.koreasme.com 2
 
< 0.1%
www.bioneer.co.kr 2
 
< 0.1%
Other values (5384) 5393
37.5%
2023-12-13T08:32:19.788088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
w 13898
 
11.1%
. 12868
 
10.3%
o 8907
 
7.1%
c 7215
 
5.8%
e 5478
 
4.4%
r 4977
 
4.0%
m 4776
 
3.8%
4665
 
3.7%
4489
 
3.6%
4486
 
3.6%
Other values (121) 53504
42.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 78187
62.4%
Other Letter 26980
 
21.5%
Other Punctuation 13815
 
11.0%
Space Separator 4665
 
3.7%
Decimal Number 844
 
0.7%
Dash Punctuation 445
 
0.4%
Uppercase Letter 279
 
0.2%
Connector Punctuation 34
 
< 0.1%
Math Symbol 8
 
< 0.1%
Open Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4489
16.6%
4486
16.6%
4486
16.6%
4486
16.6%
4486
16.6%
4486
16.6%
4
 
< 0.1%
2
 
< 0.1%
2
 
< 0.1%
2
 
< 0.1%
Other values (45) 51
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
w 13898
17.8%
o 8907
11.4%
c 7215
 
9.2%
e 5478
 
7.0%
r 4977
 
6.4%
m 4776
 
6.1%
k 4226
 
5.4%
a 3904
 
5.0%
n 3813
 
4.9%
s 2917
 
3.7%
Other values (16) 18076
23.1%
Uppercase Letter
ValueCountFrequency (%)
W 39
14.0%
C 30
 
10.8%
O 28
 
10.0%
E 20
 
7.2%
M 18
 
6.5%
N 14
 
5.0%
K 14
 
5.0%
R 13
 
4.7%
S 12
 
4.3%
L 11
 
3.9%
Other values (14) 80
28.7%
Other Punctuation
ValueCountFrequency (%)
. 12868
93.1%
/ 831
 
6.0%
: 78
 
0.6%
, 12
 
0.1%
% 9
 
0.1%
? 8
 
0.1%
@ 5
 
< 0.1%
& 2
 
< 0.1%
# 1
 
< 0.1%
! 1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
1 221
26.2%
2 215
25.5%
0 119
14.1%
3 61
 
7.2%
4 54
 
6.4%
5 41
 
4.9%
7 37
 
4.4%
8 36
 
4.3%
9 34
 
4.0%
6 26
 
3.1%
Space Separator
ValueCountFrequency (%)
4665
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 445
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 34
100.0%
Math Symbol
ValueCountFrequency (%)
= 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 78466
62.6%
Hangul 26980
 
21.5%
Common 19817
 
15.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4489
16.6%
4486
16.6%
4486
16.6%
4486
16.6%
4486
16.6%
4486
16.6%
4
 
< 0.1%
2
 
< 0.1%
2
 
< 0.1%
2
 
< 0.1%
Other values (45) 51
 
0.2%
Latin
ValueCountFrequency (%)
w 13898
17.7%
o 8907
11.4%
c 7215
 
9.2%
e 5478
 
7.0%
r 4977
 
6.3%
m 4776
 
6.1%
k 4226
 
5.4%
a 3904
 
5.0%
n 3813
 
4.9%
s 2917
 
3.7%
Other values (40) 18355
23.4%
Common
ValueCountFrequency (%)
. 12868
64.9%
4665
 
23.5%
/ 831
 
4.2%
- 445
 
2.2%
1 221
 
1.1%
2 215
 
1.1%
0 119
 
0.6%
: 78
 
0.4%
3 61
 
0.3%
4 54
 
0.3%
Other values (16) 260
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 98283
78.5%
Hangul 26980
 
21.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
w 13898
14.1%
. 12868
13.1%
o 8907
 
9.1%
c 7215
 
7.3%
e 5478
 
5.6%
r 4977
 
5.1%
m 4776
 
4.9%
4665
 
4.7%
k 4226
 
4.3%
a 3904
 
4.0%
Other values (66) 27369
27.8%
Hangul
ValueCountFrequency (%)
4489
16.6%
4486
16.6%
4486
16.6%
4486
16.6%
4486
16.6%
4486
16.6%
4
 
< 0.1%
2
 
< 0.1%
2
 
< 0.1%
2
 
< 0.1%
Other values (45) 51
 
0.2%
Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
데이터 미집계
6290 
N
2150 
Y
1560 

Length

Max length7
Median length7
Mean length4.774
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row데이터 미집계
2nd rowY
3rd row데이터 미집계
4th row데이터 미집계
5th rowN

Common Values

ValueCountFrequency (%)
데이터 미집계 6290
62.9%
N 2150
 
21.5%
Y 1560
 
15.6%

Length

2023-12-13T08:32:19.910325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:32:20.016591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
데이터 6290
38.6%
미집계 6290
38.6%
n 2150
 
13.2%
y 1560
 
9.6%
Distinct274
Distinct (%)2.7%
Missing2
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-13T08:32:20.395129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length98
Median length7
Mean length7.1815363
Min length1

Characters and Unicode

Total characters71801
Distinct characters212
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique235 ?
Unique (%)2.4%

Sample

1st row데이터 미집계
2nd row데이터 미집계
3rd row데이터 미집계
4th row데이터 미집계
5th row데이터 미집계
ValueCountFrequency (%)
데이터 9562
47.8%
미집계 9562
47.8%
iso 94
 
0.5%
ce 71
 
0.4%
iso9001 66
 
0.3%
9001 38
 
0.2%
없음 31
 
0.2%
27
 
0.1%
iso14001 22
 
0.1%
14001 21
 
0.1%
Other values (321) 528
 
2.6%
2023-12-13T08:32:20.932463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10043
14.0%
9569
13.3%
9565
13.3%
9563
13.3%
9562
13.3%
9562
13.3%
9562
13.3%
0 576
 
0.8%
1 341
 
0.5%
S 313
 
0.4%
Other values (202) 3145
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 57900
80.6%
Space Separator 10043
 
14.0%
Uppercase Letter 1696
 
2.4%
Decimal Number 1416
 
2.0%
Other Punctuation 355
 
0.5%
Lowercase Letter 329
 
0.5%
Dash Punctuation 26
 
< 0.1%
Close Punctuation 17
 
< 0.1%
Open Punctuation 17
 
< 0.1%
Final Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9569
16.5%
9565
16.5%
9563
16.5%
9562
16.5%
9562
16.5%
9562
16.5%
34
 
0.1%
32
 
0.1%
31
 
0.1%
30
 
0.1%
Other values (132) 390
 
0.7%
Uppercase Letter
ValueCountFrequency (%)
S 313
18.5%
I 255
15.0%
O 255
15.0%
C 210
12.4%
E 119
 
7.0%
A 75
 
4.4%
F 57
 
3.4%
L 52
 
3.1%
P 51
 
3.0%
T 46
 
2.7%
Other values (16) 263
15.5%
Lowercase Letter
ValueCountFrequency (%)
e 44
13.4%
o 37
11.2%
s 32
9.7%
i 29
8.8%
c 27
8.2%
a 23
 
7.0%
t 22
 
6.7%
n 20
 
6.1%
r 17
 
5.2%
p 15
 
4.6%
Other values (13) 63
19.1%
Decimal Number
ValueCountFrequency (%)
0 576
40.7%
1 341
24.1%
9 189
 
13.3%
4 101
 
7.1%
2 76
 
5.4%
8 48
 
3.4%
6 29
 
2.0%
5 23
 
1.6%
3 22
 
1.6%
7 11
 
0.8%
Other Punctuation
ValueCountFrequency (%)
, 278
78.3%
: 27
 
7.6%
/ 24
 
6.8%
. 18
 
5.1%
& 8
 
2.3%
Space Separator
ValueCountFrequency (%)
10043
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 57899
80.6%
Common 11876
 
16.5%
Latin 2025
 
2.8%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9569
16.5%
9565
16.5%
9563
16.5%
9562
16.5%
9562
16.5%
9562
16.5%
34
 
0.1%
32
 
0.1%
31
 
0.1%
30
 
0.1%
Other values (131) 389
 
0.7%
Latin
ValueCountFrequency (%)
S 313
15.5%
I 255
12.6%
O 255
12.6%
C 210
 
10.4%
E 119
 
5.9%
A 75
 
3.7%
F 57
 
2.8%
L 52
 
2.6%
P 51
 
2.5%
T 46
 
2.3%
Other values (39) 592
29.2%
Common
ValueCountFrequency (%)
10043
84.6%
0 576
 
4.9%
1 341
 
2.9%
, 278
 
2.3%
9 189
 
1.6%
4 101
 
0.9%
2 76
 
0.6%
8 48
 
0.4%
6 29
 
0.2%
: 27
 
0.2%
Other values (11) 168
 
1.4%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 57899
80.6%
ASCII 13899
 
19.4%
Punctuation 2
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10043
72.3%
0 576
 
4.1%
1 341
 
2.5%
S 313
 
2.3%
, 278
 
2.0%
I 255
 
1.8%
O 255
 
1.8%
C 210
 
1.5%
9 189
 
1.4%
E 119
 
0.9%
Other values (58) 1320
 
9.5%
Hangul
ValueCountFrequency (%)
9569
16.5%
9565
16.5%
9563
16.5%
9562
16.5%
9562
16.5%
9562
16.5%
34
 
0.1%
32
 
0.1%
31
 
0.1%
30
 
0.1%
Other values (131) 389
 
0.7%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct151
Distinct (%)1.5%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-13T08:32:21.294881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length993
Median length7
Mean length8.0261026
Min length1

Characters and Unicode

Total characters80253
Distinct characters631
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique147 ?
Unique (%)1.5%

Sample

1st row데이터 미집계
2nd row데이터 미집계
3rd row데이터 미집계
4th row데이터 미집계
5th row남미, 인도네시아, 미국 등으로 시장확대 예정
ValueCountFrequency (%)
데이터 9823
44.5%
미집계 9823
44.5%
52
 
0.2%
없음 28
 
0.1%
23
 
0.1%
있는 16
 
0.1%
있습니다 13
 
0.1%
12
 
0.1%
수출 11
 
< 0.1%
통해 9
 
< 0.1%
Other values (1753) 2256
 
10.2%
2023-12-13T08:32:21.828962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12153
15.1%
9990
12.4%
9866
12.3%
9842
12.3%
9841
12.3%
9830
12.2%
9827
12.2%
135
 
0.2%
. 130
 
0.2%
123
 
0.2%
Other values (621) 8516
10.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 66025
82.3%
Space Separator 12153
 
15.1%
Lowercase Letter 856
 
1.1%
Decimal Number 448
 
0.6%
Other Punctuation 345
 
0.4%
Uppercase Letter 262
 
0.3%
Dash Punctuation 44
 
0.1%
Close Punctuation 43
 
0.1%
Open Punctuation 40
 
< 0.1%
Control 28
 
< 0.1%
Other values (2) 9
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9990
15.1%
9866
14.9%
9842
14.9%
9841
14.9%
9830
14.9%
9827
14.9%
135
 
0.2%
123
 
0.2%
122
 
0.2%
120
 
0.2%
Other values (536) 6329
9.6%
Lowercase Letter
ValueCountFrequency (%)
e 90
 
10.5%
n 77
 
9.0%
o 74
 
8.6%
a 69
 
8.1%
r 66
 
7.7%
i 61
 
7.1%
t 58
 
6.8%
l 49
 
5.7%
s 44
 
5.1%
c 32
 
3.7%
Other values (15) 236
27.6%
Uppercase Letter
ValueCountFrequency (%)
S 28
 
10.7%
A 23
 
8.8%
B 22
 
8.4%
E 21
 
8.0%
C 18
 
6.9%
P 17
 
6.5%
D 16
 
6.1%
M 14
 
5.3%
T 13
 
5.0%
O 11
 
4.2%
Other values (13) 79
30.2%
Other Punctuation
ValueCountFrequency (%)
. 130
37.7%
, 118
34.2%
: 35
 
10.1%
" 19
 
5.5%
* 9
 
2.6%
' 9
 
2.6%
· 6
 
1.7%
& 4
 
1.2%
/ 3
 
0.9%
% 3
 
0.9%
Other values (6) 9
 
2.6%
Decimal Number
ValueCountFrequency (%)
0 114
25.4%
1 94
21.0%
2 90
20.1%
6 33
 
7.4%
7 29
 
6.5%
5 27
 
6.0%
4 21
 
4.7%
3 20
 
4.5%
9 14
 
3.1%
8 6
 
1.3%
Math Symbol
ValueCountFrequency (%)
> 2
28.6%
< 2
28.6%
~ 1
14.3%
= 1
14.3%
+ 1
14.3%
Space Separator
ValueCountFrequency (%)
12153
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 44
100.0%
Close Punctuation
ValueCountFrequency (%)
) 43
100.0%
Open Punctuation
ValueCountFrequency (%)
( 40
100.0%
Control
ValueCountFrequency (%)
28
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 66024
82.3%
Common 13110
 
16.3%
Latin 1118
 
1.4%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9990
15.1%
9866
14.9%
9842
14.9%
9841
14.9%
9830
14.9%
9827
14.9%
135
 
0.2%
123
 
0.2%
122
 
0.2%
120
 
0.2%
Other values (535) 6328
9.6%
Latin
ValueCountFrequency (%)
e 90
 
8.1%
n 77
 
6.9%
o 74
 
6.6%
a 69
 
6.2%
r 66
 
5.9%
i 61
 
5.5%
t 58
 
5.2%
l 49
 
4.4%
s 44
 
3.9%
c 32
 
2.9%
Other values (38) 498
44.5%
Common
ValueCountFrequency (%)
12153
92.7%
. 130
 
1.0%
, 118
 
0.9%
0 114
 
0.9%
1 94
 
0.7%
2 90
 
0.7%
- 44
 
0.3%
) 43
 
0.3%
( 40
 
0.3%
: 35
 
0.3%
Other values (27) 249
 
1.9%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 66022
82.3%
ASCII 14221
 
17.7%
None 6
 
< 0.1%
Compat Jamo 2
 
< 0.1%
CJK 1
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12153
85.5%
. 130
 
0.9%
, 118
 
0.8%
0 114
 
0.8%
1 94
 
0.7%
e 90
 
0.6%
2 90
 
0.6%
n 77
 
0.5%
o 74
 
0.5%
a 69
 
0.5%
Other values (73) 1212
 
8.5%
Hangul
ValueCountFrequency (%)
9990
15.1%
9866
14.9%
9842
14.9%
9841
14.9%
9830
14.9%
9827
14.9%
135
 
0.2%
123
 
0.2%
122
 
0.2%
120
 
0.2%
Other values (533) 6326
9.6%
None
ValueCountFrequency (%)
· 6
100.0%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK
ValueCountFrequency (%)
1
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%
Distinct1251
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T08:32:22.211486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length80
Median length7
Mean length7.1555
Min length1

Characters and Unicode

Total characters71555
Distinct characters634
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1154 ?
Unique (%)11.5%

Sample

1st row데이터 미집계
2nd row데이터 미집계
3rd row데이터 미집계
4th row데이터 미집계
5th row데이터 미집계
ValueCountFrequency (%)
데이터 8485
42.7%
미집계 8485
42.7%
화장품 129
 
0.6%
128
 
0.6%
38
 
0.2%
led 30
 
0.2%
마스크팩 24
 
0.1%
부품 23
 
0.1%
제품 23
 
0.1%
21
 
0.1%
Other values (1720) 2477
 
12.5%
2023-12-13T08:32:22.797941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9910
13.8%
8666
12.1%
8556
12.0%
8536
11.9%
8534
11.9%
8494
11.9%
8485
11.9%
394
 
0.6%
, 353
 
0.5%
350
 
0.5%
Other values (624) 9277
13.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 59625
83.3%
Space Separator 9910
 
13.8%
Lowercase Letter 996
 
1.4%
Uppercase Letter 547
 
0.8%
Other Punctuation 385
 
0.5%
Close Punctuation 35
 
< 0.1%
Open Punctuation 35
 
< 0.1%
Decimal Number 12
 
< 0.1%
Dash Punctuation 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8666
14.5%
8556
14.3%
8536
14.3%
8534
14.3%
8494
14.2%
8485
14.2%
394
 
0.7%
350
 
0.6%
253
 
0.4%
234
 
0.4%
Other values (557) 7123
11.9%
Lowercase Letter
ValueCountFrequency (%)
e 119
11.9%
a 93
 
9.3%
i 91
 
9.1%
t 79
 
7.9%
r 72
 
7.2%
o 64
 
6.4%
l 54
 
5.4%
s 54
 
5.4%
c 52
 
5.2%
n 52
 
5.2%
Other values (15) 266
26.7%
Uppercase Letter
ValueCountFrequency (%)
E 68
12.4%
D 59
10.8%
P 56
10.2%
C 56
10.2%
L 52
9.5%
T 33
 
6.0%
S 33
 
6.0%
V 28
 
5.1%
R 19
 
3.5%
H 18
 
3.3%
Other values (15) 125
22.9%
Other Punctuation
ValueCountFrequency (%)
, 353
91.7%
/ 15
 
3.9%
& 8
 
2.1%
. 4
 
1.0%
· 2
 
0.5%
# 1
 
0.3%
? 1
 
0.3%
% 1
 
0.3%
Decimal Number
ValueCountFrequency (%)
3 7
58.3%
1 2
 
16.7%
4 1
 
8.3%
2 1
 
8.3%
7 1
 
8.3%
Space Separator
ValueCountFrequency (%)
9910
100.0%
Close Punctuation
ValueCountFrequency (%)
) 35
100.0%
Open Punctuation
ValueCountFrequency (%)
( 35
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 59625
83.3%
Common 10387
 
14.5%
Latin 1543
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8666
14.5%
8556
14.3%
8536
14.3%
8534
14.3%
8494
14.2%
8485
14.2%
394
 
0.7%
350
 
0.6%
253
 
0.4%
234
 
0.4%
Other values (557) 7123
11.9%
Latin
ValueCountFrequency (%)
e 119
 
7.7%
a 93
 
6.0%
i 91
 
5.9%
t 79
 
5.1%
r 72
 
4.7%
E 68
 
4.4%
o 64
 
4.1%
D 59
 
3.8%
P 56
 
3.6%
C 56
 
3.6%
Other values (40) 786
50.9%
Common
ValueCountFrequency (%)
9910
95.4%
, 353
 
3.4%
) 35
 
0.3%
( 35
 
0.3%
/ 15
 
0.1%
- 10
 
0.1%
& 8
 
0.1%
3 7
 
0.1%
. 4
 
< 0.1%
· 2
 
< 0.1%
Other values (7) 8
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 59625
83.3%
ASCII 11928
 
16.7%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9910
83.1%
, 353
 
3.0%
e 119
 
1.0%
a 93
 
0.8%
i 91
 
0.8%
t 79
 
0.7%
r 72
 
0.6%
E 68
 
0.6%
o 64
 
0.5%
D 59
 
0.5%
Other values (56) 1020
 
8.6%
Hangul
ValueCountFrequency (%)
8666
14.5%
8556
14.3%
8536
14.3%
8534
14.3%
8494
14.2%
8485
14.2%
394
 
0.7%
350
 
0.6%
253
 
0.4%
234
 
0.4%
Other values (557) 7123
11.9%
None
ValueCountFrequency (%)
· 2
100.0%

기업정보개방여부(API)
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
데이터 미집계
9246 
N
 
481
Y
 
273

Length

Max length7
Median length7
Mean length6.5476
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row데이터 미집계
2nd row데이터 미집계
3rd row데이터 미집계
4th row데이터 미집계
5th row데이터 미집계

Common Values

ValueCountFrequency (%)
데이터 미집계 9246
92.5%
N 481
 
4.8%
Y 273
 
2.7%

Length

2023-12-13T08:32:22.973839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:32:23.107482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
데이터 9246
48.0%
미집계 9246
48.0%
n 481
 
2.5%
y 273
 
1.4%

등급코드
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
GOLD
4122 
MEMBER
4075 
PLATINUM
1716 
데이터 미집계
 
86
SILVER
 
1

Length

Max length8
Median length7
Mean length5.5274
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowPLATINUM
2nd rowMEMBER
3rd rowPLATINUM
4th rowPLATINUM
5th rowMEMBER

Common Values

ValueCountFrequency (%)
GOLD 4122
41.2%
MEMBER 4075
40.8%
PLATINUM 1716
17.2%
데이터 미집계 86
 
0.9%
SILVER 1
 
< 0.1%

Length

2023-12-13T08:32:23.240357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:32:23.356334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
gold 4122
40.9%
member 4075
40.4%
platinum 1716
17.0%
데이터 86
 
0.9%
미집계 86
 
0.9%
silver 1
 
< 0.1%
Distinct309
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T08:32:23.782796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length2.9768
Min length1

Characters and Unicode

Total characters29768
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique74 ?
Unique (%)0.7%

Sample

1st row840
2nd row150
3rd row850
4th row850
5th row150
ValueCountFrequency (%)
850 1001
 
9.9%
250 623
 
6.2%
50 585
 
5.8%
650 578
 
5.7%
200 536
 
5.3%
300 497
 
4.9%
150 478
 
4.7%
100 460
 
4.6%
450 448
 
4.4%
400 443
 
4.4%
Other values (300) 4437
44.0%
2023-12-13T08:32:24.392976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 12134
40.8%
5 6766
22.7%
1 1847
 
6.2%
2 1780
 
6.0%
8 1569
 
5.3%
6 1497
 
5.0%
3 1356
 
4.6%
4 1279
 
4.3%
7 629
 
2.1%
9 309
 
1.0%
Other values (7) 602
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 29166
98.0%
Other Letter 516
 
1.7%
Space Separator 86
 
0.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 12134
41.6%
5 6766
23.2%
1 1847
 
6.3%
2 1780
 
6.1%
8 1569
 
5.4%
6 1497
 
5.1%
3 1356
 
4.6%
4 1279
 
4.4%
7 629
 
2.2%
9 309
 
1.1%
Other Letter
ValueCountFrequency (%)
86
16.7%
86
16.7%
86
16.7%
86
16.7%
86
16.7%
86
16.7%
Space Separator
ValueCountFrequency (%)
86
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 29252
98.3%
Hangul 516
 
1.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 12134
41.5%
5 6766
23.1%
1 1847
 
6.3%
2 1780
 
6.1%
8 1569
 
5.4%
6 1497
 
5.1%
3 1356
 
4.6%
4 1279
 
4.4%
7 629
 
2.2%
9 309
 
1.1%
Hangul
ValueCountFrequency (%)
86
16.7%
86
16.7%
86
16.7%
86
16.7%
86
16.7%
86
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 29252
98.3%
Hangul 516
 
1.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 12134
41.5%
5 6766
23.1%
1 1847
 
6.3%
2 1780
 
6.1%
8 1569
 
5.4%
6 1497
 
5.1%
3 1356
 
4.6%
4 1279
 
4.4%
7 629
 
2.2%
9 309
 
1.1%
Hangul
ValueCountFrequency (%)
86
16.7%
86
16.7%
86
16.7%
86
16.7%
86
16.7%
86
16.7%
Distinct639
Distinct (%)6.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T08:32:24.790482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length232
Median length7
Mean length8.8433
Min length1

Characters and Unicode

Total characters88433
Distinct characters297
Distinct categories16 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique561 ?
Unique (%)5.6%

Sample

1st row데이터 미집계
2nd row데이터 미집계
3rd row데이터 미집계
4th row데이터 미집계
5th row데이터 미집계
ValueCountFrequency (%)
데이터 8693
41.1%
미집계 8693
41.1%
established 486
 
2.3%
company 358
 
1.7%
of 153
 
0.7%
establishment 139
 
0.7%
설립 132
 
0.6%
co 96
 
0.5%
estabilshed 76
 
0.4%
a 73
 
0.3%
Other values (1209) 2236
 
10.6%
2023-12-13T08:32:25.345122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11201
12.7%
8705
9.8%
8697
9.8%
8696
9.8%
8694
9.8%
8694
9.8%
8693
9.8%
e 2013
 
2.3%
s 2008
 
2.3%
a 1985
 
2.2%
Other values (287) 19047
21.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 53169
60.1%
Lowercase Letter 20134
 
22.8%
Space Separator 11201
 
12.7%
Uppercase Letter 2945
 
3.3%
Other Punctuation 402
 
0.5%
Decimal Number 357
 
0.4%
Open Punctuation 54
 
0.1%
Close Punctuation 51
 
0.1%
Control 48
 
0.1%
Dash Punctuation 47
 
0.1%
Other values (6) 25
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8705
16.4%
8697
16.4%
8696
16.4%
8694
16.4%
8694
16.4%
8693
16.3%
167
 
0.3%
162
 
0.3%
51
 
0.1%
45
 
0.1%
Other values (202) 565
 
1.1%
Lowercase Letter
ValueCountFrequency (%)
e 2013
 
10.0%
s 2008
 
10.0%
a 1985
 
9.9%
t 1626
 
8.1%
o 1502
 
7.5%
i 1479
 
7.3%
n 1416
 
7.0%
d 1046
 
5.2%
l 1017
 
5.1%
h 974
 
4.8%
Other values (16) 5068
25.2%
Uppercase Letter
ValueCountFrequency (%)
E 654
22.2%
C 415
14.1%
S 174
 
5.9%
I 140
 
4.8%
T 132
 
4.5%
A 128
 
4.3%
O 126
 
4.3%
L 116
 
3.9%
F 114
 
3.9%
N 111
 
3.8%
Other values (16) 835
28.4%
Decimal Number
ValueCountFrequency (%)
1 102
28.6%
2 81
22.7%
0 59
16.5%
5 21
 
5.9%
4 20
 
5.6%
3 19
 
5.3%
8 16
 
4.5%
9 14
 
3.9%
6 14
 
3.9%
7 11
 
3.1%
Other Punctuation
ValueCountFrequency (%)
. 216
53.7%
; 63
 
15.7%
& 63
 
15.7%
, 36
 
9.0%
/ 12
 
3.0%
: 6
 
1.5%
· 5
 
1.2%
* 1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 51
94.4%
[ 3
 
5.6%
Close Punctuation
ValueCountFrequency (%)
) 48
94.1%
] 3
 
5.9%
Final Punctuation
ValueCountFrequency (%)
7
63.6%
4
36.4%
Initial Punctuation
ValueCountFrequency (%)
5
55.6%
4
44.4%
Space Separator
ValueCountFrequency (%)
11201
100.0%
Control
ValueCountFrequency (%)
48
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 47
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%
Currency Symbol
ValueCountFrequency (%)
1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 53171
60.1%
Latin 23079
26.1%
Common 12183
 
13.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8705
16.4%
8697
16.4%
8696
16.4%
8694
16.4%
8694
16.4%
8693
16.3%
167
 
0.3%
162
 
0.3%
51
 
0.1%
45
 
0.1%
Other values (203) 567
 
1.1%
Latin
ValueCountFrequency (%)
e 2013
 
8.7%
s 2008
 
8.7%
a 1985
 
8.6%
t 1626
 
7.0%
o 1502
 
6.5%
i 1479
 
6.4%
n 1416
 
6.1%
d 1046
 
4.5%
l 1017
 
4.4%
h 974
 
4.2%
Other values (42) 8013
34.7%
Common
ValueCountFrequency (%)
11201
91.9%
. 216
 
1.8%
1 102
 
0.8%
2 81
 
0.7%
; 63
 
0.5%
& 63
 
0.5%
0 59
 
0.5%
( 51
 
0.4%
48
 
0.4%
) 48
 
0.4%
Other values (22) 251
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 53169
60.1%
ASCII 35236
39.8%
Punctuation 20
 
< 0.1%
None 8
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11201
31.8%
e 2013
 
5.7%
s 2008
 
5.7%
a 1985
 
5.6%
t 1626
 
4.6%
o 1502
 
4.3%
i 1479
 
4.2%
n 1416
 
4.0%
d 1046
 
3.0%
l 1017
 
2.9%
Other values (68) 9943
28.2%
Hangul
ValueCountFrequency (%)
8705
16.4%
8697
16.4%
8696
16.4%
8694
16.4%
8694
16.4%
8693
16.3%
167
 
0.3%
162
 
0.3%
51
 
0.1%
45
 
0.1%
Other values (202) 565
 
1.1%
Punctuation
ValueCountFrequency (%)
7
35.0%
5
25.0%
4
20.0%
4
20.0%
None
ValueCountFrequency (%)
· 5
62.5%
2
 
25.0%
1
 
12.5%
Distinct218
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T08:32:25.685725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length7
Mean length7.3921
Min length7

Characters and Unicode

Total characters73921
Distinct characters18
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique86 ?
Unique (%)0.9%

Sample

1st row데이터 미집계
2nd row데이터 미집계
3rd row데이터 미집계
4th row데이터 미집계
5th row데이터 미집계
ValueCountFrequency (%)
데이터 8693
46.5%
미집계 8693
46.5%
2018-12-17 500
 
2.7%
2017-12-08 37
 
0.2%
2017-11-28 22
 
0.1%
2017-11-21 21
 
0.1%
2017-12-11 20
 
0.1%
2017-11-22 19
 
0.1%
2017-11-20 19
 
0.1%
2018-01-23 17
 
0.1%
Other values (209) 652
 
3.5%
2023-12-13T08:32:26.099434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8693
11.8%
8693
11.8%
8693
11.8%
8693
11.8%
8693
11.8%
8693
11.8%
8693
11.8%
1 3451
 
4.7%
- 2614
 
3.5%
2 2486
 
3.4%
Other values (8) 4519
6.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 52158
70.6%
Decimal Number 10456
 
14.1%
Space Separator 8693
 
11.8%
Dash Punctuation 2614
 
3.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 3451
33.0%
2 2486
23.8%
0 2014
19.3%
8 1113
 
10.6%
7 953
 
9.1%
9 124
 
1.2%
3 111
 
1.1%
4 84
 
0.8%
6 73
 
0.7%
5 47
 
0.4%
Other Letter
ValueCountFrequency (%)
8693
16.7%
8693
16.7%
8693
16.7%
8693
16.7%
8693
16.7%
8693
16.7%
Space Separator
ValueCountFrequency (%)
8693
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2614
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 52158
70.6%
Common 21763
29.4%

Most frequent character per script

Common
ValueCountFrequency (%)
8693
39.9%
1 3451
 
15.9%
- 2614
 
12.0%
2 2486
 
11.4%
0 2014
 
9.3%
8 1113
 
5.1%
7 953
 
4.4%
9 124
 
0.6%
3 111
 
0.5%
4 84
 
0.4%
Other values (2) 120
 
0.6%
Hangul
ValueCountFrequency (%)
8693
16.7%
8693
16.7%
8693
16.7%
8693
16.7%
8693
16.7%
8693
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 52158
70.6%
ASCII 21763
29.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8693
16.7%
8693
16.7%
8693
16.7%
8693
16.7%
8693
16.7%
8693
16.7%
ASCII
ValueCountFrequency (%)
8693
39.9%
1 3451
 
15.9%
- 2614
 
12.0%
2 2486
 
11.4%
0 2014
 
9.3%
8 1113
 
5.1%
7 953
 
4.4%
9 124
 
0.6%
3 111
 
0.5%
4 84
 
0.4%
Other values (2) 120
 
0.6%

Sample

업체 일련번호업체국제우편번호업체 국가(영문)업체주소_도시명(영문)업체명(영문)업체주소_시군구(영문)업체_기본주소(국문)업체_사업구분업체연매출액업체_지역구분(국문)수출액공장주소영문홈페이지(URL)자가공장여부국제인증특이사항주생산품기업정보개방여부(API)등급코드마일리지연혁내용등록일
23243CP2018031456990데이터 미집계KR경기도Jun**데이터 미집계경기도 광명시법인데이터 미집계경기도0데이터 미집계www.juncs.co.kr데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계PLATINUM840데이터 미집계데이터 미집계
36345CP2018031473203데이터 미집계KR경기도con******데이터 미집계개인데이터 미집계경기도100000000데이터 미집계데이터 미집계Y데이터 미집계데이터 미집계데이터 미집계데이터 미집계MEMBER150데이터 미집계데이터 미집계
33704CP2018031468557데이터 미집계KR경기도EHW**************데이터 미집계경기 시흥시개인데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계PLATINUM850데이터 미집계데이터 미집계
28207CP2018031461652데이터 미집계KR강원도HAN************데이터 미집계강원도 춘천시개인데이터 미집계강원도0데이터 미집계boatman.co.kr데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계PLATINUM850데이터 미집계데이터 미집계
18246CP2018031485564데이터 미집계KR경기도Kor******************데이터 미집계법인데이터 미집계경기도136데이터 미집계데이터 미집계N데이터 미집계남미, 인도네시아, 미국 등으로 시장확대 예정데이터 미집계데이터 미집계MEMBER150데이터 미집계데이터 미집계
1442CP20171030083616690KRSuwon-siCre****************Gyeonggi-do경기 수원시법인데이터 미집계경기도13382F Hanrim Postec, 59, Omokcheon-ro 152beon-gil, Gwonseon-gu, Suwon-si, Gyonggi-do, Koreawww.cremotech.co.krYKC,FCC,CE,HDMI,WiFi,CCC,Eye-safety, 블루투스,미국 KDC와 1,000만불 수출계약체결(2016.09)빔프로젝터NMEMBER145Cremotech Established2017-12-11
18657CP2018031479592데이터 미집계KR경상남도Kiy***********데이터 미집계경남 진주시법인데이터 미집계경상남도데이터 미집계데이터 미집계www.kiyum.co.kr/데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계GOLD350데이터 미집계데이터 미집계
23076CP2018031456937데이터 미집계KR경기도SEM*******************데이터 미집계부산 서구법인데이터 미집계경기도0데이터 미집계www.searte.co.kr데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계GOLD650데이터 미집계데이터 미집계
28078CP2018031461984데이터 미집계KR경기도Bri***************데이터 미집계경기 성남시법인데이터 미집계경기도0데이터 미집계www.bridgeitc.com데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계GOLD600데이터 미집계데이터 미집계
5906CP201710303781데이터 미집계KR서울특별시데이터 미집계데이터 미집계서울특별시 성북구법인데이터 미집계서울시50데이터 미집계www.jinwoobio.comN데이터 미집계데이터 미집계데이터 미집계데이터 미집계MEMBER220데이터 미집계데이터 미집계
업체 일련번호업체국제우편번호업체 국가(영문)업체주소_도시명(영문)업체명(영문)업체주소_시군구(영문)업체_기본주소(국문)업체_사업구분업체연매출액업체_지역구분(국문)수출액공장주소영문홈페이지(URL)자가공장여부국제인증특이사항주생산품기업정보개방여부(API)등급코드마일리지연혁내용등록일
24934CP2018031459394데이터 미집계KR경기도seg***************데이터 미집계경기도 안산시법인데이터 미집계경기도데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계GOLD500데이터 미집계데이터 미집계
21342CP2018031454785데이터 미집계KR경기도WON*****************************데이터 미집계경기 의정부시법인데이터 미집계경기도0데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계PLATINUM850데이터 미집계데이터 미집계
34081CP2018031470476데이터 미집계KR인천광역시GAP****************데이터 미집계인천 남구개인데이터 미집계인천시데이터 미집계데이터 미집계www.kbit.co.kr데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계GOLD700데이터 미집계데이터 미집계
23302CP2018031456994데이터 미집계KR경기도OTI*************************데이터 미집계경기도 성남시법인데이터 미집계경기도0데이터 미집계oticom.koreasme.com데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계GOLD700데이터 미집계데이터 미집계
26443CP2018031460033데이터 미집계KR경기도Asi***************데이터 미집계법인데이터 미집계경기도0데이터 미집계데이터 미집계Y데이터 미집계데이터 미집계데이터 미집계데이터 미집계MEMBER100데이터 미집계데이터 미집계
15502CP2018031481785데이터 미집계KR경기도Eco***********데이터 미집계경기도 광명시개인데이터 미집계경기도데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계GOLD350데이터 미집계데이터 미집계
16532CP201803148372156181KRJeongeup-siMIJ*****************Jeollabuk-do전북 정읍시개인데이터 미집계전라북도30데이터 미집계www.mjint.krNiso데이터 미집계데이터 미집계데이터 미집계MEMBER340데이터 미집계데이터 미집계
38864CP2018031474295데이터 미집계KR서울특별시Hub*********************데이터 미집계Unit 1519법인데이터 미집계서울시5데이터 미집계www.eagleeye.kr데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계MEMBER300데이터 미집계데이터 미집계
12924CP20180314858113328KREunpyeong-guNTT**************Seoul서울 은평구법인데이터 미집계서울시1000B1, NTTWORKS Bldg, 5-10, Tongil-ro 89-gil, Eunpyeong-gu, Seoul 122-810, Rep. of KOREAwww.syscallglobal.com데이터 미집계데이터 미집계데이터 미집계데이터 미집계데이터 미집계GOLD415데이터 미집계데이터 미집계
16853CP2018031483148데이터 미집계KR경기도mem*데이터 미집계법인데이터 미집계경기도0데이터 미집계데이터 미집계N데이터 미집계데이터 미집계데이터 미집계데이터 미집계MEMBER150데이터 미집계데이터 미집계