Overview

Dataset statistics

Number of variables8
Number of observations659
Missing cells600
Missing cells (%)11.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory41.3 KiB
Average record size in memory64.2 B

Variable types

Text6
Categorical2

Dataset

Description충청남도 금산군의 제조업체의 관한 사항으로 회사명, 대표명, 회사 주소, 회사 번호, 팩스 번호 등 의 자료를 포함하고 있습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=395&beforeMenuCd=DOM_000000201001001000&publicdatapk=15034906

Alerts

데이터기준일자 has constant value ""Constant
전화번호 has 182 (27.6%) missing valuesMissing
팩스번호 has 418 (63.4%) missing valuesMissing

Reproduction

Analysis started2024-01-09 22:58:14.126852
Analysis finished2024-01-09 22:58:15.115459
Duration0.99 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct648
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size5.3 KiB
2024-01-10T07:58:15.317556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length7.3793627
Min length2

Characters and Unicode

Total characters4863
Distinct characters396
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique637 ?
Unique (%)96.7%

Sample

1st row(주)BDC
2nd row(주)BDC 제2공장
3rd row(주)EG
4th row(주)가나텍
5th row(주)갈산
ValueCountFrequency (%)
농업회사법인 19
 
2.6%
제2공장 13
 
1.8%
영농조합법인 5
 
0.7%
금산공장 4
 
0.6%
제1공장 3
 
0.4%
중부대학교 3
 
0.4%
주)휴온스네이처 3
 
0.4%
2공장 3
 
0.4%
지점 2
 
0.3%
주)믿음의나무 2
 
0.3%
Other values (645) 670
92.2%
2024-01-10T07:58:15.711286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
396
 
8.1%
( 389
 
8.0%
) 389
 
8.0%
150
 
3.1%
138
 
2.8%
98
 
2.0%
92
 
1.9%
82
 
1.7%
68
 
1.4%
68
 
1.4%
Other values (386) 2993
61.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3908
80.4%
Open Punctuation 389
 
8.0%
Close Punctuation 389
 
8.0%
Space Separator 68
 
1.4%
Uppercase Letter 40
 
0.8%
Decimal Number 36
 
0.7%
Other Symbol 22
 
0.5%
Other Punctuation 4
 
0.1%
Lowercase Letter 4
 
0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
396
 
10.1%
150
 
3.8%
138
 
3.5%
98
 
2.5%
92
 
2.4%
82
 
2.1%
68
 
1.7%
68
 
1.7%
67
 
1.7%
67
 
1.7%
Other values (354) 2682
68.6%
Uppercase Letter
ValueCountFrequency (%)
E 7
17.5%
D 5
12.5%
C 4
10.0%
I 4
10.0%
N 4
10.0%
P 3
7.5%
G 3
7.5%
B 3
7.5%
H 2
 
5.0%
A 1
 
2.5%
Other values (4) 4
10.0%
Decimal Number
ValueCountFrequency (%)
2 22
61.1%
1 8
 
22.2%
7 2
 
5.6%
8 2
 
5.6%
6 1
 
2.8%
3 1
 
2.8%
Lowercase Letter
ValueCountFrequency (%)
c 1
25.0%
t 1
25.0%
h 1
25.0%
e 1
25.0%
Other Punctuation
ValueCountFrequency (%)
& 2
50.0%
: 2
50.0%
Open Punctuation
ValueCountFrequency (%)
( 389
100.0%
Close Punctuation
ValueCountFrequency (%)
) 389
100.0%
Space Separator
ValueCountFrequency (%)
68
100.0%
Other Symbol
ValueCountFrequency (%)
22
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3930
80.8%
Common 889
 
18.3%
Latin 44
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
396
 
10.1%
150
 
3.8%
138
 
3.5%
98
 
2.5%
92
 
2.3%
82
 
2.1%
68
 
1.7%
68
 
1.7%
67
 
1.7%
67
 
1.7%
Other values (355) 2704
68.8%
Latin
ValueCountFrequency (%)
E 7
15.9%
D 5
11.4%
C 4
9.1%
I 4
9.1%
N 4
9.1%
P 3
 
6.8%
G 3
 
6.8%
B 3
 
6.8%
H 2
 
4.5%
A 1
 
2.3%
Other values (8) 8
18.2%
Common
ValueCountFrequency (%)
( 389
43.8%
) 389
43.8%
68
 
7.6%
2 22
 
2.5%
1 8
 
0.9%
- 2
 
0.2%
7 2
 
0.2%
8 2
 
0.2%
& 2
 
0.2%
: 2
 
0.2%
Other values (3) 3
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3908
80.4%
ASCII 933
 
19.2%
None 22
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
396
 
10.1%
150
 
3.8%
138
 
3.5%
98
 
2.5%
92
 
2.4%
82
 
2.1%
68
 
1.7%
68
 
1.7%
67
 
1.7%
67
 
1.7%
Other values (354) 2682
68.6%
ASCII
ValueCountFrequency (%)
( 389
41.7%
) 389
41.7%
68
 
7.3%
2 22
 
2.4%
1 8
 
0.9%
E 7
 
0.8%
D 5
 
0.5%
C 4
 
0.4%
I 4
 
0.4%
N 4
 
0.4%
Other values (21) 33
 
3.5%
None
ValueCountFrequency (%)
22
100.0%

주소
Text

Distinct618
Distinct (%)93.8%
Missing0
Missing (%)0.0%
Memory size5.3 KiB
2024-01-10T07:58:16.011318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length46
Mean length26.370258
Min length8

Characters and Unicode

Total characters17378
Distinct characters289
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique583 ?
Unique (%)88.5%

Sample

1st row충청남도 금산군 추부면 신평공단1로 62
2nd row충청남도 금산군 추부면 신평공단1로 85
3rd row충청남도 금산군 추부면 서대산로 459 (㈜EG)
4th row충청남도 금산군 복수면 다복로 537-18
5th row충청남도 금산군 복수면 복수공단길 37, (용진리 115-18) (풍국타올)
ValueCountFrequency (%)
금산군 653
 
16.7%
충청남도 652
 
16.7%
추부면 264
 
6.7%
복수면 155
 
4.0%
115
 
2.9%
금성면 84
 
2.1%
1필지 64
 
1.6%
군북면 58
 
1.5%
다복로 49
 
1.3%
추풍로 42
 
1.1%
Other values (873) 1777
45.4%
2024-01-10T07:58:16.740686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3255
18.7%
842
 
4.8%
796
 
4.6%
739
 
4.3%
677
 
3.9%
657
 
3.8%
655
 
3.8%
653
 
3.8%
643
 
3.7%
1 562
 
3.2%
Other values (279) 7899
45.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10475
60.3%
Space Separator 3255
 
18.7%
Decimal Number 2627
 
15.1%
Open Punctuation 283
 
1.6%
Close Punctuation 283
 
1.6%
Dash Punctuation 271
 
1.6%
Other Punctuation 129
 
0.7%
Uppercase Letter 35
 
0.2%
Other Symbol 20
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
842
 
8.0%
796
 
7.6%
739
 
7.1%
677
 
6.5%
657
 
6.3%
655
 
6.3%
653
 
6.2%
643
 
6.1%
351
 
3.4%
325
 
3.1%
Other values (248) 4137
39.5%
Uppercase Letter
ValueCountFrequency (%)
C 5
14.3%
E 5
14.3%
D 3
8.6%
R 3
8.6%
M 3
8.6%
H 3
8.6%
J 2
 
5.7%
S 2
 
5.7%
A 2
 
5.7%
T 2
 
5.7%
Other values (4) 5
14.3%
Decimal Number
ValueCountFrequency (%)
1 562
21.4%
2 356
13.6%
3 282
10.7%
4 261
9.9%
5 240
9.1%
6 216
 
8.2%
0 195
 
7.4%
7 187
 
7.1%
8 175
 
6.7%
9 153
 
5.8%
Other Punctuation
ValueCountFrequency (%)
, 128
99.2%
& 1
 
0.8%
Space Separator
ValueCountFrequency (%)
3255
100.0%
Open Punctuation
ValueCountFrequency (%)
( 283
100.0%
Close Punctuation
ValueCountFrequency (%)
) 283
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 271
100.0%
Other Symbol
ValueCountFrequency (%)
20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10495
60.4%
Common 6848
39.4%
Latin 35
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
842
 
8.0%
796
 
7.6%
739
 
7.0%
677
 
6.5%
657
 
6.3%
655
 
6.2%
653
 
6.2%
643
 
6.1%
351
 
3.3%
325
 
3.1%
Other values (249) 4157
39.6%
Common
ValueCountFrequency (%)
3255
47.5%
1 562
 
8.2%
2 356
 
5.2%
( 283
 
4.1%
) 283
 
4.1%
3 282
 
4.1%
- 271
 
4.0%
4 261
 
3.8%
5 240
 
3.5%
6 216
 
3.2%
Other values (6) 839
 
12.3%
Latin
ValueCountFrequency (%)
C 5
14.3%
E 5
14.3%
D 3
8.6%
R 3
8.6%
M 3
8.6%
H 3
8.6%
J 2
 
5.7%
S 2
 
5.7%
A 2
 
5.7%
T 2
 
5.7%
Other values (4) 5
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10475
60.3%
ASCII 6883
39.6%
None 20
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3255
47.3%
1 562
 
8.2%
2 356
 
5.2%
( 283
 
4.1%
) 283
 
4.1%
3 282
 
4.1%
- 271
 
3.9%
4 261
 
3.8%
5 240
 
3.5%
6 216
 
3.1%
Other values (20) 874
 
12.7%
Hangul
ValueCountFrequency (%)
842
 
8.0%
796
 
7.6%
739
 
7.1%
677
 
6.5%
657
 
6.3%
655
 
6.3%
653
 
6.2%
643
 
6.1%
351
 
3.4%
325
 
3.1%
Other values (248) 4137
39.5%
None
ValueCountFrequency (%)
20
100.0%

전화번호
Text

MISSING 

Distinct448
Distinct (%)93.9%
Missing182
Missing (%)27.6%
Memory size5.3 KiB
2024-01-10T07:58:16.996471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.077568
Min length9

Characters and Unicode

Total characters5761
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique421 ?
Unique (%)88.3%

Sample

1st row041-751-6262
2nd row041-751-6262
3rd row041-0750-7777
4th row041-752-1197
5th row070-7602-7895
ValueCountFrequency (%)
041-753-7141 3
 
0.6%
041-753-6981 3
 
0.6%
041-751-6111 2
 
0.4%
041-752-3243 2
 
0.4%
041-753-4291 2
 
0.4%
041-752-9992 2
 
0.4%
041-752-5583 2
 
0.4%
041-752-9945 2
 
0.4%
041-754-1551 2
 
0.4%
041-752-5304 2
 
0.4%
Other values (438) 455
95.4%
2024-01-10T07:58:17.423543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 950
16.5%
0 825
14.3%
1 775
13.5%
4 731
12.7%
5 626
10.9%
7 612
10.6%
3 349
 
6.1%
2 334
 
5.8%
8 212
 
3.7%
6 188
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4811
83.5%
Dash Punctuation 950
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 825
17.1%
1 775
16.1%
4 731
15.2%
5 626
13.0%
7 612
12.7%
3 349
7.3%
2 334
6.9%
8 212
 
4.4%
6 188
 
3.9%
9 159
 
3.3%
Dash Punctuation
ValueCountFrequency (%)
- 950
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5761
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 950
16.5%
0 825
14.3%
1 775
13.5%
4 731
12.7%
5 626
10.9%
7 612
10.6%
3 349
 
6.1%
2 334
 
5.8%
8 212
 
3.7%
6 188
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5761
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 950
16.5%
0 825
14.3%
1 775
13.5%
4 731
12.7%
5 626
10.9%
7 612
10.6%
3 349
 
6.1%
2 334
 
5.8%
8 212
 
3.7%
6 188
 
3.3%

팩스번호
Text

MISSING 

Distinct232
Distinct (%)96.3%
Missing418
Missing (%)63.4%
Memory size5.3 KiB
2024-01-10T07:58:17.652773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.161826
Min length12

Characters and Unicode

Total characters2931
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique224 ?
Unique (%)92.9%

Sample

1st row041-751-6265
2nd row041-750-7749
3rd row041-752-1173
4th row041-753-5421
5th row041-752-9988
ValueCountFrequency (%)
041-753-7145 3
 
1.2%
041-751-2508 2
 
0.8%
041-751-8636 2
 
0.8%
041-753-0047 2
 
0.8%
041-753-5304 2
 
0.8%
041-753-8907 2
 
0.8%
041-753-6983 2
 
0.8%
041-752-6863 2
 
0.8%
041-752-8174 1
 
0.4%
041-0753-9607 1
 
0.4%
Other values (222) 222
92.1%
2024-01-10T07:58:17.992257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 482
16.4%
0 388
13.2%
4 368
12.6%
1 361
12.3%
5 323
11.0%
7 305
10.4%
3 185
 
6.3%
2 176
 
6.0%
8 130
 
4.4%
6 108
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2449
83.6%
Dash Punctuation 482
 
16.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 388
15.8%
4 368
15.0%
1 361
14.7%
5 323
13.2%
7 305
12.5%
3 185
7.6%
2 176
7.2%
8 130
 
5.3%
6 108
 
4.4%
9 105
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 482
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2931
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 482
16.4%
0 388
13.2%
4 368
12.6%
1 361
12.3%
5 323
11.0%
7 305
10.4%
3 185
 
6.3%
2 176
 
6.0%
8 130
 
4.4%
6 108
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2931
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 482
16.4%
0 388
13.2%
4 368
12.6%
1 361
12.3%
5 323
11.0%
7 305
10.4%
3 185
 
6.3%
2 176
 
6.0%
8 130
 
4.4%
6 108
 
3.7%

업종
Categorical

Distinct10
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size5.3 KiB
음식료
180 
기계
151 
석유화학
116 
전기+전자
56 
목재+종이+출판
44 
Other values (5)
112 

Length

Max length8
Median length5
Mean length3.522003
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기계
2nd row섬유의복
3rd row섬유의복
4th row기계
5th row철강

Common Values

ValueCountFrequency (%)
음식료 180
27.3%
기계 151
22.9%
석유화학 116
17.6%
전기+전자 56
 
8.5%
목재+종이+출판 44
 
6.7%
기타 35
 
5.3%
비금속소재 29
 
4.4%
섬유의복 24
 
3.6%
철강 12
 
1.8%
운송장비 12
 
1.8%

Length

2024-01-10T07:58:18.126145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:58:18.235816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
음식료 180
27.3%
기계 151
22.9%
석유화학 116
17.6%
전기+전자 56
 
8.5%
목재+종이+출판 44
 
6.7%
기타 35
 
5.3%
비금속소재 29
 
4.4%
섬유의복 24
 
3.6%
철강 12
 
1.8%
운송장비 12
 
1.8%
Distinct580
Distinct (%)88.0%
Missing0
Missing (%)0.0%
Memory size5.3 KiB
2024-01-10T07:58:18.506872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length35
Mean length8.6358118
Min length1

Characters and Unicode

Total characters5691
Distinct characters490
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique534 ?
Unique (%)81.0%

Sample

1st row강관, 방송장비, 조명장치 등
2nd row무선통신용강관주
3rd row산화철
4th row실험실 기자재
5th row우레탄 수지매트
ValueCountFrequency (%)
홍삼액 32
 
2.7%
26
 
2.2%
홍삼 14
 
1.2%
14
 
1.2%
플라스틱 12
 
1.0%
홍삼정과 12
 
1.0%
철구조물 10
 
0.8%
태양광 7
 
0.6%
알루미늄 7
 
0.6%
건축용 7
 
0.6%
Other values (844) 1056
88.2%
2024-01-10T07:58:18.912601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
539
 
9.5%
, 340
 
6.0%
158
 
2.8%
130
 
2.3%
116
 
2.0%
104
 
1.8%
98
 
1.7%
85
 
1.5%
84
 
1.5%
72
 
1.3%
Other values (480) 3965
69.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4605
80.9%
Space Separator 539
 
9.5%
Other Punctuation 357
 
6.3%
Uppercase Letter 99
 
1.7%
Open Punctuation 36
 
0.6%
Close Punctuation 36
 
0.6%
Lowercase Letter 8
 
0.1%
Decimal Number 6
 
0.1%
Dash Punctuation 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
158
 
3.4%
130
 
2.8%
116
 
2.5%
104
 
2.3%
98
 
2.1%
85
 
1.8%
84
 
1.8%
72
 
1.6%
72
 
1.6%
71
 
1.5%
Other values (446) 3615
78.5%
Uppercase Letter
ValueCountFrequency (%)
C 18
18.2%
P 15
15.2%
E 14
14.1%
V 9
9.1%
L 8
8.1%
T 8
8.1%
H 6
 
6.1%
D 5
 
5.1%
M 3
 
3.0%
N 2
 
2.0%
Other values (8) 11
11.1%
Lowercase Letter
ValueCountFrequency (%)
d 2
25.0%
e 1
12.5%
r 1
12.5%
a 1
12.5%
c 1
12.5%
o 1
12.5%
l 1
12.5%
Decimal Number
ValueCountFrequency (%)
2 3
50.0%
4 2
33.3%
1 1
 
16.7%
Other Punctuation
ValueCountFrequency (%)
, 340
95.2%
. 17
 
4.8%
Space Separator
ValueCountFrequency (%)
539
100.0%
Open Punctuation
ValueCountFrequency (%)
( 36
100.0%
Close Punctuation
ValueCountFrequency (%)
) 36
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4605
80.9%
Common 979
 
17.2%
Latin 107
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
158
 
3.4%
130
 
2.8%
116
 
2.5%
104
 
2.3%
98
 
2.1%
85
 
1.8%
84
 
1.8%
72
 
1.6%
72
 
1.6%
71
 
1.5%
Other values (446) 3615
78.5%
Latin
ValueCountFrequency (%)
C 18
16.8%
P 15
14.0%
E 14
13.1%
V 9
8.4%
L 8
7.5%
T 8
7.5%
H 6
 
5.6%
D 5
 
4.7%
M 3
 
2.8%
N 2
 
1.9%
Other values (15) 19
17.8%
Common
ValueCountFrequency (%)
539
55.1%
, 340
34.7%
( 36
 
3.7%
) 36
 
3.7%
. 17
 
1.7%
- 5
 
0.5%
2 3
 
0.3%
4 2
 
0.2%
1 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4605
80.9%
ASCII 1086
 
19.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
539
49.6%
, 340
31.3%
( 36
 
3.3%
) 36
 
3.3%
C 18
 
1.7%
. 17
 
1.6%
P 15
 
1.4%
E 14
 
1.3%
V 9
 
0.8%
L 8
 
0.7%
Other values (24) 54
 
5.0%
Hangul
ValueCountFrequency (%)
158
 
3.4%
130
 
2.8%
116
 
2.5%
104
 
2.3%
98
 
2.1%
85
 
1.8%
84
 
1.8%
72
 
1.6%
72
 
1.6%
71
 
1.5%
Other values (446) 3615
78.5%
Distinct648
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size5.3 KiB
2024-01-10T07:58:19.187185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length7.3793627
Min length2

Characters and Unicode

Total characters4863
Distinct characters396
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique637 ?
Unique (%)96.7%

Sample

1st row(주)BDC
2nd row(주)BDC 제2공장
3rd row(주)EG
4th row(주)가나텍
5th row(주)갈산
ValueCountFrequency (%)
농업회사법인 19
 
2.6%
제2공장 13
 
1.8%
영농조합법인 5
 
0.7%
금산공장 4
 
0.6%
제1공장 3
 
0.4%
중부대학교 3
 
0.4%
주)휴온스네이처 3
 
0.4%
2공장 3
 
0.4%
지점 2
 
0.3%
주)믿음의나무 2
 
0.3%
Other values (645) 670
92.2%
2024-01-10T07:58:19.593563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
396
 
8.1%
( 389
 
8.0%
) 389
 
8.0%
150
 
3.1%
138
 
2.8%
98
 
2.0%
92
 
1.9%
82
 
1.7%
68
 
1.4%
68
 
1.4%
Other values (386) 2993
61.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3908
80.4%
Open Punctuation 389
 
8.0%
Close Punctuation 389
 
8.0%
Space Separator 68
 
1.4%
Uppercase Letter 40
 
0.8%
Decimal Number 36
 
0.7%
Other Symbol 22
 
0.5%
Other Punctuation 4
 
0.1%
Lowercase Letter 4
 
0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
396
 
10.1%
150
 
3.8%
138
 
3.5%
98
 
2.5%
92
 
2.4%
82
 
2.1%
68
 
1.7%
68
 
1.7%
67
 
1.7%
67
 
1.7%
Other values (354) 2682
68.6%
Uppercase Letter
ValueCountFrequency (%)
E 7
17.5%
D 5
12.5%
C 4
10.0%
I 4
10.0%
N 4
10.0%
P 3
7.5%
G 3
7.5%
B 3
7.5%
H 2
 
5.0%
A 1
 
2.5%
Other values (4) 4
10.0%
Decimal Number
ValueCountFrequency (%)
2 22
61.1%
1 8
 
22.2%
7 2
 
5.6%
8 2
 
5.6%
6 1
 
2.8%
3 1
 
2.8%
Lowercase Letter
ValueCountFrequency (%)
c 1
25.0%
t 1
25.0%
h 1
25.0%
e 1
25.0%
Other Punctuation
ValueCountFrequency (%)
& 2
50.0%
: 2
50.0%
Open Punctuation
ValueCountFrequency (%)
( 389
100.0%
Close Punctuation
ValueCountFrequency (%)
) 389
100.0%
Space Separator
ValueCountFrequency (%)
68
100.0%
Other Symbol
ValueCountFrequency (%)
22
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3930
80.8%
Common 889
 
18.3%
Latin 44
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
396
 
10.1%
150
 
3.8%
138
 
3.5%
98
 
2.5%
92
 
2.3%
82
 
2.1%
68
 
1.7%
68
 
1.7%
67
 
1.7%
67
 
1.7%
Other values (355) 2704
68.8%
Latin
ValueCountFrequency (%)
E 7
15.9%
D 5
11.4%
C 4
9.1%
I 4
9.1%
N 4
9.1%
P 3
 
6.8%
G 3
 
6.8%
B 3
 
6.8%
H 2
 
4.5%
A 1
 
2.3%
Other values (8) 8
18.2%
Common
ValueCountFrequency (%)
( 389
43.8%
) 389
43.8%
68
 
7.6%
2 22
 
2.5%
1 8
 
0.9%
- 2
 
0.2%
7 2
 
0.2%
8 2
 
0.2%
& 2
 
0.2%
: 2
 
0.2%
Other values (3) 3
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3908
80.4%
ASCII 933
 
19.2%
None 22
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
396
 
10.1%
150
 
3.8%
138
 
3.5%
98
 
2.5%
92
 
2.4%
82
 
2.1%
68
 
1.7%
68
 
1.7%
67
 
1.7%
67
 
1.7%
Other values (354) 2682
68.6%
ASCII
ValueCountFrequency (%)
( 389
41.7%
) 389
41.7%
68
 
7.3%
2 22
 
2.4%
1 8
 
0.9%
E 7
 
0.8%
D 5
 
0.5%
C 4
 
0.4%
I 4
 
0.4%
N 4
 
0.4%
Other values (21) 33
 
3.5%
None
ValueCountFrequency (%)
22
100.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size5.3 KiB
2022-02-04
659 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-02-04
2nd row2022-02-04
3rd row2022-02-04
4th row2022-02-04
5th row2022-02-04

Common Values

ValueCountFrequency (%)
2022-02-04 659
100.0%

Length

2024-01-10T07:58:19.718153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:58:19.801093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-02-04 659
100.0%

Missing values

2024-01-10T07:58:14.870548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:58:14.989327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T07:58:15.073448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

회사명주소전화번호팩스번호업종생산품대표명데이터기준일자
0(주)BDC충청남도 금산군 추부면 신평공단1로 62041-751-6262041-751-6265기계강관, 방송장비, 조명장치 등(주)BDC2022-02-04
1(주)BDC 제2공장충청남도 금산군 추부면 신평공단1로 85041-751-6262<NA>섬유의복무선통신용강관주(주)BDC 제2공장2022-02-04
2(주)EG충청남도 금산군 추부면 서대산로 459 (㈜EG)041-0750-7777041-750-7749섬유의복산화철(주)EG2022-02-04
3(주)가나텍충청남도 금산군 복수면 다복로 537-18041-752-1197041-752-1173기계실험실 기자재(주)가나텍2022-02-04
4(주)갈산충청남도 금산군 복수면 복수공단길 37, (용진리 115-18) (풍국타올)070-7602-7895<NA>철강우레탄 수지매트(주)갈산2022-02-04
5(주)건양전력충청남도 금산군 추부면 자부리 165-1번지<NA><NA>석유화학배전(분전)반 케비넷, 태양광 발전장치(주)건양전력2022-02-04
6(주)고구려식품충청남도 금산군 추부면 서대산로 180041-753-6311<NA>음식료돈육(주)고구려식품2022-02-04
7(주)광성화학충청남도 금산군 추부면 미삭길 20 외 1필지041-752-6000<NA>음식료중질탄산칼슘(주)광성화학2022-02-04
8(주)광진산업충청남도 금산군 추부면 추풍로 146041-754-1305<NA>기계냅킨, 화장지(주)광진산업2022-02-04
9(주)광진포장충청남도 금산군 복수면 다복동길 4041-752-9858<NA>석유화학골판지박스(주)광진포장2022-02-04
회사명주소전화번호팩스번호업종생산품대표명데이터기준일자
649현창 I&D충청남도 금산군 복수면 복수공단길 64, (용진리 115-31) ((주)우리금풍)041-753-1310<NA>목재+종이+출판인조대리석현창 I&D2022-02-04
650호진산업충청남도 금산군 복수면 다복리 217번지<NA><NA>목재+종이+출판철재 및 철근가공품호진산업2022-02-04
651홍도고려홍삼공사충청남도 금산군 남일면 홍도1길 83<NA><NA>비금속소재홍삼액, 홍삼절편홍도고려홍삼공사2022-02-04
652홍원바이오아그로충청남도 금산군 추부면 추풍로 344 (유원물류창고)<NA><NA>음식료동물사료및조제식품홍원바이오아그로2022-02-04
653효성산업충청남도 금산군 복수면 구례리 332-4번지 외 2필지<NA><NA>전기+전자PE 비닐효성산업2022-02-04
654효성하나로(주)충청남도 금산군 추부면 동당말길 13041-752-9955<NA>석유화학프라이팬효성하나로(주)2022-02-04
655효원엔지니어링충청남도 금산군 추부면 다복로 721-5, (마전리 784번지)042-670-6900<NA>전기+전자가공공작기계효원엔지니어링2022-02-04
656후드코리아충청남도 금산군 복수면 학평길 23-5042-0222-4551<NA>석유화학한약제엑기스, 배.사과과즙후드코리아2022-02-04
657흥농산업충청남도 금산군 금성면 상가리 252 번지041-753-8906041-753-8907기계차광망, 각종 끈 가공품흥농산업2022-02-04
658힐링파머스(주)충청남도 금산군 군북면 군북로 892061-755-9495061-755-9496철강미생물배양기 진탕배양기 및 발효기힐링파머스(주)2022-02-04