Overview

Dataset statistics

Number of variables7
Number of observations853
Missing cells207
Missing cells (%)3.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory46.8 KiB
Average record size in memory56.2 B

Variable types

Text6
Categorical1

Dataset

Description충청남도 당진시 관내에 입주한 공장등록 현황입니다(연번, 회사명, 전화번호,팩스번호, 생산품, 공장대표주소)
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=432&beforeMenuCd=DOM_000000201001001000&publicdatapk=15052053

Alerts

단지명 is highly imbalanced (55.4%)Imbalance
전화번호 has 78 (9.1%) missing valuesMissing
팩스번호 has 129 (15.1%) missing valuesMissing

Reproduction

Analysis started2024-01-09 22:37:25.761883
Analysis finished2024-01-09 22:37:26.642409
Duration0.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct818
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
2024-01-10T07:37:26.779434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length17
Mean length7.6951934
Min length2

Characters and Unicode

Total characters6564
Distinct characters402
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique787 ?
Unique (%)92.3%

Sample

1st row(유)고려소재 신평공장
2nd row(유)제이씨테크
3rd row(주)SIMPAC 당진공장
4th row(주)가온기업
5th row(주)강원엔티에스
ValueCountFrequency (%)
주식회사 25
 
2.7%
당진공장 12
 
1.3%
현대제철(주 4
 
0.4%
주)에스앤씨산업 4
 
0.4%
농업회사법인 3
 
0.3%
영진철강(주 3
 
0.3%
당진2공장 3
 
0.3%
주)기린산업 3
 
0.3%
주)해나루싱싱닭 3
 
0.3%
주)서연오토비전 3
 
0.3%
Other values (826) 862
93.2%
2024-01-10T07:37:27.106166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
698
 
10.6%
( 662
 
10.1%
) 662
 
10.1%
175
 
2.7%
156
 
2.4%
149
 
2.3%
115
 
1.8%
112
 
1.7%
103
 
1.6%
94
 
1.4%
Other values (392) 3638
55.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5111
77.9%
Open Punctuation 662
 
10.1%
Close Punctuation 662
 
10.1%
Space Separator 73
 
1.1%
Uppercase Letter 28
 
0.4%
Decimal Number 20
 
0.3%
Other Punctuation 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
698
 
13.7%
175
 
3.4%
156
 
3.1%
149
 
2.9%
115
 
2.3%
112
 
2.2%
103
 
2.0%
94
 
1.8%
76
 
1.5%
74
 
1.4%
Other values (367) 3359
65.7%
Uppercase Letter
ValueCountFrequency (%)
M 4
14.3%
A 4
14.3%
N 4
14.3%
G 2
7.1%
S 2
7.1%
B 2
7.1%
C 2
7.1%
P 2
7.1%
I 1
 
3.6%
J 1
 
3.6%
Other values (4) 4
14.3%
Decimal Number
ValueCountFrequency (%)
2 7
35.0%
1 6
30.0%
3 4
20.0%
5 1
 
5.0%
6 1
 
5.0%
4 1
 
5.0%
Other Punctuation
ValueCountFrequency (%)
. 7
87.5%
& 1
 
12.5%
Open Punctuation
ValueCountFrequency (%)
( 662
100.0%
Close Punctuation
ValueCountFrequency (%)
) 662
100.0%
Space Separator
ValueCountFrequency (%)
73
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5111
77.9%
Common 1425
 
21.7%
Latin 28
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
698
 
13.7%
175
 
3.4%
156
 
3.1%
149
 
2.9%
115
 
2.3%
112
 
2.2%
103
 
2.0%
94
 
1.8%
76
 
1.5%
74
 
1.4%
Other values (367) 3359
65.7%
Latin
ValueCountFrequency (%)
M 4
14.3%
A 4
14.3%
N 4
14.3%
G 2
7.1%
S 2
7.1%
B 2
7.1%
C 2
7.1%
P 2
7.1%
I 1
 
3.6%
J 1
 
3.6%
Other values (4) 4
14.3%
Common
ValueCountFrequency (%)
( 662
46.5%
) 662
46.5%
73
 
5.1%
2 7
 
0.5%
. 7
 
0.5%
1 6
 
0.4%
3 4
 
0.3%
5 1
 
0.1%
6 1
 
0.1%
& 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5111
77.9%
ASCII 1453
 
22.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
698
 
13.7%
175
 
3.4%
156
 
3.1%
149
 
2.9%
115
 
2.3%
112
 
2.2%
103
 
2.0%
94
 
1.8%
76
 
1.5%
74
 
1.4%
Other values (367) 3359
65.7%
ASCII
ValueCountFrequency (%)
( 662
45.6%
) 662
45.6%
73
 
5.0%
2 7
 
0.5%
. 7
 
0.5%
1 6
 
0.4%
M 4
 
0.3%
A 4
 
0.3%
N 4
 
0.3%
3 4
 
0.3%
Other values (15) 20
 
1.4%

단지명
Categorical

IMBALANCE 

Distinct17
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
<NA>
591 
아산국가산업단지(고대부곡지구)
86 
석문국가산업단지
 
50
당진합덕지방산업단지
 
25
당진합덕농공단지
 
24
Other values (12)
77 

Length

Max length16
Median length4
Mean length6.1664713
Min length4

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 591
69.3%
아산국가산업단지(고대부곡지구) 86
 
10.1%
석문국가산업단지 50
 
5.9%
당진합덕지방산업단지 25
 
2.9%
당진합덕농공단지 24
 
2.8%
당진송악농공단지 21
 
2.5%
당진당진농공단지 11
 
1.3%
당진송산2일반산업단지 10
 
1.2%
당진신평농공단지 10
 
1.2%
당진면천농공단지 9
 
1.1%
Other values (7) 16
 
1.9%

Length

2024-01-10T07:37:27.237651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 591
69.3%
아산국가산업단지(고대부곡지구 86
 
10.1%
석문국가산업단지 50
 
5.9%
당진합덕지방산업단지 25
 
2.9%
당진합덕농공단지 24
 
2.8%
당진송악농공단지 21
 
2.5%
당진당진농공단지 11
 
1.3%
당진신평농공단지 10
 
1.2%
당진송산2일반산업단지 10
 
1.2%
당진면천농공단지 9
 
1.1%
Other values (7) 16
 
1.9%
Distinct772
Distinct (%)90.5%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
2024-01-10T07:37:27.514224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length3
Mean length3.2860492
Min length2

Characters and Unicode

Total characters2803
Distinct characters210
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique703 ?
Unique (%)82.4%

Sample

1st row권민호
2nd row이준철
3rd row송효석
4th row김원일
5th row전창열
ValueCountFrequency (%)
이경호 6
 
0.7%
강학서 3
 
0.3%
김창환 3
 
0.3%
김종현 3
 
0.3%
민남규 3
 
0.3%
전오환 3
 
0.3%
이대호 3
 
0.3%
우유철 3
 
0.3%
김영선 3
 
0.3%
안정인 3
 
0.3%
Other values (790) 856
96.3%
2024-01-10T07:37:27.927729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
164
 
5.9%
139
 
5.0%
84
 
3.0%
75
 
2.7%
73
 
2.6%
69
 
2.5%
67
 
2.4%
52
 
1.9%
49
 
1.7%
/ 48
 
1.7%
Other values (200) 1983
70.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2704
96.5%
Other Punctuation 48
 
1.7%
Space Separator 37
 
1.3%
Uppercase Letter 11
 
0.4%
Decimal Number 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
164
 
6.1%
139
 
5.1%
84
 
3.1%
75
 
2.8%
73
 
2.7%
69
 
2.6%
67
 
2.5%
52
 
1.9%
49
 
1.8%
45
 
1.7%
Other values (190) 1887
69.8%
Uppercase Letter
ValueCountFrequency (%)
E 3
27.3%
K 2
18.2%
U 2
18.2%
Y 1
 
9.1%
N 1
 
9.1%
L 1
 
9.1%
D 1
 
9.1%
Other Punctuation
ValueCountFrequency (%)
/ 48
100.0%
Space Separator
ValueCountFrequency (%)
37
100.0%
Decimal Number
ValueCountFrequency (%)
1 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2704
96.5%
Common 88
 
3.1%
Latin 11
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
164
 
6.1%
139
 
5.1%
84
 
3.1%
75
 
2.8%
73
 
2.7%
69
 
2.6%
67
 
2.5%
52
 
1.9%
49
 
1.8%
45
 
1.7%
Other values (190) 1887
69.8%
Latin
ValueCountFrequency (%)
E 3
27.3%
K 2
18.2%
U 2
18.2%
Y 1
 
9.1%
N 1
 
9.1%
L 1
 
9.1%
D 1
 
9.1%
Common
ValueCountFrequency (%)
/ 48
54.5%
37
42.0%
1 3
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2704
96.5%
ASCII 99
 
3.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
164
 
6.1%
139
 
5.1%
84
 
3.1%
75
 
2.8%
73
 
2.7%
69
 
2.6%
67
 
2.5%
52
 
1.9%
49
 
1.8%
45
 
1.7%
Other values (190) 1887
69.8%
ASCII
ValueCountFrequency (%)
/ 48
48.5%
37
37.4%
1 3
 
3.0%
E 3
 
3.0%
K 2
 
2.0%
U 2
 
2.0%
Y 1
 
1.0%
N 1
 
1.0%
L 1
 
1.0%
D 1
 
1.0%

전화번호
Text

MISSING 

Distinct731
Distinct (%)94.3%
Missing78
Missing (%)9.1%
Memory size6.8 KiB
2024-01-10T07:37:28.160745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.014194
Min length11

Characters and Unicode

Total characters9311
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique693 ?
Unique (%)89.4%

Sample

1st row032-812-3438
2nd row041-355-8735
3rd row041-360-0122
4th row02-2624-0970
5th row041-357-3655
ValueCountFrequency (%)
041-355-3321 4
 
0.5%
041-356-5961 4
 
0.5%
041-357-4671 3
 
0.4%
041-358-9908 3
 
0.4%
041-356-0339 2
 
0.3%
041-354-8801 2
 
0.3%
041-358-9400 2
 
0.3%
041-355-5559 2
 
0.3%
041-680-1255 2
 
0.3%
041-355-8361 2
 
0.3%
Other values (721) 749
96.6%
2024-01-10T07:37:28.506847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1551
16.7%
0 1355
14.6%
1 1161
12.5%
3 1132
12.2%
4 1023
11.0%
5 861
9.2%
6 567
 
6.1%
2 501
 
5.4%
7 475
 
5.1%
8 400
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7760
83.3%
Dash Punctuation 1551
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1355
17.5%
1 1161
15.0%
3 1132
14.6%
4 1023
13.2%
5 861
11.1%
6 567
7.3%
2 501
 
6.5%
7 475
 
6.1%
8 400
 
5.2%
9 285
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 1551
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 9311
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1551
16.7%
0 1355
14.6%
1 1161
12.5%
3 1132
12.2%
4 1023
11.0%
5 861
9.2%
6 567
 
6.1%
2 501
 
5.4%
7 475
 
5.1%
8 400
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9311
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1551
16.7%
0 1355
14.6%
1 1161
12.5%
3 1132
12.2%
4 1023
11.0%
5 861
9.2%
6 567
 
6.1%
2 501
 
5.4%
7 475
 
5.1%
8 400
 
4.3%

팩스번호
Text

MISSING 

Distinct664
Distinct (%)91.7%
Missing129
Missing (%)15.1%
Memory size6.8 KiB
2024-01-10T07:37:28.740246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.002762
Min length11

Characters and Unicode

Total characters8690
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique617 ?
Unique (%)85.2%

Sample

1st row041-363-3438
2nd row041-355-8736
3rd row041-360-0190
4th row041-362-0072
5th row02-2624-0985
ValueCountFrequency (%)
041-355-3224 4
 
0.6%
041-352-9917 4
 
0.6%
041-357-4674 4
 
0.6%
041-354-5905 4
 
0.6%
041-358-9995 3
 
0.4%
041-358-3108 3
 
0.4%
041-356-5965 3
 
0.4%
041-362-0397 3
 
0.4%
041-363-1841 3
 
0.4%
041-357-0438 2
 
0.3%
Other values (654) 691
95.4%
2024-01-10T07:37:29.074187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1448
16.7%
0 1124
12.9%
3 1091
12.6%
4 1015
11.7%
1 1012
11.6%
5 791
9.1%
6 546
 
6.3%
2 469
 
5.4%
7 424
 
4.9%
8 415
 
4.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7242
83.3%
Dash Punctuation 1448
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1124
15.5%
3 1091
15.1%
4 1015
14.0%
1 1012
14.0%
5 791
10.9%
6 546
7.5%
2 469
6.5%
7 424
 
5.9%
8 415
 
5.7%
9 355
 
4.9%
Dash Punctuation
ValueCountFrequency (%)
- 1448
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8690
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1448
16.7%
0 1124
12.9%
3 1091
12.6%
4 1015
11.7%
1 1012
11.6%
5 791
9.1%
6 546
 
6.3%
2 469
 
5.4%
7 424
 
4.9%
8 415
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8690
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1448
16.7%
0 1124
12.9%
3 1091
12.6%
4 1015
11.7%
1 1012
11.6%
5 791
9.1%
6 546
 
6.3%
2 469
 
5.4%
7 424
 
4.9%
8 415
 
4.8%
Distinct747
Distinct (%)87.6%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
2024-01-10T07:37:29.341978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length27
Mean length8.9284877
Min length1

Characters and Unicode

Total characters7616
Distinct characters509
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique693 ?
Unique (%)81.2%

Sample

1st row차량용 차음재
2nd row철구조물
3rd row가공철판/ 합금철
4th row스테인레스철망/ 방법창
5th row산업용보일러/ 플랜트 설비
ValueCountFrequency (%)
41
 
2.7%
철구조물 40
 
2.6%
30
 
2.0%
자동차 12
 
0.8%
철판 10
 
0.7%
알루미늄 10
 
0.7%
9
 
0.6%
자동차부품 9
 
0.6%
레미콘 9
 
0.6%
부품 9
 
0.6%
Other values (1018) 1343
88.2%
2024-01-10T07:37:29.747673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
678
 
8.9%
/ 425
 
5.6%
189
 
2.5%
158
 
2.1%
130
 
1.7%
129
 
1.7%
128
 
1.7%
125
 
1.6%
115
 
1.5%
100
 
1.3%
Other values (499) 5439
71.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5853
76.9%
Space Separator 678
 
8.9%
Other Punctuation 434
 
5.7%
Uppercase Letter 324
 
4.3%
Lowercase Letter 231
 
3.0%
Open Punctuation 40
 
0.5%
Close Punctuation 40
 
0.5%
Dash Punctuation 12
 
0.2%
Decimal Number 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
189
 
3.2%
158
 
2.7%
130
 
2.2%
129
 
2.2%
128
 
2.2%
125
 
2.1%
115
 
2.0%
100
 
1.7%
98
 
1.7%
96
 
1.6%
Other values (441) 4585
78.3%
Lowercase Letter
ValueCountFrequency (%)
e 31
13.4%
l 20
 
8.7%
a 19
 
8.2%
r 16
 
6.9%
t 15
 
6.5%
s 15
 
6.5%
o 14
 
6.1%
i 12
 
5.2%
p 12
 
5.2%
n 10
 
4.3%
Other values (14) 67
29.0%
Uppercase Letter
ValueCountFrequency (%)
E 33
 
10.2%
P 32
 
9.9%
L 26
 
8.0%
R 24
 
7.4%
H 22
 
6.8%
C 21
 
6.5%
T 20
 
6.2%
A 17
 
5.2%
B 16
 
4.9%
I 16
 
4.9%
Other values (12) 97
29.9%
Other Punctuation
ValueCountFrequency (%)
/ 425
97.9%
. 6
 
1.4%
' 2
 
0.5%
& 1
 
0.2%
Decimal Number
ValueCountFrequency (%)
4 1
25.0%
2 1
25.0%
1 1
25.0%
3 1
25.0%
Space Separator
ValueCountFrequency (%)
678
100.0%
Open Punctuation
ValueCountFrequency (%)
( 40
100.0%
Close Punctuation
ValueCountFrequency (%)
) 40
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5853
76.9%
Common 1208
 
15.9%
Latin 555
 
7.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
189
 
3.2%
158
 
2.7%
130
 
2.2%
129
 
2.2%
128
 
2.2%
125
 
2.1%
115
 
2.0%
100
 
1.7%
98
 
1.7%
96
 
1.6%
Other values (441) 4585
78.3%
Latin
ValueCountFrequency (%)
E 33
 
5.9%
P 32
 
5.8%
e 31
 
5.6%
L 26
 
4.7%
R 24
 
4.3%
H 22
 
4.0%
C 21
 
3.8%
l 20
 
3.6%
T 20
 
3.6%
a 19
 
3.4%
Other values (36) 307
55.3%
Common
ValueCountFrequency (%)
678
56.1%
/ 425
35.2%
( 40
 
3.3%
) 40
 
3.3%
- 12
 
1.0%
. 6
 
0.5%
' 2
 
0.2%
4 1
 
0.1%
2 1
 
0.1%
1 1
 
0.1%
Other values (2) 2
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5853
76.9%
ASCII 1763
 
23.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
678
38.5%
/ 425
24.1%
( 40
 
2.3%
) 40
 
2.3%
E 33
 
1.9%
P 32
 
1.8%
e 31
 
1.8%
L 26
 
1.5%
R 24
 
1.4%
H 22
 
1.2%
Other values (48) 412
23.4%
Hangul
ValueCountFrequency (%)
189
 
3.2%
158
 
2.7%
130
 
2.2%
129
 
2.2%
128
 
2.2%
125
 
2.1%
115
 
2.0%
100
 
1.7%
98
 
1.7%
96
 
1.6%
Other values (441) 4585
78.3%
Distinct760
Distinct (%)89.1%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
2024-01-10T07:37:30.038282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length45
Mean length26.443142
Min length17

Characters and Unicode

Total characters22556
Distinct characters197
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique689 ?
Unique (%)80.8%

Sample

1st row충청남도 당진시 신평면 한정리 48번지 외 7 필지
2nd row충청남도 당진시 순성면 옥호리 647-4번지
3rd row충청남도 당진시 정미면 신시리 303-9번지 (신시리 303-9) 외 22 필지
4th row충청남도 당진시 순성면 본리 617-3번지
5th row충청남도 당진시 순성면 백석리 467-30번지 외 2 필지
ValueCountFrequency (%)
충청남도 853
 
16.8%
당진시 852
 
16.7%
246
 
4.8%
필지 239
 
4.7%
송악읍 230
 
4.5%
신평면 98
 
1.9%
합덕읍 96
 
1.9%
순성면 93
 
1.8%
1 87
 
1.7%
석문면 68
 
1.3%
Other values (940) 2230
43.8%
2024-01-10T07:37:30.446427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4267
18.9%
1122
 
5.0%
932
 
4.1%
893
 
4.0%
882
 
3.9%
880
 
3.9%
874
 
3.9%
864
 
3.8%
853
 
3.8%
838
 
3.7%
Other values (187) 10151
45.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13833
61.3%
Space Separator 4267
 
18.9%
Decimal Number 3764
 
16.7%
Dash Punctuation 640
 
2.8%
Close Punctuation 20
 
0.1%
Open Punctuation 20
 
0.1%
Uppercase Letter 9
 
< 0.1%
Other Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1122
 
8.1%
932
 
6.7%
893
 
6.5%
882
 
6.4%
880
 
6.4%
874
 
6.3%
864
 
6.2%
853
 
6.2%
838
 
6.1%
817
 
5.9%
Other values (166) 4878
35.3%
Decimal Number
ValueCountFrequency (%)
1 763
20.3%
2 524
13.9%
3 507
13.5%
4 414
11.0%
6 347
9.2%
5 294
 
7.8%
0 266
 
7.1%
8 256
 
6.8%
7 205
 
5.4%
9 188
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
A 4
44.4%
L 1
 
11.1%
B 1
 
11.1%
S 1
 
11.1%
P 1
 
11.1%
C 1
 
11.1%
Space Separator
ValueCountFrequency (%)
4267
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 640
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13833
61.3%
Common 8714
38.6%
Latin 9
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1122
 
8.1%
932
 
6.7%
893
 
6.5%
882
 
6.4%
880
 
6.4%
874
 
6.3%
864
 
6.2%
853
 
6.2%
838
 
6.1%
817
 
5.9%
Other values (166) 4878
35.3%
Common
ValueCountFrequency (%)
4267
49.0%
1 763
 
8.8%
- 640
 
7.3%
2 524
 
6.0%
3 507
 
5.8%
4 414
 
4.8%
6 347
 
4.0%
5 294
 
3.4%
0 266
 
3.1%
8 256
 
2.9%
Other values (5) 436
 
5.0%
Latin
ValueCountFrequency (%)
A 4
44.4%
L 1
 
11.1%
B 1
 
11.1%
S 1
 
11.1%
P 1
 
11.1%
C 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13833
61.3%
ASCII 8723
38.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4267
48.9%
1 763
 
8.7%
- 640
 
7.3%
2 524
 
6.0%
3 507
 
5.8%
4 414
 
4.7%
6 347
 
4.0%
5 294
 
3.4%
0 266
 
3.0%
8 256
 
2.9%
Other values (11) 445
 
5.1%
Hangul
ValueCountFrequency (%)
1122
 
8.1%
932
 
6.7%
893
 
6.5%
882
 
6.4%
880
 
6.4%
874
 
6.3%
864
 
6.2%
853
 
6.2%
838
 
6.1%
817
 
5.9%
Other values (166) 4878
35.3%

Missing values

2024-01-10T07:37:26.395443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:37:26.499684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T07:37:26.592912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

회사명단지명대표자명전화번호팩스번호생산품공장대표주소
0(유)고려소재 신평공장<NA>권민호032-812-3438041-363-3438차량용 차음재충청남도 당진시 신평면 한정리 48번지 외 7 필지
1(유)제이씨테크<NA>이준철041-355-8735041-355-8736철구조물충청남도 당진시 순성면 옥호리 647-4번지
2(주)SIMPAC 당진공장<NA>송효석041-360-0122041-360-0190가공철판/ 합금철충청남도 당진시 정미면 신시리 303-9번지 (신시리 303-9) 외 22 필지
3(주)가온기업<NA>김원일<NA>041-362-0072스테인레스철망/ 방법창충청남도 당진시 순성면 본리 617-3번지
4(주)강원엔티에스<NA>전창열02-2624-097002-2624-0985산업용보일러/ 플랜트 설비충청남도 당진시 순성면 백석리 467-30번지 외 2 필지
5(주)경성플랜트<NA>최진이041-357-3655041-357-3656철골조립구조재/ 금속절삭가공기계충청남도 당진시 송악읍 월곡리 48-10번지
6(주)경수제철<NA>백종서041-360-0751041-360-0729강구조물충청남도 당진시 신평면 도성리 456번지 외 8 필지
7(주)경인<NA>성락인041-354-2022041-354-2025음료용방청제/탈청제/냉각수처리제/중화방청제/보일러수처리제충청남도 당진시 순성면 봉소리 296-4번지
8(주)경일엔텍<NA>박찬웅041-358-2318041-358-2314산업기계제작충청남도 당진시 송악읍 청금리 산 31-16번지 외 1 필지
9(주)고대철강<NA>문평환041-430-7885041-430-7886철구조물/ 소부재충청남도 당진시 신평면 거산리 449-22번지 외 10 필지
회사명단지명대표자명전화번호팩스번호생산품공장대표주소
843인성산업(주)당진합덕지방산업단지유기연041-360-3600041-363-3655드럼통충청남도 당진시 합덕읍 소소리 646번지
844케이엠텍(주)당진합덕지방산업단지김성이041-363-1522041-363-1523방범창충청남도 당진시 합덕읍 소소리 641번지
845케이티씨(주)당진합덕지방산업단지김명동041-362-9323<NA>크레인충청남도 당진시 합덕읍 소소리 627번지
846코리아스틸(주)당진합덕지방산업단지황현민041-363-5700041-363-4411스틸보빈충청남도 당진시 합덕읍 소소리 640번지
847태성몰드산업(주)당진합덕지방산업단지인덕교041-363-9497041-363-9496자동차부품 BRKT류충청남도 당진시 합덕읍 소소리 632번지
848한국메탈주식회사당진합덕지방산업단지김영호031-319-3197031-319-3199기계제작/ 철망충청남도 당진시 합덕읍 소소리 641번지
849현대제철(주)당진현대제철산업단지우유철/ 강학서041-680-1255041-680-1199슬라브/ 열연강판/ 후판 등충청남도 당진시 송악읍 고대리 167-32번지
850쿠퍼스탠다드코리아(유)송산2-1외국인투자지역안봉헌031-678-9968031-681-4624자동차 부품(고무 패킹 등)충청남도 당진시 송산면 가곡리 654번지
851(주)페로텍어드밴스드머티리얼즈코리아송산2외국인투자지역야마무라 타케루041-355-3310041-355-3309SIC 코팅 그라파이트충청남도 당진시 송산면 동곡리 387번지
852해윤광업(주)송산2외국인투자지역찐홍옌041-352-8650041-352-8651활석 등 분체가공품충청남도 당진시 송산면 동곡리 384번지