Overview

Dataset statistics

Number of variables6
Number of observations9334
Missing cells12
Missing cells (%)< 0.1%
Duplicate rows28
Duplicate rows (%)0.3%
Total size in memory437.7 KiB
Average record size in memory48.0 B

Variable types

Categorical2
Text4

Dataset

Description충청남도 내 제조업체 현황을 시군기호, 업종코드, 기업체명, 대표자, 소재지, 주생산품을 나타낸 데이터로 개방합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=411&beforeMenuCd=DOM_000000201001001000&publicdatapk=3039901

Alerts

Dataset has 28 (0.3%) duplicate rowsDuplicates

Reproduction

Analysis started2024-01-09 22:44:54.848078
Analysis finished2024-01-09 22:44:56.330345
Duration1.48 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군기호
Categorical

Distinct15
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size73.1 KiB
천안
2762 
아산
2193 
당진
799 
논산
622 
공주
594 
Other values (10)
2364 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row천안
2nd row천안
3rd row천안
4th row천안
5th row천안

Common Values

ValueCountFrequency (%)
천안 2762
29.6%
아산 2193
23.5%
당진 799
 
8.6%
논산 622
 
6.7%
공주 594
 
6.4%
금산 434
 
4.6%
서산 368
 
3.9%
홍성 322
 
3.4%
예산 320
 
3.4%
보령 297
 
3.2%
Other values (5) 623
 
6.7%

Length

2024-01-10T07:44:56.380359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
천안 2762
29.6%
아산 2193
23.5%
당진 799
 
8.6%
논산 622
 
6.7%
공주 594
 
6.4%
금산 434
 
4.6%
서산 368
 
3.9%
홍성 322
 
3.4%
예산 320
 
3.4%
보령 297
 
3.2%
Other values (5) 623
 
6.7%

업종코드
Categorical

Distinct24
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size73.1 KiB
기타제품
1864 
금속가공제품;기계및가구제외
1303 
식료품
988 
기타기계및장비
952 
자동차및트레일러
566 
Other values (19)
3661 

Length

Max length19
Median length14
Mean length8.2733019
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row식료품
2nd row식료품
3rd row식료품
4th row식료품
5th row식료품

Common Values

ValueCountFrequency (%)
기타제품 1864
20.0%
금속가공제품;기계및가구제외 1303
14.0%
식료품 988
10.6%
기타기계및장비 952
10.2%
자동차및트레일러 566
 
6.1%
화학물질및화학제품;의약품제외 526
 
5.6%
고무제품및플라스틱제품 520
 
5.6%
전기장비 426
 
4.6%
비금속광물제품 417
 
4.5%
전자부품,컴퓨터,영상,음향및통신장비 395
 
4.2%
Other values (14) 1377
14.8%

Length

2024-01-10T07:44:56.487098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
기타제품 1864
20.0%
금속가공제품;기계및가구제외 1303
14.0%
식료품 988
10.6%
기타기계및장비 952
10.2%
자동차및트레일러 566
 
6.1%
화학물질및화학제품;의약품제외 526
 
5.6%
고무제품및플라스틱제품 520
 
5.6%
전기장비 426
 
4.6%
비금속광물제품 417
 
4.5%
전자부품,컴퓨터,영상,음향및통신장비 395
 
4.2%
Other values (14) 1377
14.8%
Distinct8914
Distinct (%)95.5%
Missing0
Missing (%)0.0%
Memory size73.1 KiB
2024-01-10T07:44:56.727467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length23
Mean length7.35976
Min length2

Characters and Unicode

Total characters68696
Distinct characters793
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8534 ?
Unique (%)91.4%

Sample

1st row농업회사법인에스에스바이오팜(주)
2nd row천안 티 엠 알 영농조합
3rd row장인촌
4th row신흥산업(주)
5th row농업회사법인 나래푸드주식회사
ValueCountFrequency (%)
주식회사 404
 
3.8%
농업회사법인 114
 
1.1%
제2공장 50
 
0.5%
2공장 44
 
0.4%
영농조합법인 28
 
0.3%
25
 
0.2%
천안공장 20
 
0.2%
아산공장 20
 
0.2%
제1공장 17
 
0.2%
논산공장 17
 
0.2%
Other values (8987) 9781
93.0%
2024-01-10T07:44:57.095499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5843
 
8.5%
) 5325
 
7.8%
( 5324
 
7.8%
1939
 
2.8%
1579
 
2.3%
1451
 
2.1%
1363
 
2.0%
1227
 
1.8%
1162
 
1.7%
1016
 
1.5%
Other values (783) 42467
61.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 55094
80.2%
Close Punctuation 5328
 
7.8%
Open Punctuation 5327
 
7.8%
Space Separator 1227
 
1.8%
Other Symbol 730
 
1.1%
Uppercase Letter 526
 
0.8%
Decimal Number 274
 
0.4%
Lowercase Letter 90
 
0.1%
Other Punctuation 89
 
0.1%
Dash Punctuation 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5843
 
10.6%
1939
 
3.5%
1579
 
2.9%
1451
 
2.6%
1363
 
2.5%
1162
 
2.1%
1016
 
1.8%
883
 
1.6%
781
 
1.4%
777
 
1.4%
Other values (719) 38300
69.5%
Uppercase Letter
ValueCountFrequency (%)
S 58
 
11.0%
N 57
 
10.8%
E 50
 
9.5%
G 42
 
8.0%
C 38
 
7.2%
T 28
 
5.3%
M 27
 
5.1%
P 25
 
4.8%
D 24
 
4.6%
B 23
 
4.4%
Other values (14) 154
29.3%
Lowercase Letter
ValueCountFrequency (%)
e 13
14.4%
o 9
10.0%
n 9
10.0%
c 8
8.9%
r 7
 
7.8%
a 6
 
6.7%
y 5
 
5.6%
t 5
 
5.6%
h 5
 
5.6%
u 4
 
4.4%
Other values (10) 19
21.1%
Decimal Number
ValueCountFrequency (%)
2 169
61.7%
1 54
 
19.7%
3 30
 
10.9%
4 8
 
2.9%
5 7
 
2.6%
6 4
 
1.5%
7 2
 
0.7%
Other Punctuation
ValueCountFrequency (%)
. 56
62.9%
& 22
 
24.7%
, 5
 
5.6%
· 3
 
3.4%
/ 2
 
2.2%
' 1
 
1.1%
Close Punctuation
ValueCountFrequency (%)
) 5325
99.9%
] 3
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 5324
99.9%
[ 3
 
0.1%
Space Separator
ValueCountFrequency (%)
1227
100.0%
Other Symbol
ValueCountFrequency (%)
730
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 55824
81.3%
Common 12256
 
17.8%
Latin 616
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5843
 
10.5%
1939
 
3.5%
1579
 
2.8%
1451
 
2.6%
1363
 
2.4%
1162
 
2.1%
1016
 
1.8%
883
 
1.6%
781
 
1.4%
777
 
1.4%
Other values (720) 39030
69.9%
Latin
ValueCountFrequency (%)
S 58
 
9.4%
N 57
 
9.3%
E 50
 
8.1%
G 42
 
6.8%
C 38
 
6.2%
T 28
 
4.5%
M 27
 
4.4%
P 25
 
4.1%
D 24
 
3.9%
B 23
 
3.7%
Other values (34) 244
39.6%
Common
ValueCountFrequency (%)
) 5325
43.4%
( 5324
43.4%
1227
 
10.0%
2 169
 
1.4%
. 56
 
0.5%
1 54
 
0.4%
3 30
 
0.2%
& 22
 
0.2%
- 11
 
0.1%
4 8
 
0.1%
Other values (9) 30
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 55094
80.2%
ASCII 12869
 
18.7%
None 733
 
1.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5843
 
10.6%
1939
 
3.5%
1579
 
2.9%
1451
 
2.6%
1363
 
2.5%
1162
 
2.1%
1016
 
1.8%
883
 
1.6%
781
 
1.4%
777
 
1.4%
Other values (719) 38300
69.5%
ASCII
ValueCountFrequency (%)
) 5325
41.4%
( 5324
41.4%
1227
 
9.5%
2 169
 
1.3%
S 58
 
0.5%
N 57
 
0.4%
. 56
 
0.4%
1 54
 
0.4%
E 50
 
0.4%
G 42
 
0.3%
Other values (52) 507
 
3.9%
None
ValueCountFrequency (%)
730
99.6%
· 3
 
0.4%
Distinct7449
Distinct (%)79.8%
Missing0
Missing (%)0.0%
Memory size73.1 KiB
2024-01-10T07:44:57.415732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length3
Mean length3.2505892
Min length2

Characters and Unicode

Total characters30341
Distinct characters390
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6155 ?
Unique (%)65.9%

Sample

1st row김옥희
2nd row이창길
3rd row공영화
4th row김상렬
5th row박성미
ValueCountFrequency (%)
17
 
0.2%
김영민 10
 
0.1%
김영호 10
 
0.1%
1인 10
 
0.1%
김광수 9
 
0.1%
9
 
0.1%
김종호 8
 
0.1%
김정수 8
 
0.1%
김수현 7
 
0.1%
이성우 7
 
0.1%
Other values (7529) 9586
99.0%
2024-01-10T07:44:57.844161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1917
 
6.3%
1593
 
5.3%
964
 
3.2%
811
 
2.7%
793
 
2.6%
565
 
1.9%
538
 
1.8%
514
 
1.7%
471
 
1.6%
447
 
1.5%
Other values (380) 21728
71.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29438
97.0%
Other Punctuation 403
 
1.3%
Space Separator 367
 
1.2%
Decimal Number 50
 
0.2%
Uppercase Letter 43
 
0.1%
Lowercase Letter 38
 
0.1%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1917
 
6.5%
1593
 
5.4%
964
 
3.3%
811
 
2.8%
793
 
2.7%
565
 
1.9%
538
 
1.8%
514
 
1.7%
471
 
1.6%
447
 
1.5%
Other values (339) 20825
70.7%
Uppercase Letter
ValueCountFrequency (%)
A 6
14.0%
I 5
11.6%
D 4
9.3%
K 4
9.3%
E 4
9.3%
N 3
 
7.0%
H 3
 
7.0%
S 2
 
4.7%
R 2
 
4.7%
T 2
 
4.7%
Other values (7) 8
18.6%
Lowercase Letter
ValueCountFrequency (%)
e 6
15.8%
i 5
13.2%
r 4
10.5%
n 4
10.5%
k 3
7.9%
a 3
7.9%
f 2
 
5.3%
l 2
 
5.3%
u 2
 
5.3%
s 2
 
5.3%
Other values (5) 5
13.2%
Other Punctuation
ValueCountFrequency (%)
, 396
98.3%
. 6
 
1.5%
/ 1
 
0.2%
Decimal Number
ValueCountFrequency (%)
1 45
90.0%
2 4
 
8.0%
0 1
 
2.0%
Space Separator
ValueCountFrequency (%)
367
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29438
97.0%
Common 822
 
2.7%
Latin 81
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1917
 
6.5%
1593
 
5.4%
964
 
3.3%
811
 
2.8%
793
 
2.7%
565
 
1.9%
538
 
1.8%
514
 
1.7%
471
 
1.6%
447
 
1.5%
Other values (339) 20825
70.7%
Latin
ValueCountFrequency (%)
A 6
 
7.4%
e 6
 
7.4%
i 5
 
6.2%
I 5
 
6.2%
D 4
 
4.9%
r 4
 
4.9%
n 4
 
4.9%
K 4
 
4.9%
E 4
 
4.9%
N 3
 
3.7%
Other values (22) 36
44.4%
Common
ValueCountFrequency (%)
, 396
48.2%
367
44.6%
1 45
 
5.5%
. 6
 
0.7%
2 4
 
0.5%
0 1
 
0.1%
/ 1
 
0.1%
) 1
 
0.1%
( 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29438
97.0%
ASCII 903
 
3.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1917
 
6.5%
1593
 
5.4%
964
 
3.3%
811
 
2.8%
793
 
2.7%
565
 
1.9%
538
 
1.8%
514
 
1.7%
471
 
1.6%
447
 
1.5%
Other values (339) 20825
70.7%
ASCII
ValueCountFrequency (%)
, 396
43.9%
367
40.6%
1 45
 
5.0%
A 6
 
0.7%
e 6
 
0.7%
. 6
 
0.7%
i 5
 
0.6%
I 5
 
0.6%
2 4
 
0.4%
D 4
 
0.4%
Other values (31) 59
 
6.5%
Distinct8454
Distinct (%)90.6%
Missing0
Missing (%)0.0%
Memory size73.1 KiB
2024-01-10T07:44:58.062467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length71
Median length64
Mean length26.567924
Min length12

Characters and Unicode

Total characters247985
Distinct characters555
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7774 ?
Unique (%)83.3%

Sample

1st row충청남도 천안시 동남구 각원사길 69 -4(안서동, 상명대학교창작스튜디오) 201, 203, 205호
2nd row충청남도 천안시 동남구 광덕면 광덕로 219
3rd row충청남도 천안시 동남구 광덕면 광치마을길 81
4th row충청남도 천안시 동남구 광덕면 대평교길 5-13
5th row충청남도 천안시 동남구 광덕면 휴암2길 20 -6
ValueCountFrequency (%)
충청남도 9324
 
17.1%
천안시 2762
 
5.1%
아산시 2190
 
4.0%
서북구 1833
 
3.4%
동남구 929
 
1.7%
당진시 800
 
1.5%
필지 689
 
1.3%
668
 
1.2%
논산시 633
 
1.2%
음봉면 595
 
1.1%
Other values (8060) 34229
62.6%
2024-01-10T07:44:58.419692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
46629
 
18.8%
10971
 
4.4%
9656
 
3.9%
9548
 
3.9%
9496
 
3.8%
7865
 
3.2%
1 7844
 
3.2%
7375
 
3.0%
2 6117
 
2.5%
6076
 
2.5%
Other values (545) 126408
51.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 151450
61.1%
Space Separator 46629
 
18.8%
Decimal Number 38856
 
15.7%
Dash Punctuation 4063
 
1.6%
Open Punctuation 2836
 
1.1%
Close Punctuation 2824
 
1.1%
Other Punctuation 926
 
0.4%
Uppercase Letter 341
 
0.1%
Other Symbol 26
 
< 0.1%
Math Symbol 18
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10971
 
7.2%
9656
 
6.4%
9548
 
6.3%
9496
 
6.3%
7865
 
5.2%
7375
 
4.9%
6076
 
4.0%
5180
 
3.4%
4067
 
2.7%
3922
 
2.6%
Other values (489) 77294
51.0%
Uppercase Letter
ValueCountFrequency (%)
B 82
24.0%
M 34
10.0%
A 34
10.0%
C 34
10.0%
I 24
 
7.0%
E 21
 
6.2%
T 20
 
5.9%
N 14
 
4.1%
S 13
 
3.8%
G 11
 
3.2%
Other values (13) 54
15.8%
Decimal Number
ValueCountFrequency (%)
1 7844
20.2%
2 6117
15.7%
3 4725
12.2%
4 3646
9.4%
5 3407
8.8%
6 3129
 
8.1%
0 2782
 
7.2%
7 2728
 
7.0%
8 2353
 
6.1%
9 2125
 
5.5%
Lowercase Letter
ValueCountFrequency (%)
e 3
18.8%
n 2
12.5%
a 2
12.5%
r 2
12.5%
o 2
12.5%
h 1
 
6.2%
c 1
 
6.2%
s 1
 
6.2%
i 1
 
6.2%
t 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
, 892
96.3%
: 16
 
1.7%
. 9
 
1.0%
/ 6
 
0.6%
& 3
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 2833
99.9%
[ 3
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 2821
99.9%
] 3
 
0.1%
Space Separator
ValueCountFrequency (%)
46629
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4063
100.0%
Other Symbol
ValueCountFrequency (%)
26
100.0%
Math Symbol
ValueCountFrequency (%)
~ 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 151476
61.1%
Common 96152
38.8%
Latin 357
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10971
 
7.2%
9656
 
6.4%
9548
 
6.3%
9496
 
6.3%
7865
 
5.2%
7375
 
4.9%
6076
 
4.0%
5180
 
3.4%
4067
 
2.7%
3922
 
2.6%
Other values (490) 77320
51.0%
Latin
ValueCountFrequency (%)
B 82
23.0%
M 34
9.5%
A 34
9.5%
C 34
9.5%
I 24
 
6.7%
E 21
 
5.9%
T 20
 
5.6%
N 14
 
3.9%
S 13
 
3.6%
G 11
 
3.1%
Other values (23) 70
19.6%
Common
ValueCountFrequency (%)
46629
48.5%
1 7844
 
8.2%
2 6117
 
6.4%
3 4725
 
4.9%
- 4063
 
4.2%
4 3646
 
3.8%
5 3407
 
3.5%
6 3129
 
3.3%
( 2833
 
2.9%
) 2821
 
2.9%
Other values (12) 10938
 
11.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 151450
61.1%
ASCII 96509
38.9%
None 26
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
46629
48.3%
1 7844
 
8.1%
2 6117
 
6.3%
3 4725
 
4.9%
- 4063
 
4.2%
4 3646
 
3.8%
5 3407
 
3.5%
6 3129
 
3.2%
( 2833
 
2.9%
) 2821
 
2.9%
Other values (45) 11295
 
11.7%
Hangul
ValueCountFrequency (%)
10971
 
7.2%
9656
 
6.4%
9548
 
6.3%
9496
 
6.3%
7865
 
5.2%
7375
 
4.9%
6076
 
4.0%
5180
 
3.4%
4067
 
2.7%
3922
 
2.6%
Other values (489) 77294
51.0%
None
ValueCountFrequency (%)
26
100.0%
Distinct7401
Distinct (%)79.4%
Missing12
Missing (%)0.1%
Memory size73.1 KiB
2024-01-10T07:44:58.694314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length53
Mean length9.3288994
Min length1

Characters and Unicode

Total characters86964
Distinct characters887
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6655 ?
Unique (%)71.4%

Sample

1st row홍삼제품/기타가공식품
2nd row사료
3rd row장류, 메주
4th row식물성 사료 등
5th row육포류,떡갈비류
ValueCountFrequency (%)
435
 
2.6%
265
 
1.6%
247
 
1.5%
자동차 182
 
1.1%
제조업 176
 
1.0%
부품 173
 
1.0%
자동차부품 162
 
1.0%
반도체 131
 
0.8%
플라스틱 113
 
0.7%
철구조물 94
 
0.6%
Other values (8545) 14951
88.3%
2024-01-10T07:44:59.175371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7676
 
8.8%
, 4023
 
4.6%
2195
 
2.5%
1856
 
2.1%
1514
 
1.7%
1462
 
1.7%
1385
 
1.6%
1318
 
1.5%
1290
 
1.5%
1242
 
1.4%
Other values (877) 63003
72.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 65687
75.5%
Space Separator 7676
 
8.8%
Uppercase Letter 5302
 
6.1%
Other Punctuation 4310
 
5.0%
Lowercase Letter 2305
 
2.7%
Close Punctuation 679
 
0.8%
Open Punctuation 679
 
0.8%
Decimal Number 222
 
0.3%
Dash Punctuation 98
 
0.1%
Modifier Symbol 2
 
< 0.1%
Other values (3) 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2195
 
3.3%
1856
 
2.8%
1514
 
2.3%
1462
 
2.2%
1385
 
2.1%
1318
 
2.0%
1290
 
2.0%
1242
 
1.9%
1204
 
1.8%
1109
 
1.7%
Other values (798) 51112
77.8%
Uppercase Letter
ValueCountFrequency (%)
C 570
 
10.8%
E 497
 
9.4%
P 474
 
8.9%
L 439
 
8.3%
D 385
 
7.3%
S 315
 
5.9%
A 303
 
5.7%
T 300
 
5.7%
R 285
 
5.4%
O 235
 
4.4%
Other values (16) 1499
28.3%
Lowercase Letter
ValueCountFrequency (%)
e 293
12.7%
r 204
 
8.9%
a 201
 
8.7%
o 164
 
7.1%
s 158
 
6.9%
t 154
 
6.7%
l 153
 
6.6%
i 141
 
6.1%
n 114
 
4.9%
p 108
 
4.7%
Other values (16) 615
26.7%
Decimal Number
ValueCountFrequency (%)
1 80
36.0%
2 50
22.5%
3 28
 
12.6%
0 21
 
9.5%
4 16
 
7.2%
5 10
 
4.5%
6 7
 
3.2%
8 5
 
2.3%
9 5
 
2.3%
Other Punctuation
ValueCountFrequency (%)
, 4023
93.3%
. 143
 
3.3%
/ 92
 
2.1%
' 25
 
0.6%
· 14
 
0.3%
& 8
 
0.2%
: 3
 
0.1%
% 2
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 677
99.7%
] 2
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 677
99.7%
[ 2
 
0.3%
Space Separator
ValueCountFrequency (%)
7676
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 98
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Other Number
ValueCountFrequency (%)
² 1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 65686
75.5%
Common 13670
 
15.7%
Latin 7607
 
8.7%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2195
 
3.3%
1856
 
2.8%
1514
 
2.3%
1462
 
2.2%
1385
 
2.1%
1318
 
2.0%
1290
 
2.0%
1242
 
1.9%
1204
 
1.8%
1109
 
1.7%
Other values (797) 51111
77.8%
Latin
ValueCountFrequency (%)
C 570
 
7.5%
E 497
 
6.5%
P 474
 
6.2%
L 439
 
5.8%
D 385
 
5.1%
S 315
 
4.1%
A 303
 
4.0%
T 300
 
3.9%
e 293
 
3.9%
R 285
 
3.7%
Other values (42) 3746
49.2%
Common
ValueCountFrequency (%)
7676
56.2%
, 4023
29.4%
) 677
 
5.0%
( 677
 
5.0%
. 143
 
1.0%
- 98
 
0.7%
/ 92
 
0.7%
1 80
 
0.6%
2 50
 
0.4%
3 28
 
0.2%
Other values (17) 126
 
0.9%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 65677
75.5%
ASCII 21261
 
24.4%
None 15
 
< 0.1%
Compat Jamo 9
 
< 0.1%
CJK 1
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7676
36.1%
, 4023
18.9%
) 677
 
3.2%
( 677
 
3.2%
C 570
 
2.7%
E 497
 
2.3%
P 474
 
2.2%
L 439
 
2.1%
D 385
 
1.8%
S 315
 
1.5%
Other values (66) 5528
26.0%
Hangul
ValueCountFrequency (%)
2195
 
3.3%
1856
 
2.8%
1514
 
2.3%
1462
 
2.2%
1385
 
2.1%
1318
 
2.0%
1290
 
2.0%
1242
 
1.9%
1204
 
1.8%
1109
 
1.7%
Other values (795) 51102
77.8%
None
ValueCountFrequency (%)
· 14
93.3%
² 1
 
6.7%
Compat Jamo
ValueCountFrequency (%)
8
88.9%
1
 
11.1%
CJK
ValueCountFrequency (%)
1
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%

Correlations

2024-01-10T07:44:59.258188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군기호업종코드
시군기호1.0000.632
업종코드0.6321.000
2024-01-10T07:44:59.324700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군기호업종코드
시군기호1.0000.246
업종코드0.2461.000
2024-01-10T07:44:59.592505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군기호업종코드
시군기호1.0000.246
업종코드0.2461.000

Missing values

2024-01-10T07:44:56.191383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:44:56.280230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군기호업종코드기업체명대표자소 재 지주생산품
0천안식료품농업회사법인에스에스바이오팜(주)김옥희충청남도 천안시 동남구 각원사길 69 -4(안서동, 상명대학교창작스튜디오) 201, 203, 205호홍삼제품/기타가공식품
1천안식료품천안 티 엠 알 영농조합이창길충청남도 천안시 동남구 광덕면 광덕로 219사료
2천안식료품장인촌공영화충청남도 천안시 동남구 광덕면 광치마을길 81장류, 메주
3천안식료품신흥산업(주)김상렬충청남도 천안시 동남구 광덕면 대평교길 5-13식물성 사료 등
4천안식료품농업회사법인 나래푸드주식회사박성미충청남도 천안시 동남구 광덕면 휴암2길 20 -6육포류,떡갈비류
5천안식료품대해식품권영미충청남도 천안시 동남구 구성5길 6-5 (구성동)초고추장.떡볶이소스.불고기양념.매운탕양념등
6천안식료품(주)베어스서정근충청남도 천안시 동남구 단대로 119 (안서동)홍차, 보이차
7천안식료품(주)지노바이오텍정형진충청남도 천안시 동남구 단대로 119, 단국대학교 생명공학창업보육센터 306호 (안서동)미네랄영양제(단미사료)
8천안식료품세화산업이문철충청남도 천안시 동남구 동면 수남리 229-2번지원형뻥과자
9천안식료품반개정미소윤건섭충청남도 천안시 동남구 동면 충절로 2121-5
시군기호업종코드기업체명대표자소 재 지주생산품
9324태안비금속광물제품한국서부발전(주) 태안발전본부 정제공장김병숙충청남도 태안군 원북면 방갈리 831-4번지태안화력 석탄정제회
9325태안식료품한국홍원주식회사안주영충청남도 태안군 소원면 대소산길 290-78건조해삼
9326태안기타운송장비한길조선소한광길충청남도 태안군 태안읍 원이로 341-37F.R.P 선박건조
9327태안식료품해가연 농업회사법인(주)이인선충청남도 태안군 근흥면 근흥로 242-28 (총 2 필지)생들깨 기름
9328태안금속가공제품;기계및가구제외㈜서해철망전필수충청남도 태안군 태안읍 원이로 341-68용접철망
9329태안식료품끌림언니김혜영충청남도 태안군 근흥면 근흥로 242-16생강즙
9330태안식료품대현수산영어영농조합법인전병년충청남도 태안군 고남면 누동리 1645-1액젓, 기타젓갈류, 냉동바지락
9331태안식료품에이치엠오건강드림영농조합법인(HMO)손진성충청남도 태안군 안면대로 208-37굼벵이 추출액, 추출분말
9332태안금속가공제품;기계및가구제외신호개발최재선충청남도 태안군 태안읍 원이로 341-31난간(알루미늄 파이프)
9333태안식료품태안앤스페인영농조합법인이재룡충청남도 태안군 소원면 천리포1길 122-21흑마늘 추출액, 흑삼추출액

Duplicate rows

Most frequently occurring

시군기호업종코드기업체명대표자소 재 지주생산품# duplicates
0공주고무제품및플라스틱제품(주)이큐온서종규충청남도 공주시 우성면 보흥리 산 76-11 외 1필지이형지, 이형필름2
1공주금속가공제품;기계및가구제외주식회사 신우종합상사엄복순충청남도 공주시 우성면 심산길 38-12용접철망,건축자재,안전용품2
2공주기타제품(주)삼보광통신김영련충청남도 공주시 우성면 심산길 120-10방송기기,통신기기2
3공주기타제품(주)엔씨원유세아충청남도 공주시 우성면 보흥1길 243-29 (총 4 필지)콘크리트 블럭2
4공주기타제품(주)자연과환경이병용충청남도 공주시 우성면 보흥리 431 외 1필지콘크리트블럭2
5공주기타제품(주)환경과 사람들이옥자충청남도 공주시 우성면 역골2길 6-13, 도천리 323-1플라스틱 재생 원료2
6공주기타제품아진산업 주식회사김종호충청남도 공주시 우성면 질마고개길 26-24아스콘2
7공주기타제품아진산업 주식회사김종호충청남도 공주시 우성면 질마고개길 46 (총 2 필지)레미콘2
8공주기타제품정호산업(주)정상근충청남도 공주시 우성면 약천길 71, ,171-1,171-2,135-5레디믹스트콘크리트2
9공주기타제품주식회사 쌍마산업심상용충청남도 공주시 우성면 지게실길 100스테인레스수세미2