Overview

Dataset statistics

Number of variables10
Number of observations179
Missing cells106
Missing cells (%)5.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory14.1 KiB
Average record size in memory80.7 B

Variable types

Text8
Categorical2

Dataset

Description충청남도 부여군 내 제조업체 등록현황(회사명, 전화번호, 팩스번호, 주소, 업종명, 생산품, 대표자명, 운영구분, 데이터기준일자)
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=432&beforeMenuCd=DOM_000000201001001000&publicdatapk=15053310

Alerts

데이터기준일자 has constant value ""Constant
운영구분 is highly imbalanced (50.4%)Imbalance
전화번호 has 27 (15.1%) missing valuesMissing
팩스번호 has 79 (44.1%) missing valuesMissing

Reproduction

Analysis started2024-01-09 23:15:12.325331
Analysis finished2024-01-09 23:15:13.280026
Duration0.95 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct177
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2024-01-10T08:15:13.406570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length14
Mean length7.4916201
Min length3

Characters and Unicode

Total characters1341
Distinct characters231
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique175 ?
Unique (%)97.8%

Sample

1st row(유)미주산업
2nd row(주) 건영에스피지
3rd row(주)계림폴리콘
4th row(주)골든구스
5th row(주)광일산업
ValueCountFrequency (%)
주식회사 13
 
6.4%
동양산업 2
 
1.0%
주)뉴제일이엘이씨 2
 
1.0%
이스코인더스트리(유 2
 
1.0%
부여지점 2
 
1.0%
농업회사법인 2
 
1.0%
세도석재산업 1
 
0.5%
신실천막 1
 
0.5%
유)미주산업 1
 
0.5%
씨지주식회사 1
 
0.5%
Other values (177) 177
86.8%
2024-01-10T08:15:14.007517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
112
 
8.4%
( 91
 
6.8%
) 91
 
6.8%
36
 
2.7%
34
 
2.5%
32
 
2.4%
30
 
2.2%
27
 
2.0%
26
 
1.9%
25
 
1.9%
Other values (221) 837
62.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1124
83.8%
Open Punctuation 91
 
6.8%
Close Punctuation 91
 
6.8%
Space Separator 25
 
1.9%
Uppercase Letter 8
 
0.6%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
112
 
10.0%
36
 
3.2%
34
 
3.0%
32
 
2.8%
30
 
2.7%
27
 
2.4%
26
 
2.3%
25
 
2.2%
22
 
2.0%
18
 
1.6%
Other values (209) 762
67.8%
Uppercase Letter
ValueCountFrequency (%)
B 1
12.5%
F 1
12.5%
N 1
12.5%
T 1
12.5%
M 1
12.5%
P 1
12.5%
L 1
12.5%
R 1
12.5%
Open Punctuation
ValueCountFrequency (%)
( 91
100.0%
Close Punctuation
ValueCountFrequency (%)
) 91
100.0%
Space Separator
ValueCountFrequency (%)
25
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1124
83.8%
Common 209
 
15.6%
Latin 8
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
112
 
10.0%
36
 
3.2%
34
 
3.0%
32
 
2.8%
30
 
2.7%
27
 
2.4%
26
 
2.3%
25
 
2.2%
22
 
2.0%
18
 
1.6%
Other values (209) 762
67.8%
Latin
ValueCountFrequency (%)
B 1
12.5%
F 1
12.5%
N 1
12.5%
T 1
12.5%
M 1
12.5%
P 1
12.5%
L 1
12.5%
R 1
12.5%
Common
ValueCountFrequency (%)
( 91
43.5%
) 91
43.5%
25
 
12.0%
. 2
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1124
83.8%
ASCII 217
 
16.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
112
 
10.0%
36
 
3.2%
34
 
3.0%
32
 
2.8%
30
 
2.7%
27
 
2.4%
26
 
2.3%
25
 
2.2%
22
 
2.0%
18
 
1.6%
Other values (209) 762
67.8%
ASCII
ValueCountFrequency (%)
( 91
41.9%
) 91
41.9%
25
 
11.5%
. 2
 
0.9%
B 1
 
0.5%
F 1
 
0.5%
N 1
 
0.5%
T 1
 
0.5%
M 1
 
0.5%
P 1
 
0.5%
Other values (2) 2
 
0.9%

전화번호
Text

MISSING 

Distinct143
Distinct (%)94.1%
Missing27
Missing (%)15.1%
Memory size1.5 KiB
2024-01-10T08:15:14.238158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1824
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique135 ?
Unique (%)88.8%

Sample

1st row041-843-5112
2nd row041-835-0030
3rd row041-833-3865
4th row041-836-0210
5th row041-837-0780
ValueCountFrequency (%)
041-837-4416 3
 
2.0%
041-836-5548 2
 
1.3%
041-835-8555 2
 
1.3%
041-832-1286 2
 
1.3%
041-734-7178 2
 
1.3%
041-837-0153 2
 
1.3%
041-832-1571 2
 
1.3%
041-837-0151 2
 
1.3%
041-834-6556 1
 
0.7%
041-836-2030 1
 
0.7%
Other values (133) 133
87.5%
2024-01-10T08:15:14.578857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 304
16.7%
0 248
13.6%
1 222
12.2%
3 220
12.1%
4 218
12.0%
8 218
12.0%
2 97
 
5.3%
5 91
 
5.0%
7 90
 
4.9%
6 81
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1520
83.3%
Dash Punctuation 304
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 248
16.3%
1 222
14.6%
3 220
14.5%
4 218
14.3%
8 218
14.3%
2 97
 
6.4%
5 91
 
6.0%
7 90
 
5.9%
6 81
 
5.3%
9 35
 
2.3%
Dash Punctuation
ValueCountFrequency (%)
- 304
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1824
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 304
16.7%
0 248
13.6%
1 222
12.2%
3 220
12.1%
4 218
12.0%
8 218
12.0%
2 97
 
5.3%
5 91
 
5.0%
7 90
 
4.9%
6 81
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1824
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 304
16.7%
0 248
13.6%
1 222
12.2%
3 220
12.1%
4 218
12.0%
8 218
12.0%
2 97
 
5.3%
5 91
 
5.0%
7 90
 
4.9%
6 81
 
4.4%

팩스번호
Text

MISSING 

Distinct94
Distinct (%)94.0%
Missing79
Missing (%)44.1%
Memory size1.5 KiB
2024-01-10T08:15:14.822502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1200
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique88 ?
Unique (%)88.0%

Sample

1st row041-834-3865
2nd row041-836-0215
3rd row041-837-0781
4th row041-837-8889
5th row041-832-3232
ValueCountFrequency (%)
041-837-0153 2
 
2.0%
041-833-8503 2
 
2.0%
041-836-8553 2
 
2.0%
041-736-7177 2
 
2.0%
041-833-4415 2
 
2.0%
041-834-7707 2
 
2.0%
041-830-2698 1
 
1.0%
041-836-2568 1
 
1.0%
041-833-1546 1
 
1.0%
041-832-1359 1
 
1.0%
Other values (84) 84
84.0%
2024-01-10T08:15:15.173020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 200
16.7%
4 157
13.1%
3 153
12.8%
8 147
12.2%
1 142
11.8%
0 140
11.7%
7 68
 
5.7%
6 60
 
5.0%
5 52
 
4.3%
2 50
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1000
83.3%
Dash Punctuation 200
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 157
15.7%
3 153
15.3%
8 147
14.7%
1 142
14.2%
0 140
14.0%
7 68
6.8%
6 60
 
6.0%
5 52
 
5.2%
2 50
 
5.0%
9 31
 
3.1%
Dash Punctuation
ValueCountFrequency (%)
- 200
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1200
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 200
16.7%
4 157
13.1%
3 153
12.8%
8 147
12.2%
1 142
11.8%
0 140
11.7%
7 68
 
5.7%
6 60
 
5.0%
5 52
 
4.3%
2 50
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1200
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 200
16.7%
4 157
13.1%
3 153
12.8%
8 147
12.2%
1 142
11.8%
0 140
11.7%
7 68
 
5.7%
6 60
 
5.0%
5 52
 
4.3%
2 50
 
4.2%
Distinct167
Distinct (%)93.3%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2024-01-10T08:15:15.405035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length41
Mean length24.659218
Min length18

Characters and Unicode

Total characters4414
Distinct characters117
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique159 ?
Unique (%)88.8%

Sample

1st row충청남도 부여군 은산면 충의로 602-18
2nd row충청남도 부여군 규암면 내동로 11
3rd row충청남도 부여군 임천면 가림로 618 (총 6 필지)
4th row충청남도 부여군 석성면 석성남로 45
5th row충청남도 부여군 구룡면 흥수로 397-5
ValueCountFrequency (%)
충청남도 179
 
17.1%
부여군 179
 
17.1%
필지 42
 
4.0%
42
 
4.0%
은산면 34
 
3.2%
석성면 34
 
3.2%
초촌면 19
 
1.8%
장암면 18
 
1.7%
충의로 18
 
1.7%
규암면 15
 
1.4%
Other values (254) 469
44.7%
2024-01-10T08:15:15.747119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
872
19.8%
221
 
5.0%
193
 
4.4%
192
 
4.3%
189
 
4.3%
182
 
4.1%
181
 
4.1%
180
 
4.1%
169
 
3.8%
153
 
3.5%
Other values (107) 1882
42.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2635
59.7%
Space Separator 872
 
19.8%
Decimal Number 733
 
16.6%
Dash Punctuation 71
 
1.6%
Close Punctuation 48
 
1.1%
Open Punctuation 48
 
1.1%
Other Punctuation 7
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
221
 
8.4%
193
 
7.3%
192
 
7.3%
189
 
7.2%
182
 
6.9%
181
 
6.9%
180
 
6.8%
169
 
6.4%
153
 
5.8%
80
 
3.0%
Other values (92) 895
34.0%
Decimal Number
ValueCountFrequency (%)
2 132
18.0%
1 108
14.7%
4 83
11.3%
6 80
10.9%
3 79
10.8%
5 73
10.0%
0 60
8.2%
9 43
 
5.9%
7 38
 
5.2%
8 37
 
5.0%
Space Separator
ValueCountFrequency (%)
872
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 71
100.0%
Close Punctuation
ValueCountFrequency (%)
) 48
100.0%
Open Punctuation
ValueCountFrequency (%)
( 48
100.0%
Other Punctuation
ValueCountFrequency (%)
, 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2635
59.7%
Common 1779
40.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
221
 
8.4%
193
 
7.3%
192
 
7.3%
189
 
7.2%
182
 
6.9%
181
 
6.9%
180
 
6.8%
169
 
6.4%
153
 
5.8%
80
 
3.0%
Other values (92) 895
34.0%
Common
ValueCountFrequency (%)
872
49.0%
2 132
 
7.4%
1 108
 
6.1%
4 83
 
4.7%
6 80
 
4.5%
3 79
 
4.4%
5 73
 
4.1%
- 71
 
4.0%
0 60
 
3.4%
) 48
 
2.7%
Other values (5) 173
 
9.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2635
59.7%
ASCII 1779
40.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
872
49.0%
2 132
 
7.4%
1 108
 
6.1%
4 83
 
4.7%
6 80
 
4.5%
3 79
 
4.4%
5 73
 
4.1%
- 71
 
4.0%
0 60
 
3.4%
) 48
 
2.7%
Other values (5) 173
 
9.7%
Hangul
ValueCountFrequency (%)
221
 
8.4%
193
 
7.3%
192
 
7.3%
189
 
7.2%
182
 
6.9%
181
 
6.9%
180
 
6.8%
169
 
6.4%
153
 
5.8%
80
 
3.0%
Other values (92) 895
34.0%
Distinct165
Distinct (%)92.2%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2024-01-10T08:15:16.035384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length40
Mean length25.027933
Min length20

Characters and Unicode

Total characters4480
Distinct characters111
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique155 ?
Unique (%)86.6%

Sample

1st row충청남도 부여군 은산면 은산리 20-2번지
2nd row충청남도 부여군 규암면 외리 94-2번지
3rd row충청남도 부여군 임천면 군사리 516번지 외 5 필지
4th row충청남도 부여군 석성면 석성리 896-1번지
5th row충청남도 부여군 구룡면 구봉리 511-4번지
ValueCountFrequency (%)
충청남도 179
17.4%
부여군 179
17.4%
39
 
3.8%
필지 37
 
3.6%
석성면 34
 
3.3%
은산면 34
 
3.3%
은산리 25
 
2.4%
증산리 25
 
2.4%
초촌면 19
 
1.9%
장암면 18
 
1.8%
Other values (241) 438
42.6%
2024-01-10T08:15:16.450457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
854
19.1%
228
 
5.1%
189
 
4.2%
189
 
4.2%
183
 
4.1%
183
 
4.1%
182
 
4.1%
181
 
4.0%
181
 
4.0%
181
 
4.0%
Other values (101) 1929
43.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2824
63.0%
Space Separator 854
 
19.1%
Decimal Number 680
 
15.2%
Dash Punctuation 116
 
2.6%
Close Punctuation 3
 
0.1%
Open Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
228
 
8.1%
189
 
6.7%
189
 
6.7%
183
 
6.5%
183
 
6.5%
182
 
6.4%
181
 
6.4%
181
 
6.4%
181
 
6.4%
181
 
6.4%
Other values (87) 946
33.5%
Decimal Number
ValueCountFrequency (%)
1 128
18.8%
2 113
16.6%
3 78
11.5%
6 64
9.4%
0 62
9.1%
5 56
8.2%
4 55
8.1%
7 47
 
6.9%
9 40
 
5.9%
8 37
 
5.4%
Space Separator
ValueCountFrequency (%)
854
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 116
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2824
63.0%
Common 1656
37.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
228
 
8.1%
189
 
6.7%
189
 
6.7%
183
 
6.5%
183
 
6.5%
182
 
6.4%
181
 
6.4%
181
 
6.4%
181
 
6.4%
181
 
6.4%
Other values (87) 946
33.5%
Common
ValueCountFrequency (%)
854
51.6%
1 128
 
7.7%
- 116
 
7.0%
2 113
 
6.8%
3 78
 
4.7%
6 64
 
3.9%
0 62
 
3.7%
5 56
 
3.4%
4 55
 
3.3%
7 47
 
2.8%
Other values (4) 83
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2824
63.0%
ASCII 1656
37.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
854
51.6%
1 128
 
7.7%
- 116
 
7.0%
2 113
 
6.8%
3 78
 
4.7%
6 64
 
3.9%
0 62
 
3.7%
5 56
 
3.4%
4 55
 
3.3%
7 47
 
2.8%
Other values (4) 83
 
5.0%
Hangul
ValueCountFrequency (%)
228
 
8.1%
189
 
6.7%
189
 
6.7%
183
 
6.5%
183
 
6.5%
182
 
6.4%
181
 
6.4%
181
 
6.4%
181
 
6.4%
181
 
6.4%
Other values (87) 946
33.5%
Distinct114
Distinct (%)63.7%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2024-01-10T08:15:16.764300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length24
Mean length17.452514
Min length6

Characters and Unicode

Total characters3124
Distinct characters192
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)46.4%

Sample

1st row설치용 및 위생용 플라스틱제품 제조업
2nd row금속 문, 창, 셔터 및 관련제품 제조업 외 1 종
3rd row콘크리트 관 및 기타 구조용 콘크리트 제품 제조업 외 2 종
4th row부직포 및 펠트 제조업
5th row판지 상자 및 용기 제조업
ValueCountFrequency (%)
제조업 151
 
14.6%
99
 
9.6%
88
 
8.5%
85
 
8.2%
1 50
 
4.8%
기타 33
 
3.2%
콘크리트 18
 
1.7%
제품 17
 
1.6%
3 15
 
1.5%
곡물 15
 
1.5%
Other values (188) 461
44.7%
2024-01-10T08:15:17.215792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
853
27.3%
225
 
7.2%
184
 
5.9%
180
 
5.8%
101
 
3.2%
91
 
2.9%
85
 
2.7%
73
 
2.3%
61
 
2.0%
1 52
 
1.7%
Other values (182) 1219
39.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2145
68.7%
Space Separator 853
 
27.3%
Decimal Number 91
 
2.9%
Other Punctuation 31
 
1.0%
Open Punctuation 2
 
0.1%
Close Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
225
 
10.5%
184
 
8.6%
180
 
8.4%
101
 
4.7%
91
 
4.2%
85
 
4.0%
73
 
3.4%
61
 
2.8%
46
 
2.1%
38
 
1.8%
Other values (171) 1061
49.5%
Decimal Number
ValueCountFrequency (%)
1 52
57.1%
3 16
 
17.6%
2 13
 
14.3%
5 5
 
5.5%
4 4
 
4.4%
7 1
 
1.1%
Other Punctuation
ValueCountFrequency (%)
, 30
96.8%
. 1
 
3.2%
Space Separator
ValueCountFrequency (%)
853
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2145
68.7%
Common 979
31.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
225
 
10.5%
184
 
8.6%
180
 
8.4%
101
 
4.7%
91
 
4.2%
85
 
4.0%
73
 
3.4%
61
 
2.8%
46
 
2.1%
38
 
1.8%
Other values (171) 1061
49.5%
Common
ValueCountFrequency (%)
853
87.1%
1 52
 
5.3%
, 30
 
3.1%
3 16
 
1.6%
2 13
 
1.3%
5 5
 
0.5%
4 4
 
0.4%
( 2
 
0.2%
) 2
 
0.2%
7 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2143
68.6%
ASCII 979
31.3%
Compat Jamo 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
853
87.1%
1 52
 
5.3%
, 30
 
3.1%
3 16
 
1.6%
2 13
 
1.3%
5 5
 
0.5%
4 4
 
0.4%
( 2
 
0.2%
) 2
 
0.2%
7 1
 
0.1%
Hangul
ValueCountFrequency (%)
225
 
10.5%
184
 
8.6%
180
 
8.4%
101
 
4.7%
91
 
4.2%
85
 
4.0%
73
 
3.4%
61
 
2.8%
46
 
2.1%
38
 
1.8%
Other values (170) 1059
49.4%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
Distinct155
Distinct (%)86.6%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2024-01-10T08:15:17.500618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length26
Mean length8.6927374
Min length1

Characters and Unicode

Total characters1556
Distinct characters303
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique146 ?
Unique (%)81.6%

Sample

1st row정화조
2nd row하이샷시
3rd row폴리머콘크리트제품, 합성수지제측구수로관, 밸브실
4th row부직포
5th row포장용 판지상자
ValueCountFrequency (%)
백미 11
 
3.8%
레미콘 6
 
2.1%
6
 
2.1%
건축용석재 3
 
1.0%
아스콘 3
 
1.0%
3
 
1.0%
조미식품 2
 
0.7%
절임식품 2
 
0.7%
활성탄소 2
 
0.7%
가공품 2
 
0.7%
Other values (243) 250
86.2%
2024-01-10T08:15:17.900650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 112
 
7.2%
111
 
7.1%
34
 
2.2%
30
 
1.9%
29
 
1.9%
29
 
1.9%
28
 
1.8%
28
 
1.8%
24
 
1.5%
23
 
1.5%
Other values (293) 1108
71.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1250
80.3%
Other Punctuation 118
 
7.6%
Space Separator 111
 
7.1%
Uppercase Letter 57
 
3.7%
Open Punctuation 8
 
0.5%
Close Punctuation 8
 
0.5%
Lowercase Letter 2
 
0.1%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
 
2.7%
30
 
2.4%
29
 
2.3%
29
 
2.3%
28
 
2.2%
28
 
2.2%
24
 
1.9%
23
 
1.8%
18
 
1.4%
17
 
1.4%
Other values (266) 990
79.2%
Uppercase Letter
ValueCountFrequency (%)
P 11
19.3%
E 9
15.8%
C 6
10.5%
T 4
 
7.0%
R 3
 
5.3%
F 3
 
5.3%
V 3
 
5.3%
L 3
 
5.3%
S 3
 
5.3%
A 2
 
3.5%
Other values (9) 10
17.5%
Other Punctuation
ValueCountFrequency (%)
, 112
94.9%
. 4
 
3.4%
· 2
 
1.7%
Space Separator
ValueCountFrequency (%)
111
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Lowercase Letter
ValueCountFrequency (%)
p 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1250
80.3%
Common 247
 
15.9%
Latin 59
 
3.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
 
2.7%
30
 
2.4%
29
 
2.3%
29
 
2.3%
28
 
2.2%
28
 
2.2%
24
 
1.9%
23
 
1.8%
18
 
1.4%
17
 
1.4%
Other values (266) 990
79.2%
Latin
ValueCountFrequency (%)
P 11
18.6%
E 9
15.3%
C 6
10.2%
T 4
 
6.8%
R 3
 
5.1%
F 3
 
5.1%
V 3
 
5.1%
L 3
 
5.1%
S 3
 
5.1%
p 2
 
3.4%
Other values (10) 12
20.3%
Common
ValueCountFrequency (%)
, 112
45.3%
111
44.9%
( 8
 
3.2%
) 8
 
3.2%
. 4
 
1.6%
· 2
 
0.8%
- 2
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1250
80.3%
ASCII 304
 
19.5%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 112
36.8%
111
36.5%
P 11
 
3.6%
E 9
 
3.0%
( 8
 
2.6%
) 8
 
2.6%
C 6
 
2.0%
. 4
 
1.3%
T 4
 
1.3%
R 3
 
1.0%
Other values (16) 28
 
9.2%
Hangul
ValueCountFrequency (%)
34
 
2.7%
30
 
2.4%
29
 
2.3%
29
 
2.3%
28
 
2.2%
28
 
2.2%
24
 
1.9%
23
 
1.8%
18
 
1.4%
17
 
1.4%
Other values (266) 990
79.2%
None
ValueCountFrequency (%)
· 2
100.0%
Distinct169
Distinct (%)94.4%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2024-01-10T08:15:18.207276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length3
Mean length3.0893855
Min length2

Characters and Unicode

Total characters553
Distinct characters130
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique161 ?
Unique (%)89.9%

Sample

1st row이현주
2nd row정문순
3rd row민병윤
4th row김학자
5th row김옥기
ValueCountFrequency (%)
강신황 3
 
1.7%
차선희 3
 
1.7%
홍영관 2
 
1.1%
천성룡 2
 
1.1%
김종완 2
 
1.1%
이현정 2
 
1.1%
이주현 2
 
1.1%
윤신근 2
 
1.1%
오규환 1
 
0.6%
윤여정 1
 
0.6%
Other values (161) 161
89.0%
2024-01-10T08:15:18.632405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
31
 
5.6%
28
 
5.1%
21
 
3.8%
17
 
3.1%
15
 
2.7%
14
 
2.5%
14
 
2.5%
13
 
2.4%
13
 
2.4%
12
 
2.2%
Other values (120) 375
67.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 546
98.7%
Decimal Number 3
 
0.5%
Space Separator 2
 
0.4%
Other Punctuation 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
 
5.7%
28
 
5.1%
21
 
3.8%
17
 
3.1%
15
 
2.7%
14
 
2.6%
14
 
2.6%
13
 
2.4%
13
 
2.4%
12
 
2.2%
Other values (116) 368
67.4%
Decimal Number
ValueCountFrequency (%)
2 2
66.7%
1 1
33.3%
Space Separator
ValueCountFrequency (%)
2
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 546
98.7%
Common 7
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
31
 
5.7%
28
 
5.1%
21
 
3.8%
17
 
3.1%
15
 
2.7%
14
 
2.6%
14
 
2.6%
13
 
2.4%
13
 
2.4%
12
 
2.2%
Other values (116) 368
67.4%
Common
ValueCountFrequency (%)
2
28.6%
2 2
28.6%
, 2
28.6%
1 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 546
98.7%
ASCII 7
 
1.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
31
 
5.7%
28
 
5.1%
21
 
3.8%
17
 
3.1%
15
 
2.7%
14
 
2.6%
14
 
2.6%
13
 
2.4%
13
 
2.4%
12
 
2.2%
Other values (116) 368
67.4%
ASCII
ValueCountFrequency (%)
2
28.6%
2 2
28.6%
, 2
28.6%
1 1
14.3%

운영구분
Categorical

IMBALANCE 

Distinct5
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
<NA>
129 
휴업
36 
폐업
 
12
폐업예정
 
1
휴업예정
 
1

Length

Max length4
Median length4
Mean length3.4636872
Min length2

Unique

Unique2 ?
Unique (%)1.1%

Sample

1st row휴업
2nd row<NA>
3rd row<NA>
4th row휴업
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 129
72.1%
휴업 36
 
20.1%
폐업 12
 
6.7%
폐업예정 1
 
0.6%
휴업예정 1
 
0.6%

Length

2024-01-10T08:15:18.774866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T08:15:18.885131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 129
72.1%
휴업 36
 
20.1%
폐업 12
 
6.7%
폐업예정 1
 
0.6%
휴업예정 1
 
0.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2021-09-24
179 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-09-24
2nd row2021-09-24
3rd row2021-09-24
4th row2021-09-24
5th row2021-09-24

Common Values

ValueCountFrequency (%)
2021-09-24 179
100.0%

Length

2024-01-10T08:15:18.995327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T08:15:19.086013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-09-24 179
100.0%

Correlations

2024-01-10T08:15:19.142498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
팩스번호운영구분
팩스번호1.0001.000
운영구분1.0001.000

Missing values

2024-01-10T08:15:13.004431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T08:15:13.136459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T08:15:13.233247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

회사명전화번호팩스번호도로명주소지번주소업종명생산품대표자명운영구분데이터기준일자
0(유)미주산업041-843-5112<NA>충청남도 부여군 은산면 충의로 602-18충청남도 부여군 은산면 은산리 20-2번지설치용 및 위생용 플라스틱제품 제조업정화조이현주휴업2021-09-24
1(주) 건영에스피지041-835-0030<NA>충청남도 부여군 규암면 내동로 11충청남도 부여군 규암면 외리 94-2번지금속 문, 창, 셔터 및 관련제품 제조업 외 1 종하이샷시정문순<NA>2021-09-24
2(주)계림폴리콘041-833-3865041-834-3865충청남도 부여군 임천면 가림로 618 (총 6 필지)충청남도 부여군 임천면 군사리 516번지 외 5 필지콘크리트 관 및 기타 구조용 콘크리트 제품 제조업 외 2 종폴리머콘크리트제품, 합성수지제측구수로관, 밸브실민병윤<NA>2021-09-24
3(주)골든구스041-836-0210041-836-0215충청남도 부여군 석성면 석성남로 45충청남도 부여군 석성면 석성리 896-1번지부직포 및 펠트 제조업부직포김학자휴업2021-09-24
4(주)광일산업<NA><NA>충청남도 부여군 구룡면 흥수로 397-5충청남도 부여군 구룡면 구봉리 511-4번지판지 상자 및 용기 제조업포장용 판지상자김옥기<NA>2021-09-24
5(주)그린켐텍041-837-0780041-837-0781충청남도 부여군 석성면 석성로 224-7 (총 2 필지)충청남도 부여군 석성면 봉정리 780번지 외 1 필지천막, 텐트 및 유사 제품 제조업 외 1 종PE타포린강대영<NA>2021-09-24
6(주)금강041-837-8880041-837-8889충청남도 부여군 은산면 가중리 558번지충청남도 부여군 은산면 가중리 558번지육상 금속 골조 구조재 제조업 외 13 종제진기, 수문, 밸브, 펌프이종현<NA>2021-09-24
7(주)금강필텍041-832-3222041-832-3232충청남도 부여군 은산면 충의로622번길 23충청남도 부여군 은산면 은산리 13번지기체 여과기 제조업 외 1 종필터(차량용)이석현휴업2021-09-24
8(주)네비엔 부여사업소041-837-0126<NA>충청남도 부여군 은산면 충의로622번길 38충청남도 부여군 은산면 은산리 17번지그 외 기타 분류 안된 화학제품 제조업 외 3 종활성탄소 외 관련제품국만호<NA>2021-09-24
9(주)네오캠041-837-3711<NA>충청남도 부여군 석성면 증산천길 140 (총 3 필지)충청남도 부여군 석성면 증산리 369번지 외 2 필지천막, 텐트 및 유사 제품 제조업 외 3 종방수천막오광수<NA>2021-09-24
회사명전화번호팩스번호도로명주소지번주소업종명생산품대표자명운영구분데이터기준일자
169한국유기농업개발(주)041-834-8991041-834-0104충청남도 부여군 임천면 부흥로171번길 15충청남도 부여군 임천면 칠산리 291번지복합비료 및 기타 화학비료 제조업 외 3 종미생물, 단미사료신통열<NA>2021-09-24
170한국장애인편의증진협의회(주)<NA><NA>충청남도 부여군 은산면 충의로 600충청남도 부여군 은산면 은산리 25-13번지콘크리트 타일, 기와, 벽돌 및 블록 제조업 외 1 종콘크리트타일, 보도블록, 벽돌, 볼라드송미연<NA>2021-09-24
171한국조폐공사 부여조폐창041-830-5203<NA>충청남도 부여군 부여읍 염창로180번길 67 (총 5 필지)충청남도 부여군 부여읍 염창리 17번지 외 4 필지인쇄용 및 필기용 원지 제조업 외 3 종은행권용지,특수용지,전자카드조용만<NA>2021-09-24
172한신이엔에스041-832-2888<NA>충청남도 부여군 장암면 위덕로445번길 40충청남도 부여군 장암면 상황리 328-5번지플라스틱 선, 봉, 관 및 호스 제조업상하수도 파이프정은희<NA>2021-09-24
173한타산업041-836-2914<NA>충청남도 부여군 석성면 왕릉로 657충청남도 부여군 석성면 증산리 1308-6번지천막, 텐트 및 유사 제품 제조업 외 1 종PE방수천막김삼영<NA>2021-09-24
174현일섬유041-835-6027<NA>충청남도 부여군 규암면 충절로 2266충청남도 부여군 규암면 외리 264-5번지속옷 및 잠옷 제조업메리야스서유자휴업2021-09-24
175홍산농업협동조합041-835-1280041-836-1282충청남도 부여군 홍산면 대백제로 822충청남도 부여군 홍산면 좌홍리 178-10번지곡물 도정업백미김영채<NA>2021-09-24
176홍산정미소041-835-1077041-836-1077충청남도 부여군 홍산면 남촌리 122번지 외 2필지충청남도 부여군 홍산면 남촌리 122번지 외 2필지곡물 도정업백미이재희<NA>2021-09-24
177후레시팜041-832-9479041-834-8443충청남도 부여군 은산면 충의로 602-21충청남도 부여군 은산면 은산리 25-28번지면류, 마카로니 및 유사식품 제조업 외 3 종면류,육수류,조미료류이강영폐업2021-09-24
178휴먼스화공(주)041-832-7701041-832-7726충청남도 부여군 초촌면 신암로 244충청남도 부여군 초촌면 신암리 147번지화약 및 불꽃제품 제조업 외 1 종폭죽송민규<NA>2021-09-24