Overview

Dataset statistics

Number of variables5
Number of observations236
Missing cells38
Missing cells (%)3.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.3 KiB
Average record size in memory40.6 B

Variable types

Text4
Categorical1

Dataset

Description기업명, 주소, 전화번호, 생산품 등의 기준으로 구분되어 기재되어있는 충청북도 제천시 기업체 현황에 대한 정보를 제공합니다.
Author충청북도 제천시
URLhttps://www.data.go.kr/data/15064632/fileData.do

Alerts

데이터기준일 has constant value ""Constant
전화번호 has 38 (16.1%) missing valuesMissing

Reproduction

Analysis started2023-12-12 11:57:31.157930
Analysis finished2023-12-12 11:57:31.870560
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct232
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-12T20:57:32.138443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length13
Mean length7.7754237
Min length2

Characters and Unicode

Total characters1835
Distinct characters274
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique229 ?
Unique (%)97.0%

Sample

1st row(유)진우건설
2nd row(재)충북테크노파크
3rd row(주)게스코리아
4th row(주)경방
5th row(주)경우크린텍
ValueCountFrequency (%)
주식회사 32
 
11.3%
제천공장 3
 
1.1%
대림비앤코(주 3
 
1.1%
제2공장 3
 
1.1%
주)원일바이오 3
 
1.1%
주)위매스 3
 
1.1%
주)혜성 2
 
0.7%
주)풀잎라인 2
 
0.7%
주)휴온스 2
 
0.7%
성만기계공업(주 1
 
0.4%
Other values (228) 228
80.9%
2023-12-12T20:57:32.555285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
194
 
10.6%
) 166
 
9.0%
( 166
 
9.0%
56
 
3.1%
55
 
3.0%
46
 
2.5%
38
 
2.1%
37
 
2.0%
37
 
2.0%
32
 
1.7%
Other values (264) 1008
54.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1420
77.4%
Close Punctuation 166
 
9.0%
Open Punctuation 166
 
9.0%
Space Separator 46
 
2.5%
Uppercase Letter 16
 
0.9%
Decimal Number 12
 
0.7%
Lowercase Letter 4
 
0.2%
Other Symbol 3
 
0.2%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
194
 
13.7%
56
 
3.9%
55
 
3.9%
38
 
2.7%
37
 
2.6%
37
 
2.6%
32
 
2.3%
28
 
2.0%
27
 
1.9%
23
 
1.6%
Other values (238) 893
62.9%
Uppercase Letter
ValueCountFrequency (%)
H 2
12.5%
F 2
12.5%
N 2
12.5%
M 1
 
6.2%
T 1
 
6.2%
S 1
 
6.2%
U 1
 
6.2%
D 1
 
6.2%
A 1
 
6.2%
K 1
 
6.2%
Other values (3) 3
18.8%
Decimal Number
ValueCountFrequency (%)
2 7
58.3%
1 2
 
16.7%
3 2
 
16.7%
4 1
 
8.3%
Lowercase Letter
ValueCountFrequency (%)
o 1
25.0%
r 1
25.0%
e 1
25.0%
a 1
25.0%
Close Punctuation
ValueCountFrequency (%)
) 166
100.0%
Open Punctuation
ValueCountFrequency (%)
( 166
100.0%
Space Separator
ValueCountFrequency (%)
46
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Other Punctuation
ValueCountFrequency (%)
& 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1423
77.5%
Common 392
 
21.4%
Latin 20
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
194
 
13.6%
56
 
3.9%
55
 
3.9%
38
 
2.7%
37
 
2.6%
37
 
2.6%
32
 
2.2%
28
 
2.0%
27
 
1.9%
23
 
1.6%
Other values (239) 896
63.0%
Latin
ValueCountFrequency (%)
H 2
 
10.0%
F 2
 
10.0%
N 2
 
10.0%
M 1
 
5.0%
T 1
 
5.0%
o 1
 
5.0%
S 1
 
5.0%
U 1
 
5.0%
D 1
 
5.0%
A 1
 
5.0%
Other values (7) 7
35.0%
Common
ValueCountFrequency (%)
) 166
42.3%
( 166
42.3%
46
 
11.7%
2 7
 
1.8%
1 2
 
0.5%
3 2
 
0.5%
& 2
 
0.5%
4 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1420
77.4%
ASCII 412
 
22.5%
None 3
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
194
 
13.7%
56
 
3.9%
55
 
3.9%
38
 
2.7%
37
 
2.6%
37
 
2.6%
32
 
2.3%
28
 
2.0%
27
 
1.9%
23
 
1.6%
Other values (238) 893
62.9%
ASCII
ValueCountFrequency (%)
) 166
40.3%
( 166
40.3%
46
 
11.2%
2 7
 
1.7%
H 2
 
0.5%
1 2
 
0.5%
3 2
 
0.5%
F 2
 
0.5%
& 2
 
0.5%
N 2
 
0.5%
Other values (15) 15
 
3.6%
None
ValueCountFrequency (%)
3
100.0%

주소
Text

Distinct189
Distinct (%)80.1%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-12T20:57:32.880827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length50
Mean length26.631356
Min length17

Characters and Unicode

Total characters6285
Distinct characters124
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique169 ?
Unique (%)71.6%

Sample

1st row충청북도 제천시 장평로 124 (왕암동)
2nd row충청북도 제천시 바이오밸리2로 41 (왕암동)
3rd row충청북도 제천시 제2바이오밸리로3길 26, 4동 (왕암동)
4th row충청북도 제천시 바이오밸리2로 41, 102호, 103호 (왕암동)
5th row충청북도 제천시 강저로2길 50, 강저농공단지 (강제동)
ValueCountFrequency (%)
제천시 237
19.0%
충청북도 236
18.9%
왕암동 118
 
9.5%
금성면 35
 
2.8%
바이오밸리1로 30
 
2.4%
바이오밸리2로 23
 
1.8%
한방엑스포로 23
 
1.8%
내토로73길 22
 
1.8%
고암동 21
 
1.7%
강제동 21
 
1.7%
Other values (195) 482
38.6%
2023-12-12T20:57:33.360795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1012
 
16.1%
303
 
4.8%
255
 
4.1%
241
 
3.8%
240
 
3.8%
240
 
3.8%
240
 
3.8%
238
 
3.8%
236
 
3.8%
197
 
3.1%
Other values (114) 3083
49.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3911
62.2%
Space Separator 1012
 
16.1%
Decimal Number 867
 
13.8%
Close Punctuation 195
 
3.1%
Open Punctuation 195
 
3.1%
Other Punctuation 55
 
0.9%
Dash Punctuation 33
 
0.5%
Uppercase Letter 17
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
303
 
7.7%
255
 
6.5%
241
 
6.2%
240
 
6.1%
240
 
6.1%
240
 
6.1%
238
 
6.1%
236
 
6.0%
197
 
5.0%
168
 
4.3%
Other values (94) 1553
39.7%
Decimal Number
ValueCountFrequency (%)
1 172
19.8%
2 171
19.7%
4 96
11.1%
3 89
10.3%
6 76
8.8%
0 69
8.0%
5 66
 
7.6%
7 58
 
6.7%
8 45
 
5.2%
9 25
 
2.9%
Uppercase Letter
ValueCountFrequency (%)
G 13
76.5%
C 1
 
5.9%
B 1
 
5.9%
V 1
 
5.9%
A 1
 
5.9%
Space Separator
ValueCountFrequency (%)
1012
100.0%
Close Punctuation
ValueCountFrequency (%)
) 195
100.0%
Open Punctuation
ValueCountFrequency (%)
( 195
100.0%
Other Punctuation
ValueCountFrequency (%)
, 55
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 33
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3911
62.2%
Common 2357
37.5%
Latin 17
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
303
 
7.7%
255
 
6.5%
241
 
6.2%
240
 
6.1%
240
 
6.1%
240
 
6.1%
238
 
6.1%
236
 
6.0%
197
 
5.0%
168
 
4.3%
Other values (94) 1553
39.7%
Common
ValueCountFrequency (%)
1012
42.9%
) 195
 
8.3%
( 195
 
8.3%
1 172
 
7.3%
2 171
 
7.3%
4 96
 
4.1%
3 89
 
3.8%
6 76
 
3.2%
0 69
 
2.9%
5 66
 
2.8%
Other values (5) 216
 
9.2%
Latin
ValueCountFrequency (%)
G 13
76.5%
C 1
 
5.9%
B 1
 
5.9%
V 1
 
5.9%
A 1
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3911
62.2%
ASCII 2374
37.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1012
42.6%
) 195
 
8.2%
( 195
 
8.2%
1 172
 
7.2%
2 171
 
7.2%
4 96
 
4.0%
3 89
 
3.7%
6 76
 
3.2%
0 69
 
2.9%
5 66
 
2.8%
Other values (10) 233
 
9.8%
Hangul
ValueCountFrequency (%)
303
 
7.7%
255
 
6.5%
241
 
6.2%
240
 
6.1%
240
 
6.1%
240
 
6.1%
238
 
6.1%
236
 
6.0%
197
 
5.0%
168
 
4.3%
Other values (94) 1553
39.7%

전화번호
Text

MISSING 

Distinct182
Distinct (%)91.9%
Missing38
Missing (%)16.1%
Memory size2.0 KiB
2023-12-12T20:57:33.624766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.055556
Min length11

Characters and Unicode

Total characters2387
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique172 ?
Unique (%)86.9%

Sample

1st row043-647-8851
2nd row043-270-2600
3rd row070-7841-7766
4th row043-648-3777
5th row043-648-1701
ValueCountFrequency (%)
043-652-8711 4
 
2.0%
043-653-9700 3
 
1.5%
043-648-2620 3
 
1.5%
043-653-9181 3
 
1.5%
043-648-8878 3
 
1.5%
043-643-7500 2
 
1.0%
043-642-2151 2
 
1.0%
043-647-8851 2
 
1.0%
043-652-0057 2
 
1.0%
043-645-4100 2
 
1.0%
Other values (172) 172
86.9%
2023-12-12T20:57:34.059024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 396
16.6%
0 347
14.5%
4 346
14.5%
3 281
11.8%
6 242
10.1%
5 163
6.8%
1 148
 
6.2%
7 141
 
5.9%
2 130
 
5.4%
8 124
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1991
83.4%
Dash Punctuation 396
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 347
17.4%
4 346
17.4%
3 281
14.1%
6 242
12.2%
5 163
8.2%
1 148
7.4%
7 141
7.1%
2 130
 
6.5%
8 124
 
6.2%
9 69
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 396
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2387
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 396
16.6%
0 347
14.5%
4 346
14.5%
3 281
11.8%
6 242
10.1%
5 163
6.8%
1 148
 
6.2%
7 141
 
5.9%
2 130
 
5.4%
8 124
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2387
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 396
16.6%
0 347
14.5%
4 346
14.5%
3 281
11.8%
6 242
10.1%
5 163
6.8%
1 148
 
6.2%
7 141
 
5.9%
2 130
 
5.4%
8 124
 
5.2%
Distinct209
Distinct (%)88.6%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-12T20:57:34.355662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length90
Median length23
Mean length9.7923729
Min length1

Characters and Unicode

Total characters2311
Distinct characters365
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique197 ?
Unique (%)83.5%

Sample

1st row금속제창, 플라스틱 창호
2nd row인진쑥민들레진액
3rd row차량용 연료첨가제
4th row전병과자
5th rowFRP합병정화조
ValueCountFrequency (%)
31
 
7.0%
자동차용 19
 
4.3%
볼베어링 16
 
3.6%
화장품 6
 
1.4%
창호 5
 
1.1%
플라스틱 5
 
1.1%
건강기능식품 4
 
0.9%
부품 4
 
0.9%
자동차베어링 3
 
0.7%
마스크팩 3
 
0.7%
Other values (312) 348
78.4%
2023-12-12T20:57:34.874986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
295
 
12.8%
, 110
 
4.8%
57
 
2.5%
48
 
2.1%
42
 
1.8%
41
 
1.8%
40
 
1.7%
35
 
1.5%
34
 
1.5%
32
 
1.4%
Other values (355) 1577
68.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1747
75.6%
Space Separator 295
 
12.8%
Other Punctuation 112
 
4.8%
Uppercase Letter 75
 
3.2%
Lowercase Letter 36
 
1.6%
Close Punctuation 17
 
0.7%
Open Punctuation 17
 
0.7%
Decimal Number 12
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
57
 
3.3%
48
 
2.7%
42
 
2.4%
41
 
2.3%
40
 
2.3%
35
 
2.0%
34
 
1.9%
32
 
1.8%
32
 
1.8%
30
 
1.7%
Other values (307) 1356
77.6%
Uppercase Letter
ValueCountFrequency (%)
E 9
12.0%
L 8
10.7%
P 8
10.7%
R 7
9.3%
C 7
9.3%
A 5
 
6.7%
D 5
 
6.7%
O 4
 
5.3%
T 4
 
5.3%
I 3
 
4.0%
Other values (11) 15
20.0%
Lowercase Letter
ValueCountFrequency (%)
e 8
22.2%
d 4
11.1%
r 4
11.1%
l 4
11.1%
o 3
 
8.3%
s 2
 
5.6%
t 2
 
5.6%
p 2
 
5.6%
u 1
 
2.8%
i 1
 
2.8%
Other values (5) 5
13.9%
Decimal Number
ValueCountFrequency (%)
0 4
33.3%
1 3
25.0%
3 1
 
8.3%
4 1
 
8.3%
8 1
 
8.3%
6 1
 
8.3%
2 1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
, 110
98.2%
& 2
 
1.8%
Space Separator
ValueCountFrequency (%)
295
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1747
75.6%
Common 453
 
19.6%
Latin 111
 
4.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
57
 
3.3%
48
 
2.7%
42
 
2.4%
41
 
2.3%
40
 
2.3%
35
 
2.0%
34
 
1.9%
32
 
1.8%
32
 
1.8%
30
 
1.7%
Other values (307) 1356
77.6%
Latin
ValueCountFrequency (%)
E 9
 
8.1%
e 8
 
7.2%
L 8
 
7.2%
P 8
 
7.2%
R 7
 
6.3%
C 7
 
6.3%
A 5
 
4.5%
D 5
 
4.5%
d 4
 
3.6%
O 4
 
3.6%
Other values (26) 46
41.4%
Common
ValueCountFrequency (%)
295
65.1%
, 110
 
24.3%
) 17
 
3.8%
( 17
 
3.8%
0 4
 
0.9%
1 3
 
0.7%
& 2
 
0.4%
3 1
 
0.2%
4 1
 
0.2%
8 1
 
0.2%
Other values (2) 2
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1747
75.6%
ASCII 564
 
24.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
295
52.3%
, 110
 
19.5%
) 17
 
3.0%
( 17
 
3.0%
E 9
 
1.6%
e 8
 
1.4%
L 8
 
1.4%
P 8
 
1.4%
R 7
 
1.2%
C 7
 
1.2%
Other values (38) 78
 
13.8%
Hangul
ValueCountFrequency (%)
57
 
3.3%
48
 
2.7%
42
 
2.4%
41
 
2.3%
40
 
2.3%
35
 
2.0%
34
 
1.9%
32
 
1.8%
32
 
1.8%
30
 
1.7%
Other values (307) 1356
77.6%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-09-01
236 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-09-01
2nd row2023-09-01
3rd row2023-09-01
4th row2023-09-01
5th row2023-09-01

Common Values

ValueCountFrequency (%)
2023-09-01 236
100.0%

Length

2023-12-12T20:57:35.037215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:57:35.515373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-09-01 236
100.0%

Missing values

2023-12-12T20:57:31.681610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:57:31.815066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기업명주소전화번호생산품데이터기준일
0(유)진우건설충청북도 제천시 장평로 124 (왕암동)043-647-8851금속제창, 플라스틱 창호2023-09-01
1(재)충북테크노파크충청북도 제천시 바이오밸리2로 41 (왕암동)043-270-2600인진쑥민들레진액2023-09-01
2(주)게스코리아충청북도 제천시 제2바이오밸리로3길 26, 4동 (왕암동)070-7841-7766차량용 연료첨가제2023-09-01
3(주)경방충청북도 제천시 바이오밸리2로 41, 102호, 103호 (왕암동)043-648-3777전병과자2023-09-01
4(주)경우크린텍충청북도 제천시 강저로2길 50, 강저농공단지 (강제동)043-648-1701FRP합병정화조2023-09-01
5(주)광무 제천지점충청북도 제천시 바이오밸리로 107 (왕암동)043-653-0911MCPC, OPCI22023-09-01
6(주)굿21하우징충청북도 제천시 강저로2길 16 (강제동)043-643-1900목재마루재,통나무,목블록2023-09-01
7(주)그린스케이프충청북도 제천시 금성면 양월로 46-38<NA>재활용선별압착기2023-09-01
8(주)금화충청북도 제천시 한방엑스포로 165 (왕암동)<NA>자동차용 볼베어링 등2023-09-01
9(주)넥스켐충청북도 제천시 한방엑스포로5길 45 (왕암동)043-647-1213미끄럼방지포장재2023-09-01
기업명주소전화번호생산품데이터기준일
226태양신소재(주)충청북도 제천시 금성면 청풍호로24길 8<NA>규사(실리카)2023-09-01
227테라코코리아(주)충청북도 제천시 송학면 송학로10길 21 (송학면) 외 1필지043-645-8814핸디코트,테라코트,투수골재포장재2023-09-01
228한국메탈실리콘(주)충청북도 제천시 제3산단로 277(왕암동)043-645-4164실리콘 파우더2023-09-01
229한국유기농보르도(주)충청북도 제천시 금성면 청풍호로24길 8043-647-0620비료생산(골드보르도,무색보르도)2023-09-01
230한국코러스 주식회사충청북도 제천시 강저로 30 (강제동)043-644-8457디디비캅셀외2023-09-01
231한서정밀기계충청북도 제천시 바이오밸리1로 60 (왕암동)043-648-7786철구조물2023-09-01
232한연산업(주)충청북도 제천시 바이오밸리3로 24 (왕암동)043-651-6933산업용필터,여과기2023-09-01
233홍원정공(주)충청북도 제천시 제2바이오밸리로 63 (왕암동)053-356-8955단조금형2023-09-01
234SUNDA Korea㈜충청북도 제천시 양월로 46-66<NA>태양열집열기2023-09-01
235정도충청북도 제천시 송학면 송학로10길 41<NA>접착제2023-09-01