Overview

Dataset statistics

Number of variables4
Number of observations108
Missing cells1
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.5 KiB
Average record size in memory33.2 B

Variable types

Text4

Dataset

Description충청남도 및 시군에 등록된 통합환경관리 대상 사업장에 대한 일반 현황을 제공합니다. 상호, 소재지, 전화번호, 업종 등을 개방하고 있습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=373&beforeMenuCd=DOM_000000201001001000&publicdatapk=15075648

Alerts

상호 has unique valuesUnique

Reproduction

Analysis started2024-01-09 19:45:52.524537
Analysis finished2024-01-09 19:45:53.146755
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

UNIQUE 

Distinct108
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size996.0 B
2024-01-10T04:45:53.270410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length15
Mean length8.3703704
Min length3

Characters and Unicode

Total characters904
Distinct characters199
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)100.0%

Sample

1st row㈜세람그린에너지
2nd row농업회사법인(주)신우에프에스
3rd row대주중공업㈜ 공주지점
4th row엘에스메탈㈜
5th row㈜휴스틸
ValueCountFrequency (%)
대산공장 2
 
1.5%
아산공장 2
 
1.5%
㈜세람그린에너지 1
 
0.8%
당진지점 1
 
0.8%
하나마이크론㈜ 1
 
0.8%
서산지점 1
 
0.8%
㈜크레아 1
 
0.8%
현대트랜시스㈜지곡 1
 
0.8%
한국지엠㈜보령공장 1
 
0.8%
성연 1
 
0.8%
Other values (118) 118
90.8%
2024-01-10T04:45:53.551687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
89
 
9.8%
32
 
3.5%
30
 
3.3%
24
 
2.7%
22
 
2.4%
22
 
2.4%
20
 
2.2%
18
 
2.0%
17
 
1.9%
) 16
 
1.8%
Other values (189) 614
67.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 721
79.8%
Other Symbol 89
 
9.8%
Space Separator 32
 
3.5%
Close Punctuation 16
 
1.8%
Open Punctuation 16
 
1.8%
Uppercase Letter 14
 
1.5%
Decimal Number 12
 
1.3%
Dash Punctuation 4
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
4.2%
24
 
3.3%
22
 
3.1%
22
 
3.1%
20
 
2.8%
18
 
2.5%
17
 
2.4%
16
 
2.2%
16
 
2.2%
14
 
1.9%
Other values (165) 522
72.4%
Uppercase Letter
ValueCountFrequency (%)
L 2
14.3%
A 2
14.3%
M 2
14.3%
T 1
7.1%
E 1
7.1%
C 1
7.1%
P 1
7.1%
I 1
7.1%
S 1
7.1%
O 1
7.1%
Decimal Number
ValueCountFrequency (%)
1 4
33.3%
8 2
16.7%
2 1
 
8.3%
0 1
 
8.3%
3 1
 
8.3%
7 1
 
8.3%
6 1
 
8.3%
4 1
 
8.3%
Other Symbol
ValueCountFrequency (%)
89
100.0%
Space Separator
ValueCountFrequency (%)
32
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 810
89.6%
Common 80
 
8.8%
Latin 14
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
89
 
11.0%
30
 
3.7%
24
 
3.0%
22
 
2.7%
22
 
2.7%
20
 
2.5%
18
 
2.2%
17
 
2.1%
16
 
2.0%
16
 
2.0%
Other values (166) 536
66.2%
Common
ValueCountFrequency (%)
32
40.0%
) 16
20.0%
( 16
20.0%
1 4
 
5.0%
- 4
 
5.0%
8 2
 
2.5%
2 1
 
1.2%
0 1
 
1.2%
3 1
 
1.2%
7 1
 
1.2%
Other values (2) 2
 
2.5%
Latin
ValueCountFrequency (%)
L 2
14.3%
A 2
14.3%
M 2
14.3%
T 1
7.1%
E 1
7.1%
C 1
7.1%
P 1
7.1%
I 1
7.1%
S 1
7.1%
O 1
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 721
79.8%
ASCII 94
 
10.4%
None 89
 
9.8%

Most frequent character per block

None
ValueCountFrequency (%)
89
100.0%
ASCII
ValueCountFrequency (%)
32
34.0%
) 16
17.0%
( 16
17.0%
1 4
 
4.3%
- 4
 
4.3%
8 2
 
2.1%
L 2
 
2.1%
A 2
 
2.1%
M 2
 
2.1%
T 1
 
1.1%
Other values (13) 13
13.8%
Hangul
ValueCountFrequency (%)
30
 
4.2%
24
 
3.3%
22
 
3.1%
22
 
3.1%
20
 
2.8%
18
 
2.5%
17
 
2.4%
16
 
2.2%
16
 
2.2%
14
 
1.9%
Other values (165) 522
72.4%
Distinct102
Distinct (%)94.4%
Missing0
Missing (%)0.0%
Memory size996.0 B
2024-01-10T04:45:53.812507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length25
Mean length20.203704
Min length14

Characters and Unicode

Total characters2182
Distinct characters144
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique97 ?
Unique (%)89.8%

Sample

1st row충남 금산군 부리면 선원리 663-4.3
2nd row충남 서산시 서산시 고북면 고북1로 343-22
3rd row충남 공주시 공단길 14-29
4th row충남 서천군 장항읍 화송길 123
5th row충남 당진시 송악읍 부곡공단로 131
ValueCountFrequency (%)
충남 108
 
19.5%
서산시 24
 
4.3%
천안시 22
 
4.0%
당진시 15
 
2.7%
아산시 15
 
2.7%
대산읍 14
 
2.5%
동남구 12
 
2.2%
송악읍 9
 
1.6%
서북구 9
 
1.6%
보령시 5
 
0.9%
Other values (238) 320
57.9%
2024-01-10T04:45:54.151742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
445
20.4%
123
 
5.6%
117
 
5.4%
89
 
4.1%
88
 
4.0%
1 78
 
3.6%
76
 
3.5%
2 57
 
2.6%
56
 
2.6%
3 49
 
2.2%
Other values (134) 1004
46.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1293
59.3%
Space Separator 445
 
20.4%
Decimal Number 409
 
18.7%
Dash Punctuation 34
 
1.6%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
123
 
9.5%
117
 
9.0%
89
 
6.9%
88
 
6.8%
76
 
5.9%
56
 
4.3%
40
 
3.1%
39
 
3.0%
36
 
2.8%
32
 
2.5%
Other values (121) 597
46.2%
Decimal Number
ValueCountFrequency (%)
1 78
19.1%
2 57
13.9%
3 49
12.0%
4 47
11.5%
5 43
10.5%
6 37
9.0%
8 35
8.6%
7 27
 
6.6%
0 21
 
5.1%
9 15
 
3.7%
Space Separator
ValueCountFrequency (%)
445
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 34
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1293
59.3%
Common 889
40.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
123
 
9.5%
117
 
9.0%
89
 
6.9%
88
 
6.8%
76
 
5.9%
56
 
4.3%
40
 
3.1%
39
 
3.0%
36
 
2.8%
32
 
2.5%
Other values (121) 597
46.2%
Common
ValueCountFrequency (%)
445
50.1%
1 78
 
8.8%
2 57
 
6.4%
3 49
 
5.5%
4 47
 
5.3%
5 43
 
4.8%
6 37
 
4.2%
8 35
 
3.9%
- 34
 
3.8%
7 27
 
3.0%
Other values (3) 37
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1293
59.3%
ASCII 889
40.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
445
50.1%
1 78
 
8.8%
2 57
 
6.4%
3 49
 
5.5%
4 47
 
5.3%
5 43
 
4.8%
6 37
 
4.2%
8 35
 
3.9%
- 34
 
3.8%
7 27
 
3.0%
Other values (3) 37
 
4.2%
Hangul
ValueCountFrequency (%)
123
 
9.5%
117
 
9.0%
89
 
6.9%
88
 
6.8%
76
 
5.9%
56
 
4.3%
40
 
3.1%
39
 
3.0%
36
 
2.8%
32
 
2.5%
Other values (121) 597
46.2%
Distinct106
Distinct (%)99.1%
Missing1
Missing (%)0.9%
Memory size996.0 B
2024-01-10T04:45:54.346024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.981308
Min length11

Characters and Unicode

Total characters1282
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique105 ?
Unique (%)98.1%

Sample

1st row041-663-7250
2nd row041-854-0101
3rd row041-955-3232
4th row041-350-8114
5th row041-666-3565
ValueCountFrequency (%)
041-554-7811 2
 
1.9%
041-623-1081 1
 
0.9%
041-680-1268 1
 
0.9%
041-357-4346 1
 
0.9%
041-350-2500 1
 
0.9%
041-539-6522 1
 
0.9%
041-660-4700 1
 
0.9%
041-661-9217 1
 
0.9%
041-939-9058 1
 
0.9%
041-661-7622 1
 
0.9%
Other values (96) 96
89.7%
2024-01-10T04:45:54.624431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 214
16.7%
0 194
15.1%
1 192
15.0%
4 158
12.3%
5 110
8.6%
6 98
7.6%
3 96
7.5%
9 61
 
4.8%
2 61
 
4.8%
8 55
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1068
83.3%
Dash Punctuation 214
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 194
18.2%
1 192
18.0%
4 158
14.8%
5 110
10.3%
6 98
9.2%
3 96
9.0%
9 61
 
5.7%
2 61
 
5.7%
8 55
 
5.1%
7 43
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 214
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1282
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 214
16.7%
0 194
15.1%
1 192
15.0%
4 158
12.3%
5 110
8.6%
6 98
7.6%
3 96
7.5%
9 61
 
4.8%
2 61
 
4.8%
8 55
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1282
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 214
16.7%
0 194
15.1%
1 192
15.0%
4 158
12.3%
5 110
8.6%
6 98
7.6%
3 96
7.5%
9 61
 
4.8%
2 61
 
4.8%
8 55
 
4.3%

업종
Text

Distinct52
Distinct (%)48.1%
Missing0
Missing (%)0.0%
Memory size996.0 B
2024-01-10T04:45:54.816727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length21
Mean length18.027778
Min length6

Characters and Unicode

Total characters1947
Distinct characters143
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)28.7%

Sample

1st row폐기물처리업 38210(폐기물종합재활용업)
2nd row가금류 가공 및 저장 처리업
3rd row강관·제조업(24132)
4th row강관제조업(24132)
5th row강관제조업(24132)
ValueCountFrequency (%)
지정외폐기물처리업(38210 14
 
11.6%
알루미늄제련,정련및합금제조업(24212 8
 
6.6%
자동차엔진용부품제조업(30310 6
 
5.0%
석유화학계기초화학물질제조업(20119 6
 
5.0%
그외기타자동차부품제조업(30399 4
 
3.3%
강관제조업(24132 3
 
2.5%
3
 
2.5%
그외기타종이및판지제품제조업(17909 3
 
2.5%
위생용종이제품제조업(17902 3
 
2.5%
자동차용동력전달장치제조업(30391 3
 
2.5%
Other values (52) 68
56.2%
2024-01-10T04:45:55.118474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 146
 
7.5%
1 127
 
6.5%
109
 
5.6%
107
 
5.5%
) 106
 
5.4%
( 106
 
5.4%
77
 
4.0%
0 77
 
4.0%
3 65
 
3.3%
58
 
3.0%
Other values (133) 969
49.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1175
60.3%
Decimal Number 526
27.0%
Close Punctuation 106
 
5.4%
Open Punctuation 106
 
5.4%
Other Punctuation 20
 
1.0%
Space Separator 14
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
109
 
9.3%
107
 
9.1%
77
 
6.6%
58
 
4.9%
37
 
3.1%
31
 
2.6%
29
 
2.5%
29
 
2.5%
27
 
2.3%
23
 
2.0%
Other values (118) 648
55.1%
Decimal Number
ValueCountFrequency (%)
2 146
27.8%
1 127
24.1%
0 77
14.6%
3 65
12.4%
9 41
 
7.8%
4 29
 
5.5%
8 17
 
3.2%
7 9
 
1.7%
6 9
 
1.7%
5 6
 
1.1%
Other Punctuation
ValueCountFrequency (%)
, 19
95.0%
· 1
 
5.0%
Close Punctuation
ValueCountFrequency (%)
) 106
100.0%
Open Punctuation
ValueCountFrequency (%)
( 106
100.0%
Space Separator
ValueCountFrequency (%)
14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1175
60.3%
Common 772
39.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
109
 
9.3%
107
 
9.1%
77
 
6.6%
58
 
4.9%
37
 
3.1%
31
 
2.6%
29
 
2.5%
29
 
2.5%
27
 
2.3%
23
 
2.0%
Other values (118) 648
55.1%
Common
ValueCountFrequency (%)
2 146
18.9%
1 127
16.5%
) 106
13.7%
( 106
13.7%
0 77
10.0%
3 65
8.4%
9 41
 
5.3%
4 29
 
3.8%
, 19
 
2.5%
8 17
 
2.2%
Other values (5) 39
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1175
60.3%
ASCII 771
39.6%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 146
18.9%
1 127
16.5%
) 106
13.7%
( 106
13.7%
0 77
10.0%
3 65
8.4%
9 41
 
5.3%
4 29
 
3.8%
, 19
 
2.5%
8 17
 
2.2%
Other values (4) 38
 
4.9%
Hangul
ValueCountFrequency (%)
109
 
9.3%
107
 
9.1%
77
 
6.6%
58
 
4.9%
37
 
3.1%
31
 
2.6%
29
 
2.5%
29
 
2.5%
27
 
2.3%
23
 
2.0%
Other values (118) 648
55.1%
None
ValueCountFrequency (%)
· 1
100.0%

Missing values

2024-01-10T04:45:52.819615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T04:45:53.118416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호소재지전화번호업종
0㈜세람그린에너지충남 금산군 부리면 선원리 663-4.3<NA>폐기물처리업 38210(폐기물종합재활용업)
1농업회사법인(주)신우에프에스충남 서산시 서산시 고북면 고북1로 343-22041-663-7250가금류 가공 및 저장 처리업
2대주중공업㈜ 공주지점충남 공주시 공단길 14-29041-854-0101강관·제조업(24132)
3엘에스메탈㈜충남 서천군 장항읍 화송길 123041-955-3232강관제조업(24132)
4㈜휴스틸충남 당진시 송악읍 부곡공단로 131041-350-8114강관제조업(24132)
5㈜위스코충남 서산시 성연면 성연3로 133-14041-666-3565강관제조업(24132)
6에이케이켐텍㈜충남 청양군 정산면 충의로 1547-95041-940-6112계면활성제제조업(20421)
7㈜우창충남 서산시 지곡면 빗돌머리길 2-24041-664-6671그외기타분류안된화학제품제조업(20499)
8성우오토텍㈜충남 천안시 동남구 동면 충절로 2294-42041-523-5851그외기타자동차부품제조업(30399)
9새론오토모티브㈜충남 천안시 동남구 병천면 가전5길 133041-560-4492그외기타자동차부품제조업(30399)
상호소재지전화번호업종
98국일제지㈜ 아산공장충남 아산시 시민로 485번길 47041-549-0811크라프트지및상자용판지제조업(17123)
99대성에코에너지센터충남 당진시 석문면 장고항리 1421031-498-1451폐기물처리업
100에스케이씨하이테크앤마케팅㈜충남 천안시 서북구 성거읍 성거길 112041-550-9999플라스틱적층,도포및기타표면처리제품제조업(22291)
101선영화학㈜충남 천안시 동남구 동면 충절로 1995041-523-3030플라스틱필름,시트및판제조업(22212)
102삼성디스플레이㈜-탕정로380-2충남 아산시 탕정면 탕정로 380-2041-535-1331플라즈마및기타평판디스플레이제조업(26219)
103(주)SIMPAC METALLOY㈜당진공장충남 당진시 정미면 정미로 438041-360-0100합금철제조업(24113)
104서울화인테크㈜충남 아산시 영인면 토정로 412041-544-2650합성고무제조업(20201)
105애경화학㈜충남 청양군 정산면 충의로 1547-64041-940-6300합성수지및기타플라스틱물질제조업(20202)
106코오롱인더스트리㈜대산공장충남 서산시 대산읍 대죽1로 100041-661-5781합성수지및기타플라스틱물질제조업(20202)
107내포그린에너지㈜충남 홍성군 홍북읍 신경리 279-3041-631-2901화력발전업(35113)