Overview

Dataset statistics

Number of variables5
Number of observations190
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.6 KiB
Average record size in memory40.7 B

Variable types

Categorical2
Text3

Dataset

Description서산시 관내 소재한 대기오염물질 배출사업장 현황(3~5종)이며, 1,2종 사업장의 경우 충청남도, 환경부 관할 사업장으로 해당 데이터 없음을 제공합니다.
Author충청남도 서산시
URLhttps://www.data.go.kr/data/15080890/fileData.do

Alerts

업무구분 has constant value ""Constant

Reproduction

Analysis started2023-12-12 21:07:32.455383
Analysis finished2023-12-12 21:07:32.977077
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업무구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
대기배출업소관리
190 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대기배출업소관리
2nd row대기배출업소관리
3rd row대기배출업소관리
4th row대기배출업소관리
5th row대기배출업소관리

Common Values

ValueCountFrequency (%)
대기배출업소관리 190
100.0%

Length

2023-12-13T06:07:33.046523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:07:33.149246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대기배출업소관리 190
100.0%
Distinct186
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-13T06:07:33.366347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length8.8368421
Min length3

Characters and Unicode

Total characters1679
Distinct characters256
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique182 ?
Unique (%)95.8%

Sample

1st row서산현대자동차공업사(주)
2nd row계림탕
3rd row한일산업(주)
4th row농업회사법인(주)새들만
5th row서령목욕탕
ValueCountFrequency (%)
주식회사 4
 
1.8%
농업회사법인 3
 
1.3%
서산공장 2
 
0.9%
주)동남합성 2
 
0.9%
충청남도 2
 
0.9%
대산공장 2
 
0.9%
주)한농화성 2
 
0.9%
서산 2
 
0.9%
제국모터스 2
 
0.9%
금강레미콘(주 2
 
0.9%
Other values (201) 202
89.8%
2023-12-13T06:07:33.834098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 124
 
7.4%
) 124
 
7.4%
124
 
7.4%
75
 
4.5%
52
 
3.1%
47
 
2.8%
42
 
2.5%
36
 
2.1%
36
 
2.1%
34
 
2.0%
Other values (246) 985
58.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1368
81.5%
Open Punctuation 124
 
7.4%
Close Punctuation 124
 
7.4%
Space Separator 36
 
2.1%
Decimal Number 21
 
1.3%
Uppercase Letter 6
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
124
 
9.1%
75
 
5.5%
52
 
3.8%
47
 
3.4%
42
 
3.1%
36
 
2.6%
34
 
2.5%
29
 
2.1%
22
 
1.6%
22
 
1.6%
Other values (228) 885
64.7%
Decimal Number
ValueCountFrequency (%)
2 6
28.6%
1 4
19.0%
8 3
14.3%
0 2
 
9.5%
9 2
 
9.5%
3 1
 
4.8%
6 1
 
4.8%
5 1
 
4.8%
4 1
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
B 1
16.7%
K 1
16.7%
E 1
16.7%
A 1
16.7%
P 1
16.7%
I 1
16.7%
Open Punctuation
ValueCountFrequency (%)
( 124
100.0%
Close Punctuation
ValueCountFrequency (%)
) 124
100.0%
Space Separator
ValueCountFrequency (%)
36
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1368
81.5%
Common 305
 
18.2%
Latin 6
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
124
 
9.1%
75
 
5.5%
52
 
3.8%
47
 
3.4%
42
 
3.1%
36
 
2.6%
34
 
2.5%
29
 
2.1%
22
 
1.6%
22
 
1.6%
Other values (228) 885
64.7%
Common
ValueCountFrequency (%)
( 124
40.7%
) 124
40.7%
36
 
11.8%
2 6
 
2.0%
1 4
 
1.3%
8 3
 
1.0%
0 2
 
0.7%
9 2
 
0.7%
3 1
 
0.3%
6 1
 
0.3%
Other values (2) 2
 
0.7%
Latin
ValueCountFrequency (%)
B 1
16.7%
K 1
16.7%
E 1
16.7%
A 1
16.7%
P 1
16.7%
I 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1368
81.5%
ASCII 311
 
18.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 124
39.9%
) 124
39.9%
36
 
11.6%
2 6
 
1.9%
1 4
 
1.3%
8 3
 
1.0%
0 2
 
0.6%
9 2
 
0.6%
3 1
 
0.3%
B 1
 
0.3%
Other values (8) 8
 
2.6%
Hangul
ValueCountFrequency (%)
124
 
9.1%
75
 
5.5%
52
 
3.8%
47
 
3.4%
42
 
3.1%
36
 
2.6%
34
 
2.5%
29
 
2.1%
22
 
1.6%
22
 
1.6%
Other values (228) 885
64.7%
Distinct179
Distinct (%)94.2%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-13T06:07:34.260176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length35
Mean length23.478947
Min length19

Characters and Unicode

Total characters4461
Distinct characters161
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique170 ?
Unique (%)89.5%

Sample

1st row충청남도 서산시 고운로 6-7 (예천동)
2nd row충청남도 서산시 번화2로 27 (읍내동)
3rd row충청남도 서산시 음암면 장나다리길 48
4th row충청남도 서산시 해미면 내포로 2500-6
5th row충청남도 서산시 읍내1로 5 서령상가 (읍내동)
ValueCountFrequency (%)
충청남도 190
19.2%
서산시 190
19.2%
대산읍 28
 
2.8%
성연면 27
 
2.7%
음암면 24
 
2.4%
고북면 16
 
1.6%
수석동 16
 
1.6%
운산면 15
 
1.5%
해미면 14
 
1.4%
수석산업로 10
 
1.0%
Other values (296) 460
46.5%
2023-12-13T06:07:34.856856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
827
18.5%
266
 
6.0%
203
 
4.6%
200
 
4.5%
198
 
4.4%
192
 
4.3%
192
 
4.3%
191
 
4.3%
1 182
 
4.1%
146
 
3.3%
Other values (151) 1864
41.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2696
60.4%
Space Separator 827
 
18.5%
Decimal Number 729
 
16.3%
Dash Punctuation 93
 
2.1%
Open Punctuation 57
 
1.3%
Close Punctuation 57
 
1.3%
Other Punctuation 1
 
< 0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
266
 
9.9%
203
 
7.5%
200
 
7.4%
198
 
7.3%
192
 
7.1%
192
 
7.1%
191
 
7.1%
146
 
5.4%
114
 
4.2%
72
 
2.7%
Other values (133) 922
34.2%
Decimal Number
ValueCountFrequency (%)
1 182
25.0%
2 106
14.5%
3 73
10.0%
4 69
 
9.5%
5 65
 
8.9%
7 64
 
8.8%
8 46
 
6.3%
6 45
 
6.2%
9 45
 
6.2%
0 34
 
4.7%
Open Punctuation
ValueCountFrequency (%)
( 56
98.2%
[ 1
 
1.8%
Close Punctuation
ValueCountFrequency (%)
) 56
98.2%
] 1
 
1.8%
Space Separator
ValueCountFrequency (%)
827
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 93
100.0%
Other Punctuation
ValueCountFrequency (%)
* 1
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2696
60.4%
Common 1764
39.5%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
266
 
9.9%
203
 
7.5%
200
 
7.4%
198
 
7.3%
192
 
7.1%
192
 
7.1%
191
 
7.1%
146
 
5.4%
114
 
4.2%
72
 
2.7%
Other values (133) 922
34.2%
Common
ValueCountFrequency (%)
827
46.9%
1 182
 
10.3%
2 106
 
6.0%
- 93
 
5.3%
3 73
 
4.1%
4 69
 
3.9%
5 65
 
3.7%
7 64
 
3.6%
( 56
 
3.2%
) 56
 
3.2%
Other values (7) 173
 
9.8%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2696
60.4%
ASCII 1765
39.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
827
46.9%
1 182
 
10.3%
2 106
 
6.0%
- 93
 
5.3%
3 73
 
4.1%
4 69
 
3.9%
5 65
 
3.7%
7 64
 
3.6%
( 56
 
3.2%
) 56
 
3.2%
Other values (8) 174
 
9.9%
Hangul
ValueCountFrequency (%)
266
 
9.9%
203
 
7.5%
200
 
7.4%
198
 
7.3%
192
 
7.1%
192
 
7.1%
191
 
7.1%
146
 
5.4%
114
 
4.2%
72
 
2.7%
Other values (133) 922
34.2%
Distinct83
Distinct (%)43.7%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-13T06:07:35.256957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length21
Mean length10.657895
Min length1

Characters and Unicode

Total characters2025
Distinct characters159
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)25.3%

Sample

1st row자동차 종합 수리업
2nd row욕탕업
3rd row레미콘 제조업
4th row곡물 도정업
5th row욕탕업
ValueCountFrequency (%)
제조업 81
 
14.4%
38
 
6.7%
자동차 33
 
5.9%
수리업 28
 
5.0%
기타 28
 
5.0%
종합 21
 
3.7%
폐기물 18
 
3.2%
처리업 18
 
3.2%
곡물 12
 
2.1%
도정업 12
 
2.1%
Other values (139) 274
48.7%
2023-12-13T06:07:35.827497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
413
20.4%
170
 
8.4%
122
 
6.0%
97
 
4.8%
67
 
3.3%
62
 
3.1%
47
 
2.3%
46
 
2.3%
45
 
2.2%
44
 
2.2%
Other values (149) 912
45.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1610
79.5%
Space Separator 413
 
20.4%
Decimal Number 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
170
 
10.6%
122
 
7.6%
97
 
6.0%
67
 
4.2%
62
 
3.9%
47
 
2.9%
46
 
2.9%
45
 
2.8%
44
 
2.7%
41
 
2.5%
Other values (147) 869
54.0%
Space Separator
ValueCountFrequency (%)
413
100.0%
Decimal Number
ValueCountFrequency (%)
1 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1610
79.5%
Common 415
 
20.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
170
 
10.6%
122
 
7.6%
97
 
6.0%
67
 
4.2%
62
 
3.9%
47
 
2.9%
46
 
2.9%
45
 
2.8%
44
 
2.7%
41
 
2.5%
Other values (147) 869
54.0%
Common
ValueCountFrequency (%)
413
99.5%
1 2
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1610
79.5%
ASCII 415
 
20.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
413
99.5%
1 2
 
0.5%
Hangul
ValueCountFrequency (%)
170
 
10.6%
122
 
7.6%
97
 
6.0%
67
 
4.2%
62
 
3.9%
47
 
2.9%
46
 
2.9%
45
 
2.8%
44
 
2.7%
41
 
2.5%
Other values (147) 869
54.0%


Categorical

Distinct3
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
5종
101 
4종
72 
3종
17 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5종
2nd row4종
3rd row4종
4th row4종
5th row5종

Common Values

ValueCountFrequency (%)
5종 101
53.2%
4종 72
37.9%
3종 17
 
8.9%

Length

2023-12-13T06:07:35.991566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:07:36.423522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5종 101
53.2%
4종 72
37.9%
3종 17
 
8.9%

Correlations

2023-12-13T06:07:36.497530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대표업종
대표업종1.0000.498
0.4981.000

Missing values

2023-12-13T06:07:32.822725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:07:32.940705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업무구분사업장명도로명소재지대표업종
0대기배출업소관리서산현대자동차공업사(주)충청남도 서산시 고운로 6-7 (예천동)자동차 종합 수리업5종
1대기배출업소관리계림탕충청남도 서산시 번화2로 27 (읍내동)욕탕업4종
2대기배출업소관리한일산업(주)충청남도 서산시 음암면 장나다리길 48레미콘 제조업4종
3대기배출업소관리농업회사법인(주)새들만충청남도 서산시 해미면 내포로 2500-6곡물 도정업4종
4대기배출업소관리서령목욕탕충청남도 서산시 읍내1로 5 서령상가 (읍내동)욕탕업5종
5대기배출업소관리케이와이산업(주)충청남도 서산시 운산면 장생동로 575아스콘 제조업3종
6대기배출업소관리강산레미콘(주)충청남도 서산시 대산읍 망일산로 644종
7대기배출업소관리동진파일(주)충청남도 서산시 고북면 신상날새길 22-15콘크리트관 및 기타 구조용 콘크리트제품 제조업4종
8대기배출업소관리(주)대한화학충청남도 서산시 운산면 장생동로 511-22플라스틱 발포 성형제품 제조업4종
9대기배출업소관리대호산업(주)충청남도 서산시 대산읍 충의로 2813시멘트 석회 플라스터 및 그 제품 제조업5종
업무구분사업장명도로명소재지대표업종
180대기배출업소관리디아이에프에프충청남도 서산시 성연면 성연3로 173 4동도장 및 기타 피막처리업5종
181대기배출업소관리(주)에코필충청남도 서산시 대산읍 평신1로 531-112토양 및 지하수 정화업4종
182대기배출업소관리(주)우성금속충청남도 서산시 성연면 해성길 99 (주)우성금속선철주물 주조업3종
183대기배출업소관리주택관리공단(주) 서산석림3관리소충청남도 서산시 석림4로 83 (석림동 주공아파트)주거용 부동산 관리업4종
184대기배출업소관리(주)시스턴충청남도 서산시 운산면 장생동로 681-24설치용 금속탱크 및 저장용기 제조업5종
185대기배출업소관리BK금속(서산자원)충청남도 서산시 음암면 석동로 271폐기물 처리업5종
186대기배출업소관리대전지방법원 대전가정법원 서산지원충청남도 서산시 공림4로 24 대전지방법원서산지원 (예천동)법원5종
187대기배출업소관리(주)느티나무충청남도 서산시 고북면 고수관로 17-7콘크리트 타일 기와 벽돌 및 블록 제조업4종
188대기배출업소관리(주)유진글로벌충청남도 서산시 음암면 음암로 132-10폐기물 처리업5종
189대기배출업소관리농업회사법인 (주)봉락미곡처리장충청남도 서산시 고북면 농장중앙길 158곡물 도정업5종