Overview

Dataset statistics

Number of variables2
Number of observations275
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.4 KiB
Average record size in memory16.5 B

Variable types

Text2

Dataset

Description전라남도 순천시 폐수 배출업소 현황에 대한 데이터로 폐수배출 사업장명, 주소, 배출시설 관련 업종 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/3072882/fileData.do

Reproduction

Analysis started2023-12-12 15:44:47.493414
Analysis finished2023-12-12 15:44:47.990882
Duration0.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct271
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-13T00:44:48.275537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length21
Mean length7.6472727
Min length2

Characters and Unicode

Total characters2103
Distinct characters309
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique267 ?
Unique (%)97.1%

Sample

1st row제일세차장
2nd row대박세차장
3rd row매일식품(주)
4th row호성산업주식회사
5th row대일세차장
ValueCountFrequency (%)
주식회사 5
 
1.5%
주)광남토건 2
 
0.6%
농업회사법인 2
 
0.6%
lpg 2
 
0.6%
현대세차장 2
 
0.6%
순천시 2
 
0.6%
순천지점 2
 
0.6%
2
 
0.6%
셀프세차장 2
 
0.6%
주유소 2
 
0.6%
Other values (295) 301
92.9%
2023-12-13T00:44:48.823271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
109
 
5.2%
( 87
 
4.1%
) 87
 
4.1%
68
 
3.2%
68
 
3.2%
63
 
3.0%
55
 
2.6%
53
 
2.5%
50
 
2.4%
49
 
2.3%
Other values (299) 1414
67.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1828
86.9%
Open Punctuation 87
 
4.1%
Close Punctuation 87
 
4.1%
Space Separator 49
 
2.3%
Uppercase Letter 42
 
2.0%
Decimal Number 8
 
0.4%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
109
 
6.0%
68
 
3.7%
68
 
3.7%
63
 
3.4%
55
 
3.0%
53
 
2.9%
50
 
2.7%
46
 
2.5%
36
 
2.0%
31
 
1.7%
Other values (276) 1249
68.3%
Uppercase Letter
ValueCountFrequency (%)
P 8
19.0%
C 7
16.7%
L 6
14.3%
G 6
14.3%
R 3
 
7.1%
S 2
 
4.8%
E 2
 
4.8%
D 2
 
4.8%
J 1
 
2.4%
Y 1
 
2.4%
Other values (4) 4
9.5%
Decimal Number
ValueCountFrequency (%)
2 4
50.0%
4 1
 
12.5%
9 1
 
12.5%
1 1
 
12.5%
5 1
 
12.5%
Open Punctuation
ValueCountFrequency (%)
( 87
100.0%
Close Punctuation
ValueCountFrequency (%)
) 87
100.0%
Space Separator
ValueCountFrequency (%)
49
100.0%
Other Punctuation
ValueCountFrequency (%)
& 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1828
86.9%
Common 233
 
11.1%
Latin 42
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
109
 
6.0%
68
 
3.7%
68
 
3.7%
63
 
3.4%
55
 
3.0%
53
 
2.9%
50
 
2.7%
46
 
2.5%
36
 
2.0%
31
 
1.7%
Other values (276) 1249
68.3%
Latin
ValueCountFrequency (%)
P 8
19.0%
C 7
16.7%
L 6
14.3%
G 6
14.3%
R 3
 
7.1%
S 2
 
4.8%
E 2
 
4.8%
D 2
 
4.8%
J 1
 
2.4%
Y 1
 
2.4%
Other values (4) 4
9.5%
Common
ValueCountFrequency (%)
( 87
37.3%
) 87
37.3%
49
21.0%
2 4
 
1.7%
& 2
 
0.9%
4 1
 
0.4%
9 1
 
0.4%
1 1
 
0.4%
5 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1828
86.9%
ASCII 275
 
13.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
109
 
6.0%
68
 
3.7%
68
 
3.7%
63
 
3.4%
55
 
3.0%
53
 
2.9%
50
 
2.7%
46
 
2.5%
36
 
2.0%
31
 
1.7%
Other values (276) 1249
68.3%
ASCII
ValueCountFrequency (%)
( 87
31.6%
) 87
31.6%
49
17.8%
P 8
 
2.9%
C 7
 
2.5%
L 6
 
2.2%
G 6
 
2.2%
2 4
 
1.5%
R 3
 
1.1%
& 2
 
0.7%
Other values (13) 16
 
5.8%
Distinct272
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-13T00:44:49.304462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length42
Mean length22.334545
Min length17

Characters and Unicode

Total characters6142
Distinct characters199
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique269 ?
Unique (%)97.8%

Sample

1st row전라남도 순천시 장천4길 43 (장천동)
2nd row전라남도 순천시 팔마로 172 (덕암동)
3rd row전라남도 순천시 서면 산단1길 16
4th row전라남도 순천시 서면 산단4길 56
5th row전라남도 순천시 하풍동길 14 (풍덕동)
ValueCountFrequency (%)
전라남도 275
19.4%
순천시 275
19.4%
서면 50
 
3.5%
조례동 35
 
2.5%
해룡면 31
 
2.2%
별량면 27
 
1.9%
가곡동 20
 
1.4%
풍덕동 18
 
1.3%
중앙로 14
 
1.0%
연향동 14
 
1.0%
Other values (414) 655
46.3%
2023-12-13T00:44:49.976895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1202
19.6%
301
 
4.9%
300
 
4.9%
292
 
4.8%
288
 
4.7%
277
 
4.5%
275
 
4.5%
275
 
4.5%
1 177
 
2.9%
153
 
2.5%
Other values (189) 2602
42.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3733
60.8%
Space Separator 1202
 
19.6%
Decimal Number 872
 
14.2%
Close Punctuation 135
 
2.2%
Open Punctuation 135
 
2.2%
Dash Punctuation 64
 
1.0%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
301
 
8.1%
300
 
8.0%
292
 
7.8%
288
 
7.7%
277
 
7.4%
275
 
7.4%
275
 
7.4%
153
 
4.1%
135
 
3.6%
131
 
3.5%
Other values (174) 1306
35.0%
Decimal Number
ValueCountFrequency (%)
1 177
20.3%
2 138
15.8%
3 106
12.2%
4 103
11.8%
6 74
8.5%
5 66
 
7.6%
9 56
 
6.4%
7 54
 
6.2%
0 50
 
5.7%
8 48
 
5.5%
Space Separator
ValueCountFrequency (%)
1202
100.0%
Close Punctuation
ValueCountFrequency (%)
) 135
100.0%
Open Punctuation
ValueCountFrequency (%)
( 135
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 64
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3733
60.8%
Common 2409
39.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
301
 
8.1%
300
 
8.0%
292
 
7.8%
288
 
7.7%
277
 
7.4%
275
 
7.4%
275
 
7.4%
153
 
4.1%
135
 
3.6%
131
 
3.5%
Other values (174) 1306
35.0%
Common
ValueCountFrequency (%)
1202
49.9%
1 177
 
7.3%
2 138
 
5.7%
) 135
 
5.6%
( 135
 
5.6%
3 106
 
4.4%
4 103
 
4.3%
6 74
 
3.1%
5 66
 
2.7%
- 64
 
2.7%
Other values (5) 209
 
8.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3733
60.8%
ASCII 2409
39.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1202
49.9%
1 177
 
7.3%
2 138
 
5.7%
) 135
 
5.6%
( 135
 
5.6%
3 106
 
4.4%
4 103
 
4.3%
6 74
 
3.1%
5 66
 
2.7%
- 64
 
2.7%
Other values (5) 209
 
8.7%
Hangul
ValueCountFrequency (%)
301
 
8.1%
300
 
8.0%
292
 
7.8%
288
 
7.7%
277
 
7.4%
275
 
7.4%
275
 
7.4%
153
 
4.1%
135
 
3.6%
131
 
3.5%
Other values (174) 1306
35.0%

Missing values

2023-12-13T00:44:47.855159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:44:47.954757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명소재지
0제일세차장전라남도 순천시 장천4길 43 (장천동)
1대박세차장전라남도 순천시 팔마로 172 (덕암동)
2매일식품(주)전라남도 순천시 서면 산단1길 16
3호성산업주식회사전라남도 순천시 서면 산단4길 56
4대일세차장전라남도 순천시 하풍동길 14 (풍덕동)
5근로복지공단 순천병원전라남도 순천시 조례1길 24 (조례동)
6한국신광마이크로애렉트로닉스(주)전라남도 순천시 서면 산단1길 32
7순천중앙병원전라남도 순천시 장명로 5 (장천동)
8성동세차장전라남도 순천시 성동2길 6 (동외동)
9(주)일성레미콘전라남도 순천시 서면 청소길 177
사업장명소재지
265순천수지전라남도 순천시 별량면 친환경길 122
266코오롱글로벌(주)순천센터전라남도 순천시 서면 압곡리 764-1 764-3 764-4 . 764-10
267유워시 셀프세차장전라남도 순천시 서면 산단4길 2
268(유)이십일세기관광전세전라남도 순천시 서면 압곡길 62-2
269삼호개발(주)전라남도 순천시 송광면 봉산리 670 671번지
270(주)광남토건전라남도 순천시 송광면 오봉리 9번지
271(주)삼화기업전라남도 순천시 서면 구랑실재길 133
272(주)광남토건전라남도 순천시 송광면 신흥리 산74-3번지
273(주)강진산업전라남도 순천시 서면 구랑실재길 130
274워시랜드 순천점전라남도 순천시 해룡면 상삼리 209-1