Overview

Dataset statistics

Number of variables4
Number of observations98
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.2 KiB
Average record size in memory33.3 B

Variable types

Text3
Categorical1

Dataset

Description물환경보전법에 따른 연수구 내 폐수 배출시설 설치 사업장 현황에 대한 데이터로 사업장명, 소재지, 배출 물질을 제공합니다.
Author인천광역시 연수구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15065225&srcSe=7661IVAWM27C61E190

Reproduction

Analysis started2024-01-28 13:38:21.781816
Analysis finished2024-01-28 13:38:22.669311
Duration0.89 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct96
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size916.0 B
2024-01-28T22:38:22.820345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length9.3979592
Min length3

Characters and Unicode

Total characters921
Distinct characters221
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)95.9%

Sample

1st row인천적십자병원
2nd row강인여객
3rd row도영운수㈜외2개소
4th row뉴신용카세차장
5th row팔팔현대공업사
ValueCountFrequency (%)
주식회사 4
 
3.2%
가천대학교 2
 
1.6%
현대오일뱅크(주 2
 
1.6%
주)엔에프씨 2
 
1.6%
연구소 2
 
1.6%
풍림산업(주 2
 
1.6%
제2공장 2
 
1.6%
㈜셀트리온 2
 
1.6%
직영 2
 
1.6%
송도 2
 
1.6%
Other values (104) 104
82.5%
2024-01-28T22:38:23.157050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35
 
3.8%
30
 
3.3%
27
 
2.9%
( 25
 
2.7%
) 25
 
2.7%
20
 
2.2%
19
 
2.1%
18
 
2.0%
18
 
2.0%
18
 
2.0%
Other values (211) 686
74.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 781
84.8%
Space Separator 30
 
3.3%
Other Symbol 27
 
2.9%
Open Punctuation 25
 
2.7%
Close Punctuation 25
 
2.7%
Uppercase Letter 13
 
1.4%
Decimal Number 12
 
1.3%
Lowercase Letter 4
 
0.4%
Dash Punctuation 3
 
0.3%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
 
4.5%
20
 
2.6%
19
 
2.4%
18
 
2.3%
18
 
2.3%
18
 
2.3%
16
 
2.0%
15
 
1.9%
15
 
1.9%
14
 
1.8%
Other values (188) 593
75.9%
Uppercase Letter
ValueCountFrequency (%)
K 3
23.1%
D 2
15.4%
R 2
15.4%
O 2
15.4%
S 2
15.4%
T 1
 
7.7%
A 1
 
7.7%
Decimal Number
ValueCountFrequency (%)
2 4
33.3%
0 2
16.7%
1 2
16.7%
8 2
16.7%
3 1
 
8.3%
4 1
 
8.3%
Lowercase Letter
ValueCountFrequency (%)
s 1
25.0%
l 1
25.0%
f 1
25.0%
e 1
25.0%
Space Separator
ValueCountFrequency (%)
30
100.0%
Other Symbol
ValueCountFrequency (%)
27
100.0%
Open Punctuation
ValueCountFrequency (%)
( 25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 25
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 808
87.7%
Common 96
 
10.4%
Latin 17
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
35
 
4.3%
27
 
3.3%
20
 
2.5%
19
 
2.4%
18
 
2.2%
18
 
2.2%
18
 
2.2%
16
 
2.0%
15
 
1.9%
15
 
1.9%
Other values (189) 607
75.1%
Common
ValueCountFrequency (%)
30
31.2%
( 25
26.0%
) 25
26.0%
2 4
 
4.2%
- 3
 
3.1%
0 2
 
2.1%
1 2
 
2.1%
8 2
 
2.1%
& 1
 
1.0%
3 1
 
1.0%
Latin
ValueCountFrequency (%)
K 3
17.6%
D 2
11.8%
R 2
11.8%
O 2
11.8%
S 2
11.8%
s 1
 
5.9%
l 1
 
5.9%
T 1
 
5.9%
f 1
 
5.9%
A 1
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 781
84.8%
ASCII 113
 
12.3%
None 27
 
2.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
35
 
4.5%
20
 
2.6%
19
 
2.4%
18
 
2.3%
18
 
2.3%
18
 
2.3%
16
 
2.0%
15
 
1.9%
15
 
1.9%
14
 
1.8%
Other values (188) 593
75.9%
ASCII
ValueCountFrequency (%)
30
26.5%
( 25
22.1%
) 25
22.1%
2 4
 
3.5%
K 3
 
2.7%
- 3
 
2.7%
D 2
 
1.8%
R 2
 
1.8%
O 2
 
1.8%
0 2
 
1.8%
Other values (12) 15
13.3%
None
ValueCountFrequency (%)
27
100.0%

업종
Categorical

Distinct45
Distinct (%)45.9%
Missing0
Missing (%)0.0%
Memory size916.0 B
세차
28 
이화학실험시설
생물학적제제제조
세차시설
화장품제조업
 
4
Other values (40)
46 

Length

Max length24
Median length19.5
Mean length6.8367347
Min length2

Unique

Unique36 ?
Unique (%)36.7%

Sample

1st row의료업
2nd row운수
3rd row운수
4th row세차
5th row세차

Common Values

ValueCountFrequency (%)
세차 28
28.6%
이화학실험시설 8
 
8.2%
생물학적제제제조 6
 
6.1%
세차시설 6
 
6.1%
화장품제조업 4
 
4.1%
자동차세차업(95213) 3
 
3.1%
운수 3
 
3.1%
의료업 2
 
2.0%
토목건설업 2
 
2.0%
사진용화학제품 및 감광재료제조업 1
 
1.0%
Other values (35) 35
35.7%

Length

2024-01-28T22:38:23.278819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
세차 29
 
18.6%
11
 
7.1%
이화학실험시설 9
 
5.8%
세차시설 6
 
3.8%
생물학적제제제조 6
 
3.8%
제조업 6
 
3.8%
기타 5
 
3.2%
화장품제조업 4
 
2.6%
자동차세차업(95213 3
 
1.9%
운수 3
 
1.9%
Other values (62) 74
47.4%
Distinct97
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size916.0 B
2024-01-28T22:38:23.466217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length40
Mean length26.959184
Min length18

Characters and Unicode

Total characters2642
Distinct characters139
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique96 ?
Unique (%)98.0%

Sample

1st row인천광역시 연수구 원인재로 263(연수동)
2nd row인천광역시 연수구 먼우금로 1(동춘동)
3rd row인천광역시 연수구 먼우금로 2(동춘동)
4th row인천광역시 연수구 동곡재로 198(동춘동)
5th row인천광역시 연수구 독배로 33(옥련동)
ValueCountFrequency (%)
인천광역시 98
20.8%
연수구 96
20.4%
송도동 24
 
5.1%
갯벌로 8
 
1.7%
송도과학로 6
 
1.3%
아카데미로 5
 
1.1%
송도미래로 5
 
1.1%
아암대로 5
 
1.1%
먼우금로 5
 
1.1%
비류대로 4
 
0.8%
Other values (190) 215
45.6%
2024-01-28T22:38:23.777295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
375
 
14.2%
106
 
4.0%
1 105
 
4.0%
103
 
3.9%
103
 
3.9%
102
 
3.9%
101
 
3.8%
101
 
3.8%
98
 
3.7%
98
 
3.7%
Other values (129) 1350
51.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1581
59.8%
Decimal Number 449
 
17.0%
Space Separator 375
 
14.2%
Open Punctuation 89
 
3.4%
Close Punctuation 88
 
3.3%
Dash Punctuation 32
 
1.2%
Other Punctuation 14
 
0.5%
Uppercase Letter 9
 
0.3%
Math Symbol 2
 
0.1%
Lowercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
106
 
6.7%
103
 
6.5%
103
 
6.5%
102
 
6.5%
101
 
6.4%
101
 
6.4%
98
 
6.2%
98
 
6.2%
98
 
6.2%
96
 
6.1%
Other values (102) 575
36.4%
Decimal Number
ValueCountFrequency (%)
1 105
23.4%
3 52
11.6%
0 45
10.0%
4 45
10.0%
5 42
 
9.4%
7 42
 
9.4%
2 41
 
9.1%
9 29
 
6.5%
8 25
 
5.6%
6 23
 
5.1%
Uppercase Letter
ValueCountFrequency (%)
B 4
44.4%
A 2
22.2%
E 1
 
11.1%
R 1
 
11.1%
D 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
, 11
78.6%
# 1
 
7.1%
& 1
 
7.1%
: 1
 
7.1%
Lowercase Letter
ValueCountFrequency (%)
c 1
50.0%
i 1
50.0%
Space Separator
ValueCountFrequency (%)
375
100.0%
Open Punctuation
ValueCountFrequency (%)
( 89
100.0%
Close Punctuation
ValueCountFrequency (%)
) 88
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 32
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1582
59.9%
Common 1049
39.7%
Latin 11
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
106
 
6.7%
103
 
6.5%
103
 
6.5%
102
 
6.4%
101
 
6.4%
101
 
6.4%
98
 
6.2%
98
 
6.2%
98
 
6.2%
96
 
6.1%
Other values (103) 576
36.4%
Common
ValueCountFrequency (%)
375
35.7%
1 105
 
10.0%
( 89
 
8.5%
) 88
 
8.4%
3 52
 
5.0%
0 45
 
4.3%
4 45
 
4.3%
5 42
 
4.0%
7 42
 
4.0%
2 41
 
3.9%
Other values (9) 125
 
11.9%
Latin
ValueCountFrequency (%)
B 4
36.4%
A 2
18.2%
c 1
 
9.1%
E 1
 
9.1%
i 1
 
9.1%
R 1
 
9.1%
D 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1581
59.8%
ASCII 1060
40.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
375
35.4%
1 105
 
9.9%
( 89
 
8.4%
) 88
 
8.3%
3 52
 
4.9%
0 45
 
4.2%
4 45
 
4.2%
5 42
 
4.0%
7 42
 
4.0%
2 41
 
3.9%
Other values (16) 136
 
12.8%
Hangul
ValueCountFrequency (%)
106
 
6.7%
103
 
6.5%
103
 
6.5%
102
 
6.5%
101
 
6.4%
101
 
6.4%
98
 
6.2%
98
 
6.2%
98
 
6.2%
96
 
6.1%
Other values (102) 575
36.4%
None
ValueCountFrequency (%)
1
100.0%
Distinct65
Distinct (%)66.3%
Missing0
Missing (%)0.0%
Memory size916.0 B
2024-01-28T22:38:23.957083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length233
Median length103
Mean length46.673469
Min length19

Characters and Unicode

Total characters4574
Distinct characters166
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)58.2%

Sample

1st rowpH, BOD, COD, SS, n-H, 페놀, CN, Cr, Zn, Cu, Pb, T-N, T-P,대장균, 음이온계면활성제
2nd rowpH, COD, SS, n-H, ABS
3rd rowpH, COD, SS, n-H,ABS
4th rowpH, COD, SS, n-H,ABS, T-N, T-P
5th rowpH, COD, SS, n-H(광),ABS, T-N, T-P
ValueCountFrequency (%)
ph 90
 
11.8%
ss 80
 
10.5%
t-n 69
 
9.1%
t-p 68
 
8.9%
cod 67
 
8.8%
n-h(광),abs 28
 
3.7%
n-h(광 28
 
3.7%
bod 21
 
2.8%
abs 20
 
2.6%
toc 19
 
2.5%
Other values (122) 272
35.7%
2024-01-28T22:38:24.251940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 850
18.6%
671
14.7%
- 291
 
6.4%
S 262
 
5.7%
H 203
 
4.4%
T 200
 
4.4%
C 165
 
3.6%
O 126
 
2.8%
N 125
 
2.7%
n 117
 
2.6%
Other values (156) 1564
34.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 1500
32.8%
Other Punctuation 857
18.7%
Other Letter 695
15.2%
Space Separator 671
14.7%
Lowercase Letter 335
 
7.3%
Dash Punctuation 291
 
6.4%
Open Punctuation 91
 
2.0%
Close Punctuation 91
 
2.0%
Decimal Number 37
 
0.8%
Math Symbol 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
79
 
11.4%
50
 
7.2%
28
 
4.0%
23
 
3.3%
23
 
3.3%
23
 
3.3%
23
 
3.3%
16
 
2.3%
14
 
2.0%
14
 
2.0%
Other values (106) 402
57.8%
Lowercase Letter
ValueCountFrequency (%)
n 117
34.9%
p 88
26.3%
r 23
 
6.9%
e 20
 
6.0%
u 19
 
5.7%
b 13
 
3.9%
l 8
 
2.4%
i 7
 
2.1%
o 7
 
2.1%
d 7
 
2.1%
Other values (10) 26
 
7.8%
Uppercase Letter
ValueCountFrequency (%)
S 262
17.5%
H 203
13.5%
T 200
13.3%
C 165
11.0%
O 126
8.4%
N 125
8.3%
P 117
7.8%
D 108
7.2%
B 90
 
6.0%
A 59
 
3.9%
Other values (5) 45
 
3.0%
Decimal Number
ValueCountFrequency (%)
1 14
37.8%
6 7
18.9%
4 6
16.2%
2 4
 
10.8%
8 3
 
8.1%
7 2
 
5.4%
9 1
 
2.7%
Other Punctuation
ValueCountFrequency (%)
, 850
99.2%
. 7
 
0.8%
Space Separator
ValueCountFrequency (%)
671
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 291
100.0%
Open Punctuation
ValueCountFrequency (%)
( 91
100.0%
Close Punctuation
ValueCountFrequency (%)
) 91
100.0%
Math Symbol
ValueCountFrequency (%)
+ 4
100.0%
Control
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2044
44.7%
Latin 1835
40.1%
Hangul 695
 
15.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
79
 
11.4%
50
 
7.2%
28
 
4.0%
23
 
3.3%
23
 
3.3%
23
 
3.3%
23
 
3.3%
16
 
2.3%
14
 
2.0%
14
 
2.0%
Other values (106) 402
57.8%
Latin
ValueCountFrequency (%)
S 262
14.3%
H 203
11.1%
T 200
10.9%
C 165
9.0%
O 126
 
6.9%
N 125
 
6.8%
n 117
 
6.4%
P 117
 
6.4%
D 108
 
5.9%
B 90
 
4.9%
Other values (25) 322
17.5%
Common
ValueCountFrequency (%)
, 850
41.6%
671
32.8%
- 291
 
14.2%
( 91
 
4.5%
) 91
 
4.5%
1 14
 
0.7%
. 7
 
0.3%
6 7
 
0.3%
4 6
 
0.3%
2 4
 
0.2%
Other values (5) 12
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3879
84.8%
Hangul 695
 
15.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 850
21.9%
671
17.3%
- 291
 
7.5%
S 262
 
6.8%
H 203
 
5.2%
T 200
 
5.2%
C 165
 
4.3%
O 126
 
3.2%
N 125
 
3.2%
n 117
 
3.0%
Other values (40) 869
22.4%
Hangul
ValueCountFrequency (%)
79
 
11.4%
50
 
7.2%
28
 
4.0%
23
 
3.3%
23
 
3.3%
23
 
3.3%
23
 
3.3%
16
 
2.3%
14
 
2.0%
14
 
2.0%
Other values (106) 402
57.8%

Correlations

2024-01-28T22:38:24.335940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명업종도로명 주소배출물질
업소명1.0000.9950.9970.995
업종0.9951.0000.0000.998
도로명 주소0.9970.0001.0000.980
배출물질0.9950.9980.9801.000

Missing values

2024-01-28T22:38:22.565106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T22:38:22.639444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명업종도로명 주소배출물질
0인천적십자병원의료업인천광역시 연수구 원인재로 263(연수동)pH, BOD, COD, SS, n-H, 페놀, CN, Cr, Zn, Cu, Pb, T-N, T-P,대장균, 음이온계면활성제
1강인여객운수인천광역시 연수구 먼우금로 1(동춘동)pH, COD, SS, n-H, ABS
2도영운수㈜외2개소운수인천광역시 연수구 먼우금로 2(동춘동)pH, COD, SS, n-H,ABS
3뉴신용카세차장세차인천광역시 연수구 동곡재로 198(동춘동)pH, COD, SS, n-H,ABS, T-N, T-P
4팔팔현대공업사세차인천광역시 연수구 독배로 33(옥련동)pH, COD, SS, n-H(광),ABS, T-N, T-P
5새한공업사대성덴트세차인천광역시 연수구 비류대로256번길 8-4pH, COD, SS, n-H(광),ABS, T-N, T-P
6에덴세차장세차인천광역시 연수구 학나래로 67(선학동)pH, COD, SS, n-H(광),ABS
7한국가스공사인천기지본부가스제조인천광역시 연수구 인천신항대로 960 (송도동)78. 산업시설의 정수시설 pH, COD, SS, 82. 그 밖의 배출시설 pH, COD, SS, n-H(광), 79. 이화학시험시설 전항목
8동춘카세차장세차인천광역시 연수구 앵고개로 119(동춘동)pH, COD, SS, n-H(광),ABS, T-N, T-P
9아우토반자동차공업사세차인천광역시 연수구 비류대로278번길 8-22(청학동)pH, COD, SS, n-H(광),ABS
업소명업종도로명 주소배출물질
88(주)와이지-원교육연구시설(연구소)인천광역시 연수구 송도과학로16번길 13-40 (송도동)pH, TOC, SS, n-H(광), T-N, T-P, 생태독성
89(주)듀비스영상 및 음향기기 제조인천광역시 연수구 갯벌로 84 (송도동)pH, TOC, SS, n-H(광),T-N, T-P, 생태독성
90풍림산업(주)토목건설업인천광역시 연수구 송도동 386일대pH, BOD, TOC, SS, n-H(광),T-N, T-P, 생태독성
91헬러만타이툰(유)기타 플라스틱 발포 성형제품 제조업인천광역시 벤처로12번길 28 (송도동)pH, TOC, SS, n-H(광), T-N, T-P, 생태독성
92(주)태리네트웍스인천 송도신도시지점주유소운영업인천광역시 연수구 아암대로 809(동춘동 913-3)pH, TOC, SS, n-H(광), ABS, T-N, T-P, 생태독성
93풍림산업(주)토목건설업인천광역시 연수구 송도동 431-4 (수직구 #3)pH, BOD, TOC, SS, n-H(광),T-N, T-P, 생태독성
94소버린이피에스(주)자동차세차업(95213)인천광역시 연수구 인천타워대로197번길 16, 지하1층(B105) (송도동)pH, TOC, SS, n-H(광), ABS, T-N, T-P
95주식회사 와이제이양지자동차세차업(95213)인천광역시 연수구 먼우금로 222번길 46(연수동 594)pH, TOC, SS, n-H(광), ABS, T-N, T-P
96주식회사 노터스전문, 과학 및 기술서비스업(730003)인천광역시 아카데미로 19번길 38(송도동)pH, TOC, SS, n-H(광), 자일렌, T-N, T-P, 생태독성
97워시존 실내세차장 연수점자동차세차업(95213)인천광역시 연수구 한나루로 153(옥련동)pH, TOC, SS, n-H(광), ABS, T-N, T-P