Overview

Dataset statistics

Number of variables5
Number of observations29
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory44.6 B

Variable types

Text3
DateTime1
Categorical1

Dataset

Description용인시관내 환경위반업소 공개 정보입니다.
Author경기도 용인시
URLhttps://www.data.go.kr/data/3072098/fileData.do

Alerts

처분일 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:46:27.597698
Analysis finished2023-12-12 08:46:28.017404
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct21
Distinct (%)72.4%
Missing0
Missing (%)0.0%
Memory size364.0 B
2023-12-12T17:46:28.144019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length9
Min length5

Characters and Unicode

Total characters261
Distinct characters90
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)55.2%

Sample

1st row(주)조양
2nd row(주)아우조건설
3rd row경희대학교국제캠퍼스
4th row도원자동차공업(주)
5th row(주)현대삼일자동차공업사
ValueCountFrequency (%)
해창개발(주 4
 
13.8%
삼화콘덴서공업(주 3
 
10.3%
주)조양 2
 
6.9%
현암자동차공업사(유 2
 
6.9%
알지비하이텍(주 2
 
6.9%
주)현대삼일자동차공업사 1
 
3.4%
주)엠스틸이앤씨 1
 
3.4%
삼성특수기기 1
 
3.4%
백석토건(주 1
 
3.4%
경희대학교국제캠퍼스 1
 
3.4%
Other values (11) 11
37.9%
2023-12-12T17:46:28.540494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 26
 
10.0%
) 26
 
10.0%
24
 
9.2%
9
 
3.4%
9
 
3.4%
7
 
2.7%
6
 
2.3%
6
 
2.3%
5
 
1.9%
5
 
1.9%
Other values (80) 138
52.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 209
80.1%
Open Punctuation 26
 
10.0%
Close Punctuation 26
 
10.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
11.5%
9
 
4.3%
9
 
4.3%
7
 
3.3%
6
 
2.9%
6
 
2.9%
5
 
2.4%
5
 
2.4%
4
 
1.9%
4
 
1.9%
Other values (78) 130
62.2%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 209
80.1%
Common 52
 
19.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
11.5%
9
 
4.3%
9
 
4.3%
7
 
3.3%
6
 
2.9%
6
 
2.9%
5
 
2.4%
5
 
2.4%
4
 
1.9%
4
 
1.9%
Other values (78) 130
62.2%
Common
ValueCountFrequency (%)
( 26
50.0%
) 26
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 209
80.1%
ASCII 52
 
19.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 26
50.0%
) 26
50.0%
Hangul
ValueCountFrequency (%)
24
 
11.5%
9
 
4.3%
9
 
4.3%
7
 
3.3%
6
 
2.9%
6
 
2.9%
5
 
2.4%
5
 
2.4%
4
 
1.9%
4
 
1.9%
Other values (78) 130
62.2%
Distinct27
Distinct (%)93.1%
Missing0
Missing (%)0.0%
Memory size364.0 B
2023-12-12T17:46:28.829387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length22
Mean length16.275862
Min length10

Characters and Unicode

Total characters472
Distinct characters62
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)86.2%

Sample

1st row용인시 처인구 양지면 은이로 16
2nd row경기도 용인시 수지구 풍덕천동 741번지
3rd row경기도 용인시 기흥구 덕영대로 1732
4th row용인시 양지면 남곡리 489-7
5th row용인시 수지구 풍덕천동 55
ValueCountFrequency (%)
용인시 12
 
10.6%
수지구 10
 
8.8%
처인구 10
 
8.8%
기흥구 7
 
6.2%
풍덕천동 6
 
5.3%
양지면 5
 
4.4%
경기도 4
 
3.5%
124 3
 
2.7%
북리 3
 
2.7%
남사면 3
 
2.7%
Other values (43) 50
44.2%
2023-12-12T17:46:29.266438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
84
 
17.8%
29
 
6.1%
1 24
 
5.1%
22
 
4.7%
18
 
3.8%
2 17
 
3.6%
17
 
3.6%
- 16
 
3.4%
3 14
 
3.0%
0 13
 
2.8%
Other values (52) 218
46.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 259
54.9%
Decimal Number 113
23.9%
Space Separator 84
 
17.8%
Dash Punctuation 16
 
3.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
 
11.2%
22
 
8.5%
18
 
6.9%
17
 
6.6%
12
 
4.6%
12
 
4.6%
11
 
4.2%
10
 
3.9%
10
 
3.9%
10
 
3.9%
Other values (40) 108
41.7%
Decimal Number
ValueCountFrequency (%)
1 24
21.2%
2 17
15.0%
3 14
12.4%
0 13
11.5%
7 12
10.6%
5 12
10.6%
4 10
8.8%
8 7
 
6.2%
6 2
 
1.8%
9 2
 
1.8%
Space Separator
ValueCountFrequency (%)
84
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 259
54.9%
Common 213
45.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
 
11.2%
22
 
8.5%
18
 
6.9%
17
 
6.6%
12
 
4.6%
12
 
4.6%
11
 
4.2%
10
 
3.9%
10
 
3.9%
10
 
3.9%
Other values (40) 108
41.7%
Common
ValueCountFrequency (%)
84
39.4%
1 24
 
11.3%
2 17
 
8.0%
- 16
 
7.5%
3 14
 
6.6%
0 13
 
6.1%
7 12
 
5.6%
5 12
 
5.6%
4 10
 
4.7%
8 7
 
3.3%
Other values (2) 4
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 259
54.9%
ASCII 213
45.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
84
39.4%
1 24
 
11.3%
2 17
 
8.0%
- 16
 
7.5%
3 14
 
6.6%
0 13
 
6.1%
7 12
 
5.6%
5 12
 
5.6%
4 10
 
4.7%
8 7
 
3.3%
Other values (2) 4
 
1.9%
Hangul
ValueCountFrequency (%)
29
 
11.2%
22
 
8.5%
18
 
6.9%
17
 
6.6%
12
 
4.6%
12
 
4.6%
11
 
4.2%
10
 
3.9%
10
 
3.9%
10
 
3.9%
Other values (40) 108
41.7%
Distinct17
Distinct (%)58.6%
Missing0
Missing (%)0.0%
Memory size364.0 B
2023-12-12T17:46:29.536539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length27
Mean length23.344828
Min length12

Characters and Unicode

Total characters677
Distinct characters63
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)37.9%

Sample

1st row법 제32조 배출허용기준 초과
2nd row법 제32조 배출허용기준 초과
3rd row법 제33조제2항(변경신고 미이행)
4th row미신고 폐수배출시설 설치 운영 법 제31조1항
5th row법 제38조제3항(폐수배출시설 운영일지 미기록)
ValueCountFrequency (%)
수질 19
11.1%
보전에 19
11.1%
관한 19
11.1%
19
11.1%
수생태계 18
10.5%
법률 18
10.5%
제39조 13
7.6%
6
 
3.5%
초과 4
 
2.3%
배출허용기준 4
 
2.3%
Other values (26) 32
18.7%
2023-12-12T17:46:29.968415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
142
21.0%
44
 
6.5%
33
 
4.9%
3 32
 
4.7%
27
 
4.0%
27
 
4.0%
21
 
3.1%
21
 
3.1%
21
 
3.1%
21
 
3.1%
Other values (53) 288
42.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 448
66.2%
Space Separator 142
 
21.0%
Decimal Number 60
 
8.9%
Uppercase Letter 17
 
2.5%
Close Punctuation 3
 
0.4%
Open Punctuation 3
 
0.4%
Lowercase Letter 2
 
0.3%
Other Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
9.8%
33
 
7.4%
27
 
6.0%
27
 
6.0%
21
 
4.7%
21
 
4.7%
21
 
4.7%
21
 
4.7%
21
 
4.7%
21
 
4.7%
Other values (37) 191
42.6%
Decimal Number
ValueCountFrequency (%)
3 32
53.3%
9 14
23.3%
2 7
 
11.7%
1 3
 
5.0%
5 2
 
3.3%
8 2
 
3.3%
Uppercase Letter
ValueCountFrequency (%)
S 6
35.3%
D 3
17.6%
O 3
17.6%
C 3
17.6%
N 2
 
11.8%
Space Separator
ValueCountFrequency (%)
142
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Lowercase Letter
ValueCountFrequency (%)
i 2
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 448
66.2%
Common 210
31.0%
Latin 19
 
2.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
9.8%
33
 
7.4%
27
 
6.0%
27
 
6.0%
21
 
4.7%
21
 
4.7%
21
 
4.7%
21
 
4.7%
21
 
4.7%
21
 
4.7%
Other values (37) 191
42.6%
Common
ValueCountFrequency (%)
142
67.6%
3 32
 
15.2%
9 14
 
6.7%
2 7
 
3.3%
) 3
 
1.4%
( 3
 
1.4%
1 3
 
1.4%
, 2
 
1.0%
5 2
 
1.0%
8 2
 
1.0%
Latin
ValueCountFrequency (%)
S 6
31.6%
D 3
15.8%
O 3
15.8%
C 3
15.8%
i 2
 
10.5%
N 2
 
10.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 448
66.2%
ASCII 229
33.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
142
62.0%
3 32
 
14.0%
9 14
 
6.1%
2 7
 
3.1%
S 6
 
2.6%
D 3
 
1.3%
) 3
 
1.3%
O 3
 
1.3%
( 3
 
1.3%
1 3
 
1.3%
Other values (6) 13
 
5.7%
Hangul
ValueCountFrequency (%)
44
 
9.8%
33
 
7.4%
27
 
6.0%
27
 
6.0%
21
 
4.7%
21
 
4.7%
21
 
4.7%
21
 
4.7%
21
 
4.7%
21
 
4.7%
Other values (37) 191
42.6%

처분일
Date

UNIQUE 

Distinct29
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size364.0 B
Minimum2010-03-17 00:00:00
Maximum2014-03-05 00:00:00
2023-12-12T17:46:30.099913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:46:30.226026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
Distinct12
Distinct (%)41.4%
Missing0
Missing (%)0.0%
Memory size364.0 B
개선명령
16 
경고
사용중지명령
 
1
1차 개선명령
 
1
폐쇄
 
1
Other values (7)

Length

Max length14
Median length4
Mean length5.3793103
Min length2

Unique

Unique10 ?
Unique (%)34.5%

Sample

1st row개선명령
2nd row개선명령
3rd row경고
4th row사용중지명령
5th row경고

Common Values

ValueCountFrequency (%)
개선명령 16
55.2%
경고 3
 
10.3%
사용중지명령 1
 
3.4%
1차 개선명령 1
 
3.4%
폐쇄 1
 
3.4%
경고 및 과태료60만원 1
 
3.4%
경고 및 과태료 100만원 1
 
3.4%
경고 및 과태료 60만원 1
 
3.4%
과태료 200만원 1
 
3.4%
고발 1
 
3.4%
Other values (2) 2
 
6.9%

Length

2023-12-12T17:46:30.359582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
개선명령 17
38.6%
경고 8
18.2%
5
 
11.4%
과태료 4
 
9.1%
사용중지명령 1
 
2.3%
1차 1
 
2.3%
폐쇄 1
 
2.3%
과태료60만원 1
 
2.3%
100만원 1
 
2.3%
60만원 1
 
2.3%
Other values (4) 4
 
9.1%

Correlations

2023-12-12T17:46:30.452216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명소재지위반법규처분일행정처분내용
업소명1.0001.0000.8161.0000.824
소재지1.0001.0000.9671.0000.953
위반법규0.8160.9671.0001.0000.953
처분일1.0001.0001.0001.0001.000
행정처분내용0.8240.9530.9531.0001.000

Missing values

2023-12-12T17:46:27.889741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:46:27.981912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명소재지위반법규처분일행정처분내용
0(주)조양용인시 처인구 양지면 은이로 16법 제32조 배출허용기준 초과2014-03-05개선명령
1(주)아우조건설경기도 용인시 수지구 풍덕천동 741번지법 제32조 배출허용기준 초과2014-02-18개선명령
2경희대학교국제캠퍼스경기도 용인시 기흥구 덕영대로 1732법 제33조제2항(변경신고 미이행)2014-02-04경고
3도원자동차공업(주)용인시 양지면 남곡리 489-7미신고 폐수배출시설 설치 운영 법 제31조1항2013-11-06사용중지명령
4(주)현대삼일자동차공업사용인시 수지구 풍덕천동 55법 제38조제3항(폐수배출시설 운영일지 미기록)2013-10-30경고
5삼화콘덴서공업(주)용인시 남사면 북리 124Ni 배출허용기준 초과2013-09-161차 개선명령
6삼화콘덴서공업(주)용인시 처인구 남사면 북리 124제39조 위반 (COD,SS,Ni 배출허용기준 초과)2013-09-11개선명령
7현대자동차공업사처인구 역북동 528-2수질 및 수생태계 보전에 관한 법률 제2010-09-15폐쇄
8삼화콘덴서공업(주)처인구 남사면 북리 124수질 및 수생태계 보전에 관한 법률 제39조2010-06-03개선명령
9(주)케이씨씨중앙연구소기흥구 마북동 83수질 및 수생태계 보전에 관한 법률제39조2010-04-05개선명령
업소명소재지위반법규처분일행정처분내용
19(주)대홍에이스건업기흥구 고매동 576-1수질 및 수생태계 보전에 관한 법률 제39조 COD초과2013-07-23개선명령
20성신양회(주)용인공장처인구 마평동 540-1수질 및 수생태계 보전에 관한 법률 제35조제2항2013-04-15과태료 200만원
21해창개발(주)수지구 풍덕천동 1033법 제15조 공공수역 토사유출2013-03-14고발
22백석토건(주)수지구 동천동 320수질 및 수생태계 보전에 관한 법률 제39조 COD초과2013-01-30개선명령
23해창개발(주)수지구 풍덕천동 1033수질 및 수생태계 보전에 관한 법률 제39조 SS초과2013-01-22개선명령
24해창개발(주)용인시 수지구 상현동 195-1외 1필지수질 및 수생태계 보전에 관한 법률 제39조2013-01-08개선명령
25해창개발(주)용인시 수지구 풍덕천동 1033수질 및 수생태계 보전에 관한 법률 제39조2013-01-04개선명령
26삼성특수기기경기도 용인시 처인구 원삼면 생안로 28수질및수생태계보전에관한법률제33조제2항2011-12-28경고 및 과태료부과
27알지비하이텍(주)경기도 용인시 처인구 양지면 주북로173번길 82-7수질및수생태계보전에관한법률제33조제2항2011-12-13경고 및 과태료 부과
28알지비하이텍(주)용인시 처인구 양지면 주북로 173번길 82-7수질 및 생태계 보전에 관한 법률 제32조2011-11-15경고