Overview

Dataset statistics

Number of variables5
Number of observations1090
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory42.7 KiB
Average record size in memory40.1 B

Variable types

Categorical4
Text1

Dataset

Description관내 환경오염물질 배출시설 현황에 대한 데이터로 시군명, 시설구분, 시설명, 관리기관명 데이터기준일 항목을 제공합니다.
Author경기도 양주시
URLhttps://www.data.go.kr/data/3076910/fileData.do

Alerts

시군명 has constant value ""Constant
관리기관명 has constant value ""Constant
데이터기준일 has constant value ""Constant

Reproduction

Analysis started2023-12-12 10:03:26.271926
Analysis finished2023-12-12 10:03:26.730998
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.6 KiB
양주시
1090 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row양주시
2nd row양주시
3rd row양주시
4th row양주시
5th row양주시

Common Values

ValueCountFrequency (%)
양주시 1090
100.0%

Length

2023-12-12T19:03:26.795758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:03:26.893370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
양주시 1090
100.0%

시설구분
Categorical

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size8.6 KiB
대기오염물질배출시설
468 
수질오염물질배출시설
406 
수질오염물질배출시설, 대기오염물질배출시설
216 

Length

Max length22
Median length10
Mean length12.377982
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대기오염물질배출시설
2nd row수질오염물질배출시설
3rd row대기오염물질배출시설
4th row대기오염물질배출시설
5th row대기오염물질배출시설

Common Values

ValueCountFrequency (%)
대기오염물질배출시설 468
42.9%
수질오염물질배출시설 406
37.2%
수질오염물질배출시설, 대기오염물질배출시설 216
19.8%

Length

2023-12-12T19:03:27.043726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:03:27.197249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대기오염물질배출시설 684
52.4%
수질오염물질배출시설 622
47.6%
Distinct1079
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size8.6 KiB
2023-12-12T19:03:27.464643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length19
Mean length5.7605505
Min length2

Characters and Unicode

Total characters6279
Distinct characters454
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1069 ?
Unique (%)98.1%

Sample

1st row㈜경기북부자동차공업사
2nd row㈜미진식품
3rd row㈜보성공업
4th row㈜부림화이바
5th row㈜비엠(B.M) 케미칼
ValueCountFrequency (%)
주식회사 19
 
1.6%
농업회사법인 4
 
0.3%
우일섬유 3
 
0.3%
양주지점 3
 
0.3%
태강 2
 
0.2%
㈜피제이메텍 2
 
0.2%
㈜영신물산 2
 
0.2%
커스텀튜브매뉴팩처링(유 2
 
0.2%
옥정신도시주유소 2
 
0.2%
양주점 2
 
0.2%
Other values (1110) 1121
96.5%
2023-12-12T19:03:28.021367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
434
 
6.9%
220
 
3.5%
153
 
2.4%
152
 
2.4%
136
 
2.2%
128
 
2.0%
119
 
1.9%
116
 
1.8%
111
 
1.8%
97
 
1.5%
Other values (444) 4613
73.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5565
88.6%
Other Symbol 434
 
6.9%
Uppercase Letter 107
 
1.7%
Space Separator 72
 
1.1%
Decimal Number 30
 
0.5%
Open Punctuation 22
 
0.4%
Close Punctuation 22
 
0.4%
Other Punctuation 18
 
0.3%
Dash Punctuation 9
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
220
 
4.0%
153
 
2.7%
152
 
2.7%
136
 
2.4%
128
 
2.3%
119
 
2.1%
116
 
2.1%
111
 
2.0%
97
 
1.7%
86
 
1.5%
Other values (406) 4247
76.3%
Uppercase Letter
ValueCountFrequency (%)
S 12
 
11.2%
E 10
 
9.3%
C 9
 
8.4%
T 9
 
8.4%
M 8
 
7.5%
P 6
 
5.6%
K 6
 
5.6%
L 6
 
5.6%
G 5
 
4.7%
D 5
 
4.7%
Other values (11) 31
29.0%
Decimal Number
ValueCountFrequency (%)
2 13
43.3%
1 6
20.0%
7 2
 
6.7%
4 2
 
6.7%
3 2
 
6.7%
5 2
 
6.7%
6 1
 
3.3%
9 1
 
3.3%
8 1
 
3.3%
Other Punctuation
ValueCountFrequency (%)
& 10
55.6%
. 7
38.9%
, 1
 
5.6%
Other Symbol
ValueCountFrequency (%)
434
100.0%
Space Separator
ValueCountFrequency (%)
72
100.0%
Open Punctuation
ValueCountFrequency (%)
( 22
100.0%
Close Punctuation
ValueCountFrequency (%)
) 22
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5999
95.5%
Common 173
 
2.8%
Latin 107
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
434
 
7.2%
220
 
3.7%
153
 
2.6%
152
 
2.5%
136
 
2.3%
128
 
2.1%
119
 
2.0%
116
 
1.9%
111
 
1.9%
97
 
1.6%
Other values (407) 4333
72.2%
Latin
ValueCountFrequency (%)
S 12
 
11.2%
E 10
 
9.3%
C 9
 
8.4%
T 9
 
8.4%
M 8
 
7.5%
P 6
 
5.6%
K 6
 
5.6%
L 6
 
5.6%
G 5
 
4.7%
D 5
 
4.7%
Other values (11) 31
29.0%
Common
ValueCountFrequency (%)
72
41.6%
( 22
 
12.7%
) 22
 
12.7%
2 13
 
7.5%
& 10
 
5.8%
- 9
 
5.2%
. 7
 
4.0%
1 6
 
3.5%
7 2
 
1.2%
4 2
 
1.2%
Other values (6) 8
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5565
88.6%
None 434
 
6.9%
ASCII 280
 
4.5%

Most frequent character per block

None
ValueCountFrequency (%)
434
100.0%
Hangul
ValueCountFrequency (%)
220
 
4.0%
153
 
2.7%
152
 
2.7%
136
 
2.4%
128
 
2.3%
119
 
2.1%
116
 
2.1%
111
 
2.0%
97
 
1.7%
86
 
1.5%
Other values (406) 4247
76.3%
ASCII
ValueCountFrequency (%)
72
25.7%
( 22
 
7.9%
) 22
 
7.9%
2 13
 
4.6%
S 12
 
4.3%
E 10
 
3.6%
& 10
 
3.6%
- 9
 
3.2%
C 9
 
3.2%
T 9
 
3.2%
Other values (27) 92
32.9%

관리기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.6 KiB
양주시 허가과
1090 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row양주시 허가과
2nd row양주시 허가과
3rd row양주시 허가과
4th row양주시 허가과
5th row양주시 허가과

Common Values

ValueCountFrequency (%)
양주시 허가과 1090
100.0%

Length

2023-12-12T19:03:28.207807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:03:28.336157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
양주시 1090
50.0%
허가과 1090
50.0%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.6 KiB
2023-11-22
1090 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-11-22
2nd row2023-11-22
3rd row2023-11-22
4th row2023-11-22
5th row2023-11-22

Common Values

ValueCountFrequency (%)
2023-11-22 1090
100.0%

Length

2023-12-12T19:03:28.467235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:03:28.599916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-11-22 1090
100.0%

Missing values

2023-12-12T19:03:26.577433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:03:26.691524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명시설구분시설명관리기관명데이터기준일
0양주시대기오염물질배출시설㈜경기북부자동차공업사양주시 허가과2023-11-22
1양주시수질오염물질배출시설㈜미진식품양주시 허가과2023-11-22
2양주시대기오염물질배출시설㈜보성공업양주시 허가과2023-11-22
3양주시대기오염물질배출시설㈜부림화이바양주시 허가과2023-11-22
4양주시대기오염물질배출시설㈜비엠(B.M) 케미칼양주시 허가과2023-11-22
5양주시수질오염물질배출시설, 대기오염물질배출시설㈜서광비엠비양주시 허가과2023-11-22
6양주시대기오염물질배출시설㈜애니테크 상수지점양주시 허가과2023-11-22
7양주시수질오염물질배출시설, 대기오염물질배출시설㈜일신분체양주시 허가과2023-11-22
8양주시수질오염물질배출시설㈜제이에스푸드양주시 허가과2023-11-22
9양주시수질오염물질배출시설, 대기오염물질배출시설㈜청해염업-폐업양주시 허가과2023-11-22
시군명시설구분시설명관리기관명데이터기준일
1080양주시대기오염물질배출시설㈜두림2공장양주시 허가과2023-11-22
1081양주시수질오염물질배출시설㈜엘제이앤텍양주시 허가과2023-11-22
1082양주시대기오염물질배출시설㈜지텍양주시 허가과2023-11-22
1083양주시대기오염물질배출시설강남자동차정비검사소양주시 허가과2023-11-22
1084양주시대기오염물질배출시설㈜다윈텍스타일양주시 허가과2023-11-22
1085양주시대기오염물질배출시설신일컴퍼니양주시 허가과2023-11-22
1086양주시대기오염물질배출시설블루모터스양주시 허가과2023-11-22
1087양주시대기오염물질배출시설㈜카우스양주시 허가과2023-11-22
1088양주시수질오염물질배출시설워시카카(양주옥정점)양주시 허가과2023-11-22
1089양주시수질오염물질배출시설㈜바른이앤씨양주시 허가과2023-11-22