Overview

Dataset statistics

Number of variables5
Number of observations361
Missing cells0
Missing cells (%)0.0%
Duplicate rows3
Duplicate rows (%)0.8%
Total size in memory14.2 KiB
Average record size in memory40.4 B

Variable types

Categorical3
Text1
DateTime1

Dataset

Description성남시 환경오염물질배출시설 점검 현황입니다.(시설구분,시설명,점검일자 등) ※ 올해 점검 시설에 대한 데이터임
URLhttps://www.data.go.kr/data/15037068/fileData.do

Alerts

시군명 has constant value ""Constant
데이터기준일자 has constant value ""Constant
Dataset has 3 (0.8%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 23:14:01.231719
Analysis finished2023-12-12 23:14:01.624138
Duration0.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
성남시
361 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row성남시
2nd row성남시
3rd row성남시
4th row성남시
5th row성남시

Common Values

ValueCountFrequency (%)
성남시 361
100.0%

Length

2023-12-13T08:14:01.717676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:14:01.873155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
성남시 361
100.0%

시설구분
Categorical

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
폐수배출시설
204 
대기오염물질배출시설
157 

Length

Max length10
Median length6
Mean length7.7396122
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row폐수배출시설
2nd row폐수배출시설
3rd row폐수배출시설
4th row폐수배출시설
5th row폐수배출시설

Common Values

ValueCountFrequency (%)
폐수배출시설 204
56.5%
대기오염물질배출시설 157
43.5%

Length

2023-12-13T08:14:02.009143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:14:02.148093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐수배출시설 204
56.5%
대기오염물질배출시설 157
43.5%
Distinct289
Distinct (%)80.1%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-12-13T08:14:02.418522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length22
Mean length10.263158
Min length3

Characters and Unicode

Total characters3705
Distinct characters346
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique228 ?
Unique (%)63.2%

Sample

1st row(주)대일교통
2nd row에스케이판교충전소
3rd row고등카 WASH
4th row(주)새서울석유 동판교
5th row(주)새서울석유 세종
ValueCountFrequency (%)
주식회사 17
 
3.5%
분당점 7
 
1.4%
현대오일뱅크(주)직영 6
 
1.2%
한성자동차(주 5
 
1.0%
농협성남유통센터 4
 
0.8%
주)농협하나로유통 4
 
0.8%
클린손세차 3
 
0.6%
분당차병원(여성병원 3
 
0.6%
세차장 3
 
0.6%
판교 3
 
0.6%
Other values (339) 432
88.7%
2023-12-13T08:14:02.839280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
203
 
5.5%
( 172
 
4.6%
) 172
 
4.6%
126
 
3.4%
108
 
2.9%
82
 
2.2%
82
 
2.2%
76
 
2.1%
65
 
1.8%
63
 
1.7%
Other values (336) 2556
69.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3175
85.7%
Open Punctuation 172
 
4.6%
Close Punctuation 172
 
4.6%
Space Separator 126
 
3.4%
Uppercase Letter 28
 
0.8%
Decimal Number 17
 
0.5%
Lowercase Letter 11
 
0.3%
Other Symbol 3
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
203
 
6.4%
108
 
3.4%
82
 
2.6%
82
 
2.6%
76
 
2.4%
65
 
2.0%
63
 
2.0%
61
 
1.9%
60
 
1.9%
58
 
1.8%
Other values (305) 2317
73.0%
Uppercase Letter
ValueCountFrequency (%)
K 4
14.3%
S 4
14.3%
H 4
14.3%
C 2
7.1%
G 2
7.1%
V 2
7.1%
I 2
7.1%
P 2
7.1%
W 2
7.1%
A 2
7.1%
Other values (2) 2
7.1%
Decimal Number
ValueCountFrequency (%)
1 9
52.9%
6 2
 
11.8%
5 2
 
11.8%
8 1
 
5.9%
4 1
 
5.9%
3 1
 
5.9%
9 1
 
5.9%
Lowercase Letter
ValueCountFrequency (%)
f 2
18.2%
l 2
18.2%
e 2
18.2%
s 2
18.2%
c 1
9.1%
r 1
9.1%
a 1
9.1%
Open Punctuation
ValueCountFrequency (%)
( 172
100.0%
Close Punctuation
ValueCountFrequency (%)
) 172
100.0%
Space Separator
ValueCountFrequency (%)
126
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3178
85.8%
Common 488
 
13.2%
Latin 39
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
203
 
6.4%
108
 
3.4%
82
 
2.6%
82
 
2.6%
76
 
2.4%
65
 
2.0%
63
 
2.0%
61
 
1.9%
60
 
1.9%
58
 
1.8%
Other values (306) 2320
73.0%
Latin
ValueCountFrequency (%)
K 4
 
10.3%
S 4
 
10.3%
H 4
 
10.3%
C 2
 
5.1%
G 2
 
5.1%
f 2
 
5.1%
l 2
 
5.1%
e 2
 
5.1%
s 2
 
5.1%
V 2
 
5.1%
Other values (9) 13
33.3%
Common
ValueCountFrequency (%)
( 172
35.2%
) 172
35.2%
126
25.8%
1 9
 
1.8%
6 2
 
0.4%
5 2
 
0.4%
8 1
 
0.2%
, 1
 
0.2%
4 1
 
0.2%
3 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3175
85.7%
ASCII 527
 
14.2%
None 3
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
203
 
6.4%
108
 
3.4%
82
 
2.6%
82
 
2.6%
76
 
2.4%
65
 
2.0%
63
 
2.0%
61
 
1.9%
60
 
1.9%
58
 
1.8%
Other values (305) 2317
73.0%
ASCII
ValueCountFrequency (%)
( 172
32.6%
) 172
32.6%
126
23.9%
1 9
 
1.7%
K 4
 
0.8%
S 4
 
0.8%
H 4
 
0.8%
C 2
 
0.4%
6 2
 
0.4%
G 2
 
0.4%
Other values (20) 30
 
5.7%
None
ValueCountFrequency (%)
3
100.0%
Distinct132
Distinct (%)36.6%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
Minimum2021-02-19 00:00:00
Maximum2023-06-01 00:00:00
2023-12-13T08:14:03.015670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:14:03.189821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-06-09
361 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-06-09
2nd row2023-06-09
3rd row2023-06-09
4th row2023-06-09
5th row2023-06-09

Common Values

ValueCountFrequency (%)
2023-06-09 361
100.0%

Length

2023-12-13T08:14:03.345409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:14:03.451017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-06-09 361
100.0%

Missing values

2023-12-13T08:14:01.457873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:14:01.577080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명시설구분시설명점검일자데이터기준일자
0성남시폐수배출시설(주)대일교통2021-02-192023-06-09
1성남시폐수배출시설에스케이판교충전소2021-03-172023-06-09
2성남시폐수배출시설고등카 WASH2021-03-172023-06-09
3성남시폐수배출시설(주)새서울석유 동판교2021-03-192023-06-09
4성남시폐수배출시설(주)새서울석유 세종2021-03-192023-06-09
5성남시대기오염물질배출시설VIP자동차공업사2021-03-242023-06-09
6성남시대기오염물질배출시설동경공업사2021-03-242023-06-09
7성남시대기오염물질배출시설서현자동차서비스2021-03-242023-06-09
8성남시대기오염물질배출시설조영자동차공업사2021-03-242023-06-09
9성남시대기오염물질배출시설대림자동차정비서비스(분당판교자동차서비스)2021-03-242023-06-09
시군명시설구분시설명점검일자데이터기준일자
351성남시대기오염물질배출시설위본모터스(주)아우디센터분당2023-05-242023-06-09
352성남시대기오염물질배출시설분당현대서비스(주)2023-05-242023-06-09
353성남시대기오염물질배출시설분당서비스기아오토큐(주)2023-05-242023-06-09
354성남시대기오염물질배출시설중앙자동차공업사2023-05-302023-06-09
355성남시대기오염물질배출시설VIP자동차공업사2023-05-302023-06-09
356성남시대기오염물질배출시설복정현대자동차서비스2023-05-302023-06-09
357성남시대기오염물질배출시설대림자동차정비서비스(분당판교자동차서비스)2023-06-012023-06-09
358성남시대기오염물질배출시설조영자동차공업사2023-06-012023-06-09
359성남시대기오염물질배출시설남성공업사2023-06-012023-06-09
360성남시대기오염물질배출시설쌍용자동차성남정비사업소(주)2023-06-012023-06-09

Duplicate rows

Most frequently occurring

시군명시설구분시설명점검일자데이터기준일자# duplicates
0성남시대기오염물질배출시설성남도시개발공사2022-08-292023-06-092
1성남시폐수배출시설(주)경기고속2021-03-262023-06-092
2성남시폐수배출시설(주)엔케이맥스2021-07-232023-06-092