Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory468.8 KiB
Average record size in memory48.0 B

Variable types

DateTime1
Categorical1
Text3

Dataset

Description김해도시개발공사 하수처리시설별에 대한 월별 가동시간 현황을 조회하는 서비스로 기준연월,하수처리장구분명, 가동시간 등의 정보를 제공
Author김해시도시개발공사
URLhttps://www.data.go.kr/data/15096568/fileData.do

Reproduction

Analysis started2023-12-12 08:07:33.536400
Analysis finished2023-12-12 08:07:34.229457
Duration0.69 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct81
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2011-03-01 00:00:00
Maximum2021-09-01 00:00:00
2023-12-12T17:07:34.594832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:07:34.778447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
진영맑은물사업소
2090 
진영맑은물사업소 HANT반응조
1742 
안하 하수처리장
1218 
상동 공공하수처리시설
1195 
대동 공공하수처리시설
454 
Other values (19)
3301 

Length

Max length16
Median length15
Mean length10.8699
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대동 공공하수처리시설
2nd row진영맑은물사업소
3rd row상동 공공하수처리시설
4th row상동여차 마을하수처리장
5th row진영맑은물사업소 HANT반응조

Common Values

ValueCountFrequency (%)
진영맑은물사업소 2090
20.9%
진영맑은물사업소 HANT반응조 1742
17.4%
안하 하수처리장 1218
12.2%
상동 공공하수처리시설 1195
11.9%
대동 공공하수처리시설 454
 
4.5%
하사촌마을 하수처리장 410
 
4.1%
낙산마을 하수처리장 351
 
3.5%
생철마을 하수처리장 331
 
3.3%
진례 하수처리장 235
 
2.4%
진영 하수처리장(산본마을) 222
 
2.2%
Other values (14) 1752
17.5%

Length

2023-12-12T17:07:34.973664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
하수처리장 3838
21.4%
진영맑은물사업소 3832
21.4%
hant반응조 1742
9.7%
공공하수처리시설 1672
9.3%
안하 1218
 
6.8%
상동 1195
 
6.7%
대동 454
 
2.5%
하사촌마을 410
 
2.3%
낙산마을 351
 
2.0%
생철마을 331
 
1.8%
Other values (19) 2867
16.0%
Distinct319
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:07:35.325533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length6.8708
Min length2

Characters and Unicode

Total characters68708
Distinct characters209
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)0.1%

Sample

1st row약품펌프B
2nd row잡배수펌프
3rd row부상슬러지제거장치
4th row백학2.중계펌프A
5th row잉여슬러지펌프
ValueCountFrequency (%)
교반기 1217
 
8.2%
펌프 383
 
2.6%
슬러지 380
 
2.6%
공급펌프 372
 
2.5%
반송펌프 322
 
2.2%
송풍기 314
 
2.1%
무산소조 262
 
1.8%
유량조정조 261
 
1.8%
흡입펌프 234
 
1.6%
바닥배수펌프 231
 
1.6%
Other values (296) 10886
73.2%
2023-12-12T17:07:35.809446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5126
 
7.5%
5093
 
7.4%
4862
 
7.1%
3502
 
5.1%
2900
 
4.2%
2068
 
3.0%
1847
 
2.7%
1839
 
2.7%
1647
 
2.4%
A 1478
 
2.2%
Other values (199) 38346
55.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 56372
82.0%
Uppercase Letter 6299
 
9.2%
Space Separator 4862
 
7.1%
Lowercase Letter 495
 
0.7%
Other Punctuation 226
 
0.3%
Decimal Number 190
 
0.3%
Close Punctuation 119
 
0.2%
Open Punctuation 119
 
0.2%
Dash Punctuation 26
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5126
 
9.1%
5093
 
9.0%
3502
 
6.2%
2900
 
5.1%
2068
 
3.7%
1847
 
3.3%
1839
 
3.3%
1647
 
2.9%
1212
 
2.2%
1174
 
2.1%
Other values (164) 29964
53.2%
Uppercase Letter
ValueCountFrequency (%)
A 1478
23.5%
B 1190
18.9%
O 506
 
8.0%
N 434
 
6.9%
C 393
 
6.2%
L 325
 
5.2%
P 321
 
5.1%
U 279
 
4.4%
M 266
 
4.2%
R 211
 
3.3%
Other values (13) 896
14.2%
Lowercase Letter
ValueCountFrequency (%)
a 296
59.8%
l 83
 
16.8%
u 58
 
11.7%
m 58
 
11.7%
Other Punctuation
ValueCountFrequency (%)
/ 159
70.4%
. 67
29.6%
Decimal Number
ValueCountFrequency (%)
2 114
60.0%
1 76
40.0%
Space Separator
ValueCountFrequency (%)
4862
100.0%
Close Punctuation
ValueCountFrequency (%)
) 119
100.0%
Open Punctuation
ValueCountFrequency (%)
( 119
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 56372
82.0%
Latin 6794
 
9.9%
Common 5542
 
8.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5126
 
9.1%
5093
 
9.0%
3502
 
6.2%
2900
 
5.1%
2068
 
3.7%
1847
 
3.3%
1839
 
3.3%
1647
 
2.9%
1212
 
2.2%
1174
 
2.1%
Other values (164) 29964
53.2%
Latin
ValueCountFrequency (%)
A 1478
21.8%
B 1190
17.5%
O 506
 
7.4%
N 434
 
6.4%
C 393
 
5.8%
L 325
 
4.8%
P 321
 
4.7%
a 296
 
4.4%
U 279
 
4.1%
M 266
 
3.9%
Other values (17) 1306
19.2%
Common
ValueCountFrequency (%)
4862
87.7%
/ 159
 
2.9%
) 119
 
2.1%
( 119
 
2.1%
2 114
 
2.1%
1 76
 
1.4%
. 67
 
1.2%
- 26
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 56372
82.0%
ASCII 12336
 
18.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5126
 
9.1%
5093
 
9.0%
3502
 
6.2%
2900
 
5.1%
2068
 
3.7%
1847
 
3.3%
1839
 
3.3%
1647
 
2.9%
1212
 
2.2%
1174
 
2.1%
Other values (164) 29964
53.2%
ASCII
ValueCountFrequency (%)
4862
39.4%
A 1478
 
12.0%
B 1190
 
9.6%
O 506
 
4.1%
N 434
 
3.5%
C 393
 
3.2%
L 325
 
2.6%
P 321
 
2.6%
a 296
 
2.4%
U 279
 
2.3%
Other values (25) 2252
18.3%

태그
Text

Distinct608
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:07:36.326943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length6.4597
Min length2

Characters and Unicode

Total characters64597
Distinct characters136
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)0.1%

Sample

1st rowMOP-601
2nd rowM-7-11C
3rd rowM-101-1
4th row백학2.중계펌프A
5th row2M-4-18G
ValueCountFrequency (%)
펌프b 178
 
1.6%
펌프a 178
 
1.6%
유량조정조 138
 
1.3%
교반기 133
 
1.2%
슬러지 125
 
1.2%
저류조 107
 
1.0%
부로와b 98
 
0.9%
부로와a 96
 
0.9%
내부순환 94
 
0.9%
송풍기a 94
 
0.9%
Other values (597) 9601
88.6%
2023-12-12T17:07:36.970275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 11120
17.2%
M 7191
 
11.1%
2 6136
 
9.5%
1 3594
 
5.6%
B 3458
 
5.4%
A 3336
 
5.2%
0 3317
 
5.1%
4 2942
 
4.6%
3 2177
 
3.4%
1395
 
2.2%
Other values (126) 19931
30.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 21762
33.7%
Uppercase Letter 18269
28.3%
Other Letter 12493
19.3%
Dash Punctuation 11120
17.2%
Space Separator 842
 
1.3%
Other Punctuation 67
 
0.1%
Open Punctuation 22
 
< 0.1%
Close Punctuation 22
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1395
 
11.2%
1395
 
11.2%
677
 
5.4%
615
 
4.9%
489
 
3.9%
390
 
3.1%
387
 
3.1%
317
 
2.5%
293
 
2.3%
286
 
2.3%
Other values (88) 6249
50.0%
Uppercase Letter
ValueCountFrequency (%)
M 7191
39.4%
B 3458
18.9%
A 3336
18.3%
C 970
 
5.3%
D 846
 
4.6%
S 671
 
3.7%
P 348
 
1.9%
H 276
 
1.5%
F 248
 
1.4%
E 193
 
1.1%
Other values (13) 732
 
4.0%
Decimal Number
ValueCountFrequency (%)
2 6136
28.2%
1 3594
16.5%
0 3317
15.2%
4 2942
13.5%
3 2177
 
10.0%
6 853
 
3.9%
7 822
 
3.8%
5 738
 
3.4%
8 625
 
2.9%
9 558
 
2.6%
Dash Punctuation
ValueCountFrequency (%)
- 11120
100.0%
Space Separator
ValueCountFrequency (%)
842
100.0%
Other Punctuation
ValueCountFrequency (%)
. 67
100.0%
Open Punctuation
ValueCountFrequency (%)
( 22
100.0%
Close Punctuation
ValueCountFrequency (%)
) 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 33835
52.4%
Latin 18269
28.3%
Hangul 12493
 
19.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1395
 
11.2%
1395
 
11.2%
677
 
5.4%
615
 
4.9%
489
 
3.9%
390
 
3.1%
387
 
3.1%
317
 
2.5%
293
 
2.3%
286
 
2.3%
Other values (88) 6249
50.0%
Latin
ValueCountFrequency (%)
M 7191
39.4%
B 3458
18.9%
A 3336
18.3%
C 970
 
5.3%
D 846
 
4.6%
S 671
 
3.7%
P 348
 
1.9%
H 276
 
1.5%
F 248
 
1.4%
E 193
 
1.1%
Other values (13) 732
 
4.0%
Common
ValueCountFrequency (%)
- 11120
32.9%
2 6136
18.1%
1 3594
 
10.6%
0 3317
 
9.8%
4 2942
 
8.7%
3 2177
 
6.4%
6 853
 
2.5%
842
 
2.5%
7 822
 
2.4%
5 738
 
2.2%
Other values (5) 1294
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 52104
80.7%
Hangul 12493
 
19.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 11120
21.3%
M 7191
13.8%
2 6136
11.8%
1 3594
 
6.9%
B 3458
 
6.6%
A 3336
 
6.4%
0 3317
 
6.4%
4 2942
 
5.6%
3 2177
 
4.2%
C 970
 
1.9%
Other values (28) 7863
15.1%
Hangul
ValueCountFrequency (%)
1395
 
11.2%
1395
 
11.2%
677
 
5.4%
615
 
4.9%
489
 
3.9%
390
 
3.1%
387
 
3.1%
317
 
2.5%
293
 
2.3%
286
 
2.3%
Other values (88) 6249
50.0%
Distinct4470
Distinct (%)44.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:07:37.442564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length3.2196
Min length1

Characters and Unicode

Total characters32196
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3726 ?
Unique (%)37.3%

Sample

1st row0
2nd row0
3rd row18675
4th row1733
5th row1349
ValueCountFrequency (%)
0 3124
31.2%
44640 446
 
4.5%
43200 186
 
1.9%
40320 73
 
0.7%
1 59
 
0.6%
41760 50
 
0.5%
2 42
 
0.4%
3 36
 
0.4%
4 19
 
0.2%
39403 19
 
0.2%
Other values (4460) 5946
59.5%
2023-12-12T17:07:38.076508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 6222
19.3%
4 4878
15.2%
1 4056
12.6%
2 3192
9.9%
3 3184
9.9%
6 2651
8.2%
5 2053
 
6.4%
7 2039
 
6.3%
9 2032
 
6.3%
8 1879
 
5.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 32186
> 99.9%
Dash Punctuation 10
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 6222
19.3%
4 4878
15.2%
1 4056
12.6%
2 3192
9.9%
3 3184
9.9%
6 2651
8.2%
5 2053
 
6.4%
7 2039
 
6.3%
9 2032
 
6.3%
8 1879
 
5.8%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 32196
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 6222
19.3%
4 4878
15.2%
1 4056
12.6%
2 3192
9.9%
3 3184
9.9%
6 2651
8.2%
5 2053
 
6.4%
7 2039
 
6.3%
9 2032
 
6.3%
8 1879
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 32196
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6222
19.3%
4 4878
15.2%
1 4056
12.6%
2 3192
9.9%
3 3184
9.9%
6 2651
8.2%
5 2053
 
6.4%
7 2039
 
6.3%
9 2032
 
6.3%
8 1879
 
5.8%

Correlations

2023-12-12T17:07:38.226214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준연월하수처리장구분명
기준연월1.0000.736
하수처리장구분명0.7361.000

Missing values

2023-12-12T17:07:34.083896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:07:34.183625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준연월하수처리장구분명태그설명태그가동시간
217502020-05대동 공공하수처리시설약품펌프BMOP-6010
182442019-10진영맑은물사업소잡배수펌프M-7-11C0
258732020-12상동 공공하수처리시설부상슬러지제거장치M-101-118675
213182020-04상동여차 마을하수처리장백학2.중계펌프A백학2.중계펌프A1733
122502018-11진영맑은물사업소 HANT반응조잉여슬러지펌프2M-4-18G1349
115832018-10진영맑은물사업소무산소용 교반기M-4-2J44640
160912019-06진영맑은물사업소혼합조 교반기M-4-143200
71852018-01진영맑은물사업소탈취팬(신설)NM-8-1A10625
267732021-01진영맑은물사업소무산소용 교반기M-4-2K44640
287912021-05(증설)안하 하수처리장폴리머 투입펌프2M-508B0
기준연월하수처리장구분명태그설명태그가동시간
194202019-12진영맑은물사업소 HANT반응조미세목 스크린2M-3-6D5131
6482015-11안하 하수처리장방류펌프M-236A0
209382020-03진영맑은물사업소사여과기 역세펌프M-6-3B0
43472017-08낙산마을 하수처리장부로와A부로와A41674
256662020-11진영맑은물사업소 HANT반응조잉여슬러지펌프2M-4-18F1506
249302020-10진영맑은물사업소무산소용 교반기M-4-2H44595
200442020-01하사촌마을 하수처리장슬러지 이송펌프B슬러지 이송펌프B0
3102015-06안하 하수처리장교대반응조 교반기M-204B43119
217222020-05대동 공공하수처리시설PAC저장탱크교반기MD-3140
100722018-07진영맑은물사업소급기/배기팬F-60