Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory566.4 KiB
Average record size in memory58.0 B

Variable types

DateTime1
Numeric2
Categorical3

Dataset

Description김해도시개발공사 생림 하수처리시설별에 대한 시간대별 가동시간 현황을 조회하는 서비스로 기준연월일, 기준시간, 하수처리장구분명, 가동시간 등의 정보를 제공
Author김해시도시개발공사
URLhttps://www.data.go.kr/data/15096557/fileData.do

Alerts

하수처리장구분명 has constant value ""Constant
태그설명 is highly overall correlated with 태그High correlation
태그 is highly overall correlated with 태그설명High correlation
기준시간 has 429 (4.3%) zerosZeros
가동시간 has 5705 (57.0%) zerosZeros

Reproduction

Analysis started2023-12-11 23:35:28.351463
Analysis finished2023-12-11 23:35:29.495045
Duration1.14 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1923
Distinct (%)19.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2015-12-31 00:00:00
Maximum2021-07-24 00:00:00
2023-12-12T08:35:29.583040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:35:29.759499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

기준시간
Real number (ℝ)

ZEROS 

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.5036
Minimum0
Maximum23
Zeros429
Zeros (%)4.3%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:35:29.899261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q16
median11
Q318
95-th percentile22
Maximum23
Range23
Interquartile range (IQR)12

Descriptive statistics

Standard deviation6.9193912
Coefficient of variation (CV)0.6014979
Kurtosis-1.2049082
Mean11.5036
Median Absolute Deviation (MAD)6
Skewness0.0036211244
Sum115036
Variance47.877975
MonotonicityNot monotonic
2023-12-12T08:35:30.035288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
20 451
 
4.5%
9 446
 
4.5%
4 444
 
4.4%
0 429
 
4.3%
11 427
 
4.3%
19 426
 
4.3%
14 426
 
4.3%
6 426
 
4.3%
15 423
 
4.2%
22 421
 
4.2%
Other values (14) 5681
56.8%
ValueCountFrequency (%)
0 429
4.3%
1 392
3.9%
2 404
4.0%
3 412
4.1%
4 444
4.4%
5 408
4.1%
6 426
4.3%
7 419
4.2%
8 418
4.2%
9 446
4.5%
ValueCountFrequency (%)
23 416
4.2%
22 421
4.2%
21 393
3.9%
20 451
4.5%
19 426
4.3%
18 416
4.2%
17 412
4.1%
16 382
3.8%
15 423
4.2%
14 426
4.3%

하수처리장구분명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
신안,안양마을 하수처리장
10000 

Length

Max length13
Median length13
Mean length13
Min length13

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신안,안양마을 하수처리장
2nd row신안,안양마을 하수처리장
3rd row신안,안양마을 하수처리장
4th row신안,안양마을 하수처리장
5th row신안,안양마을 하수처리장

Common Values

ValueCountFrequency (%)
신안,안양마을 하수처리장 10000
100.0%

Length

2023-12-12T08:35:30.163925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:35:30.533569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신안,안양마을 10000
50.0%
하수처리장 10000
50.0%

태그설명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
원수이송펌프A
4907 
원수이송펌프B
4822 
교반기
 
271

Length

Max length7
Median length7
Mean length6.8916
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row원수이송펌프A
2nd row원수이송펌프A
3rd row원수이송펌프B
4th row원수이송펌프B
5th row원수이송펌프B

Common Values

ValueCountFrequency (%)
원수이송펌프A 4907
49.1%
원수이송펌프B 4822
48.2%
교반기 271
 
2.7%

Length

2023-12-12T08:35:30.633775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:35:30.736259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
원수이송펌프a 4907
49.1%
원수이송펌프b 4822
48.2%
교반기 271
 
2.7%

태그
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
원수이송펌프A
4907 
원수이송펌프B
4822 
교반기
 
271

Length

Max length7
Median length7
Mean length6.8916
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row원수이송펌프A
2nd row원수이송펌프A
3rd row원수이송펌프B
4th row원수이송펌프B
5th row원수이송펌프B

Common Values

ValueCountFrequency (%)
원수이송펌프A 4907
49.1%
원수이송펌프B 4822
48.2%
교반기 271
 
2.7%

Length

2023-12-12T08:35:30.875170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:35:30.989948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
원수이송펌프a 4907
49.1%
원수이송펌프b 4822
48.2%
교반기 271
 
2.7%

가동시간
Real number (ℝ)

ZEROS 

Distinct61
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24.087
Minimum0
Maximum60
Zeros5705
Zeros (%)57.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:35:31.094260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q360
95-th percentile60
Maximum60
Range60
Interquartile range (IQR)60

Descriptive statistics

Standard deviation28.82582
Coefficient of variation (CV)1.1967377
Kurtosis-1.8050154
Mean24.087
Median Absolute Deviation (MAD)0
Skewness0.40154615
Sum240870
Variance830.92792
MonotonicityNot monotonic
2023-12-12T08:35:31.235742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 5705
57.0%
60 3735
37.4%
30 21
 
0.2%
8 19
 
0.2%
48 16
 
0.2%
21 15
 
0.1%
37 14
 
0.1%
10 14
 
0.1%
43 14
 
0.1%
7 13
 
0.1%
Other values (51) 434
 
4.3%
ValueCountFrequency (%)
0 5705
57.0%
1 11
 
0.1%
2 11
 
0.1%
3 8
 
0.1%
4 13
 
0.1%
5 7
 
0.1%
6 5
 
0.1%
7 13
 
0.1%
8 19
 
0.2%
9 5
 
0.1%
ValueCountFrequency (%)
60 3735
37.4%
59 9
 
0.1%
58 12
 
0.1%
57 7
 
0.1%
56 9
 
0.1%
55 6
 
0.1%
54 11
 
0.1%
53 11
 
0.1%
52 7
 
0.1%
51 7
 
0.1%

Interactions

2023-12-12T08:35:29.048337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:35:28.794087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:35:29.191475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:35:28.909820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:35:31.341843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준시간태그설명태그가동시간
기준시간1.0000.0000.0000.112
태그설명0.0001.0001.0000.300
태그0.0001.0001.0000.300
가동시간0.1120.3000.3001.000
2023-12-12T08:35:31.425706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
태그설명태그
태그설명1.0001.000
태그1.0001.000
2023-12-12T08:35:31.506621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준시간가동시간태그설명태그
기준시간1.0000.0490.0000.000
가동시간0.0491.0000.1880.188
태그설명0.0000.1881.0001.000
태그0.0000.1881.0001.000

Missing values

2023-12-12T08:35:29.310967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:35:29.430220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준연월일기준시간하수처리장구분명태그설명태그가동시간
149162017-12-0112신안,안양마을 하수처리장원수이송펌프A원수이송펌프A0
165472018-02-0711신안,안양마을 하수처리장원수이송펌프A원수이송펌프A0
553322017-03-2612신안,안양마을 하수처리장원수이송펌프B원수이송펌프B60
823552020-05-0211신안,안양마을 하수처리장원수이송펌프B원수이송펌프B0
658962018-06-1116신안,안양마을 하수처리장원수이송펌프B원수이송펌프B0
756932019-07-2821신안,안양마을 하수처리장원수이송펌프B원수이송펌프B0
481622016-03-1418신안,안양마을 하수처리장원수이송펌프B원수이송펌프B0
780412019-11-0417신안,안양마을 하수처리장원수이송펌프B원수이송펌프B0
316322019-11-030신안,안양마을 하수처리장원수이송펌프A원수이송펌프A60
330622020-01-0114신안,안양마을 하수처리장원수이송펌프A원수이송펌프A0
기준연월일기준시간하수처리장구분명태그설명태그가동시간
544392017-02-177신안,안양마을 하수처리장원수이송펌프B원수이송펌프B60
12702016-02-2122신안,안양마을 하수처리장원수이송펌프A원수이송펌프A1
745822019-06-1114신안,안양마을 하수처리장원수이송펌프B원수이송펌프B0
248282019-01-2012신안,안양마을 하수처리장원수이송펌프A원수이송펌프A60
464182016-01-022신안,안양마을 하수처리장원수이송펌프B원수이송펌프B0
644142018-04-1022신안,안양마을 하수처리장원수이송펌프B원수이송펌프B0
162472018-01-2523신안,안양마을 하수처리장원수이송펌프A원수이송펌프A0
288122019-07-0712신안,안양마을 하수처리장원수이송펌프A원수이송펌프A0
299702019-08-2418신안,안양마을 하수처리장원수이송펌프A원수이송펌프A60
871932020-12-051신안,안양마을 하수처리장원수이송펌프B원수이송펌프B60