Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory566.4 KiB
Average record size in memory58.0 B

Variable types

DateTime1
Numeric2
Categorical3

Dataset

Description김해도시개발공사 안하 하수처리시설별에 대한 시간대별 가동시간 현황을 조회하는 서비스로 기준연월일, 기준시간, 하수처리장구분명, 가동시간 등의 정보를 제공
Author김해시도시개발공사
URLhttps://www.data.go.kr/data/15096559/fileData.do

Alerts

하수처리장구분명 has constant value ""Constant
태그설명 is highly overall correlated with 태그High correlation
태그 is highly overall correlated with 태그설명High correlation
기준시간 has 440 (4.4%) zerosZeros
가동시간 has 9230 (92.3%) zerosZeros

Reproduction

Analysis started2023-12-12 05:56:30.767619
Analysis finished2023-12-12 05:56:31.651690
Duration0.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1404
Distinct (%)14.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2015-11-11 00:00:00
Maximum2021-09-01 00:00:00
2023-12-12T14:56:31.714358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:56:31.878073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

기준시간
Real number (ℝ)

ZEROS 

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.4996
Minimum0
Maximum23
Zeros440
Zeros (%)4.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T14:56:32.008064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q16
median11
Q317
95-th percentile22
Maximum23
Range23
Interquartile range (IQR)11

Descriptive statistics

Standard deviation6.8962566
Coefficient of variation (CV)0.59969535
Kurtosis-1.1871438
Mean11.4996
Median Absolute Deviation (MAD)6
Skewness0.0063406983
Sum114996
Variance47.558356
MonotonicityNot monotonic
2023-12-12T14:56:32.118823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
6 450
 
4.5%
8 441
 
4.4%
0 440
 
4.4%
11 438
 
4.4%
20 437
 
4.4%
9 434
 
4.3%
22 428
 
4.3%
13 427
 
4.3%
12 426
 
4.3%
10 423
 
4.2%
Other values (14) 5656
56.6%
ValueCountFrequency (%)
0 440
4.4%
1 389
3.9%
2 401
4.0%
3 394
3.9%
4 399
4.0%
5 417
4.2%
6 450
4.5%
7 409
4.1%
8 441
4.4%
9 434
4.3%
ValueCountFrequency (%)
23 407
4.1%
22 428
4.3%
21 411
4.1%
20 437
4.4%
19 404
4.0%
18 409
4.1%
17 417
4.2%
16 386
3.9%
15 401
4.0%
14 412
4.1%

하수처리장구분명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
가달 하수처리장
10000 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가달 하수처리장
2nd row가달 하수처리장
3rd row가달 하수처리장
4th row가달 하수처리장
5th row가달 하수처리장

Common Values

ValueCountFrequency (%)
가달 하수처리장 10000
100.0%

Length

2023-12-12T14:56:32.237576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:56:32.340695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가달 10000
50.0%
하수처리장 10000
50.0%

태그설명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정량펌프
3552 
교반기상
3481 
교반기하
2967 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정량펌프
2nd row정량펌프
3rd row정량펌프
4th row교반기상
5th row교반기하

Common Values

ValueCountFrequency (%)
정량펌프 3552
35.5%
교반기상 3481
34.8%
교반기하 2967
29.7%

Length

2023-12-12T14:56:32.433502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:56:32.529716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정량펌프 3552
35.5%
교반기상 3481
34.8%
교반기하 2967
29.7%

태그
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정량펌프
3552 
교반기상
3481 
교반기하
2967 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정량펌프
2nd row정량펌프
3rd row정량펌프
4th row교반기상
5th row교반기하

Common Values

ValueCountFrequency (%)
정량펌프 3552
35.5%
교반기상 3481
34.8%
교반기하 2967
29.7%

Length

2023-12-12T14:56:32.636737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:56:32.723571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정량펌프 3552
35.5%
교반기상 3481
34.8%
교반기하 2967
29.7%

가동시간
Real number (ℝ)

ZEROS 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.606
Minimum0
Maximum60
Zeros9230
Zeros (%)92.3%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T14:56:32.803751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile60
Maximum60
Range60
Interquartile range (IQR)0

Descriptive statistics

Standard deviation15.967744
Coefficient of variation (CV)3.466727
Kurtosis8.117707
Mean4.606
Median Absolute Deviation (MAD)0
Skewness3.1802274
Sum46060
Variance254.96886
MonotonicityNot monotonic
2023-12-12T14:56:33.141294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 9230
92.3%
60 765
 
7.6%
22 2
 
< 0.1%
1 1
 
< 0.1%
59 1
 
< 0.1%
56 1
 
< 0.1%
ValueCountFrequency (%)
0 9230
92.3%
1 1
 
< 0.1%
22 2
 
< 0.1%
56 1
 
< 0.1%
59 1
 
< 0.1%
60 765
 
7.6%
ValueCountFrequency (%)
60 765
 
7.6%
59 1
 
< 0.1%
56 1
 
< 0.1%
22 2
 
< 0.1%
1 1
 
< 0.1%
0 9230
92.3%

Interactions

2023-12-12T14:56:31.251160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:56:31.096929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:56:31.359767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:56:31.172736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:56:33.218111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준시간태그설명태그가동시간
기준시간1.0000.0220.0220.037
태그설명0.0221.0001.0000.100
태그0.0221.0001.0000.100
가동시간0.0370.1000.1001.000
2023-12-12T14:56:33.304095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
태그설명태그
태그설명1.0001.000
태그1.0001.000
2023-12-12T14:56:33.381463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준시간가동시간태그설명태그
기준시간1.000-0.0070.0130.013
가동시간-0.0071.0000.0300.030
태그설명0.0130.0301.0001.000
태그0.0130.0301.0001.000

Missing values

2023-12-12T14:56:31.477921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:56:31.595156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준연월일기준시간하수처리장구분명태그설명태그가동시간
28302016-03-0822가달 하수처리장정량펌프정량펌프0
224092018-09-2417가달 하수처리장정량펌프정량펌프0
333662021-08-156가달 하수처리장정량펌프정량펌프0
547712018-07-273가달 하수처리장교반기상교반기상0
752222017-01-166가달 하수처리장교반기하교반기하0
489152017-11-253가달 하수처리장교반기상교반기상0
332015-11-129가달 하수처리장정량펌프정량펌프60
893592018-08-297가달 하수처리장교반기하교반기하0
553372018-08-1917가달 하수처리장교반기상교반기상0
542242018-07-048가달 하수처리장교반기상교반기상0
기준연월일기준시간하수처리장구분명태그설명태그가동시간
657512021-06-1715가달 하수처리장교반기상교반기상0
125752017-08-1023가달 하수처리장정량펌프정량펌프0
834422017-12-2518가달 하수처리장교반기하교반기하0
476602017-10-0320가달 하수처리장교반기상교반기상0
552852018-08-1713가달 하수처리장교반기상교반기상0
409152016-12-2519가달 하수처리장교반기상교반기상0
902222018-10-046가달 하수처리장교반기하교반기하0
480132017-10-1813가달 하수처리장교반기상교반기상0
596622019-02-1522가달 하수처리장교반기상교반기상0
779692017-05-1117가달 하수처리장교반기하교반기하0