Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory644.5 KiB
Average record size in memory66.0 B

Variable types

DateTime1
Numeric2
Categorical4

Dataset

Description김해도시개발공사 안하 하수처리시설별에 대한 시간대별 계측 현황을 조회하는 서비스로 기준연월일, 기준시간, 하수처리장구분명, 계측구분명, 계측값 등의 정보를 제공
Author김해시도시개발공사
URLhttps://www.data.go.kr/data/15096560/fileData.do

Alerts

하수처리장구분명 has constant value ""Constant
계측단위 has constant value ""Constant
계측태그명 is highly overall correlated with 계측구분명High correlation
계측구분명 is highly overall correlated with 계측태그명High correlation
기준시간 has 411 (4.1%) zerosZeros
계측값 has 1557 (15.6%) zerosZeros

Reproduction

Analysis started2023-12-12 13:54:23.371153
Analysis finished2023-12-12 13:54:24.791735
Duration1.42 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct740
Distinct (%)7.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-01-01 00:00:00
Maximum2021-09-01 00:00:00
2023-12-12T22:54:24.868008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:54:25.012119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

기준시간
Real number (ℝ)

ZEROS 

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.4806
Minimum0
Maximum23
Zeros411
Zeros (%)4.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T22:54:25.182145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q15
median12
Q317
95-th percentile22
Maximum23
Range23
Interquartile range (IQR)12

Descriptive statistics

Standard deviation6.9400029
Coefficient of variation (CV)0.60449827
Kurtosis-1.2132675
Mean11.4806
Median Absolute Deviation (MAD)6
Skewness-0.0055351445
Sum114806
Variance48.16364
MonotonicityNot monotonic
2023-12-12T22:54:25.324322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
13 458
 
4.6%
4 456
 
4.6%
14 447
 
4.5%
3 445
 
4.5%
1 444
 
4.4%
16 443
 
4.4%
17 432
 
4.3%
5 425
 
4.2%
23 422
 
4.2%
12 419
 
4.2%
Other values (14) 5609
56.1%
ValueCountFrequency (%)
0 411
4.1%
1 444
4.4%
2 396
4.0%
3 445
4.5%
4 456
4.6%
5 425
4.2%
6 395
4.0%
7 393
3.9%
8 399
4.0%
9 388
3.9%
ValueCountFrequency (%)
23 422
4.2%
22 409
4.1%
21 410
4.1%
20 411
4.1%
19 407
4.1%
18 406
4.1%
17 432
4.3%
16 443
4.4%
15 404
4.0%
14 447
4.5%

하수처리장구분명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
안하 하수처리장
10000 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row안하 하수처리장
2nd row안하 하수처리장
3rd row안하 하수처리장
4th row안하 하수처리장
5th row안하 하수처리장

Common Values

ValueCountFrequency (%)
안하 하수처리장 10000
100.0%

Length

2023-12-12T22:54:25.477890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:54:25.615668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
안하 10000
50.0%
하수처리장 10000
50.0%

계측구분명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
처리수흡입펌프 토출유량계
3812 
유입유량
1877 
처리수방류유량
1823 
역세펌프 토출유량계
1816 
방류펌프 토출유량
672 

Length

Max length13
Median length10
Mean length9.4033
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row처리수방류유량
2nd row유입유량
3rd row처리수흡입펌프 토출유량계
4th row역세펌프 토출유량계
5th row처리수방류유량

Common Values

ValueCountFrequency (%)
처리수흡입펌프 토출유량계 3812
38.1%
유입유량 1877
18.8%
처리수방류유량 1823
18.2%
역세펌프 토출유량계 1816
18.2%
방류펌프 토출유량 672
 
6.7%

Length

2023-12-12T22:54:25.744089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:54:25.852001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
토출유량계 5628
34.5%
처리수흡입펌프 3812
23.4%
유입유량 1877
 
11.5%
처리수방류유량 1823
 
11.2%
역세펌프 1816
 
11.1%
방류펌프 672
 
4.1%
토출유량 672
 
4.1%

계측태그명
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
FIT-301B
1933 
FIT-301A
1879 
FIT-101
1877 
FIT-304
1823 
FIT-303
1816 

Length

Max length8
Median length7
Mean length7.3812
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFIT-304
2nd rowFIT-101
3rd rowFIT-301A
4th rowFIT-303
5th rowFIT-304

Common Values

ValueCountFrequency (%)
FIT-301B 1933
19.3%
FIT-301A 1879
18.8%
FIT-101 1877
18.8%
FIT-304 1823
18.2%
FIT-303 1816
18.2%
FIT-305 672
 
6.7%

Length

2023-12-12T22:54:25.980565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:54:26.087467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
fit-301b 1933
19.3%
fit-301a 1879
18.8%
fit-101 1877
18.8%
fit-304 1823
18.2%
fit-303 1816
18.2%
fit-305 672
 
6.7%

계측단위
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
㎥/hr
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row㎥/hr
2nd row㎥/hr
3rd row㎥/hr
4th row㎥/hr
5th row㎥/hr

Common Values

ValueCountFrequency (%)
㎥/hr 10000
100.0%

Length

2023-12-12T22:54:26.217257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:54:26.309028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
㎥/hr 10000
100.0%

계측값
Real number (ℝ)

ZEROS 

Distinct2337
Distinct (%)23.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.603379
Minimum0
Maximum58.62
Zeros1557
Zeros (%)15.6%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T22:54:26.400903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10.03
median17.22
Q323.89
95-th percentile32.8815
Maximum58.62
Range58.62
Interquartile range (IQR)23.86

Descriptive statistics

Standard deviation12.1348
Coefficient of variation (CV)0.83095835
Kurtosis-0.86479616
Mean14.603379
Median Absolute Deviation (MAD)9.35
Skewness0.15242112
Sum146033.79
Variance147.25336
MonotonicityNot monotonic
2023-12-12T22:54:26.532762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 1557
 
15.6%
0.03 488
 
4.9%
0.04 453
 
4.5%
0.01 337
 
3.4%
0.02 306
 
3.1%
0.05 189
 
1.9%
0.06 53
 
0.5%
23.98 28
 
0.3%
24.0 27
 
0.3%
23.93 20
 
0.2%
Other values (2327) 6542
65.4%
ValueCountFrequency (%)
0.0 1557
15.6%
0.01 337
 
3.4%
0.02 306
 
3.1%
0.03 488
 
4.9%
0.04 453
 
4.5%
0.05 189
 
1.9%
0.06 53
 
0.5%
0.07 7
 
0.1%
0.08 19
 
0.2%
0.11 3
 
< 0.1%
ValueCountFrequency (%)
58.62 1
< 0.1%
56.68 1
< 0.1%
56.61 1
< 0.1%
56.55 1
< 0.1%
52.67 1
< 0.1%
52.44 1
< 0.1%
51.2 1
< 0.1%
50.99 1
< 0.1%
50.84 1
< 0.1%
50.24 1
< 0.1%

Interactions

2023-12-12T22:54:23.972698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:54:23.792262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:54:24.090873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:54:23.873852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:54:26.616844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준시간계측구분명계측태그명계측값
기준시간1.0000.0000.0000.046
계측구분명0.0001.0001.0000.835
계측태그명0.0001.0001.0000.681
계측값0.0460.8350.6811.000
2023-12-12T22:54:26.698534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계측태그명계측구분명
계측태그명1.0001.000
계측구분명1.0001.000
2023-12-12T22:54:26.773717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준시간계측값계측구분명계측태그명
기준시간1.0000.0020.0000.000
계측값0.0021.0000.4960.444
계측구분명0.0000.4961.0001.000
계측태그명0.0000.4441.0001.000

Missing values

2023-12-12T22:54:24.549503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:54:24.727079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준연월일기준시간하수처리장구분명계측구분명계측태그명계측단위계측값
750052018-06-156안하 하수처리장처리수방류유량FIT-304㎥/hr20.46
168402021-07-2517안하 하수처리장유입유량FIT-101㎥/hr28.07
333892021-06-056안하 하수처리장처리수흡입펌프 토출유량계FIT-301A㎥/hr0.0
645692019-04-1610안하 하수처리장역세펌프 토출유량계FIT-303㎥/hr0.04
876192021-07-1420안하 하수처리장처리수방류유량FIT-304㎥/hr36.16
793562018-12-1313안하 하수처리장처리수방류유량FIT-304㎥/hr18.33
224232018-07-148안하 하수처리장처리수흡입펌프 토출유량계FIT-301A㎥/hr0.01
644052019-04-0914안하 하수처리장역세펌프 토출유량계FIT-303㎥/hr0.03
421402018-10-0321안하 하수처리장처리수흡입펌프 토출유량계FIT-301B㎥/hr15.59
395542018-06-183안하 하수처리장처리수흡입펌프 토출유량계FIT-301B㎥/hr0.03
기준연월일기준시간하수처리장구분명계측구분명계측태그명계측단위계측값
930492018-06-272안하 하수처리장방류펌프 토출유량FIT-305㎥/hr0.0
536582018-01-1619안하 하수처리장역세펌프 토출유량계FIT-303㎥/hr0.01
184282018-01-2821안하 하수처리장처리수흡입펌프 토출유량계FIT-301A㎥/hr20.66
424872018-10-188안하 하수처리장처리수흡입펌프 토출유량계FIT-301B㎥/hr0.03
322032021-04-1620안하 하수처리장처리수흡입펌프 토출유량계FIT-301A㎥/hr15.37
165122021-07-121안하 하수처리장유입유량FIT-101㎥/hr34.03
159412021-06-186안하 하수처리장유입유량FIT-101㎥/hr33.97
254392018-11-160안하 하수처리장처리수흡입펌프 토출유량계FIT-301A㎥/hr16.99
436842018-12-075안하 하수처리장처리수흡입펌프 토출유량계FIT-301B㎥/hr0.02
332782021-05-3115안하 하수처리장처리수흡입펌프 토출유량계FIT-301A㎥/hr15.71