Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory566.4 KiB
Average record size in memory58.0 B

Variable types

DateTime1
Numeric2
Categorical3

Dataset

Description김해도시개발공사 한림 하수처리시설별에 대한 시간대별 가동시간 현황을 조회하는 서비스로 기준연월일, 기준시간, 하수처리장구분명, 가동시간 등의 정보를 제공
Author김해시도시개발공사
URLhttps://www.data.go.kr/data/15096566/fileData.do

Alerts

하수처리장구분명 has constant value ""Constant
태그설명 has constant value ""Constant
기준시간 has 398 (4.0%) zerosZeros
가동시간 has 183 (1.8%) zerosZeros

Reproduction

Analysis started2023-12-12 12:29:27.346203
Analysis finished2023-12-12 12:29:28.474192
Duration1.13 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct777
Distinct (%)7.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2019-07-03 00:00:00
Maximum2021-09-01 00:00:00
2023-12-12T21:29:28.577262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:29:28.762271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

기준시간
Real number (ℝ)

ZEROS 

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.5951
Minimum0
Maximum23
Zeros398
Zeros (%)4.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T21:29:28.916582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q16
median12
Q318
95-th percentile22
Maximum23
Range23
Interquartile range (IQR)12

Descriptive statistics

Standard deviation6.9197069
Coefficient of variation (CV)0.59677855
Kurtosis-1.2119645
Mean11.5951
Median Absolute Deviation (MAD)6
Skewness-0.024602822
Sum115951
Variance47.882344
MonotonicityNot monotonic
2023-12-12T21:29:29.081491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
16 460
 
4.6%
17 453
 
4.5%
15 443
 
4.4%
6 434
 
4.3%
21 434
 
4.3%
1 431
 
4.3%
4 430
 
4.3%
3 428
 
4.3%
22 427
 
4.3%
11 422
 
4.2%
Other values (14) 5638
56.4%
ValueCountFrequency (%)
0 398
4.0%
1 431
4.3%
2 384
3.8%
3 428
4.3%
4 430
4.3%
5 381
3.8%
6 434
4.3%
7 405
4.0%
8 415
4.2%
9 422
4.2%
ValueCountFrequency (%)
23 405
4.0%
22 427
4.3%
21 434
4.3%
20 421
4.2%
19 409
4.1%
18 419
4.2%
17 453
4.5%
16 460
4.6%
15 443
4.4%
14 386
3.9%

하수처리장구분명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
(증설)한림 하수처리장
10000 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row(증설)한림 하수처리장
2nd row(증설)한림 하수처리장
3rd row(증설)한림 하수처리장
4th row(증설)한림 하수처리장
5th row(증설)한림 하수처리장

Common Values

ValueCountFrequency (%)
(증설)한림 하수처리장 10000
100.0%

Length

2023-12-12T21:29:29.218694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:29:29.334227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
증설)한림 10000
50.0%
하수처리장 10000
50.0%

태그설명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
잉여/반송펌프
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row잉여/반송펌프
2nd row잉여/반송펌프
3rd row잉여/반송펌프
4th row잉여/반송펌프
5th row잉여/반송펌프

Common Values

ValueCountFrequency (%)
잉여/반송펌프 10000
100.0%

Length

2023-12-12T21:29:29.454777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:29:29.573934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
잉여/반송펌프 10000
100.0%

태그
Categorical

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
M309A
2000 
M309D
1972 
M309B
1961 
M309C
1942 
M309E
1911 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowM309C
2nd rowM309E
3rd rowM309B
4th rowM309C
5th rowM309D

Common Values

ValueCountFrequency (%)
M309A 2000
20.0%
M309D 1972
19.7%
M309B 1961
19.6%
M309C 1942
19.4%
M309E 1911
19.1%
M309F 214
 
2.1%

Length

2023-12-12T21:29:29.683820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:29:29.839261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
m309a 2000
20.0%
m309d 1972
19.7%
m309b 1961
19.6%
m309c 1942
19.4%
m309e 1911
19.1%
m309f 214
 
2.1%

가동시간
Real number (ℝ)

ZEROS 

Distinct35
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.8659
Minimum0
Maximum60
Zeros183
Zeros (%)1.8%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T21:29:29.986124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile8
Q111
median15
Q318
95-th percentile21
Maximum60
Range60
Interquartile range (IQR)7

Descriptive statistics

Standard deviation4.8761763
Coefficient of variation (CV)0.32801083
Kurtosis2.1259324
Mean14.8659
Median Absolute Deviation (MAD)3
Skewness0.026972426
Sum148659
Variance23.777095
MonotonicityNot monotonic
2023-12-12T21:29:30.173620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
20 1418
14.2%
18 1379
13.8%
10 982
9.8%
16 867
8.7%
14 762
7.6%
9 649
 
6.5%
12 617
 
6.2%
13 533
 
5.3%
11 530
 
5.3%
15 466
 
4.7%
Other values (25) 1797
18.0%
ValueCountFrequency (%)
0 183
 
1.8%
3 1
 
< 0.1%
5 1
 
< 0.1%
6 2
 
< 0.1%
7 161
 
1.6%
8 227
 
2.3%
9 649
6.5%
10 982
9.8%
11 530
5.3%
12 617
6.2%
ValueCountFrequency (%)
60 1
 
< 0.1%
54 1
 
< 0.1%
39 1
 
< 0.1%
36 3
 
< 0.1%
35 2
 
< 0.1%
33 33
0.3%
32 7
 
0.1%
31 8
 
0.1%
30 28
0.3%
29 1
 
< 0.1%

Interactions

2023-12-12T21:29:27.949965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:29:27.707902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:29:28.063910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:29:27.827797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:29:30.285127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준시간태그가동시간
기준시간1.0000.0000.017
태그0.0001.0000.184
가동시간0.0170.1841.000
2023-12-12T21:29:30.413128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준시간가동시간태그
기준시간1.0000.0160.000
가동시간0.0161.0000.159
태그0.0000.1591.000

Missing values

2023-12-12T21:29:28.227201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:29:28.397338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준연월일기준시간하수처리장구분명태그설명태그가동시간
461382020-07-2010(증설)한림 하수처리장잉여/반송펌프M309C18
902822021-05-0118(증설)한림 하수처리장잉여/반송펌프M309E24
247542020-03-1310(증설)한림 하수처리장잉여/반송펌프M309B8
533442021-05-1616(증설)한림 하수처리장잉여/반송펌프M309C14
691802021-01-1912(증설)한림 하수처리장잉여/반송펌프M309D16
239952020-02-1019(증설)한림 하수처리장잉여/반송펌프M309B16
851812020-10-015(증설)한림 하수처리장잉여/반송펌프M309E10
936132019-07-1813(증설)한림 하수처리장잉여/반송펌프M309F20
664582020-09-282(증설)한림 하수처리장잉여/반송펌프M309D12
723252021-05-3013(증설)한림 하수처리장잉여/반송펌프M309D16
기준연월일기준시간하수처리장구분명태그설명태그가동시간
43142019-12-2918(증설)한림 하수처리장잉여/반송펌프M309A15
57332020-02-2621(증설)한림 하수처리장잉여/반송펌프M309A16
564752019-07-253(증설)한림 하수처리장잉여/반송펌프M309D9
308472020-12-077(증설)한림 하수처리장잉여/반송펌프M309B14
196062019-08-1122(증설)한림 하수처리장잉여/반송펌프M309B20
476312020-09-2015(증설)한림 하수처리장잉여/반송펌프M309C17
221352019-11-257(증설)한림 하수처리장잉여/반송펌프M309B13
886842021-02-244(증설)한림 하수처리장잉여/반송펌프M309E20
205212019-09-191(증설)한림 하수처리장잉여/반송펌프M309B16
474112020-09-1111(증설)한림 하수처리장잉여/반송펌프M309C15