Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 644.5 KiB |
Average record size in memory | 66.0 B |
Variable types
DateTime | 1 |
---|---|
Numeric | 2 |
Categorical | 4 |
Dataset
Description | 김해도시개발공사 상동 하수처리시설별에 대한 시간대별 계측 현황을 조회하는 서비스로 기준연월일, 기준시간, 하수처리장구분명, 계측구분명, 계측값 등의 정보를 제공 |
---|---|
Author | 김해시도시개발공사 |
URL | https://www.data.go.kr/data/15096555/fileData.do |
하수처리장구분명 has constant value "" | Constant |
계측단위 is highly overall correlated with 기준시간 and 3 other fields | High correlation |
계측태그명 is highly overall correlated with 계측값 and 2 other fields | High correlation |
계측구분명 is highly overall correlated with 계측값 and 2 other fields | High correlation |
기준시간 is highly overall correlated with 계측단위 | High correlation |
계측값 is highly overall correlated with 계측구분명 and 2 other fields | High correlation |
계측단위 is highly imbalanced (89.3%) | Imbalance |
기준시간 has 396 (4.0%) zeros | Zeros |
계측값 has 775 (7.8%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 14:21:16.927146 |
---|---|
Analysis finished | 2023-12-12 14:21:18.248732 |
Duration | 1.32 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
기준연월일
Date
Distinct | 1302 |
---|---|
Distinct (%) | 13.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2018-01-01 00:00:00 |
---|---|
Maximum | 2021-08-25 00:00:00 |
기준시간
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 24 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11.5807 |
Minimum | 0 |
---|---|
Maximum | 23 |
Zeros | 396 |
Zeros (%) | 4.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 6 |
median | 12 |
Q3 | 18 |
95-th percentile | 22 |
Maximum | 23 |
Range | 23 |
Interquartile range (IQR) | 12 |
Descriptive statistics
Standard deviation | 6.9359569 |
---|---|
Coefficient of variation (CV) | 0.59892381 |
Kurtosis | -1.2151545 |
Mean | 11.5807 |
Median Absolute Deviation (MAD) | 6 |
Skewness | -0.0057697122 |
Sum | 115807 |
Variance | 48.107498 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
19 | 456 | 4.6% |
23 | 455 | 4.5% |
6 | 453 | 4.5% |
9 | 439 | 4.4% |
16 | 437 | 4.4% |
12 | 436 | 4.4% |
5 | 434 | 4.3% |
2 | 429 | 4.3% |
15 | 426 | 4.3% |
20 | 425 | 4.2% |
Other values (14) | 5610 |
Value | Count | Frequency (%) |
0 | 396 | |
1 | 397 | |
2 | 429 | |
3 | 405 | |
4 | 418 | |
5 | 434 | |
6 | 453 | |
7 | 385 | |
8 | 406 | |
9 | 439 |
Value | Count | Frequency (%) |
23 | 455 | |
22 | 400 | |
21 | 414 | |
20 | 425 | |
19 | 456 | |
18 | 394 | |
17 | 420 | |
16 | 437 | |
15 | 426 | |
14 | 405 |
하수처리장구분명
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
상동 공공하수처리시설 |
---|
Length
Max length | 11 |
---|---|
Median length | 11 |
Mean length | 11 |
Min length | 11 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 상동 공공하수처리시설 |
---|---|
2nd row | 상동 공공하수처리시설 |
3rd row | 상동 공공하수처리시설 |
4th row | 상동 공공하수처리시설 |
5th row | 상동 공공하수처리시설 |
Common Values
Value | Count | Frequency (%) |
상동 공공하수처리시설 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
상동 | 10000 | |
공공하수처리시설 | 10000 |
계측구분명
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
유입유량 적산 | |
---|---|
유량조정조 수위 | |
반응조PH | 141 |
Length
Max length | 8 |
---|---|
Median length | 7 |
Mean length | 7.3018 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 유입유량 적산 |
---|---|
2nd row | 유입유량 적산 |
3rd row | 유량조정조 수위 |
4th row | 유입유량 적산 |
5th row | 유입유량 적산 |
Common Values
Value | Count | Frequency (%) |
유입유량 적산 | 6559 | |
유량조정조 수위 | 3300 | |
반응조PH | 141 | 1.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
유입유량 | 6559 | |
적산 | 6559 | |
유량조정조 | 3300 | |
수위 | 3300 | |
반응조ph | 141 | 0.7% |
계측태그명
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
FIT-101A | |
---|---|
LIT-101 | |
FIT-101B | |
PHIT-202A | 141 |
Length
Max length | 9 |
---|---|
Median length | 8 |
Mean length | 7.6841 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | FIT-101B |
---|---|
2nd row | FIT-101B |
3rd row | LIT-101 |
4th row | FIT-101B |
5th row | FIT-101A |
Common Values
Value | Count | Frequency (%) |
FIT-101A | 3334 | |
LIT-101 | 3300 | |
FIT-101B | 3225 | |
PHIT-202A | 141 | 1.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
fit-101a | 3334 | |
lit-101 | 3300 | |
fit-101b | 3225 | |
phit-202a | 141 | 1.4% |
계측단위
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
pH | 141 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9718 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9859 | |
pH | 141 | 1.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9859 | |
ph | 141 | 1.4% |
계측값
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 4592 |
---|---|
Distinct (%) | 45.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 538578.56 |
Minimum | 0 |
---|---|
Maximum | 1164880 |
Zeros | 775 |
Zeros (%) | 7.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 2.25 |
median | 700074 |
Q3 | 954121.75 |
95-th percentile | 1102982.8 |
Maximum | 1164880 |
Range | 1164880 |
Interquartile range (IQR) | 954119.5 |
Descriptive statistics
Standard deviation | 454152.36 |
---|---|
Coefficient of variation (CV) | 0.84324256 |
Kurtosis | -1.7090882 |
Mean | 538578.56 |
Median Absolute Deviation (MAD) | 372967 |
Skewness | -0.1965468 |
Sum | 5.3857856 × 109 |
Variance | 2.0625437 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 775 | 7.8% |
2.04 | 34 | 0.3% |
2.05 | 32 | 0.3% |
2.06 | 30 | 0.3% |
2.13 | 29 | 0.3% |
2.07 | 29 | 0.3% |
1.89 | 29 | 0.3% |
2.43 | 29 | 0.3% |
1.96 | 27 | 0.3% |
2.15 | 26 | 0.3% |
Other values (4582) | 8960 |
Value | Count | Frequency (%) |
0.0 | 775 | |
0.13 | 1 | < 0.1% |
0.53 | 1 | < 0.1% |
0.67 | 1 | < 0.1% |
0.79 | 1 | < 0.1% |
0.83 | 1 | < 0.1% |
0.86 | 1 | < 0.1% |
0.88 | 1 | < 0.1% |
0.92 | 1 | < 0.1% |
0.98 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1164880.0 | 1 | < 0.1% |
1164728.0 | 1 | < 0.1% |
1164631.0 | 2 | |
1164478.0 | 1 | < 0.1% |
1164325.0 | 1 | < 0.1% |
1164172.0 | 3 | |
1163715.0 | 1 | < 0.1% |
1162514.0 | 1 | < 0.1% |
1162375.0 | 2 | |
1162230.0 | 1 | < 0.1% |
기준시간 | 계측구분명 | 계측태그명 | 계측값 | |
---|---|---|---|---|
기준시간 | 1.000 | 0.000 | 0.000 | 0.000 |
계측구분명 | 0.000 | 1.000 | 1.000 | 0.714 |
계측태그명 | 0.000 | 1.000 | 1.000 | 0.670 |
계측값 | 0.000 | 0.714 | 0.670 | 1.000 |
계측단위 | 계측태그명 | 계측구분명 | |
---|---|---|---|
계측단위 | 1.000 | 1.000 | 1.000 |
계측태그명 | 1.000 | 1.000 | 1.000 |
계측구분명 | 1.000 | 1.000 | 1.000 |
기준시간 | 계측값 | 계측구분명 | 계측태그명 | 계측단위 | |
---|---|---|---|---|---|
기준시간 | 1.000 | 0.006 | 0.000 | 0.000 | 1.000 |
계측값 | 0.006 | 1.000 | 0.633 | 0.532 | 1.000 |
계측구분명 | 0.000 | 0.633 | 1.000 | 1.000 | 1.000 |
계측태그명 | 0.000 | 0.532 | 1.000 | 1.000 | 1.000 |
계측단위 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
기준연월일 | 기준시간 | 하수처리장구분명 | 계측구분명 | 계측태그명 | 계측단위 | 계측값 | |
---|---|---|---|---|---|---|---|
74434 | 2019-05-22 | 10 | 상동 공공하수처리시설 | 유입유량 적산 | FIT-101B | <NA> | 797040.0 |
74008 | 2019-05-04 | 16 | 상동 공공하수처리시설 | 유입유량 적산 | FIT-101B | <NA> | 788980.0 |
25254 | 2020-12-02 | 6 | 상동 공공하수처리시설 | 유량조정조 수위 | LIT-101 | <NA> | 2.45 |
82598 | 2020-04-28 | 14 | 상동 공공하수처리시설 | 유입유량 적산 | FIT-101B | <NA> | 965562.0 |
35008 | 2018-06-13 | 16 | 상동 공공하수처리시설 | 유입유량 적산 | FIT-101A | <NA> | 693249.0 |
92210 | 2021-06-19 | 2 | 상동 공공하수처리시설 | 유입유량 적산 | FIT-101B | <NA> | 1086672.0 |
45958 | 2019-09-14 | 22 | 상동 공공하수처리시설 | 유입유량 적산 | FIT-101A | <NA> | 913444.0 |
4656 | 2018-07-21 | 0 | 상동 공공하수처리시설 | 유량조정조 수위 | LIT-101 | <NA> | 1.6 |
77973 | 2019-10-16 | 21 | 상동 공공하수처리시설 | 유입유량 적산 | FIT-101B | <NA> | 873780.0 |
10510 | 2019-03-23 | 22 | 상동 공공하수처리시설 | 유량조정조 수위 | LIT-101 | <NA> | 2.39 |
기준연월일 | 기준시간 | 하수처리장구분명 | 계측구분명 | 계측태그명 | 계측단위 | 계측값 | |
---|---|---|---|---|---|---|---|
46144 | 2019-09-22 | 16 | 상동 공공하수처리시설 | 유입유량 적산 | FIT-101A | <NA> | 917044.0 |
58523 | 2021-02-24 | 11 | 상동 공공하수처리시설 | 유입유량 적산 | FIT-101A | <NA> | 1109549.0 |
41946 | 2019-03-31 | 18 | 상동 공공하수처리시설 | 유입유량 적산 | FIT-101A | <NA> | 830981.0 |
36018 | 2018-07-25 | 18 | 상동 공공하수처리시설 | 유입유량 적산 | FIT-101A | <NA> | 714997.0 |
27438 | 2021-03-03 | 6 | 상동 공공하수처리시설 | 유량조정조 수위 | LIT-101 | <NA> | 1.76 |
23696 | 2020-09-26 | 8 | 상동 공공하수처리시설 | 유량조정조 수위 | LIT-101 | <NA> | 2.66 |
45837 | 2019-09-09 | 21 | 상동 공공하수처리시설 | 유입유량 적산 | FIT-101A | <NA> | 911194.0 |
81523 | 2020-03-14 | 19 | 상동 공공하수처리시설 | 유입유량 적산 | FIT-101B | <NA> | 945267.0 |
63459 | 2018-02-15 | 3 | 상동 공공하수처리시설 | 유입유량 적산 | FIT-101B | <NA> | 580704.0 |
6843 | 2018-10-21 | 3 | 상동 공공하수처리시설 | 유량조정조 수위 | LIT-101 | <NA> | 2.1 |