Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 10000 |
Missing cells | 905 |
Missing cells (%) | 1.5% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 556.6 KiB |
Average record size in memory | 57.0 B |
Variable types
DateTime | 1 |
---|---|
Categorical | 2 |
Text | 2 |
Numeric | 1 |
Dataset
Description | 김해도시개발공사 하수처리시설별에 대한 일자별 계측 현황을 조회하는 서비스로 기준연월일, 하수처리장구분명, 계측구분명, 계측값 등의 정보를 제공 |
---|---|
Author | 김해시도시개발공사 |
URL | https://www.data.go.kr/data/15096571/fileData.do |
Reproduction
Analysis started | 2023-12-12 12:54:39.057428 |
---|---|
Analysis finished | 2023-12-12 12:54:39.791445 |
Duration | 0.73 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
기준연월일
Date
Distinct | 200 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2018-01-01 00:00:00 |
---|---|
Maximum | 2018-07-19 00:00:00 |
하수처리장구분명
Categorical
HIGH CORRELATION
 
Distinct | 16 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
진영맑은물사업소 HANT반응조 | |
---|---|
장유 하수처리장 | |
(증설)진례 하수처리장 | |
진영맑은물사업소 | |
안하 하수처리장 | |
Other values (11) |
Length
Max length | 16 |
---|---|
Median length | 13 |
Mean length | 10.904 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 진영맑은물사업소 |
---|---|
2nd row | 진영맑은물사업소 |
3rd row | 장유 하수처리장 |
4th row | 안하 하수처리장 |
5th row | 진영맑은물사업소 HANT반응조 |
Common Values
Value | Count | Frequency (%) |
진영맑은물사업소 HANT반응조 | 2463 | |
장유 하수처리장 | 1506 | |
(증설)진례 하수처리장 | 1362 | |
진영맑은물사업소 | 1347 | |
안하 하수처리장 | 1025 | |
상동 공공하수처리시설 | 972 | 9.7% |
진례 하수처리장 | 527 | 5.3% |
생림 하수처리장 | 487 | 4.9% |
낙산마을 하수처리장 | 65 | 0.7% |
용산마을 하수처리장 | 63 | 0.6% |
Other values (6) | 183 | 1.8% |
Length
Value | Count | Frequency (%) |
하수처리장 | 5178 | |
진영맑은물사업소 | 3810 | |
hant반응조 | 2463 | |
장유 | 1506 | 8.1% |
증설)진례 | 1362 | 7.3% |
안하 | 1025 | 5.5% |
상동 | 972 | 5.2% |
공공하수처리시설 | 972 | 5.2% |
진례 | 527 | 2.8% |
생림 | 487 | 2.6% |
Other values (9) | 351 | 1.9% |
계측구분명
Text
Distinct | 257 |
---|---|
Distinct (%) | 2.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 23 |
---|---|
Median length | 18 |
Mean length | 7.9533 |
Min length | 2 |
Characters and Unicode
Total characters | 79533 |
---|---|
Distinct characters | 166 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 포기조 용존산소량 |
---|---|
2nd row | 케잌호퍼 중량 |
3rd row | 2지슬러지량 |
4th row | 구연산 공급펌프 토출유량 적산 |
5th row | 여과막 토출량계 |
Value | Count | Frequency (%) |
수위 | 997 | 5.5% |
여과막 | 824 | 4.6% |
토출량계 | 824 | 4.6% |
슬러지 | 390 | 2.2% |
반응조 | 375 | 2.1% |
유량 | 365 | 2.0% |
분리막 | 345 | 1.9% |
압력계 | 345 | 1.9% |
흡입 | 345 | 1.9% |
공급유량 | 272 | 1.5% |
Other values (236) | 12954 |
Most occurring characters
Value | Count | Frequency (%) |
8036 | 10.1% | |
량 | 5034 | 6.3% |
조 | 3560 | 4.5% |
수 | 3533 | 4.4% |
유 | 3012 | 3.8% |
계 | 2570 | 3.2% |
위 | 1957 | 2.5% |
입 | 1861 | 2.3% |
지 | 1831 | 2.3% |
반 | 1752 | 2.2% |
Other values (156) | 46387 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 60576 | |
Space Separator | 8036 | 10.1% |
Uppercase Letter | 7996 | 10.1% |
Close Punctuation | 949 | 1.2% |
Open Punctuation | 949 | 1.2% |
Decimal Number | 655 | 0.8% |
Lowercase Letter | 314 | 0.4% |
Other Punctuation | 40 | 0.1% |
Dash Punctuation | 18 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
량 | 5034 | 8.3% |
조 | 3560 | 5.9% |
수 | 3533 | 5.8% |
유 | 3012 | 5.0% |
계 | 2570 | 4.2% |
위 | 1957 | 3.2% |
입 | 1861 | 3.1% |
지 | 1831 | 3.0% |
반 | 1752 | 2.9% |
토 | 1441 | 2.4% |
Other values (122) | 34025 |
Uppercase Letter
Value | Count | Frequency (%) |
O | 1338 | |
L | 1101 | |
S | 828 | |
H | 750 | |
P | 682 | |
M | 524 | 6.6% |
R | 483 | 6.0% |
D | 407 | 5.1% |
A | 390 | 4.9% |
N | 378 | 4.7% |
Other values (8) | 1115 |
Decimal Number
Value | Count | Frequency (%) |
2 | 215 | |
3 | 178 | |
1 | 149 | |
4 | 44 | 6.7% |
5 | 21 | 3.2% |
6 | 18 | 2.7% |
7 | 17 | 2.6% |
8 | 13 | 2.0% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 235 | |
p | 62 | 19.7% |
h | 17 | 5.4% |
Space Separator
Value | Count | Frequency (%) |
8036 |
Close Punctuation
Value | Count | Frequency (%) |
) | 949 |
Open Punctuation
Value | Count | Frequency (%) |
( | 949 |
Other Punctuation
Value | Count | Frequency (%) |
# | 40 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 18 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 60576 | |
Common | 10647 | 13.4% |
Latin | 8310 | 10.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
량 | 5034 | 8.3% |
조 | 3560 | 5.9% |
수 | 3533 | 5.8% |
유 | 3012 | 5.0% |
계 | 2570 | 4.2% |
위 | 1957 | 3.2% |
입 | 1861 | 3.1% |
지 | 1831 | 3.0% |
반 | 1752 | 2.9% |
토 | 1441 | 2.4% |
Other values (122) | 34025 |
Latin
Value | Count | Frequency (%) |
O | 1338 | |
L | 1101 | |
S | 828 | |
H | 750 | |
P | 682 | |
M | 524 | 6.3% |
R | 483 | 5.8% |
D | 407 | 4.9% |
A | 390 | 4.7% |
N | 378 | 4.5% |
Other values (11) | 1429 |
Common
Value | Count | Frequency (%) |
8036 | ||
) | 949 | 8.9% |
( | 949 | 8.9% |
2 | 215 | 2.0% |
3 | 178 | 1.7% |
1 | 149 | 1.4% |
4 | 44 | 0.4% |
# | 40 | 0.4% |
5 | 21 | 0.2% |
6 | 18 | 0.2% |
Other values (3) | 48 | 0.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 60576 | |
ASCII | 18957 | 23.8% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
8036 | ||
O | 1338 | 7.1% |
L | 1101 | 5.8% |
) | 949 | 5.0% |
( | 949 | 5.0% |
S | 828 | 4.4% |
H | 750 | 4.0% |
P | 682 | 3.6% |
M | 524 | 2.8% |
R | 483 | 2.5% |
Other values (24) | 3317 |
Hangul
Value | Count | Frequency (%) |
량 | 5034 | 8.3% |
조 | 3560 | 5.9% |
수 | 3533 | 5.8% |
유 | 3012 | 5.0% |
계 | 2570 | 4.2% |
위 | 1957 | 3.2% |
입 | 1861 | 3.1% |
지 | 1831 | 3.0% |
반 | 1752 | 2.9% |
토 | 1441 | 2.4% |
Other values (122) | 34025 |
계측태그명
Text
MISSING
 
Distinct | 397 |
---|---|
Distinct (%) | 4.2% |
Missing | 605 |
Missing (%) | 6.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
lit-302 | 130 | 1.4% |
lit-301 | 113 | 1.2% |
lit-401 | 107 | 1.1% |
lit-101 | 107 | 1.1% |
lit-201a | 83 | 0.9% |
lit-201b | 80 | 0.9% |
fit-404 | 71 | 0.8% |
fit-402h | 55 | 0.6% |
mlss-204a | 53 | 0.6% |
fit-201 | 52 | 0.6% |
Other values (387) | 8544 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 8716 | |
- | 8113 | |
1 | 6340 | 9.2% |
I | 5939 | 8.6% |
T | 5187 | 7.5% |
F | 4485 | 6.5% |
2 | 4326 | 6.3% |
4 | 3459 | 5.0% |
L | 2492 | 3.6% |
A | 2478 | 3.6% |
Other values (25) | 17494 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 32441 | |
Decimal Number | 28475 | |
Dash Punctuation | 8113 | 11.8% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
I | 5939 | |
T | 5187 | |
F | 4485 | |
L | 2492 | |
A | 2478 | |
B | 1809 | 5.6% |
P | 1573 | 4.8% |
R | 1333 | 4.1% |
Q | 1279 | 3.9% |
O | 1036 | 3.2% |
Other values (14) | 4830 |
Decimal Number
Value | Count | Frequency (%) |
0 | 8716 | |
1 | 6340 | |
2 | 4326 | |
4 | 3459 | 12.1% |
3 | 2246 | 7.9% |
5 | 1411 | 5.0% |
6 | 659 | 2.3% |
9 | 581 | 2.0% |
8 | 533 | 1.9% |
7 | 204 | 0.7% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 8113 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 36588 | |
Latin | 32441 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
I | 5939 | |
T | 5187 | |
F | 4485 | |
L | 2492 | |
A | 2478 | |
B | 1809 | 5.6% |
P | 1573 | 4.8% |
R | 1333 | 4.1% |
Q | 1279 | 3.9% |
O | 1036 | 3.2% |
Other values (14) | 4830 |
Common
Value | Count | Frequency (%) |
0 | 8716 | |
- | 8113 | |
1 | 6340 | |
2 | 4326 | |
4 | 3459 | 9.5% |
3 | 2246 | 6.1% |
5 | 1411 | 3.9% |
6 | 659 | 1.8% |
9 | 581 | 1.6% |
8 | 533 | 1.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 69029 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 8716 | |
- | 8113 | |
1 | 6340 | 9.2% |
I | 5939 | 8.6% |
T | 5187 | 7.5% |
F | 4485 | 6.5% |
2 | 4326 | 6.3% |
4 | 3459 | 5.0% |
L | 2492 | 3.6% |
A | 2478 | 3.6% |
Other values (25) | 17494 |
계측단위
Categorical
HIGH CORRELATION
 
Distinct | 42 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
㎥ | |
㎥/H | |
M | |
m | |
Other values (37) |
Length
Max length | 5 |
---|---|
Median length | 4 |
Mean length | 2.6025 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | ppm |
---|---|
2nd row | ton |
3rd row | ㎥/h |
4th row | ㎥ |
5th row | ㎥ |
Common Values
Value | Count | Frequency (%) |
<NA> | 1504 | |
㎥ | 1387 | |
㎥/H | 921 | 9.2% |
M | 801 | 8.0% |
m | 688 | 6.9% |
㎥/hr | 654 | 6.5% |
㎥/h | 473 | 4.7% |
mmHg | 377 | 3.8% |
mV | 324 | 3.2% |
% | 317 | 3.2% |
Other values (32) | 2554 |
Length
Value | Count | Frequency (%) |
na | 1504 | |
m | 1489 | |
㎥/h | 1394 | |
㎥ | 1387 | |
㎥/hr | 654 | 6.5% |
mv | 448 | 4.5% |
mmhg | 377 | 3.8% |
317 | 3.2% | |
ppm | 305 | 3.0% |
mg/l | 274 | 2.7% |
Other values (22) | 1851 |
계측값
Real number (ℝ)
MISSING
  SKEWED
  ZEROS
 
Distinct | 3862 |
---|---|
Distinct (%) | 39.8% |
Missing | 300 |
Missing (%) | 3.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 60210.967 |
Minimum | -1500 |
---|---|
Maximum | 1.7874988 × 108 |
Zeros | 1830 |
Zeros (%) | 18.3% |
Negative | 765 |
Negative (%) | 7.6% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -1500 |
---|---|
5-th percentile | -256.271 |
Q1 | 0 |
median | 2.55 |
Q3 | 24.505 |
95-th percentile | 2734.1265 |
Maximum | 1.7874988 × 108 |
Range | 1.7875138 × 108 |
Interquartile range (IQR) | 24.505 |
Descriptive statistics
Standard deviation | 3143729.9 |
---|---|
Coefficient of variation (CV) | 52.211915 |
Kurtosis | 3227.3437 |
Mean | 60210.967 |
Median Absolute Deviation (MAD) | 2.55 |
Skewness | 56.8099 |
Sum | 5.8404638 × 108 |
Variance | 9.8830376 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 1830 | 18.3% |
0.01 | 76 | 0.8% |
1.0 | 51 | 0.5% |
4.8 | 48 | 0.5% |
4.7 | 48 | 0.5% |
2.5 | 47 | 0.5% |
0.5 | 47 | 0.5% |
3.2 | 44 | 0.4% |
0.02 | 42 | 0.4% |
0.04 | 39 | 0.4% |
Other values (3852) | 7428 | |
(Missing) | 300 | 3.0% |
Value | Count | Frequency (%) |
-1500.0 | 4 | |
-855.96 | 1 | < 0.1% |
-700.0 | 3 | |
-675.67 | 1 | < 0.1% |
-672.34 | 1 | < 0.1% |
-671.31 | 1 | < 0.1% |
-670.82 | 1 | < 0.1% |
-665.49 | 1 | < 0.1% |
-662.67 | 1 | < 0.1% |
-656.82 | 1 | < 0.1% |
Value | Count | Frequency (%) |
178749875.0 | 1 | < 0.1% |
178749348.5 | 1 | < 0.1% |
178740852.08 | 1 | < 0.1% |
1110303.0 | 23 | |
712008.12 | 1 | < 0.1% |
709004.38 | 1 | < 0.1% |
690174.96 | 1 | < 0.1% |
672942.75 | 1 | < 0.1% |
671262.17 | 1 | < 0.1% |
656615.96 | 1 | < 0.1% |
하수처리장구분명 | 계측단위 | 계측값 | |
---|---|---|---|
하수처리장구분명 | 1.000 | 0.898 | 0.000 |
계측단위 | 0.898 | 1.000 | 0.000 |
계측값 | 0.000 | 0.000 | 1.000 |
하수처리장구분명 | 계측단위 | |
---|---|---|
하수처리장구분명 | 1.000 | 0.616 |
계측단위 | 0.616 | 1.000 |
계측값 | 하수처리장구분명 | 계측단위 | |
---|---|---|---|
계측값 | 1.000 | 0.000 | 0.000 |
하수처리장구분명 | 0.000 | 1.000 | 0.616 |
계측단위 | 0.000 | 0.616 | 1.000 |
기준연월일 | 하수처리장구분명 | 계측구분명 | 계측태그명 | 계측단위 | 계측값 | |
---|---|---|---|---|---|---|
22654 | 2018-02-17 | 진영맑은물사업소 | 포기조 용존산소량 | DOIA-101A | ppm | 0.09 |
15757 | 2018-02-03 | 진영맑은물사업소 | 케잌호퍼 중량 | WIA-101 | ton | 4.6 |
69334 | 2018-05-30 | 장유 하수처리장 | 2지슬러지량 | FI311 | ㎥/h | 332.9 |
89362 | 2018-07-08 | 안하 하수처리장 | 구연산 공급펌프 토출유량 적산 | FIQ-402 | ㎥ | <NA> |
45641 | 2018-04-05 | 진영맑은물사업소 HANT반응조 | 여과막 토출량계 | FIT-402L | ㎥ | 28.21 |
81157 | 2018-06-22 | 안하 하수처리장 | 용존산소저감조 차압 | DPI-204A | mbar | -0.34 |
8195 | 2018-01-18 | 진영맑은물사업소 HANT반응조 | NaOCL 수위계 | LIT-901E | m | 0.45 |
7235 | 2018-01-16 | 진영맑은물사업소 HANT반응조 | 반응조 수위 | LIA-401A | m | 5.0 |
62064 | 2018-05-16 | 안하 하수처리장 | 용존산소 저감조 수위계 | LIT-205B | m | 3.0 |
48901 | 2018-04-14 | (증설)진례 하수처리장 | 반송펌프 | P-206A | Hz | 46.78 |
기준연월일 | 하수처리장구분명 | 계측구분명 | 계측태그명 | 계측단위 | 계측값 | |
---|---|---|---|---|---|---|
53199 | 2018-04-24 | 진영맑은물사업소 | 한트반응조유입유량 | FRQ-101 | ㎥ | 614.29 |
51897 | 2018-04-21 | 진례 하수처리장 | 방류하수량 | FT-302 | ㎥/hr | 0.0 |
37046 | 2018-03-18 | 진영맑은물사업소 HANT반응조 | 호기조 DO계 | DO-401B | mg/ℓ | 0.17 |
11533 | 2018-01-25 | 진영맑은물사업소 | 슬러지 저장조액위 | LIA-201 | m | 1.73 |
76738 | 2018-06-13 | 진영맑은물사업소 HANT반응조 | ALUM주입량계 | FIT-901C | ℓ | 0.0 |
42053 | 2018-03-29 | 상동 공공하수처리시설 | 반응조 수위설정(LO) | LIT-201B | <NA> | 3.2 |
49595 | 2018-04-15 | 진영맑은물사업소 HANT반응조 | 여과막 토출량계 | FIT-402E | ㎥ | 0.0 |
78296 | 2018-06-16 | 진영맑은물사업소 HANT반응조 | NaOH주입량계 | FIT-901D | ℓ/min | 0.0 |
78702 | 2018-06-17 | 장유 하수처리장 | 탈질조 | ORP2301A | mV | -1500.0 |
70515 | 2018-06-01 | 진영맑은물사업소 | 초침슬러지 반송유량 | FRQ-103A | ㎥ | 0.0 |