Overview

Dataset statistics

Number of variables8
Number of observations371
Missing cells13
Missing cells (%)0.4%
Duplicate rows1
Duplicate rows (%)0.3%
Total size in memory23.7 KiB
Average record size in memory65.4 B

Variable types

Numeric1
Unsupported7

Dataset

Description2022년도 대전 흑석하수처리장 방류수질을 일일데이터로 측정한 결과로 BOD, TOC, SS, T-N, T-P, 총대장균군수를 측정한 결과를 포함한다.
Author대전광역시
URLhttps://www.data.go.kr/data/15112167/fileData.do

Alerts

Dataset has 1 (0.3%) duplicate rowsDuplicates
Unnamed: 0 has 6 (1.6%) missing valuesMissing
구 분 is an unsupported type, check if it needs cleaning or further analysisUnsupported
방 류 농 도 (mg/L, 개/mL) is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 20:11:18.771939
Analysis finished2023-12-12 20:11:19.460482
Duration0.69 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Unnamed: 0
Real number (ℝ)

MISSING 

Distinct365
Distinct (%)100.0%
Missing6
Missing (%)1.6%
Infinite0
Infinite (%)0.0%
Mean183
Minimum1
Maximum365
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.4 KiB
2023-12-13T05:11:19.552453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile19.2
Q192
median183
Q3274
95-th percentile346.8
Maximum365
Range364
Interquartile range (IQR)182

Descriptive statistics

Standard deviation105.51066
Coefficient of variation (CV)0.576561
Kurtosis-1.2
Mean183
Median Absolute Deviation (MAD)91
Skewness0
Sum66795
Variance11132.5
MonotonicityStrictly increasing
2023-12-13T05:11:19.701541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
252 1
 
0.3%
250 1
 
0.3%
249 1
 
0.3%
248 1
 
0.3%
247 1
 
0.3%
246 1
 
0.3%
245 1
 
0.3%
244 1
 
0.3%
243 1
 
0.3%
242 1
 
0.3%
Other values (355) 355
95.7%
(Missing) 6
 
1.6%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
365 1
0.3%
364 1
0.3%
363 1
0.3%
362 1
0.3%
361 1
0.3%
360 1
0.3%
359 1
0.3%
358 1
0.3%
357 1
0.3%
356 1
0.3%

구 분
Unsupported

REJECTED  UNSUPPORTED 

Missing1
Missing (%)0.3%
Memory size3.0 KiB

방 류 농 도 (mg/L, 개/mL)
Unsupported

REJECTED  UNSUPPORTED 

Missing1
Missing (%)0.3%
Memory size3.0 KiB

Unnamed: 3
Unsupported

REJECTED  UNSUPPORTED 

Missing1
Missing (%)0.3%
Memory size3.0 KiB

Unnamed: 4
Unsupported

REJECTED  UNSUPPORTED 

Missing1
Missing (%)0.3%
Memory size3.0 KiB

Unnamed: 5
Unsupported

REJECTED  UNSUPPORTED 

Missing1
Missing (%)0.3%
Memory size3.0 KiB

Unnamed: 6
Unsupported

REJECTED  UNSUPPORTED 

Missing1
Missing (%)0.3%
Memory size3.0 KiB

Unnamed: 7
Unsupported

REJECTED  UNSUPPORTED 

Missing1
Missing (%)0.3%
Memory size3.0 KiB

Interactions

2023-12-13T05:11:18.830899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T05:11:19.003789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:11:19.165403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T05:11:19.326203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

Unnamed: 0구 분방 류 농 도 (mg/L, 개/mL)Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7
0<NA>NaNBODTOCSST-NT-P대장균군수
1<NA>연평균NaNNaNNaNNaNNaNNaN
2<NA>합계207.95694.4739.81680.87643.195638
3<NA>연최고1.754.310.5670.2746
4<NA>연최저0.10.311.6170.031
5<NA>표준편차0.243310.5325770.6488282.0310030.0425240.861364
612022-01-01 00:00:000.221.96.7520.085
722022-01-02 00:00:000.62.11.35.7760.0683
832022-01-03 00:00:000.12.21.17.9840.0891
942022-01-04 00:00:000.10.918.7590.0693
Unnamed: 0구 분방 류 농 도 (mg/L, 개/mL)Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7
3613562022-12-22 00:00:000.31.62.28.3250.0982
3623572022-12-23 00:00:000.31.51.59.6050.0862
3633582022-12-24 00:00:000.11.71.510.0390.0752
3643592022-12-25 00:00:000.421.59.7650.0891
3653602022-12-26 00:00:000.622.38.3760.112
3663612022-12-27 00:00:000.51.92.17.920.1061
3673622022-12-28 00:00:000.21.91.89.590.1061
3683632022-12-29 00:00:000.21.82.57.0230.0921
3693642022-12-30 00:00:000.422.65.0630.12
3703652022-12-31 00:00:000.521.98.0830.093

Duplicate rows

Most frequently occurring

Unnamed: 0# duplicates
0<NA>6