Overview

Dataset statistics

Number of variables8
Number of observations62
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.2 KiB
Average record size in memory69.1 B

Variable types

Categorical6
Numeric1
DateTime1

Dataset

Description수돗물 수질검사 결과에 대한 데이터로 측정일, 구분(청계통합정수장, 포일정수장), 맛, 냄새, 색도 , pH, 탁도, 잔류염소 등에 관한 데이터입니다.
Author경기도 안양시
URLhttps://www.data.go.kr/data/3038213/fileData.do

Alerts

has constant value ""Constant
냄새 has constant value ""Constant
색도 has constant value ""Constant
잔류염소 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 잔류염소 and 1 other fieldsHigh correlation
탁도 is highly overall correlated with 구분High correlation

Reproduction

Analysis started2024-04-21 02:41:23.269090
Analysis finished2024-04-21 02:41:24.913990
Duration1.64 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size628.0 B
청계통합정수장
31 
포일정수장
31 

Length

Max length7
Median length6
Mean length6
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row청계통합정수장
2nd row청계통합정수장
3rd row청계통합정수장
4th row청계통합정수장
5th row청계통합정수장

Common Values

ValueCountFrequency (%)
청계통합정수장 31
50.0%
포일정수장 31
50.0%

Length

2024-04-21T11:41:24.985811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:41:25.091695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
청계통합정수장 31
50.0%
포일정수장 31
50.0%


Categorical

CONSTANT 

Distinct1
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size628.0 B
없음
62 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row없음
2nd row없음
3rd row없음
4th row없음
5th row없음

Common Values

ValueCountFrequency (%)
없음 62
100.0%

Length

2024-04-21T11:41:25.181849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:41:25.263814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
없음 62
100.0%

냄새
Categorical

CONSTANT 

Distinct1
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size628.0 B
없음
62 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row없음
2nd row없음
3rd row없음
4th row없음
5th row없음

Common Values

ValueCountFrequency (%)
없음 62
100.0%

Length

2024-04-21T11:41:25.358211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:41:25.439155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
없음 62
100.0%

색도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size628.0 B
1이하
62 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1이하
2nd row1이하
3rd row1이하
4th row1이하
5th row1이하

Common Values

ValueCountFrequency (%)
1이하 62
100.0%

Length

2024-04-21T11:41:25.533976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:41:25.615294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1이하 62
100.0%
Distinct5
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Memory size628.0 B
7.4
31 
7.5
18 
7.6
7.3
7.2
 
3

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row7.3
2nd row7.4
3rd row7.4
4th row7.4
5th row7.4

Common Values

ValueCountFrequency (%)
7.4 31
50.0%
7.5 18
29.0%
7.6 6
 
9.7%
7.3 4
 
6.5%
7.2 3
 
4.8%

Length

2024-04-21T11:41:25.696605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:41:25.786350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
7.4 31
50.0%
7.5 18
29.0%
7.6 6
 
9.7%
7.3 4
 
6.5%
7.2 3
 
4.8%

탁도
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size628.0 B
0.03
35 
0.04
27 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.03
2nd row0.03
3rd row0.03
4th row0.03
5th row0.03

Common Values

ValueCountFrequency (%)
0.03 35
56.5%
0.04 27
43.5%

Length

2024-04-21T11:41:25.882552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:41:25.968483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0.03 35
56.5%
0.04 27
43.5%

잔류염소
Real number (ℝ)

HIGH CORRELATION 

Distinct20
Distinct (%)32.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.83790323
Minimum0.71
Maximum1.01
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size690.0 B
2024-04-21T11:41:26.057346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.71
5-th percentile0.7705
Q10.81
median0.83
Q30.8575
95-th percentile0.9195
Maximum1.01
Range0.3
Interquartile range (IQR)0.0475

Descriptive statistics

Standard deviation0.04879321
Coefficient of variation (CV)0.058232513
Kurtosis2.4626824
Mean0.83790323
Median Absolute Deviation (MAD)0.02
Skewness0.88036084
Sum51.95
Variance0.0023807774
MonotonicityNot monotonic
2024-04-21T11:41:26.174665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
0.83 12
19.4%
0.82 8
12.9%
0.85 6
9.7%
0.8 6
9.7%
0.81 5
8.1%
0.89 5
8.1%
0.88 3
 
4.8%
0.77 2
 
3.2%
0.79 2
 
3.2%
0.84 2
 
3.2%
Other values (10) 11
17.7%
ValueCountFrequency (%)
0.71 1
 
1.6%
0.76 1
 
1.6%
0.77 2
 
3.2%
0.78 1
 
1.6%
0.79 2
 
3.2%
0.8 6
9.7%
0.81 5
8.1%
0.82 8
12.9%
0.83 12
19.4%
0.84 2
 
3.2%
ValueCountFrequency (%)
1.01 1
 
1.6%
0.97 1
 
1.6%
0.93 1
 
1.6%
0.92 1
 
1.6%
0.91 1
 
1.6%
0.89 5
8.1%
0.88 3
4.8%
0.87 2
 
3.2%
0.86 1
 
1.6%
0.85 6
9.7%
Distinct31
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size628.0 B
Minimum2024-03-06 00:00:00
Maximum2024-04-05 00:00:00
2024-04-21T11:41:26.331643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:41:26.463888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)

Interactions

2024-04-21T11:41:24.484175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T11:41:26.559647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분수소 이온 농도(pH)탁도잔류염소측정일
구분1.0000.2880.9700.7760.000
수소 이온 농도(pH)0.2881.0000.2940.7250.480
탁도0.9700.2941.0000.6380.000
잔류염소0.7760.7250.6381.0000.000
측정일0.0000.4800.0000.0001.000
2024-04-21T11:41:26.667470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분수소 이온 농도(pH)탁도
구분1.0000.3420.843
수소 이온 농도(pH)0.3421.0000.348
탁도0.8430.3481.000
2024-04-21T11:41:26.749287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
잔류염소구분수소 이온 농도(pH)탁도
잔류염소1.0000.6310.3520.499
구분0.6311.0000.3420.843
수소 이온 농도(pH)0.3520.3421.0000.348
탁도0.4990.8430.3481.000

Missing values

2024-04-21T11:41:24.766337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T11:41:24.868454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분냄새색도수소 이온 농도(pH)탁도잔류염소측정일
0청계통합정수장없음없음1이하7.30.030.852024-03-06
1청계통합정수장없음없음1이하7.40.030.882024-03-07
2청계통합정수장없음없음1이하7.40.030.912024-03-08
3청계통합정수장없음없음1이하7.40.030.972024-03-09
4청계통합정수장없음없음1이하7.40.031.012024-03-10
5청계통합정수장없음없음1이하7.40.030.932024-03-11
6청계통합정수장없음없음1이하7.50.030.892024-03-12
7청계통합정수장없음없음1이하7.50.030.852024-03-13
8청계통합정수장없음없음1이하7.50.030.822024-03-14
9청계통합정수장없음없음1이하7.50.030.872024-03-15
구분냄새색도수소 이온 농도(pH)탁도잔류염소측정일
52포일정수장없음없음1이하7.60.040.82024-03-27
53포일정수장없음없음1이하7.50.040.832024-03-28
54포일정수장없음없음1이하7.50.040.822024-03-29
55포일정수장없음없음1이하7.40.040.822024-03-30
56포일정수장없음없음1이하7.40.040.82024-03-31
57포일정수장없음없음1이하7.40.040.822024-04-01
58포일정수장없음없음1이하7.40.040.832024-04-02
59포일정수장없음없음1이하7.40.040.82024-04-03
60포일정수장없음없음1이하7.50.040.82024-04-04
61포일정수장없음없음1이하7.40.040.822024-04-05