Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows53
Duplicate rows (%)0.5%
Total size in memory478.5 KiB
Average record size in memory49.0 B

Variable types

Categorical4
DateTime1

Dataset

Description일산 호수공원 대장균 정보
Author고양시
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=WJ00ZHN900OJ6GNKFMSE26402625&infSeq=1

Alerts

보고형태 has constant value ""Constant
Dataset has 53 (0.5%) duplicate rowsDuplicates
디바이스명 is highly overall correlated with 설치장소명High correlation
설치장소명 is highly overall correlated with 디바이스명High correlation
대장균측정값(0:불검출,1:검출) is highly imbalanced (99.9%)Imbalance

Reproduction

Analysis started2023-12-10 21:19:12.232220
Analysis finished2023-12-10 21:19:12.773542
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

설치장소명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
계단바닦분수(우측)
6258 
계단바닦분수(좌측)
3742 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row계단바닦분수(우측)
2nd row계단바닦분수(우측)
3rd row계단바닦분수(우측)
4th row계단바닦분수(우측)
5th row계단바닦분수(우측)

Common Values

ValueCountFrequency (%)
계단바닦분수(우측) 6258
62.6%
계단바닦분수(좌측) 3742
37.4%

Length

2023-12-11T06:19:12.825561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:19:12.906104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
계단바닦분수(우측 6258
62.6%
계단바닦분수(좌측 3742
37.4%

디바이스명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
coliform2
6255 
coliform1
3745 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowcoliform2
2nd rowcoliform2
3rd rowcoliform2
4th rowcoliform2
5th rowcoliform2

Common Values

ValueCountFrequency (%)
coliform2 6255
62.5%
coliform1 3745
37.5%

Length

2023-12-11T06:19:12.996180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:19:13.090651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
coliform2 6255
62.5%
coliform1 3745
37.5%
Distinct9658
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2017-11-03 00:20:00
Maximum2020-01-08 23:28:22
2023-12-11T06:19:13.180399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:19:13.322261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

보고형태
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
report
10000 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowreport
2nd rowreport
3rd rowreport
4th rowreport
5th rowreport

Common Values

ValueCountFrequency (%)
report 10000
100.0%

Length

2023-12-11T06:19:13.475149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:19:13.558891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
report 10000
100.0%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9999 
24
 
1

Length

Max length2
Median length1
Mean length1.0001
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9999
> 99.9%
24 1
 
< 0.1%

Length

2023-12-11T06:19:13.678154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:19:13.777554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9999
> 99.9%
24 1
 
< 0.1%

Correlations

2023-12-11T06:19:13.842860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설치장소명디바이스명대장균측정값(0:불검출,1:검출)
설치장소명1.0001.0000.000
디바이스명1.0001.0000.000
대장균측정값(0:불검출,1:검출)0.0000.0001.000
2023-12-11T06:19:13.982746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
디바이스명설치장소명대장균측정값(0:불검출,1:검출)
디바이스명1.0000.9990.000
설치장소명0.9991.0000.000
대장균측정값(0:불검출,1:검출)0.0000.0001.000
2023-12-11T06:19:14.082965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설치장소명디바이스명대장균측정값(0:불검출,1:검출)
설치장소명1.0000.9990.000
디바이스명0.9991.0000.000
대장균측정값(0:불검출,1:검출)0.0000.0001.000

Missing values

2023-12-11T06:19:12.652436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:19:12.732726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

설치장소명디바이스명수집시간보고형태대장균측정값(0:불검출,1:검출)
28764계단바닦분수(우측)coliform22019-01-17 09:48:23report0
2018계단바닦분수(우측)coliform22019-12-27 04:13:22report0
3229계단바닦분수(우측)coliform22019-12-22 13:13:22report0
66796계단바닦분수(우측)coliform22018-10-03 10:18:22report0
66792계단바닦분수(우측)coliform22018-10-03 10:38:22report0
1308계단바닦분수(우측)coliform22019-12-31 19:48:22report0
31128계단바닦분수(좌측)coliform12019-01-13 07:08:23report0
22340계단바닦분수(좌측)coliform12019-10-20 06:38:22report0
51938계단바닦분수(우측)coliform22018-12-03 11:03:22report0
12391계단바닦분수(우측)coliform22019-11-19 01:03:22report0
설치장소명디바이스명수집시간보고형태대장균측정값(0:불검출,1:검출)
1341계단바닦분수(우측)coliform22019-12-31 17:03:22report0
23934계단바닦분수(우측)coliform22019-01-27 08:38:22report0
69541계단바닦분수(우측)coliform22018-09-23 21:08:22report0
20849계단바닦분수(좌측)coliform12019-10-25 11:13:22report0
69316계단바닦분수(우측)coliform22018-09-24 15:58:22report0
33805계단바닦분수(좌측)coliform12019-01-08 15:23:22report0
31722계단바닦분수(좌측)coliform12019-01-12 06:18:22report0
83429계단바닦분수(좌측)coliform12017-11-03 19:05:00report0
58009계단바닦분수(좌측)coliform12018-11-22 21:33:22report0
70645계단바닦분수(우측)coliform22018-09-19 15:23:22report0

Duplicate rows

Most frequently occurring

설치장소명디바이스명수집시간보고형태대장균측정값(0:불검출,1:검출)# duplicates
0계단바닦분수(우측)coliform12018-01-21 08:07:00report03
27계단바닦분수(우측)coliform22018-10-24 03:48:22report03
34계단바닦분수(우측)coliform22018-10-24 07:33:22report03
41계단바닦분수(우측)coliform22018-10-24 12:28:22report03
1계단바닦분수(우측)coliform22018-05-25 21:58:22report02
2계단바닦분수(우측)coliform22018-05-26 04:13:22report02
3계단바닦분수(우측)coliform22018-05-26 06:03:22report02
4계단바닦분수(우측)coliform22018-05-26 12:43:22report02
5계단바닦분수(우측)coliform22018-05-26 14:33:22report02
6계단바닦분수(우측)coliform22018-05-26 15:48:22report02