Overview

Dataset statistics

Number of variables9
Number of observations43
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.3 KiB
Average record size in memory78.1 B

Variable types

Categorical6
Text1
Numeric2

Dataset

Description인천광역시 지정 해수욕장 수질 조사 정보 데이터로, 2023년 기준 조사지점 및 측정항목에 따른 결과를 제공함. 해양수산부령에 의거해 해수욕장의 수질 조사 및 분석을 해당 해수욕장 개장 전 1회, 개장 중 월2회, 폐장 후 1회 실시함. 해수욕장의 환경관리에 관한 지침에 따라 시료 6개 중 4개 이상이 수질기준에 적합한 경우 해수욕장 수질로서 적절한 것으로 판단함. 아래 붙임문서는 개장 후 4차 ~ 폐장 후 결과를 제공함
Author인천광역시
URLhttps://www.data.go.kr/data/15119712/fileData.do

Alerts

기준 연도 has constant value ""Constant
측정 항목 has constant value ""Constant
측정 항목.1 has constant value ""Constant
적합 여부 has constant value ""Constant
측정값(MPN_100mL) is highly overall correlated with 관할구High correlation
측정값(MPN_100mL).1 is highly overall correlated with 관할구High correlation
관할구 is highly overall correlated with 측정값(MPN_100mL) and 1 other fieldsHigh correlation
대상 해변 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:22:29.029244
Analysis finished2023-12-12 12:22:30.298713
Duration1.27 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준 연도
Categorical

CONSTANT 

Distinct1
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023
43 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023
2nd row2023
3rd row2023
4th row2023
5th row2023

Common Values

ValueCountFrequency (%)
2023 43
100.0%

Length

2023-12-12T21:22:30.372997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:22:30.489777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 43
100.0%

관할구
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size476.0 B
옹진군
34 
중구

Length

Max length3
Median length3
Mean length2.7906977
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row옹진군
2nd row옹진군
3rd row옹진군
4th row옹진군
5th row옹진군

Common Values

ValueCountFrequency (%)
옹진군 34
79.1%
중구 9
 
20.9%

Length

2023-12-12T21:22:30.605904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:22:30.730605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
옹진군 34
79.1%
중구 9
 
20.9%

대상 해변
Text

UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-12T21:22:30.906718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length7.4186047
Min length4

Characters and Unicode

Total characters319
Distinct characters32
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)100.0%

Sample

1st row십리포 해수욕장-1
2nd row십리포 해수욕장-2
3rd row십리포 해수욕장-3
4th row장경리 해수욕장-1
5th row장경리 해수욕장-2
ValueCountFrequency (%)
3 9
 
11.2%
1 9
 
11.2%
2 9
 
11.2%
서포리해수욕장 5
 
6.2%
옹암해수욕장 5
 
6.2%
하나개 3
 
3.8%
떼뿌루해수욕장 3
 
3.8%
이일레 3
 
3.8%
수기해수욕장 3
 
3.8%
장골 3
 
3.8%
Other values (15) 28
35.0%
2023-12-12T21:22:31.251164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
37
11.6%
37
11.6%
31
 
9.7%
28
 
8.8%
28
 
8.8%
20
 
6.3%
1 13
 
4.1%
3 13
 
4.1%
2 13
 
4.1%
- 12
 
3.8%
Other values (22) 87
27.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 227
71.2%
Decimal Number 43
 
13.5%
Space Separator 37
 
11.6%
Dash Punctuation 12
 
3.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
37
16.3%
31
13.7%
28
12.3%
28
12.3%
20
8.8%
11
 
4.8%
6
 
2.6%
6
 
2.6%
6
 
2.6%
5
 
2.2%
Other values (15) 49
21.6%
Decimal Number
ValueCountFrequency (%)
1 13
30.2%
3 13
30.2%
2 13
30.2%
4 2
 
4.7%
5 2
 
4.7%
Space Separator
ValueCountFrequency (%)
37
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 227
71.2%
Common 92
28.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
37
16.3%
31
13.7%
28
12.3%
28
12.3%
20
8.8%
11
 
4.8%
6
 
2.6%
6
 
2.6%
6
 
2.6%
5
 
2.2%
Other values (15) 49
21.6%
Common
ValueCountFrequency (%)
37
40.2%
1 13
 
14.1%
3 13
 
14.1%
2 13
 
14.1%
- 12
 
13.0%
4 2
 
2.2%
5 2
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 227
71.2%
ASCII 92
28.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
37
40.2%
1 13
 
14.1%
3 13
 
14.1%
2 13
 
14.1%
- 12
 
13.0%
4 2
 
2.2%
5 2
 
2.2%
Hangul
ValueCountFrequency (%)
37
16.3%
31
13.7%
28
12.3%
28
12.3%
20
8.8%
11
 
4.8%
6
 
2.6%
6
 
2.6%
6
 
2.6%
5
 
2.2%
Other values (15) 49
21.6%

측정 구분
Categorical

Distinct2
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size476.0 B
폐장 후
37 
개장 중 4차

Length

Max length7
Median length4
Mean length4.4186047
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개장 중 4차
2nd row개장 중 4차
3rd row개장 중 4차
4th row개장 중 4차
5th row개장 중 4차

Common Values

ValueCountFrequency (%)
폐장 후 37
86.0%
개장 중 4차 6
 
14.0%

Length

2023-12-12T21:22:31.437128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:22:31.598886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐장 37
40.2%
37
40.2%
개장 6
 
6.5%
6
 
6.5%
4차 6
 
6.5%

측정 항목
Categorical

CONSTANT 

Distinct1
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
대장균
43 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대장균
2nd row대장균
3rd row대장균
4th row대장균
5th row대장균

Common Values

ValueCountFrequency (%)
대장균 43
100.0%

Length

2023-12-12T21:22:31.720949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:22:31.832889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대장균 43
100.0%

측정값(MPN_100mL)
Real number (ℝ)

HIGH CORRELATION 

Distinct17
Distinct (%)39.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean415.11628
Minimum240
Maximum490
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size519.0 B
2023-12-12T21:22:31.962177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum240
5-th percentile284
Q1385
median430
Q3460
95-th percentile480
Maximum490
Range250
Interquartile range (IQR)75

Descriptive statistics

Standard deviation62.65446
Coefficient of variation (CV)0.15093231
Kurtosis0.79138146
Mean415.11628
Median Absolute Deviation (MAD)40
Skewness-1.1659696
Sum17850
Variance3925.5814
MonotonicityNot monotonic
2023-12-12T21:22:32.111139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
430 5
11.6%
480 5
11.6%
460 5
11.6%
440 4
9.3%
390 4
9.3%
470 3
 
7.0%
450 3
 
7.0%
420 2
 
4.7%
330 2
 
4.7%
360 2
 
4.7%
Other values (7) 8
18.6%
ValueCountFrequency (%)
240 1
 
2.3%
260 1
 
2.3%
280 1
 
2.3%
320 1
 
2.3%
330 2
4.7%
350 1
 
2.3%
360 2
4.7%
380 2
4.7%
390 4
9.3%
420 2
4.7%
ValueCountFrequency (%)
490 1
 
2.3%
480 5
11.6%
470 3
7.0%
460 5
11.6%
450 3
7.0%
440 4
9.3%
430 5
11.6%
420 2
 
4.7%
390 4
9.3%
380 2
 
4.7%

측정 항목.1
Categorical

CONSTANT 

Distinct1
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
장구균
43 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row장구균
2nd row장구균
3rd row장구균
4th row장구균
5th row장구균

Common Values

ValueCountFrequency (%)
장구균 43
100.0%

Length

2023-12-12T21:22:32.245589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:22:32.398328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
장구균 43
100.0%

측정값(MPN_100mL).1
Real number (ℝ)

HIGH CORRELATION 

Distinct29
Distinct (%)67.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean72.953488
Minimum30
Maximum96
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size519.0 B
2023-12-12T21:22:32.519417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum30
5-th percentile34.6
Q168.5
median76
Q386
95-th percentile93.8
Maximum96
Range66
Interquartile range (IQR)17.5

Descriptive statistics

Standard deviation18.046176
Coefficient of variation (CV)0.24736549
Kurtosis0.18980382
Mean72.953488
Median Absolute Deviation (MAD)10
Skewness-1.0577608
Sum3137
Variance325.66445
MonotonicityNot monotonic
2023-12-12T21:22:32.659401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
86 3
 
7.0%
90 3
 
7.0%
74 3
 
7.0%
54 2
 
4.7%
81 2
 
4.7%
87 2
 
4.7%
73 2
 
4.7%
82 2
 
4.7%
43 2
 
4.7%
83 2
 
4.7%
Other values (19) 20
46.5%
ValueCountFrequency (%)
30 1
2.3%
32 1
2.3%
34 1
2.3%
40 1
2.3%
43 2
4.7%
50 1
2.3%
54 2
4.7%
65 1
2.3%
68 1
2.3%
69 1
2.3%
ValueCountFrequency (%)
96 1
 
2.3%
95 1
 
2.3%
94 1
 
2.3%
92 1
 
2.3%
90 3
7.0%
87 2
4.7%
86 3
7.0%
85 1
 
2.3%
84 1
 
2.3%
83 2
4.7%

적합 여부
Categorical

CONSTANT 

Distinct1
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
적합
43 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row적합
2nd row적합
3rd row적합
4th row적합
5th row적합

Common Values

ValueCountFrequency (%)
적합 43
100.0%

Length

2023-12-12T21:22:32.822429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:22:32.953771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
적합 43
100.0%

Interactions

2023-12-12T21:22:29.794649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:22:29.565120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:22:29.893702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:22:29.675155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:22:33.043108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관할구대상 해변측정 구분측정값(MPN_100mL)측정값(MPN_100mL).1
관할구1.0001.0000.0000.8680.813
대상 해변1.0001.0001.0001.0001.000
측정 구분0.0001.0001.0000.0000.650
측정값(MPN_100mL)0.8681.0000.0001.0000.830
측정값(MPN_100mL).10.8131.0000.6500.8301.000
2023-12-12T21:22:33.173223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관할구측정 구분
관할구1.0000.000
측정 구분0.0001.000
2023-12-12T21:22:33.294526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정값(MPN_100mL)측정값(MPN_100mL).1관할구측정 구분
측정값(MPN_100mL)1.0000.2710.6680.000
측정값(MPN_100mL).10.2711.0000.5830.453
관할구0.6680.5831.0000.000
측정 구분0.0000.4530.0001.000

Missing values

2023-12-12T21:22:30.041238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:22:30.227420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준 연도관할구대상 해변측정 구분측정 항목측정값(MPN_100mL)측정 항목.1측정값(MPN_100mL).1적합 여부
02023옹진군십리포 해수욕장-1개장 중 4차대장균470장구균92적합
12023옹진군십리포 해수욕장-2개장 중 4차대장균450장구균90적합
22023옹진군십리포 해수욕장-3개장 중 4차대장균430장구균90적합
32023옹진군장경리 해수욕장-1개장 중 4차대장균450장구균86적합
42023옹진군장경리 해수욕장-2개장 중 4차대장균430장구균86적합
52023옹진군장경리 해수욕장-3개장 중 4차대장균440장구균90적합
62023중구을왕리 1폐장 후대장균330장구균72적합
72023중구을왕리 2폐장 후대장균360장구균68적합
82023중구을왕리 3폐장 후대장균330장구균65적합
92023중구왕산 1폐장 후대장균380장구균73적합
기준 연도관할구대상 해변측정 구분측정 항목측정값(MPN_100mL)측정 항목.1측정값(MPN_100mL).1적합 여부
332023옹진군장골 2폐장 후대장균480장구균43적합
342023옹진군장골 3폐장 후대장균480장구균34적합
352023옹진군떼뿌루해수욕장 1폐장 후대장균420장구균81적합
362023옹진군떼뿌루해수욕장 2폐장 후대장균440장구균82적합
372023옹진군떼뿌루해수욕장 3폐장 후대장균440장구균82적합
382023옹진군서포리해수욕장 1폐장 후대장균390장구균69적합
392023옹진군서포리해수욕장 2폐장 후대장균430장구균71적합
402023옹진군서포리해수욕장 3폐장 후대장균440장구균74적합
412023옹진군서포리해수욕장 4폐장 후대장균430장구균75적합
422023옹진군서포리해수욕장 5폐장 후대장균430장구균76적합