Overview

Dataset statistics

Number of variables14
Number of observations9946
Missing cells269
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 MiB
Average record size in memory119.0 B

Variable types

Numeric7
Text3
Categorical3
DateTime1

Dataset

Description한국농어촌공사가 관리하는 농업용수로 사용되는 저수지의 주소,담당관리기관,조사일자, 수질정보 등
Author한국농어촌공사
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20191014000000001275

Alerts

조사구분 has constant value ""Constant
관리구분 has constant value ""Constant
COD is highly overall correlated with TOC and 1 other fieldsHigh correlation
TOC is highly overall correlated with CODHigh correlation
T-P is highly overall correlated with CODHigh correlation
시설구분 is highly imbalanced (97.6%)Imbalance
SS has 106 (1.1%) missing valuesMissing
SS is highly skewed (γ1 = 24.50815622)Skewed

Reproduction

Analysis started2023-12-11 03:14:48.122118
Analysis finished2023-12-11 03:14:56.861152
Duration8.74 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

표준코드
Real number (ℝ)

Distinct2521
Distinct (%)25.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.5789264 × 109
Minimum2.6710101 × 109
Maximum5.01301 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size87.5 KiB
2023-12-11T12:14:56.956193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.6710101 × 109
5-th percentile3.1710103 × 109
Q14.5720101 × 109
median4.6830101 × 109
Q34.7750101 × 109
95-th percentile4.8860102 × 109
Maximum5.01301 × 109
Range2.3419999 × 109
Interquartile range (IQR)2.0299997 × 108

Descriptive statistics

Standard deviation4.0161523 × 108
Coefficient of variation (CV)0.087709475
Kurtosis9.977121
Mean4.5789264 × 109
Median Absolute Deviation (MAD)1.0200004 × 108
Skewness-3.1594586
Sum4.5542002 × 1013
Variance1.6129479 × 1017
MonotonicityIncreasing
2023-12-11T12:14:57.117850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2671010085 4
 
< 0.1%
4717010086 4
 
< 0.1%
4715010041 4
 
< 0.1%
4715010143 4
 
< 0.1%
4715010177 4
 
< 0.1%
4717010016 4
 
< 0.1%
4717010017 4
 
< 0.1%
4717010062 4
 
< 0.1%
4717010063 4
 
< 0.1%
4717010087 4
 
< 0.1%
Other values (2511) 9906
99.6%
ValueCountFrequency (%)
2671010085 4
< 0.1%
2671010098 4
< 0.1%
2714010023 4
< 0.1%
2723010006 4
< 0.1%
2723010007 4
< 0.1%
2723010012 4
< 0.1%
2726010001 4
< 0.1%
2726010006 4
< 0.1%
2726010008 4
< 0.1%
2726010019 4
< 0.1%
ValueCountFrequency (%)
5013010001 4
< 0.1%
4971010003 4
< 0.1%
4971010002 4
< 0.1%
4971010001 4
< 0.1%
4889010614 4
< 0.1%
4889010613 4
< 0.1%
4889010610 4
< 0.1%
4889010609 4
< 0.1%
4889010608 4
< 0.1%
4889010607 4
< 0.1%
Distinct1973
Distinct (%)19.8%
Missing0
Missing (%)0.0%
Memory size77.8 KiB
2023-12-11T12:14:57.577727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length2
Mean length2.2273276
Min length1

Characters and Unicode

Total characters22153
Distinct characters368
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)0.1%

Sample

1st row송정
2nd row송정
3rd row송정
4th row송정
5th row임기
ValueCountFrequency (%)
연화 36
 
0.4%
대곡 36
 
0.4%
학동 36
 
0.4%
신기 32
 
0.3%
방축 32
 
0.3%
연동 28
 
0.3%
백운 28
 
0.3%
신흥 26
 
0.3%
신풍 24
 
0.2%
가곡 24
 
0.2%
Other values (1963) 9644
97.0%
2023-12-11T12:14:58.265375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
862
 
3.9%
755
 
3.4%
713
 
3.2%
510
 
2.3%
431
 
1.9%
408
 
1.8%
377
 
1.7%
330
 
1.5%
316
 
1.4%
316
 
1.4%
Other values (358) 17135
77.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21220
95.8%
Decimal Number 619
 
2.8%
Open Punctuation 155
 
0.7%
Close Punctuation 155
 
0.7%
Dash Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
862
 
4.1%
755
 
3.6%
713
 
3.4%
510
 
2.4%
431
 
2.0%
408
 
1.9%
377
 
1.8%
330
 
1.6%
316
 
1.5%
316
 
1.5%
Other values (349) 16202
76.4%
Decimal Number
ValueCountFrequency (%)
1 292
47.2%
2 271
43.8%
3 36
 
5.8%
4 12
 
1.9%
5 4
 
0.6%
6 4
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 155
100.0%
Close Punctuation
ValueCountFrequency (%)
) 155
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21220
95.8%
Common 933
 
4.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
862
 
4.1%
755
 
3.6%
713
 
3.4%
510
 
2.4%
431
 
2.0%
408
 
1.9%
377
 
1.8%
330
 
1.6%
316
 
1.5%
316
 
1.5%
Other values (349) 16202
76.4%
Common
ValueCountFrequency (%)
1 292
31.3%
2 271
29.0%
( 155
16.6%
) 155
16.6%
3 36
 
3.9%
4 12
 
1.3%
- 4
 
0.4%
5 4
 
0.4%
6 4
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21220
95.8%
ASCII 933
 
4.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
862
 
4.1%
755
 
3.6%
713
 
3.4%
510
 
2.4%
431
 
2.0%
408
 
1.9%
377
 
1.8%
330
 
1.6%
316
 
1.5%
316
 
1.5%
Other values (349) 16202
76.4%
ASCII
ValueCountFrequency (%)
1 292
31.3%
2 271
29.0%
( 155
16.6%
) 155
16.6%
3 36
 
3.9%
4 12
 
1.3%
- 4
 
0.4%
5 4
 
0.4%
6 4
 
0.4%

조사구분
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size77.8 KiB
공사조사
9946 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공사조사
2nd row공사조사
3rd row공사조사
4th row공사조사
5th row공사조사

Common Values

ValueCountFrequency (%)
공사조사 9946
100.0%

Length

2023-12-11T12:14:58.793477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:14:58.907304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공사조사 9946
100.0%

주소
Text

Distinct1787
Distinct (%)18.0%
Missing0
Missing (%)0.0%
Memory size77.8 KiB
2023-12-11T12:14:59.269421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length16
Mean length15.853408
Min length11

Characters and Unicode

Total characters157678
Distinct characters310
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row부산광역시 기장군 철마면 송정리
2nd row부산광역시 기장군 철마면 송정리
3rd row부산광역시 기장군 철마면 송정리
4th row부산광역시 기장군 철마면 송정리
5th row부산광역시 기장군 철마면 임기리
ValueCountFrequency (%)
전라남도 3243
 
8.2%
경상북도 1963
 
5.0%
경상남도 1759
 
4.5%
전라북도 1257
 
3.2%
나주시 632
 
1.6%
영암군 514
 
1.3%
충청남도 478
 
1.2%
충청북도 428
 
1.1%
영천시 328
 
0.8%
합천군 288
 
0.7%
Other values (2189) 28627
72.4%
2023-12-11T12:14:59.896520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29571
18.8%
10009
 
6.3%
9426
 
6.0%
8043
 
5.1%
6836
 
4.3%
6407
 
4.1%
4880
 
3.1%
4544
 
2.9%
4343
 
2.8%
4280
 
2.7%
Other values (300) 69339
44.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 128103
81.2%
Space Separator 29571
 
18.8%
Decimal Number 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10009
 
7.8%
9426
 
7.4%
8043
 
6.3%
6836
 
5.3%
6407
 
5.0%
4880
 
3.8%
4544
 
3.5%
4343
 
3.4%
4280
 
3.3%
4237
 
3.3%
Other values (298) 65098
50.8%
Space Separator
ValueCountFrequency (%)
29571
100.0%
Decimal Number
ValueCountFrequency (%)
1 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 128103
81.2%
Common 29575
 
18.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10009
 
7.8%
9426
 
7.4%
8043
 
6.3%
6836
 
5.3%
6407
 
5.0%
4880
 
3.8%
4544
 
3.5%
4343
 
3.4%
4280
 
3.3%
4237
 
3.3%
Other values (298) 65098
50.8%
Common
ValueCountFrequency (%)
29571
> 99.9%
1 4
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 128103
81.2%
ASCII 29575
 
18.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
29571
> 99.9%
1 4
 
< 0.1%
Hangul
ValueCountFrequency (%)
10009
 
7.8%
9426
 
7.4%
8043
 
6.3%
6836
 
5.3%
6407
 
5.0%
4880
 
3.8%
4544
 
3.5%
4343
 
3.4%
4280
 
3.3%
4237
 
3.3%
Other values (298) 65098
50.8%

시설구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size77.8 KiB
저수지
9922 
담수호
 
24

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row저수지
2nd row저수지
3rd row저수지
4th row저수지
5th row저수지

Common Values

ValueCountFrequency (%)
저수지 9922
99.8%
담수호 24
 
0.2%

Length

2023-12-11T12:15:00.092282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:15:00.226116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
저수지 9922
99.8%
담수호 24
 
0.2%

관리구분
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size77.8 KiB
공사
9946 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공사
2nd row공사
3rd row공사
4th row공사
5th row공사

Common Values

ValueCountFrequency (%)
공사 9946
100.0%

Length

2023-12-11T12:15:00.389598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:15:00.536973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공사 9946
100.0%
Distinct94
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size77.8 KiB
2023-12-11T12:15:00.875484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length4
Mean length4.8128896
Min length4

Characters and Unicode

Total characters47869
Distinct characters100
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row울산지사
2nd row울산지사
3rd row울산지사
4th row울산지사
5th row울산지사
ValueCountFrequency (%)
나주지사 624
 
6.3%
영암지사 514
 
5.2%
영천지사 328
 
3.3%
무안신안지사 291
 
2.9%
합천지사 288
 
2.9%
해남완도지사 284
 
2.9%
전주완주임실지사 281
 
2.8%
울산지사 264
 
2.7%
남원지사 264
 
2.7%
의성군위지사 240
 
2.4%
Other values (84) 6568
66.0%
2023-12-11T12:15:01.430896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10166
21.2%
9926
20.7%
2083
 
4.4%
1617
 
3.4%
1569
 
3.3%
1104
 
2.3%
1021
 
2.1%
934
 
2.0%
880
 
1.8%
757
 
1.6%
Other values (90) 17812
37.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 47869
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10166
21.2%
9926
20.7%
2083
 
4.4%
1617
 
3.4%
1569
 
3.3%
1104
 
2.3%
1021
 
2.1%
934
 
2.0%
880
 
1.8%
757
 
1.6%
Other values (90) 17812
37.2%

Most occurring scripts

ValueCountFrequency (%)
Hangul 47869
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10166
21.2%
9926
20.7%
2083
 
4.4%
1617
 
3.4%
1569
 
3.3%
1104
 
2.3%
1021
 
2.1%
934
 
2.0%
880
 
1.8%
757
 
1.6%
Other values (90) 17812
37.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 47869
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10166
21.2%
9926
20.7%
2083
 
4.4%
1617
 
3.4%
1569
 
3.3%
1104
 
2.3%
1021
 
2.1%
934
 
2.0%
880
 
1.8%
757
 
1.6%
Other values (90) 17812
37.2%
Distinct149
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size77.8 KiB
Minimum2017-02-28 00:00:00
Maximum2017-12-13 00:00:00
2023-12-11T12:15:01.628489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:15:01.813227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

pH
Real number (ℝ)

Distinct50
Distinct (%)0.5%
Missing18
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean7.4161261
Minimum5.6
Maximum10.8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size87.5 KiB
2023-12-11T12:15:02.008214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5.6
5-th percentile6.5
Q17
median7.3
Q37.7
95-th percentile8.7
Maximum10.8
Range5.2
Interquartile range (IQR)0.7

Descriptive statistics

Standard deviation0.65346156
Coefficient of variation (CV)0.088113599
Kurtosis1.4671749
Mean7.4161261
Median Absolute Deviation (MAD)0.4
Skewness0.97994696
Sum73627.3
Variance0.42701201
MonotonicityNot monotonic
2023-12-11T12:15:02.232617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7.0 738
 
7.4%
7.2 726
 
7.3%
7.1 707
 
7.1%
7.3 685
 
6.9%
7.4 658
 
6.6%
7.5 600
 
6.0%
6.9 596
 
6.0%
7.6 553
 
5.6%
6.8 552
 
5.5%
7.7 462
 
4.6%
Other values (40) 3651
36.7%
ValueCountFrequency (%)
5.6 1
 
< 0.1%
5.7 1
 
< 0.1%
5.8 1
 
< 0.1%
5.9 1
 
< 0.1%
6.0 14
 
0.1%
6.1 26
 
0.3%
6.2 48
 
0.5%
6.3 92
0.9%
6.4 137
1.4%
6.5 196
2.0%
ValueCountFrequency (%)
10.8 1
 
< 0.1%
10.6 1
 
< 0.1%
10.3 3
 
< 0.1%
10.2 9
 
0.1%
10.1 6
 
0.1%
10.0 4
 
< 0.1%
9.9 10
0.1%
9.8 15
0.2%
9.7 10
0.1%
9.6 23
0.2%

COD
Real number (ℝ)

HIGH CORRELATION 

Distinct264
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.3560828
Minimum0
Maximum118.5
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size87.5 KiB
2023-12-11T12:15:02.396039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2.6
Q14.7
median6.6
Q39
95-th percentile14.8
Maximum118.5
Range118.5
Interquartile range (IQR)4.3

Descriptive statistics

Standard deviation4.5006058
Coefficient of variation (CV)0.61182098
Kurtosis87.795442
Mean7.3560828
Median Absolute Deviation (MAD)2.1
Skewness5.5761455
Sum73163.6
Variance20.255453
MonotonicityNot monotonic
2023-12-11T12:15:02.558003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5.4 201
 
2.0%
4.8 192
 
1.9%
6.2 191
 
1.9%
4.4 188
 
1.9%
5.2 184
 
1.8%
6.0 180
 
1.8%
5.0 180
 
1.8%
7.0 176
 
1.8%
5.6 176
 
1.8%
6.8 175
 
1.8%
Other values (254) 8103
81.5%
ValueCountFrequency (%)
0.0 1
 
< 0.1%
0.5 1
 
< 0.1%
0.6 5
 
0.1%
0.8 6
 
0.1%
0.9 1
 
< 0.1%
1.0 11
0.1%
1.1 3
 
< 0.1%
1.2 25
0.3%
1.3 10
 
0.1%
1.4 25
0.3%
ValueCountFrequency (%)
118.5 1
< 0.1%
106.5 1
< 0.1%
77.8 1
< 0.1%
77.4 1
< 0.1%
74.9 1
< 0.1%
62.1 1
< 0.1%
51.1 1
< 0.1%
50.0 1
< 0.1%
48.1 1
< 0.1%
47.5 1
< 0.1%

TOC
Real number (ℝ)

HIGH CORRELATION 

Distinct167
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.2859743
Minimum0.2
Maximum74.7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size87.5 KiB
2023-12-11T12:15:02.719220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.2
5-th percentile1.4
Q12.6
median3.9
Q35.4
95-th percentile8.4
Maximum74.7
Range74.5
Interquartile range (IQR)2.8

Descriptive statistics

Standard deviation2.4418015
Coefficient of variation (CV)0.56971912
Kurtosis76.390101
Mean4.2859743
Median Absolute Deviation (MAD)1.4
Skewness3.9515183
Sum42628.3
Variance5.9623945
MonotonicityNot monotonic
2023-12-11T12:15:02.869605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2.4 224
 
2.3%
2.2 220
 
2.2%
3.3 220
 
2.2%
3.8 212
 
2.1%
4.2 205
 
2.1%
2.6 204
 
2.1%
2.7 204
 
2.1%
3.2 201
 
2.0%
3.4 200
 
2.0%
3.6 197
 
2.0%
Other values (157) 7859
79.0%
ValueCountFrequency (%)
0.2 1
 
< 0.1%
0.3 3
 
< 0.1%
0.4 7
 
0.1%
0.5 16
 
0.2%
0.6 15
 
0.2%
0.7 16
 
0.2%
0.8 24
 
0.2%
0.9 48
0.5%
1.0 61
0.6%
1.1 71
0.7%
ValueCountFrequency (%)
74.7 1
 
< 0.1%
35.9 1
 
< 0.1%
30.3 1
 
< 0.1%
23.5 1
 
< 0.1%
20.7 1
 
< 0.1%
20.5 1
 
< 0.1%
20.4 1
 
< 0.1%
19.5 1
 
< 0.1%
19.2 1
 
< 0.1%
18.5 3
< 0.1%

T-N
Real number (ℝ)

Distinct2934
Distinct (%)29.7%
Missing73
Missing (%)0.7%
Infinite0
Infinite (%)0.0%
Mean1.2990533
Minimum0.002
Maximum22.277
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size87.5 KiB
2023-12-11T12:15:03.055499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.002
5-th percentile0.3416
Q10.643
median0.954
Q31.557
95-th percentile3.2334
Maximum22.277
Range22.275
Interquartile range (IQR)0.914

Descriptive statistics

Standard deviation1.2188044
Coefficient of variation (CV)0.9382251
Kurtosis55.050891
Mean1.2990533
Median Absolute Deviation (MAD)0.39
Skewness5.3180171
Sum12825.553
Variance1.4854841
MonotonicityNot monotonic
2023-12-11T12:15:03.251986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.803 23
 
0.2%
0.625 23
 
0.2%
0.662 21
 
0.2%
0.985 21
 
0.2%
0.704 20
 
0.2%
0.854 19
 
0.2%
0.907 19
 
0.2%
0.672 19
 
0.2%
0.719 18
 
0.2%
0.657 18
 
0.2%
Other values (2924) 9672
97.2%
(Missing) 73
 
0.7%
ValueCountFrequency (%)
0.002 1
< 0.1%
0.004 1
< 0.1%
0.02 1
< 0.1%
0.026 1
< 0.1%
0.038 1
< 0.1%
0.042 1
< 0.1%
0.054 1
< 0.1%
0.056 1
< 0.1%
0.061 2
< 0.1%
0.062 1
< 0.1%
ValueCountFrequency (%)
22.277 1
< 0.1%
20.641 1
< 0.1%
20.155 1
< 0.1%
20.017 1
< 0.1%
19.547 1
< 0.1%
18.679 1
< 0.1%
16.762 1
< 0.1%
16.201 1
< 0.1%
14.526 1
< 0.1%
14.3 1
< 0.1%

T-P
Real number (ℝ)

HIGH CORRELATION 

Distinct505
Distinct (%)5.1%
Missing72
Missing (%)0.7%
Infinite0
Infinite (%)0.0%
Mean0.070552765
Minimum0
Maximum2.1
Zeros22
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size87.5 KiB
2023-12-11T12:15:03.401105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.009
Q10.021
median0.039
Q30.079
95-th percentile0.21335
Maximum2.1
Range2.1
Interquartile range (IQR)0.058

Descriptive statistics

Standard deviation0.11357988
Coefficient of variation (CV)1.6098573
Kurtosis85.690665
Mean0.070552765
Median Absolute Deviation (MAD)0.023
Skewness7.2863463
Sum696.638
Variance0.01290039
MonotonicityNot monotonic
2023-12-11T12:15:03.552459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.021 197
 
2.0%
0.018 191
 
1.9%
0.016 177
 
1.8%
0.012 174
 
1.7%
0.019 174
 
1.7%
0.02 173
 
1.7%
0.023 173
 
1.7%
0.022 169
 
1.7%
0.014 160
 
1.6%
0.015 155
 
1.6%
Other values (495) 8131
81.8%
ValueCountFrequency (%)
0.0 22
 
0.2%
0.001 14
 
0.1%
0.002 12
 
0.1%
0.003 34
 
0.3%
0.004 42
 
0.4%
0.005 65
0.7%
0.006 71
0.7%
0.007 101
1.0%
0.008 105
1.1%
0.009 130
1.3%
ValueCountFrequency (%)
2.1 1
< 0.1%
2.081 1
< 0.1%
2.013 1
< 0.1%
1.975 1
< 0.1%
1.911 1
< 0.1%
1.82 1
< 0.1%
1.805 1
< 0.1%
1.788 1
< 0.1%
1.709 1
< 0.1%
1.536 1
< 0.1%

SS
Real number (ℝ)

MISSING  SKEWED 

Distinct473
Distinct (%)4.8%
Missing106
Missing (%)1.1%
Infinite0
Infinite (%)0.0%
Mean8.961311
Minimum0
Maximum988
Zeros3
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size87.5 KiB
2023-12-11T12:15:03.736172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q13.3
median5.8
Q39.2
95-th percentile27
Maximum988
Range988
Interquartile range (IQR)5.9

Descriptive statistics

Standard deviation16.999515
Coefficient of variation (CV)1.8969897
Kurtosis1187.1427
Mean8.961311
Median Absolute Deviation (MAD)2.8
Skewness24.508156
Sum88179.3
Variance288.98351
MonotonicityNot monotonic
2023-12-11T12:15:03.911950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4.0 310
 
3.1%
2.0 292
 
2.9%
3.0 259
 
2.6%
6.0 254
 
2.6%
1.0 218
 
2.2%
5.0 206
 
2.1%
8.0 198
 
2.0%
5.2 184
 
1.8%
2.5 167
 
1.7%
7.0 154
 
1.5%
Other values (463) 7598
76.4%
ValueCountFrequency (%)
0.0 3
 
< 0.1%
0.1 4
 
< 0.1%
0.2 15
 
0.2%
0.3 16
 
0.2%
0.4 39
0.4%
0.5 67
0.7%
0.6 42
0.4%
0.7 22
 
0.2%
0.8 62
0.6%
0.9 21
 
0.2%
ValueCountFrequency (%)
988.0 1
< 0.1%
409.0 1
< 0.1%
333.0 1
< 0.1%
246.4 1
< 0.1%
223.0 1
< 0.1%
206.0 1
< 0.1%
202.0 1
< 0.1%
201.0 1
< 0.1%
195.9 1
< 0.1%
186.0 1
< 0.1%

Interactions

2023-12-11T12:14:55.469144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:50.635562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:51.498540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:52.325449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:53.175028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:54.073073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:54.830656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:55.572522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:50.747336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:51.612207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:52.463158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:53.282301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:54.181860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:54.923682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:55.687213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:50.857955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:51.721686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:52.596800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:53.403771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:54.286057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:55.009065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:55.842060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:50.977229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:51.868497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:52.738528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:53.551582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:54.408457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:55.105519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:55.943875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:51.089996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:52.000896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:52.856278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:53.717946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:54.534734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:55.196342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:56.048278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:51.196422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:52.117518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:52.968719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:53.851060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:54.636239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:55.285038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:56.173559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:51.352316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:52.226121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:53.077195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:53.949563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:54.736625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:14:55.373632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T12:15:04.045875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
표준코드시설구분관리기관pHCODTOCT-NT-PSS
표준코드1.0000.0640.9950.1400.0430.0350.0790.0950.038
시설구분0.0641.0000.7090.0000.0390.0580.0650.0090.000
관리기관0.9950.7091.0000.4830.2490.2830.2810.2730.059
pH0.1400.0000.4831.0000.1550.1080.0550.0950.083
COD0.0430.0390.2490.1551.0000.8460.3920.6980.464
TOC0.0350.0580.2830.1080.8461.0000.5990.2870.180
T-N0.0790.0650.2810.0550.3920.5991.0000.4090.000
T-P0.0950.0090.2730.0950.6980.2870.4091.0000.125
SS0.0380.0000.0590.0830.4640.1800.0000.1251.000
2023-12-11T12:15:04.177741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
표준코드pHCODTOCT-NT-PSS시설구분
표준코드1.0000.1030.0650.070-0.1430.0150.0650.068
pH0.1031.000-0.046-0.020-0.031-0.145-0.0360.000
COD0.065-0.0461.0000.9180.0510.5240.4470.038
TOC0.070-0.0200.9181.0000.0330.4810.3860.042
T-N-0.143-0.0310.0510.0331.0000.2020.0860.050
T-P0.015-0.1450.5240.4810.2021.0000.4230.009
SS0.065-0.0360.4470.3860.0860.4231.0000.000
시설구분0.0680.0000.0380.0420.0500.0090.0001.000

Missing values

2023-12-11T12:14:56.354243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:14:56.620280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T12:14:56.774998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

표준코드시설명조사구분주소시설구분관리구분관리기관조사일자pHCODTOCT-NT-PSS
02671010085송정공사조사부산광역시 기장군 철마면 송정리저수지공사울산지사2017-04-197.22.01.40.8480.0110.7
12671010085송정공사조사부산광역시 기장군 철마면 송정리저수지공사울산지사2017-06-137.62.81.70.4660.013.0
22671010085송정공사조사부산광역시 기장군 철마면 송정리저수지공사울산지사2017-09-196.63.62.30.2840.0055.8
32671010085송정공사조사부산광역시 기장군 철마면 송정리저수지공사울산지사2017-11-238.64.22.40.5210.0195.5
42671010098임기공사조사부산광역시 기장군 철마면 임기리저수지공사울산지사2017-04-197.41.20.60.8920.0064.1
52671010098임기공사조사부산광역시 기장군 철마면 임기리저수지공사울산지사2017-06-137.11.60.80.8260.0141.9
62671010098임기공사조사부산광역시 기장군 철마면 임기리저수지공사울산지사2017-09-196.33.02.20.4960.0039.2
72671010098임기공사조사부산광역시 기장군 철마면 임기리저수지공사울산지사2017-11-238.52.21.30.3820.0132.3
82714010023공사조사대구광역시 동구 각산동저수지공사경산청도지사2017-04-048.37.73.30.9220.0215.2
92714010023공사조사대구광역시 동구 각산동저수지공사경산청도지사2017-06-138.211.48.20.5090.0299.2
표준코드시설명조사구분주소시설구분관리구분관리기관조사일자pHCODTOCT-NT-PSS
99364971010002귀엄공사조사제주특별자치도 제주시 애월읍 수산리저수지공사제주본부2017-08-287.68.65.10.5090.01614.0
99374971010002귀엄공사조사제주특별자치도 제주시 애월읍 수산리저수지공사제주본부2017-03-216.87.04.31.3380.04215.2
99384971010003용수공사조사제주특별자치도 제주시 한경면 용수리저수지공사제주본부2017-03-218.39.84.81.1040.05219.0
99394971010003용수공사조사제주특별자치도 제주시 한경면 용수리저수지공사제주본부2017-06-128.616.07.80.6970.05224.7
99404971010003용수공사조사제주특별자치도 제주시 한경면 용수리저수지공사제주본부2017-08-2810.222.410.21.0850.04338.1
99414971010003용수공사조사제주특별자치도 제주시 한경면 용수리저수지공사제주본부2017-03-219.031.313.11.6990.08998.4
99425013010001성읍공사조사제주특별자치도 서귀포시 표선읍 성읍리저수지공사제주본부2017-03-217.26.64.00.3590.0243.1
99435013010001성읍공사조사제주특별자치도 서귀포시 표선읍 성읍리저수지공사제주본부2017-06-126.86.82.90.2330.0125.0
99445013010001성읍공사조사제주특별자치도 서귀포시 표선읍 성읍리저수지공사제주본부2017-08-287.810.06.60.450.0218.7
99455013010001성읍공사조사제주특별자치도 서귀포시 표선읍 성읍리저수지공사제주본부2017-03-216.39.46.41.4130.0395.7