Overview

Dataset statistics

Number of variables14
Number of observations10000
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 MiB
Average record size in memory127.0 B

Variable types

Numeric7
Text3
Categorical3
DateTime1

Dataset

Description한국농어촌공사가 관리하는 농업용수로 사용되는 저수지의 주소,담당관리기관,조사일자, 수질정보 등
Author한국농어촌공사
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20191014000000001276

Alerts

조사구분 has constant value ""Constant
관리구분 has constant value ""Constant
COD is highly overall correlated with TOC and 2 other fieldsHigh correlation
TOC is highly overall correlated with COD and 2 other fieldsHigh correlation
T-P is highly overall correlated with COD and 2 other fieldsHigh correlation
SS is highly overall correlated with COD and 2 other fieldsHigh correlation
시설구분 is highly imbalanced (99.1%)Imbalance

Reproduction

Analysis started2023-12-11 03:33:19.745723
Analysis finished2023-12-11 03:33:27.994044
Duration8.25 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

표준코드
Real number (ℝ)

Distinct2517
Distinct (%)25.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.5761678 × 109
Minimum2.6710101 × 109
Maximum5.01301 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T12:33:28.113009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.6710101 × 109
5-th percentile3.1710103 × 109
Q14.5720101 × 109
median4.6830101 × 109
Q34.7750101 × 109
95-th percentile4.8860102 × 109
Maximum5.01301 × 109
Range2.3419999 × 109
Interquartile range (IQR)2.0299997 × 108

Descriptive statistics

Standard deviation4.0659163 × 108
Coefficient of variation (CV)0.088849807
Kurtosis9.5710464
Mean4.5761678 × 109
Median Absolute Deviation (MAD)1.0200004 × 108
Skewness-3.1102011
Sum4.5761678 × 1013
Variance1.6531675 × 1017
MonotonicityNot monotonic
2023-12-11T12:33:28.314564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4889010186 4
 
< 0.1%
4874010192 4
 
< 0.1%
4377010042 4
 
< 0.1%
4272010015 4
 
< 0.1%
4514010045 4
 
< 0.1%
4688010031 4
 
< 0.1%
4678010070 4
 
< 0.1%
4792010067 4
 
< 0.1%
4579010099 4
 
< 0.1%
4783010052 4
 
< 0.1%
Other values (2507) 9960
99.6%
ValueCountFrequency (%)
2671010085 4
< 0.1%
2671010098 4
< 0.1%
2714010023 4
< 0.1%
2723010006 4
< 0.1%
2723010007 4
< 0.1%
2723010012 4
< 0.1%
2723010022 2
< 0.1%
2726010001 4
< 0.1%
2726010006 4
< 0.1%
2726010008 4
< 0.1%
ValueCountFrequency (%)
5013010001 4
< 0.1%
4971010003 4
< 0.1%
4971010002 4
< 0.1%
4971010001 4
< 0.1%
4889010614 4
< 0.1%
4889010613 4
< 0.1%
4889010610 4
< 0.1%
4889010609 4
< 0.1%
4889010608 4
< 0.1%
4889010607 4
< 0.1%
Distinct1968
Distinct (%)19.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T12:33:28.807217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length2
Mean length2.2281
Min length1

Characters and Unicode

Total characters22281
Distinct characters369
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row상부1
2nd row미동
3rd row효정
4th row천곡
5th row한개
ValueCountFrequency (%)
대곡 39
 
0.4%
학동 36
 
0.4%
연화 36
 
0.4%
방축 32
 
0.3%
신기 32
 
0.3%
연동 28
 
0.3%
백운 28
 
0.3%
신흥 27
 
0.3%
대동 24
 
0.2%
구룡 24
 
0.2%
Other values (1957) 9693
96.9%
2023-12-11T12:33:29.388264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
869
 
3.9%
766
 
3.4%
717
 
3.2%
515
 
2.3%
434
 
1.9%
410
 
1.8%
381
 
1.7%
336
 
1.5%
319
 
1.4%
313
 
1.4%
Other values (359) 17221
77.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21345
95.8%
Decimal Number 619
 
2.8%
Close Punctuation 156
 
0.7%
Open Punctuation 156
 
0.7%
Dash Punctuation 4
 
< 0.1%
Space Separator 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
869
 
4.1%
766
 
3.6%
717
 
3.4%
515
 
2.4%
434
 
2.0%
410
 
1.9%
381
 
1.8%
336
 
1.6%
319
 
1.5%
313
 
1.5%
Other values (349) 16285
76.3%
Decimal Number
ValueCountFrequency (%)
1 292
47.2%
2 271
43.8%
3 36
 
5.8%
4 12
 
1.9%
5 4
 
0.6%
6 4
 
0.6%
Close Punctuation
ValueCountFrequency (%)
) 156
100.0%
Open Punctuation
ValueCountFrequency (%)
( 156
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21345
95.8%
Common 936
 
4.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
869
 
4.1%
766
 
3.6%
717
 
3.4%
515
 
2.4%
434
 
2.0%
410
 
1.9%
381
 
1.8%
336
 
1.6%
319
 
1.5%
313
 
1.5%
Other values (349) 16285
76.3%
Common
ValueCountFrequency (%)
1 292
31.2%
2 271
29.0%
) 156
16.7%
( 156
16.7%
3 36
 
3.8%
4 12
 
1.3%
5 4
 
0.4%
- 4
 
0.4%
6 4
 
0.4%
1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21345
95.8%
ASCII 936
 
4.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
869
 
4.1%
766
 
3.6%
717
 
3.4%
515
 
2.4%
434
 
2.0%
410
 
1.9%
381
 
1.8%
336
 
1.6%
319
 
1.5%
313
 
1.5%
Other values (349) 16285
76.3%
ASCII
ValueCountFrequency (%)
1 292
31.2%
2 271
29.0%
) 156
16.7%
( 156
16.7%
3 36
 
3.8%
4 12
 
1.3%
5 4
 
0.4%
- 4
 
0.4%
6 4
 
0.4%
1
 
0.1%

조사구분
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공사조사
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공사조사
2nd row공사조사
3rd row공사조사
4th row공사조사
5th row공사조사

Common Values

ValueCountFrequency (%)
공사조사 10000
100.0%

Length

2023-12-11T12:33:29.536251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:33:29.622995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공사조사 10000
100.0%

주소
Text

Distinct1783
Distinct (%)17.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T12:33:29.933805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length16
Mean length15.8595
Min length11

Characters and Unicode

Total characters158595
Distinct characters310
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row경상남도 합천군 적중면 상부리
2nd row전라북도 고창군 성송면 하고리
3rd row경상북도 영천시 화산면 효정리
4th row경상남도 의령군 대의면 천곡리
5th row경상남도 사천시 서포면 조도리
ValueCountFrequency (%)
전라남도 3273
 
8.2%
경상북도 1968
 
5.0%
경상남도 1767
 
4.4%
전라북도 1242
 
3.1%
나주시 632
 
1.6%
영암군 516
 
1.3%
충청남도 479
 
1.2%
충청북도 431
 
1.1%
영천시 322
 
0.8%
합천군 288
 
0.7%
Other values (2187) 28822
72.5%
2023-12-11T12:33:30.437212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29744
18.8%
10066
 
6.3%
9480
 
6.0%
8109
 
5.1%
6886
 
4.3%
6461
 
4.1%
4903
 
3.1%
4559
 
2.9%
4342
 
2.7%
4303
 
2.7%
Other values (300) 69742
44.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 128847
81.2%
Space Separator 29744
 
18.8%
Decimal Number 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10066
 
7.8%
9480
 
7.4%
8109
 
6.3%
6886
 
5.3%
6461
 
5.0%
4903
 
3.8%
4559
 
3.5%
4342
 
3.4%
4303
 
3.3%
4259
 
3.3%
Other values (298) 65479
50.8%
Space Separator
ValueCountFrequency (%)
29744
100.0%
Decimal Number
ValueCountFrequency (%)
1 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 128847
81.2%
Common 29748
 
18.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10066
 
7.8%
9480
 
7.4%
8109
 
6.3%
6886
 
5.3%
6461
 
5.0%
4903
 
3.8%
4559
 
3.5%
4342
 
3.4%
4303
 
3.3%
4259
 
3.3%
Other values (298) 65479
50.8%
Common
ValueCountFrequency (%)
29744
> 99.9%
1 4
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 128847
81.2%
ASCII 29748
 
18.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
29744
> 99.9%
1 4
 
< 0.1%
Hangul
ValueCountFrequency (%)
10066
 
7.8%
9480
 
7.4%
8109
 
6.3%
6886
 
5.3%
6461
 
5.0%
4903
 
3.8%
4559
 
3.5%
4342
 
3.4%
4303
 
3.3%
4259
 
3.3%
Other values (298) 65479
50.8%

시설구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
저수지
9992 
담수호
 
8

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row저수지
2nd row저수지
3rd row저수지
4th row저수지
5th row저수지

Common Values

ValueCountFrequency (%)
저수지 9992
99.9%
담수호 8
 
0.1%

Length

2023-12-11T12:33:30.571279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:33:30.667061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
저수지 9992
99.9%
담수호 8
 
0.1%

관리구분
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공사
10000 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공사
2nd row공사
3rd row공사
4th row공사
5th row공사

Common Values

ValueCountFrequency (%)
공사 10000
100.0%

Length

2023-12-11T12:33:30.807599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:33:30.924965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공사 10000
100.0%
Distinct94
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T12:33:31.235374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length4
Mean length4.8068
Min length4

Characters and Unicode

Total characters48068
Distinct characters100
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row합천지사
2nd row고창지사
3rd row영천지사
4th row의령지사
5th row사천지사
ValueCountFrequency (%)
나주지사 624
 
6.2%
영암지사 516
 
5.2%
영천지사 322
 
3.2%
무안신안지사 291
 
2.9%
합천지사 288
 
2.9%
울산지사 287
 
2.9%
해남완도지사 283
 
2.8%
전주완주임실지사 268
 
2.7%
남원지사 263
 
2.6%
사천지사 243
 
2.4%
Other values (84) 6615
66.1%
2023-12-11T12:33:31.679366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10227
21.3%
9980
20.8%
2071
 
4.3%
1624
 
3.4%
1585
 
3.3%
1129
 
2.3%
1034
 
2.2%
934
 
1.9%
891
 
1.9%
753
 
1.6%
Other values (90) 17840
37.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 48068
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10227
21.3%
9980
20.8%
2071
 
4.3%
1624
 
3.4%
1585
 
3.3%
1129
 
2.3%
1034
 
2.2%
934
 
1.9%
891
 
1.9%
753
 
1.6%
Other values (90) 17840
37.1%

Most occurring scripts

ValueCountFrequency (%)
Hangul 48068
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10227
21.3%
9980
20.8%
2071
 
4.3%
1624
 
3.4%
1585
 
3.3%
1129
 
2.3%
1034
 
2.2%
934
 
1.9%
891
 
1.9%
753
 
1.6%
Other values (90) 17840
37.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 48068
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10227
21.3%
9980
20.8%
2071
 
4.3%
1624
 
3.4%
1585
 
3.3%
1129
 
2.3%
1034
 
2.2%
934
 
1.9%
891
 
1.9%
753
 
1.6%
Other values (90) 17840
37.1%
Distinct133
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-02-27 00:00:00
Maximum2018-12-13 00:00:00
2023-12-11T12:33:31.837021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:31.968559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

pH
Real number (ℝ)

Distinct60
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.709269
Minimum3.1
Maximum11.1
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T12:33:32.105728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.1
5-th percentile6.8
Q17.2
median7.6
Q38
95-th percentile9.2
Maximum11.1
Range8
Interquartile range (IQR)0.8

Descriptive statistics

Standard deviation0.71607411
Coefficient of variation (CV)0.092884826
Kurtosis1.0381414
Mean7.709269
Median Absolute Deviation (MAD)0.4
Skewness0.93607284
Sum77092.69
Variance0.51276213
MonotonicityNot monotonic
2023-12-11T12:33:32.256141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7.4 740
 
7.4%
7.3 732
 
7.3%
7.5 716
 
7.2%
7.2 667
 
6.7%
7.6 629
 
6.3%
7.7 605
 
6.0%
7.8 541
 
5.4%
7.1 513
 
5.1%
7.9 476
 
4.8%
7.0 460
 
4.6%
Other values (50) 3921
39.2%
ValueCountFrequency (%)
3.1 1
 
< 0.1%
5.9 2
 
< 0.1%
6.0 3
 
< 0.1%
6.1 5
 
0.1%
6.2 4
 
< 0.1%
6.3 12
 
0.1%
6.4 30
 
0.3%
6.5 68
 
0.7%
6.6 112
1.1%
6.7 186
1.9%
ValueCountFrequency (%)
11.1 1
 
< 0.1%
10.9 1
 
< 0.1%
10.7 2
 
< 0.1%
10.6 4
 
< 0.1%
10.5 1
 
< 0.1%
10.4 4
 
< 0.1%
10.3 4
 
< 0.1%
10.2 10
0.1%
10.1 13
0.1%
10.0 20
0.2%

COD
Real number (ℝ)

HIGH CORRELATION 

Distinct203
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.46885
Minimum0.6
Maximum96.1
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T12:33:32.423859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.6
5-th percentile2.8
Q14.6
median6.8
Q39
95-th percentile15.2
Maximum96.1
Range95.5
Interquartile range (IQR)4.4

Descriptive statistics

Standard deviation4.4124542
Coefficient of variation (CV)0.59078093
Kurtosis57.056204
Mean7.46885
Median Absolute Deviation (MAD)2.2
Skewness4.4974362
Sum74688.5
Variance19.469752
MonotonicityNot monotonic
2023-12-11T12:33:32.577145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4.6 275
 
2.8%
5.6 274
 
2.7%
4.2 274
 
2.7%
6.6 268
 
2.7%
4.0 263
 
2.6%
5.0 257
 
2.6%
8.0 253
 
2.5%
5.2 251
 
2.5%
7.6 248
 
2.5%
4.8 245
 
2.5%
Other values (193) 7392
73.9%
ValueCountFrequency (%)
0.6 1
 
< 0.1%
1.0 5
 
0.1%
1.2 12
 
0.1%
1.4 17
 
0.2%
1.6 31
0.3%
1.7 1
 
< 0.1%
1.8 34
0.3%
1.9 1
 
< 0.1%
2.0 59
0.6%
2.1 1
 
< 0.1%
ValueCountFrequency (%)
96.1 1
< 0.1%
86.1 1
< 0.1%
82.2 1
< 0.1%
82.1 1
< 0.1%
80.1 1
< 0.1%
58.1 1
< 0.1%
47.2 1
< 0.1%
43.2 1
< 0.1%
41.7 1
< 0.1%
40.9 2
< 0.1%

TOC
Real number (ℝ)

HIGH CORRELATION 

Distinct161
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.62453
Minimum0.5
Maximum38.6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T12:33:32.717881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.5
5-th percentile1.6
Q12.9
median4.3
Q35.9
95-th percentile8.8
Maximum38.6
Range38.1
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.3979262
Coefficient of variation (CV)0.51852323
Kurtosis18.232293
Mean4.62453
Median Absolute Deviation (MAD)1.5
Skewness2.2472493
Sum46245.3
Variance5.7500503
MonotonicityNot monotonic
2023-12-11T12:33:32.887444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3.0 212
 
2.1%
3.7 204
 
2.0%
4.0 198
 
2.0%
3.9 194
 
1.9%
4.3 193
 
1.9%
3.5 193
 
1.9%
4.1 191
 
1.9%
3.8 188
 
1.9%
2.3 186
 
1.9%
2.8 185
 
1.8%
Other values (151) 8056
80.6%
ValueCountFrequency (%)
0.5 3
 
< 0.1%
0.6 4
 
< 0.1%
0.7 15
 
0.1%
0.8 16
 
0.2%
0.9 28
 
0.3%
1.0 31
0.3%
1.1 40
0.4%
1.2 47
0.5%
1.3 63
0.6%
1.4 72
0.7%
ValueCountFrequency (%)
38.6 1
< 0.1%
36.6 1
< 0.1%
35.8 1
< 0.1%
33.6 1
< 0.1%
32.7 1
< 0.1%
32.6 1
< 0.1%
25.9 1
< 0.1%
22.3 1
< 0.1%
21.6 1
< 0.1%
21.5 1
< 0.1%

T-N
Real number (ℝ)

Distinct3235
Distinct (%)32.4%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean1.5057781
Minimum0.019
Maximum48.192
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T12:33:33.030883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.019
5-th percentile0.326
Q10.657
median1.087
Q31.769
95-th percentile3.8542
Maximum48.192
Range48.173
Interquartile range (IQR)1.112

Descriptive statistics

Standard deviation1.8416862
Coefficient of variation (CV)1.2230794
Kurtosis169.10539
Mean1.5057781
Median Absolute Deviation (MAD)0.509
Skewness9.5731196
Sum15056.275
Variance3.3918079
MonotonicityNot monotonic
2023-12-11T12:33:33.213329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.708 16
 
0.2%
0.89 15
 
0.1%
0.893 14
 
0.1%
0.56 14
 
0.1%
0.347 13
 
0.1%
0.611 13
 
0.1%
0.769 12
 
0.1%
0.939 12
 
0.1%
0.482 12
 
0.1%
0.546 12
 
0.1%
Other values (3225) 9866
98.7%
ValueCountFrequency (%)
0.019 1
< 0.1%
0.02 1
< 0.1%
0.044 1
< 0.1%
0.067 1
< 0.1%
0.072 1
< 0.1%
0.073 1
< 0.1%
0.095 1
< 0.1%
0.103 1
< 0.1%
0.104 1
< 0.1%
0.105 1
< 0.1%
ValueCountFrequency (%)
48.192 1
< 0.1%
44.873 1
< 0.1%
44.129 1
< 0.1%
43.904 1
< 0.1%
36.112 1
< 0.1%
30.689 1
< 0.1%
23.431 1
< 0.1%
22.481 1
< 0.1%
21.426 1
< 0.1%
21.085 1
< 0.1%

T-P
Real number (ℝ)

HIGH CORRELATION 

Distinct480
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.0599825
Minimum0.001
Maximum4.972
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T12:33:33.377866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.001
5-th percentile0.009
Q10.017
median0.029
Q30.054
95-th percentile0.20605
Maximum4.972
Range4.971
Interquartile range (IQR)0.037

Descriptive statistics

Standard deviation0.13572262
Coefficient of variation (CV)2.2627036
Kurtosis377.55285
Mean0.0599825
Median Absolute Deviation (MAD)0.014
Skewness15.134082
Sum599.825
Variance0.018420629
MonotonicityNot monotonic
2023-12-11T12:33:33.891778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.016 287
 
2.9%
0.017 270
 
2.7%
0.015 266
 
2.7%
0.013 262
 
2.6%
0.018 257
 
2.6%
0.014 249
 
2.5%
0.019 246
 
2.5%
0.012 246
 
2.5%
0.021 240
 
2.4%
0.023 227
 
2.3%
Other values (470) 7450
74.5%
ValueCountFrequency (%)
0.001 2
 
< 0.1%
0.002 8
 
0.1%
0.003 11
 
0.1%
0.004 25
 
0.2%
0.005 48
 
0.5%
0.006 78
0.8%
0.007 103
1.0%
0.008 122
1.2%
0.009 166
1.7%
0.01 192
1.9%
ValueCountFrequency (%)
4.972 1
< 0.1%
3.907 1
< 0.1%
3.788 1
< 0.1%
3.046 1
< 0.1%
2.971 1
< 0.1%
2.301 1
< 0.1%
2.276 1
< 0.1%
2.222 1
< 0.1%
1.878 1
< 0.1%
1.523 1
< 0.1%

SS
Real number (ℝ)

HIGH CORRELATION 

Distinct482
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.3501
Minimum0
Maximum660.7
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T12:33:34.054947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1.1
Q12.7
median4.8
Q39.1
95-th percentile24.805
Maximum660.7
Range660.7
Interquartile range (IQR)6.4

Descriptive statistics

Standard deviation15.961989
Coefficient of variation (CV)1.9115926
Kurtosis482.89717
Mean8.3501
Median Absolute Deviation (MAD)2.7
Skewness16.63179
Sum83501
Variance254.7851
MonotonicityNot monotonic
2023-12-11T12:33:34.235441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2.0 190
 
1.9%
3.0 183
 
1.8%
4.0 165
 
1.7%
2.3 164
 
1.6%
2.8 161
 
1.6%
3.2 153
 
1.5%
1.8 152
 
1.5%
2.4 151
 
1.5%
3.3 145
 
1.5%
1.6 140
 
1.4%
Other values (472) 8396
84.0%
ValueCountFrequency (%)
0.0 1
 
< 0.1%
0.1 17
 
0.2%
0.2 30
 
0.3%
0.3 27
 
0.3%
0.4 36
0.4%
0.5 37
0.4%
0.6 33
0.3%
0.7 74
0.7%
0.8 82
0.8%
0.9 74
0.7%
ValueCountFrequency (%)
660.7 1
< 0.1%
508.4 1
< 0.1%
390.5 1
< 0.1%
371.0 1
< 0.1%
312.0 1
< 0.1%
300.0 1
< 0.1%
272.0 1
< 0.1%
205.4 1
< 0.1%
205.3 1
< 0.1%
187.0 1
< 0.1%

Interactions

2023-12-11T12:33:26.689613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:21.706353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:22.467898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:23.248301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:23.979576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:24.757551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:25.577378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:26.818543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:21.813805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:22.595336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:23.354632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:24.077995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:24.868885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:25.694711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:26.940514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:21.913272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:22.716613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:23.461170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:24.192047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:24.972230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:25.805443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:27.073548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:22.008296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:22.836911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:23.562065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:24.301486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:25.071069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:25.902393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:27.180401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:22.121723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:22.941171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:23.667908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:24.417986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:25.216315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:26.345526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:27.310663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:22.227739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:23.045258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:23.767565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:24.523081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:25.312505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:26.473876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:27.443819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:22.337973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:23.143795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:23.870658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:24.638529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:25.437006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:26.586147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T12:33:34.370796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
표준코드시설구분관리기관pHCODTOCT-NT-PSS
표준코드1.0000.0070.9940.1280.0760.1130.0000.0130.044
시설구분0.0071.0000.1700.0320.0650.0000.0000.0000.000
관리기관0.9940.1701.0000.4450.3180.4460.1520.0000.000
pH0.1280.0320.4451.0000.3680.2050.1570.0350.043
COD0.0760.0650.3180.3681.0000.8410.2680.2740.540
TOC0.1130.0000.4460.2050.8411.0000.3790.3560.351
T-N0.0000.0000.1520.1570.2680.3791.0000.4240.218
T-P0.0130.0000.0000.0350.2740.3560.4241.0000.457
SS0.0440.0000.0000.0430.5400.3510.2180.4571.000
2023-12-11T12:33:34.517743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
표준코드pHCODTOCT-NT-PSS시설구분
표준코드1.0000.141-0.0050.042-0.031-0.034-0.0810.008
pH0.1411.0000.0590.0860.147-0.0450.0060.040
COD-0.0050.0591.0000.9490.0150.6050.5880.049
TOC0.0420.0860.9491.0000.0370.5770.5030.000
T-N-0.0310.1470.0150.0371.0000.2670.1190.000
T-P-0.034-0.0450.6050.5770.2671.0000.5720.000
SS-0.0810.0060.5880.5030.1190.5721.0000.000
시설구분0.0080.0400.0490.0000.0000.0000.0001.000

Missing values

2023-12-11T12:33:27.620070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:33:27.888878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

표준코드시설명조사구분주소시설구분관리구분관리기관조사일자pHCODTOCT-NT-PSS
98884889010186상부1공사조사경상남도 합천군 적중면 상부리저수지공사합천지사2018-09-177.76.24.32.2840.0822.1
28804579010230미동공사조사전라북도 고창군 성송면 하고리저수지공사고창지사2018-11-217.48.44.71.1270.0439.0
68764723010300효정공사조사경상북도 영천시 화산면 효정리저수지공사영천지사2018-11-297.48.66.70.8010.0233.6
88254872010051천곡공사조사경상남도 의령군 대의면 천곡리저수지공사의령지사2018-05-237.84.83.40.830.0131.6
85974824010156한개공사조사경상남도 사천시 서포면 조도리저수지공사사천지사2018-06-187.37.25.01.4920.0346.8
40924672010219매월공사조사전라남도 곡성군 입면 매월리저수지공사곡성지사2018-11-137.34.82.80.6510.0081.5
10894374010038오리곡공사조사충청북도 영동군 매곡면 공수리저수지공사옥천영동지사2018-06-217.88.65.81.430.112.0
99604889010340큰골공사조사경상남도 합천군 율곡면 영전리저수지공사합천지사2018-09-187.94.22.31.320.010.9
67664721010125성곡공사조사경상북도 봉화군 재산면 동면리저수지공사영주봉화지사2018-06-208.65.84.43.7060.0171.5
85014824010131지진개공사조사경상남도 사천시 서포면 구랑리저수지공사사천지사2018-06-187.510.05.90.470.0573.6
표준코드시설명조사구분주소시설구분관리구분관리기관조사일자pHCODTOCT-NT-PSS
88214872010050심지상공사조사경상남도 의령군 대의면 심지리저수지공사의령지사2018-05-237.85.03.30.2520.0133.2
71314725010209앞실공사조사경상북도 상주시 은척면 남곡리저수지공사상주지사2018-11-217.41.40.81.1290.0060.2
9474371010140부연공사조사충청북도 청주시 청원구 북이면 부연리저수지공사청주지사2018-10-307.06.64.81.8670.03817.0
31634617010032오봉공사조사전라남도 나주시 세지면 오봉리저수지공사나주지사2018-11-197.26.03.90.3860.026.4
37704671010007고가공사조사전라남도 담양군 담양읍 가산리저수지공사담양지사2018-08-307.64.62.40.9980.0314.3
22671010085송정공사조사부산광역시 기장군 철마면 송정리저수지공사울산지사2018-09-277.35.23.51.10.024.4
97704889010116원당3공사조사경상남도 합천군 초계면 원당리저수지공사합천지사2018-03-287.74.02.32.8370.0273.5
28794579010230미동공사조사전라북도 고창군 성송면 하고리저수지공사고창지사2018-08-098.610.47.00.5390.0510.3
75934775010179대원상공사조사경상북도 청송군 진보면 신촌리저수지공사청송영양지사2018-06-208.45.84.30.9730.0153.7
81804790010106마전1공사조사경상북도 예천군 지보면 만화리저수지공사예천지사2018-10-247.013.28.52.3010.04213.3