Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows1445
Duplicate rows (%)14.4%
Total size in memory664.1 KiB
Average record size in memory68.0 B

Variable types

Numeric4
Categorical2
DateTime1

Dataset

Description이천시의 강수에 따른 하천, 저수지 등의 수위 데이터 지점코드, 지점명, 분류, 시간, 평균수위, 최저수위, 최고수위 등을 알수 있음
Author경기도 이천시
URLhttps://www.data.go.kr/data/15038194/fileData.do

Alerts

분류(M 10분) has constant value ""Constant
Dataset has 1445 (14.4%) duplicate rowsDuplicates
지점코드 is highly overall correlated with 지점명High correlation
평균수위(cm) is highly overall correlated with 최저수위(cm) and 2 other fieldsHigh correlation
최저수위(cm) is highly overall correlated with 평균수위(cm) and 2 other fieldsHigh correlation
최고수위(cm) is highly overall correlated with 평균수위(cm) and 2 other fieldsHigh correlation
지점명 is highly overall correlated with 지점코드 and 3 other fieldsHigh correlation
평균수위(cm) has 1850 (18.5%) zerosZeros
최저수위(cm) has 1850 (18.5%) zerosZeros
최고수위(cm) has 1850 (18.5%) zerosZeros

Reproduction

Analysis started2023-12-12 15:57:50.714331
Analysis finished2023-12-12 15:57:54.102529
Duration3.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지점코드
Real number (ℝ)

HIGH CORRELATION 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.1500324 × 109
Minimum4.150025 × 109
Maximum4.150037 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T00:57:54.165523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.150025 × 109
5-th percentile4.150025 × 109
Q14.150031 × 109
median4.150034 × 109
Q34.150036 × 109
95-th percentile4.150037 × 109
Maximum4.150037 × 109
Range12021
Interquartile range (IQR)4993

Descriptive statistics

Standard deviation4611.9872
Coefficient of variation (CV)1.1113135 × 10-6
Kurtosis-1.0750166
Mean4.1500324 × 109
Median Absolute Deviation (MAD)2992
Skewness-0.69443154
Sum4.1500324 × 1013
Variance21270426
MonotonicityNot monotonic
2023-12-13T00:57:54.291515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
4150031029 1318
13.2%
4150034021 1288
12.9%
4150025003 1260
12.6%
4150034029 1254
12.5%
4150037024 1247
12.5%
4150037022 1233
12.3%
4150025004 1204
12.0%
4150036022 1196
12.0%
ValueCountFrequency (%)
4150025003 1260
12.6%
4150025004 1204
12.0%
4150031029 1318
13.2%
4150034021 1288
12.9%
4150034029 1254
12.5%
4150036022 1196
12.0%
4150037022 1233
12.3%
4150037024 1247
12.5%
ValueCountFrequency (%)
4150037024 1247
12.5%
4150037022 1233
12.3%
4150036022 1196
12.0%
4150034029 1254
12.5%
4150034021 1288
12.9%
4150031029 1318
13.2%
4150025004 1204
12.0%
4150025003 1260
12.6%

지점명
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
학암저수지
1318 
오천1교
1288 
장호원교
1260 
각평저수지
1254 
고당교
1247 
Other values (3)
3633 

Length

Max length5
Median length5
Mean length4.255
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row오천1교
2nd row고당교
3rd row복하교
4th row장호원교
5th row복하교

Common Values

ValueCountFrequency (%)
학암저수지 1318
13.2%
오천1교 1288
12.9%
장호원교 1260
12.6%
각평저수지 1254
12.5%
고당교 1247
12.5%
성호저수지 1233
12.3%
복하교 1204
12.0%
서경저수지 1196
12.0%

Length

2023-12-13T00:57:54.454902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:57:54.575709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
학암저수지 1318
13.2%
오천1교 1288
12.9%
장호원교 1260
12.6%
각평저수지 1254
12.5%
고당교 1247
12.5%
성호저수지 1233
12.3%
복하교 1204
12.0%
서경저수지 1196
12.0%

분류(M 10분)
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
M
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowM
2nd rowM
3rd rowM
4th rowM
5th rowM

Common Values

ValueCountFrequency (%)
M 10000
100.0%

Length

2023-12-13T00:57:54.700549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:57:54.781369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
m 10000
100.0%

날짜
Date

Distinct83
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-07-28 00:00:00
Maximum2020-10-18 00:00:00
2023-12-13T00:57:54.887325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:57:55.038687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

평균수위(cm)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct368
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean92.0612
Minimum-17
Maximum759
Zeros1850
Zeros (%)18.5%
Negative338
Negative (%)3.4%
Memory size166.0 KiB
2023-12-13T00:57:55.203362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-17
5-th percentile0
Q113
median99
Q3168
95-th percentile230
Maximum759
Range776
Interquartile range (IQR)155

Descriptive statistics

Standard deviation89.600836
Coefficient of variation (CV)0.9732747
Kurtosis4.1844004
Mean92.0612
Median Absolute Deviation (MAD)81
Skewness1.1907028
Sum920612
Variance8028.3099
MonotonicityNot monotonic
2023-12-13T00:57:55.368711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1850
 
18.5%
34 413
 
4.1%
35 321
 
3.2%
36 141
 
1.4%
99 140
 
1.4%
13 136
 
1.4%
100 134
 
1.3%
101 105
 
1.1%
109 104
 
1.0%
14 103
 
1.0%
Other values (358) 6553
65.5%
ValueCountFrequency (%)
-17 41
0.4%
-16 43
0.4%
-15 48
0.5%
-14 41
0.4%
-13 25
0.2%
-12 24
0.2%
-11 31
0.3%
-10 26
0.3%
-9 12
 
0.1%
-8 17
 
0.2%
ValueCountFrequency (%)
759 1
 
< 0.1%
758 1
 
< 0.1%
755 1
 
< 0.1%
730 1
 
< 0.1%
728 2
< 0.1%
727 1
 
< 0.1%
726 3
< 0.1%
725 1
 
< 0.1%
724 1
 
< 0.1%
723 1
 
< 0.1%

최저수위(cm)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct368
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean92.0612
Minimum-17
Maximum759
Zeros1850
Zeros (%)18.5%
Negative338
Negative (%)3.4%
Memory size166.0 KiB
2023-12-13T00:57:55.519448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-17
5-th percentile0
Q113
median99
Q3168
95-th percentile230
Maximum759
Range776
Interquartile range (IQR)155

Descriptive statistics

Standard deviation89.600836
Coefficient of variation (CV)0.9732747
Kurtosis4.1844004
Mean92.0612
Median Absolute Deviation (MAD)81
Skewness1.1907028
Sum920612
Variance8028.3099
MonotonicityNot monotonic
2023-12-13T00:57:55.710014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1850
 
18.5%
34 413
 
4.1%
35 321
 
3.2%
36 141
 
1.4%
99 140
 
1.4%
13 136
 
1.4%
100 134
 
1.3%
101 105
 
1.1%
109 104
 
1.0%
14 103
 
1.0%
Other values (358) 6553
65.5%
ValueCountFrequency (%)
-17 41
0.4%
-16 43
0.4%
-15 48
0.5%
-14 41
0.4%
-13 25
0.2%
-12 24
0.2%
-11 31
0.3%
-10 26
0.3%
-9 12
 
0.1%
-8 17
 
0.2%
ValueCountFrequency (%)
759 1
 
< 0.1%
758 1
 
< 0.1%
755 1
 
< 0.1%
730 1
 
< 0.1%
728 2
< 0.1%
727 1
 
< 0.1%
726 3
< 0.1%
725 1
 
< 0.1%
724 1
 
< 0.1%
723 1
 
< 0.1%

최고수위(cm)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct368
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean92.0612
Minimum-17
Maximum759
Zeros1850
Zeros (%)18.5%
Negative338
Negative (%)3.4%
Memory size166.0 KiB
2023-12-13T00:57:55.923189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-17
5-th percentile0
Q113
median99
Q3168
95-th percentile230
Maximum759
Range776
Interquartile range (IQR)155

Descriptive statistics

Standard deviation89.600836
Coefficient of variation (CV)0.9732747
Kurtosis4.1844004
Mean92.0612
Median Absolute Deviation (MAD)81
Skewness1.1907028
Sum920612
Variance8028.3099
MonotonicityNot monotonic
2023-12-13T00:57:56.115593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1850
 
18.5%
34 413
 
4.1%
35 321
 
3.2%
36 141
 
1.4%
99 140
 
1.4%
13 136
 
1.4%
100 134
 
1.3%
101 105
 
1.1%
109 104
 
1.0%
14 103
 
1.0%
Other values (358) 6553
65.5%
ValueCountFrequency (%)
-17 41
0.4%
-16 43
0.4%
-15 48
0.5%
-14 41
0.4%
-13 25
0.2%
-12 24
0.2%
-11 31
0.3%
-10 26
0.3%
-9 12
 
0.1%
-8 17
 
0.2%
ValueCountFrequency (%)
759 1
 
< 0.1%
758 1
 
< 0.1%
755 1
 
< 0.1%
730 1
 
< 0.1%
728 2
< 0.1%
727 1
 
< 0.1%
726 3
< 0.1%
725 1
 
< 0.1%
724 1
 
< 0.1%
723 1
 
< 0.1%

Interactions

2023-12-13T00:57:53.445538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:57:51.887596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:57:52.414260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:57:52.913555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:57:53.550139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:57:52.016306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:57:52.552619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:57:53.038221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:57:53.680137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:57:52.143359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:57:52.676124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:57:53.183725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:57:53.779744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:57:52.281889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:57:52.785946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:57:53.313348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:57:56.248772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지점코드지점명날짜평균수위(cm)최저수위(cm)최고수위(cm)
지점코드1.0001.0000.0770.6790.6790.679
지점명1.0001.0000.0000.7750.7750.775
날짜0.0770.0001.0000.5060.5060.506
평균수위(cm)0.6790.7750.5061.0001.0001.000
최저수위(cm)0.6790.7750.5061.0001.0001.000
최고수위(cm)0.6790.7750.5061.0001.0001.000
2023-12-13T00:57:56.387449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지점코드평균수위(cm)최저수위(cm)최고수위(cm)지점명
지점코드1.0000.3530.3530.3531.000
평균수위(cm)0.3531.0001.0001.0000.517
최저수위(cm)0.3531.0001.0001.0000.517
최고수위(cm)0.3531.0001.0001.0000.517
지점명1.0000.5170.5170.5171.000

Missing values

2023-12-13T00:57:53.922181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:57:54.045443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지점코드지점명분류(M 10분)날짜평균수위(cm)최저수위(cm)최고수위(cm)
656654150034021오천1교M2020-09-23104104104
661374150037024고당교M2020-09-23201201201
340714150025004복하교M2020-08-26141414
727284150025003장호원교M2020-09-29107107107
93754150025004복하교M2020-08-05525252
469974150025004복하교M2020-09-06212121
593974150036022서경저수지M2020-09-17000
905344150037022성호저수지M2020-10-14202202202
533744150034021오천1교M2020-09-12113113113
396224150034021오천1교M2020-08-31110110110
지점코드지점명분류(M 10분)날짜평균수위(cm)최저수위(cm)최고수위(cm)
10194150025004복하교M2020-07-28-1-1-1
519314150034029각평저수지M2020-09-11363636
378594150036022서경저수지M2020-08-29000
907634150025004복하교M2020-10-14-16-16-16
728084150034029각평저수지M2020-09-29343434
430104150025004복하교M2020-09-03122122122
234164150034029각평저수지M2020-08-17373737
191684150031029학암저수지M2020-08-13121212
297254150037022성호저수지M2020-08-22204204204
129534150036022서경저수지M2020-08-08000

Duplicate rows

Most frequently occurring

지점코드지점명분류(M 10분)날짜평균수위(cm)최저수위(cm)최고수위(cm)# duplicates
5124150031029학암저수지M2020-09-1600024
9554150036022서경저수지M2020-09-1300024
9094150036022서경저수지M2020-07-2900023
9654150036022서경저수지M2020-09-2300023
10544150037022성호저수지M2020-08-2120520520523
2014150025003장호원교M2020-10-1799999922
3924150025004복하교M2020-10-11-15-15-1522
4774150031029학암저수지M2020-09-0114141422
9314150036022서경저수지M2020-08-2000022
9334150036022서경저수지M2020-08-2200022