Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.1 KiB
Average record size in memory72.3 B

Variable types

Categorical5
Numeric3

Alerts

댐이름 has constant value ""Constant
저수위(m) has constant value ""Constant
저수율 is highly overall correlated with 일자/시간(t) and 2 other fieldsHigh correlation
강우량(mm) is highly overall correlated with 일자/시간(t) and 2 other fieldsHigh correlation
유입량(ms) is highly overall correlated with 일자/시간(t) and 2 other fieldsHigh correlation
일자/시간(t) is highly overall correlated with 강우량(mm) and 2 other fieldsHigh correlation
일자/시간(t) has unique valuesUnique
방류량(ms) has 11 (11.0%) zerosZeros

Reproduction

Analysis started2023-12-10 10:44:38.183072
Analysis finished2023-12-10 10:44:40.551631
Duration2.37 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

댐이름
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
낙동강하굿둑
100 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row낙동강하굿둑
2nd row낙동강하굿둑
3rd row낙동강하굿둑
4th row낙동강하굿둑
5th row낙동강하굿둑

Common Values

ValueCountFrequency (%)
낙동강하굿둑 100
100.0%

Length

2023-12-10T19:44:40.672234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:44:40.841707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
낙동강하굿둑 100
100.0%

일자/시간(t)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0190302 × 1011
Minimum2.0190301 × 1011
Maximum2.0190306 × 1011
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:44:41.036336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0190301 × 1011
5-th percentile2.0190301 × 1011
Q12.0190301 × 1011
median2.0190302 × 1011
Q32.0190302 × 1011
95-th percentile2.0190303 × 1011
Maximum2.0190306 × 1011
Range51200
Interquartile range (IQR)11945

Descriptive statistics

Standard deviation8572.4172
Coefficient of variation (CV)4.2458093 × 10-8
Kurtosis3.4566806
Mean2.0190302 × 1011
Median Absolute Deviation (MAD)8160
Skewness0.87561221
Sum2.0190302 × 1013
Variance73486336
MonotonicityNot monotonic
2023-12-10T19:44:41.490644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
201903010520 1
 
1.0%
201903022230 1
 
1.0%
201903022050 1
 
1.0%
201903022100 1
 
1.0%
201903022110 1
 
1.0%
201903022120 1
 
1.0%
201903022130 1
 
1.0%
201903022140 1
 
1.0%
201903022150 1
 
1.0%
201903022200 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
201903010010 1
1.0%
201903010020 1
1.0%
201903010030 1
1.0%
201903010040 1
1.0%
201903010050 1
1.0%
201903010100 1
1.0%
201903010110 1
1.0%
201903010120 1
1.0%
201903010130 1
1.0%
201903010140 1
1.0%
ValueCountFrequency (%)
201903061210 1
1.0%
201903030350 1
1.0%
201903030340 1
1.0%
201903030330 1
1.0%
201903030320 1
1.0%
201903030310 1
1.0%
201903030300 1
1.0%
201903030250 1
1.0%
201903030240 1
1.0%
201903030230 1
1.0%

저수위(m)
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
0
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 100
100.0%

Length

2023-12-10T19:44:41.753523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:44:42.202397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 100
100.0%

강우량(mm)
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
301.908
51 
302.372
21 
302.835
15 
301.445
12 
299.136
 
1

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row302.835
2nd row302.835
3rd row302.835
4th row302.835
5th row302.835

Common Values

ValueCountFrequency (%)
301.908 51
51.0%
302.372 21
21.0%
302.835 15
 
15.0%
301.445 12
 
12.0%
299.136 1
 
1.0%

Length

2023-12-10T19:44:42.376250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:44:42.562860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
301.908 51
51.0%
302.372 21
21.0%
302.835 15
 
15.0%
301.445 12
 
12.0%
299.136 1
 
1.0%

유입량(ms)
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
98.3
51 
98.4
21 
98.6
15 
98.1
12 
97.4
 
1

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row98.6
2nd row98.6
3rd row98.6
4th row98.6
5th row98.6

Common Values

ValueCountFrequency (%)
98.3 51
51.0%
98.4 21
21.0%
98.6 15
 
15.0%
98.1 12
 
12.0%
97.4 1
 
1.0%

Length

2023-12-10T19:44:42.794183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:44:42.969909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
98.3 51
51.0%
98.4 21
21.0%
98.6 15
 
15.0%
98.1 12
 
12.0%
97.4 1
 
1.0%

방류량(ms)
Real number (ℝ)

ZEROS 

Distinct88
Distinct (%)88.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean45.30397
Minimum0
Maximum259.962
Zeros11
Zeros (%)11.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:44:43.189315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11.4335
median1.826
Q32.2435
95-th percentile259.30575
Maximum259.962
Range259.962
Interquartile range (IQR)0.81

Descriptive statistics

Standard deviation97.226227
Coefficient of variation (CV)2.1460862
Kurtosis1.2058613
Mean45.30397
Median Absolute Deviation (MAD)0.418
Skewness1.7837567
Sum4530.397
Variance9452.9391
MonotonicityNot monotonic
2023-12-10T19:44:43.454800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 11
 
11.0%
1.512 2
 
2.0%
1.979 2
 
2.0%
1.202 1
 
1.0%
1.686 1
 
1.0%
2.184 1
 
1.0%
1.989 1
 
1.0%
1.833 1
 
1.0%
1.351 1
 
1.0%
2.056 1
 
1.0%
Other values (78) 78
78.0%
ValueCountFrequency (%)
0.0 11
11.0%
1.012 1
 
1.0%
1.067 1
 
1.0%
1.071 1
 
1.0%
1.075 1
 
1.0%
1.162 1
 
1.0%
1.168 1
 
1.0%
1.202 1
 
1.0%
1.247 1
 
1.0%
1.248 1
 
1.0%
ValueCountFrequency (%)
259.962 1
1.0%
259.827 1
1.0%
259.803 1
1.0%
259.703 1
1.0%
259.548 1
1.0%
259.293 1
1.0%
259.26 1
1.0%
259.19 1
1.0%
259.094 1
1.0%
259.014 1
1.0%

저수량(백만m3)
Real number (ℝ)

Distinct95
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.78722
Minimum1
Maximum3.877
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:44:43.711960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1.01285
Q11.29725
median1.6845
Q32.0035
95-th percentile3.24155
Maximum3.877
Range2.877
Interquartile range (IQR)0.70625

Descriptive statistics

Standard deviation0.65857571
Coefficient of variation (CV)0.36849169
Kurtosis1.6547663
Mean1.78722
Median Absolute Deviation (MAD)0.372
Skewness1.3000549
Sum178.722
Variance0.43372197
MonotonicityNot monotonic
2023-12-10T19:44:43.979374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1.0 4
 
4.0%
1.979 2
 
2.0%
1.876 2
 
2.0%
1.561 1
 
1.0%
2.429 1
 
1.0%
1.322 1
 
1.0%
1.773 1
 
1.0%
1.963 1
 
1.0%
2.817 1
 
1.0%
1.989 1
 
1.0%
Other values (85) 85
85.0%
ValueCountFrequency (%)
1.0 4
4.0%
1.01 1
 
1.0%
1.013 1
 
1.0%
1.027 1
 
1.0%
1.071 1
 
1.0%
1.089 1
 
1.0%
1.091 1
 
1.0%
1.096 1
 
1.0%
1.113 1
 
1.0%
1.115 1
 
1.0%
ValueCountFrequency (%)
3.877 1
1.0%
3.828 1
1.0%
3.662 1
1.0%
3.499 1
1.0%
3.423 1
1.0%
3.232 1
1.0%
3.171 1
1.0%
2.827 1
1.0%
2.817 1
1.0%
2.803 1
1.0%

저수율
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
0.86
51 
0.87
21 
0.88
15 
0.85
12 
0.8
 
1

Length

Max length4
Median length4
Mean length3.99
Min length3

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row0.88
2nd row0.88
3rd row0.88
4th row0.88
5th row0.88

Common Values

ValueCountFrequency (%)
0.86 51
51.0%
0.87 21
21.0%
0.88 15
 
15.0%
0.85 12
 
12.0%
0.8 1
 
1.0%

Length

2023-12-10T19:44:44.232603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:44:44.420720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0.86 51
51.0%
0.87 21
21.0%
0.88 15
 
15.0%
0.85 12
 
12.0%
0.8 1
 
1.0%

Interactions

2023-12-10T19:44:39.631039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:44:38.606235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:44:39.133493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:44:39.777997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:44:38.759654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:44:39.315728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:44:39.930079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:44:38.953855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:44:39.463082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:44:44.561254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일자/시간(t)강우량(mm)유입량(ms)방류량(ms)저수량(백만m3)저수율
일자/시간(t)1.0000.8010.8010.0000.2140.801
강우량(mm)0.8011.0001.0000.1470.2801.000
유입량(ms)0.8011.0001.0000.1470.2801.000
방류량(ms)0.0000.1470.1471.0000.0000.147
저수량(백만m3)0.2140.2800.2800.0001.0000.280
저수율0.8011.0001.0000.1470.2801.000
2023-12-10T19:44:44.810695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
저수율강우량(mm)유입량(ms)
저수율1.0001.0001.000
강우량(mm)1.0001.0001.000
유입량(ms)1.0001.0001.000
2023-12-10T19:44:45.003577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일자/시간(t)방류량(ms)저수량(백만m3)강우량(mm)유입량(ms)저수율
일자/시간(t)1.000-0.031-0.0560.7560.7560.756
방류량(ms)-0.0311.0000.3570.1760.1760.176
저수량(백만m3)-0.0560.3571.0000.1130.1130.113
강우량(mm)0.7560.1760.1131.0001.0001.000
유입량(ms)0.7560.1760.1131.0001.0001.000
저수율0.7560.1760.1131.0001.0001.000

Missing values

2023-12-10T19:44:40.182295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:44:40.448265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

댐이름일자/시간(t)저수위(m)강우량(mm)유입량(ms)방류량(ms)저수량(백만m3)저수율
0낙동강하굿둑2019030105200302.83598.61.6211.5610.88
1낙동강하굿둑2019030105100302.83598.61.8531.7570.88
2낙동강하굿둑2019030105000302.83598.61.9791.9790.88
3낙동강하굿둑2019030104500302.83598.62.0322.2580.88
4낙동강하굿둑2019030104400302.83598.61.9052.1340.88
5낙동강하굿둑2019030104300302.83598.61.8191.7050.88
6낙동강하굿둑2019030104200302.83598.61.8811.8760.88
7낙동강하굿둑2019030104100302.83598.61.8771.8760.88
8낙동강하굿둑2019030104000302.83598.61.7071.7070.88
9낙동강하굿둑2019030103500302.83598.61.8641.8640.88
댐이름일자/시간(t)저수위(m)강우량(mm)유입량(ms)방류량(ms)저수량(백만m3)저수율
90낙동강하굿둑2019030218100301.90898.3258.6771.0130.86
91낙동강하굿둑2019030218000301.44598.11.5361.5360.85
92낙동강하굿둑2019030217500301.44598.11.6082.4750.85
93낙동강하굿둑2019030217400301.44598.11.2471.0890.85
94낙동강하굿둑2019030217300301.44598.10.01.2170.85
95낙동강하굿둑2019030217200301.44598.11.1681.3930.85
96낙동강하굿둑2019030217100301.44598.11.0671.00.85
97낙동강하굿둑2019030217000301.90898.31.0711.0710.86
98낙동강하굿둑2019030216500301.44598.10.01.0910.85
99낙동강하굿둑2019030612100299.13697.40.01.9420.8