Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.1 KiB
Average record size in memory72.3 B

Variable types

Categorical5
Numeric3

Alerts

댐이름 has constant value ""Constant
강우량(mm) has constant value ""Constant
저수위(m) is highly overall correlated with 일자/시간(t) and 3 other fieldsHigh correlation
저수량(백만m3) is highly overall correlated with 일자/시간(t) and 3 other fieldsHigh correlation
저수율 is highly overall correlated with 일자/시간(t) and 3 other fieldsHigh correlation
일자/시간(t) is highly overall correlated with 유입량(ms) and 4 other fieldsHigh correlation
유입량(ms) is highly overall correlated with 일자/시간(t) and 1 other fieldsHigh correlation
방류량(ms) is highly overall correlated with 일자/시간(t) and 4 other fieldsHigh correlation

Reproduction

Analysis started2023-12-10 13:17:09.235943
Analysis finished2023-12-10 13:17:12.173475
Duration2.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

댐이름
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
강천보
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강천보
2nd row강천보
3rd row강천보
4th row강천보
5th row강천보

Common Values

ValueCountFrequency (%)
강천보 100
100.0%

Length

2023-12-10T22:17:12.417616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:17:12.768098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강천보 100
100.0%

일자/시간(t)
Real number (ℝ)

HIGH CORRELATION 

Distinct75
Distinct (%)75.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0200509 × 1011
Minimum2.0200508 × 1011
Maximum2.0200509 × 1011
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:17:13.083652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0200508 × 1011
5-th percentile2.0200509 × 1011
Q12.0200509 × 1011
median2.0200509 × 1011
Q32.0200509 × 1011
95-th percentile2.0200509 × 1011
Maximum2.0200509 × 1011
Range9820
Interquartile range (IQR)612.5

Descriptive statistics

Standard deviation1437.7965
Coefficient of variation (CV)7.1176251 × 10-9
Kurtosis27.773108
Mean2.0200509 × 1011
Median Absolute Deviation (MAD)305
Skewness-4.8297146
Sum2.0200509 × 1013
Variance2067258.8
MonotonicityNot monotonic
2023-12-10T22:17:13.403043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
202005091230 2
 
2.0%
202005091930 2
 
2.0%
202005092030 2
 
2.0%
202005092130 2
 
2.0%
202005092100 2
 
2.0%
202005091900 2
 
2.0%
202005091700 2
 
2.0%
202005091630 2
 
2.0%
202005090200 2
 
2.0%
202005091430 2
 
2.0%
Other values (65) 80
80.0%
ValueCountFrequency (%)
202005082400 2
2.0%
202005090010 1
1.0%
202005090020 1
1.0%
202005090030 2
2.0%
202005090040 1
1.0%
202005090050 1
1.0%
202005090100 2
2.0%
202005090110 1
1.0%
202005090120 1
1.0%
202005090130 2
2.0%
ValueCountFrequency (%)
202005092220 1
1.0%
202005092210 1
1.0%
202005092200 2
2.0%
202005092150 1
1.0%
202005092140 1
1.0%
202005092130 2
2.0%
202005092120 1
1.0%
202005092110 1
1.0%
202005092100 2
2.0%
202005092050 1
1.0%

저수위(m)
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
38.2
53 
38.19
47 

Length

Max length5
Median length4
Mean length4.47
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row38.19
2nd row38.19
3rd row38.19
4th row38.19
5th row38.19

Common Values

ValueCountFrequency (%)
38.2 53
53.0%
38.19 47
47.0%

Length

2023-12-10T22:17:13.675949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:17:13.950888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
38.2 53
53.0%
38.19 47
47.0%

강우량(mm)
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
0
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 100
100.0%

Length

2023-12-10T22:17:14.140885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:17:14.277841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 100
100.0%

유입량(ms)
Real number (ℝ)

HIGH CORRELATION 

Distinct68
Distinct (%)68.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean111.91745
Minimum109.493
Maximum136.904
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:17:14.439427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum109.493
5-th percentile109.623
Q1109.9045
median111.785
Q3111.923
95-th percentile112.06035
Maximum136.904
Range27.411
Interquartile range (IQR)2.0185

Descriptive statistics

Standard deviation5.0868526
Coefficient of variation (CV)0.045451827
Kurtosis19.275325
Mean111.91745
Median Absolute Deviation (MAD)1.722
Skewness4.4585826
Sum11191.745
Variance25.876069
MonotonicityNot monotonic
2023-12-10T22:17:15.048224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
111.847 3
 
3.0%
109.897 3
 
3.0%
112.017 3
 
3.0%
111.903 3
 
3.0%
111.923 2
 
2.0%
111.89 2
 
2.0%
109.993 2
 
2.0%
112.06 2
 
2.0%
111.877 2
 
2.0%
111.76 2
 
2.0%
Other values (58) 76
76.0%
ValueCountFrequency (%)
109.493 1
1.0%
109.553 1
1.0%
109.573 2
2.0%
109.623 2
2.0%
109.627 1
1.0%
109.69 2
2.0%
109.73 1
1.0%
109.733 1
1.0%
109.74 1
1.0%
109.783 1
1.0%
ValueCountFrequency (%)
136.904 1
 
1.0%
136.197 2
2.0%
135.511 1
 
1.0%
112.067 1
 
1.0%
112.06 2
2.0%
112.04 1
 
1.0%
112.02 1
 
1.0%
112.017 3
3.0%
112.0 2
2.0%
111.99 2
2.0%

방류량(ms)
Real number (ℝ)

HIGH CORRELATION 

Distinct77
Distinct (%)77.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean110.93744
Minimum108.97
Maximum112.2
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:17:15.568287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum108.97
5-th percentile109.65815
Q1109.89925
median111.585
Q3111.92
95-th percentile112.063
Maximum112.2
Range3.23
Interquartile range (IQR)2.02075

Descriptive statistics

Standard deviation1.0358717
Coefficient of variation (CV)0.0093374404
Kurtosis-1.8622723
Mean110.93744
Median Absolute Deviation (MAD)0.57
Skewness-0.13063068
Sum11093.744
Variance1.0730302
MonotonicityNot monotonic
2023-12-10T22:17:15.840940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
111.86 3
 
3.0%
111.83 3
 
3.0%
111.89 3
 
3.0%
112.0 2
 
2.0%
111.91 2
 
2.0%
109.84 2
 
2.0%
110.01 2
 
2.0%
109.87 2
 
2.0%
111.92 2
 
2.0%
111.76 2
 
2.0%
Other values (67) 77
77.0%
ValueCountFrequency (%)
108.97 1
1.0%
109.5 1
1.0%
109.53 1
1.0%
109.573 1
1.0%
109.623 1
1.0%
109.66 1
1.0%
109.67 1
1.0%
109.69 2
2.0%
109.78 1
1.0%
109.8 2
2.0%
ValueCountFrequency (%)
112.2 1
1.0%
112.18 1
1.0%
112.16 1
1.0%
112.15 1
1.0%
112.12 1
1.0%
112.06 1
1.0%
112.05 1
1.0%
112.04 2
2.0%
112.017 1
1.0%
112.01 1
1.0%

저수량(백만m3)
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
9.633
53 
9.588
47 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row9.588
2nd row9.588
3rd row9.588
4th row9.588
5th row9.588

Common Values

ValueCountFrequency (%)
9.633 53
53.0%
9.588 47
47.0%

Length

2023-12-10T22:17:16.153080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:17:16.408853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
9.633 53
53.0%
9.588 47
47.0%

저수율
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
110.4
53 
109.9
47 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row109.9
2nd row109.9
3rd row109.9
4th row109.9
5th row109.9

Common Values

ValueCountFrequency (%)
110.4 53
53.0%
109.9 47
47.0%

Length

2023-12-10T22:17:16.610396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:17:16.808775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
110.4 53
53.0%
109.9 47
47.0%

Interactions

2023-12-10T22:17:10.753918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:17:09.679806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:17:10.279341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:17:10.958454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:17:09.871448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:17:10.416768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:17:11.133255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:17:10.106281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:17:10.566172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:17:16.918005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일자/시간(t)저수위(m)유입량(ms)방류량(ms)저수량(백만m3)저수율
일자/시간(t)1.0000.3350.0000.5110.3350.335
저수위(m)0.3351.0000.1571.0000.9990.999
유입량(ms)0.0000.1571.0000.6410.1570.157
방류량(ms)0.5111.0000.6411.0001.0001.000
저수량(백만m3)0.3350.9990.1571.0001.0000.999
저수율0.3350.9990.1571.0000.9991.000
2023-12-10T22:17:17.079795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
저수위(m)저수량(백만m3)저수율
저수위(m)1.0000.9800.980
저수량(백만m3)0.9801.0000.980
저수율0.9800.9801.000
2023-12-10T22:17:17.214595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일자/시간(t)유입량(ms)방류량(ms)저수위(m)저수량(백만m3)저수율
일자/시간(t)1.0000.7750.7990.5390.5390.539
유입량(ms)0.7751.0000.8880.1000.1000.100
방류량(ms)0.7990.8881.0000.9740.9740.974
저수위(m)0.5390.1000.9741.0000.9800.980
저수량(백만m3)0.5390.1000.9740.9801.0000.980
저수율0.5390.1000.9740.9800.9801.000

Missing values

2023-12-10T22:17:11.602084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:17:11.988679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

댐이름일자/시간(t)저수위(m)강우량(mm)유입량(ms)방류량(ms)저수량(백만m3)저수율
0강천보20200509002038.190109.627109.539.588109.9
1강천보20200509001038.190109.493109.669.588109.9
2강천보20200509003038.190109.69109.699.588109.9
3강천보20200509010038.190109.623110.099.588109.9
4강천보20200509004038.190109.74109.819.588109.9
5강천보20200509005038.190109.553108.979.588109.9
6강천보20200509003038.190109.69109.889.588109.9
7강천보20200509013038.190109.803109.789.588109.9
8강천보20200509014038.190109.733109.929.588109.9
9강천보20200509010038.190109.623109.6239.588109.9
댐이름일자/시간(t)저수위(m)강우량(mm)유입량(ms)방류량(ms)저수량(백만m3)저수율
90강천보20200509164038.20111.96112.049.633110.4
91강천보20200509163038.20112.0111.839.633110.4
92강천보20200509162038.20112.04112.019.633110.4
93강천보20200509161038.20136.904112.169.633110.4
94강천보20200509152038.190109.95109.879.588109.9
95강천보20200509153038.190109.91109.899.588109.9
96강천보20200509151038.190109.947109.979.588109.9
97강천보20200509150038.190109.957110.019.588109.9
98강천보20200508240038.190109.573109.699.588109.9
99강천보20200508240038.190109.573109.5739.588109.9