Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.1 KiB
Average record size in memory72.3 B

Variable types

Categorical5
Numeric3

Alerts

댐이름 has constant value ""Constant
강우량(mm) has constant value ""Constant
저수율 is highly overall correlated with 일자/시간(t) and 4 other fieldsHigh correlation
저수위(m) is highly overall correlated with 일자/시간(t) and 4 other fieldsHigh correlation
저수량(백만m3) is highly overall correlated with 일자/시간(t) and 4 other fieldsHigh correlation
일자/시간(t) is highly overall correlated with 유입량(ms) and 4 other fieldsHigh correlation
유입량(ms) is highly overall correlated with 일자/시간(t) and 4 other fieldsHigh correlation
방류량(ms) is highly overall correlated with 일자/시간(t) and 4 other fieldsHigh correlation

Reproduction

Analysis started2024-04-21 16:23:56.853043
Analysis finished2024-04-21 16:24:00.331649
Duration3.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

댐이름
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size928.0 B
강천보
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강천보
2nd row강천보
3rd row강천보
4th row강천보
5th row강천보

Common Values

ValueCountFrequency (%)
강천보 100
100.0%

Length

2024-04-22T01:24:00.532458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T01:24:00.818770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강천보 100
100.0%

일자/시간(t)
Real number (ℝ)

HIGH CORRELATION 

Distinct76
Distinct (%)76.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0200407 × 1011
Minimum2.0200406 × 1011
Maximum2.0200409 × 1011
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-22T01:24:01.139981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0200406 × 1011
5-th percentile2.0200406 × 1011
Q12.0200406 × 1011
median2.0200408 × 1011
Q32.0200408 × 1011
95-th percentile2.0200409 × 1011
Maximum2.0200409 × 1011
Range29500
Interquartile range (IQR)19812.5

Descriptive statistics

Standard deviation10800.073
Coefficient of variation (CV)5.3464629 × 10-8
Kurtosis-1.4804041
Mean2.0200407 × 1011
Median Absolute Deviation (MAD)430
Skewness-0.31790515
Sum2.0200407 × 1013
Variance1.1664157 × 108
MonotonicityNot monotonic
2024-04-22T01:24:01.587985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
202004090100 2
 
2.0%
202004060930 2
 
2.0%
202004080830 2
 
2.0%
202004080730 2
 
2.0%
202004080700 2
 
2.0%
202004080530 2
 
2.0%
202004080500 2
 
2.0%
202004080400 2
 
2.0%
202004080330 2
 
2.0%
202004060900 2
 
2.0%
Other values (66) 80
80.0%
ValueCountFrequency (%)
202004060700 2
2.0%
202004060710 1
1.0%
202004060720 1
1.0%
202004060730 2
2.0%
202004060740 1
1.0%
202004060750 1
1.0%
202004060800 2
2.0%
202004060810 1
1.0%
202004060820 1
1.0%
202004060830 2
2.0%
ValueCountFrequency (%)
202004090200 1
1.0%
202004090150 1
1.0%
202004090140 1
1.0%
202004090130 2
2.0%
202004090120 1
1.0%
202004090110 1
1.0%
202004090100 2
2.0%
202004090050 1
1.0%
202004090040 1
1.0%
202004090030 1
1.0%

저수위(m)
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size928.0 B
38.15
40 
38.16
36 
38.14
20 
38.13
 
4

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row38.14
2nd row38.14
3rd row38.14
4th row38.14
5th row38.14

Common Values

ValueCountFrequency (%)
38.15 40
40.0%
38.16 36
36.0%
38.14 20
20.0%
38.13 4
 
4.0%

Length

2024-04-22T01:24:01.999775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T01:24:02.311207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
38.15 40
40.0%
38.16 36
36.0%
38.14 20
20.0%
38.13 4
 
4.0%

강우량(mm)
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size928.0 B
0
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 100
100.0%

Length

2024-04-22T01:24:02.655034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T01:24:02.941450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 100
100.0%

유입량(ms)
Real number (ℝ)

HIGH CORRELATION 

Distinct69
Distinct (%)69.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100.09249
Minimum73.132
Maximum104.017
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-22T01:24:03.245048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum73.132
5-th percentile75.61665
Q1101.7205
median101.857
Q3103.85425
95-th percentile103.967
Maximum104.017
Range30.885
Interquartile range (IQR)2.13375

Descriptive statistics

Standard deviation7.6145632
Coefficient of variation (CV)0.07607527
Kurtosis7.3800947
Mean100.09249
Median Absolute Deviation (MAD)1.972
Skewness-2.9623661
Sum10009.249
Variance57.981573
MonotonicityNot monotonic
2024-04-22T01:24:03.668885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
101.893 4
 
4.0%
101.87 3
 
3.0%
101.77 3
 
3.0%
101.773 3
 
3.0%
99.863 2
 
2.0%
101.797 2
 
2.0%
101.8 2
 
2.0%
101.89 2
 
2.0%
101.743 2
 
2.0%
103.82 2
 
2.0%
Other values (59) 75
75.0%
ValueCountFrequency (%)
73.132 1
1.0%
73.886 1
1.0%
74.389 2
2.0%
75.04 1
1.0%
75.647 1
1.0%
76.226 1
1.0%
76.227 1
1.0%
99.71 1
1.0%
99.74 1
1.0%
99.763 1
1.0%
ValueCountFrequency (%)
104.017 2
2.0%
103.98 1
1.0%
103.973 1
1.0%
103.967 2
2.0%
103.963 1
1.0%
103.953 1
1.0%
103.937 1
1.0%
103.933 2
2.0%
103.93 2
2.0%
103.92 2
2.0%

방류량(ms)
Real number (ℝ)

HIGH CORRELATION 

Distinct76
Distinct (%)76.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean102.06945
Minimum97.86
Maximum104.11
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-22T01:24:04.079608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum97.86
5-th percentile99.6915
Q1101.6275
median101.875
Q3103.825
95-th percentile104.02
Maximum104.11
Range6.25
Interquartile range (IQR)2.1975

Descriptive statistics

Standard deviation1.6401558
Coefficient of variation (CV)0.016069018
Kurtosis-0.72119215
Mean102.06945
Median Absolute Deviation (MAD)1.9
Skewness-0.39768639
Sum10206.945
Variance2.6901112
MonotonicityNot monotonic
2024-04-22T01:24:04.520004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
104.02 4
 
4.0%
103.78 3
 
3.0%
103.88 3
 
3.0%
101.89 3
 
3.0%
99.97 2
 
2.0%
101.93 2
 
2.0%
101.8 2
 
2.0%
101.78 2
 
2.0%
101.9 2
 
2.0%
101.97 2
 
2.0%
Other values (66) 75
75.0%
ValueCountFrequency (%)
97.86 1
1.0%
98.02 1
1.0%
98.85 1
1.0%
99.5 1
1.0%
99.53 1
1.0%
99.7 1
1.0%
99.71 2
2.0%
99.77 1
1.0%
99.81 2
2.0%
99.817 1
1.0%
ValueCountFrequency (%)
104.11 1
 
1.0%
104.07 1
 
1.0%
104.06 1
 
1.0%
104.04 1
 
1.0%
104.02 4
4.0%
104.017 1
 
1.0%
103.98 2
2.0%
103.97 1
 
1.0%
103.967 1
 
1.0%
103.96 1
 
1.0%

저수량(백만m3)
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size928.0 B
9.407
40 
9.452
36 
9.362
20 
9.316
 
4

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row9.362
2nd row9.362
3rd row9.362
4th row9.362
5th row9.362

Common Values

ValueCountFrequency (%)
9.407 40
40.0%
9.452 36
36.0%
9.362 20
20.0%
9.316 4
 
4.0%

Length

2024-04-22T01:24:04.929835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T01:24:05.239810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
9.407 40
40.0%
9.452 36
36.0%
9.362 20
20.0%
9.316 4
 
4.0%

저수율
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size928.0 B
107.8
40 
108.3
36 
107.3
20 
106.7
 
4

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row107.3
2nd row107.3
3rd row107.3
4th row107.3
5th row107.3

Common Values

ValueCountFrequency (%)
107.8 40
40.0%
108.3 36
36.0%
107.3 20
20.0%
106.7 4
 
4.0%

Length

2024-04-22T01:24:05.584707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T01:24:05.895064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
107.8 40
40.0%
108.3 36
36.0%
107.3 20
20.0%
106.7 4
 
4.0%

Interactions

2024-04-22T01:23:58.901617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T01:23:57.187768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T01:23:57.950668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T01:23:59.156326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T01:23:57.441958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T01:23:58.203739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T01:23:59.404625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T01:23:57.694361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T01:23:58.653008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-22T01:24:06.102976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일자/시간(t)저수위(m)유입량(ms)방류량(ms)저수량(백만m3)저수율
일자/시간(t)1.0000.8010.6020.8920.8010.801
저수위(m)0.8011.0000.9570.9991.0001.000
유입량(ms)0.6020.9571.0000.9940.9570.957
방류량(ms)0.8920.9990.9941.0000.9990.999
저수량(백만m3)0.8011.0000.9570.9991.0001.000
저수율0.8011.0000.9570.9991.0001.000
2024-04-22T01:24:06.373065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
저수율저수위(m)저수량(백만m3)
저수율1.0001.0001.000
저수위(m)1.0001.0001.000
저수량(백만m3)1.0001.0001.000
2024-04-22T01:24:06.629449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일자/시간(t)유입량(ms)방류량(ms)저수위(m)저수량(백만m3)저수율
일자/시간(t)1.000-0.825-0.8600.8590.8590.859
유입량(ms)-0.8251.0000.9430.7200.7200.720
방류량(ms)-0.8600.9431.0000.9360.9360.936
저수위(m)0.8590.7200.9361.0001.0001.000
저수량(백만m3)0.8590.7200.9361.0001.0001.000
저수율0.8590.7200.9361.0001.0001.000

Missing values

2024-04-22T01:23:59.753519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-22T01:24:00.169013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

댐이름일자/시간(t)저수위(m)강우량(mm)유입량(ms)방류량(ms)저수량(백만m3)저수율
0강천보20200409010038.14099.86399.979.362107.3
1강천보20200409005038.14099.77799.919.362107.3
2강천보20200409013038.14099.81799.779.362107.3
3강천보20200409011038.14099.8699.79.362107.3
4강천보20200409012038.14099.88399.989.362107.3
5강천보20200409010038.14099.86399.8639.362107.3
6강천보20200406104038.160103.92103.889.452108.3
7강천보20200408062038.150101.747101.639.407107.8
8강천보20200409014038.14099.90799.979.362107.3
9강천보20200409015038.14099.87399.889.362107.3
댐이름일자/시간(t)저수위(m)강우량(mm)유입량(ms)방류량(ms)저수량(백만m3)저수율
90강천보20200408035038.150101.743101.729.407107.8
91강천보20200408034038.150101.713101.629.407107.8
92강천보20200408033038.150101.757101.7579.407107.8
93강천보20200406112038.160103.953103.969.452108.3
94강천보20200406103038.160103.967104.049.452108.3
95강천보20200406103038.160103.967103.9679.452108.3
96강천보20200406102038.160103.92103.849.452108.3
97강천보20200406101038.160103.85104.029.452108.3
98강천보20200409003038.14099.7199.719.362107.3
99강천보20200409004038.14099.76399.719.362107.3