Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory400.4 KiB
Average record size in memory41.0 B

Variable types

Categorical2
DateTime1
Numeric1

Dataset

Description경기도 광주시 AWS(자동기상관측장비)의 강우 및 수위 관련 정보(비, 날씨 등)에 대한 데이터로 위치, 측정 일시, 측정 주기, 측정값 등을 제공합니다. 측정주기는 매시정각(H) 및 10분간격(M)을 의미합니다.
Author경기도 광주시
URLhttps://www.data.go.kr/data/15036890/fileData.do

Alerts

측정값 is highly overall correlated with 측정주기High correlation
측정주기 is highly overall correlated with 측정값High correlation
측정주기 is highly imbalanced (71.8%)Imbalance
측정값 is highly skewed (γ1 = 88.32751644)Skewed
측정값 has 9667 (96.7%) zerosZeros

Reproduction

Analysis started2023-12-12 04:32:54.185527
Analysis finished2023-12-12 04:32:54.679747
Duration0.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

위치
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
초월읍
6881 
오포읍
3119 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row초월읍
2nd row오포읍
3rd row초월읍
4th row오포읍
5th row오포읍

Common Values

ValueCountFrequency (%)
초월읍 6881
68.8%
오포읍 3119
31.2%

Length

2023-12-12T13:32:54.766899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:32:54.907261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
초월읍 6881
68.8%
오포읍 3119
31.2%

측정주기
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
M
8497 
H
1433 
D
 
65
N
 
4
Y
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowM
2nd rowH
3rd rowM
4th rowM
5th rowM

Common Values

ValueCountFrequency (%)
M 8497
85.0%
H 1433
 
14.3%
D 65
 
0.7%
N 4
 
< 0.1%
Y 1
 
< 0.1%

Length

2023-12-12T13:32:55.002038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:32:55.134345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
m 8497
85.0%
h 1433
 
14.3%
d 65
 
0.7%
n 4
 
< 0.1%
y 1
 
< 0.1%
Distinct9414
Distinct (%)94.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-08-01 02:00:00
Maximum2023-08-24 14:30:00
2023-12-12T13:32:55.254126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:32:55.407777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

측정값
Real number (ℝ)

HIGH CORRELATION  SKEWED  ZEROS 

Distinct41
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.775
Minimum0
Maximum110500
Zeros9667
Zeros (%)96.7%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T13:32:55.564883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum110500
Range110500
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1155.733
Coefficient of variation (CV)41.610551
Kurtosis8363.9824
Mean27.775
Median Absolute Deviation (MAD)0
Skewness88.327516
Sum277750
Variance1335718.9
MonotonicityNot monotonic
2023-12-12T13:32:55.765137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
ValueCountFrequency (%)
0 9667
96.7%
50 152
 
1.5%
100 58
 
0.6%
150 29
 
0.3%
200 23
 
0.2%
250 8
 
0.1%
300 6
 
0.1%
450 5
 
0.1%
500 5
 
0.1%
400 4
 
< 0.1%
Other values (31) 43
 
0.4%
ValueCountFrequency (%)
0 9667
96.7%
50 152
 
1.5%
100 58
 
0.6%
150 29
 
0.3%
200 23
 
0.2%
250 8
 
0.1%
300 6
 
0.1%
350 2
 
< 0.1%
400 4
 
< 0.1%
450 5
 
0.1%
ValueCountFrequency (%)
110500 1
< 0.1%
19900 1
< 0.1%
14450 1
< 0.1%
10000 1
< 0.1%
9800 1
< 0.1%
9250 1
< 0.1%
8750 1
< 0.1%
5350 1
< 0.1%
5050 1
< 0.1%
4750 1
< 0.1%

Interactions

2023-12-12T13:32:54.411276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:32:55.865641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위치측정주기측정값
위치1.0000.0000.004
측정주기0.0001.0000.833
측정값0.0040.8331.000
2023-12-12T13:32:55.978995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위치측정주기
위치1.0000.000
측정주기0.0001.000
2023-12-12T13:32:56.095586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정값위치측정주기
측정값1.0000.0070.866
위치0.0071.0000.000
측정주기0.8660.0001.000

Missing values

2023-12-12T13:32:54.540318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:32:54.632725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

위치측정주기측정일시측정값
9888초월읍M2022-09-28 12:300
70734오포읍H2022-08-30 22:000
2833초월읍M2022-08-17 18:400
95226오포읍M2023-01-22 19:100
78916오포읍M2022-10-18 07:300
88688오포읍H2022-12-15 03:000
9101초월읍M2022-09-23 20:400
42043초월읍M2023-04-06 18:300
44897초월읍M2023-04-23 15:500
83256오포읍M2022-11-12 23:500
위치측정주기측정일시측정값
35488초월읍M2023-02-27 00:000
74556오포읍M2022-09-22 12:300
14769초월읍M2022-10-27 09:300
79082오포읍M2022-10-19 07:000
10693초월읍M2022-10-03 06:40100
77000오포읍M2022-10-06 23:300
85868오포읍M2022-11-28 10:400
79924오포읍M2022-10-24 06:400
72960오포읍M2022-09-13 01:500
15259초월읍H2022-10-30 07:000