Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory576.2 KiB
Average record size in memory59.0 B

Variable types

Categorical2
DateTime1
Numeric3

Dataset

Description경기도 광주시 AWS(자동기상관측장비) 습도계의 대기습도 측정값으로 위치, 측정일시, 측정주기, 평균값, 최소값, 최대값 등의 항목을 제공합니다.
Author경기도 광주시
URLhttps://www.data.go.kr/data/15036889/fileData.do

Alerts

평균값 is highly overall correlated with 최소값 and 1 other fieldsHigh correlation
최소값 is highly overall correlated with 평균값 and 1 other fieldsHigh correlation
최대값 is highly overall correlated with 평균값 and 1 other fieldsHigh correlation
측정주기 is highly imbalanced (76.2%)Imbalance
평균값 is highly skewed (γ1 = -67.04107151)Skewed
최소값 is highly skewed (γ1 = -44.0746841)Skewed
최대값 is highly skewed (γ1 = -93.53523196)Skewed

Reproduction

Analysis started2023-12-12 21:40:26.481855
Analysis finished2023-12-12 21:40:28.262656
Duration1.78 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

위치
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
곤지암읍
5539 
양벌배수펌프
4461 

Length

Max length6
Median length4
Mean length4.8922
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row곤지암읍
2nd row곤지암읍
3rd row양벌배수펌프
4th row곤지암읍
5th row곤지암읍

Common Values

ValueCountFrequency (%)
곤지암읍 5539
55.4%
양벌배수펌프 4461
44.6%

Length

2023-12-13T06:40:28.322662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:40:28.411729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
곤지암읍 5539
55.4%
양벌배수펌프 4461
44.6%
Distinct9513
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-08-01 01:20:00
Maximum2023-08-24 15:50:00
2023-12-13T06:40:28.525127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:40:28.656756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

측정주기
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
M
9326 
H
 
637
D
 
37

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowM
2nd rowM
3rd rowM
4th rowM
5th rowM

Common Values

ValueCountFrequency (%)
M 9326
93.3%
H 637
 
6.4%
D 37
 
0.4%

Length

2023-12-13T06:40:28.760787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:40:28.855557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
m 9326
93.3%
h 637
 
6.4%
d 37
 
0.4%

평균값
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct3662
Distinct (%)36.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6723.3964
Minimum-999900
Maximum10000
Zeros0
Zeros (%)0.0%
Negative5
Negative (%)< 0.1%
Memory size166.0 KiB
2023-12-13T06:40:28.951351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-999900
5-th percentile2865.95
Q15360
median7340
Q38800
95-th percentile9697
Maximum10000
Range1009900
Interquartile range (IQR)3440

Descriptive statistics

Standard deviation13606.101
Coefficient of variation (CV)2.0236947
Kurtosis4691.1577
Mean6723.3964
Median Absolute Deviation (MAD)1620
Skewness-67.041072
Sum67233964
Variance1.8512599 × 108
MonotonicityNot monotonic
2023-12-13T06:40:29.071487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9990 202
 
2.0%
9140 30
 
0.3%
10000 29
 
0.3%
9160 27
 
0.3%
8590 24
 
0.2%
9150 24
 
0.2%
8820 24
 
0.2%
8940 23
 
0.2%
8830 23
 
0.2%
8580 22
 
0.2%
Other values (3652) 9572
95.7%
ValueCountFrequency (%)
-999900 1
< 0.1%
-866168 1
< 0.1%
-114201 1
< 0.1%
-91775 1
< 0.1%
-60969 1
< 0.1%
900 1
< 0.1%
910 1
< 0.1%
1030 1
< 0.1%
1140 1
< 0.1%
1170 1
< 0.1%
ValueCountFrequency (%)
10000 29
 
0.3%
9990 202
2.0%
9989 1
 
< 0.1%
9988 2
 
< 0.1%
9987 4
 
< 0.1%
9986 4
 
< 0.1%
9984 2
 
< 0.1%
9983 2
 
< 0.1%
9980 2
 
< 0.1%
9979 1
 
< 0.1%

최소값
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct1520
Distinct (%)15.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6384.547
Minimum-999900
Maximum10000
Zeros0
Zeros (%)0.0%
Negative5
Negative (%)< 0.1%
Memory size166.0 KiB
2023-12-13T06:40:29.204451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-999900
5-th percentile2800
Q15300
median7270
Q38770
95-th percentile9680
Maximum10000
Range1009900
Interquartile range (IQR)3470

Descriptive statistics

Standard deviation22612.958
Coefficient of variation (CV)3.5418265
Kurtosis1959.1291
Mean6384.547
Median Absolute Deviation (MAD)1651.5
Skewness-44.074684
Sum63845470
Variance5.1134587 × 108
MonotonicityNot monotonic
2023-12-13T06:40:29.349508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9990 202
 
2.0%
9140 44
 
0.4%
6969 39
 
0.4%
8940 37
 
0.4%
9150 35
 
0.4%
8990 34
 
0.3%
9000 33
 
0.3%
8590 31
 
0.3%
9500 30
 
0.3%
9040 30
 
0.3%
Other values (1510) 9485
94.8%
ValueCountFrequency (%)
-999900 5
0.1%
900 1
 
< 0.1%
910 1
 
< 0.1%
1030 1
 
< 0.1%
1130 1
 
< 0.1%
1140 1
 
< 0.1%
1170 1
 
< 0.1%
1189 1
 
< 0.1%
1200 2
 
< 0.1%
1210 1
 
< 0.1%
ValueCountFrequency (%)
10000 29
 
0.3%
9990 202
2.0%
9979 8
 
0.1%
9969 10
 
0.1%
9968 1
 
< 0.1%
9958 4
 
< 0.1%
9957 1
 
< 0.1%
9950 12
 
0.1%
9940 15
 
0.1%
9929 8
 
0.1%

최대값
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct1478
Distinct (%)14.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6888.7952
Minimum-999900
Maximum10000
Zeros0
Zeros (%)0.0%
Negative1
Negative (%)< 0.1%
Memory size166.0 KiB
2023-12-13T06:40:29.459235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-999900
5-th percentile2937.6
Q15428
median7420
Q38830
95-th percentile9740
Maximum10000
Range1009900
Interquartile range (IQR)3402

Descriptive statistics

Standard deviation10295.867
Coefficient of variation (CV)1.4945817
Kurtosis9146.6819
Mean6888.7952
Median Absolute Deviation (MAD)1600
Skewness-93.535232
Sum68887952
Variance1.0600488 × 108
MonotonicityNot monotonic
2023-12-13T06:40:29.592790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9990 246
 
2.5%
9150 37
 
0.4%
9140 37
 
0.4%
9200 37
 
0.4%
6969 36
 
0.4%
8940 35
 
0.4%
8590 33
 
0.3%
9300 31
 
0.3%
9040 31
 
0.3%
9390 30
 
0.3%
Other values (1468) 9447
94.5%
ValueCountFrequency (%)
-999900 1
< 0.1%
900 1
< 0.1%
910 1
< 0.1%
1030 1
< 0.1%
1140 1
< 0.1%
1170 1
< 0.1%
1200 2
< 0.1%
1210 1
< 0.1%
1220 1
< 0.1%
1240 1
< 0.1%
ValueCountFrequency (%)
10000 29
 
0.3%
9990 246
2.5%
9979 11
 
0.1%
9969 12
 
0.1%
9958 10
 
0.1%
9957 1
 
< 0.1%
9950 11
 
0.1%
9940 10
 
0.1%
9929 7
 
0.1%
9919 5
 
0.1%

Interactions

2023-12-13T06:40:27.826537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:40:27.271443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:40:27.545963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:40:27.914935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:40:27.361705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:40:27.649526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:40:28.027912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:40:27.451061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:40:27.742674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:40:29.672450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위치측정주기평균값최소값최대값
위치1.0000.1820.008NaNNaN
측정주기0.1821.0000.098NaNNaN
평균값0.0080.0981.000NaNNaN
최소값NaNNaNNaN1.000NaN
최대값NaNNaNNaNNaN1.000
2023-12-13T06:40:29.748569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정주기위치
측정주기1.0000.299
위치0.2991.000
2023-12-13T06:40:29.814558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
평균값최소값최대값위치측정주기
평균값1.0000.9970.9950.0140.028
최소값0.9971.0000.9840.0180.047
최대값0.9950.9841.0000.0000.000
위치0.0140.0180.0001.0000.299
측정주기0.0280.0470.0000.2991.000

Missing values

2023-12-13T06:40:28.138713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:40:28.224426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

위치측정일시측정주기평균값최소값최대값
6636곤지암읍2022-09-16 03:00M671067106710
8858곤지암읍2022-10-01 13:30M372037203720
78245양벌배수펌프2022-12-12 12:30M470846594739
2470곤지암읍2022-08-18 04:20M913091309130
7778곤지암읍2022-09-24 01:30M860086008600
67645양벌배수펌프2022-10-10 19:30M740173697450
47738곤지암읍2023-06-29 14:10M927092709270
70856양벌배수펌프2022-10-29 19:30M750074797540
6794곤지암읍2022-09-17 05:20M916091609160
84773양벌배수펌프2023-01-20 03:10M784778007958
위치측정일시측정주기평균값최소값최대값
29792곤지암읍2023-02-24 09:20M714071407140
33775곤지암읍2023-03-24 04:50M717071707170
83835양벌배수펌프2023-01-14 14:00H999099909990
47942곤지암읍2023-07-01 00:10M929092909290
13565곤지암읍2022-11-03 07:00M842084208420
78483양벌배수펌프2022-12-13 22:20M601759786050
86588양벌배수펌프2023-01-30 21:00H618557686500
7886곤지암읍2022-09-24 19:30M699069906990
83339양벌배수펌프2023-01-11 15:40M392439003959
90276양벌배수펌프2023-02-21 16:40M228022382319