Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory576.2 KiB
Average record size in memory59.0 B

Variable types

Categorical2
DateTime1
Numeric3

Dataset

Description경기도 광주시 AWS(자동기상관측장비)의 기온 측정 정보에 대한 데이터로 위치, 측정 일시, 측정 주기, 측정값 등을 제공합니다. 측정주기는 매시정각(H) 및 10분간격(M)을 의미합니다.
Author경기도 광주시
URLhttps://www.data.go.kr/data/15036891/fileData.do

Alerts

평균값 is highly overall correlated with 최소값 and 1 other fieldsHigh correlation
최소값 is highly overall correlated with 평균값 and 1 other fieldsHigh correlation
최대값 is highly overall correlated with 평균값 and 1 other fieldsHigh correlation
측정주기 is highly imbalanced (68.0%)Imbalance
평균값 is highly skewed (γ1 = -95.7906839)Skewed
최소값 is highly skewed (γ1 = -70.01355448)Skewed
최대값 is highly skewed (γ1 = -98.07715016)Skewed

Reproduction

Analysis started2023-12-12 05:50:37.577724
Analysis finished2023-12-12 05:50:39.690451
Duration2.11 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

위치
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
곤지암읍
6484 
양벌배수펌프
3516 

Length

Max length6
Median length4
Mean length4.7032
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row곤지암읍
2nd row곤지암읍
3rd row곤지암읍
4th row곤지암읍
5th row양벌배수펌프

Common Values

ValueCountFrequency (%)
곤지암읍 6484
64.8%
양벌배수펌프 3516
35.2%

Length

2023-12-12T14:50:39.767892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:50:39.893198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
곤지암읍 6484
64.8%
양벌배수펌프 3516
35.2%
Distinct9451
Distinct (%)94.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-08-01 00:40:00
Maximum2023-08-24 15:40:00
2023-12-12T14:50:40.017428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:50:40.212905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

측정주기
Categorical

IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
M
8532 
H
1407 
D
 
60
N
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowM
2nd rowM
3rd rowM
4th rowM
5th rowM

Common Values

ValueCountFrequency (%)
M 8532
85.3%
H 1407
 
14.1%
D 60
 
0.6%
N 1
 
< 0.1%

Length

2023-12-12T14:50:40.371845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:50:40.577478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
m 8532
85.3%
h 1407
 
14.1%
d 60
 
0.6%
n 1
 
< 0.1%

평균값
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct2868
Distinct (%)28.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1066.6733
Minimum-999900
Maximum3497
Zeros25
Zeros (%)0.2%
Negative1951
Negative (%)19.5%
Memory size166.0 KiB
2023-12-12T14:50:40.722629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-999900
5-th percentile-689.05
Q1239
median1270
Q32184
95-th percentile2828.05
Maximum3497
Range1003397
Interquartile range (IQR)1945

Descriptive statistics

Standard deviation10163.13
Coefficient of variation (CV)9.5278754
Kurtosis9416.0649
Mean1066.6733
Median Absolute Deviation (MAD)966
Skewness-95.790684
Sum10666733
Variance1.0328922 × 108
MonotonicityNot monotonic
2023-12-12T14:50:40.898499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2320 52
 
0.5%
2450 48
 
0.5%
2220 46
 
0.5%
2400 46
 
0.5%
2300 45
 
0.4%
2170 45
 
0.4%
2420 43
 
0.4%
2200 42
 
0.4%
2270 40
 
0.4%
1820 37
 
0.4%
Other values (2858) 9556
95.6%
ValueCountFrequency (%)
-999900 1
< 0.1%
-131916 1
< 0.1%
-1820 1
< 0.1%
-1800 1
< 0.1%
-1740 1
< 0.1%
-1710 2
< 0.1%
-1703 1
< 0.1%
-1682 1
< 0.1%
-1680 2
< 0.1%
-1672 1
< 0.1%
ValueCountFrequency (%)
3497 1
 
< 0.1%
3490 2
< 0.1%
3480 1
 
< 0.1%
3460 2
< 0.1%
3450 2
< 0.1%
3430 2
< 0.1%
3420 3
< 0.1%
3410 1
 
< 0.1%
3400 1
 
< 0.1%
3390 1
 
< 0.1%

최소값
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct966
Distinct (%)9.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean966.3488
Minimum-999900
Maximum3490
Zeros37
Zeros (%)0.4%
Negative1986
Negative (%)19.9%
Memory size166.0 KiB
2023-12-12T14:50:41.062388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-999900
5-th percentile-708
Q1220
median1260
Q32170
95-th percentile2810
Maximum3490
Range1003390
Interquartile range (IQR)1950

Descriptive statistics

Standard deviation14202.62
Coefficient of variation (CV)14.697198
Kurtosis4932.8786
Mean966.3488
Median Absolute Deviation (MAD)960
Skewness-70.013554
Sum9663488
Variance2.0171441 × 108
MonotonicityNot monotonic
2023-12-12T14:50:41.246065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2400 61
 
0.6%
2300 60
 
0.6%
2450 58
 
0.6%
2320 55
 
0.5%
2200 53
 
0.5%
2220 53
 
0.5%
2170 52
 
0.5%
2420 51
 
0.5%
2600 48
 
0.5%
2350 45
 
0.4%
Other values (956) 9464
94.6%
ValueCountFrequency (%)
-999900 2
< 0.1%
-1820 1
 
< 0.1%
-1800 1
 
< 0.1%
-1740 1
 
< 0.1%
-1710 3
< 0.1%
-1700 1
 
< 0.1%
-1680 3
< 0.1%
-1660 1
 
< 0.1%
-1640 1
 
< 0.1%
-1638 1
 
< 0.1%
ValueCountFrequency (%)
3490 2
< 0.1%
3480 1
 
< 0.1%
3460 2
< 0.1%
3450 2
< 0.1%
3430 2
< 0.1%
3420 4
< 0.1%
3410 1
 
< 0.1%
3400 1
 
< 0.1%
3390 1
 
< 0.1%
3380 2
< 0.1%

최대값
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct966
Distinct (%)9.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1094.5714
Minimum-999900
Maximum3570
Zeros48
Zeros (%)0.5%
Negative1902
Negative (%)19.0%
Memory size166.0 KiB
2023-12-12T14:50:41.451430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-999900
5-th percentile-680
Q1250
median1288
Q32200
95-th percentile2850
Maximum3570
Range1003470
Interquartile range (IQR)1950

Descriptive statistics

Standard deviation10075.944
Coefficient of variation (CV)9.2053787
Kurtosis9744.3793
Mean1094.5714
Median Absolute Deviation (MAD)962
Skewness-98.07715
Sum10945714
Variance1.0152465 × 108
MonotonicityNot monotonic
2023-12-12T14:50:41.630732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2300 59
 
0.6%
2400 58
 
0.6%
2320 58
 
0.6%
2200 57
 
0.6%
2450 57
 
0.6%
2220 52
 
0.5%
2500 51
 
0.5%
0 48
 
0.5%
2170 47
 
0.5%
2420 47
 
0.5%
Other values (956) 9466
94.7%
ValueCountFrequency (%)
-999900 1
 
< 0.1%
-1820 1
 
< 0.1%
-1800 1
 
< 0.1%
-1740 1
 
< 0.1%
-1710 2
< 0.1%
-1700 1
 
< 0.1%
-1680 2
< 0.1%
-1660 3
< 0.1%
-1620 1
 
< 0.1%
-1610 1
 
< 0.1%
ValueCountFrequency (%)
3570 1
 
< 0.1%
3490 2
< 0.1%
3480 1
 
< 0.1%
3460 2
< 0.1%
3450 2
< 0.1%
3430 3
< 0.1%
3420 3
< 0.1%
3410 1
 
< 0.1%
3400 1
 
< 0.1%
3390 2
< 0.1%

Interactions

2023-12-12T14:50:39.170698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:50:38.198127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:50:38.818138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:50:39.275493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:50:38.301416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:50:38.948433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:50:39.379903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:50:38.703902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:50:39.056301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:50:41.741819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위치측정주기평균값최소값최대값
위치1.0000.0300.000NaNNaN
측정주기0.0301.0000.027NaNNaN
평균값0.0000.0271.000NaNNaN
최소값NaNNaNNaN1.000NaN
최대값NaNNaNNaNNaN1.000
2023-12-12T14:50:41.855498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정주기위치
측정주기1.0000.020
위치0.0201.000
2023-12-12T14:50:41.959098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
평균값최소값최대값위치측정주기
평균값1.0000.9990.9990.0130.004
최소값0.9991.0000.9960.0060.000
최대값0.9990.9961.0000.0000.000
위치0.0130.0060.0001.0000.020
측정주기0.0040.0000.0000.0201.000

Missing values

2023-12-12T14:50:39.515388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:50:39.642030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

위치측정일시측정주기평균값최소값최대값
28428곤지암읍2023-01-16 09:50M-450-450-450
5985곤지암읍2022-09-05 10:30M197019701970
46956곤지암읍2023-05-06 15:20M112011201120
24272곤지암읍2022-12-22 17:40M-780-780-780
78646양벌배수펌프2022-10-18 11:20M125012301268
98377양벌배수펌프2023-02-12 04:40M-299-308-289
80550양벌배수펌프2022-10-29 17:50M157115591579
56887곤지암읍2023-07-04 11:30M290029002900
69554양벌배수펌프2022-08-25 16:30M226722602269
51191곤지암읍2023-05-31 17:40M271027102710
위치측정일시측정주기평균값최소값최대값
43597곤지암읍2023-04-16 16:40M153015301530
57209곤지암읍2023-07-06 09:10M263026302630
98501양벌배수펌프2023-02-12 22:20M618608629
74047양벌배수펌프2022-09-21 06:20M126212591268
61939곤지암읍2023-08-04 06:00M235023502350
44000곤지암읍2023-04-19 02:00H122712201240
97845양벌배수펌프2023-02-09 01:00H-305-338-257
56694곤지암읍2023-07-03 08:00H243523502530
51810곤지암읍2023-06-04 09:50M224022402240
83658양벌배수펌프2022-11-17 03:00M408408408