Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory576.2 KiB
Average record size in memory59.0 B

Variable types

Categorical2
DateTime1
Numeric3

Dataset

Description경기도 광주시 AWS(자동기상관측장비)의 기압 측정 정보에 대한 데이터로 위치, 측정일시, 측정주기, 평균(값), 최소(값), 최대(값) 등을 제공합니다. 측정주기는 매시정각(H) 및 10분간격(M)을 의미합니다. 기압측정 단위는 hPa(헥토파스칼)입니다.
Author경기도 광주시
URLhttps://www.data.go.kr/data/15036888/fileData.do

Alerts

평균 is highly overall correlated with 최소 and 1 other fieldsHigh correlation
최소 is highly overall correlated with 평균 and 1 other fieldsHigh correlation
최대 is highly overall correlated with 평균 and 1 other fieldsHigh correlation
측정주기 is highly imbalanced (76.3%)Imbalance

Reproduction

Analysis started2023-12-12 15:38:49.537160
Analysis finished2023-12-12 15:38:51.538688
Duration2 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

위치
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
곤지암읍
5555 
양벌배수펌프장
4445 

Length

Max length7
Median length4
Mean length5.3335
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row곤지암읍
2nd row곤지암읍
3rd row곤지암읍
4th row곤지암읍
5th row곤지암읍

Common Values

ValueCountFrequency (%)
곤지암읍 5555
55.5%
양벌배수펌프장 4445
44.5%

Length

2023-12-13T00:38:51.627990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:38:51.731317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
곤지암읍 5555
55.5%
양벌배수펌프장 4445
44.5%
Distinct9508
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-08-01 00:40:00
Maximum2023-08-24 13:10:00
2023-12-13T00:38:51.854724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:38:52.009806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

측정주기
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
M
9324 
H
 
645
D
 
31

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowM
2nd rowM
3rd rowM
4th rowM
5th rowM

Common Values

ValueCountFrequency (%)
M 9324
93.2%
H 645
 
6.5%
D 31
 
0.3%

Length

2023-12-13T00:38:52.136981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:38:52.224756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
m 9324
93.2%
h 645
 
6.5%
d 31
 
0.3%

평균
Real number (ℝ)

HIGH CORRELATION 

Distinct2530
Distinct (%)25.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100812.76
Minimum63850
Maximum103381
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T00:38:52.351785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum63850
5-th percentile99230
Q1100010
median100850
Q3101560
95-th percentile102423
Maximum103381
Range39531
Interquartile range (IQR)1550

Descriptive statistics

Standard deviation1095.7728
Coefficient of variation (CV)0.010869386
Kurtosis157.07627
Mean100812.76
Median Absolute Deviation (MAD)770
Skewness-5.1096466
Sum1.0081276 × 109
Variance1200718.1
MonotonicityNot monotonic
2023-12-13T00:38:52.502729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100580 39
 
0.4%
100660 39
 
0.4%
100550 38
 
0.4%
100620 36
 
0.4%
101410 33
 
0.3%
99330 33
 
0.3%
99810 33
 
0.3%
99800 32
 
0.3%
99390 31
 
0.3%
100590 30
 
0.3%
Other values (2520) 9656
96.6%
ValueCountFrequency (%)
63850 1
< 0.1%
75403 1
< 0.1%
97930 1
< 0.1%
97940 1
< 0.1%
97960 1
< 0.1%
98050 1
< 0.1%
98080 1
< 0.1%
98110 2
< 0.1%
98120 1
< 0.1%
98130 1
< 0.1%
ValueCountFrequency (%)
103381 1
< 0.1%
103369 1
< 0.1%
103329 1
< 0.1%
103323 1
< 0.1%
103314 1
< 0.1%
103295 1
< 0.1%
103270 1
< 0.1%
103229 1
< 0.1%
103220 1
< 0.1%
103218 1
< 0.1%

최소
Real number (ℝ)

HIGH CORRELATION 

Distinct965
Distinct (%)9.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100794.59
Minimum63850
Maximum103379
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T00:38:52.680280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum63850
5-th percentile99228
Q1100010
median100850
Q3101550
95-th percentile102410.35
Maximum103379
Range39529
Interquartile range (IQR)1540

Descriptive statistics

Standard deviation1294.3181
Coefficient of variation (CV)0.012841146
Kurtosis329.93325
Mean100794.59
Median Absolute Deviation (MAD)770
Skewness-11.628964
Sum1.0079459 × 109
Variance1675259.3
MonotonicityNot monotonic
2023-12-13T00:38:52.843900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
101300 48
 
0.5%
101200 48
 
0.5%
99650 41
 
0.4%
100550 41
 
0.4%
100850 40
 
0.4%
101400 40
 
0.4%
100800 40
 
0.4%
101850 39
 
0.4%
100750 39
 
0.4%
101450 39
 
0.4%
Other values (955) 9585
95.9%
ValueCountFrequency (%)
63850 5
0.1%
97930 1
 
< 0.1%
97940 1
 
< 0.1%
97960 1
 
< 0.1%
98050 1
 
< 0.1%
98080 1
 
< 0.1%
98110 2
 
< 0.1%
98120 1
 
< 0.1%
98130 1
 
< 0.1%
98230 2
 
< 0.1%
ValueCountFrequency (%)
103379 1
< 0.1%
103369 1
< 0.1%
103329 1
< 0.1%
103308 2
< 0.1%
103290 1
< 0.1%
103269 1
< 0.1%
103219 2
< 0.1%
103208 2
< 0.1%
103200 2
< 0.1%
103190 1
< 0.1%

최대
Real number (ℝ)

HIGH CORRELATION 

Distinct955
Distinct (%)9.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100821.5
Minimum63850
Maximum103440
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T00:38:52.996734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum63850
5-th percentile99230
Q1100019
median100859
Q3101569
95-th percentile102440
Maximum103440
Range39590
Interquartile range (IQR)1550

Descriptive statistics

Standard deviation1069.3956
Coefficient of variation (CV)0.010606821
Kurtosis141.73071
Mean100821.5
Median Absolute Deviation (MAD)770
Skewness-4.1602221
Sum1.008215 × 109
Variance1143606.9
MonotonicityNot monotonic
2023-12-13T00:38:53.152742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
101450 47
 
0.5%
100550 45
 
0.4%
101200 44
 
0.4%
101250 42
 
0.4%
101000 42
 
0.4%
101350 40
 
0.4%
100800 39
 
0.4%
101400 39
 
0.4%
100750 39
 
0.4%
99800 39
 
0.4%
Other values (945) 9584
95.8%
ValueCountFrequency (%)
63850 1
< 0.1%
97930 1
< 0.1%
97940 1
< 0.1%
97960 1
< 0.1%
98050 1
< 0.1%
98080 1
< 0.1%
98110 2
< 0.1%
98120 1
< 0.1%
98130 1
< 0.1%
98230 2
< 0.1%
ValueCountFrequency (%)
103440 1
< 0.1%
103390 1
< 0.1%
103369 1
< 0.1%
103340 1
< 0.1%
103329 2
< 0.1%
103319 1
< 0.1%
103300 1
< 0.1%
103279 1
< 0.1%
103240 1
< 0.1%
103229 1
< 0.1%

Interactions

2023-12-13T00:38:50.857102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:38:50.087514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:38:50.475954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:38:51.011503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:38:50.213419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:38:50.608511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:38:51.165668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:38:50.332958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:38:50.721893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:38:53.294424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위치측정주기평균최소최대
위치1.0000.1830.1420.3590.386
측정주기0.1831.0000.1540.0400.046
평균0.1420.1541.0001.0000.669
최소0.3590.0401.0001.0000.993
최대0.3860.0460.6690.9931.000
2023-12-13T00:38:53.404952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정주기위치
측정주기1.0000.301
위치0.3011.000
2023-12-13T00:38:53.499251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
평균최소최대위치측정주기
평균1.0000.9990.9990.2340.046
최소0.9991.0000.9970.2350.176
최대0.9990.9971.0000.2530.053
위치0.2340.2350.2531.0000.301
측정주기0.0460.1760.0530.3011.000

Missing values

2023-12-13T00:38:51.362908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:38:51.485801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

위치측정일시측정주기평균최소최대
40147곤지암읍2023-05-07 19:00M100010100010100010
10211곤지암읍2022-10-10 23:20M100390100390100390
15392곤지암읍2022-11-16 00:00M100590100590100590
41122곤지암읍2023-05-14 13:40M996909969099690
36086곤지암읍2023-04-09 10:40M100850100850100850
8234곤지암읍2022-09-27 05:30M100580100580100580
23189곤지암읍2023-01-09 06:10M101090101090101090
18417곤지암읍2022-12-07 00:50M101070101070101070
70721양벌배수펌프장2022-10-29 00:10M102126102119102128
51965곤지암읍2023-07-29 20:10M100260100260100260
위치측정일시측정주기평균최소최대
66596양벌배수펌프장2022-10-04 14:30M100519100519100519
10233곤지암읍2022-10-11 03:00M100610100610100610
66536양벌배수펌프장2022-10-04 06:00M100122100109100139
45594곤지암읍2023-06-14 16:20M992909929099290
84872양벌배수펌프장2023-01-20 17:20M102502102500102508
95138양벌배수펌프장2023-03-22 11:00M100101100089100119
6260곤지암읍2022-09-13 12:20M100400100400100400
40645곤지암읍2023-05-11 06:00M100730100730100730
25233곤지암읍2023-01-23 12:30M100990100990100990
77681양벌배수펌프장2022-12-09 04:20M102052102039102059