Overview

Dataset statistics

Number of variables12
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.7 KiB
Average record size in memory109.3 B

Variable types

Numeric4
Categorical8

Dataset

Description샘플 데이터
Author한국기상산업기술원
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=a3c04d80-a186-11ea-abba-b94a7d658096

Alerts

j has constant value ""Constant
1시간기온 has constant value ""Constant
1시간강수량 has constant value ""Constant
강수형태 has constant value ""Constant
풍향 has constant value ""Constant
습도 has constant value ""Constant
뇌전 has constant value ""Constant
i is highly overall correlated with 풍속 and 2 other fieldsHigh correlation
풍속 is highly overall correlated with i and 2 other fieldsHigh correlation
남북바람성분 is highly overall correlated with i and 2 other fieldsHigh correlation
하늘형태 is highly overall correlated with i and 2 other fieldsHigh correlation
하늘형태 is highly imbalanced (58.2%)Imbalance
i has unique valuesUnique

Reproduction

Analysis started2023-12-10 10:37:18.834284
Analysis finished2023-12-10 10:37:23.142925
Duration4.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

i
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:37:23.296409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-12-10T19:37:23.602236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

j
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 100
100.0%

Length

2023-12-10T19:37:23.892230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:37:24.064214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 100
100.0%

1시간기온
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-50
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-50
2nd row-50
3rd row-50
4th row-50
5th row-50

Common Values

ValueCountFrequency (%)
-50 100
100.0%

Length

2023-12-10T19:37:24.292123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:37:24.517951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
50 100
100.0%

1시간강수량
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-1
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-1
2nd row-1
3rd row-1
4th row-1
5th row-1

Common Values

ValueCountFrequency (%)
-1 100
100.0%

Length

2023-12-10T19:37:24.734299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:37:24.912413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 100
100.0%

강수형태
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
0
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 100
100.0%

Length

2023-12-10T19:37:25.085557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:37:25.352006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 100
100.0%

하늘형태
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1
85 
3
14 
4
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row3
2nd row3
3rd row1
4th row3
5th row3

Common Values

ValueCountFrequency (%)
1 85
85.0%
3 14
 
14.0%
4 1
 
1.0%

Length

2023-12-10T19:37:25.530842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:37:25.718798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 85
85.0%
3 14
 
14.0%
4 1
 
1.0%

풍향
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
267
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row267
2nd row267
3rd row267
4th row267
5th row267

Common Values

ValueCountFrequency (%)
267 100
100.0%

Length

2023-12-10T19:37:25.923352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:37:26.139677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
267 100
100.0%

풍속
Real number (ℝ)

HIGH CORRELATION 

Distinct57
Distinct (%)57.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.825
Minimum2.7
Maximum9.2
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:37:26.393100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.7
5-th percentile2.9
Q13.375
median3.9
Q36.15
95-th percentile8.605
Maximum9.2
Range6.5
Interquartile range (IQR)2.775

Descriptive statistics

Standard deviation1.9054613
Coefficient of variation (CV)0.39491426
Kurtosis-0.48528433
Mean4.825
Median Absolute Deviation (MAD)0.85
Skewness0.93036186
Sum482.5
Variance3.6307828
MonotonicityNot monotonic
2023-12-10T19:37:26.849506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3.9 12
 
12.0%
2.9 5
 
5.0%
3.0 5
 
5.0%
3.3 5
 
5.0%
3.7 4
 
4.0%
3.2 4
 
4.0%
3.4 4
 
4.0%
3.8 4
 
4.0%
3.1 3
 
3.0%
3.6 3
 
3.0%
Other values (47) 51
51.0%
ValueCountFrequency (%)
2.7 1
 
1.0%
2.8 2
 
2.0%
2.9 5
5.0%
3.0 5
5.0%
3.1 3
3.0%
3.2 4
4.0%
3.3 5
5.0%
3.4 4
4.0%
3.5 2
 
2.0%
3.6 3
3.0%
ValueCountFrequency (%)
9.2 1
1.0%
9.1 1
1.0%
9.0 1
1.0%
8.8 1
1.0%
8.7 1
1.0%
8.6 1
1.0%
8.5 1
1.0%
8.4 1
1.0%
8.3 1
1.0%
8.2 1
1.0%

동서바람성분
Real number (ℝ)

Distinct12
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-3.125
Minimum-3.6
Maximum-2.5
Zeros0
Zeros (%)0.0%
Negative100
Negative (%)100.0%
Memory size1.0 KiB
2023-12-10T19:37:27.472362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-3.6
5-th percentile-3.6
Q1-3.4
median-3.1
Q3-2.875
95-th percentile-2.7
Maximum-2.5
Range1.1
Interquartile range (IQR)0.525

Descriptive statistics

Standard deviation0.29995791
Coefficient of variation (CV)-0.095986531
Kurtosis-1.2213084
Mean-3.125
Median Absolute Deviation (MAD)0.3
Skewness-0.037952022
Sum-312.5
Variance0.089974747
MonotonicityNot monotonic
2023-12-10T19:37:27.725880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
-2.8 17
17.0%
-2.9 12
12.0%
-3.2 11
11.0%
-3.5 10
10.0%
-3.4 10
10.0%
-3.6 9
9.0%
-3.3 9
9.0%
-3.0 8
8.0%
-3.1 6
 
6.0%
-2.7 5
 
5.0%
Other values (2) 3
 
3.0%
ValueCountFrequency (%)
-3.6 9
9.0%
-3.5 10
10.0%
-3.4 10
10.0%
-3.3 9
9.0%
-3.2 11
11.0%
-3.1 6
 
6.0%
-3.0 8
8.0%
-2.9 12
12.0%
-2.8 17
17.0%
-2.7 5
 
5.0%
ValueCountFrequency (%)
-2.5 1
 
1.0%
-2.6 2
 
2.0%
-2.7 5
 
5.0%
-2.8 17
17.0%
-2.9 12
12.0%
-3.0 8
8.0%
-3.1 6
 
6.0%
-3.2 11
11.0%
-3.3 9
9.0%
-3.4 10
10.0%

남북바람성분
Real number (ℝ)

HIGH CORRELATION 

Distinct87
Distinct (%)87.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.543
Minimum-4.6
Maximum5.8
Zeros1
Zeros (%)1.0%
Negative43
Negative (%)43.0%
Memory size1.0 KiB
2023-12-10T19:37:27.993778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-4.6
5-th percentile-4.3
Q1-2.225
median0.65
Q33.125
95-th percentile5.4
Maximum5.8
Range10.4
Interquartile range (IQR)5.35

Descriptive statistics

Standard deviation3.1555708
Coefficient of variation (CV)5.8113643
Kurtosis-1.2124712
Mean0.543
Median Absolute Deviation (MAD)2.65
Skewness-0.031653547
Sum54.3
Variance9.9576273
MonotonicityDecreasing
2023-12-10T19:37:28.312884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-4.3 3
 
3.0%
-1.6 2
 
2.0%
-3.4 2
 
2.0%
-3.7 2
 
2.0%
2.2 2
 
2.0%
0.9 2
 
2.0%
0.7 2
 
2.0%
2.4 2
 
2.0%
5.2 2
 
2.0%
5.4 2
 
2.0%
Other values (77) 79
79.0%
ValueCountFrequency (%)
-4.6 2
2.0%
-4.5 1
 
1.0%
-4.4 1
 
1.0%
-4.3 3
3.0%
-4.2 1
 
1.0%
-4.1 1
 
1.0%
-4.0 1
 
1.0%
-3.9 1
 
1.0%
-3.7 2
2.0%
-3.6 1
 
1.0%
ValueCountFrequency (%)
5.8 1
1.0%
5.7 1
1.0%
5.6 1
1.0%
5.5 1
1.0%
5.4 2
2.0%
5.2 2
2.0%
5.1 1
1.0%
5.0 1
1.0%
4.9 1
1.0%
4.8 1
1.0%

습도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-1
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-1
2nd row-1
3rd row-1
4th row-1
5th row-1

Common Values

ValueCountFrequency (%)
-1 100
100.0%

Length

2023-12-10T19:37:28.597677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:37:28.785389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 100
100.0%

뇌전
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
-1
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-1
2nd row-1
3rd row-1
4th row-1
5th row-1

Common Values

ValueCountFrequency (%)
-1 100
100.0%

Length

2023-12-10T19:37:28.983879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:37:29.134527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 100
100.0%

Interactions

2023-12-10T19:37:21.747535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:37:19.217813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:37:19.964242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:37:20.804835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:37:21.934310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:37:19.373941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:37:20.117867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:37:21.033409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:37:22.133692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:37:19.582961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:37:20.315580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:37:21.245510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:37:22.311111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:37:19.778392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:37:20.523215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:37:21.521142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:37:29.294871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
i하늘형태풍속동서바람성분남북바람성분
i1.0000.6780.9570.8470.996
하늘형태0.6781.0000.7460.3980.691
풍속0.9570.7461.0000.8730.960
동서바람성분0.8470.3980.8731.0000.886
남북바람성분0.9960.6910.9600.8861.000
2023-12-10T19:37:29.496091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
i풍속동서바람성분남북바람성분하늘형태
i1.000-0.9970.211-1.0000.510
풍속-0.9971.000-0.2270.9970.595
동서바람성분0.211-0.2271.000-0.2110.250
남북바람성분-1.0000.997-0.2111.0000.525
하늘형태0.5100.5950.2500.5251.000

Missing values

2023-12-10T19:37:22.656310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:37:23.026210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

ij1시간기온1시간강수량강수형태하늘형태풍향풍속동서바람성분남북바람성분습도뇌전
011-50-1032679.2-3.65.8-1-1
121-50-1032679.1-3.65.7-1-1
231-50-1012679.0-3.65.6-1-1
341-50-1032678.8-3.55.5-1-1
451-50-1032678.7-3.55.4-1-1
561-50-1032678.6-3.55.4-1-1
671-50-1042678.5-3.45.2-1-1
781-50-1032678.4-3.45.2-1-1
891-50-1032678.3-3.45.1-1-1
9101-50-1032678.2-3.45.0-1-1
ij1시간기온1시간강수량강수형태하늘형태풍향풍속동서바람성분남북바람성분습도뇌전
90911-50-1012673.0-2.9-4.0-1-1
91921-50-1012673.0-2.9-4.1-1-1
92931-50-1012672.9-2.8-4.2-1-1
93941-50-1012672.9-2.8-4.3-1-1
94951-50-1012672.9-2.8-4.3-1-1
95961-50-1012672.9-2.7-4.3-1-1
96971-50-1012672.9-2.7-4.4-1-1
97981-50-1012672.8-2.6-4.5-1-1
98991-50-1012672.8-2.6-4.6-1-1
991001-50-1012672.7-2.5-4.6-1-1