Overview

Dataset statistics

Number of variables6
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.3 KiB
Average record size in memory54.3 B

Variable types

Categorical3
DateTime1
Numeric2

Dataset

Description샘플 데이터
Author순천향대학교 산학협력단
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=6ea74040-4078-11eb-bdda-25af7f339cc7

Alerts

장비ID has constant value ""Constant
위도 has constant value ""Constant
경도 has constant value ""Constant
초미세먼지값(ug/m3) is highly overall correlated with 미세먼지값(ug/m3)High correlation
미세먼지값(ug/m3) is highly overall correlated with 초미세먼지값(ug/m3)High correlation
데이터발생일시 has unique valuesUnique

Reproduction

Analysis started2023-12-10 11:28:47.646836
Analysis finished2023-12-10 11:28:48.531908
Duration0.89 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

장비ID
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
64969
100 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row64969
2nd row64969
3rd row64969
4th row64969
5th row64969

Common Values

ValueCountFrequency (%)
64969 100
100.0%

Length

2023-12-10T20:28:48.602248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:28:48.719521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
64969 100
100.0%
Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2020-11-01 00:01:00
Maximum2020-11-01 03:19:00
2023-12-10T20:28:48.831656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:28:48.999192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

위도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
36.772892
100 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row36.772892
2nd row36.772892
3rd row36.772892
4th row36.772892
5th row36.772892

Common Values

ValueCountFrequency (%)
36.772892 100
100.0%

Length

2023-12-10T20:28:49.176449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:28:49.277676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
36.772892 100
100.0%

경도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
127.021792
100 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row127.021792
2nd row127.021792
3rd row127.021792
4th row127.021792
5th row127.021792

Common Values

ValueCountFrequency (%)
127.021792 100
100.0%

Length

2023-12-10T20:28:49.422930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:28:49.523004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
127.021792 100
100.0%

초미세먼지값(ug/m3)
Real number (ℝ)

HIGH CORRELATION 

Distinct99
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32.5051
Minimum18.04
Maximum48.66
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T20:28:49.662091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum18.04
5-th percentile20.4595
Q125.9825
median32.57
Q338.2175
95-th percentile44.111
Maximum48.66
Range30.62
Interquartile range (IQR)12.235

Descriptive statistics

Standard deviation7.2985881
Coefficient of variation (CV)0.22453671
Kurtosis-0.77245257
Mean32.5051
Median Absolute Deviation (MAD)5.875
Skewness-0.007779529
Sum3250.51
Variance53.269389
MonotonicityNot monotonic
2023-12-10T20:28:49.827402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
25.2 2
 
2.0%
44.13 1
 
1.0%
39.04 1
 
1.0%
25.42 1
 
1.0%
25.46 1
 
1.0%
31.64 1
 
1.0%
31.27 1
 
1.0%
33.61 1
 
1.0%
30.05 1
 
1.0%
36.33 1
 
1.0%
Other values (89) 89
89.0%
ValueCountFrequency (%)
18.04 1
1.0%
19.49 1
1.0%
20.36 1
1.0%
20.43 1
1.0%
20.45 1
1.0%
20.46 1
1.0%
20.86 1
1.0%
21.38 1
1.0%
21.45 1
1.0%
22.04 1
1.0%
ValueCountFrequency (%)
48.66 1
1.0%
47.59 1
1.0%
46.82 1
1.0%
46.42 1
1.0%
44.13 1
1.0%
44.11 1
1.0%
42.81 1
1.0%
42.17 1
1.0%
41.96 1
1.0%
41.77 1
1.0%

미세먼지값(ug/m3)
Real number (ℝ)

HIGH CORRELATION 

Distinct97
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.9767
Minimum20.23
Maximum55.62
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T20:28:50.003362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20.23
5-th percentile22.4345
Q128.5275
median36.28
Q342.565
95-th percentile49.223
Maximum55.62
Range35.39
Interquartile range (IQR)14.0375

Descriptive statistics

Standard deviation8.1814772
Coefficient of variation (CV)0.22741044
Kurtosis-0.72408361
Mean35.9767
Median Absolute Deviation (MAD)6.48
Skewness-0.010230299
Sum3597.67
Variance66.93657
MonotonicityNot monotonic
2023-12-10T20:28:50.194305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
42.76 2
 
2.0%
46.46 2
 
2.0%
38.65 2
 
2.0%
49.22 1
 
1.0%
40.95 1
 
1.0%
32.93 1
 
1.0%
28.19 1
 
1.0%
28.14 1
 
1.0%
35.0 1
 
1.0%
33.7 1
 
1.0%
Other values (87) 87
87.0%
ValueCountFrequency (%)
20.23 1
1.0%
21.28 1
1.0%
21.71 1
1.0%
21.83 1
1.0%
22.14 1
1.0%
22.45 1
1.0%
23.48 1
1.0%
23.62 1
1.0%
23.66 1
1.0%
24.03 1
1.0%
ValueCountFrequency (%)
55.62 1
1.0%
51.89 1
1.0%
51.64 1
1.0%
51.43 1
1.0%
49.28 1
1.0%
49.22 1
1.0%
47.06 1
1.0%
46.5 1
1.0%
46.46 2
2.0%
45.53 1
1.0%

Interactions

2023-12-10T20:28:48.027387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:28:47.772573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:28:48.162080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:28:47.895276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T20:28:50.362573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
데이터발생일시초미세먼지값(ug/m3)미세먼지값(ug/m3)
데이터발생일시1.0001.0001.000
초미세먼지값(ug/m3)1.0001.0000.980
미세먼지값(ug/m3)1.0000.9801.000
2023-12-10T20:28:50.487009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
초미세먼지값(ug/m3)미세먼지값(ug/m3)
초미세먼지값(ug/m3)1.0000.994
미세먼지값(ug/m3)0.9941.000

Missing values

2023-12-10T20:28:48.335076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T20:28:48.478099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

장비ID데이터발생일시위도경도초미세먼지값(ug/m3)미세먼지값(ug/m3)
0649692020-11-01 0:0336.772892127.02179244.1349.22
1649692020-11-01 0:0536.772892127.02179246.8251.89
2649692020-11-01 0:0736.772892127.02179247.5951.43
3649692020-11-01 0:0936.772892127.02179242.8146.5
4649692020-11-01 0:1136.772892127.02179239.9645.49
5649692020-11-01 0:1336.772892127.02179236.6539.39
6649692020-11-01 0:1536.772892127.02179230.9834.66
7649692020-11-01 0:1736.772892127.02179233.037.39
8649692020-11-01 0:1936.772892127.02179227.0231.2
9649692020-11-01 0:2136.772892127.02179223.9826.79
장비ID데이터발생일시위도경도초미세먼지값(ug/m3)미세먼지값(ug/m3)
90649692020-11-01 3:0336.772892127.02179220.3621.71
91649692020-11-01 3:0536.772892127.02179222.6925.6
92649692020-11-01 3:0736.772892127.02179226.027.88
93649692020-11-01 3:0936.772892127.02179222.0423.48
94649692020-11-01 3:1136.772892127.02179220.4522.14
95649692020-11-01 3:1336.772892127.02179220.4323.66
96649692020-11-01 3:1536.772892127.02179221.4524.03
97649692020-11-01 3:1736.772892127.02179222.3224.37
98649692020-11-01 3:1936.772892127.02179225.4328.4
99649692020-11-01 0:0136.772892127.02179236.8741.65