Overview

Dataset statistics

Number of variables6
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.2 KiB
Average record size in memory53.3 B

Variable types

Categorical3
DateTime1
Numeric2

Dataset

Description샘플 데이터
Author순천향대학교 산학협력단
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=a7b2c560-49a3-11eb-8ff3-e7c20661cf87

Alerts

기기 ID has constant value ""Constant
위도 has constant value ""Constant
경도 has constant value ""Constant
미세먼지 is highly overall correlated with 초미세먼지High correlation
초미세먼지 is highly overall correlated with 미세먼지High correlation
데이터 측정 일시 has unique valuesUnique

Reproduction

Analysis started2023-12-10 13:11:08.468222
Analysis finished2023-12-10 13:11:09.532072
Duration1.06 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기기 ID
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1911KT149
100 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1911KT149
2nd row1911KT149
3rd row1911KT149
4th row1911KT149
5th row1911KT149

Common Values

ValueCountFrequency (%)
1911KT149 100
100.0%

Length

2023-12-10T22:11:09.628454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:11:09.768866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1911kt149 100
100.0%
Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2020-06-01 00:00:00
Maximum2020-06-05 03:00:00
2023-12-10T22:11:09.948155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:11:10.235529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

위도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
37.53785
100 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row37.53785
2nd row37.53785
3rd row37.53785
4th row37.53785
5th row37.53785

Common Values

ValueCountFrequency (%)
37.53785 100
100.0%

Length

2023-12-10T22:11:10.465363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:11:10.675174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
37.53785 100
100.0%

경도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
126.986885
100 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row126.986885
2nd row126.986885
3rd row126.986885
4th row126.986885
5th row126.986885

Common Values

ValueCountFrequency (%)
126.986885 100
100.0%

Length

2023-12-10T22:11:10.834169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:11:10.976182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
126.986885 100
100.0%

미세먼지
Real number (ℝ)

HIGH CORRELATION 

Distinct99
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.7818
Minimum3.12
Maximum55.9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:11:11.144186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.12
5-th percentile4.777
Q19.6925
median21.915
Q331.8
95-th percentile42.296
Maximum55.9
Range52.78
Interquartile range (IQR)22.1075

Descriptive statistics

Standard deviation12.85767
Coefficient of variation (CV)0.5902942
Kurtosis-0.65251178
Mean21.7818
Median Absolute Deviation (MAD)11.29
Skewness0.32274759
Sum2178.18
Variance165.31969
MonotonicityNot monotonic
2023-12-10T22:11:11.437154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
17.19 2
 
2.0%
4.79 1
 
1.0%
33.89 1
 
1.0%
31.6 1
 
1.0%
31.66 1
 
1.0%
22.76 1
 
1.0%
21.46 1
 
1.0%
12.65 1
 
1.0%
9.71 1
 
1.0%
14.51 1
 
1.0%
Other values (89) 89
89.0%
ValueCountFrequency (%)
3.12 1
1.0%
3.4 1
1.0%
3.61 1
1.0%
3.75 1
1.0%
4.53 1
1.0%
4.79 1
1.0%
4.81 1
1.0%
4.99 1
1.0%
5.56 1
1.0%
5.59 1
1.0%
ValueCountFrequency (%)
55.9 1
1.0%
52.51 1
1.0%
51.81 1
1.0%
43.95 1
1.0%
42.41 1
1.0%
42.29 1
1.0%
40.53 1
1.0%
39.42 1
1.0%
37.84 1
1.0%
37.81 1
1.0%

초미세먼지
Real number (ℝ)

HIGH CORRELATION 

Distinct99
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.3937
Minimum4.18
Maximum69.91
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:11:11.709332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.18
5-th percentile5.968
Q112.5275
median27.48
Q339.8375
95-th percentile52.9605
Maximum69.91
Range65.73
Interquartile range (IQR)27.31

Descriptive statistics

Standard deviation16.040598
Coefficient of variation (CV)0.58555793
Kurtosis-0.64363182
Mean27.3937
Median Absolute Deviation (MAD)14.055
Skewness0.33041823
Sum2739.37
Variance257.30079
MonotonicityNot monotonic
2023-12-10T22:11:11.971611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
27.48 2
 
2.0%
5.97 1
 
1.0%
34.35 1
 
1.0%
39.65 1
 
1.0%
28.67 1
 
1.0%
27.07 1
 
1.0%
16.26 1
 
1.0%
12.61 1
 
1.0%
18.37 1
 
1.0%
31.18 1
 
1.0%
Other values (89) 89
89.0%
ValueCountFrequency (%)
4.18 1
1.0%
4.53 1
1.0%
4.76 1
1.0%
5.02 1
1.0%
5.93 1
1.0%
5.97 1
1.0%
6.06 1
1.0%
6.57 1
1.0%
7.27 1
1.0%
7.3 1
1.0%
ValueCountFrequency (%)
69.91 1
1.0%
66.02 1
1.0%
64.94 1
1.0%
55.14 1
1.0%
52.97 1
1.0%
52.96 1
1.0%
50.91 1
1.0%
49.43 1
1.0%
47.79 1
1.0%
47.16 1
1.0%

Interactions

2023-12-10T22:11:08.881418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:11:08.618980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:11:09.009756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:11:08.741788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:11:12.160141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
데이터 측정 일시미세먼지초미세먼지
데이터 측정 일시1.0001.0001.000
미세먼지1.0001.0001.000
초미세먼지1.0001.0001.000
2023-12-10T22:11:12.351281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
미세먼지초미세먼지
미세먼지1.0001.000
초미세먼지1.0001.000

Missing values

2023-12-10T22:11:09.216369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:11:09.464391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기기 ID데이터 측정 일시위도경도미세먼지초미세먼지
01911KT1492020-06-01 01:00:0037.53785126.9868854.795.97
11911KT1492020-06-01 02:00:0037.53785126.98688510.713.28
21911KT1492020-06-01 03:00:0037.53785126.9868856.137.62
31911KT1492020-06-01 04:00:0037.53785126.9868857.789.67
41911KT1492020-06-01 05:00:0037.53785126.9868854.816.06
51911KT1492020-06-01 06:00:0037.53785126.9868859.5412.09
61911KT1492020-06-01 07:00:0037.53785126.98688510.5513.48
71911KT1492020-06-01 08:00:0037.53785126.9868859.0211.69
81911KT1492020-06-01 09:00:0037.53785126.9868856.558.65
91911KT1492020-06-01 10:00:0037.53785126.9868856.788.94
기기 ID데이터 측정 일시위도경도미세먼지초미세먼지
901911KT1492020-06-04 19:00:0037.53785126.98688530.437.94
911911KT1492020-06-04 20:00:0037.53785126.98688528.8136.24
921911KT1492020-06-04 21:00:0037.53785126.98688530.0438.01
931911KT1492020-06-04 22:00:0037.53785126.98688523.5729.86
941911KT1492020-06-04 23:00:0037.53785126.98688529.7437.65
951911KT1492020-06-05 00:00:0037.53785126.98688537.8147.79
961911KT1492020-06-05 01:00:0037.53785126.98688534.0242.82
971911KT1492020-06-05 02:00:0037.53785126.98688532.7441.11
981911KT1492020-06-05 03:00:0037.53785126.98688552.5166.02
991911KT1492020-06-01 00:00:0037.53785126.9868855.887.31