Overview

Dataset statistics

Number of variables6
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.2 KiB
Average record size in memory53.3 B

Variable types

Categorical3
DateTime1
Numeric2

Dataset

Description샘플 데이터
Author순천향대학교 산학협력단
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=a7b2c560-49a3-11eb-8ff3-e7c20661cf87

Alerts

기기 ID has constant value ""Constant
위도 has constant value ""Constant
경도 has constant value ""Constant
미세먼지 is highly overall correlated with 초미세먼지High correlation
초미세먼지 is highly overall correlated with 미세먼지High correlation
데이터 측정 일시 has unique valuesUnique

Reproduction

Analysis started2023-12-10 13:11:18.413809
Analysis finished2023-12-10 13:11:19.524587
Duration1.11 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기기 ID
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1911KT149
100 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1911KT149
2nd row1911KT149
3rd row1911KT149
4th row1911KT149
5th row1911KT149

Common Values

ValueCountFrequency (%)
1911KT149 100
100.0%

Length

2023-12-10T22:11:19.624609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:11:19.772951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1911kt149 100
100.0%
Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2020-04-01 00:00:00
Maximum2020-04-05 03:00:00
2023-12-10T22:11:19.993707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:11:20.271457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

위도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
37.53785
100 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row37.53785
2nd row37.53785
3rd row37.53785
4th row37.53785
5th row37.53785

Common Values

ValueCountFrequency (%)
37.53785 100
100.0%

Length

2023-12-10T22:11:20.578190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:11:20.735574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
37.53785 100
100.0%

경도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
126.986885
100 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row126.986885
2nd row126.986885
3rd row126.986885
4th row126.986885
5th row126.986885

Common Values

ValueCountFrequency (%)
126.986885 100
100.0%

Length

2023-12-10T22:11:20.881887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:11:21.085795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
126.986885 100
100.0%

미세먼지
Real number (ℝ)

HIGH CORRELATION 

Distinct98
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23.9544
Minimum3.34
Maximum76.84
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:11:21.274928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.34
5-th percentile4.524
Q110.135
median17.865
Q334.945
95-th percentile62.414
Maximum76.84
Range73.5
Interquartile range (IQR)24.81

Descriptive statistics

Standard deviation18.9837
Coefficient of variation (CV)0.79249325
Kurtosis0.51765423
Mean23.9544
Median Absolute Deviation (MAD)10.83
Skewness1.1392652
Sum2395.44
Variance360.38088
MonotonicityNot monotonic
2023-12-10T22:11:21.503520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5.56 2
 
2.0%
8.15 2
 
2.0%
22.14 1
 
1.0%
70.44 1
 
1.0%
57.13 1
 
1.0%
50.58 1
 
1.0%
47.8 1
 
1.0%
44.22 1
 
1.0%
37.8 1
 
1.0%
26.8 1
 
1.0%
Other values (88) 88
88.0%
ValueCountFrequency (%)
3.34 1
1.0%
3.78 1
1.0%
4.04 1
1.0%
4.38 1
1.0%
4.41 1
1.0%
4.53 1
1.0%
4.76 1
1.0%
4.91 1
1.0%
4.97 1
1.0%
5.16 1
1.0%
ValueCountFrequency (%)
76.84 1
1.0%
75.85 1
1.0%
74.36 1
1.0%
70.44 1
1.0%
66.29 1
1.0%
62.21 1
1.0%
60.36 1
1.0%
59.26 1
1.0%
57.13 1
1.0%
56.95 1
1.0%

초미세먼지
Real number (ℝ)

HIGH CORRELATION 

Distinct99
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean30.2868
Minimum4.6
Maximum97.14
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:11:21.725725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.6
5-th percentile6.0035
Q112.78
median22.9
Q343.7525
95-th percentile78.698
Maximum97.14
Range92.54
Interquartile range (IQR)30.9725

Descriptive statistics

Standard deviation23.832234
Coefficient of variation (CV)0.78688517
Kurtosis0.56491263
Mean30.2868
Median Absolute Deviation (MAD)13.86
Skewness1.1560089
Sum3028.68
Variance567.97536
MonotonicityNot monotonic
2023-12-10T22:11:21.957026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
31.65 2
 
2.0%
28.17 1
 
1.0%
32.95 1
 
1.0%
72.06 1
 
1.0%
63.68 1
 
1.0%
60.04 1
 
1.0%
55.46 1
 
1.0%
47.4 1
 
1.0%
33.66 1
 
1.0%
38.37 1
 
1.0%
Other values (89) 89
89.0%
ValueCountFrequency (%)
4.6 1
1.0%
5.1 1
1.0%
5.41 1
1.0%
5.87 1
1.0%
5.88 1
1.0%
6.01 1
1.0%
6.35 1
1.0%
6.49 1
1.0%
6.51 1
1.0%
6.72 1
1.0%
ValueCountFrequency (%)
97.14 1
1.0%
95.49 1
1.0%
94.08 1
1.0%
88.96 1
1.0%
83.22 1
1.0%
78.46 1
1.0%
76.15 1
1.0%
74.64 1
1.0%
72.36 1
1.0%
72.06 1
1.0%

Interactions

2023-12-10T22:11:18.965856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:11:18.683559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:11:19.119685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:11:18.832129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:11:22.092458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
데이터 측정 일시미세먼지초미세먼지
데이터 측정 일시1.0001.0001.000
미세먼지1.0001.0000.998
초미세먼지1.0000.9981.000
2023-12-10T22:11:22.224956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
미세먼지초미세먼지
미세먼지1.0001.000
초미세먼지1.0001.000

Missing values

2023-12-10T22:11:19.289004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:11:19.456423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기기 ID데이터 측정 일시위도경도미세먼지초미세먼지
01911KT1492020-04-01 01:00:0037.53785126.98688522.1428.17
11911KT1492020-04-01 02:00:0037.53785126.98688524.9231.65
21911KT1492020-04-01 03:00:0037.53785126.98688527.3634.53
31911KT1492020-04-01 04:00:0037.53785126.98688525.9632.69
41911KT1492020-04-01 05:00:0037.53785126.98688521.6427.29
51911KT1492020-04-01 06:00:0037.53785126.98688521.2626.82
61911KT1492020-04-01 07:00:0037.53785126.98688521.8627.54
71911KT1492020-04-01 08:00:0037.53785126.98688523.7629.96
81911KT1492020-04-01 09:00:0037.53785126.98688527.4334.58
91911KT1492020-04-01 10:00:0037.53785126.98688530.6438.56
기기 ID데이터 측정 일시위도경도미세먼지초미세먼지
901911KT1492020-04-04 19:00:0037.53785126.9868854.536.01
911911KT1492020-04-04 20:00:0037.53785126.9868855.346.98
921911KT1492020-04-04 21:00:0037.53785126.9868855.597.26
931911KT1492020-04-04 22:00:0037.53785126.9868854.976.49
941911KT1492020-04-04 23:00:0037.53785126.9868855.166.72
951911KT1492020-04-05 00:00:0037.53785126.9868855.567.22
961911KT1492020-04-05 01:00:0037.53785126.9868855.667.36
971911KT1492020-04-05 02:00:0037.53785126.9868856.298.19
981911KT1492020-04-05 03:00:0037.53785126.9868859.612.42
991911KT1492020-04-01 00:00:0037.53785126.98688516.8621.58