Overview

Dataset statistics

Number of variables6
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.2 KiB
Average record size in memory53.3 B

Variable types

Categorical3
DateTime1
Numeric2

Dataset

Description샘플 데이터
Author순천향대학교 산학협력단
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=a7b2c560-49a3-11eb-8ff3-e7c20661cf87

Alerts

기기 ID has constant value ""Constant
위도 has constant value ""Constant
경도 has constant value ""Constant
미세먼지 is highly overall correlated with 초미세먼지High correlation
초미세먼지 is highly overall correlated with 미세먼지High correlation
데이터 측정 일시 has unique valuesUnique

Reproduction

Analysis started2023-12-10 13:11:32.556586
Analysis finished2023-12-10 13:11:33.725518
Duration1.17 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기기 ID
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1911KT149
100 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1911KT149
2nd row1911KT149
3rd row1911KT149
4th row1911KT149
5th row1911KT149

Common Values

ValueCountFrequency (%)
1911KT149 100
100.0%

Length

2023-12-10T22:11:33.844032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:11:34.070150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1911kt149 100
100.0%
Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2020-01-01 00:00:00
Maximum2020-01-05 10:00:00
2023-12-10T22:11:34.681833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:11:35.068475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

위도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
37.53785
100 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row37.53785
2nd row37.53785
3rd row37.53785
4th row37.53785
5th row37.53785

Common Values

ValueCountFrequency (%)
37.53785 100
100.0%

Length

2023-12-10T22:11:35.340959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:11:35.552150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
37.53785 100
100.0%

경도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
126.986885
100 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row126.986885
2nd row126.986885
3rd row126.986885
4th row126.986885
5th row126.986885

Common Values

ValueCountFrequency (%)
126.986885 100
100.0%

Length

2023-12-10T22:11:35.702521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:11:35.845046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
126.986885 100
100.0%

미세먼지
Real number (ℝ)

HIGH CORRELATION 

Distinct99
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean48.4947
Minimum17.9
Maximum84.79
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:11:36.009519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum17.9
5-th percentile20.9765
Q132.8175
median52.34
Q361.3725
95-th percentile69.1935
Maximum84.79
Range66.89
Interquartile range (IQR)28.555

Descriptive statistics

Standard deviation16.635911
Coefficient of variation (CV)0.34304597
Kurtosis-1.0499725
Mean48.4947
Median Absolute Deviation (MAD)11.42
Skewness-0.26918588
Sum4849.47
Variance276.75355
MonotonicityNot monotonic
2023-12-10T22:11:36.223113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
61.73 2
 
2.0%
17.9 1
 
1.0%
53.03 1
 
1.0%
77.74 1
 
1.0%
62.48 1
 
1.0%
57.05 1
 
1.0%
63.53 1
 
1.0%
61.07 1
 
1.0%
61.56 1
 
1.0%
60.39 1
 
1.0%
Other values (89) 89
89.0%
ValueCountFrequency (%)
17.9 1
1.0%
18.89 1
1.0%
18.98 1
1.0%
19.76 1
1.0%
20.53 1
1.0%
21.0 1
1.0%
22.27 1
1.0%
22.53 1
1.0%
22.95 1
1.0%
23.06 1
1.0%
ValueCountFrequency (%)
84.79 1
1.0%
77.74 1
1.0%
75.83 1
1.0%
74.4 1
1.0%
70.78 1
1.0%
69.11 1
1.0%
69.05 1
1.0%
68.45 1
1.0%
68.37 1
1.0%
68.2 1
1.0%

초미세먼지
Real number (ℝ)

HIGH CORRELATION 

Distinct99
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean61.984
Minimum22.88
Maximum109.72
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:11:36.447731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum22.88
5-th percentile26.724
Q141.64
median66.66
Q378.605
95-th percentile88.7185
Maximum109.72
Range86.84
Interquartile range (IQR)36.965

Descriptive statistics

Standard deviation21.390711
Coefficient of variation (CV)0.34510053
Kurtosis-1.0435717
Mean61.984
Median Absolute Deviation (MAD)14.63
Skewness-0.25675025
Sum6198.4
Variance457.56252
MonotonicityNot monotonic
2023-12-10T22:11:36.720455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
88.06 2
 
2.0%
22.88 1
 
1.0%
79.46 1
 
1.0%
72.91 1
 
1.0%
81.38 1
 
1.0%
78.1 1
 
1.0%
78.65 1
 
1.0%
77.43 1
 
1.0%
75.41 1
 
1.0%
75.58 1
 
1.0%
Other values (89) 89
89.0%
ValueCountFrequency (%)
22.88 1
1.0%
24.13 1
1.0%
24.29 1
1.0%
25.22 1
1.0%
26.23 1
1.0%
26.75 1
1.0%
28.44 1
1.0%
28.71 1
1.0%
29.28 1
1.0%
29.4 1
1.0%
ValueCountFrequency (%)
109.72 1
1.0%
99.46 1
1.0%
97.45 1
1.0%
95.15 1
1.0%
90.4 1
1.0%
88.63 1
1.0%
88.11 1
1.0%
88.06 2
2.0%
87.37 1
1.0%
83.53 1
1.0%

Interactions

2023-12-10T22:11:33.109770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:11:32.786840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:11:33.243401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:11:32.974955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:11:36.909160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
데이터 측정 일시미세먼지초미세먼지
데이터 측정 일시1.0001.0001.000
미세먼지1.0001.0000.998
초미세먼지1.0000.9981.000
2023-12-10T22:11:37.073844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
미세먼지초미세먼지
미세먼지1.0000.999
초미세먼지0.9991.000

Missing values

2023-12-10T22:11:33.444410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:11:33.644732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기기 ID데이터 측정 일시위도경도미세먼지초미세먼지
01911KT1492020-01-01 01:00:0037.53785126.98688517.922.88
11911KT1492020-01-01 02:00:0037.53785126.98688518.8924.13
21911KT1492020-01-01 03:00:0037.53785126.98688520.5326.23
31911KT1492020-01-01 04:00:0037.53785126.98688522.2728.44
41911KT1492020-01-01 05:00:0037.53785126.98688523.0629.4
51911KT1492020-01-01 06:00:0037.53785126.98688523.5730.05
61911KT1492020-01-01 07:00:0037.53785126.98688524.7531.51
71911KT1492020-01-01 08:00:0037.53785126.98688522.9529.28
81911KT1492020-01-01 09:00:0037.53785126.98688519.7625.22
91911KT1492020-01-01 10:00:0037.53785126.98688521.026.75
기기 ID데이터 측정 일시위도경도미세먼지초미세먼지
901911KT1492020-01-05 02:00:0037.53785126.98688559.6976.66
911911KT1492020-01-05 03:00:0037.53785126.98688564.282.79
921911KT1492020-01-05 04:00:0037.53785126.98688584.79109.72
931911KT1492020-01-05 05:00:0037.53785126.98688562.3780.12
941911KT1492020-01-05 06:00:0037.53785126.98688561.5779.11
951911KT1492020-01-05 07:00:0037.53785126.98688561.2478.68
961911KT1492020-01-05 08:00:0037.53785126.98688557.6573.98
971911KT1492020-01-05 09:00:0037.53785126.98688568.4588.06
981911KT1492020-01-05 10:00:0037.53785126.98688575.8397.45
991911KT1492020-01-01 00:00:0037.53785126.98688518.9824.29