Overview

Dataset statistics

Number of variables6
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.2 KiB
Average record size in memory53.3 B

Variable types

Categorical3
DateTime1
Numeric2

Dataset

Description샘플 데이터
Author순천향대학교 산학협력단
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=a7b2c560-49a3-11eb-8ff3-e7c20661cf87

Alerts

기기 ID has constant value ""Constant
위도 has constant value ""Constant
경도 has constant value ""Constant
미세먼지 is highly overall correlated with 초미세먼지High correlation
초미세먼지 is highly overall correlated with 미세먼지High correlation
데이터 측정 일시 has unique valuesUnique
초미세먼지 has unique valuesUnique

Reproduction

Analysis started2023-12-10 13:11:28.005874
Analysis finished2023-12-10 13:11:28.990714
Duration0.98 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기기 ID
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1911KT149
100 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1911KT149
2nd row1911KT149
3rd row1911KT149
4th row1911KT149
5th row1911KT149

Common Values

ValueCountFrequency (%)
1911KT149 100
100.0%

Length

2023-12-10T22:11:29.105199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:11:29.291833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1911kt149 100
100.0%
Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2020-02-01 00:00:00
Maximum2020-02-05 03:00:00
2023-12-10T22:11:29.449589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:11:29.740139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

위도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
37.53785
100 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row37.53785
2nd row37.53785
3rd row37.53785
4th row37.53785
5th row37.53785

Common Values

ValueCountFrequency (%)
37.53785 100
100.0%

Length

2023-12-10T22:11:29.946196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:11:30.076235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
37.53785 100
100.0%

경도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
126.986885
100 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row126.986885
2nd row126.986885
3rd row126.986885
4th row126.986885
5th row126.986885

Common Values

ValueCountFrequency (%)
126.986885 100
100.0%

Length

2023-12-10T22:11:30.220951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:11:30.352959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
126.986885 100
100.0%

미세먼지
Real number (ℝ)

HIGH CORRELATION 

Distinct98
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean40.9588
Minimum6.36
Maximum112.79
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:11:30.510901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6.36
5-th percentile10.141
Q115.2525
median23.28
Q366.805
95-th percentile97.726
Maximum112.79
Range106.43
Interquartile range (IQR)51.5525

Descriptive statistics

Standard deviation31.904543
Coefficient of variation (CV)0.77894233
Kurtosis-0.76789645
Mean40.9588
Median Absolute Deviation (MAD)13.09
Skewness0.81718813
Sum4095.88
Variance1017.8999
MonotonicityNot monotonic
2023-12-10T22:11:30.682735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
17.09 2
 
2.0%
10.93 2
 
2.0%
13.24 1
 
1.0%
16.05 1
 
1.0%
15.0 1
 
1.0%
17.21 1
 
1.0%
15.46 1
 
1.0%
12.46 1
 
1.0%
10.34 1
 
1.0%
10.15 1
 
1.0%
Other values (88) 88
88.0%
ValueCountFrequency (%)
6.36 1
1.0%
8.07 1
1.0%
8.09 1
1.0%
8.34 1
1.0%
9.97 1
1.0%
10.15 1
1.0%
10.34 1
1.0%
10.93 2
2.0%
11.11 1
1.0%
11.21 1
1.0%
ValueCountFrequency (%)
112.79 1
1.0%
110.31 1
1.0%
109.55 1
1.0%
107.5 1
1.0%
97.84 1
1.0%
97.72 1
1.0%
97.62 1
1.0%
95.67 1
1.0%
95.54 1
1.0%
94.54 1
1.0%

초미세먼지
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean52.1031
Minimum8.3
Maximum144.93
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:11:30.856939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8.3
5-th percentile13.043
Q119.3125
median29.405
Q385.0675
95-th percentile124.814
Maximum144.93
Range136.63
Interquartile range (IQR)65.755

Descriptive statistics

Standard deviation40.71521
Coefficient of variation (CV)0.78143547
Kurtosis-0.73198577
Mean52.1031
Median Absolute Deviation (MAD)16.42
Skewness0.83262362
Sum5210.31
Variance1657.7283
MonotonicityNot monotonic
2023-12-10T22:11:31.201779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
57.53 1
 
1.0%
16.63 1
 
1.0%
18.57 1
 
1.0%
20.49 1
 
1.0%
19.12 1
 
1.0%
21.95 1
 
1.0%
19.77 1
 
1.0%
15.95 1
 
1.0%
14.05 1
 
1.0%
13.33 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
8.3 1
1.0%
10.48 1
1.0%
10.51 1
1.0%
10.82 1
1.0%
12.72 1
1.0%
13.06 1
1.0%
13.33 1
1.0%
14.05 1
1.0%
14.06 1
1.0%
14.08 1
1.0%
ValueCountFrequency (%)
144.93 1
1.0%
141.48 1
1.0%
140.67 1
1.0%
137.98 1
1.0%
124.89 1
1.0%
124.81 1
1.0%
124.71 1
1.0%
122.42 1
1.0%
121.87 1
1.0%
120.03 1
1.0%

Interactions

2023-12-10T22:11:28.438248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:11:28.200748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:11:28.576437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:11:28.311109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:11:31.349610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
데이터 측정 일시미세먼지초미세먼지
데이터 측정 일시1.0001.0001.000
미세먼지1.0001.0000.999
초미세먼지1.0000.9991.000
2023-12-10T22:11:31.483822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
미세먼지초미세먼지
미세먼지1.0001.000
초미세먼지1.0001.000

Missing values

2023-12-10T22:11:28.754568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:11:28.921136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기기 ID데이터 측정 일시위도경도미세먼지초미세먼지
01911KT1492020-02-01 01:00:0037.53785126.98688545.5257.53
11911KT1492020-02-01 02:00:0037.53785126.98688546.7659.13
21911KT1492020-02-01 03:00:0037.53785126.98688536.3345.9
31911KT1492020-02-01 04:00:0037.53785126.98688529.0836.7
41911KT1492020-02-01 05:00:0037.53785126.98688537.0546.84
51911KT1492020-02-01 06:00:0037.53785126.98688543.2754.73
61911KT1492020-02-01 07:00:0037.53785126.98688545.9158.08
71911KT1492020-02-01 08:00:0037.53785126.98688550.7764.22
81911KT1492020-02-01 09:00:0037.53785126.98688556.5371.62
91911KT1492020-02-01 10:00:0037.53785126.98688564.1881.28
기기 ID데이터 측정 일시위도경도미세먼지초미세먼지
901911KT1492020-02-04 19:00:0037.53785126.98688522.1227.63
911911KT1492020-02-04 20:00:0037.53785126.98688523.6129.68
921911KT1492020-02-04 21:00:0037.53785126.98688518.9323.79
931911KT1492020-02-04 22:00:0037.53785126.98688518.0622.71
941911KT1492020-02-04 23:00:0037.53785126.9868859.9712.72
951911KT1492020-02-05 00:00:0037.53785126.9868856.368.3
961911KT1492020-02-05 01:00:0037.53785126.9868858.0710.51
971911KT1492020-02-05 02:00:0037.53785126.9868858.3410.82
981911KT1492020-02-05 03:00:0037.53785126.9868858.0910.48
991911KT1492020-02-01 00:00:0037.53785126.98688543.8355.32