Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory693.4 KiB
Average record size in memory71.0 B

Variable types

Numeric4
Categorical3

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do

Alerts

지자체 기준초과 구분 has constant value ""Constant
측정항목 is highly overall correlated with 평균값High correlation
평균값 is highly overall correlated with 측정항목High correlation
측정기 상태 is highly imbalanced (95.0%)Imbalance
국가 기준초과 구분 is highly imbalanced (99.7%)Imbalance
평균값 is highly skewed (γ1 = -26.36700805)Skewed

Reproduction

Analysis started2024-04-27 12:03:58.692912
Analysis finished2024-04-27 12:04:04.664500
Duration5.97 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

측정일시
Real number (ℝ)

Distinct435
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.014011 × 109
Minimum2.0140101 × 109
Maximum2.0140119 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-27T12:04:04.868373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0140101 × 109
5-th percentile2.0140101 × 109
Q12.0140105 × 109
median2.0140109 × 109
Q32.0140114 × 109
95-th percentile2.0140118 × 109
Maximum2.0140119 × 109
Range1802
Interquartile range (IQR)902

Descriptive statistics

Standard deviation522.63245
Coefficient of variation (CV)2.5949831 × 10-7
Kurtosis-1.2032852
Mean2.014011 × 109
Median Absolute Deviation (MAD)480
Skewness0.011515467
Sum2.014011 × 1013
Variance273144.68
MonotonicityNot monotonic
2024-04-27T12:04:05.315108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2014011214 38
 
0.4%
2014010213 37
 
0.4%
2014011409 36
 
0.4%
2014011523 34
 
0.3%
2014011200 33
 
0.3%
2014010815 33
 
0.3%
2014010407 33
 
0.3%
2014011704 33
 
0.3%
2014011612 32
 
0.3%
2014010217 32
 
0.3%
Other values (425) 9659
96.6%
ValueCountFrequency (%)
2014010100 21
0.2%
2014010101 25
0.2%
2014010102 26
0.3%
2014010103 32
0.3%
2014010104 25
0.2%
2014010105 23
0.2%
2014010106 23
0.2%
2014010107 21
0.2%
2014010108 22
0.2%
2014010109 29
0.3%
ValueCountFrequency (%)
2014011902 20
0.2%
2014011901 18
0.2%
2014011900 27
0.3%
2014011823 26
0.3%
2014011822 28
0.3%
2014011821 26
0.3%
2014011820 18
0.2%
2014011819 21
0.2%
2014011818 29
0.3%
2014011817 22
0.2%

측정소 코드
Real number (ℝ)

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean112.9675
Minimum101
Maximum125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-27T12:04:05.735803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile102
Q1107
median113
Q3119
95-th percentile124
Maximum125
Range24
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.2033071
Coefficient of variation (CV)0.063764419
Kurtosis-1.1984696
Mean112.9675
Median Absolute Deviation (MAD)6
Skewness-0.013492096
Sum1129675
Variance51.887633
MonotonicityNot monotonic
2024-04-27T12:04:06.299806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
115 456
 
4.6%
118 435
 
4.3%
101 430
 
4.3%
124 424
 
4.2%
109 413
 
4.1%
102 412
 
4.1%
114 409
 
4.1%
119 405
 
4.0%
121 403
 
4.0%
110 399
 
4.0%
Other values (15) 5814
58.1%
ValueCountFrequency (%)
101 430
4.3%
102 412
4.1%
103 397
4.0%
104 384
3.8%
105 397
4.0%
106 394
3.9%
107 389
3.9%
108 384
3.8%
109 413
4.1%
110 399
4.0%
ValueCountFrequency (%)
125 357
3.6%
124 424
4.2%
123 392
3.9%
122 387
3.9%
121 403
4.0%
120 395
4.0%
119 405
4.0%
118 435
4.3%
117 390
3.9%
116 386
3.9%

측정항목
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.3793
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-27T12:04:06.638783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q38
95-th percentile9
Maximum9
Range8
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.7326504
Coefficient of variation (CV)0.50799368
Kurtosis-1.1807181
Mean5.3793
Median Absolute Deviation (MAD)2
Skewness-0.22757046
Sum53793
Variance7.4673782
MonotonicityNot monotonic
2024-04-27T12:04:07.001508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
5 1719
17.2%
9 1687
16.9%
8 1686
16.9%
6 1675
16.8%
3 1622
16.2%
1 1611
16.1%
ValueCountFrequency (%)
1 1611
16.1%
3 1622
16.2%
5 1719
17.2%
6 1675
16.8%
8 1686
16.9%
9 1687
16.9%
ValueCountFrequency (%)
9 1687
16.9%
8 1686
16.9%
6 1675
16.8%
5 1719
17.2%
3 1622
16.2%
1 1611
16.1%

평균값
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct302
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.8998312
Minimum-9999
Maximum195
Zeros19
Zeros (%)0.2%
Negative27
Negative (%)0.3%
Memory size166.0 KiB
2024-04-27T12:04:07.458316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-9999
5-th percentile0.003
Q10.011
median0.2
Q322
95-th percentile73.05
Maximum195
Range10194
Interquartile range (IQR)21.989

Descriptive statistics

Standard deviation375.90935
Coefficient of variation (CV)417.7554
Kurtosis698.60273
Mean0.8998312
Median Absolute Deviation (MAD)0.198
Skewness-26.367008
Sum8998.312
Variance141307.84
MonotonicityNot monotonic
2024-04-27T12:04:07.791647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.005 311
 
3.1%
0.003 309
 
3.1%
0.006 307
 
3.1%
0.007 298
 
3.0%
0.004 258
 
2.6%
0.008 254
 
2.5%
0.002 244
 
2.4%
0.6 219
 
2.2%
0.5 215
 
2.1%
0.7 197
 
2.0%
Other values (292) 7388
73.9%
ValueCountFrequency (%)
-9999.0 14
 
0.1%
-999.9 3
 
< 0.1%
-9.999 9
 
0.1%
-0.021 1
 
< 0.1%
0.0 19
 
0.2%
0.001 75
 
0.8%
0.002 244
2.4%
0.003 309
3.1%
0.004 258
2.6%
0.005 311
3.1%
ValueCountFrequency (%)
195.0 2
< 0.1%
193.0 1
< 0.1%
191.0 1
< 0.1%
188.0 1
< 0.1%
185.0 1
< 0.1%
182.0 2
< 0.1%
180.0 1
< 0.1%
179.0 1
< 0.1%
178.0 2
< 0.1%
176.0 2
< 0.1%

측정기 상태
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9872 
1
 
78
2
 
32
9
 
16
8
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9872
98.7%
1 78
 
0.8%
2 32
 
0.3%
9 16
 
0.2%
8 2
 
< 0.1%

Length

2024-04-27T12:04:08.059752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-27T12:04:08.333376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9872
98.7%
1 78
 
0.8%
2 32
 
0.3%
9 16
 
0.2%
8 2
 
< 0.1%

국가 기준초과 구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9998 
1
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9998
> 99.9%
1 2
 
< 0.1%

Length

2024-04-27T12:04:08.614634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-27T12:04:08.868134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9998
> 99.9%
1 2
 
< 0.1%

지자체 기준초과 구분
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 10000
100.0%

Length

2024-04-27T12:04:09.156529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-27T12:04:09.422377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 10000
100.0%

Interactions

2024-04-27T12:04:03.026850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-27T12:03:59.807027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-27T12:04:00.965086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-27T12:04:02.034798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-27T12:04:03.307557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-27T12:04:00.138386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-27T12:04:01.195999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-27T12:04:02.295918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-27T12:04:03.533962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-27T12:04:00.410322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-27T12:04:01.440605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-27T12:04:02.505363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-27T12:04:03.801733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-27T12:04:00.693087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-27T12:04:01.763068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-27T12:04:02.764027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-27T12:04:09.598986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정일시측정소 코드측정항목평균값측정기 상태국가 기준초과 구분
측정일시1.0000.0000.0000.0000.1090.000
측정소 코드0.0001.0000.0000.0000.1230.050
측정항목0.0000.0001.0000.0430.0450.032
평균값0.0000.0000.0431.0000.3060.000
측정기 상태0.1090.1230.0450.3061.0000.000
국가 기준초과 구분0.0000.0500.0320.0000.0001.000
2024-04-27T12:04:09.875741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정기 상태국가 기준초과 구분
측정기 상태1.0000.000
국가 기준초과 구분0.0001.000
2024-04-27T12:04:10.132481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정일시측정소 코드측정항목평균값측정기 상태국가 기준초과 구분
측정일시1.0000.007-0.0040.0100.0460.000
측정소 코드0.0071.0000.0050.0070.0510.038
측정항목-0.0040.0051.0000.6770.0300.023
평균값0.0100.0070.6771.0000.3740.000
측정기 상태0.0460.0510.0300.3741.0000.000
국가 기준초과 구분0.0000.0380.0230.0000.0001.000

Missing values

2024-04-27T12:04:04.123753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-27T12:04:04.500640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

측정일시측정소 코드측정항목평균값측정기 상태국가 기준초과 구분지자체 기준초과 구분
53316201401151911210.009000
47862014010207123861.0000
41406201401121210210.014000
33313201401100610330.042000
48823201401141311330.027000
226962014010707108877.0000
49472201401141712150.5000
195772014010610113939.0000
159472014010510108939.0000
500442014011421116866.0000
측정일시측정소 코드측정항목평균값측정기 상태국가 기준초과 구분지자체 기준초과 구분
1404201401010911010.008000
7034201401022212350.8000
19471201401060912130.045000
18573201401060312160.003000
5361201401021111960.013000
44036201401130511550.4000
56725201401161810530.066000
491212014011415112922.0000
57126201401162012210.009000
1303201401010811830.026000