Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory693.4 KiB
Average record size in memory71.0 B

Variable types

Numeric5
Categorical2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do

Alerts

국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분High correlation
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분High correlation
국가 기준초과 구분 is highly imbalanced (96.1%)Imbalance
지자체 기준초과 구분 is highly imbalanced (91.1%)Imbalance
평균값 has 216 (2.2%) zerosZeros
측정기 상태 has 9112 (91.1%) zerosZeros

Reproduction

Analysis started2024-05-04 03:59:48.316233
Analysis finished2024-05-04 03:59:57.833802
Duration9.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

측정일시
Real number (ℝ)

Distinct584
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0060113 × 109
Minimum2.0060101 × 109
Maximum2.0060125 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T03:59:58.058202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0060101 × 109
5-th percentile2.0060102 × 109
Q12.0060106 × 109
median2.0060113 × 109
Q32.0060119 × 109
95-th percentile2.0060124 × 109
Maximum2.0060125 × 109
Range2407
Interquartile range (IQR)1285

Descriptive statistics

Standard deviation704.20647
Coefficient of variation (CV)3.5104811 × 10-7
Kurtosis-1.2110538
Mean2.0060113 × 109
Median Absolute Deviation (MAD)604
Skewness-0.00038946948
Sum2.0060113 × 1013
Variance495906.75
MonotonicityNot monotonic
2024-05-04T03:59:58.467934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2006011919 31
 
0.3%
2006012222 29
 
0.3%
2006010609 29
 
0.3%
2006012106 29
 
0.3%
2006011909 28
 
0.3%
2006010605 27
 
0.3%
2006012009 27
 
0.3%
2006010120 27
 
0.3%
2006010804 27
 
0.3%
2006010314 27
 
0.3%
Other values (574) 9719
97.2%
ValueCountFrequency (%)
2006010100 24
0.2%
2006010101 11
0.1%
2006010102 15
0.1%
2006010103 12
0.1%
2006010104 13
0.1%
2006010105 17
0.2%
2006010106 20
0.2%
2006010107 15
0.1%
2006010108 24
0.2%
2006010109 19
0.2%
ValueCountFrequency (%)
2006012507 14
0.1%
2006012506 16
0.2%
2006012505 14
0.1%
2006012504 19
0.2%
2006012503 24
0.2%
2006012502 13
0.1%
2006012501 20
0.2%
2006012500 12
0.1%
2006012423 17
0.2%
2006012422 16
0.2%

측정소 코드
Real number (ℝ)

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean112.8313
Minimum101
Maximum125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T03:59:58.850211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile102
Q1107
median113
Q3119
95-th percentile124
Maximum125
Range24
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.2200175
Coefficient of variation (CV)0.063989492
Kurtosis-1.1965161
Mean112.8313
Median Absolute Deviation (MAD)6
Skewness0.033736457
Sum1128313
Variance52.128653
MonotonicityNot monotonic
2024-05-04T03:59:59.167764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
109 448
 
4.5%
101 435
 
4.3%
107 426
 
4.3%
104 421
 
4.2%
112 418
 
4.2%
102 413
 
4.1%
111 407
 
4.1%
117 407
 
4.1%
114 401
 
4.0%
122 401
 
4.0%
Other values (15) 5823
58.2%
ValueCountFrequency (%)
101 435
4.3%
102 413
4.1%
103 393
3.9%
104 421
4.2%
105 377
3.8%
106 399
4.0%
107 426
4.3%
108 401
4.0%
109 448
4.5%
110 399
4.0%
ValueCountFrequency (%)
125 389
3.9%
124 399
4.0%
123 389
3.9%
122 401
4.0%
121 399
4.0%
120 357
3.6%
119 373
3.7%
118 378
3.8%
117 407
4.1%
116 400
4.0%

측정항목
Real number (ℝ)

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.3376
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T03:59:59.601219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q38
95-th percentile9
Maximum9
Range8
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.7583305
Coefficient of variation (CV)0.51677355
Kurtosis-1.2254251
Mean5.3376
Median Absolute Deviation (MAD)3
Skewness-0.19910636
Sum53376
Variance7.6083871
MonotonicityNot monotonic
2024-05-04T03:59:59.800513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
3 1725
17.2%
9 1698
17.0%
8 1669
16.7%
1 1655
16.6%
6 1647
16.5%
5 1606
16.1%
ValueCountFrequency (%)
1 1655
16.6%
3 1725
17.2%
5 1606
16.1%
6 1647
16.5%
8 1669
16.7%
9 1698
17.0%
ValueCountFrequency (%)
9 1698
17.0%
8 1669
16.7%
6 1647
16.5%
5 1606
16.1%
3 1725
17.2%
1 1655
16.6%

평균값
Real number (ℝ)

ZEROS 

Distinct314
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-356.7894
Minimum-9999
Maximum2240
Zeros216
Zeros (%)2.2%
Negative461
Negative (%)4.6%
Memory size166.0 KiB
2024-05-04T04:00:00.104859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-9999
5-th percentile0
Q10.008
median0.05
Q318
95-th percentile68
Maximum2240
Range12239
Interquartile range (IQR)17.992

Descriptive statistics

Standard deviation1890.9833
Coefficient of variation (CV)-5.2999985
Kurtosis22.031257
Mean-356.7894
Median Absolute Deviation (MAD)0.05
Skewness-4.8988821
Sum-3567894
Variance3575817.8
MonotonicityNot monotonic
2024-05-04T04:00:00.457917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-9999.0 370
 
3.7%
0.001 269
 
2.7%
0.006 268
 
2.7%
0.004 261
 
2.6%
0.002 259
 
2.6%
0.005 253
 
2.5%
0.003 244
 
2.4%
0.007 235
 
2.4%
0.008 232
 
2.3%
0.0 216
 
2.2%
Other values (304) 7393
73.9%
ValueCountFrequency (%)
-9999.0 370
3.7%
-999.9 12
 
0.1%
-9.999 79
 
0.8%
0.0 216
2.2%
0.001 269
2.7%
0.002 259
2.6%
0.003 244
2.4%
0.004 261
2.6%
0.005 253
2.5%
0.006 268
2.7%
ValueCountFrequency (%)
2240.0 1
< 0.1%
1415.0 1
< 0.1%
1298.0 1
< 0.1%
1183.0 1
< 0.1%
1061.0 1
< 0.1%
1031.0 1
< 0.1%
910.0 1
< 0.1%
728.0 1
< 0.1%
547.0 1
< 0.1%
511.0 1
< 0.1%

측정기 상태
Real number (ℝ)

ZEROS 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.3489
Minimum0
Maximum9
Zeros9112
Zeros (%)91.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T04:00:00.814152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile4
Maximum9
Range9
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.2997453
Coefficient of variation (CV)3.7252658
Kurtosis22.042973
Mean0.3489
Median Absolute Deviation (MAD)0
Skewness4.4822555
Sum3489
Variance1.6893377
MonotonicityNot monotonic
2024-05-04T04:00:01.223289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 9112
91.1%
4 445
 
4.5%
2 248
 
2.5%
9 80
 
0.8%
1 61
 
0.6%
8 54
 
0.5%
ValueCountFrequency (%)
0 9112
91.1%
1 61
 
0.6%
2 248
 
2.5%
4 445
 
4.5%
8 54
 
0.5%
9 80
 
0.8%
ValueCountFrequency (%)
9 80
 
0.8%
8 54
 
0.5%
4 445
 
4.5%
2 248
 
2.5%
1 61
 
0.6%
0 9112
91.1%

국가 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9958 
1
 
42

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9958
99.6%
1 42
 
0.4%

Length

2024-05-04T04:00:01.508296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T04:00:01.792588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9958
99.6%
1 42
 
0.4%

지자체 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9887 
1
 
113

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9887
98.9%
1 113
 
1.1%

Length

2024-05-04T04:00:02.097636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T04:00:02.395978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9887
98.9%
1 113
 
1.1%

Interactions

2024-05-04T03:59:56.006970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:50.210366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:51.639927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:53.015833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:54.526299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:56.213521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:50.473835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:51.901329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:53.364294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:54.799004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:56.420105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:50.738752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:52.185089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:53.639222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:55.072116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:56.709272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:51.113621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:52.466641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:53.962454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:55.431204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:56.962541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:51.388904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:52.730876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:54.260742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:59:55.748872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-04T04:00:02.587056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정일시측정소 코드측정항목평균값측정기 상태국가 기준초과 구분지자체 기준초과 구분
측정일시1.0000.0000.0150.0850.1510.1750.254
측정소 코드0.0001.0000.0000.1290.3460.0450.056
측정항목0.0150.0001.0000.1380.5100.1880.320
평균값0.0850.1290.1381.0000.5970.1130.068
측정기 상태0.1510.3460.5100.5971.0000.0770.057
국가 기준초과 구분0.1750.0450.1880.1130.0771.0000.809
지자체 기준초과 구분0.2540.0560.3200.0680.0570.8091.000
2024-05-04T04:00:02.862154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지자체 기준초과 구분국가 기준초과 구분
지자체 기준초과 구분1.0000.600
국가 기준초과 구분0.6001.000
2024-05-04T04:00:03.108864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
측정일시측정소 코드측정항목평균값측정기 상태국가 기준초과 구분지자체 기준초과 구분
측정일시1.000-0.0250.010-0.0110.0590.1340.195
측정소 코드-0.0251.0000.0080.104-0.1330.0350.043
측정항목0.0100.0081.0000.4600.2290.1350.230
평균값-0.0110.1040.4601.000-0.3320.1870.114
측정기 상태0.059-0.1330.229-0.3321.0000.0550.041
국가 기준초과 구분0.1340.0350.1350.1870.0551.0000.600
지자체 기준초과 구분0.1950.0430.2300.1140.0410.6001.000

Missing values

2024-05-04T03:59:57.272207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-04T03:59:57.662030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

측정일시측정소 코드측정항목평균값측정기 상태국가 기준초과 구분지자체 기준초과 구분
5366200601021112050.8000
7219200601030010430.023000
69889200601200912430.047000
21319200601062210430.023000
8617200601030911230.024000
23995200601071512530.018000
53041200601151711630.063000
9393200601031411660.023000
400602006011203102878.0000
22849200601070810930.036000
측정일시측정소 코드측정항목평균값측정기 상태국가 기준초과 구분지자체 기준초과 구분
27165200601081310360.002000
4026200601020212210.007000
73484200601210912350.6000
788202006012221112828.0000
32043200601092111660.0000
5757200601021411060.014000
69930200601201010610.007000
3842200601020111651.1000
61393200601180110830.043000
7698200601030310910.004000