Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory693.4 KiB
Average record size in memory71.0 B

Variable types

Numeric5
Categorical2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do

Alerts

국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분High correlation
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분High correlation
측정항목 is highly overall correlated with 평균값High correlation
평균값 is highly overall correlated with 측정항목High correlation
국가 기준초과 구분 is highly imbalanced (69.8%)Imbalance
지자체 기준초과 구분 is highly imbalanced (69.8%)Imbalance
평균값 has 228 (2.3%) zerosZeros
측정기 상태 has 9635 (96.4%) zerosZeros

Reproduction

Analysis started2024-04-13 05:59:38.380042
Analysis finished2024-04-13 05:59:48.791267
Duration10.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

측정일시
Real number (ℝ)

Distinct470
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.018011 × 109
Minimum2.0180101 × 109
Maximum2.018012 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-13T05:59:48.942643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0180101 × 109
5-th percentile2.0180101 × 109
Q12.0180105 × 109
median2.018011 × 109
Q32.0180115 × 109
95-th percentile2.0180119 × 109
Maximum2.018012 × 109
Range1913
Interquartile range (IQR)992

Descriptive statistics

Standard deviation563.67488
Coefficient of variation (CV)2.79322 × 10-7
Kurtosis-1.1892393
Mean2.018011 × 109
Median Absolute Deviation (MAD)496
Skewness0.0062115151
Sum2.018011 × 1013
Variance317729.36
MonotonicityNot monotonic
2024-04-13T05:59:49.367907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2018011620 36
 
0.4%
2018010710 33
 
0.3%
2018010918 31
 
0.3%
2018011212 31
 
0.3%
2018010517 31
 
0.3%
2018011922 31
 
0.3%
2018011203 30
 
0.3%
2018010705 30
 
0.3%
2018011023 30
 
0.3%
2018010905 30
 
0.3%
Other values (460) 9687
96.9%
ValueCountFrequency (%)
2018010100 22
0.2%
2018010101 14
0.1%
2018010102 17
0.2%
2018010103 21
0.2%
2018010104 21
0.2%
2018010105 19
0.2%
2018010106 18
0.2%
2018010107 18
0.2%
2018010108 20
0.2%
2018010109 24
0.2%
ValueCountFrequency (%)
2018012013 11
 
0.1%
2018012012 13
0.1%
2018012011 28
0.3%
2018012010 20
0.2%
2018012009 24
0.2%
2018012008 19
0.2%
2018012007 23
0.2%
2018012006 16
0.2%
2018012005 26
0.3%
2018012004 19
0.2%

측정소 코드
Real number (ℝ)

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean112.956
Minimum101
Maximum125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-13T05:59:49.761713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile102
Q1107
median113
Q3119
95-th percentile124
Maximum125
Range24
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.1815751
Coefficient of variation (CV)0.063578519
Kurtosis-1.2018567
Mean112.956
Median Absolute Deviation (MAD)6
Skewness0.020404375
Sum1129560
Variance51.575022
MonotonicityNot monotonic
2024-04-13T05:59:50.215609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
118 447
 
4.5%
106 434
 
4.3%
109 429
 
4.3%
107 420
 
4.2%
103 417
 
4.2%
111 416
 
4.2%
110 411
 
4.1%
113 411
 
4.1%
119 408
 
4.1%
122 408
 
4.1%
Other values (15) 5799
58.0%
ValueCountFrequency (%)
101 364
3.6%
102 403
4.0%
103 417
4.2%
104 402
4.0%
105 379
3.8%
106 434
4.3%
107 420
4.2%
108 390
3.9%
109 429
4.3%
110 411
4.1%
ValueCountFrequency (%)
125 394
3.9%
124 397
4.0%
123 401
4.0%
122 408
4.1%
121 367
3.7%
120 379
3.8%
119 408
4.1%
118 447
4.5%
117 378
3.8%
116 383
3.8%

측정항목
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.3151
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-13T05:59:50.547302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median5
Q38
95-th percentile9
Maximum9
Range8
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.7703753
Coefficient of variation (CV)0.52122732
Kurtosis-1.2257992
Mean5.3151
Median Absolute Deviation (MAD)3
Skewness-0.19760913
Sum53151
Variance7.6749795
MonotonicityNot monotonic
2024-04-13T05:59:50.898279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 1730
17.3%
5 1709
17.1%
9 1693
16.9%
8 1653
16.5%
3 1625
16.2%
6 1590
15.9%
ValueCountFrequency (%)
1 1730
17.3%
3 1625
16.2%
5 1709
17.1%
6 1590
15.9%
8 1653
16.5%
9 1693
16.9%
ValueCountFrequency (%)
9 1693
16.9%
8 1653
16.5%
6 1590
15.9%
5 1709
17.1%
3 1625
16.2%
1 1730
17.3%

평균값
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct264
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-8.23688
Minimum-9999
Maximum3577
Zeros228
Zeros (%)2.3%
Negative26
Negative (%)0.3%
Memory size166.0 KiB
2024-04-13T05:59:51.303962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-9999
5-th percentile0.002
Q10.008
median0.07
Q325
95-th percentile76
Maximum3577
Range13576
Interquartile range (IQR)24.992

Descriptive statistics

Standard deviation505.85834
Coefficient of variation (CV)-61.413829
Kurtosis378.61729
Mean-8.23688
Median Absolute Deviation (MAD)0.07
Skewness-19.126439
Sum-82368.8
Variance255892.66
MonotonicityNot monotonic
2024-04-13T05:59:51.675509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.005 486
 
4.9%
0.006 471
 
4.7%
0.004 433
 
4.3%
0.007 288
 
2.9%
0.002 287
 
2.9%
0.6 280
 
2.8%
0.5 243
 
2.4%
0.003 238
 
2.4%
0.0 228
 
2.3%
0.4 226
 
2.3%
Other values (254) 6820
68.2%
ValueCountFrequency (%)
-9999.0 25
 
0.2%
-209.0 1
 
< 0.1%
0.0 228
2.3%
0.001 42
 
0.4%
0.002 287
2.9%
0.003 238
2.4%
0.004 433
4.3%
0.005 486
4.9%
0.006 471
4.7%
0.007 288
2.9%
ValueCountFrequency (%)
3577.0 1
< 0.1%
3561.0 1
< 0.1%
3533.0 1
< 0.1%
3493.0 1
< 0.1%
157.0 2
< 0.1%
154.0 1
< 0.1%
151.0 1
< 0.1%
150.0 1
< 0.1%
149.0 1
< 0.1%
147.0 2
< 0.1%

측정기 상태
Real number (ℝ)

ZEROS 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2411
Minimum0
Maximum9
Zeros9635
Zeros (%)96.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-13T05:59:51.892260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum9
Range9
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.3497974
Coefficient of variation (CV)5.598496
Kurtosis30.437417
Mean0.2411
Median Absolute Deviation (MAD)0
Skewness5.6522394
Sum2411
Variance1.821953
MonotonicityNot monotonic
2024-04-13T05:59:52.226090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 9635
96.4%
8 215
 
2.1%
1 56
 
0.6%
9 55
 
0.5%
4 31
 
0.3%
2 8
 
0.1%
ValueCountFrequency (%)
0 9635
96.4%
1 56
 
0.6%
2 8
 
0.1%
4 31
 
0.3%
8 215
 
2.1%
9 55
 
0.5%
ValueCountFrequency (%)
9 55
 
0.5%
8 215
 
2.1%
4 31
 
0.3%
2 8
 
0.1%
1 56
 
0.6%
0 9635
96.4%

국가 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9463 
1
 
537

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9463
94.6%
1 537
 
5.4%

Length

2024-04-13T05:59:52.510922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-13T05:59:52.805407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9463
94.6%
1 537
 
5.4%

지자체 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9463 
1
 
537

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9463
94.6%
1 537
 
5.4%

Length

2024-04-13T05:59:53.112708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-13T05:59:53.376520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9463
94.6%
1 537
 
5.4%

Interactions

2024-04-13T05:59:46.836015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T05:59:40.053035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T05:59:41.663234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T05:59:43.631719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T05:59:45.329510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T05:59:47.103565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T05:59:40.419477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T05:59:42.158727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T05:59:44.058596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T05:59:45.630669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T05:59:47.360443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T05:59:40.723302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T05:59:42.495660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/