Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells8
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory693.4 KiB
Average record size in memory71.0 B

Variable types

Numeric5
Categorical2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do

Alerts

국가 기준초과 구분 is highly overall correlated with 평균값 and 1 other fieldsHigh correlation
지자체 기준초과 구분 is highly overall correlated with 평균값 and 1 other fieldsHigh correlation
측정항목 is highly overall correlated with 평균값High correlation
평균값 is highly overall correlated with 측정항목 and 2 other fieldsHigh correlation
국가 기준초과 구분 is highly imbalanced (82.5%)Imbalance
지자체 기준초과 구분 is highly imbalanced (82.5%)Imbalance
측정기 상태 has 9838 (98.4%) zerosZeros

Reproduction

Analysis started2024-04-13 06:07:42.782866
Analysis finished2024-04-13 06:07:51.680120
Duration8.9 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

측정일시
Real number (ℝ)

Distinct468
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.016011 × 109
Minimum2.0160101 × 109
Maximum2.016012 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-13T06:07:51.882589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0160101 × 109
5-th percentile2.0160101 × 109
Q12.0160105 × 109
median2.016011 × 109
Q32.0160115 × 109
95-th percentile2.0160119 × 109
Maximum2.016012 × 109
Range1911
Interquartile range (IQR)995

Descriptive statistics

Standard deviation562.43958
Coefficient of variation (CV)2.7898636 × 10-7
Kurtosis-1.1996464
Mean2.016011 × 109
Median Absolute Deviation (MAD)498
Skewness-0.0053504085
Sum2.016011 × 1013
Variance316338.28
MonotonicityNot monotonic
2024-04-13T06:07:52.350059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2016010904 34
 
0.3%
2016011612 34
 
0.3%
2016011817 32
 
0.3%
2016011420 32
 
0.3%
2016011714 32
 
0.3%
2016011416 31
 
0.3%
2016011203 31
 
0.3%
2016011704 31
 
0.3%
2016010712 30
 
0.3%
2016010101 30
 
0.3%
Other values (458) 9683
96.8%
ValueCountFrequency (%)
2016010100 17
0.2%
2016010101 30
0.3%
2016010102 20
0.2%
2016010103 18
0.2%
2016010104 26
0.3%
2016010105 22
0.2%
2016010106 23
0.2%
2016010107 22
0.2%
2016010108 23
0.2%
2016010109 18
0.2%
ValueCountFrequency (%)
2016012011 17
0.2%
2016012010 20
0.2%
2016012009 21
0.2%
2016012008 28
0.3%
2016012007 20
0.2%
2016012006 22
0.2%
2016012005 19
0.2%
2016012004 20
0.2%
2016012003 21
0.2%
2016012002 20
0.2%

측정소 코드
Real number (ℝ)

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean112.9959
Minimum101
Maximum125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-13T06:07:52.777378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile102
Q1107
median113
Q3119
95-th percentile124
Maximum125
Range24
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.2501683
Coefficient of variation (CV)0.064163109
Kurtosis-1.2204895
Mean112.9959
Median Absolute Deviation (MAD)6
Skewness0.0070515809
Sum1129959
Variance52.56494
MonotonicityNot monotonic
2024-04-13T06:07:53.192749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
122 429
 
4.3%
114 427
 
4.3%
121 424
 
4.2%
104 423
 
4.2%
105 422
 
4.2%
124 417
 
4.2%
106 415
 
4.2%
102 409
 
4.1%
111 406
 
4.1%
107 403
 
4.0%
Other values (15) 5825
58.2%
ValueCountFrequency (%)
101 395
4.0%
102 409
4.1%
103 385
3.9%
104 423
4.2%
105 422
4.2%
106 415
4.2%
107 403
4.0%
108 394
3.9%
109 375
3.8%
110 390
3.9%
ValueCountFrequency (%)
125 400
4.0%
124 417
4.2%
123 402
4.0%
122 429
4.3%
121 424
4.2%
120 370
3.7%
119 383
3.8%
118 369
3.7%
117 397
4.0%
116 398
4.0%

측정항목
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.3283
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-13T06:07:53.513130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q38
95-th percentile9
Maximum9
Range8
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.742056
Coefficient of variation (CV)0.51462117
Kurtosis-1.1989732
Mean5.3283
Median Absolute Deviation (MAD)2
Skewness-0.20930009
Sum53283
Variance7.518871
MonotonicityNot monotonic
2024-04-13T06:07:54.031559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
6 1696
17.0%
8 1679
16.8%
5 1677
16.8%
1 1673
16.7%
3 1643
16.4%
9 1632
16.3%
ValueCountFrequency (%)
1 1673
16.7%
3 1643
16.4%
5 1677
16.8%
6 1696
17.0%
8 1679
16.8%
9 1632
16.3%
ValueCountFrequency (%)
9 1632
16.3%
8 1679
16.8%
6 1696
17.0%
5 1677
16.8%
3 1643
16.4%
1 1673
16.7%

평균값
Real number (ℝ)

HIGH CORRELATION 

Distinct432
Distinct (%)4.3%
Missing8
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean13.145517
Minimum-34
Maximum296
Zeros39
Zeros (%)0.4%
Negative5
Negative (%)< 0.1%
Memory size166.0 KiB
2024-04-13T06:07:54.450863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-34
5-th percentile0.004
Q10.011
median0.065
Q321
95-th percentile63
Maximum296
Range330
Interquartile range (IQR)20.989

Descriptive statistics

Standard deviation23.473167
Coefficient of variation (CV)1.7856404
Kurtosis7.7707341
Mean13.145517
Median Absolute Deviation (MAD)0.064
Skewness2.3399506
Sum131350.01
Variance550.98958
MonotonicityNot monotonic
2024-04-13T06:07:54.897732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.005 529
 
5.3%
0.006 506
 
5.1%
0.007 395
 
4.0%
0.004 235
 
2.4%
0.008 212
 
2.1%
0.002 171
 
1.7%
0.003 140
 
1.4%
0.016 99
 
1.0%
0.009 97
 
1.0%
0.02 93
 
0.9%
Other values (422) 7515
75.1%
ValueCountFrequency (%)
-34.0 1
 
< 0.1%
-14.0 1
 
< 0.1%
-12.0 1
 
< 0.1%
-2.0 1
 
< 0.1%
-1.0 1
 
< 0.1%
0.0 39
 
0.4%
0.001 53
 
0.5%
0.002 171
1.7%
0.003 140
1.4%
0.004 235
2.4%
ValueCountFrequency (%)
296.0 1
< 0.1%
188.0 1
< 0.1%
177.0 1
< 0.1%
175.0 1
< 0.1%
174.0 1
< 0.1%
172.0 1
< 0.1%
171.0 1
< 0.1%
167.0 2
< 0.1%
166.0 2
< 0.1%
162.0 1
< 0.1%

측정기 상태
Real number (ℝ)

ZEROS 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.0576
Minimum0
Maximum9
Zeros9838
Zeros (%)98.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-13T06:07:55.344766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum9
Range9
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.61491467
Coefficient of variation (CV)10.675602
Kurtosis181.71998
Mean0.0576
Median Absolute Deviation (MAD)0
Skewness13.15889
Sum576
Variance0.37812005
MonotonicityNot monotonic
2024-04-13T06:07:55.675915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 9838
98.4%
1 76
 
0.8%
9 38
 
0.4%
2 25
 
0.2%
4 19
 
0.2%
8 4
 
< 0.1%
ValueCountFrequency (%)
0 9838
98.4%
1 76
 
0.8%
2 25
 
0.2%
4 19
 
0.2%
8 4
 
< 0.1%
9 38
 
0.4%
ValueCountFrequency (%)
9 38
 
0.4%
8 4
 
< 0.1%
4 19
 
0.2%
2 25
 
0.2%
1 76
 
0.8%
0 9838
98.4%

국가 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9737 
1
 
263

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9737
97.4%
1 263
 
2.6%

Length

2024-04-13T06:07:56.096903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-13T06:07:56.442758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9737
97.4%
1 263
 
2.6%

지자체 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9737 
1
 
263

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9737
97.4%
1 263
 
2.6%

Length

2024-04-13T06:07:56.783376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-13T06:07:57.087205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9737
97.4%
1 263
 
2.6%

Interactions

2024-04-13T06:07:49.758882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T06:07:44.247221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T06:07:45.666763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T06:07:47.077971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T06:07:48.403315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T06:07:49.987654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T06:07:44.533136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T06:07:45.941975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T06:07:47.331760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T06:07:48.746325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T06:07:50.201794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T06:07:44.807217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T06:07:46.256070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/