Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 1223 |
Missing cells | 16 |
Missing cells (%) | 0.2% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 83.7 KiB |
Average record size in memory | 70.1 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
DateTime | 1 |
Dataset
Description | 대장관리번호,자치구명,자치구 코드,집수면적,처리용량,시설용량,이용량,설치일자 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15646/S/1/datasetView.do |
대장관리번호 is highly overall correlated with 집수면적 | High correlation |
자치구 코드 is highly overall correlated with 자치구명 | High correlation |
집수면적 is highly overall correlated with 대장관리번호 | High correlation |
처리용량 is highly overall correlated with 시설용량 | High correlation |
시설용량 is highly overall correlated with 처리용량 | High correlation |
자치구명 is highly overall correlated with 자치구 코드 | High correlation |
이용량 is highly imbalanced (98.6%) | Imbalance |
설치일자 has 16 (1.3%) missing values | Missing |
대장관리번호 has unique values | Unique |
집수면적 has 976 (79.8%) zeros | Zeros |
처리용량 has 302 (24.7%) zeros | Zeros |
시설용량 has 42 (3.4%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-10 22:15:43.144203 |
---|---|
Analysis finished | 2024-05-10 22:15:54.583262 |
Duration | 11.44 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
대장관리번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 1223 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3161.1594 |
Minimum | 14 |
---|---|
Maximum | 4162 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 10.9 KiB |
Quantile statistics
Minimum | 14 |
---|---|
5-th percentile | 430.1 |
Q1 | 3139.5 |
median | 3517 |
Q3 | 3850.5 |
95-th percentile | 4097.9 |
Maximum | 4162 |
Range | 4148 |
Interquartile range (IQR) | 711 |
Descriptive statistics
Standard deviation | 1101.3145 |
---|---|
Coefficient of variation (CV) | 0.34838941 |
Kurtosis | 1.5353706 |
Mean | 3161.1594 |
Median Absolute Deviation (MAD) | 354 |
Skewness | -1.6827366 |
Sum | 3866098 |
Variance | 1212893.6 |
Monotonicity | Strictly decreasing |
Value | Count | Frequency (%) |
4162 | 1 | 0.1% |
3276 | 1 | 0.1% |
3269 | 1 | 0.1% |
3270 | 1 | 0.1% |
3271 | 1 | 0.1% |
3272 | 1 | 0.1% |
3273 | 1 | 0.1% |
3274 | 1 | 0.1% |
3275 | 1 | 0.1% |
3277 | 1 | 0.1% |
Other values (1213) | 1213 |
Value | Count | Frequency (%) |
14 | 1 | |
67 | 1 | |
68 | 1 | |
71 | 1 | |
84 | 1 | |
100 | 1 | |
101 | 1 | |
105 | 1 | |
106 | 1 | |
115 | 1 |
Value | Count | Frequency (%) |
4162 | 1 | |
4161 | 1 | |
4160 | 1 | |
4159 | 1 | |
4158 | 1 | |
4157 | 1 | |
4156 | 1 | |
4155 | 1 | |
4154 | 1 | |
4153 | 1 |
자치구명
Categorical
HIGH CORRELATION
 
Distinct | 25 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 9.7 KiB |
서초구 | |
---|---|
은평구 | |
송파구 | |
도봉구 | 76 |
강서구 | 76 |
Other values (20) |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.0801308 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 강남구 |
---|---|
2nd row | 영등포구 |
3rd row | 영등포구 |
4th row | 도봉구 |
5th row | 관악구 |
Common Values
Value | Count | Frequency (%) |
서초구 | 119 | 9.7% |
은평구 | 81 | 6.6% |
송파구 | 80 | 6.5% |
도봉구 | 76 | 6.2% |
강서구 | 76 | 6.2% |
성북구 | 73 | 6.0% |
광진구 | 64 | 5.2% |
노원구 | 58 | 4.7% |
강동구 | 54 | 4.4% |
동대문구 | 53 | 4.3% |
Other values (15) | 489 |
Length
Value | Count | Frequency (%) |
서초구 | 119 | 9.7% |
은평구 | 81 | 6.6% |
송파구 | 80 | 6.5% |
도봉구 | 76 | 6.2% |
강서구 | 76 | 6.2% |
성북구 | 73 | 6.0% |
광진구 | 64 | 5.2% |
노원구 | 58 | 4.7% |
강동구 | 54 | 4.4% |
동대문구 | 53 | 4.3% |
Other values (15) | 489 |
자치구 코드
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 25 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11446.672 |
Minimum | 11110 |
---|---|
Maximum | 11740 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 10.9 KiB |
Quantile statistics
Minimum | 11110 |
---|---|
5-th percentile | 11170 |
Q1 | 11290 |
median | 11440 |
Q3 | 11620 |
95-th percentile | 11710 |
Maximum | 11740 |
Range | 630 |
Interquartile range (IQR) | 330 |
Descriptive statistics
Standard deviation | 182.80045 |
---|---|
Coefficient of variation (CV) | 0.015969746 |
Kurtosis | -1.303038 |
Mean | 11446.672 |
Median Absolute Deviation (MAD) | 150 |
Skewness | 0.053969132 |
Sum | 13999280 |
Variance | 33416.003 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11650 | 119 | 9.7% |
11380 | 81 | 6.6% |
11710 | 80 | 6.5% |
11320 | 76 | 6.2% |
11500 | 76 | 6.2% |
11290 | 73 | 6.0% |
11215 | 64 | 5.2% |
11350 | 58 | 4.7% |
11740 | 54 | 4.4% |
11230 | 53 | 4.3% |
Other values (15) | 489 |
Value | Count | Frequency (%) |
11110 | 17 | 1.4% |
11140 | 20 | 1.6% |
11170 | 30 | 2.5% |
11200 | 29 | 2.4% |
11215 | 64 | |
11230 | 53 | |
11260 | 40 | |
11290 | 73 | |
11305 | 33 | |
11320 | 76 |
Value | Count | Frequency (%) |
11740 | 54 | |
11710 | 80 | |
11680 | 50 | |
11650 | 119 | |
11620 | 53 | |
11590 | 19 | 1.6% |
11560 | 31 | 2.5% |
11545 | 31 | 2.5% |
11530 | 40 | 3.3% |
11500 | 76 |
집수면적
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 112 |
---|---|
Distinct (%) | 9.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 545.62796 |
Minimum | 0 |
---|---|
Maximum | 32140 |
Zeros | 976 |
Zeros (%) | 79.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 10.9 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 2980 |
Maximum | 32140 |
Range | 32140 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 2751.5602 |
---|---|
Coefficient of variation (CV) | 5.0429237 |
Kurtosis | 56.973135 |
Mean | 545.62796 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 7.1361345 |
Sum | 667303 |
Variance | 7571083.4 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 976 | |
20 | 31 | 2.5% |
40 | 25 | 2.0% |
12 | 10 | 0.8% |
200 | 8 | 0.7% |
1000 | 5 | 0.4% |
7 | 5 | 0.4% |
600 | 5 | 0.4% |
6 | 4 | 0.3% |
60 | 4 | 0.3% |
Other values (102) | 150 | 12.3% |
Value | Count | Frequency (%) |
0 | 976 | |
4 | 2 | 0.2% |
6 | 4 | 0.3% |
7 | 5 | 0.4% |
8 | 3 | 0.2% |
10 | 2 | 0.2% |
12 | 10 | 0.8% |
13 | 3 | 0.2% |
19 | 1 | 0.1% |
20 | 31 | 2.5% |
Value | Count | Frequency (%) |
32140 | 1 | |
27140 | 1 | |
26060 | 1 | |
24000 | 1 | |
23964 | 1 | |
23600 | 1 | |
23278 | 2 | |
21040 | 1 | |
19000 | 2 | |
18208 | 2 |
처리용량
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 123 |
---|---|
Distinct (%) | 10.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 28.009812 |
Minimum | 0 |
---|---|
Maximum | 1520 |
Zeros | 302 |
Zeros (%) | 24.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 10.9 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 2 |
Q3 | 10 |
95-th percentile | 101.8 |
Maximum | 1520 |
Range | 1520 |
Interquartile range (IQR) | 9 |
Descriptive statistics
Standard deviation | 113.29591 |
---|---|
Coefficient of variation (CV) | 4.0448652 |
Kurtosis | 69.66391 |
Mean | 28.009812 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 7.5657625 |
Sum | 34256 |
Variance | 12835.964 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 302 | |
1 | 293 | |
2 | 129 | |
3 | 39 | 3.2% |
9 | 31 | 2.5% |
5 | 31 | 2.5% |
6 | 28 | 2.3% |
7 | 24 | 2.0% |
8 | 18 | 1.5% |
4 | 17 | 1.4% |
Other values (113) | 311 |
Value | Count | Frequency (%) |
0 | 302 | |
1 | 293 | |
2 | 129 | |
3 | 39 | 3.2% |
4 | 17 | 1.4% |
5 | 31 | 2.5% |
6 | 28 | 2.3% |
7 | 24 | 2.0% |
8 | 18 | 1.5% |
9 | 31 | 2.5% |
Value | Count | Frequency (%) |
1520 | 1 | |
1350 | 1 | |
1260 | 1 | |
1004 | 1 | |
915 | 1 | |
908 | 1 | |
830 | 1 | |
800 | 1 | |
720 | 1 | |
642 | 1 |
시설용량
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 260 |
---|---|
Distinct (%) | 21.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 135.8937 |
Minimum | 0 |
---|---|
Maximum | 3840 |
Zeros | 42 |
Zeros (%) | 3.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 10.9 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 13 |
Q3 | 148 |
95-th percentile | 643.8 |
Maximum | 3840 |
Range | 3840 |
Interquartile range (IQR) | 147 |
Descriptive statistics
Standard deviation | 285.8763 |
---|---|
Coefficient of variation (CV) | 2.1036758 |
Kurtosis | 37.252051 |
Mean | 135.8937 |
Median Absolute Deviation (MAD) | 12 |
Skewness | 4.7199771 |
Sum | 166198 |
Variance | 81725.257 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 340 | |
2 | 156 | 12.8% |
0 | 42 | 3.4% |
10 | 21 | 1.7% |
40 | 20 | 1.6% |
50 | 20 | 1.6% |
30 | 20 | 1.6% |
150 | 17 | 1.4% |
100 | 17 | 1.4% |
5 | 17 | 1.4% |
Other values (250) | 553 |
Value | Count | Frequency (%) |
0 | 42 | 3.4% |
1 | 340 | |
2 | 156 | |
3 | 8 | 0.7% |
4 | 9 | 0.7% |
5 | 17 | 1.4% |
6 | 5 | 0.4% |
7 | 1 | 0.1% |
8 | 8 | 0.7% |
9 | 1 | 0.1% |
Value | Count | Frequency (%) |
3840 | 1 | 0.1% |
3000 | 1 | 0.1% |
2000 | 1 | 0.1% |
1800 | 1 | 0.1% |
1607 | 1 | 0.1% |
1520 | 1 | 0.1% |
1400 | 4 | |
1357 | 1 | 0.1% |
1350 | 1 | 0.1% |
1316 | 1 | 0.1% |
이용량
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 9.7 KiB |
0 | |
---|---|
175 | 1 |
340 | 1 |
1300 | 1 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.0057236 |
Min length | 1 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 1220 | |
175 | 1 | 0.1% |
340 | 1 | 0.1% |
1300 | 1 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 1220 | |
175 | 1 | 0.1% |
340 | 1 | 0.1% |
1300 | 1 | 0.1% |
설치일자
Date
MISSING
 
Distinct | 226 |
---|---|
Distinct (%) | 18.7% |
Missing | 16 |
Missing (%) | 1.3% |
Memory size | 9.7 KiB |
Minimum | 2003-01-01 00:00:00 |
---|---|
Maximum | 2024-01-04 00:00:00 |
대장관리번호 | 자치구명 | 자치구 코드 | 집수면적 | 처리용량 | 시설용량 | 이용량 | |
---|---|---|---|---|---|---|---|
대장관리번호 | 1.000 | 0.623 | 0.408 | 0.428 | 0.132 | 0.060 | 0.179 |
자치구명 | 0.623 | 1.000 | 1.000 | 0.347 | 0.209 | 0.255 | 0.290 |
자치구 코드 | 0.408 | 1.000 | 1.000 | 0.263 | 0.131 | 0.183 | 0.132 |
집수면적 | 0.428 | 0.347 | 0.263 | 1.000 | 0.000 | 0.675 | 0.480 |
처리용량 | 0.132 | 0.209 | 0.131 | 0.000 | 1.000 | 0.535 | 0.000 |
시설용량 | 0.060 | 0.255 | 0.183 | 0.675 | 0.535 | 1.000 | 0.237 |
이용량 | 0.179 | 0.290 | 0.132 | 0.480 | 0.000 | 0.237 | 1.000 |
자치구명 | 이용량 | |
---|---|---|
자치구명 | 1.000 | 0.156 |
이용량 | 0.156 | 1.000 |
대장관리번호 | 자치구 코드 | 집수면적 | 처리용량 | 시설용량 | 자치구명 | 이용량 | |
---|---|---|---|---|---|---|---|
대장관리번호 | 1.000 | -0.167 | -0.523 | 0.040 | -0.204 | 0.320 | 0.124 |
자치구 코드 | -0.167 | 1.000 | 0.062 | -0.201 | -0.035 | 0.994 | 0.069 |
집수면적 | -0.523 | 0.062 | 1.000 | -0.051 | 0.026 | 0.128 | 0.306 |
처리용량 | 0.040 | -0.201 | -0.051 | 1.000 | 0.737 | 0.081 | 0.000 |
시설용량 | -0.204 | -0.035 | 0.026 | 0.737 | 1.000 | 0.103 | 0.108 |
자치구명 | 0.320 | 0.994 | 0.128 | 0.081 | 0.103 | 1.000 | 0.156 |
이용량 | 0.124 | 0.069 | 0.306 | 0.000 | 0.108 | 0.156 | 1.000 |
대장관리번호 | 자치구명 | 자치구 코드 | 집수면적 | 처리용량 | 시설용량 | 이용량 | 설치일자 | |
---|---|---|---|---|---|---|---|---|
0 | 4162 | 강남구 | 11680 | 0 | 0 | 0 | 0 | 2024-01-04 00:00:00.0 |
1 | 4161 | 영등포구 | 11560 | 0 | 1 | 1 | 0 | 2023-05-24 00:00:00.0 |
2 | 4160 | 영등포구 | 11560 | 0 | 1 | 1 | 0 | 2023-05-24 00:00:00.0 |
3 | 4159 | 도봉구 | 11320 | 0 | 1 | 1 | 0 | 2023-05-02 00:00:00.0 |
4 | 4158 | 관악구 | 11620 | 0 | 1 | 1 | 0 | 2023-09-05 00:00:00.0 |
5 | 4157 | 관악구 | 11620 | 0 | 1 | 1 | 0 | 2023-09-05 00:00:00.0 |
6 | 4156 | 강서구 | 11500 | 0 | 1 | 1 | 0 | 2023-05-09 00:00:00.0 |
7 | 4155 | 관악구 | 11620 | 0 | 1 | 1 | 0 | 2023-05-17 00:00:00.0 |
8 | 4154 | 강북구 | 11305 | 0 | 1 | 1 | 0 | 2023-10-26 00:00:00.0 |
9 | 4153 | 강북구 | 11305 | 0 | 1 | 1 | 0 | 2023-05-02 00:00:00.0 |
대장관리번호 | 자치구명 | 자치구 코드 | 집수면적 | 처리용량 | 시설용량 | 이용량 | 설치일자 | |
---|---|---|---|---|---|---|---|---|
1213 | 115 | 성동구 | 11200 | 1760 | 5 | 88 | 0 | 2013-08-01 00:00:00.0 |
1214 | 106 | 도봉구 | 11320 | 1100 | 3 | 55 | 0 | 2013-05-01 00:00:00.0 |
1215 | 105 | 노원구 | 11350 | 9260 | 26 | 463 | 0 | 2013-08-01 00:00:00.0 |
1216 | 101 | 중구 | 11140 | 6400 | 18 | 320 | 0 | 2013-07-01 00:00:00.0 |
1217 | 100 | 종로구 | 11110 | 40 | 0 | 2 | 0 | 2013-06-01 00:00:00.0 |
1218 | 84 | 마포구 | 11440 | 600 | 2 | 30 | 0 | 2013-01-01 00:00:00.0 |
1219 | 71 | 서대문구 | 11410 | 80 | 0 | 4 | 0 | 2013-01-01 00:00:00.0 |
1220 | 68 | 서대문구 | 11410 | 40 | 0 | 2 | 0 | 2013-04-01 00:00:00.0 |
1221 | 67 | 서대문구 | 11410 | 20 | 0 | 1 | 0 | 2013-01-01 00:00:00.0 |
1222 | 14 | 중구 | 11140 | 24000 | 68 | 1200 | 1300 | 2013-03-15 00:00:00.0 |