Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 2622 |
Missing cells | 7 |
Missing cells (%) | < 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 156.3 KiB |
Average record size in memory | 61.0 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 2 |
Dataset
Description | 부산광역시 상수도사업본부에서 상하수도 요금 계산 및 징수를 위해 운영하는 수용가정보시스템에 사용되는 민원 신청 정보(급수공사_신청승낙) 자료입니다. |
---|---|
Author | 부산광역시 상수도사업본부 |
URL | https://www.data.go.kr/data/15083686/fileData.do |
사업소코드 is highly overall correlated with 사업소명 | High correlation |
전수 is highly overall correlated with 상수도업종 | High correlation |
사업소명 is highly overall correlated with 사업소코드 | High correlation |
상수도업종 is highly overall correlated with 전수 | High correlation |
상수도업종 is highly imbalanced (68.6%) | Imbalance |
월사용량 is highly skewed (γ1 = 21.1077881) | Skewed |
전수 is highly skewed (γ1 = 27.07038882) | Skewed |
연번 has unique values | Unique |
월사용량 has 38 (1.4%) zeros | Zeros |
전수 has 27 (1.0%) zeros | Zeros |
Reproduction
Analysis started | 2024-03-14 19:14:23.894131 |
---|---|
Analysis finished | 2024-03-14 19:14:31.567563 |
Duration | 7.67 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
UNIQUE
 
Distinct | 2622 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1311.5 |
Minimum | 1 |
---|---|
Maximum | 2622 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 23.2 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 132.05 |
Q1 | 656.25 |
median | 1311.5 |
Q3 | 1966.75 |
95-th percentile | 2490.95 |
Maximum | 2622 |
Range | 2621 |
Interquartile range (IQR) | 1310.5 |
Descriptive statistics
Standard deviation | 757.05053 |
---|---|
Coefficient of variation (CV) | 0.5772402 |
Kurtosis | -1.2 |
Mean | 1311.5 |
Median Absolute Deviation (MAD) | 655.5 |
Skewness | 0 |
Sum | 3438753 |
Variance | 573125.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | < 0.1% |
1763 | 1 | < 0.1% |
1745 | 1 | < 0.1% |
1746 | 1 | < 0.1% |
1747 | 1 | < 0.1% |
1748 | 1 | < 0.1% |
1749 | 1 | < 0.1% |
1750 | 1 | < 0.1% |
1751 | 1 | < 0.1% |
1752 | 1 | < 0.1% |
Other values (2612) | 2612 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
2622 | 1 | |
2621 | 1 | |
2620 | 1 | |
2619 | 1 | |
2618 | 1 | |
2617 | 1 | |
2616 | 1 | |
2615 | 1 | |
2614 | 1 | |
2613 | 1 |
사업소코드
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 12 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 296.18078 |
Minimum | 201 |
---|---|
Maximum | 312 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 23.2 KiB |
Quantile statistics
Minimum | 201 |
---|---|
5-th percentile | 244 |
Q1 | 303 |
median | 307 |
Q3 | 311 |
95-th percentile | 312 |
Maximum | 312 |
Range | 111 |
Interquartile range (IQR) | 8 |
Descriptive statistics
Standard deviation | 25.699926 |
---|---|
Coefficient of variation (CV) | 0.08677108 |
Kurtosis | 1.4139794 |
Mean | 296.18078 |
Median Absolute Deviation (MAD) | 4 |
Skewness | -1.7273778 |
Sum | 776586 |
Variance | 660.48619 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
244 | 450 | |
311 | 437 | |
312 | 422 | |
306 | 271 | |
307 | 233 | |
304 | 228 | |
308 | 164 | 6.3% |
309 | 150 | 5.7% |
301 | 105 | 4.0% |
303 | 75 | 2.9% |
Other values (2) | 87 | 3.3% |
Value | Count | Frequency (%) |
201 | 20 | 0.8% |
244 | 450 | |
301 | 105 | 4.0% |
302 | 67 | 2.6% |
303 | 75 | 2.9% |
304 | 228 | |
306 | 271 | |
307 | 233 | |
308 | 164 | 6.3% |
309 | 150 | 5.7% |
Value | Count | Frequency (%) |
312 | 422 | |
311 | 437 | |
309 | 150 | 5.7% |
308 | 164 | 6.3% |
307 | 233 | |
306 | 271 | |
304 | 228 | |
303 | 75 | 2.9% |
302 | 67 | 2.6% |
301 | 105 | 4.0% |
사업소명
Categorical
HIGH CORRELATION
 
Distinct | 12 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 20.6 KiB |
동래통합사업소 | |
---|---|
강서사업소 | |
기장사업소 | |
남부사업소 | |
북부사업소 | |
Other values (7) |
Length
Max length | 9 |
---|---|
Median length | 5 |
Mean length | 5.82418 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 동래통합사업소 |
---|---|
2nd row | 남부사업소 |
3rd row | 동래통합사업소 |
4th row | 동래통합사업소 |
5th row | 동래통합사업소 |
Common Values
Value | Count | Frequency (%) |
동래통합사업소 | 450 | |
강서사업소 | 437 | |
기장사업소 | 422 | |
남부사업소 | 271 | |
북부사업소 | 233 | |
부산진 사업소 | 228 | |
해운대사업소 | 164 | 6.3% |
사하사업소 | 150 | 5.7% |
중동부사업소 | 105 | 4.0% |
영도사업소 | 75 | 2.9% |
Other values (2) | 87 | 3.3% |
Length
Value | Count | Frequency (%) |
동래통합사업소 | 450 | |
강서사업소 | 437 | |
기장사업소 | 422 | |
사업소 | 295 | |
남부사업소 | 271 | |
북부사업소 | 233 | |
부산진 | 228 | |
해운대사업소 | 164 | 5.6% |
사하사업소 | 150 | 5.1% |
중동부사업소 | 105 | 3.6% |
Other values (3) | 162 | 5.6% |
상수도업종
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 20.6 KiB |
<NA> | |
---|---|
일반용 | |
가정용 | 125 |
공업용수 | 3 |
업무용 | 2 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.8348589 |
Min length | 3 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 2186 | |
일반용 | 305 | 11.6% |
가정용 | 125 | 4.8% |
공업용수 | 3 | 0.1% |
업무용 | 2 | 0.1% |
욕탕용 | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 2186 | |
일반용 | 305 | 11.6% |
가정용 | 125 | 4.8% |
공업용수 | 3 | 0.1% |
업무용 | 2 | 0.1% |
욕탕용 | 1 | < 0.1% |
월사용량
Real number (ℝ)
SKEWED
  ZEROS
 
Distinct | 66 |
---|---|
Distinct (%) | 2.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 99.827231 |
Minimum | 0 |
---|---|
Maximum | 26160 |
Zeros | 38 |
Zeros (%) | 1.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 23.2 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 10 |
median | 10 |
Q3 | 22.75 |
95-th percentile | 200 |
Maximum | 26160 |
Range | 26160 |
Interquartile range (IQR) | 12.75 |
Descriptive statistics
Standard deviation | 820.49361 |
---|---|
Coefficient of variation (CV) | 8.2191362 |
Kurtosis | 548.28918 |
Mean | 99.827231 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 21.107788 |
Sum | 261747 |
Variance | 673209.76 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10 | 1280 | |
1 | 542 | |
100 | 179 | 6.8% |
50 | 175 | 6.7% |
30 | 82 | 3.1% |
20 | 51 | 1.9% |
200 | 43 | 1.6% |
500 | 41 | 1.6% |
0 | 38 | 1.4% |
15 | 28 | 1.1% |
Other values (56) | 163 | 6.2% |
Value | Count | Frequency (%) |
0 | 38 | 1.4% |
1 | 542 | |
2 | 3 | 0.1% |
3 | 1 | < 0.1% |
4 | 3 | 0.1% |
5 | 13 | 0.5% |
10 | 1280 | |
11 | 5 | 0.2% |
15 | 28 | 1.1% |
20 | 51 | 1.9% |
Value | Count | Frequency (%) |
26160 | 1 | < 0.1% |
19500 | 1 | < 0.1% |
10000 | 4 | |
9780 | 1 | < 0.1% |
5235 | 1 | < 0.1% |
5000 | 3 | |
4500 | 1 | < 0.1% |
4080 | 1 | < 0.1% |
3000 | 2 | |
2430 | 1 | < 0.1% |
구경
Real number (ℝ)
Distinct | 14 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 31.488177 |
Minimum | 13 |
---|---|
Maximum | 400 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 23.2 KiB |
Quantile statistics
Minimum | 13 |
---|---|
5-th percentile | 15 |
Q1 | 15 |
median | 15 |
Q3 | 25 |
95-th percentile | 100 |
Maximum | 400 |
Range | 387 |
Interquartile range (IQR) | 10 |
Descriptive statistics
Standard deviation | 37.117173 |
---|---|
Coefficient of variation (CV) | 1.1787654 |
Kurtosis | 23.781453 |
Mean | 31.488177 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 4.2921058 |
Sum | 82562 |
Variance | 1377.6845 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
15 | 1334 | |
25 | 397 | 15.1% |
20 | 256 | 9.8% |
40 | 151 | 5.8% |
100 | 145 | 5.5% |
50 | 132 | 5.0% |
32 | 94 | 3.6% |
80 | 44 | 1.7% |
150 | 29 | 1.1% |
300 | 15 | 0.6% |
Other values (4) | 25 | 1.0% |
Value | Count | Frequency (%) |
13 | 3 | 0.1% |
15 | 1334 | |
20 | 256 | 9.8% |
25 | 397 | 15.1% |
32 | 94 | 3.6% |
40 | 151 | 5.8% |
50 | 132 | 5.0% |
80 | 44 | 1.7% |
100 | 145 | 5.5% |
150 | 29 | 1.1% |
Value | Count | Frequency (%) |
400 | 1 | < 0.1% |
300 | 15 | 0.6% |
250 | 7 | 0.3% |
200 | 14 | 0.5% |
150 | 29 | 1.1% |
100 | 145 | |
80 | 44 | 1.7% |
50 | 132 | |
40 | 151 | |
32 | 94 |
전수
Real number (ℝ)
HIGH CORRELATION
  SKEWED
  ZEROS
 
Distinct | 36 |
---|---|
Distinct (%) | 1.4% |
Missing | 7 |
Missing (%) | 0.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.4535373 |
Minimum | 0 |
---|---|
Maximum | 1360 |
Zeros | 27 |
Zeros (%) | 1.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 23.2 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 1 |
95-th percentile | 6 |
Maximum | 1360 |
Range | 1360 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 43.16906 |
---|---|
Coefficient of variation (CV) | 12.499955 |
Kurtosis | 765.58876 |
Mean | 3.4535373 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 27.070389 |
Sum | 9031 |
Variance | 1863.5678 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 2350 | |
8 | 43 | 1.6% |
2 | 32 | 1.2% |
4 | 31 | 1.2% |
0 | 27 | 1.0% |
5 | 20 | 0.8% |
3 | 19 | 0.7% |
6 | 12 | 0.5% |
7 | 11 | 0.4% |
12 | 8 | 0.3% |
Other values (26) | 62 | 2.4% |
Value | Count | Frequency (%) |
0 | 27 | 1.0% |
1 | 2350 | |
2 | 32 | 1.2% |
3 | 19 | 0.7% |
4 | 31 | 1.2% |
5 | 20 | 0.8% |
6 | 12 | 0.5% |
7 | 11 | 0.4% |
8 | 43 | 1.6% |
9 | 4 | 0.2% |
Value | Count | Frequency (%) |
1360 | 1 | < 0.1% |
1183 | 1 | < 0.1% |
1124 | 1 | < 0.1% |
481 | 1 | < 0.1% |
322 | 1 | < 0.1% |
130 | 1 | < 0.1% |
43 | 1 | < 0.1% |
42 | 2 | |
40 | 3 | |
39 | 1 | < 0.1% |
연번 | 사업소코드 | 사업소명 | 상수도업종 | 월사용량 | 구경 | 전수 | |
---|---|---|---|---|---|---|---|
연번 | 1.000 | 0.060 | 0.086 | 0.071 | 0.000 | 0.000 | 0.071 |
사업소코드 | 0.060 | 1.000 | 1.000 | 0.000 | 0.000 | 0.133 | 0.072 |
사업소명 | 0.086 | 1.000 | 1.000 | 0.444 | 0.224 | 0.328 | 0.000 |
상수도업종 | 0.071 | 0.000 | 0.444 | 1.000 | 0.000 | 0.489 | NaN |
월사용량 | 0.000 | 0.000 | 0.224 | 0.000 | 1.000 | 0.310 | 0.000 |
구경 | 0.000 | 0.133 | 0.328 | 0.489 | 0.310 | 1.000 | 0.188 |
전수 | 0.071 | 0.072 | 0.000 | NaN | 0.000 | 0.188 | 1.000 |
사업소명 | 상수도업종 | |
---|---|---|
사업소명 | 1.000 | 0.262 |
상수도업종 | 0.262 | 1.000 |
연번 | 사업소코드 | 월사용량 | 구경 | 전수 | 사업소명 | 상수도업종 | |
---|---|---|---|---|---|---|---|
연번 | 1.000 | -0.013 | 0.017 | -0.010 | 0.011 | 0.036 | 0.028 |
사업소코드 | -0.013 | 1.000 | -0.001 | -0.188 | -0.101 | 0.998 | 0.278 |
월사용량 | 0.017 | -0.001 | 1.000 | 0.092 | 0.043 | 0.089 | 0.000 |
구경 | -0.010 | -0.188 | 0.092 | 1.000 | -0.240 | 0.145 | 0.340 |
전수 | 0.011 | -0.101 | 0.043 | -0.240 | 1.000 | 0.000 | 1.000 |
사업소명 | 0.036 | 0.998 | 0.089 | 0.145 | 0.000 | 1.000 | 0.262 |
상수도업종 | 0.028 | 0.278 | 0.000 | 0.340 | 1.000 | 0.262 | 1.000 |
연번 | 사업소코드 | 사업소명 | 상수도업종 | 월사용량 | 구경 | 전수 | |
---|---|---|---|---|---|---|---|
0 | 1 | 244 | 동래통합사업소 | <NA> | 30 | 15 | 1 |
1 | 2 | 306 | 남부사업소 | <NA> | 1 | 25 | 1 |
2 | 3 | 244 | 동래통합사업소 | <NA> | 1 | 15 | 1 |
3 | 4 | 244 | 동래통합사업소 | <NA> | 1 | 15 | 1 |
4 | 5 | 244 | 동래통합사업소 | <NA> | 1 | 40 | 1 |
5 | 6 | 307 | 북부사업소 | <NA> | 30 | 32 | 1 |
6 | 7 | 312 | 기장사업소 | <NA> | 10 | 15 | 1 |
7 | 8 | 311 | 강서사업소 | <NA> | 10 | 15 | 1 |
8 | 9 | 312 | 기장사업소 | <NA> | 10 | 15 | 1 |
9 | 10 | 303 | 영도사업소 | <NA> | 10 | 15 | 1 |
연번 | 사업소코드 | 사업소명 | 상수도업종 | 월사용량 | 구경 | 전수 | |
---|---|---|---|---|---|---|---|
2612 | 2613 | 311 | 강서사업소 | <NA> | 100 | 100 | 2 |
2613 | 2614 | 307 | 북부사업소 | <NA> | 1 | 15 | 6 |
2614 | 2615 | 304 | 부산진 사업소 | <NA> | 10 | 25 | 1 |
2615 | 2616 | 312 | 기장사업소 | <NA> | 10 | 15 | 16 |
2616 | 2617 | 301 | 중동부사업소 | 가정용 | 0 | 15 | 1 |
2617 | 2618 | 307 | 북부사업소 | <NA> | 10 | 150 | 0 |
2618 | 2619 | 301 | 중동부사업소 | <NA> | 50 | 20 | 1 |
2619 | 2620 | 244 | 동래통합사업소 | <NA> | 30 | 20 | 1 |
2620 | 2621 | 244 | 동래통합사업소 | <NA> | 30 | 15 | 1 |
2621 | 2622 | 244 | 동래통합사업소 | <NA> | 10 | 25 | 1 |