Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 603 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 40.2 KiB |
Average record size in memory | 68.2 B |
Variable types
Numeric | 4 |
---|---|
Text | 1 |
Categorical | 3 |
Dataset
Description | 부산광역시 상수도사업본부에서 상하수도 요금 계산 및 징수를 위해 운영하는 수용가정보시스템에 사용되는 요금계산 관련 정보(추징계산 이력) 자료입니다. |
---|---|
Author | 부산광역시 상수도사업본부 |
URL | https://www.data.go.kr/data/15083669/fileData.do |
연번 is highly overall correlated with 추징발생년월 and 2 other fields | High correlation |
추징금액(상) is highly overall correlated with 추징금액(하) and 1 other fields | High correlation |
추징금액(하) is highly overall correlated with 추징금액(상) and 1 other fields | High correlation |
추징금액(물) is highly overall correlated with 추징금액(상) and 1 other fields | High correlation |
추징발생년월 is highly overall correlated with 연번 and 2 other fields | High correlation |
고지년월 is highly overall correlated with 연번 and 2 other fields | High correlation |
계산년월 is highly overall correlated with 연번 and 2 other fields | High correlation |
연번 has unique values | Unique |
추징금액(상) has 170 (28.2%) zeros | Zeros |
추징금액(하) has 97 (16.1%) zeros | Zeros |
추징금액(물) has 193 (32.0%) zeros | Zeros |
Reproduction
Analysis started | 2024-03-14 09:21:28.541103 |
---|---|
Analysis finished | 2024-03-14 09:21:32.370843 |
Duration | 3.83 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 603 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 302 |
Minimum | 1 |
---|---|
Maximum | 603 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.4 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 31.1 |
Q1 | 151.5 |
median | 302 |
Q3 | 452.5 |
95-th percentile | 572.9 |
Maximum | 603 |
Range | 602 |
Interquartile range (IQR) | 301 |
Descriptive statistics
Standard deviation | 174.21538 |
---|---|
Coefficient of variation (CV) | 0.57687213 |
Kurtosis | -1.2 |
Mean | 302 |
Median Absolute Deviation (MAD) | 151 |
Skewness | 0 |
Sum | 182106 |
Variance | 30351 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.2% |
398 | 1 | 0.2% |
400 | 1 | 0.2% |
401 | 1 | 0.2% |
402 | 1 | 0.2% |
403 | 1 | 0.2% |
404 | 1 | 0.2% |
405 | 1 | 0.2% |
406 | 1 | 0.2% |
407 | 1 | 0.2% |
Other values (593) | 593 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
603 | 1 | |
602 | 1 | |
601 | 1 | |
600 | 1 | |
599 | 1 | |
598 | 1 | |
597 | 1 | |
596 | 1 | |
595 | 1 | |
594 | 1 |
고객번호
Text
Distinct | 130 |
---|---|
Distinct (%) | 21.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.8 KiB |
Value | Count | Frequency (%) |
20*29 | 103 | |
94*17 | 53 | 8.8% |
21*88 | 51 | 8.5% |
87*20 | 36 | 6.0% |
87*15 | 27 | 4.5% |
98*78 | 26 | 4.3% |
11*32 | 22 | 3.6% |
95*02 | 17 | 2.8% |
95*98 | 17 | 2.8% |
53*28 | 13 | 2.2% |
Other values (120) | 238 |
Most occurring characters
Value | Count | Frequency (%) |
* | 1206 | |
2 | 425 | 11.7% |
9 | 350 | 9.7% |
8 | 308 | 8.5% |
1 | 298 | 8.2% |
0 | 278 | 7.7% |
7 | 231 | 6.4% |
5 | 188 | 5.2% |
3 | 134 | 3.7% |
4 | 125 | 3.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 2412 | |
Other Punctuation | 1206 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 425 | |
9 | 350 | |
8 | 308 | |
1 | 298 | |
0 | 278 | |
7 | 231 | |
5 | 188 | |
3 | 134 | 5.6% |
4 | 125 | 5.2% |
6 | 75 | 3.1% |
Other Punctuation
Value | Count | Frequency (%) |
* | 1206 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 3618 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
* | 1206 | |
2 | 425 | 11.7% |
9 | 350 | 9.7% |
8 | 308 | 8.5% |
1 | 298 | 8.2% |
0 | 278 | 7.7% |
7 | 231 | 6.4% |
5 | 188 | 5.2% |
3 | 134 | 3.7% |
4 | 125 | 3.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 3618 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
* | 1206 | |
2 | 425 | 11.7% |
9 | 350 | 9.7% |
8 | 308 | 8.5% |
1 | 298 | 8.2% |
0 | 278 | 7.7% |
7 | 231 | 6.4% |
5 | 188 | 5.2% |
3 | 134 | 3.7% |
4 | 125 | 3.5% |
추징발생년월
Categorical
HIGH CORRELATION
 
Distinct | 12 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.8 KiB |
2023-04 | |
---|---|
2023-03 | |
2023-09 | |
2023-12 | |
2023-10 | |
Other values (7) |
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2023-01 |
---|---|
2nd row | 2023-01 |
3rd row | 2023-01 |
4th row | 2023-01 |
5th row | 2023-01 |
Common Values
Value | Count | Frequency (%) |
2023-04 | 117 | |
2023-03 | 83 | |
2023-09 | 83 | |
2023-12 | 82 | |
2023-10 | 53 | |
2023-06 | 41 | 6.8% |
2023-05 | 39 | 6.5% |
2023-08 | 32 | 5.3% |
2023-02 | 25 | 4.1% |
2023-11 | 22 | 3.6% |
Other values (2) | 26 | 4.3% |
Length
Value | Count | Frequency (%) |
2023-04 | 117 | |
2023-03 | 83 | |
2023-09 | 83 | |
2023-12 | 82 | |
2023-10 | 53 | |
2023-06 | 41 | 6.8% |
2023-05 | 39 | 6.5% |
2023-08 | 32 | 5.3% |
2023-02 | 25 | 4.1% |
2023-11 | 22 | 3.6% |
Other values (2) | 26 | 4.3% |
고지년월
Categorical
HIGH CORRELATION
 
Distinct | 12 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.8 KiB |
2023-12 | |
---|---|
2023-03 | |
2023-10 | |
2023-09 | |
2023-06 | |
Other values (7) |
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2023-01 |
---|---|
2nd row | 2023-01 |
3rd row | 2023-01 |
4th row | 2023-01 |
5th row | 2023-01 |
Common Values
Value | Count | Frequency (%) |
2023-12 | 100 | |
2023-03 | 79 | |
2023-10 | 71 | |
2023-09 | 65 | |
2023-06 | 59 | |
2023-04 | 55 | |
2023-08 | 51 | |
2023-05 | 42 | |
2023-11 | 36 | 6.0% |
2023-02 | 23 | 3.8% |
Other values (2) | 22 | 3.6% |
Length
Value | Count | Frequency (%) |
2023-12 | 100 | |
2023-03 | 79 | |
2023-10 | 71 | |
2023-09 | 65 | |
2023-06 | 59 | |
2023-04 | 55 | |
2023-08 | 51 | |
2023-05 | 42 | |
2023-11 | 36 | 6.0% |
2023-02 | 23 | 3.8% |
Other values (2) | 22 | 3.6% |
계산년월
Categorical
HIGH CORRELATION
 
Distinct | 13 |
---|---|
Distinct (%) | 2.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.8 KiB |
2023-11 | |
---|---|
2023-09 | |
2023-03 | |
2023-08 | |
2023-02 | |
Other values (8) |
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2022-12 |
---|---|
2nd row | 2023-01 |
3rd row | 2023-01 |
4th row | 2022-12 |
5th row | 2023-01 |
Common Values
Value | Count | Frequency (%) |
2023-11 | 84 | |
2023-09 | 83 | |
2023-03 | 70 | |
2023-08 | 54 | |
2023-02 | 50 | |
2023-04 | 50 | |
2023-05 | 50 | |
2023-10 | 38 | |
2023-12 | 36 | |
2023-06 | 35 | |
Other values (3) | 53 |
Length
Value | Count | Frequency (%) |
2023-11 | 84 | |
2023-09 | 83 | |
2023-03 | 70 | |
2023-08 | 54 | |
2023-02 | 50 | |
2023-04 | 50 | |
2023-05 | 50 | |
2023-10 | 38 | |
2023-12 | 36 | |
2023-06 | 35 | |
Other values (3) | 53 |
추징금액(상)
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 194 |
---|---|
Distinct (%) | 32.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10025.705 |
Minimum | -2023050 |
---|---|
Maximum | 1788990 |
Zeros | 170 |
Zeros (%) | 28.2% |
Negative | 329 |
Negative (%) | 54.6% |
Memory size | 5.4 KiB |
Quantile statistics
Minimum | -2023050 |
---|---|
5-th percentile | -49350 |
Q1 | -10570 |
median | -1880 |
Q3 | 0 |
95-th percentile | 141858 |
Maximum | 1788990 |
Range | 3812040 |
Interquartile range (IQR) | 10570 |
Descriptive statistics
Standard deviation | 185538.86 |
---|---|
Coefficient of variation (CV) | 18.506316 |
Kurtosis | 64.551054 |
Mean | 10025.705 |
Median Absolute Deviation (MAD) | 4360 |
Skewness | 1.8939531 |
Sum | 6045500 |
Variance | 3.4424668 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 170 | |
-4020 | 22 | 3.6% |
-1880 | 17 | 2.8% |
-11850 | 16 | 2.7% |
-49350 | 16 | 2.7% |
-1200 | 15 | 2.5% |
-6640 | 15 | 2.5% |
-20080 | 15 | 2.5% |
-39730 | 15 | 2.5% |
-7200 | 10 | 1.7% |
Other values (184) | 292 |
Value | Count | Frequency (%) |
-2023050 | 1 | |
-1000000 | 1 | |
-864180 | 1 | |
-494530 | 1 | |
-486140 | 1 | |
-470410 | 1 | |
-412000 | 1 | |
-291440 | 1 | |
-286920 | 1 | |
-214680 | 1 |
Value | Count | Frequency (%) |
1788990 | 2 | |
1648380 | 1 | 0.2% |
972120 | 1 | 0.2% |
600000 | 4 | |
500000 | 2 | |
404610 | 1 | 0.2% |
329580 | 3 | |
329540 | 1 | 0.2% |
243070 | 2 | |
200000 | 4 |
추징금액(하)
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 306 |
---|---|
Distinct (%) | 50.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -35328.889 |
Minimum | -16155750 |
---|---|
Maximum | 15645610 |
Zeros | 97 |
Zeros (%) | 16.1% |
Negative | 415 |
Negative (%) | 68.8% |
Memory size | 5.4 KiB |
Quantile statistics
Minimum | -16155750 |
---|---|
5-th percentile | -87888 |
Q1 | -14620 |
median | -3000 |
Q3 | 0 |
95-th percentile | 140550 |
Maximum | 15645610 |
Range | 31801360 |
Interquartile range (IQR) | 14620 |
Descriptive statistics
Standard deviation | 1165779.7 |
---|---|
Coefficient of variation (CV) | -32.997916 |
Kurtosis | 137.9365 |
Mean | -35328.889 |
Median Absolute Deviation (MAD) | 4690 |
Skewness | -1.8703012 |
Sum | -21303320 |
Variance | 1.3590423 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 97 | 16.1% |
-2800 | 17 | 2.8% |
-4500 | 13 | 2.2% |
-1350 | 13 | 2.2% |
140550 | 9 | 1.5% |
-2250 | 9 | 1.5% |
-9190 | 8 | 1.3% |
-2920 | 8 | 1.3% |
-2020 | 7 | 1.2% |
-2470 | 7 | 1.2% |
Other values (296) | 415 |
Value | Count | Frequency (%) |
-16155750 | 1 | |
-10962830 | 1 | |
-8817560 | 1 | |
-5385250 | 1 | |
-1217180 | 1 | |
-1199980 | 1 | |
-835460 | 1 | |
-635250 | 1 | |
-547950 | 2 | |
-502320 | 1 |
Value | Count | Frequency (%) |
15645610 | 1 | |
8817560 | 1 | |
716400 | 1 | |
537080 | 1 | |
502320 | 1 | |
395550 | 1 | |
387000 | 1 | |
376200 | 1 | |
360000 | 1 | |
324050 | 1 |
추징금액(물)
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 172 |
---|---|
Distinct (%) | 28.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 905.87065 |
Minimum | -2204390 |
---|---|
Maximum | 2204390 |
Zeros | 193 |
Zeros (%) | 32.0% |
Negative | 319 |
Negative (%) | 52.9% |
Memory size | 5.4 KiB |
Quantile statistics
Minimum | -2204390 |
---|---|
5-th percentile | -7650 |
Q1 | -1740 |
median | -300 |
Q3 | 0 |
95-th percentile | 17950 |
Maximum | 2204390 |
Range | 4408780 |
Interquartile range (IQR) | 1740 |
Descriptive statistics
Standard deviation | 128989.9 |
---|---|
Coefficient of variation (CV) | 142.39329 |
Kurtosis | 283.25408 |
Mean | 905.87065 |
Median Absolute Deviation (MAD) | 610 |
Skewness | -0.0084849768 |
Sum | 546240 |
Variance | 1.6638395 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 193 | |
-610 | 22 | 3.6% |
-370 | 17 | 2.8% |
-7650 | 16 | 2.7% |
-6580 | 16 | 2.7% |
-2060 | 16 | 2.7% |
-1170 | 15 | 2.5% |
-3210 | 15 | 2.5% |
-450 | 10 | 1.7% |
-1510 | 9 | 1.5% |
Other values (162) | 274 |
Value | Count | Frequency (%) |
-2204390 | 1 | |
-233230 | 1 | |
-97830 | 1 | |
-75900 | 1 | |
-68690 | 2 | |
-57300 | 1 | |
-44420 | 1 | |
-34080 | 1 | |
-33540 | 1 | |
-33390 | 1 |
Value | Count | Frequency (%) |
2204390 | 1 | 0.2% |
210550 | 2 | |
175100 | 1 | 0.2% |
141500 | 1 | 0.2% |
131910 | 1 | 0.2% |
76070 | 4 | |
46650 | 1 | 0.2% |
37370 | 3 | |
37360 | 1 | 0.2% |
35100 | 1 | 0.2% |
연번 | 추징발생년월 | 고지년월 | 계산년월 | 추징금액(상) | 추징금액(하) | 추징금액(물) | |
---|---|---|---|---|---|---|---|
연번 | 1.000 | 0.940 | 0.881 | 0.813 | 0.145 | 0.091 | 0.301 |
추징발생년월 | 0.940 | 1.000 | 0.987 | 0.855 | 0.268 | 0.124 | 0.433 |
고지년월 | 0.881 | 0.987 | 1.000 | 0.924 | 0.260 | 0.133 | 0.313 |
계산년월 | 0.813 | 0.855 | 0.924 | 1.000 | 0.207 | 0.102 | 0.182 |
추징금액(상) | 0.145 | 0.268 | 0.260 | 0.207 | 1.000 | 0.451 | 0.300 |
추징금액(하) | 0.091 | 0.124 | 0.133 | 0.102 | 0.451 | 1.000 | 0.941 |
추징금액(물) | 0.301 | 0.433 | 0.313 | 0.182 | 0.300 | 0.941 | 1.000 |
고지년월 | 계산년월 | 추징발생년월 | |
---|---|---|---|
고지년월 | 1.000 | 0.691 | 0.783 |
계산년월 | 0.691 | 1.000 | 0.544 |
추징발생년월 | 0.783 | 0.544 | 1.000 |
연번 | 추징금액(상) | 추징금액(하) | 추징금액(물) | 추징발생년월 | 고지년월 | 계산년월 | |
---|---|---|---|---|---|---|---|
연번 | 1.000 | -0.019 | 0.123 | 0.016 | 0.777 | 0.634 | 0.511 |
추징금액(상) | -0.019 | 1.000 | 0.613 | 0.893 | 0.122 | 0.116 | 0.082 |
추징금액(하) | 0.123 | 0.613 | 1.000 | 0.661 | 0.038 | 0.057 | 0.041 |
추징금액(물) | 0.016 | 0.893 | 0.661 | 1.000 | 0.169 | 0.114 | 0.062 |
추징발생년월 | 0.777 | 0.122 | 0.038 | 0.169 | 1.000 | 0.783 | 0.544 |
고지년월 | 0.634 | 0.116 | 0.057 | 0.114 | 0.783 | 1.000 | 0.691 |
계산년월 | 0.511 | 0.082 | 0.041 | 0.062 | 0.544 | 0.691 | 1.000 |
연번 | 고객번호 | 추징발생년월 | 고지년월 | 계산년월 | 추징금액(상) | 추징금액(하) | 추징금액(물) | |
---|---|---|---|---|---|---|---|---|
0 | 1 | *11*53 | 2023-01 | 2023-01 | 2022-12 | -291440 | -4800 | -44420 |
1 | 2 | *19*30 | 2023-01 | 2023-01 | 2023-01 | 600000 | 0 | 76070 |
2 | 3 | *74*85 | 2023-01 | 2023-01 | 2023-01 | 137700 | 0 | 20980 |
3 | 4 | *77*32 | 2023-01 | 2023-01 | 2022-12 | 120000 | 0 | 0 |
4 | 5 | *59*77 | 2023-01 | 2023-01 | 2023-01 | -115140 | -195270 | -17720 |
5 | 6 | *04*06 | 2023-01 | 2023-02 | 2023-02 | -33340 | -52420 | -5120 |
6 | 7 | *35*02 | 2023-01 | 2023-01 | 2023-01 | 0 | 0 | -9160 |
7 | 8 | *41*80 | 2023-01 | 2023-01 | 2022-12 | 329580 | 198240 | 37370 |
8 | 9 | *30*60 | 2023-01 | 2023-01 | 2023-01 | 0 | -95940 | 0 |
9 | 10 | *30*60 | 2023-01 | 2023-02 | 2023-02 | 0 | -109850 | 0 |
연번 | 고객번호 | 추징발생년월 | 고지년월 | 계산년월 | 추징금액(상) | 추징금액(하) | 추징금액(물) | |
---|---|---|---|---|---|---|---|---|
593 | 594 | *95*98 | 2023-12 | 2023-12 | 2023-11 | -1880 | -2800 | -370 |
594 | 595 | *95*98 | 2023-12 | 2023-12 | 2023-11 | -1880 | -2800 | -370 |
595 | 596 | *95*98 | 2023-12 | 2023-12 | 2023-11 | -1880 | -2800 | -370 |
596 | 597 | *95*98 | 2023-12 | 2023-12 | 2023-11 | -1880 | -2800 | -370 |
597 | 598 | *95*98 | 2023-12 | 2023-12 | 2023-11 | -1880 | -2800 | -370 |
598 | 599 | *95*98 | 2023-12 | 2023-12 | 2023-11 | -1880 | -2800 | -370 |
599 | 600 | *34*65 | 2023-12 | 2023-12 | 2023-12 | 130060 | 166630 | 15570 |
600 | 601 | *12*54 | 2023-12 | 2023-12 | 2023-11 | 0 | 537080 | 141500 |
601 | 602 | *19*60 | 2023-12 | 2023-12 | 2023-12 | -6040 | -14840 | -510 |
602 | 603 | *15*16 | 2023-12 | 2023-12 | 2023-12 | 0 | 387000 | 0 |