Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 737 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 49.1 KiB |
Average record size in memory | 68.2 B |
Variable types
Numeric | 4 |
---|---|
Text | 1 |
DateTime | 3 |
Dataset
Description | 부산광역시상수도사업본부_수용가정보시스템_요금계산관련정보_추징계산이력_20230126 |
---|---|
Author | 부산광역시 상수도사업본부 |
URL | http://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15083669 |
추징금액(상) is highly overall correlated with 추징금액(물) | High correlation |
추징금액(하) is highly overall correlated with 추징금액(물) | High correlation |
추징금액(물) is highly overall correlated with 추징금액(상) and 1 other fields | High correlation |
추징금액(하) is highly skewed (γ1 = 24.31880472) | Skewed |
연번 has unique values | Unique |
추징금액(상) has 142 (19.3%) zeros | Zeros |
추징금액(하) has 198 (26.9%) zeros | Zeros |
추징금액(물) has 291 (39.5%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 17:13:32.499279 |
---|---|
Analysis finished | 2023-12-10 17:13:38.201687 |
Duration | 5.7 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
UNIQUE
 
Distinct | 737 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 369 |
Minimum | 1 |
---|---|
Maximum | 737 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 37.8 |
Q1 | 185 |
median | 369 |
Q3 | 553 |
95-th percentile | 700.2 |
Maximum | 737 |
Range | 736 |
Interquartile range (IQR) | 368 |
Descriptive statistics
Standard deviation | 212.89786 |
---|---|
Coefficient of variation (CV) | 0.57695898 |
Kurtosis | -1.2 |
Mean | 369 |
Median Absolute Deviation (MAD) | 184 |
Skewness | 0 |
Sum | 271953 |
Variance | 45325.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.1% |
496 | 1 | 0.1% |
487 | 1 | 0.1% |
488 | 1 | 0.1% |
489 | 1 | 0.1% |
490 | 1 | 0.1% |
491 | 1 | 0.1% |
492 | 1 | 0.1% |
493 | 1 | 0.1% |
494 | 1 | 0.1% |
Other values (727) | 727 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
737 | 1 | |
736 | 1 | |
735 | 1 | |
734 | 1 | |
733 | 1 | |
732 | 1 | |
731 | 1 | |
730 | 1 | |
729 | 1 | |
728 | 1 |
고객번호
Text
Distinct | 104 |
---|---|
Distinct (%) | 14.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.9 KiB |
Value | Count | Frequency (%) |
30*58 | 242 | |
50*11 | 174 | |
54*01 | 26 | 3.5% |
02*08 | 20 | 2.7% |
78*85 | 18 | 2.4% |
17*43 | 16 | 2.2% |
18*35 | 13 | 1.8% |
02*90 | 13 | 1.8% |
12*84 | 12 | 1.6% |
94*90 | 11 | 1.5% |
Other values (94) | 192 |
Most occurring characters
Value | Count | Frequency (%) |
* | 1474 | |
0 | 622 | |
5 | 543 | 12.3% |
1 | 523 | 11.8% |
8 | 379 | 8.6% |
3 | 343 | 7.8% |
7 | 126 | 2.8% |
9 | 126 | 2.8% |
4 | 122 | 2.8% |
2 | 96 | 2.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 2948 | |
Other Punctuation | 1474 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 622 | |
5 | 543 | |
1 | 523 | |
8 | 379 | |
3 | 343 | |
7 | 126 | 4.3% |
9 | 126 | 4.3% |
4 | 122 | 4.1% |
2 | 96 | 3.3% |
6 | 68 | 2.3% |
Other Punctuation
Value | Count | Frequency (%) |
* | 1474 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 4422 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
* | 1474 | |
0 | 622 | |
5 | 543 | 12.3% |
1 | 523 | 11.8% |
8 | 379 | 8.6% |
3 | 343 | 7.8% |
7 | 126 | 2.8% |
9 | 126 | 2.8% |
4 | 122 | 2.8% |
2 | 96 | 2.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 4422 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
* | 1474 | |
0 | 622 | |
5 | 543 | 12.3% |
1 | 523 | 11.8% |
8 | 379 | 8.6% |
3 | 343 | 7.8% |
7 | 126 | 2.8% |
9 | 126 | 2.8% |
4 | 122 | 2.8% |
2 | 96 | 2.2% |
추징발생년월
Date
Distinct | 12 |
---|---|
Distinct (%) | 1.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.9 KiB |
Minimum | 2022-01-01 00:00:00 |
---|---|
Maximum | 2022-12-01 00:00:00 |
고지년월
Date
Distinct | 13 |
---|---|
Distinct (%) | 1.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.9 KiB |
Minimum | 2022-01-01 00:00:00 |
---|---|
Maximum | 2023-01-01 00:00:00 |
계산년월
Date
Distinct | 14 |
---|---|
Distinct (%) | 1.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.9 KiB |
Minimum | 2021-12-01 00:00:00 |
---|---|
Maximum | 2023-01-01 00:00:00 |
추징금액(상)
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 198 |
---|---|
Distinct (%) | 26.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2312.7273 |
Minimum | -3000000 |
---|---|
Maximum | 3787810 |
Zeros | 142 |
Zeros (%) | 19.3% |
Negative | 531 |
Negative (%) | 72.0% |
Memory size | 6.6 KiB |
Quantile statistics
Minimum | -3000000 |
---|---|
5-th percentile | -23584 |
Q1 | -6960 |
median | -1920 |
Q3 | 0 |
95-th percentile | 60490 |
Maximum | 3787810 |
Range | 6787810 |
Interquartile range (IQR) | 6960 |
Descriptive statistics
Standard deviation | 230908.53 |
---|---|
Coefficient of variation (CV) | 99.842524 |
Kurtosis | 184.96832 |
Mean | 2312.7273 |
Median Absolute Deviation (MAD) | 2050 |
Skewness | 6.1784181 |
Sum | 1704480 |
Variance | 5.3318748 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 142 | 19.3% |
-1200 | 121 | 16.4% |
-6960 | 17 | 2.3% |
-15600 | 17 | 2.3% |
-5880 | 14 | 1.9% |
-5160 | 14 | 1.9% |
-4800 | 14 | 1.9% |
-7680 | 13 | 1.8% |
-3090 | 12 | 1.6% |
-7200 | 11 | 1.5% |
Other values (188) | 362 |
Value | Count | Frequency (%) |
-3000000 | 1 | |
-1230380 | 1 | |
-1000000 | 1 | |
-686900 | 1 | |
-591200 | 1 | |
-515390 | 1 | |
-435240 | 1 | |
-194130 | 1 | |
-162270 | 1 | |
-142260 | 1 |
Value | Count | Frequency (%) |
3787810 | 1 | 0.1% |
3159000 | 1 | 0.1% |
600000 | 1 | 0.1% |
598360 | 1 | 0.1% |
571900 | 1 | 0.1% |
526540 | 1 | 0.1% |
329580 | 1 | 0.1% |
200000 | 1 | 0.1% |
198000 | 4 | |
187120 | 1 | 0.1% |
추징금액(하)
Real number (ℝ)
HIGH CORRELATION
  SKEWED
  ZEROS
 
Distinct | 236 |
---|---|
Distinct (%) | 32.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9332.4966 |
Minimum | -921370 |
---|---|
Maximum | 12249360 |
Zeros | 198 |
Zeros (%) | 26.9% |
Negative | 492 |
Negative (%) | 66.8% |
Memory size | 6.6 KiB |
Quantile statistics
Minimum | -921370 |
---|---|
5-th percentile | -48958 |
Q1 | -5950 |
median | -2470 |
Q3 | 0 |
95-th percentile | 14110 |
Maximum | 12249360 |
Range | 13170730 |
Interquartile range (IQR) | 5950 |
Descriptive statistics
Standard deviation | 469409.46 |
---|---|
Coefficient of variation (CV) | 50.29838 |
Kurtosis | 631.30017 |
Mean | 9332.4966 |
Median Absolute Deviation (MAD) | 2470 |
Skewness | 24.318805 |
Sum | 6878050 |
Variance | 2.2034524 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 198 | |
-4500 | 21 | 2.8% |
-3600 | 18 | 2.4% |
-4050 | 17 | 2.3% |
-4270 | 17 | 2.3% |
-2250 | 15 | 2.0% |
-2470 | 14 | 1.9% |
-2920 | 13 | 1.8% |
-16530 | 10 | 1.4% |
-220 | 9 | 1.2% |
Other values (226) | 405 |
Value | Count | Frequency (%) |
-921370 | 1 | |
-911180 | 1 | |
-906750 | 1 | |
-826190 | 1 | |
-604500 | 1 | |
-380820 | 1 | |
-271800 | 1 | |
-257040 | 1 | |
-207560 | 1 | |
-187680 | 1 |
Value | Count | Frequency (%) |
12249360 | 1 | 0.1% |
2349000 | 1 | 0.1% |
1452700 | 1 | 0.1% |
198240 | 1 | 0.1% |
174760 | 1 | 0.1% |
150000 | 4 | |
100260 | 1 | 0.1% |
74620 | 1 | 0.1% |
65960 | 3 | |
62910 | 1 | 0.1% |
추징금액(물)
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 151 |
---|---|
Distinct (%) | 20.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | -332.42877 |
Minimum | -380350 |
---|---|
Maximum | 603860 |
Zeros | 291 |
Zeros (%) | 39.5% |
Negative | 397 |
Negative (%) | 53.9% |
Memory size | 6.6 KiB |
Quantile statistics
Minimum | -380350 |
---|---|
5-th percentile | -3034 |
Q1 | -1200 |
median | -150 |
Q3 | 0 |
95-th percentile | 4780 |
Maximum | 603860 |
Range | 984210 |
Interquartile range (IQR) | 1200 |
Descriptive statistics
Standard deviation | 29006.773 |
---|---|
Coefficient of variation (CV) | -87.257109 |
Kurtosis | 302.26003 |
Mean | -332.42877 |
Median Absolute Deviation (MAD) | 320 |
Skewness | 8.4384567 |
Sum | -245000 |
Variance | 8.4139289 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 291 | |
-1200 | 20 | 2.7% |
-1500 | 17 | 2.3% |
-750 | 15 | 2.0% |
-1920 | 14 | 1.9% |
-820 | 14 | 1.9% |
-70 | 13 | 1.8% |
-970 | 13 | 1.8% |
-470 | 13 | 1.8% |
-980 | 12 | 1.6% |
Other values (141) | 315 |
Value | Count | Frequency (%) |
-380350 | 1 | |
-241900 | 1 | |
-103780 | 1 | |
-69360 | 1 | |
-60770 | 1 | |
-48580 | 1 | |
-26080 | 1 | |
-23440 | 1 | |
-19760 | 1 | |
-17610 | 1 |
Value | Count | Frequency (%) |
603860 | 1 | 0.1% |
76070 | 1 | 0.1% |
67080 | 1 | 0.1% |
64110 | 1 | 0.1% |
62360 | 1 | 0.1% |
37370 | 1 | 0.1% |
22340 | 1 | 0.1% |
20700 | 4 | |
14910 | 6 | |
13690 | 1 | 0.1% |
연번 | 추징발생년월 | 고지년월 | 계산년월 | 추징금액(상) | 추징금액(하) | 추징금액(물) | |
---|---|---|---|---|---|---|---|
연번 | 1.000 | 0.891 | 0.832 | 0.752 | 0.069 | 0.006 | 0.168 |
추징발생년월 | 0.891 | 1.000 | 0.921 | 0.811 | 0.114 | 0.000 | 0.184 |
고지년월 | 0.832 | 0.921 | 1.000 | 0.934 | 0.192 | 0.000 | 0.163 |
계산년월 | 0.752 | 0.811 | 0.934 | 1.000 | 0.160 | 0.000 | 0.116 |
추징금액(상) | 0.069 | 0.114 | 0.192 | 0.160 | 1.000 | 0.706 | 0.916 |
추징금액(하) | 0.006 | 0.000 | 0.000 | 0.000 | 0.706 | 1.000 | 0.658 |
추징금액(물) | 0.168 | 0.184 | 0.163 | 0.116 | 0.916 | 0.658 | 1.000 |
연번 | 추징금액(상) | 추징금액(하) | 추징금액(물) | |
---|---|---|---|---|
연번 | 1.000 | 0.171 | 0.102 | 0.199 |
추징금액(상) | 0.171 | 1.000 | 0.483 | 0.839 |
추징금액(하) | 0.102 | 0.483 | 1.000 | 0.616 |
추징금액(물) | 0.199 | 0.839 | 0.616 | 1.000 |
연번 | 고객번호 | 추징발생년월 | 고지년월 | 계산년월 | 추징금액(상) | 추징금액(하) | 추징금액(물) | |
---|---|---|---|---|---|---|---|---|
0 | 1 | *17*10 | 2022-01 | 2022-01 | 2021-12 | -9860 | 0 | 0 |
1 | 2 | *26*63 | 2022-01 | 2022-01 | 2022-01 | 60490 | 50470 | 9210 |
2 | 3 | *54*01 | 2022-01 | 2022-01 | 2021-12 | -6070 | 0 | 0 |
3 | 4 | *54*01 | 2022-01 | 2022-01 | 2022-01 | -7050 | 0 | 0 |
4 | 5 | *54*01 | 2022-01 | 2022-03 | 2022-02 | -1280 | -840 | 0 |
5 | 6 | *54*01 | 2022-01 | 2022-03 | 2022-03 | 0 | -2320 | -610 |
6 | 7 | *54*01 | 2022-01 | 2022-05 | 2022-04 | 0 | -3190 | -830 |
7 | 8 | *54*01 | 2022-01 | 2022-05 | 2022-05 | 0 | -2650 | -830 |
8 | 9 | *54*01 | 2022-01 | 2022-07 | 2022-06 | 0 | 0 | -710 |
9 | 10 | *59*95 | 2022-01 | 2022-03 | 2022-02 | -770 | 0 | 0 |
연번 | 고객번호 | 추징발생년월 | 고지년월 | 계산년월 | 추징금액(상) | 추징금액(하) | 추징금액(물) | |
---|---|---|---|---|---|---|---|---|
727 | 728 | *03*91 | 2022-12 | 2022-12 | 2022-12 | -11910 | -7110 | -2180 |
728 | 729 | *14*39 | 2022-12 | 2022-12 | 2022-12 | 1280 | 0 | 0 |
729 | 730 | *12*57 | 2022-12 | 2022-12 | 2022-12 | -7200 | -4500 | -1500 |
730 | 731 | *99*63 | 2022-12 | 2022-12 | 2022-11 | 21880 | 14110 | 4560 |
731 | 732 | *11*09 | 2022-12 | 2022-12 | 2022-12 | 92760 | 45120 | 12180 |
732 | 733 | *18*35 | 2022-12 | 2022-12 | 2022-11 | -3550 | -5280 | -680 |
733 | 734 | *18*35 | 2022-12 | 2022-12 | 2022-11 | -3550 | -5280 | -680 |
734 | 735 | *18*35 | 2022-12 | 2022-12 | 2022-11 | -3550 | -2700 | -680 |
735 | 736 | *18*35 | 2022-12 | 2022-12 | 2022-12 | 0 | -2580 | 0 |
736 | 737 | *18*35 | 2022-12 | 2022-12 | 2022-11 | -3550 | -5280 | -680 |