Overview

Dataset statistics

Number of variables6
Number of observations3638
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory177.8 KiB
Average record size in memory50.0 B

Variable types

Numeric2
Categorical2
DateTime2

Dataset

Description부산광역시 상수도사업본부에서 상하수도 요금 계산 및 징수를 위해 운영하는 수용가정보시스템에 사용되는 민원 신청 정보(요금 이의) 자료입니다.
Author부산광역시 상수도사업본부
URLhttps://www.data.go.kr/data/15083682/fileData.do

Alerts

사업소코드 is highly overall correlated with 사업소명High correlation
사업소명 is highly overall correlated with 사업소코드High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 09:11:04.698615
Analysis finished2024-03-14 09:11:06.805854
Duration2.11 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct3638
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1819.5
Minimum1
Maximum3638
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size32.1 KiB
2024-03-14T18:11:07.006858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile182.85
Q1910.25
median1819.5
Q32728.75
95-th percentile3456.15
Maximum3638
Range3637
Interquartile range (IQR)1818.5

Descriptive statistics

Standard deviation1050.3445
Coefficient of variation (CV)0.57727094
Kurtosis-1.2
Mean1819.5
Median Absolute Deviation (MAD)909.5
Skewness0
Sum6619341
Variance1103223.5
MonotonicityStrictly increasing
2024-03-14T18:11:07.456090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
2446 1
 
< 0.1%
2420 1
 
< 0.1%
2421 1
 
< 0.1%
2422 1
 
< 0.1%
2423 1
 
< 0.1%
2424 1
 
< 0.1%
2425 1
 
< 0.1%
2426 1
 
< 0.1%
2427 1
 
< 0.1%
Other values (3628) 3628
99.7%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
3638 1
< 0.1%
3637 1
< 0.1%
3636 1
< 0.1%
3635 1
< 0.1%
3634 1
< 0.1%
3633 1
< 0.1%
3632 1
< 0.1%
3631 1
< 0.1%
3630 1
< 0.1%
3629 1
< 0.1%

사업소코드
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean296.2735
Minimum244
Maximum312
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size32.1 KiB
2024-03-14T18:11:07.833230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum244
5-th percentile244
Q1301
median306
Q3308
95-th percentile311
Maximum312
Range68
Interquartile range (IQR)7

Descriptive statistics

Standard deviation22.932309
Coefficient of variation (CV)0.077402498
Kurtosis1.3454427
Mean296.2735
Median Absolute Deviation (MAD)3
Skewness-1.792447
Sum1077843
Variance525.89081
MonotonicityNot monotonic
2024-03-14T18:11:08.179486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
244 578
15.9%
307 461
12.7%
301 422
11.6%
306 408
11.2%
311 368
10.1%
308 316
8.7%
303 297
8.2%
304 294
8.1%
309 209
 
5.7%
312 162
 
4.5%
ValueCountFrequency (%)
244 578
15.9%
301 422
11.6%
302 123
 
3.4%
303 297
8.2%
304 294
8.1%
306 408
11.2%
307 461
12.7%
308 316
8.7%
309 209
 
5.7%
311 368
10.1%
ValueCountFrequency (%)
312 162
 
4.5%
311 368
10.1%
309 209
5.7%
308 316
8.7%
307 461
12.7%
306 408
11.2%
304 294
8.1%
303 297
8.2%
302 123
 
3.4%
301 422
11.6%

사업소명
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size28.5 KiB
동래통합사업소
578 
북부사업소
461 
중동부사업소
422 
남부사업소
408 
강서사업소
368 
Other values (6)
1401 

Length

Max length9
Median length5
Mean length5.8982958
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남부사업소
2nd row동래통합사업소
3rd row동래통합사업소
4th row동래통합사업소
5th row동래통합사업소

Common Values

ValueCountFrequency (%)
동래통합사업소 578
15.9%
북부사업소 461
12.7%
중동부사업소 422
11.6%
남부사업소 408
11.2%
강서사업소 368
10.1%
해운대사업소 316
8.7%
영도사업소 297
8.2%
부산진 사업소 294
8.1%
사하사업소 209
 
5.7%
기장사업소 162
 
4.5%

Length

2024-03-14T18:11:08.606052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
동래통합사업소 578
14.3%
북부사업소 461
11.4%
중동부사업소 422
10.4%
사업소 417
10.3%
남부사업소 408
10.1%
강서사업소 368
9.1%
해운대사업소 316
7.8%
영도사업소 297
7.3%
부산진 294
7.3%
사하사업소 209
 
5.2%
Other values (2) 285
7.0%
Distinct51
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size28.5 KiB
Minimum2013-03-01 00:00:00
Maximum2023-12-01 00:00:00
2024-03-14T18:11:08.997852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T18:11:09.440790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct51
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size28.5 KiB
Minimum2013-03-01 00:00:00
Maximum2023-12-01 00:00:00
2024-03-14T18:11:09.792205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T18:11:10.040707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

경정사유
Categorical

Distinct21
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size28.5 KiB
기타
1560 
옥내누수감면액 정산
634 
검침착오
495 
인정조정과다
375 
수급자등감면착오
186 
Other values (16)
388 

Length

Max length10
Median length9
Mean length4.9219351
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row옥내누수감면액 정산
2nd row인정조정과다
3rd row검침착오
4th row검침착오
5th row분할승인

Common Values

ValueCountFrequency (%)
기타 1560
42.9%
옥내누수감면액 정산 634
17.4%
검침착오 495
 
13.6%
인정조정과다 375
 
10.3%
수급자등감면착오 186
 
5.1%
사용량합산착오 79
 
2.2%
수복정산착오 60
 
1.6%
잔량처리(기타) 49
 
1.3%
오탁수에 의한 감면 43
 
1.2%
하수도감량(률)정정 32
 
0.9%
Other values (11) 125
 
3.4%

Length

2024-03-14T18:11:10.405035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
기타 1560
35.8%
옥내누수감면액 634
14.5%
정산 634
14.5%
검침착오 495
 
11.4%
인정조정과다 375
 
8.6%
수급자등감면착오 186
 
4.3%
사용량합산착오 79
 
1.8%
수복정산착오 60
 
1.4%
잔량처리(기타 49
 
1.1%
오탁수에 43
 
1.0%
Other values (14) 243
 
5.6%

Interactions

2024-03-14T18:11:05.667833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T18:11:05.106001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T18:11:05.942411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T18:11:05.385102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T18:11:10.666484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업소코드사업소명요금이의시작년월요금이의만료년월경정사유
연번1.0000.1290.2270.6500.6500.255
사업소코드0.1291.0001.0000.1250.1250.220
사업소명0.2271.0001.0000.3830.3830.423
요금이의시작년월0.6500.1250.3831.0001.0000.194
요금이의만료년월0.6500.1250.3831.0001.0000.194
경정사유0.2550.2200.4230.1940.1941.000
2024-03-14T18:11:10.854280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업소명경정사유
사업소명1.0000.167
경정사유0.1671.000
2024-03-14T18:11:10.992398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업소코드사업소명경정사유
연번1.0000.1080.0980.083
사업소코드0.1081.0000.9990.212
사업소명0.0980.9991.0000.167
경정사유0.0830.2120.1671.000

Missing values

2024-03-14T18:11:06.296631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T18:11:06.650197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업소코드사업소명요금이의시작년월요금이의만료년월경정사유
01306남부사업소2023-012023-01옥내누수감면액 정산
12244동래통합사업소2022-112022-11인정조정과다
23244동래통합사업소2023-012023-01검침착오
34244동래통합사업소2022-092022-09검침착오
45244동래통합사업소2022-122022-12분할승인
56244동래통합사업소2023-012023-01검침착오
67309사하사업소2023-032023-03오탁수에 의한 감면
78307북부사업소2023-032023-03옥내누수감면액 정산
89307북부사업소2023-032023-03기타
910307북부사업소2023-032023-03옥내누수감면액 정산
연번사업소코드사업소명요금이의시작년월요금이의만료년월경정사유
36283629244동래통합사업소2023-092023-09검침착오
36293630244동래통합사업소2023-092023-09기타
36303631303영도사업소2023-092023-09기타
36313632303영도사업소2023-092023-09수급자등감면착오
36323633244동래통합사업소2023-092023-09하수도감량(률)정정
36333634301중동부사업소2023-092023-09기타
36343635309사하사업소2018-092018-09인정조정과다
36353636244동래통합사업소2023-072023-07기타
36363637311강서사업소2023-122023-12사용량합산착오
36373638311강서사업소2023-122023-12기타