Overview

Dataset statistics

Number of variables6
Number of observations21
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory58.1 B

Variable types

Numeric3
Categorical2
DateTime1

Dataset

Description부산광역시 상수도사업본부에서 상하수도 요금 계산 및 징수를 위해 운영하는 수용가정보시스템에 사용되는 민원 신청 정보(급수탑 사용 내역) 자료입니다.
Author부산광역시 상수도사업본부
URLhttps://www.data.go.kr/data/15083680/fileData.do

Alerts

급수탑사용량 is highly overall correlated with 급수탑사용액High correlation
급수탑사용액 is highly overall correlated with 급수탑사용량High correlation
사업소코드 is highly imbalanced (72.4%)Imbalance
사업소명 is highly imbalanced (72.4%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 19:56:05.383496
Analysis finished2024-03-14 19:56:08.277026
Duration2.89 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11
Minimum1
Maximum21
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size317.0 B
2024-03-15T04:56:08.465569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q16
median11
Q316
95-th percentile20
Maximum21
Range20
Interquartile range (IQR)10

Descriptive statistics

Standard deviation6.2048368
Coefficient of variation (CV)0.56407607
Kurtosis-1.2
Mean11
Median Absolute Deviation (MAD)5
Skewness0
Sum231
Variance38.5
MonotonicityStrictly increasing
2024-03-15T04:56:08.873784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
1 1
 
4.8%
2 1
 
4.8%
21 1
 
4.8%
20 1
 
4.8%
19 1
 
4.8%
18 1
 
4.8%
17 1
 
4.8%
16 1
 
4.8%
15 1
 
4.8%
14 1
 
4.8%
Other values (11) 11
52.4%
ValueCountFrequency (%)
1 1
4.8%
2 1
4.8%
3 1
4.8%
4 1
4.8%
5 1
4.8%
6 1
4.8%
7 1
4.8%
8 1
4.8%
9 1
4.8%
10 1
4.8%
ValueCountFrequency (%)
21 1
4.8%
20 1
4.8%
19 1
4.8%
18 1
4.8%
17 1
4.8%
16 1
4.8%
15 1
4.8%
14 1
4.8%
13 1
4.8%
12 1
4.8%

사업소코드
Categorical

IMBALANCE 

Distinct2
Distinct (%)9.5%
Missing0
Missing (%)0.0%
Memory size296.0 B
308
20 
312
 
1

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)4.8%

Sample

1st row308
2nd row308
3rd row308
4th row308
5th row308

Common Values

ValueCountFrequency (%)
308 20
95.2%
312 1
 
4.8%

Length

2024-03-15T04:56:09.266850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T04:56:09.438787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
308 20
95.2%
312 1
 
4.8%

사업소명
Categorical

IMBALANCE 

Distinct2
Distinct (%)9.5%
Missing0
Missing (%)0.0%
Memory size296.0 B
해운대사업소
20 
기장사업소
 
1

Length

Max length6
Median length6
Mean length5.952381
Min length5

Unique

Unique1 ?
Unique (%)4.8%

Sample

1st row해운대사업소
2nd row해운대사업소
3rd row해운대사업소
4th row해운대사업소
5th row해운대사업소

Common Values

ValueCountFrequency (%)
해운대사업소 20
95.2%
기장사업소 1
 
4.8%

Length

2024-03-15T04:56:09.624729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T04:56:09.846974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
해운대사업소 20
95.2%
기장사업소 1
 
4.8%
Distinct20
Distinct (%)95.2%
Missing0
Missing (%)0.0%
Memory size296.0 B
Minimum2013-05-20 00:00:00
Maximum2023-07-20 00:00:00
2024-03-15T04:56:10.153450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T04:56:10.589617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)

급수탑사용량
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)52.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.761905
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size317.0 B
2024-03-15T04:56:11.036424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5
Q17
median8
Q39
95-th percentile16
Maximum100
Range99
Interquartile range (IQR)2

Descriptive statistics

Standard deviation20.258097
Coefficient of variation (CV)1.5873882
Kurtosis19.733359
Mean12.761905
Median Absolute Deviation (MAD)1
Skewness4.38574
Sum268
Variance410.39048
MonotonicityNot monotonic
2024-03-15T04:56:11.325056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
8 5
23.8%
7 4
19.0%
9 3
14.3%
6 2
 
9.5%
5 1
 
4.8%
13 1
 
4.8%
15 1
 
4.8%
1 1
 
4.8%
100 1
 
4.8%
11 1
 
4.8%
ValueCountFrequency (%)
1 1
 
4.8%
5 1
 
4.8%
6 2
 
9.5%
7 4
19.0%
8 5
23.8%
9 3
14.3%
11 1
 
4.8%
13 1
 
4.8%
15 1
 
4.8%
16 1
 
4.8%
ValueCountFrequency (%)
100 1
 
4.8%
16 1
 
4.8%
15 1
 
4.8%
13 1
 
4.8%
11 1
 
4.8%
9 3
14.3%
8 5
23.8%
7 4
19.0%
6 2
 
9.5%
5 1
 
4.8%

급수탑사용액
Real number (ℝ)

HIGH CORRELATION 

Distinct15
Distinct (%)71.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean38630.952
Minimum30
Maximum343180
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size317.0 B
2024-03-15T04:56:11.677971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum30
5-th percentile30
Q120590
median23540
Q327280
95-th percentile52860
Maximum343180
Range343150
Interquartile range (IQR)6690

Descriptive statistics

Standard deviation70788.391
Coefficient of variation (CV)1.8324268
Kurtosis19.647286
Mean38630.952
Median Absolute Deviation (MAD)3740
Skewness4.3713718
Sum811250
Variance5.0109963 × 109
MonotonicityNot monotonic
2024-03-15T04:56:12.078009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
23540 3
14.3%
24250 2
 
9.5%
27280 2
 
9.5%
20730 2
 
9.5%
30 2
 
9.5%
20590 1
 
4.8%
17650 1
 
4.8%
25140 1
 
4.8%
14710 1
 
4.8%
38500 1
 
4.8%
Other values (5) 5
23.8%
ValueCountFrequency (%)
30 2
9.5%
14710 1
 
4.8%
17650 1
 
4.8%
17770 1
 
4.8%
20590 1
 
4.8%
20730 2
9.5%
21220 1
 
4.8%
23540 3
14.3%
24250 2
9.5%
25140 1
 
4.8%
ValueCountFrequency (%)
343180 1
 
4.8%
52860 1
 
4.8%
44430 1
 
4.8%
38500 1
 
4.8%
27280 2
9.5%
25140 1
 
4.8%
24250 2
9.5%
23540 3
14.3%
21220 1
 
4.8%
20730 2
9.5%

Interactions

2024-03-15T04:56:07.295830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T04:56:05.619510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T04:56:06.619623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T04:56:07.503223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T04:56:06.101328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T04:56:06.881400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T04:56:07.737809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T04:56:06.355737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T04:56:07.120518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T04:56:12.428704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업소코드사업소명급수탑사용일자급수탑사용량급수탑사용액
연번1.0000.0000.0000.9690.6690.590
사업소코드0.0001.0000.6281.0000.2200.294
사업소명0.0000.6281.0001.0000.2200.294
급수탑사용일자0.9691.0001.0001.0000.6321.000
급수탑사용량0.6690.2200.2200.6321.0000.995
급수탑사용액0.5900.2940.2941.0000.9951.000
2024-03-15T04:56:12.719747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업소명사업소코드
사업소명1.0000.430
사업소코드0.4301.000
2024-03-15T04:56:13.001062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번급수탑사용량급수탑사용액사업소코드사업소명
연번1.0000.2820.0750.1620.162
급수탑사용량0.2821.0000.8230.3440.344
급수탑사용액0.0750.8231.0000.4590.459
사업소코드0.1620.3440.4591.0000.430
사업소명0.1620.3440.4590.4301.000

Missing values

2024-03-15T04:56:07.952799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T04:56:08.144504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업소코드사업소명급수탑사용일자급수탑사용량급수탑사용액
01308해운대사업소2014-07-18824250
12308해운대사업소2016-08-17720590
23308해운대사업소2016-10-28617650
34308해운대사업소2014-06-02927280
45308해운대사업소2015-08-24925140
56308해운대사업소2016-06-20514710
67308해운대사업소2016-07-25823540
78308해운대사업소2013-06-19720730
89308해운대사업소2013-07-221338500
910308해운대사업소2013-08-131544430
연번사업소코드사업소명급수탑사용일자급수탑사용량급수탑사용액
1112308해운대사업소2014-03-25927280
1213308해운대사업소2013-05-20617770
1314308해운대사업소2014-06-25721220
1415308해운대사업소2014-08-04824250
1516308해운대사업소2016-08-05823540
1617308해운대사업소2016-09-07823540
1718308해운대사업소2019-07-23130
1819308해운대사업소2023-07-20100343180
1920308해운대사업소2019-07-231130
2021312기장사업소2018-04-131652860