Overview

Dataset statistics

Number of variables9
Number of observations1434
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory108.0 KiB
Average record size in memory77.1 B

Variable types

Categorical2
DateTime2
Numeric5

Dataset

Description경상북도 구미시의 유수율제고블록시스템 수도요금검침집계(중블록) 테이블 데이터로 중블록에 따른 월별 사용량 데이터를 제공합니다.
Author경상북도 구미시
URLhttps://www.data.go.kr/data/15049706/fileData.do

Alerts

당월사용량 is highly overall correlated with 확정사용량 and 3 other fieldsHigh correlation
확정사용량 is highly overall correlated with 당월사용량 and 3 other fieldsHigh correlation
가사용량 is highly overall correlated with 당월사용량 and 3 other fieldsHigh correlation
조정량 is highly overall correlated with 당월사용량 and 3 other fieldsHigh correlation
급수전수 is highly overall correlated with 당월사용량 and 3 other fieldsHigh correlation
당월사용량 has 78 (5.4%) zerosZeros
확정사용량 has 78 (5.4%) zerosZeros
가사용량 has 82 (5.7%) zerosZeros
조정량 has 82 (5.7%) zerosZeros

Reproduction

Analysis started2023-12-12 08:27:04.621678
Analysis finished2023-12-12 08:27:08.905052
Duration4.28 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

중블록
Categorical

Distinct30
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size11.3 KiB
WH-1
 
89
SP-6
 
85
SP-2
 
85
WH-4
 
82
SP-1
 
81
Other values (25)
1012 

Length

Max length4
Median length4
Mean length3.9916318
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4F-1
2nd row4F-1
3rd row4F-1
4th rowHP-1
5th rowHP-1

Common Values

ValueCountFrequency (%)
WH-1 89
 
6.2%
SP-6 85
 
5.9%
SP-2 85
 
5.9%
WH-4 82
 
5.7%
SP-1 81
 
5.6%
4F-1 80
 
5.6%
ID-1 75
 
5.2%
SP-5 75
 
5.2%
ID-2 75
 
5.2%
SS-1 75
 
5.2%
Other values (20) 632
44.1%

Length

2023-12-12T17:27:09.014619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
wh-1 125
 
8.7%
wh-4 118
 
8.2%
sp-6 85
 
5.9%
sp-2 85
 
5.9%
sp-1 81
 
5.6%
4f-1 80
 
5.6%
id-1 75
 
5.2%
sp-5 75
 
5.2%
id-2 75
 
5.2%
ss-1 75
 
5.2%
Other values (18) 560
39.1%

업종
Categorical

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size11.3 KiB
가정용
556 
영업용
541 
욕탕용
337 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가정용
2nd row영업용
3rd row욕탕용
4th row가정용
5th row영업용

Common Values

ValueCountFrequency (%)
가정용 556
38.8%
영업용 541
37.7%
욕탕용 337
23.5%

Length

2023-12-12T17:27:09.186668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:27:09.340909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가정용 556
38.8%
영업용 541
37.7%
욕탕용 337
23.5%
Distinct25
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size11.3 KiB
Minimum2019-03-01 00:00:00
Maximum2021-03-01 00:00:00
2023-12-12T17:27:09.450452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:09.582546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
Distinct25
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size11.3 KiB
Minimum2019-04-01 00:00:00
Maximum2021-04-01 00:00:00
2023-12-12T17:27:09.734021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:09.878867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)

당월사용량
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct1158
Distinct (%)80.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean71028.817
Minimum0
Maximum691922
Zeros78
Zeros (%)5.4%
Negative0
Negative (%)0.0%
Memory size12.7 KiB
2023-12-12T17:27:10.036784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1171
median15303
Q391779.75
95-th percentile315651.5
Maximum691922
Range691922
Interquartile range (IQR)91608.75

Descriptive statistics

Standard deviation116812.15
Coefficient of variation (CV)1.644574
Kurtosis6.95707
Mean71028.817
Median Absolute Deviation (MAD)15298
Skewness2.4417386
Sum1.0185532 × 108
Variance1.3645078 × 1010
MonotonicityNot monotonic
2023-12-12T17:27:10.530360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 78
 
5.4%
4 13
 
0.9%
5 11
 
0.8%
1 9
 
0.6%
9 9
 
0.6%
3 8
 
0.6%
10 7
 
0.5%
6 6
 
0.4%
39 6
 
0.4%
8 6
 
0.4%
Other values (1148) 1281
89.3%
ValueCountFrequency (%)
0 78
5.4%
1 9
 
0.6%
2 5
 
0.3%
3 8
 
0.6%
4 13
 
0.9%
5 11
 
0.8%
6 6
 
0.4%
7 5
 
0.3%
8 6
 
0.4%
9 9
 
0.6%
ValueCountFrequency (%)
691922 1
0.1%
684127 1
0.1%
679247 1
0.1%
668493 1
0.1%
660618 1
0.1%
658411 1
0.1%
651928 1
0.1%
631049 1
0.1%
626545 1
0.1%
619791 1
0.1%

확정사용량
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct1158
Distinct (%)80.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean71028.817
Minimum0
Maximum691922
Zeros78
Zeros (%)5.4%
Negative0
Negative (%)0.0%
Memory size12.7 KiB
2023-12-12T17:27:10.719376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1171
median15303
Q391779.75
95-th percentile315651.5
Maximum691922
Range691922
Interquartile range (IQR)91608.75

Descriptive statistics

Standard deviation116812.15
Coefficient of variation (CV)1.644574
Kurtosis6.95707
Mean71028.817
Median Absolute Deviation (MAD)15298
Skewness2.4417386
Sum1.0185532 × 108
Variance1.3645078 × 1010
MonotonicityNot monotonic
2023-12-12T17:27:10.878093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 78
 
5.4%
4 13
 
0.9%
5 11
 
0.8%
1 9
 
0.6%
9 9
 
0.6%
3 8
 
0.6%
10 7
 
0.5%
6 6
 
0.4%
39 6
 
0.4%
8 6
 
0.4%
Other values (1148) 1281
89.3%
ValueCountFrequency (%)
0 78
5.4%
1 9
 
0.6%
2 5
 
0.3%
3 8
 
0.6%
4 13
 
0.9%
5 11
 
0.8%
6 6
 
0.4%
7 5
 
0.3%
8 6
 
0.4%
9 9
 
0.6%
ValueCountFrequency (%)
691922 1
0.1%
684127 1
0.1%
679247 1
0.1%
668493 1
0.1%
660618 1
0.1%
658411 1
0.1%
651928 1
0.1%
631049 1
0.1%
626545 1
0.1%
619791 1
0.1%

가사용량
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct1150
Distinct (%)80.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean71538.179
Minimum0
Maximum752238
Zeros82
Zeros (%)5.7%
Negative0
Negative (%)0.0%
Memory size12.7 KiB
2023-12-12T17:27:11.051556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1154
median15154
Q391428.25
95-th percentile315640.65
Maximum752238
Range752238
Interquartile range (IQR)91274.25

Descriptive statistics

Standard deviation119714.51
Coefficient of variation (CV)1.6734352
Kurtosis8.2550376
Mean71538.179
Median Absolute Deviation (MAD)15149
Skewness2.6044874
Sum1.0258575 × 108
Variance1.4331564 × 1010
MonotonicityNot monotonic
2023-12-12T17:27:11.201830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 82
 
5.7%
4 13
 
0.9%
5 11
 
0.8%
1 9
 
0.6%
9 9
 
0.6%
3 8
 
0.6%
10 7
 
0.5%
6 6
 
0.4%
39 6
 
0.4%
8 6
 
0.4%
Other values (1140) 1277
89.1%
ValueCountFrequency (%)
0 82
5.7%
1 9
 
0.6%
2 5
 
0.3%
3 8
 
0.6%
4 13
 
0.9%
5 11
 
0.8%
6 6
 
0.4%
7 5
 
0.3%
8 6
 
0.4%
9 9
 
0.6%
ValueCountFrequency (%)
752238 1
0.1%
742875 1
0.1%
725214 1
0.1%
724058 1
0.1%
705307 1
0.1%
686501 1
0.1%
682318 1
0.1%
665689 1
0.1%
664748 1
0.1%
659594 1
0.1%

조정량
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct1150
Distinct (%)80.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean71538.179
Minimum0
Maximum752238
Zeros82
Zeros (%)5.7%
Negative0
Negative (%)0.0%
Memory size12.7 KiB
2023-12-12T17:27:11.354828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1154
median15154
Q391428.25
95-th percentile315640.65
Maximum752238
Range752238
Interquartile range (IQR)91274.25

Descriptive statistics

Standard deviation119714.51
Coefficient of variation (CV)1.6734352
Kurtosis8.2550376
Mean71538.179
Median Absolute Deviation (MAD)15149
Skewness2.6044874
Sum1.0258575 × 108
Variance1.4331564 × 1010
MonotonicityNot monotonic
2023-12-12T17:27:11.498737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 82
 
5.7%
4 13
 
0.9%
5 11
 
0.8%
1 9
 
0.6%
9 9
 
0.6%
3 8
 
0.6%
10 7
 
0.5%
6 6
 
0.4%
39 6
 
0.4%
8 6
 
0.4%
Other values (1140) 1277
89.1%
ValueCountFrequency (%)
0 82
5.7%
1 9
 
0.6%
2 5
 
0.3%
3 8
 
0.6%
4 13
 
0.9%
5 11
 
0.8%
6 6
 
0.4%
7 5
 
0.3%
8 6
 
0.4%
9 9
 
0.6%
ValueCountFrequency (%)
752238 1
0.1%
742875 1
0.1%
725214 1
0.1%
724058 1
0.1%
705307 1
0.1%
686501 1
0.1%
682318 1
0.1%
665689 1
0.1%
664748 1
0.1%
659594 1
0.1%

급수전수
Real number (ℝ)

HIGH CORRELATION 

Distinct496
Distinct (%)34.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean727.65621
Minimum1
Maximum4480
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.7 KiB
2023-12-12T17:27:11.654963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median212
Q31312
95-th percentile2794
Maximum4480
Range4479
Interquartile range (IQR)1309

Descriptive statistics

Standard deviation997.65763
Coefficient of variation (CV)1.3710563
Kurtosis2.0710302
Mean727.65621
Median Absolute Deviation (MAD)211
Skewness1.5575112
Sum1043459
Variance995320.75
MonotonicityNot monotonic
2023-12-12T17:27:11.824809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 184
 
12.8%
2 111
 
7.7%
3 103
 
7.2%
4 68
 
4.7%
7 31
 
2.2%
5 29
 
2.0%
212 18
 
1.3%
60 14
 
1.0%
59 11
 
0.8%
104 11
 
0.8%
Other values (486) 854
59.6%
ValueCountFrequency (%)
1 184
12.8%
2 111
7.7%
3 103
7.2%
4 68
 
4.7%
5 29
 
2.0%
6 2
 
0.1%
7 31
 
2.2%
8 1
 
0.1%
9 3
 
0.2%
10 3
 
0.2%
ValueCountFrequency (%)
4480 1
 
0.1%
4475 1
 
0.1%
4474 2
0.1%
4465 1
 
0.1%
4458 1
 
0.1%
4457 1
 
0.1%
4404 2
0.1%
4403 3
0.2%
4400 1
 
0.1%
4398 2
0.1%

Interactions

2023-12-12T17:27:07.972190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:05.176328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:05.835885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:06.476228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:07.251138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:08.088537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:05.311631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:05.955360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:06.703861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:07.386960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:08.238070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:05.449553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:06.057752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:06.869017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:07.525297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:08.371851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:05.577299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:06.161990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:07.006333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:07.676803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:08.501243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:05.705327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:06.327660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:07.133740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:27:07.826230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:27:11.938340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
중블록업종사용년월납기년월당월사용량확정사용량가사용량조정량급수전수
중블록1.0000.4840.0000.0000.7370.7370.6840.6840.830
업종0.4841.0000.0000.0000.5160.5160.6290.6290.669
사용년월0.0000.0001.0001.0000.0000.0000.0000.0000.000
납기년월0.0000.0001.0001.0000.0000.0000.0000.0000.000
당월사용량0.7370.5160.0000.0001.0001.0000.9360.9360.718
확정사용량0.7370.5160.0000.0001.0001.0000.9360.9360.718
가사용량0.6840.6290.0000.0000.9360.9361.0001.0000.836
조정량0.6840.6290.0000.0000.9360.9361.0001.0000.836
급수전수0.8300.6690.0000.0000.7180.7180.8360.8361.000
2023-12-12T17:27:12.060887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
중블록업종
중블록1.0000.255
업종0.2551.000
2023-12-12T17:27:12.167232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
당월사용량확정사용량가사용량조정량급수전수중블록업종
당월사용량1.0001.0000.9970.9970.8420.3270.361
확정사용량1.0001.0000.9970.9970.8420.3270.361
가사용량0.9970.9971.0001.0000.8390.3300.347
조정량0.9970.9971.0001.0000.8390.3300.347
급수전수0.8420.8420.8390.8391.0000.4880.380
중블록0.3270.3270.3300.3300.4881.0000.255
업종0.3610.3610.3470.3470.3800.2551.000

Missing values

2023-12-12T17:27:08.635256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:27:08.832447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

중블록업종사용년월납기년월당월사용량확정사용량가사용량조정량급수전수
04F-1가정용2019-03-012019-04-013328733328733324323324322621
14F-1영업용2019-03-012019-04-012354432354432355302355301851
24F-1욕탕용2019-03-012019-04-011761761761763
3HP-1가정용2019-03-012019-04-01155711557115549155491518
4HP-1영업용2019-03-012019-04-015701570157015701235
5ID-1가정용2019-03-012019-04-012658052658052656562656561529
6ID-1영업용2019-03-012019-04-01849978499784997849971195
7ID-1욕탕용2019-03-012019-04-0153835383538353832
8ID-2가정용2019-03-012019-04-0170594705947059470594515
9ID-2영업용2019-03-012019-04-0127010270102623726237325
중블록업종사용년월납기년월당월사용량확정사용량가사용량조정량급수전수
1424SP-1영업용2021-03-012021-04-01141414143
1425SP-2가정용2021-03-012021-04-0124424424424416
1426SP-2영업용2021-03-012021-04-01242424242
1427SP-6가정용2021-03-012021-04-0188881
1428SP-6영업용2021-03-012021-04-01353535354
1429SS-2가정용2021-03-012021-04-018181818110
1430SS-2영업용2021-03-012021-04-01333333331
1431WH-1가정용2021-03-012021-04-0100001
1432WH-1영업용2021-03-012021-04-011841841841847
1433WH-4영업용2021-03-012021-04-0115515587877