Overview

Dataset statistics

Number of variables8
Number of observations26
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory71.1 B

Variable types

Categorical5
Text1
Numeric1
DateTime1

Dataset

Description상하수도 요금정보를 제공합니다. 상/하수도별, 업종별, 단계별, 사용량별 금액을 제공하고, 관리기관과 연락처 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15093688/fileData.do

Alerts

관리기관 has constant value ""Constant
연락처 has constant value ""Constant
데이터기준일자 has constant value ""Constant
세제곱미터당 금액(원) has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:23:46.974742
Analysis finished2023-12-12 22:23:47.436043
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct2
Distinct (%)7.7%
Missing0
Missing (%)0.0%
Memory size340.0 B
상수도
13 
하수도
13 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row상수도
2nd row상수도
3rd row상수도
4th row상수도
5th row상수도

Common Values

ValueCountFrequency (%)
상수도 13
50.0%
하수도 13
50.0%

Length

2023-12-13T07:23:47.500170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:23:47.597506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
상수도 13
50.0%
하수도 13
50.0%

업종
Categorical

Distinct4
Distinct (%)15.4%
Missing0
Missing (%)0.0%
Memory size340.0 B
가정용
10 
일반용
대중탕용
전용공업용

Length

Max length5
Median length3
Mean length3.3846154
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가정용
2nd row가정용
3rd row가정용
4th row가정용
5th row가정용

Common Values

ValueCountFrequency (%)
가정용 10
38.5%
일반용 8
30.8%
대중탕용 6
23.1%
전용공업용 2
 
7.7%

Length

2023-12-13T07:23:47.697243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:23:47.798491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가정용 10
38.5%
일반용 8
30.8%
대중탕용 6
23.1%
전용공업용 2
 
7.7%

단계
Categorical

Distinct6
Distinct (%)23.1%
Missing0
Missing (%)0.0%
Memory size340.0 B
1
2
3
4
5

Length

Max length4
Median length1
Mean length1.2307692
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row2
3rd row3
4th row4
5th row5

Common Values

ValueCountFrequency (%)
1 6
23.1%
2 6
23.1%
3 6
23.1%
4 4
15.4%
5 2
 
7.7%
<NA> 2
 
7.7%

Length

2023-12-13T07:23:47.929677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:23:48.051728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 6
23.1%
2 6
23.1%
3 6
23.1%
4 4
15.4%
5 2
 
7.7%
na 2
 
7.7%
Distinct14
Distinct (%)53.8%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-13T07:23:48.185888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length5.6923077
Min length4

Characters and Unicode

Total characters148
Distinct characters16
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)7.7%

Sample

1st row1~10
2nd row11~20
3rd row21~30
4th row31~40
5th row41 이상
ValueCountFrequency (%)
이상 6
18.2%
1~10 2
 
6.1%
11~20 2
 
6.1%
21~30 2
 
6.1%
31~40 2
 
6.1%
41 2
 
6.1%
1~50 2
 
6.1%
51~100 2
 
6.1%
101~300 2
 
6.1%
301 2
 
6.1%
Other values (6) 9
27.3%
2023-12-13T07:23:48.517667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 38
25.7%
1 37
25.0%
~ 18
12.2%
3 8
 
5.4%
5 8
 
5.4%
7
 
4.7%
6
 
4.1%
6
 
4.1%
2 4
 
2.7%
4 4
 
2.7%
Other values (6) 12
 
8.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 99
66.9%
Other Letter 24
 
16.2%
Math Symbol 18
 
12.2%
Space Separator 7
 
4.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
25.0%
6
25.0%
2
 
8.3%
2
 
8.3%
2
 
8.3%
2
 
8.3%
2
 
8.3%
2
 
8.3%
Decimal Number
ValueCountFrequency (%)
0 38
38.4%
1 37
37.4%
3 8
 
8.1%
5 8
 
8.1%
2 4
 
4.0%
4 4
 
4.0%
Math Symbol
ValueCountFrequency (%)
~ 18
100.0%
Space Separator
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 124
83.8%
Hangul 24
 
16.2%

Most frequent character per script

Common
ValueCountFrequency (%)
0 38
30.6%
1 37
29.8%
~ 18
14.5%
3 8
 
6.5%
5 8
 
6.5%
7
 
5.6%
2 4
 
3.2%
4 4
 
3.2%
Hangul
ValueCountFrequency (%)
6
25.0%
6
25.0%
2
 
8.3%
2
 
8.3%
2
 
8.3%
2
 
8.3%
2
 
8.3%
2
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 124
83.8%
Hangul 24
 
16.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 38
30.6%
1 37
29.8%
~ 18
14.5%
3 8
 
6.5%
5 8
 
6.5%
7
 
5.6%
2 4
 
3.2%
4 4
 
3.2%
Hangul
ValueCountFrequency (%)
6
25.0%
6
25.0%
2
 
8.3%
2
 
8.3%
2
 
8.3%
2
 
8.3%
2
 
8.3%
2
 
8.3%

세제곱미터당 금액(원)
Real number (ℝ)

UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean998.23077
Minimum141
Maximum4760
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2023-12-13T07:23:48.680816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum141
5-th percentile177
Q1327.5
median635
Q31265.5
95-th percentile3331.75
Maximum4760
Range4619
Interquartile range (IQR)938

Descriptive statistics

Standard deviation1097.4789
Coefficient of variation (CV)1.099424
Kurtosis5.5587346
Mean998.23077
Median Absolute Deviation (MAD)338
Skewness2.3118746
Sum25954
Variance1204459.9
MonotonicityNot monotonic
2023-12-13T07:23:48.820441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
260 1
 
3.8%
165 1
 
3.8%
253 1
 
3.8%
2467 1
 
3.8%
1533 1
 
3.8%
952 1
 
3.8%
801 1
 
3.8%
784 1
 
3.8%
483 1
 
3.8%
427 1
 
3.8%
Other values (16) 16
61.5%
ValueCountFrequency (%)
141 1
3.8%
165 1
3.8%
213 1
3.8%
253 1
3.8%
260 1
3.8%
284 1
3.8%
310 1
3.8%
380 1
3.8%
390 1
3.8%
427 1
3.8%
ValueCountFrequency (%)
4760 1
3.8%
3620 1
3.8%
2467 1
3.8%
1533 1
3.8%
1450 1
3.8%
1400 1
3.8%
1370 1
3.8%
952 1
3.8%
900 1
3.8%
850 1
3.8%

관리기관
Categorical

CONSTANT 

Distinct1
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size340.0 B
양구군 상하수도사업소
26 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row양구군 상하수도사업소
2nd row양구군 상하수도사업소
3rd row양구군 상하수도사업소
4th row양구군 상하수도사업소
5th row양구군 상하수도사업소

Common Values

ValueCountFrequency (%)
양구군 상하수도사업소 26
100.0%

Length

2023-12-13T07:23:48.970687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:23:49.391463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
양구군 26
50.0%
상하수도사업소 26
50.0%

연락처
Categorical

CONSTANT 

Distinct1
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size340.0 B
033-480-7812
26 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row033-480-7812
2nd row033-480-7812
3rd row033-480-7812
4th row033-480-7812
5th row033-480-7812

Common Values

ValueCountFrequency (%)
033-480-7812 26
100.0%

Length

2023-12-13T07:23:49.507564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:23:49.607507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
033-480-7812 26
100.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size340.0 B
Minimum2023-06-28 00:00:00
Maximum2023-06-28 00:00:00
2023-12-13T07:23:49.696285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:23:49.806944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T07:23:47.171850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:23:49.880496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분업종단계사용량(세제곱미터)세제곱미터당 금액(원)
구분1.0000.0000.0000.0000.245
업종0.0001.0000.0001.0000.528
단계0.0000.0001.0001.0000.000
사용량(세제곱미터)0.0001.0001.0001.0000.000
세제곱미터당 금액(원)0.2450.5280.0000.0001.000
2023-12-13T07:23:49.975100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분단계업종
구분1.0000.0000.000
단계0.0001.0000.000
업종0.0000.0001.000
2023-12-13T07:23:50.054839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세제곱미터당 금액(원)구분업종단계
세제곱미터당 금액(원)1.0000.2410.3700.000
구분0.2411.0000.0000.000
업종0.3700.0001.0000.000
단계0.0000.0000.0001.000

Missing values

2023-12-13T07:23:47.264976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:23:47.384198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분업종단계사용량(세제곱미터)세제곱미터당 금액(원)관리기관연락처데이터기준일자
0상수도가정용11~10260양구군 상하수도사업소033-480-78122023-06-28
1상수도가정용211~20310양구군 상하수도사업소033-480-78122023-06-28
2상수도가정용321~30390양구군 상하수도사업소033-480-78122023-06-28
3상수도가정용431~40520양구군 상하수도사업소033-480-78122023-06-28
4상수도가정용541 이상900양구군 상하수도사업소033-480-78122023-06-28
5상수도일반용11~50750양구군 상하수도사업소033-480-78122023-06-28
6상수도일반용251~100850양구군 상하수도사업소033-480-78122023-06-28
7상수도일반용3101~3001370양구군 상하수도사업소033-480-78122023-06-28
8상수도일반용4301 이상1400양구군 상하수도사업소033-480-78122023-06-28
9상수도대중탕용11~5001450양구군 상하수도사업소033-480-78122023-06-28
구분업종단계사용량(세제곱미터)세제곱미터당 금액(원)관리기관연락처데이터기준일자
16하수도가정용431~40284양구군 상하수도사업소033-480-78122023-06-28
17하수도가정용541 이상491양구군 상하수도사업소033-480-78122023-06-28
18하수도일반용11~50427양구군 상하수도사업소033-480-78122023-06-28
19하수도일반용251~100483양구군 상하수도사업소033-480-78122023-06-28
20하수도일반용3101~300784양구군 상하수도사업소033-480-78122023-06-28
21하수도일반용4301 이상801양구군 상하수도사업소033-480-78122023-06-28
22하수도대중탕용11~500952양구군 상하수도사업소033-480-78122023-06-28
23하수도대중탕용2501~10001533양구군 상하수도사업소033-480-78122023-06-28
24하수도대중탕용31001 이상2467양구군 상하수도사업소033-480-78122023-06-28
25하수도전용공업용<NA>세제곱미터당253양구군 상하수도사업소033-480-78122023-06-28