Overview

Dataset statistics

Number of variables8
Number of observations25
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory71.3 B

Variable types

Categorical6
Text1
Numeric1

Dataset

Description상수도 하수도의 구분 , 업종, 단계 , 사용량, 범위당 금액 , 관리기관, 연락처, 데이터기준일자등을 알수 있습니다.
Author경기도 이천시
URLhttps://www.data.go.kr/data/15093685/fileData.do

Alerts

관리기관 has constant value ""Constant
데이터기준일자 has constant value ""Constant
구분 is highly overall correlated with 연락처High correlation
연락처 is highly overall correlated with 구분High correlation
세제곱미터당 금액(원) has unique valuesUnique

Reproduction

Analysis started2023-12-11 23:58:34.458198
Analysis finished2023-12-11 23:58:35.033314
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
하수도
13 
상수도
12 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row상수도
2nd row상수도
3rd row상수도
4th row상수도
5th row상수도

Common Values

ValueCountFrequency (%)
하수도 13
52.0%
상수도 12
48.0%

Length

2023-12-12T08:58:35.091612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:58:35.173503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
하수도 13
52.0%
상수도 12
48.0%

업종
Categorical

Distinct4
Distinct (%)16.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
일반용
10 
대중탕용
가정용
산업용
 
1

Length

Max length4
Median length3
Mean length3.32
Min length3

Unique

Unique1 ?
Unique (%)4.0%

Sample

1st row가정용
2nd row가정용
3rd row가정용
4th row일반용
5th row일반용

Common Values

ValueCountFrequency (%)
일반용 10
40.0%
대중탕용 8
32.0%
가정용 6
24.0%
산업용 1
 
4.0%

Length

2023-12-12T08:58:35.265073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:58:35.363011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반용 10
40.0%
대중탕용 8
32.0%
가정용 6
24.0%
산업용 1
 
4.0%

단계
Categorical

Distinct6
Distinct (%)24.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
1
2
3
4
5

Length

Max length4
Median length1
Mean length1.12
Min length1

Unique

Unique1 ?
Unique (%)4.0%

Sample

1st row1
2nd row2
3rd row3
4th row1
5th row2

Common Values

ValueCountFrequency (%)
1 6
24.0%
2 6
24.0%
3 6
24.0%
4 4
16.0%
5 2
 
8.0%
<NA> 1
 
4.0%

Length

2023-12-12T08:58:35.468341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:58:35.564464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 6
24.0%
2 6
24.0%
3 6
24.0%
4 4
16.0%
5 2
 
8.0%
na 1
 
4.0%
Distinct13
Distinct (%)52.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-12T08:58:35.721543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length5.84
Min length4

Characters and Unicode

Total characters146
Distinct characters13
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)4.0%

Sample

1st row0~20
2nd row21~30
3rd row31이상
4th row0~50
5th row51~100
ValueCountFrequency (%)
0~20 2
 
7.4%
21~30 2
 
7.4%
31이상 2
 
7.4%
0~50 2
 
7.4%
51~100 2
 
7.4%
101~300 2
 
7.4%
301~500 2
 
7.4%
501 2
 
7.4%
이상 2
 
7.4%
0~500 2
 
7.4%
Other values (4) 7
25.9%
2023-12-12T08:58:36.003302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 52
35.6%
1 30
20.5%
~ 18
 
12.3%
5 16
 
11.0%
3 8
 
5.5%
6
 
4.1%
6
 
4.1%
2 4
 
2.7%
2
 
1.4%
1
 
0.7%
Other values (3) 3
 
2.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 110
75.3%
Math Symbol 18
 
12.3%
Other Letter 16
 
11.0%
Space Separator 2
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
37.5%
6
37.5%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Decimal Number
ValueCountFrequency (%)
0 52
47.3%
1 30
27.3%
5 16
 
14.5%
3 8
 
7.3%
2 4
 
3.6%
Math Symbol
ValueCountFrequency (%)
~ 18
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 130
89.0%
Hangul 16
 
11.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 52
40.0%
1 30
23.1%
~ 18
 
13.8%
5 16
 
12.3%
3 8
 
6.2%
2 4
 
3.1%
2
 
1.5%
Hangul
ValueCountFrequency (%)
6
37.5%
6
37.5%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 130
89.0%
Hangul 16
 
11.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 52
40.0%
1 30
23.1%
~ 18
 
13.8%
5 16
 
12.3%
3 8
 
6.2%
2 4
 
3.1%
2
 
1.5%
Hangul
ValueCountFrequency (%)
6
37.5%
6
37.5%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%

세제곱미터당 금액(원)
Real number (ℝ)

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1415.72
Minimum554
Maximum2212
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T08:58:36.122142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum554
5-th percentile649
Q1890
median1453
Q31881
95-th percentile2160
Maximum2212
Range1658
Interquartile range (IQR)991

Descriptive statistics

Standard deviation538.20779
Coefficient of variation (CV)0.38016542
Kurtosis-1.4869548
Mean1415.72
Median Absolute Deviation (MAD)507
Skewness-0.035228538
Sum35393
Variance289667.63
MonotonicityNot monotonic
2023-12-12T08:58:36.235309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
890 1
 
4.0%
1200 1
 
4.0%
851 1
 
4.0%
1174 1
 
4.0%
1030 1
 
4.0%
881 1
 
4.0%
733 1
 
4.0%
2212 1
 
4.0%
1881 1
 
4.0%
1750 1
 
4.0%
Other values (15) 15
60.0%
ValueCountFrequency (%)
554 1
4.0%
628 1
4.0%
733 1
4.0%
851 1
4.0%
864 1
4.0%
881 1
4.0%
890 1
4.0%
942 1
4.0%
1030 1
4.0%
1174 1
4.0%
ValueCountFrequency (%)
2212 1
4.0%
2170 1
4.0%
2120 1
4.0%
2070 1
4.0%
1960 1
4.0%
1920 1
4.0%
1881 1
4.0%
1850 1
4.0%
1840 1
4.0%
1750 1
4.0%

관리기관
Categorical

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
경기도 이천시청
25 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도 이천시청
2nd row경기도 이천시청
3rd row경기도 이천시청
4th row경기도 이천시청
5th row경기도 이천시청

Common Values

ValueCountFrequency (%)
경기도 이천시청 25
100.0%

Length

2023-12-12T08:58:36.365414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:58:36.449524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 25
50.0%
이천시청 25
50.0%

연락처
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
031-644-4217
15 
031-644-4277
10 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row031-644-4217
2nd row031-644-4217
3rd row031-644-4217
4th row031-644-4217
5th row031-644-4217

Common Values

ValueCountFrequency (%)
031-644-4217 15
60.0%
031-644-4277 10
40.0%

Length

2023-12-12T08:58:36.547589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:58:36.632168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
031-644-4217 15
60.0%
031-644-4277 10
40.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-10-28
25 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-10-28
2nd row2023-10-28
3rd row2023-10-28
4th row2023-10-28
5th row2023-10-28

Common Values

ValueCountFrequency (%)
2023-10-28 25
100.0%

Length

2023-12-12T08:58:36.732360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:58:36.823652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-10-28 25
100.0%

Interactions

2023-12-12T08:58:34.726568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:58:36.883988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분업종단계사용량(세제곱미터)세제곱미터당 금액(원)연락처
구분1.0000.0000.0000.0000.0000.882
업종0.0001.0000.0001.0000.0000.000
단계0.0000.0001.0001.0000.5860.000
사용량(세제곱미터)0.0001.0001.0001.0000.2280.000
세제곱미터당 금액(원)0.0000.0000.5860.2281.0000.000
연락처0.8820.0000.0000.0000.0001.000
2023-12-12T08:58:37.014632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종연락처단계구분
업종1.0000.0000.0000.000
연락처0.0001.0000.0000.687
단계0.0000.0001.0000.000
구분0.0000.6870.0001.000
2023-12-12T08:58:37.108986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세제곱미터당 금액(원)구분업종단계연락처
세제곱미터당 금액(원)1.0000.1580.0000.1630.000
구분0.1581.0000.0000.0000.687
업종0.0000.0001.0000.0000.000
단계0.1630.0000.0001.0000.000
연락처0.0000.6870.0000.0001.000

Missing values

2023-12-12T08:58:34.873854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:58:34.990480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분업종단계사용량(세제곱미터)세제곱미터당 금액(원)관리기관연락처데이터기준일자
0상수도가정용10~20890경기도 이천시청031-644-42172023-10-28
1상수도가정용221~301200경기도 이천시청031-644-42172023-10-28
2상수도가정용331이상1650경기도 이천시청031-644-42172023-10-28
3상수도일반용10~501500경기도 이천시청031-644-42172023-10-28
4상수도일반용251~1001850경기도 이천시청031-644-42172023-10-28
5상수도일반용3101~3001960경기도 이천시청031-644-42172023-10-28
6상수도일반용4301~5002070경기도 이천시청031-644-42172023-10-28
7상수도일반용5501 이상2170경기도 이천시청031-644-42172023-10-28
8상수도대중탕용10~5001270경기도 이천시청031-644-42172023-10-28
9상수도대중탕용2501~10001840경기도 이천시청031-644-42172023-10-28
구분업종단계사용량(세제곱미터)세제곱미터당 금액(원)관리기관연락처데이터기준일자
15하수도일반용10~50942경기도 이천시청031-644-42772023-10-28
16하수도일반용251~1001453경기도 이천시청031-644-42772023-10-28
17하수도일반용3101~3001750경기도 이천시청031-644-42772023-10-28
18하수도일반용4301~5001881경기도 이천시청031-644-42772023-10-28
19하수도일반용5501 이상2212경기도 이천시청031-644-42772023-10-28
20하수도대중탕용10~500733경기도 이천시청031-644-42172023-10-28
21하수도대중탕용2501~1000881경기도 이천시청031-644-42772023-10-28
22하수도대중탕용31001~15001030경기도 이천시청031-644-42772023-10-28
23하수도대중탕용41501이상1174경기도 이천시청031-644-42172023-10-28
24하수도산업용<NA>단일구간851경기도 이천시청031-644-42172023-10-28