Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows325
Duplicate rows (%)3.2%
Total size in memory654.3 KiB
Average record size in memory67.0 B

Variable types

Numeric3
Categorical4

Dataset

Description충청남도 부여군 상하수도요금 고지 현황 정보입니다.(고지번호, 지역, 업종, 사용량, 사용료, 납기마감일, 데이터기준일자)
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=410&beforeMenuCd=DOM_000000201001001000&publicdatapk=15040581

Alerts

납기마감일 has constant value ""Constant
데이터기준일자 has constant value ""Constant
Dataset has 325 (3.2%) duplicate rowsDuplicates
수용가번호 is highly overall correlated with 지역High correlation
사용량 is highly overall correlated with 사용료High correlation
사용료 is highly overall correlated with 사용량High correlation
지역 is highly overall correlated with 수용가번호High correlation
업종 is highly imbalanced (76.6%)Imbalance
사용량 is highly skewed (γ1 = 49.09885872)Skewed
사용료 is highly skewed (γ1 = 32.69453164)Skewed
사용량 has 2223 (22.2%) zerosZeros

Reproduction

Analysis started2024-01-09 22:59:46.456831
Analysis finished2024-01-09 22:59:47.890540
Duration1.43 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

수용가번호
Real number (ℝ)

HIGH CORRELATION 

Distinct8241
Distinct (%)82.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.7418159 × 109
Minimum1.0110003 × 109
Maximum5.0111 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T07:59:47.965045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.0110003 × 109
5-th percentile1.0320003 × 109
Q11.1410186 × 109
median2.333011 × 109
Q31.3809 × 1010
95-th percentile1.8020001 × 1010
Maximum5.0111 × 1010
Range4.91 × 1010
Interquartile range (IQR)1.2667982 × 1010

Descriptive statistics

Standard deviation6.5138666 × 109
Coefficient of variation (CV)0.96618874
Kurtosis-1.038971
Mean6.7418159 × 109
Median Absolute Deviation (MAD)1.3009928 × 109
Skewness0.67123724
Sum6.7418159 × 1013
Variance4.2430458 × 1019
MonotonicityNot monotonic
2024-01-10T07:59:48.106520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1343010900 5
 
0.1%
1343008400 5
 
0.1%
1333016700 5
 
0.1%
2336003300 4
 
< 0.1%
1061002500 4
 
< 0.1%
2338012400 4
 
< 0.1%
2343005100 4
 
< 0.1%
2333011000 4
 
< 0.1%
3333008000 4
 
< 0.1%
1011003800 4
 
< 0.1%
Other values (8231) 9957
99.6%
ValueCountFrequency (%)
1011000300 2
< 0.1%
1011000800 1
 
< 0.1%
1011001400 1
 
< 0.1%
1011003700 1
 
< 0.1%
1011003800 4
< 0.1%
1011005200 1
 
< 0.1%
1011005300 1
 
< 0.1%
1011005500 1
 
< 0.1%
1011006100 1
 
< 0.1%
1011006200 1
 
< 0.1%
ValueCountFrequency (%)
50111000300 1
< 0.1%
20023004500 1
< 0.1%
20023004400 1
< 0.1%
20023004300 1
< 0.1%
20023004100 2
< 0.1%
20023003900 2
< 0.1%
20023003800 1
< 0.1%
20023003700 1
< 0.1%
20023003600 1
< 0.1%
20023003400 1
< 0.1%

지역
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
충청남도 부여군 부여읍
3910 
충청남도 부여군 규암면
1339 
충청남도 부여군 세도면
650 
충청남도 부여군 석성면
524 
충청남도 부여군 임천면
478 
Other values (10)
3099 

Length

Max length12
Median length12
Mean length11.9657
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충청남도 부여군 규암면
2nd row충청남도 부여군 홍산면
3rd row충청남도 부여군 은산면
4th row충청남도 부여군 장암면
5th row충청남도 부여군 부여읍

Common Values

ValueCountFrequency (%)
충청남도 부여군 부여읍 3910
39.1%
충청남도 부여군 규암면 1339
 
13.4%
충청남도 부여군 세도면 650
 
6.5%
충청남도 부여군 석성면 524
 
5.2%
충청남도 부여군 임천면 478
 
4.8%
충청남도 부여군 장암면 426
 
4.3%
충청남도 부여군 초촌면 401
 
4.0%
충청남도 부여군 구룡면 389
 
3.9%
충청남도 부여군 홍산면 344
 
3.4%
충청남도 부여군 남면 343
 
3.4%
Other values (5) 1196
 
12.0%

Length

2024-01-10T07:59:48.238967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
충청남도 10000
33.3%
부여군 10000
33.3%
부여읍 3910
 
13.0%
규암면 1339
 
4.5%
세도면 650
 
2.2%
석성면 524
 
1.7%
임천면 478
 
1.6%
장암면 426
 
1.4%
초촌면 401
 
1.3%
구룡면 389
 
1.3%
Other values (7) 1883
 
6.3%

업종
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
가정용
8697 
일반용
1239 
일반겸용
 
35
일반(교육)용
 
21
대중탕용
 
6

Length

Max length9
Median length3
Mean length3.0137
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가정용
2nd row가정용
3rd row가정용
4th row가정용
5th row가정용

Common Values

ValueCountFrequency (%)
가정용 8697
87.0%
일반용 1239
 
12.4%
일반겸용 35
 
0.4%
일반(교육)용 21
 
0.2%
대중탕용 6
 
0.1%
일반용(가구분할) 2
 
< 0.1%

Length

2024-01-10T07:59:48.348825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:59:48.444449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가정용 8697
87.0%
일반용 1239
 
12.4%
일반겸용 35
 
0.4%
일반(교육)용 21
 
0.2%
대중탕용 6
 
0.1%
일반용(가구분할 2
 
< 0.1%

사용량
Real number (ℝ)

HIGH CORRELATION  SKEWED  ZEROS 

Distinct200
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18.4274
Minimum0
Maximum9730
Zeros2223
Zeros (%)22.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T07:59:48.561421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median10
Q319
95-th percentile44
Maximum9730
Range9730
Interquartile range (IQR)18

Descriptive statistics

Standard deviation137.76062
Coefficient of variation (CV)7.4758577
Kurtosis2929.7447
Mean18.4274
Median Absolute Deviation (MAD)9
Skewness49.098859
Sum184274
Variance18977.989
MonotonicityNot monotonic
2024-01-10T07:59:48.690146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2223
22.2%
1 361
 
3.6%
7 324
 
3.2%
5 317
 
3.2%
4 310
 
3.1%
8 298
 
3.0%
12 297
 
3.0%
3 293
 
2.9%
10 292
 
2.9%
6 291
 
2.9%
Other values (190) 4994
49.9%
ValueCountFrequency (%)
0 2223
22.2%
1 361
 
3.6%
2 279
 
2.8%
3 293
 
2.9%
4 310
 
3.1%
5 317
 
3.2%
6 291
 
2.9%
7 324
 
3.2%
8 298
 
3.0%
9 288
 
2.9%
ValueCountFrequency (%)
9730 1
< 0.1%
5492 1
< 0.1%
4475 1
< 0.1%
3657 1
< 0.1%
3549 1
< 0.1%
2011 1
< 0.1%
1821 1
< 0.1%
1694 1
< 0.1%
1121 1
< 0.1%
950 1
< 0.1%

사용료
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct2528
Distinct (%)25.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29327.207
Minimum0
Maximum11329780
Zeros29
Zeros (%)0.3%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T07:59:48.819948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1140
Q13280
median11380
Q322860
95-th percentile72842.5
Maximum11329780
Range11329780
Interquartile range (IQR)19580

Descriptive statistics

Standard deviation196265.27
Coefficient of variation (CV)6.6922593
Kurtosis1424.6857
Mean29327.207
Median Absolute Deviation (MAD)9020
Skewness32.694532
Sum2.9327207 × 108
Variance3.8520057 × 1010
MonotonicityNot monotonic
2024-01-10T07:59:48.946629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1140 1373
 
13.7%
3280 127
 
1.3%
1640 116
 
1.2%
1230 74
 
0.7%
1950 74
 
0.7%
1360 73
 
0.7%
6890 65
 
0.7%
2720 64
 
0.6%
9190 61
 
0.6%
2290 59
 
0.6%
Other values (2518) 7914
79.1%
ValueCountFrequency (%)
0 29
0.3%
20 1
 
< 0.1%
170 1
 
< 0.1%
330 1
 
< 0.1%
680 1
 
< 0.1%
1080 2
 
< 0.1%
1090 1
 
< 0.1%
1100 1
 
< 0.1%
1110 2
 
< 0.1%
1120 24
0.2%
ValueCountFrequency (%)
11329780 1
< 0.1%
6153400 1
< 0.1%
5607550 1
< 0.1%
5046590 1
< 0.1%
4744110 1
< 0.1%
4251310 1
< 0.1%
3814000 1
< 0.1%
3715550 1
< 0.1%
3645090 1
< 0.1%
3642390 1
< 0.1%

납기마감일
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2021-09-30
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-09-30
2nd row2021-09-30
3rd row2021-09-30
4th row2021-09-30
5th row2021-09-30

Common Values

ValueCountFrequency (%)
2021-09-30 10000
100.0%

Length

2024-01-10T07:59:49.066178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:59:49.144108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-09-30 10000
100.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2022-08-31
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-08-31
2nd row2022-08-31
3rd row2022-08-31
4th row2022-08-31
5th row2022-08-31

Common Values

ValueCountFrequency (%)
2022-08-31 10000
100.0%

Length

2024-01-10T07:59:49.226724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:59:49.314595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-08-31 10000
100.0%

Interactions

2024-01-10T07:59:47.442472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:59:46.894060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:59:47.165599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:59:47.532958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:59:46.987147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:59:47.260138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:59:47.625182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:59:47.077406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:59:47.347781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T07:59:49.371482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수용가번호지역업종사용량사용료
수용가번호1.0000.9910.2030.0000.000
지역0.9911.0000.2170.0000.000
업종0.2030.2171.0000.0000.046
사용량0.0000.0000.0001.0000.973
사용료0.0000.0000.0460.9731.000
2024-01-10T07:59:49.455164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종지역
업종1.0000.103
지역0.1031.000
2024-01-10T07:59:49.540330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수용가번호사용량사용료지역업종
수용가번호1.000-0.136-0.2340.8650.138
사용량-0.1361.0000.9350.0000.000
사용료-0.2340.9351.0000.0000.027
지역0.8650.0000.0001.0000.103
업종0.1380.0000.0270.1031.000

Missing values

2024-01-10T07:59:47.737508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:59:47.840407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

수용가번호지역업종사용량사용료납기마감일데이터기준일자
759912343006000충청남도 부여군 규암면가정용064902021-09-302022-08-31
115977011006800충청남도 부여군 홍산면가정용345702021-09-302022-08-31
321563011024100충청남도 부여군 은산면가정용011402021-09-302022-08-31
5951113804002800충청남도 부여군 장암면가정용13115502021-09-302022-08-31
492591151026800충청남도 부여군 부여읍가정용11100502021-09-302022-08-31
8038012171000300충청남도 부여군 임천면일반용24319802021-09-302022-08-31
329326031006600충청남도 부여군 구룡면가정용13159102021-09-302022-08-31
1865216001012200충청남도 부여군 초촌면가정용568702021-09-302022-08-31
8588816034006700충청남도 부여군 초촌면가정용22194902021-09-302022-08-31
532182103007100충청남도 부여군 규암면가정용094602021-09-302022-08-31
수용가번호지역업종사용량사용료납기마감일데이터기준일자
334286046002200충청남도 부여군 구룡면가정용35351802021-09-302022-08-31
8831820023000100충청남도 부여군 내산면가정용011402021-09-302022-08-31
219181343005900충청남도 부여군 부여읍가정용016202021-09-302022-08-31
447611031002000충청남도 부여군 부여읍일반용30652702021-09-302022-08-31
4012115131003700충청남도 부여군 석성면가정용011402021-09-302022-08-31
233901041032500충청남도 부여군 부여읍가정용26330202021-09-302022-08-31
76182031006900충청남도 부여군 규암면가정용011402021-09-302022-08-31
8262714026001500충청남도 부여군 세도면가정용1118702021-09-302022-08-31
1961217004001600충청남도 부여군 남면가정용011402021-09-302022-08-31
244101071013000충청남도 부여군 부여읍가정용8102802021-09-302022-08-31

Duplicate rows

Most frequently occurring

수용가번호지역업종사용량사용료납기마감일데이터기준일자# duplicates
811343008400충청남도 부여군 부여읍가정용032802021-09-302022-08-314
871343010900충청남도 부여군 부여읍가정용032802021-09-302022-08-314
1552338009700충청남도 부여군 규암면가정용016402021-09-302022-08-314
1572338012400충청남도 부여군 규암면가정용065602021-09-302022-08-314
341151023100충청남도 부여군 부여읍가정용011402021-09-302022-08-313
551257001300충청남도 부여군 부여읍가정용011402021-09-302022-08-313
641333010200충청남도 부여군 부여읍가정용016402021-09-302022-08-313
711333021900충청남도 부여군 부여읍일반용002021-09-302022-08-313
881366000500충청남도 부여군 부여읍가정용032802021-09-302022-08-313
901366001200충청남도 부여군 부여읍가정용049202021-09-302022-08-313