Overview

Dataset statistics

Number of variables4
Number of observations46
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory37.9 B

Variable types

DateTime1
Numeric3

Dataset

Description한국가스공사 인천기지본부의 연간 상수도 사용량으로, 용도별(설비용, 사무용) 상수도 사용량(㎥)에 대한 정보를 나타냄
Author한국가스공사
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15087687&srcSe=7661IVAWM27C61E190

Alerts

설비용사용량 is highly overall correlated with 합계High correlation
합계 is highly overall correlated with 설비용사용량High correlation
사용 월 has unique valuesUnique
설비용사용량 has unique valuesUnique

Reproduction

Analysis started2024-04-06 09:45:58.747508
Analysis finished2024-04-06 09:46:00.462439
Duration1.71 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사용 월
Date

UNIQUE 

Distinct46
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size500.0 B
Minimum2020-01-01 00:00:00
Maximum2023-10-01 00:00:00
2024-04-06T18:46:00.569384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:46:00.777170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=46)

설비용사용량
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct46
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14580.565
Minimum4300
Maximum50749
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size546.0 B
2024-04-06T18:46:00.982981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4300
5-th percentile6585.25
Q17540.25
median11209.5
Q316595
95-th percentile46128.25
Maximum50749
Range46449
Interquartile range (IQR)9054.75

Descriptive statistics

Standard deviation11352.423
Coefficient of variation (CV)0.77859964
Kurtosis4.1378713
Mean14580.565
Median Absolute Deviation (MAD)3882.5
Skewness2.174403
Sum670706
Variance1.288775 × 108
MonotonicityNot monotonic
2024-04-06T18:46:01.194052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=46)
ValueCountFrequency (%)
8729 1
 
2.2%
18032 1
 
2.2%
17489 1
 
2.2%
12302 1
 
2.2%
42829 1
 
2.2%
50749 1
 
2.2%
13526 1
 
2.2%
8708 1
 
2.2%
10211 1
 
2.2%
17287 1
 
2.2%
Other values (36) 36
78.3%
ValueCountFrequency (%)
4300 1
2.2%
5927 1
2.2%
6580 1
2.2%
6601 1
2.2%
6696 1
2.2%
6744 1
2.2%
6810 1
2.2%
6883 1
2.2%
7100 1
2.2%
7155 1
2.2%
ValueCountFrequency (%)
50749 1
2.2%
47718 1
2.2%
47228 1
2.2%
42829 1
2.2%
29044 1
2.2%
23924 1
2.2%
20933 1
2.2%
18032 1
2.2%
17489 1
2.2%
17287 1
2.2%

사무용사용량
Real number (ℝ)

Distinct45
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3247.3696
Minimum1813
Maximum6598
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size546.0 B
2024-04-06T18:46:01.744628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1813
5-th percentile1852.25
Q12473.5
median2924
Q33512.25
95-th percentile5828.25
Maximum6598
Range4785
Interquartile range (IQR)1038.75

Descriptive statistics

Standard deviation1198.8809
Coefficient of variation (CV)0.36918524
Kurtosis1.0339177
Mean3247.3696
Median Absolute Deviation (MAD)576.5
Skewness1.2692456
Sum149379
Variance1437315.4
MonotonicityNot monotonic
2024-04-06T18:46:01.994915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
1813 2
 
4.3%
2567 1
 
2.2%
6073 1
 
2.2%
3709 1
 
2.2%
1922 1
 
2.2%
2248 1
 
2.2%
4539 1
 
2.2%
4284 1
 
2.2%
4482 1
 
2.2%
4146 1
 
2.2%
Other values (35) 35
76.1%
ValueCountFrequency (%)
1813 2
4.3%
1829 1
2.2%
1922 1
2.2%
2020 1
2.2%
2140 1
2.2%
2183 1
2.2%
2238 1
2.2%
2248 1
2.2%
2351 1
2.2%
2381 1
2.2%
ValueCountFrequency (%)
6598 1
2.2%
6073 1
2.2%
5835 1
2.2%
5808 1
2.2%
5658 1
2.2%
4539 1
2.2%
4482 1
2.2%
4337 1
2.2%
4284 1
2.2%
4146 1
2.2%

합계
Real number (ℝ)

HIGH CORRELATION 

Distinct45
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17827.935
Minimum7321
Maximum52997
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size546.0 B
2024-04-06T18:46:02.206241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7321
5-th percentile8886.75
Q110011
median14596.5
Q319733.5
95-th percentile47980.5
Maximum52997
Range45676
Interquartile range (IQR)9722.5

Descriptive statistics

Standard deviation11270.854
Coefficient of variation (CV)0.6322019
Kurtosis3.360267
Mean17827.935
Median Absolute Deviation (MAD)4899
Skewness1.9610622
Sum820085
Variance1.2703216 × 108
MonotonicityNot monotonic
2024-04-06T18:46:02.419465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
9682 2
 
4.3%
11296 1
 
2.2%
27006 1
 
2.2%
16011 1
 
2.2%
44751 1
 
2.2%
52997 1
 
2.2%
18065 1
 
2.2%
12992 1
 
2.2%
14693 1
 
2.2%
21433 1
 
2.2%
Other values (35) 35
76.1%
ValueCountFrequency (%)
7321 1
2.2%
7740 1
2.2%
8716 1
2.2%
9399 1
2.2%
9438 1
2.2%
9451 1
2.2%
9478 1
2.2%
9500 1
2.2%
9623 1
2.2%
9682 2
4.3%
ValueCountFrequency (%)
52997 1
2.2%
49531 1
2.2%
49057 1
2.2%
44751 1
2.2%
33381 1
2.2%
27184 1
2.2%
27006 1
2.2%
23867 1
2.2%
21433 1
2.2%
20993 1
2.2%

Interactions

2024-04-06T18:45:59.726099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:45:58.933326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:45:59.343616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:45:59.969012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:45:59.067986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:45:59.465840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:46:00.094795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:45:59.212987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:45:59.593283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T18:46:02.576170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용 월설비용사용량사무용사용량합계
사용 월1.0001.0001.0001.000
설비용사용량1.0001.0000.6050.992
사무용사용량1.0000.6051.0000.768
합계1.0000.9920.7681.000
2024-04-06T18:46:02.746288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설비용사용량사무용사용량합계
설비용사용량1.0000.2210.982
사무용사용량0.2211.0000.322
합계0.9820.3221.000

Missing values

2024-04-06T18:46:00.245224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T18:46:00.405903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사용 월설비용사용량사무용사용량합계
02020-01-01 00:008729256711296
12020-02-01 00:00710023519451
22020-03-01 00:00592718137740
32020-04-01 00:00715524689623
42020-05-01 00:00752523819906
52020-06-01 00:00674426559399
62020-07-01 00:00430030217321
72020-08-01 00:00658028989478
82020-09-01 00:00660130819682
92020-10-01 00:00688325559438
사용 월설비용사용량사무용사용량합계
362023-01-01 00:0020933607327006
372023-02-01 00:0012882659819480
382023-03-01 00:0011400580817208
392023-04-01 00:008361565814019
402023-05-01 00:008786214010926
412023-06-01 00:0010327295013277
422023-07-01 00:00681026909500
432023-08-01 00:007701347011171
442023-09-01 00:007586274010326
452023-10-01 00:00669620208716