Overview

Dataset statistics

Number of variables4
Number of observations47
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory37.8 B

Variable types

DateTime1
Numeric3

Dataset

Description한국가스공사 인천기지본부의 연간 상수도 사용량으로, 용도별(설비용, 사무용) 상수도 사용량(㎥)에 대한 정보를 나타냄
Author한국가스공사
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15087687&srcSe=7661IVAWM27C61E190

Alerts

설비용사용량 is highly overall correlated with 합계High correlation
합계 is highly overall correlated with 설비용사용량High correlation
사용 월 has unique valuesUnique
설비용사용량 has unique valuesUnique

Reproduction

Analysis started2024-04-06 09:45:45.257766
Analysis finished2024-04-06 09:45:47.085650
Duration1.83 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사용 월
Date

UNIQUE 

Distinct47
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size508.0 B
Minimum2020-01-01 00:00:00
Maximum2023-11-01 00:00:00
2024-04-06T18:45:47.204422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:45:47.440948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)

설비용사용량
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct47
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14512.66
Minimum4300
Maximum50749
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size555.0 B
2024-04-06T18:45:47.732517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4300
5-th percentile6586.3
Q17555.5
median11389
Q316205
95-th percentile45908.3
Maximum50749
Range46449
Interquartile range (IQR)8649.5

Descriptive statistics

Standard deviation11237.996
Coefficient of variation (CV)0.7743581
Kurtosis4.3167035
Mean14512.66
Median Absolute Deviation (MAD)3890
Skewness2.2090685
Sum682095
Variance1.2629254 × 108
MonotonicityNot monotonic
2024-04-06T18:45:47.919956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
8729 1
 
2.1%
7100 1
 
2.1%
17489 1
 
2.1%
12302 1
 
2.1%
42829 1
 
2.1%
50749 1
 
2.1%
13526 1
 
2.1%
8708 1
 
2.1%
10211 1
 
2.1%
17287 1
 
2.1%
Other values (37) 37
78.7%
ValueCountFrequency (%)
4300 1
2.1%
5927 1
2.1%
6580 1
2.1%
6601 1
2.1%
6696 1
2.1%
6744 1
2.1%
6810 1
2.1%
6883 1
2.1%
7100 1
2.1%
7155 1
2.1%
ValueCountFrequency (%)
50749 1
2.1%
47718 1
2.1%
47228 1
2.1%
42829 1
2.1%
29044 1
2.1%
23924 1
2.1%
20933 1
2.1%
18032 1
2.1%
17489 1
2.1%
17287 1
2.1%

사무용사용량
Real number (ℝ)

Distinct46
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3242.5319
Minimum1813
Maximum6598
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size555.0 B
2024-04-06T18:45:48.116210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1813
5-th percentile1856.9
Q12479
median2950
Q33511.5
95-th percentile5826.9
Maximum6598
Range4785
Interquartile range (IQR)1032.5

Descriptive statistics

Standard deviation1186.2417
Coefficient of variation (CV)0.3658381
Kurtosis1.1366676
Mean3242.5319
Median Absolute Deviation (MAD)560
Skewness1.2931464
Sum152399
Variance1407169.4
MonotonicityNot monotonic
2024-04-06T18:45:48.332920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=46)
ValueCountFrequency (%)
1813 2
 
4.3%
2567 1
 
2.1%
6073 1
 
2.1%
3709 1
 
2.1%
1922 1
 
2.1%
2248 1
 
2.1%
4539 1
 
2.1%
4284 1
 
2.1%
4482 1
 
2.1%
4146 1
 
2.1%
Other values (36) 36
76.6%
ValueCountFrequency (%)
1813 2
4.3%
1829 1
2.1%
1922 1
2.1%
2020 1
2.1%
2140 1
2.1%
2183 1
2.1%
2238 1
2.1%
2248 1
2.1%
2351 1
2.1%
2381 1
2.1%
ValueCountFrequency (%)
6598 1
2.1%
6073 1
2.1%
5835 1
2.1%
5808 1
2.1%
5658 1
2.1%
4539 1
2.1%
4482 1
2.1%
4337 1
2.1%
4284 1
2.1%
4146 1
2.1%

합계
Real number (ℝ)

HIGH CORRELATION 

Distinct46
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17755.191
Minimum7321
Maximum52997
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size555.0 B
2024-04-06T18:45:48.552071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7321
5-th percentile8920.9
Q110116
median14582
Q319649
95-th percentile47765.2
Maximum52997
Range45676
Interquartile range (IQR)9533

Descriptive statistics

Standard deviation11158.821
Coefficient of variation (CV)0.62848217
Kurtosis3.5215898
Mean17755.191
Median Absolute Deviation (MAD)4898
Skewness1.994658
Sum834494
Variance1.2451929 × 108
MonotonicityNot monotonic
2024-04-06T18:45:48.821978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=46)
ValueCountFrequency (%)
9682 2
 
4.3%
11296 1
 
2.1%
27006 1
 
2.1%
16011 1
 
2.1%
44751 1
 
2.1%
52997 1
 
2.1%
18065 1
 
2.1%
12992 1
 
2.1%
14693 1
 
2.1%
21433 1
 
2.1%
Other values (36) 36
76.6%
ValueCountFrequency (%)
7321 1
2.1%
7740 1
2.1%
8716 1
2.1%
9399 1
2.1%
9438 1
2.1%
9451 1
2.1%
9478 1
2.1%
9500 1
2.1%
9623 1
2.1%
9682 2
4.3%
ValueCountFrequency (%)
52997 1
2.1%
49531 1
2.1%
49057 1
2.1%
44751 1
2.1%
33381 1
2.1%
27184 1
2.1%
27006 1
2.1%
23867 1
2.1%
21433 1
2.1%
20993 1
2.1%

Interactions

2024-04-06T18:45:46.460458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:45:45.431201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:45:46.124740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:45:46.597235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:45:45.908280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:45:46.225189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:45:46.718346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:45:46.018381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:45:46.319104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T18:45:48.945138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용 월설비용사용량사무용사용량합계
사용 월1.0001.0001.0001.000
설비용사용량1.0001.0000.6300.992
사무용사용량1.0000.6301.0000.780
합계1.0000.9920.7801.000
2024-04-06T18:45:49.072375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설비용사용량사무용사용량합계
설비용사용량1.0000.2260.983
사무용사용량0.2261.0000.325
합계0.9830.3251.000

Missing values

2024-04-06T18:45:46.899779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T18:45:47.034642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사용 월설비용사용량사무용사용량합계
02020-01-01 00:008729256711296
12020-02-01 00:00710023519451
22020-03-01 00:00592718137740
32020-04-01 00:00715524689623
42020-05-01 00:00752523819906
52020-06-01 00:00674426559399
62020-07-01 00:00430030217321
72020-08-01 00:00658028989478
82020-09-01 00:00660130819682
92020-10-01 00:00688325559438
사용 월설비용사용량사무용사용량합계
372023-02-01 00:0012882659819480
382023-03-01 00:0011400580817208
392023-04-01 00:008361565814019
402023-05-01 00:008786214010926
412023-06-01 00:0010327295013277
422023-07-01 00:00681026909500
432023-08-01 00:007701347011171
442023-09-01 00:007586274010326
452023-10-01 00:00669620208716
462023-11-01 00:0011389302014409