Overview

Dataset statistics

Number of variables4
Number of observations49
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory37.7 B

Variable types

DateTime1
Numeric3

Dataset

Description한국가스공사 인천기지본부의 연간 상수도 사용량으로, 용도별(설비용, 사무용) 상수도 사용량(㎥)에 대한 정보를 나타냄
Author한국가스공사
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15087687&srcSe=7661IVAWM27C61E190

Alerts

설비용사용량 is highly overall correlated with 합계High correlation
합계 is highly overall correlated with 설비용사용량High correlation
사용 월 has unique valuesUnique
설비용사용량 has unique valuesUnique

Reproduction

Analysis started2024-04-06 09:46:03.425407
Analysis finished2024-04-06 09:46:04.982119
Duration1.56 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사용 월
Date

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
Minimum2020-01-01 00:00:00
Maximum2024-01-01 00:00:00
2024-04-06T18:46:05.096711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:46:05.363317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)

설비용사용량
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14490.653
Minimum4300
Maximum50749
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2024-04-06T18:46:05.627403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4300
5-th percentile6588.4
Q17586
median11400
Q316108
95-th percentile45468.4
Maximum50749
Range46449
Interquartile range (IQR)8522

Descriptive statistics

Standard deviation11010.532
Coefficient of variation (CV)0.75983682
Kurtosis4.6039139
Mean14490.653
Median Absolute Deviation (MAD)3901
Skewness2.2528887
Sum710042
Variance1.2123181 × 108
MonotonicityNot monotonic
2024-04-06T18:46:05.865396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
8729 1
 
2.0%
12882 1
 
2.0%
12302 1
 
2.0%
42829 1
 
2.0%
50749 1
 
2.0%
13526 1
 
2.0%
8708 1
 
2.0%
10211 1
 
2.0%
17287 1
 
2.0%
8323 1
 
2.0%
Other values (39) 39
79.6%
ValueCountFrequency (%)
4300 1
2.0%
5927 1
2.0%
6580 1
2.0%
6601 1
2.0%
6696 1
2.0%
6744 1
2.0%
6810 1
2.0%
6883 1
2.0%
7100 1
2.0%
7155 1
2.0%
ValueCountFrequency (%)
50749 1
2.0%
47718 1
2.0%
47228 1
2.0%
42829 1
2.0%
29044 1
2.0%
23924 1
2.0%
20933 1
2.0%
18032 1
2.0%
17489 1
2.0%
17287 1
2.0%

사무용사용량
Real number (ℝ)

Distinct48
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3244.0612
Minimum1813
Maximum6598
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2024-04-06T18:46:06.095801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1813
5-th percentile1866.2
Q12490
median2950
Q33513
95-th percentile5824.2
Maximum6598
Range4785
Interquartile range (IQR)1023

Descriptive statistics

Standard deviation1166.7037
Coefficient of variation (CV)0.35964295
Kurtosis1.2214545
Mean3244.0612
Median Absolute Deviation (MAD)560
Skewness1.2970462
Sum158959
Variance1361197.6
MonotonicityNot monotonic
2024-04-06T18:46:06.384852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=48)
ValueCountFrequency (%)
1813 2
 
4.1%
2567 1
 
2.0%
5808 1
 
2.0%
1922 1
 
2.0%
2248 1
 
2.0%
4539 1
 
2.0%
4284 1
 
2.0%
4482 1
 
2.0%
4146 1
 
2.0%
3513 1
 
2.0%
Other values (38) 38
77.6%
ValueCountFrequency (%)
1813 2
4.1%
1829 1
2.0%
1922 1
2.0%
2020 1
2.0%
2140 1
2.0%
2183 1
2.0%
2238 1
2.0%
2248 1
2.0%
2351 1
2.0%
2381 1
2.0%
ValueCountFrequency (%)
6598 1
2.0%
6073 1
2.0%
5835 1
2.0%
5808 1
2.0%
5658 1
2.0%
4539 1
2.0%
4482 1
2.0%
4337 1
2.0%
4284 1
2.0%
4146 1
2.0%

합계
Real number (ℝ)

HIGH CORRELATION 

Distinct48
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17734.714
Minimum7321
Maximum52997
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2024-04-06T18:46:06.642684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7321
5-th percentile8989.2
Q110326
median14582
Q319818
95-th percentile47334.6
Maximum52997
Range45676
Interquartile range (IQR)9492

Descriptive statistics

Standard deviation10938.067
Coefficient of variation (CV)0.61676029
Kurtosis3.7655609
Mean17734.714
Median Absolute Deviation (MAD)4898
Skewness2.0314521
Sum869001
Variance1.1964132 × 108
MonotonicityNot monotonic
2024-04-06T18:46:06.887109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=48)
ValueCountFrequency (%)
9682 2
 
4.1%
11296 1
 
2.0%
17208 1
 
2.0%
44751 1
 
2.0%
52997 1
 
2.0%
18065 1
 
2.0%
12992 1
 
2.0%
14693 1
 
2.0%
21433 1
 
2.0%
11836 1
 
2.0%
Other values (38) 38
77.6%
ValueCountFrequency (%)
7321 1
2.0%
7740 1
2.0%
8716 1
2.0%
9399 1
2.0%
9438 1
2.0%
9451 1
2.0%
9478 1
2.0%
9500 1
2.0%
9623 1
2.0%
9682 2
4.1%
ValueCountFrequency (%)
52997 1
2.0%
49531 1
2.0%
49057 1
2.0%
44751 1
2.0%
33381 1
2.0%
27184 1
2.0%
27006 1
2.0%
23867 1
2.0%
21433 1
2.0%
20993 1
2.0%

Interactions

2024-04-06T18:46:04.370622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:46:03.585317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:46:04.011277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:46:04.520752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:46:03.762448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:46:04.152841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:46:04.657628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:46:03.896415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:46:04.259258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T18:46:07.058769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용 월설비용사용량사무용사용량합계
사용 월1.0001.0001.0001.000
설비용사용량1.0001.0000.6710.992
사무용사용량1.0000.6711.0000.709
합계1.0000.9920.7091.000
2024-04-06T18:46:07.203634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설비용사용량사무용사용량합계
설비용사용량1.0000.2390.983
사무용사용량0.2391.0000.339
합계0.9830.3391.000

Missing values

2024-04-06T18:46:04.819918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T18:46:04.932906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사용 월설비용사용량사무용사용량합계
02020-01-01 00:008729256711296
12020-02-01 00:00710023519451
22020-03-01 00:00592718137740
32020-04-01 00:00715524689623
42020-05-01 00:00752523819906
52020-06-01 00:00674426559399
62020-07-01 00:00430030217321
72020-08-01 00:00658028989478
82020-09-01 00:00660130819682
92020-10-01 00:00688325559438
사용 월설비용사용량사무용사용량합계
392023-04-01 00:008361565814019
402023-05-01 00:008786214010926
412023-06-01 00:0010327295013277
422023-07-01 00:00681026909500
432023-08-01 00:007701347011171
442023-09-01 00:007586274010326
452023-10-01 00:00669620208716
462023-11-01 00:0011389302014409
472023-12-01 00:0016108383019938
482024-01-01 00:0011839273014569