Overview

Dataset statistics

Number of variables4
Number of observations50
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory37.6 B

Variable types

DateTime1
Numeric3

Dataset

Description한국가스공사 인천기지본부의 연간 상수도 사용량으로, 용도별(설비용, 사무용) 상수도 사용량(㎥)에 대한 정보를 나타냄
Author한국가스공사
URLhttps://www.data.go.kr/data/15087687/fileData.do

Alerts

설비용사용량 is highly overall correlated with 합계High correlation
합계 is highly overall correlated with 설비용사용량High correlation
사용 월 has unique valuesUnique
설비용사용량 has unique valuesUnique

Reproduction

Analysis started2024-03-23 05:37:48.502593
Analysis finished2024-03-23 05:37:50.735477
Duration2.23 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사용 월
Date

UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
Minimum2020-01-01 00:00:00
Maximum2024-02-01 00:00:00
2024-03-23T14:37:50.946531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:37:51.321692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

설비용사용량
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14439.86
Minimum4300
Maximum50749
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size582.0 B
2024-03-23T14:37:52.069633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4300
5-th percentile6589.45
Q17614.75
median11592
Q315937.25
95-th percentile45248.45
Maximum50749
Range46449
Interquartile range (IQR)8322.5

Descriptive statistics

Standard deviation10903.517
Coefficient of variation (CV)0.75509854
Kurtosis4.7792315
Mean14439.86
Median Absolute Deviation (MAD)4036.5
Skewness2.2848882
Sum721993
Variance1.1888669 × 108
MonotonicityNot monotonic
2024-03-23T14:37:52.336439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8729 1
 
2.0%
11400 1
 
2.0%
42829 1
 
2.0%
50749 1
 
2.0%
13526 1
 
2.0%
8708 1
 
2.0%
10211 1
 
2.0%
17287 1
 
2.0%
8323 1
 
2.0%
18032 1
 
2.0%
Other values (40) 40
80.0%
ValueCountFrequency (%)
4300 1
2.0%
5927 1
2.0%
6580 1
2.0%
6601 1
2.0%
6696 1
2.0%
6744 1
2.0%
6810 1
2.0%
6883 1
2.0%
7100 1
2.0%
7155 1
2.0%
ValueCountFrequency (%)
50749 1
2.0%
47718 1
2.0%
47228 1
2.0%
42829 1
2.0%
29044 1
2.0%
23924 1
2.0%
20933 1
2.0%
18032 1
2.0%
17489 1
2.0%
17287 1
2.0%

사무용사용량
Real number (ℝ)

Distinct49
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3239.78
Minimum1813
Maximum6598
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size582.0 B
2024-03-23T14:37:52.604599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1813
5-th percentile1870.85
Q12506.25
median2953
Q33512.25
95-th percentile5822.85
Maximum6598
Range4785
Interquartile range (IQR)1006

Descriptive statistics

Standard deviation1155.134
Coefficient of variation (CV)0.35654704
Kurtosis1.3212027
Mean3239.78
Median Absolute Deviation (MAD)554
Skewness1.3194712
Sum161989
Variance1334334.5
MonotonicityNot monotonic
2024-03-23T14:37:52.889441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
1813 2
 
4.0%
2567 1
 
2.0%
5658 1
 
2.0%
2248 1
 
2.0%
4539 1
 
2.0%
4284 1
 
2.0%
4482 1
 
2.0%
4146 1
 
2.0%
3513 1
 
2.0%
5835 1
 
2.0%
Other values (39) 39
78.0%
ValueCountFrequency (%)
1813 2
4.0%
1829 1
2.0%
1922 1
2.0%
2020 1
2.0%
2140 1
2.0%
2183 1
2.0%
2238 1
2.0%
2248 1
2.0%
2351 1
2.0%
2381 1
2.0%
ValueCountFrequency (%)
6598 1
2.0%
6073 1
2.0%
5835 1
2.0%
5808 1
2.0%
5658 1
2.0%
4539 1
2.0%
4482 1
2.0%
4337 1
2.0%
4284 1
2.0%
4146 1
2.0%

합계
Real number (ℝ)

HIGH CORRELATION 

Distinct49
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17679.64
Minimum7321
Maximum52997
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size582.0 B
2024-03-23T14:37:53.176692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7321
5-th percentile9023.35
Q110476
median14596.5
Q319733.5
95-th percentile47119.3
Maximum52997
Range45676
Interquartile range (IQR)9257.5

Descriptive statistics

Standard deviation10832.881
Coefficient of variation (CV)0.61273201
Kurtosis3.9238212
Mean17679.64
Median Absolute Deviation (MAD)4787
Skewness2.0623229
Sum883982
Variance1.1735132 × 108
MonotonicityNot monotonic
2024-03-23T14:37:53.496970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
9682 2
 
4.0%
11296 1
 
2.0%
14019 1
 
2.0%
52997 1
 
2.0%
18065 1
 
2.0%
12992 1
 
2.0%
14693 1
 
2.0%
21433 1
 
2.0%
11836 1
 
2.0%
23867 1
 
2.0%
Other values (39) 39
78.0%
ValueCountFrequency (%)
7321 1
2.0%
7740 1
2.0%
8716 1
2.0%
9399 1
2.0%
9438 1
2.0%
9451 1
2.0%
9478 1
2.0%
9500 1
2.0%
9623 1
2.0%
9682 2
4.0%
ValueCountFrequency (%)
52997 1
2.0%
49531 1
2.0%
49057 1
2.0%
44751 1
2.0%
33381 1
2.0%
27184 1
2.0%
27006 1
2.0%
23867 1
2.0%
21433 1
2.0%
20993 1
2.0%

Interactions

2024-03-23T14:37:49.778951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:37:48.749411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:37:49.311744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:37:49.932277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:37:48.993008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:37:49.457536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:37:50.171944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:37:49.174083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:37:49.619797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T14:37:53.695575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용 월설비용사용량사무용사용량합계
사용 월1.0001.0001.0001.000
설비용사용량1.0001.0000.6880.993
사무용사용량1.0000.6881.0000.724
합계1.0000.9930.7241.000
2024-03-23T14:37:53.890356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설비용사용량사무용사용량합계
설비용사용량1.0000.2470.983
사무용사용량0.2471.0000.346
합계0.9830.3461.000

Missing values

2024-03-23T14:37:50.386222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T14:37:50.606242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사용 월설비용사용량사무용사용량합계
02020-01-01 00:008729256711296
12020-02-01 00:00710023519451
22020-03-01 00:00592718137740
32020-04-01 00:00715524689623
42020-05-01 00:00752523819906
52020-06-01 00:00674426559399
62020-07-01 00:00430030217321
72020-08-01 00:00658028989478
82020-09-01 00:00660130819682
92020-10-01 00:00688325559438
사용 월설비용사용량사무용사용량합계
402023-05-01 00:008786214010926
412023-06-01 00:0010327295013277
422023-07-01 00:00681026909500
432023-08-01 00:007701347011171
442023-09-01 00:007586274010326
452023-10-01 00:00669620208716
462023-11-01 00:0011389302014409
472023-12-01 00:0016108383019938
482024-01-01 00:0011839273014569
492024-02-01 00:0011951303014981