Overview

Dataset statistics

Number of variables3
Number of observations377
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.0 KiB
Average record size in memory24.4 B

Variable types

DateTime1
Categorical2

Dataset

Description대구광역시 상수도사업본부의 2020년 8월 1일부터 2023년 6월 30일까지 휴일과 요금조정일의 일자를 포함하고 있습니다.
Author공공데이터포털
URLhttps://www.data.go.kr/data/15116751/fileData.do

Alerts

휴일여부 is highly imbalanced (55.4%)Imbalance
요금조정일 is highly imbalanced (55.4%)Imbalance
일자 has unique valuesUnique

Reproduction

Analysis started2024-04-17 16:12:16.918507
Analysis finished2024-04-17 16:12:17.089245
Duration0.17 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일자
Date

UNIQUE 

Distinct377
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
Minimum2020-08-01 00:00:00
Maximum2023-06-25 00:00:00
2024-04-18T01:12:17.158728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T01:12:17.277890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴일여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
휴일
342 
<NA>
35 

Length

Max length4
Median length2
Mean length2.1856764
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row휴일
2nd row휴일
3rd row휴일
4th row휴일
5th row<NA>

Common Values

ValueCountFrequency (%)
휴일 342
90.7%
<NA> 35
 
9.3%

Length

2024-04-18T01:12:17.380791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T01:12:17.464015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
휴일 342
90.7%
na 35
 
9.3%

요금조정일
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
<NA>
342 
요금조정일
35 

Length

Max length5
Median length4
Mean length4.0928382
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row요금조정일

Common Values

ValueCountFrequency (%)
<NA> 342
90.7%
요금조정일 35
 
9.3%

Length

2024-04-18T01:12:17.549485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T01:12:17.628770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 342
90.7%
요금조정일 35
 
9.3%

Correlations

2024-04-18T01:12:17.680501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
휴일여부요금조정일
휴일여부1.000NaN
요금조정일NaN1.000
2024-04-18T01:12:17.746219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
휴일여부요금조정일
휴일여부1.0000.000
요금조정일0.0001.000

Missing values

2024-04-18T01:12:16.993988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-18T01:12:17.061239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일자휴일여부요금조정일
02020-08-01휴일<NA>
12020-08-02휴일<NA>
22020-08-08휴일<NA>
32020-08-09휴일<NA>
42020-08-10<NA>요금조정일
52020-08-15휴일<NA>
62020-08-16휴일<NA>
72020-08-17휴일<NA>
82020-08-22휴일<NA>
92020-08-23휴일<NA>
일자휴일여부요금조정일
3672023-06-03휴일<NA>
3682023-06-04휴일<NA>
3692023-06-06휴일<NA>
3702023-06-10휴일<NA>
3712023-06-11휴일<NA>
3722023-06-12<NA>요금조정일
3732023-06-17휴일<NA>
3742023-06-18휴일<NA>
3752023-06-24휴일<NA>
3762023-06-25휴일<NA>