Overview

Dataset statistics

Number of variables7
Number of observations2557
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory145.0 KiB
Average record size in memory58.1 B

Variable types

DateTime1
Categorical4
Boolean2

Dataset

Description환경신기술 업무 기준 달력 정보(2020-10-26 기준 / 일자, 요일, 주간, 월 마지막 주, 휴일여부, 휴일사유, 일요일 구분 등)
Author한국환경산업기술원
URLhttps://www.data.go.kr/data/15071523/fileData.do

Alerts

일요일 구분 is highly overall correlated with 요일High correlation
휴일사유 is highly overall correlated with 주간 and 2 other fieldsHigh correlation
휴일여부 is highly overall correlated with 휴일사유High correlation
요일 is highly overall correlated with 일요일 구분High correlation
주간 is highly overall correlated with 휴일사유High correlation
월마지막주 is highly overall correlated with 휴일사유High correlation
월마지막주 is highly imbalanced (69.4%)Imbalance
휴일여부 is highly imbalanced (81.3%)Imbalance
휴일사유 is highly imbalanced (93.3%)Imbalance
일자 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:37:08.134611
Analysis finished2023-12-12 16:37:08.802657
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일자
Date

UNIQUE 

Distinct2557
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size20.1 KiB
Minimum2014-01-01 00:00:00
Maximum2020-12-31 00:00:00
2023-12-13T01:37:09.273871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:37:09.446116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

요일
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size20.1 KiB
366 
366 
365 
365 
365 
Other values (2)
730 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
366
14.3%
366
14.3%
365
14.3%
365
14.3%
365
14.3%
365
14.3%
365
14.3%

Length

2023-12-13T01:37:09.617513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:37:09.781119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
366
14.3%
366
14.3%
365
14.3%
365
14.3%
365
14.3%
365
14.3%
365
14.3%

주간
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size20.1 KiB
2
588 
3
588 
4
588 
1
588 
5
205 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
2 588
23.0%
3 588
23.0%
4 588
23.0%
1 588
23.0%
5 205
 
8.0%

Length

2023-12-13T01:37:09.955543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:37:10.095654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 588
23.0%
3 588
23.0%
4 588
23.0%
1 588
23.0%
5 205
 
8.0%

월마지막주
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size20.1 KiB
5
2417 
4
 
140

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5
2nd row5
3rd row5
4th row5
5th row5

Common Values

ValueCountFrequency (%)
5 2417
94.5%
4 140
 
5.5%

Length

2023-12-13T01:37:10.220935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:37:10.317172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5 2417
94.5%
4 140
 
5.5%

휴일여부
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
False
2484 
True
 
73
ValueCountFrequency (%)
False 2484
97.1%
True 73
 
2.9%
2023-12-13T01:37:10.395469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

휴일사유
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct22
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size20.1 KiB
<NA>
2484 
삼일절
 
7
어린이날
 
7
현충일
 
7
광복절
 
7
Other values (17)
 
45

Length

Max length9
Median length4
Mean length3.9898318
Min length2

Unique

Unique12 ?
Unique (%)0.5%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 2484
97.1%
삼일절 7
 
0.3%
어린이날 7
 
0.3%
현충일 7
 
0.3%
광복절 7
 
0.3%
개천절 7
 
0.3%
한글날 7
 
0.3%
크리스마스 7
 
0.3%
신정 7
 
0.3%
석가탄신일 5
 
0.2%
Other values (12) 12
 
0.5%

Length

2023-12-13T01:37:10.519074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 2484
97.1%
어린이날 7
 
0.3%
현충일 7
 
0.3%
광복절 7
 
0.3%
개천절 7
 
0.3%
한글날 7
 
0.3%
크리스마스 7
 
0.3%
신정 7
 
0.3%
삼일절 7
 
0.3%
석가탄신일 5
 
0.2%
Other values (12) 12
 
0.5%

일요일 구분
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
False
1832 
True
725 
ValueCountFrequency (%)
False 1832
71.6%
True 725
 
28.4%
2023-12-13T01:37:10.644967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:37:10.729459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
요일주간월마지막주휴일여부휴일사유일요일 구분
요일1.0000.0000.0000.0000.0000.948
주간0.0001.0000.0480.0871.0000.000
월마지막주0.0000.0481.0000.0001.0000.000
휴일여부0.0000.0870.0001.000NaN0.077
휴일사유0.0001.0001.000NaN1.0000.000
일요일 구분0.9480.0000.0000.0770.0001.000
2023-12-13T01:37:10.845453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
월마지막주일요일 구분주간휴일사유휴일여부요일
월마지막주1.0000.0000.0590.8560.0000.000
일요일 구분0.0001.0000.0000.0000.0490.994
주간0.0590.0001.0000.8680.1070.000
휴일사유0.8560.0000.8681.0001.0000.000
휴일여부0.0000.0490.1071.0001.0000.000
요일0.0000.9940.0000.0000.0001.000
2023-12-13T01:37:10.984568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
요일주간월마지막주휴일여부휴일사유일요일 구분
요일1.0000.0000.0000.0000.0000.994
주간0.0001.0000.0590.1070.8680.000
월마지막주0.0000.0591.0000.0000.8560.000
휴일여부0.0000.1070.0001.0001.0000.049
휴일사유0.0000.8680.8561.0001.0000.000
일요일 구분0.9940.0000.0000.0490.0001.000

Missing values

2023-12-13T01:37:08.562011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:37:08.736498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일자요일주간월마지막주휴일여부휴일사유일요일 구분
02014-11-1025N<NA>N
12014-11-1125N<NA>N
22014-11-1225N<NA>N
32014-11-1325N<NA>N
42014-11-1425N<NA>N
52014-11-1535N<NA>Y
62014-11-1635N<NA>Y
72014-11-1735N<NA>N
82014-11-1835N<NA>N
92014-11-1935N<NA>N
일자요일주간월마지막주휴일여부휴일사유일요일 구분
25472020-12-2245N<NA>N
25482020-12-2345N<NA>N
25492020-12-2445N<NA>N
25502020-12-2545Y크리스마스N
25512020-12-2645N<NA>Y
25522020-12-2745N<NA>Y
25532020-12-2845N<NA>N
25542020-12-2955N<NA>N
25552020-12-3055N<NA>N
25562020-12-3155N<NA>N