Overview

Dataset statistics

Number of variables3
Number of observations1273
Missing cells608
Missing cells (%)15.9%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory32.4 KiB
Average record size in memory26.1 B

Variable types

DateTime1
Categorical1
Numeric1

Dataset

Description2021. 1월부터 2023. 8월까지의 코로나19로 인한 일별 사망자 현황입니다. 날짜별 사망자수 및 누계를 제공합니다.
Author대구광역시 서구
URLhttps://www.data.go.kr/data/15098734/fileData.do

Alerts

Dataset has 1 (0.1%) duplicate rowsDuplicates
사망자 수 is highly imbalanced (50.3%)Imbalance
일자 has 304 (23.9%) missing valuesMissing
누계 has 304 (23.9%) missing valuesMissing

Reproduction

Analysis started2024-03-14 09:55:25.762590
Analysis finished2024-03-14 09:55:26.622649
Duration0.86 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일자
Date

MISSING 

Distinct969
Distinct (%)100.0%
Missing304
Missing (%)23.9%
Memory size10.1 KiB
Minimum2021-01-05 00:00:00
Maximum2023-08-31 00:00:00
2024-03-14T18:55:26.748492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T18:55:26.975327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사망자 수
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size10.1 KiB
0
848 
<NA>
304 
1
96 
2
 
23
3
 
1

Length

Max length4
Median length1
Mean length1.7164179
Min length1

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 848
66.6%
<NA> 304
 
23.9%
1 96
 
7.5%
2 23
 
1.8%
3 1
 
0.1%
4 1
 
0.1%

Length

2024-03-14T18:55:27.198206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T18:55:27.480910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 848
66.6%
na 304
 
23.9%
1 96
 
7.5%
2 23
 
1.8%
3 1
 
0.1%
4 1
 
0.1%

누계
Real number (ℝ)

MISSING 

Distinct122
Distinct (%)12.6%
Missing304
Missing (%)23.9%
Infinite0
Infinite (%)0.0%
Mean117.93911
Minimum46
Maximum195
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.3 KiB
2024-03-14T18:55:27.733271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum46
5-th percentile46
Q147
median139
Q3178
95-th percentile190
Maximum195
Range149
Interquartile range (IQR)131

Descriptive statistics

Standard deviation57.43161
Coefficient of variation (CV)0.48695983
Kurtosis-1.6336724
Mean117.93911
Median Absolute Deviation (MAD)50
Skewness-0.089272276
Sum114283
Variance3298.3899
MonotonicityIncreasing
2024-03-14T18:55:28.189779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
47 147
 
11.5%
46 115
 
9.0%
141 42
 
3.3%
189 39
 
3.1%
186 37
 
2.9%
185 32
 
2.5%
139 30
 
2.4%
87 28
 
2.2%
86 28
 
2.2%
188 25
 
2.0%
Other values (112) 446
35.0%
(Missing) 304
23.9%
ValueCountFrequency (%)
46 115
9.0%
47 147
11.5%
48 12
 
0.9%
49 21
 
1.6%
51 3
 
0.2%
52 1
 
0.1%
53 1
 
0.1%
54 1
 
0.1%
55 1
 
0.1%
56 1
 
0.1%
ValueCountFrequency (%)
195 3
 
0.2%
194 6
 
0.5%
193 11
 
0.9%
192 10
 
0.8%
191 13
 
1.0%
190 22
1.7%
189 39
3.1%
188 25
2.0%
187 8
 
0.6%
186 37
2.9%

Interactions

2024-03-14T18:55:25.881032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T18:55:28.471444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사망자 수누계
사망자 수1.0000.562
누계0.5621.000
2024-03-14T18:55:28.607990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
누계사망자 수
누계1.0000.266
사망자 수0.2661.000

Missing values

2024-03-14T18:55:26.255493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T18:55:26.388453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T18:55:26.535663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

일자사망자 수누계
02021-01-05046
12021-01-06046
22021-01-07046
32021-01-08046
42021-01-09046
52021-01-10046
62021-01-11046
72021-01-12046
82021-01-13046
92021-01-14046
일자사망자 수누계
1263<NA><NA><NA>
1264<NA><NA><NA>
1265<NA><NA><NA>
1266<NA><NA><NA>
1267<NA><NA><NA>
1268<NA><NA><NA>
1269<NA><NA><NA>
1270<NA><NA><NA>
1271<NA><NA><NA>
1272<NA><NA><NA>

Duplicate rows

Most frequently occurring

일자사망자 수누계# duplicates
0<NA><NA><NA>304