Overview

Dataset statistics

Number of variables4
Number of observations43
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory37.1 B

Variable types

Categorical2
DateTime1
Numeric1

Dataset

Description전라남도 강진군 코로나19 확진자 및 사망자에 대한 데이터로서 처음 발생 시부터 2021년까지의 월별 확진자수와 사망자수 현황 등을 포함하고 있음
URLhttps://www.data.go.kr/data/15098678/fileData.do

Alerts

시군명 has constant value ""Constant
확진자수 is highly overall correlated with 사망자수High correlation
사망자수 is highly overall correlated with 확진자수High correlation
사망자수 is highly imbalanced (72.9%)Imbalance
발생년월 has unique valuesUnique
확진자수 has 15 (34.9%) zerosZeros

Reproduction

Analysis started2023-12-12 21:16:49.248866
Analysis finished2023-12-12 21:16:49.599693
Duration0.35 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
전라남도 강진군
43 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전라남도 강진군
2nd row전라남도 강진군
3rd row전라남도 강진군
4th row전라남도 강진군
5th row전라남도 강진군

Common Values

ValueCountFrequency (%)
전라남도 강진군 43
100.0%

Length

2023-12-13T06:16:49.664879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:16:49.764254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전라남도 43
50.0%
강진군 43
50.0%

발생년월
Date

UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size476.0 B
Minimum2020-01-01 00:00:00
Maximum2023-07-01 00:00:00
2023-12-13T06:16:49.875966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:16:50.013424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=43)

확진자수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct25
Distinct (%)58.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean380.86047
Minimum0
Maximum5843
Zeros15
Zeros (%)34.9%
Negative0
Negative (%)0.0%
Memory size519.0 B
2023-12-13T06:16:50.151660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median5
Q3292
95-th percentile1960.1
Maximum5843
Range5843
Interquartile range (IQR)292

Descriptive statistics

Standard deviation988.45913
Coefficient of variation (CV)2.5953314
Kurtosis23.065478
Mean380.86047
Median Absolute Deviation (MAD)5
Skewness4.5060708
Sum16377
Variance977051.46
MonotonicityNot monotonic
2023-12-13T06:16:50.578089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
0 15
34.9%
3 3
 
7.0%
1 2
 
4.7%
11 2
 
4.7%
248 1
 
2.3%
468 1
 
2.3%
152 1
 
2.3%
254 1
 
2.3%
215 1
 
2.3%
169 1
 
2.3%
Other values (15) 15
34.9%
ValueCountFrequency (%)
0 15
34.9%
1 2
 
4.7%
3 3
 
7.0%
4 1
 
2.3%
5 1
 
2.3%
11 2
 
4.7%
59 1
 
2.3%
135 1
 
2.3%
152 1
 
2.3%
169 1
 
2.3%
ValueCountFrequency (%)
5843 1
2.3%
2347 1
2.3%
2060 1
2.3%
1061 1
2.3%
700 1
2.3%
628 1
2.3%
572 1
2.3%
549 1
2.3%
468 1
2.3%
372 1
2.3%

사망자수
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size476.0 B
0
40 
2
 
2
10
 
1

Length

Max length2
Median length1
Mean length1.0232558
Min length1

Unique

Unique1 ?
Unique (%)2.3%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 40
93.0%
2 2
 
4.7%
10 1
 
2.3%

Length

2023-12-13T06:16:50.710248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:16:50.827280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 40
93.0%
2 2
 
4.7%
10 1
 
2.3%

Interactions

2023-12-13T06:16:49.350457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:16:50.901220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발생년월확진자수사망자수
발생년월1.0001.0001.000
확진자수1.0001.0001.000
사망자수1.0001.0001.000
2023-12-13T06:16:50.992696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
확진자수사망자수
확진자수1.0000.975
사망자수0.9751.000

Missing values

2023-12-13T06:16:49.472116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:16:49.562553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명발생년월확진자수사망자수
0전라남도 강진군2020-0100
1전라남도 강진군2020-0200
2전라남도 강진군2020-0300
3전라남도 강진군2020-0400
4전라남도 강진군2020-0500
5전라남도 강진군2020-0600
6전라남도 강진군2020-0700
7전라남도 강진군2020-0800
8전라남도 강진군2020-0900
9전라남도 강진군2020-1000
시군명발생년월확진자수사망자수
33전라남도 강진군2022-102480
34전라남도 강진군2022-115720
35전라남도 강진군2022-1210610
36전라남도 강진군2023-015490
37전라남도 강진군2023-021730
38전라남도 강진군2023-031690
39전라남도 강진군2023-042150
40전라남도 강진군2023-052540
41전라남도 강진군2023-061520
42전라남도 강진군2023-074680