Overview

Dataset statistics

Number of variables3
Number of observations24
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory756.0 B
Average record size in memory31.5 B

Variable types

DateTime1
Numeric1
Categorical1

Dataset

Description세종특별자치시 코로나19 확진자 및 사망자 현황(2020.1월 에서 2021. 12월 까지) 월별 데이터 총 24건을 등록하여 추진
Author세종특별자치시
URLhttps://www.data.go.kr/data/15098910/fileData.do

Alerts

확진자 is highly overall correlated with 사망자High correlation
사망자 is highly overall correlated with 확진자High correlation
사망자 is highly imbalanced (58.5%)Imbalance
구분 has unique valuesUnique
확진자 has 3 (12.5%) zerosZeros

Reproduction

Analysis started2023-12-12 10:57:31.932045
Analysis finished2023-12-12 10:57:32.446113
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Date

UNIQUE 

Distinct24
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size324.0 B
Minimum2020-01-31 00:00:00
Maximum2021-12-31 00:00:00
2023-12-12T19:57:32.556489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:57:32.777745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)

확진자
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct21
Distinct (%)87.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean89.916667
Minimum0
Maximum578
Zeros3
Zeros (%)12.5%
Negative0
Negative (%)0.0%
Memory size348.0 B
2023-12-12T19:57:32.996931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q15.25
median45.5
Q3120.5
95-th percentile255.9
Maximum578
Range578
Interquartile range (IQR)115.25

Descriptive statistics

Standard deviation131.44082
Coefficient of variation (CV)1.461807
Kurtosis7.7181316
Mean89.916667
Median Absolute Deviation (MAD)43.5
Skewness2.5220327
Sum2158
Variance17276.688
MonotonicityNot monotonic
2023-12-12T19:57:33.226480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
0 3
 
12.5%
1 2
 
8.3%
578 1
 
4.2%
213 1
 
4.2%
120 1
 
4.2%
244 1
 
4.2%
258 1
 
4.2%
180 1
 
4.2%
82 1
 
4.2%
122 1
 
4.2%
Other values (11) 11
45.8%
ValueCountFrequency (%)
0 3
12.5%
1 2
8.3%
3 1
 
4.2%
6 1
 
4.2%
9 1
 
4.2%
17 1
 
4.2%
20 1
 
4.2%
31 1
 
4.2%
45 1
 
4.2%
46 1
 
4.2%
ValueCountFrequency (%)
578 1
4.2%
258 1
4.2%
244 1
4.2%
213 1
4.2%
180 1
4.2%
122 1
4.2%
120 1
4.2%
82 1
4.2%
78 1
4.2%
56 1
4.2%

사망자
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size324.0 B
0
21 
1
 
2
2
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)4.2%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 21
87.5%
1 2
 
8.3%
2 1
 
4.2%

Length

2023-12-12T19:57:33.443845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:57:33.595807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 21
87.5%
1 2
 
8.3%
2 1
 
4.2%

Interactions

2023-12-12T19:57:32.048288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:57:33.709002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분확진자사망자
구분1.0001.0001.000
확진자1.0001.0000.943
사망자1.0000.9431.000
2023-12-12T19:57:33.859586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
확진자사망자
확진자1.0000.654
사망자0.6541.000

Missing values

2023-12-12T19:57:32.263426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:57:32.397252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분확진자사망자
02020-01-3100
12020-02-2910
22020-03-31450
32020-04-3000
42020-05-3110
52020-06-3030
62020-07-3100
72020-08-31170
82020-09-3090
92020-10-3160
구분확진자사망자
142021-03-31560
152021-04-30780
162021-05-311220
172021-06-30820
182021-07-311800
192021-08-312580
202021-09-302440
212021-10-311201
222021-11-302130
232021-12-315782