Overview

Dataset statistics

Number of variables3
Number of observations27
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory807.0 B
Average record size in memory29.9 B

Variable types

DateTime1
Categorical2

Dataset

Description전라북도 고창군 코로나19 확진자 및 사망자 현황 입니다. 데이터는 사망자수, 확진자수 등으로 구성되어 있습니다.
Author전라북도 고창군
URLhttps://www.data.go.kr/data/15099547/fileData.do

Alerts

확진자 is highly overall correlated with 사망자High correlation
사망자 is highly overall correlated with 확진자High correlation
구분 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:45:52.249659
Analysis finished2023-12-12 09:45:52.491698
Duration0.24 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Date

UNIQUE 

Distinct27
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size348.0 B
Minimum2020-01-31 00:00:00
Maximum2022-03-31 00:00:00
2023-12-12T18:45:52.563779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:45:52.713261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)

확진자
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)48.1%
Missing0
Missing (%)0.0%
Memory size348.0 B
0
1
2
6
3
Other values (8)

Length

Max length5
Median length1
Mean length1.5925926
Min length1

Unique

Unique8 ?
Unique (%)29.6%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 8
29.6%
1 4
14.8%
2 3
 
11.1%
6 2
 
7.4%
3 2
 
7.4%
11 1
 
3.7%
15 1
 
3.7%
9 1
 
3.7%
121 1
 
3.7%
114 1
 
3.7%
Other values (3) 3
 
11.1%

Length

2023-12-12T18:45:52.921411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
0 8
29.6%
1 4
14.8%
2 3
 
11.1%
6 2
 
7.4%
3 2
 
7.4%
11 1
 
3.7%
15 1
 
3.7%
9 1
 
3.7%
121 1
 
3.7%
114 1
 
3.7%
Other values (3) 3
 
11.1%

사망자
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)18.5%
Missing0
Missing (%)0.0%
Memory size348.0 B
0
20 
1
3
 
2
2
 
1
9
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique2 ?
Unique (%)7.4%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 20
74.1%
1 3
 
11.1%
3 2
 
7.4%
2 1
 
3.7%
9 1
 
3.7%

Length

2023-12-12T18:45:53.072218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:45:53.196470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 20
74.1%
1 3
 
11.1%
3 2
 
7.4%
2 1
 
3.7%
9 1
 
3.7%

Correlations

2023-12-12T18:45:53.281836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분확진자사망자
구분1.0001.0001.000
확진자1.0001.0000.868
사망자1.0000.8681.000
2023-12-12T18:45:53.419860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
확진자사망자
확진자1.0000.555
사망자0.5551.000
2023-12-12T18:45:53.514812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
확진자사망자
확진자1.0000.555
사망자0.5551.000

Missing values

2023-12-12T18:45:52.371592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:45:52.455484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분확진자사망자
02020-01-3100
12020-02-2900
22020-03-3100
32020-04-3000
42020-05-3100
52020-06-3000
62020-07-3110
72020-08-3110
82020-09-3000
92020-10-3120
구분확진자사망자
172021-06-3030
182021-07-3120
192021-08-31150
202021-09-3090
212021-10-3160
222021-11-301211
232021-12-311149
242022-01-311011
252022-02-281,3033
262022-03-314,8973