Overview

Dataset statistics

Number of variables3
Number of observations228
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.4%
Total size in memory5.7 KiB
Average record size in memory25.6 B

Variable types

Categorical1
DateTime1
Numeric1

Dataset

Description경기도 포천시에서 제공하는 2020년 2월(포천시 발생시점) ~ 2021년 8월까지 코로나19(COVID-19) 확진자 현황입니다.
Author경기도 포천시
URLhttps://www.data.go.kr/data/15090310/fileData.do

Alerts

구분 has constant value ""Constant
Dataset has 1 (0.4%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 08:59:13.691764
Analysis finished2023-12-12 08:59:14.061266
Duration0.37 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
포천시
228 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row포천시
2nd row포천시
3rd row포천시
4th row포천시
5th row포천시

Common Values

ValueCountFrequency (%)
포천시 228
100.0%

Length

2023-12-12T17:59:14.137778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:59:14.245922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
포천시 228
100.0%
Distinct227
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
Minimum2020-02-22 00:00:00
Maximum2021-08-31 00:00:00
2023-12-12T17:59:14.380776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:59:14.575588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

확진자수(명)
Real number (ℝ)

Distinct15
Distinct (%)6.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.3201754
Minimum1
Maximum30
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-12T17:59:14.751902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q34
95-th percentile10
Maximum30
Range29
Interquartile range (IQR)3

Descriptive statistics

Standard deviation3.6059344
Coefficient of variation (CV)1.0860674
Kurtosis18.136301
Mean3.3201754
Median Absolute Deviation (MAD)1
Skewness3.4995604
Sum757
Variance13.002763
MonotonicityNot monotonic
2023-12-12T17:59:14.941051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
1 83
36.4%
2 49
21.5%
3 27
 
11.8%
4 19
 
8.3%
5 15
 
6.6%
6 7
 
3.1%
7 7
 
3.1%
12 5
 
2.2%
8 4
 
1.8%
9 4
 
1.8%
Other values (5) 8
 
3.5%
ValueCountFrequency (%)
1 83
36.4%
2 49
21.5%
3 27
 
11.8%
4 19
 
8.3%
5 15
 
6.6%
6 7
 
3.1%
7 7
 
3.1%
8 4
 
1.8%
9 4
 
1.8%
10 4
 
1.8%
ValueCountFrequency (%)
30 1
 
0.4%
24 1
 
0.4%
18 1
 
0.4%
13 1
 
0.4%
12 5
2.2%
10 4
1.8%
9 4
1.8%
8 4
1.8%
7 7
3.1%
6 7
3.1%

Interactions

2023-12-12T17:59:13.757944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T17:59:13.927421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:59:14.022064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분격리(확진)일자확진자수(명)
0포천시2020-02-226
1포천시2020-04-024
2포천시2020-04-051
3포천시2020-04-131
4포천시2020-04-141
5포천시2020-04-151
6포천시2020-04-161
7포천시2020-04-172
8포천시2020-04-221
9포천시2020-06-071
구분격리(확진)일자확진자수(명)
218포천시2021-08-212
219포천시2021-08-226
220포천시2021-08-233
221포천시2021-08-245
222포천시2021-08-254
223포천시2021-08-2612
224포천시2021-08-283
225포천시2021-08-291
226포천시2021-08-302
227포천시2021-08-315

Duplicate rows

Most frequently occurring

구분격리(확진)일자확진자수(명)# duplicates
0포천시2021-03-1812