Overview

Dataset statistics

Number of variables3
Number of observations894
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory22.0 KiB
Average record size in memory25.1 B

Variable types

Categorical1
DateTime1
Numeric1

Dataset

Description2020년 2월 부터 2022년 7월까지 대구광역시 북구 관내 코로나19 확진자 현황(날짜, 확진자수) 정보를 제공합니다.
Author대구광역시 북구
URLhttps://www.data.go.kr/data/15080665/fileData.do

Alerts

구분 has constant value ""Constant
날짜 has unique valuesUnique
확진자 수(명) has 252 (28.2%) zerosZeros

Reproduction

Analysis started2023-12-12 14:16:35.516146
Analysis finished2023-12-12 14:16:35.840484
Duration0.32 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.1 KiB
대구광역시 북구
894 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구광역시 북구
2nd row대구광역시 북구
3rd row대구광역시 북구
4th row대구광역시 북구
5th row대구광역시 북구

Common Values

ValueCountFrequency (%)
대구광역시 북구 894
100.0%

Length

2023-12-12T23:16:35.906887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:16:36.036786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구광역시 894
50.0%
북구 894
50.0%

날짜
Date

UNIQUE 

Distinct894
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size7.1 KiB
Minimum2020-02-19 00:00:00
Maximum2022-07-31 00:00:00
2023-12-12T23:16:36.163635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:16:36.331981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

확진자 수(명)
Real number (ℝ)

ZEROS 

Distinct208
Distinct (%)23.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean146.76398
Minimum0
Maximum4069
Zeros252
Zeros (%)28.2%
Negative0
Negative (%)0.0%
Memory size8.0 KiB
2023-12-12T23:16:36.511250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median4
Q324
95-th percentile843.7
Maximum4069
Range4069
Interquartile range (IQR)24

Descriptive statistics

Standard deviation463.29876
Coefficient of variation (CV)3.1567606
Kurtosis27.780549
Mean146.76398
Median Absolute Deviation (MAD)4
Skewness4.889685
Sum131207
Variance214645.74
MonotonicityNot monotonic
2023-12-12T23:16:36.724108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 252
28.2%
1 95
 
10.6%
2 52
 
5.8%
4 38
 
4.3%
5 25
 
2.8%
3 22
 
2.5%
8 21
 
2.3%
7 19
 
2.1%
6 18
 
2.0%
13 17
 
1.9%
Other values (198) 335
37.5%
ValueCountFrequency (%)
0 252
28.2%
1 95
 
10.6%
2 52
 
5.8%
3 22
 
2.5%
4 38
 
4.3%
5 25
 
2.8%
6 18
 
2.0%
7 19
 
2.1%
8 21
 
2.3%
9 15
 
1.7%
ValueCountFrequency (%)
4069 1
0.1%
3958 1
0.1%
3827 1
0.1%
3434 1
0.1%
3220 1
0.1%
2825 1
0.1%
2617 1
0.1%
2606 1
0.1%
2587 1
0.1%
2583 1
0.1%

Interactions

2023-12-12T23:16:35.574270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T23:16:35.714467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:16:35.803089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분날짜확진자 수(명)
0대구광역시 북구2020-02-191
1대구광역시 북구2020-02-202
2대구광역시 북구2020-02-217
3대구광역시 북구2020-02-2210
4대구광역시 북구2020-02-2327
5대구광역시 북구2020-02-2441
6대구광역시 북구2020-02-2520
7대구광역시 북구2020-02-2622
8대구광역시 북구2020-02-2718
9대구광역시 북구2020-02-2851
구분날짜확진자 수(명)
884대구광역시 북구2022-07-22387
885대구광역시 북구2022-07-23387
886대구광역시 북구2022-07-24135
887대구광역시 북구2022-07-25777
888대구광역시 북구2022-07-26593
889대구광역시 북구2022-07-27587
890대구광역시 북구2022-07-28589
891대구광역시 북구2022-07-29589
892대구광역시 북구2022-07-30511
893대구광역시 북구2022-07-31158