Overview

Dataset statistics

Number of variables6
Number of observations35
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory53.8 B

Variable types

DateTime1
Numeric2
Categorical3

Dataset

Description대구광역시 달서구 코로나19 확진자에 대한 데이터로 월별 확진자, 누적 확진자 수 등의 데이터를 제공합니다.(연월, 월별 확진자수, 누적 확진자수, 시군구)
URLhttps://www.data.go.kr/data/15099531/fileData.do

Alerts

시군구 has constant value ""Constant
관리부서 has constant value ""Constant
기준일자 has constant value ""Constant
월별 확진자수 is highly overall correlated with 누적 확진자수High correlation
누적 확진자수 is highly overall correlated with 월별 확진자수High correlation
연월 has unique valuesUnique
누적 확진자수 has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:56:34.596406
Analysis finished2023-12-12 15:56:35.822886
Duration1.23 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연월
Date

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
Minimum2020-02-29 00:00:00
Maximum2022-12-31 00:00:00
2023-12-13T00:56:35.890230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:56:36.009790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)

월별 확진자수
Real number (ℝ)

HIGH CORRELATION 

Distinct34
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8276.9143
Minimum6
Maximum89592
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size447.0 B
2023-12-13T00:56:36.145421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile9.4
Q150.5
median341
Q39883
95-th percentile40314.8
Maximum89592
Range89586
Interquartile range (IQR)9832.5

Descriptive statistics

Standard deviation17844.443
Coefficient of variation (CV)2.1559294
Kurtosis12.732498
Mean8276.9143
Median Absolute Deviation (MAD)331
Skewness3.3083834
Sum289692
Variance3.1842415 × 108
MonotonicityNot monotonic
2023-12-13T00:56:36.265856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
10 2
 
5.7%
459 1
 
2.9%
11420 1
 
2.9%
285 1
 
2.9%
931 1
 
2.9%
2217 1
 
2.9%
21993 1
 
2.9%
89592 1
 
2.9%
43238 1
 
2.9%
3244 1
 
2.9%
Other values (24) 24
68.6%
ValueCountFrequency (%)
6 1
2.9%
8 1
2.9%
10 2
5.7%
12 1
2.9%
13 1
2.9%
17 1
2.9%
22 1
2.9%
36 1
2.9%
65 1
2.9%
89 1
2.9%
ValueCountFrequency (%)
89592 1
2.9%
43238 1
2.9%
39062 1
2.9%
21993 1
2.9%
18944 1
2.9%
18897 1
2.9%
15439 1
2.9%
12056 1
2.9%
11420 1
2.9%
8346 1
2.9%

누적 확진자수
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean62906.686
Minimum459
Maximum289692
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size447.0 B
2023-12-13T00:56:36.391481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum459
5-th percentile1606.4
Q11684
median2708
Q3140665
95-th percentile259940.7
Maximum289692
Range289233
Interquartile range (IQR)138981

Descriptive statistics

Standard deviation99365.023
Coefficient of variation (CV)1.5795622
Kurtosis-0.12517418
Mean62906.686
Median Absolute Deviation (MAD)1095
Skewness1.245793
Sum2201734
Variance9.8734078 × 109
MonotonicityStrictly increasing
2023-12-13T00:56:36.542466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
459 1
 
2.9%
1591 1
 
2.9%
4028 1
 
2.9%
4313 1
 
2.9%
5244 1
 
2.9%
7461 1
 
2.9%
29454 1
 
2.9%
119046 1
 
2.9%
162284 1
 
2.9%
173704 1
 
2.9%
Other values (25) 25
71.4%
ValueCountFrequency (%)
459 1
2.9%
1591 1
2.9%
1613 1
2.9%
1626 1
2.9%
1634 1
2.9%
1644 1
2.9%
1661 1
2.9%
1673 1
2.9%
1679 1
2.9%
1689 1
2.9%
ValueCountFrequency (%)
289692 1
2.9%
270748 1
2.9%
255309 1
2.9%
246963 1
2.9%
228066 1
2.9%
189004 1
2.9%
176948 1
2.9%
173704 1
2.9%
162284 1
2.9%
119046 1
2.9%

시군구
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
달서구
35 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row달서구
2nd row달서구
3rd row달서구
4th row달서구
5th row달서구

Common Values

ValueCountFrequency (%)
달서구 35
100.0%

Length

2023-12-13T00:56:36.746772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:56:36.869949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
달서구 35
100.0%

관리부서
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
보건행정과
35 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보건행정과
2nd row보건행정과
3rd row보건행정과
4th row보건행정과
5th row보건행정과

Common Values

ValueCountFrequency (%)
보건행정과 35
100.0%

Length

2023-12-13T00:56:36.983215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:56:37.088185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보건행정과 35
100.0%

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-03-24
35 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-03-24
2nd row2023-03-24
3rd row2023-03-24
4th row2023-03-24
5th row2023-03-24

Common Values

ValueCountFrequency (%)
2023-03-24 35
100.0%

Length

2023-12-13T00:56:37.219492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:56:37.340802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-03-24 35
100.0%

Interactions

2023-12-13T00:56:34.993842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:56:34.739832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:56:35.478951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:56:34.869467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:56:37.420605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연월월별 확진자수누적 확진자수
연월1.0001.0001.000
월별 확진자수1.0001.0000.887
누적 확진자수1.0000.8871.000
2023-12-13T00:56:37.514080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
월별 확진자수누적 확진자수
월별 확진자수1.0000.809
누적 확진자수0.8091.000

Missing values

2023-12-13T00:56:35.661433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:56:35.781803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연월월별 확진자수누적 확진자수시군구관리부서기준일자
02020-02-29459459달서구보건행정과2023-03-24
12020-03-3111321591달서구보건행정과2023-03-24
22020-04-30221613달서구보건행정과2023-03-24
32020-05-31131626달서구보건행정과2023-03-24
42020-06-3081634달서구보건행정과2023-03-24
52020-07-31101644달서구보건행정과2023-03-24
62020-08-31171661달서구보건행정과2023-03-24
72020-09-30121673달서구보건행정과2023-03-24
82020-10-3161679달서구보건행정과2023-03-24
92020-11-30101689달서구보건행정과2023-03-24
연월월별 확진자수누적 확진자수시군구관리부서기준일자
252022-03-3189592119046달서구보건행정과2023-03-24
262022-04-3043238162284달서구보건행정과2023-03-24
272022-05-3111420173704달서구보건행정과2023-03-24
282022-06-303244176948달서구보건행정과2023-03-24
292022-07-3112056189004달서구보건행정과2023-03-24
302022-08-3139062228066달서구보건행정과2023-03-24
312022-09-3018897246963달서구보건행정과2023-03-24
322022-10-318346255309달서구보건행정과2023-03-24
332022-11-3015439270748달서구보건행정과2023-03-24
342022-12-3118944289692달서구보건행정과2023-03-24