Overview

Dataset statistics

Number of variables4
Number of observations31
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory38.3 B

Variable types

DateTime2
Numeric1
Categorical1

Dataset

Description충청북도 증평군 코로나19 확진자에 대한 데이터로 확진자 발생 및 사망자 발생 월별 현황입니다. (7월달은 7월 31일 기준입니다.)
Author충청북도 증평군
URLhttps://www.data.go.kr/data/15098848/fileData.do

Alerts

데이터 기준일 has constant value ""Constant
확진자 is highly overall correlated with 사망자High correlation
사망자 is highly overall correlated with 확진자High correlation
사망자 is highly imbalanced (69.4%)Imbalance
날짜 has unique valuesUnique
확진자 has 10 (32.3%) zerosZeros

Reproduction

Analysis started2023-12-12 04:41:08.855402
Analysis finished2023-12-12 04:41:09.281479
Duration0.43 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

날짜
Date

UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size380.0 B
Minimum2020-01-01 00:00:00
Maximum2022-07-01 00:00:00
2023-12-12T13:41:09.354394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:41:09.522082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)

확진자
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct19
Distinct (%)61.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean558.64516
Minimum0
Maximum9572
Zeros10
Zeros (%)32.3%
Negative0
Negative (%)0.0%
Memory size411.0 B
2023-12-12T13:41:09.671181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median9
Q342.5
95-th percentile2508
Maximum9572
Range9572
Interquartile range (IQR)42.5

Descriptive statistics

Standard deviation1818.4262
Coefficient of variation (CV)3.2550647
Kurtosis21.579552
Mean558.64516
Median Absolute Deviation (MAD)9
Skewness4.4854874
Sum17318
Variance3306673.7
MonotonicityNot monotonic
2023-12-12T13:41:09.811787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
0 10
32.3%
1 3
 
9.7%
16 2
 
6.5%
39 1
 
3.2%
1431 1
 
3.2%
288 1
 
3.2%
802 1
 
3.2%
3585 1
 
3.2%
9572 1
 
3.2%
1264 1
 
3.2%
Other values (9) 9
29.0%
ValueCountFrequency (%)
0 10
32.3%
1 3
 
9.7%
2 1
 
3.2%
5 1
 
3.2%
9 1
 
3.2%
13 1
 
3.2%
16 2
 
6.5%
19 1
 
3.2%
29 1
 
3.2%
34 1
 
3.2%
ValueCountFrequency (%)
9572 1
3.2%
3585 1
3.2%
1431 1
3.2%
1264 1
3.2%
802 1
3.2%
288 1
3.2%
145 1
3.2%
46 1
3.2%
39 1
3.2%
34 1
3.2%

사망자
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)12.9%
Missing0
Missing (%)0.0%
Memory size380.0 B
<NA>
28 
2
 
1
6
 
1
1
 
1

Length

Max length4
Median length4
Mean length3.7096774
Min length1

Unique

Unique3 ?
Unique (%)9.7%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 28
90.3%
2 1
 
3.2%
6 1
 
3.2%
1 1
 
3.2%

Length

2023-12-12T13:41:09.970443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:41:10.082752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 28
90.3%
2 1
 
3.2%
6 1
 
3.2%
1 1
 
3.2%

데이터 기준일
Date

CONSTANT 

Distinct1
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size380.0 B
Minimum2022-07-31 00:00:00
Maximum2022-07-31 00:00:00
2023-12-12T13:41:10.176686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:41:10.289840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T13:41:08.960789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:41:10.379368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
날짜확진자사망자
날짜1.0001.0001.000
확진자1.0001.0001.000
사망자1.0001.0001.000
2023-12-12T13:41:10.514476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
확진자사망자
확진자1.0001.000
사망자1.0001.000

Missing values

2023-12-12T13:41:09.108019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:41:09.231413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

날짜확진자사망자데이터 기준일
02020-010<NA>2022-07-31
12020-021<NA>2022-07-31
22020-031<NA>2022-07-31
32020-040<NA>2022-07-31
42020-050<NA>2022-07-31
52020-060<NA>2022-07-31
62020-070<NA>2022-07-31
72020-080<NA>2022-07-31
82020-091<NA>2022-07-31
92020-100<NA>2022-07-31
날짜확진자사망자데이터 기준일
212021-1016<NA>2022-07-31
222021-1129<NA>2022-07-31
232021-1239<NA>2022-07-31
242022-01145<NA>2022-07-31
252022-021264<NA>2022-07-31
262022-03957262022-07-31
272022-04358512022-07-31
282022-05802<NA>2022-07-31
292022-06288<NA>2022-07-31
302022-071431<NA>2022-07-31