Overview

Dataset statistics

Number of variables6
Number of observations41
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory56.2 B

Variable types

Numeric3
Categorical3

Dataset

Description충청남도 청양군의 코로나19 확진자 및 사망자 데이터입니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=109&beforeMenuCd=DOM_000000201001001000&publicdatapk=15098718

Alerts

시군명 has constant value ""Constant
연번 is highly overall correlated with 확진자수 and 1 other fieldsHigh correlation
확진자수 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
기준연도 is highly overall correlated with 연번High correlation
사망자수 is highly overall correlated with 확진자수High correlation
사망자수 is highly imbalanced (60.7%)Imbalance
연번 has unique valuesUnique
확진자수 has 6 (14.6%) zerosZeros

Reproduction

Analysis started2024-01-09 19:48:14.100582
Analysis finished2024-01-09 19:48:15.043352
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct41
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21
Minimum1
Maximum41
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size501.0 B
2024-01-10T04:48:15.106518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q111
median21
Q331
95-th percentile39
Maximum41
Range40
Interquartile range (IQR)20

Descriptive statistics

Standard deviation11.979149
Coefficient of variation (CV)0.57043565
Kurtosis-1.2
Mean21
Median Absolute Deviation (MAD)10
Skewness0
Sum861
Variance143.5
MonotonicityStrictly increasing
2024-01-10T04:48:15.225607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
ValueCountFrequency (%)
1 1
 
2.4%
32 1
 
2.4%
24 1
 
2.4%
25 1
 
2.4%
26 1
 
2.4%
27 1
 
2.4%
28 1
 
2.4%
29 1
 
2.4%
30 1
 
2.4%
31 1
 
2.4%
Other values (31) 31
75.6%
ValueCountFrequency (%)
1 1
2.4%
2 1
2.4%
3 1
2.4%
4 1
2.4%
5 1
2.4%
6 1
2.4%
7 1
2.4%
8 1
2.4%
9 1
2.4%
10 1
2.4%
ValueCountFrequency (%)
41 1
2.4%
40 1
2.4%
39 1
2.4%
38 1
2.4%
37 1
2.4%
36 1
2.4%
35 1
2.4%
34 1
2.4%
33 1
2.4%
32 1
2.4%

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size460.0 B
충청남도 청양군
41 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충청남도 청양군
2nd row충청남도 청양군
3rd row충청남도 청양군
4th row충청남도 청양군
5th row충청남도 청양군

Common Values

ValueCountFrequency (%)
충청남도 청양군 41
100.0%

Length

2024-01-10T04:48:15.347582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T04:48:15.436863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충청남도 41
50.0%
청양군 41
50.0%

기준연도
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Memory size460.0 B
2021
12 
2022
12 
2023
12 
2020

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2021 12
29.3%
2022 12
29.3%
2023 12
29.3%
2020 5
12.2%

Length

2024-01-10T04:48:15.511998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T04:48:15.589891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 12
29.3%
2022 12
29.3%
2023 12
29.3%
2020 5
12.2%

기준월
Real number (ℝ)

Distinct12
Distinct (%)29.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.9268293
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size501.0 B
2024-01-10T04:48:15.669003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median7
Q310
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.5099163
Coefficient of variation (CV)0.50671326
Kurtosis-1.2037235
Mean6.9268293
Median Absolute Deviation (MAD)3
Skewness-0.19134676
Sum284
Variance12.319512
MonotonicityNot monotonic
2024-01-10T04:48:15.746792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
8 4
9.8%
9 4
9.8%
10 4
9.8%
11 4
9.8%
12 4
9.8%
1 3
7.3%
2 3
7.3%
3 3
7.3%
4 3
7.3%
5 3
7.3%
Other values (2) 6
14.6%
ValueCountFrequency (%)
1 3
7.3%
2 3
7.3%
3 3
7.3%
4 3
7.3%
5 3
7.3%
6 3
7.3%
7 3
7.3%
8 4
9.8%
9 4
9.8%
10 4
9.8%
ValueCountFrequency (%)
12 4
9.8%
11 4
9.8%
10 4
9.8%
9 4
9.8%
8 4
9.8%
7 3
7.3%
6 3
7.3%
5 3
7.3%
4 3
7.3%
3 3
7.3%

확진자수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct32
Distinct (%)78.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean353.14634
Minimum0
Maximum3820
Zeros6
Zeros (%)14.6%
Negative0
Negative (%)0.0%
Memory size501.0 B
2024-01-10T04:48:15.834438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q14
median49
Q3418
95-th percentile1867
Maximum3820
Range3820
Interquartile range (IQR)414

Descriptive statistics

Standard deviation723.42603
Coefficient of variation (CV)2.0485163
Kurtosis13.634614
Mean353.14634
Median Absolute Deviation (MAD)49
Skewness3.4477545
Sum14479
Variance523345.23
MonotonicityNot monotonic
2024-01-10T04:48:15.938346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
0 6
 
14.6%
1 2
 
4.9%
11 2
 
4.9%
4 2
 
4.9%
10 2
 
4.9%
96 1
 
2.4%
592 1
 
2.4%
765 1
 
2.4%
530 1
 
2.4%
147 1
 
2.4%
Other values (22) 22
53.7%
ValueCountFrequency (%)
0 6
14.6%
1 2
 
4.9%
3 1
 
2.4%
4 2
 
4.9%
5 1
 
2.4%
10 2
 
4.9%
11 2
 
4.9%
13 1
 
2.4%
23 1
 
2.4%
24 1
 
2.4%
ValueCountFrequency (%)
3820 1
2.4%
2082 1
2.4%
1867 1
2.4%
854 1
2.4%
772 1
2.4%
765 1
2.4%
624 1
2.4%
592 1
2.4%
530 1
2.4%
447 1
2.4%

사망자수
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)7.3%
Missing0
Missing (%)0.0%
Memory size460.0 B
0
36 
1
5
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)2.4%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 36
87.8%
1 4
 
9.8%
5 1
 
2.4%

Length

2024-01-10T04:48:16.035675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T04:48:16.111168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 36
87.8%
1 4
 
9.8%
5 1
 
2.4%

Interactions

2024-01-10T04:48:14.684133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:48:14.254676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:48:14.477843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:48:14.758218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:48:14.332895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:48:14.552085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:48:14.828102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:48:14.399727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:48:14.611767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T04:48:16.164307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번기준연도기준월확진자수사망자수
연번1.0000.9600.0000.3840.000
기준연도0.9601.0000.0000.3870.000
기준월0.0000.0001.0000.0000.324
확진자수0.3840.3870.0001.0000.961
사망자수0.0000.0000.3240.9611.000
2024-01-10T04:48:16.239403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준연도사망자수
기준연도1.0000.000
사망자수0.0001.000
2024-01-10T04:48:16.555998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번기준월확진자수기준연도사망자수
연번1.0000.0720.5140.9150.211
기준월0.0721.000-0.1710.0000.170
확진자수0.514-0.1711.0000.2450.729
기준연도0.9150.0000.2451.0000.000
사망자수0.2110.1700.7290.0001.000

Missing values

2024-01-10T04:48:14.926659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T04:48:15.009342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시군명기준연도기준월확진자수사망자수
01충청남도 청양군2020810
12충청남도 청양군20209230
23충청남도 청양군20201000
34충청남도 청양군20201100
45충청남도 청양군202012260
56충청남도 청양군2021131
67충청남도 청양군20212240
78충청남도 청양군2021300
89충청남도 청양군2021410
910충청남도 청양군2021500
연번시군명기준연도기준월확진자수사망자수
3132충청남도 청양군202331560
3233충청남도 청양군20234960
3334충청남도 청양군202351800
3435충청남도 청양군202361770
3536충청남도 청양군202373391
3637충청남도 청양군202387720
3738충청남도 청양군20239110
3839충청남도 청양군202310110
3940충청남도 청양군202311100
4041충청남도 청양군20231250