Overview

Dataset statistics

Number of variables6
Number of observations29
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory57.6 B

Variable types

Numeric3
Categorical3

Dataset

Description충청남도 청양군의 코로나19 확진자 및 사망자 데이터입니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=109&beforeMenuCd=DOM_000000201001001000&publicdatapk=15098718

Alerts

시군명 has constant value ""Constant
연번 is highly overall correlated with 확진자수 and 1 other fieldsHigh correlation
확진자수 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
기준연도 is highly overall correlated with 연번High correlation
사망자수 is highly overall correlated with 확진자수High correlation
사망자수 is highly imbalanced (56.4%)Imbalance
연번 has unique valuesUnique
확진자수 has 6 (20.7%) zerosZeros

Reproduction

Analysis started2024-01-09 19:48:11.339036
Analysis finished2024-01-09 19:48:12.450890
Duration1.11 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct29
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15
Minimum1
Maximum29
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2024-01-10T04:48:12.496049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.4
Q18
median15
Q322
95-th percentile27.6
Maximum29
Range28
Interquartile range (IQR)14

Descriptive statistics

Standard deviation8.5146932
Coefficient of variation (CV)0.56764621
Kurtosis-1.2
Mean15
Median Absolute Deviation (MAD)7
Skewness0
Sum435
Variance72.5
MonotonicityStrictly increasing
2024-01-10T04:48:12.620853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
1 1
 
3.4%
2 1
 
3.4%
29 1
 
3.4%
28 1
 
3.4%
27 1
 
3.4%
26 1
 
3.4%
25 1
 
3.4%
24 1
 
3.4%
23 1
 
3.4%
22 1
 
3.4%
Other values (19) 19
65.5%
ValueCountFrequency (%)
1 1
3.4%
2 1
3.4%
3 1
3.4%
4 1
3.4%
5 1
3.4%
6 1
3.4%
7 1
3.4%
8 1
3.4%
9 1
3.4%
10 1
3.4%
ValueCountFrequency (%)
29 1
3.4%
28 1
3.4%
27 1
3.4%
26 1
3.4%
25 1
3.4%
24 1
3.4%
23 1
3.4%
22 1
3.4%
21 1
3.4%
20 1
3.4%

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size364.0 B
충청남도 청양군
29 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충청남도 청양군
2nd row충청남도 청양군
3rd row충청남도 청양군
4th row충청남도 청양군
5th row충청남도 청양군

Common Values

ValueCountFrequency (%)
충청남도 청양군 29
100.0%

Length

2024-01-10T04:48:12.717324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T04:48:12.796048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충청남도 29
50.0%
청양군 29
50.0%

기준연도
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)10.3%
Missing0
Missing (%)0.0%
Memory size364.0 B
2021
12 
2022
12 
2020

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2021 12
41.4%
2022 12
41.4%
2020 5
17.2%

Length

2024-01-10T04:48:12.893360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T04:48:12.972234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 12
41.4%
2022 12
41.4%
2020 5
17.2%

기준월
Real number (ℝ)

Distinct12
Distinct (%)41.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.1034483
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2024-01-10T04:48:13.049926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1.4
Q14
median8
Q310
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.5187744
Coefficient of variation (CV)0.49536145
Kurtosis-1.1577136
Mean7.1034483
Median Absolute Deviation (MAD)3
Skewness-0.27701021
Sum206
Variance12.381773
MonotonicityNot monotonic
2024-01-10T04:48:13.136219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
8 3
10.3%
9 3
10.3%
10 3
10.3%
11 3
10.3%
12 3
10.3%
1 2
6.9%
2 2
6.9%
3 2
6.9%
4 2
6.9%
5 2
6.9%
Other values (2) 4
13.8%
ValueCountFrequency (%)
1 2
6.9%
2 2
6.9%
3 2
6.9%
4 2
6.9%
5 2
6.9%
6 2
6.9%
7 2
6.9%
8 3
10.3%
9 3
10.3%
10 3
10.3%
ValueCountFrequency (%)
12 3
10.3%
11 3
10.3%
10 3
10.3%
9 3
10.3%
8 3
10.3%
7 2
6.9%
6 2
6.9%
5 2
6.9%
4 2
6.9%
3 2
6.9%

확진자수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct22
Distinct (%)75.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean415.34483
Minimum0
Maximum3820
Zeros6
Zeros (%)20.7%
Negative0
Negative (%)0.0%
Memory size393.0 B
2024-01-10T04:48:13.240257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median24
Q3447
95-th percentile1996
Maximum3820
Range3820
Interquartile range (IQR)446

Descriptive statistics

Standard deviation843.74007
Coefficient of variation (CV)2.0314207
Kurtosis9.4942442
Mean415.34483
Median Absolute Deviation (MAD)24
Skewness2.9420085
Sum12045
Variance711897.31
MonotonicityNot monotonic
2024-01-10T04:48:13.344254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
0 6
20.7%
1 2
 
6.9%
4 2
 
6.9%
2082 1
 
3.4%
765 1
 
3.4%
592 1
 
3.4%
284 1
 
3.4%
624 1
 
3.4%
1867 1
 
3.4%
447 1
 
3.4%
Other values (12) 12
41.4%
ValueCountFrequency (%)
0 6
20.7%
1 2
 
6.9%
3 1
 
3.4%
4 2
 
6.9%
10 1
 
3.4%
13 1
 
3.4%
23 1
 
3.4%
24 1
 
3.4%
26 1
 
3.4%
49 1
 
3.4%
ValueCountFrequency (%)
3820 1
3.4%
2082 1
3.4%
1867 1
3.4%
854 1
3.4%
765 1
3.4%
624 1
3.4%
592 1
3.4%
447 1
3.4%
418 1
3.4%
284 1
3.4%

사망자수
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)10.3%
Missing0
Missing (%)0.0%
Memory size364.0 B
0
25 
1
5
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)3.4%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 25
86.2%
1 3
 
10.3%
5 1
 
3.4%

Length

2024-01-10T04:48:13.446722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T04:48:13.524091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 25
86.2%
1 3
 
10.3%
5 1
 
3.4%

Interactions

2024-01-10T04:48:12.089060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:48:11.483272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:48:11.899553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:48:12.163475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:48:11.544691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:48:11.959269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:48:12.231575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:48:11.603636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:48:12.019876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T04:48:13.578904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번기준연도기준월확진자수사망자수
연번1.0000.9430.4590.4090.000
기준연도0.9431.0000.0000.7420.000
기준월0.4590.0001.0000.3040.473
확진자수0.4090.7420.3041.0000.969
사망자수0.0000.0000.4730.9691.000
2024-01-10T04:48:13.657207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준연도사망자수
기준연도1.0000.000
사망자수0.0001.000
2024-01-10T04:48:13.731674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번기준월확진자수기준연도사망자수
연번1.0000.1000.7650.7930.000
기준월0.1001.000-0.0390.0000.256
확진자수0.765-0.0391.0000.3890.737
기준연도0.7930.0000.3891.0000.000
사망자수0.0000.2560.7370.0001.000

Missing values

2024-01-10T04:48:12.341550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T04:48:12.420426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시군명기준연도기준월확진자수사망자수
01충청남도 청양군2020810
12충청남도 청양군20209230
23충청남도 청양군20201000
34충청남도 청양군20201100
45충청남도 청양군202012260
56충청남도 청양군2021131
67충청남도 청양군20212240
78충청남도 청양군2021300
89충청남도 청양군2021410
910충청남도 청양군2021500
연번시군명기준연도기준월확진자수사망자수
1920충청남도 청양군2022338205
2021충청남도 청양군2022420820
2122충청남도 청양군202254181
2223충청남도 청양군20226600
2324충청남도 청양군202274470
2425충청남도 청양군2022818671
2526충청남도 청양군202296240
2627충청남도 청양군2022102840
2728충청남도 청양군2022115920
2829충청남도 청양군2022127650