Overview

Dataset statistics

Number of variables5
Number of observations34
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory47.8 B

Variable types

Categorical2
Numeric3

Dataset

Description보성군 코로나19 확진자 및 사망자 현황에 대한 데이터로 시군구명, 발생연도, 발생월, 확진인원, 사망인원에 등의 항목을 제공합니다.
Author전라남도 보성군
URLhttps://www.data.go.kr/data/15098859/fileData.do

Alerts

시군구명 has constant value ""Constant
확진인원 is highly overall correlated with 사망인원High correlation
사망인원 is highly overall correlated with 확진인원 High correlation
확진인원 has 2 (5.9%) zerosZeros
사망인원 has 20 (58.8%) zerosZeros

Reproduction

Analysis started2024-04-13 12:31:13.308650
Analysis finished2024-04-13 12:31:17.427482
Duration4.12 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size400.0 B
전라남도 보성군
34 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전라남도 보성군
2nd row전라남도 보성군
3rd row전라남도 보성군
4th row전라남도 보성군
5th row전라남도 보성군

Common Values

ValueCountFrequency (%)
전라남도 보성군 34
100.0%

Length

2024-04-13T21:31:17.645072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-13T21:31:17.962123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전라남도 34
50.0%
보성군 34
50.0%

발생연도
Categorical

Distinct4
Distinct (%)11.8%
Missing0
Missing (%)0.0%
Memory size400.0 B
2021
12 
2022
12 
2023
2020

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2021 12
35.3%
2022 12
35.3%
2023 8
23.5%
2020 2
 
5.9%

Length

2024-04-13T21:31:18.287060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-13T21:31:18.611983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 12
35.3%
2022 12
35.3%
2023 8
23.5%
2020 2
 
5.9%

발생월
Real number (ℝ)

Distinct12
Distinct (%)35.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.3235294
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size434.0 B
2024-04-13T21:31:18.938603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13.25
median6
Q39
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)5.75

Descriptive statistics

Standard deviation3.5224009
Coefficient of variation (CV)0.55703085
Kurtosis-1.1769739
Mean6.3235294
Median Absolute Deviation (MAD)3
Skewness0.11673213
Sum215
Variance12.407308
MonotonicityNot monotonic
2024-04-13T21:31:19.303044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
11 3
8.8%
12 3
8.8%
1 3
8.8%
2 3
8.8%
3 3
8.8%
4 3
8.8%
5 3
8.8%
6 3
8.8%
7 3
8.8%
8 3
8.8%
Other values (2) 4
11.8%
ValueCountFrequency (%)
1 3
8.8%
2 3
8.8%
3 3
8.8%
4 3
8.8%
5 3
8.8%
6 3
8.8%
7 3
8.8%
8 3
8.8%
9 2
5.9%
10 2
5.9%
ValueCountFrequency (%)
12 3
8.8%
11 3
8.8%
10 2
5.9%
9 2
5.9%
8 3
8.8%
7 3
8.8%
6 3
8.8%
5 3
8.8%
4 3
8.8%
3 3
8.8%

확진인원
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct30
Distinct (%)88.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean635.44118
Minimum0
Maximum5445
Zeros2
Zeros (%)5.9%
Negative0
Negative (%)0.0%
Memory size434.0 B
2024-04-13T21:31:19.666860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.65
Q14.75
median148.5
Q3776.25
95-th percentile2880.05
Maximum5445
Range5445
Interquartile range (IQR)771.5

Descriptive statistics

Standard deviation1143.7099
Coefficient of variation (CV)1.7998675
Kurtosis9.7186597
Mean635.44118
Median Absolute Deviation (MAD)147.5
Skewness2.944871
Sum21605
Variance1308072.3
MonotonicityNot monotonic
2024-04-13T21:31:20.072194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1 3
 
8.8%
4 2
 
5.9%
0 2
 
5.9%
2592 1
 
2.9%
1025 1
 
2.9%
709 1
 
2.9%
150 1
 
2.9%
390 1
 
2.9%
280 1
 
2.9%
147 1
 
2.9%
Other values (20) 20
58.8%
ValueCountFrequency (%)
0 2
5.9%
1 3
8.8%
2 1
 
2.9%
3 1
 
2.9%
4 2
5.9%
7 1
 
2.9%
8 1
 
2.9%
11 1
 
2.9%
17 1
 
2.9%
48 1
 
2.9%
ValueCountFrequency (%)
5445 1
2.9%
3415 1
2.9%
2592 1
2.9%
1368 1
2.9%
1230 1
2.9%
1149 1
2.9%
1025 1
2.9%
867 1
2.9%
785 1
2.9%
750 1
2.9%

사망인원
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct7
Distinct (%)20.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.1176471
Minimum0
Maximum9
Zeros20
Zeros (%)58.8%
Negative0
Negative (%)0.0%
Memory size434.0 B
2024-04-13T21:31:20.428773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31.75
95-th percentile4.7
Maximum9
Range9
Interquartile range (IQR)1.75

Descriptive statistics

Standard deviation1.981195
Coefficient of variation (CV)1.7726482
Kurtosis7.4570699
Mean1.1176471
Median Absolute Deviation (MAD)0
Skewness2.5596598
Sum38
Variance3.9251337
MonotonicityNot monotonic
2024-04-13T21:31:20.753644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
0 20
58.8%
1 5
 
14.7%
2 4
 
11.8%
3 2
 
5.9%
9 1
 
2.9%
6 1
 
2.9%
4 1
 
2.9%
ValueCountFrequency (%)
0 20
58.8%
1 5
 
14.7%
2 4
 
11.8%
3 2
 
5.9%
4 1
 
2.9%
6 1
 
2.9%
9 1
 
2.9%
ValueCountFrequency (%)
9 1
 
2.9%
6 1
 
2.9%
4 1
 
2.9%
3 2
 
5.9%
2 4
 
11.8%
1 5
 
14.7%
0 20
58.8%

Interactions

2024-04-13T21:31:16.201621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T21:31:14.783303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T21:31:15.496223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T21:31:16.435813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T21:31:15.027405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T21:31:15.734223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T21:31:16.666628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T21:31:15.265790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-13T21:31:15.971008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-13T21:31:20.973443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발생연도발생월확진인원사망인원
발생연도1.0000.0000.3680.360
발생월0.0001.0000.0000.498
확진인원0.3680.0001.0000.832
사망인원0.3600.4980.8321.000
2024-04-13T21:31:21.229740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발생월확진인원사망인원발생연도
발생월1.000-0.002-0.1950.000
확진인원-0.0021.0000.7330.226
사망인원-0.1950.7331.0000.230
발생연도0.0000.2260.2301.000

Missing values

2024-04-13T21:31:16.970786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-13T21:31:17.288371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군구명발생연도발생월확진인원사망인원
0전라남도 보성군20201110
1전라남도 보성군20201230
2전라남도 보성군2021140
3전라남도 보성군2021200
4전라남도 보성군2021300
5전라남도 보성군2021410
6전라남도 보성군2021580
7전라남도 보성군2021610
8전라남도 보성군2021720
9전라남도 보성군2021840
시군구명발생연도발생월확진인원사망인원
24전라남도 보성군2022117502
25전라남도 보성군20221213681
26전라남도 보성군202318674
27전라남도 보성군202321840
28전라남도 보성군202331472
29전라남도 보성군202342800
30전라남도 보성군202353901
31전라남도 보성군202361500
32전라남도 보성군202377093
33전라남도 보성군2023810252