Overview

Dataset statistics

Number of variables5
Number of observations28
Missing cells27
Missing cells (%)19.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory47.6 B

Variable types

Numeric2
DateTime2
Categorical1

Dataset

Description2021년 2월 전라남도 장흥군 코로나19 최초 확진자 발생 이후부터 2023년 8월까지 전라남도 장흥군의 월별 코로나 19 확진자 및 사망자 현황
Author전라남도 장흥군
URLhttps://www.data.go.kr/data/15099314/fileData.do

Alerts

최초 확진 년월일 has constant value ""Constant
연번 is highly overall correlated with 확진자수High correlation
확진자수 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
사망자수 is highly overall correlated with 확진자수High correlation
최초 확진 년월일 has 27 (96.4%) missing valuesMissing
연번 has unique valuesUnique
연월 has unique valuesUnique

Reproduction

Analysis started2024-03-15 02:16:49.815924
Analysis finished2024-03-15 02:16:52.598348
Duration2.78 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct28
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.5
Minimum1
Maximum28
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size380.0 B
2024-03-15T11:16:52.753376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.35
Q17.75
median14.5
Q321.25
95-th percentile26.65
Maximum28
Range27
Interquartile range (IQR)13.5

Descriptive statistics

Standard deviation8.2259751
Coefficient of variation (CV)0.56730863
Kurtosis-1.2
Mean14.5
Median Absolute Deviation (MAD)7
Skewness0
Sum406
Variance67.666667
MonotonicityStrictly increasing
2024-03-15T11:16:52.992473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
1 1
 
3.6%
16 1
 
3.6%
28 1
 
3.6%
27 1
 
3.6%
26 1
 
3.6%
25 1
 
3.6%
24 1
 
3.6%
23 1
 
3.6%
22 1
 
3.6%
21 1
 
3.6%
Other values (18) 18
64.3%
ValueCountFrequency (%)
1 1
3.6%
2 1
3.6%
3 1
3.6%
4 1
3.6%
5 1
3.6%
6 1
3.6%
7 1
3.6%
8 1
3.6%
9 1
3.6%
10 1
3.6%
ValueCountFrequency (%)
28 1
3.6%
27 1
3.6%
26 1
3.6%
25 1
3.6%
24 1
3.6%
23 1
3.6%
22 1
3.6%
21 1
3.6%
20 1
3.6%
19 1
3.6%

연월
Date

UNIQUE 

Distinct28
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size352.0 B
Minimum2021-02-21 00:00:00
Maximum2023-08-23 00:00:00
2024-03-15T11:16:53.205830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T11:16:53.430739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)

확진자수
Real number (ℝ)

HIGH CORRELATION 

Distinct27
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean728.64286
Minimum1
Maximum5498
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size380.0 B
2024-03-15T11:16:53.757710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5
Q124.75
median226.5
Q3811.5
95-th percentile3160.95
Maximum5498
Range5497
Interquartile range (IQR)786.75

Descriptive statistics

Standard deviation1226.689
Coefficient of variation (CV)1.6835258
Kurtosis8.8590622
Mean728.64286
Median Absolute Deviation (MAD)221.5
Skewness2.8694882
Sum20402
Variance1504766
MonotonicityNot monotonic
2024-03-15T11:16:54.016999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
5 2
 
7.1%
1 1
 
3.6%
1139 1
 
3.6%
936 1
 
3.6%
178 1
 
3.6%
285 1
 
3.6%
197 1
 
3.6%
195 1
 
3.6%
201 1
 
3.6%
580 1
 
3.6%
Other values (17) 17
60.7%
ValueCountFrequency (%)
1 1
3.6%
5 2
7.1%
10 1
3.6%
13 1
3.6%
23 1
3.6%
24 1
3.6%
25 1
3.6%
62 1
3.6%
110 1
3.6%
178 1
3.6%
ValueCountFrequency (%)
5498 1
3.6%
3645 1
3.6%
2262 1
3.6%
1185 1
3.6%
1139 1
3.6%
946 1
3.6%
936 1
3.6%
770 1
3.6%
664 1
3.6%
604 1
3.6%

사망자수
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)17.9%
Missing0
Missing (%)0.0%
Memory size352.0 B
0
16 
2
1
4
9

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 16
57.1%
2 4
 
14.3%
1 3
 
10.7%
4 3
 
10.7%
9 2
 
7.1%

Length

2024-03-15T11:16:54.290942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T11:16:54.567014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 16
57.1%
2 4
 
14.3%
1 3
 
10.7%
4 3
 
10.7%
9 2
 
7.1%

최초 확진 년월일
Date

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing27
Missing (%)96.4%
Memory size352.0 B
Minimum2021-02-17 00:00:00
Maximum2021-02-17 00:00:00
2024-03-15T11:16:54.759591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T11:16:55.077004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-15T11:16:51.046641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T11:16:50.309285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T11:16:51.436351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T11:16:50.655767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T11:16:55.292344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번연월확진자수사망자수
연번1.0001.0000.4410.554
연월1.0001.0001.0001.000
확진자수0.4411.0001.0000.835
사망자수0.5541.0000.8351.000
2024-03-15T11:16:55.547801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번확진자수사망자수
연번1.0000.5780.202
확진자수0.5781.0000.719
사망자수0.2020.7191.000

Missing values

2024-03-15T11:16:51.950636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T11:16:52.329016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번연월확진자수사망자수최초 확진 년월일
012021-02-21502021-02-17
122021-03-2150<NA>
232021-04-2110<NA>
342021-08-21230<NA>
452021-09-21130<NA>
562021-10-21100<NA>
672021-11-21251<NA>
782021-12-21240<NA>
892022-01-22621<NA>
9102022-02-226642<NA>
연번연월확진자수사망자수최초 확진 년월일
18192022-11-229460<NA>
19202022-12-2211854<NA>
20212023-01-235802<NA>
21222023-02-232010<NA>
22232023-03-231950<NA>
23242023-04-231970<NA>
24252023-05-232850<NA>
25262023-06-231780<NA>
26272023-07-239362<NA>
27282023-08-2311394<NA>