Overview

Dataset statistics

Number of variables5
Number of observations25
Missing cells20
Missing cells (%)16.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory49.3 B

Variable types

Categorical2
Numeric3

Dataset

Description인천광역시 부평구의 코로나19 발생 최초월부터 2021년 12월까지 연도별, 월별 확진자 수, 사망자 수 데이터를 제공합니다.
Author인천광역시 부평구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15098733&srcSe=7661IVAWM27C61E190

Alerts

년도 is highly overall correlated with 데이터기준일자High correlation
데이터기준일자 is highly overall correlated with and 3 other fieldsHigh correlation
is highly overall correlated with 사망자수 and 1 other fieldsHigh correlation
확진자 수 is highly overall correlated with 사망자수 and 1 other fieldsHigh correlation
사망자수 is highly overall correlated with and 2 other fieldsHigh correlation
데이터기준일자 is highly imbalanced (75.8%)Imbalance
has 1 (4.0%) missing valuesMissing
확진자 수 has 2 (8.0%) missing valuesMissing
사망자수 has 17 (68.0%) missing valuesMissing

Reproduction

Analysis started2024-01-28 12:17:32.508478
Analysis finished2024-01-28 12:17:33.484011
Duration0.98 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년도
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2020
12 
2021
12 
<NA>
 
1

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique1 ?
Unique (%)4.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2020 12
48.0%
2021 12
48.0%
<NA> 1
 
4.0%

Length

2024-01-28T21:17:33.539017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T21:17:33.629116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 12
48.0%
2021 12
48.0%
na 1
 
4.0%


Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct12
Distinct (%)50.0%
Missing1
Missing (%)4.0%
Infinite0
Infinite (%)0.0%
Mean6.5
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2024-01-28T21:17:33.713843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1.15
Q13.75
median6.5
Q39.25
95-th percentile11.85
Maximum12
Range11
Interquartile range (IQR)5.5

Descriptive statistics

Standard deviation3.5262987
Coefficient of variation (CV)0.54250749
Kurtosis-1.2156934
Mean6.5
Median Absolute Deviation (MAD)3
Skewness0
Sum156
Variance12.434783
MonotonicityNot monotonic
2024-01-28T21:17:33.811401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
1 2
8.0%
2 2
8.0%
3 2
8.0%
4 2
8.0%
5 2
8.0%
6 2
8.0%
7 2
8.0%
8 2
8.0%
9 2
8.0%
10 2
8.0%
Other values (2) 4
16.0%
ValueCountFrequency (%)
1 2
8.0%
2 2
8.0%
3 2
8.0%
4 2
8.0%
5 2
8.0%
6 2
8.0%
7 2
8.0%
8 2
8.0%
9 2
8.0%
10 2
8.0%
ValueCountFrequency (%)
12 2
8.0%
11 2
8.0%
10 2
8.0%
9 2
8.0%
8 2
8.0%
7 2
8.0%
6 2
8.0%
5 2
8.0%
4 2
8.0%
3 2
8.0%

확진자 수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct21
Distinct (%)91.3%
Missing2
Missing (%)8.0%
Infinite0
Infinite (%)0.0%
Mean309.6087
Minimum2
Maximum2202
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2024-01-28T21:17:33.905154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile2.9
Q131
median92
Q3403
95-th percentile1152.6
Maximum2202
Range2200
Interquartile range (IQR)372

Descriptive statistics

Standard deviation513.18274
Coefficient of variation (CV)1.6575204
Kurtosis8.2446339
Mean309.6087
Median Absolute Deviation (MAD)79
Skewness2.7061791
Sum7121
Variance263356.52
MonotonicityNot monotonic
2024-01-28T21:17:33.997267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
2 2
 
8.0%
31 2
 
8.0%
35 1
 
4.0%
125 1
 
4.0%
2202 1
 
4.0%
1200 1
 
4.0%
703 1
 
4.0%
726 1
 
4.0%
506 1
 
4.0%
393 1
 
4.0%
Other values (11) 11
44.0%
(Missing) 2
 
8.0%
ValueCountFrequency (%)
2 2
8.0%
11 1
4.0%
13 1
4.0%
27 1
4.0%
31 2
8.0%
35 1
4.0%
41 1
4.0%
47 1
4.0%
74 1
4.0%
92 1
4.0%
ValueCountFrequency (%)
2202 1
4.0%
1200 1
4.0%
726 1
4.0%
703 1
4.0%
506 1
4.0%
413 1
4.0%
393 1
4.0%
193 1
4.0%
139 1
4.0%
125 1
4.0%

사망자수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct6
Distinct (%)75.0%
Missing17
Missing (%)68.0%
Infinite0
Infinite (%)0.0%
Mean4.375
Minimum1
Maximum13
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2024-01-28T21:17:34.081151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11.75
median3
Q35.5
95-th percentile10.9
Maximum13
Range12
Interquartile range (IQR)3.75

Descriptive statistics

Standard deviation4.0333432
Coefficient of variation (CV)0.92190701
Kurtosis2.717878
Mean4.375
Median Absolute Deviation (MAD)2
Skewness1.6386457
Sum35
Variance16.267857
MonotonicityNot monotonic
2024-01-28T21:17:34.179152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 2
 
8.0%
3 2
 
8.0%
7 1
 
4.0%
2 1
 
4.0%
5 1
 
4.0%
13 1
 
4.0%
(Missing) 17
68.0%
ValueCountFrequency (%)
1 2
8.0%
2 1
4.0%
3 2
8.0%
5 1
4.0%
7 1
4.0%
13 1
4.0%
ValueCountFrequency (%)
13 1
4.0%
7 1
4.0%
5 1
4.0%
3 2
8.0%
2 1
4.0%
1 2
8.0%

데이터기준일자
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2022-02-07
24 
<NA>
 
1

Length

Max length10
Median length10
Mean length9.76
Min length4

Unique

Unique1 ?
Unique (%)4.0%

Sample

1st row2022-02-07
2nd row2022-02-07
3rd row2022-02-07
4th row2022-02-07
5th row2022-02-07

Common Values

ValueCountFrequency (%)
2022-02-07 24
96.0%
<NA> 1
 
4.0%

Length

2024-01-28T21:17:34.281406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T21:17:34.365717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-02-07 24
96.0%
na 1
 
4.0%

Interactions

2024-01-28T21:17:33.043437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T21:17:32.651236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T21:17:32.857324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T21:17:33.112776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T21:17:32.726215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T21:17:32.917215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T21:17:33.176136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T21:17:32.790107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T21:17:32.981140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T21:17:34.422904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도확진자 수사망자수
년도1.0000.0000.3220.273
0.0001.0000.0000.000
확진자 수0.3220.0001.0001.000
사망자수0.2730.0001.0001.000
2024-01-28T21:17:34.502549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도데이터기준일자
년도1.0001.000
데이터기준일자1.0001.000
2024-01-28T21:17:34.574477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
확진자 수사망자수년도데이터기준일자
1.0000.4560.8360.0001.000
확진자 수0.4561.0000.8430.1841.000
사망자수0.8360.8431.0000.0001.000
년도0.0000.1840.0001.0001.000
데이터기준일자1.0001.0001.0001.0001.000

Missing values

2024-01-28T21:17:33.268770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T21:17:33.346123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-28T21:17:33.429270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

년도확진자 수사망자수데이터기준일자
020201<NA><NA>2022-02-07
1202022<NA>2022-02-07
22020313<NA>2022-02-07
3202042<NA>2022-02-07
42020531<NA>2022-02-07
52020635<NA>2022-02-07
62020711<NA>2022-02-07
72020847<NA>2022-02-07
8202093112022-02-07
920201027<NA>2022-02-07
년도확진자 수사망자수데이터기준일자
1520214115<NA>2022-02-07
162021512512022-02-07
1720216139<NA>2022-02-07
1820217393<NA>2022-02-07
192021850632022-02-07
2020219726<NA>2022-02-07
2120211070332022-02-07
22202111120052022-02-07
232021122202132022-02-07
24<NA><NA><NA><NA><NA>