Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 25 |
Missing cells | 20 |
Missing cells (%) | 16.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.2 KiB |
Average record size in memory | 49.3 B |
Variable types
Categorical | 2 |
---|---|
Numeric | 3 |
Dataset
Description | 인천광역시 부평구의 코로나19 발생 최초월부터 2021년 12월까지 연도별, 월별 확진자 수, 사망자 수 데이터를 제공합니다. |
---|---|
Author | 인천광역시 부평구 |
URL | https://data.incheon.go.kr/findData/publicDataDetail?dataId=15098733&srcSe=7661IVAWM27C61E190 |
년도 is highly overall correlated with 데이터기준일자 | High correlation |
데이터기준일자 is highly overall correlated with 월 and 3 other fields | High correlation |
월 is highly overall correlated with 사망자수 and 1 other fields | High correlation |
확진자 수 is highly overall correlated with 사망자수 and 1 other fields | High correlation |
사망자수 is highly overall correlated with 월 and 2 other fields | High correlation |
데이터기준일자 is highly imbalanced (75.8%) | Imbalance |
월 has 1 (4.0%) missing values | Missing |
확진자 수 has 2 (8.0%) missing values | Missing |
사망자수 has 17 (68.0%) missing values | Missing |
Reproduction
Analysis started | 2024-01-28 12:17:32.508478 |
---|---|
Analysis finished | 2024-01-28 12:17:33.484011 |
Duration | 0.98 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
년도
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 12.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 332.0 B |
2020 | |
---|---|
2021 | |
<NA> | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 4.0% |
Sample
1st row | 2020 |
---|---|
2nd row | 2020 |
3rd row | 2020 |
4th row | 2020 |
5th row | 2020 |
Common Values
Value | Count | Frequency (%) |
2020 | 12 | |
2021 | 12 | |
<NA> | 1 | 4.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020 | 12 | |
2021 | 12 | |
na | 1 | 4.0% |
월
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | 50.0% |
Missing | 1 |
Missing (%) | 4.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.5 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 357.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1.15 |
Q1 | 3.75 |
median | 6.5 |
Q3 | 9.25 |
95-th percentile | 11.85 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 5.5 |
Descriptive statistics
Standard deviation | 3.5262987 |
---|---|
Coefficient of variation (CV) | 0.54250749 |
Kurtosis | -1.2156934 |
Mean | 6.5 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0 |
Sum | 156 |
Variance | 12.434783 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 2 | |
2 | 2 | |
3 | 2 | |
4 | 2 | |
5 | 2 | |
6 | 2 | |
7 | 2 | |
8 | 2 | |
9 | 2 | |
10 | 2 | |
Other values (2) | 4 |
Value | Count | Frequency (%) |
1 | 2 | |
2 | 2 | |
3 | 2 | |
4 | 2 | |
5 | 2 | |
6 | 2 | |
7 | 2 | |
8 | 2 | |
9 | 2 | |
10 | 2 |
Value | Count | Frequency (%) |
12 | 2 | |
11 | 2 | |
10 | 2 | |
9 | 2 | |
8 | 2 | |
7 | 2 | |
6 | 2 | |
5 | 2 | |
4 | 2 | |
3 | 2 |
확진자 수
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 21 |
---|---|
Distinct (%) | 91.3% |
Missing | 2 |
Missing (%) | 8.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 309.6087 |
Minimum | 2 |
---|---|
Maximum | 2202 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 357.0 B |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 2.9 |
Q1 | 31 |
median | 92 |
Q3 | 403 |
95-th percentile | 1152.6 |
Maximum | 2202 |
Range | 2200 |
Interquartile range (IQR) | 372 |
Descriptive statistics
Standard deviation | 513.18274 |
---|---|
Coefficient of variation (CV) | 1.6575204 |
Kurtosis | 8.2446339 |
Mean | 309.6087 |
Median Absolute Deviation (MAD) | 79 |
Skewness | 2.7061791 |
Sum | 7121 |
Variance | 263356.52 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 2 | 8.0% |
31 | 2 | 8.0% |
35 | 1 | 4.0% |
125 | 1 | 4.0% |
2202 | 1 | 4.0% |
1200 | 1 | 4.0% |
703 | 1 | 4.0% |
726 | 1 | 4.0% |
506 | 1 | 4.0% |
393 | 1 | 4.0% |
Other values (11) | 11 | |
(Missing) | 2 | 8.0% |
Value | Count | Frequency (%) |
2 | 2 | |
11 | 1 | |
13 | 1 | |
27 | 1 | |
31 | 2 | |
35 | 1 | |
41 | 1 | |
47 | 1 | |
74 | 1 | |
92 | 1 |
Value | Count | Frequency (%) |
2202 | 1 | |
1200 | 1 | |
726 | 1 | |
703 | 1 | |
506 | 1 | |
413 | 1 | |
393 | 1 | |
193 | 1 | |
139 | 1 | |
125 | 1 |
사망자수
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | 75.0% |
Missing | 17 |
Missing (%) | 68.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.375 |
Minimum | 1 |
---|---|
Maximum | 13 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 357.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1.75 |
median | 3 |
Q3 | 5.5 |
95-th percentile | 10.9 |
Maximum | 13 |
Range | 12 |
Interquartile range (IQR) | 3.75 |
Descriptive statistics
Standard deviation | 4.0333432 |
---|---|
Coefficient of variation (CV) | 0.92190701 |
Kurtosis | 2.717878 |
Mean | 4.375 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 1.6386457 |
Sum | 35 |
Variance | 16.267857 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 2 | 8.0% |
3 | 2 | 8.0% |
7 | 1 | 4.0% |
2 | 1 | 4.0% |
5 | 1 | 4.0% |
13 | 1 | 4.0% |
(Missing) | 17 |
Value | Count | Frequency (%) |
1 | 2 | |
2 | 1 | |
3 | 2 | |
5 | 1 | |
7 | 1 | |
13 | 1 |
Value | Count | Frequency (%) |
13 | 1 | |
7 | 1 | |
5 | 1 | |
3 | 2 | |
2 | 1 | |
1 | 2 |
데이터기준일자
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 8.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 332.0 B |
2022-02-07 | |
---|---|
<NA> | 1 |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 9.76 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 4.0% |
Sample
1st row | 2022-02-07 |
---|---|
2nd row | 2022-02-07 |
3rd row | 2022-02-07 |
4th row | 2022-02-07 |
5th row | 2022-02-07 |
Common Values
Value | Count | Frequency (%) |
2022-02-07 | 24 | |
<NA> | 1 | 4.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2022-02-07 | 24 | |
na | 1 | 4.0% |
년도 | 월 | 확진자 수 | 사망자수 | |
---|---|---|---|---|
년도 | 1.000 | 0.000 | 0.322 | 0.273 |
월 | 0.000 | 1.000 | 0.000 | 0.000 |
확진자 수 | 0.322 | 0.000 | 1.000 | 1.000 |
사망자수 | 0.273 | 0.000 | 1.000 | 1.000 |
년도 | 데이터기준일자 | |
---|---|---|
년도 | 1.000 | 1.000 |
데이터기준일자 | 1.000 | 1.000 |
월 | 확진자 수 | 사망자수 | 년도 | 데이터기준일자 | |
---|---|---|---|---|---|
월 | 1.000 | 0.456 | 0.836 | 0.000 | 1.000 |
확진자 수 | 0.456 | 1.000 | 0.843 | 0.184 | 1.000 |
사망자수 | 0.836 | 0.843 | 1.000 | 0.000 | 1.000 |
년도 | 0.000 | 0.184 | 0.000 | 1.000 | 1.000 |
데이터기준일자 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
년도 | 월 | 확진자 수 | 사망자수 | 데이터기준일자 | |
---|---|---|---|---|---|
0 | 2020 | 1 | <NA> | <NA> | 2022-02-07 |
1 | 2020 | 2 | 2 | <NA> | 2022-02-07 |
2 | 2020 | 3 | 13 | <NA> | 2022-02-07 |
3 | 2020 | 4 | 2 | <NA> | 2022-02-07 |
4 | 2020 | 5 | 31 | <NA> | 2022-02-07 |
5 | 2020 | 6 | 35 | <NA> | 2022-02-07 |
6 | 2020 | 7 | 11 | <NA> | 2022-02-07 |
7 | 2020 | 8 | 47 | <NA> | 2022-02-07 |
8 | 2020 | 9 | 31 | 1 | 2022-02-07 |
9 | 2020 | 10 | 27 | <NA> | 2022-02-07 |
년도 | 월 | 확진자 수 | 사망자수 | 데이터기준일자 | |
---|---|---|---|---|---|
15 | 2021 | 4 | 115 | <NA> | 2022-02-07 |
16 | 2021 | 5 | 125 | 1 | 2022-02-07 |
17 | 2021 | 6 | 139 | <NA> | 2022-02-07 |
18 | 2021 | 7 | 393 | <NA> | 2022-02-07 |
19 | 2021 | 8 | 506 | 3 | 2022-02-07 |
20 | 2021 | 9 | 726 | <NA> | 2022-02-07 |
21 | 2021 | 10 | 703 | 3 | 2022-02-07 |
22 | 2021 | 11 | 1200 | 5 | 2022-02-07 |
23 | 2021 | 12 | 2202 | 13 | 2022-02-07 |
24 | <NA> | <NA> | <NA> | <NA> | <NA> |