Overview

Dataset statistics

Number of variables8
Number of observations36
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory72.6 B

Variable types

Numeric4
Categorical4

Dataset

Description대전광역시 유성구 코로나19 확진자 및 사망자 (연번, 시도, 시군구, 연, 월, 확진자, 사망자, 데이터기준일자) 정보를 제공합니다.
Author대전광역시 유성구
URLhttps://www.data.go.kr/data/15099446/fileData.do

Alerts

시도 has constant value ""Constant
시군구 has constant value ""Constant
데이터기준일자 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 is highly overall correlated with 확진자 and 3 other fieldsHigh correlation
확진자 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
사망자 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 has unique valuesUnique
사망자 has 16 (44.4%) zerosZeros

Reproduction

Analysis started2024-03-14 21:25:02.096427
Analysis finished2024-03-14 21:25:06.818854
Duration4.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18.5
Minimum1
Maximum36
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size452.0 B
2024-03-15T06:25:06.943966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.75
Q19.75
median18.5
Q327.25
95-th percentile34.25
Maximum36
Range35
Interquartile range (IQR)17.5

Descriptive statistics

Standard deviation10.535654
Coefficient of variation (CV)0.5694948
Kurtosis-1.2
Mean18.5
Median Absolute Deviation (MAD)9
Skewness0
Sum666
Variance111
MonotonicityStrictly increasing
2024-03-15T06:25:07.482332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
1 1
 
2.8%
20 1
 
2.8%
22 1
 
2.8%
23 1
 
2.8%
24 1
 
2.8%
25 1
 
2.8%
26 1
 
2.8%
27 1
 
2.8%
28 1
 
2.8%
29 1
 
2.8%
Other values (26) 26
72.2%
ValueCountFrequency (%)
1 1
2.8%
2 1
2.8%
3 1
2.8%
4 1
2.8%
5 1
2.8%
6 1
2.8%
7 1
2.8%
8 1
2.8%
9 1
2.8%
10 1
2.8%
ValueCountFrequency (%)
36 1
2.8%
35 1
2.8%
34 1
2.8%
33 1
2.8%
32 1
2.8%
31 1
2.8%
30 1
2.8%
29 1
2.8%
28 1
2.8%
27 1
2.8%

시도
Categorical

CONSTANT 

Distinct1
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size416.0 B
대전광역시
36 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대전광역시
2nd row대전광역시
3rd row대전광역시
4th row대전광역시
5th row대전광역시

Common Values

ValueCountFrequency (%)
대전광역시 36
100.0%

Length

2024-03-15T06:25:07.819781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T06:25:08.095930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대전광역시 36
100.0%

시군구
Categorical

CONSTANT 

Distinct1
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size416.0 B
유성구
36 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유성구
2nd row유성구
3rd row유성구
4th row유성구
5th row유성구

Common Values

ValueCountFrequency (%)
유성구 36
100.0%

Length

2024-03-15T06:25:08.399109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T06:25:08.630292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유성구 36
100.0%


Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size416.0 B
2020
12 
2021
12 
2022
12 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2020 12
33.3%
2021 12
33.3%
2022 12
33.3%

Length

2024-03-15T06:25:08.810483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T06:25:08.999629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 12
33.3%
2021 12
33.3%
2022 12
33.3%


Real number (ℝ)

Distinct12
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.5
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size452.0 B
2024-03-15T06:25:09.185527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13.75
median6.5
Q39.25
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)5.5

Descriptive statistics

Standard deviation3.5010203
Coefficient of variation (CV)0.5386185
Kurtosis-1.217232
Mean6.5
Median Absolute Deviation (MAD)3
Skewness0
Sum234
Variance12.257143
MonotonicityNot monotonic
2024-03-15T06:25:09.412220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
1 3
8.3%
2 3
8.3%
3 3
8.3%
4 3
8.3%
5 3
8.3%
6 3
8.3%
7 3
8.3%
8 3
8.3%
9 3
8.3%
10 3
8.3%
Other values (2) 6
16.7%
ValueCountFrequency (%)
1 3
8.3%
2 3
8.3%
3 3
8.3%
4 3
8.3%
5 3
8.3%
6 3
8.3%
7 3
8.3%
8 3
8.3%
9 3
8.3%
10 3
8.3%
ValueCountFrequency (%)
12 3
8.3%
11 3
8.3%
10 3
8.3%
9 3
8.3%
8 3
8.3%
7 3
8.3%
6 3
8.3%
5 3
8.3%
4 3
8.3%
3 3
8.3%

확진자
Real number (ℝ)

HIGH CORRELATION 

Distinct34
Distinct (%)94.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6494.2778
Minimum1
Maximum72801
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size452.0 B
2024-03-15T06:25:09.716038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.75
Q127
median145
Q37248
95-th percentile32219
Maximum72801
Range72800
Interquartile range (IQR)7221

Descriptive statistics

Standard deviation14265.258
Coefficient of variation (CV)2.1965889
Kurtosis13.381268
Mean6494.2778
Median Absolute Deviation (MAD)142.5
Skewness3.3767677
Sum233794
Variance2.0349759 × 108
MonotonicityNot monotonic
2024-03-15T06:25:09.964871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
15 3
 
8.3%
1 1
 
2.8%
8754 1
 
2.8%
457 1
 
2.8%
946 1
 
2.8%
1480 1
 
2.8%
16134 1
 
2.8%
72801 1
 
2.8%
33419 1
 
2.8%
2461 1
 
2.8%
Other values (24) 24
66.7%
ValueCountFrequency (%)
1 1
 
2.8%
4 1
 
2.8%
5 1
 
2.8%
7 1
 
2.8%
15 3
8.3%
18 1
 
2.8%
21 1
 
2.8%
29 1
 
2.8%
31 1
 
2.8%
32 1
 
2.8%
ValueCountFrequency (%)
72801 1
2.8%
33419 1
2.8%
31819 1
2.8%
17393 1
2.8%
16134 1
2.8%
14123 1
2.8%
12866 1
2.8%
12313 1
2.8%
8754 1
2.8%
6746 1
2.8%

사망자
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct8
Distinct (%)22.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.6944444
Minimum0
Maximum49
Zeros16
Zeros (%)44.4%
Negative0
Negative (%)0.0%
Memory size452.0 B
2024-03-15T06:25:10.183673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q31
95-th percentile9.25
Maximum49
Range49
Interquartile range (IQR)1

Descriptive statistics

Standard deviation8.287062
Coefficient of variation (CV)3.0756106
Kurtosis29.829538
Mean2.6944444
Median Absolute Deviation (MAD)1
Skewness5.2938965
Sum97
Variance68.675397
MonotonicityNot monotonic
2024-03-15T06:25:10.406313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
0 16
44.4%
1 13
36.1%
3 2
 
5.6%
6 1
 
2.8%
10 1
 
2.8%
49 1
 
2.8%
9 1
 
2.8%
4 1
 
2.8%
ValueCountFrequency (%)
0 16
44.4%
1 13
36.1%
3 2
 
5.6%
4 1
 
2.8%
6 1
 
2.8%
9 1
 
2.8%
10 1
 
2.8%
49 1
 
2.8%
ValueCountFrequency (%)
49 1
 
2.8%
10 1
 
2.8%
9 1
 
2.8%
6 1
 
2.8%
4 1
 
2.8%
3 2
 
5.6%
1 13
36.1%
0 16
44.4%

데이터기준일자
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size416.0 B
2022-03-17
24 
2023-03-14
12 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-03-17
2nd row2022-03-17
3rd row2022-03-17
4th row2022-03-17
5th row2022-03-17

Common Values

ValueCountFrequency (%)
2022-03-17 24
66.7%
2023-03-14 12
33.3%

Length

2024-03-15T06:25:10.617757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T06:25:10.916734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-03-17 24
66.7%
2023-03-14 12
33.3%

Interactions

2024-03-15T06:25:05.616932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T06:25:02.406985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T06:25:03.517638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T06:25:04.681931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T06:25:05.863899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T06:25:02.727515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T06:25:03.779972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T06:25:05.012378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T06:25:06.108384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T06:25:02.983667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T06:25:04.161022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T06:25:05.290934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T06:25:06.319555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T06:25:03.261202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T06:25:04.435398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T06:25:05.469550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T06:25:11.118283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번확진자사망자데이터기준일자
연번1.0000.9360.0000.6790.0000.995
0.9361.0000.0000.5560.0001.000
0.0000.0001.0000.0000.0000.000
확진자0.6790.5560.0001.0000.4630.660
사망자0.0000.0000.0000.4631.0000.000
데이터기준일자0.9951.0000.0000.6600.0001.000
2024-03-15T06:25:11.348111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
데이터기준일자
데이터기준일자1.0000.985
0.9851.000
2024-03-15T06:25:11.495220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번확진자사망자데이터기준일자
연번1.0000.3320.9370.6890.8140.824
0.3321.0000.2740.1440.0000.000
확진자0.9370.2741.0000.7330.4820.754
사망자0.6890.1440.7331.0000.0000.000
0.8140.0000.4820.0001.0000.985
데이터기준일자0.8240.0000.7540.0000.9851.000

Missing values

2024-03-15T06:25:06.511063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T06:25:06.732960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시도시군구확진자사망자데이터기준일자
01대전광역시유성구20201102022-03-17
12대전광역시유성구20202702022-03-17
23대전광역시유성구202032102022-03-17
34대전광역시유성구20204402022-03-17
45대전광역시유성구20205502022-03-17
56대전광역시유성구202061802022-03-17
67대전광역시유성구202071502022-03-17
78대전광역시유성구202082902022-03-17
89대전광역시유성구202091502022-03-17
910대전광역시유성구2020104312022-03-17
연번시도시군구확진자사망자데이터기준일자
2627대전광역시유성구202237280112023-03-14
2728대전광역시유성구2022433419492023-03-14
2829대전광역시유성구20225875412023-03-14
2930대전광역시유성구20226246132023-03-14
3031대전광역시유성구202271286612023-03-14
3132대전광역시유성구202283181992023-03-14
3233대전광역시유성구202291231312023-03-14
3334대전광역시유성구202210674602023-03-14
3435대전광역시유성구2022111412332023-03-14
3536대전광역시유성구2022121739342023-03-14