Overview

Dataset statistics

Number of variables3
Number of observations26
Missing cells5
Missing cells (%)6.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory782.0 B
Average record size in memory30.1 B

Variable types

DateTime1
Text1
Categorical1

Dataset

Description2020년 2월부터 2023년 4월까지 경상남도 밀양시의 코로나19 확진자 및 사망자 수 현황에 대한 데이터를 제공합니다.
Author경상남도 밀양시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15098719

Alerts

사망자수 is highly imbalanced (60.1%)Imbalance
확진자수 has 5 (19.2%) missing valuesMissing
월별 has unique valuesUnique

Reproduction

Analysis started2024-04-18 01:58:28.001235
Analysis finished2024-04-18 01:58:29.184927
Duration1.18 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

월별
Date

UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size340.0 B
Minimum2020-02-01 00:00:00
Maximum2022-03-01 00:00:00
2024-04-18T10:58:29.232538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T10:58:29.338248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)

확진자수
Text

MISSING 

Distinct17
Distinct (%)81.0%
Missing5
Missing (%)19.2%
Memory size340.0 B
2024-04-18T10:58:29.472601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length2.0952381
Min length1

Characters and Unicode

Total characters44
Distinct characters10
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)66.7%

Sample

1st row4
2nd row1
3rd row2
4th row1
5th row1
ValueCountFrequency (%)
1 3
14.3%
44 2
 
9.5%
2 2
 
9.5%
31 1
 
4.8%
4 1
 
4.8%
21 1
 
4.8%
2,516 1
 
4.8%
190 1
 
4.8%
219 1
 
4.8%
43 1
 
4.8%
Other values (7) 7
33.3%
2024-04-18T10:58:29.678852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 13
29.5%
2 7
15.9%
4 7
15.9%
9 4
 
9.1%
6 3
 
6.8%
0 3
 
6.8%
5 2
 
4.5%
3 2
 
4.5%
, 2
 
4.5%
7 1
 
2.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 42
95.5%
Other Punctuation 2
 
4.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 13
31.0%
2 7
16.7%
4 7
16.7%
9 4
 
9.5%
6 3
 
7.1%
0 3
 
7.1%
5 2
 
4.8%
3 2
 
4.8%
7 1
 
2.4%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 44
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 13
29.5%
2 7
15.9%
4 7
15.9%
9 4
 
9.1%
6 3
 
6.8%
0 3
 
6.8%
5 2
 
4.5%
3 2
 
4.5%
, 2
 
4.5%
7 1
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 44
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 13
29.5%
2 7
15.9%
4 7
15.9%
9 4
 
9.1%
6 3
 
6.8%
0 3
 
6.8%
5 2
 
4.5%
3 2
 
4.5%
, 2
 
4.5%
7 1
 
2.3%

사망자수
Categorical

IMBALANCE 

Distinct5
Distinct (%)19.2%
Missing0
Missing (%)0.0%
Memory size340.0 B
<NA>
22 
3
 
1
1
 
1
2
 
1
22
 
1

Length

Max length4
Median length4
Mean length3.5769231
Min length1

Unique

Unique4 ?
Unique (%)15.4%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 22
84.6%
3 1
 
3.8%
1 1
 
3.8%
2 1
 
3.8%
22 1
 
3.8%

Length

2024-04-18T10:58:29.786964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T10:58:29.870593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 22
84.6%
3 1
 
3.8%
1 1
 
3.8%
2 1
 
3.8%
22 1
 
3.8%

Correlations

2024-04-18T10:58:29.925593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
월별확진자수사망자수
월별1.0001.0001.000
확진자수1.0001.0001.000
사망자수1.0001.0001.000

Missing values

2024-04-18T10:58:29.159292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

월별확진자수사망자수
02020-024<NA>
12020-031<NA>
22020-04<NA><NA>
32020-05<NA><NA>
42020-06<NA><NA>
52020-07<NA><NA>
62020-082<NA>
72020-091<NA>
82020-10<NA><NA>
92020-111<NA>
월별확진자수사망자수
162021-066<NA>
172021-0743<NA>
182021-0844<NA>
192021-0921<NA>
202021-1031<NA>
212021-1117<NA>
222021-122191
232022-01190<NA>
242022-022,5162
252022-0316,94022