Overview

Dataset statistics

Number of variables2
Number of observations314
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.3 KiB
Average record size in memory17.4 B

Variable types

Text1
Numeric1

Dataset

Description부산광역시금정구_일자별코로나확진자수현황_20230926
Author부산광역시 금정구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15124001

Alerts

날짜 has unique valuesUnique
확진자수 has 268 (85.4%) zerosZeros

Reproduction

Analysis started2023-12-10 16:51:29.305311
Analysis finished2023-12-10 16:51:29.647845
Duration0.34 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

날짜
Text

UNIQUE 

Distinct314
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-11T01:51:29.959044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length10
Mean length10.041401
Min length10

Characters and Unicode

Total characters3153
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique314 ?
Unique (%)100.0%

Sample

1st row2020-01-01 ~ 2020-02-22
2nd row2020-02-23
3rd row2020-02-24
4th row2020-02-25
5th row2020-02-26
ValueCountFrequency (%)
2020-01-01 1
 
0.3%
2020-09-13 1
 
0.3%
2020-09-20 1
 
0.3%
2020-09-19 1
 
0.3%
2020-09-18 1
 
0.3%
2020-09-17 1
 
0.3%
2020-09-16 1
 
0.3%
2020-09-27 1
 
0.3%
2020-09-14 1
 
0.3%
2020-09-12 1
 
0.3%
Other values (306) 306
96.8%
2023-12-11T01:51:30.434763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1005
31.9%
2 808
25.6%
- 630
20.0%
1 260
 
8.2%
3 78
 
2.5%
7 62
 
2.0%
8 62
 
2.0%
5 62
 
2.0%
4 61
 
1.9%
6 61
 
1.9%
Other values (3) 64
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2520
79.9%
Dash Punctuation 630
 
20.0%
Space Separator 2
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1005
39.9%
2 808
32.1%
1 260
 
10.3%
3 78
 
3.1%
7 62
 
2.5%
8 62
 
2.5%
5 62
 
2.5%
4 61
 
2.4%
6 61
 
2.4%
9 61
 
2.4%
Dash Punctuation
ValueCountFrequency (%)
- 630
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3153
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1005
31.9%
2 808
25.6%
- 630
20.0%
1 260
 
8.2%
3 78
 
2.5%
7 62
 
2.0%
8 62
 
2.0%
5 62
 
2.0%
4 61
 
1.9%
6 61
 
1.9%
Other values (3) 64
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3153
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1005
31.9%
2 808
25.6%
- 630
20.0%
1 260
 
8.2%
3 78
 
2.5%
7 62
 
2.0%
8 62
 
2.0%
5 62
 
2.0%
4 61
 
1.9%
6 61
 
1.9%
Other values (3) 64
 
2.0%

확진자수
Real number (ℝ)

ZEROS 

Distinct7
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.24203822
Minimum0
Maximum6
Zeros268
Zeros (%)85.4%
Negative0
Negative (%)0.0%
Memory size2.9 KiB
2023-12-11T01:51:30.560659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1.35
Maximum6
Range6
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.72738557
Coefficient of variation (CV)3.0052509
Kurtosis22.421552
Mean0.24203822
Median Absolute Deviation (MAD)0
Skewness4.2551011
Sum76
Variance0.52908976
MonotonicityNot monotonic
2023-12-11T01:51:30.672036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
0 268
85.4%
1 30
 
9.6%
2 8
 
2.5%
3 5
 
1.6%
5 1
 
0.3%
4 1
 
0.3%
6 1
 
0.3%
ValueCountFrequency (%)
0 268
85.4%
1 30
 
9.6%
2 8
 
2.5%
3 5
 
1.6%
4 1
 
0.3%
5 1
 
0.3%
6 1
 
0.3%
ValueCountFrequency (%)
6 1
 
0.3%
5 1
 
0.3%
4 1
 
0.3%
3 5
 
1.6%
2 8
 
2.5%
1 30
 
9.6%
0 268
85.4%

Interactions

2023-12-11T01:51:29.354972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-11T01:51:29.480797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:51:29.612869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

날짜확진자수
02020-01-01 ~ 2020-02-220
12020-02-231
22020-02-242
32020-02-250
42020-02-261
52020-02-270
62020-02-280
72020-02-290
82020-03-010
92020-03-020
날짜확진자수
3042020-12-226
3052020-12-233
3062020-12-242
3072020-12-253
3082020-12-262
3092020-12-270
3102020-12-283
3112020-12-290
3122020-12-301
3132020-12-311