Overview

Dataset statistics

Number of variables3
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory322.3 KiB
Average record size in memory33.0 B

Variable types

Numeric1
Text1
DateTime1

Dataset

Description충청남도 코로나바이러스감염증19 발생현황입니다. 2020년 3월 부터 2021년 12월까지의 시군별 확진자 현황을 제공합니다.
Author충청남도
URLhttps://www.data.go.kr/data/15106644/fileData.do

Alerts

연번 has unique valuesUnique
확진자번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:45:02.878957
Analysis finished2023-12-12 22:45:03.268136
Duration0.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9033.848
Minimum4
Maximum18170
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T07:45:03.340741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile877.85
Q14489.75
median9004.5
Q313580.25
95-th percentile17251.2
Maximum18170
Range18166
Interquartile range (IQR)9090.5

Descriptive statistics

Standard deviation5245.973
Coefficient of variation (CV)0.58070194
Kurtosis-1.1936212
Mean9033.848
Median Absolute Deviation (MAD)4549
Skewness0.013191309
Sum90338480
Variance27520233
MonotonicityNot monotonic
2023-12-13T07:45:03.515441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15768 1
 
< 0.1%
17993 1
 
< 0.1%
8157 1
 
< 0.1%
12056 1
 
< 0.1%
16911 1
 
< 0.1%
12429 1
 
< 0.1%
10963 1
 
< 0.1%
10712 1
 
< 0.1%
8557 1
 
< 0.1%
8659 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
4 1
< 0.1%
5 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
10 1
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
15 1
< 0.1%
16 1
< 0.1%
ValueCountFrequency (%)
18170 1
< 0.1%
18169 1
< 0.1%
18168 1
< 0.1%
18166 1
< 0.1%
18165 1
< 0.1%
18163 1
< 0.1%
18160 1
< 0.1%
18159 1
< 0.1%
18156 1
< 0.1%
18155 1
< 0.1%

확진자번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T07:45:03.843866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length6.3724
Min length3

Characters and Unicode

Total characters63724
Distinct characters37
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st row보령#548
2nd row당진#892
3rd row부여#365
4th row아산#960
5th row천안#5517
ValueCountFrequency (%)
아산 3
 
< 0.1%
당진#1132 1
 
< 0.1%
보령#140 1
 
< 0.1%
천안#3573 1
 
< 0.1%
천안#2856 1
 
< 0.1%
천안#4077 1
 
< 0.1%
천안#5986 1
 
< 0.1%
공주#346 1
 
< 0.1%
천안#3682 1
 
< 0.1%
보령#548 1
 
< 0.1%
Other values (9991) 9991
99.9%
2023-12-13T07:45:04.306993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
# 9862
15.5%
1 5087
 
8.0%
2 4268
 
6.7%
3727
 
5.8%
3717
 
5.8%
3 3701
 
5.8%
4 3626
 
5.7%
3428
 
5.4%
5 3359
 
5.3%
6 3068
 
4.8%
Other values (27) 19881
31.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 33576
52.7%
Other Letter 20274
31.8%
Other Punctuation 9862
 
15.5%
Space Separator 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3727
18.4%
3717
18.3%
3428
16.9%
1649
8.1%
959
 
4.7%
959
 
4.7%
788
 
3.9%
627
 
3.1%
475
 
2.3%
475
 
2.3%
Other values (15) 3470
17.1%
Decimal Number
ValueCountFrequency (%)
1 5087
15.2%
2 4268
12.7%
3 3701
11.0%
4 3626
10.8%
5 3359
10.0%
6 3068
9.1%
7 2806
8.4%
8 2630
7.8%
0 2518
7.5%
9 2513
7.5%
Other Punctuation
ValueCountFrequency (%)
# 9862
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 43450
68.2%
Hangul 20274
31.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3727
18.4%
3717
18.3%
3428
16.9%
1649
8.1%
959
 
4.7%
959
 
4.7%
788
 
3.9%
627
 
3.1%
475
 
2.3%
475
 
2.3%
Other values (15) 3470
17.1%
Common
ValueCountFrequency (%)
# 9862
22.7%
1 5087
11.7%
2 4268
9.8%
3 3701
 
8.5%
4 3626
 
8.3%
5 3359
 
7.7%
6 3068
 
7.1%
7 2806
 
6.5%
8 2630
 
6.1%
0 2518
 
5.8%
Other values (2) 2525
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 43450
68.2%
Hangul 20274
31.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
# 9862
22.7%
1 5087
11.7%
2 4268
9.8%
3 3701
 
8.5%
4 3626
 
8.3%
5 3359
 
7.7%
6 3068
 
7.1%
7 2806
 
6.5%
8 2630
 
6.1%
0 2518
 
5.8%
Other values (2) 2525
 
5.8%
Hangul
ValueCountFrequency (%)
3727
18.4%
3717
18.3%
3428
16.9%
1649
8.1%
959
 
4.7%
959
 
4.7%
788
 
3.9%
627
 
3.1%
475
 
2.3%
475
 
2.3%
Other values (15) 3470
17.1%
Distinct534
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-03-01 00:00:00
Maximum2021-12-31 00:00:00
2023-12-13T07:45:04.437810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:45:04.550599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-13T07:45:03.037170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T07:45:03.152167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:45:03.228014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번확진자번호확진일자
1576715768보령#5482021-12-16
91649165당진#8922021-10-01
92969297부여#3652021-10-04
55115512아산#9602021-08-10
1578615787천안#55172021-12-17
1035110352태안#1942021-10-28
94849485부여#3702021-10-08
31603161아산#5612021-05-13
31763177논산#832021-05-15
98149815천안#33712021-10-16
연번확진자번호확진일자
47114712천안#17662021-07-27
88758876부여#3402021-09-26
64416442천안#23542021-08-24
1511915120천안#52982021-12-13
51365137논산#2442021-08-04
90289029천안#31252021-09-29
75807581천안#27042021-09-08
1651316514당진#16832021-12-20
348349금산#122020-09-10
97059706천안#33312021-10-13