Overview

Dataset statistics

Number of variables10
Number of observations910
Missing cells5460
Missing cells (%)60.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory79.2 KiB
Average record size in memory89.1 B

Variable types

Numeric3
DateTime1
Unsupported6

Dataset

Description"대구광역시 서구_코로나 확진자 일자별 현황"에 대한 데이터로 2020년부터 대구광역시 서구에서 발생한 코로나 확진자에 대한 일자별 현황을 제공합니다.
Author대구광역시 서구
URLhttps://www.data.go.kr/data/15080623/fileData.do

Alerts

순번 is highly overall correlated with 일일 확진자수 and 1 other fieldsHigh correlation
일일 확진자수 is highly overall correlated with 순번 and 1 other fieldsHigh correlation
확진자 누계 is highly overall correlated with 순번 and 1 other fieldsHigh correlation
Unnamed: 4 has 910 (100.0%) missing valuesMissing
Unnamed: 5 has 910 (100.0%) missing valuesMissing
Unnamed: 6 has 910 (100.0%) missing valuesMissing
Unnamed: 7 has 910 (100.0%) missing valuesMissing
Unnamed: 8 has 910 (100.0%) missing valuesMissing
Unnamed: 9 has 910 (100.0%) missing valuesMissing
순번 has unique valuesUnique
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
일일 확진자수 has 332 (36.5%) zerosZeros

Reproduction

Analysis started2023-12-12 07:18:49.682749
Analysis finished2023-12-12 07:18:51.056386
Duration1.37 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct910
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean505.54286
Minimum1
Maximum967
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.1 KiB
2023-12-12T16:18:51.145561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile48.45
Q1278.25
median509.5
Q3739.75
95-th percentile921.55
Maximum967
Range966
Interquartile range (IQR)461.5

Descriptive statistics

Standard deviation271.49973
Coefficient of variation (CV)0.53704592
Kurtosis-1.1176939
Mean505.54286
Median Absolute Deviation (MAD)231
Skewness-0.07241807
Sum460044
Variance73712.103
MonotonicityStrictly increasing
2023-12-12T16:18:51.317171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
682 1
 
0.1%
656 1
 
0.1%
657 1
 
0.1%
658 1
 
0.1%
659 1
 
0.1%
660 1
 
0.1%
661 1
 
0.1%
662 1
 
0.1%
663 1
 
0.1%
Other values (900) 900
98.9%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
967 1
0.1%
966 1
0.1%
965 1
0.1%
964 1
0.1%
963 1
0.1%
962 1
0.1%
961 1
0.1%
960 1
0.1%
959 1
0.1%
958 1
0.1%

일자
Date

Distinct909
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size7.2 KiB
Minimum2020-02-18 00:00:00
Maximum2022-09-21 00:00:00
2023-12-12T16:18:51.495710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:18:51.947980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

일일 확진자수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct180
Distinct (%)19.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean66.389011
Minimum0
Maximum1383
Zeros332
Zeros (%)36.5%
Negative0
Negative (%)0.0%
Memory size8.1 KiB
2023-12-12T16:18:52.103992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q316
95-th percentile446.1
Maximum1383
Range1383
Interquartile range (IQR)16

Descriptive statistics

Standard deviation179.32981
Coefficient of variation (CV)2.7011971
Kurtosis14.947623
Mean66.389011
Median Absolute Deviation (MAD)2
Skewness3.6788254
Sum60414
Variance32159.18
MonotonicityNot monotonic
2023-12-12T16:18:52.252179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 332
36.5%
1 100
 
11.0%
2 52
 
5.7%
4 29
 
3.2%
3 27
 
3.0%
7 27
 
3.0%
6 23
 
2.5%
5 20
 
2.2%
8 18
 
2.0%
9 10
 
1.1%
Other values (170) 272
29.9%
ValueCountFrequency (%)
0 332
36.5%
1 100
 
11.0%
2 52
 
5.7%
3 27
 
3.0%
4 29
 
3.2%
5 20
 
2.2%
6 23
 
2.5%
7 27
 
3.0%
8 18
 
2.0%
9 10
 
1.1%
ValueCountFrequency (%)
1383 1
0.1%
1234 1
0.1%
1184 1
0.1%
1055 1
0.1%
1045 1
0.1%
1012 1
0.1%
981 1
0.1%
959 1
0.1%
942 1
0.1%
906 1
0.1%

확진자 누계
Real number (ℝ)

HIGH CORRELATION 

Distinct577
Distinct (%)63.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9308.2462
Minimum1
Maximum63355
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.1 KiB
2023-12-12T16:18:52.421921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile516.35
Q1560
median700
Q32099.25
95-th percentile47230.4
Maximum63355
Range63354
Interquartile range (IQR)1539.25

Descriptive statistics

Standard deviation17644.472
Coefficient of variation (CV)1.8955743
Kurtosis1.3703849
Mean9308.2462
Median Absolute Deviation (MAD)163.5
Skewness1.7595988
Sum8470504
Variance3.113274 × 108
MonotonicityNot monotonic
2023-12-12T16:18:52.561000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
560 45
 
4.9%
545 42
 
4.6%
542 42
 
4.6%
567 18
 
2.0%
559 17
 
1.9%
634 17
 
1.9%
569 11
 
1.2%
570 8
 
0.9%
629 7
 
0.8%
566 7
 
0.8%
Other values (567) 696
76.5%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
5 1
0.1%
11 1
0.1%
25 1
0.1%
41 1
0.1%
46 1
0.1%
84 1
0.1%
92 1
0.1%
104 1
0.1%
ValueCountFrequency (%)
63355 1
0.1%
63247 1
0.1%
63111 1
0.1%
62921 1
0.1%
62611 1
0.1%
62436 1
0.1%
62195 1
0.1%
61787 1
0.1%
60937 1
0.1%
60684 1
0.1%

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing910
Missing (%)100.0%
Memory size8.1 KiB

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing910
Missing (%)100.0%
Memory size8.1 KiB

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing910
Missing (%)100.0%
Memory size8.1 KiB

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing910
Missing (%)100.0%
Memory size8.1 KiB

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing910
Missing (%)100.0%
Memory size8.1 KiB

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing910
Missing (%)100.0%
Memory size8.1 KiB

Interactions

2023-12-12T16:18:50.493387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:18:49.839397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:18:50.151268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:18:50.595398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:18:49.937044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:18:50.261771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:18:50.712910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:18:50.038892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:18:50.375195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:18:52.695172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번일일 확진자수확진자 누계
순번1.0000.6160.845
일일 확진자수0.6161.0000.900
확진자 누계0.8450.9001.000
2023-12-12T16:18:52.792826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번일일 확진자수확진자 누계
순번1.0000.7621.000
일일 확진자수0.7621.0000.763
확진자 누계1.0000.7631.000

Missing values

2023-12-12T16:18:50.840120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:18:50.992424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번일자일일 확진자수확진자 누계Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9
012020-02-1811<NA><NA><NA><NA><NA><NA>
122020-02-1912<NA><NA><NA><NA><NA><NA>
232020-02-2035<NA><NA><NA><NA><NA><NA>
342020-02-21611<NA><NA><NA><NA><NA><NA>
452020-02-221425<NA><NA><NA><NA><NA><NA>
562020-02-231641<NA><NA><NA><NA><NA><NA>
672020-02-24546<NA><NA><NA><NA><NA><NA>
782020-02-253884<NA><NA><NA><NA><NA><NA>
892020-02-26892<NA><NA><NA><NA><NA><NA>
9102020-02-2712104<NA><NA><NA><NA><NA><NA>
순번일자일일 확진자수확진자 누계Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9
9009582022-09-0630060684<NA><NA><NA><NA><NA><NA>
9019592022-09-0725360937<NA><NA><NA><NA><NA><NA>
9029602022-09-1285061787<NA><NA><NA><NA><NA><NA>
9039612022-09-1340862195<NA><NA><NA><NA><NA><NA>
9049622022-09-1424162436<NA><NA><NA><NA><NA><NA>
9059632022-09-1517562611<NA><NA><NA><NA><NA><NA>
9069642022-09-1831062921<NA><NA><NA><NA><NA><NA>
9079652022-09-1919063111<NA><NA><NA><NA><NA><NA>
9089662022-09-2013663247<NA><NA><NA><NA><NA><NA>
9099672022-09-2110863355<NA><NA><NA><NA><NA><NA>