Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory488.3 KiB
Average record size in memory50.0 B

Variable types

Numeric2
Categorical2
DateTime1

Dataset

Description2022년 9월 21일 기준 인천광역시 코로나19 확진자 현황(연번,관리기관,확진일,성별,나이) 데이터 입니다. * 기준일: 2020-01-20 ~2022-09-21
Author인천광역시
URLhttps://www.data.go.kr/data/15085834/fileData.do

Alerts

인천 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:11:53.464301
Analysis finished2023-12-12 23:11:54.326407
Duration0.86 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

인천
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20142.827
Minimum1
Maximum40627
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T08:11:54.394309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1945.8
Q19992.75
median20034.5
Q330266
95-th percentile38564.9
Maximum40627
Range40626
Interquartile range (IQR)20273.25

Descriptive statistics

Standard deviation11721.031
Coefficient of variation (CV)0.58189602
Kurtosis-1.1992451
Mean20142.827
Median Absolute Deviation (MAD)10117.5
Skewness0.023190102
Sum2.0142827 × 108
Variance1.3738256 × 108
MonotonicityNot monotonic
2023-12-13T08:11:54.544429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37347 1
 
< 0.1%
36655 1
 
< 0.1%
13781 1
 
< 0.1%
4720 1
 
< 0.1%
31736 1
 
< 0.1%
3641 1
 
< 0.1%
9512 1
 
< 0.1%
10177 1
 
< 0.1%
22557 1
 
< 0.1%
22935 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
4 1
< 0.1%
8 1
< 0.1%
10 1
< 0.1%
16 1
< 0.1%
22 1
< 0.1%
36 1
< 0.1%
39 1
< 0.1%
41 1
< 0.1%
49 1
< 0.1%
ValueCountFrequency (%)
40627 1
< 0.1%
40624 1
< 0.1%
40621 1
< 0.1%
40620 1
< 0.1%
40617 1
< 0.1%
40614 1
< 0.1%
40605 1
< 0.1%
40598 1
< 0.1%
40597 1
< 0.1%
40595 1
< 0.1%

관리기관
Categorical

Distinct11
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
부평구
1922 
남동구
1734 
서구
1641 
미추홀구
1407 
연수구
1401 
Other values (6)
1895 

Length

Max length8
Median length3
Mean length2.9038
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row연수구
2nd row연수구
3rd row남동구
4th row연수구
5th row계양구

Common Values

ValueCountFrequency (%)
부평구 1922
19.2%
남동구 1734
17.3%
서구 1641
16.4%
미추홀구 1407
14.1%
연수구 1401
14.0%
계양구 933
9.3%
중구 532
 
5.3%
동구 201
 
2.0%
강화군 197
 
2.0%
옹진군 31
 
0.3%

Length

2023-12-13T08:11:54.689553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
부평구 1922
19.2%
남동구 1734
17.3%
서구 1641
16.4%
미추홀구 1407
14.1%
연수구 1401
14.0%
계양구 933
9.3%
중구 532
 
5.3%
동구 201
 
2.0%
강화군 197
 
2.0%
옹진군 31
 
0.3%
Other values (2) 2
 
< 0.1%
Distinct551
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-01-20 00:00:00
Maximum2022-01-17 00:00:00
2023-12-13T08:11:54.814404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:11:54.982167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

성별
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
5055 
4945 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
5055
50.5%
4945
49.5%

Length

2023-12-13T08:11:55.094391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:11:55.204461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5055
50.5%
4945
49.5%

나이
Real number (ℝ)

Distinct103
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean40.6454
Minimum-1
Maximum101
Zeros36
Zeros (%)0.4%
Negative1
Negative (%)< 0.1%
Memory size166.0 KiB
2023-12-13T08:11:55.337697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-1
5-th percentile6
Q124
median40
Q358
95-th percentile76
Maximum101
Range102
Interquartile range (IQR)34

Descriptive statistics

Standard deviation21.632528
Coefficient of variation (CV)0.53222573
Kurtosis-0.79229146
Mean40.6454
Median Absolute Deviation (MAD)17
Skewness0.087897687
Sum406454
Variance467.96626
MonotonicityNot monotonic
2023-12-13T08:11:55.544316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
61 190
 
1.9%
40 183
 
1.8%
29 177
 
1.8%
62 176
 
1.8%
41 173
 
1.7%
30 170
 
1.7%
49 169
 
1.7%
6 168
 
1.7%
60 168
 
1.7%
28 166
 
1.7%
Other values (93) 8260
82.6%
ValueCountFrequency (%)
-1 1
 
< 0.1%
0 36
 
0.4%
1 57
 
0.6%
2 58
 
0.6%
3 62
 
0.6%
4 83
0.8%
5 108
1.1%
6 168
1.7%
7 121
1.2%
8 99
1.0%
ValueCountFrequency (%)
101 1
 
< 0.1%
100 1
 
< 0.1%
99 1
 
< 0.1%
98 1
 
< 0.1%
97 3
 
< 0.1%
96 3
 
< 0.1%
95 4
 
< 0.1%
94 7
0.1%
93 12
0.1%
92 11
0.1%

Interactions

2023-12-13T08:11:53.931929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:11:53.731591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:11:54.103090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:11:53.848489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:11:55.641248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인천관리기관성별나이
인천1.0000.1450.0900.308
관리기관0.1451.0000.0330.127
성별0.0900.0331.0000.110
나이0.3080.1270.1101.000
2023-12-13T08:11:55.740421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별관리기관
성별1.0000.032
관리기관0.0321.000
2023-12-13T08:11:55.833602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인천나이관리기관성별
인천1.000-0.0810.0620.069
나이-0.0811.0000.0540.085
관리기관0.0620.0541.0000.032
성별0.0690.0850.0321.000

Missing values

2023-12-13T08:11:54.204551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:11:54.289489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

인천관리기관확진일성별나이
3734637347연수구2022-01-0120
2470224703연수구2021-11-2972
3782637827남동구2022-01-049
19451946연수구2020-12-1549
2168821689계양구2021-11-1533
1377313774계양구2021-09-1524
81058106부평구2021-07-2050
1905919060남동구2021-10-285
2961729618부평구2021-12-139
2106621067서구2021-11-1165
인천관리기관확진일성별나이
3027930280미추홀구2021-12-1471
1945019451부평구2021-10-318
1747517476연수구2021-10-1351
11211122서구2020-11-1852
71637164미추홀구2021-07-0927
87978798미추홀구2021-07-2821
1159511596미추홀구2021-08-2715
2188221883부평구2021-11-1764
93419342서구2021-08-0372
3615136152계양구2021-12-2825