Overview

Dataset statistics

Number of variables6
Number of observations54
Missing cells0
Missing cells (%)0.0%
Duplicate rows8
Duplicate rows (%)14.8%
Total size in memory2.7 KiB
Average record size in memory50.4 B

Variable types

Categorical6

Dataset

Description검역법 및 검역법 시행규칙에 따른 검역시 의심환자에 대한 정보 (검역일시, 검역소, 국적, 성별, 승객승무원구분, 검사결과)
Author질병관리청
URLhttps://www.data.go.kr/data/3074710/fileData.do

Alerts

승객승무원구분 has constant value ""Constant
검사결과 has constant value ""Constant
Dataset has 8 (14.8%) duplicate rowsDuplicates
검역소 is highly overall correlated with 검역일시 and 2 other fieldsHigh correlation
국적 is highly overall correlated with 검역일시 and 2 other fieldsHigh correlation
검역일시 is highly overall correlated with 검역소 and 1 other fieldsHigh correlation
성별 is highly overall correlated with 검역소 and 1 other fieldsHigh correlation
검역소 is highly imbalanced (86.7%)Imbalance

Reproduction

Analysis started2023-12-12 05:13:41.641870
Analysis finished2023-12-12 05:13:42.045799
Duration0.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

검역일시
Categorical

HIGH CORRELATION 

Distinct23
Distinct (%)42.6%
Missing0
Missing (%)0.0%
Memory size564.0 B
2022-02-04 16:35
10 
2022-02-01 16:25
2022-01-31 16:50
2022-11-24 16:50
2022-12-28 11:35
Other values (18)
23 

Length

Max length16
Median length16
Mean length16
Min length16

Unique

Unique15 ?
Unique (%)27.8%

Sample

1st row2022-01-25 16:28
2nd row2022-01-27 16:35
3rd row2022-01-27 16:35
4th row2022-01-29 16:50
5th row2022-01-31 16:35

Common Values

ValueCountFrequency (%)
2022-02-04 16:35 10
18.5%
2022-02-01 16:25 7
13.0%
2022-01-31 16:50 7
13.0%
2022-11-24 16:50 4
 
7.4%
2022-12-28 11:35 3
 
5.6%
2022-12-11 10:30 3
 
5.6%
2022-11-21 11:35 3
 
5.6%
2022-01-27 16:35 2
 
3.7%
2022-11-22 12:05 1
 
1.9%
2022-01-29 16:50 1
 
1.9%
Other values (13) 13
24.1%

Length

2023-12-12T14:13:42.120612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
16:35 14
13.0%
16:50 14
13.0%
2022-02-04 11
 
10.2%
2022-01-31 8
 
7.4%
2022-02-01 7
 
6.5%
16:25 7
 
6.5%
11:35 6
 
5.6%
2022-11-24 4
 
3.7%
2022-12-11 4
 
3.7%
16:55 3
 
2.8%
Other values (23) 30
27.8%

검역소
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size564.0 B
국립인천공항검역소
53 
국립평택검역소
 
1

Length

Max length9
Median length9
Mean length8.962963
Min length7

Unique

Unique1 ?
Unique (%)1.9%

Sample

1st row국립인천공항검역소
2nd row국립인천공항검역소
3rd row국립인천공항검역소
4th row국립인천공항검역소
5th row국립인천공항검역소

Common Values

ValueCountFrequency (%)
국립인천공항검역소 53
98.1%
국립평택검역소 1
 
1.9%

Length

2023-12-12T14:13:42.284419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:13:42.426569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국립인천공항검역소 53
98.1%
국립평택검역소 1
 
1.9%

국적
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size564.0 B
<NA>
34 
한국
20 

Length

Max length4
Median length4
Mean length3.2592593
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 34
63.0%
한국 20
37.0%

Length

2023-12-12T14:13:42.595819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:13:42.737414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 34
63.0%
한국 20
37.0%

성별
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size564.0 B
<NA>
34 
13 

Length

Max length4
Median length4
Mean length2.8888889
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 34
63.0%
13
 
24.1%
7
 
13.0%

Length

2023-12-12T14:13:42.885286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:13:43.031419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 34
63.0%
13
 
24.1%
7
 
13.0%

승객승무원구분
Categorical

CONSTANT 

Distinct1
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size564.0 B
승객
54 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row승객
2nd row승객
3rd row승객
4th row승객
5th row승객

Common Values

ValueCountFrequency (%)
승객 54
100.0%

Length

2023-12-12T14:13:43.179839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:13:43.311087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
승객 54
100.0%

검사결과
Categorical

CONSTANT 

Distinct1
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size564.0 B
음성
54 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row음성
2nd row음성
3rd row음성
4th row음성
5th row음성

Common Values

ValueCountFrequency (%)
음성 54
100.0%

Length

2023-12-12T14:13:43.439643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:13:43.559617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
음성 54
100.0%

Correlations

2023-12-12T14:13:43.637687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
검역일시검역소성별
검역일시1.0001.0000.000
검역소1.0001.000NaN
성별0.000NaN1.000
2023-12-12T14:13:43.752657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
검역소국적검역일시성별
검역소1.0001.0000.7721.000
국적1.0001.0001.0001.000
검역일시0.7721.0001.0000.000
성별1.0001.0000.0001.000
2023-12-12T14:13:43.860662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
검역일시검역소국적성별
검역일시1.0000.7721.0000.000
검역소0.7721.0001.0001.000
국적1.0001.0001.0001.000
성별0.0001.0001.0001.000

Missing values

2023-12-12T14:13:41.860409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:13:41.996824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

검역일시검역소국적성별승객승무원구분검사결과
02022-01-25 16:28국립인천공항검역소<NA><NA>승객음성
12022-01-27 16:35국립인천공항검역소<NA><NA>승객음성
22022-01-27 16:35국립인천공항검역소<NA><NA>승객음성
32022-01-29 16:50국립인천공항검역소<NA><NA>승객음성
42022-01-31 16:35국립인천공항검역소<NA><NA>승객음성
52022-01-31 16:50국립인천공항검역소<NA><NA>승객음성
62022-01-31 16:50국립인천공항검역소<NA><NA>승객음성
72022-01-31 16:50국립인천공항검역소<NA><NA>승객음성
82022-01-31 16:50국립인천공항검역소<NA><NA>승객음성
92022-01-31 16:50국립인천공항검역소<NA><NA>승객음성
검역일시검역소국적성별승객승무원구분검사결과
442022-12-06 16:50국립인천공항검역소한국승객음성
452022-12-11 10:30국립인천공항검역소한국승객음성
462022-12-11 10:30국립인천공항검역소한국승객음성
472022-12-11 10:30국립인천공항검역소한국승객음성
482022-12-11 16:55국립인천공항검역소한국승객음성
492022-12-21 16:50국립인천공항검역소한국승객음성
502022-12-23 18:30국립인천공항검역소한국승객음성
512022-12-28 11:35국립인천공항검역소한국승객음성
522022-12-28 11:35국립인천공항검역소한국승객음성
532022-12-28 11:35국립인천공항검역소한국승객음성

Duplicate rows

Most frequently occurring

검역일시검역소국적성별승객승무원구분검사결과# duplicates
32022-02-04 16:35국립인천공항검역소<NA><NA>승객음성10
12022-01-31 16:50국립인천공항검역소<NA><NA>승객음성7
22022-02-01 16:25국립인천공항검역소<NA><NA>승객음성7
52022-11-24 16:50국립인천공항검역소한국승객음성3
02022-01-27 16:35국립인천공항검역소<NA><NA>승객음성2
42022-11-21 11:35국립인천공항검역소한국승객음성2
62022-12-11 10:30국립인천공항검역소한국승객음성2
72022-12-28 11:35국립인천공항검역소한국승객음성2