Overview

Dataset statistics

Number of variables11
Number of observations49
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.3 KiB
Average record size in memory90.7 B

Variable types

Categorical11

Dataset

Description검역구역내 채취 가검물에 대한 검사정보 (구분, 채취일자, 가검물분류, 상세가검물, 채취장소, 검사기관, 검사종류, 검출일자, 검출균, 법정감염병, 법정군)
Author질병관리청
URLhttps://www.data.go.kr/data/3074718/fileData.do

Alerts

구분 has constant value ""Constant
검사종류 has constant value ""Constant
법정감염병 is highly overall correlated with 검출균 and 1 other fieldsHigh correlation
검사기관 is highly overall correlated with 채취일자 and 2 other fieldsHigh correlation
검출균 is highly overall correlated with 법정감염병 and 1 other fieldsHigh correlation
가검물분류 is highly overall correlated with 상세가검물 and 1 other fieldsHigh correlation
채취일자 is highly overall correlated with 검사기관 and 1 other fieldsHigh correlation
검출일자 is highly overall correlated with 채취일자High correlation
법정군 is highly overall correlated with 검출균 and 1 other fieldsHigh correlation
상세가검물 is highly overall correlated with 가검물분류 and 2 other fieldsHigh correlation
채취장소 is highly overall correlated with 가검물분류 and 2 other fieldsHigh correlation
검사기관 is highly imbalanced (66.8%)Imbalance

Reproduction

Analysis started2023-12-12 02:30:46.271866
Analysis finished2023-12-12 02:30:47.150112
Duration0.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

CONSTANT 

Distinct1
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
선박
49 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row선박
2nd row선박
3rd row선박
4th row선박
5th row선박

Common Values

ValueCountFrequency (%)
선박 49
100.0%

Length

2023-12-12T11:30:47.222094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:30:47.330871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
선박 49
100.0%

채취일자
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)16.3%
Missing0
Missing (%)0.0%
Memory size524.0 B
2022-09-19
10 
2022-07-18
2022-08-22
2022-08-08
2022-07-04
Other values (3)
11 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-07-04
2nd row2022-07-04
3rd row2022-07-04
4th row2022-07-04
5th row2022-07-04

Common Values

ValueCountFrequency (%)
2022-09-19 10
20.4%
2022-07-18 8
16.3%
2022-08-22 8
16.3%
2022-08-08 7
14.3%
2022-07-04 5
10.2%
2022-09-05 5
10.2%
2022-07-15 3
 
6.1%
2022-07-19 3
 
6.1%

Length

2023-12-12T11:30:47.444402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:30:47.580899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-09-19 10
20.4%
2022-07-18 8
16.3%
2022-08-22 8
16.3%
2022-08-08 7
14.3%
2022-07-04 5
10.2%
2022-09-05 5
10.2%
2022-07-15 3
 
6.1%
2022-07-19 3
 
6.1%

가검물분류
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size524.0 B
해수
36 
기타
13 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row해수
2nd row해수
3rd row해수
4th row해수
5th row해수

Common Values

ValueCountFrequency (%)
해수 36
73.5%
기타 13
 
26.5%

Length

2023-12-12T11:30:47.719812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:30:47.826314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
해수 36
73.5%
기타 13
 
26.5%

상세가검물
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)10.2%
Missing0
Missing (%)0.0%
Memory size524.0 B
해수
33 
내항
외항
조위관측소
 
3
비브리오넷
 
3

Length

Max length5
Median length2
Mean length2.3673469
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row해수
2nd row해수
3rd row해수
4th row해수
5th row해수

Common Values

ValueCountFrequency (%)
해수 33
67.3%
내항 5
 
10.2%
외항 5
 
10.2%
조위관측소 3
 
6.1%
비브리오넷 3
 
6.1%

Length

2023-12-12T11:30:48.260313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:30:48.388607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
해수 33
67.3%
내항 5
 
10.2%
외항 5
 
10.2%
조위관측소 3
 
6.1%
비브리오넷 3
 
6.1%

채취장소
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)20.4%
Missing0
Missing (%)0.0%
Memory size524.0 B
신국제
13 
물량장
10 
인천조위소
삼척어판장
후진항
Other values (5)

Length

Max length9
Median length3
Mean length4
Min length3

Unique

Unique4 ?
Unique (%)8.2%

Sample

1st row물량장
2nd row신국제
3rd row신국제
4th row인천조위소
5th row인천조위소

Common Values

ValueCountFrequency (%)
신국제 13
26.5%
물량장 10
20.4%
인천조위소 9
18.4%
삼척어판장 5
 
10.2%
후진항 5
 
10.2%
묵호조위소 3
 
6.1%
경포천 1
 
2.0%
군산외항조위관측소 1
 
2.0%
외항 7부두 1
 
2.0%
동해항 여객터미널 1
 
2.0%

Length

2023-12-12T11:30:48.540366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:30:48.699480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신국제 13
25.5%
물량장 10
19.6%
인천조위소 9
17.6%
삼척어판장 5
 
9.8%
후진항 5
 
9.8%
묵호조위소 3
 
5.9%
경포천 1
 
2.0%
군산외항조위관측소 1
 
2.0%
외항 1
 
2.0%
7부두 1
 
2.0%
Other values (2) 2
 
3.9%

검사기관
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size524.0 B
수도권질병대응센터
46 
충청권질병대응센터
 
3

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수도권질병대응센터
2nd row수도권질병대응센터
3rd row수도권질병대응센터
4th row수도권질병대응센터
5th row수도권질병대응센터

Common Values

ValueCountFrequency (%)
수도권질병대응센터 46
93.9%
충청권질병대응센터 3
 
6.1%

Length

2023-12-12T11:30:48.876930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:30:48.981548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수도권질병대응센터 46
93.9%
충청권질병대응센터 3
 
6.1%

검사종류
Categorical

CONSTANT 

Distinct1
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
Vibrio
49 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowVibrio
2nd rowVibrio
3rd rowVibrio
4th rowVibrio
5th rowVibrio

Common Values

ValueCountFrequency (%)
Vibrio 49
100.0%

Length

2023-12-12T11:30:49.111277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:30:49.233725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
vibrio 49
100.0%

검출일자
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size524.0 B
2022-07-26
11 
2022-09-27
10 
2022-08-30
2022-08-16
2022-07-12
Other values (2)

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-07-12
2nd row2022-07-12
3rd row2022-07-12
4th row2022-07-12
5th row2022-07-12

Common Values

ValueCountFrequency (%)
2022-07-26 11
22.4%
2022-09-27 10
20.4%
2022-08-30 8
16.3%
2022-08-16 7
14.3%
2022-07-12 5
10.2%
2022-09-13 5
10.2%
2022-07-15 3
 
6.1%

Length

2023-12-12T11:30:49.357106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:30:49.506076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-07-26 11
22.4%
2022-09-27 10
20.4%
2022-08-30 8
16.3%
2022-08-16 7
14.3%
2022-07-12 5
10.2%
2022-09-13 5
10.2%
2022-07-15 3
 
6.1%

검출균
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size524.0 B
Vibrio parahaemolyticus
32 
Vibrio vulnificus
14 
Vibrio cholerae non O1
 
3

Length

Max length23
Median length23
Mean length21.22449
Min length17

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowVibrio parahaemolyticus
2nd rowVibrio parahaemolyticus
3rd rowVibrio vulnificus
4th rowVibrio parahaemolyticus
5th rowVibrio vulnificus

Common Values

ValueCountFrequency (%)
Vibrio parahaemolyticus 32
65.3%
Vibrio vulnificus 14
28.6%
Vibrio cholerae non O1 3
 
6.1%

Length

2023-12-12T11:30:49.677412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:30:49.809901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
vibrio 49
47.1%
parahaemolyticus 32
30.8%
vulnificus 14
 
13.5%
cholerae 3
 
2.9%
non 3
 
2.9%
o1 3
 
2.9%

법정감염병
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size524.0 B
장염비브리오균 감염증
32 
비브리오패혈증
14 
<NA>
 
3

Length

Max length11
Median length11
Mean length9.4285714
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row장염비브리오균 감염증
2nd row장염비브리오균 감염증
3rd row비브리오패혈증
4th row장염비브리오균 감염증
5th row비브리오패혈증

Common Values

ValueCountFrequency (%)
장염비브리오균 감염증 32
65.3%
비브리오패혈증 14
28.6%
<NA> 3
 
6.1%

Length

2023-12-12T11:30:49.966588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:30:50.086749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
장염비브리오균 32
39.5%
감염증 32
39.5%
비브리오패혈증 14
17.3%
na 3
 
3.7%

법정군
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size524.0 B
지정
32 
3군
14 
<NA>
 
3

Length

Max length4
Median length2
Mean length2.122449
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지정
2nd row지정
3rd row3군
4th row지정
5th row3군

Common Values

ValueCountFrequency (%)
지정 32
65.3%
3군 14
28.6%
<NA> 3
 
6.1%

Length

2023-12-12T11:30:50.210213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:30:50.337739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지정 32
65.3%
3군 14
28.6%
na 3
 
6.1%

Correlations

2023-12-12T11:30:50.438701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
채취일자가검물분류상세가검물채취장소검사기관검출일자검출균법정감염병법정군
채취일자1.0000.5660.6470.3491.0001.0000.3220.3780.378
가검물분류0.5661.0001.0001.0000.0000.4020.2430.5020.502
상세가검물0.6471.0001.0001.0001.0000.2030.3550.3820.382
채취장소0.3491.0001.0001.0001.0000.0000.2270.5100.510
검사기관1.0000.0001.0001.0001.0000.3200.2180.4090.409
검출일자1.0000.4020.2030.0000.3201.0000.0000.0000.000
검출균0.3220.2430.3550.2270.2180.0001.0000.9970.997
법정감염병0.3780.5020.3820.5100.4090.0000.9971.0000.997
법정군0.3780.5020.3820.5100.4090.0000.9970.9971.000
2023-12-12T11:30:50.568914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법정감염병검사기관검출균가검물분류채취일자검출일자법정군상세가검물채취장소
법정감염병1.0000.2680.9470.3340.2580.0000.9470.4470.350
검사기관0.2681.0000.3520.0000.9340.3200.2680.9680.911
검출균0.9470.3521.0000.3920.1960.0000.9470.2780.111
가검물분류0.3340.0000.3921.0000.3960.4050.3340.9680.911
채취일자0.2580.9340.1960.3961.0000.9880.2580.4510.157
검출일자0.0000.3200.0000.4050.9881.0000.0000.1170.000
법정군0.9470.2680.9470.3340.2580.0001.0000.4470.350
상세가검물0.4470.9680.2780.9680.4510.1170.4471.0000.941
채취장소0.3500.9110.1110.9110.1570.0000.3500.9411.000
2023-12-12T11:30:50.715580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
채취일자가검물분류상세가검물채취장소검사기관검출일자검출균법정감염병법정군
채취일자1.0000.3960.4510.1570.9340.9880.1960.2580.258
가검물분류0.3961.0000.9680.9110.0000.4050.3920.3340.334
상세가검물0.4510.9681.0000.9410.9680.1170.2780.4470.447
채취장소0.1570.9110.9411.0000.9110.0000.1110.3500.350
검사기관0.9340.0000.9680.9111.0000.3200.3520.2680.268
검출일자0.9880.4050.1170.0000.3201.0000.0000.0000.000
검출균0.1960.3920.2780.1110.3520.0001.0000.9470.947
법정감염병0.2580.3340.4470.3500.2680.0000.9471.0000.947
법정군0.2580.3340.4470.3500.2680.0000.9470.9471.000

Missing values

2023-12-12T11:30:46.881810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:30:47.089058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분채취일자가검물분류상세가검물채취장소검사기관검사종류검출일자검출균법정감염병법정군
0선박2022-07-04해수해수물량장수도권질병대응센터Vibrio2022-07-12Vibrio parahaemolyticus장염비브리오균 감염증지정
1선박2022-07-04해수해수신국제수도권질병대응센터Vibrio2022-07-12Vibrio parahaemolyticus장염비브리오균 감염증지정
2선박2022-07-04해수해수신국제수도권질병대응센터Vibrio2022-07-12Vibrio vulnificus비브리오패혈증3군
3선박2022-07-04해수해수인천조위소수도권질병대응센터Vibrio2022-07-12Vibrio parahaemolyticus장염비브리오균 감염증지정
4선박2022-07-04해수해수인천조위소수도권질병대응센터Vibrio2022-07-12Vibrio vulnificus비브리오패혈증3군
5선박2022-07-15기타내항삼척어판장수도권질병대응센터Vibrio2022-07-15Vibrio parahaemolyticus장염비브리오균 감염증지정
6선박2022-07-15기타외항후진항수도권질병대응센터Vibrio2022-07-15Vibrio parahaemolyticus장염비브리오균 감염증지정
7선박2022-07-15기타조위관측소묵호조위소수도권질병대응센터Vibrio2022-07-15Vibrio parahaemolyticus장염비브리오균 감염증지정
8선박2022-07-18기타내항삼척어판장수도권질병대응센터Vibrio2022-07-26Vibrio parahaemolyticus장염비브리오균 감염증지정
9선박2022-07-18기타외항후진항수도권질병대응센터Vibrio2022-07-26Vibrio parahaemolyticus장염비브리오균 감염증지정
구분채취일자가검물분류상세가검물채취장소검사기관검사종류검출일자검출균법정감염병법정군
39선박2022-09-19기타내항삼척어판장수도권질병대응센터Vibrio2022-09-27Vibrio parahaemolyticus장염비브리오균 감염증지정
40선박2022-09-19기타외항후진항수도권질병대응센터Vibrio2022-09-27Vibrio parahaemolyticus장염비브리오균 감염증지정
41선박2022-09-19해수해수동해항 여객터미널수도권질병대응센터Vibrio2022-09-27Vibrio parahaemolyticus장염비브리오균 감염증지정
42선박2022-09-19해수해수물량장수도권질병대응센터Vibrio2022-09-27Vibrio vulnificus비브리오패혈증3군
43선박2022-09-19해수해수물량장수도권질병대응센터Vibrio2022-09-27Vibrio parahaemolyticus장염비브리오균 감염증지정
44선박2022-09-19해수해수신국제수도권질병대응센터Vibrio2022-09-27Vibrio parahaemolyticus장염비브리오균 감염증지정
45선박2022-09-19해수해수신국제수도권질병대응센터Vibrio2022-09-27Vibrio cholerae non O1<NA><NA>
46선박2022-09-19해수해수신국제수도권질병대응센터Vibrio2022-09-27Vibrio vulnificus비브리오패혈증3군
47선박2022-09-19해수해수인천조위소수도권질병대응센터Vibrio2022-09-27Vibrio vulnificus비브리오패혈증3군
48선박2022-09-19해수해수인천조위소수도권질병대응센터Vibrio2022-09-27Vibrio parahaemolyticus장염비브리오균 감염증지정