Overview

Dataset statistics

Number of variables12
Number of observations58
Missing cells0
Missing cells (%)0.0%
Duplicate rows12
Duplicate rows (%)20.7%
Total size in memory5.6 KiB
Average record size in memory98.3 B

Variable types

Categorical10
DateTime2

Dataset

Description검역소별 검사결과 정보 (채취기관, 검사기관, 채취일자, 구분, 가검물구분, 가검물분류, 상세가검물, 검사종류, 검출일자, 검출균, 법정감염병, 법정군)
Author질병관리청
URLhttps://www.data.go.kr/data/3074719/fileData.do

Alerts

구분 has constant value ""Constant
Dataset has 12 (20.7%) duplicate rowsDuplicates
법정군 is highly overall correlated with 검출균 and 1 other fieldsHigh correlation
검사기관 is highly overall correlated with 채취기관 and 1 other fieldsHigh correlation
법정감염병 is highly overall correlated with 가검물구분 and 5 other fieldsHigh correlation
채취기관 is highly overall correlated with 검사기관 and 2 other fieldsHigh correlation
상세가검물 is highly overall correlated with 채취기관 and 4 other fieldsHigh correlation
검사종류 is highly overall correlated with 검출균 and 1 other fieldsHigh correlation
가검물분류 is highly overall correlated with 채취기관 and 3 other fieldsHigh correlation
가검물구분 is highly overall correlated with 가검물분류 and 3 other fieldsHigh correlation
검출균 is highly overall correlated with 가검물구분 and 3 other fieldsHigh correlation
검사기관 is highly imbalanced (70.6%)Imbalance
검사종류 is highly imbalanced (78.4%)Imbalance

Reproduction

Analysis started2023-12-12 11:21:49.945389
Analysis finished2023-12-12 11:21:51.591873
Duration1.65 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

채취기관
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)5.2%
Missing0
Missing (%)0.0%
Memory size596.0 B
국립인천검역소
41 
국립동해검역소
14 
국립군산검역소
 
3

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국립군산검역소
2nd row국립군산검역소
3rd row국립군산검역소
4th row국립인천검역소
5th row국립인천검역소

Common Values

ValueCountFrequency (%)
국립인천검역소 41
70.7%
국립동해검역소 14
 
24.1%
국립군산검역소 3
 
5.2%

Length

2023-12-12T20:21:51.700764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:21:51.869516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국립인천검역소 41
70.7%
국립동해검역소 14
 
24.1%
국립군산검역소 3
 
5.2%

검사기관
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size596.0 B
수도권질병대응센터
55 
충청권질병대응센터
 
3

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충청권질병대응센터
2nd row충청권질병대응센터
3rd row충청권질병대응센터
4th row수도권질병대응센터
5th row수도권질병대응센터

Common Values

ValueCountFrequency (%)
수도권질병대응센터 55
94.8%
충청권질병대응센터 3
 
5.2%

Length

2023-12-12T20:21:52.062780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:21:52.218958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수도권질병대응센터 55
94.8%
충청권질병대응센터 3
 
5.2%
Distinct15
Distinct (%)25.9%
Missing0
Missing (%)0.0%
Memory size596.0 B
Minimum2022-03-18 00:00:00
Maximum2022-11-01 00:00:00
2023-12-12T20:21:52.366139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:21:52.596268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)

구분
Categorical

CONSTANT 

Distinct1
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size596.0 B
선박
58 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row선박
2nd row선박
3rd row선박
4th row선박
5th row선박

Common Values

ValueCountFrequency (%)
선박 58
100.0%

Length

2023-12-12T20:21:52.844606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:21:52.967448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
선박 58
100.0%

가검물구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size596.0 B
검역구역내
49 
운송수단

Length

Max length5
Median length5
Mean length4.8448276
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row검역구역내
2nd row검역구역내
3rd row검역구역내
4th row운송수단
5th row검역구역내

Common Values

ValueCountFrequency (%)
검역구역내 49
84.5%
운송수단 9
 
15.5%

Length

2023-12-12T20:21:53.121530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:21:53.307112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
검역구역내 49
84.5%
운송수단 9
 
15.5%

가검물분류
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size596.0 B
해수
36 
기타
13 
선박기타
선박변기오수
 
1

Length

Max length6
Median length2
Mean length2.3448276
Min length2

Unique

Unique1 ?
Unique (%)1.7%

Sample

1st row해수
2nd row해수
3rd row해수
4th row선박기타
5th row해수

Common Values

ValueCountFrequency (%)
해수 36
62.1%
기타 13
 
22.4%
선박기타 8
 
13.8%
선박변기오수 1
 
1.7%

Length

2023-12-12T20:21:53.489679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:21:53.663115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
해수 36
62.1%
기타 13
 
22.4%
선박기타 8
 
13.8%
선박변기오수 1
 
1.7%

상세가검물
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)15.5%
Missing0
Missing (%)0.0%
Memory size596.0 B
해수
33 
주방오수
내항
외항
비브리오넷
 
3
Other values (4)

Length

Max length5
Median length2
Mean length2.5689655
Min length2

Unique

Unique2 ?
Unique (%)3.4%

Sample

1st row비브리오넷
2nd row비브리오넷
3rd row비브리오넷
4th row주방오수
5th row해수

Common Values

ValueCountFrequency (%)
해수 33
56.9%
주방오수 5
 
8.6%
내항 5
 
8.6%
외항 5
 
8.6%
비브리오넷 3
 
5.2%
조위관측소 3
 
5.2%
칼도마 2
 
3.4%
B/W 1
 
1.7%
<NA> 1
 
1.7%

Length

2023-12-12T20:21:53.843483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:21:54.046171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
해수 33
56.9%
주방오수 5
 
8.6%
내항 5
 
8.6%
외항 5
 
8.6%
비브리오넷 3
 
5.2%
조위관측소 3
 
5.2%
칼도마 2
 
3.4%
b/w 1
 
1.7%
na 1
 
1.7%

검사종류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size596.0 B
Vibrio
56 
Escherichia coli
 
2

Length

Max length16
Median length6
Mean length6.3448276
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowVibrio
2nd rowVibrio
3rd rowVibrio
4th rowEscherichia coli
5th rowVibrio

Common Values

ValueCountFrequency (%)
Vibrio 56
96.6%
Escherichia coli 2
 
3.4%

Length

2023-12-12T20:21:54.231471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:21:54.382001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
vibrio 56
93.3%
escherichia 2
 
3.3%
coli 2
 
3.3%
Distinct16
Distinct (%)27.6%
Missing0
Missing (%)0.0%
Memory size596.0 B
Minimum2022-03-19 00:00:00
Maximum2022-11-05 00:00:00
2023-12-12T20:21:54.529994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:21:54.739229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)

검출균
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)8.6%
Missing0
Missing (%)0.0%
Memory size596.0 B
Vibrio parahaemolyticus
35 
Vibrio vulnificus
14 
Vibrio cholerae non O1
ETEC LT
 
1
ETEC LT, ST
 
1

Length

Max length23
Median length23
Mean length20.948276
Min length7

Unique

Unique2 ?
Unique (%)3.4%

Sample

1st rowVibrio vulnificus
2nd rowVibrio vulnificus
3rd rowVibrio vulnificus
4th rowETEC LT
5th rowVibrio parahaemolyticus

Common Values

ValueCountFrequency (%)
Vibrio parahaemolyticus 35
60.3%
Vibrio vulnificus 14
 
24.1%
Vibrio cholerae non O1 7
 
12.1%
ETEC LT 1
 
1.7%
ETEC LT, ST 1
 
1.7%

Length

2023-12-12T20:21:54.946199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:21:55.153472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
vibrio 56
42.7%
parahaemolyticus 35
26.7%
vulnificus 14
 
10.7%
cholerae 7
 
5.3%
non 7
 
5.3%
o1 7
 
5.3%
etec 2
 
1.5%
lt 2
 
1.5%
st 1
 
0.8%

법정감염병
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size596.0 B
장염비브리오균 감염증
35 
비브리오패혈증
14 
<NA>
장독소성대장균 (ETEC)감염증
 
2

Length

Max length17
Median length11
Mean length9.3965517
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row비브리오패혈증
2nd row비브리오패혈증
3rd row비브리오패혈증
4th row장독소성대장균 (ETEC)감염증
5th row장염비브리오균 감염증

Common Values

ValueCountFrequency (%)
장염비브리오균 감염증 35
60.3%
비브리오패혈증 14
 
24.1%
<NA> 7
 
12.1%
장독소성대장균 (ETEC)감염증 2
 
3.4%

Length

2023-12-12T20:21:55.365139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:21:55.539326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
장염비브리오균 35
36.8%
감염증 35
36.8%
비브리오패혈증 14
 
14.7%
na 7
 
7.4%
장독소성대장균 2
 
2.1%
etec)감염증 2
 
2.1%

법정군
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)5.2%
Missing0
Missing (%)0.0%
Memory size596.0 B
지정
37 
3군
14 
<NA>

Length

Max length4
Median length2
Mean length2.2413793
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3군
2nd row3군
3rd row3군
4th row지정
5th row지정

Common Values

ValueCountFrequency (%)
지정 37
63.8%
3군 14
 
24.1%
<NA> 7
 
12.1%

Length

2023-12-12T20:21:55.755850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:21:55.928813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지정 37
63.8%
3군 14
 
24.1%
na 7
 
12.1%

Correlations

2023-12-12T20:21:56.068582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
채취기관검사기관채취일자가검물구분가검물분류상세가검물검사종류검출일자검출균법정감염병법정군
채취기관1.0001.0000.9430.1260.6370.9580.0000.2420.4080.6800.302
검사기관1.0001.0001.0000.0000.0001.0000.0000.0000.2720.2210.430
채취일자0.9431.0001.0001.0000.9330.8610.8470.9940.9140.8360.162
가검물구분0.1260.0001.0001.0001.0001.0000.4311.0000.5020.3830.000
가검물분류0.6370.0000.9331.0001.0001.0000.6140.9810.4730.5360.576
상세가검물0.9581.0000.8611.0001.0001.0000.6350.8550.6520.7560.586
검사종류0.0000.0000.8470.4310.6140.6351.0001.0001.0001.0000.000
검출일자0.2420.0000.9941.0000.9810.8551.0001.0000.9070.9010.000
검출균0.4080.2720.9140.5020.4730.6521.0000.9071.0001.0001.000
법정감염병0.6800.2210.8360.3830.5360.7561.0000.9011.0001.0001.000
법정군0.3020.4300.1620.0000.5760.5860.0000.0001.0001.0001.000
2023-12-12T20:21:56.327230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법정군검사기관법정감염병채취기관상세가검물검사종류가검물분류가검물구분검출균
법정군1.0000.2820.9900.4820.4120.0000.3860.0000.979
검사기관0.2821.0000.3570.9910.9440.0000.0000.0000.321
법정감염병0.9900.3571.0000.3370.6240.9900.5330.5990.990
채취기관0.4820.9910.3371.0000.9270.0000.6530.2040.331
상세가검물0.4120.9440.6240.9271.0000.4530.9530.9440.459
검사종류0.0000.0000.9900.0000.4531.0000.4170.2830.973
가검물분류0.3860.0000.5330.6530.9530.4171.0000.9820.398
가검물구분0.0000.0000.5990.2040.9440.2830.9821.0000.591
검출균0.9790.3210.9900.3310.4590.9730.3980.5911.000
2023-12-12T20:21:56.535965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
채취기관검사기관가검물구분가검물분류상세가검물검사종류검출균법정감염병법정군
채취기관1.0000.9910.2040.6530.9270.0000.3310.3370.482
검사기관0.9911.0000.0000.0000.9440.0000.3210.3570.282
가검물구분0.2040.0001.0000.9820.9440.2830.5910.5990.000
가검물분류0.6530.0000.9821.0000.9530.4170.3980.5330.386
상세가검물0.9270.9440.9440.9531.0000.4530.4590.6240.412
검사종류0.0000.0000.2830.4170.4531.0000.9730.9900.000
검출균0.3310.3210.5910.3980.4590.9731.0000.9900.979
법정감염병0.3370.3570.5990.5330.6240.9900.9901.0000.990
법정군0.4820.2820.0000.3860.4120.0000.9790.9901.000

Missing values

2023-12-12T20:21:51.123055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:21:51.473021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

채취기관검사기관채취일자구분가검물구분가검물분류상세가검물검사종류검출일자검출균법정감염병법정군
0국립군산검역소충청권질병대응센터2022-07-19선박검역구역내해수비브리오넷Vibrio2022-07-26Vibrio vulnificus비브리오패혈증3군
1국립군산검역소충청권질병대응센터2022-07-19선박검역구역내해수비브리오넷Vibrio2022-07-26Vibrio vulnificus비브리오패혈증3군
2국립군산검역소충청권질병대응센터2022-07-19선박검역구역내해수비브리오넷Vibrio2022-07-26Vibrio vulnificus비브리오패혈증3군
3국립인천검역소수도권질병대응센터2022-03-18선박운송수단선박기타주방오수Escherichia coli2022-03-19ETEC LT장독소성대장균 (ETEC)감염증지정
4국립인천검역소수도권질병대응센터2022-07-04선박검역구역내해수해수Vibrio2022-07-12Vibrio parahaemolyticus장염비브리오균 감염증지정
5국립인천검역소수도권질병대응센터2022-07-04선박검역구역내해수해수Vibrio2022-07-12Vibrio vulnificus비브리오패혈증3군
6국립인천검역소수도권질병대응센터2022-07-04선박검역구역내해수해수Vibrio2022-07-12Vibrio vulnificus비브리오패혈증3군
7국립인천검역소수도권질병대응센터2022-07-04선박검역구역내해수해수Vibrio2022-07-12Vibrio parahaemolyticus장염비브리오균 감염증지정
8국립인천검역소수도권질병대응센터2022-07-04선박검역구역내해수해수Vibrio2022-07-12Vibrio parahaemolyticus장염비브리오균 감염증지정
9국립동해검역소수도권질병대응센터2022-07-15선박검역구역내기타내항Vibrio2022-07-15Vibrio parahaemolyticus장염비브리오균 감염증지정
채취기관검사기관채취일자구분가검물구분가검물분류상세가검물검사종류검출일자검출균법정감염병법정군
48국립동해검역소수도권질병대응센터2022-09-19선박검역구역내기타외항Vibrio2022-09-27Vibrio parahaemolyticus장염비브리오균 감염증지정
49국립인천검역소수도권질병대응센터2022-09-19선박검역구역내해수해수Vibrio2022-09-27Vibrio cholerae non O1<NA><NA>
50국립동해검역소수도권질병대응센터2022-09-19선박검역구역내해수해수Vibrio2022-09-27Vibrio parahaemolyticus장염비브리오균 감염증지정
51국립인천검역소수도권질병대응센터2022-09-19선박검역구역내해수해수Vibrio2022-09-27Vibrio vulnificus비브리오패혈증3군
52국립인천검역소수도권질병대응센터2022-09-19선박검역구역내해수해수Vibrio2022-09-27Vibrio parahaemolyticus장염비브리오균 감염증지정
53국립인천검역소수도권질병대응센터2022-09-19선박검역구역내해수해수Vibrio2022-09-27Vibrio vulnificus비브리오패혈증3군
54국립인천검역소수도권질병대응센터2022-09-19선박검역구역내해수해수Vibrio2022-09-27Vibrio parahaemolyticus장염비브리오균 감염증지정
55국립인천검역소수도권질병대응센터2022-09-19선박검역구역내해수해수Vibrio2022-09-27Vibrio vulnificus비브리오패혈증3군
56국립인천검역소수도권질병대응센터2022-09-19선박검역구역내해수해수Vibrio2022-09-27Vibrio parahaemolyticus장염비브리오균 감염증지정
57국립인천검역소수도권질병대응센터2022-11-01선박운송수단선박기타주방오수Vibrio2022-11-05Vibrio cholerae non O1<NA><NA>

Duplicate rows

Most frequently occurring

채취기관검사기관채취일자구분가검물구분가검물분류상세가검물검사종류검출일자검출균법정감염병법정군# duplicates
0국립군산검역소충청권질병대응센터2022-07-19선박검역구역내해수비브리오넷Vibrio2022-07-26Vibrio vulnificus비브리오패혈증3군3
1국립인천검역소수도권질병대응센터2022-07-04선박검역구역내해수해수Vibrio2022-07-12Vibrio parahaemolyticus장염비브리오균 감염증지정3
4국립인천검역소수도권질병대응센터2022-07-18선박검역구역내해수해수Vibrio2022-07-26Vibrio parahaemolyticus장염비브리오균 감염증지정3
5국립인천검역소수도권질병대응센터2022-08-08선박검역구역내해수해수Vibrio2022-08-16Vibrio parahaemolyticus장염비브리오균 감염증지정3
6국립인천검역소수도권질병대응센터2022-08-22선박검역구역내해수해수Vibrio2022-08-30Vibrio parahaemolyticus장염비브리오균 감염증지정3
8국립인천검역소수도권질병대응센터2022-09-05선박검역구역내해수해수Vibrio2022-09-13Vibrio parahaemolyticus장염비브리오균 감염증지정3
10국립인천검역소수도권질병대응센터2022-09-19선박검역구역내해수해수Vibrio2022-09-27Vibrio parahaemolyticus장염비브리오균 감염증지정3
11국립인천검역소수도권질병대응센터2022-09-19선박검역구역내해수해수Vibrio2022-09-27Vibrio vulnificus비브리오패혈증3군3
2국립인천검역소수도권질병대응센터2022-07-04선박검역구역내해수해수Vibrio2022-07-12Vibrio vulnificus비브리오패혈증3군2
3국립인천검역소수도권질병대응센터2022-07-18선박검역구역내해수해수Vibrio2022-07-26Vibrio cholerae non O1<NA><NA>2