Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 116 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.6 KiB |
Average record size in memory | 58.1 B |
Variable types
Numeric | 1 |
---|---|
Text | 3 |
Categorical | 3 |
Dataset
Description | 동물의 질병 정보 |
---|---|
Author | 농림축산검역본부 |
URL | https://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220214000000001884 |
Reproduction
Analysis started | 2023-12-11 03:46:38.485779 |
---|---|
Analysis finished | 2023-12-11 03:46:39.192415 |
Duration | 0.71 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
DISS_NO
Real number (ℝ)
UNIQUE
 
Distinct | 116 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 60.051724 |
Minimum | 1 |
---|---|
Maximum | 121 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.1 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 6.75 |
Q1 | 30.75 |
median | 59.5 |
Q3 | 89.25 |
95-th percentile | 115.25 |
Maximum | 121 |
Range | 120 |
Interquartile range (IQR) | 58.5 |
Descriptive statistics
Standard deviation | 34.549988 |
---|---|
Coefficient of variation (CV) | 0.57533715 |
Kurtosis | -1.1629478 |
Mean | 60.051724 |
Median Absolute Deviation (MAD) | 29.5 |
Skewness | 0.03094647 |
Sum | 6966 |
Variance | 1193.7016 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
28 | 1 | 0.9% |
103 | 1 | 0.9% |
118 | 1 | 0.9% |
117 | 1 | 0.9% |
116 | 1 | 0.9% |
115 | 1 | 0.9% |
113 | 1 | 0.9% |
112 | 1 | 0.9% |
110 | 1 | 0.9% |
109 | 1 | 0.9% |
Other values (106) | 106 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
11 | 1 |
Value | Count | Frequency (%) |
121 | 1 | |
120 | 1 | |
119 | 1 | |
118 | 1 | |
117 | 1 | |
116 | 1 | |
115 | 1 | |
113 | 1 | |
112 | 1 | |
110 | 1 |
DISS_NM
Text
UNIQUE
 
Distinct | 116 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.0 KiB |
Length
Max length | 15 |
---|---|
Median length | 10 |
Mean length | 5.9913793 |
Min length | 2 |
Characters and Unicode
Total characters | 695 |
---|---|
Distinct characters | 198 |
Distinct categories | 3 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 116 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 돼지단독 |
---|---|
2nd row | 돼지로타바이러스감염증 |
3rd row | 돼지생식기호흡기증후군 |
4th row | 돼지수포병 |
5th row | 돼지써코바이러스감염증 |
Value | Count | Frequency (%) |
돼지단독 | 1 | 0.9% |
전염성f낭병 | 1 | 0.9% |
파스튜렐라폐렴 | 1 | 0.9% |
소부제병 | 1 | 0.9% |
개코로나바이러스감염증 | 1 | 0.9% |
토끼바이러스성출혈병 | 1 | 0.9% |
탄저 | 1 | 0.9% |
클라미디아병 | 1 | 0.9% |
크립토스포리디움증 | 1 | 0.9% |
큐열 | 1 | 0.9% |
Other values (107) | 107 |
Most occurring characters
Value | Count | Frequency (%) |
병 | 38 | 5.5% |
염 | 35 | 5.0% |
스 | 32 | 4.6% |
증 | 31 | 4.5% |
이 | 25 | 3.6% |
성 | 21 | 3.0% |
바 | 21 | 3.0% |
러 | 18 | 2.6% |
지 | 17 | 2.4% |
소 | 16 | 2.3% |
Other values (188) | 441 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 691 | |
Uppercase Letter | 3 | 0.4% |
Space Separator | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
병 | 38 | 5.5% |
염 | 35 | 5.1% |
스 | 32 | 4.6% |
증 | 31 | 4.5% |
이 | 25 | 3.6% |
성 | 21 | 3.0% |
바 | 21 | 3.0% |
러 | 18 | 2.6% |
지 | 17 | 2.5% |
소 | 16 | 2.3% |
Other values (184) | 437 |
Uppercase Letter
Value | Count | Frequency (%) |
F | 1 | |
S | 1 | |
R | 1 |
Space Separator
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 691 | |
Latin | 3 | 0.4% |
Common | 1 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
병 | 38 | 5.5% |
염 | 35 | 5.1% |
스 | 32 | 4.6% |
증 | 31 | 4.5% |
이 | 25 | 3.6% |
성 | 21 | 3.0% |
바 | 21 | 3.0% |
러 | 18 | 2.6% |
지 | 17 | 2.5% |
소 | 16 | 2.3% |
Other values (184) | 437 |
Latin
Value | Count | Frequency (%) |
F | 1 | |
S | 1 | |
R | 1 |
Common
Value | Count | Frequency (%) |
1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 691 | |
ASCII | 4 | 0.6% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
병 | 38 | 5.5% |
염 | 35 | 5.1% |
스 | 32 | 4.6% |
증 | 31 | 4.5% |
이 | 25 | 3.6% |
성 | 21 | 3.0% |
바 | 21 | 3.0% |
러 | 18 | 2.6% |
지 | 17 | 2.5% |
소 | 16 | 2.3% |
Other values (184) | 437 |
ASCII
Value | Count | Frequency (%) |
F | 1 | |
S | 1 | |
1 | ||
R | 1 |
ENG_DISS_NM
Text
Distinct | 115 |
---|---|
Distinct (%) | 99.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.0 KiB |
Length
Max length | 44 |
---|---|
Median length | 28 |
Mean length | 18.517241 |
Min length | 6 |
Characters and Unicode
Total characters | 2148 |
---|---|
Distinct characters | 52 |
Distinct categories | 6 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 114 ? |
---|---|
Unique (%) | 98.3% |
Sample
1st row | Swine erysipelas |
---|---|
2nd row | Porcine rotavirus infection |
3rd row | Porcine reproductive and respiratory syndrom |
4th row | Swine vesicular disease |
5th row | PCV-2 infection |
Value | Count | Frequency (%) |
disease | 19 | 7.6% |
infection | 14 | 5.6% |
bovine | 8 | 3.2% |
fever | 7 | 2.8% |
porcine | 6 | 2.4% |
swine | 5 | 2.0% |
infectious | 4 | 1.6% |
viral | 4 | 1.6% |
fowl | 3 | 1.2% |
respiratory | 3 | 1.2% |
Other values (154) | 178 |
Most occurring characters
Value | Count | Frequency (%) |
i | 244 | 11.4% |
e | 214 | 10.0% |
s | 195 | 9.1% |
o | 164 | 7.6% |
a | 138 | 6.4% |
n | 136 | 6.3% |
135 | 6.3% | |
r | 125 | 5.8% |
t | 98 | 4.6% |
l | 79 | 3.7% |
Other values (42) | 620 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 1874 | |
Space Separator | 135 | 6.3% |
Uppercase Letter | 130 | 6.1% |
Other Punctuation | 7 | 0.3% |
Dash Punctuation | 1 | < 0.1% |
Decimal Number | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
i | 244 | |
e | 214 | |
s | 195 | |
o | 164 | |
a | 138 | 7.4% |
n | 136 | 7.3% |
r | 125 | 6.7% |
t | 98 | 5.2% |
l | 79 | 4.2% |
c | 77 | 4.1% |
Other values (16) | 404 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 18 | |
A | 16 | |
P | 14 | |
B | 12 | |
S | 11 | |
E | 8 | 6.2% |
R | 7 | 5.4% |
F | 7 | 5.4% |
L | 7 | 5.4% |
I | 6 | 4.6% |
Other values (9) | 24 |
Other Punctuation
Value | Count | Frequency (%) |
' | 4 | |
: | 1 | 14.3% |
. | 1 | 14.3% |
, | 1 | 14.3% |
Space Separator
Value | Count | Frequency (%) |
135 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Decimal Number
Value | Count | Frequency (%) |
2 | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 2004 | |
Common | 144 | 6.7% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
i | 244 | |
e | 214 | |
s | 195 | 9.7% |
o | 164 | 8.2% |
a | 138 | 6.9% |
n | 136 | 6.8% |
r | 125 | 6.2% |
t | 98 | 4.9% |
l | 79 | 3.9% |
c | 77 | 3.8% |
Other values (35) | 534 |
Common
Value | Count | Frequency (%) |
135 | ||
' | 4 | 2.8% |
- | 1 | 0.7% |
: | 1 | 0.7% |
. | 1 | 0.7% |
, | 1 | 0.7% |
2 | 1 | 0.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2148 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
i | 244 | 11.4% |
e | 214 | 10.0% |
s | 195 | 9.1% |
o | 164 | 7.6% |
a | 138 | 6.4% |
n | 136 | 6.3% |
135 | 6.3% | |
r | 125 | 5.8% |
t | 98 | 4.6% |
l | 79 | 3.7% |
Other values (42) | 620 |
INFO_OFFER_NM
Categorical
Distinct | 42 |
---|---|
Distinct (%) | 36.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.0 KiB |
정우석 | |
---|---|
예정용 | 8 |
이희수 | 7 |
최강석 | 7 |
양동군 | 5 |
Other values (37) |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 2.9482759 |
Min length | 2 |
Unique
Unique | 9 ? |
---|---|
Unique (%) | 7.8% |
Sample
1st row | 문진산 |
---|---|
2nd row | 김병한 |
3rd row | 배유찬 |
4th row | 박종현 |
5th row | 양동군 |
Common Values
Value | Count | Frequency (%) |
정우석 | 9 | 7.8% |
예정용 | 8 | 6.9% |
이희수 | 7 | 6.0% |
최강석 | 7 | 6.0% |
양동군 | 5 | 4.3% |
장환 | 4 | 3.4% |
박종현 | 4 | 3.4% |
최세은 | 4 | 3.4% |
조성준 | 4 | 3.4% |
강현미 | 3 | 2.6% |
Other values (32) | 61 |
Length
Value | Count | Frequency (%) |
정우석 | 9 | 7.8% |
예정용 | 8 | 6.9% |
이희수 | 7 | 6.0% |
최강석 | 7 | 6.0% |
양동군 | 5 | 4.3% |
장환 | 4 | 3.4% |
박종현 | 4 | 3.4% |
최세은 | 4 | 3.4% |
조성준 | 4 | 3.4% |
정병열 | 3 | 2.6% |
Other values (32) | 61 |
RGSDE
Categorical
IMBALANCE
 
Distinct | 8 |
---|---|
Distinct (%) | 6.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.0 KiB |
2007-09-18 | |
---|---|
2007-10-25 | 4 |
2007-11-07 | 3 |
2007-06-04 | 2 |
2007-06-12 | 1 |
Other values (3) | 3 |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 4 ? |
---|---|
Unique (%) | 3.4% |
Sample
1st row | 2007-09-18 |
---|---|
2nd row | 2007-09-18 |
3rd row | 2007-09-18 |
4th row | 2007-09-18 |
5th row | 2007-09-18 |
Common Values
Value | Count | Frequency (%) |
2007-09-18 | 103 | |
2007-10-25 | 4 | 3.4% |
2007-11-07 | 3 | 2.6% |
2007-06-04 | 2 | 1.7% |
2007-06-12 | 1 | 0.9% |
2007-06-07 | 1 | 0.9% |
2007-05-28 | 1 | 0.9% |
2007-10-30 | 1 | 0.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2007-09-18 | 103 | |
2007-10-25 | 4 | 3.4% |
2007-11-07 | 3 | 2.6% |
2007-06-04 | 2 | 1.7% |
2007-06-12 | 1 | 0.9% |
2007-06-07 | 1 | 0.9% |
2007-05-28 | 1 | 0.9% |
2007-10-30 | 1 | 0.9% |
MAIN_INFC_ANIMAL
Text
Distinct | 51 |
---|---|
Distinct (%) | 44.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.0 KiB |
Value | Count | Frequency (%) |
돼지 | 19 | 13.8% |
소 | 18 | 13.0% |
닭 | 8 | 5.8% |
미분류 | 8 | 5.8% |
미분류,돼지 | 6 | 4.3% |
야생조류-기타 | 4 | 2.9% |
소,기타 | 4 | 2.9% |
소,돼지 | 4 | 2.9% |
쥐-랫트 | 4 | 2.9% |
개 | 4 | 2.9% |
Other values (47) | 59 |
Most occurring characters
Value | Count | Frequency (%) |
, | 132 | |
소 | 50 | 6.8% |
돼 | 41 | 5.6% |
지 | 41 | 5.6% |
양 | 41 | 5.6% |
류 | 34 | 4.6% |
면 | 31 | 4.2% |
기 | 29 | 3.9% |
타 | 29 | 3.9% |
- | 24 | 3.3% |
Other values (36) | 285 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 559 | |
Other Punctuation | 132 | 17.9% |
Dash Punctuation | 24 | 3.3% |
Space Separator | 22 | 3.0% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
소 | 50 | 8.9% |
돼 | 41 | 7.3% |
지 | 41 | 7.3% |
양 | 41 | 7.3% |
류 | 34 | 6.1% |
면 | 31 | 5.5% |
기 | 29 | 5.2% |
타 | 29 | 5.2% |
미 | 22 | 3.9% |
분 | 22 | 3.9% |
Other values (33) | 219 |
Other Punctuation
Value | Count | Frequency (%) |
, | 132 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 24 |
Space Separator
Value | Count | Frequency (%) |
22 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 559 | |
Common | 178 | 24.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
소 | 50 | 8.9% |
돼 | 41 | 7.3% |
지 | 41 | 7.3% |
양 | 41 | 7.3% |
류 | 34 | 6.1% |
면 | 31 | 5.5% |
기 | 29 | 5.2% |
타 | 29 | 5.2% |
미 | 22 | 3.9% |
분 | 22 | 3.9% |
Other values (33) | 219 |
Common
Value | Count | Frequency (%) |
, | 132 | |
- | 24 | 13.5% |
22 | 12.4% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 559 | |
ASCII | 178 | 24.2% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
, | 132 | |
- | 24 | 13.5% |
22 | 12.4% |
Hangul
Value | Count | Frequency (%) |
소 | 50 | 8.9% |
돼 | 41 | 7.3% |
지 | 41 | 7.3% |
양 | 41 | 7.3% |
류 | 34 | 6.1% |
면 | 31 | 5.5% |
기 | 29 | 5.2% |
타 | 29 | 5.2% |
미 | 22 | 3.9% |
분 | 22 | 3.9% |
Other values (33) | 219 |
CAUSE_CMMN_CL
Categorical
Distinct | 6 |
---|---|
Distinct (%) | 5.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.0 KiB |
기타 | |
---|---|
바이러스 | |
세균 | |
기생충 | |
곰팡이 | 3 |
Length
Max length | 7 |
---|---|
Median length | 2 |
Mean length | 2.6293103 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.9% |
Sample
1st row | 세균 |
---|---|
2nd row | 바이러스 |
3rd row | 바이러스 |
4th row | 기타 |
5th row | 바이러스 |
Common Values
Value | Count | Frequency (%) |
기타 | 50 | |
바이러스 | 28 | |
세균 | 25 | |
기생충 | 9 | 7.8% |
곰팡이 | 3 | 2.6% |
기타,바이러스 | 1 | 0.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
기타 | 50 | |
바이러스 | 28 | |
세균 | 25 | |
기생충 | 9 | 7.8% |
곰팡이 | 3 | 2.6% |
기타,바이러스 | 1 | 0.9% |
DISS_NO | INFO_OFFER_NM | RGSDE | MAIN_INFC_ANIMAL | CAUSE_CMMN_CL | |
---|---|---|---|---|---|
DISS_NO | 1.000 | 0.448 | 0.473 | 0.840 | 0.214 |
INFO_OFFER_NM | 0.448 | 1.000 | 0.000 | 0.847 | 0.694 |
RGSDE | 0.473 | 0.000 | 1.000 | 0.000 | 0.246 |
MAIN_INFC_ANIMAL | 0.840 | 0.847 | 0.000 | 1.000 | 0.000 |
CAUSE_CMMN_CL | 0.214 | 0.694 | 0.246 | 0.000 | 1.000 |
RGSDE | CAUSE_CMMN_CL | INFO_OFFER_NM | |
---|---|---|---|
RGSDE | 1.000 | 0.136 | 0.000 |
CAUSE_CMMN_CL | 0.136 | 1.000 | 0.291 |
INFO_OFFER_NM | 0.000 | 0.291 | 1.000 |
DISS_NO | INFO_OFFER_NM | RGSDE | CAUSE_CMMN_CL | |
---|---|---|---|---|
DISS_NO | 1.000 | 0.150 | 0.244 | 0.120 |
INFO_OFFER_NM | 0.150 | 1.000 | 0.000 | 0.291 |
RGSDE | 0.244 | 0.000 | 1.000 | 0.136 |
CAUSE_CMMN_CL | 0.120 | 0.291 | 0.136 | 1.000 |
DISS_NO | DISS_NM | ENG_DISS_NM | INFO_OFFER_NM | RGSDE | MAIN_INFC_ANIMAL | CAUSE_CMMN_CL | |
---|---|---|---|---|---|---|---|
0 | 28 | 돼지단독 | Swine erysipelas | 문진산 | 2007-09-18 | 돼지 | 세균 |
1 | 29 | 돼지로타바이러스감염증 | Porcine rotavirus infection | 김병한 | 2007-09-18 | 돼지 | 바이러스 |
2 | 30 | 돼지생식기호흡기증후군 | Porcine reproductive and respiratory syndrom | 배유찬 | 2007-09-18 | 돼지 | 바이러스 |
3 | 31 | 돼지수포병 | Swine vesicular disease | 박종현 | 2007-09-18 | 돼지 | 기타 |
4 | 32 | 돼지써코바이러스감염증 | PCV-2 infection | 양동군 | 2007-09-18 | 돼지 | 바이러스 |
5 | 33 | 돼지유행성설사병 | Porcine epidemic diarrhea | 현방훈 | 2007-09-18 | 돼지 | 바이러스 |
6 | 34 | 돼지적리 | Swine dysentery | 임숙경 | 2007-09-18 | 돼지 | 세균 |
7 | 35 | 돼지열병 | Classical swine fever | 송재영 | 2007-09-18 | 돼지 | 바이러스 |
8 | 36 | 돼지파보바이러스감염증 | Porcine parvovirus infection | 김성희 | 2007-09-18 | 돼지 | 기타 |
9 | 37 | 돼지호흡기코로나바이러스감염증 | Porcine respiratory coronaviral infection | 노인순 | 2007-09-18 | 돼지 | 바이러스 |
DISS_NO | DISS_NM | ENG_DISS_NM | INFO_OFFER_NM | RGSDE | MAIN_INFC_ANIMAL | CAUSE_CMMN_CL | |
---|---|---|---|---|---|---|---|
106 | 18 | 네오스포라병 | Neosporosis | 정우석 | 2007-09-18 | 소,개,면양,기타 미분류 | 기생충 |
107 | 19 | 노제마병 | Nosema disease | 장환 | 2007-09-18 | 벌-꿀벌 | 기타 |
108 | 20 | 뇌척수염 | Avian encephalomyelitis | 이윤정 | 2007-09-18 | 닭,꿩,칠면조,메추리 | 바이러스 |
109 | 21 | 뉴캣슬병 | Newcastle disease | 최강석 | 2007-09-18 | 닭,꿩,메추리 | 기타 |
110 | 22 | 니파바이러스감염증 | Nipahvirus infection | 나진주 | 2007-09-18 | 고양이,개,돼지,산양,면양,쥐-랫트 | 기타 |
111 | 23 | 닭세망내피증 | Reticuloendotheliosis | 최강석 | 2007-09-18 | 닭,오리,칠면조 | 바이러스 |
112 | 24 | 닭콕시듐증 | Coccidiosis | 장환 | 2007-06-04 | 닭 | 기타 |
113 | 25 | 대장균증 | Colibacillosis | 이희수 | 2007-09-18 | 소,돼지 | 세균 |
114 | 26 | 돼지게타바이러스감염증 | Porcine getahvirus disease | 최강석 | 2007-09-18 | 소,돼지,쥐-랫트 | 바이러스 |
115 | 27 | 돼지뇌심근염 | Encephalomyocarditis | 김성희 | 2007-09-18 | 돼지 | 기타,바이러스 |