Overview

Dataset statistics

Number of variables8
Number of observations475
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory30.3 KiB
Average record size in memory65.3 B

Variable types

Categorical4
Text2
DateTime1
Numeric1

Dataset

Description제주특별자치도내에서 발생한 법정가축전염병과 관련한 데이터로 가축전염병명, 농장명, 농장위치, 발생일자, 축종, 발생두수, 진단기관 등의 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15047554/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
발생두수(마리) is highly overall correlated with 축종(품종)High correlation
가축전염병명 is highly overall correlated with 축종(품종) and 1 other fieldsHigh correlation
축종(품종) is highly overall correlated with 발생두수(마리) and 2 other fieldsHigh correlation
진단기관 is highly overall correlated with 가축전염병명 and 1 other fieldsHigh correlation
가축전염병명 is highly imbalanced (64.2%)Imbalance
진단기관 is highly imbalanced (87.4%)Imbalance
발생두수(마리) has 7 (1.5%) zerosZeros

Reproduction

Analysis started2023-12-12 23:44:28.291051
Analysis finished2023-12-12 23:44:28.861621
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

가축전염병명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct9
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
돼지생식기호흡기증후군
384 
결핵병
40 
낭충봉아부패병
 
25
고병원성조류인플루엔자
 
7
가금티푸스
 
5
Other values (4)
 
14

Length

Max length16
Median length11
Mean length9.9557895
Min length3

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row돼지생식기호흡기증후군
2nd row돼지생식기호흡기증후군
3rd row돼지생식기호흡기증후군
4th row낭충봉아부패병
5th row돼지생식기호흡기증후군

Common Values

ValueCountFrequency (%)
돼지생식기호흡기증후군 384
80.8%
결핵병 40
 
8.4%
낭충봉아부패병 25
 
5.3%
고병원성조류인플루엔자 7
 
1.5%
가금티푸스 5
 
1.1%
뉴캣슬병 5
 
1.1%
돼지생식기호흡기증후군-생식기형 4
 
0.8%
브루셀라병 4
 
0.8%
돼지열병 1
 
0.2%

Length

2023-12-13T08:44:28.914524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:44:29.006546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
돼지생식기호흡기증후군 384
80.8%
결핵병 40
 
8.4%
낭충봉아부패병 25
 
5.3%
고병원성조류인플루엔자 7
 
1.5%
가금티푸스 5
 
1.1%
뉴캣슬병 5
 
1.1%
돼지생식기호흡기증후군-생식기형 4
 
0.8%
브루셀라병 4
 
0.8%
돼지열병 1
 
0.2%
Distinct208
Distinct (%)43.8%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-12-13T08:44:29.245391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length25
Mean length4.92
Min length2

Characters and Unicode

Total characters2337
Distinct characters193
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique115 ?
Unique (%)24.2%

Sample

1st row청솔농장
2nd row덕림농장
3rd row현소농장
4th row꿀벌연구소
5th row우경농장
ValueCountFrequency (%)
25
 
4.8%
19
 
3.6%
농업회사법인 16
 
3.0%
주식회사 14
 
2.7%
안성종돈장 13
 
2.5%
11
 
2.1%
11
 
2.1%
9
 
1.7%
9
 
1.7%
제일양돈영농조합법인 8
 
1.5%
Other values (207) 390
74.3%
2023-12-13T08:44:29.578509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 280
 
12.0%
229
 
9.8%
199
 
8.5%
50
 
2.1%
49
 
2.1%
48
 
2.1%
47
 
2.0%
47
 
2.0%
44
 
1.9%
42
 
1.8%
Other values (183) 1302
55.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1956
83.7%
Other Punctuation 280
 
12.0%
Space Separator 50
 
2.1%
Open Punctuation 16
 
0.7%
Close Punctuation 16
 
0.7%
Decimal Number 10
 
0.4%
Uppercase Letter 9
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
229
 
11.7%
199
 
10.2%
49
 
2.5%
48
 
2.5%
47
 
2.4%
47
 
2.4%
44
 
2.2%
42
 
2.1%
42
 
2.1%
42
 
2.1%
Other values (171) 1167
59.7%
Decimal Number
ValueCountFrequency (%)
2 6
60.0%
1 2
 
20.0%
6 1
 
10.0%
4 1
 
10.0%
Uppercase Letter
ValueCountFrequency (%)
K 4
44.4%
J 3
33.3%
S 1
 
11.1%
B 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
* 280
100.0%
Space Separator
ValueCountFrequency (%)
50
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1956
83.7%
Common 372
 
15.9%
Latin 9
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
229
 
11.7%
199
 
10.2%
49
 
2.5%
48
 
2.5%
47
 
2.4%
47
 
2.4%
44
 
2.2%
42
 
2.1%
42
 
2.1%
42
 
2.1%
Other values (171) 1167
59.7%
Common
ValueCountFrequency (%)
* 280
75.3%
50
 
13.4%
( 16
 
4.3%
) 16
 
4.3%
2 6
 
1.6%
1 2
 
0.5%
6 1
 
0.3%
4 1
 
0.3%
Latin
ValueCountFrequency (%)
K 4
44.4%
J 3
33.3%
S 1
 
11.1%
B 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1956
83.7%
ASCII 381
 
16.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 280
73.5%
50
 
13.1%
( 16
 
4.2%
) 16
 
4.2%
2 6
 
1.6%
K 4
 
1.0%
J 3
 
0.8%
1 2
 
0.5%
S 1
 
0.3%
B 1
 
0.3%
Other values (2) 2
 
0.5%
Hangul
ValueCountFrequency (%)
229
 
11.7%
199
 
10.2%
49
 
2.5%
48
 
2.5%
47
 
2.4%
47
 
2.4%
44
 
2.2%
42
 
2.1%
42
 
2.1%
42
 
2.1%
Other values (171) 1167
59.7%
Distinct71
Distinct (%)14.9%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-12-13T08:44:29.765304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length19
Mean length19.016842
Min length14

Characters and Unicode

Total characters9033
Distinct characters101
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)5.5%

Sample

1st row제주특별자치도 제주시 한림읍 상대리
2nd row제주특별자치도 제주시 한림읍 명월리
3rd row제주특별자치도 제주시 한림읍 금악리
4th row제주특별자치도 제주시 영평동
5th row제주특별자치도 제주시 한림읍 금능리
ValueCountFrequency (%)
제주특별자치도 475
25.5%
제주시 330
17.7%
한림읍 190
 
10.2%
서귀포시 145
 
7.8%
금악리 100
 
5.4%
대정읍 82
 
4.4%
애월읍 67
 
3.6%
동일리 41
 
2.2%
광령리 30
 
1.6%
상명리 27
 
1.5%
Other values (72) 375
20.1%
2023-12-13T08:44:30.078013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1387
15.4%
805
 
8.9%
805
 
8.9%
486
 
5.4%
485
 
5.4%
475
 
5.3%
475
 
5.3%
475
 
5.3%
475
 
5.3%
437
 
4.8%
Other values (91) 2728
30.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7640
84.6%
Space Separator 1387
 
15.4%
Decimal Number 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
805
 
10.5%
805
 
10.5%
486
 
6.4%
485
 
6.3%
475
 
6.2%
475
 
6.2%
475
 
6.2%
475
 
6.2%
437
 
5.7%
404
 
5.3%
Other values (88) 2318
30.3%
Decimal Number
ValueCountFrequency (%)
1 3
50.0%
2 3
50.0%
Space Separator
ValueCountFrequency (%)
1387
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7640
84.6%
Common 1393
 
15.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
805
 
10.5%
805
 
10.5%
486
 
6.4%
485
 
6.3%
475
 
6.2%
475
 
6.2%
475
 
6.2%
475
 
6.2%
437
 
5.7%
404
 
5.3%
Other values (88) 2318
30.3%
Common
ValueCountFrequency (%)
1387
99.6%
1 3
 
0.2%
2 3
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7640
84.6%
ASCII 1393
 
15.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1387
99.6%
1 3
 
0.2%
2 3
 
0.2%
Hangul
ValueCountFrequency (%)
805
 
10.5%
805
 
10.5%
486
 
6.4%
485
 
6.3%
475
 
6.2%
475
 
6.2%
475
 
6.2%
475
 
6.2%
437
 
5.7%
404
 
5.3%
Other values (88) 2318
30.3%
Distinct398
Distinct (%)83.8%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
Minimum2000-06-21 00:00:00
Maximum2023-04-12 00:00:00
2023-12-13T08:44:30.236017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:44:30.387745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

축종(품종)
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
돼지-육성돈
69 
돼지-일반
52 
돼지-자돈
45 
돼지-이유자돈
39 
돼지-비육돈
36 
Other values (26)
234 

Length

Max length7
Median length6
Mean length5.4694737
Min length1

Unique

Unique6 ?
Unique (%)1.3%

Sample

1st row돼지-육성돈
2nd row돼지-비분류
3rd row돼지-이유자돈
4th row벌-재래종
5th row돼지-자돈

Common Values

ValueCountFrequency (%)
돼지-육성돈 69
14.5%
돼지-일반 52
10.9%
돼지-자돈 45
9.5%
돼지-이유자돈 39
 
8.2%
돼지-비육돈 36
 
7.6%
돼지-삼원교잡 35
 
7.4%
소-한우 34
 
7.2%
돼지-비분류 31
 
6.5%
돼지-종돈 18
 
3.8%
돼지-기타 17
 
3.6%
Other values (21) 99
20.8%

Length

2023-12-13T08:44:30.508287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
돼지-육성돈 69
14.5%
돼지-일반 52
10.9%
돼지-자돈 45
9.5%
돼지-이유자돈 39
 
8.2%
돼지-비육돈 36
 
7.6%
돼지-삼원교잡 35
 
7.4%
소-한우 34
 
7.2%
돼지-비분류 31
 
6.5%
돼지-종돈 18
 
3.8%
돼지-기타 17
 
3.6%
Other values (21) 99
20.8%

발생두수(마리)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct35
Distinct (%)7.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean46.442105
Minimum0
Maximum7000
Zeros7
Zeros (%)1.5%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2023-12-13T08:44:30.615332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median2
Q35
95-th percentile20
Maximum7000
Range7000
Interquartile range (IQR)4

Descriptive statistics

Standard deviation395.66957
Coefficient of variation (CV)8.5196303
Kurtosis213.56894
Mean46.442105
Median Absolute Deviation (MAD)1
Skewness13.500186
Sum22060
Variance156554.41
MonotonicityNot monotonic
2023-12-13T08:44:30.740506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
1 148
31.2%
2 124
26.1%
3 40
 
8.4%
5 35
 
7.4%
4 33
 
6.9%
6 20
 
4.2%
8 9
 
1.9%
9 9
 
1.9%
7 8
 
1.7%
0 7
 
1.5%
Other values (25) 42
 
8.8%
ValueCountFrequency (%)
0 7
 
1.5%
1 148
31.2%
2 124
26.1%
3 40
 
8.4%
4 33
 
6.9%
5 35
 
7.4%
6 20
 
4.2%
7 8
 
1.7%
8 9
 
1.9%
9 9
 
1.9%
ValueCountFrequency (%)
7000 1
0.2%
3000 1
0.2%
2550 1
0.2%
2000 2
0.4%
1100 1
0.2%
1000 1
0.2%
500 1
0.2%
350 1
0.2%
200 2
0.4%
120 1
0.2%

진단기관
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct8
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
제주 동물위생시험소
453 
조류질병과
 
9
질병진단센터
 
4
병리진단과
 
4
질병진단과
 
2
Other values (3)
 
3

Length

Max length12
Median length10
Mean length9.8042105
Min length5

Unique

Unique3 ?
Unique (%)0.6%

Sample

1st row제주 동물위생시험소
2nd row제주 동물위생시험소
3rd row제주 동물위생시험소
4th row제주 동물위생시험소
5th row제주 동물위생시험소

Common Values

ValueCountFrequency (%)
제주 동물위생시험소 453
95.4%
조류질병과 9
 
1.9%
질병진단센터 4
 
0.8%
병리진단과 4
 
0.8%
질병진단과 2
 
0.4%
(주)포스트바이오 1
 
0.2%
조류인플루엔자연구진단과 1
 
0.2%
바이러스질병과 1
 
0.2%

Length

2023-12-13T08:44:30.866715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:44:30.960102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제주 453
48.8%
동물위생시험소 453
48.8%
조류질병과 9
 
1.0%
질병진단센터 4
 
0.4%
병리진단과 4
 
0.4%
질병진단과 2
 
0.2%
주)포스트바이오 1
 
0.1%
조류인플루엔자연구진단과 1
 
0.1%
바이러스질병과 1
 
0.1%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-07-20
475 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-20
2nd row2023-07-20
3rd row2023-07-20
4th row2023-07-20
5th row2023-07-20

Common Values

ValueCountFrequency (%)
2023-07-20 475
100.0%

Length

2023-12-13T08:44:31.060167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:44:31.132230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-07-20 475
100.0%

Interactions

2023-12-13T08:44:28.636088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:44:31.188376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
가축전염병명농장위치축종(품종)발생두수(마리)진단기관
가축전염병명1.0000.8710.9740.7360.814
농장위치0.8711.0000.9220.6240.687
축종(품종)0.9740.9221.0000.8540.882
발생두수(마리)0.7360.6240.8541.0000.146
진단기관0.8140.6870.8820.1461.000
2023-12-13T08:44:31.280298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
가축전염병명진단기관축종(품종)
가축전염병명1.0000.5780.826
진단기관0.5781.0000.586
축종(품종)0.8260.5861.000
2023-12-13T08:44:31.362076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발생두수(마리)가축전염병명축종(품종)진단기관
발생두수(마리)1.0000.4740.5230.081
가축전염병명0.4741.0000.8260.578
축종(품종)0.5230.8261.0000.586
진단기관0.0810.5780.5861.000

Missing values

2023-12-13T08:44:28.731388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:44:28.823526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

가축전염병명농장명농장위치발생일자축종(품종)발생두수(마리)진단기관데이터기준일자
0돼지생식기호흡기증후군청솔농장제주특별자치도 제주시 한림읍 상대리2023-04-12돼지-육성돈2제주 동물위생시험소2023-07-20
1돼지생식기호흡기증후군덕림농장제주특별자치도 제주시 한림읍 명월리2023-03-14돼지-비분류3제주 동물위생시험소2023-07-20
2돼지생식기호흡기증후군현소농장제주특별자치도 제주시 한림읍 금악리2023-03-02돼지-이유자돈3제주 동물위생시험소2023-07-20
3낭충봉아부패병꿀벌연구소제주특별자치도 제주시 영평동2023-02-20벌-재래종1제주 동물위생시험소2023-07-20
4돼지생식기호흡기증후군우경농장제주특별자치도 제주시 한림읍 금능리2023-02-01돼지-자돈1제주 동물위생시험소2023-07-20
5돼지생식기호흡기증후군은성농장제주특별자치도 제주시 한림읍 상명리2023-01-31돼지-비분류1제주 동물위생시험소2023-07-20
6돼지생식기호흡기증후군성주농장제주특별자치도 제주시 한림읍 명월리2022-12-21돼지-이유자돈2제주 동물위생시험소2023-07-20
7돼지생식기호흡기증후군농업회사법인 유한회사 한솔제주특별자치도 제주시 한림읍 상명리2022-12-02돼지-자돈2제주 동물위생시험소2023-07-20
8돼지생식기호흡기증후군서흥축산영농조합법인제주특별자치도 서귀포시 대포동2022-11-29돼지-일반2제주 동물위생시험소2023-07-20
9돼지생식기호흡기증후군은성농장제주특별자치도 제주시 한림읍 상명리2022-11-10돼지-이유자돈1제주 동물위생시험소2023-07-20
가축전염병명농장명농장위치발생일자축종(품종)발생두수(마리)진단기관데이터기준일자
465돼지생식기호흡기증후군김**제주특별자치도 제주시 구좌읍 세화리2004-12-07돼지-육성돈2병리진단과2023-07-20
466뉴캣슬병양**제주특별자치도 제주시 애월읍 고성리2004-06-23닭-육계7000제주 동물위생시험소2023-07-20
467브루셀라병문**제주특별자치도 서귀포시 대정읍 신도리2004-04-07소-한우1제주 동물위생시험소2023-07-20
468브루셀라병이**제주특별자치도 서귀포시 표선면 가시리2004-03-06소-한우10제주 동물위생시험소2023-07-20
469돼지생식기호흡기증후군진**제주특별자치도 제주시 한림읍 명월리2001-06-08돼지-육성돈100병리진단과2023-07-20
470돼지생식기호흡기증후군김**제주특별자치도 제주시 한림읍 금악리2001-02-05돼지-육성돈350병리진단과2023-07-20
471브루셀라병이**제주특별자치도 제주시 한림읍 금악리2000-11-23소-육우1제주 동물위생시험소2023-07-20
472브루셀라병이**제주특별자치도 제주시 조천읍 선흘리2000-11-11소-육우1제주 동물위생시험소2023-07-20
473뉴캣슬병강**제주특별자치도 제주시 한림읍 금능리2000-06-251000조류질병과2023-07-20
474뉴캣슬병설**제주특별자치도 제주시 한림읍 금능리2000-06-212000조류질병과2023-07-20