Overview

Dataset statistics

Number of variables6
Number of observations245
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.6 KiB
Average record size in memory52.5 B

Variable types

Categorical3
Numeric3

Dataset

Description경상남도 수산생물 질병 발생 현황 월별 조사결과입니다.(해양, 어류, 갑각류에 대한 질병발생 품종, 질병, 발생건수, 발생률 데이터를 제공합니다.)
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3076265

Alerts

발생건수 is highly overall correlated with 발생률 and 1 other fieldsHigh correlation
발생률 is highly overall correlated with 발생건수 and 1 other fieldsHigh correlation
품종 is highly overall correlated with 발생건수 and 1 other fieldsHigh correlation
발생건수 has 5 (2.0%) zerosZeros
발생률 has 5 (2.0%) zerosZeros

Reproduction

Analysis started2023-12-11 00:06:06.928174
Analysis finished2023-12-11 00:06:08.480926
Duration1.55 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Categorical

Distinct3
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2022
110 
2023
69 
2021
66 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2022 110
44.9%
2023 69
28.2%
2021 66
26.9%

Length

2023-12-11T09:06:08.564419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:06:08.656989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 110
44.9%
2023 69
28.2%
2021 66
26.9%


Real number (ℝ)

Distinct12
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.044898
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-11T09:06:08.748174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q15
median7
Q39
95-th percentile11.8
Maximum12
Range11
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.9017421
Coefficient of variation (CV)0.4118927
Kurtosis-0.67895578
Mean7.044898
Median Absolute Deviation (MAD)2
Skewness-0.2478871
Sum1726
Variance8.4201071
MonotonicityNot monotonic
2023-12-11T09:06:08.867228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
7 48
19.6%
9 28
11.4%
8 24
9.8%
10 21
8.6%
6 21
8.6%
11 20
8.2%
4 20
8.2%
5 17
 
6.9%
12 13
 
5.3%
2 12
 
4.9%
Other values (2) 21
8.6%
ValueCountFrequency (%)
1 9
 
3.7%
2 12
 
4.9%
3 12
 
4.9%
4 20
8.2%
5 17
 
6.9%
6 21
8.6%
7 48
19.6%
8 24
9.8%
9 28
11.4%
10 21
8.6%
ValueCountFrequency (%)
12 13
 
5.3%
11 20
8.2%
10 21
8.6%
9 28
11.4%
8 24
9.8%
7 48
19.6%
6 21
8.6%
5 17
 
6.9%
4 20
8.2%
3 12
 
4.9%

품종
Categorical

HIGH CORRELATION 

Distinct46
Distinct (%)18.8%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
조피볼락
55 
넙치
54 
참돔
23 
감성돔
22 
숭어
16 
Other values (41)
75 

Length

Max length127
Median length117
Mean length10.453061
Min length2

Unique

Unique33 ?
Unique (%)13.5%

Sample

1st row조피볼락
2nd row조피볼락
3rd row넙치
4th row넙치
5th row넙치

Common Values

ValueCountFrequency (%)
조피볼락 55
22.4%
넙치 54
22.0%
참돔 23
9.4%
감성돔 22
 
9.0%
숭어 16
 
6.5%
돌돔 11
 
4.5%
방어 11
 
4.5%
점농어 5
 
2.0%
자주복 5
 
2.0%
말쥐치 3
 
1.2%
Other values (36) 40
16.3%

Length

2023-12-11T09:06:08.996333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
조피볼락 80
 
12.7%
넙치 79
 
12.6%
참돔 49
 
7.8%
감성돔 45
 
7.2%
숭어 41
 
6.5%
돌돔 35
 
5.6%
방어 26
 
4.1%
흰다리새우 20
 
3.2%
말쥐치 19
 
3.0%
잉어 17
 
2.7%
Other values (51) 217
34.6%

병명
Categorical

Distinct28
Distinct (%)11.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
비브리오병
42 
연쇄구균증
37 
아가미흡충
32 
없음
25 
비브리오증
19 
Other values (23)
90 

Length

Max length8
Median length5
Mean length4.6040816
Min length2

Unique

Unique10 ?
Unique (%)4.1%

Sample

1st row아가미흡충
2nd row연쇄구균
3rd row스쿠티카충
4th row트리코디나충
5th row연쇄구균

Common Values

ValueCountFrequency (%)
비브리오병 42
17.1%
연쇄구균증 37
15.1%
아가미흡충 32
13.1%
없음 25
10.2%
비브리오증 19
7.8%
스쿠티카충 15
 
6.1%
트리코디나 13
 
5.3%
연쇄구균 12
 
4.9%
트리코디나충 7
 
2.9%
활주세균증 7
 
2.9%
Other values (18) 36
14.7%

Length

2023-12-11T09:06:09.121769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
비브리오병 42
17.1%
연쇄구균증 37
15.1%
아가미흡충 32
13.1%
없음 25
10.2%
비브리오증 19
7.8%
스쿠티카충 15
 
6.1%
트리코디나 13
 
5.3%
연쇄구균 12
 
4.9%
트리코디나충 7
 
2.9%
활주세균증 7
 
2.9%
Other values (18) 36
14.7%

발생건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct43
Distinct (%)17.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.2734694
Minimum0
Maximum94
Zeros5
Zeros (%)2.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-11T09:06:09.265223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median2
Q34
95-th percentile57.4
Maximum94
Range94
Interquartile range (IQR)3

Descriptive statistics

Standard deviation19.537341
Coefficient of variation (CV)2.1067995
Kurtosis7.9528244
Mean9.2734694
Median Absolute Deviation (MAD)1
Skewness2.9086128
Sum2272
Variance381.70769
MonotonicityNot monotonic
2023-12-11T09:06:09.432027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=43)
ValueCountFrequency (%)
1 117
47.8%
2 39
 
15.9%
3 17
 
6.9%
4 10
 
4.1%
0 5
 
2.0%
6 5
 
2.0%
7 4
 
1.6%
33 3
 
1.2%
5 3
 
1.2%
28 3
 
1.2%
Other values (33) 39
 
15.9%
ValueCountFrequency (%)
0 5
 
2.0%
1 117
47.8%
2 39
 
15.9%
3 17
 
6.9%
4 10
 
4.1%
5 3
 
1.2%
6 5
 
2.0%
7 4
 
1.6%
8 1
 
0.4%
9 2
 
0.8%
ValueCountFrequency (%)
94 1
0.4%
91 1
0.4%
90 1
0.4%
88 2
0.8%
83 1
0.4%
82 1
0.4%
81 1
0.4%
76 1
0.4%
75 1
0.4%
67 2
0.8%

발생률
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct82
Distinct (%)33.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.212163
Minimum0
Maximum96.4
Zeros5
Zeros (%)2.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-11T09:06:09.571848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11.1
median1.8
Q34.6
95-th percentile66.4
Maximum96.4
Range96.4
Interquartile range (IQR)3.5

Descriptive statistics

Standard deviation21.498591
Coefficient of variation (CV)2.1051946
Kurtosis7.5137173
Mean10.212163
Median Absolute Deviation (MAD)0.8
Skewness2.863544
Sum2501.98
Variance462.18943
MonotonicityNot monotonic
2023-12-11T09:06:09.724676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1.0 43
17.6%
1.2 34
 
13.9%
1.1 24
 
9.8%
2.0 13
 
5.3%
1.4 13
 
5.3%
2.4 10
 
4.1%
3.0 5
 
2.0%
0.0 5
 
2.0%
3.1 5
 
2.0%
4.1 4
 
1.6%
Other values (72) 89
36.3%
ValueCountFrequency (%)
0.0 5
 
2.0%
0.9 3
 
1.2%
1.0 43
17.6%
1.1 24
9.8%
1.2 34
13.9%
1.4 13
 
5.3%
1.8 2
 
0.8%
2.0 13
 
5.3%
2.1 4
 
1.6%
2.2 1
 
0.4%
ValueCountFrequency (%)
96.4 1
0.4%
95.9 1
0.4%
95.7 1
0.4%
92.2 1
0.4%
91.7 1
0.4%
91.0 1
0.4%
90.54 1
0.4%
90.5 1
0.4%
89.1 1
0.4%
88.2 1
0.4%

Interactions

2023-12-11T09:06:08.020017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:06:07.276559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:06:07.512682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:06:08.117322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:06:07.351244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:06:07.852535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:06:08.212329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:06:07.437330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:06:07.926538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:06:09.819659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도품종병명발생건수발생률
연도1.0000.5890.1130.6490.1510.175
0.5891.0000.3260.4970.3510.343
품종0.1130.3261.0000.5610.9770.964
병명0.6490.4970.5611.0000.2560.422
발생건수0.1510.3510.9770.2561.0000.939
발생률0.1750.3430.9640.4220.9391.000
2023-12-11T09:06:09.939434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도병명품종
연도1.0000.4030.041
병명0.4031.0000.138
품종0.0410.1381.000
2023-12-11T09:06:10.053370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발생건수발생률연도품종병명
1.0000.0090.1870.4260.1040.193
발생건수0.0091.0000.9460.0880.7660.088
발생률0.1870.9461.0000.0750.7200.163
연도0.4260.0880.0751.0000.0410.403
품종0.1040.7660.7200.0411.0000.138
병명0.1930.0880.1630.4030.1381.000

Missing values

2023-12-11T09:06:08.324448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:06:08.440160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도품종병명발생건수발생률
020217조피볼락아가미흡충3131.0
120217조피볼락연쇄구균33.0
220217넙치스쿠티카충77.0
320217넙치트리코디나충33.0
420217넙치연쇄구균11.0
520217참돔구두충11.0
620217참돔비브리오병11.0
720217참돔녹간증11.0
820217감성돔닥틸로자이로스11.0
920217돌돔베네데니아11.0
연도품종병명발생건수발생률
23520237점농어연쇄구균증33.1
23620237넙치비브리오증00.0
23720237넙치활주세균증11.0
23820237넙치연쇄구균증11.0
23920237넙치트리코디나44.1
24020237넙치스쿠티카충11.0
24120237말쥐치연쇄구균증22.0
24220237방어베네데니아11.0
24320237방어연쇄구균증11.0
24420237넙치, 볼락, 조피볼락, 쥐치, 돌돔, 강도다리, 참돔, 숭어, 농어, 감성돔, 말쥐치, 방어, 메기, 잉어, 장어, 흰다리새우, 철갑상어, 우렁이없음4040.8