Overview

Dataset statistics

Number of variables15
Number of observations31
Missing cells287
Missing cells (%)61.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.2 KiB
Average record size in memory137.3 B

Variable types

Categorical4
Unsupported9
Numeric1
DateTime1

Dataset

Description홍천군 가축전염병 발생현황(발생연도, 읍면동, 구제역, 돼지열병, 돼지오제스키병, 돼지생식기호흡기 중후군, 브루셀라병, 결핵병, 고병원성조류인플루엔자, 추백리, 기금티푸스, 뉴캣슬병, 사슴만성소모성질병, 낭충봉아부패병, 기준일자)
Author강원도 홍천군
URLhttps://www.data.go.kr/data/15092394/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
가금티푸스(건) is highly imbalanced (79.4%)Imbalance
구제역(건) has 31 (100.0%) missing valuesMissing
돼지열병(건) has 31 (100.0%) missing valuesMissing
돼지오제스키병(건) has 31 (100.0%) missing valuesMissing
돼지생식기호흡기 증후군(건) has 31 (100.0%) missing valuesMissing
브루셀라병(건) has 31 (100.0%) missing valuesMissing
고병원성조류인플루엔자(건) has 31 (100.0%) missing valuesMissing
추백리(건) has 31 (100.0%) missing valuesMissing
뉴캣슬병(건) has 31 (100.0%) missing valuesMissing
사슴만성소모성질병(건) has 31 (100.0%) missing valuesMissing
낭충봉아부패병(건) has 8 (25.8%) missing valuesMissing
구제역(건) is an unsupported type, check if it needs cleaning or further analysisUnsupported
돼지열병(건) is an unsupported type, check if it needs cleaning or further analysisUnsupported
돼지오제스키병(건) is an unsupported type, check if it needs cleaning or further analysisUnsupported
돼지생식기호흡기 증후군(건) is an unsupported type, check if it needs cleaning or further analysisUnsupported
브루셀라병(건) is an unsupported type, check if it needs cleaning or further analysisUnsupported
고병원성조류인플루엔자(건) is an unsupported type, check if it needs cleaning or further analysisUnsupported
추백리(건) is an unsupported type, check if it needs cleaning or further analysisUnsupported
뉴캣슬병(건) is an unsupported type, check if it needs cleaning or further analysisUnsupported
사슴만성소모성질병(건) is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 10:42:40.194596
Analysis finished2023-12-12 10:42:40.978381
Duration0.78 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

발생년도
Categorical

Distinct5
Distinct (%)16.1%
Missing0
Missing (%)0.0%
Memory size380.0 B
2018
10 
2019
10 
2016
2020
2017

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2016
2nd row2016
3rd row2016
4th row2016
5th row2017

Common Values

ValueCountFrequency (%)
2018 10
32.3%
2019 10
32.3%
2016 4
 
12.9%
2020 4
 
12.9%
2017 3
 
9.7%

Length

2023-12-12T19:42:41.064720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:42:41.200703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2018 10
32.3%
2019 10
32.3%
2016 4
 
12.9%
2020 4
 
12.9%
2017 3
 
9.7%

읍면동
Categorical

Distinct10
Distinct (%)32.3%
Missing0
Missing (%)0.0%
Memory size380.0 B
화촌면
북방면
서면
두촌면
홍천읍
Other values (5)
12 

Length

Max length4
Median length3
Mean length2.8064516
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row두촌면
2nd row북방면
3rd row화촌면
4th row서면
5th row홍천읍

Common Values

ValueCountFrequency (%)
화촌면 5
16.1%
북방면 4
12.9%
서면 4
12.9%
두촌면 3
9.7%
홍천읍 3
9.7%
내면 3
9.7%
영귀미면 3
9.7%
남면 2
 
6.5%
내촌면 2
 
6.5%
서석면 2
 
6.5%

Length

2023-12-12T19:42:41.380299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:42:41.534025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
화촌면 5
16.1%
북방면 4
12.9%
서면 4
12.9%
두촌면 3
9.7%
홍천읍 3
9.7%
내면 3
9.7%
영귀미면 3
9.7%
남면 2
 
6.5%
내촌면 2
 
6.5%
서석면 2
 
6.5%

구제역(건)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing31
Missing (%)100.0%
Memory size411.0 B

돼지열병(건)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing31
Missing (%)100.0%
Memory size411.0 B

돼지오제스키병(건)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing31
Missing (%)100.0%
Memory size411.0 B

돼지생식기호흡기 증후군(건)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing31
Missing (%)100.0%
Memory size411.0 B

브루셀라병(건)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing31
Missing (%)100.0%
Memory size411.0 B

결핵병(건)
Categorical

Distinct4
Distinct (%)12.9%
Missing0
Missing (%)0.0%
Memory size380.0 B
<NA>
23 
1
2
 
1
5
 
1

Length

Max length4
Median length4
Mean length3.2258065
Min length1

Unique

Unique2 ?
Unique (%)6.5%

Sample

1st row<NA>
2nd row2
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
<NA> 23
74.2%
1 6
 
19.4%
2 1
 
3.2%
5 1
 
3.2%

Length

2023-12-12T19:42:41.715376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:42:41.889486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 23
74.2%
1 6
 
19.4%
2 1
 
3.2%
5 1
 
3.2%

고병원성조류인플루엔자(건)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing31
Missing (%)100.0%
Memory size411.0 B

추백리(건)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing31
Missing (%)100.0%
Memory size411.0 B

가금티푸스(건)
Categorical

IMBALANCE 

Distinct2
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size380.0 B
<NA>
30 
1
 
1

Length

Max length4
Median length4
Mean length3.9032258
Min length1

Unique

Unique1 ?
Unique (%)3.2%

Sample

1st row1
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 30
96.8%
1 1
 
3.2%

Length

2023-12-12T19:42:42.041973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:42:42.175461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 30
96.8%
1 1
 
3.2%

뉴캣슬병(건)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing31
Missing (%)100.0%
Memory size411.0 B

사슴만성소모성질병(건)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing31
Missing (%)100.0%
Memory size411.0 B

낭충봉아부패병(건)
Real number (ℝ)

MISSING 

Distinct10
Distinct (%)43.5%
Missing8
Missing (%)25.8%
Infinite0
Infinite (%)0.0%
Mean4.5652174
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size411.0 B
2023-12-12T19:42:42.296194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q37
95-th percentile10.8
Maximum12
Range11
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.258919
Coefficient of variation (CV)0.71385846
Kurtosis-0.15708594
Mean4.5652174
Median Absolute Deviation (MAD)2
Skewness0.91860927
Sum105
Variance10.620553
MonotonicityNot monotonic
2023-12-12T19:42:42.437199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
2 5
16.1%
3 4
12.9%
1 3
 
9.7%
7 2
 
6.5%
4 2
 
6.5%
8 2
 
6.5%
5 2
 
6.5%
9 1
 
3.2%
12 1
 
3.2%
11 1
 
3.2%
(Missing) 8
25.8%
ValueCountFrequency (%)
1 3
9.7%
2 5
16.1%
3 4
12.9%
4 2
 
6.5%
5 2
 
6.5%
7 2
 
6.5%
8 2
 
6.5%
9 1
 
3.2%
11 1
 
3.2%
12 1
 
3.2%
ValueCountFrequency (%)
12 1
 
3.2%
11 1
 
3.2%
9 1
 
3.2%
8 2
 
6.5%
7 2
 
6.5%
5 2
 
6.5%
4 2
 
6.5%
3 4
12.9%
2 5
16.1%
1 3
9.7%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size380.0 B
Minimum2021-10-19 00:00:00
Maximum2021-10-19 00:00:00
2023-12-12T19:42:42.562387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:42:42.703460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T19:42:40.481521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:42:42.794530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발생년도읍면동결핵병(건)낭충봉아부패병(건)
발생년도1.0000.0000.0000.000
읍면동0.0001.0000.0000.675
결핵병(건)0.0000.0001.000NaN
낭충봉아부패병(건)0.0000.675NaN1.000
2023-12-12T19:42:42.916714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
가금티푸스(건)결핵병(건)발생년도읍면동
가금티푸스(건)1.000NaNNaNNaN
결핵병(건)NaN1.0000.0000.000
발생년도NaN0.0001.0000.000
읍면동NaN0.0000.0001.000
2023-12-12T19:42:43.021608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
낭충봉아부패병(건)발생년도읍면동결핵병(건)가금티푸스(건)
낭충봉아부패병(건)1.0000.0000.347NaN0.000
발생년도0.0001.0000.0000.000NaN
읍면동0.3470.0001.0000.000NaN
결핵병(건)NaN0.0000.0001.0000.000
가금티푸스(건)0.000NaNNaN0.0001.000

Missing values

2023-12-12T19:42:40.635567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:42:40.882779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

발생년도읍면동구제역(건)돼지열병(건)돼지오제스키병(건)돼지생식기호흡기 증후군(건)브루셀라병(건)결핵병(건)고병원성조류인플루엔자(건)추백리(건)가금티푸스(건)뉴캣슬병(건)사슴만성소모성질병(건)낭충봉아부패병(건)데이터기준일자
02016두촌면<NA><NA><NA><NA><NA><NA><NA><NA>1<NA><NA><NA>2021-10-19
12016북방면<NA><NA><NA><NA><NA>2<NA><NA><NA><NA><NA><NA>2021-10-19
22016화촌면<NA><NA><NA><NA><NA>1<NA><NA><NA><NA><NA><NA>2021-10-19
32016서면<NA><NA><NA><NA><NA>1<NA><NA><NA><NA><NA><NA>2021-10-19
42017홍천읍<NA><NA><NA><NA><NA>1<NA><NA><NA><NA><NA><NA>2021-10-19
52017화촌면<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>12021-10-19
62017내면<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>12021-10-19
72018남면<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>72021-10-19
82018내면<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>12021-10-19
92018내촌면<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>92021-10-19
발생년도읍면동구제역(건)돼지열병(건)돼지오제스키병(건)돼지생식기호흡기 증후군(건)브루셀라병(건)결핵병(건)고병원성조류인플루엔자(건)추백리(건)가금티푸스(건)뉴캣슬병(건)사슴만성소모성질병(건)낭충봉아부패병(건)데이터기준일자
212019두촌면<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>42021-10-19
222019북방면<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>22021-10-19
232019서면<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>72021-10-19
242019서석면<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>52021-10-19
252019홍천읍<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>32021-10-19
262019화촌면<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>82021-10-19
272020영귀미면<NA><NA><NA><NA><NA>5<NA><NA><NA><NA><NA><NA>2021-10-19
282020북방면<NA><NA><NA><NA><NA>1<NA><NA><NA><NA><NA><NA>2021-10-19
292020화촌면<NA><NA><NA><NA><NA>1<NA><NA><NA><NA><NA><NA>2021-10-19
302020서면<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>22021-10-19