Overview

Dataset statistics

Number of variables7
Number of observations21
Missing cells8
Missing cells (%)5.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory62.3 B

Variable types

Text3
Categorical4

Dataset

Description가축전염병발생현황201511
Author전라북도
URLhttps://www.bigdatahub.go.kr/opendata/dataSet/detail.nm?contentId=37&rlik=49451aebf056b486&serviceId=202574

Alerts

Unnamed: 1 is highly overall correlated with Unnamed: 2 and 2 other fieldsHigh correlation
Unnamed: 2 is highly overall correlated with Unnamed: 1 and 2 other fieldsHigh correlation
Unnamed: 3 is highly overall correlated with Unnamed: 1 and 2 other fieldsHigh correlation
Unnamed: 5 is highly overall correlated with Unnamed: 1 and 2 other fieldsHigh correlation
Unnamed: 4 has 4 (19.0%) missing valuesMissing
Unnamed: 6 has 4 (19.0%) missing valuesMissing

Reproduction

Analysis started2024-03-14 02:40:06.954870
Analysis finished2024-03-14 02:40:07.505215
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct13
Distinct (%)61.9%
Missing0
Missing (%)0.0%
Memory size300.0 B
2024-03-14T11:40:07.603508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length5.0952381
Min length2

Characters and Unicode

Total characters107
Distinct characters52
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)38.1%

Sample

1st row구분
2nd row2013
3rd row가금티프스
4th row결핵병
5th row돼지생식기호흡기증후군
ValueCountFrequency (%)
가금티프스 3
13.0%
결핵병 3
13.0%
브루셀라병 3
13.0%
고병원성 2
8.7%
조류인플루엔자 2
8.7%
총합계 2
8.7%
구분 1
 
4.3%
2013 1
 
4.3%
돼지생식기호흡기증후군 1
 
4.3%
추백리 1
 
4.3%
Other values (4) 4
17.4%
2024-03-14T11:40:07.838227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8
 
7.5%
5
 
4.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2 3
 
2.8%
0 3
 
2.8%
1 3
 
2.8%
3
 
2.8%
3
 
2.8%
Other values (42) 70
65.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 93
86.9%
Decimal Number 12
 
11.2%
Space Separator 2
 
1.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
8.6%
5
 
5.4%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
Other values (35) 56
60.2%
Decimal Number
ValueCountFrequency (%)
2 3
25.0%
0 3
25.0%
1 3
25.0%
5 1
 
8.3%
4 1
 
8.3%
3 1
 
8.3%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 93
86.9%
Common 14
 
13.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
8.6%
5
 
5.4%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
Other values (35) 56
60.2%
Common
ValueCountFrequency (%)
2 3
21.4%
0 3
21.4%
1 3
21.4%
2
14.3%
5 1
 
7.1%
4 1
 
7.1%
3 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 93
86.9%
ASCII 14
 
13.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8
 
8.6%
5
 
5.4%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
Other values (35) 56
60.2%
ASCII
ValueCountFrequency (%)
2 3
21.4%
0 3
21.4%
1 3
21.4%
2
14.3%
5 1
 
7.1%
4 1
 
7.1%
3 1
 
7.1%

Unnamed: 1
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)42.9%
Missing0
Missing (%)0.0%
Memory size300.0 B
-
10 
<NA>
1
 
1
12
 
1
Other values (4)

Length

Max length4
Median length1
Mean length1.6190476
Min length1

Unique

Unique6 ?
Unique (%)28.6%

Sample

1st row
2nd row<NA>
3rd row12
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 10
47.6%
<NA> 3
 
14.3%
1 2
 
9.5%
1
 
4.8%
12 1
 
4.8%
2 1
 
4.8%
16 1
 
4.8%
20 1
 
4.8%
36 1
 
4.8%

Length

2024-03-14T11:40:07.944060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T11:40:08.042579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
10
47.6%
na 3
 
14.3%
1 2
 
9.5%
1
 
4.8%
12 1
 
4.8%
2 1
 
4.8%
16 1
 
4.8%
20 1
 
4.8%
36 1
 
4.8%

Unnamed: 2
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)28.6%
Missing0
Missing (%)0.0%
Memory size300.0 B
-
13 
<NA>
1
돼지
 
1
13
 
1

Length

Max length4
Median length1
Mean length1.5714286
Min length1

Unique

Unique3 ?
Unique (%)14.3%

Sample

1st row돼지
2nd row<NA>
3rd row-
4th row-
5th row1

Common Values

ValueCountFrequency (%)
- 13
61.9%
<NA> 3
 
14.3%
1 2
 
9.5%
돼지 1
 
4.8%
13 1
 
4.8%
14 1
 
4.8%

Length

2024-03-14T11:40:08.152871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T11:40:08.274360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
13
61.9%
na 3
 
14.3%
1 2
 
9.5%
돼지 1
 
4.8%
13 1
 
4.8%
14 1
 
4.8%

Unnamed: 3
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)19.0%
Missing0
Missing (%)0.0%
Memory size300.0 B
-
14 
<NA>
1
사슴
 
1

Length

Max length4
Median length1
Mean length1.4761905
Min length1

Unique

Unique1 ?
Unique (%)4.8%

Sample

1st row사슴
2nd row<NA>
3rd row-
4th row1
5th row-

Common Values

ValueCountFrequency (%)
- 14
66.7%
<NA> 3
 
14.3%
1 3
 
14.3%
사슴 1
 
4.8%

Length

2024-03-14T11:40:08.404052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T11:40:08.511294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
14
66.7%
na 3
 
14.3%
1 3
 
14.3%
사슴 1
 
4.8%

Unnamed: 4
Text

MISSING 

Distinct11
Distinct (%)64.7%
Missing4
Missing (%)19.0%
Memory size300.0 B
2024-03-14T11:40:08.608016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length1
Mean length1.3529412
Min length1

Characters and Unicode

Total characters23
Distinct characters11
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)58.8%

Sample

1st row
2nd row-
3rd row50
4th row-
5th row24
ValueCountFrequency (%)
7
41.2%
1
 
5.9%
50 1
 
5.9%
24 1
 
5.9%
3 1
 
5.9%
2 1
 
5.9%
79 1
 
5.9%
48 1
 
5.9%
1 1
 
5.9%
5 1
 
5.9%
2024-03-14T11:40:08.872494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 7
30.4%
3 3
13.0%
5 2
 
8.7%
2 2
 
8.7%
4 2
 
8.7%
1 2
 
8.7%
1
 
4.3%
0 1
 
4.3%
7 1
 
4.3%
9 1
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 15
65.2%
Dash Punctuation 7
30.4%
Other Letter 1
 
4.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 3
20.0%
5 2
13.3%
2 2
13.3%
4 2
13.3%
1 2
13.3%
0 1
 
6.7%
7 1
 
6.7%
9 1
 
6.7%
8 1
 
6.7%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Other Letter
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 22
95.7%
Hangul 1
 
4.3%

Most frequent character per script

Common
ValueCountFrequency (%)
- 7
31.8%
3 3
13.6%
5 2
 
9.1%
2 2
 
9.1%
4 2
 
9.1%
1 2
 
9.1%
0 1
 
4.5%
7 1
 
4.5%
9 1
 
4.5%
8 1
 
4.5%
Hangul
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 22
95.7%
Hangul 1
 
4.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 7
31.8%
3 3
13.6%
5 2
 
9.1%
2 2
 
9.1%
4 2
 
9.1%
1 2
 
9.1%
0 1
 
4.5%
7 1
 
4.5%
9 1
 
4.5%
8 1
 
4.5%
Hangul
ValueCountFrequency (%)
1
100.0%

Unnamed: 5
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)19.0%
Missing0
Missing (%)0.0%
Memory size300.0 B
-
14 
<NA>
6
오리
 
1

Length

Max length4
Median length1
Mean length1.4761905
Min length1

Unique

Unique1 ?
Unique (%)4.8%

Sample

1st row오리
2nd row<NA>
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 14
66.7%
<NA> 3
 
14.3%
6 3
 
14.3%
오리 1
 
4.8%

Length

2024-03-14T11:40:08.977997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T11:40:09.065159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
14
66.7%
na 3
 
14.3%
6 3
 
14.3%
오리 1
 
4.8%

Unnamed: 6
Text

MISSING 

Distinct14
Distinct (%)82.4%
Missing4
Missing (%)19.0%
Memory size300.0 B
2024-03-14T11:40:09.159440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length2
Mean length1.7058824
Min length1

Characters and Unicode

Total characters29
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)70.6%

Sample

1st row총합계
2nd row12
3rd row51
4th row1
5th row24
ValueCountFrequency (%)
1 3
17.6%
2 2
11.8%
총합계 1
 
5.9%
12 1
 
5.9%
51 1
 
5.9%
24 1
 
5.9%
3 1
 
5.9%
7 1
 
5.9%
103 1
 
5.9%
20 1
 
5.9%
Other values (4) 4
23.5%
2024-03-14T11:40:09.368773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 8
27.6%
2 5
17.2%
3 3
 
10.3%
0 3
 
10.3%
5 2
 
6.9%
4 2
 
6.9%
1
 
3.4%
1
 
3.4%
1
 
3.4%
7 1
 
3.4%
Other values (2) 2
 
6.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 26
89.7%
Other Letter 3
 
10.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 8
30.8%
2 5
19.2%
3 3
 
11.5%
0 3
 
11.5%
5 2
 
7.7%
4 2
 
7.7%
7 1
 
3.8%
8 1
 
3.8%
9 1
 
3.8%
Other Letter
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring scripts

ValueCountFrequency (%)
Common 26
89.7%
Hangul 3
 
10.3%

Most frequent character per script

Common
ValueCountFrequency (%)
1 8
30.8%
2 5
19.2%
3 3
 
11.5%
0 3
 
11.5%
5 2
 
7.7%
4 2
 
7.7%
7 1
 
3.8%
8 1
 
3.8%
9 1
 
3.8%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 26
89.7%
Hangul 3
 
10.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 8
30.8%
2 5
19.2%
3 3
 
11.5%
0 3
 
11.5%
5 2
 
7.7%
4 2
 
7.7%
7 1
 
3.8%
8 1
 
3.8%
9 1
 
3.8%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Correlations

2024-03-14T11:40:09.441120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
가축전염병 발생현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6
가축전염병 발생현황1.0000.4580.9780.9010.5670.9230.649
Unnamed: 10.4581.0000.7800.8720.0000.9180.878
Unnamed: 20.9780.7801.0000.7810.8880.7810.911
Unnamed: 30.9010.8720.7811.0001.0000.9731.000
Unnamed: 40.5670.0000.8881.0001.0000.8530.907
Unnamed: 50.9230.9180.7810.9730.8531.0001.000
Unnamed: 60.6490.8780.9111.0000.9071.0001.000
2024-03-14T11:40:09.540583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 5Unnamed: 2Unnamed: 1Unnamed: 3
Unnamed: 51.0000.7350.7430.796
Unnamed: 20.7351.0000.5360.735
Unnamed: 10.7430.5361.0000.678
Unnamed: 30.7960.7350.6781.000
2024-03-14T11:40:09.670390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 5
Unnamed: 11.0000.5360.6780.743
Unnamed: 20.5361.0000.7350.735
Unnamed: 30.6780.7351.0000.796
Unnamed: 50.7430.7350.7961.000

Missing values

2024-03-14T11:40:07.212195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T11:40:07.326102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T11:40:07.419095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

가축전염병 발생현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6
0구분돼지사슴오리총합계
12013<NA><NA><NA><NA><NA><NA>
2가금티프스12----12
3결핵병--150-51
4돼지생식기호흡기증후군-1---1
5브루셀라병---24-24
6추백리1----1
72014<NA><NA><NA><NA><NA><NA>
8가금티프스2----2
9결핵병---3-3
가축전염병 발생현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6
11브루셀라병---2-2
12총합계1611796103
132015<NA><NA><NA><NA><NA><NA>
14가금티프스20----20
15결핵병---48-48
16브루셀라병---1-1
17기종저---5-5
18고병원성 조류인플루엔자-----<NA>
19돼지유행성설사-13-<NA>-13
20총합계361411336190