Overview

Dataset statistics

Number of variables3
Number of observations1366
Missing cells0
Missing cells (%)0.0%
Duplicate rows194
Duplicate rows (%)14.2%
Total size in memory32.1 KiB
Average record size in memory24.1 B

Variable types

Text1
Categorical2

Dataset

Description전라남도 영암군 축산업 현황에 관한 데이터로 대표자명, 사육축종(돼지,한우,젖소 등), 사업장소재지의 정보를 제공하고 있습니다.
Author전라남도 영암군
URLhttps://www.data.go.kr/data/15006766/fileData.do

Alerts

Dataset has 194 (14.2%) duplicate rowsDuplicates
사육축종 is highly imbalanced (65.6%)Imbalance

Reproduction

Analysis started2024-04-21 01:00:04.770976
Analysis finished2024-04-21 01:00:06.126750
Duration1.36 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct67
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
2024-04-21T10:00:06.237244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters4098
Distinct characters68
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)1.2%

Sample

1st row김○○
2nd row이○○
3rd row성○○
4th row강○○
5th row서○○
ValueCountFrequency (%)
김○○ 301
22.0%
이○○ 173
12.7%
박○○ 147
 
10.8%
최○○ 97
 
7.1%
조○○ 52
 
3.8%
정○○ 51
 
3.7%
강○○ 50
 
3.7%
임○○ 32
 
2.3%
서○○ 31
 
2.3%
양○○ 30
 
2.2%
Other values (57) 402
29.4%
2024-04-21T10:00:06.526683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2732
66.7%
301
 
7.3%
173
 
4.2%
147
 
3.6%
97
 
2.4%
52
 
1.3%
51
 
1.2%
50
 
1.2%
32
 
0.8%
31
 
0.8%
Other values (58) 432
 
10.5%

Most occurring categories

ValueCountFrequency (%)
Other Symbol 2732
66.7%
Other Letter 1366
33.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
301
22.0%
173
12.7%
147
 
10.8%
97
 
7.1%
52
 
3.8%
51
 
3.7%
50
 
3.7%
32
 
2.3%
31
 
2.3%
30
 
2.2%
Other values (57) 402
29.4%
Other Symbol
ValueCountFrequency (%)
2732
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2732
66.7%
Hangul 1366
33.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
301
22.0%
173
12.7%
147
 
10.8%
97
 
7.1%
52
 
3.8%
51
 
3.7%
50
 
3.7%
32
 
2.3%
31
 
2.3%
30
 
2.2%
Other values (57) 402
29.4%
Common
ValueCountFrequency (%)
2732
100.0%

Most occurring blocks

ValueCountFrequency (%)
Geometric Shapes 2732
66.7%
Hangul 1366
33.3%

Most frequent character per block

Geometric Shapes
ValueCountFrequency (%)
2732
100.0%
Hangul
ValueCountFrequency (%)
301
22.0%
173
12.7%
147
 
10.8%
97
 
7.1%
52
 
3.8%
51
 
3.7%
50
 
3.7%
32
 
2.3%
31
 
2.3%
30
 
2.2%
Other values (57) 402
29.4%

사육축종
Categorical

IMBALANCE 

Distinct13
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
한우
1087 
오리
 
79
육계
 
71
젖소
 
43
돼지
 
36
Other values (8)
 
50

Length

Max length6
Median length2
Mean length2.0204978
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row돼지
2nd row한우
3rd row돼지
4th row돼지
5th row종계/산란계

Common Values

ValueCountFrequency (%)
한우 1087
79.6%
오리 79
 
5.8%
육계 71
 
5.2%
젖소 43
 
3.1%
돼지 36
 
2.6%
염소 24
 
1.8%
육우 7
 
0.5%
산양 7
 
0.5%
종계/산란계 6
 
0.4%
사슴 2
 
0.1%
Other values (3) 4
 
0.3%

Length

2024-04-21T10:00:06.652910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 1087
79.6%
오리 79
 
5.8%
육계 71
 
5.2%
젖소 43
 
3.1%
돼지 36
 
2.6%
염소 24
 
1.8%
육우 7
 
0.5%
산양 7
 
0.5%
종계/산란계 6
 
0.4%
사슴 2
 
0.1%
Other values (3) 4
 
0.3%
Distinct12
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
도포면
195 
시종면
181 
서호면
160 
신북면
159 
덕진면
159 
Other values (7)
512 

Length

Max length3
Median length3
Mean length2.9985359
Min length1

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row서호면
2nd row신북면
3rd row미암면
4th row미암면
5th row삼호읍

Common Values

ValueCountFrequency (%)
도포면 195
14.3%
시종면 181
13.3%
서호면 160
11.7%
신북면 159
11.6%
덕진면 159
11.6%
학산면 126
9.2%
군서면 122
8.9%
미암면 97
7.1%
영암읍 63
 
4.6%
삼호읍 59
 
4.3%
Other values (2) 45
 
3.3%

Length

2024-04-21T10:00:06.779701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
도포면 195
14.3%
시종면 181
13.3%
서호면 160
11.7%
신북면 159
11.6%
덕진면 159
11.6%
학산면 126
9.2%
군서면 122
8.9%
미암면 97
7.1%
영암읍 63
 
4.6%
삼호읍 59
 
4.3%
Other values (2) 45
 
3.3%

Correlations

2024-04-21T10:00:06.861659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대표자명사육축종사업장소재지
대표자명1.0000.3710.369
사육축종0.3711.0000.260
사업장소재지0.3690.2601.000
2024-04-21T10:00:06.939269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육축종사업장소재지
사육축종1.0000.102
사업장소재지0.1021.000
2024-04-21T10:00:07.008721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육축종사업장소재지
사육축종1.0000.102
사업장소재지0.1021.000

Missing values

2024-04-21T10:00:05.986715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T10:00:06.096395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

대표자명사육축종사업장소재지
0김○○돼지서호면
1이○○한우신북면
2성○○돼지미암면
3강○○돼지미암면
4서○○종계/산란계삼호읍
5김○○돼지미암면
6강○○돼지미암면
7이○○돼지학산면
8이○○종계/산란계삼호읍
9류○○육계신북면
대표자명사육축종사업장소재지
1356이○○한우서호면
1357김○○한우서호면
1358강○○한우시종면
1359박○○한우덕진면
1360이○○한우서호면
1361김○○염소영암읍
1362이○○한우도포면
1363정○○한우덕진면
1364이○○한우군서면
1365박○○한우덕진면

Duplicate rows

Most frequently occurring

대표자명사육축종사업장소재지# duplicates
34김○○한우도포면37
37김○○한우서호면34
41김○○한우학산면34
131이○○한우서호면31
38김○○한우시종면28
33김○○한우덕진면26
31김○○한우군서면25
35김○○한우미암면20
128이○○한우도포면17
39김○○한우신북면16