Overview

Dataset statistics

Number of variables4
Number of observations1045
Missing cells8
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory33.8 KiB
Average record size in memory33.1 B

Variable types

Text2
Categorical1
Numeric1

Dataset

Description보성군 축산농가현황에 대한 데이터로 농장명, 주사육업종, 사업장소재지. 적정사육두수 데이터를 제공하고 있습니다.
URLhttps://www.data.go.kr/data/15064365/fileData.do

Alerts

주사육업종 is highly imbalanced (74.2%)Imbalance

Reproduction

Analysis started2023-12-12 19:33:05.194678
Analysis finished2023-12-12 19:33:05.691208
Duration0.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct994
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size8.3 KiB
2023-12-13T04:33:05.887557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length5
Mean length4.8449761
Min length3

Characters and Unicode

Total characters5063
Distinct characters302
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique946 ?
Unique (%)90.5%

Sample

1st row화원농장
2nd row윤동원목장
3rd row덕림축산
4th row보성녹우영농법인
5th row수종목장
ValueCountFrequency (%)
농장 43
 
3.9%
축산 4
 
0.4%
형제농장 3
 
0.3%
승지농장 3
 
0.3%
늘푸른농장 3
 
0.3%
흑염소농장 3
 
0.3%
배태욱농장 2
 
0.2%
임병관농장 2
 
0.2%
신성농장 2
 
0.2%
봉황농장 2
 
0.2%
Other values (997) 1040
93.9%
2023-12-13T04:33:06.276683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
995
 
19.7%
948
 
18.7%
111
 
2.2%
94
 
1.9%
82
 
1.6%
68
 
1.3%
62
 
1.2%
60
 
1.2%
58
 
1.1%
54
 
1.1%
Other values (292) 2531
50.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4970
98.2%
Space Separator 62
 
1.2%
Decimal Number 19
 
0.4%
Open Punctuation 6
 
0.1%
Close Punctuation 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
995
20.0%
948
 
19.1%
111
 
2.2%
94
 
1.9%
82
 
1.6%
68
 
1.4%
60
 
1.2%
58
 
1.2%
54
 
1.1%
54
 
1.1%
Other values (285) 2446
49.2%
Decimal Number
ValueCountFrequency (%)
2 14
73.7%
3 2
 
10.5%
1 2
 
10.5%
4 1
 
5.3%
Space Separator
ValueCountFrequency (%)
62
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4970
98.2%
Common 93
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
995
20.0%
948
 
19.1%
111
 
2.2%
94
 
1.9%
82
 
1.6%
68
 
1.4%
60
 
1.2%
58
 
1.2%
54
 
1.1%
54
 
1.1%
Other values (285) 2446
49.2%
Common
ValueCountFrequency (%)
62
66.7%
2 14
 
15.1%
( 6
 
6.5%
) 6
 
6.5%
3 2
 
2.2%
1 2
 
2.2%
4 1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4970
98.2%
ASCII 93
 
1.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
995
20.0%
948
 
19.1%
111
 
2.2%
94
 
1.9%
82
 
1.6%
68
 
1.4%
60
 
1.2%
58
 
1.2%
54
 
1.1%
54
 
1.1%
Other values (285) 2446
49.2%
ASCII
ValueCountFrequency (%)
62
66.7%
2 14
 
15.1%
( 6
 
6.5%
) 6
 
6.5%
3 2
 
2.2%
1 2
 
2.2%
4 1
 
1.1%

주사육업종
Categorical

IMBALANCE 

Distinct13
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size8.3 KiB
한우
907 
산양
 
30
젖소
 
26
염소
 
21
오리
 
19
Other values (8)
 
42

Length

Max length6
Median length2
Mean length2.0401914
Min length2

Unique

Unique4 ?
Unique (%)0.4%

Sample

1st row돼지
2nd row한우
3rd row돼지
4th row한우
5th row젖소

Common Values

ValueCountFrequency (%)
한우 907
86.8%
산양 30
 
2.9%
젖소 26
 
2.5%
염소 21
 
2.0%
오리 19
 
1.8%
돼지 16
 
1.5%
종계/산란계 10
 
1.0%
육계 9
 
0.9%
사슴 3
 
0.3%
타조 1
 
0.1%
Other values (3) 3
 
0.3%

Length

2023-12-13T04:33:06.412360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 907
86.8%
산양 30
 
2.9%
젖소 26
 
2.5%
염소 21
 
2.0%
오리 19
 
1.8%
돼지 16
 
1.5%
종계/산란계 10
 
1.0%
육계 9
 
0.9%
사슴 3
 
0.3%
타조 1
 
0.1%
Other values (3) 3
 
0.3%
Distinct114
Distinct (%)11.0%
Missing8
Missing (%)0.8%
Memory size8.3 KiB
2023-12-13T04:33:06.684101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length16
Mean length16
Min length16

Characters and Unicode

Total characters16592
Distinct characters113
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)0.6%

Sample

1st row전라남도 보성군 미력면 용정리
2nd row전라남도 보성군 복내면 장천리
3rd row전라남도 보성군 웅치면 용반리
4th row전라남도 보성군 보성읍 쾌상리
5th row전라남도 보성군 보성읍 봉산리
ValueCountFrequency (%)
전라남도 1037
25.0%
보성군 1037
25.0%
보성읍 161
 
3.9%
벌교읍 158
 
3.8%
득량면 125
 
3.0%
조성면 107
 
2.6%
겸백면 96
 
2.3%
웅치면 81
 
2.0%
복내면 78
 
1.9%
미력면 69
 
1.7%
Other values (115) 1199
28.9%
2023-12-13T04:33:07.055051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3111
18.8%
1321
 
8.0%
1203
 
7.3%
1091
 
6.6%
1065
 
6.4%
1056
 
6.4%
1041
 
6.3%
1037
 
6.2%
1037
 
6.2%
718
 
4.3%
Other values (103) 3912
23.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13481
81.2%
Space Separator 3111
 
18.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1321
 
9.8%
1203
 
8.9%
1091
 
8.1%
1065
 
7.9%
1056
 
7.8%
1041
 
7.7%
1037
 
7.7%
1037
 
7.7%
718
 
5.3%
334
 
2.5%
Other values (102) 3578
26.5%
Space Separator
ValueCountFrequency (%)
3111
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13481
81.2%
Common 3111
 
18.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1321
 
9.8%
1203
 
8.9%
1091
 
8.1%
1065
 
7.9%
1056
 
7.8%
1041
 
7.7%
1037
 
7.7%
1037
 
7.7%
718
 
5.3%
334
 
2.5%
Other values (102) 3578
26.5%
Common
ValueCountFrequency (%)
3111
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13481
81.2%
ASCII 3111
 
18.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3111
100.0%
Hangul
ValueCountFrequency (%)
1321
 
9.8%
1203
 
8.9%
1091
 
8.1%
1065
 
7.9%
1056
 
7.8%
1041
 
7.7%
1037
 
7.7%
1037
 
7.7%
718
 
5.3%
334
 
2.5%
Other values (102) 3578
26.5%

적정사육두수
Real number (ℝ)

Distinct520
Distinct (%)49.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean62.472057
Minimum0.7
Maximum1210.3
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.3 KiB
2023-12-13T04:33:07.187611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.7
5-th percentile2
Q18
median26
Q375
95-th percentile230.68
Maximum1210.3
Range1209.6
Interquartile range (IQR)67

Descriptive statistics

Standard deviation100.95698
Coefficient of variation (CV)1.6160341
Kurtosis29.466276
Mean62.472057
Median Absolute Deviation (MAD)21.8
Skewness4.3841101
Sum65283.3
Variance10192.311
MonotonicityNot monotonic
2023-12-13T04:33:07.549405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5.0 19
 
1.8%
6.0 13
 
1.2%
16.0 13
 
1.2%
9.6 12
 
1.1%
20.0 12
 
1.1%
19.2 11
 
1.1%
3.2 11
 
1.1%
60.0 10
 
1.0%
4.5 9
 
0.9%
4.0 8
 
0.8%
Other values (510) 927
88.7%
ValueCountFrequency (%)
0.7 1
 
0.1%
0.8 2
 
0.2%
0.9 4
0.4%
1.0 3
 
0.3%
1.1 2
 
0.2%
1.2 5
0.5%
1.3 3
 
0.3%
1.4 5
0.5%
1.5 8
0.8%
1.6 7
0.7%
ValueCountFrequency (%)
1210.3 1
0.1%
840.8 1
0.1%
760.3 1
0.1%
746.0 1
0.1%
744.5 1
0.1%
660.8 1
0.1%
594.1 1
0.1%
577.7 1
0.1%
569.3 1
0.1%
507.4 1
0.1%

Interactions

2023-12-13T04:33:05.446098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:33:07.619446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주사육업종적정사육두수
주사육업종1.0000.652
적정사육두수0.6521.000
2023-12-13T04:33:07.694757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
적정사육두수주사육업종
적정사육두수1.0000.365
주사육업종0.3651.000

Missing values

2023-12-13T04:33:05.566285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:33:05.655208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명칭주사육업종사업장소재지적정사육두수
0화원농장돼지전라남도 보성군 미력면 용정리302.5
1윤동원목장한우전라남도 보성군 복내면 장천리35.4
2덕림축산돼지전라남도 보성군 웅치면 용반리247.8
3보성녹우영농법인한우전라남도 보성군 보성읍 쾌상리357.2
4수종목장젖소전라남도 보성군 보성읍 봉산리232.1
5현정목장한우전라남도 보성군 복내면 동교리43.7
6내판농장한우전라남도 보성군 복내면 동교리47.4
7승준농장(4)돼지전라남도 보성군 보성읍 쾌상리60.8
8명진농장돼지전라남도 보성군 조성면 대곡리117.5
9낙안축산육계전라남도 보성군 조성면 은곡리123.2
사업장명칭주사육업종사업장소재지적정사육두수
1035송태연 농장한우전라남도 보성군 율어면 금천리65.4
1036낙원농장육용오리전라남도 보성군 겸백면 수남리507.4
1037망골농장한우전라남도 보성군 보성읍 쾌상리108.0
1038형제농장한우전라남도 보성군 조성면 용전리120.0
1039천우농장한우전라남도 보성군 노동면 거석리80.0
1040정병준농장한우전라남도 보성군 미력면 덕림리100.8
1041정재원농장한우전라남도 보성군 미력면 덕림리110.0
1042안세환농장한우전라남도 보성군 웅치면 봉산리57.6
1043양지은농장한우전라남도 보성군 보성읍 쾌상리187.2
1044하늘소 농장한우전라남도 보성군 웅치면 강산리110.0