Overview

Dataset statistics

Number of variables4
Number of observations1096
Missing cells0
Missing cells (%)0.0%
Duplicate rows4
Duplicate rows (%)0.4%
Total size in memory35.4 KiB
Average record size in memory33.1 B

Variable types

Text1
Categorical1
Numeric1
DateTime1

Dataset

Description충청남도 서산시의 축산농가 현황으로 농장명, 주사육축종, 사육두수 등 축산농가에 대한 전반적인 정보를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=432&beforeMenuCd=DOM_000000201001001000&publicdatapk=15034929

Alerts

데이터 기준일자 has constant value ""Constant
Dataset has 4 (0.4%) duplicate rowsDuplicates
주사육업종 is highly imbalanced (73.8%)Imbalance
사육두수 has 12 (1.1%) zerosZeros

Reproduction

Analysis started2024-01-09 20:08:41.328028
Analysis finished2024-01-09 20:08:41.875134
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct946
Distinct (%)86.3%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
2024-01-10T05:08:42.117897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length4
Mean length4.25
Min length2

Characters and Unicode

Total characters4658
Distinct characters331
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique848 ?
Unique (%)77.4%

Sample

1st row단성목장
2nd row예삼목장
3rd row우미관농장
4th row부장한우
5th row한양농장
ValueCountFrequency (%)
농장 31
 
2.7%
한우농장 8
 
0.7%
영락농원 6
 
0.5%
형제농장 5
 
0.4%
용암농장 5
 
0.4%
와우농장 5
 
0.4%
목장 5
 
0.4%
애정농장 5
 
0.4%
대성목장 4
 
0.3%
갈산농장 4
 
0.3%
Other values (953) 1077
93.2%
2024-01-10T05:08:42.607149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
992
21.3%
781
 
16.8%
225
 
4.8%
104
 
2.2%
70
 
1.5%
69
 
1.5%
65
 
1.4%
63
 
1.4%
60
 
1.3%
59
 
1.3%
Other values (321) 2170
46.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4583
98.4%
Space Separator 59
 
1.3%
Decimal Number 7
 
0.2%
Dash Punctuation 5
 
0.1%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
992
21.6%
781
 
17.0%
225
 
4.9%
104
 
2.3%
70
 
1.5%
69
 
1.5%
65
 
1.4%
63
 
1.4%
60
 
1.3%
55
 
1.2%
Other values (311) 2099
45.8%
Decimal Number
ValueCountFrequency (%)
1 2
28.6%
2 1
14.3%
0 1
14.3%
6 1
14.3%
5 1
14.3%
4 1
14.3%
Space Separator
ValueCountFrequency (%)
59
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4583
98.4%
Common 75
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
992
21.6%
781
 
17.0%
225
 
4.9%
104
 
2.3%
70
 
1.5%
69
 
1.5%
65
 
1.4%
63
 
1.4%
60
 
1.3%
55
 
1.2%
Other values (311) 2099
45.8%
Common
ValueCountFrequency (%)
59
78.7%
- 5
 
6.7%
( 2
 
2.7%
) 2
 
2.7%
1 2
 
2.7%
2 1
 
1.3%
0 1
 
1.3%
6 1
 
1.3%
5 1
 
1.3%
4 1
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4583
98.4%
ASCII 75
 
1.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
992
21.6%
781
 
17.0%
225
 
4.9%
104
 
2.3%
70
 
1.5%
69
 
1.5%
65
 
1.4%
63
 
1.4%
60
 
1.3%
55
 
1.2%
Other values (311) 2099
45.8%
ASCII
ValueCountFrequency (%)
59
78.7%
- 5
 
6.7%
( 2
 
2.7%
) 2
 
2.7%
1 2
 
2.7%
2 1
 
1.3%
0 1
 
1.3%
6 1
 
1.3%
5 1
 
1.3%
4 1
 
1.3%

주사육업종
Categorical

IMBALANCE 

Distinct15
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
한우
932 
젖소
 
48
육계
 
37
돼지
 
30
염소
 
17
Other values (10)
 
32

Length

Max length6
Median length2
Mean length2.0465328
Min length2

Unique

Unique5 ?
Unique (%)0.5%

Sample

1st row젖소
2nd row한우
3rd row한우
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 932
85.0%
젖소 48
 
4.4%
육계 37
 
3.4%
돼지 30
 
2.7%
염소 17
 
1.6%
종계/산란계 11
 
1.0%
사슴 8
 
0.7%
육우 4
 
0.4%
산양 2
 
0.2%
기러기 2
 
0.2%
Other values (5) 5
 
0.5%

Length

2024-01-10T05:08:42.782066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 932
85.0%
젖소 48
 
4.4%
육계 37
 
3.4%
돼지 30
 
2.7%
염소 17
 
1.6%
종계/산란계 11
 
1.0%
사슴 8
 
0.7%
육우 4
 
0.4%
산양 2
 
0.2%
기러기 2
 
0.2%
Other values (5) 5
 
0.5%

사육두수
Real number (ℝ)

ZEROS 

Distinct175
Distinct (%)16.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1614.6843
Minimum0
Maximum150000
Zeros12
Zeros (%)1.1%
Negative0
Negative (%)0.0%
Memory size9.8 KiB
2024-01-10T05:08:42.946723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q110
median20
Q345
95-th percentile1250
Maximum150000
Range150000
Interquartile range (IQR)35

Descriptive statistics

Standard deviation9819.4075
Coefficient of variation (CV)6.0813172
Kurtosis86.679685
Mean1614.6843
Median Absolute Deviation (MAD)14
Skewness8.3651811
Sum1769694
Variance96420763
MonotonicityNot monotonic
2024-01-10T05:08:43.100232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 59
 
5.4%
20 59
 
5.4%
15 45
 
4.1%
5 44
 
4.0%
4 37
 
3.4%
40 35
 
3.2%
30 31
 
2.8%
3 29
 
2.6%
12 26
 
2.4%
6 26
 
2.4%
Other values (165) 705
64.3%
ValueCountFrequency (%)
0 12
 
1.1%
1 7
 
0.6%
2 25
2.3%
3 29
2.6%
4 37
3.4%
5 44
4.0%
6 26
2.4%
7 17
 
1.6%
8 24
2.2%
9 20
1.8%
ValueCountFrequency (%)
150000 1
0.1%
120000 1
0.1%
80000 1
0.1%
76000 1
0.1%
75000 1
0.1%
70000 2
0.2%
65000 1
0.1%
60000 1
0.1%
55000 1
0.1%
54000 1
0.1%

데이터 기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
Minimum2022-10-28 00:00:00
Maximum2022-10-28 00:00:00
2024-01-10T05:08:43.229112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:08:43.337823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-01-10T05:08:41.563254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:08:43.422736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주사육업종사육두수
주사육업종1.0000.786
사육두수0.7861.000
2024-01-10T05:08:43.510529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육두수주사육업종
사육두수1.0000.485
주사육업종0.4851.000

Missing values

2024-01-10T05:08:41.722120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:08:41.830553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

농장명주사육업종사육두수데이터 기준일자
0단성목장젖소1302022-10-28
1예삼목장한우292022-10-28
2우미관농장한우642022-10-28
3부장한우한우502022-10-28
4한양농장한우562022-10-28
5지산농장한우362022-10-28
6강당농장육계380002022-10-28
7창리양계농장육계300002022-10-28
8되동목장한우272022-10-28
9진장농장육계700002022-10-28
농장명주사육업종사육두수데이터 기준일자
1086철이농장염소42022-10-28
1087태영축산한우852022-10-28
1088강정목장한우362022-10-28
1089도원농장한우142022-10-28
1090심지농장한우122022-10-28
1091채원목장한우52022-10-28
1092연화농장육계120002022-10-28
1093샘이나 목장젖소142022-10-28
1094대룡농장한우1732022-10-28
1095산속농원한우102022-10-28

Duplicate rows

Most frequently occurring

농장명주사육업종사육두수데이터 기준일자# duplicates
0갈산농장한우402022-10-282
1운산목장한우402022-10-282
2원벌농장한우212022-10-282
3현호농장한우102022-10-282