Overview

Dataset statistics

Number of variables4
Number of observations1131
Missing cells0
Missing cells (%)0.0%
Duplicate rows3
Duplicate rows (%)0.3%
Total size in memory36.6 KiB
Average record size in memory33.1 B

Variable types

Text1
Categorical2
Numeric1

Dataset

Description충청남도 서산시의 축산농가 현황으로 농장명, 주사육축종, 사육두수 등 축산농가에 대한 전반적인 정보를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=432&beforeMenuCd=DOM_000000201001001000&publicdatapk=15034929

Alerts

데이터 기준일자 has constant value ""Constant
Dataset has 3 (0.3%) duplicate rowsDuplicates
주사육업종 is highly imbalanced (68.1%)Imbalance
사육두수 has 15 (1.3%) zerosZeros

Reproduction

Analysis started2024-01-09 20:08:38.132199
Analysis finished2024-01-09 20:08:38.674807
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct973
Distinct (%)86.0%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
2024-01-10T05:08:38.909939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length4
Mean length4.2608311
Min length2

Characters and Unicode

Total characters4819
Distinct characters327
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique873 ?
Unique (%)77.2%

Sample

1st row단성목장
2nd row예삼목장
3rd row진복목장
4th row우미관농장
5th row부장한우
ValueCountFrequency (%)
농장 24
 
2.0%
영락농원 8
 
0.7%
한우농장 7
 
0.6%
애정농장 6
 
0.5%
지산농장 5
 
0.4%
갈산농장 5
 
0.4%
목장 5
 
0.4%
와우농장 5
 
0.4%
용암농장 5
 
0.4%
장요농장 4
 
0.3%
Other values (982) 1112
93.8%
2024-01-10T05:08:39.380990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1027
21.3%
812
 
16.8%
232
 
4.8%
102
 
2.1%
73
 
1.5%
70
 
1.5%
65
 
1.3%
63
 
1.3%
62
 
1.3%
61
 
1.3%
Other values (317) 2252
46.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4736
98.3%
Space Separator 55
 
1.1%
Decimal Number 11
 
0.2%
Dash Punctuation 8
 
0.2%
Close Punctuation 4
 
0.1%
Open Punctuation 4
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1027
21.7%
812
 
17.1%
232
 
4.9%
102
 
2.2%
73
 
1.5%
70
 
1.5%
65
 
1.4%
63
 
1.3%
62
 
1.3%
61
 
1.3%
Other values (304) 2169
45.8%
Decimal Number
ValueCountFrequency (%)
1 4
36.4%
0 1
 
9.1%
5 1
 
9.1%
4 1
 
9.1%
2 1
 
9.1%
9 1
 
9.1%
6 1
 
9.1%
3 1
 
9.1%
Space Separator
ValueCountFrequency (%)
55
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4736
98.3%
Common 83
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1027
21.7%
812
 
17.1%
232
 
4.9%
102
 
2.2%
73
 
1.5%
70
 
1.5%
65
 
1.4%
63
 
1.3%
62
 
1.3%
61
 
1.3%
Other values (304) 2169
45.8%
Common
ValueCountFrequency (%)
55
66.3%
- 8
 
9.6%
1 4
 
4.8%
) 4
 
4.8%
( 4
 
4.8%
, 1
 
1.2%
0 1
 
1.2%
5 1
 
1.2%
4 1
 
1.2%
2 1
 
1.2%
Other values (3) 3
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4736
98.3%
ASCII 83
 
1.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1027
21.7%
812
 
17.1%
232
 
4.9%
102
 
2.2%
73
 
1.5%
70
 
1.5%
65
 
1.4%
63
 
1.3%
62
 
1.3%
61
 
1.3%
Other values (304) 2169
45.8%
ASCII
ValueCountFrequency (%)
55
66.3%
- 8
 
9.6%
1 4
 
4.8%
) 4
 
4.8%
( 4
 
4.8%
, 1
 
1.2%
0 1
 
1.2%
5 1
 
1.2%
4 1
 
1.2%
2 1
 
1.2%
Other values (3) 3
 
3.6%

주사육업종
Categorical

IMBALANCE 

Distinct12
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
한우
929 
젖소
 
53
돼지
 
53
육계
 
46
염소
 
17
Other values (7)
 
33

Length

Max length3
Median length2
Mean length2.0123784
Min length2

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row젖소
2nd row한우
3rd row한우
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 929
82.1%
젖소 53
 
4.7%
돼지 53
 
4.7%
육계 46
 
4.1%
염소 17
 
1.5%
산란계 13
 
1.1%
사슴 7
 
0.6%
육우 6
 
0.5%
산양 3
 
0.3%
면양 2
 
0.2%
Other values (2) 2
 
0.2%

Length

2024-01-10T05:08:39.555956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 929
82.1%
젖소 53
 
4.7%
돼지 53
 
4.7%
육계 46
 
4.1%
염소 17
 
1.5%
산란계 13
 
1.1%
사슴 7
 
0.6%
육우 6
 
0.5%
산양 3
 
0.3%
면양 2
 
0.2%
Other values (2) 2
 
0.2%

사육두수
Real number (ℝ)

ZEROS 

Distinct159
Distinct (%)14.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1706.0698
Minimum0
Maximum150000
Zeros15
Zeros (%)1.3%
Negative0
Negative (%)0.0%
Memory size10.1 KiB
2024-01-10T05:08:39.725347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q110
median20
Q342
95-th percentile1550
Maximum150000
Range150000
Interquartile range (IQR)32

Descriptive statistics

Standard deviation9648.952
Coefficient of variation (CV)5.6556606
Kurtosis85.506853
Mean1706.0698
Median Absolute Deviation (MAD)13
Skewness8.1152964
Sum1929565
Variance93102274
MonotonicityNot monotonic
2024-01-10T05:08:39.909544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20 69
 
6.1%
10 61
 
5.4%
15 50
 
4.4%
5 37
 
3.3%
30 37
 
3.3%
40 35
 
3.1%
4 31
 
2.7%
6 31
 
2.7%
12 30
 
2.7%
11 29
 
2.6%
Other values (149) 721
63.7%
ValueCountFrequency (%)
0 15
1.3%
1 7
 
0.6%
2 28
2.5%
3 29
2.6%
4 31
2.7%
5 37
3.3%
6 31
2.7%
7 19
1.7%
8 27
2.4%
9 23
2.0%
ValueCountFrequency (%)
150000 1
 
0.1%
120000 1
 
0.1%
80000 1
 
0.1%
75000 1
 
0.1%
70000 1
 
0.1%
60000 2
0.2%
54000 1
 
0.1%
50030 1
 
0.1%
50000 4
0.4%
45000 2
0.2%

데이터 기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
2021-09-29
1131 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-09-29
2nd row2021-09-29
3rd row2021-09-29
4th row2021-09-29
5th row2021-09-29

Common Values

ValueCountFrequency (%)
2021-09-29 1131
100.0%

Length

2024-01-10T05:08:40.088679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:08:40.200780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-09-29 1131
100.0%

Interactions

2024-01-10T05:08:38.364256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:08:40.288023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주사육업종사육두수
주사육업종1.0000.794
사육두수0.7941.000
2024-01-10T05:08:40.787166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육두수주사육업종
사육두수1.0000.486
주사육업종0.4861.000

Missing values

2024-01-10T05:08:38.524153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:08:38.625809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

농장명주사육업종사육두수데이터 기준일자
0단성목장젖소1302021-09-29
1예삼목장한우292021-09-29
2진복목장한우202021-09-29
3우미관농장한우642021-09-29
4부장한우한우502021-09-29
5한양농장한우562021-09-29
6지헤농장육계280002021-09-29
7지산농장한우272021-09-29
8강당농장육계500002021-09-29
9창리양계농장육계300002021-09-29
농장명주사육업종사육두수데이터 기준일자
1121소유목장염소602021-09-29
1122광명사슴농장사슴22021-09-29
1123인수농장한우3992021-09-29
1124가야농장산양342021-09-29
1125원벌농장한우202021-09-29
1126관유농장돼지362021-09-29
1127초원농장한우32021-09-29
1128황대헌산양492021-09-29
1129우리숲산란계350002021-09-29
1130대륜농장염소302021-09-29

Duplicate rows

Most frequently occurring

농장명주사육업종사육두수데이터 기준일자# duplicates
0갈산농장한우402021-09-292
1운산목장한우402021-09-292
2원벌농장한우212021-09-292