Overview

Dataset statistics

Number of variables6
Number of observations1004
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory48.2 KiB
Average record size in memory49.1 B

Variable types

Text2
Categorical3
Numeric1

Dataset

Description충청남도 논산시 축산농가 현황 데이터로 농장명, 축종, 사육두수, 사육규모, 행정구역, 소재지 정보를 제공하고 있습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=389&beforeMenuCd=DOM_000000201001001000&publicdatapk=15034227

Alerts

사육두수 is highly overall correlated with 사육규모High correlation
사육규모 is highly overall correlated with 사육두수High correlation
축종 is highly imbalanced (55.3%)Imbalance
사육규모 is highly imbalanced (57.5%)Imbalance

Reproduction

Analysis started2024-01-09 21:14:51.951416
Analysis finished2024-01-09 21:14:52.512060
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct494
Distinct (%)49.2%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
2024-01-10T06:14:52.673099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length18
Mean length3.4631474
Min length2

Characters and Unicode

Total characters3477
Distinct characters281
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique467 ?
Unique (%)46.5%

Sample

1st row임리목장
2nd row태영농장
3rd row농장
4th row농장
5th row농장
ValueCountFrequency (%)
농장 487
45.1%
축사 33
 
3.1%
농업회사법인 5
 
0.5%
청운농장 4
 
0.4%
대성농장 4
 
0.4%
하나농장 4
 
0.4%
대광농장 4
 
0.4%
한우 4
 
0.4%
한솔농장 3
 
0.3%
대인농장 3
 
0.3%
Other values (500) 529
49.0%
2024-01-10T06:14:53.034719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
911
26.2%
870
25.0%
76
 
2.2%
67
 
1.9%
67
 
1.9%
52
 
1.5%
40
 
1.2%
38
 
1.1%
34
 
1.0%
33
 
0.9%
Other values (271) 1289
37.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3349
96.3%
Space Separator 76
 
2.2%
Decimal Number 23
 
0.7%
Close Punctuation 9
 
0.3%
Open Punctuation 9
 
0.3%
Uppercase Letter 7
 
0.2%
Other Punctuation 2
 
0.1%
Letter Number 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
911
27.2%
870
26.0%
67
 
2.0%
67
 
2.0%
52
 
1.6%
40
 
1.2%
38
 
1.1%
34
 
1.0%
33
 
1.0%
27
 
0.8%
Other values (251) 1210
36.1%
Decimal Number
ValueCountFrequency (%)
2 14
60.9%
1 3
 
13.0%
3 2
 
8.7%
6 1
 
4.3%
5 1
 
4.3%
4 1
 
4.3%
7 1
 
4.3%
Uppercase Letter
ValueCountFrequency (%)
B 1
14.3%
R 1
14.3%
A 1
14.3%
F 1
14.3%
G 1
14.3%
E 1
14.3%
M 1
14.3%
Space Separator
ValueCountFrequency (%)
76
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3349
96.3%
Common 120
 
3.5%
Latin 8
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
911
27.2%
870
26.0%
67
 
2.0%
67
 
2.0%
52
 
1.6%
40
 
1.2%
38
 
1.1%
34
 
1.0%
33
 
1.0%
27
 
0.8%
Other values (251) 1210
36.1%
Common
ValueCountFrequency (%)
76
63.3%
2 14
 
11.7%
) 9
 
7.5%
( 9
 
7.5%
1 3
 
2.5%
3 2
 
1.7%
. 2
 
1.7%
6 1
 
0.8%
5 1
 
0.8%
- 1
 
0.8%
Other values (2) 2
 
1.7%
Latin
ValueCountFrequency (%)
B 1
12.5%
1
12.5%
R 1
12.5%
A 1
12.5%
F 1
12.5%
G 1
12.5%
E 1
12.5%
M 1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3349
96.3%
ASCII 127
 
3.7%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
911
27.2%
870
26.0%
67
 
2.0%
67
 
2.0%
52
 
1.6%
40
 
1.2%
38
 
1.1%
34
 
1.0%
33
 
1.0%
27
 
0.8%
Other values (251) 1210
36.1%
ASCII
ValueCountFrequency (%)
76
59.8%
2 14
 
11.0%
) 9
 
7.1%
( 9
 
7.1%
1 3
 
2.4%
3 2
 
1.6%
. 2
 
1.6%
6 1
 
0.8%
B 1
 
0.8%
5 1
 
0.8%
Other values (9) 9
 
7.1%
Number Forms
ValueCountFrequency (%)
1
100.0%

축종
Categorical

IMBALANCE 

Distinct7
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
한우
767 
돼지
124 
젖소
 
41
염소
 
31
산양
 
26
Other values (2)
 
15

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사슴
2nd row한우
3rd row한우
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 767
76.4%
돼지 124
 
12.4%
젖소 41
 
4.1%
염소 31
 
3.1%
산양 26
 
2.6%
사슴 9
 
0.9%
육우 6
 
0.6%

Length

2024-01-10T06:14:53.149022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:14:53.238035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한우 767
76.4%
돼지 124
 
12.4%
젖소 41
 
4.1%
염소 31
 
3.1%
산양 26
 
2.6%
사슴 9
 
0.9%
육우 6
 
0.6%

사육두수
Real number (ℝ)

HIGH CORRELATION 

Distinct179
Distinct (%)17.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean221.82669
Minimum1
Maximum7000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.0 KiB
2024-01-10T06:14:53.349750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q110
median24
Q370
95-th percentile1700
Maximum7000
Range6999
Interquartile range (IQR)60

Descriptive statistics

Standard deviation625.57399
Coefficient of variation (CV)2.8201024
Kurtosis24.881951
Mean221.82669
Median Absolute Deviation (MAD)19
Skewness4.4148437
Sum222714
Variance391342.82
MonotonicityIncreasing
2024-01-10T06:14:53.469373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5 52
 
5.2%
10 49
 
4.9%
20 39
 
3.9%
50 35
 
3.5%
3 29
 
2.9%
2 29
 
2.9%
7 28
 
2.8%
4 27
 
2.7%
30 26
 
2.6%
6 25
 
2.5%
Other values (169) 665
66.2%
ValueCountFrequency (%)
1 16
 
1.6%
2 29
2.9%
3 29
2.9%
4 27
2.7%
5 52
5.2%
6 25
2.5%
7 28
2.8%
8 19
 
1.9%
9 14
 
1.4%
10 49
4.9%
ValueCountFrequency (%)
7000 1
 
0.1%
4000 3
 
0.3%
3600 2
 
0.2%
3400 1
 
0.1%
3300 2
 
0.2%
3000 6
0.6%
2865 1
 
0.1%
2800 1
 
0.1%
2600 1
 
0.1%
2500 9
0.9%

사육규모
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct7
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
100마리 미만
787 
100마리 ~ 500마리 미만
108 
500마리 ~ 1,000마리 미만
 
36
1,000마리 ~ 2,000마리 미만
 
31
2,000마리 ~ 3,000마리 미만
 
27
Other values (2)
 
15

Length

Max length20
Median length8
Mean length10.051793
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row100마리 미만
2nd row100마리 미만
3rd row100마리 미만
4th row100마리 미만
5th row100마리 미만

Common Values

ValueCountFrequency (%)
100마리 미만 787
78.4%
100마리 ~ 500마리 미만 108
 
10.8%
500마리 ~ 1,000마리 미만 36
 
3.6%
1,000마리 ~ 2,000마리 미만 31
 
3.1%
2,000마리 ~ 3,000마리 미만 27
 
2.7%
3,000마리 ~ 4,000마리 미만 11
 
1.1%
4,000마리 이상 4
 
0.4%

Length

2024-01-10T06:14:53.583243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:14:53.691264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미만 1000
41.1%
100마리 895
36.8%
213
 
8.8%
500마리 144
 
5.9%
1,000마리 67
 
2.8%
2,000마리 58
 
2.4%
3,000마리 38
 
1.6%
4,000마리 15
 
0.6%
이상 4
 
0.2%

행정구역
Categorical

Distinct15
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
광석면
167 
연무읍
155 
양촌면
106 
연산면
88 
성동면
86 
Other values (10)
402 

Length

Max length4
Median length3
Mean length3.0766932
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row연산면
2nd row상월면
3rd row노성면
4th row노성면
5th row은진면

Common Values

ValueCountFrequency (%)
광석면 167
16.6%
연무읍 155
15.4%
양촌면 106
10.6%
연산면 88
8.8%
성동면 86
8.6%
가야곡면 77
7.7%
부적면 75
7.5%
노성면 74
7.4%
상월면 61
 
6.1%
은진면 38
 
3.8%
Other values (5) 77
7.7%

Length

2024-01-10T06:14:53.826814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
광석면 167
16.6%
연무읍 155
15.4%
양촌면 106
10.6%
연산면 88
8.8%
성동면 86
8.6%
가야곡면 77
7.7%
부적면 75
7.5%
노성면 74
7.4%
상월면 61
 
6.1%
은진면 38
 
3.8%
Other values (5) 77
7.7%
Distinct990
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size8.0 KiB
2024-01-10T06:14:54.108291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length35
Mean length25.224104
Min length13

Characters and Unicode

Total characters25325
Distinct characters150
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique976 ?
Unique (%)97.2%

Sample

1st row충청남도 논산시 연산면 임리 114번지 1
2nd row충청남도 논산시 상월면 대촌리 65번지 4
3rd row충청남도 논산시 노성면 암리 33번지 4
4th row충청남도 논산시 노성면 교촌리 81번지
5th row충청남도 논산시 은진면 방축리 679번지
ValueCountFrequency (%)
충청남도 1004
 
17.6%
논산시 1004
 
17.6%
1 178
 
3.1%
광석면 167
 
2.9%
연무읍 155
 
2.7%
2 114
 
2.0%
양촌면 106
 
1.9%
연산면 88
 
1.5%
성동면 86
 
1.5%
가야곡면 77
 
1.3%
Other values (768) 2731
47.8%
2024-01-10T06:14:54.711793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6559
25.9%
1205
 
4.8%
1041
 
4.1%
1029
 
4.1%
1021
 
4.0%
1012
 
4.0%
1011
 
4.0%
1011
 
4.0%
1004
 
4.0%
1001
 
4.0%
Other values (140) 9431
37.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15053
59.4%
Space Separator 6559
25.9%
Decimal Number 3704
 
14.6%
Other Punctuation 4
 
< 0.1%
Dash Punctuation 3
 
< 0.1%
Open Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1205
 
8.0%
1041
 
6.9%
1029
 
6.8%
1021
 
6.8%
1012
 
6.7%
1011
 
6.7%
1011
 
6.7%
1004
 
6.7%
1001
 
6.6%
991
 
6.6%
Other values (126) 4727
31.4%
Decimal Number
ValueCountFrequency (%)
1 744
20.1%
2 508
13.7%
3 487
13.1%
4 355
9.6%
6 330
8.9%
5 314
8.5%
7 267
 
7.2%
8 256
 
6.9%
0 252
 
6.8%
9 191
 
5.2%
Space Separator
ValueCountFrequency (%)
6559
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15053
59.4%
Common 10272
40.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1205
 
8.0%
1041
 
6.9%
1029
 
6.8%
1021
 
6.8%
1012
 
6.7%
1011
 
6.7%
1011
 
6.7%
1004
 
6.7%
1001
 
6.6%
991
 
6.6%
Other values (126) 4727
31.4%
Common
ValueCountFrequency (%)
6559
63.9%
1 744
 
7.2%
2 508
 
4.9%
3 487
 
4.7%
4 355
 
3.5%
6 330
 
3.2%
5 314
 
3.1%
7 267
 
2.6%
8 256
 
2.5%
0 252
 
2.5%
Other values (4) 200
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15053
59.4%
ASCII 10272
40.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6559
63.9%
1 744
 
7.2%
2 508
 
4.9%
3 487
 
4.7%
4 355
 
3.5%
6 330
 
3.2%
5 314
 
3.1%
7 267
 
2.6%
8 256
 
2.5%
0 252
 
2.5%
Other values (4) 200
 
1.9%
Hangul
ValueCountFrequency (%)
1205
 
8.0%
1041
 
6.9%
1029
 
6.8%
1021
 
6.8%
1012
 
6.7%
1011
 
6.7%
1011
 
6.7%
1004
 
6.7%
1001
 
6.6%
991
 
6.6%
Other values (126) 4727
31.4%

Interactions

2024-01-10T06:14:52.276800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T06:14:54.791832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
축종사육두수사육규모행정구역
축종1.0000.7160.7710.333
사육두수0.7161.0000.9610.159
사육규모0.7710.9611.0000.240
행정구역0.3330.1590.2401.000
2024-01-10T06:14:54.874874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정구역사육규모축종
행정구역1.0000.1100.157
사육규모0.1101.0000.368
축종0.1570.3681.000
2024-01-10T06:14:54.951582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육두수축종사육규모행정구역
사육두수1.0000.3220.6970.072
축종0.3221.0000.3680.157
사육규모0.6970.3681.0000.110
행정구역0.0720.1570.1101.000

Missing values

2024-01-10T06:14:52.386633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:14:52.477025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

농장명축종사육두수사육규모행정구역소재지
0임리목장사슴1100마리 미만연산면충청남도 논산시 연산면 임리 114번지 1
1태영농장한우1100마리 미만상월면충청남도 논산시 상월면 대촌리 65번지 4
2농장한우1100마리 미만노성면충청남도 논산시 노성면 암리 33번지 4
3농장한우1100마리 미만노성면충청남도 논산시 노성면 교촌리 81번지
4농장한우1100마리 미만은진면충청남도 논산시 은진면 방축리 679번지
5농장한우1100마리 미만상월면충청남도 논산시 상월면 대명리 249번지 4
6농장한우1100마리 미만상월면충청남도 논산시 상월면 월오리 320번지 12
7농장한우1100마리 미만부적면충청남도 논산시 부적면 아리 530번지 4
8농장한우1100마리 미만채운면충청남도 논산시 채운면 화정리 298번지 1
9농장한우1100마리 미만노성면충청남도 논산시 노성면 교촌리 279번지 1
농장명축종사육두수사육규모행정구역소재지
994문성농원돼지30003,000마리 ~ 4,000마리 미만연무읍충청남도 논산시 연무읍 고내리 562번지 3
995현대농장돼지33003,000마리 ~ 4,000마리 미만연무읍충청남도 논산시 연무읍 양지리 21번지 4
996원농장돼지33003,000마리 ~ 4,000마리 미만연무읍충청남도 논산시 연무읍 마전리 1062번지 9 ,1062-17
997덕진 2 농장돼지34003,000마리 ~ 4,000마리 미만은진면충청남도 논산시 은진면 시묘리 57번지 1
998농업회사법인(주)팜스코바이오인티돼지36003,000마리 ~ 4,000마리 미만연무읍충청남도 논산시 연무읍 봉동리 1207번지 11
999석원농장돼지36003,000마리 ~ 4,000마리 미만은진면충청남도 논산시 은진면 토양리 515번지 2
1000대광농장돼지40004,000마리 이상광석면충청남도 논산시 광석면 중리 522번지 84 ,32
1001(주)에스더블유디에프돼지40004,000마리 이상가야곡면충청남도 논산시 가야곡면 등리 709번지 7
1002연합농장돼지40004,000마리 이상양촌면충청남도 논산시 양촌면 모촌리 142번지
1003사포농장돼지70004,000마리 이상연산면충청남도 논산시 연산면 사포리 194번지 3 , 194번지 11