Overview

Dataset statistics

Number of variables6
Number of observations1118
Missing cells197
Missing cells (%)2.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory53.6 KiB
Average record size in memory49.1 B

Variable types

Text3
Categorical2
Numeric1

Dataset

Description충청남도 논산시 축산농가 현황 데이터로 농장명, 축종, 사육두수, 사육규모, 행정구역, 소재지 정보를 제공하고 있습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=389&beforeMenuCd=DOM_000000201001001000&publicdatapk=15034227

Alerts

축종 is highly imbalanced (50.8%)Imbalance
소재지도로명주소 has 197 (17.6%) missing valuesMissing
사육두수 is highly skewed (γ1 = 21.850528)Skewed

Reproduction

Analysis started2024-01-09 21:14:46.564917
Analysis finished2024-01-09 21:14:47.190788
Duration0.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct523
Distinct (%)46.8%
Missing0
Missing (%)0.0%
Memory size8.9 KiB
2024-01-10T06:14:47.359244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length3.3506261
Min length2

Characters and Unicode

Total characters3746
Distinct characters304
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique486 ?
Unique (%)43.5%

Sample

1st row(주)도드람양돈서비스
2nd row(주)에스더블유디에프
3rd row(주)지산농원
4th row(주)친환경식품축산농업회사법인 직영점
5th row(주)해피팜스
ValueCountFrequency (%)
농장 563
48.2%
축사 14
 
1.2%
청운농장 4
 
0.3%
농업회사법인 4
 
0.3%
하나농장 4
 
0.3%
대광농장 3
 
0.3%
염소농장 3
 
0.3%
주식회사 3
 
0.3%
광석한우단지 3
 
0.3%
대인농장 3
 
0.3%
Other values (528) 564
48.3%
2024-01-10T06:14:47.691633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1047
27.9%
1004
26.8%
53
 
1.4%
50
 
1.3%
43
 
1.1%
43
 
1.1%
39
 
1.0%
39
 
1.0%
33
 
0.9%
32
 
0.9%
Other values (294) 1363
36.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3637
97.1%
Space Separator 50
 
1.3%
Decimal Number 24
 
0.6%
Uppercase Letter 12
 
0.3%
Open Punctuation 10
 
0.3%
Close Punctuation 10
 
0.3%
Other Punctuation 2
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1047
28.8%
1004
27.6%
53
 
1.5%
43
 
1.2%
43
 
1.2%
39
 
1.1%
39
 
1.1%
33
 
0.9%
32
 
0.9%
32
 
0.9%
Other values (274) 1272
35.0%
Uppercase Letter
ValueCountFrequency (%)
F 3
25.0%
A 2
16.7%
L 2
16.7%
E 1
 
8.3%
G 1
 
8.3%
R 1
 
8.3%
M 1
 
8.3%
B 1
 
8.3%
Decimal Number
ValueCountFrequency (%)
2 16
66.7%
1 3
 
12.5%
6 1
 
4.2%
7 1
 
4.2%
5 1
 
4.2%
4 1
 
4.2%
3 1
 
4.2%
Space Separator
ValueCountFrequency (%)
50
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3637
97.1%
Common 97
 
2.6%
Latin 12
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1047
28.8%
1004
27.6%
53
 
1.5%
43
 
1.2%
43
 
1.2%
39
 
1.1%
39
 
1.1%
33
 
0.9%
32
 
0.9%
32
 
0.9%
Other values (274) 1272
35.0%
Common
ValueCountFrequency (%)
50
51.5%
2 16
 
16.5%
( 10
 
10.3%
) 10
 
10.3%
1 3
 
3.1%
. 2
 
2.1%
6 1
 
1.0%
7 1
 
1.0%
5 1
 
1.0%
4 1
 
1.0%
Other values (2) 2
 
2.1%
Latin
ValueCountFrequency (%)
F 3
25.0%
A 2
16.7%
L 2
16.7%
E 1
 
8.3%
G 1
 
8.3%
R 1
 
8.3%
M 1
 
8.3%
B 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3637
97.1%
ASCII 109
 
2.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1047
28.8%
1004
27.6%
53
 
1.5%
43
 
1.2%
43
 
1.2%
39
 
1.1%
39
 
1.1%
33
 
0.9%
32
 
0.9%
32
 
0.9%
Other values (274) 1272
35.0%
ASCII
ValueCountFrequency (%)
50
45.9%
2 16
 
14.7%
( 10
 
9.2%
) 10
 
9.2%
F 3
 
2.8%
1 3
 
2.8%
. 2
 
1.8%
A 2
 
1.8%
L 2
 
1.8%
E 1
 
0.9%
Other values (10) 10
 
9.2%

축종
Categorical

IMBALANCE 

Distinct12
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size8.9 KiB
한우
743 
돼지
151 
육계
81 
젖소
 
41
염소
 
34
Other values (7)
 
68

Length

Max length3
Median length2
Mean length2.0125224
Min length2

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row돼지
2nd row돼지
3rd row산란계
4th row한우
5th row돼지

Common Values

ValueCountFrequency (%)
한우 743
66.5%
돼지 151
 
13.5%
육계 81
 
7.2%
젖소 41
 
3.7%
염소 34
 
3.0%
산양 31
 
2.8%
산란계 12
 
1.1%
사슴 10
 
0.9%
오리 7
 
0.6%
육우 6
 
0.5%
Other values (2) 2
 
0.2%

Length

2024-01-10T06:14:47.818618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 743
66.5%
돼지 151
 
13.5%
육계 81
 
7.2%
젖소 41
 
3.7%
염소 34
 
3.0%
산양 31
 
2.8%
산란계 12
 
1.1%
사슴 10
 
0.9%
오리 7
 
0.6%
육우 6
 
0.5%
Other values (2) 2
 
0.2%

사육두수
Real number (ℝ)

SKEWED 

Distinct192
Distinct (%)17.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4513.8462
Minimum0
Maximum900000
Zeros9
Zeros (%)0.8%
Negative0
Negative (%)0.0%
Memory size10.0 KiB
2024-01-10T06:14:47.937433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q110
median27
Q3124.5
95-th percentile30000
Maximum900000
Range900000
Interquartile range (IQR)114.5

Descriptive statistics

Standard deviation31413.181
Coefficient of variation (CV)6.9592936
Kurtosis597.25382
Mean4513.8462
Median Absolute Deviation (MAD)22
Skewness21.850528
Sum5046480
Variance9.8678792 × 108
MonotonicityNot monotonic
2024-01-10T06:14:48.057197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5 53
 
4.7%
10 44
 
3.9%
20 41
 
3.7%
50 34
 
3.0%
7 31
 
2.8%
2 30
 
2.7%
30 30
 
2.7%
3 28
 
2.5%
4 28
 
2.5%
25 26
 
2.3%
Other values (182) 773
69.1%
ValueCountFrequency (%)
0 9
 
0.8%
1 15
 
1.3%
2 30
2.7%
3 28
2.5%
4 28
2.5%
5 53
4.7%
6 24
2.1%
7 31
2.8%
8 20
 
1.8%
9 13
 
1.2%
ValueCountFrequency (%)
900000 1
 
0.1%
240000 1
 
0.1%
175000 1
 
0.1%
170000 1
 
0.1%
160000 2
0.2%
110000 2
0.2%
94000 1
 
0.1%
80000 1
 
0.1%
70000 3
0.3%
65000 1
 
0.1%

행정동
Categorical

Distinct15
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size8.9 KiB
연무읍
174 
광석면
170 
양촌면
103 
성동면
103 
연산면
99 
Other values (10)
469 

Length

Max length4
Median length3
Mean length3.078712
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row양촌면
2nd row가야곡면
3rd row연산면
4th row가야곡면
5th row연산면

Common Values

ValueCountFrequency (%)
연무읍 174
15.6%
광석면 170
15.2%
양촌면 103
9.2%
성동면 103
9.2%
연산면 99
8.9%
가야곡면 88
7.9%
부적면 87
7.8%
노성면 81
7.2%
상월면 69
 
6.2%
은진면 49
 
4.4%
Other values (5) 95
8.5%

Length

2024-01-10T06:14:48.164777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
연무읍 174
15.6%
광석면 170
15.2%
양촌면 103
9.2%
성동면 103
9.2%
연산면 99
8.9%
가야곡면 88
7.9%
부적면 87
7.8%
노성면 81
7.2%
상월면 69
 
6.2%
은진면 49
 
4.4%
Other values (5) 95
8.5%
Distinct892
Distinct (%)96.9%
Missing197
Missing (%)17.6%
Memory size8.9 KiB
2024-01-10T06:14:48.411564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length27
Mean length23.212812
Min length15

Characters and Unicode

Total characters21379
Distinct characters159
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique864 ?
Unique (%)93.8%

Sample

1st row충청남도 논산시 양촌면 매죽헌로1369번길 3-96
2nd row충청남도 논산시 가야곡면 가야로 192-10
3rd row충청남도 논산시 연산면 화악2길 38-5
4th row충청남도 논산시 가야곡면 원앙로842번길 41-3
5th row충청남도 논산시 연산면 화악길 263-6
ValueCountFrequency (%)
충청남도 921
20.1%
논산시 921
20.1%
연무읍 144
 
3.1%
광석면 142
 
3.1%
양촌면 89
 
1.9%
성동면 86
 
1.9%
연산면 76
 
1.7%
부적면 74
 
1.6%
노성면 68
 
1.5%
가야곡면 66
 
1.4%
Other values (1101) 2006
43.7%
2024-01-10T06:14:48.805861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3672
 
17.2%
1076
 
5.0%
941
 
4.4%
932
 
4.4%
927
 
4.3%
923
 
4.3%
923
 
4.3%
921
 
4.3%
1 862
 
4.0%
758
 
3.5%
Other values (149) 9444
44.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12864
60.2%
Decimal Number 4253
 
19.9%
Space Separator 3672
 
17.2%
Dash Punctuation 590
 
2.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1076
 
8.4%
941
 
7.3%
932
 
7.2%
927
 
7.2%
923
 
7.2%
923
 
7.2%
921
 
7.2%
758
 
5.9%
631
 
4.9%
572
 
4.4%
Other values (137) 4260
33.1%
Decimal Number
ValueCountFrequency (%)
1 862
20.3%
2 588
13.8%
3 500
11.8%
4 429
10.1%
5 399
9.4%
7 334
 
7.9%
8 325
 
7.6%
6 310
 
7.3%
9 275
 
6.5%
0 231
 
5.4%
Space Separator
ValueCountFrequency (%)
3672
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 590
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12864
60.2%
Common 8515
39.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1076
 
8.4%
941
 
7.3%
932
 
7.2%
927
 
7.2%
923
 
7.2%
923
 
7.2%
921
 
7.2%
758
 
5.9%
631
 
4.9%
572
 
4.4%
Other values (137) 4260
33.1%
Common
ValueCountFrequency (%)
3672
43.1%
1 862
 
10.1%
- 590
 
6.9%
2 588
 
6.9%
3 500
 
5.9%
4 429
 
5.0%
5 399
 
4.7%
7 334
 
3.9%
8 325
 
3.8%
6 310
 
3.6%
Other values (2) 506
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12864
60.2%
ASCII 8515
39.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3672
43.1%
1 862
 
10.1%
- 590
 
6.9%
2 588
 
6.9%
3 500
 
5.9%
4 429
 
5.0%
5 399
 
4.7%
7 334
 
3.9%
8 325
 
3.8%
6 310
 
3.6%
Other values (2) 506
 
5.9%
Hangul
ValueCountFrequency (%)
1076
 
8.4%
941
 
7.3%
932
 
7.2%
927
 
7.2%
923
 
7.2%
923
 
7.2%
921
 
7.2%
758
 
5.9%
631
 
4.9%
572
 
4.4%
Other values (137) 4260
33.1%
Distinct1097
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size8.9 KiB
2024-01-10T06:14:49.126438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length24
Mean length21.357782
Min length15

Characters and Unicode

Total characters23878
Distinct characters139
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1077 ?
Unique (%)96.3%

Sample

1st row충청남도 논산시 양촌면 석서리 397-1
2nd row충청남도 논산시 가야곡면 등리 709-7
3rd row충청남도 논산시 연산면 화악리 307
4th row충청남도 논산시 가야곡면 두월리 44
5th row충청남도 논산시 연산면 화악리 587-2
ValueCountFrequency (%)
충청남도 1118
20.0%
논산시 1118
20.0%
연무읍 174
 
3.1%
광석면 170
 
3.0%
성동면 103
 
1.8%
양촌면 103
 
1.8%
연산면 99
 
1.8%
가야곡면 88
 
1.6%
부적면 87
 
1.6%
노성면 81
 
1.4%
Other values (1141) 2463
44.0%
2024-01-10T06:14:49.886881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4492
18.8%
1347
 
5.6%
1148
 
4.8%
1140
 
4.8%
1127
 
4.7%
1126
 
4.7%
1125
 
4.7%
1118
 
4.7%
1105
 
4.6%
924
 
3.9%
Other values (129) 9226
38.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14560
61.0%
Space Separator 4492
 
18.8%
Decimal Number 4078
 
17.1%
Dash Punctuation 748
 
3.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1347
 
9.3%
1148
 
7.9%
1140
 
7.8%
1127
 
7.7%
1126
 
7.7%
1125
 
7.7%
1118
 
7.7%
1105
 
7.6%
924
 
6.3%
282
 
1.9%
Other values (117) 4118
28.3%
Decimal Number
ValueCountFrequency (%)
1 800
19.6%
2 576
14.1%
3 541
13.3%
4 382
9.4%
5 351
8.6%
6 351
8.6%
7 304
 
7.5%
8 282
 
6.9%
0 270
 
6.6%
9 221
 
5.4%
Space Separator
ValueCountFrequency (%)
4492
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 748
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14560
61.0%
Common 9318
39.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1347
 
9.3%
1148
 
7.9%
1140
 
7.8%
1127
 
7.7%
1126
 
7.7%
1125
 
7.7%
1118
 
7.7%
1105
 
7.6%
924
 
6.3%
282
 
1.9%
Other values (117) 4118
28.3%
Common
ValueCountFrequency (%)
4492
48.2%
1 800
 
8.6%
- 748
 
8.0%
2 576
 
6.2%
3 541
 
5.8%
4 382
 
4.1%
5 351
 
3.8%
6 351
 
3.8%
7 304
 
3.3%
8 282
 
3.0%
Other values (2) 491
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14560
61.0%
ASCII 9318
39.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4492
48.2%
1 800
 
8.6%
- 748
 
8.0%
2 576
 
6.2%
3 541
 
5.8%
4 382
 
4.1%
5 351
 
3.8%
6 351
 
3.8%
7 304
 
3.3%
8 282
 
3.0%
Other values (2) 491
 
5.3%
Hangul
ValueCountFrequency (%)
1347
 
9.3%
1148
 
7.9%
1140
 
7.8%
1127
 
7.7%
1126
 
7.7%
1125
 
7.7%
1118
 
7.7%
1105
 
7.6%
924
 
6.3%
282
 
1.9%
Other values (117) 4118
28.3%

Interactions

2024-01-10T06:14:46.944506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T06:14:50.238051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
축종사육두수행정동
축종1.0000.6240.331
사육두수0.6241.0000.000
행정동0.3310.0001.000
2024-01-10T06:14:50.541501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정동축종
행정동1.0000.127
축종0.1271.000
2024-01-10T06:14:50.721789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육두수축종행정동
사육두수1.0000.3320.000
축종0.3321.0000.127
행정동0.0000.1271.000

Missing values

2024-01-10T06:14:47.051258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:14:47.147006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

농장명축종사육두수행정동소재지도로명주소소재지지번주소
0(주)도드람양돈서비스돼지70양촌면충청남도 논산시 양촌면 매죽헌로1369번길 3-96충청남도 논산시 양촌면 석서리 397-1
1(주)에스더블유디에프돼지4000가야곡면충청남도 논산시 가야곡면 가야로 192-10충청남도 논산시 가야곡면 등리 709-7
2(주)지산농원산란계5000연산면충청남도 논산시 연산면 화악2길 38-5충청남도 논산시 연산면 화악리 307
3(주)친환경식품축산농업회사법인 직영점한우2가야곡면충청남도 논산시 가야곡면 원앙로842번길 41-3충청남도 논산시 가야곡면 두월리 44
4(주)해피팜스돼지2000연산면충청남도 논산시 연산면 화악길 263-6충청남도 논산시 연산면 화악리 587-2
5E.G.FARM한우40성동면충청남도 논산시 성동면 금백로585번길 74-14충청남도 논산시 성동면 우곤리 762
6가곡농장산양10노성면<NA>충청남도 논산시 노성면 가곡리 375-6
7가나안농장산양50광석면<NA>충청남도 논산시 광석면 사월리 230-21
8가브리엘 농장돼지1000은진면충청남도 논산시 은진면 탑정로 185충청남도 논산시 은진면 남산리 84
9가야농장한우55가야곡면충청남도 논산시 가야곡면 매죽헌로 726-87충청남도 논산시 가야곡면 육곡리 528
농장명축종사육두수행정동소재지도로명주소소재지지번주소
1108황산농장한우10연산면충청남도 논산시 연산면 신양길 150충청남도 논산시 연산면 신양리 182-1
1109황산벌한우한우20광석면충청남도 논산시 광석면 사계로39번길 61충청남도 논산시 광석면 득윤리 639
1110황새울농장한우13연산면충청남도 논산시 연산면 한전2길 66충청남도 논산시 연산면 한전리 354
1111황영자농장한우25성동면충청남도 논산시 성동면 원봉길 25충청남도 논산시 성동면 정지리 377-2
1112황용목장젖소57부적면충청남도 논산시 부적면 감곡길 83-57충청남도 논산시 부적면 감곡리 2-1
1113황지산농장한우50부적면충청남도 논산시 부적면 예학로1길 106충청남도 논산시 부적면 신교리 13-2
1114황화농장돼지300연무읍<NA>충청남도 논산시 연무읍 황화정리 1039-1
1115황화목장젖소100연무읍충청남도 논산시 연무읍 봉황로 241충청남도 논산시 연무읍 황화정리 459-1
1116훈이목장젖소60연무읍<NA>충청남도 논산시 연무읍 고내리 1040-8
1117휴산농장한우100광석면충청남도 논산시 광석면 장마루로 109-14충청남도 논산시 광석면 이사리 391-6