Overview

Dataset statistics

Number of variables4
Number of observations751
Missing cells0
Missing cells (%)0.0%
Duplicate rows29
Duplicate rows (%)3.9%
Total size in memory24.3 KiB
Average record size in memory33.2 B

Variable types

Text2
Categorical1
Numeric1

Dataset

Description남해군의 등록된 축산농장현황입니다. 소재지 읍면별 사업장명칭, 등록축종, 사업장 도로명주소지, 농장규모를 포함한 정보입니다.
Author경상남도 남해군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15031717

Alerts

Dataset has 29 (3.9%) duplicate rowsDuplicates
축종 is highly imbalanced (85.0%)Imbalance

Reproduction

Analysis started2023-12-10 22:56:40.905049
Analysis finished2023-12-10 22:56:41.397622
Duration0.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct130
Distinct (%)17.3%
Missing0
Missing (%)0.0%
Memory size6.0 KiB
2023-12-11T07:56:41.613853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length4
Mean length4.1078562
Min length1

Characters and Unicode

Total characters3085
Distinct characters172
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique124 ?
Unique (%)16.5%

Sample

1st row호산농장
2nd row불매골농장
3rd row애영농장
4th row미리내농장
5th row장골농장
ValueCountFrequency (%)
개인농장 617
81.0%
망운농장 3
 
0.4%
영지축산 3
 
0.4%
갱문농장 2
 
0.3%
농장 2
 
0.3%
흑염소농장 2
 
0.3%
미리내농장 2
 
0.3%
옥석농장 2
 
0.3%
호산농장 2
 
0.3%
우리농장 1
 
0.1%
Other values (126) 126
 
16.5%
2023-12-11T07:56:42.017484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
728
23.6%
719
23.3%
619
20.1%
618
20.0%
21
 
0.7%
19
 
0.6%
11
 
0.4%
11
 
0.4%
11
 
0.4%
9
 
0.3%
Other values (162) 319
10.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3069
99.5%
Space Separator 11
 
0.4%
Decimal Number 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
728
23.7%
719
23.4%
619
20.2%
618
20.1%
21
 
0.7%
19
 
0.6%
11
 
0.4%
11
 
0.4%
9
 
0.3%
8
 
0.3%
Other values (159) 306
10.0%
Decimal Number
ValueCountFrequency (%)
2 3
60.0%
1 2
40.0%
Space Separator
ValueCountFrequency (%)
11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3069
99.5%
Common 16
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
728
23.7%
719
23.4%
619
20.2%
618
20.1%
21
 
0.7%
19
 
0.6%
11
 
0.4%
11
 
0.4%
9
 
0.3%
8
 
0.3%
Other values (159) 306
10.0%
Common
ValueCountFrequency (%)
11
68.8%
2 3
 
18.8%
1 2
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3069
99.5%
ASCII 16
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
728
23.7%
719
23.4%
619
20.2%
618
20.1%
21
 
0.7%
19
 
0.6%
11
 
0.4%
11
 
0.4%
9
 
0.3%
8
 
0.3%
Other values (159) 306
10.0%
ASCII
ValueCountFrequency (%)
11
68.8%
2 3
 
18.8%
1 2
 
12.5%

축종
Categorical

IMBALANCE 

Distinct8
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size6.0 KiB
한우
707 
산양
 
20
젖소
 
7
돼지
 
7
육계
 
6
Other values (3)
 
4

Length

Max length3
Median length2
Mean length2.0013316
Min length1

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row한우
2nd row한우
3rd row한우
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 707
94.1%
산양 20
 
2.7%
젖소 7
 
0.9%
돼지 7
 
0.9%
육계 6
 
0.8%
산란계 2
 
0.3%
면양 1
 
0.1%
1
 
0.1%

Length

2023-12-11T07:56:42.181674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:56:42.326009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한우 707
94.1%
산양 20
 
2.7%
젖소 7
 
0.9%
돼지 7
 
0.9%
육계 6
 
0.8%
산란계 2
 
0.3%
면양 1
 
0.1%
1
 
0.1%
Distinct73
Distinct (%)9.7%
Missing0
Missing (%)0.0%
Memory size6.0 KiB
2023-12-11T07:56:42.559562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length16
Mean length16
Min length16

Characters and Unicode

Total characters12016
Distinct characters92
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)1.2%

Sample

1st row경상남도 남해군 남면 상가리
2nd row경상남도 남해군 서면 정포리
3rd row경상남도 남해군 서면 서호리
4th row경상남도 남해군 이동면 다정리
5th row경상남도 남해군 삼동면 봉화리
ValueCountFrequency (%)
경상남도 751
25.0%
남해군 751
25.0%
서면 149
 
5.0%
남면 147
 
4.9%
설천면 103
 
3.4%
이동면 90
 
3.0%
고현면 80
 
2.7%
남해읍 61
 
2.0%
창선면 56
 
1.9%
삼동면 51
 
1.7%
Other values (74) 765
25.5%
2023-12-11T07:56:42.954411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2563
21.3%
1771
14.7%
872
 
7.3%
812
 
6.8%
763
 
6.3%
751
 
6.2%
751
 
6.2%
751
 
6.2%
694
 
5.8%
179
 
1.5%
Other values (82) 2109
17.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9453
78.7%
Space Separator 2563
 
21.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1771
18.7%
872
9.2%
812
8.6%
763
 
8.1%
751
 
7.9%
751
 
7.9%
751
 
7.9%
694
 
7.3%
179
 
1.9%
154
 
1.6%
Other values (81) 1955
20.7%
Space Separator
ValueCountFrequency (%)
2563
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9453
78.7%
Common 2563
 
21.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1771
18.7%
872
9.2%
812
8.6%
763
 
8.1%
751
 
7.9%
751
 
7.9%
751
 
7.9%
694
 
7.3%
179
 
1.9%
154
 
1.6%
Other values (81) 1955
20.7%
Common
ValueCountFrequency (%)
2563
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9453
78.7%
ASCII 2563
 
21.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2563
100.0%
Hangul
ValueCountFrequency (%)
1771
18.7%
872
9.2%
812
8.6%
763
 
8.1%
751
 
7.9%
751
 
7.9%
751
 
7.9%
694
 
7.3%
179
 
1.9%
154
 
1.6%
Other values (81) 1955
20.7%

농장규모(㎡)
Real number (ℝ)

Distinct353
Distinct (%)47.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean241.43449
Minimum10
Maximum7438
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.7 KiB
2023-12-11T07:56:43.094051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile11
Q122
median80
Q3288
95-th percentile900
Maximum7438
Range7428
Interquartile range (IQR)266

Descriptive statistics

Standard deviation503.06436
Coefficient of variation (CV)2.0836475
Kurtosis93.411696
Mean241.43449
Median Absolute Deviation (MAD)66
Skewness7.9139134
Sum181317.3
Variance253073.75
MonotonicityNot monotonic
2023-12-11T07:56:43.249304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10.0 34
 
4.5%
12.0 23
 
3.1%
64.0 18
 
2.4%
20.0 18
 
2.4%
32.0 14
 
1.9%
15.0 13
 
1.7%
16.0 13
 
1.7%
18.0 13
 
1.7%
192.0 11
 
1.5%
96.0 10
 
1.3%
Other values (343) 584
77.8%
ValueCountFrequency (%)
10.0 34
4.5%
10.2 1
 
0.1%
10.8 1
 
0.1%
11.0 7
 
0.9%
11.5 1
 
0.1%
12.0 23
3.1%
13.0 8
 
1.1%
13.2 2
 
0.3%
13.22 1
 
0.1%
13.3 1
 
0.1%
ValueCountFrequency (%)
7438.0 1
0.1%
6621.78 1
0.1%
2927.33 1
0.1%
2767.5 1
0.1%
2649.6 1
0.1%
2641.5 1
0.1%
2572.84 1
0.1%
2470.5 1
0.1%
1839.5 1
0.1%
1835.0 1
0.1%

Interactions

2023-12-11T07:56:41.114277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:56:43.340578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
축종소재지농장규모(㎡)
축종1.0000.0000.253
소재지0.0001.0000.000
농장규모(㎡)0.2530.0001.000
2023-12-11T07:56:43.437428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
농장규모(㎡)축종
농장규모(㎡)1.0000.143
축종0.1431.000

Missing values

2023-12-11T07:56:41.275820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:56:41.358185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

농장명축종소재지농장규모(㎡)
0호산농장한우경상남도 남해군 남면 상가리1160.0
1불매골농장한우경상남도 남해군 서면 정포리2927.33
2애영농장한우경상남도 남해군 서면 서호리941.94
3미리내농장한우경상남도 남해군 이동면 다정리400.0
4장골농장한우경상남도 남해군 삼동면 봉화리736.0
5복이네농장한우경상남도 남해군 삼동면 영지리436.0
6누렁이농장한우경상남도 남해군 남해읍 평리1120.0
7개인농장한우경상남도 남해군 서면 남상리576.0
8분대골농장한우경상남도 남해군 이동면 다정리436.0
9개인농장한우경상남도 남해군 고현면 대곡리352.0
농장명축종소재지농장규모(㎡)
741바람이농장한우경상남도 남해군 남면 덕월리320.0
742개인농장한우경상남도 남해군 남해읍 아산리82.82
743개인농장한우경상남도 남해군 상주면 양아리169.0
744명안농장한우경상남도 남해군 고현면 차면리128.0
745개인농장한우경상남도 남해군 삼동면 영지리972.0
746개인농장한우경상남도 남해군 삼동면 영지리256.0
747개인농장한우경상남도 남해군 서면 중현리80.0
748태하농장한우경상남도 남해군 창선면 광천리7438.0
749개인농장한우경상남도 남해군 창선면 상신리40.0
750한우경상남도 남해군 창선면 옥천리32.0

Duplicate rows

Most frequently occurring

농장명축종소재지농장규모(㎡)# duplicates
3개인농장한우경상남도 남해군 남면 덕월리10.04
14개인농장한우경상남도 남해군 서면 남상리64.03
24개인농장한우경상남도 남해군 설천면 덕신리10.03
27개인농장한우경상남도 남해군 이동면 다정리10.03
0개인농장한우경상남도 남해군 남면 당항리16.02
1개인농장한우경상남도 남해군 남면 당항리64.02
2개인농장한우경상남도 남해군 남면 당항리192.02
4개인농장한우경상남도 남해군 남면 덕월리18.02
5개인농장한우경상남도 남해군 남면 덕월리64.02
6개인농장한우경상남도 남해군 남면 상가리12.02