Overview

Dataset statistics

Number of variables5
Number of observations888
Missing cells6
Missing cells (%)0.1%
Duplicate rows2
Duplicate rows (%)0.2%
Total size in memory35.7 KiB
Average record size in memory41.1 B

Variable types

Text3
Categorical1
Numeric1

Dataset

Description경상남도 창녕군의 축산정보_농장정보 데이터를 제공하고 있습니다.(사업장 명칭, 주사육업종, 사업장주소, 사육두수, 사육면적)
URLhttps://www.data.go.kr/data/15031861/fileData.do

Alerts

Dataset has 2 (0.2%) duplicate rowsDuplicates
주사육업종 is highly imbalanced (71.8%)Imbalance

Reproduction

Analysis started2023-12-12 14:44:28.316401
Analysis finished2023-12-12 14:44:28.957795
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct821
Distinct (%)92.5%
Missing0
Missing (%)0.0%
Memory size7.1 KiB
2023-12-12T23:44:29.268716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length4
Mean length4.4234234
Min length2

Characters and Unicode

Total characters3928
Distinct characters310
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique765 ?
Unique (%)86.1%

Sample

1st row건회목장
2nd row송곡목장
3rd row성화농장
4th row자우목장
5th row송원축산
ValueCountFrequency (%)
농장 6
 
0.7%
대성농장 4
 
0.4%
창녕축산 4
 
0.4%
양정축산 3
 
0.3%
원동양계장 3
 
0.3%
벧엘농장 3
 
0.3%
행복농장 3
 
0.3%
금농영농조합법인 3
 
0.3%
수복농장 3
 
0.3%
대성축산 3
 
0.3%
Other values (824) 881
96.2%
2023-12-12T23:44:29.910933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
500
 
12.7%
441
 
11.2%
369
 
9.4%
352
 
9.0%
68
 
1.7%
65
 
1.7%
63
 
1.6%
62
 
1.6%
2 49
 
1.2%
46
 
1.2%
Other values (300) 1913
48.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3800
96.7%
Decimal Number 80
 
2.0%
Space Separator 28
 
0.7%
Uppercase Letter 6
 
0.2%
Open Punctuation 5
 
0.1%
Close Punctuation 5
 
0.1%
Other Punctuation 1
 
< 0.1%
Lowercase Letter 1
 
< 0.1%
Other Symbol 1
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
500
 
13.2%
441
 
11.6%
369
 
9.7%
352
 
9.3%
68
 
1.8%
65
 
1.7%
63
 
1.7%
62
 
1.6%
46
 
1.2%
45
 
1.2%
Other values (286) 1789
47.1%
Decimal Number
ValueCountFrequency (%)
2 49
61.3%
1 25
31.2%
3 5
 
6.2%
6 1
 
1.2%
Uppercase Letter
ValueCountFrequency (%)
O 2
33.3%
K 2
33.3%
J 2
33.3%
Space Separator
ValueCountFrequency (%)
28
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3801
96.8%
Common 119
 
3.0%
Latin 8
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
500
 
13.2%
441
 
11.6%
369
 
9.7%
352
 
9.3%
68
 
1.8%
65
 
1.7%
63
 
1.7%
62
 
1.6%
46
 
1.2%
45
 
1.2%
Other values (287) 1790
47.1%
Common
ValueCountFrequency (%)
2 49
41.2%
28
23.5%
1 25
21.0%
( 5
 
4.2%
3 5
 
4.2%
) 5
 
4.2%
6 1
 
0.8%
& 1
 
0.8%
Latin
ValueCountFrequency (%)
O 2
25.0%
K 2
25.0%
J 2
25.0%
e 1
12.5%
1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3800
96.7%
ASCII 126
 
3.2%
None 1
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
500
 
13.2%
441
 
11.6%
369
 
9.7%
352
 
9.3%
68
 
1.8%
65
 
1.7%
63
 
1.7%
62
 
1.6%
46
 
1.2%
45
 
1.2%
Other values (286) 1789
47.1%
ASCII
ValueCountFrequency (%)
2 49
38.9%
28
22.2%
1 25
19.8%
( 5
 
4.0%
3 5
 
4.0%
) 5
 
4.0%
O 2
 
1.6%
K 2
 
1.6%
J 2
 
1.6%
6 1
 
0.8%
Other values (2) 2
 
1.6%
None
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

주사육업종
Categorical

IMBALANCE 

Distinct12
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size7.1 KiB
한우
756 
육계
 
42
돼지
 
29
종계/산란계
 
18
젖소
 
17
Other values (7)
 
26

Length

Max length6
Median length2
Mean length2.088964
Min length2

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row한우
2nd row한우
3rd row한우
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 756
85.1%
육계 42
 
4.7%
돼지 29
 
3.3%
종계/산란계 18
 
2.0%
젖소 17
 
1.9%
염소 8
 
0.9%
육우 5
 
0.6%
오리 5
 
0.6%
산양 4
 
0.5%
산란육성계 2
 
0.2%
Other values (2) 2
 
0.2%

Length

2023-12-12T23:44:30.074532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 756
85.1%
육계 42
 
4.7%
돼지 29
 
3.3%
종계/산란계 18
 
2.0%
젖소 17
 
1.9%
염소 8
 
0.9%
육우 5
 
0.6%
오리 5
 
0.6%
산양 4
 
0.5%
산란육성계 2
 
0.2%
Other values (2) 2
 
0.2%
Distinct850
Distinct (%)95.7%
Missing0
Missing (%)0.0%
Memory size7.1 KiB
2023-12-12T23:44:30.431180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length57
Mean length25.897523
Min length18

Characters and Unicode

Total characters22997
Distinct characters147
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique812 ?
Unique (%)91.4%

Sample

1st row경상남도 창녕군 길곡면 마천리 106번지 11호
2nd row경상남도 창녕군 이방면 장천리 482번지 3호
3rd row경상남도 창녕군 도천면 송진리 439번지
4th row경상남도 창녕군 도천면 일리 330번지 1호
5th row경상남도 창녕군 도천면 송진리 71번지
ValueCountFrequency (%)
경상남도 888
 
17.7%
창녕군 888
 
17.7%
1호 189
 
3.8%
대합면 141
 
2.8%
계성면 96
 
1.9%
이방면 89
 
1.8%
2호 84
 
1.7%
창녕읍 81
 
1.6%
유어면 73
 
1.5%
장마면 70
 
1.4%
Other values (809) 2405
48.1%
2023-12-12T23:44:30.967465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5830
25.4%
1039
 
4.5%
991
 
4.3%
990
 
4.3%
980
 
4.3%
936
 
4.1%
896
 
3.9%
891
 
3.9%
888
 
3.9%
888
 
3.9%
Other values (137) 8668
37.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13758
59.8%
Space Separator 5830
25.4%
Decimal Number 3384
 
14.7%
Open Punctuation 12
 
0.1%
Close Punctuation 12
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1039
 
7.6%
991
 
7.2%
990
 
7.2%
980
 
7.1%
936
 
6.8%
896
 
6.5%
891
 
6.5%
888
 
6.5%
888
 
6.5%
850
 
6.2%
Other values (123) 4409
32.0%
Decimal Number
ValueCountFrequency (%)
1 701
20.7%
2 433
12.8%
3 356
10.5%
4 337
10.0%
5 291
8.6%
7 281
8.3%
6 267
 
7.9%
8 258
 
7.6%
9 235
 
6.9%
0 225
 
6.6%
Space Separator
ValueCountFrequency (%)
5830
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Other Punctuation
ValueCountFrequency (%)
: 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13758
59.8%
Common 9239
40.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1039
 
7.6%
991
 
7.2%
990
 
7.2%
980
 
7.1%
936
 
6.8%
896
 
6.5%
891
 
6.5%
888
 
6.5%
888
 
6.5%
850
 
6.2%
Other values (123) 4409
32.0%
Common
ValueCountFrequency (%)
5830
63.1%
1 701
 
7.6%
2 433
 
4.7%
3 356
 
3.9%
4 337
 
3.6%
5 291
 
3.1%
7 281
 
3.0%
6 267
 
2.9%
8 258
 
2.8%
9 235
 
2.5%
Other values (4) 250
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13758
59.8%
ASCII 9239
40.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5830
63.1%
1 701
 
7.6%
2 433
 
4.7%
3 356
 
3.9%
4 337
 
3.6%
5 291
 
3.1%
7 281
 
3.0%
6 267
 
2.9%
8 258
 
2.8%
9 235
 
2.5%
Other values (4) 250
 
2.7%
Hangul
ValueCountFrequency (%)
1039
 
7.6%
991
 
7.2%
990
 
7.2%
980
 
7.1%
936
 
6.8%
896
 
6.5%
891
 
6.5%
888
 
6.5%
888
 
6.5%
850
 
6.2%
Other values (123) 4409
32.0%
Distinct205
Distinct (%)23.2%
Missing6
Missing (%)0.7%
Memory size7.1 KiB
2023-12-12T23:44:31.356686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length2
Mean length2.0873016
Min length1

Characters and Unicode

Total characters1841
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique102 ?
Unique (%)11.6%

Sample

1st row54
2nd row5
3rd row56
4th row38
5th row288
ValueCountFrequency (%)
2 26
 
3.7%
5 24
 
3.4%
4 21
 
3.0%
6 20
 
2.9%
3 18
 
2.6%
9 15
 
2.1%
7 13
 
1.9%
12 13
 
1.9%
16 13
 
1.9%
8 13
 
1.9%
Other values (194) 523
74.8%
2023-12-12T23:44:31.806136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
366
19.9%
1 266
14.4%
2 205
11.1%
0 170
9.2%
3 158
8.6%
4 135
 
7.3%
5 134
 
7.3%
6 120
 
6.5%
7 97
 
5.3%
8 97
 
5.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1475
80.1%
Space Separator 366
 
19.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 266
18.0%
2 205
13.9%
0 170
11.5%
3 158
10.7%
4 135
9.2%
5 134
9.1%
6 120
8.1%
7 97
 
6.6%
8 97
 
6.6%
9 93
 
6.3%
Space Separator
ValueCountFrequency (%)
366
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1841
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
366
19.9%
1 266
14.4%
2 205
11.1%
0 170
9.2%
3 158
8.6%
4 135
 
7.3%
5 134
 
7.3%
6 120
 
6.5%
7 97
 
5.3%
8 97
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1841
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
366
19.9%
1 266
14.4%
2 205
11.1%
0 170
9.2%
3 158
8.6%
4 135
 
7.3%
5 134
 
7.3%
6 120
 
6.5%
7 97
 
5.3%
8 97
 
5.3%
Distinct596
Distinct (%)67.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean928.71171
Minimum10
Maximum23655
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2023-12-12T23:44:31.970781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile50
Q1181.5
median456.5
Q31040
95-th percentile3300.65
Maximum23655
Range23645
Interquartile range (IQR)858.5

Descriptive statistics

Standard deviation1631.1373
Coefficient of variation (CV)1.7563441
Kurtosis67.987239
Mean928.71171
Median Absolute Deviation (MAD)335.5
Skewness6.7730609
Sum824696
Variance2660609
MonotonicityNot monotonic
2023-12-12T23:44:32.113860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99 10
 
1.1%
846 8
 
0.9%
200 8
 
0.9%
165 8
 
0.9%
288 8
 
0.9%
396 7
 
0.8%
400 7
 
0.8%
150 6
 
0.7%
198 6
 
0.7%
90 6
 
0.7%
Other values (586) 814
91.7%
ValueCountFrequency (%)
10 2
0.2%
15 3
0.3%
17 3
0.3%
20 1
 
0.1%
22 1
 
0.1%
24 1
 
0.1%
25 2
0.2%
26 1
 
0.1%
27 1
 
0.1%
30 2
0.2%
ValueCountFrequency (%)
23655 1
0.1%
15823 1
0.1%
15060 1
0.1%
14952 1
0.1%
14059 1
0.1%
9420 1
0.1%
8510 1
0.1%
8114 1
0.1%
6897 1
0.1%
6346 1
0.1%

Interactions

2023-12-12T23:44:28.682343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:44:32.185975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주사육업종사육면적(제곱미터)
주사육업종1.0000.673
사육면적(제곱미터)0.6731.000
2023-12-12T23:44:32.259273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육면적(제곱미터)주사육업종
사육면적(제곱미터)1.0000.408
주사육업종0.4081.000

Missing values

2023-12-12T23:44:28.806678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:44:28.910513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명칭주사육업종사업장소재지(지번)사육두수(두수)사육면적(제곱미터)
0건회목장한우경상남도 창녕군 길곡면 마천리 106번지 11호54700
1송곡목장한우경상남도 창녕군 이방면 장천리 482번지 3호<NA>793
2성화농장한우경상남도 창녕군 도천면 송진리 439번지5410
3자우목장한우경상남도 창녕군 도천면 일리 330번지 1호56700
4송원축산한우경상남도 창녕군 도천면 송진리 71번지38678
5훈이농장한우경상남도 창녕군 장마면 신구리 392번지 외 3필지 및 38212882453
6만복농장한우경상남도 창녕군 대지면 용소리 787번지<NA>296
7이화목장한우경상남도 창녕군 도천면 예리 1019번지 2호5594
8대몽농장한우경상남도 창녕군 대합면 이방리 474번지13490
9우산목장한우경상남도 창녕군 이방면 장천리 717번지 5호6734938
사업장명칭주사육업종사업장소재지(지번)사육두수(두수)사육면적(제곱미터)
878금농영농조합법인돼지경상남도 창녕군 성산면 대산리 865번지14711435
879가복농장돼지경상남도 창녕군 성산면 가복리 28번지120
880준이양돈돼지경상남도 창녕군 장마면 산지리 677번지1260
881성일축산돼지경상남도 창녕군 유어면 광산리 108번지27321745
882범골농장돼지경상남도 창녕군 남지읍 월하리 531번지 1호396
883장개축산돼지경상남도 창녕군 계성면 봉산리 1467번지 1호29952208
884거부농장돼지경상남도 창녕군 창녕읍 퇴천리 650번지20771859
885농업회사법인 ㈜일오삼축산돼지경상남도 창녕군 이방면 안리 111번지3301
886해돌이농장돼지경상남도 창녕군 대합면 모전리 1221번지19802251
887도방육종돼지경상남도 창녕군 영산면 월령리 568번지 3호59084950

Duplicate rows

Most frequently occurring

사업장명칭주사육업종사업장소재지(지번)사육두수(두수)사육면적(제곱미터)# duplicates
0대가축산한우경상남도 창녕군 유어면 선소리 76번지1198462
1벧엘농장종계/산란계경상남도 창녕군 계성면 봉산리 674번지 1호22722