Overview

Dataset statistics

Number of variables6
Number of observations986
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory47.3 KiB
Average record size in memory49.1 B

Variable types

Categorical3
Text2
Numeric1

Dataset

Description전라북도 부안군 축산업현황을 축산업 종류, 사업장 명칭, 주사육업종, 사업장소재지, 사육두수, 데이터 기준일자 등의 항목으로 제공합니다.
Author전라북도 부안군
URLhttps://www.data.go.kr/data/15034276/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
사육두수 is highly overall correlated with 주사육업종High correlation
축산업종류 is highly overall correlated with 주사육업종High correlation
주사육업종 is highly overall correlated with 사육두수 and 1 other fieldsHigh correlation
축산업종류 is highly imbalanced (87.4%)Imbalance
주사육업종 is highly imbalanced (65.2%)Imbalance
사육두수 has 46 (4.7%) zerosZeros

Reproduction

Analysis started2023-12-11 23:51:41.973651
Analysis finished2023-12-11 23:51:42.551596
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

축산업종류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size7.8 KiB
가축사육업
969 
종축업
 
17

Length

Max length5
Median length5
Mean length4.9655172
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가축사육업
2nd row가축사육업
3rd row가축사육업
4th row가축사육업
5th row가축사육업

Common Values

ValueCountFrequency (%)
가축사육업 969
98.3%
종축업 17
 
1.7%

Length

2023-12-12T08:51:42.620908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:51:42.708641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가축사육업 969
98.3%
종축업 17
 
1.7%
Distinct856
Distinct (%)86.8%
Missing0
Missing (%)0.0%
Memory size7.8 KiB
2023-12-12T08:51:42.930883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length4
Mean length4.5922921
Min length2

Characters and Unicode

Total characters4528
Distinct characters354
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique772 ?
Unique (%)78.3%

Sample

1st row서율농장
2nd row감교농장
3rd row월제농장
4th row길호농장
5th row광열농장
ValueCountFrequency (%)
농장 23
 
2.2%
대성농장 8
 
0.8%
유정농장 6
 
0.6%
태양농장 6
 
0.6%
부자농장 6
 
0.6%
푸른농장 6
 
0.6%
우리농장 5
 
0.5%
한우농장 5
 
0.5%
가야축산 4
 
0.4%
주산농장 4
 
0.4%
Other values (858) 955
92.9%
2023-12-12T08:51:43.317291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
957
21.1%
921
20.3%
73
 
1.6%
63
 
1.4%
60
 
1.3%
60
 
1.3%
57
 
1.3%
42
 
0.9%
42
 
0.9%
42
 
0.9%
Other values (344) 2211
48.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4410
97.4%
Space Separator 42
 
0.9%
Decimal Number 34
 
0.8%
Lowercase Letter 16
 
0.4%
Uppercase Letter 12
 
0.3%
Close Punctuation 6
 
0.1%
Open Punctuation 6
 
0.1%
Dash Punctuation 1
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
957
21.7%
921
20.9%
73
 
1.7%
63
 
1.4%
60
 
1.4%
60
 
1.4%
57
 
1.3%
42
 
1.0%
42
 
1.0%
41
 
0.9%
Other values (316) 2094
47.5%
Lowercase Letter
ValueCountFrequency (%)
f 3
18.8%
o 2
12.5%
m 2
12.5%
a 2
12.5%
r 2
12.5%
s 1
 
6.2%
j 1
 
6.2%
d 1
 
6.2%
n 1
 
6.2%
i 1
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
S 3
25.0%
F 2
16.7%
K 2
16.7%
H 1
 
8.3%
G 1
 
8.3%
Y 1
 
8.3%
D 1
 
8.3%
O 1
 
8.3%
Decimal Number
ValueCountFrequency (%)
2 20
58.8%
1 10
29.4%
3 2
 
5.9%
6 1
 
2.9%
5 1
 
2.9%
Space Separator
ValueCountFrequency (%)
42
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Punctuation
ValueCountFrequency (%)
' 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4410
97.4%
Common 90
 
2.0%
Latin 28
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
957
21.7%
921
20.9%
73
 
1.7%
63
 
1.4%
60
 
1.4%
60
 
1.4%
57
 
1.3%
42
 
1.0%
42
 
1.0%
41
 
0.9%
Other values (316) 2094
47.5%
Latin
ValueCountFrequency (%)
f 3
 
10.7%
S 3
 
10.7%
F 2
 
7.1%
o 2
 
7.1%
K 2
 
7.1%
m 2
 
7.1%
a 2
 
7.1%
r 2
 
7.1%
H 1
 
3.6%
G 1
 
3.6%
Other values (8) 8
28.6%
Common
ValueCountFrequency (%)
42
46.7%
2 20
22.2%
1 10
 
11.1%
) 6
 
6.7%
( 6
 
6.7%
3 2
 
2.2%
- 1
 
1.1%
' 1
 
1.1%
6 1
 
1.1%
5 1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4410
97.4%
ASCII 118
 
2.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
957
21.7%
921
20.9%
73
 
1.7%
63
 
1.4%
60
 
1.4%
60
 
1.4%
57
 
1.3%
42
 
1.0%
42
 
1.0%
41
 
0.9%
Other values (316) 2094
47.5%
ASCII
ValueCountFrequency (%)
42
35.6%
2 20
16.9%
1 10
 
8.5%
) 6
 
5.1%
( 6
 
5.1%
f 3
 
2.5%
S 3
 
2.5%
F 2
 
1.7%
o 2
 
1.7%
3 2
 
1.7%
Other values (18) 22
18.6%

주사육업종
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct15
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size7.8 KiB
한우
766 
육계
84 
오리
 
57
돼지
 
18
염소
 
15
Other values (10)
 
46

Length

Max length6
Median length2
Mean length2.0070994
Min length1

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row한우
2nd row한우
3rd row한우
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 766
77.7%
육계 84
 
8.5%
오리 57
 
5.8%
돼지 18
 
1.8%
염소 15
 
1.5%
12
 
1.2%
젖소 11
 
1.1%
사슴 7
 
0.7%
육우 5
 
0.5%
종계/산란계 3
 
0.3%
Other values (5) 8
 
0.8%

Length

2023-12-12T08:51:43.439394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 766
77.7%
육계 84
 
8.5%
오리 57
 
5.8%
돼지 18
 
1.8%
염소 15
 
1.5%
12
 
1.2%
젖소 11
 
1.1%
사슴 7
 
0.7%
육우 5
 
0.5%
종계/산란계 3
 
0.3%
Other values (5) 8
 
0.8%
Distinct956
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size7.8 KiB
2023-12-12T08:51:43.984329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length102
Median length73
Mean length29.802231
Min length4

Characters and Unicode

Total characters29385
Distinct characters120
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique933 ?
Unique (%)94.6%

Sample

1st row전라북도 부안군 백산면 하청리 245번지 외 3필지(243-1 246-2 246-10)
2nd row전라북도 부안군 상서면 감교리 811번지 19호 811-20 811-52.주산면 사산리1178-7 1178-8 1178-30
3rd row전라북도 부안군 하서면 백련리 567번지 6호
4th row전라북도 부안군 하서면 백련리 470번지
5th row전라북도 부안군 보안면 유천리 439번지
ValueCountFrequency (%)
전라북도 977
 
15.7%
부안군 977
 
15.7%
보안면 190
 
3.1%
184
 
3.0%
1호 174
 
2.8%
주산면 126
 
2.0%
백산면 112
 
1.8%
부안읍 96
 
1.5%
줄포면 93
 
1.5%
3호 84
 
1.3%
Other values (1195) 3211
51.6%
2023-12-12T08:51:44.384858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7112
24.2%
1345
 
4.6%
1153
 
3.9%
1 1085
 
3.7%
1078
 
3.7%
1010
 
3.4%
1003
 
3.4%
983
 
3.3%
978
 
3.3%
977
 
3.3%
Other values (110) 12661
43.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16033
54.6%
Space Separator 7112
24.2%
Decimal Number 5619
 
19.1%
Dash Punctuation 309
 
1.1%
Close Punctuation 154
 
0.5%
Open Punctuation 154
 
0.5%
Other Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1345
 
8.4%
1153
 
7.2%
1078
 
6.7%
1010
 
6.3%
1003
 
6.3%
983
 
6.1%
978
 
6.1%
977
 
6.1%
977
 
6.1%
977
 
6.1%
Other values (95) 5552
34.6%
Decimal Number
ValueCountFrequency (%)
1 1085
19.3%
2 735
13.1%
4 621
11.1%
3 601
10.7%
5 565
10.1%
8 432
 
7.7%
7 412
 
7.3%
6 409
 
7.3%
9 402
 
7.2%
0 357
 
6.4%
Space Separator
ValueCountFrequency (%)
7112
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 309
100.0%
Close Punctuation
ValueCountFrequency (%)
) 154
100.0%
Open Punctuation
ValueCountFrequency (%)
( 154
100.0%
Other Punctuation
ValueCountFrequency (%)
. 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16033
54.6%
Common 13352
45.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1345
 
8.4%
1153
 
7.2%
1078
 
6.7%
1010
 
6.3%
1003
 
6.3%
983
 
6.1%
978
 
6.1%
977
 
6.1%
977
 
6.1%
977
 
6.1%
Other values (95) 5552
34.6%
Common
ValueCountFrequency (%)
7112
53.3%
1 1085
 
8.1%
2 735
 
5.5%
4 621
 
4.7%
3 601
 
4.5%
5 565
 
4.2%
8 432
 
3.2%
7 412
 
3.1%
6 409
 
3.1%
9 402
 
3.0%
Other values (5) 978
 
7.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16033
54.6%
ASCII 13352
45.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7112
53.3%
1 1085
 
8.1%
2 735
 
5.5%
4 621
 
4.7%
3 601
 
4.5%
5 565
 
4.2%
8 432
 
3.2%
7 412
 
3.1%
6 409
 
3.1%
9 402
 
3.0%
Other values (5) 978
 
7.3%
Hangul
ValueCountFrequency (%)
1345
 
8.4%
1153
 
7.2%
1078
 
6.7%
1010
 
6.3%
1003
 
6.3%
983
 
6.1%
978
 
6.1%
977
 
6.1%
977
 
6.1%
977
 
6.1%
Other values (95) 5552
34.6%

사육두수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct239
Distinct (%)24.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8238.2333
Minimum0
Maximum504000
Zeros46
Zeros (%)4.7%
Negative0
Negative (%)0.0%
Memory size8.8 KiB
2023-12-12T08:51:44.498265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q110
median32
Q3150
95-th percentile65000
Maximum504000
Range504000
Interquartile range (IQR)140

Descriptive statistics

Standard deviation27468.521
Coefficient of variation (CV)3.3342733
Kurtosis111.79202
Mean8238.2333
Median Absolute Deviation (MAD)28
Skewness7.793444
Sum8122898
Variance7.5451966 × 108
MonotonicityNot monotonic
2023-12-12T08:51:44.607372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 46
 
4.7%
10 44
 
4.5%
20 37
 
3.8%
5 34
 
3.4%
4 30
 
3.0%
6 25
 
2.5%
30 24
 
2.4%
100 24
 
2.4%
3 24
 
2.4%
50 20
 
2.0%
Other values (229) 678
68.8%
ValueCountFrequency (%)
0 46
4.7%
1 3
 
0.3%
2 15
 
1.5%
3 24
2.4%
4 30
3.0%
5 34
3.4%
6 25
2.5%
7 14
 
1.4%
8 14
 
1.4%
9 18
 
1.8%
ValueCountFrequency (%)
504000 1
 
0.1%
151678 1
 
0.1%
140000 2
0.2%
135233 1
 
0.1%
130000 1
 
0.1%
120000 2
0.2%
112800 1
 
0.1%
110000 1
 
0.1%
100000 4
0.4%
98000 2
0.2%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.8 KiB
2023-12-05
986 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-12-05
2nd row2023-12-05
3rd row2023-12-05
4th row2023-12-05
5th row2023-12-05

Common Values

ValueCountFrequency (%)
2023-12-05 986
100.0%

Length

2023-12-12T08:51:44.702089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:51:44.768897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-12-05 986
100.0%

Interactions

2023-12-12T08:51:42.284437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:51:44.811361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
축산업종류주사육업종사육두수
축산업종류1.0000.8760.041
주사육업종0.8761.0000.845
사육두수0.0410.8451.000
2023-12-12T08:51:44.877375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
축산업종류주사육업종
축산업종류1.0000.841
주사육업종0.8411.000
2023-12-12T08:51:44.940903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육두수축산업종류주사육업종
사육두수1.0000.0500.526
축산업종류0.0501.0000.841
주사육업종0.5260.8411.000

Missing values

2023-12-12T08:51:42.390503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:51:42.499217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

축산업종류사업장명칭주사육업종사업장소재지사육두수데이터기준일자
0가축사육업서율농장한우전라북도 부안군 백산면 하청리 245번지 외 3필지(243-1 246-2 246-10)872023-12-05
1가축사육업감교농장한우전라북도 부안군 상서면 감교리 811번지 19호 811-20 811-52.주산면 사산리1178-7 1178-8 1178-307002023-12-05
2가축사육업월제농장한우전라북도 부안군 하서면 백련리 567번지 6호252023-12-05
3가축사육업길호농장한우전라북도 부안군 하서면 백련리 470번지302023-12-05
4가축사육업광열농장한우전라북도 부안군 보안면 유천리 439번지152023-12-05
5가축사육업종문농장한우전라북도 부안군 동진면 안성리 1298번지1242023-12-05
6가축사육업조원제농장한우전라북도 부안군 하서면 백련리 566번지 35호152023-12-05
7가축사육업윤재필농장한우전라북도 부안군 하서면 장신리 207번지622023-12-05
8가축사육업이윤기한우전라북도 부안군 보안면 월천리 451번지1002023-12-05
9가축사육업계화누렁이 농장한우전라북도 부안군 계화면 창북리 4134번지 3호702023-12-05
축산업종류사업장명칭주사육업종사업장소재지사육두수데이터기준일자
976종축업신흥농장전라북도 부안군 부안읍 신흥리 279번지 22호 외 10필220002023-12-05
977종축업갑도농장전라북도 부안군 진서면 진서리 63번지 1호519002023-12-05
978종축업서원농장전라북도 부안군 보안면 상림리 219번지 1호 외 3필(219-2 219-3 532)290002023-12-05
979종축업가농보안농장전라북도 부안군 보안면 월천리 1011번지 외 2필1100002023-12-05
980종축업평교 종계장전라북도 부안군 부안읍 신흥리 산 82번지 1호140002023-12-05
981종축업다산2농장오리전라북도 부안군 진서면 진서리 1273번지 5호55002023-12-05
982종축업다산1농장오리전라북도 부안군 진서면 진서리 1273번지 6호55002023-12-05
983종축업증산농장전라북도 부안군 동진면 증산리 85번지 1호 증산농장310002023-12-05
984종축업진서1농장오리전라북도 부안군 진서면 운호리 238번지 11호 진서종계장108002023-12-05
985종축업진서2농장오리전라북도 부안군 진서면 운호리 238번지 11호 진서종계장78002023-12-05