Overview

Dataset statistics

Number of variables4
Number of observations474
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.4 KiB
Average record size in memory33.3 B

Variable types

Text2
Categorical1
Numeric1

Dataset

Description경상북도 군위군에서 관리하고 있는 가축사육업현황에 대한 데이터로 농장명, 축종, 지번주소, 사육두수의 항목을 제공합니다.
Author경상북도 군위군
URLhttps://www.data.go.kr/data/15034432/fileData.do

Alerts

축종 is highly imbalanced (65.7%)Imbalance

Reproduction

Analysis started2024-03-11 03:19:28.954219
Analysis finished2024-03-11 03:19:30.356576
Duration1.4 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct435
Distinct (%)91.8%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2024-03-11T12:19:30.541067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length4
Mean length4.2763713
Min length3

Characters and Unicode

Total characters2027
Distinct characters257
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique403 ?
Unique (%)85.0%

Sample

1st row구일농장
2nd row목우농장
3rd row율림농장
4th row흥진농장
5th row농업회사법인유한회사둥지농장
ValueCountFrequency (%)
원산농장 4
 
0.8%
지호농장 4
 
0.8%
매성농장 3
 
0.6%
대성농장 3
 
0.6%
십리골농장 3
 
0.6%
주식회사 3
 
0.6%
이화농장 2
 
0.4%
팔공농장 2
 
0.4%
형제농장 2
 
0.4%
석이농장 2
 
0.4%
Other values (430) 455
94.2%
2024-03-11T12:19:30.891497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
439
21.7%
428
21.1%
46
 
2.3%
33
 
1.6%
32
 
1.6%
29
 
1.4%
27
 
1.3%
27
 
1.3%
24
 
1.2%
21
 
1.0%
Other values (247) 921
45.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1989
98.1%
Decimal Number 19
 
0.9%
Space Separator 9
 
0.4%
Uppercase Letter 6
 
0.3%
Close Punctuation 2
 
0.1%
Open Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
439
22.1%
428
21.5%
46
 
2.3%
33
 
1.7%
32
 
1.6%
29
 
1.5%
27
 
1.4%
27
 
1.4%
24
 
1.2%
21
 
1.1%
Other values (233) 883
44.4%
Uppercase Letter
ValueCountFrequency (%)
K 1
16.7%
O 1
16.7%
M 1
16.7%
J 1
16.7%
B 1
16.7%
L 1
16.7%
Decimal Number
ValueCountFrequency (%)
2 13
68.4%
3 2
 
10.5%
7 2
 
10.5%
4 1
 
5.3%
1 1
 
5.3%
Space Separator
ValueCountFrequency (%)
9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1989
98.1%
Common 32
 
1.6%
Latin 6
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
439
22.1%
428
21.5%
46
 
2.3%
33
 
1.7%
32
 
1.6%
29
 
1.5%
27
 
1.4%
27
 
1.4%
24
 
1.2%
21
 
1.1%
Other values (233) 883
44.4%
Common
ValueCountFrequency (%)
2 13
40.6%
9
28.1%
3 2
 
6.2%
7 2
 
6.2%
) 2
 
6.2%
( 2
 
6.2%
4 1
 
3.1%
1 1
 
3.1%
Latin
ValueCountFrequency (%)
K 1
16.7%
O 1
16.7%
M 1
16.7%
J 1
16.7%
B 1
16.7%
L 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1989
98.1%
ASCII 38
 
1.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
439
22.1%
428
21.5%
46
 
2.3%
33
 
1.7%
32
 
1.6%
29
 
1.5%
27
 
1.4%
27
 
1.4%
24
 
1.2%
21
 
1.1%
Other values (233) 883
44.4%
ASCII
ValueCountFrequency (%)
2 13
34.2%
9
23.7%
3 2
 
5.3%
7 2
 
5.3%
) 2
 
5.3%
( 2
 
5.3%
K 1
 
2.6%
O 1
 
2.6%
4 1
 
2.6%
M 1
 
2.6%
Other values (4) 4
 
10.5%

축종
Categorical

IMBALANCE 

Distinct9
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
한우
389 
돼지
42 
육계
 
13
젖소
 
8
종계/산란계
 
8
Other values (4)
 
14

Length

Max length6
Median length2
Mean length2.0864979
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한우
2nd row한우
3rd row돼지
4th row돼지
5th row돼지

Common Values

ValueCountFrequency (%)
한우 389
82.1%
돼지 42
 
8.9%
육계 13
 
2.7%
젖소 8
 
1.7%
종계/산란계 8
 
1.7%
육우 7
 
1.5%
산란육성계 3
 
0.6%
사슴 2
 
0.4%
산양 2
 
0.4%

Length

2024-03-11T12:19:31.007312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-11T12:19:31.099914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한우 389
82.1%
돼지 42
 
8.9%
육계 13
 
2.7%
젖소 8
 
1.7%
종계/산란계 8
 
1.7%
육우 7
 
1.5%
산란육성계 3
 
0.6%
사슴 2
 
0.4%
산양 2
 
0.4%
Distinct473
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2024-03-11T12:19:31.277083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length29
Mean length24.291139
Min length18

Characters and Unicode

Total characters11514
Distinct characters115
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique472 ?
Unique (%)99.6%

Sample

1st row경상북도 군위군 소보면 신계리 121-3번지
2nd row경상북도 군위군 효령면 장군리 1051-2번지
3rd row경상북도 군위군 효령면 금매리 995-23번지
4th row경상북도 군위군 효령면 금매리 1447-96번지
5th row경상북도 군위군 효령면 금매리 408-12번지
ValueCountFrequency (%)
경상북도 474
20.0%
군위군 474
20.0%
효령면 135
 
5.7%
의흥면 94
 
4.0%
군위읍 82
 
3.5%
소보면 70
 
3.0%
우보면 49
 
2.1%
장군리 21
 
0.9%
금매리 21
 
0.9%
부계면 19
 
0.8%
Other values (548) 931
39.3%
2024-03-11T12:19:31.596145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2598
22.6%
1052
 
9.1%
561
 
4.9%
495
 
4.3%
485
 
4.2%
478
 
4.2%
475
 
4.1%
457
 
4.0%
435
 
3.8%
428
 
3.7%
Other values (105) 4050
35.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7044
61.2%
Space Separator 2598
 
22.6%
Decimal Number 1664
 
14.5%
Dash Punctuation 208
 
1.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1052
14.9%
561
 
8.0%
495
 
7.0%
485
 
6.9%
478
 
6.8%
475
 
6.7%
457
 
6.5%
435
 
6.2%
428
 
6.1%
392
 
5.6%
Other values (93) 1786
25.4%
Decimal Number
ValueCountFrequency (%)
1 319
19.2%
2 179
10.8%
3 165
9.9%
4 158
9.5%
6 153
9.2%
8 147
8.8%
5 147
8.8%
7 144
8.7%
0 139
8.4%
9 113
 
6.8%
Space Separator
ValueCountFrequency (%)
2598
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 208
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7044
61.2%
Common 4470
38.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1052
14.9%
561
 
8.0%
495
 
7.0%
485
 
6.9%
478
 
6.8%
475
 
6.7%
457
 
6.5%
435
 
6.2%
428
 
6.1%
392
 
5.6%
Other values (93) 1786
25.4%
Common
ValueCountFrequency (%)
2598
58.1%
1 319
 
7.1%
- 208
 
4.7%
2 179
 
4.0%
3 165
 
3.7%
4 158
 
3.5%
6 153
 
3.4%
8 147
 
3.3%
5 147
 
3.3%
7 144
 
3.2%
Other values (2) 252
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7044
61.2%
ASCII 4470
38.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2598
58.1%
1 319
 
7.1%
- 208
 
4.7%
2 179
 
4.0%
3 165
 
3.7%
4 158
 
3.5%
6 153
 
3.4%
8 147
 
3.3%
5 147
 
3.3%
7 144
 
3.2%
Other values (2) 252
 
5.6%
Hangul
ValueCountFrequency (%)
1052
14.9%
561
 
8.0%
495
 
7.0%
485
 
6.9%
478
 
6.8%
475
 
6.7%
457
 
6.5%
435
 
6.2%
428
 
6.1%
392
 
5.6%
Other values (93) 1786
25.4%

사육두수
Real number (ℝ)

Distinct167
Distinct (%)35.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2757.2848
Minimum1
Maximum206020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2024-03-11T12:19:31.725637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q17
median22
Q374
95-th percentile5673.15
Maximum206020
Range206019
Interquartile range (IQR)67

Descriptive statistics

Standard deviation14644.213
Coefficient of variation (CV)5.3110993
Kurtosis93.919203
Mean2757.2848
Median Absolute Deviation (MAD)18
Skewness8.5629546
Sum1306953
Variance2.1445299 × 108
MonotonicityNot monotonic
2024-03-11T12:19:31.850104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 30
 
6.3%
4 23
 
4.9%
7 21
 
4.4%
5 17
 
3.6%
6 16
 
3.4%
8 15
 
3.2%
24 13
 
2.7%
15 11
 
2.3%
3 11
 
2.3%
12 10
 
2.1%
Other values (157) 307
64.8%
ValueCountFrequency (%)
1 10
 
2.1%
2 30
6.3%
3 11
 
2.3%
4 23
4.9%
5 17
3.6%
6 16
3.4%
7 21
4.4%
8 15
3.2%
9 9
 
1.9%
10 7
 
1.5%
ValueCountFrequency (%)
206020 1
0.2%
120000 1
0.2%
100000 1
0.2%
78000 1
0.2%
62000 1
0.2%
60000 1
0.2%
57000 1
0.2%
51600 1
0.2%
51000 1
0.2%
50000 2
0.4%

Interactions

2024-03-11T12:19:30.139450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-11T12:19:31.933089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
축종사육두수
축종1.0000.728
사육두수0.7281.000
2024-03-11T12:19:32.007862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육두수축종
사육두수1.0000.498
축종0.4981.000

Missing values

2024-03-11T12:19:30.260999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-11T12:19:30.326037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

농장명축종지번주소사육두수
0구일농장한우경상북도 군위군 소보면 신계리 121-3번지9
1목우농장한우경상북도 군위군 효령면 장군리 1051-2번지95
2율림농장돼지경상북도 군위군 효령면 금매리 995-23번지1057
3흥진농장돼지경상북도 군위군 효령면 금매리 1447-96번지2695
4농업회사법인유한회사둥지농장돼지경상북도 군위군 효령면 금매리 408-12번지945
5용대목장한우경상북도 군위군 군위읍 용대리 산25-1번지318
6봉실농장한우경상북도 군위군 군위읍 외량리 170번지197
7칠사사농장한우경상북도 군위군 군위읍 하곡리 600번지4
8영우농장한우경상북도 군위군 효령면 고곡리 598-1번지12
9새들농장돼지경상북도 군위군 효령면 금매리 1447-48번지2493
농장명축종지번주소사육두수
464부강농장돼지경상북도 군위군 부계면 창평리 164번지900
465진호농장한우경상북도 군위군 부계면 창평리 1565-8번지57
466대동목장육계경상북도 군위군 부계면 남산리 146번지70
467대천농장한우경상북도 군위군 부계면 신화1길 4-2640
468연암축산한우경상북도 군위군 부계면 신화리 349번지101
469산너머숲속에돼지경상북도 군위군 부계면 창평리 146번지2100
470원진농장한우경상북도 군위군 부계면 신화리 1098-1번지3
471절골농장한우경상북도 군위군 부계면 명산리 536번지37
472신화농장한우경상북도 군위군 부계면 신화리 1096번지15
473덕성농장한우경상북도 군위군 부계면 가호리 1248-9번지19