Overview

Dataset statistics

Number of variables5
Number of observations1552
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory63.8 KiB
Average record size in memory42.1 B

Variable types

Numeric2
Text2
Categorical1

Dataset

Description경상북도 구미시 관내에 등록된 축산농가 현황 데이터로 사육장 명칭, 주사육 업종, 사업장소재지, 사육두수 데이터를 제공합니다.
Author경상북도 구미시
URLhttps://www.data.go.kr/data/15034289/fileData.do

Alerts

주사육업종 is highly imbalanced (83.2%)Imbalance
연번 has unique valuesUnique
사육두수 has 84 (5.4%) zerosZeros

Reproduction

Analysis started2024-03-14 12:59:06.676150
Analysis finished2024-03-14 12:59:08.860120
Duration2.18 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct1552
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean776.5
Minimum1
Maximum1552
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size13.8 KiB
2024-03-14T21:59:09.068467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile78.55
Q1388.75
median776.5
Q31164.25
95-th percentile1474.45
Maximum1552
Range1551
Interquartile range (IQR)775.5

Descriptive statistics

Standard deviation448.16812
Coefficient of variation (CV)0.57716436
Kurtosis-1.2
Mean776.5
Median Absolute Deviation (MAD)388
Skewness0
Sum1205128
Variance200854.67
MonotonicityStrictly increasing
2024-03-14T21:59:09.521999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
1021 1
 
0.1%
1043 1
 
0.1%
1042 1
 
0.1%
1041 1
 
0.1%
1040 1
 
0.1%
1039 1
 
0.1%
1038 1
 
0.1%
1037 1
 
0.1%
1036 1
 
0.1%
Other values (1542) 1542
99.4%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1552 1
0.1%
1551 1
0.1%
1550 1
0.1%
1549 1
0.1%
1548 1
0.1%
1547 1
0.1%
1546 1
0.1%
1545 1
0.1%
1544 1
0.1%
1543 1
0.1%
Distinct1393
Distinct (%)89.8%
Missing0
Missing (%)0.0%
Memory size12.2 KiB
2024-03-14T21:59:10.492970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length4.8260309
Min length2

Characters and Unicode

Total characters7490
Distinct characters396
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1277 ?
Unique (%)82.3%

Sample

1st row천혜농장
2nd row현대농장
3rd row강준농장
4th row실로암
5th row초일농장
ValueCountFrequency (%)
농장 434
 
21.3%
대성농장 8
 
0.4%
2농장 8
 
0.4%
목장 6
 
0.3%
우리농장 4
 
0.2%
현대농장 4
 
0.2%
송곡농장 4
 
0.2%
우림농장 4
 
0.2%
신우농장 4
 
0.2%
형제농장 4
 
0.2%
Other values (1399) 1561
76.5%
2024-03-14T21:59:11.886236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1488
19.9%
1415
18.9%
489
 
6.5%
125
 
1.7%
117
 
1.6%
105
 
1.4%
98
 
1.3%
97
 
1.3%
78
 
1.0%
2 77
 
1.0%
Other values (386) 3401
45.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6848
91.4%
Space Separator 489
 
6.5%
Decimal Number 125
 
1.7%
Uppercase Letter 17
 
0.2%
Open Punctuation 3
 
< 0.1%
Close Punctuation 3
 
< 0.1%
Other Punctuation 2
 
< 0.1%
Lowercase Letter 2
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1488
21.7%
1415
20.7%
125
 
1.8%
117
 
1.7%
105
 
1.5%
98
 
1.4%
97
 
1.4%
78
 
1.1%
68
 
1.0%
64
 
0.9%
Other values (360) 3193
46.6%
Uppercase Letter
ValueCountFrequency (%)
H 4
23.5%
M 2
11.8%
A 2
11.8%
S 2
11.8%
G 1
 
5.9%
B 1
 
5.9%
J 1
 
5.9%
R 1
 
5.9%
F 1
 
5.9%
C 1
 
5.9%
Decimal Number
ValueCountFrequency (%)
2 77
61.6%
1 26
 
20.8%
3 13
 
10.4%
4 5
 
4.0%
6 1
 
0.8%
8 1
 
0.8%
7 1
 
0.8%
5 1
 
0.8%
Lowercase Letter
ValueCountFrequency (%)
k 1
50.0%
o 1
50.0%
Space Separator
ValueCountFrequency (%)
489
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6848
91.4%
Common 623
 
8.3%
Latin 19
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1488
21.7%
1415
20.7%
125
 
1.8%
117
 
1.7%
105
 
1.5%
98
 
1.4%
97
 
1.4%
78
 
1.1%
68
 
1.0%
64
 
0.9%
Other values (360) 3193
46.6%
Common
ValueCountFrequency (%)
489
78.5%
2 77
 
12.4%
1 26
 
4.2%
3 13
 
2.1%
4 5
 
0.8%
( 3
 
0.5%
) 3
 
0.5%
. 2
 
0.3%
6 1
 
0.2%
8 1
 
0.2%
Other values (3) 3
 
0.5%
Latin
ValueCountFrequency (%)
H 4
21.1%
M 2
10.5%
A 2
10.5%
S 2
10.5%
k 1
 
5.3%
o 1
 
5.3%
G 1
 
5.3%
B 1
 
5.3%
J 1
 
5.3%
R 1
 
5.3%
Other values (3) 3
15.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6848
91.4%
ASCII 642
 
8.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1488
21.7%
1415
20.7%
125
 
1.8%
117
 
1.7%
105
 
1.5%
98
 
1.4%
97
 
1.4%
78
 
1.1%
68
 
1.0%
64
 
0.9%
Other values (360) 3193
46.6%
ASCII
ValueCountFrequency (%)
489
76.2%
2 77
 
12.0%
1 26
 
4.0%
3 13
 
2.0%
4 5
 
0.8%
H 4
 
0.6%
( 3
 
0.5%
) 3
 
0.5%
M 2
 
0.3%
. 2
 
0.3%
Other values (16) 18
 
2.8%

주사육업종
Categorical

IMBALANCE 

Distinct10
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size12.2 KiB
한우
1439 
육우
 
33
젖소
 
30
돼지
 
20
육계
 
15
Other values (5)
 
15

Length

Max length6
Median length2
Mean length2.0045103
Min length1

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row한우
2nd row젖소
3rd row한우
4th row돼지
5th row젖소

Common Values

ValueCountFrequency (%)
한우 1439
92.7%
육우 33
 
2.1%
젖소 30
 
1.9%
돼지 20
 
1.3%
육계 15
 
1.0%
산양 8
 
0.5%
종계/산란계 2
 
0.1%
사슴 2
 
0.1%
염소 2
 
0.1%
1
 
0.1%

Length

2024-03-14T21:59:12.313334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T21:59:12.672212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한우 1439
92.7%
육우 33
 
2.1%
젖소 30
 
1.9%
돼지 20
 
1.3%
육계 15
 
1.0%
산양 8
 
0.5%
종계/산란계 2
 
0.1%
사슴 2
 
0.1%
염소 2
 
0.1%
1
 
0.1%
Distinct1538
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size12.2 KiB
2024-03-14T21:59:13.926996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length55
Mean length25.529639
Min length18

Characters and Unicode

Total characters39622
Distinct characters126
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1524 ?
Unique (%)98.2%

Sample

1st row경상북도 구미시 옥성면 초곡리 1080번지 32호
2nd row경상북도 구미시 옥성면 주아리 911번지
3rd row경상북도 구미시 고아읍 괴평리 1112번지 405호 외 3필지
4th row경상북도 구미시 도개면 다곡리 162번지
5th row경상북도 구미시 옥성면 초곡리 243번지
ValueCountFrequency (%)
경상북도 1552
18.3%
구미시 1552
18.3%
해평면 337
 
4.0%
선산읍 329
 
3.9%
고아읍 281
 
3.3%
1호 262
 
3.1%
도개면 183
 
2.2%
옥성면 135
 
1.6%
산동읍 120
 
1.4%
2호 111
 
1.3%
Other values (1090) 3615
42.6%
2024-03-14T21:59:15.820297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9992
25.2%
1768
 
4.5%
1616
 
4.1%
1582
 
4.0%
1569
 
4.0%
1558
 
3.9%
1555
 
3.9%
1552
 
3.9%
1552
 
3.9%
1552
 
3.9%
Other values (116) 15326
38.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 23866
60.2%
Space Separator 9992
25.2%
Decimal Number 5699
 
14.4%
Other Punctuation 37
 
0.1%
Dash Punctuation 25
 
0.1%
Uppercase Letter 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1768
 
7.4%
1616
 
6.8%
1582
 
6.6%
1569
 
6.6%
1558
 
6.5%
1555
 
6.5%
1552
 
6.5%
1552
 
6.5%
1552
 
6.5%
1540
 
6.5%
Other values (100) 8022
33.6%
Decimal Number
ValueCountFrequency (%)
1 1147
20.1%
2 710
12.5%
4 591
10.4%
3 555
9.7%
5 514
9.0%
6 475
8.3%
7 442
 
7.8%
0 427
 
7.5%
8 422
 
7.4%
9 416
 
7.3%
Space Separator
ValueCountFrequency (%)
9992
100.0%
Other Punctuation
ValueCountFrequency (%)
, 37
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 25
100.0%
Uppercase Letter
ValueCountFrequency (%)
K 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 23866
60.2%
Common 15755
39.8%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1768
 
7.4%
1616
 
6.8%
1582
 
6.6%
1569
 
6.6%
1558
 
6.5%
1555
 
6.5%
1552
 
6.5%
1552
 
6.5%
1552
 
6.5%
1540
 
6.5%
Other values (100) 8022
33.6%
Common
ValueCountFrequency (%)
9992
63.4%
1 1147
 
7.3%
2 710
 
4.5%
4 591
 
3.8%
3 555
 
3.5%
5 514
 
3.3%
6 475
 
3.0%
7 442
 
2.8%
0 427
 
2.7%
8 422
 
2.7%
Other values (5) 480
 
3.0%
Latin
ValueCountFrequency (%)
K 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 23866
60.2%
ASCII 15756
39.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9992
63.4%
1 1147
 
7.3%
2 710
 
4.5%
4 591
 
3.8%
3 555
 
3.5%
5 514
 
3.3%
6 475
 
3.0%
7 442
 
2.8%
0 427
 
2.7%
8 422
 
2.7%
Other values (6) 481
 
3.1%
Hangul
ValueCountFrequency (%)
1768
 
7.4%
1616
 
6.8%
1582
 
6.6%
1569
 
6.6%
1558
 
6.5%
1555
 
6.5%
1552
 
6.5%
1552
 
6.5%
1552
 
6.5%
1540
 
6.5%
Other values (100) 8022
33.6%

사육두수
Real number (ℝ)

ZEROS 

Distinct155
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean308.72552
Minimum0
Maximum50000
Zeros84
Zeros (%)5.4%
Negative0
Negative (%)0.0%
Memory size13.8 KiB
2024-03-14T21:59:16.235980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q15
median18
Q340
95-th percentile150
Maximum50000
Range50000
Interquartile range (IQR)35

Descriptive statistics

Standard deviation2827.8021
Coefficient of variation (CV)9.1595995
Kurtosis169.05123
Mean308.72552
Median Absolute Deviation (MAD)15
Skewness12.565971
Sum479142
Variance7996464.5
MonotonicityNot monotonic
2024-03-14T21:59:16.685267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 141
 
9.1%
0 84
 
5.4%
20 79
 
5.1%
30 61
 
3.9%
10 57
 
3.7%
4 54
 
3.5%
5 51
 
3.3%
50 51
 
3.3%
40 49
 
3.2%
3 45
 
2.9%
Other values (145) 880
56.7%
ValueCountFrequency (%)
0 84
5.4%
1 141
9.1%
2 35
 
2.3%
3 45
 
2.9%
4 54
 
3.5%
5 51
 
3.3%
6 43
 
2.8%
7 32
 
2.1%
8 35
 
2.3%
9 15
 
1.0%
ValueCountFrequency (%)
50000 1
0.1%
40000 2
0.1%
35000 2
0.1%
33000 1
0.1%
30000 1
0.1%
28000 1
0.1%
25000 1
0.1%
16000 1
0.1%
15000 2
0.1%
8000 1
0.1%

Interactions

2024-03-14T21:59:07.795206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T21:59:07.226721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T21:59:08.070675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T21:59:07.511856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T21:59:16.950566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번주사육업종사육두수
연번1.0000.3440.089
주사육업종0.3441.0000.658
사육두수0.0890.6581.000
2024-03-14T21:59:17.191975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사육두수주사육업종
연번1.000-0.3580.112
사육두수-0.3581.0000.372
주사육업종0.1120.3721.000

Missing values

2024-03-14T21:59:08.413158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T21:59:08.729487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업장명칭주사육업종사업장소재지(지번)사육두수
01천혜농장한우경상북도 구미시 옥성면 초곡리 1080번지 32호48
12현대농장젖소경상북도 구미시 옥성면 주아리 911번지128
23강준농장한우경상북도 구미시 고아읍 괴평리 1112번지 405호 외 3필지130
34실로암돼지경상북도 구미시 도개면 다곡리 162번지2500
45초일농장젖소경상북도 구미시 옥성면 초곡리 243번지74
56문숙농장젖소경상북도 구미시 옥성면 초곡리 262번지180
67선화양돈단지돼지경상북도 구미시 옥성면 산촌리 761번지5300
78나녕농장돼지경상북도 구미시 고아읍 문성리 400번지1620
89이실농장한우경상북도 구미시 옥성면 농소리 929번지 1호59
910인철농장한우경상북도 구미시 해평면 월호리 67번지 3호54
연번사업장명칭주사육업종사업장소재지(지번)사육두수
15421543최춘옥 농장한우경상북도 구미시 도개면 월림리 243번지0
15431544한솔농장한우경상북도 구미시 선산읍 생곡리 1040번지0
15441545이동훈 농장한우경상북도 구미시 고아읍 괴평리 859번지 8600
15451546호야농장한우경상북도 구미시 해평면 송곡리 198번지 1호0
15461547무등농장한우경상북도 구미시 무을면 무등리 936번지0
15471548성민농장한우경상북도 구미시 무을면 원리 190번지 2호0
15481549행복농장한우경상북도 구미시 선산읍 포상리 983번지 1호0
15491550손 농장한우경상북도 구미시 산동읍 송산리 180번지0
15501551현진농장한우경상북도 구미시 옥성면 농소리 336번지 3호5
15511552부부농장한우경상북도 구미시 고아읍 황산리 208번지0