Overview

Dataset statistics

Number of variables6
Number of observations871
Missing cells153
Missing cells (%)2.9%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory41.8 KiB
Average record size in memory49.2 B

Variable types

Text3
Categorical2
Numeric1

Dataset

Description경북 포항시에 위치한 가금류 축산농장현황에 대한 데이터로 농장명, 주사육업종, 축종, 사육수, 소재지 등의 정보를 제공합니다
URLhttps://www.data.go.kr/data/15034203/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 1 (0.1%) duplicate rowsDuplicates
주사육업종 is highly imbalanced (70.6%)Imbalance
도로명소재지 has 152 (17.5%) missing valuesMissing
사육두수 has 181 (20.8%) zerosZeros

Reproduction

Analysis started2023-12-12 01:59:26.094360
Analysis finished2023-12-12 01:59:27.082543
Duration0.99 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct753
Distinct (%)86.5%
Missing0
Missing (%)0.0%
Memory size6.9 KiB
2023-12-12T10:59:27.636853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length4
Mean length4.3432836
Min length3

Characters and Unicode

Total characters3783
Distinct characters327
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique675 ?
Unique (%)77.5%

Sample

1st row송학농장
2nd row장포농장
3rd row송학농장
4th row별농장
5th row예훈농장
ValueCountFrequency (%)
농장 30
 
3.2%
대성농장 9
 
1.0%
성곡농장 8
 
0.9%
푸른농장 5
 
0.5%
한우농장 5
 
0.5%
죽성농장 4
 
0.4%
덕실농장 4
 
0.4%
동해농장 4
 
0.4%
용곡농장 4
 
0.4%
신성농장 4
 
0.4%
Other values (755) 847
91.7%
2023-12-12T10:59:28.109274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
828
21.9%
756
20.0%
85
 
2.2%
73
 
1.9%
60
 
1.6%
58
 
1.5%
53
 
1.4%
53
 
1.4%
49
 
1.3%
39
 
1.0%
Other values (317) 1729
45.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3707
98.0%
Space Separator 53
 
1.4%
Uppercase Letter 10
 
0.3%
Decimal Number 9
 
0.2%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
828
22.3%
756
20.4%
85
 
2.3%
73
 
2.0%
60
 
1.6%
58
 
1.6%
53
 
1.4%
49
 
1.3%
39
 
1.1%
37
 
1.0%
Other values (301) 1669
45.0%
Uppercase Letter
ValueCountFrequency (%)
K 2
20.0%
I 2
20.0%
L 1
10.0%
H 1
10.0%
D 1
10.0%
A 1
10.0%
O 1
10.0%
S 1
10.0%
Decimal Number
ValueCountFrequency (%)
2 5
55.6%
1 3
33.3%
3 1
 
11.1%
Space Separator
ValueCountFrequency (%)
53
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3707
98.0%
Common 66
 
1.7%
Latin 10
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
828
22.3%
756
20.4%
85
 
2.3%
73
 
2.0%
60
 
1.6%
58
 
1.6%
53
 
1.4%
49
 
1.3%
39
 
1.1%
37
 
1.0%
Other values (301) 1669
45.0%
Common
ValueCountFrequency (%)
53
80.3%
2 5
 
7.6%
1 3
 
4.5%
3 1
 
1.5%
( 1
 
1.5%
) 1
 
1.5%
- 1
 
1.5%
~ 1
 
1.5%
Latin
ValueCountFrequency (%)
K 2
20.0%
I 2
20.0%
L 1
10.0%
H 1
10.0%
D 1
10.0%
A 1
10.0%
O 1
10.0%
S 1
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3707
98.0%
ASCII 76
 
2.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
828
22.3%
756
20.4%
85
 
2.3%
73
 
2.0%
60
 
1.6%
58
 
1.6%
53
 
1.4%
49
 
1.3%
39
 
1.1%
37
 
1.0%
Other values (301) 1669
45.0%
ASCII
ValueCountFrequency (%)
53
69.7%
2 5
 
6.6%
1 3
 
3.9%
K 2
 
2.6%
I 2
 
2.6%
3 1
 
1.3%
L 1
 
1.3%
H 1
 
1.3%
D 1
 
1.3%
( 1
 
1.3%
Other values (6) 6
 
7.9%

주사육업종
Categorical

IMBALANCE 

Distinct9
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size6.9 KiB
한우
752 
젖소
 
38
종계/산란계
 
22
돼지
 
20
육우
 
16
Other values (4)
 
23

Length

Max length6
Median length2
Mean length2.1010333
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row돼지
2nd row한우
3rd row한우
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 752
86.3%
젖소 38
 
4.4%
종계/산란계 22
 
2.5%
돼지 20
 
2.3%
육우 16
 
1.8%
산양 8
 
0.9%
염소 7
 
0.8%
사슴 6
 
0.7%
육계 2
 
0.2%

Length

2023-12-12T10:59:28.259311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:59:28.413301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한우 752
86.3%
젖소 38
 
4.4%
종계/산란계 22
 
2.5%
돼지 20
 
2.3%
육우 16
 
1.8%
산양 8
 
0.9%
염소 7
 
0.8%
사슴 6
 
0.7%
육계 2
 
0.2%
Distinct863
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size6.9 KiB
2023-12-12T10:59:28.820859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length40
Mean length28.486797
Min length4

Characters and Unicode

Total characters24812
Distinct characters130
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique856 ?
Unique (%)98.3%

Sample

1st row경상북도 포항시 남구 연일읍 학전리 340번지
2nd row경상북도 포항시 남구 장기면 산서리 416번지
3rd row경상북도 포항시 남구 연일읍 학전리 산 86번지 1호
4th row경상북도 포항시 북구 기계면 내단리 911번지 2호 외 4필지
5th row경상북도 포항시 북구 기계면 내단리 962번지 10호
ValueCountFrequency (%)
경상북도 868
 
15.4%
포항시 868
 
15.4%
북구 658
 
11.6%
기계면 215
 
3.8%
남구 210
 
3.7%
신광면 151
 
2.7%
1호 133
 
2.4%
흥해읍 119
 
2.1%
장기면 86
 
1.5%
2호 81
 
1.4%
Other values (780) 2260
40.0%
2023-12-12T10:59:29.482222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6498
26.2%
1581
 
6.4%
935
 
3.8%
903
 
3.6%
891
 
3.6%
887
 
3.6%
869
 
3.5%
868
 
3.5%
868
 
3.5%
868
 
3.5%
Other values (120) 9644
38.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15271
61.5%
Space Separator 6498
26.2%
Decimal Number 3027
 
12.2%
Close Punctuation 5
 
< 0.1%
Dash Punctuation 5
 
< 0.1%
Open Punctuation 5
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1581
 
10.4%
935
 
6.1%
903
 
5.9%
891
 
5.8%
887
 
5.8%
869
 
5.7%
868
 
5.7%
868
 
5.7%
868
 
5.7%
866
 
5.7%
Other values (105) 5735
37.6%
Decimal Number
ValueCountFrequency (%)
1 537
17.7%
2 369
12.2%
3 342
11.3%
4 301
9.9%
6 296
9.8%
5 286
9.4%
7 253
8.4%
8 222
7.3%
9 215
7.1%
0 206
 
6.8%
Space Separator
ValueCountFrequency (%)
6498
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15271
61.5%
Common 9541
38.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1581
 
10.4%
935
 
6.1%
903
 
5.9%
891
 
5.8%
887
 
5.8%
869
 
5.7%
868
 
5.7%
868
 
5.7%
868
 
5.7%
866
 
5.7%
Other values (105) 5735
37.6%
Common
ValueCountFrequency (%)
6498
68.1%
1 537
 
5.6%
2 369
 
3.9%
3 342
 
3.6%
4 301
 
3.2%
6 296
 
3.1%
5 286
 
3.0%
7 253
 
2.7%
8 222
 
2.3%
9 215
 
2.3%
Other values (5) 222
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15271
61.5%
ASCII 9541
38.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6498
68.1%
1 537
 
5.6%
2 369
 
3.9%
3 342
 
3.6%
4 301
 
3.2%
6 296
 
3.1%
5 286
 
3.0%
7 253
 
2.7%
8 222
 
2.3%
9 215
 
2.3%
Other values (5) 222
 
2.3%
Hangul
ValueCountFrequency (%)
1581
 
10.4%
935
 
6.1%
903
 
5.9%
891
 
5.8%
887
 
5.8%
869
 
5.7%
868
 
5.7%
868
 
5.7%
868
 
5.7%
866
 
5.7%
Other values (105) 5735
37.6%

도로명소재지
Text

MISSING 

Distinct704
Distinct (%)97.9%
Missing152
Missing (%)17.5%
Memory size6.9 KiB
2023-12-12T10:59:29.925330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length35
Mean length26.955494
Min length21

Characters and Unicode

Total characters19381
Distinct characters138
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique689 ?
Unique (%)95.8%

Sample

1st row경상북도 포항시 남구 연일읍 새마을로 615-23
2nd row경상북도 포항시 남구 장기면 산서길 113-27
3rd row경상북도 포항시 남구 연일읍 새마을로 615-23
4th row경상북도 포항시 북구 기계면 새마을로1525번길 147-20
5th row경상북도 포항시 북구 기계면 새마을로 1347-25
ValueCountFrequency (%)
경상북도 719
16.6%
포항시 719
16.6%
북구 554
 
12.8%
기계면 184
 
4.2%
남구 165
 
3.8%
신광면 137
 
3.2%
흥해읍 103
 
2.4%
장기면 65
 
1.5%
기북면 43
 
1.0%
호미곶면 38
 
0.9%
Other values (855) 1610
37.1%
2023-12-12T10:59:30.572221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3618
18.7%
1353
 
7.0%
746
 
3.8%
743
 
3.8%
733
 
3.8%
732
 
3.8%
730
 
3.8%
720
 
3.7%
719
 
3.7%
719
 
3.7%
Other values (128) 8568
44.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11869
61.2%
Space Separator 3618
 
18.7%
Decimal Number 3478
 
17.9%
Dash Punctuation 371
 
1.9%
Open Punctuation 22
 
0.1%
Close Punctuation 22
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1353
 
11.4%
746
 
6.3%
743
 
6.3%
733
 
6.2%
732
 
6.2%
730
 
6.2%
720
 
6.1%
719
 
6.1%
719
 
6.1%
576
 
4.9%
Other values (113) 4098
34.5%
Decimal Number
ValueCountFrequency (%)
1 714
20.5%
2 436
12.5%
3 378
10.9%
5 344
9.9%
4 314
9.0%
6 289
8.3%
8 287
8.3%
7 278
 
8.0%
9 225
 
6.5%
0 213
 
6.1%
Space Separator
ValueCountFrequency (%)
3618
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 371
100.0%
Open Punctuation
ValueCountFrequency (%)
( 22
100.0%
Close Punctuation
ValueCountFrequency (%)
) 22
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11869
61.2%
Common 7512
38.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1353
 
11.4%
746
 
6.3%
743
 
6.3%
733
 
6.2%
732
 
6.2%
730
 
6.2%
720
 
6.1%
719
 
6.1%
719
 
6.1%
576
 
4.9%
Other values (113) 4098
34.5%
Common
ValueCountFrequency (%)
3618
48.2%
1 714
 
9.5%
2 436
 
5.8%
3 378
 
5.0%
- 371
 
4.9%
5 344
 
4.6%
4 314
 
4.2%
6 289
 
3.8%
8 287
 
3.8%
7 278
 
3.7%
Other values (5) 483
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11869
61.2%
ASCII 7512
38.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3618
48.2%
1 714
 
9.5%
2 436
 
5.8%
3 378
 
5.0%
- 371
 
4.9%
5 344
 
4.6%
4 314
 
4.2%
6 289
 
3.8%
8 287
 
3.8%
7 278
 
3.7%
Other values (5) 483
 
6.4%
Hangul
ValueCountFrequency (%)
1353
 
11.4%
746
 
6.3%
743
 
6.3%
733
 
6.2%
732
 
6.2%
730
 
6.2%
720
 
6.1%
719
 
6.1%
719
 
6.1%
576
 
4.9%
Other values (113) 4098
34.5%

사육두수
Real number (ℝ)

ZEROS 

Distinct159
Distinct (%)18.3%
Missing1
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean664.68161
Minimum0
Maximum180000
Zeros181
Zeros (%)20.8%
Negative0
Negative (%)0.0%
Memory size7.8 KiB
2023-12-12T10:59:30.781784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median16
Q346
95-th percentile144.65
Maximum180000
Range180000
Interquartile range (IQR)44

Descriptive statistics

Standard deviation8129.6014
Coefficient of variation (CV)12.23082
Kurtosis318.13879
Mean664.68161
Median Absolute Deviation (MAD)16
Skewness16.769428
Sum578273
Variance66090420
MonotonicityNot monotonic
2023-12-12T10:59:30.964252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 181
 
20.8%
4 33
 
3.8%
2 31
 
3.6%
6 22
 
2.5%
3 20
 
2.3%
8 18
 
2.1%
7 16
 
1.8%
5 16
 
1.8%
11 14
 
1.6%
30 13
 
1.5%
Other values (149) 506
58.1%
ValueCountFrequency (%)
0 181
20.8%
1 13
 
1.5%
2 31
 
3.6%
3 20
 
2.3%
4 33
 
3.8%
5 16
 
1.8%
6 22
 
2.5%
7 16
 
1.8%
8 18
 
2.1%
9 9
 
1.0%
ValueCountFrequency (%)
180000 1
0.1%
97500 1
0.1%
87000 1
0.1%
74000 1
0.1%
43498 1
0.1%
27000 1
0.1%
10000 1
0.1%
9000 1
0.1%
4600 1
0.1%
3050 1
0.1%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.9 KiB
2023-04-14
871 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-04-14
2nd row2023-04-14
3rd row2023-04-14
4th row2023-04-14
5th row2023-04-14

Common Values

ValueCountFrequency (%)
2023-04-14 871
100.0%

Length

2023-12-12T10:59:31.106196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:59:31.223331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-04-14 871
100.0%

Interactions

2023-12-12T10:59:26.577380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:59:31.295487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주사육업종사육두수
주사육업종1.0000.515
사육두수0.5151.000
2023-12-12T10:59:31.404431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육두수주사육업종
사육두수1.0000.285
주사육업종0.2851.000

Missing values

2023-12-12T10:59:26.746545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:59:26.919173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T10:59:27.032819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

농가명주사육업종지번소재지도로명소재지사육두수데이터기준일자
0송학농장돼지경상북도 포항시 남구 연일읍 학전리 340번지경상북도 포항시 남구 연일읍 새마을로 615-2316492023-04-14
1장포농장한우경상북도 포항시 남구 장기면 산서리 416번지경상북도 포항시 남구 장기면 산서길 113-27412023-04-14
2송학농장한우경상북도 포항시 남구 연일읍 학전리 산 86번지 1호경상북도 포항시 남구 연일읍 새마을로 615-2302023-04-14
3별농장한우경상북도 포항시 북구 기계면 내단리 911번지 2호 외 4필지경상북도 포항시 북구 기계면 새마을로1525번길 147-202242023-04-14
4예훈농장한우경상북도 포항시 북구 기계면 내단리 962번지 10호경상북도 포항시 북구 기계면 새마을로 1347-251172023-04-14
5덕장농장한우경상북도 포항시 북구 흥해읍 덕장리 746번지 1호경상북도 포항시 북구 흥해읍 덕실마을길174번길 40-6512023-04-14
6덕실농장한우경상북도 포항시 북구 흥해읍 덕성리 436번지 2호경상북도 포항시 북구 흥해읍 덕실마을길 490-12712023-04-14
7병태농장한우경상북도 포항시 북구 흥해읍 덕성리 555번지경상북도 포항시 북구 흥해읍 덕실마을길 516-5162023-04-14
8대성농장한우경상북도 포항시 북구 기계면 미현리 169번지 1호경상북도 포항시 북구 기계면 기동지길 522592023-04-14
9부림농장한우경상북도 포항시 북구 기계면 미현리 18번지 5호경상북도 포항시 북구 기계면 기동지길 55132023-04-14
농가명주사육업종지번소재지도로명소재지사육두수데이터기준일자
861세욱농장육우경상북도 포항시 북구 기계면 화봉리 305번지경상북도 포항시 북구 기계면 화대길114번길 114-9362023-04-14
862우야농장2한우경상북도 포항시 북구 기북면 대곡리 923번지경상북도 포항시 북구 기북면 기북로 334-20392023-04-14
863광명농장2한우경상북도 포항시 북구 흥해읍 학천리 695번지경상북도 포항시 북구 흥해읍 도음로 650-25632023-04-14
864학산농원한우경상북도 포항시 북구 신광면 안덕리 516번지경상북도 포항시 북구 신광면 비학로1197번길 97-1902023-04-14
865가나안목장젖소경상북도 포항시 북구 신광면 만석리 784번지경상북도 포항시 북구 신광면 비학로1241번길 9-139642023-04-14
866우리한우농장한우경상북도 포항시 북구 신광면 안덕리 894번지 2호경상북도 포항시 북구 신광면 안덕길55번길 41-201852023-04-14
867푸르미농장한우경상북도 포항시 북구 기계면 현내리 692번지 3호경상북도 포항시 북구 기계면 새마을로1757번길 31152023-04-14
868준축사한우경상북도 포항시 북구 흥해읍 흥안리 936번지 6호경상북도 포항시 북구 흥해읍 칠포로 158202023-04-14
869ASK축산한우경상북도 포항시 북구 신광면 흥곡리 781번지 1호경상북도 포항시 북구 신광면 흥곡길 119-96622023-04-14
870한우농장한우경상북도 포항시 북구 기계면 내단리 134번지 2호경상북도 포항시 북구 기계면 내단천길 80172023-04-14

Duplicate rows

Most frequently occurring

농가명주사육업종지번소재지도로명소재지사육두수데이터기준일자# duplicates
0덕실농장한우경상북도 포항시 북구 흥해읍 덕성리 417번지경상북도 포항시 북구 흥해읍 덕실마을길 490-8372023-04-142