Overview

Dataset statistics

Number of variables5
Number of observations1936
Missing cells0
Missing cells (%)0.0%
Duplicate rows9
Duplicate rows (%)0.5%
Total size in memory77.6 KiB
Average record size in memory41.1 B

Variable types

Text2
Categorical2
Numeric1

Dataset

Description경상북도 예천군에서 축산현황 및 가금류 농가현황 (사업장명칭, 주사육업종, 사업장 소재지, 면적(㎡) 등)입니다.
Author경상북도 예천군
URLhttps://www.data.go.kr/data/15034264/fileData.do

Alerts

기준일자 has constant value ""Constant
Dataset has 9 (0.5%) duplicate rowsDuplicates
면적 is highly overall correlated with 주사육업종High correlation
주사육업종 is highly overall correlated with 면적High correlation
주사육업종 is highly imbalanced (81.3%)Imbalance

Reproduction

Analysis started2024-03-14 13:53:35.548907
Analysis finished2024-03-14 13:53:36.508032
Duration0.96 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct754
Distinct (%)38.9%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
2024-03-14T22:53:37.572279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length4
Mean length4.3734504
Min length2

Characters and Unicode

Total characters8467
Distinct characters300
Distinct categories6 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique563 ?
Unique (%)29.1%

Sample

1st row안0농장
2nd row경0농장
3rd row하0농장
4th row옥0농장
5th row부0목장
ValueCountFrequency (%)
농장 103
 
5.0%
영0농장 56
 
2.7%
재0농장 54
 
2.6%
종0농장 35
 
1.7%
대0농장 32
 
1.5%
상0농장 31
 
1.5%
성0농장 31
 
1.5%
병0농장 27
 
1.3%
창0농장 24
 
1.2%
태0농장 24
 
1.2%
Other values (761) 1657
79.9%
2024-03-14T22:53:39.214115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1864
22.0%
1759
20.8%
1732
20.5%
141
 
1.7%
138
 
1.6%
119
 
1.4%
71
 
0.8%
65
 
0.8%
62
 
0.7%
58
 
0.7%
Other values (290) 2458
29.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6382
75.4%
Decimal Number 1921
 
22.7%
Space Separator 138
 
1.6%
Uppercase Letter 12
 
0.1%
Open Punctuation 7
 
0.1%
Close Punctuation 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1759
27.6%
1732
27.1%
141
 
2.2%
119
 
1.9%
71
 
1.1%
65
 
1.0%
62
 
1.0%
58
 
0.9%
51
 
0.8%
50
 
0.8%
Other values (277) 2274
35.6%
Decimal Number
ValueCountFrequency (%)
0 1864
97.0%
2 47
 
2.4%
1 5
 
0.3%
3 4
 
0.2%
4 1
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
S 4
33.3%
R 2
16.7%
M 2
16.7%
A 2
16.7%
F 2
16.7%
Space Separator
ValueCountFrequency (%)
138
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6381
75.4%
Common 2073
 
24.5%
Latin 12
 
0.1%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1759
27.6%
1732
27.1%
141
 
2.2%
119
 
1.9%
71
 
1.1%
65
 
1.0%
62
 
1.0%
58
 
0.9%
51
 
0.8%
50
 
0.8%
Other values (276) 2273
35.6%
Common
ValueCountFrequency (%)
0 1864
89.9%
138
 
6.7%
2 47
 
2.3%
( 7
 
0.3%
) 7
 
0.3%
1 5
 
0.2%
3 4
 
0.2%
4 1
 
< 0.1%
Latin
ValueCountFrequency (%)
S 4
33.3%
R 2
16.7%
M 2
16.7%
A 2
16.7%
F 2
16.7%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6381
75.4%
ASCII 2085
 
24.6%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1864
89.4%
138
 
6.6%
2 47
 
2.3%
( 7
 
0.3%
) 7
 
0.3%
1 5
 
0.2%
S 4
 
0.2%
3 4
 
0.2%
R 2
 
0.1%
M 2
 
0.1%
Other values (3) 5
 
0.2%
Hangul
ValueCountFrequency (%)
1759
27.6%
1732
27.1%
141
 
2.2%
119
 
1.9%
71
 
1.1%
65
 
1.0%
62
 
1.0%
58
 
0.9%
51
 
0.8%
50
 
0.8%
Other values (276) 2273
35.6%
CJK
ValueCountFrequency (%)
1
100.0%

주사육업종
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct12
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
한우
1764 
염소
 
59
돼지
 
32
종계/산란계
 
23
육계
 
21
Other values (7)
 
37

Length

Max length6
Median length2
Mean length2.0485537
Min length2

Unique

Unique3 ?
Unique (%)0.2%

Sample

1st row돼지
2nd row돼지
3rd row한우
4th row돼지
5th row젖소

Common Values

ValueCountFrequency (%)
한우 1764
91.1%
염소 59
 
3.0%
돼지 32
 
1.7%
종계/산란계 23
 
1.2%
육계 21
 
1.1%
육우 15
 
0.8%
오리 9
 
0.5%
젖소 7
 
0.4%
사슴 3
 
0.2%
산양 1
 
0.1%
Other values (2) 2
 
0.1%

Length

2024-03-14T22:53:39.463894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 1764
91.1%
염소 59
 
3.0%
돼지 32
 
1.7%
종계/산란계 23
 
1.2%
육계 21
 
1.1%
육우 15
 
0.8%
오리 9
 
0.5%
젖소 7
 
0.4%
사슴 3
 
0.2%
산양 1
 
0.1%
Other values (2) 2
 
0.1%
Distinct252
Distinct (%)13.0%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
2024-03-14T22:53:40.808425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length81
Median length16
Mean length16.380165
Min length16

Characters and Unicode

Total characters31712
Distinct characters154
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique95 ?
Unique (%)4.9%

Sample

1st row경상북도 예천군 풍양면 흔효리
2nd row경상북도 예천군 개포면 동송리
3rd row경상북도 예천군 유천면 화지리
4th row경상북도 예천군 지보면 수월리
5th row경상북도 예천군 은풍면 부초리
ValueCountFrequency (%)
경상북도 1936
24.6%
예천군 1936
24.6%
풍양면 290
 
3.7%
감천면 278
 
3.5%
예천읍 259
 
3.3%
지보면 188
 
2.4%
용문면 179
 
2.3%
유천면 161
 
2.0%
호명면 142
 
1.8%
용궁면 134
 
1.7%
Other values (270) 2360
30.0%
2024-03-14T22:53:42.411245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6202
19.6%
2725
 
8.6%
2196
 
6.9%
2009
 
6.3%
1980
 
6.2%
1944
 
6.1%
1936
 
6.1%
1936
 
6.1%
1900
 
6.0%
1677
 
5.3%
Other values (144) 7207
22.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 25215
79.5%
Space Separator 6202
 
19.6%
Decimal Number 281
 
0.9%
Other Punctuation 7
 
< 0.1%
Dash Punctuation 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2725
10.8%
2196
 
8.7%
2009
 
8.0%
1980
 
7.9%
1944
 
7.7%
1936
 
7.7%
1936
 
7.7%
1900
 
7.5%
1677
 
6.7%
390
 
1.5%
Other values (131) 6522
25.9%
Decimal Number
ValueCountFrequency (%)
2 48
17.1%
1 46
16.4%
3 44
15.7%
5 32
11.4%
6 27
9.6%
4 24
8.5%
7 23
8.2%
8 13
 
4.6%
9 12
 
4.3%
0 12
 
4.3%
Space Separator
ValueCountFrequency (%)
6202
100.0%
Other Punctuation
ValueCountFrequency (%)
, 7
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 25215
79.5%
Common 6497
 
20.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2725
10.8%
2196
 
8.7%
2009
 
8.0%
1980
 
7.9%
1944
 
7.7%
1936
 
7.7%
1936
 
7.7%
1900
 
7.5%
1677
 
6.7%
390
 
1.5%
Other values (131) 6522
25.9%
Common
ValueCountFrequency (%)
6202
95.5%
2 48
 
0.7%
1 46
 
0.7%
3 44
 
0.7%
5 32
 
0.5%
6 27
 
0.4%
4 24
 
0.4%
7 23
 
0.4%
8 13
 
0.2%
9 12
 
0.2%
Other values (3) 26
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 25215
79.5%
ASCII 6497
 
20.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6202
95.5%
2 48
 
0.7%
1 46
 
0.7%
3 44
 
0.7%
5 32
 
0.5%
6 27
 
0.4%
4 24
 
0.4%
7 23
 
0.4%
8 13
 
0.2%
9 12
 
0.2%
Other values (3) 26
 
0.4%
Hangul
ValueCountFrequency (%)
2725
10.8%
2196
 
8.7%
2009
 
8.0%
1980
 
7.9%
1944
 
7.7%
1936
 
7.7%
1936
 
7.7%
1900
 
7.5%
1677
 
6.7%
390
 
1.5%
Other values (131) 6522
25.9%

면적
Real number (ℝ)

HIGH CORRELATION 

Distinct785
Distinct (%)40.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean579.92319
Minimum0
Maximum35150
Zeros2
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size17.1 KiB
2024-03-14T22:53:42.654840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile21
Q182.45
median325.745
Q3768
95-th percentile1788
Maximum35150
Range35150
Interquartile range (IQR)685.55

Descriptive statistics

Standard deviation1085.5546
Coefficient of variation (CV)1.8718938
Kurtosis538.9575
Mean579.92319
Median Absolute Deviation (MAD)276.745
Skewness18.064341
Sum1122731.3
Variance1178428.9
MonotonicityNot monotonic
2024-03-14T22:53:43.086258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
33.0 53
 
2.7%
320.0 41
 
2.1%
66.0 34
 
1.8%
384.0 32
 
1.7%
192.0 28
 
1.4%
640.0 28
 
1.4%
40.0 27
 
1.4%
50.0 27
 
1.4%
30.0 23
 
1.2%
99.0 23
 
1.2%
Other values (775) 1620
83.7%
ValueCountFrequency (%)
0.0 2
0.1%
1.0 3
0.2%
2.0 4
0.2%
3.0 4
0.2%
4.0 4
0.2%
5.0 4
0.2%
6.0 3
0.2%
7.0 1
 
0.1%
8.0 3
0.2%
9.01 1
 
0.1%
ValueCountFrequency (%)
35150.0 1
0.1%
11772.59 1
0.1%
7227.04 1
0.1%
5978.0 1
0.1%
5504.18 1
0.1%
5376.19 1
0.1%
5045.68 1
0.1%
4860.0 1
0.1%
4669.35 1
0.1%
4456.0 1
0.1%

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
2024-01-09
1936 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-01-09
2nd row2024-01-09
3rd row2024-01-09
4th row2024-01-09
5th row2024-01-09

Common Values

ValueCountFrequency (%)
2024-01-09 1936
100.0%

Length

2024-03-14T22:53:43.499104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T22:53:43.805877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-01-09 1936
100.0%

Interactions

2024-03-14T22:53:36.000898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T22:53:43.982409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주사육업종면적
주사육업종1.0000.760
면적0.7601.000
2024-03-14T22:53:44.206631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
면적주사육업종
면적1.0000.552
주사육업종0.5521.000

Missing values

2024-03-14T22:53:36.269973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T22:53:36.438727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명칭주사육업종사업장소재지면적기준일자
0안0농장돼지경상북도 예천군 풍양면 흔효리11772.592024-01-09
1경0농장돼지경상북도 예천군 개포면 동송리3585.592024-01-09
2하0농장한우경상북도 예천군 유천면 화지리894.12024-01-09
3옥0농장돼지경상북도 예천군 지보면 수월리2748.72024-01-09
4부0목장젖소경상북도 예천군 은풍면 부초리1556.442024-01-09
5수0농장한우경상북도 예천군 유천면 수심리964.12024-01-09
6대0농장종계/산란계경상북도 예천군 호명면 종산리5045.682024-01-09
7승0농장육계경상북도 예천군 보문면 승본리2039.12024-01-09
8고0리골 제1농장한우경상북도 예천군 용문면 덕신리906.02024-01-09
9용0목장젖소경상북도 예천군 용궁면 금남리2266.622024-01-09
사업장명칭주사육업종사업장소재지면적기준일자
1926희망농장한우경상북도 예천군 유천면 화지리 263번지17.02024-01-09
1927희망농장한우경상북도 예천군 풍양면 고산리 197번지12.02024-01-09
1928희망농장한우경상북도 예천군 예천읍 통명리 452번지10.02024-01-09
1929희삼축산한우경상북도 예천군 풍양면 흔효리 180번지51.02024-01-09
1930희상농장한우경상북도 예천군 용문면 상금곡리 348번지8.02024-01-09
1931희석농장한우경상북도 예천군 유천면 고산리 200번지 1호30.02024-01-09
1932희야농장한우경상북도 예천군 지보면 마전리 741번지18.02024-01-09
1933희영농장한우경상북도 예천군 예천읍 지내리 453번지24.02024-01-09
1934희주농장한우경상북도 예천군 감천면 포리 279번지 1호15.02024-01-09
1935희찬농장한우경상북도 예천군 유천면 광전리 340번지 광전리 34633.02024-01-09

Duplicate rows

Most frequently occurring

사업장명칭주사육업종사업장소재지면적기준일자# duplicates
0강0농장한우경상북도 예천군 효자면 백석리40.02024-01-092
1건0농장한우경상북도 예천군 용문면 덕신리288.02024-01-092
2군0농장한우경상북도 예천군 용문면 하금곡33.02024-01-092
3기0농장한우경상북도 예천군 예천읍 고평리30.02024-01-092
4은0농장한우경상북도 예천군 은풍면 우곡리723.072024-01-092
5재0농장한우경상북도 예천군 용궁면 금남리320.02024-01-092
6재0농장한우경상북도 예천군 지보면 도화리33.02024-01-092
7정0농장한우경상북도 예천군 감천면 현내리800.02024-01-092
8주0농장한우경상북도 예천군 효자면 석묘리65.02024-01-092