Overview

Dataset statistics

Number of variables4
Number of observations683
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory22.1 KiB
Average record size in memory33.2 B

Variable types

Text2
Categorical1
Numeric1

Dataset

Description충청북도 옥천군의 축사 현황을 나타냅니다. 데이터 항목으로 사업장명칭, 주사육업종, 사업장소재지(지번) 및 축사면적(제곱미터)이 있습니다.
Author충청북도 옥천군
URLhttps://www.data.go.kr/data/15125379/fileData.do

Alerts

주사육업종 is highly imbalanced (80.8%)Imbalance

Reproduction

Analysis started2023-12-12 05:36:22.910915
Analysis finished2023-12-12 05:36:23.525278
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct649
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
2023-12-12T14:36:23.733793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length4
Mean length4.5461201
Min length2

Characters and Unicode

Total characters3105
Distinct characters303
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique621 ?
Unique (%)90.9%

Sample

1st row한덕농장
2nd row능월농장
3rd row월수목장
4th row대성목장
5th row연일농장
ValueCountFrequency (%)
농장 16
 
2.2%
한비농장 5
 
0.7%
동이농장 3
 
0.4%
수정농장 3
 
0.4%
옥천농장 3
 
0.4%
농업회사법인 3
 
0.4%
형제목장 3
 
0.4%
가풍목장 3
 
0.4%
청정목장 3
 
0.4%
그린농장 3
 
0.4%
Other values (651) 687
93.9%
2023-12-12T14:36:24.207729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
609
 
19.6%
343
 
11.0%
281
 
9.0%
66
 
2.1%
2 54
 
1.7%
49
 
1.6%
47
 
1.5%
46
 
1.5%
45
 
1.4%
39
 
1.3%
Other values (293) 1526
49.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2939
94.7%
Decimal Number 91
 
2.9%
Space Separator 49
 
1.6%
Open Punctuation 12
 
0.4%
Close Punctuation 12
 
0.4%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
609
20.7%
343
 
11.7%
281
 
9.6%
66
 
2.2%
47
 
1.6%
46
 
1.6%
45
 
1.5%
39
 
1.3%
38
 
1.3%
36
 
1.2%
Other values (282) 1389
47.3%
Decimal Number
ValueCountFrequency (%)
2 54
59.3%
1 15
 
16.5%
3 12
 
13.2%
4 4
 
4.4%
5 3
 
3.3%
0 2
 
2.2%
6 1
 
1.1%
Space Separator
ValueCountFrequency (%)
49
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Other Punctuation
ValueCountFrequency (%)
' 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2939
94.7%
Common 166
 
5.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
609
20.7%
343
 
11.7%
281
 
9.6%
66
 
2.2%
47
 
1.6%
46
 
1.6%
45
 
1.5%
39
 
1.3%
38
 
1.3%
36
 
1.2%
Other values (282) 1389
47.3%
Common
ValueCountFrequency (%)
2 54
32.5%
49
29.5%
1 15
 
9.0%
3 12
 
7.2%
( 12
 
7.2%
) 12
 
7.2%
4 4
 
2.4%
5 3
 
1.8%
' 2
 
1.2%
0 2
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2939
94.7%
ASCII 166
 
5.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
609
20.7%
343
 
11.7%
281
 
9.6%
66
 
2.2%
47
 
1.6%
46
 
1.6%
45
 
1.5%
39
 
1.3%
38
 
1.3%
36
 
1.2%
Other values (282) 1389
47.3%
ASCII
ValueCountFrequency (%)
2 54
32.5%
49
29.5%
1 15
 
9.0%
3 12
 
7.2%
( 12
 
7.2%
) 12
 
7.2%
4 4
 
2.4%
5 3
 
1.8%
' 2
 
1.2%
0 2
 
1.2%

주사육업종
Categorical

IMBALANCE 

Distinct11
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
한우
622 
염소
 
25
산란계
 
12
돼지
 
7
젖소
 
6
Other values (6)
 
11

Length

Max length3
Median length2
Mean length2.0204978
Min length2

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row한우
2nd row한우
3rd row한우
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 622
91.1%
염소 25
 
3.7%
산란계 12
 
1.8%
돼지 7
 
1.0%
젖소 6
 
0.9%
육계 3
 
0.4%
육우 2
 
0.3%
메추리 2
 
0.3%
사슴 2
 
0.3%
산양 1
 
0.1%

Length

2023-12-12T14:36:24.373448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 622
91.1%
염소 25
 
3.7%
산란계 12
 
1.8%
돼지 7
 
1.0%
젖소 6
 
0.9%
육계 3
 
0.4%
육우 2
 
0.3%
메추리 2
 
0.3%
사슴 2
 
0.3%
산양 1
 
0.1%
Distinct677
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
2023-12-12T14:36:24.727833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length43
Mean length25.860908
Min length20

Characters and Unicode

Total characters17663
Distinct characters133
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique673 ?
Unique (%)98.5%

Sample

1st row충청북도 옥천군 안내면 도율리 270번지 1호
2nd row충청북도 옥천군 청성면 능월리 522번지 6호
3rd row충청북도 옥천군 옥천읍 매화리 131번지 1호
4th row충청북도 옥천군 안내면 현리 132번지 7호
5th row충청북도 옥천군 옥천읍 가풍리 663번지 2호
ValueCountFrequency (%)
충청북도 683
17.9%
옥천군 683
17.9%
옥천읍 216
 
5.7%
동이면 127
 
3.3%
1호 121
 
3.2%
2호 67
 
1.8%
안내면 61
 
1.6%
삼청리 59
 
1.5%
이원면 58
 
1.5%
구일리 52
 
1.4%
Other values (680) 1683
44.2%
2023-12-12T14:36:25.331624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4410
25.0%
924
 
5.2%
900
 
5.1%
857
 
4.9%
760
 
4.3%
757
 
4.3%
737
 
4.2%
708
 
4.0%
693
 
3.9%
684
 
3.9%
Other values (123) 6233
35.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10681
60.5%
Space Separator 4410
25.0%
Decimal Number 2489
 
14.1%
Dash Punctuation 34
 
0.2%
Other Punctuation 22
 
0.1%
Close Punctuation 13
 
0.1%
Open Punctuation 13
 
0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
924
 
8.7%
900
 
8.4%
857
 
8.0%
760
 
7.1%
757
 
7.1%
737
 
6.9%
708
 
6.6%
693
 
6.5%
684
 
6.4%
683
 
6.4%
Other values (107) 2978
27.9%
Decimal Number
ValueCountFrequency (%)
1 523
21.0%
2 291
11.7%
3 266
10.7%
4 237
9.5%
5 229
9.2%
6 206
 
8.3%
8 196
 
7.9%
9 191
 
7.7%
0 184
 
7.4%
7 166
 
6.7%
Space Separator
ValueCountFrequency (%)
4410
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 34
100.0%
Other Punctuation
ValueCountFrequency (%)
, 22
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10681
60.5%
Common 6981
39.5%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
924
 
8.7%
900
 
8.4%
857
 
8.0%
760
 
7.1%
757
 
7.1%
737
 
6.9%
708
 
6.6%
693
 
6.5%
684
 
6.4%
683
 
6.4%
Other values (107) 2978
27.9%
Common
ValueCountFrequency (%)
4410
63.2%
1 523
 
7.5%
2 291
 
4.2%
3 266
 
3.8%
4 237
 
3.4%
5 229
 
3.3%
6 206
 
3.0%
8 196
 
2.8%
9 191
 
2.7%
0 184
 
2.6%
Other values (5) 248
 
3.6%
Latin
ValueCountFrequency (%)
A 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10681
60.5%
ASCII 6982
39.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4410
63.2%
1 523
 
7.5%
2 291
 
4.2%
3 266
 
3.8%
4 237
 
3.4%
5 229
 
3.3%
6 206
 
3.0%
8 196
 
2.8%
9 191
 
2.7%
0 184
 
2.6%
Other values (6) 249
 
3.6%
Hangul
ValueCountFrequency (%)
924
 
8.7%
900
 
8.4%
857
 
8.0%
760
 
7.1%
757
 
7.1%
737
 
6.9%
708
 
6.6%
693
 
6.5%
684
 
6.4%
683
 
6.4%
Other values (107) 2978
27.9%

축사면적
Real number (ℝ)

Distinct452
Distinct (%)66.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean768.73375
Minimum0
Maximum14532.06
Zeros1
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size6.1 KiB
2023-12-12T14:36:25.540138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile80.02
Q1317.5
median550
Q3900
95-th percentile1875.36
Maximum14532.06
Range14532.06
Interquartile range (IQR)582.5

Descriptive statistics

Standard deviation1053.7354
Coefficient of variation (CV)1.3707417
Kurtosis66.9465
Mean768.73375
Median Absolute Deviation (MAD)274
Skewness6.8884101
Sum525045.15
Variance1110358.2
MonotonicityNot monotonic
2023-12-12T14:36:25.713578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
448.0 14
 
2.0%
640.0 13
 
1.9%
768.0 12
 
1.8%
384.0 12
 
1.8%
320.0 11
 
1.6%
192.0 8
 
1.2%
300.0 8
 
1.2%
256.0 8
 
1.2%
576.0 7
 
1.0%
800.0 7
 
1.0%
Other values (442) 583
85.4%
ValueCountFrequency (%)
0.0 1
0.1%
16.0 1
0.1%
31.45 1
0.1%
31.72 1
0.1%
32.0 1
0.1%
33.0 1
0.1%
33.6 1
0.1%
35.0 1
0.1%
36.0 2
0.3%
39.0 1
0.1%
ValueCountFrequency (%)
14532.06 1
0.1%
10444.61 1
0.1%
9710.84 1
0.1%
8085.47 1
0.1%
7566.0 1
0.1%
5993.0 1
0.1%
5954.78 1
0.1%
4497.68 1
0.1%
3873.2 1
0.1%
3591.25 1
0.1%

Interactions

2023-12-12T14:36:23.247949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:36:25.800833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주사육업종축사면적
주사육업종1.0000.502
축사면적0.5021.000
2023-12-12T14:36:25.878796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
축사면적주사육업종
축사면적1.0000.254
주사육업종0.2541.000

Missing values

2023-12-12T14:36:23.383347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:36:23.483141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명칭주사육업종사업장소재지(지번)축사면적
0한덕농장한우충청북도 옥천군 안내면 도율리 270번지 1호532.0
1능월농장한우충청북도 옥천군 청성면 능월리 522번지 6호910.11
2월수목장한우충청북도 옥천군 옥천읍 매화리 131번지 1호377.4
3대성목장한우충청북도 옥천군 안내면 현리 132번지 7호416.0
4연일농장한우충청북도 옥천군 옥천읍 가풍리 663번지 2호762.5
5대호축산한우충청북도 옥천군 안남면 지수리 1094번지398.25
6대성농장(1)산란계충청북도 옥천군 안남면 도농리 696번지 1호5993.0
7수일목장한우충청북도 옥천군 안남면 화학리 78번지780.0
8건내뜰농장한우충청북도 옥천군 안내면 방하목리 121번지432.0
9싸리재농장한우충청북도 옥천군 안내면 오덕리 453번지 2호562.6
사업장명칭주사육업종사업장소재지(지번)축사면적
673귀죽목장5호한우충청북도 옥천군 옥천읍 구일리 880번지1024.0
674상진농장4한우충청북도 옥천군 동이면 금암리 641번지 번지 외 1필지(641-1)1127.0
675해성농장1한우충청북도 옥천군 옥천읍 마암리 276번지462.68
676동이농장한우충청북도 옥천군 안내면 오덕리 1175번지640.0
677조천농장한우충청북도 옥천군 청성면 조천리 498번지 1호 ,499-160.0
678재일농장한우충청북도 옥천군 청산면 장위리 356번지 1호900.0
679하나농장한우충청북도 옥천군 동이면 금암리 629번지704.0
680시은목장염소충청북도 옥천군 군북면 자모리 96번지75.0
681종삼목장한우충청북도 옥천군 옥천읍 가풍리 569번지 1호650.0
682청정농장한우충청북도 옥천군 안남면 청정리 840번지448.0