Overview

Dataset statistics

Number of variables5
Number of observations909
Missing cells35
Missing cells (%)0.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory37.4 KiB
Average record size in memory42.1 B

Variable types

Numeric2
Text2
Categorical1

Dataset

Description충청북도 보은군 관내에 등록된 축산농가 현황 데이터로 사육장 명칭, 사육 업종, 사업장소재지, 사육두수 데이터를 제공합니다.
URLhttps://www.data.go.kr/data/15112822/fileData.do

Alerts

사육업종 is highly imbalanced (65.5%)Imbalance
사육두수 has 35 (3.9%) missing valuesMissing
순번 has unique valuesUnique
사육두수 has 26 (2.9%) zerosZeros

Reproduction

Analysis started2023-12-12 05:24:26.545042
Analysis finished2023-12-12 05:24:27.305053
Duration0.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct909
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean455
Minimum1
Maximum909
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.1 KiB
2023-12-12T14:24:27.378445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile46.4
Q1228
median455
Q3682
95-th percentile863.6
Maximum909
Range908
Interquartile range (IQR)454

Descriptive statistics

Standard deviation262.55
Coefficient of variation (CV)0.57703296
Kurtosis-1.2
Mean455
Median Absolute Deviation (MAD)227
Skewness0
Sum413595
Variance68932.5
MonotonicityStrictly increasing
2023-12-12T14:24:27.509827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
626 1
 
0.1%
600 1
 
0.1%
601 1
 
0.1%
602 1
 
0.1%
603 1
 
0.1%
604 1
 
0.1%
605 1
 
0.1%
606 1
 
0.1%
607 1
 
0.1%
Other values (899) 899
98.9%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
909 1
0.1%
908 1
0.1%
907 1
0.1%
906 1
0.1%
905 1
0.1%
904 1
0.1%
903 1
0.1%
902 1
0.1%
901 1
0.1%
900 1
0.1%
Distinct75
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size7.2 KiB
2023-12-12T14:24:27.730897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length5
Mean length5.2464246
Min length5

Characters and Unicode

Total characters4769
Distinct characters100
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)2.1%

Sample

1st row최○○농장
2nd row변○○농장
3rd row이○○농장
4th row최○○농장
5th row이○○농장
ValueCountFrequency (%)
김○○농장 191
20.8%
이○○농장 159
17.3%
박○○농장 57
 
6.2%
최○○농장 54
 
5.9%
송○○농장 32
 
3.5%
정○○농장 30
 
3.3%
조○○농장 24
 
2.6%
양○○농장 22
 
2.4%
황○○농장 21
 
2.3%
안○○농장 19
 
2.1%
Other values (65) 311
33.8%
2023-12-12T14:24:28.101270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1748
36.7%
908
19.0%
893
18.7%
191
 
4.0%
159
 
3.3%
57
 
1.2%
54
 
1.1%
40
 
0.8%
32
 
0.7%
31
 
0.7%
Other values (90) 656
 
13.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2985
62.6%
Other Symbol 1757
36.8%
Space Separator 11
 
0.2%
Open Punctuation 7
 
0.1%
Close Punctuation 7
 
0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
908
30.4%
893
29.9%
191
 
6.4%
159
 
5.3%
57
 
1.9%
54
 
1.8%
40
 
1.3%
32
 
1.1%
31
 
1.0%
25
 
0.8%
Other values (83) 595
19.9%
Other Symbol
ValueCountFrequency (%)
1748
99.5%
9
 
0.5%
Other Punctuation
ValueCountFrequency (%)
. 1
50.0%
, 1
50.0%
Space Separator
ValueCountFrequency (%)
11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2994
62.8%
Common 1775
37.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
908
30.3%
893
29.8%
191
 
6.4%
159
 
5.3%
57
 
1.9%
54
 
1.8%
40
 
1.3%
32
 
1.1%
31
 
1.0%
25
 
0.8%
Other values (84) 604
20.2%
Common
ValueCountFrequency (%)
1748
98.5%
11
 
0.6%
( 7
 
0.4%
) 7
 
0.4%
. 1
 
0.1%
, 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2985
62.6%
Geometric Shapes 1748
36.7%
ASCII 27
 
0.6%
None 9
 
0.2%

Most frequent character per block

Geometric Shapes
ValueCountFrequency (%)
1748
100.0%
Hangul
ValueCountFrequency (%)
908
30.4%
893
29.9%
191
 
6.4%
159
 
5.3%
57
 
1.9%
54
 
1.8%
40
 
1.3%
32
 
1.1%
31
 
1.0%
25
 
0.8%
Other values (83) 595
19.9%
ASCII
ValueCountFrequency (%)
11
40.7%
( 7
25.9%
) 7
25.9%
. 1
 
3.7%
, 1
 
3.7%
None
ValueCountFrequency (%)
9
100.0%

주소
Text

Distinct144
Distinct (%)15.8%
Missing0
Missing (%)0.0%
Memory size7.2 KiB
2023-12-12T14:24:28.430171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length16.026403
Min length15

Characters and Unicode

Total characters14568
Distinct characters135
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)1.7%

Sample

1st row충청북도 보은군 내북면 대안리
2nd row충청북도 보은군 내북면 대안리
3rd row충청북도 보은군 내북면 대안리
4th row충청북도 보은군 내북면 도원리
5th row충청북도 보은군 내북면 도원리
ValueCountFrequency (%)
충청북도 909
25.0%
보은군 909
25.0%
보은읍 195
 
5.4%
탄부면 133
 
3.7%
삼승면 113
 
3.1%
수한면 103
 
2.8%
마로면 101
 
2.8%
산외면 80
 
2.2%
내북면 58
 
1.6%
장안면 49
 
1.3%
Other values (146) 986
27.1%
2023-12-12T14:24:28.924795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2727
18.7%
1104
 
7.6%
1104
 
7.6%
981
 
6.7%
939
 
6.4%
916
 
6.3%
913
 
6.3%
909
 
6.2%
909
 
6.2%
714
 
4.9%
Other values (125) 3352
23.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11841
81.3%
Space Separator 2727
 
18.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1104
 
9.3%
1104
 
9.3%
981
 
8.3%
939
 
7.9%
916
 
7.7%
913
 
7.7%
909
 
7.7%
909
 
7.7%
714
 
6.0%
195
 
1.6%
Other values (124) 3157
26.7%
Space Separator
ValueCountFrequency (%)
2727
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11841
81.3%
Common 2727
 
18.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1104
 
9.3%
1104
 
9.3%
981
 
8.3%
939
 
7.9%
916
 
7.7%
913
 
7.7%
909
 
7.7%
909
 
7.7%
714
 
6.0%
195
 
1.6%
Other values (124) 3157
26.7%
Common
ValueCountFrequency (%)
2727
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11841
81.3%
ASCII 2727
 
18.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2727
100.0%
Hangul
ValueCountFrequency (%)
1104
 
9.3%
1104
 
9.3%
981
 
8.3%
939
 
7.9%
916
 
7.7%
913
 
7.7%
909
 
7.7%
909
 
7.7%
714
 
6.0%
195
 
1.6%
Other values (124) 3157
26.7%

사육두수
Real number (ℝ)

MISSING  ZEROS 

Distinct231
Distinct (%)26.4%
Missing35
Missing (%)3.9%
Infinite0
Infinite (%)0.0%
Mean2905.5011
Minimum0
Maximum136000
Zeros26
Zeros (%)2.9%
Negative0
Negative (%)0.0%
Memory size8.1 KiB
2023-12-12T14:24:29.092716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q112
median37
Q394
95-th percentile15000
Maximum136000
Range136000
Interquartile range (IQR)82

Descriptive statistics

Standard deviation14067.652
Coefficient of variation (CV)4.8417301
Kurtosis46.589849
Mean2905.5011
Median Absolute Deviation (MAD)30
Skewness6.4192667
Sum2539408
Variance1.9789884 × 108
MonotonicityNot monotonic
2023-12-12T14:24:29.254569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 26
 
2.9%
3 24
 
2.6%
7 22
 
2.4%
1 19
 
2.1%
6 17
 
1.9%
2 17
 
1.9%
4 17
 
1.9%
9 16
 
1.8%
12 15
 
1.7%
10 15
 
1.7%
Other values (221) 686
75.5%
(Missing) 35
 
3.9%
ValueCountFrequency (%)
0 26
2.9%
1 19
2.1%
2 17
1.9%
3 24
2.6%
4 17
1.9%
5 13
1.4%
6 17
1.9%
7 22
2.4%
8 8
 
0.9%
9 16
1.8%
ValueCountFrequency (%)
136000 1
0.1%
134800 1
0.1%
133500 1
0.1%
133000 1
0.1%
93955 2
0.2%
93000 1
0.1%
91600 2
0.2%
80500 1
0.1%
78500 1
0.1%
66000 1
0.1%

사육업종
Categorical

IMBALANCE 

Distinct10
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size7.2 KiB
한우
729 
양계
93 
젖소
 
27
양돈
 
22
염소
 
16
Other values (5)
 
22

Length

Max length3
Median length2
Mean length2.0022002
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row한우
2nd row한우
3rd row한우
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 729
80.2%
양계 93
 
10.2%
젖소 27
 
3.0%
양돈 22
 
2.4%
염소 16
 
1.8%
산양 13
 
1.4%
육우 3
 
0.3%
사슴 3
 
0.3%
메추리 2
 
0.2%
면양 1
 
0.1%

Length

2023-12-12T14:24:29.439800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:24:29.569896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한우 729
80.2%
양계 93
 
10.2%
젖소 27
 
3.0%
양돈 22
 
2.4%
염소 16
 
1.8%
산양 13
 
1.4%
육우 3
 
0.3%
사슴 3
 
0.3%
메추리 2
 
0.2%
면양 1
 
0.1%

Interactions

2023-12-12T14:24:26.956195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:24:26.785205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:24:27.042544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:24:26.860315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:24:29.681154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번농가명사육두수사육업종
순번1.0000.4410.4290.786
농가명0.4411.0000.7940.405
사육두수0.4290.7941.0000.508
사육업종0.7860.4050.5081.000
2023-12-12T14:24:29.778820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사육두수사육업종
순번1.0000.1370.351
사육두수0.1371.0000.342
사육업종0.3510.3421.000

Missing values

2023-12-12T14:24:27.153409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:24:27.263349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번농가명주소사육두수사육업종
01최○○농장충청북도 보은군 내북면 대안리89한우
12변○○농장충청북도 보은군 내북면 대안리80한우
23이○○농장충청북도 보은군 내북면 대안리22한우
34최○○농장충청북도 보은군 내북면 도원리17한우
45이○○농장충청북도 보은군 내북면 도원리7한우
56김○○농장충청북도 보은군 내북면 동산리3한우
67이○○농장충청북도 보은군 내북면 동산리3한우
78백○○농장충청북도 보은군 내북면 두평리24한우
89김○○농장충청북도 보은군 내북면 두평리20한우
910천○○농장충청북도 보은군 내북면 두평리181한우
순번농가명주소사육두수사육업종
899900문○○농장충청북도 보은군 수한면 오정리<NA>염소
900901최○○농장충청북도 보은군 보은읍 용암리<NA>염소
901902김○○농장충청북도 보은군 수한면 오정리<NA>면양
902903박○○농장충청북도 보은군 내북면 동산리<NA>메추리
903904이○○농장충청북도 보은군 산외면 원평리<NA>염소
904905권○○농장충청북도 보은군 수한면 발산리<NA>염소
905906김○○농장충청북도 보은군 보은읍 중초리<NA>염소
906907전○○농장충청북도 보은군 마로면 갈전리<NA>산양
907908문○○농장충청북도 보은군 삼승면 원남리<NA>산양
908909정○○농장충청북도 보은군 산외면 길탕리<NA>염소