Overview

Dataset statistics

Number of variables5
Number of observations1850
Missing cells18
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory77.8 KiB
Average record size in memory43.1 B

Variable types

Numeric3
Text1
Categorical1

Dataset

Description전북특별자치도 김제시 축산업현황입니다.
Author전라북도
URLhttps://www.bigdatahub.go.kr/index.jeonbuk?startPage=7&menuCd=DOM_000000103007001000&pListTypeStr=&pId=15034296

Alerts

사육두수 is highly overall correlated with 면적High correlation
면적 is highly overall correlated with 사육두수High correlation
주사육업종 is highly imbalanced (57.0%)Imbalance
사육두수 has 19 (1.0%) zerosZeros

Reproduction

Analysis started2024-03-14 00:07:50.626317
Analysis finished2024-03-14 00:07:51.787273
Duration1.16 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

Distinct1832
Distinct (%)100.0%
Missing18
Missing (%)1.0%
Infinite0
Infinite (%)0.0%
Mean916.5
Minimum1
Maximum1832
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size16.4 KiB
2024-03-14T09:07:51.839776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile92.55
Q1458.75
median916.5
Q31374.25
95-th percentile1740.45
Maximum1832
Range1831
Interquartile range (IQR)915.5

Descriptive statistics

Standard deviation528.99716
Coefficient of variation (CV)0.57719276
Kurtosis-1.2
Mean916.5
Median Absolute Deviation (MAD)458
Skewness0
Sum1679028
Variance279838
MonotonicityStrictly increasing
2024-03-14T09:07:51.945929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1232 1
 
0.1%
1230 1
 
0.1%
1229 1
 
0.1%
1228 1
 
0.1%
1227 1
 
0.1%
1226 1
 
0.1%
1225 1
 
0.1%
1224 1
 
0.1%
1223 1
 
0.1%
1222 1
 
0.1%
Other values (1822) 1822
98.5%
(Missing) 18
 
1.0%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1832 1
0.1%
1831 1
0.1%
1830 1
0.1%
1829 1
0.1%
1828 1
0.1%
1827 1
0.1%
1826 1
0.1%
1825 1
0.1%
1824 1
0.1%
1823 1
0.1%
Distinct1415
Distinct (%)76.5%
Missing0
Missing (%)0.0%
Memory size14.6 KiB
2024-03-14T09:07:52.177318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length4
Mean length4.4621622
Min length2

Characters and Unicode

Total characters8255
Distinct characters399
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1148 ?
Unique (%)62.1%

Sample

1st row길정해농장
2nd row형제목장
3rd row라니농장
4th row쌍용농장
5th row신흥농장
ValueCountFrequency (%)
대성농장 12
 
0.6%
우리농장 12
 
0.6%
희망농장 10
 
0.5%
보람농장 9
 
0.5%
농장 9
 
0.5%
형제농장 8
 
0.4%
신흥농장 8
 
0.4%
농업회사법인 6
 
0.3%
영농조합법인 6
 
0.3%
쌍용농장 6
 
0.3%
Other values (1418) 1804
95.4%
2024-03-14T09:07:52.609283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1725
20.9%
1717
20.8%
146
 
1.8%
134
 
1.6%
122
 
1.5%
119
 
1.4%
94
 
1.1%
88
 
1.1%
86
 
1.0%
78
 
0.9%
Other values (389) 3946
47.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8115
98.3%
Decimal Number 50
 
0.6%
Space Separator 40
 
0.5%
Uppercase Letter 19
 
0.2%
Open Punctuation 12
 
0.1%
Close Punctuation 12
 
0.1%
Lowercase Letter 4
 
< 0.1%
Other Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1725
21.3%
1717
21.2%
146
 
1.8%
134
 
1.7%
122
 
1.5%
119
 
1.5%
94
 
1.2%
88
 
1.1%
86
 
1.1%
78
 
1.0%
Other values (366) 3806
46.9%
Uppercase Letter
ValueCountFrequency (%)
D 4
21.1%
K 3
15.8%
I 3
15.8%
A 2
10.5%
J 2
10.5%
S 1
 
5.3%
P 1
 
5.3%
G 1
 
5.3%
C 1
 
5.3%
N 1
 
5.3%
Decimal Number
ValueCountFrequency (%)
2 33
66.0%
1 11
 
22.0%
3 5
 
10.0%
4 1
 
2.0%
Lowercase Letter
ValueCountFrequency (%)
m 1
25.0%
a 1
25.0%
e 1
25.0%
r 1
25.0%
Other Punctuation
ValueCountFrequency (%)
, 2
66.7%
& 1
33.3%
Space Separator
ValueCountFrequency (%)
40
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8115
98.3%
Common 117
 
1.4%
Latin 23
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1725
21.3%
1717
21.2%
146
 
1.8%
134
 
1.7%
122
 
1.5%
119
 
1.5%
94
 
1.2%
88
 
1.1%
86
 
1.1%
78
 
1.0%
Other values (366) 3806
46.9%
Latin
ValueCountFrequency (%)
D 4
17.4%
K 3
13.0%
I 3
13.0%
A 2
8.7%
J 2
8.7%
S 1
 
4.3%
m 1
 
4.3%
a 1
 
4.3%
e 1
 
4.3%
r 1
 
4.3%
Other values (4) 4
17.4%
Common
ValueCountFrequency (%)
40
34.2%
2 33
28.2%
( 12
 
10.3%
) 12
 
10.3%
1 11
 
9.4%
3 5
 
4.3%
, 2
 
1.7%
& 1
 
0.9%
4 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8115
98.3%
ASCII 140
 
1.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1725
21.3%
1717
21.2%
146
 
1.8%
134
 
1.7%
122
 
1.5%
119
 
1.5%
94
 
1.2%
88
 
1.1%
86
 
1.1%
78
 
1.0%
Other values (366) 3806
46.9%
ASCII
ValueCountFrequency (%)
40
28.6%
2 33
23.6%
( 12
 
8.6%
) 12
 
8.6%
1 11
 
7.9%
3 5
 
3.6%
D 4
 
2.9%
K 3
 
2.1%
I 3
 
2.1%
A 2
 
1.4%
Other values (13) 15
 
10.7%

주사육업종
Categorical

IMBALANCE 

Distinct11
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size14.6 KiB
한우
1329 
돼지
209 
산란계
 
132
육계
 
103
젖소
 
28
Other values (6)
 
49

Length

Max length3
Median length2
Mean length2.0718919
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row돼지
2nd row한우
3rd row한우
4th row한우
5th row돼지

Common Values

ValueCountFrequency (%)
한우 1329
71.8%
돼지 209
 
11.3%
산란계 132
 
7.1%
육계 103
 
5.6%
젖소 28
 
1.5%
오리 21
 
1.1%
염소 10
 
0.5%
사슴 10
 
0.5%
육우 5
 
0.3%
산양 2
 
0.1%

Length

2024-03-14T09:07:52.732282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 1329
71.8%
돼지 209
 
11.3%
산란계 132
 
7.1%
육계 103
 
5.6%
젖소 28
 
1.5%
오리 21
 
1.1%
염소 10
 
0.5%
사슴 10
 
0.5%
육우 5
 
0.3%
산양 2
 
0.1%

사육두수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct239
Distinct (%)12.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4117.5897
Minimum0
Maximum300000
Zeros19
Zeros (%)1.0%
Negative0
Negative (%)0.0%
Memory size16.4 KiB
2024-03-14T09:07:52.852313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q110
median28.5
Q3177.5
95-th percentile28550
Maximum300000
Range300000
Interquartile range (IQR)167.5

Descriptive statistics

Standard deviation16031.075
Coefficient of variation (CV)3.8933153
Kurtosis111.29733
Mean4117.5897
Median Absolute Deviation (MAD)23.5
Skewness8.6675121
Sum7617541
Variance2.5699536 × 108
MonotonicityNot monotonic
2024-03-14T09:07:52.956957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 112
 
6.1%
5 82
 
4.4%
50 74
 
4.0%
20 74
 
4.0%
30 69
 
3.7%
2 52
 
2.8%
15 50
 
2.7%
3 49
 
2.6%
100 48
 
2.6%
4 48
 
2.6%
Other values (229) 1192
64.4%
ValueCountFrequency (%)
0 19
 
1.0%
1 13
 
0.7%
2 52
2.8%
3 49
2.6%
4 48
2.6%
5 82
4.4%
6 47
2.5%
7 45
2.4%
8 40
2.2%
9 37
2.0%
ValueCountFrequency (%)
300000 1
 
0.1%
230000 1
 
0.1%
200000 1
 
0.1%
168000 1
 
0.1%
140000 1
 
0.1%
120030 1
 
0.1%
120000 1
 
0.1%
110000 2
0.1%
85000 1
 
0.1%
80000 3
0.2%

면적
Real number (ℝ)

HIGH CORRELATION 

Distinct1086
Distinct (%)58.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean998.77568
Minimum0
Maximum30119
Zeros5
Zeros (%)0.3%
Negative0
Negative (%)0.0%
Memory size16.4 KiB
2024-03-14T09:07:53.078783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile50
Q1197
median560
Q31315.5
95-th percentile3377.75
Maximum30119
Range30119
Interquartile range (IQR)1118.5

Descriptive statistics

Standard deviation1416.253
Coefficient of variation (CV)1.4179891
Kurtosis110.14742
Mean998.77568
Median Absolute Deviation (MAD)424
Skewness7.0958887
Sum1847735
Variance2005772.7
MonotonicityNot monotonic
2024-03-14T09:07:53.184029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
300 17
 
0.9%
150 17
 
0.9%
360 13
 
0.7%
160 13
 
0.7%
600 12
 
0.6%
192 11
 
0.6%
200 11
 
0.6%
50 11
 
0.6%
60 11
 
0.6%
90 10
 
0.5%
Other values (1076) 1724
93.2%
ValueCountFrequency (%)
0 5
0.3%
10 1
 
0.1%
11 2
 
0.1%
12 4
0.2%
13 2
 
0.1%
15 1
 
0.1%
18 1
 
0.1%
19 1
 
0.1%
20 5
0.3%
21 1
 
0.1%
ValueCountFrequency (%)
30119 1
0.1%
16608 1
0.1%
12791 1
0.1%
10781 1
0.1%
8814 1
0.1%
8113 1
0.1%
7968 1
0.1%
7920 1
0.1%
7825 1
0.1%
7734 1
0.1%

Interactions

2024-03-14T09:07:51.366280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T09:07:50.893794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T09:07:51.124905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T09:07:51.475951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T09:07:50.973623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T09:07:51.203527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T09:07:51.564849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T09:07:51.047788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T09:07:51.273545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T09:07:53.250583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번주사육업종사육두수면적
연번1.0000.3810.0990.110
주사육업종0.3811.0000.7080.316
사육두수0.0990.7081.0000.806
면적0.1100.3160.8061.000
2024-03-14T09:07:53.323198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사육두수면적주사육업종
연번1.000-0.347-0.2580.172
사육두수-0.3471.0000.7330.419
면적-0.2580.7331.0000.161
주사육업종0.1720.4190.1611.000

Missing values

2024-03-14T09:07:51.663715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T09:07:51.757312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업장명칭주사육업종사육두수면적
01길정해농장돼지151311
12형제목장한우1701828
23라니농장한우37560
34쌍용농장한우25360
45신흥농장돼지9001055
56천일농장돼지22002553
67으뜸농장돼지45064321
78정말농장돼지10004187
89윤농장한우301075
910덕산농장한우75755
연번사업장명칭주사육업종사육두수면적
1840<NA>남문농장한우49890
1841<NA>삼성농장돼지120255
1842<NA>상옥농장한우1903460
1843<NA>민희농장한우2255
1844<NA>배미농장한우14372
1845<NA>영천농장한우8814
1846<NA>삼백농장한우1601656
1847<NA>주식회사 하림육계100001387
1848<NA>도담농장한우30384
1849<NA>은정농장한우30232