Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells618
Missing cells (%)0.9%
Duplicate rows1290
Duplicate rows (%)12.9%
Total size in memory683.6 KiB
Average record size in memory70.0 B

Variable types

Numeric5
Categorical2

Dataset

Description가축분뇨 전자인계관리시스템과 농림사업정보시스템(Agrix)간 연계하여 송수/신하는 가축분뇨 및 퇴액비 관리대장 정보입니다.
Author한국환경공단
URLhttps://www.data.go.kr/data/15041936/fileData.do

Alerts

액비살포 면적 has constant value ""Constant
Dataset has 1290 (12.9%) duplicate rowsDuplicates
자가처리량 is highly overall correlated with 위탁축분High correlation
퇴액비생산량 is highly overall correlated with 퇴액비처분량 and 1 other fieldsHigh correlation
퇴액비처분량 is highly overall correlated with 퇴액비생산량 and 1 other fieldsHigh correlation
액비살포량 is highly overall correlated with 위탁축분High correlation
위탁축분 is highly overall correlated with 자가처리량 and 3 other fieldsHigh correlation
퇴액비생산량 has 618 (6.2%) missing valuesMissing
자가처리량 is highly skewed (γ1 = 53.4394363)Skewed
퇴액비생산량 is highly skewed (γ1 = 27.9991341)Skewed
자가처리량 has 9984 (99.8%) zerosZeros
위탁량 has 1320 (13.2%) zerosZeros
퇴액비생산량 has 9343 (93.4%) zerosZeros
퇴액비처분량 has 9357 (93.6%) zerosZeros
액비살포량 has 9606 (96.1%) zerosZeros

Reproduction

Analysis started2023-12-12 08:53:19.228249
Analysis finished2023-12-12 08:53:23.798619
Duration4.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

자가처리량
Real number (ℝ)

HIGH CORRELATION  SKEWED  ZEROS 

Distinct15
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.0845
Minimum0
Maximum225
Zeros9984
Zeros (%)99.8%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:53:23.891035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum225
Range225
Interquartile range (IQR)0

Descriptive statistics

Standard deviation3.0297323
Coefficient of variation (CV)35.85482
Kurtosis3421.8472
Mean0.0845
Median Absolute Deviation (MAD)0
Skewness53.439436
Sum845
Variance9.1792777
MonotonicityNot monotonic
2023-12-12T17:53:24.065247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
0 9984
99.8%
20 2
 
< 0.1%
15 2
 
< 0.1%
37 1
 
< 0.1%
11 1
 
< 0.1%
10 1
 
< 0.1%
108 1
 
< 0.1%
30 1
 
< 0.1%
96 1
 
< 0.1%
41 1
 
< 0.1%
Other values (5) 5
 
0.1%
ValueCountFrequency (%)
0 9984
99.8%
10 1
 
< 0.1%
11 1
 
< 0.1%
15 2
 
< 0.1%
20 2
 
< 0.1%
21 1
 
< 0.1%
30 1
 
< 0.1%
36 1
 
< 0.1%
37 1
 
< 0.1%
41 1
 
< 0.1%
ValueCountFrequency (%)
225 1
< 0.1%
108 1
< 0.1%
96 1
< 0.1%
94 1
< 0.1%
66 1
< 0.1%
41 1
< 0.1%
37 1
< 0.1%
36 1
< 0.1%
30 1
< 0.1%
21 1
< 0.1%

위탁축분
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
혼합
7350 
<NA>
1073 
 
707
액비
 
690
 
164

Length

Max length4
Median length2
Mean length2.1275
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row혼합
2nd row<NA>
3rd row혼합
4th row
5th row혼합

Common Values

ValueCountFrequency (%)
혼합 7350
73.5%
<NA> 1073
 
10.7%
707
 
7.1%
액비 690
 
6.9%
164
 
1.6%
퇴비 16
 
0.2%

Length

2023-12-12T17:53:24.207178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:53:24.364122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
혼합 7350
73.5%
na 1073
 
10.7%
707
 
7.1%
액비 690
 
6.9%
164
 
1.6%
퇴비 16
 
0.2%

위탁량
Real number (ℝ)

ZEROS 

Distinct3743
Distinct (%)37.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44.990478
Minimum0
Maximum1495
Zeros1320
Zeros (%)13.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:53:24.519499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q114.6775
median30.155
Q362.8525
95-th percentile125.883
Maximum1495
Range1495
Interquartile range (IQR)48.175

Descriptive statistics

Standard deviation52.018222
Coefficient of variation (CV)1.1562051
Kurtosis118.80036
Mean44.990478
Median Absolute Deviation (MAD)19.845
Skewness6.3819282
Sum449904.78
Variance2705.8954
MonotonicityNot monotonic
2023-12-12T17:53:24.677022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 1320
 
13.2%
48.0 142
 
1.4%
23.0 140
 
1.4%
25.0 139
 
1.4%
15.0 136
 
1.4%
24.0 134
 
1.3%
30.0 126
 
1.3%
40.0 117
 
1.2%
45.0 100
 
1.0%
20.0 98
 
1.0%
Other values (3733) 7548
75.5%
ValueCountFrequency (%)
0.0 1320
13.2%
0.89 1
 
< 0.1%
1.0 32
 
0.3%
1.65 1
 
< 0.1%
1.68 1
 
< 0.1%
2.6 1
 
< 0.1%
3.47 1
 
< 0.1%
3.6 1
 
< 0.1%
3.63 1
 
< 0.1%
4.0 41
 
0.4%
ValueCountFrequency (%)
1495.0 1
< 0.1%
1372.7 1
< 0.1%
778.5 1
< 0.1%
700.0 1
< 0.1%
576.0 1
< 0.1%
551.0 1
< 0.1%
536.0 1
< 0.1%
529.0 1
< 0.1%
505.0 1
< 0.1%
483.0 1
< 0.1%

퇴액비생산량
Real number (ℝ)

HIGH CORRELATION  MISSING  SKEWED  ZEROS 

Distinct13
Distinct (%)0.1%
Missing618
Missing (%)6.2%
Infinite0
Infinite (%)0.0%
Mean0.12848433
Minimum0
Maximum140
Zeros9343
Zeros (%)93.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:53:24.833269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum140
Range140
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2.8990434
Coefficient of variation (CV)22.5634
Kurtosis925.14787
Mean0.12848433
Median Absolute Deviation (MAD)0
Skewness27.999134
Sum1205.44
Variance8.4044526
MonotonicityNot monotonic
2023-12-12T17:53:24.966701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
0.0 9343
93.4%
60.0 11
 
0.1%
5.0 10
 
0.1%
7.0 6
 
0.1%
0.6 3
 
< 0.1%
75.0 2
 
< 0.1%
15.0 1
 
< 0.1%
56.0 1
 
< 0.1%
2.64 1
 
< 0.1%
65.0 1
 
< 0.1%
Other values (3) 3
 
< 0.1%
(Missing) 618
 
6.2%
ValueCountFrequency (%)
0.0 9343
93.4%
0.6 3
 
< 0.1%
2.64 1
 
< 0.1%
3.0 1
 
< 0.1%
5.0 10
 
0.1%
7.0 6
 
0.1%
15.0 1
 
< 0.1%
20.0 1
 
< 0.1%
56.0 1
 
< 0.1%
60.0 11
 
0.1%
ValueCountFrequency (%)
140.0 1
 
< 0.1%
75.0 2
 
< 0.1%
65.0 1
 
< 0.1%
60.0 11
0.1%
56.0 1
 
< 0.1%
20.0 1
 
< 0.1%
15.0 1
 
< 0.1%
7.0 6
0.1%
5.0 10
0.1%
3.0 1
 
< 0.1%

퇴액비처분량
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct264
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.3154937
Minimum0
Maximum1247
Zeros9357
Zeros (%)93.6%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:53:25.098549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile35.05
Maximum1247
Range1247
Interquartile range (IQR)0

Descriptive statistics

Standard deviation41.785176
Coefficient of variation (CV)5.7118737
Kurtosis202.45383
Mean7.3154937
Median Absolute Deviation (MAD)0
Skewness11.207262
Sum73154.937
Variance1746.0009
MonotonicityNot monotonic
2023-12-12T17:53:25.271582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 9357
93.6%
46.0 26
 
0.3%
15.0 20
 
0.2%
60.0 19
 
0.2%
23.0 18
 
0.2%
48.0 14
 
0.1%
44.0 12
 
0.1%
24.0 12
 
0.1%
22.0 12
 
0.1%
144.0 11
 
0.1%
Other values (254) 499
 
5.0%
ValueCountFrequency (%)
0.0 9357
93.6%
2.64 1
 
< 0.1%
5.0 3
 
< 0.1%
6.0 1
 
< 0.1%
6.4 1
 
< 0.1%
6.5 2
 
< 0.1%
7.0 4
 
< 0.1%
7.5 1
 
< 0.1%
8.0 3
 
< 0.1%
10.0 3
 
< 0.1%
ValueCountFrequency (%)
1247.0 1
< 0.1%
1151.4 1
< 0.1%
748.4 1
< 0.1%
718.5 1
< 0.1%
709.0 1
< 0.1%
655.0 1
< 0.1%
616.1 1
< 0.1%
574.0 1
< 0.1%
564.8 1
< 0.1%
536.0 1
< 0.1%

액비살포량
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct176
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.3923681
Minimum0
Maximum1248
Zeros9606
Zeros (%)96.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:53:25.798438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum1248
Range1248
Interquartile range (IQR)0

Descriptive statistics

Standard deviation33.731657
Coefficient of variation (CV)7.6796061
Kurtosis488.8649
Mean4.3923681
Median Absolute Deviation (MAD)0
Skewness17.391915
Sum43923.681
Variance1137.8247
MonotonicityNot monotonic
2023-12-12T17:53:26.000924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 9606
96.1%
23.0 23
 
0.2%
46.0 13
 
0.1%
48.0 12
 
0.1%
92.0 10
 
0.1%
24.0 10
 
0.1%
60.0 9
 
0.1%
72.0 8
 
0.1%
30.0 8
 
0.1%
22.0 7
 
0.1%
Other values (166) 294
 
2.9%
ValueCountFrequency (%)
0.0 9606
96.1%
7.0 1
 
< 0.1%
7.5 1
 
< 0.1%
8.0 2
 
< 0.1%
10.0 3
 
< 0.1%
14.0 1
 
< 0.1%
15.0 6
 
0.1%
16.0 4
 
< 0.1%
18.0 3
 
< 0.1%
20.0 4
 
< 0.1%
ValueCountFrequency (%)
1248.0 1
 
< 0.1%
1247.0 1
 
< 0.1%
1013.0 1
 
< 0.1%
483.0 1
 
< 0.1%
464.0 1
 
< 0.1%
423.0 1
 
< 0.1%
416.0 1
 
< 0.1%
384.0 2
< 0.1%
375.0 3
< 0.1%
359.0 1
 
< 0.1%

액비살포 면적
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 10000
100.0%

Length

2023-12-12T17:53:26.178985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:53:26.283301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 10000
100.0%

Interactions

2023-12-12T17:53:22.821227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:20.356748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:20.975704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:21.563509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:22.119533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:22.931423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:20.472096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:21.078513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:21.664494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:22.238044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:23.073760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:20.607741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:21.204569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:21.791470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:22.368081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:23.220644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:20.755665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:21.343297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:21.896971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:22.492086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:23.334818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:20.878147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:21.444676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:22.000890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:53:22.685543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:53:26.351043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자가처리량위탁축분위탁량퇴액비생산량퇴액비처분량액비살포량
자가처리량1.000NaN0.0000.0000.0000.000
위탁축분NaN1.0000.248NaNNaNNaN
위탁량0.0000.2481.0000.0000.0000.000
퇴액비생산량0.000NaN0.0001.0001.0000.000
퇴액비처분량0.000NaN0.0001.0001.0000.000
액비살포량0.000NaN0.0000.0000.0001.000
2023-12-12T17:53:26.509741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자가처리량위탁량퇴액비생산량퇴액비처분량액비살포량위탁축분
자가처리량1.000-0.060-0.003-0.010-0.0081.000
위탁량-0.0601.000-0.104-0.394-0.3050.161
퇴액비생산량-0.003-0.1041.0000.832-0.0141.000
퇴액비처분량-0.010-0.3940.8321.000-0.0531.000
액비살포량-0.008-0.305-0.014-0.0531.0001.000
위탁축분1.0000.1611.0001.0001.0001.000

Missing values

2023-12-12T17:53:23.502435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:53:23.714069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

자가처리량위탁축분위탁량퇴액비생산량퇴액비처분량액비살포량액비살포 면적
359740혼합49.850.00.00.00
294360<NA>0.00.00.080.00
360830혼합45.00.00.00.00
5531040.00.00.00.00
260470혼합30.20.00.00.00
31589039.840.00.00.00
4206054.770.00.00.00
298650혼합119.00.00.00.00
150880혼합32.110.00.00.00
48130혼합10.860.00.00.00
자가처리량위탁축분위탁량퇴액비생산량퇴액비처분량액비살포량액비살포 면적
53920혼합75.180.00.00.00
263580혼합46.750.00.00.00
328670혼합6.220.00.00.00
251030혼합20.470.00.00.00
205000혼합178.430.00.00.00
27421015.00.00.00.00
85260혼합97.490.00.00.00
188390혼합24.00.00.00.00
100050혼합119.080.00.00.00
241190혼합25.00.00.00.00

Duplicate rows

Most frequently occurring

자가처리량위탁축분위탁량퇴액비생산량퇴액비처분량액비살포량액비살포 면적# duplicates
1540혼합0.00.00.00.00239
5340혼합25.00.00.00.00130
4490혼합23.00.00.00.00119
8730혼합48.00.00.00.00113
4970혼합24.00.00.00.00107
7260혼합40.00.00.00.00107
2860혼합15.00.00.00.0091
8970혼합50.00.00.00.0081
8240혼합46.00.00.00.0080
6240혼합30.00.00.00.0072