Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 10000 |
Missing cells | 134 |
Missing cells (%) | 0.1% |
Duplicate rows | 1577 |
Duplicate rows (%) | 15.8% |
Total size in memory | 820.3 KiB |
Average record size in memory | 84.0 B |
Variable types
Numeric | 3 |
---|---|
DateTime | 3 |
Categorical | 2 |
Boolean | 1 |
Dataset
Description | 가축분뇨 전자인계관리시스템에서 관리하고 있는 가축 분뇨 중 액비를 운반하는 인계서 내역에 대하여 등록된 정보입니다. |
---|---|
Author | 한국환경공단 |
URL | https://www.data.go.kr/data/15041900/fileData.do |
Dataset has 1577 (15.8%) duplicate rows | Duplicates |
인수량(톤) is highly overall correlated with 살포량(톤) | High correlation |
살포량(톤) is highly overall correlated with 인수량(톤) | High correlation |
보관장소 경유여부 is highly imbalanced (70.9%) | Imbalance |
살포일자 has 134 (1.3%) missing values | Missing |
살포량(톤) has 167 (1.7%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 05:58:32.113444 |
---|---|
Analysis finished | 2023-12-12 05:58:34.170936 |
Duration | 2.06 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
운반업체번호
Real number (ℝ)
Distinct | 342 |
---|---|
Distinct (%) | 3.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0156373 × 109 |
Minimum | 2.0130001 × 109 |
---|---|
Maximum | 2.021 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0130001 × 109 |
---|---|
5-th percentile | 2.0130004 × 109 |
Q1 | 2.0150003 × 109 |
median | 2.0160008 × 109 |
Q3 | 2.0160033 × 109 |
95-th percentile | 2.0180105 × 109 |
Maximum | 2.021 × 109 |
Range | 7999876 |
Interquartile range (IQR) | 1002948 |
Descriptive statistics
Standard deviation | 1532742.8 |
---|---|
Coefficient of variation (CV) | 0.00076042592 |
Kurtosis | 2.6708767 |
Mean | 2.0156373 × 109 |
Median Absolute Deviation (MAD) | 1000399 |
Skewness | 0.89062485 |
Sum | 2.0156373 × 1013 |
Variance | 2.3493006 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2016000829 | 255 | 2.5% |
2021000007 | 237 | 2.4% |
2015000203 | 200 | 2.0% |
2015000311 | 149 | 1.5% |
2013000410 | 132 | 1.3% |
2013000369 | 130 | 1.3% |
2017000168 | 125 | 1.2% |
2016001702 | 122 | 1.2% |
2016002010 | 119 | 1.2% |
2015000094 | 117 | 1.2% |
Other values (332) | 8414 |
Value | Count | Frequency (%) |
2013000149 | 4 | < 0.1% |
2013000160 | 51 | |
2013000202 | 1 | < 0.1% |
2013000217 | 7 | 0.1% |
2013000238 | 1 | < 0.1% |
2013000256 | 1 | < 0.1% |
2013000259 | 4 | < 0.1% |
2013000277 | 17 | 0.2% |
2013000286 | 1 | < 0.1% |
2013000297 | 2 | < 0.1% |
Value | Count | Frequency (%) |
2021000025 | 10 | 0.1% |
2021000007 | 237 | |
2020000737 | 9 | 0.1% |
2020000645 | 2 | < 0.1% |
2020000642 | 7 | 0.1% |
2020000618 | 3 | < 0.1% |
2020000530 | 1 | < 0.1% |
2020000527 | 1 | < 0.1% |
2020000476 | 8 | 0.1% |
2020000413 | 20 | 0.2% |
인수일자
Date
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2021-01-01 00:00:00 |
---|---|
Maximum | 2021-04-01 00:00:00 |
인수량(톤)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 856 |
---|---|
Distinct (%) | 8.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 18.133698 |
Minimum | 2 |
---|---|
Maximum | 253 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 6.14 |
Q1 | 13.2925 |
median | 16.6 |
Q3 | 23 |
95-th percentile | 25 |
Maximum | 253 |
Range | 251 |
Interquartile range (IQR) | 9.7075 |
Descriptive statistics
Standard deviation | 12.216639 |
---|---|
Coefficient of variation (CV) | 0.67369817 |
Kurtosis | 119.46595 |
Mean | 18.133698 |
Median Absolute Deviation (MAD) | 6.4 |
Skewness | 8.387076 |
Sum | 181336.98 |
Variance | 149.24627 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
15.0 | 1259 | 12.6% |
23.0 | 1136 | 11.4% |
24.0 | 869 | 8.7% |
22.0 | 463 | 4.6% |
8.0 | 458 | 4.6% |
14.0 | 452 | 4.5% |
20.0 | 450 | 4.5% |
16.0 | 446 | 4.5% |
21.0 | 370 | 3.7% |
7.0 | 356 | 3.6% |
Other values (846) | 3741 |
Value | Count | Frequency (%) |
2.0 | 5 | 0.1% |
2.46 | 1 | < 0.1% |
3.0 | 4 | < 0.1% |
3.5 | 1 | < 0.1% |
3.8 | 1 | < 0.1% |
3.9 | 1 | < 0.1% |
4.0 | 22 | |
4.4 | 1 | < 0.1% |
4.5 | 20 | |
4.89 | 1 | < 0.1% |
Value | Count | Frequency (%) |
253.0 | 2 | |
240.0 | 1 | < 0.1% |
230.0 | 3 | |
225.0 | 1 | < 0.1% |
200.0 | 2 | |
192.0 | 2 | |
184.0 | 2 | |
180.0 | 1 | < 0.1% |
175.0 | 1 | < 0.1% |
168.0 | 1 | < 0.1% |
살포일자
Date
MISSING
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 134 |
Missing (%) | 1.3% |
Memory size | 156.2 KiB |
Minimum | 2021-01-01 00:00:00 |
---|---|
Maximum | 2021-04-01 00:00:00 |
살포량(톤)
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 857 |
---|---|
Distinct (%) | 8.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 17.838678 |
Minimum | 0 |
---|---|
Maximum | 253 |
Zeros | 167 |
Zeros (%) | 1.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 6 |
Q1 | 13 |
median | 16 |
Q3 | 23 |
95-th percentile | 25 |
Maximum | 253 |
Range | 253 |
Interquartile range (IQR) | 10 |
Descriptive statistics
Standard deviation | 12.380399 |
---|---|
Coefficient of variation (CV) | 0.69401997 |
Kurtosis | 113.75996 |
Mean | 17.838678 |
Median Absolute Deviation (MAD) | 7 |
Skewness | 8.0517992 |
Sum | 178386.78 |
Variance | 153.27427 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
15.0 | 1222 | 12.2% |
23.0 | 1126 | 11.3% |
24.0 | 848 | 8.5% |
8.0 | 458 | 4.6% |
14.0 | 449 | 4.5% |
16.0 | 446 | 4.5% |
20.0 | 439 | 4.4% |
22.0 | 431 | 4.3% |
21.0 | 360 | 3.6% |
7.0 | 356 | 3.6% |
Other values (847) | 3865 |
Value | Count | Frequency (%) |
0.0 | 167 | |
2.0 | 5 | 0.1% |
2.46 | 1 | < 0.1% |
3.0 | 4 | < 0.1% |
3.5 | 1 | < 0.1% |
3.8 | 1 | < 0.1% |
3.9 | 1 | < 0.1% |
4.0 | 22 | 0.2% |
4.4 | 1 | < 0.1% |
4.5 | 19 | 0.2% |
Value | Count | Frequency (%) |
253.0 | 2 | |
240.0 | 1 | < 0.1% |
230.0 | 3 | |
225.0 | 1 | < 0.1% |
200.0 | 2 | |
192.0 | 2 | |
184.0 | 2 | |
180.0 | 1 | < 0.1% |
175.0 | 1 | < 0.1% |
168.0 | 1 | < 0.1% |
마감처리일자
Date
Distinct | 9 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2021-01-01 00:00:00 |
---|---|
Maximum | 2022-12-01 00:00:00 |
인계량입력업체구분
Categorical
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
E1 | |
---|---|
T1 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | E1 |
---|---|
2nd row | E1 |
3rd row | E1 |
4th row | E1 |
5th row | E1 |
Common Values
Value | Count | Frequency (%) |
E1 | 6677 | |
T1 | 3323 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
e1 | 6677 | |
t1 | 3323 |
인계서입력구분
Categorical
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2 | |
---|---|
1 | |
4 | 45 |
5 | 13 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 2 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
2 | 6256 | |
1 | 3686 | |
4 | 45 | 0.4% |
5 | 13 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 6256 | |
1 | 3686 | |
4 | 45 | 0.4% |
5 | 13 | 0.1% |
보관장소 경유여부
Boolean
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 87.9 KiB |
False | |
---|---|
True | 512 |
Value | Count | Frequency (%) |
False | 9488 | |
True | 512 | 5.1% |
운반업체번호 | 인수일자 | 인수량(톤) | 살포일자 | 살포량(톤) | 마감처리일자 | 인계량입력업체구분 | 인계서입력구분 | 보관장소 경유여부 | |
---|---|---|---|---|---|---|---|---|---|
운반업체번호 | 1.000 | 0.128 | 0.076 | 0.128 | 0.082 | 0.225 | 0.318 | 0.198 | 0.215 |
인수일자 | 0.128 | 1.000 | 0.038 | 1.000 | 0.054 | 0.890 | 0.060 | 0.119 | 0.017 |
인수량(톤) | 0.076 | 0.038 | 1.000 | 0.040 | 0.999 | 0.000 | 0.056 | 0.054 | 0.074 |
살포일자 | 0.128 | 1.000 | 0.040 | 1.000 | 0.054 | 0.891 | 0.056 | 0.116 | 0.035 |
살포량(톤) | 0.082 | 0.054 | 0.999 | 0.054 | 1.000 | 0.185 | 0.093 | 0.062 | 0.083 |
마감처리일자 | 0.225 | 0.890 | 0.000 | 0.891 | 0.185 | 1.000 | 0.090 | 0.158 | 0.029 |
인계량입력업체구분 | 0.318 | 0.060 | 0.056 | 0.056 | 0.093 | 0.090 | 1.000 | 0.617 | 0.094 |
인계서입력구분 | 0.198 | 0.119 | 0.054 | 0.116 | 0.062 | 0.158 | 0.617 | 1.000 | 0.033 |
보관장소 경유여부 | 0.215 | 0.017 | 0.074 | 0.035 | 0.083 | 0.029 | 0.094 | 0.033 | 1.000 |
보관장소 경유여부 | 인계서입력구분 | 인계량입력업체구분 | |
---|---|---|---|
보관장소 경유여부 | 1.000 | 0.022 | 0.060 |
인계서입력구분 | 0.022 | 1.000 | 0.428 |
인계량입력업체구분 | 0.060 | 0.428 | 1.000 |
운반업체번호 | 인수량(톤) | 살포량(톤) | 인계량입력업체구분 | 인계서입력구분 | 보관장소 경유여부 | |
---|---|---|---|---|---|---|
운반업체번호 | 1.000 | -0.060 | -0.058 | 0.318 | 0.128 | 0.214 |
인수량(톤) | -0.060 | 1.000 | 0.969 | 0.043 | 0.032 | 0.057 |
살포량(톤) | -0.058 | 0.969 | 1.000 | 0.071 | 0.037 | 0.063 |
인계량입력업체구분 | 0.318 | 0.043 | 0.071 | 1.000 | 0.428 | 0.060 |
인계서입력구분 | 0.128 | 0.032 | 0.037 | 0.428 | 1.000 | 0.022 |
보관장소 경유여부 | 0.214 | 0.057 | 0.063 | 0.060 | 0.022 | 1.000 |
운반업체번호 | 인수일자 | 인수량(톤) | 살포일자 | 살포량(톤) | 마감처리일자 | 인계량입력업체구분 | 인계서입력구분 | 보관장소 경유여부 | |
---|---|---|---|---|---|---|---|---|---|
16291 | 2015000443 | 2021-01 | 7.0 | 2021-01 | 7.0 | 2021-02 | E1 | 1 | N |
35652 | 2014000241 | 2021-02 | 16.0 | 2021-02 | 16.0 | 2021-02 | E1 | 1 | N |
29332 | 2013000378 | 2021-02 | 15.0 | 2021-02 | 15.0 | 2021-02 | E1 | 2 | N |
92330 | 2016000826 | 2021-04 | 23.43 | 2021-04 | 23.43 | 2021-05 | E1 | 1 | N |
48370 | 2017000624 | 2021-03 | 15.0 | 2021-03 | 15.0 | 2021-03 | E1 | 1 | N |
35047 | 2016000783 | 2021-02 | 8.0 | 2021-02 | 8.0 | 2021-02 | E1 | 2 | N |
77366 | 2016001702 | 2021-03 | 7.0 | 2021-03 | 7.0 | 2021-04 | E1 | 2 | N |
250 | 2016000744 | 2021-01 | 23.0 | 2021-01 | 23.0 | 2021-01 | E1 | 2 | N |
86415 | 2015000418 | 2021-04 | 16.0 | 2021-04 | 16.0 | 2021-04 | E1 | 1 | N |
94017 | 2013000379 | 2021-04 | 20.0 | 2021-04 | 20.0 | 2021-04 | E1 | 1 | N |
운반업체번호 | 인수일자 | 인수량(톤) | 살포일자 | 살포량(톤) | 마감처리일자 | 인계량입력업체구분 | 인계서입력구분 | 보관장소 경유여부 | |
---|---|---|---|---|---|---|---|---|---|
39248 | 2014000276 | 2021-02 | 22.0 | 2021-02 | 22.0 | 2021-02 | T1 | 2 | Y |
63574 | 2017001023 | 2021-03 | 23.0 | 2021-03 | 23.0 | 2021-03 | T1 | 2 | N |
12893 | 2016002854 | 2021-01 | 15.0 | 2021-01 | 15.0 | 2021-01 | E1 | 1 | N |
46464 | 2017001023 | 2021-02 | 23.0 | 2021-02 | 23.0 | 2021-03 | T1 | 2 | N |
1503 | 2015000366 | 2021-01 | 16.0 | 2021-01 | 16.0 | 2021-01 | T1 | 2 | N |
58170 | 2016000963 | 2021-03 | 15.0 | 2021-03 | 15.0 | 2021-03 | E1 | 1 | N |
27549 | 2016002010 | 2021-02 | 22.24 | 2021-02 | 22.24 | 2021-02 | E1 | 1 | N |
2617 | 2016000829 | 2021-01 | 6.13 | 2021-01 | 6.13 | 2021-01 | E1 | 2 | N |
3291 | 2015000352 | 2021-01 | 24.0 | 2021-01 | 24.0 | 2021-01 | T1 | 2 | N |
94764 | 2016004105 | 2021-04 | 16.0 | 2021-04 | 16.0 | 2021-04 | E1 | 1 | N |
Most frequently occurring
운반업체번호 | 인수일자 | 인수량(톤) | 살포일자 | 살포량(톤) | 마감처리일자 | 인계량입력업체구분 | 인계서입력구분 | 보관장소 경유여부 | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|
36 | 2013000369 | 2021-03 | 16.0 | 2021-03 | 16.0 | 2021-03 | E1 | 2 | N | 42 |
1152 | 2016003180 | 2021-03 | 7.4 | 2021-03 | 7.4 | 2021-03 | T1 | 2 | N | 41 |
113 | 2013000410 | 2021-02 | 23.0 | 2021-02 | 23.0 | 2021-02 | E1 | 2 | N | 38 |
1355 | 2017000624 | 2021-03 | 15.0 | 2021-03 | 15.0 | 2021-03 | E1 | 1 | N | 34 |
219 | 2014000243 | 2021-03 | 15.0 | 2021-03 | 15.0 | 2021-03 | E1 | 2 | N | 33 |
394 | 2015000311 | 2021-02 | 15.0 | 2021-02 | 15.0 | 2021-02 | T1 | 2 | Y | 33 |
398 | 2015000311 | 2021-03 | 15.0 | 2021-03 | 15.0 | 2021-03 | T1 | 2 | Y | 33 |
39 | 2013000369 | 2021-04 | 16.0 | 2021-04 | 16.0 | 2021-04 | E1 | 2 | N | 32 |
910 | 2016001154 | 2021-03 | 8.0 | 2021-03 | 8.0 | 2021-03 | T1 | 2 | N | 31 |
285 | 2014000278 | 2021-03 | 15.0 | 2021-03 | 15.0 | 2021-03 | E1 | 1 | N | 30 |