Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 1722 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 166 |
Duplicate rows (%) | 9.6% |
Total size in memory | 89.3 KiB |
Average record size in memory | 53.1 B |
Variable types
Categorical | 1 |
---|---|
Numeric | 5 |
Dataset
Description | 농림수산식품교육문화정보원 스마트팜코리아에서 제공하는 스마트축산 양돈분야 포유모돈 정보입니다. |
---|---|
Author | 농림수산식품교육문화정보원 |
URL | https://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20210929000000001581 |
Dataset has 166 (9.6%) duplicate rows | Duplicates |
섭취량 has 56 (3.3%) zeros | Zeros |
Reproduction
Analysis started | 2022-08-12 14:47:07.098737 |
---|---|
Analysis finished | 2022-08-12 14:47:13.703052 |
Duration | 6.6 seconds |
Software version | pandas-profiling v3.2.0 |
Download configuration | config.json |
농장아이디
Categorical
Distinct | 7 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 13.6 KiB |
PF_0020440 | |
---|---|
PF_0000347_01 | |
PF_0021299 | |
PF_0020426 | |
PF_0021284 | |
Other values (2) |
Length
Max length | 13 |
---|---|
Median length | 10 |
Mean length | 10.79442509 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | PF_0020239 |
---|---|
2nd row | PF_0020239 |
3rd row | PF_0020239 |
4th row | PF_0020239 |
5th row | PF_0020239 |
Common Values
Value | Count | Frequency (%) |
PF_0020440 | 683 | |
PF_0000347_01 | 456 | |
PF_0021299 | 215 | 12.5% |
PF_0020426 | 138 | 8.0% |
PF_0021284 | 127 | 7.4% |
PF_0020239 | 84 | 4.9% |
PF_0021283 | 19 | 1.1% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
pf_0020440 | 683 | |
pf_0000347_01 | 456 | |
pf_0021299 | 215 | 12.5% |
pf_0020426 | 138 | 8.0% |
pf_0021284 | 127 | 7.4% |
pf_0020239 | 84 | 4.9% |
pf_0021283 | 19 | 1.1% |
개체 구별 번호
Real number (ℝ≥0)
Distinct | 238 |
---|---|
Distinct (%) | 13.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 529.4779326 |
Minimum | 1 |
---|---|
Maximum | 1751 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 15.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 45 |
Q1 | 180 |
median | 386 |
Q3 | 770.5 |
95-th percentile | 1643.85 |
Maximum | 1751 |
Range | 1750 |
Interquartile range (IQR) | 590.5 |
Descriptive statistics
Standard deviation | 455.4949952 |
---|---|
Coefficient of variation (CV) | 0.8602719153 |
Kurtosis | 0.7640417999 |
Mean | 529.4779326 |
Median Absolute Deviation (MAD) | 234 |
Skewness | 1.219057434 |
Sum | 911761 |
Variance | 207475.6907 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
533 | 18 | 1.0% |
576 | 18 | 1.0% |
536 | 18 | 1.0% |
45 | 17 | 1.0% |
47 | 16 | 0.9% |
155 | 15 | 0.9% |
92 | 15 | 0.9% |
41 | 14 | 0.8% |
302 | 14 | 0.8% |
26 | 12 | 0.7% |
Other values (228) | 1565 |
Value | Count | Frequency (%) |
1 | 9 | |
10 | 2 | 0.1% |
13 | 5 | 0.3% |
15 | 10 | |
25 | 7 | |
26 | 12 | |
27 | 1 | 0.1% |
38 | 10 | |
39 | 7 | |
41 | 14 |
Value | Count | Frequency (%) |
1751 | 11 | |
1746 | 8 | |
1740 | 8 | |
1738 | 8 | |
1724 | 10 | |
1711 | 10 | |
1672 | 11 | |
1656 | 10 | |
1645 | 11 | |
1622 | 11 |
교배일
Real number (ℝ≥0)
Distinct | 32 |
---|---|
Distinct (%) | 1.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20210539.6 |
Minimum | 20210503 |
---|---|
Maximum | 20210608 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 15.3 KiB |
Quantile statistics
Minimum | 20210503 |
---|---|
5-th percentile | 20210510 |
Q1 | 20210517 |
median | 20210525 |
Q3 | 20210531 |
95-th percentile | 20210607 |
Maximum | 20210608 |
Range | 105 |
Interquartile range (IQR) | 14 |
Descriptive statistics
Standard deviation | 35.45733327 |
---|---|
Coefficient of variation (CV) | 1.754398149 × 10-6 |
Kurtosis | -0.4496197337 |
Mean | 20210539.6 |
Median Absolute Deviation (MAD) | 8 |
Skewness | 1.182470628 |
Sum | 3.48025492 × 1010 |
Variance | 1257.222483 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20210601 | 137 | 8.0% |
20210517 | 122 | 7.1% |
20210510 | 113 | 6.6% |
20210525 | 113 | 6.6% |
20210602 | 109 | 6.3% |
20210526 | 101 | 5.9% |
20210518 | 97 | 5.6% |
20210511 | 89 | 5.2% |
20210524 | 86 | 5.0% |
20210607 | 72 | 4.2% |
Other values (22) | 683 |
Value | Count | Frequency (%) |
20210503 | 2 | 0.1% |
20210508 | 11 | 0.6% |
20210510 | 113 | |
20210511 | 89 | |
20210512 | 23 | 1.3% |
20210513 | 40 | 2.3% |
20210514 | 18 | 1.0% |
20210515 | 15 | 0.9% |
20210516 | 51 | |
20210517 | 122 |
Value | Count | Frequency (%) |
20210608 | 34 | 2.0% |
20210607 | 72 | |
20210606 | 1 | 0.1% |
20210605 | 7 | 0.4% |
20210604 | 5 | 0.3% |
20210603 | 34 | 2.0% |
20210602 | 109 | |
20210601 | 137 | |
20210531 | 68 | |
20210530 | 40 | 2.3% |
분만일
Real number (ℝ≥0)
Distinct | 30 |
---|---|
Distinct (%) | 1.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20210914.95 |
Minimum | 20210901 |
---|---|
Maximum | 20210930 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 15.3 KiB |
Quantile statistics
Minimum | 20210901 |
---|---|
5-th percentile | 20210903 |
Q1 | 20210909 |
median | 20210916 |
Q3 | 20210922 |
95-th percentile | 20210929 |
Maximum | 20210930 |
Range | 29 |
Interquartile range (IQR) | 13 |
Descriptive statistics
Standard deviation | 8.001161208 |
---|---|
Coefficient of variation (CV) | 3.958831765 × 10-7 |
Kurtosis | -1.063423433 |
Mean | 20210914.95 |
Median Absolute Deviation (MAD) | 7 |
Skewness | -0.0210376683 |
Sum | 3.480319554 × 1010 |
Variance | 64.01858068 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20210923 | 131 | 7.6% |
20210924 | 127 | 7.4% |
20210916 | 118 | 6.9% |
20210909 | 118 | 6.9% |
20210917 | 101 | 5.9% |
20210904 | 99 | 5.7% |
20210918 | 94 | 5.5% |
20210903 | 89 | 5.2% |
20210929 | 72 | 4.2% |
20210911 | 69 | 4.0% |
Other values (20) | 704 |
Value | Count | Frequency (%) |
20210901 | 36 | 2.1% |
20210902 | 43 | 2.5% |
20210903 | 89 | |
20210904 | 99 | |
20210905 | 35 | 2.0% |
20210906 | 23 | 1.3% |
20210907 | 47 | 2.7% |
20210908 | 41 | 2.4% |
20210909 | 118 | |
20210910 | 65 |
Value | Count | Frequency (%) |
20210930 | 34 | 2.0% |
20210929 | 72 | |
20210928 | 1 | 0.1% |
20210927 | 7 | 0.4% |
20210926 | 7 | 0.4% |
20210925 | 40 | 2.3% |
20210924 | 127 | |
20210923 | 131 | |
20210922 | 50 | 2.9% |
20210921 | 36 | 2.1% |
설정량
Real number (ℝ≥0)
Distinct | 85 |
---|---|
Distinct (%) | 4.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.616550523 |
Minimum | 0 |
---|---|
Maximum | 12.6 |
Zeros | 10 |
Zeros (%) | 0.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 15.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1.4 |
Q1 | 2.5 |
median | 3.5 |
Q3 | 6.375 |
95-th percentile | 11.1 |
Maximum | 12.6 |
Range | 12.6 |
Interquartile range (IQR) | 3.875 |
Descriptive statistics
Standard deviation | 2.979706523 |
---|---|
Coefficient of variation (CV) | 0.6454400333 |
Kurtosis | -0.2709947362 |
Mean | 4.616550523 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.8504445008 |
Sum | 7949.7 |
Variance | 8.878650965 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1.5 | 129 | 7.5% |
3.2 | 126 | 7.3% |
2.5 | 104 | 6.0% |
2 | 85 | 4.9% |
6 | 77 | 4.5% |
11.1 | 69 | 4.0% |
5 | 69 | 4.0% |
3.5 | 66 | 3.8% |
1 | 66 | 3.8% |
3 | 66 | 3.8% |
Other values (75) | 865 |
Value | Count | Frequency (%) |
0 | 10 | 0.6% |
0.6 | 2 | 0.1% |
0.7 | 2 | 0.1% |
1 | 66 | |
1.2 | 2 | 0.1% |
1.3 | 2 | 0.1% |
1.4 | 66 | |
1.5 | 129 | |
1.6 | 18 | 1.0% |
1.7 | 2 | 0.1% |
Value | Count | Frequency (%) |
12.6 | 1 | 0.1% |
12.2 | 1 | 0.1% |
12 | 19 | 1.1% |
11.8 | 3 | 0.2% |
11.3 | 3 | 0.2% |
11.1 | 69 | |
11 | 5 | 0.3% |
10.8 | 18 | 1.0% |
10.5 | 15 | 0.9% |
10.4 | 2 | 0.1% |
Distinct | 108 |
---|---|
Distinct (%) | 6.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.813704994 |
Minimum | 0 |
---|---|
Maximum | 51 |
Zeros | 56 |
Zeros (%) | 3.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 15.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0.8 |
Q1 | 1.9 |
median | 3.2 |
Q3 | 5.3 |
95-th percentile | 8.7 |
Maximum | 51 |
Range | 51 |
Interquartile range (IQR) | 3.4 |
Descriptive statistics
Standard deviation | 2.691650697 |
---|---|
Coefficient of variation (CV) | 0.7057836674 |
Kurtosis | 54.34521503 |
Mean | 3.813704994 |
Median Absolute Deviation (MAD) | 1.6 |
Skewness | 3.764906344 |
Sum | 6567.2 |
Variance | 7.244983476 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1.5 | 99 | 5.7% |
2.5 | 95 | 5.5% |
3.2 | 94 | 5.5% |
1.6 | 84 | 4.9% |
2 | 79 | 4.6% |
6 | 58 | 3.4% |
3.1 | 57 | 3.3% |
5 | 57 | 3.3% |
0 | 56 | 3.3% |
5.5 | 47 | 2.7% |
Other values (98) | 996 |
Value | Count | Frequency (%) |
0 | 56 | |
0.3 | 2 | 0.1% |
0.4 | 4 | 0.2% |
0.5 | 2 | 0.1% |
0.6 | 11 | 0.6% |
0.7 | 7 | 0.4% |
0.8 | 19 | 1.1% |
0.9 | 12 | 0.7% |
1 | 33 | |
1.1 | 2 | 0.1% |
Value | Count | Frequency (%) |
51 | 1 | 0.1% |
12 | 4 | |
11.6 | 1 | 0.1% |
11.1 | 9 | |
11 | 6 | |
10.8 | 4 | |
10.7 | 2 | 0.1% |
10.6 | 1 | 0.1% |
10.5 | 3 | 0.2% |
10.4 | 2 | 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
농장아이디 | 개체 구별 번호 | 교배일 | 분만일 | 설정량 | 섭취량 | |
---|---|---|---|---|---|---|
0 | PF_0020239 | 200 | 20210516 | 20210907 | 5.2 | 4.5 |
1 | PF_0020239 | 200 | 20210516 | 20210907 | 5.3 | 2.1 |
2 | PF_0020239 | 200 | 20210516 | 20210907 | 4.7 | 3.5 |
3 | PF_0020239 | 200 | 20210516 | 20210907 | 5.2 | 3.4 |
4 | PF_0020239 | 284 | 20210528 | 20210919 | 0.6 | 0.6 |
5 | PF_0020239 | 284 | 20210528 | 20210919 | 1.2 | 2.6 |
6 | PF_0020239 | 284 | 20210528 | 20210919 | 1.7 | 1.8 |
7 | PF_0020239 | 284 | 20210528 | 20210919 | 1.2 | 1.6 |
8 | PF_0020239 | 295 | 20210524 | 20210918 | 1.5 | 2.2 |
9 | PF_0020239 | 295 | 20210524 | 20210918 | 2.1 | 2.4 |
Last rows
농장아이디 | 개체 구별 번호 | 교배일 | 분만일 | 설정량 | 섭취량 | |
---|---|---|---|---|---|---|
1712 | PF_0021283 | 1 | 20210521 | 20210912 | 9.0 | 9.0 |
1713 | PF_0021283 | 1 | 20210521 | 20210912 | 9.0 | 9.0 |
1714 | PF_0021283 | 27 | 20210511 | 20210902 | 9.0 | 6.9 |
1715 | PF_0021283 | 399 | 20210511 | 20210902 | 9.0 | 8.4 |
1716 | PF_0021283 | 416 | 20210528 | 20210919 | 3.0 | 4.0 |
1717 | PF_0021283 | 416 | 20210528 | 20210919 | 6.0 | 6.1 |
1718 | PF_0021283 | 422 | 20210524 | 20210915 | 7.0 | 5.7 |
1719 | PF_0021283 | 422 | 20210524 | 20210915 | 9.0 | 6.1 |
1720 | PF_0021283 | 423 | 20210523 | 20210914 | 8.0 | 7.1 |
1721 | PF_0021283 | 423 | 20210528 | 20210919 | 6.0 | 6.1 |
Most frequently occurring
농장아이디 | 개체 구별 번호 | 교배일 | 분만일 | 설정량 | 섭취량 | # duplicates | |
---|---|---|---|---|---|---|---|
73 | PF_0020440 | 397 | 20210608 | 20210930 | 2.5 | 2.5 | 5 |
78 | PF_0020440 | 524 | 20210608 | 20210930 | 2.5 | 2.5 | 5 |
85 | PF_0020440 | 536 | 20210608 | 20210930 | 2.5 | 2.5 | 5 |
115 | PF_0020440 | 871 | 20210608 | 20210930 | 2.5 | 2.5 | 5 |
1 | PF_0000347_01 | 361 | 20210607 | 20210929 | 3.2 | 3.2 | 4 |
3 | PF_0000347_01 | 386 | 20210511 | 20210902 | 11.1 | 11.1 | 4 |
5 | PF_0000347_01 | 487 | 20210607 | 20210929 | 3.2 | 3.2 | 4 |
9 | PF_0000347_01 | 510 | 20210607 | 20210929 | 3.2 | 3.2 | 4 |
15 | PF_0000347_01 | 555 | 20210530 | 20210921 | 3.1 | 3.1 | 4 |
18 | PF_0000347_01 | 567 | 20210510 | 20210904 | 11.1 | 11.1 | 4 |