Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 468.9 KiB |
Average record size in memory | 48.0 B |
Variable types
Numeric | 1 |
---|---|
Categorical | 5 |
Dataset
Description | 2015년 제·개정된 농축수산물 표준코드(품목,시장,단위,포장,크기,등급,산지)와 동일한 의미를 가지는 2013년 농축수산물 표준코드(품목,시장,단위,포장,크기,등급,산지) 나타낸 정보 |
---|---|
Author | 농림수산식품교육문화정보원 |
URL | https://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20191011000000001245 |
업데이트일자 has constant value "2015-12-15" | Constant |
크기코드 has a high cardinality: 174 distinct values | High cardinality |
크기명 has a high cardinality: 106 distinct values | High cardinality |
구크기코드 has a high cardinality: 175 distinct values | High cardinality |
구크기명 has a high cardinality: 9539 distinct values | High cardinality |
df_index has unique values | Unique |
Reproduction
Analysis started | 2022-08-12 14:48:32.515738 |
---|---|
Analysis finished | 2022-08-12 14:48:33.547420 |
Duration | 1.03 second |
Software version | pandas-profiling v3.2.0 |
Download configuration | config.json |
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8685.3458 |
Minimum | 0 |
---|---|
Maximum | 17375 |
Zeros | 1 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 78.2 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 883.95 |
Q1 | 4299.75 |
median | 8671.5 |
Q3 | 13087 |
95-th percentile | 16502.05 |
Maximum | 17375 |
Range | 17375 |
Interquartile range (IQR) | 8787.25 |
Descriptive statistics
Standard deviation | 5036.75494 |
---|---|
Coefficient of variation (CV) | 0.5799141515 |
Kurtosis | -1.215331695 |
Mean | 8685.3458 |
Median Absolute Deviation (MAD) | 4390 |
Skewness | 0.009336468874 |
Sum | 86853458 |
Variance | 25368900.32 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11918 | 1 | < 0.1% |
15202 | 1 | < 0.1% |
14685 | 1 | < 0.1% |
16728 | 1 | < 0.1% |
9390 | 1 | < 0.1% |
11924 | 1 | < 0.1% |
5936 | 1 | < 0.1% |
4149 | 1 | < 0.1% |
16185 | 1 | < 0.1% |
7309 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
0 | 1 | |
1 | 1 | |
6 | 1 | |
7 | 1 | |
9 | 1 | |
10 | 1 | |
11 | 1 | |
12 | 1 | |
16 | 1 | |
18 | 1 |
Value | Count | Frequency (%) |
17375 | 1 | |
17373 | 1 | |
17372 | 1 | |
17371 | 1 | |
17369 | 1 | |
17368 | 1 | |
17367 | 1 | |
17366 | 1 | |
17363 | 1 | |
17362 | 1 |
Distinct | 174 |
---|---|
Distinct (%) | 1.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
7ZZ | 444 |
---|---|
124 | 242 |
123 | 221 |
125 | 208 |
162 | 146 |
Other values (169) |
Length
Max length | 10 |
---|---|
Median length | 3 |
Mean length | 3.1327 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 782 |
---|---|
2nd row | 162 |
3rd row | 312 |
4th row | 802 |
5th row | 172 |
Common Values
Value | Count | Frequency (%) |
7ZZ | 444 | 4.4% |
124 | 242 | 2.4% |
123 | 221 | 2.2% |
125 | 208 | 2.1% |
162 | 146 | 1.5% |
126 | 140 | 1.4% |
1ZZ | 125 | 1.2% |
3ZZ | 96 | 1.0% |
131 | 86 | 0.9% |
145 | 82 | 0.8% |
Other values (164) | 8210 |
Length
Value | Count | Frequency (%) |
7zz | 444 | 4.4% |
124 | 242 | 2.4% |
123 | 221 | 2.2% |
125 | 208 | 2.1% |
162 | 146 | 1.5% |
126 | 140 | 1.4% |
1zz | 125 | 1.2% |
3zz | 96 | 1.0% |
131 | 86 | 0.9% |
145 | 82 | 0.8% |
Other values (164) | 8210 |
Distinct | 106 |
---|---|
Distinct (%) | 1.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
기타 | 676 |
---|---|
40내 | 284 |
30내 | 271 |
50내 | 255 |
20내 | 199 |
Other values (101) |
Length
Max length | 16 |
---|---|
Median length | 14 |
Mean length | 3.2037 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 대 |
---|---|
2nd row | 20내 |
3rd row | 12cm내외×1.2m내 |
4th row | 2급 |
5th row | 72 |
Common Values
Value | Count | Frequency (%) |
기타 | 676 | 6.8% |
40내 | 284 | 2.8% |
30내 | 271 | 2.7% |
50내 | 255 | 2.5% |
20내 | 199 | 2.0% |
60내 | 183 | 1.8% |
180내 | 137 | 1.4% |
6 | 131 | 1.3% |
110내 | 129 | 1.3% |
15 | 129 | 1.3% |
Other values (96) | 7606 |
Length
Value | Count | Frequency (%) |
기타 | 676 | 6.8% |
40내 | 284 | 2.8% |
30내 | 271 | 2.7% |
50내 | 255 | 2.5% |
20내 | 199 | 2.0% |
60내 | 183 | 1.8% |
180내 | 137 | 1.4% |
6 | 131 | 1.3% |
110내 | 129 | 1.3% |
15 | 129 | 1.3% |
Other values (96) | 7606 |
Distinct | 175 |
---|---|
Distinct (%) | 1.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
700 | 91 |
---|---|
131 | 86 |
124 | 84 |
152 | 83 |
145 | 82 |
Other values (170) |
Length
Max length | 10 |
---|---|
Median length | 3 |
Mean length | 3.1327 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 782 |
---|---|
2nd row | 162 |
3rd row | 312 |
4th row | 802 |
5th row | 172 |
Common Values
Value | Count | Frequency (%) |
700 | 91 | 0.9% |
131 | 86 | 0.9% |
124 | 84 | 0.8% |
152 | 83 | 0.8% |
145 | 82 | 0.8% |
138 | 82 | 0.8% |
115 | 81 | 0.8% |
147 | 81 | 0.8% |
164 | 80 | 0.8% |
151 | 80 | 0.8% |
Other values (165) | 9170 |
Length
Value | Count | Frequency (%) |
700 | 91 | 0.9% |
131 | 86 | 0.9% |
124 | 84 | 0.8% |
152 | 83 | 0.8% |
145 | 82 | 0.8% |
138 | 82 | 0.8% |
115 | 81 | 0.8% |
147 | 81 | 0.8% |
164 | 80 | 0.8% |
151 | 80 | 0.8% |
Other values (165) | 9170 |
Distinct | 9539 |
---|---|
Distinct (%) | 95.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
기타 | 4 |
---|---|
상자 기타 | 3 |
ton 기타 | 3 |
kg PP대 기타 | 3 |
g PP대 | 3 |
Other values (9534) |
Length
Max length | 26 |
---|---|
Median length | 22 |
Mean length | 9.037 |
Min length | 1 |
Unique
Unique | 9089 ? |
---|---|
Unique (%) | 90.9% |
Sample
1st row | 두름 대 |
---|---|
2nd row | ton 트럭 20내(5단위) |
3rd row | l 재 12cm내외×1.2m내 |
4th row | 단 6개 2급 |
5th row | ml PE대 72개 |
Common Values
Value | Count | Frequency (%) |
기타 | 4 | < 0.1% |
상자 기타 | 3 | < 0.1% |
ton 기타 | 3 | < 0.1% |
kg PP대 기타 | 3 | < 0.1% |
g PP대 | 3 | < 0.1% |
kg 기타 | 3 | < 0.1% |
PP대 | 3 | < 0.1% |
kg | 3 | < 0.1% |
ton PP대 | 3 | < 0.1% |
그물망 기타 | 3 | < 0.1% |
Other values (9529) | 9969 |
Length
Value | Count | Frequency (%) |
g | 2032 | 7.4% |
ton | 1987 | 7.3% |
kg | 1975 | 7.2% |
l | 946 | 3.5% |
ml | 818 | 3.0% |
기타 | 665 | 2.4% |
그물망 | 499 | 1.8% |
pp대 | 480 | 1.8% |
속 | 477 | 1.7% |
상자 | 471 | 1.7% |
Other values (175) | 16929 |
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
2015-12-15 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2015-12-15 |
---|---|
2nd row | 2015-12-15 |
3rd row | 2015-12-15 |
4th row | 2015-12-15 |
5th row | 2015-12-15 |
Common Values
Value | Count | Frequency (%) |
2015-12-15 | 10000 |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
2015-12-15 | 10000 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
First rows
df_index | 크기코드 | 크기명 | 구크기코드 | 구크기명 | 업데이트일자 | |
---|---|---|---|---|---|---|
0 | 11918 | 782 | 대 | 782 | 두름 대 | 2015-12-15 |
1 | 7251 | 162 | 20내 | 162 | ton 트럭 20내(5단위) | 2015-12-15 |
2 | 9464 | 312 | 12cm내외×1.2m내 | 312 | l 재 12cm내외×1.2m내 | 2015-12-15 |
3 | 16993 | 802 | 2급 | 802 | 단 6개 2급 | 2015-12-15 |
4 | 7725 | 172 | 72 | 172 | ml PE대 72개 | 2015-12-15 |
5 | 5943 | 141 | 210내 | 141 | l 210내 | 2015-12-15 |
6 | 14350 | 7D2 | 100내 | 7D2 | kg 쾌 100내 | 2015-12-15 |
7 | 3563 | 125 | 50내 | 125 | kg 접 50내 | 2015-12-15 |
8 | 13388 | 7B8 | 17 | 7B8 | ton 17미 | 2015-12-15 |
9 | 6796 | 147 | 450내 | 147 | g PE대 450내 | 2015-12-15 |
Last rows
df_index | 크기코드 | 크기명 | 구크기코드 | 구크기명 | 업데이트일자 | |
---|---|---|---|---|---|---|
9990 | 13788 | 7C4 | 30내 | 7C4 | ton 30내 | 2015-12-15 |
9991 | 16092 | 7ZZ | 기타 | 731 | 4P | 2015-12-15 |
9992 | 5364 | 136 | 160내 | 136 | kg 개 160내 | 2015-12-15 |
9993 | 2879 | 123 | 30내 | 152 | g 단 25내 | 2015-12-15 |
9994 | 12536 | 7A6 | 6 | 7A6 | g 각 6미 | 2015-12-15 |
9995 | 15800 | 7F2 | 2000내 | 7F2 | kg 쾌 2000내 | 2015-12-15 |
9996 | 14446 | 7D3 | 110내 | 7D3 | kg PAN(펜) 110내 | 2015-12-15 |
9997 | 1451 | 112 | 12 | 112 | kg 포 12개 | 2015-12-15 |
9998 | 10808 | 715 | 6통 | 715 | 6통 | 2015-12-15 |
9999 | 8764 | 303 | 12-18cm×3.6m이상 | 303 | ton 주 12-18cm×3.6m이상 | 2015-12-15 |