Dataset statistics
Number of variables | 13 |
---|---|
Number of observations | 276 |
Missing cells | 130 |
Missing cells (%) | 3.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 29.8 KiB |
Average record size in memory | 110.5 B |
Variable types
Numeric | 4 |
---|---|
Categorical | 5 |
Text | 4 |
Dataset
Description | 도매시장에서 실거래가 발생하는 농축수산물 표준 품목 코드 499개를 사전 선정하였으며 499개의 농축수산물 표준 품목코드를 기준으로 동일한 국제표준코드를 나타낸 정보 |
---|---|
Author | 농림수산식품교육문화정보원 |
URL | https://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220210000000001774 |
GPC_SEGM_CODE has constant value "" | Constant |
GPC_SEGM_NM has constant value "" | Constant |
UPDT_DE has constant value "" | Constant |
CATGORY_CODE is highly overall correlated with CATGORY_NM and 1 other fields | High correlation |
GPC_FAMY_CODE is highly overall correlated with GPC_CLAS_CODE and 3 other fields | High correlation |
GPC_CLAS_CODE is highly overall correlated with GPC_FAMY_CODE and 3 other fields | High correlation |
GPC_BRIK_CODE is highly overall correlated with GPC_FAMY_CODE and 1 other fields | High correlation |
CATGORY_NM is highly overall correlated with CATGORY_CODE and 3 other fields | High correlation |
GPC_FAMY_NM is highly overall correlated with CATGORY_CODE and 3 other fields | High correlation |
GPC_BRIK_CODE has 65 (23.6%) missing values | Missing |
GPC_BRIK_NM has 65 (23.6%) missing values | Missing |
STD_PRDLST_CODE has unique values | Unique |
Reproduction
Analysis started | 2023-12-11 03:47:21.722744 |
---|---|
Analysis finished | 2023-12-11 03:47:25.122138 |
Duration | 3.4 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
CATGORY_CODE
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 31 |
---|---|
Distinct (%) | 11.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 21.224638 |
Minimum | 1 |
---|---|
Maximum | 91 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 4.75 |
Q1 | 10 |
median | 13 |
Q3 | 19 |
95-th percentile | 72 |
Maximum | 91 |
Range | 90 |
Interquartile range (IQR) | 9 |
Descriptive statistics
Standard deviation | 22.204838 |
---|---|
Coefficient of variation (CV) | 1.0461822 |
Kurtosis | 1.5988677 |
Mean | 21.224638 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 1.7430078 |
Sum | 5858 |
Variance | 493.05481 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
6 | 47 | |
10 | 41 | |
13 | 26 | 9.4% |
19 | 25 | 9.1% |
14 | 16 | 5.8% |
17 | 16 | 5.8% |
12 | 12 | 4.3% |
61 | 10 | 3.6% |
71 | 10 | 3.6% |
16 | 7 | 2.5% |
Other values (21) | 66 |
Value | Count | Frequency (%) |
1 | 2 | 0.7% |
2 | 1 | 0.4% |
3 | 6 | 2.2% |
4 | 5 | 1.8% |
5 | 3 | 1.1% |
6 | 47 | |
9 | 4 | 1.4% |
10 | 41 | |
11 | 5 | 1.8% |
12 | 12 | 4.3% |
Value | Count | Frequency (%) |
91 | 3 | 1.1% |
81 | 5 | |
73 | 3 | 1.1% |
72 | 4 | 1.4% |
71 | 10 | |
64 | 3 | 1.1% |
63 | 2 | 0.7% |
62 | 4 | 1.4% |
61 | 10 | |
47 | 2 | 0.7% |
CATGORY_NM
Categorical
HIGH CORRELATION
 
Distinct | 31 |
---|---|
Distinct (%) | 11.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.3 KiB |
과실류 | |
---|---|
엽경채류 | |
양채류 | |
약용작물류 | |
산채류 | |
Other values (26) |
Length
Max length | 6 |
---|---|
Median length | 5 |
Mean length | 3.7246377 |
Min length | 2 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 1.1% |
Sample
1st row | 미곡류 |
---|---|
2nd row | 미곡류 |
3rd row | 맥류 |
4th row | 두류 |
5th row | 두류 |
Common Values
Value | Count | Frequency (%) |
과실류 | 47 | |
엽경채류 | 41 | |
양채류 | 26 | 9.4% |
약용작물류 | 25 | 9.1% |
산채류 | 16 | 5.8% |
버섯류 | 16 | 5.8% |
조미채소류 | 12 | 4.3% |
내수면어류 | 10 | 3.6% |
해면어류 | 10 | 3.6% |
특용작물류 | 7 | 2.5% |
Other values (21) | 66 |
Length
Value | Count | Frequency (%) |
과실류 | 47 | |
엽경채류 | 41 | |
양채류 | 26 | 9.4% |
약용작물류 | 25 | 9.1% |
산채류 | 16 | 5.8% |
버섯류 | 16 | 5.8% |
조미채소류 | 12 | 4.3% |
내수면어류 | 10 | 3.6% |
해면어류 | 10 | 3.6% |
특용작물류 | 7 | 2.5% |
Other values (21) | 66 |
STD_PRDLST_CODE
Text
UNIQUE
 
Distinct | 276 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.3 KiB |
Value | Count | Frequency (%) |
0101 | 1 | 0.4% |
1705 | 1 | 0.4% |
1610 | 1 | 0.4% |
1615 | 1 | 0.4% |
1701 | 1 | 0.4% |
1702 | 1 | 0.4% |
1704 | 1 | 0.4% |
1603 | 1 | 0.4% |
1707 | 1 | 0.4% |
1602 | 1 | 0.4% |
Other values (266) | 266 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 295 | |
0 | 241 | |
6 | 101 | 9.1% |
2 | 99 | 9.0% |
3 | 92 | 8.3% |
4 | 75 | 6.8% |
7 | 58 | 5.3% |
9 | 56 | 5.1% |
5 | 47 | 4.3% |
8 | 33 | 3.0% |
Other values (5) | 7 | 0.6% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1097 | |
Uppercase Letter | 7 | 0.6% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 295 | |
0 | 241 | |
6 | 101 | 9.2% |
2 | 99 | 9.0% |
3 | 92 | 8.4% |
4 | 75 | 6.8% |
7 | 58 | 5.3% |
9 | 56 | 5.1% |
5 | 47 | 4.3% |
8 | 33 | 3.0% |
Uppercase Letter
Value | Count | Frequency (%) |
O | 2 | |
B | 2 | |
N | 1 | |
D | 1 | |
V | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1097 | |
Latin | 7 | 0.6% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 295 | |
0 | 241 | |
6 | 101 | 9.2% |
2 | 99 | 9.0% |
3 | 92 | 8.4% |
4 | 75 | 6.8% |
7 | 58 | 5.3% |
9 | 56 | 5.1% |
5 | 47 | 4.3% |
8 | 33 | 3.0% |
Latin
Value | Count | Frequency (%) |
O | 2 | |
B | 2 | |
N | 1 | |
D | 1 | |
V | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1104 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 295 | |
0 | 241 | |
6 | 101 | 9.1% |
2 | 99 | 9.0% |
3 | 92 | 8.3% |
4 | 75 | 6.8% |
7 | 58 | 5.3% |
9 | 56 | 5.1% |
5 | 47 | 4.3% |
8 | 33 | 3.0% |
Other values (5) | 7 | 0.6% |
STD_PRDLST_NM
Text
Distinct | 275 |
---|---|
Distinct (%) | 99.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.3 KiB |
Value | Count | Frequency (%) |
민들레 | 2 | 0.7% |
팽이버섯 | 1 | 0.4% |
호박씨 | 1 | 0.4% |
수세미 | 1 | 0.4% |
느타리버섯 | 1 | 0.4% |
양송이 | 1 | 0.4% |
표고버섯 | 1 | 0.4% |
땅콩 | 1 | 0.4% |
목이 | 1 | 0.4% |
사보래(사보이양배추 | 1 | 0.4% |
Other values (265) | 265 |
Most occurring characters
Value | Count | Frequency (%) |
류 | 42 | 4.8% |
리 | 26 | 3.0% |
나 | 22 | 2.5% |
이 | 16 | 1.8% |
고 | 16 | 1.8% |
자 | 16 | 1.8% |
추 | 16 | 1.8% |
파 | 15 | 1.7% |
물 | 15 | 1.7% |
( | 14 | 1.6% |
Other values (262) | 671 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 840 | |
Open Punctuation | 14 | 1.6% |
Close Punctuation | 14 | 1.6% |
Other Punctuation | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
류 | 42 | 5.0% |
리 | 26 | 3.1% |
나 | 22 | 2.6% |
이 | 16 | 1.9% |
고 | 16 | 1.9% |
자 | 16 | 1.9% |
추 | 16 | 1.9% |
파 | 15 | 1.8% |
물 | 15 | 1.8% |
무 | 14 | 1.7% |
Other values (259) | 642 |
Open Punctuation
Value | Count | Frequency (%) |
( | 14 |
Close Punctuation
Value | Count | Frequency (%) |
) | 14 |
Other Punctuation
Value | Count | Frequency (%) |
? | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 840 | |
Common | 29 | 3.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
류 | 42 | 5.0% |
리 | 26 | 3.1% |
나 | 22 | 2.6% |
이 | 16 | 1.9% |
고 | 16 | 1.9% |
자 | 16 | 1.9% |
추 | 16 | 1.9% |
파 | 15 | 1.8% |
물 | 15 | 1.8% |
무 | 14 | 1.7% |
Other values (259) | 642 |
Common
Value | Count | Frequency (%) |
( | 14 | |
) | 14 | |
? | 1 | 3.4% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 840 | |
ASCII | 28 | 3.2% |
None | 1 | 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
류 | 42 | 5.0% |
리 | 26 | 3.1% |
나 | 22 | 2.6% |
이 | 16 | 1.9% |
고 | 16 | 1.9% |
자 | 16 | 1.9% |
추 | 16 | 1.9% |
파 | 15 | 1.8% |
물 | 15 | 1.8% |
무 | 14 | 1.7% |
Other values (259) | 642 |
ASCII
Value | Count | Frequency (%) |
( | 14 | |
) | 14 |
None
Value | Count | Frequency (%) |
? | 1 |
GPC_SEGM_CODE
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.3 KiB |
50000000 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 50000000 |
---|---|
2nd row | 50000000 |
3rd row | 50000000 |
4th row | 50000000 |
5th row | 50000000 |
Common Values
Value | Count | Frequency (%) |
50000000 | 276 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
50000000 | 276 |
GPC_SEGM_NM
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.3 KiB |
Food/Beverage/Tobacco |
---|
Length
Max length | 21 |
---|---|
Median length | 21 |
Mean length | 21 |
Min length | 21 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Food/Beverage/Tobacco |
---|---|
2nd row | Food/Beverage/Tobacco |
3rd row | Food/Beverage/Tobacco |
4th row | Food/Beverage/Tobacco |
5th row | Food/Beverage/Tobacco |
Common Values
Value | Count | Frequency (%) |
Food/Beverage/Tobacco | 276 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
food/beverage/tobacco | 276 |
GPC_FAMY_CODE
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 16 |
---|---|
Distinct (%) | 5.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 50844420 |
Minimum | 50100000 |
---|---|
Maximum | 93030000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 50100000 |
---|---|
5-th percentile | 50100000 |
Q1 | 50120001 |
median | 50260000 |
Q3 | 50260000 |
95-th percentile | 50342500 |
Maximum | 93030000 |
Range | 42930000 |
Interquartile range (IQR) | 139999.25 |
Descriptive statistics
Standard deviation | 5125540.5 |
---|---|
Coefficient of variation (CV) | 0.10080832 |
Kurtosis | 65.18544 |
Mean | 50844420 |
Median Absolute Deviation (MAD) | 10000 |
Skewness | 8.1669724 |
Sum | 1.403306 × 1010 |
Variance | 2.6271166 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
50260000 | 120 | |
50250000 | 46 | 16.7% |
50120000 | 40 | 14.5% |
50100000 | 29 | 10.5% |
50350000 | 10 | 3.6% |
50220000 | 7 | 2.5% |
50340000 | 4 | 1.4% |
93030000 | 4 | 1.4% |
50310000 | 3 | 1.1% |
50320000 | 3 | 1.1% |
Other values (6) | 10 | 3.6% |
Value | Count | Frequency (%) |
50100000 | 29 | 10.5% |
50120000 | 40 | 14.5% |
50120001 | 1 | 0.4% |
50130000 | 1 | 0.4% |
50150000 | 1 | 0.4% |
50190000 | 3 | 1.1% |
50220000 | 7 | 2.5% |
50250000 | 46 | 16.7% |
50260000 | 120 | |
50290000 | 1 | 0.4% |
Value | Count | Frequency (%) |
93030000 | 4 | 1.4% |
50350000 | 10 | 3.6% |
50340000 | 4 | 1.4% |
50330000 | 3 | 1.1% |
50320000 | 3 | 1.1% |
50310000 | 3 | 1.1% |
50290000 | 1 | 0.4% |
50260000 | 120 | |
50250000 | 46 | 16.7% |
50220000 | 7 | 2.5% |
GPC_FAMY_NM
Categorical
HIGH CORRELATION
 
Distinct | 14 |
---|---|
Distinct (%) | 5.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.3 KiB |
Vegetables (Non Leaf) ? Unprepared/Unprocessed (Fresh) | |
---|---|
Fruits ? Unprepared/Unprocessed (Fresh) | |
Seafood | |
Fruits/Vegetables/Nuts/Seeds Prepared/Processed | |
Leaf Vegetables ? Unprepared/Unprocessed (Fresh) | 10 |
Other values (9) |
Length
Max length | 54 |
---|---|
Median length | 50 |
Mean length | 41.713768 |
Min length | 7 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 0.7% |
Sample
1st row | Cereal/Grain/Pulse Products |
---|---|
2nd row | Cereal/Grain/Pulse Products |
3rd row | Cereal/Grain/Pulse Products |
4th row | Vegetables (Non Leaf) ? Unprepared/Unprocessed (Fresh) |
5th row | Vegetables (Non Leaf) ? Unprepared/Unprocessed (Fresh) |
Common Values
Value | Count | Frequency (%) |
Vegetables (Non Leaf) ? Unprepared/Unprocessed (Fresh) | 121 | |
Fruits ? Unprepared/Unprocessed (Fresh) | 46 | 16.7% |
Seafood | 41 | 14.9% |
Fruits/Vegetables/Nuts/Seeds Prepared/Processed | 29 | 10.5% |
Leaf Vegetables ? Unprepared/Unprocessed (Fresh) | 10 | 3.6% |
Cereal/Grain/Pulse Products | 7 | 2.5% |
Nuts/Seeds ? Unprepared/Unprocessed (Shelf Stable) | 4 | 1.4% |
Live Plants (Genus A thru G) | 4 | 1.4% |
Fruits ? Unprepared/Unprocessed (Shelf Stable) | 3 | 1.1% |
Vegetables ? Unprepared/Unprocessed (Shelf Stable) | 3 | 1.1% |
Other values (4) | 8 | 2.9% |
Length
Value | Count | Frequency (%) |
190 | ||
unprepared/unprocessed | 190 | |
fresh | 180 | |
vegetables | 134 | |
leaf | 131 | |
non | 121 | |
fruits | 49 | 4.2% |
seafood | 41 | 3.5% |
fruits/vegetables/nuts/seeds | 29 | 2.5% |
prepared/processed | 29 | 2.5% |
Other values (16) | 74 | 6.3% |
GPC_CLAS_CODE
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 57 |
---|---|
Distinct (%) | 20.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 50845567 |
Minimum | 50101800 |
---|---|
Maximum | 93037100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 50101800 |
---|---|
5-th percentile | 50102100 |
Q1 | 50122250 |
median | 50260100 |
Q3 | 50261300 |
95-th percentile | 50342600 |
Maximum | 93037100 |
Range | 42935300 |
Interquartile range (IQR) | 139050 |
Descriptive statistics
Standard deviation | 5125825.3 |
---|---|
Coefficient of variation (CV) | 0.10081165 |
Kurtosis | 65.185825 |
Mean | 50845567 |
Median Absolute Deviation (MAD) | 9050 |
Skewness | 8.167008 |
Sum | 1.4033377 × 1010 |
Variance | 2.6274085 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
50102100 | 27 | 9.8% |
50260100 | 24 | 8.7% |
50121500 | 23 | 8.3% |
50261300 | 21 | 7.6% |
50261100 | 19 | 6.9% |
50261700 | 16 | 5.8% |
50121700 | 11 | 4.0% |
50250600 | 9 | 3.3% |
50251000 | 9 | 3.3% |
50251900 | 8 | 2.9% |
Other values (47) | 109 |
Value | Count | Frequency (%) |
50101800 | 2 | 0.7% |
50102100 | 27 | |
50121500 | 23 | |
50121501 | 1 | 0.4% |
50121700 | 11 | |
50121900 | 2 | 0.7% |
50122000 | 1 | 0.4% |
50122100 | 2 | 0.7% |
50122300 | 1 | 0.4% |
50132500 | 1 | 0.4% |
Value | Count | Frequency (%) |
93037100 | 1 | 0.4% |
93033300 | 2 | |
93030500 | 1 | 0.4% |
50350500 | 1 | 0.4% |
50350400 | 3 | |
50350200 | 3 | |
50350100 | 3 | |
50340100 | 4 | |
50330100 | 3 | |
50320100 | 3 |
GPC_CLAS_NM
Text
Distinct | 56 |
---|---|
Distinct (%) | 20.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.3 KiB |
Length
Max length | 54 |
---|---|
Median length | 42 |
Mean length | 19.833333 |
Min length | 5 |
Characters and Unicode
Total characters | 5474 |
---|---|
Distinct characters | 47 |
Distinct categories | 6 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 23 ? |
---|---|
Unique (%) | 8.3% |
Sample
1st row | Grains/Flour |
---|---|
2nd row | Grains/Flour |
3rd row | Grains/Flour |
4th row | Beans (With Pods) |
5th row | Beans (With Pods) |
Value | Count | Frequency (%) |
vegetables | 95 | |
80 | 13.1% | |
unprepared/unprocessed | 50 | 8.2% |
prepared/processed | 34 | 5.5% |
fish | 26 | 4.2% |
fruit | 24 | 3.9% |
root/tuber | 24 | 3.9% |
herbs | 21 | 3.4% |
brassica | 19 | 3.1% |
fungi | 16 | 2.6% |
Other values (65) | 224 |
Most occurring characters
Value | Count | Frequency (%) |
e | 866 | |
s | 494 | 9.0% |
r | 434 | 7.9% |
338 | 6.2% | |
a | 309 | 5.6% |
t | 232 | 4.2% |
l | 213 | 3.9% |
d | 202 | 3.7% |
p | 195 | 3.6% |
o | 187 | 3.4% |
Other values (37) | 2004 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 4207 | |
Uppercase Letter | 668 | 12.2% |
Space Separator | 338 | 6.2% |
Other Punctuation | 215 | 3.9% |
Close Punctuation | 23 | 0.4% |
Open Punctuation | 23 | 0.4% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 866 | |
s | 494 | |
r | 434 | |
a | 309 | 7.3% |
t | 232 | 5.5% |
l | 213 | 5.1% |
d | 202 | 4.8% |
p | 195 | 4.6% |
o | 187 | 4.4% |
i | 183 | 4.3% |
Other values (12) | 892 |
Uppercase Letter
Value | Count | Frequency (%) |
U | 100 | |
V | 96 | |
P | 96 | |
F | 88 | |
S | 76 | |
B | 42 | |
T | 24 | 3.6% |
H | 24 | 3.6% |
R | 24 | 3.6% |
C | 18 | 2.7% |
Other values (10) | 80 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 137 | |
? | 78 |
Space Separator
Value | Count | Frequency (%) |
338 |
Close Punctuation
Value | Count | Frequency (%) |
) | 23 |
Open Punctuation
Value | Count | Frequency (%) |
( | 23 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 4875 | |
Common | 599 | 10.9% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 866 | |
s | 494 | 10.1% |
r | 434 | 8.9% |
a | 309 | 6.3% |
t | 232 | 4.8% |
l | 213 | 4.4% |
d | 202 | 4.1% |
p | 195 | 4.0% |
o | 187 | 3.8% |
i | 183 | 3.8% |
Other values (32) | 1560 |
Common
Value | Count | Frequency (%) |
338 | ||
/ | 137 | |
? | 78 | 13.0% |
) | 23 | 3.8% |
( | 23 | 3.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 5474 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 866 | |
s | 494 | 9.0% |
r | 434 | 7.9% |
338 | 6.2% | |
a | 309 | 5.6% |
t | 232 | 4.2% |
l | 213 | 3.9% |
d | 202 | 3.7% |
p | 195 | 3.6% |
o | 187 | 3.4% |
Other values (37) | 2004 |
GPC_BRIK_CODE
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 123 |
---|---|
Distinct (%) | 58.3% |
Missing | 65 |
Missing (%) | 23.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10194313 |
Minimum | 10000003 |
---|---|
Maximum | 50261700 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 10000003 |
---|---|
5-th percentile | 10000008 |
Q1 | 10000272 |
median | 10005917 |
Q3 | 10006114 |
95-th percentile | 10006354 |
Maximum | 50261700 |
Range | 40261697 |
Interquartile range (IQR) | 5842 |
Descriptive statistics
Standard deviation | 2771489.2 |
---|---|
Coefficient of variation (CV) | 0.27186621 |
Kurtosis | 210.99952 |
Mean | 10194313 |
Median Absolute Deviation (MAD) | 424 |
Skewness | 14.525814 |
Sum | 2.1510001 × 109 |
Variance | 7.6811526 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10000272 | 25 | 9.1% |
10000282 | 19 | 6.9% |
10000019 | 11 | 4.0% |
10006260 | 7 | 2.5% |
10000211 | 5 | 1.8% |
10000281 | 4 | 1.4% |
10000008 | 4 | 1.4% |
10000007 | 3 | 1.1% |
10000003 | 3 | 1.1% |
10000006 | 3 | 1.1% |
Other values (113) | 127 | |
(Missing) | 65 |
Value | Count | Frequency (%) |
10000003 | 3 | 1.1% |
10000006 | 3 | 1.1% |
10000007 | 3 | 1.1% |
10000008 | 4 | 1.4% |
10000016 | 1 | 0.4% |
10000017 | 1 | 0.4% |
10000019 | 11 | |
10000146 | 1 | 0.4% |
10000149 | 1 | 0.4% |
10000203 | 1 | 0.4% |
Value | Count | Frequency (%) |
50261700 | 1 | |
10006632 | 1 | |
10006594 | 2 | |
10006566 | 1 | |
10006441 | 1 | |
10006417 | 1 | |
10006364 | 1 | |
10006363 | 2 | |
10006362 | 1 | |
10006345 | 1 |
GPC_BRIK_NM
Text
MISSING
 
Distinct | 120 |
---|---|
Distinct (%) | 56.9% |
Missing | 65 |
Missing (%) | 23.6% |
Memory size | 2.3 KiB |
Length
Max length | 60 |
---|---|
Median length | 46 |
Mean length | 26.720379 |
Min length | 4 |
Characters and Unicode
Total characters | 5638 |
---|---|
Distinct characters | 57 |
Distinct categories | 7 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 95 ? |
---|---|
Unique (%) | 45.0% |
Sample
1st row | Grains/Cereal ? Not Ready to Eat ? (Shelf Stable) |
---|---|
2nd row | Grains/Cereal ? Not Ready to Eat ? (Shelf Stable) |
3rd row | Grains/Cereal ? Not Ready to Eat ? (Shelf Stable) |
4th row | Beans (Winged) |
5th row | Peas |
Value | Count | Frequency (%) |
102 | 15.7% | |
unprepared/unprocessed | 49 | 7.5% |
shelf | 45 | 6.9% |
stable | 45 | 6.9% |
perishable | 41 | 6.3% |
prepared/processed | 34 | 5.2% |
vegetables | 30 | 4.6% |
fish | 26 | 4.0% |
shellfish | 13 | 2.0% |
nuts/seeds | 9 | 1.4% |
Other values (157) | 256 |
Most occurring characters
Value | Count | Frequency (%) |
e | 805 | 14.3% |
440 | 7.8% | |
s | 437 | 7.8% |
r | 427 | 7.6% |
a | 342 | 6.1% |
l | 236 | 4.2% |
p | 217 | 3.8% |
d | 207 | 3.7% |
o | 198 | 3.5% |
t | 184 | 3.3% |
Other values (47) | 2145 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 4129 | |
Uppercase Letter | 643 | 11.4% |
Space Separator | 440 | 7.8% |
Other Punctuation | 209 | 3.7% |
Open Punctuation | 108 | 1.9% |
Close Punctuation | 108 | 1.9% |
Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 805 | |
s | 437 | |
r | 427 | |
a | 342 | 8.3% |
l | 236 | 5.7% |
p | 217 | 5.3% |
d | 207 | 5.0% |
o | 198 | 4.8% |
t | 184 | 4.5% |
h | 181 | 4.4% |
Other values (16) | 895 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 143 | |
P | 135 | |
U | 98 | |
F | 43 | 6.7% |
V | 31 | 4.8% |
C | 30 | 4.7% |
R | 20 | 3.1% |
G | 18 | 2.8% |
N | 16 | 2.5% |
M | 15 | 2.3% |
Other values (13) | 94 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 106 | |
? | 101 | |
, | 1 | 0.5% |
' | 1 | 0.5% |
Space Separator
Value | Count | Frequency (%) |
440 |
Open Punctuation
Value | Count | Frequency (%) |
( | 108 |
Close Punctuation
Value | Count | Frequency (%) |
) | 108 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 4772 | |
Common | 866 | 15.4% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 805 | |
s | 437 | 9.2% |
r | 427 | 8.9% |
a | 342 | 7.2% |
l | 236 | 4.9% |
p | 217 | 4.5% |
d | 207 | 4.3% |
o | 198 | 4.1% |
t | 184 | 3.9% |
h | 181 | 3.8% |
Other values (39) | 1538 |
Common
Value | Count | Frequency (%) |
440 | ||
( | 108 | 12.5% |
) | 108 | 12.5% |
/ | 106 | 12.2% |
? | 101 | 11.7% |
- | 1 | 0.1% |
, | 1 | 0.1% |
' | 1 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 5638 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 805 | 14.3% |
440 | 7.8% | |
s | 437 | 7.8% |
r | 427 | 7.6% |
a | 342 | 6.1% |
l | 236 | 4.2% |
p | 217 | 3.8% |
d | 207 | 3.7% |
o | 198 | 3.5% |
t | 184 | 3.3% |
Other values (47) | 2145 |
UPDT_DE
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.3 KiB |
20151203 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20151203 |
---|---|
2nd row | 20151203 |
3rd row | 20151203 |
4th row | 20151203 |
5th row | 20151203 |
Common Values
Value | Count | Frequency (%) |
20151203 | 276 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20151203 | 276 |
CATGORY_CODE | CATGORY_NM | GPC_FAMY_CODE | GPC_FAMY_NM | GPC_CLAS_CODE | GPC_CLAS_NM | GPC_BRIK_CODE | |
---|---|---|---|---|---|---|---|
CATGORY_CODE | 1.000 | 1.000 | 0.496 | 0.894 | 0.181 | 0.946 | 0.000 |
CATGORY_NM | 1.000 | 1.000 | 0.847 | 0.951 | 0.822 | 0.969 | 0.000 |
GPC_FAMY_CODE | 0.496 | 0.847 | 1.000 | 1.000 | 0.980 | 1.000 | 0.000 |
GPC_FAMY_NM | 0.894 | 0.951 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 |
GPC_CLAS_CODE | 0.181 | 0.822 | 0.980 | 1.000 | 1.000 | 1.000 | 0.000 |
GPC_CLAS_NM | 0.946 | 0.969 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
GPC_BRIK_CODE | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 |
GPC_FAMY_NM | CATGORY_NM | |
---|---|---|
GPC_FAMY_NM | 1.000 | 0.668 |
CATGORY_NM | 0.668 | 1.000 |
CATGORY_CODE | GPC_FAMY_CODE | GPC_CLAS_CODE | GPC_BRIK_CODE | CATGORY_NM | GPC_FAMY_NM | |
---|---|---|---|---|---|---|
CATGORY_CODE | 1.000 | -0.374 | -0.324 | -0.331 | 0.958 | 0.599 |
GPC_FAMY_CODE | -0.374 | 1.000 | 0.953 | 0.557 | 0.693 | 0.978 |
GPC_CLAS_CODE | -0.324 | 0.953 | 1.000 | 0.535 | 0.693 | 0.978 |
GPC_BRIK_CODE | -0.331 | 0.557 | 0.535 | 1.000 | 0.000 | 0.000 |
CATGORY_NM | 0.958 | 0.693 | 0.693 | 0.000 | 1.000 | 0.668 |
GPC_FAMY_NM | 0.599 | 0.978 | 0.978 | 0.000 | 0.668 | 1.000 |
CATGORY_CODE | CATGORY_NM | STD_PRDLST_CODE | STD_PRDLST_NM | GPC_SEGM_CODE | GPC_SEGM_NM | GPC_FAMY_CODE | GPC_FAMY_NM | GPC_CLAS_CODE | GPC_CLAS_NM | GPC_BRIK_CODE | GPC_BRIK_NM | UPDT_DE | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 1 | 미곡류 | 0101 | 벼 | 50000000 | Food/Beverage/Tobacco | 50220000 | Cereal/Grain/Pulse Products | 50221000 | Grains/Flour | 10000211 | Grains/Cereal ? Not Ready to Eat ? (Shelf Stable) | 20151203 |
1 | 1 | 미곡류 | 0104 | 찹쌀 | 50000000 | Food/Beverage/Tobacco | 50220000 | Cereal/Grain/Pulse Products | 50221000 | Grains/Flour | 10000211 | Grains/Cereal ? Not Ready to Eat ? (Shelf Stable) | 20151203 |
2 | 2 | 맥류 | 0201 | 보리 | 50000000 | Food/Beverage/Tobacco | 50220000 | Cereal/Grain/Pulse Products | 50221000 | Grains/Flour | 10000315 | Grains/Cereal ? Not Ready to Eat ? (Shelf Stable) | 20151203 |
3 | 3 | 두류 | 0301 | 콩 | 50000000 | Food/Beverage/Tobacco | 50260000 | Vegetables (Non Leaf) ? Unprepared/Unprocessed (Fresh) | 50261400 | Beans (With Pods) | 10006336 | Beans (Winged) | 20151203 |
4 | 3 | 두류 | 0302 | 팥 | 50000000 | Food/Beverage/Tobacco | 50260000 | Vegetables (Non Leaf) ? Unprepared/Unprocessed (Fresh) | 50261400 | Beans (With Pods) | <NA> | <NA> | 20151203 |
5 | 3 | 두류 | 0303 | 녹두 | 50000000 | Food/Beverage/Tobacco | 50260000 | Vegetables (Non Leaf) ? Unprepared/Unprocessed (Fresh) | 50261400 | Beans (With Pods) | <NA> | <NA> | 20151203 |
6 | 3 | 두류 | 0304 | 완두 | 50000000 | Food/Beverage/Tobacco | 50260000 | Vegetables (Non Leaf) ? Unprepared/Unprocessed (Fresh) | 50261500 | Peas (With Pods) | 10005984 | Peas | 20151203 |
7 | 3 | 두류 | 0305 | 강낭콩 | 50000000 | Food/Beverage/Tobacco | 50290000 | Vegetables (Non Leaf) ? Unprepared/Unprocessed (Fresh) | 50261400 | Beans (With Pods) | <NA> | <NA> | 20151203 |
8 | 3 | 두류 | 0306 | 동부 | 50000000 | Food/Beverage/Tobacco | 50260000 | Vegetables (Non Leaf) ? Unprepared/Unprocessed (Fresh) | 50261400 | Beans (With Pods) | <NA> | <NA> | 20151203 |
9 | 4 | 잡곡류 | 0401 | 옥수수 | 50000000 | Food/Beverage/Tobacco | 50260000 | Vegetables (Non Leaf) ? Unprepared/Unprocessed (Fresh) | 50261000 | Other Vegetables | 10006147 | Sweetcorn | 20151203 |
CATGORY_CODE | CATGORY_NM | STD_PRDLST_CODE | STD_PRDLST_NM | GPC_SEGM_CODE | GPC_SEGM_NM | GPC_FAMY_CODE | GPC_FAMY_NM | GPC_CLAS_CODE | GPC_CLAS_NM | GPC_BRIK_CODE | GPC_BRIK_NM | UPDT_DE | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
266 | 73 | 내수면갑각류 | 7302 | 민물게류 | 50000000 | Food/Beverage/Tobacco | 50120000 | Seafood | 50121700 | Shellfish Unprepared/Unprocessed | 10000019 | Shellfish ? Unprepared/Unprocessed (Perishable) | 20151203 |
267 | 73 | 내수면갑각류 | 7303 | 민물새우류 | 50000000 | Food/Beverage/Tobacco | 50120000 | Seafood | 50121700 | Shellfish Unprepared/Unprocessed | 10000019 | Shellfish ? Unprepared/Unprocessed (Perishable) | 20151203 |
268 | 81 | 해조류 | 8102 | 갈래곰보류 | 50000000 | Food/Beverage/Tobacco | 50120000 | Seafood | 50121500 | Fish ? Unprepared/Unprocessed | 10000281 | Fish ? Unprepared/Unprocessed (Frozen) | 20151203 |
269 | 81 | 해조류 | 8103 | 김류 | 50000000 | Food/Beverage/Tobacco | 50120000 | Seafood | 50121500 | Fish ? Unprepared/Unprocessed | 10000281 | Fish ? Unprepared/Unprocessed (Frozen) | 20151203 |
270 | 81 | 해조류 | 8104 | 꼬시래기류 | 50000000 | Food/Beverage/Tobacco | 50120000 | Seafood | 50121900 | Fish ? Prepared/Processed | 10000017 | Fish ? Prepared/Processed (Frozen) | 20151203 |
271 | 81 | 해조류 | 8106 | 도박류 | 50000000 | Food/Beverage/Tobacco | 50120000 | Seafood | 50121500 | Fish ? Unprepared/Unprocessed | 10000281 | Fish ? Unprepared/Unprocessed (Frozen) | 20151203 |
272 | 81 | 해조류 | 8112 | 청각류 | 50000000 | Food/Beverage/Tobacco | 50120000 | Seafood | 50121500 | Fish ? Unprepared/Unprocessed | 10000281 | Fish ? Unprepared/Unprocessed (Frozen) | 20151203 |
273 | 91 | 농림가공 | 9104 | 절임식품 | 50000000 | Food/Beverage/Tobacco | 50190000 | Prepared/Preserved Foods | 50193100 | Vegetable Based Products / Meals | 10000289 | Vegetable Based Products / Meals ? Ready to Eat (Perishable) | 20151203 |
274 | 91 | 농림가공 | 9105 | 유지 | 50000000 | Food/Beverage/Tobacco | 50150000 | Oils/Fats Edible | 50151500 | Oils Edible | <NA> | <NA> | 20151203 |
275 | 91 | 농림가공 | 9107 | 곡물제조 | 50000000 | Food/Beverage/Tobacco | 50190000 | Prepared/Preserved Foods | 50193200 | Grain Based Products / Meals | <NA> | <NA> | 20151203 |