Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 11 |
Duplicate rows (%) | 0.1% |
Total size in memory | 498.0 KiB |
Average record size in memory | 51.0 B |
Variable types
Numeric | 2 |
---|---|
Categorical | 2 |
Text | 1 |
Dataset
Description | 2015년 제·개정된 농축수산물 표준코드의 단위코드와 동일한 의미를 가지는 2013년 농축수산물 표준코드의 단위코드를 나타낸 정보 |
---|---|
Author | 농림수산식품교육문화정보원 |
URL | https://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220209000000001764 |
B2EN_TESTUPDT_DE has constant value "" | Constant |
Dataset has 11 (0.1%) duplicate rows | Duplicates |
STD_UNIT_NEW_CODE is highly overall correlated with STD_UNIT_CODE and 1 other fields | High correlation |
STD_UNIT_CODE is highly overall correlated with STD_UNIT_NEW_CODE and 1 other fields | High correlation |
STD_UNIT_NEW_NM is highly overall correlated with STD_UNIT_NEW_CODE and 1 other fields | High correlation |
Reproduction
Analysis started | 2023-12-11 03:53:19.322267 |
---|---|
Analysis finished | 2023-12-11 03:53:20.292935 |
Duration | 0.97 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
STD_UNIT_NEW_CODE
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 16 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 47.4269 |
Minimum | 11 |
---|---|
Maximum | 85 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 11 |
---|---|
5-th percentile | 11 |
Q1 | 13 |
median | 71 |
Q3 | 72 |
95-th percentile | 73 |
Maximum | 85 |
Range | 74 |
Interquartile range (IQR) | 59 |
Descriptive statistics
Standard deviation | 29.161545 |
---|---|
Coefficient of variation (CV) | 0.61487352 |
Kurtosis | -1.8125123 |
Mean | 47.4269 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.27266242 |
Sum | 474269 |
Variance | 850.3957 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
72 | 1712 | |
71 | 1684 | |
73 | 1658 | |
13 | 1236 | |
11 | 1235 | |
12 | 1213 | |
83 | 197 | 2.0% |
85 | 195 | 1.9% |
32 | 193 | 1.9% |
34 | 192 | 1.9% |
Other values (6) | 485 | 4.9% |
Value | Count | Frequency (%) |
11 | 1235 | |
12 | 1213 | |
13 | 1236 | |
31 | 191 | 1.9% |
32 | 193 | 1.9% |
33 | 188 | 1.9% |
34 | 192 | 1.9% |
41 | 3 | < 0.1% |
42 | 4 | < 0.1% |
43 | 4 | < 0.1% |
Value | Count | Frequency (%) |
85 | 195 | 1.9% |
84 | 95 | 0.9% |
83 | 197 | 2.0% |
73 | 1658 | |
72 | 1712 | |
71 | 1684 | |
43 | 4 | < 0.1% |
42 | 4 | < 0.1% |
41 | 3 | < 0.1% |
34 | 192 | 1.9% |
STD_UNIT_NEW_NM
Categorical
HIGH CORRELATION
 
Distinct | 7 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
kg | |
---|---|
g | |
ton | |
속 | 197 |
본 | 195 |
Other values (2) | 287 |
Length
Max length | 3 |
---|---|
Median length | 2 |
Mean length | 1.9294 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | kg |
---|---|
2nd row | 본 |
3rd row | ton |
4th row | kg |
5th row | g |
Common Values
Value | Count | Frequency (%) |
kg | 3122 | |
g | 3113 | |
ton | 3086 | |
속 | 197 | 2.0% |
본 | 195 | 1.9% |
l | 192 | 1.9% |
분 | 95 | 0.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
kg | 3122 | |
g | 3113 | |
ton | 3086 | |
속 | 197 | 2.0% |
본 | 195 | 1.9% |
l | 192 | 1.9% |
분 | 95 | 0.9% |
STD_UNIT_CODE
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 18 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 47.3772 |
Minimum | 11 |
---|---|
Maximum | 85 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 11 |
---|---|
5-th percentile | 11 |
Q1 | 13 |
median | 71 |
Q3 | 72 |
95-th percentile | 73 |
Maximum | 85 |
Range | 74 |
Interquartile range (IQR) | 59 |
Descriptive statistics
Standard deviation | 29.100979 |
---|---|
Coefficient of variation (CV) | 0.61424017 |
Kurtosis | -1.8169112 |
Mean | 47.3772 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.27686512 |
Sum | 473772 |
Variance | 846.86701 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
72 | 1712 | |
71 | 1684 | |
73 | 1658 | |
13 | 1236 | |
11 | 1235 | |
12 | 1213 | |
32 | 193 | 1.9% |
34 | 192 | 1.9% |
31 | 191 | 1.9% |
33 | 188 | 1.9% |
Other values (8) | 498 | 5.0% |
Value | Count | Frequency (%) |
11 | 1235 | |
12 | 1213 | |
13 | 1236 | |
31 | 191 | 1.9% |
32 | 193 | 1.9% |
33 | 188 | 1.9% |
34 | 192 | 1.9% |
41 | 3 | < 0.1% |
42 | 4 | < 0.1% |
43 | 4 | < 0.1% |
Value | Count | Frequency (%) |
85 | 96 | 1.0% |
84 | 95 | 0.9% |
83 | 96 | 1.0% |
82 | 101 | 1.0% |
81 | 99 | 1.0% |
73 | 1658 | |
72 | 1712 | |
71 | 1684 | |
43 | 4 | < 0.1% |
42 | 4 | < 0.1% |
STD_UNIT_NM
Text
Distinct | 9272 |
---|---|
Distinct (%) | 92.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 26 |
---|---|
Median length | 23 |
Mean length | 9.661 |
Min length | 1 |
Characters and Unicode
Total characters | 96610 |
---|---|
Distinct characters | 95 |
Distinct categories | 10 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 8571 ? |
---|---|
Unique (%) | 85.7% |
Sample
1st row | kg 접 100내 |
---|---|
2nd row | 개 12개 3급 |
3rd row | ton PP대 60내 |
4th row | kg 두름 150내 |
5th row | g 코 80내 |
Value | Count | Frequency (%) |
g | 3167 | 10.8% |
kg | 3122 | 10.7% |
ton | 3086 | 10.6% |
기타 | 733 | 2.5% |
속 | 523 | 1.8% |
상자 | 500 | 1.7% |
그물망 | 482 | 1.6% |
pp대 | 477 | 1.6% |
포 | 432 | 1.5% |
쾌 | 431 | 1.5% |
Other values (181) | 16287 |
Most occurring characters
Value | Count | Frequency (%) |
19240 | ||
g | 6235 | 6.5% |
0 | 5290 | 5.5% |
1 | 4163 | 4.3% |
내 | 3673 | 3.8% |
k | 3118 | 3.2% |
n | 3086 | 3.2% |
o | 3086 | 3.2% |
t | 3082 | 3.2% |
2 | 2379 | 2.5% |
Other values (85) | 43258 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 24627 | |
Lowercase Letter | 20872 | |
Decimal Number | 20498 | |
Space Separator | 19240 | |
Uppercase Letter | 6027 | 6.2% |
Other Punctuation | 1889 | 2.0% |
Open Punctuation | 1101 | 1.1% |
Close Punctuation | 1101 | 1.1% |
Math Symbol | 710 | 0.7% |
Dash Punctuation | 545 | 0.6% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
내 | 3673 | 14.9% |
개 | 2124 | 8.6% |
미 | 1711 | 6.9% |
상 | 1224 | 5.0% |
단 | 955 | 3.9% |
기 | 917 | 3.7% |
대 | 912 | 3.7% |
타 | 733 | 3.0% |
이 | 724 | 2.9% |
지 | 599 | 2.4% |
Other values (44) | 11055 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 2108 | |
T | 522 | 8.7% |
B | 476 | 7.9% |
M | 457 | 7.6% |
S | 450 | 7.5% |
A | 263 | 4.4% |
N | 263 | 4.4% |
C | 259 | 4.3% |
D | 255 | 4.2% |
E | 231 | 3.8% |
Other values (5) | 743 | 12.3% |
Decimal Number
Value | Count | Frequency (%) |
0 | 5290 | |
1 | 4163 | |
2 | 2379 | |
5 | 2068 | 10.1% |
3 | 1911 | 9.3% |
4 | 1406 | 6.9% |
8 | 987 | 4.8% |
7 | 938 | 4.6% |
6 | 733 | 3.6% |
9 | 623 | 3.0% |
Lowercase Letter
Value | Count | Frequency (%) |
g | 6235 | |
k | 3118 | |
n | 3086 | |
o | 3086 | |
t | 3082 | |
m | 1363 | 6.5% |
c | 710 | 3.4% |
l | 192 | 0.9% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 1018 | |
. | 819 | |
, | 52 | 2.8% |
Space Separator
Value | Count | Frequency (%) |
19240 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1101 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1101 |
Math Symbol
Value | Count | Frequency (%) |
× | 710 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 545 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 45084 | |
Latin | 26899 | |
Hangul | 24627 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
내 | 3673 | 14.9% |
개 | 2124 | 8.6% |
미 | 1711 | 6.9% |
상 | 1224 | 5.0% |
단 | 955 | 3.9% |
기 | 917 | 3.7% |
대 | 912 | 3.7% |
타 | 733 | 3.0% |
이 | 724 | 2.9% |
지 | 599 | 2.4% |
Other values (44) | 11055 |
Latin
Value | Count | Frequency (%) |
g | 6235 | |
k | 3118 | |
n | 3086 | |
o | 3086 | |
t | 3082 | |
P | 2108 | 7.8% |
m | 1363 | 5.1% |
c | 710 | 2.6% |
T | 522 | 1.9% |
B | 476 | 1.8% |
Other values (13) | 3113 |
Common
Value | Count | Frequency (%) |
19240 | ||
0 | 5290 | 11.7% |
1 | 4163 | 9.2% |
2 | 2379 | 5.3% |
5 | 2068 | 4.6% |
3 | 1911 | 4.2% |
4 | 1406 | 3.1% |
( | 1101 | 2.4% |
) | 1101 | 2.4% |
/ | 1018 | 2.3% |
Other values (8) | 5407 | 12.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 71273 | |
Hangul | 24627 | 25.5% |
None | 710 | 0.7% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
19240 | ||
g | 6235 | 8.7% |
0 | 5290 | 7.4% |
1 | 4163 | 5.8% |
k | 3118 | 4.4% |
n | 3086 | 4.3% |
o | 3086 | 4.3% |
t | 3082 | 4.3% |
2 | 2379 | 3.3% |
P | 2108 | 3.0% |
Other values (30) | 19486 |
Hangul
Value | Count | Frequency (%) |
내 | 3673 | 14.9% |
개 | 2124 | 8.6% |
미 | 1711 | 6.9% |
상 | 1224 | 5.0% |
단 | 955 | 3.9% |
기 | 917 | 3.7% |
대 | 912 | 3.7% |
타 | 733 | 3.0% |
이 | 724 | 2.9% |
지 | 599 | 2.4% |
Other values (44) | 11055 |
None
Value | Count | Frequency (%) |
× | 710 |
B2EN_TESTUPDT_DE
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
20160128 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20160128 |
---|---|
2nd row | 20160128 |
3rd row | 20160128 |
4th row | 20160128 |
5th row | 20160128 |
Common Values
Value | Count | Frequency (%) |
20160128 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20160128 | 10000 |
STD_UNIT_NEW_CODE | STD_UNIT_NEW_NM | STD_UNIT_CODE | |
---|---|---|---|
STD_UNIT_NEW_CODE | 1.000 | 0.798 | 1.000 |
STD_UNIT_NEW_NM | 0.798 | 1.000 | 0.798 |
STD_UNIT_CODE | 1.000 | 0.798 | 1.000 |
STD_UNIT_NEW_CODE | STD_UNIT_CODE | STD_UNIT_NEW_NM | |
---|---|---|---|
STD_UNIT_NEW_CODE | 1.000 | 1.000 | 0.632 |
STD_UNIT_CODE | 1.000 | 1.000 | 0.632 |
STD_UNIT_NEW_NM | 0.632 | 0.632 | 1.000 |
STD_UNIT_NEW_CODE | STD_UNIT_NEW_NM | STD_UNIT_CODE | STD_UNIT_NM | B2EN_TESTUPDT_DE | |
---|---|---|---|---|---|
2342 | 12 | kg | 12 | kg 접 100내 | 20160128 |
11603 | 85 | 본 | 81 | 개 12개 3급 | 20160128 |
3154 | 13 | ton | 13 | ton PP대 60내 | 20160128 |
8212 | 72 | kg | 72 | kg 두름 150내 | 20160128 |
6637 | 71 | g | 71 | g 코 80내 | 20160128 |
8441 | 72 | kg | 72 | kg 깡 6통 | 20160128 |
3629 | 13 | ton | 13 | ton 단 50내 | 20160128 |
2431 | 12 | kg | 12 | kg 채 25내 | 20160128 |
3518 | 13 | ton | 13 | ton 봉지 15내 | 20160128 |
9294 | 73 | ton | 73 | ton 상자 190내 | 20160128 |
STD_UNIT_NEW_CODE | STD_UNIT_NEW_NM | STD_UNIT_CODE | STD_UNIT_NM | B2EN_TESTUPDT_DE | |
---|---|---|---|---|---|
10691 | 73 | ton | 73 | ton 포 3방 | 20160128 |
8829 | 72 | kg | 72 | kg 축 3방 | 20160128 |
5783 | 71 | g | 71 | g C/T(B/T) 5단 | 20160128 |
11226 | 83 | 속 | 82 | 단 16개이상 기타 | 20160128 |
1853 | 12 | kg | 12 | kg 트럭 17개 | 20160128 |
4827 | 33 | ton | 33 | ton 트럭 3.9cm×5.1cm×2.7m | 20160128 |
871 | 11 | g | 11 | g 개 10내(5단위) | 20160128 |
6447 | 71 | g | 71 | g 미(마리) 140내 | 20160128 |
5886 | 71 | g | 71 | g S/P 10단 | 20160128 |
4811 | 33 | ton | 33 | ton 그물망 30cm×3.6m이상 | 20160128 |
Most frequently occurring
STD_UNIT_NEW_CODE | STD_UNIT_NEW_NM | STD_UNIT_CODE | STD_UNIT_NM | B2EN_TESTUPDT_DE | # duplicates | |
---|---|---|---|---|---|---|
0 | 11 | g | 11 | g 기타 | 20160128 | 2 |
1 | 12 | kg | 12 | kg 기타 | 20160128 | 2 |
2 | 13 | ton | 13 | ton 기타 | 20160128 | 2 |
3 | 33 | ton | 33 | ton 기타 | 20160128 | 2 |
4 | 34 | l | 34 | l 기타 | 20160128 | 2 |
5 | 72 | kg | 72 | kg 기타 | 20160128 | 2 |
6 | 73 | ton | 73 | ton 기타 | 20160128 | 2 |
7 | 83 | 속 | 82 | 단 기타 | 20160128 | 2 |
8 | 83 | 속 | 83 | 속 기타 | 20160128 | 2 |
9 | 85 | 본 | 81 | 개 기타 | 20160128 | 2 |