Dataset statistics
Number of variables | 13 |
---|---|
Number of observations | 1621 |
Missing cells | 1580 |
Missing cells (%) | 7.5% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 171.1 KiB |
Average record size in memory | 108.1 B |
Variable types
Categorical | 5 |
---|---|
Numeric | 3 |
Text | 5 |
Dataset
Description | 우리나라 농축산물의 WTO 양허관세율과 기본세율 자료 |
---|---|
Author | 농림축산식품부 |
URL | https://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220217000000002067 |
2014 has constant value "" | Constant |
1 is highly overall correlated with Unnamed: 10 | High correlation |
0 is highly overall correlated with Unnamed: 8 and 1 other fields | High correlation |
Unnamed: 8 is highly overall correlated with 0 and 2 other fields | High correlation |
Unnamed: 10 is highly overall correlated with 1 and 2 other fields | High correlation |
Unnamed: 11 is highly overall correlated with Unnamed: 8 and 1 other fields | High correlation |
Unnamed: 12 is highly overall correlated with Unnamed: 11 | High correlation |
Unnamed: 8 is highly imbalanced (71.2%) | Imbalance |
Unnamed: 10 is highly imbalanced (91.7%) | Imbalance |
Unnamed: 11 is highly imbalanced (61.7%) | Imbalance |
Unnamed: 12 is highly imbalanced (65.8%) | Imbalance |
Unnamed: 9 has 1575 (97.2%) missing values | Missing |
1 has unique values | Unique |
0101.21.1000 has unique values | Unique |
0 has 106 (6.5%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-11 03:44:13.747963 |
---|---|
Analysis finished | 2023-12-11 03:44:16.595637 |
Duration | 2.85 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
2014
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 12.8 KiB |
2014 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2014 |
---|---|
2nd row | 2014 |
3rd row | 2014 |
4th row | 2014 |
5th row | 2014 |
Common Values
Value | Count | Frequency (%) |
2014 | 1621 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2014 | 1621 |
1
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 1621 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 812 |
Minimum | 2 |
---|---|
Maximum | 1622 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 14.4 KiB |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 83 |
Q1 | 407 |
median | 812 |
Q3 | 1217 |
95-th percentile | 1541 |
Maximum | 1622 |
Range | 1620 |
Interquartile range (IQR) | 810 |
Descriptive statistics
Standard deviation | 468.08671 |
---|---|
Coefficient of variation (CV) | 0.57646146 |
Kurtosis | -1.2 |
Mean | 812 |
Median Absolute Deviation (MAD) | 405 |
Skewness | 0 |
Sum | 1316252 |
Variance | 219105.17 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
2 | 1 | 0.1% |
1080 | 1 | 0.1% |
1090 | 1 | 0.1% |
1089 | 1 | 0.1% |
1088 | 1 | 0.1% |
1087 | 1 | 0.1% |
1086 | 1 | 0.1% |
1085 | 1 | 0.1% |
1084 | 1 | 0.1% |
1083 | 1 | 0.1% |
Other values (1611) | 1611 |
Value | Count | Frequency (%) |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 | |
11 | 1 |
Value | Count | Frequency (%) |
1622 | 1 | |
1621 | 1 | |
1620 | 1 | |
1619 | 1 | |
1618 | 1 | |
1617 | 1 | |
1616 | 1 | |
1615 | 1 | |
1614 | 1 | |
1613 | 1 |
0101.21.1000
Text
UNIQUE
 
Distinct | 1621 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 12.8 KiB |
Length
Max length | 12 |
---|---|
Median length | 12 |
Mean length | 12 |
Min length | 12 |
Characters and Unicode
Total characters | 19452 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1621 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 0101.21.9000 |
---|---|
2nd row | 0101.29.1000 |
3rd row | 0101.29.9000 |
4th row | 0101.30.1000 |
5th row | 0101.30.9000 |
Value | Count | Frequency (%) |
0101.21.9000 | 1 | 0.1% |
1702.30.2000 | 1 | 0.1% |
1802.00.1000 | 1 | 0.1% |
1801.00.2000 | 1 | 0.1% |
1801.00.1000 | 1 | 0.1% |
1704.90.9000 | 1 | 0.1% |
1704.90.2090 | 1 | 0.1% |
1704.90.2020 | 1 | 0.1% |
1704.90.2010 | 1 | 0.1% |
1704.90.1000 | 1 | 0.1% |
Other values (1611) | 1611 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 8064 | |
. | 3242 | |
1 | 2514 | 12.9% |
2 | 1622 | 8.3% |
9 | 1508 | 7.8% |
3 | 624 | 3.2% |
4 | 485 | 2.5% |
5 | 466 | 2.4% |
6 | 342 | 1.8% |
7 | 330 | 1.7% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 16210 | |
Other Punctuation | 3242 | 16.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 8064 | |
1 | 2514 | 15.5% |
2 | 1622 | 10.0% |
9 | 1508 | 9.3% |
3 | 624 | 3.8% |
4 | 485 | 3.0% |
5 | 466 | 2.9% |
6 | 342 | 2.1% |
7 | 330 | 2.0% |
8 | 255 | 1.6% |
Other Punctuation
Value | Count | Frequency (%) |
. | 3242 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 19452 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 8064 | |
. | 3242 | |
1 | 2514 | 12.9% |
2 | 1622 | 8.3% |
9 | 1508 | 7.8% |
3 | 624 | 3.2% |
4 | 485 | 2.5% |
5 | 466 | 2.4% |
6 | 342 | 1.8% |
7 | 330 | 1.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 19452 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 8064 | |
. | 3242 | |
1 | 2514 | 12.9% |
2 | 1622 | 8.3% |
9 | 1508 | 7.8% |
3 | 624 | 3.2% |
4 | 485 | 2.5% |
5 | 466 | 2.4% |
6 | 342 | 1.8% |
7 | 330 | 1.7% |
말(번식용/농가사육용)
Text
Distinct | 1600 |
---|---|
Distinct (%) | 98.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 12.8 KiB |
Length
Max length | 56 |
---|---|
Median length | 48 |
Mean length | 12.58174 |
Min length | 1 |
Characters and Unicode
Total characters | 20395 |
---|---|
Distinct characters | 596 |
Distinct categories | 9 ? |
Distinct scripts | 4 ? |
Distinct blocks | 4 ? |
Unique
Unique | 1586 ? |
---|---|
Unique (%) | 97.8% |
Sample
1st row | 말(번식용/기타) |
---|---|
2nd row | 말(기타/경주말) |
3rd row | 말(기타/기타) |
4th row | 당나귀(번식용) |
5th row | 당나귀(기타) |
Value | Count | Frequency (%) |
것 | 207 | 5.8% |
또는 | 95 | 2.6% |
기타 | 94 | 2.6% |
및 | 63 | 1.8% |
그 | 34 | 0.9% |
종자 | 29 | 0.8% |
안한 | 28 | 0.8% |
분획물 | 22 | 0.6% |
도메스티쿠스종에 | 21 | 0.6% |
함유한 | 19 | 0.5% |
Other values (2066) | 2986 |
Most occurring characters
Value | Count | Frequency (%) |
2005 | 9.8% | |
( | 1198 | 5.9% |
) | 1196 | 5.9% |
기 | 707 | 3.5% |
타 | 623 | 3.1% |
/ | 504 | 2.5% |
의 | 310 | 1.5% |
조 | 306 | 1.5% |
리 | 303 | 1.5% |
스 | 302 | 1.5% |
Other values (586) | 12941 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 14826 | |
Space Separator | 2005 | 9.8% |
Open Punctuation | 1198 | 5.9% |
Close Punctuation | 1196 | 5.9% |
Other Punctuation | 694 | 3.4% |
Decimal Number | 343 | 1.7% |
Dash Punctuation | 74 | 0.4% |
Lowercase Letter | 57 | 0.3% |
Math Symbol | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
기 | 707 | 4.8% |
타 | 623 | 4.2% |
의 | 310 | 2.1% |
조 | 306 | 2.1% |
리 | 303 | 2.0% |
스 | 302 | 2.0% |
유 | 282 | 1.9% |
것 | 264 | 1.8% |
이 | 250 | 1.7% |
제 | 243 | 1.6% |
Other values (560) | 11236 |
Decimal Number
Value | Count | Frequency (%) |
0 | 102 | |
1 | 63 | |
2 | 41 | |
5 | 35 | 10.2% |
6 | 22 | 6.4% |
8 | 20 | 5.8% |
4 | 20 | 5.8% |
9 | 18 | 5.2% |
3 | 17 | 5.0% |
7 | 5 | 1.5% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 504 | |
, | 117 | 16.9% |
· | 31 | 4.5% |
. | 29 | 4.2% |
% | 11 | 1.6% |
: | 1 | 0.1% |
? | 1 | 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
g | 25 | |
k | 16 | |
m | 16 |
Math Symbol
Value | Count | Frequency (%) |
< | 1 | |
> | 1 |
Space Separator
Value | Count | Frequency (%) |
2005 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1198 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1196 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 74 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 14816 | |
Common | 5512 | 27.0% |
Latin | 57 | 0.3% |
Han | 10 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
기 | 707 | 4.8% |
타 | 623 | 4.2% |
의 | 310 | 2.1% |
조 | 306 | 2.1% |
리 | 303 | 2.0% |
스 | 302 | 2.0% |
유 | 282 | 1.9% |
것 | 264 | 1.8% |
이 | 250 | 1.7% |
제 | 243 | 1.6% |
Other values (550) | 11226 |
Common
Value | Count | Frequency (%) |
2005 | ||
( | 1198 | |
) | 1196 | |
/ | 504 | 9.1% |
, | 117 | 2.1% |
0 | 102 | 1.9% |
- | 74 | 1.3% |
1 | 63 | 1.1% |
2 | 41 | 0.7% |
5 | 35 | 0.6% |
Other values (13) | 177 | 3.2% |
Han
Value | Count | Frequency (%) |
芎 | 1 | |
黃 | 1 | |
川 | 1 | |
當 | 1 | |
歸 | 1 | |
五 | 1 | |
味 | 1 | |
子 | 1 | |
芍 | 1 | |
藥 | 1 |
Latin
Value | Count | Frequency (%) |
g | 25 | |
k | 16 | |
m | 16 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 14816 | |
ASCII | 5538 | 27.2% |
None | 31 | 0.2% |
CJK | 10 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2005 | ||
( | 1198 | |
) | 1196 | |
/ | 504 | 9.1% |
, | 117 | 2.1% |
0 | 102 | 1.8% |
- | 74 | 1.3% |
1 | 63 | 1.1% |
2 | 41 | 0.7% |
5 | 35 | 0.6% |
Other values (15) | 203 | 3.7% |
Hangul
Value | Count | Frequency (%) |
기 | 707 | 4.8% |
타 | 623 | 4.2% |
의 | 310 | 2.1% |
조 | 306 | 2.1% |
리 | 303 | 2.0% |
스 | 302 | 2.0% |
유 | 282 | 1.9% |
것 | 264 | 1.8% |
이 | 250 | 1.7% |
제 | 243 | 1.6% |
Other values (550) | 11226 |
None
Value | Count | Frequency (%) |
· | 31 |
CJK
Value | Count | Frequency (%) |
芎 | 1 | |
黃 | 1 | |
川 | 1 | |
當 | 1 | |
歸 | 1 | |
五 | 1 | |
味 | 1 | |
子 | 1 | |
芍 | 1 | |
藥 | 1 |
Distinct | 1590 |
---|---|
Distinct (%) | 98.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 12.8 KiB |
Length
Max length | 150 |
---|---|
Median length | 82 |
Mean length | 31.969155 |
Min length | 4 |
Characters and Unicode
Total characters | 51822 |
---|---|
Distinct characters | 75 |
Distinct categories | 9 ? |
Distinct scripts | 2 ? |
Distinct blocks | 3 ? |
Unique
Unique | 1576 ? |
---|---|
Unique (%) | 97.2% |
Sample
1st row | Horses: Pure-bred breeding anmials(Other) |
---|---|
2nd row | Horses: Other(Horses for racing) |
3rd row | Horses: Other(Other) |
4th row | Asses: Pure-bred breeding animals |
5th row | Asses: Other |
Value | Count | Frequency (%) |
or | 367 | 5.5% |
of | 342 | 5.1% |
other | 311 | 4.6% |
and | 242 | 3.6% |
meat | 118 | 1.8% |
chilled | 80 | 1.2% |
preserved | 73 | 1.1% |
the | 63 | 0.9% |
offal | 57 | 0.8% |
by | 56 | 0.8% |
Other values (1890) | 5008 |
Most occurring characters
Value | Count | Frequency (%) |
e | 5735 | 11.1% |
5104 | 9.8% | |
r | 3916 | 7.6% |
o | 3192 | 6.2% |
a | 3131 | 6.0% |
s | 2962 | 5.7% |
t | 2938 | 5.7% |
i | 2603 | 5.0% |
n | 2417 | 4.7% |
d | 1867 | 3.6% |
Other values (65) | 17957 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 40902 | |
Space Separator | 5104 | 9.8% |
Uppercase Letter | 2545 | 4.9% |
Open Punctuation | 1089 | 2.1% |
Close Punctuation | 1088 | 2.1% |
Other Punctuation | 736 | 1.4% |
Decimal Number | 249 | 0.5% |
Dash Punctuation | 104 | 0.2% |
Other Symbol | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 5735 | |
r | 3916 | 9.6% |
o | 3192 | 7.8% |
a | 3131 | 7.7% |
s | 2962 | 7.2% |
t | 2938 | 7.2% |
i | 2603 | 6.4% |
n | 2417 | 5.9% |
d | 1867 | 4.6% |
l | 1776 | 4.3% |
Other values (16) | 10365 |
Uppercase Letter
Value | Count | Frequency (%) |
O | 717 | |
C | 239 | 9.4% |
S | 212 | 8.3% |
M | 170 | 6.7% |
P | 168 | 6.6% |
F | 126 | 5.0% |
B | 121 | 4.8% |
R | 110 | 4.3% |
G | 106 | 4.2% |
L | 89 | 3.5% |
Other values (16) | 487 |
Decimal Number
Value | Count | Frequency (%) |
0 | 75 | |
5 | 40 | |
1 | 39 | |
2 | 30 | 12.0% |
4 | 18 | 7.2% |
9 | 15 | 6.0% |
8 | 13 | 5.2% |
3 | 10 | 4.0% |
6 | 6 | 2.4% |
7 | 3 | 1.2% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 334 | |
, | 223 | |
. | 79 | 10.7% |
: | 40 | 5.4% |
% | 37 | 5.0% |
' | 16 | 2.2% |
; | 7 | 1.0% |
Other Symbol
Value | Count | Frequency (%) |
° | 4 | |
ⓚ | 1 | 20.0% |
Space Separator
Value | Count | Frequency (%) |
5104 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1089 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1088 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 104 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 43447 | |
Common | 8375 | 16.2% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 5735 | |
r | 3916 | 9.0% |
o | 3192 | 7.3% |
a | 3131 | 7.2% |
s | 2962 | 6.8% |
t | 2938 | 6.8% |
i | 2603 | 6.0% |
n | 2417 | 5.6% |
d | 1867 | 4.3% |
l | 1776 | 4.1% |
Other values (42) | 12910 |
Common
Value | Count | Frequency (%) |
5104 | ||
( | 1089 | 13.0% |
) | 1088 | 13.0% |
/ | 334 | 4.0% |
, | 223 | 2.7% |
- | 104 | 1.2% |
. | 79 | 0.9% |
0 | 75 | 0.9% |
5 | 40 | 0.5% |
: | 40 | 0.5% |
Other values (13) | 199 | 2.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 51817 | |
None | 4 | < 0.1% |
Enclosed Alphanum | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 5735 | 11.1% |
5104 | 9.9% | |
r | 3916 | 7.6% |
o | 3192 | 6.2% |
a | 3131 | 6.0% |
s | 2962 | 5.7% |
t | 2938 | 5.7% |
i | 2603 | 5.0% |
n | 2417 | 4.7% |
d | 1867 | 3.6% |
Other values (63) | 17952 |
None
Value | Count | Frequency (%) |
° | 4 |
Enclosed Alphanum
Value | Count | Frequency (%) |
ⓚ | 1 |
0
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 27 |
---|---|
Distinct (%) | 1.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16.595928 |
Minimum | 0 |
---|---|
Maximum | 50 |
Zeros | 106 |
Zeros (%) | 6.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 14.4 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 5 |
median | 8 |
Q3 | 27 |
95-th percentile | 45 |
Maximum | 50 |
Range | 50 |
Interquartile range (IQR) | 22 |
Descriptive statistics
Standard deviation | 13.686267 |
---|---|
Coefficient of variation (CV) | 0.82467617 |
Kurtosis | -0.51322946 |
Mean | 16.595928 |
Median Absolute Deviation (MAD) | 8 |
Skewness | 0.74881973 |
Sum | 26902 |
Variance | 187.3139 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
8.0 | 456 | |
30.0 | 203 | |
5.0 | 142 | 8.8% |
20.0 | 128 | 7.9% |
3.0 | 122 | 7.5% |
0.0 | 106 | 6.5% |
27.0 | 102 | 6.3% |
45.0 | 58 | 3.6% |
18.0 | 54 | 3.3% |
40.0 | 46 | 2.8% |
Other values (17) | 204 |
Value | Count | Frequency (%) |
0.0 | 106 | |
1.0 | 2 | 0.1% |
1.8 | 4 | 0.2% |
2.0 | 28 | 1.7% |
3.0 | 122 | |
4.0 | 1 | 0.1% |
4.2 | 3 | 0.2% |
5.0 | 142 | |
5.4 | 6 | 0.4% |
7.0 | 1 | 0.1% |
Value | Count | Frequency (%) |
50.0 | 46 | 2.8% |
45.0 | 58 | 3.6% |
40.0 | 46 | 2.8% |
36.0 | 37 | 2.3% |
32.8 | 1 | 0.1% |
30.0 | 203 | |
27.0 | 102 | |
25.0 | 20 | 1.2% |
24.0 | 2 | 0.1% |
22.5 | 38 | 2.3% |
20
Text
Distinct | 85 |
---|---|
Distinct (%) | 5.2% |
Missing | 1 |
Missing (%) | 0.1% |
Memory size | 12.8 KiB |
Value | Count | Frequency (%) |
30 | 283 | |
20 | 235 | |
10 | 143 | 8.8% |
60 | 98 | 6.0% |
40 | 74 | 4.6% |
50 | 66 | 4.1% |
100 | 59 | 3.6% |
25 | 52 | 3.2% |
59.2 | 48 | 3.0% |
35 | 40 | 2.5% |
Other values (75) | 522 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 1118 | |
2 | 494 | |
3 | 460 | |
5 | 399 | 9.8% |
1 | 339 | 8.4% |
. | 319 | 7.9% |
4 | 256 | 6.3% |
6 | 197 | 4.9% |
9 | 182 | 4.5% |
7 | 144 | 3.6% |
Other values (2) | 143 | 3.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 3716 | |
Other Punctuation | 319 | 7.9% |
Dash Punctuation | 16 | 0.4% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 1118 | |
2 | 494 | |
3 | 460 | |
5 | 399 | 10.7% |
1 | 339 | 9.1% |
4 | 256 | 6.9% |
6 | 197 | 5.3% |
9 | 182 | 4.9% |
7 | 144 | 3.9% |
8 | 127 | 3.4% |
Other Punctuation
Value | Count | Frequency (%) |
. | 319 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 16 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 4051 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 1118 | |
2 | 494 | |
3 | 460 | |
5 | 399 | 9.8% |
1 | 339 | 8.4% |
. | 319 | 7.9% |
4 | 256 | 6.3% |
6 | 197 | 4.9% |
9 | 182 | 4.5% |
7 | 144 | 3.6% |
Other values (2) | 143 | 3.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 4051 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 1118 | |
2 | 494 | |
3 | 460 | |
5 | 399 | 9.8% |
1 | 339 | 8.4% |
. | 319 | 7.9% |
4 | 256 | 6.3% |
6 | 197 | 4.9% |
9 | 182 | 4.5% |
7 | 144 | 3.6% |
Other values (2) | 143 | 3.5% |
13.1
Text
Distinct | 101 |
---|---|
Distinct (%) | 6.2% |
Missing | 4 |
Missing (%) | 0.2% |
Memory size | 12.8 KiB |
Value | Count | Frequency (%) |
27 | 168 | 10.4% |
18 | 125 | 7.7% |
19.7 | 123 | 7.6% |
54 | 121 | 7.5% |
13.1 | 109 | 6.7% |
45 | 93 | 5.8% |
36 | 71 | 4.4% |
6.6 | 57 | 3.5% |
22.5 | 55 | 3.4% |
30 | 54 | 3.3% |
Other values (91) | 641 |
Most occurring characters
Value | Count | Frequency (%) |
. | 632 | |
1 | 607 | |
2 | 512 | |
5 | 501 | |
7 | 398 | |
3 | 391 | |
4 | 384 | |
6 | 285 | |
9 | 233 | 5.3% |
0 | 232 | 5.2% |
Other values (2) | 246 | 5.6% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 3773 | |
Other Punctuation | 632 | 14.3% |
Dash Punctuation | 16 | 0.4% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 607 | |
2 | 512 | |
5 | 501 | |
7 | 398 | |
3 | 391 | |
4 | 384 | |
6 | 285 | |
9 | 233 | 6.2% |
0 | 232 | 6.1% |
8 | 230 | 6.1% |
Other Punctuation
Value | Count | Frequency (%) |
. | 632 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 16 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 4421 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
. | 632 | |
1 | 607 | |
2 | 512 | |
5 | 501 | |
7 | 398 | |
3 | 391 | |
4 | 384 | |
6 | 285 | |
9 | 233 | 5.3% |
0 | 232 | 5.2% |
Other values (2) | 246 | 5.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 4421 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 632 | |
1 | 607 | |
2 | 512 | |
5 | 501 | |
7 | 398 | |
3 | 391 | |
4 | 384 | |
6 | 285 | |
9 | 233 | 5.3% |
0 | 232 | 5.2% |
Other values (2) | 246 | 5.6% |
Unnamed: 8
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 17 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 12.8 KiB |
<NA> | |
---|---|
20 | 73 |
- | 45 |
5 | 43 |
8 | 24 |
Other values (12) | 87 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.5823566 |
Min length | 1 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 1349 | |
20 | 73 | 4.5% |
- | 45 | 2.8% |
5 | 43 | 2.7% |
8 | 24 | 1.5% |
50 | 20 | 1.2% |
40 | 16 | 1.0% |
3 | 14 | 0.9% |
30 | 13 | 0.8% |
0 | 11 | 0.7% |
Other values (7) | 13 | 0.8% |
Length
Value | Count | Frequency (%) |
na | 1349 | |
20 | 73 | 4.5% |
45 | 2.8% | |
5 | 43 | 2.7% |
8 | 24 | 1.5% |
50 | 20 | 1.2% |
40 | 16 | 1.0% |
3 | 14 | 0.9% |
30 | 13 | 0.8% |
0 | 11 | 0.7% |
Other values (7) | 13 | 0.8% |
Unnamed: 9
Real number (ℝ)
MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | 13.0% |
Missing | 1575 |
Missing (%) | 97.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20.543478 |
Minimum | 5 |
---|---|
Maximum | 40 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 14.4 KiB |
Quantile statistics
Minimum | 5 |
---|---|
5-th percentile | 15 |
Q1 | 15 |
median | 15 |
Q3 | 20 |
95-th percentile | 40 |
Maximum | 40 |
Range | 35 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 9.2031553 |
---|---|
Coefficient of variation (CV) | 0.44798428 |
Kurtosis | 0.10630103 |
Mean | 20.543478 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 1.017925 |
Sum | 945 |
Variance | 84.698068 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
15 | 22 | 1.4% |
20 | 12 | 0.7% |
35 | 5 | 0.3% |
40 | 4 | 0.2% |
5 | 2 | 0.1% |
30 | 1 | 0.1% |
(Missing) | 1575 |
Value | Count | Frequency (%) |
5 | 2 | 0.1% |
15 | 22 | |
20 | 12 | |
30 | 1 | 0.1% |
35 | 5 | 0.3% |
40 | 4 | 0.2% |
Value | Count | Frequency (%) |
40 | 4 | 0.2% |
35 | 5 | 0.3% |
30 | 1 | 0.1% |
20 | 12 | |
15 | 22 | |
5 | 2 | 0.1% |
Unnamed: 10
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 15 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 12.8 KiB |
<NA> | |
---|---|
할당0 (12월말) | 41 |
조정45% | 5 |
할당25 (6월말) | 4 |
할당1 (6월말) | 3 |
Other values (10) | 11 |
Length
Max length | 15 |
---|---|
Median length | 4 |
Mean length | 4.2264035 |
Min length | 4 |
Unique
Unique | 9 ? |
---|---|
Unique (%) | 0.6% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 1557 | |
할당0 (12월말) | 41 | 2.5% |
조정45% | 5 | 0.3% |
할당25 (6월말) | 4 | 0.2% |
할당1 (6월말) | 3 | 0.2% |
할당5 (6월말) | 2 | 0.1% |
조정40%,1625원/kg | 1 | 0.1% |
조정40%,1,625원/kg | 1 | 0.1% |
할당10/0 (12월말) | 1 | 0.1% |
할당4 (12월말) | 1 | 0.1% |
Other values (5) | 5 | 0.3% |
Length
Value | Count | Frequency (%) |
na | 1557 | |
12월말 | 44 | 2.6% |
할당0 | 41 | 2.4% |
6월말 | 10 | 0.6% |
조정45 | 5 | 0.3% |
할당25 | 4 | 0.2% |
할당1 | 3 | 0.2% |
할당5 | 3 | 0.2% |
할당4 | 2 | 0.1% |
206원/kg | 1 | 0.1% |
Other values (7) | 7 | 0.4% |
Unnamed: 11
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 8 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 12.8 KiB |
<NA> | |
---|---|
TC | 72 |
TM | 51 |
BM | 51 |
BC | 49 |
Other values (3) | 68 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.6409624 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 1330 | |
TC | 72 | 4.4% |
TM | 51 | 3.1% |
BM | 51 | 3.1% |
BC | 49 | 3.0% |
BX | 40 | 2.5% |
ST | 16 | 1.0% |
TX | 12 | 0.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 1330 | |
tc | 72 | 4.4% |
tm | 51 | 3.1% |
bm | 51 | 3.1% |
bc | 49 | 3.0% |
bx | 40 | 2.5% |
st | 16 | 1.0% |
tx | 12 | 0.7% |
Unnamed: 12
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 7 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 12.8 KiB |
<NA> | |
---|---|
95.1 | |
97.7 | 39 |
96.7 | 22 |
- | 16 |
Other values (2) | 17 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.961752 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 1330 | |
95.1 | 197 | 12.2% |
97.7 | 39 | 2.4% |
96.7 | 22 | 1.4% |
- | 16 | 1.0% |
1.1 | 14 | 0.9% |
96.1 | 3 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 1330 | |
95.1 | 197 | 12.2% |
97.7 | 39 | 2.4% |
96.7 | 22 | 1.4% |
16 | 1.0% | |
1.1 | 14 | 0.9% |
96.1 | 3 | 0.2% |
1 | 0 | 20 | Unnamed: 8 | Unnamed: 9 | Unnamed: 10 | Unnamed: 11 | Unnamed: 12 | |
---|---|---|---|---|---|---|---|---|
1 | 1.000 | 0.765 | 0.868 | 0.783 | 0.912 | 0.844 | 0.672 | 0.680 |
0 | 0.765 | 1.000 | 0.912 | 0.893 | 0.920 | 0.892 | 0.677 | 0.777 |
20 | 0.868 | 0.912 | 1.000 | 0.984 | 0.832 | 0.856 | 0.980 | 0.974 |
Unnamed: 8 | 0.783 | 0.893 | 0.984 | 1.000 | NaN | 0.758 | 0.792 | 0.725 |
Unnamed: 9 | 0.912 | 0.920 | 0.832 | NaN | 1.000 | NaN | NaN | NaN |
Unnamed: 10 | 0.844 | 0.892 | 0.856 | 0.758 | NaN | 1.000 | 0.266 | 0.000 |
Unnamed: 11 | 0.672 | 0.677 | 0.980 | 0.792 | NaN | 0.266 | 1.000 | 0.730 |
Unnamed: 12 | 0.680 | 0.777 | 0.974 | 0.725 | NaN | 0.000 | 0.730 | 1.000 |
Unnamed: 11 | Unnamed: 8 | Unnamed: 12 | Unnamed: 10 | |
---|---|---|---|---|
Unnamed: 11 | 1.000 | 0.511 | 0.545 | 0.240 |
Unnamed: 8 | 0.511 | 1.000 | 0.465 | 0.567 |
Unnamed: 12 | 0.545 | 0.465 | 1.000 | 0.000 |
Unnamed: 10 | 0.240 | 0.567 | 0.000 | 1.000 |
1 | 0 | Unnamed: 9 | Unnamed: 8 | Unnamed: 10 | Unnamed: 11 | Unnamed: 12 | |
---|---|---|---|---|---|---|---|
1 | 1.000 | -0.219 | -0.174 | 0.446 | 0.540 | 0.421 | 0.441 |
0 | -0.219 | 1.000 | -0.052 | 0.674 | 0.539 | 0.377 | 0.473 |
Unnamed: 9 | -0.174 | -0.052 | 1.000 | 0.000 | NaN | 0.000 | 0.000 |
Unnamed: 8 | 0.446 | 0.674 | 0.000 | 1.000 | 0.567 | 0.511 | 0.465 |
Unnamed: 10 | 0.540 | 0.539 | NaN | 0.567 | 1.000 | 0.240 | 0.000 |
Unnamed: 11 | 0.421 | 0.377 | 0.000 | 0.511 | 0.240 | 1.000 | 0.545 |
Unnamed: 12 | 0.441 | 0.473 | 0.000 | 0.465 | 0.000 | 0.545 | 1.000 |
2014 | 1 | 0101.21.1000 | 말(번식용/농가사육용) | Horses: Pure-bred breeding anmials(For farm breeding) | 0 | 20 | 13.1 | Unnamed: 8 | Unnamed: 9 | Unnamed: 10 | Unnamed: 11 | Unnamed: 12 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 2014 | 2 | 0101.21.9000 | 말(번식용/기타) | Horses: Pure-bred breeding anmials(Other) | 8.0 | 20 | 13.1 | <NA> | <NA> | <NA> | <NA> | <NA> |
1 | 2014 | 3 | 0101.29.1000 | 말(기타/경주말) | Horses: Other(Horses for racing) | 8.0 | 20 | 13.1 | <NA> | <NA> | <NA> | <NA> | <NA> |
2 | 2014 | 4 | 0101.29.9000 | 말(기타/기타) | Horses: Other(Other) | 8.0 | 20 | 13.1 | <NA> | <NA> | <NA> | <NA> | <NA> |
3 | 2014 | 5 | 0101.30.1000 | 당나귀(번식용) | Asses: Pure-bred breeding animals | 8.0 | 20 | 13.1 | <NA> | <NA> | <NA> | <NA> | <NA> |
4 | 2014 | 6 | 0101.30.9000 | 당나귀(기타) | Asses: Other | 8.0 | 20 | 13.1 | <NA> | <NA> | <NA> | <NA> | <NA> |
5 | 2014 | 7 | 0101.90.0000 | 기타(기타) | Other | 8.0 | 20 | 13.1 | <NA> | <NA> | <NA> | <NA> | <NA> |
6 | 2014 | 8 | 0102.21.1000 | 축우(번식용/젖소) | Cattle: Pure-bred breeding animals(For milk) | 0.0 | 99 | 89.1 | 0 | <NA> | <NA> | TM | 95.1 |
7 | 2014 | 9 | 0102.21.2000 | 축우(번식용/육우) | Cattle: Pure-bred breeding animals(For meat) | 0.0 | 99 | 89.1 | 0 | <NA> | <NA> | TM | 95.1 |
8 | 2014 | 10 | 0102.21.9000 | 축우(번식용/기타) | Cattle: Pure-bred breeding animals(Other) | 0.0 | 99 | 89.1 | 0 | <NA> | <NA> | TM | 95.1 |
9 | 2014 | 11 | 0102.29.1000 | 축우(기타/젖소) | Cattle: Other(For milk) | 20.0 | 44.5 | 40 | - | <NA> | <NA> | BX | 1.1 |
2014 | 1 | 0101.21.1000 | 말(번식용/농가사육용) | Horses: Pure-bred breeding anmials(For farm breeding) | 0 | 20 | 13.1 | Unnamed: 8 | Unnamed: 9 | Unnamed: 10 | Unnamed: 11 | Unnamed: 12 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1611 | 2014 | 1613 | 5203.00.0000 | 면(카드 또는 코움한 것) | Cotton, carded or combed | 0.0 | 10 | 2 | <NA> | <NA> | <NA> | <NA> | <NA> |
1612 | 2014 | 1614 | 5301.10.0000 | 생아마 또는 침지아마 | Flax, raw or retted | 2.0 | 10 | 2 | <NA> | <NA> | <NA> | <NA> | <NA> |
1613 | 2014 | 1615 | 5301.21.0000 | 아마(쇄경 또는 타마한 것) | Flax(broken or scutched) | 2.0 | 10 | 2 | <NA> | <NA> | <NA> | <NA> | <NA> |
1614 | 2014 | 1616 | 5301.29.0000 | 아마(기타) | Other flax | 2.0 | 10 | 2 | <NA> | <NA> | <NA> | <NA> | <NA> |
1615 | 2014 | 1617 | 5301.30.1000 | 아마의 토우 | Flax tow | 2.0 | 10 | 2 | <NA> | <NA> | <NA> | <NA> | <NA> |
1616 | 2014 | 1618 | 5301.30.2000 | 아마의 웨이스트 | Flax waste | 2.0 | 10 | 2 | <NA> | <NA> | <NA> | <NA> | <NA> |
1617 | 2014 | 1619 | 5302.10.0000 | 생대마 또는 침지대마 | True hemp, raw or retted | 2.0 | 10 | 2 | <NA> | <NA> | <NA> | <NA> | <NA> |
1618 | 2014 | 1620 | 5302.90.1000 | 쇄경, 탐, 핵클 또는 기타의 방법으로 가공한 대마 | True hemp, broken, scutched, hackled or other wise processed | 2.0 | 10 | 2 | <NA> | <NA> | <NA> | <NA> | <NA> |
1619 | 2014 | 1621 | 5302.90.2010 | 대마의 토우 | Tow of true hemp | 2.0 | 10 | 2 | <NA> | <NA> | <NA> | <NA> | <NA> |
1620 | 2014 | 1622 | 5302.90.2020 | 대마의 웨이스트 | Waste of true hemp | 2.0 | 10 | 2 | <NA> | <NA> | <NA> | <NA> | <NA> |