Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 49 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 3.4 KiB |
Average record size in memory | 71.7 B |
Variable types
Text | 1 |
---|---|
Categorical | 5 |
Numeric | 2 |
Dataset
Description | Sample |
---|---|
Author | 올시데이터 |
URL | https://www.bigdata-sea.kr/datasearch/base/view.do?prodId=PROD_000466 |
PRFMC has constant value "" | Constant |
FUEL_CNSMP_QTY has constant value "" | Constant |
NVGTN_DIST is highly overall correlated with SHIP_CNT | High correlation |
SHIP_CNT is highly overall correlated with NVGTN_DIST | High correlation |
SHIP_CNT is highly imbalanced (59.6%) | Imbalance |
DPTR_HMS is highly imbalanced (51.5%) | Imbalance |
SHIP_OWNER_NM has unique values | Unique |
NVGTN_DIST has unique values | Unique |
RN has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 14:32:24.990771 |
---|---|
Analysis finished | 2023-12-10 14:32:27.599712 |
Duration | 2.61 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
SHIP_OWNER_NM
Text
UNIQUE
 
Distinct | 49 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 524.0 B |
Length
Max length | 30 |
---|---|
Median length | 25 |
Mean length | 17.55102 |
Min length | 3 |
Characters and Unicode
Total characters | 860 |
---|---|
Distinct characters | 27 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 49 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | FUJIAN YUANYUAN SHIPPING |
---|---|
2nd row | ZHEJIANG CHAOSHENG SHIPPING |
3rd row | DONGHAI SHIPPING |
4th row | ANRUN SHIPPING |
5th row | HAIHUA SHIPPING |
Value | Count | Frequency (%) |
shipping | 18 | 14.8% |
marine | 3 | 2.5% |
tankers | 3 | 2.5% |
shipmanagement | 3 | 2.5% |
maritime | 2 | 1.6% |
management | 2 | 1.6% |
trading | 2 | 1.6% |
nordic | 2 | 1.6% |
lng | 2 | 1.6% |
hong | 2 | 1.6% |
Other values (81) | 83 |
Most occurring characters
Value | Count | Frequency (%) |
I | 96 | |
N | 93 | |
A | 84 | 9.8% |
73 | 8.5% | |
E | 58 | 6.7% |
S | 55 | 6.4% |
P | 50 | 5.8% |
H | 46 | 5.3% |
G | 43 | 5.0% |
R | 42 | 4.9% |
Other values (17) | 220 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 786 | |
Space Separator | 73 | 8.5% |
Other Punctuation | 1 | 0.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
I | 96 | |
N | 93 | |
A | 84 | |
E | 58 | 7.4% |
S | 55 | 7.0% |
P | 50 | 6.4% |
H | 46 | 5.9% |
G | 43 | 5.5% |
R | 42 | 5.3% |
T | 32 | 4.1% |
Other values (15) | 187 |
Space Separator
Value | Count | Frequency (%) |
73 |
Other Punctuation
Value | Count | Frequency (%) |
& | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 786 | |
Common | 74 | 8.6% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
I | 96 | |
N | 93 | |
A | 84 | |
E | 58 | 7.4% |
S | 55 | 7.0% |
P | 50 | 6.4% |
H | 46 | 5.9% |
G | 43 | 5.5% |
R | 42 | 5.3% |
T | 32 | 4.1% |
Other values (15) | 187 |
Common
Value | Count | Frequency (%) |
73 | ||
& | 1 | 1.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 860 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
I | 96 | |
N | 93 | |
A | 84 | 9.8% |
73 | 8.5% | |
E | 58 | 6.7% |
S | 55 | 6.4% |
P | 50 | 5.8% |
H | 46 | 5.3% |
G | 43 | 5.0% |
R | 42 | 4.9% |
Other values (17) | 220 |
SHIP_CNT
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 10.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 524.0 B |
1 | |
---|---|
3 | 3 |
2 | 3 |
21 | 1 |
5 | 1 |
Length
Max length | 2 |
---|---|
Median length | 1 |
Mean length | 1.0204082 |
Min length | 1 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 4.1% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 41 | |
3 | 3 | 6.1% |
2 | 3 | 6.1% |
21 | 1 | 2.0% |
5 | 1 | 2.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 41 | |
3 | 3 | 6.1% |
2 | 3 | 6.1% |
21 | 1 | 2.0% |
5 | 1 | 2.0% |
DPTR_HMS
Categorical
IMBALANCE
 
Distinct | 9 |
---|---|
Distinct (%) | 18.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 524.0 B |
01-Jan-2021 00:00:00 | |
---|---|
02-Jan-2021 00:00:00 | |
05-Jan-2021 00:00:00 | 2 |
10-Jan-2021 00:00:00 | 1 |
11-Jan-2021 00:00:00 | 1 |
Other values (4) |
Length
Max length | 20 |
---|---|
Median length | 20 |
Mean length | 20 |
Min length | 20 |
Unique
Unique | 6 ? |
---|---|
Unique (%) | 12.2% |
Sample
1st row | 01-Jan-2021 00:00:00 |
---|---|
2nd row | 01-Jan-2021 00:00:00 |
3rd row | 02-Jan-2021 00:00:00 |
4th row | 01-Jan-2021 00:00:00 |
5th row | 01-Jan-2021 00:00:00 |
Common Values
Value | Count | Frequency (%) |
01-Jan-2021 00:00:00 | 36 | |
02-Jan-2021 00:00:00 | 5 | 10.2% |
05-Jan-2021 00:00:00 | 2 | 4.1% |
10-Jan-2021 00:00:00 | 1 | 2.0% |
11-Jan-2021 00:00:00 | 1 | 2.0% |
06-Jan-2021 00:00:00 | 1 | 2.0% |
03-Feb-2021 00:00:00 | 1 | 2.0% |
04-Jan-2021 00:00:00 | 1 | 2.0% |
03-Jan-2021 00:00:00 | 1 | 2.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
00:00:00 | 49 | |
01-jan-2021 | 36 | |
02-jan-2021 | 5 | 5.1% |
05-jan-2021 | 2 | 2.0% |
10-jan-2021 | 1 | 1.0% |
11-jan-2021 | 1 | 1.0% |
06-jan-2021 | 1 | 1.0% |
03-feb-2021 | 1 | 1.0% |
04-jan-2021 | 1 | 1.0% |
03-jan-2021 | 1 | 1.0% |
ARVL_HMS
Categorical
Distinct | 21 |
---|---|
Distinct (%) | 42.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 524.0 B |
13-Oct-2021 18:00:00 | |
---|---|
13-Oct-2021 12:00:00 | |
12-Oct-2021 18:00:00 | 2 |
13-Oct-2021 06:00:00 | 2 |
11-Oct-2021 18:00:00 | 2 |
Other values (16) |
Length
Max length | 20 |
---|---|
Median length | 20 |
Mean length | 20 |
Min length | 20 |
Unique
Unique | 15 ? |
---|---|
Unique (%) | 30.6% |
Sample
1st row | 12-Oct-2021 06:00:00 |
---|---|
2nd row | 13-Oct-2021 18:00:00 |
3rd row | 13-Oct-2021 18:00:00 |
4th row | 12-Oct-2021 00:00:00 |
5th row | 13-Oct-2021 12:00:00 |
Common Values
Value | Count | Frequency (%) |
13-Oct-2021 18:00:00 | 22 | |
13-Oct-2021 12:00:00 | 4 | 8.2% |
12-Oct-2021 18:00:00 | 2 | 4.1% |
13-Oct-2021 06:00:00 | 2 | 4.1% |
11-Oct-2021 18:00:00 | 2 | 4.1% |
10-Oct-2021 00:00:00 | 2 | 4.1% |
09-Jul-2021 06:00:00 | 1 | 2.0% |
12-Oct-2021 00:00:00 | 1 | 2.0% |
10-Jan-2021 18:00:00 | 1 | 2.0% |
03-Jul-2021 12:00:00 | 1 | 2.0% |
Other values (11) | 11 |
Length
Value | Count | Frequency (%) |
18:00:00 | 29 | |
13-oct-2021 | 28 | |
06:00:00 | 9 | 9.2% |
12:00:00 | 7 | 7.1% |
12-oct-2021 | 4 | 4.1% |
10-oct-2021 | 4 | 4.1% |
00:00:00 | 4 | 4.1% |
11-oct-2021 | 2 | 2.0% |
04-aug-2021 | 1 | 1.0% |
26-apr-2021 | 1 | 1.0% |
Other values (9) | 9 | 9.2% |
PRFMC
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 524.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 49 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 49 |
FUEL_CNSMP_QTY
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 524.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 49 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 49 |
NVGTN_DIST
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 49 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 112894.34 |
Minimum | 1332.14 |
---|---|
Maximum | 1711330 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 573.0 B |
Quantile statistics
Minimum | 1332.14 |
---|---|
5-th percentile | 2204.486 |
Q1 | 35396.3 |
median | 61666.9 |
Q3 | 92328.3 |
95-th percentile | 276396.6 |
Maximum | 1711330 |
Range | 1709997.9 |
Interquartile range (IQR) | 56932 |
Descriptive statistics
Standard deviation | 244040.43 |
---|---|
Coefficient of variation (CV) | 2.161671 |
Kurtosis | 40.286714 |
Mean | 112894.34 |
Median Absolute Deviation (MAD) | 26828.9 |
Skewness | 6.1071315 |
Sum | 5531822.9 |
Variance | 5.9555732 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
39506.6 | 1 | 2.0% |
6661.0 | 1 | 2.0% |
193437.0 | 1 | 2.0% |
35396.3 | 1 | 2.0% |
36620.8 | 1 | 2.0% |
263697.0 | 1 | 2.0% |
64499.9 | 1 | 2.0% |
71345.5 | 1 | 2.0% |
124641.0 | 1 | 2.0% |
69053.9 | 1 | 2.0% |
Other values (39) | 39 |
Value | Count | Frequency (%) |
1332.14 | 1 | |
1424.23 | 1 | |
1971.17 | 1 | |
2554.46 | 1 | |
3854.49 | 1 | |
6661.0 | 1 | |
10372.1 | 1 | |
25623.1 | 1 | |
30357.6 | 1 | |
30477.7 | 1 |
Value | Count | Frequency (%) |
1711330.0 | 1 | |
321102.0 | 1 | |
284863.0 | 1 | |
263697.0 | 1 | |
196415.0 | 1 | |
193437.0 | 1 | |
171599.0 | 1 | |
145686.0 | 1 | |
131455.0 | 1 | |
131000.0 | 1 |
RN
Real number (ℝ)
UNIQUE
 
Distinct | 49 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 26 |
Minimum | 2 |
---|---|
Maximum | 50 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 573.0 B |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 4.4 |
Q1 | 14 |
median | 26 |
Q3 | 38 |
95-th percentile | 47.6 |
Maximum | 50 |
Range | 48 |
Interquartile range (IQR) | 24 |
Descriptive statistics
Standard deviation | 14.28869 |
---|---|
Coefficient of variation (CV) | 0.54956501 |
Kurtosis | -1.2 |
Mean | 26 |
Median Absolute Deviation (MAD) | 12 |
Skewness | 0 |
Sum | 1274 |
Variance | 204.16667 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
2 | 1 | 2.0% |
39 | 1 | 2.0% |
29 | 1 | 2.0% |
30 | 1 | 2.0% |
31 | 1 | 2.0% |
32 | 1 | 2.0% |
33 | 1 | 2.0% |
34 | 1 | 2.0% |
35 | 1 | 2.0% |
36 | 1 | 2.0% |
Other values (39) | 39 |
Value | Count | Frequency (%) |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 | |
11 | 1 |
Value | Count | Frequency (%) |
50 | 1 | |
49 | 1 | |
48 | 1 | |
47 | 1 | |
46 | 1 | |
45 | 1 | |
44 | 1 | |
43 | 1 | |
42 | 1 | |
41 | 1 |
SHIP_OWNER_NM | SHIP_CNT | DPTR_HMS | ARVL_HMS | NVGTN_DIST | RN | |
---|---|---|---|---|---|---|
SHIP_OWNER_NM | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
SHIP_CNT | 1.000 | 1.000 | 0.000 | 0.528 | 0.890 | 0.319 |
DPTR_HMS | 1.000 | 0.000 | 1.000 | 0.665 | 0.000 | 0.371 |
ARVL_HMS | 1.000 | 0.528 | 0.665 | 1.000 | 0.589 | 0.000 |
NVGTN_DIST | 1.000 | 0.890 | 0.000 | 0.589 | 1.000 | 0.021 |
RN | 1.000 | 0.319 | 0.371 | 0.000 | 0.021 | 1.000 |
SHIP_CNT | ARVL_HMS | DPTR_HMS | |
---|---|---|---|
SHIP_CNT | 1.000 | 0.213 | 0.000 |
ARVL_HMS | 0.213 | 1.000 | 0.258 |
DPTR_HMS | 0.000 | 0.258 | 1.000 |
NVGTN_DIST | RN | SHIP_CNT | DPTR_HMS | ARVL_HMS | |
---|---|---|---|---|---|
NVGTN_DIST | 1.000 | 0.367 | 0.918 | 0.000 | 0.268 |
RN | 0.367 | 1.000 | 0.133 | 0.193 | 0.000 |
SHIP_CNT | 0.918 | 0.133 | 1.000 | 0.000 | 0.213 |
DPTR_HMS | 0.000 | 0.193 | 0.000 | 1.000 | 0.258 |
ARVL_HMS | 0.268 | 0.000 | 0.213 | 0.258 | 1.000 |
SHIP_OWNER_NM | SHIP_CNT | DPTR_HMS | ARVL_HMS | PRFMC | FUEL_CNSMP_QTY | NVGTN_DIST | RN | |
---|---|---|---|---|---|---|---|---|
0 | FUJIAN YUANYUAN SHIPPING | 1 | 01-Jan-2021 00:00:00 | 12-Oct-2021 06:00:00 | 0 | 0 | 39506.6 | 2 |
1 | ZHEJIANG CHAOSHENG SHIPPING | 1 | 01-Jan-2021 00:00:00 | 13-Oct-2021 18:00:00 | 0 | 0 | 40730.5 | 3 |
2 | DONGHAI SHIPPING | 1 | 02-Jan-2021 00:00:00 | 13-Oct-2021 18:00:00 | 0 | 0 | 42353.7 | 4 |
3 | ANRUN SHIPPING | 1 | 01-Jan-2021 00:00:00 | 12-Oct-2021 00:00:00 | 0 | 0 | 35110.6 | 5 |
4 | HAIHUA SHIPPING | 1 | 01-Jan-2021 00:00:00 | 13-Oct-2021 12:00:00 | 0 | 0 | 46297.0 | 6 |
5 | MAERSK SHIPPING HONG KONG | 1 | 02-Jan-2021 00:00:00 | 10-Jan-2021 18:00:00 | 0 | 0 | 1424.23 | 7 |
6 | JINGHAI SHIPPING | 1 | 01-Jan-2021 00:00:00 | 13-Oct-2021 12:00:00 | 0 | 0 | 3854.49 | 8 |
7 | SINO OCEAN SHIPPING | 1 | 02-Jan-2021 00:00:00 | 13-Oct-2021 18:00:00 | 0 | 0 | 72091.7 | 9 |
8 | BAKRI NAVIGATION | 3 | 01-Jan-2021 00:00:00 | 13-Oct-2021 18:00:00 | 0 | 0 | 321102.0 | 10 |
9 | WESTFAL LARSEN | 1 | 02-Jan-2021 00:00:00 | 13-Oct-2021 18:00:00 | 0 | 0 | 78263.0 | 11 |
SHIP_OWNER_NM | SHIP_CNT | DPTR_HMS | ARVL_HMS | PRFMC | FUEL_CNSMP_QTY | NVGTN_DIST | RN | |
---|---|---|---|---|---|---|---|---|
39 | DAMICO TANKERS | 2 | 01-Jan-2021 00:00:00 | 10-Aug-2021 12:00:00 | 0 | 0 | 80889.3 | 41 |
40 | EUSU SHIPMANAGEMENT | 1 | 01-Jan-2021 00:00:00 | 13-Oct-2021 18:00:00 | 0 | 0 | 171599.0 | 42 |
41 | DAELIM | 2 | 01-Jan-2021 00:00:00 | 08-Jul-2021 18:00:00 | 0 | 0 | 83013.0 | 43 |
42 | HANJIN SHIPMANAGEMENT | 1 | 03-Jan-2021 00:00:00 | 12-Oct-2021 18:00:00 | 0 | 0 | 63886.2 | 44 |
43 | EITZEN CHEMICAL USA | 1 | 01-Jan-2021 00:00:00 | 11-Oct-2021 18:00:00 | 0 | 0 | 61666.9 | 45 |
44 | NORDIC TANKERS TRADING | 5 | 01-Jan-2021 00:00:00 | 10-Oct-2021 00:00:00 | 0 | 0 | 284863.0 | 46 |
45 | CS MARINE | 1 | 01-Jan-2021 00:00:00 | 12-Oct-2021 18:00:00 | 0 | 0 | 196415.0 | 47 |
46 | DAEHO SHIPPING | 1 | 01-Jan-2021 00:00:00 | 24-Sep-2021 00:00:00 | 0 | 0 | 59713.2 | 48 |
47 | GLOBAL MARINE SERVICES | 2 | 01-Jan-2021 00:00:00 | 13-Oct-2021 18:00:00 | 0 | 0 | 131455.0 | 49 |
48 | TOKYO LNG TANKER | 1 | 01-Jan-2021 00:00:00 | 13-Oct-2021 18:00:00 | 0 | 0 | 123946.0 | 50 |