Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 49 |
Missing cells | 1 |
Missing cells (%) | 0.3% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.6 KiB |
Average record size in memory | 53.7 B |
Variable types
Text | 1 |
---|---|
Numeric | 3 |
DateTime | 2 |
Dataset
Description | Sample |
---|---|
Author | 올시데이터 |
URL | https://www.bigdata-sea.kr/datasearch/base/view.do?prodId=PROD_000429 |
SHIP_CNT is highly overall correlated with NVGTN_DIST | High correlation |
NVGTN_DIST is highly overall correlated with SHIP_CNT | High correlation |
SHIP_KIND has 1 (2.0%) missing values | Missing |
DPTR_HMS has unique values | Unique |
NVGTN_DIST has unique values | Unique |
RN has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 14:50:31.986510 |
---|---|
Analysis finished | 2023-12-10 14:50:33.722664 |
Duration | 1.74 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
SHIP_KIND
Text
MISSING
 
Distinct | 46 |
---|---|
Distinct (%) | 95.8% |
Missing | 1 |
Missing (%) | 2.0% |
Memory size | 524.0 B |
Length
Max length | 46 |
---|---|
Median length | 30.5 |
Mean length | 20.916667 |
Min length | 3 |
Characters and Unicode
Total characters | 1004 |
---|---|
Distinct characters | 53 |
Distinct categories | 8 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 44 ? |
---|---|
Unique (%) | 91.7% |
Sample
1st row | Tanker(product oil) |
---|---|
2nd row | Tanker(chemical/oil product) |
3rd row | Tanker(oil/chemical) |
4th row | Tanker(chemical) |
5th row | Tanker |
Value | Count | Frequency (%) |
tanker | 27 | 17.3% |
inland | 16 | 10.3% |
cargo | 9 | 5.8% |
oil | 6 | 3.8% |
pushtow | 6 | 3.8% |
barges | 5 | 3.2% |
motor | 5 | 3.2% |
chemical | 4 | 2.6% |
liquid | 4 | 2.6% |
or | 3 | 1.9% |
Other values (54) | 71 |
Most occurring characters
Value | Count | Frequency (%) |
108 | 10.8% | |
a | 87 | 8.7% |
n | 77 | 7.7% |
e | 77 | 7.7% |
r | 76 | 7.6% |
o | 51 | 5.1% |
l | 46 | 4.6% |
i | 38 | 3.8% |
t | 37 | 3.7% |
T | 33 | 3.3% |
Other values (43) | 374 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 725 | |
Uppercase Letter | 153 | 15.2% |
Space Separator | 108 | 10.8% |
Open Punctuation | 6 | 0.6% |
Close Punctuation | 6 | 0.6% |
Other Punctuation | 3 | 0.3% |
Dash Punctuation | 2 | 0.2% |
Decimal Number | 1 | 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 87 | |
n | 77 | |
e | 77 | |
r | 76 | |
o | 51 | 7.0% |
l | 46 | 6.3% |
i | 38 | 5.2% |
t | 37 | 5.1% |
k | 32 | 4.4% |
s | 32 | 4.4% |
Other values (16) | 172 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 33 | |
I | 20 | |
P | 16 | |
C | 13 | 8.5% |
M | 9 | 5.9% |
O | 8 | 5.2% |
A | 6 | 3.9% |
L | 5 | 3.3% |
U | 5 | 3.3% |
F | 5 | 3.3% |
Other values (11) | 33 |
Space Separator
Value | Count | Frequency (%) |
108 |
Open Punctuation
Value | Count | Frequency (%) |
( | 6 |
Close Punctuation
Value | Count | Frequency (%) |
) | 6 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 3 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2 |
Decimal Number
Value | Count | Frequency (%) |
2 | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 878 | |
Common | 126 | 12.5% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 87 | 9.9% |
n | 77 | 8.8% |
e | 77 | 8.8% |
r | 76 | 8.7% |
o | 51 | 5.8% |
l | 46 | 5.2% |
i | 38 | 4.3% |
t | 37 | 4.2% |
T | 33 | 3.8% |
k | 32 | 3.6% |
Other values (37) | 324 |
Common
Value | Count | Frequency (%) |
108 | ||
( | 6 | 4.8% |
) | 6 | 4.8% |
/ | 3 | 2.4% |
- | 2 | 1.6% |
2 | 1 | 0.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1004 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
108 | 10.8% | |
a | 87 | 8.7% |
n | 77 | 7.7% |
e | 77 | 7.7% |
r | 76 | 7.6% |
o | 51 | 5.1% |
l | 46 | 4.6% |
i | 38 | 3.8% |
t | 37 | 3.7% |
T | 33 | 3.3% |
Other values (43) | 374 |
SHIP_CNT
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 40 |
---|---|
Distinct (%) | 81.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 531.38776 |
Minimum | 1 |
---|---|
Maximum | 4436 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 573.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 7 |
median | 59 |
Q3 | 369 |
95-th percentile | 3189.8 |
Maximum | 4436 |
Range | 4435 |
Interquartile range (IQR) | 362 |
Descriptive statistics
Standard deviation | 1078.4791 |
---|---|
Coefficient of variation (CV) | 2.029552 |
Kurtosis | 5.5759337 |
Mean | 531.38776 |
Median Absolute Deviation (MAD) | 56 |
Skewness | 2.5226367 |
Sum | 26038 |
Variance | 1163117.2 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 4 | 8.2% |
3 | 4 | 8.2% |
20 | 2 | 4.1% |
7 | 2 | 4.1% |
110 | 2 | 4.1% |
3471 | 1 | 2.0% |
595 | 1 | 2.0% |
528 | 1 | 2.0% |
28 | 1 | 2.0% |
147 | 1 | 2.0% |
Other values (30) | 30 |
Value | Count | Frequency (%) |
1 | 4 | |
2 | 1 | 2.0% |
3 | 4 | |
4 | 1 | 2.0% |
5 | 1 | 2.0% |
7 | 2 | |
8 | 1 | 2.0% |
9 | 1 | 2.0% |
12 | 1 | 2.0% |
18 | 1 | 2.0% |
Value | Count | Frequency (%) |
4436 | 1 | |
3991 | 1 | |
3471 | 1 | |
2768 | 1 | |
2749 | 1 | |
1358 | 1 | |
1325 | 1 | |
1205 | 1 | |
595 | 1 | |
571 | 1 |
DPTR_HMS
Date
UNIQUE
 
Distinct | 49 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 524.0 B |
Minimum | 2021-01-01 00:00:01 |
---|---|
Maximum | 2023-01-03 01:11:39 |
ARVL_HMS
Date
Distinct | 39 |
---|---|
Distinct (%) | 79.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 524.0 B |
Minimum | 2021-01-01 05:25:04 |
---|---|
Maximum | 2023-05-31 23:59:15 |
NVGTN_DIST
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 49 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.3191287 × 1010 |
Minimum | 2372370 |
---|---|
Maximum | 2.93964 × 1011 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 573.0 B |
Quantile statistics
Minimum | 2372370 |
---|---|
5-th percentile | 55633780 |
Q1 | 1.83412 × 108 |
median | 1.25437 × 109 |
Q3 | 1.1618 × 1010 |
95-th percentile | 1.02892 × 1011 |
Maximum | 2.93964 × 1011 |
Range | 2.9396163 × 1011 |
Interquartile range (IQR) | 1.1434588 × 1010 |
Descriptive statistics
Standard deviation | 5.5534424 × 1010 |
---|---|
Coefficient of variation (CV) | 2.3946245 |
Kurtosis | 13.098556 |
Mean | 2.3191287 × 1010 |
Median Absolute Deviation (MAD) | 1.177267 × 109 |
Skewness | 3.4352786 |
Sum | 1.136373 × 1012 |
Variance | 3.0840722 × 1021 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
102760000000 | 1 | 2.0% |
81064800000 | 1 | 2.0% |
1254370000 | 1 | 2.0% |
11791500000 | 1 | 2.0% |
574511000 | 1 | 2.0% |
86111200 | 1 | 2.0% |
2619400000 | 1 | 2.0% |
557365000 | 1 | 2.0% |
102980000000 | 1 | 2.0% |
2892830000 | 1 | 2.0% |
Other values (39) | 39 |
Value | Count | Frequency (%) |
2372370 | 1 | |
13496000 | 1 | |
51309500 | 1 | |
62120200 | 1 | |
77103000 | 1 | |
78135500 | 1 | |
80181800 | 1 | |
86111200 | 1 | |
86620900 | 1 | |
123551000 | 1 |
Value | Count | Frequency (%) |
293964000000 | 1 | |
203049000000 | 1 | |
102980000000 | 1 | |
102760000000 | 1 | |
100792000000 | 1 | |
81064800000 | 1 | |
73894500000 | 1 | |
32416900000 | 1 | |
30803400000 | 1 | |
24802900000 | 1 |
RN
Real number (ℝ)
UNIQUE
 
Distinct | 49 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 26 |
Minimum | 2 |
---|---|
Maximum | 50 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 573.0 B |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 4.4 |
Q1 | 14 |
median | 26 |
Q3 | 38 |
95-th percentile | 47.6 |
Maximum | 50 |
Range | 48 |
Interquartile range (IQR) | 24 |
Descriptive statistics
Standard deviation | 14.28869 |
---|---|
Coefficient of variation (CV) | 0.54956501 |
Kurtosis | -1.2 |
Mean | 26 |
Median Absolute Deviation (MAD) | 12 |
Skewness | 0 |
Sum | 1274 |
Variance | 204.16667 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
2 | 1 | 2.0% |
39 | 1 | 2.0% |
29 | 1 | 2.0% |
30 | 1 | 2.0% |
31 | 1 | 2.0% |
32 | 1 | 2.0% |
33 | 1 | 2.0% |
34 | 1 | 2.0% |
35 | 1 | 2.0% |
36 | 1 | 2.0% |
Other values (39) | 39 |
Value | Count | Frequency (%) |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 | |
11 | 1 |
Value | Count | Frequency (%) |
50 | 1 | |
49 | 1 | |
48 | 1 | |
47 | 1 | |
46 | 1 | |
45 | 1 | |
44 | 1 | |
43 | 1 | |
42 | 1 | |
41 | 1 |
SHIP_KIND | SHIP_CNT | DPTR_HMS | ARVL_HMS | NVGTN_DIST | RN | |
---|---|---|---|---|---|---|
SHIP_KIND | 1.000 | 0.000 | 1.000 | 0.900 | 0.735 | 0.851 |
SHIP_CNT | 0.000 | 1.000 | 1.000 | 0.881 | 0.895 | 0.000 |
DPTR_HMS | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
ARVL_HMS | 0.900 | 0.881 | 1.000 | 1.000 | 0.876 | 0.000 |
NVGTN_DIST | 0.735 | 0.895 | 1.000 | 0.876 | 1.000 | 0.000 |
RN | 0.851 | 0.000 | 1.000 | 0.000 | 0.000 | 1.000 |
SHIP_CNT | NVGTN_DIST | RN | |
---|---|---|---|
SHIP_CNT | 1.000 | 0.982 | 0.046 |
NVGTN_DIST | 0.982 | 1.000 | 0.087 |
RN | 0.046 | 0.087 | 1.000 |
SHIP_KIND | SHIP_CNT | DPTR_HMS | ARVL_HMS | NVGTN_DIST | RN | |
---|---|---|---|---|---|---|
0 | Tanker(product oil) | 3471 | 01-Jan-2023 00:23:21 | 31-May-2023 23:51:22 | 102760000000 | 2 |
1 | Tanker(chemical/oil product) | 436 | 01-Jan-2023 17:16:56 | 31-May-2023 23:50:02 | 16609600000 | 3 |
2 | Tanker(oil/chemical) | 1205 | 01-Jan-2023 00:02:58 | 24-Jan-2023 16:33:04 | 32416900000 | 4 |
3 | Tanker(chemical) | 334 | 01-Jan-2023 00:00:29 | 31-May-2023 23:59:15 | 11618000000 | 5 |
4 | Tanker | 39 | 01-Jan-2023 00:05:07 | 31-May-2023 10:06:23 | 946523000 | 6 |
5 | Tanker - Hazard A (Major) | 1 | 01-Jan-2023 00:10:04 | 17-May-2023 23:55:58 | 13496000 | 7 |
6 | Asphalt/Bitumen Tanker | 1 | 03-Jan-2023 01:11:39 | 31-May-2023 23:02:10 | 77103000 | 8 |
7 | Edible Oil Tanker | 3 | 01-Jan-2023 00:09:20 | 31-May-2023 23:55:15 | 51309500 | 9 |
8 | CO2 Tanker | 3 | 01-Jan-2023 00:03:59 | 31-May-2023 23:53:28 | 123551000 | 10 |
9 | FRUIT JUICE Tanker | 5 | 01-Jan-2023 00:10:49 | 31-May-2023 22:38:01 | 418478000 | 11 |
SHIP_KIND | SHIP_CNT | DPTR_HMS | ARVL_HMS | NVGTN_DIST | RN | |
---|---|---|---|---|---|---|
39 | Floating Storage or Production | 110 | 01-Apr-2021 15:11:44 | 13-Oct-2021 02:25:00 | 1408310000 | 41 |
40 | Oil Products Tanker | 2749 | 01-Jan-2021 00:04:47 | 13-Oct-2021 23:58:02 | 100792000000 | 42 |
41 | Bunkering Tanker | 333 | 03-Jan-2021 14:37:28 | 13-Oct-2021 23:59:04 | 5554170000 | 43 |
42 | Inland Pushtow six cargo barges | 1 | 01-Jan-2021 00:01:11 | 16-Sep-2021 20:45:04 | 86620900 | 44 |
43 | Inland Pushtow two barges at least one tanker | 7 | 03-Jan-2021 19:31:59 | 16-Aug-2021 20:50:03 | 137903000 | 45 |
44 | Oil or Chemical Tanker | 4436 | 02-Jan-2021 02:58:14 | 13-Oct-2021 23:55:00 | 293964000000 | 46 |
45 | Shuttle Tanker | 65 | 03-Jun-2021 10:46:40 | 13-Oct-2021 23:42:00 | 3262600000 | 47 |
46 | Chemical Tanker | 571 | 06-Jan-2021 13:00:04 | 13-Oct-2021 23:57:03 | 30803400000 | 48 |
47 | CHEMICAL TANKER | 27 | 01-Jan-2021 00:01:47 | 28-Apr-2021 21:40:22 | 652442000 | 49 |
48 | LPG or Chemical Tanker | 7 | 01-Jan-2021 00:02:19 | 13-Oct-2021 23:58:05 | 266748000 | 50 |