Overview

Dataset statistics

Number of variables7
Number of observations49
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.0 KiB
Average record size in memory62.7 B

Variable types

Numeric4
Text1
DateTime2

Dataset

DescriptionSample
Author올시데이터
URLhttps://www.bigdata-sea.kr/datasearch/base/view.do?prodId=PROD_001075

Alerts

RANK is highly overall correlated with SHIP_CNT and 2 other fieldsHigh correlation
SHIP_CNT is highly overall correlated with RANK and 2 other fieldsHigh correlation
FRGHT_CNVNC_QTY is highly overall correlated with RANK and 2 other fieldsHigh correlation
RN is highly overall correlated with RANK and 2 other fieldsHigh correlation
RANK has unique valuesUnique
DPTR_CN_NM has unique valuesUnique
SHIP_CNT has unique valuesUnique
FRGHT_CNVNC_QTY has unique valuesUnique
RN has unique valuesUnique

Reproduction

Analysis started2023-12-10 14:49:41.989961
Analysis finished2023-12-10 14:49:43.960958
Duration1.97 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

RANK
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26
Minimum2
Maximum50
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-10T23:49:44.147022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile4.4
Q114
median26
Q338
95-th percentile47.6
Maximum50
Range48
Interquartile range (IQR)24

Descriptive statistics

Standard deviation14.28869
Coefficient of variation (CV)0.54956501
Kurtosis-1.2
Mean26
Median Absolute Deviation (MAD)12
Skewness0
Sum1274
Variance204.16667
MonotonicityStrictly increasing
2023-12-10T23:49:44.295362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
2 1
 
2.0%
39 1
 
2.0%
29 1
 
2.0%
30 1
 
2.0%
31 1
 
2.0%
32 1
 
2.0%
33 1
 
2.0%
34 1
 
2.0%
35 1
 
2.0%
36 1
 
2.0%
Other values (39) 39
79.6%
ValueCountFrequency (%)
2 1
2.0%
3 1
2.0%
4 1
2.0%
5 1
2.0%
6 1
2.0%
7 1
2.0%
8 1
2.0%
9 1
2.0%
10 1
2.0%
11 1
2.0%
ValueCountFrequency (%)
50 1
2.0%
49 1
2.0%
48 1
2.0%
47 1
2.0%
46 1
2.0%
45 1
2.0%
44 1
2.0%
43 1
2.0%
42 1
2.0%
41 1
2.0%

DPTR_CN_NM
Text

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
2023-12-10T23:49:44.534867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length13
Mean length8.2244898
Min length4

Characters and Unicode

Total characters403
Distinct characters48
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)100.0%

Sample

1st rowBrazil
2nd rowIndonesia
3rd rowSouth Africa
4th rowMalaysia
5th rowChina
ValueCountFrequency (%)
united 3
 
4.8%
south 2
 
3.2%
new 2
 
3.2%
brazil 1
 
1.6%
bahamas 1
 
1.6%
france 1
 
1.6%
turkey 1
 
1.6%
colombia 1
 
1.6%
italy 1
 
1.6%
mauritius 1
 
1.6%
Other values (48) 48
77.4%
2023-12-10T23:49:44.991309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 57
 
14.1%
i 35
 
8.7%
e 35
 
8.7%
n 32
 
7.9%
r 22
 
5.5%
o 18
 
4.5%
t 17
 
4.2%
u 16
 
4.0%
13
 
3.2%
l 12
 
3.0%
Other values (38) 146
36.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 326
80.9%
Uppercase Letter 63
 
15.6%
Space Separator 13
 
3.2%
Dash Punctuation 1
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 57
17.5%
i 35
10.7%
e 35
10.7%
n 32
9.8%
r 22
 
6.7%
o 18
 
5.5%
t 17
 
5.2%
u 16
 
4.9%
l 12
 
3.7%
s 12
 
3.7%
Other values (14) 70
21.5%
Uppercase Letter
ValueCountFrequency (%)
S 8
12.7%
M 6
 
9.5%
P 5
 
7.9%
N 4
 
6.3%
C 4
 
6.3%
U 4
 
6.3%
T 4
 
6.3%
R 3
 
4.8%
G 3
 
4.8%
B 3
 
4.8%
Other values (12) 19
30.2%
Space Separator
ValueCountFrequency (%)
13
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 389
96.5%
Common 14
 
3.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 57
14.7%
i 35
 
9.0%
e 35
 
9.0%
n 32
 
8.2%
r 22
 
5.7%
o 18
 
4.6%
t 17
 
4.4%
u 16
 
4.1%
l 12
 
3.1%
s 12
 
3.1%
Other values (36) 133
34.2%
Common
ValueCountFrequency (%)
13
92.9%
- 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 403
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 57
 
14.1%
i 35
 
8.7%
e 35
 
8.7%
n 32
 
7.9%
r 22
 
5.5%
o 18
 
4.5%
t 17
 
4.2%
u 16
 
4.0%
13
 
3.2%
l 12
 
3.0%
Other values (38) 146
36.2%

SHIP_CNT
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2331.8776
Minimum95
Maximum9083
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-10T23:49:45.171011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum95
5-th percentile173.8
Q1576
median1977
Q33015
95-th percentile6463.6
Maximum9083
Range8988
Interquartile range (IQR)2439

Descriptive statistics

Standard deviation2057.1807
Coefficient of variation (CV)0.88219929
Kurtosis1.9850183
Mean2331.8776
Median Absolute Deviation (MAD)1295
Skewness1.3697038
Sum114262
Variance4231992.5
MonotonicityNot monotonic
2023-12-10T23:49:45.339818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
3035 1
 
2.0%
507 1
 
2.0%
4712 1
 
2.0%
3015 1
 
2.0%
571 1
 
2.0%
2618 1
 
2.0%
315 1
 
2.0%
1770 1
 
2.0%
312 1
 
2.0%
947 1
 
2.0%
Other values (39) 39
79.6%
ValueCountFrequency (%)
95 1
2.0%
109 1
2.0%
167 1
2.0%
184 1
2.0%
247 1
2.0%
252 1
2.0%
312 1
2.0%
315 1
2.0%
409 1
2.0%
440 1
2.0%
ValueCountFrequency (%)
9083 1
2.0%
7703 1
2.0%
6896 1
2.0%
5815 1
2.0%
5682 1
2.0%
4712 1
2.0%
4627 1
2.0%
3906 1
2.0%
3622 1
2.0%
3434 1
2.0%
Distinct42
Distinct (%)85.7%
Missing0
Missing (%)0.0%
Memory size524.0 B
Minimum2022-01-01 00:00:02
Maximum2022-01-01 00:59:26
2023-12-10T23:49:45.805048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:49:45.964534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
Distinct34
Distinct (%)69.4%
Missing0
Missing (%)0.0%
Memory size524.0 B
Minimum2022-07-17 21:41:47
Maximum2022-07-17 22:00:21
2023-12-10T23:49:46.128502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:49:46.285066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)

FRGHT_CNVNC_QTY
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.125178 × 1011
Minimum2.04917 × 1010
Maximum1.16206 × 1012
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-10T23:49:46.439537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.04917 × 1010
5-th percentile2.205526 × 1010
Q13.66713 × 1010
median8.34907 × 1010
Q32.07442 × 1011
95-th percentile9.221236 × 1011
Maximum1.16206 × 1012
Range1.1415683 × 1012
Interquartile range (IQR)1.707707 × 1011

Descriptive statistics

Standard deviation2.9237704 × 1011
Coefficient of variation (CV)1.3757767
Kurtosis4.059142
Mean2.125178 × 1011
Median Absolute Deviation (MAD)5.90587 × 1010
Skewness2.1778732
Sum1.0413372 × 1013
Variance8.5484333 × 1022
MonotonicityStrictly decreasing
2023-12-10T23:49:46.604220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
1162060000000 1
 
2.0%
36353400000 1
 
2.0%
74500000000 1
 
2.0%
65476600000 1
 
2.0%
64406800000 1
 
2.0%
64353400000 1
 
2.0%
61430400000 1
 
2.0%
58667400000 1
 
2.0%
52236800000 1
 
2.0%
46307400000 1
 
2.0%
Other values (39) 39
79.6%
ValueCountFrequency (%)
20491700000 1
2.0%
20999500000 1
2.0%
21689300000 1
2.0%
22604200000 1
2.0%
23758500000 1
2.0%
24432000000 1
2.0%
26800600000 1
2.0%
27711200000 1
2.0%
31010600000 1
2.0%
34014100000 1
2.0%
ValueCountFrequency (%)
1162060000000 1
2.0%
1155960000000 1
2.0%
939892000000 1
2.0%
895471000000 1
2.0%
791298000000 1
2.0%
463598000000 1
2.0%
451866000000 1
2.0%
400696000000 1
2.0%
347457000000 1
2.0%
327960000000 1
2.0%

RN
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26
Minimum2
Maximum50
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-10T23:49:46.794593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile4.4
Q114
median26
Q338
95-th percentile47.6
Maximum50
Range48
Interquartile range (IQR)24

Descriptive statistics

Standard deviation14.28869
Coefficient of variation (CV)0.54956501
Kurtosis-1.2
Mean26
Median Absolute Deviation (MAD)12
Skewness0
Sum1274
Variance204.16667
MonotonicityStrictly increasing
2023-12-10T23:49:46.977946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
2 1
 
2.0%
39 1
 
2.0%
29 1
 
2.0%
30 1
 
2.0%
31 1
 
2.0%
32 1
 
2.0%
33 1
 
2.0%
34 1
 
2.0%
35 1
 
2.0%
36 1
 
2.0%
Other values (39) 39
79.6%
ValueCountFrequency (%)
2 1
2.0%
3 1
2.0%
4 1
2.0%
5 1
2.0%
6 1
2.0%
7 1
2.0%
8 1
2.0%
9 1
2.0%
10 1
2.0%
11 1
2.0%
ValueCountFrequency (%)
50 1
2.0%
49 1
2.0%
48 1
2.0%
47 1
2.0%
46 1
2.0%
45 1
2.0%
44 1
2.0%
43 1
2.0%
42 1
2.0%
41 1
2.0%

Interactions

2023-12-10T23:49:43.346518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:49:42.294742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:49:42.651232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:49:42.979362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:49:43.439634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:49:42.383972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:49:42.748102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:49:43.062519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:49:43.520378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:49:42.470148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:49:42.817471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:49:43.147917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:49:43.623261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:49:42.566311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:49:42.899462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T23:49:43.254471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:49:47.093346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
RANKDPTR_CN_NMSHIP_CNTDPTR_HMSARVL_HMSFRGHT_CNVNC_QTYRN
RANK1.0001.0000.5510.5400.7650.7551.000
DPTR_CN_NM1.0001.0001.0001.0001.0001.0001.000
SHIP_CNT0.5511.0001.0000.0000.0000.8620.551
DPTR_HMS0.5401.0000.0001.0000.9230.0000.540
ARVL_HMS0.7651.0000.0000.9231.0000.0000.765
FRGHT_CNVNC_QTY0.7551.0000.8620.0000.0001.0000.755
RN1.0001.0000.5510.5400.7650.7551.000
2023-12-10T23:49:47.230160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
RANKSHIP_CNTFRGHT_CNVNC_QTYRN
RANK1.000-0.778-1.0001.000
SHIP_CNT-0.7781.0000.778-0.778
FRGHT_CNVNC_QTY-1.0000.7781.000-1.000
RN1.000-0.778-1.0001.000

Missing values

2023-12-10T23:49:43.768588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:49:43.903004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

RANKDPTR_CN_NMSHIP_CNTDPTR_HMSARVL_HMSFRGHT_CNVNC_QTYRN
02Brazil303501-Jan-2022 00:01:4517-Jul-2022 22:00:1811620600000002
13Indonesia908301-Jan-2022 00:00:2517-Jul-2022 22:00:1811559600000003
24South Africa271001-Jan-2022 00:00:0217-Jul-2022 22:00:199398920000004
35Malaysia689601-Jan-2022 00:01:4317-Jul-2022 21:59:598954710000005
46China770301-Jan-2022 00:04:5117-Jul-2022 22:00:187912980000006
57Singapore343401-Jan-2022 00:02:5517-Jul-2022 22:00:184635980000007
68Japan462701-Jan-2022 00:00:2117-Jul-2022 22:00:114518660000008
79United States568201-Jan-2022 00:00:3317-Jul-2022 22:00:154006960000009
810Taiwan390601-Jan-2022 00:01:1017-Jul-2022 21:59:4434745700000010
911Spain581501-Jan-2022 00:00:1517-Jul-2022 21:59:3932796000000011
RANKDPTR_CN_NMSHIP_CNTDPTR_HMSARVL_HMSFRGHT_CNVNC_QTYRN
3941New Zealand24701-Jan-2022 00:02:5817-Jul-2022 21:59:343401410000041
4042Gibraltar40901-Jan-2022 00:59:2617-Jul-2022 21:59:393101060000042
4143Norway103401-Jan-2022 00:02:4817-Jul-2022 21:55:452771120000043
4244Senegal25201-Jan-2022 00:02:0817-Jul-2022 21:58:222680060000044
4345Mexico75201-Jan-2022 00:03:5817-Jul-2022 21:58:222443200000045
4446Sierra Leone9501-Jan-2022 00:05:3517-Jul-2022 21:55:062375850000046
4547Netherlands197701-Jan-2022 00:00:2017-Jul-2022 21:58:332260420000047
4648Puerto Rico18401-Jan-2022 00:41:2617-Jul-2022 22:00:072168930000048
4749Reunion10901-Jan-2022 00:01:1617-Jul-2022 21:56:022099950000049
4850Guinea-Bissau16701-Jan-2022 00:02:5517-Jul-2022 21:52:502049170000050