Overview

Dataset statistics

Number of variables10
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows332
Duplicate rows (%)3.3%
Total size in memory888.7 KiB
Average record size in memory91.0 B

Variable types

Categorical3
Numeric2
DateTime5

Dataset

Description울산항만공사 선석운영지원시스템의 울산항 선석배정현황 데이터입니다. · 선석: 항내에서 선박을 접안시키는 장소 · 선석배정: 선박 접안장소 지정 · 접안방향: Port(좌현), Star(우현) · 시프팅: 선박 접안장소 이동 ※ 데이터기준일 : 2020년~2023년(최근 3개년)
URLhttps://www.data.go.kr/data/15121258/fileData.do

Alerts

Dataset has 332 (3.3%) duplicate rowsDuplicates
부두코드명 is highly overall correlated with 선석코드명 and 1 other fieldsHigh correlation
항구코드명 is highly overall correlated with 선석코드명 and 1 other fieldsHigh correlation
선석코드명 is highly overall correlated with 항구코드명 and 1 other fieldsHigh correlation
항구코드명 is highly imbalanced (55.9%)Imbalance
접안방향 is highly imbalanced (75.3%)Imbalance

Reproduction

Analysis started2023-12-12 19:37:50.389527
Analysis finished2023-12-12 19:37:51.604801
Duration1.22 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

항구코드명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
820
8121 
823
1878 
821
 
1

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row823
2nd row820
3rd row820
4th row820
5th row820

Common Values

ValueCountFrequency (%)
820 8121
81.2%
823 1878
 
18.8%
821 1
 
< 0.1%

Length

2023-12-13T04:37:51.670628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:37:51.847573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
820 8121
81.2%
823 1878
 
18.8%
821 1
 
< 0.1%

부두코드명
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
MB6
1699 
MBN
1167 
MB4
991 
MB3
885 
MB2
814 
Other values (16)
4444 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowMBF
2nd rowMBY
3rd rowMB9
4th rowMB9
5th rowMB3

Common Values

ValueCountFrequency (%)
MB6 1699
17.0%
MBN 1167
11.7%
MB4 991
9.9%
MB3 885
8.8%
MB2 814
8.1%
MBG 666
 
6.7%
MB8 530
 
5.3%
MDY 527
 
5.3%
MBF 423
 
4.2%
MB7 393
 
3.9%
Other values (11) 1905
19.1%

Length

2023-12-13T04:37:51.999058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
mb6 1699
17.0%
mbn 1167
11.7%
mb4 991
9.9%
mb3 885
8.8%
mb2 814
8.1%
mbg 666
 
6.7%
mb8 530
 
5.3%
mdy 527
 
5.3%
mbf 423
 
4.2%
mb7 393
 
3.9%
Other values (11) 1905
19.1%

선석코드명
Real number (ℝ)

HIGH CORRELATION 

Distinct14
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.3878
Minimum1
Maximum41
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T04:37:52.131532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q321
95-th percentile31
Maximum41
Range40
Interquartile range (IQR)20

Descriptive statistics

Standard deviation11.490518
Coefficient of variation (CV)1.1061551
Kurtosis-0.25555181
Mean10.3878
Median Absolute Deviation (MAD)2
Skewness1.0201826
Sum103878
Variance132.03201
MonotonicityNot monotonic
2023-12-13T04:37:52.275454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
1 2510
25.1%
2 2014
20.1%
26 1006
10.1%
11 941
 
9.4%
21 902
 
9.0%
3 597
 
6.0%
31 547
 
5.5%
41 301
 
3.0%
7 292
 
2.9%
5 277
 
2.8%
Other values (4) 613
 
6.1%
ValueCountFrequency (%)
1 2510
25.1%
2 2014
20.1%
3 597
 
6.0%
4 87
 
0.9%
5 277
 
2.8%
6 121
 
1.2%
7 292
 
2.9%
8 244
 
2.4%
11 941
 
9.4%
21 902
 
9.0%
ValueCountFrequency (%)
41 301
 
3.0%
31 547
5.5%
27 161
 
1.6%
26 1006
10.1%
21 902
9.0%
11 941
9.4%
8 244
 
2.4%
7 292
 
2.9%
6 121
 
1.2%
5 277
 
2.8%

접안방향
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Port
9232 
Star
 
767
<NA>
 
1

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowPort
2nd rowPort
3rd rowPort
4th rowStar
5th rowPort

Common Values

ValueCountFrequency (%)
Port 9232
92.3%
Star 767
 
7.7%
<NA> 1
 
< 0.1%

Length

2023-12-13T04:37:52.426879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:37:52.551223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
port 9232
92.3%
star 767
 
7.7%
na 1
 
< 0.1%

입항연도
Real number (ℝ)

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2021.6304
Minimum2000
Maximum2032
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T04:37:52.650620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2000
5-th percentile2020
Q12021
median2022
Q32022
95-th percentile2023
Maximum2032
Range32
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.020049
Coefficient of variation (CV)0.00050456748
Kurtosis67.433333
Mean2021.6304
Median Absolute Deviation (MAD)1
Skewness-3.2606803
Sum20216304
Variance1.0404999
MonotonicityNot monotonic
2023-12-13T04:37:52.777002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
2022 3459
34.6%
2021 3398
34.0%
2023 2016
20.2%
2020 1121
 
11.2%
2002 2
 
< 0.1%
2000 2
 
< 0.1%
2024 1
 
< 0.1%
2032 1
 
< 0.1%
ValueCountFrequency (%)
2000 2
 
< 0.1%
2002 2
 
< 0.1%
2020 1121
 
11.2%
2021 3398
34.0%
2022 3459
34.6%
2023 2016
20.2%
2024 1
 
< 0.1%
2032 1
 
< 0.1%
ValueCountFrequency (%)
2032 1
 
< 0.1%
2024 1
 
< 0.1%
2023 2016
20.2%
2022 3459
34.6%
2021 3398
34.0%
2020 1121
 
11.2%
2002 2
 
< 0.1%
2000 2
 
< 0.1%
Distinct1079
Distinct (%)10.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-05-26 00:00:00
Maximum2023-08-27 00:00:00
2023-12-13T04:37:53.014679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:37:53.197304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct1082
Distinct (%)10.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-05-27 00:00:00
Maximum2023-09-08 00:00:00
2023-12-13T04:37:53.389672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:37:53.592379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct935
Distinct (%)9.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-05-26 00:00:00
Maximum2023-08-24 00:00:00
2023-12-13T04:37:53.770538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:37:53.966064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct875
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-05-26 00:00:00
Maximum2023-08-24 00:00:00
2023-12-13T04:37:54.131973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:37:54.286286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct935
Distinct (%)9.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-05-26 00:00:00
Maximum2023-08-24 00:00:00
2023-12-13T04:37:54.427225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:37:54.581389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-13T04:37:51.138981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:37:50.887814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:37:51.252993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:37:51.013787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:37:54.676813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
항구코드명부두코드명선석코드명접안방향입항연도
항구코드명1.0001.0000.6450.0110.023
부두코드명1.0001.0000.9540.5560.095
선석코드명0.6450.9541.0000.2170.061
접안방향0.0110.5560.2171.0000.105
입항연도0.0230.0950.0610.1051.000
2023-12-13T04:37:55.107314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
접안방향부두코드명항구코드명
접안방향1.0000.4910.018
부두코드명0.4911.0000.999
항구코드명0.0180.9991.000
2023-12-13T04:37:55.229424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
선석코드명입항연도항구코드명부두코드명접안방향
선석코드명1.0000.0200.5450.7370.221
입항연도0.0201.0000.0220.0530.070
항구코드명0.5450.0221.0000.9990.018
부두코드명0.7370.0530.9991.0000.491
접안방향0.2210.0700.0180.4911.000

Missing values

2023-12-13T04:37:51.380509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:37:51.528634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

항구코드명부두코드명선석코드명접안방향입항연도접안예정일출항예정일선석배정일등록일자수정일자
6667823MBF2Port20212021-12-082021-12-092021-12-072021-12-072021-12-07
6363820MBY1Port20212021-11-202021-11-202021-11-192021-11-192021-11-19
16454820MB91Port20232023-08-122023-08-122023-08-122023-08-112023-08-12
14531820MB91Star20232023-04-142023-04-172023-04-172023-04-132023-04-17
5734820MB32Port20212021-10-122021-10-122021-10-122021-10-122021-10-12
226820MBP1Port20202020-09-282020-10-292020-10-052020-09-252020-10-05
2553820MB62Port20212021-03-112021-03-112021-03-092021-03-092021-03-09
15574820MB41Port20232023-06-152023-06-162023-06-142023-06-142023-06-14
3634820U343Port20212021-05-182021-05-192021-05-182021-05-182021-05-18
1292820MB81Port20202020-12-172020-12-182020-12-172020-12-162020-12-17
항구코드명부두코드명선석코드명접안방향입항연도접안예정일출항예정일선석배정일등록일자수정일자
3913820MB65Port20212021-06-112021-06-112021-06-102021-06-102021-06-10
10514820MBP1Port20222022-08-012022-08-012022-07-292022-07-292022-07-29
12358820MB41Port20222022-12-072022-12-072022-12-072022-12-062022-12-07
12522823MBN26Port20222022-12-152022-12-162022-12-152022-12-152022-12-15
12591823MBN26Port20222022-12-192022-12-202022-12-192022-12-192022-12-19
8002823MBF1Port20222022-02-232022-02-242022-02-222022-02-222022-02-22
16164823MBN26Port20232023-07-212023-07-212023-07-212023-07-212023-07-21
8901820MB631Port20222022-04-162022-04-162022-04-152022-04-152022-04-15
7796820MB41Port20222022-02-112022-02-112022-02-112022-02-112022-02-11
15248823MBN27Port20232023-05-222023-05-222023-05-222023-05-222023-05-22

Duplicate rows

Most frequently occurring

항구코드명부두코드명선석코드명접안방향입항연도접안예정일출항예정일선석배정일등록일자수정일자# duplicates
326823MBN26Port20232023-08-242023-08-252023-08-242023-08-242023-08-246
258823MBN26Port20222022-12-132022-12-142022-12-132022-12-132022-12-134
308823MBN26Port20232023-06-082023-06-092023-06-082023-06-082023-06-084
313823MBN26Port20232023-07-052023-07-062023-07-052023-07-052023-07-054
62820MB62Star20222022-02-192022-02-192022-02-162022-02-162022-02-163
67820MB62Star20222022-06-232022-06-232022-06-222022-06-222022-06-223
70820MB62Star20222022-09-132022-09-132022-09-122022-09-122022-09-123
119820MBP1Port20212021-10-142021-10-152021-10-132021-10-132021-10-133
120820MDY2Port20202020-09-092020-09-102020-09-092020-09-092020-09-093
126820MDY2Port20212021-12-262021-12-272021-12-242021-12-242021-12-243