Overview

Dataset statistics

Number of variables8
Number of observations1958
Missing cells3497
Missing cells (%)22.3%
Duplicate rows11
Duplicate rows (%)0.6%
Total size in memory122.5 KiB
Average record size in memory64.1 B

Variable types

Text6
DateTime2

Dataset

Description해양안전종합정보시스템에 등록되어있는 선사들의 정보(선사명, 우편번호, 선사주소, 선사사업자번호, 등록일, 수정일) 를 제공합니다.
URLhttps://www.data.go.kr/data/15116474/fileData.do

Alerts

Dataset has 11 (0.6%) duplicate rowsDuplicates
선사명_영문 has 1611 (82.3%) missing valuesMissing
선사_우편번호 has 490 (25.0%) missing valuesMissing
선사_주소 has 219 (11.2%) missing valuesMissing
선사_상세주소 has 230 (11.7%) missing valuesMissing
사업자번호 has 610 (31.2%) missing valuesMissing
등록일 has 113 (5.8%) missing valuesMissing
수정일 has 224 (11.4%) missing valuesMissing

Reproduction

Analysis started2023-12-12 12:37:45.673201
Analysis finished2023-12-12 12:37:46.751720
Duration1.08 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1725
Distinct (%)88.1%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
2023-12-12T21:37:46.942101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length25
Mean length7.4540347
Min length2

Characters and Unicode

Total characters14595
Distinct characters444
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1533 ?
Unique (%)78.3%

Sample

1st row아산상선(주)
2nd row대경해운(주)
3rd row한국선무
4th row(주)에스제이탱커
5th row(주)신한
ValueCountFrequency (%)
주식회사 75
 
3.5%
8
 
0.4%
유한회사 7
 
0.3%
진도운수(주 6
 
0.3%
에스엔글로벌 5
 
0.2%
주)대아고속해운 4
 
0.2%
주)동방 4
 
0.2%
주)서경 4
 
0.2%
평화해운(주 4
 
0.2%
대림예선사 4
 
0.2%
Other values (1755) 1995
94.3%
2023-12-12T21:37:47.359846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1278
 
8.8%
( 1153
 
7.9%
) 1153
 
7.9%
860
 
5.9%
661
 
4.5%
529
 
3.6%
302
 
2.1%
262
 
1.8%
217
 
1.5%
217
 
1.5%
Other values (434) 7963
54.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11108
76.1%
Open Punctuation 1153
 
7.9%
Close Punctuation 1153
 
7.9%
Space Separator 860
 
5.9%
Uppercase Letter 125
 
0.9%
Other Symbol 80
 
0.5%
Decimal Number 80
 
0.5%
Lowercase Letter 26
 
0.2%
Other Punctuation 7
 
< 0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1278
 
11.5%
661
 
6.0%
529
 
4.8%
302
 
2.7%
262
 
2.4%
217
 
2.0%
217
 
2.0%
204
 
1.8%
197
 
1.8%
186
 
1.7%
Other values (382) 7055
63.5%
Uppercase Letter
ValueCountFrequency (%)
S 22
17.6%
M 12
9.6%
T 12
9.6%
K 12
9.6%
L 8
 
6.4%
C 7
 
5.6%
G 6
 
4.8%
D 6
 
4.8%
A 6
 
4.8%
I 5
 
4.0%
Other values (12) 29
23.2%
Lowercase Letter
ValueCountFrequency (%)
e 4
15.4%
l 4
15.4%
s 3
11.5%
i 3
11.5%
h 2
7.7%
m 2
7.7%
v 2
7.7%
n 2
7.7%
c 1
 
3.8%
r 1
 
3.8%
Other values (2) 2
7.7%
Decimal Number
ValueCountFrequency (%)
1 21
26.2%
2 15
18.8%
0 11
13.8%
3 9
11.2%
8 7
 
8.8%
5 6
 
7.5%
6 5
 
6.2%
7 4
 
5.0%
4 2
 
2.5%
Other Punctuation
ValueCountFrequency (%)
. 3
42.9%
, 3
42.9%
& 1
 
14.3%
Open Punctuation
ValueCountFrequency (%)
( 1153
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1153
100.0%
Space Separator
ValueCountFrequency (%)
860
100.0%
Other Symbol
ValueCountFrequency (%)
80
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11188
76.7%
Common 3256
 
22.3%
Latin 151
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1278
 
11.4%
661
 
5.9%
529
 
4.7%
302
 
2.7%
262
 
2.3%
217
 
1.9%
217
 
1.9%
204
 
1.8%
197
 
1.8%
186
 
1.7%
Other values (383) 7135
63.8%
Latin
ValueCountFrequency (%)
S 22
 
14.6%
M 12
 
7.9%
T 12
 
7.9%
K 12
 
7.9%
L 8
 
5.3%
C 7
 
4.6%
G 6
 
4.0%
D 6
 
4.0%
A 6
 
4.0%
I 5
 
3.3%
Other values (24) 55
36.4%
Common
ValueCountFrequency (%)
( 1153
35.4%
) 1153
35.4%
860
26.4%
1 21
 
0.6%
2 15
 
0.5%
0 11
 
0.3%
3 9
 
0.3%
8 7
 
0.2%
5 6
 
0.2%
6 5
 
0.2%
Other values (7) 16
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11108
76.1%
ASCII 3407
 
23.3%
None 80
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1278
 
11.5%
661
 
6.0%
529
 
4.8%
302
 
2.7%
262
 
2.4%
217
 
2.0%
217
 
2.0%
204
 
1.8%
197
 
1.8%
186
 
1.7%
Other values (382) 7055
63.5%
ASCII
ValueCountFrequency (%)
( 1153
33.8%
) 1153
33.8%
860
25.2%
S 22
 
0.6%
1 21
 
0.6%
2 15
 
0.4%
M 12
 
0.4%
T 12
 
0.4%
K 12
 
0.4%
0 11
 
0.3%
Other values (41) 136
 
4.0%
None
ValueCountFrequency (%)
80
100.0%

선사명_영문
Text

MISSING 

Distinct315
Distinct (%)90.8%
Missing1611
Missing (%)82.3%
Memory size15.4 KiB
2023-12-12T21:37:47.663920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length24
Mean length12.766571
Min length2

Characters and Unicode

Total characters4430
Distinct characters80
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique287 ?
Unique (%)82.7%

Sample

1st rowDAE SUNG FISHERIES
2nd rowcity korea
3rd row(유)동아모래
4th rowTT
5th rowSILLA CO., LTD.
ValueCountFrequency (%)
shipping 39
 
6.0%
co 29
 
4.5%
ltd 25
 
3.9%
korea 16
 
2.5%
marine 12
 
1.9%
co.,ltd 11
 
1.7%
maritime 5
 
0.8%
line 5
 
0.8%
moonsoo 4
 
0.6%
ship 4
 
0.6%
Other values (388) 496
76.8%
2023-12-12T21:37:48.097032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
300
 
6.8%
n 271
 
6.1%
o 220
 
5.0%
a 184
 
4.2%
N 173
 
3.9%
I 164
 
3.7%
g 154
 
3.5%
i 154
 
3.5%
S 149
 
3.4%
e 148
 
3.3%
Other values (70) 2513
56.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2094
47.3%
Uppercase Letter 1804
40.7%
Space Separator 300
 
6.8%
Other Punctuation 105
 
2.4%
Decimal Number 101
 
2.3%
Other Letter 13
 
0.3%
Dash Punctuation 5
 
0.1%
Open Punctuation 4
 
0.1%
Close Punctuation 4
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 271
12.9%
o 220
10.5%
a 184
 
8.8%
g 154
 
7.4%
i 154
 
7.4%
e 148
 
7.1%
s 127
 
6.1%
h 123
 
5.9%
u 106
 
5.1%
p 88
 
4.2%
Other values (14) 519
24.8%
Uppercase Letter
ValueCountFrequency (%)
N 173
 
9.6%
I 164
 
9.1%
S 149
 
8.3%
A 147
 
8.1%
E 131
 
7.3%
O 118
 
6.5%
G 91
 
5.0%
L 91
 
5.0%
H 86
 
4.8%
T 85
 
4.7%
Other values (13) 569
31.5%
Other Letter
ValueCountFrequency (%)
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Other values (3) 3
23.1%
Decimal Number
ValueCountFrequency (%)
1 21
20.8%
2 17
16.8%
0 16
15.8%
7 10
9.9%
3 10
9.9%
5 9
8.9%
6 8
 
7.9%
8 7
 
6.9%
9 3
 
3.0%
Other Punctuation
ValueCountFrequency (%)
. 61
58.1%
, 33
31.4%
& 5
 
4.8%
" 2
 
1.9%
; 2
 
1.9%
# 1
 
1.0%
@ 1
 
1.0%
Space Separator
ValueCountFrequency (%)
300
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 3898
88.0%
Common 519
 
11.7%
Hangul 13
 
0.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 271
 
7.0%
o 220
 
5.6%
a 184
 
4.7%
N 173
 
4.4%
I 164
 
4.2%
g 154
 
4.0%
i 154
 
4.0%
S 149
 
3.8%
e 148
 
3.8%
A 147
 
3.8%
Other values (37) 2134
54.7%
Common
ValueCountFrequency (%)
300
57.8%
. 61
 
11.8%
, 33
 
6.4%
1 21
 
4.0%
2 17
 
3.3%
0 16
 
3.1%
7 10
 
1.9%
3 10
 
1.9%
5 9
 
1.7%
6 8
 
1.5%
Other values (10) 34
 
6.6%
Hangul
ValueCountFrequency (%)
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Other values (3) 3
23.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4417
99.7%
Hangul 13
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
300
 
6.8%
n 271
 
6.1%
o 220
 
5.0%
a 184
 
4.2%
N 173
 
3.9%
I 164
 
3.7%
g 154
 
3.5%
i 154
 
3.5%
S 149
 
3.4%
e 148
 
3.4%
Other values (57) 2500
56.6%
Hangul
ValueCountFrequency (%)
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Other values (3) 3
23.1%

선사_우편번호
Text

MISSING 

Distinct683
Distinct (%)46.5%
Missing490
Missing (%)25.0%
Memory size15.4 KiB
2023-12-12T21:37:48.581883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length4.9788828
Min length2

Characters and Unicode

Total characters7309
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique434 ?
Unique (%)29.6%

Sample

1st row27733
2nd row600102
3rd row6738
4th row602053
5th row600022
ValueCountFrequency (%)
602020 30
 
2.0%
602011 24
 
1.6%
27632 19
 
1.3%
601010 17
 
1.2%
600045 17
 
1.2%
600014 17
 
1.2%
48955 16
 
1.1%
22787 16
 
1.1%
48822 16
 
1.1%
44780 15
 
1.0%
Other values (673) 1281
87.3%
2023-12-12T21:37:49.199213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1273
17.4%
2 887
12.1%
4 787
10.8%
1 771
10.5%
6 767
10.5%
3 667
9.1%
8 579
7.9%
7 577
7.9%
5 551
7.5%
9 447
 
6.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7306
> 99.9%
Uppercase Letter 3
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1273
17.4%
2 887
12.1%
4 787
10.8%
1 771
10.6%
6 767
10.5%
3 667
9.1%
8 579
7.9%
7 577
7.9%
5 551
7.5%
9 447
 
6.1%
Uppercase Letter
ValueCountFrequency (%)
T 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7306
> 99.9%
Latin 3
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1273
17.4%
2 887
12.1%
4 787
10.8%
1 771
10.6%
6 767
10.5%
3 667
9.1%
8 579
7.9%
7 577
7.9%
5 551
7.5%
9 447
 
6.1%
Latin
ValueCountFrequency (%)
T 3
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7309
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1273
17.4%
2 887
12.1%
4 787
10.8%
1 771
10.5%
6 767
10.5%
3 667
9.1%
8 579
7.9%
7 577
7.9%
5 551
7.5%
9 447
 
6.1%

선사_주소
Text

MISSING 

Distinct1222
Distinct (%)70.3%
Missing219
Missing (%)11.2%
Memory size15.4 KiB
2023-12-12T21:37:49.588174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length33
Mean length18.278896
Min length2

Characters and Unicode

Total characters31787
Distinct characters376
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique991 ?
Unique (%)57.0%

Sample

1st row부산시 동구 초량1동 1205-1 교직원공제회관빌딩 802호
2nd row부산시 중구 중앙동 2가 50-4 효천빌딩 4층
3rd row부산시 중구 중앙동 4가 84-5 해운빌딩 4층
4th row부산광역시 중구 중앙동4가 78-21 보승빌딩 503호
5th row전남 완도군 완도읍 군내리 1288번지
ValueCountFrequency (%)
중구 473
 
6.5%
부산광역시 356
 
4.9%
부산 320
 
4.4%
서울특별시 238
 
3.3%
동구 182
 
2.5%
전라남도 126
 
1.7%
부산시 115
 
1.6%
영도구 110
 
1.5%
서구 108
 
1.5%
종로구 76
 
1.0%
Other values (1850) 5147
71.0%
2023-12-12T21:37:50.207843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5555
 
17.5%
1 1407
 
4.4%
1324
 
4.2%
1277
 
4.0%
1270
 
4.0%
986
 
3.1%
870
 
2.7%
726
 
2.3%
706
 
2.2%
2 695
 
2.2%
Other values (366) 16971
53.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20368
64.1%
Space Separator 5555
 
17.5%
Decimal Number 5283
 
16.6%
Dash Punctuation 488
 
1.5%
Math Symbol 36
 
0.1%
Uppercase Letter 23
 
0.1%
Open Punctuation 11
 
< 0.1%
Close Punctuation 11
 
< 0.1%
Other Punctuation 11
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1324
 
6.5%
1277
 
6.3%
1270
 
6.2%
986
 
4.8%
870
 
4.3%
726
 
3.6%
706
 
3.5%
563
 
2.8%
552
 
2.7%
541
 
2.7%
Other values (334) 11553
56.7%
Uppercase Letter
ValueCountFrequency (%)
T 5
21.7%
A 3
13.0%
B 3
13.0%
K 2
 
8.7%
D 2
 
8.7%
S 1
 
4.3%
G 1
 
4.3%
L 1
 
4.3%
X 1
 
4.3%
C 1
 
4.3%
Other values (3) 3
13.0%
Decimal Number
ValueCountFrequency (%)
1 1407
26.6%
2 695
13.2%
3 485
 
9.2%
5 474
 
9.0%
6 436
 
8.3%
4 413
 
7.8%
0 382
 
7.2%
7 348
 
6.6%
8 339
 
6.4%
9 304
 
5.8%
Other Punctuation
ValueCountFrequency (%)
, 6
54.5%
/ 3
27.3%
. 2
 
18.2%
Space Separator
ValueCountFrequency (%)
5555
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 488
100.0%
Math Symbol
ValueCountFrequency (%)
~ 36
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20368
64.1%
Common 11396
35.9%
Latin 23
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1324
 
6.5%
1277
 
6.3%
1270
 
6.2%
986
 
4.8%
870
 
4.3%
726
 
3.6%
706
 
3.5%
563
 
2.8%
552
 
2.7%
541
 
2.7%
Other values (334) 11553
56.7%
Common
ValueCountFrequency (%)
5555
48.7%
1 1407
 
12.3%
2 695
 
6.1%
- 488
 
4.3%
3 485
 
4.3%
5 474
 
4.2%
6 436
 
3.8%
4 413
 
3.6%
0 382
 
3.4%
7 348
 
3.1%
Other values (9) 713
 
6.3%
Latin
ValueCountFrequency (%)
T 5
21.7%
A 3
13.0%
B 3
13.0%
K 2
 
8.7%
D 2
 
8.7%
S 1
 
4.3%
G 1
 
4.3%
L 1
 
4.3%
X 1
 
4.3%
C 1
 
4.3%
Other values (3) 3
13.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20368
64.1%
ASCII 11419
35.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5555
48.6%
1 1407
 
12.3%
2 695
 
6.1%
- 488
 
4.3%
3 485
 
4.2%
5 474
 
4.2%
6 436
 
3.8%
4 413
 
3.6%
0 382
 
3.3%
7 348
 
3.0%
Other values (22) 736
 
6.4%
Hangul
ValueCountFrequency (%)
1324
 
6.5%
1277
 
6.3%
1270
 
6.2%
986
 
4.8%
870
 
4.3%
726
 
3.6%
706
 
3.5%
563
 
2.8%
552
 
2.7%
541
 
2.7%
Other values (334) 11553
56.7%

선사_상세주소
Text

MISSING 

Distinct1586
Distinct (%)91.8%
Missing230
Missing (%)11.7%
Memory size15.4 KiB
2023-12-12T21:37:50.594842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length36
Mean length15.501736
Min length1

Characters and Unicode

Total characters26787
Distinct characters451
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1485 ?
Unique (%)85.9%

Sample

1st row부산광역시 중구 중앙동4가 78-21 보승빌딩 503호
2nd row37-3번지 센트럴 오피스텔 6층
3rd row3층(양재동)
4th row5-1번지
5th row4번지 동양빌딩 803호
ValueCountFrequency (%)
중구 166
 
3.4%
서울특별시 158
 
3.3%
부산광역시 65
 
1.4%
3층 53
 
1.1%
종로구 52
 
1.1%
2층 42
 
0.9%
부산시 39
 
0.8%
동구 38
 
0.8%
전라남도 34
 
0.7%
강남구 30
 
0.6%
Other values (2453) 4137
85.9%
2023-12-12T21:37:51.216068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3119
 
11.6%
1 1595
 
6.0%
1205
 
4.5%
0 942
 
3.5%
2 842
 
3.1%
) 693
 
2.6%
( 693
 
2.6%
- 669
 
2.5%
3 667
 
2.5%
610
 
2.3%
Other values (441) 15752
58.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14553
54.3%
Decimal Number 6668
24.9%
Space Separator 3119
 
11.6%
Close Punctuation 693
 
2.6%
Open Punctuation 693
 
2.6%
Dash Punctuation 669
 
2.5%
Uppercase Letter 191
 
0.7%
Other Punctuation 142
 
0.5%
Lowercase Letter 57
 
0.2%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1205
 
8.3%
610
 
4.2%
604
 
4.2%
580
 
4.0%
501
 
3.4%
406
 
2.8%
396
 
2.7%
372
 
2.6%
367
 
2.5%
344
 
2.4%
Other values (379) 9168
63.0%
Uppercase Letter
ValueCountFrequency (%)
T 22
11.5%
K 19
9.9%
A 18
 
9.4%
B 15
 
7.9%
D 15
 
7.9%
S 14
 
7.3%
G 14
 
7.3%
C 11
 
5.8%
F 8
 
4.2%
E 8
 
4.2%
Other values (13) 47
24.6%
Lowercase Letter
ValueCountFrequency (%)
e 11
19.3%
a 5
8.8%
k 5
8.8%
o 4
 
7.0%
i 4
 
7.0%
r 4
 
7.0%
s 4
 
7.0%
y 4
 
7.0%
t 3
 
5.3%
b 3
 
5.3%
Other values (8) 10
17.5%
Decimal Number
ValueCountFrequency (%)
1 1595
23.9%
0 942
14.1%
2 842
12.6%
3 667
10.0%
4 569
 
8.5%
5 559
 
8.4%
6 487
 
7.3%
7 391
 
5.9%
8 330
 
4.9%
9 286
 
4.3%
Other Punctuation
ValueCountFrequency (%)
, 93
65.5%
: 30
 
21.1%
. 11
 
7.7%
/ 7
 
4.9%
@ 1
 
0.7%
Space Separator
ValueCountFrequency (%)
3119
100.0%
Close Punctuation
ValueCountFrequency (%)
) 693
100.0%
Open Punctuation
ValueCountFrequency (%)
( 693
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 669
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14554
54.3%
Common 11985
44.7%
Latin 248
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1205
 
8.3%
610
 
4.2%
604
 
4.2%
580
 
4.0%
501
 
3.4%
406
 
2.8%
396
 
2.7%
372
 
2.6%
367
 
2.5%
344
 
2.4%
Other values (380) 9169
63.0%
Latin
ValueCountFrequency (%)
T 22
 
8.9%
K 19
 
7.7%
A 18
 
7.3%
B 15
 
6.0%
D 15
 
6.0%
S 14
 
5.6%
G 14
 
5.6%
e 11
 
4.4%
C 11
 
4.4%
F 8
 
3.2%
Other values (31) 101
40.7%
Common
ValueCountFrequency (%)
3119
26.0%
1 1595
13.3%
0 942
 
7.9%
2 842
 
7.0%
) 693
 
5.8%
( 693
 
5.8%
- 669
 
5.6%
3 667
 
5.6%
4 569
 
4.7%
5 559
 
4.7%
Other values (10) 1637
13.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14553
54.3%
ASCII 12233
45.7%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3119
25.5%
1 1595
13.0%
0 942
 
7.7%
2 842
 
6.9%
) 693
 
5.7%
( 693
 
5.7%
- 669
 
5.5%
3 667
 
5.5%
4 569
 
4.7%
5 559
 
4.6%
Other values (51) 1885
15.4%
Hangul
ValueCountFrequency (%)
1205
 
8.3%
610
 
4.2%
604
 
4.2%
580
 
4.0%
501
 
3.4%
406
 
2.8%
396
 
2.7%
372
 
2.6%
367
 
2.5%
344
 
2.4%
Other values (379) 9168
63.0%
None
ValueCountFrequency (%)
1
100.0%

사업자번호
Text

MISSING 

Distinct1151
Distinct (%)85.4%
Missing610
Missing (%)31.2%
Memory size15.4 KiB
2023-12-12T21:37:51.519974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.8894659
Min length1

Characters and Unicode

Total characters13331
Distinct characters16
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1005 ?
Unique (%)74.6%

Sample

1st row1048155903
2nd row6018113909
3rd row6018126632
4th row4158107204
5th row2018144615
ValueCountFrequency (%)
123456789 10
 
0.7%
0 8
 
0.6%
2229082165 4
 
0.3%
1218526982 4
 
0.3%
6168169671 4
 
0.3%
1068197118 4
 
0.3%
4018121231 4
 
0.3%
3138104478 4
 
0.3%
6168100013 4
 
0.3%
6028139275 3
 
0.2%
Other values (1142) 1300
96.4%
2023-12-12T21:37:51.934154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 2596
19.5%
0 1887
14.2%
8 1801
13.5%
6 1505
11.3%
2 1259
9.4%
3 934
 
7.0%
4 897
 
6.7%
5 851
 
6.4%
7 835
 
6.3%
9 720
 
5.4%
Other values (6) 46
 
0.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 13285
99.7%
Dash Punctuation 30
 
0.2%
Space Separator 12
 
0.1%
Other Letter 4
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 2596
19.5%
0 1887
14.2%
8 1801
13.6%
6 1505
11.3%
2 1259
9.5%
3 934
 
7.0%
4 897
 
6.8%
5 851
 
6.4%
7 835
 
6.3%
9 720
 
5.4%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 30
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 13327
> 99.9%
Hangul 4
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
1 2596
19.5%
0 1887
14.2%
8 1801
13.5%
6 1505
11.3%
2 1259
9.4%
3 934
 
7.0%
4 897
 
6.7%
5 851
 
6.4%
7 835
 
6.3%
9 720
 
5.4%
Other values (2) 42
 
0.3%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13327
> 99.9%
Hangul 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 2596
19.5%
0 1887
14.2%
8 1801
13.5%
6 1505
11.3%
2 1259
9.4%
3 934
 
7.0%
4 897
 
6.7%
5 851
 
6.4%
7 835
 
6.3%
9 720
 
5.4%
Other values (2) 42
 
0.3%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

등록일
Date

MISSING 

Distinct1133
Distinct (%)61.4%
Missing113
Missing (%)5.8%
Memory size15.4 KiB
Minimum1999-03-04 00:00:00
Maximum2023-07-07 00:00:00
2023-12-12T21:37:52.077613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:52.237653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

수정일
Date

MISSING 

Distinct789
Distinct (%)45.5%
Missing224
Missing (%)11.4%
Memory size15.4 KiB
Minimum2005-10-05 00:00:00
Maximum2023-07-07 00:00:00
2023-12-12T21:37:52.368505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:52.760072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Missing values

2023-12-12T21:37:46.430253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:37:46.558240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T21:37:46.670460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

선사명_한글선사명_영문선사_우편번호선사_주소선사_상세주소사업자번호등록일수정일
0아산상선(주)<NA><NA>부산시 동구 초량1동 1205-1 교직원공제회관빌딩 802호<NA>10481559032007-11-082008-01-24
1대경해운(주)<NA><NA>부산시 중구 중앙동 2가 50-4 효천빌딩 4층<NA>60181139092007-11-212008-01-24
2한국선무<NA><NA>부산시 중구 중앙동 4가 84-5 해운빌딩 4층<NA>60181266322007-11-262008-01-24
3(주)에스제이탱커<NA>27733부산광역시 중구 중앙동4가 78-21 보승빌딩 503호부산광역시 중구 중앙동4가 78-21 보승빌딩 503호<NA>2010-05-042022-11-17
4(주)신한<NA><NA>전남 완도군 완도읍 군내리 1288번지<NA>41581072042007-10-302013-04-02
5대호상선<NA>600102부산 중구 대창동2가37-3번지 센트럴 오피스텔 6층20181446152007-11-082020-05-15
6부산지방기상청<NA><NA>부산시 동래구 명륜동 577번지<NA>60283001532008-01-072008-01-24
7삼성해업<NA><NA>부산시 영도구 대교동1가 75-1번지 대창빌딩 7층<NA>60535966042008-02-152008-02-15
8두남해운(주)<NA><NA><NA><NA><NA>2008-02-152008-02-15
9세인해운<NA>6738서울특별시 서초구 바우뫼로41길 873층(양재동)10186166772008-02-152016-04-12
선사명_한글선사명_영문선사_우편번호선사_주소선사_상세주소사업자번호등록일수정일
1948(주)대흥개발Daeheung Development31921충청남도 서산시 팔봉면 팔봉1로 753호리 621-3번지31681216842019-03-222019-03-22
1949범진상운PANSTAR48934부산광역시 중구 충장대로9번길 66901호(중앙동4가)60181110462019-05-10<NA>
1950대림예선사DAELIM YE SUN SA46040부산광역시 기장군 일광면 박영준길 88-22단독주택60126709162019-06-03<NA>
1951양밍한국 주식회사YANGMING KOREA48939부산광역시 중구 충장대로 11부산무역회관 6층(중앙동4가)22087264162019-11-292019-11-29
1952에이치알쉬핑(주)HR SHIPPING48822부산광역시 동구 중앙대로180번길 6-9영동빌딩 503호(초량동)68887004892020-02-28<NA>
1953보성마린<NA>48730부산광역시 동구 중앙대로286번길 3-4대산빌딩 301호(초량동)60522689612021-02-022021-02-04
1954신안군청<NA>58827전라남도 신안군 압해읍 천사로 1004전라남도 신안군 압해읍 천사로 100441183003482021-03-152021-03-18
1955청송해운<NA>49043부산광역시 영도구 대평로 113층(대평동1가)18381014152021-08-112022-01-18
1956(주)올포랜드<NA>8506서울특별시 금천구 가산디지털1로 1451402호(가산동)40281625552022-03-042022-04-04
1957(주)지성쉬핑gicoms10089648821부산광역시 동구 중앙대로196번길 6-5GS GQUARE 6층(초량동)10587823842023-02-212023-03-09

Duplicate rows

Most frequently occurring

선사명_한글선사명_영문선사_우편번호선사_주소선사_상세주소사업자번호등록일수정일# duplicates
2고려고속훼리Korea Express Ferry Co., Ltd22349인천광역시 중구 연안부두로 70203호(항동7가)12185269822022-07-29<NA>3
0(주)케이티서브마린<NA>29039부산광역시 해운대구 송정동 433-1번지부산광역시 해운대구 송정동 433-1번지<NA>2008-05-292022-11-172
1(주)한성선박<NA>17서울특별시 중구 소공동 51번지 해운센터 신관 17층서울특별시 중구 소공동 51번지 해운센터 신관 17층<NA><NA>2022-11-172
3김정숙Kim25752강원도 동해시 평원1길 3주택 (천곡동)22290821652023-05-24<NA>2
4대경해운(주)<NA><NA>부산시 중구 중앙동 2가 50-4 효천빌딩 4층<NA>60181139092007-11-212008-01-242
5대양해운(주)Daeyang Shipping Co., Ltd63283제주특별자치도 제주시 동문로 125-1(건입동)61681000132022-07-13<NA>2
6부산지방기상청<NA><NA>부산시 동래구 명륜동 577번지<NA>60283001532008-01-072008-01-242
7알파해운(주)ALPHA MARITIME CO.,LTD.4526서울특별시 중구 남대문로1길 34범화빌딩 805호(북창동)10481366772023-03-29<NA>2
8판타지호fantasy32123충청남도 태안군 소원면 행금길 147-21층22305526162023-04-27<NA>2
9해양수산부종합상황실haeyangsoosanboo30110세종특별자치시 다솜2로 941111(어진동)11112022-08-23<NA>2