Overview

Dataset statistics

Number of variables3
Number of observations46
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)4.3%
Total size in memory1.2 KiB
Average record size in memory26.9 B

Variable types

Text3

Dataset

Description해양경찰청 해양통신장비 수리업체(사설업체)에 관한 데이터로서 (수리)업체명, 전화번호, 주소 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15012769/fileData.do

Alerts

Dataset has 2 (4.3%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 02:54:57.979256
Analysis finished2023-12-12 02:54:58.495933
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct43
Distinct (%)93.5%
Missing0
Missing (%)0.0%
Memory size500.0 B
2023-12-12T11:54:58.764762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length5.173913
Min length3

Characters and Unicode

Total characters238
Distinct characters92
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)87.0%

Sample

1st row삼영전자무선
2nd row위월드
3rd rowKCC
4th rowKOC전기
5th rowMRC
ValueCountFrequency (%)
위월드 2
 
4.1%
대양정보통신 2
 
4.1%
신아정보통신 2
 
4.1%
우리정보통신 1
 
2.0%
신아종합 1
 
2.0%
정보통신 1
 
2.0%
유)에이탑 1
 
2.0%
삼영전자무선 1
 
2.0%
마리타임레디오 1
 
2.0%
삼영 1
 
2.0%
Other values (36) 36
73.5%
2023-12-12T11:54:59.721807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22
 
9.2%
15
 
6.3%
10
 
4.2%
9
 
3.8%
8
 
3.4%
8
 
3.4%
C 7
 
2.9%
6
 
2.5%
5
 
2.1%
5
 
2.1%
Other values (82) 143
60.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 207
87.0%
Uppercase Letter 21
 
8.8%
Other Symbol 4
 
1.7%
Space Separator 3
 
1.3%
Close Punctuation 1
 
0.4%
Open Punctuation 1
 
0.4%
Lowercase Letter 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
 
10.6%
15
 
7.2%
10
 
4.8%
9
 
4.3%
8
 
3.9%
8
 
3.9%
6
 
2.9%
5
 
2.4%
5
 
2.4%
4
 
1.9%
Other values (67) 115
55.6%
Uppercase Letter
ValueCountFrequency (%)
C 7
33.3%
S 3
14.3%
R 2
 
9.5%
N 2
 
9.5%
K 2
 
9.5%
E 1
 
4.8%
O 1
 
4.8%
M 1
 
4.8%
A 1
 
4.8%
P 1
 
4.8%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 211
88.7%
Latin 22
 
9.2%
Common 5
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
 
10.4%
15
 
7.1%
10
 
4.7%
9
 
4.3%
8
 
3.8%
8
 
3.8%
6
 
2.8%
5
 
2.4%
5
 
2.4%
4
 
1.9%
Other values (68) 119
56.4%
Latin
ValueCountFrequency (%)
C 7
31.8%
S 3
13.6%
R 2
 
9.1%
N 2
 
9.1%
K 2
 
9.1%
E 1
 
4.5%
O 1
 
4.5%
M 1
 
4.5%
A 1
 
4.5%
P 1
 
4.5%
Common
ValueCountFrequency (%)
3
60.0%
) 1
 
20.0%
( 1
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 207
87.0%
ASCII 27
 
11.3%
None 4
 
1.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
22
 
10.6%
15
 
7.2%
10
 
4.8%
9
 
4.3%
8
 
3.9%
8
 
3.9%
6
 
2.9%
5
 
2.4%
5
 
2.4%
4
 
1.9%
Other values (67) 115
55.6%
ASCII
ValueCountFrequency (%)
C 7
25.9%
S 3
11.1%
3
11.1%
R 2
 
7.4%
N 2
 
7.4%
K 2
 
7.4%
E 1
 
3.7%
) 1
 
3.7%
( 1
 
3.7%
O 1
 
3.7%
Other values (4) 4
14.8%
None
ValueCountFrequency (%)
4
100.0%
Distinct44
Distinct (%)95.7%
Missing0
Missing (%)0.0%
Memory size500.0 B
2023-12-12T11:55:00.051911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.978261
Min length11

Characters and Unicode

Total characters551
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)91.3%

Sample

1st row041-934-7467
2nd row042-630-0670
3rd row051-301-2111
4th row051-832-0550
5th row051-400-2685
ValueCountFrequency (%)
042-630-0670 2
 
4.3%
061-242-2568 2
 
4.3%
051-336-4565 1
 
2.2%
052-234-1555 1
 
2.2%
032-820-1119 1
 
2.2%
051-601-6666 1
 
2.2%
051-417-9500 1
 
2.2%
051-467-5001 1
 
2.2%
051-204-6223 1
 
2.2%
051-241-6151 1
 
2.2%
Other values (34) 34
73.9%
2023-12-12T11:55:00.546038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 92
16.7%
0 89
16.2%
1 62
11.3%
5 59
10.7%
2 51
9.3%
6 46
8.3%
4 43
7.8%
3 38
6.9%
8 28
 
5.1%
7 26
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 459
83.3%
Dash Punctuation 92
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 89
19.4%
1 62
13.5%
5 59
12.9%
2 51
11.1%
6 46
10.0%
4 43
9.4%
3 38
8.3%
8 28
 
6.1%
7 26
 
5.7%
9 17
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 92
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 551
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 92
16.7%
0 89
16.2%
1 62
11.3%
5 59
10.7%
2 51
9.3%
6 46
8.3%
4 43
7.8%
3 38
6.9%
8 28
 
5.1%
7 26
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 551
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 92
16.7%
0 89
16.2%
1 62
11.3%
5 59
10.7%
2 51
9.3%
6 46
8.3%
4 43
7.8%
3 38
6.9%
8 28
 
5.1%
7 26
 
4.7%

주소
Text

Distinct44
Distinct (%)95.7%
Missing0
Missing (%)0.0%
Memory size500.0 B
2023-12-12T11:55:00.928090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length23.5
Mean length20.195652
Min length10

Characters and Unicode

Total characters929
Distinct characters138
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)91.3%

Sample

1st row충남 보령시 대천항로 320
2nd row대전광역시 유성구 용산동 585-1번지 위월드
3rd row부산시 사상구 모라동 655-9
4th row부산시 강서구 녹산산단77로 6
5th row부산광역시 영도구 남항서로 6-49
ValueCountFrequency (%)
부산시 17
 
8.1%
영도구 5
 
2.4%
전남 4
 
1.9%
목포시 4
 
1.9%
인천시 3
 
1.4%
군산시 3
 
1.4%
경기도 3
 
1.4%
중구 3
 
1.4%
경남 2
 
1.0%
남구 2
 
1.0%
Other values (144) 163
78.0%
2023-12-12T11:55:01.441881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
165
 
17.8%
48
 
5.2%
1 42
 
4.5%
32
 
3.4%
31
 
3.3%
5 28
 
3.0%
28
 
3.0%
25
 
2.7%
21
 
2.3%
21
 
2.3%
Other values (128) 488
52.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 550
59.2%
Decimal Number 188
 
20.2%
Space Separator 165
 
17.8%
Dash Punctuation 18
 
1.9%
Other Punctuation 3
 
0.3%
Open Punctuation 2
 
0.2%
Close Punctuation 2
 
0.2%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
48
 
8.7%
32
 
5.8%
31
 
5.6%
28
 
5.1%
25
 
4.5%
21
 
3.8%
21
 
3.8%
15
 
2.7%
15
 
2.7%
13
 
2.4%
Other values (112) 301
54.7%
Decimal Number
ValueCountFrequency (%)
1 42
22.3%
5 28
14.9%
3 20
10.6%
2 19
10.1%
4 17
9.0%
0 16
 
8.5%
7 14
 
7.4%
9 11
 
5.9%
8 11
 
5.9%
6 10
 
5.3%
Space Separator
ValueCountFrequency (%)
165
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 550
59.2%
Common 379
40.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
48
 
8.7%
32
 
5.8%
31
 
5.6%
28
 
5.1%
25
 
4.5%
21
 
3.8%
21
 
3.8%
15
 
2.7%
15
 
2.7%
13
 
2.4%
Other values (112) 301
54.7%
Common
ValueCountFrequency (%)
165
43.5%
1 42
 
11.1%
5 28
 
7.4%
3 20
 
5.3%
2 19
 
5.0%
- 18
 
4.7%
4 17
 
4.5%
0 16
 
4.2%
7 14
 
3.7%
9 11
 
2.9%
Other values (6) 29
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 550
59.2%
ASCII 379
40.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
165
43.5%
1 42
 
11.1%
5 28
 
7.4%
3 20
 
5.3%
2 19
 
5.0%
- 18
 
4.7%
4 17
 
4.5%
0 16
 
4.2%
7 14
 
3.7%
9 11
 
2.9%
Other values (6) 29
 
7.7%
Hangul
ValueCountFrequency (%)
48
 
8.7%
32
 
5.8%
31
 
5.6%
28
 
5.1%
25
 
4.5%
21
 
3.8%
21
 
3.8%
15
 
2.7%
15
 
2.7%
13
 
2.4%
Other values (112) 301
54.7%

Correlations

2023-12-12T11:55:01.580560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체명전화번호주소
업체명1.0001.0001.000
전화번호1.0001.0001.000
주소1.0001.0001.000

Missing values

2023-12-12T11:54:58.329938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:54:58.449083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업체명전화번호주소
0삼영전자무선041-934-7467충남 보령시 대천항로 320
1위월드042-630-0670대전광역시 유성구 용산동 585-1번지 위월드
2KCC051-301-2111부산시 사상구 모라동 655-9
3KOC전기051-832-0550부산시 강서구 녹산산단77로 6
4MRC051-400-2685부산광역시 영도구 남항서로 6-49
5대양정보통신061-242-2568전남 목포시 고하대로 597번길 29
6SRC070-4123-6197부산시 영도구 남항동2가 65-23 대도빌딩 5층
7금호마린테크051-293-8589부산시 남구 지게골로 50
8㈜영신정보통신061-643-0333여수시 봉산남7길 23, 103호(대영빌 1층)
9진호통신061-642-2082여수시 봉산동 소재
업체명전화번호주소
36해성전자055-221-0006경남 마산시 남성동 230-1
37㈜턴온전자055-346-3930경남 김해시 진례면 테크노밸리길 161-43
38동양전자통신064-723-5799제주시 임항로 55-2
39한신전자051-412-5551부산시 영도구 남항남로 45
40새한정보통신032-881-2445인천 중구 연안부두로21번길 13
41에스아이텍032-888-5482인천시 중구 연안부두로 24-1 해양센타
42우리정보통신032-886-7541인천시 중구 연안부두로55, 2층(항동7가)
43(유)에이탑061-287-5100전남 목포시 남악1로 52번가길 17-4
44주신에이브이티070-7018-9729경기도 하남시 조정대로 150 아이테코 542호
45유텔레콤㈜02-304-2114서울특별시 은평구 증산로 325 유니온빌딩 3층

Duplicate rows

Most frequently occurring

업체명전화번호주소# duplicates
0대양정보통신061-242-2568전남 목포시 고하대로 597번길 292
1위월드042-630-0670대전광역시 유성구 용산동 585-1번지 위월드2