Overview

Dataset statistics

Number of variables3
Number of observations50
Missing cells15
Missing cells (%)10.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory26.6 B

Variable types

Text3

Dataset

Description부산광역시_기장군_수산물가공업체현황_20230920
Author부산광역시 기장군
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3072042

Alerts

전화번호 has 15 (30.0%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:32:32.593123
Analysis finished2023-12-10 16:32:32.996325
Duration0.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct48
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2023-12-11T01:32:33.156016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length5.32
Min length3

Characters and Unicode

Total characters266
Distinct characters107
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)92.0%

Sample

1st row(주)기장사람들
2nd row(주)이삭에프앤비
3rd row33식품
4th row99식품
5th row㈜기장명품특산물
ValueCountFrequency (%)
동해식품 2
 
4.0%
기장식품 2
 
4.0%
엄지식품 1
 
2.0%
주)기장사람들 1
 
2.0%
에스씨푸드 1
 
2.0%
블루오션에스㈜ 1
 
2.0%
삼광수산(주 1
 
2.0%
삼기물산 1
 
2.0%
수협1번중매인 1
 
2.0%
신광식품 1
 
2.0%
Other values (38) 38
76.0%
2023-12-11T01:32:33.463194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18
 
6.8%
17
 
6.4%
12
 
4.5%
10
 
3.8%
9
 
3.4%
8
 
3.0%
8
 
3.0%
8
 
3.0%
7
 
2.6%
5
 
1.9%
Other values (97) 164
61.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 228
85.7%
Uppercase Letter 11
 
4.1%
Other Symbol 10
 
3.8%
Open Punctuation 5
 
1.9%
Close Punctuation 5
 
1.9%
Decimal Number 5
 
1.9%
Other Punctuation 1
 
0.4%
Dash Punctuation 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
7.9%
17
 
7.5%
12
 
5.3%
9
 
3.9%
8
 
3.5%
8
 
3.5%
8
 
3.5%
7
 
3.1%
5
 
2.2%
5
 
2.2%
Other values (82) 131
57.5%
Uppercase Letter
ValueCountFrequency (%)
O 4
36.4%
D 2
18.2%
S 1
 
9.1%
E 1
 
9.1%
A 1
 
9.1%
H 1
 
9.1%
F 1
 
9.1%
Decimal Number
ValueCountFrequency (%)
9 2
40.0%
3 2
40.0%
1 1
20.0%
Other Symbol
ValueCountFrequency (%)
10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 238
89.5%
Common 17
 
6.4%
Latin 11
 
4.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18
 
7.6%
17
 
7.1%
12
 
5.0%
10
 
4.2%
9
 
3.8%
8
 
3.4%
8
 
3.4%
8
 
3.4%
7
 
2.9%
5
 
2.1%
Other values (83) 136
57.1%
Common
ValueCountFrequency (%)
( 5
29.4%
) 5
29.4%
9 2
 
11.8%
3 2
 
11.8%
1 1
 
5.9%
. 1
 
5.9%
- 1
 
5.9%
Latin
ValueCountFrequency (%)
O 4
36.4%
D 2
18.2%
S 1
 
9.1%
E 1
 
9.1%
A 1
 
9.1%
H 1
 
9.1%
F 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 228
85.7%
ASCII 28
 
10.5%
None 10
 
3.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
18
 
7.9%
17
 
7.5%
12
 
5.3%
9
 
3.9%
8
 
3.5%
8
 
3.5%
8
 
3.5%
7
 
3.1%
5
 
2.2%
5
 
2.2%
Other values (82) 131
57.5%
None
ValueCountFrequency (%)
10
100.0%
ASCII
ValueCountFrequency (%)
( 5
17.9%
) 5
17.9%
O 4
14.3%
D 2
 
7.1%
9 2
 
7.1%
3 2
 
7.1%
1 1
 
3.6%
S 1
 
3.6%
E 1
 
3.6%
A 1
 
3.6%
Other values (4) 4
14.3%
Distinct42
Distinct (%)84.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2023-12-11T01:32:33.676426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length26
Mean length21.88
Min length19

Characters and Unicode

Total characters1094
Distinct characters59
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)76.0%

Sample

1st row부산광역시 기장군 일광면 일광로 747-2
2nd row부산광역시 기장군 장안읍 오리길 2-12
3rd row부산광역시 기장군 기장읍 대변로 146
4th row부산광역시 기장군 기장읍 기장해안로 593-11
5th row부산광역시 기장군 장안읍 오리길 166
ValueCountFrequency (%)
부산광역시 50
19.9%
기장군 50
19.9%
기장읍 28
 
11.2%
대변로 10
 
4.0%
정관읍 8
 
3.2%
일광면 8
 
3.2%
장안읍 6
 
2.4%
146 6
 
2.4%
두메로 5
 
2.0%
기장해안로 4
 
1.6%
Other values (61) 76
30.3%
2023-12-11T01:32:34.075781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
201
18.4%
89
 
8.1%
83
 
7.6%
61
 
5.6%
59
 
5.4%
51
 
4.7%
50
 
4.6%
50
 
4.6%
50
 
4.6%
42
 
3.8%
Other values (49) 358
32.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 722
66.0%
Space Separator 201
 
18.4%
Decimal Number 158
 
14.4%
Dash Punctuation 13
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
89
12.3%
83
11.5%
61
8.4%
59
 
8.2%
51
 
7.1%
50
 
6.9%
50
 
6.9%
50
 
6.9%
42
 
5.8%
38
 
5.3%
Other values (37) 149
20.6%
Decimal Number
ValueCountFrequency (%)
1 36
22.8%
6 29
18.4%
4 19
12.0%
5 17
10.8%
3 15
9.5%
7 11
 
7.0%
9 9
 
5.7%
8 8
 
5.1%
2 7
 
4.4%
0 7
 
4.4%
Space Separator
ValueCountFrequency (%)
201
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 722
66.0%
Common 372
34.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
89
12.3%
83
11.5%
61
8.4%
59
 
8.2%
51
 
7.1%
50
 
6.9%
50
 
6.9%
50
 
6.9%
42
 
5.8%
38
 
5.3%
Other values (37) 149
20.6%
Common
ValueCountFrequency (%)
201
54.0%
1 36
 
9.7%
6 29
 
7.8%
4 19
 
5.1%
5 17
 
4.6%
3 15
 
4.0%
- 13
 
3.5%
7 11
 
3.0%
9 9
 
2.4%
8 8
 
2.2%
Other values (2) 14
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 722
66.0%
ASCII 372
34.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
201
54.0%
1 36
 
9.7%
6 29
 
7.8%
4 19
 
5.1%
5 17
 
4.6%
3 15
 
4.0%
- 13
 
3.5%
7 11
 
3.0%
9 9
 
2.4%
8 8
 
2.2%
Other values (2) 14
 
3.8%
Hangul
ValueCountFrequency (%)
89
12.3%
83
11.5%
61
8.4%
59
 
8.2%
51
 
7.1%
50
 
6.9%
50
 
6.9%
50
 
6.9%
42
 
5.8%
38
 
5.3%
Other values (37) 149
20.6%

전화번호
Text

MISSING 

Distinct33
Distinct (%)94.3%
Missing15
Missing (%)30.0%
Memory size532.0 B
2023-12-11T01:32:34.292645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters420
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)88.6%

Sample

1st row051-722-2248
2nd row051-723-2293
3rd row051-722-5453
4th row051-721-0900
5th row051-721-5666
ValueCountFrequency (%)
051-782-0961 2
 
5.7%
051-722-4381 2
 
5.7%
051-722-2248 1
 
2.9%
051-724-0430 1
 
2.9%
051-266-3636 1
 
2.9%
051-722-2512 1
 
2.9%
051-721-0400 1
 
2.9%
051-727-2073 1
 
2.9%
051-723-6545 1
 
2.9%
051-721-0511 1
 
2.9%
Other values (23) 23
65.7%
2023-12-11T01:32:34.640200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 70
16.7%
0 61
14.5%
2 60
14.3%
1 57
13.6%
7 51
12.1%
5 48
11.4%
6 18
 
4.3%
3 18
 
4.3%
4 16
 
3.8%
8 13
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 350
83.3%
Dash Punctuation 70
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 61
17.4%
2 60
17.1%
1 57
16.3%
7 51
14.6%
5 48
13.7%
6 18
 
5.1%
3 18
 
5.1%
4 16
 
4.6%
8 13
 
3.7%
9 8
 
2.3%
Dash Punctuation
ValueCountFrequency (%)
- 70
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 420
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 70
16.7%
0 61
14.5%
2 60
14.3%
1 57
13.6%
7 51
12.1%
5 48
11.4%
6 18
 
4.3%
3 18
 
4.3%
4 16
 
3.8%
8 13
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 420
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 70
16.7%
0 61
14.5%
2 60
14.3%
1 57
13.6%
7 51
12.1%
5 48
11.4%
6 18
 
4.3%
3 18
 
4.3%
4 16
 
3.8%
8 13
 
3.1%

Correlations

2023-12-11T01:32:34.758521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명소재지(도로명주소)전화번호
업소명1.0000.9780.978
소재지(도로명주소)0.9781.0001.000
전화번호0.9781.0001.000

Missing values

2023-12-11T01:32:32.880361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:32:32.968343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명소재지(도로명주소)전화번호
0(주)기장사람들부산광역시 기장군 일광면 일광로 747-2051-722-2248
1(주)이삭에프앤비부산광역시 기장군 장안읍 오리길 2-12051-723-2293
233식품부산광역시 기장군 기장읍 대변로 146<NA>
399식품부산광역시 기장군 기장읍 기장해안로 593-11<NA>
4㈜기장명품특산물부산광역시 기장군 장안읍 오리길 166<NA>
5㈜마린바이오프로세스부산광역시 기장군 일광면 횡계길 7051-722-5453
6㈜보성상사부산광역시 기장군 정관읍 산단7로 83051-721-0900
7㈜석하부산광역시 기장군 정관읍 산단2로 6-17051-721-5666
8㈜제이엔디부산광역시 기장군 정관읍 산단3로 47051-896-1034
9㈜지이스트냉동부산광역시 기장군 기장읍 기장해안로 640051-723-5570
업소명소재지(도로명주소)전화번호
40은진이네부산광역시 기장군 기장읍 두메로 4051-722-1760
41지은이네부산광역시 기장군 기장읍 대변로 141<NA>
42진양식품부산광역시 기장군 정관읍 산단5로 100-81051-522-0033
43통영수산부산광역시 기장군 기장읍 대변로 146<NA>
44한식품부산광역시 기장군 기장읍 기장대로 475051-724-8872
45해민산업부산광역시 기장군 기장읍 반송로 1565051-722-2117
46해양식품부산광역시 기장군 장안읍 해맞이로 335051-727-4879
47해조나라영어법인부산광역시 기장군 기장읍 두메로 158051-724-2904
48혜인식품부산광역시 기장군 기장읍 대변로 146<NA>
49후푸드(HOO-FOOD)부산광역시 기장군 정관읍 정관상곡1길 18051-727-5884