Overview

Dataset statistics

Number of variables3
Number of observations360
Missing cells148
Missing cells (%)13.7%
Duplicate rows1
Duplicate rows (%)0.3%
Total size in memory8.6 KiB
Average record size in memory24.4 B

Variable types

Text3

Dataset

Description부산광역시연제구_즉석판매제조가공업소현황_20221024
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15047914

Alerts

Dataset has 1 (0.3%) duplicate rowsDuplicates
소재지전화 has 148 (41.1%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:46:38.070402
Analysis finished2023-12-10 16:46:38.579783
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct350
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-12-11T01:46:38.845854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length5.9194444
Min length2

Characters and Unicode

Total characters2131
Distinct characters384
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique341 ?
Unique (%)94.7%

Sample

1st row경주제분소
2nd row양산상회
3rd row함안상회
4th row신흥상회
5th row벽산상회
ValueCountFrequency (%)
주식회사 5
 
1.1%
반찬 5
 
1.1%
연산점 5
 
1.1%
밀양방앗간 3
 
0.7%
홈플러스(주)아시아드점 2
 
0.4%
빅세일마트 2
 
0.4%
별난 2
 
0.4%
2
 
0.4%
엄마손 2
 
0.4%
연제점 2
 
0.4%
Other values (408) 418
93.3%
2023-12-11T01:46:39.413113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
88
 
4.1%
64
 
3.0%
44
 
2.1%
40
 
1.9%
39
 
1.8%
37
 
1.7%
35
 
1.6%
35
 
1.6%
35
 
1.6%
( 32
 
1.5%
Other values (374) 1682
78.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1923
90.2%
Space Separator 88
 
4.1%
Open Punctuation 32
 
1.5%
Close Punctuation 32
 
1.5%
Lowercase Letter 19
 
0.9%
Decimal Number 15
 
0.7%
Other Punctuation 12
 
0.6%
Uppercase Letter 10
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
64
 
3.3%
44
 
2.3%
40
 
2.1%
39
 
2.0%
37
 
1.9%
35
 
1.8%
35
 
1.8%
35
 
1.8%
32
 
1.7%
30
 
1.6%
Other values (340) 1532
79.7%
Lowercase Letter
ValueCountFrequency (%)
i 4
21.1%
n 3
15.8%
o 2
10.5%
m 2
10.5%
c 1
 
5.3%
k 1
 
5.3%
t 1
 
5.3%
h 1
 
5.3%
y 1
 
5.3%
e 1
 
5.3%
Other values (2) 2
10.5%
Uppercase Letter
ValueCountFrequency (%)
O 2
20.0%
E 2
20.0%
K 1
10.0%
F 1
10.0%
S 1
10.0%
W 1
10.0%
T 1
10.0%
N 1
10.0%
Decimal Number
ValueCountFrequency (%)
3 4
26.7%
1 3
20.0%
5 3
20.0%
0 2
13.3%
6 1
 
6.7%
2 1
 
6.7%
8 1
 
6.7%
Other Punctuation
ValueCountFrequency (%)
& 5
41.7%
, 4
33.3%
! 2
 
16.7%
. 1
 
8.3%
Space Separator
ValueCountFrequency (%)
88
100.0%
Open Punctuation
ValueCountFrequency (%)
( 32
100.0%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1923
90.2%
Common 179
 
8.4%
Latin 29
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
64
 
3.3%
44
 
2.3%
40
 
2.1%
39
 
2.0%
37
 
1.9%
35
 
1.8%
35
 
1.8%
35
 
1.8%
32
 
1.7%
30
 
1.6%
Other values (340) 1532
79.7%
Latin
ValueCountFrequency (%)
i 4
 
13.8%
n 3
 
10.3%
o 2
 
6.9%
m 2
 
6.9%
O 2
 
6.9%
E 2
 
6.9%
c 1
 
3.4%
K 1
 
3.4%
F 1
 
3.4%
k 1
 
3.4%
Other values (10) 10
34.5%
Common
ValueCountFrequency (%)
88
49.2%
( 32
 
17.9%
) 32
 
17.9%
& 5
 
2.8%
3 4
 
2.2%
, 4
 
2.2%
1 3
 
1.7%
5 3
 
1.7%
0 2
 
1.1%
! 2
 
1.1%
Other values (4) 4
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1923
90.2%
ASCII 208
 
9.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
88
42.3%
( 32
 
15.4%
) 32
 
15.4%
& 5
 
2.4%
3 4
 
1.9%
, 4
 
1.9%
i 4
 
1.9%
n 3
 
1.4%
1 3
 
1.4%
5 3
 
1.4%
Other values (24) 30
 
14.4%
Hangul
ValueCountFrequency (%)
64
 
3.3%
44
 
2.3%
40
 
2.1%
39
 
2.0%
37
 
1.9%
35
 
1.8%
35
 
1.8%
35
 
1.8%
32
 
1.7%
30
 
1.6%
Other values (340) 1532
79.7%
Distinct330
Distinct (%)91.7%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-12-11T01:46:39.705097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length56
Mean length31.052778
Min length18

Characters and Unicode

Total characters11179
Distinct characters190
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique311 ?
Unique (%)86.4%

Sample

1st row부산광역시 연제구 거제천로 140 (연산동)
2nd row부산광역시 연제구 월드컵대로19번길 8 (연산동)
3rd row부산광역시 연제구 거제천로87번길 15-7 (거제동)
4th row부산광역시 연제구 거제천로 103 (거제동,1층 일부)
5th row부산광역시 연제구 월드컵대로3번길 16 (연산동,1층)
ValueCountFrequency (%)
부산광역시 360
 
16.4%
연제구 360
 
16.4%
연산동 273
 
12.4%
1층 138
 
6.3%
거제동 73
 
3.3%
2층 22
 
1.0%
연수로 17
 
0.8%
7 17
 
0.8%
89 13
 
0.6%
거제천로 13
 
0.6%
Other values (430) 907
41.4%
2023-12-11T01:46:40.178049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1839
 
16.5%
722
 
6.5%
668
 
6.0%
1 517
 
4.6%
503
 
4.5%
432
 
3.9%
397
 
3.6%
376
 
3.4%
365
 
3.3%
( 362
 
3.2%
Other values (180) 4998
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6615
59.2%
Space Separator 1839
 
16.5%
Decimal Number 1636
 
14.6%
Open Punctuation 362
 
3.2%
Close Punctuation 362
 
3.2%
Other Punctuation 275
 
2.5%
Uppercase Letter 50
 
0.4%
Dash Punctuation 38
 
0.3%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
722
 
10.9%
668
 
10.1%
503
 
7.6%
432
 
6.5%
397
 
6.0%
376
 
5.7%
365
 
5.5%
362
 
5.5%
360
 
5.4%
359
 
5.4%
Other values (150) 2071
31.3%
Uppercase Letter
ValueCountFrequency (%)
E 11
22.0%
V 5
10.0%
I 5
10.0%
W 5
10.0%
K 5
10.0%
S 5
10.0%
A 4
 
8.0%
B 4
 
8.0%
C 1
 
2.0%
D 1
 
2.0%
Other values (4) 4
 
8.0%
Decimal Number
ValueCountFrequency (%)
1 517
31.6%
2 228
13.9%
3 155
 
9.5%
0 131
 
8.0%
4 125
 
7.6%
8 112
 
6.8%
9 96
 
5.9%
5 96
 
5.9%
7 89
 
5.4%
6 87
 
5.3%
Space Separator
ValueCountFrequency (%)
1839
100.0%
Open Punctuation
ValueCountFrequency (%)
( 362
100.0%
Close Punctuation
ValueCountFrequency (%)
) 362
100.0%
Other Punctuation
ValueCountFrequency (%)
, 275
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 38
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6615
59.2%
Common 4514
40.4%
Latin 50
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
722
 
10.9%
668
 
10.1%
503
 
7.6%
432
 
6.5%
397
 
6.0%
376
 
5.7%
365
 
5.5%
362
 
5.5%
360
 
5.4%
359
 
5.4%
Other values (150) 2071
31.3%
Common
ValueCountFrequency (%)
1839
40.7%
1 517
 
11.5%
( 362
 
8.0%
) 362
 
8.0%
, 275
 
6.1%
2 228
 
5.1%
3 155
 
3.4%
0 131
 
2.9%
4 125
 
2.8%
8 112
 
2.5%
Other values (6) 408
 
9.0%
Latin
ValueCountFrequency (%)
E 11
22.0%
V 5
10.0%
I 5
10.0%
W 5
10.0%
K 5
10.0%
S 5
10.0%
A 4
 
8.0%
B 4
 
8.0%
C 1
 
2.0%
D 1
 
2.0%
Other values (4) 4
 
8.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6615
59.2%
ASCII 4564
40.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1839
40.3%
1 517
 
11.3%
( 362
 
7.9%
) 362
 
7.9%
, 275
 
6.0%
2 228
 
5.0%
3 155
 
3.4%
0 131
 
2.9%
4 125
 
2.7%
8 112
 
2.5%
Other values (20) 458
 
10.0%
Hangul
ValueCountFrequency (%)
722
 
10.9%
668
 
10.1%
503
 
7.6%
432
 
6.5%
397
 
6.0%
376
 
5.7%
365
 
5.5%
362
 
5.5%
360
 
5.4%
359
 
5.4%
Other values (150) 2071
31.3%

소재지전화
Text

MISSING 

Distinct205
Distinct (%)96.7%
Missing148
Missing (%)41.1%
Memory size2.9 KiB
2023-12-11T01:46:40.449313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12
Min length9

Characters and Unicode

Total characters2544
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique200 ?
Unique (%)94.3%

Sample

1st row051-864-1384
2nd row051-865-5341
3rd row051-861-4621
4th row051-852-9113
5th row051-862-0992
ValueCountFrequency (%)
051-860-1052 3
 
1.4%
051-500-8000 3
 
1.4%
051-757-8876 2
 
0.9%
051-868-8852 2
 
0.9%
051-968-5600 2
 
0.9%
051-863-4559 1
 
0.5%
051-863-2080 1
 
0.5%
051-864-1384 1
 
0.5%
051-868-6898 1
 
0.5%
051-864-2786 1
 
0.5%
Other values (195) 195
92.0%
2023-12-11T01:46:40.903105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 423
16.6%
5 393
15.4%
0 352
13.8%
1 314
12.3%
8 266
10.5%
6 217
8.5%
7 152
 
6.0%
3 123
 
4.8%
2 122
 
4.8%
9 94
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2121
83.4%
Dash Punctuation 423
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 393
18.5%
0 352
16.6%
1 314
14.8%
8 266
12.5%
6 217
10.2%
7 152
 
7.2%
3 123
 
5.8%
2 122
 
5.8%
9 94
 
4.4%
4 88
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 423
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2544
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 423
16.6%
5 393
15.4%
0 352
13.8%
1 314
12.3%
8 266
10.5%
6 217
8.5%
7 152
 
6.0%
3 123
 
4.8%
2 122
 
4.8%
9 94
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2544
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 423
16.6%
5 393
15.4%
0 352
13.8%
1 314
12.3%
8 266
10.5%
6 217
8.5%
7 152
 
6.0%
3 123
 
4.8%
2 122
 
4.8%
9 94
 
3.7%

Missing values

2023-12-11T01:46:38.440932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:46:38.543844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명소재지(도로명)소재지전화
0경주제분소부산광역시 연제구 거제천로 140 (연산동)051-864-1384
1양산상회부산광역시 연제구 월드컵대로19번길 8 (연산동)<NA>
2함안상회부산광역시 연제구 거제천로87번길 15-7 (거제동)051-865-5341
3신흥상회부산광역시 연제구 거제천로 103 (거제동,1층 일부)051-861-4621
4벽산상회부산광역시 연제구 월드컵대로3번길 16 (연산동,1층)051-852-9113
5대명상회부산광역시 연제구 월드컵대로 19 (연산동)051-862-0992
6웰빙떡방앗간부산광역시 연제구 해맞이로 77 (거제동)051-503-0460
7연산방앗간부산광역시 연제구 금련로 12 (연산동)051-865-0639
8울산방앗간부산광역시 연제구 거제천로87번길 15-1 (거제동,지상1층)051-865-1444
9밀양방앗간부산광역시 연제구 쌍미천로151번길 22 (연산동)051-866-1366
업소명소재지(도로명)소재지전화
350궁 잔기지떡 부산시청부산광역시 연제구 쌍미천로 175, 1층 (연산동)<NA>
351부부빵집부산광역시 연제구 명륜로 16, 광일메디칼센터 101호 (거제동)051-506-5030
352진도특산영어조합법인부산광역시 연제구 연수로 89, 신세계연제점E마트 1층 (연산동)031-792-6892
353똑똑! 떡볶이입니다부산광역시 연제구 중앙대로1075번길 30, 1층 (연산동)<NA>
354(주)이노믹스부산광역시 연제구 해맞이로 23, 115동 106호 (거제동, 거제유림아시아드)<NA>
355오에스푸드부산광역시 연제구 종합운동장로 7, 부산아시아드주경기장노외주차장 지하2층 (거제동)055-292-8770
356민유통부산광역시 연제구 반송로 88, 홈플러스 부산연산점 2층 (연산동)<NA>
357(주)마켓인부산광역시 연제구 연수로 89, 신세계연제점E마트 1층 (연산동)<NA>
358농촌사랑(주)부산광역시 연제구 연수로 89, 신세계연제점E마트 1층 (연산동)<NA>
359(주)케이지프레시부산광역시 연제구 연수로 89, 신세계연제점E마트 1층 (연산동)<NA>

Duplicate rows

Most frequently occurring

업소명소재지(도로명)소재지전화# duplicates
0(주)이마트연제점부산광역시 연제구 연수로 89 (연산동)051-860-10522