Overview

Dataset statistics

Number of variables4
Number of observations325
Missing cells85
Missing cells (%)6.5%
Duplicate rows2
Duplicate rows (%)0.6%
Total size in memory10.3 KiB
Average record size in memory32.4 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시_연제구_즉석판매제조가공업소현황_20200613
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15047914

Alerts

업종명 has constant value ""Constant
Dataset has 2 (0.6%) duplicate rowsDuplicates
소재지전화 has 85 (26.2%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:46:32.694696
Analysis finished2023-12-10 16:46:33.181597
Duration0.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
즉석판매제조가공업
325 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row즉석판매제조가공업
2nd row즉석판매제조가공업
3rd row즉석판매제조가공업
4th row즉석판매제조가공업
5th row즉석판매제조가공업

Common Values

ValueCountFrequency (%)
즉석판매제조가공업 325
100.0%

Length

2023-12-11T01:46:33.247136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:46:33.380908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
즉석판매제조가공업 325
100.0%
Distinct313
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-11T01:46:33.767574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length18
Mean length5.9692308
Min length2

Characters and Unicode

Total characters1940
Distinct characters362
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique303 ?
Unique (%)93.2%

Sample

1st row경주제분소
2nd row양산상회
3rd row함안상회
4th row신흥상회
5th row벽산상회
ValueCountFrequency (%)
주식회사 5
 
1.3%
반찬 4
 
1.0%
주)은하수산 3
 
0.8%
연제점 3
 
0.8%
밀양방앗간 3
 
0.8%
주)부산축산홈플러스아시아드점 2
 
0.5%
건강원 2
 
0.5%
미래식품 2
 
0.5%
잼있는 2
 
0.5%
부엌 2
 
0.5%
Other values (353) 362
92.8%
2023-12-11T01:46:34.459133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
66
 
3.4%
55
 
2.8%
46
 
2.4%
41
 
2.1%
( 40
 
2.1%
) 40
 
2.1%
38
 
2.0%
37
 
1.9%
35
 
1.8%
33
 
1.7%
Other values (352) 1509
77.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1723
88.8%
Space Separator 66
 
3.4%
Open Punctuation 40
 
2.1%
Close Punctuation 40
 
2.1%
Lowercase Letter 30
 
1.5%
Uppercase Letter 28
 
1.4%
Decimal Number 7
 
0.4%
Other Punctuation 4
 
0.2%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
55
 
3.2%
46
 
2.7%
41
 
2.4%
38
 
2.2%
37
 
2.1%
35
 
2.0%
33
 
1.9%
33
 
1.9%
32
 
1.9%
31
 
1.8%
Other values (308) 1342
77.9%
Lowercase Letter
ValueCountFrequency (%)
e 6
20.0%
o 3
10.0%
m 3
10.0%
h 2
 
6.7%
n 2
 
6.7%
r 2
 
6.7%
i 2
 
6.7%
a 2
 
6.7%
u 1
 
3.3%
l 1
 
3.3%
Other values (6) 6
20.0%
Uppercase Letter
ValueCountFrequency (%)
E 4
14.3%
M 3
10.7%
O 2
 
7.1%
N 2
 
7.1%
G 2
 
7.1%
V 2
 
7.1%
J 2
 
7.1%
T 2
 
7.1%
F 2
 
7.1%
Y 1
 
3.6%
Other values (6) 6
21.4%
Decimal Number
ValueCountFrequency (%)
4 2
28.6%
0 2
28.6%
5 1
14.3%
9 1
14.3%
8 1
14.3%
Other Punctuation
ValueCountFrequency (%)
, 2
50.0%
! 1
25.0%
& 1
25.0%
Space Separator
ValueCountFrequency (%)
66
100.0%
Open Punctuation
ValueCountFrequency (%)
( 40
100.0%
Close Punctuation
ValueCountFrequency (%)
) 40
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1723
88.8%
Common 159
 
8.2%
Latin 58
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
55
 
3.2%
46
 
2.7%
41
 
2.4%
38
 
2.2%
37
 
2.1%
35
 
2.0%
33
 
1.9%
33
 
1.9%
32
 
1.9%
31
 
1.8%
Other values (308) 1342
77.9%
Latin
ValueCountFrequency (%)
e 6
 
10.3%
E 4
 
6.9%
M 3
 
5.2%
o 3
 
5.2%
m 3
 
5.2%
O 2
 
3.4%
N 2
 
3.4%
h 2
 
3.4%
G 2
 
3.4%
V 2
 
3.4%
Other values (22) 29
50.0%
Common
ValueCountFrequency (%)
66
41.5%
( 40
25.2%
) 40
25.2%
4 2
 
1.3%
- 2
 
1.3%
0 2
 
1.3%
, 2
 
1.3%
5 1
 
0.6%
9 1
 
0.6%
! 1
 
0.6%
Other values (2) 2
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1722
88.8%
ASCII 217
 
11.2%
Compat Jamo 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
66
30.4%
( 40
18.4%
) 40
18.4%
e 6
 
2.8%
E 4
 
1.8%
M 3
 
1.4%
o 3
 
1.4%
m 3
 
1.4%
O 2
 
0.9%
N 2
 
0.9%
Other values (34) 48
22.1%
Hangul
ValueCountFrequency (%)
55
 
3.2%
46
 
2.7%
41
 
2.4%
38
 
2.2%
37
 
2.1%
35
 
2.0%
33
 
1.9%
33
 
1.9%
32
 
1.9%
31
 
1.8%
Other values (307) 1341
77.9%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct288
Distinct (%)88.6%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-11T01:46:34.803919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length50
Mean length30.418462
Min length19

Characters and Unicode

Total characters9886
Distinct characters171
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique270 ?
Unique (%)83.1%

Sample

1st row부산광역시 연제구 거제천로 140 (연산동)
2nd row부산광역시 연제구 월드컵대로19번길 8 (연산동)
3rd row부산광역시 연제구 거제천로87번길 15-7 (거제동)
4th row부산광역시 연제구 거제천로 103 (거제동,1층 일부)
5th row부산광역시 연제구 월드컵대로3번길 16 (연산동,1층)
ValueCountFrequency (%)
부산광역시 325
16.8%
연제구 325
16.8%
연산동 234
 
12.1%
1층 93
 
4.8%
거제동 72
 
3.7%
연수로 23
 
1.2%
7 20
 
1.0%
89 19
 
1.0%
종합운동장로 15
 
0.8%
연동로8번길 14
 
0.7%
Other values (369) 794
41.1%
2023-12-11T01:46:35.411227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1611
 
16.3%
664
 
6.7%
601
 
6.1%
470
 
4.8%
1 404
 
4.1%
396
 
4.0%
360
 
3.6%
344
 
3.5%
325
 
3.3%
325
 
3.3%
Other values (161) 4386
44.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5991
60.6%
Space Separator 1611
 
16.3%
Decimal Number 1360
 
13.8%
Close Punctuation 322
 
3.3%
Open Punctuation 322
 
3.3%
Other Punctuation 221
 
2.2%
Uppercase Letter 33
 
0.3%
Dash Punctuation 25
 
0.3%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
664
 
11.1%
601
 
10.0%
470
 
7.8%
396
 
6.6%
360
 
6.0%
344
 
5.7%
325
 
5.4%
325
 
5.4%
325
 
5.4%
324
 
5.4%
Other values (132) 1857
31.0%
Uppercase Letter
ValueCountFrequency (%)
E 10
30.3%
A 6
18.2%
B 3
 
9.1%
T 2
 
6.1%
L 2
 
6.1%
W 2
 
6.1%
G 2
 
6.1%
P 2
 
6.1%
S 1
 
3.0%
K 1
 
3.0%
Other values (2) 2
 
6.1%
Decimal Number
ValueCountFrequency (%)
1 404
29.7%
2 180
13.2%
3 127
 
9.3%
8 118
 
8.7%
0 102
 
7.5%
4 101
 
7.4%
5 92
 
6.8%
9 89
 
6.5%
7 81
 
6.0%
6 66
 
4.9%
Other Punctuation
ValueCountFrequency (%)
, 220
99.5%
@ 1
 
0.5%
Space Separator
ValueCountFrequency (%)
1611
100.0%
Close Punctuation
ValueCountFrequency (%)
) 322
100.0%
Open Punctuation
ValueCountFrequency (%)
( 322
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 25
100.0%
Lowercase Letter
ValueCountFrequency (%)
b 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5991
60.6%
Common 3861
39.1%
Latin 34
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
664
 
11.1%
601
 
10.0%
470
 
7.8%
396
 
6.6%
360
 
6.0%
344
 
5.7%
325
 
5.4%
325
 
5.4%
325
 
5.4%
324
 
5.4%
Other values (132) 1857
31.0%
Common
ValueCountFrequency (%)
1611
41.7%
1 404
 
10.5%
) 322
 
8.3%
( 322
 
8.3%
, 220
 
5.7%
2 180
 
4.7%
3 127
 
3.3%
8 118
 
3.1%
0 102
 
2.6%
4 101
 
2.6%
Other values (6) 354
 
9.2%
Latin
ValueCountFrequency (%)
E 10
29.4%
A 6
17.6%
B 3
 
8.8%
T 2
 
5.9%
L 2
 
5.9%
W 2
 
5.9%
G 2
 
5.9%
P 2
 
5.9%
b 1
 
2.9%
S 1
 
2.9%
Other values (3) 3
 
8.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5991
60.6%
ASCII 3895
39.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1611
41.4%
1 404
 
10.4%
) 322
 
8.3%
( 322
 
8.3%
, 220
 
5.6%
2 180
 
4.6%
3 127
 
3.3%
8 118
 
3.0%
0 102
 
2.6%
4 101
 
2.6%
Other values (19) 388
 
10.0%
Hangul
ValueCountFrequency (%)
664
 
11.1%
601
 
10.0%
470
 
7.8%
396
 
6.6%
360
 
6.0%
344
 
5.7%
325
 
5.4%
325
 
5.4%
325
 
5.4%
324
 
5.4%
Other values (132) 1857
31.0%

소재지전화
Text

MISSING 

Distinct220
Distinct (%)91.7%
Missing85
Missing (%)26.2%
Memory size2.7 KiB
2023-12-11T01:46:35.765604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.016667
Min length12

Characters and Unicode

Total characters2884
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique214 ?
Unique (%)89.2%

Sample

1st row051-864-1384
2nd row051-000-0000
3rd row051-865-5341
4th row051-861-4621
5th row051-852-9113
ValueCountFrequency (%)
051-000-0000 11
 
4.6%
051-500-8000 5
 
2.1%
051-860-1052 4
 
1.7%
051-868-8852 2
 
0.8%
051-757-8876 2
 
0.8%
031-792-4950 2
 
0.8%
051-866-0866 1
 
0.4%
051-867-4472 1
 
0.4%
051-867-1667 1
 
0.4%
051-868-3068 1
 
0.4%
Other values (210) 210
87.5%
2023-12-11T01:46:36.252559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 481
16.7%
- 480
16.6%
5 434
15.0%
1 359
12.4%
8 289
10.0%
6 224
7.8%
7 157
 
5.4%
3 138
 
4.8%
2 131
 
4.5%
4 101
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2404
83.4%
Dash Punctuation 480
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 481
20.0%
5 434
18.1%
1 359
14.9%
8 289
12.0%
6 224
9.3%
7 157
 
6.5%
3 138
 
5.7%
2 131
 
5.4%
4 101
 
4.2%
9 90
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 480
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2884
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 481
16.7%
- 480
16.6%
5 434
15.0%
1 359
12.4%
8 289
10.0%
6 224
7.8%
7 157
 
5.4%
3 138
 
4.8%
2 131
 
4.5%
4 101
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2884
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 481
16.7%
- 480
16.6%
5 434
15.0%
1 359
12.4%
8 289
10.0%
6 224
7.8%
7 157
 
5.4%
3 138
 
4.8%
2 131
 
4.5%
4 101
 
3.5%

Missing values

2023-12-11T01:46:33.052185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:46:33.147567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지(도로명)소재지전화
0즉석판매제조가공업경주제분소부산광역시 연제구 거제천로 140 (연산동)051-864-1384
1즉석판매제조가공업양산상회부산광역시 연제구 월드컵대로19번길 8 (연산동)051-000-0000
2즉석판매제조가공업함안상회부산광역시 연제구 거제천로87번길 15-7 (거제동)051-865-5341
3즉석판매제조가공업신흥상회부산광역시 연제구 거제천로 103 (거제동,1층 일부)051-861-4621
4즉석판매제조가공업벽산상회부산광역시 연제구 월드컵대로3번길 16 (연산동,1층)051-852-9113
5즉석판매제조가공업대명상회부산광역시 연제구 월드컵대로 19 (연산동)051-862-0992
6즉석판매제조가공업웰빙떡방앗간부산광역시 연제구 해맞이로 77 (거제동)051-503-0460
7즉석판매제조가공업연산방앗간부산광역시 연제구 금련로 12 (연산동)051-865-0639
8즉석판매제조가공업울산방앗간부산광역시 연제구 거제천로87번길 15-1 (거제동,지상1층)051-865-1444
9즉석판매제조가공업밀양방앗간부산광역시 연제구 쌍미천로151번길 22 (연산동)051-866-1366
업종명업소명소재지(도로명)소재지전화
315즉석판매제조가공업(주)부광 농협하나로마트판부점부산광역시 연제구 반송로 88, 홈플러스 부산연산점 (연산동)031-792-4950
316즉석판매제조가공업미래식품부산광역시 연제구 연수로 89, 신세계연제점E마트 1층 (연산동)<NA>
317즉석판매제조가공업해콩 부산연산점부산광역시 연제구 세병로 11, 1층 일부호 (연산동)<NA>
318즉석판매제조가공업알앤알코리아부산광역시 연제구 종합운동장로 7, 홈플러스 아시아드점 1층 (거제동)054-432-8808
319즉석판매제조가공업주식회사 남선푸드부산광역시 연제구 연수로 89, 신세계연제점E마트 1층 (연산동)<NA>
320즉석판매제조가공업주식회사 다미원부산광역시 연제구 연수로 89, 신세계연제점E마트 1층 (연산동)<NA>
321즉석판매제조가공업강가네 돈대박부산광역시 연제구 쌍미천로47번길 52, 1층 (연산동)051-865-5552
322즉석판매제조가공업(주)부광ㅇ부산광역시 연제구 반송로 88, 홈플러스 부산연산점 (연산동)031-792-4950
323즉석판매제조가공업수라원부산광역시 연제구 연수로 89, 신세계연제점E마트 1층 (연산동)<NA>
324즉석판매제조가공업부산축산 홈플러스 연산점부산광역시 연제구 반송로 88, 홈플러스 부산연산점 2층 (연산동)051-831-7511

Duplicate rows

Most frequently occurring

업종명업소명소재지(도로명)소재지전화# duplicates
0즉석판매제조가공업(주)이마트연제점부산광역시 연제구 연수로 89 (연산동)051-860-10522
1즉석판매제조가공업미래식품부산광역시 연제구 연수로 89, 신세계연제점E마트 1층 (연산동)<NA>2