Overview

Dataset statistics

Number of variables4
Number of observations3285
Missing cells1635
Missing cells (%)12.4%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory102.8 KiB
Average record size in memory32.0 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시_사상구_일반및휴게음식점현황_20230619
Author부산광역시 사상구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3078756

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates
소재지전화 has 1635 (49.8%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:20:16.019116
Analysis finished2023-12-10 16:20:16.957373
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size25.8 KiB
일반음식점
2616 
휴게음식점
669 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row일반음식점
5th row일반음식점

Common Values

ValueCountFrequency (%)
일반음식점 2616
79.6%
휴게음식점 669
 
20.4%

Length

2023-12-11T01:20:17.018023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:20:17.113857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 2616
79.6%
휴게음식점 669
 
20.4%
Distinct3155
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Memory size25.8 KiB
2023-12-11T01:20:17.358599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length24
Mean length6.5305936
Min length1

Characters and Unicode

Total characters21453
Distinct characters856
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3047 ?
Unique (%)92.8%

Sample

1st row섹션
2nd row목포녹동세발낚지
3rd row조은데이
4th row통큰아재
5th row낙원각
ValueCountFrequency (%)
사상점 104
 
2.3%
주례점 34
 
0.8%
모라점 26
 
0.6%
세븐일레븐 24
 
0.5%
학장점 24
 
0.5%
씨유 24
 
0.5%
엄궁점 21
 
0.5%
gs25 13
 
0.3%
카페 12
 
0.3%
컴포즈커피 12
 
0.3%
Other values (3471) 4163
93.4%
2023-12-11T01:20:17.809171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1185
 
5.5%
739
 
3.4%
395
 
1.8%
325
 
1.5%
308
 
1.4%
297
 
1.4%
277
 
1.3%
275
 
1.3%
274
 
1.3%
251
 
1.2%
Other values (846) 17127
79.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18395
85.7%
Space Separator 1185
 
5.5%
Uppercase Letter 578
 
2.7%
Decimal Number 372
 
1.7%
Lowercase Letter 363
 
1.7%
Open Punctuation 230
 
1.1%
Close Punctuation 230
 
1.1%
Other Punctuation 93
 
0.4%
Dash Punctuation 4
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
739
 
4.0%
395
 
2.1%
325
 
1.8%
308
 
1.7%
297
 
1.6%
277
 
1.5%
275
 
1.5%
274
 
1.5%
251
 
1.4%
243
 
1.3%
Other values (774) 15011
81.6%
Uppercase Letter
ValueCountFrequency (%)
C 66
 
11.4%
S 45
 
7.8%
E 44
 
7.6%
G 43
 
7.4%
A 35
 
6.1%
O 35
 
6.1%
P 30
 
5.2%
U 28
 
4.8%
F 27
 
4.7%
B 27
 
4.7%
Other values (15) 198
34.3%
Lowercase Letter
ValueCountFrequency (%)
o 54
14.9%
e 51
14.0%
a 29
 
8.0%
n 21
 
5.8%
l 20
 
5.5%
i 19
 
5.2%
c 19
 
5.2%
r 17
 
4.7%
m 15
 
4.1%
d 15
 
4.1%
Other values (13) 103
28.4%
Decimal Number
ValueCountFrequency (%)
2 99
26.6%
5 63
16.9%
1 46
12.4%
0 46
12.4%
4 27
 
7.3%
3 21
 
5.6%
9 20
 
5.4%
7 19
 
5.1%
6 18
 
4.8%
8 13
 
3.5%
Other Punctuation
ValueCountFrequency (%)
& 51
54.8%
. 19
 
20.4%
, 12
 
12.9%
· 4
 
4.3%
' 3
 
3.2%
! 3
 
3.2%
: 1
 
1.1%
Math Symbol
ValueCountFrequency (%)
> 1
50.0%
< 1
50.0%
Space Separator
ValueCountFrequency (%)
1185
100.0%
Open Punctuation
ValueCountFrequency (%)
( 230
100.0%
Close Punctuation
ValueCountFrequency (%)
) 230
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18382
85.7%
Common 2117
 
9.9%
Latin 941
 
4.4%
Han 13
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
739
 
4.0%
395
 
2.1%
325
 
1.8%
308
 
1.7%
297
 
1.6%
277
 
1.5%
275
 
1.5%
274
 
1.5%
251
 
1.4%
243
 
1.3%
Other values (762) 14998
81.6%
Latin
ValueCountFrequency (%)
C 66
 
7.0%
o 54
 
5.7%
e 51
 
5.4%
S 45
 
4.8%
E 44
 
4.7%
G 43
 
4.6%
A 35
 
3.7%
O 35
 
3.7%
P 30
 
3.2%
a 29
 
3.1%
Other values (38) 509
54.1%
Common
ValueCountFrequency (%)
1185
56.0%
( 230
 
10.9%
) 230
 
10.9%
2 99
 
4.7%
5 63
 
3.0%
& 51
 
2.4%
1 46
 
2.2%
0 46
 
2.2%
4 27
 
1.3%
3 21
 
1.0%
Other values (14) 119
 
5.6%
Han
ValueCountFrequency (%)
2
15.4%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
Other values (2) 2
15.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 18382
85.7%
ASCII 3053
 
14.2%
CJK 13
 
0.1%
None 4
 
< 0.1%
Geometric Shapes 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1185
38.8%
( 230
 
7.5%
) 230
 
7.5%
2 99
 
3.2%
C 66
 
2.2%
5 63
 
2.1%
o 54
 
1.8%
e 51
 
1.7%
& 51
 
1.7%
1 46
 
1.5%
Other values (60) 978
32.0%
Hangul
ValueCountFrequency (%)
739
 
4.0%
395
 
2.1%
325
 
1.8%
308
 
1.7%
297
 
1.6%
277
 
1.5%
275
 
1.5%
274
 
1.5%
251
 
1.4%
243
 
1.3%
Other values (762) 14998
81.6%
None
ValueCountFrequency (%)
· 4
100.0%
CJK
ValueCountFrequency (%)
2
15.4%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
Other values (2) 2
15.4%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%
Distinct2850
Distinct (%)86.8%
Missing0
Missing (%)0.0%
Memory size25.8 KiB
2023-12-11T01:20:18.148273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length54
Mean length29.631659
Min length9

Characters and Unicode

Total characters97340
Distinct characters324
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2517 ?
Unique (%)76.6%

Sample

1st row부산광역시 사상구 사상로309번길 14 (덕포동)
2nd row부산광역시 사상구 사상로277번길 27, 1층 (덕포동)
3rd row부산광역시 사상구 운산로 25 (삼락동)
4th row부산광역시 사상구 학장로 157, 1층 (학장동)
5th row부산광역시 사상구 사상로 7 (주례동)
ValueCountFrequency (%)
부산광역시 3282
 
16.9%
사상구 3282
 
16.9%
1층 1240
 
6.4%
괘법동 794
 
4.1%
주례동 661
 
3.4%
모라동 460
 
2.4%
덕포동 375
 
1.9%
감전동 367
 
1.9%
학장동 265
 
1.4%
백양대로 258
 
1.3%
Other values (1580) 8420
43.4%
2023-12-11T01:20:18.654256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16121
 
16.6%
4314
 
4.4%
4102
 
4.2%
1 4084
 
4.2%
3959
 
4.1%
3591
 
3.7%
3419
 
3.5%
3375
 
3.5%
3313
 
3.4%
3302
 
3.4%
Other values (314) 47760
49.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 56120
57.7%
Space Separator 16121
 
16.6%
Decimal Number 15644
 
16.1%
Open Punctuation 3290
 
3.4%
Close Punctuation 3290
 
3.4%
Other Punctuation 2253
 
2.3%
Dash Punctuation 419
 
0.4%
Uppercase Letter 169
 
0.2%
Math Symbol 34
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4314
 
7.7%
4102
 
7.3%
3959
 
7.1%
3591
 
6.4%
3419
 
6.1%
3375
 
6.0%
3313
 
5.9%
3302
 
5.9%
3291
 
5.9%
3287
 
5.9%
Other values (275) 20167
35.9%
Uppercase Letter
ValueCountFrequency (%)
A 60
35.5%
B 25
14.8%
P 18
 
10.7%
T 17
 
10.1%
E 9
 
5.3%
G 8
 
4.7%
C 7
 
4.1%
L 5
 
3.0%
S 4
 
2.4%
R 3
 
1.8%
Other values (9) 13
 
7.7%
Decimal Number
ValueCountFrequency (%)
1 4084
26.1%
2 2384
15.2%
3 1537
 
9.8%
0 1432
 
9.2%
6 1182
 
7.6%
7 1175
 
7.5%
4 1139
 
7.3%
9 927
 
5.9%
5 902
 
5.8%
8 882
 
5.6%
Other Punctuation
ValueCountFrequency (%)
, 2246
99.7%
. 4
 
0.2%
@ 3
 
0.1%
Math Symbol
ValueCountFrequency (%)
~ 28
82.4%
< 3
 
8.8%
> 3
 
8.8%
Space Separator
ValueCountFrequency (%)
16121
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3290
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3290
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 419
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 56120
57.7%
Common 41051
42.2%
Latin 169
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4314
 
7.7%
4102
 
7.3%
3959
 
7.1%
3591
 
6.4%
3419
 
6.1%
3375
 
6.0%
3313
 
5.9%
3302
 
5.9%
3291
 
5.9%
3287
 
5.9%
Other values (275) 20167
35.9%
Common
ValueCountFrequency (%)
16121
39.3%
1 4084
 
9.9%
( 3290
 
8.0%
) 3290
 
8.0%
2 2384
 
5.8%
, 2246
 
5.5%
3 1537
 
3.7%
0 1432
 
3.5%
6 1182
 
2.9%
7 1175
 
2.9%
Other values (10) 4310
 
10.5%
Latin
ValueCountFrequency (%)
A 60
35.5%
B 25
14.8%
P 18
 
10.7%
T 17
 
10.1%
E 9
 
5.3%
G 8
 
4.7%
C 7
 
4.1%
L 5
 
3.0%
S 4
 
2.4%
R 3
 
1.8%
Other values (9) 13
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 56120
57.7%
ASCII 41220
42.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
16121
39.1%
1 4084
 
9.9%
( 3290
 
8.0%
) 3290
 
8.0%
2 2384
 
5.8%
, 2246
 
5.4%
3 1537
 
3.7%
0 1432
 
3.5%
6 1182
 
2.9%
7 1175
 
2.9%
Other values (29) 4479
 
10.9%
Hangul
ValueCountFrequency (%)
4314
 
7.7%
4102
 
7.3%
3959
 
7.1%
3591
 
6.4%
3419
 
6.1%
3375
 
6.0%
3313
 
5.9%
3302
 
5.9%
3291
 
5.9%
3287
 
5.9%
Other values (275) 20167
35.9%

소재지전화
Text

MISSING 

Distinct1625
Distinct (%)98.5%
Missing1635
Missing (%)49.8%
Memory size25.8 KiB
2023-12-11T01:20:18.924477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.002424
Min length12

Characters and Unicode

Total characters19804
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1608 ?
Unique (%)97.5%

Sample

1st row051-302-1735
2nd row051-314-3784
3rd row051-313-5819
4th row051-302-0973
5th row051-301-1541
ValueCountFrequency (%)
051-329-2500 7
 
0.4%
051-329-1234 4
 
0.2%
051-313-7777 3
 
0.2%
051-316-7570 2
 
0.1%
051-312-7776 2
 
0.1%
051-314-5022 2
 
0.1%
051-328-7888 2
 
0.1%
051-303-0290 2
 
0.1%
051-325-8727 2
 
0.1%
051-327-9463 2
 
0.1%
Other values (1615) 1622
98.3%
2023-12-11T01:20:19.365661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 3301
16.7%
1 3145
15.9%
0 2820
14.2%
3 2590
13.1%
5 2578
13.0%
2 1569
7.9%
7 832
 
4.2%
9 763
 
3.9%
8 759
 
3.8%
4 746
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 16503
83.3%
Dash Punctuation 3301
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 3145
19.1%
0 2820
17.1%
3 2590
15.7%
5 2578
15.6%
2 1569
9.5%
7 832
 
5.0%
9 763
 
4.6%
8 759
 
4.6%
4 746
 
4.5%
6 701
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 3301
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 19804
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 3301
16.7%
1 3145
15.9%
0 2820
14.2%
3 2590
13.1%
5 2578
13.0%
2 1569
7.9%
7 832
 
4.2%
9 763
 
3.9%
8 759
 
3.8%
4 746
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 19804
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 3301
16.7%
1 3145
15.9%
0 2820
14.2%
3 2590
13.1%
5 2578
13.0%
2 1569
7.9%
7 832
 
4.2%
9 763
 
3.9%
8 759
 
3.8%
4 746
 
3.8%

Missing values

2023-12-11T01:20:16.825539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:20:16.919712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지(도로명)소재지전화
0일반음식점섹션부산광역시 사상구 사상로309번길 14 (덕포동)051-302-1735
1일반음식점목포녹동세발낚지부산광역시 사상구 사상로277번길 27, 1층 (덕포동)<NA>
2일반음식점조은데이부산광역시 사상구 운산로 25 (삼락동)<NA>
3일반음식점통큰아재부산광역시 사상구 학장로 157, 1층 (학장동)<NA>
4일반음식점낙원각부산광역시 사상구 사상로 7 (주례동)051-314-3784
5일반음식점목마포차부산광역시 사상구 사상로 148 (괘법동)051-313-5819
6일반음식점밀양손칼국수부산광역시 사상구 사상로 493 (모라동)051-302-0973
7일반음식점구포집부산광역시 사상구 사상로 394 (덕포동)051-301-1541
8일반음식점현대식당부산광역시 사상구 사상로 76-6 (주례동)051-316-5725
9일반음식점야간매점부산광역시 사상구 사상로342번길 10, 1층 (덕포동)051-303-1207
업종명업소명소재지(도로명)소재지전화
3275휴게음식점랑이랑(사상점)부산광역시 사상구 사상로223번길 23, 304동 110호 (괘법동, 센트럴 스타힐스)<NA>
3276휴게음식점스탠다드부산광역시 사상구 사상로 312, 1층 (덕포동)<NA>
3277휴게음식점뉴욕버거 학장점부산광역시 사상구 대동로 133-2, 1층 (학장동)<NA>
3278휴게음식점원커피숍부산광역시 사상구 사상로 179, 2층 (괘법동)<NA>
3279휴게음식점앤티앤스 부산서부시외버스터미널점부산광역시 사상구 사상로 201, 서부시외버스터미널 본관동 1층 (괘법동)<NA>
3280휴게음식점얌얌 푸르츠부산광역시 사상구 광장로93번길 53, 1층 (괘법동)<NA>
3281휴게음식점통통성맘부산광역시 사상구 엄궁남로 20, 1층 (엄궁동)<NA>
3282휴게음식점판크로스부산광역시 사상구 광장로 17, 이마트 사상점 (괘법동)<NA>
3283휴게음식점커피스토어 주례럭키점부산광역시 사상구 동주로 14, 1층 (주례동)<NA>
3284휴게음식점팥을 직접 삶는 집부산광역시 사상구 엄궁북로 37, 1층 (엄궁동)<NA>

Duplicate rows

Most frequently occurring

업종명업소명소재지(도로명)소재지전화# duplicates
0휴게음식점롯데쇼핑(주)롯데마트사상부산광역시 사상구 낙동대로 733 (엄궁동)051-329-25003