Overview

Dataset statistics

Number of variables4
Number of observations771
Missing cells420
Missing cells (%)13.6%
Duplicate rows5
Duplicate rows (%)0.6%
Total size in memory24.2 KiB
Average record size in memory32.2 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시연제구_이미용업현황_20190613
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15051416

Alerts

Dataset has 5 (0.6%) duplicate rowsDuplicates
소재지전화 has 420 (54.5%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:55:03.428993
Analysis finished2023-12-10 16:55:04.143861
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

Distinct16
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
미용업(일반)
265 
미용업
201 
미용업(피부)
80 
이용업
72 
미용업(손톱ㆍ발톱)
53 
Other values (11)
100 

Length

Max length31
Median length28
Mean length6.9455253
Min length3

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row이용업
2nd row이용업
3rd row이용업
4th row이용업
5th row이용업

Common Values

ValueCountFrequency (%)
미용업(일반) 265
34.4%
미용업 201
26.1%
미용업(피부) 80
 
10.4%
이용업 72
 
9.3%
미용업(손톱ㆍ발톱) 53
 
6.9%
미용업(종합) 31
 
4.0%
미용업(일반), 미용업(화장ㆍ분장) 15
 
1.9%
미용업(일반), 미용업(손톱ㆍ발톱) 12
 
1.6%
미용업(손톱ㆍ발톱), 미용업(화장ㆍ분장) 12
 
1.6%
미용업(화장ㆍ분장) 7
 
0.9%
Other values (6) 23
 
3.0%

Length

2023-12-11T01:55:04.265405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
미용업(일반 302
35.9%
미용업 201
23.9%
미용업(피부 99
 
11.8%
미용업(손톱ㆍ발톱 89
 
10.6%
이용업 72
 
8.6%
미용업(화장ㆍ분장 48
 
5.7%
미용업(종합 31
 
3.7%
Distinct746
Distinct (%)96.8%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
2023-12-11T01:55:04.602817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length26
Mean length5.9636835
Min length1

Characters and Unicode

Total characters4598
Distinct characters481
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique722 ?
Unique (%)93.6%

Sample

1st row중앙
2nd row연산초등교구내이용
3rd row일신이용
4th row평화이용
5th row진주이용
ValueCountFrequency (%)
헤어 16
 
1.7%
네일 8
 
0.8%
이용원 8
 
0.8%
미용실 8
 
0.8%
에스테틱 6
 
0.6%
hair 6
 
0.6%
nail 6
 
0.6%
by 5
 
0.5%
머리사랑 3
 
0.3%
메이크업 3
 
0.3%
Other values (843) 898
92.9%
2023-12-11T01:55:05.270183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
299
 
6.5%
296
 
6.4%
196
 
4.3%
124
 
2.7%
106
 
2.3%
92
 
2.0%
87
 
1.9%
83
 
1.8%
) 82
 
1.8%
( 82
 
1.8%
Other values (471) 3151
68.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3650
79.4%
Uppercase Letter 282
 
6.1%
Lowercase Letter 231
 
5.0%
Space Separator 196
 
4.3%
Close Punctuation 82
 
1.8%
Open Punctuation 82
 
1.8%
Other Punctuation 43
 
0.9%
Decimal Number 20
 
0.4%
Dash Punctuation 7
 
0.2%
Math Symbol 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
299
 
8.2%
296
 
8.1%
124
 
3.4%
106
 
2.9%
92
 
2.5%
87
 
2.4%
83
 
2.3%
80
 
2.2%
73
 
2.0%
46
 
1.3%
Other values (407) 2364
64.8%
Uppercase Letter
ValueCountFrequency (%)
A 27
 
9.6%
N 26
 
9.2%
R 21
 
7.4%
E 21
 
7.4%
I 21
 
7.4%
B 19
 
6.7%
L 17
 
6.0%
Y 14
 
5.0%
M 14
 
5.0%
H 14
 
5.0%
Other values (14) 88
31.2%
Lowercase Letter
ValueCountFrequency (%)
a 35
15.2%
i 34
14.7%
e 22
9.5%
n 18
7.8%
y 16
 
6.9%
l 16
 
6.9%
r 15
 
6.5%
h 14
 
6.1%
o 12
 
5.2%
b 9
 
3.9%
Other values (9) 40
17.3%
Other Punctuation
ValueCountFrequency (%)
& 15
34.9%
. 8
18.6%
? 7
16.3%
, 5
 
11.6%
# 5
 
11.6%
· 1
 
2.3%
1
 
2.3%
' 1
 
2.3%
Decimal Number
ValueCountFrequency (%)
1 7
35.0%
2 4
20.0%
5 3
15.0%
0 3
15.0%
3 2
 
10.0%
4 1
 
5.0%
Math Symbol
ValueCountFrequency (%)
< 2
50.0%
> 2
50.0%
Space Separator
ValueCountFrequency (%)
196
100.0%
Close Punctuation
ValueCountFrequency (%)
) 82
100.0%
Open Punctuation
ValueCountFrequency (%)
( 82
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3640
79.2%
Latin 513
 
11.2%
Common 435
 
9.5%
Han 10
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
299
 
8.2%
296
 
8.1%
124
 
3.4%
106
 
2.9%
92
 
2.5%
87
 
2.4%
83
 
2.3%
80
 
2.2%
73
 
2.0%
46
 
1.3%
Other values (399) 2354
64.7%
Latin
ValueCountFrequency (%)
a 35
 
6.8%
i 34
 
6.6%
A 27
 
5.3%
N 26
 
5.1%
e 22
 
4.3%
R 21
 
4.1%
E 21
 
4.1%
I 21
 
4.1%
B 19
 
3.7%
n 18
 
3.5%
Other values (33) 269
52.4%
Common
ValueCountFrequency (%)
196
45.1%
) 82
18.9%
( 82
18.9%
& 15
 
3.4%
. 8
 
1.8%
? 7
 
1.6%
- 7
 
1.6%
1 7
 
1.6%
, 5
 
1.1%
# 5
 
1.1%
Other values (11) 21
 
4.8%
Han
ValueCountFrequency (%)
3
30.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3640
79.2%
ASCII 946
 
20.6%
CJK 10
 
0.2%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
299
 
8.2%
296
 
8.1%
124
 
3.4%
106
 
2.9%
92
 
2.5%
87
 
2.4%
83
 
2.3%
80
 
2.2%
73
 
2.0%
46
 
1.3%
Other values (399) 2354
64.7%
ASCII
ValueCountFrequency (%)
196
20.7%
) 82
 
8.7%
( 82
 
8.7%
a 35
 
3.7%
i 34
 
3.6%
A 27
 
2.9%
N 26
 
2.7%
e 22
 
2.3%
R 21
 
2.2%
E 21
 
2.2%
Other values (52) 400
42.3%
CJK
ValueCountFrequency (%)
3
30.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
None
ValueCountFrequency (%)
· 1
50.0%
1
50.0%
Distinct747
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
2023-12-11T01:55:05.760002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length48
Mean length29.722438
Min length20

Characters and Unicode

Total characters22916
Distinct characters223
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique725 ?
Unique (%)94.0%

Sample

1st row부산광역시 연제구 아시아드대로64번길 4, 1층 (거제동)
2nd row부산광역시 연제구 월드컵대로 41 (연산동)
3rd row부산광역시 연제구 거제대로74번길 80 (거제동)
4th row부산광역시 연제구 연산동 1807번지 1호 (T/B)
5th row부산광역시 연제구 월드컵대로 2-13 (연산동)
ValueCountFrequency (%)
부산광역시 771
17.2%
연제구 771
17.2%
연산동 617
 
13.8%
1층 177
 
4.0%
거제동 119
 
2.7%
2층 72
 
1.6%
과정로 33
 
0.7%
연수로 32
 
0.7%
고분로 28
 
0.6%
중앙대로 24
 
0.5%
Other values (651) 1834
41.0%
2023-12-11T01:55:06.433636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3719
 
16.2%
1549
 
6.8%
1454
 
6.3%
1 983
 
4.3%
982
 
4.3%
876
 
3.8%
818
 
3.6%
783
 
3.4%
( 778
 
3.4%
) 777
 
3.4%
Other values (213) 10197
44.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13425
58.6%
Space Separator 3719
 
16.2%
Decimal Number 3573
 
15.6%
Open Punctuation 778
 
3.4%
Close Punctuation 777
 
3.4%
Other Punctuation 478
 
2.1%
Dash Punctuation 84
 
0.4%
Uppercase Letter 80
 
0.3%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1549
 
11.5%
1454
 
10.8%
982
 
7.3%
876
 
6.5%
818
 
6.1%
783
 
5.8%
775
 
5.8%
771
 
5.7%
771
 
5.7%
769
 
5.7%
Other values (178) 3877
28.9%
Uppercase Letter
ValueCountFrequency (%)
B 14
17.5%
A 11
13.8%
S 7
8.8%
K 7
8.8%
I 7
8.8%
E 5
 
6.2%
G 5
 
6.2%
C 4
 
5.0%
W 4
 
5.0%
V 4
 
5.0%
Other values (5) 12
15.0%
Decimal Number
ValueCountFrequency (%)
1 983
27.5%
2 606
17.0%
3 429
12.0%
0 337
 
9.4%
4 258
 
7.2%
5 236
 
6.6%
8 214
 
6.0%
6 185
 
5.2%
7 169
 
4.7%
9 156
 
4.4%
Other Punctuation
ValueCountFrequency (%)
, 470
98.3%
/ 5
 
1.0%
@ 2
 
0.4%
& 1
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
c 1
50.0%
e 1
50.0%
Space Separator
ValueCountFrequency (%)
3719
100.0%
Open Punctuation
ValueCountFrequency (%)
( 778
100.0%
Close Punctuation
ValueCountFrequency (%)
) 777
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 84
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13425
58.6%
Common 9409
41.1%
Latin 82
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1549
 
11.5%
1454
 
10.8%
982
 
7.3%
876
 
6.5%
818
 
6.1%
783
 
5.8%
775
 
5.8%
771
 
5.7%
771
 
5.7%
769
 
5.7%
Other values (178) 3877
28.9%
Common
ValueCountFrequency (%)
3719
39.5%
1 983
 
10.4%
( 778
 
8.3%
) 777
 
8.3%
2 606
 
6.4%
, 470
 
5.0%
3 429
 
4.6%
0 337
 
3.6%
4 258
 
2.7%
5 236
 
2.5%
Other values (8) 816
 
8.7%
Latin
ValueCountFrequency (%)
B 14
17.1%
A 11
13.4%
S 7
8.5%
K 7
8.5%
I 7
8.5%
E 5
 
6.1%
G 5
 
6.1%
C 4
 
4.9%
W 4
 
4.9%
V 4
 
4.9%
Other values (7) 14
17.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13425
58.6%
ASCII 9491
41.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3719
39.2%
1 983
 
10.4%
( 778
 
8.2%
) 777
 
8.2%
2 606
 
6.4%
, 470
 
5.0%
3 429
 
4.5%
0 337
 
3.6%
4 258
 
2.7%
5 236
 
2.5%
Other values (25) 898
 
9.5%
Hangul
ValueCountFrequency (%)
1549
 
11.5%
1454
 
10.8%
982
 
7.3%
876
 
6.5%
818
 
6.1%
783
 
5.8%
775
 
5.8%
771
 
5.7%
771
 
5.7%
769
 
5.7%
Other values (178) 3877
28.9%

소재지전화
Text

MISSING 

Distinct348
Distinct (%)99.1%
Missing420
Missing (%)54.5%
Memory size6.2 KiB
2023-12-11T01:55:06.862912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.011396
Min length12

Characters and Unicode

Total characters4216
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique345 ?
Unique (%)98.3%

Sample

1st row051-865-9173
2nd row051-862-0178
3rd row051-865-2863
4th row051-852-4534
5th row051-866-3862
ValueCountFrequency (%)
051-853-3162 2
 
0.6%
051-852-3313 2
 
0.6%
051-757-2844 2
 
0.6%
051-868-8217 1
 
0.3%
051-501-3448 1
 
0.3%
051-868-0512 1
 
0.3%
051-863-7004 1
 
0.3%
051-853-6730 1
 
0.3%
051-751-1002 1
 
0.3%
051-755-8385 1
 
0.3%
Other values (338) 338
96.3%
2023-12-11T01:55:07.452298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 714
16.9%
- 702
16.7%
0 553
13.1%
1 550
13.0%
8 397
9.4%
6 298
7.1%
7 264
 
6.3%
2 245
 
5.8%
3 204
 
4.8%
9 148
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3514
83.3%
Dash Punctuation 702
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 714
20.3%
0 553
15.7%
1 550
15.7%
8 397
11.3%
6 298
8.5%
7 264
 
7.5%
2 245
 
7.0%
3 204
 
5.8%
9 148
 
4.2%
4 141
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 702
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4216
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 714
16.9%
- 702
16.7%
0 553
13.1%
1 550
13.0%
8 397
9.4%
6 298
7.1%
7 264
 
6.3%
2 245
 
5.8%
3 204
 
4.8%
9 148
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4216
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 714
16.9%
- 702
16.7%
0 553
13.1%
1 550
13.0%
8 397
9.4%
6 298
7.1%
7 264
 
6.3%
2 245
 
5.8%
3 204
 
4.8%
9 148
 
3.5%

Missing values

2023-12-11T01:55:03.985849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:55:04.094128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명업소소재지(도로명)소재지전화
0이용업중앙부산광역시 연제구 아시아드대로64번길 4, 1층 (거제동)<NA>
1이용업연산초등교구내이용부산광역시 연제구 월드컵대로 41 (연산동)<NA>
2이용업일신이용부산광역시 연제구 거제대로74번길 80 (거제동)<NA>
3이용업평화이용부산광역시 연제구 연산동 1807번지 1호 (T/B)<NA>
4이용업진주이용부산광역시 연제구 월드컵대로 2-13 (연산동)<NA>
5이용업신성부산광역시 연제구 연수로87번길 59 (연산동)051-865-9173
6이용업연우이용부산광역시 연제구 연산동 1146번지 1호 (T/B)<NA>
7이용업신광이용부산광역시 연제구 연수로218번길 27 (연산동)<NA>
8이용업보성이용부산광역시 연제구 연수로 173 (연산동,(1층))<NA>
9이용업목화이용부산광역시 연제구 고분로20번길 21 (연산동)<NA>
업종명업소명업소소재지(도로명)소재지전화
761미용업(손톱ㆍ발톱), 미용업(화장ㆍ분장)네일(Nail) 빛부산광역시 연제구 반송로 80, 108동 1층 105호 (연산동, 연산동 일동 미라주 더 스타)<NA>
762미용업(일반), 미용업(피부), 미용업(화장ㆍ분장)연재헤어부산광역시 연제구 금련로18번길 24, 1층 (연산동)<NA>
763미용업(일반), 미용업(손톱ㆍ발톱), 미용업(화장ㆍ분장)디아이엠(DEIM)부산광역시 연제구 여고로52번길 45 (거제동)<NA>
764미용업(일반), 미용업(손톱ㆍ발톱), 미용업(화장ㆍ분장)팝콘부산광역시 연제구 고분로236번길 75, 1층 (연산동)<NA>
765미용업(일반), 미용업(손톱ㆍ발톱), 미용업(화장ㆍ분장)Bien poeme(빈포엠)부산광역시 연제구 안연로 33, 상가B동 103호 (연산동)<NA>
766미용업(일반), 미용업(손톱ㆍ발톱), 미용업(화장ㆍ분장)이가자헤어비스(연산더샵)부산광역시 연제구 연수로 130, 2층 201, 202호 (연산동, 연산더샵)<NA>
767미용업(피부), 미용업(손톱ㆍ발톱), 미용업(화장ㆍ분장)제시속눈썹부산광역시 연제구 안연로23번길 53, 1층 (연산동)<NA>
768미용업(피부), 미용업(손톱ㆍ발톱), 미용업(화장ㆍ분장)윤네일부산광역시 연제구 해맞이로31번길 58, GIB메네스빌딩 2층 (거제동)<NA>
769미용업(피부), 미용업(손톱ㆍ발톱), 미용업(화장ㆍ분장)네일은. 설렘부산광역시 연제구 중앙천로 7, 1층 (연산동)<NA>
770미용업(피부), 미용업(손톱ㆍ발톱), 미용업(화장ㆍ분장)지안뷰티(JI AN BEAUTY)부산광역시 연제구 중앙대로 1130, 103동 4층 404호 (연산동, 연산동 SK VIEW(2단지))<NA>

Duplicate rows

Most frequently occurring

업종명업소명업소소재지(도로명)소재지전화# duplicates
0미용업날마다예쁜집부산광역시 연제구 토곡로 50-1 (연산동)<NA>2
1미용업정 미용실부산광역시 연제구 고분로 105 (연산동)<NA>2
2미용업(일반)똥머리부산광역시 연제구 거제천로87번길 40 (거제동)051-853-31622
3미용업(일반)지붕개량헤어샵부산광역시 연제구 교대로 3 (거제동)<NA>2
4미용업(종합)스피나 피부&네일부산광역시 연제구 과정로 185, 2층 (연산동)<NA>2