Overview

Dataset statistics

Number of variables4
Number of observations801
Missing cells242
Missing cells (%)7.6%
Duplicate rows2
Duplicate rows (%)0.2%
Total size in memory25.2 KiB
Average record size in memory32.2 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시연제구_이미용업현황_20200925
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15051416

Alerts

Dataset has 2 (0.2%) duplicate rowsDuplicates
소재지전화 has 242 (30.2%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:54:57.090416
Analysis finished2023-12-10 16:54:57.939560
Duration0.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

Distinct16
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
일반미용업
271 
미용업
195 
피부미용업
83 
이용업
74 
네일미용업
63 
Other values (11)
115 

Length

Max length23
Median length5
Mean length5.4531835
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row이용업
2nd row이용업
3rd row이용업
4th row이용업
5th row이용업

Common Values

ValueCountFrequency (%)
일반미용업 271
33.8%
미용업 195
24.3%
피부미용업 83
 
10.4%
이용업 74
 
9.2%
네일미용업 63
 
7.9%
종합미용업 30
 
3.7%
일반미용업, 화장ㆍ분장 미용업 16
 
2.0%
피부미용업, 화장ㆍ분장 미용업 13
 
1.6%
네일미용업, 화장ㆍ분장 미용업 13
 
1.6%
일반미용업, 네일미용업 10
 
1.2%
Other values (6) 33
 
4.1%

Length

2023-12-11T01:54:58.049798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반미용업 309
32.3%
미용업 261
27.2%
피부미용업 114
 
11.9%
네일미용업 104
 
10.9%
이용업 74
 
7.7%
화장ㆍ분장 66
 
6.9%
종합미용업 30
 
3.1%
Distinct779
Distinct (%)97.3%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
2023-12-11T01:54:58.531084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length28
Mean length5.9837703
Min length1

Characters and Unicode

Total characters4793
Distinct characters493
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique758 ?
Unique (%)94.6%

Sample

1st row중앙
2nd row일신이용
3rd row평화이용
4th row진주이용
5th row신성
ValueCountFrequency (%)
헤어 19
 
1.9%
네일 10
 
1.0%
이용원 8
 
0.8%
미용실 7
 
0.7%
nail 6
 
0.6%
by 6
 
0.6%
에스테틱 6
 
0.6%
hair 6
 
0.6%
연산점 5
 
0.5%
바이 4
 
0.4%
Other values (892) 950
92.5%
2023-12-11T01:54:59.639869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
305
 
6.4%
303
 
6.3%
226
 
4.7%
124
 
2.6%
102
 
2.1%
100
 
2.1%
90
 
1.9%
86
 
1.8%
) 85
 
1.8%
( 85
 
1.8%
Other values (483) 3287
68.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3766
78.6%
Uppercase Letter 292
 
6.1%
Lowercase Letter 254
 
5.3%
Space Separator 226
 
4.7%
Close Punctuation 85
 
1.8%
Open Punctuation 85
 
1.8%
Other Punctuation 44
 
0.9%
Decimal Number 29
 
0.6%
Dash Punctuation 7
 
0.1%
Math Symbol 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
305
 
8.1%
303
 
8.0%
124
 
3.3%
102
 
2.7%
100
 
2.7%
90
 
2.4%
86
 
2.3%
81
 
2.2%
77
 
2.0%
49
 
1.3%
Other values (416) 2449
65.0%
Uppercase Letter
ValueCountFrequency (%)
A 28
 
9.6%
N 24
 
8.2%
R 24
 
8.2%
I 23
 
7.9%
E 21
 
7.2%
L 18
 
6.2%
H 18
 
6.2%
B 16
 
5.5%
S 14
 
4.8%
M 13
 
4.5%
Other values (14) 93
31.8%
Lowercase Letter
ValueCountFrequency (%)
a 35
13.8%
i 33
13.0%
e 23
9.1%
n 21
8.3%
o 20
7.9%
y 18
 
7.1%
l 16
 
6.3%
r 15
 
5.9%
h 15
 
5.9%
t 9
 
3.5%
Other values (10) 49
19.3%
Other Punctuation
ValueCountFrequency (%)
& 14
31.8%
, 8
18.2%
. 7
15.9%
# 6
13.6%
' 3
 
6.8%
: 2
 
4.5%
· 2
 
4.5%
% 1
 
2.3%
1
 
2.3%
Decimal Number
ValueCountFrequency (%)
1 9
31.0%
5 5
17.2%
2 4
13.8%
0 4
13.8%
3 3
 
10.3%
4 2
 
6.9%
9 2
 
6.9%
Math Symbol
ValueCountFrequency (%)
> 2
50.0%
< 2
50.0%
Space Separator
ValueCountFrequency (%)
226
100.0%
Close Punctuation
ValueCountFrequency (%)
) 85
100.0%
Open Punctuation
ValueCountFrequency (%)
( 85
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3756
78.4%
Latin 546
 
11.4%
Common 481
 
10.0%
Han 10
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
305
 
8.1%
303
 
8.1%
124
 
3.3%
102
 
2.7%
100
 
2.7%
90
 
2.4%
86
 
2.3%
81
 
2.2%
77
 
2.1%
49
 
1.3%
Other values (408) 2439
64.9%
Latin
ValueCountFrequency (%)
a 35
 
6.4%
i 33
 
6.0%
A 28
 
5.1%
N 24
 
4.4%
R 24
 
4.4%
e 23
 
4.2%
I 23
 
4.2%
n 21
 
3.8%
E 21
 
3.8%
o 20
 
3.7%
Other values (34) 294
53.8%
Common
ValueCountFrequency (%)
226
47.0%
) 85
 
17.7%
( 85
 
17.7%
& 14
 
2.9%
1 9
 
1.9%
, 8
 
1.7%
. 7
 
1.5%
- 7
 
1.5%
# 6
 
1.2%
5 5
 
1.0%
Other values (13) 29
 
6.0%
Han
ValueCountFrequency (%)
3
30.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3756
78.4%
ASCII 1024
 
21.4%
CJK 10
 
0.2%
None 3
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
305
 
8.1%
303
 
8.1%
124
 
3.3%
102
 
2.7%
100
 
2.7%
90
 
2.4%
86
 
2.3%
81
 
2.2%
77
 
2.1%
49
 
1.3%
Other values (408) 2439
64.9%
ASCII
ValueCountFrequency (%)
226
22.1%
) 85
 
8.3%
( 85
 
8.3%
a 35
 
3.4%
i 33
 
3.2%
A 28
 
2.7%
N 24
 
2.3%
R 24
 
2.3%
e 23
 
2.2%
I 23
 
2.2%
Other values (55) 438
42.8%
CJK
ValueCountFrequency (%)
3
30.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
None
ValueCountFrequency (%)
· 2
66.7%
1
33.3%
Distinct782
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
2023-12-11T01:55:00.125494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length50
Mean length30.242197
Min length20

Characters and Unicode

Total characters24224
Distinct characters233
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique765 ?
Unique (%)95.5%

Sample

1st row부산광역시 연제구 아시아드대로64번길 4, 1층 (거제동)
2nd row부산광역시 연제구 거제대로74번길 80 (거제동)
3rd row부산광역시 연제구 연산동 1807-1 T통B반
4th row부산광역시 연제구 월드컵대로 2-13 (연산동)
5th row부산광역시 연제구 연수로87번길 59 (연산동)
ValueCountFrequency (%)
부산광역시 801
16.9%
연제구 801
16.9%
연산동 646
 
13.7%
1층 194
 
4.1%
거제동 124
 
2.6%
2층 81
 
1.7%
연수로 38
 
0.8%
과정로 34
 
0.7%
중앙대로 28
 
0.6%
고분로 28
 
0.6%
Other values (688) 1956
41.3%
2023-12-11T01:55:00.917265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3930
 
16.2%
1614
 
6.7%
1517
 
6.3%
1 1072
 
4.4%
1026
 
4.2%
919
 
3.8%
858
 
3.5%
817
 
3.4%
( 808
 
3.3%
) 807
 
3.3%
Other values (223) 10856
44.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14113
58.3%
Space Separator 3930
 
16.2%
Decimal Number 3818
 
15.8%
Open Punctuation 808
 
3.3%
Close Punctuation 807
 
3.3%
Other Punctuation 535
 
2.2%
Uppercase Letter 120
 
0.5%
Dash Punctuation 90
 
0.4%
Lowercase Letter 2
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1614
 
11.4%
1517
 
10.7%
1026
 
7.3%
919
 
6.5%
858
 
6.1%
817
 
5.8%
806
 
5.7%
805
 
5.7%
801
 
5.7%
799
 
5.7%
Other values (187) 4151
29.4%
Uppercase Letter
ValueCountFrequency (%)
S 14
11.7%
K 14
11.7%
I 14
11.7%
E 13
10.8%
V 12
10.0%
W 12
10.0%
A 11
9.2%
B 11
9.2%
G 4
 
3.3%
C 4
 
3.3%
Other values (5) 11
9.2%
Decimal Number
ValueCountFrequency (%)
1 1072
28.1%
2 646
16.9%
3 478
12.5%
0 358
 
9.4%
4 284
 
7.4%
5 246
 
6.4%
8 216
 
5.7%
6 181
 
4.7%
7 178
 
4.7%
9 159
 
4.2%
Other Punctuation
ValueCountFrequency (%)
, 530
99.1%
& 2
 
0.4%
@ 2
 
0.4%
/ 1
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
c 1
50.0%
e 1
50.0%
Space Separator
ValueCountFrequency (%)
3930
100.0%
Open Punctuation
ValueCountFrequency (%)
( 808
100.0%
Close Punctuation
ValueCountFrequency (%)
) 807
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 90
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14113
58.3%
Common 9989
41.2%
Latin 122
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1614
 
11.4%
1517
 
10.7%
1026
 
7.3%
919
 
6.5%
858
 
6.1%
817
 
5.8%
806
 
5.7%
805
 
5.7%
801
 
5.7%
799
 
5.7%
Other values (187) 4151
29.4%
Common
ValueCountFrequency (%)
3930
39.3%
1 1072
 
10.7%
( 808
 
8.1%
) 807
 
8.1%
2 646
 
6.5%
, 530
 
5.3%
3 478
 
4.8%
0 358
 
3.6%
4 284
 
2.8%
5 246
 
2.5%
Other values (9) 830
 
8.3%
Latin
ValueCountFrequency (%)
S 14
11.5%
K 14
11.5%
I 14
11.5%
E 13
10.7%
V 12
9.8%
W 12
9.8%
A 11
9.0%
B 11
9.0%
G 4
 
3.3%
C 4
 
3.3%
Other values (7) 13
10.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14113
58.3%
ASCII 10111
41.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3930
38.9%
1 1072
 
10.6%
( 808
 
8.0%
) 807
 
8.0%
2 646
 
6.4%
, 530
 
5.2%
3 478
 
4.7%
0 358
 
3.5%
4 284
 
2.8%
5 246
 
2.4%
Other values (26) 952
 
9.4%
Hangul
ValueCountFrequency (%)
1614
 
11.4%
1517
 
10.7%
1026
 
7.3%
919
 
6.5%
858
 
6.1%
817
 
5.8%
806
 
5.7%
805
 
5.7%
801
 
5.7%
799
 
5.7%
Other values (187) 4151
29.4%

소재지전화
Text

MISSING 

Distinct556
Distinct (%)99.5%
Missing242
Missing (%)30.2%
Memory size6.4 KiB
2023-12-11T01:55:01.319104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.041145
Min length9

Characters and Unicode

Total characters6731
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique553 ?
Unique (%)98.9%

Sample

1st row051-504-5378
2nd row051-865-9173
3rd row051-851-4933
4th row051-502-0298
5th row051-862-0178
ValueCountFrequency (%)
051-852-3313 2
 
0.4%
051-864-3682 2
 
0.4%
051-757-2844 2
 
0.4%
051-868-1210 1
 
0.2%
051-911-6944 1
 
0.2%
051-946-0034 1
 
0.2%
051-759-4469 1
 
0.2%
051-862-2200 1
 
0.2%
051-852-4327 1
 
0.2%
051-867-3315 1
 
0.2%
Other values (546) 546
97.7%
2023-12-11T01:55:01.919949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1117
16.6%
5 1114
16.6%
0 915
13.6%
1 859
12.8%
8 646
9.6%
6 473
7.0%
7 422
 
6.3%
2 373
 
5.5%
3 329
 
4.9%
4 249
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5614
83.4%
Dash Punctuation 1117
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 1114
19.8%
0 915
16.3%
1 859
15.3%
8 646
11.5%
6 473
8.4%
7 422
 
7.5%
2 373
 
6.6%
3 329
 
5.9%
4 249
 
4.4%
9 234
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 1117
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6731
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1117
16.6%
5 1114
16.6%
0 915
13.6%
1 859
12.8%
8 646
9.6%
6 473
7.0%
7 422
 
6.3%
2 373
 
5.5%
3 329
 
4.9%
4 249
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6731
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1117
16.6%
5 1114
16.6%
0 915
13.6%
1 859
12.8%
8 646
9.6%
6 473
7.0%
7 422
 
6.3%
2 373
 
5.5%
3 329
 
4.9%
4 249
 
3.7%

Missing values

2023-12-11T01:54:57.629640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:54:57.800996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명영업소 주소(도로명)소재지전화
0이용업중앙부산광역시 연제구 아시아드대로64번길 4, 1층 (거제동)051-504-5378
1이용업일신이용부산광역시 연제구 거제대로74번길 80 (거제동)<NA>
2이용업평화이용부산광역시 연제구 연산동 1807-1 T통B반<NA>
3이용업진주이용부산광역시 연제구 월드컵대로 2-13 (연산동)<NA>
4이용업신성부산광역시 연제구 연수로87번길 59 (연산동)051-865-9173
5이용업연우이용부산광역시 연제구 연산동 1146-1 T통B반<NA>
6이용업신광이용부산광역시 연제구 연수로218번길 27 (연산동)<NA>
7이용업보성이용부산광역시 연제구 연수로 173 (연산동,(1층))<NA>
8이용업목화이용부산광역시 연제구 고분로20번길 21 (연산동)051-851-4933
9이용업반도부산광역시 연제구 월드컵대로235번길 33 (거제동)051-502-0298
업종명업소명영업소 주소(도로명)소재지전화
791일반미용업, 네일미용업, 화장ㆍ분장 미용업Bien poeme(빈포엠)부산광역시 연제구 안연로 33, 상가B동 103호 (연산동)<NA>
792일반미용업, 네일미용업, 화장ㆍ분장 미용업이가자헤어비스(연산더샵)부산광역시 연제구 연수로 130, 2층 201, 202호 (연산동, 연산더샵)051-853-8324
793일반미용업, 네일미용업, 화장ㆍ분장 미용업드라포레 연산점부산광역시 연제구 연수로 113, 1~2층 (연산동)051-867-1012
794피부미용업, 네일미용업, 화장ㆍ분장 미용업제시속눈썹부산광역시 연제구 안연로23번길 53, 1층 (연산동)070-8108-0609
795피부미용업, 네일미용업, 화장ㆍ분장 미용업윤네일부산광역시 연제구 해맞이로31번길 58, GIB메네스빌딩 2층 (거제동)<NA>
796피부미용업, 네일미용업, 화장ㆍ분장 미용업네일은. 설렘부산광역시 연제구 중앙천로 7, 1층 (연산동)<NA>
797피부미용업, 네일미용업, 화장ㆍ분장 미용업티나뷰티부산광역시 연제구 신촌로 30, 2층 (연산동)051-852-3777
798피부미용업, 네일미용업, 화장ㆍ분장 미용업라즈뷰티부산광역시 연제구 연수로 130, 124동 101호 (연산동, 연산더샵)<NA>
799피부미용업, 네일미용업, 화장ㆍ분장 미용업네일맑음부산광역시 연제구 연수로 204-1, 1층 (연산동)051-851-9656
800피부미용업, 네일미용업, 화장ㆍ분장 미용업뷰티믈리에부산광역시 연제구 거제천로124번길 16, 1층 (연산동)<NA>

Duplicate rows

Most frequently occurring

업종명업소명영업소 주소(도로명)소재지전화# duplicates
0미용업정 미용실부산광역시 연제구 고분로 105 (연산동)051-864-36822
1종합미용업스피나 피부&네일부산광역시 연제구 과정로 185, 2층 (연산동)<NA>2