Overview

Dataset statistics

Number of variables4
Number of observations900
Missing cells404
Missing cells (%)11.2%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory28.3 KiB
Average record size in memory32.1 B

Variable types

Categorical1
Text3

Dataset

Description부산시 연제구 환경위생과에 영업신고된 이미용업 업소 현황입니다(이용업, 미용업), 업종, 업소명, 소재지, 전화번호
Author부산광역시 연제구
URLhttps://www.data.go.kr/data/15051416/fileData.do

Alerts

Dataset has 1 (0.1%) duplicate rowsDuplicates
소재지전화 has 404 (44.9%) missing valuesMissing

Reproduction

Analysis started2023-12-12 04:01:48.685761
Analysis finished2023-12-12 04:01:49.281430
Duration0.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

Distinct17
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size7.2 KiB
일반미용업
313 
미용업
171 
피부미용업
104 
네일미용업
90 
이용업
66 
Other values (12)
156 

Length

Max length23
Median length5
Mean length5.9333333
Min length3

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row이용업
2nd row이용업
3rd row이용업
4th row이용업
5th row이용업

Common Values

ValueCountFrequency (%)
일반미용업 313
34.8%
미용업 171
19.0%
피부미용업 104
 
11.6%
네일미용업 90
 
10.0%
이용업 66
 
7.3%
종합미용업 35
 
3.9%
피부미용업, 화장ㆍ분장 미용업 23
 
2.6%
화장ㆍ분장 미용업 18
 
2.0%
네일미용업, 화장ㆍ분장 미용업 17
 
1.9%
일반미용업, 화장ㆍ분장 미용업 17
 
1.9%
Other values (7) 46
 
5.1%

Length

2023-12-12T13:01:49.371585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반미용업 356
31.5%
미용업 272
24.0%
피부미용업 154
13.6%
네일미용업 147
13.0%
화장ㆍ분장 101
 
8.9%
이용업 66
 
5.8%
종합미용업 35
 
3.1%
Distinct881
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size7.2 KiB
2023-12-12T13:01:49.696492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length26
Mean length6.1933333
Min length1

Characters and Unicode

Total characters5574
Distinct characters514
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique863 ?
Unique (%)95.9%

Sample

1st row중앙
2nd row일신이용
3rd row평화이용
4th row진주이용
5th row신성
ValueCountFrequency (%)
헤어 27
 
2.3%
네일 16
 
1.3%
뷰티 11
 
0.9%
nail 9
 
0.8%
연산점 9
 
0.8%
미장원 7
 
0.6%
이용원 7
 
0.6%
by 7
 
0.6%
에스테틱 6
 
0.5%
6
 
0.5%
Other values (1002) 1088
91.2%
2023-12-12T13:01:50.277424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
324
 
5.8%
316
 
5.7%
294
 
5.3%
144
 
2.6%
118
 
2.1%
106
 
1.9%
105
 
1.9%
( 98
 
1.8%
) 98
 
1.8%
93
 
1.7%
Other values (504) 3878
69.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4277
76.7%
Lowercase Letter 390
 
7.0%
Uppercase Letter 302
 
5.4%
Space Separator 294
 
5.3%
Open Punctuation 98
 
1.8%
Close Punctuation 98
 
1.8%
Other Punctuation 59
 
1.1%
Decimal Number 44
 
0.8%
Dash Punctuation 7
 
0.1%
Math Symbol 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
324
 
7.6%
316
 
7.4%
144
 
3.4%
118
 
2.8%
106
 
2.5%
105
 
2.5%
93
 
2.2%
89
 
2.1%
74
 
1.7%
67
 
1.6%
Other values (433) 2841
66.4%
Lowercase Letter
ValueCountFrequency (%)
a 56
14.4%
i 43
11.0%
e 39
10.0%
r 29
 
7.4%
n 29
 
7.4%
o 29
 
7.4%
l 26
 
6.7%
y 24
 
6.2%
h 19
 
4.9%
b 17
 
4.4%
Other values (14) 79
20.3%
Uppercase Letter
ValueCountFrequency (%)
N 33
 
10.9%
A 25
 
8.3%
I 23
 
7.6%
R 22
 
7.3%
B 20
 
6.6%
H 20
 
6.6%
L 18
 
6.0%
O 18
 
6.0%
Y 18
 
6.0%
S 16
 
5.3%
Other values (14) 89
29.5%
Other Punctuation
ValueCountFrequency (%)
& 15
25.4%
, 11
18.6%
. 10
16.9%
# 8
13.6%
' 7
11.9%
: 6
 
10.2%
% 1
 
1.7%
1
 
1.7%
Decimal Number
ValueCountFrequency (%)
1 12
27.3%
9 8
18.2%
2 5
11.4%
5 5
11.4%
0 4
 
9.1%
3 4
 
9.1%
4 3
 
6.8%
7 3
 
6.8%
Math Symbol
ValueCountFrequency (%)
> 2
50.0%
< 2
50.0%
Space Separator
ValueCountFrequency (%)
294
100.0%
Open Punctuation
ValueCountFrequency (%)
( 98
100.0%
Close Punctuation
ValueCountFrequency (%)
) 98
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4268
76.6%
Latin 692
 
12.4%
Common 605
 
10.9%
Han 9
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
324
 
7.6%
316
 
7.4%
144
 
3.4%
118
 
2.8%
106
 
2.5%
105
 
2.5%
93
 
2.2%
89
 
2.1%
74
 
1.7%
67
 
1.6%
Other values (426) 2832
66.4%
Latin
ValueCountFrequency (%)
a 56
 
8.1%
i 43
 
6.2%
e 39
 
5.6%
N 33
 
4.8%
r 29
 
4.2%
n 29
 
4.2%
o 29
 
4.2%
l 26
 
3.8%
A 25
 
3.6%
y 24
 
3.5%
Other values (38) 359
51.9%
Common
ValueCountFrequency (%)
294
48.6%
( 98
 
16.2%
) 98
 
16.2%
& 15
 
2.5%
1 12
 
2.0%
, 11
 
1.8%
. 10
 
1.7%
# 8
 
1.3%
9 8
 
1.3%
' 7
 
1.2%
Other values (13) 44
 
7.3%
Han
ValueCountFrequency (%)
3
33.3%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4268
76.6%
ASCII 1296
 
23.3%
CJK 9
 
0.2%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
324
 
7.6%
316
 
7.4%
144
 
3.4%
118
 
2.8%
106
 
2.5%
105
 
2.5%
93
 
2.2%
89
 
2.1%
74
 
1.7%
67
 
1.6%
Other values (426) 2832
66.4%
ASCII
ValueCountFrequency (%)
294
22.7%
( 98
 
7.6%
) 98
 
7.6%
a 56
 
4.3%
i 43
 
3.3%
e 39
 
3.0%
N 33
 
2.5%
r 29
 
2.2%
n 29
 
2.2%
o 29
 
2.2%
Other values (60) 548
42.3%
CJK
ValueCountFrequency (%)
3
33.3%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
None
ValueCountFrequency (%)
1
100.0%
Distinct867
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Memory size7.2 KiB
2023-12-12T13:01:50.643582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length64
Median length52
Mean length31.576667
Min length21

Characters and Unicode

Total characters28419
Distinct characters244
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique836 ?
Unique (%)92.9%

Sample

1st row부산광역시 연제구 아시아드대로64번길 4, 1층 (거제동)
2nd row부산광역시 연제구 거제대로74번길 80 (거제동)
3rd row부산광역시 연제구 금련로 7, 2층 (연산동)
4th row부산광역시 연제구 월드컵대로 2-13 (연산동)
5th row부산광역시 연제구 연수로87번길 59 (연산동)
ValueCountFrequency (%)
부산광역시 900
 
16.1%
연제구 900
 
16.1%
연산동 745
 
13.3%
1층 261
 
4.7%
거제동 140
 
2.5%
2층 135
 
2.4%
연수로 41
 
0.7%
중앙대로 41
 
0.7%
월드컵대로 35
 
0.6%
과정로 33
 
0.6%
Other values (751) 2367
42.3%
2023-12-12T13:01:51.126073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4698
 
16.5%
1830
 
6.4%
1721
 
6.1%
1 1261
 
4.4%
1156
 
4.1%
1068
 
3.8%
986
 
3.5%
942
 
3.3%
( 917
 
3.2%
) 917
 
3.2%
Other values (234) 12923
45.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16379
57.6%
Space Separator 4698
 
16.5%
Decimal Number 4469
 
15.7%
Open Punctuation 917
 
3.2%
Close Punctuation 917
 
3.2%
Other Punctuation 745
 
2.6%
Uppercase Letter 196
 
0.7%
Dash Punctuation 93
 
0.3%
Lowercase Letter 4
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1830
 
11.2%
1721
 
10.5%
1156
 
7.1%
1068
 
6.5%
986
 
6.0%
942
 
5.8%
914
 
5.6%
904
 
5.5%
902
 
5.5%
901
 
5.5%
Other values (200) 5055
30.9%
Uppercase Letter
ValueCountFrequency (%)
S 30
15.3%
I 27
13.8%
K 27
13.8%
E 27
13.8%
W 26
13.3%
V 26
13.3%
A 12
 
6.1%
B 10
 
5.1%
C 3
 
1.5%
D 3
 
1.5%
Other values (3) 5
 
2.6%
Decimal Number
ValueCountFrequency (%)
1 1261
28.2%
2 824
18.4%
3 543
12.2%
0 437
 
9.8%
4 334
 
7.5%
5 274
 
6.1%
6 207
 
4.6%
8 204
 
4.6%
7 202
 
4.5%
9 183
 
4.1%
Other Punctuation
ValueCountFrequency (%)
, 741
99.5%
& 2
 
0.3%
@ 1
 
0.1%
/ 1
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
c 2
50.0%
e 2
50.0%
Space Separator
ValueCountFrequency (%)
4698
100.0%
Open Punctuation
ValueCountFrequency (%)
( 917
100.0%
Close Punctuation
ValueCountFrequency (%)
) 917
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 93
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16379
57.6%
Common 11840
41.7%
Latin 200
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1830
 
11.2%
1721
 
10.5%
1156
 
7.1%
1068
 
6.5%
986
 
6.0%
942
 
5.8%
914
 
5.6%
904
 
5.5%
902
 
5.5%
901
 
5.5%
Other values (200) 5055
30.9%
Common
ValueCountFrequency (%)
4698
39.7%
1 1261
 
10.7%
( 917
 
7.7%
) 917
 
7.7%
2 824
 
7.0%
, 741
 
6.3%
3 543
 
4.6%
0 437
 
3.7%
4 334
 
2.8%
5 274
 
2.3%
Other values (9) 894
 
7.6%
Latin
ValueCountFrequency (%)
S 30
15.0%
I 27
13.5%
K 27
13.5%
E 27
13.5%
W 26
13.0%
V 26
13.0%
A 12
 
6.0%
B 10
 
5.0%
C 3
 
1.5%
D 3
 
1.5%
Other values (5) 9
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16379
57.6%
ASCII 12040
42.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4698
39.0%
1 1261
 
10.5%
( 917
 
7.6%
) 917
 
7.6%
2 824
 
6.8%
, 741
 
6.2%
3 543
 
4.5%
0 437
 
3.6%
4 334
 
2.8%
5 274
 
2.3%
Other values (24) 1094
 
9.1%
Hangul
ValueCountFrequency (%)
1830
 
11.2%
1721
 
10.5%
1156
 
7.1%
1068
 
6.5%
986
 
6.0%
942
 
5.8%
914
 
5.6%
904
 
5.5%
902
 
5.5%
901
 
5.5%
Other values (200) 5055
30.9%

소재지전화
Text

MISSING 

Distinct493
Distinct (%)99.4%
Missing404
Missing (%)44.9%
Memory size7.2 KiB
2023-12-12T13:01:51.522220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length13.947581
Min length7

Characters and Unicode

Total characters6918
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique490 ?
Unique (%)98.8%

Sample

1st row051 -504 -5378
2nd row 051- 865-9173
3rd row051 -851 -4933
4th row 051- 862-0178
5th row 051- 865-2863
ValueCountFrequency (%)
051 464
34.7%
868 39
 
2.9%
852 35
 
2.6%
853 27
 
2.0%
851 22
 
1.6%
070 20
 
1.5%
867 20
 
1.5%
866 15
 
1.1%
864 13
 
1.0%
861 13
 
1.0%
Other values (543) 669
50.0%
2023-12-12T13:01:52.169749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 1000
14.5%
- 988
14.3%
958
13.8%
0 806
11.7%
1 758
11.0%
8 566
8.2%
6 412
6.0%
7 380
 
5.5%
2 332
 
4.8%
3 306
 
4.4%
Other values (2) 412
6.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4972
71.9%
Dash Punctuation 988
 
14.3%
Space Separator 958
 
13.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 1000
20.1%
0 806
16.2%
1 758
15.2%
8 566
11.4%
6 412
8.3%
7 380
 
7.6%
2 332
 
6.7%
3 306
 
6.2%
4 210
 
4.2%
9 202
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 988
100.0%
Space Separator
ValueCountFrequency (%)
958
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6918
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 1000
14.5%
- 988
14.3%
958
13.8%
0 806
11.7%
1 758
11.0%
8 566
8.2%
6 412
6.0%
7 380
 
5.5%
2 332
 
4.8%
3 306
 
4.4%
Other values (2) 412
6.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6918
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 1000
14.5%
- 988
14.3%
958
13.8%
0 806
11.7%
1 758
11.0%
8 566
8.2%
6 412
6.0%
7 380
 
5.5%
2 332
 
4.8%
3 306
 
4.4%
Other values (2) 412
6.0%

Missing values

2023-12-12T13:01:49.146522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:01:49.241591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명영업소 주소(도로명)소재지전화
0이용업중앙부산광역시 연제구 아시아드대로64번길 4, 1층 (거제동)051 -504 -5378
1이용업일신이용부산광역시 연제구 거제대로74번길 80 (거제동)<NA>
2이용업평화이용부산광역시 연제구 금련로 7, 2층 (연산동)<NA>
3이용업진주이용부산광역시 연제구 월드컵대로 2-13 (연산동)<NA>
4이용업신성부산광역시 연제구 연수로87번길 59 (연산동)051- 865-9173
5이용업연우이용부산광역시 연제구 대리로19번길 3, 1층 (연산동)<NA>
6이용업신광이용부산광역시 연제구 연수로218번길 27 (연산동)<NA>
7이용업보성이용부산광역시 연제구 연수로 173 (연산동,(1층))<NA>
8이용업목화이용부산광역시 연제구 고분로20번길 21 (연산동)051 -851 -4933
9이용업강남부산광역시 연제구 고분로 50-1 (연산동)<NA>
업종명업소명영업소 주소(도로명)소재지전화
890피부미용업, 네일미용업, 화장ㆍ분장 미용업라즈뷰티부산광역시 연제구 연수로 130, 124동 101호 (연산동, 연산더샵)<NA>
891피부미용업, 네일미용업, 화장ㆍ분장 미용업뷰티믈리에부산광역시 연제구 거제천로124번길 16, 1층 (연산동)<NA>
892피부미용업, 네일미용업, 화장ㆍ분장 미용업윤s' beauty academy부산광역시 연제구 신촌로 14, 3층 (연산동)<NA>
893피부미용업, 네일미용업, 화장ㆍ분장 미용업네일. 별부산광역시 연제구 중앙천로19번길 46, 2층 (연산동)<NA>
894피부미용업, 네일미용업, 화장ㆍ분장 미용업블랑코뷰티부산광역시 연제구 과정로 166, 1층 (연산동)<NA>
895피부미용업, 네일미용업, 화장ㆍ분장 미용업Beauty4(뷰티4)부산광역시 연제구 중앙대로 1049, 3층 (연산동)<NA>
896피부미용업, 네일미용업, 화장ㆍ분장 미용업미드나잇네일(midnight Nail)부산광역시 연제구 월드컵대로73번길 4, 1층 (연산동)<NA>
897피부미용업, 네일미용업, 화장ㆍ분장 미용업뷰티빛담부산광역시 연제구 봉수로 25, 231동 2층 211호 (연산동)<NA>
898피부미용업, 네일미용업, 화장ㆍ분장 미용업홍브로우 토탈뷰티샵부산광역시 연제구 월드컵대로 55, 302동 2층 201호 (연산동, 연제롯데캐슬&데시앙)<NA>
899피부미용업, 네일미용업, 화장ㆍ분장 미용업까모르살롱 속눈썹 왁싱부산광역시 연제구 반송로 14, 3층 (연산동)<NA>

Duplicate rows

Most frequently occurring

업종명업소명영업소 주소(도로명)소재지전화# duplicates
0종합미용업순뷰티부산광역시 연제구 월드컵대로145번길 103, 2층 (연산동)051 -851 -67392