Overview

Dataset statistics

Number of variables4
Number of observations1172
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory36.8 KiB
Average record size in memory32.1 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시 연제구 환경위생과 영업신고된 공중위생업소 관리 현황입니다(이묭업, 미용업, 세탁업, 목욕장업, 숙박업, 건물관리위생업)
Author부산광역시 연제구
URLhttps://www.data.go.kr/data/15051414/fileData.do

Alerts

Dataset has 1 (0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 07:29:02.821622
Analysis finished2023-12-12 07:29:03.432263
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

Distinct22
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size9.3 KiB
일반미용업
317 
미용업
168 
피부미용업
102 
네일미용업
93 
세탁업
84 
Other values (17)
408 

Length

Max length23
Median length19
Mean length5.7730375
Min length3

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row숙박업(일반)
2nd row숙박업(일반)
3rd row숙박업(일반)
4th row숙박업(일반)
5th row숙박업(일반)

Common Values

ValueCountFrequency (%)
일반미용업 317
27.0%
미용업 168
14.3%
피부미용업 102
 
8.7%
네일미용업 93
 
7.9%
세탁업 84
 
7.2%
건물위생관리업 78
 
6.7%
이용업 65
 
5.5%
숙박업(일반) 62
 
5.3%
목욕장업 41
 
3.5%
종합미용업 35
 
3.0%
Other values (12) 127
10.8%

Length

2023-12-12T16:29:03.516198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반미용업 361
25.8%
미용업 267
19.1%
피부미용업 150
10.7%
네일미용업 148
10.6%
화장ㆍ분장 99
 
7.1%
세탁업 84
 
6.0%
건물위생관리업 78
 
5.6%
이용업 65
 
4.6%
숙박업(일반 62
 
4.4%
목욕장업 41
 
2.9%
Other values (2) 43
 
3.1%
Distinct1148
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size9.3 KiB
2023-12-12T16:29:03.801388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length26
Mean length6.1313993
Min length1

Characters and Unicode

Total characters7186
Distinct characters572
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1126 ?
Unique (%)96.1%

Sample

1st row궁락모텔
2nd row미진장여관
3rd row제일여관
4th row대원장여관
5th row에그(egg)모텔
ValueCountFrequency (%)
헤어 27
 
1.8%
네일 17
 
1.1%
주식회사 17
 
1.1%
뷰티 11
 
0.7%
연산점 10
 
0.7%
nail 8
 
0.5%
by 7
 
0.5%
이용원 7
 
0.5%
7
 
0.5%
hair 6
 
0.4%
Other values (1296) 1405
92.3%
2023-12-12T16:29:04.263880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
352
 
4.9%
331
 
4.6%
316
 
4.4%
181
 
2.5%
( 145
 
2.0%
) 145
 
2.0%
135
 
1.9%
135
 
1.9%
127
 
1.8%
112
 
1.6%
Other values (562) 5207
72.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5678
79.0%
Lowercase Letter 399
 
5.6%
Space Separator 352
 
4.9%
Uppercase Letter 336
 
4.7%
Open Punctuation 145
 
2.0%
Close Punctuation 145
 
2.0%
Other Punctuation 67
 
0.9%
Decimal Number 55
 
0.8%
Dash Punctuation 8
 
0.1%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
331
 
5.8%
316
 
5.6%
181
 
3.2%
135
 
2.4%
135
 
2.4%
127
 
2.2%
112
 
2.0%
94
 
1.7%
77
 
1.4%
71
 
1.3%
Other values (492) 4099
72.2%
Lowercase Letter
ValueCountFrequency (%)
a 52
13.0%
i 43
10.8%
e 42
10.5%
o 31
 
7.8%
n 30
 
7.5%
r 26
 
6.5%
l 26
 
6.5%
y 25
 
6.3%
h 21
 
5.3%
t 16
 
4.0%
Other values (14) 87
21.8%
Uppercase Letter
ValueCountFrequency (%)
N 38
 
11.3%
I 27
 
8.0%
A 27
 
8.0%
O 23
 
6.8%
B 22
 
6.5%
H 22
 
6.5%
Y 21
 
6.2%
R 20
 
6.0%
L 19
 
5.7%
S 16
 
4.8%
Other values (14) 101
30.1%
Decimal Number
ValueCountFrequency (%)
1 15
27.3%
9 9
16.4%
2 7
12.7%
5 6
 
10.9%
7 5
 
9.1%
4 4
 
7.3%
0 4
 
7.3%
3 4
 
7.3%
8 1
 
1.8%
Other Punctuation
ValueCountFrequency (%)
& 22
32.8%
. 12
17.9%
, 12
17.9%
# 8
 
11.9%
: 6
 
9.0%
' 5
 
7.5%
% 1
 
1.5%
1
 
1.5%
Space Separator
ValueCountFrequency (%)
352
100.0%
Open Punctuation
ValueCountFrequency (%)
( 145
100.0%
Close Punctuation
ValueCountFrequency (%)
) 145
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5669
78.9%
Common 773
 
10.8%
Latin 735
 
10.2%
Han 9
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
331
 
5.8%
316
 
5.6%
181
 
3.2%
135
 
2.4%
135
 
2.4%
127
 
2.2%
112
 
2.0%
94
 
1.7%
77
 
1.4%
71
 
1.3%
Other values (485) 4090
72.1%
Latin
ValueCountFrequency (%)
a 52
 
7.1%
i 43
 
5.9%
e 42
 
5.7%
N 38
 
5.2%
o 31
 
4.2%
n 30
 
4.1%
I 27
 
3.7%
A 27
 
3.7%
r 26
 
3.5%
l 26
 
3.5%
Other values (38) 393
53.5%
Common
ValueCountFrequency (%)
352
45.5%
( 145
18.8%
) 145
18.8%
& 22
 
2.8%
1 15
 
1.9%
. 12
 
1.6%
, 12
 
1.6%
9 9
 
1.2%
- 8
 
1.0%
# 8
 
1.0%
Other values (12) 45
 
5.8%
Han
ValueCountFrequency (%)
3
33.3%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5669
78.9%
ASCII 1507
 
21.0%
CJK 9
 
0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
352
23.4%
( 145
 
9.6%
) 145
 
9.6%
a 52
 
3.5%
i 43
 
2.9%
e 42
 
2.8%
N 38
 
2.5%
o 31
 
2.1%
n 30
 
2.0%
I 27
 
1.8%
Other values (59) 602
39.9%
Hangul
ValueCountFrequency (%)
331
 
5.8%
316
 
5.6%
181
 
3.2%
135
 
2.4%
135
 
2.4%
127
 
2.2%
112
 
2.0%
94
 
1.7%
77
 
1.4%
71
 
1.3%
Other values (485) 4090
72.1%
CJK
ValueCountFrequency (%)
3
33.3%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
None
ValueCountFrequency (%)
1
100.0%
Distinct1118
Distinct (%)95.4%
Missing0
Missing (%)0.0%
Memory size9.3 KiB
2023-12-12T16:29:04.608137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length64
Median length54
Mean length31.285836
Min length21

Characters and Unicode

Total characters36667
Distinct characters255
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1070 ?
Unique (%)91.3%

Sample

1st row부산광역시 연제구 중앙대로1120번길 17 (연산동)
2nd row부산광역시 연제구 거제시장로 24 (거제동)
3rd row부산광역시 연제구 월드컵대로 217-1 (거제동)
4th row부산광역시 연제구 거제천로230번길 98 (연산동)
5th row부산광역시 연제구 고분로13번길 13 (연산동)
ValueCountFrequency (%)
부산광역시 1172
 
16.4%
연제구 1172
 
16.4%
연산동 953
 
13.3%
1층 301
 
4.2%
거제동 183
 
2.6%
2층 149
 
2.1%
중앙대로 49
 
0.7%
일부호 48
 
0.7%
과정로 47
 
0.7%
월드컵대로 46
 
0.6%
Other values (896) 3029
42.4%
2023-12-12T16:29:05.150744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5979
 
16.3%
2349
 
6.4%
2225
 
6.1%
1 1644
 
4.5%
1520
 
4.1%
1354
 
3.7%
1270
 
3.5%
1237
 
3.4%
( 1191
 
3.2%
) 1191
 
3.2%
Other values (245) 16707
45.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21264
58.0%
Space Separator 5979
 
16.3%
Decimal Number 5793
 
15.8%
Open Punctuation 1191
 
3.2%
Close Punctuation 1191
 
3.2%
Other Punctuation 898
 
2.4%
Uppercase Letter 211
 
0.6%
Dash Punctuation 126
 
0.3%
Math Symbol 10
 
< 0.1%
Lowercase Letter 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2349
 
11.0%
2225
 
10.5%
1520
 
7.1%
1354
 
6.4%
1270
 
6.0%
1237
 
5.8%
1187
 
5.6%
1176
 
5.5%
1173
 
5.5%
1173
 
5.5%
Other values (208) 6600
31.0%
Uppercase Letter
ValueCountFrequency (%)
S 31
14.7%
E 28
13.3%
I 28
13.3%
K 28
13.3%
V 27
12.8%
W 27
12.8%
B 14
6.6%
A 13
6.2%
C 5
 
2.4%
D 3
 
1.4%
Other values (5) 7
 
3.3%
Decimal Number
ValueCountFrequency (%)
1 1644
28.4%
2 1037
17.9%
3 672
11.6%
0 564
 
9.7%
4 443
 
7.6%
5 370
 
6.4%
8 287
 
5.0%
6 274
 
4.7%
7 263
 
4.5%
9 239
 
4.1%
Other Punctuation
ValueCountFrequency (%)
, 893
99.4%
& 2
 
0.2%
@ 1
 
0.1%
/ 1
 
0.1%
. 1
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
c 2
50.0%
e 2
50.0%
Space Separator
ValueCountFrequency (%)
5979
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1191
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1191
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 126
100.0%
Math Symbol
ValueCountFrequency (%)
~ 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21264
58.0%
Common 15188
41.4%
Latin 215
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2349
 
11.0%
2225
 
10.5%
1520
 
7.1%
1354
 
6.4%
1270
 
6.0%
1237
 
5.8%
1187
 
5.6%
1176
 
5.5%
1173
 
5.5%
1173
 
5.5%
Other values (208) 6600
31.0%
Common
ValueCountFrequency (%)
5979
39.4%
1 1644
 
10.8%
( 1191
 
7.8%
) 1191
 
7.8%
2 1037
 
6.8%
, 893
 
5.9%
3 672
 
4.4%
0 564
 
3.7%
4 443
 
2.9%
5 370
 
2.4%
Other values (10) 1204
 
7.9%
Latin
ValueCountFrequency (%)
S 31
14.4%
E 28
13.0%
I 28
13.0%
K 28
13.0%
V 27
12.6%
W 27
12.6%
B 14
6.5%
A 13
6.0%
C 5
 
2.3%
D 3
 
1.4%
Other values (7) 11
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21264
58.0%
ASCII 15403
42.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5979
38.8%
1 1644
 
10.7%
( 1191
 
7.7%
) 1191
 
7.7%
2 1037
 
6.7%
, 893
 
5.8%
3 672
 
4.4%
0 564
 
3.7%
4 443
 
2.9%
5 370
 
2.4%
Other values (27) 1419
 
9.2%
Hangul
ValueCountFrequency (%)
2349
 
11.0%
2225
 
10.5%
1520
 
7.1%
1354
 
6.4%
1270
 
6.0%
1237
 
5.8%
1187
 
5.6%
1176
 
5.5%
1173
 
5.5%
1173
 
5.5%
Other values (208) 6600
31.0%
Distinct721
Distinct (%)61.5%
Missing0
Missing (%)0.0%
Memory size9.3 KiB
2023-12-12T16:29:05.515444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length9.7551195
Min length6

Characters and Unicode

Total characters11433
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique710 ?
Unique (%)60.6%

Sample

1st row051-861-6727
2nd row051-852-5927
3rd row051-503-5639
4th row051-865-7560
5th row051-863-9110
ValueCountFrequency (%)
전화번호없음 441
37.6%
051-757-0101 3
 
0.3%
051-867-7401 2
 
0.2%
051-851-6739 2
 
0.2%
051-862-1863 2
 
0.2%
051-753-5894 2
 
0.2%
051-862-9999 2
 
0.2%
051-852-3313 2
 
0.2%
051-757-2844 2
 
0.2%
051-504-9789 2
 
0.2%
Other values (711) 712
60.8%
2023-12-12T16:29:06.051592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1460
12.8%
5 1456
12.7%
0 1192
10.4%
1 1132
 
9.9%
8 815
 
7.1%
6 608
 
5.3%
7 551
 
4.8%
2 494
 
4.3%
3 447
 
3.9%
441
 
3.9%
Other values (7) 2837
24.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7327
64.1%
Other Letter 2646
 
23.1%
Dash Punctuation 1460
 
12.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 1456
19.9%
0 1192
16.3%
1 1132
15.4%
8 815
11.1%
6 608
8.3%
7 551
 
7.5%
2 494
 
6.7%
3 447
 
6.1%
4 321
 
4.4%
9 311
 
4.2%
Other Letter
ValueCountFrequency (%)
441
16.7%
441
16.7%
441
16.7%
441
16.7%
441
16.7%
441
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 1460
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8787
76.9%
Hangul 2646
 
23.1%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1460
16.6%
5 1456
16.6%
0 1192
13.6%
1 1132
12.9%
8 815
9.3%
6 608
6.9%
7 551
 
6.3%
2 494
 
5.6%
3 447
 
5.1%
4 321
 
3.7%
Hangul
ValueCountFrequency (%)
441
16.7%
441
16.7%
441
16.7%
441
16.7%
441
16.7%
441
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8787
76.9%
Hangul 2646
 
23.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1460
16.6%
5 1456
16.6%
0 1192
13.6%
1 1132
12.9%
8 815
9.3%
6 608
6.9%
7 551
 
6.3%
2 494
 
5.6%
3 447
 
5.1%
4 321
 
3.7%
Hangul
ValueCountFrequency (%)
441
16.7%
441
16.7%
441
16.7%
441
16.7%
441
16.7%
441
16.7%

Missing values

2023-12-12T16:29:03.307676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:29:03.395100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명영업소 주소(도로명)소재지전화
0숙박업(일반)궁락모텔부산광역시 연제구 중앙대로1120번길 17 (연산동)051-861-6727
1숙박업(일반)미진장여관부산광역시 연제구 거제시장로 24 (거제동)051-852-5927
2숙박업(일반)제일여관부산광역시 연제구 월드컵대로 217-1 (거제동)051-503-5639
3숙박업(일반)대원장여관부산광역시 연제구 거제천로230번길 98 (연산동)051-865-7560
4숙박업(일반)에그(egg)모텔부산광역시 연제구 고분로13번길 13 (연산동)051-863-9110
5숙박업(일반)토곡부산광역시 연제구 과정로 187-1 (연산동)051-759-2040
6숙박업(일반)BNB(비앤비)부산광역시 연제구 월드컵대로114번길 15 (연산동)051-866-8277
7숙박업(일반)샤이어호텔부산광역시 연제구 반송로 18-6 (연산동)051-866-4645
8숙박업(일반)오모텔부산광역시 연제구 과정로 165-2 (연산동)051-758-7583
9숙박업(일반)더 제니스 호텔부산광역시 연제구 거제천로152번길 66 (연산동)051-868-3335
업종명업소명영업소 주소(도로명)소재지전화
1162피부미용업, 네일미용업, 화장ㆍ분장 미용업네일드네쥬(NAIL DE NEJUE)부산광역시 연제구 토곡로 37, 1층 (연산동)전화번호없음
1163피부미용업, 네일미용업, 화장ㆍ분장 미용업티나뷰티부산광역시 연제구 신촌로 30, 2층 (연산동)051-852-3777
1164피부미용업, 네일미용업, 화장ㆍ분장 미용업뷰티믈리에부산광역시 연제구 거제천로124번길 16, 1층 (연산동)전화번호없음
1165피부미용업, 네일미용업, 화장ㆍ분장 미용업윤s' beauty academy부산광역시 연제구 신촌로 14, 3층 (연산동)전화번호없음
1166피부미용업, 네일미용업, 화장ㆍ분장 미용업네일. 별부산광역시 연제구 중앙천로19번길 46, 2층 (연산동)전화번호없음
1167피부미용업, 네일미용업, 화장ㆍ분장 미용업블랑코뷰티부산광역시 연제구 과정로 166, 1층 (연산동)전화번호없음
1168피부미용업, 네일미용업, 화장ㆍ분장 미용업Beauty4(뷰티4)부산광역시 연제구 중앙대로 1049, 3층 (연산동)전화번호없음
1169피부미용업, 네일미용업, 화장ㆍ분장 미용업미드나잇네일(midnight Nail)부산광역시 연제구 월드컵대로73번길 4, 1층 (연산동)전화번호없음
1170피부미용업, 네일미용업, 화장ㆍ분장 미용업뷰티빛담부산광역시 연제구 봉수로 25, 231동 2층 211호 (연산동)전화번호없음
1171피부미용업, 네일미용업, 화장ㆍ분장 미용업홍브로우 토탈뷰티샵부산광역시 연제구 월드컵대로 55, 302동 2층 201호 (연산동, 연제롯데캐슬&데시앙)전화번호없음

Duplicate rows

Most frequently occurring

업종명업소명영업소 주소(도로명)소재지전화# duplicates
0종합미용업순뷰티부산광역시 연제구 월드컵대로145번길 103, 2층 (연산동)051-851-67392