Overview

Dataset statistics

Number of variables4
Number of observations1096
Missing cells267
Missing cells (%)6.1%
Duplicate rows2
Duplicate rows (%)0.2%
Total size in memory34.4 KiB
Average record size in memory32.1 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시_연제구_공중위생업소관리현황_20200908
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15051414

Alerts

Dataset has 2 (0.2%) duplicate rowsDuplicates
소재지전화 has 267 (24.4%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:03:14.855637
Analysis finished2023-12-10 16:03:15.370975
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

Distinct21
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
일반미용업
271 
미용업
195 
세탁업
95 
피부미용업
83 
이용업
74 
Other values (16)
378 

Length

Max length23
Median length16
Mean length5.3804745
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row숙박업(일반)
2nd row숙박업(일반)
3rd row숙박업(일반)
4th row숙박업(일반)
5th row숙박업(일반)

Common Values

ValueCountFrequency (%)
일반미용업 271
24.7%
미용업 195
17.8%
세탁업 95
 
8.7%
피부미용업 83
 
7.6%
이용업 74
 
6.8%
숙박업(일반) 73
 
6.7%
건물위생관리업 67
 
6.1%
네일미용업 63
 
5.7%
목욕장업 52
 
4.7%
종합미용업 30
 
2.7%
Other values (11) 93
 
8.5%

Length

2023-12-11T01:03:15.443768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반미용업 309
24.7%
미용업 261
20.8%
피부미용업 114
 
9.1%
네일미용업 104
 
8.3%
세탁업 95
 
7.6%
이용업 74
 
5.9%
숙박업(일반 73
 
5.8%
건물위생관리업 67
 
5.3%
화장ㆍ분장 66
 
5.3%
목욕장업 52
 
4.2%
Other values (2) 38
 
3.0%
Distinct1068
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
2023-12-11T01:03:15.714541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length28
Mean length5.8658759
Min length1

Characters and Unicode

Total characters6429
Distinct characters555
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1043 ?
Unique (%)95.2%

Sample

1st row궁락모텔
2nd row미진장여관
3rd row제일여관
4th row세림장
5th row1RUA(일루아)
ValueCountFrequency (%)
헤어 19
 
1.4%
주식회사 12
 
0.9%
네일 10
 
0.7%
이용원 8
 
0.6%
미용실 7
 
0.5%
연산점 6
 
0.4%
nail 6
 
0.4%
by 6
 
0.4%
hair 6
 
0.4%
에스테틱 6
 
0.4%
Other values (1204) 1285
93.7%
2023-12-11T01:03:16.116769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
306
 
4.8%
306
 
4.8%
276
 
4.3%
154
 
2.4%
141
 
2.2%
) 134
 
2.1%
( 134
 
2.1%
121
 
1.9%
102
 
1.6%
90
 
1.4%
Other values (545) 4665
72.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5172
80.4%
Uppercase Letter 340
 
5.3%
Space Separator 276
 
4.3%
Lowercase Letter 271
 
4.2%
Close Punctuation 134
 
2.1%
Open Punctuation 134
 
2.1%
Other Punctuation 47
 
0.7%
Decimal Number 42
 
0.7%
Dash Punctuation 8
 
0.1%
Math Symbol 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
306
 
5.9%
306
 
5.9%
154
 
3.0%
141
 
2.7%
121
 
2.3%
102
 
2.0%
90
 
1.7%
82
 
1.6%
81
 
1.6%
70
 
1.4%
Other values (476) 3719
71.9%
Uppercase Letter
ValueCountFrequency (%)
A 31
 
9.1%
R 27
 
7.9%
N 27
 
7.9%
I 26
 
7.6%
E 22
 
6.5%
B 22
 
6.5%
H 20
 
5.9%
L 19
 
5.6%
M 17
 
5.0%
S 16
 
4.7%
Other values (14) 113
33.2%
Lowercase Letter
ValueCountFrequency (%)
a 35
12.9%
i 33
12.2%
e 26
9.6%
o 23
8.5%
n 22
8.1%
y 18
 
6.6%
l 17
 
6.3%
h 16
 
5.9%
r 16
 
5.9%
t 12
 
4.4%
Other values (10) 53
19.6%
Other Punctuation
ValueCountFrequency (%)
& 15
31.9%
, 9
19.1%
. 8
17.0%
# 6
 
12.8%
' 3
 
6.4%
: 2
 
4.3%
· 2
 
4.3%
1
 
2.1%
% 1
 
2.1%
Decimal Number
ValueCountFrequency (%)
1 13
31.0%
2 6
14.3%
5 6
14.3%
9 4
 
9.5%
3 4
 
9.5%
0 4
 
9.5%
4 3
 
7.1%
6 1
 
2.4%
7 1
 
2.4%
Math Symbol
ValueCountFrequency (%)
< 2
50.0%
> 2
50.0%
Space Separator
ValueCountFrequency (%)
276
100.0%
Close Punctuation
ValueCountFrequency (%)
) 134
100.0%
Open Punctuation
ValueCountFrequency (%)
( 134
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5162
80.3%
Common 646
 
10.0%
Latin 611
 
9.5%
Han 10
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
306
 
5.9%
306
 
5.9%
154
 
3.0%
141
 
2.7%
121
 
2.3%
102
 
2.0%
90
 
1.7%
82
 
1.6%
81
 
1.6%
70
 
1.4%
Other values (468) 3709
71.9%
Latin
ValueCountFrequency (%)
a 35
 
5.7%
i 33
 
5.4%
A 31
 
5.1%
R 27
 
4.4%
N 27
 
4.4%
e 26
 
4.3%
I 26
 
4.3%
o 23
 
3.8%
E 22
 
3.6%
B 22
 
3.6%
Other values (34) 339
55.5%
Common
ValueCountFrequency (%)
276
42.7%
) 134
20.7%
( 134
20.7%
& 15
 
2.3%
1 13
 
2.0%
, 9
 
1.4%
. 8
 
1.2%
- 8
 
1.2%
2 6
 
0.9%
# 6
 
0.9%
Other values (15) 37
 
5.7%
Han
ValueCountFrequency (%)
3
30.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5162
80.3%
ASCII 1254
 
19.5%
CJK 10
 
0.2%
None 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
306
 
5.9%
306
 
5.9%
154
 
3.0%
141
 
2.7%
121
 
2.3%
102
 
2.0%
90
 
1.7%
82
 
1.6%
81
 
1.6%
70
 
1.4%
Other values (468) 3709
71.9%
ASCII
ValueCountFrequency (%)
276
22.0%
) 134
 
10.7%
( 134
 
10.7%
a 35
 
2.8%
i 33
 
2.6%
A 31
 
2.5%
R 27
 
2.2%
N 27
 
2.2%
e 26
 
2.1%
I 26
 
2.1%
Other values (57) 505
40.3%
CJK
ValueCountFrequency (%)
3
30.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
None
ValueCountFrequency (%)
· 2
66.7%
1
33.3%
Distinct1050
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
2023-12-11T01:03:16.440134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length50
Mean length29.974453
Min length20

Characters and Unicode

Total characters32852
Distinct characters249
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1010 ?
Unique (%)92.2%

Sample

1st row부산광역시 연제구 중앙대로1120번길 17 (연산동)
2nd row부산광역시 연제구 거제시장로 24 (거제동)
3rd row부산광역시 연제구 월드컵대로 217-1 (거제동)
4th row부산광역시 연제구 과정로191번길 3 (연산동)
5th row부산광역시 연제구 반송로 18-14 (연산동)
ValueCountFrequency (%)
부산광역시 1096
17.3%
연제구 1096
17.3%
연산동 861
 
13.6%
1층 221
 
3.5%
거제동 173
 
2.7%
2층 91
 
1.4%
과정로 45
 
0.7%
연수로 40
 
0.6%
고분로 36
 
0.6%
월드컵대로 35
 
0.6%
Other values (846) 2648
41.8%
2023-12-11T01:03:16.929106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5246
 
16.0%
2174
 
6.6%
2057
 
6.3%
1 1447
 
4.4%
1430
 
4.4%
1235
 
3.8%
1167
 
3.6%
1114
 
3.4%
( 1107
 
3.4%
) 1106
 
3.4%
Other values (239) 14769
45.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 19268
58.7%
Space Separator 5246
 
16.0%
Decimal Number 5191
 
15.8%
Open Punctuation 1107
 
3.4%
Close Punctuation 1106
 
3.4%
Other Punctuation 665
 
2.0%
Uppercase Letter 130
 
0.4%
Dash Punctuation 129
 
0.4%
Math Symbol 8
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2174
 
11.3%
2057
 
10.7%
1430
 
7.4%
1235
 
6.4%
1167
 
6.1%
1114
 
5.8%
1101
 
5.7%
1100
 
5.7%
1096
 
5.7%
1094
 
5.7%
Other values (202) 5700
29.6%
Uppercase Letter
ValueCountFrequency (%)
I 15
11.5%
S 14
10.8%
K 14
10.8%
E 13
10.0%
B 13
10.0%
A 12
9.2%
V 12
9.2%
W 12
9.2%
C 6
 
4.6%
G 5
 
3.8%
Other values (5) 14
10.8%
Decimal Number
ValueCountFrequency (%)
1 1447
27.9%
2 867
16.7%
3 629
12.1%
0 486
 
9.4%
4 398
 
7.7%
5 358
 
6.9%
8 297
 
5.7%
6 254
 
4.9%
7 239
 
4.6%
9 216
 
4.2%
Other Punctuation
ValueCountFrequency (%)
, 659
99.1%
@ 2
 
0.3%
& 2
 
0.3%
. 1
 
0.2%
/ 1
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
c 1
50.0%
e 1
50.0%
Space Separator
ValueCountFrequency (%)
5246
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1107
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1106
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 129
100.0%
Math Symbol
ValueCountFrequency (%)
~ 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 19268
58.7%
Common 13452
40.9%
Latin 132
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2174
 
11.3%
2057
 
10.7%
1430
 
7.4%
1235
 
6.4%
1167
 
6.1%
1114
 
5.8%
1101
 
5.7%
1100
 
5.7%
1096
 
5.7%
1094
 
5.7%
Other values (202) 5700
29.6%
Common
ValueCountFrequency (%)
5246
39.0%
1 1447
 
10.8%
( 1107
 
8.2%
) 1106
 
8.2%
2 867
 
6.4%
, 659
 
4.9%
3 629
 
4.7%
0 486
 
3.6%
4 398
 
3.0%
5 358
 
2.7%
Other values (10) 1149
 
8.5%
Latin
ValueCountFrequency (%)
I 15
11.4%
S 14
10.6%
K 14
10.6%
E 13
9.8%
B 13
9.8%
A 12
9.1%
V 12
9.1%
W 12
9.1%
C 6
 
4.5%
G 5
 
3.8%
Other values (7) 16
12.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 19268
58.7%
ASCII 13584
41.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5246
38.6%
1 1447
 
10.7%
( 1107
 
8.1%
) 1106
 
8.1%
2 867
 
6.4%
, 659
 
4.9%
3 629
 
4.6%
0 486
 
3.6%
4 398
 
2.9%
5 358
 
2.6%
Other values (27) 1281
 
9.4%
Hangul
ValueCountFrequency (%)
2174
 
11.3%
2057
 
10.7%
1430
 
7.4%
1235
 
6.4%
1167
 
6.1%
1114
 
5.8%
1101
 
5.7%
1100
 
5.7%
1096
 
5.7%
1094
 
5.7%
Other values (202) 5700
29.6%

소재지전화
Text

MISSING 

Distinct816
Distinct (%)98.4%
Missing267
Missing (%)24.4%
Memory size8.7 KiB
2023-12-11T01:03:17.224704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.021713
Min length9

Characters and Unicode

Total characters9966
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique804 ?
Unique (%)97.0%

Sample

1st row051-861-6727
2nd row051-852-5927
3rd row051-503-5639
4th row051-753-0316
5th row051-865-7560
ValueCountFrequency (%)
051-757-0101 3
 
0.4%
051-757-2844 2
 
0.2%
051-862-1863 2
 
0.2%
051-863-1515 2
 
0.2%
051-862-9999 2
 
0.2%
051-753-5894 2
 
0.2%
051-867-7401 2
 
0.2%
051-862-7345 2
 
0.2%
051-504-9789 2
 
0.2%
051-852-3313 2
 
0.2%
Other values (806) 808
97.5%
2023-12-11T01:03:17.673199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1655
16.6%
5 1631
16.4%
0 1360
13.6%
1 1289
12.9%
8 926
9.3%
6 718
7.2%
7 613
 
6.2%
2 546
 
5.5%
3 486
 
4.9%
4 378
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 8311
83.4%
Dash Punctuation 1655
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 1631
19.6%
0 1360
16.4%
1 1289
15.5%
8 926
11.1%
6 718
8.6%
7 613
 
7.4%
2 546
 
6.6%
3 486
 
5.8%
4 378
 
4.5%
9 364
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 1655
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 9966
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1655
16.6%
5 1631
16.4%
0 1360
13.6%
1 1289
12.9%
8 926
9.3%
6 718
7.2%
7 613
 
6.2%
2 546
 
5.5%
3 486
 
4.9%
4 378
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9966
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1655
16.6%
5 1631
16.4%
0 1360
13.6%
1 1289
12.9%
8 926
9.3%
6 718
7.2%
7 613
 
6.2%
2 546
 
5.5%
3 486
 
4.9%
4 378
 
3.8%

Missing values

2023-12-11T01:03:15.265551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:03:15.339773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명영업소 주소(도로명)소재지전화
0숙박업(일반)궁락모텔부산광역시 연제구 중앙대로1120번길 17 (연산동)051-861-6727
1숙박업(일반)미진장여관부산광역시 연제구 거제시장로 24 (거제동)051-852-5927
2숙박업(일반)제일여관부산광역시 연제구 월드컵대로 217-1 (거제동)051-503-5639
3숙박업(일반)세림장부산광역시 연제구 과정로191번길 3 (연산동)051-753-0316
4숙박업(일반)1RUA(일루아)부산광역시 연제구 반송로 18-14 (연산동)<NA>
5숙박업(일반)대원장여관부산광역시 연제구 거제천로230번길 98 (연산동)051-865-7560
6숙박업(일반)에그(egg)모텔부산광역시 연제구 고분로13번길 13 (연산동)051-863-9110
7숙박업(일반)토곡부산광역시 연제구 과정로 187-1 (연산동)051-759-2040
8숙박업(일반)BNB(비앤비)부산광역시 연제구 월드컵대로114번길 15 (연산동)051-866-8277
9숙박업(일반)샤이어호텔부산광역시 연제구 반송로 18-6 (연산동)051-866-4645
업종명업소명영업소 주소(도로명)소재지전화
1086일반미용업, 네일미용업, 화장ㆍ분장 미용업Bien poeme(빈포엠)부산광역시 연제구 안연로 33, 상가B동 103호 (연산동)<NA>
1087일반미용업, 네일미용업, 화장ㆍ분장 미용업이가자헤어비스(연산더샵)부산광역시 연제구 연수로 130, 2층 201, 202호 (연산동, 연산더샵)051-853-8324
1088일반미용업, 네일미용업, 화장ㆍ분장 미용업드라포레 연산점부산광역시 연제구 연수로 113, 1~2층 (연산동)051-867-1012
1089피부미용업, 네일미용업, 화장ㆍ분장 미용업제시속눈썹부산광역시 연제구 안연로23번길 53, 1층 (연산동)070-8108-0609
1090피부미용업, 네일미용업, 화장ㆍ분장 미용업윤네일부산광역시 연제구 해맞이로31번길 58, GIB메네스빌딩 2층 (거제동)<NA>
1091피부미용업, 네일미용업, 화장ㆍ분장 미용업네일은. 설렘부산광역시 연제구 중앙천로 7, 1층 (연산동)<NA>
1092피부미용업, 네일미용업, 화장ㆍ분장 미용업티나뷰티부산광역시 연제구 신촌로 30, 2층 (연산동)051-852-3777
1093피부미용업, 네일미용업, 화장ㆍ분장 미용업라즈뷰티부산광역시 연제구 연수로 130, 124동 101호 (연산동, 연산더샵)<NA>
1094피부미용업, 네일미용업, 화장ㆍ분장 미용업네일맑음부산광역시 연제구 연수로 204-1, 1층 (연산동)051-851-9656
1095피부미용업, 네일미용업, 화장ㆍ분장 미용업뷰티믈리에부산광역시 연제구 거제천로124번길 16, 1층 (연산동)<NA>

Duplicate rows

Most frequently occurring

업종명업소명영업소 주소(도로명)소재지전화# duplicates
0미용업정 미용실부산광역시 연제구 고분로 105 (연산동)051-864-36822
1종합미용업스피나 피부&네일부산광역시 연제구 과정로 185, 2층 (연산동)<NA>2