Overview

Dataset statistics

Number of variables5
Number of observations1141
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory44.7 KiB
Average record size in memory40.1 B

Variable types

Categorical2
Text3

Dataset

Description부산광역시_연제구_공중위생업소관리현황_20221021
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15051414

Alerts

Dataset has 1 (0.1%) duplicate rowsDuplicates
기타유의사항 is highly overall correlated with 업종명High correlation
업종명 is highly overall correlated with 기타유의사항High correlation

Reproduction

Analysis started2023-12-10 16:03:09.254970
Analysis finished2023-12-10 16:03:09.867000
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
일반미용업
299 
미용업
181 
피부미용업
100 
세탁업
86 
네일미용업
81 
Other values (16)
394 

Length

Max length23
Median length16
Mean length5.652936
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row숙박업(일반)
2nd row숙박업(일반)
3rd row숙박업(일반)
4th row숙박업(일반)
5th row숙박업(일반)

Common Values

ValueCountFrequency (%)
일반미용업 299
26.2%
미용업 181
15.9%
피부미용업 100
 
8.8%
세탁업 86
 
7.5%
네일미용업 81
 
7.1%
건물위생관리업 75
 
6.6%
이용업 67
 
5.9%
숙박업(일반) 65
 
5.7%
목욕장업 42
 
3.7%
종합미용업 32
 
2.8%
Other values (11) 113
 
9.9%

Length

2023-12-11T01:03:09.949805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반미용업 340
25.3%
미용업 266
19.8%
피부미용업 148
11.0%
네일미용업 129
 
9.6%
세탁업 86
 
6.4%
화장ㆍ분장 85
 
6.3%
건물위생관리업 75
 
5.6%
이용업 67
 
5.0%
숙박업(일반 65
 
4.8%
목욕장업 42
 
3.1%
Other values (2) 40
 
3.0%
Distinct1117
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
2023-12-11T01:03:10.185015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length28
Mean length6.1139351
Min length1

Characters and Unicode

Total characters6976
Distinct characters562
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1094 ?
Unique (%)95.9%

Sample

1st row궁락모텔
2nd row미진장여관
3rd row제일여관
4th row세림장
5th row대원장여관
ValueCountFrequency (%)
헤어 25
 
1.7%
주식회사 15
 
1.0%
네일 12
 
0.8%
nail 10
 
0.7%
연산점 7
 
0.5%
에스테틱 7
 
0.5%
이용원 7
 
0.5%
by 7
 
0.5%
6
 
0.4%
호텔 6
 
0.4%
Other values (1269) 1373
93.1%
2023-12-11T01:03:10.579878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
337
 
4.8%
322
 
4.6%
311
 
4.5%
176
 
2.5%
) 143
 
2.0%
( 143
 
2.0%
139
 
2.0%
122
 
1.7%
117
 
1.7%
95
 
1.4%
Other values (552) 5071
72.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5462
78.3%
Lowercase Letter 409
 
5.9%
Uppercase Letter 367
 
5.3%
Space Separator 337
 
4.8%
Close Punctuation 143
 
2.0%
Open Punctuation 143
 
2.0%
Other Punctuation 57
 
0.8%
Decimal Number 44
 
0.6%
Dash Punctuation 9
 
0.1%
Math Symbol 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
322
 
5.9%
311
 
5.7%
176
 
3.2%
139
 
2.5%
122
 
2.2%
117
 
2.1%
95
 
1.7%
94
 
1.7%
71
 
1.3%
71
 
1.3%
Other values (482) 3944
72.2%
Uppercase Letter
ValueCountFrequency (%)
N 38
 
10.4%
A 28
 
7.6%
I 28
 
7.6%
R 27
 
7.4%
O 26
 
7.1%
H 25
 
6.8%
B 24
 
6.5%
Y 21
 
5.7%
L 19
 
5.2%
E 18
 
4.9%
Other values (14) 113
30.8%
Lowercase Letter
ValueCountFrequency (%)
a 56
13.7%
i 42
10.3%
e 41
10.0%
o 35
8.6%
n 31
 
7.6%
r 27
 
6.6%
l 27
 
6.6%
y 25
 
6.1%
h 22
 
5.4%
b 17
 
4.2%
Other values (12) 86
21.0%
Other Punctuation
ValueCountFrequency (%)
& 17
29.8%
. 11
19.3%
# 8
14.0%
' 7
12.3%
, 6
 
10.5%
: 5
 
8.8%
% 1
 
1.8%
· 1
 
1.8%
1
 
1.8%
Decimal Number
ValueCountFrequency (%)
1 13
29.5%
9 8
18.2%
5 6
13.6%
2 6
13.6%
0 4
 
9.1%
4 3
 
6.8%
3 2
 
4.5%
7 2
 
4.5%
Math Symbol
ValueCountFrequency (%)
> 2
50.0%
< 2
50.0%
Space Separator
ValueCountFrequency (%)
337
100.0%
Close Punctuation
ValueCountFrequency (%)
) 143
100.0%
Open Punctuation
ValueCountFrequency (%)
( 143
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5453
78.2%
Latin 776
 
11.1%
Common 738
 
10.6%
Han 9
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
322
 
5.9%
311
 
5.7%
176
 
3.2%
139
 
2.5%
122
 
2.2%
117
 
2.1%
95
 
1.7%
94
 
1.7%
71
 
1.3%
71
 
1.3%
Other values (475) 3935
72.2%
Latin
ValueCountFrequency (%)
a 56
 
7.2%
i 42
 
5.4%
e 41
 
5.3%
N 38
 
4.9%
o 35
 
4.5%
n 31
 
4.0%
A 28
 
3.6%
I 28
 
3.6%
r 27
 
3.5%
R 27
 
3.5%
Other values (36) 423
54.5%
Common
ValueCountFrequency (%)
337
45.7%
) 143
19.4%
( 143
19.4%
& 17
 
2.3%
1 13
 
1.8%
. 11
 
1.5%
- 9
 
1.2%
# 8
 
1.1%
9 8
 
1.1%
' 7
 
0.9%
Other values (14) 42
 
5.7%
Han
ValueCountFrequency (%)
3
33.3%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5453
78.2%
ASCII 1512
 
21.7%
CJK 9
 
0.1%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
337
22.3%
) 143
 
9.5%
( 143
 
9.5%
a 56
 
3.7%
i 42
 
2.8%
e 41
 
2.7%
N 38
 
2.5%
o 35
 
2.3%
n 31
 
2.1%
A 28
 
1.9%
Other values (58) 618
40.9%
Hangul
ValueCountFrequency (%)
322
 
5.9%
311
 
5.7%
176
 
3.2%
139
 
2.5%
122
 
2.2%
117
 
2.1%
95
 
1.7%
94
 
1.7%
71
 
1.3%
71
 
1.3%
Other values (475) 3935
72.2%
CJK
ValueCountFrequency (%)
3
33.3%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
None
ValueCountFrequency (%)
· 1
50.0%
1
50.0%
Distinct1087
Distinct (%)95.3%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
2023-12-11T01:03:10.858654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length51
Mean length30.83085
Min length9

Characters and Unicode

Total characters35178
Distinct characters255
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1042 ?
Unique (%)91.3%

Sample

1st row부산광역시 연제구 중앙대로1120번길 17 (연산동)
2nd row부산광역시 연제구 거제시장로 24 (거제동)
3rd row부산광역시 연제구 월드컵대로 217-1 (거제동)
4th row부산광역시 연제구 과정로191번길 3 (연산동)
5th row부산광역시 연제구 거제천로230번길 98 (연산동)
ValueCountFrequency (%)
부산광역시 1141
 
16.7%
연제구 1141
 
16.7%
연산동 922
 
13.5%
1층 267
 
3.9%
거제동 178
 
2.6%
2층 122
 
1.8%
과정로 49
 
0.7%
중앙대로 47
 
0.7%
연수로 41
 
0.6%
월드컵대로 40
 
0.6%
Other values (880) 2890
42.3%
2023-12-11T01:03:11.314916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5697
 
16.2%
2282
 
6.5%
2162
 
6.1%
1 1576
 
4.5%
1477
 
4.2%
1316
 
3.7%
1227
 
3.5%
1180
 
3.4%
) 1159
 
3.3%
( 1159
 
3.3%
Other values (245) 15943
45.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20472
58.2%
Space Separator 5697
 
16.2%
Decimal Number 5537
 
15.7%
Close Punctuation 1159
 
3.3%
Open Punctuation 1159
 
3.3%
Other Punctuation 826
 
2.3%
Uppercase Letter 193
 
0.5%
Dash Punctuation 124
 
0.4%
Math Symbol 8
 
< 0.1%
Lowercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2282
 
11.1%
2162
 
10.6%
1477
 
7.2%
1316
 
6.4%
1227
 
6.0%
1180
 
5.8%
1153
 
5.6%
1145
 
5.6%
1141
 
5.6%
1140
 
5.6%
Other values (209) 6249
30.5%
Uppercase Letter
ValueCountFrequency (%)
E 26
13.5%
S 26
13.5%
K 26
13.5%
I 26
13.5%
W 25
13.0%
V 25
13.0%
A 13
6.7%
B 10
 
5.2%
C 5
 
2.6%
G 3
 
1.6%
Other values (4) 8
 
4.1%
Decimal Number
ValueCountFrequency (%)
1 1576
28.5%
2 975
17.6%
3 663
12.0%
0 522
 
9.4%
4 417
 
7.5%
5 363
 
6.6%
8 270
 
4.9%
6 264
 
4.8%
7 255
 
4.6%
9 232
 
4.2%
Other Punctuation
ValueCountFrequency (%)
, 821
99.4%
& 2
 
0.2%
/ 1
 
0.1%
@ 1
 
0.1%
. 1
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
c 2
66.7%
e 1
33.3%
Space Separator
ValueCountFrequency (%)
5697
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1159
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1159
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 124
100.0%
Math Symbol
ValueCountFrequency (%)
~ 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20472
58.2%
Common 14510
41.2%
Latin 196
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2282
 
11.1%
2162
 
10.6%
1477
 
7.2%
1316
 
6.4%
1227
 
6.0%
1180
 
5.8%
1153
 
5.6%
1145
 
5.6%
1141
 
5.6%
1140
 
5.6%
Other values (209) 6249
30.5%
Common
ValueCountFrequency (%)
5697
39.3%
1 1576
 
10.9%
) 1159
 
8.0%
( 1159
 
8.0%
2 975
 
6.7%
, 821
 
5.7%
3 663
 
4.6%
0 522
 
3.6%
4 417
 
2.9%
5 363
 
2.5%
Other values (10) 1158
 
8.0%
Latin
ValueCountFrequency (%)
E 26
13.3%
S 26
13.3%
K 26
13.3%
I 26
13.3%
W 25
12.8%
V 25
12.8%
A 13
6.6%
B 10
 
5.1%
C 5
 
2.6%
G 3
 
1.5%
Other values (6) 11
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20472
58.2%
ASCII 14706
41.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5697
38.7%
1 1576
 
10.7%
) 1159
 
7.9%
( 1159
 
7.9%
2 975
 
6.6%
, 821
 
5.6%
3 663
 
4.5%
0 522
 
3.5%
4 417
 
2.8%
5 363
 
2.5%
Other values (26) 1354
 
9.2%
Hangul
ValueCountFrequency (%)
2282
 
11.1%
2162
 
10.6%
1477
 
7.2%
1316
 
6.4%
1227
 
6.0%
1180
 
5.8%
1153
 
5.6%
1145
 
5.6%
1141
 
5.6%
1140
 
5.6%
Other values (209) 6249
30.5%
Distinct747
Distinct (%)65.5%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
2023-12-11T01:03:11.582495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length10.004382
Min length6

Characters and Unicode

Total characters11415
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique735 ?
Unique (%)64.4%

Sample

1st row051-861-6727
2nd row051-852-5927
3rd row051-503-5639
4th row051-751-1915
5th row051-865-7560
ValueCountFrequency (%)
전화번호없음 383
33.6%
051-757-0101 3
 
0.3%
051-862-7345 2
 
0.2%
051-753-5894 2
 
0.2%
051-852-8219 2
 
0.2%
051-852-3313 2
 
0.2%
051-757-2844 2
 
0.2%
051-504-9789 2
 
0.2%
051-862-9999 2
 
0.2%
051-867-7401 2
 
0.2%
Other values (737) 739
64.8%
2023-12-11T01:03:12.002200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1516
13.3%
5 1505
13.2%
0 1241
10.9%
1 1163
10.2%
8 845
 
7.4%
6 634
 
5.6%
7 573
 
5.0%
2 512
 
4.5%
3 471
 
4.1%
383
 
3.4%
Other values (7) 2572
22.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7601
66.6%
Other Letter 2298
 
20.1%
Dash Punctuation 1516
 
13.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 1505
19.8%
0 1241
16.3%
1 1163
15.3%
8 845
11.1%
6 634
8.3%
7 573
 
7.5%
2 512
 
6.7%
3 471
 
6.2%
9 331
 
4.4%
4 326
 
4.3%
Other Letter
ValueCountFrequency (%)
383
16.7%
383
16.7%
383
16.7%
383
16.7%
383
16.7%
383
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 1516
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 9117
79.9%
Hangul 2298
 
20.1%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1516
16.6%
5 1505
16.5%
0 1241
13.6%
1 1163
12.8%
8 845
9.3%
6 634
7.0%
7 573
 
6.3%
2 512
 
5.6%
3 471
 
5.2%
9 331
 
3.6%
Hangul
ValueCountFrequency (%)
383
16.7%
383
16.7%
383
16.7%
383
16.7%
383
16.7%
383
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9117
79.9%
Hangul 2298
 
20.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1516
16.6%
5 1505
16.5%
0 1241
13.6%
1 1163
12.8%
8 845
9.3%
6 634
7.0%
7 573
 
6.3%
2 512
 
5.6%
3 471
 
5.2%
9 331
 
3.6%
Hangul
ValueCountFrequency (%)
383
16.7%
383
16.7%
383
16.7%
383
16.7%
383
16.7%
383
16.7%

기타유의사항
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
<NA>
758 
개인정보 포함
383 

Length

Max length7
Median length4
Mean length5.0070114
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 758
66.4%
개인정보 포함 383
33.6%

Length

2023-12-11T01:03:12.162996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:03:12.276849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 758
49.7%
개인정보 383
25.1%
포함 383
25.1%

Correlations

2023-12-11T01:03:12.343186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명
업종명1.000
2023-12-11T01:03:12.412767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기타유의사항업종명
기타유의사항1.0001.000
업종명1.0001.000
2023-12-11T01:03:12.488869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명기타유의사항
업종명1.0001.000
기타유의사항1.0001.000

Missing values

2023-12-11T01:03:09.728253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:03:09.827023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명영업소 주소(도로명)소재지전화기타유의사항
0숙박업(일반)궁락모텔부산광역시 연제구 중앙대로1120번길 17 (연산동)051-861-6727<NA>
1숙박업(일반)미진장여관부산광역시 연제구 거제시장로 24 (거제동)051-852-5927<NA>
2숙박업(일반)제일여관부산광역시 연제구 월드컵대로 217-1 (거제동)051-503-5639<NA>
3숙박업(일반)세림장부산광역시 연제구 과정로191번길 3 (연산동)051-751-1915<NA>
4숙박업(일반)대원장여관부산광역시 연제구 거제천로230번길 98 (연산동)051-865-7560<NA>
5숙박업(일반)에그(egg)모텔부산광역시 연제구 고분로13번길 13 (연산동)051-863-9110<NA>
6숙박업(일반)토곡부산광역시 연제구 과정로 187-1 (연산동)051-759-2040<NA>
7숙박업(일반)BNB(비앤비)부산광역시 연제구 월드컵대로114번길 15 (연산동)051-866-8277<NA>
8숙박업(일반)샤이어호텔부산광역시 연제구 반송로 18-6 (연산동)051-866-4645<NA>
9숙박업(일반)프린스장부산광역시 연제구 반송로 17 (연산동)051-865-7595<NA>
업종명업소명영업소 주소(도로명)소재지전화기타유의사항
1131피부미용업, 네일미용업, 화장ㆍ분장 미용업제시속눈썹부산광역시 연제구 안연로23번길 53, 1층 (연산동)070-8108-0609<NA>
1132피부미용업, 네일미용업, 화장ㆍ분장 미용업아니스 속눈썹부산광역시 연제구 과정로251번길 45, 1층 103호 (연산동)전화번호없음개인정보 포함
1133피부미용업, 네일미용업, 화장ㆍ분장 미용업네일노마부산광역시 연제구 월드컵대로91번길 20, 101호 (연산동, 유림스카이)전화번호없음개인정보 포함
1134피부미용업, 네일미용업, 화장ㆍ분장 미용업티나뷰티부산광역시 연제구 신촌로 30, 2층 (연산동)051-852-3777<NA>
1135피부미용업, 네일미용업, 화장ㆍ분장 미용업라즈뷰티부산광역시 연제구 연수로 130, 124동 101호 (연산동, 연산더샵)전화번호없음개인정보 포함
1136피부미용업, 네일미용업, 화장ㆍ분장 미용업뷰티믈리에부산광역시 연제구 거제천로124번길 16, 1층 (연산동)전화번호없음개인정보 포함
1137피부미용업, 네일미용업, 화장ㆍ분장 미용업윤s' beauty academy부산광역시 연제구 신촌로 14, 3층 (연산동)전화번호없음개인정보 포함
1138피부미용업, 네일미용업, 화장ㆍ분장 미용업네일. 별부산광역시 연제구 중앙천로19번길 46, 2층 (연산동)전화번호없음개인정보 포함
1139피부미용업, 네일미용업, 화장ㆍ분장 미용업블랑코뷰티부산광역시 연제구 과정로 166, 1층 (연산동)전화번호없음개인정보 포함
1140피부미용업, 네일미용업, 화장ㆍ분장 미용업뷰티빛담부산광역시 연제구 봉수로 25, 231동 2층 211호 (연산동)전화번호없음개인정보 포함

Duplicate rows

Most frequently occurring

업종명업소명영업소 주소(도로명)소재지전화기타유의사항# duplicates
0종합미용업순뷰티부산광역시 연제구 월드컵대로145번길 103, 2층 (연산동)051-851-6739<NA>2