Overview

Dataset statistics

Number of variables3
Number of observations232
Missing cells1
Missing cells (%)0.1%
Duplicate rows1
Duplicate rows (%)0.4%
Total size in memory5.6 KiB
Average record size in memory24.6 B

Variable types

Categorical1
Text2

Dataset

Description부산광역시_사상구_체육시설업현황_20230328
Author부산광역시 사상구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3078734

Alerts

Dataset has 1 (0.4%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-10 17:16:12.775791
Analysis finished2023-12-10 17:16:13.733908
Duration0.96 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct10
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
당구장업
68 
체육도장업
55 
체력단련장업
45 
골프연습장업
37 
가상체험 체육시설업
13 
Other values (5)
14 

Length

Max length10
Median length7
Mean length5.3534483
Min length4

Unique

Unique2 ?
Unique (%)0.9%

Sample

1st row수영장업
2nd row수영장업
3rd row체육도장업
4th row체육도장업
5th row체육도장업

Common Values

ValueCountFrequency (%)
당구장업 68
29.3%
체육도장업 55
23.7%
체력단련장업 45
19.4%
골프연습장업 37
15.9%
가상체험 체육시설업 13
 
5.6%
체육교습업 8
 
3.4%
수영장업 2
 
0.9%
종합체육시설업 2
 
0.9%
무도학원업 1
 
0.4%
인공암벽장업 1
 
0.4%

Length

2023-12-11T02:16:13.919730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:16:14.179747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
당구장업 68
27.8%
체육도장업 55
22.4%
체력단련장업 45
18.4%
골프연습장업 37
15.1%
가상체험 13
 
5.3%
체육시설업 13
 
5.3%
체육교습업 8
 
3.3%
수영장업 2
 
0.8%
종합체육시설업 2
 
0.8%
무도학원업 1
 
0.4%

상호
Text

Distinct229
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-11T02:16:14.692394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length15
Mean length7.2284483
Min length2

Characters and Unicode

Total characters1677
Distinct characters297
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique226 ?
Unique (%)97.4%

Sample

1st row블루25 워터파크 (삼락야외수영장)
2nd row엄궁스포츠센터
3rd row엄궁체육관
4th row사상태권도
5th row구학 태권도장
ValueCountFrequency (%)
당구클럽 15
 
4.5%
태권도 5
 
1.5%
당구장 4
 
1.2%
스크린골프 4
 
1.2%
휘트니스 4
 
1.2%
차오름 3
 
0.9%
gym 3
 
0.9%
태권도장 3
 
0.9%
체육관 3
 
0.9%
스포츠 3
 
0.9%
Other values (274) 285
85.8%
2023-12-11T02:16:15.492282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
100
 
6.0%
77
 
4.6%
68
 
4.1%
64
 
3.8%
54
 
3.2%
50
 
3.0%
48
 
2.9%
46
 
2.7%
44
 
2.6%
36
 
2.1%
Other values (287) 1090
65.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1440
85.9%
Space Separator 100
 
6.0%
Uppercase Letter 85
 
5.1%
Decimal Number 19
 
1.1%
Open Punctuation 11
 
0.7%
Close Punctuation 11
 
0.7%
Lowercase Letter 6
 
0.4%
Other Punctuation 4
 
0.2%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
77
 
5.3%
68
 
4.7%
64
 
4.4%
54
 
3.8%
50
 
3.5%
48
 
3.3%
46
 
3.2%
44
 
3.1%
36
 
2.5%
28
 
1.9%
Other values (244) 925
64.2%
Uppercase Letter
ValueCountFrequency (%)
S 9
 
10.6%
M 8
 
9.4%
K 7
 
8.2%
Y 6
 
7.1%
G 6
 
7.1%
P 6
 
7.1%
T 5
 
5.9%
C 5
 
5.9%
O 4
 
4.7%
E 4
 
4.7%
Other values (15) 25
29.4%
Decimal Number
ValueCountFrequency (%)
2 10
52.6%
1 3
 
15.8%
5 2
 
10.5%
6 1
 
5.3%
3 1
 
5.3%
9 1
 
5.3%
4 1
 
5.3%
Lowercase Letter
ValueCountFrequency (%)
l 2
33.3%
i 1
16.7%
e 1
16.7%
n 1
16.7%
o 1
16.7%
Other Punctuation
ValueCountFrequency (%)
& 3
75.0%
· 1
 
25.0%
Space Separator
ValueCountFrequency (%)
100
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1440
85.9%
Common 146
 
8.7%
Latin 91
 
5.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
77
 
5.3%
68
 
4.7%
64
 
4.4%
54
 
3.8%
50
 
3.5%
48
 
3.3%
46
 
3.2%
44
 
3.1%
36
 
2.5%
28
 
1.9%
Other values (244) 925
64.2%
Latin
ValueCountFrequency (%)
S 9
 
9.9%
M 8
 
8.8%
K 7
 
7.7%
Y 6
 
6.6%
G 6
 
6.6%
P 6
 
6.6%
T 5
 
5.5%
C 5
 
5.5%
O 4
 
4.4%
E 4
 
4.4%
Other values (20) 31
34.1%
Common
ValueCountFrequency (%)
100
68.5%
( 11
 
7.5%
) 11
 
7.5%
2 10
 
6.8%
& 3
 
2.1%
1 3
 
2.1%
5 2
 
1.4%
6 1
 
0.7%
3 1
 
0.7%
9 1
 
0.7%
Other values (3) 3
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1440
85.9%
ASCII 236
 
14.1%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
100
42.4%
( 11
 
4.7%
) 11
 
4.7%
2 10
 
4.2%
S 9
 
3.8%
M 8
 
3.4%
K 7
 
3.0%
Y 6
 
2.5%
G 6
 
2.5%
P 6
 
2.5%
Other values (32) 62
26.3%
Hangul
ValueCountFrequency (%)
77
 
5.3%
68
 
4.7%
64
 
4.4%
54
 
3.8%
50
 
3.5%
48
 
3.3%
46
 
3.2%
44
 
3.1%
36
 
2.5%
28
 
1.9%
Other values (244) 925
64.2%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct227
Distinct (%)98.3%
Missing1
Missing (%)0.4%
Memory size1.9 KiB
2023-12-11T02:16:16.058355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length40
Mean length31
Min length22

Characters and Unicode

Total characters7161
Distinct characters199
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique223 ?
Unique (%)96.5%

Sample

1st row부산광역시 사상구 엄궁로191번길 23 (엄궁동, 엄궁초등학교)
2nd row부산광역시 사상구 낙동대로776번길 40 (엄궁동)
3rd row부산광역시 사상구 사상로250번길 25, 2층 (괘법동, 칠성불교사)
4th row부산광역시 사상구 학감대로49번길 34, 구학마을상가 111,112호 (학장동)
5th row부산광역시 사상구 엄궁로191번길 41 (엄궁동,3층)
ValueCountFrequency (%)
사상구 232
 
16.6%
부산광역시 231
 
16.5%
주례동 43
 
3.1%
모라동 38
 
2.7%
백양대로 31
 
2.2%
괘법동 28
 
2.0%
2층 28
 
2.0%
학장동 28
 
2.0%
엄궁동 28
 
2.0%
3층 23
 
1.6%
Other values (356) 688
49.2%
2023-12-11T02:16:16.993604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1196
 
16.7%
299
 
4.2%
291
 
4.1%
274
 
3.8%
242
 
3.4%
242
 
3.4%
240
 
3.4%
) 237
 
3.3%
( 237
 
3.3%
235
 
3.3%
Other values (189) 3668
51.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4243
59.3%
Space Separator 1196
 
16.7%
Decimal Number 1026
 
14.3%
Close Punctuation 237
 
3.3%
Open Punctuation 237
 
3.3%
Other Punctuation 197
 
2.8%
Dash Punctuation 19
 
0.3%
Uppercase Letter 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
299
 
7.0%
291
 
6.9%
274
 
6.5%
242
 
5.7%
242
 
5.7%
240
 
5.7%
235
 
5.5%
234
 
5.5%
232
 
5.5%
231
 
5.4%
Other values (170) 1723
40.6%
Decimal Number
ValueCountFrequency (%)
1 168
16.4%
2 161
15.7%
3 135
13.2%
4 133
13.0%
0 106
10.3%
7 75
7.3%
9 68
6.6%
5 67
 
6.5%
6 64
 
6.2%
8 49
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
B 2
33.3%
A 2
33.3%
S 1
16.7%
L 1
16.7%
Space Separator
ValueCountFrequency (%)
1196
100.0%
Close Punctuation
ValueCountFrequency (%)
) 237
100.0%
Open Punctuation
ValueCountFrequency (%)
( 237
100.0%
Other Punctuation
ValueCountFrequency (%)
, 197
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4243
59.3%
Common 2912
40.7%
Latin 6
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
299
 
7.0%
291
 
6.9%
274
 
6.5%
242
 
5.7%
242
 
5.7%
240
 
5.7%
235
 
5.5%
234
 
5.5%
232
 
5.5%
231
 
5.4%
Other values (170) 1723
40.6%
Common
ValueCountFrequency (%)
1196
41.1%
) 237
 
8.1%
( 237
 
8.1%
, 197
 
6.8%
1 168
 
5.8%
2 161
 
5.5%
3 135
 
4.6%
4 133
 
4.6%
0 106
 
3.6%
7 75
 
2.6%
Other values (5) 267
 
9.2%
Latin
ValueCountFrequency (%)
B 2
33.3%
A 2
33.3%
S 1
16.7%
L 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4243
59.3%
ASCII 2918
40.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1196
41.0%
) 237
 
8.1%
( 237
 
8.1%
, 197
 
6.8%
1 168
 
5.8%
2 161
 
5.5%
3 135
 
4.6%
4 133
 
4.6%
0 106
 
3.6%
7 75
 
2.6%
Other values (9) 273
 
9.4%
Hangul
ValueCountFrequency (%)
299
 
7.0%
291
 
6.9%
274
 
6.5%
242
 
5.7%
242
 
5.7%
240
 
5.7%
235
 
5.5%
234
 
5.5%
232
 
5.5%
231
 
5.4%
Other values (170) 1723
40.6%

Missing values

2023-12-11T02:16:13.352169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:16:13.632673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호시설주소(도로명)
0수영장업블루25 워터파크 (삼락야외수영장)<NA>
1수영장업엄궁스포츠센터부산광역시 사상구 엄궁로191번길 23 (엄궁동, 엄궁초등학교)
2체육도장업엄궁체육관부산광역시 사상구 낙동대로776번길 40 (엄궁동)
3체육도장업사상태권도부산광역시 사상구 사상로250번길 25, 2층 (괘법동, 칠성불교사)
4체육도장업구학 태권도장부산광역시 사상구 학감대로49번길 34, 구학마을상가 111,112호 (학장동)
5체육도장업승학2체육관부산광역시 사상구 엄궁로191번길 41 (엄궁동,3층)
6체육도장업모동체육관(태권도)부산광역시 사상구 모라로192번길 32 (모라동)
7체육도장업유강체육도장부산광역시 사상구 양지로30번길 4 (주례동)
8체육도장업무덕체육관(태권도)부산광역시 사상구 백양대로 766, 3층 (덕포동)
9체육도장업감전 태권도부산광역시 사상구 새벽로167번길 36, 감전한겨례태권도 2층 (감전동)
업종상호시설주소(도로명)
222가상체험 체육시설업국제스크린골프부산광역시 사상구 백양대로 887-26, 3층 (모라동)
223체육교습업줄친구 점프점프 엄궁부산광역시 사상구 엄궁로 95, 5층 (엄궁동)
224체육교습업킨더즈 풋볼 아카데미부산광역시 사상구 가야대로 295, 4층 (주례동)
225체육교습업J 스포츠 스쿨부산광역시 사상구 엄궁로 95, 4층 (엄궁동)
226체육교습업J 스포츠 스쿨 2호점부산광역시 사상구 엄궁로 11, 1층 (엄궁동)
227체육교습업황쌤 점프파이어 줄넘기 클럽부산광역시 사상구 엄궁로 97, 한아름빌딩 4층 (엄궁동)
228체육교습업아티 스포츠 클럽부산광역시 사상구 주례로 47, 동서대학교, 산학협력관 4층 5407호 (주례동)
229체육교습업한상운 풋볼 스튜디오부산광역시 사상구 대동로 21, 티에이치빌딩, 봉팔이가 2층 (엄궁동)
230체육교습업본트 인라인클럽부산광역시 사상구 사상로 78, 사랑의교회 4층 (감전동)
231인공암벽장업포고 클라임부산광역시 사상구 사상로 415 (주)제이드엠 2층 (모라동)

Duplicate rows

Most frequently occurring

업종상호시설주소(도로명)# duplicates
0체력단련장업알통클럽부산광역시 사상구 낙동대로 751, 7층 (엄궁동, 동일메가타워)2