Overview

Dataset statistics

Number of variables3
Number of observations227
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.4 KiB
Average record size in memory24.6 B

Variable types

Categorical1
Text2

Dataset

Description부산광역시연제구_체육시설업현황_20230828
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3040712

Reproduction

Analysis started2023-12-10 16:59:44.058708
Analysis finished2023-12-10 16:59:44.768830
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct8
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
체력단련장업
67 
체육도장업
56 
당구장업
45 
골프연습장업
38 
가상체험 체육시설업
10 
Other values (3)
11 

Length

Max length10
Median length6
Mean length5.4757709
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수영장업
2nd row수영장업
3rd row체육도장업
4th row체육도장업
5th row체육도장업

Common Values

ValueCountFrequency (%)
체력단련장업 67
29.5%
체육도장업 56
24.7%
당구장업 45
19.8%
골프연습장업 38
16.7%
가상체험 체육시설업 10
 
4.4%
체육교습업 7
 
3.1%
수영장업 2
 
0.9%
무도학원업 2
 
0.9%

Length

2023-12-11T01:59:44.932721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:59:45.177694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
체력단련장업 67
28.3%
체육도장업 56
23.6%
당구장업 45
19.0%
골프연습장업 38
16.0%
가상체험 10
 
4.2%
체육시설업 10
 
4.2%
체육교습업 7
 
3.0%
수영장업 2
 
0.8%
무도학원업 2
 
0.8%

상호
Text

Distinct224
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-11T01:59:45.636741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length20
Mean length7.4361233
Min length1

Characters and Unicode

Total characters1688
Distinct characters302
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique222 ?
Unique (%)97.8%

Sample

1st row대영 스포렉스
2nd row더 퍼스트 수영장
3rd row송무인 동명태권도
4th row경동태권도
5th row새화랑체육도장
ValueCountFrequency (%)
6
 
1.8%
당구클럽 6
 
1.8%
연산점 5
 
1.5%
당구장 4
 
1.2%
피트니스 4
 
1.2%
스포렉스 3
 
0.9%
휘트니스 3
 
0.9%
골프 3
 
0.9%
합기도 3
 
0.9%
대영 3
 
0.9%
Other values (274) 292
88.0%
2023-12-11T01:59:46.361427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
105
 
6.2%
92
 
5.5%
48
 
2.8%
48
 
2.8%
43
 
2.5%
42
 
2.5%
42
 
2.5%
37
 
2.2%
35
 
2.1%
29
 
1.7%
Other values (292) 1167
69.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1439
85.2%
Space Separator 105
 
6.2%
Uppercase Letter 81
 
4.8%
Lowercase Letter 15
 
0.9%
Close Punctuation 14
 
0.8%
Open Punctuation 14
 
0.8%
Other Punctuation 9
 
0.5%
Decimal Number 9
 
0.5%
Letter Number 1
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
92
 
6.4%
48
 
3.3%
48
 
3.3%
43
 
3.0%
42
 
2.9%
42
 
2.9%
37
 
2.6%
35
 
2.4%
29
 
2.0%
26
 
1.8%
Other values (247) 997
69.3%
Uppercase Letter
ValueCountFrequency (%)
M 10
12.3%
A 10
12.3%
T 6
 
7.4%
S 6
 
7.4%
K 6
 
7.4%
B 6
 
7.4%
G 6
 
7.4%
Y 5
 
6.2%
P 5
 
6.2%
N 3
 
3.7%
Other values (9) 18
22.2%
Lowercase Letter
ValueCountFrequency (%)
i 3
20.0%
r 2
13.3%
a 2
13.3%
l 2
13.3%
b 1
 
6.7%
s 1
 
6.7%
d 1
 
6.7%
t 1
 
6.7%
f 1
 
6.7%
k 1
 
6.7%
Decimal Number
ValueCountFrequency (%)
2 2
22.2%
8 1
11.1%
1 1
11.1%
9 1
11.1%
0 1
11.1%
3 1
11.1%
7 1
11.1%
4 1
11.1%
Other Punctuation
ValueCountFrequency (%)
. 6
66.7%
& 2
 
22.2%
: 1
 
11.1%
Space Separator
ValueCountFrequency (%)
105
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1439
85.2%
Common 152
 
9.0%
Latin 97
 
5.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
92
 
6.4%
48
 
3.3%
48
 
3.3%
43
 
3.0%
42
 
2.9%
42
 
2.9%
37
 
2.6%
35
 
2.4%
29
 
2.0%
26
 
1.8%
Other values (247) 997
69.3%
Latin
ValueCountFrequency (%)
M 10
 
10.3%
A 10
 
10.3%
T 6
 
6.2%
S 6
 
6.2%
K 6
 
6.2%
B 6
 
6.2%
G 6
 
6.2%
Y 5
 
5.2%
P 5
 
5.2%
i 3
 
3.1%
Other values (20) 34
35.1%
Common
ValueCountFrequency (%)
105
69.1%
) 14
 
9.2%
( 14
 
9.2%
. 6
 
3.9%
2 2
 
1.3%
& 2
 
1.3%
8 1
 
0.7%
1 1
 
0.7%
9 1
 
0.7%
0 1
 
0.7%
Other values (5) 5
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1439
85.2%
ASCII 248
 
14.7%
Number Forms 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
105
42.3%
) 14
 
5.6%
( 14
 
5.6%
M 10
 
4.0%
A 10
 
4.0%
T 6
 
2.4%
S 6
 
2.4%
. 6
 
2.4%
K 6
 
2.4%
B 6
 
2.4%
Other values (34) 65
26.2%
Hangul
ValueCountFrequency (%)
92
 
6.4%
48
 
3.3%
48
 
3.3%
43
 
3.0%
42
 
2.9%
42
 
2.9%
37
 
2.6%
35
 
2.4%
29
 
2.0%
26
 
1.8%
Other values (247) 997
69.3%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct222
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-11T01:59:46.926690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length43
Mean length29.903084
Min length21

Characters and Unicode

Total characters6788
Distinct characters181
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique217 ?
Unique (%)95.6%

Sample

1st row부산광역시 연제구 연안로 25 (연산동)
2nd row부산광역시 연제구 거제천로87번길 30, 지하1층 (거제동, 연제그린타워)
3rd row부산광역시 연제구 연제로8번길 54 (연산동)
4th row부산광역시 연제구 세병로 16 (연산동)
5th row부산광역시 연제구 쌍미천로 11 (연산동)
ValueCountFrequency (%)
부산광역시 227
16.7%
연제구 227
16.7%
연산동 172
 
12.6%
거제동 57
 
4.2%
2층 29
 
2.1%
과정로 26
 
1.9%
3층 26
 
1.9%
4층 17
 
1.2%
중앙대로 15
 
1.1%
지하1층 15
 
1.1%
Other values (320) 549
40.4%
2023-12-11T01:59:47.771713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1133
 
16.7%
422
 
6.2%
409
 
6.0%
316
 
4.7%
246
 
3.6%
242
 
3.6%
230
 
3.4%
228
 
3.4%
228
 
3.4%
227
 
3.3%
Other values (171) 3107
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4046
59.6%
Space Separator 1133
 
16.7%
Decimal Number 944
 
13.9%
Close Punctuation 227
 
3.3%
Open Punctuation 227
 
3.3%
Other Punctuation 172
 
2.5%
Dash Punctuation 20
 
0.3%
Uppercase Letter 19
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
422
 
10.4%
409
 
10.1%
316
 
7.8%
246
 
6.1%
242
 
6.0%
230
 
5.7%
228
 
5.6%
228
 
5.6%
227
 
5.6%
227
 
5.6%
Other values (147) 1271
31.4%
Decimal Number
ValueCountFrequency (%)
1 206
21.8%
2 161
17.1%
3 147
15.6%
4 92
9.7%
0 80
 
8.5%
5 73
 
7.7%
6 55
 
5.8%
7 49
 
5.2%
8 46
 
4.9%
9 35
 
3.7%
Uppercase Letter
ValueCountFrequency (%)
B 4
21.1%
I 3
15.8%
E 2
10.5%
S 2
10.5%
K 2
10.5%
V 2
10.5%
W 2
10.5%
G 1
 
5.3%
A 1
 
5.3%
Space Separator
ValueCountFrequency (%)
1133
100.0%
Close Punctuation
ValueCountFrequency (%)
) 227
100.0%
Open Punctuation
ValueCountFrequency (%)
( 227
100.0%
Other Punctuation
ValueCountFrequency (%)
, 172
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4046
59.6%
Common 2723
40.1%
Latin 19
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
422
 
10.4%
409
 
10.1%
316
 
7.8%
246
 
6.1%
242
 
6.0%
230
 
5.7%
228
 
5.6%
228
 
5.6%
227
 
5.6%
227
 
5.6%
Other values (147) 1271
31.4%
Common
ValueCountFrequency (%)
1133
41.6%
) 227
 
8.3%
( 227
 
8.3%
1 206
 
7.6%
, 172
 
6.3%
2 161
 
5.9%
3 147
 
5.4%
4 92
 
3.4%
0 80
 
2.9%
5 73
 
2.7%
Other values (5) 205
 
7.5%
Latin
ValueCountFrequency (%)
B 4
21.1%
I 3
15.8%
E 2
10.5%
S 2
10.5%
K 2
10.5%
V 2
10.5%
W 2
10.5%
G 1
 
5.3%
A 1
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4046
59.6%
ASCII 2742
40.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1133
41.3%
) 227
 
8.3%
( 227
 
8.3%
1 206
 
7.5%
, 172
 
6.3%
2 161
 
5.9%
3 147
 
5.4%
4 92
 
3.4%
0 80
 
2.9%
5 73
 
2.7%
Other values (14) 224
 
8.2%
Hangul
ValueCountFrequency (%)
422
 
10.4%
409
 
10.1%
316
 
7.8%
246
 
6.1%
242
 
6.0%
230
 
5.7%
228
 
5.6%
228
 
5.6%
227
 
5.6%
227
 
5.6%
Other values (147) 1271
31.4%

Missing values

2023-12-11T01:59:44.565902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:59:44.713429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호시설주소(도로명)
0수영장업대영 스포렉스부산광역시 연제구 연안로 25 (연산동)
1수영장업더 퍼스트 수영장부산광역시 연제구 거제천로87번길 30, 지하1층 (거제동, 연제그린타워)
2체육도장업송무인 동명태권도부산광역시 연제구 연제로8번길 54 (연산동)
3체육도장업경동태권도부산광역시 연제구 세병로 16 (연산동)
4체육도장업새화랑체육도장부산광역시 연제구 쌍미천로 11 (연산동)
5체육도장업태극체육관부산광역시 연제구 연안로13번길 65 (연산동)
6체육도장업대세 태권도장부산광역시 연제구 거제천로 112 (연산동)
7체육도장업거성체육관부산광역시 연제구 해맞이로 23, 115동 305호 (거제동, 거제유림아시아드)
8체육도장업경원체육관부산광역시 연제구 중앙천로 38, 3층 (연산동)
9체육도장업연산체육관부산광역시 연제구 중앙천로39번길 13, 지하1층 (연산동)
업종상호시설주소(도로명)
217가상체험 체육시설업프렌즈아카데미 연산교차로부산광역시 연제구 반송로 33, 803호, 805호 (연산동)
218가상체험 체육시설업엑스클루시 골프아카데미부산광역시 연제구 고분로13번길 25, 405호 (연산동, 연산동 쌍용아파트)
219가상체험 체육시설업프렌즈스크린 연산교차로점부산광역시 연제구 반송로 33, 801호, 804호 (연산동)
220체육교습업지니어스 음악줄넘기센터부산광역시 연제구 과정로 207, 3층 (연산동)
221체육교습업SM SSAKA(에쓰엠싸카)부산광역시 연제구 좌수영로 295, 2층 (연산동)
222체육교습업루키즈부산광역시 연제구 월드컵대로 54, 4층 (연산동)
223체육교습업(주)모션스포츠부산광역시 연제구 쌍미천로 160, 행복한교회 3층 (연산동)
224체육교습업FC BS89 축구클럽부산광역시 연제구 반송로 89, 6층 (연산동)
225체육교습업지니어스 음악줄넘기 연일점부산광역시 연제구 쌍미천로 106, 2층 (연산동)
226체육교습업더퍼스트 FC부산광역시 연제구 과정로 314, A동 4층 (연산동)