Overview

Dataset statistics

Number of variables3
Number of observations216
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.2 KiB
Average record size in memory24.6 B

Variable types

Categorical1
Text2

Dataset

Description부산광역시 연제구 관내 체육시설업 신고 현황에 관한 데이터로 체육시설의 업종, 상호, 시설주소(도로명주소)를 제공합니다.
Author부산광역시 연제구
URLhttps://www.data.go.kr/data/15111781/fileData.do

Reproduction

Analysis started2023-12-12 23:06:39.231962
Analysis finished2023-12-12 23:06:39.663044
Duration0.43 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct8
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
체력단련장업
62 
체육도장업
54 
당구장업
45 
골프연습장업
36 
가상체험 체육시설업
10 
Other values (3)

Length

Max length10
Median length6
Mean length5.4675926
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수영장업
2nd row수영장업
3rd row체육도장업
4th row체육도장업
5th row체육도장업

Common Values

ValueCountFrequency (%)
체력단련장업 62
28.7%
체육도장업 54
25.0%
당구장업 45
20.8%
골프연습장업 36
16.7%
가상체험 체육시설업 10
 
4.6%
체육교습업 5
 
2.3%
수영장업 2
 
0.9%
무도학원업 2
 
0.9%

Length

2023-12-13T08:06:39.745840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:06:39.880319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
체력단련장업 62
27.4%
체육도장업 54
23.9%
당구장업 45
19.9%
골프연습장업 36
15.9%
가상체험 10
 
4.4%
체육시설업 10
 
4.4%
체육교습업 5
 
2.2%
수영장업 2
 
0.9%
무도학원업 2
 
0.9%

상호
Text

Distinct213
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-13T08:06:40.241398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length21
Mean length7.5092593
Min length1

Characters and Unicode

Total characters1622
Distinct characters305
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique211 ?
Unique (%)97.7%

Sample

1st row대영 스포렉스
2nd row더 퍼스트 수영장
3rd row송무인 동명태권도
4th row경동태권도
5th row새화랑체육도장
ValueCountFrequency (%)
당구클럽 6
 
1.9%
6
 
1.9%
당구장 4
 
1.3%
대영 3
 
1.0%
합기도 3
 
1.0%
골프 3
 
1.0%
휘트니스 3
 
1.0%
피트니스 3
 
1.0%
스포렉스 3
 
1.0%
코오롱글로벌(주)스포렉스 3
 
1.0%
Other values (261) 273
88.1%
2023-12-13T08:06:40.833558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
94
 
5.8%
88
 
5.4%
48
 
3.0%
47
 
2.9%
43
 
2.7%
42
 
2.6%
40
 
2.5%
36
 
2.2%
31
 
1.9%
30
 
1.8%
Other values (295) 1123
69.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1368
84.3%
Space Separator 94
 
5.8%
Uppercase Letter 79
 
4.9%
Lowercase Letter 31
 
1.9%
Close Punctuation 15
 
0.9%
Open Punctuation 15
 
0.9%
Other Punctuation 9
 
0.6%
Decimal Number 9
 
0.6%
Letter Number 1
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
88
 
6.4%
48
 
3.5%
47
 
3.4%
43
 
3.1%
42
 
3.1%
40
 
2.9%
36
 
2.6%
31
 
2.3%
30
 
2.2%
25
 
1.8%
Other values (245) 938
68.6%
Uppercase Letter
ValueCountFrequency (%)
M 10
12.7%
A 9
11.4%
G 7
8.9%
Y 6
 
7.6%
S 6
 
7.6%
B 6
 
7.6%
K 6
 
7.6%
T 5
 
6.3%
P 4
 
5.1%
F 3
 
3.8%
Other values (9) 17
21.5%
Lowercase Letter
ValueCountFrequency (%)
i 5
16.1%
s 4
12.9%
t 3
9.7%
d 2
 
6.5%
n 2
 
6.5%
a 2
 
6.5%
e 2
 
6.5%
o 2
 
6.5%
r 2
 
6.5%
l 2
 
6.5%
Other values (5) 5
16.1%
Decimal Number
ValueCountFrequency (%)
2 2
22.2%
8 1
11.1%
9 1
11.1%
0 1
11.1%
3 1
11.1%
1 1
11.1%
7 1
11.1%
4 1
11.1%
Other Punctuation
ValueCountFrequency (%)
. 6
66.7%
& 2
 
22.2%
: 1
 
11.1%
Space Separator
ValueCountFrequency (%)
94
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1368
84.3%
Common 143
 
8.8%
Latin 111
 
6.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
88
 
6.4%
48
 
3.5%
47
 
3.4%
43
 
3.1%
42
 
3.1%
40
 
2.9%
36
 
2.6%
31
 
2.3%
30
 
2.2%
25
 
1.8%
Other values (245) 938
68.6%
Latin
ValueCountFrequency (%)
M 10
 
9.0%
A 9
 
8.1%
G 7
 
6.3%
Y 6
 
5.4%
S 6
 
5.4%
B 6
 
5.4%
K 6
 
5.4%
T 5
 
4.5%
i 5
 
4.5%
s 4
 
3.6%
Other values (25) 47
42.3%
Common
ValueCountFrequency (%)
94
65.7%
) 15
 
10.5%
( 15
 
10.5%
. 6
 
4.2%
2 2
 
1.4%
& 2
 
1.4%
8 1
 
0.7%
9 1
 
0.7%
0 1
 
0.7%
3 1
 
0.7%
Other values (5) 5
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1368
84.3%
ASCII 253
 
15.6%
Number Forms 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
94
37.2%
) 15
 
5.9%
( 15
 
5.9%
M 10
 
4.0%
A 9
 
3.6%
G 7
 
2.8%
Y 6
 
2.4%
S 6
 
2.4%
B 6
 
2.4%
. 6
 
2.4%
Other values (39) 79
31.2%
Hangul
ValueCountFrequency (%)
88
 
6.4%
48
 
3.5%
47
 
3.4%
43
 
3.1%
42
 
3.1%
40
 
2.9%
36
 
2.6%
31
 
2.3%
30
 
2.2%
25
 
1.8%
Other values (245) 938
68.6%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct211
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-13T08:06:41.112524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length43
Mean length29.902778
Min length21

Characters and Unicode

Total characters6459
Distinct characters177
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique206 ?
Unique (%)95.4%

Sample

1st row부산광역시 연제구 연안로 25 (연산동)
2nd row부산광역시 연제구 거제천로87번길 30, 지하1층 (거제동, 연제그린타워)
3rd row부산광역시 연제구 연제로8번길 54 (연산동)
4th row부산광역시 연제구 세병로 16 (연산동)
5th row부산광역시 연제구 쌍미천로 11 (연산동)
ValueCountFrequency (%)
부산광역시 216
16.7%
연제구 216
16.7%
연산동 162
 
12.5%
거제동 56
 
4.3%
2층 27
 
2.1%
3층 25
 
1.9%
과정로 22
 
1.7%
4층 16
 
1.2%
지하1층 14
 
1.1%
25 13
 
1.0%
Other values (303) 525
40.6%
2023-12-13T08:06:41.592456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1076
 
16.7%
400
 
6.2%
388
 
6.0%
303
 
4.7%
236
 
3.7%
229
 
3.5%
219
 
3.4%
217
 
3.4%
217
 
3.4%
( 216
 
3.3%
Other values (167) 2958
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3855
59.7%
Space Separator 1076
 
16.7%
Decimal Number 897
 
13.9%
Open Punctuation 216
 
3.3%
Close Punctuation 216
 
3.3%
Other Punctuation 160
 
2.5%
Dash Punctuation 21
 
0.3%
Uppercase Letter 18
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
400
 
10.4%
388
 
10.1%
303
 
7.9%
236
 
6.1%
229
 
5.9%
219
 
5.7%
217
 
5.6%
217
 
5.6%
216
 
5.6%
216
 
5.6%
Other values (144) 1214
31.5%
Decimal Number
ValueCountFrequency (%)
1 192
21.4%
2 159
17.7%
3 141
15.7%
4 87
9.7%
0 74
 
8.2%
5 67
 
7.5%
6 53
 
5.9%
7 46
 
5.1%
8 45
 
5.0%
9 33
 
3.7%
Uppercase Letter
ValueCountFrequency (%)
B 4
22.2%
I 3
16.7%
V 2
11.1%
W 2
11.1%
E 2
11.1%
S 2
11.1%
K 2
11.1%
G 1
 
5.6%
Space Separator
ValueCountFrequency (%)
1076
100.0%
Open Punctuation
ValueCountFrequency (%)
( 216
100.0%
Close Punctuation
ValueCountFrequency (%)
) 216
100.0%
Other Punctuation
ValueCountFrequency (%)
, 160
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3855
59.7%
Common 2586
40.0%
Latin 18
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
400
 
10.4%
388
 
10.1%
303
 
7.9%
236
 
6.1%
229
 
5.9%
219
 
5.7%
217
 
5.6%
217
 
5.6%
216
 
5.6%
216
 
5.6%
Other values (144) 1214
31.5%
Common
ValueCountFrequency (%)
1076
41.6%
( 216
 
8.4%
) 216
 
8.4%
1 192
 
7.4%
, 160
 
6.2%
2 159
 
6.1%
3 141
 
5.5%
4 87
 
3.4%
0 74
 
2.9%
5 67
 
2.6%
Other values (5) 198
 
7.7%
Latin
ValueCountFrequency (%)
B 4
22.2%
I 3
16.7%
V 2
11.1%
W 2
11.1%
E 2
11.1%
S 2
11.1%
K 2
11.1%
G 1
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3855
59.7%
ASCII 2604
40.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1076
41.3%
( 216
 
8.3%
) 216
 
8.3%
1 192
 
7.4%
, 160
 
6.1%
2 159
 
6.1%
3 141
 
5.4%
4 87
 
3.3%
0 74
 
2.8%
5 67
 
2.6%
Other values (13) 216
 
8.3%
Hangul
ValueCountFrequency (%)
400
 
10.4%
388
 
10.1%
303
 
7.9%
236
 
6.1%
229
 
5.9%
219
 
5.7%
217
 
5.6%
217
 
5.6%
216
 
5.6%
216
 
5.6%
Other values (144) 1214
31.5%

Missing values

2023-12-13T08:06:39.536855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:06:39.628370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호시설주소(도로명)
0수영장업대영 스포렉스부산광역시 연제구 연안로 25 (연산동)
1수영장업더 퍼스트 수영장부산광역시 연제구 거제천로87번길 30, 지하1층 (거제동, 연제그린타워)
2체육도장업송무인 동명태권도부산광역시 연제구 연제로8번길 54 (연산동)
3체육도장업경동태권도부산광역시 연제구 세병로 16 (연산동)
4체육도장업새화랑체육도장부산광역시 연제구 쌍미천로 11 (연산동)
5체육도장업태극체육관부산광역시 연제구 연안로13번길 65 (연산동)
6체육도장업대세 태권도장부산광역시 연제구 거제천로 112 (연산동)
7체육도장업거성체육관부산광역시 연제구 해맞이로 23, 115동 305호 (거제동, 거제유림아시아드)
8체육도장업경원체육관부산광역시 연제구 중앙천로 38, 3층 (연산동)
9체육도장업연산체육관부산광역시 연제구 중앙천로39번길 13, 지하1층 (연산동)
업종상호시설주소(도로명)
206가상체험 체육시설업코스앤 골프스튜디오부산광역시 연제구 거제대로 274, 2층 2호 (거제동)
207가상체험 체육시설업프렌즈아카데미(연산테슬라점)부산광역시 연제구 좌수영로 290 (연산동)
208가상체험 체육시설업프렌즈아카데미 연산교차로부산광역시 연제구 반송로 33, 803호, 805호 (연산동)
209가상체험 체육시설업엑스클루시 골프아카데미부산광역시 연제구 고분로13번길 25, 405호 (연산동, 연산동 쌍용아파트)
210가상체험 체육시설업프렌즈스크린 연산교차로점부산광역시 연제구 반송로 33, 801호, 804호 (연산동)
211체육교습업지니어스 음악줄넘기센터부산광역시 연제구 과정로 207, 3층 (연산동)
212체육교습업SM SSAKA(에쓰엠싸카)부산광역시 연제구 좌수영로 295, 2층 (연산동)
213체육교습업펀펀짐부산광역시 연제구 월드컵대로 54, 4층 (연산동)
214체육교습업(주)모션스포츠부산광역시 연제구 쌍미천로 160, 행복한교회 3층 (연산동)
215체육교습업FC BS89 축구클럽부산광역시 연제구 반송로 89, 6층 (연산동)