Overview

Dataset statistics

Number of variables4
Number of observations233
Missing cells84
Missing cells (%)9.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.4 KiB
Average record size in memory32.6 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시남구_체육시설업현황_20200701
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15055499

Alerts

전화번호 has 84 (36.1%) missing valuesMissing

Reproduction

Analysis started2023-12-10 17:02:12.337338
Analysis finished2023-12-10 17:02:12.992861
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct7
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
체육도장업
73 
체력단련장업
64 
당구장업
62 
골프연습장업
28 
수영장업
 
4
Other values (2)
 
2

Length

Max length10
Median length6
Mean length5.1287554
Min length4

Unique

Unique2 ?
Unique (%)0.9%

Sample

1st row수영장업
2nd row수영장업
3rd row수영장업
4th row수영장업
5th row체육도장업

Common Values

ValueCountFrequency (%)
체육도장업 73
31.3%
체력단련장업 64
27.5%
당구장업 62
26.6%
골프연습장업 28
 
12.0%
수영장업 4
 
1.7%
빙상장업 1
 
0.4%
가상체험 체육시설업 1
 
0.4%

Length

2023-12-11T02:02:13.089895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:02:13.279587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
체육도장업 73
31.2%
체력단련장업 64
27.4%
당구장업 62
26.5%
골프연습장업 28
 
12.0%
수영장업 4
 
1.7%
빙상장업 1
 
0.4%
가상체험 1
 
0.4%
체육시설업 1
 
0.4%

상호
Text

Distinct230
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-11T02:02:13.651806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length20
Mean length7.4248927
Min length2

Characters and Unicode

Total characters1730
Distinct characters291
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique227 ?
Unique (%)97.4%

Sample

1st row용호레포츠수영장
2nd row대호 키즈수영장
3rd row주식회사 센츄리 스포렉스
4th row키즈올림픽 수영장
5th row청운체육관
ValueCountFrequency (%)
당구클럽 22
 
6.0%
당구장 14
 
3.8%
휘트니스 10
 
2.7%
태권도 9
 
2.5%
4
 
1.1%
스크린골프 3
 
0.8%
3
 
0.8%
주)동진스포렉스 2
 
0.5%
태권도장 2
 
0.5%
s 2
 
0.5%
Other values (283) 296
80.7%
2023-12-11T02:02:14.284044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
134
 
7.7%
79
 
4.6%
61
 
3.5%
60
 
3.5%
57
 
3.3%
53
 
3.1%
43
 
2.5%
42
 
2.4%
38
 
2.2%
37
 
2.1%
Other values (281) 1126
65.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1422
82.2%
Space Separator 134
 
7.7%
Uppercase Letter 110
 
6.4%
Lowercase Letter 17
 
1.0%
Open Punctuation 14
 
0.8%
Close Punctuation 14
 
0.8%
Decimal Number 14
 
0.8%
Other Punctuation 3
 
0.2%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
79
 
5.6%
61
 
4.3%
60
 
4.2%
57
 
4.0%
53
 
3.7%
43
 
3.0%
42
 
3.0%
38
 
2.7%
37
 
2.6%
33
 
2.3%
Other values (239) 919
64.6%
Uppercase Letter
ValueCountFrequency (%)
M 12
 
10.9%
I 11
 
10.0%
A 9
 
8.2%
G 7
 
6.4%
N 7
 
6.4%
P 6
 
5.5%
L 6
 
5.5%
T 5
 
4.5%
F 5
 
4.5%
B 5
 
4.5%
Other values (12) 37
33.6%
Lowercase Letter
ValueCountFrequency (%)
n 4
23.5%
o 3
17.6%
t 2
11.8%
i 2
11.8%
s 2
11.8%
e 2
11.8%
c 1
 
5.9%
f 1
 
5.9%
Decimal Number
ValueCountFrequency (%)
2 6
42.9%
4 2
 
14.3%
7 2
 
14.3%
0 2
 
14.3%
3 1
 
7.1%
1 1
 
7.1%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
& 1
33.3%
Space Separator
ValueCountFrequency (%)
134
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1422
82.2%
Common 181
 
10.5%
Latin 127
 
7.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
79
 
5.6%
61
 
4.3%
60
 
4.2%
57
 
4.0%
53
 
3.7%
43
 
3.0%
42
 
3.0%
38
 
2.7%
37
 
2.6%
33
 
2.3%
Other values (239) 919
64.6%
Latin
ValueCountFrequency (%)
M 12
 
9.4%
I 11
 
8.7%
A 9
 
7.1%
G 7
 
5.5%
N 7
 
5.5%
P 6
 
4.7%
L 6
 
4.7%
T 5
 
3.9%
F 5
 
3.9%
B 5
 
3.9%
Other values (20) 54
42.5%
Common
ValueCountFrequency (%)
134
74.0%
( 14
 
7.7%
) 14
 
7.7%
2 6
 
3.3%
. 2
 
1.1%
4 2
 
1.1%
7 2
 
1.1%
0 2
 
1.1%
- 2
 
1.1%
3 1
 
0.6%
Other values (2) 2
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1422
82.2%
ASCII 308
 
17.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
134
43.5%
( 14
 
4.5%
) 14
 
4.5%
M 12
 
3.9%
I 11
 
3.6%
A 9
 
2.9%
G 7
 
2.3%
N 7
 
2.3%
P 6
 
1.9%
L 6
 
1.9%
Other values (32) 88
28.6%
Hangul
ValueCountFrequency (%)
79
 
5.6%
61
 
4.3%
60
 
4.2%
57
 
4.0%
53
 
3.7%
43
 
3.0%
42
 
3.0%
38
 
2.7%
37
 
2.6%
33
 
2.3%
Other values (239) 919
64.6%
Distinct231
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-11T02:02:15.051589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length74
Median length49
Mean length31.918455
Min length19

Characters and Unicode

Total characters7437
Distinct characters201
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique229 ?
Unique (%)98.3%

Sample

1st row부산광역시 남구 동명로152번길 27 (용호동)
2nd row부산광역시 남구 석포로 119, 지하1층 (대연동)
3rd row부산광역시 남구 수영로 312, 지하2층 14호 (대연동, 센츄리빌딩)
4th row부산광역시 남구 용호로 132, 지하1층 (용호동)
5th row부산광역시 남구 우암로 40-1, 2층 (감만동)
ValueCountFrequency (%)
부산광역시 233
 
15.4%
남구 233
 
15.4%
대연동 113
 
7.5%
용호동 57
 
3.8%
3층 44
 
2.9%
2층 34
 
2.3%
문현동 33
 
2.2%
수영로 33
 
2.2%
4층 20
 
1.3%
감만동 18
 
1.2%
Other values (359) 692
45.8%
2023-12-11T02:02:15.790614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1280
 
17.2%
293
 
3.9%
1 287
 
3.9%
, 287
 
3.9%
247
 
3.3%
240
 
3.2%
239
 
3.2%
238
 
3.2%
236
 
3.2%
234
 
3.1%
Other values (191) 3856
51.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4134
55.6%
Space Separator 1280
 
17.2%
Decimal Number 1210
 
16.3%
Other Punctuation 287
 
3.9%
Open Punctuation 231
 
3.1%
Close Punctuation 231
 
3.1%
Uppercase Letter 36
 
0.5%
Dash Punctuation 27
 
0.4%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
293
 
7.1%
247
 
6.0%
240
 
5.8%
239
 
5.8%
238
 
5.8%
236
 
5.7%
234
 
5.7%
233
 
5.6%
233
 
5.6%
162
 
3.9%
Other values (162) 1779
43.0%
Uppercase Letter
ValueCountFrequency (%)
B 15
41.7%
A 6
 
16.7%
S 3
 
8.3%
I 2
 
5.6%
P 2
 
5.6%
L 1
 
2.8%
G 1
 
2.8%
K 1
 
2.8%
V 1
 
2.8%
Z 1
 
2.8%
Other values (3) 3
 
8.3%
Decimal Number
ValueCountFrequency (%)
1 287
23.7%
2 191
15.8%
3 163
13.5%
0 114
 
9.4%
4 109
 
9.0%
5 90
 
7.4%
6 83
 
6.9%
9 61
 
5.0%
8 58
 
4.8%
7 54
 
4.5%
Space Separator
ValueCountFrequency (%)
1280
100.0%
Other Punctuation
ValueCountFrequency (%)
, 287
100.0%
Open Punctuation
ValueCountFrequency (%)
( 231
100.0%
Close Punctuation
ValueCountFrequency (%)
) 231
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 27
100.0%
Lowercase Letter
ValueCountFrequency (%)
b 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4134
55.6%
Common 3266
43.9%
Latin 37
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
293
 
7.1%
247
 
6.0%
240
 
5.8%
239
 
5.8%
238
 
5.8%
236
 
5.7%
234
 
5.7%
233
 
5.6%
233
 
5.6%
162
 
3.9%
Other values (162) 1779
43.0%
Common
ValueCountFrequency (%)
1280
39.2%
1 287
 
8.8%
, 287
 
8.8%
( 231
 
7.1%
) 231
 
7.1%
2 191
 
5.8%
3 163
 
5.0%
0 114
 
3.5%
4 109
 
3.3%
5 90
 
2.8%
Other values (5) 283
 
8.7%
Latin
ValueCountFrequency (%)
B 15
40.5%
A 6
 
16.2%
S 3
 
8.1%
I 2
 
5.4%
P 2
 
5.4%
L 1
 
2.7%
G 1
 
2.7%
K 1
 
2.7%
V 1
 
2.7%
Z 1
 
2.7%
Other values (4) 4
 
10.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4134
55.6%
ASCII 3303
44.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1280
38.8%
1 287
 
8.7%
, 287
 
8.7%
( 231
 
7.0%
) 231
 
7.0%
2 191
 
5.8%
3 163
 
4.9%
0 114
 
3.5%
4 109
 
3.3%
5 90
 
2.7%
Other values (19) 320
 
9.7%
Hangul
ValueCountFrequency (%)
293
 
7.1%
247
 
6.0%
240
 
5.8%
239
 
5.8%
238
 
5.8%
236
 
5.7%
234
 
5.7%
233
 
5.6%
233
 
5.6%
162
 
3.9%
Other values (162) 1779
43.0%

전화번호
Text

MISSING 

Distinct149
Distinct (%)100.0%
Missing84
Missing (%)36.1%
Memory size1.9 KiB
2023-12-11T02:02:16.311999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length8
Mean length8.9194631
Min length8

Characters and Unicode

Total characters1329
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique149 ?
Unique (%)100.0%

Sample

1st row627-7373
2nd row611-6227
3rd row610-1111
4th row626-7783
5th row635-8961
ValueCountFrequency (%)
612-0070 1
 
0.7%
051-622-2868 1
 
0.7%
623-3119 1
 
0.7%
622-9679 1
 
0.7%
633-2147 1
 
0.7%
051-637-9682 1
 
0.7%
051-623-9696 1
 
0.7%
051-611-0090 1
 
0.7%
051-464-2088 1
 
0.7%
051-633-6653 1
 
0.7%
Other values (139) 139
93.3%
2023-12-11T02:02:17.081774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6 208
15.7%
- 183
13.8%
2 156
11.7%
1 135
10.2%
0 130
9.8%
3 110
8.3%
7 97
7.3%
5 97
7.3%
8 76
 
5.7%
9 72
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1146
86.2%
Dash Punctuation 183
 
13.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
6 208
18.2%
2 156
13.6%
1 135
11.8%
0 130
11.3%
3 110
9.6%
7 97
8.5%
5 97
8.5%
8 76
 
6.6%
9 72
 
6.3%
4 65
 
5.7%
Dash Punctuation
ValueCountFrequency (%)
- 183
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1329
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
6 208
15.7%
- 183
13.8%
2 156
11.7%
1 135
10.2%
0 130
9.8%
3 110
8.3%
7 97
7.3%
5 97
7.3%
8 76
 
5.7%
9 72
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1329
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6 208
15.7%
- 183
13.8%
2 156
11.7%
1 135
10.2%
0 130
9.8%
3 110
8.3%
7 97
7.3%
5 97
7.3%
8 76
 
5.7%
9 72
 
5.4%

Missing values

2023-12-11T02:02:12.819776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:02:12.946800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호도로명주소전화번호
0수영장업용호레포츠수영장부산광역시 남구 동명로152번길 27 (용호동)627-7373
1수영장업대호 키즈수영장부산광역시 남구 석포로 119, 지하1층 (대연동)611-6227
2수영장업주식회사 센츄리 스포렉스부산광역시 남구 수영로 312, 지하2층 14호 (대연동, 센츄리빌딩)610-1111
3수영장업키즈올림픽 수영장부산광역시 남구 용호로 132, 지하1층 (용호동)626-7783
4체육도장업청운체육관부산광역시 남구 우암로 40-1, 2층 (감만동)635-8961
5체육도장업동양복싱체육관부산광역시 남구 수영로 190, 5층 (대연동)635-1591
6체육도장업국가대표태권스쿨부산광역시 남구 유엔로120번길 32 (대연동)626-8240
7체육도장업자의누리감만도장부산광역시 남구 홍곡로 18 (감만동)636-0641
8체육도장업화신태권도부산광역시 남구 고동골로78번길 84, 2층 (문현동)631-7585
9체육도장업남부태권도장부산광역시 남구 신정번영로 7, 2층 (대연동)634-2362
업종상호도로명주소전화번호
223당구장업판테라 당구클럽부산광역시 남구 무민사로 13, 지하1층 B101호 (감만동, 판테라오피스텔)635-3023
224당구장업땡큐 당구클럽부산광역시 남구 동명로 136, 3층 (용호동)<NA>
225당구장업일레븐 당구장부산광역시 남구 우암로 75-1, 3층 (감만동)<NA>
226당구장업문현당구클럽부산광역시 남구 수영로 33, 지하1층 (문현동)<NA>
227당구장업메트로 당구클럽부산광역시 남구 용호로 64, 중앙해수월드 지하1층 (용호동)<NA>
228당구장업스타당구장부산광역시 남구 석포로 137, 3층 (대연동)<NA>
229당구장업봉당구클럽부산광역시 남구 용소로13번길 30, 대명빌딩 3층 (대연동)612-1123
230당구장업그린당구클럽(GREEN BILLIARDS CLUB)부산광역시 남구 황령대로319번가길 190-6, 대우그린아파트 (대연동)<NA>
231빙상장업부산 스노우파크부산광역시 남구 분포로 145, 지하층 B105호 (용호동, 더블유)628-5200
232가상체험 체육시설업AVANI GOLF(아바니 실내골프연습장)부산광역시 남구 전포대로 133, 아바니센트럴부산 6층 6층 (문현동)051-791-5890