Overview

Dataset statistics

Number of variables3
Number of observations233
Missing cells1
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.6 KiB
Average record size in memory24.6 B

Variable types

Categorical1
Text2

Dataset

Description부산광역시강서구_체육시설업현황_20230516
Author부산광역시 강서구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3045862

Reproduction

Analysis started2023-12-10 17:42:38.301632
Analysis finished2023-12-10 17:42:39.209619
Duration0.91 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct9
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
체육도장업
67 
체력단련장업
44 
골프연습장업
38 
가상체험 체육시설업
36 
당구장업
23 
Other values (4)
25 

Length

Max length10
Median length7
Mean length6.0343348
Min length4

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row수영장업
2nd row수영장업
3rd row수영장업
4th row체육도장업
5th row체육도장업

Common Values

ValueCountFrequency (%)
체육도장업 67
28.8%
체력단련장업 44
18.9%
골프연습장업 38
16.3%
가상체험 체육시설업 36
15.5%
당구장업 23
 
9.9%
체육교습업 18
 
7.7%
수영장업 3
 
1.3%
종합체육시설업 3
 
1.3%
승마장업 1
 
0.4%

Length

2023-12-11T02:42:39.367345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:42:39.606807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
체육도장업 67
24.9%
체력단련장업 44
16.4%
골프연습장업 38
14.1%
가상체험 36
13.4%
체육시설업 36
13.4%
당구장업 23
 
8.6%
체육교습업 18
 
6.7%
수영장업 3
 
1.1%
종합체육시설업 3
 
1.1%
승마장업 1
 
0.4%

상호
Text

Distinct227
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-11T02:42:40.144608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length8.4206009
Min length3

Characters and Unicode

Total characters1962
Distinct characters303
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique221 ?
Unique (%)94.8%

Sample

1st row아이올림픽 수영장
2nd row이야 오르카 명지점
3rd rowBS스포츠인재교육원
4th row고당산체육관
5th row공항체육관
ValueCountFrequency (%)
태권도 9
 
2.4%
골프 7
 
1.8%
gym 6
 
1.6%
명지점 6
 
1.6%
부산명지점 6
 
1.6%
명지국제신도시점 5
 
1.3%
아카데미 5
 
1.3%
태권도장 5
 
1.3%
명지 5
 
1.3%
스크린골프 5
 
1.3%
Other values (284) 320
84.4%
2023-12-11T02:42:41.054770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
146
 
7.4%
77
 
3.9%
69
 
3.5%
62
 
3.2%
60
 
3.1%
46
 
2.3%
42
 
2.1%
42
 
2.1%
41
 
2.1%
38
 
1.9%
Other values (293) 1339
68.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1606
81.9%
Space Separator 146
 
7.4%
Uppercase Letter 118
 
6.0%
Lowercase Letter 44
 
2.2%
Decimal Number 21
 
1.1%
Open Punctuation 12
 
0.6%
Close Punctuation 12
 
0.6%
Other Punctuation 2
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
77
 
4.8%
69
 
4.3%
62
 
3.9%
60
 
3.7%
46
 
2.9%
42
 
2.6%
42
 
2.6%
41
 
2.6%
38
 
2.4%
34
 
2.1%
Other values (239) 1095
68.2%
Uppercase Letter
ValueCountFrequency (%)
G 15
12.7%
M 11
 
9.3%
S 9
 
7.6%
C 8
 
6.8%
D 8
 
6.8%
E 6
 
5.1%
Y 6
 
5.1%
J 6
 
5.1%
K 6
 
5.1%
A 6
 
5.1%
Other values (13) 37
31.4%
Lowercase Letter
ValueCountFrequency (%)
y 7
15.9%
m 5
11.4%
i 4
9.1%
e 4
9.1%
l 3
 
6.8%
g 3
 
6.8%
t 3
 
6.8%
n 2
 
4.5%
v 2
 
4.5%
r 2
 
4.5%
Other values (8) 9
20.5%
Decimal Number
ValueCountFrequency (%)
4 7
33.3%
1 4
19.0%
2 4
19.0%
0 3
14.3%
3 1
 
4.8%
6 1
 
4.8%
5 1
 
4.8%
Other Punctuation
ValueCountFrequency (%)
. 1
50.0%
: 1
50.0%
Space Separator
ValueCountFrequency (%)
146
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1606
81.9%
Common 194
 
9.9%
Latin 162
 
8.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
77
 
4.8%
69
 
4.3%
62
 
3.9%
60
 
3.7%
46
 
2.9%
42
 
2.6%
42
 
2.6%
41
 
2.6%
38
 
2.4%
34
 
2.1%
Other values (239) 1095
68.2%
Latin
ValueCountFrequency (%)
G 15
 
9.3%
M 11
 
6.8%
S 9
 
5.6%
C 8
 
4.9%
D 8
 
4.9%
y 7
 
4.3%
E 6
 
3.7%
Y 6
 
3.7%
J 6
 
3.7%
K 6
 
3.7%
Other values (31) 80
49.4%
Common
ValueCountFrequency (%)
146
75.3%
( 12
 
6.2%
) 12
 
6.2%
4 7
 
3.6%
1 4
 
2.1%
2 4
 
2.1%
0 3
 
1.5%
. 1
 
0.5%
3 1
 
0.5%
6 1
 
0.5%
Other values (3) 3
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1606
81.9%
ASCII 356
 
18.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
146
41.0%
G 15
 
4.2%
( 12
 
3.4%
) 12
 
3.4%
M 11
 
3.1%
S 9
 
2.5%
C 8
 
2.2%
D 8
 
2.2%
4 7
 
2.0%
y 7
 
2.0%
Other values (44) 121
34.0%
Hangul
ValueCountFrequency (%)
77
 
4.8%
69
 
4.3%
62
 
3.9%
60
 
3.7%
46
 
2.9%
42
 
2.6%
42
 
2.6%
41
 
2.6%
38
 
2.4%
34
 
2.1%
Other values (239) 1095
68.2%
Distinct228
Distinct (%)98.3%
Missing1
Missing (%)0.4%
Memory size1.9 KiB
2023-12-11T02:42:41.559573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length48
Mean length38.568966
Min length23

Characters and Unicode

Total characters8948
Distinct characters206
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique224 ?
Unique (%)96.6%

Sample

1st row부산광역시 강서구 명지국제8로 293, 1층 (명지동)
2nd row부산광역시 강서구 영강길 87, 2층 (명지동)
3rd row부산광역시 강서구 영강길81번길 24, 부산학생안전체험관 1층 (명지동)
4th row부산광역시 강서구 공항로1309번길 162-1 (대저1동)
5th row부산광역시 강서구 공항로811번다길 23-20 (대저2동)
ValueCountFrequency (%)
부산광역시 232
 
14.2%
강서구 232
 
14.2%
명지동 165
 
10.1%
명지국제8로 43
 
2.6%
신호동 28
 
1.7%
명지오션시티4로 27
 
1.7%
2층 26
 
1.6%
3층 21
 
1.3%
5층 17
 
1.0%
4층 16
 
1.0%
Other values (426) 828
50.6%
2023-12-11T02:42:42.367449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1410
 
15.8%
357
 
4.0%
340
 
3.8%
1 303
 
3.4%
2 297
 
3.3%
288
 
3.2%
282
 
3.2%
, 272
 
3.0%
268
 
3.0%
0 239
 
2.7%
Other values (196) 4892
54.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4971
55.6%
Decimal Number 1728
 
19.3%
Space Separator 1410
 
15.8%
Other Punctuation 277
 
3.1%
Open Punctuation 234
 
2.6%
Close Punctuation 234
 
2.6%
Uppercase Letter 41
 
0.5%
Dash Punctuation 28
 
0.3%
Math Symbol 22
 
0.2%
Lowercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
357
 
7.2%
340
 
6.8%
288
 
5.8%
282
 
5.7%
268
 
5.4%
237
 
4.8%
236
 
4.7%
234
 
4.7%
233
 
4.7%
232
 
4.7%
Other values (161) 2264
45.5%
Uppercase Letter
ValueCountFrequency (%)
B 9
22.0%
A 7
17.1%
S 6
14.6%
R 4
9.8%
K 3
 
7.3%
F 2
 
4.9%
O 2
 
4.9%
M 2
 
4.9%
P 1
 
2.4%
E 1
 
2.4%
Other values (4) 4
9.8%
Decimal Number
ValueCountFrequency (%)
1 303
17.5%
2 297
17.2%
0 239
13.8%
3 187
10.8%
4 155
9.0%
8 142
8.2%
6 134
7.8%
5 112
 
6.5%
7 91
 
5.3%
9 68
 
3.9%
Other Punctuation
ValueCountFrequency (%)
, 272
98.2%
· 3
 
1.1%
& 2
 
0.7%
Lowercase Letter
ValueCountFrequency (%)
e 1
33.3%
m 1
33.3%
s 1
33.3%
Space Separator
ValueCountFrequency (%)
1410
100.0%
Open Punctuation
ValueCountFrequency (%)
( 234
100.0%
Close Punctuation
ValueCountFrequency (%)
) 234
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%
Math Symbol
ValueCountFrequency (%)
~ 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4971
55.6%
Common 3933
44.0%
Latin 44
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
357
 
7.2%
340
 
6.8%
288
 
5.8%
282
 
5.7%
268
 
5.4%
237
 
4.8%
236
 
4.7%
234
 
4.7%
233
 
4.7%
232
 
4.7%
Other values (161) 2264
45.5%
Common
ValueCountFrequency (%)
1410
35.9%
1 303
 
7.7%
2 297
 
7.6%
, 272
 
6.9%
0 239
 
6.1%
( 234
 
5.9%
) 234
 
5.9%
3 187
 
4.8%
4 155
 
3.9%
8 142
 
3.6%
Other values (8) 460
 
11.7%
Latin
ValueCountFrequency (%)
B 9
20.5%
A 7
15.9%
S 6
13.6%
R 4
9.1%
K 3
 
6.8%
F 2
 
4.5%
O 2
 
4.5%
M 2
 
4.5%
P 1
 
2.3%
E 1
 
2.3%
Other values (7) 7
15.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4971
55.6%
ASCII 3974
44.4%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1410
35.5%
1 303
 
7.6%
2 297
 
7.5%
, 272
 
6.8%
0 239
 
6.0%
( 234
 
5.9%
) 234
 
5.9%
3 187
 
4.7%
4 155
 
3.9%
8 142
 
3.6%
Other values (24) 501
 
12.6%
Hangul
ValueCountFrequency (%)
357
 
7.2%
340
 
6.8%
288
 
5.8%
282
 
5.7%
268
 
5.4%
237
 
4.8%
236
 
4.7%
234
 
4.7%
233
 
4.7%
232
 
4.7%
Other values (161) 2264
45.5%
None
ValueCountFrequency (%)
· 3
100.0%

Missing values

2023-12-11T02:42:38.877081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:42:39.135142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호시설주소(도로명)
0수영장업아이올림픽 수영장부산광역시 강서구 명지국제8로 293, 1층 (명지동)
1수영장업이야 오르카 명지점부산광역시 강서구 영강길 87, 2층 (명지동)
2수영장업BS스포츠인재교육원부산광역시 강서구 영강길81번길 24, 부산학생안전체험관 1층 (명지동)
3체육도장업고당산체육관부산광역시 강서구 공항로1309번길 162-1 (대저1동)
4체육도장업공항체육관부산광역시 강서구 공항로811번다길 23-20 (대저2동)
5체육도장업힘찬나래 태권도부산광역시 강서구 명지오션시티10로 16, 222동 246호 (명지동)
6체육도장업창조태권도부산광역시 강서구 명지오션시티4로 62, 702호 (명지동,파빌리온빌딩)
7체육도장업죽선재검도장부산광역시 강서구 명지국제8로 245, 802호 (명지동)
8체육도장업더 마스터 태권도부산광역시 강서구 명지오션시티5로 10 (명지동)
9체육도장업화인태권도부산광역시 강서구 명지오션시티4로 82 (명지동,501호)
업종상호시설주소(도로명)
223체육교습업점핑키즈줄넘기클럽(1호점)부산광역시 강서구 명지국제5로 12, 3,4층 (명지동)
224체육교습업점핑키즈줄넘기클럽(포스코점)부산광역시 강서구 명지국제2로 27, 대산골든스퀘어 7층 705,706호 (명지동)
225체육교습업한문연 야구아카데미부산광역시 강서구 대저로63번길 70-1 (대저1동)
226체육교습업주식회사 에프씨엠제이 풋볼아카데미부산광역시 강서구 영강길 140 (명지동)
227체육교습업FCMJ 축구클럽부산광역시 강서구 명지국제9로 38, 기유타워 7층 702-703호 (명지동)
228체육교습업오션아레나부산광역시 강서구 명지오션시티6로 69, 가온유치원(가온꿈놀이터) 1층 (명지동)
229체육교습업투핸즈줄넘기 오션본점부산광역시 강서구 명지오션시티4로 74, 2층 (명지동)
230체육교습업명지점프파이어부산광역시 강서구 명지국제8로 245, 명지뉴타워복합상가 7층 703호 (명지동)
231체육교습업발리 스포츠 센터부산광역시 강서구 명지오션시티13로 12-26 (명지동)
232체육교습업아이파크 풋볼 아카데미 명지부산광역시 강서구 명지국제8로 265, SM빌딩 3층 301호 (명지동)