Overview

Dataset statistics

Number of variables5
Number of observations261
Missing cells124
Missing cells (%)9.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.3 KiB
Average record size in memory40.5 B

Variable types

Categorical2
Text3

Dataset

Description부산광역시북구체육시설현황_20230921
Author부산광역시 북구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3069306

Alerts

공영민영구분 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 공영민영구분High correlation
공영민영구분 is highly imbalanced (90.9%)Imbalance
시설전화번호 has 124 (47.5%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:03:00.884744
Analysis finished2023-12-10 16:03:01.356524
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
체육도장업
84 
체력단련장업
59 
당구장업
51 
골프연습장업
28 
가상체험 체육시설업
15 
Other values (6)
24 

Length

Max length10
Median length7
Mean length5.4252874
Min length3

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row체육관
2nd row체육관
3rd row체육관
4th row수영장업
5th row체육도장업

Common Values

ValueCountFrequency (%)
체육도장업 84
32.2%
체력단련장업 59
22.6%
당구장업 51
19.5%
골프연습장업 28
 
10.7%
가상체험 체육시설업 15
 
5.7%
체육교습업 11
 
4.2%
무도학원업 4
 
1.5%
체육관 3
 
1.1%
인공암벽장업 3
 
1.1%
종합체육시설업 2
 
0.8%

Length

2023-12-11T01:03:01.422574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
체육도장업 84
30.4%
체력단련장업 59
21.4%
당구장업 51
18.5%
골프연습장업 28
 
10.1%
가상체험 15
 
5.4%
체육시설업 15
 
5.4%
체육교습업 11
 
4.0%
무도학원업 4
 
1.4%
체육관 3
 
1.1%
인공암벽장업 3
 
1.1%
Other values (2) 3
 
1.1%

상호
Text

Distinct257
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-11T01:03:01.950340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length15
Mean length7.7394636
Min length2

Characters and Unicode

Total characters2020
Distinct characters315
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique253 ?
Unique (%)96.9%

Sample

1st row백양생활체육관
2nd row덕천생활체육공원
3rd row북구국민체육센터
4th row망고키즈수영장(화명본점)
5th row차오름 대승태권도
ValueCountFrequency (%)
당구클럽 13
 
3.1%
태권도장 9
 
2.1%
골프 8
 
1.9%
태권도 7
 
1.7%
피트니스 7
 
1.7%
스크린 6
 
1.4%
당구장 6
 
1.4%
화명점 6
 
1.4%
휘트니스 5
 
1.2%
화명 5
 
1.2%
Other values (316) 349
82.9%
2023-12-11T01:03:02.362553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
160
 
7.9%
93
 
4.6%
58
 
2.9%
55
 
2.7%
50
 
2.5%
49
 
2.4%
48
 
2.4%
46
 
2.3%
43
 
2.1%
40
 
2.0%
Other values (305) 1378
68.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1691
83.7%
Space Separator 160
 
7.9%
Uppercase Letter 105
 
5.2%
Lowercase Letter 25
 
1.2%
Open Punctuation 14
 
0.7%
Close Punctuation 14
 
0.7%
Other Punctuation 8
 
0.4%
Decimal Number 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
93
 
5.5%
58
 
3.4%
55
 
3.3%
50
 
3.0%
49
 
2.9%
48
 
2.8%
46
 
2.7%
43
 
2.5%
40
 
2.4%
40
 
2.4%
Other values (262) 1169
69.1%
Uppercase Letter
ValueCountFrequency (%)
G 14
13.3%
S 13
12.4%
P 10
 
9.5%
T 10
 
9.5%
M 7
 
6.7%
O 7
 
6.7%
K 6
 
5.7%
W 5
 
4.8%
R 4
 
3.8%
I 4
 
3.8%
Other values (12) 25
23.8%
Lowercase Letter
ValueCountFrequency (%)
l 3
12.0%
i 3
12.0%
e 3
12.0%
t 3
12.0%
u 2
8.0%
f 2
8.0%
o 2
8.0%
h 2
8.0%
a 1
 
4.0%
n 1
 
4.0%
Other values (3) 3
12.0%
Other Punctuation
ValueCountFrequency (%)
& 5
62.5%
' 2
 
25.0%
. 1
 
12.5%
Decimal Number
ValueCountFrequency (%)
2 2
66.7%
3 1
33.3%
Space Separator
ValueCountFrequency (%)
160
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1691
83.7%
Common 199
 
9.9%
Latin 129
 
6.4%
Greek 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
93
 
5.5%
58
 
3.4%
55
 
3.3%
50
 
3.0%
49
 
2.9%
48
 
2.8%
46
 
2.7%
43
 
2.5%
40
 
2.4%
40
 
2.4%
Other values (262) 1169
69.1%
Latin
ValueCountFrequency (%)
G 14
 
10.9%
S 13
 
10.1%
P 10
 
7.8%
T 10
 
7.8%
M 7
 
5.4%
O 7
 
5.4%
K 6
 
4.7%
W 5
 
3.9%
R 4
 
3.1%
I 4
 
3.1%
Other values (24) 49
38.0%
Common
ValueCountFrequency (%)
160
80.4%
( 14
 
7.0%
) 14
 
7.0%
& 5
 
2.5%
2 2
 
1.0%
' 2
 
1.0%
. 1
 
0.5%
3 1
 
0.5%
Greek
ValueCountFrequency (%)
α 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1691
83.7%
ASCII 328
 
16.2%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
160
48.8%
G 14
 
4.3%
( 14
 
4.3%
) 14
 
4.3%
S 13
 
4.0%
P 10
 
3.0%
T 10
 
3.0%
M 7
 
2.1%
O 7
 
2.1%
K 6
 
1.8%
Other values (32) 73
22.3%
Hangul
ValueCountFrequency (%)
93
 
5.5%
58
 
3.4%
55
 
3.3%
50
 
3.0%
49
 
2.9%
48
 
2.8%
46
 
2.7%
43
 
2.5%
40
 
2.4%
40
 
2.4%
Other values (262) 1169
69.1%
None
ValueCountFrequency (%)
α 1
100.0%
Distinct256
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-11T01:03:02.619486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length43
Mean length31.666667
Min length17

Characters and Unicode

Total characters8265
Distinct characters188
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique251 ?
Unique (%)96.2%

Sample

1st row부산광역시 북구 금곡대로303번길 2, 비101,102호 (화명동, 시티타워)
2nd row부산광역시 북구 백양대로 1003, 상가동 지하2층 1호 (구포동, 현대아파트)
3rd row부산광역시 북구 만덕대로39번길 34, 4층 (덕천동)
4th row부산광역시 북구 금곡대로303번길 2, 비101,102호 (화명동, 시티타워)
5th row부산광역시 북구 만덕대로39번길 34, 4층 (덕천동)
ValueCountFrequency (%)
부산광역시 261
 
16.0%
북구 261
 
16.0%
화명동 85
 
5.2%
구포동 54
 
3.3%
만덕동 42
 
2.6%
3층 35
 
2.1%
덕천동 29
 
1.8%
2층 26
 
1.6%
금곡대로 25
 
1.5%
4층 25
 
1.5%
Other values (386) 790
48.4%
2023-12-11T01:03:03.033739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1383
 
16.7%
324
 
3.9%
309
 
3.7%
, 306
 
3.7%
291
 
3.5%
275
 
3.3%
) 267
 
3.2%
( 267
 
3.2%
1 266
 
3.2%
265
 
3.2%
Other values (178) 4312
52.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4590
55.5%
Decimal Number 1427
 
17.3%
Space Separator 1383
 
16.7%
Other Punctuation 307
 
3.7%
Close Punctuation 267
 
3.2%
Open Punctuation 267
 
3.2%
Dash Punctuation 18
 
0.2%
Math Symbol 3
 
< 0.1%
Uppercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
324
 
7.1%
309
 
6.7%
291
 
6.3%
275
 
6.0%
265
 
5.8%
265
 
5.8%
262
 
5.7%
261
 
5.7%
256
 
5.6%
166
 
3.6%
Other values (158) 1916
41.7%
Decimal Number
ValueCountFrequency (%)
1 266
18.6%
2 236
16.5%
3 191
13.4%
0 180
12.6%
4 121
8.5%
5 103
 
7.2%
6 95
 
6.7%
7 92
 
6.4%
8 82
 
5.7%
9 61
 
4.3%
Uppercase Letter
ValueCountFrequency (%)
A 1
33.3%
B 1
33.3%
J 1
33.3%
Other Punctuation
ValueCountFrequency (%)
, 306
99.7%
. 1
 
0.3%
Space Separator
ValueCountFrequency (%)
1383
100.0%
Close Punctuation
ValueCountFrequency (%)
) 267
100.0%
Open Punctuation
ValueCountFrequency (%)
( 267
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4590
55.5%
Common 3672
44.4%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
324
 
7.1%
309
 
6.7%
291
 
6.3%
275
 
6.0%
265
 
5.8%
265
 
5.8%
262
 
5.7%
261
 
5.7%
256
 
5.6%
166
 
3.6%
Other values (158) 1916
41.7%
Common
ValueCountFrequency (%)
1383
37.7%
, 306
 
8.3%
) 267
 
7.3%
( 267
 
7.3%
1 266
 
7.2%
2 236
 
6.4%
3 191
 
5.2%
0 180
 
4.9%
4 121
 
3.3%
5 103
 
2.8%
Other values (7) 352
 
9.6%
Latin
ValueCountFrequency (%)
A 1
33.3%
B 1
33.3%
J 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4590
55.5%
ASCII 3675
44.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1383
37.6%
, 306
 
8.3%
) 267
 
7.3%
( 267
 
7.3%
1 266
 
7.2%
2 236
 
6.4%
3 191
 
5.2%
0 180
 
4.9%
4 121
 
3.3%
5 103
 
2.8%
Other values (10) 355
 
9.7%
Hangul
ValueCountFrequency (%)
324
 
7.1%
309
 
6.7%
291
 
6.3%
275
 
6.0%
265
 
5.8%
265
 
5.8%
262
 
5.7%
261
 
5.7%
256
 
5.6%
166
 
3.6%
Other values (158) 1916
41.7%

시설전화번호
Text

MISSING 

Distinct133
Distinct (%)97.1%
Missing124
Missing (%)47.5%
Memory size2.2 KiB
2023-12-11T01:03:03.298412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1644
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique129 ?
Unique (%)94.2%

Sample

1st row051-309-4129
2nd row051-309-4885
3rd row051-365-7070
4th row051-342-3689
5th row051-335-5675
ValueCountFrequency (%)
051-342-0707 2
 
1.5%
051-336-0707 2
 
1.5%
051-365-7070 2
 
1.5%
051-362-0222 2
 
1.5%
051-337-9682 1
 
0.7%
051-365-7581 1
 
0.7%
051-342-8296 1
 
0.7%
051-337-8296 1
 
0.7%
051-333-5151 1
 
0.7%
051-362-5775 1
 
0.7%
Other values (123) 123
89.8%
2023-12-11T01:03:03.780099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 274
16.7%
3 273
16.6%
0 226
13.7%
1 214
13.0%
5 205
12.5%
6 98
 
6.0%
4 78
 
4.7%
2 75
 
4.6%
9 71
 
4.3%
8 68
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1370
83.3%
Dash Punctuation 274
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 273
19.9%
0 226
16.5%
1 214
15.6%
5 205
15.0%
6 98
 
7.2%
4 78
 
5.7%
2 75
 
5.5%
9 71
 
5.2%
8 68
 
5.0%
7 62
 
4.5%
Dash Punctuation
ValueCountFrequency (%)
- 274
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1644
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 274
16.7%
3 273
16.6%
0 226
13.7%
1 214
13.0%
5 205
12.5%
6 98
 
6.0%
4 78
 
4.7%
2 75
 
4.6%
9 71
 
4.3%
8 68
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1644
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 274
16.7%
3 273
16.6%
0 226
13.7%
1 214
13.0%
5 205
12.5%
6 98
 
6.0%
4 78
 
4.7%
2 75
 
4.6%
9 71
 
4.3%
8 68
 
4.1%

공영민영구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
민간
258 
공공
 
3

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공공
2nd row공공
3rd row공공
4th row민간
5th row민간

Common Values

ValueCountFrequency (%)
민간 258
98.9%
공공 3
 
1.1%

Length

2023-12-11T01:03:03.932817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:03:04.037241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
민간 258
98.9%
공공 3
 
1.1%

Correlations

2023-12-11T01:03:04.115914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종공영민영구분
업종1.0001.000
공영민영구분1.0001.000
2023-12-11T01:03:04.208660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공영민영구분업종
공영민영구분1.0000.982
업종0.9821.000
2023-12-11T01:03:04.305443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종공영민영구분
업종1.0000.982
공영민영구분0.9821.000

Missing values

2023-12-11T01:03:01.229820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:03:01.320459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호시설주소(도로명)시설전화번호공영민영구분
0체육관백양생활체육관부산광역시 북구 금곡대로303번길 2, 비101,102호 (화명동, 시티타워)051-309-4129공공
1체육관덕천생활체육공원부산광역시 북구 백양대로 1003, 상가동 지하2층 1호 (구포동, 현대아파트)051-309-4885공공
2체육관북구국민체육센터부산광역시 북구 만덕대로39번길 34, 4층 (덕천동)051-365-7070공공
3수영장업망고키즈수영장(화명본점)부산광역시 북구 금곡대로303번길 2, 비101,102호 (화명동, 시티타워)<NA>민간
4체육도장업차오름 대승태권도부산광역시 북구 만덕대로39번길 34, 4층 (덕천동)051-342-3689민간
5체육도장업청운태권도부산광역시 북구 만덕대로39번길 21, 2층 (덕천동)051-335-5675민간
6체육도장업화랑체육도장부산광역시 북구 양달로4번길 11 (화명동,3층)051-332-3376민간
7체육도장업한성체육관부산광역시 북구 만덕2로44번길 79, 2층 (만덕동)051-334-0398민간
8체육도장업태백체육관부산광역시 북구 팽나무로 52 (구포동)051-338-2175민간
9체육도장업차오름 신천태권도부산광역시 북구 시랑로 84-1<NA>민간
업종상호시설주소(도로명)시설전화번호공영민영구분
251체육교습업점프윙스 줄넘기클럽부산광역시 북구 화명신도시로 129, 골든프라자 803호 (화명동)<NA>민간
252체육교습업일레븐 스포츠부산광역시 북구 만덕3로 58, 6층 (만덕동)<NA>민간
253체육교습업점프윙스(구포점)부산광역시 북구 시랑로 133, 원진빌딩 3층 (구포동)<NA>민간
254체육교습업트루스포츠부산광역시 북구 시랑로 24, 4층 (구포동)<NA>민간
255체육교습업FC 리틀슛부산광역시 북구 화명대로 20, 대성빌딩 602-2호 (화명동)<NA>민간
256체육교습업발렌시아씨에프아카데미(주)부산광역시 북구 화명신도시로 127, 1202호 (화명동)<NA>민간
257체육교습업줄넘기세상 화명점부산광역시 북구 화명대로 101, 118동 (상가) 701,702호 (화명동, 삼한힐파크)<NA>민간
258인공암벽장업락 클라이밍부산광역시 북구 의성로 79, 지하1층 (덕천동)051-331-8382민간
259인공암벽장업SO클라이밍짐부산광역시 북구 금곡대로303번길 61, 2층 201호 (화명동)051-361-4443민간
260인공암벽장업화명 자바클라이밍센터부산광역시 북구 양달로 40, 101호 (화명동)<NA>민간