Overview

Dataset statistics

Number of variables6
Number of observations392
Missing cells148
Missing cells (%)6.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory18.5 KiB
Average record size in memory48.3 B

Variable types

Categorical1
Text3
DateTime2

Dataset

Description부산광역시_사하구_체육시설업신고현황_20221124
Author부산광역시 사하구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3045764

Alerts

데이터기준일자 has constant value ""Constant
시설전화번호 has 148 (37.8%) missing valuesMissing

Reproduction

Analysis started2024-04-21 08:19:42.776172
Analysis finished2024-04-21 08:19:44.084753
Duration1.31 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct10
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
당구장업
114 
체육도장업
108 
체력단련장업
83 
골프연습장업
53 
가상체험 체육시설업
12 
Other values (5)
22 

Length

Max length10
Median length7
Mean length5.2168367
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수영장업
2nd row수영장업
3rd row수영장업
4th row체육도장업
5th row체육도장업

Common Values

ValueCountFrequency (%)
당구장업 114
29.1%
체육도장업 108
27.6%
체력단련장업 83
21.2%
골프연습장업 53
13.5%
가상체험 체육시설업 12
 
3.1%
체육교습업 10
 
2.6%
무도학원업 5
 
1.3%
수영장업 3
 
0.8%
종합체육시설업 2
 
0.5%
인공암벽장업 2
 
0.5%

Length

2024-04-21T17:19:44.311286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T17:19:44.681725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
당구장업 114
28.2%
체육도장업 108
26.7%
체력단련장업 83
20.5%
골프연습장업 53
13.1%
가상체험 12
 
3.0%
체육시설업 12
 
3.0%
체육교습업 10
 
2.5%
무도학원업 5
 
1.2%
수영장업 3
 
0.7%
종합체육시설업 2
 
0.5%

상호
Text

Distinct378
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2024-04-21T17:19:45.852343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length18
Mean length6.8903061
Min length2

Characters and Unicode

Total characters2701
Distinct characters355
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique365 ?
Unique (%)93.1%

Sample

1st row(주)승학스포츠센터
2nd row망고키즈수영장
3rd row서부안심생존센터
4th row정심검도장
5th row신평태권도장
ValueCountFrequency (%)
당구클럽 10
 
1.9%
태권도 9
 
1.7%
당구장 8
 
1.6%
사하 4
 
0.8%
아카데미 4
 
0.8%
골프 4
 
0.8%
휘트니스 4
 
0.8%
헬스 3
 
0.6%
태권도장 3
 
0.6%
용인대 3
 
0.6%
Other values (428) 463
89.9%
2024-04-21T17:19:47.415283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
133
 
4.9%
123
 
4.6%
115
 
4.3%
114
 
4.2%
99
 
3.7%
78
 
2.9%
67
 
2.5%
66
 
2.4%
66
 
2.4%
58
 
2.1%
Other values (345) 1782
66.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2405
89.0%
Space Separator 123
 
4.6%
Uppercase Letter 117
 
4.3%
Decimal Number 24
 
0.9%
Lowercase Letter 12
 
0.4%
Other Punctuation 8
 
0.3%
Open Punctuation 6
 
0.2%
Close Punctuation 6
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
133
 
5.5%
115
 
4.8%
114
 
4.7%
99
 
4.1%
78
 
3.2%
67
 
2.8%
66
 
2.7%
66
 
2.7%
58
 
2.4%
56
 
2.3%
Other values (293) 1553
64.6%
Uppercase Letter
ValueCountFrequency (%)
O 12
 
10.3%
T 11
 
9.4%
G 10
 
8.5%
P 9
 
7.7%
A 8
 
6.8%
K 8
 
6.8%
S 8
 
6.8%
M 7
 
6.0%
J 7
 
6.0%
E 6
 
5.1%
Other values (14) 31
26.5%
Lowercase Letter
ValueCountFrequency (%)
e 2
16.7%
l 2
16.7%
h 1
8.3%
a 1
8.3%
i 1
8.3%
t 1
8.3%
f 1
8.3%
g 1
8.3%
y 1
8.3%
m 1
8.3%
Decimal Number
ValueCountFrequency (%)
2 7
29.2%
5 4
16.7%
4 3
12.5%
0 2
 
8.3%
9 2
 
8.3%
3 2
 
8.3%
7 2
 
8.3%
1 1
 
4.2%
8 1
 
4.2%
Other Punctuation
ValueCountFrequency (%)
& 3
37.5%
· 1
 
12.5%
. 1
 
12.5%
# 1
 
12.5%
' 1
 
12.5%
, 1
 
12.5%
Space Separator
ValueCountFrequency (%)
123
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2405
89.0%
Common 167
 
6.2%
Latin 129
 
4.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
133
 
5.5%
115
 
4.8%
114
 
4.7%
99
 
4.1%
78
 
3.2%
67
 
2.8%
66
 
2.7%
66
 
2.7%
58
 
2.4%
56
 
2.3%
Other values (293) 1553
64.6%
Latin
ValueCountFrequency (%)
O 12
 
9.3%
T 11
 
8.5%
G 10
 
7.8%
P 9
 
7.0%
A 8
 
6.2%
K 8
 
6.2%
S 8
 
6.2%
M 7
 
5.4%
J 7
 
5.4%
E 6
 
4.7%
Other values (24) 43
33.3%
Common
ValueCountFrequency (%)
123
73.7%
2 7
 
4.2%
( 6
 
3.6%
) 6
 
3.6%
5 4
 
2.4%
& 3
 
1.8%
4 3
 
1.8%
0 2
 
1.2%
9 2
 
1.2%
3 2
 
1.2%
Other values (8) 9
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2405
89.0%
ASCII 295
 
10.9%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
133
 
5.5%
115
 
4.8%
114
 
4.7%
99
 
4.1%
78
 
3.2%
67
 
2.8%
66
 
2.7%
66
 
2.7%
58
 
2.4%
56
 
2.3%
Other values (293) 1553
64.6%
ASCII
ValueCountFrequency (%)
123
41.7%
O 12
 
4.1%
T 11
 
3.7%
G 10
 
3.4%
P 9
 
3.1%
A 8
 
2.7%
K 8
 
2.7%
S 8
 
2.7%
M 7
 
2.4%
2 7
 
2.4%
Other values (41) 92
31.2%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct389
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2024-04-21T17:19:48.410380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length48
Mean length31.303571
Min length21

Characters and Unicode

Total characters12271
Distinct characters197
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique386 ?
Unique (%)98.5%

Sample

1st row부산광역시 사하구 마하로48번길 26 (괴정동)
2nd row부산광역시 사하구 다대로 317, 지하1층 (장림동)
3rd row부산광역시 사하구 원양로 379 (감천동)
4th row부산광역시 사하구 낙동대로234번길 38 (괴정동)
5th row부산광역시 사하구 장평로 307, 3층 (신평동)
ValueCountFrequency (%)
부산광역시 392
 
16.8%
사하구 392
 
16.8%
다대동 52
 
2.2%
하단동 49
 
2.1%
다대로 48
 
2.1%
2층 44
 
1.9%
괴정동 43
 
1.8%
장림동 42
 
1.8%
낙동대로 42
 
1.8%
3층 39
 
1.7%
Other values (546) 1185
50.9%
2024-04-21T17:19:49.682394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2058
 
16.8%
580
 
4.7%
523
 
4.3%
, 423
 
3.4%
406
 
3.3%
403
 
3.3%
400
 
3.3%
398
 
3.2%
( 396
 
3.2%
) 396
 
3.2%
Other values (187) 6288
51.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7069
57.6%
Space Separator 2058
 
16.8%
Decimal Number 1890
 
15.4%
Other Punctuation 423
 
3.4%
Open Punctuation 396
 
3.2%
Close Punctuation 396
 
3.2%
Dash Punctuation 20
 
0.2%
Uppercase Letter 15
 
0.1%
Math Symbol 3
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
580
 
8.2%
523
 
7.4%
406
 
5.7%
403
 
5.7%
400
 
5.7%
398
 
5.6%
395
 
5.6%
393
 
5.6%
392
 
5.5%
380
 
5.4%
Other values (167) 2799
39.6%
Decimal Number
ValueCountFrequency (%)
2 325
17.2%
1 318
16.8%
3 243
12.9%
0 199
10.5%
4 198
10.5%
5 184
9.7%
7 132
7.0%
6 122
 
6.5%
8 88
 
4.7%
9 81
 
4.3%
Uppercase Letter
ValueCountFrequency (%)
B 11
73.3%
W 2
 
13.3%
A 2
 
13.3%
Space Separator
ValueCountFrequency (%)
2058
100.0%
Other Punctuation
ValueCountFrequency (%)
, 423
100.0%
Open Punctuation
ValueCountFrequency (%)
( 396
100.0%
Close Punctuation
ValueCountFrequency (%)
) 396
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7069
57.6%
Common 5186
42.3%
Latin 16
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
580
 
8.2%
523
 
7.4%
406
 
5.7%
403
 
5.7%
400
 
5.7%
398
 
5.6%
395
 
5.6%
393
 
5.6%
392
 
5.5%
380
 
5.4%
Other values (167) 2799
39.6%
Common
ValueCountFrequency (%)
2058
39.7%
, 423
 
8.2%
( 396
 
7.6%
) 396
 
7.6%
2 325
 
6.3%
1 318
 
6.1%
3 243
 
4.7%
0 199
 
3.8%
4 198
 
3.8%
5 184
 
3.5%
Other values (6) 446
 
8.6%
Latin
ValueCountFrequency (%)
B 11
68.8%
W 2
 
12.5%
A 2
 
12.5%
e 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7069
57.6%
ASCII 5202
42.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2058
39.6%
, 423
 
8.1%
( 396
 
7.6%
) 396
 
7.6%
2 325
 
6.2%
1 318
 
6.1%
3 243
 
4.7%
0 199
 
3.8%
4 198
 
3.8%
5 184
 
3.5%
Other values (10) 462
 
8.9%
Hangul
ValueCountFrequency (%)
580
 
8.2%
523
 
7.4%
406
 
5.7%
403
 
5.7%
400
 
5.7%
398
 
5.6%
395
 
5.6%
393
 
5.6%
392
 
5.5%
380
 
5.4%
Other values (167) 2799
39.6%

시설전화번호
Text

MISSING 

Distinct239
Distinct (%)98.0%
Missing148
Missing (%)37.8%
Memory size3.2 KiB
2024-04-21T17:19:50.584689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters2928
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique234 ?
Unique (%)95.9%

Sample

1st row051-207-0885
2nd row051-265-6065
3rd row051-203-1304
4th row051-291-8873
5th row051-291-2285
ValueCountFrequency (%)
051-292-3892 2
 
0.8%
051-291-8869 2
 
0.8%
051-205-7272 2
 
0.8%
051-207-0885 2
 
0.8%
051-264-1818 2
 
0.8%
051-204-6565 1
 
0.4%
051-262-9682 1
 
0.4%
051-292-1322 1
 
0.4%
051-206-9996 1
 
0.4%
051-203-1018 1
 
0.4%
Other values (229) 229
93.9%
2024-04-21T17:19:51.693771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 488
16.7%
0 485
16.6%
5 393
13.4%
1 383
13.1%
2 380
13.0%
6 196
6.7%
9 134
 
4.6%
3 131
 
4.5%
8 127
 
4.3%
7 116
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2440
83.3%
Dash Punctuation 488
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 485
19.9%
5 393
16.1%
1 383
15.7%
2 380
15.6%
6 196
8.0%
9 134
 
5.5%
3 131
 
5.4%
8 127
 
5.2%
7 116
 
4.8%
4 95
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 488
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2928
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 488
16.7%
0 485
16.6%
5 393
13.4%
1 383
13.1%
2 380
13.0%
6 196
6.7%
9 134
 
4.6%
3 131
 
4.5%
8 127
 
4.3%
7 116
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2928
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 488
16.7%
0 485
16.6%
5 393
13.4%
1 383
13.1%
2 380
13.0%
6 196
6.7%
9 134
 
4.6%
3 131
 
4.5%
8 127
 
4.3%
7 116
 
4.0%
Distinct362
Distinct (%)92.3%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
Minimum1988-11-26 00:00:00
Maximum2022-11-22 00:00:00
2024-04-21T17:19:51.938797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T17:19:52.190860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
Minimum2022-11-24 00:00:00
Maximum2022-11-24 00:00:00
2024-04-21T17:19:52.377843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T17:19:52.534630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2024-04-21T17:19:43.362525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T17:19:43.715082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호시설주소(도로명)시설전화번호최초등록일자데이터기준일자
0수영장업(주)승학스포츠센터부산광역시 사하구 마하로48번길 26 (괴정동)051-207-08852000-07-032022-11-24
1수영장업망고키즈수영장부산광역시 사하구 다대로 317, 지하1층 (장림동)051-265-60652015-10-152022-11-24
2수영장업서부안심생존센터부산광역시 사하구 원양로 379 (감천동)<NA>2020-10-232022-11-24
3체육도장업정심검도장부산광역시 사하구 낙동대로234번길 38 (괴정동)051-203-13041988-11-262022-11-24
4체육도장업신평태권도장부산광역시 사하구 장평로 307, 3층 (신평동)<NA>1989-12-182022-11-24
5체육도장업감정체육관부산광역시 사하구 옥천로 80-1 (감천동,3층)051-291-88731993-03-302022-11-24
6체육도장업하남체육관부산광역시 사하구 하신번영로207번길 2 (하단동,5층)<NA>1994-11-252022-11-24
7체육도장업낭만 태권도부산광역시 사하구 다대로 24 (당리동)051-291-22851995-07-122022-11-24
8체육도장업MAD태권도(꿈을 만드는 태권도)부산광역시 사하구 승학로 132 (당리동)051-203-52051996-03-042022-11-24
9체육도장업이든태권도장부산광역시 사하구 다대로429번길 20, 상가동 301호 (다대동, 삼환아파트)051-265-33561996-09-252022-11-24
업종상호시설주소(도로명)시설전화번호최초등록일자데이터기준일자
382체육교습업점프윙스줄넘기클럽부산광역시 사하구 윤공단로75번길 43, 301, 302호 (다대동)051-264-99792021-10-262022-11-24
383체육교습업J스포츠스쿨 사하점부산광역시 사하구 다대로 240, 지하1층 (장림동)051-262-69152021-10-272022-11-24
384체육교습업아르테 유소년 축구교실부산광역시 사하구 다송로 71, 306, 307, 308호 (다대동)<NA>2021-11-042022-11-24
385체육교습업점프윙스줄넘기클럽부산광역시 사하구 다대로 473, 맘모스상가 1동 3층 23, 25호 (다대동, 다대포현대아파트)<NA>2021-11-082022-11-24
386체육교습업한상운풋볼스튜디오부산광역시 사하구 하신중앙로 164, 1층 (신평동)<NA>2021-11-182022-11-24
387체육교습업디오스포츠부산광역시 사하구 하신번영로 324, 3층 (하단동)<NA>2022-02-112022-11-24
388체육교습업인피니트스포츠 사하점부산광역시 사하구 마하로48번길 26, 401, 501호 (괴정동)<NA>2022-03-242022-11-24
389체육교습업점프윙스줄넘기클럽부산광역시 사하구 하신번영로207번길 17, 4층 (하단동)<NA>2022-03-242022-11-24
390인공암벽장업짱클라이밍부산광역시 사하구 다대로429번길 5, 101호 (다대동)<NA>2021-09-282022-11-24
391인공암벽장업락오디세이 하단부산광역시 사하구 낙동대로 498, 2층 (하단동)<NA>2022-06-102022-11-24