Overview

Dataset statistics

Number of variables4
Number of observations156
Missing cells39
Missing cells (%)6.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.0 KiB
Average record size in memory32.8 B

Variable types

Categorical1
Text3

Dataset

Description서울특별시 강서구의 체력단련장업 업소(헬스장, 피트니스 센터 등) 현황 정보를 제공합니다. 업종,상호,시설주소,시설전화번호 등의 항목이 포함되어 있습니다.
Author서울특별시 강서구
URLhttps://www.data.go.kr/data/15074340/fileData.do

Alerts

업종 has constant value ""Constant
시설전화번호 has 39 (25.0%) missing valuesMissing

Reproduction

Analysis started2023-12-12 03:30:40.673562
Analysis finished2023-12-12 03:30:41.233430
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
체력단련장업
156 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row체력단련장업
2nd row체력단련장업
3rd row체력단련장업
4th row체력단련장업
5th row체력단련장업

Common Values

ValueCountFrequency (%)
체력단련장업 156
100.0%

Length

2023-12-12T12:30:41.326023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:30:41.469994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
체력단련장업 156
100.0%

상호
Text

Distinct154
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T12:30:41.844825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length7.7115385
Min length3

Characters and Unicode

Total characters1203
Distinct characters246
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique152 ?
Unique (%)97.4%

Sample

1st row백제헬스크럽
2nd row제일휘트니스
3rd row그린헬스크럽
4th row헬스라인
5th row하이짐 까치산점
ValueCountFrequency (%)
휘트니스 18
 
6.6%
8
 
2.9%
gym 4
 
1.5%
커브스 4
 
1.5%
피트니스 4
 
1.5%
goto 4
 
1.5%
화곡점 3
 
1.1%
에이블짐 3
 
1.1%
크로스핏 3
 
1.1%
스포츠 3
 
1.1%
Other values (197) 218
80.1%
2023-12-12T12:30:42.474214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
116
 
9.6%
96
 
8.0%
46
 
3.8%
39
 
3.2%
37
 
3.1%
37
 
3.1%
27
 
2.2%
20
 
1.7%
18
 
1.5%
15
 
1.2%
Other values (236) 752
62.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 881
73.2%
Uppercase Letter 134
 
11.1%
Space Separator 116
 
9.6%
Lowercase Letter 39
 
3.2%
Decimal Number 14
 
1.2%
Close Punctuation 7
 
0.6%
Other Punctuation 5
 
0.4%
Open Punctuation 4
 
0.3%
Dash Punctuation 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
96
 
10.9%
46
 
5.2%
39
 
4.4%
37
 
4.2%
37
 
4.2%
27
 
3.1%
20
 
2.3%
18
 
2.0%
15
 
1.7%
15
 
1.7%
Other values (181) 531
60.3%
Uppercase Letter
ValueCountFrequency (%)
T 14
 
10.4%
G 13
 
9.7%
M 12
 
9.0%
O 12
 
9.0%
E 11
 
8.2%
Y 9
 
6.7%
S 8
 
6.0%
P 7
 
5.2%
K 7
 
5.2%
I 6
 
4.5%
Other values (13) 35
26.1%
Lowercase Letter
ValueCountFrequency (%)
s 4
10.3%
a 4
10.3%
u 3
 
7.7%
o 3
 
7.7%
k 3
 
7.7%
l 3
 
7.7%
e 3
 
7.7%
c 3
 
7.7%
r 2
 
5.1%
h 2
 
5.1%
Other values (7) 9
23.1%
Decimal Number
ValueCountFrequency (%)
2 5
35.7%
4 2
 
14.3%
1 2
 
14.3%
3 1
 
7.1%
0 1
 
7.1%
8 1
 
7.1%
9 1
 
7.1%
6 1
 
7.1%
Other Punctuation
ValueCountFrequency (%)
& 3
60.0%
. 1
 
20.0%
1
 
20.0%
Space Separator
ValueCountFrequency (%)
116
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 881
73.2%
Latin 173
 
14.4%
Common 149
 
12.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
96
 
10.9%
46
 
5.2%
39
 
4.4%
37
 
4.2%
37
 
4.2%
27
 
3.1%
20
 
2.3%
18
 
2.0%
15
 
1.7%
15
 
1.7%
Other values (181) 531
60.3%
Latin
ValueCountFrequency (%)
T 14
 
8.1%
G 13
 
7.5%
M 12
 
6.9%
O 12
 
6.9%
E 11
 
6.4%
Y 9
 
5.2%
S 8
 
4.6%
P 7
 
4.0%
K 7
 
4.0%
I 6
 
3.5%
Other values (30) 74
42.8%
Common
ValueCountFrequency (%)
116
77.9%
) 7
 
4.7%
2 5
 
3.4%
( 4
 
2.7%
- 3
 
2.0%
& 3
 
2.0%
4 2
 
1.3%
1 2
 
1.3%
3 1
 
0.7%
0 1
 
0.7%
Other values (5) 5
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 881
73.2%
ASCII 321
 
26.7%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
116
36.1%
T 14
 
4.4%
G 13
 
4.0%
M 12
 
3.7%
O 12
 
3.7%
E 11
 
3.4%
Y 9
 
2.8%
S 8
 
2.5%
) 7
 
2.2%
P 7
 
2.2%
Other values (44) 112
34.9%
Hangul
ValueCountFrequency (%)
96
 
10.9%
46
 
5.2%
39
 
4.4%
37
 
4.2%
37
 
4.2%
27
 
3.1%
20
 
2.3%
18
 
2.0%
15
 
1.7%
15
 
1.7%
Other values (181) 531
60.3%
None
ValueCountFrequency (%)
1
100.0%
Distinct155
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T12:30:42.854955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length46
Mean length37.346154
Min length22

Characters and Unicode

Total characters5826
Distinct characters229
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique154 ?
Unique (%)98.7%

Sample

1st row서울특별시 강서구 가로공원로76길 100 (화곡동)
2nd row서울특별시 강서구 화곡로 206, 3층 (화곡동)
3rd row서울특별시 강서구 방화동로 120 (방화동,63,158호)
4th row서울특별시 강서구 강서로17길 29 (화곡동)
5th row서울특별시 강서구 강서로 61, 5층 (화곡동, 영상빌딩)
ValueCountFrequency (%)
서울특별시 156
 
14.5%
강서구 156
 
14.5%
마곡동 46
 
4.3%
화곡동 36
 
3.3%
강서로 21
 
1.9%
공항대로 20
 
1.9%
등촌동 20
 
1.9%
양천로 16
 
1.5%
3층 13
 
1.2%
2층 13
 
1.2%
Other values (393) 580
53.9%
2023-12-12T12:30:43.460143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
927
 
15.9%
354
 
6.1%
, 238
 
4.1%
1 206
 
3.5%
198
 
3.4%
181
 
3.1%
161
 
2.8%
160
 
2.7%
) 157
 
2.7%
( 157
 
2.7%
Other values (219) 3087
53.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3276
56.2%
Decimal Number 981
 
16.8%
Space Separator 927
 
15.9%
Other Punctuation 238
 
4.1%
Close Punctuation 157
 
2.7%
Open Punctuation 157
 
2.7%
Uppercase Letter 46
 
0.8%
Dash Punctuation 21
 
0.4%
Math Symbol 20
 
0.3%
Letter Number 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
354
 
10.8%
198
 
6.0%
181
 
5.5%
161
 
4.9%
160
 
4.9%
157
 
4.8%
156
 
4.8%
156
 
4.8%
156
 
4.8%
149
 
4.5%
Other values (191) 1448
44.2%
Decimal Number
ValueCountFrequency (%)
1 206
21.0%
0 137
14.0%
3 130
13.3%
2 128
13.0%
5 88
9.0%
4 76
 
7.7%
6 67
 
6.8%
7 60
 
6.1%
9 45
 
4.6%
8 44
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
B 29
63.0%
A 7
 
15.2%
K 2
 
4.3%
I 2
 
4.3%
J 1
 
2.2%
W 1
 
2.2%
S 1
 
2.2%
C 1
 
2.2%
G 1
 
2.2%
H 1
 
2.2%
Letter Number
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
927
100.0%
Other Punctuation
ValueCountFrequency (%)
, 238
100.0%
Close Punctuation
ValueCountFrequency (%)
) 157
100.0%
Open Punctuation
ValueCountFrequency (%)
( 157
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%
Math Symbol
ValueCountFrequency (%)
~ 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3276
56.2%
Common 2501
42.9%
Latin 49
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
354
 
10.8%
198
 
6.0%
181
 
5.5%
161
 
4.9%
160
 
4.9%
157
 
4.8%
156
 
4.8%
156
 
4.8%
156
 
4.8%
149
 
4.5%
Other values (191) 1448
44.2%
Common
ValueCountFrequency (%)
927
37.1%
, 238
 
9.5%
1 206
 
8.2%
) 157
 
6.3%
( 157
 
6.3%
0 137
 
5.5%
3 130
 
5.2%
2 128
 
5.1%
5 88
 
3.5%
4 76
 
3.0%
Other values (6) 257
 
10.3%
Latin
ValueCountFrequency (%)
B 29
59.2%
A 7
 
14.3%
K 2
 
4.1%
2
 
4.1%
I 2
 
4.1%
1
 
2.0%
J 1
 
2.0%
W 1
 
2.0%
S 1
 
2.0%
C 1
 
2.0%
Other values (2) 2
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3276
56.2%
ASCII 2547
43.7%
Number Forms 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
927
36.4%
, 238
 
9.3%
1 206
 
8.1%
) 157
 
6.2%
( 157
 
6.2%
0 137
 
5.4%
3 130
 
5.1%
2 128
 
5.0%
5 88
 
3.5%
4 76
 
3.0%
Other values (16) 303
 
11.9%
Hangul
ValueCountFrequency (%)
354
 
10.8%
198
 
6.0%
181
 
5.5%
161
 
4.9%
160
 
4.9%
157
 
4.8%
156
 
4.8%
156
 
4.8%
156
 
4.8%
149
 
4.5%
Other values (191) 1448
44.2%
Number Forms
ValueCountFrequency (%)
2
66.7%
1
33.3%

시설전화번호
Text

MISSING 

Distinct116
Distinct (%)99.1%
Missing39
Missing (%)25.0%
Memory size1.3 KiB
2023-12-12T12:30:43.814641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.982906
Min length11

Characters and Unicode

Total characters1402
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique115 ?
Unique (%)98.3%

Sample

1st row02-2602-9184
2nd row02-664-4172
3rd row02-699-5521
4th row02-2691-9981
5th row02-2666-5454
ValueCountFrequency (%)
02-2661-3500 2
 
1.7%
02-2658-9678 1
 
0.9%
02-3663-9666 1
 
0.9%
02-6373-4545 1
 
0.9%
02-2603-1778 1
 
0.9%
070-8844-9666 1
 
0.9%
02-2658-9620 1
 
0.9%
02-2658-7781 1
 
0.9%
02-2661-9430 1
 
0.9%
02-702-7307 1
 
0.9%
Other values (106) 106
90.6%
2023-12-12T12:30:44.413642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 241
17.2%
- 234
16.7%
0 214
15.3%
6 212
15.1%
3 89
 
6.3%
8 79
 
5.6%
7 75
 
5.3%
1 74
 
5.3%
5 69
 
4.9%
9 61
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1168
83.3%
Dash Punctuation 234
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 241
20.6%
0 214
18.3%
6 212
18.2%
3 89
 
7.6%
8 79
 
6.8%
7 75
 
6.4%
1 74
 
6.3%
5 69
 
5.9%
9 61
 
5.2%
4 54
 
4.6%
Dash Punctuation
ValueCountFrequency (%)
- 234
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1402
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 241
17.2%
- 234
16.7%
0 214
15.3%
6 212
15.1%
3 89
 
6.3%
8 79
 
5.6%
7 75
 
5.3%
1 74
 
5.3%
5 69
 
4.9%
9 61
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1402
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 241
17.2%
- 234
16.7%
0 214
15.3%
6 212
15.1%
3 89
 
6.3%
8 79
 
5.6%
7 75
 
5.3%
1 74
 
5.3%
5 69
 
4.9%
9 61
 
4.4%

Missing values

2023-12-12T12:30:41.071276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:30:41.193647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호시설주소시설전화번호
0체력단련장업백제헬스크럽서울특별시 강서구 가로공원로76길 100 (화곡동)<NA>
1체력단련장업제일휘트니스서울특별시 강서구 화곡로 206, 3층 (화곡동)02-2602-9184
2체력단련장업그린헬스크럽서울특별시 강서구 방화동로 120 (방화동,63,158호)02-664-4172
3체력단련장업헬스라인서울특별시 강서구 강서로17길 29 (화곡동)02-699-5521
4체력단련장업하이짐 까치산점서울특별시 강서구 강서로 61, 5층 (화곡동, 영상빌딩)<NA>
5체력단련장업화곡역점 스포애니 주)케이디스포츠서울특별시 강서구 화곡로 191, 1층로비,B1~B2층 (화곡동)02-2691-9981
6체력단련장업GOTO 방화점서울특별시 강서구 금낭화로 135, 금강프라자빌딩 10층 (방화동)02-2666-5454
7체력단련장업GYM the Classic서울특별시 강서구 강서로56가길 47, 5층 (등촌동, 동진빌딩)02-3663-3883
8체력단련장업KBS 스포츠월드 헬스클럽서울특별시 강서구 공항대로 376 (화곡동)02-2600-8855
9체력단련장업영헬스피아서울특별시 강서구 등촌로 47 (화곡동)02-2647-1175
업종상호시설주소시설전화번호
146체력단련장업껀바디(kkunbody)서울특별시 강서구 공항대로 247, 퀸즈파크나인 331,332,333호 (마곡동)<NA>
147체력단련장업가벼워GYM서울특별시 강서구 양천로 489, 상가동 2층 203호 (가양동, 가양우성아파트)<NA>
148체력단련장업레오핏서울특별시 강서구 마곡중앙6로 42, 사이언스타 409호 (마곡동)<NA>
149체력단련장업이바디서울특별시 강서구 양천로 73, 멤브로메디컬센터 304호 (방화동)02-2661-9679
150체력단련장업제이뷰짐서울특별시 강서구 마곡중앙로 59-17, 류마타워2 301~303호 (마곡동)02-2135-5632
151체력단련장업라인핏서울특별시 강서구 공항대로 227, 마곡센트럴타워Ⅰ 217호 (마곡동)<NA>
152체력단련장업행복해GYM서울특별시 강서구 마곡서1로 115-1, 마곡헤리움1차 212호 (마곡동)<NA>
153체력단련장업어뉴바디 PT&Pilates서울특별시 강서구 강서로 153, 3층 (화곡동)<NA>
154체력단련장업MK짐서울특별시 강서구 양천로 658, 201호 (염창동)<NA>
155체력단련장업원필라테스서울특별시 강서구 마곡동로 61, 에이스프라자 504~506호 (마곡동)<NA>