Overview

Dataset statistics

Number of variables4
Number of observations806
Missing cells421
Missing cells (%)13.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory25.3 KiB
Average record size in memory32.2 B

Variable types

Categorical1
Text3

Dataset

Description부천시 관내의 민간 체육시설 현황으로 업종, 상호명, 소재지 주소지(도로명), 소재지 전화번호 등의 정보를 제공하고 있습니다.
URLhttps://www.data.go.kr/data/3078656/fileData.do

Alerts

시설전화번호 has 421 (52.2%) missing valuesMissing

Reproduction

Analysis started2023-12-12 09:13:11.993677
Analysis finished2023-12-12 09:13:12.681971
Duration0.69 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct11
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
체육도장업
230 
당구장업
218 
체력단련장업
191 
골프연습장업
81 
가상체험 체육시설업
46 
Other values (6)
40 

Length

Max length10
Median length7
Mean length5.3498759
Min length4

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row수영장업
2nd row체육도장업
3rd row체육도장업
4th row체육도장업
5th row체육도장업

Common Values

ValueCountFrequency (%)
체육도장업 230
28.5%
당구장업 218
27.0%
체력단련장업 191
23.7%
골프연습장업 81
 
10.0%
가상체험 체육시설업 46
 
5.7%
체육교습업 20
 
2.5%
수영장업 11
 
1.4%
인공암벽장업 4
 
0.5%
종합체육시설업 3
 
0.4%
무도학원업 1
 
0.1%

Length

2023-12-12T18:13:12.786049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
체육도장업 230
27.0%
당구장업 218
25.6%
체력단련장업 191
22.4%
골프연습장업 81
 
9.5%
가상체험 46
 
5.4%
체육시설업 46
 
5.4%
체육교습업 20
 
2.3%
수영장업 11
 
1.3%
인공암벽장업 4
 
0.5%
종합체육시설업 3
 
0.4%
Other values (2) 2
 
0.2%

상호
Text

Distinct773
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
2023-12-12T18:13:13.070719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length20
Mean length7.7866005
Min length2

Characters and Unicode

Total characters6276
Distinct characters458
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique745 ?
Unique (%)92.4%

Sample

1st row룸비니수영장
2nd row153태권도장
3rd row부천화랑태권도
4th row경희대대웅태권도체육관
5th row한국체대 참태권도장
ValueCountFrequency (%)
태권도장 41
 
3.4%
당구장 26
 
2.2%
당구클럽 17
 
1.4%
용인대 14
 
1.2%
경희대 11
 
0.9%
gym 10
 
0.8%
상동점 10
 
0.8%
휘트니스 9
 
0.7%
스크린골프 7
 
0.6%
신중동점 6
 
0.5%
Other values (878) 1057
87.5%
2023-12-12T18:13:13.547955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
403
 
6.4%
250
 
4.0%
238
 
3.8%
213
 
3.4%
204
 
3.3%
185
 
2.9%
151
 
2.4%
149
 
2.4%
135
 
2.2%
115
 
1.8%
Other values (448) 4233
67.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5168
82.3%
Uppercase Letter 434
 
6.9%
Space Separator 403
 
6.4%
Lowercase Letter 127
 
2.0%
Other Punctuation 39
 
0.6%
Decimal Number 37
 
0.6%
Close Punctuation 34
 
0.5%
Open Punctuation 30
 
0.5%
Dash Punctuation 3
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
250
 
4.8%
238
 
4.6%
213
 
4.1%
204
 
3.9%
185
 
3.6%
151
 
2.9%
149
 
2.9%
135
 
2.6%
115
 
2.2%
111
 
2.1%
Other values (385) 3417
66.1%
Uppercase Letter
ValueCountFrequency (%)
S 46
 
10.6%
G 41
 
9.4%
M 36
 
8.3%
T 35
 
8.1%
B 32
 
7.4%
P 28
 
6.5%
Y 27
 
6.2%
K 21
 
4.8%
A 20
 
4.6%
C 19
 
4.4%
Other values (14) 129
29.7%
Lowercase Letter
ValueCountFrequency (%)
i 14
11.0%
l 13
10.2%
s 13
10.2%
a 11
8.7%
o 11
8.7%
e 10
 
7.9%
n 10
 
7.9%
r 7
 
5.5%
t 6
 
4.7%
b 6
 
4.7%
Other values (11) 26
20.5%
Decimal Number
ValueCountFrequency (%)
2 18
48.6%
0 6
 
16.2%
1 5
 
13.5%
4 2
 
5.4%
6 1
 
2.7%
9 1
 
2.7%
8 1
 
2.7%
7 1
 
2.7%
3 1
 
2.7%
5 1
 
2.7%
Other Punctuation
ValueCountFrequency (%)
& 20
51.3%
. 17
43.6%
' 2
 
5.1%
Space Separator
ValueCountFrequency (%)
403
100.0%
Close Punctuation
ValueCountFrequency (%)
) 34
100.0%
Open Punctuation
ValueCountFrequency (%)
( 30
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Math Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5167
82.3%
Latin 561
 
8.9%
Common 547
 
8.7%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
250
 
4.8%
238
 
4.6%
213
 
4.1%
204
 
3.9%
185
 
3.6%
151
 
2.9%
149
 
2.9%
135
 
2.6%
115
 
2.2%
111
 
2.1%
Other values (384) 3416
66.1%
Latin
ValueCountFrequency (%)
S 46
 
8.2%
G 41
 
7.3%
M 36
 
6.4%
T 35
 
6.2%
B 32
 
5.7%
P 28
 
5.0%
Y 27
 
4.8%
K 21
 
3.7%
A 20
 
3.6%
C 19
 
3.4%
Other values (35) 256
45.6%
Common
ValueCountFrequency (%)
403
73.7%
) 34
 
6.2%
( 30
 
5.5%
& 20
 
3.7%
2 18
 
3.3%
. 17
 
3.1%
0 6
 
1.1%
1 5
 
0.9%
- 3
 
0.5%
' 2
 
0.4%
Other values (8) 9
 
1.6%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5167
82.3%
ASCII 1107
 
17.6%
Math Operators 1
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
403
36.4%
S 46
 
4.2%
G 41
 
3.7%
M 36
 
3.3%
T 35
 
3.2%
) 34
 
3.1%
B 32
 
2.9%
( 30
 
2.7%
P 28
 
2.5%
Y 27
 
2.4%
Other values (52) 395
35.7%
Hangul
ValueCountFrequency (%)
250
 
4.8%
238
 
4.6%
213
 
4.1%
204
 
3.9%
185
 
3.6%
151
 
2.9%
149
 
2.9%
135
 
2.6%
115
 
2.2%
111
 
2.1%
Other values (384) 3416
66.1%
Math Operators
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct793
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
2023-12-12T18:13:13.882425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length80
Median length45
Mean length31.384615
Min length19

Characters and Unicode

Total characters25296
Distinct characters316
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique781 ?
Unique (%)96.9%

Sample

1st row경기도 부천시 소사로 367 (원미동 석왕사)
2nd row경기도 부천시 부일로 497 3층 (심곡동)
3rd row경기도 부천시 장말로 310 (심곡동)
4th row경기도 부천시 부흥로 431 2층3층 (심곡동)
5th row경기도 부천시 조종로 34 3~4층 (원미동)
ValueCountFrequency (%)
경기도 806
 
14.9%
부천시 806
 
14.9%
중동 177
 
3.3%
상동 168
 
3.1%
2층 112
 
2.1%
3층 108
 
2.0%
심곡동 62
 
1.1%
길주로 52
 
1.0%
송내동 46
 
0.9%
소사본동 45
 
0.8%
Other values (1087) 3016
55.9%
2023-12-12T18:13:14.540609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5490
21.7%
976
 
3.9%
976
 
3.9%
897
 
3.5%
1 875
 
3.5%
870
 
3.4%
863
 
3.4%
( 820
 
3.2%
) 820
 
3.2%
819
 
3.2%
Other values (306) 11890
47.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13290
52.5%
Space Separator 5490
21.7%
Decimal Number 4662
 
18.4%
Open Punctuation 820
 
3.2%
Close Punctuation 820
 
3.2%
Dash Punctuation 75
 
0.3%
Uppercase Letter 72
 
0.3%
Math Symbol 55
 
0.2%
Lowercase Letter 7
 
< 0.1%
Other Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
976
 
7.3%
976
 
7.3%
897
 
6.7%
870
 
6.5%
863
 
6.5%
819
 
6.2%
812
 
6.1%
807
 
6.1%
421
 
3.2%
399
 
3.0%
Other values (265) 5450
41.0%
Uppercase Letter
ValueCountFrequency (%)
B 32
44.4%
A 7
 
9.7%
C 6
 
8.3%
S 5
 
6.9%
G 4
 
5.6%
F 4
 
5.6%
R 2
 
2.8%
M 2
 
2.8%
I 2
 
2.8%
E 1
 
1.4%
Other values (7) 7
 
9.7%
Decimal Number
ValueCountFrequency (%)
1 875
18.8%
2 763
16.4%
0 671
14.4%
3 568
12.2%
4 415
8.9%
5 323
 
6.9%
7 305
 
6.5%
6 280
 
6.0%
8 235
 
5.0%
9 227
 
4.9%
Lowercase Letter
ValueCountFrequency (%)
e 2
28.6%
c 1
14.3%
t 1
14.3%
n 1
14.3%
y 1
14.3%
r 1
14.3%
Other Punctuation
ValueCountFrequency (%)
. 3
75.0%
& 1
 
25.0%
Space Separator
ValueCountFrequency (%)
5490
100.0%
Open Punctuation
ValueCountFrequency (%)
( 820
100.0%
Close Punctuation
ValueCountFrequency (%)
) 820
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 75
100.0%
Math Symbol
ValueCountFrequency (%)
~ 55
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13290
52.5%
Common 11926
47.1%
Latin 80
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
976
 
7.3%
976
 
7.3%
897
 
6.7%
870
 
6.5%
863
 
6.5%
819
 
6.2%
812
 
6.1%
807
 
6.1%
421
 
3.2%
399
 
3.0%
Other values (265) 5450
41.0%
Latin
ValueCountFrequency (%)
B 32
40.0%
A 7
 
8.8%
C 6
 
7.5%
S 5
 
6.2%
G 4
 
5.0%
F 4
 
5.0%
R 2
 
2.5%
M 2
 
2.5%
e 2
 
2.5%
I 2
 
2.5%
Other values (14) 14
17.5%
Common
ValueCountFrequency (%)
5490
46.0%
1 875
 
7.3%
( 820
 
6.9%
) 820
 
6.9%
2 763
 
6.4%
0 671
 
5.6%
3 568
 
4.8%
4 415
 
3.5%
5 323
 
2.7%
7 305
 
2.6%
Other values (7) 876
 
7.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13290
52.5%
ASCII 12005
47.5%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5490
45.7%
1 875
 
7.3%
( 820
 
6.8%
) 820
 
6.8%
2 763
 
6.4%
0 671
 
5.6%
3 568
 
4.7%
4 415
 
3.5%
5 323
 
2.7%
7 305
 
2.5%
Other values (30) 955
 
8.0%
Hangul
ValueCountFrequency (%)
976
 
7.3%
976
 
7.3%
897
 
6.7%
870
 
6.5%
863
 
6.5%
819
 
6.2%
812
 
6.1%
807
 
6.1%
421
 
3.2%
399
 
3.0%
Other values (265) 5450
41.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

시설전화번호
Text

MISSING 

Distinct377
Distinct (%)97.9%
Missing421
Missing (%)52.2%
Memory size6.4 KiB
2023-12-12T18:13:14.916408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.038961
Min length9

Characters and Unicode

Total characters4635
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique369 ?
Unique (%)95.8%

Sample

1st row032-663-7776
2nd row032-664-2007
3rd row032-213-9909
4th row032-208-1213
5th row032-654-1010
ValueCountFrequency (%)
032-677-8207 2
 
0.5%
032-323-2889 2
 
0.5%
032-614-9300 2
 
0.5%
032-665-6989 2
 
0.5%
032-677-7700 2
 
0.5%
032-324-4799 2
 
0.5%
032-327-6655 2
 
0.5%
032-321-8820 2
 
0.5%
032-655-5959 1
 
0.3%
032-657-8855 1
 
0.3%
Other values (367) 367
95.3%
2023-12-12T18:13:15.463319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 783
16.9%
- 769
16.6%
2 735
15.9%
0 626
13.5%
6 315
6.8%
7 298
 
6.4%
5 257
 
5.5%
4 228
 
4.9%
8 226
 
4.9%
1 221
 
4.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3866
83.4%
Dash Punctuation 769
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 783
20.3%
2 735
19.0%
0 626
16.2%
6 315
8.1%
7 298
 
7.7%
5 257
 
6.6%
4 228
 
5.9%
8 226
 
5.8%
1 221
 
5.7%
9 177
 
4.6%
Dash Punctuation
ValueCountFrequency (%)
- 769
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4635
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 783
16.9%
- 769
16.6%
2 735
15.9%
0 626
13.5%
6 315
6.8%
7 298
 
6.4%
5 257
 
5.5%
4 228
 
4.9%
8 226
 
4.9%
1 221
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4635
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 783
16.9%
- 769
16.6%
2 735
15.9%
0 626
13.5%
6 315
6.8%
7 298
 
6.4%
5 257
 
5.5%
4 228
 
4.9%
8 226
 
4.9%
1 221
 
4.8%

Missing values

2023-12-12T18:13:12.508279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:13:12.636427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호시설주소(도로명)시설전화번호
0수영장업룸비니수영장경기도 부천시 소사로 367 (원미동 석왕사)032-663-7776
1체육도장업153태권도장경기도 부천시 부일로 497 3층 (심곡동)<NA>
2체육도장업부천화랑태권도경기도 부천시 장말로 310 (심곡동)<NA>
3체육도장업경희대대웅태권도체육관경기도 부천시 부흥로 431 2층3층 (심곡동)<NA>
4체육도장업한국체대 참태권도장경기도 부천시 조종로 34 3~4층 (원미동)<NA>
5체육도장업JS 복싱휘트니스클럽경기도 부천시 부천로 24 2층 (심곡동)<NA>
6체육도장업고려대태권스쿨경기도 부천시 부천로66번길 57 2층 (심곡동)<NA>
7체육도장업용인대 대동 태권도경기도 부천시 조마루로366번길 67 2층 (심곡동)<NA>
8체육도장업하나 태권도장경기도 부천시 장말로294번길 32-3 명성빌딩 2층 (심곡동)<NA>
9체육도장업원미창조태권도경기도 부천시 부흥로 424 상가층 201호 (심곡동 하나리아벨주상복합아파트)032-664-2007
업종상호시설주소(도로명)시설전화번호
796체육도장업행복한 동행 삼성태권도장경기도 부천시 오정로252번길 16 3층 (오정동)<NA>
797체육도장업경희대정훈태권도경기도 부천시 오정로 250 401호 (오정동 신오빌딩)032-677-1484
798체육도장업용인대 TOP 복싱경기도 부천시 소사로 745 3층 (원종동 장우빌딩)032-219-7989
799체육도장업부천 라온태권도경기도 부천시 수도로 65 2층 (삼정동)032-674-2106
800체육도장업C.M. 복싱경기도 부천시 오정로 209-4 5층 (오정동)032-682-8872
801체육도장업힘찬태권도장경기도 부천시 평천로721번길 21 2층 (삼정동)032-675-9793
802체육도장업숭무관경기도 부천시 원종로9번길 59 4층 (원종동)032-671-7979
803체육도장업으라차차 복싱부경기도 부천시 중동로 405 4층 (삼정동)032-676-1702
804체육도장업파인태권도장경기도 부천시 소사로862번길 89 2층 (원종동)032-672-4561
805체육도장업충호태권도장경기도 부천시 부천로476번길 64 2층 (오정동)032-677-7145