Overview

Dataset statistics

Number of variables4
Number of observations356
Missing cells146
Missing cells (%)10.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.3 KiB
Average record size in memory32.4 B

Variable types

Categorical1
Text3

Dataset

Description대구광역시 동구_체육시설업정보_20220422
Author대구광역시 동구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=3057613&dataSetDetailId=305761319feea53b72d1&provdMethod=FILE

Alerts

시설전화번호 has 146 (41.0%) missing valuesMissing

Reproduction

Analysis started2023-12-10 18:33:14.186751
Analysis finished2023-12-10 18:33:15.578808
Duration1.39 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct10
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
체육도장업
107 
체력단련장업
62 
당구장업
59 
골프연습장업
51 
가상체험 체육시설업
37 
Other values (5)
40 

Length

Max length10
Median length7
Mean length5.6769663
Min length4

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row수영장업
2nd row수영장업
3rd row수영장업
4th row체육도장업
5th row체육도장업

Common Values

ValueCountFrequency (%)
체육도장업 107
30.1%
체력단련장업 62
17.4%
당구장업 59
16.6%
골프연습장업 51
14.3%
가상체험 체육시설업 37
 
10.4%
무도학원업 18
 
5.1%
체육교습업 15
 
4.2%
수영장업 3
 
0.8%
종합체육시설업 3
 
0.8%
빙상장업 1
 
0.3%

Length

2023-12-11T03:33:15.741749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T03:33:16.001510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
체육도장업 107
27.2%
체력단련장업 62
15.8%
당구장업 59
15.0%
골프연습장업 51
13.0%
가상체험 37
 
9.4%
체육시설업 37
 
9.4%
무도학원업 18
 
4.6%
체육교습업 15
 
3.8%
수영장업 3
 
0.8%
종합체육시설업 3
 
0.8%

상호
Text

Distinct345
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-12-11T03:33:16.584477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length19
Mean length8.0449438
Min length3

Characters and Unicode

Total characters2864
Distinct characters352
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique334 ?
Unique (%)93.8%

Sample

1st row동촌아쿠아수영장
2nd row월드스포츠스쿨 아쿠아센터
3rd row대구메리어트호텔 '인피니티풀'
4th row동성체육관
5th row효신태권도장
ValueCountFrequency (%)
태권도장 15
 
2.8%
스크린골프 9
 
1.7%
아카데미 9
 
1.7%
당구클럽 8
 
1.5%
합기도 6
 
1.1%
당구장 5
 
0.9%
피트니스 5
 
0.9%
골프 5
 
0.9%
휘트니스 5
 
0.9%
계명대 4
 
0.7%
Other values (419) 464
86.7%
2023-12-11T03:33:17.382429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
179
 
6.2%
152
 
5.3%
102
 
3.6%
89
 
3.1%
88
 
3.1%
83
 
2.9%
82
 
2.9%
76
 
2.7%
74
 
2.6%
54
 
1.9%
Other values (342) 1885
65.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2505
87.5%
Space Separator 179
 
6.2%
Uppercase Letter 106
 
3.7%
Lowercase Letter 23
 
0.8%
Close Punctuation 15
 
0.5%
Open Punctuation 15
 
0.5%
Decimal Number 9
 
0.3%
Other Punctuation 9
 
0.3%
Dash Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
152
 
6.1%
102
 
4.1%
89
 
3.6%
88
 
3.5%
83
 
3.3%
82
 
3.3%
76
 
3.0%
74
 
3.0%
54
 
2.2%
52
 
2.1%
Other values (296) 1653
66.0%
Uppercase Letter
ValueCountFrequency (%)
G 20
18.9%
M 11
10.4%
S 10
 
9.4%
Y 10
 
9.4%
T 6
 
5.7%
B 5
 
4.7%
R 5
 
4.7%
F 4
 
3.8%
E 4
 
3.8%
I 4
 
3.8%
Other values (12) 27
25.5%
Lowercase Letter
ValueCountFrequency (%)
l 4
17.4%
s 4
17.4%
e 3
13.0%
n 3
13.0%
i 2
8.7%
b 1
 
4.3%
g 1
 
4.3%
r 1
 
4.3%
a 1
 
4.3%
o 1
 
4.3%
Other values (2) 2
8.7%
Decimal Number
ValueCountFrequency (%)
2 5
55.6%
3 2
 
22.2%
8 1
 
11.1%
7 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
' 3
33.3%
& 3
33.3%
. 2
22.2%
, 1
 
11.1%
Space Separator
ValueCountFrequency (%)
179
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2505
87.5%
Common 230
 
8.0%
Latin 129
 
4.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
152
 
6.1%
102
 
4.1%
89
 
3.6%
88
 
3.5%
83
 
3.3%
82
 
3.3%
76
 
3.0%
74
 
3.0%
54
 
2.2%
52
 
2.1%
Other values (296) 1653
66.0%
Latin
ValueCountFrequency (%)
G 20
 
15.5%
M 11
 
8.5%
S 10
 
7.8%
Y 10
 
7.8%
T 6
 
4.7%
B 5
 
3.9%
R 5
 
3.9%
l 4
 
3.1%
F 4
 
3.1%
E 4
 
3.1%
Other values (24) 50
38.8%
Common
ValueCountFrequency (%)
179
77.8%
) 15
 
6.5%
( 15
 
6.5%
2 5
 
2.2%
' 3
 
1.3%
& 3
 
1.3%
- 3
 
1.3%
3 2
 
0.9%
. 2
 
0.9%
8 1
 
0.4%
Other values (2) 2
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2505
87.5%
ASCII 359
 
12.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
179
49.9%
G 20
 
5.6%
) 15
 
4.2%
( 15
 
4.2%
M 11
 
3.1%
S 10
 
2.8%
Y 10
 
2.8%
T 6
 
1.7%
B 5
 
1.4%
R 5
 
1.4%
Other values (36) 83
23.1%
Hangul
ValueCountFrequency (%)
152
 
6.1%
102
 
4.1%
89
 
3.6%
88
 
3.5%
83
 
3.3%
82
 
3.3%
76
 
3.0%
74
 
3.0%
54
 
2.2%
52
 
2.1%
Other values (296) 1653
66.0%
Distinct346
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-12-11T03:33:17.976562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length45
Mean length28.988764
Min length20

Characters and Unicode

Total characters10320
Distinct characters202
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique336 ?
Unique (%)94.4%

Sample

1st row대구광역시 동구 동촌로 168 (방촌동, 대구동촌초등학교)
2nd row대구광역시 동구 경안로 722, B1층 (동호동)
3rd row대구광역시 동구 동부로26길 6, 대구 메리어트 호텔 및 서비스드 레지던스 옥상층 (신천동)
4th row대구광역시 동구 아양로11길 39-5 (신암동)
5th row대구광역시 동구 화랑로11길 26, 상가동 1층 (신천동, 코스모스아파트)
ValueCountFrequency (%)
대구광역시 356
 
16.4%
동구 356
 
16.4%
2층 54
 
2.5%
3층 45
 
2.1%
신천동 44
 
2.0%
율하동 37
 
1.7%
신암동 37
 
1.7%
방촌동 33
 
1.5%
신서동 31
 
1.4%
동촌로 28
 
1.3%
Other values (463) 1147
52.9%
2023-12-11T03:33:18.935639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1820
17.6%
869
 
8.4%
725
 
7.0%
380
 
3.7%
365
 
3.5%
360
 
3.5%
360
 
3.5%
358
 
3.5%
( 356
 
3.4%
) 356
 
3.4%
Other values (192) 4371
42.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5776
56.0%
Space Separator 1820
 
17.6%
Decimal Number 1649
 
16.0%
Open Punctuation 356
 
3.4%
Close Punctuation 356
 
3.4%
Other Punctuation 314
 
3.0%
Dash Punctuation 41
 
0.4%
Uppercase Letter 5
 
< 0.1%
Math Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
869
15.0%
725
 
12.6%
380
 
6.6%
365
 
6.3%
360
 
6.2%
360
 
6.2%
358
 
6.2%
202
 
3.5%
153
 
2.6%
147
 
2.5%
Other values (172) 1857
32.2%
Decimal Number
ValueCountFrequency (%)
1 307
18.6%
2 289
17.5%
3 208
12.6%
0 196
11.9%
5 170
10.3%
4 164
9.9%
6 113
 
6.9%
7 75
 
4.5%
9 70
 
4.2%
8 57
 
3.5%
Uppercase Letter
ValueCountFrequency (%)
B 3
60.0%
M 1
 
20.0%
J 1
 
20.0%
Other Punctuation
ValueCountFrequency (%)
, 310
98.7%
. 4
 
1.3%
Space Separator
ValueCountFrequency (%)
1820
100.0%
Open Punctuation
ValueCountFrequency (%)
( 356
100.0%
Close Punctuation
ValueCountFrequency (%)
) 356
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 41
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5776
56.0%
Common 4539
44.0%
Latin 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
869
15.0%
725
 
12.6%
380
 
6.6%
365
 
6.3%
360
 
6.2%
360
 
6.2%
358
 
6.2%
202
 
3.5%
153
 
2.6%
147
 
2.5%
Other values (172) 1857
32.2%
Common
ValueCountFrequency (%)
1820
40.1%
( 356
 
7.8%
) 356
 
7.8%
, 310
 
6.8%
1 307
 
6.8%
2 289
 
6.4%
3 208
 
4.6%
0 196
 
4.3%
5 170
 
3.7%
4 164
 
3.6%
Other values (7) 363
 
8.0%
Latin
ValueCountFrequency (%)
B 3
60.0%
M 1
 
20.0%
J 1
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5776
56.0%
ASCII 4544
44.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1820
40.1%
( 356
 
7.8%
) 356
 
7.8%
, 310
 
6.8%
1 307
 
6.8%
2 289
 
6.4%
3 208
 
4.6%
0 196
 
4.3%
5 170
 
3.7%
4 164
 
3.6%
Other values (10) 368
 
8.1%
Hangul
ValueCountFrequency (%)
869
15.0%
725
 
12.6%
380
 
6.6%
365
 
6.3%
360
 
6.2%
360
 
6.2%
358
 
6.2%
202
 
3.5%
153
 
2.6%
147
 
2.5%
Other values (172) 1857
32.2%

시설전화번호
Text

MISSING 

Distinct206
Distinct (%)98.1%
Missing146
Missing (%)41.0%
Memory size2.9 KiB
2023-12-11T03:33:19.450092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.004762
Min length12

Characters and Unicode

Total characters2521
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique202 ?
Unique (%)96.2%

Sample

1st row053-986-5012
2nd row053-793-6452
3rd row053-327-7000
4th row053-942-6601
5th row053-756-1711
ValueCountFrequency (%)
053-965-9994 2
 
1.0%
053-986-0050 2
 
1.0%
053-952-3000 2
 
1.0%
053-965-0755 2
 
1.0%
053-986-1902 1
 
0.5%
053-212-8899 1
 
0.5%
053-964-0331 1
 
0.5%
053-986-5012 1
 
0.5%
053-952-9777 1
 
0.5%
053-943-7755 1
 
0.5%
Other values (196) 196
93.3%
2023-12-11T03:33:20.167365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 420
16.7%
0 383
15.2%
5 365
14.5%
3 316
12.5%
9 246
9.8%
6 167
 
6.6%
8 158
 
6.3%
7 127
 
5.0%
1 119
 
4.7%
2 110
 
4.4%
Other values (2) 110
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2100
83.3%
Dash Punctuation 420
 
16.7%
Space Separator 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 383
18.2%
5 365
17.4%
3 316
15.0%
9 246
11.7%
6 167
8.0%
8 158
7.5%
7 127
 
6.0%
1 119
 
5.7%
2 110
 
5.2%
4 109
 
5.2%
Dash Punctuation
ValueCountFrequency (%)
- 420
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2521
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 420
16.7%
0 383
15.2%
5 365
14.5%
3 316
12.5%
9 246
9.8%
6 167
 
6.6%
8 158
 
6.3%
7 127
 
5.0%
1 119
 
4.7%
2 110
 
4.4%
Other values (2) 110
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2521
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 420
16.7%
0 383
15.2%
5 365
14.5%
3 316
12.5%
9 246
9.8%
6 167
 
6.6%
8 158
 
6.3%
7 127
 
5.0%
1 119
 
4.7%
2 110
 
4.4%
Other values (2) 110
 
4.4%

Missing values

2023-12-11T03:33:15.337057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T03:33:15.506979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호시설주소(도로명)시설전화번호
0수영장업동촌아쿠아수영장대구광역시 동구 동촌로 168 (방촌동, 대구동촌초등학교)053-986-5012
1수영장업월드스포츠스쿨 아쿠아센터대구광역시 동구 경안로 722, B1층 (동호동)053-793-6452
2수영장업대구메리어트호텔 '인피니티풀'대구광역시 동구 동부로26길 6, 대구 메리어트 호텔 및 서비스드 레지던스 옥상층 (신천동)053-327-7000
3체육도장업동성체육관대구광역시 동구 아양로11길 39-5 (신암동)053-942-6601
4체육도장업효신태권도장대구광역시 동구 화랑로11길 26, 상가동 1층 (신천동, 코스모스아파트)053-756-1711
5체육도장업반야월태권도장대구광역시 동구 반야월로14길 13 (율하동)053-963-1102
6체육도장업힘찬태권도대구광역시 동구 동호로3길 3, 3.4층 (동호동)<NA>
7체육도장업지묘경희도장대구광역시 동구 팔공로101길 55, 상가동 301호 (지묘동, 팔공보성2차아파트)053-982-9924
8체육도장업최강키즈태권도장대구광역시 동구 팔공로31길 1, 3층 (불로동)<NA>
9체육도장업보람체육관대구광역시 동구 율하동로24길 76, 2층 (서호동)053-963-6520
업종상호시설주소(도로명)시설전화번호
346체육교습업런투유줄넘기클럽(신천점)대구광역시 동구 송라로10길 25, 2층 (신천동)<NA>
347체육교습업임진미 줌바스튜디오, 점프윙스 줄넘기클럽 동호점대구광역시 동구 동호로 75, 4층 403호 (신서동)<NA>
348체육교습업온 배드민턴대구광역시 동구 둔산로40길 17, 가동 (방촌동)053-981-0754
349체육교습업레인보우음악줄넘기대구광역시 동구 안심로16길 47, 율하동 타임스퀘어 2층 210호 (율하동)<NA>
350체육교습업제제(ZEZE)스포츠스쿨대구광역시 동구 팔공로51길 15-13, 4층 (봉무동)<NA>
351체육교습업레인보우 음악줄넘기대구광역시 동구 동호로9길 75, 2층 (신서동)<NA>
352체육교습업아이비 키즈 스포츠 율하점대구광역시 동구 안심로22길 60, 동흥메디칼 602호 (율하동)053-965-8288
353체육교습업위드스포츠 혁신점대구광역시 동구 경안로 938, 203-1호 (각산동)<NA>
354체육교습업유니온축구클럽대구광역시 동구 메디밸리로 5-21, 3층 301,302호 (대림동)<NA>
355체육교습업윤창열 축구교실대구광역시 동구 첨단로8길 8, 4층 402호, 404호 (신서동)053-965-2242