Overview

Dataset statistics

Number of variables4
Number of observations379
Missing cells141
Missing cells (%)9.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.0 KiB
Average record size in memory32.3 B

Variable types

Categorical1
Text3

Dataset

Description대구광역시 동구_체육시설업정보_20230324
Author대구광역시 동구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=3057613&dataSetDetailId=30576131bdcdc70598ff&provdMethod=FILE

Alerts

시설전화번호 has 141 (37.2%) missing valuesMissing

Reproduction

Analysis started2023-12-10 19:19:21.527242
Analysis finished2023-12-10 19:19:22.177604
Duration0.65 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct11
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
체육도장업
107 
체력단련장업
73 
가상체험 체육시설업
60 
당구장업
56 
골프연습장업
40 
Other values (6)
43 

Length

Max length10
Median length7
Mean length5.9525066
Min length4

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row수영장업
2nd row수영장업
3rd row수영장업
4th row체육도장업
5th row체육도장업

Common Values

ValueCountFrequency (%)
체육도장업 107
28.2%
체력단련장업 73
19.3%
가상체험 체육시설업 60
15.8%
당구장업 56
14.8%
골프연습장업 40
 
10.6%
무도학원업 17
 
4.5%
체육교습업 17
 
4.5%
수영장업 3
 
0.8%
종합체육시설업 3
 
0.8%
인공암벽장업 2
 
0.5%

Length

2023-12-11T04:19:22.290960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
체육도장업 107
24.4%
체력단련장업 73
16.6%
가상체험 60
13.7%
체육시설업 60
13.7%
당구장업 56
12.8%
골프연습장업 40
 
9.1%
무도학원업 17
 
3.9%
체육교습업 17
 
3.9%
수영장업 3
 
0.7%
종합체육시설업 3
 
0.7%
Other values (2) 3
 
0.7%

상호
Text

Distinct369
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2023-12-11T04:19:22.672354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length19
Mean length8.298153
Min length3

Characters and Unicode

Total characters3145
Distinct characters376
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique359 ?
Unique (%)94.7%

Sample

1st row동촌아쿠아수영장
2nd row에이스 어린이 수영장
3rd row대구메리어트호텔 '인피니티풀'
4th row동성체육관
5th row효신태권도장
ValueCountFrequency (%)
태권도장 18
 
3.0%
스크린골프 11
 
1.9%
아카데미 10
 
1.7%
당구클럽 8
 
1.3%
gym 7
 
1.2%
피트니스 5
 
0.8%
합기도 5
 
0.8%
골프 5
 
0.8%
휘트니스 4
 
0.7%
당구장 4
 
0.7%
Other values (461) 516
87.0%
2023-12-11T04:19:23.357917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
214
 
6.8%
160
 
5.1%
101
 
3.2%
100
 
3.2%
93
 
3.0%
88
 
2.8%
84
 
2.7%
74
 
2.4%
73
 
2.3%
63
 
2.0%
Other values (366) 2095
66.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2691
85.6%
Space Separator 214
 
6.8%
Uppercase Letter 146
 
4.6%
Lowercase Letter 29
 
0.9%
Open Punctuation 19
 
0.6%
Close Punctuation 19
 
0.6%
Decimal Number 13
 
0.4%
Other Punctuation 11
 
0.3%
Dash Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
160
 
5.9%
101
 
3.8%
100
 
3.7%
93
 
3.5%
88
 
3.3%
84
 
3.1%
74
 
2.7%
73
 
2.7%
63
 
2.3%
53
 
2.0%
Other values (312) 1802
67.0%
Uppercase Letter
ValueCountFrequency (%)
G 22
15.1%
M 15
 
10.3%
Y 13
 
8.9%
S 12
 
8.2%
T 12
 
8.2%
E 8
 
5.5%
I 6
 
4.1%
B 5
 
3.4%
P 5
 
3.4%
R 5
 
3.4%
Other values (14) 43
29.5%
Lowercase Letter
ValueCountFrequency (%)
s 5
17.2%
e 4
13.8%
l 4
13.8%
n 3
10.3%
g 2
 
6.9%
i 2
 
6.9%
k 1
 
3.4%
a 1
 
3.4%
b 1
 
3.4%
r 1
 
3.4%
Other values (5) 5
17.2%
Decimal Number
ValueCountFrequency (%)
2 5
38.5%
4 3
23.1%
3 2
 
15.4%
5 1
 
7.7%
6 1
 
7.7%
1 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
' 4
36.4%
& 3
27.3%
. 2
18.2%
, 1
 
9.1%
: 1
 
9.1%
Space Separator
ValueCountFrequency (%)
214
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2691
85.6%
Common 279
 
8.9%
Latin 175
 
5.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
160
 
5.9%
101
 
3.8%
100
 
3.7%
93
 
3.5%
88
 
3.3%
84
 
3.1%
74
 
2.7%
73
 
2.7%
63
 
2.3%
53
 
2.0%
Other values (312) 1802
67.0%
Latin
ValueCountFrequency (%)
G 22
 
12.6%
M 15
 
8.6%
Y 13
 
7.4%
S 12
 
6.9%
T 12
 
6.9%
E 8
 
4.6%
I 6
 
3.4%
B 5
 
2.9%
P 5
 
2.9%
R 5
 
2.9%
Other values (29) 72
41.1%
Common
ValueCountFrequency (%)
214
76.7%
( 19
 
6.8%
) 19
 
6.8%
2 5
 
1.8%
' 4
 
1.4%
4 3
 
1.1%
- 3
 
1.1%
& 3
 
1.1%
. 2
 
0.7%
3 2
 
0.7%
Other values (5) 5
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2691
85.6%
ASCII 454
 
14.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
214
47.1%
G 22
 
4.8%
( 19
 
4.2%
) 19
 
4.2%
M 15
 
3.3%
Y 13
 
2.9%
S 12
 
2.6%
T 12
 
2.6%
E 8
 
1.8%
I 6
 
1.3%
Other values (44) 114
25.1%
Hangul
ValueCountFrequency (%)
160
 
5.9%
101
 
3.8%
100
 
3.7%
93
 
3.5%
88
 
3.3%
84
 
3.1%
74
 
2.7%
73
 
2.7%
63
 
2.3%
53
 
2.0%
Other values (312) 1802
67.0%
Distinct370
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2023-12-11T04:19:23.927022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length47
Mean length29.662269
Min length20

Characters and Unicode

Total characters11242
Distinct characters217
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique361 ?
Unique (%)95.3%

Sample

1st row대구광역시 동구 동촌로 168 (방촌동, 대구동촌초등학교)
2nd row대구광역시 동구 경안로 722, 지하1층 (동호동)
3rd row대구광역시 동구 동부로26길 6, 대구 메리어트 호텔 및 서비스드 레지던스 옥상층 (신천동)
4th row대구광역시 동구 아양로11길 39-5 (신암동)
5th row대구광역시 동구 화랑로11길 26, 상가동 1층 (신천동, 코스모스아파트)
ValueCountFrequency (%)
대구광역시 379
 
16.2%
동구 379
 
16.2%
2층 60
 
2.6%
3층 48
 
2.0%
신천동 46
 
2.0%
율하동 42
 
1.8%
신암동 38
 
1.6%
신서동 35
 
1.5%
방촌동 34
 
1.4%
봉무동 31
 
1.3%
Other values (509) 1254
53.5%
2023-12-11T04:19:24.641997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1973
17.6%
932
 
8.3%
775
 
6.9%
407
 
3.6%
389
 
3.5%
384
 
3.4%
384
 
3.4%
382
 
3.4%
( 379
 
3.4%
) 379
 
3.4%
Other values (207) 4858
43.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6214
55.3%
Space Separator 1973
 
17.6%
Decimal Number 1870
 
16.6%
Open Punctuation 379
 
3.4%
Close Punctuation 379
 
3.4%
Other Punctuation 366
 
3.3%
Dash Punctuation 52
 
0.5%
Uppercase Letter 6
 
0.1%
Math Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
932
15.0%
775
 
12.5%
407
 
6.5%
389
 
6.3%
384
 
6.2%
384
 
6.2%
382
 
6.1%
229
 
3.7%
163
 
2.6%
155
 
2.5%
Other values (185) 2014
32.4%
Decimal Number
ValueCountFrequency (%)
1 344
18.4%
2 334
17.9%
3 243
13.0%
0 235
12.6%
5 197
10.5%
4 177
9.5%
6 119
 
6.4%
7 80
 
4.3%
9 76
 
4.1%
8 65
 
3.5%
Uppercase Letter
ValueCountFrequency (%)
B 2
33.3%
H 1
16.7%
G 1
16.7%
J 1
16.7%
M 1
16.7%
Other Punctuation
ValueCountFrequency (%)
, 361
98.6%
. 5
 
1.4%
Space Separator
ValueCountFrequency (%)
1973
100.0%
Open Punctuation
ValueCountFrequency (%)
( 379
100.0%
Close Punctuation
ValueCountFrequency (%)
) 379
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6214
55.3%
Common 5022
44.7%
Latin 6
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
932
15.0%
775
 
12.5%
407
 
6.5%
389
 
6.3%
384
 
6.2%
384
 
6.2%
382
 
6.1%
229
 
3.7%
163
 
2.6%
155
 
2.5%
Other values (185) 2014
32.4%
Common
ValueCountFrequency (%)
1973
39.3%
( 379
 
7.5%
) 379
 
7.5%
, 361
 
7.2%
1 344
 
6.8%
2 334
 
6.7%
3 243
 
4.8%
0 235
 
4.7%
5 197
 
3.9%
4 177
 
3.5%
Other values (7) 400
 
8.0%
Latin
ValueCountFrequency (%)
B 2
33.3%
H 1
16.7%
G 1
16.7%
J 1
16.7%
M 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6214
55.3%
ASCII 5028
44.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1973
39.2%
( 379
 
7.5%
) 379
 
7.5%
, 361
 
7.2%
1 344
 
6.8%
2 334
 
6.6%
3 243
 
4.8%
0 235
 
4.7%
5 197
 
3.9%
4 177
 
3.5%
Other values (12) 406
 
8.1%
Hangul
ValueCountFrequency (%)
932
15.0%
775
 
12.5%
407
 
6.5%
389
 
6.3%
384
 
6.2%
384
 
6.2%
382
 
6.1%
229
 
3.7%
163
 
2.6%
155
 
2.5%
Other values (185) 2014
32.4%

시설전화번호
Text

MISSING 

Distinct234
Distinct (%)98.3%
Missing141
Missing (%)37.2%
Memory size3.1 KiB
2023-12-11T04:19:25.048836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters2856
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique230 ?
Unique (%)96.6%

Sample

1st row053-986-5012
2nd row053-793-6452
3rd row053-327-7000
4th row053-942-6601
5th row053-756-1711
ValueCountFrequency (%)
053-942-0002 2
 
0.8%
053-952-3000 2
 
0.8%
053-965-9994 2
 
0.8%
053-965-0755 2
 
0.8%
053-964-8368 1
 
0.4%
053-741-8860 1
 
0.4%
053-986-5012 1
 
0.4%
053-953-6809 1
 
0.4%
053-986-1902 1
 
0.4%
053-952-9777 1
 
0.4%
Other values (224) 224
94.1%
2023-12-11T04:19:25.670761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 476
16.7%
0 432
15.1%
5 400
14.0%
3 355
12.4%
9 281
9.8%
6 191
6.7%
8 178
 
6.2%
7 148
 
5.2%
1 138
 
4.8%
4 133
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2380
83.3%
Dash Punctuation 476
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 432
18.2%
5 400
16.8%
3 355
14.9%
9 281
11.8%
6 191
8.0%
8 178
7.5%
7 148
 
6.2%
1 138
 
5.8%
4 133
 
5.6%
2 124
 
5.2%
Dash Punctuation
ValueCountFrequency (%)
- 476
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2856
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 476
16.7%
0 432
15.1%
5 400
14.0%
3 355
12.4%
9 281
9.8%
6 191
6.7%
8 178
 
6.2%
7 148
 
5.2%
1 138
 
4.8%
4 133
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2856
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 476
16.7%
0 432
15.1%
5 400
14.0%
3 355
12.4%
9 281
9.8%
6 191
6.7%
8 178
 
6.2%
7 148
 
5.2%
1 138
 
4.8%
4 133
 
4.7%

Missing values

2023-12-11T04:19:21.991925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T04:19:22.120252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호시설주소(도로명)시설전화번호
0수영장업동촌아쿠아수영장대구광역시 동구 동촌로 168 (방촌동, 대구동촌초등학교)053-986-5012
1수영장업에이스 어린이 수영장대구광역시 동구 경안로 722, 지하1층 (동호동)053-793-6452
2수영장업대구메리어트호텔 '인피니티풀'대구광역시 동구 동부로26길 6, 대구 메리어트 호텔 및 서비스드 레지던스 옥상층 (신천동)053-327-7000
3체육도장업동성체육관대구광역시 동구 아양로11길 39-5 (신암동)053-942-6601
4체육도장업효신태권도장대구광역시 동구 화랑로11길 26, 상가동 1층 (신천동, 코스모스아파트)053-756-1711
5체육도장업반야월태권도장대구광역시 동구 반야월로14길 13 (율하동)053-963-1102
6체육도장업힘찬태권도대구광역시 동구 동호로3길 3, 3.4층 (동호동)<NA>
7체육도장업지묘경희도장대구광역시 동구 팔공로101길 55, 상가동 301호 (지묘동, 팔공보성2차아파트)053-982-9924
8체육도장업최강키즈태권도장대구광역시 동구 팔공로31길 1, 3층 (불로동)<NA>
9체육도장업보람체육관대구광역시 동구 율하동로24길 76, 2층 (서호동)053-963-6520
업종상호시설주소(도로명)시설전화번호
369체육교습업제제(ZEZE)스포츠스쿨대구광역시 동구 팔공로51길 15-13, 4층 (봉무동)<NA>
370체육교습업레인보우 음악줄넘기대구광역시 동구 동호로9길 75, 2층 (신서동)<NA>
371체육교습업위너키즈스포츠 율하신서점대구광역시 동구 안심로22길 60, 동흥메디칼 602호 (율하동)053-965-8288
372체육교습업위드스포츠 혁신점대구광역시 동구 경안로 938, 203-1호 (각산동)<NA>
373체육교습업유니온축구클럽대구광역시 동구 메디밸리로 5-21, 3층 301,302호 (대림동)<NA>
374체육교습업윤창열 축구교실대구광역시 동구 첨단로8길 8, 4층 402호, 404호 (신서동)053-965-2242
375체육교습업스카이 스포츠아카데미지점대구광역시 동구 안심로 80, 롯데쇼핑프라자 3층 (율하동)<NA>
376체육교습업줄친구 점프점프대구광역시 동구 경안로 820, 2층 201호 (각산동)053-961-9799
377인공암벽장업다이노캣 클라이밍 짐대구광역시 동구 안심로 52, 5층 (율하동)053-962-5331
378인공암벽장업동구 펀앤펀클라이밍센터대구광역시 동구 안심로 366, 지하1층 (신서동)<NA>