Overview

Dataset statistics

Number of variables6
Number of observations112
Missing cells447
Missing cells (%)66.5%
Duplicate rows3
Duplicate rows (%)2.7%
Total size in memory5.5 KiB
Average record size in memory50.2 B

Variable types

Unsupported3
Text3

Dataset

Description구미시설공단이 관리하는 체육시설 현황 (주소, 면적, 사용료, 이용예약 안내 등) 1.금오테니스장(테니스) 2.올림픽기념관(농구, 배드민턴) 3.승마장(승마) 4.근로자종합복지회관(탁구)
Author구미시설공단
URLhttps://www.data.go.kr/data/15002589/fileData.do

Alerts

Dataset has 3 (2.7%) duplicate rowsDuplicates
Unnamed: 0 has 112 (100.0%) missing valuesMissing
금오테니스장 시설현황 has 52 (46.4%) missing valuesMissing
Unnamed: 2 has 65 (58.0%) missing valuesMissing
Unnamed: 3 has 57 (50.9%) missing valuesMissing
Unnamed: 4 has 58 (51.8%) missing valuesMissing
Unnamed: 5 has 103 (92.0%) missing valuesMissing
Unnamed: 0 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 13:42:21.820414
Analysis finished2023-12-12 13:42:22.544307
Duration0.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Unnamed: 0
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing112
Missing (%)100.0%
Memory size1.1 KiB
Distinct55
Distinct (%)91.7%
Missing52
Missing (%)46.4%
Memory size1.0 KiB
2023-12-12T22:42:22.816969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length66
Median length42.5
Mean length21.833333
Min length2

Characters and Unicode

Total characters1310
Distinct characters190
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)86.7%

Sample

1st row▪ 위치 : 구미시 남통동 136-1번지 일원 (산책길 105)
2nd row▪ 총사업비 : 12,500백만원
3rd row▪ 규모 : 건축면적(4,865.51㎡), 연면적 (4,997.74㎡), 지하1층/지상2층
4th row▪ 주요시설 : 테니스장 15면(실내-4면, 실외-11면) 및 부대시설
5th row구 분
ValueCountFrequency (%)
22
 
8.8%
17
 
6.8%
구미시 5
 
2.0%
위치 4
 
1.6%
산책길 4
 
1.6%
매표소 4
 
1.6%
이용접수 4
 
1.6%
오프라인 4
 
1.6%
3
 
1.2%
올림픽기념관 3
 
1.2%
Other values (137) 181
72.1%
2023-12-12T22:42:23.433104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
245
 
18.7%
0 45
 
3.4%
1 35
 
2.7%
: 30
 
2.3%
26
 
2.0%
, 24
 
1.8%
. 24
 
1.8%
) 23
 
1.8%
( 23
 
1.8%
- 19
 
1.5%
Other values (180) 816
62.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 622
47.5%
Space Separator 245
 
18.7%
Decimal Number 192
 
14.7%
Other Punctuation 91
 
6.9%
Lowercase Letter 60
 
4.6%
Other Symbol 27
 
2.1%
Close Punctuation 23
 
1.8%
Open Punctuation 23
 
1.8%
Dash Punctuation 19
 
1.5%
Control 5
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
 
4.2%
18
 
2.9%
17
 
2.7%
16
 
2.6%
16
 
2.6%
15
 
2.4%
14
 
2.3%
13
 
2.1%
12
 
1.9%
12
 
1.9%
Other values (146) 463
74.4%
Lowercase Letter
ValueCountFrequency (%)
r 9
15.0%
o 9
15.0%
s 9
15.0%
t 9
15.0%
p 6
10.0%
n 3
 
5.0%
g 3
 
5.0%
i 3
 
5.0%
c 3
 
5.0%
k 3
 
5.0%
Decimal Number
ValueCountFrequency (%)
0 45
23.4%
1 35
18.2%
2 19
9.9%
5 19
9.9%
4 18
 
9.4%
3 15
 
7.8%
7 13
 
6.8%
8 13
 
6.8%
9 8
 
4.2%
6 7
 
3.6%
Other Punctuation
ValueCountFrequency (%)
: 30
33.0%
, 24
26.4%
. 24
26.4%
/ 9
 
9.9%
% 4
 
4.4%
Other Symbol
ValueCountFrequency (%)
17
63.0%
10
37.0%
Space Separator
ValueCountFrequency (%)
245
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%
Control
ValueCountFrequency (%)
5
100.0%
Math Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 628
47.9%
Hangul 622
47.5%
Latin 60
 
4.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
 
4.2%
18
 
2.9%
17
 
2.7%
16
 
2.6%
16
 
2.6%
15
 
2.4%
14
 
2.3%
13
 
2.1%
12
 
1.9%
12
 
1.9%
Other values (146) 463
74.4%
Common
ValueCountFrequency (%)
245
39.0%
0 45
 
7.2%
1 35
 
5.6%
: 30
 
4.8%
, 24
 
3.8%
. 24
 
3.8%
) 23
 
3.7%
( 23
 
3.7%
- 19
 
3.0%
2 19
 
3.0%
Other values (13) 141
22.5%
Latin
ValueCountFrequency (%)
r 9
15.0%
o 9
15.0%
s 9
15.0%
t 9
15.0%
p 6
10.0%
n 3
 
5.0%
g 3
 
5.0%
i 3
 
5.0%
c 3
 
5.0%
k 3
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 658
50.2%
Hangul 620
47.3%
Geometric Shapes 17
 
1.3%
CJK Compat 10
 
0.8%
Math Operators 3
 
0.2%
Compat Jamo 2
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
245
37.2%
0 45
 
6.8%
1 35
 
5.3%
: 30
 
4.6%
, 24
 
3.6%
. 24
 
3.6%
) 23
 
3.5%
( 23
 
3.5%
- 19
 
2.9%
2 19
 
2.9%
Other values (21) 171
26.0%
Hangul
ValueCountFrequency (%)
26
 
4.2%
18
 
2.9%
17
 
2.7%
16
 
2.6%
16
 
2.6%
15
 
2.4%
14
 
2.3%
13
 
2.1%
12
 
1.9%
12
 
1.9%
Other values (145) 461
74.4%
Geometric Shapes
ValueCountFrequency (%)
17
100.0%
CJK Compat
ValueCountFrequency (%)
10
100.0%
Math Operators
ValueCountFrequency (%)
3
100.0%
Compat Jamo
ValueCountFrequency (%)
2
100.0%

Unnamed: 2
Text

MISSING 

Distinct44
Distinct (%)93.6%
Missing65
Missing (%)58.0%
Memory size1.0 KiB
2023-12-12T22:42:23.699239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length13
Mean length8.2340426
Min length2

Characters and Unicode

Total characters387
Distinct characters82
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)89.4%

Sample

1st row시 설 개 요
2nd row지하1층/지상2층
3rd row(연면적 : 4,402.96㎡)
4th row연면적 : 269.79㎡
5th row실외테니스장(10면)
ValueCountFrequency (%)
15
 
15.3%
어린이 4
 
4.1%
청소년 4
 
4.1%
4
 
4.1%
4
 
4.1%
연면적 3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
Other values (48) 52
53.1%
2023-12-12T22:42:24.109251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 55
 
14.2%
53
 
13.7%
: 15
 
3.9%
14
 
3.6%
1 14
 
3.6%
, 14
 
3.6%
( 9
 
2.3%
) 9
 
2.3%
2 8
 
2.1%
5 7
 
1.8%
Other values (72) 189
48.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 170
43.9%
Decimal Number 106
27.4%
Space Separator 53
 
13.7%
Other Punctuation 37
 
9.6%
Open Punctuation 9
 
2.3%
Close Punctuation 9
 
2.3%
Other Symbol 3
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
8.2%
7
 
4.1%
6
 
3.5%
6
 
3.5%
6
 
3.5%
6
 
3.5%
6
 
3.5%
5
 
2.9%
4
 
2.4%
4
 
2.4%
Other values (53) 106
62.4%
Decimal Number
ValueCountFrequency (%)
0 55
51.9%
1 14
 
13.2%
2 8
 
7.5%
5 7
 
6.6%
4 7
 
6.6%
9 4
 
3.8%
3 4
 
3.8%
7 3
 
2.8%
6 3
 
2.8%
8 1
 
0.9%
Other Punctuation
ValueCountFrequency (%)
: 15
40.5%
, 14
37.8%
. 4
 
10.8%
* 2
 
5.4%
/ 2
 
5.4%
Space Separator
ValueCountFrequency (%)
53
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 217
56.1%
Hangul 170
43.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
8.2%
7
 
4.1%
6
 
3.5%
6
 
3.5%
6
 
3.5%
6
 
3.5%
6
 
3.5%
5
 
2.9%
4
 
2.4%
4
 
2.4%
Other values (53) 106
62.4%
Common
ValueCountFrequency (%)
0 55
25.3%
53
24.4%
: 15
 
6.9%
1 14
 
6.5%
, 14
 
6.5%
( 9
 
4.1%
) 9
 
4.1%
2 8
 
3.7%
5 7
 
3.2%
4 7
 
3.2%
Other values (9) 26
12.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 214
55.3%
Hangul 167
43.2%
Compat Jamo 3
 
0.8%
CJK Compat 3
 
0.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 55
25.7%
53
24.8%
: 15
 
7.0%
1 14
 
6.5%
, 14
 
6.5%
( 9
 
4.2%
) 9
 
4.2%
2 8
 
3.7%
5 7
 
3.3%
4 7
 
3.3%
Other values (8) 23
10.7%
Hangul
ValueCountFrequency (%)
14
 
8.4%
7
 
4.2%
6
 
3.6%
6
 
3.6%
6
 
3.6%
6
 
3.6%
6
 
3.6%
5
 
3.0%
4
 
2.4%
4
 
2.4%
Other values (52) 103
61.7%
Compat Jamo
ValueCountFrequency (%)
3
100.0%
CJK Compat
ValueCountFrequency (%)
3
100.0%

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing57
Missing (%)50.9%
Memory size1.0 KiB

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing58
Missing (%)51.8%
Memory size1.0 KiB

Unnamed: 5
Text

MISSING 

Distinct8
Distinct (%)88.9%
Missing103
Missing (%)92.0%
Memory size1.0 KiB
2023-12-12T22:42:24.271810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length3.8888889
Min length2

Characters and Unicode

Total characters35
Distinct characters26
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)77.8%

Sample

1st row비고
2nd row배드민턴,농구
3rd row헬스
4th row비고
5th row비 고
ValueCountFrequency (%)
비고 2
20.0%
배드민턴,농구 1
10.0%
헬스 1
10.0%
1
10.0%
1
10.0%
30*86 1
10.0%
33*71 1
10.0%
468.2㎡ 1
10.0%
시설물 1
10.0%
2023-12-12T22:42:24.577933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3
 
8.6%
3 3
 
8.6%
3
 
8.6%
* 2
 
5.7%
6 2
 
5.7%
8 2
 
5.7%
1
 
2.9%
1
 
2.9%
1
 
2.9%
2 1
 
2.9%
Other values (16) 16
45.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 17
48.6%
Decimal Number 12
34.3%
Other Punctuation 4
 
11.4%
Other Symbol 1
 
2.9%
Space Separator 1
 
2.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3
17.6%
3
17.6%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (3) 3
17.6%
Decimal Number
ValueCountFrequency (%)
3 3
25.0%
6 2
16.7%
8 2
16.7%
2 1
 
8.3%
4 1
 
8.3%
1 1
 
8.3%
7 1
 
8.3%
0 1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
* 2
50.0%
. 1
25.0%
, 1
25.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 18
51.4%
Hangul 17
48.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3
17.6%
3
17.6%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (3) 3
17.6%
Common
ValueCountFrequency (%)
3 3
16.7%
* 2
11.1%
6 2
11.1%
8 2
11.1%
1
 
5.6%
2 1
 
5.6%
. 1
 
5.6%
4 1
 
5.6%
1 1
 
5.6%
7 1
 
5.6%
Other values (3) 3
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 17
48.6%
ASCII 17
48.6%
CJK Compat 1
 
2.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3
17.6%
3
17.6%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
1
 
5.9%
Other values (3) 3
17.6%
ASCII
ValueCountFrequency (%)
3 3
17.6%
* 2
11.8%
6 2
11.8%
8 2
11.8%
2 1
 
5.9%
. 1
 
5.9%
4 1
 
5.9%
1 1
 
5.9%
7 1
 
5.9%
0 1
 
5.9%
Other values (2) 2
11.8%
CJK Compat
ValueCountFrequency (%)
1
100.0%

Correlations

2023-12-12T22:42:24.670333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
금오테니스장 시설현황Unnamed: 2Unnamed: 5
금오테니스장 시설현황1.0000.9971.000
Unnamed: 20.9971.0001.000
Unnamed: 51.0001.0001.000

Missing values

2023-12-12T22:42:22.141367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:42:22.304194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T22:42:22.453336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

Unnamed: 0금오테니스장 시설현황Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5
0<NA>▪ 위치 : 구미시 남통동 136-1번지 일원 (산책길 105)<NA>NaNNaN<NA>
1<NA>▪ 총사업비 : 12,500백만원<NA>NaNNaN<NA>
2<NA>▪ 규모 : 건축면적(4,865.51㎡), 연면적 (4,997.74㎡), 지하1층/지상2층<NA>NaNNaN<NA>
3<NA>▪ 주요시설 : 테니스장 15면(실내-4면, 실외-11면) 및 부대시설<NA>NaNNaN<NA>
4<NA><NA><NA>NaNNaN<NA>
5<NA>구 분시 설 개 요시 설 내 용비 고<NA>
6<NA>실내테니스장지하1층/지상2층실내테니스장 4면 및 부대시설관람석(476석)<NA>
7<NA><NA>(연면적 : 4,402.96㎡)NaNNaN<NA>
8<NA>센터코트연면적 : 269.79㎡테니스장 1면, 방송실,관람석(1,235석)<NA>
9<NA><NA><NA>운영실 및 부대시설NaN<NA>
Unnamed: 0금오테니스장 시설현황Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5
102<NA>(10장기준)청소년 : 100,000원및 승마교관 강습대여 포함<NA>
103<NA><NA>어린이 : 50,000원NaN·상해보험 가입<NA>
104<NA>승마체험성 인 : 5,000원실내마장·승마모자, 안전복, 챕<NA>
105<NA>(1인당)청소년 : 4,000원NaN대여 포함<NA>
106<NA><NA>어린이 : 3,000원NaN·상해보험 가입<NA>
107<NA>1. 이용시간 : 자마회원을 제외한 기승자는 1일 1시간 이내로 기승시간을 정한다.<NA>NaNNaN<NA>
108<NA>2. 월 회원 : 정기적으로 주 3일 이상 이용하는 자를 말한다.<NA>NaNNaN<NA>
109<NA>3. 쿠폰제 유휴기간은 발급일로부터 3개월로 한다.<NA>NaNNaN<NA>
110<NA>4. 단체입장 시 1일 기승 및 마차 체험 이용료를 50% 할인하며 시간은 조정할 수 있다.<NA>NaNNaN<NA>
111<NA>5. 국가유공자, 기초수급대상자, 등록장애인(1-3급은 보호자 1명 포함)에게는 50%를 할인 할 수 있다.<NA>NaNNaN<NA>

Duplicate rows

Most frequently occurring

금오테니스장 시설현황Unnamed: 2Unnamed: 5# duplicates
2<NA><NA><NA>30
0▪ 온라인 예약접수 : https://sports.ginco.or.kr<NA><NA>3
1구 분시 설 개 요비고2