Overview

Dataset statistics

Number of variables7
Number of observations716
Missing cells199
Missing cells (%)4.0%
Duplicate rows61
Duplicate rows (%)8.5%
Total size in memory40.0 KiB
Average record size in memory57.2 B

Variable types

Text3
Categorical3
DateTime1

Dataset

Description구로구에 설최되어있는 실외운동기구와 관련된 정보입니다. 실내운동기구의 종류, 대수, 설치위치와 상세위치 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15078910/fileData.do

Alerts

기준일자 has constant value ""Constant
Dataset has 61 (8.5%) duplicate rowsDuplicates
주소 is highly overall correlated with 수량 and 1 other fieldsHigh correlation
수량 is highly overall correlated with 주소High correlation
관리부서 is highly overall correlated with 주소High correlation
수량 is highly imbalanced (89.8%)Imbalance
상세위치 has 199 (27.8%) missing valuesMissing

Reproduction

Analysis started2023-12-12 21:40:22.172471
Analysis finished2023-12-12 21:40:22.695182
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct74
Distinct (%)10.3%
Missing0
Missing (%)0.0%
Memory size5.7 KiB
2023-12-13T06:40:22.882797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length19
Mean length8.5418994
Min length2

Characters and Unicode

Total characters6116
Distinct characters151
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row고척근린공원
2nd row고척근린공원
3rd row고척근린공원
4th row고척근린공원
5th row고척근린공원
ValueCountFrequency (%)
온수도시자연공원 100
 
9.0%
개웅산근린공원 67
 
6.0%
하부 55
 
5.0%
고척근린공원 51
 
4.6%
45
 
4.1%
43
 
3.9%
계남근린공원 36
 
3.2%
안양천 27
 
2.4%
천왕도시자연공원 24
 
2.2%
오금교 22
 
2.0%
Other values (90) 640
57.7%
2023-12-13T06:40:23.257821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
417
 
6.8%
411
 
6.7%
402
 
6.6%
248
 
4.1%
171
 
2.8%
154
 
2.5%
141
 
2.3%
138
 
2.3%
124
 
2.0%
124
 
2.0%
Other values (141) 3786
61.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5184
84.8%
Space Separator 402
 
6.6%
Uppercase Letter 127
 
2.1%
Decimal Number 123
 
2.0%
Close Punctuation 118
 
1.9%
Open Punctuation 118
 
1.9%
Math Symbol 23
 
0.4%
Dash Punctuation 21
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
417
 
8.0%
411
 
7.9%
248
 
4.8%
171
 
3.3%
154
 
3.0%
141
 
2.7%
138
 
2.7%
124
 
2.4%
124
 
2.4%
121
 
2.3%
Other values (125) 3135
60.5%
Uppercase Letter
ValueCountFrequency (%)
I 28
22.0%
P 21
16.5%
R 21
16.5%
A 21
16.5%
K 21
16.5%
C 15
11.8%
Decimal Number
ValueCountFrequency (%)
1 46
37.4%
2 29
23.6%
0 21
17.1%
9 21
17.1%
3 6
 
4.9%
Space Separator
ValueCountFrequency (%)
402
100.0%
Close Punctuation
ValueCountFrequency (%)
) 118
100.0%
Open Punctuation
ValueCountFrequency (%)
( 118
100.0%
Math Symbol
ValueCountFrequency (%)
~ 23
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5184
84.8%
Common 805
 
13.2%
Latin 127
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
417
 
8.0%
411
 
7.9%
248
 
4.8%
171
 
3.3%
154
 
3.0%
141
 
2.7%
138
 
2.7%
124
 
2.4%
124
 
2.4%
121
 
2.3%
Other values (125) 3135
60.5%
Common
ValueCountFrequency (%)
402
49.9%
) 118
 
14.7%
( 118
 
14.7%
1 46
 
5.7%
2 29
 
3.6%
~ 23
 
2.9%
- 21
 
2.6%
0 21
 
2.6%
9 21
 
2.6%
3 6
 
0.7%
Latin
ValueCountFrequency (%)
I 28
22.0%
P 21
16.5%
R 21
16.5%
A 21
16.5%
K 21
16.5%
C 15
11.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5184
84.8%
ASCII 932
 
15.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
417
 
8.0%
411
 
7.9%
248
 
4.8%
171
 
3.3%
154
 
3.0%
141
 
2.7%
138
 
2.7%
124
 
2.4%
124
 
2.4%
121
 
2.3%
Other values (125) 3135
60.5%
ASCII
ValueCountFrequency (%)
402
43.1%
) 118
 
12.7%
( 118
 
12.7%
1 46
 
4.9%
2 29
 
3.1%
I 28
 
3.0%
~ 23
 
2.5%
- 21
 
2.3%
P 21
 
2.3%
R 21
 
2.3%
Other values (6) 105
 
11.3%
Distinct104
Distinct (%)14.5%
Missing0
Missing (%)0.0%
Memory size5.7 KiB
2023-12-13T06:40:23.509766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length13
Mean length5.3743017
Min length2

Characters and Unicode

Total characters3848
Distinct characters119
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)7.0%

Sample

1st row철봉
2nd row거꾸로매달리기
3rd row다리뻗치기
4th row양팔줄당기기
5th row온몸근육풀기
ValueCountFrequency (%)
허리돌리기 70
 
9.7%
하늘걷기 42
 
5.8%
온몸근육풀기 40
 
5.5%
윗몸일으키기 40
 
5.5%
파도타기 34
 
4.7%
마라톤운동 32
 
4.4%
역기내리기 24
 
3.3%
등허리지압기 22
 
3.0%
철봉 21
 
2.9%
거꾸로매달리기 20
 
2.8%
Other values (93) 379
52.3%
2023-12-13T06:40:23.862835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
775
20.1%
396
 
10.3%
133
 
3.5%
119
 
3.1%
107
 
2.8%
101
 
2.6%
95
 
2.5%
95
 
2.5%
91
 
2.4%
90
 
2.3%
Other values (109) 1846
48.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3780
98.2%
Math Symbol 57
 
1.5%
Space Separator 9
 
0.2%
Decimal Number 1
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
775
20.5%
396
 
10.5%
133
 
3.5%
119
 
3.1%
107
 
2.8%
101
 
2.7%
95
 
2.5%
95
 
2.5%
91
 
2.4%
90
 
2.4%
Other values (105) 1778
47.0%
Math Symbol
ValueCountFrequency (%)
+ 57
100.0%
Space Separator
ValueCountFrequency (%)
9
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%
Other Punctuation
ValueCountFrequency (%)
· 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3780
98.2%
Common 68
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
775
20.5%
396
 
10.5%
133
 
3.5%
119
 
3.1%
107
 
2.8%
101
 
2.7%
95
 
2.5%
95
 
2.5%
91
 
2.4%
90
 
2.4%
Other values (105) 1778
47.0%
Common
ValueCountFrequency (%)
+ 57
83.8%
9
 
13.2%
3 1
 
1.5%
· 1
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3780
98.2%
ASCII 67
 
1.7%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
775
20.5%
396
 
10.5%
133
 
3.5%
119
 
3.1%
107
 
2.8%
101
 
2.7%
95
 
2.5%
95
 
2.5%
91
 
2.4%
90
 
2.4%
Other values (105) 1778
47.0%
ASCII
ValueCountFrequency (%)
+ 57
85.1%
9
 
13.4%
3 1
 
1.5%
None
ValueCountFrequency (%)
· 1
100.0%

수량
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size5.7 KiB
1
694 
2
 
16
3
 
3
4
 
2
5
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 694
96.9%
2 16
 
2.2%
3 3
 
0.4%
4 2
 
0.3%
5 1
 
0.1%

Length

2023-12-13T06:40:24.005518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:40:24.133055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 694
96.9%
2 16
 
2.2%
3 3
 
0.4%
4 2
 
0.3%
5 1
 
0.1%

주소
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size5.7 KiB
<NA>
438 
서울특별시 구로구 온수동 9-44
100 
서울특별시 구로구 개봉동 산53-5
67 
서울특별시 구로구 고척로45길 39
51 
서울특별시 구로구 중앙로15길 100-30
 
36

Length

Max length23
Median length4
Mean length9.9189944
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시 구로구 고척로45길 39
2nd row서울특별시 구로구 고척로45길 39
3rd row서울특별시 구로구 고척로45길 39
4th row서울특별시 구로구 고척로45길 39
5th row서울특별시 구로구 고척로45길 39

Common Values

ValueCountFrequency (%)
<NA> 438
61.2%
서울특별시 구로구 온수동 9-44 100
 
14.0%
서울특별시 구로구 개봉동 산53-5 67
 
9.4%
서울특별시 구로구 고척로45길 39 51
 
7.1%
서울특별시 구로구 중앙로15길 100-30 36
 
5.0%
서울특별시 구로구 연동로12길 149 24
 
3.4%

Length

2023-12-13T06:40:24.264179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:40:24.397581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 438
28.3%
서울특별시 278
17.9%
구로구 278
17.9%
온수동 100
 
6.5%
9-44 100
 
6.5%
개봉동 67
 
4.3%
산53-5 67
 
4.3%
고척로45길 51
 
3.3%
39 51
 
3.3%
중앙로15길 36
 
2.3%
Other values (3) 84
 
5.4%

상세위치
Text

MISSING 

Distinct54
Distinct (%)10.4%
Missing199
Missing (%)27.8%
Memory size5.7 KiB
2023-12-13T06:40:24.653070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length10
Mean length7.3133462
Min length2

Characters and Unicode

Total characters3781
Distinct characters98
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row광장
2nd row광장
3rd row광장
4th row광장
5th row광장
ValueCountFrequency (%)
개봉동 72
 
8.4%
잣절공원 65
 
7.6%
신도림동 60
 
7.0%
구로동 53
 
6.2%
372-1 43
 
5.0%
정상 26
 
3.0%
871-10 22
 
2.6%
신정동 22
 
2.6%
뒷산2 20
 
2.3%
온수체육공원 19
 
2.2%
Other values (62) 456
53.1%
2023-12-13T06:40:25.109297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
341
 
9.0%
241
 
6.4%
- 234
 
6.2%
1 227
 
6.0%
2 197
 
5.2%
3 183
 
4.8%
7 166
 
4.4%
107
 
2.8%
107
 
2.8%
6 107
 
2.8%
Other values (88) 1871
49.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2038
53.9%
Decimal Number 1117
29.5%
Space Separator 341
 
9.0%
Dash Punctuation 234
 
6.2%
Open Punctuation 20
 
0.5%
Close Punctuation 20
 
0.5%
Math Symbol 11
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
241
 
11.8%
107
 
5.3%
107
 
5.3%
97
 
4.8%
93
 
4.6%
84
 
4.1%
83
 
4.1%
77
 
3.8%
72
 
3.5%
65
 
3.2%
Other values (73) 1012
49.7%
Decimal Number
ValueCountFrequency (%)
1 227
20.3%
2 197
17.6%
3 183
16.4%
7 166
14.9%
6 107
9.6%
8 79
 
7.1%
4 59
 
5.3%
0 52
 
4.7%
5 29
 
2.6%
9 18
 
1.6%
Space Separator
ValueCountFrequency (%)
341
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 234
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Math Symbol
ValueCountFrequency (%)
~ 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2038
53.9%
Common 1743
46.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
241
 
11.8%
107
 
5.3%
107
 
5.3%
97
 
4.8%
93
 
4.6%
84
 
4.1%
83
 
4.1%
77
 
3.8%
72
 
3.5%
65
 
3.2%
Other values (73) 1012
49.7%
Common
ValueCountFrequency (%)
341
19.6%
- 234
13.4%
1 227
13.0%
2 197
11.3%
3 183
10.5%
7 166
9.5%
6 107
 
6.1%
8 79
 
4.5%
4 59
 
3.4%
0 52
 
3.0%
Other values (5) 98
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2038
53.9%
ASCII 1743
46.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
341
19.6%
- 234
13.4%
1 227
13.0%
2 197
11.3%
3 183
10.5%
7 166
9.5%
6 107
 
6.1%
8 79
 
4.5%
4 59
 
3.4%
0 52
 
3.0%
Other values (5) 98
 
5.6%
Hangul
ValueCountFrequency (%)
241
 
11.8%
107
 
5.3%
107
 
5.3%
97
 
4.8%
93
 
4.6%
84
 
4.1%
83
 
4.1%
77
 
3.8%
72
 
3.5%
65
 
3.2%
Other values (73) 1012
49.7%

관리부서
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.7 KiB
녹색도시과
477 
체육진흥과
239 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row녹색도시과
2nd row녹색도시과
3rd row녹색도시과
4th row녹색도시과
5th row녹색도시과

Common Values

ValueCountFrequency (%)
녹색도시과 477
66.6%
체육진흥과 239
33.4%

Length

2023-12-13T06:40:25.275219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:40:25.406320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
녹색도시과 477
66.6%
체육진흥과 239
33.4%

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size5.7 KiB
Minimum2023-03-31 00:00:00
Maximum2023-03-31 00:00:00
2023-12-13T06:40:25.529019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:40:25.652107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-13T06:40:25.783966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
장소명수량주소상세위치관리부서
장소명1.0000.7511.0000.9971.000
수량0.7511.000NaNNaN0.083
주소1.000NaN1.0000.999NaN
상세위치0.997NaN0.9991.0001.000
관리부서1.0000.083NaN1.0001.000
2023-12-13T06:40:25.921954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주소수량관리부서
주소1.0001.0001.000
수량1.0001.0000.102
관리부서1.0000.1021.000
2023-12-13T06:40:26.057244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수량주소관리부서
수량1.0001.0000.102
주소1.0001.0001.000
관리부서0.1021.0001.000

Missing values

2023-12-13T06:40:22.557788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:40:22.657088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

장소명운동기구명수량주소상세위치관리부서기준일자
0고척근린공원철봉1서울특별시 구로구 고척로45길 39광장녹색도시과2023-03-31
1고척근린공원거꾸로매달리기1서울특별시 구로구 고척로45길 39광장녹색도시과2023-03-31
2고척근린공원다리뻗치기1서울특별시 구로구 고척로45길 39광장녹색도시과2023-03-31
3고척근린공원양팔줄당기기1서울특별시 구로구 고척로45길 39광장녹색도시과2023-03-31
4고척근린공원온몸근육풀기1서울특별시 구로구 고척로45길 39광장녹색도시과2023-03-31
5고척근린공원역기내리기1서울특별시 구로구 고척로45길 39광장녹색도시과2023-03-31
6고척근린공원허리돌리기1서울특별시 구로구 고척로45길 39광장녹색도시과2023-03-31
7고척근린공원온몸노젓기1서울특별시 구로구 고척로45길 39광장녹색도시과2023-03-31
8고척근린공원하늘걷기1서울특별시 구로구 고척로45길 39광장녹색도시과2023-03-31
9고척근린공원파도타기1서울특별시 구로구 고척로45길 39등산로입구녹색도시과2023-03-31
장소명운동기구명수량주소상세위치관리부서기준일자
706오류역광장달리기1<NA>오류동 65-6체육진흥과2023-03-31
707오류역광장역기올리기1<NA>오류동 65-6체육진흥과2023-03-31
708오류역광장다리로밀기1<NA>오류동 65-6체육진흥과2023-03-31
709궁동 조광빌라 뒤허리돌리기1<NA>궁동 77-3체육진흥과2023-03-31
710궁동 조광빌라 뒤등허리지압기1<NA>궁동 77-3체육진흥과2023-03-31
711궁동 조광빌라 뒤다리로밀기1<NA>궁동 77-3체육진흥과2023-03-31
712궁동 조광빌라 뒤역기내리기1<NA>궁동 77-3체육진흥과2023-03-31
713궁동 조광빌라 뒤상체근육풀기1<NA>궁동 77-3체육진흥과2023-03-31
714궁동 조광빌라 뒤달리기1<NA>궁동 77-3체육진흥과2023-03-31
715궁동 조광빌라 뒤파도타기1<NA>궁동 77-3체육진흥과2023-03-31

Duplicate rows

Most frequently occurring

장소명운동기구명수량주소상세위치관리부서기준일자# duplicates
59천왕도시자연공원역기1서울특별시 구로구 연동로12길 149정상(그늘막)녹색도시과2023-03-319
37온수도시자연공원마라톤운동1서울특별시 구로구 온수동 9-44잣절공원녹색도시과2023-03-316
36온수도시자연공원링잡고오르기1서울특별시 구로구 온수동 9-44잣절공원녹색도시과2023-03-315
39온수도시자연공원역기1서울특별시 구로구 온수동 9-44잣절공원녹색도시과2023-03-315
50온수도시자연공원파도타기1서울특별시 구로구 온수동 9-44잣절공원녹색도시과2023-03-315
51온수도시자연공원팔굽혀펴기1서울특별시 구로구 온수동 9-44잣절공원녹색도시과2023-03-315
56온수도시자연공원허리돌리기1서울특별시 구로구 온수동 9-44잣절공원녹색도시과2023-03-315
0I-PARK(109동) 앞거꾸로매달리기1<NA>개봉동 372-1체육진흥과2023-03-314
40온수도시자연공원역기내리기1서울특별시 구로구 온수동 9-44잣절공원녹색도시과2023-03-314
41온수도시자연공원역기올리기1서울특별시 구로구 온수동 9-44잣절공원녹색도시과2023-03-314