Overview

Dataset statistics

Number of variables3
Number of observations238
Missing cells8
Missing cells (%)1.1%
Duplicate rows1
Duplicate rows (%)0.4%
Total size in memory5.7 KiB
Average record size in memory24.6 B

Variable types

Text2
Categorical1

Dataset

Description시설명,전화번호,동
Author강동구
URLhttps://data.seoul.go.kr/dataList/OA-12640/S/1/datasetView.do

Alerts

Dataset has 1 (0.4%) duplicate rowsDuplicates
전화번호 has 8 (3.4%) missing valuesMissing

Reproduction

Analysis started2024-05-04 01:09:38.442219
Analysis finished2024-05-04 01:09:39.148298
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct236
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2024-05-04T01:09:39.385644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length15
Mean length9.2226891
Min length4

Characters and Unicode

Total characters2195
Distinct characters251
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique234 ?
Unique (%)98.3%

Sample

1st row한사랑실버센터 1호점
2nd row한사랑실버센터 2호점
3rd row시립강동노인종합복지관주간보호센터
4th row흰돌주간보호센터(2호점)
5th row반석햇빛마을주간보호센터
ValueCountFrequency (%)
경로당 29
 
9.4%
a경로당 10
 
3.2%
어르신사랑방 6
 
1.9%
a 4
 
1.3%
암사1동 3
 
1.0%
한사랑실버센터 3
 
1.0%
둔촌현대a 2
 
0.6%
제2경로당 2
 
0.6%
시립강동노인종합복지관 2
 
0.6%
둔촌2동 2
 
0.6%
Other values (244) 246
79.6%
2024-05-04T01:09:40.280728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
114
 
5.2%
113
 
5.1%
113
 
5.1%
71
 
3.2%
61
 
2.8%
60
 
2.7%
59
 
2.7%
49
 
2.2%
45
 
2.1%
44
 
2.0%
Other values (241) 1466
66.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1979
90.2%
Space Separator 71
 
3.2%
Decimal Number 63
 
2.9%
Uppercase Letter 47
 
2.1%
Open Punctuation 15
 
0.7%
Close Punctuation 15
 
0.7%
Other Punctuation 2
 
0.1%
Math Symbol 1
 
< 0.1%
Other Symbol 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
114
 
5.8%
113
 
5.7%
113
 
5.7%
61
 
3.1%
60
 
3.0%
59
 
3.0%
49
 
2.5%
45
 
2.3%
44
 
2.2%
42
 
2.1%
Other values (217) 1279
64.6%
Decimal Number
ValueCountFrequency (%)
2 22
34.9%
1 19
30.2%
3 9
14.3%
0 3
 
4.8%
5 2
 
3.2%
6 2
 
3.2%
4 2
 
3.2%
9 2
 
3.2%
8 1
 
1.6%
7 1
 
1.6%
Uppercase Letter
ValueCountFrequency (%)
A 42
89.4%
G 1
 
2.1%
S 1
 
2.1%
E 1
 
2.1%
T 1
 
2.1%
H 1
 
2.1%
Other Punctuation
ValueCountFrequency (%)
/ 1
50.0%
? 1
50.0%
Space Separator
ValueCountFrequency (%)
71
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1980
90.2%
Common 168
 
7.7%
Latin 47
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
114
 
5.8%
113
 
5.7%
113
 
5.7%
61
 
3.1%
60
 
3.0%
59
 
3.0%
49
 
2.5%
45
 
2.3%
44
 
2.2%
42
 
2.1%
Other values (218) 1280
64.6%
Common
ValueCountFrequency (%)
71
42.3%
2 22
 
13.1%
1 19
 
11.3%
( 15
 
8.9%
) 15
 
8.9%
3 9
 
5.4%
0 3
 
1.8%
5 2
 
1.2%
6 2
 
1.2%
4 2
 
1.2%
Other values (7) 8
 
4.8%
Latin
ValueCountFrequency (%)
A 42
89.4%
G 1
 
2.1%
S 1
 
2.1%
E 1
 
2.1%
T 1
 
2.1%
H 1
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1979
90.2%
ASCII 215
 
9.8%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
114
 
5.8%
113
 
5.7%
113
 
5.7%
61
 
3.1%
60
 
3.0%
59
 
3.0%
49
 
2.5%
45
 
2.3%
44
 
2.2%
42
 
2.1%
Other values (217) 1279
64.6%
ASCII
ValueCountFrequency (%)
71
33.0%
A 42
19.5%
2 22
 
10.2%
1 19
 
8.8%
( 15
 
7.0%
) 15
 
7.0%
3 9
 
4.2%
0 3
 
1.4%
5 2
 
0.9%
6 2
 
0.9%
Other values (13) 15
 
7.0%
None
ValueCountFrequency (%)
1
100.0%

전화번호
Text

MISSING 

Distinct210
Distinct (%)91.3%
Missing8
Missing (%)3.4%
Memory size2.0 KiB
2024-05-04T01:09:40.828484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length11.1
Min length2

Characters and Unicode

Total characters2553
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique197 ?
Unique (%)85.7%

Sample

1st row02-482-6400
2nd row02-482-6400
3rd row02-426-2048
4th row02-478-8400
5th row02-477-4268
ValueCountFrequency (%)
02-479-1199 5
 
2.2%
02-481-6562 4
 
1.7%
02-482-6400 3
 
1.3%
02-400-0126 3
 
1.3%
02-478-8400 2
 
0.9%
02-481-2217 2
 
0.9%
02-478-2555 2
 
0.9%
02-442-1026 2
 
0.9%
02-470-5926 2
 
0.9%
02-427-6888 2
 
0.9%
Other values (200) 203
88.3%
2024-05-04T01:09:41.582386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 460
18.0%
2 419
16.4%
0 378
14.8%
4 323
12.7%
7 173
 
6.8%
8 161
 
6.3%
1 147
 
5.8%
6 146
 
5.7%
3 136
 
5.3%
9 108
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2093
82.0%
Dash Punctuation 460
 
18.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 419
20.0%
0 378
18.1%
4 323
15.4%
7 173
8.3%
8 161
 
7.7%
1 147
 
7.0%
6 146
 
7.0%
3 136
 
6.5%
9 108
 
5.2%
5 102
 
4.9%
Dash Punctuation
ValueCountFrequency (%)
- 460
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2553
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 460
18.0%
2 419
16.4%
0 378
14.8%
4 323
12.7%
7 173
 
6.8%
8 161
 
6.3%
1 147
 
5.8%
6 146
 
5.7%
3 136
 
5.3%
9 108
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2553
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 460
18.0%
2 419
16.4%
0 378
14.8%
4 323
12.7%
7 173
 
6.8%
8 161
 
6.3%
1 147
 
5.8%
6 146
 
5.7%
3 136
 
5.3%
9 108
 
4.2%


Categorical

Distinct18
Distinct (%)7.6%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
길동
38 
천호2동
21 
암사1동
20 
둔촌2동
19 
강일동
18 
Other values (13)
122 

Length

Max length4
Median length4
Mean length3.5672269
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row길동
2nd row길동
3rd row명일2동
4th row성내3동
5th row길동

Common Values

ValueCountFrequency (%)
길동 38
16.0%
천호2동 21
 
8.8%
암사1동 20
 
8.4%
둔촌2동 19
 
8.0%
강일동 18
 
7.6%
천호3동 13
 
5.5%
성내3동 13
 
5.5%
천호1동 13
 
5.5%
고덕1동 12
 
5.0%
명일2동 12
 
5.0%
Other values (8) 59
24.8%

Length

2024-05-04T01:09:42.036287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
길동 38
16.0%
천호2동 21
 
8.8%
암사1동 20
 
8.4%
둔촌2동 19
 
8.0%
강일동 18
 
7.6%
천호3동 13
 
5.5%
성내3동 13
 
5.5%
천호1동 13
 
5.5%
명일1동 12
 
5.0%
명일2동 12
 
5.0%
Other values (8) 59
24.8%

Missing values

2024-05-04T01:09:38.811656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-04T01:09:39.056442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시설명전화번호
0한사랑실버센터 1호점02-482-6400길동
1한사랑실버센터 2호점02-482-6400길동
2시립강동노인종합복지관주간보호센터02-426-2048명일2동
3흰돌주간보호센터(2호점)02-478-8400성내3동
4반석햇빛마을주간보호센터02-477-4268길동
5흰돌재가장기요양센터02-481-6562암사1동
6큰나무실버센터02-482-8787길동
7그랜드너싱홈<NA>길동
8마추미강동실버케어02-429-3434암사2동
9언약노인요양전문기관02-476-5319둔촌2동
시설명전화번호
228고덕아이파크A 경로당02-442-5580고덕1동
229강동롯데캐슬A 경로당02-426-1105암사3동
230해공경로당02-486-7610천호2동
231구사거리 경로당02-488-0350천호2동
232레미안힐스테이트고덕(남) 어르신사랑방02-481-5575고덕1동
233레미안힐스테이트고덕(여) 어르신사랑방02-481-6675고덕1동
234암사현대홈타운 어르신사랑방--암사3동
235올림픽파크 한양수자인 어르신사랑방02-471-9630성내1동
236선린공원 어르신사랑방02-470-1716성내3동
237둔촌2동 어르신사랑방02-478-3328둔촌2동

Duplicate rows

Most frequently occurring

시설명전화번호# duplicates
0소망요양원02-400-0126고덕1동2