Overview

Dataset statistics

Number of variables3
Number of observations44
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory27.0 B

Variable types

Text2
Categorical1

Dataset

Description시설명,전화번호,동
Author강동구
URLhttps://data.seoul.go.kr/dataList/OA-12641/S/1/datasetView.do

Reproduction

Analysis started2023-12-11 07:13:02.660407
Analysis finished2023-12-11 07:13:03.007278
Duration0.35 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct43
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size484.0 B
2023-12-11T16:13:03.210050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length9.75
Min length5

Characters and Unicode

Total characters429
Distinct characters97
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)95.5%

Sample

1st row구립성내지역아동센터
2nd row강동꿈나무지역아동센터
3rd row명성지역아동센터
4th row소리마을지역아동센터
5th row들꽃청소년지역아동센터
ValueCountFrequency (%)
꿈미소 7
 
11.5%
강동구 3
 
4.9%
청소년 2
 
3.3%
구립함께하는지역아동센터 2
 
3.3%
7호점 1
 
1.6%
구립길리청소년지역아동센터 1
 
1.6%
암사지역아동센터 1
 
1.6%
마을학교지역아동센터 1
 
1.6%
구립강동청소년누리터 1
 
1.6%
아동청소년 1
 
1.6%
Other values (41) 41
67.2%
2023-12-11T16:13:03.655044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
41
 
9.6%
30
 
7.0%
30
 
7.0%
29
 
6.8%
28
 
6.5%
25
 
5.8%
20
 
4.7%
15
 
3.5%
14
 
3.3%
12
 
2.8%
Other values (87) 185
43.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 397
92.5%
Space Separator 20
 
4.7%
Decimal Number 10
 
2.3%
Dash Punctuation 1
 
0.2%
Uppercase Letter 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
41
 
10.3%
30
 
7.6%
30
 
7.6%
29
 
7.3%
28
 
7.1%
25
 
6.3%
15
 
3.8%
14
 
3.5%
12
 
3.0%
10
 
2.5%
Other values (76) 163
41.1%
Decimal Number
ValueCountFrequency (%)
3 2
20.0%
4 2
20.0%
7 1
10.0%
6 1
10.0%
5 1
10.0%
2 1
10.0%
1 1
10.0%
8 1
10.0%
Space Separator
ValueCountFrequency (%)
20
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Uppercase Letter
ValueCountFrequency (%)
H 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 397
92.5%
Common 31
 
7.2%
Latin 1
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
41
 
10.3%
30
 
7.6%
30
 
7.6%
29
 
7.3%
28
 
7.1%
25
 
6.3%
15
 
3.8%
14
 
3.5%
12
 
3.0%
10
 
2.5%
Other values (76) 163
41.1%
Common
ValueCountFrequency (%)
20
64.5%
3 2
 
6.5%
4 2
 
6.5%
7 1
 
3.2%
6 1
 
3.2%
5 1
 
3.2%
2 1
 
3.2%
1 1
 
3.2%
- 1
 
3.2%
8 1
 
3.2%
Latin
ValueCountFrequency (%)
H 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 397
92.5%
ASCII 32
 
7.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
41
 
10.3%
30
 
7.6%
30
 
7.6%
29
 
7.3%
28
 
7.1%
25
 
6.3%
15
 
3.8%
14
 
3.5%
12
 
3.0%
10
 
2.5%
Other values (76) 163
41.1%
ASCII
ValueCountFrequency (%)
20
62.5%
3 2
 
6.2%
4 2
 
6.2%
7 1
 
3.1%
6 1
 
3.1%
5 1
 
3.1%
2 1
 
3.1%
1 1
 
3.1%
- 1
 
3.1%
H 1
 
3.1%
Distinct43
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size484.0 B
2023-12-11T16:13:03.937452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length11.5
Min length11

Characters and Unicode

Total characters506
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)95.5%

Sample

1st row070-4164-1200
2nd row02-478-7200
3rd row02-474-6940
4th row02-472-3557
5th row02-478-0504
ValueCountFrequency (%)
02-429-6900 2
 
4.5%
070-4100-4323 1
 
2.3%
070-7708-0308 1
 
2.3%
02-478-6336 1
 
2.3%
02-481-9711 1
 
2.3%
02-488-0816 1
 
2.3%
02-429-4171 1
 
2.3%
02-6252-1388 1
 
2.3%
02-6252-1329 1
 
2.3%
02-483-1318 1
 
2.3%
Other values (33) 33
75.0%
2023-12-11T16:13:04.444589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 88
17.4%
0 85
16.8%
2 72
14.2%
4 58
11.5%
8 42
8.3%
7 36
7.1%
1 33
 
6.5%
3 27
 
5.3%
6 25
 
4.9%
5 21
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 418
82.6%
Dash Punctuation 88
 
17.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 85
20.3%
2 72
17.2%
4 58
13.9%
8 42
10.0%
7 36
8.6%
1 33
 
7.9%
3 27
 
6.5%
6 25
 
6.0%
5 21
 
5.0%
9 19
 
4.5%
Dash Punctuation
ValueCountFrequency (%)
- 88
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 506
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 88
17.4%
0 85
16.8%
2 72
14.2%
4 58
11.5%
8 42
8.3%
7 36
7.1%
1 33
 
6.5%
3 27
 
5.3%
6 25
 
4.9%
5 21
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 506
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 88
17.4%
0 85
16.8%
2 72
14.2%
4 58
11.5%
8 42
8.3%
7 36
7.1%
1 33
 
6.5%
3 27
 
5.3%
6 25
 
4.9%
5 21
 
4.2%


Categorical

Distinct18
Distinct (%)40.9%
Missing0
Missing (%)0.0%
Memory size484.0 B
길동
암사1동
천호2동
강일동
천호1동
Other values (13)
21 

Length

Max length4
Median length4
Mean length3.5681818
Min length2

Unique

Unique6 ?
Unique (%)13.6%

Sample

1st row성내2동
2nd row천호2동
3rd row천호1동
4th row천호2동
5th row천호2동

Common Values

ValueCountFrequency (%)
길동 7
15.9%
암사1동 5
11.4%
천호2동 4
 
9.1%
강일동 4
 
9.1%
천호1동 3
 
6.8%
둔촌2동 3
 
6.8%
성내3동 2
 
4.5%
명일2동 2
 
4.5%
성내1동 2
 
4.5%
상일2동 2
 
4.5%
Other values (8) 10
22.7%

Length

2023-12-11T16:13:04.678420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
길동 7
15.9%
암사1동 5
11.4%
천호2동 4
 
9.1%
강일동 4
 
9.1%
천호1동 3
 
6.8%
둔촌2동 3
 
6.8%
성내2동 2
 
4.5%
명일1동 2
 
4.5%
상일2동 2
 
4.5%
성내1동 2
 
4.5%
Other values (8) 10
22.7%

Correlations

2023-12-11T16:13:04.804147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설명전화번호
시설명1.0001.0000.547
전화번호1.0001.0000.547
0.5470.5471.000

Missing values

2023-12-11T16:13:02.886934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T16:13:02.971051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시설명전화번호
0구립성내지역아동센터070-4164-1200성내2동
1강동꿈나무지역아동센터02-478-7200천호2동
2명성지역아동센터02-474-6940천호1동
3소리마을지역아동센터02-472-3557천호2동
4들꽃청소년지역아동센터02-478-0504천호2동
5숲과나무지역아동센터02-481-7179암사1동
6아름다운행복한홈스쿨지역아동센터02-3427-4415명일1동
7돋움지역아동센터02-6081-5252고덕1동
8서울중심지역아동센터02-488-1067성내2동
9동서울지역아동복지센터02-478-3673둔촌2동
시설명전화번호
34구립함께하는지역아동센터02-429-6900상일2동
35꿈미소 1호점02-482-9888길동
36꿈미호 2호점070-4236-2583암사1동
37꿈미소 3호점02-484-4663천호2동
38꿈미소 4호점070-4244-0809명일2동
39꿈미소 5호점02-473-8228길동
40꿈미소 6호점070-4251-9264암사1동
41꿈미소 7호점02-489-6536둔촌2동
42꿈미소 8호점02-428-1318상일1동
43강동구 아동청소년 미래본부02-470-4415성내1동