Overview

Dataset statistics

Number of variables4
Number of observations111
Missing cells58
Missing cells (%)13.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.6 KiB
Average record size in memory33.2 B

Variable types

Categorical2
Text2

Dataset

Description서울특별시 성동구 성동 푸르미 재활용정거장 현황 정보입니다. 관할 동주민센터, 주소, 상세위치, 통 수, 동주민센터 전화번호 등의 정보를 포함하고 있습니다.
Author서울특별시 성동구
URLhttps://www.data.go.kr/data/15089646/fileData.do

Alerts

동명 is highly overall correlated with 동주민센터전화번호High correlation
동주민센터전화번호 is highly overall correlated with 동명High correlation
상세위치 has 58 (52.3%) missing valuesMissing
주소 has unique valuesUnique

Reproduction

Analysis started2023-12-13 00:25:59.916567
Analysis finished2023-12-13 00:26:00.239513
Duration0.32 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

동명
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)15.3%
Missing0
Missing (%)0.0%
Memory size1020.0 B
송정동
15 
용답동
12 
사근동
10 
금호2,3가동
10 
마장동
10 
Other values (12)
54 

Length

Max length7
Median length6
Mean length4.7117117
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row왕십리도선동
2nd row왕십리도선동
3rd row왕십리도선동
4th row왕십리도선동
5th row왕십리도선동

Common Values

ValueCountFrequency (%)
송정동 15
13.5%
용답동 12
10.8%
사근동 10
9.0%
금호2,3가동 10
9.0%
마장동 10
9.0%
성수2가제1동 9
 
8.1%
성수1가제2동 6
 
5.4%
성수2가3동 5
 
4.5%
왕십리도선동 5
 
4.5%
성수1가1동 5
 
4.5%
Other values (7) 24
21.6%

Length

2023-12-13T09:26:00.297162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
송정동 15
13.5%
용답동 12
10.8%
사근동 10
9.0%
금호2,3가동 10
9.0%
마장동 10
9.0%
성수2가제1동 9
 
8.1%
성수1가제2동 6
 
5.4%
성수1가1동 5
 
4.5%
왕십리제2동 5
 
4.5%
왕십리도선동 5
 
4.5%
Other values (7) 24
21.6%

주소
Text

UNIQUE 

Distinct111
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1020.0 B
2023-12-13T09:26:00.548321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length22
Mean length18.927928
Min length16

Characters and Unicode

Total characters2101
Distinct characters77
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique111 ?
Unique (%)100.0%

Sample

1st row서울특별시 성동구 마장로23길 12
2nd row서울특별시 성동구 무학로8길 8
3rd row서울특별시 성동구 무학로 10길 33
4th row서울특별시 성동구 무학로6길 29-2
5th row서울특별시 성동구 무학로4길 26-1
ValueCountFrequency (%)
서울특별시 111
25.0%
성동구 111
25.0%
7 5
 
1.1%
1 5
 
1.1%
15 4
 
0.9%
6 3
 
0.7%
송정동 3
 
0.7%
14 3
 
0.7%
8 3
 
0.7%
용답29길 3
 
0.7%
Other values (165) 193
43.5%
2023-12-13T09:26:00.911578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
338
16.1%
130
 
6.2%
122
 
5.8%
116
 
5.5%
112
 
5.3%
111
 
5.3%
111
 
5.3%
111
 
5.3%
111
 
5.3%
1 110
 
5.2%
Other values (67) 729
34.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1338
63.7%
Decimal Number 393
 
18.7%
Space Separator 338
 
16.1%
Dash Punctuation 32
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
130
9.7%
122
9.1%
116
8.7%
112
8.4%
111
8.3%
111
8.3%
111
8.3%
111
8.3%
97
 
7.2%
51
 
3.8%
Other values (55) 266
19.9%
Decimal Number
ValueCountFrequency (%)
1 110
28.0%
2 76
19.3%
5 36
 
9.2%
3 31
 
7.9%
4 29
 
7.4%
9 25
 
6.4%
6 24
 
6.1%
7 23
 
5.9%
0 23
 
5.9%
8 16
 
4.1%
Space Separator
ValueCountFrequency (%)
338
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1338
63.7%
Common 763
36.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
130
9.7%
122
9.1%
116
8.7%
112
8.4%
111
8.3%
111
8.3%
111
8.3%
111
8.3%
97
 
7.2%
51
 
3.8%
Other values (55) 266
19.9%
Common
ValueCountFrequency (%)
338
44.3%
1 110
 
14.4%
2 76
 
10.0%
5 36
 
4.7%
- 32
 
4.2%
3 31
 
4.1%
4 29
 
3.8%
9 25
 
3.3%
6 24
 
3.1%
7 23
 
3.0%
Other values (2) 39
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1338
63.7%
ASCII 763
36.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
338
44.3%
1 110
 
14.4%
2 76
 
10.0%
5 36
 
4.7%
- 32
 
4.2%
3 31
 
4.1%
4 29
 
3.8%
9 25
 
3.3%
6 24
 
3.1%
7 23
 
3.0%
Other values (2) 39
 
5.1%
Hangul
ValueCountFrequency (%)
130
9.7%
122
9.1%
116
8.7%
112
8.4%
111
8.3%
111
8.3%
111
8.3%
111
8.3%
97
 
7.2%
51
 
3.8%
Other values (55) 266
19.9%

상세위치
Text

MISSING 

Distinct48
Distinct (%)90.6%
Missing58
Missing (%)52.3%
Memory size1020.0 B
2023-12-13T09:26:01.089168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length11
Mean length7.3207547
Min length2

Characters and Unicode

Total characters388
Distinct characters128
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)84.9%

Sample

1st row보건소후문
2nd row도선동공영주차장 입구 옆
3rd row아롱다롱 앞
4th row서주우유 앞
5th row팔각심인당 주차장 후문
ValueCountFrequency (%)
19
 
17.9%
맞은편 11
 
10.4%
건너편 6
 
5.7%
주차장 5
 
4.7%
인도 3
 
2.8%
공원 3
 
2.8%
2
 
1.9%
2
 
1.9%
부분 2
 
1.9%
계단 2
 
1.9%
Other values (49) 51
48.1%
2023-12-13T09:26:01.376767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
65
 
16.8%
21
 
5.4%
18
 
4.6%
15
 
3.9%
11
 
2.8%
11
 
2.8%
11
 
2.8%
10
 
2.6%
9
 
2.3%
7
 
1.8%
Other values (118) 210
54.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 320
82.5%
Space Separator 65
 
16.8%
Decimal Number 2
 
0.5%
Dash Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
 
6.6%
18
 
5.6%
15
 
4.7%
11
 
3.4%
11
 
3.4%
11
 
3.4%
10
 
3.1%
9
 
2.8%
7
 
2.2%
7
 
2.2%
Other values (114) 200
62.5%
Decimal Number
ValueCountFrequency (%)
3 1
50.0%
2 1
50.0%
Space Separator
ValueCountFrequency (%)
65
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 320
82.5%
Common 68
 
17.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21
 
6.6%
18
 
5.6%
15
 
4.7%
11
 
3.4%
11
 
3.4%
11
 
3.4%
10
 
3.1%
9
 
2.8%
7
 
2.2%
7
 
2.2%
Other values (114) 200
62.5%
Common
ValueCountFrequency (%)
65
95.6%
3 1
 
1.5%
- 1
 
1.5%
2 1
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 320
82.5%
ASCII 68
 
17.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
65
95.6%
3 1
 
1.5%
- 1
 
1.5%
2 1
 
1.5%
Hangul
ValueCountFrequency (%)
21
 
6.6%
18
 
5.6%
15
 
4.7%
11
 
3.4%
11
 
3.4%
11
 
3.4%
10
 
3.1%
9
 
2.8%
7
 
2.2%
7
 
2.2%
Other values (114) 200
62.5%

동주민센터전화번호
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)15.3%
Missing0
Missing (%)0.0%
Memory size1020.0 B
02-2286-7509
15 
02-2286-7535
12 
02-2286-7271
10 
02-2286-7371
10 
02-2286-7560
10 
Other values (12)
54 

Length

Max length13
Median length12
Mean length12.045045
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row02-2286-7204
2nd row02-2286-7204
3rd row02-2286-7204
4th row02-2286-7204
5th row02-2286-7204

Common Values

ValueCountFrequency (%)
02-2286-7509 15
13.5%
02-2286-7535 12
10.8%
02-2286-7271 10
9.0%
02-2286-7371 10
9.0%
02-2286-7560 10
9.0%
02-2286-7741 9
 
8.1%
02-2286-7459 6
 
5.4%
02-2286-7494 5
 
4.5%
02-2286-7204 5
 
4.5%
02-2286-7433 5
 
4.5%
Other values (7) 24
21.6%

Length

2023-12-13T09:26:01.479534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
02-2286-7509 15
13.5%
02-2286-7535 12
10.8%
02-2286-7271 10
9.0%
02-2286-7371 10
9.0%
02-2286-7560 10
9.0%
02-2286-7741 9
 
8.1%
02-2286-7459 6
 
5.4%
02-2286-7433 5
 
4.5%
02-2286-7644 5
 
4.5%
02-2286-7204 5
 
4.5%
Other values (7) 24
21.6%

Correlations

2023-12-13T09:26:01.538389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
동명상세위치동주민센터전화번호
동명1.0000.3181.000
상세위치0.3181.0000.318
동주민센터전화번호1.0000.3181.000
2023-12-13T09:26:01.604617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
동명동주민센터전화번호
동명1.0001.000
동주민센터전화번호1.0001.000
2023-12-13T09:26:01.666968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
동명동주민센터전화번호
동명1.0001.000
동주민센터전화번호1.0001.000

Missing values

2023-12-13T09:26:00.143437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T09:26:00.214555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

동명주소상세위치동주민센터전화번호
0왕십리도선동서울특별시 성동구 마장로23길 12보건소후문02-2286-7204
1왕십리도선동서울특별시 성동구 무학로8길 8도선동공영주차장 입구 옆02-2286-7204
2왕십리도선동서울특별시 성동구 무학로 10길 33아롱다롱 앞02-2286-7204
3왕십리도선동서울특별시 성동구 무학로6길 29-2서주우유 앞02-2286-7204
4왕십리도선동서울특별시 성동구 무학로4길 26-1팔각심인당 주차장 후문02-2286-7204
5왕십리제2동서울특별시 성동구 왕십리로 387<NA>02-2286-7644
6왕십리제2동서울특별시 성동구 무학봉15길 22<NA>02-2286-7644
7왕십리제2동서울특별시 성동구 무학봉25길 6<NA>02-2286-7644
8왕십리제2동서울특별시 성동구 무학봉길 69<NA>02-2286-7644
9왕십리제2동서울특별시 성동구 무학봉11길 14<NA>02-2286-7644
동명주소상세위치동주민센터전화번호
101용답동서울특별시 성동구 용답길 133건너편02-2286-7535
102용답동서울특별시 성동구 용답15길 3<NA>02-2286-7535
103용답동서울특별시 성동구 용답25길 18-1<NA>02-2286-7535
104용답동서울특별시 성동구 용답중앙길 42서윤피부샵 건너편02-2286-7535
105용답동서울특별시 성동구 용답15가길어울림공영주차장02-2286-7535
106용답동서울특별시 성동구 용답29길 28<NA>02-2286-7535
107용답동서울특별시 성동구 용답23길 1건너편 전봇대02-2286-7535
108용답동서울특별시 성동구 용답1길 1<NA>02-2286-7535
109용답동서울특별시 성동구 용답중앙11길 17-1<NA>02-2286-7535
110용답동서울특별시 성동구 용답25길 2<NA>02-2286-7535