Overview

Dataset statistics

Number of variables6
Number of observations172
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)1.2%
Total size in memory8.2 KiB
Average record size in memory48.8 B

Variable types

Categorical4
Text2

Dataset

Description서울특별시 성동구 가로휴지통 현황정보입니다. 가로(도로)명, 주소, 세부위치, 수거 쓰레기 종류, 형태, 수거 담당자 등의 정보를 포함하고 있습니다.
URLhttps://www.data.go.kr/data/15038230/fileData.do

Alerts

수거 담당자 has constant value ""Constant
Dataset has 2 (1.2%) duplicate rowsDuplicates
수거쓰레기종류 is highly overall correlated with 형태High correlation
형태 is highly overall correlated with 수거쓰레기종류High correlation

Reproduction

Analysis started2023-12-12 18:35:16.942987
Analysis finished2023-12-12 18:35:17.471690
Duration0.53 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

가로명
Categorical

Distinct24
Distinct (%)14.0%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
왕십리로
34 
독서당로
23 
고산자로
20 
동일로
18 
광나루로
12 
Other values (19)
65 

Length

Max length8
Median length4
Mean length3.744186
Min length3

Unique

Unique9 ?
Unique (%)5.2%

Sample

1st row고산자로
2nd row고산자로
3rd row마장로
4th row사근동길
5th row사근동길

Common Values

ValueCountFrequency (%)
왕십리로 34
19.8%
독서당로 23
13.4%
고산자로 20
11.6%
동일로 18
10.5%
광나루로 12
 
7.0%
금호로 12
 
7.0%
아차산로 10
 
5.8%
천호대로 8
 
4.7%
마조로 5
 
2.9%
행당로 5
 
2.9%
Other values (14) 25
14.5%

Length

2023-12-13T03:35:17.545642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
왕십리로 34
19.8%
독서당로 23
13.4%
고산자로 20
11.6%
동일로 18
10.5%
금호로 13
 
7.6%
광나루로 12
 
7.0%
아차산로 10
 
5.8%
천호대로 8
 
4.7%
마조로 5
 
2.9%
행당로 5
 
2.9%
Other values (13) 24
14.0%

주소
Text

Distinct158
Distinct (%)91.9%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-13T03:35:17.891484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length22
Mean length18.912791
Min length12

Characters and Unicode

Total characters3253
Distinct characters71
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique144 ?
Unique (%)83.7%

Sample

1st row서울특별시 성동구 도선동 15-3
2nd row서울특별시 성동구 도선동 409
3rd row서울특별시 성동구 마장동 792
4th row서울특별시 성동구 사근동 33-10
5th row서울특별시 성동구 행당동 1-13
ValueCountFrequency (%)
서울특별시 172
26.2%
성동구 172
26.2%
행당동 14
 
2.1%
성수동2가 13
 
2.0%
성수동1가 10
 
1.5%
광나루로 9
 
1.4%
왕십리로 9
 
1.4%
금호로 7
 
1.1%
독서당로 5
 
0.8%
송정동 5
 
0.8%
Other values (187) 241
36.7%
2023-12-13T03:35:18.382436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
516
15.9%
280
 
8.6%
205
 
6.3%
177
 
5.4%
172
 
5.3%
172
 
5.3%
172
 
5.3%
172
 
5.3%
172
 
5.3%
1 135
 
4.2%
Other values (61) 1080
33.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1978
60.8%
Decimal Number 667
 
20.5%
Space Separator 516
 
15.9%
Dash Punctuation 92
 
2.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
280
14.2%
205
10.4%
177
8.9%
172
8.7%
172
8.7%
172
8.7%
172
8.7%
172
8.7%
57
 
2.9%
48
 
2.4%
Other values (49) 351
17.7%
Decimal Number
ValueCountFrequency (%)
1 135
20.2%
2 115
17.2%
3 74
11.1%
5 64
9.6%
6 57
8.5%
7 49
 
7.3%
0 48
 
7.2%
9 45
 
6.7%
4 41
 
6.1%
8 39
 
5.8%
Space Separator
ValueCountFrequency (%)
516
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 92
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1978
60.8%
Common 1275
39.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
280
14.2%
205
10.4%
177
8.9%
172
8.7%
172
8.7%
172
8.7%
172
8.7%
172
8.7%
57
 
2.9%
48
 
2.4%
Other values (49) 351
17.7%
Common
ValueCountFrequency (%)
516
40.5%
1 135
 
10.6%
2 115
 
9.0%
- 92
 
7.2%
3 74
 
5.8%
5 64
 
5.0%
6 57
 
4.5%
7 49
 
3.8%
0 48
 
3.8%
9 45
 
3.5%
Other values (2) 80
 
6.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1978
60.8%
ASCII 1275
39.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
516
40.5%
1 135
 
10.6%
2 115
 
9.0%
- 92
 
7.2%
3 74
 
5.8%
5 64
 
5.0%
6 57
 
4.5%
7 49
 
3.8%
0 48
 
3.8%
9 45
 
3.5%
Other values (2) 80
 
6.3%
Hangul
ValueCountFrequency (%)
280
14.2%
205
10.4%
177
8.9%
172
8.7%
172
8.7%
172
8.7%
172
8.7%
172
8.7%
57
 
2.9%
48
 
2.4%
Other values (49) 351
17.7%
Distinct97
Distinct (%)56.4%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-13T03:35:18.699477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length16
Mean length8.4069767
Min length3

Characters and Unicode

Total characters1446
Distinct characters216
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique79 ?
Unique (%)45.9%

Sample

1st row용호공사 옆 거창식당 앞
2nd row세븐일레븐 앞
3rd row마장갈비 앞
4th row모닝글로리 서점 앞
5th row왕십리역 6번출구 위 공원
ValueCountFrequency (%)
84
21.7%
버스정류장 31
 
8.0%
28
 
7.2%
횡단보도 27
 
7.0%
정류장 19
 
4.9%
버스 16
 
4.1%
편의점 9
 
2.3%
주변 8
 
2.1%
도로(가로)변 4
 
1.0%
3번출구 4
 
1.0%
Other values (120) 157
40.6%
2023-12-13T03:35:19.242100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
224
 
15.5%
89
 
6.2%
59
 
4.1%
54
 
3.7%
54
 
3.7%
53
 
3.7%
51
 
3.5%
34
 
2.4%
30
 
2.1%
30
 
2.1%
Other values (206) 768
53.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1135
78.5%
Space Separator 224
 
15.5%
Decimal Number 57
 
3.9%
Uppercase Letter 11
 
0.8%
Open Punctuation 7
 
0.5%
Close Punctuation 7
 
0.5%
Other Punctuation 4
 
0.3%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
89
 
7.8%
59
 
5.2%
54
 
4.8%
54
 
4.8%
53
 
4.7%
51
 
4.5%
34
 
3.0%
30
 
2.6%
30
 
2.6%
28
 
2.5%
Other values (184) 653
57.5%
Decimal Number
ValueCountFrequency (%)
3 18
31.6%
1 14
24.6%
2 9
15.8%
0 4
 
7.0%
6 4
 
7.0%
7 3
 
5.3%
5 3
 
5.3%
8 1
 
1.8%
4 1
 
1.8%
Uppercase Letter
ValueCountFrequency (%)
C 3
27.3%
U 2
18.2%
S 1
 
9.1%
H 1
 
9.1%
N 1
 
9.1%
I 1
 
9.1%
K 1
 
9.1%
B 1
 
9.1%
Space Separator
ValueCountFrequency (%)
224
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1135
78.5%
Common 300
 
20.7%
Latin 11
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
89
 
7.8%
59
 
5.2%
54
 
4.8%
54
 
4.8%
53
 
4.7%
51
 
4.5%
34
 
3.0%
30
 
2.6%
30
 
2.6%
28
 
2.5%
Other values (184) 653
57.5%
Common
ValueCountFrequency (%)
224
74.7%
3 18
 
6.0%
1 14
 
4.7%
2 9
 
3.0%
( 7
 
2.3%
) 7
 
2.3%
, 4
 
1.3%
0 4
 
1.3%
6 4
 
1.3%
7 3
 
1.0%
Other values (4) 6
 
2.0%
Latin
ValueCountFrequency (%)
C 3
27.3%
U 2
18.2%
S 1
 
9.1%
H 1
 
9.1%
N 1
 
9.1%
I 1
 
9.1%
K 1
 
9.1%
B 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1135
78.5%
ASCII 311
 
21.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
224
72.0%
3 18
 
5.8%
1 14
 
4.5%
2 9
 
2.9%
( 7
 
2.3%
) 7
 
2.3%
, 4
 
1.3%
0 4
 
1.3%
6 4
 
1.3%
C 3
 
1.0%
Other values (12) 17
 
5.5%
Hangul
ValueCountFrequency (%)
89
 
7.8%
59
 
5.2%
54
 
4.8%
54
 
4.8%
53
 
4.7%
51
 
4.5%
34
 
3.0%
30
 
2.6%
30
 
2.6%
28
 
2.5%
Other values (184) 653
57.5%

수거쓰레기종류
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
일반쓰레기
74 
담배꽁초 수거용
36 
담배꽁초, 음료컵
35 
일반+담배꽁초
20 
재활용쓰레기 수거용
 
3
Other values (3)
 
4

Length

Max length11
Median length10
Mean length6.9709302
Min length5

Unique

Unique2 ?
Unique (%)1.2%

Sample

1st row일반쓰레기
2nd row일반+담배꽁초
3rd row일반쓰레기
4th row일반쓰레기
5th row일반쓰레기

Common Values

ValueCountFrequency (%)
일반쓰레기 74
43.0%
담배꽁초 수거용 36
20.9%
담배꽁초, 음료컵 35
20.3%
일반+담배꽁초 20
 
11.6%
재활용쓰레기 수거용 3
 
1.7%
재활용쓰레기 수거용 2
 
1.2%
일반쓰레기 1
 
0.6%
일반+담배꽁초 1
 
0.6%

Length

2023-12-13T03:35:19.397487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:35:19.528506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반쓰레기 75
30.2%
담배꽁초 71
28.6%
수거용 41
16.5%
음료컵 35
14.1%
일반+담배꽁초 21
 
8.5%
재활용쓰레기 5
 
2.0%

형태
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
일반 사각 쓰레기통
96 
원형 쓰레기통
71 
분리수거 사각쓰레기통
 
5

Length

Max length11
Median length10
Mean length8.7906977
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반 사각 쓰레기통
2nd row일반 사각 쓰레기통
3rd row일반 사각 쓰레기통
4th row일반 사각 쓰레기통
5th row일반 사각 쓰레기통

Common Values

ValueCountFrequency (%)
일반 사각 쓰레기통 96
55.8%
원형 쓰레기통 71
41.3%
분리수거 사각쓰레기통 5
 
2.9%

Length

2023-12-13T03:35:19.686931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:35:19.844504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
쓰레기통 167
38.0%
일반 96
21.8%
사각 96
21.8%
원형 71
16.1%
분리수거 5
 
1.1%
사각쓰레기통 5
 
1.1%

수거 담당자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
가로환경미화원
172 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가로환경미화원
2nd row가로환경미화원
3rd row가로환경미화원
4th row가로환경미화원
5th row가로환경미화원

Common Values

ValueCountFrequency (%)
가로환경미화원 172
100.0%

Length

2023-12-13T03:35:19.989810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:35:20.106068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가로환경미화원 172
100.0%

Correlations

2023-12-13T03:35:20.187886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
가로명세부위치수거쓰레기종류형태
가로명1.0000.0000.5910.419
세부위치0.0001.0000.9400.830
수거쓰레기종류0.5910.9401.0001.000
형태0.4190.8301.0001.000
2023-12-13T03:35:20.721611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
가로명수거쓰레기종류형태
가로명1.0000.2360.199
수거쓰레기종류0.2361.0000.985
형태0.1990.9851.000
2023-12-13T03:35:20.854814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
가로명수거쓰레기종류형태
가로명1.0000.2360.199
수거쓰레기종류0.2361.0000.985
형태0.1990.9851.000

Missing values

2023-12-13T03:35:17.313088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:35:17.423224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

가로명주소세부위치수거쓰레기종류형태수거 담당자
0고산자로서울특별시 성동구 도선동 15-3용호공사 옆 거창식당 앞일반쓰레기일반 사각 쓰레기통가로환경미화원
1고산자로서울특별시 성동구 도선동 409세븐일레븐 앞일반+담배꽁초일반 사각 쓰레기통가로환경미화원
2마장로서울특별시 성동구 마장동 792마장갈비 앞일반쓰레기일반 사각 쓰레기통가로환경미화원
3사근동길서울특별시 성동구 사근동 33-10모닝글로리 서점 앞일반쓰레기일반 사각 쓰레기통가로환경미화원
4사근동길서울특별시 성동구 행당동 1-13왕십리역 6번출구 위 공원일반쓰레기일반 사각 쓰레기통가로환경미화원
5사근동길서울특별시 성동구 행당동 1-13왕십리역 6번출구 위 공원일반쓰레기일반 사각 쓰레기통가로환경미화원
6왕십리로서울특별시 성동구 하왕십리동 966-13왕십리로327 국민은행 앞일반+담배꽁초일반 사각 쓰레기통가로환경미화원
7왕십리로서울특별시 성동구 도선동 256-1왕십리로 332-1 유노헤어샾일반+담배꽁초일반 사각 쓰레기통가로환경미화원
8왕십리로서울특별시 성동구 행당동 284왕십리로 303 성동우체국앞일반쓰레기일반 사각 쓰레기통가로환경미화원
9왕십리로서울특별시 성동구 행당동 284왕십리로 303 성동우체국앞재활용쓰레기 수거용분리수거 사각쓰레기통가로환경미화원
가로명주소세부위치수거쓰레기종류형태수거 담당자
162뚝섬로서울특별시 성동구 뚝섬로 452버스정류장담배꽁초, 음료컵원형 쓰레기통가로환경미화원
163아차산로서울특별시 성동구 아차산로 120전철역담배꽁초, 음료컵원형 쓰레기통가로환경미화원
164광나루로서울특별시 성동구 광나루로 328버스정류장담배꽁초, 음료컵원형 쓰레기통가로환경미화원
165광나루로서울특별시 성동구 광나루로 297버스정류장담배꽁초, 음료컵원형 쓰레기통가로환경미화원
166광나루로서울특별시 성동구 광나루로 319횡단보도담배꽁초, 음료컵원형 쓰레기통가로환경미화원
167광나루로서울특별시 성동구 광나루로 190버스정류장담배꽁초, 음료컵원형 쓰레기통가로환경미화원
168광나루로서울특별시 성동구 광나루로 142-1버스정류장담배꽁초, 음료컵원형 쓰레기통가로환경미화원
169광나루로서울특별시 성동구 광나루로 184버스정류장담배꽁초, 음료컵원형 쓰레기통가로환경미화원
170살곶이서울특별시 성동구 살곶이 8길 22버스정류장담배꽁초, 음료컵원형 쓰레기통가로환경미화원
171광나루로서울특별시 성동구 광나루로 297버스정류장담배꽁초, 음료컵원형 쓰레기통가로환경미화원

Duplicate rows

Most frequently occurring

가로명주소세부위치수거쓰레기종류형태수거 담당자# duplicates
0광나루로서울특별시 성동구 광나루로 297버스정류장담배꽁초, 음료컵원형 쓰레기통가로환경미화원2
1사근동길서울특별시 성동구 행당동 1-13왕십리역 6번출구 위 공원일반쓰레기일반 사각 쓰레기통가로환경미화원2