Overview

Dataset statistics

Number of variables5
Number of observations706
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory28.4 KiB
Average record size in memory41.2 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description종량제쓰레기를 배출하기 위해 사용하는 종량제봉투를 구로구 관내에서 판매하고 있는 판매소의 위치 목록입니다. 구로구 청소행정과(02-860-2904)에서 관리하고 있는 자료입니다.
Author서울특별시 구로구
URLhttps://www.data.go.kr/data/15113047/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 행정동High correlation
행정동 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-04-06 08:51:32.347086
Analysis finished2024-04-06 08:51:34.103294
Duration1.76 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct706
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean353.5
Minimum1
Maximum706
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.3 KiB
2024-04-06T17:51:34.338512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile36.25
Q1177.25
median353.5
Q3529.75
95-th percentile670.75
Maximum706
Range705
Interquartile range (IQR)352.5

Descriptive statistics

Standard deviation203.94893
Coefficient of variation (CV)0.57694181
Kurtosis-1.2
Mean353.5
Median Absolute Deviation (MAD)176.5
Skewness0
Sum249571
Variance41595.167
MonotonicityStrictly increasing
2024-04-06T17:51:34.942878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
531 1
 
0.1%
467 1
 
0.1%
468 1
 
0.1%
469 1
 
0.1%
470 1
 
0.1%
471 1
 
0.1%
472 1
 
0.1%
473 1
 
0.1%
474 1
 
0.1%
Other values (696) 696
98.6%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
706 1
0.1%
705 1
0.1%
704 1
0.1%
703 1
0.1%
702 1
0.1%
701 1
0.1%
700 1
0.1%
699 1
0.1%
698 1
0.1%
697 1
0.1%

행정동
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size5.6 KiB
구로2동
91 
구로3동
84 
구로4동
63 
개봉1동
56 
구로5동
53 
Other values (11)
359 

Length

Max length4
Median length4
Mean length3.9107649
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가리봉동
2nd row가리봉동
3rd row가리봉동
4th row가리봉동
5th row가리봉동

Common Values

ValueCountFrequency (%)
구로2동 91
12.9%
구로3동 84
11.9%
구로4동 63
8.9%
개봉1동 56
 
7.9%
구로5동 53
 
7.5%
가리봉동 45
 
6.4%
고척2동 42
 
5.9%
오류2동 40
 
5.7%
신도림동 39
 
5.5%
고척1동 38
 
5.4%
Other values (6) 155
22.0%

Length

2024-04-06T17:51:35.471899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
구로2동 91
12.9%
구로3동 84
11.9%
구로4동 63
8.9%
개봉1동 56
 
7.9%
구로5동 53
 
7.5%
가리봉동 45
 
6.4%
고척2동 42
 
5.9%
오류2동 40
 
5.7%
신도림동 39
 
5.5%
고척1동 38
 
5.4%
Other values (6) 155
22.0%
Distinct678
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Memory size5.6 KiB
2024-04-06T17:51:36.230451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length16
Mean length8.3002833
Min length3

Characters and Unicode

Total characters5860
Distinct characters374
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique653 ?
Unique (%)92.5%

Sample

1st row남구로슈퍼
2nd row영등포농협 하나로마트 남구로역점
3rd row지에스25 디지털오거리점
4th row씨유 디지털오거리점
5th row고향마트
ValueCountFrequency (%)
씨유 101
 
9.0%
세븐일레븐 86
 
7.7%
gs25 77
 
6.9%
지에스25 44
 
3.9%
이마트24 30
 
2.7%
구로점 9
 
0.8%
cu 7
 
0.6%
개봉점 7
 
0.6%
스토리웨이 6
 
0.5%
중국식품 5
 
0.4%
Other values (668) 752
66.9%
2024-04-06T17:51:38.263541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
421
 
7.2%
408
 
7.0%
232
 
4.0%
224
 
3.8%
214
 
3.7%
207
 
3.5%
173
 
3.0%
2 170
 
2.9%
5 126
 
2.2%
123
 
2.1%
Other values (364) 3562
60.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4826
82.4%
Space Separator 421
 
7.2%
Decimal Number 345
 
5.9%
Uppercase Letter 242
 
4.1%
Close Punctuation 11
 
0.2%
Open Punctuation 11
 
0.2%
Lowercase Letter 2
 
< 0.1%
Other Punctuation 1
 
< 0.1%
Control 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
408
 
8.5%
232
 
4.8%
224
 
4.6%
214
 
4.4%
207
 
4.3%
173
 
3.6%
123
 
2.5%
117
 
2.4%
114
 
2.4%
108
 
2.2%
Other values (330) 2906
60.2%
Uppercase Letter
ValueCountFrequency (%)
S 85
35.1%
G 82
33.9%
C 19
 
7.9%
U 9
 
3.7%
D 8
 
3.3%
R 7
 
2.9%
N 5
 
2.1%
K 5
 
2.1%
M 4
 
1.7%
L 3
 
1.2%
Other values (9) 15
 
6.2%
Decimal Number
ValueCountFrequency (%)
2 170
49.3%
5 126
36.5%
4 34
 
9.9%
1 10
 
2.9%
7 2
 
0.6%
3 1
 
0.3%
0 1
 
0.3%
8 1
 
0.3%
Lowercase Letter
ValueCountFrequency (%)
k 1
50.0%
s 1
50.0%
Space Separator
ValueCountFrequency (%)
421
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4826
82.4%
Common 790
 
13.5%
Latin 244
 
4.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
408
 
8.5%
232
 
4.8%
224
 
4.6%
214
 
4.4%
207
 
4.3%
173
 
3.6%
123
 
2.5%
117
 
2.4%
114
 
2.4%
108
 
2.2%
Other values (330) 2906
60.2%
Latin
ValueCountFrequency (%)
S 85
34.8%
G 82
33.6%
C 19
 
7.8%
U 9
 
3.7%
D 8
 
3.3%
R 7
 
2.9%
N 5
 
2.0%
K 5
 
2.0%
M 4
 
1.6%
L 3
 
1.2%
Other values (11) 17
 
7.0%
Common
ValueCountFrequency (%)
421
53.3%
2 170
21.5%
5 126
 
15.9%
4 34
 
4.3%
) 11
 
1.4%
( 11
 
1.4%
1 10
 
1.3%
7 2
 
0.3%
. 1
 
0.1%
3 1
 
0.1%
Other values (3) 3
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4826
82.4%
ASCII 1034
 
17.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
421
40.7%
2 170
16.4%
5 126
 
12.2%
S 85
 
8.2%
G 82
 
7.9%
4 34
 
3.3%
C 19
 
1.8%
) 11
 
1.1%
( 11
 
1.1%
1 10
 
1.0%
Other values (24) 65
 
6.3%
Hangul
ValueCountFrequency (%)
408
 
8.5%
232
 
4.8%
224
 
4.6%
214
 
4.4%
207
 
4.3%
173
 
3.6%
123
 
2.5%
117
 
2.4%
114
 
2.4%
108
 
2.2%
Other values (330) 2906
60.2%

주소
Text

Distinct663
Distinct (%)93.9%
Missing0
Missing (%)0.0%
Memory size5.6 KiB
2024-04-06T17:51:39.338540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length40
Mean length21.655807
Min length12

Characters and Unicode

Total characters15289
Distinct characters176
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique623 ?
Unique (%)88.2%

Sample

1st row서울 구로구 구로동로7길 37
2nd row서울 구로구 구로동로 34
3rd row서울특별시 구로구 남부순환로105길 58 (가리봉동)
4th row서울특별시 구로구 디지털로 221 (가리봉동)
5th row서울특별시 구로구 구로동로5길 11 (가리봉동)
ValueCountFrequency (%)
구로구 706
21.3%
서울 360
 
10.9%
서울특별시 346
 
10.5%
구로동 138
 
4.2%
개봉동 53
 
1.6%
고척동 47
 
1.4%
경인로 40
 
1.2%
오류동 37
 
1.1%
구로동로 25
 
0.8%
가리봉동 21
 
0.6%
Other values (611) 1535
46.4%
2024-04-06T17:51:40.953711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2656
17.4%
1718
 
11.2%
1692
 
11.1%
722
 
4.7%
706
 
4.6%
1 596
 
3.9%
483
 
3.2%
474
 
3.1%
2 424
 
2.8%
351
 
2.3%
Other values (166) 5467
35.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9144
59.8%
Space Separator 2656
 
17.4%
Decimal Number 2645
 
17.3%
Close Punctuation 346
 
2.3%
Open Punctuation 346
 
2.3%
Dash Punctuation 88
 
0.6%
Other Punctuation 55
 
0.4%
Uppercase Letter 7
 
< 0.1%
Lowercase Letter 1
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1718
18.8%
1692
18.5%
722
 
7.9%
706
 
7.7%
483
 
5.3%
474
 
5.2%
351
 
3.8%
347
 
3.8%
346
 
3.8%
127
 
1.4%
Other values (144) 2178
23.8%
Decimal Number
ValueCountFrequency (%)
1 596
22.5%
2 424
16.0%
3 305
11.5%
5 227
 
8.6%
8 209
 
7.9%
4 205
 
7.8%
0 182
 
6.9%
6 182
 
6.9%
7 180
 
6.8%
9 135
 
5.1%
Uppercase Letter
ValueCountFrequency (%)
G 2
28.6%
L 2
28.6%
I 1
14.3%
B 1
14.3%
J 1
14.3%
Space Separator
ValueCountFrequency (%)
2656
100.0%
Close Punctuation
ValueCountFrequency (%)
) 346
100.0%
Open Punctuation
ValueCountFrequency (%)
( 346
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 88
100.0%
Other Punctuation
ValueCountFrequency (%)
, 55
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9144
59.8%
Common 6137
40.1%
Latin 8
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1718
18.8%
1692
18.5%
722
 
7.9%
706
 
7.7%
483
 
5.3%
474
 
5.2%
351
 
3.8%
347
 
3.8%
346
 
3.8%
127
 
1.4%
Other values (144) 2178
23.8%
Common
ValueCountFrequency (%)
2656
43.3%
1 596
 
9.7%
2 424
 
6.9%
) 346
 
5.6%
( 346
 
5.6%
3 305
 
5.0%
5 227
 
3.7%
8 209
 
3.4%
4 205
 
3.3%
0 182
 
3.0%
Other values (6) 641
 
10.4%
Latin
ValueCountFrequency (%)
G 2
25.0%
L 2
25.0%
e 1
12.5%
I 1
12.5%
B 1
12.5%
J 1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9144
59.8%
ASCII 6145
40.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2656
43.2%
1 596
 
9.7%
2 424
 
6.9%
) 346
 
5.6%
( 346
 
5.6%
3 305
 
5.0%
5 227
 
3.7%
8 209
 
3.4%
4 205
 
3.3%
0 182
 
3.0%
Other values (12) 649
 
10.6%
Hangul
ValueCountFrequency (%)
1718
18.8%
1692
18.5%
722
 
7.9%
706
 
7.7%
483
 
5.3%
474
 
5.2%
351
 
3.8%
347
 
3.8%
346
 
3.8%
127
 
1.4%
Other values (144) 2178
23.8%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size5.6 KiB
2024-03-27
706 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-03-27
2nd row2024-03-27
3rd row2024-03-27
4th row2024-03-27
5th row2024-03-27

Common Values

ValueCountFrequency (%)
2024-03-27 706
100.0%

Length

2024-04-06T17:51:41.331975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:51:41.628114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-03-27 706
100.0%

Interactions

2024-04-06T17:51:33.363640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:51:41.822882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번행정동
연번1.0000.969
행정동0.9691.000
2024-04-06T17:51:42.042284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번행정동
연번1.0000.854
행정동0.8541.000

Missing values

2024-04-06T17:51:33.731666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:51:34.006654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번행정동상점명주소데이터기준일자
01가리봉동남구로슈퍼서울 구로구 구로동로7길 372024-03-27
12가리봉동영등포농협 하나로마트 남구로역점서울 구로구 구로동로 342024-03-27
23가리봉동지에스25 디지털오거리점서울특별시 구로구 남부순환로105길 58 (가리봉동)2024-03-27
34가리봉동씨유 디지털오거리점서울특별시 구로구 디지털로 221 (가리봉동)2024-03-27
45가리봉동고향마트서울특별시 구로구 구로동로5길 11 (가리봉동)2024-03-27
56가리봉동세븐일레븐 가리봉중앙점서울특별시 구로구 구로동로 16 (가리봉동)2024-03-27
67가리봉동씨유 공단오거리점서울특별시 구로구 남부순환로105길 52 (가리봉동)2024-03-27
78가리봉동이마트24 구로스카이점서울특별시 구로구 구로동로7길 100 (가리봉동)2024-03-27
89가리봉동까르푸 중한식품서울특별시 구로구 우마길 10 (가리봉동)2024-03-27
910가리봉동희용슈퍼서울특별시 구로구 구로동로2길 6 (가리봉동)2024-03-27
연번행정동상점명주소데이터기준일자
696697항동세븐일레븐 항동SK점서울특별시 구로구 부광로 88 (항동)2024-03-27
697698항동이마트에브리데이 구로항동점서울특별시 구로구 서해안로 2124 (항동)2024-03-27
698699항동하버그린마켓서울특별시 구로구 서해안로 2036 (항동, 하버라인8단지)2024-03-27
699700항동세븐일레븐 항동하버라인점서울특별시 구로구 연동로11길 26 (항동)2024-03-27
700701항동씨유항동7단지점서울특별시 구로구 부광로 96-16 (항동, 항동 제일풍경채 포레스트)2024-03-27
701702항동CU항동10단지점서울특별시 구로구 항동로 60 (항동, 하버라인 10단지)2024-03-27
702703항동씨유항동3단지점서울특별시 구로구 항동로3길 6 (항동)2024-03-27
703704항동지에스25 구로솔보점서울특별시 구로구 서해안로 2102 (항동)2024-03-27
704705항동GS THE FRESH 항동에듀힐즈점서울특별시 구로구 항동로 42 (항동, 한양수자인 에듀힐즈)2024-03-27
705706항동씨유 항동2단지점서울특별시 구로구 연동로 234 (항동, 하버라인 2단지)2024-03-27