Overview

Dataset statistics

Number of variables6
Number of observations349
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory17.2 KiB
Average record size in memory50.4 B

Variable types

Numeric2
Categorical2
Text2

Dataset

Description송파구의 폐형광등, 폐건전지 분리수거함 현황으로 행정동, 주소, 위치, 폐형광등 분리수거함 여부, 폐건전지 분리수거함 여부 등 정보
URLhttps://www.data.go.kr/data/15038206/fileData.do

Alerts

연번 is highly overall correlated with 동명High correlation
박스수량 is highly overall correlated with 동명High correlation
동명 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
구분 is highly imbalanced (67.8%)Imbalance
연번 has unique valuesUnique
세부위치(건물명 또는 상호) has unique valuesUnique

Reproduction

Analysis started2023-12-12 20:55:42.142366
Analysis finished2023-12-12 20:55:43.102892
Duration0.96 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct349
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean175
Minimum1
Maximum349
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-13T05:55:43.177604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile18.4
Q188
median175
Q3262
95-th percentile331.6
Maximum349
Range348
Interquartile range (IQR)174

Descriptive statistics

Standard deviation100.89186
Coefficient of variation (CV)0.57652489
Kurtosis-1.2
Mean175
Median Absolute Deviation (MAD)87
Skewness0
Sum61075
Variance10179.167
MonotonicityStrictly increasing
2023-12-13T05:55:43.368907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
231 1
 
0.3%
239 1
 
0.3%
238 1
 
0.3%
237 1
 
0.3%
236 1
 
0.3%
235 1
 
0.3%
234 1
 
0.3%
233 1
 
0.3%
232 1
 
0.3%
Other values (339) 339
97.1%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
349 1
0.3%
348 1
0.3%
347 1
0.3%
346 1
0.3%
345 1
0.3%
344 1
0.3%
343 1
0.3%
342 1
0.3%
341 1
0.3%
340 1
0.3%

동명
Categorical

HIGH CORRELATION 

Distinct27
Distinct (%)7.7%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
위례동
60 
가락1동
50 
장지동
50 
잠실3동
37 
가락2동
14 
Other values (22)
138 

Length

Max length4
Median length4
Mean length3.6389685
Min length3

Unique

Unique4 ?
Unique (%)1.1%

Sample

1st row가락1동
2nd row가락1동
3rd row가락1동
4th row가락1동
5th row가락1동

Common Values

ValueCountFrequency (%)
위례동 60
17.2%
가락1동 50
14.3%
장지동 50
14.3%
잠실3동 37
10.6%
가락2동 14
 
4.0%
송파2동 14
 
4.0%
가락본동 13
 
3.7%
오금동 12
 
3.4%
풍납2동 12
 
3.4%
문정2동 10
 
2.9%
Other values (17) 77
22.1%

Length

2023-12-13T05:55:43.537409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
위례동 60
17.2%
가락1동 50
14.3%
장지동 50
14.3%
잠실3동 37
10.6%
가락2동 14
 
4.0%
송파2동 14
 
4.0%
가락본동 13
 
3.7%
오금동 12
 
3.4%
풍납2동 12
 
3.4%
문정2동 10
 
2.9%
Other values (17) 77
22.1%

주소
Text

Distinct169
Distinct (%)48.4%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-12-13T05:55:43.732327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length21
Mean length17.272206
Min length12

Characters and Unicode

Total characters6028
Distinct characters72
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique140 ?
Unique (%)40.1%

Sample

1st row서울특별시 송파구 송파대로345
2nd row서울특별시 송파구 송파대로345
3rd row서울특별시 송파구 송파대로345
4th row서울특별시 송파구 송파대로345
5th row서울특별시 송파구 송파대로345
ValueCountFrequency (%)
송파구 347
26.5%
서울특별시 260
19.9%
위례광장로 60
 
4.6%
송파대로345 49
 
3.7%
서울시 47
 
3.6%
서울 41
 
3.1%
잠실로62 36
 
2.8%
121 18
 
1.4%
충민로4길 14
 
1.1%
215 14
 
1.1%
Other values (194) 423
32.3%
2023-12-13T05:55:44.105818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
965
16.0%
443
 
7.3%
428
 
7.1%
348
 
5.8%
348
 
5.8%
347
 
5.8%
346
 
5.7%
307
 
5.1%
260
 
4.3%
260
 
4.3%
Other values (62) 1976
32.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4002
66.4%
Decimal Number 1054
 
17.5%
Space Separator 965
 
16.0%
Dash Punctuation 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
443
11.1%
428
10.7%
348
8.7%
348
8.7%
347
8.7%
346
8.6%
307
 
7.7%
260
 
6.5%
260
 
6.5%
127
 
3.2%
Other values (50) 788
19.7%
Decimal Number
ValueCountFrequency (%)
1 201
19.1%
2 199
18.9%
4 136
12.9%
3 129
12.2%
5 118
11.2%
6 97
9.2%
8 64
 
6.1%
0 44
 
4.2%
7 38
 
3.6%
9 28
 
2.7%
Space Separator
ValueCountFrequency (%)
965
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4002
66.4%
Common 2026
33.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
443
11.1%
428
10.7%
348
8.7%
348
8.7%
347
8.7%
346
8.6%
307
 
7.7%
260
 
6.5%
260
 
6.5%
127
 
3.2%
Other values (50) 788
19.7%
Common
ValueCountFrequency (%)
965
47.6%
1 201
 
9.9%
2 199
 
9.8%
4 136
 
6.7%
3 129
 
6.4%
5 118
 
5.8%
6 97
 
4.8%
8 64
 
3.2%
0 44
 
2.2%
7 38
 
1.9%
Other values (2) 35
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4002
66.4%
ASCII 2026
33.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
965
47.6%
1 201
 
9.9%
2 199
 
9.8%
4 136
 
6.7%
3 129
 
6.4%
5 118
 
5.8%
6 97
 
4.8%
8 64
 
3.2%
0 44
 
2.2%
7 38
 
1.9%
Other values (2) 35
 
1.7%
Hangul
ValueCountFrequency (%)
443
11.1%
428
10.7%
348
8.7%
348
8.7%
347
8.7%
346
8.6%
307
 
7.7%
260
 
6.5%
260
 
6.5%
127
 
3.2%
Other values (50) 788
19.7%

구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
공동주택
318 
주민센터
 
22
주택가
 
9

Length

Max length4
Median length4
Mean length3.974212
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공동주택
2nd row공동주택
3rd row공동주택
4th row공동주택
5th row공동주택

Common Values

ValueCountFrequency (%)
공동주택 318
91.1%
주민센터 22
 
6.3%
주택가 9
 
2.6%

Length

2023-12-13T05:55:44.252558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:55:44.370915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공동주택 318
91.1%
주민센터 22
 
6.3%
주택가 9
 
2.6%
Distinct349
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-12-13T05:55:44.830261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length27
Mean length15.773639
Min length5

Characters and Unicode

Total characters5505
Distinct characters215
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique349 ?
Unique (%)100.0%

Sample

1st row송파헬리오시티 101동 재활용장
2nd row송파헬리오시티 105동 재활용장
3rd row송파헬리오시티 108동 재활용장
4th row송파헬리오시티 110 재활용장
5th row송파헬리오시티 112동 재활용장
ValueCountFrequency (%)
85
 
9.2%
송파헬리오시티 49
 
5.3%
재활용장 49
 
5.3%
재활용품 37
 
4.0%
처리장 35
 
3.8%
3단지 26
 
2.8%
관리사무소 19
 
2.1%
송파꿈에그린(24단지 17
 
1.8%
17
 
1.8%
위례스타힐스 14
 
1.5%
Other values (428) 574
62.3%
2023-12-13T05:55:45.450244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
583
 
10.6%
1 252
 
4.6%
250
 
4.5%
224
 
4.1%
203
 
3.7%
2 192
 
3.5%
0 171
 
3.1%
146
 
2.7%
139
 
2.5%
135
 
2.5%
Other values (205) 3210
58.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3723
67.6%
Decimal Number 975
 
17.7%
Space Separator 583
 
10.6%
Open Punctuation 85
 
1.5%
Close Punctuation 85
 
1.5%
Other Punctuation 29
 
0.5%
Math Symbol 9
 
0.2%
Dash Punctuation 8
 
0.1%
Uppercase Letter 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
250
 
6.7%
224
 
6.0%
203
 
5.5%
146
 
3.9%
139
 
3.7%
135
 
3.6%
122
 
3.3%
107
 
2.9%
98
 
2.6%
95
 
2.6%
Other values (182) 2204
59.2%
Decimal Number
ValueCountFrequency (%)
1 252
25.8%
2 192
19.7%
0 171
17.5%
4 98
 
10.1%
3 89
 
9.1%
5 70
 
7.2%
6 29
 
3.0%
8 28
 
2.9%
7 26
 
2.7%
9 20
 
2.1%
Uppercase Letter
ValueCountFrequency (%)
K 3
37.5%
C 2
25.0%
N 1
 
12.5%
P 1
 
12.5%
S 1
 
12.5%
Other Punctuation
ValueCountFrequency (%)
, 28
96.6%
. 1
 
3.4%
Math Symbol
ValueCountFrequency (%)
~ 7
77.8%
> 2
 
22.2%
Space Separator
ValueCountFrequency (%)
583
100.0%
Open Punctuation
ValueCountFrequency (%)
( 85
100.0%
Close Punctuation
ValueCountFrequency (%)
) 85
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3723
67.6%
Common 1774
32.2%
Latin 8
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
250
 
6.7%
224
 
6.0%
203
 
5.5%
146
 
3.9%
139
 
3.7%
135
 
3.6%
122
 
3.3%
107
 
2.9%
98
 
2.6%
95
 
2.6%
Other values (182) 2204
59.2%
Common
ValueCountFrequency (%)
583
32.9%
1 252
14.2%
2 192
 
10.8%
0 171
 
9.6%
4 98
 
5.5%
3 89
 
5.0%
( 85
 
4.8%
) 85
 
4.8%
5 70
 
3.9%
6 29
 
1.6%
Other values (8) 120
 
6.8%
Latin
ValueCountFrequency (%)
K 3
37.5%
C 2
25.0%
N 1
 
12.5%
P 1
 
12.5%
S 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3723
67.6%
ASCII 1782
32.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
583
32.7%
1 252
14.1%
2 192
 
10.8%
0 171
 
9.6%
4 98
 
5.5%
3 89
 
5.0%
( 85
 
4.8%
) 85
 
4.8%
5 70
 
3.9%
6 29
 
1.6%
Other values (13) 128
 
7.2%
Hangul
ValueCountFrequency (%)
250
 
6.7%
224
 
6.0%
203
 
5.5%
146
 
3.9%
139
 
3.7%
135
 
3.6%
122
 
3.3%
107
 
2.9%
98
 
2.6%
95
 
2.6%
Other values (182) 2204
59.2%

박스수량
Real number (ℝ)

HIGH CORRELATION 

Distinct9
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.2836676
Minimum1
Maximum30
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-13T05:55:45.604078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile2
Maximum30
Range29
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.8699967
Coefficient of variation (CV)1.4567608
Kurtosis171.9559
Mean1.2836676
Median Absolute Deviation (MAD)0
Skewness12.21584
Sum448
Variance3.4968877
MonotonicityNot monotonic
2023-12-13T05:55:45.725354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
1 318
91.1%
2 17
 
4.9%
3 7
 
2.0%
4 2
 
0.6%
9 1
 
0.3%
7 1
 
0.3%
30 1
 
0.3%
16 1
 
0.3%
5 1
 
0.3%
ValueCountFrequency (%)
1 318
91.1%
2 17
 
4.9%
3 7
 
2.0%
4 2
 
0.6%
5 1
 
0.3%
7 1
 
0.3%
9 1
 
0.3%
16 1
 
0.3%
30 1
 
0.3%
ValueCountFrequency (%)
30 1
 
0.3%
16 1
 
0.3%
9 1
 
0.3%
7 1
 
0.3%
5 1
 
0.3%
4 2
 
0.6%
3 7
 
2.0%
2 17
 
4.9%
1 318
91.1%

Interactions

2023-12-13T05:55:42.673325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:55:42.455815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:55:42.776427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:55:42.571526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:55:45.820646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번동명구분박스수량
연번1.0000.9610.2800.000
동명0.9611.0000.5160.905
구분0.2800.5161.0000.000
박스수량0.0000.9050.0001.000
2023-12-13T05:55:45.919462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분동명
구분1.0000.270
동명0.2701.000
2023-12-13T05:55:45.992213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번박스수량동명구분
연번1.0000.0040.7720.172
박스수량0.0041.0000.6900.000
동명0.7720.6901.0000.270
구분0.1720.0000.2701.000

Missing values

2023-12-13T05:55:42.905981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:55:43.041405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번동명주소구분세부위치(건물명 또는 상호)박스수량
01가락1동서울특별시 송파구 송파대로345공동주택송파헬리오시티 101동 재활용장1
12가락1동서울특별시 송파구 송파대로345공동주택송파헬리오시티 105동 재활용장1
23가락1동서울특별시 송파구 송파대로345공동주택송파헬리오시티 108동 재활용장1
34가락1동서울특별시 송파구 송파대로345공동주택송파헬리오시티 110 재활용장1
45가락1동서울특별시 송파구 송파대로345공동주택송파헬리오시티 112동 재활용장1
56가락1동서울특별시 송파구 송파대로345공동주택송파헬리오시티 201동 재활용장1
67가락1동서울특별시 송파구 송파대로345공동주택송파헬리오시티 205동 재활용장1
78가락1동서울특별시 송파구 송파대로345공동주택송파헬리오시티 206동 재활용장1
89가락1동서울특별시 송파구 송파대로345공동주택송파헬리오시티 207동 재활용장1
910가락1동서울특별시 송파구 송파대로345공동주택송파헬리오시티 208동 재활용장1
연번동명주소구분세부위치(건물명 또는 상호)박스수량
339340풍납2동서울특별시 송파구 올림픽로 525공동주택풍납현대아파트3
340341풍납2동서울특별시 송파구 풍성로24길 26공동주택현대리버빌1차1
341342풍납2동서울특별시 송파구 풍성로24길 38공동주택현대리버빌2차1
342343풍납2동서울특별시 송파구 풍성로6가길 7공동주택KNP상상아파트1
343344풍납2동서울특별시 송파구 올림픽로47길 9공동주택쌍용아파트1
344345풍납2동서울특별시 송파구 강동대로9길 8공동주택극동아파트1
345346풍납2동서울특별시 송파구 풍성로6길 10공동주택풍납미성아파트1
346347풍납2동서울특별시 송파구 풍성로26길 31공동주택송파힐스테이트아파프1
347348풍납2동서울특별시 송파구 한가람로 414공동주택갑을아파트1
348349풍납2동서울특별시 송파구 풍성로14길 19공동주택토성현대아파트1