Overview

Dataset statistics

Number of variables4
Number of observations425
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.8 KiB
Average record size in memory33.3 B

Variable types

Numeric1
Categorical2
Text1

Dataset

Description이 데이터는 서울특별시 동작구 관내에 있는 의류수거함 위치 데이터입니다. 이 데이터에는 의류수거함이 있는 주소 등이 포함되어 있습니다.
URLhttps://www.data.go.kr/data/15068021/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 행정동High correlation
행정동 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:44:11.350027
Analysis finished2023-12-12 12:44:11.700841
Duration0.35 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct425
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean213
Minimum1
Maximum425
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.9 KiB
2023-12-12T21:44:11.769065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile22.2
Q1107
median213
Q3319
95-th percentile403.8
Maximum425
Range424
Interquartile range (IQR)212

Descriptive statistics

Standard deviation122.83118
Coefficient of variation (CV)0.57667223
Kurtosis-1.2
Mean213
Median Absolute Deviation (MAD)106
Skewness0
Sum90525
Variance15087.5
MonotonicityStrictly increasing
2023-12-12T21:44:11.904313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
293 1
 
0.2%
291 1
 
0.2%
290 1
 
0.2%
289 1
 
0.2%
288 1
 
0.2%
287 1
 
0.2%
286 1
 
0.2%
285 1
 
0.2%
284 1
 
0.2%
Other values (415) 415
97.6%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
425 1
0.2%
424 1
0.2%
423 1
0.2%
422 1
0.2%
421 1
0.2%
420 1
0.2%
419 1
0.2%
418 1
0.2%
417 1
0.2%
416 1
0.2%

행정동
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
상도3동
54 
상도4동
50 
상도1동
45 
대방동
38 
노량진1동
37 
Other values (10)
201 

Length

Max length5
Median length4
Mean length4.0588235
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신대방1동
2nd row신대방1동
3rd row신대방1동
4th row신대방1동
5th row신대방1동

Common Values

ValueCountFrequency (%)
상도3동 54
12.7%
상도4동 50
11.8%
상도1동 45
10.6%
대방동 38
8.9%
노량진1동 37
8.7%
신대방1동 27
 
6.4%
흑석동 27
 
6.4%
상도2동 25
 
5.9%
사당3동 23
 
5.4%
사당4동 21
 
4.9%
Other values (5) 78
18.4%

Length

2023-12-12T21:44:12.051825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
상도3동 54
12.7%
상도4동 50
11.8%
상도1동 45
10.6%
대방동 38
8.9%
노량진1동 37
8.7%
신대방1동 27
 
6.4%
흑석동 27
 
6.4%
상도2동 25
 
5.9%
사당3동 23
 
5.4%
사당4동 21
 
4.9%
Other values (5) 78
18.4%

주소
Text

Distinct419
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
2023-12-12T21:44:12.296276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length34
Mean length20.442353
Min length15

Characters and Unicode

Total characters8688
Distinct characters99
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique413 ?
Unique (%)97.2%

Sample

1st row서울특별시 동작구 신대방 1다길 신대방지구대옆
2nd row서울특별시 동작구 신대방 1길 116-1
3rd row서울특별시 동작구 신대방 1길 96
4th row서울특별시 동작구 신대방 1길 72
5th row서울특별시 동작구 신대방 1길 40
ValueCountFrequency (%)
동작구 425
22.1%
서울특별시 388
20.2%
사당로 66
 
3.4%
상도로 60
 
3.1%
대방동 40
 
2.1%
특별시 37
 
1.9%
서울 37
 
1.9%
양녕로 27
 
1.4%
신대방 21
 
1.1%
성대로 21
 
1.1%
Other values (416) 797
41.5%
2023-12-12T21:44:12.717562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1534
17.7%
504
 
5.8%
441
 
5.1%
427
 
4.9%
427
 
4.9%
425
 
4.9%
425
 
4.9%
425
 
4.9%
425
 
4.9%
351
 
4.0%
Other values (89) 3304
38.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5351
61.6%
Decimal Number 1660
 
19.1%
Space Separator 1534
 
17.7%
Dash Punctuation 97
 
1.1%
Open Punctuation 20
 
0.2%
Close Punctuation 20
 
0.2%
Lowercase Letter 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
504
9.4%
441
 
8.2%
427
 
8.0%
427
 
8.0%
425
 
7.9%
425
 
7.9%
425
 
7.9%
425
 
7.9%
351
 
6.6%
309
 
5.8%
Other values (72) 1192
22.3%
Decimal Number
ValueCountFrequency (%)
1 338
20.4%
2 259
15.6%
3 224
13.5%
6 162
9.8%
5 141
8.5%
4 138
8.3%
9 102
 
6.1%
8 101
 
6.1%
7 100
 
6.0%
0 95
 
5.7%
Lowercase Letter
ValueCountFrequency (%)
s 2
33.3%
e 2
33.3%
t 2
33.3%
Space Separator
ValueCountFrequency (%)
1534
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 97
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5351
61.6%
Common 3331
38.3%
Latin 6
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
504
9.4%
441
 
8.2%
427
 
8.0%
427
 
8.0%
425
 
7.9%
425
 
7.9%
425
 
7.9%
425
 
7.9%
351
 
6.6%
309
 
5.8%
Other values (72) 1192
22.3%
Common
ValueCountFrequency (%)
1534
46.1%
1 338
 
10.1%
2 259
 
7.8%
3 224
 
6.7%
6 162
 
4.9%
5 141
 
4.2%
4 138
 
4.1%
9 102
 
3.1%
8 101
 
3.0%
7 100
 
3.0%
Other values (4) 232
 
7.0%
Latin
ValueCountFrequency (%)
s 2
33.3%
e 2
33.3%
t 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5351
61.6%
ASCII 3337
38.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1534
46.0%
1 338
 
10.1%
2 259
 
7.8%
3 224
 
6.7%
6 162
 
4.9%
5 141
 
4.2%
4 138
 
4.1%
9 102
 
3.1%
8 101
 
3.0%
7 100
 
3.0%
Other values (7) 238
 
7.1%
Hangul
ValueCountFrequency (%)
504
9.4%
441
 
8.2%
427
 
8.0%
427
 
8.0%
425
 
7.9%
425
 
7.9%
425
 
7.9%
425
 
7.9%
351
 
6.6%
309
 
5.8%
Other values (72) 1192
22.3%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
2023-08-22
425 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-22
2nd row2023-08-22
3rd row2023-08-22
4th row2023-08-22
5th row2023-08-22

Common Values

ValueCountFrequency (%)
2023-08-22 425
100.0%

Length

2023-12-12T21:44:13.201865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:44:13.328119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-22 425
100.0%

Interactions

2023-12-12T21:44:11.485932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:44:13.405431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번행정동
연번1.0000.992
행정동0.9921.000
2023-12-12T21:44:13.494247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번행정동
연번1.0000.903
행정동0.9031.000

Missing values

2023-12-12T21:44:11.586202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:44:11.668638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번행정동주소데이터기준일자
01신대방1동서울특별시 동작구 신대방 1다길 신대방지구대옆2023-08-22
12신대방1동서울특별시 동작구 신대방 1길 116-12023-08-22
23신대방1동서울특별시 동작구 신대방 1길 962023-08-22
34신대방1동서울특별시 동작구 신대방 1길 722023-08-22
45신대방1동서울특별시 동작구 신대방 1길 402023-08-22
56신대방1동서울특별시 동작구 신대방 1가길 42023-08-22
67신대방1동서울특별시 동작구 신대방 1길 242023-08-22
78신대방1동서울특별시 동작구 대림로 44 서광전기 앞2023-08-22
89신대방1동서울특별시 동작구 신대방 11길 232023-08-22
910신대방1동서울특별시 동작구 신대방 7길 42023-08-22
연번행정동주소데이터기준일자
415416사당5동서울특별시 동작구 사당로 2라길 832023-08-22
416417사당5동서울특별시 동작구 사당로 2자길 182023-08-22
417418사당5동서울특별시 동작구 사당로 2자길 139(2set)2023-08-22
418419사당5동서울특별시 동작구 사당로 2자길 492023-08-22
419420사당5동서울특별시 동작구 사당로 8나길 242023-08-22
420421사당5동서울특별시 동작구 사당로 8나길 122023-08-22
421422사당5동서울특별시 동작구 사당로 8길 82023-08-22
422423사당5동서울특별시 동작구 사당로 10길 182023-08-22
423424사당5동서울특별시 동작구 사당로 16가길 712023-08-22
424425사당5동서울특별시 동작구 사당로 2차길 22023-08-22