Overview

Dataset statistics

Number of variables7
Number of observations2998
Missing cells0
Missing cells (%)0.0%
Duplicate rows172
Duplicate rows (%)5.7%
Total size in memory169.9 KiB
Average record size in memory58.0 B

Variable types

DateTime2
Numeric2
Text1
Categorical2

Dataset

Description경기도 시흥시 대형폐기물 수거현황에 대한 데이터로 수거일자, 개수, 수거장소 도로명주소, 행정동, 관리기관, 전화번호 등의 항목을 제공합니다
Author경기도 시흥시
URLhttps://www.data.go.kr/data/15096473/fileData.do

Alerts

관리기관 has constant value ""Constant
전화번호 has constant value ""Constant
데이터기준일자 has constant value ""Constant
Dataset has 172 (5.7%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-23 07:57:29.482816
Analysis finished2023-12-23 07:57:33.622398
Duration4.14 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct123
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size23.6 KiB
Minimum2022-08-31 00:00:00
Maximum2022-12-31 00:00:00
2023-12-23T07:57:34.017397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-23T07:57:34.837010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

개수
Real number (ℝ)

Distinct16
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.5890594
Minimum1
Maximum19
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.5 KiB
2023-12-23T07:57:35.533887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile4
Maximum19
Range18
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.3872628
Coefficient of variation (CV)0.87300882
Kurtosis32.853978
Mean1.5890594
Median Absolute Deviation (MAD)0
Skewness4.8019744
Sum4764
Variance1.9244982
MonotonicityNot monotonic
2023-12-23T07:57:36.104151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
1 2087
69.6%
2 566
 
18.9%
3 174
 
5.8%
4 70
 
2.3%
5 40
 
1.3%
7 13
 
0.4%
6 12
 
0.4%
9 8
 
0.3%
8 8
 
0.3%
11 7
 
0.2%
Other values (6) 13
 
0.4%
ValueCountFrequency (%)
1 2087
69.6%
2 566
 
18.9%
3 174
 
5.8%
4 70
 
2.3%
5 40
 
1.3%
6 12
 
0.4%
7 13
 
0.4%
8 8
 
0.3%
9 8
 
0.3%
10 3
 
0.1%
ValueCountFrequency (%)
19 1
 
< 0.1%
16 1
 
< 0.1%
15 1
 
< 0.1%
13 3
 
0.1%
12 4
 
0.1%
11 7
0.2%
10 3
 
0.1%
9 8
0.3%
8 8
0.3%
7 13
0.4%
Distinct889
Distinct (%)29.7%
Missing0
Missing (%)0.0%
Memory size23.6 KiB
2023-12-23T07:57:37.731265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length22
Mean length17.37992
Min length13

Characters and Unicode

Total characters52105
Distinct characters156
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique546 ?
Unique (%)18.2%

Sample

1st row경기도 시흥시 시청로 100
2nd row경기도 시흥시 시화호수전원2길 82-14
3rd row경기도 시흥시 은계중앙로 151
4th row경기도 시흥시 서울대학로 172-20
5th row경기도 시흥시 서울대학로 172-20
ValueCountFrequency (%)
경기도 2998
25.0%
시흥시 2998
25.0%
은계중앙로 395
 
3.3%
장현순환로 169
 
1.4%
65 108
 
0.9%
서울대학로 90
 
0.8%
11 86
 
0.7%
시청로 82
 
0.7%
은계남로 80
 
0.7%
배곧4로 78
 
0.7%
Other values (825) 4908
40.9%
2023-12-23T07:57:39.462975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8994
17.3%
6226
 
11.9%
3059
 
5.9%
3056
 
5.9%
3052
 
5.9%
3001
 
5.8%
2627
 
5.0%
1 2466
 
4.7%
2 1461
 
2.8%
1201
 
2.3%
Other values (146) 16962
32.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 32014
61.4%
Decimal Number 10403
 
20.0%
Space Separator 8994
 
17.3%
Dash Punctuation 694
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6226
19.4%
3059
 
9.6%
3056
 
9.5%
3052
 
9.5%
3001
 
9.4%
2627
 
8.2%
1201
 
3.8%
823
 
2.6%
683
 
2.1%
562
 
1.8%
Other values (134) 7724
24.1%
Decimal Number
ValueCountFrequency (%)
1 2466
23.7%
2 1461
14.0%
3 1134
10.9%
4 934
 
9.0%
5 916
 
8.8%
0 778
 
7.5%
6 776
 
7.5%
7 740
 
7.1%
8 665
 
6.4%
9 533
 
5.1%
Space Separator
ValueCountFrequency (%)
8994
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 694
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 32014
61.4%
Common 20091
38.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6226
19.4%
3059
 
9.6%
3056
 
9.5%
3052
 
9.5%
3001
 
9.4%
2627
 
8.2%
1201
 
3.8%
823
 
2.6%
683
 
2.1%
562
 
1.8%
Other values (134) 7724
24.1%
Common
ValueCountFrequency (%)
8994
44.8%
1 2466
 
12.3%
2 1461
 
7.3%
3 1134
 
5.6%
4 934
 
4.6%
5 916
 
4.6%
0 778
 
3.9%
6 776
 
3.9%
7 740
 
3.7%
- 694
 
3.5%
Other values (2) 1198
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 32014
61.4%
ASCII 20091
38.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8994
44.8%
1 2466
 
12.3%
2 1461
 
7.3%
3 1134
 
5.6%
4 934
 
4.6%
5 916
 
4.6%
0 778
 
3.9%
6 776
 
3.9%
7 740
 
3.7%
- 694
 
3.5%
Other values (2) 1198
 
6.0%
Hangul
ValueCountFrequency (%)
6226
19.4%
3059
 
9.6%
3056
 
9.5%
3052
 
9.5%
3001
 
9.4%
2627
 
8.2%
1201
 
3.8%
823
 
2.6%
683
 
2.1%
562
 
1.8%
Other values (134) 7724
24.1%

우편번호
Real number (ℝ)

Distinct172
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14982.245
Minimum14900
Maximum15121
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.5 KiB
2023-12-23T07:57:40.229724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum14900
5-th percentile14904
Q114922
median14996
Q315011
95-th percentile15118
Maximum15121
Range221
Interquartile range (IQR)89

Descriptive statistics

Standard deviation61.323706
Coefficient of variation (CV)0.0040930918
Kurtosis-0.36909141
Mean14982.245
Median Absolute Deviation (MAD)52
Skewness0.59200716
Sum44916771
Variance3760.5969
MonotonicityNot monotonic
2023-12-23T07:57:41.083538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
14996 322
 
10.7%
14922 255
 
8.5%
15010 201
 
6.7%
14923 198
 
6.6%
15011 168
 
5.6%
15120 107
 
3.6%
15118 73
 
2.4%
14902 70
 
2.3%
15002 67
 
2.2%
14911 65
 
2.2%
Other values (162) 1472
49.1%
ValueCountFrequency (%)
14900 22
 
0.7%
14901 46
1.5%
14902 70
2.3%
14903 2
 
0.1%
14904 28
 
0.9%
14905 19
 
0.6%
14906 8
 
0.3%
14907 16
 
0.5%
14908 34
1.1%
14909 13
 
0.4%
ValueCountFrequency (%)
15121 12
 
0.4%
15120 107
3.6%
15119 8
 
0.3%
15118 73
2.4%
15117 3
 
0.1%
15116 1
 
< 0.1%
15115 2
 
0.1%
15114 2
 
0.1%
15111 3
 
0.1%
15110 3
 
0.1%

관리기관
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size23.6 KiB
경기도 시흥시청
2998 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도 시흥시청
2nd row경기도 시흥시청
3rd row경기도 시흥시청
4th row경기도 시흥시청
5th row경기도 시흥시청

Common Values

ValueCountFrequency (%)
경기도 시흥시청 2998
100.0%

Length

2023-12-23T07:57:41.769484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-23T07:57:42.147475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 2998
50.0%
시흥시청 2998
50.0%

전화번호
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size23.6 KiB
031-310-2254
2998 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row031-310-2254
2nd row031-310-2254
3rd row031-310-2254
4th row031-310-2254
5th row031-310-2254

Common Values

ValueCountFrequency (%)
031-310-2254 2998
100.0%

Length

2023-12-23T07:57:42.978159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-23T07:57:43.985048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
031-310-2254 2998
100.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size23.6 KiB
Minimum2023-12-08 00:00:00
Maximum2023-12-08 00:00:00
2023-12-23T07:57:44.309175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-23T07:57:45.114763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-23T07:57:31.490598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-23T07:57:30.344145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-23T07:57:31.787702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-23T07:57:30.965390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-23T07:57:45.727564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
개수우편번호
개수1.0000.022
우편번호0.0221.000
2023-12-23T07:57:46.288913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
개수우편번호
개수1.0000.034
우편번호0.0341.000

Missing values

2023-12-23T07:57:32.614691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-23T07:57:33.249005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

수거일자개수수거장소(도로명주소)우편번호관리기관전화번호데이터기준일자
02022-12-313경기도 시흥시 시청로 10014996경기도 시흥시청031-310-22542023-12-08
12022-12-312경기도 시흥시 시화호수전원2길 82-1415118경기도 시흥시청031-310-22542023-12-08
22022-12-311경기도 시흥시 은계중앙로 15114923경기도 시흥시청031-310-22542023-12-08
32022-12-312경기도 시흥시 서울대학로 172-2015011경기도 시흥시청031-310-22542023-12-08
42022-12-311경기도 시흥시 서울대학로 172-2015011경기도 시흥시청031-310-22542023-12-08
52022-12-313경기도 시흥시 월곶중앙로 9014964경기도 시흥시청031-310-22542023-12-08
62022-12-311경기도 시흥시 새재로 314998경기도 시흥시청031-310-22542023-12-08
72022-12-311경기도 시흥시 마유로423번길 20-215024경기도 시흥시청031-310-22542023-12-08
82022-12-311경기도 시흥시 배곧1로 27-1615011경기도 시흥시청031-310-22542023-12-08
92022-12-311경기도 시흥시 은계중앙로 11814923경기도 시흥시청031-310-22542023-12-08
수거일자개수수거장소(도로명주소)우편번호관리기관전화번호데이터기준일자
29882022-08-311경기도 시흥시 수인로3312번길 1614911경기도 시흥시청031-310-22542023-12-08
29892022-08-311경기도 시흥시 군자천로131번길 3115091경기도 시흥시청031-310-22542023-12-08
29902022-08-311경기도 시흥시 수인로3312번길 1614911경기도 시흥시청031-310-22542023-12-08
29912022-08-314경기도 시흥시 군자로534번길 4015069경기도 시흥시청031-310-22542023-12-08
29922022-08-312경기도 시흥시 수인로3312번길 1614911경기도 시흥시청031-310-22542023-12-08
29932022-08-312경기도 시흥시 은계중앙로 6514922경기도 시흥시청031-310-22542023-12-08
29942022-08-311경기도 시흥시 소래산길 6514902경기도 시흥시청031-310-22542023-12-08
29952022-08-312경기도 시흥시 군자천로131번길 3115091경기도 시흥시청031-310-22542023-12-08
29962022-08-314경기도 시흥시 정왕대로143번길 915032경기도 시흥시청031-310-22542023-12-08
29972022-08-311경기도 시흥시 서울대학로 19015011경기도 시흥시청031-310-22542023-12-08

Duplicate rows

Most frequently occurring

수거일자개수수거장소(도로명주소)우편번호관리기관전화번호데이터기준일자# duplicates
1682022-12-291경기도 시흥시 은행고길 8514916경기도 시흥시청031-310-22542023-12-086
22022-09-011경기도 시흥시 은계중앙로 6514922경기도 시흥시청031-310-22542023-12-084
552022-10-041경기도 시흥시 은계중앙로 14014923경기도 시흥시청031-310-22542023-12-084
692022-10-141경기도 시흥시 공단1대로322번길 1715106경기도 시흥시청031-310-22542023-12-084
1022022-11-071경기도 시흥시 은계중앙로 6514922경기도 시흥시청031-310-22542023-12-084
1082022-11-091경기도 시흥시 장현순환로 8114996경기도 시흥시청031-310-22542023-12-084
02022-08-311경기도 시흥시 수인로3312번길 1614911경기도 시흥시청031-310-22542023-12-083
12022-09-011경기도 시흥시 은계중앙로 14014923경기도 시흥시청031-310-22542023-12-083
32022-09-021경기도 시흥시 은계중앙로 32515120경기도 시흥시청031-310-22542023-12-083
122022-09-081경기도 시흥시 장곡북로 4314996경기도 시흥시청031-310-22542023-12-083