Overview

Dataset statistics

Number of variables4
Number of observations339
Missing cells0
Missing cells (%)0.0%
Duplicate rows45
Duplicate rows (%)13.3%
Total size in memory10.7 KiB
Average record size in memory32.4 B

Variable types

Categorical3
Text1

Dataset

Description수도권매립지 폐기물 반입허용시간 정보입니다.개방항목 : 광역지자체명, 폐기물명, 시작시간, 종료시간의 항목을 제공합니다.
Author수도권매립지관리공사
URLhttps://www.data.go.kr/data/15064395/fileData.do

Alerts

Dataset has 45 (13.3%) duplicate rowsDuplicates
종료시간 is highly imbalanced (92.7%)Imbalance

Reproduction

Analysis started2024-04-13 12:54:30.933772
Analysis finished2024-04-13 12:54:33.840463
Duration2.91 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct3
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
서울시
113 
인천시
113 
경기도
113 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울시
2nd row서울시
3rd row서울시
4th row서울시
5th row서울시

Common Values

ValueCountFrequency (%)
서울시 113
33.3%
인천시 113
33.3%
경기도 113
33.3%

Length

2024-04-13T21:54:34.073231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-13T21:54:34.408977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울시 113
33.3%
인천시 113
33.3%
경기도 113
33.3%
Distinct97
Distinct (%)28.6%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2024-04-13T21:54:35.504936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length19
Mean length6.3539823
Min length2

Characters and Unicode

Total characters2154
Distinct characters163
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row폐아스팔트콘크리트
2nd row폐벽돌
3rd row폐블럭
4th row폐기와
5th row폐목재
ValueCountFrequency (%)
처리물 9
 
2.1%
무기성 9
 
2.1%
밖의 9
 
2.1%
9
 
2.1%
폐아스팔트콘크리트 6
 
1.4%
건설폐토석 6
 
1.4%
하수오니 6
 
1.4%
정수오니 6
 
1.4%
폐벽돌 6
 
1.4%
폐기물 6
 
1.4%
Other values (105) 366
83.6%
2024-04-13T21:54:37.063799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
171
 
7.9%
99
 
4.6%
75
 
3.5%
57
 
2.6%
57
 
2.6%
54
 
2.5%
48
 
2.2%
45
 
2.1%
( 42
 
1.9%
) 42
 
1.9%
Other values (153) 1464
68.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1914
88.9%
Space Separator 99
 
4.6%
Open Punctuation 42
 
1.9%
Close Punctuation 42
 
1.9%
Decimal Number 39
 
1.8%
Other Punctuation 15
 
0.7%
Connector Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
171
 
8.9%
75
 
3.9%
57
 
3.0%
57
 
3.0%
54
 
2.8%
48
 
2.5%
45
 
2.4%
42
 
2.2%
39
 
2.0%
30
 
1.6%
Other values (141) 1296
67.7%
Decimal Number
ValueCountFrequency (%)
1 12
30.8%
0 6
15.4%
5 6
15.4%
2 6
15.4%
4 3
 
7.7%
7 3
 
7.7%
3 3
 
7.7%
Space Separator
ValueCountFrequency (%)
99
100.0%
Open Punctuation
ValueCountFrequency (%)
( 42
100.0%
Close Punctuation
ValueCountFrequency (%)
) 42
100.0%
Other Punctuation
ValueCountFrequency (%)
% 15
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1914
88.9%
Common 240
 
11.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
171
 
8.9%
75
 
3.9%
57
 
3.0%
57
 
3.0%
54
 
2.8%
48
 
2.5%
45
 
2.4%
42
 
2.2%
39
 
2.0%
30
 
1.6%
Other values (141) 1296
67.7%
Common
ValueCountFrequency (%)
99
41.2%
( 42
17.5%
) 42
17.5%
% 15
 
6.2%
1 12
 
5.0%
0 6
 
2.5%
5 6
 
2.5%
2 6
 
2.5%
4 3
 
1.2%
7 3
 
1.2%
Other values (2) 6
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1911
88.7%
ASCII 240
 
11.1%
Compat Jamo 3
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
171
 
8.9%
75
 
3.9%
57
 
3.0%
57
 
3.0%
54
 
2.8%
48
 
2.5%
45
 
2.4%
42
 
2.2%
39
 
2.0%
30
 
1.6%
Other values (140) 1293
67.7%
ASCII
ValueCountFrequency (%)
99
41.2%
( 42
17.5%
) 42
17.5%
% 15
 
6.2%
1 12
 
5.0%
0 6
 
2.5%
5 6
 
2.5%
2 6
 
2.5%
4 3
 
1.2%
7 3
 
1.2%
Other values (2) 6
 
2.5%
Compat Jamo
ValueCountFrequency (%)
3
100.0%

시작시간
Categorical

Distinct4
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
06:30
120 
06:00
108 
07:30
96 
07:00
15 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row06:30
2nd row06:30
3rd row06:30
4th row06:30
5th row06:30

Common Values

ValueCountFrequency (%)
06:30 120
35.4%
06:00 108
31.9%
07:30 96
28.3%
07:00 15
 
4.4%

Length

2024-04-13T21:54:37.483765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-13T21:54:37.811049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
06:30 120
35.4%
06:00 108
31.9%
07:30 96
28.3%
07:00 15
 
4.4%

종료시간
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
16:00
336 
17:00
 
3

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row16:00
2nd row16:00
3rd row16:00
4th row16:00
5th row16:00

Common Values

ValueCountFrequency (%)
16:00 336
99.1%
17:00 3
 
0.9%

Length

2024-04-13T21:54:38.182528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-13T21:54:38.501638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
16:00 336
99.1%
17:00 3
 
0.9%

Correlations

2024-04-13T21:54:38.689345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
광역지자체명폐기물명시작시간종료시간
광역지자체명1.0000.0000.0000.000
폐기물명0.0001.0000.9991.000
시작시간0.0000.9991.0000.153
종료시간0.0001.0000.1531.000
2024-04-13T21:54:38.950139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
광역지자체명종료시간시작시간
광역지자체명1.0000.0000.000
종료시간0.0001.0000.101
시작시간0.0000.1011.000
2024-04-13T21:54:39.194922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
광역지자체명시작시간종료시간
광역지자체명1.0000.0000.000
시작시간0.0001.0000.101
종료시간0.0000.1011.000

Missing values

2024-04-13T21:54:33.409664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-13T21:54:33.710522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

광역지자체명폐기물명시작시간종료시간
0서울시폐아스팔트콘크리트06:3016:00
1서울시폐벽돌06:3016:00
2서울시폐블럭06:3016:00
3서울시폐기와06:3016:00
4서울시폐목재06:3016:00
5서울시폐합성수지06:3016:00
6서울시폐섬유06:3016:00
7서울시폐벽지06:3016:00
8서울시건설오니06:3016:00
9서울시폐금속류06:3016:00
광역지자체명폐기물명시작시간종료시간
329경기도사토준설토06:0016:00
330경기도골재(매립관리)06:0016:00
331경기도탈취약품(매립관리)06:0016:00
332경기도건설폐재06:0016:00
333경기도고철등매각06:0016:00
334경기도낙엽06:0016:00
335경기도보증금06:0016:00
336경기도탈리액유분06:0016:00
337경기도탈수슬러지06:0017:00
338경기도음폐수06:0016:00

Duplicate rows

Most frequently occurring

광역지자체명폐기물명시작시간종료시간# duplicates
0경기도건설공사로 인하여 발생되는 그 밖의 폐기물06:3016:002
1경기도건설오니06:3016:002
2경기도건설폐토석06:3016:002
3경기도폐기와06:3016:002
4경기도폐목재06:3016:002
5경기도폐벽돌06:3016:002
6경기도폐보드류06:3016:002
7경기도폐블럭06:3016:002
8경기도폐섬유06:3016:002
9경기도폐아스팔트콘크리트06:3016:002