Overview

Dataset statistics

Number of variables17
Number of observations76
Missing cells141
Missing cells (%)10.9%
Duplicate rows5
Duplicate rows (%)6.6%
Total size in memory10.2 KiB
Average record size in memory137.7 B

Variable types

Text2
Categorical2
Unsupported13

Dataset

Description관악구시설관리공단에서 관리운영하는 사업장의 생활 폐기물 및 재활용에 대해 종량제 봉투사용량과 재활용봉투 사용량의 집계 실적
URLhttps://www.data.go.kr/data/15121994/fileData.do

Alerts

Dataset has 5 (6.6%) duplicate rowsDuplicates
Unnamed: 2 is highly overall correlated with Unnamed: 3High correlation
Unnamed: 3 is highly overall correlated with Unnamed: 2High correlation
2023년 관악구시설관리공단 재활용 및 종량제봉투 사용실적 has 70 (92.1%) missing valuesMissing
Unnamed: 1 has 58 (76.3%) missing valuesMissing
Unnamed: 4 has 1 (1.3%) missing valuesMissing
Unnamed: 5 has 1 (1.3%) missing valuesMissing
Unnamed: 6 has 1 (1.3%) missing valuesMissing
Unnamed: 7 has 1 (1.3%) missing valuesMissing
Unnamed: 8 has 1 (1.3%) missing valuesMissing
Unnamed: 9 has 1 (1.3%) missing valuesMissing
Unnamed: 10 has 1 (1.3%) missing valuesMissing
Unnamed: 11 has 1 (1.3%) missing valuesMissing
Unnamed: 12 has 1 (1.3%) missing valuesMissing
Unnamed: 13 has 1 (1.3%) missing valuesMissing
Unnamed: 14 has 1 (1.3%) missing valuesMissing
Unnamed: 15 has 1 (1.3%) missing valuesMissing
Unnamed: 16 has 1 (1.3%) missing valuesMissing
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 16 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 02:59:15.705179
Analysis finished2023-12-12 02:59:17.004552
Duration1.3 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct6
Distinct (%)100.0%
Missing70
Missing (%)92.1%
Memory size740.0 B
2023-12-12T11:59:17.149456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length4.3333333
Min length2

Characters and Unicode

Total characters26
Distinct characters20
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)100.0%

Sample

1st row구 분
2nd row문화체육팀
3rd row생활체육팀
4th row주차사업팀
5th row환경시설팀
ValueCountFrequency (%)
1
14.3%
1
14.3%
문화체육팀 1
14.3%
생활체육팀 1
14.3%
주차사업팀 1
14.3%
환경시설팀 1
14.3%
합계 1
14.3%
2023-12-12T11:59:17.593163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4
 
15.4%
2
 
7.7%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (10) 10
38.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 24
92.3%
Space Separator 2
 
7.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
16.7%
2
 
8.3%
2
 
8.3%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
Other values (9) 9
37.5%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 24
92.3%
Common 2
 
7.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
16.7%
2
 
8.3%
2
 
8.3%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
Other values (9) 9
37.5%
Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 24
92.3%
ASCII 2
 
7.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4
16.7%
2
 
8.3%
2
 
8.3%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
Other values (9) 9
37.5%
ASCII
ValueCountFrequency (%)
2
100.0%

Unnamed: 1
Text

MISSING 

Distinct18
Distinct (%)100.0%
Missing58
Missing (%)76.3%
Memory size740.0 B
2023-12-12T11:59:17.875038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7.5
Mean length5.6111111
Min length4

Characters and Unicode

Total characters101
Distinct characters54
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)100.0%

Sample

1st row시 설 명
2nd row관악구민체육센터
3rd row신림체육센터
4th row까치산체육센터
5th row국사봉체육관
ValueCountFrequency (%)
1
 
5.0%
1
 
5.0%
재활용봉투 1
 
5.0%
관악구청사 1
 
5.0%
가족행복센터 1
 
5.0%
보훈회관 1
 
5.0%
별빛내린천 1
 
5.0%
주차사업팀 1
 
5.0%
선우체육관 1
 
5.0%
장군봉체육관 1
 
5.0%
Other values (10) 10
50.0%
2023-12-12T11:59:18.438449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8
 
7.9%
8
 
7.9%
8
 
7.9%
4
 
4.0%
4
 
4.0%
4
 
4.0%
4
 
4.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
Other values (44) 52
51.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 98
97.0%
Space Separator 2
 
2.0%
Decimal Number 1
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
8.2%
8
 
8.2%
8
 
8.2%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
Other values (42) 49
50.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 98
97.0%
Common 3
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
8.2%
8
 
8.2%
8
 
8.2%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
Other values (42) 49
50.0%
Common
ValueCountFrequency (%)
2
66.7%
2 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 98
97.0%
ASCII 3
 
3.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8
 
8.2%
8
 
8.2%
8
 
8.2%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
Other values (42) 49
50.0%
ASCII
ValueCountFrequency (%)
2
66.7%
2 1
33.3%

Unnamed: 2
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Memory size740.0 B
<NA>
45 
재활용봉투
15 
종량제봉투
15 
종 류
 
1

Length

Max length5
Median length4
Mean length4.3947368
Min length4

Unique

Unique1 ?
Unique (%)1.3%

Sample

1st row<NA>
2nd row종 류
3rd row재활용봉투
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 45
59.2%
재활용봉투 15
 
19.7%
종량제봉투 15
 
19.7%
종 류 1
 
1.3%

Length

2023-12-12T11:59:18.666677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:59:18.829641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 45
58.4%
재활용봉투 15
 
19.5%
종량제봉투 15
 
19.5%
1
 
1.3%
1
 
1.3%

Unnamed: 3
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)13.2%
Missing0
Missing (%)0.0%
Memory size740.0 B
절감액[원]
35 
월별 배출량[재-100ℓ]
15 
월별 배출량[종-75ℓ]
13 
월별 배출량[종-50ℓ]
월별 배출량[재-50ℓ]
 
3
Other values (5)

Length

Max length14
Median length13
Mean length9.3552632
Min length4

Unique

Unique4 ?
Unique (%)5.3%

Sample

1st row<NA>
2nd row구 분
3rd row월별 배출량[재-50ℓ]
4th row절감액[원]
5th row월별 배출량[재-100ℓ]

Common Values

ValueCountFrequency (%)
절감액[원] 35
46.1%
월별 배출량[재-100ℓ] 15
19.7%
월별 배출량[종-75ℓ] 13
 
17.1%
월별 배출량[종-50ℓ] 4
 
5.3%
월별 배출량[재-50ℓ] 3
 
3.9%
절감액 계 2
 
2.6%
<NA> 1
 
1.3%
구 분 1
 
1.3%
배출량 계 1
 
1.3%
배출량[ℓ] 계 1
 
1.3%

Length

2023-12-12T11:59:19.025028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:59:19.171170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
절감액[원 35
30.2%
월별 35
30.2%
배출량[재-100ℓ 15
12.9%
배출량[종-75ℓ 13
 
11.2%
배출량[종-50ℓ 4
 
3.4%
4
 
3.4%
배출량[재-50ℓ 3
 
2.6%
절감액 2
 
1.7%
na 1
 
0.9%
1
 
0.9%
Other values (3) 3
 
2.6%

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)1.3%
Memory size740.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)1.3%
Memory size740.0 B

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)1.3%
Memory size740.0 B

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)1.3%
Memory size740.0 B

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)1.3%
Memory size740.0 B

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)1.3%
Memory size740.0 B

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)1.3%
Memory size740.0 B

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)1.3%
Memory size740.0 B

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)1.3%
Memory size740.0 B

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)1.3%
Memory size740.0 B

Unnamed: 14
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)1.3%
Memory size740.0 B

Unnamed: 15
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)1.3%
Memory size740.0 B

Unnamed: 16
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)1.3%
Memory size740.0 B

Correlations

2023-12-12T11:59:19.302443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023년 관악구시설관리공단 재활용 및 종량제봉투 사용실적Unnamed: 1Unnamed: 2Unnamed: 3
2023년 관악구시설관리공단 재활용 및 종량제봉투 사용실적1.0001.0001.0001.000
Unnamed: 11.0001.0001.0001.000
Unnamed: 21.0001.0001.0001.000
Unnamed: 31.0001.0001.0001.000
2023-12-12T11:59:19.441734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 2Unnamed: 3
Unnamed: 21.0000.964
Unnamed: 30.9641.000
2023-12-12T11:59:19.562456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 2Unnamed: 3
Unnamed: 21.0000.964
Unnamed: 30.9641.000

Missing values

2023-12-12T11:59:16.052378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:59:16.390905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T11:59:16.726957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

2023년 관악구시설관리공단 재활용 및 종량제봉투 사용실적Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16
0<NA><NA><NA><NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
1구 분시 설 명종 류구 분소 계1월2월3월4월5월6월7월8월9월10월11월12월
2문화체육팀관악구민체육센터재활용봉투월별 배출량[재-50ℓ]0000000000000
3<NA><NA><NA>절감액[원]-11547.8-1212.2-510.4-574.2-638-1020.8-1084.6-1467.4-1276-1212.2-1212.2-638-701.8
4<NA><NA><NA>월별 배출량[재-100ℓ]0000000000000
5<NA><NA><NA>절감액[원]-34815-4950-1815-1815-2145-3135-2970-4125-3795-3630-3300-1485-1650
6<NA><NA>종량제봉투월별 배출량[종-75ℓ]0000000000000
7<NA><NA><NA>절감액[원]-389160-15040-22560-18800-26320-37600-45120-47000-43240-47000-47000-20680-18800
8<NA>신림체육센터재활용봉투월별 배출량[재-100ℓ]0000000000000
9<NA><NA><NA>절감액[원]-27555-3300-3300-3135-1815-2475-2640-1815-1980-1650-1815-1980-1650
2023년 관악구시설관리공단 재활용 및 종량제봉투 사용실적Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16
66<NA><NA><NA>월별 배출량[재-100ℓ]12512500000000000
67<NA><NA><NA>절감액[원]-522225-25740-40920-43230-42405-45210-47850-49170-50325-47355-48675-41250-40095
68<NA><NA><NA>배출량 계125001250000000000000
69<NA><NA><NA>절감액 계-550871.2-28228.2-42897.8-45654.4-44638-47762-50912.4-51594.4-52877-49843.2-50780.4-43100.2-42583.2
70<NA>종량제봉투<NA>월별 배출량[종-50ℓ]0000000000000
71<NA><NA><NA>절감액[원]-10041250-711250-535000-718750-916250-1090000-1107500-1175000-770000-731250-716250-791250-778750
72<NA><NA><NA>월별 배출량[종-75ℓ]12712700000000000
73<NA><NA><NA>절감액[원]-8100920-426760-673040-682440-644840-680560-706880-663640-705000-714400-716280-691840-795240
74<NA><NA><NA>배출량[ℓ] 계9525952500000000000
75<NA><NA><NA>절감액 계-18142170-1138010-1208040-1401190-1561090-1770560-1814380-1838640-1475000-1445650-1432530-1483090-1573990

Duplicate rows

Most frequently occurring

2023년 관악구시설관리공단 재활용 및 종량제봉투 사용실적Unnamed: 1Unnamed: 2Unnamed: 3# duplicates
4<NA><NA><NA>절감액[원]35
1<NA><NA>종량제봉투월별 배출량[종-75ℓ]12
0<NA><NA>종량제봉투월별 배출량[종-50ℓ]3
2<NA><NA><NA>월별 배출량[재-100ℓ]2
3<NA><NA><NA>절감액 계2