Overview

Dataset statistics

Number of variables21
Number of observations23
Missing cells93
Missing cells (%)19.3%
Duplicate rows1
Duplicate rows (%)4.3%
Total size in memory3.9 KiB
Average record size in memory173.7 B

Variable types

Text3
Categorical1
Unsupported17

Dataset

Description인천광역시 건설폐기물 처리현황에 관한 데이터로 시도 건설폐기물 발생량과 시군구 건설폐기물 발생량을 제공합니다.
URLhttps://www.data.go.kr/data/15121824/fileData.do

Alerts

Dataset has 1 (4.3%) duplicate rowsDuplicates
1. 건설폐기물 발생 및 처리현황 has 20 (87.0%) missing valuesMissing
Unnamed: 1 has 17 (73.9%) missing valuesMissing
Unnamed: 2 has 10 (43.5%) missing valuesMissing
Unnamed: 4 has 3 (13.0%) missing valuesMissing
Unnamed: 5 has 2 (8.7%) missing valuesMissing
Unnamed: 6 has 3 (13.0%) missing valuesMissing
Unnamed: 7 has 3 (13.0%) missing valuesMissing
Unnamed: 8 has 3 (13.0%) missing valuesMissing
Unnamed: 9 has 2 (8.7%) missing valuesMissing
Unnamed: 10 has 3 (13.0%) missing valuesMissing
Unnamed: 11 has 3 (13.0%) missing valuesMissing
Unnamed: 12 has 3 (13.0%) missing valuesMissing
Unnamed: 13 has 2 (8.7%) missing valuesMissing
Unnamed: 14 has 3 (13.0%) missing valuesMissing
Unnamed: 15 has 3 (13.0%) missing valuesMissing
Unnamed: 16 has 3 (13.0%) missing valuesMissing
Unnamed: 17 has 2 (8.7%) missing valuesMissing
Unnamed: 18 has 3 (13.0%) missing valuesMissing
Unnamed: 19 has 3 (13.0%) missing valuesMissing
Unnamed: 20 has 2 (8.7%) missing valuesMissing
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 16 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 17 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 18 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 19 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 20 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 08:38:05.510683
Analysis finished2023-12-12 08:38:05.681899
Duration0.17 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct3
Distinct (%)100.0%
Missing20
Missing (%)87.0%
Memory size316.0 B
2023-12-12T17:38:05.783895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length2
Mean length8.6666667
Min length2

Characters and Unicode

Total characters26
Distinct characters20
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)100.0%

Sample

1st row가. 시·도 건설폐기물 발생 및 처리현황
2nd row시도
3rd row인천
ValueCountFrequency (%)
1
12.5%
시·도 1
12.5%
건설폐기물 1
12.5%
발생 1
12.5%
1
12.5%
처리현황 1
12.5%
시도 1
12.5%
인천 1
12.5%
2023-12-12T17:38:06.114950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5
19.2%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (10) 10
38.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 19
73.1%
Space Separator 5
 
19.2%
Other Punctuation 2
 
7.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
 
10.5%
2
 
10.5%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (7) 7
36.8%
Other Punctuation
ValueCountFrequency (%)
. 1
50.0%
· 1
50.0%
Space Separator
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 19
73.1%
Common 7
 
26.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
 
10.5%
2
 
10.5%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (7) 7
36.8%
Common
ValueCountFrequency (%)
5
71.4%
. 1
 
14.3%
· 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 19
73.1%
ASCII 6
 
23.1%
None 1
 
3.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5
83.3%
. 1
 
16.7%
Hangul
ValueCountFrequency (%)
2
 
10.5%
2
 
10.5%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (7) 7
36.8%
None
ValueCountFrequency (%)
· 1
100.0%

Unnamed: 1
Text

MISSING 

Distinct6
Distinct (%)100.0%
Missing17
Missing (%)73.9%
Memory size316.0 B
2023-12-12T17:38:06.304566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length6
Mean length4.3333333
Min length2

Characters and Unicode

Total characters26
Distinct characters16
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)100.0%

Sample

1st row폐기물 종류
2nd row총계
3rd row가연성
4th row불연성
5th row가연성ㆍ불연성 혼합
ValueCountFrequency (%)
폐기물 1
12.5%
종류 1
12.5%
총계 1
12.5%
가연성 1
12.5%
불연성 1
12.5%
가연성ㆍ불연성 1
12.5%
혼합 1
12.5%
기타 1
12.5%
2023-12-12T17:38:06.692922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4
15.4%
4
15.4%
2
 
7.7%
2
 
7.7%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (6) 6
23.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 24
92.3%
Space Separator 2
 
7.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
16.7%
4
16.7%
2
 
8.3%
2
 
8.3%
2
 
8.3%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
Other values (5) 5
20.8%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 24
92.3%
Common 2
 
7.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
16.7%
4
16.7%
2
 
8.3%
2
 
8.3%
2
 
8.3%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
Other values (5) 5
20.8%
Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 23
88.5%
ASCII 2
 
7.7%
Compat Jamo 1
 
3.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4
17.4%
4
17.4%
2
8.7%
2
8.7%
2
8.7%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
Other values (4) 4
17.4%
ASCII
ValueCountFrequency (%)
2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

Unnamed: 2
Text

MISSING 

Distinct13
Distinct (%)100.0%
Missing10
Missing (%)43.5%
Memory size316.0 B
2023-12-12T17:38:06.918134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length7
Mean length4.6153846
Min length3

Characters and Unicode

Total characters60
Distinct characters36
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)100.0%

Sample

1st row폐목재
2nd row폐합성수지류
3rd row폐섬유
4th row폐벽지
5th row건설폐재류
ValueCountFrequency (%)
폐목재 1
 
6.7%
폐합성수지류 1
 
6.7%
폐섬유 1
 
6.7%
폐벽지 1
 
6.7%
건설폐재류 1
 
6.7%
건설오니 1
 
6.7%
폐금속류 1
 
6.7%
폐유리 1
 
6.7%
폐타일 1
 
6.7%
1
 
6.7%
Other values (5) 5
33.3%
2023-12-12T17:38:07.312957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12
20.0%
4
 
6.7%
3
 
5.0%
3
 
5.0%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
Other values (26) 26
43.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 53
88.3%
Uppercase Letter 5
 
8.3%
Space Separator 2
 
3.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
22.6%
4
 
7.5%
3
 
5.7%
3
 
5.7%
2
 
3.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
1
 
1.9%
Other values (20) 20
37.7%
Uppercase Letter
ValueCountFrequency (%)
T 1
20.0%
P 1
20.0%
E 1
20.0%
M 1
20.0%
Y 1
20.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 53
88.3%
Latin 5
 
8.3%
Common 2
 
3.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
22.6%
4
 
7.5%
3
 
5.7%
3
 
5.7%
2
 
3.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
1
 
1.9%
Other values (20) 20
37.7%
Latin
ValueCountFrequency (%)
T 1
20.0%
P 1
20.0%
E 1
20.0%
M 1
20.0%
Y 1
20.0%
Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 53
88.3%
ASCII 7
 
11.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
12
22.6%
4
 
7.5%
3
 
5.7%
3
 
5.7%
2
 
3.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
1
 
1.9%
Other values (20) 20
37.7%
ASCII
ValueCountFrequency (%)
2
28.6%
T 1
14.3%
P 1
14.3%
E 1
14.3%
M 1
14.3%
Y 1
14.3%

Unnamed: 3
Categorical

Distinct8
Distinct (%)34.8%
Missing0
Missing (%)0.0%
Memory size316.0 B
EMPTY
13 
<NA>
폐콘크리트
 
1
폐아스팔트콘크리트
 
1
폐벽돌
 
1
Other values (3)

Length

Max length9
Median length5
Mean length4.7391304
Min length3

Unique

Unique6 ?
Unique (%)26.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th rowEMPTY

Common Values

ValueCountFrequency (%)
EMPTY 13
56.5%
<NA> 4
 
17.4%
폐콘크리트 1
 
4.3%
폐아스팔트콘크리트 1
 
4.3%
폐벽돌 1
 
4.3%
폐블럭 1
 
4.3%
폐기와 1
 
4.3%
건설폐토석 1
 
4.3%

Length

2023-12-12T17:38:07.464018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:38:07.575517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
empty 13
56.5%
na 4
 
17.4%
폐콘크리트 1
 
4.3%
폐아스팔트콘크리트 1
 
4.3%
폐벽돌 1
 
4.3%
폐블럭 1
 
4.3%
폐기와 1
 
4.3%
건설폐토석 1
 
4.3%

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)13.0%
Memory size316.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)8.7%
Memory size316.0 B

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)13.0%
Memory size316.0 B

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)13.0%
Memory size316.0 B

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)13.0%
Memory size316.0 B

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)8.7%
Memory size316.0 B

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)13.0%
Memory size316.0 B

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)13.0%
Memory size316.0 B

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)13.0%
Memory size316.0 B

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)8.7%
Memory size316.0 B

Unnamed: 14
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)13.0%
Memory size316.0 B

Unnamed: 15
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)13.0%
Memory size316.0 B

Unnamed: 16
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)13.0%
Memory size316.0 B

Unnamed: 17
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)8.7%
Memory size316.0 B

Unnamed: 18
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)13.0%
Memory size316.0 B

Unnamed: 19
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)13.0%
Memory size316.0 B

Unnamed: 20
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)8.7%
Memory size316.0 B

Sample

1. 건설폐기물 발생 및 처리현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20
0가. 시·도 건설폐기물 발생 및 처리현황<NA><NA><NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
1<NA><NA><NA><NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN( 단위 : 톤/년 )
2시도폐기물 종류<NA><NA>2021년\n발생량총계NaNNaNNaN공공처리NaNNaNNaN자가처리NaNNaNNaN위탁처리NaNNaNNaN
3<NA><NA><NA><NA>NaN재활용소각매립기타재활용소각매립기타재활용소각매립기타재활용소각매립기타
4인천총계<NA>EMPTY5604383.55510928.614979.278475.700078397.20005305510928.614979.225.50
5<NA>가연성폐목재EMPTY24805.924298507.9000000000024298507.900
6<NA><NA>폐합성수지류EMPTY27786.714837.812948.9000000000014837.812948.900
7<NA><NA>폐섬유EMPTY837.2262575.20000000000262575.200
8<NA><NA>폐벽지EMPTY00000000000000000
9<NA>불연성건설폐재류폐콘크리트2779990.92779965.4025.50000000002779965.4025.50
1. 건설폐기물 발생 및 처리현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20
13<NA><NA><NA>폐기와66.166.10000000000066.1000
14<NA><NA><NA>건설폐토석1072391.91072391.9000000000001072391.9000
15<NA><NA>건설오니EMPTY218629.1218629.100000000000218629.1000
16<NA><NA>폐금속류EMPTY00000000000000000
17<NA><NA>폐유리EMPTY00000000000000000
18<NA><NA>폐타일 및 폐도자기EMPTY94.494.40000000000094.4000
19<NA>가연성ㆍ불연성 혼합폐보드류EMPTY6231.76231.7000000000006231.7000
20<NA><NA>폐판넬EMPTY49.849.80000000000049.8000
21<NA><NA>혼합건설폐기물EMPTY678585.1600134.9078450.200078397.2000530600134.9000
22<NA>기타EMPTYEMPTY96618.8947.2000000000018.8947.200

Duplicate rows

Most frequently occurring

1. 건설폐기물 발생 및 처리현황Unnamed: 1Unnamed: 2Unnamed: 3# duplicates
0<NA><NA><NA><NA>2