Overview

Dataset statistics

Number of variables3
Number of observations25
Missing cells17
Missing cells (%)22.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory732.0 B
Average record size in memory29.3 B

Variable types

Text3

Dataset

Description대구광역시_재활용가능 자원(용품)정보_20230719
Author대구광역시
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15088187&dataSetDetailId=150881871e7a58673972e&provdMethod=FILE

Alerts

비해당품목 has 17 (68.0%) missing valuesMissing
세부품목 has unique valuesUnique

Reproduction

Analysis started2024-04-19 05:46:22.929117
Analysis finished2024-04-19 05:46:23.293822
Duration0.36 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

품목
Text

Distinct19
Distinct (%)76.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2024-04-19T14:46:23.411267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length14
Mean length7.04
Min length3

Characters and Unicode

Total characters176
Distinct characters67
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)64.0%

Sample

1st row골판지류
2nd row골판지 외 종이류
3rd row골판지 외 종이류
4th row골판지 외 종이류
5th row골판지 외 종이류
ValueCountFrequency (%)
골판지 5
 
10.9%
종이류 5
 
10.9%
5
 
10.9%
3
 
6.5%
소형가전제품 2
 
4.3%
이차전지류 2
 
4.3%
금속캔 2
 
4.3%
합성수지 2
 
4.3%
타이어 1
 
2.2%
전지류 1
 
2.2%
Other values (18) 18
39.1%
2024-04-19T14:46:23.729604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21
 
11.9%
15
 
8.5%
12
 
6.8%
10
 
5.7%
6
 
3.4%
6
 
3.4%
6
 
3.4%
5
 
2.8%
5
 
2.8%
4
 
2.3%
Other values (57) 86
48.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 149
84.7%
Space Separator 21
 
11.9%
Uppercase Letter 3
 
1.7%
Open Punctuation 1
 
0.6%
Close Punctuation 1
 
0.6%
Other Punctuation 1
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15
 
10.1%
12
 
8.1%
10
 
6.7%
6
 
4.0%
6
 
4.0%
6
 
4.0%
5
 
3.4%
5
 
3.4%
4
 
2.7%
3
 
2.0%
Other values (50) 77
51.7%
Uppercase Letter
ValueCountFrequency (%)
P 1
33.3%
E 1
33.3%
T 1
33.3%
Space Separator
ValueCountFrequency (%)
21
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Other Punctuation
ValueCountFrequency (%)
· 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 149
84.7%
Common 24
 
13.6%
Latin 3
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15
 
10.1%
12
 
8.1%
10
 
6.7%
6
 
4.0%
6
 
4.0%
6
 
4.0%
5
 
3.4%
5
 
3.4%
4
 
2.7%
3
 
2.0%
Other values (50) 77
51.7%
Common
ValueCountFrequency (%)
21
87.5%
( 1
 
4.2%
) 1
 
4.2%
· 1
 
4.2%
Latin
ValueCountFrequency (%)
P 1
33.3%
E 1
33.3%
T 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 149
84.7%
ASCII 26
 
14.8%
None 1
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
21
80.8%
( 1
 
3.8%
P 1
 
3.8%
E 1
 
3.8%
T 1
 
3.8%
) 1
 
3.8%
Hangul
ValueCountFrequency (%)
15
 
10.1%
12
 
8.1%
10
 
6.7%
6
 
4.0%
6
 
4.0%
6
 
4.0%
5
 
3.4%
5
 
3.4%
4
 
2.7%
3
 
2.0%
Other values (50) 77
51.7%
None
ValueCountFrequency (%)
· 1
100.0%

세부품목
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2024-04-19T14:46:23.895021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length133
Median length45
Mean length30.8
Min length3

Characters and Unicode

Total characters770
Distinct characters213
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row골판지상자 등
2nd row종이팩(살균팩+멸균팩)
3rd row신문지
4th row책자+노트+전단지 등
5th row종이컵
ValueCountFrequency (%)
5
 
6.4%
무색 2
 
2.6%
먹는샘물+음료 2
 
2.6%
리튬이차전지+보조배터리 2
 
2.6%
전자제품내 2
 
2.6%
투명한 2
 
2.6%
골판지상자 1
 
1.3%
농약용기+농촌폐비닐 1
 
1.3%
등)+비철금속(알미늄+스텐류 1
 
1.3%
고철(공기구+철사+못 1
 
1.3%
Other values (59) 59
75.6%
2024-04-19T14:46:24.232063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
+ 77
 
10.0%
54
 
7.0%
37
 
4.8%
22
 
2.9%
) 17
 
2.2%
( 17
 
2.2%
16
 
2.1%
15
 
1.9%
P 15
 
1.9%
14
 
1.8%
Other values (203) 486
63.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 555
72.1%
Math Symbol 77
 
10.0%
Space Separator 54
 
7.0%
Uppercase Letter 42
 
5.5%
Close Punctuation 18
 
2.3%
Open Punctuation 18
 
2.3%
Other Punctuation 3
 
0.4%
Decimal Number 3
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
37
 
6.7%
22
 
4.0%
16
 
2.9%
15
 
2.7%
14
 
2.5%
12
 
2.2%
11
 
2.0%
10
 
1.8%
8
 
1.4%
8
 
1.4%
Other values (184) 402
72.4%
Uppercase Letter
ValueCountFrequency (%)
P 15
35.7%
E 5
 
11.9%
T 5
 
11.9%
F 4
 
9.5%
L 4
 
9.5%
C 3
 
7.1%
M 2
 
4.8%
V 2
 
4.8%
S 2
 
4.8%
Close Punctuation
ValueCountFrequency (%)
) 17
94.4%
1
 
5.6%
Open Punctuation
ValueCountFrequency (%)
( 17
94.4%
1
 
5.6%
Other Punctuation
ValueCountFrequency (%)
/ 2
66.7%
· 1
33.3%
Decimal Number
ValueCountFrequency (%)
1 2
66.7%
3 1
33.3%
Math Symbol
ValueCountFrequency (%)
+ 77
100.0%
Space Separator
ValueCountFrequency (%)
54
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 555
72.1%
Common 173
 
22.5%
Latin 42
 
5.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
37
 
6.7%
22
 
4.0%
16
 
2.9%
15
 
2.7%
14
 
2.5%
12
 
2.2%
11
 
2.0%
10
 
1.8%
8
 
1.4%
8
 
1.4%
Other values (184) 402
72.4%
Common
ValueCountFrequency (%)
+ 77
44.5%
54
31.2%
) 17
 
9.8%
( 17
 
9.8%
/ 2
 
1.2%
1 2
 
1.2%
1
 
0.6%
1
 
0.6%
· 1
 
0.6%
3 1
 
0.6%
Latin
ValueCountFrequency (%)
P 15
35.7%
E 5
 
11.9%
T 5
 
11.9%
F 4
 
9.5%
L 4
 
9.5%
C 3
 
7.1%
M 2
 
4.8%
V 2
 
4.8%
S 2
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 554
71.9%
ASCII 212
 
27.5%
None 3
 
0.4%
Compat Jamo 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
+ 77
36.3%
54
25.5%
) 17
 
8.0%
( 17
 
8.0%
P 15
 
7.1%
E 5
 
2.4%
T 5
 
2.4%
F 4
 
1.9%
L 4
 
1.9%
C 3
 
1.4%
Other values (6) 11
 
5.2%
Hangul
ValueCountFrequency (%)
37
 
6.7%
22
 
4.0%
16
 
2.9%
15
 
2.7%
14
 
2.5%
12
 
2.2%
11
 
2.0%
10
 
1.8%
8
 
1.4%
8
 
1.4%
Other values (183) 401
72.4%
None
ValueCountFrequency (%)
1
33.3%
1
33.3%
· 1
33.3%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

비해당품목
Text

MISSING 

Distinct8
Distinct (%)100.0%
Missing17
Missing (%)68.0%
Memory size332.0 B
2024-04-19T14:46:24.411454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length124
Median length85
Mean length77.25
Min length11

Characters and Unicode

Total characters618
Distinct characters178
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)100.0%

Sample

1st row택배용 보냉 상자류 등 내부에 알루미늄박+비닐 등이 부착되어 종이와 분리되지 않는 상자류
2nd row양면이 코팅된 종이컵
3rd row알루미늄 등 금속이 박힌 복합소재 종이+택배전표+영수증 감열지+사진용지+종이호일+색지+사용한 화장지+방수 가공 포스터 등
4th row깨진 유리제품(신문지 등에 싸서종량제 봉투에 배출)+코팅 및 다양한 색상이들어간유리제품+내열유리제품+크리스탈 유리제품+판유리+조명기구용 유리류+사기·도자기류 등(특수규격마대 또는 대형폐기물 처리 등지자체 조례에 따라 배출)
5th row내용물이 남아있는 캔류(락카+페인트통 등)는 특수규격마대 등 지자체 조례에 따라 배출
ValueCountFrequency (%)
6
 
5.7%
배출 5
 
4.7%
따라 5
 
4.7%
조례에 5
 
4.7%
또는 5
 
4.7%
지자체 4
 
3.8%
대형폐기물 3
 
2.8%
상자류 2
 
1.9%
처리 2
 
1.9%
종량제봉투+특수규격마대 2
 
1.9%
Other values (67) 67
63.2%
2024-04-19T14:46:24.692586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
98
 
15.9%
+ 37
 
6.0%
16
 
2.6%
15
 
2.4%
13
 
2.1%
12
 
1.9%
10
 
1.6%
10
 
1.6%
10
 
1.6%
9
 
1.5%
Other values (168) 388
62.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 465
75.2%
Space Separator 98
 
15.9%
Math Symbol 37
 
6.0%
Other Punctuation 5
 
0.8%
Uppercase Letter 5
 
0.8%
Close Punctuation 4
 
0.6%
Open Punctuation 4
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16
 
3.4%
15
 
3.2%
13
 
2.8%
12
 
2.6%
10
 
2.2%
10
 
2.2%
10
 
2.2%
9
 
1.9%
9
 
1.9%
8
 
1.7%
Other values (160) 353
75.9%
Uppercase Letter
ValueCountFrequency (%)
D 3
60.0%
V 1
 
20.0%
C 1
 
20.0%
Space Separator
ValueCountFrequency (%)
98
100.0%
Math Symbol
ValueCountFrequency (%)
+ 37
100.0%
Other Punctuation
ValueCountFrequency (%)
· 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 465
75.2%
Common 148
 
23.9%
Latin 5
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16
 
3.4%
15
 
3.2%
13
 
2.8%
12
 
2.6%
10
 
2.2%
10
 
2.2%
10
 
2.2%
9
 
1.9%
9
 
1.9%
8
 
1.7%
Other values (160) 353
75.9%
Common
ValueCountFrequency (%)
98
66.2%
+ 37
 
25.0%
· 5
 
3.4%
) 4
 
2.7%
( 4
 
2.7%
Latin
ValueCountFrequency (%)
D 3
60.0%
V 1
 
20.0%
C 1
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 465
75.2%
ASCII 148
 
23.9%
None 5
 
0.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
98
66.2%
+ 37
 
25.0%
) 4
 
2.7%
( 4
 
2.7%
D 3
 
2.0%
V 1
 
0.7%
C 1
 
0.7%
Hangul
ValueCountFrequency (%)
16
 
3.4%
15
 
3.2%
13
 
2.8%
12
 
2.6%
10
 
2.2%
10
 
2.2%
10
 
2.2%
9
 
1.9%
9
 
1.9%
8
 
1.7%
Other values (160) 353
75.9%
None
ValueCountFrequency (%)
· 5
100.0%

Correlations

2024-04-19T14:46:24.773169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목세부품목비해당품목
품목1.0001.0001.000
세부품목1.0001.0001.000
비해당품목1.0001.0001.000

Missing values

2024-04-19T14:46:23.193017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-19T14:46:23.263273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

품목세부품목비해당품목
0골판지류골판지상자 등택배용 보냉 상자류 등 내부에 알루미늄박+비닐 등이 부착되어 종이와 분리되지 않는 상자류
1골판지 외 종이류종이팩(살균팩+멸균팩)<NA>
2골판지 외 종이류신문지<NA>
3골판지 외 종이류책자+노트+전단지 등<NA>
4골판지 외 종이류종이컵양면이 코팅된 종이컵
5골판지 외 종이류기타 종이류알루미늄 등 금속이 박힌 복합소재 종이+택배전표+영수증 감열지+사진용지+종이호일+색지+사용한 화장지+방수 가공 포스터 등
6유리병음료수병+기타 병류깨진 유리제품(신문지 등에 싸서종량제 봉투에 배출)+코팅 및 다양한 색상이들어간유리제품+내열유리제품+크리스탈 유리제품+판유리+조명기구용 유리류+사기·도자기류 등(특수규격마대 또는 대형폐기물 처리 등지자체 조례에 따라 배출)
7금속캔음료+주류캔+식료품캔<NA>
8금속캔기타캔류(부탄가스+살충제용기 등)내용물이 남아있는 캔류(락카+페인트통 등)는 특수규격마대 등 지자체 조례에 따라 배출
9무색 폴리에틸렌테레프탈레이트 (PET)병무색 투명한 먹는샘물+음료 폴리에틸렌테레프탈레이트(PET)병(나머지 폴리에틸렌테레프탈레이트(PET)병은 플라스틱류와 함께 배출)<NA>
품목세부품목비해당품목
15형광등직관형(FL)+환형(FCL)+안정기 내장형(CFL)+콤팩트형(FPL)+기타 수은을 함유한 조명제품<NA>
16고철류고철(공기구+철사+못 등)+비철금속(알미늄+스텐류 등)<NA>
17영농 폐기물류농약용기+농촌폐비닐<NA>
18소형가전제품 및 이차전지류휴대폰+카메라+MP3+PMP+게임기+전자사전+믹서기+네비게이션+스탠드+헤어드라이 등 「쓰레기수수료 종량제 시행지침」에 따른 대형폐기물에 해당하지 않는 제품 / 전자제품내 리튬이차전지+보조배터리<NA>
19소형가전제품 및 이차전지류전자제품내 리튬이차전지+보조배터리<NA>
20전자제품TV+냉장고+세탁기+에어컨+자동판매기+컴퓨터+프린터+복사기+팩시밀리+전기정수기+전기오븐+전자레인지+음식물처리기+식기건조기+전기비데+공기청정기+전기히터+오디오+전기밥솥+연수기+가습기+전기다리미+선풍기+믹서+청소기+비디오플레이어+이동전화단말기<NA>
21타이어소형+중형+대형<NA>
22자동차부품폐납산배터리<NA>
23식용유폐식용유<NA>
24윤활유윤활유(윤활유 용기 포함)<NA>