Overview

Dataset statistics

Number of variables5
Number of observations284
Missing cells35
Missing cells (%)2.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.2 KiB
Average record size in memory40.5 B

Variable types

Categorical1
Text4

Dataset

Description대구환경공단 환경자원사업소 폐기물 반입업체 현황 관련 데이터로 지역명, 회사명, 우편번호, 주소, 전화번호 정보를 제공합니다.
Author대구공공시설관리공단
URLhttps://www.data.go.kr/data/15113566/fileData.do

Alerts

우편번호 has 5 (1.8%) missing valuesMissing
주소 has 5 (1.8%) missing valuesMissing
전화번호 has 25 (8.8%) missing valuesMissing

Reproduction

Analysis started2024-04-29 23:09:11.261171
Analysis finished2024-04-29 23:09:12.090393
Duration0.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지역명
Categorical

Distinct9
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
달성군
63 
동구
61 
달서구
34 
북구
30 
서구
27 
Other values (4)
69 

Length

Max length3
Median length2
Mean length2.4366197
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남구
2nd row남구
3rd row남구
4th row남구
5th row남구

Common Values

ValueCountFrequency (%)
달성군 63
22.2%
동구 61
21.5%
달서구 34
12.0%
북구 30
10.6%
서구 27
9.5%
수성구 27
9.5%
남구 21
 
7.4%
중구 17
 
6.0%
기타 4
 
1.4%

Length

2024-04-30T08:09:12.153035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T08:09:12.274420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
달성군 63
22.2%
동구 61
21.5%
달서구 34
12.0%
북구 30
10.6%
서구 27
9.5%
수성구 27
9.5%
남구 21
 
7.4%
중구 17
 
6.0%
기타 4
 
1.4%
Distinct277
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2024-04-30T08:09:12.509569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length13
Mean length7.778169
Min length2

Characters and Unicode

Total characters2209
Distinct characters212
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique270 ?
Unique (%)95.1%

Sample

1st row대한환경(주)
2nd row(주)청기와환경-남구
3rd row더원생활환경(주)
4th row(주)크린시티(남구)
5th row화성에코 주식회사(남구)
ValueCountFrequency (%)
주식회사 9
 
2.9%
대구공공시설관리공단 7
 
2.3%
주)유창알앤씨 4
 
1.3%
태성환경 2
 
0.6%
서부사업소 2
 
0.6%
계명대학교 2
 
0.6%
주)현대환경 2
 
0.6%
주)코머스건설 2
 
0.6%
상수도사업본부 2
 
0.6%
주)일촌 2
 
0.6%
Other values (271) 275
89.0%
2024-04-30T08:09:12.920815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 189
 
8.6%
) 189
 
8.6%
172
 
7.8%
85
 
3.8%
81
 
3.7%
74
 
3.3%
63
 
2.9%
57
 
2.6%
51
 
2.3%
43
 
1.9%
Other values (202) 1205
54.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1754
79.4%
Open Punctuation 189
 
8.6%
Close Punctuation 189
 
8.6%
Decimal Number 27
 
1.2%
Space Separator 25
 
1.1%
Dash Punctuation 19
 
0.9%
Other Symbol 3
 
0.1%
Uppercase Letter 2
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
172
 
9.8%
85
 
4.8%
81
 
4.6%
74
 
4.2%
63
 
3.6%
57
 
3.2%
51
 
2.9%
43
 
2.5%
40
 
2.3%
39
 
2.2%
Other values (186) 1049
59.8%
Decimal Number
ValueCountFrequency (%)
1 6
22.2%
3 5
18.5%
6 4
14.8%
5 4
14.8%
7 3
11.1%
9 2
 
7.4%
2 2
 
7.4%
0 1
 
3.7%
Uppercase Letter
ValueCountFrequency (%)
B 1
50.0%
T 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 189
100.0%
Close Punctuation
ValueCountFrequency (%)
) 189
100.0%
Space Separator
ValueCountFrequency (%)
25
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1757
79.5%
Common 450
 
20.4%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
172
 
9.8%
85
 
4.8%
81
 
4.6%
74
 
4.2%
63
 
3.6%
57
 
3.2%
51
 
2.9%
43
 
2.4%
40
 
2.3%
39
 
2.2%
Other values (187) 1052
59.9%
Common
ValueCountFrequency (%)
( 189
42.0%
) 189
42.0%
25
 
5.6%
- 19
 
4.2%
1 6
 
1.3%
3 5
 
1.1%
6 4
 
0.9%
5 4
 
0.9%
7 3
 
0.7%
9 2
 
0.4%
Other values (3) 4
 
0.9%
Latin
ValueCountFrequency (%)
B 1
50.0%
T 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1754
79.4%
ASCII 452
 
20.5%
None 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 189
41.8%
) 189
41.8%
25
 
5.5%
- 19
 
4.2%
1 6
 
1.3%
3 5
 
1.1%
6 4
 
0.9%
5 4
 
0.9%
7 3
 
0.7%
9 2
 
0.4%
Other values (5) 6
 
1.3%
Hangul
ValueCountFrequency (%)
172
 
9.8%
85
 
4.8%
81
 
4.6%
74
 
4.2%
63
 
3.6%
57
 
3.2%
51
 
2.9%
43
 
2.5%
40
 
2.3%
39
 
2.2%
Other values (186) 1049
59.8%
None
ValueCountFrequency (%)
3
100.0%

우편번호
Text

MISSING 

Distinct158
Distinct (%)56.6%
Missing5
Missing (%)1.8%
Memory size2.3 KiB
2024-04-30T08:09:13.233577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.0645161
Min length5

Characters and Unicode

Total characters1413
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)35.1%

Sample

1st row42412
2nd row41126
3rd row42428
4th row42412
5th row42412
ValueCountFrequency (%)
41039 9
 
3.2%
42929 9
 
3.2%
42697 6
 
2.2%
42965 6
 
2.2%
42699 5
 
1.8%
42904 5
 
1.8%
42926 5
 
1.8%
42972 5
 
1.8%
42716 5
 
1.8%
41041 4
 
1.4%
Other values (148) 220
78.9%
2024-04-30T08:09:13.670148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 345
24.4%
2 234
16.6%
1 209
14.8%
9 140
9.9%
0 116
 
8.2%
7 93
 
6.6%
6 78
 
5.5%
8 70
 
5.0%
3 61
 
4.3%
5 58
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1404
99.4%
Dash Punctuation 9
 
0.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 345
24.6%
2 234
16.7%
1 209
14.9%
9 140
10.0%
0 116
 
8.3%
7 93
 
6.6%
6 78
 
5.6%
8 70
 
5.0%
3 61
 
4.3%
5 58
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1413
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 345
24.4%
2 234
16.6%
1 209
14.8%
9 140
9.9%
0 116
 
8.2%
7 93
 
6.6%
6 78
 
5.5%
8 70
 
5.0%
3 61
 
4.3%
5 58
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1413
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 345
24.4%
2 234
16.6%
1 209
14.8%
9 140
9.9%
0 116
 
8.2%
7 93
 
6.6%
6 78
 
5.5%
8 70
 
5.0%
3 61
 
4.3%
5 58
 
4.1%

주소
Text

MISSING 

Distinct221
Distinct (%)79.2%
Missing5
Missing (%)1.8%
Memory size2.3 KiB
2024-04-30T08:09:14.004974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length42
Mean length21.928315
Min length12

Characters and Unicode

Total characters6118
Distinct characters200
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique181 ?
Unique (%)64.9%

Sample

1st row대구광역시 남구 대명동 1897-4
2nd row대구광역시 동구 각산동 377-10 2
3rd row대구광역시 남구 봉덕동 503-2
4th row대구광역시 남구 대명동 1899-38
5th row대구광역시 남구 대명동 1899-38
ValueCountFrequency (%)
대구광역시 199
 
15.1%
대구 79
 
6.0%
달성군 61
 
4.6%
동구 60
 
4.6%
달서구 44
 
3.3%
서구 28
 
2.1%
북구 27
 
2.1%
수성구 26
 
2.0%
다사읍 19
 
1.4%
남구 16
 
1.2%
Other values (408) 755
57.5%
2024-04-30T08:09:14.553457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1261
20.6%
515
 
8.4%
315
 
5.1%
302
 
4.9%
1 255
 
4.2%
209
 
3.4%
- 207
 
3.4%
199
 
3.3%
199
 
3.3%
2 161
 
2.6%
Other values (190) 2495
40.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3340
54.6%
Decimal Number 1287
 
21.0%
Space Separator 1261
 
20.6%
Dash Punctuation 207
 
3.4%
Open Punctuation 9
 
0.1%
Close Punctuation 9
 
0.1%
Other Punctuation 3
 
< 0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
515
15.4%
315
 
9.4%
302
 
9.0%
209
 
6.3%
199
 
6.0%
199
 
6.0%
112
 
3.4%
106
 
3.2%
103
 
3.1%
93
 
2.8%
Other values (171) 1187
35.5%
Decimal Number
ValueCountFrequency (%)
1 255
19.8%
2 161
12.5%
3 155
12.0%
0 146
11.3%
6 109
8.5%
5 104
8.1%
8 97
 
7.5%
7 95
 
7.4%
4 88
 
6.8%
9 77
 
6.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
33.3%
, 1
33.3%
. 1
33.3%
Uppercase Letter
ValueCountFrequency (%)
C 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
1261
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 207
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3340
54.6%
Common 2776
45.4%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
515
15.4%
315
 
9.4%
302
 
9.0%
209
 
6.3%
199
 
6.0%
199
 
6.0%
112
 
3.4%
106
 
3.2%
103
 
3.1%
93
 
2.8%
Other values (171) 1187
35.5%
Common
ValueCountFrequency (%)
1261
45.4%
1 255
 
9.2%
- 207
 
7.5%
2 161
 
5.8%
3 155
 
5.6%
0 146
 
5.3%
6 109
 
3.9%
5 104
 
3.7%
8 97
 
3.5%
7 95
 
3.4%
Other values (7) 186
 
6.7%
Latin
ValueCountFrequency (%)
C 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3340
54.6%
ASCII 2778
45.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1261
45.4%
1 255
 
9.2%
- 207
 
7.5%
2 161
 
5.8%
3 155
 
5.6%
0 146
 
5.3%
6 109
 
3.9%
5 104
 
3.7%
8 97
 
3.5%
7 95
 
3.4%
Other values (9) 188
 
6.8%
Hangul
ValueCountFrequency (%)
515
15.4%
315
 
9.4%
302
 
9.0%
209
 
6.3%
199
 
6.0%
199
 
6.0%
112
 
3.4%
106
 
3.2%
103
 
3.1%
93
 
2.8%
Other values (171) 1187
35.5%

전화번호
Text

MISSING 

Distinct202
Distinct (%)78.0%
Missing25
Missing (%)8.8%
Memory size2.3 KiB
2024-04-30T08:09:14.800786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.007722
Min length12

Characters and Unicode

Total characters3110
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique170 ?
Unique (%)65.6%

Sample

1st row053-656-4146
2nd row053-955-2273
3rd row053-754-8828
4th row053-959-7270
5th row053-424-8170
ValueCountFrequency (%)
053-355-7300 7
 
2.7%
053-526-4377 7
 
2.7%
053-290-3136 5
 
1.9%
053-984-7323 5
 
1.9%
053-593-3451 4
 
1.5%
053-765-4602 4
 
1.5%
053-592-1714 3
 
1.2%
053-959-7270 3
 
1.2%
053-635-2585 3
 
1.2%
053-592-9393 3
 
1.2%
Other values (192) 215
83.0%
2024-04-30T08:09:15.168809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 518
16.7%
5 512
16.5%
3 451
14.5%
0 426
13.7%
6 226
7.3%
2 206
 
6.6%
7 177
 
5.7%
4 163
 
5.2%
1 149
 
4.8%
9 144
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2592
83.3%
Dash Punctuation 518
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 512
19.8%
3 451
17.4%
0 426
16.4%
6 226
8.7%
2 206
7.9%
7 177
 
6.8%
4 163
 
6.3%
1 149
 
5.7%
9 144
 
5.6%
8 138
 
5.3%
Dash Punctuation
ValueCountFrequency (%)
- 518
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3110
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 518
16.7%
5 512
16.5%
3 451
14.5%
0 426
13.7%
6 226
7.3%
2 206
 
6.6%
7 177
 
5.7%
4 163
 
5.2%
1 149
 
4.8%
9 144
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3110
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 518
16.7%
5 512
16.5%
3 451
14.5%
0 426
13.7%
6 226
7.3%
2 206
 
6.6%
7 177
 
5.7%
4 163
 
5.2%
1 149
 
4.8%
9 144
 
4.6%

Missing values

2024-04-30T08:09:11.874031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T08:09:11.960617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-30T08:09:12.042343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

지역명회사명우편번호주소전화번호
0남구대한환경(주)42412대구광역시 남구 대명동 1897-4053-656-4146
1남구(주)청기와환경-남구41126대구광역시 동구 각산동 377-10 2053-955-2273
2남구더원생활환경(주)42428대구광역시 남구 봉덕동 503-2053-754-8828
3남구(주)크린시티(남구)42412대구광역시 남구 대명동 1899-38053-959-7270
4남구화성에코 주식회사(남구)42412대구광역시 남구 대명동 1899-38053-424-8170
5남구(주)현대환경 (남구)41039대구광역시 동구 불로동 966053-959-7270
6남구(주)대한실업-남구42697대구광역시 달서구 갈산동 7053-593-3451
7남구관문상가42491대구 남구 대명동 1160번지 (5열16호)053-656-4011
8남구주식회사 빅토리(남구)42970대구광역시 달성군 옥포면 본리리 2375053-290-3136
9남구(자)이화산업41224대구광역시 동구 신암동 329-29053-952-5350
지역명회사명우편번호주소전화번호
274중구동아쇼핑41936대구광역시 중구 덕산동 53-3 동아쇼핑053-251-3212
275중구계명대학교 동산의료원41931대구광역시 중구 동산동 194 동산의료원053-250-7883
276중구대일주택관리41903대구광역시 중구 태평로1가 1-187 태평라이프 2층 250호053-423-5552
277중구덕수개발41918대구광역시 중구 포정동 16 5층 502호<NA>
278중구동아백화점쇼핑점41936대구 중구 덕산동 동아쇼핑센터건물 53-3053-251-3212
279중구중구청(환경과)41908대구광역시 중구 동인동2가 177-4 중구청053-661-2714
280기타지에스건설(주)<NA><NA><NA>
281기타(주)오케이산업<NA><NA>054-954-0575
282기타소백환경<NA><NA>054-635-2525
283기타주식회사 지구환경<NA><NA>055-264-6663