Overview

Dataset statistics

Number of variables11
Number of observations516
Missing cells36
Missing cells (%)0.6%
Duplicate rows13
Duplicate rows (%)2.5%
Total size in memory44.5 KiB
Average record size in memory88.3 B

Variable types

Categorical3
Text8

Dataset

Description"지자체에 신고된 사업장폐기물배출자 신고현황
Author강원도 속초시
URLhttps://www.data.go.kr/data/15081057/fileData.do

Alerts

데이터기준일 has constant value ""Constant
Dataset has 13 (2.5%) duplicate rowsDuplicates
구분 is highly overall correlated with 처리방법High correlation
처리방법 is highly overall correlated with 구분High correlation
처리방법 is highly imbalanced (51.1%)Imbalance
전화번호 has 36 (7.0%) missing valuesMissing

Reproduction

Analysis started2023-12-12 00:03:24.908630
Analysis finished2023-12-12 00:03:25.988619
Duration1.08 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
지정
405 
사업장
111 

Length

Max length3
Median length2
Mean length2.2151163
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업장
2nd row사업장
3rd row사업장
4th row사업장
5th row사업장

Common Values

ValueCountFrequency (%)
지정 405
78.5%
사업장 111
 
21.5%

Length

2023-12-12T09:03:26.046992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:03:26.122001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지정 405
78.5%
사업장 111
 
21.5%

상호
Text

Distinct211
Distinct (%)40.9%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
2023-12-12T09:03:26.346688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length17
Mean length8.0562016
Min length3

Characters and Unicode

Total characters4157
Distinct characters265
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)8.7%

Sample

1st row속초시수산업협동조합
2nd row속초시 시설관리공단
3rd row(주)대양환경기술
4th row(주)대양환경기술
5th row(주)대양환경기술
ValueCountFrequency (%)
속초시 16
 
2.8%
한화호텔앤드리조트(주 15
 
2.6%
하수도사업소 11
 
1.9%
강원도자동차전문정비사업조합속초지회 10
 
1.7%
한솔이엠이주식회사 9
 
1.6%
의료법인 9
 
1.6%
신의의료재단(속초우리요양병원 9
 
1.6%
속초정요양병원 9
 
1.6%
영동자동차폐차장 8
 
1.4%
주식회사한국동물혈액은행 7
 
1.2%
Other values (213) 472
82.1%
2023-12-12T09:03:26.723749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
229
 
5.5%
193
 
4.6%
169
 
4.1%
112
 
2.7%
112
 
2.7%
94
 
2.3%
) 94
 
2.3%
( 94
 
2.3%
94
 
2.3%
89
 
2.1%
Other values (255) 2877
69.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3891
93.6%
Close Punctuation 94
 
2.3%
Open Punctuation 94
 
2.3%
Space Separator 59
 
1.4%
Lowercase Letter 10
 
0.2%
Uppercase Letter 6
 
0.1%
Decimal Number 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
229
 
5.9%
193
 
5.0%
169
 
4.3%
112
 
2.9%
112
 
2.9%
94
 
2.4%
94
 
2.4%
89
 
2.3%
62
 
1.6%
62
 
1.6%
Other values (247) 2675
68.7%
Lowercase Letter
ValueCountFrequency (%)
b 5
50.0%
j 5
50.0%
Uppercase Letter
ValueCountFrequency (%)
M 3
50.0%
D 3
50.0%
Close Punctuation
ValueCountFrequency (%)
) 94
100.0%
Open Punctuation
ValueCountFrequency (%)
( 94
100.0%
Space Separator
ValueCountFrequency (%)
59
100.0%
Decimal Number
ValueCountFrequency (%)
2 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3891
93.6%
Common 250
 
6.0%
Latin 16
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
229
 
5.9%
193
 
5.0%
169
 
4.3%
112
 
2.9%
112
 
2.9%
94
 
2.4%
94
 
2.4%
89
 
2.3%
62
 
1.6%
62
 
1.6%
Other values (247) 2675
68.7%
Common
ValueCountFrequency (%)
) 94
37.6%
( 94
37.6%
59
23.6%
2 3
 
1.2%
Latin
ValueCountFrequency (%)
b 5
31.2%
j 5
31.2%
M 3
18.8%
D 3
18.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3891
93.6%
ASCII 266
 
6.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
229
 
5.9%
193
 
5.0%
169
 
4.3%
112
 
2.9%
112
 
2.9%
94
 
2.4%
94
 
2.4%
89
 
2.3%
62
 
1.6%
62
 
1.6%
Other values (247) 2675
68.7%
ASCII
ValueCountFrequency (%)
) 94
35.3%
( 94
35.3%
59
22.2%
b 5
 
1.9%
j 5
 
1.9%
M 3
 
1.1%
2 3
 
1.1%
D 3
 
1.1%
Distinct64
Distinct (%)12.4%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
2023-12-12T09:03:26.944619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length81
Mean length11.755814
Min length2

Characters and Unicode

Total characters6066
Distinct characters201
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)5.0%

Sample

1st row폐어망
2nd row생활폐기물 소각시설 바닥재
3rd row그 밖의 폐수처리오니
4th row그 밖의 폐기물
5th row하수처리오니
ValueCountFrequency (%)
일반의료폐기물 135
 
12.3%
손상성폐기물 132
 
12.0%
64
 
5.8%
밖의 64
 
5.8%
제외한다 31
 
2.8%
말한다 27
 
2.5%
조직물류폐기물(태반을 23
 
2.1%
재활용하는 23
 
2.1%
경우는 23
 
2.1%
20
 
1.8%
Other values (137) 560
50.8%
2023-12-12T09:03:27.546232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
586
 
9.7%
517
 
8.5%
483
 
8.0%
386
 
6.4%
242
 
4.0%
165
 
2.7%
161
 
2.7%
158
 
2.6%
144
 
2.4%
136
 
2.2%
Other values (191) 3088
50.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5190
85.6%
Space Separator 586
 
9.7%
Close Punctuation 79
 
1.3%
Open Punctuation 79
 
1.3%
Lowercase Letter 60
 
1.0%
Decimal Number 42
 
0.7%
Connector Punctuation 30
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
517
 
10.0%
483
 
9.3%
386
 
7.4%
242
 
4.7%
165
 
3.2%
161
 
3.1%
158
 
3.0%
144
 
2.8%
136
 
2.6%
132
 
2.5%
Other values (174) 2666
51.4%
Lowercase Letter
ValueCountFrequency (%)
e 20
33.3%
s 10
16.7%
r 10
16.7%
g 10
16.7%
a 10
16.7%
Decimal Number
ValueCountFrequency (%)
2 20
47.6%
0 10
23.8%
1 6
 
14.3%
8 6
 
14.3%
Close Punctuation
ValueCountFrequency (%)
) 63
79.7%
] 10
 
12.7%
6
 
7.6%
Open Punctuation
ValueCountFrequency (%)
( 63
79.7%
[ 10
 
12.7%
6
 
7.6%
Space Separator
ValueCountFrequency (%)
586
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 30
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5190
85.6%
Common 816
 
13.5%
Latin 60
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
517
 
10.0%
483
 
9.3%
386
 
7.4%
242
 
4.7%
165
 
3.2%
161
 
3.1%
158
 
3.0%
144
 
2.8%
136
 
2.6%
132
 
2.5%
Other values (174) 2666
51.4%
Common
ValueCountFrequency (%)
586
71.8%
) 63
 
7.7%
( 63
 
7.7%
_ 30
 
3.7%
2 20
 
2.5%
] 10
 
1.2%
[ 10
 
1.2%
0 10
 
1.2%
6
 
0.7%
1 6
 
0.7%
Other values (2) 12
 
1.5%
Latin
ValueCountFrequency (%)
e 20
33.3%
s 10
16.7%
r 10
16.7%
g 10
16.7%
a 10
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5144
84.8%
ASCII 864
 
14.2%
Compat Jamo 46
 
0.8%
None 12
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
586
67.8%
) 63
 
7.3%
( 63
 
7.3%
_ 30
 
3.5%
2 20
 
2.3%
e 20
 
2.3%
s 10
 
1.2%
] 10
 
1.2%
[ 10
 
1.2%
r 10
 
1.2%
Other values (5) 42
 
4.9%
Hangul
ValueCountFrequency (%)
517
 
10.1%
483
 
9.4%
386
 
7.5%
242
 
4.7%
165
 
3.2%
161
 
3.1%
158
 
3.1%
144
 
2.8%
136
 
2.6%
132
 
2.6%
Other values (173) 2620
50.9%
Compat Jamo
ValueCountFrequency (%)
46
100.0%
None
ValueCountFrequency (%)
6
50.0%
6
50.0%
Distinct206
Distinct (%)39.9%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
2023-12-12T09:03:27.754095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters6192
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)7.4%

Sample

1st row227-82-00028
2nd row227-82-06089
3rd row611-81-16543
4th row611-81-16543
5th row611-81-16543
ValueCountFrequency (%)
101-81-30747 15
 
2.9%
227-83-00724 11
 
2.1%
227-04-40527 10
 
1.9%
227-90-21386 9
 
1.7%
104-82-13384 9
 
1.7%
314-81-44231 9
 
1.7%
227-81-04065 7
 
1.4%
119-87-07875 7
 
1.4%
206-86-50913 7
 
1.4%
227-81-14073 7
 
1.4%
Other values (196) 425
82.4%
2023-12-12T09:03:28.060613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1032
16.7%
2 966
15.6%
0 838
13.5%
1 616
9.9%
7 597
9.6%
8 466
7.5%
9 429
6.9%
3 336
 
5.4%
6 328
 
5.3%
4 309
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5160
83.3%
Dash Punctuation 1032
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 966
18.7%
0 838
16.2%
1 616
11.9%
7 597
11.6%
8 466
9.0%
9 429
8.3%
3 336
 
6.5%
6 328
 
6.4%
4 309
 
6.0%
5 275
 
5.3%
Dash Punctuation
ValueCountFrequency (%)
- 1032
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6192
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1032
16.7%
2 966
15.6%
0 838
13.5%
1 616
9.9%
7 597
9.6%
8 466
7.5%
9 429
6.9%
3 336
 
5.4%
6 328
 
5.3%
4 309
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6192
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1032
16.7%
2 966
15.6%
0 838
13.5%
1 616
9.9%
7 597
9.6%
8 466
7.5%
9 429
6.9%
3 336
 
5.4%
6 328
 
5.3%
4 309
 
5.0%

전화번호
Text

MISSING 

Distinct183
Distinct (%)38.1%
Missing36
Missing (%)7.0%
Memory size4.2 KiB
2023-12-12T09:03:28.281222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length11
Mean length10.529167
Min length2

Characters and Unicode

Total characters5054
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)9.0%

Sample

1st row'033-630-7711
2nd row'0336306254
3rd row'033-632-9366
4th row'033-630-7711
5th row'0336306037
ValueCountFrequency (%)
43
 
8.6%
0336305556 15
 
3.0%
033635 11
 
2.2%
2562 11
 
2.2%
033 11
 
2.2%
0336319674 10
 
2.0%
033-637-0757 9
 
1.8%
033-638-0060 9
 
1.8%
0336336271 9
 
1.8%
033-631-8575 7
 
1.4%
Other values (175) 367
73.1%
2023-12-12T09:03:28.631798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 1430
28.3%
0 701
13.9%
6 650
12.9%
' 480
 
9.5%
5 319
 
6.3%
7 270
 
5.3%
2 242
 
4.8%
1 225
 
4.5%
8 219
 
4.3%
- 206
 
4.1%
Other values (3) 312
 
6.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4303
85.1%
Other Punctuation 480
 
9.5%
Dash Punctuation 206
 
4.1%
Space Separator 65
 
1.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 1430
33.2%
0 701
16.3%
6 650
15.1%
5 319
 
7.4%
7 270
 
6.3%
2 242
 
5.6%
1 225
 
5.2%
8 219
 
5.1%
9 135
 
3.1%
4 112
 
2.6%
Other Punctuation
ValueCountFrequency (%)
' 480
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 206
100.0%
Space Separator
ValueCountFrequency (%)
65
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5054
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 1430
28.3%
0 701
13.9%
6 650
12.9%
' 480
 
9.5%
5 319
 
6.3%
7 270
 
5.3%
2 242
 
4.8%
1 225
 
4.5%
8 219
 
4.3%
- 206
 
4.1%
Other values (3) 312
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5054
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 1430
28.3%
0 701
13.9%
6 650
12.9%
' 480
 
9.5%
5 319
 
6.3%
7 270
 
5.3%
2 242
 
4.8%
1 225
 
4.5%
8 219
 
4.3%
- 206
 
4.1%
Other values (3) 312
 
6.2%
Distinct92
Distinct (%)17.8%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
2023-12-12T09:03:28.856107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length7
Mean length6.5736434
Min length2

Characters and Unicode

Total characters3392
Distinct characters163
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)10.5%

Sample

1st row지에스그린텍
2nd row자가운반
3rd row대게환경피엔씨
4th row청도환경주식회사
5th row금강물류주식회사
ValueCountFrequency (%)
태광실업(주 174
33.3%
태광실업 111
21.3%
주)도시환경 24
 
4.6%
주)제아이씨 23
 
4.4%
주)한국라이신화공사 17
 
3.3%
중부신대한정유산업(주 11
 
2.1%
합)부흥환경 7
 
1.3%
하나그린(주 6
 
1.1%
자가운반 6
 
1.1%
현대특수사료(주 5
 
1.0%
Other values (85) 138
26.4%
2023-12-12T09:03:29.201027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 347
 
10.2%
) 347
 
10.2%
337
 
9.9%
315
 
9.3%
285
 
8.4%
285
 
8.4%
285
 
8.4%
61
 
1.8%
57
 
1.7%
56
 
1.7%
Other values (153) 1017
30.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2672
78.8%
Open Punctuation 350
 
10.3%
Close Punctuation 350
 
10.3%
Space Separator 8
 
0.2%
Decimal Number 6
 
0.2%
Other Punctuation 5
 
0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
337
 
12.6%
315
 
11.8%
285
 
10.7%
285
 
10.7%
285
 
10.7%
61
 
2.3%
57
 
2.1%
56
 
2.1%
37
 
1.4%
36
 
1.3%
Other values (140) 918
34.4%
Decimal Number
ValueCountFrequency (%)
2 1
16.7%
6 1
16.7%
0 1
16.7%
5 1
16.7%
3 1
16.7%
9 1
16.7%
Open Punctuation
ValueCountFrequency (%)
( 347
99.1%
[ 3
 
0.9%
Close Punctuation
ValueCountFrequency (%)
) 347
99.1%
] 3
 
0.9%
Space Separator
ValueCountFrequency (%)
8
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 5
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2672
78.8%
Common 720
 
21.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
337
 
12.6%
315
 
11.8%
285
 
10.7%
285
 
10.7%
285
 
10.7%
61
 
2.3%
57
 
2.1%
56
 
2.1%
37
 
1.4%
36
 
1.3%
Other values (140) 918
34.4%
Common
ValueCountFrequency (%)
( 347
48.2%
) 347
48.2%
8
 
1.1%
/ 5
 
0.7%
[ 3
 
0.4%
] 3
 
0.4%
2 1
 
0.1%
6 1
 
0.1%
0 1
 
0.1%
5 1
 
0.1%
Other values (3) 3
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2672
78.8%
ASCII 720
 
21.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 347
48.2%
) 347
48.2%
8
 
1.1%
/ 5
 
0.7%
[ 3
 
0.4%
] 3
 
0.4%
2 1
 
0.1%
6 1
 
0.1%
0 1
 
0.1%
5 1
 
0.1%
Other values (3) 3
 
0.4%
Hangul
ValueCountFrequency (%)
337
 
12.6%
315
 
11.8%
285
 
10.7%
285
 
10.7%
285
 
10.7%
61
 
2.3%
57
 
2.1%
56
 
2.1%
37
 
1.4%
36
 
1.3%
Other values (140) 918
34.4%
Distinct97
Distinct (%)18.8%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
2023-12-12T09:03:29.488872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length7
Mean length7.4496124
Min length2

Characters and Unicode

Total characters3844
Distinct characters157
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)11.0%

Sample

1st row지에스그린텍
2nd row속초시 환경자원사업소
3rd row쌍용양회공업(주)동해공장
4th row속초시 환경자원사업소
5th row주식회사와이에스텍
ValueCountFrequency (%)
주)삼우그린 221
42.0%
주)이에스지 59
 
11.2%
도시환경(주 41
 
7.8%
속초시환경자원사업소 18
 
3.4%
주)한국라이신화공사 17
 
3.2%
환경자원사업소 10
 
1.9%
속초시 9
 
1.7%
성림유화(주 8
 
1.5%
에코시스템(주)구미 7
 
1.3%
현대특수사료(주 5
 
1.0%
Other values (88) 131
24.9%
2023-12-12T09:03:29.925635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
441
 
11.5%
( 440
 
11.4%
) 438
 
11.4%
234
 
6.1%
231
 
6.0%
228
 
5.9%
228
 
5.9%
102
 
2.7%
82
 
2.1%
80
 
2.1%
Other values (147) 1340
34.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2956
76.9%
Open Punctuation 440
 
11.4%
Close Punctuation 438
 
11.4%
Space Separator 10
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
441
 
14.9%
234
 
7.9%
231
 
7.8%
228
 
7.7%
228
 
7.7%
102
 
3.5%
82
 
2.8%
80
 
2.7%
79
 
2.7%
77
 
2.6%
Other values (144) 1174
39.7%
Open Punctuation
ValueCountFrequency (%)
( 440
100.0%
Close Punctuation
ValueCountFrequency (%)
) 438
100.0%
Space Separator
ValueCountFrequency (%)
10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2956
76.9%
Common 888
 
23.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
441
 
14.9%
234
 
7.9%
231
 
7.8%
228
 
7.7%
228
 
7.7%
102
 
3.5%
82
 
2.8%
80
 
2.7%
79
 
2.7%
77
 
2.6%
Other values (144) 1174
39.7%
Common
ValueCountFrequency (%)
( 440
49.5%
) 438
49.3%
10
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2956
76.9%
ASCII 888
 
23.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
441
 
14.9%
234
 
7.9%
231
 
7.8%
228
 
7.7%
228
 
7.7%
102
 
3.5%
82
 
2.8%
80
 
2.7%
79
 
2.7%
77
 
2.6%
Other values (144) 1174
39.7%
ASCII
ValueCountFrequency (%)
( 440
49.5%
) 438
49.3%
10
 
1.1%

처리방법
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct27
Distinct (%)5.2%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
중간처분(일반소각)
333 
재활용(원료 제조)
38 
재활용(중간가공폐기물 제조)
 
19
매립(민간관리형매립시설)
 
18
매립(지방자치단체매립시설)
 
17
Other values (22)
91 

Length

Max length17
Median length10
Mean length10.744186
Min length7

Unique

Unique5 ?
Unique (%)1.0%

Sample

1st row재활용(중간가공폐기물 제조)
2nd row매립(지방자치단체매립시설)
3rd row재활용(직접 제품제조)
4th row매립(지방자치단체매립시설)
5th row매립(민간관리형매립시설)

Common Values

ValueCountFrequency (%)
중간처분(일반소각) 333
64.5%
재활용(원료 제조) 38
 
7.4%
재활용(중간가공폐기물 제조) 19
 
3.7%
매립(민간관리형매립시설) 18
 
3.5%
매립(지방자치단체매립시설) 17
 
3.3%
중간처분(지방자치단체소각) 16
 
3.1%
재활용(연료·고형연료제품 제조) 13
 
2.5%
재활용(직접 제품제조) 10
 
1.9%
재활용(기타) 7
 
1.4%
중간처분(고온소각) 6
 
1.2%
Other values (17) 39
 
7.6%

Length

2023-12-12T09:03:30.049184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
중간처분(일반소각 333
54.8%
제조 70
 
11.5%
재활용(원료 38
 
6.2%
재활용(중간가공폐기물 19
 
3.1%
매립(민간관리형매립시설 18
 
3.0%
매립(지방자치단체매립시설 17
 
2.8%
중간처분(지방자치단체소각 16
 
2.6%
재활용(연료·고형연료제품 13
 
2.1%
재활용(직접 12
 
2.0%
제품제조 10
 
1.6%
Other values (21) 62
 
10.2%
Distinct191
Distinct (%)37.0%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
2023-12-12T09:03:30.331722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length36
Mean length24.193798
Min length1

Characters and Unicode

Total characters12484
Distinct characters154
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)7.8%

Sample

1st row강원도 속초시 설악금강대교로 67_ 속초시수협 산지거점유통센터 (청호동)
2nd row강원도 속초시 방축길 60_ 속초시환경자원사업소 (대포동)
3rd row강원도 속초시 농공단지2길 5 (대포동)
4th row강원도 속초시 농공단지2길 5 (대포동)
5th row강원도 속초시 농공단지2길 5 (대포동)
ValueCountFrequency (%)
강원도 509
18.4%
속초시 503
18.1%
교동 146
 
5.3%
중앙로 115
 
4.1%
동해대로 107
 
3.9%
조양동 100
 
3.6%
대포동 63
 
2.3%
청학동 41
 
1.5%
금호동 31
 
1.1%
장사동 26
 
0.9%
Other values (272) 1132
40.8%
2023-12-12T09:03:30.787205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2269
18.2%
676
 
5.4%
554
 
4.4%
526
 
4.2%
522
 
4.2%
( 521
 
4.2%
) 521
 
4.2%
520
 
4.2%
520
 
4.2%
510
 
4.1%
Other values (144) 5345
42.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7036
56.4%
Space Separator 2269
 
18.2%
Decimal Number 1908
 
15.3%
Open Punctuation 521
 
4.2%
Close Punctuation 521
 
4.2%
Connector Punctuation 155
 
1.2%
Dash Punctuation 54
 
0.4%
Uppercase Letter 18
 
0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
676
 
9.6%
554
 
7.9%
526
 
7.5%
522
 
7.4%
520
 
7.4%
520
 
7.4%
510
 
7.2%
406
 
5.8%
220
 
3.1%
171
 
2.4%
Other values (125) 2411
34.3%
Decimal Number
ValueCountFrequency (%)
1 407
21.3%
2 266
13.9%
4 253
13.3%
3 233
12.2%
0 166
8.7%
6 145
 
7.6%
7 126
 
6.6%
9 120
 
6.3%
5 109
 
5.7%
8 83
 
4.4%
Uppercase Letter
ValueCountFrequency (%)
A 9
50.0%
E 7
38.9%
B 2
 
11.1%
Space Separator
ValueCountFrequency (%)
2269
100.0%
Open Punctuation
ValueCountFrequency (%)
( 521
100.0%
Close Punctuation
ValueCountFrequency (%)
) 521
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 155
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 54
100.0%
Lowercase Letter
ValueCountFrequency (%)
b 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7036
56.4%
Common 5428
43.5%
Latin 20
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
676
 
9.6%
554
 
7.9%
526
 
7.5%
522
 
7.4%
520
 
7.4%
520
 
7.4%
510
 
7.2%
406
 
5.8%
220
 
3.1%
171
 
2.4%
Other values (125) 2411
34.3%
Common
ValueCountFrequency (%)
2269
41.8%
( 521
 
9.6%
) 521
 
9.6%
1 407
 
7.5%
2 266
 
4.9%
4 253
 
4.7%
3 233
 
4.3%
0 166
 
3.1%
_ 155
 
2.9%
6 145
 
2.7%
Other values (5) 492
 
9.1%
Latin
ValueCountFrequency (%)
A 9
45.0%
E 7
35.0%
b 2
 
10.0%
B 2
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7036
56.4%
ASCII 5448
43.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2269
41.6%
( 521
 
9.6%
) 521
 
9.6%
1 407
 
7.5%
2 266
 
4.9%
4 253
 
4.6%
3 233
 
4.3%
0 166
 
3.0%
_ 155
 
2.8%
6 145
 
2.7%
Other values (9) 512
 
9.4%
Hangul
ValueCountFrequency (%)
676
 
9.6%
554
 
7.9%
526
 
7.5%
522
 
7.4%
520
 
7.4%
520
 
7.4%
510
 
7.2%
406
 
5.8%
220
 
3.1%
171
 
2.4%
Other values (125) 2411
34.3%
Distinct152
Distinct (%)29.5%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
2023-12-12T09:03:30.982880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length14
Min length14

Characters and Unicode

Total characters7224
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)10.9%

Sample

1st row2020 년04 월27 일
2nd row2020 년04 월17 일
3rd row2019 년11 월18 일
4th row2019 년05 월31 일
5th row2019 년05 월31 일
ValueCountFrequency (%)
516
25.0%
년02 168
 
8.1%
2006 133
 
6.4%
월28 70
 
3.4%
년01 64
 
3.1%
월29 61
 
3.0%
2013 45
 
2.2%
월24 44
 
2.1%
2015 42
 
2.0%
년07 41
 
2.0%
Other values (55) 880
42.6%
2023-12-12T09:03:31.323230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1548
21.4%
0 1324
18.3%
2 1059
14.7%
1 637
8.8%
516
 
7.1%
516
 
7.1%
516
 
7.1%
6 230
 
3.2%
8 167
 
2.3%
9 159
 
2.2%
Other values (4) 552
 
7.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4128
57.1%
Space Separator 1548
 
21.4%
Other Letter 1548
 
21.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1324
32.1%
2 1059
25.7%
1 637
15.4%
6 230
 
5.6%
8 167
 
4.0%
9 159
 
3.9%
4 150
 
3.6%
7 144
 
3.5%
5 130
 
3.1%
3 128
 
3.1%
Other Letter
ValueCountFrequency (%)
516
33.3%
516
33.3%
516
33.3%
Space Separator
ValueCountFrequency (%)
1548
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5676
78.6%
Hangul 1548
 
21.4%

Most frequent character per script

Common
ValueCountFrequency (%)
1548
27.3%
0 1324
23.3%
2 1059
18.7%
1 637
11.2%
6 230
 
4.1%
8 167
 
2.9%
9 159
 
2.8%
4 150
 
2.6%
7 144
 
2.5%
5 130
 
2.3%
Hangul
ValueCountFrequency (%)
516
33.3%
516
33.3%
516
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5676
78.6%
Hangul 1548
 
21.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1548
27.3%
0 1324
23.3%
2 1059
18.7%
1 637
11.2%
6 230
 
4.1%
8 167
 
2.9%
9 159
 
2.8%
4 150
 
2.6%
7 144
 
2.5%
5 130
 
2.3%
Hangul
ValueCountFrequency (%)
516
33.3%
516
33.3%
516
33.3%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
2021-05-03
516 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-05-03
2nd row2021-05-03
3rd row2021-05-03
4th row2021-05-03
5th row2021-05-03

Common Values

ValueCountFrequency (%)
2021-05-03 516
100.0%

Length

2023-12-12T09:03:31.446654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:03:31.523314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-05-03 516
100.0%

Correlations

2023-12-12T09:03:31.581329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분폐기물 종류운반자처리업소명처리방법
구분1.0001.0000.9991.0000.934
폐기물 종류1.0001.0000.9970.9960.981
운반자0.9990.9971.0000.9990.994
처리업소명1.0000.9960.9991.0000.997
처리방법0.9340.9810.9940.9971.000
2023-12-12T09:03:31.667287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처리방법구분
처리방법1.0000.859
구분0.8591.000
2023-12-12T09:03:31.740820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분처리방법
구분1.0000.859
처리방법0.8591.000

Missing values

2023-12-12T09:03:25.821718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:03:25.940962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분상호폐기물 종류사업자등록번호전화번호운반자처리업소명처리방법사업장도로명주소신고일데이터기준일
0사업장속초시수산업협동조합폐어망227-82-00028'033-630-7711지에스그린텍지에스그린텍재활용(중간가공폐기물 제조)강원도 속초시 설악금강대교로 67_ 속초시수협 산지거점유통센터 (청호동)2020 년04 월27 일2021-05-03
1사업장속초시 시설관리공단생활폐기물 소각시설 바닥재227-82-06089'0336306254자가운반속초시 환경자원사업소매립(지방자치단체매립시설)강원도 속초시 방축길 60_ 속초시환경자원사업소 (대포동)2020 년04 월17 일2021-05-03
2사업장(주)대양환경기술그 밖의 폐수처리오니611-81-16543<NA>대게환경피엔씨쌍용양회공업(주)동해공장재활용(직접 제품제조)강원도 속초시 농공단지2길 5 (대포동)2019 년11 월18 일2021-05-03
3사업장(주)대양환경기술그 밖의 폐기물611-81-16543<NA>청도환경주식회사속초시 환경자원사업소매립(지방자치단체매립시설)강원도 속초시 농공단지2길 5 (대포동)2019 년05 월31 일2021-05-03
4사업장(주)대양환경기술하수처리오니611-81-16543<NA>금강물류주식회사주식회사와이에스텍매립(민간관리형매립시설)강원도 속초시 농공단지2길 5 (대포동)2019 년05 월31 일2021-05-03
5사업장(주)대양환경기술하수처리오니611-81-16543<NA>대안환경(주)한라시멘트주식회사재활용(직접 제품제조)강원도 속초시 농공단지2길 5 (대포동)2019 년05 월31 일2021-05-03
6사업장(주)대양환경기술하수처리오니611-81-16543<NA>금강물류주식회사(주)케이엠그린구미지점매립(민간관리형매립시설)강원도 속초시 농공단지2길 5 (대포동)2019 년05 월31 일2021-05-03
7사업장성진상사(주)수산물가공잔재물227-81-11173'033-632-9366속초물산속초물산중간처분(증발.농축)강원도 속초시 농공단지길 107 (대포동)2019 년05 월23 일2021-05-03
8사업장속초시수산업협동조합폐어망227-82-00028'033-630-7711거성거성중간처분(압축)강원도 속초시 설악금강대교로 67_ 속초시수협 산지거점유통센터 (청호동)2019 년05 월14 일2021-05-03
9사업장(주)봉포머구리집그 밖의 폐기물227-81-21031<NA>청도환경환경자원사업소중간처분(지방자치단체소각)강원도 속초시 영랑해안길 223 (영랑동)2018 년09 월13 일2021-05-03
구분상호폐기물 종류사업자등록번호전화번호운반자처리업소명처리방법사업장도로명주소신고일데이터기준일
506지정김봉수외과의원일반의료폐기물227-96-01043'0336336633태광실업(주)(주)삼우그린중간처분(일반소각)강원도 속초시 중앙로 108-1 (청학동)2006 년02 월28 일2021-05-03
507지정김봉수외과의원손상성폐기물227-96-01043'0336336633태광실업(주)(주)삼우그린중간처분(일반소각)강원도 속초시 중앙로 108-1 (청학동)2006 년02 월28 일2021-05-03
508지정이좋은세상 치과일반의료폐기물227-90-28550'0336382822태광실업(주)삼우그린중간처분(일반소각)강원도 속초시 중앙로 20_ b동 2층 (교동)2006 년02 월24 일2021-05-03
509지정이좋은세상 치과손상성폐기물227-90-28550'0336382822태광실업(주)삼우그린중간처분(일반소각)강원도 속초시 중앙로 20_ b동 2층 (교동)2006 년02 월24 일2021-05-03
510지정서울피부과일반의료폐기물227-90-38437'태광실업(주)(주)삼우그린중간처분(일반소각)강원도 속초시 중앙로 125-1 (금호동)2006 년02 월23 일2021-05-03
511지정서울피부과손상성폐기물227-90-38437'태광실업(주)(주)삼우그린중간처분(일반소각)강원도 속초시 중앙로 125-1 (금호동)2006 년02 월23 일2021-05-03
512지정세란의원손상성폐기물227-90-16020'0336368114태광실업(주)(주)삼우그린중간처분(일반소각)강원도 속초시 동해대로 4024_ 307호 (조양동)2006 년02 월28 일2021-05-03
513지정세란의원일반의료폐기물227-90-16020'0336368114태광실업(주)(주)삼우그린중간처분(일반소각)강원도 속초시 동해대로 4024_ 307호 (조양동)2006 년02 월28 일2021-05-03
514지정미르이비인후과일반의료폐기물227-90-42697'0336337573태광실업(주)(주)삼우그린중간처분(일반소각)강원도 속초시 중앙로 20_ B동 203호 (교동_ 르네상스 속초빌딩)2004 년06 월08 일2021-05-03
515지정미르이비인후과손상성폐기물227-90-42697'0336337573태광실업(주)(주)삼우그린중간처분(일반소각)강원도 속초시 중앙로 20_ B동 203호 (교동_ 르네상스 속초빌딩)2004 년06 월08 일2021-05-03

Duplicate rows

Most frequently occurring

구분상호폐기물 종류사업자등록번호전화번호운반자처리업소명처리방법사업장도로명주소신고일데이터기준일# duplicates
0사업장속초시 하수도사업소하수처리오니227-83-00724'033635 2562자가운반속초시환경자원사업소중간처분(지방자치단체소각)강원도 속초시 해오름로 99 (대포동_(해오름길 86호))1999 년12 월17 일2021-05-032
1지정강원도동물위생시험소북부지소병리계폐기물227-83-03167'0336348534태광실업(주)(주)삼우그린중간처분(일반소각)강원도 속초시 장성천길 15-14 (장사동)2006 년02 월23 일2021-05-032
2지정강원도속초양양교육지원청연구ㆍ검사용 폐시약227-83-00011'033-639-6025(주)드림에코성림유화(주)중간처분(고온소각)강원도 속초시 미시령로 3336 (교동)2017 년11 월08 일2021-05-032
3지정바른정형외과조직물류폐기물(태반을 재활용하는 경우는 제외한다)227-90-59562'0336336119(주)도시환경도시환경(주)중간처분(일반소각)강원도 속초시 청대로 347 (교동)2008 년06 월04 일2021-05-032
4지정서울노앤기내과혈액오염폐기물110-99-86221<NA>(주)도시환경도시환경(주)중간처분(일반소각)강원도 속초시 중앙로 20_ A동 (교동_ 르네상스빌딩)2016 년08 월01 일2021-05-032
5지정속초시보건소병리계폐기물227-83-00419'033-639-2550태광실업(주)삼우그린중간처분(일반소각)강원도 속초시 수복로 36 (교동)2006 년02 월23 일2021-05-032
6지정속초정요양병원격리의료폐기물227-90-21386'033-637-0757(주)제아이씨도시환경(주)중간처분(일반소각)강원도 속초시 동해대로 4084 (조양동)2015 년08 월31 일2021-05-032
7지정속초정요양병원조직물류폐기물(태반을 재활용하는 경우는 제외한다)227-90-21386'033-637-0757(주)제아이씨도시환경(주)중간처분(일반소각)강원도 속초시 동해대로 4084 (조양동)2015 년08 월31 일2021-05-032
8지정속초정요양병원혈액오염폐기물227-90-21386'033-637-0757(주)제아이씨도시환경(주)중간처분(일반소각)강원도 속초시 동해대로 4084 (조양동)2015 년08 월31 일2021-05-032
9지정의료법인 신의의료재단(속초우리요양병원)격리의료폐기물104-82-13384'033-638-0060(주)제아이씨도시환경(주)중간처분(일반소각)강원도 속초시 온천로 291 (교동)2018 년02 월05 일2021-05-032