Overview

Dataset statistics

Number of variables9
Number of observations1303
Missing cells0
Missing cells (%)0.0%
Duplicate rows107
Duplicate rows (%)8.2%
Total size in memory93.0 KiB
Average record size in memory73.1 B

Variable types

Categorical3
Numeric1
Text5

Dataset

Description통영시 사업장폐기물배출자신고현황에 대한 폐기물 구분,신고기준년도,상호명,사업장도로명주소,폐기물 종류,운반자,처리업소명,처리방법,데이터기준일자 대한 정보를 제공합니다.
Author경상남도 통영시
URLhttps://www.data.go.kr/data/15060235/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 107 (8.2%) duplicate rowsDuplicates
폐기물 구분 is highly overall correlated with 처리방법High correlation
처리방법 is highly overall correlated with 폐기물 구분High correlation

Reproduction

Analysis started2024-05-04 07:14:08.319102
Analysis finished2024-05-04 07:14:11.490414
Duration3.17 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

폐기물 구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size10.3 KiB
지정폐기물배출자
802 
사업장폐기물배출자
501 

Length

Max length9
Median length8
Mean length8.3844973
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업장폐기물배출자
2nd row사업장폐기물배출자
3rd row사업장폐기물배출자
4th row사업장폐기물배출자
5th row사업장폐기물배출자

Common Values

ValueCountFrequency (%)
지정폐기물배출자 802
61.6%
사업장폐기물배출자 501
38.4%

Length

2024-05-04T07:14:11.699898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T07:14:12.019113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지정폐기물배출자 802
61.6%
사업장폐기물배출자 501
38.4%

신고기준년도
Real number (ℝ)

Distinct26
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2010.0414
Minimum1999
Maximum2024
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.6 KiB
2024-05-04T07:14:12.369920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1999
5-th percentile2000
Q12003
median2009
Q32016
95-th percentile2022
Maximum2024
Range25
Interquartile range (IQR)13

Descriptive statistics

Standard deviation7.5889888
Coefficient of variation (CV)0.0037755385
Kurtosis-1.1887441
Mean2010.0414
Median Absolute Deviation (MAD)7
Skewness0.20468242
Sum2619084
Variance57.592751
MonotonicityNot monotonic
2024-05-04T07:14:12.803638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
2000 215
16.5%
2022 153
 
11.7%
2009 103
 
7.9%
2010 85
 
6.5%
2015 84
 
6.4%
2011 61
 
4.7%
2005 61
 
4.7%
2008 57
 
4.4%
2002 51
 
3.9%
2016 43
 
3.3%
Other values (16) 390
29.9%
ValueCountFrequency (%)
1999 12
 
0.9%
2000 215
16.5%
2001 35
 
2.7%
2002 51
 
3.9%
2003 36
 
2.8%
2004 15
 
1.2%
2005 61
 
4.7%
2006 42
 
3.2%
2007 40
 
3.1%
2008 57
 
4.4%
ValueCountFrequency (%)
2024 7
 
0.5%
2023 19
 
1.5%
2022 153
11.7%
2021 13
 
1.0%
2020 16
 
1.2%
2019 41
 
3.1%
2018 33
 
2.5%
2017 24
 
1.8%
2016 43
 
3.3%
2015 84
6.4%

상호
Text

Distinct499
Distinct (%)38.3%
Missing0
Missing (%)0.0%
Memory size10.3 KiB
2024-05-04T07:14:13.263399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length18
Mean length8.2847276
Min length1

Characters and Unicode

Total characters10795
Distinct characters341
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique222 ?
Unique (%)17.0%

Sample

1st row(주)글로벌중공업 통영지점
2nd row통영원 어업회사법인주식회사
3rd row고성군청(통영시 환경자원화센터)
4th row고성군청(통영시 환경자원화센터)
5th row태양수산
ValueCountFrequency (%)
굴박신장 107
 
6.6%
주식회사 49
 
3.0%
통영시(환경자원화센터 17
 
1.1%
국립수산과학원 17
 
1.1%
에이치에스지성동조선 16
 
1.0%
주)대우환경 16
 
1.0%
spp조선(주 15
 
0.9%
경남도립통영노인전문병원 15
 
0.9%
통영서울병원 15
 
0.9%
13
 
0.8%
Other values (522) 1337
82.7%
2024-05-04T07:14:14.322620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
520
 
4.8%
352
 
3.3%
338
 
3.1%
338
 
3.1%
326
 
3.0%
316
 
2.9%
) 267
 
2.5%
( 267
 
2.5%
253
 
2.3%
225
 
2.1%
Other values (331) 7593
70.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9789
90.7%
Space Separator 316
 
2.9%
Close Punctuation 267
 
2.5%
Open Punctuation 267
 
2.5%
Uppercase Letter 77
 
0.7%
Decimal Number 57
 
0.5%
Lowercase Letter 20
 
0.2%
Connector Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
520
 
5.3%
352
 
3.6%
338
 
3.5%
338
 
3.5%
326
 
3.3%
253
 
2.6%
225
 
2.3%
206
 
2.1%
202
 
2.1%
186
 
1.9%
Other values (305) 6843
69.9%
Decimal Number
ValueCountFrequency (%)
1 21
36.8%
2 10
17.5%
3 8
 
14.0%
5 7
 
12.3%
4 3
 
5.3%
9 2
 
3.5%
0 2
 
3.5%
6 2
 
3.5%
7 2
 
3.5%
Uppercase Letter
ValueCountFrequency (%)
P 30
39.0%
S 22
28.6%
H 7
 
9.1%
C 7
 
9.1%
D 5
 
6.5%
L 2
 
2.6%
N 2
 
2.6%
G 2
 
2.6%
Lowercase Letter
ValueCountFrequency (%)
o 11
55.0%
n 6
30.0%
f 1
 
5.0%
r 1
 
5.0%
p 1
 
5.0%
Space Separator
ValueCountFrequency (%)
316
100.0%
Close Punctuation
ValueCountFrequency (%)
) 267
100.0%
Open Punctuation
ValueCountFrequency (%)
( 267
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9789
90.7%
Common 909
 
8.4%
Latin 97
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
520
 
5.3%
352
 
3.6%
338
 
3.5%
338
 
3.5%
326
 
3.3%
253
 
2.6%
225
 
2.3%
206
 
2.1%
202
 
2.1%
186
 
1.9%
Other values (305) 6843
69.9%
Common
ValueCountFrequency (%)
316
34.8%
) 267
29.4%
( 267
29.4%
1 21
 
2.3%
2 10
 
1.1%
3 8
 
0.9%
5 7
 
0.8%
4 3
 
0.3%
_ 2
 
0.2%
9 2
 
0.2%
Other values (3) 6
 
0.7%
Latin
ValueCountFrequency (%)
P 30
30.9%
S 22
22.7%
o 11
 
11.3%
H 7
 
7.2%
C 7
 
7.2%
n 6
 
6.2%
D 5
 
5.2%
L 2
 
2.1%
N 2
 
2.1%
G 2
 
2.1%
Other values (3) 3
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9789
90.7%
ASCII 1006
 
9.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
520
 
5.3%
352
 
3.6%
338
 
3.5%
338
 
3.5%
326
 
3.3%
253
 
2.6%
225
 
2.3%
206
 
2.1%
202
 
2.1%
186
 
1.9%
Other values (305) 6843
69.9%
ASCII
ValueCountFrequency (%)
316
31.4%
) 267
26.5%
( 267
26.5%
P 30
 
3.0%
S 22
 
2.2%
1 21
 
2.1%
o 11
 
1.1%
2 10
 
1.0%
3 8
 
0.8%
H 7
 
0.7%
Other values (16) 47
 
4.7%
Distinct390
Distinct (%)29.9%
Missing0
Missing (%)0.0%
Memory size10.3 KiB
2024-05-04T07:14:15.088178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length42
Mean length25.307751
Min length1

Characters and Unicode

Total characters32976
Distinct characters266
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique109 ?
Unique (%)8.4%

Sample

1st row경상남도 통영시 서송정길 110 (봉평동)
2nd row경상남도 통영시 도산면 남해안대로 1729-37
3rd row경상남도 고성군 고성읍 성내로 130_ 고성군청
4th row경상남도 고성군 고성읍 성내로 130_ 고성군청
5th row경상남도 통영시 용남면 밤개길 128-44
ValueCountFrequency (%)
경상남도 1244
 
17.6%
통영시 1242
 
17.6%
광도면 326
 
4.6%
중앙로 255
 
3.6%
북신동 157
 
2.2%
무전동 135
 
1.9%
용남면 111
 
1.6%
도산면 110
 
1.6%
남해안대로 85
 
1.2%
2층 84
 
1.2%
Other values (603) 3301
46.8%
2024-05-04T07:14:16.412839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5866
 
17.8%
1795
 
5.4%
1537
 
4.7%
1338
 
4.1%
1335
 
4.0%
1276
 
3.9%
1267
 
3.8%
1263
 
3.8%
1 1039
 
3.2%
908
 
2.8%
Other values (256) 15352
46.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 19833
60.1%
Space Separator 5866
 
17.8%
Decimal Number 4967
 
15.1%
Close Punctuation 641
 
1.9%
Open Punctuation 641
 
1.9%
Connector Punctuation 602
 
1.8%
Dash Punctuation 388
 
1.2%
Uppercase Letter 20
 
0.1%
Math Symbol 7
 
< 0.1%
Other Punctuation 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1795
 
9.1%
1537
 
7.7%
1338
 
6.7%
1335
 
6.7%
1276
 
6.4%
1267
 
6.4%
1263
 
6.4%
908
 
4.6%
708
 
3.6%
598
 
3.0%
Other values (233) 7808
39.4%
Decimal Number
ValueCountFrequency (%)
1 1039
20.9%
2 709
14.3%
0 556
11.2%
4 523
10.5%
3 511
10.3%
9 383
 
7.7%
5 370
 
7.4%
7 348
 
7.0%
8 278
 
5.6%
6 250
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
D 9
45.0%
K 9
45.0%
A 2
 
10.0%
Other Punctuation
ValueCountFrequency (%)
& 5
71.4%
· 2
 
28.6%
Lowercase Letter
ValueCountFrequency (%)
k 2
50.0%
t 2
50.0%
Space Separator
ValueCountFrequency (%)
5866
100.0%
Close Punctuation
ValueCountFrequency (%)
) 641
100.0%
Open Punctuation
ValueCountFrequency (%)
( 641
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 602
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 388
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 19833
60.1%
Common 13119
39.8%
Latin 24
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1795
 
9.1%
1537
 
7.7%
1338
 
6.7%
1335
 
6.7%
1276
 
6.4%
1267
 
6.4%
1263
 
6.4%
908
 
4.6%
708
 
3.6%
598
 
3.0%
Other values (233) 7808
39.4%
Common
ValueCountFrequency (%)
5866
44.7%
1 1039
 
7.9%
2 709
 
5.4%
) 641
 
4.9%
( 641
 
4.9%
_ 602
 
4.6%
0 556
 
4.2%
4 523
 
4.0%
3 511
 
3.9%
- 388
 
3.0%
Other values (8) 1643
 
12.5%
Latin
ValueCountFrequency (%)
D 9
37.5%
K 9
37.5%
k 2
 
8.3%
A 2
 
8.3%
t 2
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 19833
60.1%
ASCII 13141
39.9%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5866
44.6%
1 1039
 
7.9%
2 709
 
5.4%
) 641
 
4.9%
( 641
 
4.9%
_ 602
 
4.6%
0 556
 
4.2%
4 523
 
4.0%
3 511
 
3.9%
- 388
 
3.0%
Other values (12) 1665
 
12.7%
Hangul
ValueCountFrequency (%)
1795
 
9.1%
1537
 
7.7%
1338
 
6.7%
1335
 
6.7%
1276
 
6.4%
1267
 
6.4%
1263
 
6.4%
908
 
4.6%
708
 
3.6%
598
 
3.0%
Other values (233) 7808
39.4%
None
ValueCountFrequency (%)
· 2
100.0%
Distinct89
Distinct (%)6.8%
Missing0
Missing (%)0.0%
Memory size10.3 KiB
2024-05-04T07:14:17.161539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length113
Median length84
Mean length12.684574
Min length1

Characters and Unicode

Total characters16528
Distinct characters238
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)2.5%

Sample

1st row폐합성수지류(폐염화비닐수지류는 제외한다)
2nd row수산물가공잔재물
3rd row생활폐기물 소각시설 비산재
4th row생활폐기물 소각시설 바닥재
5th row폐합성수지류(폐염화비닐수지류는 제외한다)
ValueCountFrequency (%)
제외한다 229
 
8.2%
손상성폐기물 188
 
6.7%
일반의료폐기물 188
 
6.7%
폐패각 143
 
5.1%
조직물류폐기물(태반을 136
 
4.9%
재활용하는 136
 
4.9%
경우는 136
 
4.9%
128
 
4.6%
밖의 128
 
4.6%
폐합성수지류(폐염화비닐수지류는 84
 
3.0%
Other values (204) 1305
46.6%
2024-05-04T07:14:18.312938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1516
 
9.2%
1360
 
8.2%
1183
 
7.2%
818
 
4.9%
416
 
2.5%
396
 
2.4%
387
 
2.3%
385
 
2.3%
351
 
2.1%
326
 
2.0%
Other values (228) 9390
56.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14085
85.2%
Space Separator 1516
 
9.2%
Open Punctuation 313
 
1.9%
Close Punctuation 313
 
1.9%
Lowercase Letter 120
 
0.7%
Connector Punctuation 103
 
0.6%
Decimal Number 75
 
0.5%
Other Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1360
 
9.7%
1183
 
8.4%
818
 
5.8%
416
 
3.0%
396
 
2.8%
387
 
2.7%
385
 
2.7%
351
 
2.5%
326
 
2.3%
310
 
2.2%
Other values (207) 8153
57.9%
Decimal Number
ValueCountFrequency (%)
2 30
40.0%
0 20
26.7%
1 14
18.7%
8 9
 
12.0%
3 1
 
1.3%
4 1
 
1.3%
Lowercase Letter
ValueCountFrequency (%)
e 40
33.3%
r 20
16.7%
a 20
16.7%
g 20
16.7%
s 20
16.7%
Open Punctuation
ValueCountFrequency (%)
( 284
90.7%
[ 20
 
6.4%
9
 
2.9%
Close Punctuation
ValueCountFrequency (%)
) 284
90.7%
] 20
 
6.4%
9
 
2.9%
Other Punctuation
ValueCountFrequency (%)
· 2
66.7%
. 1
33.3%
Space Separator
ValueCountFrequency (%)
1516
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 103
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14085
85.2%
Common 2323
 
14.1%
Latin 120
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1360
 
9.7%
1183
 
8.4%
818
 
5.8%
416
 
3.0%
396
 
2.8%
387
 
2.7%
385
 
2.7%
351
 
2.5%
326
 
2.3%
310
 
2.2%
Other values (207) 8153
57.9%
Common
ValueCountFrequency (%)
1516
65.3%
( 284
 
12.2%
) 284
 
12.2%
_ 103
 
4.4%
2 30
 
1.3%
0 20
 
0.9%
[ 20
 
0.9%
] 20
 
0.9%
1 14
 
0.6%
9
 
0.4%
Other values (6) 23
 
1.0%
Latin
ValueCountFrequency (%)
e 40
33.3%
r 20
16.7%
a 20
16.7%
g 20
16.7%
s 20
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13919
84.2%
ASCII 2423
 
14.7%
Compat Jamo 166
 
1.0%
None 20
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1516
62.6%
( 284
 
11.7%
) 284
 
11.7%
_ 103
 
4.3%
e 40
 
1.7%
2 30
 
1.2%
r 20
 
0.8%
0 20
 
0.8%
[ 20
 
0.8%
a 20
 
0.8%
Other values (8) 86
 
3.5%
Hangul
ValueCountFrequency (%)
1360
 
9.8%
1183
 
8.5%
818
 
5.9%
416
 
3.0%
396
 
2.8%
387
 
2.8%
385
 
2.8%
351
 
2.5%
326
 
2.3%
310
 
2.2%
Other values (206) 7987
57.4%
Compat Jamo
ValueCountFrequency (%)
166
100.0%
None
ValueCountFrequency (%)
9
45.0%
9
45.0%
· 2
 
10.0%
Distinct252
Distinct (%)19.3%
Missing0
Missing (%)0.0%
Memory size10.3 KiB
2024-05-04T07:14:19.088597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length42
Mean length8.4105909
Min length1

Characters and Unicode

Total characters10959
Distinct characters190
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique140 ?
Unique (%)10.7%

Sample

1st row성진철재
2nd row동원피드
3rd row고성군청
4th row고성군청
5th row(주)대우환경
ValueCountFrequency (%)
소망위생개발 385
27.6%
동원위생 267
19.2%
주)대우환경 38
 
2.7%
동남환경 37
 
2.7%
주)동진개발_지산환경_(주)해양산업개발 22
 
1.6%
대성기업 19
 
1.4%
고성중기환경 14
 
1.0%
우봉이엔티 13
 
0.9%
주)동진건설_지산환경 12
 
0.9%
주)해양산업개발_지산환경 12
 
0.9%
Other values (254) 574
41.2%
2024-05-04T07:14:20.444829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
711
 
6.5%
711
 
6.5%
654
 
6.0%
654
 
6.0%
591
 
5.4%
( 589
 
5.4%
) 589
 
5.4%
486
 
4.4%
400
 
3.6%
392
 
3.6%
Other values (180) 5182
47.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9066
82.7%
Open Punctuation 589
 
5.4%
Close Punctuation 589
 
5.4%
Connector Punctuation 381
 
3.5%
Space Separator 100
 
0.9%
Other Punctuation 95
 
0.9%
Dash Punctuation 94
 
0.9%
Other Symbol 18
 
0.2%
Lowercase Letter 18
 
0.2%
Uppercase Letter 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
711
 
7.8%
711
 
7.8%
654
 
7.2%
654
 
7.2%
591
 
6.5%
486
 
5.4%
400
 
4.4%
392
 
4.3%
387
 
4.3%
387
 
4.3%
Other values (163) 3693
40.7%
Lowercase Letter
ValueCountFrequency (%)
e 6
33.3%
r 3
16.7%
g 3
16.7%
y 3
16.7%
n 3
16.7%
Other Punctuation
ValueCountFrequency (%)
/ 81
85.3%
: 13
 
13.7%
. 1
 
1.1%
Uppercase Letter
ValueCountFrequency (%)
H 3
50.0%
S 3
50.0%
Open Punctuation
ValueCountFrequency (%)
( 589
100.0%
Close Punctuation
ValueCountFrequency (%)
) 589
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 381
100.0%
Space Separator
ValueCountFrequency (%)
100
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 94
100.0%
Other Symbol
ValueCountFrequency (%)
18
100.0%
Decimal Number
ValueCountFrequency (%)
2 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9084
82.9%
Common 1851
 
16.9%
Latin 24
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
711
 
7.8%
711
 
7.8%
654
 
7.2%
654
 
7.2%
591
 
6.5%
486
 
5.4%
400
 
4.4%
392
 
4.3%
387
 
4.3%
387
 
4.3%
Other values (164) 3711
40.9%
Common
ValueCountFrequency (%)
( 589
31.8%
) 589
31.8%
_ 381
20.6%
100
 
5.4%
- 94
 
5.1%
/ 81
 
4.4%
: 13
 
0.7%
2 3
 
0.2%
. 1
 
0.1%
Latin
ValueCountFrequency (%)
e 6
25.0%
r 3
12.5%
g 3
12.5%
y 3
12.5%
H 3
12.5%
S 3
12.5%
n 3
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9066
82.7%
ASCII 1875
 
17.1%
None 18
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
711
 
7.8%
711
 
7.8%
654
 
7.2%
654
 
7.2%
591
 
6.5%
486
 
5.4%
400
 
4.4%
392
 
4.3%
387
 
4.3%
387
 
4.3%
Other values (163) 3693
40.7%
ASCII
ValueCountFrequency (%)
( 589
31.4%
) 589
31.4%
_ 381
20.3%
100
 
5.3%
- 94
 
5.0%
/ 81
 
4.3%
: 13
 
0.7%
e 6
 
0.3%
r 3
 
0.2%
g 3
 
0.2%
Other values (6) 16
 
0.9%
None
ValueCountFrequency (%)
18
100.0%
Distinct338
Distinct (%)25.9%
Missing0
Missing (%)0.0%
Memory size10.3 KiB
2024-05-04T07:14:21.027935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length85
Median length81
Mean length14.219493
Min length1

Characters and Unicode

Total characters18528
Distinct characters222
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique238 ?
Unique (%)18.3%

Sample

1st row(주)마루에너지
2nd row동원피드
3rd row고성군청
4th row고성군청
5th row(주)대우환경
ValueCountFrequency (%)
주)경서 423
26.7%
주)아림환경 243
15.4%
주)에코비트에너지경산 242
15.3%
주)대우환경 30
 
1.9%
주)창원에너텍 15
 
0.9%
주)아림환경(주)에코비트에너지경산 12
 
0.8%
주)에스씨이노베이션 12
 
0.8%
주)베스트_(주)삼보패화석_(주)청해광업(고흥)_(주)청해광업(해남 10
 
0.6%
주)아림환경_(주)에코비트에너지경산 9
 
0.6%
통영시 9
 
0.6%
Other values (341) 577
36.5%
2024-05-04T07:14:22.089424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 2207
 
11.9%
( 2200
 
11.9%
2090
 
11.3%
1042
 
5.6%
_ 898
 
4.8%
734
 
4.0%
655
 
3.5%
480
 
2.6%
449
 
2.4%
440
 
2.4%
Other values (212) 7333
39.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12787
69.0%
Close Punctuation 2207
 
11.9%
Open Punctuation 2200
 
11.9%
Connector Punctuation 898
 
4.8%
Space Separator 285
 
1.5%
Dash Punctuation 74
 
0.4%
Uppercase Letter 43
 
0.2%
Other Punctuation 20
 
0.1%
Other Symbol 12
 
0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2090
 
16.3%
1042
 
8.1%
734
 
5.7%
655
 
5.1%
480
 
3.8%
449
 
3.5%
440
 
3.4%
405
 
3.2%
329
 
2.6%
304
 
2.4%
Other values (189) 5859
45.8%
Uppercase Letter
ValueCountFrequency (%)
C 14
32.6%
K 10
23.3%
N 5
 
11.6%
H 3
 
7.0%
S 3
 
7.0%
G 2
 
4.7%
D 1
 
2.3%
T 1
 
2.3%
M 1
 
2.3%
O 1
 
2.3%
Other values (2) 2
 
4.7%
Other Punctuation
ValueCountFrequency (%)
/ 18
90.0%
& 1
 
5.0%
. 1
 
5.0%
Lowercase Letter
ValueCountFrequency (%)
r 1
50.0%
a 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 2207
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2200
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 898
100.0%
Space Separator
ValueCountFrequency (%)
285
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 74
100.0%
Other Symbol
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12799
69.1%
Common 5684
30.7%
Latin 45
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2090
 
16.3%
1042
 
8.1%
734
 
5.7%
655
 
5.1%
480
 
3.8%
449
 
3.5%
440
 
3.4%
405
 
3.2%
329
 
2.6%
304
 
2.4%
Other values (190) 5871
45.9%
Latin
ValueCountFrequency (%)
C 14
31.1%
K 10
22.2%
N 5
 
11.1%
H 3
 
6.7%
S 3
 
6.7%
G 2
 
4.4%
D 1
 
2.2%
T 1
 
2.2%
r 1
 
2.2%
a 1
 
2.2%
Other values (4) 4
 
8.9%
Common
ValueCountFrequency (%)
) 2207
38.8%
( 2200
38.7%
_ 898
15.8%
285
 
5.0%
- 74
 
1.3%
/ 18
 
0.3%
& 1
 
< 0.1%
. 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12787
69.0%
ASCII 5729
30.9%
None 12
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 2207
38.5%
( 2200
38.4%
_ 898
15.7%
285
 
5.0%
- 74
 
1.3%
/ 18
 
0.3%
C 14
 
0.2%
K 10
 
0.2%
N 5
 
0.1%
H 3
 
0.1%
Other values (12) 15
 
0.3%
Hangul
ValueCountFrequency (%)
2090
 
16.3%
1042
 
8.1%
734
 
5.7%
655
 
5.1%
480
 
3.8%
449
 
3.5%
440
 
3.4%
405
 
3.2%
329
 
2.6%
304
 
2.4%
Other values (189) 5859
45.8%
None
ValueCountFrequency (%)
12
100.0%

처리방법
Categorical

HIGH CORRELATION 

Distinct30
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size10.3 KiB
중간처분(일반소각)
745 
재활용(중간가공폐기물 제조)
164 
재활용(파쇄.분쇄)
82 
재활용(직접 제품제조)
 
57
재활용(연료·고형연료제품 제조)
 
41
Other values (25)
214 

Length

Max length19
Median length10
Mean length11.09056
Min length1

Unique

Unique7 ?
Unique (%)0.5%

Sample

1st row재활용(중간가공폐기물 제조)
2nd row재활용(원료 제조)
3rd row매립(관리형매립시설)
4th row매립(관리형매립시설)
5th row재활용(중간가공폐기물 제조)

Common Values

ValueCountFrequency (%)
중간처분(일반소각) 745
57.2%
재활용(중간가공폐기물 제조) 164
 
12.6%
재활용(파쇄.분쇄) 82
 
6.3%
재활용(직접 제품제조) 57
 
4.4%
재활용(연료·고형연료제품 제조) 41
 
3.1%
재활용(원료 제조) 41
 
3.1%
중간처분(고온소각) 36
 
2.8%
매립(민간관리형매립시설) 33
 
2.5%
재활용(토질개선에 사용) 22
 
1.7%
재활용(농업생산활동에 사용) 15
 
1.2%
Other values (20) 67
 
5.1%

Length

2024-05-04T07:14:22.629088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
중간처분(일반소각 745
45.0%
제조 246
 
14.9%
재활용(중간가공폐기물 164
 
9.9%
재활용(파쇄.분쇄 82
 
5.0%
재활용(직접 60
 
3.6%
제품제조 57
 
3.4%
사용 41
 
2.5%
재활용(연료·고형연료제품 41
 
2.5%
재활용(원료 41
 
2.5%
중간처분(고온소각 36
 
2.2%
Other values (24) 142
 
8.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.3 KiB
2024-04-24
1303 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-04-24
2nd row2024-04-24
3rd row2024-04-24
4th row2024-04-24
5th row2024-04-24

Common Values

ValueCountFrequency (%)
2024-04-24 1303
100.0%

Length

2024-05-04T07:14:23.101862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T07:14:23.528631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-04-24 1303
100.0%

Interactions

2024-05-04T07:14:10.425344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-04T07:14:23.797309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물 구분신고기준년도폐기물 종류처리방법
폐기물 구분1.0000.4870.9990.962
신고기준년도0.4871.0000.6970.599
폐기물 종류0.9990.6971.0000.977
처리방법0.9620.5990.9771.000
2024-05-04T07:14:24.131587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물 구분처리방법
폐기물 구분1.0000.850
처리방법0.8501.000
2024-05-04T07:14:24.422460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신고기준년도폐기물 구분처리방법
신고기준년도1.0000.3710.228
폐기물 구분0.3711.0000.850
처리방법0.2280.8501.000

Missing values

2024-05-04T07:14:10.800337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-04T07:14:11.290358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

폐기물 구분신고기준년도상호사업장도로명주소폐기물 종류운반자처리업소명처리방법데이터기준일자
0사업장폐기물배출자2023(주)글로벌중공업 통영지점경상남도 통영시 서송정길 110 (봉평동)폐합성수지류(폐염화비닐수지류는 제외한다)성진철재(주)마루에너지재활용(중간가공폐기물 제조)2024-04-24
1사업장폐기물배출자2023통영원 어업회사법인주식회사경상남도 통영시 도산면 남해안대로 1729-37수산물가공잔재물동원피드동원피드재활용(원료 제조)2024-04-24
2사업장폐기물배출자2022고성군청(통영시 환경자원화센터)경상남도 고성군 고성읍 성내로 130_ 고성군청생활폐기물 소각시설 비산재고성군청고성군청매립(관리형매립시설)2024-04-24
3사업장폐기물배출자2022고성군청(통영시 환경자원화센터)경상남도 고성군 고성읍 성내로 130_ 고성군청생활폐기물 소각시설 바닥재고성군청고성군청매립(관리형매립시설)2024-04-24
4사업장폐기물배출자2022태양수산경상남도 통영시 용남면 밤개길 128-44폐합성수지류(폐염화비닐수지류는 제외한다)(주)대우환경(주)대우환경재활용(중간가공폐기물 제조)2024-04-24
5사업장폐기물배출자2022주식회사 통영허브수산경상남도 통영시 도산면 남해안대로 1729-45수산물가공잔재물(주)HS아쿠아피드 고성공장(주)HS아쿠아피드 고성공장재활용(원료 제조)2024-04-24
6사업장폐기물배출자2022여명수산(굴양식 제1571호)경상남도 통영시 광도면 용호로 325_ 용호리 327-2 여명수산폐합성수지류(폐염화비닐수지류는 제외한다)(주)예림산업(주)에너지네트웍중간처분(일반소각)2024-04-24
7사업장폐기물배출자2022덕진수산경상남도 통영시 산양읍 풍화일주로 808그 밖의 폐수처리오니(주)조은환경농업회사법인(주)송암바이오재활용(토질개선에 사용)2024-04-24
8사업장폐기물배출자2022(주)삼성산업경상남도 통영시 광도면 은황길 386-38폐아스팔트콘크리트코데코(주)코데코(주)중간처분(파쇄.분쇄)2024-04-24
9사업장폐기물배출자2022(주)삼성산업경상남도 통영시 광도면 은황길 386-38폐콘크리트코데코(주)코데코(주)중간처분(파쇄.분쇄)2024-04-24
폐기물 구분신고기준년도상호사업장도로명주소폐기물 종류운반자처리업소명처리방법데이터기준일자
1293지정폐기물배출자2000통영적십자병원경상남도 통영시 중앙로 97 (서호동)병리계폐기물동원위생(주)아림환경(주)에코비트에너지경산중간처분(일반소각)2024-04-24
1294지정폐기물배출자2000통영적십자병원경상남도 통영시 중앙로 97 (서호동)조직물류폐기물(태반을 재활용하는 경우는 제외한다)동원위생(주)아림환경(주)에코비트에너지경산중간처분(일반소각)2024-04-24
1295지정폐기물배출자2000통영적십자병원경상남도 통영시 중앙로 97 (서호동)조직물류폐기물(태반을 재활용하는 경우는 제외한다)동원위생(주)아림환경(주)에코비트에너지경산중간처분(일반소각)2024-04-24
1296지정폐기물배출자2000통영적십자병원경상남도 통영시 중앙로 97 (서호동)격리의료폐기물동원위생(주)아림환경(주)에코비트에너지경산중간처분(일반소각)2024-04-24
1297지정폐기물배출자2000통영적십자병원경상남도 통영시 중앙로 97 (서호동)격리의료폐기물동원위생(주)아림환경(주)에코비트에너지경산중간처분(일반소각)2024-04-24
1298지정폐기물배출자2000통영적십자병원경상남도 통영시 중앙로 97 (서호동)일반의료폐기물동원위생(주)아림환경(주)에코비트에너지경산중간처분(일반소각)2024-04-24
1299지정폐기물배출자2000통영적십자병원경상남도 통영시 중앙로 97 (서호동)혈액오염폐기물동원위생(주)아림환경(주)에코비트에너지경산중간처분(일반소각)2024-04-24
1300지정폐기물배출자2000통영적십자병원경상남도 통영시 중앙로 97 (서호동)혈액오염폐기물동원위생(주)아림환경(주)에코비트에너지경산중간처분(일반소각)2024-04-24
1301지정폐기물배출자2000통영적십자병원경상남도 통영시 중앙로 97 (서호동)생물ㆍ화학폐기물동원위생(주)아림환경(주)에코비트에너지경산중간처분(일반소각)2024-04-24
1302지정폐기물배출자2000통영적십자병원경상남도 통영시 중앙로 97 (서호동)생물ㆍ화학폐기물동원위생(주)아림환경(주)에코비트에너지경산중간처분(일반소각)2024-04-24

Duplicate rows

Most frequently occurring

폐기물 구분신고기준년도상호사업장도로명주소폐기물 종류운반자처리업소명처리방법데이터기준일자# duplicates
59지정폐기물배출자2011통영서울병원경상남도 통영시 광도면 남해안대로 857격리의료폐기물동남환경(주)경서중간처분(일반소각)2024-04-243
0사업장폐기물배출자2013(주)에스씨이노베이션 리사이클링경상남도 통영시 광도면 춘원1로 107폐금속류(주)신진스틸(주)신진스틸재활용(기타)2024-04-242
1지정폐기물배출자2000경상남도동물위생시험소 남부지소경상남도 통영시 도산면 남해안대로 2018-23_ 경상남도동물위생시험소 남부지소조직물류폐기물(태반을 재활용하는 경우는 제외한다)소망위생개발(주)경서중간처분(일반소각)2024-04-242
2지정폐기물배출자2000김영호내과경상남도 통영시 중앙로 287 (북신동_2층)병리계폐기물동원위생(주)아림환경_ (주)에코비트에너지경산중간처분(일반소각)2024-04-242
3지정폐기물배출자2000김영호내과경상남도 통영시 중앙로 287 (북신동_2층)조직물류폐기물(태반을 재활용하는 경우는 제외한다)동원위생(주)아림환경_ (주)에코비트에너지경산중간처분(일반소각)2024-04-242
4지정폐기물배출자2000김우신성형외과경상남도 통영시 중앙로 307 (무전동)조직물류폐기물(태반을 재활용하는 경우는 제외한다)소망위생개발(주)경서중간처분(일반소각)2024-04-242
5지정폐기물배출자2000미래산부인과의원경상남도 통영시 중앙로 311_ 3층 (무전동)조직물류폐기물(태반을 재활용하는 경우는 제외한다)소망위생개발(주)경서중간처분(일반소각)2024-04-242
6지정폐기물배출자2000삼성늘푸른정형외과경상남도 통영시 중앙로 309 (무전동)병리계폐기물동원위생(주)아림환경_ (주)에코비트에너지경산중간처분(일반소각)2024-04-242
7지정폐기물배출자2000삼성늘푸른정형외과경상남도 통영시 중앙로 309 (무전동)조직물류폐기물(태반을 재활용하는 경우는 제외한다)동원위생(주)아림환경_ (주)에코비트에너지경산중간처분(일반소각)2024-04-242
8지정폐기물배출자2000삼성정형외과의원경상남도 통영시 중앙로 309 (무전동)병리계폐기물동원위생(주)아림환경_ (주)에코비트에너지경산중간처분(일반소각)2024-04-242