Overview

Dataset statistics

Number of variables9
Number of observations5954
Missing cells1566
Missing cells (%)2.9%
Duplicate rows226
Duplicate rows (%)3.8%
Total size in memory424.6 KiB
Average record size in memory73.0 B

Variable types

Categorical3
Text5
Numeric1

Dataset

Description"사업장폐기물배출자 신고현황" 공공데이터는 구례 지역 내에서 사업장이 폐기물을 배출하는 것에 관한 정보를 제공합니다.이 데이터는 주로 사업장의 명칭, 업종, 위치, 폐기물 종류 및 양, 배출 일자 등을 포함합니다.이를 통해 지역의 폐기물 관리 상태를 파악하고, 환경 보호 및 재활용 정책에 대한 효율적인 모니터링 및 관리가 가능합니다.또한, 해당 데이터는 폐기물 관리에 관련된 규제 및 정책 수립에도 활용됩니다.
Author전라남도 구례군
URLhttps://www.data.go.kr/data/15066683/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 226 (3.8%) duplicate rowsDuplicates
폐기물구분 is highly overall correlated with 처리방법High correlation
처리방법 is highly overall correlated with 폐기물구분High correlation
폐기물구분 is highly imbalanced (70.8%)Imbalance
처리방법 is highly imbalanced (71.6%)Imbalance
사업자등록번호 has 1566 (26.3%) missing valuesMissing

Reproduction

Analysis started2024-03-30 07:39:09.229504
Analysis finished2024-03-30 07:39:13.280980
Duration4.05 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

폐기물구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size46.6 KiB
지정폐기물
5649 
사업장일반폐기물
 
305

Length

Max length8
Median length5
Mean length5.1536782
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업장일반폐기물
2nd row사업장일반폐기물
3rd row사업장일반폐기물
4th row사업장일반폐기물
5th row사업장일반폐기물

Common Values

ValueCountFrequency (%)
지정폐기물 5649
94.9%
사업장일반폐기물 305
 
5.1%

Length

2024-03-30T07:39:13.537458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-30T07:39:14.042050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지정폐기물 5649
94.9%
사업장일반폐기물 305
 
5.1%
Distinct475
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size46.6 KiB
2024-03-30T07:39:14.514796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length25
Mean length5.7670474
Min length2

Characters and Unicode

Total characters34337
Distinct characters302
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique176 ?
Unique (%)3.0%

Sample

1st row푸른농원
2nd row(주)쿱농산 과채가공센터
3rd row(주)쿱농산 과채가공센터
4th row구례군상하수도사업소
5th row농업회사법인 주식회사 구례양조 비어락하우스
ValueCountFrequency (%)
주)황룡 1573
24.2%
개인 1015
15.6%
유)대신건설 774
11.9%
주식회사 371
 
5.7%
황룡 336
 
5.2%
주)보림건설 222
 
3.4%
보광토건(주 206
 
3.2%
구례군청 197
 
3.0%
주)다온산업개발 178
 
2.7%
주)한울산업개발 166
 
2.6%
Other values (482) 1449
22.3%
2024-03-30T07:39:15.748876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 3425
 
10.0%
) 3425
 
10.0%
2966
 
8.6%
1916
 
5.6%
1909
 
5.6%
1401
 
4.1%
1395
 
4.1%
1175
 
3.4%
1088
 
3.2%
816
 
2.4%
Other values (292) 14821
43.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 26862
78.2%
Open Punctuation 3425
 
10.0%
Close Punctuation 3425
 
10.0%
Space Separator 533
 
1.6%
Uppercase Letter 52
 
0.2%
Decimal Number 27
 
0.1%
Lowercase Letter 13
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2966
 
11.0%
1916
 
7.1%
1909
 
7.1%
1401
 
5.2%
1395
 
5.2%
1175
 
4.4%
1088
 
4.1%
816
 
3.0%
807
 
3.0%
802
 
3.0%
Other values (281) 12587
46.9%
Decimal Number
ValueCountFrequency (%)
9 12
44.4%
1 6
22.2%
8 6
22.2%
2 3
 
11.1%
Uppercase Letter
ValueCountFrequency (%)
O 26
50.0%
C 13
25.0%
P 13
25.0%
Open Punctuation
ValueCountFrequency (%)
( 3425
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3425
100.0%
Space Separator
ValueCountFrequency (%)
533
100.0%
Lowercase Letter
ValueCountFrequency (%)
i 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 26862
78.2%
Common 7410
 
21.6%
Latin 65
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2966
 
11.0%
1916
 
7.1%
1909
 
7.1%
1401
 
5.2%
1395
 
5.2%
1175
 
4.4%
1088
 
4.1%
816
 
3.0%
807
 
3.0%
802
 
3.0%
Other values (281) 12587
46.9%
Common
ValueCountFrequency (%)
( 3425
46.2%
) 3425
46.2%
533
 
7.2%
9 12
 
0.2%
1 6
 
0.1%
8 6
 
0.1%
2 3
 
< 0.1%
Latin
ValueCountFrequency (%)
O 26
40.0%
C 13
20.0%
P 13
20.0%
i 13
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 26862
78.2%
ASCII 7475
 
21.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 3425
45.8%
) 3425
45.8%
533
 
7.1%
O 26
 
0.3%
C 13
 
0.2%
P 13
 
0.2%
i 13
 
0.2%
9 12
 
0.2%
1 6
 
0.1%
8 6
 
0.1%
Hangul
ValueCountFrequency (%)
2966
 
11.0%
1916
 
7.1%
1909
 
7.1%
1401
 
5.2%
1395
 
5.2%
1175
 
4.4%
1088
 
4.1%
816
 
3.0%
807
 
3.0%
802
 
3.0%
Other values (281) 12587
46.9%
Distinct80
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size46.6 KiB
2024-03-30T07:39:16.297518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length113
Median length81
Mean length24.813067
Min length1

Characters and Unicode

Total characters147737
Distinct characters242
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)0.5%

Sample

1st row폐합성수지류(폐염화비닐수지류는 제외한다)
2nd row그 밖의 식물성잔재물
3rd row폐합성수지류(폐염화비닐수지류는 제외한다)
4th row그 밖의 공정오니
5th row그 밖의 식물성잔재물
ValueCountFrequency (%)
흩날릴 3134
10.3%
우려가 3134
10.3%
2811
9.2%
폐석면 2569
8.4%
없는 2382
7.8%
사용된 2277
 
7.5%
석면의 2227
 
7.3%
제거작업에 2227
 
7.3%
모든 1888
 
6.2%
비닐시트ㆍ방진마스크ㆍ작업복ㆍ집진필터 1888
 
6.2%
Other values (182) 5892
19.4%
2024-03-30T07:39:17.483311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24477
 
16.6%
6533
 
4.4%
4848
 
3.3%
4845
 
3.3%
4458
 
3.0%
4456
 
3.0%
4155
 
2.8%
3287
 
2.2%
3221
 
2.2%
3153
 
2.1%
Other values (232) 84304
57.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 122383
82.8%
Space Separator 24477
 
16.6%
Connector Punctuation 245
 
0.2%
Open Punctuation 244
 
0.2%
Close Punctuation 244
 
0.2%
Decimal Number 112
 
0.1%
Lowercase Letter 30
 
< 0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6533
 
5.3%
4848
 
4.0%
4845
 
4.0%
4458
 
3.6%
4456
 
3.6%
4155
 
3.4%
3287
 
2.7%
3221
 
2.6%
3153
 
2.6%
3144
 
2.6%
Other values (212) 80283
65.6%
Decimal Number
ValueCountFrequency (%)
1 90
80.4%
2 12
 
10.7%
0 5
 
4.5%
3 2
 
1.8%
4 2
 
1.8%
8 1
 
0.9%
Lowercase Letter
ValueCountFrequency (%)
e 10
33.3%
g 5
16.7%
r 5
16.7%
a 5
16.7%
s 5
16.7%
Open Punctuation
ValueCountFrequency (%)
( 238
97.5%
[ 5
 
2.0%
1
 
0.4%
Close Punctuation
ValueCountFrequency (%)
) 238
97.5%
] 5
 
2.0%
1
 
0.4%
Space Separator
ValueCountFrequency (%)
24477
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 245
100.0%
Other Punctuation
ValueCountFrequency (%)
· 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 122383
82.8%
Common 25324
 
17.1%
Latin 30
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6533
 
5.3%
4848
 
4.0%
4845
 
4.0%
4458
 
3.6%
4456
 
3.6%
4155
 
3.4%
3287
 
2.7%
3221
 
2.6%
3153
 
2.6%
3144
 
2.6%
Other values (212) 80283
65.6%
Common
ValueCountFrequency (%)
24477
96.7%
_ 245
 
1.0%
( 238
 
0.9%
) 238
 
0.9%
1 90
 
0.4%
2 12
 
< 0.1%
0 5
 
< 0.1%
[ 5
 
< 0.1%
] 5
 
< 0.1%
· 2
 
< 0.1%
Other values (5) 7
 
< 0.1%
Latin
ValueCountFrequency (%)
e 10
33.3%
g 5
16.7%
r 5
16.7%
a 5
16.7%
s 5
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 115850
78.4%
ASCII 25350
 
17.2%
Compat Jamo 6533
 
4.4%
None 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
24477
96.6%
_ 245
 
1.0%
( 238
 
0.9%
) 238
 
0.9%
1 90
 
0.4%
2 12
 
< 0.1%
e 10
 
< 0.1%
0 5
 
< 0.1%
[ 5
 
< 0.1%
g 5
 
< 0.1%
Other values (7) 25
 
0.1%
Compat Jamo
ValueCountFrequency (%)
6533
100.0%
Hangul
ValueCountFrequency (%)
4848
 
4.2%
4845
 
4.2%
4458
 
3.8%
4456
 
3.8%
4155
 
3.6%
3287
 
2.8%
3221
 
2.8%
3153
 
2.7%
3144
 
2.7%
3134
 
2.7%
Other values (211) 77149
66.6%
None
ValueCountFrequency (%)
· 2
50.0%
1
25.0%
1
25.0%

사업자등록번호
Text

MISSING 

Distinct243
Distinct (%)5.5%
Missing1566
Missing (%)26.3%
Memory size46.6 KiB
2024-03-30T07:39:18.124677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters52656
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique114 ?
Unique (%)2.6%

Sample

1st row402-02-17472
2nd row855-85-01750
3rd row855-85-01750
4th row416-83-06196
5th row173-85-01056
ValueCountFrequency (%)
416-81-97747 1909
43.5%
411-81-15959 774
17.6%
415-81-05171 206
 
4.7%
416-83-00945 176
 
4.0%
373-81-01401 166
 
3.8%
416-83-00979 143
 
3.3%
236-86-01465 130
 
3.0%
507-86-02691 80
 
1.8%
416-83-01022 55
 
1.3%
236-86-01456 52
 
1.2%
Other values (233) 697
 
15.9%
2024-03-30T07:39:19.304033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 10108
19.2%
- 8776
16.7%
7 6627
12.6%
4 6563
12.5%
8 4576
8.7%
9 4385
8.3%
6 3789
 
7.2%
5 2737
 
5.2%
0 2693
 
5.1%
3 1392
 
2.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 43880
83.3%
Dash Punctuation 8776
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 10108
23.0%
7 6627
15.1%
4 6563
15.0%
8 4576
10.4%
9 4385
10.0%
6 3789
 
8.6%
5 2737
 
6.2%
0 2693
 
6.1%
3 1392
 
3.2%
2 1010
 
2.3%
Dash Punctuation
ValueCountFrequency (%)
- 8776
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 52656
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 10108
19.2%
- 8776
16.7%
7 6627
12.6%
4 6563
12.5%
8 4576
8.7%
9 4385
8.3%
6 3789
 
7.2%
5 2737
 
5.2%
0 2693
 
5.1%
3 1392
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 52656
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 10108
19.2%
- 8776
16.7%
7 6627
12.6%
4 6563
12.5%
8 4576
8.7%
9 4385
8.3%
6 3789
 
7.2%
5 2737
 
5.2%
0 2693
 
5.1%
3 1392
 
2.6%
Distinct244
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size46.6 KiB
2024-03-30T07:39:20.004645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length19
Mean length7.9548203
Min length1

Characters and Unicode

Total characters47363
Distinct characters225
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique118 ?
Unique (%)2.0%

Sample

1st row리뉴에너지새한(주)
2nd row농업회사법인(주)쿱농산
3rd row광양환경
4th row농업회사법인유한회사 농부
5th row농업회사법인(주)쿱농산
ValueCountFrequency (%)
주)와이엔텍 1818
30.4%
한맥테코산업(주 743
12.4%
㈜와이엔텍 668
 
11.2%
승우산업개발(주 608
 
10.2%
에코시스템㈜ 402
 
6.7%
한맥테코산업(주)율촌사업소 381
 
6.4%
에코시스템(주 240
 
4.0%
주)하나이앤에스 207
 
3.5%
인선이엔티(주)광양 167
 
2.8%
주)이메디원 65
 
1.1%
Other values (234) 685
 
11.4%
2024-03-30T07:39:21.244843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4811
 
10.2%
) 4776
 
10.1%
( 4775
 
10.1%
3071
 
6.5%
2713
 
5.7%
2516
 
5.3%
2503
 
5.3%
2347
 
5.0%
1927
 
4.1%
1917
 
4.0%
Other values (215) 16007
33.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 36655
77.4%
Close Punctuation 4777
 
10.1%
Open Punctuation 4775
 
10.1%
Other Symbol 1070
 
2.3%
Space Separator 55
 
0.1%
Uppercase Letter 13
 
< 0.1%
Decimal Number 12
 
< 0.1%
Dash Punctuation 3
 
< 0.1%
Connector Punctuation 2
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4811
 
13.1%
3071
 
8.4%
2713
 
7.4%
2516
 
6.9%
2503
 
6.8%
2347
 
6.4%
1927
 
5.3%
1917
 
5.2%
1209
 
3.3%
1199
 
3.3%
Other values (200) 12442
33.9%
Uppercase Letter
ValueCountFrequency (%)
N 3
23.1%
T 3
23.1%
E 3
23.1%
C 2
15.4%
K 2
15.4%
Close Punctuation
ValueCountFrequency (%)
) 4776
> 99.9%
} 1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
2 7
58.3%
1 5
41.7%
Open Punctuation
ValueCountFrequency (%)
( 4775
100.0%
Other Symbol
ValueCountFrequency (%)
1070
100.0%
Space Separator
ValueCountFrequency (%)
55
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 37725
79.7%
Common 9624
 
20.3%
Latin 14
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4811
 
12.8%
3071
 
8.1%
2713
 
7.2%
2516
 
6.7%
2503
 
6.6%
2347
 
6.2%
1927
 
5.1%
1917
 
5.1%
1209
 
3.2%
1199
 
3.2%
Other values (201) 13512
35.8%
Common
ValueCountFrequency (%)
) 4776
49.6%
( 4775
49.6%
55
 
0.6%
2 7
 
0.1%
1 5
 
0.1%
- 3
 
< 0.1%
_ 2
 
< 0.1%
} 1
 
< 0.1%
Latin
ValueCountFrequency (%)
N 3
21.4%
T 3
21.4%
E 3
21.4%
C 2
14.3%
K 2
14.3%
e 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 36654
77.4%
ASCII 9638
 
20.3%
None 1070
 
2.3%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4811
 
13.1%
3071
 
8.4%
2713
 
7.4%
2516
 
6.9%
2503
 
6.8%
2347
 
6.4%
1927
 
5.3%
1917
 
5.2%
1209
 
3.3%
1199
 
3.3%
Other values (199) 12441
33.9%
ASCII
ValueCountFrequency (%)
) 4776
49.6%
( 4775
49.5%
55
 
0.6%
2 7
 
0.1%
1 5
 
0.1%
- 3
 
< 0.1%
N 3
 
< 0.1%
T 3
 
< 0.1%
E 3
 
< 0.1%
_ 2
 
< 0.1%
Other values (4) 6
 
0.1%
None
ValueCountFrequency (%)
1070
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

처리방법
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct22
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size46.6 KiB
매립(민간관리형매립시설)
4591 
중간처분(고형화)
855 
중간처분(일반소각)
 
161
재활용(중간가공폐기물 제조)
 
107
재활용(직접 제품제조)
 
34
Other values (17)
 
206

Length

Max length19
Median length13
Mean length12.332382
Min length1

Unique

Unique3 ?
Unique (%)0.1%

Sample

1st row중간처분(일반소각)
2nd row재활용(토질개선에 사용)
3rd row재활용(중간가공폐기물 제조)
4th row재활용(토질개선에 사용)
5th row재활용(토질개선에 사용)

Common Values

ValueCountFrequency (%)
매립(민간관리형매립시설) 4591
77.1%
중간처분(고형화) 855
 
14.4%
중간처분(일반소각) 161
 
2.7%
재활용(중간가공폐기물 제조) 107
 
1.8%
재활용(직접 제품제조) 34
 
0.6%
재활용(파쇄.분쇄) 34
 
0.6%
중간처분(고온소각) 32
 
0.5%
중간처분(파쇄.분쇄) 27
 
0.5%
재활용(성토재·복토재 등으로 사용) 23
 
0.4%
재활용(연료·고형연료제품 제조) 19
 
0.3%
Other values (12) 71
 
1.2%

Length

2024-03-30T07:39:21.708558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
매립(민간관리형매립시설 4591
74.1%
중간처분(고형화 855
 
13.8%
중간처분(일반소각 161
 
2.6%
제조 141
 
2.3%
재활용(중간가공폐기물 107
 
1.7%
사용 46
 
0.7%
재활용(직접 34
 
0.5%
제품제조 34
 
0.5%
재활용(파쇄.분쇄 34
 
0.5%
중간처분(고온소각 32
 
0.5%
Other values (16) 161
 
2.6%
Distinct944
Distinct (%)15.9%
Missing0
Missing (%)0.0%
Memory size46.6 KiB
2024-03-30T07:39:22.380392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length46
Mean length23.843299
Min length1

Characters and Unicode

Total characters141963
Distinct characters307
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique289 ?
Unique (%)4.9%

Sample

1st row
2nd row전라남도 구례군 용방면 용산로 107-17
3rd row전라남도 구례군 용방면 용산로 107-17
4th row전라남도 구례군 마산면 섬진강대로 5363_ 구례군상하수도사업소
5th row전라남도 구례군 용방면 용산로 107-59
ValueCountFrequency (%)
전라남도 5696
18.1%
구례군 5141
16.3%
구례읍 3772
 
12.0%
용방로 2017
 
6.4%
41-19 1167
 
3.7%
제일토건(주 1167
 
3.7%
8 787
 
2.5%
양정3길 776
 
2.5%
21-17 752
 
2.4%
산동면 363
 
1.2%
Other values (1140) 9842
31.3%
2024-03-30T07:39:23.729478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25906
18.2%
9529
 
6.7%
9413
 
6.6%
1 7099
 
5.0%
6320
 
4.5%
6011
 
4.2%
5875
 
4.1%
5815
 
4.1%
5572
 
3.9%
4055
 
2.9%
Other values (297) 56368
39.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 87976
62.0%
Space Separator 25906
 
18.2%
Decimal Number 19526
 
13.8%
Dash Punctuation 3071
 
2.2%
Connector Punctuation 1901
 
1.3%
Close Punctuation 1781
 
1.3%
Open Punctuation 1781
 
1.3%
Uppercase Letter 15
 
< 0.1%
Lowercase Letter 4
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9529
 
10.8%
9413
 
10.7%
6320
 
7.2%
6011
 
6.8%
5875
 
6.7%
5815
 
6.6%
5572
 
6.3%
4055
 
4.6%
3207
 
3.6%
2851
 
3.2%
Other values (271) 29328
33.3%
Decimal Number
ValueCountFrequency (%)
1 7099
36.4%
2 2729
 
14.0%
3 1803
 
9.2%
9 1616
 
8.3%
4 1596
 
8.2%
8 1524
 
7.8%
7 1350
 
6.9%
0 710
 
3.6%
6 575
 
2.9%
5 524
 
2.7%
Uppercase Letter
ValueCountFrequency (%)
T 2
13.3%
K 2
13.3%
D 2
13.3%
Y 2
13.3%
G 2
13.3%
L 2
13.3%
E 2
13.3%
B 1
6.7%
Space Separator
ValueCountFrequency (%)
25906
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3071
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1901
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1781
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1781
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 4
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 87977
62.0%
Common 53967
38.0%
Latin 19
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9529
 
10.8%
9413
 
10.7%
6320
 
7.2%
6011
 
6.8%
5875
 
6.7%
5815
 
6.6%
5572
 
6.3%
4055
 
4.6%
3207
 
3.6%
2851
 
3.2%
Other values (272) 29329
33.3%
Common
ValueCountFrequency (%)
25906
48.0%
1 7099
 
13.2%
- 3071
 
5.7%
2 2729
 
5.1%
_ 1901
 
3.5%
3 1803
 
3.3%
) 1781
 
3.3%
( 1781
 
3.3%
9 1616
 
3.0%
4 1596
 
3.0%
Other values (6) 4684
 
8.7%
Latin
ValueCountFrequency (%)
e 4
21.1%
T 2
10.5%
K 2
10.5%
D 2
10.5%
Y 2
10.5%
G 2
10.5%
L 2
10.5%
E 2
10.5%
B 1
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 87976
62.0%
ASCII 53986
38.0%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
25906
48.0%
1 7099
 
13.1%
- 3071
 
5.7%
2 2729
 
5.1%
_ 1901
 
3.5%
3 1803
 
3.3%
) 1781
 
3.3%
( 1781
 
3.3%
9 1616
 
3.0%
4 1596
 
3.0%
Other values (15) 4703
 
8.7%
Hangul
ValueCountFrequency (%)
9529
 
10.8%
9413
 
10.7%
6320
 
7.2%
6011
 
6.8%
5875
 
6.7%
5815
 
6.6%
5572
 
6.3%
4055
 
4.6%
3207
 
3.6%
2851
 
3.2%
Other values (271) 29328
33.3%
None
ValueCountFrequency (%)
1
100.0%

신고기준년도
Real number (ℝ)

Distinct15
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.1001
Minimum2010
Maximum2024
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size52.5 KiB
2024-03-30T07:39:24.215585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2010
5-th percentile2012
Q12017
median2020
Q32022
95-th percentile2023
Maximum2024
Range14
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.3267729
Coefficient of variation (CV)0.0016476513
Kurtosis-0.29181712
Mean2019.1001
Median Absolute Deviation (MAD)2
Skewness-0.83794603
Sum12021722
Variance11.067418
MonotonicityNot monotonic
2024-03-30T07:39:24.693582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
2021 1119
18.8%
2022 792
13.3%
2020 774
13.0%
2023 774
13.0%
2019 438
 
7.4%
2018 352
 
5.9%
2016 352
 
5.9%
2017 326
 
5.5%
2013 282
 
4.7%
2015 254
 
4.3%
Other values (5) 491
8.2%
ValueCountFrequency (%)
2010 48
 
0.8%
2011 61
 
1.0%
2012 200
3.4%
2013 282
4.7%
2014 171
 
2.9%
2015 254
4.3%
2016 352
5.9%
2017 326
5.5%
2018 352
5.9%
2019 438
7.4%
ValueCountFrequency (%)
2024 11
 
0.2%
2023 774
13.0%
2022 792
13.3%
2021 1119
18.8%
2020 774
13.0%
2019 438
 
7.4%
2018 352
 
5.9%
2017 326
 
5.5%
2016 352
 
5.9%
2015 254
 
4.3%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size46.6 KiB
2024-03-05
5954 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-03-05
2nd row2024-03-05
3rd row2024-03-05
4th row2024-03-05
5th row2024-03-05

Common Values

ValueCountFrequency (%)
2024-03-05 5954
100.0%

Length

2024-03-30T07:39:25.605736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-30T07:39:26.061453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-03-05 5954
100.0%

Interactions

2024-03-30T07:39:11.588445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-30T07:39:26.278151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물구분폐기물종류처리방법신고기준년도
폐기물구분1.0001.0000.9750.085
폐기물종류1.0001.0000.9860.797
처리방법0.9750.9861.0000.338
신고기준년도0.0850.7970.3381.000
2024-03-30T07:39:26.554720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처리방법폐기물구분
처리방법1.0000.876
폐기물구분0.8761.000
2024-03-30T07:39:26.809891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신고기준년도폐기물구분처리방법
신고기준년도1.0000.0880.143
폐기물구분0.0881.0000.876
처리방법0.1430.8761.000

Missing values

2024-03-30T07:39:12.367946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-30T07:39:13.092663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

폐기물구분업체명폐기물종류사업자등록번호처리업소명처리방법사업장도로명주소신고기준년도데이터기준일자
0사업장일반폐기물푸른농원폐합성수지류(폐염화비닐수지류는 제외한다)402-02-17472리뉴에너지새한(주)중간처분(일반소각)20242024-03-05
1사업장일반폐기물(주)쿱농산 과채가공센터그 밖의 식물성잔재물855-85-01750농업회사법인(주)쿱농산재활용(토질개선에 사용)전라남도 구례군 용방면 용산로 107-1720222024-03-05
2사업장일반폐기물(주)쿱농산 과채가공센터폐합성수지류(폐염화비닐수지류는 제외한다)855-85-01750광양환경재활용(중간가공폐기물 제조)전라남도 구례군 용방면 용산로 107-1720222024-03-05
3사업장일반폐기물구례군상하수도사업소그 밖의 공정오니416-83-06196농업회사법인유한회사 농부재활용(토질개선에 사용)전라남도 구례군 마산면 섬진강대로 5363_ 구례군상하수도사업소20222024-03-05
4사업장일반폐기물농업회사법인 주식회사 구례양조 비어락하우스그 밖의 식물성잔재물173-85-01056농업회사법인(주)쿱농산재활용(토질개선에 사용)전라남도 구례군 용방면 용산로 107-5920222024-03-05
5사업장일반폐기물농업회사법인 주식회사 구례양조 비어락하우스폐합성수지류(폐염화비닐수지류는 제외한다)173-85-01056광양환경재활용(중간가공폐기물 제조)전라남도 구례군 용방면 용산로 107-5920222024-03-05
6사업장일반폐기물김태경 우리밀베이커리 주식회사그 밖의 식물성잔재물138-81-60205농업회사법인(주)쿱농산재활용(토질개선에 사용)전라남도 구례군 용방면 용산로 107-54_ 3층20222024-03-05
7사업장일반폐기물김태경 우리밀베이커리 주식회사폐합성수지류(폐염화비닐수지류는 제외한다)138-81-60205광양환경재활용(중간가공폐기물 제조)전라남도 구례군 용방면 용산로 107-54_ 3층20222024-03-05
8사업장일반폐기물(주)올곧은폐합성수지류(폐염화비닐수지류는 제외한다)123-86-24779광양환경재활용(중간가공폐기물 제조)전라남도 구례군 용방면 용산로 107-9820222024-03-05
9사업장일반폐기물(주)올곧은그 밖의 식물성잔재물123-86-24779꿈특이농원재활용(토질개선에 사용)전라남도 구례군 용방면 용산로 107-9820222024-03-05
폐기물구분업체명폐기물종류사업자등록번호처리업소명처리방법사업장도로명주소신고기준년도데이터기준일자
5944지정폐기물김무일건조고형물의 함량을 기준으로 하여 석면이 1퍼센트 이상 함유된 제품ㆍ설비(뿜칠로 사용된 것을 포함한다) 등의 해체ㆍ제거 시 발생되는 것<NA>한맥테코산업(주)율촌사업소매립(민간관리형매립시설)20102024-03-05
5945지정폐기물김용섭건조고형물의 함량을 기준으로 하여 석면이 1퍼센트 이상 함유된 제품ㆍ설비(뿜칠로 사용된 것을 포함한다) 등의 해체ㆍ제거 시 발생되는 것<NA>한맥테코산업(주)율촌사업소매립(민간관리형매립시설)20102024-03-05
5946지정폐기물김용섭건조고형물의 함량을 기준으로 하여 석면이 1퍼센트 이상 함유된 제품ㆍ설비(뿜칠로 사용된 것을 포함한다) 등의 해체ㆍ제거 시 발생되는 것<NA>한맥테코산업(주)율촌사업소매립(민간관리형매립시설)20102024-03-05
5947지정폐기물김용묵폐석면<NA>한맥테코산업(주)매립(민간관리형매립시설)20102024-03-05
5948지정폐기물부산시 수영구청건조고형물의 함량을 기준으로 하여 석면이 1퍼센트 이상 함유된 제품ㆍ설비(뿜칠로 사용된 것을 포함한다) 등의 해체ㆍ제거 시 발생되는 것617-83-01829(주)유니콘매립(민간관리형매립시설)20102024-03-05
5949지정폐기물구례교육청폐석면416-83-00979한맥테코산업(주)매립(민간관리형매립시설)20102024-03-05
5950지정폐기물전라남도구례교육청건조고형물의 함량을 기준으로 하여 석면이 1퍼센트 이상 함유된 제품ㆍ설비(뿜칠로 사용된 것을 포함한다) 등의 해체ㆍ제거 시 발생되는 것416-83-00979한맥테코산업(주)율촌사업소매립(민간관리형매립시설)20102024-03-05
5951지정폐기물황의대건조고형물의 함량을 기준으로 하여 석면이 1퍼센트 이상 함유된 제품ㆍ설비(뿜칠로 사용된 것을 포함한다) 등의 해체ㆍ제거 시 발생되는 것<NA>한맥테코산업(주)율촌사업소매립(민간관리형매립시설)전라남도 구례군 광의면 지하1길 3920102024-03-05
5952지정폐기물구례군청건조고형물의 함량을 기준으로 하여 석면이 1퍼센트 이상 함유된 제품ㆍ설비(뿜칠로 사용된 것을 포함한다) 등의 해체ㆍ제거 시 발생되는 것416-83-00945한맥테코산업(주)율촌사업소매립(민간관리형매립시설)전라남도 구례군 구례읍 봉성로 120102024-03-05
5953지정폐기물구례군청폐유416-83-00945부여국제전기(주)재활용(기타)전라남도 구례군 구례읍 봉성로 120102024-03-05

Duplicate rows

Most frequently occurring

폐기물구분업체명폐기물종류사업자등록번호처리업소명처리방법사업장도로명주소신고기준년도데이터기준일자# duplicates
66지정폐기물(주)황룡석면의 제거작업에 사용된 모든 비닐시트ㆍ방진마스크ㆍ작업복ㆍ집진필터 등416-81-97747㈜와이엔텍매립(민간관리형매립시설)전라남도 구례군 구례읍 용방로 41-19_ 제일토건(주)20202024-03-05210
75지정폐기물(주)황룡흩날릴 우려가 없는 폐석면416-81-97747㈜와이엔텍매립(민간관리형매립시설)전라남도 구례군 구례읍 용방로 41-19_ 제일토건(주)20202024-03-05210
79지정폐기물(주)황룡흩날릴 우려가 있는 폐석면416-81-97747(주)하나이앤에스중간처분(고형화)전라남도 구례군 구례읍 용방로 41-19_ 제일토건(주)20212024-03-05136
30지정폐기물(유)대신건설흩날릴 우려가 있는 폐석면411-81-15959승우산업개발(주)중간처분(고형화)전라남도 구례군 구례읍 양정3길 820212024-03-05131
68지정폐기물(주)황룡석면의 제거작업에 사용된 모든 비닐시트ㆍ방진마스크ㆍ작업복ㆍ집진필터 등416-81-97747에코시스템㈜매립(민간관리형매립시설)전라남도 구례군 구례읍 용방로 41-19_ 제일토건(주)20212024-03-05131
77지정폐기물(주)황룡흩날릴 우려가 없는 폐석면416-81-97747에코시스템㈜매립(민간관리형매립시설)전라남도 구례군 구례읍 용방로 41-19_ 제일토건(주)20212024-03-05131
22지정폐기물(유)대신건설석면의 제거작업에 사용된 모든 비닐시트ㆍ방진마스크ㆍ작업복ㆍ집진필터 등411-81-15959㈜와이엔텍매립(민간관리형매립시설)전라남도 구례군 구례읍 양정3길 820212024-03-05123
26지정폐기물(유)대신건설흩날릴 우려가 없는 폐석면411-81-15959㈜와이엔텍매립(민간관리형매립시설)전라남도 구례군 구례읍 양정3길 820212024-03-05123
62지정폐기물(주)황룡석면의 제거작업에 사용된 모든 비닐시트ㆍ방진마스크ㆍ작업복ㆍ집진필터 등416-81-97747(주)와이엔텍매립(민간관리형매립시설)전라남도 구례군 구례읍 용방로 21-1720232024-03-05112
71지정폐기물(주)황룡흩날릴 우려가 없는 폐석면416-81-97747(주)와이엔텍매립(민간관리형매립시설)전라남도 구례군 구례읍 용방로 21-1720232024-03-05112