Overview

Dataset statistics

Number of variables9
Number of observations2313
Missing cells0
Missing cells (%)0.0%
Duplicate rows79
Duplicate rows (%)3.4%
Total size in memory165.0 KiB
Average record size in memory73.1 B

Variable types

Categorical4
Text5

Dataset

Description경상남도_남해군 사업장폐기물 신고현황 데이터로 상호명, 폐기물 종류, 처리업소명, 처리방법, 사업장주소 등 항목별로 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15062072/fileData.do

Alerts

Dataset has 79 (3.4%) duplicate rowsDuplicates
구분 is highly overall correlated with 폐기물 종류 and 1 other fieldsHigh correlation
처리방법 is highly overall correlated with 구분 and 1 other fieldsHigh correlation
폐기물 종류 is highly overall correlated with 구분 and 1 other fieldsHigh correlation
구분 is highly imbalanced (52.7%)Imbalance
폐기물 종류 is highly imbalanced (63.3%)Imbalance
처리방법 is highly imbalanced (76.3%)Imbalance

Reproduction

Analysis started2023-12-12 08:54:16.415576
Analysis finished2023-12-12 08:54:17.734013
Duration1.32 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size18.2 KiB
지정폐기물배출자관리
2079 
사업장일반폐기물
234 

Length

Max length10
Median length10
Mean length9.7976654
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업장일반폐기물
2nd row사업장일반폐기물
3rd row사업장일반폐기물
4th row사업장일반폐기물
5th row사업장일반폐기물

Common Values

ValueCountFrequency (%)
지정폐기물배출자관리 2079
89.9%
사업장일반폐기물 234
 
10.1%

Length

2023-12-12T17:54:17.811096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:54:17.907247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지정폐기물배출자관리 2079
89.9%
사업장일반폐기물 234
 
10.1%

상호
Text

Distinct259
Distinct (%)11.2%
Missing0
Missing (%)0.0%
Memory size18.2 KiB
2023-12-12T17:54:18.073781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length17
Mean length7.2957198
Min length2

Characters and Unicode

Total characters16875
Distinct characters241
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique101 ?
Unique (%)4.4%

Sample

1st row남해양식어업 470호
2nd row개인
3rd row414호 양식
4th row남해양식 제493호
5th row남해양식 제522호
ValueCountFrequency (%)
주)금당산업 549
22.9%
주)초원환경 308
12.9%
주)삼화산업개발 270
11.3%
개인 167
 
7.0%
주)원일 165
 
6.9%
주)가온석면환경 148
 
6.2%
주)두남산업개발 130
 
5.4%
경상남도남해교육지원청 31
 
1.3%
주식회사 19
 
0.8%
남해군청 15
 
0.6%
Other values (270) 592
24.7%
2023-12-12T17:54:18.510695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 1874
 
11.1%
) 1874
 
11.1%
1689
 
10.0%
1019
 
6.0%
997
 
5.9%
693
 
4.1%
563
 
3.3%
559
 
3.3%
551
 
3.3%
547
 
3.2%
Other values (231) 6509
38.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12955
76.8%
Open Punctuation 1874
 
11.1%
Close Punctuation 1874
 
11.1%
Decimal Number 85
 
0.5%
Space Separator 81
 
0.5%
Uppercase Letter 4
 
< 0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1689
 
13.0%
1019
 
7.9%
997
 
7.7%
693
 
5.3%
563
 
4.3%
559
 
4.3%
551
 
4.3%
547
 
4.2%
487
 
3.8%
411
 
3.2%
Other values (215) 5439
42.0%
Decimal Number
ValueCountFrequency (%)
4 22
25.9%
1 15
17.6%
3 12
14.1%
9 7
 
8.2%
5 7
 
8.2%
6 6
 
7.1%
8 5
 
5.9%
0 5
 
5.9%
2 4
 
4.7%
7 2
 
2.4%
Uppercase Letter
ValueCountFrequency (%)
K 2
50.0%
S 2
50.0%
Open Punctuation
ValueCountFrequency (%)
( 1874
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1874
100.0%
Space Separator
ValueCountFrequency (%)
81
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12955
76.8%
Common 3916
 
23.2%
Latin 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1689
 
13.0%
1019
 
7.9%
997
 
7.7%
693
 
5.3%
563
 
4.3%
559
 
4.3%
551
 
4.3%
547
 
4.2%
487
 
3.8%
411
 
3.2%
Other values (215) 5439
42.0%
Common
ValueCountFrequency (%)
( 1874
47.9%
) 1874
47.9%
81
 
2.1%
4 22
 
0.6%
1 15
 
0.4%
3 12
 
0.3%
9 7
 
0.2%
5 7
 
0.2%
6 6
 
0.2%
8 5
 
0.1%
Other values (4) 13
 
0.3%
Latin
ValueCountFrequency (%)
K 2
50.0%
S 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12955
76.8%
ASCII 3920
 
23.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 1874
47.8%
) 1874
47.8%
81
 
2.1%
4 22
 
0.6%
1 15
 
0.4%
3 12
 
0.3%
9 7
 
0.2%
5 7
 
0.2%
6 6
 
0.2%
8 5
 
0.1%
Other values (6) 17
 
0.4%
Hangul
ValueCountFrequency (%)
1689
 
13.0%
1019
 
7.9%
997
 
7.7%
693
 
5.3%
563
 
4.3%
559
 
4.3%
551
 
4.3%
547
 
4.2%
487
 
3.8%
411
 
3.2%
Other values (215) 5439
42.0%

폐기물 종류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct28
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size18.2 KiB
석면의 제거작업에 사용된 모든 비닐시트ㆍ방진마스크ㆍ작업복ㆍ집진필터 등
1017 
흩날릴 우려가 없는 폐석면
1015 
폐패각
 
94
폐합성수지류(폐염화비닐수지류는 제외한다)
 
77
임목폐목재(건설공사_ 산지개간 등의 과정에서 발생된 나무뿌리_ 가지_ 줄기 등을 말한다)
 
31
Other values (23)
 
79

Length

Max length81
Median length66
Mean length24.871163
Min length3

Unique

Unique14 ?
Unique (%)0.6%

Sample

1st row폐패각
2nd row폐패각
3rd row폐패각
4th row폐패각
5th row폐패각

Common Values

ValueCountFrequency (%)
석면의 제거작업에 사용된 모든 비닐시트ㆍ방진마스크ㆍ작업복ㆍ집진필터 등 1017
44.0%
흩날릴 우려가 없는 폐석면 1015
43.9%
폐패각 94
 
4.1%
폐합성수지류(폐염화비닐수지류는 제외한다) 77
 
3.3%
임목폐목재(건설공사_ 산지개간 등의 과정에서 발생된 나무뿌리_ 가지_ 줄기 등을 말한다) 31
 
1.3%
흩날릴 우려가 있는 폐석면 23
 
1.0%
그 밖의 폐목재류 13
 
0.6%
폐절연유(폴리클로리네이티드비페닐 함유 폐기물을 제외한다) 8
 
0.3%
그 밖의 폐농약 7
 
0.3%
하수준설토 4
 
0.2%
Other values (18) 24
 
1.0%

Length

2023-12-12T17:54:18.699656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
폐석면 1038
9.5%
우려가 1038
9.5%
흩날릴 1038
9.5%
1017
9.3%
제거작업에 1017
9.3%
석면의 1017
9.3%
비닐시트ㆍ방진마스크ㆍ작업복ㆍ집진필터 1017
9.3%
모든 1017
9.3%
사용된 1017
9.3%
없는 1015
9.3%
Other values (54) 740
6.7%
Distinct174
Distinct (%)7.5%
Missing0
Missing (%)0.0%
Memory size18.2 KiB
2023-12-12T17:54:18.928668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.472546
Min length2

Characters and Unicode

Total characters26536
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)4.0%

Sample

1st row292-98-00703
2nd row166-90-01751
3rd row613-92-16303
4th row613-90-89495
5th row129-97-79905
ValueCountFrequency (%)
613-81-59698 551
23.8%
613-81-33683 308
13.3%
721-86-00950 270
11.7%
000-00-00000 171
 
7.4%
489-87-00465 165
 
7.1%
613-81-63625 148
 
6.4%
407-81-19150 130
 
5.6%
122
 
5.3%
614-83-00743 95
 
4.1%
614-83-00762 31
 
1.3%
Other values (164) 322
13.9%
2023-12-12T17:54:19.379679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 4626
17.4%
0 3821
14.4%
1 3340
12.6%
8 3123
11.8%
6 3090
11.6%
3 2693
10.1%
9 1883
7.1%
5 1439
 
5.4%
4 946
 
3.6%
7 842
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 21910
82.6%
Dash Punctuation 4626
 
17.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3821
17.4%
1 3340
15.2%
8 3123
14.3%
6 3090
14.1%
3 2693
12.3%
9 1883
8.6%
5 1439
 
6.6%
4 946
 
4.3%
7 842
 
3.8%
2 733
 
3.3%
Dash Punctuation
ValueCountFrequency (%)
- 4626
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 26536
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 4626
17.4%
0 3821
14.4%
1 3340
12.6%
8 3123
11.8%
6 3090
11.6%
3 2693
10.1%
9 1883
7.1%
5 1439
 
5.4%
4 946
 
3.6%
7 842
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 26536
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 4626
17.4%
0 3821
14.4%
1 3340
12.6%
8 3123
11.8%
6 3090
11.6%
3 2693
10.1%
9 1883
7.1%
5 1439
 
5.4%
4 946
 
3.6%
7 842
 
3.2%
Distinct84
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size18.2 KiB
2023-12-12T17:54:19.689381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length17
Mean length7.3549503
Min length4

Characters and Unicode

Total characters17012
Distinct characters135
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)1.4%

Sample

1st row여수바이오(주)지점
2nd row여수바이오(주)지점
3rd row여수바이오(주)지점
4th row여수바이오(주)지점
5th row여수바이오(주)지점
ValueCountFrequency (%)
㈜태창크린텍 587
25.3%
주)아시아환경 399
17.2%
주)우리환경 250
10.8%
주)태경산업환경 213
 
9.2%
주)금화로지스 160
 
6.9%
㈜태경산업환경 150
 
6.5%
주)태창크린텍 138
 
5.9%
여수바이오(주)지점 64
 
2.8%
남문개발(주 35
 
1.5%
해양바이오(주 29
 
1.2%
Other values (77) 299
12.9%
2023-12-12T17:54:20.110002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1537
 
9.0%
1515
 
8.9%
( 1509
 
8.9%
) 1508
 
8.9%
1168
 
6.9%
1094
 
6.4%
802
 
4.7%
749
 
4.4%
742
 
4.4%
739
 
4.3%
Other values (125) 5649
33.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13244
77.9%
Open Punctuation 1509
 
8.9%
Close Punctuation 1508
 
8.9%
Other Symbol 739
 
4.3%
Space Separator 11
 
0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1537
11.6%
1515
11.4%
1168
 
8.8%
1094
 
8.3%
802
 
6.1%
749
 
5.7%
742
 
5.6%
728
 
5.5%
727
 
5.5%
476
 
3.6%
Other values (120) 3706
28.0%
Open Punctuation
ValueCountFrequency (%)
( 1509
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1508
100.0%
Other Symbol
ValueCountFrequency (%)
739
100.0%
Space Separator
ValueCountFrequency (%)
11
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13983
82.2%
Common 3029
 
17.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1537
 
11.0%
1515
 
10.8%
1168
 
8.4%
1094
 
7.8%
802
 
5.7%
749
 
5.4%
742
 
5.3%
739
 
5.3%
728
 
5.2%
727
 
5.2%
Other values (121) 4182
29.9%
Common
ValueCountFrequency (%)
( 1509
49.8%
) 1508
49.8%
11
 
0.4%
_ 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13244
77.9%
ASCII 3029
 
17.8%
None 739
 
4.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1537
11.6%
1515
11.4%
1168
 
8.8%
1094
 
8.3%
802
 
6.1%
749
 
5.7%
742
 
5.6%
728
 
5.5%
727
 
5.5%
476
 
3.6%
Other values (120) 3706
28.0%
ASCII
ValueCountFrequency (%)
( 1509
49.8%
) 1508
49.8%
11
 
0.4%
_ 1
 
< 0.1%
None
ValueCountFrequency (%)
739
100.0%
Distinct81
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size18.2 KiB
2023-12-12T17:54:20.378298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length18
Mean length8.8677043
Min length4

Characters and Unicode

Total characters20511
Distinct characters143
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)1.6%

Sample

1st row여수바이오(주)
2nd row여수바이오(주)
3rd row여수바이오(주)
4th row여수바이오(주)
5th row여수바이오(주)
ValueCountFrequency (%)
에코시스템(주 824
32.7%
㈜이앤컴퍼니구미지점 499
19.8%
구미지점 194
 
7.7%
주)에코비트그린 156
 
6.2%
㈜에코비트그린 152
 
6.0%
㈜이앤컴퍼니 98
 
3.9%
주)이앤컴퍼니 97
 
3.8%
여수바이오(주 65
 
2.6%
한맥테코산업(주 60
 
2.4%
인선이엔티(주)광양 46
 
1.8%
Other values (77) 332
13.2%
2023-12-12T17:54:20.830592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1516
 
7.4%
( 1508
 
7.4%
) 1508
 
7.4%
1209
 
5.9%
1151
 
5.6%
941
 
4.6%
834
 
4.1%
831
 
4.1%
825
 
4.0%
807
 
3.9%
Other values (133) 9381
45.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16531
80.6%
Open Punctuation 1508
 
7.4%
Close Punctuation 1508
 
7.4%
Other Symbol 752
 
3.7%
Space Separator 210
 
1.0%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1516
 
9.2%
1209
 
7.3%
1151
 
7.0%
941
 
5.7%
834
 
5.0%
831
 
5.0%
825
 
5.0%
807
 
4.9%
728
 
4.4%
726
 
4.4%
Other values (127) 6963
42.1%
Uppercase Letter
ValueCountFrequency (%)
B 1
50.0%
H 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 1508
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1508
100.0%
Other Symbol
ValueCountFrequency (%)
752
100.0%
Space Separator
ValueCountFrequency (%)
210
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 17283
84.3%
Common 3226
 
15.7%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1516
 
8.8%
1209
 
7.0%
1151
 
6.7%
941
 
5.4%
834
 
4.8%
831
 
4.8%
825
 
4.8%
807
 
4.7%
752
 
4.4%
728
 
4.2%
Other values (128) 7689
44.5%
Common
ValueCountFrequency (%)
( 1508
46.7%
) 1508
46.7%
210
 
6.5%
Latin
ValueCountFrequency (%)
B 1
50.0%
H 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16531
80.6%
ASCII 3228
 
15.7%
None 752
 
3.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1516
 
9.2%
1209
 
7.3%
1151
 
7.0%
941
 
5.7%
834
 
5.0%
831
 
5.0%
825
 
5.0%
807
 
4.9%
728
 
4.4%
726
 
4.4%
Other values (127) 6963
42.1%
ASCII
ValueCountFrequency (%)
( 1508
46.7%
) 1508
46.7%
210
 
6.5%
B 1
 
< 0.1%
H 1
 
< 0.1%
None
ValueCountFrequency (%)
752
100.0%

처리방법
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct13
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size18.2 KiB
매립(민간관리형매립시설)
2021 
재활용(중간가공폐기물 제조)
 
107
재활용(원료 제조)
 
52
재활용(직접 제품제조)
 
45
재활용(연료·고형연료제품 제조)
 
29
Other values (8)
 
59

Length

Max length17
Median length13
Mean length12.982706
Min length9

Unique

Unique3 ?
Unique (%)0.1%

Sample

1st row재활용(원료 제조)
2nd row재활용(원형 재사용)
3rd row재활용(원료 제조)
4th row재활용(원료 제조)
5th row재활용(원료 제조)

Common Values

ValueCountFrequency (%)
매립(민간관리형매립시설) 2021
87.4%
재활용(중간가공폐기물 제조) 107
 
4.6%
재활용(원료 제조) 52
 
2.2%
재활용(직접 제품제조) 45
 
1.9%
재활용(연료·고형연료제품 제조) 29
 
1.3%
중간처분(고형화) 29
 
1.3%
중간처분(파쇄.분쇄) 13
 
0.6%
중간처분(고온소각) 7
 
0.3%
재활용(토질개선에 사용) 5
 
0.2%
중간처분(일반소각) 2
 
0.1%
Other values (3) 3
 
0.1%

Length

2023-12-12T17:54:21.028702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
매립(민간관리형매립시설 2021
79.2%
제조 188
 
7.4%
재활용(중간가공폐기물 107
 
4.2%
재활용(원료 52
 
2.0%
재활용(직접 45
 
1.8%
제품제조 45
 
1.8%
재활용(연료·고형연료제품 29
 
1.1%
중간처분(고형화 29
 
1.1%
중간처분(파쇄.분쇄 13
 
0.5%
중간처분(고온소각 7
 
0.3%
Other values (7) 17
 
0.7%
Distinct337
Distinct (%)14.6%
Missing0
Missing (%)0.0%
Memory size18.2 KiB
2023-12-12T17:54:21.362879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length50
Mean length26.324687
Min length17

Characters and Unicode

Total characters60889
Distinct characters283
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique117 ?
Unique (%)5.1%

Sample

1st row경상남도 남해군 이동면 초음리지선 470호
2nd row경상남도 남해군 남해읍 선소리지선 590호
3rd row경상남도 남해군 설천면 진목리 지선
4th row경상남도 남해군 설천면 진목리 지선
5th row경상남도 남해군 설천면 진목리 진목지선
ValueCountFrequency (%)
경상남도 2239
17.5%
남해군 2126
16.7%
남해읍 915
 
7.2%
고현면 512
 
4.0%
창선면 475
 
3.7%
화전로78번길 354
 
2.8%
17-30 339
 
2.7%
남해대로 336
 
2.6%
3243-26 308
 
2.4%
2층 304
 
2.4%
Other values (759) 4850
38.0%
2023-12-12T17:54:21.859278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10474
17.2%
5867
 
9.6%
3508
 
5.8%
2440
 
4.0%
2278
 
3.7%
2256
 
3.7%
2211
 
3.6%
2201
 
3.6%
1 2156
 
3.5%
2 1731
 
2.8%
Other values (273) 25767
42.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 36581
60.1%
Decimal Number 11248
 
18.5%
Space Separator 10474
 
17.2%
Dash Punctuation 1341
 
2.2%
Connector Punctuation 590
 
1.0%
Close Punctuation 323
 
0.5%
Open Punctuation 323
 
0.5%
Uppercase Letter 5
 
< 0.1%
Other Punctuation 2
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5867
16.0%
3508
 
9.6%
2440
 
6.7%
2278
 
6.2%
2256
 
6.2%
2211
 
6.0%
2201
 
6.0%
1399
 
3.8%
1318
 
3.6%
1268
 
3.5%
Other values (252) 11835
32.4%
Decimal Number
ValueCountFrequency (%)
1 2156
19.2%
2 1731
15.4%
3 1596
14.2%
6 1078
9.6%
7 1003
8.9%
0 947
8.4%
4 936
8.3%
5 700
 
6.2%
8 572
 
5.1%
9 529
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
B 2
40.0%
P 1
20.0%
R 1
20.0%
F 1
20.0%
Space Separator
ValueCountFrequency (%)
10474
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1341
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 590
100.0%
Close Punctuation
ValueCountFrequency (%)
) 323
100.0%
Open Punctuation
ValueCountFrequency (%)
( 323
100.0%
Other Punctuation
ValueCountFrequency (%)
· 2
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 36581
60.1%
Common 24301
39.9%
Latin 7
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5867
16.0%
3508
 
9.6%
2440
 
6.7%
2278
 
6.2%
2256
 
6.2%
2211
 
6.0%
2201
 
6.0%
1399
 
3.8%
1318
 
3.6%
1268
 
3.5%
Other values (252) 11835
32.4%
Common
ValueCountFrequency (%)
10474
43.1%
1 2156
 
8.9%
2 1731
 
7.1%
3 1596
 
6.6%
- 1341
 
5.5%
6 1078
 
4.4%
7 1003
 
4.1%
0 947
 
3.9%
4 936
 
3.9%
5 700
 
2.9%
Other values (6) 2339
 
9.6%
Latin
ValueCountFrequency (%)
B 2
28.6%
e 2
28.6%
P 1
14.3%
R 1
14.3%
F 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 36581
60.1%
ASCII 24306
39.9%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10474
43.1%
1 2156
 
8.9%
2 1731
 
7.1%
3 1596
 
6.6%
- 1341
 
5.5%
6 1078
 
4.4%
7 1003
 
4.1%
0 947
 
3.9%
4 936
 
3.9%
5 700
 
2.9%
Other values (10) 2344
 
9.6%
Hangul
ValueCountFrequency (%)
5867
16.0%
3508
 
9.6%
2440
 
6.7%
2278
 
6.2%
2256
 
6.2%
2211
 
6.0%
2201
 
6.0%
1399
 
3.8%
1318
 
3.6%
1268
 
3.5%
Other values (252) 11835
32.4%
None
ValueCountFrequency (%)
· 2
100.0%

신고일
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size18.2 KiB
2022
1050 
2021
1026 
2023
237 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2022 1050
45.4%
2021 1026
44.4%
2023 237
 
10.2%

Length

2023-12-12T17:54:22.035994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:54:22.128864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 1050
45.4%
2021 1026
44.4%
2023 237
 
10.2%

Correlations

2023-12-12T17:54:22.208458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분폐기물 종류운반자처리업소명처리방법신고일
구분1.0001.0000.9991.0000.9420.051
폐기물 종류1.0001.0000.9970.9960.9680.138
운반자0.9990.9971.0000.9990.9800.818
처리업소명1.0000.9960.9991.0000.9870.884
처리방법0.9420.9680.9800.9871.0000.170
신고일0.0510.1380.8180.8840.1701.000
2023-12-12T17:54:22.322555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신고일구분처리방법폐기물 종류
신고일1.0000.0850.0960.069
구분0.0851.0000.9490.994
처리방법0.0960.9491.0000.780
폐기물 종류0.0690.9940.7801.000
2023-12-12T17:54:22.419285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분폐기물 종류처리방법신고일
구분1.0000.9940.9490.085
폐기물 종류0.9941.0000.7800.069
처리방법0.9490.7801.0000.096
신고일0.0850.0690.0961.000

Missing values

2023-12-12T17:54:17.559183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:54:17.676148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분상호폐기물 종류사업자등록번호운반자처리업소명처리방법사업장도로명주소신고일
0사업장일반폐기물남해양식어업 470호폐패각292-98-00703여수바이오(주)지점여수바이오(주)재활용(원료 제조)경상남도 남해군 이동면 초음리지선 470호2021
1사업장일반폐기물개인폐패각166-90-01751여수바이오(주)지점여수바이오(주)재활용(원형 재사용)경상남도 남해군 남해읍 선소리지선 590호2021
2사업장일반폐기물414호 양식폐패각613-92-16303여수바이오(주)지점여수바이오(주)재활용(원료 제조)경상남도 남해군 설천면 진목리 지선2021
3사업장일반폐기물남해양식 제493호폐패각613-90-89495여수바이오(주)지점여수바이오(주)재활용(원료 제조)경상남도 남해군 설천면 진목리 지선2021
4사업장일반폐기물남해양식 제522호폐패각129-97-79905여수바이오(주)지점여수바이오(주)재활용(원료 제조)경상남도 남해군 설천면 진목리 진목지선2021
5사업장일반폐기물주식회사 덕산건설폐합성수지류(폐염화비닐수지류는 제외한다)110-81-82045보물섬환경보물섬환경재활용(중간가공폐기물 제조)경상남도 창원시 진해구 충장로511번길 7-1 (풍호동)2021
6사업장일반폐기물태영상사폐패각191-96-01439여수바이오(주)지점여수바이오(주)재활용(원료 제조)경상남도 남해군 남해읍 선소로 1502021
7사업장일반폐기물남해군청(도시건축과)하수준설토614-83-00743(주)장산환경인선이엔티(주)사천지점매립(민간관리형매립시설)경상남도 남해군 남해읍 망운로9번길 12_ 남해군청2021
8사업장일반폐기물남해종합환경(주)그 밖의 폐목재류613-81-41811남해종합환경(주)(주)HB에너지재활용(원료 제조)경상남도 남해군 남해읍 에코파크길 81-232021
9사업장일반폐기물고현면사무소하수준설토614-83-00928(주)장산환경인선이엔티(주)사천지점매립(민간관리형매립시설)경상남도 남해군 고현면 탑동로 422021
구분상호폐기물 종류사업자등록번호운반자처리업소명처리방법사업장도로명주소신고일
2303지정폐기물배출자관리경남해양과학고등학교석면의 제거작업에 사용된 모든 비닐시트ㆍ방진마스크ㆍ작업복ㆍ집진필터 등614-83-00971남문개발(주)(주)양원이엔지매립(민간관리형매립시설)경상남도 남해군 삼동면 동부대로 1810_ 경남해양과학고등학교2023
2304지정폐기물배출자관리경남해양과학고등학교흩날릴 우려가 없는 폐석면614-83-00971남문개발(주)(주)양원이엔지매립(민간관리형매립시설)경상남도 남해군 삼동면 동부대로 1810_ 경남해양과학고등학교2023
2305지정폐기물배출자관리경남해양과학고등학교흩날릴 우려가 있는 폐석면614-83-00971남문개발(주)(주)디와이솔루션중간처분(고형화)경상남도 남해군 삼동면 동부대로 1810_ 경남해양과학고등학교2023
2306지정폐기물배출자관리경남도립남해대학흩날릴 우려가 있는 폐석면614-83-02848(주)태경산업환경(주)디와이솔루션중간처분(고형화)경상남도 남해군 남해읍 화전로78번길 30_ 경남도립남해대학2023
2307지정폐기물배출자관리경남도립남해대학흩날릴 우려가 없는 폐석면614-83-02848(주)태경산업환경(주)에코비트그린매립(민간관리형매립시설)경상남도 남해군 남해읍 화전로78번길 30_ 경남도립남해대학2023
2308지정폐기물배출자관리경남도립남해대학석면의 제거작업에 사용된 모든 비닐시트ㆍ방진마스크ㆍ작업복ㆍ집진필터 등614-83-02848(주)태경산업환경(주)에코비트그린매립(민간관리형매립시설)경상남도 남해군 남해읍 화전로78번길 30_ 경남도립남해대학2023
2309지정폐기물배출자관리동남해농업협동조합석면의 제거작업에 사용된 모든 비닐시트ㆍ방진마스크ㆍ작업복ㆍ집진필터 등614-82-00203(주)우리환경(주)에코비트그린매립(민간관리형매립시설)경상남도 남해군 이동면 무림로 82_ 동남해농협2023
2310지정폐기물배출자관리동남해농업협동조합흩날릴 우려가 없는 폐석면614-82-00203(주)우리환경(주)에코비트그린매립(민간관리형매립시설)경상남도 남해군 이동면 무림로 82_ 동남해농협2023
2311지정폐기물배출자관리개인(송가은)석면의 제거작업에 사용된 모든 비닐시트ㆍ방진마스크ㆍ작업복ㆍ집진필터 등--장산자원개발(주)한맥테코(주)매립(민간관리형매립시설)경상남도 남해군 창선면 흥선로1202번길 352023
2312지정폐기물배출자관리개인(송가은)흩날릴 우려가 없는 폐석면--장산자원개발(주)한맥테코(주)매립(민간관리형매립시설)경상남도 남해군 창선면 흥선로1202번길 352023

Duplicate rows

Most frequently occurring

구분상호폐기물 종류사업자등록번호운반자처리업소명처리방법사업장도로명주소신고일# duplicates
19지정폐기물배출자관리(주)금당산업석면의 제거작업에 사용된 모든 비닐시트ㆍ방진마스크ㆍ작업복ㆍ집진필터 등613-81-59698㈜태창크린텍㈜이앤컴퍼니구미지점매립(민간관리형매립시설)경상남도 남해군 남해읍 화전로78번길 17-30_ 2층 1호2022109
26지정폐기물배출자관리(주)금당산업흩날릴 우려가 없는 폐석면613-81-59698㈜태창크린텍㈜이앤컴퍼니구미지점매립(민간관리형매립시설)경상남도 남해군 남해읍 화전로78번길 17-30_ 2층 1호2022109
14지정폐기물배출자관리(주)금당산업석면의 제거작업에 사용된 모든 비닐시트ㆍ방진마스크ㆍ작업복ㆍ집진필터 등613-81-59698(주)아시아환경에코시스템(주)매립(민간관리형매립시설)경상남도 남해군 남해읍 망운로9번길 14-102021102
21지정폐기물배출자관리(주)금당산업흩날릴 우려가 없는 폐석면613-81-59698(주)아시아환경에코시스템(주)매립(민간관리형매립시설)경상남도 남해군 남해읍 망운로9번길 14-102021102
12지정폐기물배출자관리(주)가온석면환경흩날릴 우려가 없는 폐석면613-81-63625(주)아시아환경에코시스템(주)매립(민간관리형매립시설)경상남도 남해군 창선면 흥선로 34-1202172
48지정폐기물배출자관리(주)초원환경석면의 제거작업에 사용된 모든 비닐시트ㆍ방진마스크ㆍ작업복ㆍ집진필터 등613-81-33683(주)태경산업환경에코시스템(주)매립(민간관리형매립시설)경상남도 남해군 고현면 남해대로 3243-26202171
52지정폐기물배출자관리(주)초원환경흩날릴 우려가 없는 폐석면613-81-33683(주)태경산업환경에코시스템(주)매립(민간관리형매립시설)경상남도 남해군 고현면 남해대로 3243-26202171
11지정폐기물배출자관리(주)가온석면환경석면의 제거작업에 사용된 모든 비닐시트ㆍ방진마스크ㆍ작업복ㆍ집진필터 등613-81-63625(주)아시아환경에코시스템(주)매립(민간관리형매립시설)경상남도 남해군 창선면 흥선로 34-1202170
34지정폐기물배출자관리(주)삼화산업개발석면의 제거작업에 사용된 모든 비닐시트ㆍ방진마스크ㆍ작업복ㆍ집진필터 등721-86-00950㈜태창크린텍㈜이앤컴퍼니구미지점매립(민간관리형매립시설)경상남도 남해군 창선면 창선로66번길 5202265
39지정폐기물배출자관리(주)삼화산업개발흩날릴 우려가 없는 폐석면721-86-00950㈜태창크린텍㈜이앤컴퍼니구미지점매립(민간관리형매립시설)경상남도 남해군 창선면 창선로66번길 5202265