Overview

Dataset statistics

Number of variables11
Number of observations393
Missing cells0
Missing cells (%)0.0%
Duplicate rows13
Duplicate rows (%)3.3%
Total size in memory34.3 KiB
Average record size in memory89.3 B

Variable types

Categorical3
Text7
Numeric1

Dataset

Description옥천군 사업장폐기물배출자 현황으로(폐기물구분, 상호명, 폐기물종류, 사업자등록번호, 연락처, 운반자명, 처리업소명, 처리방법, 사업장도로명주소, 신고기준년)으로 구성
Author충청북도 옥천군
URLhttps://www.data.go.kr/data/15081071/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 13 (3.3%) duplicate rowsDuplicates
신고기준년도 is highly overall correlated with 폐기물구분High correlation
폐기물구분 is highly overall correlated with 신고기준년도 and 1 other fieldsHigh correlation
처리방법 is highly overall correlated with 폐기물구분High correlation

Reproduction

Analysis started2023-12-12 12:28:26.140330
Analysis finished2023-12-12 12:28:27.393208
Duration1.25 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

폐기물구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
사업장폐기물(7호)
258 
사업장폐기물(6호)
135 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업장폐기물(6호)
2nd row사업장폐기물(6호)
3rd row사업장폐기물(6호)
4th row사업장폐기물(6호)
5th row사업장폐기물(6호)

Common Values

ValueCountFrequency (%)
사업장폐기물(7호) 258
65.6%
사업장폐기물(6호) 135
34.4%

Length

2023-12-12T21:28:27.473794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:28:27.611664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업장폐기물(7호 258
65.6%
사업장폐기물(6호 135
34.4%
Distinct193
Distinct (%)49.1%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-12T21:28:27.908560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length14
Mean length8.2620865
Min length2

Characters and Unicode

Total characters3247
Distinct characters214
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique117 ?
Unique (%)29.8%

Sample

1st row(주)하나그린
2nd row향미김치
3rd row환경시설관리주식회사
4th row반석산업
5th row옥천아스콘(주)
ValueCountFrequency (%)
대신이앤씨(주 32
 
7.5%
한국수자원공사 13
 
3.0%
국제종합기계(주 11
 
2.6%
환경시설관리주식회사 9
 
2.1%
케이비아이코스모링크(주 8
 
1.9%
한국수자원공사대청댐관리단 8
 
1.9%
대전금속 8
 
1.9%
관성건설(주 8
 
1.9%
현대건설(주 7
 
1.6%
대청댐관리단 6
 
1.4%
Other values (192) 317
74.2%
2023-12-12T21:28:28.398546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
280
 
8.6%
( 262
 
8.1%
) 262
 
8.1%
106
 
3.3%
98
 
3.0%
94
 
2.9%
92
 
2.8%
68
 
2.1%
64
 
2.0%
53
 
1.6%
Other values (204) 1868
57.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2683
82.6%
Open Punctuation 262
 
8.1%
Close Punctuation 262
 
8.1%
Space Separator 34
 
1.0%
Decimal Number 5
 
0.2%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
280
 
10.4%
106
 
4.0%
98
 
3.7%
94
 
3.5%
92
 
3.4%
68
 
2.5%
64
 
2.4%
53
 
2.0%
50
 
1.9%
50
 
1.9%
Other values (198) 1728
64.4%
Decimal Number
ValueCountFrequency (%)
2 3
60.0%
4 2
40.0%
Open Punctuation
ValueCountFrequency (%)
( 262
100.0%
Close Punctuation
ValueCountFrequency (%)
) 262
100.0%
Space Separator
ValueCountFrequency (%)
34
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2684
82.7%
Common 563
 
17.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
280
 
10.4%
106
 
3.9%
98
 
3.7%
94
 
3.5%
92
 
3.4%
68
 
2.5%
64
 
2.4%
53
 
2.0%
50
 
1.9%
50
 
1.9%
Other values (199) 1729
64.4%
Common
ValueCountFrequency (%)
( 262
46.5%
) 262
46.5%
34
 
6.0%
2 3
 
0.5%
4 2
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2683
82.6%
ASCII 563
 
17.3%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
280
 
10.4%
106
 
4.0%
98
 
3.7%
94
 
3.5%
92
 
3.4%
68
 
2.5%
64
 
2.4%
53
 
2.0%
50
 
1.9%
50
 
1.9%
Other values (198) 1728
64.4%
ASCII
ValueCountFrequency (%)
( 262
46.5%
) 262
46.5%
34
 
6.0%
2 3
 
0.5%
4 2
 
0.4%
None
ValueCountFrequency (%)
1
100.0%
Distinct67
Distinct (%)17.0%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-12T21:28:28.706100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length49
Mean length13.643766
Min length2

Characters and Unicode

Total characters5362
Distinct characters149
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)7.1%

Sample

1st row폐합성수지류(폐염화비닐수지류는 제외한다)
2nd row음식물류폐기물
3rd row하수처리오니
4th row폐합성수지류(폐염화비닐수지류는 제외한다)
5th row폐아스팔트콘크리트
ValueCountFrequency (%)
폐콘크리트 64
 
6.4%
제외한다 53
 
5.3%
폐합성수지류(폐염화비닐수지류는 46
 
4.6%
등을 46
 
4.6%
하수준설토 46
 
4.6%
말한다 45
 
4.5%
등의 42
 
4.2%
과정에서 39
 
3.9%
임목폐목재(건설공사 38
 
3.8%
산지개간 38
 
3.8%
Other values (103) 545
54.4%
2023-12-12T21:28:29.182594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
610
 
11.4%
375
 
7.0%
204
 
3.8%
186
 
3.5%
181
 
3.4%
140
 
2.6%
_ 140
 
2.6%
127
 
2.4%
108
 
2.0%
102
 
1.9%
Other values (139) 3189
59.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4396
82.0%
Space Separator 610
 
11.4%
Connector Punctuation 140
 
2.6%
Close Punctuation 100
 
1.9%
Open Punctuation 100
 
1.9%
Decimal Number 10
 
0.2%
Other Punctuation 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
375
 
8.5%
204
 
4.6%
186
 
4.2%
181
 
4.1%
140
 
3.2%
127
 
2.9%
108
 
2.5%
102
 
2.3%
102
 
2.3%
100
 
2.3%
Other values (132) 2771
63.0%
Decimal Number
ValueCountFrequency (%)
1 9
90.0%
2 1
 
10.0%
Space Separator
ValueCountFrequency (%)
610
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 140
100.0%
Close Punctuation
ValueCountFrequency (%)
) 100
100.0%
Open Punctuation
ValueCountFrequency (%)
( 100
100.0%
Other Punctuation
ValueCountFrequency (%)
. 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4396
82.0%
Common 966
 
18.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
375
 
8.5%
204
 
4.6%
186
 
4.2%
181
 
4.1%
140
 
3.2%
127
 
2.9%
108
 
2.5%
102
 
2.3%
102
 
2.3%
100
 
2.3%
Other values (132) 2771
63.0%
Common
ValueCountFrequency (%)
610
63.1%
_ 140
 
14.5%
) 100
 
10.4%
( 100
 
10.4%
1 9
 
0.9%
. 6
 
0.6%
2 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4386
81.8%
ASCII 966
 
18.0%
Compat Jamo 10
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
610
63.1%
_ 140
 
14.5%
) 100
 
10.4%
( 100
 
10.4%
1 9
 
0.9%
. 6
 
0.6%
2 1
 
0.1%
Hangul
ValueCountFrequency (%)
375
 
8.5%
204
 
4.7%
186
 
4.2%
181
 
4.1%
140
 
3.2%
127
 
2.9%
108
 
2.5%
102
 
2.3%
102
 
2.3%
100
 
2.3%
Other values (131) 2761
63.0%
Compat Jamo
ValueCountFrequency (%)
10
100.0%
Distinct175
Distinct (%)44.5%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-12T21:28:29.512873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters4716
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)25.4%

Sample

1st row302-81-22160
2nd row118-90-22776
3rd row339-81-01041
4th row314-14-31414
5th row619-87-00197
ValueCountFrequency (%)
250-88-00001 32
 
8.1%
306-82-00471 21
 
5.3%
302-83-00836 11
 
2.8%
129-81-00751 11
 
2.8%
339-81-01041 9
 
2.3%
314-01-60544 8
 
2.0%
124-81-34993 8
 
2.0%
101-81-16293 7
 
1.8%
120-82-00052 6
 
1.5%
302-81-03125 6
 
1.5%
Other values (165) 274
69.7%
2023-12-12T21:28:30.020663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 854
18.1%
- 786
16.7%
1 652
13.8%
8 524
11.1%
3 524
11.1%
2 440
9.3%
4 224
 
4.7%
5 217
 
4.6%
6 194
 
4.1%
7 154
 
3.3%
Other values (2) 147
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3928
83.3%
Dash Punctuation 786
 
16.7%
Space Separator 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 854
21.7%
1 652
16.6%
8 524
13.3%
3 524
13.3%
2 440
11.2%
4 224
 
5.7%
5 217
 
5.5%
6 194
 
4.9%
7 154
 
3.9%
9 145
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 786
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4716
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 854
18.1%
- 786
16.7%
1 652
13.8%
8 524
11.1%
3 524
11.1%
2 440
9.3%
4 224
 
4.7%
5 217
 
4.6%
6 194
 
4.1%
7 154
 
3.3%
Other values (2) 147
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4716
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 854
18.1%
- 786
16.7%
1 652
13.8%
8 524
11.1%
3 524
11.1%
2 440
9.3%
4 224
 
4.7%
5 217
 
4.6%
6 194
 
4.1%
7 154
 
3.3%
Other values (2) 147
 
3.1%
Distinct196
Distinct (%)49.9%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-12T21:28:30.325386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.007634
Min length12

Characters and Unicode

Total characters4719
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique121 ?
Unique (%)30.8%

Sample

1st row043-731-8859
2nd row043-733-9745
3rd row043-733-9982
4th row043-731-7397
5th row043-733-8823
ValueCountFrequency (%)
043-733-6015 36
 
9.2%
042-930-7216 10
 
2.5%
043-730-1340 10
 
2.5%
043-733-9982 9
 
2.3%
043-730-5114 8
 
2.0%
043-732-8528 8
 
2.0%
043-731-1902 7
 
1.8%
043-731-5965 6
 
1.5%
043-733-8870 5
 
1.3%
043-733-8959 5
 
1.3%
Other values (186) 289
73.5%
2023-12-12T21:28:30.764261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 941
19.9%
- 786
16.7%
0 675
14.3%
4 528
11.2%
7 439
9.3%
1 331
 
7.0%
2 303
 
6.4%
5 255
 
5.4%
8 167
 
3.5%
9 157
 
3.3%
Other values (2) 137
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3930
83.3%
Dash Punctuation 786
 
16.7%
Space Separator 3
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 941
23.9%
0 675
17.2%
4 528
13.4%
7 439
11.2%
1 331
 
8.4%
2 303
 
7.7%
5 255
 
6.5%
8 167
 
4.2%
9 157
 
4.0%
6 134
 
3.4%
Dash Punctuation
ValueCountFrequency (%)
- 786
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4719
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 941
19.9%
- 786
16.7%
0 675
14.3%
4 528
11.2%
7 439
9.3%
1 331
 
7.0%
2 303
 
6.4%
5 255
 
5.4%
8 167
 
3.5%
9 157
 
3.3%
Other values (2) 137
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4719
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 941
19.9%
- 786
16.7%
0 675
14.3%
4 528
11.2%
7 439
9.3%
1 331
 
7.0%
2 303
 
6.4%
5 255
 
5.4%
8 167
 
3.5%
9 157
 
3.3%
Other values (2) 137
 
2.9%
Distinct145
Distinct (%)36.9%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-12T21:28:31.012873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length7
Mean length6.7989822
Min length2

Characters and Unicode

Total characters2672
Distinct characters167
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)21.1%

Sample

1st row우진환경개발(주)
2nd row(주)푸른환경산업
3rd row청지환경(주)
4th row(주)대림환경산업
5th row이레환경산업(주)
ValueCountFrequency (%)
유림환경(합 22
 
5.5%
참웰빙임업 18
 
4.5%
대신이엔씨(주 17
 
4.3%
국보환경(주 16
 
4.0%
주)경성건설 16
 
4.0%
성원산업(주 13
 
3.3%
청지환경(주 11
 
2.8%
자가처리 11
 
2.8%
관성건설(주 10
 
2.5%
주)우림 9
 
2.3%
Other values (138) 254
64.0%
2023-12-12T21:28:31.451357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 312
 
11.7%
) 312
 
11.7%
291
 
10.9%
156
 
5.8%
137
 
5.1%
83
 
3.1%
79
 
3.0%
65
 
2.4%
61
 
2.3%
59
 
2.2%
Other values (157) 1117
41.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2027
75.9%
Open Punctuation 312
 
11.7%
Close Punctuation 312
 
11.7%
Uppercase Letter 12
 
0.4%
Space Separator 4
 
0.1%
Other Punctuation 3
 
0.1%
Other Symbol 1
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
291
 
14.4%
156
 
7.7%
137
 
6.8%
83
 
4.1%
79
 
3.9%
65
 
3.2%
61
 
3.0%
59
 
2.9%
41
 
2.0%
38
 
1.9%
Other values (148) 1017
50.2%
Uppercase Letter
ValueCountFrequency (%)
R 6
50.0%
F 4
33.3%
G 2
 
16.7%
Open Punctuation
ValueCountFrequency (%)
( 312
100.0%
Close Punctuation
ValueCountFrequency (%)
) 312
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%
Other Punctuation
ValueCountFrequency (%)
. 3
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2028
75.9%
Common 632
 
23.7%
Latin 12
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
291
 
14.3%
156
 
7.7%
137
 
6.8%
83
 
4.1%
79
 
3.9%
65
 
3.2%
61
 
3.0%
59
 
2.9%
41
 
2.0%
38
 
1.9%
Other values (149) 1018
50.2%
Common
ValueCountFrequency (%)
( 312
49.4%
) 312
49.4%
4
 
0.6%
. 3
 
0.5%
_ 1
 
0.2%
Latin
ValueCountFrequency (%)
R 6
50.0%
F 4
33.3%
G 2
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2027
75.9%
ASCII 644
 
24.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 312
48.4%
) 312
48.4%
R 6
 
0.9%
4
 
0.6%
F 4
 
0.6%
. 3
 
0.5%
G 2
 
0.3%
_ 1
 
0.2%
Hangul
ValueCountFrequency (%)
291
 
14.4%
156
 
7.7%
137
 
6.8%
83
 
4.1%
79
 
3.9%
65
 
3.2%
61
 
3.0%
59
 
2.9%
41
 
2.0%
38
 
1.9%
Other values (148) 1017
50.2%
None
ValueCountFrequency (%)
1
100.0%
Distinct154
Distinct (%)39.2%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-12T21:28:31.752247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length6.8575064
Min length2

Characters and Unicode

Total characters2695
Distinct characters172
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique95 ?
Unique (%)24.2%

Sample

1st row우진환경개발(주)
2nd row그린웨이
3rd row화산한일농장
4th row(주)알엔이옥산공장
5th row인선기업(주)
ValueCountFrequency (%)
유)천만금 39
 
9.9%
국보환경(주 26
 
6.6%
참웰빙임업 18
 
4.6%
대금환경개발(주 17
 
4.3%
주)우림 13
 
3.3%
두제에너지산업(주 11
 
2.8%
자가 11
 
2.8%
주)클렌코 10
 
2.5%
한세이프(주 8
 
2.0%
주)화성그린 7
 
1.8%
Other values (144) 234
59.4%
2023-12-12T21:28:32.243471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 305
 
11.3%
) 305
 
11.3%
273
 
10.1%
105
 
3.9%
101
 
3.7%
84
 
3.1%
80
 
3.0%
72
 
2.7%
64
 
2.4%
48
 
1.8%
Other values (162) 1258
46.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2070
76.8%
Open Punctuation 305
 
11.3%
Close Punctuation 305
 
11.3%
Uppercase Letter 12
 
0.4%
Space Separator 2
 
0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
273
 
13.2%
105
 
5.1%
101
 
4.9%
84
 
4.1%
80
 
3.9%
72
 
3.5%
64
 
3.1%
48
 
2.3%
42
 
2.0%
42
 
2.0%
Other values (155) 1159
56.0%
Uppercase Letter
ValueCountFrequency (%)
R 6
50.0%
F 4
33.3%
G 2
 
16.7%
Open Punctuation
ValueCountFrequency (%)
( 305
100.0%
Close Punctuation
ValueCountFrequency (%)
) 305
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2071
76.8%
Common 612
 
22.7%
Latin 12
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
273
 
13.2%
105
 
5.1%
101
 
4.9%
84
 
4.1%
80
 
3.9%
72
 
3.5%
64
 
3.1%
48
 
2.3%
42
 
2.0%
42
 
2.0%
Other values (156) 1160
56.0%
Common
ValueCountFrequency (%)
( 305
49.8%
) 305
49.8%
2
 
0.3%
Latin
ValueCountFrequency (%)
R 6
50.0%
F 4
33.3%
G 2
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2070
76.8%
ASCII 624
 
23.2%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 305
48.9%
) 305
48.9%
R 6
 
1.0%
F 4
 
0.6%
2
 
0.3%
G 2
 
0.3%
Hangul
ValueCountFrequency (%)
273
 
13.2%
105
 
5.1%
101
 
4.9%
84
 
4.1%
80
 
3.9%
72
 
3.5%
64
 
3.1%
48
 
2.3%
42
 
2.0%
42
 
2.0%
Other values (155) 1159
56.0%
None
ValueCountFrequency (%)
1
100.0%

처리방법
Categorical

HIGH CORRELATION 

Distinct24
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
재활용(중간가공폐기물 제조)
72 
재활용(파쇄.분쇄)
68 
중간처분(일반소각)
45 
재활용(농업생산활동에 사용)
31 
중간처분(파쇄.분쇄)
22 
Other values (19)
155 

Length

Max length19
Median length17
Mean length11.676845
Min length2

Unique

Unique4 ?
Unique (%)1.0%

Sample

1st row중간처분(일반소각)
2nd row재활용(농업생산활동에 사용)
3rd row재활용(원료 제조)
4th row재활용(원료 제조)
5th row중간처분(파쇄.분쇄)

Common Values

ValueCountFrequency (%)
재활용(중간가공폐기물 제조) 72
18.3%
재활용(파쇄.분쇄) 68
17.3%
중간처분(일반소각) 45
11.5%
재활용(농업생산활동에 사용) 31
7.9%
중간처분(파쇄.분쇄) 22
 
5.6%
재활용(기타) 22
 
5.6%
재활용(연료·고형연료제품 제조) 20
 
5.1%
재활용(직접 제품제조) 19
 
4.8%
매립(민간관리형매립시설) 17
 
4.3%
기타재활용 16
 
4.1%
Other values (14) 61
15.5%

Length

2023-12-12T21:28:32.434484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제조 104
18.2%
재활용(중간가공폐기물 72
12.6%
재활용(파쇄.분쇄 68
11.9%
중간처분(일반소각 45
 
7.9%
사용 45
 
7.9%
재활용(농업생산활동에 31
 
5.4%
중간처분(파쇄.분쇄 22
 
3.8%
재활용(기타 22
 
3.8%
재활용(연료·고형연료제품 20
 
3.5%
재활용(직접 19
 
3.3%
Other values (19) 125
21.8%
Distinct193
Distinct (%)49.1%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-12T21:28:32.770407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length56
Mean length26.07888
Min length18

Characters and Unicode

Total characters10249
Distinct characters239
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique112 ?
Unique (%)28.5%

Sample

1st row충청북도 옥천군 청산면 판수길 210
2nd row충청북도 옥천군 동이면 동이농공길 73-10
3rd row충청북도 옥천군 군북면 이백6길 65 (옥천공공하수처리시설)
4th row충청북도 옥천군 군서면 동평3길 45-61
5th row충청북도 옥천군 이원면 건진2길 49
ValueCountFrequency (%)
충청북도 365
 
16.4%
옥천군 313
 
14.0%
옥천읍 175
 
7.9%
동이면 44
 
2.0%
청산면 33
 
1.5%
이원면 30
 
1.3%
동이농공길 27
 
1.2%
중앙로 24
 
1.1%
청주시 24
 
1.1%
75 21
 
0.9%
Other values (446) 1173
52.6%
2023-12-12T21:28:33.378991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1890
18.4%
567
 
5.5%
559
 
5.5%
472
 
4.6%
406
 
4.0%
393
 
3.8%
378
 
3.7%
373
 
3.6%
1 271
 
2.6%
249
 
2.4%
Other values (229) 4691
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6488
63.3%
Space Separator 1890
 
18.4%
Decimal Number 1354
 
13.2%
Open Punctuation 149
 
1.5%
Close Punctuation 148
 
1.4%
Dash Punctuation 113
 
1.1%
Connector Punctuation 90
 
0.9%
Other Punctuation 7
 
0.1%
Uppercase Letter 7
 
0.1%
Lowercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
567
 
8.7%
559
 
8.6%
472
 
7.3%
406
 
6.3%
393
 
6.1%
378
 
5.8%
373
 
5.7%
249
 
3.8%
190
 
2.9%
184
 
2.8%
Other values (209) 2717
41.9%
Decimal Number
ValueCountFrequency (%)
1 271
20.0%
2 197
14.5%
5 144
10.6%
3 121
8.9%
4 118
8.7%
9 116
8.6%
6 104
 
7.7%
7 104
 
7.7%
0 94
 
6.9%
8 85
 
6.3%
Uppercase Letter
ValueCountFrequency (%)
K 3
42.9%
D 2
28.6%
Y 2
28.6%
Space Separator
ValueCountFrequency (%)
1890
100.0%
Open Punctuation
ValueCountFrequency (%)
( 149
100.0%
Close Punctuation
ValueCountFrequency (%)
) 148
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 113
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 90
100.0%
Other Punctuation
ValueCountFrequency (%)
. 7
100.0%
Lowercase Letter
ValueCountFrequency (%)
m 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6488
63.3%
Common 3751
36.6%
Latin 10
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
567
 
8.7%
559
 
8.6%
472
 
7.3%
406
 
6.3%
393
 
6.1%
378
 
5.8%
373
 
5.7%
249
 
3.8%
190
 
2.9%
184
 
2.8%
Other values (209) 2717
41.9%
Common
ValueCountFrequency (%)
1890
50.4%
1 271
 
7.2%
2 197
 
5.3%
( 149
 
4.0%
) 148
 
3.9%
5 144
 
3.8%
3 121
 
3.2%
4 118
 
3.1%
9 116
 
3.1%
- 113
 
3.0%
Other values (6) 484
 
12.9%
Latin
ValueCountFrequency (%)
m 3
30.0%
K 3
30.0%
D 2
20.0%
Y 2
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6488
63.3%
ASCII 3761
36.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1890
50.3%
1 271
 
7.2%
2 197
 
5.2%
( 149
 
4.0%
) 148
 
3.9%
5 144
 
3.8%
3 121
 
3.2%
4 118
 
3.1%
9 116
 
3.1%
- 113
 
3.0%
Other values (10) 494
 
13.1%
Hangul
ValueCountFrequency (%)
567
 
8.7%
559
 
8.6%
472
 
7.3%
406
 
6.3%
393
 
6.1%
378
 
5.8%
373
 
5.7%
249
 
3.8%
190
 
2.9%
184
 
2.8%
Other values (209) 2717
41.9%

신고기준년도
Real number (ℝ)

HIGH CORRELATION 

Distinct24
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2010.0178
Minimum1998
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-12T21:28:33.580966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1998
5-th percentile1999
Q12002
median2009
Q32019
95-th percentile2020
Maximum2022
Range24
Interquartile range (IQR)17

Descriptive statistics

Standard deviation7.9902484
Coefficient of variation (CV)0.0039752127
Kurtosis-1.6738209
Mean2010.0178
Median Absolute Deviation (MAD)8
Skewness0.079172775
Sum789937
Variance63.84407
MonotonicityNot monotonic
2023-12-12T21:28:33.752990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
2002 54
13.7%
2020 50
12.7%
2019 42
10.7%
2004 37
9.4%
2003 29
 
7.4%
1999 27
 
6.9%
2018 27
 
6.9%
2017 18
 
4.6%
2012 14
 
3.6%
2000 13
 
3.3%
Other values (14) 82
20.9%
ValueCountFrequency (%)
1998 2
 
0.5%
1999 27
6.9%
2000 13
 
3.3%
2001 8
 
2.0%
2002 54
13.7%
2003 29
7.4%
2004 37
9.4%
2005 1
 
0.3%
2006 8
 
2.0%
2007 9
 
2.3%
ValueCountFrequency (%)
2022 4
 
1.0%
2021 10
 
2.5%
2020 50
12.7%
2019 42
10.7%
2018 27
6.9%
2017 18
 
4.6%
2015 5
 
1.3%
2014 1
 
0.3%
2013 6
 
1.5%
2012 14
 
3.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2022-09-15
393 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-09-15
2nd row2022-09-15
3rd row2022-09-15
4th row2022-09-15
5th row2022-09-15

Common Values

ValueCountFrequency (%)
2022-09-15 393
100.0%

Length

2023-12-12T21:28:33.930775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:28:34.065918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-09-15 393
100.0%

Interactions

2023-12-12T21:28:26.935996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:28:34.177524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물구분폐기물종류처리방법신고기준년도
폐기물구분1.0000.8980.6570.699
폐기물종류0.8981.0000.9680.842
처리방법0.6570.9681.0000.729
신고기준년도0.6990.8420.7291.000
2023-12-12T21:28:34.318953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물구분처리방법
폐기물구분1.0000.514
처리방법0.5141.000
2023-12-12T21:28:34.455149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신고기준년도폐기물구분처리방법
신고기준년도1.0000.5460.362
폐기물구분0.5461.0000.514
처리방법0.3620.5141.000

Missing values

2023-12-12T21:28:27.103193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:28:27.315747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

폐기물구분상호명폐기물종류사업자등록번호연락처운반자명처리업소명처리방법사업장도로명주소신고기준년도데이터기준일자
0사업장폐기물(6호)(주)하나그린폐합성수지류(폐염화비닐수지류는 제외한다)302-81-22160043-731-8859우진환경개발(주)우진환경개발(주)중간처분(일반소각)충청북도 옥천군 청산면 판수길 21020222022-09-15
1사업장폐기물(6호)향미김치음식물류폐기물118-90-22776043-733-9745(주)푸른환경산업그린웨이재활용(농업생산활동에 사용)충청북도 옥천군 동이면 동이농공길 73-1020222022-09-15
2사업장폐기물(6호)환경시설관리주식회사하수처리오니339-81-01041043-733-9982청지환경(주)화산한일농장재활용(원료 제조)충청북도 옥천군 군북면 이백6길 65 (옥천공공하수처리시설)20222022-09-15
3사업장폐기물(6호)반석산업폐합성수지류(폐염화비닐수지류는 제외한다)314-14-31414043-731-7397(주)대림환경산업(주)알엔이옥산공장재활용(원료 제조)충청북도 옥천군 군서면 동평3길 45-6120222022-09-15
4사업장폐기물(6호)옥천아스콘(주)폐아스팔트콘크리트619-87-00197043-733-8823이레환경산업(주)인선기업(주)중간처분(파쇄.분쇄)충청북도 옥천군 이원면 건진2길 4920212022-09-15
5사업장폐기물(6호)대진아스콘(주)폐아스팔트콘크리트319-81-00427043-733-8823이레환경산업(주)인선기업(주)중간처분(파쇄.분쇄)충청북도 옥천군 이원면 건진2길 5320212022-09-15
6사업장폐기물(6호)용진환경 주식회사(청산산단)그 밖의 폐수처리오니302-81-11227043-745-1550(주)다온대산재활용(토질개선에 사용)충청북도 옥천군 청산면 인정1길 2520202022-09-15
7사업장폐기물(6호)용진환경 주식회사(청산산단)그 밖의 폐수처리오니302-81-11227043-745-1550(주)다온그린용산농원재활용(토질개선에 사용)충청북도 옥천군 청산면 인정1길 2520202022-09-15
8사업장폐기물(6호)옥천군청(환경과)폐폴리염화비닐수지류302-83-00836043-730-3452럭키소재럭키소재재활용(원료 제조)충청북도 옥천군 옥천읍 중앙로 9920202022-09-15
9사업장폐기물(6호)옥천군청(환경과)폐가구류_ 폐도장목_ 폐목재포장재_ 폐전선드럼(접착제_ 페인트_ 기름_ 콘크리트 등의 물질이 사용된 목재를 말한다)302-83-00836043-730-3452(주)우림(주)우림재활용(연료·고형연료제품 제조)충청북도 옥천군 옥천읍 중앙로 9920202022-09-15
폐기물구분상호명폐기물종류사업자등록번호연락처운반자명처리업소명처리방법사업장도로명주소신고기준년도데이터기준일자
383사업장폐기물(7호)현대건설(주)폐합성수지101-81-16293043-731-1902우진공영(주)우진환경개발(주)중간처분(기타)충청북도 옥천군 옥천읍 옥각길 920022022-09-15
384사업장폐기물(7호)현대건설(주)건설폐재류101-81-16293043-731-1902(주)세명산업개발(주)세명산업개발중간처분(기타)충청북도 옥천군 옥천읍 옥각길 920022022-09-15
385사업장폐기물(7호)현대건설(주)폐콘크리트101-81-16293043-731-1902(주)세명산업개발(주)세명산업개발중간처분(기타)충청북도 옥천군 옥천읍 옥각길 920022022-09-15
386사업장폐기물(7호)현대건설(주)폐콘크리트101-81-16293043-731-1902자가처리자가재활용(파쇄.분쇄)충청북도 옥천군 옥천읍 옥각길 920022022-09-15
387사업장폐기물(7호)현대건설(주)폐아스팔트콘크리트101-81-16293043-731-1902(주)세명산업개발(주)세명산업개발중간처분(기타)충청북도 옥천군 옥천읍 옥각길 920022022-09-15
388사업장폐기물(7호)현대건설(주)폐아스팔트콘크리트101-81-16293043-731-1902자가처리자가재활용(파쇄.분쇄)충청북도 옥천군 옥천읍 옥각길 920022022-09-15
389사업장폐기물(7호)현대건설(주)폐목재류101-81-16293043-731-1902(주)세림산업(주)진웅산업중간처분(기타)충청북도 옥천군 옥천읍 옥각길 920022022-09-15
390사업장폐기물(7호)청봉건설(주)폐콘크리트302-81-00021043-732-7743자가처리자가재활용(원료가공)충청북도 영동군 영동읍 계산로2길 16 ((안내면 담양리 담양1_2교 가설공사))20022022-09-15
391사업장폐기물(7호)(주)광신건설폐콘크리트302-81-04901043-872-5020자가처리자가재활용(원료가공)충청북도 옥천군 옥천읍 삼양로5길 21-720022022-09-15
392사업장폐기물(7호)정토건설(주)폐콘크리트303-81-17321043-872-5020자가처리자가재활용(기타)충청북도 음성군 음성읍 중앙로 174 ((안남면 도덕리 서모선농어촌도로 확.포장공사))20022022-09-15

Duplicate rows

Most frequently occurring

폐기물구분상호명폐기물종류사업자등록번호연락처운반자명처리업소명처리방법사업장도로명주소신고기준년도데이터기준일자# duplicates
7사업장폐기물(7호)대신이앤씨(주)하수준설토250-88-00001043-733-6015대신이엔씨(주)(유)천만금재활용(중간가공폐기물 제조)충청북도 옥천군 옥천읍 향수3길 7520202022-09-158
3사업장폐기물(7호)대신이앤씨(주)하수준설토250-88-00001043-733-6015(주)경성건설(유)천만금재활용(중간가공폐기물 제조)충청북도 옥천군 옥천읍 동부로 28_ 1층20192022-09-157
4사업장폐기물(7호)대신이앤씨(주)하수준설토250-88-00001043-733-6015(주)경성건설(유)천만금재활용(중간가공폐기물 제조)충청북도 옥천군 옥천읍 향수3길 7520202022-09-155
6사업장폐기물(7호)대신이앤씨(주)하수준설토250-88-00001043-733-6015대신이엔씨(주)(유)천만금재활용(중간가공폐기물 제조)충청북도 옥천군 옥천읍 동부로 28_ 1층20192022-09-153
9사업장폐기물(7호)대주건설주식회사하수준설토210-81-26867043-731-5965대주건설(주)(유)천만금재활용(성토재·복토재 등으로 사용)충청북도 옥천군 옥천읍 마장로 74_ 사무소20202022-09-153
0사업장폐기물(6호)국제종합기계(주)폐합성수지류(폐염화비닐수지류는 제외한다)129-81-00751043-730-1340유림환경(합)두제에너지산업(주)재활용(중간가공폐기물 제조)충청북도 옥천군 옥천읍 서부로 4919992022-09-152
1사업장폐기물(6호)청산양곡가공공장사업장폐기물302-02-54575043-733-9385농가농가재활용(퇴비화)충청북도 옥천군 청산면 남부로 227320062022-09-152
2사업장폐기물(7호)(주)송암조경폐목재류(원목의 용도 그대로 사용하는 나무뿌리ㆍ가지 등을 제거한 원줄기는 제외한다.)302-81-19665043-733-6845(주)우림(주)우림재활용(파쇄.분쇄)충청북도 옥천군 옥천읍 삼청2길 156-620122022-09-152
5사업장폐기물(7호)대신이앤씨(주)하수준설토250-88-00001043-733-6015(주)경성건설(유)천만금재활용(중간가공폐기물 제조)충청북도 옥천군 옥천읍 향수3길 7520212022-09-152
8사업장폐기물(7호)대신이앤씨주식회사하수준설토302-81-20653043-733-6015대신이앤씨주식회사충청환경산업재활용(기타)충청북도 옥천군 옥천읍 삼양로 5220112022-09-152