Overview

Dataset statistics

Number of variables11
Number of observations691
Missing cells47
Missing cells (%)0.6%
Duplicate rows16
Duplicate rows (%)2.3%
Total size in memory59.5 KiB
Average record size in memory88.2 B

Variable types

Categorical3
Text8

Dataset

Description진주시 사업장폐기물 및 지정폐기물 배출자 현황
Author경상남도 진주시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15081147

Alerts

Dataset has 16 (2.3%) duplicate rowsDuplicates
구분 is highly overall correlated with 처리방법High correlation
처리방법 is highly overall correlated with 구분High correlation
처리구분 is highly imbalanced (81.8%)Imbalance
전화번호 has 44 (6.4%) missing valuesMissing

Reproduction

Analysis started2023-12-10 23:12:19.312658
Analysis finished2023-12-10 23:12:20.388999
Duration1.08 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
일반
454 
지정
237 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
일반 454
65.7%
지정 237
34.3%

Length

2023-12-11T08:12:20.445934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:12:20.539529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 454
65.7%
지정 237
34.3%

상호
Text

Distinct220
Distinct (%)31.8%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
2023-12-11T08:12:20.732116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length19
Mean length9.4023155
Min length4

Characters and Unicode

Total characters6497
Distinct characters259
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique81 ?
Unique (%)11.7%

Sample

1st row동일팩키지(주)진주공장
2nd row동일팩키지(주)진주공장
3rd row동일팩키지(주)진주공장
4th row우성정공(주)
5th row우성정공(주)
ValueCountFrequency (%)
경상국립대학교 28
 
3.2%
주)동경산업 21
 
2.4%
주)경동산업 21
 
2.4%
환경시설관리 20
 
2.3%
무림페이퍼(주 18
 
2.1%
공군교육사령부 17
 
2.0%
사)한국자동차세정협회경상남도지회 16
 
1.8%
진주점 16
 
1.8%
시민환경 15
 
1.7%
주식회사 15
 
1.7%
Other values (235) 682
78.5%
2023-12-11T08:12:21.078699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
554
 
8.5%
( 407
 
6.3%
) 407
 
6.3%
186
 
2.9%
178
 
2.7%
178
 
2.7%
150
 
2.3%
142
 
2.2%
136
 
2.1%
135
 
2.1%
Other values (249) 4024
61.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5461
84.1%
Open Punctuation 407
 
6.3%
Close Punctuation 407
 
6.3%
Space Separator 178
 
2.7%
Decimal Number 14
 
0.2%
Lowercase Letter 12
 
0.2%
Uppercase Letter 8
 
0.1%
Dash Punctuation 6
 
0.1%
Other Symbol 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
554
 
10.1%
186
 
3.4%
178
 
3.3%
150
 
2.7%
142
 
2.6%
136
 
2.5%
135
 
2.5%
121
 
2.2%
97
 
1.8%
89
 
1.6%
Other values (233) 3673
67.3%
Lowercase Letter
ValueCountFrequency (%)
e 4
33.3%
n 2
16.7%
t 2
16.7%
o 2
16.7%
h 2
16.7%
Decimal Number
ValueCountFrequency (%)
2 8
57.1%
3 5
35.7%
4 1
 
7.1%
Uppercase Letter
ValueCountFrequency (%)
J 4
50.0%
T 2
25.0%
S 2
25.0%
Open Punctuation
ValueCountFrequency (%)
( 407
100.0%
Close Punctuation
ValueCountFrequency (%)
) 407
100.0%
Space Separator
ValueCountFrequency (%)
178
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5465
84.1%
Common 1012
 
15.6%
Latin 20
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
554
 
10.1%
186
 
3.4%
178
 
3.3%
150
 
2.7%
142
 
2.6%
136
 
2.5%
135
 
2.5%
121
 
2.2%
97
 
1.8%
89
 
1.6%
Other values (234) 3677
67.3%
Latin
ValueCountFrequency (%)
e 4
20.0%
J 4
20.0%
T 2
10.0%
n 2
10.0%
S 2
10.0%
t 2
10.0%
o 2
10.0%
h 2
10.0%
Common
ValueCountFrequency (%)
( 407
40.2%
) 407
40.2%
178
17.6%
2 8
 
0.8%
- 6
 
0.6%
3 5
 
0.5%
4 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5461
84.1%
ASCII 1032
 
15.9%
None 4
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
554
 
10.1%
186
 
3.4%
178
 
3.3%
150
 
2.7%
142
 
2.6%
136
 
2.5%
135
 
2.5%
121
 
2.2%
97
 
1.8%
89
 
1.6%
Other values (233) 3673
67.3%
ASCII
ValueCountFrequency (%)
( 407
39.4%
) 407
39.4%
178
17.2%
2 8
 
0.8%
- 6
 
0.6%
3 5
 
0.5%
e 4
 
0.4%
J 4
 
0.4%
T 2
 
0.2%
n 2
 
0.2%
Other values (5) 9
 
0.9%
None
ValueCountFrequency (%)
4
100.0%
Distinct227
Distinct (%)32.9%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
2023-12-11T08:12:21.338319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length32
Mean length16.882779
Min length10

Characters and Unicode

Total characters11666
Distinct characters168
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique84 ?
Unique (%)12.2%

Sample

1st row남강로 1303 (상평동)
2nd row남강로 1303 (상평동)
3rd row남강로 1303 (상평동)
4th row진성면 동부로1259번길 54
5th row진성면 동부로1259번길 54
ValueCountFrequency (%)
진주시 252
 
10.3%
대곡면 99
 
4.0%
상평동 88
 
3.6%
남강로 78
 
3.2%
사봉면 49
 
2.0%
정촌면 49
 
2.0%
상대동 42
 
1.7%
진성면 39
 
1.6%
진주대로 39
 
1.6%
경상남도 37
 
1.5%
Other values (315) 1676
68.5%
2023-12-11T08:12:21.743063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1770
 
15.2%
640
 
5.5%
1 569
 
4.9%
445
 
3.8%
426
 
3.7%
3 377
 
3.2%
4 354
 
3.0%
352
 
3.0%
319
 
2.7%
( 314
 
2.7%
Other values (158) 6100
52.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6206
53.2%
Decimal Number 2854
24.5%
Space Separator 1770
 
15.2%
Open Punctuation 314
 
2.7%
Close Punctuation 310
 
2.7%
Dash Punctuation 136
 
1.2%
Connector Punctuation 63
 
0.5%
Uppercase Letter 13
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
640
 
10.3%
445
 
7.2%
426
 
6.9%
352
 
5.7%
319
 
5.1%
285
 
4.6%
280
 
4.5%
253
 
4.1%
246
 
4.0%
179
 
2.9%
Other values (139) 2781
44.8%
Decimal Number
ValueCountFrequency (%)
1 569
19.9%
3 377
13.2%
4 354
12.4%
2 311
10.9%
0 261
9.1%
6 228
8.0%
7 216
 
7.6%
8 214
 
7.5%
5 168
 
5.9%
9 156
 
5.5%
Uppercase Letter
ValueCountFrequency (%)
B 5
38.5%
F 3
23.1%
G 3
23.1%
L 2
 
15.4%
Space Separator
ValueCountFrequency (%)
1770
100.0%
Open Punctuation
ValueCountFrequency (%)
( 314
100.0%
Close Punctuation
ValueCountFrequency (%)
) 310
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 136
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 63
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6206
53.2%
Common 5447
46.7%
Latin 13
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
640
 
10.3%
445
 
7.2%
426
 
6.9%
352
 
5.7%
319
 
5.1%
285
 
4.6%
280
 
4.5%
253
 
4.1%
246
 
4.0%
179
 
2.9%
Other values (139) 2781
44.8%
Common
ValueCountFrequency (%)
1770
32.5%
1 569
 
10.4%
3 377
 
6.9%
4 354
 
6.5%
( 314
 
5.8%
2 311
 
5.7%
) 310
 
5.7%
0 261
 
4.8%
6 228
 
4.2%
7 216
 
4.0%
Other values (5) 737
13.5%
Latin
ValueCountFrequency (%)
B 5
38.5%
F 3
23.1%
G 3
23.1%
L 2
 
15.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6206
53.2%
ASCII 5460
46.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1770
32.4%
1 569
 
10.4%
3 377
 
6.9%
4 354
 
6.5%
( 314
 
5.8%
2 311
 
5.7%
) 310
 
5.7%
0 261
 
4.8%
6 228
 
4.2%
7 216
 
4.0%
Other values (9) 750
13.7%
Hangul
ValueCountFrequency (%)
640
 
10.3%
445
 
7.2%
426
 
6.9%
352
 
5.7%
319
 
5.1%
285
 
4.6%
280
 
4.5%
253
 
4.1%
246
 
4.0%
179
 
2.9%
Other values (139) 2781
44.8%
Distinct198
Distinct (%)28.8%
Missing3
Missing (%)0.4%
Memory size5.5 KiB
2023-12-11T08:12:21.962646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters8256
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique74 ?
Unique (%)10.8%

Sample

1st row613-85-13308
2nd row613-85-13308
3rd row613-85-13308
4th row613-81-10088
5th row613-81-10088
ValueCountFrequency (%)
613-83-00570 29
 
4.2%
384-83-00013 25
 
3.6%
613-81-58592 21
 
3.1%
613-81-49924 21
 
3.1%
339-81-01041 20
 
2.9%
613-81-00289 18
 
2.6%
613-83-03617 17
 
2.5%
609-82-05832 16
 
2.3%
613-16-56655 15
 
2.2%
114-81-65558 13
 
1.9%
Other values (188) 493
71.7%
2023-12-11T08:12:22.282001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1376
16.7%
1 1275
15.4%
3 1011
12.2%
0 932
11.3%
8 916
11.1%
6 806
9.8%
2 476
 
5.8%
5 473
 
5.7%
9 383
 
4.6%
4 370
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6880
83.3%
Dash Punctuation 1376
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 1275
18.5%
3 1011
14.7%
0 932
13.5%
8 916
13.3%
6 806
11.7%
2 476
 
6.9%
5 473
 
6.9%
9 383
 
5.6%
4 370
 
5.4%
7 238
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 1376
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8256
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1376
16.7%
1 1275
15.4%
3 1011
12.2%
0 932
11.3%
8 916
11.1%
6 806
9.8%
2 476
 
5.8%
5 473
 
5.7%
9 383
 
4.6%
4 370
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8256
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1376
16.7%
1 1275
15.4%
3 1011
12.2%
0 932
11.3%
8 916
11.1%
6 806
9.8%
2 476
 
5.8%
5 473
 
5.7%
9 383
 
4.6%
4 370
 
4.5%

전화번호
Text

MISSING 

Distinct190
Distinct (%)29.4%
Missing44
Missing (%)6.4%
Memory size5.5 KiB
2023-12-11T08:12:22.521174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.973725
Min length2

Characters and Unicode

Total characters7747
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)10.4%

Sample

1st row055-755-1211
2nd row055-755-1211
3rd row055-755-1211
4th row055-758-1256
5th row055-758-1256
ValueCountFrequency (%)
055-772-1004 21
 
3.2%
055-744-6678 21
 
3.2%
055-745-6698 21
 
3.2%
055-751-1431 18
 
2.8%
055-750-4121 17
 
2.6%
055-743-3469 16
 
2.5%
055-758-8900 15
 
2.3%
055-741-3550 12
 
1.9%
055-757-0963 9
 
1.4%
055-750-3791 9
 
1.4%
Other values (178) 488
75.4%
2023-12-11T08:12:22.895548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 1804
23.3%
- 1261
16.3%
0 1234
15.9%
7 859
11.1%
1 523
 
6.8%
4 480
 
6.2%
2 358
 
4.6%
6 348
 
4.5%
9 318
 
4.1%
3 269
 
3.5%
Other values (3) 293
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6459
83.4%
Dash Punctuation 1261
 
16.3%
Other Punctuation 26
 
0.3%
Space Separator 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 1804
27.9%
0 1234
19.1%
7 859
13.3%
1 523
 
8.1%
4 480
 
7.4%
2 358
 
5.5%
6 348
 
5.4%
9 318
 
4.9%
3 269
 
4.2%
8 266
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 1261
100.0%
Other Punctuation
ValueCountFrequency (%)
' 26
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7747
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 1804
23.3%
- 1261
16.3%
0 1234
15.9%
7 859
11.1%
1 523
 
6.8%
4 480
 
6.2%
2 358
 
4.6%
6 348
 
4.5%
9 318
 
4.1%
3 269
 
3.5%
Other values (3) 293
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7747
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 1804
23.3%
- 1261
16.3%
0 1234
15.9%
7 859
11.1%
1 523
 
6.8%
4 480
 
6.2%
2 358
 
4.6%
6 348
 
4.5%
9 318
 
4.1%
3 269
 
3.5%
Other values (3) 293
 
3.8%
Distinct218
Distinct (%)31.5%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
2023-12-11T08:12:23.147517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length11
Mean length12.085384
Min length11

Characters and Unicode

Total characters8351
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique79 ?
Unique (%)11.4%

Sample

1st row2007년08월08일
2nd row2007년08월08일
3rd row2007년08월08일
4th row2005년03월04일
5th row2005년03월04일
ValueCountFrequency (%)
37
 
3.0%
12월 34
 
2.8%
03월 33
 
2.7%
2013년 31
 
2.5%
2008년 28
 
2.3%
29일 28
 
2.3%
2011년 26
 
2.1%
02일 24
 
1.9%
02월 23
 
1.9%
2015년 23
 
1.9%
Other values (199) 945
76.7%
2023-12-11T08:12:23.586277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1877
22.5%
2 1217
14.6%
1 991
11.9%
750
 
9.0%
691
 
8.3%
691
 
8.3%
691
 
8.3%
3 281
 
3.4%
8 250
 
3.0%
9 217
 
2.6%
Other values (4) 695
 
8.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5528
66.2%
Other Letter 2073
 
24.8%
Space Separator 750
 
9.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1877
34.0%
2 1217
22.0%
1 991
17.9%
3 281
 
5.1%
8 250
 
4.5%
9 217
 
3.9%
7 215
 
3.9%
6 196
 
3.5%
5 162
 
2.9%
4 122
 
2.2%
Other Letter
ValueCountFrequency (%)
691
33.3%
691
33.3%
691
33.3%
Space Separator
ValueCountFrequency (%)
750
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6278
75.2%
Hangul 2073
 
24.8%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1877
29.9%
2 1217
19.4%
1 991
15.8%
750
 
11.9%
3 281
 
4.5%
8 250
 
4.0%
9 217
 
3.5%
7 215
 
3.4%
6 196
 
3.1%
5 162
 
2.6%
Hangul
ValueCountFrequency (%)
691
33.3%
691
33.3%
691
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6278
75.2%
Hangul 2073
 
24.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1877
29.9%
2 1217
19.4%
1 991
15.8%
750
 
11.9%
3 281
 
4.5%
8 250
 
4.0%
9 217
 
3.5%
7 215
 
3.4%
6 196
 
3.1%
5 162
 
2.6%
Hangul
ValueCountFrequency (%)
691
33.3%
691
33.3%
691
33.3%
Distinct84
Distinct (%)12.2%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
2023-12-11T08:12:23.897251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length64
Mean length21.342981
Min length2

Characters and Unicode

Total characters14748
Distinct characters234
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)3.3%

Sample

1st row사업장폐기물 소각시설 바닥재
2nd row폐토사
3rd row그 밖의 폐기물
4th row그 밖의 폐목재류
5th row그 밖의 공정오니
ValueCountFrequency (%)
262
 
9.8%
밖의 262
 
9.8%
제외한다 119
 
4.5%
폐유 117
 
4.4%
폐합성수지류(폐염화비닐수지류는 112
 
4.2%
말한다 107
 
4.0%
82
 
3.1%
등을 66
 
2.5%
20퍼센트 54
 
2.0%
이상의 54
 
2.0%
Other values (167) 1434
53.7%
2023-12-11T08:12:24.301071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1988
 
13.5%
937
 
6.4%
624
 
4.2%
426
 
2.9%
376
 
2.5%
321
 
2.2%
316
 
2.1%
309
 
2.1%
283
 
1.9%
265
 
1.8%
Other values (224) 8903
60.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11443
77.6%
Space Separator 1988
 
13.5%
Lowercase Letter 324
 
2.2%
Open Punctuation 305
 
2.1%
Close Punctuation 305
 
2.1%
Connector Punctuation 196
 
1.3%
Decimal Number 187
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
937
 
8.2%
624
 
5.5%
426
 
3.7%
376
 
3.3%
321
 
2.8%
316
 
2.8%
309
 
2.7%
283
 
2.5%
265
 
2.3%
262
 
2.3%
Other values (207) 7324
64.0%
Lowercase Letter
ValueCountFrequency (%)
e 108
33.3%
g 54
16.7%
a 54
16.7%
r 54
16.7%
s 54
16.7%
Decimal Number
ValueCountFrequency (%)
2 97
51.9%
0 54
28.9%
8 18
 
9.6%
1 18
 
9.6%
Open Punctuation
ValueCountFrequency (%)
( 233
76.4%
[ 54
 
17.7%
18
 
5.9%
Close Punctuation
ValueCountFrequency (%)
) 233
76.4%
] 54
 
17.7%
18
 
5.9%
Space Separator
ValueCountFrequency (%)
1988
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 196
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11443
77.6%
Common 2981
 
20.2%
Latin 324
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
937
 
8.2%
624
 
5.5%
426
 
3.7%
376
 
3.3%
321
 
2.8%
316
 
2.8%
309
 
2.7%
283
 
2.5%
265
 
2.3%
262
 
2.3%
Other values (207) 7324
64.0%
Common
ValueCountFrequency (%)
1988
66.7%
( 233
 
7.8%
) 233
 
7.8%
_ 196
 
6.6%
2 97
 
3.3%
[ 54
 
1.8%
] 54
 
1.8%
0 54
 
1.8%
18
 
0.6%
18
 
0.6%
Other values (2) 36
 
1.2%
Latin
ValueCountFrequency (%)
e 108
33.3%
g 54
16.7%
a 54
16.7%
r 54
16.7%
s 54
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11212
76.0%
ASCII 3269
 
22.2%
Compat Jamo 231
 
1.6%
None 36
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1988
60.8%
( 233
 
7.1%
) 233
 
7.1%
_ 196
 
6.0%
e 108
 
3.3%
2 97
 
3.0%
g 54
 
1.7%
[ 54
 
1.7%
a 54
 
1.7%
r 54
 
1.7%
Other values (5) 198
 
6.1%
Hangul
ValueCountFrequency (%)
937
 
8.4%
624
 
5.6%
426
 
3.8%
376
 
3.4%
321
 
2.9%
316
 
2.8%
309
 
2.8%
283
 
2.5%
265
 
2.4%
262
 
2.3%
Other values (206) 7093
63.3%
Compat Jamo
ValueCountFrequency (%)
231
100.0%
None
ValueCountFrequency (%)
18
50.0%
18
50.0%

처리구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
위탁
672 
자가
 
19

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row위탁
2nd row위탁
3rd row자가
4th row위탁
5th row위탁

Common Values

ValueCountFrequency (%)
위탁 672
97.3%
자가 19
 
2.7%

Length

2023-12-11T08:12:24.421971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:12:24.517926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
위탁 672
97.3%
자가 19
 
2.7%
Distinct198
Distinct (%)28.7%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
2023-12-11T08:12:24.714053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length30
Mean length7.0709117
Min length2

Characters and Unicode

Total characters4886
Distinct characters202
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique103 ?
Unique (%)14.9%

Sample

1st row그린자원
2nd row그린자원
3rd row자가
4th row(주)아시아환경
5th row(주)아시아환경
ValueCountFrequency (%)
주)청호산업 73
 
10.0%
자가 39
 
5.4%
주)유니환경 36
 
4.9%
주)동남정유 29
 
4.0%
주)메탈링크 24
 
3.3%
주)동남정유제2공장 23
 
3.2%
자가운반 21
 
2.9%
일신리사이클링 16
 
2.2%
주)오엔이 14
 
1.9%
주)아시아환경 14
 
1.9%
Other values (186) 439
60.3%
2023-12-11T08:12:25.066210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 494
 
10.1%
( 492
 
10.1%
480
 
9.8%
236
 
4.8%
199
 
4.1%
171
 
3.5%
162
 
3.3%
130
 
2.7%
100
 
2.0%
95
 
1.9%
Other values (192) 2327
47.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3739
76.5%
Close Punctuation 494
 
10.1%
Open Punctuation 492
 
10.1%
Decimal Number 41
 
0.8%
Space Separator 38
 
0.8%
Lowercase Letter 38
 
0.8%
Connector Punctuation 24
 
0.5%
Other Symbol 12
 
0.2%
Uppercase Letter 7
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
480
 
12.8%
236
 
6.3%
199
 
5.3%
171
 
4.6%
162
 
4.3%
130
 
3.5%
100
 
2.7%
95
 
2.5%
93
 
2.5%
90
 
2.4%
Other values (170) 1983
53.0%
Lowercase Letter
ValueCountFrequency (%)
e 11
28.9%
r 7
18.4%
g 6
15.8%
n 6
15.8%
y 6
15.8%
m 1
 
2.6%
a 1
 
2.6%
Uppercase Letter
ValueCountFrequency (%)
E 1
14.3%
F 1
14.3%
K 1
14.3%
C 1
14.3%
M 1
14.3%
O 1
14.3%
A 1
14.3%
Decimal Number
ValueCountFrequency (%)
2 40
97.6%
9 1
 
2.4%
Close Punctuation
ValueCountFrequency (%)
) 494
100.0%
Open Punctuation
ValueCountFrequency (%)
( 492
100.0%
Space Separator
ValueCountFrequency (%)
38
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 24
100.0%
Other Symbol
ValueCountFrequency (%)
12
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3751
76.8%
Common 1090
 
22.3%
Latin 45
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
480
 
12.8%
236
 
6.3%
199
 
5.3%
171
 
4.6%
162
 
4.3%
130
 
3.5%
100
 
2.7%
95
 
2.5%
93
 
2.5%
90
 
2.4%
Other values (171) 1995
53.2%
Latin
ValueCountFrequency (%)
e 11
24.4%
r 7
15.6%
g 6
13.3%
n 6
13.3%
y 6
13.3%
E 1
 
2.2%
F 1
 
2.2%
m 1
 
2.2%
a 1
 
2.2%
K 1
 
2.2%
Other values (4) 4
 
8.9%
Common
ValueCountFrequency (%)
) 494
45.3%
( 492
45.1%
2 40
 
3.7%
38
 
3.5%
_ 24
 
2.2%
9 1
 
0.1%
& 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3739
76.5%
ASCII 1135
 
23.2%
None 12
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 494
43.5%
( 492
43.3%
2 40
 
3.5%
38
 
3.3%
_ 24
 
2.1%
e 11
 
1.0%
r 7
 
0.6%
g 6
 
0.5%
n 6
 
0.5%
y 6
 
0.5%
Other values (11) 11
 
1.0%
Hangul
ValueCountFrequency (%)
480
 
12.8%
236
 
6.3%
199
 
5.3%
171
 
4.6%
162
 
4.3%
130
 
3.5%
100
 
2.7%
95
 
2.5%
93
 
2.5%
90
 
2.4%
Other values (170) 1983
53.0%
None
ValueCountFrequency (%)
12
100.0%
Distinct280
Distinct (%)40.5%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
2023-12-11T08:12:25.278556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length21
Mean length8.4167873
Min length2

Characters and Unicode

Total characters5816
Distinct characters238
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique174 ?
Unique (%)25.2%

Sample

1st row여수환경
2nd row여수환경
3rd row동일팩키지(주)진주공장
4th row늘푸른자원(주)
5th row인선이엔티(주) 사천지점
ValueCountFrequency (%)
케이지이티에스(주 36
 
4.8%
주)에너지네트웍 23
 
3.0%
nc양산(주 20
 
2.6%
주)서진인바이러테크 19
 
2.5%
엠함안(주 14
 
1.9%
kc환경서비스(주)창원사업부 14
 
1.9%
지산우드텍(주 12
 
1.6%
주)단석산업군산1공장 11
 
1.5%
에코시스템(주 11
 
1.5%
주)메탈링크 11
 
1.5%
Other values (278) 585
77.4%
2023-12-11T08:12:25.653120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
612
 
10.5%
) 562
 
9.7%
( 561
 
9.6%
203
 
3.5%
170
 
2.9%
156
 
2.7%
144
 
2.5%
135
 
2.3%
120
 
2.1%
97
 
1.7%
Other values (228) 3056
52.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4408
75.8%
Close Punctuation 562
 
9.7%
Open Punctuation 561
 
9.6%
Uppercase Letter 158
 
2.7%
Space Separator 66
 
1.1%
Connector Punctuation 28
 
0.5%
Decimal Number 16
 
0.3%
Other Symbol 10
 
0.2%
Lowercase Letter 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
612
 
13.9%
203
 
4.6%
170
 
3.9%
156
 
3.5%
144
 
3.3%
135
 
3.1%
120
 
2.7%
97
 
2.2%
94
 
2.1%
79
 
1.8%
Other values (204) 2598
58.9%
Uppercase Letter
ValueCountFrequency (%)
C 51
32.3%
N 29
18.4%
K 25
15.8%
M 10
 
6.3%
A 10
 
6.3%
O 10
 
6.3%
S 6
 
3.8%
T 5
 
3.2%
E 5
 
3.2%
G 5
 
3.2%
Other values (2) 2
 
1.3%
Lowercase Letter
ValueCountFrequency (%)
k 2
28.6%
c 2
28.6%
a 1
14.3%
r 1
14.3%
m 1
14.3%
Decimal Number
ValueCountFrequency (%)
1 13
81.2%
2 3
 
18.8%
Close Punctuation
ValueCountFrequency (%)
) 562
100.0%
Open Punctuation
ValueCountFrequency (%)
( 561
100.0%
Space Separator
ValueCountFrequency (%)
66
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 28
100.0%
Other Symbol
ValueCountFrequency (%)
10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4418
76.0%
Common 1233
 
21.2%
Latin 165
 
2.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
612
 
13.9%
203
 
4.6%
170
 
3.8%
156
 
3.5%
144
 
3.3%
135
 
3.1%
120
 
2.7%
97
 
2.2%
94
 
2.1%
79
 
1.8%
Other values (205) 2608
59.0%
Latin
ValueCountFrequency (%)
C 51
30.9%
N 29
17.6%
K 25
15.2%
M 10
 
6.1%
A 10
 
6.1%
O 10
 
6.1%
S 6
 
3.6%
T 5
 
3.0%
E 5
 
3.0%
G 5
 
3.0%
Other values (7) 9
 
5.5%
Common
ValueCountFrequency (%)
) 562
45.6%
( 561
45.5%
66
 
5.4%
_ 28
 
2.3%
1 13
 
1.1%
2 3
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4408
75.8%
ASCII 1398
 
24.0%
None 10
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
612
 
13.9%
203
 
4.6%
170
 
3.9%
156
 
3.5%
144
 
3.3%
135
 
3.1%
120
 
2.7%
97
 
2.2%
94
 
2.1%
79
 
1.8%
Other values (204) 2598
58.9%
ASCII
ValueCountFrequency (%)
) 562
40.2%
( 561
40.1%
66
 
4.7%
C 51
 
3.6%
N 29
 
2.1%
_ 28
 
2.0%
K 25
 
1.8%
1 13
 
0.9%
M 10
 
0.7%
A 10
 
0.7%
Other values (13) 43
 
3.1%
None
ValueCountFrequency (%)
10
100.0%

처리방법
Categorical

HIGH CORRELATION 

Distinct26
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
재활용(중간가공폐기물 제조)
147 
재활용(연료·고형연료제품 제조)
85 
재활용(원료 제조)
79 
중간처분(일반소각)
79 
매립(민간관리형매립시설)
69 
Other values (21)
232 

Length

Max length19
Median length17
Mean length13.062229
Min length4

Unique

Unique9 ?
Unique (%)1.3%

Sample

1st row매립(민간관리형매립시설)
2nd row매립(민간관리형매립시설)
3rd row중간처분(고온소각)
4th row중간처분(파쇄.분쇄)
5th row매립(민간관리형매립시설)

Common Values

ValueCountFrequency (%)
재활용(중간가공폐기물 제조) 147
21.3%
재활용(연료·고형연료제품 제조) 85
12.3%
재활용(원료 제조) 79
11.4%
중간처분(일반소각) 79
11.4%
매립(민간관리형매립시설) 69
10.0%
중간처분(고온소각) 56
 
8.1%
재활용(농업생산활동에 사용) 29
 
4.2%
재활용(성토재·복토재 등으로 사용) 25
 
3.6%
재활용(직접 제품제조) 24
 
3.5%
재활용(토질개선에 사용) 20
 
2.9%
Other values (16) 78
11.3%

Length

2023-12-11T08:12:25.780353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제조 311
27.3%
재활용(중간가공폐기물 147
12.9%
재활용(연료·고형연료제품 85
 
7.4%
재활용(원료 79
 
6.9%
중간처분(일반소각 79
 
6.9%
사용 74
 
6.5%
매립(민간관리형매립시설 69
 
6.0%
중간처분(고온소각 56
 
4.9%
재활용(직접 40
 
3.5%
재활용(농업생산활동에 29
 
2.5%
Other values (20) 172
15.1%

Correlations

2023-12-11T08:12:25.852539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분폐기물 종류처리구분처리방법
구분1.0001.0000.1650.744
폐기물 종류1.0001.0000.5940.974
처리구분0.1650.5941.0000.413
처리방법0.7440.9740.4131.000
2023-12-11T08:12:25.935760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처리방법구분처리구분
처리방법1.0000.5960.323
구분0.5961.0000.106
처리구분0.3230.1061.000
2023-12-11T08:12:26.006374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분처리구분처리방법
구분1.0000.1060.596
처리구분0.1061.0000.323
처리방법0.5960.3231.000

Missing values

2023-12-11T08:12:20.074074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:12:20.246627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T08:12:20.342978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

구분상호사업장주소사업자등록번호전화번호신고일폐기물 종류처리구분운반자처리업소명처리방법
0일반동일팩키지(주)진주공장남강로 1303 (상평동)613-85-13308055-755-12112007년08월08일사업장폐기물 소각시설 바닥재위탁그린자원여수환경매립(민간관리형매립시설)
1일반동일팩키지(주)진주공장남강로 1303 (상평동)613-85-13308055-755-12112007년08월08일폐토사위탁그린자원여수환경매립(민간관리형매립시설)
2일반동일팩키지(주)진주공장남강로 1303 (상평동)613-85-13308055-755-12112007년08월08일그 밖의 폐기물자가자가동일팩키지(주)진주공장중간처분(고온소각)
3일반우성정공(주)진성면 동부로1259번길 54613-81-10088055-758-12562005년03월04일그 밖의 폐목재류위탁(주)아시아환경늘푸른자원(주)중간처분(파쇄.분쇄)
4일반우성정공(주)진성면 동부로1259번길 54613-81-10088055-758-12562005년03월04일그 밖의 공정오니위탁(주)아시아환경인선이엔티(주) 사천지점매립(민간관리형매립시설)
5일반우성정공(주)진성면 동부로1259번길 54613-81-10088055-758-12562005년03월04일폐합성수지류(폐염화비닐수지류는 제외한다)위탁(주)아시아환경(주)아시아그린재활용(중간가공폐기물 제조)
6일반남강제지(주)남강로1367번길 36 (상대동)613-81-01876055-752-17172002년02월19일펄프ㆍ제지폐수처리오니위탁(주)청호산업_케이엠그린산업(주)(주)케이엠그린구미점매립(민간관리형매립시설)
7일반남강제지(주)남강로1367번길 36 (상대동)613-81-01876055-752-17172002년02월19일사업장폐기물 소각시설 바닥재위탁전진환경_이에프실업(주)지엠이앤씨 (주)여수환경산업매립(민간관리형매립시설)
8일반남강제지(주)남강로1367번길 36 (상대동)613-81-01876055-752-17172002년02월19일펄프ㆍ제지폐수처리오니위탁청호산업구양실업재활용(원료 제조)
9일반남강제지(주)남강로1367번길 36 (상대동)613-81-01876055-752-17172002년02월19일펄프ㆍ제지폐수처리오니자가자가남강제지(주)재활용(직접 에너지회수)
구분상호사업장주소사업자등록번호전화번호신고일폐기물 종류처리구분운반자처리업소명처리방법
681지정청원모터스경상남도 진주시 모덕로 72-4 (상대동)613-05-73319<NA>2022 년01 월10 일폐황산이 포함된 2차폐축전지위탁(주)메탈링크(주)단석산업군산1공장재활용(원료 제조)
682지정경상남도산림환경연구원경상남도 진주시 이반성면 수목원로 386613-83-01036'05525438112021 년12 월13 일연구ㆍ검사용 폐시약위탁(주)유니환경케이지이티에스(주)중간처분(고온소각)
683지정경상남도산림환경연구원경상남도 진주시 이반성면 수목원로 386613-83-01036'05525438112021 년12 월13 일그 밖의 폐유기용제위탁(주)유니환경케이지이티에스(주)중간처분(고온소각)
684지정경상남도산림환경연구원경상남도 진주시 이반성면 수목원로 386613-83-01036'05525438112021 년12 월13 일그 밖의 폐유기용제위탁(주)유니환경케이지이티에스(주)중간처분(고온소각)
685지정경상남도산림환경연구원경상남도 진주시 이반성면 수목원로 386613-83-01036'05525438112021 년12 월13 일연구ㆍ검사용 폐시약위탁(주)유니환경케이지이티에스(주)중간처분(고온소각)
686지정(주)국민렌탈경상남도 진주시 사봉면 산업단지로44번길 41-60611-88-00109<NA>2021 년11 월02 일폐기계유ㆍ폐작동유(공업용 기계유ㆍ냉동기유ㆍ터어빈유ㆍ베어링윤활유ㆍ압축기유ㆍ유압작동유ㆍ열매체유 및 프로세스유 등을 말한다)위탁(주)동남정유제2공장주식회사 동남정유재활용(연료·고형연료제품 제조)
687지정(주)국민렌탈경상남도 진주시 사봉면 산업단지로44번길 41-60611-88-00109<NA>2021 년11 월02 일폐황산이 포함된 2차폐축전지위탁(주)퍼스트(주)상신금속재활용(원료 제조)
688지정(주)범우에이텍경상남도 진주시 정촌면 뿌리산단로15번길 81478-86-01389'05575602022021 년07 월08 일그 밖의 폐광물유[아스팔트유ㆍ그리스(grease)ㆍ방청유 및 수용성절삭유_ 20퍼센트 이상의 이물질이 함유된 폐유_ 고체상태의 폐유 등을 말한다]위탁(주)동남정유제2공장KC환경서비스(주)창원사업부재활용(중간가공폐기물 제조)
689지정진주 사랑의 밧데리경상남도 진주시 대신로106번길 9 (상평동)309-03-70248<NA>2021 년06 월02 일폐황산이 포함된 2차폐축전지위탁(주)메탈링크(주)메탈링크재활용(중간가공폐기물 제조)
690지정진주 사랑의 밧데리경상남도 진주시 대신로106번길 9 (상평동)309-03-70248<NA>2021 년06 월02 일폐황산이 포함된 2차폐축전지위탁(주)메탈링크(주)단석산업군산1공장재활용(원료 제조)

Duplicate rows

Most frequently occurring

구분상호사업장주소사업자등록번호전화번호신고일폐기물 종류처리구분운반자처리업소명처리방법# duplicates
0지정(주)성일엔텍진주시 범골로54번길 30-9_ B동 313호 (충무공동)613-81-18057055-760-62002020년 02월 03일연구ㆍ검사용 폐시약위탁이가환경케이지이티에스(주)중간처분(고온소각)2
1지정(주)웅전공업진주시 정촌면 연꽃로145번길 40-20613-81-26687055-757-31152013년 11월 20일그 밖의 폐광물유[아스팔트유ㆍ그리스(grease)ㆍ방청유 및 수용성절삭유_ 20퍼센트 이상의 이물질이 함유된 폐유_ 고체상태의 폐유 등을 말한다]위탁(주)오엔이(주)에너지네트웍중간처분(일반소각)2
2지정경상국립대학교진주시 내동면 내동로139번길 6613-83-00570055-772-10042020년 06월 02일그 밖의 폐유기용제위탁(주)유니환경케이지이티에스(주)중간처분(고온소각)2
3지정경상국립대학교진주시 동진로 33 (칠암동)613-83-00570055-751-31452011년 05월 30일그 밖의 폐유기용제위탁(주)유니환경케이지이티에스(주)중간처분(고온소각)2
4지정경상국립대학교진주시 진주대로 501 (가좌동)613-83-00570055-772-10042010년 03월 02일그 밖의 폐유기용제위탁(주)유니환경케이지이티에스(주)중간처분(고온소각)2
5지정경상국립대학교 의과대학진주시 진주대로816번길 15 (주약동)613-83-00570055-772-10042010년 03월 02일그 밖의 폐유기용제위탁(주)유니환경케이지이티에스(주)중간처분(고온소각)2
6지정경상남도농업기술원진주시 대신로 570 (초전동)613-83-03164055-254-12142015년 02월 16일그 밖의 폐유기용제위탁(주)유니환경케이지이티에스(주)중간처분(고온소각)2
7지정경상남도농업기술원진주시 대신로 570 (초전동)613-83-03164055-254-12142015년 02월 16일그 밖의 폐유독물질위탁(주)유니환경케이지이티에스(주)중간처분(고온소각)2
8지정경상남도산림환경연구원경상남도 진주시 이반성면 수목원로 386613-83-01036'05525438112021 년12 월13 일그 밖의 폐유기용제위탁(주)유니환경케이지이티에스(주)중간처분(고온소각)2
9지정경상남도산림환경연구원경상남도 진주시 이반성면 수목원로 386613-83-01036'05525438112021 년12 월13 일연구ㆍ검사용 폐시약위탁(주)유니환경케이지이티에스(주)중간처분(고온소각)2