Overview

Dataset statistics

Number of variables9
Number of observations850
Missing cells910
Missing cells (%)11.9%
Duplicate rows5
Duplicate rows (%)0.6%
Total size in memory59.9 KiB
Average record size in memory72.2 B

Variable types

Categorical4
Text5

Dataset

Description울산광역시내 사업자 폐기물 배출 신고 업체 정보(업체명, 연락처, 폐기물명, 지분주소, 도로명주소 등)를 제공하고 있음.
Author울산광역시
URLhttps://www.data.go.kr/data/15068261/fileData.do

Alerts

시_도 has constant value ""Constant
데이터기준일자 has constant value ""Constant
Dataset has 5 (0.6%) duplicate rowsDuplicates
시_군_구 is highly overall correlated with 폐기물구분High correlation
폐기물구분 is highly overall correlated with 시_군_구High correlation
폐기물구분 is highly imbalanced (60.6%)Imbalance
연락처 has 133 (15.6%) missing valuesMissing
폐기물명 has 655 (77.1%) missing valuesMissing
지번주소 has 64 (7.5%) missing valuesMissing
도로명주소 has 58 (6.8%) missing valuesMissing

Reproduction

Analysis started2023-12-12 02:52:29.236574
Analysis finished2023-12-12 02:52:30.464403
Duration1.23 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시_도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
울산광역시
850 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row울산광역시
2nd row울산광역시
3rd row울산광역시
4th row울산광역시
5th row울산광역시

Common Values

ValueCountFrequency (%)
울산광역시 850
100.0%

Length

2023-12-12T11:52:30.550095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:52:30.670344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
울산광역시 850
100.0%

시_군_구
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
울주군
424 
남구
221 
동구
106 
북구
89 
중구
 
10

Length

Max length3
Median length2
Mean length2.4988235
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row동구
2nd row동구
3rd row동구
4th row동구
5th row동구

Common Values

ValueCountFrequency (%)
울주군 424
49.9%
남구 221
26.0%
동구 106
 
12.5%
북구 89
 
10.5%
중구 10
 
1.2%

Length

2023-12-12T11:52:30.779472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:52:30.917014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
울주군 424
49.9%
남구 221
26.0%
동구 106
 
12.5%
북구 89
 
10.5%
중구 10
 
1.2%
Distinct747
Distinct (%)87.9%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
2023-12-12T11:52:31.212480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length72
Median length19
Mean length7.8541176
Min length3

Characters and Unicode

Total characters6676
Distinct characters410
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique723 ?
Unique (%)85.1%

Sample

1st row(주)현대미포조선
2nd row(주)현대미포조선
3rd row한국프랜지공업(주)제3공장
4th row현대중공업㈜
5th row(주)케이씨씨울산공장
ValueCountFrequency (%)
현대중공업(주 26
 
2.6%
주)케이씨씨울산공장 25
 
2.5%
주)현대미포조선 21
 
2.1%
울산공장 12
 
1.2%
주식회사 11
 
1.1%
울산광역시 8
 
0.8%
현대건설기계(주)울산공장 6
 
0.6%
주식회사현대백화점동구점 6
 
0.6%
한국프랜지공업(주)제3공장 5
 
0.5%
㈜지에스엔텍 4
 
0.4%
Other values (793) 859
87.4%
2023-12-12T11:52:31.763770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
553
 
8.3%
290
 
4.3%
224
 
3.4%
190
 
2.8%
173
 
2.6%
164
 
2.5%
160
 
2.4%
160
 
2.4%
154
 
2.3%
136
 
2.0%
Other values (400) 4472
67.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5570
83.4%
Other Symbol 553
 
8.3%
Space Separator 136
 
2.0%
Open Punctuation 129
 
1.9%
Close Punctuation 129
 
1.9%
Uppercase Letter 81
 
1.2%
Decimal Number 53
 
0.8%
Other Punctuation 12
 
0.2%
Lowercase Letter 9
 
0.1%
Dash Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
290
 
5.2%
224
 
4.0%
190
 
3.4%
173
 
3.1%
164
 
2.9%
160
 
2.9%
160
 
2.9%
154
 
2.8%
116
 
2.1%
111
 
2.0%
Other values (354) 3828
68.7%
Uppercase Letter
ValueCountFrequency (%)
S 11
13.6%
K 11
13.6%
G 7
 
8.6%
C 6
 
7.4%
T 6
 
7.4%
N 5
 
6.2%
E 4
 
4.9%
R 4
 
4.9%
I 4
 
4.9%
M 3
 
3.7%
Other values (10) 20
24.7%
Lowercase Letter
ValueCountFrequency (%)
l 2
22.2%
g 1
11.1%
a 1
11.1%
b 1
11.1%
o 1
11.1%
c 1
11.1%
i 1
11.1%
p 1
11.1%
Decimal Number
ValueCountFrequency (%)
2 25
47.2%
3 12
22.6%
1 10
 
18.9%
5 2
 
3.8%
4 2
 
3.8%
7 1
 
1.9%
9 1
 
1.9%
Other Punctuation
ValueCountFrequency (%)
. 5
41.7%
& 3
25.0%
· 2
 
16.7%
/ 1
 
8.3%
: 1
 
8.3%
Other Symbol
ValueCountFrequency (%)
553
100.0%
Space Separator
ValueCountFrequency (%)
136
100.0%
Open Punctuation
ValueCountFrequency (%)
( 129
100.0%
Close Punctuation
ValueCountFrequency (%)
) 129
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Math Symbol
ValueCountFrequency (%)
> 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6123
91.7%
Common 463
 
6.9%
Latin 90
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
553
 
9.0%
290
 
4.7%
224
 
3.7%
190
 
3.1%
173
 
2.8%
164
 
2.7%
160
 
2.6%
160
 
2.6%
154
 
2.5%
116
 
1.9%
Other values (355) 3939
64.3%
Latin
ValueCountFrequency (%)
S 11
 
12.2%
K 11
 
12.2%
G 7
 
7.8%
C 6
 
6.7%
T 6
 
6.7%
N 5
 
5.6%
E 4
 
4.4%
R 4
 
4.4%
I 4
 
4.4%
M 3
 
3.3%
Other values (18) 29
32.2%
Common
ValueCountFrequency (%)
136
29.4%
( 129
27.9%
) 129
27.9%
2 25
 
5.4%
3 12
 
2.6%
1 10
 
2.2%
. 5
 
1.1%
- 3
 
0.6%
& 3
 
0.6%
· 2
 
0.4%
Other values (7) 9
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5570
83.4%
None 555
 
8.3%
ASCII 551
 
8.3%

Most frequent character per block

None
ValueCountFrequency (%)
553
99.6%
· 2
 
0.4%
Hangul
ValueCountFrequency (%)
290
 
5.2%
224
 
4.0%
190
 
3.4%
173
 
3.1%
164
 
2.9%
160
 
2.9%
160
 
2.9%
154
 
2.8%
116
 
2.1%
111
 
2.0%
Other values (354) 3828
68.7%
ASCII
ValueCountFrequency (%)
136
24.7%
( 129
23.4%
) 129
23.4%
2 25
 
4.5%
3 12
 
2.2%
S 11
 
2.0%
K 11
 
2.0%
1 10
 
1.8%
G 7
 
1.3%
C 6
 
1.1%
Other values (34) 75
13.6%

연락처
Text

MISSING 

Distinct601
Distinct (%)83.8%
Missing133
Missing (%)15.6%
Memory size6.8 KiB
2023-12-12T11:52:32.108016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.019526
Min length12

Characters and Unicode

Total characters8618
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique569 ?
Unique (%)79.4%

Sample

1st row052-250-3551
2nd row052-250-3551
3rd row052-250-5073
4th row052-202-5787
5th row052-280-1447
ValueCountFrequency (%)
052-202-5787 27
 
3.8%
052-280-1447 25
 
3.5%
052-250-3551 22
 
3.1%
052-250-4827 6
 
0.8%
052-202-8897 6
 
0.8%
052-250-5073 5
 
0.7%
052-250-6206 3
 
0.4%
052-256-0994 3
 
0.4%
052-202-5471 3
 
0.4%
052-225-5590 3
 
0.4%
Other values (591) 614
85.6%
2023-12-12T11:52:32.581847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 1693
19.6%
- 1434
16.6%
0 1381
16.0%
5 1236
14.3%
7 542
 
6.3%
1 477
 
5.5%
3 419
 
4.9%
8 394
 
4.6%
6 388
 
4.5%
4 365
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7184
83.4%
Dash Punctuation 1434
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 1693
23.6%
0 1381
19.2%
5 1236
17.2%
7 542
 
7.5%
1 477
 
6.6%
3 419
 
5.8%
8 394
 
5.5%
6 388
 
5.4%
4 365
 
5.1%
9 289
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 1434
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8618
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 1693
19.6%
- 1434
16.6%
0 1381
16.0%
5 1236
14.3%
7 542
 
6.3%
1 477
 
5.5%
3 419
 
4.9%
8 394
 
4.6%
6 388
 
4.5%
4 365
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8618
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 1693
19.6%
- 1434
16.6%
0 1381
16.0%
5 1236
14.3%
7 542
 
6.3%
1 477
 
5.5%
3 419
 
4.9%
8 394
 
4.6%
6 388
 
4.5%
4 365
 
4.2%

폐기물구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
<NA>
744 
사업장배출시설계폐기물
88 
사업장생활계폐기물
 
18

Length

Max length11
Median length4
Mean length4.8305882
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업장배출시설계폐기물
2nd row사업장배출시설계폐기물
3rd row사업장배출시설계폐기물
4th row사업장배출시설계폐기물
5th row사업장배출시설계폐기물

Common Values

ValueCountFrequency (%)
<NA> 744
87.5%
사업장배출시설계폐기물 88
 
10.4%
사업장생활계폐기물 18
 
2.1%

Length

2023-12-12T11:52:32.755838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:52:32.898734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 744
87.5%
사업장배출시설계폐기물 88
 
10.4%
사업장생활계폐기물 18
 
2.1%

폐기물명
Text

MISSING 

Distinct83
Distinct (%)42.6%
Missing655
Missing (%)77.1%
Memory size6.8 KiB
2023-12-12T11:52:33.102700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length42
Mean length9.8153846
Min length3

Characters and Unicode

Total characters1914
Distinct characters141
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)27.2%

Sample

1st row(구)폐타이어
2nd row폐가구류폐도장목폐목재포장재폐전선드럼(원목상태의깨끗한목재를말한다)
3rd row(구)1등급
4th row(구)폐토사
5th row(구)폐유리
ValueCountFrequency (%)
폐합성수지 49
20.9%
30
 
12.8%
구)그밖의폐기물 11
 
4.7%
폐수처리오니 10
 
4.3%
음식물류폐기물 9
 
3.8%
구)폐합성수지류 7
 
3.0%
폐합성수지류(폐염화비닐수지류는제외한다 5
 
2.1%
구)분진(대기오염방지시설에서포집된것에한정하되소각시설에서발생되는것은제외한다 4
 
1.7%
그밖의분진 4
 
1.7%
밖의 4
 
1.7%
Other values (70) 102
43.4%
2023-12-12T11:52:33.451621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
181
 
9.5%
94
 
4.9%
84
 
4.4%
( 84
 
4.4%
) 84
 
4.4%
81
 
4.2%
77
 
4.0%
63
 
3.3%
59
 
3.1%
50
 
2.6%
Other values (131) 1057
55.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1701
88.9%
Open Punctuation 84
 
4.4%
Close Punctuation 84
 
4.4%
Space Separator 40
 
2.1%
Decimal Number 5
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
181
 
10.6%
94
 
5.5%
84
 
4.9%
81
 
4.8%
77
 
4.5%
63
 
3.7%
59
 
3.5%
50
 
2.9%
46
 
2.7%
40
 
2.4%
Other values (125) 926
54.4%
Decimal Number
ValueCountFrequency (%)
1 3
60.0%
3 1
 
20.0%
2 1
 
20.0%
Open Punctuation
ValueCountFrequency (%)
( 84
100.0%
Close Punctuation
ValueCountFrequency (%)
) 84
100.0%
Space Separator
ValueCountFrequency (%)
40
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1701
88.9%
Common 213
 
11.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
181
 
10.6%
94
 
5.5%
84
 
4.9%
81
 
4.8%
77
 
4.5%
63
 
3.7%
59
 
3.5%
50
 
2.9%
46
 
2.7%
40
 
2.4%
Other values (125) 926
54.4%
Common
ValueCountFrequency (%)
( 84
39.4%
) 84
39.4%
40
18.8%
1 3
 
1.4%
3 1
 
0.5%
2 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1700
88.8%
ASCII 213
 
11.1%
Compat Jamo 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
181
 
10.6%
94
 
5.5%
84
 
4.9%
81
 
4.8%
77
 
4.5%
63
 
3.7%
59
 
3.5%
50
 
2.9%
46
 
2.7%
40
 
2.4%
Other values (124) 925
54.4%
ASCII
ValueCountFrequency (%)
( 84
39.4%
) 84
39.4%
40
18.8%
1 3
 
1.4%
3 1
 
0.5%
2 1
 
0.5%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

지번주소
Text

MISSING 

Distinct632
Distinct (%)80.4%
Missing64
Missing (%)7.5%
Memory size6.8 KiB
2023-12-12T11:52:33.783936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length34
Mean length20.754453
Min length14

Characters and Unicode

Total characters16313
Distinct characters212
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique571 ?
Unique (%)72.6%

Sample

1st row울산광역시 동구 방어동 1381 현대미포조선
2nd row울산광역시 동구 방어동 1381 현대미포조선
3rd row울산광역시 동구 동부동 167-1 한국프랜지공업(주)3공장
4th row울산광역시 동구 전하동 1 현대중공업
5th row울산광역시 동구 방어동 1234
ValueCountFrequency (%)
울산광역시 785
21.6%
울주군 364
 
10.0%
남구 219
 
6.0%
온산읍 138
 
3.8%
동구 104
 
2.9%
북구 89
 
2.4%
웅촌면 70
 
1.9%
화산리 52
 
1.4%
방어동 51
 
1.4%
여천동 43
 
1.2%
Other values (774) 1726
47.4%
2023-12-12T11:52:34.659881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2867
17.6%
1159
 
7.1%
1061
 
6.5%
804
 
4.9%
786
 
4.8%
785
 
4.8%
1 621
 
3.8%
574
 
3.5%
- 443
 
2.7%
431
 
2.6%
Other values (202) 6782
41.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10029
61.5%
Decimal Number 2896
 
17.8%
Space Separator 2867
 
17.6%
Dash Punctuation 443
 
2.7%
Open Punctuation 35
 
0.2%
Close Punctuation 35
 
0.2%
Other Symbol 5
 
< 0.1%
Uppercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1159
 
11.6%
1061
 
10.6%
804
 
8.0%
786
 
7.8%
785
 
7.8%
574
 
5.7%
431
 
4.3%
405
 
4.0%
364
 
3.6%
364
 
3.6%
Other values (184) 3296
32.9%
Decimal Number
ValueCountFrequency (%)
1 621
21.4%
3 359
12.4%
2 337
11.6%
4 263
9.1%
8 241
 
8.3%
6 232
 
8.0%
0 223
 
7.7%
5 221
 
7.6%
7 203
 
7.0%
9 196
 
6.8%
Uppercase Letter
ValueCountFrequency (%)
F 1
33.3%
N 1
33.3%
B 1
33.3%
Space Separator
ValueCountFrequency (%)
2867
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 443
100.0%
Open Punctuation
ValueCountFrequency (%)
( 35
100.0%
Close Punctuation
ValueCountFrequency (%)
) 35
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10034
61.5%
Common 6276
38.5%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1159
 
11.6%
1061
 
10.6%
804
 
8.0%
786
 
7.8%
785
 
7.8%
574
 
5.7%
431
 
4.3%
405
 
4.0%
364
 
3.6%
364
 
3.6%
Other values (185) 3301
32.9%
Common
ValueCountFrequency (%)
2867
45.7%
1 621
 
9.9%
- 443
 
7.1%
3 359
 
5.7%
2 337
 
5.4%
4 263
 
4.2%
8 241
 
3.8%
6 232
 
3.7%
0 223
 
3.6%
5 221
 
3.5%
Other values (4) 469
 
7.5%
Latin
ValueCountFrequency (%)
F 1
33.3%
N 1
33.3%
B 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10029
61.5%
ASCII 6279
38.5%
None 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2867
45.7%
1 621
 
9.9%
- 443
 
7.1%
3 359
 
5.7%
2 337
 
5.4%
4 263
 
4.2%
8 241
 
3.8%
6 232
 
3.7%
0 223
 
3.6%
5 221
 
3.5%
Other values (7) 472
 
7.5%
Hangul
ValueCountFrequency (%)
1159
 
11.6%
1061
 
10.6%
804
 
8.0%
786
 
7.8%
785
 
7.8%
574
 
5.7%
431
 
4.3%
405
 
4.0%
364
 
3.6%
364
 
3.6%
Other values (184) 3296
32.9%
None
ValueCountFrequency (%)
5
100.0%

도로명주소
Text

MISSING 

Distinct642
Distinct (%)81.1%
Missing58
Missing (%)6.8%
Memory size6.8 KiB
2023-12-12T11:52:35.119789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length28
Mean length23.108586
Min length9

Characters and Unicode

Total characters18302
Distinct characters183
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique583 ?
Unique (%)73.6%

Sample

1st row울산광역시 동구 방어진순환도로 100 (방어동)
2nd row울산광역시 동구 방어진순환도로 100 (방어동)
3rd row울산광역시 동구 방어진순환도로 1100 (동부동)
4th row울산광역시 동구 방어진순환도로 1000 (전하1동)
5th row울산광역시 동구 방어진순환도로 30 (방어동)
ValueCountFrequency (%)
울산광역시 791
 
19.9%
울주군 371
 
9.4%
남구 224
 
5.6%
온산읍 143
 
3.6%
동구 104
 
2.6%
방어진순환도로 95
 
2.4%
북구 89
 
2.2%
웅촌면 66
 
1.7%
방어동 50
 
1.3%
여천동 41
 
1.0%
Other values (722) 1991
50.2%
2023-12-12T11:52:35.711071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3180
 
17.4%
1163
 
6.4%
1111
 
6.1%
803
 
4.4%
793
 
4.3%
792
 
4.3%
577
 
3.2%
1 574
 
3.1%
573
 
3.1%
444
 
2.4%
Other values (173) 8292
45.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11532
63.0%
Space Separator 3180
 
17.4%
Decimal Number 2576
 
14.1%
Open Punctuation 417
 
2.3%
Close Punctuation 417
 
2.3%
Dash Punctuation 180
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1163
 
10.1%
1111
 
9.6%
803
 
7.0%
793
 
6.9%
792
 
6.9%
577
 
5.0%
573
 
5.0%
444
 
3.9%
372
 
3.2%
371
 
3.2%
Other values (159) 4533
39.3%
Decimal Number
ValueCountFrequency (%)
1 574
22.3%
0 322
12.5%
2 316
12.3%
3 312
12.1%
4 205
 
8.0%
5 184
 
7.1%
8 180
 
7.0%
7 175
 
6.8%
6 173
 
6.7%
9 135
 
5.2%
Space Separator
ValueCountFrequency (%)
3180
100.0%
Open Punctuation
ValueCountFrequency (%)
( 417
100.0%
Close Punctuation
ValueCountFrequency (%)
) 417
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 180
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11532
63.0%
Common 6770
37.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1163
 
10.1%
1111
 
9.6%
803
 
7.0%
793
 
6.9%
792
 
6.9%
577
 
5.0%
573
 
5.0%
444
 
3.9%
372
 
3.2%
371
 
3.2%
Other values (159) 4533
39.3%
Common
ValueCountFrequency (%)
3180
47.0%
1 574
 
8.5%
( 417
 
6.2%
) 417
 
6.2%
0 322
 
4.8%
2 316
 
4.7%
3 312
 
4.6%
4 205
 
3.0%
5 184
 
2.7%
8 180
 
2.7%
Other values (4) 663
 
9.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11532
63.0%
ASCII 6770
37.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3180
47.0%
1 574
 
8.5%
( 417
 
6.2%
) 417
 
6.2%
0 322
 
4.8%
2 316
 
4.7%
3 312
 
4.6%
4 205
 
3.0%
5 184
 
2.7%
8 180
 
2.7%
Other values (4) 663
 
9.8%
Hangul
ValueCountFrequency (%)
1163
 
10.1%
1111
 
9.6%
803
 
7.0%
793
 
6.9%
792
 
6.9%
577
 
5.0%
573
 
5.0%
444
 
3.9%
372
 
3.2%
371
 
3.2%
Other values (159) 4533
39.3%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
2021-09-24
850 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-09-24
2nd row2021-09-24
3rd row2021-09-24
4th row2021-09-24
5th row2021-09-24

Common Values

ValueCountFrequency (%)
2021-09-24 850
100.0%

Length

2023-12-12T11:52:35.849939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:52:35.935655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-09-24 850
100.0%

Correlations

2023-12-12T11:52:35.987209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시_군_구폐기물구분폐기물명
시_군_구1.000NaN0.983
폐기물구분NaN1.0000.587
폐기물명0.9830.5871.000
2023-12-12T11:52:36.077880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시_군_구폐기물구분
시_군_구1.0001.000
폐기물구분1.0001.000
2023-12-12T11:52:36.150395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시_군_구폐기물구분
시_군_구1.0001.000
폐기물구분1.0001.000

Missing values

2023-12-12T11:52:30.029391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:52:30.218645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T11:52:30.371187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시_도시_군_구업체명연락처폐기물구분폐기물명지번주소도로명주소데이터기준일자
0울산광역시동구(주)현대미포조선052-250-3551사업장배출시설계폐기물(구)폐타이어울산광역시 동구 방어동 1381 현대미포조선울산광역시 동구 방어진순환도로 100 (방어동)2021-09-24
1울산광역시동구(주)현대미포조선052-250-3551사업장배출시설계폐기물폐가구류폐도장목폐목재포장재폐전선드럼(원목상태의깨끗한목재를말한다)울산광역시 동구 방어동 1381 현대미포조선울산광역시 동구 방어진순환도로 100 (방어동)2021-09-24
2울산광역시동구한국프랜지공업(주)제3공장052-250-5073사업장배출시설계폐기물(구)1등급울산광역시 동구 동부동 167-1 한국프랜지공업(주)3공장울산광역시 동구 방어진순환도로 1100 (동부동)2021-09-24
3울산광역시동구현대중공업㈜052-202-5787사업장배출시설계폐기물(구)폐토사울산광역시 동구 전하동 1 현대중공업울산광역시 동구 방어진순환도로 1000 (전하1동)2021-09-24
4울산광역시동구(주)케이씨씨울산공장052-280-1447사업장배출시설계폐기물(구)폐유리울산광역시 동구 방어동 1234울산광역시 동구 방어진순환도로 30 (방어동)2021-09-24
5울산광역시동구(주)케이씨씨울산공장052-280-1447사업장배출시설계폐기물(구)그밖의폐기물울산광역시 동구 방어동 1234울산광역시 동구 방어진순환도로 30 (방어동)2021-09-24
6울산광역시동구주식회사현대백화점동구점052-250-4827사업장생활계폐기물(구)폐합성수지류울산광역시 동구 서부동 105-3 현대백화점울산광역시 동구 방어진순환도로 899 (서부동)2021-09-24
7울산광역시동구(주)현대미포조선052-250-3551사업장배출시설계폐기물(구)폐사(샌드블라스트폐사)울산광역시 동구 방어동 1381 현대미포조선울산광역시 동구 방어진순환도로 100 (방어동)2021-09-24
8울산광역시동구(주)현대미포조선052-250-3551사업장배출시설계폐기물폐합성수지류(폐염화비닐수지류는제외한다)울산광역시 동구 방어동 1381 현대미포조선울산광역시 동구 방어진순환도로 100 (방어동)2021-09-24
9울산광역시동구(주)현대미포조선052-250-3551사업장배출시설계폐기물(구)1등급울산광역시 동구 방어동 1381 현대미포조선울산광역시 동구 방어진순환도로 100 (방어동)2021-09-24
시_도시_군_구업체명연락처폐기물구분폐기물명지번주소도로명주소데이터기준일자
840울산광역시중구뉴코아아울렛 성남점052-210-5004<NA><NA>울산광역시 중구 성남동 249-1울산광역시 중구 시계탑거리 20 (성남동)2021-09-24
841울산광역시중구대진실업052-245-9703<NA><NA>울산광역시 중구 성안동 477-4번지울산광역시 중구 성안13길 17 (성안동)2021-09-24
842울산광역시중구동강병원052-241-3712<NA><NA>울산광역시 중구 태화동 123-3 동강병원울산광역시 중구 태화로 239 (태화동)2021-09-24
843울산광역시중구동천컨벤션052-282-3000<NA><NA>울산광역시 중구 남외동 865 울산종합운동장울산광역시 중구 염포로 55 (남외동)2021-09-24
844울산광역시중구롯데컬처웍스㈜070-7493-2913<NA><NA>울산광역시 중구 성남동 256-24울산광역시 중구 젊음의2거리 33 (성남동)2021-09-24
845울산광역시중구세민에스요양병원052-920-1188<NA><NA>울산광역시 중구 반구동 777-5울산광역시 중구 내황4길 11 (반구동)2021-09-24
846울산광역시중구㈜맥서브(울산메가박스성남점)<NA><NA><NA>서울특별시 강남구 영동대로85길 88<NA>2021-09-24
847울산광역시중구전진산업052-246-1056<NA><NA>울산광역시 중구 성안동 471-1울산광역시 중구 성안1길 155-3 (성안동)2021-09-24
848울산광역시중구한국화학융합시험연구원영남본부052-220-3012<NA><NA>울산광역시 중구 다운동 936-3 울산테크노파크울산광역시 중구 종가로 15 (다운동)2021-09-24
849울산광역시중구홈플러스052-290-8000<NA><NA>울산광역시 중구 복산동 100 홈플러스울산점울산광역시 중구 번영로 475 (복산동)2021-09-24

Duplicate rows

Most frequently occurring

시_도시_군_구업체명연락처폐기물구분폐기물명지번주소도로명주소데이터기준일자# duplicates
0울산광역시동구(주)케이씨씨울산공장052-280-1447사업장배출시설계폐기물(구)그밖의폐기물울산광역시 동구 방어동 1234울산광역시 동구 방어진순환도로 30 (방어동)2021-09-242
1울산광역시동구현대중공업(주)052-202-5787사업장배출시설계폐기물(구)그밖의폐기물울산광역시 동구 전하동 1 현대중공업울산광역시 동구 방어진순환도로 1000 (전하1동)2021-09-242
2울산광역시울주군한국도로공사<NA><NA><NA><NA><NA>2021-09-242
3울산광역시울주군한국석유공사<NA><NA><NA><NA><NA>2021-09-242
4울산광역시울주군한진레미콘㈜052-225-5590<NA><NA>울산광역시 울주군 웅촌면 곡천리 43-1울산광역시 울주군 웅촌면 원당골길 722021-09-242