Overview

Dataset statistics

Number of variables5
Number of observations99
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.0 KiB
Average record size in memory41.3 B

Variable types

Categorical2
Text2
DateTime1

Dataset

Description부산광역시강서구_폐기물업체_20221020
Author부산광역시 강서구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3045901

Alerts

처분업 is highly overall correlated with 영업대상High correlation
영업대상 is highly overall correlated with 처분업High correlation
처분업 is highly imbalanced (51.4%)Imbalance

Reproduction

Analysis started2023-12-10 16:37:22.548890
Analysis finished2023-12-10 16:37:23.192494
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

처분업
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
수집·운반업
77 
종합재활용업
17 
중간재활용업
 
4
건설폐기물 중간처리업
 
1

Length

Max length11
Median length6
Mean length6.0505051
Min length6

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row건설폐기물 중간처리업
2nd row수집·운반업
3rd row수집·운반업
4th row수집·운반업
5th row수집·운반업

Common Values

ValueCountFrequency (%)
수집·운반업 77
77.8%
종합재활용업 17
 
17.2%
중간재활용업 4
 
4.0%
건설폐기물 중간처리업 1
 
1.0%

Length

2023-12-11T01:37:23.246483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:37:23.322576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수집·운반업 77
77.0%
종합재활용업 17
 
17.0%
중간재활용업 4
 
4.0%
건설폐기물 1
 
1.0%
중간처리업 1
 
1.0%
Distinct86
Distinct (%)86.9%
Missing0
Missing (%)0.0%
Memory size924.0 B
2023-12-11T01:37:23.537384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length5.2323232
Min length2

Characters and Unicode

Total characters518
Distinct characters140
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique78 ?
Unique (%)78.8%

Sample

1st row대도이앤알㈜
2nd row대도환경㈜
3rd row㈜강서환경
4th row㈜이레환경
5th row㈜세환건영
ValueCountFrequency (%)
㈜그린포트 4
 
4.0%
㈜은성이엔씨 3
 
3.0%
대도이앤알㈜ 3
 
3.0%
㈜세환건영 3
 
3.0%
대호수지 2
 
2.0%
동인 2
 
2.0%
해중산업(주 2
 
2.0%
㈜한맥개발 2
 
2.0%
동양기연㈜ 1
 
1.0%
대성환경산업㈜ 1
 
1.0%
Other values (77) 77
77.0%
2023-12-11T01:37:23.858167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
 
8.1%
30
 
5.8%
29
 
5.6%
23
 
4.4%
18
 
3.5%
16
 
3.1%
13
 
2.5%
13
 
2.5%
) 11
 
2.1%
( 11
 
2.1%
Other values (130) 312
60.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 450
86.9%
Other Symbol 42
 
8.1%
Close Punctuation 11
 
2.1%
Open Punctuation 11
 
2.1%
Uppercase Letter 2
 
0.4%
Space Separator 1
 
0.2%
Decimal Number 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
6.7%
29
 
6.4%
23
 
5.1%
18
 
4.0%
16
 
3.6%
13
 
2.9%
13
 
2.9%
11
 
2.4%
10
 
2.2%
9
 
2.0%
Other values (123) 278
61.8%
Uppercase Letter
ValueCountFrequency (%)
R 1
50.0%
C 1
50.0%
Other Symbol
ValueCountFrequency (%)
42
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 492
95.0%
Common 24
 
4.6%
Latin 2
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
8.5%
30
 
6.1%
29
 
5.9%
23
 
4.7%
18
 
3.7%
16
 
3.3%
13
 
2.6%
13
 
2.6%
11
 
2.2%
10
 
2.0%
Other values (124) 287
58.3%
Common
ValueCountFrequency (%)
) 11
45.8%
( 11
45.8%
1
 
4.2%
2 1
 
4.2%
Latin
ValueCountFrequency (%)
R 1
50.0%
C 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 450
86.9%
None 42
 
8.1%
ASCII 26
 
5.0%

Most frequent character per block

None
ValueCountFrequency (%)
42
100.0%
Hangul
ValueCountFrequency (%)
30
 
6.7%
29
 
6.4%
23
 
5.1%
18
 
4.0%
16
 
3.6%
13
 
2.9%
13
 
2.9%
11
 
2.4%
10
 
2.2%
9
 
2.0%
Other values (123) 278
61.8%
ASCII
ValueCountFrequency (%)
) 11
42.3%
( 11
42.3%
R 1
 
3.8%
C 1
 
3.8%
1
 
3.8%
2 1
 
3.8%

영업대상
Categorical

HIGH CORRELATION 

Distinct20
Distinct (%)20.2%
Missing0
Missing (%)0.0%
Memory size924.0 B
사업장배출시설계폐기물
34 
건설폐기물
22 
사업장비배출시설계폐기물
19 
폐합성수지
생활폐기물, 대형폐기물, 사업장비배출시설계
 
2
Other values (15)
17 

Length

Max length24
Median length22
Mean length9.8080808
Min length4

Unique

Unique13 ?
Unique (%)13.1%

Sample

1st row건설폐기물
2nd row생활폐기물, 대형폐기물, 사업장비배출시설계
3rd row생활폐기물, 대형폐기물, 사업장비배출시설계
4th row사업장비배출시설계폐기물
5th row사업장비배출시설계폐기물

Common Values

ValueCountFrequency (%)
사업장배출시설계폐기물 34
34.3%
건설폐기물 22
22.2%
사업장비배출시설계폐기물 19
19.2%
폐합성수지 5
 
5.1%
생활폐기물, 대형폐기물, 사업장비배출시설계 2
 
2.0%
스케일, 분진, 분철 2
 
2.0%
음식물류폐기물 2
 
2.0%
동물성잔재물 1
 
1.0%
사업장비배출시설계폐기물(폐섬유류) 1
 
1.0%
폐플라스틱류 1
 
1.0%
Other values (10) 10
 
10.1%

Length

2023-12-11T01:37:23.993814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
사업장배출시설계폐기물 34
28.8%
건설폐기물 22
18.6%
사업장비배출시설계폐기물 19
16.1%
폐합성수지 6
 
5.1%
생활폐기물 2
 
1.7%
대형폐기물 2
 
1.7%
사업장비배출시설계 2
 
1.7%
스케일 2
 
1.7%
분진 2
 
1.7%
분철 2
 
1.7%
Other values (23) 25
21.2%
Distinct87
Distinct (%)87.9%
Missing0
Missing (%)0.0%
Memory size924.0 B
2023-12-11T01:37:24.189647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length39
Mean length27.393939
Min length18

Characters and Unicode

Total characters2712
Distinct characters85
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique79 ?
Unique (%)79.8%

Sample

1st row부산광역시 강서구 생곡산단로52번길 29(생곡동)
2nd row부산광역시 강서구 생곡산단로52번길 29(생곡동)
3rd row부산광역시 강서구 대저로 289-1(대저1동)
4th row부산광역시 강서구 공항앞길221번길 56(대저2동)
5th row부산광역시 강서구 낙동남로 1042(명지동)
ValueCountFrequency (%)
부산광역시 99
23.1%
강서구 99
23.1%
가락대로 9
 
2.1%
녹산산단381로12번길 7
 
1.6%
녹산산업중로 6
 
1.4%
낙동북로 5
 
1.2%
대저로 4
 
0.9%
55 4
 
0.9%
맥도강변길 4
 
0.9%
명지오션시티9로 4
 
0.9%
Other values (149) 188
43.8%
2023-12-11T01:37:24.500545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
334
 
12.3%
148
 
5.5%
111
 
4.1%
2 110
 
4.1%
1 107
 
3.9%
106
 
3.9%
103
 
3.8%
102
 
3.8%
101
 
3.7%
99
 
3.7%
Other values (75) 1391
51.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1623
59.8%
Decimal Number 541
 
19.9%
Space Separator 334
 
12.3%
Open Punctuation 84
 
3.1%
Close Punctuation 84
 
3.1%
Dash Punctuation 25
 
0.9%
Other Punctuation 21
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
148
 
9.1%
111
 
6.8%
106
 
6.5%
103
 
6.3%
102
 
6.3%
101
 
6.2%
99
 
6.1%
99
 
6.1%
99
 
6.1%
84
 
5.2%
Other values (60) 571
35.2%
Decimal Number
ValueCountFrequency (%)
2 110
20.3%
1 107
19.8%
3 72
13.3%
4 49
9.1%
5 47
8.7%
9 45
8.3%
0 32
 
5.9%
6 31
 
5.7%
7 24
 
4.4%
8 24
 
4.4%
Space Separator
ValueCountFrequency (%)
334
100.0%
Open Punctuation
ValueCountFrequency (%)
( 84
100.0%
Close Punctuation
ValueCountFrequency (%)
) 84
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 25
100.0%
Other Punctuation
ValueCountFrequency (%)
, 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1623
59.8%
Common 1089
40.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
148
 
9.1%
111
 
6.8%
106
 
6.5%
103
 
6.3%
102
 
6.3%
101
 
6.2%
99
 
6.1%
99
 
6.1%
99
 
6.1%
84
 
5.2%
Other values (60) 571
35.2%
Common
ValueCountFrequency (%)
334
30.7%
2 110
 
10.1%
1 107
 
9.8%
( 84
 
7.7%
) 84
 
7.7%
3 72
 
6.6%
4 49
 
4.5%
5 47
 
4.3%
9 45
 
4.1%
0 32
 
2.9%
Other values (5) 125
 
11.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1623
59.8%
ASCII 1089
40.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
334
30.7%
2 110
 
10.1%
1 107
 
9.8%
( 84
 
7.7%
) 84
 
7.7%
3 72
 
6.6%
4 49
 
4.5%
5 47
 
4.3%
9 45
 
4.1%
0 32
 
2.9%
Other values (5) 125
 
11.5%
Hangul
ValueCountFrequency (%)
148
 
9.1%
111
 
6.8%
106
 
6.5%
103
 
6.3%
102
 
6.3%
101
 
6.2%
99
 
6.1%
99
 
6.1%
99
 
6.1%
84
 
5.2%
Other values (60) 571
35.2%
Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
Minimum2022-10-20 00:00:00
Maximum2022-10-21 00:00:00
2023-12-11T01:37:24.592998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:37:24.666863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=2)

Correlations

2023-12-11T01:37:24.726883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처분업상호명영업대상소재지데이터 기준일자
처분업1.0000.0000.9340.0000.000
상호명0.0001.0000.0000.9981.000
영업대상0.9340.0001.0000.0000.000
소재지0.0000.9980.0001.0001.000
데이터 기준일자0.0001.0000.0001.0001.000
2023-12-11T01:37:24.807829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처분업영업대상
처분업1.0000.644
영업대상0.6441.000
2023-12-11T01:37:24.873607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처분업영업대상
처분업1.0000.644
영업대상0.6441.000

Missing values

2023-12-11T01:37:23.087176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:37:23.160543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

처분업상호명영업대상소재지데이터 기준일자
0건설폐기물 중간처리업대도이앤알㈜건설폐기물부산광역시 강서구 생곡산단로52번길 29(생곡동)2022-10-20
1수집·운반업대도환경㈜생활폐기물, 대형폐기물, 사업장비배출시설계부산광역시 강서구 생곡산단로52번길 29(생곡동)2022-10-20
2수집·운반업㈜강서환경생활폐기물, 대형폐기물, 사업장비배출시설계부산광역시 강서구 대저로 289-1(대저1동)2022-10-20
3수집·운반업㈜이레환경사업장비배출시설계폐기물부산광역시 강서구 공항앞길221번길 56(대저2동)2022-10-20
4수집·운반업㈜세환건영사업장비배출시설계폐기물부산광역시 강서구 낙동남로 1042(명지동)2022-10-20
5수집·운반업매일환경사업장비배출시설계폐기물부산광역시 강서구 낙동남로582번길 6-1(녹산동)2022-10-20
6수집·운반업칠우무역사업장비배출시설계폐기물부산광역시 강서구 대저중앙로 336(대저1동)2022-10-20
7수집·운반업부산광역시자원재활용센타사업장비배출시설계폐기물부산광역시 강서구 생곡산단로 76(생곡동)2022-10-20
8수집·운반업미소산업㈜사업장비배출시설계폐기물부산광역시 강서구 체육공원로26번길 148, 4층(대저1동)2022-10-20
9수집·운반업㈜그린포트사업장비배출시설계폐기물부산광역시 강서구 가락대로 320(송정동)2022-10-20
처분업상호명영업대상소재지데이터 기준일자
89종합재활용업성우케미칼폐합성수지부산광역시 강서구 녹산산단381로12번길 43-9(송정동)2022-10-20
90종합재활용업삼득산업(주)음식물류폐기물부산광역시 강서구 녹산산단381로12번길 8(송정동)2022-10-20
91종합재활용업준성화학폐합성수지부산광역시 강서구 녹산산단442로 52(송정동)2022-10-20
92종합재활용업천세철강(주)스케일, 분진, 분철부산광역시 강서구 녹산산단381로12번길 77-25(송정동)2022-10-20
93종합재활용업현대수산사료㈜수산물 가공 잔재물부산광역시 강서구 녹산산업중로 432(송정동)2022-10-20
94종합재활용업나성폴리머폐합성수지부산광역시 강서구 녹산산단381로12번길 43-19(송정동)2022-10-20
95종합재활용업㈜이앤알피혁스크랩부산광역시 강서구 녹산산단262로59번길 43(송정동)2022-10-20
96종합재활용업㈜대동씨엠폐주물사부산광역시 강서구 녹산산단382로59번길 33(송정동)2022-10-20
97종합재활용업동아인텍페수처리오니 등부산광역시 강서구 녹산산단165로 86-19(송정동)2022-10-20
98종합재활용업㈜쓰리알티2차폐전지, 그 밖의 공정오니부산광역시 강서구 녹산산단262로50번길 29(송정동)2022-10-20