Overview

Dataset statistics

Number of variables8
Number of observations23
Missing cells1
Missing cells (%)0.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory70.7 B

Variable types

Numeric1
Categorical3
Text3
DateTime1

Dataset

Description부산광역시남구폐기물처리업체현황_20210614
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3081518

Alerts

처리업종류 is highly overall correlated with 영업대상폐기물High correlation
영업대상폐기물 is highly overall correlated with 처리업종류 and 1 other fieldsHigh correlation
영업구역 is highly overall correlated with 영업대상폐기물High correlation
처리업종류 is highly imbalanced (53.6%)Imbalance
전화번호 has 1 (4.3%) missing valuesMissing
연번 has unique valuesUnique
허가일시 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:01:55.511165
Analysis finished2023-12-10 17:01:56.615234
Duration1.1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct23
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12
Minimum1
Maximum23
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size339.0 B
2023-12-11T02:01:56.729462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.1
Q16.5
median12
Q317.5
95-th percentile21.9
Maximum23
Range22
Interquartile range (IQR)11

Descriptive statistics

Standard deviation6.78233
Coefficient of variation (CV)0.56519417
Kurtosis-1.2
Mean12
Median Absolute Deviation (MAD)6
Skewness0
Sum276
Variance46
MonotonicityStrictly increasing
2023-12-11T02:01:56.961167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
1 1
 
4.3%
2 1
 
4.3%
23 1
 
4.3%
22 1
 
4.3%
21 1
 
4.3%
20 1
 
4.3%
19 1
 
4.3%
18 1
 
4.3%
17 1
 
4.3%
16 1
 
4.3%
Other values (13) 13
56.5%
ValueCountFrequency (%)
1 1
4.3%
2 1
4.3%
3 1
4.3%
4 1
4.3%
5 1
4.3%
6 1
4.3%
7 1
4.3%
8 1
4.3%
9 1
4.3%
10 1
4.3%
ValueCountFrequency (%)
23 1
4.3%
22 1
4.3%
21 1
4.3%
20 1
4.3%
19 1
4.3%
18 1
4.3%
17 1
4.3%
16 1
4.3%
15 1
4.3%
14 1
4.3%

처리업종류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)17.4%
Missing0
Missing (%)0.0%
Memory size316.0 B
수집운반
19 
중간재활용업
수집운반 및 중간재활용업
 
1
최종재활용업
 
1

Length

Max length13
Median length4
Mean length4.6521739
Min length4

Unique

Unique2 ?
Unique (%)8.7%

Sample

1st row수집운반
2nd row수집운반
3rd row수집운반
4th row수집운반
5th row수집운반

Common Values

ValueCountFrequency (%)
수집운반 19
82.6%
중간재활용업 2
 
8.7%
수집운반 및 중간재활용업 1
 
4.3%
최종재활용업 1
 
4.3%

Length

2023-12-11T02:01:57.543513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:01:57.688470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수집운반 20
80.0%
중간재활용업 3
 
12.0%
1
 
4.0%
최종재활용업 1
 
4.0%

영업대상폐기물
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)43.5%
Missing0
Missing (%)0.0%
Memory size316.0 B
사업장생활계폐기물
건설폐기물
사업장배출시설계
대형폐기물
생활폐기물 및 사업장생활계폐기물
Other values (5)

Length

Max length18
Median length17
Mean length7.5217391
Min length3

Unique

Unique6 ?
Unique (%)26.1%

Sample

1st row생활폐기물 및 사업장생활계폐기물
2nd row생활폐기물
3rd row사업장생활계폐기물
4th row사업장생활계폐기물
5th row대형폐기물

Common Values

ValueCountFrequency (%)
사업장생활계폐기물 6
26.1%
건설폐기물 5
21.7%
사업장배출시설계 4
17.4%
대형폐기물 2
 
8.7%
생활폐기물 및 사업장생활계폐기물 1
 
4.3%
생활폐기물 1
 
4.3%
건설폐기물 및 폐기물처리 재활용업 1
 
4.3%
폐목재 1
 
4.3%
폐합성수지 1
 
4.3%
폐타이어 1
 
4.3%

Length

2023-12-11T02:01:57.933059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:01:58.106241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업장생활계폐기물 7
25.0%
건설폐기물 6
21.4%
사업장배출시설계 4
14.3%
대형폐기물 2
 
7.1%
생활폐기물 2
 
7.1%
2
 
7.1%
폐기물처리 1
 
3.6%
재활용업 1
 
3.6%
폐목재 1
 
3.6%
폐합성수지 1
 
3.6%

상호
Text

Distinct21
Distinct (%)91.3%
Missing0
Missing (%)0.0%
Memory size316.0 B
2023-12-11T02:01:58.354717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length4
Mean length4.5652174
Min length3

Characters and Unicode

Total characters105
Distinct characters54
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)82.6%

Sample

1st row대방환경㈜
2nd row㈜선도산업
3rd row(합)보수산업
4th row우리환경
5th row경인산업
ValueCountFrequency (%)
황령기업 2
 
8.7%
원일상사 2
 
8.7%
세방㈜ 1
 
4.3%
대방환경㈜ 1
 
4.3%
㈜동양통운 1
 
4.3%
성진환경 1
 
4.3%
회우표국㈜ 1
 
4.3%
원크린㈜ 1
 
4.3%
청휘환경 1
 
4.3%
감만타이어 1
 
4.3%
Other values (11) 11
47.8%
2023-12-11T02:01:58.847529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10
 
9.5%
10
 
9.5%
8
 
7.6%
7
 
6.7%
6
 
5.7%
4
 
3.8%
3
 
2.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (44) 51
48.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 93
88.6%
Other Symbol 10
 
9.5%
Open Punctuation 1
 
1.0%
Close Punctuation 1
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
10.8%
8
 
8.6%
7
 
7.5%
6
 
6.5%
4
 
4.3%
3
 
3.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (41) 47
50.5%
Other Symbol
ValueCountFrequency (%)
10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 103
98.1%
Common 2
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
9.7%
10
 
9.7%
8
 
7.8%
7
 
6.8%
6
 
5.8%
4
 
3.9%
3
 
2.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (42) 49
47.6%
Common
ValueCountFrequency (%)
( 1
50.0%
) 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 93
88.6%
None 10
 
9.5%
ASCII 2
 
1.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10
 
10.8%
8
 
8.6%
7
 
7.5%
6
 
6.5%
4
 
4.3%
3
 
3.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (41) 47
50.5%
None
ValueCountFrequency (%)
10
100.0%
ASCII
ValueCountFrequency (%)
( 1
50.0%
) 1
50.0%
Distinct21
Distinct (%)91.3%
Missing0
Missing (%)0.0%
Memory size316.0 B
2023-12-11T02:01:59.157543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length36
Mean length25.173913
Min length22

Characters and Unicode

Total characters579
Distinct characters64
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)82.6%

Sample

1st row부산광역시 남구 신선로 231-1 (용당동)
2nd row부산광역시 남구 신선대산복로 42 (용당동)
3rd row부산광역시 남구 백운포로 36 (용호동)
4th row부산광역시 남구 고동골로 119 (문현동)
5th row부산광역시 남구 황령대로 329-67 (대연동)
ValueCountFrequency (%)
부산광역시 23
19.0%
남구 23
19.0%
대연동 8
 
6.6%
신선로 5
 
4.1%
용당동 5
 
4.1%
황령대로 4
 
3.3%
감만동 4
 
3.3%
문현동 3
 
2.5%
329-23 2
 
1.7%
211 2
 
1.7%
Other values (38) 42
34.7%
2023-12-11T02:01:59.684710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
99
 
17.1%
26
 
4.5%
25
 
4.3%
24
 
4.1%
23
 
4.0%
) 23
 
4.0%
23
 
4.0%
23
 
4.0%
23
 
4.0%
23
 
4.0%
Other values (54) 267
46.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 330
57.0%
Space Separator 99
 
17.1%
Decimal Number 91
 
15.7%
Close Punctuation 23
 
4.0%
Open Punctuation 23
 
4.0%
Dash Punctuation 7
 
1.2%
Other Punctuation 5
 
0.9%
Uppercase Letter 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
 
7.9%
25
 
7.6%
24
 
7.3%
23
 
7.0%
23
 
7.0%
23
 
7.0%
23
 
7.0%
23
 
7.0%
23
 
7.0%
14
 
4.2%
Other values (38) 103
31.2%
Decimal Number
ValueCountFrequency (%)
1 22
24.2%
2 18
19.8%
3 17
18.7%
9 7
 
7.7%
5 6
 
6.6%
4 6
 
6.6%
8 5
 
5.5%
6 4
 
4.4%
7 3
 
3.3%
0 3
 
3.3%
Space Separator
ValueCountFrequency (%)
99
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Other Punctuation
ValueCountFrequency (%)
, 5
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 330
57.0%
Common 248
42.8%
Latin 1
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
 
7.9%
25
 
7.6%
24
 
7.3%
23
 
7.0%
23
 
7.0%
23
 
7.0%
23
 
7.0%
23
 
7.0%
23
 
7.0%
14
 
4.2%
Other values (38) 103
31.2%
Common
ValueCountFrequency (%)
99
39.9%
) 23
 
9.3%
( 23
 
9.3%
1 22
 
8.9%
2 18
 
7.3%
3 17
 
6.9%
- 7
 
2.8%
9 7
 
2.8%
5 6
 
2.4%
4 6
 
2.4%
Other values (5) 20
 
8.1%
Latin
ValueCountFrequency (%)
A 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 330
57.0%
ASCII 249
43.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
99
39.8%
) 23
 
9.2%
( 23
 
9.2%
1 22
 
8.8%
2 18
 
7.2%
3 17
 
6.8%
- 7
 
2.8%
9 7
 
2.8%
5 6
 
2.4%
4 6
 
2.4%
Other values (6) 21
 
8.4%
Hangul
ValueCountFrequency (%)
26
 
7.9%
25
 
7.6%
24
 
7.3%
23
 
7.0%
23
 
7.0%
23
 
7.0%
23
 
7.0%
23
 
7.0%
23
 
7.0%
14
 
4.2%
Other values (38) 103
31.2%

허가일시
Date

UNIQUE 

Distinct23
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size316.0 B
Minimum1993-03-08 00:00:00
Maximum2021-04-27 00:00:00
2023-12-11T02:01:59.924957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:02:00.155685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)

영업구역
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)8.7%
Missing0
Missing (%)0.0%
Memory size316.0 B
전국
20 
부산광역시 남구

Length

Max length8
Median length2
Mean length2.7826087
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전국
2nd row부산광역시 남구
3rd row전국
4th row전국
5th row부산광역시 남구

Common Values

ValueCountFrequency (%)
전국 20
87.0%
부산광역시 남구 3
 
13.0%

Length

2023-12-11T02:02:00.555298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:02:00.739042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전국 20
76.9%
부산광역시 3
 
11.5%
남구 3
 
11.5%

전화번호
Text

MISSING 

Distinct20
Distinct (%)90.9%
Missing1
Missing (%)4.3%
Memory size316.0 B
2023-12-11T02:02:01.024077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.045455
Min length12

Characters and Unicode

Total characters265
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)81.8%

Sample

1st row051-624-8282
2nd row051-626-8131
3rd row051-628-1236
4th row051-638-6792
5th row051-631-7868
ValueCountFrequency (%)
051-628-5040 2
 
9.1%
051-627-3335 2
 
9.1%
051-624-8282 1
 
4.5%
051-625-5104 1
 
4.5%
051-625-1119 1
 
4.5%
070-4963-3752 1
 
4.5%
051-610-0063 1
 
4.5%
051-625-3423 1
 
4.5%
051-628-5611 1
 
4.5%
051-630-5300 1
 
4.5%
Other values (10) 10
45.5%
2023-12-11T02:02:01.584036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 44
16.6%
5 36
13.6%
0 34
12.8%
1 33
12.5%
6 32
12.1%
3 23
8.7%
2 20
7.5%
8 15
 
5.7%
4 13
 
4.9%
7 10
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 221
83.4%
Dash Punctuation 44
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 36
16.3%
0 34
15.4%
1 33
14.9%
6 32
14.5%
3 23
10.4%
2 20
9.0%
8 15
6.8%
4 13
 
5.9%
7 10
 
4.5%
9 5
 
2.3%
Dash Punctuation
ValueCountFrequency (%)
- 44
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 265
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 44
16.6%
5 36
13.6%
0 34
12.8%
1 33
12.5%
6 32
12.1%
3 23
8.7%
2 20
7.5%
8 15
 
5.7%
4 13
 
4.9%
7 10
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 265
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 44
16.6%
5 36
13.6%
0 34
12.8%
1 33
12.5%
6 32
12.1%
3 23
8.7%
2 20
7.5%
8 15
 
5.7%
4 13
 
4.9%
7 10
 
3.8%

Interactions

2023-12-11T02:01:56.039677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:02:01.787595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번처리업종류영업대상폐기물상호소 재 지허가일시영업구역전화번호
연번1.0000.5800.8590.8140.8141.0000.0000.884
처리업종류0.5801.0001.0000.0000.0001.0000.0000.000
영업대상폐기물0.8591.0001.0000.0000.0001.0001.0000.135
상호0.8140.0000.0001.0001.0001.0001.0001.000
소 재 지0.8140.0000.0001.0001.0001.0001.0001.000
허가일시1.0001.0001.0001.0001.0001.0001.0001.000
영업구역0.0000.0001.0001.0001.0001.0001.0001.000
전화번호0.8840.0000.1351.0001.0001.0001.0001.000
2023-12-11T02:02:01.979323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업대상폐기물처리업종류영업구역
영업대상폐기물1.0000.8270.787
처리업종류0.8271.0000.000
영업구역0.7870.0001.000
2023-12-11T02:02:02.128781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번처리업종류영업대상폐기물영업구역
연번1.0000.2910.3280.000
처리업종류0.2911.0000.8270.000
영업대상폐기물0.3280.8271.0000.787
영업구역0.0000.0000.7871.000

Missing values

2023-12-11T02:01:56.285176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:01:56.520571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번처리업종류영업대상폐기물상호소 재 지허가일시영업구역전화번호
01수집운반생활폐기물 및 사업장생활계폐기물대방환경㈜부산광역시 남구 신선로 231-1 (용당동)1996-03-11전국051-624-8282
12수집운반생활폐기물㈜선도산업부산광역시 남구 신선대산복로 42 (용당동)2014-03-27부산광역시 남구051-626-8131
23수집운반사업장생활계폐기물(합)보수산업부산광역시 남구 백운포로 36 (용호동)1993-03-08전국051-628-1236
34수집운반사업장생활계폐기물우리환경부산광역시 남구 고동골로 119 (문현동)2014-07-31전국051-638-6792
45수집운반대형폐기물경인산업부산광역시 남구 황령대로 329-67 (대연동)2000-08-22부산광역시 남구051-631-7868
56수집운반건설폐기물황령기업부산광역시 남구 황령대로 329-23 (대연동)1996-06-19전국051-628-5040
67수집운반건설폐기물㈜청룡산업부산광역시 남구 황령대로 329-45 (대연동)2002-11-18전국051-643-5577
78수집운반건설폐기물거창기업부산광역시 남구 우암로 286 (문현동)2007-12-05전국051-645-8866
89수집운반 및 중간재활용업건설폐기물 및 폐기물처리 재활용업㈜부경환경산업부산광역시 남구 신선로 213-1 (감만동)2015-04-16전국051-634-6699
910수집운반건설폐기물㈜석천개발부산광역시 남구 신선로 351 (용당동)2016-12-05전국051-628-3474
연번처리업종류영업대상폐기물상호소 재 지허가일시영업구역전화번호
1314수집운반사업장배출시설계세방㈜부산광역시 남구 북항로 141 (감만동)2014-04-07전국051-630-5300
1415수집운반사업장배출시설계성원환경부산광역시 남구 못골로 91-30 (대연동)2016-11-14전국051-628-5611
1516중간재활용업폐목재황령기업부산광역시 남구 황령대로 329-23 (대연동)2013-07-25전국051-628-5040
1617중간재활용업폐합성수지원일상사부산광역시 남구 신선로 211 (감만동)2015-12-03전국051-627-3335
1718최종재활용업폐타이어감만타이어부산광역시 남구 홍곡로 231 (용당동)2013-09-13전국051-625-3423
1819수집운반사업장생활계폐기물청휘환경부산광역시 남구 고동골로78번길 8, 2층 (문현동)2019-12-19전국<NA>
1920수집운반사업장생활계폐기물원크린㈜부산광역시 남구 수영로 312, 634호 (대연동)2019-12-18전국051-610-0063
2021수집운반사업장생활계폐기물회우표국㈜부산광역시 남구 용호로231번길 53 (용호동)2020-01-06전국070-4963-3752
2122수집운반사업장생활계폐기물성진환경부산광역시 남구 수영로 248, 1410호 (대연동, 메트로타워)2021-04-23전국051-625-1119
2223수집운반대형폐기물고려산업부산광역시 남구 유엔평화로17번길 55, A동 101호 (대연동)2021-04-27부산광역시 남구051-628-4373