Overview

Dataset statistics

Number of variables6
Number of observations41
Missing cells4
Missing cells (%)1.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 KiB
Average record size in memory52.2 B

Variable types

Numeric1
Text4
Categorical1

Dataset

Description금정구 관내 대형 폐기물 쓰레기 업체 현황에 대한 데이터로 상호, 대표자, 사무실주소, 전화번호 등의 항목을 제공합니다
Author부산광역시 금정구
URLhttps://www.data.go.kr/data/3070491/fileData.do

Alerts

전화번호 has 4 (9.8%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:59:33.848485
Analysis finished2023-12-12 21:59:34.485493
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct41
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21
Minimum1
Maximum41
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size501.0 B
2023-12-13T06:59:34.552238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q111
median21
Q331
95-th percentile39
Maximum41
Range40
Interquartile range (IQR)20

Descriptive statistics

Standard deviation11.979149
Coefficient of variation (CV)0.57043565
Kurtosis-1.2
Mean21
Median Absolute Deviation (MAD)10
Skewness0
Sum861
Variance143.5
MonotonicityStrictly increasing
2023-12-13T06:59:34.717396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
ValueCountFrequency (%)
1 1
 
2.4%
32 1
 
2.4%
24 1
 
2.4%
25 1
 
2.4%
26 1
 
2.4%
27 1
 
2.4%
28 1
 
2.4%
29 1
 
2.4%
30 1
 
2.4%
31 1
 
2.4%
Other values (31) 31
75.6%
ValueCountFrequency (%)
1 1
2.4%
2 1
2.4%
3 1
2.4%
4 1
2.4%
5 1
2.4%
6 1
2.4%
7 1
2.4%
8 1
2.4%
9 1
2.4%
10 1
2.4%
ValueCountFrequency (%)
41 1
2.4%
40 1
2.4%
39 1
2.4%
38 1
2.4%
37 1
2.4%
36 1
2.4%
35 1
2.4%
34 1
2.4%
33 1
2.4%
32 1
2.4%
Distinct36
Distinct (%)87.8%
Missing0
Missing (%)0.0%
Memory size460.0 B
2023-12-13T06:59:34.984368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length6.195122
Min length3

Characters and Unicode

Total characters254
Distinct characters87
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)75.6%

Sample

1st row㈜세명기업사
2nd row현대실업
3rd row부광자원 주식회사
4th row(유)우리환경
5th row㈜도호네트웍스 금정지점
ValueCountFrequency (%)
유승건기산업㈜ 2
 
4.3%
웅상자원 2
 
4.3%
동건환경 2
 
4.3%
부광자원 2
 
4.3%
주식회사 2
 
4.3%
금정지점 2
 
4.3%
현대실업 2
 
4.3%
거성환경㈜ 1
 
2.2%
㈜우리환경산업 1
 
2.2%
늘푸른환경 1
 
2.2%
Other values (29) 29
63.0%
2023-12-13T06:59:35.341471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17
 
6.7%
14
 
5.5%
14
 
5.5%
9
 
3.5%
9
 
3.5%
8
 
3.1%
8
 
3.1%
) 7
 
2.8%
7
 
2.8%
( 7
 
2.8%
Other values (77) 154
60.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 218
85.8%
Other Symbol 17
 
6.7%
Close Punctuation 7
 
2.8%
Open Punctuation 7
 
2.8%
Space Separator 5
 
2.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
6.4%
14
 
6.4%
9
 
4.1%
9
 
4.1%
8
 
3.7%
8
 
3.7%
7
 
3.2%
5
 
2.3%
5
 
2.3%
5
 
2.3%
Other values (73) 134
61.5%
Other Symbol
ValueCountFrequency (%)
17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 235
92.5%
Common 19
 
7.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
 
7.2%
14
 
6.0%
14
 
6.0%
9
 
3.8%
9
 
3.8%
8
 
3.4%
8
 
3.4%
7
 
3.0%
5
 
2.1%
5
 
2.1%
Other values (74) 139
59.1%
Common
ValueCountFrequency (%)
) 7
36.8%
( 7
36.8%
5
26.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 218
85.8%
ASCII 19
 
7.5%
None 17
 
6.7%

Most frequent character per block

None
ValueCountFrequency (%)
17
100.0%
Hangul
ValueCountFrequency (%)
14
 
6.4%
14
 
6.4%
9
 
4.1%
9
 
4.1%
8
 
3.7%
8
 
3.7%
7
 
3.2%
5
 
2.3%
5
 
2.3%
5
 
2.3%
Other values (73) 134
61.5%
ASCII
ValueCountFrequency (%)
) 7
36.8%
( 7
36.8%
5
26.3%
Distinct35
Distinct (%)85.4%
Missing0
Missing (%)0.0%
Memory size460.0 B
2023-12-13T06:59:35.793586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9756098
Min length2

Characters and Unicode

Total characters122
Distinct characters53
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)73.2%

Sample

1st row박규미
2nd row감은근
3rd row홍성표
4th row최준영
5th row이현철
ValueCountFrequency (%)
감은근 3
 
7.3%
주미향 2
 
4.9%
김진옥 2
 
4.9%
서영규 2
 
4.9%
홍성표 2
 
4.9%
박규미 1
 
2.4%
정도용 1
 
2.4%
이화자 1
 
2.4%
윤선미 1
 
2.4%
박훈일 1
 
2.4%
Other values (25) 25
61.0%
2023-12-13T06:59:36.086040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
 
7.4%
7
 
5.7%
6
 
4.9%
5
 
4.1%
5
 
4.1%
4
 
3.3%
4
 
3.3%
4
 
3.3%
3
 
2.5%
3
 
2.5%
Other values (43) 72
59.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 122
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
7.4%
7
 
5.7%
6
 
4.9%
5
 
4.1%
5
 
4.1%
4
 
3.3%
4
 
3.3%
4
 
3.3%
3
 
2.5%
3
 
2.5%
Other values (43) 72
59.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 122
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
7.4%
7
 
5.7%
6
 
4.9%
5
 
4.1%
5
 
4.1%
4
 
3.3%
4
 
3.3%
4
 
3.3%
3
 
2.5%
3
 
2.5%
Other values (43) 72
59.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 122
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9
 
7.4%
7
 
5.7%
6
 
4.9%
5
 
4.1%
5
 
4.1%
4
 
3.3%
4
 
3.3%
4
 
3.3%
3
 
2.5%
3
 
2.5%
Other values (43) 72
59.0%
Distinct8
Distinct (%)19.5%
Missing0
Missing (%)0.0%
Memory size460.0 B
사업장배출시설계폐기물
21 
건설폐기물
10 
사업장비배출시설계폐기물(생활)
생활폐기물
 
2
생활폐기물 및 사업장생활계폐기물
 
1
Other values (3)

Length

Max length24
Median length11
Mean length10.414634
Min length5

Unique

Unique4 ?
Unique (%)9.8%

Sample

1st row생활폐기물 및 사업장생활계폐기물
2nd row생활폐기물
3rd row생활폐기물
4th row생활폐기물 (대형폐기물)
5th row사업장비배출시설계폐기물(생활)

Common Values

ValueCountFrequency (%)
사업장배출시설계폐기물 21
51.2%
건설폐기물 10
24.4%
사업장비배출시설계폐기물(생활) 4
 
9.8%
생활폐기물 2
 
4.9%
생활폐기물 및 사업장생활계폐기물 1
 
2.4%
생활폐기물 (대형폐기물) 1
 
2.4%
사업장비배출시설계폐기물(의료기관일회용기저귀) 1
 
2.4%
사업장비배출시설계폐기물(폐식용유) 1
 
2.4%

Length

2023-12-13T06:59:36.226976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:59:36.339728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업장배출시설계폐기물 21
47.7%
건설폐기물 10
22.7%
사업장비배출시설계폐기물(생활 4
 
9.1%
생활폐기물 4
 
9.1%
1
 
2.3%
사업장생활계폐기물 1
 
2.3%
대형폐기물 1
 
2.3%
사업장비배출시설계폐기물(의료기관일회용기저귀 1
 
2.3%
사업장비배출시설계폐기물(폐식용유 1
 
2.3%
Distinct36
Distinct (%)87.8%
Missing0
Missing (%)0.0%
Memory size460.0 B
2023-12-13T06:59:36.612427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length36
Mean length27.926829
Min length20

Characters and Unicode

Total characters1145
Distinct characters81
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)75.6%

Sample

1st row부산광역시 금정구 금샘로203 (장전동)
2nd row부산광역시 금정구 공단로8번길 15 (금사동)
3rd row부산광역시 금정구 체육공원로126 (구서동)
4th row부산광역시 금정구 부산대학로 10, 상가1동 비7-2호 (부곡동, 대우아파트)
5th row부산광역시 금정구 금정도서관로3,101호 (청룡동, 삼부아파트)
ValueCountFrequency (%)
부산광역시 41
21.6%
금정구 41
21.6%
금사동 7
 
3.7%
구서동 7
 
3.7%
부곡동 5
 
2.6%
구서동,유림노르웨이 3
 
1.6%
회동동 3
 
1.6%
중앙대로 3
 
1.6%
중앙대로1799,306호 2
 
1.1%
장전동 2
 
1.1%
Other values (69) 76
40.0%
2023-12-13T06:59:36.997979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
150
 
13.1%
57
 
5.0%
54
 
4.7%
53
 
4.6%
48
 
4.2%
46
 
4.0%
43
 
3.8%
41
 
3.6%
41
 
3.6%
41
 
3.6%
Other values (71) 571
49.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 707
61.7%
Decimal Number 181
 
15.8%
Space Separator 150
 
13.1%
Open Punctuation 41
 
3.6%
Close Punctuation 41
 
3.6%
Other Punctuation 19
 
1.7%
Dash Punctuation 6
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
57
 
8.1%
54
 
7.6%
53
 
7.5%
48
 
6.8%
46
 
6.5%
43
 
6.1%
41
 
5.8%
41
 
5.8%
41
 
5.8%
41
 
5.8%
Other values (56) 242
34.2%
Decimal Number
ValueCountFrequency (%)
1 34
18.8%
2 30
16.6%
3 19
10.5%
6 17
9.4%
7 16
8.8%
9 16
8.8%
5 14
7.7%
4 13
 
7.2%
0 12
 
6.6%
8 10
 
5.5%
Space Separator
ValueCountFrequency (%)
150
100.0%
Open Punctuation
ValueCountFrequency (%)
( 41
100.0%
Close Punctuation
ValueCountFrequency (%)
) 41
100.0%
Other Punctuation
ValueCountFrequency (%)
, 19
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 707
61.7%
Common 438
38.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
57
 
8.1%
54
 
7.6%
53
 
7.5%
48
 
6.8%
46
 
6.5%
43
 
6.1%
41
 
5.8%
41
 
5.8%
41
 
5.8%
41
 
5.8%
Other values (56) 242
34.2%
Common
ValueCountFrequency (%)
150
34.2%
( 41
 
9.4%
) 41
 
9.4%
1 34
 
7.8%
2 30
 
6.8%
, 19
 
4.3%
3 19
 
4.3%
6 17
 
3.9%
7 16
 
3.7%
9 16
 
3.7%
Other values (5) 55
 
12.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 707
61.7%
ASCII 438
38.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
150
34.2%
( 41
 
9.4%
) 41
 
9.4%
1 34
 
7.8%
2 30
 
6.8%
, 19
 
4.3%
3 19
 
4.3%
6 17
 
3.9%
7 16
 
3.7%
9 16
 
3.7%
Other values (5) 55
 
12.6%
Hangul
ValueCountFrequency (%)
57
 
8.1%
54
 
7.6%
53
 
7.5%
48
 
6.8%
46
 
6.5%
43
 
6.1%
41
 
5.8%
41
 
5.8%
41
 
5.8%
41
 
5.8%
Other values (56) 242
34.2%

전화번호
Text

MISSING 

Distinct30
Distinct (%)81.1%
Missing4
Missing (%)9.8%
Memory size460.0 B
2023-12-13T06:59:37.208196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.027027
Min length12

Characters and Unicode

Total characters445
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)62.2%

Sample

1st row051-514-2212
2nd row051-521-1003
3rd row051-582-8572
4th row051-526-9787
5th row051-508-5585
ValueCountFrequency (%)
051-517-6420 2
 
5.4%
051-521-1003 2
 
5.4%
051-516-4998 2
 
5.4%
051-514-0220 2
 
5.4%
051-525-3205 2
 
5.4%
051-513-0113 2
 
5.4%
051-582-8572 2
 
5.4%
051-582-7064 1
 
2.7%
051-512-9751 1
 
2.7%
051-704-2003 1
 
2.7%
Other values (20) 20
54.1%
2023-12-13T06:59:37.538764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 91
20.4%
- 74
16.6%
0 70
15.7%
1 70
15.7%
2 37
8.3%
3 24
 
5.4%
8 23
 
5.2%
7 19
 
4.3%
4 15
 
3.4%
9 12
 
2.7%
Other values (2) 10
 
2.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 370
83.1%
Dash Punctuation 74
 
16.6%
Space Separator 1
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 91
24.6%
0 70
18.9%
1 70
18.9%
2 37
10.0%
3 24
 
6.5%
8 23
 
6.2%
7 19
 
5.1%
4 15
 
4.1%
9 12
 
3.2%
6 9
 
2.4%
Dash Punctuation
ValueCountFrequency (%)
- 74
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 445
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 91
20.4%
- 74
16.6%
0 70
15.7%
1 70
15.7%
2 37
8.3%
3 24
 
5.4%
8 23
 
5.2%
7 19
 
4.3%
4 15
 
3.4%
9 12
 
2.7%
Other values (2) 10
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 445
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 91
20.4%
- 74
16.6%
0 70
15.7%
1 70
15.7%
2 37
8.3%
3 24
 
5.4%
8 23
 
5.2%
7 19
 
4.3%
4 15
 
3.4%
9 12
 
2.7%
Other values (2) 10
 
2.2%

Interactions

2023-12-13T06:59:34.207445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:59:37.631335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번상 호대표자영업대상 폐기물사무실전화번호
연번1.0000.5500.4140.8100.5500.361
상 호0.5501.0001.0000.6040.9990.996
대표자0.4141.0001.0000.5940.9960.996
영업대상 폐기물0.8100.6040.5941.0000.7370.876
사무실0.5500.9990.9960.7371.0000.996
전화번호0.3610.9960.9960.8760.9961.000
2023-12-13T06:59:37.727685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번영업대상 폐기물
연번1.0000.447
영업대상 폐기물0.4471.000

Missing values

2023-12-13T06:59:34.332595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:59:34.438781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상 호대표자영업대상 폐기물사무실전화번호
01㈜세명기업사박규미생활폐기물 및 사업장생활계폐기물부산광역시 금정구 금샘로203 (장전동)051-514-2212
12현대실업감은근생활폐기물부산광역시 금정구 공단로8번길 15 (금사동)051-521-1003
23부광자원 주식회사홍성표생활폐기물부산광역시 금정구 체육공원로126 (구서동)051-582-8572
34(유)우리환경최준영생활폐기물 (대형폐기물)부산광역시 금정구 부산대학로 10, 상가1동 비7-2호 (부곡동, 대우아파트)051-526-9787
45㈜도호네트웍스 금정지점이현철사업장비배출시설계폐기물(생활)부산광역시 금정구 금정도서관로3,101호 (청룡동, 삼부아파트)051-508-5585
56웅상자원주미향사업장비배출시설계폐기물(생활)부산광역시 금정구 수림로62번길 49, 2층(장전동)051-516-4998
67㈜삼원건업이귀자사업장비배출시설계폐기물(생활)부산광역시 금정구 중앙대로1944번길 29(구서동)051-553-8100
78부광자원 주식회사홍성표사업장비배출시설계폐기물(생활)부산광역시 금정구 체육공원로126 (구서동)051-582-8572
89에코로지스박효성사업장비배출시설계폐기물(의료기관일회용기저귀)부산광역시 금정구 금강로 502, 204동 201호 (구서동)051-583-0169
910㈜나르고환경안주현사업장비배출시설계폐기물(폐식용유)부산광역시 금정구 중앙대로1778번길 26-11, 2층 (부곡동)051-951-7800
연번상 호대표자영업대상 폐기물사무실전화번호
3132㈜선영테크박훈일건설폐기물부산광역시 금정구 중앙대로 1799,327호 (구서동,유림노르웨이)051-512-9751
3233동서자원윤선미건설폐기물부산광역시 금정구 체육공원로135 (구서동)051-516-8888
3334유승건기산업㈜서영규건설폐기물부산광역시 금정구 무학송로106 (부곡동)051-517-6420
3435이화환경개발㈜이화자건설폐기물부산광역시 금정구 금강로565번길65 (구서동)051-582-7064
3536동건환경김진옥건설폐기물부산광역시 금정구 수원지로22번길38 (회동동)051-525-3205
3637현대실업(주)감은근건설폐기물부산광역시 금정구 공단로8번길15 (금사동)051-521-1003
3738늘푸른환경정도용건설폐기물부산광역시 금정구 공단동로29번길35 (금사동)051-532-2323
3839㈜우리환경산업김순경건설폐기물부산광역시 금정구 동천로67(회동동)051-555-4401
3940마린환경산업이화열건설폐기물부산광역시 금정구 공단서로18번길89 (금사동)051-704-2003
4041㈜금사환경김형준건설폐기물부산광역시 금정구 공단서로 18번길 72 (금사동)051-525-9091