Overview

Dataset statistics

Number of variables8
Number of observations43
Missing cells29
Missing cells (%)8.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.9 KiB
Average record size in memory68.1 B

Variable types

Numeric1
Categorical2
Text4
DateTime1

Dataset

Description경기도 의정부시 사업장폐기물 수집운반업체 현황 데이터로 번호, 폐기물처리업구분명, 관리번호, 사업장명, 사업장전화번호, 사업장주소, 인허가일자, 영업상태의 항목으로 구성되어 있습니다.
Author경기도 의정부시
URLhttps://www.data.go.kr/data/15039968/fileData.do

Alerts

폐기물처리업구분명 has constant value ""Constant
영업상태 has constant value ""Constant
사업장전화번호 has 29 (67.4%) missing valuesMissing
번호 has unique valuesUnique
관리번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 11:55:39.678635
Analysis finished2023-12-12 11:55:40.537135
Duration0.86 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22
Minimum1
Maximum43
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size519.0 B
2023-12-12T20:55:40.631274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.1
Q111.5
median22
Q332.5
95-th percentile40.9
Maximum43
Range42
Interquartile range (IQR)21

Descriptive statistics

Standard deviation12.556539
Coefficient of variation (CV)0.57075176
Kurtosis-1.2
Mean22
Median Absolute Deviation (MAD)11
Skewness0
Sum946
Variance157.66667
MonotonicityStrictly increasing
2023-12-12T20:55:40.780266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=43)
ValueCountFrequency (%)
1 1
 
2.3%
2 1
 
2.3%
25 1
 
2.3%
26 1
 
2.3%
27 1
 
2.3%
28 1
 
2.3%
29 1
 
2.3%
30 1
 
2.3%
31 1
 
2.3%
32 1
 
2.3%
Other values (33) 33
76.7%
ValueCountFrequency (%)
1 1
2.3%
2 1
2.3%
3 1
2.3%
4 1
2.3%
5 1
2.3%
6 1
2.3%
7 1
2.3%
8 1
2.3%
9 1
2.3%
10 1
2.3%
ValueCountFrequency (%)
43 1
2.3%
42 1
2.3%
41 1
2.3%
40 1
2.3%
39 1
2.3%
38 1
2.3%
37 1
2.3%
36 1
2.3%
35 1
2.3%
34 1
2.3%

폐기물처리업구분명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
수집운반업(사업장폐기물)
43 

Length

Max length13
Median length13
Mean length13
Min length13

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수집운반업(사업장폐기물)
2nd row수집운반업(사업장폐기물)
3rd row수집운반업(사업장폐기물)
4th row수집운반업(사업장폐기물)
5th row수집운반업(사업장폐기물)

Common Values

ValueCountFrequency (%)
수집운반업(사업장폐기물) 43
100.0%

Length

2023-12-12T20:55:40.959845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:55:41.091600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수집운반업(사업장폐기물 43
100.0%

관리번호
Text

UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-12T20:55:41.360620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters430
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)100.0%

Sample

1st row2016-00005
2nd row2000-00001
3rd row2007-00002
4th row2007-00011
5th row2010-00004
ValueCountFrequency (%)
2016-00005 1
 
2.3%
2018-00002 1
 
2.3%
2019-00001 1
 
2.3%
2019-00003 1
 
2.3%
2019-00004 1
 
2.3%
2019-00005 1
 
2.3%
2019-00007 1
 
2.3%
2019-00008 1
 
2.3%
2019-00009 1
 
2.3%
2020-00002 1
 
2.3%
Other values (33) 33
76.7%
2023-12-12T20:55:41.815645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 224
52.1%
2 70
 
16.3%
1 45
 
10.5%
- 43
 
10.0%
9 13
 
3.0%
6 7
 
1.6%
7 7
 
1.6%
8 6
 
1.4%
5 5
 
1.2%
4 5
 
1.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 387
90.0%
Dash Punctuation 43
 
10.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 224
57.9%
2 70
 
18.1%
1 45
 
11.6%
9 13
 
3.4%
6 7
 
1.8%
7 7
 
1.8%
8 6
 
1.6%
5 5
 
1.3%
4 5
 
1.3%
3 5
 
1.3%
Dash Punctuation
ValueCountFrequency (%)
- 43
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 430
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 224
52.1%
2 70
 
16.3%
1 45
 
10.5%
- 43
 
10.0%
9 13
 
3.0%
6 7
 
1.6%
7 7
 
1.6%
8 6
 
1.4%
5 5
 
1.2%
4 5
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 430
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 224
52.1%
2 70
 
16.3%
1 45
 
10.5%
- 43
 
10.0%
9 13
 
3.0%
6 7
 
1.6%
7 7
 
1.6%
8 6
 
1.4%
5 5
 
1.2%
4 5
 
1.2%
Distinct41
Distinct (%)95.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-12T20:55:42.090767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length8
Mean length4.9069767
Min length2

Characters and Unicode

Total characters211
Distinct characters98
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)90.7%

Sample

1st row㈜케이비티환경
2nd row㈜미래환경
3rd row사계절환경
4th row(주)이레상사
5th row대풍산업
ValueCountFrequency (%)
주식회사 3
 
6.5%
진환경 2
 
4.3%
복지스크랩 2
 
4.3%
㈜나무야 1
 
2.2%
개별화물 1
 
2.2%
대덕철재(자원 1
 
2.2%
용진산업 1
 
2.2%
도솔산업 1
 
2.2%
갈현자원 1
 
2.2%
대한유지 1
 
2.2%
Other values (32) 32
69.6%
2023-12-12T20:55:42.538896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13
 
6.2%
13
 
6.2%
10
 
4.7%
9
 
4.3%
7
 
3.3%
7
 
3.3%
6
 
2.8%
5
 
2.4%
5
 
2.4%
4
 
1.9%
Other values (88) 132
62.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 193
91.5%
Other Symbol 7
 
3.3%
Space Separator 3
 
1.4%
Other Punctuation 2
 
0.9%
Close Punctuation 2
 
0.9%
Open Punctuation 2
 
0.9%
Uppercase Letter 2
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
6.7%
13
 
6.7%
10
 
5.2%
9
 
4.7%
7
 
3.6%
6
 
3.1%
5
 
2.6%
5
 
2.6%
4
 
2.1%
4
 
2.1%
Other values (81) 117
60.6%
Uppercase Letter
ValueCountFrequency (%)
R 1
50.0%
C 1
50.0%
Other Symbol
ValueCountFrequency (%)
7
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 200
94.8%
Common 9
 
4.3%
Latin 2
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
 
6.5%
13
 
6.5%
10
 
5.0%
9
 
4.5%
7
 
3.5%
7
 
3.5%
6
 
3.0%
5
 
2.5%
5
 
2.5%
4
 
2.0%
Other values (82) 121
60.5%
Common
ValueCountFrequency (%)
3
33.3%
. 2
22.2%
) 2
22.2%
( 2
22.2%
Latin
ValueCountFrequency (%)
R 1
50.0%
C 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 193
91.5%
ASCII 11
 
5.2%
None 7
 
3.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
13
 
6.7%
13
 
6.7%
10
 
5.2%
9
 
4.7%
7
 
3.6%
6
 
3.1%
5
 
2.6%
5
 
2.6%
4
 
2.1%
4
 
2.1%
Other values (81) 117
60.6%
None
ValueCountFrequency (%)
7
100.0%
ASCII
ValueCountFrequency (%)
3
27.3%
. 2
18.2%
) 2
18.2%
( 2
18.2%
R 1
 
9.1%
C 1
 
9.1%

사업장전화번호
Text

MISSING 

Distinct12
Distinct (%)85.7%
Missing29
Missing (%)67.4%
Memory size476.0 B
2023-12-12T20:55:42.735425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters168
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)71.4%

Sample

1st row031-855-0851
2nd row031-877-7000
3rd row031-855-6573
4th row031-848-1988
5th row031-821-4763
ValueCountFrequency (%)
031-821-4763 2
14.3%
031-647-6311 2
14.3%
031-855-0851 1
7.1%
031-877-7000 1
7.1%
031-855-6573 1
7.1%
031-848-1988 1
7.1%
031-851-8311 1
7.1%
031-846-1420 1
7.1%
031-811-0016 1
7.1%
031-878-6780 1
7.1%
Other values (2) 2
14.3%
2023-12-12T20:55:43.084070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 30
17.9%
- 28
16.7%
0 22
13.1%
3 22
13.1%
8 21
12.5%
7 12
 
7.1%
6 11
 
6.5%
5 10
 
6.0%
4 7
 
4.2%
2 3
 
1.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 140
83.3%
Dash Punctuation 28
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 30
21.4%
0 22
15.7%
3 22
15.7%
8 21
15.0%
7 12
 
8.6%
6 11
 
7.9%
5 10
 
7.1%
4 7
 
5.0%
2 3
 
2.1%
9 2
 
1.4%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 168
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 30
17.9%
- 28
16.7%
0 22
13.1%
3 22
13.1%
8 21
12.5%
7 12
 
7.1%
6 11
 
6.5%
5 10
 
6.0%
4 7
 
4.2%
2 3
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 168
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 30
17.9%
- 28
16.7%
0 22
13.1%
3 22
13.1%
8 21
12.5%
7 12
 
7.1%
6 11
 
6.5%
5 10
 
6.0%
4 7
 
4.2%
2 3
 
1.8%
Distinct42
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-12T20:55:43.432539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length38
Mean length33.093023
Min length20

Characters and Unicode

Total characters1423
Distinct characters106
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique41 ?
Unique (%)95.3%

Sample

1st row경기도 의정부시 체육로 298-33, 8층 808호
2nd row경기도 의정부시 경의로 41, 402호 (의정부동)
3rd row경기도 의정부시 둔야로 36 (가능동)
4th row경기도 의정부시 범골로158번길 46, 302호 (의정부동)
5th row경기도 의정부시 태평로204번길 16, 1층 (가능동)
ValueCountFrequency (%)
경기도 43
 
15.0%
의정부시 43
 
15.0%
의정부동 21
 
7.3%
경의로 6
 
2.1%
1층 5
 
1.7%
신곡동 4
 
1.4%
70 4
 
1.4%
8층 4
 
1.4%
해태프라자 3
 
1.0%
가능동 3
 
1.0%
Other values (114) 150
52.4%
2023-12-12T20:55:43.965037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
243
 
17.1%
72
 
5.1%
69
 
4.8%
67
 
4.7%
1 50
 
3.5%
49
 
3.4%
48
 
3.4%
45
 
3.2%
43
 
3.0%
43
 
3.0%
Other values (96) 694
48.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 769
54.0%
Decimal Number 273
 
19.2%
Space Separator 243
 
17.1%
Close Punctuation 42
 
3.0%
Open Punctuation 42
 
3.0%
Other Punctuation 38
 
2.7%
Dash Punctuation 9
 
0.6%
Uppercase Letter 7
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
72
 
9.4%
69
 
9.0%
67
 
8.7%
49
 
6.4%
48
 
6.2%
45
 
5.9%
43
 
5.6%
43
 
5.6%
43
 
5.6%
38
 
4.9%
Other values (79) 252
32.8%
Decimal Number
ValueCountFrequency (%)
1 50
18.3%
0 41
15.0%
4 38
13.9%
2 38
13.9%
5 26
9.5%
3 21
7.7%
7 19
 
7.0%
8 19
 
7.0%
6 11
 
4.0%
9 10
 
3.7%
Uppercase Letter
ValueCountFrequency (%)
A 6
85.7%
B 1
 
14.3%
Space Separator
ValueCountFrequency (%)
243
100.0%
Close Punctuation
ValueCountFrequency (%)
) 42
100.0%
Open Punctuation
ValueCountFrequency (%)
( 42
100.0%
Other Punctuation
ValueCountFrequency (%)
, 38
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 769
54.0%
Common 647
45.5%
Latin 7
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
72
 
9.4%
69
 
9.0%
67
 
8.7%
49
 
6.4%
48
 
6.2%
45
 
5.9%
43
 
5.6%
43
 
5.6%
43
 
5.6%
38
 
4.9%
Other values (79) 252
32.8%
Common
ValueCountFrequency (%)
243
37.6%
1 50
 
7.7%
) 42
 
6.5%
( 42
 
6.5%
0 41
 
6.3%
4 38
 
5.9%
2 38
 
5.9%
, 38
 
5.9%
5 26
 
4.0%
3 21
 
3.2%
Other values (5) 68
 
10.5%
Latin
ValueCountFrequency (%)
A 6
85.7%
B 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 769
54.0%
ASCII 654
46.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
243
37.2%
1 50
 
7.6%
) 42
 
6.4%
( 42
 
6.4%
0 41
 
6.3%
4 38
 
5.8%
2 38
 
5.8%
, 38
 
5.8%
5 26
 
4.0%
3 21
 
3.2%
Other values (7) 75
 
11.5%
Hangul
ValueCountFrequency (%)
72
 
9.4%
69
 
9.0%
67
 
8.7%
49
 
6.4%
48
 
6.2%
45
 
5.9%
43
 
5.6%
43
 
5.6%
43
 
5.6%
38
 
4.9%
Other values (79) 252
32.8%
Distinct42
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size476.0 B
Minimum1999-06-08 00:00:00
Maximum2021-02-01 00:00:00
2023-12-12T20:55:44.119842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:55:44.303665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)

영업상태
Categorical

CONSTANT 

Distinct1
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
영업
43 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업
2nd row영업
3rd row영업
4th row영업
5th row영업

Common Values

ValueCountFrequency (%)
영업 43
100.0%

Length

2023-12-12T20:55:44.786819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:55:44.913238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업 43
100.0%

Interactions

2023-12-12T20:55:40.159855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:55:45.015123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호관리번호사업장명사업장전화번호사업장주소인허가일자
번호1.0001.0000.8730.8800.9361.000
관리번호1.0001.0001.0001.0001.0001.000
사업장명0.8731.0001.0001.0001.0000.985
사업장전화번호0.8801.0001.0001.0001.0001.000
사업장주소0.9361.0001.0001.0001.0000.995
인허가일자1.0001.0000.9851.0000.9951.000

Missing values

2023-12-12T20:55:40.300066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:55:40.468717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호폐기물처리업구분명관리번호사업장명사업장전화번호사업장주소인허가일자영업상태
01수집운반업(사업장폐기물)2016-00005㈜케이비티환경031-855-0851경기도 의정부시 체육로 298-33, 8층 808호2017-01-05영업
12수집운반업(사업장폐기물)2000-00001㈜미래환경031-877-7000경기도 의정부시 경의로 41, 402호 (의정부동)2001-09-11영업
23수집운반업(사업장폐기물)2007-00002사계절환경031-855-6573경기도 의정부시 둔야로 36 (가능동)2007-06-20영업
34수집운반업(사업장폐기물)2007-00011(주)이레상사<NA>경기도 의정부시 범골로158번길 46, 302호 (의정부동)2008-01-04영업
45수집운반업(사업장폐기물)2010-00004대풍산업<NA>경기도 의정부시 태평로204번길 16, 1층 (가능동)1999-06-08영업
56수집운반업(사업장폐기물)2012-00004고산환경산업 주식회사031-848-1988경기도 의정부시 동일로794번길 3-8 (녹양동)2007-05-17영업
67수집운반업(사업장폐기물)2016-00002장암자원<NA>경기도 의정부시 동일로 114-24 (장암동)2007-02-05영업
78수집운반업(사업장폐기물)2017-00001에코환경<NA>경기도 의정부시 둔야로45번길 11, 1층 A145호 (의정부동)2007-02-09영업
89수집운반업(사업장폐기물)2018-00003복지스크랩031-821-4763경기도 의정부시 추동로108번길 76 (신곡동)2007-06-25영업
910수집운반업(사업장폐기물)2019-00002철원자원031-851-8311경기도 의정부시 승지로 9 (민락동)2007-06-27영업
번호폐기물처리업구분명관리번호사업장명사업장전화번호사업장주소인허가일자영업상태
3334수집운반업(사업장폐기물)2020-00008부영인더스트리<NA>경기도 의정부시 둔야로45번길 11, 1층 A105호 (의정부동)2015-03-03영업
3435수집운반업(사업장폐기물)2020-00009도솔산업<NA>경기도 의정부시 경의로 13, 4층 401호 (의정부동)2015-02-12영업
3536수집운반업(사업장폐기물)2020-00013주식회사 부창<NA>경기도 의정부시 청사로 45, 플래티넘프라자 7층 702-26호 (금오동)2012-08-08영업
3637수집운반업(사업장폐기물)2020-00014㈜시화<NA>경기도 의정부시 경의로 70, 4층 A408호 (의정부동)2007-03-16영업
3738수집운반업(사업장폐기물)2020-00015만물자원<NA>경기도 의정부시 둔야로45번길 11, 1층 A101호 (의정부동)2007-01-17영업
3839수집운반업(사업장폐기물)2020-00018㈜나무야<NA>경기도 의정부시 시민로 39, 의정부 대정프라자 501호 씨-06호 (의정부동)2016-10-20영업
3940수집운반업(사업장폐기물)2021-00002한국RC<NA>경기도 의정부시 경의로 70, 5층 A507호 (의정부동)2016-05-23영업
4041수집운반업(사업장폐기물)2021-00003대륙철재<NA>경기도 의정부시 경의로 70, 5층 A501호 (의정부동)2016-05-10영업
4142수집운반업(사업장폐기물)2021-00006대덕철재(자원)<NA>경기도 의정부시 가능로74번길 20, 1층 (의정부동)2007-05-04영업
4243수집운반업(사업장폐기물)2021-00007대영환경㈜<NA>경기도 의정부시 경의로 70, 403호 (의정부동)2021-02-01영업