Overview

Dataset statistics

Number of variables6
Number of observations40
Missing cells20
Missing cells (%)8.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory52.3 B

Variable types

Numeric1
Categorical1
Text4

Dataset

Description경상북도 구미시 재생플라스틱업체 현황으로 폐기물재활용업 중 최종재활용업, 종합재활용업 등록 시설 중 합성수지를 영업대상 폐기물로 포함한 업체의 사업자명칭, 주소, 전화번호 데이터 파일입니다.
Author경상북도 구미시
URLhttps://www.data.go.kr/data/15035303/fileData.do

Alerts

업종 has constant value ""Constant
사업장주소(도로명) has 1 (2.5%) missing valuesMissing
사업장주소(지번) has 4 (10.0%) missing valuesMissing
사업장전화번호 has 15 (37.5%) missing valuesMissing
연번 has unique valuesUnique
사업장명칭 has unique valuesUnique

Reproduction

Analysis started2024-03-23 05:19:43.057668
Analysis finished2024-03-23 05:19:49.831329
Duration6.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct40
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.5
Minimum1
Maximum40
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size492.0 B
2024-03-23T05:19:50.220196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.95
Q110.75
median20.5
Q330.25
95-th percentile38.05
Maximum40
Range39
Interquartile range (IQR)19.5

Descriptive statistics

Standard deviation11.690452
Coefficient of variation (CV)0.57026595
Kurtosis-1.2
Mean20.5
Median Absolute Deviation (MAD)10
Skewness0
Sum820
Variance136.66667
MonotonicityStrictly increasing
2024-03-23T05:19:50.868193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
1 1
 
2.5%
22 1
 
2.5%
24 1
 
2.5%
25 1
 
2.5%
26 1
 
2.5%
27 1
 
2.5%
28 1
 
2.5%
29 1
 
2.5%
30 1
 
2.5%
31 1
 
2.5%
Other values (30) 30
75.0%
ValueCountFrequency (%)
1 1
2.5%
2 1
2.5%
3 1
2.5%
4 1
2.5%
5 1
2.5%
6 1
2.5%
7 1
2.5%
8 1
2.5%
9 1
2.5%
10 1
2.5%
ValueCountFrequency (%)
40 1
2.5%
39 1
2.5%
38 1
2.5%
37 1
2.5%
36 1
2.5%
35 1
2.5%
34 1
2.5%
33 1
2.5%
32 1
2.5%
31 1
2.5%

업종
Categorical

CONSTANT 

Distinct1
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
종합재활용업
40 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종합재활용업
2nd row종합재활용업
3rd row종합재활용업
4th row종합재활용업
5th row종합재활용업

Common Values

ValueCountFrequency (%)
종합재활용업 40
100.0%

Length

2024-03-23T05:19:51.294787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T05:19:51.961646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
종합재활용업 40
100.0%

사업장명칭
Text

UNIQUE 

Distinct40
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size452.0 B
2024-03-23T05:19:52.528722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length7.05
Min length4

Characters and Unicode

Total characters282
Distinct characters90
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)100.0%

Sample

1st row(주)기신산업구미공장
2nd row(주)대성산업개발
3rd row(주)대성산업개발1공장
4th row(주)대성산업개발2공장
5th row(주)대현이엔지
ValueCountFrequency (%)
주식회사 5
 
10.9%
신원환경 1
 
2.2%
만석금속 1
 
2.2%
대영산업 1
 
2.2%
동천수지 1
 
2.2%
부성섬유(주 1
 
2.2%
송학산업 1
 
2.2%
에이스텍 1
 
2.2%
영남수지공업사 1
 
2.2%
금장케미칼 1
 
2.2%
Other values (32) 32
69.6%
2024-03-23T05:19:53.893102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29
 
10.3%
( 21
 
7.4%
) 21
 
7.4%
12
 
4.3%
11
 
3.9%
10
 
3.5%
8
 
2.8%
7
 
2.5%
6
 
2.1%
6
 
2.1%
Other values (80) 151
53.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 232
82.3%
Open Punctuation 21
 
7.4%
Close Punctuation 21
 
7.4%
Space Separator 6
 
2.1%
Decimal Number 2
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
 
12.5%
12
 
5.2%
11
 
4.7%
10
 
4.3%
8
 
3.4%
7
 
3.0%
6
 
2.6%
6
 
2.6%
5
 
2.2%
5
 
2.2%
Other values (75) 133
57.3%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
1 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Space Separator
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 232
82.3%
Common 50
 
17.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
 
12.5%
12
 
5.2%
11
 
4.7%
10
 
4.3%
8
 
3.4%
7
 
3.0%
6
 
2.6%
6
 
2.6%
5
 
2.2%
5
 
2.2%
Other values (75) 133
57.3%
Common
ValueCountFrequency (%)
( 21
42.0%
) 21
42.0%
6
 
12.0%
2 1
 
2.0%
1 1
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 232
82.3%
ASCII 50
 
17.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
29
 
12.5%
12
 
5.2%
11
 
4.7%
10
 
4.3%
8
 
3.4%
7
 
3.0%
6
 
2.6%
6
 
2.6%
5
 
2.2%
5
 
2.2%
Other values (75) 133
57.3%
ASCII
ValueCountFrequency (%)
( 21
42.0%
) 21
42.0%
6
 
12.0%
2 1
 
2.0%
1 1
 
2.0%
Distinct37
Distinct (%)94.9%
Missing1
Missing (%)2.5%
Memory size452.0 B
2024-03-23T05:19:54.586468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length27
Mean length22.641026
Min length19

Characters and Unicode

Total characters883
Distinct characters63
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)89.7%

Sample

1st row경상북도 구미시 장천면 장군로 216-64
2nd row경상북도 구미시 장천면 장천상림3길 89
3rd row경상북도 구미시 산동읍 동백로 341-6
4th row경상북도 구미시 산동읍 동백로 341
5th row경상북도 구미시 장천면 장천상림3길 83-1
ValueCountFrequency (%)
경상북도 38
19.3%
구미시 38
19.3%
장천면 24
 
12.2%
산동읍 6
 
3.0%
장천상림3길 5
 
2.5%
장군로 5
 
2.5%
학신로 4
 
2.0%
하장2길 4
 
2.0%
공단동 3
 
1.5%
동백로 3
 
1.5%
Other values (60) 67
34.0%
2024-03-23T05:19:55.734525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
158
17.9%
43
 
4.9%
43
 
4.9%
41
 
4.6%
40
 
4.5%
39
 
4.4%
39
 
4.4%
38
 
4.3%
38
 
4.3%
1 30
 
3.4%
Other values (53) 374
42.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 529
59.9%
Decimal Number 160
 
18.1%
Space Separator 158
 
17.9%
Dash Punctuation 25
 
2.8%
Close Punctuation 4
 
0.5%
Open Punctuation 4
 
0.5%
Other Punctuation 2
 
0.2%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
43
 
8.1%
43
 
8.1%
41
 
7.8%
40
 
7.6%
39
 
7.4%
39
 
7.4%
38
 
7.2%
38
 
7.2%
29
 
5.5%
26
 
4.9%
Other values (37) 153
28.9%
Decimal Number
ValueCountFrequency (%)
1 30
18.8%
2 23
14.4%
3 21
13.1%
7 20
12.5%
4 15
9.4%
8 13
8.1%
9 12
 
7.5%
6 11
 
6.9%
5 10
 
6.2%
0 5
 
3.1%
Space Separator
ValueCountFrequency (%)
158
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 529
59.9%
Common 353
40.0%
Latin 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
43
 
8.1%
43
 
8.1%
41
 
7.8%
40
 
7.6%
39
 
7.4%
39
 
7.4%
38
 
7.2%
38
 
7.2%
29
 
5.5%
26
 
4.9%
Other values (37) 153
28.9%
Common
ValueCountFrequency (%)
158
44.8%
1 30
 
8.5%
- 25
 
7.1%
2 23
 
6.5%
3 21
 
5.9%
7 20
 
5.7%
4 15
 
4.2%
8 13
 
3.7%
9 12
 
3.4%
6 11
 
3.1%
Other values (5) 25
 
7.1%
Latin
ValueCountFrequency (%)
A 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 529
59.9%
ASCII 354
40.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
158
44.6%
1 30
 
8.5%
- 25
 
7.1%
2 23
 
6.5%
3 21
 
5.9%
7 20
 
5.6%
4 15
 
4.2%
8 13
 
3.7%
9 12
 
3.4%
6 11
 
3.1%
Other values (6) 26
 
7.3%
Hangul
ValueCountFrequency (%)
43
 
8.1%
43
 
8.1%
41
 
7.8%
40
 
7.6%
39
 
7.4%
39
 
7.4%
38
 
7.2%
38
 
7.2%
29
 
5.5%
26
 
4.9%
Other values (37) 153
28.9%
Distinct34
Distinct (%)94.4%
Missing4
Missing (%)10.0%
Memory size452.0 B
2024-03-23T05:19:56.389847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length24
Mean length21.694444
Min length17

Characters and Unicode

Total characters781
Distinct characters56
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)88.9%

Sample

1st row경상북도 구미시 장천면 여남리 982
2nd row경상북도 구미시 장천면 상림리 8
3rd row경상북도 구미시 산동읍 백현리 1114
4th row경상북도 구미시 산동읍 백현리 1114-5
5th row경상북도 구미시 장천면 상림리 9
ValueCountFrequency (%)
경상북도 35
19.7%
구미시 35
19.7%
장천면 22
12.4%
신장리 6
 
3.4%
산동읍 6
 
3.4%
하장리 5
 
2.8%
여남리 5
 
2.8%
상림리 4
 
2.2%
적림리 3
 
1.7%
백현리 3
 
1.7%
Other values (50) 54
30.3%
2024-03-23T05:19:57.586766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
178
22.8%
40
 
5.1%
38
 
4.9%
37
 
4.7%
36
 
4.6%
36
 
4.6%
35
 
4.5%
35
 
4.5%
35
 
4.5%
31
 
4.0%
Other values (46) 280
35.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 462
59.2%
Space Separator 178
 
22.8%
Decimal Number 122
 
15.6%
Dash Punctuation 19
 
2.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
40
 
8.7%
38
 
8.2%
37
 
8.0%
36
 
7.8%
36
 
7.8%
35
 
7.6%
35
 
7.6%
35
 
7.6%
31
 
6.7%
23
 
5.0%
Other values (34) 116
25.1%
Decimal Number
ValueCountFrequency (%)
1 27
22.1%
5 16
13.1%
8 13
10.7%
3 12
9.8%
6 12
9.8%
4 11
9.0%
7 9
 
7.4%
2 9
 
7.4%
9 7
 
5.7%
0 6
 
4.9%
Space Separator
ValueCountFrequency (%)
178
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 462
59.2%
Common 319
40.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
40
 
8.7%
38
 
8.2%
37
 
8.0%
36
 
7.8%
36
 
7.8%
35
 
7.6%
35
 
7.6%
35
 
7.6%
31
 
6.7%
23
 
5.0%
Other values (34) 116
25.1%
Common
ValueCountFrequency (%)
178
55.8%
1 27
 
8.5%
- 19
 
6.0%
5 16
 
5.0%
8 13
 
4.1%
3 12
 
3.8%
6 12
 
3.8%
4 11
 
3.4%
7 9
 
2.8%
2 9
 
2.8%
Other values (2) 13
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 462
59.2%
ASCII 319
40.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
178
55.8%
1 27
 
8.5%
- 19
 
6.0%
5 16
 
5.0%
8 13
 
4.1%
3 12
 
3.8%
6 12
 
3.8%
4 11
 
3.4%
7 9
 
2.8%
2 9
 
2.8%
Other values (2) 13
 
4.1%
Hangul
ValueCountFrequency (%)
40
 
8.7%
38
 
8.2%
37
 
8.0%
36
 
7.8%
36
 
7.8%
35
 
7.6%
35
 
7.6%
35
 
7.6%
31
 
6.7%
23
 
5.0%
Other values (34) 116
25.1%

사업장전화번호
Text

MISSING 

Distinct23
Distinct (%)92.0%
Missing15
Missing (%)37.5%
Memory size452.0 B
2024-03-23T05:19:58.135638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters300
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)84.0%

Sample

1st row054-472-3355
2nd row054-472-3355
3rd row054-462-2211
4th row054-472-7712
5th row054-473-9512
ValueCountFrequency (%)
054-472-3355 2
 
8.0%
054-472-6145 2
 
8.0%
054-471-7718 1
 
4.0%
054-482-7380 1
 
4.0%
054-473-9490 1
 
4.0%
054-482-9240 1
 
4.0%
054-482-8505 1
 
4.0%
054-473-2872 1
 
4.0%
054-475-2267 1
 
4.0%
054-475-3634 1
 
4.0%
Other values (13) 13
52.0%
2024-03-23T05:19:59.517868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 57
19.0%
- 50
16.7%
5 40
13.3%
0 32
10.7%
7 24
8.0%
2 24
8.0%
3 24
8.0%
1 18
 
6.0%
6 14
 
4.7%
8 9
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 250
83.3%
Dash Punctuation 50
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 57
22.8%
5 40
16.0%
0 32
12.8%
7 24
9.6%
2 24
9.6%
3 24
9.6%
1 18
 
7.2%
6 14
 
5.6%
8 9
 
3.6%
9 8
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 50
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 300
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 57
19.0%
- 50
16.7%
5 40
13.3%
0 32
10.7%
7 24
8.0%
2 24
8.0%
3 24
8.0%
1 18
 
6.0%
6 14
 
4.7%
8 9
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 300
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 57
19.0%
- 50
16.7%
5 40
13.3%
0 32
10.7%
7 24
8.0%
2 24
8.0%
3 24
8.0%
1 18
 
6.0%
6 14
 
4.7%
8 9
 
3.0%

Interactions

2024-03-23T05:19:47.913226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T05:19:59.954234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업장명칭사업장주소(도로명)사업장주소(지번)사업장전화번호
연번1.0001.0000.7710.7800.966
사업장명칭1.0001.0001.0001.0001.000
사업장주소(도로명)0.7711.0001.0000.9990.976
사업장주소(지번)0.7801.0000.9991.0000.974
사업장전화번호0.9661.0000.9760.9741.000

Missing values

2024-03-23T05:19:48.473370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T05:19:49.087566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-23T05:19:49.615224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번업종사업장명칭사업장주소(도로명)사업장주소(지번)사업장전화번호
01종합재활용업(주)기신산업구미공장경상북도 구미시 장천면 장군로 216-64경상북도 구미시 장천면 여남리 982<NA>
12종합재활용업(주)대성산업개발경상북도 구미시 장천면 장천상림3길 89경상북도 구미시 장천면 상림리 8<NA>
23종합재활용업(주)대성산업개발1공장경상북도 구미시 산동읍 동백로 341-6경상북도 구미시 산동읍 백현리 1114054-472-3355
34종합재활용업(주)대성산업개발2공장경상북도 구미시 산동읍 동백로 341경상북도 구미시 산동읍 백현리 1114-5054-472-3355
45종합재활용업(주)대현이엔지경상북도 구미시 장천면 장천상림3길 83-1경상북도 구미시 장천면 상림리 9054-462-2211
56종합재활용업(주)두다모경상북도 구미시 장천면 하장2길 172-7경상북도 구미시 장천면 하장리 288054-472-7712
67종합재활용업(주)보성산업환경경상북도 구미시 장천면 신장1길 75, 화인테크공장경상북도 구미시 장천면 신장리 400 화인테크공장<NA>
78종합재활용업(주)부원에코베라경상북도 구미시 장천면 하장2길 274-25경상북도 구미시 장천면 하장리 23-3054-473-9512
89종합재활용업(주)성림경상북도 구미시 산호대로 104-104 (공단동)경상북도 구미시 공단동 297-15054-463-2183
910종합재활용업(주)우성알씨경상북도 구미시 선산읍 북산3길 48경상북도 구미시 선산읍 북산리 648-1054-451-1133
연번업종사업장명칭사업장주소(도로명)사업장주소(지번)사업장전화번호
3031종합재활용업주식회사 신원환경대구광역시 북구 동북로 117, 1301-A호 (산격동)대구광역시 북구 산격동 505-7<NA>
3132종합재활용업주식회사 호성크린텍경상북도 구미시 장천면 하장3길 113-15경상북도 구미시 장천면 하장리 754-5<NA>
3233종합재활용업주식회사 화원경상북도 구미시 산동읍 성림길 47경상북도 구미시 산동읍 적림리 480-6<NA>
3334종합재활용업주은산업경상북도 구미시 장천면 장천상림3길 77-3<NA><NA>
3435종합재활용업준엔프라경상북도 구미시 장천면 장군로 216-62경상북도 구미시 장천면 여남리 881-1054-482-8505
3536종합재활용업지산수지경상북도 구미시 고아읍 파산2길 57<NA>054-482-9240
3637종합재활용업진한 리싸이클링경상북도 구미시 산동읍 동백로 349경상북도 구미시 산동읍 백현리 1115<NA>
3738종합재활용업태경플라스틱경상북도 구미시 장천면 장군로 216-60경상북도 구미시 장천면 여남리 981 외 1필지<NA>
3839종합재활용업평화케미칼경상북도 구미시 장천면 장천상림3길 59경상북도 구미시 장천면 상림리 15054-473-9490
3940종합재활용업효성수지경상북도 구미시 장천면 학신로 278-8<NA>054-471-9118