Overview

Dataset statistics

Number of variables5
Number of observations26
Missing cells26
Missing cells (%)20.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory46.1 B

Variable types

Text4
Unsupported1

Dataset

Description울산광역시의 분뇨를 처리하는 수집운반업체 정보(업체명, 대표자, 수집업체, 전화번호, 비고 등) 를 제공하고 있습니다.
URLhttps://www.data.go.kr/data/15020210/fileData.do

Alerts

비고 has 26 (100.0%) missing valuesMissing
대표자 has unique valuesUnique
전화번호 has unique valuesUnique
비고 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 12:14:34.946843
Analysis finished2023-12-12 12:14:35.357696
Duration0.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct25
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-12T21:14:35.489307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length4
Mean length4.2307692
Min length4

Characters and Unicode

Total characters110
Distinct characters43
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)92.3%

Sample

1st row청록위생
2nd row중구환경
3rd row서진환경
4th row이수환경
5th row울산환경
ValueCountFrequency (%)
울산환경 2
 
7.7%
청록위생 1
 
3.8%
무룡산업 1
 
3.8%
한일위생 1
 
3.8%
길평물류 1
 
3.8%
울주위생사 1
 
3.8%
해정그린 1
 
3.8%
동해개발 1
 
3.8%
무룡위생 1
 
3.8%
동국정화 1
 
3.8%
Other values (15) 15
57.7%
2023-12-12T21:14:35.816445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
 
8.2%
9
 
8.2%
8
 
7.3%
8
 
7.3%
6
 
5.5%
6
 
5.5%
5
 
4.5%
5
 
4.5%
4
 
3.6%
3
 
2.7%
Other values (33) 47
42.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 109
99.1%
Other Symbol 1
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
8.3%
9
 
8.3%
8
 
7.3%
8
 
7.3%
6
 
5.5%
6
 
5.5%
5
 
4.6%
5
 
4.6%
4
 
3.7%
3
 
2.8%
Other values (32) 46
42.2%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 110
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
8.2%
9
 
8.2%
8
 
7.3%
8
 
7.3%
6
 
5.5%
6
 
5.5%
5
 
4.5%
5
 
4.5%
4
 
3.6%
3
 
2.7%
Other values (33) 47
42.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 109
99.1%
None 1
 
0.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9
 
8.3%
9
 
8.3%
8
 
7.3%
8
 
7.3%
6
 
5.5%
6
 
5.5%
5
 
4.6%
5
 
4.6%
4
 
3.7%
3
 
2.8%
Other values (32) 46
42.2%
None
ValueCountFrequency (%)
1
100.0%

대표자
Text

UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-12T21:14:36.037352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9615385
Min length2

Characters and Unicode

Total characters77
Distinct characters46
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)100.0%

Sample

1st row정효애
2nd row이명래
3rd row정영옥
4th row이승범
5th row이윤정
ValueCountFrequency (%)
정효애 1
 
3.8%
이명래 1
 
3.8%
오일희 1
 
3.8%
지대율 1
 
3.8%
노영수 1
 
3.8%
최종훈 1
 
3.8%
노천순 1
 
3.8%
백승호 1
 
3.8%
김익락 1
 
3.8%
박근 1
 
3.8%
Other values (16) 16
61.5%
2023-12-12T21:14:36.423146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6
 
7.8%
5
 
6.5%
4
 
5.2%
3
 
3.9%
3
 
3.9%
3
 
3.9%
3
 
3.9%
2
 
2.6%
2
 
2.6%
2
 
2.6%
Other values (36) 44
57.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 77
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
 
7.8%
5
 
6.5%
4
 
5.2%
3
 
3.9%
3
 
3.9%
3
 
3.9%
3
 
3.9%
2
 
2.6%
2
 
2.6%
2
 
2.6%
Other values (36) 44
57.1%

Most occurring scripts

ValueCountFrequency (%)
Hangul 77
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
 
7.8%
5
 
6.5%
4
 
5.2%
3
 
3.9%
3
 
3.9%
3
 
3.9%
3
 
3.9%
2
 
2.6%
2
 
2.6%
2
 
2.6%
Other values (36) 44
57.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 77
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6
 
7.8%
5
 
6.5%
4
 
5.2%
3
 
3.9%
3
 
3.9%
3
 
3.9%
3
 
3.9%
2
 
2.6%
2
 
2.6%
2
 
2.6%
Other values (36) 44
57.1%
Distinct20
Distinct (%)76.9%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-12T21:14:36.717256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length35
Mean length23.269231
Min length13

Characters and Unicode

Total characters605
Distinct characters82
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)69.2%

Sample

1st row울산 중구 서원11길 85 (복산동)
2nd row울산 중구 서원11길 85 (복산동)
3rd row울산 중구 서원11길 85 (복산동)
4th row울산 중구 서원11길 85 (복산동)
5th row울산 중구 서원11길 85 (복산동)
ValueCountFrequency (%)
울산 26
 
18.6%
남구 12
 
8.6%
서원11길 5
 
3.6%
85 5
 
3.6%
복산동 5
 
3.6%
북구 5
 
3.6%
중구 5
 
3.6%
울주군 4
 
2.9%
무룡1로 3
 
2.1%
110 3
 
2.1%
Other values (58) 67
47.9%
2023-12-12T21:14:37.151904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
114
18.8%
1 48
 
7.9%
37
 
6.1%
30
 
5.0%
27
 
4.5%
22
 
3.6%
( 20
 
3.3%
0 20
 
3.3%
) 20
 
3.3%
17
 
2.8%
Other values (72) 250
41.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 304
50.2%
Decimal Number 130
21.5%
Space Separator 114
 
18.8%
Open Punctuation 20
 
3.3%
Close Punctuation 20
 
3.3%
Other Punctuation 12
 
2.0%
Dash Punctuation 5
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
37
 
12.2%
30
 
9.9%
27
 
8.9%
22
 
7.2%
17
 
5.6%
14
 
4.6%
13
 
4.3%
10
 
3.3%
6
 
2.0%
6
 
2.0%
Other values (57) 122
40.1%
Decimal Number
ValueCountFrequency (%)
1 48
36.9%
0 20
15.4%
5 13
 
10.0%
2 13
 
10.0%
3 9
 
6.9%
8 7
 
5.4%
6 7
 
5.4%
4 6
 
4.6%
7 4
 
3.1%
9 3
 
2.3%
Space Separator
ValueCountFrequency (%)
114
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Other Punctuation
ValueCountFrequency (%)
, 12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 304
50.2%
Common 301
49.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
37
 
12.2%
30
 
9.9%
27
 
8.9%
22
 
7.2%
17
 
5.6%
14
 
4.6%
13
 
4.3%
10
 
3.3%
6
 
2.0%
6
 
2.0%
Other values (57) 122
40.1%
Common
ValueCountFrequency (%)
114
37.9%
1 48
15.9%
( 20
 
6.6%
0 20
 
6.6%
) 20
 
6.6%
5 13
 
4.3%
2 13
 
4.3%
, 12
 
4.0%
3 9
 
3.0%
8 7
 
2.3%
Other values (5) 25
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 304
50.2%
ASCII 301
49.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
114
37.9%
1 48
15.9%
( 20
 
6.6%
0 20
 
6.6%
) 20
 
6.6%
5 13
 
4.3%
2 13
 
4.3%
, 12
 
4.0%
3 9
 
3.0%
8 7
 
2.3%
Other values (5) 25
 
8.3%
Hangul
ValueCountFrequency (%)
37
 
12.2%
30
 
9.9%
27
 
8.9%
22
 
7.2%
17
 
5.6%
14
 
4.6%
13
 
4.3%
10
 
3.3%
6
 
2.0%
6
 
2.0%
Other values (57) 122
40.1%

전화번호
Text

UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-12T21:14:37.413328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters312
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)100.0%

Sample

1st row052-277-1119
2nd row052-281-0408
3rd row052-296-9629
4th row052-294-2626
5th row052-224-5555
ValueCountFrequency (%)
052-277-1119 1
 
3.8%
052-281-0408 1
 
3.8%
052-264-3448 1
 
3.8%
052-239-6967 1
 
3.8%
052-269-0808 1
 
3.8%
052-288-2707 1
 
3.8%
052-289-2777 1
 
3.8%
052-289-8585 1
 
3.8%
052-251-3900 1
 
3.8%
052-234-7200 1
 
3.8%
Other values (16) 16
61.5%
2023-12-12T21:14:37.816591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 69
22.1%
- 52
16.7%
0 44
14.1%
5 41
13.1%
8 24
 
7.7%
7 19
 
6.1%
9 15
 
4.8%
6 15
 
4.8%
1 12
 
3.8%
4 11
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 260
83.3%
Dash Punctuation 52
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 69
26.5%
0 44
16.9%
5 41
15.8%
8 24
 
9.2%
7 19
 
7.3%
9 15
 
5.8%
6 15
 
5.8%
1 12
 
4.6%
4 11
 
4.2%
3 10
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 312
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 69
22.1%
- 52
16.7%
0 44
14.1%
5 41
13.1%
8 24
 
7.7%
7 19
 
6.1%
9 15
 
4.8%
6 15
 
4.8%
1 12
 
3.8%
4 11
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 312
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 69
22.1%
- 52
16.7%
0 44
14.1%
5 41
13.1%
8 24
 
7.7%
7 19
 
6.1%
9 15
 
4.8%
6 15
 
4.8%
1 12
 
3.8%
4 11
 
3.5%

비고
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing26
Missing (%)100.0%
Memory size366.0 B

Correlations

2023-12-12T21:14:37.917690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체명대표자소재지전화번호
업체명1.0001.0000.9771.000
대표자1.0001.0001.0001.000
소재지0.9771.0001.0001.000
전화번호1.0001.0001.0001.000

Missing values

2023-12-12T21:14:35.209013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:14:35.313053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업체명대표자소재지전화번호비고
0청록위생정효애울산 중구 서원11길 85 (복산동)052-277-1119<NA>
1중구환경이명래울산 중구 서원11길 85 (복산동)052-281-0408<NA>
2서진환경정영옥울산 중구 서원11길 85 (복산동)052-296-9629<NA>
3이수환경이승범울산 중구 서원11길 85 (복산동)052-294-2626<NA>
4울산환경이윤정울산 중구 서원11길 85 (복산동)052-224-5555<NA>
5달동위생김승철울산 남구 신복로35번길 6, 101동 1201호(쌍용스윗닷홈)052-282-8989<NA>
6청록환경서경자울산 남구 남산로 102(무거동)052-223-3888<NA>
7남울산위생오정연울산 남구 삼산로9번길 6-3, 403호 (신정동)052-268-2747<NA>
8삼산위생박두근울산 남구 월평로129번길 26-2, 101호 (신정동)052-266-7822<NA>
9울산환경박순병울산 남구 월평로 253, 107동 1001호 (삼산동, 삼산현대아파트)052-277-0033<NA>
업체명대표자소재지전화번호비고
16북일산업김종원울산 남구 꽃대나리로 15, 1층 2호 (달동)052-261-7157<NA>
17동해정화박근울산 북구 호계동 350052-234-7200<NA>
18동국정화김익락울산 북구 호계동 701-1052-251-3900<NA>
19무룡위생백승호울산 북구 무룡1로 110 (연암동)052-289-8585<NA>
20동해개발노천순울산 북구 무룡1로 110 (연암동)052-289-2777<NA>
21해정그린최종훈울산 북구 무룡1로 110 (연암동)052-288-2707<NA>
22울주위생사노영수울산 울주군 청량읍 온산로 606052-269-0808<NA>
23길평물류지대율울산 울주군 온양읍 남창로 439052-239-6967<NA>
24한일위생오일희울산 울주군 언양읍 어음길 13052-264-3448<NA>
25서울주정화㈜김태혁울산 울주군 언양읍 동부3길 17-6052-254-2002<NA>