Overview

Dataset statistics

Number of variables3
Number of observations69
Missing cells30
Missing cells (%)14.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory25.9 B

Variable types

Text3

Dataset

Description경상남도 양산시 관내 청소업체 현황에 대한 데이터로 업체명, 영업소 주소(도로명), 전화번호 항목을 제공합니다.
Author경상남도 양산시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15103405

Alerts

전화번호 has 30 (43.5%) missing valuesMissing
업체명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 23:22:31.627743
Analysis finished2023-12-10 23:22:32.042789
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업체명
Text

UNIQUE 

Distinct69
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size684.0 B
2023-12-11T08:22:32.241946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length10
Mean length6.9275362
Min length3

Characters and Unicode

Total characters478
Distinct characters134
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique69 ?
Unique (%)100.0%

Sample

1st row(주)그린에코산업
2nd row(주)금강산업개발
3rd row(주)대궁
4th row(주)대아
5th row(주)동성
ValueCountFrequency (%)
주식회사 10
 
12.0%
한스클린영남 1
 
1.2%
정인티비엠 1
 
1.2%
양산지사 1
 
1.2%
인덱스코리아 1
 
1.2%
위드 1
 
1.2%
시우 1
 
1.2%
웅상청소박사 1
 
1.2%
강한 1
 
1.2%
좋은환경제로스 1
 
1.2%
Other values (64) 64
77.1%
2023-12-11T08:22:32.655573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38
 
7.9%
( 28
 
5.9%
) 28
 
5.9%
15
 
3.1%
14
 
2.9%
13
 
2.7%
12
 
2.5%
11
 
2.3%
11
 
2.3%
10
 
2.1%
Other values (124) 298
62.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 393
82.2%
Open Punctuation 28
 
5.9%
Close Punctuation 28
 
5.9%
Space Separator 14
 
2.9%
Lowercase Letter 9
 
1.9%
Uppercase Letter 3
 
0.6%
Decimal Number 2
 
0.4%
Dash Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
38
 
9.7%
15
 
3.8%
13
 
3.3%
12
 
3.1%
11
 
2.8%
11
 
2.8%
10
 
2.5%
9
 
2.3%
8
 
2.0%
8
 
2.0%
Other values (107) 258
65.6%
Lowercase Letter
ValueCountFrequency (%)
r 2
22.2%
o 1
11.1%
e 1
11.1%
z 1
11.1%
s 1
11.1%
u 1
11.1%
i 1
11.1%
v 1
11.1%
Uppercase Letter
ValueCountFrequency (%)
M 1
33.3%
B 1
33.3%
K 1
33.3%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
1 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 28
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28
100.0%
Space Separator
ValueCountFrequency (%)
14
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 393
82.2%
Common 73
 
15.3%
Latin 12
 
2.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
38
 
9.7%
15
 
3.8%
13
 
3.3%
12
 
3.1%
11
 
2.8%
11
 
2.8%
10
 
2.5%
9
 
2.3%
8
 
2.0%
8
 
2.0%
Other values (107) 258
65.6%
Latin
ValueCountFrequency (%)
r 2
16.7%
M 1
8.3%
B 1
8.3%
K 1
8.3%
o 1
8.3%
e 1
8.3%
z 1
8.3%
s 1
8.3%
u 1
8.3%
i 1
8.3%
Common
ValueCountFrequency (%)
( 28
38.4%
) 28
38.4%
14
19.2%
2 1
 
1.4%
1 1
 
1.4%
- 1
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 393
82.2%
ASCII 85
 
17.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
38
 
9.7%
15
 
3.8%
13
 
3.3%
12
 
3.1%
11
 
2.8%
11
 
2.8%
10
 
2.5%
9
 
2.3%
8
 
2.0%
8
 
2.0%
Other values (107) 258
65.6%
ASCII
ValueCountFrequency (%)
( 28
32.9%
) 28
32.9%
14
16.5%
r 2
 
2.4%
2 1
 
1.2%
1 1
 
1.2%
M 1
 
1.2%
B 1
 
1.2%
K 1
 
1.2%
- 1
 
1.2%
Other values (7) 7
 
8.2%
Distinct67
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size684.0 B
2023-12-11T08:22:32.974878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length31
Mean length23.333333
Min length11

Characters and Unicode

Total characters1610
Distinct characters135
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique66 ?
Unique (%)95.7%

Sample

1st row 양산시 북안남3길 10, 3층
2nd row 양산시 상북면 구소석2길 15, 1층
3rd row 양산시 물금읍 오봉로 169, 양산범어대동타운 상가동 4층 404호
4th row 양산시 동면 석금산로 10, 1동 1층 103호
5th row 양산시 물금읍 화산길 26-1, 2층
ValueCountFrequency (%)
양산시 69
 
18.9%
1층 19
 
5.2%
2층 18
 
4.9%
물금읍 15
 
4.1%
동면 14
 
3.8%
3층 7
 
1.9%
일부 6
 
1.6%
상북면 6
 
1.6%
4층 4
 
1.1%
202호 4
 
1.1%
Other values (155) 204
55.7%
2023-12-11T08:22:33.403178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
366
22.7%
1 84
 
5.2%
79
 
4.9%
77
 
4.8%
73
 
4.5%
2 71
 
4.4%
, 66
 
4.1%
51
 
3.2%
0 41
 
2.5%
41
 
2.5%
Other values (125) 661
41.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 801
49.8%
Space Separator 366
22.7%
Decimal Number 354
22.0%
Other Punctuation 66
 
4.1%
Dash Punctuation 15
 
0.9%
Close Punctuation 3
 
0.2%
Open Punctuation 3
 
0.2%
Uppercase Letter 1
 
0.1%
Lowercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
79
 
9.9%
77
 
9.6%
73
 
9.1%
51
 
6.4%
41
 
5.1%
40
 
5.0%
31
 
3.9%
29
 
3.6%
28
 
3.5%
21
 
2.6%
Other values (108) 331
41.3%
Decimal Number
ValueCountFrequency (%)
1 84
23.7%
2 71
20.1%
0 41
11.6%
3 39
11.0%
4 34
9.6%
5 25
 
7.1%
6 20
 
5.6%
9 19
 
5.4%
7 13
 
3.7%
8 8
 
2.3%
Space Separator
ValueCountFrequency (%)
366
100.0%
Other Punctuation
ValueCountFrequency (%)
, 66
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 807
50.1%
Hangul 801
49.8%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
79
 
9.9%
77
 
9.6%
73
 
9.1%
51
 
6.4%
41
 
5.1%
40
 
5.0%
31
 
3.9%
29
 
3.6%
28
 
3.5%
21
 
2.6%
Other values (108) 331
41.3%
Common
ValueCountFrequency (%)
366
45.4%
1 84
 
10.4%
2 71
 
8.8%
, 66
 
8.2%
0 41
 
5.1%
3 39
 
4.8%
4 34
 
4.2%
5 25
 
3.1%
6 20
 
2.5%
9 19
 
2.4%
Other values (5) 42
 
5.2%
Latin
ValueCountFrequency (%)
A 1
50.0%
e 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 809
50.2%
Hangul 801
49.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
366
45.2%
1 84
 
10.4%
2 71
 
8.8%
, 66
 
8.2%
0 41
 
5.1%
3 39
 
4.8%
4 34
 
4.2%
5 25
 
3.1%
6 20
 
2.5%
9 19
 
2.3%
Other values (7) 44
 
5.4%
Hangul
ValueCountFrequency (%)
79
 
9.9%
77
 
9.6%
73
 
9.1%
51
 
6.4%
41
 
5.1%
40
 
5.0%
31
 
3.9%
29
 
3.6%
28
 
3.5%
21
 
2.6%
Other values (108) 331
41.3%

전화번호
Text

MISSING 

Distinct37
Distinct (%)94.9%
Missing30
Missing (%)43.5%
Memory size684.0 B
2023-12-11T08:22:33.635289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.948718
Min length9

Characters and Unicode

Total characters466
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)92.3%

Sample

1st row055-374-7771
2nd row055-384-8822
3rd row055-364-7093
4th row055-381-2240
5th row055-781-1014
ValueCountFrequency (%)
055-375-1966 3
 
7.7%
0507-1306-5834 1
 
2.6%
055-367-3340 1
 
2.6%
1661-6264 1
 
2.6%
055-367-9212 1
 
2.6%
055-385-0500 1
 
2.6%
1588-5031 1
 
2.6%
055-366-6253 1
 
2.6%
055-387-4007 1
 
2.6%
055-382-5792 1
 
2.6%
Other values (27) 27
69.2%
2023-12-11T08:22:33.980201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 93
20.0%
- 76
16.3%
0 63
13.5%
3 55
11.8%
6 34
 
7.3%
8 33
 
7.1%
1 30
 
6.4%
7 26
 
5.6%
4 25
 
5.4%
2 18
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 390
83.7%
Dash Punctuation 76
 
16.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 93
23.8%
0 63
16.2%
3 55
14.1%
6 34
 
8.7%
8 33
 
8.5%
1 30
 
7.7%
7 26
 
6.7%
4 25
 
6.4%
2 18
 
4.6%
9 13
 
3.3%
Dash Punctuation
ValueCountFrequency (%)
- 76
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 466
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 93
20.0%
- 76
16.3%
0 63
13.5%
3 55
11.8%
6 34
 
7.3%
8 33
 
7.1%
1 30
 
6.4%
7 26
 
5.6%
4 25
 
5.4%
2 18
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 466
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 93
20.0%
- 76
16.3%
0 63
13.5%
3 55
11.8%
6 34
 
7.3%
8 33
 
7.1%
1 30
 
6.4%
7 26
 
5.6%
4 25
 
5.4%
2 18
 
3.9%

Correlations

2023-12-11T08:22:34.075129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체명영업소 주소(도로명)전화번호
업체명1.0001.0001.000
영업소 주소(도로명)1.0001.0001.000
전화번호1.0001.0001.000

Missing values

2023-12-11T08:22:31.906208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:22:32.005025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업체명영업소 주소(도로명)전화번호
0(주)그린에코산업양산시 북안남3길 10, 3층<NA>
1(주)금강산업개발양산시 상북면 구소석2길 15, 1층055-374-7771
2(주)대궁양산시 물금읍 오봉로 169, 양산범어대동타운 상가동 4층 404호<NA>
3(주)대아양산시 동면 석금산로 10, 1동 1층 103호055-384-8822
4(주)동성양산시 물금읍 화산길 26-1, 2층055-364-7093
5(주)바른종합관리양산시 물금읍 백호로 76, 인정에코타운 4층 405호055-381-2240
6(주)바지런종합환경양산시 동면 석산6길 6-19, 103호055-781-1014
7(주)배성양산시 상북면 충렬로 955, 1층<NA>
8(주)삼우양산시 옥곡4길 16, (주)경남종합조경 2층 202호<NA>
9(주)서정이엔씨양산시 물금읍 신주로 35-1, 양산2차e편한세상아파트 상가동 103호<NA>
업체명영업소 주소(도로명)전화번호
59케이비엠(KBM)양산시 중앙로 232, 북부시장 A동 51호055-388-6093
60크린21양산시 동면 계석3길 29, 대정그린파크 상가동055-383-9248
61클린산업양산시 남부6길 6, 2층055-383-4954
62클린엔케어양산시 물금읍 목화로 21, 1층 일부055-388-4952
63태명공조양산시 동면 금오4길 59-15, 101호<NA>
64태창그린양산시 신기6길 8, 상신기마을회관 2층<NA>
65하이종합환경양산시 두전길 40, 삼성파크빌아파트 6동(상가) 2층 203호055-362-0780
66한가람양산시 양주1길 11, 재능교육 5층 508호<NA>
67한국선물넷(주)양산시 하북면 월평로 5, 202호055-364-4008
68한국종합관리양산시 북안남5길 30, 세종빌딩 411호055-388-0492