Overview

Dataset statistics

Number of variables4
Number of observations62
Missing cells27
Missing cells (%)10.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 KiB
Average record size in memory34.1 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시_연제구_건물위생관리업현황_20190613
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3082203

Alerts

업종명 has constant value ""Constant
소재지전화 has 27 (43.5%) missing valuesMissing
업소명 has unique valuesUnique

Reproduction

Analysis started2024-04-21 07:13:14.007442
Analysis finished2024-04-21 07:13:14.850325
Duration0.84 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size624.0 B
건물위생관리업
62 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건물위생관리업
2nd row건물위생관리업
3rd row건물위생관리업
4th row건물위생관리업
5th row건물위생관리업

Common Values

ValueCountFrequency (%)
건물위생관리업 62
100.0%

Length

2024-04-21T16:13:15.062134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T16:13:15.368332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건물위생관리업 62
100.0%

업소명
Text

UNIQUE 

Distinct62
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size624.0 B
2024-04-21T16:13:16.194815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length7.1774194
Min length2

Characters and Unicode

Total characters445
Distinct characters136
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique62 ?
Unique (%)100.0%

Sample

1st row(주)명건
2nd row부일정보링크(주)
3rd row(주)목평인력개발
4th row(주)항도시스템
5th row바지런토탈크리닝시스템
ValueCountFrequency (%)
주식회사 10
 
13.3%
주)명건 1
 
1.3%
주)로텍스부산 1
 
1.3%
주)영린텍 1
 
1.3%
주)포시즌에스앤씨건설 1
 
1.3%
주)두리컴 1
 
1.3%
주)노은 1
 
1.3%
신삼성 1
 
1.3%
대명종합관리시스템 1
 
1.3%
모든청소 1
 
1.3%
Other values (56) 56
74.7%
2024-04-21T16:13:17.531800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
40
 
9.0%
( 28
 
6.3%
) 28
 
6.3%
17
 
3.8%
16
 
3.6%
14
 
3.1%
13
 
2.9%
13
 
2.9%
9
 
2.0%
8
 
1.8%
Other values (126) 259
58.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 373
83.8%
Open Punctuation 28
 
6.3%
Close Punctuation 28
 
6.3%
Space Separator 14
 
3.1%
Other Punctuation 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
40
 
10.7%
17
 
4.6%
16
 
4.3%
13
 
3.5%
13
 
3.5%
9
 
2.4%
8
 
2.1%
8
 
2.1%
7
 
1.9%
6
 
1.6%
Other values (121) 236
63.3%
Other Punctuation
ValueCountFrequency (%)
. 1
50.0%
, 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 28
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28
100.0%
Space Separator
ValueCountFrequency (%)
14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 373
83.8%
Common 72
 
16.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
40
 
10.7%
17
 
4.6%
16
 
4.3%
13
 
3.5%
13
 
3.5%
9
 
2.4%
8
 
2.1%
8
 
2.1%
7
 
1.9%
6
 
1.6%
Other values (121) 236
63.3%
Common
ValueCountFrequency (%)
( 28
38.9%
) 28
38.9%
14
19.4%
. 1
 
1.4%
, 1
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 373
83.8%
ASCII 72
 
16.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
40
 
10.7%
17
 
4.6%
16
 
4.3%
13
 
3.5%
13
 
3.5%
9
 
2.4%
8
 
2.1%
8
 
2.1%
7
 
1.9%
6
 
1.6%
Other values (121) 236
63.3%
ASCII
ValueCountFrequency (%)
( 28
38.9%
) 28
38.9%
14
19.4%
. 1
 
1.4%
, 1
 
1.4%
Distinct61
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size624.0 B
2024-04-21T16:13:18.520737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length38
Mean length30
Min length22

Characters and Unicode

Total characters1860
Distinct characters93
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique60 ?
Unique (%)96.8%

Sample

1st row부산광역시 연제구 과정로 84, 4층 (연산동)
2nd row부산광역시 연제구 거제대로 295 (거제동)
3rd row부산광역시 연제구 고분로32번길 43 (연산동)
4th row부산광역시 연제구 월드컵대로 20 (연산동)
5th row부산광역시 연제구 신촌로 35-6 (연산동)
ValueCountFrequency (%)
부산광역시 62
17.0%
연제구 62
17.0%
연산동 41
 
11.3%
거제동 15
 
4.1%
1층 10
 
2.7%
2층 5
 
1.4%
쌍미천로 5
 
1.4%
거제대로 4
 
1.1%
3층 4
 
1.1%
지하1층 4
 
1.1%
Other values (115) 152
41.8%
2024-04-21T16:13:19.919891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
302
 
16.2%
111
 
6.0%
108
 
5.8%
88
 
4.7%
1 69
 
3.7%
( 65
 
3.5%
) 65
 
3.5%
65
 
3.5%
64
 
3.4%
63
 
3.4%
Other values (83) 860
46.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1086
58.4%
Space Separator 302
 
16.2%
Decimal Number 284
 
15.3%
Open Punctuation 65
 
3.5%
Close Punctuation 65
 
3.5%
Other Punctuation 52
 
2.8%
Dash Punctuation 6
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
111
 
10.2%
108
 
9.9%
88
 
8.1%
65
 
6.0%
64
 
5.9%
63
 
5.8%
62
 
5.7%
62
 
5.7%
62
 
5.7%
62
 
5.7%
Other values (67) 339
31.2%
Decimal Number
ValueCountFrequency (%)
1 69
24.3%
2 50
17.6%
3 31
10.9%
5 27
 
9.5%
0 26
 
9.2%
4 25
 
8.8%
8 24
 
8.5%
7 12
 
4.2%
6 11
 
3.9%
9 9
 
3.2%
Other Punctuation
ValueCountFrequency (%)
, 51
98.1%
. 1
 
1.9%
Space Separator
ValueCountFrequency (%)
302
100.0%
Open Punctuation
ValueCountFrequency (%)
( 65
100.0%
Close Punctuation
ValueCountFrequency (%)
) 65
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1086
58.4%
Common 774
41.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
111
 
10.2%
108
 
9.9%
88
 
8.1%
65
 
6.0%
64
 
5.9%
63
 
5.8%
62
 
5.7%
62
 
5.7%
62
 
5.7%
62
 
5.7%
Other values (67) 339
31.2%
Common
ValueCountFrequency (%)
302
39.0%
1 69
 
8.9%
( 65
 
8.4%
) 65
 
8.4%
, 51
 
6.6%
2 50
 
6.5%
3 31
 
4.0%
5 27
 
3.5%
0 26
 
3.4%
4 25
 
3.2%
Other values (6) 63
 
8.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1086
58.4%
ASCII 774
41.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
302
39.0%
1 69
 
8.9%
( 65
 
8.4%
) 65
 
8.4%
, 51
 
6.6%
2 50
 
6.5%
3 31
 
4.0%
5 27
 
3.5%
0 26
 
3.4%
4 25
 
3.2%
Other values (6) 63
 
8.1%
Hangul
ValueCountFrequency (%)
111
 
10.2%
108
 
9.9%
88
 
8.1%
65
 
6.0%
64
 
5.9%
63
 
5.8%
62
 
5.7%
62
 
5.7%
62
 
5.7%
62
 
5.7%
Other values (67) 339
31.2%

소재지전화
Text

MISSING 

Distinct35
Distinct (%)100.0%
Missing27
Missing (%)43.5%
Memory size624.0 B
2024-04-21T16:13:20.736230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters420
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row051-505-2070
2nd row051-850-2022
3rd row051-852-5571
4th row051-861-1525
5th row051-867-6469
ValueCountFrequency (%)
051-505-2070 1
 
2.9%
051-853-0412 1
 
2.9%
051-558-1685 1
 
2.9%
051-809-8090 1
 
2.9%
051-528-1152 1
 
2.9%
051-868-4929 1
 
2.9%
051-929-0166 1
 
2.9%
051-507-9053 1
 
2.9%
051-728-7740 1
 
2.9%
051-746-5273 1
 
2.9%
Other values (25) 25
71.4%
2024-04-21T16:13:21.740438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 72
17.1%
- 70
16.7%
0 64
15.2%
1 60
14.3%
2 30
7.1%
8 27
 
6.4%
7 24
 
5.7%
6 24
 
5.7%
9 18
 
4.3%
3 17
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 350
83.3%
Dash Punctuation 70
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 72
20.6%
0 64
18.3%
1 60
17.1%
2 30
8.6%
8 27
 
7.7%
7 24
 
6.9%
6 24
 
6.9%
9 18
 
5.1%
3 17
 
4.9%
4 14
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 70
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 420
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 72
17.1%
- 70
16.7%
0 64
15.2%
1 60
14.3%
2 30
7.1%
8 27
 
6.4%
7 24
 
5.7%
6 24
 
5.7%
9 18
 
4.3%
3 17
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 420
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 72
17.1%
- 70
16.7%
0 64
15.2%
1 60
14.3%
2 30
7.1%
8 27
 
6.4%
7 24
 
5.7%
6 24
 
5.7%
9 18
 
4.3%
3 17
 
4.0%

Correlations

2024-04-21T16:13:21.893972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명업소소재지(도로명)소재지전화
업소명1.0001.0001.000
업소소재지(도로명)1.0001.0001.000
소재지전화1.0001.0001.000

Missing values

2024-04-21T16:13:14.446635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T16:13:14.737451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명업소소재지(도로명)소재지전화
0건물위생관리업(주)명건부산광역시 연제구 과정로 84, 4층 (연산동)051-505-2070
1건물위생관리업부일정보링크(주)부산광역시 연제구 거제대로 295 (거제동)051-850-2022
2건물위생관리업(주)목평인력개발부산광역시 연제구 고분로32번길 43 (연산동)051-852-5571
3건물위생관리업(주)항도시스템부산광역시 연제구 월드컵대로 20 (연산동)051-861-1525
4건물위생관리업바지런토탈크리닝시스템부산광역시 연제구 신촌로 35-6 (연산동)051-867-6469
5건물위생관리업(주)현대용역부산광역시 연제구 중앙천로 65 (연산동)051-851-0950
6건물위생관리업롯데환경부산광역시 연제구 해맞이로 71 (거제동)051-501-0118
7건물위생관리업(주)다명부산광역시 연제구 법원로 20 (거제동)051-948-2297
8건물위생관리업연제지역 자활센터, 청솔환경부산광역시 연제구 쌍미천로 58, 상가동 203,205,206호 (연산동, 연산훼미리타운)051-852-8219
9건물위생관리업(주)대한시스템부산광역시 연제구 신촌로 18 (연산동,(4층))051-867-6592
업종명업소명업소소재지(도로명)소재지전화
52건물위생관리업그린행복주식회사부산광역시 연제구 중앙대로1039번길 4, 3층 301호 (연산동)<NA>
53건물위생관리업베스트크린부산광역시 연제구 월드컵대로20번길 14, 201호 (연산동, 성진빌라)<NA>
54건물위생관리업크린월드부산광역시 연제구 연미로13번길 28, 1층 (연산동)<NA>
55건물위생관리업하얀나라부산광역시 연제구 과정로278번길 12, 1층 (연산동)051-900-0602
56건물위생관리업토탈그린서비스부산광역시 연제구 과정로278번길 12 (연산동)<NA>
57건물위생관리업(주)화이트네스트부산광역시 연제구 마곡천로 18, 1층 (연산동)051-728-7740
58건물위생관리업대성환경부산광역시 연제구 쌍미천로 28, 1.2층 (연산동)<NA>
59건물위생관리업토탈케어부산광역시 연제구 안연로7번길 10, 1층 (연산동)<NA>
60건물위생관리업부산크린부산광역시 연제구 배산북로 38-1, 1층 (연산동)<NA>
61건물위생관리업엔오원크린서비스부산광역시 연제구 과정로191번가길 45, 1층 13호 (연산동)051-335-0816