Overview

Dataset statistics

Number of variables4
Number of observations64
Missing cells12
Missing cells (%)4.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 KiB
Average record size in memory34.1 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시_동구_건물위생관리업현황_20200121
Author부산광역시 동구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15028640

Alerts

업종명 has constant value ""Constant
소재지전화 has 12 (18.8%) missing valuesMissing
업소명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:23:48.821711
Analysis finished2023-12-10 16:23:49.340046
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size644.0 B
건물위생관리업
64 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건물위생관리업
2nd row건물위생관리업
3rd row건물위생관리업
4th row건물위생관리업
5th row건물위생관리업

Common Values

ValueCountFrequency (%)
건물위생관리업 64
100.0%

Length

2023-12-11T01:23:49.439275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:23:49.613146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건물위생관리업 64
100.0%

업소명
Text

UNIQUE 

Distinct64
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size644.0 B
2023-12-11T01:23:49.880596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length13
Mean length8.546875
Min length4

Characters and Unicode

Total characters547
Distinct characters142
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique64 ?
Unique (%)100.0%

Sample

1st row삼부종합개발(주)
2nd row협성개발주식회사
3rd row(주)창성개발
4th row한미환경공사
5th row(주)미성기업
ValueCountFrequency (%)
주식회사 5
 
6.8%
삼부종합개발(주 1
 
1.4%
복지사업단 1
 
1.4%
주)실버종합물류 1
 
1.4%
주)정오시스템 1
 
1.4%
한국노인생활지원재단 1
 
1.4%
사회복지법인 1
 
1.4%
국제크린기술(주 1
 
1.4%
부산동구시니어클럽 1
 
1.4%
천혜기업 1
 
1.4%
Other values (59) 59
80.8%
2023-12-11T01:23:50.442269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
45
 
8.2%
) 40
 
7.3%
( 38
 
6.9%
20
 
3.7%
19
 
3.5%
15
 
2.7%
12
 
2.2%
12
 
2.2%
10
 
1.8%
10
 
1.8%
Other values (132) 326
59.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 458
83.7%
Close Punctuation 40
 
7.3%
Open Punctuation 38
 
6.9%
Space Separator 9
 
1.6%
Uppercase Letter 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
45
 
9.8%
20
 
4.4%
19
 
4.1%
15
 
3.3%
12
 
2.6%
12
 
2.6%
10
 
2.2%
10
 
2.2%
10
 
2.2%
9
 
2.0%
Other values (127) 296
64.6%
Uppercase Letter
ValueCountFrequency (%)
T 1
50.0%
S 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 40
100.0%
Open Punctuation
ValueCountFrequency (%)
( 38
100.0%
Space Separator
ValueCountFrequency (%)
9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 458
83.7%
Common 87
 
15.9%
Latin 2
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
45
 
9.8%
20
 
4.4%
19
 
4.1%
15
 
3.3%
12
 
2.6%
12
 
2.6%
10
 
2.2%
10
 
2.2%
10
 
2.2%
9
 
2.0%
Other values (127) 296
64.6%
Common
ValueCountFrequency (%)
) 40
46.0%
( 38
43.7%
9
 
10.3%
Latin
ValueCountFrequency (%)
T 1
50.0%
S 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 458
83.7%
ASCII 89
 
16.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
45
 
9.8%
20
 
4.4%
19
 
4.1%
15
 
3.3%
12
 
2.6%
12
 
2.6%
10
 
2.2%
10
 
2.2%
10
 
2.2%
9
 
2.0%
Other values (127) 296
64.6%
ASCII
ValueCountFrequency (%)
) 40
44.9%
( 38
42.7%
9
 
10.1%
T 1
 
1.1%
S 1
 
1.1%
Distinct61
Distinct (%)95.3%
Missing0
Missing (%)0.0%
Memory size644.0 B
2023-12-11T01:23:50.918105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length38
Mean length31.78125
Min length20

Characters and Unicode

Total characters2034
Distinct characters103
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique59 ?
Unique (%)92.2%

Sample

1st row부산광역시 동구 조방로49번길 18 (범일동,대흥빌딩 402호)
2nd row부산광역시 동구 조방로 39 (범일동,썬오피스텔14층1403호)
3rd row부산광역시 동구 중앙대로349번길 38, 5층 (수정동)
4th row부산광역시 동구 고관로 37 (수정동)
5th row부산광역시 동구 자성로 129 (범일동,대우증권B/D 지하1층)
ValueCountFrequency (%)
부산광역시 64
 
16.4%
동구 64
 
16.4%
초량동 36
 
9.2%
중앙대로 14
 
3.6%
범일동 10
 
2.6%
수정동 9
 
2.3%
3층 6
 
1.5%
2층 5
 
1.3%
5층 5
 
1.3%
고관로 5
 
1.3%
Other values (123) 173
44.2%
2023-12-11T01:23:51.610682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
328
 
16.1%
131
 
6.4%
) 69
 
3.4%
( 69
 
3.4%
67
 
3.3%
66
 
3.2%
66
 
3.2%
65
 
3.2%
64
 
3.1%
64
 
3.1%
Other values (93) 1045
51.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1133
55.7%
Decimal Number 346
 
17.0%
Space Separator 328
 
16.1%
Close Punctuation 69
 
3.4%
Open Punctuation 69
 
3.4%
Other Punctuation 58
 
2.9%
Dash Punctuation 24
 
1.2%
Uppercase Letter 7
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
131
 
11.6%
67
 
5.9%
66
 
5.8%
66
 
5.8%
65
 
5.7%
64
 
5.6%
64
 
5.6%
64
 
5.6%
42
 
3.7%
42
 
3.7%
Other values (72) 462
40.8%
Decimal Number
ValueCountFrequency (%)
3 62
17.9%
1 61
17.6%
2 51
14.7%
6 31
9.0%
0 29
8.4%
8 25
7.2%
4 25
7.2%
7 23
 
6.6%
9 21
 
6.1%
5 18
 
5.2%
Uppercase Letter
ValueCountFrequency (%)
B 3
42.9%
D 1
 
14.3%
O 1
 
14.3%
T 1
 
14.3%
A 1
 
14.3%
Other Punctuation
ValueCountFrequency (%)
, 56
96.6%
/ 2
 
3.4%
Space Separator
ValueCountFrequency (%)
328
100.0%
Close Punctuation
ValueCountFrequency (%)
) 69
100.0%
Open Punctuation
ValueCountFrequency (%)
( 69
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1133
55.7%
Common 894
44.0%
Latin 7
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
131
 
11.6%
67
 
5.9%
66
 
5.8%
66
 
5.8%
65
 
5.7%
64
 
5.6%
64
 
5.6%
64
 
5.6%
42
 
3.7%
42
 
3.7%
Other values (72) 462
40.8%
Common
ValueCountFrequency (%)
328
36.7%
) 69
 
7.7%
( 69
 
7.7%
3 62
 
6.9%
1 61
 
6.8%
, 56
 
6.3%
2 51
 
5.7%
6 31
 
3.5%
0 29
 
3.2%
8 25
 
2.8%
Other values (6) 113
 
12.6%
Latin
ValueCountFrequency (%)
B 3
42.9%
D 1
 
14.3%
O 1
 
14.3%
T 1
 
14.3%
A 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1133
55.7%
ASCII 901
44.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
328
36.4%
) 69
 
7.7%
( 69
 
7.7%
3 62
 
6.9%
1 61
 
6.8%
, 56
 
6.2%
2 51
 
5.7%
6 31
 
3.4%
0 29
 
3.2%
8 25
 
2.8%
Other values (11) 120
 
13.3%
Hangul
ValueCountFrequency (%)
131
 
11.6%
67
 
5.9%
66
 
5.8%
66
 
5.8%
65
 
5.7%
64
 
5.6%
64
 
5.6%
64
 
5.6%
42
 
3.7%
42
 
3.7%
Other values (72) 462
40.8%

소재지전화
Text

MISSING 

Distinct51
Distinct (%)98.1%
Missing12
Missing (%)18.8%
Memory size644.0 B
2023-12-11T01:23:51.948044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters624
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)96.2%

Sample

1st row051-632-6182
2nd row051-643-1101
3rd row051-468-8434
4th row051-463-8636
5th row051-634-0134
ValueCountFrequency (%)
051-469-8866 2
 
3.8%
051-463-2721 1
 
1.9%
051-632-6182 1
 
1.9%
051-503-2470 1
 
1.9%
051-441-6660 1
 
1.9%
051-638-9111 1
 
1.9%
051-851-8633 1
 
1.9%
051-328-6488 1
 
1.9%
051-442-3469 1
 
1.9%
051-469-0900 1
 
1.9%
Other values (41) 41
78.8%
2023-12-11T01:23:52.436631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 104
16.7%
1 98
15.7%
0 85
13.6%
5 82
13.1%
4 68
10.9%
6 61
9.8%
8 33
 
5.3%
3 32
 
5.1%
2 29
 
4.6%
9 20
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 520
83.3%
Dash Punctuation 104
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 98
18.8%
0 85
16.3%
5 82
15.8%
4 68
13.1%
6 61
11.7%
8 33
 
6.3%
3 32
 
6.2%
2 29
 
5.6%
9 20
 
3.8%
7 12
 
2.3%
Dash Punctuation
ValueCountFrequency (%)
- 104
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 624
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 104
16.7%
1 98
15.7%
0 85
13.6%
5 82
13.1%
4 68
10.9%
6 61
9.8%
8 33
 
5.3%
3 32
 
5.1%
2 29
 
4.6%
9 20
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 624
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 104
16.7%
1 98
15.7%
0 85
13.6%
5 82
13.1%
4 68
10.9%
6 61
9.8%
8 33
 
5.3%
3 32
 
5.1%
2 29
 
4.6%
9 20
 
3.2%

Correlations

2023-12-11T01:23:52.563282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명업소소재지(도로명)소재지전화
업소명1.0001.0001.000
업소소재지(도로명)1.0001.0001.000
소재지전화1.0001.0001.000

Missing values

2023-12-11T01:23:49.168302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:23:49.298328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명업소소재지(도로명)소재지전화
0건물위생관리업삼부종합개발(주)부산광역시 동구 조방로49번길 18 (범일동,대흥빌딩 402호)051-632-6182
1건물위생관리업협성개발주식회사부산광역시 동구 조방로 39 (범일동,썬오피스텔14층1403호)051-643-1101
2건물위생관리업(주)창성개발부산광역시 동구 중앙대로349번길 38, 5층 (수정동)051-468-8434
3건물위생관리업한미환경공사부산광역시 동구 고관로 37 (수정동)051-463-8636
4건물위생관리업(주)미성기업부산광역시 동구 자성로 129 (범일동,대우증권B/D 지하1층)051-634-0134
5건물위생관리업(주)한국시엔드지부산광역시 동구 중앙대로 263, 국제오피스텔 1811,1812호 (초량동)051-441-1235
6건물위생관리업(주)국제보안공사부산광역시 동구 조방로49번길 18-1, 5층 (범일동)051-441-7171
7건물위생관리업(주)현대오에스시스템부산광역시 동구 중앙대로236번길 3-6 (초량동)051-441-6100
8건물위생관리업(주)고태부산광역시 동구 중앙대로 338 (초량동,연합뉴스빌딩 6층)051-462-5191
9건물위생관리업타워크리닝시스템부산광역시 동구 망양로875번길 39-21 (범일동)051-462-1482
업종명업소명업소소재지(도로명)소재지전화
54건물위생관리업(주)더청화부산광역시 동구 중앙대로236번길 7-6, 302호 (초량동)051-995-8140
55건물위생관리업주식회사 코스포부산광역시 동구 중앙대로 181, 3층 (초량동)<NA>
56건물위생관리업녹색환경부산광역시 동구 망양로 683, 지하1층 (수정동)051-461-0315
57건물위생관리업에스지푸드앤펀주식회사부산광역시 동구 중앙대로 297 (초량동)051-469-9248
58건물위생관리업(주)정인시스템부산광역시 동구 중앙대로 514, 지하1층 135호 (범일동)051-637-0330
59건물위생관리업(주)다온코스믹부산광역시 동구 중앙대로 270, 721호 (초량동)<NA>
60건물위생관리업부산동구지역자활센터부산광역시 동구 중앙대로 284, 3층 (초량동)051-462-1466
61건물위생관리업(주)스카이씨마린부산광역시 동구 중앙대로286번길 7, 10층 1003호 (초량동)051-463-2625
62건물위생관리업코스포서비스 주식회사부산광역시 동구 중앙대로 216, 교원빌딩 9층 (초량동)051-469-9360
63건물위생관리업청소나라부산광역시 동구 중앙대로214번길 7-5, 지하1층 (초량동)051-464-3515