Overview

Dataset statistics

Number of variables4
Number of observations65
Missing cells15
Missing cells (%)5.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory34.0 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시_동구_건물위생관리업현황_20210117
Author부산광역시 동구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15028640

Alerts

업종명 has constant value ""Constant
소재지전화 has 15 (23.1%) missing valuesMissing
업소명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:23:44.565010
Analysis finished2023-12-10 16:23:44.983017
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size652.0 B
건물위생관리업
65 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건물위생관리업
2nd row건물위생관리업
3rd row건물위생관리업
4th row건물위생관리업
5th row건물위생관리업

Common Values

ValueCountFrequency (%)
건물위생관리업 65
100.0%

Length

2023-12-11T01:23:45.072435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:23:45.205093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건물위생관리업 65
100.0%

업소명
Text

UNIQUE 

Distinct65
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size652.0 B
2023-12-11T01:23:45.423254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length13
Mean length8.3846154
Min length2

Characters and Unicode

Total characters545
Distinct characters144
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)100.0%

Sample

1st row클린에어존 부산
2nd row(주)실버종합물류
3rd row(주)대홍관리
4th row(주)다온코스믹
5th row(주)더청화
ValueCountFrequency (%)
주식회사 7
 
9.1%
주)보광비에스 1
 
1.3%
주)정오시스템 1
 
1.3%
전문환경공사 1
 
1.3%
창성환경 1
 
1.3%
주)휴먼존 1
 
1.3%
주)씨앤에스코리아 1
 
1.3%
우림 1
 
1.3%
주)한미안전공사 1
 
1.3%
클린에어존 1
 
1.3%
Other values (61) 61
79.2%
2023-12-11T01:23:45.826692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
44
 
8.1%
) 38
 
7.0%
( 36
 
6.6%
20
 
3.7%
19
 
3.5%
15
 
2.8%
12
 
2.2%
12
 
2.2%
12
 
2.2%
11
 
2.0%
Other values (134) 326
59.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 457
83.9%
Close Punctuation 38
 
7.0%
Open Punctuation 36
 
6.6%
Space Separator 12
 
2.2%
Uppercase Letter 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
9.6%
20
 
4.4%
19
 
4.2%
15
 
3.3%
12
 
2.6%
12
 
2.6%
11
 
2.4%
10
 
2.2%
9
 
2.0%
9
 
2.0%
Other values (129) 296
64.8%
Uppercase Letter
ValueCountFrequency (%)
S 1
50.0%
T 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 38
100.0%
Open Punctuation
ValueCountFrequency (%)
( 36
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 457
83.9%
Common 86
 
15.8%
Latin 2
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
9.6%
20
 
4.4%
19
 
4.2%
15
 
3.3%
12
 
2.6%
12
 
2.6%
11
 
2.4%
10
 
2.2%
9
 
2.0%
9
 
2.0%
Other values (129) 296
64.8%
Common
ValueCountFrequency (%)
) 38
44.2%
( 36
41.9%
12
 
14.0%
Latin
ValueCountFrequency (%)
S 1
50.0%
T 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 457
83.9%
ASCII 88
 
16.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
44
 
9.6%
20
 
4.4%
19
 
4.2%
15
 
3.3%
12
 
2.6%
12
 
2.6%
11
 
2.4%
10
 
2.2%
9
 
2.0%
9
 
2.0%
Other values (129) 296
64.8%
ASCII
ValueCountFrequency (%)
) 38
43.2%
( 36
40.9%
12
 
13.6%
S 1
 
1.1%
T 1
 
1.1%
Distinct62
Distinct (%)95.4%
Missing0
Missing (%)0.0%
Memory size652.0 B
2023-12-11T01:23:46.115000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length37
Mean length32.323077
Min length20

Characters and Unicode

Total characters2101
Distinct characters108
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique60 ?
Unique (%)92.3%

Sample

1st row부산광역시 동구 초량로 99, 지하1층 B101호 (초량동, 도형빌라)
2nd row부산광역시 동구 중앙대로248번길 7-6, 4층 (초량동)
3rd row부산광역시 동구 중앙대로286번길 9, 3층 306호 (초량동, 녹원빌딩)
4th row부산광역시 동구 중앙대로 270, 721호 (초량동)
5th row부산광역시 동구 중앙대로236번길 7-6, 302호 (초량동)
ValueCountFrequency (%)
부산광역시 65
 
16.1%
동구 65
 
16.1%
초량동 36
 
8.9%
중앙대로 12
 
3.0%
수정동 10
 
2.5%
범일동 9
 
2.2%
지하1층 7
 
1.7%
2층 6
 
1.5%
고관로 5
 
1.2%
3층 5
 
1.2%
Other values (131) 184
45.5%
2023-12-11T01:23:46.752601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
340
 
16.2%
133
 
6.3%
) 70
 
3.3%
( 70
 
3.3%
68
 
3.2%
67
 
3.2%
66
 
3.1%
66
 
3.1%
66
 
3.1%
65
 
3.1%
Other values (98) 1090
51.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1168
55.6%
Decimal Number 363
 
17.3%
Space Separator 340
 
16.2%
Close Punctuation 70
 
3.3%
Open Punctuation 70
 
3.3%
Other Punctuation 61
 
2.9%
Dash Punctuation 21
 
1.0%
Uppercase Letter 8
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
133
 
11.4%
68
 
5.8%
67
 
5.7%
66
 
5.7%
66
 
5.7%
66
 
5.7%
65
 
5.6%
65
 
5.6%
42
 
3.6%
42
 
3.6%
Other values (77) 488
41.8%
Decimal Number
ValueCountFrequency (%)
1 65
17.9%
3 64
17.6%
2 54
14.9%
0 34
9.4%
6 32
8.8%
7 25
 
6.9%
8 25
 
6.9%
4 23
 
6.3%
5 21
 
5.8%
9 20
 
5.5%
Uppercase Letter
ValueCountFrequency (%)
B 4
50.0%
D 1
 
12.5%
A 1
 
12.5%
O 1
 
12.5%
T 1
 
12.5%
Other Punctuation
ValueCountFrequency (%)
, 59
96.7%
/ 2
 
3.3%
Space Separator
ValueCountFrequency (%)
340
100.0%
Close Punctuation
ValueCountFrequency (%)
) 70
100.0%
Open Punctuation
ValueCountFrequency (%)
( 70
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1168
55.6%
Common 925
44.0%
Latin 8
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
133
 
11.4%
68
 
5.8%
67
 
5.7%
66
 
5.7%
66
 
5.7%
66
 
5.7%
65
 
5.6%
65
 
5.6%
42
 
3.6%
42
 
3.6%
Other values (77) 488
41.8%
Common
ValueCountFrequency (%)
340
36.8%
) 70
 
7.6%
( 70
 
7.6%
1 65
 
7.0%
3 64
 
6.9%
, 59
 
6.4%
2 54
 
5.8%
0 34
 
3.7%
6 32
 
3.5%
7 25
 
2.7%
Other values (6) 112
 
12.1%
Latin
ValueCountFrequency (%)
B 4
50.0%
D 1
 
12.5%
A 1
 
12.5%
O 1
 
12.5%
T 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1168
55.6%
ASCII 933
44.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
340
36.4%
) 70
 
7.5%
( 70
 
7.5%
1 65
 
7.0%
3 64
 
6.9%
, 59
 
6.3%
2 54
 
5.8%
0 34
 
3.6%
6 32
 
3.4%
7 25
 
2.7%
Other values (11) 120
 
12.9%
Hangul
ValueCountFrequency (%)
133
 
11.4%
68
 
5.8%
67
 
5.7%
66
 
5.7%
66
 
5.7%
66
 
5.7%
65
 
5.6%
65
 
5.6%
42
 
3.6%
42
 
3.6%
Other values (77) 488
41.8%

소재지전화
Text

MISSING 

Distinct48
Distinct (%)96.0%
Missing15
Missing (%)23.1%
Memory size652.0 B
2023-12-11T01:23:46.997777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.94
Min length9

Characters and Unicode

Total characters597
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)92.0%

Sample

1st row1644-1965
2nd row051-995-8140
3rd row051-863-0605
4th row051-851-8633
5th row051-714-6099
ValueCountFrequency (%)
051-469-8866 2
 
4.0%
051-610-1510 2
 
4.0%
051-851-9782 1
 
2.0%
1644-1965 1
 
2.0%
051-643-1101 1
 
2.0%
051-442-5059 1
 
2.0%
051-442-3469 1
 
2.0%
051-441-3361 1
 
2.0%
051-441-2244 1
 
2.0%
051-328-6488 1
 
2.0%
Other values (38) 38
76.0%
2023-12-11T01:23:47.389560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 99
16.6%
1 93
15.6%
0 81
13.6%
5 80
13.4%
4 63
10.6%
6 58
9.7%
8 32
 
5.4%
3 31
 
5.2%
2 29
 
4.9%
9 19
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 498
83.4%
Dash Punctuation 99
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 93
18.7%
0 81
16.3%
5 80
16.1%
4 63
12.7%
6 58
11.6%
8 32
 
6.4%
3 31
 
6.2%
2 29
 
5.8%
9 19
 
3.8%
7 12
 
2.4%
Dash Punctuation
ValueCountFrequency (%)
- 99
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 597
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 99
16.6%
1 93
15.6%
0 81
13.6%
5 80
13.4%
4 63
10.6%
6 58
9.7%
8 32
 
5.4%
3 31
 
5.2%
2 29
 
4.9%
9 19
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 597
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 99
16.6%
1 93
15.6%
0 81
13.6%
5 80
13.4%
4 63
10.6%
6 58
9.7%
8 32
 
5.4%
3 31
 
5.2%
2 29
 
4.9%
9 19
 
3.2%

Correlations

2023-12-11T01:23:47.518890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명업소소재지(도로명)소재지전화
업소명1.0001.0001.000
업소소재지(도로명)1.0001.0001.000
소재지전화1.0001.0001.000

Missing values

2023-12-11T01:23:44.824088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:23:44.927496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명업소소재지(도로명)소재지전화
0건물위생관리업클린에어존 부산부산광역시 동구 초량로 99, 지하1층 B101호 (초량동, 도형빌라)1644-1965
1건물위생관리업(주)실버종합물류부산광역시 동구 중앙대로248번길 7-6, 4층 (초량동)<NA>
2건물위생관리업(주)대홍관리부산광역시 동구 중앙대로286번길 9, 3층 306호 (초량동, 녹원빌딩)<NA>
3건물위생관리업(주)다온코스믹부산광역시 동구 중앙대로 270, 721호 (초량동)<NA>
4건물위생관리업(주)더청화부산광역시 동구 중앙대로236번길 7-6, 302호 (초량동)051-995-8140
5건물위생관리업(사)부산장애인총연합회 복지사업단부산광역시 동구 중앙대로196번길 12-3, 9층 (초량동)051-863-0605
6건물위생관리업대한민국고엽제전우회부산지부부산광역시 동구 고관로 5 (초량동, 부산보훈회관 3층)051-851-8633
7건물위생관리업부산동구시니어클럽부산광역시 동구 초량상로 161 (수정동, 에스엔시빌딩 201호)051-714-6099
8건물위생관리업에스엔시(주)부산광역시 동구 자성공원로 23 (범일동,자성공원로 23 (범일동))051-638-9111
9건물위생관리업(주)정인시스템부산광역시 동구 중앙대로 514, 지하1층 135호 (범일동)051-637-0330
업종명업소명업소소재지(도로명)소재지전화
55건물위생관리업(주)제이에이치관리부산광역시 동구 조방로 39, 썬오피스텔 지상4층 401호 (범일동)051-631-8066
56건물위생관리업(주)정오개발부산광역시 동구 중앙대로320번길 7-3 (초량동)051-469-8866
57건물위생관리업한미환경공사부산광역시 동구 고관로 37 (수정동)051-463-8636
58건물위생관리업(주)고태부산광역시 동구 중앙대로 338 (초량동,연합뉴스빌딩 6층)051-462-5191
59건물위생관리업타워크리닝시스템부산광역시 동구 망양로875번길 39-21 (범일동)051-462-1482
60건물위생관리업주식회사 금조개발부산광역시 동구 중앙대로308번길 3-5, 지상3층 (초량동)051-462-0881
61건물위생관리업(주)국제보안공사부산광역시 동구 조방로49번길 18-1, 5층 (범일동)051-441-7171
62건물위생관리업(주)현대오에스시스템부산광역시 동구 중앙대로236번길 3-6 (초량동)051-441-6100
63건물위생관리업(주)한국시엔드지부산광역시 동구 중앙대로 263, 국제오피스텔 20층 2003호 (초량동)051-441-1235
64건물위생관리업(주)보훈기업부산광역시 동구 중앙대로214번길 7-11 (초량동)051-441-0747