Overview

Dataset statistics

Number of variables4
Number of observations77
Missing cells20
Missing cells (%)6.5%
Duplicate rows3
Duplicate rows (%)3.9%
Total size in memory2.5 KiB
Average record size in memory33.7 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시연제구_건물위생관리업현황_20230620
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3082203

Alerts

업종명 has constant value ""Constant
Dataset has 3 (3.9%) duplicate rowsDuplicates
소재지전화 has 20 (26.0%) missing valuesMissing

Reproduction

Analysis started2024-04-21 07:13:03.886967
Analysis finished2024-04-21 07:13:04.803085
Duration0.92 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size744.0 B
건물위생관리업
77 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건물위생관리업
2nd row건물위생관리업
3rd row건물위생관리업
4th row건물위생관리업
5th row건물위생관리업

Common Values

ValueCountFrequency (%)
건물위생관리업 77
100.0%

Length

2024-04-21T16:13:05.007773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T16:13:05.312573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건물위생관리업 77
100.0%
Distinct74
Distinct (%)96.1%
Missing0
Missing (%)0.0%
Memory size744.0 B
2024-04-21T16:13:06.101237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length12
Mean length7.6103896
Min length2

Characters and Unicode

Total characters586
Distinct characters165
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)92.2%

Sample

1st row(주)명건
2nd row(주)휴넥트
3rd row(주)목평인력개발
4th row(주)항도시스템
5th row바지런토탈크리닝시스템
ValueCountFrequency (%)
주식회사 14
 
14.1%
내츄럴사이언스 2
 
2.0%
영식 2
 
2.0%
건물주택관리 2
 
2.0%
주)팜스 2
 
2.0%
일호 1
 
1.0%
주)금길 1
 
1.0%
엔오원크린서비스 1
 
1.0%
주)성진산업개발 1
 
1.0%
주)명건 1
 
1.0%
Other values (72) 72
72.7%
2024-04-21T16:13:07.181387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
49
 
8.4%
( 33
 
5.6%
) 33
 
5.6%
25
 
4.3%
23
 
3.9%
19
 
3.2%
18
 
3.1%
16
 
2.7%
16
 
2.7%
12
 
2.0%
Other values (155) 342
58.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 478
81.6%
Open Punctuation 33
 
5.6%
Close Punctuation 33
 
5.6%
Space Separator 23
 
3.9%
Lowercase Letter 9
 
1.5%
Uppercase Letter 7
 
1.2%
Other Punctuation 3
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
49
 
10.3%
25
 
5.2%
19
 
4.0%
18
 
3.8%
16
 
3.3%
16
 
3.3%
12
 
2.5%
9
 
1.9%
8
 
1.7%
7
 
1.5%
Other values (138) 299
62.6%
Lowercase Letter
ValueCountFrequency (%)
t 2
22.2%
o 2
22.2%
s 1
11.1%
l 1
11.1%
r 1
11.1%
e 1
11.1%
n 1
11.1%
Uppercase Letter
ValueCountFrequency (%)
C 2
28.6%
B 2
28.6%
M 1
14.3%
E 1
14.3%
P 1
14.3%
Other Punctuation
ValueCountFrequency (%)
& 2
66.7%
. 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 33
100.0%
Close Punctuation
ValueCountFrequency (%)
) 33
100.0%
Space Separator
ValueCountFrequency (%)
23
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 478
81.6%
Common 92
 
15.7%
Latin 16
 
2.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
49
 
10.3%
25
 
5.2%
19
 
4.0%
18
 
3.8%
16
 
3.3%
16
 
3.3%
12
 
2.5%
9
 
1.9%
8
 
1.7%
7
 
1.5%
Other values (138) 299
62.6%
Latin
ValueCountFrequency (%)
C 2
12.5%
B 2
12.5%
t 2
12.5%
o 2
12.5%
M 1
6.2%
E 1
6.2%
s 1
6.2%
P 1
6.2%
l 1
6.2%
r 1
6.2%
Other values (2) 2
12.5%
Common
ValueCountFrequency (%)
( 33
35.9%
) 33
35.9%
23
25.0%
& 2
 
2.2%
. 1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 478
81.6%
ASCII 108
 
18.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
49
 
10.3%
25
 
5.2%
19
 
4.0%
18
 
3.8%
16
 
3.3%
16
 
3.3%
12
 
2.5%
9
 
1.9%
8
 
1.7%
7
 
1.5%
Other values (138) 299
62.6%
ASCII
ValueCountFrequency (%)
( 33
30.6%
) 33
30.6%
23
21.3%
C 2
 
1.9%
& 2
 
1.9%
B 2
 
1.9%
t 2
 
1.9%
o 2
 
1.9%
M 1
 
0.9%
E 1
 
0.9%
Other values (7) 7
 
6.5%
Distinct70
Distinct (%)90.9%
Missing0
Missing (%)0.0%
Memory size744.0 B
2024-04-21T16:13:08.230669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length41
Mean length32.012987
Min length22

Characters and Unicode

Total characters2465
Distinct characters116
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique64 ?
Unique (%)83.1%

Sample

1st row부산광역시 연제구 과정로 84, 4층 (연산동)
2nd row부산광역시 연제구 중앙대로 1217, 국제문화센터 17층 일부호 (거제동)
3rd row부산광역시 연제구 고분로32번길 43 (연산동)
4th row부산광역시 연제구 월드컵대로 20 (연산동)
5th row부산광역시 연제구 신촌로 35-6 (연산동)
ValueCountFrequency (%)
부산광역시 77
 
15.9%
연제구 77
 
15.9%
연산동 60
 
12.4%
거제동 16
 
3.3%
1층 13
 
2.7%
2층 12
 
2.5%
지하1층 12
 
2.5%
과정로 8
 
1.6%
중앙대로 6
 
1.2%
쌍미천로 6
 
1.2%
Other values (135) 198
40.8%
2024-04-21T16:13:09.728538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
408
 
16.6%
141
 
5.7%
137
 
5.6%
102
 
4.1%
1 101
 
4.1%
81
 
3.3%
80
 
3.2%
80
 
3.2%
79
 
3.2%
) 78
 
3.2%
Other values (106) 1178
47.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1438
58.3%
Space Separator 408
 
16.6%
Decimal Number 379
 
15.4%
Close Punctuation 78
 
3.2%
Open Punctuation 78
 
3.2%
Other Punctuation 72
 
2.9%
Dash Punctuation 6
 
0.2%
Uppercase Letter 6
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
141
 
9.8%
137
 
9.5%
102
 
7.1%
81
 
5.6%
80
 
5.6%
80
 
5.6%
79
 
5.5%
77
 
5.4%
77
 
5.4%
77
 
5.4%
Other values (84) 507
35.3%
Decimal Number
ValueCountFrequency (%)
1 101
26.6%
2 70
18.5%
0 41
10.8%
3 34
 
9.0%
5 31
 
8.2%
4 28
 
7.4%
8 24
 
6.3%
7 21
 
5.5%
9 16
 
4.2%
6 13
 
3.4%
Uppercase Letter
ValueCountFrequency (%)
S 1
16.7%
K 1
16.7%
V 1
16.7%
I 1
16.7%
E 1
16.7%
W 1
16.7%
Other Punctuation
ValueCountFrequency (%)
, 71
98.6%
. 1
 
1.4%
Space Separator
ValueCountFrequency (%)
408
100.0%
Close Punctuation
ValueCountFrequency (%)
) 78
100.0%
Open Punctuation
ValueCountFrequency (%)
( 78
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1438
58.3%
Common 1021
41.4%
Latin 6
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
141
 
9.8%
137
 
9.5%
102
 
7.1%
81
 
5.6%
80
 
5.6%
80
 
5.6%
79
 
5.5%
77
 
5.4%
77
 
5.4%
77
 
5.4%
Other values (84) 507
35.3%
Common
ValueCountFrequency (%)
408
40.0%
1 101
 
9.9%
) 78
 
7.6%
( 78
 
7.6%
, 71
 
7.0%
2 70
 
6.9%
0 41
 
4.0%
3 34
 
3.3%
5 31
 
3.0%
4 28
 
2.7%
Other values (6) 81
 
7.9%
Latin
ValueCountFrequency (%)
S 1
16.7%
K 1
16.7%
V 1
16.7%
I 1
16.7%
E 1
16.7%
W 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1438
58.3%
ASCII 1027
41.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
408
39.7%
1 101
 
9.8%
) 78
 
7.6%
( 78
 
7.6%
, 71
 
6.9%
2 70
 
6.8%
0 41
 
4.0%
3 34
 
3.3%
5 31
 
3.0%
4 28
 
2.7%
Other values (12) 87
 
8.5%
Hangul
ValueCountFrequency (%)
141
 
9.8%
137
 
9.5%
102
 
7.1%
81
 
5.6%
80
 
5.6%
80
 
5.6%
79
 
5.5%
77
 
5.4%
77
 
5.4%
77
 
5.4%
Other values (84) 507
35.3%

소재지전화
Text

MISSING 

Distinct56
Distinct (%)98.2%
Missing20
Missing (%)26.0%
Memory size744.0 B
2024-04-21T16:13:10.630462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.859649
Min length9

Characters and Unicode

Total characters676
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)96.5%

Sample

1st row051-505-2070
2nd row051-850-2036
3rd row051-852-5571
4th row051-861-1525
5th row051-867-6469
ValueCountFrequency (%)
051-865-7373 2
 
3.5%
051-809-8090 1
 
1.8%
051-728-7740 1
 
1.8%
051-505-2070 1
 
1.8%
1661-1714 1
 
1.8%
051-853-3372 1
 
1.8%
051-853-0412 1
 
1.8%
051-532-5200 1
 
1.8%
051-757-3753 1
 
1.8%
051-465-9428 1
 
1.8%
Other values (46) 46
80.7%
2024-04-21T16:13:11.924133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 111
16.4%
0 103
15.2%
5 102
15.1%
1 91
13.5%
6 48
7.1%
2 46
6.8%
8 44
 
6.5%
7 42
 
6.2%
3 34
 
5.0%
9 28
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 565
83.6%
Dash Punctuation 111
 
16.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 103
18.2%
5 102
18.1%
1 91
16.1%
6 48
8.5%
2 46
8.1%
8 44
7.8%
7 42
7.4%
3 34
 
6.0%
9 28
 
5.0%
4 27
 
4.8%
Dash Punctuation
ValueCountFrequency (%)
- 111
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 676
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 111
16.4%
0 103
15.2%
5 102
15.1%
1 91
13.5%
6 48
7.1%
2 46
6.8%
8 44
 
6.5%
7 42
 
6.2%
3 34
 
5.0%
9 28
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 676
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 111
16.4%
0 103
15.2%
5 102
15.1%
1 91
13.5%
6 48
7.1%
2 46
6.8%
8 44
 
6.5%
7 42
 
6.2%
3 34
 
5.0%
9 28
 
4.1%

Correlations

2024-04-21T16:13:12.300695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명영업소 주소(도로명)소재지전화
업소명1.0001.0001.000
영업소 주소(도로명)1.0001.0001.000
소재지전화1.0001.0001.000

Missing values

2024-04-21T16:13:04.404176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T16:13:04.691839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명영업소 주소(도로명)소재지전화
0건물위생관리업(주)명건부산광역시 연제구 과정로 84, 4층 (연산동)051-505-2070
1건물위생관리업(주)휴넥트부산광역시 연제구 중앙대로 1217, 국제문화센터 17층 일부호 (거제동)051-850-2036
2건물위생관리업(주)목평인력개발부산광역시 연제구 고분로32번길 43 (연산동)051-852-5571
3건물위생관리업(주)항도시스템부산광역시 연제구 월드컵대로 20 (연산동)051-861-1525
4건물위생관리업바지런토탈크리닝시스템부산광역시 연제구 신촌로 35-6 (연산동)051-867-6469
5건물위생관리업주식회사 금조개발부산광역시 연제구 중앙천로14번길 1, 지하1층 (연산동)051-462-0881
6건물위생관리업롯데환경부산광역시 연제구 해맞이로 71 (거제동)051-501-0118
7건물위생관리업(주)다명부산광역시 연제구 법원로 20, 로제스티빌딩 601-1호 (거제동)051-948-2297
8건물위생관리업부산연제지역자활센터부산광역시 연제구 토곡로9번길 16, 1층 101호 (연산동)051-852-8219
9건물위생관리업(주)대한시스템부산광역시 연제구 마곡천로30번길 2, 2층 (연산동)051-867-6592
업종명업소명영업소 주소(도로명)소재지전화
67건물위생관리업(주)팜스부산광역시 연제구 법원남로15번길 22, 파라존빌딩 5층 503호 (거제동)051-865-7373
68건물위생관리업영식 건물주택관리부산광역시 연제구 쌍미천로 39, 1층 (연산동)<NA>
69건물위생관리업내츄럴사이언스부산광역시 연제구 과정로 340, 지하1층 (연산동)<NA>
70건물위생관리업에스클린부산광역시 연제구 중앙천로70번길 20, 1층 (연산동)1600-7458
71건물위생관리업오엔케이 크리닝부산광역시 연제구 대리로12번길 21, 2층 (연산동)<NA>
72건물위생관리업(주)금길부산광역시 연제구 법원남로 9, 보은빌딩 지하1층 (거제동)051-506-6520
73건물위생관리업창하토탈관리부산광역시 연제구 중앙대로 1080, 디오빌 905호 (연산동)<NA>
74건물위생관리업(주)팜스부산광역시 연제구 법원남로15번길 22, 파라존빌딩 5층 503호 (거제동)051-865-7373
75건물위생관리업영식 건물주택관리부산광역시 연제구 쌍미천로 39, 1층 (연산동)<NA>
76건물위생관리업내츄럴사이언스부산광역시 연제구 과정로 340, 지하1층 (연산동)<NA>

Duplicate rows

Most frequently occurring

업종명업소명영업소 주소(도로명)소재지전화# duplicates
0건물위생관리업(주)팜스부산광역시 연제구 법원남로15번길 22, 파라존빌딩 5층 503호 (거제동)051-865-73732
1건물위생관리업내츄럴사이언스부산광역시 연제구 과정로 340, 지하1층 (연산동)<NA>2
2건물위생관리업영식 건물주택관리부산광역시 연제구 쌍미천로 39, 1층 (연산동)<NA>2