Overview

Dataset statistics

Number of variables3
Number of observations67
Missing cells15
Missing cells (%)7.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory25.9 B

Variable types

Text3

Dataset

Description부산광역시연제구_건물위생관리업현황_20200624
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3082203

Alerts

소재지전화 has 15 (22.4%) missing valuesMissing
업소명 has unique valuesUnique

Reproduction

Analysis started2024-04-21 07:13:23.597075
Analysis finished2024-04-21 07:13:24.369053
Duration0.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업소명
Text

UNIQUE 

Distinct67
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size664.0 B
2024-04-21T16:13:25.124684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length14
Mean length7.8208955
Min length2

Characters and Unicode

Total characters524
Distinct characters154
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)100.0%

Sample

1st row(주)명건
2nd row부일정보링크(주)
3rd row(주)목평인력개발
4th row(주)항도시스템
5th row바지런토탈크리닝시스템
ValueCountFrequency (%)
주식회사 12
 
14.0%
주)명건 1
 
1.2%
부일환경(주 1
 
1.2%
주)이화환경 1
 
1.2%
신화종합환경 1
 
1.2%
한결같이 1
 
1.2%
모든청소 1
 
1.2%
주)성진산업개발 1
 
1.2%
주)케이제이씨에스 1
 
1.2%
주)영린텍 1
 
1.2%
Other values (65) 65
75.6%
2024-04-21T16:13:26.442154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
44
 
8.4%
( 32
 
6.1%
) 32
 
6.1%
20
 
3.8%
18
 
3.4%
17
 
3.2%
14
 
2.7%
14
 
2.7%
12
 
2.3%
9
 
1.7%
Other values (144) 312
59.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 423
80.7%
Open Punctuation 32
 
6.1%
Close Punctuation 32
 
6.1%
Space Separator 20
 
3.8%
Lowercase Letter 9
 
1.7%
Uppercase Letter 5
 
1.0%
Other Punctuation 3
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
10.4%
18
 
4.3%
17
 
4.0%
14
 
3.3%
14
 
3.3%
12
 
2.8%
9
 
2.1%
8
 
1.9%
8
 
1.9%
7
 
1.7%
Other values (127) 272
64.3%
Lowercase Letter
ValueCountFrequency (%)
t 2
22.2%
o 2
22.2%
l 1
11.1%
r 1
11.1%
n 1
11.1%
s 1
11.1%
e 1
11.1%
Uppercase Letter
ValueCountFrequency (%)
B 2
40.0%
C 1
20.0%
P 1
20.0%
M 1
20.0%
Other Punctuation
ValueCountFrequency (%)
& 1
33.3%
. 1
33.3%
, 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 32
100.0%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%
Space Separator
ValueCountFrequency (%)
20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 423
80.7%
Common 87
 
16.6%
Latin 14
 
2.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
10.4%
18
 
4.3%
17
 
4.0%
14
 
3.3%
14
 
3.3%
12
 
2.8%
9
 
2.1%
8
 
1.9%
8
 
1.9%
7
 
1.7%
Other values (127) 272
64.3%
Latin
ValueCountFrequency (%)
t 2
14.3%
o 2
14.3%
B 2
14.3%
l 1
7.1%
r 1
7.1%
n 1
7.1%
C 1
7.1%
s 1
7.1%
e 1
7.1%
P 1
7.1%
Common
ValueCountFrequency (%)
( 32
36.8%
) 32
36.8%
20
23.0%
& 1
 
1.1%
. 1
 
1.1%
, 1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 423
80.7%
ASCII 101
 
19.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
44
 
10.4%
18
 
4.3%
17
 
4.0%
14
 
3.3%
14
 
3.3%
12
 
2.8%
9
 
2.1%
8
 
1.9%
8
 
1.9%
7
 
1.7%
Other values (127) 272
64.3%
ASCII
ValueCountFrequency (%)
( 32
31.7%
) 32
31.7%
20
19.8%
t 2
 
2.0%
o 2
 
2.0%
B 2
 
2.0%
& 1
 
1.0%
l 1
 
1.0%
r 1
 
1.0%
n 1
 
1.0%
Other values (7) 7
 
6.9%
Distinct64
Distinct (%)95.5%
Missing0
Missing (%)0.0%
Memory size664.0 B
2024-04-21T16:13:27.492290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length40
Mean length30.313433
Min length22

Characters and Unicode

Total characters2031
Distinct characters96
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique61 ?
Unique (%)91.0%

Sample

1st row부산광역시 연제구 과정로 84, 4층 (연산동)
2nd row부산광역시 연제구 거제대로 295 (거제동)
3rd row부산광역시 연제구 고분로32번길 43 (연산동)
4th row부산광역시 연제구 월드컵대로 20 (연산동)
5th row부산광역시 연제구 신촌로 35-6 (연산동)
ValueCountFrequency (%)
부산광역시 67
16.8%
연제구 67
16.8%
연산동 46
 
11.5%
거제동 15
 
3.8%
1층 11
 
2.8%
2층 9
 
2.3%
지하1층 6
 
1.5%
쌍미천로 5
 
1.3%
거제대로 4
 
1.0%
과정로 4
 
1.0%
Other values (122) 165
41.4%
2024-04-21T16:13:28.962091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
332
 
16.3%
122
 
6.0%
117
 
5.8%
94
 
4.6%
1 74
 
3.6%
70
 
3.4%
) 70
 
3.4%
( 70
 
3.4%
69
 
3.4%
68
 
3.3%
Other values (86) 945
46.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1189
58.5%
Space Separator 332
 
16.3%
Decimal Number 307
 
15.1%
Close Punctuation 70
 
3.4%
Open Punctuation 70
 
3.4%
Other Punctuation 59
 
2.9%
Dash Punctuation 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
122
 
10.3%
117
 
9.8%
94
 
7.9%
70
 
5.9%
69
 
5.8%
68
 
5.7%
68
 
5.7%
67
 
5.6%
67
 
5.6%
67
 
5.6%
Other values (70) 380
32.0%
Decimal Number
ValueCountFrequency (%)
1 74
24.1%
2 55
17.9%
3 33
10.7%
5 29
 
9.4%
4 26
 
8.5%
0 26
 
8.5%
8 24
 
7.8%
7 15
 
4.9%
9 13
 
4.2%
6 12
 
3.9%
Other Punctuation
ValueCountFrequency (%)
, 58
98.3%
. 1
 
1.7%
Space Separator
ValueCountFrequency (%)
332
100.0%
Close Punctuation
ValueCountFrequency (%)
) 70
100.0%
Open Punctuation
ValueCountFrequency (%)
( 70
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1189
58.5%
Common 842
41.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
122
 
10.3%
117
 
9.8%
94
 
7.9%
70
 
5.9%
69
 
5.8%
68
 
5.7%
68
 
5.7%
67
 
5.6%
67
 
5.6%
67
 
5.6%
Other values (70) 380
32.0%
Common
ValueCountFrequency (%)
332
39.4%
1 74
 
8.8%
) 70
 
8.3%
( 70
 
8.3%
, 58
 
6.9%
2 55
 
6.5%
3 33
 
3.9%
5 29
 
3.4%
4 26
 
3.1%
0 26
 
3.1%
Other values (6) 69
 
8.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1189
58.5%
ASCII 842
41.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
332
39.4%
1 74
 
8.8%
) 70
 
8.3%
( 70
 
8.3%
, 58
 
6.9%
2 55
 
6.5%
3 33
 
3.9%
5 29
 
3.4%
4 26
 
3.1%
0 26
 
3.1%
Other values (6) 69
 
8.2%
Hangul
ValueCountFrequency (%)
122
 
10.3%
117
 
9.8%
94
 
7.9%
70
 
5.9%
69
 
5.8%
68
 
5.7%
68
 
5.7%
67
 
5.6%
67
 
5.6%
67
 
5.6%
Other values (70) 380
32.0%

소재지전화
Text

MISSING 

Distinct52
Distinct (%)100.0%
Missing15
Missing (%)22.4%
Memory size664.0 B
2024-04-21T16:13:29.798944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.903846
Min length9

Characters and Unicode

Total characters619
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)100.0%

Sample

1st row051-505-2070
2nd row051-850-2022
3rd row051-852-5571
4th row051-861-1525
5th row051-867-6469
ValueCountFrequency (%)
051-809-8090 1
 
1.9%
051-853-7437 1
 
1.9%
051-465-9428 1
 
1.9%
051-929-0166 1
 
1.9%
051-507-9053 1
 
1.9%
051-853-3372 1
 
1.9%
051-853-0412 1
 
1.9%
051-867-2010 1
 
1.9%
051-996-1006 1
 
1.9%
051-861-6822 1
 
1.9%
Other values (42) 42
80.8%
2024-04-21T16:13:30.962126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 102
16.5%
0 94
15.2%
5 93
15.0%
1 91
14.7%
6 42
6.8%
8 40
 
6.5%
7 37
 
6.0%
2 37
 
6.0%
9 32
 
5.2%
4 26
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 517
83.5%
Dash Punctuation 102
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 94
18.2%
5 93
18.0%
1 91
17.6%
6 42
8.1%
8 40
7.7%
7 37
 
7.2%
2 37
 
7.2%
9 32
 
6.2%
4 26
 
5.0%
3 25
 
4.8%
Dash Punctuation
ValueCountFrequency (%)
- 102
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 619
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 102
16.5%
0 94
15.2%
5 93
15.0%
1 91
14.7%
6 42
6.8%
8 40
 
6.5%
7 37
 
6.0%
2 37
 
6.0%
9 32
 
5.2%
4 26
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 619
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 102
16.5%
0 94
15.2%
5 93
15.0%
1 91
14.7%
6 42
6.8%
8 40
 
6.5%
7 37
 
6.0%
2 37
 
6.0%
9 32
 
5.2%
4 26
 
4.2%

Correlations

2024-04-21T16:13:31.137878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명업소소재지(도로명)소재지전화
업소명1.0001.0001.000
업소소재지(도로명)1.0001.0001.000
소재지전화1.0001.0001.000

Missing values

2024-04-21T16:13:24.025397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T16:13:24.271126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명업소소재지(도로명)소재지전화
0(주)명건부산광역시 연제구 과정로 84, 4층 (연산동)051-505-2070
1부일정보링크(주)부산광역시 연제구 거제대로 295 (거제동)051-850-2022
2(주)목평인력개발부산광역시 연제구 고분로32번길 43 (연산동)051-852-5571
3(주)항도시스템부산광역시 연제구 월드컵대로 20 (연산동)051-861-1525
4바지런토탈크리닝시스템부산광역시 연제구 신촌로 35-6 (연산동)051-867-6469
5롯데환경부산광역시 연제구 해맞이로 71 (거제동)051-501-0118
6(주)다명부산광역시 연제구 법원로 20 (거제동)051-948-2297
7연제지역 자활센터, 청솔환경부산광역시 연제구 쌍미천로 58, 상가동 203,205,206호 (연산동, 연산훼미리타운)051-852-8219
8(주)대한시스템부산광역시 연제구 신촌로 18 (연산동,(4층))051-867-6592
9주식회사 토.요부산광역시 연제구 쌍미천로135번길 22, 2층 일부호 (연산동)051-865-0116
업소명업소소재지(도로명)소재지전화
57부산크린부산광역시 연제구 중앙천로39번길 9, 1층 (연산동)<NA>
58엔오원크린서비스부산광역시 연제구 과정로191번가길 45, 1층 13호 (연산동)051-335-0816
59주식회사 에이치엔피부산광역시 연제구 거제천로255번가길 56, 2층 (거제동)055-316-0877
60(주)정림임업부산광역시 연제구 연미로13번길 20, 3층 (연산동)051-464-5100
61주식회사 에어솔루션(부산지점)부산광역시 연제구 월드컵대로 160, 6층 609호 (연산동)<NA>
62제일청소용역부산광역시 연제구 쌍미천로 39, 2층 (연산동)1600-3735
63모든환경방역 Pest Control & 모든청소용역부산광역시 연제구 마곡천로 18, 1층 (연산동)051-863-1109
64에어굿맨부산광역시 연제구 연수로213번길 17, 1층 (연산동)051-867-3552
65주식회사 씨엠씨종합관리부산광역시 연제구 과정로 137, 삼호라이프레포츠텔 7층 717호 (연산동)051-927-2978
66BBM코리아부산광역시 연제구 중앙대로 1038, 2층 (연산동)<NA>