Overview

Dataset statistics

Number of variables5
Number of observations62
Missing cells25
Missing cells (%)8.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory42.1 B

Variable types

Categorical2
Text3

Dataset

Description부산광역시_사하구_건물위생관리업현황_20230809
Author부산광역시 사하구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3079297

Alerts

업종명 has constant value ""Constant
데이터기준일자 has constant value ""Constant
소재지전화 has 25 (40.3%) missing valuesMissing
업소명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:39:28.770614
Analysis finished2023-12-10 16:39:29.435493
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size628.0 B
건물위생관리업
62 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건물위생관리업
2nd row건물위생관리업
3rd row건물위생관리업
4th row건물위생관리업
5th row건물위생관리업

Common Values

ValueCountFrequency (%)
건물위생관리업 62
100.0%

Length

2023-12-11T01:39:29.580151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:39:29.734705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건물위생관리업 62
100.0%

업소명
Text

UNIQUE 

Distinct62
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size628.0 B
2023-12-11T01:39:29.978818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length11
Mean length6.7580645
Min length2

Characters and Unicode

Total characters419
Distinct characters145
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique62 ?
Unique (%)100.0%

Sample

1st row(주)동아에스엔씨
2nd row(주)대한크린에어
3rd row(주)대산기업
4th row(주)대성공사
5th row(주)명가종합관리
ValueCountFrequency (%)
주식회사 4
 
5.1%
그린 2
 
2.5%
주)동아에스엔씨 1
 
1.3%
주)도시농부 1
 
1.3%
주)대한크린에어 1
 
1.3%
주)내맘같이 1
 
1.3%
한국시스템에듀 1
 
1.3%
동아종합관리 1
 
1.3%
서부산방역 1
 
1.3%
조은관리시스템 1
 
1.3%
Other values (65) 65
82.3%
2023-12-11T01:39:30.418710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23
 
5.5%
( 21
 
5.0%
) 21
 
5.0%
17
 
4.1%
15
 
3.6%
12
 
2.9%
11
 
2.6%
10
 
2.4%
8
 
1.9%
8
 
1.9%
Other values (135) 273
65.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 351
83.8%
Open Punctuation 21
 
5.0%
Close Punctuation 21
 
5.0%
Space Separator 17
 
4.1%
Uppercase Letter 6
 
1.4%
Other Punctuation 2
 
0.5%
Decimal Number 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
6.6%
15
 
4.3%
12
 
3.4%
11
 
3.1%
10
 
2.8%
8
 
2.3%
8
 
2.3%
8
 
2.3%
7
 
2.0%
7
 
2.0%
Other values (124) 242
68.9%
Uppercase Letter
ValueCountFrequency (%)
K 1
16.7%
H 1
16.7%
D 1
16.7%
M 1
16.7%
B 1
16.7%
C 1
16.7%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Space Separator
ValueCountFrequency (%)
17
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Decimal Number
ValueCountFrequency (%)
9 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 351
83.8%
Common 62
 
14.8%
Latin 6
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
6.6%
15
 
4.3%
12
 
3.4%
11
 
3.1%
10
 
2.8%
8
 
2.3%
8
 
2.3%
8
 
2.3%
7
 
2.0%
7
 
2.0%
Other values (124) 242
68.9%
Latin
ValueCountFrequency (%)
K 1
16.7%
H 1
16.7%
D 1
16.7%
M 1
16.7%
B 1
16.7%
C 1
16.7%
Common
ValueCountFrequency (%)
( 21
33.9%
) 21
33.9%
17
27.4%
. 2
 
3.2%
9 1
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 351
83.8%
ASCII 68
 
16.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
23
 
6.6%
15
 
4.3%
12
 
3.4%
11
 
3.1%
10
 
2.8%
8
 
2.3%
8
 
2.3%
8
 
2.3%
7
 
2.0%
7
 
2.0%
Other values (124) 242
68.9%
ASCII
ValueCountFrequency (%)
( 21
30.9%
) 21
30.9%
17
25.0%
. 2
 
2.9%
9 1
 
1.5%
K 1
 
1.5%
H 1
 
1.5%
D 1
 
1.5%
M 1
 
1.5%
B 1
 
1.5%
Distinct61
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size628.0 B
2023-12-11T01:39:30.752571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length45
Mean length35.967742
Min length22

Characters and Unicode

Total characters2230
Distinct characters116
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique60 ?
Unique (%)96.8%

Sample

1st row부산광역시 사하구 다대로 109 (신평동,한성기린A상가304호)
2nd row부산광역시 사하구 승학로 123-4, 6동 2층 202호 (당리동, 신익아파트 상가동)
3rd row부산광역시 사하구 동매로 23, 3층 (하단동)
4th row부산광역시 사하구 장평로 286-1 (신평동)
5th row부산광역시 사하구 낙동남로 1348-5 (하단동,1,2층)
ValueCountFrequency (%)
부산광역시 62
 
14.6%
사하구 62
 
14.6%
하단동 17
 
4.0%
2층 12
 
2.8%
당리동 11
 
2.6%
상가동 9
 
2.1%
낙동대로 8
 
1.9%
3층 8
 
1.9%
괴정동 7
 
1.6%
신평동 7
 
1.6%
Other values (147) 223
52.3%
2023-12-11T01:39:31.312584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
364
 
16.3%
105
 
4.7%
100
 
4.5%
1 80
 
3.6%
, 75
 
3.4%
2 67
 
3.0%
65
 
2.9%
65
 
2.9%
64
 
2.9%
63
 
2.8%
Other values (106) 1182
53.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1295
58.1%
Space Separator 364
 
16.3%
Decimal Number 359
 
16.1%
Other Punctuation 75
 
3.4%
Close Punctuation 63
 
2.8%
Open Punctuation 63
 
2.8%
Dash Punctuation 7
 
0.3%
Uppercase Letter 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
105
 
8.1%
100
 
7.7%
65
 
5.0%
65
 
5.0%
64
 
4.9%
63
 
4.9%
62
 
4.8%
62
 
4.8%
62
 
4.8%
58
 
4.5%
Other values (89) 589
45.5%
Decimal Number
ValueCountFrequency (%)
1 80
22.3%
2 67
18.7%
3 51
14.2%
0 47
13.1%
5 28
 
7.8%
4 28
 
7.8%
6 16
 
4.5%
9 15
 
4.2%
7 14
 
3.9%
8 13
 
3.6%
Uppercase Letter
ValueCountFrequency (%)
A 3
75.0%
B 1
 
25.0%
Space Separator
ValueCountFrequency (%)
364
100.0%
Other Punctuation
ValueCountFrequency (%)
, 75
100.0%
Close Punctuation
ValueCountFrequency (%)
) 63
100.0%
Open Punctuation
ValueCountFrequency (%)
( 63
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1295
58.1%
Common 931
41.7%
Latin 4
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
105
 
8.1%
100
 
7.7%
65
 
5.0%
65
 
5.0%
64
 
4.9%
63
 
4.9%
62
 
4.8%
62
 
4.8%
62
 
4.8%
58
 
4.5%
Other values (89) 589
45.5%
Common
ValueCountFrequency (%)
364
39.1%
1 80
 
8.6%
, 75
 
8.1%
2 67
 
7.2%
) 63
 
6.8%
( 63
 
6.8%
3 51
 
5.5%
0 47
 
5.0%
5 28
 
3.0%
4 28
 
3.0%
Other values (5) 65
 
7.0%
Latin
ValueCountFrequency (%)
A 3
75.0%
B 1
 
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1295
58.1%
ASCII 935
41.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
364
38.9%
1 80
 
8.6%
, 75
 
8.0%
2 67
 
7.2%
) 63
 
6.7%
( 63
 
6.7%
3 51
 
5.5%
0 47
 
5.0%
5 28
 
3.0%
4 28
 
3.0%
Other values (7) 69
 
7.4%
Hangul
ValueCountFrequency (%)
105
 
8.1%
100
 
7.7%
65
 
5.0%
65
 
5.0%
64
 
4.9%
63
 
4.9%
62
 
4.8%
62
 
4.8%
62
 
4.8%
58
 
4.5%
Other values (89) 589
45.5%

소재지전화
Text

MISSING 

Distinct35
Distinct (%)94.6%
Missing25
Missing (%)40.3%
Memory size628.0 B
2023-12-11T01:39:31.567005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.864865
Min length9

Characters and Unicode

Total characters439
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)91.9%

Sample

1st row051-203-7983
2nd row051-292-1101
3rd row051-203-2100
4th row051-241-5858
5th row051-207-8953
ValueCountFrequency (%)
051-263-6697 3
 
8.1%
051-205-2411 1
 
2.7%
051-203-7983 1
 
2.7%
051-202-9336 1
 
2.7%
051-208-3288 1
 
2.7%
051-255-3280 1
 
2.7%
051-989-7606 1
 
2.7%
1688-4749 1
 
2.7%
051-710-7062 1
 
2.7%
051-206-1069 1
 
2.7%
Other values (25) 25
67.6%
2023-12-11T01:39:32.053602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 72
16.4%
0 70
15.9%
1 63
14.4%
5 60
13.7%
2 48
10.9%
6 35
8.0%
9 23
 
5.2%
8 22
 
5.0%
3 17
 
3.9%
7 16
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 367
83.6%
Dash Punctuation 72
 
16.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 70
19.1%
1 63
17.2%
5 60
16.3%
2 48
13.1%
6 35
9.5%
9 23
 
6.3%
8 22
 
6.0%
3 17
 
4.6%
7 16
 
4.4%
4 13
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 72
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 439
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 72
16.4%
0 70
15.9%
1 63
14.4%
5 60
13.7%
2 48
10.9%
6 35
8.0%
9 23
 
5.2%
8 22
 
5.0%
3 17
 
3.9%
7 16
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 439
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 72
16.4%
0 70
15.9%
1 63
14.4%
5 60
13.7%
2 48
10.9%
6 35
8.0%
9 23
 
5.2%
8 22
 
5.0%
3 17
 
3.9%
7 16
 
3.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size628.0 B
2023-08-09
62 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-09
2nd row2023-08-09
3rd row2023-08-09
4th row2023-08-09
5th row2023-08-09

Common Values

ValueCountFrequency (%)
2023-08-09 62
100.0%

Length

2023-12-11T01:39:32.266525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:39:32.428558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-09 62
100.0%

Correlations

2023-12-11T01:39:32.517448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명영업소 주소(도로명)소재지전화
업소명1.0001.0001.000
영업소 주소(도로명)1.0001.0001.000
소재지전화1.0001.0001.000

Missing values

2023-12-11T01:39:29.195318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:39:29.375575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명영업소 주소(도로명)소재지전화데이터기준일자
0건물위생관리업(주)동아에스엔씨부산광역시 사하구 다대로 109 (신평동,한성기린A상가304호)051-203-79832023-08-09
1건물위생관리업(주)대한크린에어부산광역시 사하구 승학로 123-4, 6동 2층 202호 (당리동, 신익아파트 상가동)051-292-11012023-08-09
2건물위생관리업(주)대산기업부산광역시 사하구 동매로 23, 3층 (하단동)051-203-21002023-08-09
3건물위생관리업(주)대성공사부산광역시 사하구 장평로 286-1 (신평동)051-241-58582023-08-09
4건물위생관리업(주)명가종합관리부산광역시 사하구 낙동남로 1348-5 (하단동,1,2층)051-207-89532023-08-09
5건물위생관리업부일환경산업부산광역시 사하구 낙동대로 106 (괴정동,현대상가 2층)051-206-65442023-08-09
6건물위생관리업(주)경성종합관리부산광역시 사하구 괴정로 111-1 (당리동)051-205-59112023-08-09
7건물위생관리업주식회사승학환경부산광역시 사하구 낙동대로 542, 2층 209-A호 (하단동, 대우에덴프라자)051-265-45762023-08-09
8건물위생관리업그린산업부산광역시 사하구 장평로299번길 7, 2층 (신평동)<NA>2023-08-09
9건물위생관리업(주)신성부산광역시 사하구 다대로 109, 상가동 202호 (신평동,한성기린임호아파트)051-206-89002023-08-09
업종명업소명영업소 주소(도로명)소재지전화데이터기준일자
52건물위생관리업우리집 케어부산광역시 사하구 승학로 123-4, 6동 203호 (당리동, 당리신익아파트)<NA>2023-08-09
53건물위생관리업주식회사 프로텍가드부산광역시 사하구 사하로 107, 2층 (감천동)<NA>2023-08-09
54건물위생관리업9K 클린부산광역시 사하구 승학로131번길 1, 당리동 1층 (당리동)<NA>2023-08-09
55건물위생관리업녹색환경 클린케어부산광역시 사하구 감내1로175번길 27, 1층 (감천동)<NA>2023-08-09
56건물위생관리업나일상사부산광역시 사하구 사하로 47, 상가동 3층 305호 (구평동, 구평화신아파트)<NA>2023-08-09
57건물위생관리업유크린부산광역시 사하구 하신중앙로 337, 5층 504호 (하단동, 충우드림라이프)<NA>2023-08-09
58건물위생관리업바른클린솔루션부산광역시 사하구 감천로139번길 18, 지하1층 (감천동)1544-04932023-08-09
59건물위생관리업엣지 컴퍼니부산광역시 사하구 다대로130번길 76, 1층 (신평동)<NA>2023-08-09
60건물위생관리업클린업부산광역시 사하구 다대낙조2길 15, 2층 202호 (다대동)051-263-66972023-08-09
61건물위생관리업사단법인 누리복지회 사업부부산광역시 사하구 다대로 256, 1층 (장림동)<NA>2023-08-09