Overview

Dataset statistics

Number of variables3
Number of observations329
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)0.6%
Total size in memory7.8 KiB
Average record size in memory24.4 B

Variable types

Text2
Categorical1

Dataset

Description감염병의 예반 및 관리에 관한 법률에 의거하여 주기적인 소독을 실시해햐하는 부산광역시 서구 소독의무시설 현황입니다.
Author부산광역시 서구
URLhttps://www.data.go.kr/data/15127170/fileData.do

Alerts

Dataset has 2 (0.6%) duplicate rowsDuplicates

Reproduction

Analysis started2024-03-23 05:52:04.313734
Analysis finished2024-03-23 05:52:04.769085
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct301
Distinct (%)91.5%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2024-03-23T14:52:05.052465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length22
Mean length8.0455927
Min length3

Characters and Unicode

Total characters2647
Distinct characters359
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique273 ?
Unique (%)83.0%

Sample

1st row부산비치관광호텔
2nd row유엔송도호텔(뉴엔송도호텔)
3rd row페어필드 바이 메리어트 부산 송도비치호텔
4th row윈덤그랜드부산
5th row이터널블루
ValueCountFrequency (%)
어린이집 12
 
2.6%
집단급식소 9
 
2.0%
호텔 7
 
1.5%
부산대학교병원 5
 
1.1%
지하철역사 3
 
0.7%
주)풀무원푸드앤컬처 3
 
0.7%
건물 3
 
0.7%
푸디스트(주 3
 
0.7%
모텔 3
 
0.7%
동아대학교병원 3
 
0.7%
Other values (350) 402
88.7%
2024-03-23T14:52:05.590209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
124
 
4.7%
76
 
2.9%
70
 
2.6%
67
 
2.5%
63
 
2.4%
52
 
2.0%
51
 
1.9%
50
 
1.9%
49
 
1.9%
47
 
1.8%
Other values (349) 1998
75.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2397
90.6%
Space Separator 124
 
4.7%
Uppercase Letter 30
 
1.1%
Open Punctuation 27
 
1.0%
Close Punctuation 27
 
1.0%
Decimal Number 26
 
1.0%
Lowercase Letter 9
 
0.3%
Other Punctuation 5
 
0.2%
Other Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
76
 
3.2%
70
 
2.9%
67
 
2.8%
63
 
2.6%
52
 
2.2%
51
 
2.1%
50
 
2.1%
49
 
2.0%
47
 
2.0%
45
 
1.9%
Other values (310) 1827
76.2%
Uppercase Letter
ValueCountFrequency (%)
B 5
16.7%
A 3
10.0%
S 3
10.0%
G 3
10.0%
K 2
 
6.7%
H 2
 
6.7%
M 2
 
6.7%
T 2
 
6.7%
C 1
 
3.3%
J 1
 
3.3%
Other values (6) 6
20.0%
Decimal Number
ValueCountFrequency (%)
2 6
23.1%
1 5
19.2%
9 4
15.4%
5 3
11.5%
3 3
11.5%
6 2
 
7.7%
7 1
 
3.8%
4 1
 
3.8%
0 1
 
3.8%
Lowercase Letter
ValueCountFrequency (%)
e 2
22.2%
i 1
11.1%
o 1
11.1%
x 1
11.1%
h 1
11.1%
r 1
11.1%
g 1
11.1%
d 1
11.1%
Other Punctuation
ValueCountFrequency (%)
. 3
60.0%
, 2
40.0%
Space Separator
ValueCountFrequency (%)
124
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2399
90.6%
Common 209
 
7.9%
Latin 39
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
76
 
3.2%
70
 
2.9%
67
 
2.8%
63
 
2.6%
52
 
2.2%
51
 
2.1%
50
 
2.1%
49
 
2.0%
47
 
2.0%
45
 
1.9%
Other values (311) 1829
76.2%
Latin
ValueCountFrequency (%)
B 5
 
12.8%
A 3
 
7.7%
S 3
 
7.7%
G 3
 
7.7%
K 2
 
5.1%
H 2
 
5.1%
M 2
 
5.1%
T 2
 
5.1%
e 2
 
5.1%
i 1
 
2.6%
Other values (14) 14
35.9%
Common
ValueCountFrequency (%)
124
59.3%
( 27
 
12.9%
) 27
 
12.9%
2 6
 
2.9%
1 5
 
2.4%
9 4
 
1.9%
5 3
 
1.4%
3 3
 
1.4%
. 3
 
1.4%
6 2
 
1.0%
Other values (4) 5
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2397
90.6%
ASCII 248
 
9.4%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
124
50.0%
( 27
 
10.9%
) 27
 
10.9%
2 6
 
2.4%
B 5
 
2.0%
1 5
 
2.0%
9 4
 
1.6%
A 3
 
1.2%
5 3
 
1.2%
S 3
 
1.2%
Other values (28) 41
 
16.5%
Hangul
ValueCountFrequency (%)
76
 
3.2%
70
 
2.9%
67
 
2.8%
63
 
2.6%
52
 
2.2%
51
 
2.1%
50
 
2.1%
49
 
2.0%
47
 
2.0%
45
 
1.9%
Other values (310) 1827
76.2%
None
ValueCountFrequency (%)
2
100.0%

주소
Text

Distinct322
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2024-03-23T14:52:05.986592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length40
Mean length23.179331
Min length6

Characters and Unicode

Total characters7626
Distinct characters159
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique318 ?
Unique (%)96.7%

Sample

1st row부산광역시 서구 등대로 113
2nd row부산광역시 서구 충무대로 16
3rd row부산광역시 서구 송도해변로 113(암남동)
4th row부산광역시 서구 등대로 27
5th row부산광역시 서구 송도해변로 27, 4층
ValueCountFrequency (%)
서구 284
 
19.3%
부산광역시 255
 
17.3%
암남동 85
 
5.8%
송도해변로 25
 
1.7%
서대신동3가 22
 
1.5%
남부민동 20
 
1.4%
암남공원로 17
 
1.2%
충무동1가 17
 
1.2%
구덕로 14
 
1.0%
외1필지 12
 
0.8%
Other values (431) 719
48.9%
2024-03-23T14:52:06.566309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1154
 
15.1%
364
 
4.8%
339
 
4.4%
335
 
4.4%
1 333
 
4.4%
315
 
4.1%
272
 
3.6%
270
 
3.5%
262
 
3.4%
262
 
3.4%
Other values (149) 3720
48.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4519
59.3%
Decimal Number 1345
 
17.6%
Space Separator 1154
 
15.1%
Open Punctuation 211
 
2.8%
Close Punctuation 210
 
2.8%
Dash Punctuation 112
 
1.5%
Other Punctuation 67
 
0.9%
Math Symbol 4
 
0.1%
Uppercase Letter 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
364
 
8.1%
339
 
7.5%
335
 
7.4%
315
 
7.0%
272
 
6.0%
270
 
6.0%
262
 
5.8%
262
 
5.8%
223
 
4.9%
180
 
4.0%
Other values (131) 1697
37.6%
Decimal Number
ValueCountFrequency (%)
1 333
24.8%
2 247
18.4%
3 226
16.8%
4 95
 
7.1%
5 91
 
6.8%
7 81
 
6.0%
6 79
 
5.9%
0 66
 
4.9%
9 64
 
4.8%
8 63
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
B 3
75.0%
A 1
 
25.0%
Space Separator
ValueCountFrequency (%)
1154
100.0%
Open Punctuation
ValueCountFrequency (%)
( 211
100.0%
Close Punctuation
ValueCountFrequency (%)
) 210
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 112
100.0%
Other Punctuation
ValueCountFrequency (%)
, 67
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4519
59.3%
Common 3103
40.7%
Latin 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
364
 
8.1%
339
 
7.5%
335
 
7.4%
315
 
7.0%
272
 
6.0%
270
 
6.0%
262
 
5.8%
262
 
5.8%
223
 
4.9%
180
 
4.0%
Other values (131) 1697
37.6%
Common
ValueCountFrequency (%)
1154
37.2%
1 333
 
10.7%
2 247
 
8.0%
3 226
 
7.3%
( 211
 
6.8%
) 210
 
6.8%
- 112
 
3.6%
4 95
 
3.1%
5 91
 
2.9%
7 81
 
2.6%
Other values (6) 343
 
11.1%
Latin
ValueCountFrequency (%)
B 3
75.0%
A 1
 
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4519
59.3%
ASCII 3107
40.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1154
37.1%
1 333
 
10.7%
2 247
 
7.9%
3 226
 
7.3%
( 211
 
6.8%
) 210
 
6.8%
- 112
 
3.6%
4 95
 
3.1%
5 91
 
2.9%
7 81
 
2.6%
Other values (8) 347
 
11.2%
Hangul
ValueCountFrequency (%)
364
 
8.1%
339
 
7.5%
335
 
7.4%
315
 
7.0%
272
 
6.0%
270
 
6.0%
262
 
5.8%
262
 
5.8%
223
 
4.9%
180
 
4.0%
Other values (131) 1697
37.6%

유형
Categorical

Distinct13
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
사무실용 및 복합용도 건축물
101 
집단급식소
52 
숙박업
42 
식품접객업
34 
학교
24 
Other values (8)
76 

Length

Max length15
Median length10
Mean length7.6595745
Min length2

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row관광숙박업
2nd row관광숙박업
3rd row관광숙박업
4th row관광숙박업
5th row관광숙박업

Common Values

ValueCountFrequency (%)
사무실용 및 복합용도 건축물 101
30.7%
집단급식소 52
15.8%
숙박업 42
12.8%
식품접객업 34
 
10.3%
학교 24
 
7.3%
어린이집및 유치원 19
 
5.8%
공동주택 16
 
4.9%
병원 14
 
4.3%
전통시장 10
 
3.0%
위탁급식소 7
 
2.1%
Other values (3) 10
 
3.0%

Length

2024-03-23T14:52:06.745366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
사무실용 101
15.5%
복합용도 101
15.5%
건축물 101
15.5%
101
15.5%
집단급식소 52
8.0%
숙박업 42
6.5%
식품접객업 34
 
5.2%
학교 24
 
3.7%
유치원 19
 
2.9%
어린이집및 19
 
2.9%
Other values (7) 57
8.8%

Missing values

2024-03-23T14:52:04.637634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T14:52:04.726552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

대상시설주소유형
0부산비치관광호텔부산광역시 서구 등대로 113관광숙박업
1유엔송도호텔(뉴엔송도호텔)부산광역시 서구 충무대로 16관광숙박업
2페어필드 바이 메리어트 부산 송도비치호텔부산광역시 서구 송도해변로 113(암남동)관광숙박업
3윈덤그랜드부산부산광역시 서구 등대로 27관광숙박업
4이터널블루부산광역시 서구 송도해변로 27, 4층관광숙박업
5부산장모텔부산광역시 서구 충무대로255번길 10 (충무동3가)숙박업
6S모텔부산광역시 서구 송도해변로 123 (암남동)숙박업
7호텔더메이부산광역시 서구 충무대로 14-13 (암남동)숙박업
8부산장부산광역시 서구 충무대로241번길 7 (남부민동)숙박업
9금강모텔부산광역시 서구 충무시장길 24 (충무동1가)숙박업
대상시설주소유형
319풀리페아파트해돋이로23공동주택
320대신푸르지오2차고운들로 181공동주택
321대신푸르지오대영로 38번길11공동주택
322현대힐스테이트 이진베이시티송도해변로 192공동주택
323대림비치아파트충무대로 133공동주택
324대신더샵꽃마을로 48공동주택
325토성동 경동리인타워보수대로 27공동주택
326대신공원한신휴플러스보수대로 284공동주택
327협성르네상스대티로 159공동주택
328경동 포레스트힐 행복주택 아미부산 서구 옥천로173번길 29(아미동2가)공동주택

Duplicate rows

Most frequently occurring

대상시설주소유형# duplicates
0그랩 디 오션 송도부산광역시 서구 송도해변로 97 (암남동)숙박업2
1삼성보안공사 건물부산광역시 서구 부민동1가 17-1 외1필지사무실용 및 복합용도 건축물2