Overview

Dataset statistics

Number of variables4
Number of observations520
Missing cells276
Missing cells (%)13.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.4 KiB
Average record size in memory32.3 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시 강서구 공중위생업 현황에 대한 데이터로 업종명, 업소명, 업소소재지(도로명) 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15007007/fileData.do

Alerts

소재지전화 has 276 (53.1%) missing valuesMissing

Reproduction

Analysis started2023-12-12 12:54:41.721807
Analysis finished2023-12-12 12:54:42.235568
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

Distinct19
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
일반미용업
192 
피부미용업
56 
네일미용업
49 
숙박업(일반)
42 
건물위생관리업
40 
Other values (14)
141 

Length

Max length23
Median length5
Mean length6.3211538
Min length3

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row숙박업(일반)
2nd row숙박업(일반)
3rd row숙박업(일반)
4th row숙박업(일반)
5th row숙박업(일반)

Common Values

ValueCountFrequency (%)
일반미용업 192
36.9%
피부미용업 56
 
10.8%
네일미용업 49
 
9.4%
숙박업(일반) 42
 
8.1%
건물위생관리업 40
 
7.7%
이용업 31
 
6.0%
세탁업 19
 
3.7%
화장ㆍ분장 미용업 16
 
3.1%
목욕장업 14
 
2.7%
네일미용업, 화장ㆍ분장 미용업 11
 
2.1%
Other values (9) 50
 
9.6%

Length

2023-12-12T21:54:42.322377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반미용업 206
32.6%
피부미용업 88
13.9%
네일미용업 83
13.1%
화장ㆍ분장 49
 
7.8%
미용업 49
 
7.8%
숙박업(일반 42
 
6.6%
건물위생관리업 40
 
6.3%
이용업 31
 
4.9%
세탁업 19
 
3.0%
목욕장업 14
 
2.2%
Distinct511
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
2023-12-12T21:54:42.626587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length22
Mean length6.6192308
Min length2

Characters and Unicode

Total characters3442
Distinct characters443
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique503 ?
Unique (%)96.7%

Sample

1st row명지오션시티호텔
2nd row비엠호텔
3rd row제타
4th row호텔프렌치코드
5th row마틴모텔
ValueCountFrequency (%)
명지점 8
 
1.3%
주식회사 8
 
1.3%
에스테틱 6
 
1.0%
헤어 5
 
0.8%
nail 4
 
0.6%
퀸즈헤나 3
 
0.5%
명지국제신도시점 3
 
0.5%
호텔 3
 
0.5%
넘버25호텔 2
 
0.3%
블루클럽 2
 
0.3%
Other values (567) 584
93.0%
2023-12-12T21:54:43.102871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
140
 
4.1%
136
 
4.0%
119
 
3.5%
108
 
3.1%
85
 
2.5%
67
 
1.9%
61
 
1.8%
) 60
 
1.7%
( 60
 
1.7%
55
 
1.6%
Other values (433) 2551
74.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2837
82.4%
Lowercase Letter 167
 
4.9%
Uppercase Letter 144
 
4.2%
Space Separator 108
 
3.1%
Close Punctuation 61
 
1.8%
Open Punctuation 61
 
1.8%
Decimal Number 40
 
1.2%
Other Punctuation 24
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
140
 
4.9%
136
 
4.8%
119
 
4.2%
85
 
3.0%
67
 
2.4%
61
 
2.2%
55
 
1.9%
53
 
1.9%
42
 
1.5%
41
 
1.4%
Other values (372) 2038
71.8%
Uppercase Letter
ValueCountFrequency (%)
A 22
15.3%
H 15
 
10.4%
I 12
 
8.3%
N 12
 
8.3%
R 9
 
6.2%
M 8
 
5.6%
L 8
 
5.6%
E 7
 
4.9%
S 6
 
4.2%
O 6
 
4.2%
Other values (12) 39
27.1%
Lowercase Letter
ValueCountFrequency (%)
o 27
16.2%
a 19
11.4%
l 19
11.4%
n 18
10.8%
i 18
10.8%
e 11
6.6%
s 11
6.6%
r 8
 
4.8%
m 6
 
3.6%
d 5
 
3.0%
Other values (10) 25
15.0%
Decimal Number
ValueCountFrequency (%)
2 10
25.0%
1 9
22.5%
0 6
15.0%
3 5
12.5%
7 3
 
7.5%
5 3
 
7.5%
4 2
 
5.0%
6 1
 
2.5%
8 1
 
2.5%
Other Punctuation
ValueCountFrequency (%)
. 8
33.3%
, 6
25.0%
# 5
20.8%
& 4
16.7%
' 1
 
4.2%
Close Punctuation
ValueCountFrequency (%)
) 60
98.4%
] 1
 
1.6%
Open Punctuation
ValueCountFrequency (%)
( 60
98.4%
[ 1
 
1.6%
Space Separator
ValueCountFrequency (%)
108
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2837
82.4%
Latin 311
 
9.0%
Common 294
 
8.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
140
 
4.9%
136
 
4.8%
119
 
4.2%
85
 
3.0%
67
 
2.4%
61
 
2.2%
55
 
1.9%
53
 
1.9%
42
 
1.5%
41
 
1.4%
Other values (372) 2038
71.8%
Latin
ValueCountFrequency (%)
o 27
 
8.7%
A 22
 
7.1%
a 19
 
6.1%
l 19
 
6.1%
n 18
 
5.8%
i 18
 
5.8%
H 15
 
4.8%
I 12
 
3.9%
N 12
 
3.9%
e 11
 
3.5%
Other values (32) 138
44.4%
Common
ValueCountFrequency (%)
108
36.7%
) 60
20.4%
( 60
20.4%
2 10
 
3.4%
1 9
 
3.1%
. 8
 
2.7%
, 6
 
2.0%
0 6
 
2.0%
3 5
 
1.7%
# 5
 
1.7%
Other values (9) 17
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2837
82.4%
ASCII 605
 
17.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
140
 
4.9%
136
 
4.8%
119
 
4.2%
85
 
3.0%
67
 
2.4%
61
 
2.2%
55
 
1.9%
53
 
1.9%
42
 
1.5%
41
 
1.4%
Other values (372) 2038
71.8%
ASCII
ValueCountFrequency (%)
108
17.9%
) 60
 
9.9%
( 60
 
9.9%
o 27
 
4.5%
A 22
 
3.6%
a 19
 
3.1%
l 19
 
3.1%
n 18
 
3.0%
i 18
 
3.0%
H 15
 
2.5%
Other values (51) 239
39.5%
Distinct510
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
2023-12-12T21:54:43.413627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length50
Mean length38.534615
Min length18

Characters and Unicode

Total characters20038
Distinct characters246
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique501 ?
Unique (%)96.3%

Sample

1st row부산광역시 강서구 명지오션시티3로 37 (명지동)
2nd row부산광역시 강서구 르노삼성대로 576-2 (명지동)
3rd row부산광역시 강서구 르노삼성대로 634, A동 (명지동, 러브스토리)
4th row부산광역시 강서구 명지오션시티5로 8 (명지동)
5th row부산광역시 강서구 르노삼성대로 600 (명지동)
ValueCountFrequency (%)
부산광역시 520
 
14.0%
강서구 520
 
14.0%
명지동 330
 
8.9%
1층 136
 
3.7%
일부호 66
 
1.8%
명지국제8로 59
 
1.6%
신호동 52
 
1.4%
일부 42
 
1.1%
명지국제2로 36
 
1.0%
2층 35
 
0.9%
Other values (697) 1911
51.6%
2023-12-12T21:54:43.862316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3187
 
15.9%
1 923
 
4.6%
794
 
4.0%
729
 
3.6%
696
 
3.5%
677
 
3.4%
2 665
 
3.3%
661
 
3.3%
652
 
3.3%
, 574
 
2.9%
Other values (236) 10480
52.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11538
57.6%
Decimal Number 3506
 
17.5%
Space Separator 3187
 
15.9%
Other Punctuation 576
 
2.9%
Close Punctuation 526
 
2.6%
Open Punctuation 526
 
2.6%
Dash Punctuation 97
 
0.5%
Uppercase Letter 63
 
0.3%
Lowercase Letter 19
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
794
 
6.9%
729
 
6.3%
696
 
6.0%
677
 
5.9%
661
 
5.7%
652
 
5.7%
542
 
4.7%
526
 
4.6%
524
 
4.5%
522
 
4.5%
Other values (212) 5215
45.2%
Decimal Number
ValueCountFrequency (%)
1 923
26.3%
2 665
19.0%
3 370
10.6%
0 353
 
10.1%
4 244
 
7.0%
8 222
 
6.3%
5 216
 
6.2%
6 200
 
5.7%
7 186
 
5.3%
9 127
 
3.6%
Uppercase Letter
ValueCountFrequency (%)
B 26
41.3%
C 11
17.5%
S 9
 
14.3%
A 8
 
12.7%
M 4
 
6.3%
D 4
 
6.3%
G 1
 
1.6%
Other Punctuation
ValueCountFrequency (%)
, 574
99.7%
@ 2
 
0.3%
Space Separator
ValueCountFrequency (%)
3187
100.0%
Close Punctuation
ValueCountFrequency (%)
) 526
100.0%
Open Punctuation
ValueCountFrequency (%)
( 526
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 97
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11538
57.6%
Common 8418
42.0%
Latin 82
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
794
 
6.9%
729
 
6.3%
696
 
6.0%
677
 
5.9%
661
 
5.7%
652
 
5.7%
542
 
4.7%
526
 
4.6%
524
 
4.5%
522
 
4.5%
Other values (212) 5215
45.2%
Common
ValueCountFrequency (%)
3187
37.9%
1 923
 
11.0%
2 665
 
7.9%
, 574
 
6.8%
) 526
 
6.2%
( 526
 
6.2%
3 370
 
4.4%
0 353
 
4.2%
4 244
 
2.9%
8 222
 
2.6%
Other values (6) 828
 
9.8%
Latin
ValueCountFrequency (%)
B 26
31.7%
e 19
23.2%
C 11
13.4%
S 9
 
11.0%
A 8
 
9.8%
M 4
 
4.9%
D 4
 
4.9%
G 1
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11538
57.6%
ASCII 8500
42.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3187
37.5%
1 923
 
10.9%
2 665
 
7.8%
, 574
 
6.8%
) 526
 
6.2%
( 526
 
6.2%
3 370
 
4.4%
0 353
 
4.2%
4 244
 
2.9%
8 222
 
2.6%
Other values (14) 910
 
10.7%
Hangul
ValueCountFrequency (%)
794
 
6.9%
729
 
6.3%
696
 
6.0%
677
 
5.9%
661
 
5.7%
652
 
5.7%
542
 
4.7%
526
 
4.6%
524
 
4.5%
522
 
4.5%
Other values (212) 5215
45.2%

소재지전화
Text

MISSING 

Distinct242
Distinct (%)99.2%
Missing276
Missing (%)53.1%
Memory size4.2 KiB
2023-12-12T21:54:44.207635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.02459
Min length12

Characters and Unicode

Total characters2934
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique240 ?
Unique (%)98.4%

Sample

1st row051-271-0023
2nd row051-271-1511
3rd row051-271-3677
4th row051-271-2563
5th row051-717-3872
ValueCountFrequency (%)
051-971-0194 2
 
0.8%
051-271-2696 2
 
0.8%
051-978-0607 1
 
0.4%
051-972-1713 1
 
0.4%
051-271-0023 1
 
0.4%
051-208-7979 1
 
0.4%
070-7787-2020 1
 
0.4%
051-972-5896 1
 
0.4%
0704-6449-064 1
 
0.4%
051-913-2553 1
 
0.4%
Other values (232) 232
95.1%
2023-12-12T21:54:44.651258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 488
16.6%
1 463
15.8%
0 441
15.0%
5 355
12.1%
7 248
8.5%
2 244
8.3%
9 190
 
6.5%
3 160
 
5.5%
8 129
 
4.4%
6 121
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2446
83.4%
Dash Punctuation 488
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 463
18.9%
0 441
18.0%
5 355
14.5%
7 248
10.1%
2 244
10.0%
9 190
7.8%
3 160
 
6.5%
8 129
 
5.3%
6 121
 
4.9%
4 95
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 488
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2934
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 488
16.6%
1 463
15.8%
0 441
15.0%
5 355
12.1%
7 248
8.5%
2 244
8.3%
9 190
 
6.5%
3 160
 
5.5%
8 129
 
4.4%
6 121
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2934
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 488
16.6%
1 463
15.8%
0 441
15.0%
5 355
12.1%
7 248
8.5%
2 244
8.3%
9 190
 
6.5%
3 160
 
5.5%
8 129
 
4.4%
6 121
 
4.1%

Missing values

2023-12-12T21:54:42.109530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:54:42.202790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명영업소 주소(도로명)소재지전화
0숙박업(일반)명지오션시티호텔부산광역시 강서구 명지오션시티3로 37 (명지동)051-271-0023
1숙박업(일반)비엠호텔부산광역시 강서구 르노삼성대로 576-2 (명지동)051-271-1511
2숙박업(일반)제타부산광역시 강서구 르노삼성대로 634, A동 (명지동, 러브스토리)051-271-3677
3숙박업(일반)호텔프렌치코드부산광역시 강서구 명지오션시티5로 8 (명지동)051-271-2563
4숙박업(일반)마틴모텔부산광역시 강서구 르노삼성대로 600 (명지동)051-717-3872
5숙박업(일반)김해공항 브라운도트 명지점부산광역시 강서구 르노삼성대로 576-6 (명지동)051-271-3777
6숙박업(일반)디자인팝호텔부산광역시 강서구 르노삼성대로 576-4 (명지동)051-271-2001
7숙박업(일반)사이버모텔부산광역시 강서구 명지오션시티5로 18 (명지동)051-271-4411
8숙박업(일반)마르부산광역시 강서구 르노삼성대로 634, B동 (명지동)051-271-1907
9숙박업(일반)더블유(W)모텔부산광역시 강서구 르노삼성대로 598 (명지동)051-271-1119
업종명업소명영업소 주소(도로명)소재지전화
510피부미용업, 네일미용업, 화장ㆍ분장 미용업무드네일(mood nail)부산광역시 강서구 명지국제7로 140, 상가3동 107호 (명지동, 더 힐 시그니처)<NA>
511피부미용업, 네일미용업, 화장ㆍ분장 미용업까사벨르부산광역시 강서구 공항진입로 108, 2층 (대저2동, 국제선청사)051-972-3331
512피부미용업, 네일미용업, 화장ㆍ분장 미용업더끌림부산광역시 강서구 명지오션시티12로 10, 상가동 111호 (명지동, 엘크루솔마레)<NA>
513피부미용업, 네일미용업, 화장ㆍ분장 미용업디디네일(DD네일)부산광역시 강서구 명지오션시티12로 120, C4상가동 106호 (명지동)051-203-0930
514피부미용업, 네일미용업, 화장ㆍ분장 미용업뷰티인졸리부산광역시 강서구 명지국제8로 229, 부산명지지구아이메디컬 305호 (명지동)<NA>
515피부미용업, 네일미용업, 화장ㆍ분장 미용업아띠[네일,속눈썹,왁싱]뷰티부산광역시 강서구 명지국제6로318번길 14-4, 1층 일부 (명지동)<NA>
516피부미용업, 네일미용업, 화장ㆍ분장 미용업네일묘안부산광역시 강서구 명지국제2로 41, 203호 (명지동, 더샵 명지퍼스트월드 3단지)<NA>
517피부미용업, 네일미용업, 화장ㆍ분장 미용업네일케이부산광역시 강서구 명지국제11로 47, 1층 일부 (명지동)<NA>
518피부미용업, 네일미용업, 화장ㆍ분장 미용업프롬마레뷰티부산광역시 강서구 명지국제8로10번길 38, 7층 714호 (명지동)<NA>
519피부미용업, 네일미용업, 화장ㆍ분장 미용업유네일부산광역시 강서구 명지국제5로148번길 2, 1층 일부 (명지동)<NA>