Overview

Dataset statistics

Number of variables4
Number of observations2784
Missing cells1425
Missing cells (%)12.8%
Duplicate rows2
Duplicate rows (%)0.1%
Total size in memory87.1 KiB
Average record size in memory32.0 B

Variable types

Text3
Categorical1

Dataset

Description제주특별자치도에 소재하고 있는 미용업과 관련한 데이터로 업종명(네일미용, 피부미용,일반미용), 업소명, 소재지, 전화번호 등의 정보를 제공합니다.
Author제주특별자치도
URLhttps://www.data.go.kr/data/15056146/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 2 (0.1%) duplicate rowsDuplicates
전화번호 has 1421 (51.0%) missing valuesMissing

Reproduction

Analysis started2023-12-12 15:43:56.129966
Analysis finished2023-12-12 15:43:57.170177
Duration1.04 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

이름
Text

Distinct2746
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size21.9 KiB
2023-12-13T00:43:57.562941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length23
Mean length5.3383621
Min length1

Characters and Unicode

Total characters14862
Distinct characters680
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2710 ?
Unique (%)97.3%

Sample

1st row채안나뷰티살롱
2nd row우리
3rd row국제미용실
4th row금천
5th row화신
ValueCountFrequency (%)
헤어 8
 
0.3%
스킨케어 7
 
0.2%
아라점 5
 
0.2%
미용실 5
 
0.2%
아로마 5
 
0.2%
네일 5
 
0.2%
beauty 4
 
0.1%
태후사랑 4
 
0.1%
중앙미용실 3
 
0.1%
피부관리 3
 
0.1%
Other values (2813) 2882
98.3%
2023-12-13T00:43:58.194916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
917
 
6.2%
777
 
5.2%
442
 
3.0%
415
 
2.8%
412
 
2.8%
354
 
2.4%
330
 
2.2%
315
 
2.1%
314
 
2.1%
237
 
1.6%
Other values (670) 10349
69.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14212
95.6%
Lowercase Letter 187
 
1.3%
Space Separator 161
 
1.1%
Uppercase Letter 112
 
0.8%
Decimal Number 77
 
0.5%
Close Punctuation 50
 
0.3%
Open Punctuation 50
 
0.3%
Other Punctuation 11
 
0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
917
 
6.5%
777
 
5.5%
442
 
3.1%
415
 
2.9%
412
 
2.9%
354
 
2.5%
330
 
2.3%
315
 
2.2%
314
 
2.2%
237
 
1.7%
Other values (608) 9699
68.2%
Uppercase Letter
ValueCountFrequency (%)
A 14
 
12.5%
S 9
 
8.0%
O 8
 
7.1%
U 8
 
7.1%
N 7
 
6.2%
R 6
 
5.4%
J 6
 
5.4%
Y 6
 
5.4%
P 6
 
5.4%
E 6
 
5.4%
Other values (12) 36
32.1%
Lowercase Letter
ValueCountFrequency (%)
e 25
13.4%
a 22
11.8%
s 17
 
9.1%
l 17
 
9.1%
i 11
 
5.9%
o 11
 
5.9%
n 10
 
5.3%
t 10
 
5.3%
y 9
 
4.8%
r 9
 
4.8%
Other values (11) 46
24.6%
Decimal Number
ValueCountFrequency (%)
2 22
28.6%
1 13
16.9%
0 12
15.6%
9 10
13.0%
4 7
 
9.1%
8 4
 
5.2%
5 4
 
5.2%
3 3
 
3.9%
7 1
 
1.3%
6 1
 
1.3%
Other Punctuation
ValueCountFrequency (%)
& 5
45.5%
# 2
 
18.2%
: 2
 
18.2%
' 1
 
9.1%
. 1
 
9.1%
Space Separator
ValueCountFrequency (%)
161
100.0%
Close Punctuation
ValueCountFrequency (%)
) 50
100.0%
Open Punctuation
ValueCountFrequency (%)
( 50
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14212
95.6%
Common 351
 
2.4%
Latin 299
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
917
 
6.5%
777
 
5.5%
442
 
3.1%
415
 
2.9%
412
 
2.9%
354
 
2.5%
330
 
2.3%
315
 
2.2%
314
 
2.2%
237
 
1.7%
Other values (608) 9699
68.2%
Latin
ValueCountFrequency (%)
e 25
 
8.4%
a 22
 
7.4%
s 17
 
5.7%
l 17
 
5.7%
A 14
 
4.7%
i 11
 
3.7%
o 11
 
3.7%
n 10
 
3.3%
t 10
 
3.3%
S 9
 
3.0%
Other values (33) 153
51.2%
Common
ValueCountFrequency (%)
161
45.9%
) 50
 
14.2%
( 50
 
14.2%
2 22
 
6.3%
1 13
 
3.7%
0 12
 
3.4%
9 10
 
2.8%
4 7
 
2.0%
& 5
 
1.4%
8 4
 
1.1%
Other values (9) 17
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14212
95.6%
ASCII 650
 
4.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
917
 
6.5%
777
 
5.5%
442
 
3.1%
415
 
2.9%
412
 
2.9%
354
 
2.5%
330
 
2.3%
315
 
2.2%
314
 
2.2%
237
 
1.7%
Other values (608) 9699
68.2%
ASCII
ValueCountFrequency (%)
161
24.8%
) 50
 
7.7%
( 50
 
7.7%
e 25
 
3.8%
a 22
 
3.4%
2 22
 
3.4%
s 17
 
2.6%
l 17
 
2.6%
A 14
 
2.2%
1 13
 
2.0%
Other values (52) 259
39.8%

전화번호
Text

MISSING 

Distinct1340
Distinct (%)98.3%
Missing1421
Missing (%)51.0%
Memory size21.9 KiB
2023-12-13T00:43:58.529333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.032282
Min length11

Characters and Unicode

Total characters16400
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1317 ?
Unique (%)96.6%

Sample

1st row064-744-0256
2nd row064-722-4044
3rd row064-752-6495
4th row064-758-9254
5th row064-753-3450
ValueCountFrequency (%)
064-721-7474 2
 
0.1%
064-727-8838 2
 
0.1%
064-702-1436 2
 
0.1%
064-763-1428 2
 
0.1%
064-748-7366 2
 
0.1%
064-724-6819 2
 
0.1%
064-712-2245 2
 
0.1%
070-8200-8478 2
 
0.1%
064-758-9319 2
 
0.1%
064-756-2226 2
 
0.1%
Other values (1330) 1343
98.5%
2023-12-13T00:43:59.084749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 2726
16.6%
4 2227
13.6%
0 2149
13.1%
6 2086
12.7%
7 2035
12.4%
2 1142
7.0%
5 972
 
5.9%
3 891
 
5.4%
1 763
 
4.7%
8 746
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 13674
83.4%
Dash Punctuation 2726
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 2227
16.3%
0 2149
15.7%
6 2086
15.3%
7 2035
14.9%
2 1142
8.4%
5 972
7.1%
3 891
6.5%
1 763
 
5.6%
8 746
 
5.5%
9 663
 
4.8%
Dash Punctuation
ValueCountFrequency (%)
- 2726
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 16400
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 2726
16.6%
4 2227
13.6%
0 2149
13.1%
6 2086
12.7%
7 2035
12.4%
2 1142
7.0%
5 972
 
5.9%
3 891
 
5.4%
1 763
 
4.7%
8 746
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 16400
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 2726
16.6%
4 2227
13.6%
0 2149
13.1%
6 2086
12.7%
7 2035
12.4%
2 1142
7.0%
5 972
 
5.9%
3 891
 
5.4%
1 763
 
4.7%
8 746
 
4.5%

주소
Text

Distinct2635
Distinct (%)94.8%
Missing4
Missing (%)0.1%
Memory size21.9 KiB
2023-12-13T00:43:59.634807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length43
Mean length23.722662
Min length17

Characters and Unicode

Total characters65949
Distinct characters332
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2502 ?
Unique (%)90.0%

Sample

1st row제주특별자치도 제주시 도령북길 22 102호
2nd row제주특별자치도 제주시 무근성7길 14
3rd row제주특별자치도 제주시 동문로4길 17-1
4th row제주특별자치도 제주시 중앙로23길 6
5th row제주특별자치도 제주시 용담로18길 9-1
ValueCountFrequency (%)
제주특별자치도 2780
20.1%
제주시 2096
 
15.1%
1층 1073
 
7.8%
서귀포시 684
 
4.9%
2층 352
 
2.5%
중앙로 90
 
0.7%
102호 89
 
0.6%
101호 89
 
0.6%
3층 89
 
0.6%
대정읍 88
 
0.6%
Other values (1807) 6415
46.3%
2023-12-13T00:44:00.398293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11065
16.8%
4982
 
7.6%
4935
 
7.5%
1 3471
 
5.3%
2860
 
4.3%
2802
 
4.2%
2789
 
4.2%
2782
 
4.2%
2780
 
4.2%
2780
 
4.2%
Other values (322) 24703
37.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 43185
65.5%
Decimal Number 11155
 
16.9%
Space Separator 11065
 
16.8%
Dash Punctuation 501
 
0.8%
Uppercase Letter 38
 
0.1%
Math Symbol 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4982
 
11.5%
4935
 
11.4%
2860
 
6.6%
2802
 
6.5%
2789
 
6.5%
2782
 
6.4%
2780
 
6.4%
2780
 
6.4%
2091
 
4.8%
1616
 
3.7%
Other values (298) 12768
29.6%
Uppercase Letter
ValueCountFrequency (%)
B 13
34.2%
A 10
26.3%
E 4
 
10.5%
C 2
 
5.3%
L 2
 
5.3%
F 2
 
5.3%
T 1
 
2.6%
S 1
 
2.6%
W 1
 
2.6%
I 1
 
2.6%
Decimal Number
ValueCountFrequency (%)
1 3471
31.1%
2 1844
16.5%
3 1116
 
10.0%
0 845
 
7.6%
4 833
 
7.5%
5 773
 
6.9%
6 617
 
5.5%
8 565
 
5.1%
7 557
 
5.0%
9 534
 
4.8%
Space Separator
ValueCountFrequency (%)
11065
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 501
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 43185
65.5%
Common 22726
34.5%
Latin 38
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4982
 
11.5%
4935
 
11.4%
2860
 
6.6%
2802
 
6.5%
2789
 
6.5%
2782
 
6.4%
2780
 
6.4%
2780
 
6.4%
2091
 
4.8%
1616
 
3.7%
Other values (298) 12768
29.6%
Common
ValueCountFrequency (%)
11065
48.7%
1 3471
 
15.3%
2 1844
 
8.1%
3 1116
 
4.9%
0 845
 
3.7%
4 833
 
3.7%
5 773
 
3.4%
6 617
 
2.7%
8 565
 
2.5%
7 557
 
2.5%
Other values (3) 1040
 
4.6%
Latin
ValueCountFrequency (%)
B 13
34.2%
A 10
26.3%
E 4
 
10.5%
C 2
 
5.3%
L 2
 
5.3%
F 2
 
5.3%
T 1
 
2.6%
S 1
 
2.6%
W 1
 
2.6%
I 1
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 43185
65.5%
ASCII 22764
34.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11065
48.6%
1 3471
 
15.2%
2 1844
 
8.1%
3 1116
 
4.9%
0 845
 
3.7%
4 833
 
3.7%
5 773
 
3.4%
6 617
 
2.7%
8 565
 
2.5%
7 557
 
2.4%
Other values (14) 1078
 
4.7%
Hangul
ValueCountFrequency (%)
4982
 
11.5%
4935
 
11.4%
2860
 
6.6%
2802
 
6.5%
2789
 
6.5%
2782
 
6.4%
2780
 
6.4%
2780
 
6.4%
2091
 
4.8%
1616
 
3.7%
Other values (298) 12768
29.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size21.9 KiB
2023-10-30
2784 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-10-30
2nd row2023-10-30
3rd row2023-10-30
4th row2023-10-30
5th row2023-10-30

Common Values

ValueCountFrequency (%)
2023-10-30 2784
100.0%

Length

2023-12-13T00:44:00.549211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:44:00.675625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-10-30 2784
100.0%

Missing values

2023-12-13T00:43:56.819266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:43:56.973235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T00:43:57.100225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

이름전화번호주소데이터기준일자
0채안나뷰티살롱064-744-0256제주특별자치도 제주시 도령북길 22 102호2023-10-30
1우리064-722-4044제주특별자치도 제주시 무근성7길 142023-10-30
2국제미용실064-752-6495제주특별자치도 제주시 동문로4길 17-12023-10-30
3금천064-758-9254제주특별자치도 제주시 중앙로23길 62023-10-30
4화신064-753-3450제주특별자치도 제주시 용담로18길 9-12023-10-30
5경양064-758-9248제주특별자치도 제주시 전농로7길 34 1층2023-10-30
6보경064-753-6039제주특별자치도 제주시 관덕로17길 37-42023-10-30
7전원064-751-5682제주특별자치도 제주시 임항로 2612023-10-30
8보라064-743-1122제주특별자치도 제주시 신광로6길 132023-10-30
9064-751-0942제주특별자치도 제주시 관덕로6길 52023-10-30
이름전화번호주소데이터기준일자
2774이트네일 스튜디오<NA>제주특별자치도 서귀포시 안덕면 산방로 3 1층2023-10-30
2775스파바이제이더블유(SPA BY JW)<NA>제주특별자치도 서귀포시 태평로 152 지하2층2023-10-30
2776앤제이살롱064-712-8733제주특별자치도 서귀포시 대정읍 일주서로 25772023-10-30
2777라이티티아<NA>제주특별자치도 서귀포시 중앙로72번길 3 2층2023-10-30
2778로고스헤어064-739-0055제주특별자치도 서귀포시 김정문화로 522023-10-30
2779블루밍(Blooming)<NA>제주특별자치도 서귀포시 성산읍 일출로 12 1층2023-10-30
2780네일예쁜날<NA>제주특별자치도 서귀포시 남원읍 태위로 673 2층2023-10-30
2781이미호아이래쉬서귀포점064-794-0426제주특별자치도 서귀포시 대정읍 보성구억로 209 B동 1층 104호2023-10-30
2782닥터아이티엔서귀포점<NA>제주특별자치도 서귀포시 태평로 5342023-10-30
2783제이제이엠에스뷰티(JJ MS BEAUTY)<NA>제주특별자치도 서귀포시 동문로 34 1층2023-10-30

Duplicate rows

Most frequently occurring

이름전화번호주소데이터기준일자# duplicates
0뷰티타임<NA>제주특별자치도 제주시 청사로 44 2층2023-10-302
1살롱드메이드<NA>제주특별자치도 제주시 신설로5길 8 1층 101호2023-10-302