Overview

Dataset statistics

Number of variables5
Number of observations556
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory21.8 KiB
Average record size in memory40.2 B

Variable types

Categorical2
Text3

Dataset

Description울산광역시 울주군의 공중 미용업 현황에 대한 데이터로 업종명, 업소명, 영업소 주소(도로명), 소재지 전화 등의 정보를 제공합니다.
Author울산광역시 울주군
URLhttps://www.data.go.kr/data/15007188/fileData.do

Alerts

기준일자 has constant value ""Constant
업종명 is highly imbalanced (59.1%)Imbalance

Reproduction

Analysis started2023-12-12 08:31:33.174587
Analysis finished2023-12-12 08:31:33.795646
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

IMBALANCE 

Distinct12
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
일반미용업
405 
피부미용업
62 
네일미용업
51 
일반미용업, 화장ㆍ분장 미용업
 
10
화장ㆍ분장 미용업
 
7
Other values (7)
 
21

Length

Max length23
Median length5
Mean length5.5719424
Min length5

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row일반미용업
2nd row일반미용업
3rd row일반미용업
4th row일반미용업
5th row일반미용업

Common Values

ValueCountFrequency (%)
일반미용업 405
72.8%
피부미용업 62
 
11.2%
네일미용업 51
 
9.2%
일반미용업, 화장ㆍ분장 미용업 10
 
1.8%
화장ㆍ분장 미용업 7
 
1.3%
종합미용업 5
 
0.9%
일반미용업, 네일미용업 3
 
0.5%
피부미용업, 네일미용업 3
 
0.5%
피부미용업, 화장ㆍ분장 미용업 3
 
0.5%
네일미용업, 화장ㆍ분장 미용업 3
 
0.5%
Other values (2) 4
 
0.7%

Length

2023-12-12T17:31:33.889888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반미용업 421
68.7%
피부미용업 69
 
11.3%
네일미용업 64
 
10.4%
화장ㆍ분장 27
 
4.4%
미용업 27
 
4.4%
종합미용업 5
 
0.8%
Distinct539
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
2023-12-12T17:31:34.166256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length15
Mean length5.6384892
Min length1

Characters and Unicode

Total characters3135
Distinct characters408
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique525 ?
Unique (%)94.4%

Sample

1st row오고파미장원
2nd row날개미용실
3rd row덕신미용실
4th row로얄미용실
5th row성림미용실
ValueCountFrequency (%)
헤어 9
 
1.4%
hair 7
 
1.1%
남성컷트전문점 6
 
0.9%
포맨 3
 
0.5%
뷰티 3
 
0.5%
nail 3
 
0.5%
스킨케어 3
 
0.5%
네일 3
 
0.5%
남성컷트 3
 
0.5%
살롱 3
 
0.5%
Other values (572) 592
93.2%
2023-12-12T17:31:34.621167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
268
 
8.5%
251
 
8.0%
107
 
3.4%
84
 
2.7%
80
 
2.6%
77
 
2.5%
65
 
2.1%
60
 
1.9%
49
 
1.6%
48
 
1.5%
Other values (398) 2046
65.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2805
89.5%
Lowercase Letter 129
 
4.1%
Space Separator 80
 
2.6%
Uppercase Letter 73
 
2.3%
Other Punctuation 16
 
0.5%
Open Punctuation 12
 
0.4%
Close Punctuation 12
 
0.4%
Decimal Number 6
 
0.2%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
268
 
9.6%
251
 
8.9%
107
 
3.8%
84
 
3.0%
77
 
2.7%
65
 
2.3%
60
 
2.1%
49
 
1.7%
48
 
1.7%
48
 
1.7%
Other values (341) 1748
62.3%
Lowercase Letter
ValueCountFrequency (%)
a 19
14.7%
i 17
13.2%
e 13
10.1%
r 13
10.1%
o 11
8.5%
h 9
7.0%
l 8
 
6.2%
t 7
 
5.4%
y 5
 
3.9%
n 5
 
3.9%
Other values (12) 22
17.1%
Uppercase Letter
ValueCountFrequency (%)
H 15
20.5%
M 6
 
8.2%
I 6
 
8.2%
A 6
 
8.2%
R 5
 
6.8%
N 4
 
5.5%
S 4
 
5.5%
Y 3
 
4.1%
O 3
 
4.1%
B 3
 
4.1%
Other values (10) 18
24.7%
Other Punctuation
ValueCountFrequency (%)
. 5
31.2%
& 4
25.0%
' 2
 
12.5%
, 1
 
6.2%
· 1
 
6.2%
1
 
6.2%
# 1
 
6.2%
: 1
 
6.2%
Decimal Number
ValueCountFrequency (%)
1 3
50.0%
9 2
33.3%
0 1
 
16.7%
Space Separator
ValueCountFrequency (%)
80
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2804
89.4%
Latin 202
 
6.4%
Common 128
 
4.1%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
268
 
9.6%
251
 
9.0%
107
 
3.8%
84
 
3.0%
77
 
2.7%
65
 
2.3%
60
 
2.1%
49
 
1.7%
48
 
1.7%
48
 
1.7%
Other values (340) 1747
62.3%
Latin
ValueCountFrequency (%)
a 19
 
9.4%
i 17
 
8.4%
H 15
 
7.4%
e 13
 
6.4%
r 13
 
6.4%
o 11
 
5.4%
h 9
 
4.5%
l 8
 
4.0%
t 7
 
3.5%
M 6
 
3.0%
Other values (32) 84
41.6%
Common
ValueCountFrequency (%)
80
62.5%
( 12
 
9.4%
) 12
 
9.4%
. 5
 
3.9%
& 4
 
3.1%
1 3
 
2.3%
' 2
 
1.6%
9 2
 
1.6%
- 2
 
1.6%
, 1
 
0.8%
Other values (5) 5
 
3.9%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2804
89.4%
ASCII 328
 
10.5%
None 2
 
0.1%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
268
 
9.6%
251
 
9.0%
107
 
3.8%
84
 
3.0%
77
 
2.7%
65
 
2.3%
60
 
2.1%
49
 
1.7%
48
 
1.7%
48
 
1.7%
Other values (340) 1747
62.3%
ASCII
ValueCountFrequency (%)
80
24.4%
a 19
 
5.8%
i 17
 
5.2%
H 15
 
4.6%
e 13
 
4.0%
r 13
 
4.0%
( 12
 
3.7%
) 12
 
3.7%
o 11
 
3.4%
h 9
 
2.7%
Other values (45) 127
38.7%
None
ValueCountFrequency (%)
· 1
50.0%
1
50.0%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct545
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
2023-12-12T17:31:35.050465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length45
Mean length27.991007
Min length9

Characters and Unicode

Total characters15563
Distinct characters234
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique535 ?
Unique (%)96.2%

Sample

1st row울산광역시 울주군 두서면 노동길 30
2nd row울산광역시 울주군 청량읍 대등길 29
3rd row울산광역시 울주군 온산읍 온덕1길 25
4th row울산광역시 울주군 언양읍 동문길 90
5th row울산광역시 울주군 언양읍 헌양길 54
ValueCountFrequency (%)
울산광역시 554
 
16.1%
울주군 554
 
16.1%
1층 185
 
5.4%
범서읍 178
 
5.2%
언양읍 113
 
3.3%
온양읍 94
 
2.7%
온산읍 83
 
2.4%
2층 48
 
1.4%
삼남읍 37
 
1.1%
상가동 30
 
0.9%
Other values (611) 1568
45.5%
2023-12-12T17:31:35.642868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2888
18.6%
1122
 
7.2%
1 768
 
4.9%
655
 
4.2%
567
 
3.6%
561
 
3.6%
558
 
3.6%
555
 
3.6%
554
 
3.6%
537
 
3.5%
Other values (224) 6798
43.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9417
60.5%
Space Separator 2888
 
18.6%
Decimal Number 2456
 
15.8%
Other Punctuation 344
 
2.2%
Close Punctuation 147
 
0.9%
Open Punctuation 146
 
0.9%
Dash Punctuation 135
 
0.9%
Uppercase Letter 25
 
0.2%
Math Symbol 4
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1122
 
11.9%
655
 
7.0%
567
 
6.0%
561
 
6.0%
558
 
5.9%
555
 
5.9%
554
 
5.9%
537
 
5.7%
454
 
4.8%
253
 
2.7%
Other values (191) 3601
38.2%
Decimal Number
ValueCountFrequency (%)
1 768
31.3%
2 379
15.4%
3 246
 
10.0%
0 242
 
9.9%
4 196
 
8.0%
5 168
 
6.8%
6 126
 
5.1%
7 114
 
4.6%
9 113
 
4.6%
8 104
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
A 8
32.0%
E 4
16.0%
C 3
 
12.0%
B 2
 
8.0%
P 2
 
8.0%
L 2
 
8.0%
R 2
 
8.0%
G 1
 
4.0%
M 1
 
4.0%
Other Punctuation
ValueCountFrequency (%)
, 339
98.5%
@ 2
 
0.6%
* 1
 
0.3%
/ 1
 
0.3%
! 1
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 146
99.3%
] 1
 
0.7%
Open Punctuation
ValueCountFrequency (%)
( 145
99.3%
[ 1
 
0.7%
Math Symbol
ValueCountFrequency (%)
> 2
50.0%
< 2
50.0%
Space Separator
ValueCountFrequency (%)
2888
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 135
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9417
60.5%
Common 6120
39.3%
Latin 26
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1122
 
11.9%
655
 
7.0%
567
 
6.0%
561
 
6.0%
558
 
5.9%
555
 
5.9%
554
 
5.9%
537
 
5.7%
454
 
4.8%
253
 
2.7%
Other values (191) 3601
38.2%
Common
ValueCountFrequency (%)
2888
47.2%
1 768
 
12.5%
2 379
 
6.2%
, 339
 
5.5%
3 246
 
4.0%
0 242
 
4.0%
4 196
 
3.2%
5 168
 
2.7%
) 146
 
2.4%
( 145
 
2.4%
Other values (13) 603
 
9.9%
Latin
ValueCountFrequency (%)
A 8
30.8%
E 4
15.4%
C 3
 
11.5%
B 2
 
7.7%
P 2
 
7.7%
L 2
 
7.7%
R 2
 
7.7%
e 1
 
3.8%
G 1
 
3.8%
M 1
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9417
60.5%
ASCII 6146
39.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2888
47.0%
1 768
 
12.5%
2 379
 
6.2%
, 339
 
5.5%
3 246
 
4.0%
0 242
 
3.9%
4 196
 
3.2%
5 168
 
2.7%
) 146
 
2.4%
( 145
 
2.4%
Other values (23) 629
 
10.2%
Hangul
ValueCountFrequency (%)
1122
 
11.9%
655
 
7.0%
567
 
6.0%
561
 
6.0%
558
 
5.9%
555
 
5.9%
554
 
5.9%
537
 
5.7%
454
 
4.8%
253
 
2.7%
Other values (191) 3601
38.2%
Distinct314
Distinct (%)56.5%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
2023-12-12T17:31:35.923428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length7.2302158
Min length1

Characters and Unicode

Total characters4020
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique312 ?
Unique (%)56.1%

Sample

1st row052-262-6200
2nd row052-269-8144
3rd row052-238-2432
4th row052-262-2872
5th row052-262-3743
ValueCountFrequency (%)
052-244-2555 2
 
0.6%
052-237-4227 1
 
0.3%
052-264-6721 1
 
0.3%
052-264-0517 1
 
0.3%
052-244-0770 1
 
0.3%
070-4062-8225 1
 
0.3%
052-969-8096 1
 
0.3%
052-239-7907 1
 
0.3%
070-8821-2904 1
 
0.3%
052-239-2766 1
 
0.3%
Other values (303) 303
96.5%
2023-12-12T17:31:36.410016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 792
19.7%
- 628
15.6%
0 497
12.4%
5 484
12.0%
242
 
6.0%
3 236
 
5.9%
7 224
 
5.6%
4 206
 
5.1%
1 205
 
5.1%
8 186
 
4.6%
Other values (2) 320
8.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3150
78.4%
Dash Punctuation 628
 
15.6%
Space Separator 242
 
6.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 792
25.1%
0 497
15.8%
5 484
15.4%
3 236
 
7.5%
7 224
 
7.1%
4 206
 
6.5%
1 205
 
6.5%
8 186
 
5.9%
6 179
 
5.7%
9 141
 
4.5%
Dash Punctuation
ValueCountFrequency (%)
- 628
100.0%
Space Separator
ValueCountFrequency (%)
242
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4020
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 792
19.7%
- 628
15.6%
0 497
12.4%
5 484
12.0%
242
 
6.0%
3 236
 
5.9%
7 224
 
5.6%
4 206
 
5.1%
1 205
 
5.1%
8 186
 
4.6%
Other values (2) 320
8.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4020
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 792
19.7%
- 628
15.6%
0 497
12.4%
5 484
12.0%
242
 
6.0%
3 236
 
5.9%
7 224
 
5.6%
4 206
 
5.1%
1 205
 
5.1%
8 186
 
4.6%
Other values (2) 320
8.0%

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
2021-09-14
556 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-09-14
2nd row2021-09-14
3rd row2021-09-14
4th row2021-09-14
5th row2021-09-14

Common Values

ValueCountFrequency (%)
2021-09-14 556
100.0%

Length

2023-12-12T17:31:36.593204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:31:36.743299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-09-14 556
100.0%

Missing values

2023-12-12T17:31:33.618783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:31:33.740885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명영업소 주소(도로명)소재지전화기준일자
0일반미용업오고파미장원울산광역시 울주군 두서면 노동길 30052-262-62002021-09-14
1일반미용업날개미용실울산광역시 울주군 청량읍 대등길 29052-269-81442021-09-14
2일반미용업덕신미용실울산광역시 울주군 온산읍 온덕1길 25052-238-24322021-09-14
3일반미용업로얄미용실울산광역시 울주군 언양읍 동문길 90052-262-28722021-09-14
4일반미용업성림미용실울산광역시 울주군 언양읍 헌양길 54052-262-37432021-09-14
5일반미용업목화미용실울산광역시 울주군 언양읍 장터1길 22, 2층 202호052-263-02022021-09-14
6일반미용업주희미용실울산광역시 울주군 언양읍 동문길 53052-262-30972021-09-14
7일반미용업여왕미용실울산광역시 울주군 온양읍 남창장터길 5 ((1층))052-238-69282021-09-14
8일반미용업영남미용실울산광역시 울주군 두서면 노동1길 31052-262-61002021-09-14
9일반미용업엘레강스미용실울산광역시 울주군 청량읍 덕하장터길 1052-268-28932021-09-14
업종명업소명영업소 주소(도로명)소재지전화기준일자
546피부미용업, 화장ㆍ분장 미용업나인뷰티울산광역시 울주군 언양읍 헌양길 61, 1층2021-09-14
547피부미용업, 화장ㆍ분장 미용업the 예쁜가시나울산광역시 울주군 온산읍 영남길 12, 101호2021-09-14
548피부미용업, 화장ㆍ분장 미용업뷰티샵눈썹가게울산광역시 울주군 온산읍 영남7길 42021-09-14
549네일미용업, 화장ㆍ분장 미용업네일또와울산광역시 울주군 범서읍 천상중앙길 70, 3동 103호 (천상중앙시장형상가)2021-09-14
550네일미용업, 화장ㆍ분장 미용업로즈네일울산광역시 울주군 삼남읍 봉화로 5-12021-09-14
551네일미용업, 화장ㆍ분장 미용업레푸스울산범서점울산광역시 울주군 범서읍 울밀로 2816, 2층052-9115-6562021-09-14
552일반미용업, 네일미용업, 화장ㆍ분장 미용업샤인헤어울산광역시 울주군 온산읍 영남7길 16052-237-24532021-09-14
553일반미용업, 네일미용업, 화장ㆍ분장 미용업준헤어커커울산광역시 울주군 온양읍 온양로 157, 102호052-238-08262021-09-14
554일반미용업, 네일미용업, 화장ㆍ분장 미용업금단비헤어샵울산광역시 울주군 온양읍 연안8길 3, 1층2021-09-14
555피부미용업, 네일미용업, 화장ㆍ분장 미용업새기다 뷰티울산광역시 울주군 온산읍 신온7길 30, 1층2021-09-14