Overview

Dataset statistics

Number of variables5
Number of observations1757
Missing cells990
Missing cells (%)11.3%
Duplicate rows3
Duplicate rows (%)0.2%
Total size in memory68.8 KiB
Average record size in memory40.1 B

Variable types

Categorical2
Text3

Dataset

Description인천광역시 서구에 등록된 미용업 현황(업종명, 업소명, 업소소재지, 소재지전화번호, 데이터 기준일 등 )입니다.
Author인천광역시 서구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15039942&srcSe=7661IVAWM27C61E190

Alerts

업종명 has constant value ""Constant
데이터기준일 has constant value ""Constant
Dataset has 3 (0.2%) duplicate rowsDuplicates
소재지전화 has 990 (56.3%) missing valuesMissing

Reproduction

Analysis started2024-01-28 05:27:16.414015
Analysis finished2024-01-28 05:27:17.109521
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
미용업
1757 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row미용업
2nd row미용업
3rd row미용업
4th row미용업
5th row미용업

Common Values

ValueCountFrequency (%)
미용업 1757
100.0%

Length

2024-01-28T14:27:17.163418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T14:27:17.248038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미용업 1757
100.0%
Distinct1657
Distinct (%)94.3%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
2024-01-28T14:27:17.490965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length32
Mean length6.430848
Min length2

Characters and Unicode

Total characters11299
Distinct characters625
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1587 ?
Unique (%)90.3%

Sample

1st row리헤어시티
2nd row이지숙머리방
3rd row배해영헤어모드
4th row정화헤어
5th row컷앤컷헤어클럽
ValueCountFrequency (%)
헤어 32
 
1.5%
hair 19
 
0.9%
nail 18
 
0.8%
에스테틱 14
 
0.6%
네일 14
 
0.6%
청라점 12
 
0.6%
de 10
 
0.5%
헤어샵 9
 
0.4%
블루클럽 8
 
0.4%
검단신도시점 8
 
0.4%
Other values (1802) 2014
93.3%
2024-01-28T14:27:17.914147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
772
 
6.8%
736
 
6.5%
401
 
3.5%
282
 
2.5%
250
 
2.2%
245
 
2.2%
235
 
2.1%
219
 
1.9%
212
 
1.9%
) 182
 
1.6%
Other values (615) 7765
68.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8902
78.8%
Uppercase Letter 698
 
6.2%
Lowercase Letter 693
 
6.1%
Space Separator 401
 
3.5%
Close Punctuation 182
 
1.6%
Open Punctuation 181
 
1.6%
Decimal Number 118
 
1.0%
Other Punctuation 111
 
1.0%
Dash Punctuation 7
 
0.1%
Connector Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
772
 
8.7%
736
 
8.3%
282
 
3.2%
250
 
2.8%
245
 
2.8%
235
 
2.6%
219
 
2.5%
212
 
2.4%
168
 
1.9%
144
 
1.6%
Other values (538) 5639
63.3%
Uppercase Letter
ValueCountFrequency (%)
A 78
 
11.2%
H 56
 
8.0%
I 51
 
7.3%
E 51
 
7.3%
N 50
 
7.2%
S 49
 
7.0%
L 45
 
6.4%
R 42
 
6.0%
O 32
 
4.6%
M 29
 
4.2%
Other values (15) 215
30.8%
Lowercase Letter
ValueCountFrequency (%)
a 99
14.3%
e 71
10.2%
i 70
10.1%
o 62
 
8.9%
n 62
 
8.9%
l 56
 
8.1%
r 33
 
4.8%
y 31
 
4.5%
u 26
 
3.8%
h 26
 
3.8%
Other values (14) 157
22.7%
Other Punctuation
ValueCountFrequency (%)
& 29
26.1%
# 25
22.5%
, 22
19.8%
. 13
11.7%
: 10
 
9.0%
' 5
 
4.5%
; 4
 
3.6%
? 1
 
0.9%
! 1
 
0.9%
1
 
0.9%
Decimal Number
ValueCountFrequency (%)
1 26
22.0%
2 26
22.0%
0 22
18.6%
3 11
9.3%
6 7
 
5.9%
4 6
 
5.1%
9 6
 
5.1%
5 5
 
4.2%
8 5
 
4.2%
7 4
 
3.4%
Math Symbol
ValueCountFrequency (%)
= 1
33.3%
> 1
33.3%
< 1
33.3%
Space Separator
ValueCountFrequency (%)
401
100.0%
Close Punctuation
ValueCountFrequency (%)
) 182
100.0%
Open Punctuation
ValueCountFrequency (%)
( 181
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8896
78.7%
Latin 1391
 
12.3%
Common 1006
 
8.9%
Han 6
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
772
 
8.7%
736
 
8.3%
282
 
3.2%
250
 
2.8%
245
 
2.8%
235
 
2.6%
219
 
2.5%
212
 
2.4%
168
 
1.9%
144
 
1.6%
Other values (534) 5633
63.3%
Latin
ValueCountFrequency (%)
a 99
 
7.1%
A 78
 
5.6%
e 71
 
5.1%
i 70
 
5.0%
o 62
 
4.5%
n 62
 
4.5%
H 56
 
4.0%
l 56
 
4.0%
I 51
 
3.7%
E 51
 
3.7%
Other values (39) 735
52.8%
Common
ValueCountFrequency (%)
401
39.9%
) 182
18.1%
( 181
18.0%
& 29
 
2.9%
1 26
 
2.6%
2 26
 
2.6%
# 25
 
2.5%
, 22
 
2.2%
0 22
 
2.2%
. 13
 
1.3%
Other values (18) 79
 
7.9%
Han
ValueCountFrequency (%)
3
50.0%
1
 
16.7%
1
 
16.7%
1
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8896
78.7%
ASCII 2396
 
21.2%
CJK 6
 
0.1%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
772
 
8.7%
736
 
8.3%
282
 
3.2%
250
 
2.8%
245
 
2.8%
235
 
2.6%
219
 
2.5%
212
 
2.4%
168
 
1.9%
144
 
1.6%
Other values (534) 5633
63.3%
ASCII
ValueCountFrequency (%)
401
 
16.7%
) 182
 
7.6%
( 181
 
7.6%
a 99
 
4.1%
A 78
 
3.3%
e 71
 
3.0%
i 70
 
2.9%
o 62
 
2.6%
n 62
 
2.6%
H 56
 
2.3%
Other values (66) 1134
47.3%
CJK
ValueCountFrequency (%)
3
50.0%
1
 
16.7%
1
 
16.7%
1
 
16.7%
None
ValueCountFrequency (%)
1
100.0%
Distinct1737
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
2024-01-28T14:27:18.171039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length48
Mean length35.75868
Min length9

Characters and Unicode

Total characters62828
Distinct characters374
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1718 ?
Unique (%)97.8%

Sample

1st row인천광역시 서구 검단로 489 (마전동)
2nd row인천광역시 서구 원창로 234 (가정동)
3rd row인천광역시 서구 간촌로31번길 2 (연희동)
4th row인천광역시 서구 봉수대로1440번길 4, 검단풍림아이원아파트 상가동 102호 (왕길동)
5th row인천광역시 서구 완정로34번길 53, 현대아파트 상가동 304호 (마전동)
ValueCountFrequency (%)
인천광역시 1756
 
14.4%
서구 1756
 
14.4%
청라동 353
 
2.9%
1층 347
 
2.8%
석남동 180
 
1.5%
가정동 178
 
1.5%
일부호 157
 
1.3%
가좌동 144
 
1.2%
원당동 137
 
1.1%
마전동 137
 
1.1%
Other values (1901) 7046
57.8%
2024-01-28T14:27:18.529195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10437
 
16.6%
1 2701
 
4.3%
2099
 
3.3%
1932
 
3.1%
1865
 
3.0%
, 1855
 
3.0%
1833
 
2.9%
1787
 
2.8%
) 1786
 
2.8%
( 1785
 
2.8%
Other values (364) 34748
55.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 35241
56.1%
Decimal Number 10970
 
17.5%
Space Separator 10437
 
16.6%
Other Punctuation 1876
 
3.0%
Close Punctuation 1786
 
2.8%
Open Punctuation 1785
 
2.8%
Uppercase Letter 318
 
0.5%
Dash Punctuation 310
 
0.5%
Lowercase Letter 92
 
0.1%
Letter Number 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2099
 
6.0%
1932
 
5.5%
1865
 
5.3%
1833
 
5.2%
1787
 
5.1%
1769
 
5.0%
1764
 
5.0%
1761
 
5.0%
1759
 
5.0%
1347
 
3.8%
Other values (317) 17325
49.2%
Uppercase Letter
ValueCountFrequency (%)
B 77
24.2%
A 54
17.0%
K 28
 
8.8%
I 23
 
7.2%
E 20
 
6.3%
S 20
 
6.3%
W 18
 
5.7%
L 18
 
5.7%
M 17
 
5.3%
V 17
 
5.3%
Other values (8) 26
 
8.2%
Decimal Number
ValueCountFrequency (%)
1 2701
24.6%
2 1759
16.0%
0 1456
13.3%
3 1040
 
9.5%
4 846
 
7.7%
8 687
 
6.3%
5 676
 
6.2%
6 652
 
5.9%
7 632
 
5.8%
9 521
 
4.7%
Lowercase Letter
ValueCountFrequency (%)
e 34
37.0%
r 15
16.3%
s 14
15.2%
d 14
15.2%
a 14
15.2%
y 1
 
1.1%
Other Punctuation
ValueCountFrequency (%)
, 1855
98.9%
' 14
 
0.7%
@ 4
 
0.2%
. 2
 
0.1%
/ 1
 
0.1%
Math Symbol
ValueCountFrequency (%)
~ 3
60.0%
> 1
 
20.0%
< 1
 
20.0%
Space Separator
ValueCountFrequency (%)
10437
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1786
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1785
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 310
100.0%
Letter Number
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 35241
56.1%
Common 27169
43.2%
Latin 418
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2099
 
6.0%
1932
 
5.5%
1865
 
5.3%
1833
 
5.2%
1787
 
5.1%
1769
 
5.0%
1764
 
5.0%
1761
 
5.0%
1759
 
5.0%
1347
 
3.8%
Other values (317) 17325
49.2%
Latin
ValueCountFrequency (%)
B 77
18.4%
A 54
12.9%
e 34
 
8.1%
K 28
 
6.7%
I 23
 
5.5%
E 20
 
4.8%
S 20
 
4.8%
W 18
 
4.3%
L 18
 
4.3%
M 17
 
4.1%
Other values (15) 109
26.1%
Common
ValueCountFrequency (%)
10437
38.4%
1 2701
 
9.9%
, 1855
 
6.8%
) 1786
 
6.6%
( 1785
 
6.6%
2 1759
 
6.5%
0 1456
 
5.4%
3 1040
 
3.8%
4 846
 
3.1%
8 687
 
2.5%
Other values (12) 2817
 
10.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 35241
56.1%
ASCII 27579
43.9%
Number Forms 8
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10437
37.8%
1 2701
 
9.8%
, 1855
 
6.7%
) 1786
 
6.5%
( 1785
 
6.5%
2 1759
 
6.4%
0 1456
 
5.3%
3 1040
 
3.8%
4 846
 
3.1%
8 687
 
2.5%
Other values (36) 3227
 
11.7%
Hangul
ValueCountFrequency (%)
2099
 
6.0%
1932
 
5.5%
1865
 
5.3%
1833
 
5.2%
1787
 
5.1%
1769
 
5.0%
1764
 
5.0%
1761
 
5.0%
1759
 
5.0%
1347
 
3.8%
Other values (317) 17325
49.2%
Number Forms
ValueCountFrequency (%)
8
100.0%

소재지전화
Text

MISSING 

Distinct757
Distinct (%)98.7%
Missing990
Missing (%)56.3%
Memory size13.9 KiB
2024-01-28T14:27:18.756215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.043025
Min length11

Characters and Unicode

Total characters9237
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique747 ?
Unique (%)97.4%

Sample

1st row032-563-0346
2nd row032-581-0272
3rd row032-564-4008
4th row032-566-8001
5th row032-565-1328
ValueCountFrequency (%)
032-564-1489 2
 
0.3%
032-563-2530 2
 
0.3%
032-578-3220 2
 
0.3%
032-564-1109 2
 
0.3%
032-568-0199 2
 
0.3%
032-562-8520 2
 
0.3%
032-561-1172 2
 
0.3%
032-575-2361 2
 
0.3%
032-577-2924 2
 
0.3%
032-564-3461 2
 
0.3%
Other values (747) 747
97.4%
2024-01-28T14:27:19.101840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1534
16.6%
2 1209
13.1%
3 1199
13.0%
0 1169
12.7%
5 1080
11.7%
6 726
7.9%
7 708
7.7%
8 452
 
4.9%
1 445
 
4.8%
4 388
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7703
83.4%
Dash Punctuation 1534
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 1209
15.7%
3 1199
15.6%
0 1169
15.2%
5 1080
14.0%
6 726
9.4%
7 708
9.2%
8 452
 
5.9%
1 445
 
5.8%
4 388
 
5.0%
9 327
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 1534
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 9237
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1534
16.6%
2 1209
13.1%
3 1199
13.0%
0 1169
12.7%
5 1080
11.7%
6 726
7.9%
7 708
7.7%
8 452
 
4.9%
1 445
 
4.8%
4 388
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9237
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1534
16.6%
2 1209
13.1%
3 1199
13.0%
0 1169
12.7%
5 1080
11.7%
6 726
7.9%
7 708
7.7%
8 452
 
4.9%
1 445
 
4.8%
4 388
 
4.2%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.9 KiB
2023-03-30
1757 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-03-30
2nd row2023-03-30
3rd row2023-03-30
4th row2023-03-30
5th row2023-03-30

Common Values

ValueCountFrequency (%)
2023-03-30 1757
100.0%

Length

2024-01-28T14:27:19.233608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T14:27:19.331114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-03-30 1757
100.0%

Missing values

2024-01-28T14:27:16.996994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T14:27:17.075919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명업소 소재지소재지전화데이터기준일
0미용업리헤어시티인천광역시 서구 검단로 489 (마전동)032-563-03462023-03-30
1미용업이지숙머리방인천광역시 서구 원창로 234 (가정동)032-581-02722023-03-30
2미용업배해영헤어모드인천광역시 서구 간촌로31번길 2 (연희동)032-564-40082023-03-30
3미용업정화헤어인천광역시 서구 봉수대로1440번길 4, 검단풍림아이원아파트 상가동 102호 (왕길동)032-566-80012023-03-30
4미용업컷앤컷헤어클럽인천광역시 서구 완정로34번길 53, 현대아파트 상가동 304호 (마전동)032-565-13282023-03-30
5미용업동부헤어타운인천광역시 서구 검단로 853 (불로동, 동부상가2동203호)032-277-75302023-03-30
6미용업비녀헤어인천광역시 서구 검단로 842 (불로동, 월드주상가1층23호)032-568-11652023-03-30
7미용업나니미용실인천광역시 서구 가정로 124 (가좌동)032-575-87012023-03-30
8미용업세느미용실인천광역시 서구 가정로 124 (가좌동)032-576-08672023-03-30
9미용업블랙정인천광역시 서구 건지로284번길 28 (가좌동)032-583-14032023-03-30
업종명업소명업소 소재지소재지전화데이터기준일
1747미용업블링제이뷰티인천광역시 서구 청중로478번안길 6, 아트프라자 2층 일부호 (가정동)<NA>2023-03-30
1748미용업청라한네일인천광역시 서구 청라에메랄드로 99, 지젤엠청라 1층 69호 (청라동)<NA>2023-03-30
1749미용업라라뷰티인천광역시 서구 승학로 481, 한명빌딩 3층 일부호 (검암동)<NA>2023-03-30
1750미용업리블로셀검단아라점인천광역시 서구 이음5로 39, 3동 2층 204호 (원당동, 우미린 더 시그니처)<NA>2023-03-30
1751미용업네일루인천광역시 서구 검단로768번1길 11-6, 102호 (불로동)<NA>2023-03-30
1752미용업모드니네일인천광역시 서구 청라에메랄드로 99, 지젤엠청라 1층 38호 (청라동)<NA>2023-03-30
1753미용업오늘네일인천광역시 서구 서곶로 16, 한신그랜드힐빌리지 상가1동 227호 (가정동)<NA>2023-03-30
1754미용업라움(RAUM)인천광역시 서구 이음대로 388, ABM타워 518호 (원당동)<NA>2023-03-30
1755미용업아엘리뷰티 검단신도시 피부관리 네일 왁싱 속눈썹인천광역시 서구 바리미로5번길 12, 다승프라자Ⅱ 602호 (원당동)<NA>2023-03-30
1756미용업도르뷰티인천광역시 서구 청라에메랄드로102번길 8-22, 미라클프라자 104호 (청라동)<NA>2023-03-30

Duplicate rows

Most frequently occurring

업종명업소명업소 소재지소재지전화데이터기준일# duplicates
0미용업라라뷰티인천광역시 서구 승학로 481, 한명빌딩 3층 일부호 (검암동)<NA>2023-03-302
1미용업모드니인천광역시 서구 완정로178번길 44, 103일부호 (마전동, 현승빌딩)<NA>2023-03-302
2미용업이쁨더하기인천광역시 서구 검단로 853, 204호 일부 (불로동)032-563-25302023-03-302