Overview

Dataset statistics

Number of variables5
Number of observations37
Missing cells7
Missing cells (%)3.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory43.6 B

Variable types

Categorical1
Text4

Dataset

Description경상북도 군위군의 미용업현황에 대한 데이터로 업종, 업소명, 업소소재지(도로명주소, 지번), 전화번호에 대한 항목을 제공합니다.
Author경상북도 군위군
URLhttps://www.data.go.kr/data/15033534/fileData.do

Alerts

업종명 is highly imbalanced (61.1%)Imbalance
전화번호 has 7 (18.9%) missing valuesMissing

Reproduction

Analysis started2023-12-12 15:03:26.623650
Analysis finished2023-12-12 15:03:27.110962
Duration0.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

IMBALANCE 

Distinct6
Distinct (%)16.2%
Missing0
Missing (%)0.0%
Memory size428.0 B
일반미용업
31 
피부미용업
 
2
종합미용업
 
1
네일미용업
 
1
일반미용업, 네일미용업
 
1

Length

Max length16
Median length5
Mean length5.4864865
Min length5

Unique

Unique4 ?
Unique (%)10.8%

Sample

1st row일반미용업
2nd row일반미용업
3rd row일반미용업
4th row일반미용업
5th row일반미용업

Common Values

ValueCountFrequency (%)
일반미용업 31
83.8%
피부미용업 2
 
5.4%
종합미용업 1
 
2.7%
네일미용업 1
 
2.7%
일반미용업, 네일미용업 1
 
2.7%
네일미용업, 화장ㆍ분장 미용업 1
 
2.7%

Length

2023-12-13T00:03:27.192391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:03:27.324548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반미용업 32
80.0%
네일미용업 3
 
7.5%
피부미용업 2
 
5.0%
종합미용업 1
 
2.5%
화장ㆍ분장 1
 
2.5%
미용업 1
 
2.5%
Distinct36
Distinct (%)97.3%
Missing0
Missing (%)0.0%
Memory size428.0 B
2023-12-13T00:03:27.591128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length5.1891892
Min length4

Characters and Unicode

Total characters192
Distinct characters83
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)94.6%

Sample

1st row화본미용소
2nd row흥신미용소
3rd row현대미용실
4th row삼화미용소
5th row의흥미용실
ValueCountFrequency (%)
대구미용실 2
 
4.5%
헤어 2
 
4.5%
은미용실 1
 
2.3%
현대미용실 1
 
2.3%
헤어샵 1
 
2.3%
서현희헤어샵 1
 
2.3%
청미용실 1
 
2.3%
한밤미용실 1
 
2.3%
부산미용실 1
 
2.3%
빨간머리앤 1
 
2.3%
Other values (32) 32
72.7%
2023-12-13T00:03:28.003633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21
 
10.9%
18
 
9.4%
14
 
7.3%
12
 
6.2%
12
 
6.2%
7
 
3.6%
7
 
3.6%
4
 
2.1%
3
 
1.6%
3
 
1.6%
Other values (73) 91
47.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 176
91.7%
Space Separator 7
 
3.6%
Uppercase Letter 6
 
3.1%
Other Punctuation 3
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
 
11.9%
18
 
10.2%
14
 
8.0%
12
 
6.8%
12
 
6.8%
7
 
4.0%
4
 
2.3%
3
 
1.7%
3
 
1.7%
3
 
1.7%
Other values (64) 79
44.9%
Uppercase Letter
ValueCountFrequency (%)
H 1
16.7%
A 1
16.7%
S 1
16.7%
K 1
16.7%
I 1
16.7%
N 1
16.7%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
# 1
33.3%
Space Separator
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 176
91.7%
Common 10
 
5.2%
Latin 6
 
3.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21
 
11.9%
18
 
10.2%
14
 
8.0%
12
 
6.8%
12
 
6.8%
7
 
4.0%
4
 
2.3%
3
 
1.7%
3
 
1.7%
3
 
1.7%
Other values (64) 79
44.9%
Latin
ValueCountFrequency (%)
H 1
16.7%
A 1
16.7%
S 1
16.7%
K 1
16.7%
I 1
16.7%
N 1
16.7%
Common
ValueCountFrequency (%)
7
70.0%
. 2
 
20.0%
# 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 176
91.7%
ASCII 16
 
8.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
21
 
11.9%
18
 
10.2%
14
 
8.0%
12
 
6.8%
12
 
6.8%
7
 
4.0%
4
 
2.3%
3
 
1.7%
3
 
1.7%
3
 
1.7%
Other values (64) 79
44.9%
ASCII
ValueCountFrequency (%)
7
43.8%
. 2
 
12.5%
H 1
 
6.2%
A 1
 
6.2%
S 1
 
6.2%
K 1
 
6.2%
I 1
 
6.2%
N 1
 
6.2%
# 1
 
6.2%
Distinct36
Distinct (%)97.3%
Missing0
Missing (%)0.0%
Memory size428.0 B
2023-12-13T00:03:28.284710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length23
Mean length20.918919
Min length19

Characters and Unicode

Total characters774
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)94.6%

Sample

1st row경상북도 군위군 산성면 부흥로 437
2nd row경상북도 군위군 소보면 송원3길 27-1
3rd row경상북도 군위군 군위읍 중앙길 64
4th row경상북도 군위군 우보면 이화3길 1-1
5th row경상북도 군위군 의흥면 읍내길 59-1
ValueCountFrequency (%)
경상북도 37
20.0%
군위군 37
20.0%
군위읍 21
 
11.4%
중앙길 10
 
5.4%
의흥면 4
 
2.2%
우보면 3
 
1.6%
부계면 3
 
1.6%
효령면 3
 
1.6%
읍내길 3
 
1.6%
이화3길 3
 
1.6%
Other values (52) 61
33.0%
2023-12-13T00:03:28.694608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
148
19.1%
96
12.4%
58
 
7.5%
37
 
4.8%
37
 
4.8%
37
 
4.8%
1 37
 
4.8%
37
 
4.8%
32
 
4.1%
25
 
3.2%
Other values (41) 230
29.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 485
62.7%
Space Separator 148
 
19.1%
Decimal Number 122
 
15.8%
Dash Punctuation 19
 
2.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
96
19.8%
58
12.0%
37
 
7.6%
37
 
7.6%
37
 
7.6%
37
 
7.6%
32
 
6.6%
25
 
5.2%
16
 
3.3%
16
 
3.3%
Other values (29) 94
19.4%
Decimal Number
ValueCountFrequency (%)
1 37
30.3%
7 13
 
10.7%
3 13
 
10.7%
2 13
 
10.7%
6 12
 
9.8%
4 10
 
8.2%
5 10
 
8.2%
8 6
 
4.9%
9 4
 
3.3%
0 4
 
3.3%
Space Separator
ValueCountFrequency (%)
148
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 485
62.7%
Common 289
37.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
96
19.8%
58
12.0%
37
 
7.6%
37
 
7.6%
37
 
7.6%
37
 
7.6%
32
 
6.6%
25
 
5.2%
16
 
3.3%
16
 
3.3%
Other values (29) 94
19.4%
Common
ValueCountFrequency (%)
148
51.2%
1 37
 
12.8%
- 19
 
6.6%
7 13
 
4.5%
3 13
 
4.5%
2 13
 
4.5%
6 12
 
4.2%
4 10
 
3.5%
5 10
 
3.5%
8 6
 
2.1%
Other values (2) 8
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 485
62.7%
ASCII 289
37.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
148
51.2%
1 37
 
12.8%
- 19
 
6.6%
7 13
 
4.5%
3 13
 
4.5%
2 13
 
4.5%
6 12
 
4.2%
4 10
 
3.5%
5 10
 
3.5%
8 6
 
2.1%
Other values (2) 8
 
2.8%
Hangul
ValueCountFrequency (%)
96
19.8%
58
12.0%
37
 
7.6%
37
 
7.6%
37
 
7.6%
37
 
7.6%
32
 
6.6%
25
 
5.2%
16
 
3.3%
16
 
3.3%
Other values (29) 94
19.4%
Distinct36
Distinct (%)97.3%
Missing0
Missing (%)0.0%
Memory size428.0 B
2023-12-13T00:03:28.951371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length23
Mean length22.594595
Min length20

Characters and Unicode

Total characters836
Distinct characters48
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)94.6%

Sample

1st row경상북도 군위군 산성면 화본리 1167
2nd row경상북도 군위군 소보면 송원리 672
3rd row경상북도 군위군 군위읍 서부리 286
4th row경상북도 군위군 우보면 이화리 1292-1
5th row경상북도 군위군 의흥면 읍내리 593
ValueCountFrequency (%)
경상북도 37
20.0%
군위군 37
20.0%
군위읍 21
 
11.4%
서부리 14
 
7.6%
동부리 6
 
3.2%
읍내리 4
 
2.2%
의흥면 4
 
2.2%
효령면 3
 
1.6%
이화리 3
 
1.6%
우보면 3
 
1.6%
Other values (46) 53
28.6%
2023-12-13T00:03:29.362634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
185
22.1%
96
 
11.5%
58
 
6.9%
37
 
4.4%
37
 
4.4%
37
 
4.4%
37
 
4.4%
37
 
4.4%
- 28
 
3.3%
1 26
 
3.1%
Other values (38) 258
30.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 480
57.4%
Space Separator 185
 
22.1%
Decimal Number 143
 
17.1%
Dash Punctuation 28
 
3.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
96
20.0%
58
12.1%
37
 
7.7%
37
 
7.7%
37
 
7.7%
37
 
7.7%
37
 
7.7%
25
 
5.2%
23
 
4.8%
16
 
3.3%
Other values (26) 77
16.0%
Decimal Number
ValueCountFrequency (%)
1 26
18.2%
2 20
14.0%
7 18
12.6%
6 15
10.5%
4 14
9.8%
5 13
9.1%
3 12
8.4%
0 11
7.7%
9 9
 
6.3%
8 5
 
3.5%
Space Separator
ValueCountFrequency (%)
185
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 480
57.4%
Common 356
42.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
96
20.0%
58
12.1%
37
 
7.7%
37
 
7.7%
37
 
7.7%
37
 
7.7%
37
 
7.7%
25
 
5.2%
23
 
4.8%
16
 
3.3%
Other values (26) 77
16.0%
Common
ValueCountFrequency (%)
185
52.0%
- 28
 
7.9%
1 26
 
7.3%
2 20
 
5.6%
7 18
 
5.1%
6 15
 
4.2%
4 14
 
3.9%
5 13
 
3.7%
3 12
 
3.4%
0 11
 
3.1%
Other values (2) 14
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 480
57.4%
ASCII 356
42.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
185
52.0%
- 28
 
7.9%
1 26
 
7.3%
2 20
 
5.6%
7 18
 
5.1%
6 15
 
4.2%
4 14
 
3.9%
5 13
 
3.7%
3 12
 
3.4%
0 11
 
3.1%
Other values (2) 14
 
3.9%
Hangul
ValueCountFrequency (%)
96
20.0%
58
12.1%
37
 
7.7%
37
 
7.7%
37
 
7.7%
37
 
7.7%
37
 
7.7%
25
 
5.2%
23
 
4.8%
16
 
3.3%
Other values (26) 77
16.0%

전화번호
Text

MISSING 

Distinct30
Distinct (%)100.0%
Missing7
Missing (%)18.9%
Memory size428.0 B
2023-12-13T00:03:29.622315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters360
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row054-382-3259
2nd row054-382-4351
3rd row054-382-0417
4th row054-382-5800
5th row054-382-7055
ValueCountFrequency (%)
054-382-4351 1
 
3.3%
054-382-0417 1
 
3.3%
054-605-2400 1
 
3.3%
054-382-2238 1
 
3.3%
054-383-4224 1
 
3.3%
054-382-2406 1
 
3.3%
054-382-1311 1
 
3.3%
054-383-3556 1
 
3.3%
054-383-8810 1
 
3.3%
054-382-9888 1
 
3.3%
Other values (20) 20
66.7%
2023-12-13T00:03:30.360020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 60
16.7%
3 55
15.3%
5 45
12.5%
8 45
12.5%
0 43
11.9%
4 39
10.8%
2 31
8.6%
6 12
 
3.3%
7 11
 
3.1%
1 10
 
2.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 300
83.3%
Dash Punctuation 60
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 55
18.3%
5 45
15.0%
8 45
15.0%
0 43
14.3%
4 39
13.0%
2 31
10.3%
6 12
 
4.0%
7 11
 
3.7%
1 10
 
3.3%
9 9
 
3.0%
Dash Punctuation
ValueCountFrequency (%)
- 60
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 360
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 60
16.7%
3 55
15.3%
5 45
12.5%
8 45
12.5%
0 43
11.9%
4 39
10.8%
2 31
8.6%
6 12
 
3.3%
7 11
 
3.1%
1 10
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 360
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 60
16.7%
3 55
15.3%
5 45
12.5%
8 45
12.5%
0 43
11.9%
4 39
10.8%
2 31
8.6%
6 12
 
3.3%
7 11
 
3.1%
1 10
 
2.8%

Correlations

2023-12-13T00:03:30.463723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명업소명업소소재지(도로명)업소소재지(지번)전화번호
업종명1.0001.0000.2970.2971.000
업소명1.0001.0000.9940.9941.000
업소소재지(도로명)0.2970.9941.0001.0001.000
업소소재지(지번)0.2970.9941.0001.0001.000
전화번호1.0001.0001.0001.0001.000

Missing values

2023-12-13T00:03:26.974009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:03:27.069893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명업소소재지(도로명)업소소재지(지번)전화번호
0일반미용업화본미용소경상북도 군위군 산성면 부흥로 437경상북도 군위군 산성면 화본리 1167054-382-3259
1일반미용업흥신미용소경상북도 군위군 소보면 송원3길 27-1경상북도 군위군 소보면 송원리 672054-382-4351
2일반미용업현대미용실경상북도 군위군 군위읍 중앙길 64경상북도 군위군 군위읍 서부리 286054-382-0417
3일반미용업삼화미용소경상북도 군위군 우보면 이화3길 1-1경상북도 군위군 우보면 이화리 1292-1054-382-5800
4일반미용업의흥미용실경상북도 군위군 의흥면 읍내길 59-1경상북도 군위군 의흥면 읍내리 593054-382-7055
5일반미용업박현주미용실경상북도 군위군 군위읍 중앙4길 16-6경상북도 군위군 군위읍 서부리 427-2054-383-6777
6일반미용업우아미미용실경상북도 군위군 군위읍 중앙길 112-2경상북도 군위군 군위읍 서부리 26-11054-382-5158
7일반미용업그린미용실경상북도 군위군 효령면 중구2길 6-11경상북도 군위군 효령면 중구리 169-4054-382-8527
8일반미용업은미용실경상북도 군위군 군위읍 중앙길 118경상북도 군위군 군위읍 동부리 396-10054-383-7980
9일반미용업계절의 여왕경상북도 군위군 의흥면 읍내길 55-1경상북도 군위군 의흥면 읍내리 591-27054-382-6658
업종명업소명업소소재지(도로명)업소소재지(지번)전화번호
27일반미용업헤어다빈샵경상북도 군위군 군위읍 동서1길 17-6경상북도 군위군 군위읍 서부리 27-14054-383-4224
28일반미용업미애헤어경상북도 군위군 부계면 창평1길 27경상북도 군위군 부계면 창평리 1025-4<NA>
29일반미용업은H.헤어샵경상북도 군위군 군위읍 동서5길 10경상북도 군위군 군위읍 동부리 405-1<NA>
30일반미용업로뎀헤어경상북도 군위군 군위읍 동서1길 24-1경상북도 군위군 군위읍 서부리 28-3<NA>
31피부미용업다담다 피부관리샵경상북도 군위군 군위읍 중앙6길 4경상북도 군위군 군위읍 서부리 17-13054-382-2238
32피부미용업A.SKIN #경상북도 군위군 군위읍 동서5길 10경상북도 군위군 군위읍 동부리 405-1<NA>
33종합미용업블리스 헤어경상북도 군위군 군위읍 중앙4길 20경상북도 군위군 군위읍 서부리 420-2054-605-2400
34네일미용업네일우디경상북도 군위군 군위읍 동서6길 28-14경상북도 군위군 군위읍 동부리 575<NA>
35일반미용업, 네일미용업도화뜨락경상북도 군위군 군위읍 중앙길 138-16경상북도 군위군 군위읍 동부리 325-6054-383-3358
36네일미용업, 화장ㆍ분장 미용업네일 숲경상북도 군위군 군위읍 중앙6길 13-57경상북도 군위군 군위읍 서부리 493-2<NA>