Overview

Dataset statistics

Number of variables6
Number of observations148
Missing cells72
Missing cells (%)8.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.1 KiB
Average record size in memory48.9 B

Variable types

Categorical2
Text4

Dataset

Description업종명(이용업, 미용업), 업소명, 업소소재지(도로명), 업소소재지(지번), 소재지전화, 데이터기준일자 포함
Author경상북도 의성군
URLhttps://www.data.go.kr/data/15032130/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
업소소재지(도로명) has 30 (20.3%) missing valuesMissing
업소소재지(지번) has 4 (2.7%) missing valuesMissing
소재지전화 has 38 (25.7%) missing valuesMissing

Reproduction

Analysis started2023-12-12 19:41:12.362782
Analysis finished2023-12-12 19:41:13.019794
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

Distinct2
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
미용업
110 
이용업
38 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row이용업
2nd row이용업
3rd row이용업
4th row이용업
5th row이용업

Common Values

ValueCountFrequency (%)
미용업 110
74.3%
이용업 38
 
25.7%

Length

2023-12-13T04:41:13.105509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:41:13.246532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미용업 110
74.3%
이용업 38
 
25.7%
Distinct139
Distinct (%)93.9%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-13T04:41:13.573814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length14
Mean length3.7702703
Min length1

Characters and Unicode

Total characters558
Distinct characters182
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique130 ?
Unique (%)87.8%

Sample

1st row신성
2nd row서울
3rd row안평이용소
4th row최신
5th row동산
ValueCountFrequency (%)
은혜미용실 2
 
1.3%
정화미용실 2
 
1.3%
제일 2
 
1.3%
서울 2
 
1.3%
부산 2
 
1.3%
hair 2
 
1.3%
가위손 2
 
1.3%
2
 
1.3%
협동 2
 
1.3%
사곡 2
 
1.3%
Other values (139) 139
87.4%
2023-12-13T04:41:14.184798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
31
 
5.6%
30
 
5.4%
30
 
5.4%
27
 
4.8%
23
 
4.1%
17
 
3.0%
13
 
2.3%
11
 
2.0%
11
 
2.0%
10
 
1.8%
Other values (172) 355
63.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 523
93.7%
Space Separator 11
 
2.0%
Uppercase Letter 10
 
1.8%
Lowercase Letter 9
 
1.6%
Close Punctuation 2
 
0.4%
Open Punctuation 2
 
0.4%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
 
5.9%
30
 
5.7%
30
 
5.7%
27
 
5.2%
23
 
4.4%
17
 
3.3%
13
 
2.5%
11
 
2.1%
10
 
1.9%
8
 
1.5%
Other values (154) 323
61.8%
Uppercase Letter
ValueCountFrequency (%)
H 2
20.0%
A 2
20.0%
I 1
10.0%
L 1
10.0%
U 1
10.0%
N 1
10.0%
J 1
10.0%
R 1
10.0%
Lowercase Letter
ValueCountFrequency (%)
a 3
33.3%
i 2
22.2%
n 1
 
11.1%
h 1
 
11.1%
r 1
 
11.1%
l 1
 
11.1%
Space Separator
ValueCountFrequency (%)
11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 523
93.7%
Latin 19
 
3.4%
Common 16
 
2.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
31
 
5.9%
30
 
5.7%
30
 
5.7%
27
 
5.2%
23
 
4.4%
17
 
3.3%
13
 
2.5%
11
 
2.1%
10
 
1.9%
8
 
1.5%
Other values (154) 323
61.8%
Latin
ValueCountFrequency (%)
a 3
15.8%
H 2
10.5%
i 2
10.5%
A 2
10.5%
I 1
 
5.3%
L 1
 
5.3%
U 1
 
5.3%
N 1
 
5.3%
n 1
 
5.3%
J 1
 
5.3%
Other values (4) 4
21.1%
Common
ValueCountFrequency (%)
11
68.8%
) 2
 
12.5%
( 2
 
12.5%
. 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 523
93.7%
ASCII 35
 
6.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
31
 
5.9%
30
 
5.7%
30
 
5.7%
27
 
5.2%
23
 
4.4%
17
 
3.3%
13
 
2.5%
11
 
2.1%
10
 
1.9%
8
 
1.5%
Other values (154) 323
61.8%
ASCII
ValueCountFrequency (%)
11
31.4%
a 3
 
8.6%
H 2
 
5.7%
i 2
 
5.7%
) 2
 
5.7%
A 2
 
5.7%
( 2
 
5.7%
I 1
 
2.9%
L 1
 
2.9%
U 1
 
2.9%
Other values (8) 8
22.9%
Distinct116
Distinct (%)98.3%
Missing30
Missing (%)20.3%
Memory size1.3 KiB
2023-12-13T04:41:14.611221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length29
Mean length20.788136
Min length18

Characters and Unicode

Total characters2453
Distinct characters95
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique114 ?
Unique (%)96.6%

Sample

1st row경상북도 의성군 옥산면 입암1길 34
2nd row경상북도 의성군 금성면 탑리길 32-1
3rd row경상북도 의성군 안평면 봉호로 943
4th row경상북도 의성군 안계면 안계길 138
5th row경상북도 의성군 의성읍 군청길 61
ValueCountFrequency (%)
경상북도 118
19.8%
의성군 118
19.8%
의성읍 53
 
8.9%
안계면 25
 
4.2%
안계길 15
 
2.5%
중앙길 13
 
2.2%
군청길 10
 
1.7%
금성면 8
 
1.3%
봉양면 7
 
1.2%
안계시장길 6
 
1.0%
Other values (146) 222
37.3%
2023-12-13T04:41:15.174887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
477
19.4%
184
 
7.5%
173
 
7.1%
131
 
5.3%
128
 
5.2%
121
 
4.9%
119
 
4.9%
118
 
4.8%
107
 
4.4%
1 90
 
3.7%
Other values (85) 805
32.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1596
65.1%
Space Separator 477
 
19.4%
Decimal Number 337
 
13.7%
Dash Punctuation 38
 
1.5%
Other Punctuation 3
 
0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
184
11.5%
173
10.8%
131
 
8.2%
128
 
8.0%
121
 
7.6%
119
 
7.5%
118
 
7.4%
107
 
6.7%
65
 
4.1%
53
 
3.3%
Other values (70) 397
24.9%
Decimal Number
ValueCountFrequency (%)
1 90
26.7%
2 45
13.4%
3 37
11.0%
5 32
 
9.5%
4 32
 
9.5%
6 26
 
7.7%
7 26
 
7.7%
8 21
 
6.2%
9 16
 
4.7%
0 12
 
3.6%
Space Separator
ValueCountFrequency (%)
477
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 38
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1596
65.1%
Common 857
34.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
184
11.5%
173
10.8%
131
 
8.2%
128
 
8.0%
121
 
7.6%
119
 
7.5%
118
 
7.4%
107
 
6.7%
65
 
4.1%
53
 
3.3%
Other values (70) 397
24.9%
Common
ValueCountFrequency (%)
477
55.7%
1 90
 
10.5%
2 45
 
5.3%
- 38
 
4.4%
3 37
 
4.3%
5 32
 
3.7%
4 32
 
3.7%
6 26
 
3.0%
7 26
 
3.0%
8 21
 
2.5%
Other values (5) 33
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1596
65.1%
ASCII 857
34.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
477
55.7%
1 90
 
10.5%
2 45
 
5.3%
- 38
 
4.4%
3 37
 
4.3%
5 32
 
3.7%
4 32
 
3.7%
6 26
 
3.0%
7 26
 
3.0%
8 21
 
2.5%
Other values (5) 33
 
3.9%
Hangul
ValueCountFrequency (%)
184
11.5%
173
10.8%
131
 
8.2%
128
 
8.0%
121
 
7.6%
119
 
7.5%
118
 
7.4%
107
 
6.7%
65
 
4.1%
53
 
3.3%
Other values (70) 397
24.9%
Distinct141
Distinct (%)97.9%
Missing4
Missing (%)2.7%
Memory size1.3 KiB
2023-12-13T04:41:15.553070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length25
Mean length24.979167
Min length21

Characters and Unicode

Total characters3597
Distinct characters81
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique138 ?
Unique (%)95.8%

Sample

1st row경상북도 의성군 옥산면 입암리 1105-29번지
2nd row경상북도 의성군 금성면 대리리 19-12번지
3rd row경상북도 의성군 안평면 박곡리 902-7번지
4th row경상북도 의성군 안계면 용기리 468-10번지
5th row경상북도 의성군 의성읍 중리리 751-1번지
ValueCountFrequency (%)
경상북도 144
19.9%
의성군 144
19.9%
의성읍 59
 
8.2%
안계면 28
 
3.9%
용기리 27
 
3.7%
후죽리 25
 
3.5%
도동리 17
 
2.4%
금성면 13
 
1.8%
중리리 12
 
1.7%
대리리 11
 
1.5%
Other values (179) 243
33.6%
2023-12-13T04:41:16.574364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
722
20.1%
218
 
6.1%
204
 
5.7%
170
 
4.7%
163
 
4.5%
147
 
4.1%
146
 
4.1%
144
 
4.0%
144
 
4.0%
134
 
3.7%
Other values (71) 1405
39.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2142
59.5%
Space Separator 722
 
20.1%
Decimal Number 605
 
16.8%
Dash Punctuation 128
 
3.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
218
10.2%
204
 
9.5%
170
 
7.9%
163
 
7.6%
147
 
6.9%
146
 
6.8%
144
 
6.7%
144
 
6.7%
134
 
6.3%
130
 
6.1%
Other values (59) 542
25.3%
Decimal Number
ValueCountFrequency (%)
1 122
20.2%
4 67
11.1%
8 67
11.1%
2 66
10.9%
5 65
10.7%
6 57
9.4%
9 45
 
7.4%
3 40
 
6.6%
7 39
 
6.4%
0 37
 
6.1%
Space Separator
ValueCountFrequency (%)
722
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 128
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2142
59.5%
Common 1455
40.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
218
10.2%
204
 
9.5%
170
 
7.9%
163
 
7.6%
147
 
6.9%
146
 
6.8%
144
 
6.7%
144
 
6.7%
134
 
6.3%
130
 
6.1%
Other values (59) 542
25.3%
Common
ValueCountFrequency (%)
722
49.6%
- 128
 
8.8%
1 122
 
8.4%
4 67
 
4.6%
8 67
 
4.6%
2 66
 
4.5%
5 65
 
4.5%
6 57
 
3.9%
9 45
 
3.1%
3 40
 
2.7%
Other values (2) 76
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2142
59.5%
ASCII 1455
40.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
722
49.6%
- 128
 
8.8%
1 122
 
8.4%
4 67
 
4.6%
8 67
 
4.6%
2 66
 
4.5%
5 65
 
4.5%
6 57
 
3.9%
9 45
 
3.1%
3 40
 
2.7%
Other values (2) 76
 
5.2%
Hangul
ValueCountFrequency (%)
218
10.2%
204
 
9.5%
170
 
7.9%
163
 
7.6%
147
 
6.9%
146
 
6.8%
144
 
6.7%
144
 
6.7%
134
 
6.3%
130
 
6.1%
Other values (59) 542
25.3%

소재지전화
Text

MISSING 

Distinct110
Distinct (%)100.0%
Missing38
Missing (%)25.7%
Memory size1.3 KiB
2023-12-13T04:41:16.873111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length11.009091
Min length11

Characters and Unicode

Total characters1211
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique110 ?
Unique (%)100.0%

Sample

1st row054 8340999
2nd row054 8338565
3rd row054 8328411
4th row054 8326531
5th row054 833 9015
ValueCountFrequency (%)
054 108
48.9%
053 2
 
0.9%
8610448 1
 
0.5%
8611386 1
 
0.5%
8337358 1
 
0.5%
8618118 1
 
0.5%
8347877 1
 
0.5%
8340136 1
 
0.5%
8338916 1
 
0.5%
8623632 1
 
0.5%
Other values (103) 103
46.6%
2023-12-13T04:41:17.294970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 174
14.4%
0 170
14.0%
5 154
12.7%
8 147
12.1%
3 147
12.1%
111
9.2%
1 79
6.5%
2 78
6.4%
6 67
 
5.5%
7 47
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1100
90.8%
Space Separator 111
 
9.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 174
15.8%
0 170
15.5%
5 154
14.0%
8 147
13.4%
3 147
13.4%
1 79
7.2%
2 78
7.1%
6 67
 
6.1%
7 47
 
4.3%
9 37
 
3.4%
Space Separator
ValueCountFrequency (%)
111
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1211
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 174
14.4%
0 170
14.0%
5 154
12.7%
8 147
12.1%
3 147
12.1%
111
9.2%
1 79
6.5%
2 78
6.4%
6 67
 
5.5%
7 47
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1211
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 174
14.4%
0 170
14.0%
5 154
12.7%
8 147
12.1%
3 147
12.1%
111
9.2%
1 79
6.5%
2 78
6.4%
6 67
 
5.5%
7 47
 
3.9%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2021-07-31
148 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-07-31
2nd row2021-07-31
3rd row2021-07-31
4th row2021-07-31
5th row2021-07-31

Common Values

ValueCountFrequency (%)
2021-07-31 148
100.0%

Length

2023-12-13T04:41:17.474623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:41:17.609174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-07-31 148
100.0%

Missing values

2023-12-13T04:41:12.677274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:41:12.808382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T04:41:12.935162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업종명업소명업소소재지(도로명)업소소재지(지번)소재지전화데이터기준일자
0이용업신성경상북도 의성군 옥산면 입암1길 34경상북도 의성군 옥산면 입암리 1105-29번지<NA>2021-07-31
1이용업서울경상북도 의성군 금성면 탑리길 32-1경상북도 의성군 금성면 대리리 19-12번지<NA>2021-07-31
2이용업안평이용소경상북도 의성군 안평면 봉호로 943경상북도 의성군 안평면 박곡리 902-7번지054 83409992021-07-31
3이용업최신경상북도 의성군 안계면 안계길 138경상북도 의성군 안계면 용기리 468-10번지<NA>2021-07-31
4이용업동산경상북도 의성군 의성읍 군청길 61경상북도 의성군 의성읍 중리리 751-1번지054 83385652021-07-31
5이용업제일경상북도 의성군 안계면 용기3길 3경상북도 의성군 안계면 용기리 853번지<NA>2021-07-31
6이용업대구<NA>경상북도 의성군 봉양면 화전리 91번지<NA>2021-07-31
7이용업문화경상북도 의성군 금성면 탑리1길 6경상북도 의성군 금성면 대리리 16-1번지054 83284112021-07-31
8이용업봉농이용원경상북도 의성군 봉양면 도리원1길 57<NA><NA>2021-07-31
9이용업성인이용소경상북도 의성군 다인면 서릉길 58경상북도 의성군 다인면 서릉리 110-7번지<NA>2021-07-31
업종명업소명업소소재지(도로명)업소소재지(지번)소재지전화데이터기준일자
138미용업근애미용실경상북도 의성군 옥산면 입암1길 42-9경상북도 의성군 옥산면 입암리 1105-90번지054 83310332021-07-31
139미용업무지개<NA>경상북도 의성군 금성면 산운리 636번지054 83401052021-07-31
140미용업이례<NA>경상북도 의성군 금성면 산운리 636-26번지054 83404832021-07-31
141미용업빛나경상북도 의성군 금성면 탑리1길 1경상북도 의성군 금성면 대리리 17-4번지054 83408432021-07-31
142미용업라라코리아경상북도 의성군 봉양면 도리원3길 24경상북도 의성군 봉양면 화전리 522-11054 83470122021-07-31
143미용업우리미용실경상북도 의성군 옥산면 입암1길 39경상북도 의성군 옥산면 입암리 1105-34번지054 83377782021-07-31
144미용업손길헤어살롱경상북도 의성군 의성읍 군청길 32-1경상북도 의성군 의성읍 후죽리 554<NA>2021-07-31
145미용업지나헤어(J.a hair)경상북도 의성군 안계면 용기9길 11경상북도 의성군 안계면 용기리 824-12<NA>2021-07-31
146미용업H nail경상북도 의성군 의성읍 충효로 75 (의성청구제네스1단지)경상북도 의성군 의성읍 중리리 735-1 의성청구제네스1단지 103동 102호<NA>2021-07-31
147미용업루나헤어(LUNA HAIR)경상북도 의성군 의성읍 군청길 12-1경상북도 의성군 의성읍 후죽리 484-21<NA>2021-07-31