Overview

Dataset statistics

Number of variables4
Number of observations299
Missing cells21
Missing cells (%)1.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.5 KiB
Average record size in memory32.4 B

Variable types

Categorical1
Text3

Dataset

Description경상북도 구미시에 등록된 숙박업 정보로서 숙박업소의 업종명, 업소명, 소재지, 전화번호의 데이터를 제공하고 있습니다.
Author경상북도 구미시
URLhttps://www.data.go.kr/data/3071381/fileData.do

Alerts

업종명 is highly imbalanced (64.3%)Imbalance
소재지전화 has 21 (7.0%) missing valuesMissing
업소명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:58:29.154691
Analysis finished2023-12-12 06:58:29.569156
Duration0.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

IMBALANCE 

Distinct6
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
여관업
255 
일반호텔
 
16
숙박업(생활)
 
9
관광호텔
 
8
숙박업 기타
 
7

Length

Max length7
Median length3
Mean length3.2842809
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row여관업
2nd row여관업
3rd row숙박업(생활)
4th row여관업
5th row일반호텔

Common Values

ValueCountFrequency (%)
여관업 255
85.3%
일반호텔 16
 
5.4%
숙박업(생활) 9
 
3.0%
관광호텔 8
 
2.7%
숙박업 기타 7
 
2.3%
여인숙업 4
 
1.3%

Length

2023-12-12T15:58:29.659827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:58:29.793498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
여관업 255
83.3%
일반호텔 16
 
5.2%
숙박업(생활 9
 
2.9%
관광호텔 8
 
2.6%
숙박업 7
 
2.3%
기타 7
 
2.3%
여인숙업 4
 
1.3%

업소명
Text

UNIQUE 

Distinct299
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023-12-12T15:58:30.085442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length13
Mean length4.8160535
Min length1

Characters and Unicode

Total characters1440
Distinct characters290
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique299 ?
Unique (%)100.0%

Sample

1st row에이치에비뉴
2nd row포유모텔
3rd row태양궁
4th row힐탑모텔
5th row라마다바이윈덤구미호텔
ValueCountFrequency (%)
에이치에비뉴 1
 
0.3%
로망스모텔 1
 
0.3%
힐파크모텔 1
 
0.3%
호텔칸 1
 
0.3%
에스모텔 1
 
0.3%
힐사이드장 1
 
0.3%
금계모텔 1
 
0.3%
삼정여인숙 1
 
0.3%
브이투모텔 1
 
0.3%
에덴빌 1
 
0.3%
Other values (290) 290
96.7%
2023-12-12T15:58:30.546209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
214
 
14.9%
157
 
10.9%
61
 
4.2%
51
 
3.5%
44
 
3.1%
35
 
2.4%
22
 
1.5%
21
 
1.5%
20
 
1.4%
19
 
1.3%
Other values (280) 796
55.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1418
98.5%
Decimal Number 10
 
0.7%
Uppercase Letter 4
 
0.3%
Open Punctuation 3
 
0.2%
Close Punctuation 3
 
0.2%
Space Separator 1
 
0.1%
Lowercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
214
 
15.1%
157
 
11.1%
61
 
4.3%
51
 
3.6%
44
 
3.1%
35
 
2.5%
22
 
1.6%
21
 
1.5%
20
 
1.4%
19
 
1.3%
Other values (268) 774
54.6%
Decimal Number
ValueCountFrequency (%)
2 4
40.0%
9 3
30.0%
5 2
20.0%
1 1
 
10.0%
Uppercase Letter
ValueCountFrequency (%)
Q 1
25.0%
N 1
25.0%
U 1
25.0%
F 1
25.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1418
98.5%
Common 17
 
1.2%
Latin 5
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
214
 
15.1%
157
 
11.1%
61
 
4.3%
51
 
3.6%
44
 
3.1%
35
 
2.5%
22
 
1.6%
21
 
1.5%
20
 
1.4%
19
 
1.3%
Other values (268) 774
54.6%
Common
ValueCountFrequency (%)
2 4
23.5%
9 3
17.6%
( 3
17.6%
) 3
17.6%
5 2
11.8%
1 1
 
5.9%
1
 
5.9%
Latin
ValueCountFrequency (%)
Q 1
20.0%
e 1
20.0%
N 1
20.0%
U 1
20.0%
F 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1418
98.5%
ASCII 22
 
1.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
214
 
15.1%
157
 
11.1%
61
 
4.3%
51
 
3.6%
44
 
3.1%
35
 
2.5%
22
 
1.6%
21
 
1.5%
20
 
1.4%
19
 
1.3%
Other values (268) 774
54.6%
ASCII
ValueCountFrequency (%)
2 4
18.2%
9 3
13.6%
( 3
13.6%
) 3
13.6%
5 2
9.1%
1 1
 
4.5%
1
 
4.5%
Q 1
 
4.5%
e 1
 
4.5%
N 1
 
4.5%
Other values (2) 2
9.1%
Distinct298
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023-12-12T15:58:30.817253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length36
Mean length25.618729
Min length20

Characters and Unicode

Total characters7660
Distinct characters91
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique297 ?
Unique (%)99.3%

Sample

1st row경상북도 구미시 산호대로29길 13-19, 4,5,6층 (옥계동)
2nd row경상북도 구미시 송원서로6길 58 (원평동)
3rd row경상북도 구미시 산업로28길 17 (원평동)
4th row경상북도 구미시 구미중앙로33길 14, 1, 2, 3층 (원평동)
5th row경상북도 구미시 인동중앙로3길 41, 라마다 구미 호텔 (황상동)
ValueCountFrequency (%)
경상북도 299
19.7%
구미시 299
19.7%
원평동 154
 
10.1%
황상동 28
 
1.8%
송원서로6길 25
 
1.6%
옥계동 20
 
1.3%
인동중앙로3길 15
 
1.0%
인의동 13
 
0.9%
봉곡동 11
 
0.7%
송원서로8길 11
 
0.7%
Other values (331) 645
42.4%
2023-12-12T15:58:31.239914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1221
 
15.9%
364
 
4.8%
362
 
4.7%
342
 
4.5%
330
 
4.3%
330
 
4.3%
302
 
3.9%
301
 
3.9%
299
 
3.9%
) 289
 
3.8%
Other values (81) 3520
46.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4515
58.9%
Space Separator 1221
 
15.9%
Decimal Number 1143
 
14.9%
Close Punctuation 289
 
3.8%
Open Punctuation 289
 
3.8%
Dash Punctuation 140
 
1.8%
Other Punctuation 60
 
0.8%
Math Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
364
 
8.1%
362
 
8.0%
342
 
7.6%
330
 
7.3%
330
 
7.3%
302
 
6.7%
301
 
6.7%
299
 
6.6%
280
 
6.2%
230
 
5.1%
Other values (65) 1375
30.5%
Decimal Number
ValueCountFrequency (%)
1 244
21.3%
3 180
15.7%
2 172
15.0%
4 101
8.8%
5 98
8.6%
6 90
 
7.9%
9 76
 
6.6%
7 70
 
6.1%
8 62
 
5.4%
0 50
 
4.4%
Space Separator
ValueCountFrequency (%)
1221
100.0%
Close Punctuation
ValueCountFrequency (%)
) 289
100.0%
Open Punctuation
ValueCountFrequency (%)
( 289
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 140
100.0%
Other Punctuation
ValueCountFrequency (%)
, 60
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4515
58.9%
Common 3145
41.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
364
 
8.1%
362
 
8.0%
342
 
7.6%
330
 
7.3%
330
 
7.3%
302
 
6.7%
301
 
6.7%
299
 
6.6%
280
 
6.2%
230
 
5.1%
Other values (65) 1375
30.5%
Common
ValueCountFrequency (%)
1221
38.8%
) 289
 
9.2%
( 289
 
9.2%
1 244
 
7.8%
3 180
 
5.7%
2 172
 
5.5%
- 140
 
4.5%
4 101
 
3.2%
5 98
 
3.1%
6 90
 
2.9%
Other values (6) 321
 
10.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4515
58.9%
ASCII 3145
41.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1221
38.8%
) 289
 
9.2%
( 289
 
9.2%
1 244
 
7.8%
3 180
 
5.7%
2 172
 
5.5%
- 140
 
4.5%
4 101
 
3.2%
5 98
 
3.1%
6 90
 
2.9%
Other values (6) 321
 
10.2%
Hangul
ValueCountFrequency (%)
364
 
8.1%
362
 
8.0%
342
 
7.6%
330
 
7.3%
330
 
7.3%
302
 
6.7%
301
 
6.7%
299
 
6.6%
280
 
6.2%
230
 
5.1%
Other values (65) 1375
30.5%

소재지전화
Text

MISSING 

Distinct277
Distinct (%)99.6%
Missing21
Missing (%)7.0%
Memory size2.5 KiB
2023-12-12T15:58:31.462311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters3336
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique276 ?
Unique (%)99.3%

Sample

1st row054-456-2240
2nd row054-479-9000
3rd row054-472-1460
4th row054-716-0200
5th row054-475-6900
ValueCountFrequency (%)
054-451-0940 2
 
0.7%
054-478-0100 1
 
0.4%
054-458-4568 1
 
0.4%
054-441-0725 1
 
0.4%
054-452-9451 1
 
0.4%
054-453-4491 1
 
0.4%
054-441-2604 1
 
0.4%
054-465-3696 1
 
0.4%
054-455-6611 1
 
0.4%
054-473-0505 1
 
0.4%
Other values (267) 267
96.0%
2023-12-12T15:58:31.801330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 720
21.6%
- 556
16.7%
5 542
16.2%
0 512
15.3%
1 192
 
5.8%
7 170
 
5.1%
2 147
 
4.4%
6 145
 
4.3%
8 136
 
4.1%
3 128
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2780
83.3%
Dash Punctuation 556
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 720
25.9%
5 542
19.5%
0 512
18.4%
1 192
 
6.9%
7 170
 
6.1%
2 147
 
5.3%
6 145
 
5.2%
8 136
 
4.9%
3 128
 
4.6%
9 88
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 556
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3336
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 720
21.6%
- 556
16.7%
5 542
16.2%
0 512
15.3%
1 192
 
5.8%
7 170
 
5.1%
2 147
 
4.4%
6 145
 
4.3%
8 136
 
4.1%
3 128
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3336
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 720
21.6%
- 556
16.7%
5 542
16.2%
0 512
15.3%
1 192
 
5.8%
7 170
 
5.1%
2 147
 
4.4%
6 145
 
4.3%
8 136
 
4.1%
3 128
 
3.8%

Missing values

2023-12-12T15:58:29.418191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:58:29.518450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명업소소재지(도로명)소재지전화
0여관업에이치에비뉴경상북도 구미시 산호대로29길 13-19, 4,5,6층 (옥계동)<NA>
1여관업포유모텔경상북도 구미시 송원서로6길 58 (원평동)054-456-2240
2숙박업(생활)태양궁경상북도 구미시 산업로28길 17 (원평동)<NA>
3여관업힐탑모텔경상북도 구미시 구미중앙로33길 14, 1, 2, 3층 (원평동)<NA>
4일반호텔라마다바이윈덤구미호텔경상북도 구미시 인동중앙로3길 41, 라마다 구미 호텔 (황상동)054-479-9000
5일반호텔호텔선용경상북도 구미시 산동읍 신당1로 18, 1동 6,7층 601,701호054-472-1460
6여관업호텔여기어때구미1호점경상북도 구미시 구미중앙로33길 26 (원평동)054-716-0200
7여관업핑크모텔경상북도 구미시 금오시장로6길 5-4, 2,3,4층 (원평동)<NA>
8여관업광명모텔경상북도 구미시 금오시장로9길 3, 2,3,4층 (원평동)<NA>
9여관업까사루시도경상북도 구미시 인동32길 13-8 (진평동)054-475-6900
업종명업소명업소소재지(도로명)소재지전화
289여관업호수장경상북도 구미시 구미중앙로21길 16-1 (원평동)054-452-1220
290여관업스위스모텔경상북도 구미시 송원서로6길 53-3 (원평동)054-458-2468
291여관업윈저모텔경상북도 구미시 송정대로 82 (송정동)054-457-7000
292여관업이코노미호텔구미경상북도 구미시 구미중앙로17길 6 (원평동)054-451-1131
293여관업홈스위트홈모텔경상북도 구미시 1공단로 196 (공단동)054-461-5858
294여관업부산모텔경상북도 구미시 1공단로 186-19 (공단동)054-463-8103
295여관업은하장경상북도 구미시 구미중앙로5길 7 (원평동)054-452-1477
296여관업모텔썸투경상북도 구미시 인동5길 10 (인의동,(지번확정))054-473-4949
297여관업쉴레경상북도 구미시 산업로22길 11 (원평동)054-455-2726
298여관업씨에프모텔경상북도 구미시 3공단1로 312-24 (임수동)054-471-1994