Overview

Dataset statistics

Number of variables5
Number of observations110
Missing cells8
Missing cells (%)1.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.6 KiB
Average record size in memory43.2 B

Variable types

Numeric2
Text3

Dataset

Description대구광역시 서구_숙박업소 현황_20240208
Author대구광역시 서구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=3081453&dataSetDetailId=3081453184a79f46e37f_201711281630&provdMethod=FILE

Alerts

소재지전화 has 8 (7.3%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-13 14:18:26.512340
Analysis finished2024-03-13 14:18:27.288329
Duration0.78 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct110
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean55.5
Minimum1
Maximum110
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-03-13T23:18:27.376629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.45
Q128.25
median55.5
Q382.75
95-th percentile104.55
Maximum110
Range109
Interquartile range (IQR)54.5

Descriptive statistics

Standard deviation31.898276
Coefficient of variation (CV)0.57474371
Kurtosis-1.2
Mean55.5
Median Absolute Deviation (MAD)27.5
Skewness0
Sum6105
Variance1017.5
MonotonicityStrictly increasing
2024-03-13T23:18:27.517436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.9%
71 1
 
0.9%
82 1
 
0.9%
81 1
 
0.9%
80 1
 
0.9%
79 1
 
0.9%
78 1
 
0.9%
77 1
 
0.9%
76 1
 
0.9%
75 1
 
0.9%
Other values (100) 100
90.9%
ValueCountFrequency (%)
1 1
0.9%
2 1
0.9%
3 1
0.9%
4 1
0.9%
5 1
0.9%
6 1
0.9%
7 1
0.9%
8 1
0.9%
9 1
0.9%
10 1
0.9%
ValueCountFrequency (%)
110 1
0.9%
109 1
0.9%
108 1
0.9%
107 1
0.9%
106 1
0.9%
105 1
0.9%
104 1
0.9%
103 1
0.9%
102 1
0.9%
101 1
0.9%
Distinct109
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size1012.0 B
2024-03-13T23:18:27.815087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16
Mean length4.9090909
Min length2

Characters and Unicode

Total characters540
Distinct characters178
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)98.2%

Sample

1st row제니스호텔
2nd row용성여인숙
3rd row솔로몬여관
4th row호텔센텀
5th row동백장
ValueCountFrequency (%)
여관 6
 
4.9%
모텔 4
 
3.3%
호텔센텀 2
 
1.6%
그린장 2
 
1.6%
워싱턴모텔 1
 
0.8%
뷰티모텔 1
 
0.8%
별장모텔 1
 
0.8%
세기장여관 1
 
0.8%
비룡장여관 1
 
0.8%
샤네모텔 1
 
0.8%
Other values (102) 102
83.6%
2024-03-13T23:18:28.314260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
62
 
11.5%
50
 
9.3%
40
 
7.4%
33
 
6.1%
25
 
4.6%
13
 
2.4%
13
 
2.4%
10
 
1.9%
8
 
1.5%
7
 
1.3%
Other values (168) 279
51.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 477
88.3%
Uppercase Letter 25
 
4.6%
Space Separator 13
 
2.4%
Lowercase Letter 7
 
1.3%
Open Punctuation 6
 
1.1%
Close Punctuation 6
 
1.1%
Decimal Number 6
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
62
 
13.0%
50
 
10.5%
40
 
8.4%
33
 
6.9%
25
 
5.2%
13
 
2.7%
10
 
2.1%
8
 
1.7%
7
 
1.5%
6
 
1.3%
Other values (140) 223
46.8%
Uppercase Letter
ValueCountFrequency (%)
T 4
16.0%
H 3
12.0%
O 3
12.0%
E 3
12.0%
L 3
12.0%
M 2
8.0%
Y 1
 
4.0%
A 1
 
4.0%
P 1
 
4.0%
U 1
 
4.0%
Other values (3) 3
12.0%
Lowercase Letter
ValueCountFrequency (%)
y 1
14.3%
k 1
14.3%
i 1
14.3%
t 1
14.3%
s 1
14.3%
n 1
14.3%
e 1
14.3%
Decimal Number
ValueCountFrequency (%)
2 2
33.3%
8 1
16.7%
5 1
16.7%
6 1
16.7%
3 1
16.7%
Space Separator
ValueCountFrequency (%)
13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 477
88.3%
Latin 32
 
5.9%
Common 31
 
5.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
62
 
13.0%
50
 
10.5%
40
 
8.4%
33
 
6.9%
25
 
5.2%
13
 
2.7%
10
 
2.1%
8
 
1.7%
7
 
1.5%
6
 
1.3%
Other values (140) 223
46.8%
Latin
ValueCountFrequency (%)
T 4
 
12.5%
H 3
 
9.4%
O 3
 
9.4%
E 3
 
9.4%
L 3
 
9.4%
M 2
 
6.2%
Y 1
 
3.1%
A 1
 
3.1%
P 1
 
3.1%
U 1
 
3.1%
Other values (10) 10
31.2%
Common
ValueCountFrequency (%)
13
41.9%
( 6
19.4%
) 6
19.4%
2 2
 
6.5%
8 1
 
3.2%
5 1
 
3.2%
6 1
 
3.2%
3 1
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 477
88.3%
ASCII 63
 
11.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
62
 
13.0%
50
 
10.5%
40
 
8.4%
33
 
6.9%
25
 
5.2%
13
 
2.7%
10
 
2.1%
8
 
1.7%
7
 
1.5%
6
 
1.3%
Other values (140) 223
46.8%
ASCII
ValueCountFrequency (%)
13
20.6%
( 6
 
9.5%
) 6
 
9.5%
T 4
 
6.3%
H 3
 
4.8%
O 3
 
4.8%
E 3
 
4.8%
L 3
 
4.8%
M 2
 
3.2%
2 2
 
3.2%
Other values (18) 18
28.6%
Distinct109
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size1012.0 B
2024-03-13T23:18:28.631004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length29
Mean length24.645455
Min length20

Characters and Unicode

Total characters2711
Distinct characters52
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)98.2%

Sample

1st row대구광역시 서구 국채보상로42길 17 (평리동)
2nd row대구광역시 서구 서대구로 360-7 (비산동)
3rd row대구광역시 서구 평리로62길 14 (내당동)
4th row대구광역시 서구 국채보상로42길 20 (평리동)
5th row대구광역시 서구 서대구로61길 10 (비산동)
ValueCountFrequency (%)
대구광역시 110
19.9%
서구 110
19.9%
비산동 48
 
8.7%
평리동 41
 
7.4%
서대구로 29
 
5.3%
내당동 15
 
2.7%
국채보상로46길 14
 
2.5%
국채보상로42길 10
 
1.8%
달서로 6
 
1.1%
달서천로41길 5
 
0.9%
Other values (124) 164
29.7%
2024-03-13T23:18:29.118374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
442
16.3%
263
 
9.7%
164
 
6.0%
156
 
5.8%
110
 
4.1%
110
 
4.1%
110
 
4.1%
110
 
4.1%
( 110
 
4.1%
110
 
4.1%
Other values (42) 1026
37.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1614
59.5%
Space Separator 442
 
16.3%
Decimal Number 397
 
14.6%
Open Punctuation 110
 
4.1%
Close Punctuation 110
 
4.1%
Dash Punctuation 35
 
1.3%
Other Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
263
16.3%
164
10.2%
156
9.7%
110
 
6.8%
110
 
6.8%
110
 
6.8%
110
 
6.8%
110
 
6.8%
56
 
3.5%
52
 
3.2%
Other values (27) 373
23.1%
Decimal Number
ValueCountFrequency (%)
1 67
16.9%
2 59
14.9%
4 53
13.4%
6 50
12.6%
3 46
11.6%
5 33
8.3%
7 30
7.6%
0 22
 
5.5%
8 21
 
5.3%
9 16
 
4.0%
Space Separator
ValueCountFrequency (%)
442
100.0%
Open Punctuation
ValueCountFrequency (%)
( 110
100.0%
Close Punctuation
ValueCountFrequency (%)
) 110
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 35
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1614
59.5%
Common 1097
40.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
263
16.3%
164
10.2%
156
9.7%
110
 
6.8%
110
 
6.8%
110
 
6.8%
110
 
6.8%
110
 
6.8%
56
 
3.5%
52
 
3.2%
Other values (27) 373
23.1%
Common
ValueCountFrequency (%)
442
40.3%
( 110
 
10.0%
) 110
 
10.0%
1 67
 
6.1%
2 59
 
5.4%
4 53
 
4.8%
6 50
 
4.6%
3 46
 
4.2%
- 35
 
3.2%
5 33
 
3.0%
Other values (5) 92
 
8.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1614
59.5%
ASCII 1097
40.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
442
40.3%
( 110
 
10.0%
) 110
 
10.0%
1 67
 
6.1%
2 59
 
5.4%
4 53
 
4.8%
6 50
 
4.6%
3 46
 
4.2%
- 35
 
3.2%
5 33
 
3.0%
Other values (5) 92
 
8.4%
Hangul
ValueCountFrequency (%)
263
16.3%
164
10.2%
156
9.7%
110
 
6.8%
110
 
6.8%
110
 
6.8%
110
 
6.8%
110
 
6.8%
56
 
3.5%
52
 
3.2%
Other values (27) 373
23.1%

소재지전화
Text

MISSING 

Distinct102
Distinct (%)100.0%
Missing8
Missing (%)7.3%
Memory size1012.0 B
2024-03-13T23:18:29.414175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1224
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique102 ?
Unique (%)100.0%

Sample

1st row053-571-3082
2nd row053-355-1828
3rd row053-567-1325
4th row053-358-4876
5th row053-355-2295
ValueCountFrequency (%)
053-527-3310 1
 
1.0%
053-355-6951 1
 
1.0%
053-557-3621 1
 
1.0%
053-213-1333 1
 
1.0%
053-556-3131 1
 
1.0%
053-561-3211 1
 
1.0%
053-562-7115 1
 
1.0%
053-359-1418 1
 
1.0%
053-353-9222 1
 
1.0%
053-551-0332 1
 
1.0%
Other values (92) 92
90.2%
2024-03-13T23:18:29.852501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 282
23.0%
- 204
16.7%
3 192
15.7%
0 157
12.8%
2 77
 
6.3%
1 67
 
5.5%
6 65
 
5.3%
7 64
 
5.2%
9 45
 
3.7%
8 38
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1020
83.3%
Dash Punctuation 204
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 282
27.6%
3 192
18.8%
0 157
15.4%
2 77
 
7.5%
1 67
 
6.6%
6 65
 
6.4%
7 64
 
6.3%
9 45
 
4.4%
8 38
 
3.7%
4 33
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 204
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1224
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 282
23.0%
- 204
16.7%
3 192
15.7%
0 157
12.8%
2 77
 
6.3%
1 67
 
5.5%
6 65
 
5.3%
7 64
 
5.2%
9 45
 
3.7%
8 38
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1224
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 282
23.0%
- 204
16.7%
3 192
15.7%
0 157
12.8%
2 77
 
6.3%
1 67
 
5.5%
6 65
 
5.3%
7 64
 
5.2%
9 45
 
3.7%
8 38
 
3.1%

객실수
Real number (ℝ)

Distinct36
Distinct (%)32.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.581818
Minimum5
Maximum50
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-03-13T23:18:30.007133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile7.45
Q112
median18.5
Q325
95-th percentile36.55
Maximum50
Range45
Interquartile range (IQR)13

Descriptive statistics

Standard deviation9.2670689
Coefficient of variation (CV)0.47324864
Kurtosis0.35617933
Mean19.581818
Median Absolute Deviation (MAD)6.5
Skewness0.81028356
Sum2154
Variance85.878565
MonotonicityNot monotonic
2024-03-13T23:18:30.177544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
10 9
 
8.2%
19 9
 
8.2%
20 6
 
5.5%
16 6
 
5.5%
17 6
 
5.5%
12 6
 
5.5%
11 5
 
4.5%
32 4
 
3.6%
13 4
 
3.6%
25 4
 
3.6%
Other values (26) 51
46.4%
ValueCountFrequency (%)
5 1
 
0.9%
6 3
 
2.7%
7 2
 
1.8%
8 2
 
1.8%
9 2
 
1.8%
10 9
8.2%
11 5
4.5%
12 6
5.5%
13 4
3.6%
14 3
 
2.7%
ValueCountFrequency (%)
50 1
 
0.9%
44 1
 
0.9%
40 3
2.7%
37 1
 
0.9%
36 1
 
0.9%
35 1
 
0.9%
34 1
 
0.9%
33 1
 
0.9%
32 4
3.6%
31 2
1.8%

Interactions

2024-03-13T23:18:26.931792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T23:18:26.722572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T23:18:27.026357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T23:18:26.821190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-13T23:18:30.283810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번객실수
연번1.0000.361
객실수0.3611.000
2024-03-13T23:18:30.368862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번객실수
연번1.0000.097
객실수0.0971.000

Missing values

2024-03-13T23:18:27.149116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T23:18:27.253103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업소명영업소 주소(도로명)소재지전화객실수
01제니스호텔대구광역시 서구 국채보상로42길 17 (평리동)053-571-308226
12용성여인숙대구광역시 서구 서대구로 360-7 (비산동)053-355-18286
23솔로몬여관대구광역시 서구 평리로62길 14 (내당동)053-567-132515
34호텔센텀대구광역시 서구 국채보상로42길 20 (평리동)<NA>20
45동백장대구광역시 서구 서대구로61길 10 (비산동)053-358-487611
56삼화여인숙대구광역시 서구 달서천로41길 9 (비산동)053-355-229510
67션모텔대구광역시 서구 서대구로 96 (평리동)053-557-002119
78동원모텔대구광역시 서구 서대구로 322 (비산동)053-352-922520
89하와이모텔대구광역시 서구 서대구로 356 (비산동)053-341-886420
910기키모텔대구광역시 서구 국채보상로42길 41 (평리동)053-557-669921
연번업소명영업소 주소(도로명)소재지전화객실수
100101해피모텔대구광역시 서구 평리로 321 (평리동)053-562-497522
101102키스미모텔대구광역시 서구 국채보상로42길 19 (평리동)053-522-013121
102103실비여인숙대구광역시 서구 서대구로 334-3 (비산동)<NA>6
103104모텔제이대구광역시 서구 국채보상로42길 4 (평리동)053-557-171731
104105오투(O2)대구광역시 서구 국채보상로42길 8 (평리동)053-523-991844
105106호텔오늘대구광역시 서구 서대구로63안길 13, 2,3층 (비산동)053-355-444524
106107이사벨호텔대구광역시 서구 서대구로63안길 15, 2층 (비산동)053-341-112310
107108가나다모텔대구광역시 서구 팔달로 96 (비산동)053-359-202040
108109꿈의궁전장대구광역시 서구 서대구로 57-20 (내당동)<NA>14
109110장미장여관대구광역시 서구 서대구로11길 3 (내당동)<NA>13