Overview

Dataset statistics

Number of variables5
Number of observations297
Missing cells76
Missing cells (%)5.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.0 KiB
Average record size in memory41.4 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description대구광역시 동구_유흥단란주점현황_20230515
Author대구광역시 동구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15061564&dataSetDetailId=150615641ccd236dc9eb9&provdMethod=FILE

Alerts

연번 is highly overall correlated with 업종명High correlation
업종명 is highly overall correlated with 연번High correlation
소재지전화 has 76 (25.6%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-04-19 06:22:19.308416
Analysis finished2024-04-19 06:22:19.817014
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct297
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean149
Minimum1
Maximum297
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.7 KiB
2024-04-19T15:22:19.917701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile15.8
Q175
median149
Q3223
95-th percentile282.2
Maximum297
Range296
Interquartile range (IQR)148

Descriptive statistics

Standard deviation85.880731
Coefficient of variation (CV)0.57638075
Kurtosis-1.2
Mean149
Median Absolute Deviation (MAD)74
Skewness0
Sum44253
Variance7375.5
MonotonicityStrictly increasing
2024-04-19T15:22:20.052648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
205 1
 
0.3%
203 1
 
0.3%
202 1
 
0.3%
201 1
 
0.3%
200 1
 
0.3%
199 1
 
0.3%
198 1
 
0.3%
197 1
 
0.3%
196 1
 
0.3%
Other values (287) 287
96.6%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
297 1
0.3%
296 1
0.3%
295 1
0.3%
294 1
0.3%
293 1
0.3%
292 1
0.3%
291 1
0.3%
290 1
0.3%
289 1
0.3%
288 1
0.3%

업종명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
유흥주점영업
201 
단란주점
96 

Length

Max length6
Median length6
Mean length5.3535354
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유흥주점영업
2nd row유흥주점영업
3rd row유흥주점영업
4th row유흥주점영업
5th row유흥주점영업

Common Values

ValueCountFrequency (%)
유흥주점영업 201
67.7%
단란주점 96
32.3%

Length

2024-04-19T15:22:20.198312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T15:22:20.340836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유흥주점영업 201
67.7%
단란주점 96
32.3%
Distinct264
Distinct (%)88.9%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2024-04-19T15:22:20.596401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length9
Mean length4.1144781
Min length1

Characters and Unicode

Total characters1222
Distinct characters293
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique236 ?
Unique (%)79.5%

Sample

1st row큐브
2nd row평화가요주점
3rd row베르사체
4th row희로가요궁
5th row프린스
ValueCountFrequency (%)
5
 
1.6%
동촌라이브 3
 
1.0%
가요주점 3
 
1.0%
3
 
1.0%
연예인 3
 
1.0%
surf 2
 
0.6%
애플 2
 
0.6%
세븐 2
 
0.6%
초이스 2
 
0.6%
블랙 2
 
0.6%
Other values (260) 282
91.3%
2024-04-19T15:22:21.064057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
75
 
6.1%
74
 
6.1%
37
 
3.0%
29
 
2.4%
27
 
2.2%
26
 
2.1%
25
 
2.0%
25
 
2.0%
22
 
1.8%
22
 
1.8%
Other values (283) 860
70.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1099
89.9%
Uppercase Letter 46
 
3.8%
Open Punctuation 18
 
1.5%
Close Punctuation 18
 
1.5%
Decimal Number 17
 
1.4%
Space Separator 12
 
1.0%
Lowercase Letter 9
 
0.7%
Other Punctuation 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
75
 
6.8%
74
 
6.7%
37
 
3.4%
29
 
2.6%
27
 
2.5%
26
 
2.4%
25
 
2.3%
25
 
2.3%
22
 
2.0%
22
 
2.0%
Other values (247) 737
67.1%
Uppercase Letter
ValueCountFrequency (%)
B 8
17.4%
O 6
13.0%
S 4
8.7%
M 4
8.7%
C 3
 
6.5%
L 3
 
6.5%
K 3
 
6.5%
A 2
 
4.3%
H 2
 
4.3%
F 2
 
4.3%
Other values (6) 9
19.6%
Decimal Number
ValueCountFrequency (%)
7 4
23.5%
2 4
23.5%
1 3
17.6%
0 2
11.8%
3 2
11.8%
9 1
 
5.9%
8 1
 
5.9%
Lowercase Letter
ValueCountFrequency (%)
a 3
33.3%
h 1
 
11.1%
r 1
 
11.1%
n 1
 
11.1%
e 1
 
11.1%
c 1
 
11.1%
o 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
, 1
33.3%
& 1
33.3%
. 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1099
89.9%
Common 68
 
5.6%
Latin 55
 
4.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
75
 
6.8%
74
 
6.7%
37
 
3.4%
29
 
2.6%
27
 
2.5%
26
 
2.4%
25
 
2.3%
25
 
2.3%
22
 
2.0%
22
 
2.0%
Other values (247) 737
67.1%
Latin
ValueCountFrequency (%)
B 8
14.5%
O 6
 
10.9%
S 4
 
7.3%
M 4
 
7.3%
a 3
 
5.5%
C 3
 
5.5%
L 3
 
5.5%
K 3
 
5.5%
A 2
 
3.6%
H 2
 
3.6%
Other values (13) 17
30.9%
Common
ValueCountFrequency (%)
( 18
26.5%
) 18
26.5%
12
17.6%
7 4
 
5.9%
2 4
 
5.9%
1 3
 
4.4%
0 2
 
2.9%
3 2
 
2.9%
, 1
 
1.5%
9 1
 
1.5%
Other values (3) 3
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1099
89.9%
ASCII 123
 
10.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
75
 
6.8%
74
 
6.7%
37
 
3.4%
29
 
2.6%
27
 
2.5%
26
 
2.4%
25
 
2.3%
25
 
2.3%
22
 
2.0%
22
 
2.0%
Other values (247) 737
67.1%
ASCII
ValueCountFrequency (%)
( 18
14.6%
) 18
14.6%
12
 
9.8%
B 8
 
6.5%
O 6
 
4.9%
7 4
 
3.3%
2 4
 
3.3%
S 4
 
3.3%
M 4
 
3.3%
a 3
 
2.4%
Other values (26) 42
34.1%
Distinct261
Distinct (%)87.9%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2024-04-19T15:22:21.300933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length37
Mean length26.949495
Min length21

Characters and Unicode

Total characters8004
Distinct characters93
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique228 ?
Unique (%)76.8%

Sample

1st row대구광역시 동구 동대구로 462-3, 지하1층 (신천동)
2nd row대구광역시 동구 아양로 65 (신암동)
3rd row대구광역시 동구 동부로26길 60 (신천동)
4th row대구광역시 동구 효목로 20 (효목동)
5th row대구광역시 동구 아양로 30 (신암동)
ValueCountFrequency (%)
대구광역시 297
17.8%
동구 297
17.8%
신천동 141
 
8.4%
지하1층 83
 
5.0%
신암동 53
 
3.2%
동부로30길 52
 
3.1%
효목동 35
 
2.1%
동부로22길 31
 
1.9%
아양로 31
 
1.9%
2층 28
 
1.7%
Other values (253) 622
37.2%
2024-04-19T15:22:21.652512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1373
17.2%
794
 
9.9%
607
 
7.6%
312
 
3.9%
303
 
3.8%
) 297
 
3.7%
( 297
 
3.7%
297
 
3.7%
297
 
3.7%
297
 
3.7%
Other values (83) 3130
39.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4515
56.4%
Space Separator 1373
 
17.2%
Decimal Number 1287
 
16.1%
Close Punctuation 297
 
3.7%
Open Punctuation 297
 
3.7%
Other Punctuation 181
 
2.3%
Dash Punctuation 51
 
0.6%
Uppercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
794
17.6%
607
13.4%
312
 
6.9%
303
 
6.7%
297
 
6.6%
297
 
6.6%
297
 
6.6%
239
 
5.3%
187
 
4.1%
157
 
3.5%
Other values (65) 1025
22.7%
Decimal Number
ValueCountFrequency (%)
2 260
20.2%
1 258
20.0%
3 170
13.2%
0 122
9.5%
6 110
8.5%
4 106
8.2%
8 76
 
5.9%
5 75
 
5.8%
7 63
 
4.9%
9 47
 
3.7%
Other Punctuation
ValueCountFrequency (%)
, 178
98.3%
. 3
 
1.7%
Uppercase Letter
ValueCountFrequency (%)
A 2
66.7%
B 1
33.3%
Space Separator
ValueCountFrequency (%)
1373
100.0%
Close Punctuation
ValueCountFrequency (%)
) 297
100.0%
Open Punctuation
ValueCountFrequency (%)
( 297
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 51
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4515
56.4%
Common 3486
43.6%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
794
17.6%
607
13.4%
312
 
6.9%
303
 
6.7%
297
 
6.6%
297
 
6.6%
297
 
6.6%
239
 
5.3%
187
 
4.1%
157
 
3.5%
Other values (65) 1025
22.7%
Common
ValueCountFrequency (%)
1373
39.4%
) 297
 
8.5%
( 297
 
8.5%
2 260
 
7.5%
1 258
 
7.4%
, 178
 
5.1%
3 170
 
4.9%
0 122
 
3.5%
6 110
 
3.2%
4 106
 
3.0%
Other values (6) 315
 
9.0%
Latin
ValueCountFrequency (%)
A 2
66.7%
B 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4515
56.4%
ASCII 3489
43.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1373
39.4%
) 297
 
8.5%
( 297
 
8.5%
2 260
 
7.5%
1 258
 
7.4%
, 178
 
5.1%
3 170
 
4.9%
0 122
 
3.5%
6 110
 
3.2%
4 106
 
3.0%
Other values (8) 318
 
9.1%
Hangul
ValueCountFrequency (%)
794
17.6%
607
13.4%
312
 
6.9%
303
 
6.7%
297
 
6.6%
297
 
6.6%
297
 
6.6%
239
 
5.3%
187
 
4.1%
157
 
3.5%
Other values (65) 1025
22.7%

소재지전화
Text

MISSING 

Distinct216
Distinct (%)97.7%
Missing76
Missing (%)25.6%
Memory size2.4 KiB
2024-04-19T15:22:21.938193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.0181
Min length12

Characters and Unicode

Total characters2656
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique211 ?
Unique (%)95.5%

Sample

1st row053-755-4040
2nd row053-956-3222
3rd row053-744-8144
4th row053-743-1834
5th row053-955-5007
ValueCountFrequency (%)
053-959-3001 2
 
0.9%
053-751-6969 2
 
0.9%
053-754-6206 2
 
0.9%
053-985-1204 2
 
0.9%
053-985-3001 2
 
0.9%
053-741-1441 1
 
0.5%
053-955-0110 1
 
0.5%
053-954-2080 1
 
0.5%
053-746-3111 1
 
0.5%
053-755-4040 1
 
0.5%
Other values (206) 206
93.2%
2024-04-19T15:22:22.392321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 442
16.6%
- 442
16.6%
0 347
13.1%
3 346
13.0%
7 210
7.9%
9 198
7.5%
4 187
7.0%
1 139
 
5.2%
2 127
 
4.8%
6 111
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2214
83.4%
Dash Punctuation 442
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 442
20.0%
0 347
15.7%
3 346
15.6%
7 210
9.5%
9 198
8.9%
4 187
8.4%
1 139
 
6.3%
2 127
 
5.7%
6 111
 
5.0%
8 107
 
4.8%
Dash Punctuation
ValueCountFrequency (%)
- 442
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2656
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 442
16.6%
- 442
16.6%
0 347
13.1%
3 346
13.0%
7 210
7.9%
9 198
7.5%
4 187
7.0%
1 139
 
5.2%
2 127
 
4.8%
6 111
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2656
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 442
16.6%
- 442
16.6%
0 347
13.1%
3 346
13.0%
7 210
7.9%
9 198
7.5%
4 187
7.0%
1 139
 
5.2%
2 127
 
4.8%
6 111
 
4.2%

Interactions

2024-04-19T15:22:19.550190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-19T15:22:22.491163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명
연번1.0000.998
업종명0.9981.000
2024-04-19T15:22:22.576865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명
연번1.0000.943
업종명0.9431.000

Missing values

2024-04-19T15:22:19.674545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-19T15:22:19.773626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종명업소명소재지(도로명)소재지전화
01유흥주점영업큐브대구광역시 동구 동대구로 462-3, 지하1층 (신천동)053-755-4040
12유흥주점영업평화가요주점대구광역시 동구 아양로 65 (신암동)053-956-3222
23유흥주점영업베르사체대구광역시 동구 동부로26길 60 (신천동)053-744-8144
34유흥주점영업희로가요궁대구광역시 동구 효목로 20 (효목동)053-743-1834
45유흥주점영업프린스대구광역시 동구 아양로 30 (신암동)053-955-5007
56유흥주점영업위너대구광역시 동구 동부로26길 25 (신천동)053-742-1250
67유흥주점영업SBS대구광역시 동구 동부로22길 36 (신천동)053-754-0500
78유흥주점영업귀빈회관대구광역시 동구 화랑로25길 5 (효목동)053-755-5057
89유흥주점영업하야트대구광역시 동구 화랑로25길 5 (효목동)053-756-4939
910유흥주점영업신세계대구광역시 동구 장등로 20 (신천동)053-742-7978
연번업종명업소명소재지(도로명)소재지전화
287288단란주점탑단란주점대구광역시 동구 이노밸리로26길 14, 4층 401호 (각산동)<NA>
288289단란주점아모르파티대구광역시 동구 이노밸리로 322, 408,409호 (신서동, 비젼스퀘어2)<NA>
289290단란주점써프 (SURF)대구광역시 동구 동부로28길 40, 지하1층 (신천동)<NA>
290291단란주점홀릭(HOLIC)대구광역시 동구 동부로28길 39-1, 1층 (신천동)<NA>
291292단란주점동촌라이브대구광역시 동구 효동로2길 57-11, 2층 (효목동)053-985-3001
292293단란주점신천별곡대구광역시 동구 동부로30길 43, 2층 (신천동)<NA>
293294단란주점로타리라이브대구광역시 동구 효동로2길 13-3, 지하1층 (효목동)<NA>
294295단란주점대구광역시 동구 동촌로 224, 지하1층 (방촌동)053-981-7228
295296단란주점필라이브카페대구광역시 동구 효동로6길 88, 3층 (효목동)<NA>
296297단란주점동촌라이브 3층대구광역시 동구 효동로2길 57-11, 3층 (효목동)053-959-3001