Overview

Dataset statistics

Number of variables4
Number of observations261
Missing cells53
Missing cells (%)5.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.3 KiB
Average record size in memory32.5 B

Variable types

Categorical1
Text3

Dataset

Description공공데이터 목록에는 부산광역시 연제구 유흥주점 단란주점 현황이 있습니다.유흥주점 단란주점 소재지 전화번호, 업종명, 업소명 등이 기록되어 있습니다.
Author부산광역시 연제구
URLhttps://www.data.go.kr/data/3082714/fileData.do

Alerts

소재지전화 has 53 (20.3%) missing valuesMissing

Reproduction

Analysis started2023-12-12 11:12:23.233369
Analysis finished2023-12-12 11:12:23.772102
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

Distinct2
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
유흥주점영업
178 
단란주점
83 

Length

Max length6
Median length6
Mean length5.3639847
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유흥주점영업
2nd row유흥주점영업
3rd row유흥주점영업
4th row유흥주점영업
5th row유흥주점영업

Common Values

ValueCountFrequency (%)
유흥주점영업 178
68.2%
단란주점 83
31.8%

Length

2023-12-12T20:12:23.907537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:12:24.092292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유흥주점영업 178
68.2%
단란주점 83
31.8%
Distinct258
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-12T20:12:24.459685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length5.467433
Min length1

Characters and Unicode

Total characters1427
Distinct characters296
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique255 ?
Unique (%)97.7%

Sample

1st row수정성인룸크럽
2nd row술마시는 싱싱노래방
3rd row7080태평양
4th row여궁 노래주점
5th row조아노래주점
ValueCountFrequency (%)
노래방 10
 
3.2%
노래주점 8
 
2.5%
술마시는 6
 
1.9%
라이브 5
 
1.6%
노래타운 3
 
0.9%
7080 3
 
0.9%
비타민 2
 
0.6%
오픈식 2
 
0.6%
2
 
0.6%
주식회사 2
 
0.6%
Other values (268) 273
86.4%
2023-12-12T20:12:25.140505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
93
 
6.5%
89
 
6.2%
65
 
4.6%
62
 
4.3%
55
 
3.9%
33
 
2.3%
31
 
2.2%
29
 
2.0%
29
 
2.0%
28
 
2.0%
Other values (286) 913
64.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1296
90.8%
Space Separator 55
 
3.9%
Decimal Number 35
 
2.5%
Uppercase Letter 21
 
1.5%
Close Punctuation 8
 
0.6%
Open Punctuation 8
 
0.6%
Lowercase Letter 3
 
0.2%
Letter Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
93
 
7.2%
89
 
6.9%
65
 
5.0%
62
 
4.8%
33
 
2.5%
31
 
2.4%
29
 
2.2%
29
 
2.2%
28
 
2.2%
27
 
2.1%
Other values (261) 810
62.5%
Uppercase Letter
ValueCountFrequency (%)
N 3
14.3%
B 3
14.3%
O 3
14.3%
K 2
9.5%
J 2
9.5%
E 2
9.5%
S 2
9.5%
V 1
 
4.8%
U 1
 
4.8%
G 1
 
4.8%
Decimal Number
ValueCountFrequency (%)
0 13
37.1%
7 7
20.0%
8 7
20.0%
2 5
 
14.3%
9 1
 
2.9%
4 1
 
2.9%
1 1
 
2.9%
Lowercase Letter
ValueCountFrequency (%)
m 1
33.3%
i 1
33.3%
k 1
33.3%
Space Separator
ValueCountFrequency (%)
55
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1295
90.7%
Common 106
 
7.4%
Latin 25
 
1.8%
Han 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
93
 
7.2%
89
 
6.9%
65
 
5.0%
62
 
4.8%
33
 
2.5%
31
 
2.4%
29
 
2.2%
29
 
2.2%
28
 
2.2%
27
 
2.1%
Other values (260) 809
62.5%
Latin
ValueCountFrequency (%)
N 3
12.0%
B 3
12.0%
O 3
12.0%
K 2
 
8.0%
J 2
 
8.0%
E 2
 
8.0%
S 2
 
8.0%
V 1
 
4.0%
m 1
 
4.0%
U 1
 
4.0%
Other values (5) 5
20.0%
Common
ValueCountFrequency (%)
55
51.9%
0 13
 
12.3%
) 8
 
7.5%
( 8
 
7.5%
7 7
 
6.6%
8 7
 
6.6%
2 5
 
4.7%
9 1
 
0.9%
4 1
 
0.9%
1 1
 
0.9%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1295
90.7%
ASCII 130
 
9.1%
Number Forms 1
 
0.1%
CJK 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
93
 
7.2%
89
 
6.9%
65
 
5.0%
62
 
4.8%
33
 
2.5%
31
 
2.4%
29
 
2.2%
29
 
2.2%
28
 
2.2%
27
 
2.1%
Other values (260) 809
62.5%
ASCII
ValueCountFrequency (%)
55
42.3%
0 13
 
10.0%
) 8
 
6.2%
( 8
 
6.2%
7 7
 
5.4%
8 7
 
5.4%
2 5
 
3.8%
N 3
 
2.3%
B 3
 
2.3%
O 3
 
2.3%
Other values (14) 18
 
13.8%
Number Forms
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct210
Distinct (%)80.5%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-12T20:12:25.487882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length35
Mean length27.111111
Min length21

Characters and Unicode

Total characters7076
Distinct characters65
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)70.1%

Sample

1st row부산광역시 연제구 중앙대로 1116-9 (연산동)
2nd row부산광역시 연제구 반송로 13-10 (연산동)
3rd row부산광역시 연제구 반송로 13-8 (연산동)
4th row부산광역시 연제구 거제천로230번길 98 (연산동)
5th row부산광역시 연제구 중앙대로 1116-11 (연산동,연산4동)
ValueCountFrequency (%)
부산광역시 261
19.1%
연제구 261
19.1%
연산동 224
16.4%
반송로 51
 
3.7%
월드컵대로 40
 
2.9%
과정로 25
 
1.8%
고분로 24
 
1.8%
중앙대로1120번길 23
 
1.7%
고분로13번길 22
 
1.6%
중앙대로 20
 
1.5%
Other values (199) 415
30.4%
2023-12-12T20:12:26.209345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1105
 
15.6%
526
 
7.4%
521
 
7.4%
1 375
 
5.3%
) 277
 
3.9%
( 277
 
3.9%
277
 
3.9%
270
 
3.8%
267
 
3.8%
263
 
3.7%
Other values (55) 2918
41.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4214
59.6%
Space Separator 1105
 
15.6%
Decimal Number 1040
 
14.7%
Close Punctuation 277
 
3.9%
Open Punctuation 277
 
3.9%
Dash Punctuation 83
 
1.2%
Other Punctuation 74
 
1.0%
Uppercase Letter 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
526
12.5%
521
12.4%
277
 
6.6%
270
 
6.4%
267
 
6.3%
263
 
6.2%
261
 
6.2%
261
 
6.2%
261
 
6.2%
261
 
6.2%
Other values (37) 1046
24.8%
Decimal Number
ValueCountFrequency (%)
1 375
36.1%
2 139
 
13.4%
3 95
 
9.1%
4 94
 
9.0%
6 72
 
6.9%
5 65
 
6.2%
0 62
 
6.0%
8 56
 
5.4%
7 41
 
3.9%
9 41
 
3.9%
Uppercase Letter
ValueCountFrequency (%)
C 2
33.3%
T 2
33.3%
B 2
33.3%
Space Separator
ValueCountFrequency (%)
1105
100.0%
Close Punctuation
ValueCountFrequency (%)
) 277
100.0%
Open Punctuation
ValueCountFrequency (%)
( 277
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 83
100.0%
Other Punctuation
ValueCountFrequency (%)
, 74
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4214
59.6%
Common 2856
40.4%
Latin 6
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
526
12.5%
521
12.4%
277
 
6.6%
270
 
6.4%
267
 
6.3%
263
 
6.2%
261
 
6.2%
261
 
6.2%
261
 
6.2%
261
 
6.2%
Other values (37) 1046
24.8%
Common
ValueCountFrequency (%)
1105
38.7%
1 375
 
13.1%
) 277
 
9.7%
( 277
 
9.7%
2 139
 
4.9%
3 95
 
3.3%
4 94
 
3.3%
- 83
 
2.9%
, 74
 
2.6%
6 72
 
2.5%
Other values (5) 265
 
9.3%
Latin
ValueCountFrequency (%)
C 2
33.3%
T 2
33.3%
B 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4214
59.6%
ASCII 2862
40.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1105
38.6%
1 375
 
13.1%
) 277
 
9.7%
( 277
 
9.7%
2 139
 
4.9%
3 95
 
3.3%
4 94
 
3.3%
- 83
 
2.9%
, 74
 
2.6%
6 72
 
2.5%
Other values (8) 271
 
9.5%
Hangul
ValueCountFrequency (%)
526
12.5%
521
12.4%
277
 
6.6%
270
 
6.4%
267
 
6.3%
263
 
6.2%
261
 
6.2%
261
 
6.2%
261
 
6.2%
261
 
6.2%
Other values (37) 1046
24.8%

소재지전화
Text

MISSING 

Distinct205
Distinct (%)98.6%
Missing53
Missing (%)20.3%
Memory size2.2 KiB
2023-12-12T20:12:26.786417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length13.990385
Min length12

Characters and Unicode

Total characters2910
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique202 ?
Unique (%)97.1%

Sample

1st row051 -853 -0666
2nd row 051- 868-5587
3rd row 051- 868-6466
4th row 051- 867-8877
5th row051 -868 -6585
ValueCountFrequency (%)
051 203
41.7%
852 12
 
2.5%
867 10
 
2.1%
853 8
 
1.6%
865 7
 
1.4%
868 6
 
1.2%
851 6
 
1.2%
864 5
 
1.0%
863 4
 
0.8%
861 4
 
0.8%
Other values (215) 222
45.6%
2023-12-12T20:12:27.827543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 416
14.3%
413
14.2%
5 386
13.3%
0 318
10.9%
1 314
10.8%
8 311
10.7%
6 222
7.6%
7 134
 
4.6%
2 114
 
3.9%
3 111
 
3.8%
Other values (2) 171
5.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2081
71.5%
Dash Punctuation 416
 
14.3%
Space Separator 413
 
14.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 386
18.5%
0 318
15.3%
1 314
15.1%
8 311
14.9%
6 222
10.7%
7 134
 
6.4%
2 114
 
5.5%
3 111
 
5.3%
4 89
 
4.3%
9 82
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 416
100.0%
Space Separator
ValueCountFrequency (%)
413
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2910
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 416
14.3%
413
14.2%
5 386
13.3%
0 318
10.9%
1 314
10.8%
8 311
10.7%
6 222
7.6%
7 134
 
4.6%
2 114
 
3.9%
3 111
 
3.8%
Other values (2) 171
5.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2910
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 416
14.3%
413
14.2%
5 386
13.3%
0 318
10.9%
1 314
10.8%
8 311
10.7%
6 222
7.6%
7 134
 
4.6%
2 114
 
3.9%
3 111
 
3.8%
Other values (2) 171
5.9%

Missing values

2023-12-12T20:12:23.587979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:12:23.713873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지(도로명)소재지전화
0유흥주점영업수정성인룸크럽부산광역시 연제구 중앙대로 1116-9 (연산동)051 -853 -0666
1유흥주점영업술마시는 싱싱노래방부산광역시 연제구 반송로 13-10 (연산동)051- 868-5587
2유흥주점영업7080태평양부산광역시 연제구 반송로 13-8 (연산동)051- 868-6466
3유흥주점영업여궁 노래주점부산광역시 연제구 거제천로230번길 98 (연산동)051- 867-8877
4유흥주점영업조아노래주점부산광역시 연제구 중앙대로 1116-11 (연산동,연산4동)051 -868 -6585
5유흥주점영업메리트부산광역시 연제구 중앙대로1120번길 13 (연산동)051- 852-1254
6유흥주점영업술마시는 도화 노래방부산광역시 연제구 반송로 16 (연산동)051- 864-4375
7유흥주점영업카네기 실내포장부산광역시 연제구 중앙대로1120번길 14-6 (연산동)<NA>
8유흥주점영업올리브부산광역시 연제구 과정로 156 (연산동)051 -758 -9491
9유흥주점영업초콜릿부산광역시 연제구 고분로13번길 5-20 (연산동)051 -865 -5200
업종명업소명소재지(도로명)소재지전화
251단란주점발리노래방 단란주점부산광역시 연제구 고분로13번길 43, 4층 (연산동)051 -862 -1900
252단란주점샤인소맥클럽 단란주점부산광역시 연제구 반송로 32-15, 2층 (연산동)<NA>
253단란주점나도가수다부산광역시 연제구 반송로 9-1, 3층 (연산동)051 -852 -5789
254단란주점캡틴원탁가라오케부산광역시 연제구 중앙대로1120번길 8, 5층 (연산동)<NA>
255단란주점신데렐라부산광역시 연제구 고분로13번길 43, 3층 일부호 (연산동)<NA>
256단란주점U턴 원탁가라오케부산광역시 연제구 고분로13번길 11, 2층 (연산동)<NA>
257단란주점소금창고부산광역시 연제구 과정로 166, 지하1층 (연산동)<NA>
258단란주점파티야부산광역시 연제구 고분로 5-1, 2층 일부호 (연산동)051- 853-8091
259단란주점영라이브바부산광역시 연제구 고분로13번길 5-16, 2층 (연산동)<NA>
260단란주점올레원탁가라오케부산광역시 연제구 거제천로182번길 42, 3층 (연산동)<NA>