Overview

Dataset statistics

Number of variables5
Number of observations108
Missing cells23
Missing cells (%)4.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.3 KiB
Average record size in memory41.2 B

Variable types

Categorical1
Text4

Dataset

Description부산광역시 남구에 있는 유흥주점, 단란주점의 업소명, 도로명주소, 전화번호, 업종명에 관한 상세한 정보를 제공합니다.
URLhttps://www.data.go.kr/data/3081503/fileData.do

Alerts

소재지전화 has 23 (21.3%) missing valuesMissing

Reproduction

Analysis started2023-12-12 08:02:42.541083
Analysis finished2023-12-12 08:02:43.056653
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

Distinct2
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size996.0 B
단란주점
67 
유흥주점영업
41 

Length

Max length6
Median length4
Mean length4.7592593
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유흥주점영업
2nd row유흥주점영업
3rd row유흥주점영업
4th row유흥주점영업
5th row유흥주점영업

Common Values

ValueCountFrequency (%)
단란주점 67
62.0%
유흥주점영업 41
38.0%

Length

2023-12-12T17:02:43.171113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:02:43.311319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
단란주점 67
62.0%
유흥주점영업 41
38.0%
Distinct107
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size996.0 B
2023-12-12T17:02:43.589645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length5.0925926
Min length1

Characters and Unicode

Total characters550
Distinct characters187
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique106 ?
Unique (%)98.1%

Sample

1st row대연각포장센타
2nd row블루 노래주점
3rd row골든벨노래주점
4th row제시 노래주점
5th row준코 뮤직타운 경성대2호점
ValueCountFrequency (%)
노래주점 5
 
3.9%
단란주점 3
 
2.3%
발리노래자랑 2
 
1.6%
에이스 2
 
1.6%
노래방 2
 
1.6%
힐링 2
 
1.6%
스마트노래자랑 1
 
0.8%
모래알 1
 
0.8%
이벤트가요노래주점 1
 
0.8%
1
 
0.8%
Other values (109) 109
84.5%
2023-12-12T17:02:44.064734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30
 
5.5%
29
 
5.3%
29
 
5.3%
29
 
5.3%
21
 
3.8%
14
 
2.5%
14
 
2.5%
13
 
2.4%
11
 
2.0%
11
 
2.0%
Other values (177) 349
63.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 507
92.2%
Space Separator 21
 
3.8%
Decimal Number 14
 
2.5%
Other Punctuation 3
 
0.5%
Uppercase Letter 3
 
0.5%
Close Punctuation 1
 
0.2%
Open Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
5.9%
29
 
5.7%
29
 
5.7%
29
 
5.7%
14
 
2.8%
14
 
2.8%
13
 
2.6%
11
 
2.2%
11
 
2.2%
10
 
2.0%
Other values (165) 317
62.5%
Decimal Number
ValueCountFrequency (%)
0 6
42.9%
7 3
21.4%
8 3
21.4%
2 1
 
7.1%
1 1
 
7.1%
Uppercase Letter
ValueCountFrequency (%)
P 1
33.3%
I 1
33.3%
V 1
33.3%
Space Separator
ValueCountFrequency (%)
21
100.0%
Other Punctuation
ValueCountFrequency (%)
. 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 507
92.2%
Common 40
 
7.3%
Latin 3
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
 
5.9%
29
 
5.7%
29
 
5.7%
29
 
5.7%
14
 
2.8%
14
 
2.8%
13
 
2.6%
11
 
2.2%
11
 
2.2%
10
 
2.0%
Other values (165) 317
62.5%
Common
ValueCountFrequency (%)
21
52.5%
0 6
 
15.0%
. 3
 
7.5%
7 3
 
7.5%
8 3
 
7.5%
) 1
 
2.5%
( 1
 
2.5%
2 1
 
2.5%
1 1
 
2.5%
Latin
ValueCountFrequency (%)
P 1
33.3%
I 1
33.3%
V 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 507
92.2%
ASCII 43
 
7.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
30
 
5.9%
29
 
5.7%
29
 
5.7%
29
 
5.7%
14
 
2.8%
14
 
2.8%
13
 
2.6%
11
 
2.2%
11
 
2.2%
10
 
2.0%
Other values (165) 317
62.5%
ASCII
ValueCountFrequency (%)
21
48.8%
0 6
 
14.0%
. 3
 
7.0%
7 3
 
7.0%
8 3
 
7.0%
P 1
 
2.3%
) 1
 
2.3%
I 1
 
2.3%
V 1
 
2.3%
( 1
 
2.3%
Other values (2) 2
 
4.7%
Distinct105
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size996.0 B
2023-12-12T17:02:44.461387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length30
Mean length24.638889
Min length20

Characters and Unicode

Total characters2661
Distinct characters60
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique102 ?
Unique (%)94.4%

Sample

1st row부산광역시 남구 수영로266번길 5 (대연동)
2nd row부산광역시 남구 수영로266번길 13 (대연동)
3rd row부산광역시 남구 진남로 9 (대연동)
4th row부산광역시 남구 수영로219번길 12-4 (대연동)
5th row부산광역시 남구 용소로7번길 9 (대연동)
ValueCountFrequency (%)
부산광역시 108
19.6%
남구 108
19.6%
용호동 48
 
8.7%
대연동 44
 
8.0%
용호로 21
 
3.8%
수영로 13
 
2.4%
동명로 11
 
2.0%
문현동 10
 
1.8%
동명로132번길 6
 
1.1%
지하1층 6
 
1.1%
Other values (121) 177
32.1%
2023-12-12T17:02:45.051374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
444
 
16.7%
134
 
5.0%
1 130
 
4.9%
110
 
4.1%
) 109
 
4.1%
( 109
 
4.1%
108
 
4.1%
108
 
4.1%
108
 
4.1%
108
 
4.1%
Other values (50) 1193
44.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1551
58.3%
Space Separator 444
 
16.7%
Decimal Number 402
 
15.1%
Close Punctuation 109
 
4.1%
Open Punctuation 109
 
4.1%
Dash Punctuation 30
 
1.1%
Other Punctuation 16
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
134
 
8.6%
110
 
7.1%
108
 
7.0%
108
 
7.0%
108
 
7.0%
108
 
7.0%
108
 
7.0%
108
 
7.0%
108
 
7.0%
77
 
5.0%
Other values (35) 474
30.6%
Decimal Number
ValueCountFrequency (%)
1 130
32.3%
2 61
15.2%
3 47
 
11.7%
5 33
 
8.2%
4 33
 
8.2%
6 27
 
6.7%
9 23
 
5.7%
7 18
 
4.5%
0 16
 
4.0%
8 14
 
3.5%
Space Separator
ValueCountFrequency (%)
444
100.0%
Close Punctuation
ValueCountFrequency (%)
) 109
100.0%
Open Punctuation
ValueCountFrequency (%)
( 109
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 30
100.0%
Other Punctuation
ValueCountFrequency (%)
, 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1551
58.3%
Common 1110
41.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
134
 
8.6%
110
 
7.1%
108
 
7.0%
108
 
7.0%
108
 
7.0%
108
 
7.0%
108
 
7.0%
108
 
7.0%
108
 
7.0%
77
 
5.0%
Other values (35) 474
30.6%
Common
ValueCountFrequency (%)
444
40.0%
1 130
 
11.7%
) 109
 
9.8%
( 109
 
9.8%
2 61
 
5.5%
3 47
 
4.2%
5 33
 
3.0%
4 33
 
3.0%
- 30
 
2.7%
6 27
 
2.4%
Other values (5) 87
 
7.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1551
58.3%
ASCII 1110
41.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
444
40.0%
1 130
 
11.7%
) 109
 
9.8%
( 109
 
9.8%
2 61
 
5.5%
3 47
 
4.2%
5 33
 
3.0%
4 33
 
3.0%
- 30
 
2.7%
6 27
 
2.4%
Other values (5) 87
 
7.8%
Hangul
ValueCountFrequency (%)
134
 
8.6%
110
 
7.1%
108
 
7.0%
108
 
7.0%
108
 
7.0%
108
 
7.0%
108
 
7.0%
108
 
7.0%
108
 
7.0%
77
 
5.0%
Other values (35) 474
30.6%
Distinct104
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Memory size996.0 B
2023-12-12T17:02:45.471022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length27
Mean length20.416667
Min length18

Characters and Unicode

Total characters2205
Distinct characters49
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)92.6%

Sample

1st row부산광역시 남구 대연동 324-18
2nd row부산광역시 남구 대연동 376-8
3rd row부산광역시 남구 대연동 1742-1
4th row부산광역시 남구 대연동 1745-19
5th row부산광역시 남구 대연동 55-18
ValueCountFrequency (%)
부산광역시 108
24.4%
남구 108
24.4%
대연동 49
11.1%
용호동 48
10.8%
문현동 10
 
2.3%
325-10 2
 
0.5%
395-1 2
 
0.5%
1742-10 2
 
0.5%
370-14 2
 
0.5%
406-16 2
 
0.5%
Other values (110) 110
24.8%
2023-12-12T17:02:46.033489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
443
20.1%
- 111
 
5.0%
109
 
4.9%
108
 
4.9%
108
 
4.9%
108
 
4.9%
108
 
4.9%
108
 
4.9%
108
 
4.9%
108
 
4.9%
Other values (39) 786
35.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1097
49.8%
Decimal Number 541
24.5%
Space Separator 443
20.1%
Dash Punctuation 111
 
5.0%
Uppercase Letter 11
 
0.5%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
109
9.9%
108
9.8%
108
9.8%
108
9.8%
108
9.8%
108
9.8%
108
9.8%
108
9.8%
50
 
4.6%
50
 
4.6%
Other values (15) 132
12.0%
Decimal Number
ValueCountFrequency (%)
1 102
18.9%
3 78
14.4%
7 54
10.0%
5 54
10.0%
4 51
9.4%
9 48
8.9%
2 46
8.5%
6 38
 
7.0%
8 36
 
6.7%
0 34
 
6.3%
Uppercase Letter
ValueCountFrequency (%)
E 2
18.2%
T 1
9.1%
L 1
9.1%
H 1
9.1%
O 1
9.1%
R 1
9.1%
B 1
9.1%
M 1
9.1%
U 1
9.1%
N 1
9.1%
Space Separator
ValueCountFrequency (%)
443
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 111
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1097
49.8%
Hangul 1097
49.8%
Latin 11
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
109
9.9%
108
9.8%
108
9.8%
108
9.8%
108
9.8%
108
9.8%
108
9.8%
108
9.8%
50
 
4.6%
50
 
4.6%
Other values (15) 132
12.0%
Common
ValueCountFrequency (%)
443
40.4%
- 111
 
10.1%
1 102
 
9.3%
3 78
 
7.1%
7 54
 
4.9%
5 54
 
4.9%
4 51
 
4.6%
9 48
 
4.4%
2 46
 
4.2%
6 38
 
3.5%
Other values (4) 72
 
6.6%
Latin
ValueCountFrequency (%)
E 2
18.2%
T 1
9.1%
L 1
9.1%
H 1
9.1%
O 1
9.1%
R 1
9.1%
B 1
9.1%
M 1
9.1%
U 1
9.1%
N 1
9.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1108
50.2%
Hangul 1097
49.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
443
40.0%
- 111
 
10.0%
1 102
 
9.2%
3 78
 
7.0%
7 54
 
4.9%
5 54
 
4.9%
4 51
 
4.6%
9 48
 
4.3%
2 46
 
4.2%
6 38
 
3.4%
Other values (14) 83
 
7.5%
Hangul
ValueCountFrequency (%)
109
9.9%
108
9.8%
108
9.8%
108
9.8%
108
9.8%
108
9.8%
108
9.8%
108
9.8%
50
 
4.6%
50
 
4.6%
Other values (15) 132
12.0%

소재지전화
Text

MISSING 

Distinct85
Distinct (%)100.0%
Missing23
Missing (%)21.3%
Memory size996.0 B
2023-12-12T17:02:46.341769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1020
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique85 ?
Unique (%)100.0%

Sample

1st row051-628-0220
2nd row051-628-4447
3rd row051-618-3133
4th row051-611-1911
5th row051-610-0496
ValueCountFrequency (%)
051-621-5232 1
 
1.2%
051-612-4416 1
 
1.2%
051-626-6440 1
 
1.2%
051-761-0201 1
 
1.2%
051-642-6977 1
 
1.2%
051-611-2870 1
 
1.2%
051-624-3800 1
 
1.2%
051-625-3122 1
 
1.2%
051-628-8815 1
 
1.2%
051-647-8331 1
 
1.2%
Other values (75) 75
88.2%
2023-12-12T17:02:46.801264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 170
16.7%
1 144
14.1%
0 141
13.8%
6 124
12.2%
5 121
11.9%
2 103
10.1%
3 57
 
5.6%
7 49
 
4.8%
4 44
 
4.3%
8 42
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 850
83.3%
Dash Punctuation 170
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 144
16.9%
0 141
16.6%
6 124
14.6%
5 121
14.2%
2 103
12.1%
3 57
 
6.7%
7 49
 
5.8%
4 44
 
5.2%
8 42
 
4.9%
9 25
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 170
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1020
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 170
16.7%
1 144
14.1%
0 141
13.8%
6 124
12.2%
5 121
11.9%
2 103
10.1%
3 57
 
5.6%
7 49
 
4.8%
4 44
 
4.3%
8 42
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1020
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 170
16.7%
1 144
14.1%
0 141
13.8%
6 124
12.2%
5 121
11.9%
2 103
10.1%
3 57
 
5.6%
7 49
 
4.8%
4 44
 
4.3%
8 42
 
4.1%

Correlations

2023-12-12T17:02:47.240303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명소재지전화
업종명1.0001.000
소재지전화1.0001.000

Missing values

2023-12-12T17:02:42.861494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:02:42.996007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지(도로명)소재지(지번)소재지전화
0유흥주점영업대연각포장센타부산광역시 남구 수영로266번길 5 (대연동)부산광역시 남구 대연동 324-18051-628-0220
1유흥주점영업블루 노래주점부산광역시 남구 수영로266번길 13 (대연동)부산광역시 남구 대연동 376-8051-628-4447
2유흥주점영업골든벨노래주점부산광역시 남구 진남로 9 (대연동)부산광역시 남구 대연동 1742-1051-618-3133
3유흥주점영업제시 노래주점부산광역시 남구 수영로219번길 12-4 (대연동)부산광역시 남구 대연동 1745-19051-611-1911
4유흥주점영업준코 뮤직타운 경성대2호점부산광역시 남구 용소로7번길 9 (대연동)부산광역시 남구 대연동 55-18051-610-0496
5유흥주점영업주 노래주점부산광역시 남구 수영로 246 (대연동)부산광역시 남구 대연동 1729-4051-627-6700
6유흥주점영업스타부산광역시 남구 수영로 19, 2층 (문현동)부산광역시 남구 문현동 405-6<NA>
7유흥주점영업베니스부산광역시 남구 수영로219번길 12-1 (대연동)부산광역시 남구 대연동 1742-10051-624-8822
8유흥주점영업술익는노래방부산광역시 남구 수영로 205-10 (대연동)부산광역시 남구 대연동 1379-14<NA>
9유흥주점영업자전거부산광역시 남구 수영로219번길 12-1 (대연동)부산광역시 남구 대연동 1742-10051-627-0633
업종명업소명소재지(도로명)소재지(지번)소재지전화
98단란주점아리수부산광역시 남구 용호로 145 (용호동)부산광역시 남구 용호동 394-54051-627-7716
99단란주점헤라부산광역시 남구 동명로 135 (용호동)부산광역시 남구 용호동 370-16051-628-2811
100단란주점돈비치부산광역시 남구 지게골로 4 (문현동)부산광역시 남구 문현동 846-1051-632-0441
101단란주점샵노래주점부산광역시 남구 수영로346번길 12 (대연동,(3층))부산광역시 남구 대연동 39-28 (3층)<NA>
102단란주점백악관부산광역시 남구 동명로132번길 31 (용호동)부산광역시 남구 용호동 395-53<NA>
103단란주점엑스노래주점부산광역시 남구 유엔평화로 16-1 (대연동)부산광역시 남구 대연동 986-5<NA>
104단란주점고고 7080부산광역시 남구 동명로 135-1 (용호동)부산광역시 남구 용호동 370-43<NA>
105단란주점투다리앤뮤직타운부산광역시 남구 동명로 131, 2층 (용호동)부산광역시 남구 용호동 370-14051-612-3390
106단란주점온라인 가라오케부산광역시 남구 유엔평화로13번길 2, 2층 (대연동)부산광역시 남구 대연동 887-1<NA>
107단란주점땡큐7080라이브부산광역시 남구 용호로 161, 지하1층 (용호동)부산광역시 남구 용호동 494-1<NA>