Overview

Dataset statistics

Number of variables5
Number of observations35
Missing cells33
Missing cells (%)18.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory43.7 B

Variable types

Text3
Categorical2

Dataset

Description부산광역시남구_게임제공업현황_20220926
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15045445

Alerts

업종명 is highly overall correlated with 기타유의사항High correlation
기타유의사항 is highly overall correlated with 업종명High correlation
업종명 is highly imbalanced (68.4%)Imbalance
기타유의사항 is highly imbalanced (68.4%)Imbalance
전화번호 has 33 (94.3%) missing valuesMissing
영업소소재지(도로명) has unique valuesUnique

Reproduction

Analysis started2024-04-21 08:11:39.286703
Analysis finished2024-04-21 08:11:39.905330
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

Distinct34
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size408.0 B
2024-04-21T17:11:40.460620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length9
Mean length6.2
Min length2

Characters and Unicode

Total characters217
Distinct characters100
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)94.3%

Sample

1st row대연 게임랜드
2nd row황금게임랜드
3rd row투아트플스방
4th row갤러리
5th row평화게임랜드
ValueCountFrequency (%)
뽀끼노리 2
 
4.5%
게임랜드 1
 
2.3%
인형뽑기놀이(부산문현점 1
 
2.3%
장난감 1
 
2.3%
세상 1
 
2.3%
퍼니2 1
 
2.3%
인형샵 1
 
2.3%
모찌게임장 1
 
2.3%
황금게임랜드 1
 
2.3%
대연 1
 
2.3%
Other values (33) 33
75.0%
2024-04-21T17:11:41.370861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12
 
5.5%
12
 
5.5%
12
 
5.5%
11
 
5.1%
9
 
4.1%
7
 
3.2%
7
 
3.2%
7
 
3.2%
6
 
2.8%
4
 
1.8%
Other values (90) 130
59.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 190
87.6%
Uppercase Letter 11
 
5.1%
Space Separator 9
 
4.1%
Open Punctuation 3
 
1.4%
Close Punctuation 3
 
1.4%
Decimal Number 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
6.3%
12
 
6.3%
12
 
6.3%
11
 
5.8%
7
 
3.7%
7
 
3.7%
7
 
3.7%
6
 
3.2%
4
 
2.1%
4
 
2.1%
Other values (76) 108
56.8%
Uppercase Letter
ValueCountFrequency (%)
A 2
18.2%
D 1
9.1%
H 1
9.1%
C 1
9.1%
G 1
9.1%
N 1
9.1%
U 1
9.1%
F 1
9.1%
O 1
9.1%
M 1
9.1%
Space Separator
ValueCountFrequency (%)
9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 190
87.6%
Common 16
 
7.4%
Latin 11
 
5.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
6.3%
12
 
6.3%
12
 
6.3%
11
 
5.8%
7
 
3.7%
7
 
3.7%
7
 
3.7%
6
 
3.2%
4
 
2.1%
4
 
2.1%
Other values (76) 108
56.8%
Latin
ValueCountFrequency (%)
A 2
18.2%
D 1
9.1%
H 1
9.1%
C 1
9.1%
G 1
9.1%
N 1
9.1%
U 1
9.1%
F 1
9.1%
O 1
9.1%
M 1
9.1%
Common
ValueCountFrequency (%)
9
56.2%
( 3
 
18.8%
) 3
 
18.8%
2 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 190
87.6%
ASCII 27
 
12.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
12
 
6.3%
12
 
6.3%
12
 
6.3%
11
 
5.8%
7
 
3.7%
7
 
3.7%
7
 
3.7%
6
 
3.2%
4
 
2.1%
4
 
2.1%
Other values (76) 108
56.8%
ASCII
ValueCountFrequency (%)
9
33.3%
( 3
 
11.1%
) 3
 
11.1%
A 2
 
7.4%
D 1
 
3.7%
2 1
 
3.7%
H 1
 
3.7%
C 1
 
3.7%
G 1
 
3.7%
N 1
 
3.7%
Other values (4) 4
14.8%
Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size408.0 B
2024-04-21T17:11:42.165617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length34
Mean length28.628571
Min length20

Characters and Unicode

Total characters1002
Distinct characters80
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row부산광역시 남구 수영로196번길 8, B동 2층 (대연동)
2nd row부산광역시 남구 용호로 149, 전용태한의원 2층 (용호동)
3rd row부산광역시 남구 용소로8번길 7 (대연동)
4th row부산광역시 남구 수영로 313-1 (대연동)
5th row부산광역시 남구 황령대로90번다길 8 (문현동)
ValueCountFrequency (%)
부산광역시 35
16.7%
남구 35
16.7%
대연동 17
 
8.1%
1층 11
 
5.2%
용호동 9
 
4.3%
문현동 7
 
3.3%
6 4
 
1.9%
수영로 4
 
1.9%
용소로7번길 4
 
1.9%
용호로 3
 
1.4%
Other values (67) 81
38.6%
2024-04-21T17:11:43.440665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
175
 
17.5%
1 57
 
5.7%
43
 
4.3%
36
 
3.6%
36
 
3.6%
35
 
3.5%
35
 
3.5%
35
 
3.5%
35
 
3.5%
35
 
3.5%
Other values (70) 480
47.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 564
56.3%
Space Separator 175
 
17.5%
Decimal Number 158
 
15.8%
Close Punctuation 35
 
3.5%
Open Punctuation 35
 
3.5%
Other Punctuation 28
 
2.8%
Dash Punctuation 5
 
0.5%
Uppercase Letter 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
43
 
7.6%
36
 
6.4%
36
 
6.4%
35
 
6.2%
35
 
6.2%
35
 
6.2%
35
 
6.2%
35
 
6.2%
35
 
6.2%
23
 
4.1%
Other values (54) 216
38.3%
Decimal Number
ValueCountFrequency (%)
1 57
36.1%
3 18
 
11.4%
2 17
 
10.8%
0 12
 
7.6%
9 10
 
6.3%
6 10
 
6.3%
8 10
 
6.3%
7 9
 
5.7%
5 9
 
5.7%
4 6
 
3.8%
Space Separator
ValueCountFrequency (%)
175
100.0%
Close Punctuation
ValueCountFrequency (%)
) 35
100.0%
Open Punctuation
ValueCountFrequency (%)
( 35
100.0%
Other Punctuation
ValueCountFrequency (%)
, 28
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 564
56.3%
Common 436
43.5%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
43
 
7.6%
36
 
6.4%
36
 
6.4%
35
 
6.2%
35
 
6.2%
35
 
6.2%
35
 
6.2%
35
 
6.2%
35
 
6.2%
23
 
4.1%
Other values (54) 216
38.3%
Common
ValueCountFrequency (%)
175
40.1%
1 57
 
13.1%
) 35
 
8.0%
( 35
 
8.0%
, 28
 
6.4%
3 18
 
4.1%
2 17
 
3.9%
0 12
 
2.8%
9 10
 
2.3%
6 10
 
2.3%
Other values (5) 39
 
8.9%
Latin
ValueCountFrequency (%)
B 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 564
56.3%
ASCII 438
43.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
175
40.0%
1 57
 
13.0%
) 35
 
8.0%
( 35
 
8.0%
, 28
 
6.4%
3 18
 
4.1%
2 17
 
3.9%
0 12
 
2.7%
9 10
 
2.3%
6 10
 
2.3%
Other values (6) 41
 
9.4%
Hangul
ValueCountFrequency (%)
43
 
7.6%
36
 
6.4%
36
 
6.4%
35
 
6.2%
35
 
6.2%
35
 
6.2%
35
 
6.2%
35
 
6.2%
35
 
6.2%
23
 
4.1%
Other values (54) 216
38.3%

전화번호
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing33
Missing (%)94.3%
Memory size408.0 B
2024-04-21T17:11:43.886609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length10
Min length8

Characters and Unicode

Total characters20
Distinct characters9
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st row625-3927
2nd row051-612-9223
ValueCountFrequency (%)
625-3927 1
50.0%
051-612-9223 1
50.0%
2024-04-21T17:11:44.559828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 5
25.0%
- 3
15.0%
6 2
 
10.0%
5 2
 
10.0%
3 2
 
10.0%
9 2
 
10.0%
1 2
 
10.0%
7 1
 
5.0%
0 1
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 17
85.0%
Dash Punctuation 3
 
15.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 5
29.4%
6 2
 
11.8%
5 2
 
11.8%
3 2
 
11.8%
9 2
 
11.8%
1 2
 
11.8%
7 1
 
5.9%
0 1
 
5.9%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 20
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 5
25.0%
- 3
15.0%
6 2
 
10.0%
5 2
 
10.0%
3 2
 
10.0%
9 2
 
10.0%
1 2
 
10.0%
7 1
 
5.0%
0 1
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 20
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 5
25.0%
- 3
15.0%
6 2
 
10.0%
5 2
 
10.0%
3 2
 
10.0%
9 2
 
10.0%
1 2
 
10.0%
7 1
 
5.0%
0 1
 
5.0%

업종명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size408.0 B
청소년게임제공업
33 
일반게임제공업
 
2

Length

Max length8
Median length8
Mean length7.9428571
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반게임제공업
2nd row일반게임제공업
3rd row청소년게임제공업
4th row청소년게임제공업
5th row청소년게임제공업

Common Values

ValueCountFrequency (%)
청소년게임제공업 33
94.3%
일반게임제공업 2
 
5.7%

Length

2024-04-21T17:11:44.778597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T17:11:45.098920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
청소년게임제공업 33
94.3%
일반게임제공업 2
 
5.7%

기타유의사항
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size408.0 B
개인정보 포함
33 
<NA>
 
2

Length

Max length7
Median length7
Mean length6.8285714
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인정보 포함
2nd row개인정보 포함
3rd row개인정보 포함
4th row<NA>
5th row개인정보 포함

Common Values

ValueCountFrequency (%)
개인정보 포함 33
94.3%
<NA> 2
 
5.7%

Length

2024-04-21T17:11:45.458396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T17:11:45.787036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인정보 33
48.5%
포함 33
48.5%
na 2
 
2.9%

Correlations

2024-04-21T17:11:45.980681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상호영업소소재지(도로명)전화번호업종명
상호1.0001.0000.0001.000
영업소소재지(도로명)1.0001.0000.0001.000
전화번호0.0000.0001.000NaN
업종명1.0001.000NaN1.000
2024-04-21T17:11:46.241270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명기타유의사항
업종명1.0001.000
기타유의사항1.0001.000
2024-04-21T17:11:46.470613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명기타유의사항
업종명1.0001.000
기타유의사항1.0001.000

Missing values

2024-04-21T17:11:39.668141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T17:11:39.840856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호영업소소재지(도로명)전화번호업종명기타유의사항
0대연 게임랜드부산광역시 남구 수영로196번길 8, B동 2층 (대연동)<NA>일반게임제공업개인정보 포함
1황금게임랜드부산광역시 남구 용호로 149, 전용태한의원 2층 (용호동)<NA>일반게임제공업개인정보 포함
2투아트플스방부산광역시 남구 용소로8번길 7 (대연동)<NA>청소년게임제공업개인정보 포함
3갤러리부산광역시 남구 수영로 313-1 (대연동)625-3927청소년게임제공업<NA>
4평화게임랜드부산광역시 남구 황령대로90번다길 8 (문현동)<NA>청소년게임제공업개인정보 포함
5히트게임랜드부산광역시 남구 용호로198번길 5 (용호동)<NA>청소년게임제공업개인정보 포함
6대박게임랜드부산광역시 남구 용호로 191-1 (용호동)<NA>청소년게임제공업개인정보 포함
7유엔게임랜드부산광역시 남구 석포로 132-1 (대연동)<NA>청소년게임제공업개인정보 포함
8시티게임랜드부산광역시 남구 수영로 26, 101동 1층 102호 (문현동, 대림시티프라자)<NA>청소년게임제공업개인정보 포함
9모펀(MOFUN)부산광역시 남구 용소로13번길 13, 4층 (대연동)<NA>청소년게임제공업개인정보 포함
상호영업소소재지(도로명)전화번호업종명기타유의사항
25신대왕게임센터부산광역시 남구 용소로13번길 6, 도성빌딩 지하1층 (대연동)<NA>청소년게임제공업개인정보 포함
26D오락실부산광역시 남구 수영로298번길 10, 1층2층3층 (대연동)<NA>청소년게임제공업개인정보 포함
27모찌게임장부산광역시 남구 수영로250번길 22, 1층 (대연동)<NA>청소년게임제공업개인정보 포함
28퍼니2 인형샵부산광역시 남구 동명로158번길 104, 104호 (용호동)<NA>청소년게임제공업개인정보 포함
29장난감 세상부산광역시 남구 동명로 129, 1층 (용호동)<NA>청소년게임제공업개인정보 포함
30인형뽑기놀이(부산문현점)부산광역시 남구 지게골로 31, 137,144호 (문현동, 문현상가)<NA>청소년게임제공업개인정보 포함
31문현뽑기방부산광역시 남구 고동골로 6, 1층 (문현동)<NA>청소년게임제공업개인정보 포함
32뽀끼노리부산광역시 남구 동명로 113 (용호동)<NA>청소년게임제공업개인정보 포함
33루피 인형뽑기방부산광역시 남구 진남로 196, 1층 (문현동)<NA>청소년게임제공업개인정보 포함
34뽑기뽑기 매니아부산광역시 남구 용소로7번길 10, 1층 (대연동)<NA>청소년게임제공업개인정보 포함