Overview

Dataset statistics

Number of variables4
Number of observations66
Missing cells16
Missing cells (%)6.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory34.0 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시남구여행업현황_20210701
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3034664

Alerts

전화번호 has 16 (24.2%) missing valuesMissing
상호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:16:05.919303
Analysis finished2023-12-10 16:16:06.658674
Duration0.74 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct3
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size660.0 B
국내여행업
27 
국외여행업
27 
일반여행업
12 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내여행업 27
40.9%
국외여행업 27
40.9%
일반여행업 12
18.2%

Length

2023-12-11T01:16:06.749452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:16:06.888173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내여행업 27
40.9%
국외여행업 27
40.9%
일반여행업 12
18.2%

상호
Text

UNIQUE 

Distinct66
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size660.0 B
2023-12-11T01:16:07.225920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length11
Mean length7.969697
Min length3

Characters and Unicode

Total characters526
Distinct characters161
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique66 ?
Unique (%)100.0%

Sample

1st row(주)경성투어
2nd row(주)라라여행
3rd row(주)로터스투어
4th row(주)스마트투어
5th row(주)써니투어
ValueCountFrequency (%)
주식회사 7
 
8.5%
tour 4
 
4.9%
락투어 1
 
1.2%
유니버스 1
 
1.2%
스쿨옥션 1
 
1.2%
여행사 1
 
1.2%
세미항공 1
 
1.2%
비엔투어 1
 
1.2%
레알투어(real 1
 
1.2%
gd 1
 
1.2%
Other values (63) 63
76.8%
2023-12-11T01:16:07.908997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
41
 
7.8%
( 37
 
7.0%
) 37
 
7.0%
25
 
4.8%
25
 
4.8%
22
 
4.2%
22
 
4.2%
20
 
3.8%
16
 
3.0%
14
 
2.7%
Other values (151) 267
50.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 400
76.0%
Open Punctuation 37
 
7.0%
Close Punctuation 37
 
7.0%
Uppercase Letter 31
 
5.9%
Space Separator 16
 
3.0%
Lowercase Letter 4
 
0.8%
Other Symbol 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
41
 
10.2%
25
 
6.2%
25
 
6.2%
22
 
5.5%
22
 
5.5%
20
 
5.0%
14
 
3.5%
12
 
3.0%
8
 
2.0%
7
 
1.8%
Other values (128) 204
51.0%
Uppercase Letter
ValueCountFrequency (%)
O 5
16.1%
R 4
12.9%
U 3
9.7%
T 3
9.7%
D 2
 
6.5%
A 2
 
6.5%
P 2
 
6.5%
L 2
 
6.5%
Y 2
 
6.5%
G 1
 
3.2%
Other values (5) 5
16.1%
Lowercase Letter
ValueCountFrequency (%)
o 1
25.0%
u 1
25.0%
t 1
25.0%
r 1
25.0%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%
Space Separator
ValueCountFrequency (%)
16
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 401
76.2%
Common 90
 
17.1%
Latin 35
 
6.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
41
 
10.2%
25
 
6.2%
25
 
6.2%
22
 
5.5%
22
 
5.5%
20
 
5.0%
14
 
3.5%
12
 
3.0%
8
 
2.0%
7
 
1.7%
Other values (129) 205
51.1%
Latin
ValueCountFrequency (%)
O 5
14.3%
R 4
11.4%
U 3
 
8.6%
T 3
 
8.6%
D 2
 
5.7%
A 2
 
5.7%
P 2
 
5.7%
L 2
 
5.7%
Y 2
 
5.7%
G 1
 
2.9%
Other values (9) 9
25.7%
Common
ValueCountFrequency (%)
( 37
41.1%
) 37
41.1%
16
17.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 400
76.0%
ASCII 125
 
23.8%
None 1
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
41
 
10.2%
25
 
6.2%
25
 
6.2%
22
 
5.5%
22
 
5.5%
20
 
5.0%
14
 
3.5%
12
 
3.0%
8
 
2.0%
7
 
1.8%
Other values (128) 204
51.0%
ASCII
ValueCountFrequency (%)
( 37
29.6%
) 37
29.6%
16
12.8%
O 5
 
4.0%
R 4
 
3.2%
U 3
 
2.4%
T 3
 
2.4%
D 2
 
1.6%
A 2
 
1.6%
P 2
 
1.6%
Other values (12) 14
 
11.2%
None
ValueCountFrequency (%)
1
100.0%
Distinct65
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size660.0 B
2023-12-11T01:16:08.419190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length45
Mean length38.257576
Min length22

Characters and Unicode

Total characters2525
Distinct characters162
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique64 ?
Unique (%)97.0%

Sample

1st row부산광역시 남구 수영로 312, 21 센츄리시티 오피스텔 B1층 제18-2호 (대연동)
2nd row부산광역시 남구 분포로 145, 스퀘어동 1층 1089호 (용호동, 더블유)
3rd row부산광역시 남구 수영로 312, 16층 1608호 (대연동, 21세기 센츄리시티 )
4th row부산광역시 남구 수영로 312, 302호 (대연동, 21센츄리시티오피스텔)
5th row부산광역시 남구 전포대로91번길 47, 3층 (문현동, 이마트)
ValueCountFrequency (%)
부산광역시 66
 
13.0%
남구 66
 
13.0%
대연동 32
 
6.3%
수영로 29
 
5.7%
312 16
 
3.2%
문현동 16
 
3.2%
센츄리시티 12
 
2.4%
오피스텔 11
 
2.2%
21 11
 
2.2%
용호동 10
 
2.0%
Other values (171) 238
46.9%
2023-12-11T01:16:09.159104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
441
 
17.5%
1 136
 
5.4%
2 101
 
4.0%
, 92
 
3.6%
90
 
3.6%
83
 
3.3%
70
 
2.8%
69
 
2.7%
( 66
 
2.6%
66
 
2.6%
Other values (152) 1311
51.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1373
54.4%
Decimal Number 455
 
18.0%
Space Separator 441
 
17.5%
Other Punctuation 93
 
3.7%
Open Punctuation 66
 
2.6%
Close Punctuation 66
 
2.6%
Uppercase Letter 22
 
0.9%
Dash Punctuation 4
 
0.2%
Lowercase Letter 4
 
0.2%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
90
 
6.6%
83
 
6.0%
70
 
5.1%
69
 
5.0%
66
 
4.8%
66
 
4.8%
66
 
4.8%
66
 
4.8%
66
 
4.8%
64
 
4.7%
Other values (120) 667
48.6%
Uppercase Letter
ValueCountFrequency (%)
B 6
27.3%
I 3
13.6%
C 3
13.6%
F 2
 
9.1%
T 1
 
4.5%
S 1
 
4.5%
K 1
 
4.5%
V 1
 
4.5%
E 1
 
4.5%
W 1
 
4.5%
Other values (2) 2
 
9.1%
Decimal Number
ValueCountFrequency (%)
1 136
29.9%
2 101
22.2%
3 55
12.1%
0 44
 
9.7%
5 28
 
6.2%
4 23
 
5.1%
6 21
 
4.6%
8 21
 
4.6%
9 17
 
3.7%
7 9
 
2.0%
Lowercase Letter
ValueCountFrequency (%)
l 2
50.0%
i 1
25.0%
s 1
25.0%
Other Punctuation
ValueCountFrequency (%)
, 92
98.9%
/ 1
 
1.1%
Space Separator
ValueCountFrequency (%)
441
100.0%
Open Punctuation
ValueCountFrequency (%)
( 66
100.0%
Close Punctuation
ValueCountFrequency (%)
) 66
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1373
54.4%
Common 1126
44.6%
Latin 26
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
90
 
6.6%
83
 
6.0%
70
 
5.1%
69
 
5.0%
66
 
4.8%
66
 
4.8%
66
 
4.8%
66
 
4.8%
66
 
4.8%
64
 
4.7%
Other values (120) 667
48.6%
Common
ValueCountFrequency (%)
441
39.2%
1 136
 
12.1%
2 101
 
9.0%
, 92
 
8.2%
( 66
 
5.9%
) 66
 
5.9%
3 55
 
4.9%
0 44
 
3.9%
5 28
 
2.5%
4 23
 
2.0%
Other values (7) 74
 
6.6%
Latin
ValueCountFrequency (%)
B 6
23.1%
I 3
11.5%
C 3
11.5%
F 2
 
7.7%
l 2
 
7.7%
T 1
 
3.8%
S 1
 
3.8%
K 1
 
3.8%
V 1
 
3.8%
E 1
 
3.8%
Other values (5) 5
19.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1373
54.4%
ASCII 1152
45.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
441
38.3%
1 136
 
11.8%
2 101
 
8.8%
, 92
 
8.0%
( 66
 
5.7%
) 66
 
5.7%
3 55
 
4.8%
0 44
 
3.8%
5 28
 
2.4%
4 23
 
2.0%
Other values (22) 100
 
8.7%
Hangul
ValueCountFrequency (%)
90
 
6.6%
83
 
6.0%
70
 
5.1%
69
 
5.0%
66
 
4.8%
66
 
4.8%
66
 
4.8%
66
 
4.8%
66
 
4.8%
64
 
4.7%
Other values (120) 667
48.6%

전화번호
Text

MISSING 

Distinct49
Distinct (%)98.0%
Missing16
Missing (%)24.2%
Memory size660.0 B
2023-12-11T01:16:09.497648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.8
Min length8

Characters and Unicode

Total characters590
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)96.0%

Sample

1st row051-611-8833
2nd row051-464-8899
3rd row051-925-0051
4th row051-529-0700
5th row051-714-0804
ValueCountFrequency (%)
051-465-6817 2
 
4.0%
051-611-5040 1
 
2.0%
051-611-8833 1
 
2.0%
051-611-7667 1
 
2.0%
051-635-6555 1
 
2.0%
051-632-3000 1
 
2.0%
051-634-7715 1
 
2.0%
070-4366-7008 1
 
2.0%
051-937-2800 1
 
2.0%
051-465-0070 1
 
2.0%
Other values (39) 39
78.0%
2023-12-11T01:16:10.006360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 97
16.4%
0 92
15.6%
1 91
15.4%
5 76
12.9%
6 60
10.2%
4 38
 
6.4%
3 32
 
5.4%
7 31
 
5.3%
8 29
 
4.9%
2 26
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 493
83.6%
Dash Punctuation 97
 
16.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 92
18.7%
1 91
18.5%
5 76
15.4%
6 60
12.2%
4 38
7.7%
3 32
 
6.5%
7 31
 
6.3%
8 29
 
5.9%
2 26
 
5.3%
9 18
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 97
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 590
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 97
16.4%
0 92
15.6%
1 91
15.4%
5 76
12.9%
6 60
10.2%
4 38
 
6.4%
3 32
 
5.4%
7 31
 
5.3%
8 29
 
4.9%
2 26
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 590
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 97
16.4%
0 92
15.6%
1 91
15.4%
5 76
12.9%
6 60
10.2%
4 38
 
6.4%
3 32
 
5.4%
7 31
 
5.3%
8 29
 
4.9%
2 26
 
4.4%

Correlations

2023-12-11T01:16:10.153801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종상호소재지(도로명)전화번호
업종1.0001.0001.0001.000
상호1.0001.0001.0001.000
소재지(도로명)1.0001.0001.0001.000
전화번호1.0001.0001.0001.000

Missing values

2023-12-11T01:16:06.376675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:16:06.595697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호소재지(도로명)전화번호
0국내여행업(주)경성투어부산광역시 남구 수영로 312, 21 센츄리시티 오피스텔 B1층 제18-2호 (대연동)051-611-8833
1국내여행업(주)라라여행부산광역시 남구 분포로 145, 스퀘어동 1층 1089호 (용호동, 더블유)051-464-8899
2국내여행업(주)로터스투어부산광역시 남구 수영로 312, 16층 1608호 (대연동, 21세기 센츄리시티 )<NA>
3국내여행업(주)스마트투어부산광역시 남구 수영로 312, 302호 (대연동, 21센츄리시티오피스텔)051-925-0051
4국내여행업(주)써니투어부산광역시 남구 전포대로91번길 47, 3층 (문현동, 이마트)051-529-0700
5국내여행업(주)여행나무부산광역시 남구 수영로 209, 102동 302호 (대연동, 일동 지에닌)<NA>
6국내여행업(주)요트북부산광역시 남구 신선로 365, 부경대학교용당캠퍼스,부산창업지원센터 307호 (용당동)051-714-0804
7국내여행업(주)웰빙투어부산광역시 남구 수영로 312, 21 센츄리시티 오피스텔 12층 1235호 (대연동)051-701-4994
8국내여행업(주)유림투어부산광역시 남구 분포로 115, B동 3층 312호 (용호동, 힐탑탑플레이스)02-1644-6015
9국내여행업(주)으뜸문화항공부산광역시 남구 수영로 261, 401동 211호 (대연동, 대연 SK VIEW Hills)051-441-4001
업종상호소재지(도로명)전화번호
56일반여행업(주)제일항공여행사부산광역시 남구 석포로 103 (대연동)051-441-2811
57일반여행업(주)태산부산광역시 남구 전포대로 116, 2층 (문현동)051-782-6668
58일반여행업(주)투어하이유부산광역시 남구 분포로 115, B동 404호 (용호동, 힐탑탑플레이스)051-463-3553
59일반여행업AM픽쳐스부산광역시 남구 수영로 234, 부산은행대연동지점 3층 (대연동)051-611-3101
60일반여행업감만동새마을금고(남부여행사)부산광역시 남구 석포로 29 (감만동, 감만2동새마을금고)642-1036
61일반여행업씨케이브릿지 주식회사부산광역시 남구 신선로 365, 부경대학교용당캠퍼스 종합실습관동 112호 (용당동)051-715-5155
62일반여행업주식회사 더해피투어부산광역시 남구 고동골로 29, 문현베스티움 아파트 111동 106호 (문현동)051-638-5959
63일반여행업주식회사 디와이투어 (DY tour)부산광역시 남구 수영로 312, 21 센츄리시티 오피스텔 1221호 (대연동)051-611-5040
64일반여행업주식회사 유니언부산광역시 남구 동명로 26, 현대아이파크 108동 202호 (용당동)<NA>
65일반여행업주식회사 케이티고부산광역시 남구 수영로 312, 21 센츄리시티 오피스텔 925호 (대연동)051-753-4818