Overview

Dataset statistics

Number of variables4
Number of observations90
Missing cells28
Missing cells (%)7.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.9 KiB
Average record size in memory33.5 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시남구여행업현황_20220621
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3034664

Alerts

전화번호 has 28 (31.1%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:16:00.456879
Analysis finished2023-12-10 16:16:01.339941
Duration0.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct3
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size852.0 B
국내외여행업
49 
국내여행업
21 
종합여행업
20 

Length

Max length6
Median length6
Mean length5.5444444
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내외여행업 49
54.4%
국내여행업 21
23.3%
종합여행업 20
22.2%

Length

2023-12-11T01:16:01.442882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:16:01.654516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외여행업 49
54.4%
국내여행업 21
23.3%
종합여행업 20
22.2%

상호
Text

Distinct75
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Memory size852.0 B
2023-12-11T01:16:02.034023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length11
Mean length8.1666667
Min length3

Characters and Unicode

Total characters735
Distinct characters172
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique60 ?
Unique (%)66.7%

Sample

1st row늘푸른여행사
2nd row(주)써니투어
3rd row(주)유림투어
4th row부산투어 전세버스
5th row성공그린여행사
ValueCountFrequency (%)
주식회사 9
 
8.3%
tour 3
 
2.8%
투어마스터 2
 
1.8%
여행사 2
 
1.8%
주)써니투어 2
 
1.8%
주)웰빙투어 2
 
1.8%
주)도계투어 2
 
1.8%
싱글스투어 2
 
1.8%
주)요트북 2
 
1.8%
늘푸른여행사 2
 
1.8%
Other values (73) 81
74.3%
2023-12-11T01:16:02.692455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
61
 
8.3%
( 54
 
7.3%
) 54
 
7.3%
43
 
5.9%
41
 
5.6%
28
 
3.8%
26
 
3.5%
22
 
3.0%
22
 
3.0%
19
 
2.6%
Other values (162) 365
49.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 580
78.9%
Open Punctuation 54
 
7.3%
Close Punctuation 54
 
7.3%
Uppercase Letter 24
 
3.3%
Space Separator 19
 
2.6%
Lowercase Letter 4
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
61
 
10.5%
43
 
7.4%
41
 
7.1%
28
 
4.8%
26
 
4.5%
22
 
3.8%
22
 
3.8%
19
 
3.3%
14
 
2.4%
11
 
1.9%
Other values (142) 293
50.5%
Uppercase Letter
ValueCountFrequency (%)
O 4
16.7%
T 2
8.3%
D 2
8.3%
U 2
8.3%
R 2
8.3%
Y 2
8.3%
P 2
8.3%
I 2
8.3%
V 2
8.3%
M 1
 
4.2%
Other values (3) 3
12.5%
Lowercase Letter
ValueCountFrequency (%)
o 1
25.0%
u 1
25.0%
r 1
25.0%
t 1
25.0%
Open Punctuation
ValueCountFrequency (%)
( 54
100.0%
Close Punctuation
ValueCountFrequency (%)
) 54
100.0%
Space Separator
ValueCountFrequency (%)
19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 580
78.9%
Common 127
 
17.3%
Latin 28
 
3.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
61
 
10.5%
43
 
7.4%
41
 
7.1%
28
 
4.8%
26
 
4.5%
22
 
3.8%
22
 
3.8%
19
 
3.3%
14
 
2.4%
11
 
1.9%
Other values (142) 293
50.5%
Latin
ValueCountFrequency (%)
O 4
14.3%
T 2
 
7.1%
D 2
 
7.1%
U 2
 
7.1%
R 2
 
7.1%
Y 2
 
7.1%
P 2
 
7.1%
I 2
 
7.1%
V 2
 
7.1%
o 1
 
3.6%
Other values (7) 7
25.0%
Common
ValueCountFrequency (%)
( 54
42.5%
) 54
42.5%
19
 
15.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 580
78.9%
ASCII 155
 
21.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
61
 
10.5%
43
 
7.4%
41
 
7.1%
28
 
4.8%
26
 
4.5%
22
 
3.8%
22
 
3.8%
19
 
3.3%
14
 
2.4%
11
 
1.9%
Other values (142) 293
50.5%
ASCII
ValueCountFrequency (%)
( 54
34.8%
) 54
34.8%
19
 
12.3%
O 4
 
2.6%
T 2
 
1.3%
D 2
 
1.3%
U 2
 
1.3%
R 2
 
1.3%
Y 2
 
1.3%
P 2
 
1.3%
Other values (10) 12
 
7.7%
Distinct71
Distinct (%)78.9%
Missing0
Missing (%)0.0%
Memory size852.0 B
2023-12-11T01:16:03.096442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length45
Mean length36.566667
Min length21

Characters and Unicode

Total characters3291
Distinct characters174
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)60.0%

Sample

1st row부산광역시 남구 전포대로 26, 상가동 125호 (문현동, 문현삼성힐타워)
2nd row부산광역시 남구 전포대로91번길 47, 3층 (문현동, 이마트)
3rd row부산광역시 남구 분포로 115, B동 3층 312호 (용호동, 힐탑탑플레이스)
4th row부산광역시 남구 동명로118번길 82 (용호동, 한우한마당)
5th row부산광역시 남구 못골번영로 84, 1층 (대연동)
ValueCountFrequency (%)
부산광역시 90
 
13.7%
남구 90
 
13.7%
대연동 43
 
6.6%
수영로 32
 
4.9%
문현동 22
 
3.4%
312 13
 
2.0%
3층 12
 
1.8%
용호동 11
 
1.7%
21 9
 
1.4%
센츄리시티 9
 
1.4%
Other values (177) 324
49.5%
2023-12-11T01:16:03.674160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
565
 
17.2%
1 142
 
4.3%
115
 
3.5%
, 115
 
3.5%
2 109
 
3.3%
103
 
3.1%
97
 
2.9%
97
 
2.9%
) 92
 
2.8%
( 92
 
2.8%
Other values (164) 1764
53.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1814
55.1%
Decimal Number 570
 
17.3%
Space Separator 565
 
17.2%
Other Punctuation 116
 
3.5%
Close Punctuation 92
 
2.8%
Open Punctuation 92
 
2.8%
Uppercase Letter 26
 
0.8%
Dash Punctuation 11
 
0.3%
Lowercase Letter 4
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
115
 
6.3%
103
 
5.7%
97
 
5.3%
97
 
5.3%
91
 
5.0%
91
 
5.0%
90
 
5.0%
90
 
5.0%
90
 
5.0%
77
 
4.2%
Other values (129) 873
48.1%
Uppercase Letter
ValueCountFrequency (%)
B 7
26.9%
I 3
11.5%
C 3
11.5%
F 2
 
7.7%
H 1
 
3.8%
E 1
 
3.8%
A 1
 
3.8%
W 1
 
3.8%
V 1
 
3.8%
O 1
 
3.8%
Other values (5) 5
19.2%
Decimal Number
ValueCountFrequency (%)
1 142
24.9%
2 109
19.1%
3 81
14.2%
0 55
 
9.6%
5 51
 
8.9%
4 42
 
7.4%
9 26
 
4.6%
7 24
 
4.2%
6 23
 
4.0%
8 17
 
3.0%
Lowercase Letter
ValueCountFrequency (%)
l 2
50.0%
s 1
25.0%
i 1
25.0%
Other Punctuation
ValueCountFrequency (%)
, 115
99.1%
/ 1
 
0.9%
Space Separator
ValueCountFrequency (%)
565
100.0%
Close Punctuation
ValueCountFrequency (%)
) 92
100.0%
Open Punctuation
ValueCountFrequency (%)
( 92
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1814
55.1%
Common 1447
44.0%
Latin 30
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
115
 
6.3%
103
 
5.7%
97
 
5.3%
97
 
5.3%
91
 
5.0%
91
 
5.0%
90
 
5.0%
90
 
5.0%
90
 
5.0%
77
 
4.2%
Other values (129) 873
48.1%
Latin
ValueCountFrequency (%)
B 7
23.3%
I 3
 
10.0%
C 3
 
10.0%
l 2
 
6.7%
F 2
 
6.7%
s 1
 
3.3%
i 1
 
3.3%
H 1
 
3.3%
E 1
 
3.3%
A 1
 
3.3%
Other values (8) 8
26.7%
Common
ValueCountFrequency (%)
565
39.0%
1 142
 
9.8%
, 115
 
7.9%
2 109
 
7.5%
) 92
 
6.4%
( 92
 
6.4%
3 81
 
5.6%
0 55
 
3.8%
5 51
 
3.5%
4 42
 
2.9%
Other values (7) 103
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1814
55.1%
ASCII 1477
44.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
565
38.3%
1 142
 
9.6%
, 115
 
7.8%
2 109
 
7.4%
) 92
 
6.2%
( 92
 
6.2%
3 81
 
5.5%
0 55
 
3.7%
5 51
 
3.5%
4 42
 
2.8%
Other values (25) 133
 
9.0%
Hangul
ValueCountFrequency (%)
115
 
6.3%
103
 
5.7%
97
 
5.3%
97
 
5.3%
91
 
5.0%
91
 
5.0%
90
 
5.0%
90
 
5.0%
90
 
5.0%
77
 
4.2%
Other values (129) 873
48.1%

전화번호
Text

MISSING 

Distinct50
Distinct (%)80.6%
Missing28
Missing (%)31.1%
Memory size852.0 B
2023-12-11T01:16:04.014627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.951613
Min length9

Characters and Unicode

Total characters741
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)61.3%

Sample

1st row051-645-1211
2nd row051-529-0700
3rd row02-1644-6015
4th row051-904-6762
5th row051-623-6291
ValueCountFrequency (%)
051-462-2793 2
 
3.2%
051-636-2626 2
 
3.2%
051-645-1211 2
 
3.2%
051-714-0804 2
 
3.2%
051-529-0700 2
 
3.2%
051-621-7575 2
 
3.2%
051-465-6817 2
 
3.2%
051-925-0051 2
 
3.2%
051-701-4994 2
 
3.2%
051-611-3101 2
 
3.2%
Other values (40) 42
67.7%
2023-12-11T01:16:04.597952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 123
16.6%
0 119
16.1%
1 116
15.7%
5 96
13.0%
6 74
10.0%
4 49
 
6.6%
2 44
 
5.9%
3 37
 
5.0%
7 33
 
4.5%
8 29
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 618
83.4%
Dash Punctuation 123
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 119
19.3%
1 116
18.8%
5 96
15.5%
6 74
12.0%
4 49
7.9%
2 44
 
7.1%
3 37
 
6.0%
7 33
 
5.3%
8 29
 
4.7%
9 21
 
3.4%
Dash Punctuation
ValueCountFrequency (%)
- 123
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 741
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 123
16.6%
0 119
16.1%
1 116
15.7%
5 96
13.0%
6 74
10.0%
4 49
 
6.6%
2 44
 
5.9%
3 37
 
5.0%
7 33
 
4.5%
8 29
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 741
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 123
16.6%
0 119
16.1%
1 116
15.7%
5 96
13.0%
6 74
10.0%
4 49
 
6.6%
2 44
 
5.9%
3 37
 
5.0%
7 33
 
4.5%
8 29
 
3.9%

Correlations

2023-12-11T01:16:04.747551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종상호소재지(도로명)전화번호
업종1.0000.0000.0000.000
상호0.0001.0001.0000.999
소재지(도로명)0.0001.0001.0001.000
전화번호0.0000.9991.0001.000

Missing values

2023-12-11T01:16:01.115094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:16:01.286402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호소재지(도로명)전화번호
0국내여행업늘푸른여행사부산광역시 남구 전포대로 26, 상가동 125호 (문현동, 문현삼성힐타워)051-645-1211
1국내여행업(주)써니투어부산광역시 남구 전포대로91번길 47, 3층 (문현동, 이마트)051-529-0700
2국내여행업(주)유림투어부산광역시 남구 분포로 115, B동 3층 312호 (용호동, 힐탑탑플레이스)02-1644-6015
3국내여행업부산투어 전세버스부산광역시 남구 동명로118번길 82 (용호동, 한우한마당)051-904-6762
4국내여행업성공그린여행사부산광역시 남구 못골번영로 84, 1층 (대연동)051-623-6291
5국내여행업(주)조이투어부산광역시 남구 수영로 233, 401호 (대연동, 민석빌딩)051-442-3999
6국내여행업(주)스마트투어부산광역시 남구 수영로 312, 302호 (대연동, 21센츄리시티오피스텔)051-925-0051
7국내여행업(주)투어드림부산광역시 남구 수영로 295, 세웅빌딩 407호 (대연동)<NA>
8국내여행업(주)찬스투어네트워크부산광역시 남구 분포로 115, 힐탑탑플레이스 B동 3층 319호 (용호동)<NA>
9국내여행업골목여행사부산광역시 남구 용소로19번길 43-1 (대연동)051-635-4316
업종상호소재지(도로명)전화번호
80종합여행업(주)미륭투어부산광역시 남구 신선대산복로 30 (주)미륭레미콘 (용당동)051-611-0800
81종합여행업(주)세라고속관광부산광역시 남구 신선로356번길 65-60, 2층층 (용당동)051-244-8500
82종합여행업(주)서우여행사부산광역시 남구 용소로 45 (대연동)051-784-7070
83종합여행업(주)디에이치오션부산광역시 남구 신선로 365, 산학협력관 101-A호 (용당동)051-323-0329
84종합여행업(주)이에스에스티앤엘부산광역시 남구 전포대로 133, 15층 115호,122호 (문현동)051-923-9100
85종합여행업제이지스포츠부산광역시 남구 못골로 49, 3층 (대연동)<NA>
86종합여행업글로벌엠에스부산광역시 남구 신선로301번길 10, 1층 (용당동)<NA>
87종합여행업(주)애드맥스부산광역시 남구 수영로 234, 부산은행대연동지점 3층 (대연동)051-611-3101
88종합여행업투어피플 여행사부산광역시 남구 수영로 312, 21 센츄리시티 오피스텔 1703호 (대연동)<NA>
89종합여행업주식회사 아이비즈트래블부산광역시 남구 황령대로319번가길 190-6, 상가동 406호 (대연동, 대우그린아파트)<NA>