Overview

Dataset statistics

Number of variables3
Number of observations88
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory25.5 B

Variable types

Categorical1
Text2

Dataset

Description부산광역시남구여행업현황_20230719
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3034664

Reproduction

Analysis started2023-12-10 16:15:55.487092
Analysis finished2023-12-10 16:15:56.152152
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct3
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size836.0 B
국내외여행업
54 
종합여행업
24 
국내여행업
10 

Length

Max length6
Median length6
Mean length5.6136364
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내외여행업 54
61.4%
종합여행업 24
27.3%
국내여행업 10
 
11.4%

Length

2023-12-11T01:15:56.283889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:15:56.466978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외여행업 54
61.4%
종합여행업 24
27.3%
국내여행업 10
 
11.4%

상호
Text

Distinct85
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Memory size836.0 B
2023-12-11T01:15:56.858620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length14
Mean length8.3977273
Min length3

Characters and Unicode

Total characters739
Distinct characters196
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique82 ?
Unique (%)93.2%

Sample

1st row부산투어 전세버스
2nd row성공그린여행사
3rd row골목여행사
4th row우리고속관광협동조합
5th row(주)오리온투어시스템
ValueCountFrequency (%)
주식회사 10
 
8.8%
tour 5
 
4.4%
골목여행사 2
 
1.8%
주)오리온투어시스템 2
 
1.8%
싱글스투어 2
 
1.8%
여행사 2
 
1.8%
주)에어앤비 1
 
0.9%
하늘투어(ciel 1
 
0.9%
dy 1
 
0.9%
디와이투어 1
 
0.9%
Other values (86) 86
76.1%
2023-12-11T01:15:57.758813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
54
 
7.3%
( 50
 
6.8%
) 50
 
6.8%
36
 
4.9%
34
 
4.6%
26
 
3.5%
25
 
3.4%
25
 
3.4%
23
 
3.1%
23
 
3.1%
Other values (186) 393
53.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 549
74.3%
Open Punctuation 50
 
6.8%
Close Punctuation 50
 
6.8%
Uppercase Letter 36
 
4.9%
Lowercase Letter 29
 
3.9%
Space Separator 25
 
3.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
 
9.8%
36
 
6.6%
34
 
6.2%
26
 
4.7%
25
 
4.6%
23
 
4.2%
23
 
4.2%
18
 
3.3%
11
 
2.0%
10
 
1.8%
Other values (152) 289
52.6%
Uppercase Letter
ValueCountFrequency (%)
T 4
11.1%
J 4
11.1%
O 4
11.1%
P 3
 
8.3%
R 2
 
5.6%
V 2
 
5.6%
U 2
 
5.6%
Y 2
 
5.6%
I 2
 
5.6%
D 2
 
5.6%
Other values (8) 9
25.0%
Lowercase Letter
ValueCountFrequency (%)
o 5
17.2%
t 4
13.8%
i 3
10.3%
r 3
10.3%
u 3
10.3%
h 2
 
6.9%
e 2
 
6.9%
l 2
 
6.9%
s 1
 
3.4%
p 1
 
3.4%
Other values (3) 3
10.3%
Open Punctuation
ValueCountFrequency (%)
( 50
100.0%
Close Punctuation
ValueCountFrequency (%)
) 50
100.0%
Space Separator
ValueCountFrequency (%)
25
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 549
74.3%
Common 125
 
16.9%
Latin 65
 
8.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
 
9.8%
36
 
6.6%
34
 
6.2%
26
 
4.7%
25
 
4.6%
23
 
4.2%
23
 
4.2%
18
 
3.3%
11
 
2.0%
10
 
1.8%
Other values (152) 289
52.6%
Latin
ValueCountFrequency (%)
o 5
 
7.7%
t 4
 
6.2%
T 4
 
6.2%
J 4
 
6.2%
O 4
 
6.2%
i 3
 
4.6%
r 3
 
4.6%
u 3
 
4.6%
P 3
 
4.6%
R 2
 
3.1%
Other values (21) 30
46.2%
Common
ValueCountFrequency (%)
( 50
40.0%
) 50
40.0%
25
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 549
74.3%
ASCII 190
 
25.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
54
 
9.8%
36
 
6.6%
34
 
6.2%
26
 
4.7%
25
 
4.6%
23
 
4.2%
23
 
4.2%
18
 
3.3%
11
 
2.0%
10
 
1.8%
Other values (152) 289
52.6%
ASCII
ValueCountFrequency (%)
( 50
26.3%
) 50
26.3%
25
13.2%
o 5
 
2.6%
t 4
 
2.1%
T 4
 
2.1%
J 4
 
2.1%
O 4
 
2.1%
i 3
 
1.6%
r 3
 
1.6%
Other values (24) 38
20.0%
Distinct82
Distinct (%)93.2%
Missing0
Missing (%)0.0%
Memory size836.0 B
2023-12-11T01:15:58.300201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length44.5
Mean length36.261364
Min length21

Characters and Unicode

Total characters3191
Distinct characters174
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique76 ?
Unique (%)86.4%

Sample

1st row부산광역시 남구 동명로118번길 82 (용호동, 한우한마당)
2nd row부산광역시 남구 못골번영로 84, 1층 (대연동)
3rd row부산광역시 남구 용소로19번길 43-1 (대연동)
4th row부산광역시 남구 수영로 282, 902호 (대연동, 현대오피스텔)
5th row부산광역시 남구 자성로 152, 한일오피스텔 705호 (문현동)
ValueCountFrequency (%)
부산광역시 88
 
13.7%
남구 88
 
13.7%
대연동 44
 
6.9%
수영로 32
 
5.0%
문현동 26
 
4.0%
312 11
 
1.7%
21 9
 
1.4%
센츄리시티 9
 
1.4%
오피스텔 9
 
1.4%
2층 9
 
1.4%
Other values (201) 317
49.4%
2023-12-11T01:15:59.036760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
554
 
17.4%
1 138
 
4.3%
2 109
 
3.4%
108
 
3.4%
, 104
 
3.3%
101
 
3.2%
97
 
3.0%
93
 
2.9%
90
 
2.8%
90
 
2.8%
Other values (164) 1707
53.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1742
54.6%
Decimal Number 567
 
17.8%
Space Separator 554
 
17.4%
Other Punctuation 105
 
3.3%
Close Punctuation 90
 
2.8%
Open Punctuation 90
 
2.8%
Uppercase Letter 24
 
0.8%
Dash Punctuation 14
 
0.4%
Lowercase Letter 4
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
108
 
6.2%
101
 
5.8%
97
 
5.6%
93
 
5.3%
90
 
5.2%
90
 
5.2%
88
 
5.1%
88
 
5.1%
88
 
5.1%
72
 
4.1%
Other values (128) 827
47.5%
Uppercase Letter
ValueCountFrequency (%)
C 3
12.5%
B 3
12.5%
I 3
12.5%
F 2
 
8.3%
A 2
 
8.3%
G 1
 
4.2%
K 1
 
4.2%
T 1
 
4.2%
O 1
 
4.2%
J 1
 
4.2%
Other values (6) 6
25.0%
Decimal Number
ValueCountFrequency (%)
1 138
24.3%
2 109
19.2%
3 72
12.7%
0 64
11.3%
5 46
 
8.1%
4 42
 
7.4%
6 27
 
4.8%
9 26
 
4.6%
8 25
 
4.4%
7 18
 
3.2%
Lowercase Letter
ValueCountFrequency (%)
l 2
50.0%
s 1
25.0%
i 1
25.0%
Other Punctuation
ValueCountFrequency (%)
, 104
99.0%
/ 1
 
1.0%
Space Separator
ValueCountFrequency (%)
554
100.0%
Close Punctuation
ValueCountFrequency (%)
) 90
100.0%
Open Punctuation
ValueCountFrequency (%)
( 90
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1742
54.6%
Common 1421
44.5%
Latin 28
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
108
 
6.2%
101
 
5.8%
97
 
5.6%
93
 
5.3%
90
 
5.2%
90
 
5.2%
88
 
5.1%
88
 
5.1%
88
 
5.1%
72
 
4.1%
Other values (128) 827
47.5%
Latin
ValueCountFrequency (%)
C 3
 
10.7%
B 3
 
10.7%
I 3
 
10.7%
l 2
 
7.1%
F 2
 
7.1%
A 2
 
7.1%
G 1
 
3.6%
K 1
 
3.6%
T 1
 
3.6%
O 1
 
3.6%
Other values (9) 9
32.1%
Common
ValueCountFrequency (%)
554
39.0%
1 138
 
9.7%
2 109
 
7.7%
, 104
 
7.3%
) 90
 
6.3%
( 90
 
6.3%
3 72
 
5.1%
0 64
 
4.5%
5 46
 
3.2%
4 42
 
3.0%
Other values (7) 112
 
7.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1742
54.6%
ASCII 1449
45.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
554
38.2%
1 138
 
9.5%
2 109
 
7.5%
, 104
 
7.2%
) 90
 
6.2%
( 90
 
6.2%
3 72
 
5.0%
0 64
 
4.4%
5 46
 
3.2%
4 42
 
2.9%
Other values (26) 140
 
9.7%
Hangul
ValueCountFrequency (%)
108
 
6.2%
101
 
5.8%
97
 
5.6%
93
 
5.3%
90
 
5.2%
90
 
5.2%
88
 
5.1%
88
 
5.1%
88
 
5.1%
72
 
4.1%
Other values (128) 827
47.5%

Correlations

2023-12-11T01:15:59.201823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종상호소재지(도로명)
업종1.0000.0000.000
상호0.0001.0001.000
소재지(도로명)0.0001.0001.000

Missing values

2023-12-11T01:15:55.937782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:15:56.084484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호소재지(도로명)
0국내여행업부산투어 전세버스부산광역시 남구 동명로118번길 82 (용호동, 한우한마당)
1국내여행업성공그린여행사부산광역시 남구 못골번영로 84, 1층 (대연동)
2국내여행업골목여행사부산광역시 남구 용소로19번길 43-1 (대연동)
3국내여행업우리고속관광협동조합부산광역시 남구 수영로 282, 902호 (대연동, 현대오피스텔)
4국내여행업(주)오리온투어시스템부산광역시 남구 자성로 152, 한일오피스텔 705호 (문현동)
5국내여행업싱글스투어부산광역시 남구 수영로 274-23, 우성빌딩 3층 308호 (대연동)
6국내여행업커뮤니케이션 다움부산광역시 남구 진남로29번길 4, JM하우스 201호 (대연동)
7국내여행업주식회사 미식의시대부산광역시 남구 전포대로 133 (문현동,위워크) 오피스동 13층 (문현동)
8국내여행업부산VIP버스투어(서밋투어)부산광역시 남구 석포로20번길 4, 2층 (감만동)
9국내여행업주식회사 플랜온기획부산광역시 남구 지게골로 31, 1층 12호 (문현동, 문현상가)
업종상호소재지(도로명)
78종합여행업제이지스포츠부산광역시 남구 못골로 49, 3층 (대연동)
79종합여행업(주)애드맥스부산광역시 남구 수영로 234, 부산은행대연동지점 3층 (대연동)
80종합여행업투어피플 여행사부산광역시 남구 수영로 312, 21 센츄리시티 오피스텔 1703호 (대연동)
81종합여행업주식회사 아이비즈트래블부산광역시 남구 황령대로319번가길 190-6, 상가동 406호 (대연동, 대우그린아파트)
82종합여행업(주)노매드헐부산광역시 남구 문현금융로 40, 부산국제금융센터 55층 (문현동)
83종합여행업W(월드)투어부산광역시 남구 수영로 312, 21 센츄리시티 오피스텔 1608호 (대연동)
84종합여행업골든나비스부산광역시 남구 황령대로319번길 128, 5층 (대연동)
85종합여행업야나트립부산광역시 남구 자성로 152, 한일오피스텔 503-G06호 (문현동)
86종합여행업주식회사 코푸부산광역시 남구 신선로 259-2, 2층 (용당동)
87종합여행업(주)허니투어부산광역시 남구 문현금융로 40, 부산국제금융센터 2층 에이218호 (문현동)