Overview

Dataset statistics

Number of variables5
Number of observations158
Missing cells30
Missing cells (%)3.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.3 KiB
Average record size in memory40.8 B

Variable types

Categorical2
Text3

Dataset

Description부산광역시 중구 관내 관광 및 여행업 현황에 대한 데이터로 업종, 상호, 소재지(도로명), 전화번호 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15026360/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
전화번호 has 30 (19.0%) missing valuesMissing

Reproduction

Analysis started2023-12-12 19:28:30.320675
Analysis finished2023-12-12 19:28:30.760211
Duration0.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct3
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
국내외여행업
96 
종합여행업
40 
국내여행업
22 

Length

Max length6
Median length6
Mean length5.6075949
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내외여행업 96
60.8%
종합여행업 40
25.3%
국내여행업 22
 
13.9%

Length

2023-12-13T04:28:30.840837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:28:30.970776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외여행업 96
60.8%
종합여행업 40
25.3%
국내여행업 22
 
13.9%

상호
Text

Distinct143
Distinct (%)90.5%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-13T04:28:31.289740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length11
Mean length8.0822785
Min length3

Characters and Unicode

Total characters1277
Distinct characters213
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique128 ?
Unique (%)81.0%

Sample

1st row(주)동방여행사
2nd row(주)로얄항공여행사
3rd row(주)리더스투어
4th row(주)부산그린여행사
5th row(주)강남국제고속관광
ValueCountFrequency (%)
주식회사 6
 
3.4%
여행사 4
 
2.2%
주)이투어프랜드 2
 
1.1%
주)세환네트워크 2
 
1.1%
투어 2
 
1.1%
주)팔성국제관광 2
 
1.1%
제일여행개발 2
 
1.1%
주)로얄항공여행사 2
 
1.1%
주)에이스여행사 2
 
1.1%
주)동방여행사 2
 
1.1%
Other values (145) 153
85.5%
2023-12-13T04:28:31.822891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
129
 
10.1%
( 118
 
9.2%
) 118
 
9.2%
73
 
5.7%
72
 
5.6%
47
 
3.7%
47
 
3.7%
36
 
2.8%
28
 
2.2%
28
 
2.2%
Other values (203) 581
45.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1001
78.4%
Open Punctuation 118
 
9.2%
Close Punctuation 118
 
9.2%
Space Separator 21
 
1.6%
Uppercase Letter 7
 
0.5%
Decimal Number 6
 
0.5%
Other Punctuation 3
 
0.2%
Lowercase Letter 2
 
0.2%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
129
 
12.9%
73
 
7.3%
72
 
7.2%
47
 
4.7%
47
 
4.7%
36
 
3.6%
28
 
2.8%
28
 
2.8%
16
 
1.6%
13
 
1.3%
Other values (186) 512
51.1%
Uppercase Letter
ValueCountFrequency (%)
M 2
28.6%
P 2
28.6%
B 1
14.3%
J 1
14.3%
T 1
14.3%
Decimal Number
ValueCountFrequency (%)
8 3
50.0%
1 1
 
16.7%
2 1
 
16.7%
4 1
 
16.7%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
& 1
33.3%
Lowercase Letter
ValueCountFrequency (%)
i 1
50.0%
g 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 118
100.0%
Close Punctuation
ValueCountFrequency (%)
) 118
100.0%
Space Separator
ValueCountFrequency (%)
21
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1002
78.5%
Common 266
 
20.8%
Latin 9
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
129
 
12.9%
73
 
7.3%
72
 
7.2%
47
 
4.7%
47
 
4.7%
36
 
3.6%
28
 
2.8%
28
 
2.8%
16
 
1.6%
13
 
1.3%
Other values (187) 513
51.2%
Common
ValueCountFrequency (%)
( 118
44.4%
) 118
44.4%
21
 
7.9%
8 3
 
1.1%
. 2
 
0.8%
1 1
 
0.4%
2 1
 
0.4%
4 1
 
0.4%
& 1
 
0.4%
Latin
ValueCountFrequency (%)
M 2
22.2%
P 2
22.2%
B 1
11.1%
i 1
11.1%
g 1
11.1%
J 1
11.1%
T 1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1001
78.4%
ASCII 275
 
21.5%
None 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
129
 
12.9%
73
 
7.3%
72
 
7.2%
47
 
4.7%
47
 
4.7%
36
 
3.6%
28
 
2.8%
28
 
2.8%
16
 
1.6%
13
 
1.3%
Other values (186) 512
51.1%
ASCII
ValueCountFrequency (%)
( 118
42.9%
) 118
42.9%
21
 
7.6%
8 3
 
1.1%
M 2
 
0.7%
. 2
 
0.7%
P 2
 
0.7%
B 1
 
0.4%
i 1
 
0.4%
g 1
 
0.4%
Other values (6) 6
 
2.2%
None
ValueCountFrequency (%)
1
100.0%
Distinct141
Distinct (%)89.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-13T04:28:32.258738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length40
Mean length33.64557
Min length22

Characters and Unicode

Total characters5316
Distinct characters136
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique125 ?
Unique (%)79.1%

Sample

1st row부산광역시 중구 대영로 247, 준희빌딩 202호 (영주동)
2nd row부산광역시 중구 중구로53번길 1 (부평동1가)
3rd row부산광역시 중구 충장대로9번길 66, 한국선원센터 501호 (중앙동4가)
4th row부산광역시 중구 해관로 89 (대창동1가,중앙빌딩 304)
5th row부산광역시 중구 중앙대로 113-1 (대창동1가,세방빌딩 3층)
ValueCountFrequency (%)
부산광역시 158
 
15.5%
중구 158
 
15.5%
중앙동4가 72
 
7.1%
중앙대로 46
 
4.5%
해관로 30
 
2.9%
3층 17
 
1.7%
2층 13
 
1.3%
충장대로5번길 13
 
1.3%
중앙대로81번길 12
 
1.2%
501호 11
 
1.1%
Other values (240) 489
48.0%
2023-12-13T04:28:32.863971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
861
 
16.2%
334
 
6.3%
1 196
 
3.7%
185
 
3.5%
180
 
3.4%
, 175
 
3.3%
172
 
3.2%
172
 
3.2%
171
 
3.2%
165
 
3.1%
Other values (126) 2705
50.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2974
55.9%
Decimal Number 946
 
17.8%
Space Separator 861
 
16.2%
Other Punctuation 176
 
3.3%
Open Punctuation 163
 
3.1%
Close Punctuation 163
 
3.1%
Dash Punctuation 30
 
0.6%
Uppercase Letter 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
334
 
11.2%
185
 
6.2%
180
 
6.1%
172
 
5.8%
172
 
5.8%
171
 
5.7%
165
 
5.5%
158
 
5.3%
158
 
5.3%
157
 
5.3%
Other values (107) 1122
37.7%
Decimal Number
ValueCountFrequency (%)
1 196
20.7%
4 135
14.3%
2 118
12.5%
0 108
11.4%
5 96
10.1%
3 93
9.8%
7 63
 
6.7%
9 49
 
5.2%
6 46
 
4.9%
8 42
 
4.4%
Uppercase Letter
ValueCountFrequency (%)
A 1
33.3%
B 1
33.3%
D 1
33.3%
Other Punctuation
ValueCountFrequency (%)
, 175
99.4%
/ 1
 
0.6%
Space Separator
ValueCountFrequency (%)
861
100.0%
Open Punctuation
ValueCountFrequency (%)
( 163
100.0%
Close Punctuation
ValueCountFrequency (%)
) 163
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 30
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2974
55.9%
Common 2339
44.0%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
334
 
11.2%
185
 
6.2%
180
 
6.1%
172
 
5.8%
172
 
5.8%
171
 
5.7%
165
 
5.5%
158
 
5.3%
158
 
5.3%
157
 
5.3%
Other values (107) 1122
37.7%
Common
ValueCountFrequency (%)
861
36.8%
1 196
 
8.4%
, 175
 
7.5%
( 163
 
7.0%
) 163
 
7.0%
4 135
 
5.8%
2 118
 
5.0%
0 108
 
4.6%
5 96
 
4.1%
3 93
 
4.0%
Other values (6) 231
 
9.9%
Latin
ValueCountFrequency (%)
A 1
33.3%
B 1
33.3%
D 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2974
55.9%
ASCII 2342
44.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
861
36.8%
1 196
 
8.4%
, 175
 
7.5%
( 163
 
7.0%
) 163
 
7.0%
4 135
 
5.8%
2 118
 
5.0%
0 108
 
4.6%
5 96
 
4.1%
3 93
 
4.0%
Other values (9) 234
 
10.0%
Hangul
ValueCountFrequency (%)
334
 
11.2%
185
 
6.2%
180
 
6.1%
172
 
5.8%
172
 
5.8%
171
 
5.7%
165
 
5.5%
158
 
5.3%
158
 
5.3%
157
 
5.3%
Other values (107) 1122
37.7%

전화번호
Text

MISSING 

Distinct111
Distinct (%)86.7%
Missing30
Missing (%)19.0%
Memory size1.4 KiB
2023-12-13T04:28:33.193672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.984375
Min length9

Characters and Unicode

Total characters1534
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique95 ?
Unique (%)74.2%

Sample

1st row051-466-7107
2nd row051-441-8500
3rd row051-255-6511
4th row051-466-1100
5th row051-464-9800
ValueCountFrequency (%)
051-240-8881 3
 
2.3%
051-468-8383 2
 
1.6%
051-442-6300 2
 
1.6%
051-466-7107 2
 
1.6%
051-264-7900 2
 
1.6%
051-462-4020 2
 
1.6%
051-469-7731 2
 
1.6%
051-464-0606 2
 
1.6%
051-468-7747 2
 
1.6%
051-467-9010 2
 
1.6%
Other values (101) 107
83.6%
2023-12-13T04:28:33.924461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 254
16.6%
0 251
16.4%
1 235
15.3%
5 189
12.3%
4 154
10.0%
6 112
7.3%
2 83
 
5.4%
8 79
 
5.1%
7 75
 
4.9%
3 59
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1280
83.4%
Dash Punctuation 254
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 251
19.6%
1 235
18.4%
5 189
14.8%
4 154
12.0%
6 112
8.8%
2 83
 
6.5%
8 79
 
6.2%
7 75
 
5.9%
3 59
 
4.6%
9 43
 
3.4%
Dash Punctuation
ValueCountFrequency (%)
- 254
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1534
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 254
16.6%
0 251
16.4%
1 235
15.3%
5 189
12.3%
4 154
10.0%
6 112
7.3%
2 83
 
5.4%
8 79
 
5.1%
7 75
 
4.9%
3 59
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1534
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 254
16.6%
0 251
16.4%
1 235
15.3%
5 189
12.3%
4 154
10.0%
6 112
7.3%
2 83
 
5.4%
8 79
 
5.1%
7 75
 
4.9%
3 59
 
3.8%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-07-20
158 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-20
2nd row2023-07-20
3rd row2023-07-20
4th row2023-07-20
5th row2023-07-20

Common Values

ValueCountFrequency (%)
2023-07-20 158
100.0%

Length

2023-12-13T04:28:34.058916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:28:34.156188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-07-20 158
100.0%

Missing values

2023-12-13T04:28:30.594371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:28:30.712409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호소재지(도로명)전화번호데이터기준일자
0국내여행업(주)동방여행사부산광역시 중구 대영로 247, 준희빌딩 202호 (영주동)051-466-71072023-07-20
1국내여행업(주)로얄항공여행사부산광역시 중구 중구로53번길 1 (부평동1가)051-441-85002023-07-20
2국내여행업(주)리더스투어부산광역시 중구 충장대로9번길 66, 한국선원센터 501호 (중앙동4가)051-255-65112023-07-20
3국내여행업(주)부산그린여행사부산광역시 중구 해관로 89 (대창동1가,중앙빌딩 304)051-466-11002023-07-20
4국내여행업(주)강남국제고속관광부산광역시 중구 중앙대로 113-1 (대창동1가,세방빌딩 3층)051-464-98002023-07-20
5국내여행업(주)두모씨앤씨부산광역시 중구 충장대로9번길 52, 1710호 (중앙동4가, 마린센터빌딩)051-245-10662023-07-20
6국내여행업(주)원샷골프투어부산광역시 중구 중앙대로 57, 10층 (중앙동2가)051-442-18362023-07-20
7국내여행업(주)투어장네트워크부산광역시 중구 중앙대로 131, 1501호 (대창동2가, 센트럴오피스텔)051-467-90102023-07-20
8국내여행업(주)허니스투어부산광역시 중구 광복로 39 (창선동2가,4층)051-246-08192023-07-20
9국내여행업(주)오주쉬핑부산광역시 중구 광복로97번길 25-1, 503호 (동광동2가,삼호빌딩 별관)051-241-81812023-07-20
업종상호소재지(도로명)전화번호데이터기준일자
148종합여행업(주)엔에프투어부산광역시 중구 해관로 89, 704호 (대창동1가)051-716-86002023-07-20
149종합여행업(주)더조이투어부산광역시 중구 대청로 146 (중앙동2가)<NA>2023-07-20
150종합여행업주말엔부산광역시 중구 대청로141번길 3, 2층 (중앙동3가)051-468-83832023-07-20
151종합여행업라쿠투어부산광역시 중구 중구로148번길 3-1, 401호 (동광동5가, 미래원룸)<NA>2023-07-20
152종합여행업(주)팬스타라인닷컴부산광역시 중구 해관로 30, 3,5층 (중앙동2가)051-240-88812023-07-20
153종합여행업아루토라 트래블부산광역시 중구 충장대로5번길 18, 201호 (중앙동4가)<NA>2023-07-20
154종합여행업(주)비오스부산광역시 중구 광복중앙로 28-1, 509호 (대청동2가)<NA>2023-07-20
155종합여행업폼나는 여행사부산광역시 중구 충장대로5번길 56, 202호 (중앙동4가)<NA>2023-07-20
156종합여행업(주)버스투어여행사부산광역시 중구 충장대로9번길 37-1, 403호 (중앙동4가)051-951-99772023-07-20
157종합여행업(주)레오여행사부산광역시 중구 해관로 44, 201호 (중앙동2가)051-710-52112023-07-20