Overview

Dataset statistics

Number of variables5
Number of observations234
Missing cells101
Missing cells (%)8.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.5 KiB
Average record size in memory41.6 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description부산광역시해운대구_여행업현황_20230126
Author부산광역시 해운대구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3075746

Alerts

연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
연번 has 7 (3.0%) missing valuesMissing
연락처 has 92 (39.3%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:17:20.629614
Analysis finished2023-12-10 16:17:21.293723
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct227
Distinct (%)100.0%
Missing7
Missing (%)3.0%
Infinite0
Infinite (%)0.0%
Mean114
Minimum1
Maximum227
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-11T01:17:21.391696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12.3
Q157.5
median114
Q3170.5
95-th percentile215.7
Maximum227
Range226
Interquartile range (IQR)113

Descriptive statistics

Standard deviation65.673435
Coefficient of variation (CV)0.57608276
Kurtosis-1.2
Mean114
Median Absolute Deviation (MAD)57
Skewness0
Sum25878
Variance4313
MonotonicityStrictly increasing
2023-12-11T01:17:21.550899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
144 1
 
0.4%
146 1
 
0.4%
147 1
 
0.4%
148 1
 
0.4%
149 1
 
0.4%
150 1
 
0.4%
151 1
 
0.4%
152 1
 
0.4%
153 1
 
0.4%
154 1
 
0.4%
Other values (217) 217
92.7%
(Missing) 7
 
3.0%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
227 1
0.4%
226 1
0.4%
225 1
0.4%
224 1
0.4%
223 1
0.4%
222 1
0.4%
221 1
0.4%
220 1
0.4%
219 1
0.4%
218 1
0.4%

업종
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
국내외여행업
92 
종합여행업
80 
국내여행업
62 

Length

Max length6
Median length5
Mean length5.3931624
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내외여행업 92
39.3%
종합여행업 80
34.2%
국내여행업 62
26.5%

Length

2023-12-11T01:17:21.736747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:17:21.863889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외여행업 92
39.3%
종합여행업 80
34.2%
국내여행업 62
26.5%

상호
Text

Distinct207
Distinct (%)88.5%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-11T01:17:22.100557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length21
Mean length8.6837607
Min length2

Characters and Unicode

Total characters2032
Distinct characters298
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique180 ?
Unique (%)76.9%

Sample

1st row(주)이트립포유
2nd row관광가이드 부산(주)
3rd row(주)토성투어
4th row(주)로마여행사
5th row(주)하나에스엠여행사
ValueCountFrequency (%)
주식회사 42
 
13.0%
여행사 6
 
1.9%
투어 4
 
1.2%
주)스카이블루투어 2
 
0.6%
2
 
0.6%
대어여행 2
 
0.6%
주)투어락네트워크 2
 
0.6%
주)클럽가이아 2
 
0.6%
주)코밴더 2
 
0.6%
프래블 2
 
0.6%
Other values (231) 257
79.6%
2023-12-11T01:17:22.496186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
170
 
8.4%
) 131
 
6.4%
( 131
 
6.4%
89
 
4.4%
87
 
4.3%
69
 
3.4%
66
 
3.2%
62
 
3.1%
61
 
3.0%
51
 
2.5%
Other values (288) 1115
54.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1566
77.1%
Close Punctuation 131
 
6.4%
Open Punctuation 131
 
6.4%
Space Separator 89
 
4.4%
Uppercase Letter 51
 
2.5%
Lowercase Letter 47
 
2.3%
Other Punctuation 6
 
0.3%
Math Symbol 4
 
0.2%
Dash Punctuation 3
 
0.1%
Decimal Number 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
170
 
10.9%
87
 
5.6%
69
 
4.4%
66
 
4.2%
62
 
4.0%
61
 
3.9%
51
 
3.3%
46
 
2.9%
43
 
2.7%
43
 
2.7%
Other values (237) 868
55.4%
Uppercase Letter
ValueCountFrequency (%)
T 10
19.6%
C 5
9.8%
E 4
 
7.8%
U 4
 
7.8%
H 3
 
5.9%
O 3
 
5.9%
L 3
 
5.9%
R 3
 
5.9%
Y 2
 
3.9%
A 2
 
3.9%
Other values (10) 12
23.5%
Lowercase Letter
ValueCountFrequency (%)
e 6
12.8%
r 6
12.8%
a 5
10.6%
t 4
 
8.5%
v 3
 
6.4%
l 3
 
6.4%
i 3
 
6.4%
u 2
 
4.3%
o 2
 
4.3%
c 2
 
4.3%
Other values (9) 11
23.4%
Other Punctuation
ValueCountFrequency (%)
& 4
66.7%
, 1
 
16.7%
. 1
 
16.7%
Math Symbol
ValueCountFrequency (%)
> 3
75.0%
1
 
25.0%
Decimal Number
ValueCountFrequency (%)
8 2
66.7%
0 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 131
100.0%
Open Punctuation
ValueCountFrequency (%)
( 131
100.0%
Space Separator
ValueCountFrequency (%)
89
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1567
77.1%
Common 367
 
18.1%
Latin 98
 
4.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
170
 
10.8%
87
 
5.6%
69
 
4.4%
66
 
4.2%
62
 
4.0%
61
 
3.9%
51
 
3.3%
46
 
2.9%
43
 
2.7%
43
 
2.7%
Other values (238) 869
55.5%
Latin
ValueCountFrequency (%)
T 10
 
10.2%
e 6
 
6.1%
r 6
 
6.1%
C 5
 
5.1%
a 5
 
5.1%
E 4
 
4.1%
t 4
 
4.1%
U 4
 
4.1%
H 3
 
3.1%
v 3
 
3.1%
Other values (29) 48
49.0%
Common
ValueCountFrequency (%)
) 131
35.7%
( 131
35.7%
89
24.3%
& 4
 
1.1%
> 3
 
0.8%
- 3
 
0.8%
8 2
 
0.5%
0 1
 
0.3%
1
 
0.3%
, 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1566
77.1%
ASCII 464
 
22.8%
None 1
 
< 0.1%
Arrows 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
170
 
10.9%
87
 
5.6%
69
 
4.4%
66
 
4.2%
62
 
4.0%
61
 
3.9%
51
 
3.3%
46
 
2.9%
43
 
2.7%
43
 
2.7%
Other values (237) 868
55.4%
ASCII
ValueCountFrequency (%)
) 131
28.2%
( 131
28.2%
89
19.2%
T 10
 
2.2%
e 6
 
1.3%
r 6
 
1.3%
C 5
 
1.1%
a 5
 
1.1%
& 4
 
0.9%
E 4
 
0.9%
Other values (39) 73
15.7%
None
ValueCountFrequency (%)
1
100.0%
Arrows
ValueCountFrequency (%)
1
100.0%
Distinct200
Distinct (%)86.2%
Missing2
Missing (%)0.9%
Memory size2.0 KiB
2023-12-11T01:17:22.776348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length47
Mean length39.650862
Min length22

Characters and Unicode

Total characters9199
Distinct characters226
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique171 ?
Unique (%)73.7%

Sample

1st row부산광역시 해운대구 재반로63번길 27, 2층 (재송동)
2nd row부산광역시 해운대구 구남로29번길 21 (중동, 리베라호텔 16층)
3rd row부산광역시 해운대구 해운대해변로 140, 지하1층 (우동, 홈플러스 해운대점)
4th row부산광역시 해운대구 해운대로 813, 1층 133호 (좌동, ZIPOP상가)
5th row부산광역시 해운대구 반여로 131, 215호 (반여동, 아시아선수촌 프레스상가)
ValueCountFrequency (%)
부산광역시 232
 
13.8%
해운대구 232
 
13.8%
우동 109
 
6.5%
재송동 42
 
2.5%
센텀중앙로 37
 
2.2%
중동 34
 
2.0%
좌동 29
 
1.7%
해운대해변로 25
 
1.5%
센텀동로 17
 
1.0%
2층 16
 
0.9%
Other values (423) 914
54.2%
2023-12-11T01:17:23.176836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1455
 
15.8%
351
 
3.8%
332
 
3.6%
332
 
3.6%
330
 
3.6%
, 319
 
3.5%
1 309
 
3.4%
278
 
3.0%
257
 
2.8%
238
 
2.6%
Other values (216) 4998
54.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5437
59.1%
Space Separator 1455
 
15.8%
Decimal Number 1450
 
15.8%
Other Punctuation 319
 
3.5%
Close Punctuation 232
 
2.5%
Open Punctuation 232
 
2.5%
Uppercase Letter 56
 
0.6%
Dash Punctuation 15
 
0.2%
Lowercase Letter 2
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
351
 
6.5%
332
 
6.1%
332
 
6.1%
330
 
6.1%
278
 
5.1%
257
 
4.7%
238
 
4.4%
233
 
4.3%
232
 
4.3%
232
 
4.3%
Other values (189) 2622
48.2%
Decimal Number
ValueCountFrequency (%)
1 309
21.3%
2 230
15.9%
0 223
15.4%
3 176
12.1%
9 96
 
6.6%
5 95
 
6.6%
4 90
 
6.2%
7 86
 
5.9%
6 78
 
5.4%
8 67
 
4.6%
Uppercase Letter
ValueCountFrequency (%)
A 13
23.2%
P 11
19.6%
E 10
17.9%
C 8
14.3%
B 4
 
7.1%
T 4
 
7.1%
O 2
 
3.6%
I 2
 
3.6%
Z 2
 
3.6%
Lowercase Letter
ValueCountFrequency (%)
s 1
50.0%
e 1
50.0%
Space Separator
ValueCountFrequency (%)
1455
100.0%
Other Punctuation
ValueCountFrequency (%)
, 319
100.0%
Close Punctuation
ValueCountFrequency (%)
) 232
100.0%
Open Punctuation
ValueCountFrequency (%)
( 232
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5437
59.1%
Common 3704
40.3%
Latin 58
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
351
 
6.5%
332
 
6.1%
332
 
6.1%
330
 
6.1%
278
 
5.1%
257
 
4.7%
238
 
4.4%
233
 
4.3%
232
 
4.3%
232
 
4.3%
Other values (189) 2622
48.2%
Common
ValueCountFrequency (%)
1455
39.3%
, 319
 
8.6%
1 309
 
8.3%
) 232
 
6.3%
( 232
 
6.3%
2 230
 
6.2%
0 223
 
6.0%
3 176
 
4.8%
9 96
 
2.6%
5 95
 
2.6%
Other values (6) 337
 
9.1%
Latin
ValueCountFrequency (%)
A 13
22.4%
P 11
19.0%
E 10
17.2%
C 8
13.8%
B 4
 
6.9%
T 4
 
6.9%
O 2
 
3.4%
I 2
 
3.4%
Z 2
 
3.4%
s 1
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5437
59.1%
ASCII 3762
40.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1455
38.7%
, 319
 
8.5%
1 309
 
8.2%
) 232
 
6.2%
( 232
 
6.2%
2 230
 
6.1%
0 223
 
5.9%
3 176
 
4.7%
9 96
 
2.6%
5 95
 
2.5%
Other values (17) 395
 
10.5%
Hangul
ValueCountFrequency (%)
351
 
6.5%
332
 
6.1%
332
 
6.1%
330
 
6.1%
278
 
5.1%
257
 
4.7%
238
 
4.4%
233
 
4.3%
232
 
4.3%
232
 
4.3%
Other values (189) 2622
48.2%

연락처
Text

MISSING 

Distinct125
Distinct (%)88.0%
Missing92
Missing (%)39.3%
Memory size2.0 KiB
2023-12-11T01:17:23.437428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.880282
Min length8

Characters and Unicode

Total characters1687
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)76.1%

Sample

1st row051-744-0027
2nd row051-743-5449
3rd row051-746-1118
4th row051-702-8260
5th row051-997-8789
ValueCountFrequency (%)
051-743-1159 2
 
1.4%
051-747-4666 2
 
1.4%
051-744-0027 2
 
1.4%
051-747-2525 2
 
1.4%
051-741-8390 2
 
1.4%
051-463-7500 2
 
1.4%
051-338-8825 2
 
1.4%
051-817-3060 2
 
1.4%
051-7420707 2
 
1.4%
051-633-1161 2
 
1.4%
Other values (115) 122
85.9%
2023-12-11T01:17:23.837829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 271
16.1%
0 264
15.6%
1 227
13.5%
5 196
11.6%
7 193
11.4%
4 117
6.9%
8 109
6.5%
3 92
 
5.5%
2 84
 
5.0%
9 69
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1416
83.9%
Dash Punctuation 271
 
16.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 264
18.6%
1 227
16.0%
5 196
13.8%
7 193
13.6%
4 117
8.3%
8 109
7.7%
3 92
 
6.5%
2 84
 
5.9%
9 69
 
4.9%
6 65
 
4.6%
Dash Punctuation
ValueCountFrequency (%)
- 271
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1687
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 271
16.1%
0 264
15.6%
1 227
13.5%
5 196
11.6%
7 193
11.4%
4 117
6.9%
8 109
6.5%
3 92
 
5.5%
2 84
 
5.0%
9 69
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1687
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 271
16.1%
0 264
15.6%
1 227
13.5%
5 196
11.6%
7 193
11.4%
4 117
6.9%
8 109
6.5%
3 92
 
5.5%
2 84
 
5.0%
9 69
 
4.1%

Interactions

2023-12-11T01:17:20.934986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:17:23.922701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.950
업종0.9501.000
2023-12-11T01:17:24.219326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.928
업종0.9281.000

Missing values

2023-12-11T01:17:21.035655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:17:21.122936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T01:17:21.233711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번업종상호소재지(도로명)연락처
01국내여행업(주)이트립포유부산광역시 해운대구 재반로63번길 27, 2층 (재송동)051-744-0027
12국내여행업관광가이드 부산(주)부산광역시 해운대구 구남로29번길 21 (중동, 리베라호텔 16층)051-743-5449
23국내여행업(주)토성투어부산광역시 해운대구 해운대해변로 140, 지하1층 (우동, 홈플러스 해운대점)051-746-1118
34국내여행업(주)로마여행사부산광역시 해운대구 해운대로 813, 1층 133호 (좌동, ZIPOP상가)051-702-8260
45국내여행업(주)하나에스엠여행사부산광역시 해운대구 반여로 131, 215호 (반여동, 아시아선수촌 프레스상가)051-997-8789
56국내여행업일일관광부산광역시 해운대구 마린시티3로 52 (우동)051-749-5215
67국내여행업(주)파라디아트레블부산광역시 해운대구 해운대해변로 296, 별관동 1층 (중동, 파라다이스호텔부산)051-744-6551
78국내여행업(주)케이비트래블부산광역시 해운대구 좌동순환로 511 (중동, 해운대이마트1층)<NA>
89국내여행업(주)늘봄 여행사부산광역시 해운대구 센텀남대로 35, 9층 (우동, 신세계센텀시티점)<NA>
910국내여행업주식회사 투어파크부산광역시 해운대구 해운대로 1224, 국초빌딩 1층 (송정동)051-704-9936
연번업종상호소재지(도로명)연락처
224225종합여행업(주)마실부산광역시 해운대구 센텀중앙로 97, 센텀스카이비즈 에이동 703호 (재송동)051-741-3388
225226종합여행업더웰(THE WELL)부산광역시 해운대구 수영강변대로 140, 부산문화콘텐츠컴플렉스 612호 (우동)070-8722-6321
226227종합여행업주식회사 옹골찬사람들부산광역시 해운대구 센텀서로 39, 영상산업센터 713호 (우동)070-8015-6374
227<NA>종합여행업스완골프부산광역시 해운대구 선수촌로 175, 202호 (반여동)<NA>
228<NA>종합여행업그랜트리투어부산광역시 해운대구 해운대로 724, 해운대에스에이치타워 1715호 (중동)<NA>
229<NA>종합여행업주식회사 더블유이컴퍼니부산광역시 해운대구 센텀3로 26, 센텀스퀘어 901호 (우동)1877-6995
230<NA>종합여행업롯데관광부산(주)부산광역시 해운대구 대천로 205, 상가동 301호 (좌동, 벽산1차아파트)<NA>
231<NA>종합여행업투어아이존부산광역시 해운대구 APEC로 55, 벡스코 157호 (우동)051-743-7781
232<NA>종합여행업(주)위메이크부산광역시 해운대구 센텀서로 30, 케이엔엔타워 2208호 (우동)<NA>
233<NA>종합여행업더비치타이부산광역시 해운대구 해운대로570번길 23, 더세일링타워 4층 (우동)051-741-9978