Overview

Dataset statistics

Number of variables5
Number of observations227
Missing cells137
Missing cells (%)12.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.2 KiB
Average record size in memory41.6 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description부산광역시해운대구_여행업현황_20220112
Author부산광역시 해운대구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3075746

Alerts

연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
연락처 has 137 (60.4%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:17:25.673531
Analysis finished2023-12-10 16:17:26.166462
Duration0.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct227
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean114
Minimum1
Maximum227
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-11T01:17:26.236346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12.3
Q157.5
median114
Q3170.5
95-th percentile215.7
Maximum227
Range226
Interquartile range (IQR)113

Descriptive statistics

Standard deviation65.673435
Coefficient of variation (CV)0.57608276
Kurtosis-1.2
Mean114
Median Absolute Deviation (MAD)57
Skewness0
Sum25878
Variance4313
MonotonicityStrictly increasing
2023-12-11T01:17:26.356827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
144 1
 
0.4%
146 1
 
0.4%
147 1
 
0.4%
148 1
 
0.4%
149 1
 
0.4%
150 1
 
0.4%
151 1
 
0.4%
152 1
 
0.4%
153 1
 
0.4%
Other values (217) 217
95.6%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
227 1
0.4%
226 1
0.4%
225 1
0.4%
224 1
0.4%
223 1
0.4%
222 1
0.4%
221 1
0.4%
220 1
0.4%
219 1
0.4%
218 1
0.4%

업종
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
국내외여행업
88 
국내여행업
72 
종합여행업
67 

Length

Max length6
Median length5
Mean length5.3876652
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내외여행업 88
38.8%
국내여행업 72
31.7%
종합여행업 67
29.5%

Length

2023-12-11T01:17:26.472100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:17:26.570876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외여행업 88
38.8%
국내여행업 72
31.7%
종합여행업 67
29.5%

상호
Text

Distinct187
Distinct (%)82.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-11T01:17:26.785680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length15
Mean length8.4845815
Min length2

Characters and Unicode

Total characters1926
Distinct characters279
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique147 ?
Unique (%)64.8%

Sample

1st row(주)동아항공여행사
2nd row(주)이트립포유
3rd row관광가이드 부산(주)
4th row(주)토성투어
5th row(주)로마여행사
ValueCountFrequency (%)
주식회사 34
 
11.3%
여행사 6
 
2.0%
투어 4
 
1.3%
주)동아항공여행사 2
 
0.7%
주)클럽가이아 2
 
0.7%
유레카투어 2
 
0.7%
주)여행매니저닷컴 2
 
0.7%
주)코밴더 2
 
0.7%
핸즈투어엔에어 2
 
0.7%
글로벌탑투어 2
 
0.7%
Other values (208) 244
80.8%
2023-12-11T01:17:27.154899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
169
 
8.8%
( 134
 
7.0%
) 134
 
7.0%
81
 
4.2%
75
 
3.9%
69
 
3.6%
65
 
3.4%
65
 
3.4%
63
 
3.3%
48
 
2.5%
Other values (269) 1023
53.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1499
77.8%
Open Punctuation 134
 
7.0%
Close Punctuation 134
 
7.0%
Space Separator 75
 
3.9%
Lowercase Letter 39
 
2.0%
Uppercase Letter 38
 
2.0%
Other Punctuation 4
 
0.2%
Decimal Number 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
169
 
11.3%
81
 
5.4%
69
 
4.6%
65
 
4.3%
65
 
4.3%
63
 
4.2%
48
 
3.2%
45
 
3.0%
35
 
2.3%
35
 
2.3%
Other values (224) 824
55.0%
Uppercase Letter
ValueCountFrequency (%)
T 5
13.2%
U 5
13.2%
C 4
10.5%
O 3
 
7.9%
A 3
 
7.9%
R 3
 
7.9%
N 2
 
5.3%
S 2
 
5.3%
J 1
 
2.6%
B 1
 
2.6%
Other values (9) 9
23.7%
Lowercase Letter
ValueCountFrequency (%)
a 5
12.8%
r 5
12.8%
e 5
12.8%
l 3
 
7.7%
v 3
 
7.7%
t 3
 
7.7%
i 2
 
5.1%
n 2
 
5.1%
s 2
 
5.1%
g 1
 
2.6%
Other values (8) 8
20.5%
Other Punctuation
ValueCountFrequency (%)
& 2
50.0%
. 1
25.0%
, 1
25.0%
Decimal Number
ValueCountFrequency (%)
8 2
66.7%
0 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 134
100.0%
Close Punctuation
ValueCountFrequency (%)
) 134
100.0%
Space Separator
ValueCountFrequency (%)
75
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1499
77.8%
Common 350
 
18.2%
Latin 77
 
4.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
169
 
11.3%
81
 
5.4%
69
 
4.6%
65
 
4.3%
65
 
4.3%
63
 
4.2%
48
 
3.2%
45
 
3.0%
35
 
2.3%
35
 
2.3%
Other values (224) 824
55.0%
Latin
ValueCountFrequency (%)
T 5
 
6.5%
a 5
 
6.5%
r 5
 
6.5%
U 5
 
6.5%
e 5
 
6.5%
C 4
 
5.2%
l 3
 
3.9%
v 3
 
3.9%
O 3
 
3.9%
A 3
 
3.9%
Other values (27) 36
46.8%
Common
ValueCountFrequency (%)
( 134
38.3%
) 134
38.3%
75
21.4%
& 2
 
0.6%
8 2
 
0.6%
0 1
 
0.3%
. 1
 
0.3%
, 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1499
77.8%
ASCII 427
 
22.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
169
 
11.3%
81
 
5.4%
69
 
4.6%
65
 
4.3%
65
 
4.3%
63
 
4.2%
48
 
3.2%
45
 
3.0%
35
 
2.3%
35
 
2.3%
Other values (224) 824
55.0%
ASCII
ValueCountFrequency (%)
( 134
31.4%
) 134
31.4%
75
17.6%
T 5
 
1.2%
a 5
 
1.2%
r 5
 
1.2%
U 5
 
1.2%
e 5
 
1.2%
C 4
 
0.9%
l 3
 
0.7%
Other values (35) 52
 
12.2%
Distinct186
Distinct (%)81.9%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-11T01:17:27.407244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length48
Mean length39.770925
Min length22

Characters and Unicode

Total characters9028
Distinct characters215
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique148 ?
Unique (%)65.2%

Sample

1st row부산광역시 해운대구 해운대해변로 154, 마리나센타 3층 302호 (우동)
2nd row부산광역시 해운대구 재반로63번길 27, 2층 (재송동)
3rd row부산광역시 해운대구 구남로29번길 21 (중동, 리베라호텔 16층)
4th row부산광역시 해운대구 해운대해변로 140, 지하1층 (우동, 홈플러스 해운대점)
5th row부산광역시 해운대구 해운대로 813, 1층 133호 (좌동, ZIPOP상가)
ValueCountFrequency (%)
부산광역시 227
 
13.7%
해운대구 227
 
13.7%
우동 106
 
6.4%
재송동 37
 
2.2%
좌동 35
 
2.1%
센텀중앙로 34
 
2.1%
중동 33
 
2.0%
해운대해변로 26
 
1.6%
2층 17
 
1.0%
센텀동로 16
 
1.0%
Other values (397) 896
54.2%
2023-12-11T01:17:27.774190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1427
 
15.8%
349
 
3.9%
334
 
3.7%
332
 
3.7%
332
 
3.7%
, 319
 
3.5%
1 298
 
3.3%
270
 
3.0%
250
 
2.8%
232
 
2.6%
Other values (205) 4885
54.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5348
59.2%
Space Separator 1427
 
15.8%
Decimal Number 1420
 
15.7%
Other Punctuation 319
 
3.5%
Close Punctuation 226
 
2.5%
Open Punctuation 226
 
2.5%
Uppercase Letter 47
 
0.5%
Dash Punctuation 12
 
0.1%
Lowercase Letter 2
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
349
 
6.5%
334
 
6.2%
332
 
6.2%
332
 
6.2%
270
 
5.0%
250
 
4.7%
232
 
4.3%
229
 
4.3%
227
 
4.2%
227
 
4.2%
Other values (178) 2566
48.0%
Decimal Number
ValueCountFrequency (%)
1 298
21.0%
2 225
15.8%
0 212
14.9%
3 177
12.5%
5 92
 
6.5%
4 89
 
6.3%
9 87
 
6.1%
7 85
 
6.0%
6 80
 
5.6%
8 75
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
A 11
23.4%
P 8
17.0%
E 8
17.0%
B 6
12.8%
C 5
10.6%
T 3
 
6.4%
O 2
 
4.3%
I 2
 
4.3%
Z 2
 
4.3%
Lowercase Letter
ValueCountFrequency (%)
s 1
50.0%
e 1
50.0%
Space Separator
ValueCountFrequency (%)
1427
100.0%
Other Punctuation
ValueCountFrequency (%)
, 319
100.0%
Close Punctuation
ValueCountFrequency (%)
) 226
100.0%
Open Punctuation
ValueCountFrequency (%)
( 226
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5348
59.2%
Common 3631
40.2%
Latin 49
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
349
 
6.5%
334
 
6.2%
332
 
6.2%
332
 
6.2%
270
 
5.0%
250
 
4.7%
232
 
4.3%
229
 
4.3%
227
 
4.2%
227
 
4.2%
Other values (178) 2566
48.0%
Common
ValueCountFrequency (%)
1427
39.3%
, 319
 
8.8%
1 298
 
8.2%
) 226
 
6.2%
( 226
 
6.2%
2 225
 
6.2%
0 212
 
5.8%
3 177
 
4.9%
5 92
 
2.5%
4 89
 
2.5%
Other values (6) 340
 
9.4%
Latin
ValueCountFrequency (%)
A 11
22.4%
P 8
16.3%
E 8
16.3%
B 6
12.2%
C 5
10.2%
T 3
 
6.1%
O 2
 
4.1%
I 2
 
4.1%
Z 2
 
4.1%
s 1
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5348
59.2%
ASCII 3680
40.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1427
38.8%
, 319
 
8.7%
1 298
 
8.1%
) 226
 
6.1%
( 226
 
6.1%
2 225
 
6.1%
0 212
 
5.8%
3 177
 
4.8%
5 92
 
2.5%
4 89
 
2.4%
Other values (17) 389
 
10.6%
Hangul
ValueCountFrequency (%)
349
 
6.5%
334
 
6.2%
332
 
6.2%
332
 
6.2%
270
 
5.0%
250
 
4.7%
232
 
4.3%
229
 
4.3%
227
 
4.2%
227
 
4.2%
Other values (178) 2566
48.0%

연락처
Text

MISSING 

Distinct82
Distinct (%)91.1%
Missing137
Missing (%)60.4%
Memory size1.9 KiB
2023-12-11T01:17:28.008160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length10.355556
Min length7

Characters and Unicode

Total characters932
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique74 ?
Unique (%)82.2%

Sample

1st row051-746-5655
2nd row744-0027
3rd row743-5449
4th row746-1118
5th row702-8260
ValueCountFrequency (%)
055-275-5511 2
 
2.2%
055-338-3721 2
 
2.2%
1661-3347 2
 
2.2%
051-742-3732 2
 
2.2%
742-8967 2
 
2.2%
704-9936 2
 
2.2%
051-338-8825 2
 
2.2%
051-257-1250 2
 
2.2%
517574877 1
 
1.1%
051-746-5655 1
 
1.1%
Other values (72) 72
80.0%
2023-12-11T01:17:28.370675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 135
14.5%
0 128
13.7%
7 114
12.2%
5 111
11.9%
1 97
10.4%
3 71
7.6%
4 70
7.5%
2 67
7.2%
8 59
6.3%
6 41
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 797
85.5%
Dash Punctuation 135
 
14.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 128
16.1%
7 114
14.3%
5 111
13.9%
1 97
12.2%
3 71
8.9%
4 70
8.8%
2 67
8.4%
8 59
7.4%
6 41
 
5.1%
9 39
 
4.9%
Dash Punctuation
ValueCountFrequency (%)
- 135
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 932
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 135
14.5%
0 128
13.7%
7 114
12.2%
5 111
11.9%
1 97
10.4%
3 71
7.6%
4 70
7.5%
2 67
7.2%
8 59
6.3%
6 41
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 932
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 135
14.5%
0 128
13.7%
7 114
12.2%
5 111
11.9%
1 97
10.4%
3 71
7.6%
4 70
7.5%
2 67
7.2%
8 59
6.3%
6 41
 
4.4%

Interactions

2023-12-11T01:17:25.952457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:17:28.482134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종연락처
연번1.0000.9670.000
업종0.9671.0000.000
연락처0.0000.0001.000
2023-12-11T01:17:28.625880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.956
업종0.9561.000

Missing values

2023-12-11T01:17:26.049904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:17:26.130745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종상호소재지(도로명)연락처
01국내여행업(주)동아항공여행사부산광역시 해운대구 해운대해변로 154, 마리나센타 3층 302호 (우동)051-746-5655
12국내여행업(주)이트립포유부산광역시 해운대구 재반로63번길 27, 2층 (재송동)744-0027
23국내여행업관광가이드 부산(주)부산광역시 해운대구 구남로29번길 21 (중동, 리베라호텔 16층)743-5449
34국내여행업(주)토성투어부산광역시 해운대구 해운대해변로 140, 지하1층 (우동, 홈플러스 해운대점)746-1118
45국내여행업(주)로마여행사부산광역시 해운대구 해운대로 813, 1층 133호 (좌동, ZIPOP상가)702-8260
56국내여행업(주)해운관광부산광역시 해운대구 양운로 91 (좌동)<NA>
67국내여행업(주)하나에스엠여행사부산광역시 해운대구 반여로 131, 215호 (반여동, 아시아선수촌 프레스상가)<NA>
78국내여행업(주)토성관광부산광역시 해운대구 해운대해변로 140, 해운대홈플러스 지하1층 (우동)<NA>
89국내여행업(주)비너스여행사부산광역시 해운대구 센텀중앙로 142, 2동 107호 (재송동, 더샵센텀파크2차 상가)<NA>
910국내여행업(주)산수여행부산광역시 해운대구 센텀남대로 59, 6층 롯데제이티비호 (우동, 롯데백화점센텀시티점)<NA>
연번업종상호소재지(도로명)연락처
217218종합여행업(주)환상투어부산광역시 해운대구 센텀중앙로 66, 센텀 T 타워 1005호 (우동)051-469-7030
218219종합여행업(주)에스에이비코퍼레이션부산광역시 해운대구 APEC로 17, 센텀리더스마크 2310호 (우동)051-745-8877
219220종합여행업(주)여우투어부산광역시 해운대구 반송로924번길 4-2, 3층 (반송동)<NA>
220221종합여행업주식회사 캔슬마켓부산광역시 해운대구 센텀동로 45, 613호 (우동)070-4217-7755
221222종합여행업에스에스컴퍼니(해피멤버스)부산광역시 해운대구 센텀동로 200, C동 (재송동)1544-3395
222223종합여행업(주)서프홀릭부산광역시 해운대구 송정해변로 50, 2층 (송정동)701-4851
223224종합여행업씨아이티에스브이서비스(주)부산광역시 해운대구 마린시티2로 38, 씨1동 519~524,526호 (우동, 해운대아이파크)051-920-0800
224225종합여행업AQUA 여행컨설팅부산광역시 해운대구 구남로18번길 47, 블루비치텔 205호 (우동)051-410-6137
225226종합여행업주식회사 에이블부산광역시 해운대구 센텀중앙로 97, 센텀스카이비즈 에이동 2003호 (재송동)070-4710-2990
226227종합여행업위더스콘텐츠부산광역시 해운대구 센텀1로 28, 더블유비씨더팔레스오피스텔 101동 2602호 (우동)<NA>