Overview

Dataset statistics

Number of variables5
Number of observations231
Missing cells75
Missing cells (%)6.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.4 KiB
Average record size in memory41.6 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description부산광역시해운대구_여행업현황_20210119
Author부산광역시 해운대구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3075746

Alerts

연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
연락처 has 75 (32.5%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:17:30.557984
Analysis finished2023-12-10 16:17:31.142120
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct231
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean116
Minimum1
Maximum231
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-11T01:17:31.209143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12.5
Q158.5
median116
Q3173.5
95-th percentile219.5
Maximum231
Range230
Interquartile range (IQR)115

Descriptive statistics

Standard deviation66.828138
Coefficient of variation (CV)0.57610464
Kurtosis-1.2
Mean116
Median Absolute Deviation (MAD)58
Skewness0
Sum26796
Variance4466
MonotonicityStrictly increasing
2023-12-11T01:17:31.343869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
160 1
 
0.4%
148 1
 
0.4%
149 1
 
0.4%
150 1
 
0.4%
151 1
 
0.4%
152 1
 
0.4%
153 1
 
0.4%
154 1
 
0.4%
155 1
 
0.4%
Other values (221) 221
95.7%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
231 1
0.4%
230 1
0.4%
229 1
0.4%
228 1
0.4%
227 1
0.4%
226 1
0.4%
225 1
0.4%
224 1
0.4%
223 1
0.4%
222 1
0.4%

업종
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
국외여행업
92 
국내여행업
79 
일반여행업
60 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국외여행업 92
39.8%
국내여행업 79
34.2%
일반여행업 60
26.0%

Length

2023-12-11T01:17:31.469589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:17:31.555737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국외여행업 92
39.8%
국내여행업 79
34.2%
일반여행업 60
26.0%

상호
Text

Distinct187
Distinct (%)81.0%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-11T01:17:31.781492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length15
Mean length8.4285714
Min length2

Characters and Unicode

Total characters1947
Distinct characters267
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique143 ?
Unique (%)61.9%

Sample

1st row(주)한창여행사
2nd row(주)동아항공여행사
3rd row(주)이트립포유
4th row관광가이드 부산(주)
5th row(주)토성투어
ValueCountFrequency (%)
주식회사 30
 
9.9%
여행사 6
 
2.0%
투어 5
 
1.6%
트래블 3
 
1.0%
비자 2
 
0.7%
2
 
0.7%
주)해운대여행사 2
 
0.7%
지앤투어 2
 
0.7%
선경투어 2
 
0.7%
주)투어오션여행사 2
 
0.7%
Other values (208) 248
81.6%
2023-12-11T01:17:32.229313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
168
 
8.6%
( 138
 
7.1%
) 138
 
7.1%
84
 
4.3%
78
 
4.0%
74
 
3.8%
73
 
3.7%
69
 
3.5%
69
 
3.5%
45
 
2.3%
Other values (257) 1011
51.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1506
77.3%
Open Punctuation 138
 
7.1%
Close Punctuation 138
 
7.1%
Space Separator 73
 
3.7%
Lowercase Letter 43
 
2.2%
Uppercase Letter 41
 
2.1%
Other Punctuation 5
 
0.3%
Decimal Number 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
168
 
11.2%
84
 
5.6%
78
 
5.2%
74
 
4.9%
69
 
4.6%
69
 
4.6%
45
 
3.0%
37
 
2.5%
31
 
2.1%
31
 
2.1%
Other values (213) 820
54.4%
Lowercase Letter
ValueCountFrequency (%)
r 5
11.6%
a 5
11.6%
e 5
11.6%
i 4
9.3%
g 3
 
7.0%
l 3
 
7.0%
t 3
 
7.0%
v 3
 
7.0%
n 2
 
4.7%
s 2
 
4.7%
Other values (8) 8
18.6%
Uppercase Letter
ValueCountFrequency (%)
T 5
12.2%
C 5
12.2%
U 4
9.8%
B 4
9.8%
R 3
 
7.3%
O 3
 
7.3%
N 2
 
4.9%
M 2
 
4.9%
J 2
 
4.9%
G 2
 
4.9%
Other values (8) 9
22.0%
Other Punctuation
ValueCountFrequency (%)
& 2
40.0%
. 2
40.0%
, 1
20.0%
Decimal Number
ValueCountFrequency (%)
8 2
66.7%
0 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 138
100.0%
Close Punctuation
ValueCountFrequency (%)
) 138
100.0%
Space Separator
ValueCountFrequency (%)
73
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1506
77.3%
Common 357
 
18.3%
Latin 84
 
4.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
168
 
11.2%
84
 
5.6%
78
 
5.2%
74
 
4.9%
69
 
4.6%
69
 
4.6%
45
 
3.0%
37
 
2.5%
31
 
2.1%
31
 
2.1%
Other values (213) 820
54.4%
Latin
ValueCountFrequency (%)
r 5
 
6.0%
a 5
 
6.0%
T 5
 
6.0%
e 5
 
6.0%
C 5
 
6.0%
U 4
 
4.8%
B 4
 
4.8%
i 4
 
4.8%
g 3
 
3.6%
l 3
 
3.6%
Other values (26) 41
48.8%
Common
ValueCountFrequency (%)
( 138
38.7%
) 138
38.7%
73
20.4%
& 2
 
0.6%
8 2
 
0.6%
. 2
 
0.6%
0 1
 
0.3%
, 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1506
77.3%
ASCII 441
 
22.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
168
 
11.2%
84
 
5.6%
78
 
5.2%
74
 
4.9%
69
 
4.6%
69
 
4.6%
45
 
3.0%
37
 
2.5%
31
 
2.1%
31
 
2.1%
Other values (213) 820
54.4%
ASCII
ValueCountFrequency (%)
( 138
31.3%
) 138
31.3%
73
16.6%
r 5
 
1.1%
a 5
 
1.1%
T 5
 
1.1%
e 5
 
1.1%
C 5
 
1.1%
U 4
 
0.9%
B 4
 
0.9%
Other values (34) 59
13.4%
Distinct187
Distinct (%)81.0%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-11T01:17:32.563637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length47
Mean length39.272727
Min length21

Characters and Unicode

Total characters9072
Distinct characters203
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique144 ?
Unique (%)62.3%

Sample

1st row부산광역시 해운대구 해운대로 658, 2층 (우동)
2nd row부산광역시 해운대구 해운대해변로 154, 마리나센타 3층 302호 (우동)
3rd row부산광역시 해운대구 재반로63번길 27, 2층 (재송동)
4th row부산광역시 해운대구 구남로29번길 21 (중동, 리베라호텔 16층)
5th row부산광역시 해운대구 해운대해변로 140, 지하1층 (우동, 홈플러스 해운대점)
ValueCountFrequency (%)
부산광역시 231
 
13.8%
해운대구 231
 
13.8%
우동 117
 
7.0%
센텀중앙로 37
 
2.2%
재송동 36
 
2.2%
좌동 35
 
2.1%
중동 28
 
1.7%
해운대해변로 24
 
1.4%
해운대로 20
 
1.2%
센텀동로 17
 
1.0%
Other values (387) 895
53.6%
2023-12-11T01:17:33.060601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1440
 
15.9%
351
 
3.9%
341
 
3.8%
337
 
3.7%
333
 
3.7%
, 323
 
3.6%
1 304
 
3.4%
272
 
3.0%
254
 
2.8%
233
 
2.6%
Other values (193) 4884
53.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5378
59.3%
Space Separator 1440
 
15.9%
Decimal Number 1418
 
15.6%
Other Punctuation 323
 
3.6%
Close Punctuation 229
 
2.5%
Open Punctuation 229
 
2.5%
Uppercase Letter 44
 
0.5%
Dash Punctuation 9
 
0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
351
 
6.5%
341
 
6.3%
337
 
6.3%
333
 
6.2%
272
 
5.1%
254
 
4.7%
233
 
4.3%
232
 
4.3%
231
 
4.3%
231
 
4.3%
Other values (167) 2563
47.7%
Decimal Number
ValueCountFrequency (%)
1 304
21.4%
2 214
15.1%
0 211
14.9%
3 173
12.2%
5 97
 
6.8%
4 87
 
6.1%
9 84
 
5.9%
7 84
 
5.9%
8 82
 
5.8%
6 82
 
5.8%
Uppercase Letter
ValueCountFrequency (%)
A 11
25.0%
E 8
18.2%
P 7
15.9%
B 6
13.6%
T 3
 
6.8%
C 3
 
6.8%
O 2
 
4.5%
Z 2
 
4.5%
I 2
 
4.5%
Lowercase Letter
ValueCountFrequency (%)
s 1
50.0%
e 1
50.0%
Space Separator
ValueCountFrequency (%)
1440
100.0%
Other Punctuation
ValueCountFrequency (%)
, 323
100.0%
Close Punctuation
ValueCountFrequency (%)
) 229
100.0%
Open Punctuation
ValueCountFrequency (%)
( 229
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5378
59.3%
Common 3648
40.2%
Latin 46
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
351
 
6.5%
341
 
6.3%
337
 
6.3%
333
 
6.2%
272
 
5.1%
254
 
4.7%
233
 
4.3%
232
 
4.3%
231
 
4.3%
231
 
4.3%
Other values (167) 2563
47.7%
Common
ValueCountFrequency (%)
1440
39.5%
, 323
 
8.9%
1 304
 
8.3%
) 229
 
6.3%
( 229
 
6.3%
2 214
 
5.9%
0 211
 
5.8%
3 173
 
4.7%
5 97
 
2.7%
4 87
 
2.4%
Other values (5) 341
 
9.3%
Latin
ValueCountFrequency (%)
A 11
23.9%
E 8
17.4%
P 7
15.2%
B 6
13.0%
T 3
 
6.5%
C 3
 
6.5%
O 2
 
4.3%
Z 2
 
4.3%
I 2
 
4.3%
s 1
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5378
59.3%
ASCII 3694
40.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1440
39.0%
, 323
 
8.7%
1 304
 
8.2%
) 229
 
6.2%
( 229
 
6.2%
2 214
 
5.8%
0 211
 
5.7%
3 173
 
4.7%
5 97
 
2.6%
4 87
 
2.4%
Other values (16) 387
 
10.5%
Hangul
ValueCountFrequency (%)
351
 
6.5%
341
 
6.3%
337
 
6.3%
333
 
6.2%
272
 
5.1%
254
 
4.7%
233
 
4.3%
232
 
4.3%
231
 
4.3%
231
 
4.3%
Other values (167) 2563
47.7%

연락처
Text

MISSING 

Distinct123
Distinct (%)78.8%
Missing75
Missing (%)32.5%
Memory size1.9 KiB
2023-12-11T01:17:33.292372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.99359
Min length9

Characters and Unicode

Total characters1871
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique90 ?
Unique (%)57.7%

Sample

1st row051-731-6666
2nd row051-746-5655
3rd row051-744-0027
4th row051-743-5449
5th row051-746-1118
ValueCountFrequency (%)
051-744-0027 2
 
1.3%
051-746-2999 2
 
1.3%
051-742-0707 2
 
1.3%
051-633-1161 2
 
1.3%
051-747-2525 2
 
1.3%
051-743-1159 2
 
1.3%
051-747-9582 2
 
1.3%
051-747-4666 2
 
1.3%
051-741-8390 2
 
1.3%
051-746-5655 2
 
1.3%
Other values (113) 136
87.2%
2023-12-11T01:17:33.680883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 308
16.5%
0 306
16.4%
1 238
12.7%
7 216
11.5%
5 214
11.4%
4 130
6.9%
8 108
 
5.8%
3 97
 
5.2%
6 95
 
5.1%
2 82
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1563
83.5%
Dash Punctuation 308
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 306
19.6%
1 238
15.2%
7 216
13.8%
5 214
13.7%
4 130
8.3%
8 108
 
6.9%
3 97
 
6.2%
6 95
 
6.1%
2 82
 
5.2%
9 77
 
4.9%
Dash Punctuation
ValueCountFrequency (%)
- 308
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1871
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 308
16.5%
0 306
16.4%
1 238
12.7%
7 216
11.5%
5 214
11.4%
4 130
6.9%
8 108
 
5.8%
3 97
 
5.2%
6 95
 
5.1%
2 82
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1871
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 308
16.5%
0 306
16.4%
1 238
12.7%
7 216
11.5%
5 214
11.4%
4 130
6.9%
8 108
 
5.8%
3 97
 
5.2%
6 95
 
5.1%
2 82
 
4.4%

Interactions

2023-12-11T01:17:30.882509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:17:33.766038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.939
업종0.9391.000
2023-12-11T01:17:33.863199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.910
업종0.9101.000

Missing values

2023-12-11T01:17:31.001583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:17:31.102713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종상호소재지(도로명)연락처
01국내여행업(주)한창여행사부산광역시 해운대구 해운대로 658, 2층 (우동)051-731-6666
12국내여행업(주)동아항공여행사부산광역시 해운대구 해운대해변로 154, 마리나센타 3층 302호 (우동)051-746-5655
23국내여행업(주)이트립포유부산광역시 해운대구 재반로63번길 27, 2층 (재송동)051-744-0027
34국내여행업관광가이드 부산(주)부산광역시 해운대구 구남로29번길 21 (중동, 리베라호텔 16층)051-743-5449
45국내여행업(주)토성투어부산광역시 해운대구 해운대해변로 140, 지하1층 (우동, 홈플러스 해운대점)051-746-1118
56국내여행업(주)로마여행사부산광역시 해운대구 해운대로 813, 1층 133호 (좌동, ZIPOP상가)051-702-8260
67국내여행업(주)해운관광부산광역시 해운대구 양운로 91 (좌동)<NA>
78국내여행업(주)하나에스엠여행사부산광역시 해운대구 반여로 131, 215호 (반여동, 아시아선수촌 프레스상가)051-997-8789
89국내여행업(주)토성관광부산광역시 해운대구 해운대해변로 140, 해운대홈플러스 지하1층 (우동)051-743-0131
910국내여행업(주)비너스여행사부산광역시 해운대구 센텀중앙로 142, 2동 107호 (재송동, 더샵센텀파크2차 상가)051-465-7500
연번업종상호소재지(도로명)연락처
221222일반여행업제라네이처(주)부산광역시 해운대구 마린시티1로 147, 우신골든스위트 208호 (우동)02-6978-9521
222223일반여행업(주)오래부산광역시 해운대구 센텀중앙로 97, 센텀스카이비즈 A동 602호 (재송동)051-717-0300
223224일반여행업(주)여행보내주는남자부산광역시 해운대구 마린시티3로 1, 지하1층 125호 (우동, 썬프라자)051-807-3007
224225일반여행업에이치엔핀코어(주)부산광역시 해운대구 센텀동로 99, 벽산이센텀클래스원 1408호 (재송동)070-7204-9101
225226일반여행업펀펀투어부산광역시 해운대구 마린시티1로 127, 아라트리움 3303호 (우동)<NA>
226227일반여행업주식회사 코렘투어부산광역시 해운대구 선수촌로 74-17 (반여동)051-782-0193
227228일반여행업(주)신라투어부산광역시 해운대구 센텀북대로 60, 센텀아이에스타워 803호 (재송동)051-553-1133
228229일반여행업(주)신라투어고속부산광역시 해운대구 센텀북대로 60, 센텀아이에스타워 803호 (재송동)051-989-1133
229230일반여행업(주)포유커뮤니케이션즈부산광역시 해운대구 센텀동로 99, 벽산이센텀클래스원 1001호 (재송동)051-552-7978
230231일반여행업(주)보배여행사부산광역시 해운대구 센텀6로 21, 인텔리움센텀 406호 (우동)051-463-9100