Overview

Dataset statistics

Number of variables4
Number of observations245
Missing cells2
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.0 KiB
Average record size in memory33.5 B

Variable types

Numeric1
Categorical1
Text2

Dataset

Description2024년 1월 15일 기준 해운대구에 등록된 국내, 국내외, 종합여행업에 대한 데이터로 업종, 상호, 소재지, 연락처의 등록에 대한 현황자료입니다.
Author부산광역시 해운대구
URLhttps://www.data.go.kr/data/3075746/fileData.do

Alerts

연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 16:39:02.870200
Analysis finished2024-03-14 16:39:04.383296
Duration1.51 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct245
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean123
Minimum1
Maximum245
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2024-03-15T01:39:04.724275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13.2
Q162
median123
Q3184
95-th percentile232.8
Maximum245
Range244
Interquartile range (IQR)122

Descriptive statistics

Standard deviation70.869599
Coefficient of variation (CV)0.5761756
Kurtosis-1.2
Mean123
Median Absolute Deviation (MAD)61
Skewness0
Sum30135
Variance5022.5
MonotonicityStrictly increasing
2024-03-15T01:39:05.159422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
155 1
 
0.4%
157 1
 
0.4%
158 1
 
0.4%
159 1
 
0.4%
160 1
 
0.4%
161 1
 
0.4%
162 1
 
0.4%
163 1
 
0.4%
164 1
 
0.4%
Other values (235) 235
95.9%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
245 1
0.4%
244 1
0.4%
243 1
0.4%
242 1
0.4%
241 1
0.4%
240 1
0.4%
239 1
0.4%
238 1
0.4%
237 1
0.4%
236 1
0.4%

업종
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
국내외여행업
96 
종합여행업
89 
국내여행업
60 

Length

Max length6
Median length5
Mean length5.3918367
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내외여행업 96
39.2%
종합여행업 89
36.3%
국내여행업 60
24.5%

Length

2024-03-15T01:39:05.595681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T01:39:05.928037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외여행업 96
39.2%
종합여행업 89
36.3%
국내여행업 60
24.5%

상호
Text

Distinct219
Distinct (%)89.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2024-03-15T01:39:06.856803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length21
Mean length8.6530612
Min length2

Characters and Unicode

Total characters2120
Distinct characters309
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique193 ?
Unique (%)78.8%

Sample

1st row(주)이트립포유
2nd row관광가이드 부산(주)
3rd row(주)토성투어
4th row(주)로마여행사
5th row(주)하나에스엠여행사
ValueCountFrequency (%)
주식회사 47
 
13.9%
여행사 6
 
1.8%
투어 4
 
1.2%
주)투어오션여행사 2
 
0.6%
주)마실 2
 
0.6%
주)행운컴퍼니 2
 
0.6%
글로벌탑투어 2
 
0.6%
주)해운대투어 2
 
0.6%
주)교실밖교실 2
 
0.6%
대어여행 2
 
0.6%
Other values (242) 267
79.0%
2024-03-15T01:39:08.286592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
180
 
8.5%
( 135
 
6.4%
) 135
 
6.4%
93
 
4.4%
92
 
4.3%
70
 
3.3%
67
 
3.2%
61
 
2.9%
61
 
2.9%
57
 
2.7%
Other values (299) 1169
55.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1650
77.8%
Open Punctuation 135
 
6.4%
Close Punctuation 135
 
6.4%
Space Separator 93
 
4.4%
Lowercase Letter 47
 
2.2%
Uppercase Letter 43
 
2.0%
Other Punctuation 6
 
0.3%
Math Symbol 4
 
0.2%
Dash Punctuation 3
 
0.1%
Decimal Number 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
180
 
10.9%
92
 
5.6%
70
 
4.2%
67
 
4.1%
61
 
3.7%
61
 
3.7%
57
 
3.5%
48
 
2.9%
48
 
2.9%
48
 
2.9%
Other values (248) 918
55.6%
Uppercase Letter
ValueCountFrequency (%)
T 8
18.6%
U 4
 
9.3%
C 4
 
9.3%
O 3
 
7.0%
L 3
 
7.0%
R 3
 
7.0%
E 3
 
7.0%
N 2
 
4.7%
S 2
 
4.7%
B 1
 
2.3%
Other values (10) 10
23.3%
Lowercase Letter
ValueCountFrequency (%)
r 6
12.8%
e 6
12.8%
a 5
10.6%
t 4
 
8.5%
l 3
 
6.4%
v 3
 
6.4%
i 3
 
6.4%
n 2
 
4.3%
s 2
 
4.3%
u 2
 
4.3%
Other values (9) 11
23.4%
Other Punctuation
ValueCountFrequency (%)
& 4
66.7%
, 1
 
16.7%
. 1
 
16.7%
Math Symbol
ValueCountFrequency (%)
> 3
75.0%
1
 
25.0%
Decimal Number
ValueCountFrequency (%)
8 2
66.7%
0 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 135
100.0%
Close Punctuation
ValueCountFrequency (%)
) 135
100.0%
Space Separator
ValueCountFrequency (%)
93
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1651
77.9%
Common 379
 
17.9%
Latin 90
 
4.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
180
 
10.9%
92
 
5.6%
70
 
4.2%
67
 
4.1%
61
 
3.7%
61
 
3.7%
57
 
3.5%
48
 
2.9%
48
 
2.9%
48
 
2.9%
Other values (249) 919
55.7%
Latin
ValueCountFrequency (%)
T 8
 
8.9%
r 6
 
6.7%
e 6
 
6.7%
a 5
 
5.6%
U 4
 
4.4%
t 4
 
4.4%
C 4
 
4.4%
O 3
 
3.3%
l 3
 
3.3%
v 3
 
3.3%
Other values (29) 44
48.9%
Common
ValueCountFrequency (%)
( 135
35.6%
) 135
35.6%
93
24.5%
& 4
 
1.1%
> 3
 
0.8%
- 3
 
0.8%
8 2
 
0.5%
0 1
 
0.3%
1
 
0.3%
, 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1650
77.8%
ASCII 468
 
22.1%
None 1
 
< 0.1%
Arrows 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
180
 
10.9%
92
 
5.6%
70
 
4.2%
67
 
4.1%
61
 
3.7%
61
 
3.7%
57
 
3.5%
48
 
2.9%
48
 
2.9%
48
 
2.9%
Other values (248) 918
55.6%
ASCII
ValueCountFrequency (%)
( 135
28.8%
) 135
28.8%
93
19.9%
T 8
 
1.7%
r 6
 
1.3%
e 6
 
1.3%
a 5
 
1.1%
U 4
 
0.9%
& 4
 
0.9%
t 4
 
0.9%
Other values (39) 68
14.5%
None
ValueCountFrequency (%)
1
100.0%
Arrows
ValueCountFrequency (%)
1
100.0%
Distinct210
Distinct (%)86.4%
Missing2
Missing (%)0.8%
Memory size2.0 KiB
2024-03-15T01:39:09.494582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length47
Mean length39.209877
Min length22

Characters and Unicode

Total characters9528
Distinct characters223
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)75.3%

Sample

1st row부산광역시 해운대구 재반로63번길 27,
2nd row부산광역시 해운대구 구남로29번길 21 (중동, 리베라호텔 16층)
3rd row부산광역시 해운대구 해운대해변로 140, 지하1층 (우동, 홈플러스 해운대점)
4th row부산광역시 해운대구 해운대로 813, 1층 133호 (좌동, ZIPOP상가)
5th row부산광역시 해운대구 반여로 131, 215호 (반여동, 아시아선수촌 프레스상가)
ValueCountFrequency (%)
부산광역시 243
 
13.8%
해운대구 243
 
13.8%
우동 114
 
6.5%
재송동 42
 
2.4%
중동 37
 
2.1%
센텀중앙로 36
 
2.1%
좌동 30
 
1.7%
해운대해변로 22
 
1.3%
센텀동로 21
 
1.2%
2층 19
 
1.1%
Other values (428) 948
54.0%
2024-03-15T01:39:11.472189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1513
 
15.9%
355
 
3.7%
342
 
3.6%
339
 
3.6%
339
 
3.6%
, 321
 
3.4%
1 312
 
3.3%
294
 
3.1%
271
 
2.8%
249
 
2.6%
Other values (213) 5193
54.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5616
58.9%
Space Separator 1513
 
15.9%
Decimal Number 1513
 
15.9%
Other Punctuation 321
 
3.4%
Close Punctuation 243
 
2.6%
Open Punctuation 243
 
2.6%
Uppercase Letter 56
 
0.6%
Dash Punctuation 20
 
0.2%
Lowercase Letter 2
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
355
 
6.3%
342
 
6.1%
339
 
6.0%
339
 
6.0%
294
 
5.2%
271
 
4.8%
249
 
4.4%
244
 
4.3%
244
 
4.3%
243
 
4.3%
Other values (186) 2696
48.0%
Decimal Number
ValueCountFrequency (%)
1 312
20.6%
2 243
16.1%
0 228
15.1%
3 193
12.8%
9 105
 
6.9%
4 99
 
6.5%
5 93
 
6.1%
7 89
 
5.9%
6 76
 
5.0%
8 75
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
A 13
23.2%
P 11
19.6%
E 10
17.9%
C 8
14.3%
B 4
 
7.1%
T 4
 
7.1%
O 2
 
3.6%
I 2
 
3.6%
Z 2
 
3.6%
Lowercase Letter
ValueCountFrequency (%)
s 1
50.0%
e 1
50.0%
Space Separator
ValueCountFrequency (%)
1513
100.0%
Other Punctuation
ValueCountFrequency (%)
, 321
100.0%
Close Punctuation
ValueCountFrequency (%)
) 243
100.0%
Open Punctuation
ValueCountFrequency (%)
( 243
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5616
58.9%
Common 3854
40.4%
Latin 58
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
355
 
6.3%
342
 
6.1%
339
 
6.0%
339
 
6.0%
294
 
5.2%
271
 
4.8%
249
 
4.4%
244
 
4.3%
244
 
4.3%
243
 
4.3%
Other values (186) 2696
48.0%
Common
ValueCountFrequency (%)
1513
39.3%
, 321
 
8.3%
1 312
 
8.1%
2 243
 
6.3%
) 243
 
6.3%
( 243
 
6.3%
0 228
 
5.9%
3 193
 
5.0%
9 105
 
2.7%
4 99
 
2.6%
Other values (6) 354
 
9.2%
Latin
ValueCountFrequency (%)
A 13
22.4%
P 11
19.0%
E 10
17.2%
C 8
13.8%
B 4
 
6.9%
T 4
 
6.9%
O 2
 
3.4%
I 2
 
3.4%
Z 2
 
3.4%
s 1
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5616
58.9%
ASCII 3912
41.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1513
38.7%
, 321
 
8.2%
1 312
 
8.0%
2 243
 
6.2%
) 243
 
6.2%
( 243
 
6.2%
0 228
 
5.8%
3 193
 
4.9%
9 105
 
2.7%
4 99
 
2.5%
Other values (17) 412
 
10.5%
Hangul
ValueCountFrequency (%)
355
 
6.3%
342
 
6.1%
339
 
6.0%
339
 
6.0%
294
 
5.2%
271
 
4.8%
249
 
4.4%
244
 
4.3%
244
 
4.3%
243
 
4.3%
Other values (186) 2696
48.0%

Interactions

2024-03-15T01:39:03.353431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T01:39:11.701810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.939
업종0.9391.000
2024-03-15T01:39:11.970437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.909
업종0.9091.000

Missing values

2024-03-15T01:39:03.958668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T01:39:04.264455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종상호소재지(도로명)
01국내여행업(주)이트립포유부산광역시 해운대구 재반로63번길 27,
12국내여행업관광가이드 부산(주)부산광역시 해운대구 구남로29번길 21 (중동, 리베라호텔 16층)
23국내여행업(주)토성투어부산광역시 해운대구 해운대해변로 140, 지하1층 (우동, 홈플러스 해운대점)
34국내여행업(주)로마여행사부산광역시 해운대구 해운대로 813, 1층 133호 (좌동, ZIPOP상가)
45국내여행업(주)하나에스엠여행사부산광역시 해운대구 반여로 131, 215호 (반여동, 아시아선수촌 프레스상가)
56국내여행업일일관광부산광역시 해운대구 마린시티3로 52 (우동)
67국내여행업(주)파라디아트레블부산광역시 해운대구 해운대해변로 296, 별관동 1층 (중동, 파라다이스호텔부산)
78국내여행업(주)케이비트래블부산광역시 해운대구 좌동순환로 511 (중동, 해운대이마트1층)
89국내여행업(주)늘봄 여행사부산광역시 해운대구 센텀남대로 35, 9층 (우동, 신세계센텀시티점)
910국내여행업건건테마여행사부산광역시 해운대구 해운대로 550, 102동 3602호 (우동, 해운대한신휴플러스)
연번업종상호소재지(도로명)
235236종합여행업천홍국제관광부산광역시 해운대구 해운대로81번길 24, 3층 3B-11호 (재송동)
236237종합여행업(주)그라운드케이부산광역시 해운대구 센텀동로 45, 405호 (우동)
237238종합여행업(주)비아이벤처스부산광역시 해운대구 센텀동로 45 (주)웹스 305호 (우동)
238239종합여행업주식회사 부일기획부산광역시 해운대구 센텀7로 6, 201호 (우동)
239240종합여행업머스트플레이부산광역시 해운대구 해운대로108번길 22, 센텀 센트레빌 플래비뉴 101동 1802호 (재송동)
240241종합여행업루시드플랜부산광역시 해운대구 해운대로 794, 엘리움 1403호 (좌동)
241242종합여행업(주)노바투어부산광역시 해운대구 해운대해변로 291, 크리스탈비치오피스텔 402호 (중동)
242243종합여행업데이제로컴퍼니부산광역시 해운대구 재반로256번길 43-10, 101, 103호 (반여동)
243244종합여행업주식회사 월드투어플랫폼부산광역시 해운대구 대천로 38, 402호 (중동)
244245종합여행업주식회사 케이네비부산광역시 해운대구 센텀동로 35, 센텀에스에이치밸리 803호 (우동)