Overview

Dataset statistics

Number of variables5
Number of observations62
Missing cells24
Missing cells (%)7.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory43.1 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description부산광역시_수영구_여행업등록현황_20220425
Author부산광역시 수영구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3042065

Alerts

연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
전화번호 has 24 (38.7%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:00:26.234420
Analysis finished2023-12-10 16:00:27.027095
Duration0.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct62
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.5
Minimum1
Maximum62
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size690.0 B
2023-12-11T01:00:27.094502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.05
Q116.25
median31.5
Q346.75
95-th percentile58.95
Maximum62
Range61
Interquartile range (IQR)30.5

Descriptive statistics

Standard deviation18.041619
Coefficient of variation (CV)0.5727498
Kurtosis-1.2
Mean31.5
Median Absolute Deviation (MAD)15.5
Skewness0
Sum1953
Variance325.5
MonotonicityStrictly increasing
2023-12-11T01:00:27.218235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.6%
48 1
 
1.6%
35 1
 
1.6%
36 1
 
1.6%
37 1
 
1.6%
38 1
 
1.6%
39 1
 
1.6%
40 1
 
1.6%
41 1
 
1.6%
42 1
 
1.6%
Other values (52) 52
83.9%
ValueCountFrequency (%)
1 1
1.6%
2 1
1.6%
3 1
1.6%
4 1
1.6%
5 1
1.6%
6 1
1.6%
7 1
1.6%
8 1
1.6%
9 1
1.6%
10 1
1.6%
ValueCountFrequency (%)
62 1
1.6%
61 1
1.6%
60 1
1.6%
59 1
1.6%
58 1
1.6%
57 1
1.6%
56 1
1.6%
55 1
1.6%
54 1
1.6%
53 1
1.6%

업종
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size628.0 B
국내외여행업
35 
국내여행업
15 
종합여행업
12 

Length

Max length6
Median length6
Mean length5.5645161
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내외여행업 35
56.5%
국내여행업 15
24.2%
종합여행업 12
 
19.4%

Length

2023-12-11T01:00:27.351051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:00:27.475698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외여행업 35
56.5%
국내여행업 15
24.2%
종합여행업 12
 
19.4%

상호
Text

Distinct53
Distinct (%)85.5%
Missing0
Missing (%)0.0%
Memory size628.0 B
2023-12-11T01:00:27.704660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length13.5
Mean length8.2258065
Min length4

Characters and Unicode

Total characters510
Distinct characters138
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)71.0%

Sample

1st row(주)세명항공여행사
2nd row강남고속관광투어(주)
3rd row웰빙투어
4th row(주)월드시티 홀딩스
5th row성원여행사
ValueCountFrequency (%)
주식회사 3
 
4.1%
주)세명항공여행사 2
 
2.7%
트래블엔조이 2
 
2.7%
강남고속관광투어(주 2
 
2.7%
하하투어 2
 
2.7%
주)에버그린투어 2
 
2.7%
성원여행사 2
 
2.7%
주)월드시티 2
 
2.7%
홀딩스 2
 
2.7%
주)썬트레킹여행사 2
 
2.7%
Other values (50) 52
71.2%
2023-12-11T01:00:28.461440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 38
 
7.5%
) 38
 
7.5%
38
 
7.5%
22
 
4.3%
22
 
4.3%
21
 
4.1%
21
 
4.1%
19
 
3.7%
16
 
3.1%
11
 
2.2%
Other values (128) 264
51.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 407
79.8%
Open Punctuation 38
 
7.5%
Close Punctuation 38
 
7.5%
Space Separator 11
 
2.2%
Lowercase Letter 7
 
1.4%
Uppercase Letter 6
 
1.2%
Other Punctuation 3
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
38
 
9.3%
22
 
5.4%
22
 
5.4%
21
 
5.2%
21
 
5.2%
19
 
4.7%
16
 
3.9%
11
 
2.7%
8
 
2.0%
7
 
1.7%
Other values (111) 222
54.5%
Lowercase Letter
ValueCountFrequency (%)
b 1
14.3%
c 1
14.3%
o 1
14.3%
m 1
14.3%
r 1
14.3%
u 1
14.3%
t 1
14.3%
Uppercase Letter
ValueCountFrequency (%)
S 2
33.3%
G 1
16.7%
N 1
16.7%
P 1
16.7%
J 1
16.7%
Other Punctuation
ValueCountFrequency (%)
· 2
66.7%
. 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 38
100.0%
Close Punctuation
ValueCountFrequency (%)
) 38
100.0%
Space Separator
ValueCountFrequency (%)
11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 407
79.8%
Common 90
 
17.6%
Latin 13
 
2.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
38
 
9.3%
22
 
5.4%
22
 
5.4%
21
 
5.2%
21
 
5.2%
19
 
4.7%
16
 
3.9%
11
 
2.7%
8
 
2.0%
7
 
1.7%
Other values (111) 222
54.5%
Latin
ValueCountFrequency (%)
S 2
15.4%
G 1
7.7%
N 1
7.7%
b 1
7.7%
c 1
7.7%
o 1
7.7%
m 1
7.7%
r 1
7.7%
u 1
7.7%
t 1
7.7%
Other values (2) 2
15.4%
Common
ValueCountFrequency (%)
( 38
42.2%
) 38
42.2%
11
 
12.2%
· 2
 
2.2%
. 1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 407
79.8%
ASCII 101
 
19.8%
None 2
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 38
37.6%
) 38
37.6%
11
 
10.9%
S 2
 
2.0%
G 1
 
1.0%
N 1
 
1.0%
b 1
 
1.0%
c 1
 
1.0%
o 1
 
1.0%
m 1
 
1.0%
Other values (6) 6
 
5.9%
Hangul
ValueCountFrequency (%)
38
 
9.3%
22
 
5.4%
22
 
5.4%
21
 
5.2%
21
 
5.2%
19
 
4.7%
16
 
3.9%
11
 
2.7%
8
 
2.0%
7
 
1.7%
Other values (111) 222
54.5%
None
ValueCountFrequency (%)
· 2
100.0%
Distinct53
Distinct (%)85.5%
Missing0
Missing (%)0.0%
Memory size628.0 B
2023-12-11T01:00:28.770153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length38.5
Mean length33.306452
Min length19

Characters and Unicode

Total characters2065
Distinct characters113
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)72.6%

Sample

1st row부산광역시 수영구 수영로 532, 6층 (광안동, 부성빌딩)
2nd row부산광역시 수영구 과정로 55 (망미동, 망미1동 새마을금고)
3rd row부산광역시 수영구 수영로545번길 21, 708호 (광안동, 유원빌딩)
4th row부산광역시 수영구 광남로 22 (남천동)
5th row부산광역시 수영구 수영로 759, 808호 (수영동)
ValueCountFrequency (%)
부산광역시 62
 
15.0%
수영구 62
 
15.0%
수영로 24
 
5.8%
광안동 19
 
4.6%
남천동 14
 
3.4%
수영동 12
 
2.9%
망미동 9
 
2.2%
759 9
 
2.2%
민락동 8
 
1.9%
알파오피스텔 7
 
1.7%
Other values (114) 188
45.4%
2023-12-11T01:00:29.239566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
353
 
17.1%
110
 
5.3%
106
 
5.1%
100
 
4.8%
74
 
3.6%
, 69
 
3.3%
67
 
3.2%
65
 
3.1%
64
 
3.1%
64
 
3.1%
Other values (103) 993
48.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1186
57.4%
Space Separator 353
 
17.1%
Decimal Number 325
 
15.7%
Other Punctuation 69
 
3.3%
Open Punctuation 61
 
3.0%
Close Punctuation 61
 
3.0%
Dash Punctuation 6
 
0.3%
Uppercase Letter 3
 
0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
110
 
9.3%
106
 
8.9%
100
 
8.4%
74
 
6.2%
67
 
5.6%
65
 
5.5%
64
 
5.4%
64
 
5.4%
64
 
5.4%
62
 
5.2%
Other values (84) 410
34.6%
Decimal Number
ValueCountFrequency (%)
1 54
16.6%
2 44
13.5%
5 39
12.0%
3 36
11.1%
0 34
10.5%
6 32
9.8%
4 29
8.9%
7 25
7.7%
9 19
 
5.8%
8 13
 
4.0%
Uppercase Letter
ValueCountFrequency (%)
M 1
33.3%
B 1
33.3%
C 1
33.3%
Space Separator
ValueCountFrequency (%)
353
100.0%
Other Punctuation
ValueCountFrequency (%)
, 69
100.0%
Open Punctuation
ValueCountFrequency (%)
( 61
100.0%
Close Punctuation
ValueCountFrequency (%)
) 61
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1186
57.4%
Common 875
42.4%
Latin 4
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
110
 
9.3%
106
 
8.9%
100
 
8.4%
74
 
6.2%
67
 
5.6%
65
 
5.5%
64
 
5.4%
64
 
5.4%
64
 
5.4%
62
 
5.2%
Other values (84) 410
34.6%
Common
ValueCountFrequency (%)
353
40.3%
, 69
 
7.9%
( 61
 
7.0%
) 61
 
7.0%
1 54
 
6.2%
2 44
 
5.0%
5 39
 
4.5%
3 36
 
4.1%
0 34
 
3.9%
6 32
 
3.7%
Other values (5) 92
 
10.5%
Latin
ValueCountFrequency (%)
M 1
25.0%
B 1
25.0%
C 1
25.0%
e 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1186
57.4%
ASCII 879
42.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
353
40.2%
, 69
 
7.8%
( 61
 
6.9%
) 61
 
6.9%
1 54
 
6.1%
2 44
 
5.0%
5 39
 
4.4%
3 36
 
4.1%
0 34
 
3.9%
6 32
 
3.6%
Other values (9) 96
 
10.9%
Hangul
ValueCountFrequency (%)
110
 
9.3%
106
 
8.9%
100
 
8.4%
74
 
6.2%
67
 
5.6%
65
 
5.5%
64
 
5.4%
64
 
5.4%
64
 
5.4%
62
 
5.2%
Other values (84) 410
34.6%

전화번호
Text

MISSING 

Distinct32
Distinct (%)84.2%
Missing24
Missing (%)38.7%
Memory size628.0 B
2023-12-11T01:00:29.486377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.973684
Min length9

Characters and Unicode

Total characters455
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)68.4%

Sample

1st row051-621-7202
2nd row051-529-9898
3rd row051-919-2231
4th row051-747-6677
5th row051-752-4455
ValueCountFrequency (%)
051-756-4117 2
 
5.3%
051-752-4455 2
 
5.3%
051-529-9898 2
 
5.3%
051-502-6888 2
 
5.3%
051-621-7202 2
 
5.3%
051-747-6677 2
 
5.3%
051-760-1046 1
 
2.6%
051-898-4573 1
 
2.6%
051-624-2972 1
 
2.6%
051-903-0505 1
 
2.6%
Other values (22) 22
57.9%
2023-12-11T01:00:29.885981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 75
16.5%
0 68
14.9%
5 67
14.7%
1 59
13.0%
6 38
8.4%
7 35
7.7%
2 33
7.3%
8 25
 
5.5%
9 21
 
4.6%
4 20
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 380
83.5%
Dash Punctuation 75
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 68
17.9%
5 67
17.6%
1 59
15.5%
6 38
10.0%
7 35
9.2%
2 33
8.7%
8 25
 
6.6%
9 21
 
5.5%
4 20
 
5.3%
3 14
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 75
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 455
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 75
16.5%
0 68
14.9%
5 67
14.7%
1 59
13.0%
6 38
8.4%
7 35
7.7%
2 33
7.3%
8 25
 
5.5%
9 21
 
4.6%
4 20
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 455
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 75
16.5%
0 68
14.9%
5 67
14.7%
1 59
13.0%
6 38
8.4%
7 35
7.7%
2 33
7.3%
8 25
 
5.5%
9 21
 
4.6%
4 20
 
4.4%

Interactions

2023-12-11T01:00:26.804541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:00:30.029083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종상호소재지(도로명)전화번호
연번1.0000.9450.3490.6260.691
업종0.9451.0000.0000.4610.000
상호0.3490.0001.0000.9991.000
소재지(도로명)0.6260.4610.9991.0000.997
전화번호0.6910.0001.0000.9971.000
2023-12-11T01:00:30.159530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.876
업종0.8761.000

Missing values

2023-12-11T01:00:26.907879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:00:26.991964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종상호소재지(도로명)전화번호
01국내여행업(주)세명항공여행사부산광역시 수영구 수영로 532, 6층 (광안동, 부성빌딩)051-621-7202
12국내여행업강남고속관광투어(주)부산광역시 수영구 과정로 55 (망미동, 망미1동 새마을금고)051-529-9898
23국내여행업웰빙투어부산광역시 수영구 수영로545번길 21, 708호 (광안동, 유원빌딩)051-919-2231
34국내여행업(주)월드시티 홀딩스부산광역시 수영구 광남로 22 (남천동)<NA>
45국내여행업성원여행사부산광역시 수영구 수영로 759, 808호 (수영동)051-747-6677
56국내여행업(주)에버그린투어부산광역시 수영구 연수로 265-1, 1층 (망미동)<NA>
67국내여행업(주)썬트레킹여행사부산광역시 수영구 광안해변로 263, 파로스오피스텔 803호 (민락동)051-752-4455
78국내여행업동해여행사부산광역시 수영구 수영로 662 (광안동)<NA>
89국내여행업다모여투어부산광역시 수영구 수영로 754, 상가동 201호 (민락동, 센텀비스타동원)051-304-3991
910국내여행업하하투어부산광역시 수영구 수영로 411, 글로리메디컬센터 6층 641호 (남천동)<NA>
연번업종상호소재지(도로명)전화번호
5253종합여행업(주)잠언코리아부산광역시 수영구 수영로 759, 알파오피스텔 812호 (수영동)<NA>
5354종합여행업(주)헬로우에이젼시부산광역시 수영구 황령대로 479, 2층 (남천동)051-441-1955
5455종합여행업타피사코리아(주)부산광역시 수영구 수영로575번길 33, 4층 (광안동)051-624-2972
5556종합여행업(주)스마일코리아부산광역시 수영구 수영로 759, 알파오피스텔 지하1층 1311호 (수영동)1800-8757
5657종합여행업에이스투어부산광역시 수영구 수영로666번길 6, 삼정그린코아 711호 (광안동)<NA>
5758종합여행업제이하우스부산광역시 수영구 광남로 100, 10층 1005호 (광안동, 가비펠리치)<NA>
5859종합여행업주식회사 해암부산광역시 수영구 광일로 63, 102동 4층 402호 (광안동, 광원아파트)<NA>
5960종합여행업여왕의 놀이터부산광역시 수영구 연수로357번길 35 (수영동)<NA>
6061종합여행업(주)부산의아름다운길부산광역시 수영구 광남로 37, 하나은행 4층 (남천동)051-898-4573
6162종합여행업(주)뚱스커뮤니티부산광역시 수영구 수영로528번길 22, 601호 (광안동, 상상가)051-928-7788