Overview

Dataset statistics

Number of variables5
Number of observations66
Missing cells11
Missing cells (%)3.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory43.0 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description부산광역시_수영구_여행업등록현황_20230711
Author부산광역시 수영구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3042065

Alerts

연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
전화번호 has 11 (16.7%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:00:18.621096
Analysis finished2023-12-10 16:00:20.721989
Duration2.1 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct66
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33.5
Minimum1
Maximum66
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size726.0 B
2023-12-11T01:00:20.824782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.25
Q117.25
median33.5
Q349.75
95-th percentile62.75
Maximum66
Range65
Interquartile range (IQR)32.5

Descriptive statistics

Standard deviation19.196354
Coefficient of variation (CV)0.57302549
Kurtosis-1.2
Mean33.5
Median Absolute Deviation (MAD)16.5
Skewness0
Sum2211
Variance368.5
MonotonicityStrictly increasing
2023-12-11T01:00:21.031019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.5%
51 1
 
1.5%
37 1
 
1.5%
38 1
 
1.5%
39 1
 
1.5%
40 1
 
1.5%
41 1
 
1.5%
42 1
 
1.5%
43 1
 
1.5%
44 1
 
1.5%
Other values (56) 56
84.8%
ValueCountFrequency (%)
1 1
1.5%
2 1
1.5%
3 1
1.5%
4 1
1.5%
5 1
1.5%
6 1
1.5%
7 1
1.5%
8 1
1.5%
9 1
1.5%
10 1
1.5%
ValueCountFrequency (%)
66 1
1.5%
65 1
1.5%
64 1
1.5%
63 1
1.5%
62 1
1.5%
61 1
1.5%
60 1
1.5%
59 1
1.5%
58 1
1.5%
57 1
1.5%

업종
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size660.0 B
국내외여행업
36 
종합여행업
19 
국내여행업
11 

Length

Max length6
Median length6
Mean length5.5454545
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내외여행업 36
54.5%
종합여행업 19
28.8%
국내여행업 11
 
16.7%

Length

2023-12-11T01:00:21.251423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:00:21.411655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외여행업 36
54.5%
종합여행업 19
28.8%
국내여행업 11
 
16.7%
Distinct62
Distinct (%)93.9%
Missing0
Missing (%)0.0%
Memory size660.0 B
2023-12-11T01:00:21.734637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length11
Mean length8.0151515
Min length3

Characters and Unicode

Total characters529
Distinct characters149
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)87.9%

Sample

1st row강남고속관광투어(주)
2nd row웰빙투어
3rd row(주)에버그린투어
4th row동해여행사
5th row다모여투어
ValueCountFrequency (%)
주식회사 4
 
5.4%
강남고속관광투어(주 2
 
2.7%
주)에버그린투어 2
 
2.7%
하하투어 2
 
2.7%
주)부산자유여행사 2
 
2.7%
주)그리다 1
 
1.4%
에이스투어 1
 
1.4%
투어올레 1
 
1.4%
에이블투어 1
 
1.4%
주)월드시티글로벌 1
 
1.4%
Other values (57) 57
77.0%
2023-12-11T01:00:22.360413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
44
 
8.3%
) 38
 
7.2%
( 38
 
7.2%
27
 
5.1%
26
 
4.9%
20
 
3.8%
19
 
3.6%
18
 
3.4%
17
 
3.2%
12
 
2.3%
Other values (139) 270
51.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 435
82.2%
Close Punctuation 38
 
7.2%
Open Punctuation 38
 
7.2%
Space Separator 8
 
1.5%
Lowercase Letter 7
 
1.3%
Other Punctuation 3
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
10.1%
27
 
6.2%
26
 
6.0%
20
 
4.6%
19
 
4.4%
18
 
4.1%
17
 
3.9%
12
 
2.8%
8
 
1.8%
7
 
1.6%
Other values (127) 237
54.5%
Lowercase Letter
ValueCountFrequency (%)
m 1
14.3%
r 1
14.3%
u 1
14.3%
c 1
14.3%
o 1
14.3%
t 1
14.3%
b 1
14.3%
Other Punctuation
ValueCountFrequency (%)
· 2
66.7%
. 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 38
100.0%
Open Punctuation
ValueCountFrequency (%)
( 38
100.0%
Space Separator
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 435
82.2%
Common 87
 
16.4%
Latin 7
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
10.1%
27
 
6.2%
26
 
6.0%
20
 
4.6%
19
 
4.4%
18
 
4.1%
17
 
3.9%
12
 
2.8%
8
 
1.8%
7
 
1.6%
Other values (127) 237
54.5%
Latin
ValueCountFrequency (%)
m 1
14.3%
r 1
14.3%
u 1
14.3%
c 1
14.3%
o 1
14.3%
t 1
14.3%
b 1
14.3%
Common
ValueCountFrequency (%)
) 38
43.7%
( 38
43.7%
8
 
9.2%
· 2
 
2.3%
. 1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 435
82.2%
ASCII 92
 
17.4%
None 2
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
44
 
10.1%
27
 
6.2%
26
 
6.0%
20
 
4.6%
19
 
4.4%
18
 
4.1%
17
 
3.9%
12
 
2.8%
8
 
1.8%
7
 
1.6%
Other values (127) 237
54.5%
ASCII
ValueCountFrequency (%)
) 38
41.3%
( 38
41.3%
8
 
8.7%
m 1
 
1.1%
r 1
 
1.1%
u 1
 
1.1%
c 1
 
1.1%
o 1
 
1.1%
t 1
 
1.1%
b 1
 
1.1%
None
ValueCountFrequency (%)
· 2
100.0%
Distinct59
Distinct (%)89.4%
Missing0
Missing (%)0.0%
Memory size660.0 B
2023-12-11T01:00:22.784558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length41.5
Mean length34.636364
Min length23

Characters and Unicode

Total characters2286
Distinct characters120
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)80.3%

Sample

1st row부산광역시 수영구 과정로 55 (망미동, 망미1동 새마을금고)
2nd row부산광역시 수영구 장대골로7번길 45, 1423호 (광안동, 광안유림노르웨이아침)
3rd row부산광역시 수영구 연수로 265-1, 1층 (망미동)
4th row부산광역시 수영구 수영로 662 (광안동)
5th row부산광역시 수영구 수영로 754, 상가동 201호 (민락동, 센텀비스타동원)
ValueCountFrequency (%)
부산광역시 66
 
14.7%
수영구 66
 
14.7%
수영로 23
 
5.1%
광안동 21
 
4.7%
남천동 14
 
3.1%
수영동 14
 
3.1%
759 10
 
2.2%
3층 10
 
2.2%
알파오피스텔 9
 
2.0%
4층 9
 
2.0%
Other values (133) 208
46.2%
2023-12-11T01:00:23.471640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
384
 
16.8%
118
 
5.2%
112
 
4.9%
109
 
4.8%
80
 
3.5%
1 73
 
3.2%
, 71
 
3.1%
70
 
3.1%
68
 
3.0%
68
 
3.0%
Other values (110) 1133
49.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1305
57.1%
Space Separator 384
 
16.8%
Decimal Number 383
 
16.8%
Other Punctuation 71
 
3.1%
Open Punctuation 66
 
2.9%
Close Punctuation 66
 
2.9%
Dash Punctuation 7
 
0.3%
Uppercase Letter 3
 
0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
118
 
9.0%
112
 
8.6%
109
 
8.4%
80
 
6.1%
70
 
5.4%
68
 
5.2%
68
 
5.2%
67
 
5.1%
67
 
5.1%
66
 
5.1%
Other values (91) 480
36.8%
Decimal Number
ValueCountFrequency (%)
1 73
19.1%
2 47
12.3%
3 45
11.7%
5 41
10.7%
4 37
9.7%
6 37
9.7%
0 36
9.4%
7 30
7.8%
9 24
 
6.3%
8 13
 
3.4%
Uppercase Letter
ValueCountFrequency (%)
B 1
33.3%
C 1
33.3%
M 1
33.3%
Space Separator
ValueCountFrequency (%)
384
100.0%
Other Punctuation
ValueCountFrequency (%)
, 71
100.0%
Open Punctuation
ValueCountFrequency (%)
( 66
100.0%
Close Punctuation
ValueCountFrequency (%)
) 66
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1305
57.1%
Common 977
42.7%
Latin 4
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
118
 
9.0%
112
 
8.6%
109
 
8.4%
80
 
6.1%
70
 
5.4%
68
 
5.2%
68
 
5.2%
67
 
5.1%
67
 
5.1%
66
 
5.1%
Other values (91) 480
36.8%
Common
ValueCountFrequency (%)
384
39.3%
1 73
 
7.5%
, 71
 
7.3%
( 66
 
6.8%
) 66
 
6.8%
2 47
 
4.8%
3 45
 
4.6%
5 41
 
4.2%
4 37
 
3.8%
6 37
 
3.8%
Other values (5) 110
 
11.3%
Latin
ValueCountFrequency (%)
B 1
25.0%
C 1
25.0%
M 1
25.0%
e 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1305
57.1%
ASCII 981
42.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
384
39.1%
1 73
 
7.4%
, 71
 
7.2%
( 66
 
6.7%
) 66
 
6.7%
2 47
 
4.8%
3 45
 
4.6%
5 41
 
4.2%
4 37
 
3.8%
6 37
 
3.8%
Other values (9) 114
 
11.6%
Hangul
ValueCountFrequency (%)
118
 
9.0%
112
 
8.6%
109
 
8.4%
80
 
6.1%
70
 
5.4%
68
 
5.2%
68
 
5.2%
67
 
5.1%
67
 
5.1%
66
 
5.1%
Other values (91) 480
36.8%

전화번호
Text

MISSING 

Distinct49
Distinct (%)89.1%
Missing11
Missing (%)16.7%
Memory size660.0 B
2023-12-11T01:00:23.789372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.927273
Min length9

Characters and Unicode

Total characters656
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)78.2%

Sample

1st row051-529-9898
2nd row051-919-2231
3rd row051-632-9977
4th row051-861-2991
5th row051-304-3991
ValueCountFrequency (%)
051-632-9977 2
 
3.6%
051-622-7253 2
 
3.6%
051-502-6888 2
 
3.6%
051-756-4117 2
 
3.6%
051-529-9898 2
 
3.6%
070-4320-6549 2
 
3.6%
051-441-1955 1
 
1.8%
051-624-2972 1
 
1.8%
1800-8757 1
 
1.8%
051-755-0125 1
 
1.8%
Other values (39) 39
70.9%
2023-12-11T01:00:24.256657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 107
16.3%
0 96
14.6%
5 94
14.3%
1 82
12.5%
7 54
8.2%
2 49
7.5%
6 48
7.3%
8 41
 
6.2%
9 34
 
5.2%
4 28
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 549
83.7%
Dash Punctuation 107
 
16.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 96
17.5%
5 94
17.1%
1 82
14.9%
7 54
9.8%
2 49
8.9%
6 48
8.7%
8 41
7.5%
9 34
 
6.2%
4 28
 
5.1%
3 23
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 107
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 656
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 107
16.3%
0 96
14.6%
5 94
14.3%
1 82
12.5%
7 54
8.2%
2 49
7.5%
6 48
7.3%
8 41
 
6.2%
9 34
 
5.2%
4 28
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 656
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 107
16.3%
0 96
14.6%
5 94
14.3%
1 82
12.5%
7 54
8.2%
2 49
7.5%
6 48
7.3%
8 41
 
6.2%
9 34
 
5.2%
4 28
 
4.3%

Interactions

2023-12-11T01:00:20.288767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:00:24.422048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종업체명소재지(도로명)전화번호
연번1.0000.9440.7890.7870.041
업종0.9441.0000.0000.0000.000
업체명0.7890.0001.0001.0001.000
소재지(도로명)0.7870.0001.0001.0000.999
전화번호0.0410.0001.0000.9991.000
2023-12-11T01:00:24.593458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.865
업종0.8651.000

Missing values

2023-12-11T01:00:20.528127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:00:20.664459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종업체명소재지(도로명)전화번호
01국내여행업강남고속관광투어(주)부산광역시 수영구 과정로 55 (망미동, 망미1동 새마을금고)051-529-9898
12국내여행업웰빙투어부산광역시 수영구 장대골로7번길 45, 1423호 (광안동, 광안유림노르웨이아침)051-919-2231
23국내여행업(주)에버그린투어부산광역시 수영구 연수로 265-1, 1층 (망미동)051-632-9977
34국내여행업동해여행사부산광역시 수영구 수영로 662 (광안동)051-861-2991
45국내여행업다모여투어부산광역시 수영구 수영로 754, 상가동 201호 (민락동, 센텀비스타동원)051-304-3991
56국내여행업하하투어부산광역시 수영구 수영로 411, 글로리메디컬센터 6층 641호 (남천동)051-622-7253
67국내여행업(주)부산자유여행사부산광역시 수영구 광남로223번길 31, 103동 403호 (민락동, 광안현대하이페리온)051-502-6888
78국내여행업모닝투어부산광역시 수영구 수영로 421-1, 로얄오피스텔 907호 (남천동)051-626-8066
89국내여행업모아디자인부산광역시 수영구 무학로 46, 금강빌딩 4층 (광안동)<NA>
910국내여행업주식회사 페텔부산광역시 수영구 수영로 759, 알파오피스텔 지하1층 503호 (수영동)070-8064-5447
연번업종업체명소재지(도로명)전화번호
5657종합여행업여왕의 놀이터부산광역시 수영구 연수로357번길 35 (수영동)<NA>
5758종합여행업(주)부산의아름다운길부산광역시 수영구 광남로 37, 하나은행 4층 (남천동)051-898-4573
5859종합여행업(주)뚱스커뮤니티부산광역시 수영구 수영로528번길 22, 601호 (광안동, 상상가)051-928-7788
5960종합여행업(주)아주항공여행사부산광역시 수영구 광안해변로 100, 212동 1011호 (남천동, 비치아파트)051-809-7771
6061종합여행업(주)금손투어부산광역시 수영구 수영로 668, 화목오피스텔 1009호 (광안동)051-756-1009
6162종합여행업(주)노바투어부산광역시 수영구 수영로 759, 알파오피스텔 지하1층 1608호 (수영동)<NA>
6263종합여행업주식회사 투게더부산광역시 수영구 광안해변로 423-1, 3층 (민락동)051-711-2228
6364종합여행업지엠피 서비스 시스템부산광역시 수영구 수영로 759, 알파오피스텔 3층 307호 (수영동)051-757-0088
6465종합여행업투어뱅크 주식회사부산광역시 수영구 수영로 668, 화목오피스텔 11층 1101호 (광안동)051-753-4809
6566종합여행업와이썬코리아(주)부산광역시 수영구 수영로575번길 33, 4층 (광안동)<NA>