Overview

Dataset statistics

Number of variables5
Number of observations28
Missing cells5
Missing cells (%)3.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory45.7 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description부산광역시북구여행업등록현황_20220922
Author부산광역시 북구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3069202

Alerts

순번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 순번High correlation
전화번호 has 5 (17.9%) missing valuesMissing
순번 has unique valuesUnique
상호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:13:40.586631
Analysis finished2023-12-10 16:13:41.227584
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct28
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.5
Minimum1
Maximum28
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size384.0 B
2023-12-11T01:13:41.623044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.35
Q17.75
median14.5
Q321.25
95-th percentile26.65
Maximum28
Range27
Interquartile range (IQR)13.5

Descriptive statistics

Standard deviation8.2259751
Coefficient of variation (CV)0.56730863
Kurtosis-1.2
Mean14.5
Median Absolute Deviation (MAD)7
Skewness0
Sum406
Variance67.666667
MonotonicityStrictly increasing
2023-12-11T01:13:41.770785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
1 1
 
3.6%
16 1
 
3.6%
28 1
 
3.6%
27 1
 
3.6%
26 1
 
3.6%
25 1
 
3.6%
24 1
 
3.6%
23 1
 
3.6%
22 1
 
3.6%
21 1
 
3.6%
Other values (18) 18
64.3%
ValueCountFrequency (%)
1 1
3.6%
2 1
3.6%
3 1
3.6%
4 1
3.6%
5 1
3.6%
6 1
3.6%
7 1
3.6%
8 1
3.6%
9 1
3.6%
10 1
3.6%
ValueCountFrequency (%)
28 1
3.6%
27 1
3.6%
26 1
3.6%
25 1
3.6%
24 1
3.6%
23 1
3.6%
22 1
3.6%
21 1
3.6%
20 1
3.6%
19 1
3.6%

업종
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size356.0 B
국내외여행업
13 
국내국내외 겸업
국내여행업
종합여행업

Length

Max length8
Median length6
Mean length6.4285714
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종합여행업
2nd row종합여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내외여행업 13
46.4%
국내국내외 겸업 9
32.1%
국내여행업 4
 
14.3%
종합여행업 2
 
7.1%

Length

2023-12-11T01:13:41.931973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:13:42.061602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외여행업 13
35.1%
국내국내외 9
24.3%
겸업 9
24.3%
국내여행업 4
 
10.8%
종합여행업 2
 
5.4%

상호
Text

UNIQUE 

Distinct28
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size356.0 B
2023-12-11T01:13:42.288321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length7.3928571
Min length2

Characters and Unicode

Total characters207
Distinct characters76
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)100.0%

Sample

1st row플라이 투어
2nd row주식회사 행복드림
3rd row뉴부산여행사
4th row운모아 여행사
5th row주식회사 렌고
ValueCountFrequency (%)
여행사 3
 
8.1%
주식회사 2
 
5.4%
플라이 1
 
2.7%
풍경 1
 
2.7%
이지스트 1
 
2.7%
트래블 1
 
2.7%
도윤여행사 1
 
2.7%
제이케이투어 1
 
2.7%
주)화인항공여행사 1
 
2.7%
주)장투어 1
 
2.7%
Other values (24) 24
64.9%
2023-12-11T01:13:42.691586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14
 
6.8%
13
 
6.3%
12
 
5.8%
12
 
5.8%
( 12
 
5.8%
12
 
5.8%
) 12
 
5.8%
11
 
5.3%
9
 
4.3%
8
 
3.9%
Other values (66) 92
44.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 171
82.6%
Open Punctuation 12
 
5.8%
Close Punctuation 12
 
5.8%
Space Separator 9
 
4.3%
Other Symbol 3
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
8.2%
13
 
7.6%
12
 
7.0%
12
 
7.0%
12
 
7.0%
11
 
6.4%
8
 
4.7%
3
 
1.8%
3
 
1.8%
2
 
1.2%
Other values (62) 81
47.4%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Space Separator
ValueCountFrequency (%)
9
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 174
84.1%
Common 33
 
15.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
8.0%
13
 
7.5%
12
 
6.9%
12
 
6.9%
12
 
6.9%
11
 
6.3%
8
 
4.6%
3
 
1.7%
3
 
1.7%
3
 
1.7%
Other values (63) 83
47.7%
Common
ValueCountFrequency (%)
( 12
36.4%
) 12
36.4%
9
27.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 171
82.6%
ASCII 33
 
15.9%
None 3
 
1.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
14
 
8.2%
13
 
7.6%
12
 
7.0%
12
 
7.0%
12
 
7.0%
11
 
6.4%
8
 
4.7%
3
 
1.8%
3
 
1.8%
2
 
1.2%
Other values (62) 81
47.4%
ASCII
ValueCountFrequency (%)
( 12
36.4%
) 12
36.4%
9
27.3%
None
ValueCountFrequency (%)
3
100.0%
Distinct27
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Memory size356.0 B
2023-12-11T01:13:42.992604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length38.5
Mean length34
Min length23

Characters and Unicode

Total characters952
Distinct characters109
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)92.9%

Sample

1st row부산광역시 북구 화명신도시로 115, 506호(화명동, 성문타워)
2nd row부산광역시 북구 화명신도시로 240, 202호(금곡동, 화명리버빌2차)
3rd row부산광역시 북구 만덕대로65번길 3, 1403호 (덕천동, 메디앙오피스텔)
4th row부산광역시 북구 모분재로 24, 2층 (구포동)
5th row부산광역시 북구 금곡대로303번길 80, 코아프라자 701호 (화명동)
ValueCountFrequency (%)
부산광역시 28
 
15.3%
북구 28
 
15.3%
화명동 9
 
4.9%
구포동 6
 
3.3%
금곡대로 5
 
2.7%
2층 5
 
2.7%
덕천동 4
 
2.2%
만덕동 4
 
2.2%
백양대로 4
 
2.2%
만덕대로 3
 
1.6%
Other values (76) 87
47.5%
2023-12-11T01:13:43.404377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
155
 
16.3%
39
 
4.1%
35
 
3.7%
, 34
 
3.6%
31
 
3.3%
30
 
3.2%
29
 
3.0%
2 29
 
3.0%
29
 
3.0%
28
 
2.9%
Other values (99) 513
53.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 551
57.9%
Space Separator 155
 
16.3%
Decimal Number 155
 
16.3%
Other Punctuation 34
 
3.6%
Close Punctuation 28
 
2.9%
Open Punctuation 28
 
2.9%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
39
 
7.1%
35
 
6.4%
31
 
5.6%
30
 
5.4%
29
 
5.3%
29
 
5.3%
28
 
5.1%
28
 
5.1%
28
 
5.1%
19
 
3.4%
Other values (84) 255
46.3%
Decimal Number
ValueCountFrequency (%)
2 29
18.7%
1 24
15.5%
0 24
15.5%
3 17
11.0%
7 14
9.0%
6 11
 
7.1%
8 11
 
7.1%
5 9
 
5.8%
9 8
 
5.2%
4 8
 
5.2%
Space Separator
ValueCountFrequency (%)
155
100.0%
Other Punctuation
ValueCountFrequency (%)
, 34
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28
100.0%
Open Punctuation
ValueCountFrequency (%)
( 28
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 551
57.9%
Common 401
42.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
39
 
7.1%
35
 
6.4%
31
 
5.6%
30
 
5.4%
29
 
5.3%
29
 
5.3%
28
 
5.1%
28
 
5.1%
28
 
5.1%
19
 
3.4%
Other values (84) 255
46.3%
Common
ValueCountFrequency (%)
155
38.7%
, 34
 
8.5%
2 29
 
7.2%
) 28
 
7.0%
( 28
 
7.0%
1 24
 
6.0%
0 24
 
6.0%
3 17
 
4.2%
7 14
 
3.5%
6 11
 
2.7%
Other values (5) 37
 
9.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 551
57.9%
ASCII 401
42.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
155
38.7%
, 34
 
8.5%
2 29
 
7.2%
) 28
 
7.0%
( 28
 
7.0%
1 24
 
6.0%
0 24
 
6.0%
3 17
 
4.2%
7 14
 
3.5%
6 11
 
2.7%
Other values (5) 37
 
9.2%
Hangul
ValueCountFrequency (%)
39
 
7.1%
35
 
6.4%
31
 
5.6%
30
 
5.4%
29
 
5.3%
29
 
5.3%
28
 
5.1%
28
 
5.1%
28
 
5.1%
19
 
3.4%
Other values (84) 255
46.3%

전화번호
Text

MISSING 

Distinct19
Distinct (%)82.6%
Missing5
Missing (%)17.9%
Memory size356.0 B
2023-12-11T01:13:43.590903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters276
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)65.2%

Sample

1st row051-363-6505
2nd row051-861-3623
3rd row051-581-3002
4th row051-938-5070
5th row051-335-0626
ValueCountFrequency (%)
051-701-8087 2
 
8.7%
051-333-3652 2
 
8.7%
051-938-5070 2
 
8.7%
051-342-1711 2
 
8.7%
051-363-1700 1
 
4.3%
051-363-6505 1
 
4.3%
051-804-1313 1
 
4.3%
051-361-0069 1
 
4.3%
051-635-1234 1
 
4.3%
051-331-3488 1
 
4.3%
Other values (9) 9
39.1%
2023-12-11T01:13:43.919530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 46
16.7%
0 44
15.9%
1 42
15.2%
5 38
13.8%
3 38
13.8%
8 15
 
5.4%
6 15
 
5.4%
2 12
 
4.3%
7 11
 
4.0%
4 10
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 230
83.3%
Dash Punctuation 46
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 44
19.1%
1 42
18.3%
5 38
16.5%
3 38
16.5%
8 15
 
6.5%
6 15
 
6.5%
2 12
 
5.2%
7 11
 
4.8%
4 10
 
4.3%
9 5
 
2.2%
Dash Punctuation
ValueCountFrequency (%)
- 46
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 276
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 46
16.7%
0 44
15.9%
1 42
15.2%
5 38
13.8%
3 38
13.8%
8 15
 
5.4%
6 15
 
5.4%
2 12
 
4.3%
7 11
 
4.0%
4 10
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 276
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 46
16.7%
0 44
15.9%
1 42
15.2%
5 38
13.8%
3 38
13.8%
8 15
 
5.4%
6 15
 
5.4%
2 12
 
4.3%
7 11
 
4.0%
4 10
 
3.6%

Interactions

2023-12-11T01:13:40.934634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:13:44.028505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번업종상호소재지(도로명)전화번호
순번1.0000.9461.0000.9320.566
업종0.9461.0001.0000.7370.884
상호1.0001.0001.0001.0001.000
소재지(도로명)0.9320.7371.0001.0000.964
전화번호0.5660.8841.0000.9641.000
2023-12-11T01:13:44.157481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번업종
순번1.0000.750
업종0.7501.000

Missing values

2023-12-11T01:13:41.062058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:13:41.180802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번업종상호소재지(도로명)전화번호
01종합여행업플라이 투어부산광역시 북구 화명신도시로 115, 506호(화명동, 성문타워)051-363-6505
12종합여행업주식회사 행복드림부산광역시 북구 화명신도시로 240, 202호(금곡동, 화명리버빌2차)<NA>
23국내여행업뉴부산여행사부산광역시 북구 만덕대로65번길 3, 1403호 (덕천동, 메디앙오피스텔)<NA>
34국내여행업운모아 여행사부산광역시 북구 모분재로 24, 2층 (구포동)051-861-3623
45국내여행업주식회사 렌고부산광역시 북구 금곡대로303번길 80, 코아프라자 701호 (화명동)051-581-3002
56국내여행업㈜핑크투어(북구지점)부산광역시 북구 낙동대로1570번길 32, 4층 (구포동)051-938-5070
67국내외여행업가보자여행사부산광역시 북구 백양대로 1198 (구포동)051-335-0626
78국내외여행업하나여행타운(주)부산광역시 북구 금곡대로 469 (금곡동)051-557-0041
89국내외여행업(주)하이투어부산광역시 북구 금곡대로 287 (화명동)051-361-0094
910국내외여행업(주)토마토여행부산광역시 북구 금곡대로 175, 208동 202호 (화명동, 화명2차동원로얄듀크비스타)051-468-3322
순번업종상호소재지(도로명)전화번호
1819국내외여행업제이케이투어부산광역시 북구 백양대로 1198, 3층(구포동)051-342-1711
1920국내국내외 겸업(주)화인항공여행사부산광역시 북구 백양대로 1204, 5층 (덕천동, 동강빌딩)051-342-1711
2021국내국내외 겸업(주)장투어부산광역시 북구 금곡대로 287, 화명동 (화명동, 삼한골든뷰 201)051-701-8087
2122국내국내외 겸업풍경부산광역시 북구 백양대로 1069 (구포동)051-331-3488
2223국내국내외 겸업부산화명새마을금고부산광역시 북구 와석장터로 8, 화명동새마을금고 (화명동)051-635-1234
2324국내국내외 겸업(주)우리여행사부산광역시 북구 화명대로 47, 2층 (화명동, 롯데마트화명점)051-361-0069
2425국내국내외 겸업해피투어 (주)씨아이부산광역시 북구 금곡대로285번길 19, 306호 (화명동, 리버사이드빌딩)051-634-5251
2526국내국내외 겸업(주)해피투어부산광역시 북구 덕천로 306, 가동 2층 201호 (만덕동)051-701-8087
2627국내국내외 겸업우리들여행사부산광역시 북구 시랑로170번길 30, 2층 (구포동)<NA>
2728국내국내외 겸업㈜미소투어 여행사부산광역시 북구 만덕대로 21, 10층(덕천동)<NA>