Overview

Dataset statistics

Number of variables4
Number of observations27
Missing cells13
Missing cells (%)12.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory996.0 B
Average record size in memory36.9 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시_기장군_여행업등록현황_20230707
Author부산광역시 기장군
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3072006

Alerts

연락처 has 13 (48.1%) missing valuesMissing

Reproduction

Analysis started2023-12-10 17:31:41.981169
Analysis finished2023-12-10 17:31:42.801785
Duration0.82 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct3
Distinct (%)11.1%
Missing0
Missing (%)0.0%
Memory size348.0 B
국내외여행업
11 
종합여행업
국내여행업

Length

Max length6
Median length5
Mean length5.4074074
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종합여행업
2nd row종합여행업
3rd row종합여행업
4th row종합여행업
5th row종합여행업

Common Values

ValueCountFrequency (%)
국내외여행업 11
40.7%
종합여행업 8
29.6%
국내여행업 8
29.6%

Length

2023-12-11T02:31:43.016880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:31:43.297440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외여행업 11
40.7%
종합여행업 8
29.6%
국내여행업 8
29.6%

상호
Text

Distinct22
Distinct (%)81.5%
Missing0
Missing (%)0.0%
Memory size348.0 B
2023-12-11T02:31:43.722608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length12
Mean length8.962963
Min length2

Characters and Unicode

Total characters242
Distinct characters82
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)63.0%

Sample

1st row대화여행사
2nd row하늘투어
3rd row여행사 우
4th row다함 여행사
5th rowGo East Travel
ValueCountFrequency (%)
주식회사 8
18.2%
여행사 3
 
6.8%
여행이필요할때 2
 
4.5%
주)세명항공여행사 2
 
4.5%
유민여행사 2
 
4.5%
여행나무 2
 
4.5%
투어파크 2
 
4.5%
생각나는남자(여필남 2
 
4.5%
피케이코퍼레이션 1
 
2.3%
투어밴드 1
 
2.3%
Other values (19) 19
43.2%
2023-12-11T02:31:44.482552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17
 
7.0%
17
 
7.0%
15
 
6.2%
13
 
5.4%
13
 
5.4%
9
 
3.7%
9
 
3.7%
8
 
3.3%
( 8
 
3.3%
) 8
 
3.3%
Other values (72) 125
51.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 190
78.5%
Space Separator 17
 
7.0%
Uppercase Letter 10
 
4.1%
Lowercase Letter 9
 
3.7%
Open Punctuation 8
 
3.3%
Close Punctuation 8
 
3.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
8.9%
15
 
7.9%
13
 
6.8%
13
 
6.8%
9
 
4.7%
9
 
4.7%
8
 
4.2%
8
 
4.2%
8
 
4.2%
5
 
2.6%
Other values (54) 85
44.7%
Lowercase Letter
ValueCountFrequency (%)
a 2
22.2%
t 1
11.1%
o 1
11.1%
s 1
11.1%
r 1
11.1%
v 1
11.1%
e 1
11.1%
l 1
11.1%
Uppercase Letter
ValueCountFrequency (%)
T 3
30.0%
O 2
20.0%
G 1
 
10.0%
E 1
 
10.0%
M 1
 
10.0%
U 1
 
10.0%
R 1
 
10.0%
Space Separator
ValueCountFrequency (%)
17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 190
78.5%
Common 33
 
13.6%
Latin 19
 
7.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
 
8.9%
15
 
7.9%
13
 
6.8%
13
 
6.8%
9
 
4.7%
9
 
4.7%
8
 
4.2%
8
 
4.2%
8
 
4.2%
5
 
2.6%
Other values (54) 85
44.7%
Latin
ValueCountFrequency (%)
T 3
15.8%
O 2
 
10.5%
a 2
 
10.5%
t 1
 
5.3%
G 1
 
5.3%
o 1
 
5.3%
E 1
 
5.3%
s 1
 
5.3%
r 1
 
5.3%
v 1
 
5.3%
Other values (5) 5
26.3%
Common
ValueCountFrequency (%)
17
51.5%
( 8
24.2%
) 8
24.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 190
78.5%
ASCII 52
 
21.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17
32.7%
( 8
15.4%
) 8
15.4%
T 3
 
5.8%
O 2
 
3.8%
a 2
 
3.8%
t 1
 
1.9%
G 1
 
1.9%
o 1
 
1.9%
E 1
 
1.9%
Other values (8) 8
15.4%
Hangul
ValueCountFrequency (%)
17
 
8.9%
15
 
7.9%
13
 
6.8%
13
 
6.8%
9
 
4.7%
9
 
4.7%
8
 
4.2%
8
 
4.2%
8
 
4.2%
5
 
2.6%
Other values (54) 85
44.7%
Distinct23
Distinct (%)85.2%
Missing0
Missing (%)0.0%
Memory size348.0 B
2023-12-11T02:31:45.017241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length41
Mean length31.777778
Min length22

Characters and Unicode

Total characters858
Distinct characters83
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)70.4%

Sample

1st row부산광역시 기장군 기장읍 기장대로 543-2, 2층
2nd row부산광역시 기장군 기장읍 차성로 461, 108동 104호 (이진캐스빌블루 1차아파트)
3rd row부산광역시 기장군 기장읍 기장해안로 136, 103호
4th row부산광역시 기장군 기장읍 차성로190번길 33, 1층
5th row부산광역시 기장군 정관읍 정관로 565, 진우빌딩 4층 404호
ValueCountFrequency (%)
부산광역시 27
 
14.8%
기장군 27
 
14.8%
기장읍 15
 
8.2%
정관읍 10
 
5.5%
2층 4
 
2.2%
정관로 4
 
2.2%
기장대로 4
 
2.2%
1층 3
 
1.6%
504호 3
 
1.6%
3층 3
 
1.6%
Other values (64) 82
45.1%
2023-12-11T02:31:45.958776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
155
 
18.1%
50
 
5.8%
48
 
5.6%
29
 
3.4%
28
 
3.3%
28
 
3.3%
27
 
3.1%
27
 
3.1%
27
 
3.1%
27
 
3.1%
Other values (73) 412
48.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 508
59.2%
Decimal Number 157
 
18.3%
Space Separator 155
 
18.1%
Other Punctuation 25
 
2.9%
Close Punctuation 5
 
0.6%
Open Punctuation 5
 
0.6%
Dash Punctuation 2
 
0.2%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
50
 
9.8%
48
 
9.4%
29
 
5.7%
28
 
5.5%
28
 
5.5%
27
 
5.3%
27
 
5.3%
27
 
5.3%
27
 
5.3%
27
 
5.3%
Other values (57) 190
37.4%
Decimal Number
ValueCountFrequency (%)
1 26
16.6%
3 25
15.9%
5 24
15.3%
0 20
12.7%
4 18
11.5%
2 15
9.6%
6 10
 
6.4%
9 8
 
5.1%
8 7
 
4.5%
7 4
 
2.5%
Space Separator
ValueCountFrequency (%)
155
100.0%
Other Punctuation
ValueCountFrequency (%)
, 25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 508
59.2%
Common 349
40.7%
Latin 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
50
 
9.8%
48
 
9.4%
29
 
5.7%
28
 
5.5%
28
 
5.5%
27
 
5.3%
27
 
5.3%
27
 
5.3%
27
 
5.3%
27
 
5.3%
Other values (57) 190
37.4%
Common
ValueCountFrequency (%)
155
44.4%
1 26
 
7.4%
3 25
 
7.2%
, 25
 
7.2%
5 24
 
6.9%
0 20
 
5.7%
4 18
 
5.2%
2 15
 
4.3%
6 10
 
2.9%
9 8
 
2.3%
Other values (5) 23
 
6.6%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 508
59.2%
ASCII 350
40.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
155
44.3%
1 26
 
7.4%
3 25
 
7.1%
, 25
 
7.1%
5 24
 
6.9%
0 20
 
5.7%
4 18
 
5.1%
2 15
 
4.3%
6 10
 
2.9%
9 8
 
2.3%
Other values (6) 24
 
6.9%
Hangul
ValueCountFrequency (%)
50
 
9.8%
48
 
9.4%
29
 
5.7%
28
 
5.5%
28
 
5.5%
27
 
5.3%
27
 
5.3%
27
 
5.3%
27
 
5.3%
27
 
5.3%
Other values (57) 190
37.4%

연락처
Text

MISSING 

Distinct12
Distinct (%)85.7%
Missing13
Missing (%)48.1%
Memory size348.0 B
2023-12-11T02:31:46.360019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.071429
Min length12

Characters and Unicode

Total characters169
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)71.4%

Sample

1st row051-722-2205
2nd row051-724-0057
3rd row051-715-2560
4th row051-741-8390
5th row051-724-5830
ValueCountFrequency (%)
051-728-0903 2
14.3%
051-704-9936 2
14.3%
051-722-2205 1
7.1%
051-724-0057 1
7.1%
051-715-2560 1
7.1%
051-741-8390 1
7.1%
051-724-5830 1
7.1%
070-4110-8599 1
7.1%
051-728-5923 1
7.1%
051-243-4972 1
7.1%
Other values (2) 2
14.3%
2023-12-11T02:31:47.074038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 28
16.6%
- 28
16.6%
1 21
12.4%
5 20
11.8%
7 16
9.5%
2 14
8.3%
9 13
7.7%
3 10
 
5.9%
4 9
 
5.3%
8 6
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 141
83.4%
Dash Punctuation 28
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 28
19.9%
1 21
14.9%
5 20
14.2%
7 16
11.3%
2 14
9.9%
9 13
9.2%
3 10
 
7.1%
4 9
 
6.4%
8 6
 
4.3%
6 4
 
2.8%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 169
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 28
16.6%
- 28
16.6%
1 21
12.4%
5 20
11.8%
7 16
9.5%
2 14
8.3%
9 13
7.7%
3 10
 
5.9%
4 9
 
5.3%
8 6
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 169
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 28
16.6%
- 28
16.6%
1 21
12.4%
5 20
11.8%
7 16
9.5%
2 14
8.3%
9 13
7.7%
3 10
 
5.9%
4 9
 
5.3%
8 6
 
3.6%

Correlations

2023-12-11T02:31:47.288992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종상호소재지연락처
업종1.0000.0000.0000.496
상호0.0001.0001.0001.000
소재지0.0001.0001.0001.000
연락처0.4961.0001.0001.000

Missing values

2023-12-11T02:31:42.465667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:31:42.700309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호소재지연락처
0종합여행업대화여행사부산광역시 기장군 기장읍 기장대로 543-2, 2층051-722-2205
1종합여행업하늘투어부산광역시 기장군 기장읍 차성로 461, 108동 104호 (이진캐스빌블루 1차아파트)051-724-0057
2종합여행업여행사 우부산광역시 기장군 기장읍 기장해안로 136, 103호<NA>
3종합여행업다함 여행사부산광역시 기장군 기장읍 차성로190번길 33, 1층<NA>
4종합여행업Go East Travel부산광역시 기장군 정관읍 정관로 565, 진우빌딩 4층 404호051-715-2560
5종합여행업탐 여행사(TOM TOUR)부산광역시 기장군 정관읍 정관로 560, 마치빌딩<NA>
6종합여행업(주)하이트래블에이전시부산광역시 기장군 기장읍 당사로3길 32, 1층051-741-8390
7종합여행업보담 여행사부산광역시 기장군 기장읍 기장해안로 108, 에이원오션시티 502,503호<NA>
8국내외여행업(주)나이스투어부산광역시 기장군 기장읍 차성동로 116, 2층051-724-5830
9국내외여행업주식회사 유민여행사부산광역시 기장군 정관읍 정관7로 34 (서진프라자)051-728-0903
업종상호소재지연락처
17국내외여행업주식회사 투어파크부산광역시 기장군 장안읍 반룡산단3로 95, 경동오토필드 504호051-704-9936
18국내외여행업써니투어부산광역시 기장군 정관읍 정관8로 28, 504호<NA>
19국내여행업주식회사 유민여행사부산광역시 기장군 정관읍 정관7로 34 (서진프라자)051-728-0903
20국내여행업여행이필요할때 생각나는남자(여필남)부산광역시 기장군 정관읍 정관로 583, 503호051-243-4972
21국내여행업주식회사 여행나무부산광역시 기장군 기장읍 기장대로 495, 2301호051-611-7979
22국내여행업다된다투어부산광역시 기장군 기장읍 반송로 1601051-721-4331
23국내여행업비와이컴퍼니부산광역시 기장군 정관읍 정관중앙로 45, 2층 205호<NA>
24국내여행업(주)세명항공여행사부산광역시 기장군 기장읍 반송로 1576, 아토건설 3층<NA>
25국내여행업대박부산광역시 기장군 기장읍 기장대로 482-5, 303호 (비룡 벨로스텔라)<NA>
26국내여행업주식회사 투어파크부산광역시 기장군 장안읍 반룡산단3로 95, 경동오토필드 504호051-704-9936