Overview

Dataset statistics

Number of variables6
Number of observations117
Missing cells30
Missing cells (%)4.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.6 KiB
Average record size in memory49.1 B

Variable types

Categorical3
Text3

Dataset

Description제주특별자치도 제주시 관내에 있는 국외 여행업 관련 데이터를 제공합니다.
Author제주특별자치도 제주시
URLhttps://www.data.go.kr/data/15056292/fileData.do

Alerts

구분 has constant value ""Constant
데이터기준일자 has constant value ""Constant
비고 is highly imbalanced (82.8%)Imbalance
연락처 has 30 (25.6%) missing valuesMissing
상호명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 19:43:32.070066
Analysis finished2023-12-12 19:43:32.555225
Duration0.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
국외여행업
117 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국외여행업
2nd row국외여행업
3rd row국외여행업
4th row국외여행업
5th row국외여행업

Common Values

ValueCountFrequency (%)
국외여행업 117
100.0%

Length

2023-12-13T04:43:32.685521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:43:32.813383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국외여행업 117
100.0%

상호명
Text

UNIQUE 

Distinct117
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-13T04:43:33.054958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length11
Mean length7.8888889
Min length4

Characters and Unicode

Total characters923
Distinct characters191
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique117 ?
Unique (%)100.0%

Sample

1st row(유)그린삼육오
2nd row(유)대승항공여행사
3rd row(유)미로항공여행
4th row(유)반디투어
5th row(유)스카이관광
ValueCountFrequency (%)
주식회사 32
 
20.9%
유한회사 2
 
1.3%
홀리데이여행사 1
 
0.7%
제주몬딱 1
 
0.7%
㈜쎄븐투어 1
 
0.7%
㈜썬앤문투어 1
 
0.7%
혼디가자 1
 
0.7%
한스글로벌 1
 
0.7%
티엔지 1
 
0.7%
티앤토투어 1
 
0.7%
Other values (111) 111
72.5%
2023-12-13T04:43:33.566016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
70
 
7.6%
53
 
5.7%
50
 
5.4%
47
 
5.1%
47
 
5.1%
36
 
3.9%
36
 
3.9%
35
 
3.8%
34
 
3.7%
32
 
3.5%
Other values (181) 483
52.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 791
85.7%
Other Symbol 50
 
5.4%
Space Separator 36
 
3.9%
Open Punctuation 13
 
1.4%
Close Punctuation 13
 
1.4%
Lowercase Letter 8
 
0.9%
Uppercase Letter 8
 
0.9%
Decimal Number 4
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
70
 
8.8%
53
 
6.7%
47
 
5.9%
47
 
5.9%
36
 
4.6%
35
 
4.4%
34
 
4.3%
32
 
4.0%
26
 
3.3%
18
 
2.3%
Other values (160) 393
49.7%
Uppercase Letter
ValueCountFrequency (%)
O 2
25.0%
G 1
12.5%
F 1
12.5%
R 1
12.5%
U 1
12.5%
T 1
12.5%
C 1
12.5%
Lowercase Letter
ValueCountFrequency (%)
s 3
37.5%
r 1
 
12.5%
i 1
 
12.5%
t 1
 
12.5%
a 1
 
12.5%
l 1
 
12.5%
Decimal Number
ValueCountFrequency (%)
0 1
25.0%
5 1
25.0%
9 1
25.0%
1 1
25.0%
Other Symbol
ValueCountFrequency (%)
50
100.0%
Space Separator
ValueCountFrequency (%)
36
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 841
91.1%
Common 66
 
7.2%
Latin 16
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
70
 
8.3%
53
 
6.3%
50
 
5.9%
47
 
5.6%
47
 
5.6%
36
 
4.3%
35
 
4.2%
34
 
4.0%
32
 
3.8%
26
 
3.1%
Other values (161) 411
48.9%
Latin
ValueCountFrequency (%)
s 3
18.8%
O 2
12.5%
G 1
 
6.2%
F 1
 
6.2%
R 1
 
6.2%
U 1
 
6.2%
T 1
 
6.2%
r 1
 
6.2%
i 1
 
6.2%
t 1
 
6.2%
Other values (3) 3
18.8%
Common
ValueCountFrequency (%)
36
54.5%
( 13
 
19.7%
) 13
 
19.7%
0 1
 
1.5%
5 1
 
1.5%
9 1
 
1.5%
1 1
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 791
85.7%
ASCII 82
 
8.9%
None 50
 
5.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
70
 
8.8%
53
 
6.7%
47
 
5.9%
47
 
5.9%
36
 
4.6%
35
 
4.4%
34
 
4.3%
32
 
4.0%
26
 
3.3%
18
 
2.3%
Other values (160) 393
49.7%
None
ValueCountFrequency (%)
50
100.0%
ASCII
ValueCountFrequency (%)
36
43.9%
( 13
 
15.9%
) 13
 
15.9%
s 3
 
3.7%
O 2
 
2.4%
G 1
 
1.2%
F 1
 
1.2%
R 1
 
1.2%
U 1
 
1.2%
T 1
 
1.2%
Other values (10) 10
 
12.2%
Distinct107
Distinct (%)91.5%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-13T04:43:34.073321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length23
Mean length19.512821
Min length17

Characters and Unicode

Total characters2283
Distinct characters87
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique97 ?
Unique (%)82.9%

Sample

1st row제주특별자치도 제주시 선덕로8길 26
2nd row제주특별자치도 제주시 고마로13길 10
3rd row제주특별자치도 제주시 전농로 8-1
4th row제주특별자치도 제주시 연화로4길 7
5th row제주특별자치도 제주시 전농로 96
ValueCountFrequency (%)
제주특별자치도 117
24.8%
제주시 117
24.8%
도령로 5
 
1.1%
서광로 5
 
1.1%
중앙로 5
 
1.1%
연삼로 4
 
0.8%
고마로 3
 
0.6%
2 3
 
0.6%
서사로 3
 
0.6%
전농로 3
 
0.6%
Other values (161) 207
43.9%
2023-12-13T04:43:34.835855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
355
15.5%
238
 
10.4%
234
 
10.2%
127
 
5.6%
117
 
5.1%
117
 
5.1%
117
 
5.1%
117
 
5.1%
117
 
5.1%
89
 
3.9%
Other values (77) 655
28.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1577
69.1%
Space Separator 355
 
15.5%
Decimal Number 333
 
14.6%
Dash Punctuation 18
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
238
15.1%
234
14.8%
127
8.1%
117
7.4%
117
7.4%
117
7.4%
117
7.4%
117
7.4%
89
 
5.6%
55
 
3.5%
Other values (65) 249
15.8%
Decimal Number
ValueCountFrequency (%)
1 80
24.0%
2 48
14.4%
3 35
10.5%
4 34
10.2%
6 32
 
9.6%
7 26
 
7.8%
5 25
 
7.5%
8 20
 
6.0%
9 17
 
5.1%
0 16
 
4.8%
Space Separator
ValueCountFrequency (%)
355
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1577
69.1%
Common 706
30.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
238
15.1%
234
14.8%
127
8.1%
117
7.4%
117
7.4%
117
7.4%
117
7.4%
117
7.4%
89
 
5.6%
55
 
3.5%
Other values (65) 249
15.8%
Common
ValueCountFrequency (%)
355
50.3%
1 80
 
11.3%
2 48
 
6.8%
3 35
 
5.0%
4 34
 
4.8%
6 32
 
4.5%
7 26
 
3.7%
5 25
 
3.5%
8 20
 
2.8%
- 18
 
2.5%
Other values (2) 33
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1577
69.1%
ASCII 706
30.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
355
50.3%
1 80
 
11.3%
2 48
 
6.8%
3 35
 
5.0%
4 34
 
4.8%
6 32
 
4.5%
7 26
 
3.7%
5 25
 
3.5%
8 20
 
2.8%
- 18
 
2.5%
Other values (2) 33
 
4.7%
Hangul
ValueCountFrequency (%)
238
15.1%
234
14.8%
127
8.1%
117
7.4%
117
7.4%
117
7.4%
117
7.4%
117
7.4%
89
 
5.6%
55
 
3.5%
Other values (65) 249
15.8%

연락처
Text

MISSING 

Distinct87
Distinct (%)100.0%
Missing30
Missing (%)25.6%
Memory size1.0 KiB
2023-12-13T04:43:35.169293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length12
Mean length12
Min length9

Characters and Unicode

Total characters1044
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique87 ?
Unique (%)100.0%

Sample

1st row064-711-3651
2nd row064-727-5588
3rd row064-724-1206
4th row064-759-7788
5th row064-753-8496
ValueCountFrequency (%)
064-711-3651 1
 
1.1%
064-743-8081 1
 
1.1%
064-742-4983 1
 
1.1%
064-727-7707 1
 
1.1%
064-746-5665 1
 
1.1%
064-725-7747 1
 
1.1%
064-749-4155 1
 
1.1%
064-722-1000 1
 
1.1%
064-805-9889 1
 
1.1%
064-743-0866 1
 
1.1%
Other values (77) 77
88.5%
2023-12-13T04:43:35.687177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 171
16.4%
0 167
16.0%
4 162
15.5%
7 134
12.8%
6 118
11.3%
2 59
 
5.7%
1 54
 
5.2%
5 51
 
4.9%
8 41
 
3.9%
3 40
 
3.8%
Other values (2) 47
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 866
83.0%
Dash Punctuation 171
 
16.4%
Space Separator 7
 
0.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 167
19.3%
4 162
18.7%
7 134
15.5%
6 118
13.6%
2 59
 
6.8%
1 54
 
6.2%
5 51
 
5.9%
8 41
 
4.7%
3 40
 
4.6%
9 40
 
4.6%
Dash Punctuation
ValueCountFrequency (%)
- 171
100.0%
Space Separator
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1044
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 171
16.4%
0 167
16.0%
4 162
15.5%
7 134
12.8%
6 118
11.3%
2 59
 
5.7%
1 54
 
5.2%
5 51
 
4.9%
8 41
 
3.9%
3 40
 
3.8%
Other values (2) 47
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1044
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 171
16.4%
0 167
16.0%
4 162
15.5%
7 134
12.8%
6 118
11.3%
2 59
 
5.7%
1 54
 
5.2%
5 51
 
4.9%
8 41
 
3.9%
3 40
 
3.8%
Other values (2) 47
 
4.5%

비고
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
<NA>
114 
휴업
 
3

Length

Max length4
Median length4
Mean length3.9487179
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 114
97.4%
휴업 3
 
2.6%

Length

2023-12-13T04:43:35.896387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:43:36.091561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 114
97.4%
휴업 3
 
2.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2021-05-31
117 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-05-31
2nd row2021-05-31
3rd row2021-05-31
4th row2021-05-31
5th row2021-05-31

Common Values

ValueCountFrequency (%)
2021-05-31 117
100.0%

Length

2023-12-13T04:43:36.285552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:43:36.429007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-05-31 117
100.0%

Correlations

2023-12-13T04:43:36.535460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연락처
연락처1.000

Missing values

2023-12-13T04:43:32.375437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:43:32.504681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분상호명소재지연락처비고데이터기준일자
0국외여행업(유)그린삼육오제주특별자치도 제주시 선덕로8길 26064-711-3651<NA>2021-05-31
1국외여행업(유)대승항공여행사제주특별자치도 제주시 고마로13길 10064-727-5588<NA>2021-05-31
2국외여행업(유)미로항공여행제주특별자치도 제주시 전농로 8-1064-724-1206<NA>2021-05-31
3국외여행업(유)반디투어제주특별자치도 제주시 연화로4길 7<NA><NA>2021-05-31
4국외여행업(유)스카이관광제주특별자치도 제주시 전농로 96064-759-7788<NA>2021-05-31
5국외여행업(유)승제주특별자치도 제주시 남광로 113-1064-753-8496<NA>2021-05-31
6국외여행업(유)제주로여행제주특별자치도 제주시 대동길 2064-745-4560<NA>2021-05-31
7국외여행업(유)제주이글여행사제주특별자치도 제주시 서사로14길 2064-756-3661<NA>2021-05-31
8국외여행업(유)진성항공여행사제주특별자치도 제주시 월랑로10길 22064-724-5000<NA>2021-05-31
9국외여행업(유)한송여행사제주특별자치도 제주시 신대로12길 51064-711-2373<NA>2021-05-31
구분상호명소재지연락처비고데이터기준일자
107국외여행업㈜허브여행사제주특별자치도 제주시 신대로 124064-711-1012<NA>2021-05-31
108국외여행업케이앤에이치투어제주특별자치도 제주시 인다6길 53<NA><NA>2021-05-31
109국외여행업탑코리아여행사제주특별자치도 제주시 일주서로 7326064-713-0117<NA>2021-05-31
110국외여행업트래블라운지제주특별자치도 제주시 국기로 9064-711-4488<NA>2021-05-31
111국외여행업하루 여행사제주특별자치도 제주시 광양9길 33<NA><NA>2021-05-31
112국외여행업한우리투어제주특별자치도 제주시 중앙로 206064-748-4488<NA>2021-05-31
113국외여행업행복제주제주특별자치도 제주시 신대로 119<NA><NA>2021-05-31
114국외여행업현여행갤러리제주특별자치도 제주시 구남로6길 44<NA><NA>2021-05-31
115국외여행업홀리데이여행사제주특별자치도 제주시 진군4길 10064-746-6000<NA>2021-05-31
116국외여행업황금여행사제주특별자치도 제주시 중앙로 118-1064-723-3295<NA>2021-05-31