Overview

Dataset statistics

Number of variables6
Number of observations90
Missing cells32
Missing cells (%)5.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.3 KiB
Average record size in memory49.5 B

Variable types

Categorical2
Text3
DateTime1

Dataset

Description경상남도 거제시 여행업현황(등록일자, 업종, 상호, 소재지, 위도, 경도, 연락처, 기준일자)등에 대한 정보를 제공합니다.
Author경상남도 거제시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3079556

Alerts

기준일 has constant value ""Constant
전화번호 has 32 (35.6%) missing valuesMissing

Reproduction

Analysis started2024-04-17 19:08:48.394158
Analysis finished2024-04-17 19:08:48.900064
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct3
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size852.0 B
국내여행업
43 
국외여행업
34 
일반여행업
13 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반여행업
2nd row국외여행업
3rd row일반여행업
4th row일반여행업
5th row국외여행업

Common Values

ValueCountFrequency (%)
국내여행업 43
47.8%
국외여행업 34
37.8%
일반여행업 13
 
14.4%

Length

2024-04-18T04:08:48.945945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T04:08:49.024627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내여행업 43
47.8%
국외여행업 34
37.8%
일반여행업 13
 
14.4%

상호
Text

Distinct60
Distinct (%)66.7%
Missing0
Missing (%)0.0%
Memory size852.0 B
2024-04-18T04:08:49.204471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length7.5666667
Min length4

Characters and Unicode

Total characters681
Distinct characters121
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)34.4%

Sample

1st row라온제나
2nd row더월드투어
3rd row외국트레블
4th row주식회사 엔젤
5th row(주)거제씨월드
ValueCountFrequency (%)
주식회사 6
 
5.8%
주)동백관광 3
 
2.9%
주)한려관광 2
 
1.9%
대원투어 2
 
1.9%
주)브이아이피항공여행사 2
 
1.9%
주)오션시티 2
 
1.9%
거제도투어 2
 
1.9%
여행백화점 2
 
1.9%
주)여행만들기 2
 
1.9%
주)대성투어 2
 
1.9%
Other values (56) 78
75.7%
2024-04-18T04:08:49.496197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
58
 
8.5%
) 51
 
7.5%
( 50
 
7.3%
33
 
4.8%
33
 
4.8%
32
 
4.7%
31
 
4.6%
31
 
4.6%
14
 
2.1%
13
 
1.9%
Other values (111) 335
49.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 564
82.8%
Close Punctuation 51
 
7.5%
Open Punctuation 50
 
7.3%
Space Separator 13
 
1.9%
Decimal Number 3
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
58
 
10.3%
33
 
5.9%
33
 
5.9%
32
 
5.7%
31
 
5.5%
31
 
5.5%
14
 
2.5%
13
 
2.3%
12
 
2.1%
12
 
2.1%
Other values (105) 295
52.3%
Decimal Number
ValueCountFrequency (%)
3 1
33.3%
6 1
33.3%
5 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 51
100.0%
Open Punctuation
ValueCountFrequency (%)
( 50
100.0%
Space Separator
ValueCountFrequency (%)
13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 564
82.8%
Common 117
 
17.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
58
 
10.3%
33
 
5.9%
33
 
5.9%
32
 
5.7%
31
 
5.5%
31
 
5.5%
14
 
2.5%
13
 
2.3%
12
 
2.1%
12
 
2.1%
Other values (105) 295
52.3%
Common
ValueCountFrequency (%)
) 51
43.6%
( 50
42.7%
13
 
11.1%
3 1
 
0.9%
6 1
 
0.9%
5 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 564
82.8%
ASCII 117
 
17.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
58
 
10.3%
33
 
5.9%
33
 
5.9%
32
 
5.7%
31
 
5.5%
31
 
5.5%
14
 
2.5%
13
 
2.3%
12
 
2.1%
12
 
2.1%
Other values (105) 295
52.3%
ASCII
ValueCountFrequency (%)
) 51
43.6%
( 50
42.7%
13
 
11.1%
3 1
 
0.9%
6 1
 
0.9%
5 1
 
0.9%
Distinct63
Distinct (%)70.0%
Missing0
Missing (%)0.0%
Memory size852.0 B
2024-04-18T04:08:49.746912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length41
Mean length28.533333
Min length19

Characters and Unicode

Total characters2568
Distinct characters135
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)43.3%

Sample

1st row경상남도 거제시 일운면 거제대로 2631, 라마다호텔 1층
2nd row경상남도 거제시 옥포대첩로 1, 103호 (옥포동, 옥포마리나아파트)
3rd row경상남도 거제시 일운면 지세포해안로 38
4th row경상남도 거제시 거제중앙로5길 9 (상동동)
5th row경상남도 거제시 일운면 지세포해안로 15 (거제씨월드)
ValueCountFrequency (%)
경상남도 90
 
16.5%
거제시 90
 
16.5%
고현동 20
 
3.7%
옥포동 18
 
3.3%
1층 14
 
2.6%
거제대로 13
 
2.4%
장평동 12
 
2.2%
2층 10
 
1.8%
일운면 10
 
1.8%
거제중앙로 8
 
1.5%
Other values (139) 259
47.6%
2024-04-18T04:08:50.101206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
454
 
17.7%
124
 
4.8%
122
 
4.8%
1 108
 
4.2%
105
 
4.1%
92
 
3.6%
92
 
3.6%
91
 
3.5%
90
 
3.5%
90
 
3.5%
Other values (125) 1200
46.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1531
59.6%
Space Separator 454
 
17.7%
Decimal Number 357
 
13.9%
Close Punctuation 72
 
2.8%
Open Punctuation 72
 
2.8%
Other Punctuation 64
 
2.5%
Dash Punctuation 11
 
0.4%
Uppercase Letter 6
 
0.2%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
124
 
8.1%
122
 
8.0%
105
 
6.9%
92
 
6.0%
92
 
6.0%
91
 
5.9%
90
 
5.9%
90
 
5.9%
84
 
5.5%
50
 
3.3%
Other values (105) 591
38.6%
Decimal Number
ValueCountFrequency (%)
1 108
30.3%
2 54
15.1%
3 36
 
10.1%
0 34
 
9.5%
7 26
 
7.3%
4 24
 
6.7%
6 20
 
5.6%
5 20
 
5.6%
9 19
 
5.3%
8 16
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
G 2
33.3%
S 2
33.3%
C 1
16.7%
B 1
16.7%
Space Separator
ValueCountFrequency (%)
454
100.0%
Close Punctuation
ValueCountFrequency (%)
) 72
100.0%
Open Punctuation
ValueCountFrequency (%)
( 72
100.0%
Other Punctuation
ValueCountFrequency (%)
, 64
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1531
59.6%
Common 1030
40.1%
Latin 7
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
124
 
8.1%
122
 
8.0%
105
 
6.9%
92
 
6.0%
92
 
6.0%
91
 
5.9%
90
 
5.9%
90
 
5.9%
84
 
5.5%
50
 
3.3%
Other values (105) 591
38.6%
Common
ValueCountFrequency (%)
454
44.1%
1 108
 
10.5%
) 72
 
7.0%
( 72
 
7.0%
, 64
 
6.2%
2 54
 
5.2%
3 36
 
3.5%
0 34
 
3.3%
7 26
 
2.5%
4 24
 
2.3%
Other values (5) 86
 
8.3%
Latin
ValueCountFrequency (%)
G 2
28.6%
S 2
28.6%
e 1
14.3%
C 1
14.3%
B 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1531
59.6%
ASCII 1037
40.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
454
43.8%
1 108
 
10.4%
) 72
 
6.9%
( 72
 
6.9%
, 64
 
6.2%
2 54
 
5.2%
3 36
 
3.5%
0 34
 
3.3%
7 26
 
2.5%
4 24
 
2.3%
Other values (10) 93
 
9.0%
Hangul
ValueCountFrequency (%)
124
 
8.1%
122
 
8.0%
105
 
6.9%
92
 
6.0%
92
 
6.0%
91
 
5.9%
90
 
5.9%
90
 
5.9%
84
 
5.5%
50
 
3.3%
Other values (105) 591
38.6%

전화번호
Text

MISSING 

Distinct40
Distinct (%)69.0%
Missing32
Missing (%)35.6%
Memory size852.0 B
2024-04-18T04:08:50.278855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.017241
Min length12

Characters and Unicode

Total characters697
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)39.7%

Sample

1st row055-681-7080
2nd row055-688-5441
3rd row070-4924-6123
4th row055-632-1244
5th row055-632-3531
ValueCountFrequency (%)
055-638-1833 3
 
5.2%
055-637-1001 2
 
3.4%
055-636-1312 2
 
3.4%
055-637-2001 2
 
3.4%
055-688-5441 2
 
3.4%
055-682-2351 2
 
3.4%
055-638-2999 2
 
3.4%
055-637-9837 2
 
3.4%
055-637-7997 2
 
3.4%
055-687-1234 2
 
3.4%
Other values (30) 37
63.8%
2024-04-18T04:08:50.550459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 131
18.8%
- 116
16.6%
0 92
13.2%
6 78
11.2%
3 68
9.8%
8 43
 
6.2%
1 42
 
6.0%
2 38
 
5.5%
7 38
 
5.5%
9 27
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 581
83.4%
Dash Punctuation 116
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 131
22.5%
0 92
15.8%
6 78
13.4%
3 68
11.7%
8 43
 
7.4%
1 42
 
7.2%
2 38
 
6.5%
7 38
 
6.5%
9 27
 
4.6%
4 24
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 116
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 697
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 131
18.8%
- 116
16.6%
0 92
13.2%
6 78
11.2%
3 68
9.8%
8 43
 
6.2%
1 42
 
6.0%
2 38
 
5.5%
7 38
 
5.5%
9 27
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 697
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 131
18.8%
- 116
16.6%
0 92
13.2%
6 78
11.2%
3 68
9.8%
8 43
 
6.2%
1 42
 
6.0%
2 38
 
5.5%
7 38
 
5.5%
9 27
 
3.9%
Distinct66
Distinct (%)73.3%
Missing0
Missing (%)0.0%
Memory size852.0 B
Minimum1987-09-25 00:00:00
Maximum2019-06-18 00:00:00
2024-04-18T04:08:50.664397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T04:08:50.783008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size852.0 B
2019-11-25
90 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019-11-25
2nd row2019-11-25
3rd row2019-11-25
4th row2019-11-25
5th row2019-11-25

Common Values

ValueCountFrequency (%)
2019-11-25 90
100.0%

Length

2024-04-18T04:08:50.873191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T04:08:50.943824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019-11-25 90
100.0%

Correlations

2024-04-18T04:08:50.992263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종상호소재지(도로명)전화번호등록일자
업종1.0000.0000.0000.0000.000
상호0.0001.0000.9990.9980.999
소재지(도로명)0.0000.9991.0000.9990.996
전화번호0.0000.9980.9991.0000.991
등록일자0.0000.9990.9960.9911.000

Missing values

2024-04-18T04:08:48.791746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-18T04:08:48.869906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호소재지(도로명)전화번호등록일자기준일
0일반여행업라온제나경상남도 거제시 일운면 거제대로 2631, 라마다호텔 1층055-681-70802018-02-092019-11-25
1국외여행업더월드투어경상남도 거제시 옥포대첩로 1, 103호 (옥포동, 옥포마리나아파트)055-688-54412017-03-022019-11-25
2일반여행업외국트레블경상남도 거제시 일운면 지세포해안로 38<NA>2014-07-282019-11-25
3일반여행업주식회사 엔젤경상남도 거제시 거제중앙로5길 9 (상동동)<NA>2018-10-222019-11-25
4국외여행업(주)거제씨월드경상남도 거제시 일운면 지세포해안로 15 (거제씨월드)070-4924-61232017-06-152019-11-25
5국외여행업(주)정운고속관광경상남도 거제시 피솔길 104 (장평동)<NA>2000-02-112019-11-25
6국외여행업이너스투어경상남도 거제시 계룡로2길 69-1, 1층 (고현동)<NA>2016-08-122019-11-25
7국내여행업주식회사 거제밴투어경상남도 거제시 성산로 169, 109호 (덕포동, 거제옥포도뮤토2단지)<NA>2017-09-252019-11-25
8국내여행업스카이투어경상남도 거제시 옥포로 174, 301호 (옥포동)<NA>2017-10-202019-11-25
9국내여행업거제관광전략센터주식회사경상남도 거제시 거제면 거제남서로 3525<NA>2013-01-212019-11-25
업종상호소재지(도로명)전화번호등록일자기준일
80국외여행업칠백리 투어경상남도 거제시 능포로 115, 403호 (능포동, 문화빌딩)055-687-20902015-01-122019-11-25
81국내여행업(주)여행과사람경상남도 거제시 거제대로 3730, 영진상가 2층 (옥포동)055-687-40002000-08-292019-11-25
82국외여행업(주)여행과사람경상남도 거제시 거제대로 3730, 영진상가 2층 (옥포동)055-687-40002000-09-012019-11-25
83일반여행업유)대금투어경상남도 거제시 연초면 죽토로 31055-688-12312019-06-182019-11-25
84일반여행업365투어경상남도 거제시 거제대로 3791 (옥포동, 거제박물관)055-688-32652009-12-282019-11-25
85일반여행업(주)드림투어경상남도 거제시 옥포대첩로 75 (옥포동)055-688-52922016-10-182019-11-25
86국내여행업더월드투어경상남도 거제시 옥포대첩로 1, 옥포마리나아파트 상가동 103호 (옥포동)055-688-54412017-03-022019-11-25
87국내여행업(주)대우투어경상남도 거제시 일운면 거제대로 2799055-688-61882008-11-192019-11-25
88국외여행업(주)여행만들기경상남도 거제시 거제중앙로 1894 (고현동)055-634-00552014-04-212019-11-25
89일반여행업고고트레블주식회사경상남도 거제시 성산로 33, 112동 102호 (옥포동, e편한세상 옥포아파트 1단지)055-687-76692017-07-192019-11-25