Overview

Dataset statistics

Number of variables8
Number of observations60
Missing cells22
Missing cells (%)4.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.0 KiB
Average record size in memory68.2 B

Variable types

Categorical2
Text3
Numeric2
DateTime1

Dataset

Description경상남도 거제시 여행업현황(등록일자, 업종, 상호, 소재지, 위도, 경도, 연락처, 기준일자)등에 대한 정보를 제공합니다.
Author경상남도 거제시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3079556

Alerts

기준일자 has constant value ""Constant
전화번호 has 22 (36.7%) missing valuesMissing

Reproduction

Analysis started2024-04-17 19:08:43.548099
Analysis finished2024-04-17 19:08:44.528361
Duration0.98 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct3
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size612.0 B
국내외여행업
26 
국내여행업
20 
종합여행업
14 

Length

Max length6
Median length5
Mean length5.4333333
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내외여행업 26
43.3%
국내여행업 20
33.3%
종합여행업 14
23.3%

Length

2024-04-18T04:08:44.579021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T04:08:44.660838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외여행업 26
43.3%
국내여행업 20
33.3%
종합여행업 14
23.3%

상호
Text

Distinct53
Distinct (%)88.3%
Missing0
Missing (%)0.0%
Memory size612.0 B
2024-04-18T04:08:44.828769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length11
Mean length7.7833333
Min length4

Characters and Unicode

Total characters467
Distinct characters121
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)76.7%

Sample

1st row(주)대웅관광
2nd row(주)바다로
3rd row여행백화점 거제도투어
4th row거제에코투어
5th row거제시티투어(주)
ValueCountFrequency (%)
주식회사 10
 
12.8%
주)동부여행사 2
 
2.6%
대원투어 2
 
2.6%
여행사랑 2
 
2.6%
솔미여행 2
 
2.6%
마운틴 2
 
2.6%
강남투어 2
 
2.6%
여행백화점 2
 
2.6%
거제도투어 2
 
2.6%
참조은여행사 2
 
2.6%
Other values (50) 50
64.1%
2024-04-18T04:08:45.125896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
36
 
7.7%
) 26
 
5.6%
( 25
 
5.4%
25
 
5.4%
23
 
4.9%
22
 
4.7%
21
 
4.5%
21
 
4.5%
18
 
3.9%
12
 
2.6%
Other values (111) 238
51.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 395
84.6%
Close Punctuation 26
 
5.6%
Open Punctuation 25
 
5.4%
Space Separator 18
 
3.9%
Decimal Number 3
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
36
 
9.1%
25
 
6.3%
23
 
5.8%
22
 
5.6%
21
 
5.3%
21
 
5.3%
12
 
3.0%
11
 
2.8%
8
 
2.0%
7
 
1.8%
Other values (105) 209
52.9%
Decimal Number
ValueCountFrequency (%)
6 1
33.3%
3 1
33.3%
5 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Open Punctuation
ValueCountFrequency (%)
( 25
100.0%
Space Separator
ValueCountFrequency (%)
18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 395
84.6%
Common 72
 
15.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
36
 
9.1%
25
 
6.3%
23
 
5.8%
22
 
5.6%
21
 
5.3%
21
 
5.3%
12
 
3.0%
11
 
2.8%
8
 
2.0%
7
 
1.8%
Other values (105) 209
52.9%
Common
ValueCountFrequency (%)
) 26
36.1%
( 25
34.7%
18
25.0%
6 1
 
1.4%
3 1
 
1.4%
5 1
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 395
84.6%
ASCII 72
 
15.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
36
 
9.1%
25
 
6.3%
23
 
5.8%
22
 
5.6%
21
 
5.3%
21
 
5.3%
12
 
3.0%
11
 
2.8%
8
 
2.0%
7
 
1.8%
Other values (105) 209
52.9%
ASCII
ValueCountFrequency (%)
) 26
36.1%
( 25
34.7%
18
25.0%
6 1
 
1.4%
3 1
 
1.4%
5 1
 
1.4%

주소
Text

Distinct51
Distinct (%)85.0%
Missing0
Missing (%)0.0%
Memory size612.0 B
2024-04-18T04:08:45.361752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length39
Mean length27.866667
Min length19

Characters and Unicode

Total characters1672
Distinct characters133
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)70.0%

Sample

1st row경상남도 거제시 일운면 거제대로 2799
2nd row경상남도 거제시 중곡2로 89-1,거제2차덕산베스트타운상가 401호 (고현동)
3rd row경상남도 거제시 서문로 48 (고현동)
4th row경상남도 거제시 연초면 대금산로 20
5th row경상남도 거제시 능포로 241,2층 (능포동)
ValueCountFrequency (%)
경상남도 60
 
18.5%
거제시 60
 
18.5%
장평동 11
 
3.4%
고현동 11
 
3.4%
2층 6
 
1.9%
거제대로 5
 
1.5%
일운면 5
 
1.5%
능포동 4
 
1.2%
능포로 4
 
1.2%
1층 4
 
1.2%
Other values (118) 154
47.5%
2024-04-18T04:08:45.699947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
264
 
15.8%
75
 
4.5%
75
 
4.5%
68
 
4.1%
61
 
3.6%
61
 
3.6%
60
 
3.6%
60
 
3.6%
1 60
 
3.6%
60
 
3.6%
Other values (123) 828
49.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1015
60.7%
Space Separator 264
 
15.8%
Decimal Number 235
 
14.1%
Close Punctuation 49
 
2.9%
Open Punctuation 49
 
2.9%
Other Punctuation 44
 
2.6%
Dash Punctuation 9
 
0.5%
Uppercase Letter 6
 
0.4%
Lowercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
75
 
7.4%
75
 
7.4%
68
 
6.7%
61
 
6.0%
61
 
6.0%
60
 
5.9%
60
 
5.9%
60
 
5.9%
53
 
5.2%
33
 
3.3%
Other values (103) 409
40.3%
Decimal Number
ValueCountFrequency (%)
1 60
25.5%
2 51
21.7%
3 21
 
8.9%
0 18
 
7.7%
7 17
 
7.2%
5 15
 
6.4%
4 15
 
6.4%
6 14
 
6.0%
8 13
 
5.5%
9 11
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
S 2
33.3%
G 2
33.3%
C 1
16.7%
D 1
16.7%
Space Separator
ValueCountFrequency (%)
264
100.0%
Close Punctuation
ValueCountFrequency (%)
) 49
100.0%
Open Punctuation
ValueCountFrequency (%)
( 49
100.0%
Other Punctuation
ValueCountFrequency (%)
44
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1015
60.7%
Common 650
38.9%
Latin 7
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
75
 
7.4%
75
 
7.4%
68
 
6.7%
61
 
6.0%
61
 
6.0%
60
 
5.9%
60
 
5.9%
60
 
5.9%
53
 
5.2%
33
 
3.3%
Other values (103) 409
40.3%
Common
ValueCountFrequency (%)
264
40.6%
1 60
 
9.2%
2 51
 
7.8%
) 49
 
7.5%
( 49
 
7.5%
44
 
6.8%
3 21
 
3.2%
0 18
 
2.8%
7 17
 
2.6%
5 15
 
2.3%
Other values (5) 62
 
9.5%
Latin
ValueCountFrequency (%)
S 2
28.6%
G 2
28.6%
C 1
14.3%
e 1
14.3%
D 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1015
60.7%
ASCII 613
36.7%
None 44
 
2.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
264
43.1%
1 60
 
9.8%
2 51
 
8.3%
) 49
 
8.0%
( 49
 
8.0%
3 21
 
3.4%
0 18
 
2.9%
7 17
 
2.8%
5 15
 
2.4%
4 15
 
2.4%
Other values (9) 54
 
8.8%
Hangul
ValueCountFrequency (%)
75
 
7.4%
75
 
7.4%
68
 
6.7%
61
 
6.0%
61
 
6.0%
60
 
5.9%
60
 
5.9%
60
 
5.9%
53
 
5.2%
33
 
3.3%
Other values (103) 409
40.3%
None
ValueCountFrequency (%)
44
100.0%

위도
Real number (ℝ)

Distinct47
Distinct (%)78.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34.885801
Minimum34.739716
Maximum34.97977
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size672.0 B
2024-04-18T04:08:45.810788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum34.739716
5-th percentile34.840078
Q134.880797
median34.889599
Q334.894461
95-th percentile34.931373
Maximum34.97977
Range0.24005368
Interquartile range (IQR)0.013663575

Descriptive statistics

Standard deviation0.030517432
Coefficient of variation (CV)0.00087478089
Kurtosis9.6305009
Mean34.885801
Median Absolute Deviation (MAD)0.006907345
Skewness-1.6132578
Sum2093.1481
Variance0.00093131367
MonotonicityNot monotonic
2024-04-18T04:08:45.912873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
34.90212736 3
 
5.0%
34.85122922 2
 
3.3%
34.88269174 2
 
3.3%
34.89446106 2
 
3.3%
34.8937332 2
 
3.3%
34.89467188 2
 
3.3%
34.88924723 2
 
3.3%
34.89355941 2
 
3.3%
34.93137327 2
 
3.3%
34.89317772 2
 
3.3%
Other values (37) 39
65.0%
ValueCountFrequency (%)
34.73971604 1
1.7%
34.81352878 1
1.7%
34.83587891 1
1.7%
34.84029905 1
1.7%
34.85122922 2
3.3%
34.86455214 1
1.7%
34.86458893 1
1.7%
34.86532048 1
1.7%
34.87010458 1
1.7%
34.8750801 1
1.7%
ValueCountFrequency (%)
34.97976972 1
 
1.7%
34.93566876 1
 
1.7%
34.93137327 2
3.3%
34.91399552 1
 
1.7%
34.91377721 1
 
1.7%
34.90320443 1
 
1.7%
34.90212736 3
5.0%
34.89812434 1
 
1.7%
34.89631404 1
 
1.7%
34.89467188 2
3.3%

경도
Real number (ℝ)

Distinct47
Distinct (%)78.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean128.65291
Minimum128.5245
Maximum128.73692
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size672.0 B
2024-04-18T04:08:46.013060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum128.5245
5-th percentile128.58414
Q1128.62133
median128.63344
Q3128.69347
95-th percentile128.73136
Maximum128.73692
Range0.2124225
Interquartile range (IQR)0.072139325

Descriptive statistics

Standard deviation0.049067482
Coefficient of variation (CV)0.00038139426
Kurtosis-0.13461534
Mean128.65291
Median Absolute Deviation (MAD)0.0442493
Skewness-0.27857959
Sum7719.1747
Variance0.0024076178
MonotonicityNot monotonic
2024-04-18T04:08:46.110131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
128.5841433 3
 
5.0%
128.7095063 2
 
3.3%
128.7365023 2
 
3.3%
128.6141113 2
 
3.3%
128.6875965 2
 
3.3%
128.6309951 2
 
3.3%
128.6917711 2
 
3.3%
128.6943675 2
 
3.3%
128.524498 2
 
3.3%
128.6093971 2
 
3.3%
Other values (37) 39
65.0%
ValueCountFrequency (%)
128.524498 2
3.3%
128.5841433 3
5.0%
128.6060125 1
 
1.7%
128.6093971 2
3.3%
128.6141113 2
3.3%
128.6159087 2
3.3%
128.6170567 1
 
1.7%
128.6199746 1
 
1.7%
128.6213344 2
3.3%
128.6219549 1
 
1.7%
ValueCountFrequency (%)
128.7369205 1
1.7%
128.7365023 2
3.3%
128.7310853 1
1.7%
128.7305504 1
1.7%
128.7095063 2
3.3%
128.7050288 1
1.7%
128.7014586 1
1.7%
128.699386 1
1.7%
128.6956215 1
1.7%
128.6953251 1
1.7%

전화번호
Text

MISSING 

Distinct34
Distinct (%)89.5%
Missing22
Missing (%)36.7%
Memory size612.0 B
2024-04-18T04:08:46.279893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.026316
Min length12

Characters and Unicode

Total characters457
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)78.9%

Sample

1st row055-681-6167
2nd row055-687-6500
3rd row055-632-6377
4th row055-681-6167
5th row055-634-0060
ValueCountFrequency (%)
055-637-7997 2
 
5.3%
055-681-6167 2
 
5.3%
055-634-2267 2
 
5.3%
055-632-7446 2
 
5.3%
055-687-7669 1
 
2.6%
070-4924-6123 1
 
2.6%
055-688-3265 1
 
2.6%
055-687-0505 1
 
2.6%
055-633-6622 1
 
2.6%
055-639-8229 1
 
2.6%
Other values (24) 24
63.2%
2024-04-18T04:08:46.556586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 87
19.0%
- 76
16.6%
0 60
13.1%
6 60
13.1%
3 35
7.7%
7 35
7.7%
8 28
 
6.1%
2 24
 
5.3%
1 20
 
4.4%
9 16
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 381
83.4%
Dash Punctuation 76
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 87
22.8%
0 60
15.7%
6 60
15.7%
3 35
9.2%
7 35
9.2%
8 28
 
7.3%
2 24
 
6.3%
1 20
 
5.2%
9 16
 
4.2%
4 16
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 76
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 457
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 87
19.0%
- 76
16.6%
0 60
13.1%
6 60
13.1%
3 35
7.7%
7 35
7.7%
8 28
 
6.1%
2 24
 
5.3%
1 20
 
4.4%
9 16
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 457
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 87
19.0%
- 76
16.6%
0 60
13.1%
6 60
13.1%
3 35
7.7%
7 35
7.7%
8 28
 
6.1%
2 24
 
5.3%
1 20
 
4.4%
9 16
 
3.5%
Distinct53
Distinct (%)88.3%
Missing0
Missing (%)0.0%
Memory size612.0 B
Minimum1995-01-07 00:00:00
Maximum2022-07-26 00:00:00
2024-04-18T04:08:46.667173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T04:08:46.790490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size612.0 B
2022-09-05
60 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-09-05
2nd row2022-09-05
3rd row2022-09-05
4th row2022-09-05
5th row2022-09-05

Common Values

ValueCountFrequency (%)
2022-09-05 60
100.0%

Length

2024-04-18T04:08:46.879943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T04:08:46.948875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-09-05 60
100.0%

Interactions

2024-04-18T04:08:44.047164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T04:08:43.930088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T04:08:44.109535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T04:08:43.987287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-18T04:08:46.993949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종상호주소위도경도전화번호등록일자
업종1.0000.0000.0000.0000.2400.3340.000
상호0.0001.0000.9670.9860.5660.9760.987
주소0.0000.9671.0001.0001.0001.0000.993
위도0.0000.9861.0001.0000.8271.0000.989
경도0.2400.5661.0000.8271.0001.0000.970
전화번호0.3340.9761.0001.0001.0001.0000.995
등록일자0.0000.9870.9930.9890.9700.9951.000
2024-04-18T04:08:47.071989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위도경도업종
위도1.000-0.4470.000
경도-0.4471.0000.113
업종0.0000.1131.000

Missing values

2024-04-18T04:08:44.203611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-18T04:08:44.493850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호주소위도경도전화번호등록일자기준일자
0국내여행업(주)대웅관광경상남도 거제시 일운면 거제대로 279934.851229128.709506055-681-61672002-01-152022-09-05
1국내여행업(주)바다로경상남도 거제시 중곡2로 89-1,거제2차덕산베스트타운상가 401호 (고현동)34.896314128.635887<NA>2002-05-272022-09-05
2국내여행업여행백화점 거제도투어경상남도 거제시 서문로 48 (고현동)34.888916128.621334055-687-65002004-02-262022-09-05
3국내여행업거제에코투어경상남도 거제시 연초면 대금산로 2034.935669128.658424<NA>2005-01-282022-09-05
4국내여행업거제시티투어(주)경상남도 거제시 능포로 241,2층 (능포동)34.884195128.73692<NA>2005-07-052022-09-05
5국내여행업(주)바람의언덕경상남도 거제시 남부면 해금강로 13234.739716128.663338055-632-63772007-04-162022-09-05
6국내여행업(주)대우투어경상남도 거제시 일운면 거제대로 279934.851229128.709506055-681-61672008-11-192022-09-05
7국내여행업(주)상상속의여행경상남도 거제시 동문천로 6,101호 (고현동,고현하이츠빌라)34.879656128.623562055-634-00602013-10-182022-09-05
8국내여행업여행사랑경상남도 거제시 장평로 7 (장평동)34.890476128.615909055-637-79972014-02-212022-09-05
9국내여행업(주)동부여행사경상남도 거제시 장평로8길 32,2층 (장평동)34.893178128.609397055-634-22672014-11-192022-09-05
업종상호주소위도경도전화번호등록일자기준일자
50종합여행업(주)와우경상남도 거제시 동문1길 6,에스엠빌 202호 (고현동)34.884143128.62574055-688-52922016-10-182022-09-05
51종합여행업주식회사 엔젤경상남도 거제시 장평1로 57,2층 (장평동)34.894461128.614111<NA>2017-05-312022-09-05
52종합여행업고고트레블주식회사경상남도 거제시 성산로 33,112동 102호 (옥포동,e편한세상 옥포아파트 1단지)34.898124128.695325055-687-76692017-07-192022-09-05
53종합여행업주식회사 포우경상남도 거제시 거제중앙로5길 9 (상동동)34.877967128.627684055-636-33222018-10-222022-09-05
54종합여행업경남 전세버스 협동조합경상남도 거제시 국산로 24-25,C동 1층 (옥포동,씨에스 비버리힐즈1)34.903204128.683364055-687-09982019-05-172022-09-05
55종합여행업(주)드림투어경상남도 거제시 연초면 죽토로 3134.913996128.65882055-688-12312019-06-182022-09-05
56종합여행업외국트레블경상남도 거제시 장평3로 75,삼성사우매장 2층 (장평동)34.894141128.606012055-638-37002020-03-032022-09-05
57종합여행업(주)동백관광경상남도 거제시 일운면 거제대로 2631,라마다호텔 2층34.840299128.699386055-681-70802017-06-092022-09-05
58종합여행업웰리브투어경상남도 거제시 옥포성안로 71 (옥포동,이던플레이스)34.89281128.694536055-688-57072020-10-262022-09-05
59종합여행업365투어경상남도 거제시 계룡로 130 (고현동)34.881269128.621955055-637-10012022-07-262022-09-05