Overview

Dataset statistics

Number of variables8
Number of observations63
Missing cells24
Missing cells (%)4.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.2 KiB
Average record size in memory68.0 B

Variable types

Categorical2
Text3
Numeric2
DateTime1

Dataset

Description경상남도 거제시 여행업현황(등록일자, 업종, 상호, 소재지, 위도, 경도, 연락처, 기준일자)등에 대한 정보를 제공합니다.
Author경상남도 거제시
URLhttps://www.data.go.kr/data/3079556/fileData.do

Alerts

기준일자 has constant value ""Constant
전화번호 has 24 (38.1%) missing valuesMissing

Reproduction

Analysis started2024-03-14 13:00:08.258224
Analysis finished2024-03-14 13:00:10.443847
Duration2.19 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct3
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size632.0 B
국내외여행업
27 
국내여행업
18 
종합여행업
18 

Length

Max length6
Median length5
Mean length5.4285714
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내외여행업 27
42.9%
국내여행업 18
28.6%
종합여행업 18
28.6%

Length

2024-03-14T22:00:10.670958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T22:00:10.999577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외여행업 27
42.9%
국내여행업 18
28.6%
종합여행업 18
28.6%

상호
Text

Distinct57
Distinct (%)90.5%
Missing0
Missing (%)0.0%
Memory size632.0 B
2024-03-14T22:00:11.842115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length11
Mean length7.6984127
Min length4

Characters and Unicode

Total characters485
Distinct characters132
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)81.0%

Sample

1st row(주)대웅관광
2nd row(주)바다로
3rd row여행백화점 거제도투어
4th row거제에코투어
5th row거제시티투어(주)
ValueCountFrequency (%)
주식회사 9
 
10.7%
대원투어 2
 
2.4%
마운틴 2
 
2.4%
주)동백관광 2
 
2.4%
강남투어 2
 
2.4%
여행백화점 2
 
2.4%
솔미여행 2
 
2.4%
주)동부여행사 2
 
2.4%
거제도투어 2
 
2.4%
외국트레블 1
 
1.2%
Other values (58) 58
69.0%
2024-03-14T22:00:13.147360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34
 
7.0%
27
 
5.6%
26
 
5.4%
) 25
 
5.2%
( 24
 
4.9%
22
 
4.5%
21
 
4.3%
20
 
4.1%
20
 
4.1%
11
 
2.3%
Other values (122) 255
52.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 411
84.7%
Close Punctuation 25
 
5.2%
Open Punctuation 24
 
4.9%
Space Separator 21
 
4.3%
Decimal Number 3
 
0.6%
Other Symbol 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
 
8.3%
27
 
6.6%
26
 
6.3%
22
 
5.4%
20
 
4.9%
20
 
4.9%
11
 
2.7%
10
 
2.4%
9
 
2.2%
8
 
1.9%
Other values (115) 224
54.5%
Decimal Number
ValueCountFrequency (%)
5 1
33.3%
6 1
33.3%
3 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 25
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%
Space Separator
ValueCountFrequency (%)
21
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 412
84.9%
Common 73
 
15.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
 
8.3%
27
 
6.6%
26
 
6.3%
22
 
5.3%
20
 
4.9%
20
 
4.9%
11
 
2.7%
10
 
2.4%
9
 
2.2%
8
 
1.9%
Other values (116) 225
54.6%
Common
ValueCountFrequency (%)
) 25
34.2%
( 24
32.9%
21
28.8%
5 1
 
1.4%
6 1
 
1.4%
3 1
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 411
84.7%
ASCII 73
 
15.1%
None 1
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
34
 
8.3%
27
 
6.6%
26
 
6.3%
22
 
5.4%
20
 
4.9%
20
 
4.9%
11
 
2.7%
10
 
2.4%
9
 
2.2%
8
 
1.9%
Other values (115) 224
54.5%
ASCII
ValueCountFrequency (%)
) 25
34.2%
( 24
32.9%
21
28.8%
5 1
 
1.4%
6 1
 
1.4%
3 1
 
1.4%
None
ValueCountFrequency (%)
1
100.0%

주소
Text

Distinct52
Distinct (%)82.5%
Missing0
Missing (%)0.0%
Memory size632.0 B
2024-03-14T22:00:14.208517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length21
Mean length17.698413
Min length14

Characters and Unicode

Total characters1115
Distinct characters74
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)68.3%

Sample

1st row경상남도 거제시 일운면 거제대로 2799
2nd row경상남도 거제시 중곡2로 89-1
3rd row경상남도 거제시 서문로 48
4th row경상남도 거제시 연초면 대금산로 20
5th row경상남도 거제시 능포로 241
ValueCountFrequency (%)
경상남도 63
23.9%
거제시 63
23.9%
능포로 5
 
1.9%
거제대로 5
 
1.9%
일운면 5
 
1.9%
피솔길 4
 
1.5%
104 4
 
1.5%
거제중앙로 4
 
1.5%
장평1로 3
 
1.1%
연초면 3
 
1.1%
Other values (86) 105
39.8%
2024-03-14T22:00:15.660593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
201
18.0%
75
 
6.7%
75
 
6.7%
64
 
5.7%
64
 
5.7%
63
 
5.7%
63
 
5.7%
63
 
5.7%
55
 
4.9%
1 44
 
3.9%
Other values (64) 348
31.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 719
64.5%
Space Separator 201
 
18.0%
Decimal Number 185
 
16.6%
Dash Punctuation 10
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
75
10.4%
75
10.4%
64
 
8.9%
64
 
8.9%
63
 
8.8%
63
 
8.8%
63
 
8.8%
55
 
7.6%
18
 
2.5%
16
 
2.2%
Other values (52) 163
22.7%
Decimal Number
ValueCountFrequency (%)
1 44
23.8%
2 29
15.7%
3 19
10.3%
7 17
 
9.2%
6 16
 
8.6%
8 14
 
7.6%
9 13
 
7.0%
4 13
 
7.0%
5 12
 
6.5%
0 8
 
4.3%
Space Separator
ValueCountFrequency (%)
201
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 719
64.5%
Common 396
35.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
75
10.4%
75
10.4%
64
 
8.9%
64
 
8.9%
63
 
8.8%
63
 
8.8%
63
 
8.8%
55
 
7.6%
18
 
2.5%
16
 
2.2%
Other values (52) 163
22.7%
Common
ValueCountFrequency (%)
201
50.8%
1 44
 
11.1%
2 29
 
7.3%
3 19
 
4.8%
7 17
 
4.3%
6 16
 
4.0%
8 14
 
3.5%
9 13
 
3.3%
4 13
 
3.3%
5 12
 
3.0%
Other values (2) 18
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 719
64.5%
ASCII 396
35.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
201
50.8%
1 44
 
11.1%
2 29
 
7.3%
3 19
 
4.8%
7 17
 
4.3%
6 16
 
4.0%
8 14
 
3.5%
9 13
 
3.3%
4 13
 
3.3%
5 12
 
3.0%
Other values (2) 18
 
4.5%
Hangul
ValueCountFrequency (%)
75
10.4%
75
10.4%
64
 
8.9%
64
 
8.9%
63
 
8.8%
63
 
8.8%
63
 
8.8%
55
 
7.6%
18
 
2.5%
16
 
2.2%
Other values (52) 163
22.7%

위도
Real number (ℝ)

Distinct52
Distinct (%)82.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34.886008
Minimum34.739754
Maximum34.979769
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size695.0 B
2024-03-14T22:00:16.072392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum34.739754
5-th percentile34.841366
Q134.8804
median34.889956
Q334.894302
95-th percentile34.929635
Maximum34.979769
Range0.240015
Interquartile range (IQR)0.013902

Descriptive statistics

Standard deviation0.029888006
Coefficient of variation (CV)0.00085673332
Kurtosis10.054544
Mean34.886008
Median Absolute Deviation (MAD)0.00728
Skewness-1.6508233
Sum2197.8185
Variance0.00089329288
MonotonicityNot monotonic
2024-03-14T22:00:16.515642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
34.902124 4
 
6.3%
34.850966 2
 
3.2%
34.89318 2
 
3.2%
34.894458 2
 
3.2%
34.931372 2
 
3.2%
34.882676 2
 
3.2%
34.89356 2
 
3.2%
34.892829 2
 
3.2%
34.888899 2
 
3.2%
34.894671 1
 
1.6%
Other values (42) 42
66.7%
ValueCountFrequency (%)
34.739754 1
1.6%
34.813527 1
1.6%
34.835882 1
1.6%
34.840299 1
1.6%
34.850966 2
3.2%
34.864549 1
1.6%
34.864592 1
1.6%
34.865323 1
1.6%
34.870178 1
1.6%
34.874036 1
1.6%
ValueCountFrequency (%)
34.979769 1
 
1.6%
34.935668 1
 
1.6%
34.931372 2
3.2%
34.914 1
 
1.6%
34.913782 1
 
1.6%
34.903206 1
 
1.6%
34.902124 4
6.3%
34.898125 1
 
1.6%
34.896316 1
 
1.6%
34.894671 1
 
1.6%

경도
Real number (ℝ)

Distinct52
Distinct (%)82.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean128.65566
Minimum128.5245
Maximum128.74121
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size695.0 B
2024-03-14T22:00:16.936689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum128.5245
5-th percentile128.58414
Q1128.62134
median128.65843
Q3128.69452
95-th percentile128.736
Maximum128.74121
Range0.216713
Interquartile range (IQR)0.073174

Descriptive statistics

Standard deviation0.051166603
Coefficient of variation (CV)0.0003977019
Kurtosis-0.35415867
Mean128.65566
Median Absolute Deviation (MAD)0.036894
Skewness-0.2808528
Sum8105.3068
Variance0.0026180212
MonotonicityNot monotonic
2024-03-14T22:00:17.369589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
128.584142 4
 
6.3%
128.709546 2
 
3.2%
128.609396 2
 
3.2%
128.614111 2
 
3.2%
128.524496 2
 
3.2%
128.736521 2
 
3.2%
128.694363 2
 
3.2%
128.694516 2
 
3.2%
128.621342 2
 
3.2%
128.630996 1
 
1.6%
Other values (42) 42
66.7%
ValueCountFrequency (%)
128.524496 2
3.2%
128.584142 4
6.3%
128.606017 1
 
1.6%
128.609396 2
3.2%
128.609639 1
 
1.6%
128.614111 2
3.2%
128.615911 1
 
1.6%
128.617059 1
 
1.6%
128.619978 1
 
1.6%
128.621342 2
3.2%
ValueCountFrequency (%)
128.741209 1
1.6%
128.736925 1
1.6%
128.736521 2
3.2%
128.731275 1
1.6%
128.731086 1
1.6%
128.73055 1
1.6%
128.709546 2
3.2%
128.705033 1
1.6%
128.701462 1
1.6%
128.699383 1
1.6%

전화번호
Text

MISSING 

Distinct35
Distinct (%)89.7%
Missing24
Missing (%)38.1%
Memory size632.0 B
2024-03-14T22:00:18.205923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters468
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)79.5%

Sample

1st row055-633-7676
2nd row055-636-7909
3rd row055-681-2112
4th row055-632-6377
5th row055-632-3531
ValueCountFrequency (%)
055-638-1833 2
 
5.1%
055-633-7676 2
 
5.1%
055-634-2267 2
 
5.1%
055-632-7446 2
 
5.1%
055-688-3265 1
 
2.6%
055-637-7997 1
 
2.6%
055-634-0055 1
 
2.6%
055-637-1734 1
 
2.6%
055-687-5707 1
 
2.6%
055-688-5441 1
 
2.6%
Other values (25) 25
64.1%
2024-03-14T22:00:19.407672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 89
19.0%
- 78
16.7%
6 58
12.4%
0 57
12.2%
3 42
9.0%
8 35
 
7.5%
7 31
 
6.6%
2 24
 
5.1%
1 22
 
4.7%
4 16
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 390
83.3%
Dash Punctuation 78
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 89
22.8%
6 58
14.9%
0 57
14.6%
3 42
10.8%
8 35
 
9.0%
7 31
 
7.9%
2 24
 
6.2%
1 22
 
5.6%
4 16
 
4.1%
9 16
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 78
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 468
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 89
19.0%
- 78
16.7%
6 58
12.4%
0 57
12.2%
3 42
9.0%
8 35
 
7.5%
7 31
 
6.6%
2 24
 
5.1%
1 22
 
4.7%
4 16
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 468
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 89
19.0%
- 78
16.7%
6 58
12.4%
0 57
12.2%
3 42
9.0%
8 35
 
7.5%
7 31
 
6.6%
2 24
 
5.1%
1 22
 
4.7%
4 16
 
3.4%
Distinct56
Distinct (%)88.9%
Missing0
Missing (%)0.0%
Memory size632.0 B
Minimum1995-01-07 00:00:00
Maximum2023-12-05 00:00:00
2024-03-14T22:00:19.819625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T22:00:20.239963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size632.0 B
2024-02-07
63 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-02-07
2nd row2024-02-07
3rd row2024-02-07
4th row2024-02-07
5th row2024-02-07

Common Values

ValueCountFrequency (%)
2024-02-07 63
100.0%

Length

2024-03-14T22:00:20.638794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T22:00:20.910926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-02-07 63
100.0%

Interactions

2024-03-14T22:00:09.312722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T22:00:08.848733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T22:00:09.548618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T22:00:09.078943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T22:00:21.018538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종상호주소위도경도전화번호등록일자
업종1.0000.0000.0000.1560.0000.0000.000
상호0.0001.0001.0001.0001.0000.9990.997
주소0.0001.0001.0001.0001.0001.0000.993
위도0.1561.0001.0001.0000.8761.0000.995
경도0.0001.0001.0000.8761.0001.0000.966
전화번호0.0000.9991.0001.0001.0001.0000.994
등록일자0.0000.9970.9930.9950.9660.9941.000
2024-03-14T22:00:21.200308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위도경도업종
위도1.000-0.5000.085
경도-0.5001.0000.000
업종0.0850.0001.000

Missing values

2024-03-14T22:00:09.879385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T22:00:10.288048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호주소위도경도전화번호등록일자기준일자
0국내여행업(주)대웅관광경상남도 거제시 일운면 거제대로 279934.850966128.709546<NA>2002-01-152024-02-07
1국내여행업(주)바다로경상남도 거제시 중곡2로 89-134.896316128.635884<NA>2002-05-272024-02-07
2국내여행업여행백화점 거제도투어경상남도 거제시 서문로 4834.888899128.621342055-633-76762004-02-262024-02-07
3국내여행업거제에코투어경상남도 거제시 연초면 대금산로 2034.935668128.658429055-636-79092005-01-282024-02-07
4국내여행업거제시티투어(주)경상남도 거제시 능포로 24134.884198128.736925055-681-21122005-07-052024-02-07
5국내여행업(주)바람의언덕경상남도 거제시 남부면 해금강로 13234.739754128.663411055-632-63772007-04-162024-02-07
6국내여행업(주)상상속의여행경상남도 거제시 동문천로 634.879623128.623564055-632-35312013-10-182024-02-07
7국내여행업(주)동부여행사경상남도 거제시 장평로8길 3234.89318128.609396055-634-22672014-11-192024-02-07
8국내여행업솔미여행 마운틴경상남도 거제시 능포로 22534.882676128.736521<NA>2014-12-102024-02-07
9국내여행업대원투어경상남도 거제시 옥포대첩로 6234.89356128.694363055-632-74462015-05-262024-02-07
업종상호주소위도경도전화번호등록일자기준일자
53종합여행업주식회사 홈포레스트경상남도 거제시 국산로 24-2534.903206128.683369055-687-09982019-05-172024-02-07
54종합여행업유)대금투어경상남도 거제시 연초면 죽토로 3134.914128.658824055-688-12312019-06-182024-02-07
55종합여행업주식회사 무학항공여행사경상남도 거제시 장평3로 7534.894146128.606017055-368-37002020-03-032024-02-07
56종합여행업주식회사 포우경상남도 거제시 일운면 거제대로 263134.840299128.699383055-681-70802017-06-092024-02-07
57종합여행업이던하우스경상남도 거제시 옥포성안로 7134.892829128.694516055-687-57072020-10-262024-02-07
58종합여행업㈜위드투어 앤 골프경상남도 거제시 서간도길 9-934.88749128.691732<NA>2023-04-142024-02-07
59종합여행업프리한 골프투어경상남도 거제시 옥포성안로 7134.892829128.694516055-688-57072023-05-112024-02-07
60종합여행업떠나자 투어경상남도 거제시 고현로 138-3734.890639128.630284<NA>2023-06-232024-02-07
61종합여행업정운투어경상남도 거제시 피솔길 10434.902124128.584142<NA>2023-08-222024-02-07
62종합여행업메종드투어경상남도 거제시 능포로 11434.874036128.731275<NA>2023-11-032024-02-07