Overview

Dataset statistics

Number of variables7
Number of observations33
Missing cells2
Missing cells (%)0.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory61.9 B

Variable types

Numeric2
Categorical2
Text3

Dataset

Description부산광역시 사하구 여행업에 대한 데이터입니다. 업종, 상호, 우편번호, 소재지, 전화번호 등의 정보를 제공합니다.
Author부산광역시 사하구
URLhttps://www.data.go.kr/data/3045765/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
전화번호 has 2 (6.1%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-15 01:16:08.178702
Analysis finished2024-03-15 01:16:10.155621
Duration1.98 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct33
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17
Minimum1
Maximum33
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size425.0 B
2024-03-15T10:16:10.341700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.6
Q19
median17
Q325
95-th percentile31.4
Maximum33
Range32
Interquartile range (IQR)16

Descriptive statistics

Standard deviation9.6695398
Coefficient of variation (CV)0.56879646
Kurtosis-1.2
Mean17
Median Absolute Deviation (MAD)8
Skewness0
Sum561
Variance93.5
MonotonicityStrictly increasing
2024-03-15T10:16:10.779032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
1 1
 
3.0%
26 1
 
3.0%
20 1
 
3.0%
21 1
 
3.0%
22 1
 
3.0%
23 1
 
3.0%
24 1
 
3.0%
25 1
 
3.0%
27 1
 
3.0%
2 1
 
3.0%
Other values (23) 23
69.7%
ValueCountFrequency (%)
1 1
3.0%
2 1
3.0%
3 1
3.0%
4 1
3.0%
5 1
3.0%
6 1
3.0%
7 1
3.0%
8 1
3.0%
9 1
3.0%
10 1
3.0%
ValueCountFrequency (%)
33 1
3.0%
32 1
3.0%
31 1
3.0%
30 1
3.0%
29 1
3.0%
28 1
3.0%
27 1
3.0%
26 1
3.0%
25 1
3.0%
24 1
3.0%

업종
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)9.1%
Missing0
Missing (%)0.0%
Memory size392.0 B
국내외여행업
20 
종합여행업
국내여행업

Length

Max length6
Median length6
Mean length5.6060606
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내외여행업 20
60.6%
종합여행업 7
 
21.2%
국내여행업 6
 
18.2%

Length

2024-03-15T10:16:11.181910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T10:16:11.386879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외여행업 20
60.6%
종합여행업 7
 
21.2%
국내여행업 6
 
18.2%

상호
Text

Distinct31
Distinct (%)93.9%
Missing0
Missing (%)0.0%
Memory size392.0 B
2024-03-15T10:16:12.177250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length8.4848485
Min length4

Characters and Unicode

Total characters280
Distinct characters86
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)87.9%

Sample

1st row뉴대림고속관광 주식회사
2nd row대림카워시 주식회사
3rd row하나고속관광(주)
4th row테마부산고속관광투어
5th row(협)하나고속 여행사
ValueCountFrequency (%)
주식회사 5
 
10.6%
여행사 4
 
8.5%
뉴대림고속관광 2
 
4.3%
협)하나고속 2
 
4.3%
2
 
4.3%
하나여행사 2
 
4.3%
tour 1
 
2.1%
고고랜드 1
 
2.1%
펀텍플러스 1
 
2.1%
모두 1
 
2.1%
Other values (26) 26
55.3%
2024-03-15T10:16:13.382999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23
 
8.2%
( 21
 
7.5%
) 21
 
7.5%
14
 
5.0%
12
 
4.3%
12
 
4.3%
11
 
3.9%
11
 
3.9%
11
 
3.9%
8
 
2.9%
Other values (76) 136
48.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 209
74.6%
Open Punctuation 21
 
7.5%
Close Punctuation 21
 
7.5%
Space Separator 14
 
5.0%
Uppercase Letter 9
 
3.2%
Lowercase Letter 6
 
2.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
11.0%
12
 
5.7%
12
 
5.7%
11
 
5.3%
11
 
5.3%
11
 
5.3%
8
 
3.8%
7
 
3.3%
7
 
3.3%
6
 
2.9%
Other values (61) 101
48.3%
Uppercase Letter
ValueCountFrequency (%)
T 2
22.2%
O 2
22.2%
I 1
11.1%
A 1
11.1%
H 1
11.1%
L 1
11.1%
Y 1
11.1%
Lowercase Letter
ValueCountFrequency (%)
o 2
33.3%
r 1
16.7%
t 1
16.7%
m 1
16.7%
u 1
16.7%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Space Separator
ValueCountFrequency (%)
14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 209
74.6%
Common 56
 
20.0%
Latin 15
 
5.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
11.0%
12
 
5.7%
12
 
5.7%
11
 
5.3%
11
 
5.3%
11
 
5.3%
8
 
3.8%
7
 
3.3%
7
 
3.3%
6
 
2.9%
Other values (61) 101
48.3%
Latin
ValueCountFrequency (%)
o 2
13.3%
T 2
13.3%
O 2
13.3%
I 1
6.7%
r 1
6.7%
A 1
6.7%
t 1
6.7%
m 1
6.7%
u 1
6.7%
H 1
6.7%
Other values (2) 2
13.3%
Common
ValueCountFrequency (%)
( 21
37.5%
) 21
37.5%
14
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 209
74.6%
ASCII 71
 
25.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
23
 
11.0%
12
 
5.7%
12
 
5.7%
11
 
5.3%
11
 
5.3%
11
 
5.3%
8
 
3.8%
7
 
3.3%
7
 
3.3%
6
 
2.9%
Other values (61) 101
48.3%
ASCII
ValueCountFrequency (%)
( 21
29.6%
) 21
29.6%
14
19.7%
o 2
 
2.8%
T 2
 
2.8%
O 2
 
2.8%
I 1
 
1.4%
r 1
 
1.4%
A 1
 
1.4%
t 1
 
1.4%
Other values (5) 5
 
7.0%

우편번호
Real number (ℝ)

Distinct25
Distinct (%)75.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean49405.152
Minimum49308
Maximum49519
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size425.0 B
2024-03-15T10:16:13.770018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum49308
5-th percentile49320.4
Q149339
median49415
Q349440
95-th percentile49492.2
Maximum49519
Range211
Interquartile range (IQR)101

Descriptive statistics

Standard deviation58.201225
Coefficient of variation (CV)0.0011780396
Kurtosis-0.92659258
Mean49405.152
Median Absolute Deviation (MAD)43
Skewness-0.12049726
Sum1630370
Variance3387.3826
MonotonicityNot monotonic
2024-03-15T10:16:14.153963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
49415 3
 
9.1%
49431 3
 
9.1%
49338 2
 
6.1%
49324 2
 
6.1%
49339 2
 
6.1%
49464 2
 
6.1%
49337 1
 
3.0%
49491 1
 
3.0%
49428 1
 
3.0%
49450 1
 
3.0%
Other values (15) 15
45.5%
ValueCountFrequency (%)
49308 1
3.0%
49315 1
3.0%
49324 2
6.1%
49330 1
3.0%
49337 1
3.0%
49338 2
6.1%
49339 2
6.1%
49398 1
3.0%
49402 1
3.0%
49403 1
3.0%
ValueCountFrequency (%)
49519 1
 
3.0%
49494 1
 
3.0%
49491 1
 
3.0%
49476 1
 
3.0%
49464 2
6.1%
49458 1
 
3.0%
49450 1
 
3.0%
49440 1
 
3.0%
49431 3
9.1%
49428 1
 
3.0%
Distinct30
Distinct (%)90.9%
Missing0
Missing (%)0.0%
Memory size392.0 B
2024-03-15T10:16:15.012888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length33
Mean length30.515152
Min length23

Characters and Unicode

Total characters1007
Distinct characters66
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)84.8%

Sample

1st row부산광역시 사하구 하신중앙로 236-1 (하단동)
2nd row부산광역시 사하구 하신중앙로 234, 1층 (하단동)
3rd row부산광역시 사하구 하신번영로151번길 14, 2층 (신평동)
4th row부산광역시 사하구 다대로130번길 166, 3층 (신평동)
5th row부산광역시 사하구 하신번영로151번길 14, 2층 (신평동)
ValueCountFrequency (%)
부산광역시 33
16.5%
사하구 33
16.5%
하단동 11
 
5.5%
낙동대로 9
 
4.5%
2층 7
 
3.5%
괴정동 7
 
3.5%
하신중앙로 6
 
3.0%
장림동 5
 
2.5%
14 4
 
2.0%
신평동 4
 
2.0%
Other values (60) 81
40.5%
2024-03-15T10:16:16.302000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
167
 
16.6%
55
 
5.5%
48
 
4.8%
1 43
 
4.3%
33
 
3.3%
) 33
 
3.3%
( 33
 
3.3%
33
 
3.3%
33
 
3.3%
33
 
3.3%
Other values (56) 496
49.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 581
57.7%
Space Separator 167
 
16.6%
Decimal Number 160
 
15.9%
Close Punctuation 33
 
3.3%
Open Punctuation 33
 
3.3%
Other Punctuation 27
 
2.7%
Dash Punctuation 6
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
55
 
9.5%
48
 
8.3%
33
 
5.7%
33
 
5.7%
33
 
5.7%
33
 
5.7%
33
 
5.7%
33
 
5.7%
33
 
5.7%
33
 
5.7%
Other values (41) 214
36.8%
Decimal Number
ValueCountFrequency (%)
1 43
26.9%
2 24
15.0%
4 21
13.1%
3 20
12.5%
5 18
11.2%
6 16
 
10.0%
0 12
 
7.5%
7 3
 
1.9%
8 2
 
1.2%
9 1
 
0.6%
Space Separator
ValueCountFrequency (%)
167
100.0%
Close Punctuation
ValueCountFrequency (%)
) 33
100.0%
Open Punctuation
ValueCountFrequency (%)
( 33
100.0%
Other Punctuation
ValueCountFrequency (%)
, 27
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 581
57.7%
Common 426
42.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
55
 
9.5%
48
 
8.3%
33
 
5.7%
33
 
5.7%
33
 
5.7%
33
 
5.7%
33
 
5.7%
33
 
5.7%
33
 
5.7%
33
 
5.7%
Other values (41) 214
36.8%
Common
ValueCountFrequency (%)
167
39.2%
1 43
 
10.1%
) 33
 
7.7%
( 33
 
7.7%
, 27
 
6.3%
2 24
 
5.6%
4 21
 
4.9%
3 20
 
4.7%
5 18
 
4.2%
6 16
 
3.8%
Other values (5) 24
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 581
57.7%
ASCII 426
42.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
167
39.2%
1 43
 
10.1%
) 33
 
7.7%
( 33
 
7.7%
, 27
 
6.3%
2 24
 
5.6%
4 21
 
4.9%
3 20
 
4.7%
5 18
 
4.2%
6 16
 
3.8%
Other values (5) 24
 
5.6%
Hangul
ValueCountFrequency (%)
55
 
9.5%
48
 
8.3%
33
 
5.7%
33
 
5.7%
33
 
5.7%
33
 
5.7%
33
 
5.7%
33
 
5.7%
33
 
5.7%
33
 
5.7%
Other values (41) 214
36.8%

전화번호
Text

MISSING 

Distinct27
Distinct (%)87.1%
Missing2
Missing (%)6.1%
Memory size392.0 B
2024-03-15T10:16:16.995722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.032258
Min length12

Characters and Unicode

Total characters373
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)80.6%

Sample

1st row051-266-5600
2nd row051-266-5600
3rd row051-207-2211
4th row051-291-2379
5th row051-207-2211
ValueCountFrequency (%)
051-266-5600 3
 
9.7%
051-207-2211 3
 
9.7%
051-266-1000 1
 
3.2%
051-711-8226 1
 
3.2%
051-294-1379 1
 
3.2%
051-462-7258 1
 
3.2%
051-902-2626 1
 
3.2%
051-747-7669 1
 
3.2%
070-5033-3980 1
 
3.2%
051-201-0191 1
 
3.2%
Other values (17) 17
54.8%
2024-03-15T10:16:18.039949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 75
20.1%
- 62
16.6%
1 53
14.2%
2 47
12.6%
5 41
11.0%
6 29
 
7.8%
9 20
 
5.4%
7 18
 
4.8%
3 14
 
3.8%
4 10
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 311
83.4%
Dash Punctuation 62
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 75
24.1%
1 53
17.0%
2 47
15.1%
5 41
13.2%
6 29
 
9.3%
9 20
 
6.4%
7 18
 
5.8%
3 14
 
4.5%
4 10
 
3.2%
8 4
 
1.3%
Dash Punctuation
ValueCountFrequency (%)
- 62
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 373
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 75
20.1%
- 62
16.6%
1 53
14.2%
2 47
12.6%
5 41
11.0%
6 29
 
7.8%
9 20
 
5.4%
7 18
 
4.8%
3 14
 
3.8%
4 10
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 373
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 75
20.1%
- 62
16.6%
1 53
14.2%
2 47
12.6%
5 41
11.0%
6 29
 
7.8%
9 20
 
5.4%
7 18
 
4.8%
3 14
 
3.8%
4 10
 
2.7%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size392.0 B
2024-01-01
33 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-01-01
2nd row2024-01-01
3rd row2024-01-01
4th row2024-01-01
5th row2024-01-01

Common Values

ValueCountFrequency (%)
2024-01-01 33
100.0%

Length

2024-03-15T10:16:18.380955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T10:16:18.570173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-01-01 33
100.0%

Interactions

2024-03-15T10:16:08.875014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:16:08.585924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:16:09.362896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:16:08.736574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T10:16:18.749358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종상호우편번호소재지(도로명)전화번호
연번1.0000.9540.8690.0000.8610.891
업종0.9541.0000.0000.4540.0000.523
상호0.8690.0001.0001.0001.0001.000
우편번호0.0000.4541.0001.0001.0001.000
소재지(도로명)0.8610.0001.0001.0001.0001.000
전화번호0.8910.5231.0001.0001.0001.000
2024-03-15T10:16:18.937099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번우편번호업종
연번1.000-0.0350.830
우편번호-0.0351.0000.309
업종0.8300.3091.000

Missing values

2024-03-15T10:16:09.621826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T10:16:10.005881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종상호우편번호소재지(도로명)전화번호데이터기준일자
01국내여행업뉴대림고속관광 주식회사49415부산광역시 사하구 하신중앙로 236-1 (하단동)051-266-56002024-01-01
12국내여행업대림카워시 주식회사49415부산광역시 사하구 하신중앙로 234, 1층 (하단동)051-266-56002024-01-01
23국내여행업하나고속관광(주)49431부산광역시 사하구 하신번영로151번길 14, 2층 (신평동)051-207-22112024-01-01
34국내여행업테마부산고속관광투어49440부산광역시 사하구 다대로130번길 166, 3층 (신평동)051-291-23792024-01-01
45국내여행업(협)하나고속 여행사49431부산광역시 사하구 하신번영로151번길 14, 2층 (신평동)051-207-22112024-01-01
56국내여행업YOLO 여행사49476부산광역시 사하구 하신중앙로 14, 2층 (장림동)051-263-25362024-01-01
67국내외여행업(주) HIT 관광여행사49338부산광역시 사하구 낙동대로 168 (괴정동)051-292-33992024-01-01
78국내외여행업뉴대림고속관광 주식회사49415부산광역시 사하구 하신중앙로 236-1 (하단동)051-266-56002024-01-01
89국내외여행업(주) 하나여행사49338부산광역시 사하구 낙동대로 204 (괴정동)051-294-99002024-01-01
910국내외여행업(주)로얄드림투어49408부산광역시 사하구 낙동대로 457, 5층 (하단동)051-206-57712024-01-01
연번업종상호우편번호소재지(도로명)전화번호데이터기준일자
2324국내외여행업고고랜드49324부산광역시 사하구 낙동대로 542, 지하1층 112호 (하단동, 대우에덴프라자)<NA>2024-01-01
2425국내외여행업펀텍플러스49339부산광역시 사하구 낙동대로 164, 한라빌딩 606호 (괴정동)051-202-10342024-01-01
2526국내외여행업모두 하나로 여행사49407부산광역시 사하구 하신중앙로 324, 2층 2호 (하단동)051-201-01912024-01-01
2627종합여행업아톰투어(Atom Tour)49403부산광역시 사하구 괴정로 222, 105호 (괴정동, 정우맨션)070-5033-39802024-01-01
2728종합여행업(주)여명여행49519부산광역시 사하구 다대로 435, 4층 (다대동)051-747-76692024-01-01
2829종합여행업(주)야가자투어49411부산광역시 사하구 낙동대로 403, 6층 (당리동, 동양빌딩)051-902-26262024-01-01
2930종합여행업(주)씨스카이투어49450부산광역시 사하구 감천로142번길 1 (감천동)051-462-72582024-01-01
3031종합여행업(주)똑똑여행49428부산광역시 사하구 하신중앙로 307, 3층 1호 (하단동)051-294-13792024-01-01
3132종합여행업(주)서브원49491부산광역시 사하구 장평로15번길 3-1 (장림동)051-711-82262024-01-01
3233종합여행업(주)골드브릿지49339부산광역시 사하구 낙동대로 164, 604호 (괴정동)051-291-02022024-01-01