Overview

Dataset statistics

Number of variables6
Number of observations39
Missing cells53
Missing cells (%)22.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory53.4 B

Variable types

Categorical1
Text3
Numeric1
Unsupported1

Dataset

Description부산광역시_강서구_관광여행업정보_20230209
Author부산광역시 강서구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15026342

Alerts

상호 has 1 (2.6%) missing valuesMissing
우편번호 has 1 (2.6%) missing valuesMissing
소재지(도로명) has 1 (2.6%) missing valuesMissing
전화번호 has 11 (28.2%) missing valuesMissing
Unnamed: 5 has 39 (100.0%) missing valuesMissing
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 16:14:12.664671
Analysis finished2023-12-10 16:14:13.538826
Duration0.87 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct8
Distinct (%)20.5%
Missing0
Missing (%)0.0%
Memory size444.0 B
국내외여행업
15 
종합여행업
11 
국내여행업
일반야영장업
 
1
자동차야영장업
 
1
Other values (3)

Length

Max length11
Median length7
Mean length5.6410256
Min length4

Unique

Unique5 ?
Unique (%)12.8%

Sample

1st row국내여행업
2nd row국내여행업
3rd row국내여행업
4th row국내여행업
5th row국내여행업

Common Values

ValueCountFrequency (%)
국내외여행업 15
38.5%
종합여행업 11
28.2%
국내여행업 8
20.5%
일반야영장업 1
 
2.6%
자동차야영장업 1
 
2.6%
외국인관광 도시민박업 1
 
2.6%
국제회의기획업 1
 
2.6%
<NA> 1
 
2.6%

Length

2023-12-11T01:14:13.648903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:14:13.799803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내외여행업 15
37.5%
종합여행업 11
27.5%
국내여행업 8
20.0%
일반야영장업 1
 
2.5%
자동차야영장업 1
 
2.5%
외국인관광 1
 
2.5%
도시민박업 1
 
2.5%
국제회의기획업 1
 
2.5%
na 1
 
2.5%

상호
Text

MISSING 

Distinct37
Distinct (%)97.4%
Missing1
Missing (%)2.6%
Memory size444.0 B
2023-12-11T01:14:14.067953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length7.4736842
Min length2

Characters and Unicode

Total characters284
Distinct characters112
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)94.7%

Sample

1st row(주)부산트랩
2nd row댕큐블랙
3rd row삼부자 여행사
4th row(주)유림이엔티
5th row주식회사 응카
ValueCountFrequency (%)
주식회사 6
 
12.2%
주)부산여행사 2
 
4.1%
여행사 2
 
4.1%
주)플랜씨 1
 
2.0%
두리오 1
 
2.0%
스타엘 1
 
2.0%
주)삼천리투어 1
 
2.0%
영한여행사 1
 
2.0%
samsung 1
 
2.0%
tour 1
 
2.0%
Other values (32) 32
65.3%
2023-12-11T01:14:14.506107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24
 
8.5%
( 17
 
6.0%
) 17
 
6.0%
12
 
4.2%
11
 
3.9%
10
 
3.5%
10
 
3.5%
9
 
3.2%
8
 
2.8%
7
 
2.5%
Other values (102) 159
56.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 222
78.2%
Open Punctuation 17
 
6.0%
Close Punctuation 17
 
6.0%
Uppercase Letter 16
 
5.6%
Space Separator 11
 
3.9%
Other Punctuation 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
10.8%
12
 
5.4%
10
 
4.5%
10
 
4.5%
9
 
4.1%
8
 
3.6%
7
 
3.2%
6
 
2.7%
6
 
2.7%
6
 
2.7%
Other values (87) 124
55.9%
Uppercase Letter
ValueCountFrequency (%)
S 3
18.8%
U 3
18.8%
A 2
12.5%
O 1
 
6.2%
R 1
 
6.2%
T 1
 
6.2%
G 1
 
6.2%
N 1
 
6.2%
M 1
 
6.2%
C 1
 
6.2%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Space Separator
ValueCountFrequency (%)
11
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 222
78.2%
Common 46
 
16.2%
Latin 16
 
5.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
10.8%
12
 
5.4%
10
 
4.5%
10
 
4.5%
9
 
4.1%
8
 
3.6%
7
 
3.2%
6
 
2.7%
6
 
2.7%
6
 
2.7%
Other values (87) 124
55.9%
Latin
ValueCountFrequency (%)
S 3
18.8%
U 3
18.8%
A 2
12.5%
O 1
 
6.2%
R 1
 
6.2%
T 1
 
6.2%
G 1
 
6.2%
N 1
 
6.2%
M 1
 
6.2%
C 1
 
6.2%
Common
ValueCountFrequency (%)
( 17
37.0%
) 17
37.0%
11
23.9%
& 1
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 222
78.2%
ASCII 62
 
21.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
24
 
10.8%
12
 
5.4%
10
 
4.5%
10
 
4.5%
9
 
4.1%
8
 
3.6%
7
 
3.2%
6
 
2.7%
6
 
2.7%
6
 
2.7%
Other values (87) 124
55.9%
ASCII
ValueCountFrequency (%)
( 17
27.4%
) 17
27.4%
11
17.7%
S 3
 
4.8%
U 3
 
4.8%
A 2
 
3.2%
O 1
 
1.6%
R 1
 
1.6%
T 1
 
1.6%
G 1
 
1.6%
Other values (5) 5
 
8.1%

우편번호
Real number (ℝ)

MISSING 

Distinct17
Distinct (%)44.7%
Missing1
Missing (%)2.6%
Infinite0
Infinite (%)0.0%
Mean46727.237
Minimum46700
Maximum46773
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size483.0 B
2023-12-11T01:14:14.692433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum46700
5-th percentile46700
Q146719.25
median46721
Q346726
95-th percentile46771.3
Maximum46773
Range73
Interquartile range (IQR)6.75

Descriptive statistics

Standard deviation21.779527
Coefficient of variation (CV)0.00046609918
Kurtosis-0.089018865
Mean46727.237
Median Absolute Deviation (MAD)5
Skewness0.9326789
Sum1775635
Variance474.3478
MonotonicityNot monotonic
2023-12-11T01:14:14.909994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
46721 8
20.5%
46726 7
17.9%
46720 3
 
7.7%
46700 3
 
7.7%
46702 3
 
7.7%
46762 2
 
5.1%
46773 2
 
5.1%
46722 1
 
2.6%
46771 1
 
2.6%
46760 1
 
2.6%
Other values (7) 7
17.9%
ValueCountFrequency (%)
46700 3
 
7.7%
46702 3
 
7.7%
46705 1
 
2.6%
46707 1
 
2.6%
46717 1
 
2.6%
46719 1
 
2.6%
46720 3
 
7.7%
46721 8
20.5%
46722 1
 
2.6%
46726 7
17.9%
ValueCountFrequency (%)
46773 2
 
5.1%
46771 1
 
2.6%
46764 1
 
2.6%
46762 2
 
5.1%
46760 1
 
2.6%
46757 1
 
2.6%
46727 1
 
2.6%
46726 7
17.9%
46722 1
 
2.6%
46721 8
20.5%

소재지(도로명)
Text

MISSING 

Distinct36
Distinct (%)94.7%
Missing1
Missing (%)2.6%
Memory size444.0 B
2023-12-11T01:14:15.276618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length45
Mean length37.026316
Min length21

Characters and Unicode

Total characters1407
Distinct characters129
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)89.5%

Sample

1st row부산광역시 강서구 공항로743번길 2-1 (대저2동)
2nd row부산광역시 강서구 식만로 2 (죽림동)
3rd row부산광역시 강서구 범방3로72번길 14 (범방동)
4th row부산광역시 강서구 낙동북로73번길 15, 1층 (강동동)
5th row부산광역시 강서구 대저중앙로394번길 32 (대저1동)
ValueCountFrequency (%)
부산광역시 38
 
14.3%
강서구 38
 
14.3%
명지동 13
 
4.9%
대저2동 13
 
4.9%
유통단지1로 8
 
3.0%
1층 8
 
3.0%
대저1동 6
 
2.3%
부산티플렉스 5
 
1.9%
41 5
 
1.9%
50 5
 
1.9%
Other values (106) 126
47.5%
2023-12-11T01:14:15.850010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
227
 
16.1%
1 60
 
4.3%
53
 
3.8%
2 52
 
3.7%
48
 
3.4%
44
 
3.1%
43
 
3.1%
39
 
2.8%
39
 
2.8%
39
 
2.8%
Other values (119) 763
54.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 805
57.2%
Decimal Number 261
 
18.6%
Space Separator 227
 
16.1%
Close Punctuation 38
 
2.7%
Open Punctuation 38
 
2.7%
Other Punctuation 30
 
2.1%
Dash Punctuation 5
 
0.4%
Uppercase Letter 2
 
0.1%
Lowercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
53
 
6.6%
48
 
6.0%
44
 
5.5%
43
 
5.3%
39
 
4.8%
39
 
4.8%
39
 
4.8%
38
 
4.7%
38
 
4.7%
38
 
4.7%
Other values (101) 386
48.0%
Decimal Number
ValueCountFrequency (%)
1 60
23.0%
2 52
19.9%
0 28
10.7%
3 27
10.3%
4 20
 
7.7%
5 19
 
7.3%
6 16
 
6.1%
9 15
 
5.7%
8 14
 
5.4%
7 10
 
3.8%
Uppercase Letter
ValueCountFrequency (%)
K 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
227
100.0%
Close Punctuation
ValueCountFrequency (%)
) 38
100.0%
Open Punctuation
ValueCountFrequency (%)
( 38
100.0%
Other Punctuation
ValueCountFrequency (%)
, 30
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 805
57.2%
Common 599
42.6%
Latin 3
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
53
 
6.6%
48
 
6.0%
44
 
5.5%
43
 
5.3%
39
 
4.8%
39
 
4.8%
39
 
4.8%
38
 
4.7%
38
 
4.7%
38
 
4.7%
Other values (101) 386
48.0%
Common
ValueCountFrequency (%)
227
37.9%
1 60
 
10.0%
2 52
 
8.7%
) 38
 
6.3%
( 38
 
6.3%
, 30
 
5.0%
0 28
 
4.7%
3 27
 
4.5%
4 20
 
3.3%
5 19
 
3.2%
Other values (5) 60
 
10.0%
Latin
ValueCountFrequency (%)
e 1
33.3%
K 1
33.3%
B 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 805
57.2%
ASCII 602
42.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
227
37.7%
1 60
 
10.0%
2 52
 
8.6%
) 38
 
6.3%
( 38
 
6.3%
, 30
 
5.0%
0 28
 
4.7%
3 27
 
4.5%
4 20
 
3.3%
5 19
 
3.2%
Other values (8) 63
 
10.5%
Hangul
ValueCountFrequency (%)
53
 
6.6%
48
 
6.0%
44
 
5.5%
43
 
5.3%
39
 
4.8%
39
 
4.8%
39
 
4.8%
38
 
4.7%
38
 
4.7%
38
 
4.7%
Other values (101) 386
48.0%

전화번호
Text

MISSING 

Distinct26
Distinct (%)92.9%
Missing11
Missing (%)28.2%
Memory size444.0 B
2023-12-11T01:14:16.133394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.071429
Min length12

Characters and Unicode

Total characters338
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)85.7%

Sample

1st row051-912-3377
2nd row051-941-5802
3rd row051-941-9082
4th row051-941-1801
5th row051-832-0093
ValueCountFrequency (%)
051-808-1011 2
 
7.1%
051-310-6034 2
 
7.1%
051-912-3377 1
 
3.6%
051-714-3133 1
 
3.6%
051-463-4144 1
 
3.6%
051-961-1101 1
 
3.6%
051-201-4960 1
 
3.6%
070-4149-0071 1
 
3.6%
051-442-0755 1
 
3.6%
051-831-5555 1
 
3.6%
Other values (16) 16
57.1%
2023-12-11T01:14:16.555646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 64
18.9%
1 56
16.6%
- 56
16.6%
5 38
11.2%
4 24
 
7.1%
7 23
 
6.8%
3 18
 
5.3%
9 16
 
4.7%
2 16
 
4.7%
8 15
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 282
83.4%
Dash Punctuation 56
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 64
22.7%
1 56
19.9%
5 38
13.5%
4 24
 
8.5%
7 23
 
8.2%
3 18
 
6.4%
9 16
 
5.7%
2 16
 
5.7%
8 15
 
5.3%
6 12
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 56
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 338
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 64
18.9%
1 56
16.6%
- 56
16.6%
5 38
11.2%
4 24
 
7.1%
7 23
 
6.8%
3 18
 
5.3%
9 16
 
4.7%
2 16
 
4.7%
8 15
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 338
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 64
18.9%
1 56
16.6%
- 56
16.6%
5 38
11.2%
4 24
 
7.1%
7 23
 
6.8%
3 18
 
5.3%
9 16
 
4.7%
2 16
 
4.7%
8 15
 
4.4%

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing39
Missing (%)100.0%
Memory size483.0 B

Interactions

2023-12-11T01:14:12.983144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:14:16.692455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종상호우편번호소재지(도로명)전화번호
업종1.0000.9680.1400.9830.000
상호0.9681.0001.0001.0001.000
우편번호0.1401.0001.0001.0001.000
소재지(도로명)0.9831.0001.0001.0001.000
전화번호0.0001.0001.0001.0001.000
2023-12-11T01:14:16.832704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
우편번호업종
우편번호1.0000.000
업종0.0001.000

Missing values

2023-12-11T01:14:13.138751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:14:13.286020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T01:14:13.438906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업종상호우편번호소재지(도로명)전화번호Unnamed: 5
0국내여행업(주)부산트랩46720부산광역시 강서구 공항로743번길 2-1 (대저2동)051-912-3377<NA>
1국내여행업댕큐블랙46707부산광역시 강서구 식만로 2 (죽림동)051-941-5802<NA>
2국내여행업삼부자 여행사46727부산광역시 강서구 범방3로72번길 14 (범방동)051-941-9082<NA>
3국내여행업(주)유림이엔티46705부산광역시 강서구 낙동북로73번길 15, 1층 (강동동)051-941-1801<NA>
4국내여행업주식회사 응카46702부산광역시 강서구 대저중앙로394번길 32 (대저1동)051-832-0093<NA>
5국내여행업(주)부산여행사46726부산광역시 강서구 명지국제6로 21, 플러스시티 605호 (명지동)051-808-1011<NA>
6국내여행업(주)로메인인터네셔널46721부산광역시 강서구 유통단지1로 50, 부산티플렉스 217동 1층 105호 (대저2동)<NA><NA>
7국내여행업우짜46717부산광역시 강서구 에코델타스마트로 39, 어반테크하우스 산학연계지원공간 3동 1호 (명지동)<NA><NA>
8국내외여행업(주)아쿠아 여행사46726부산광역시 강서구 명지국제8로10번길 16, KB타워 6층 606호 (명지동)051-742-0005<NA>
9국내외여행업(주)하나여행클럽46721부산광역시 강서구 유통단지1로 41, 129동 208호 (대저2동)051-632-2369<NA>
업종상호우편번호소재지(도로명)전화번호Unnamed: 5
29종합여행업(주)플랜씨46700부산광역시 강서구 대저중앙로 89, 1층 (대저1동)070-4149-0071<NA>
30종합여행업두리오 주식회사46726부산광역시 강서구 명지국제7로 37, 5층 503호 (명지동, 더샵 명지퍼스트월드 2단지)051-201-4960<NA>
31종합여행업SUA C&L46721부산광역시 강서구 유통단지1로 41, 부산티플렉스 115동 1층 208호 (대저2동)051-961-1101<NA>
32종합여행업(주)티제이투어46762부산광역시 강서구 명지오션시티9로 50, 삼정그린코아 웨스트 2층 202호 (명지동)<NA><NA>
33종합여행업(주)포커스투어46720부산광역시 강서구 공항앞길163번길 99-2 (대저2동)051-463-4144<NA>
34일반야영장업대저생태공원캠핑장46700부산광역시 강서구 대저로 3, 대저캠핑장 (대저1동)051-310-6034<NA>
35자동차야영장업대저생태공원 캠핑장46700부산광역시 강서구 대저로 3 (대저1동)051-310-6034<NA>
36외국인관광 도시민박업스카이필드46722부산광역시 강서구 금호순서길89번가길 48, 스카이필드 (대저2동)<NA><NA>
37국제회의기획업(주)트윈스라이팅46702부산광역시 강서구 공항로1309번길 95-5 (대저1동)051-314-4607<NA>
38<NA><NA><NA><NA><NA><NA>