Overview

Dataset statistics

Number of variables4
Number of observations78
Missing cells20
Missing cells (%)6.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory33.7 B

Variable types

Text3
DateTime1

Dataset

Description부산광역시_기장군_숙박업소현황_20230616
Author부산광역시 기장군
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3072179

Alerts

소재지전화 has 20 (25.6%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:33:42.621224
Analysis finished2023-12-10 16:33:43.254542
Duration0.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct77
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size756.0 B
2023-12-11T01:33:43.444350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length21
Mean length7.5641026
Min length3

Characters and Unicode

Total characters590
Distinct characters174
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique76 ?
Unique (%)97.4%

Sample

1st row조인장모텔
2nd row올리브모텔
3rd row늘봄모텔
4th row팔레스
5th row해변여관
ValueCountFrequency (%)
오시리아 5
 
4.7%
큐모텔 2
 
1.9%
베스트루이스해밀턴호텔 2
 
1.9%
브라운도트호텔 2
 
1.9%
정관점 2
 
1.9%
기장호텔 1
 
0.9%
기장일광점 1
 
0.9%
본레브호텔 1
 
0.9%
세븐브릭스모텔 1
 
0.9%
위더스오션 1
 
0.9%
Other values (89) 89
83.2%
2023-12-11T01:33:43.810809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
44
 
7.5%
30
 
5.1%
29
 
4.9%
16
 
2.7%
16
 
2.7%
15
 
2.5%
14
 
2.4%
14
 
2.4%
13
 
2.2%
13
 
2.2%
Other values (164) 386
65.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 463
78.5%
Uppercase Letter 54
 
9.2%
Space Separator 29
 
4.9%
Lowercase Letter 16
 
2.7%
Open Punctuation 10
 
1.7%
Close Punctuation 10
 
1.7%
Decimal Number 6
 
1.0%
Dash Punctuation 1
 
0.2%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
9.5%
30
 
6.5%
16
 
3.5%
16
 
3.5%
15
 
3.2%
14
 
3.0%
14
 
3.0%
13
 
2.8%
13
 
2.8%
10
 
2.2%
Other values (129) 278
60.0%
Uppercase Letter
ValueCountFrequency (%)
N 5
 
9.3%
H 5
 
9.3%
A 5
 
9.3%
E 5
 
9.3%
T 4
 
7.4%
L 4
 
7.4%
U 4
 
7.4%
B 3
 
5.6%
G 3
 
5.6%
O 3
 
5.6%
Other values (8) 13
24.1%
Lowercase Letter
ValueCountFrequency (%)
l 4
25.0%
o 4
25.0%
v 1
 
6.2%
b 1
 
6.2%
y 1
 
6.2%
p 1
 
6.2%
a 1
 
6.2%
i 1
 
6.2%
e 1
 
6.2%
t 1
 
6.2%
Decimal Number
ValueCountFrequency (%)
2 4
66.7%
5 2
33.3%
Space Separator
ValueCountFrequency (%)
29
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 463
78.5%
Latin 70
 
11.9%
Common 57
 
9.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
9.5%
30
 
6.5%
16
 
3.5%
16
 
3.5%
15
 
3.2%
14
 
3.0%
14
 
3.0%
13
 
2.8%
13
 
2.8%
10
 
2.2%
Other values (129) 278
60.0%
Latin
ValueCountFrequency (%)
N 5
 
7.1%
H 5
 
7.1%
A 5
 
7.1%
E 5
 
7.1%
l 4
 
5.7%
o 4
 
5.7%
T 4
 
5.7%
L 4
 
5.7%
U 4
 
5.7%
B 3
 
4.3%
Other values (18) 27
38.6%
Common
ValueCountFrequency (%)
29
50.9%
( 10
 
17.5%
) 10
 
17.5%
2 4
 
7.0%
5 2
 
3.5%
- 1
 
1.8%
& 1
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 463
78.5%
ASCII 127
 
21.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
44
 
9.5%
30
 
6.5%
16
 
3.5%
16
 
3.5%
15
 
3.2%
14
 
3.0%
14
 
3.0%
13
 
2.8%
13
 
2.8%
10
 
2.2%
Other values (129) 278
60.0%
ASCII
ValueCountFrequency (%)
29
22.8%
( 10
 
7.9%
) 10
 
7.9%
N 5
 
3.9%
H 5
 
3.9%
A 5
 
3.9%
E 5
 
3.9%
l 4
 
3.1%
o 4
 
3.1%
T 4
 
3.1%
Other values (25) 46
36.2%
Distinct72
Distinct (%)92.3%
Missing0
Missing (%)0.0%
Memory size756.0 B
2023-12-11T01:33:44.063540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length38
Mean length25.807692
Min length20

Characters and Unicode

Total characters2013
Distinct characters76
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)91.0%

Sample

1st row부산광역시 기장군 기장읍 대라리 94-4
2nd row부산광역시 기장군 기장읍 청강리 335-3
3rd row부산광역시 기장군 일광읍 삼성리 33-11
4th row부산광역시 기장군 기장읍 청강리 337-2
5th row부산광역시 기장군 일광읍 삼성리 39-4
ValueCountFrequency (%)
부산광역시 78
18.4%
기장군 78
18.4%
기장읍 51
 
12.0%
일광읍 15
 
3.5%
삼성리 12
 
2.8%
정관읍 11
 
2.6%
시랑리 11
 
2.6%
연화리 9
 
2.1%
달산리 9
 
2.1%
오시리아 8
 
1.9%
Other values (94) 143
33.6%
2023-12-11T01:33:44.431509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
423
21.0%
130
 
6.5%
130
 
6.5%
97
 
4.8%
93
 
4.6%
87
 
4.3%
87
 
4.3%
86
 
4.3%
78
 
3.9%
78
 
3.9%
Other values (66) 724
36.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1198
59.5%
Space Separator 423
 
21.0%
Decimal Number 321
 
15.9%
Dash Punctuation 63
 
3.1%
Other Punctuation 4
 
0.2%
Math Symbol 2
 
0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
130
10.9%
130
10.9%
97
 
8.1%
93
 
7.8%
87
 
7.3%
87
 
7.3%
86
 
7.2%
78
 
6.5%
78
 
6.5%
78
 
6.5%
Other values (50) 254
21.2%
Decimal Number
ValueCountFrequency (%)
3 61
19.0%
1 44
13.7%
4 34
10.6%
7 32
10.0%
2 31
9.7%
0 29
9.0%
6 26
8.1%
9 23
 
7.2%
5 22
 
6.9%
8 19
 
5.9%
Space Separator
ValueCountFrequency (%)
423
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 63
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1198
59.5%
Common 815
40.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
130
10.9%
130
10.9%
97
 
8.1%
93
 
7.8%
87
 
7.3%
87
 
7.3%
86
 
7.2%
78
 
6.5%
78
 
6.5%
78
 
6.5%
Other values (50) 254
21.2%
Common
ValueCountFrequency (%)
423
51.9%
- 63
 
7.7%
3 61
 
7.5%
1 44
 
5.4%
4 34
 
4.2%
7 32
 
3.9%
2 31
 
3.8%
0 29
 
3.6%
6 26
 
3.2%
9 23
 
2.8%
Other values (6) 49
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1198
59.5%
ASCII 815
40.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
423
51.9%
- 63
 
7.7%
3 61
 
7.5%
1 44
 
5.4%
4 34
 
4.2%
7 32
 
3.9%
2 31
 
3.8%
0 29
 
3.6%
6 26
 
3.2%
9 23
 
2.8%
Other values (6) 49
 
6.0%
Hangul
ValueCountFrequency (%)
130
10.9%
130
10.9%
97
 
8.1%
93
 
7.8%
87
 
7.3%
87
 
7.3%
86
 
7.2%
78
 
6.5%
78
 
6.5%
78
 
6.5%
Other values (50) 254
21.2%

소재지전화
Text

MISSING 

Distinct58
Distinct (%)100.0%
Missing20
Missing (%)25.6%
Memory size756.0 B
2023-12-11T01:33:44.668759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters696
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)100.0%

Sample

1st row051-721-2212
2nd row051-721-7390
3rd row051-724-3360
4th row051-721-5988
5th row051-721-1882
ValueCountFrequency (%)
051-724-8544 1
 
1.7%
051-724-3100 1
 
1.7%
051-727-1006 1
 
1.7%
051-723-0497 1
 
1.7%
051-724-4301 1
 
1.7%
051-724-5335 1
 
1.7%
051-724-0662 1
 
1.7%
051-727-9991 1
 
1.7%
051-723-2146 1
 
1.7%
051-723-1229 1
 
1.7%
Other values (48) 48
82.8%
2023-12-11T01:33:45.012789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 116
16.7%
0 102
14.7%
2 95
13.6%
1 90
12.9%
5 81
11.6%
7 74
10.6%
8 37
 
5.3%
3 31
 
4.5%
4 26
 
3.7%
9 26
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 580
83.3%
Dash Punctuation 116
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 102
17.6%
2 95
16.4%
1 90
15.5%
5 81
14.0%
7 74
12.8%
8 37
 
6.4%
3 31
 
5.3%
4 26
 
4.5%
9 26
 
4.5%
6 18
 
3.1%
Dash Punctuation
ValueCountFrequency (%)
- 116
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 696
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 116
16.7%
0 102
14.7%
2 95
13.6%
1 90
12.9%
5 81
11.6%
7 74
10.6%
8 37
 
5.3%
3 31
 
4.5%
4 26
 
3.7%
9 26
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 696
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 116
16.7%
0 102
14.7%
2 95
13.6%
1 90
12.9%
5 81
11.6%
7 74
10.6%
8 37
 
5.3%
3 31
 
4.5%
4 26
 
3.7%
9 26
 
3.7%
Distinct76
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size756.0 B
Minimum1997-01-03 00:00:00
Maximum2023-06-14 00:00:00
2023-12-11T01:33:45.137783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:33:45.249241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Correlations

2023-12-11T01:33:45.334704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명영업소 주소(도로명)소재지전화영업자시작일
업소명1.0000.9941.0000.995
영업소 주소(도로명)0.9941.0001.0000.993
소재지전화1.0001.0001.0001.000
영업자시작일0.9950.9931.0001.000

Missing values

2023-12-11T01:33:43.147856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:33:43.222867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명영업소 주소(도로명)소재지전화영업자시작일
0조인장모텔부산광역시 기장군 기장읍 대라리 94-4051-721-22122010-07-29
1올리브모텔부산광역시 기장군 기장읍 청강리 335-3051-721-73902006-04-26
2늘봄모텔부산광역시 기장군 일광읍 삼성리 33-11051-724-33602017-05-04
3팔레스부산광역시 기장군 기장읍 청강리 337-2051-721-59882019-03-13
4해변여관부산광역시 기장군 일광읍 삼성리 39-4051-721-18822013-09-05
5베스트루이스해밀턴호텔 기장점(BEST LOUIS HAMILTON HOTEL)부산광역시 기장군 기장읍 서부리 422051-721-22032023-06-12
6뷰모텔부산광역시 기장군 기장읍 대변리 277-2051-724-92012023-06-07
7호텔오월로부산광역시 기장군 기장읍 동부리 393-1051-722-45612018-09-28
8호텔오즈부산광역시 기장군 기장읍 대변리 277-1051-722-90302021-08-30
9초콜릿부산광역시 기장군 기장읍 연화리 292-1051-723-29882010-05-03
업소명영업소 주소(도로명)소재지전화영업자시작일
68르컬렉티브 오시리아부산광역시 기장군 기장읍 시랑리 736 오시리아 스위첸 마티에<NA>2022-05-16
69오시리아 위더스오션부산광역시 기장군 기장읍 시랑리 736 오시리아 스위첸 마티에<NA>2022-05-20
70마티에 오시리아부산광역시 기장군 기장읍 시랑리 736 오시리아 스위첸 마티에051-983-55002022-06-13
71스테이오부산광역시 기장군 기장읍 시랑리 736 오시리아 스위첸 마티에<NA>2022-06-23
72머무름N오시리아부산광역시 기장군 기장읍 시랑리 736 오시리아 스위첸 마티에051-783-88022022-10-17
73와이컬렉션 by UH FLAT 오시리아부산광역시 기장군 기장읍 시랑리 736 오시리아 스위첸 마티에<NA>2022-12-06
74씨앤트리부산광역시 기장군 일광읍 문동리 4<NA>2023-01-18
75메르벨르A부산광역시 기장군 일광읍 문중리 180-34<NA>2023-03-16
76메르벨르B부산광역시 기장군 일광읍 문중리 180-28<NA>2023-03-16
77오실라부산광역시 기장군 기장읍 시랑리 736 오시리아 스위첸 마티에 1-4동 30호<NA>2023-04-10