Overview

Dataset statistics

Number of variables4
Number of observations335
Missing cells172
Missing cells (%)12.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.6 KiB
Average record size in memory32.4 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시 기장군 카페 현황에 대한 데이터로 카페의 업종, 업소명, 소재지(도로명), 소재지전화를 공개합니다.
Author부산광역시 기장군
URLhttps://www.data.go.kr/data/15111313/fileData.do

Alerts

업종 has constant value ""Constant
소재지전화 has 172 (51.3%) missing valuesMissing
소재지(도로명) has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:23:02.831375
Analysis finished2023-12-12 10:23:03.718562
Duration0.89 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
휴게음식점
335 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row휴게음식점
2nd row휴게음식점
3rd row휴게음식점
4th row휴게음식점
5th row휴게음식점

Common Values

ValueCountFrequency (%)
휴게음식점 335
100.0%

Length

2023-12-12T19:23:03.795919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:23:03.895382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
휴게음식점 335
100.0%
Distinct333
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-12T19:23:04.172000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length22
Mean length8.5343284
Min length2

Characters and Unicode

Total characters2859
Distinct characters385
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique331 ?
Unique (%)98.8%

Sample

1st row산마루다방
2nd row고궁다방
3rd row약속다방
4th row원다방
5th row삼거리다방
ValueCountFrequency (%)
컴포즈커피 9
 
1.9%
coffee 8
 
1.6%
정관점 8
 
1.6%
하삼동커피 7
 
1.4%
부산정관점 6
 
1.2%
일광신도시점 6
 
1.2%
더벤티 5
 
1.0%
기장시장점 4
 
0.8%
카페 4
 
0.8%
하이오커피 4
 
0.8%
Other values (373) 424
87.4%
2023-12-12T19:23:04.621331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
150
 
5.2%
112
 
3.9%
104
 
3.6%
102
 
3.6%
67
 
2.3%
67
 
2.3%
) 64
 
2.2%
( 64
 
2.2%
62
 
2.2%
60
 
2.1%
Other values (375) 2007
70.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2090
73.1%
Uppercase Letter 214
 
7.5%
Lowercase Letter 203
 
7.1%
Space Separator 150
 
5.2%
Decimal Number 67
 
2.3%
Close Punctuation 64
 
2.2%
Open Punctuation 64
 
2.2%
Other Punctuation 5
 
0.2%
Dash Punctuation 1
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
112
 
5.4%
104
 
5.0%
102
 
4.9%
67
 
3.2%
67
 
3.2%
62
 
3.0%
60
 
2.9%
47
 
2.2%
43
 
2.1%
43
 
2.1%
Other values (310) 1383
66.2%
Lowercase Letter
ValueCountFrequency (%)
e 37
18.2%
a 28
13.8%
f 18
8.9%
o 15
 
7.4%
i 13
 
6.4%
n 12
 
5.9%
t 10
 
4.9%
r 9
 
4.4%
l 9
 
4.4%
c 8
 
3.9%
Other values (13) 44
21.7%
Uppercase Letter
ValueCountFrequency (%)
E 27
12.6%
O 24
11.2%
A 18
 
8.4%
C 18
 
8.4%
F 16
 
7.5%
S 14
 
6.5%
T 12
 
5.6%
N 9
 
4.2%
B 9
 
4.2%
L 9
 
4.2%
Other values (13) 58
27.1%
Decimal Number
ValueCountFrequency (%)
5 14
20.9%
1 14
20.9%
0 10
14.9%
4 5
 
7.5%
2 5
 
7.5%
6 5
 
7.5%
9 4
 
6.0%
3 4
 
6.0%
8 3
 
4.5%
7 3
 
4.5%
Other Punctuation
ValueCountFrequency (%)
· 2
40.0%
# 1
20.0%
& 1
20.0%
. 1
20.0%
Space Separator
ValueCountFrequency (%)
150
100.0%
Close Punctuation
ValueCountFrequency (%)
) 64
100.0%
Open Punctuation
ValueCountFrequency (%)
( 64
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2090
73.1%
Latin 418
 
14.6%
Common 351
 
12.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
112
 
5.4%
104
 
5.0%
102
 
4.9%
67
 
3.2%
67
 
3.2%
62
 
3.0%
60
 
2.9%
47
 
2.2%
43
 
2.1%
43
 
2.1%
Other values (310) 1383
66.2%
Latin
ValueCountFrequency (%)
e 37
 
8.9%
a 28
 
6.7%
E 27
 
6.5%
O 24
 
5.7%
A 18
 
4.3%
C 18
 
4.3%
f 18
 
4.3%
F 16
 
3.8%
o 15
 
3.6%
S 14
 
3.3%
Other values (37) 203
48.6%
Common
ValueCountFrequency (%)
150
42.7%
) 64
18.2%
( 64
18.2%
5 14
 
4.0%
1 14
 
4.0%
0 10
 
2.8%
4 5
 
1.4%
2 5
 
1.4%
6 5
 
1.4%
9 4
 
1.1%
Other values (8) 16
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2090
73.1%
ASCII 766
 
26.8%
None 2
 
0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
150
19.6%
) 64
 
8.4%
( 64
 
8.4%
e 37
 
4.8%
a 28
 
3.7%
E 27
 
3.5%
O 24
 
3.1%
A 18
 
2.3%
C 18
 
2.3%
f 18
 
2.3%
Other values (53) 318
41.5%
Hangul
ValueCountFrequency (%)
112
 
5.4%
104
 
5.0%
102
 
4.9%
67
 
3.2%
67
 
3.2%
62
 
3.0%
60
 
2.9%
47
 
2.2%
43
 
2.1%
43
 
2.1%
Other values (310) 1383
66.2%
None
ValueCountFrequency (%)
· 2
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct335
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-12T19:23:05.031288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length48
Mean length29.961194
Min length9

Characters and Unicode

Total characters10037
Distinct characters226
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique335 ?
Unique (%)100.0%

Sample

1st row부산광역시 기장군 기장읍 차성로299번길 12
2nd row부산광역시 기장군 기장읍 차성남로 49 (,408-15)
3rd row부산광역시 기장군 기장읍 차성로288번길 24
4th row부산광역시 기장군 일광읍 일광로 106
5th row부산광역시 기장군 기장읍 차성로288번길 67
ValueCountFrequency (%)
부산광역시 334
 
15.4%
기장군 334
 
15.4%
1층 138
 
6.4%
기장읍 121
 
5.6%
정관읍 101
 
4.7%
일광읍 69
 
3.2%
정관로 42
 
1.9%
장안읍 41
 
1.9%
기장해안로 30
 
1.4%
2층 19
 
0.9%
Other values (519) 943
43.4%
2023-12-12T19:23:05.570923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1838
18.3%
1 554
 
5.5%
548
 
5.5%
494
 
4.9%
440
 
4.4%
372
 
3.7%
365
 
3.6%
341
 
3.4%
336
 
3.3%
335
 
3.3%
Other values (216) 4414
44.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5890
58.7%
Space Separator 1838
 
18.3%
Decimal Number 1773
 
17.7%
Other Punctuation 317
 
3.2%
Dash Punctuation 57
 
0.6%
Close Punctuation 51
 
0.5%
Open Punctuation 51
 
0.5%
Uppercase Letter 31
 
0.3%
Math Symbol 28
 
0.3%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
548
 
9.3%
494
 
8.4%
440
 
7.5%
372
 
6.3%
365
 
6.2%
341
 
5.8%
336
 
5.7%
335
 
5.7%
334
 
5.7%
266
 
4.5%
Other values (182) 2059
35.0%
Uppercase Letter
ValueCountFrequency (%)
B 8
25.8%
A 7
22.6%
C 3
 
9.7%
D 3
 
9.7%
E 2
 
6.5%
W 2
 
6.5%
R 2
 
6.5%
T 1
 
3.2%
P 1
 
3.2%
I 1
 
3.2%
Decimal Number
ValueCountFrequency (%)
1 554
31.2%
2 253
14.3%
3 199
 
11.2%
0 181
 
10.2%
5 121
 
6.8%
4 120
 
6.8%
8 100
 
5.6%
6 97
 
5.5%
7 87
 
4.9%
9 61
 
3.4%
Other Punctuation
ValueCountFrequency (%)
, 313
98.7%
* 2
 
0.6%
. 2
 
0.6%
Math Symbol
ValueCountFrequency (%)
~ 26
92.9%
> 1
 
3.6%
< 1
 
3.6%
Close Punctuation
ValueCountFrequency (%)
) 49
96.1%
] 2
 
3.9%
Open Punctuation
ValueCountFrequency (%)
( 49
96.1%
[ 2
 
3.9%
Space Separator
ValueCountFrequency (%)
1838
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 57
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5890
58.7%
Common 4115
41.0%
Latin 32
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
548
 
9.3%
494
 
8.4%
440
 
7.5%
372
 
6.3%
365
 
6.2%
341
 
5.8%
336
 
5.7%
335
 
5.7%
334
 
5.7%
266
 
4.5%
Other values (182) 2059
35.0%
Common
ValueCountFrequency (%)
1838
44.7%
1 554
 
13.5%
, 313
 
7.6%
2 253
 
6.1%
3 199
 
4.8%
0 181
 
4.4%
5 121
 
2.9%
4 120
 
2.9%
8 100
 
2.4%
6 97
 
2.4%
Other values (12) 339
 
8.2%
Latin
ValueCountFrequency (%)
B 8
25.0%
A 7
21.9%
C 3
 
9.4%
D 3
 
9.4%
E 2
 
6.2%
W 2
 
6.2%
R 2
 
6.2%
T 1
 
3.1%
P 1
 
3.1%
I 1
 
3.1%
Other values (2) 2
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5890
58.7%
ASCII 4147
41.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1838
44.3%
1 554
 
13.4%
, 313
 
7.5%
2 253
 
6.1%
3 199
 
4.8%
0 181
 
4.4%
5 121
 
2.9%
4 120
 
2.9%
8 100
 
2.4%
6 97
 
2.3%
Other values (24) 371
 
8.9%
Hangul
ValueCountFrequency (%)
548
 
9.3%
494
 
8.4%
440
 
7.5%
372
 
6.3%
365
 
6.2%
341
 
5.8%
336
 
5.7%
335
 
5.7%
334
 
5.7%
266
 
4.5%
Other values (182) 2059
35.0%

소재지전화
Text

MISSING 

Distinct161
Distinct (%)98.8%
Missing172
Missing (%)51.3%
Memory size2.7 KiB
2023-12-12T19:23:05.860937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.030675
Min length11

Characters and Unicode

Total characters1961
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique159 ?
Unique (%)97.5%

Sample

1st row051-721-1343
2nd row051-721-1155
3rd row051-721-4242
4th row051-721-0818
5th row051-721-2044
ValueCountFrequency (%)
051-724-8556 2
 
1.2%
051-724-3457 2
 
1.2%
02-465-1121 1
 
0.6%
051-727-3422 1
 
0.6%
051-728-2088 1
 
0.6%
051-940-1640 1
 
0.6%
051-721-1343 1
 
0.6%
051-728-7462 1
 
0.6%
051-727-2259 1
 
0.6%
051-817-6800 1
 
0.6%
Other values (151) 151
92.6%
2023-12-12T19:23:06.312892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 326
16.6%
1 273
13.9%
0 271
13.8%
7 253
12.9%
5 232
11.8%
2 205
10.5%
4 98
 
5.0%
8 96
 
4.9%
3 76
 
3.9%
9 68
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1635
83.4%
Dash Punctuation 326
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 273
16.7%
0 271
16.6%
7 253
15.5%
5 232
14.2%
2 205
12.5%
4 98
 
6.0%
8 96
 
5.9%
3 76
 
4.6%
9 68
 
4.2%
6 63
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 326
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1961
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 326
16.6%
1 273
13.9%
0 271
13.8%
7 253
12.9%
5 232
11.8%
2 205
10.5%
4 98
 
5.0%
8 96
 
4.9%
3 76
 
3.9%
9 68
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1961
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 326
16.6%
1 273
13.9%
0 271
13.8%
7 253
12.9%
5 232
11.8%
2 205
10.5%
4 98
 
5.0%
8 96
 
4.9%
3 76
 
3.9%
9 68
 
3.5%

Missing values

2023-12-12T19:23:03.547426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:23:03.668546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종업소명소재지(도로명)소재지전화
0휴게음식점산마루다방부산광역시 기장군 기장읍 차성로299번길 12051-721-1343
1휴게음식점고궁다방부산광역시 기장군 기장읍 차성남로 49 (,408-15)051-721-1155
2휴게음식점약속다방부산광역시 기장군 기장읍 차성로288번길 24051-721-4242
3휴게음식점원다방부산광역시 기장군 일광읍 일광로 106051-721-0818
4휴게음식점삼거리다방부산광역시 기장군 기장읍 차성로288번길 67051-721-2044
5휴게음식점단지다방부산광역시 기장군 기장읍 차성동로 77051-722-6003
6휴게음식점명천하커피숍부산광역시 기장군 기장읍 차성로288번길 6051-723-0068
7휴게음식점에제르부산광역시 기장군 기장읍 대청로72번길 6, 1층<NA>
8휴게음식점마로니에부산광역시 기장군 정관읍 정관로 262051-728-3824
9휴게음식점젠커피숍부산광역시 기장군 장안읍 장안로 38051-727-9991
업종업소명소재지(도로명)소재지전화
325휴게음식점더벤티 기장좌천점부산광역시 기장군 장안읍 좌천로 53, 1층<NA>
326휴게음식점카페 사주부산광역시 기장군 기장읍 죽성로 348, 메밀꽃필무렵 1층<NA>
327휴게음식점바람바람바람부산광역시 기장군 일광읍 달음길 26<NA>
328휴게음식점컴포즈(장안좌천역점)부산광역시 기장군 장안읍 해맞이로 9, 1층<NA>
329휴게음식점할리스 일광신도시점부산광역시 기장군 일광읍 해빛6로 43-20<NA>
330휴게음식점낭만커피(일광)부산광역시 기장군 일광읍 일광로 137, 1층<NA>
331휴게음식점은아 케이크부산광역시 기장군 정관읍 방곡로 29, 402동 106호 (이지더원5차)<NA>
332휴게음식점이디야커피 일광해송점부산광역시 기장군 일광읍 해송1로 16, 1-2일부층<NA>
333휴게음식점댄싱컵 기장한신점부산광역시 기장군 기장읍 차성로338번길 28, 1층<NA>
334휴게음식점탑카페부산광역시 기장군 장안읍 해맞이로 438, 1층 일부호<NA>