Overview

Dataset statistics

Number of variables5
Number of observations3165
Missing cells1559
Missing cells (%)9.9%
Duplicate rows194
Duplicate rows (%)6.1%
Total size in memory123.8 KiB
Average record size in memory40.0 B

Variable types

Categorical2
Text3

Dataset

Description부산광역시 강서구 관내 식품접객업소 현황에 대한 데이터로 업종명, 업소명, 소재지, 업소 전화번호 등을 제공합니다.
Author부산광역시 강서구
URLhttps://www.data.go.kr/data/15006230/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 194 (6.1%) duplicate rowsDuplicates
업종명 is highly imbalanced (52.6%)Imbalance
소재지전화 has 1543 (48.8%) missing valuesMissing

Reproduction

Analysis started2023-12-12 03:06:47.766188
Analysis finished2023-12-12 03:06:49.296552
Duration1.53 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size24.9 KiB
일반음식점
2244 
휴게음식점
662 
위탁급식영업
 
181
제과점영업
 
46
유흥주점영업
 
21

Length

Max length6
Median length5
Mean length5.0603476
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row일반음식점
5th row일반음식점

Common Values

ValueCountFrequency (%)
일반음식점 2244
70.9%
휴게음식점 662
 
20.9%
위탁급식영업 181
 
5.7%
제과점영업 46
 
1.5%
유흥주점영업 21
 
0.7%
단란주점 11
 
0.3%

Length

2023-12-12T12:06:49.380886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:06:49.545093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 2244
70.9%
휴게음식점 662
 
20.9%
위탁급식영업 181
 
5.7%
제과점영업 46
 
1.5%
유흥주점영업 21
 
0.7%
단란주점 11
 
0.3%
Distinct2908
Distinct (%)91.9%
Missing0
Missing (%)0.0%
Memory size24.9 KiB
2023-12-12T12:06:49.896117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length28
Mean length7.6274882
Min length1

Characters and Unicode

Total characters24141
Distinct characters823
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2669 ?
Unique (%)84.3%

Sample

1st row명지횟집
2nd row진영집
3rd row강변횟집
4th row풍년횟집
5th row밀양시장횟집
ValueCountFrequency (%)
명지점 166
 
3.7%
부산명지점 65
 
1.5%
명지국제신도시점 57
 
1.3%
씨유 40
 
0.9%
지사점 25
 
0.6%
신호점 23
 
0.5%
세븐일레븐 21
 
0.5%
명지오션시티점 20
 
0.4%
명지국제점 18
 
0.4%
지에스25 18
 
0.4%
Other values (3162) 4011
89.9%
2023-12-12T12:06:50.480511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1299
 
5.4%
967
 
4.0%
836
 
3.5%
622
 
2.6%
413
 
1.7%
384
 
1.6%
352
 
1.5%
351
 
1.5%
327
 
1.4%
( 310
 
1.3%
Other values (813) 18280
75.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20697
85.7%
Space Separator 1299
 
5.4%
Uppercase Letter 517
 
2.1%
Lowercase Letter 506
 
2.1%
Decimal Number 424
 
1.8%
Open Punctuation 310
 
1.3%
Close Punctuation 310
 
1.3%
Other Punctuation 70
 
0.3%
Dash Punctuation 6
 
< 0.1%
Connector Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
967
 
4.7%
836
 
4.0%
622
 
3.0%
413
 
2.0%
384
 
1.9%
352
 
1.7%
351
 
1.7%
327
 
1.6%
299
 
1.4%
259
 
1.3%
Other values (741) 15887
76.8%
Lowercase Letter
ValueCountFrequency (%)
e 74
14.6%
a 53
 
10.5%
o 50
 
9.9%
r 40
 
7.9%
n 30
 
5.9%
t 25
 
4.9%
s 25
 
4.9%
k 25
 
4.9%
i 23
 
4.5%
c 20
 
4.0%
Other values (14) 141
27.9%
Uppercase Letter
ValueCountFrequency (%)
C 53
 
10.3%
S 40
 
7.7%
B 35
 
6.8%
O 34
 
6.6%
E 33
 
6.4%
A 30
 
5.8%
T 26
 
5.0%
N 26
 
5.0%
G 25
 
4.8%
P 24
 
4.6%
Other values (14) 191
36.9%
Decimal Number
ValueCountFrequency (%)
2 104
24.5%
1 78
18.4%
5 64
15.1%
4 42
9.9%
0 34
 
8.0%
9 32
 
7.5%
3 26
 
6.1%
6 17
 
4.0%
7 16
 
3.8%
8 11
 
2.6%
Other Punctuation
ValueCountFrequency (%)
& 28
40.0%
, 14
20.0%
. 13
18.6%
' 9
 
12.9%
# 2
 
2.9%
/ 1
 
1.4%
: 1
 
1.4%
? 1
 
1.4%
1
 
1.4%
Space Separator
ValueCountFrequency (%)
1299
100.0%
Open Punctuation
ValueCountFrequency (%)
( 310
100.0%
Close Punctuation
ValueCountFrequency (%)
) 310
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20690
85.7%
Common 2421
 
10.0%
Latin 1023
 
4.2%
Han 7
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
967
 
4.7%
836
 
4.0%
622
 
3.0%
413
 
2.0%
384
 
1.9%
352
 
1.7%
351
 
1.7%
327
 
1.6%
299
 
1.4%
259
 
1.3%
Other values (735) 15880
76.8%
Latin
ValueCountFrequency (%)
e 74
 
7.2%
a 53
 
5.2%
C 53
 
5.2%
o 50
 
4.9%
S 40
 
3.9%
r 40
 
3.9%
B 35
 
3.4%
O 34
 
3.3%
E 33
 
3.2%
n 30
 
2.9%
Other values (38) 581
56.8%
Common
ValueCountFrequency (%)
1299
53.7%
( 310
 
12.8%
) 310
 
12.8%
2 104
 
4.3%
1 78
 
3.2%
5 64
 
2.6%
4 42
 
1.7%
0 34
 
1.4%
9 32
 
1.3%
& 28
 
1.2%
Other values (14) 120
 
5.0%
Han
ValueCountFrequency (%)
2
28.6%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20690
85.7%
ASCII 3443
 
14.3%
CJK 7
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1299
37.7%
( 310
 
9.0%
) 310
 
9.0%
2 104
 
3.0%
1 78
 
2.3%
e 74
 
2.1%
5 64
 
1.9%
a 53
 
1.5%
C 53
 
1.5%
o 50
 
1.5%
Other values (61) 1048
30.4%
Hangul
ValueCountFrequency (%)
967
 
4.7%
836
 
4.0%
622
 
3.0%
413
 
2.0%
384
 
1.9%
352
 
1.7%
351
 
1.7%
327
 
1.6%
299
 
1.4%
259
 
1.3%
Other values (735) 15880
76.8%
CJK
ValueCountFrequency (%)
2
28.6%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
None
ValueCountFrequency (%)
1
100.0%
Distinct2846
Distinct (%)90.4%
Missing16
Missing (%)0.5%
Memory size24.9 KiB
2023-12-12T12:06:50.805879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length118
Median length61
Mean length35.51699
Min length21

Characters and Unicode

Total characters111843
Distinct characters353
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2585 ?
Unique (%)82.1%

Sample

1st row부산광역시 강서구 신포길17번길 10-1 (명지동)
2nd row부산광역시 강서구 낙동북로 21 (강동동)
3rd row부산광역시 강서구 명지새동네길2번길 73 (명지동)
4th row부산광역시 강서구 공항로811번길 65 (대저2동)
5th row부산광역시 강서구 신포길17번길 30 (명지동)
ValueCountFrequency (%)
부산광역시 3149
 
15.1%
강서구 3149
 
15.1%
명지동 1565
 
7.5%
1층 1184
 
5.7%
일부호 338
 
1.6%
신호동 288
 
1.4%
대저2동 239
 
1.1%
일부 229
 
1.1%
명지국제8로 193
 
0.9%
대저1동 191
 
0.9%
Other values (1915) 10282
49.4%
2023-12-12T12:06:51.332130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17660
 
15.8%
1 5767
 
5.2%
4090
 
3.7%
3907
 
3.5%
3829
 
3.4%
3770
 
3.4%
3620
 
3.2%
2 3336
 
3.0%
3298
 
2.9%
3265
 
2.9%
Other values (343) 59301
53.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 64032
57.3%
Decimal Number 19818
 
17.7%
Space Separator 17660
 
15.8%
Close Punctuation 3185
 
2.8%
Open Punctuation 3185
 
2.8%
Other Punctuation 2967
 
2.7%
Dash Punctuation 632
 
0.6%
Uppercase Letter 262
 
0.2%
Lowercase Letter 71
 
0.1%
Math Symbol 31
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4090
 
6.4%
3907
 
6.1%
3829
 
6.0%
3770
 
5.9%
3620
 
5.7%
3298
 
5.2%
3265
 
5.1%
3199
 
5.0%
3173
 
5.0%
3156
 
4.9%
Other values (296) 28725
44.9%
Uppercase Letter
ValueCountFrequency (%)
B 63
24.0%
A 39
14.9%
S 38
14.5%
C 25
 
9.5%
R 18
 
6.9%
D 14
 
5.3%
T 12
 
4.6%
O 10
 
3.8%
L 8
 
3.1%
N 7
 
2.7%
Other values (9) 28
10.7%
Decimal Number
ValueCountFrequency (%)
1 5767
29.1%
2 3336
16.8%
3 1879
 
9.5%
0 1853
 
9.4%
4 1540
 
7.8%
6 1405
 
7.1%
8 1312
 
6.6%
5 1088
 
5.5%
7 928
 
4.7%
9 710
 
3.6%
Lowercase Letter
ValueCountFrequency (%)
e 43
60.6%
r 6
 
8.5%
s 3
 
4.2%
a 3
 
4.2%
u 3
 
4.2%
q 3
 
4.2%
n 3
 
4.2%
o 3
 
4.2%
t 3
 
4.2%
b 1
 
1.4%
Other Punctuation
ValueCountFrequency (%)
, 2963
99.9%
· 3
 
0.1%
. 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
17660
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3185
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3185
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 632
100.0%
Math Symbol
ValueCountFrequency (%)
~ 31
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 64032
57.3%
Common 47478
42.5%
Latin 333
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4090
 
6.4%
3907
 
6.1%
3829
 
6.0%
3770
 
5.9%
3620
 
5.7%
3298
 
5.2%
3265
 
5.1%
3199
 
5.0%
3173
 
5.0%
3156
 
4.9%
Other values (296) 28725
44.9%
Latin
ValueCountFrequency (%)
B 63
18.9%
e 43
12.9%
A 39
11.7%
S 38
11.4%
C 25
 
7.5%
R 18
 
5.4%
D 14
 
4.2%
T 12
 
3.6%
O 10
 
3.0%
L 8
 
2.4%
Other values (19) 63
18.9%
Common
ValueCountFrequency (%)
17660
37.2%
1 5767
 
12.1%
2 3336
 
7.0%
) 3185
 
6.7%
( 3185
 
6.7%
, 2963
 
6.2%
3 1879
 
4.0%
0 1853
 
3.9%
4 1540
 
3.2%
6 1405
 
3.0%
Other values (8) 4705
 
9.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 64032
57.3%
ASCII 47808
42.7%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17660
36.9%
1 5767
 
12.1%
2 3336
 
7.0%
) 3185
 
6.7%
( 3185
 
6.7%
, 2963
 
6.2%
3 1879
 
3.9%
0 1853
 
3.9%
4 1540
 
3.2%
6 1405
 
2.9%
Other values (36) 5035
 
10.5%
Hangul
ValueCountFrequency (%)
4090
 
6.4%
3907
 
6.1%
3829
 
6.0%
3770
 
5.9%
3620
 
5.7%
3298
 
5.2%
3265
 
5.1%
3199
 
5.0%
3173
 
5.0%
3156
 
4.9%
Other values (296) 28725
44.9%
None
ValueCountFrequency (%)
· 3
100.0%

소재지전화
Text

MISSING 

Distinct1394
Distinct (%)85.9%
Missing1543
Missing (%)48.8%
Memory size24.9 KiB
2023-12-12T12:06:51.763292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.019112
Min length5

Characters and Unicode

Total characters19495
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1188 ?
Unique (%)73.2%

Sample

1st row051-271-2662
2nd row051-271-1616
3rd row051-271-1905
4th row051-271-2413
5th row051-973-1303
ValueCountFrequency (%)
051-633-0102 8
 
0.5%
051-715-2200 5
 
0.3%
051-941-9288 4
 
0.2%
051-831-3880 4
 
0.2%
051-271-4489 3
 
0.2%
070-4050-6909 3
 
0.2%
051-971-8873 3
 
0.2%
051-941-8585 3
 
0.2%
051-661-9107 3
 
0.2%
051-831-9297 3
 
0.2%
Other values (1384) 1583
97.6%
2023-12-12T12:06:52.295398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 3241
16.6%
1 2963
15.2%
0 2890
14.8%
5 2291
11.8%
2 1752
9.0%
7 1446
7.4%
9 1440
7.4%
3 1199
 
6.2%
8 967
 
5.0%
4 723
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 16254
83.4%
Dash Punctuation 3241
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 2963
18.2%
0 2890
17.8%
5 2291
14.1%
2 1752
10.8%
7 1446
8.9%
9 1440
8.9%
3 1199
7.4%
8 967
 
5.9%
4 723
 
4.4%
6 583
 
3.6%
Dash Punctuation
ValueCountFrequency (%)
- 3241
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 19495
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 3241
16.6%
1 2963
15.2%
0 2890
14.8%
5 2291
11.8%
2 1752
9.0%
7 1446
7.4%
9 1440
7.4%
3 1199
 
6.2%
8 967
 
5.0%
4 723
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 19495
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 3241
16.6%
1 2963
15.2%
0 2890
14.8%
5 2291
11.8%
2 1752
9.0%
7 1446
7.4%
9 1440
7.4%
3 1199
 
6.2%
8 967
 
5.0%
4 723
 
3.7%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size24.9 KiB
2023-09-13
3165 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-09-13
2nd row2023-09-13
3rd row2023-09-13
4th row2023-09-13
5th row2023-09-13

Common Values

ValueCountFrequency (%)
2023-09-13 3165
100.0%

Length

2023-12-12T12:06:52.464632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:06:52.612564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-09-13 3165
100.0%

Missing values

2023-12-12T12:06:48.637250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:06:48.757698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T12:06:49.233389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업종명업소명소재지(도로명)소재지전화데이터기준일자
0일반음식점명지횟집부산광역시 강서구 신포길17번길 10-1 (명지동)051-271-26622023-09-13
1일반음식점진영집부산광역시 강서구 낙동북로 21 (강동동)<NA>2023-09-13
2일반음식점강변횟집부산광역시 강서구 명지새동네길2번길 73 (명지동)051-271-16162023-09-13
3일반음식점풍년횟집<NA>051-271-19052023-09-13
4일반음식점밀양시장횟집<NA>051-271-24132023-09-13
5일반음식점대동반점부산광역시 강서구 공항로811번길 65 (대저2동)051-973-13032023-09-13
6일반음식점사상횟집부산광역시 강서구 신포길17번길 30 (명지동)051-271-19292023-09-13
7일반음식점진목식당부산광역시 강서구 신포길 13-8 (명지동)051-271-18722023-09-13
8일반음식점대성식당부산광역시 강서구 신포길 13-8 (명지동)<NA>2023-09-13
9일반음식점명품달인김밥(녹산점)부산광역시 강서구 생곡로 14-1 (녹산동)051-941-07332023-09-13
업종명업소명소재지(도로명)소재지전화데이터기준일자
3155휴게음식점카페051부산광역시 강서구 낙동북로188번길 25, 1층 일부호 (대저1동)<NA>2023-09-13
3156휴게음식점비와별춘천집 스타필드명지점부산광역시 강서구 명지국제6로 168, 스타필드 시티 명지점 3층 3303호 (명지동)<NA>2023-09-13
3157휴게음식점롯데리아 김해공항국제선점부산광역시 강서구 공항진입로 108, 김해국제공항 국제선청사 3층 (대저2동)02-709-10042023-09-13
3158위탁급식영업만나정식부산광역시 강서구 녹산산단262로50번길 28 (송정동)051-831-41812023-09-13
3159위탁급식영업푸디스트(주) 노바인터내쇼널부산광역시 강서구 범방2로 73, A동 3층 (범방동)<NA>2023-09-13
3160위탁급식영업진아푸드 동화엔텍 화전점부산광역시 강서구 화전산단1로63번길 20, 2동 4층 (화전동)<NA>2023-09-13
3161위탁급식영업본우리집밥 영도산업점부산광역시 강서구 녹산산업중로 22 (송정동)<NA>2023-09-13
3162위탁급식영업(주)호성식품 트렉스타점부산광역시 강서구 녹산산업중로192번길 10 (송정동)<NA>2023-09-13
3163위탁급식영업(주)호성식품 대원식품점부산광역시 강서구 녹산산단407로 27, C동 2층 (송정동)051-831-38802023-09-13
3164제과점영업컴포즈커피 공항덕두점부산광역시 강서구 공항앞길 6 (대저2동)051-973-45942023-09-13

Duplicate rows

Most frequently occurring

업종명업소명소재지(도로명)소재지전화데이터기준일자# duplicates
0일반음식점(주)푸르웰 가덕해양Park 푸드코트2부산광역시 강서구 거가대로 2571 (천성동)051-715-22002023-09-132
1일반음식점가덕도1번지쌈밥부산광역시 강서구 가덕해안로 805 (천성동)051-9719-3272023-09-132
2일반음식점가마솥소머리국밥부산광역시 강서구 대저중앙로 13 (대저1동)051-866-40982023-09-132
3일반음식점가얏골 감자탕 강서구청점부산광역시 강서구 대저로274번길 21 (대저1동)051-971-67202023-09-132
4일반음식점감탄 오션시티점부산광역시 강서구 명지오션시티10로 16 (명지동,퀸덤1차링컨타운상가동 255호)051-260-81552023-09-132
5일반음식점강동어탕전문부산광역시 강서구 낙동북로138번길 91-10 (강동동)051-9739-2882023-09-132
6일반음식점강동회센타참가자미물회부산광역시 강서구 제도로 1004 (강동동)<NA>2023-09-132
7일반음식점강서주문진메밀막국수부산광역시 강서구 금호순서길7번길 48, 2층 (대저2동)051-971-01112023-09-132
8일반음식점강창호물회(대어횟집)부산광역시 강서구 명지오션시티8로 37 (명지동)051-293-37772023-09-132
9일반음식점개경목장부산광역시 강서구 울만로 402 (대저2동)051-271-18592023-09-132