Overview

Dataset statistics

Number of variables6
Number of observations3931
Missing cells49
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory188.2 KiB
Average record size in memory49.0 B

Variable types

Categorical2
DateTime1
Text2
Numeric1

Dataset

Description서울특별시 성동구의 일반음식점 현황 자요입니다. 업종명, 인허가일자, 업소명, 도로명주소, 영업장면적, 업태명 등의 정보를 포함합니다.
URLhttps://www.data.go.kr/data/15035732/fileData.do

Alerts

업종명 has constant value ""Constant
도로명주소 has 43 (1.1%) missing valuesMissing

Reproduction

Analysis started2023-12-11 23:56:31.910631
Analysis finished2023-12-11 23:56:33.319541
Duration1.41 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size30.8 KiB
일반음식점
3931 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row일반음식점
5th row일반음식점

Common Values

ValueCountFrequency (%)
일반음식점 3931
100.0%

Length

2023-12-12T08:56:33.400192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:56:33.508395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 3931
100.0%
Distinct2607
Distinct (%)66.3%
Missing0
Missing (%)0.0%
Memory size30.8 KiB
Minimum1970-07-11 00:00:00
Maximum2023-06-28 00:00:00
2023-12-12T08:56:33.624009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:56:33.782956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct3826
Distinct (%)97.3%
Missing0
Missing (%)0.0%
Memory size30.8 KiB
2023-12-12T08:56:34.100873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length29
Mean length6.8957008
Min length1

Characters and Unicode

Total characters27107
Distinct characters954
Distinct categories12 ?
Distinct scripts5 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3739 ?
Unique (%)95.1%

Sample

1st row오후五厚
2nd row술있는 식탁
3rd row앳모스피어(atmosphere)
4th row더육회
5th row큰집 빈대떡 포차
ValueCountFrequency (%)
성수점 85
 
1.5%
왕십리점 76
 
1.4%
한양대점 55
 
1.0%
성수 45
 
0.8%
서울숲점 32
 
0.6%
주식회사 28
 
0.5%
카페 24
 
0.4%
금호점 21
 
0.4%
성수역점 20
 
0.4%
왕십리 18
 
0.3%
Other values (4457) 5146
92.7%
2023-12-12T08:56:34.609282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1622
 
6.0%
721
 
2.7%
574
 
2.1%
525
 
1.9%
511
 
1.9%
) 469
 
1.7%
( 468
 
1.7%
439
 
1.6%
397
 
1.5%
290
 
1.1%
Other values (944) 21091
77.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21460
79.2%
Space Separator 1622
 
6.0%
Uppercase Letter 1361
 
5.0%
Lowercase Letter 1326
 
4.9%
Close Punctuation 469
 
1.7%
Open Punctuation 468
 
1.7%
Decimal Number 283
 
1.0%
Other Punctuation 105
 
0.4%
Dash Punctuation 7
 
< 0.1%
Connector Punctuation 3
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
721
 
3.4%
574
 
2.7%
525
 
2.4%
511
 
2.4%
439
 
2.0%
397
 
1.8%
290
 
1.4%
258
 
1.2%
249
 
1.2%
245
 
1.1%
Other values (866) 17251
80.4%
Uppercase Letter
ValueCountFrequency (%)
A 131
 
9.6%
E 125
 
9.2%
O 103
 
7.6%
N 84
 
6.2%
T 79
 
5.8%
S 77
 
5.7%
L 71
 
5.2%
D 62
 
4.6%
B 62
 
4.6%
C 60
 
4.4%
Other values (16) 507
37.3%
Lowercase Letter
ValueCountFrequency (%)
e 190
14.3%
o 149
11.2%
a 121
 
9.1%
r 94
 
7.1%
n 87
 
6.6%
i 86
 
6.5%
t 82
 
6.2%
s 69
 
5.2%
l 64
 
4.8%
c 41
 
3.1%
Other values (15) 343
25.9%
Decimal Number
ValueCountFrequency (%)
2 56
19.8%
1 52
18.4%
0 38
13.4%
3 28
9.9%
9 25
8.8%
8 23
8.1%
5 22
 
7.8%
7 20
 
7.1%
4 10
 
3.5%
6 9
 
3.2%
Other Punctuation
ValueCountFrequency (%)
& 32
30.5%
. 27
25.7%
, 16
15.2%
? 11
 
10.5%
' 10
 
9.5%
! 4
 
3.8%
: 2
 
1.9%
; 1
 
1.0%
/ 1
 
1.0%
# 1
 
1.0%
Space Separator
ValueCountFrequency (%)
1622
100.0%
Close Punctuation
ValueCountFrequency (%)
) 469
100.0%
Open Punctuation
ValueCountFrequency (%)
( 468
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21437
79.1%
Common 2960
 
10.9%
Latin 2687
 
9.9%
Han 21
 
0.1%
Katakana 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
721
 
3.4%
574
 
2.7%
525
 
2.4%
511
 
2.4%
439
 
2.0%
397
 
1.9%
290
 
1.4%
258
 
1.2%
249
 
1.2%
245
 
1.1%
Other values (844) 17228
80.4%
Latin
ValueCountFrequency (%)
e 190
 
7.1%
o 149
 
5.5%
A 131
 
4.9%
E 125
 
4.7%
a 121
 
4.5%
O 103
 
3.8%
r 94
 
3.5%
n 87
 
3.2%
i 86
 
3.2%
N 84
 
3.1%
Other values (41) 1517
56.5%
Common
ValueCountFrequency (%)
1622
54.8%
) 469
 
15.8%
( 468
 
15.8%
2 56
 
1.9%
1 52
 
1.8%
0 38
 
1.3%
& 32
 
1.1%
3 28
 
0.9%
. 27
 
0.9%
9 25
 
0.8%
Other values (17) 143
 
4.8%
Han
ValueCountFrequency (%)
2
 
9.5%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
Other values (10) 10
47.6%
Katakana
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21436
79.1%
ASCII 5647
 
20.8%
CJK 20
 
0.1%
Katakana 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1622
28.7%
) 469
 
8.3%
( 468
 
8.3%
e 190
 
3.4%
o 149
 
2.6%
A 131
 
2.3%
E 125
 
2.2%
a 121
 
2.1%
O 103
 
1.8%
r 94
 
1.7%
Other values (68) 2175
38.5%
Hangul
ValueCountFrequency (%)
721
 
3.4%
574
 
2.7%
525
 
2.4%
511
 
2.4%
439
 
2.0%
397
 
1.9%
290
 
1.4%
258
 
1.2%
249
 
1.2%
245
 
1.1%
Other values (843) 17227
80.4%
CJK
ValueCountFrequency (%)
2
 
10.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (9) 9
45.0%
Katakana
ValueCountFrequency (%)
1
50.0%
1
50.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%

도로명주소
Text

MISSING 

Distinct3691
Distinct (%)94.9%
Missing43
Missing (%)1.1%
Memory size30.8 KiB
2023-12-12T08:56:34.897108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length76
Median length65
Mean length33.855195
Min length22

Characters and Unicode

Total characters131629
Distinct characters360
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3535 ?
Unique (%)90.9%

Sample

1st row서울특별시 성동구 서울숲2길 44-7, 1층 (성수동1가)
2nd row서울특별시 성동구 왕십리로10길 18, 지층 (성수동1가)
3rd row서울특별시 성동구 아차산로17길 11, 1층 (성수동2가)
4th row서울특별시 성동구 마장로 137, 221동 1층 1121호 (상왕십리동, 텐즈힐)
5th row서울특별시 성동구 장터길 35-1, 1층 (금호동3가)
ValueCountFrequency (%)
서울특별시 3888
 
15.5%
성동구 3888
 
15.5%
1층 1818
 
7.2%
성수동2가 950
 
3.8%
성수동1가 754
 
3.0%
행당동 506
 
2.0%
2층 363
 
1.4%
왕십리로 229
 
0.9%
지상1층 203
 
0.8%
하왕십리동 175
 
0.7%
Other values (2295) 12350
49.2%
2023-12-12T08:56:35.325632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21246
 
16.1%
1 8516
 
6.5%
8379
 
6.4%
6426
 
4.9%
, 4336
 
3.3%
4318
 
3.3%
( 4173
 
3.2%
) 4172
 
3.2%
4155
 
3.2%
2 4123
 
3.1%
Other values (350) 61785
46.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 73140
55.6%
Decimal Number 22713
 
17.3%
Space Separator 21246
 
16.1%
Other Punctuation 4343
 
3.3%
Open Punctuation 4173
 
3.2%
Close Punctuation 4172
 
3.2%
Dash Punctuation 1225
 
0.9%
Uppercase Letter 535
 
0.4%
Lowercase Letter 58
 
< 0.1%
Math Symbol 23
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8379
 
11.5%
6426
 
8.8%
4318
 
5.9%
4155
 
5.7%
3946
 
5.4%
3903
 
5.3%
3890
 
5.3%
3888
 
5.3%
3380
 
4.6%
2685
 
3.7%
Other values (293) 28170
38.5%
Uppercase Letter
ValueCountFrequency (%)
B 142
26.5%
A 53
 
9.9%
R 50
 
9.3%
T 35
 
6.5%
I 35
 
6.5%
C 33
 
6.2%
E 28
 
5.2%
S 25
 
4.7%
K 23
 
4.3%
L 18
 
3.4%
Other values (14) 93
17.4%
Lowercase Letter
ValueCountFrequency (%)
e 12
20.7%
o 11
19.0%
r 10
17.2%
w 7
12.1%
t 4
 
6.9%
i 2
 
3.4%
l 2
 
3.4%
a 2
 
3.4%
m 2
 
3.4%
p 2
 
3.4%
Other values (3) 4
 
6.9%
Decimal Number
ValueCountFrequency (%)
1 8516
37.5%
2 4123
18.2%
3 1945
 
8.6%
0 1612
 
7.1%
4 1555
 
6.8%
5 1186
 
5.2%
7 1131
 
5.0%
6 972
 
4.3%
8 854
 
3.8%
9 819
 
3.6%
Other Punctuation
ValueCountFrequency (%)
, 4336
99.8%
. 4
 
0.1%
@ 2
 
< 0.1%
? 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
21246
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4173
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4172
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1225
100.0%
Math Symbol
ValueCountFrequency (%)
~ 23
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 73140
55.6%
Common 57895
44.0%
Latin 594
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8379
 
11.5%
6426
 
8.8%
4318
 
5.9%
4155
 
5.7%
3946
 
5.4%
3903
 
5.3%
3890
 
5.3%
3888
 
5.3%
3380
 
4.6%
2685
 
3.7%
Other values (293) 28170
38.5%
Latin
ValueCountFrequency (%)
B 142
23.9%
A 53
 
8.9%
R 50
 
8.4%
T 35
 
5.9%
I 35
 
5.9%
C 33
 
5.6%
E 28
 
4.7%
S 25
 
4.2%
K 23
 
3.9%
L 18
 
3.0%
Other values (28) 152
25.6%
Common
ValueCountFrequency (%)
21246
36.7%
1 8516
14.7%
, 4336
 
7.5%
( 4173
 
7.2%
) 4172
 
7.2%
2 4123
 
7.1%
3 1945
 
3.4%
0 1612
 
2.8%
4 1555
 
2.7%
- 1225
 
2.1%
Other values (9) 4992
 
8.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 73140
55.6%
ASCII 58488
44.4%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
21246
36.3%
1 8516
14.6%
, 4336
 
7.4%
( 4173
 
7.1%
) 4172
 
7.1%
2 4123
 
7.0%
3 1945
 
3.3%
0 1612
 
2.8%
4 1555
 
2.7%
- 1225
 
2.1%
Other values (46) 5585
 
9.5%
Hangul
ValueCountFrequency (%)
8379
 
11.5%
6426
 
8.8%
4318
 
5.9%
4155
 
5.7%
3946
 
5.4%
3903
 
5.3%
3890
 
5.3%
3888
 
5.3%
3380
 
4.6%
2685
 
3.7%
Other values (293) 28170
38.5%
Number Forms
ValueCountFrequency (%)
1
100.0%

영업장면적
Real number (ℝ)

Distinct2431
Distinct (%)61.9%
Missing6
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean69.891562
Minimum4
Maximum1744.39
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size34.7 KiB
2023-12-12T08:56:35.465259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile16.5
Q131.4
median51.87
Q383
95-th percentile174.96
Maximum1744.39
Range1740.39
Interquartile range (IQR)51.6

Descriptive statistics

Standard deviation74.732983
Coefficient of variation (CV)1.0692705
Kurtosis96.9861
Mean69.891562
Median Absolute Deviation (MAD)23.83
Skewness7.0145311
Sum274324.38
Variance5585.0187
MonotonicityNot monotonic
2023-12-12T08:56:35.600845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
33.0 59
 
1.5%
30.0 41
 
1.0%
50.0 36
 
0.9%
40.0 30
 
0.8%
20.0 29
 
0.7%
24.0 27
 
0.7%
66.0 27
 
0.7%
60.0 22
 
0.6%
26.0 21
 
0.5%
15.0 21
 
0.5%
Other values (2421) 3612
91.9%
ValueCountFrequency (%)
4.0 1
 
< 0.1%
4.96 1
 
< 0.1%
5.0 3
0.1%
5.2 1
 
< 0.1%
5.22 1
 
< 0.1%
6.08 1
 
< 0.1%
6.47 1
 
< 0.1%
6.5 1
 
< 0.1%
6.6 5
0.1%
7.0 1
 
< 0.1%
ValueCountFrequency (%)
1744.39 1
< 0.1%
1063.55 1
< 0.1%
929.26 1
< 0.1%
891.49 1
< 0.1%
867.0 1
< 0.1%
840.85 1
< 0.1%
768.7 1
< 0.1%
715.06 1
< 0.1%
674.74 1
< 0.1%
646.81 1
< 0.1%

업태명
Categorical

Distinct24
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size30.8 KiB
한식
1571 
기타
1025 
경양식
308 
호프/통닭
247 
일식
180 
Other values (19)
600 

Length

Max length15
Median length2
Mean length2.7657085
Min length2

Unique

Unique6 ?
Unique (%)0.2%

Sample

1st row한식
2nd row기타
3rd row경양식
4th row일식
5th row한식

Common Values

ValueCountFrequency (%)
한식 1571
40.0%
기타 1025
26.1%
경양식 308
 
7.8%
호프/통닭 247
 
6.3%
일식 180
 
4.6%
분식 144
 
3.7%
중국식 111
 
2.8%
통닭(치킨) 77
 
2.0%
외국음식전문점(인도,태국등) 57
 
1.5%
정종/대포집/소주방 50
 
1.3%
Other values (14) 161
 
4.1%

Length

2023-12-12T08:56:35.738646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한식 1571
40.0%
기타 1025
26.1%
경양식 308
 
7.8%
호프/통닭 247
 
6.3%
일식 180
 
4.6%
분식 144
 
3.7%
중국식 111
 
2.8%
통닭(치킨 77
 
2.0%
외국음식전문점(인도,태국등 57
 
1.5%
정종/대포집/소주방 50
 
1.3%
Other values (14) 161
 
4.1%

Interactions

2023-12-12T08:56:32.888356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:56:35.821816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업장면적업태명
영업장면적1.0000.473
업태명0.4731.000
2023-12-12T08:56:35.905516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업장면적업태명
영업장면적1.0000.186
업태명0.1861.000

Missing values

2023-12-12T08:56:33.011526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:56:33.154727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T08:56:33.257548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업종명인허가일자업소명도로명주소영업장면적업태명
0일반음식점2018-12-27오후五厚서울특별시 성동구 서울숲2길 44-7, 1층 (성수동1가)83.42한식
1일반음식점2018-11-02술있는 식탁서울특별시 성동구 왕십리로10길 18, 지층 (성수동1가)47.52기타
2일반음식점2018-11-29앳모스피어(atmosphere)서울특별시 성동구 아차산로17길 11, 1층 (성수동2가)50.87경양식
3일반음식점2018-12-04더육회서울특별시 성동구 마장로 137, 221동 1층 1121호 (상왕십리동, 텐즈힐)29.25일식
4일반음식점2018-11-02큰집 빈대떡 포차서울특별시 성동구 장터길 35-1, 1층 (금호동3가)23.0한식
5일반음식점2018-11-08(주) 포스트핀스페이스서울특별시 성동구 성수이로7길 24, 신원지기공업사 2층 (성수동2가)18.23분식
6일반음식점2019-01-22처갓집양념치킨 왕십리점서울특별시 성동구 마장로23길 7, 1층 (홍익동)36.0호프/통닭
7일반음식점2019-01-29주식회사 식당컴퍼니 텐동식당서울특별시 성동구 연무장5가길 20-1, 1층 (성수동2가)49.5일식
8일반음식점2019-01-29제스티 살룬(zesty saloon)서울특별시 성동구 서울숲4길 13, 1층, 2층 (성수동1가)184.57기타
9일반음식점2019-01-17인덱스 카라멜 (INDEX CARAMEL)서울특별시 성동구 성수이로14길 14, 나동 1층 (성수동2가)15.6기타
업종명인허가일자업소명도로명주소영업장면적업태명
3921일반음식점2022-05-23두남자의 개수작서울특별시 성동구 왕십리로 107, 2층 (성수동1가)52.27기타
3922일반음식점2022-05-23래뮤서울특별시 성동구 연무장길 14, 2층 (성수동1가)80.66기타
3923일반음식점2022-05-19마음마켓서울특별시 성동구 청계천로10가길 102, 희운빌딩 2층 (마장동)30.3기타
3924일반음식점2022-05-19순자네코다리조림서울특별시 성동구 마조로15가길 27, 만성빌딩 1층 (마장동)60.0한식
3925일반음식점2022-05-19열기서울특별시 성동구 광나루로 302, 1층 (성수동2가)125.6한식
3926일반음식점2022-05-19하우 엠(How M)서울특별시 성동구 장터길 32, 2층 (금호동4가)61.74기타
3927일반음식점2022-04-28완강정서울특별시 성동구 장터길 14, 1층 (금호동4가)6.5기타
3928일반음식점2022-05-17잇프레쉬서울특별시 성동구 성수일로4길 33, 1층 (성수동2가)8.0기타
3929일반음식점2022-05-17와인쌤마켓(WINE SSEM MARKET)서울특별시 성동구 아차산로17길 48, 성수 SK V1 CENTER I 1층 R105호 (성수동2가)139.06기타
3930일반음식점2022-05-17중앙해장포장 금호점서울특별시 성동구 금호로 15, 상가동 1층 102호 (금호동4가, 서울숲푸르지오아파트)21.6한식