Overview

Dataset statistics

Number of variables9
Number of observations3567
Missing cells161
Missing cells (%)0.5%
Duplicate rows50
Duplicate rows (%)1.4%
Total size in memory264.9 KiB
Average record size in memory76.0 B

Variable types

Numeric4
Text4
Categorical1

Dataset

Description업소아이디,업소명,분류코드,분류코드명,업소 주소,업소 전화번호,추천수,상품명,상품가격(일반)(원)
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-1174/S/1/datasetView.do

Alerts

Dataset has 50 (1.4%) duplicate rowsDuplicates
업소아이디 is highly overall correlated with 추천수High correlation
분류코드 is highly overall correlated with 분류코드명High correlation
추천수 is highly overall correlated with 업소아이디High correlation
분류코드명 is highly overall correlated with 분류코드High correlation
업소 전화번호 has 151 (4.2%) missing valuesMissing
추천수 has 2252 (63.1%) zerosZeros

Reproduction

Analysis started2024-05-11 00:32:09.701880
Analysis finished2024-05-11 00:32:23.711117
Duration14.01 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업소아이디
Real number (ℝ)

HIGH CORRELATION 

Distinct1518
Distinct (%)42.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8898.5887
Minimum263
Maximum10075
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size31.5 KiB
2024-05-11T00:32:24.016704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum263
5-th percentile6759
Q18558
median9276
Q39691
95-th percentile9992.7
Maximum10075
Range9812
Interquartile range (IQR)1133

Descriptive statistics

Standard deviation1359.7519
Coefficient of variation (CV)0.15280535
Kurtosis13.430613
Mean8898.5887
Median Absolute Deviation (MAD)477
Skewness-3.261924
Sum31741266
Variance1848925.3
MonotonicityDecreasing
2024-05-11T00:32:24.586229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8709 11
 
0.3%
8550 11
 
0.3%
9705 9
 
0.3%
9886 9
 
0.3%
9706 9
 
0.3%
9701 9
 
0.3%
9699 8
 
0.2%
8209 8
 
0.2%
7442 8
 
0.2%
8148 7
 
0.2%
Other values (1508) 3478
97.5%
ValueCountFrequency (%)
263 2
0.1%
272 3
0.1%
443 2
0.1%
736 2
0.1%
784 2
0.1%
879 2
0.1%
1113 1
 
< 0.1%
1145 1
 
< 0.1%
1163 2
0.1%
1264 2
0.1%
ValueCountFrequency (%)
10075 1
 
< 0.1%
10074 1
 
< 0.1%
10073 1
 
< 0.1%
10072 1
 
< 0.1%
10071 1
 
< 0.1%
10070 3
0.1%
10069 1
 
< 0.1%
10068 2
0.1%
10067 1
 
< 0.1%
10066 3
0.1%
Distinct1436
Distinct (%)40.3%
Missing0
Missing (%)0.0%
Memory size28.0 KiB
2024-05-11T00:32:25.348025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length5.1906364
Min length2

Characters and Unicode

Total characters18515
Distinct characters617
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique261 ?
Unique (%)7.3%

Sample

1st row지연피부샵
2nd row은평왕돈까스
3rd row신가네 김밥
4th row금송아지 생고기구이
5th row포항물회
ValueCountFrequency (%)
미용실 19
 
0.5%
홍두깨손칼국수 13
 
0.3%
거목식당 12
 
0.3%
탐라도야지 11
 
0.3%
만미정 11
 
0.3%
식당 11
 
0.3%
행복밥상 11
 
0.3%
김밥천국 10
 
0.3%
충정로김밥 10
 
0.3%
왕비집 10
 
0.3%
Other values (1545) 3847
97.0%
2024-05-11T00:32:26.553231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
453
 
2.4%
419
 
2.3%
401
 
2.2%
399
 
2.2%
340
 
1.8%
338
 
1.8%
334
 
1.8%
313
 
1.7%
309
 
1.7%
285
 
1.5%
Other values (607) 14924
80.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 17739
95.8%
Space Separator 401
 
2.2%
Decimal Number 109
 
0.6%
Lowercase Letter 73
 
0.4%
Uppercase Letter 71
 
0.4%
Other Punctuation 48
 
0.3%
Open Punctuation 36
 
0.2%
Close Punctuation 36
 
0.2%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
453
 
2.6%
419
 
2.4%
399
 
2.2%
340
 
1.9%
338
 
1.9%
334
 
1.9%
313
 
1.8%
309
 
1.7%
285
 
1.6%
284
 
1.6%
Other values (555) 14265
80.4%
Lowercase Letter
ValueCountFrequency (%)
i 9
12.3%
o 7
9.6%
a 7
9.6%
r 6
8.2%
l 6
8.2%
v 6
8.2%
s 6
8.2%
t 6
8.2%
p 4
 
5.5%
y 4
 
5.5%
Other values (7) 12
16.4%
Uppercase Letter
ValueCountFrequency (%)
S 13
18.3%
G 8
11.3%
O 6
8.5%
K 6
8.5%
M 6
8.5%
J 6
8.5%
R 4
 
5.6%
C 4
 
5.6%
T 4
 
5.6%
E 4
 
5.6%
Other values (4) 10
14.1%
Decimal Number
ValueCountFrequency (%)
1 20
18.3%
2 16
14.7%
0 14
12.8%
5 14
12.8%
4 12
11.0%
3 9
8.3%
7 7
 
6.4%
6 7
 
6.4%
9 6
 
5.5%
8 4
 
3.7%
Other Punctuation
ValueCountFrequency (%)
. 16
33.3%
& 12
25.0%
! 7
14.6%
# 4
 
8.3%
' 4
 
8.3%
, 3
 
6.2%
: 2
 
4.2%
Space Separator
ValueCountFrequency (%)
401
100.0%
Open Punctuation
ValueCountFrequency (%)
( 36
100.0%
Close Punctuation
ValueCountFrequency (%)
) 36
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 17739
95.8%
Common 632
 
3.4%
Latin 144
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
453
 
2.6%
419
 
2.4%
399
 
2.2%
340
 
1.9%
338
 
1.9%
334
 
1.9%
313
 
1.8%
309
 
1.7%
285
 
1.6%
284
 
1.6%
Other values (555) 14265
80.4%
Latin
ValueCountFrequency (%)
S 13
 
9.0%
i 9
 
6.2%
G 8
 
5.6%
o 7
 
4.9%
a 7
 
4.9%
r 6
 
4.2%
l 6
 
4.2%
O 6
 
4.2%
v 6
 
4.2%
K 6
 
4.2%
Other values (21) 70
48.6%
Common
ValueCountFrequency (%)
401
63.4%
( 36
 
5.7%
) 36
 
5.7%
1 20
 
3.2%
2 16
 
2.5%
. 16
 
2.5%
0 14
 
2.2%
5 14
 
2.2%
& 12
 
1.9%
4 12
 
1.9%
Other values (11) 55
 
8.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 17739
95.8%
ASCII 776
 
4.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
453
 
2.6%
419
 
2.4%
399
 
2.2%
340
 
1.9%
338
 
1.9%
334
 
1.9%
313
 
1.8%
309
 
1.7%
285
 
1.6%
284
 
1.6%
Other values (555) 14265
80.4%
ASCII
ValueCountFrequency (%)
401
51.7%
( 36
 
4.6%
) 36
 
4.6%
1 20
 
2.6%
2 16
 
2.1%
. 16
 
2.1%
0 14
 
1.8%
5 14
 
1.8%
S 13
 
1.7%
& 12
 
1.5%
Other values (42) 198
25.5%

분류코드
Real number (ℝ)

HIGH CORRELATION 

Distinct9
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.6240538
Minimum1
Maximum13
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size31.5 KiB
2024-05-11T00:32:27.077541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q35
95-th percentile7
Maximum13
Range12
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.4188951
Coefficient of variation (CV)0.9218161
Kurtosis5.1066614
Mean2.6240538
Median Absolute Deviation (MAD)0
Skewness1.9660106
Sum9360
Variance5.8510533
MonotonicityNot monotonic
2024-05-11T00:32:27.567067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
1 2044
57.3%
5 730
 
20.5%
4 257
 
7.2%
2 242
 
6.8%
7 117
 
3.3%
3 93
 
2.6%
13 78
 
2.2%
8 3
 
0.1%
6 3
 
0.1%
ValueCountFrequency (%)
1 2044
57.3%
2 242
 
6.8%
3 93
 
2.6%
4 257
 
7.2%
5 730
 
20.5%
6 3
 
0.1%
7 117
 
3.3%
8 3
 
0.1%
13 78
 
2.2%
ValueCountFrequency (%)
13 78
 
2.2%
8 3
 
0.1%
7 117
 
3.3%
6 3
 
0.1%
5 730
 
20.5%
4 257
 
7.2%
3 93
 
2.6%
2 242
 
6.8%
1 2044
57.3%

분류코드명
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size28.0 KiB
한식
2044 
이 미용업
730 
기타외식업(다방,패스트푸드등)
257 
중식
242 
세탁업
 
117
Other values (4)
 
177

Length

Max length16
Median length2
Mean length3.876647
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row이 미용업
2nd row한식
3rd row한식
4th row한식
5th row한식

Common Values

ValueCountFrequency (%)
한식 2044
57.3%
이 미용업 730
 
20.5%
기타외식업(다방,패스트푸드등) 257
 
7.2%
중식 242
 
6.8%
세탁업 117
 
3.3%
경양식,일식 93
 
2.6%
기타서비스업종 78
 
2.2%
숙박업(호텔,여관) 3
 
0.1%
목욕업 3
 
0.1%

Length

2024-05-11T00:32:28.077060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T00:32:28.610071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한식 2044
47.6%
730
 
17.0%
미용업 730
 
17.0%
기타외식업(다방,패스트푸드등 257
 
6.0%
중식 242
 
5.6%
세탁업 117
 
2.7%
경양식,일식 93
 
2.2%
기타서비스업종 78
 
1.8%
숙박업(호텔,여관 3
 
0.1%
목욕업 3
 
0.1%
Distinct1497
Distinct (%)42.0%
Missing0
Missing (%)0.0%
Memory size28.0 KiB
2024-05-11T00:32:29.638504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length45
Mean length24.978413
Min length9

Characters and Unicode

Total characters89098
Distinct characters341
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique271 ?
Unique (%)7.6%

Sample

1st row서울특별시 서울 은평구 증산로21길 26
2nd row서울특별시 서울 은평구 은평로12길 7-9, 105호
3rd row서울특별시 서울 은평구 진흥로1길 17
4th row서울특별시 서울 은평구 응암로 218
5th row서울특별시 서울 은평구 통일로83길 15-1
ValueCountFrequency (%)
서울특별시 3594
 
19.7%
서울 995
 
5.4%
강남구 371
 
2.0%
구로구 324
 
1.8%
관악구 249
 
1.4%
1층 245
 
1.3%
동작구 218
 
1.2%
광진구 183
 
1.0%
서대문구 176
 
1.0%
마포구 171
 
0.9%
Other values (2006) 11750
64.3%
2024-05-11T00:32:31.132093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14772
 
16.6%
5136
 
5.8%
4626
 
5.2%
4024
 
4.5%
3978
 
4.5%
3733
 
4.2%
3594
 
4.0%
3594
 
4.0%
1 3570
 
4.0%
2936
 
3.3%
Other values (331) 39135
43.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 54770
61.5%
Space Separator 14772
 
16.6%
Decimal Number 14119
 
15.8%
Close Punctuation 2101
 
2.4%
Open Punctuation 2101
 
2.4%
Dash Punctuation 611
 
0.7%
Other Punctuation 565
 
0.6%
Uppercase Letter 58
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5136
 
9.4%
4626
 
8.4%
4024
 
7.3%
3978
 
7.3%
3733
 
6.8%
3594
 
6.6%
3594
 
6.6%
2936
 
5.4%
2359
 
4.3%
791
 
1.4%
Other values (305) 19999
36.5%
Decimal Number
ValueCountFrequency (%)
1 3570
25.3%
2 2092
14.8%
3 1509
10.7%
4 1214
 
8.6%
5 1145
 
8.1%
6 1087
 
7.7%
0 986
 
7.0%
7 890
 
6.3%
8 867
 
6.1%
9 759
 
5.4%
Uppercase Letter
ValueCountFrequency (%)
B 27
46.6%
A 9
 
15.5%
M 6
 
10.3%
T 3
 
5.2%
H 3
 
5.2%
S 3
 
5.2%
I 3
 
5.2%
C 2
 
3.4%
U 2
 
3.4%
Other Punctuation
ValueCountFrequency (%)
, 561
99.3%
. 4
 
0.7%
Space Separator
ValueCountFrequency (%)
14772
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2101
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2101
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 611
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 54770
61.5%
Common 34270
38.5%
Latin 58
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5136
 
9.4%
4626
 
8.4%
4024
 
7.3%
3978
 
7.3%
3733
 
6.8%
3594
 
6.6%
3594
 
6.6%
2936
 
5.4%
2359
 
4.3%
791
 
1.4%
Other values (305) 19999
36.5%
Common
ValueCountFrequency (%)
14772
43.1%
1 3570
 
10.4%
) 2101
 
6.1%
( 2101
 
6.1%
2 2092
 
6.1%
3 1509
 
4.4%
4 1214
 
3.5%
5 1145
 
3.3%
6 1087
 
3.2%
0 986
 
2.9%
Other values (7) 3693
 
10.8%
Latin
ValueCountFrequency (%)
B 27
46.6%
A 9
 
15.5%
M 6
 
10.3%
T 3
 
5.2%
H 3
 
5.2%
S 3
 
5.2%
I 3
 
5.2%
C 2
 
3.4%
U 2
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 54770
61.5%
ASCII 34328
38.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
14772
43.0%
1 3570
 
10.4%
) 2101
 
6.1%
( 2101
 
6.1%
2 2092
 
6.1%
3 1509
 
4.4%
4 1214
 
3.5%
5 1145
 
3.3%
6 1087
 
3.2%
0 986
 
2.9%
Other values (16) 3751
 
10.9%
Hangul
ValueCountFrequency (%)
5136
 
9.4%
4626
 
8.4%
4024
 
7.3%
3978
 
7.3%
3733
 
6.8%
3594
 
6.6%
3594
 
6.6%
2936
 
5.4%
2359
 
4.3%
791
 
1.4%
Other values (305) 19999
36.5%

업소 전화번호
Text

MISSING 

Distinct1371
Distinct (%)40.1%
Missing151
Missing (%)4.2%
Memory size28.0 KiB
2024-05-11T00:32:31.984541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length11
Mean length10.913349
Min length1

Characters and Unicode

Total characters37280
Distinct characters24
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique246 ?
Unique (%)7.2%

Sample

1st row02-302-6440
2nd row02-388-6090
3rd row02-385-3187
4th row02-374-9092
5th row02-389-0988
ValueCountFrequency (%)
75
 
2.2%
없음 44
 
1.3%
02-2263-0010 12
 
0.4%
0507-1399-3337 11
 
0.3%
02-983-8884 11
 
0.3%
02-313-1006 10
 
0.3%
070-7779-5577 9
 
0.3%
02-521-4465 9
 
0.3%
02-535-0310 9
 
0.3%
02-312-8253 9
 
0.3%
Other values (1361) 3216
94.2%
2024-05-11T00:32:33.288536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 6321
17.0%
2 5773
15.5%
0 5183
13.9%
5 2804
7.5%
3 2697
7.2%
8 2587
6.9%
7 2451
 
6.6%
4 2318
 
6.2%
1 2234
 
6.0%
9 2215
 
5.9%
Other values (14) 2697
7.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 30465
81.7%
Dash Punctuation 6321
 
17.0%
Space Separator 383
 
1.0%
Other Letter 97
 
0.3%
Close Punctuation 9
 
< 0.1%
Other Punctuation 5
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 5773
18.9%
0 5183
17.0%
5 2804
9.2%
3 2697
8.9%
8 2587
8.5%
7 2451
8.0%
4 2318
7.6%
1 2234
 
7.3%
9 2215
 
7.3%
6 2203
 
7.2%
Other Letter
ValueCountFrequency (%)
44
45.4%
44
45.4%
2
 
2.1%
2
 
2.1%
2
 
2.1%
1
 
1.0%
1
 
1.0%
1
 
1.0%
Space Separator
ValueCountFrequency (%)
379
99.0%
  4
 
1.0%
Other Punctuation
ValueCountFrequency (%)
. 3
60.0%
? 2
40.0%
Dash Punctuation
ValueCountFrequency (%)
- 6321
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 37183
99.7%
Hangul 97
 
0.3%

Most frequent character per script

Common
ValueCountFrequency (%)
- 6321
17.0%
2 5773
15.5%
0 5183
13.9%
5 2804
7.5%
3 2697
7.3%
8 2587
7.0%
7 2451
 
6.6%
4 2318
 
6.2%
1 2234
 
6.0%
9 2215
 
6.0%
Other values (6) 2600
7.0%
Hangul
ValueCountFrequency (%)
44
45.4%
44
45.4%
2
 
2.1%
2
 
2.1%
2
 
2.1%
1
 
1.0%
1
 
1.0%
1
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 37179
99.7%
Hangul 97
 
0.3%
None 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 6321
17.0%
2 5773
15.5%
0 5183
13.9%
5 2804
7.5%
3 2697
7.3%
8 2587
7.0%
7 2451
 
6.6%
4 2318
 
6.2%
1 2234
 
6.0%
9 2215
 
6.0%
Other values (5) 2596
7.0%
Hangul
ValueCountFrequency (%)
44
45.4%
44
45.4%
2
 
2.1%
2
 
2.1%
2
 
2.1%
1
 
1.0%
1
 
1.0%
1
 
1.0%
None
ValueCountFrequency (%)
  4
100.0%

추천수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct51
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.6781609
Minimum0
Maximum450
Zeros2252
Zeros (%)63.1%
Negative0
Negative (%)0.0%
Memory size31.5 KiB
2024-05-11T00:32:34.283685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q33
95-th percentile8.7
Maximum450
Range450
Interquartile range (IQR)3

Descriptive statistics

Standard deviation28.018615
Coefficient of variation (CV)4.9344524
Kurtosis96.607273
Mean5.6781609
Median Absolute Deviation (MAD)0
Skewness8.9570977
Sum20254
Variance785.04277
MonotonicityNot monotonic
2024-05-11T00:32:35.051939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2252
63.1%
3 555
 
15.6%
2 267
 
7.5%
5 140
 
3.9%
6 61
 
1.7%
4 46
 
1.3%
7 42
 
1.2%
9 25
 
0.7%
62 19
 
0.5%
8 19
 
0.5%
Other values (41) 141
 
4.0%
ValueCountFrequency (%)
0 2252
63.1%
1 6
 
0.2%
2 267
 
7.5%
3 555
 
15.6%
4 46
 
1.3%
5 140
 
3.9%
6 61
 
1.7%
7 42
 
1.2%
8 19
 
0.5%
9 25
 
0.7%
ValueCountFrequency (%)
450 1
 
< 0.1%
432 2
0.1%
336 2
0.1%
318 2
0.1%
264 1
 
< 0.1%
258 2
0.1%
257 4
0.1%
252 3
0.1%
250 1
 
< 0.1%
236 4
0.1%
Distinct1136
Distinct (%)31.9%
Missing5
Missing (%)0.1%
Memory size28.0 KiB
2024-05-11T00:32:36.103159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length18
Mean length4.4387984
Min length1

Characters and Unicode

Total characters15811
Distinct characters472
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique863 ?
Unique (%)24.2%

Sample

1st row기본관리(11회)
2nd row왕돈까스
3rd row야채김밥
4th row삼겹살(200g)
5th row모듬물회
ValueCountFrequency (%)
커트 225
 
6.0%
김치찌개 175
 
4.7%
파마 175
 
4.7%
된장찌개 146
 
3.9%
칼국수 77
 
2.1%
짬뽕 73
 
2.0%
비빔밥 72
 
1.9%
냉면 56
 
1.5%
삼겹살 54
 
1.4%
백반 49
 
1.3%
Other values (1093) 2637
70.5%
2024-05-11T00:32:37.865941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 513
 
3.2%
( 513
 
3.2%
481
 
3.0%
459
 
2.9%
438
 
2.8%
394
 
2.5%
369
 
2.3%
360
 
2.3%
353
 
2.2%
319
 
2.0%
Other values (462) 11612
73.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13465
85.2%
Decimal Number 703
 
4.4%
Close Punctuation 513
 
3.2%
Open Punctuation 513
 
3.2%
Space Separator 256
 
1.6%
Lowercase Letter 246
 
1.6%
Other Punctuation 83
 
0.5%
Math Symbol 22
 
0.1%
Uppercase Letter 10
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
481
 
3.6%
459
 
3.4%
438
 
3.3%
394
 
2.9%
369
 
2.7%
360
 
2.7%
353
 
2.6%
319
 
2.4%
316
 
2.3%
297
 
2.2%
Other values (418) 9679
71.9%
Lowercase Letter
ValueCountFrequency (%)
g 201
81.7%
p 7
 
2.8%
t 6
 
2.4%
e 6
 
2.4%
s 5
 
2.0%
c 5
 
2.0%
o 5
 
2.0%
i 3
 
1.2%
h 3
 
1.2%
l 1
 
0.4%
Other values (4) 4
 
1.6%
Decimal Number
ValueCountFrequency (%)
0 314
44.7%
2 121
 
17.2%
1 104
 
14.8%
5 61
 
8.7%
8 42
 
6.0%
3 25
 
3.6%
4 20
 
2.8%
6 11
 
1.6%
7 4
 
0.6%
9 1
 
0.1%
Other Punctuation
ValueCountFrequency (%)
, 61
73.5%
/ 10
 
12.0%
& 4
 
4.8%
' 3
 
3.6%
; 2
 
2.4%
. 1
 
1.2%
: 1
 
1.2%
# 1
 
1.2%
Uppercase Letter
ValueCountFrequency (%)
L 3
30.0%
S 2
20.0%
X 1
 
10.0%
O 1
 
10.0%
M 1
 
10.0%
K 1
 
10.0%
R 1
 
10.0%
Math Symbol
ValueCountFrequency (%)
+ 21
95.5%
~ 1
 
4.5%
Close Punctuation
ValueCountFrequency (%)
) 513
100.0%
Open Punctuation
ValueCountFrequency (%)
( 513
100.0%
Space Separator
ValueCountFrequency (%)
256
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13463
85.1%
Common 2090
 
13.2%
Latin 256
 
1.6%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
481
 
3.6%
459
 
3.4%
438
 
3.3%
394
 
2.9%
369
 
2.7%
360
 
2.7%
353
 
2.6%
319
 
2.4%
316
 
2.3%
297
 
2.2%
Other values (417) 9677
71.9%
Common
ValueCountFrequency (%)
) 513
24.5%
( 513
24.5%
0 314
15.0%
256
12.2%
2 121
 
5.8%
1 104
 
5.0%
5 61
 
2.9%
, 61
 
2.9%
8 42
 
2.0%
3 25
 
1.2%
Other values (13) 80
 
3.8%
Latin
ValueCountFrequency (%)
g 201
78.5%
p 7
 
2.7%
t 6
 
2.3%
e 6
 
2.3%
s 5
 
2.0%
c 5
 
2.0%
o 5
 
2.0%
i 3
 
1.2%
L 3
 
1.2%
h 3
 
1.2%
Other values (11) 12
 
4.7%
Han
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13463
85.1%
ASCII 2346
 
14.8%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 513
21.9%
( 513
21.9%
0 314
13.4%
256
10.9%
g 201
 
8.6%
2 121
 
5.2%
1 104
 
4.4%
5 61
 
2.6%
, 61
 
2.6%
8 42
 
1.8%
Other values (34) 160
 
6.8%
Hangul
ValueCountFrequency (%)
481
 
3.6%
459
 
3.4%
438
 
3.3%
394
 
2.9%
369
 
2.7%
360
 
2.7%
353
 
2.6%
319
 
2.4%
316
 
2.3%
297
 
2.2%
Other values (417) 9677
71.9%
CJK
ValueCountFrequency (%)
2
100.0%

상품가격(일반)(원)
Real number (ℝ)

Distinct120
Distinct (%)3.4%
Missing5
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean9792.8271
Minimum200
Maximum350000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size31.5 KiB
2024-05-11T00:32:38.536168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum200
5-th percentile2805
Q15500
median7000
Q310000
95-th percentile29950
Maximum350000
Range349800
Interquartile range (IQR)4500

Descriptive statistics

Standard deviation10596.725
Coefficient of variation (CV)1.0820905
Kurtosis331.84284
Mean9792.8271
Median Absolute Deviation (MAD)2000
Skewness12.494118
Sum34882050
Variance1.1229057 × 108
MonotonicityNot monotonic
2024-05-11T00:32:39.314342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7000 515
14.4%
6000 415
 
11.6%
8000 372
 
10.4%
5000 280
 
7.8%
10000 206
 
5.8%
9000 136
 
3.8%
4000 136
 
3.8%
3000 119
 
3.3%
15000 114
 
3.2%
12000 96
 
2.7%
Other values (110) 1173
32.9%
ValueCountFrequency (%)
200 1
 
< 0.1%
220 1
 
< 0.1%
250 1
 
< 0.1%
300 1
 
< 0.1%
350 1
 
< 0.1%
400 2
 
0.1%
500 4
0.1%
600 1
 
< 0.1%
700 3
0.1%
1000 6
0.2%
ValueCountFrequency (%)
350000 1
 
< 0.1%
198000 1
 
< 0.1%
100000 2
0.1%
90000 1
 
< 0.1%
80000 1
 
< 0.1%
69000 2
0.1%
60000 1
 
< 0.1%
58000 1
 
< 0.1%
55000 4
0.1%
51000 1
 
< 0.1%

Interactions

2024-05-11T00:32:20.838847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T00:32:16.834617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T00:32:18.073483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T00:32:19.475051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T00:32:21.165486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T00:32:17.121249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T00:32:18.351569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T00:32:19.833548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T00:32:21.519805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T00:32:17.385915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T00:32:18.768474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T00:32:20.179811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T00:32:21.924763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T00:32:17.721359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T00:32:19.104645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T00:32:20.488726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T00:32:39.677319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소아이디분류코드분류코드명추천수상품가격(일반)(원)
업소아이디1.0000.2300.2340.5960.000
분류코드0.2301.0001.0000.1210.206
분류코드명0.2341.0001.0000.1220.255
추천수0.5960.1210.1221.0000.000
상품가격(일반)(원)0.0000.2060.2550.0001.000
2024-05-11T00:32:40.006536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소아이디분류코드추천수상품가격(일반)(원)분류코드명
업소아이디1.000-0.069-0.8560.0360.107
분류코드-0.0691.0000.0650.1521.000
추천수-0.8560.0651.000-0.0230.060
상품가격(일반)(원)0.0360.152-0.0231.0000.149
분류코드명0.1071.0000.0600.1491.000

Missing values

2024-05-11T00:32:22.456417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T00:32:23.076197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-05-11T00:32:23.498888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업소아이디업소명분류코드분류코드명업소 주소업소 전화번호추천수상품명상품가격(일반)(원)
010075지연피부샵5이 미용업서울특별시 서울 은평구 증산로21길 2602-302-64400기본관리(11회)350000
110074은평왕돈까스1한식서울특별시 서울 은평구 은평로12길 7-9, 105호02-388-60900왕돈까스8000
210073신가네 김밥1한식서울특별시 서울 은평구 진흥로1길 1702-385-31870야채김밥2000
310072금송아지 생고기구이1한식서울특별시 서울 은평구 응암로 21802-374-90920삼겹살(200g)15000
410071포항물회1한식서울특별시 서울 은평구 통일로83길 15-102-389-09880모듬물회40000
510070옷수선하는날7세탁업서울특별시 서울 구로구 신도림로11길 302-824-34650소배기장수선8000
610070옷수선하는날7세탁업서울특별시 서울 구로구 신도림로11길 302-824-34650지퍼수선8000
710070옷수선하는날7세탁업서울특별시 서울 구로구 신도림로11길 302-824-34650바짓단4000
810069우렁찬1한식서울특별시 강동구 풍성로37가길 67, 1층 101호02-474-54250시금치나물2000
910068강동집1한식서울특별시 강동구 천호대로188길 15, 1층02-489-39920백반(점심메뉴)6000
업소아이디업소명분류코드분류코드명업소 주소업소 전화번호추천수상품명상품가격(일반)(원)
3557784강헤어컬렉션5이 미용업서울특별시 중랑구 겸재로49길 2202-2208-4790111미용료(파마)30000
3558736장수설렁탕1한식서울특별시 송파구 백제고분로 148 (잠실동)02-415-147256불고기(200g호주산)12000
3559736장수설렁탕1한식서울특별시 송파구 백제고분로 148 (잠실동)02-415-147256설렁탕6000
3560443무진장3경양식,일식서울특별시 용산구 원효로 212 (원효로2가)02-712-88146알탕10000
3561443무진장3경양식,일식서울특별시 용산구 원효로 212 (원효로2가)02-712-88146생선구이정식10000
3562272홍노래방13기타서비스업종서울특별시 마포구 홍익로 1 (서교동)3142-15517노래방이용료(2인)15000
3563272홍노래방13기타서비스업종서울특별시 마포구 홍익로 1 (서교동)3142-15517노래방이용료15000
3564272홍노래방13기타서비스업종서울특별시 마포구 홍익로 1 (서교동)3142-15517노래방이용료(3인)15000
3565263장미미용실5이 미용업서울특별시 서울 중랑구 용마산로129길 61-4<NA>137커트10000
3566263장미미용실5이 미용업서울특별시 서울 중랑구 용마산로129길 61-4<NA>137미용료(파마)15000

Duplicate rows

Most frequently occurring

업소아이디업소명분류코드분류코드명업소 주소업소 전화번호추천수상품명상품가격(일반)(원)# duplicates
03263스타머리방5이 미용업서울특별시 양천구 신월로9길 22 (신월동)02-2601-784362파마150002
16778흥부농장1한식서울특별시 광진구 뚝섬로59길 81-2 (자양동)02-3437-361164김치찌개70002
27348와우돈까스1한식서울특별시 서울 중랑구 공릉로12가길 602-2209-36075칼국수60002
37939소백산영주한우1한식서울특별시 은평구 은평로 187-3 (녹번동)02-358-11995한우갈비탕150002
48148시골마당1한식서울특별시 서초구 강남대로23길 30 (양재동)02-577-94535부대찌개80002
58148시골마당1한식서울특별시 서초구 강남대로23길 30 (양재동)02-577-94535청국장80002
68209다정이모네1한식서울특별시 관악구 신원로 3 (신림동)865-02183청국장60002
78214빨리셀프크리닝7세탁업서울특별시 관악구 신림로 224 (신림동)02-872-82533세탁료(양복1벌)39002
88217초밥아저씨3경양식,일식서울특별시 관악구 신원로 23 (신림동), 관악종합시장 108 호02-3281-28205모듬초밥(10p)50002
98373송백식당1한식서울특별시 영등포구 당산로38길 15 (당산동4가)02-2632-50803설렁탕90002