Overview

Dataset statistics

Number of variables12
Number of observations10000
Missing cells45528
Missing cells (%)37.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 MiB
Average record size in memory111.0 B

Variable types

Numeric6
Text3
Categorical2
Boolean1

Dataset

Description전라남도에 위치한 식당의 품질정보를 적어놓은 데이터 입니다.식당명, 어워드정보, RTI지수, 온라인화진행여부, 수용태세지수, 인기도 등 식당품질정보를 포함하고 있습니다
Author전라남도
URLhttps://www.data.go.kr/data/15111204/fileData.do

Alerts

지역명 is highly overall correlated with 씨트립 평점High correlation
씨트립 평점 is highly overall correlated with 식당ID and 7 other fieldsHigh correlation
온라인화진행여부 is highly overall correlated with 식당ID and 3 other fieldsHigh correlation
식당ID is highly overall correlated with 온라인화진행여부 and 1 other fieldsHigh correlation
RTI지수 is highly overall correlated with 수용태세지수 and 3 other fieldsHigh correlation
수용태세지수 is highly overall correlated with RTI지수 and 2 other fieldsHigh correlation
인기도 is highly overall correlated with RTI지수 and 1 other fieldsHigh correlation
트립어드바이저 평점 is highly overall correlated with 씨트립 평점High correlation
네이버 평점 is highly overall correlated with 씨트립 평점High correlation
씨트립 평점 is highly imbalanced (99.8%)Imbalance
지점명 has 9099 (91.0%) missing valuesMissing
어워드정보설명 has 9533 (95.3%) missing valuesMissing
RTI지수 has 763 (7.6%) missing valuesMissing
온라인화진행여부 has 3156 (31.6%) missing valuesMissing
수용태세지수 has 1521 (15.2%) missing valuesMissing
인기도 has 6385 (63.8%) missing valuesMissing
트립어드바이저 평점 has 9838 (98.4%) missing valuesMissing
네이버 평점 has 5233 (52.3%) missing valuesMissing
식당ID has unique valuesUnique

Reproduction

Analysis started2023-12-16 15:17:26.089484
Analysis finished2023-12-16 15:17:59.846958
Duration33.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

식당ID
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean382628.08
Minimum10399
Maximum1074217
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-16T15:18:00.527774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10399
5-th percentile31635.85
Q1172780.25
median196008
Q3688486.5
95-th percentile1022777.8
Maximum1074217
Range1063818
Interquartile range (IQR)515706.25

Descriptive statistics

Standard deviation325852.35
Coefficient of variation (CV)0.85161639
Kurtosis-0.87704155
Mean382628.08
Median Absolute Deviation (MAD)78310.5
Skewness0.76506244
Sum3.8262808 × 109
Variance1.0617975 × 1011
MonotonicityNot monotonic
2023-12-16T15:18:01.386218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
968651 1
 
< 0.1%
34286 1
 
< 0.1%
196411 1
 
< 0.1%
197915 1
 
< 0.1%
183409 1
 
< 0.1%
199108 1
 
< 0.1%
193677 1
 
< 0.1%
704021 1
 
< 0.1%
179173 1
 
< 0.1%
33560 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
10399 1
< 0.1%
10401 1
< 0.1%
10402 1
< 0.1%
10403 1
< 0.1%
10405 1
< 0.1%
10406 1
< 0.1%
10409 1
< 0.1%
10412 1
< 0.1%
10414 1
< 0.1%
10420 1
< 0.1%
ValueCountFrequency (%)
1074217 1
< 0.1%
1074107 1
< 0.1%
1074104 1
< 0.1%
1074073 1
< 0.1%
1074006 1
< 0.1%
1073993 1
< 0.1%
1073940 1
< 0.1%
1073916 1
< 0.1%
1073903 1
< 0.1%
1073902 1
< 0.1%
Distinct9202
Distinct (%)92.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-16T15:18:02.682064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length32
Mean length5.3245
Min length1

Characters and Unicode

Total characters53245
Distinct characters1055
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8702 ?
Unique (%)87.0%

Sample

1st row드루와
2nd row백팔번
3rd row소나무꼬마김밥
4th row동향국밥
5th row해뜰짱분식
ValueCountFrequency (%)
카페 97
 
0.8%
여수 29
 
0.3%
식당 23
 
0.2%
투모아 22
 
0.2%
cafe 20
 
0.2%
까투리 19
 
0.2%
포차 19
 
0.2%
김밥나라 16
 
0.1%
커피 16
 
0.1%
호프 11
 
0.1%
Other values (9834) 11145
97.6%
2023-12-16T15:18:05.062861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1510
 
2.8%
1420
 
2.7%
1258
 
2.4%
1081
 
2.0%
830
 
1.6%
606
 
1.1%
605
 
1.1%
594
 
1.1%
538
 
1.0%
526
 
1.0%
Other values (1045) 44277
83.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 48529
91.1%
Space Separator 1420
 
2.7%
Uppercase Letter 1022
 
1.9%
Lowercase Letter 938
 
1.8%
Decimal Number 569
 
1.1%
Open Punctuation 273
 
0.5%
Close Punctuation 273
 
0.5%
Other Punctuation 197
 
0.4%
Dash Punctuation 12
 
< 0.1%
Control 4
 
< 0.1%
Other values (3) 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1510
 
3.1%
1258
 
2.6%
1081
 
2.2%
830
 
1.7%
606
 
1.2%
605
 
1.2%
594
 
1.2%
538
 
1.1%
526
 
1.1%
523
 
1.1%
Other values (962) 40458
83.4%
Lowercase Letter
ValueCountFrequency (%)
e 142
15.1%
a 105
11.2%
o 92
 
9.8%
r 63
 
6.7%
n 63
 
6.7%
c 60
 
6.4%
f 59
 
6.3%
i 41
 
4.4%
s 38
 
4.1%
u 37
 
3.9%
Other values (16) 238
25.4%
Uppercase Letter
ValueCountFrequency (%)
C 105
 
10.3%
A 98
 
9.6%
E 97
 
9.5%
O 91
 
8.9%
F 66
 
6.5%
B 66
 
6.5%
L 46
 
4.5%
S 42
 
4.1%
N 42
 
4.1%
T 39
 
3.8%
Other values (16) 330
32.3%
Decimal Number
ValueCountFrequency (%)
1 122
21.4%
2 97
17.0%
3 57
10.0%
0 55
9.7%
9 48
 
8.4%
5 45
 
7.9%
6 39
 
6.9%
4 38
 
6.7%
8 35
 
6.2%
7 33
 
5.8%
Other Punctuation
ValueCountFrequency (%)
& 92
46.7%
, 32
 
16.2%
. 30
 
15.2%
· 12
 
6.1%
: 9
 
4.6%
! 9
 
4.6%
' 6
 
3.0%
# 5
 
2.5%
; 1
 
0.5%
/ 1
 
0.5%
Open Punctuation
ValueCountFrequency (%)
( 271
99.3%
[ 2
 
0.7%
Close Punctuation
ValueCountFrequency (%)
) 271
99.3%
] 2
 
0.7%
Math Symbol
ValueCountFrequency (%)
+ 2
50.0%
~ 2
50.0%
Space Separator
ValueCountFrequency (%)
1420
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Control
ValueCountFrequency (%)
4
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 48529
91.1%
Common 2756
 
5.2%
Latin 1960
 
3.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1510
 
3.1%
1258
 
2.6%
1081
 
2.2%
830
 
1.7%
606
 
1.2%
605
 
1.2%
594
 
1.2%
538
 
1.1%
526
 
1.1%
523
 
1.1%
Other values (962) 40458
83.4%
Latin
ValueCountFrequency (%)
e 142
 
7.2%
a 105
 
5.4%
C 105
 
5.4%
A 98
 
5.0%
E 97
 
4.9%
o 92
 
4.7%
O 91
 
4.6%
F 66
 
3.4%
B 66
 
3.4%
r 63
 
3.2%
Other values (42) 1035
52.8%
Common
ValueCountFrequency (%)
1420
51.5%
( 271
 
9.8%
) 271
 
9.8%
1 122
 
4.4%
2 97
 
3.5%
& 92
 
3.3%
3 57
 
2.1%
0 55
 
2.0%
9 48
 
1.7%
5 45
 
1.6%
Other values (21) 278
 
10.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 48526
91.1%
ASCII 4701
 
8.8%
None 12
 
< 0.1%
Letterlike Symbols 3
 
< 0.1%
Compat Jamo 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1510
 
3.1%
1258
 
2.6%
1081
 
2.2%
830
 
1.7%
606
 
1.2%
605
 
1.2%
594
 
1.2%
538
 
1.1%
526
 
1.1%
523
 
1.1%
Other values (959) 40455
83.4%
ASCII
ValueCountFrequency (%)
1420
30.2%
( 271
 
5.8%
) 271
 
5.8%
e 142
 
3.0%
1 122
 
2.6%
a 105
 
2.2%
C 105
 
2.2%
A 98
 
2.1%
2 97
 
2.1%
E 97
 
2.1%
Other values (71) 1973
42.0%
None
ValueCountFrequency (%)
· 12
100.0%
Letterlike Symbols
ValueCountFrequency (%)
3
100.0%
Compat Jamo
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

지점명
Text

MISSING 

Distinct399
Distinct (%)44.3%
Missing9099
Missing (%)91.0%
Memory size156.2 KiB
2023-12-16T15:18:06.152075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length3
Mean length3.7746948
Min length2

Characters and Unicode

Total characters3401
Distinct characters254
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique289 ?
Unique (%)32.1%

Sample

1st row여수웅천점
2nd row교동점
3rd row신대점
4th row여수점
5th row다도지점
ValueCountFrequency (%)
본점 40
 
4.4%
남악점 22
 
2.4%
여수점 22
 
2.4%
화순점 21
 
2.3%
광양점 20
 
2.2%
금당점 18
 
2.0%
순천점 16
 
1.7%
나주혁신점 15
 
1.6%
2호점 13
 
1.4%
신대점 12
 
1.3%
Other values (390) 716
78.3%
2023-12-16T15:18:08.644507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
902
26.5%
101
 
3.0%
87
 
2.6%
86
 
2.5%
82
 
2.4%
79
 
2.3%
78
 
2.3%
73
 
2.1%
70
 
2.1%
62
 
1.8%
Other values (244) 1781
52.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3304
97.1%
Decimal Number 67
 
2.0%
Uppercase Letter 16
 
0.5%
Space Separator 14
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
902
27.3%
101
 
3.1%
87
 
2.6%
86
 
2.6%
82
 
2.5%
79
 
2.4%
78
 
2.4%
73
 
2.2%
70
 
2.1%
62
 
1.9%
Other values (226) 1684
51.0%
Decimal Number
ValueCountFrequency (%)
1 26
38.8%
2 20
29.9%
3 7
 
10.4%
8 4
 
6.0%
5 3
 
4.5%
9 2
 
3.0%
0 2
 
3.0%
4 2
 
3.0%
7 1
 
1.5%
Uppercase Letter
ValueCountFrequency (%)
L 4
25.0%
F 4
25.0%
C 3
18.8%
D 1
 
6.2%
S 1
 
6.2%
R 1
 
6.2%
N 1
 
6.2%
G 1
 
6.2%
Space Separator
ValueCountFrequency (%)
14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3304
97.1%
Common 81
 
2.4%
Latin 16
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
902
27.3%
101
 
3.1%
87
 
2.6%
86
 
2.6%
82
 
2.5%
79
 
2.4%
78
 
2.4%
73
 
2.2%
70
 
2.1%
62
 
1.9%
Other values (226) 1684
51.0%
Common
ValueCountFrequency (%)
1 26
32.1%
2 20
24.7%
14
17.3%
3 7
 
8.6%
8 4
 
4.9%
5 3
 
3.7%
9 2
 
2.5%
0 2
 
2.5%
4 2
 
2.5%
7 1
 
1.2%
Latin
ValueCountFrequency (%)
L 4
25.0%
F 4
25.0%
C 3
18.8%
D 1
 
6.2%
S 1
 
6.2%
R 1
 
6.2%
N 1
 
6.2%
G 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3304
97.1%
ASCII 97
 
2.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
902
27.3%
101
 
3.1%
87
 
2.6%
86
 
2.6%
82
 
2.5%
79
 
2.4%
78
 
2.4%
73
 
2.2%
70
 
2.1%
62
 
1.9%
Other values (226) 1684
51.0%
ASCII
ValueCountFrequency (%)
1 26
26.8%
2 20
20.6%
14
14.4%
3 7
 
7.2%
8 4
 
4.1%
L 4
 
4.1%
F 4
 
4.1%
C 3
 
3.1%
5 3
 
3.1%
9 2
 
2.1%
Other values (8) 10
 
10.3%

지역명
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
여수시
1675 
순천시
1513 
목포시
1441 
광양시
821 
나주시
585 
Other values (17)
3965 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강진군
2nd row광양시
3rd row순천시
4th row광양시
5th row무안군

Common Values

ValueCountFrequency (%)
여수시 1675
16.8%
순천시 1513
15.1%
목포시 1441
14.4%
광양시 821
 
8.2%
나주시 585
 
5.9%
무안군 400
 
4.0%
해남군 331
 
3.3%
화순군 328
 
3.3%
담양군 315
 
3.1%
영암군 283
 
2.8%
Other values (12) 2308
23.1%

Length

2023-12-16T15:18:09.773695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
여수시 1675
16.8%
순천시 1513
15.1%
목포시 1441
14.4%
광양시 821
 
8.2%
나주시 585
 
5.9%
무안군 400
 
4.0%
해남군 331
 
3.3%
화순군 328
 
3.3%
담양군 315
 
3.1%
영암군 283
 
2.8%
Other values (12) 2308
23.1%

어워드정보설명
Text

MISSING 

Distinct60
Distinct (%)12.8%
Missing9533
Missing (%)95.3%
Memory size156.2 KiB
2023-12-16T15:18:11.464128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length33
Mean length15.862955
Min length10

Characters and Unicode

Total characters7408
Distinct characters35
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)2.4%

Sample

1st row모범음식점(2001)
2nd row안심식당(2021)
3rd row모범음식점(2016),안심식당(2020)
4th row모범음식점(2011)
5th row안심식당(2021)
ValueCountFrequency (%)
안심식당(2020 80
16.8%
안심식당(2021 54
 
11.4%
모범음식점(2018),안심식당(2020 29
 
6.1%
모범음식점(2016),안심식당(2020 22
 
4.6%
모범음식점(2017),안심식당(2020 22
 
4.6%
모범음식점(2018 16
 
3.4%
모범음식점(2012),안심식당(2020 15
 
3.2%
모범음식점(2019),안심식당(2020 15
 
3.2%
블루리본(2020 14
 
2.9%
모범음식점(2008),안심식당(2020 12
 
2.5%
Other values (51) 196
41.3%
2023-12-16T15:18:13.617035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 1116
15.1%
0 1065
14.4%
( 684
9.2%
) 684
9.2%
644
8.7%
341
 
4.6%
341
 
4.6%
341
 
4.6%
1 325
 
4.4%
303
 
4.1%
Other values (25) 1564
21.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3079
41.6%
Decimal Number 2736
36.9%
Open Punctuation 684
 
9.2%
Close Punctuation 684
 
9.2%
Other Punctuation 217
 
2.9%
Space Separator 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
644
20.9%
341
11.1%
341
11.1%
341
11.1%
303
9.8%
303
9.8%
303
9.8%
303
9.8%
32
 
1.0%
32
 
1.0%
Other values (11) 136
 
4.4%
Decimal Number
ValueCountFrequency (%)
2 1116
40.8%
0 1065
38.9%
1 325
 
11.9%
8 68
 
2.5%
9 39
 
1.4%
6 32
 
1.2%
7 29
 
1.1%
3 22
 
0.8%
4 20
 
0.7%
5 20
 
0.7%
Open Punctuation
ValueCountFrequency (%)
( 684
100.0%
Close Punctuation
ValueCountFrequency (%)
) 684
100.0%
Other Punctuation
ValueCountFrequency (%)
, 217
100.0%
Space Separator
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4329
58.4%
Hangul 3079
41.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
644
20.9%
341
11.1%
341
11.1%
341
11.1%
303
9.8%
303
9.8%
303
9.8%
303
9.8%
32
 
1.0%
32
 
1.0%
Other values (11) 136
 
4.4%
Common
ValueCountFrequency (%)
2 1116
25.8%
0 1065
24.6%
( 684
15.8%
) 684
15.8%
1 325
 
7.5%
, 217
 
5.0%
8 68
 
1.6%
9 39
 
0.9%
6 32
 
0.7%
7 29
 
0.7%
Other values (4) 70
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4329
58.4%
Hangul 3079
41.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 1116
25.8%
0 1065
24.6%
( 684
15.8%
) 684
15.8%
1 325
 
7.5%
, 217
 
5.0%
8 68
 
1.6%
9 39
 
0.9%
6 32
 
0.7%
7 29
 
0.7%
Other values (4) 70
 
1.6%
Hangul
ValueCountFrequency (%)
644
20.9%
341
11.1%
341
11.1%
341
11.1%
303
9.8%
303
9.8%
303
9.8%
303
9.8%
32
 
1.0%
32
 
1.0%
Other values (11) 136
 
4.4%

RTI지수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct1541
Distinct (%)16.7%
Missing763
Missing (%)7.6%
Infinite0
Infinite (%)0.0%
Mean2.3833588
Minimum0.0002234
Maximum4.9983257
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-16T15:18:14.693691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.0002234
5-th percentile0.0186212
Q10.910544
median2.4955107
Q34.1162244
95-th percentile4.3883172
Maximum4.9983257
Range4.9981023
Interquartile range (IQR)3.2056804

Descriptive statistics

Standard deviation1.6443796
Coefficient of variation (CV)0.68994211
Kurtosis-1.6510312
Mean2.3833588
Median Absolute Deviation (MAD)1.5849667
Skewness-0.063017512
Sum22015.085
Variance2.7039842
MonotonicityNot monotonic
2023-12-16T15:18:15.978276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.910544 2672
26.7%
0.0186212 1126
 
11.3%
2.4955107 1079
 
10.8%
3.8151542 431
 
4.3%
4.0717341 195
 
1.9%
4.0210963 149
 
1.5%
3.2736601 147
 
1.5%
2.4672079 138
 
1.4%
4.1162244 100
 
1.0%
1.0501897 98
 
1.0%
Other values (1531) 3102
31.0%
(Missing) 763
 
7.6%
ValueCountFrequency (%)
0.0002234 1
 
< 0.1%
0.0186212 1126
11.3%
0.0286097 1
 
< 0.1%
0.1351425 1
 
< 0.1%
0.1421433 1
 
< 0.1%
0.1577147 66
 
0.7%
0.1731637 6
 
0.1%
0.1751647 1
 
< 0.1%
0.2630585 1
 
< 0.1%
0.3154518 1
 
< 0.1%
ValueCountFrequency (%)
4.9983257 1
< 0.1%
4.8865116 1
< 0.1%
4.7668809 1
< 0.1%
4.7506174 1
< 0.1%
4.7167231 1
< 0.1%
4.69429 1
< 0.1%
4.6810765 1
< 0.1%
4.6806264 1
< 0.1%
4.6641802 1
< 0.1%
4.6500527 1
< 0.1%

온라인화진행여부
Boolean

HIGH CORRELATION  MISSING 

Distinct2
Distinct (%)< 0.1%
Missing3156
Missing (%)31.6%
Memory size97.7 KiB
False
5022 
True
1822 
(Missing)
3156 
ValueCountFrequency (%)
False 5022
50.2%
True 1822
 
18.2%
(Missing) 3156
31.6%
2023-12-16T15:18:16.737694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

수용태세지수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct199
Distinct (%)2.3%
Missing1521
Missing (%)15.2%
Infinite0
Infinite (%)0.0%
Mean0.16704859
Minimum0.029
Maximum0.383
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-16T15:18:17.629843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.029
5-th percentile0.029
Q10.17
median0.199
Q30.216
95-th percentile0.262
Maximum0.383
Range0.354
Interquartile range (IQR)0.046

Descriptive statistics

Standard deviation0.076477191
Coefficient of variation (CV)0.45781405
Kurtosis-0.3771835
Mean0.16704859
Median Absolute Deviation (MAD)0.029
Skewness-0.8564889
Sum1416.405
Variance0.0058487608
MonotonicityNot monotonic
2023-12-16T15:18:18.696960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.029 1657
16.6%
0.17 1149
11.5%
0.201 660
 
6.6%
0.2 620
 
6.2%
0.189 518
 
5.2%
0.199 245
 
2.5%
0.202 236
 
2.4%
0.217 225
 
2.2%
0.216 211
 
2.1%
0.12 198
 
2.0%
Other values (189) 2760
27.6%
(Missing) 1521
15.2%
ValueCountFrequency (%)
0.029 1657
16.6%
0.044 49
 
0.5%
0.048 9
 
0.1%
0.053 2
 
< 0.1%
0.057 1
 
< 0.1%
0.058 2
 
< 0.1%
0.063 46
 
0.5%
0.066 3
 
< 0.1%
0.07 3
 
< 0.1%
0.072 4
 
< 0.1%
ValueCountFrequency (%)
0.383 1
< 0.1%
0.379 2
< 0.1%
0.378 1
< 0.1%
0.371 1
< 0.1%
0.364 1
< 0.1%
0.362 2
< 0.1%
0.357 1
< 0.1%
0.345 1
< 0.1%
0.341 1
< 0.1%
0.34 2
< 0.1%

인기도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct40
Distinct (%)1.1%
Missing6385
Missing (%)63.8%
Infinite0
Infinite (%)0.0%
Mean0.11962102
Minimum0.01
Maximum0.51
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-16T15:18:19.708591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.01
5-th percentile0.01
Q10.02
median0.06
Q30.21
95-th percentile0.33
Maximum0.51
Range0.5
Interquartile range (IQR)0.19

Descriptive statistics

Standard deviation0.12263183
Coefficient of variation (CV)1.0251696
Kurtosis-0.85179062
Mean0.11962102
Median Absolute Deviation (MAD)0.05
Skewness0.86049337
Sum432.43
Variance0.015038567
MonotonicityNot monotonic
2023-12-16T15:18:20.706308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
0.01 742
 
7.4%
0.33 609
 
6.1%
0.02 397
 
4.0%
0.03 282
 
2.8%
0.04 228
 
2.3%
0.06 164
 
1.6%
0.07 130
 
1.3%
0.08 113
 
1.1%
0.09 105
 
1.1%
0.11 90
 
0.9%
Other values (30) 755
 
7.5%
(Missing) 6385
63.8%
ValueCountFrequency (%)
0.01 742
7.4%
0.02 397
4.0%
0.03 282
 
2.8%
0.04 228
 
2.3%
0.06 164
 
1.6%
0.07 130
 
1.3%
0.08 113
 
1.1%
0.09 105
 
1.1%
0.1 83
 
0.8%
0.11 90
 
0.9%
ValueCountFrequency (%)
0.51 1
 
< 0.1%
0.44 1
 
< 0.1%
0.43 1
 
< 0.1%
0.41 2
 
< 0.1%
0.4 5
 
0.1%
0.39 4
 
< 0.1%
0.38 2
 
< 0.1%
0.37 7
 
0.1%
0.36 12
0.1%
0.34 26
0.3%

트립어드바이저 평점
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct9
Distinct (%)5.6%
Missing9838
Missing (%)98.4%
Infinite0
Infinite (%)0.0%
Mean4.0987654
Minimum1
Maximum5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-16T15:18:21.451388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q14
median4.25
Q35
95-th percentile5
Maximum5
Range4
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.95851773
Coefficient of variation (CV)0.23385523
Kurtosis2.9651189
Mean4.0987654
Median Absolute Deviation (MAD)0.25
Skewness-1.6673475
Sum664
Variance0.91875623
MonotonicityNot monotonic
2023-12-16T15:18:22.214209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
4.0 48
 
0.5%
5.0 45
 
0.4%
4.5 36
 
0.4%
3.0 14
 
0.1%
3.5 8
 
0.1%
1.0 7
 
0.1%
2.0 2
 
< 0.1%
1.5 1
 
< 0.1%
2.5 1
 
< 0.1%
(Missing) 9838
98.4%
ValueCountFrequency (%)
1.0 7
 
0.1%
1.5 1
 
< 0.1%
2.0 2
 
< 0.1%
2.5 1
 
< 0.1%
3.0 14
 
0.1%
3.5 8
 
0.1%
4.0 48
0.5%
4.5 36
0.4%
5.0 45
0.4%
ValueCountFrequency (%)
5.0 45
0.4%
4.5 36
0.4%
4.0 48
0.5%
3.5 8
 
0.1%
3.0 14
 
0.1%
2.5 1
 
< 0.1%
2.0 2
 
< 0.1%
1.5 1
 
< 0.1%
1.0 7
 
0.1%

씨트립 평점
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9997 
4.5
 
1
3.7
 
1
5.0
 
1

Length

Max length4
Median length4
Mean length3.9997
Min length3

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9997
> 99.9%
4.5 1
 
< 0.1%
3.7 1
 
< 0.1%
5.0 1
 
< 0.1%

Length

2023-12-16T15:18:23.048718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-16T15:18:23.943638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9997
> 99.9%
4.5 1
 
< 0.1%
3.7 1
 
< 0.1%
5.0 1
 
< 0.1%

네이버 평점
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct172
Distinct (%)3.6%
Missing5233
Missing (%)52.3%
Infinite0
Infinite (%)0.0%
Mean4.3381771
Minimum0.5
Maximum5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-16T15:18:24.623775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.5
5-th percentile3.75
Q14.2
median4.36
Q34.51
95-th percentile5
Maximum5
Range4.5
Interquartile range (IQR)0.31

Descriptive statistics

Standard deviation0.37270288
Coefficient of variation (CV)0.085912327
Kurtosis11.44212
Mean4.3381771
Median Absolute Deviation (MAD)0.16
Skewness-1.9193052
Sum20680.09
Variance0.13890744
MonotonicityNot monotonic
2023-12-16T15:18:25.647817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5.0 269
 
2.7%
4.5 219
 
2.2%
4.0 172
 
1.7%
4.33 118
 
1.2%
4.38 111
 
1.1%
4.25 109
 
1.1%
4.36 101
 
1.0%
4.3 96
 
1.0%
4.4 91
 
0.9%
4.42 83
 
0.8%
Other values (162) 3398
34.0%
(Missing) 5233
52.3%
ValueCountFrequency (%)
0.5 1
 
< 0.1%
1.0 4
 
< 0.1%
1.5 1
 
< 0.1%
2.0 1
 
< 0.1%
2.25 2
 
< 0.1%
2.5 13
0.1%
2.75 1
 
< 0.1%
2.83 3
 
< 0.1%
2.88 1
 
< 0.1%
2.9 1
 
< 0.1%
ValueCountFrequency (%)
5.0 269
2.7%
4.98 1
 
< 0.1%
4.96 1
 
< 0.1%
4.95 2
 
< 0.1%
4.94 2
 
< 0.1%
4.93 3
 
< 0.1%
4.92 1
 
< 0.1%
4.91 3
 
< 0.1%
4.9 7
 
0.1%
4.89 7
 
0.1%

Interactions

2023-12-16T15:17:53.452439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:35.543371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:38.162775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:41.896133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:45.520000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:49.971572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:54.065098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:35.854200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:38.524595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:42.501360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:46.321057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:50.661687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:54.620196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:36.138407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:38.917331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:43.238298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:46.983231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:51.153793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:55.255804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:36.853774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:39.874992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:43.640533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:47.579783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:51.749097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:55.745654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:37.310134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:40.679764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:44.230237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:48.086710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:52.321313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:56.313666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:37.632864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:41.280184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:44.885114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:49.228988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:17:52.938166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-16T15:18:26.145449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
식당ID지역명어워드정보설명RTI지수온라인화진행여부수용태세지수인기도트립어드바이저 평점씨트립 평점네이버 평점
식당ID1.0000.5340.5010.6910.8570.5720.2080.0751.0000.126
지역명0.5341.0000.8280.1900.2910.2040.1840.7501.0000.128
어워드정보설명0.5010.8281.0000.4370.3330.1200.6500.0001.0000.441
RTI지수0.6910.1900.4371.0000.5820.6850.5040.2091.0000.225
온라인화진행여부0.8570.2910.3330.5821.0000.6660.2840.0001.0000.138
수용태세지수0.5720.2040.1200.6850.6661.0000.3810.0001.0000.138
인기도0.2080.1840.6500.5040.2840.3811.0000.3141.0000.146
트립어드바이저 평점0.0750.7500.0000.2090.0000.0000.3141.0001.0000.000
씨트립 평점1.0001.0001.0001.0001.0001.0001.0001.0001.000NaN
네이버 평점0.1260.1280.4410.2250.1380.1380.1460.000NaN1.000
2023-12-16T15:18:26.895013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역명씨트립 평점온라인화진행여부
지역명1.0001.0000.230
씨트립 평점1.0001.0001.000
온라인화진행여부0.2301.0001.000
2023-12-16T15:18:27.286608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
식당IDRTI지수수용태세지수인기도트립어드바이저 평점네이버 평점지역명온라인화진행여부씨트립 평점
식당ID1.000-0.414-0.391-0.130-0.0660.0660.2500.6561.000
RTI지수-0.4141.0000.7130.714-0.2770.0370.0740.5851.000
수용태세지수-0.3910.7131.0000.3600.0350.1600.0770.5051.000
인기도-0.1300.7140.3601.0000.0090.0510.0660.2011.000
트립어드바이저 평점-0.066-0.2770.0350.0091.0000.1370.3900.0001.000
네이버 평점0.0660.0370.1600.0510.1371.0000.0490.1331.000
지역명0.2500.0740.0770.0660.3900.0491.0000.2301.000
온라인화진행여부0.6560.5850.5050.2010.0000.1330.2301.0001.000
씨트립 평점1.0001.0001.0001.0001.0001.0001.0001.0001.000

Missing values

2023-12-16T15:17:57.050640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-16T15:17:58.260226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-16T15:17:59.235953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

식당ID식당명지점명지역명어워드정보설명RTI지수온라인화진행여부수용태세지수인기도트립어드바이저 평점씨트립 평점네이버 평점
27313968651드루와<NA>강진군<NA>2.495511<NA><NA><NA><NA><NA><NA>
7330172301백팔번<NA>광양시<NA>3.815154N0.2010.01<NA><NA>4.5
10641181353소나무꼬마김밥<NA>순천시<NA>4.300699N0.2180.03<NA><NA>4.02
8626175168동향국밥<NA>광양시<NA><NA><NA>0.029<NA><NA><NA><NA>
7730173154해뜰짱분식<NA>무안군<NA>3.996129Y0.2170.02<NA><NA>4.16
15626197023공룡알 팬션 횟집<NA>완도군<NA>0.910544N0.201<NA><NA><NA>4.67
23667700810우여사포차<NA>순천시<NA>0.018621<NA>0.029<NA><NA><NA>4.38
15744197269길목집<NA>장흥군<NA>0.910544N0.198<NA><NA><NA><NA>
18158241272속풀이 식당<NA>여수시<NA>4.207309N0.2150.2<NA><NA>4.14
16364198646장성댐식당<NA>장성군<NA>4.274447N0.2150.14<NA><NA>4.79
식당ID식당명지점명지역명어워드정보설명RTI지수온라인화진행여부수용태세지수인기도트립어드바이저 평점씨트립 평점네이버 평점
231532476일미정<NA>해남군<NA>4.169592Y0.2160.01<NA><NA>4.06
19567273650유앤라<NA>여수시<NA>4.22166N0.2080.19<NA><NA><NA>
6552169061푸른바다<NA>광양시<NA>4.402488Y0.2170.11<NA><NA>4.26
13620193676새림정<NA>영암군<NA>4.359258N0.2010.08<NA><NA>4.48
67023195문수골<NA>여수시모범음식점(2003),안심식당(2020)4.126017Y0.2730.17<NA><NA>4.28
289181026433말괄량이 식빵고로케<NA>해남군<NA>2.495511<NA><NA><NA><NA><NA><NA>
5463165215카페루트77<NA>강진군<NA>4.409429N0.2310.22<NA><NA>4.41
95823775거빈한정식<NA>해남군모범음식점(2017),안심식당(2020)4.06663N0.2650.1<NA><NA>3.63
26542747712대덕회진장례식장<NA>장흥군<NA>0.910544N0.17<NA><NA><NA><NA>
10657181411오소돈순천시연향점순천시<NA>4.021096N0.20.03<NA><NA>4.35