Overview

Dataset statistics

Number of variables6
Number of observations9570
Missing cells4922
Missing cells (%)8.6%
Duplicate rows2
Duplicate rows (%)< 0.1%
Total size in memory458.1 KiB
Average record size in memory49.0 B

Variable types

Text3
Numeric1
Categorical2

Dataset

Description제주특별자치도 제주시 식품접객업 일반음식점 현황 데이터입니다.
Author제주특별자치도 제주시
URLhttps://www.data.go.kr/data/15055979/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 2 (< 0.1%) duplicate rowsDuplicates
전화번호 has 4922 (51.4%) missing valuesMissing

Reproduction

Analysis started2023-12-12 20:34:44.360838
Analysis finished2023-12-12 20:34:46.070070
Duration1.71 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct9551
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size74.9 KiB
2023-12-13T05:34:46.308468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length21
Mean length5.4365726
Min length1

Characters and Unicode

Total characters52028
Distinct characters1067
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9532 ?
Unique (%)99.6%

Sample

1st row(주)삼다도횟집
2nd row가영이네횟집
3rd row강강술래
4th row갯바위횟집
5th row거제도횟집
ValueCountFrequency (%)
라온골프클럽 5
 
0.1%
그랑블제주알앤지(주 4
 
< 0.1%
레스토랑 4
 
< 0.1%
주)금자탑 4
 
< 0.1%
스타트하우스 4
 
< 0.1%
주식회사 3
 
< 0.1%
산적 2
 
< 0.1%
수제핫바깡스 2
 
< 0.1%
제주 2
 
< 0.1%
30년할매닭발 2
 
< 0.1%
Other values (9570) 9587
99.7%
2023-12-13T05:34:46.881292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1768
 
3.4%
1283
 
2.5%
1187
 
2.3%
1061
 
2.0%
796
 
1.5%
743
 
1.4%
705
 
1.4%
621
 
1.2%
619
 
1.2%
586
 
1.1%
Other values (1057) 42659
82.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 51104
98.2%
Decimal Number 606
 
1.2%
Open Punctuation 104
 
0.2%
Close Punctuation 104
 
0.2%
Space Separator 49
 
0.1%
Uppercase Letter 37
 
0.1%
Lowercase Letter 20
 
< 0.1%
Dash Punctuation 3
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1768
 
3.5%
1283
 
2.5%
1187
 
2.3%
1061
 
2.1%
796
 
1.6%
743
 
1.5%
705
 
1.4%
621
 
1.2%
619
 
1.2%
586
 
1.1%
Other values (1014) 41735
81.7%
Uppercase Letter
ValueCountFrequency (%)
G 7
18.9%
C 6
16.2%
A 4
10.8%
S 4
10.8%
F 3
8.1%
B 3
8.1%
T 2
 
5.4%
E 2
 
5.4%
L 1
 
2.7%
W 1
 
2.7%
Other values (4) 4
10.8%
Lowercase Letter
ValueCountFrequency (%)
e 3
15.0%
o 2
10.0%
y 2
10.0%
n 2
10.0%
r 2
10.0%
a 1
 
5.0%
u 1
 
5.0%
b 1
 
5.0%
f 1
 
5.0%
w 1
 
5.0%
Other values (4) 4
20.0%
Decimal Number
ValueCountFrequency (%)
2 133
21.9%
1 118
19.5%
0 75
12.4%
3 62
10.2%
9 55
9.1%
4 46
 
7.6%
5 33
 
5.4%
7 33
 
5.4%
8 31
 
5.1%
6 20
 
3.3%
Open Punctuation
ValueCountFrequency (%)
( 104
100.0%
Close Punctuation
ValueCountFrequency (%)
) 104
100.0%
Space Separator
ValueCountFrequency (%)
49
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 51104
98.2%
Common 867
 
1.7%
Latin 57
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1768
 
3.5%
1283
 
2.5%
1187
 
2.3%
1061
 
2.1%
796
 
1.6%
743
 
1.5%
705
 
1.4%
621
 
1.2%
619
 
1.2%
586
 
1.1%
Other values (1014) 41735
81.7%
Latin
ValueCountFrequency (%)
G 7
 
12.3%
C 6
 
10.5%
A 4
 
7.0%
S 4
 
7.0%
e 3
 
5.3%
F 3
 
5.3%
B 3
 
5.3%
o 2
 
3.5%
y 2
 
3.5%
n 2
 
3.5%
Other values (18) 21
36.8%
Common
ValueCountFrequency (%)
2 133
15.3%
1 118
13.6%
( 104
12.0%
) 104
12.0%
0 75
8.7%
3 62
7.2%
9 55
6.3%
49
 
5.7%
4 46
 
5.3%
5 33
 
3.8%
Other values (5) 88
10.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 51104
98.2%
ASCII 924
 
1.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1768
 
3.5%
1283
 
2.5%
1187
 
2.3%
1061
 
2.1%
796
 
1.6%
743
 
1.5%
705
 
1.4%
621
 
1.2%
619
 
1.2%
586
 
1.1%
Other values (1014) 41735
81.7%
ASCII
ValueCountFrequency (%)
2 133
14.4%
1 118
12.8%
( 104
11.3%
) 104
11.3%
0 75
8.1%
3 62
6.7%
9 55
 
6.0%
49
 
5.3%
4 46
 
5.0%
5 33
 
3.6%
Other values (33) 145
15.7%
Distinct8019
Distinct (%)83.8%
Missing0
Missing (%)0.0%
Memory size74.9 KiB
2023-12-13T05:34:47.374130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length29
Mean length21.22163
Min length17

Characters and Unicode

Total characters203091
Distinct characters231
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6924 ?
Unique (%)72.4%

Sample

1st row제주특별자치도 제주시 서해안로 572
2nd row제주특별자치도 제주시 성지로 46-1
3rd row제주특별자치도 제주시 중앙로11길 12
4th row제주특별자치도 제주시 흥운길 99
5th row제주특별자치도 제주시 서부두길 26
ValueCountFrequency (%)
제주시 9572
23.2%
제주특별자치도 9570
23.2%
애월읍 738
 
1.8%
한림읍 648
 
1.6%
구좌읍 559
 
1.4%
조천읍 558
 
1.4%
한경면 216
 
0.5%
중앙로 177
 
0.4%
1 169
 
0.4%
3 167
 
0.4%
Other values (3062) 18834
45.7%
2023-12-13T05:34:48.058517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34525
17.0%
19415
 
9.6%
19209
 
9.5%
10120
 
5.0%
9686
 
4.8%
9574
 
4.7%
9572
 
4.7%
9571
 
4.7%
9570
 
4.7%
6702
 
3.3%
Other values (221) 65147
32.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 138750
68.3%
Space Separator 34525
 
17.0%
Decimal Number 28200
 
13.9%
Dash Punctuation 1615
 
0.8%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19415
14.0%
19209
13.8%
10120
 
7.3%
9686
 
7.0%
9574
 
6.9%
9572
 
6.9%
9571
 
6.9%
9570
 
6.9%
6702
 
4.8%
5056
 
3.6%
Other values (208) 30275
21.8%
Decimal Number
ValueCountFrequency (%)
1 6618
23.5%
2 4011
14.2%
3 3309
11.7%
4 2938
10.4%
5 2336
 
8.3%
6 2334
 
8.3%
7 1737
 
6.2%
0 1692
 
6.0%
8 1676
 
5.9%
9 1549
 
5.5%
Space Separator
ValueCountFrequency (%)
34525
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1615
100.0%
Uppercase Letter
ValueCountFrequency (%)
S 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 138750
68.3%
Common 64340
31.7%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19415
14.0%
19209
13.8%
10120
 
7.3%
9686
 
7.0%
9574
 
6.9%
9572
 
6.9%
9571
 
6.9%
9570
 
6.9%
6702
 
4.8%
5056
 
3.6%
Other values (208) 30275
21.8%
Common
ValueCountFrequency (%)
34525
53.7%
1 6618
 
10.3%
2 4011
 
6.2%
3 3309
 
5.1%
4 2938
 
4.6%
5 2336
 
3.6%
6 2334
 
3.6%
7 1737
 
2.7%
0 1692
 
2.6%
8 1676
 
2.6%
Other values (2) 3164
 
4.9%
Latin
ValueCountFrequency (%)
S 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 138750
68.3%
ASCII 64341
31.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
34525
53.7%
1 6618
 
10.3%
2 4011
 
6.2%
3 3309
 
5.1%
4 2938
 
4.6%
5 2336
 
3.6%
6 2334
 
3.6%
7 1737
 
2.7%
0 1692
 
2.6%
8 1676
 
2.6%
Other values (3) 3165
 
4.9%
Hangul
ValueCountFrequency (%)
19415
14.0%
19209
13.8%
10120
 
7.3%
9686
 
7.0%
9574
 
6.9%
9572
 
6.9%
9571
 
6.9%
9570
 
6.9%
6702
 
4.8%
5056
 
3.6%
Other values (208) 30275
21.8%

영업장면적
Real number (ℝ)

Distinct5923
Distinct (%)61.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean103.71518
Minimum0
Maximum2537.2
Zeros2
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size84.2 KiB
2023-12-13T05:34:48.235639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile24.0815
Q149.5
median78.81
Q3119.13
95-th percentile257.9775
Maximum2537.2
Range2537.2
Interquartile range (IQR)69.63

Descriptive statistics

Standard deviation114.46016
Coefficient of variation (CV)1.1036009
Kurtosis108.43936
Mean103.71518
Median Absolute Deviation (MAD)32.665
Skewness7.8294996
Sum992554.24
Variance13101.129
MonotonicityNot monotonic
2023-12-13T05:34:48.423949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
66.0 102
 
1.1%
33.0 69
 
0.7%
49.5 50
 
0.5%
99.0 42
 
0.4%
82.5 37
 
0.4%
60.0 32
 
0.3%
30.0 31
 
0.3%
40.0 29
 
0.3%
50.0 27
 
0.3%
32.0 24
 
0.3%
Other values (5913) 9127
95.4%
ValueCountFrequency (%)
0.0 2
< 0.1%
6.1 1
< 0.1%
6.6 1
< 0.1%
6.9 1
< 0.1%
7.0 1
< 0.1%
7.26 1
< 0.1%
7.38 1
< 0.1%
7.5 1
< 0.1%
7.92 1
< 0.1%
8.01 1
< 0.1%
ValueCountFrequency (%)
2537.2 1
< 0.1%
2327.93 1
< 0.1%
2316.56 1
< 0.1%
2088.0 1
< 0.1%
2026.0 1
< 0.1%
1980.46 1
< 0.1%
1958.5 1
< 0.1%
1690.0 1
< 0.1%
1621.21 1
< 0.1%
1544.39 1
< 0.1%

전화번호
Text

MISSING 

Distinct4472
Distinct (%)96.2%
Missing4922
Missing (%)51.4%
Memory size74.9 KiB
2023-12-13T05:34:48.690942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.001291
Min length11

Characters and Unicode

Total characters55782
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4336 ?
Unique (%)93.3%

Sample

1st row064-711-7080
2nd row064-757-8581
3rd row064-725-6731
4th row064-712-7600
5th row064-758-3737
ValueCountFrequency (%)
064-795-6336 8
 
0.2%
064-782-3803 7
 
0.2%
064-758-2500 5
 
0.1%
064-724-2001 5
 
0.1%
064-730-7025 4
 
0.1%
064-780-5040 4
 
0.1%
064-796-8400 4
 
0.1%
064-729-8251 4
 
0.1%
064-793-0703 4
 
0.1%
064-784-4811 4
 
0.1%
Other values (4462) 4599
98.9%
2023-12-13T05:34:49.152474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 9296
16.7%
4 7730
13.9%
0 7434
13.3%
7 6816
12.2%
6 6661
11.9%
2 3938
7.1%
5 3368
 
6.0%
9 2791
 
5.0%
1 2633
 
4.7%
8 2616
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 46486
83.3%
Dash Punctuation 9296
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 7730
16.6%
0 7434
16.0%
7 6816
14.7%
6 6661
14.3%
2 3938
8.5%
5 3368
7.2%
9 2791
 
6.0%
1 2633
 
5.7%
8 2616
 
5.6%
3 2499
 
5.4%
Dash Punctuation
ValueCountFrequency (%)
- 9296
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 55782
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 9296
16.7%
4 7730
13.9%
0 7434
13.3%
7 6816
12.2%
6 6661
11.9%
2 3938
7.1%
5 3368
 
6.0%
9 2791
 
5.0%
1 2633
 
4.7%
8 2616
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 55782
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 9296
16.7%
4 7730
13.9%
0 7434
13.3%
7 6816
12.2%
6 6661
11.9%
2 3938
7.1%
5 3368
 
6.0%
9 2791
 
5.0%
1 2633
 
4.7%
8 2616
 
4.7%

업태명
Categorical

Distinct25
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size74.9 KiB
한식
3832 
기타
2390 
호프/통닭
830 
식육(숯불구이)
595 
경양식
 
358
Other values (20)
1565 

Length

Max length15
Median length2
Mean length2.9210031
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row횟집
2nd row횟집
3rd row횟집
4th row횟집
5th row횟집

Common Values

ValueCountFrequency (%)
한식 3832
40.0%
기타 2390
25.0%
호프/통닭 830
 
8.7%
식육(숯불구이) 595
 
6.2%
경양식 358
 
3.7%
까페 267
 
2.8%
횟집 247
 
2.6%
통닭(치킨) 237
 
2.5%
분식 205
 
2.1%
중국식 194
 
2.0%
Other values (15) 415
 
4.3%

Length

2023-12-13T05:34:49.294760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한식 3832
40.0%
기타 2390
25.0%
호프/통닭 830
 
8.7%
식육(숯불구이 595
 
6.2%
경양식 358
 
3.7%
까페 267
 
2.8%
횟집 247
 
2.6%
통닭(치킨 237
 
2.5%
분식 205
 
2.1%
중국식 194
 
2.0%
Other values (15) 415
 
4.3%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size74.9 KiB
2021-02-23
9570 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-02-23
2nd row2021-02-23
3rd row2021-02-23
4th row2021-02-23
5th row2021-02-23

Common Values

ValueCountFrequency (%)
2021-02-23 9570
100.0%

Length

2023-12-13T05:34:49.425323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:34:49.513198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-02-23 9570
100.0%

Interactions

2023-12-13T05:34:45.640701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:34:49.566173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업장면적업태명
영업장면적1.0000.240
업태명0.2401.000
2023-12-13T05:34:49.646001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업장면적업태명
영업장면적1.0000.087
업태명0.0871.000

Missing values

2023-12-13T05:34:45.822512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:34:45.994016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명소재지영업장면적전화번호업태명데이터기준일자
0(주)삼다도횟집제주특별자치도 제주시 서해안로 572519.96064-711-7080횟집2021-02-23
1가영이네횟집제주특별자치도 제주시 성지로 46-1113.0064-757-8581횟집2021-02-23
2강강술래제주특별자치도 제주시 중앙로11길 1229.48064-725-6731횟집2021-02-23
3갯바위횟집제주특별자치도 제주시 흥운길 99191.08064-712-7600횟집2021-02-23
4거제도횟집제주특별자치도 제주시 서부두길 26110.56064-758-3737횟집2021-02-23
5경락제주특별자치도 제주시 천수로13길 683.33064-726-3380횟집2021-02-23
6계성제주특별자치도 제주시 서사로 16154.1064-757-1161횟집2021-02-23
7고기가게제주특별자치도 제주시 국기중길 12142.06<NA>횟집2021-02-23
8곱스럽닭도남점제주특별자치도 제주시 도남로 25108.89<NA>횟집2021-02-23
9구엄해녀수산제주특별자치도 제주시 애월읍 애월해안로 715-1133.21064-745-1135횟집2021-02-23
업소명소재지영업장면적전화번호업태명데이터기준일자
9560협재화덕도나토스제주특별자치도 제주시 한림읍 협재2길 6130.48064-796-1981경양식2021-02-23
9561호텔리젠트마린더블루라티프제주특별자치도 제주시 서부두2길 20379.88064-717-5040경양식2021-02-23
9562홍대98돈까스제주특별자치도 제주시 아란5길 1871.68<NA>경양식2021-02-23
9563홍스제주특별자치도 제주시 애월읍 애월해안로 881250.38064-742-8848경양식2021-02-23
9564홍익돈까스제주점제주특별자치도 제주시 중앙로 501215.82<NA>경양식2021-02-23
9565회장님댁제주시청점제주특별자치도 제주시 서광로32길 31144.76064-751-1767경양식2021-02-23
9566제주특별자치도 제주시 관덕로15길 15251.01064-726-3398경양식2021-02-23
9567흙먹는고양이제주특별자치도 제주시 구좌읍 세화7길 3773.28<NA>경양식2021-02-23
9568히아담제주특별자치도 제주시 과원로 2893.98064-743-0626경양식2021-02-23
9569와인솔트제주특별자치도 제주시 일주동로 33181.6<NA>감성주점2021-02-23

Duplicate rows

Most frequently occurring

업소명소재지영업장면적전화번호업태명데이터기준일자# duplicates
0슬기제주특별자치도 제주시 사라봉7길 3625.8064-756-4802한식2021-02-232
1터줏대감제주특별자치도 제주시 오일장서길 2632.0<NA>한식2021-02-232