Overview

Dataset statistics

Number of variables6
Number of observations133
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.4 KiB
Average record size in memory49.0 B

Variable types

Text3
Categorical1
DateTime2

Dataset

Description제주특별자치도 서귀포시 관내 음식점 위생등급지정현황에 관한 데이터로 업소명,지정번호,주소,위생등급, 지정일자 등에 대한 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15055970/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
지정등급 is highly imbalanced (60.4%)Imbalance
지정번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:57:28.715926
Analysis finished2023-12-12 08:57:29.314210
Duration0.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct132
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T17:57:29.531794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length22
Mean length10.62406
Min length2

Characters and Unicode

Total characters1413
Distinct characters258
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique131 ?
Unique (%)98.5%

Sample

1st row네네치킨남원점
2nd rowBHC제주남원점
3rd rowBHC치킨신서귀포점
4th row비에이치씨제주강정점
5th row케이에프씨(KFC)서귀포중문DT점
ValueCountFrequency (%)
베스킨라빈스 2
 
1.2%
파리바게뜨 2
 
1.2%
on 2
 
1.2%
중문점 2
 
1.2%
dining 2
 
1.2%
snacks 2
 
1.2%
스타벅스커피 2
 
1.2%
lounge 2
 
1.2%
파리바게뜨서귀포시강정지구 1
 
0.6%
도미노피자제주서귀포점 1
 
0.6%
Other values (149) 149
89.2%
2023-12-12T17:57:29.973702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
82
 
5.8%
56
 
4.0%
39
 
2.8%
38
 
2.7%
38
 
2.7%
37
 
2.6%
37
 
2.6%
34
 
2.4%
33
 
2.3%
23
 
1.6%
Other values (248) 996
70.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1093
77.4%
Lowercase Letter 142
 
10.0%
Uppercase Letter 80
 
5.7%
Space Separator 34
 
2.4%
Open Punctuation 20
 
1.4%
Close Punctuation 20
 
1.4%
Decimal Number 20
 
1.4%
Modifier Symbol 2
 
0.1%
Other Punctuation 1
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
82
 
7.5%
56
 
5.1%
39
 
3.6%
38
 
3.5%
38
 
3.5%
37
 
3.4%
37
 
3.4%
33
 
3.0%
23
 
2.1%
22
 
2.0%
Other values (194) 688
62.9%
Lowercase Letter
ValueCountFrequency (%)
n 21
14.8%
e 15
10.6%
i 15
10.6%
a 13
9.2%
o 11
 
7.7%
g 9
 
6.3%
s 8
 
5.6%
r 7
 
4.9%
u 7
 
4.9%
c 6
 
4.2%
Other values (13) 30
21.1%
Uppercase Letter
ValueCountFrequency (%)
B 12
15.0%
D 10
12.5%
C 9
11.2%
T 8
10.0%
S 7
8.8%
L 7
8.8%
H 6
7.5%
F 3
 
3.8%
J 3
 
3.8%
W 2
 
2.5%
Other values (10) 13
16.2%
Decimal Number
ValueCountFrequency (%)
5 5
25.0%
2 5
25.0%
6 4
20.0%
0 4
20.0%
1 2
 
10.0%
Space Separator
ValueCountFrequency (%)
34
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Modifier Symbol
ValueCountFrequency (%)
´ 2
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1093
77.4%
Latin 222
 
15.7%
Common 98
 
6.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
82
 
7.5%
56
 
5.1%
39
 
3.6%
38
 
3.5%
38
 
3.5%
37
 
3.4%
37
 
3.4%
33
 
3.0%
23
 
2.1%
22
 
2.0%
Other values (194) 688
62.9%
Latin
ValueCountFrequency (%)
n 21
 
9.5%
e 15
 
6.8%
i 15
 
6.8%
a 13
 
5.9%
B 12
 
5.4%
o 11
 
5.0%
D 10
 
4.5%
g 9
 
4.1%
C 9
 
4.1%
T 8
 
3.6%
Other values (33) 99
44.6%
Common
ValueCountFrequency (%)
34
34.7%
( 20
20.4%
) 20
20.4%
5 5
 
5.1%
2 5
 
5.1%
6 4
 
4.1%
0 4
 
4.1%
´ 2
 
2.0%
1 2
 
2.0%
& 1
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1093
77.4%
ASCII 318
 
22.5%
None 2
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
82
 
7.5%
56
 
5.1%
39
 
3.6%
38
 
3.5%
38
 
3.5%
37
 
3.4%
37
 
3.4%
33
 
3.0%
23
 
2.1%
22
 
2.0%
Other values (194) 688
62.9%
ASCII
ValueCountFrequency (%)
34
 
10.7%
n 21
 
6.6%
( 20
 
6.3%
) 20
 
6.3%
e 15
 
4.7%
i 15
 
4.7%
a 13
 
4.1%
B 12
 
3.8%
o 11
 
3.5%
D 10
 
3.1%
Other values (43) 147
46.2%
None
ValueCountFrequency (%)
´ 2
100.0%
Distinct118
Distinct (%)88.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T17:57:30.289948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length42
Mean length30.887218
Min length24

Characters and Unicode

Total characters4108
Distinct characters149
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique111 ?
Unique (%)83.5%

Sample

1st row제주특별자치도 서귀포시 남원읍 남원체육관로 342(상가동 1층 103호 서귀포남원엘에이치아파트)
2nd row제주특별자치도 서귀포시 남원읍 남원회관로 55
3rd row제주특별자치도 서귀포시 신서귀로 37(1층 법환동)
4th row제주특별자치도 서귀포시 이어도로 602(1층 강정동)
5th row제주특별자치도 서귀포시 중문관광로 90(1층 색달동)
ValueCountFrequency (%)
제주특별자치도 133
19.8%
서귀포시 133
19.8%
안덕면 22
 
3.3%
신화역사로304번길 21
 
3.1%
대정읍 14
 
2.1%
표선면 12
 
1.8%
색달동 11
 
1.6%
중문관광로72번길 9
 
1.3%
성산읍 8
 
1.2%
남원읍 8
 
1.2%
Other values (204) 302
44.9%
2023-12-12T17:57:30.807630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
540
 
13.1%
164
 
4.0%
147
 
3.6%
1 145
 
3.5%
144
 
3.5%
144
 
3.5%
136
 
3.3%
135
 
3.3%
135
 
3.3%
134
 
3.3%
Other values (139) 2284
55.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2715
66.1%
Decimal Number 585
 
14.2%
Space Separator 540
 
13.1%
Open Punctuation 112
 
2.7%
Close Punctuation 112
 
2.7%
Dash Punctuation 15
 
0.4%
Other Punctuation 11
 
0.3%
Lowercase Letter 11
 
0.3%
Uppercase Letter 5
 
0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
164
 
6.0%
147
 
5.4%
144
 
5.3%
144
 
5.3%
136
 
5.0%
135
 
5.0%
135
 
5.0%
134
 
4.9%
134
 
4.9%
133
 
4.9%
Other values (110) 1309
48.2%
Decimal Number
ValueCountFrequency (%)
1 145
24.8%
3 81
13.8%
2 74
12.6%
4 54
 
9.2%
0 52
 
8.9%
8 47
 
8.0%
6 36
 
6.2%
7 36
 
6.2%
9 30
 
5.1%
5 30
 
5.1%
Lowercase Letter
ValueCountFrequency (%)
e 2
18.2%
a 2
18.2%
u 2
18.2%
j 1
9.1%
n 1
9.1%
t 1
9.1%
q 1
9.1%
l 1
9.1%
Uppercase Letter
ValueCountFrequency (%)
A 2
40.0%
F 1
20.0%
J 1
20.0%
P 1
20.0%
Other Punctuation
ValueCountFrequency (%)
, 10
90.9%
. 1
 
9.1%
Space Separator
ValueCountFrequency (%)
540
100.0%
Open Punctuation
ValueCountFrequency (%)
( 112
100.0%
Close Punctuation
ValueCountFrequency (%)
) 112
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2715
66.1%
Common 1377
33.5%
Latin 16
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
164
 
6.0%
147
 
5.4%
144
 
5.3%
144
 
5.3%
136
 
5.0%
135
 
5.0%
135
 
5.0%
134
 
4.9%
134
 
4.9%
133
 
4.9%
Other values (110) 1309
48.2%
Common
ValueCountFrequency (%)
540
39.2%
1 145
 
10.5%
( 112
 
8.1%
) 112
 
8.1%
3 81
 
5.9%
2 74
 
5.4%
4 54
 
3.9%
0 52
 
3.8%
8 47
 
3.4%
6 36
 
2.6%
Other values (7) 124
 
9.0%
Latin
ValueCountFrequency (%)
e 2
12.5%
a 2
12.5%
A 2
12.5%
u 2
12.5%
F 1
6.2%
j 1
6.2%
n 1
6.2%
J 1
6.2%
t 1
6.2%
q 1
6.2%
Other values (2) 2
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2715
66.1%
ASCII 1393
33.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
540
38.8%
1 145
 
10.4%
( 112
 
8.0%
) 112
 
8.0%
3 81
 
5.8%
2 74
 
5.3%
4 54
 
3.9%
0 52
 
3.7%
8 47
 
3.4%
6 36
 
2.6%
Other values (19) 140
 
10.1%
Hangul
ValueCountFrequency (%)
164
 
6.0%
147
 
5.4%
144
 
5.3%
144
 
5.3%
136
 
5.0%
135
 
5.0%
135
 
5.0%
134
 
4.9%
134
 
4.9%
133
 
4.9%
Other values (110) 1309
48.2%

지정등급
Categorical

IMBALANCE 

Distinct3
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
매우우수
117 
우수
12 
좋음
 
4

Length

Max length4
Median length4
Mean length3.7593985
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row매우우수
2nd row매우우수
3rd row매우우수
4th row매우우수
5th row우수

Common Values

ValueCountFrequency (%)
매우우수 117
88.0%
우수 12
 
9.0%
좋음 4
 
3.0%

Length

2023-12-12T17:57:30.991582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:57:31.111856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
매우우수 117
88.0%
우수 12
 
9.0%
좋음 4
 
3.0%

지정번호
Text

UNIQUE 

Distinct133
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T17:57:31.389968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length11
Mean length11
Min length11

Characters and Unicode

Total characters1463
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique133 ?
Unique (%)100.0%

Sample

1st rowHG210002621
2nd rowHG210002622
3rd rowHG210002635
4th rowHG210002654
5th rowHG210002639
ValueCountFrequency (%)
hg210002621 1
 
0.8%
hg220001132 1
 
0.8%
hg200006292 1
 
0.8%
hg220007351 1
 
0.8%
hg220007365 1
 
0.8%
hg220007382 1
 
0.8%
hg220007009 1
 
0.8%
hg200005583 1
 
0.8%
hg220006198 1
 
0.8%
hg220006041 1
 
0.8%
Other values (123) 123
92.5%
2023-12-12T17:57:31.851841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 478
32.7%
2 214
14.6%
1 143
 
9.8%
H 133
 
9.1%
G 133
 
9.1%
9 74
 
5.1%
3 65
 
4.4%
4 54
 
3.7%
6 46
 
3.1%
5 44
 
3.0%
Other values (2) 79
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1197
81.8%
Uppercase Letter 266
 
18.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 478
39.9%
2 214
17.9%
1 143
 
11.9%
9 74
 
6.2%
3 65
 
5.4%
4 54
 
4.5%
6 46
 
3.8%
5 44
 
3.7%
7 42
 
3.5%
8 37
 
3.1%
Uppercase Letter
ValueCountFrequency (%)
H 133
50.0%
G 133
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1197
81.8%
Latin 266
 
18.2%

Most frequent character per script

Common
ValueCountFrequency (%)
0 478
39.9%
2 214
17.9%
1 143
 
11.9%
9 74
 
6.2%
3 65
 
5.4%
4 54
 
4.5%
6 46
 
3.8%
5 44
 
3.7%
7 42
 
3.5%
8 37
 
3.1%
Latin
ValueCountFrequency (%)
H 133
50.0%
G 133
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1463
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 478
32.7%
2 214
14.6%
1 143
 
9.8%
H 133
 
9.1%
G 133
 
9.1%
9 74
 
5.1%
3 65
 
4.4%
4 54
 
3.7%
6 46
 
3.1%
5 44
 
3.0%
Other values (2) 79
 
5.4%
Distinct70
Distinct (%)52.6%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
Minimum2021-04-12 00:00:00
Maximum2023-04-05 00:00:00
2023-12-12T17:57:32.124936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:57:32.287246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
Minimum2023-04-12 00:00:00
Maximum2023-04-12 00:00:00
2023-12-12T17:57:32.440119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:57:32.588933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-12T17:57:32.672043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지정등급지정일자
지정등급1.0000.905
지정일자0.9051.000

Missing values

2023-12-12T17:57:29.125637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:57:29.245089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명소재지지정등급지정번호지정일자데이터기준일자
0네네치킨남원점제주특별자치도 서귀포시 남원읍 남원체육관로 342(상가동 1층 103호 서귀포남원엘에이치아파트)매우우수HG2100026212021-04-122023-04-12
1BHC제주남원점제주특별자치도 서귀포시 남원읍 남원회관로 55매우우수HG2100026222021-04-122023-04-12
2BHC치킨신서귀포점제주특별자치도 서귀포시 신서귀로 37(1층 법환동)매우우수HG2100026352021-04-122023-04-12
3비에이치씨제주강정점제주특별자치도 서귀포시 이어도로 602(1층 강정동)매우우수HG2100026542021-04-122023-04-12
4케이에프씨(KFC)서귀포중문DT점제주특별자치도 서귀포시 중문관광로 90(1층 색달동)우수HG2100026392021-04-122023-04-12
5비에이치시중문점제주특별자치도 서귀포시 중문관광로72번길 29-9(1층 색달동)매우우수HG2100026462021-04-122023-04-12
6비에이치씨제주모슬포점제주특별자치도 서귀포시 대정읍 최남단해안로 1매우우수HG2100027832021-04-152023-04-12
7BHC 성산점제주특별자치도 서귀포시 성산읍 성산중앙로 48매우우수HG2100027722021-04-152023-04-12
8비에이치씨제주표선점제주특별자치도 서귀포시 표선면 표선중앙로 77매우우수HG2100027822021-04-152023-04-12
9제주비에이치씨신화월드제주특별자치도 서귀포시 안덕면 신화역사로304번길 133(1층)매우우수HG2100035642021-04-302023-04-12
업소명소재지지정등급지정번호지정일자데이터기준일자
123카페프랑제리제주특별자치도 서귀포시 중문관광로72번길 29-29(색달동)매우우수HG2300006452023-02-212023-04-12
124수마제주특별자치도 서귀포시 성산읍 일출로 264-6매우우수HG2100010352023-02-262023-04-12
125이디야커피성읍점제주특별자치도 서귀포시 표선면 번영로 2644매우우수HG2100010232023-02-262023-04-12
126비에이치씨제주동홍점제주특별자치도 서귀포시 동홍동로 27(5,6호 동홍동)매우우수HG2300009902023-03-102023-04-12
127(주)올더타임토평제주특별자치도 서귀포시 516로 73(1층 토평동)우수HG2100017372023-03-182023-04-12
128베스킨라빈스제주특별자치도 서귀포시 일주동로 8674-1(동홍동)매우우수HG2100017462023-03-182023-04-12
129베스킨라빈스제주특별자치도 서귀포시 중정로 60(서귀동)매우우수HG2100017452023-03-182023-04-12
130파리바게뜨 중문점제주특별자치도 서귀포시 천제연로 186(중문동)매우우수HG2100019392023-03-232023-04-12
131BHC서귀행복점제주특별자치도 서귀포시 홍중로 64-1(1층 서홍동)매우우수HG2100019402023-03-232023-04-12
132본도시락제주서귀포점제주특별자치도 서귀포시 중앙로47번길 7(1층 서귀동)우수HG2300017422023-04-052023-04-12