Overview

Dataset statistics

Number of variables5
Number of observations197
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.0 KiB
Average record size in memory41.7 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description부산광역시연제구_공중위생서비스평가결과_20221208
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15051417

Alerts

연번 is highly overall correlated with 업종 and 1 other fieldsHigh correlation
업종 is highly overall correlated with 연번High correlation
평가등급 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
상호 has unique valuesUnique
소재지 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:08:49.988856
Analysis finished2023-12-10 17:08:50.961322
Duration0.97 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct197
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean99
Minimum1
Maximum197
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-11T02:08:51.069060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10.8
Q150
median99
Q3148
95-th percentile187.2
Maximum197
Range196
Interquartile range (IQR)98

Descriptive statistics

Standard deviation57.013156
Coefficient of variation (CV)0.57589047
Kurtosis-1.2
Mean99
Median Absolute Deviation (MAD)49
Skewness0
Sum19503
Variance3250.5
MonotonicityStrictly increasing
2023-12-11T02:08:51.292580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
125 1
 
0.5%
127 1
 
0.5%
128 1
 
0.5%
129 1
 
0.5%
130 1
 
0.5%
131 1
 
0.5%
132 1
 
0.5%
133 1
 
0.5%
134 1
 
0.5%
Other values (187) 187
94.9%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
197 1
0.5%
196 1
0.5%
195 1
0.5%
194 1
0.5%
193 1
0.5%
192 1
0.5%
191 1
0.5%
190 1
0.5%
189 1
0.5%
188 1
0.5%

업종
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
세탁업
87 
숙박업(일반)
62 
목욕장업
41 
숙박업(생활)
 
7

Length

Max length7
Median length4
Mean length4.6091371
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row숙박업(일반)
2nd row숙박업(일반)
3rd row숙박업(일반)
4th row숙박업(일반)
5th row숙박업(일반)

Common Values

ValueCountFrequency (%)
세탁업 87
44.2%
숙박업(일반) 62
31.5%
목욕장업 41
20.8%
숙박업(생활) 7
 
3.6%

Length

2023-12-11T02:08:51.561750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:08:51.783763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
세탁업 87
44.2%
숙박업(일반 62
31.5%
목욕장업 41
20.8%
숙박업(생활 7
 
3.6%

상호
Text

UNIQUE 

Distinct197
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-11T02:08:52.207668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length15
Mean length5.1675127
Min length2

Characters and Unicode

Total characters1018
Distinct characters242
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique197 ?
Unique (%)100.0%

Sample

1st row에그(egg)모텔
2nd row샤이어호텔
3rd row오모텔
4th row더 제니스 호텔
5th row호텔 로이
ValueCountFrequency (%)
호텔 6
 
2.6%
홈텔 2
 
0.9%
무지개 2
 
0.9%
백성 2
 
0.9%
세탁 2
 
0.9%
에그(egg)모텔 1
 
0.4%
동방빨래방 1
 
0.4%
연제점 1
 
0.4%
부산세탁 1
 
0.4%
스마트명품 1
 
0.4%
Other values (212) 212
91.8%
2023-12-11T02:08:52.840979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
45
 
4.4%
44
 
4.3%
44
 
4.3%
35
 
3.4%
29
 
2.8%
27
 
2.7%
22
 
2.2%
22
 
2.2%
19
 
1.9%
17
 
1.7%
Other values (232) 714
70.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 893
87.7%
Uppercase Letter 46
 
4.5%
Space Separator 35
 
3.4%
Close Punctuation 13
 
1.3%
Open Punctuation 13
 
1.3%
Decimal Number 9
 
0.9%
Lowercase Letter 8
 
0.8%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
45
 
5.0%
44
 
4.9%
44
 
4.9%
29
 
3.2%
27
 
3.0%
22
 
2.5%
22
 
2.5%
19
 
2.1%
17
 
1.9%
16
 
1.8%
Other values (197) 608
68.1%
Uppercase Letter
ValueCountFrequency (%)
O 7
15.2%
T 4
 
8.7%
B 4
 
8.7%
I 3
 
6.5%
N 3
 
6.5%
Y 3
 
6.5%
H 3
 
6.5%
A 2
 
4.3%
W 2
 
4.3%
L 2
 
4.3%
Other values (9) 13
28.3%
Decimal Number
ValueCountFrequency (%)
1 3
33.3%
2 2
22.2%
7 1
 
11.1%
9 1
 
11.1%
5 1
 
11.1%
4 1
 
11.1%
Lowercase Letter
ValueCountFrequency (%)
g 2
25.0%
e 2
25.0%
h 1
12.5%
t 1
12.5%
u 1
12.5%
o 1
12.5%
Space Separator
ValueCountFrequency (%)
35
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 893
87.7%
Common 71
 
7.0%
Latin 54
 
5.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
45
 
5.0%
44
 
4.9%
44
 
4.9%
29
 
3.2%
27
 
3.0%
22
 
2.5%
22
 
2.5%
19
 
2.1%
17
 
1.9%
16
 
1.8%
Other values (197) 608
68.1%
Latin
ValueCountFrequency (%)
O 7
 
13.0%
T 4
 
7.4%
B 4
 
7.4%
I 3
 
5.6%
N 3
 
5.6%
Y 3
 
5.6%
H 3
 
5.6%
A 2
 
3.7%
W 2
 
3.7%
L 2
 
3.7%
Other values (15) 21
38.9%
Common
ValueCountFrequency (%)
35
49.3%
) 13
 
18.3%
( 13
 
18.3%
1 3
 
4.2%
2 2
 
2.8%
- 1
 
1.4%
7 1
 
1.4%
9 1
 
1.4%
5 1
 
1.4%
4 1
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 893
87.7%
ASCII 125
 
12.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
45
 
5.0%
44
 
4.9%
44
 
4.9%
29
 
3.2%
27
 
3.0%
22
 
2.5%
22
 
2.5%
19
 
2.1%
17
 
1.9%
16
 
1.8%
Other values (197) 608
68.1%
ASCII
ValueCountFrequency (%)
35
28.0%
) 13
 
10.4%
( 13
 
10.4%
O 7
 
5.6%
T 4
 
3.2%
B 4
 
3.2%
I 3
 
2.4%
N 3
 
2.4%
1 3
 
2.4%
Y 3
 
2.4%
Other values (25) 37
29.6%

소재지
Text

UNIQUE 

Distinct197
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-11T02:08:53.332654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length46
Mean length29.22335
Min length21

Characters and Unicode

Total characters5757
Distinct characters143
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique197 ?
Unique (%)100.0%

Sample

1st row부산광역시 연제구 고분로13번길 13 (연산동)
2nd row부산광역시 연제구 반송로 18-6 (연산동)
3rd row부산광역시 연제구 과정로 165-2 (연산동)
4th row부산광역시 연제구 거제천로152번길 66 (연산동)
5th row부산광역시 연제구 과정로191번길 41 (연산동)
ValueCountFrequency (%)
부산광역시 197
18.5%
연제구 197
18.5%
연산동 150
 
14.1%
거제동 27
 
2.5%
1층 18
 
1.7%
과정로191번길 7
 
0.7%
반송로 7
 
0.7%
고분로 6
 
0.6%
30 6
 
0.6%
10 6
 
0.6%
Other values (277) 446
41.8%
2023-12-11T02:08:54.084320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
871
 
15.1%
383
 
6.7%
370
 
6.4%
1 265
 
4.6%
263
 
4.6%
218
 
3.8%
207
 
3.6%
200
 
3.5%
( 199
 
3.5%
) 199
 
3.5%
Other values (133) 2582
44.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3460
60.1%
Decimal Number 922
 
16.0%
Space Separator 871
 
15.1%
Open Punctuation 199
 
3.5%
Close Punctuation 199
 
3.5%
Other Punctuation 69
 
1.2%
Dash Punctuation 22
 
0.4%
Uppercase Letter 8
 
0.1%
Math Symbol 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
383
 
11.1%
370
 
10.7%
263
 
7.6%
218
 
6.3%
207
 
6.0%
200
 
5.8%
197
 
5.7%
197
 
5.7%
197
 
5.7%
196
 
5.7%
Other values (112) 1032
29.8%
Decimal Number
ValueCountFrequency (%)
1 265
28.7%
2 136
14.8%
3 105
 
11.4%
0 88
 
9.5%
4 81
 
8.8%
5 69
 
7.5%
6 53
 
5.7%
8 46
 
5.0%
9 40
 
4.3%
7 39
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
B 3
37.5%
C 2
25.0%
A 1
 
12.5%
T 1
 
12.5%
M 1
 
12.5%
Space Separator
ValueCountFrequency (%)
871
100.0%
Open Punctuation
ValueCountFrequency (%)
( 199
100.0%
Close Punctuation
ValueCountFrequency (%)
) 199
100.0%
Other Punctuation
ValueCountFrequency (%)
, 69
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3460
60.1%
Common 2289
39.8%
Latin 8
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
383
 
11.1%
370
 
10.7%
263
 
7.6%
218
 
6.3%
207
 
6.0%
200
 
5.8%
197
 
5.7%
197
 
5.7%
197
 
5.7%
196
 
5.7%
Other values (112) 1032
29.8%
Common
ValueCountFrequency (%)
871
38.1%
1 265
 
11.6%
( 199
 
8.7%
) 199
 
8.7%
2 136
 
5.9%
3 105
 
4.6%
0 88
 
3.8%
4 81
 
3.5%
5 69
 
3.0%
, 69
 
3.0%
Other values (6) 207
 
9.0%
Latin
ValueCountFrequency (%)
B 3
37.5%
C 2
25.0%
A 1
 
12.5%
T 1
 
12.5%
M 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3460
60.1%
ASCII 2297
39.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
871
37.9%
1 265
 
11.5%
( 199
 
8.7%
) 199
 
8.7%
2 136
 
5.9%
3 105
 
4.6%
0 88
 
3.8%
4 81
 
3.5%
5 69
 
3.0%
, 69
 
3.0%
Other values (11) 215
 
9.4%
Hangul
ValueCountFrequency (%)
383
 
11.1%
370
 
10.7%
263
 
7.6%
218
 
6.3%
207
 
6.0%
200
 
5.8%
197
 
5.7%
197
 
5.7%
197
 
5.7%
196
 
5.7%
Other values (112) 1032
29.8%

평가등급
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
백색등급
74 
녹색등급
66 
황색등급
57 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row녹색등급
2nd row녹색등급
3rd row녹색등급
4th row녹색등급
5th row녹색등급

Common Values

ValueCountFrequency (%)
백색등급 74
37.6%
녹색등급 66
33.5%
황색등급 57
28.9%

Length

2023-12-11T02:08:54.268390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:08:54.437514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
백색등급 74
37.6%
녹색등급 66
33.5%
황색등급 57
28.9%

Interactions

2023-12-11T02:08:50.534188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:08:54.560494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종평가등급
연번1.0000.9020.789
업종0.9021.0000.362
평가등급0.7890.3621.000
2023-12-11T02:08:54.705456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
평가등급업종
평가등급1.0000.350
업종0.3501.000
2023-12-11T02:08:54.832785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종평가등급
연번1.0000.7740.665
업종0.7741.0000.350
평가등급0.6650.3501.000

Missing values

2023-12-11T02:08:50.746195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:08:50.905782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종상호소재지평가등급
01숙박업(일반)에그(egg)모텔부산광역시 연제구 고분로13번길 13 (연산동)녹색등급
12숙박업(일반)샤이어호텔부산광역시 연제구 반송로 18-6 (연산동)녹색등급
23숙박업(일반)오모텔부산광역시 연제구 과정로 165-2 (연산동)녹색등급
34숙박업(일반)더 제니스 호텔부산광역시 연제구 거제천로152번길 66 (연산동)녹색등급
45숙박업(일반)호텔 로이부산광역시 연제구 과정로191번길 41 (연산동)녹색등급
56숙박업(일반)센트럴호텔부산광역시 연제구 중앙대로 1122 (연산동)녹색등급
67숙박업(일반)투헤븐호텔부산광역시 연제구 과정로191번길 25 (연산동)녹색등급
78숙박업(일반)휴모텔부산광역시 연제구 반송로 13-1 (연산동)녹색등급
89숙박업(일반)웁스(OOPS)부산광역시 연제구 월드컵대로119번길 10 (연산동)녹색등급
910숙박업(일반)호텔유아인(You I IN)부산광역시 연제구 월드컵대로120번길 14 (연산동)녹색등급
연번업종상호소재지평가등급
187188세탁업현대홈크리닝부산광역시 연제구 법원북로 34, 상가동 205호 (거제동,거제1차홈타운아파트)백색등급
188189세탁업동원사부산광역시 연제구 거제천로269번길 31, 로얄듀크상가동 104호 (거제동)백색등급
189190세탁업성일부산광역시 연제구 과정로287번길 35 (연산동)백색등급
190191세탁업원세탁부산광역시 연제구 고분로 30 (연산동)백색등급
191192세탁업덕원부산광역시 연제구 마곡천로30번길 26, 1층 (연산동)백색등급
192193세탁업경동세탁소부산광역시 연제구 과정로344번길 34, 1층 (연산동)백색등급
193194세탁업하얀기쁨운동화빨래방부산광역시 연제구 거제천로 187, 1층 (거제동)백색등급
194195세탁업제일세탁부산광역시 연제구 월드컵대로99번길 20 (연산동)백색등급
195196세탁업태양세탁부산광역시 연제구 배산북로 4-2, 1층 (연산동)백색등급
196197세탁업하늘채 명품 세탁부산광역시 연제구 여고로14번길 31, 1층 (거제동)백색등급