Overview

Dataset statistics

Number of variables5
Number of observations770
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory31.0 KiB
Average record size in memory41.2 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description부산광역시연제구2019공중위생서비스평가결과(201912)
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15051417

Alerts

연번 is highly overall correlated with 업종명 and 1 other fieldsHigh correlation
업종명 is highly overall correlated with 연번High correlation
등급 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:08:56.731594
Analysis finished2023-12-10 17:08:57.849072
Duration1.12 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct770
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean385.5
Minimum1
Maximum770
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.9 KiB
2023-12-11T02:08:57.972522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile39.45
Q1193.25
median385.5
Q3577.75
95-th percentile731.55
Maximum770
Range769
Interquartile range (IQR)384.5

Descriptive statistics

Standard deviation222.42414
Coefficient of variation (CV)0.57697573
Kurtosis-1.2
Mean385.5
Median Absolute Deviation (MAD)192.5
Skewness0
Sum296835
Variance49472.5
MonotonicityStrictly increasing
2023-12-11T02:08:58.176752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
579 1
 
0.1%
509 1
 
0.1%
510 1
 
0.1%
511 1
 
0.1%
512 1
 
0.1%
513 1
 
0.1%
514 1
 
0.1%
515 1
 
0.1%
516 1
 
0.1%
Other values (760) 760
98.7%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
770 1
0.1%
769 1
0.1%
768 1
0.1%
767 1
0.1%
766 1
0.1%
765 1
0.1%
764 1
0.1%
763 1
0.1%
762 1
0.1%
761 1
0.1%

업종명
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
미용업(일반)
265 
미용업
197 
미용업(피부)
77 
이용업
68 
미용업(손톱ㆍ발톱)
54 
Other values (11)
109 

Length

Max length31
Median length28
Mean length7.1467532
Min length3

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row이용업
2nd row이용업
3rd row이용업
4th row이용업
5th row이용업

Common Values

ValueCountFrequency (%)
미용업(일반) 265
34.4%
미용업 197
25.6%
미용업(피부) 77
 
10.0%
이용업 68
 
8.8%
미용업(손톱ㆍ발톱) 54
 
7.0%
미용업(종합) 32
 
4.2%
미용업(일반), 미용업(화장ㆍ분장) 17
 
2.2%
미용업(손톱ㆍ발톱), 미용업(화장ㆍ분장) 12
 
1.6%
미용업(일반), 미용업(손톱ㆍ발톱) 12
 
1.6%
미용업(피부), 미용업(화장ㆍ분장) 8
 
1.0%
Other values (6) 28
 
3.6%

Length

2023-12-11T02:08:58.423796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
미용업(일반 305
35.8%
미용업 197
23.1%
미용업(피부 101
 
11.9%
미용업(손톱ㆍ발톱 93
 
10.9%
이용업 68
 
8.0%
미용업(화장ㆍ분장 55
 
6.5%
미용업(종합 32
 
3.8%
Distinct751
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
2023-12-11T02:08:58.791295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length28
Mean length6.012987
Min length1

Characters and Unicode

Total characters4630
Distinct characters470
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique733 ?
Unique (%)95.2%

Sample

1st row맨인블랙
2nd row헤모랩 천연염색·두피케어 전문점
3rd row현대이용원
4th row발리캇트실
5th row맨즈헤어
ValueCountFrequency (%)
헤어 16
 
1.6%
이용원 8
 
0.8%
네일 8
 
0.8%
미용실 7
 
0.7%
hair 7
 
0.7%
by 6
 
0.6%
에스테틱 6
 
0.6%
nail 6
 
0.6%
연산점 4
 
0.4%
바이 4
 
0.4%
Other values (861) 909
92.7%
2023-12-11T02:08:59.588984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
299
 
6.5%
298
 
6.4%
211
 
4.6%
127
 
2.7%
104
 
2.2%
97
 
2.1%
88
 
1.9%
84
 
1.8%
( 82
 
1.8%
) 82
 
1.8%
Other values (460) 3158
68.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3672
79.3%
Uppercase Letter 283
 
6.1%
Lowercase Letter 235
 
5.1%
Space Separator 211
 
4.6%
Open Punctuation 82
 
1.8%
Close Punctuation 82
 
1.8%
Other Punctuation 34
 
0.7%
Decimal Number 23
 
0.5%
Dash Punctuation 7
 
0.2%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
299
 
8.1%
298
 
8.1%
127
 
3.5%
104
 
2.8%
97
 
2.6%
88
 
2.4%
84
 
2.3%
78
 
2.1%
74
 
2.0%
48
 
1.3%
Other values (399) 2375
64.7%
Uppercase Letter
ValueCountFrequency (%)
A 28
 
9.9%
N 25
 
8.8%
R 22
 
7.8%
I 22
 
7.8%
E 20
 
7.1%
L 17
 
6.0%
B 17
 
6.0%
H 16
 
5.7%
M 14
 
4.9%
Y 14
 
4.9%
Other values (14) 88
31.1%
Lowercase Letter
ValueCountFrequency (%)
a 35
14.9%
i 34
14.5%
e 22
9.4%
n 18
7.7%
y 17
7.2%
l 16
 
6.8%
r 15
 
6.4%
o 14
 
6.0%
h 14
 
6.0%
b 10
 
4.3%
Other values (9) 40
17.0%
Other Punctuation
ValueCountFrequency (%)
& 13
38.2%
. 8
23.5%
# 5
 
14.7%
, 5
 
14.7%
' 1
 
2.9%
1
 
2.9%
· 1
 
2.9%
Decimal Number
ValueCountFrequency (%)
1 8
34.8%
2 4
17.4%
0 3
 
13.0%
3 3
 
13.0%
5 3
 
13.0%
4 2
 
8.7%
Space Separator
ValueCountFrequency (%)
211
100.0%
Open Punctuation
ValueCountFrequency (%)
( 82
100.0%
Close Punctuation
ValueCountFrequency (%)
) 82
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3663
79.1%
Latin 518
 
11.2%
Common 440
 
9.5%
Han 9
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
299
 
8.2%
298
 
8.1%
127
 
3.5%
104
 
2.8%
97
 
2.6%
88
 
2.4%
84
 
2.3%
78
 
2.1%
74
 
2.0%
48
 
1.3%
Other values (392) 2366
64.6%
Latin
ValueCountFrequency (%)
a 35
 
6.8%
i 34
 
6.6%
A 28
 
5.4%
N 25
 
4.8%
R 22
 
4.2%
I 22
 
4.2%
e 22
 
4.2%
E 20
 
3.9%
n 18
 
3.5%
L 17
 
3.3%
Other values (33) 275
53.1%
Common
ValueCountFrequency (%)
211
48.0%
( 82
 
18.6%
) 82
 
18.6%
& 13
 
3.0%
1 8
 
1.8%
. 8
 
1.8%
- 7
 
1.6%
# 5
 
1.1%
, 5
 
1.1%
2 4
 
0.9%
Other values (8) 15
 
3.4%
Han
ValueCountFrequency (%)
3
33.3%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3663
79.1%
ASCII 956
 
20.6%
CJK 9
 
0.2%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
299
 
8.2%
298
 
8.1%
127
 
3.5%
104
 
2.8%
97
 
2.6%
88
 
2.4%
84
 
2.3%
78
 
2.1%
74
 
2.0%
48
 
1.3%
Other values (392) 2366
64.6%
ASCII
ValueCountFrequency (%)
211
22.1%
( 82
 
8.6%
) 82
 
8.6%
a 35
 
3.7%
i 34
 
3.6%
A 28
 
2.9%
N 25
 
2.6%
R 22
 
2.3%
I 22
 
2.3%
e 22
 
2.3%
Other values (49) 393
41.1%
CJK
ValueCountFrequency (%)
3
33.3%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
None
ValueCountFrequency (%)
1
50.0%
· 1
50.0%
Distinct754
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
2023-12-11T02:09:00.121389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length51
Mean length30.711688
Min length20

Characters and Unicode

Total characters23648
Distinct characters224
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique739 ?
Unique (%)96.0%

Sample

1st row부산광역시 연제구 과정로191번가길 62, 1층 (연산동)
2nd row부산광역시 연제구 명륜로2번길 7, 상가101동 2층 213호 (거제동, 삼익퓨쳐타워아파트)
3rd row부산광역시 연제구 과정로344번길 19 (연산동)
4th row부산광역시 연제구 월드컵대로 152 (연산동)
5th row부산광역시 연제구 고분로 108-1, 1층 (연산동)
ValueCountFrequency (%)
부산광역시 817
17.8%
연제구 815
17.7%
연산동 610
 
13.3%
1층 183
 
4.0%
거제동 121
 
2.6%
2층 75
 
1.6%
과정로 33
 
0.7%
연수로 32
 
0.7%
고분로 27
 
0.6%
중앙대로 25
 
0.5%
Other values (645) 1863
40.5%
2023-12-11T02:09:00.905546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3899
 
16.5%
1602
 
6.8%
1512
 
6.4%
1048
 
4.4%
1 996
 
4.2%
880
 
3.7%
872
 
3.7%
845
 
3.6%
837
 
3.5%
834
 
3.5%
Other values (214) 10323
43.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13947
59.0%
Space Separator 3899
 
16.5%
Decimal Number 3594
 
15.2%
Open Punctuation 774
 
3.3%
Close Punctuation 773
 
3.3%
Other Punctuation 490
 
2.1%
Dash Punctuation 86
 
0.4%
Uppercase Letter 82
 
0.3%
Lowercase Letter 2
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1602
11.5%
1512
 
10.8%
1048
 
7.5%
880
 
6.3%
872
 
6.3%
845
 
6.1%
837
 
6.0%
834
 
6.0%
833
 
6.0%
770
 
5.5%
Other values (178) 3914
28.1%
Uppercase Letter
ValueCountFrequency (%)
B 12
14.6%
A 11
13.4%
S 8
9.8%
I 8
9.8%
K 8
9.8%
E 6
7.3%
W 5
6.1%
G 5
6.1%
V 5
6.1%
C 4
 
4.9%
Other values (5) 10
12.2%
Decimal Number
ValueCountFrequency (%)
1 996
27.7%
2 615
17.1%
3 446
12.4%
0 333
 
9.3%
4 254
 
7.1%
5 233
 
6.5%
8 204
 
5.7%
6 181
 
5.0%
7 171
 
4.8%
9 161
 
4.5%
Other Punctuation
ValueCountFrequency (%)
, 484
98.8%
/ 3
 
0.6%
@ 2
 
0.4%
& 1
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
c 1
50.0%
e 1
50.0%
Space Separator
ValueCountFrequency (%)
3899
100.0%
Open Punctuation
ValueCountFrequency (%)
( 774
100.0%
Close Punctuation
ValueCountFrequency (%)
) 773
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 86
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13947
59.0%
Common 9617
40.7%
Latin 84
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1602
11.5%
1512
 
10.8%
1048
 
7.5%
880
 
6.3%
872
 
6.3%
845
 
6.1%
837
 
6.0%
834
 
6.0%
833
 
6.0%
770
 
5.5%
Other values (178) 3914
28.1%
Common
ValueCountFrequency (%)
3899
40.5%
1 996
 
10.4%
( 774
 
8.0%
) 773
 
8.0%
2 615
 
6.4%
, 484
 
5.0%
3 446
 
4.6%
0 333
 
3.5%
4 254
 
2.6%
5 233
 
2.4%
Other values (9) 810
 
8.4%
Latin
ValueCountFrequency (%)
B 12
14.3%
A 11
13.1%
S 8
9.5%
I 8
9.5%
K 8
9.5%
E 6
7.1%
W 5
6.0%
G 5
6.0%
V 5
6.0%
C 4
 
4.8%
Other values (7) 12
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13947
59.0%
ASCII 9701
41.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3899
40.2%
1 996
 
10.3%
( 774
 
8.0%
) 773
 
8.0%
2 615
 
6.3%
, 484
 
5.0%
3 446
 
4.6%
0 333
 
3.4%
4 254
 
2.6%
5 233
 
2.4%
Other values (26) 894
 
9.2%
Hangul
ValueCountFrequency (%)
1602
11.5%
1512
 
10.8%
1048
 
7.5%
880
 
6.3%
872
 
6.3%
845
 
6.1%
837
 
6.0%
834
 
6.0%
833
 
6.0%
770
 
5.5%
Other values (178) 3914
28.1%

등급
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
최우수업소
463 
우수업소
192 
일반관리업소
115 

Length

Max length6
Median length5
Mean length4.9
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row최우수업소
2nd row최우수업소
3rd row최우수업소
4th row최우수업소
5th row최우수업소

Common Values

ValueCountFrequency (%)
최우수업소 463
60.1%
우수업소 192
24.9%
일반관리업소 115
 
14.9%

Length

2023-12-11T02:09:01.175756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:09:01.342282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
최우수업소 463
60.1%
우수업소 192
24.9%
일반관리업소 115
 
14.9%

Interactions

2023-12-11T02:08:57.426380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:09:01.461528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명등급
연번1.0000.8580.908
업종명0.8581.0000.434
등급0.9080.4341.000
2023-12-11T02:09:01.963737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등급업종명
등급1.0000.262
업종명0.2621.000
2023-12-11T02:09:02.105004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명등급
연번1.0000.5600.862
업종명0.5601.0000.262
등급0.8620.2621.000

Missing values

2023-12-11T02:08:57.647611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:08:57.788240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종명업소명업소소재지등급
01이용업맨인블랙부산광역시 연제구 과정로191번가길 62, 1층 (연산동)최우수업소
12이용업헤모랩 천연염색·두피케어 전문점부산광역시 연제구 명륜로2번길 7, 상가101동 2층 213호 (거제동, 삼익퓨쳐타워아파트)최우수업소
23이용업현대이용원부산광역시 연제구 과정로344번길 19 (연산동)최우수업소
34이용업발리캇트실부산광역시 연제구 월드컵대로 152 (연산동)최우수업소
45이용업맨즈헤어부산광역시 연제구 고분로 108-1, 1층 (연산동)최우수업소
56이용업맨 컷트샵 이용원부산광역시 연제구 월드컵대로19번길 22, 1층 (연산동)우수업소
67이용업택시부산광역시 연제구 월드컵대로 399 (거제동)우수업소
78이용업시티 이용원부산광역시 연제구 거제시장로 42 (거제동)우수업소
89이용업국제부산광역시 연제구 거제천로255번가길 31-2 (거제동)우수업소
910이용업문화 이용원부산광역시 연제구 거제대로 156-1, 1층 (거제동)우수업소
연번업종명업소명업소소재지등급
760761미용업(일반), 미용업(피부)MW 메이크업 앤왁싱부산광역시 연제구 중앙대로 1130, 2층 206호 (연산동, SK뷰)일반관리업소
761762미용업(일반), 미용업(화장ㆍ분장)연산동 붙임머리부산광역시 연제구 월드컵대로46번길 12, 1층 (연산동)일반관리업소
762763미용업(종합)마술머리 미용실부산광역시 연제구 해맞이로 71, 1층 (거제동)일반관리업소
763764미용업(종합)미담헤어부산광역시 연제구 중앙대로1134번길 51 (연산동)일반관리업소
764765미용업(종합)수아헤어부산광역시 연제구 금련로 4 (연산동)일반관리업소
765766미용업(종합)유리알피부Skin&Body(스킨앤바디)부산광역시 연제구 거제천로 232-1, 1층 (연산동)일반관리업소
766767미용업(종합)이진헤어부산광역시 연제구 신금로 17, 1층 (연산동)일반관리업소
767768미용업(종합)셀루스웰빙 미용실부산광역시 연제구 과정로265번길 28 (연산동)일반관리업소
768769미용업(피부)다영케어부산광역시 연제구 고분로20번가길 7 (연산동)일반관리업소
769770미용업(피부)고운찬테라피부산광역시 연제구 월드컵대로153번길 18 (연산동,4층)일반관리업소