Overview

Dataset statistics

Number of variables6
Number of observations73
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.6 KiB
Average record size in memory50.8 B

Variable types

Numeric1
Categorical2
Text3

Dataset

Description강서구 위생관리등급평가정보(업종명, 평가구분(등급), 업소명, 영업소 주소, 영업자), 위생관리등급공표와 관련한 문서로 부산광역시 강서구 내 공중위생 서비스 평가 결과 최우수 및 우수 업소 현황 자료입니다.
URLhttps://www.data.go.kr/data/3045896/fileData.do

Alerts

연번 is highly overall correlated with 업종명 and 1 other fieldsHigh correlation
업종명 is highly overall correlated with 연번High correlation
평가구분(등급) is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
업소명 has unique valuesUnique
영업소 주소(도로명) has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:16:14.826365
Analysis finished2023-12-12 23:16:15.395316
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct73
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37
Minimum1
Maximum73
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size789.0 B
2023-12-13T08:16:15.449612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.6
Q119
median37
Q355
95-th percentile69.4
Maximum73
Range72
Interquartile range (IQR)36

Descriptive statistics

Standard deviation21.217131
Coefficient of variation (CV)0.57343598
Kurtosis-1.2
Mean37
Median Absolute Deviation (MAD)18
Skewness0
Sum2701
Variance450.16667
MonotonicityStrictly increasing
2023-12-13T08:16:15.554710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.4%
56 1
 
1.4%
54 1
 
1.4%
53 1
 
1.4%
52 1
 
1.4%
51 1
 
1.4%
50 1
 
1.4%
49 1
 
1.4%
48 1
 
1.4%
47 1
 
1.4%
Other values (63) 63
86.3%
ValueCountFrequency (%)
1 1
1.4%
2 1
1.4%
3 1
1.4%
4 1
1.4%
5 1
1.4%
6 1
1.4%
7 1
1.4%
8 1
1.4%
9 1
1.4%
10 1
1.4%
ValueCountFrequency (%)
73 1
1.4%
72 1
1.4%
71 1
1.4%
70 1
1.4%
69 1
1.4%
68 1
1.4%
67 1
1.4%
66 1
1.4%
65 1
1.4%
64 1
1.4%

업종명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size716.0 B
숙박업(일반)
42 
세탁업
19 
목욕장업
12 

Length

Max length7
Median length7
Mean length5.4657534
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row숙박업(일반)
2nd row숙박업(일반)
3rd row숙박업(일반)
4th row숙박업(일반)
5th row숙박업(일반)

Common Values

ValueCountFrequency (%)
숙박업(일반) 42
57.5%
세탁업 19
26.0%
목욕장업 12
 
16.4%

Length

2023-12-13T08:16:15.662377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:16:15.747001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
숙박업(일반 42
57.5%
세탁업 19
26.0%
목욕장업 12
 
16.4%

평가구분(등급)
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size716.0 B
녹색
39 
황색
25 
백색

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row녹색
2nd row녹색
3rd row녹색
4th row녹색
5th row녹색

Common Values

ValueCountFrequency (%)
녹색 39
53.4%
황색 25
34.2%
백색 9
 
12.3%

Length

2023-12-13T08:16:15.835886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:16:15.918808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
녹색 39
53.4%
황색 25
34.2%
백색 9
 
12.3%

업소명
Text

UNIQUE 

Distinct73
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size716.0 B
2023-12-13T08:16:16.118666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length15
Mean length6.4931507
Min length2

Characters and Unicode

Total characters474
Distinct characters155
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)100.0%

Sample

1st row씨엘오션호텔
2nd row신라스테이 서부산
3rd row서부산관광호텔하운드
4th row명지오션시티호텔
5th row호텔프렌치코드
ValueCountFrequency (%)
호텔 3
 
3.4%
브라운도트 2
 
2.2%
명지점 2
 
2.2%
넘버25호텔 2
 
2.2%
2월호텔더스테이 2
 
2.2%
씨엘오션호텔 1
 
1.1%
광천수 1
 
1.1%
명지레포츠센터 1
 
1.1%
명지탕 1
 
1.1%
신호해수온천 1
 
1.1%
Other values (73) 73
82.0%
2023-12-13T08:16:16.428602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
37
 
7.8%
32
 
6.8%
16
 
3.4%
16
 
3.4%
13
 
2.7%
13
 
2.7%
12
 
2.5%
12
 
2.5%
9
 
1.9%
8
 
1.7%
Other values (145) 306
64.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 435
91.8%
Space Separator 16
 
3.4%
Uppercase Letter 10
 
2.1%
Decimal Number 6
 
1.3%
Close Punctuation 3
 
0.6%
Open Punctuation 3
 
0.6%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
37
 
8.5%
32
 
7.4%
16
 
3.7%
13
 
3.0%
13
 
3.0%
12
 
2.8%
12
 
2.8%
9
 
2.1%
8
 
1.8%
8
 
1.8%
Other values (131) 275
63.2%
Uppercase Letter
ValueCountFrequency (%)
W 2
20.0%
O 2
20.0%
D 1
10.0%
G 1
10.0%
K 1
10.0%
J 1
10.0%
S 1
10.0%
I 1
10.0%
Decimal Number
ValueCountFrequency (%)
2 4
66.7%
5 2
33.3%
Space Separator
ValueCountFrequency (%)
16
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 435
91.8%
Common 29
 
6.1%
Latin 10
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
37
 
8.5%
32
 
7.4%
16
 
3.7%
13
 
3.0%
13
 
3.0%
12
 
2.8%
12
 
2.8%
9
 
2.1%
8
 
1.8%
8
 
1.8%
Other values (131) 275
63.2%
Latin
ValueCountFrequency (%)
W 2
20.0%
O 2
20.0%
D 1
10.0%
G 1
10.0%
K 1
10.0%
J 1
10.0%
S 1
10.0%
I 1
10.0%
Common
ValueCountFrequency (%)
16
55.2%
2 4
 
13.8%
) 3
 
10.3%
( 3
 
10.3%
5 2
 
6.9%
, 1
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 435
91.8%
ASCII 39
 
8.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
37
 
8.5%
32
 
7.4%
16
 
3.7%
13
 
3.0%
13
 
3.0%
12
 
2.8%
12
 
2.8%
9
 
2.1%
8
 
1.8%
8
 
1.8%
Other values (131) 275
63.2%
ASCII
ValueCountFrequency (%)
16
41.0%
2 4
 
10.3%
) 3
 
7.7%
( 3
 
7.7%
W 2
 
5.1%
5 2
 
5.1%
O 2
 
5.1%
D 1
 
2.6%
G 1
 
2.6%
K 1
 
2.6%
Other values (4) 4
 
10.3%
Distinct73
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size716.0 B
2023-12-13T08:16:16.696067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length46
Mean length32.191781
Min length22

Characters and Unicode

Total characters2350
Distinct characters124
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)100.0%

Sample

1st row부산광역시 강서구 명지국제6로34번길 22 (명지동)
2nd row부산광역시 강서구 명지국제7로 38, 신라스테이 서부산점 (명지동)
3rd row부산광역시 강서구 화전산단4로25번길 53 (화전동)
4th row부산광역시 강서구 명지오션시티3로 37 (명지동)
5th row부산광역시 강서구 명지오션시티5로 8 (명지동)
ValueCountFrequency (%)
부산광역시 73
 
17.3%
강서구 73
 
17.3%
명지동 32
 
7.6%
신호동 18
 
4.3%
르노삼성대로 10
 
2.4%
대저1동 8
 
1.9%
신호산단1로217번길 8
 
1.9%
1층 7
 
1.7%
대저2동 4
 
1.0%
8 4
 
1.0%
Other values (140) 184
43.7%
2023-12-13T08:16:17.113730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
348
 
14.8%
1 111
 
4.7%
100
 
4.3%
85
 
3.6%
83
 
3.5%
81
 
3.4%
76
 
3.2%
74
 
3.1%
74
 
3.1%
( 73
 
3.1%
Other values (114) 1245
53.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1409
60.0%
Decimal Number 391
 
16.6%
Space Separator 348
 
14.8%
Open Punctuation 73
 
3.1%
Close Punctuation 73
 
3.1%
Other Punctuation 41
 
1.7%
Dash Punctuation 11
 
0.5%
Uppercase Letter 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
100
 
7.1%
85
 
6.0%
83
 
5.9%
81
 
5.7%
76
 
5.4%
74
 
5.3%
74
 
5.3%
73
 
5.2%
73
 
5.2%
71
 
5.0%
Other values (96) 619
43.9%
Decimal Number
ValueCountFrequency (%)
1 111
28.4%
2 64
16.4%
7 40
 
10.2%
3 35
 
9.0%
5 32
 
8.2%
6 24
 
6.1%
4 23
 
5.9%
8 23
 
5.9%
0 20
 
5.1%
9 19
 
4.9%
Uppercase Letter
ValueCountFrequency (%)
A 2
50.0%
C 1
25.0%
B 1
25.0%
Space Separator
ValueCountFrequency (%)
348
100.0%
Open Punctuation
ValueCountFrequency (%)
( 73
100.0%
Close Punctuation
ValueCountFrequency (%)
) 73
100.0%
Other Punctuation
ValueCountFrequency (%)
, 41
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1409
60.0%
Common 937
39.9%
Latin 4
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
100
 
7.1%
85
 
6.0%
83
 
5.9%
81
 
5.7%
76
 
5.4%
74
 
5.3%
74
 
5.3%
73
 
5.2%
73
 
5.2%
71
 
5.0%
Other values (96) 619
43.9%
Common
ValueCountFrequency (%)
348
37.1%
1 111
 
11.8%
( 73
 
7.8%
) 73
 
7.8%
2 64
 
6.8%
, 41
 
4.4%
7 40
 
4.3%
3 35
 
3.7%
5 32
 
3.4%
6 24
 
2.6%
Other values (5) 96
 
10.2%
Latin
ValueCountFrequency (%)
A 2
50.0%
C 1
25.0%
B 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1409
60.0%
ASCII 941
40.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
348
37.0%
1 111
 
11.8%
( 73
 
7.8%
) 73
 
7.8%
2 64
 
6.8%
, 41
 
4.4%
7 40
 
4.3%
3 35
 
3.7%
5 32
 
3.4%
6 24
 
2.6%
Other values (8) 100
 
10.6%
Hangul
ValueCountFrequency (%)
100
 
7.1%
85
 
6.0%
83
 
5.9%
81
 
5.7%
76
 
5.4%
74
 
5.3%
74
 
5.3%
73
 
5.2%
73
 
5.2%
71
 
5.0%
Other values (96) 619
43.9%
Distinct72
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size716.0 B
2023-12-13T08:16:17.373973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length3
Mean length3.6986301
Min length3

Characters and Unicode

Total characters270
Distinct characters107
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)97.3%

Sample

1st row나선석
2nd row박서정
3rd row김성동 외 1명
4th row성춘경
5th row김수연
ValueCountFrequency (%)
9
 
9.8%
1명 8
 
8.7%
김수연 2
 
2.2%
박명찬 1
 
1.1%
강병식 1
 
1.1%
윤수휘 1
 
1.1%
윤서해 1
 
1.1%
장용훈 1
 
1.1%
전일조 1
 
1.1%
김영석 1
 
1.1%
Other values (66) 66
71.7%
2023-12-13T08:16:17.787264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19
 
7.0%
14
 
5.2%
10
 
3.7%
9
 
3.3%
9
 
3.3%
1 8
 
3.0%
8
 
3.0%
6
 
2.2%
6
 
2.2%
6
 
2.2%
Other values (97) 175
64.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 233
86.3%
Space Separator 19
 
7.0%
Decimal Number 9
 
3.3%
Uppercase Letter 9
 
3.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
6.0%
10
 
4.3%
9
 
3.9%
9
 
3.9%
8
 
3.4%
6
 
2.6%
6
 
2.6%
6
 
2.6%
6
 
2.6%
5
 
2.1%
Other values (87) 154
66.1%
Uppercase Letter
ValueCountFrequency (%)
U 2
22.2%
I 2
22.2%
X 1
11.1%
G 1
11.1%
N 1
11.1%
W 1
11.1%
Y 1
11.1%
Decimal Number
ValueCountFrequency (%)
1 8
88.9%
3 1
 
11.1%
Space Separator
ValueCountFrequency (%)
19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 233
86.3%
Common 28
 
10.4%
Latin 9
 
3.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
6.0%
10
 
4.3%
9
 
3.9%
9
 
3.9%
8
 
3.4%
6
 
2.6%
6
 
2.6%
6
 
2.6%
6
 
2.6%
5
 
2.1%
Other values (87) 154
66.1%
Latin
ValueCountFrequency (%)
U 2
22.2%
I 2
22.2%
X 1
11.1%
G 1
11.1%
N 1
11.1%
W 1
11.1%
Y 1
11.1%
Common
ValueCountFrequency (%)
19
67.9%
1 8
28.6%
3 1
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 233
86.3%
ASCII 37
 
13.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
19
51.4%
1 8
21.6%
U 2
 
5.4%
I 2
 
5.4%
X 1
 
2.7%
G 1
 
2.7%
N 1
 
2.7%
W 1
 
2.7%
Y 1
 
2.7%
3 1
 
2.7%
Hangul
ValueCountFrequency (%)
14
 
6.0%
10
 
4.3%
9
 
3.9%
9
 
3.9%
8
 
3.4%
6
 
2.6%
6
 
2.6%
6
 
2.6%
6
 
2.6%
5
 
2.1%
Other values (87) 154
66.1%

Interactions

2023-12-13T08:16:15.196880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:16:17.904882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명평가구분(등급)업소명영업소 주소(도로명)영업자
연번1.0000.9170.9071.0001.0000.940
업종명0.9171.0000.7211.0001.0000.307
평가구분(등급)0.9070.7211.0001.0001.0001.000
업소명1.0001.0001.0001.0001.0001.000
영업소 주소(도로명)1.0001.0001.0001.0001.0001.000
영업자0.9400.3071.0001.0001.0001.000
2023-12-13T08:16:17.995881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명평가구분(등급)
업종명1.0000.377
평가구분(등급)0.3771.000
2023-12-13T08:16:18.073418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명평가구분(등급)
연번1.0000.8380.821
업종명0.8381.0000.377
평가구분(등급)0.8210.3771.000

Missing values

2023-12-13T08:16:15.288520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:16:15.364470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종명평가구분(등급)업소명영업소 주소(도로명)영업자
01숙박업(일반)녹색씨엘오션호텔부산광역시 강서구 명지국제6로34번길 22 (명지동)나선석
12숙박업(일반)녹색신라스테이 서부산부산광역시 강서구 명지국제7로 38, 신라스테이 서부산점 (명지동)박서정
23숙박업(일반)녹색서부산관광호텔하운드부산광역시 강서구 화전산단4로25번길 53 (화전동)김성동 외 1명
34숙박업(일반)녹색명지오션시티호텔부산광역시 강서구 명지오션시티3로 37 (명지동)성춘경
45숙박업(일반)녹색호텔프렌치코드부산광역시 강서구 명지오션시티5로 8 (명지동)김수연
56숙박업(일반)녹색김해공항 브라운도트 명지점부산광역시 강서구 르노삼성대로 576-6 (명지동)박현숙
67숙박업(일반)녹색브이호텔부산광역시 강서구 신호산단2로27번길 33 (신호동)박재근 외 1명
78숙박업(일반)녹색이그니스호텔부산광역시 강서구 신호산단2로27번길 21 (신호동)최성훈
89숙박업(일반)녹색제이케이(JK)부산광역시 강서구 르노삼성대로 590 (명지동)서미자
910숙박업(일반)녹색비즈니스호텔부산광역시 강서구 신호산단2로27번길 41 (신호동)김일배 외 1명
연번업종명평가구분(등급)업소명영업소 주소(도로명)영업자
6364세탁업황색신영세탁부산광역시 강서구 명지새동네길14번길 21 (명지동)이영복
6465세탁업황색서울세탁소부산광역시 강서구 공항로811번다길 16 (대저2동)김광웅
6566세탁업백색공항세탁소부산광역시 강서구 공항로811번가길 2 (대저2동)이철조
6667세탁업백색세원산업부산광역시 강서구 대저로89번가길 115-11, 1층 (대저1동)신용운
6768세탁업백색명지세탁소부산광역시 강서구 신호산단1로 76, 경희빌딩 101호,102호,103호,104호 일부 1층 (신호동)김정연
6869세탁업백색파트너세탁부산광역시 강서구 대저로89번가길 115-16 (대저1동)노장현
6970세탁업백색경남사부산광역시 강서구 대저중앙로394번가길 74-3, 1층 (대저1동)WU XIUYING
7071세탁업백색나루터세탁부산광역시 강서구 대저중앙로394번길 95-1 (대저1동)우창택
7172세탁업백색빨래리노 명지점부산광역시 강서구 명지국제12로47번길 5, 1층 (명지동)윤도연
7273세탁업백색부경사부산광역시 강서구 대저로135번길 11, A동 (대저1동)김대경