Overview

Dataset statistics

Number of variables6
Number of observations90
Missing cells7
Missing cells (%)1.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.3 KiB
Average record size in memory49.5 B

Variable types

Text4
Categorical1
Boolean1

Dataset

Description김해시 목욕장업 현황 자료로 업소명, 업태구분명, 전화번호, 도로명주소, 지번주소, 발한실여부에 대한 항목으로 구성되어 있습니다.
Author경상남도 김해시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15033395

Alerts

업태명 is highly imbalanced (65.2%)Imbalance
소재지전화 has 7 (7.8%) missing valuesMissing
영업소 주소(도로명) has unique valuesUnique
영업소 주소(지번) has unique valuesUnique

Reproduction

Analysis started2023-12-11 00:12:19.772636
Analysis finished2023-12-11 00:12:20.779691
Duration1.01 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct89
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size852.0 B
2023-12-11T09:12:21.093388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length5.0777778
Min length3

Characters and Unicode

Total characters457
Distinct characters141
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique88 ?
Unique (%)97.8%

Sample

1st row남초탕
2nd row금양목욕탕
3rd row우림탕
4th row덕삼탕
5th row수정탕
ValueCountFrequency (%)
청수탕 2
 
2.1%
용천스파랜드 1
 
1.0%
롯데캐슬사우나 1
 
1.0%
우리들사우나 1
 
1.0%
청암레포츠 1
 
1.0%
대청사우나 1
 
1.0%
그랜드사우나 1
 
1.0%
장유계곡참숯가마 1
 
1.0%
굿타임사우나 1
 
1.0%
수정사우나 1
 
1.0%
Other values (85) 85
88.5%
2023-12-11T09:12:21.588101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
54
 
11.8%
26
 
5.7%
24
 
5.3%
23
 
5.0%
11
 
2.4%
11
 
2.4%
11
 
2.4%
9
 
2.0%
8
 
1.8%
8
 
1.8%
Other values (131) 272
59.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 444
97.2%
Space Separator 6
 
1.3%
Close Punctuation 3
 
0.7%
Open Punctuation 3
 
0.7%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
 
12.2%
26
 
5.9%
24
 
5.4%
23
 
5.2%
11
 
2.5%
11
 
2.5%
11
 
2.5%
9
 
2.0%
8
 
1.8%
8
 
1.8%
Other values (127) 259
58.3%
Space Separator
ValueCountFrequency (%)
6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 444
97.2%
Common 13
 
2.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
 
12.2%
26
 
5.9%
24
 
5.4%
23
 
5.2%
11
 
2.5%
11
 
2.5%
11
 
2.5%
9
 
2.0%
8
 
1.8%
8
 
1.8%
Other values (127) 259
58.3%
Common
ValueCountFrequency (%)
6
46.2%
) 3
23.1%
( 3
23.1%
& 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 444
97.2%
ASCII 13
 
2.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
54
 
12.2%
26
 
5.9%
24
 
5.4%
23
 
5.2%
11
 
2.5%
11
 
2.5%
11
 
2.5%
9
 
2.0%
8
 
1.8%
8
 
1.8%
Other values (127) 259
58.3%
ASCII
ValueCountFrequency (%)
6
46.2%
) 3
23.1%
( 3
23.1%
& 1
 
7.7%
Distinct90
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size852.0 B
2023-12-11T09:12:21.932321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length40
Mean length28.677778
Min length20

Characters and Unicode

Total characters2581
Distinct characters132
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique90 ?
Unique (%)100.0%

Sample

1st row경상남도 김해시 삼안로132번길 9-5 (삼방동)
2nd row경상남도 김해시 진영읍 여래로20번길 7-1, 4층
3rd row경상남도 김해시 분성로 116 (외동)
4th row경상남도 김해시 김해대로2529번길 70 (어방동)
5th row경상남도 김해시 호계로500번길 3 (동상동)
ValueCountFrequency (%)
경상남도 90
 
17.9%
김해시 90
 
17.9%
내동 8
 
1.6%
어방동 7
 
1.4%
삼계동 6
 
1.2%
삼방동 6
 
1.2%
외동 6
 
1.2%
삼정동 5
 
1.0%
7 5
 
1.0%
진영읍 5
 
1.0%
Other values (222) 274
54.6%
2023-12-11T09:12:22.418286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
412
 
16.0%
1 108
 
4.2%
100
 
3.9%
97
 
3.8%
97
 
3.8%
93
 
3.6%
92
 
3.6%
90
 
3.5%
90
 
3.5%
89
 
3.4%
Other values (122) 1313
50.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1429
55.4%
Decimal Number 489
 
18.9%
Space Separator 412
 
16.0%
Close Punctuation 80
 
3.1%
Open Punctuation 80
 
3.1%
Other Punctuation 66
 
2.6%
Dash Punctuation 24
 
0.9%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
100
 
7.0%
97
 
6.8%
97
 
6.8%
93
 
6.5%
92
 
6.4%
90
 
6.3%
90
 
6.3%
89
 
6.2%
89
 
6.2%
59
 
4.1%
Other values (105) 533
37.3%
Decimal Number
ValueCountFrequency (%)
1 108
22.1%
2 77
15.7%
5 54
11.0%
4 52
10.6%
3 52
10.6%
0 46
9.4%
7 31
 
6.3%
8 26
 
5.3%
6 22
 
4.5%
9 21
 
4.3%
Other Punctuation
ValueCountFrequency (%)
, 63
95.5%
. 3
 
4.5%
Space Separator
ValueCountFrequency (%)
412
100.0%
Close Punctuation
ValueCountFrequency (%)
) 80
100.0%
Open Punctuation
ValueCountFrequency (%)
( 80
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1429
55.4%
Common 1151
44.6%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
100
 
7.0%
97
 
6.8%
97
 
6.8%
93
 
6.5%
92
 
6.4%
90
 
6.3%
90
 
6.3%
89
 
6.2%
89
 
6.2%
59
 
4.1%
Other values (105) 533
37.3%
Common
ValueCountFrequency (%)
412
35.8%
1 108
 
9.4%
) 80
 
7.0%
( 80
 
7.0%
2 77
 
6.7%
, 63
 
5.5%
5 54
 
4.7%
4 52
 
4.5%
3 52
 
4.5%
0 46
 
4.0%
Other values (6) 127
 
11.0%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1429
55.4%
ASCII 1152
44.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
412
35.8%
1 108
 
9.4%
) 80
 
6.9%
( 80
 
6.9%
2 77
 
6.7%
, 63
 
5.5%
5 54
 
4.7%
4 52
 
4.5%
3 52
 
4.5%
0 46
 
4.0%
Other values (7) 128
 
11.1%
Hangul
ValueCountFrequency (%)
100
 
7.0%
97
 
6.8%
97
 
6.8%
93
 
6.5%
92
 
6.4%
90
 
6.3%
90
 
6.3%
89
 
6.2%
89
 
6.2%
59
 
4.1%
Other values (105) 533
37.3%
Distinct90
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size852.0 B
2023-12-11T09:12:22.840278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length35
Mean length22.244444
Min length15

Characters and Unicode

Total characters2002
Distinct characters115
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique90 ?
Unique (%)100.0%

Sample

1st row경상남도 김해시 삼방동 27-4
2nd row경상남도 김해시 진영읍 여래리 700-164
3rd row경상남도 김해시 외동 533-2
4th row경상남도 김해시 어방동 1097-16
5th row경상남도 김해시 동상동 785-1
ValueCountFrequency (%)
경상남도 90
21.5%
김해시 90
21.5%
내동 8
 
1.9%
어방동 7
 
1.7%
삼방동 6
 
1.4%
외동 6
 
1.4%
삼계동 6
 
1.4%
진영읍 5
 
1.2%
대청동 5
 
1.2%
삼정동 5
 
1.2%
Other values (163) 191
45.6%
2023-12-11T09:12:23.376306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
412
20.6%
1 113
 
5.6%
97
 
4.8%
91
 
4.5%
91
 
4.5%
91
 
4.5%
90
 
4.5%
90
 
4.5%
90
 
4.5%
89
 
4.4%
Other values (105) 748
37.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1052
52.5%
Decimal Number 445
22.2%
Space Separator 412
 
20.6%
Dash Punctuation 80
 
4.0%
Other Punctuation 12
 
0.6%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
97
 
9.2%
91
 
8.7%
91
 
8.7%
91
 
8.7%
90
 
8.6%
90
 
8.6%
90
 
8.6%
89
 
8.5%
21
 
2.0%
14
 
1.3%
Other values (90) 288
27.4%
Decimal Number
ValueCountFrequency (%)
1 113
25.4%
2 62
13.9%
3 47
10.6%
0 44
 
9.9%
4 42
 
9.4%
5 39
 
8.8%
6 33
 
7.4%
7 25
 
5.6%
9 21
 
4.7%
8 19
 
4.3%
Other Punctuation
ValueCountFrequency (%)
, 11
91.7%
. 1
 
8.3%
Space Separator
ValueCountFrequency (%)
412
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 80
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1052
52.5%
Common 949
47.4%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
97
 
9.2%
91
 
8.7%
91
 
8.7%
91
 
8.7%
90
 
8.6%
90
 
8.6%
90
 
8.6%
89
 
8.5%
21
 
2.0%
14
 
1.3%
Other values (90) 288
27.4%
Common
ValueCountFrequency (%)
412
43.4%
1 113
 
11.9%
- 80
 
8.4%
2 62
 
6.5%
3 47
 
5.0%
0 44
 
4.6%
4 42
 
4.4%
5 39
 
4.1%
6 33
 
3.5%
7 25
 
2.6%
Other values (4) 52
 
5.5%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1052
52.5%
ASCII 950
47.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
412
43.4%
1 113
 
11.9%
- 80
 
8.4%
2 62
 
6.5%
3 47
 
4.9%
0 44
 
4.6%
4 42
 
4.4%
5 39
 
4.1%
6 33
 
3.5%
7 25
 
2.6%
Other values (5) 53
 
5.6%
Hangul
ValueCountFrequency (%)
97
 
9.2%
91
 
8.7%
91
 
8.7%
91
 
8.7%
90
 
8.6%
90
 
8.6%
90
 
8.6%
89
 
8.5%
21
 
2.0%
14
 
1.3%
Other values (90) 288
27.4%

소재지전화
Text

MISSING 

Distinct82
Distinct (%)98.8%
Missing7
Missing (%)7.8%
Memory size852.0 B
2023-12-11T09:12:23.698708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length13
Mean length12.614458
Min length12

Characters and Unicode

Total characters1047
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique81 ?
Unique (%)97.6%

Sample

1st row 055-334-8123
2nd row 055-345-0781
3rd row055-321-2179
4th row055-326-2231
5th row 055-335-2083
ValueCountFrequency (%)
055-335-6454 2
 
2.4%
055-330-9000 2
 
2.4%
055-314-3741 1
 
1.2%
055-314-7518 1
 
1.2%
055-332-9207 1
 
1.2%
055-312-4411 1
 
1.2%
055-336-3311 1
 
1.2%
055-312-0210 1
 
1.2%
055-332-0998 1
 
1.2%
055-325-3044 1
 
1.2%
Other values (71) 71
85.5%
2023-12-11T09:12:24.118926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 208
19.9%
- 166
15.9%
3 161
15.4%
0 131
12.5%
2 80
 
7.6%
1 58
 
5.5%
4 51
 
4.9%
51
 
4.9%
6 46
 
4.4%
7 38
 
3.6%
Other values (2) 57
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 830
79.3%
Dash Punctuation 166
 
15.9%
Space Separator 51
 
4.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 208
25.1%
3 161
19.4%
0 131
15.8%
2 80
 
9.6%
1 58
 
7.0%
4 51
 
6.1%
6 46
 
5.5%
7 38
 
4.6%
8 33
 
4.0%
9 24
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 166
100.0%
Space Separator
ValueCountFrequency (%)
51
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1047
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 208
19.9%
- 166
15.9%
3 161
15.4%
0 131
12.5%
2 80
 
7.6%
1 58
 
5.5%
4 51
 
4.9%
51
 
4.9%
6 46
 
4.4%
7 38
 
3.6%
Other values (2) 57
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1047
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 208
19.9%
- 166
15.9%
3 161
15.4%
0 131
12.5%
2 80
 
7.6%
1 58
 
5.5%
4 51
 
4.9%
51
 
4.9%
6 46
 
4.4%
7 38
 
3.6%
Other values (2) 57
 
5.4%

업태명
Categorical

IMBALANCE 

Distinct4
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size852.0 B
공동탕업
79 
공동탕업+찜질시설서비스영업
 
7
목욕장업 기타
 
2
찜질시설서비스영업
 
2

Length

Max length14
Median length4
Mean length4.9555556
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공동탕업
2nd row공동탕업
3rd row공동탕업
4th row공동탕업
5th row공동탕업

Common Values

ValueCountFrequency (%)
공동탕업 79
87.8%
공동탕업+찜질시설서비스영업 7
 
7.8%
목욕장업 기타 2
 
2.2%
찜질시설서비스영업 2
 
2.2%

Length

2023-12-11T09:12:24.313083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:12:24.459924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공동탕업 79
85.9%
공동탕업+찜질시설서비스영업 7
 
7.6%
목욕장업 2
 
2.2%
기타 2
 
2.2%
찜질시설서비스영업 2
 
2.2%

발한실
Boolean

Distinct2
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size222.0 B
False
50 
True
40 
ValueCountFrequency (%)
False 50
55.6%
True 40
44.4%
2023-12-11T09:12:24.591427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:12:24.703950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명영업소 주소(도로명)영업소 주소(지번)소재지전화업태명발한실
업소명1.0001.0001.0000.9991.0000.000
영업소 주소(도로명)1.0001.0001.0001.0001.0001.000
영업소 주소(지번)1.0001.0001.0001.0001.0001.000
소재지전화0.9991.0001.0001.0001.0001.000
업태명1.0001.0001.0001.0001.0000.405
발한실0.0001.0001.0001.0000.4051.000
2023-12-11T09:12:24.836366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발한실업태명
발한실1.0000.268
업태명0.2681.000
2023-12-11T09:12:24.934641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업태명발한실
업태명1.0000.268
발한실0.2681.000

Missing values

2023-12-11T09:12:20.622557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:12:20.736879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명영업소 주소(도로명)영업소 주소(지번)소재지전화업태명발한실
0남초탕경상남도 김해시 삼안로132번길 9-5 (삼방동)경상남도 김해시 삼방동 27-4055-334-8123공동탕업N
1금양목욕탕경상남도 김해시 진영읍 여래로20번길 7-1, 4층경상남도 김해시 진영읍 여래리 700-164055-345-0781공동탕업Y
2우림탕경상남도 김해시 분성로 116 (외동)경상남도 김해시 외동 533-2055-321-2179공동탕업Y
3덕삼탕경상남도 김해시 김해대로2529번길 70 (어방동)경상남도 김해시 어방동 1097-16055-326-2231공동탕업N
4수정탕경상남도 김해시 호계로500번길 3 (동상동)경상남도 김해시 동상동 785-1055-335-2083공동탕업N
5은하탕경상남도 김해시 가락로23번길 18-1 (봉황동)경상남도 김해시 봉황동 24-5055-336-2488공동탕업N
6성광탕경상남도 김해시 호계로452번길 23-9 (부원동)경상남도 김해시 부원동 63-23055-328-2925공동탕업N
7봉황사우나경상남도 김해시 가락로15번길 15 (봉황동)경상남도 김해시 봉황동 26-4055-322-1577공동탕업N
8창성탕경상남도 김해시 가락로 156-8 (대성동)경상남도 김해시 대성동 183-7055-337-7729공동탕업N
9천호탕경상남도 김해시 해반천로 34-17 (구산동)경상남도 김해시 구산동 530-4055-334-9190공동탕업Y
업소명영업소 주소(도로명)영업소 주소(지번)소재지전화업태명발한실
80맑은샘사우나시스템경상남도 김해시 김해대로1902번길 25 (구산동)경상남도 김해시 구산동 833 외2필지055-322-1286공동탕업+찜질시설서비스영업Y
81황토나라탕경상남도 김해시 가락로 153-5, 1층 (대성동)경상남도 김해시 대성동 179-9<NA>공동탕업N
82금호탕경상남도 김해시 분성로501번길 34, 금호빌딩 1,2층 (어방동)경상남도 김해시 어방동 1129-11 금호빌딩<NA>공동탕업N
83워터랜드 맑은샘 찜질방사우나경상남도 김해시 능동로 27, 퓨전스포츠타운 3,4,5층 301,401,501호 (삼문동)경상남도 김해시 삼문동 79-3 퓨전스포츠타운055-339-5772공동탕업+찜질시설서비스영업Y
84남강스파앤 피트니스(주)경상남도 김해시 덕정로204번길 36, 남강관동온천 2,4층 (관동동)경상남도 김해시 관동동 462 남강관동온천 2,4층055-322-7202공동탕업N
85진영라이프사우나경상남도 김해시 진영읍 진영로 132, 3층경상남도 김해시 진영읍 진영리 265-1055-346-7651공동탕업N
86인제 암반수 헬스 사우나경상남도 김해시 활천로255번길 39, 2,3층 (어방동)경상남도 김해시 어방동 521-4<NA>공동탕업N
87가야해수사우나&골프존파크경상남도 김해시 김해대로2492번길 38, 2,3층 (삼정동)경상남도 김해시 삼정동 0<NA>공동탕업N
88센텀사우나경상남도 김해시 주촌면 선천로 65, 태성에스더블유 레포츠 2,3층경상남도 김해시 주촌면 선지리 750-9 태성에스더블유 레포츠055-321-2800공동탕업Y
89올리아경상남도 김해시 가야로 183, 삼계위너스타운 10층 1004호 (삼계동)경상남도 김해시 삼계동 1486-1 외 1필지 삼계위너스타운 10층 1004호<NA>목욕장업 기타N