Overview

Dataset statistics

Number of variables5
Number of observations58
Missing cells26
Missing cells (%)9.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.5 KiB
Average record size in memory44.3 B

Variable types

Numeric2
Text3

Dataset

Description여수시에서 허가되어 있는 민간체육시설 중 헬스장에 관한 자료들로 헬스장에 대한 상호명과 시설주소, 시설전화번호 현황 자료 등을 제공하고 있는 자료들입니다.
URLhttps://www.data.go.kr/data/15077231/fileData.do

Alerts

시설전화번호 has 26 (44.8%) missing valuesMissing
연번 has unique valuesUnique
상호 has unique valuesUnique
시설주소(도로명) has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:40:16.455577
Analysis finished2023-12-12 04:40:17.204977
Duration0.75 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct58
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29.5
Minimum1
Maximum58
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size654.0 B
2023-12-12T13:40:17.283276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.85
Q115.25
median29.5
Q343.75
95-th percentile55.15
Maximum58
Range57
Interquartile range (IQR)28.5

Descriptive statistics

Standard deviation16.886879
Coefficient of variation (CV)0.57243656
Kurtosis-1.2
Mean29.5
Median Absolute Deviation (MAD)14.5
Skewness0
Sum1711
Variance285.16667
MonotonicityStrictly increasing
2023-12-12T13:40:17.409773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.7%
45 1
 
1.7%
33 1
 
1.7%
34 1
 
1.7%
35 1
 
1.7%
36 1
 
1.7%
37 1
 
1.7%
38 1
 
1.7%
39 1
 
1.7%
40 1
 
1.7%
Other values (48) 48
82.8%
ValueCountFrequency (%)
1 1
1.7%
2 1
1.7%
3 1
1.7%
4 1
1.7%
5 1
1.7%
6 1
1.7%
7 1
1.7%
8 1
1.7%
9 1
1.7%
10 1
1.7%
ValueCountFrequency (%)
58 1
1.7%
57 1
1.7%
56 1
1.7%
55 1
1.7%
54 1
1.7%
53 1
1.7%
52 1
1.7%
51 1
1.7%
50 1
1.7%
49 1
1.7%

상호
Text

UNIQUE 

Distinct58
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size596.0 B
2023-12-12T13:40:17.623500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length14
Mean length7.0517241
Min length3

Characters and Unicode

Total characters409
Distinct characters153
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)100.0%

Sample

1st row현대헬스클럽
2nd row다도해사우나헬스
3rd row미라클 헬스클럽
4th row헬스카페
5th row제이바디 휘트니스
ValueCountFrequency (%)
휘트니스 4
 
5.0%
gym 3
 
3.8%
바디스미스 3
 
3.8%
더원짐 1
 
1.2%
웅천 1
 
1.2%
블루 1
 
1.2%
고고짐(gogo 1
 
1.2%
주)갤럭시스포츠 1
 
1.2%
디오션피트니스 1
 
1.2%
킹콩짐 1
 
1.2%
Other values (63) 63
78.8%
2023-12-12T13:40:18.026361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
50
 
12.2%
22
 
5.4%
18
 
4.4%
12
 
2.9%
11
 
2.7%
10
 
2.4%
G 9
 
2.2%
9
 
2.2%
9
 
2.2%
Y 8
 
2.0%
Other values (143) 251
61.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 309
75.6%
Uppercase Letter 45
 
11.0%
Space Separator 22
 
5.4%
Lowercase Letter 14
 
3.4%
Close Punctuation 7
 
1.7%
Open Punctuation 7
 
1.7%
Decimal Number 3
 
0.7%
Dash Punctuation 1
 
0.2%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
50
 
16.2%
18
 
5.8%
12
 
3.9%
11
 
3.6%
10
 
3.2%
9
 
2.9%
9
 
2.9%
8
 
2.6%
6
 
1.9%
6
 
1.9%
Other values (110) 170
55.0%
Uppercase Letter
ValueCountFrequency (%)
G 9
20.0%
Y 8
17.8%
M 7
15.6%
C 3
 
6.7%
B 2
 
4.4%
T 2
 
4.4%
J 2
 
4.4%
E 2
 
4.4%
H 2
 
4.4%
O 2
 
4.4%
Other values (6) 6
13.3%
Lowercase Letter
ValueCountFrequency (%)
o 3
21.4%
s 2
14.3%
t 2
14.3%
m 1
 
7.1%
y 1
 
7.1%
g 1
 
7.1%
i 1
 
7.1%
e 1
 
7.1%
d 1
 
7.1%
u 1
 
7.1%
Decimal Number
ValueCountFrequency (%)
1 2
66.7%
2 1
33.3%
Space Separator
ValueCountFrequency (%)
22
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Punctuation
ValueCountFrequency (%)
: 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 309
75.6%
Latin 59
 
14.4%
Common 41
 
10.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
50
 
16.2%
18
 
5.8%
12
 
3.9%
11
 
3.6%
10
 
3.2%
9
 
2.9%
9
 
2.9%
8
 
2.6%
6
 
1.9%
6
 
1.9%
Other values (110) 170
55.0%
Latin
ValueCountFrequency (%)
G 9
15.3%
Y 8
13.6%
M 7
 
11.9%
o 3
 
5.1%
C 3
 
5.1%
B 2
 
3.4%
T 2
 
3.4%
s 2
 
3.4%
J 2
 
3.4%
E 2
 
3.4%
Other values (16) 19
32.2%
Common
ValueCountFrequency (%)
22
53.7%
) 7
 
17.1%
( 7
 
17.1%
1 2
 
4.9%
2 1
 
2.4%
- 1
 
2.4%
: 1
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 309
75.6%
ASCII 100
 
24.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
50
 
16.2%
18
 
5.8%
12
 
3.9%
11
 
3.6%
10
 
3.2%
9
 
2.9%
9
 
2.9%
8
 
2.6%
6
 
1.9%
6
 
1.9%
Other values (110) 170
55.0%
ASCII
ValueCountFrequency (%)
22
22.0%
G 9
 
9.0%
Y 8
 
8.0%
M 7
 
7.0%
) 7
 
7.0%
( 7
 
7.0%
o 3
 
3.0%
C 3
 
3.0%
1 2
 
2.0%
B 2
 
2.0%
Other values (23) 30
30.0%

우편번호
Real number (ℝ)

Distinct42
Distinct (%)72.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68158.828
Minimum59629
Maximum550817
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size654.0 B
2023-12-12T13:40:18.154009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum59629
5-th percentile59638.8
Q159664.5
median59691
Q359714.5
95-th percentile59763.05
Maximum550817
Range491188
Interquartile range (IQR)50

Descriptive statistics

Standard deviation64487.995
Coefficient of variation (CV)0.94614296
Kurtosis57.99996
Mean68158.828
Median Absolute Deviation (MAD)26
Skewness7.6157693
Sum3953212
Variance4.1587015 × 109
MonotonicityNot monotonic
2023-12-12T13:40:18.268028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
ValueCountFrequency (%)
59691 4
 
6.9%
59672 3
 
5.2%
59689 3
 
5.2%
59694 2
 
3.4%
59655 2
 
3.4%
59676 2
 
3.4%
59713 2
 
3.4%
59721 2
 
3.4%
59646 2
 
3.4%
59629 2
 
3.4%
Other values (32) 34
58.6%
ValueCountFrequency (%)
59629 2
3.4%
59632 1
1.7%
59640 2
3.4%
59643 1
1.7%
59645 1
1.7%
59646 2
3.4%
59649 1
1.7%
59655 2
3.4%
59660 1
1.7%
59662 1
1.7%
ValueCountFrequency (%)
550817 1
1.7%
59771 1
1.7%
59769 1
1.7%
59762 1
1.7%
59761 1
1.7%
59753 1
1.7%
59750 1
1.7%
59747 1
1.7%
59732 1
1.7%
59726 1
1.7%
Distinct58
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size596.0 B
2023-12-12T13:40:18.552767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length33
Mean length25.793103
Min length20

Characters and Unicode

Total characters1496
Distinct characters119
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)100.0%

Sample

1st row전라남도 여수시 신월로 668 (국동)
2nd row전라남도 여수시 한재로 182 (광무동)
3rd row전라남도 여수시 충민로 176 (덕충동)
4th row전라남도 여수시 통제영1길 13 (충무동)
5th row전라남도 여수시 봉계대곡길 28, 2층 (봉계동)
ValueCountFrequency (%)
전라남도 58
 
17.6%
여수시 58
 
17.6%
학동 9
 
2.7%
3층 8
 
2.4%
문수동 7
 
2.1%
웅천동 6
 
1.8%
선원동 5
 
1.5%
여서동 4
 
1.2%
2층 4
 
1.2%
예울마루로 4
 
1.2%
Other values (136) 167
50.6%
2023-12-12T13:40:19.048148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
272
18.2%
75
 
5.0%
73
 
4.9%
61
 
4.1%
61
 
4.1%
61
 
4.1%
60
 
4.0%
58
 
3.9%
57
 
3.8%
) 54
 
3.6%
Other values (109) 664
44.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 865
57.8%
Space Separator 272
 
18.2%
Decimal Number 208
 
13.9%
Close Punctuation 54
 
3.6%
Open Punctuation 54
 
3.6%
Other Punctuation 29
 
1.9%
Dash Punctuation 14
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
75
 
8.7%
73
 
8.4%
61
 
7.1%
61
 
7.1%
61
 
7.1%
60
 
6.9%
58
 
6.7%
57
 
6.6%
40
 
4.6%
19
 
2.2%
Other values (94) 300
34.7%
Decimal Number
ValueCountFrequency (%)
1 40
19.2%
3 34
16.3%
2 27
13.0%
5 26
12.5%
4 16
 
7.7%
6 16
 
7.7%
7 15
 
7.2%
0 14
 
6.7%
8 14
 
6.7%
9 6
 
2.9%
Space Separator
ValueCountFrequency (%)
272
100.0%
Close Punctuation
ValueCountFrequency (%)
) 54
100.0%
Open Punctuation
ValueCountFrequency (%)
( 54
100.0%
Other Punctuation
ValueCountFrequency (%)
, 29
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 865
57.8%
Common 631
42.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
75
 
8.7%
73
 
8.4%
61
 
7.1%
61
 
7.1%
61
 
7.1%
60
 
6.9%
58
 
6.7%
57
 
6.6%
40
 
4.6%
19
 
2.2%
Other values (94) 300
34.7%
Common
ValueCountFrequency (%)
272
43.1%
) 54
 
8.6%
( 54
 
8.6%
1 40
 
6.3%
3 34
 
5.4%
, 29
 
4.6%
2 27
 
4.3%
5 26
 
4.1%
4 16
 
2.5%
6 16
 
2.5%
Other values (5) 63
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 865
57.8%
ASCII 631
42.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
272
43.1%
) 54
 
8.6%
( 54
 
8.6%
1 40
 
6.3%
3 34
 
5.4%
, 29
 
4.6%
2 27
 
4.3%
5 26
 
4.1%
4 16
 
2.5%
6 16
 
2.5%
Other values (5) 63
 
10.0%
Hangul
ValueCountFrequency (%)
75
 
8.7%
73
 
8.4%
61
 
7.1%
61
 
7.1%
61
 
7.1%
60
 
6.9%
58
 
6.7%
57
 
6.6%
40
 
4.6%
19
 
2.2%
Other values (94) 300
34.7%

시설전화번호
Text

MISSING 

Distinct32
Distinct (%)100.0%
Missing26
Missing (%)44.8%
Memory size596.0 B
2023-12-12T13:40:19.811227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.0625
Min length12

Characters and Unicode

Total characters386
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)100.0%

Sample

1st row061-641-2572
2nd row061-644-5800
3rd row061-666-5111
4th row061-663-8875
5th row061-686-2200
ValueCountFrequency (%)
061-643-4321 1
 
3.1%
061-666-5111 1
 
3.1%
061-686-8008 1
 
3.1%
061-683-3400 1
 
3.1%
061-660-5800 1
 
3.1%
061-680-3005 1
 
3.1%
061-655-0505 1
 
3.1%
061-686-3288 1
 
3.1%
061-685-2330 1
 
3.1%
061-661-2000 1
 
3.1%
Other values (22) 22
68.8%
2023-12-12T13:40:20.229660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6 79
20.5%
0 69
17.9%
- 64
16.6%
1 45
11.7%
5 34
8.8%
8 25
 
6.5%
4 20
 
5.2%
3 18
 
4.7%
2 17
 
4.4%
7 10
 
2.6%
Other values (2) 5
 
1.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 321
83.2%
Dash Punctuation 64
 
16.6%
Math Symbol 1
 
0.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
6 79
24.6%
0 69
21.5%
1 45
14.0%
5 34
10.6%
8 25
 
7.8%
4 20
 
6.2%
3 18
 
5.6%
2 17
 
5.3%
7 10
 
3.1%
9 4
 
1.2%
Dash Punctuation
ValueCountFrequency (%)
- 64
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 386
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
6 79
20.5%
0 69
17.9%
- 64
16.6%
1 45
11.7%
5 34
8.8%
8 25
 
6.5%
4 20
 
5.2%
3 18
 
4.7%
2 17
 
4.4%
7 10
 
2.6%
Other values (2) 5
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 386
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6 79
20.5%
0 69
17.9%
- 64
16.6%
1 45
11.7%
5 34
8.8%
8 25
 
6.5%
4 20
 
5.2%
3 18
 
4.7%
2 17
 
4.4%
7 10
 
2.6%
Other values (2) 5
 
1.3%

Interactions

2023-12-12T13:40:16.906160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:40:16.743918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:40:16.978760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:40:16.823695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:40:20.355464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번상호우편번호시설주소(도로명)시설전화번호
연번1.0001.0000.2241.0001.000
상호1.0001.0001.0001.0001.000
우편번호0.2241.0001.0001.000NaN
시설주소(도로명)1.0001.0001.0001.0001.000
시설전화번호1.0001.000NaN1.0001.000
2023-12-12T13:40:20.471145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번우편번호
연번1.000-0.122
우편번호-0.1221.000

Missing values

2023-12-12T13:40:17.072345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:40:17.169933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호우편번호시설주소(도로명)시설전화번호
01현대헬스클럽59761전라남도 여수시 신월로 668 (국동)061-641-2572
12다도해사우나헬스59750전라남도 여수시 한재로 182 (광무동)061-644-5800
23미라클 헬스클럽59726전라남도 여수시 충민로 176 (덕충동)061-666-5111
34헬스카페59732전라남도 여수시 통제영1길 13 (충무동)061-663-8875
45제이바디 휘트니스59632전라남도 여수시 봉계대곡길 28, 2층 (봉계동)<NA>
56여천헬스타운59687전라남도 여수시 흥국로 74 (신기동)061-686-2200
67건강스포츠59666전라남도 여수시 소호5길 6-8 (소호동)061-681-1947
78(주)유심천레저산업(헬스)59660전라남도 여수시 소라면 안심산길 155061-681-5657~8
89고궁스포츠클럽59695전라남도 여수시 쌍봉로 353, 2층 (둔덕동)061-652-0014
910훈GYM헬스550817전라남도 여수시 여문2로 110, 201호 (문수동, 캐슬하임아파트상가)<NA>
연번상호우편번호시설주소(도로명)시설전화번호
4849여문휘트니스59700전라남도 여수시 문수2길 9-11 (문수동)061-655-4000
4950호텔JCS여수(헬스)59771전라남도 여수시 돌산읍 돌산로 3169-14061-643-5000
5051자이언트헬스59673전라남도 여수시 학동서4길 58 (학동)<NA>
5152마음가짐59691전라남도 여수시 예울마루로 41, 2층 (웅천동)<NA>
5253에이스핏짐59689전라남도 여수시 망마로 40, 6층 (학동)<NA>
5354여수헬스 스파랜드59753전라남도 여수시 남산로 60-31 (주)여수수산물특화시장 3층 (남산동)<NA>
5455야무짐59646전라남도 여수시 도원로 175, 5층 (선원동)<NA>
5556다GYM59694전라남도 여수시 여서로 58, 상가동 201호 (웅천동, 여수웅천 마린파크 애시앙 2단지)<NA>
5657더존 기구 필라테스59640전라남도 여수시 무선로 187, 롯데마트 여천점 3층 (선원동)<NA>
5758더 멋짐59640전라남도 여수시 무선중앙로 59, 2층 (선원동)<NA>