Overview

Dataset statistics

Number of variables9
Number of observations336
Missing cells631
Missing cells (%)20.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory24.7 KiB
Average record size in memory75.4 B

Variable types

Categorical2
Text4
Numeric3

Dataset

Description부산광역시 동래구 관내 체육시설업소 현황에 대한 데이터로 업종, 상호명, 우편번호, 지번주소, 도로명주소, 전화번호, 부지면적, 면적, 연면적, 상세분류 등의 항목을 제공합니다.
Author부산광역시 동래구
URLhttps://www.data.go.kr/data/3078915/fileData.do

Alerts

체육도장업상세분류 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 시설부지면적(제곱미터) and 2 other fieldsHigh correlation
시설부지면적(제곱미터) is highly overall correlated with 시설면적(제곱미터) and 2 other fieldsHigh correlation
시설면적(제곱미터) is highly overall correlated with 시설부지면적(제곱미터) and 2 other fieldsHigh correlation
건물연면적(제곱미터) is highly overall correlated with 시설부지면적(제곱미터) and 1 other fieldsHigh correlation
체육도장업상세분류 is highly imbalanced (53.8%)Imbalance
시설전화번호 has 279 (83.0%) missing valuesMissing
시설부지면적(제곱미터) has 190 (56.5%) missing valuesMissing
건물연면적(제곱미터) has 159 (47.3%) missing valuesMissing
건물연면적(제곱미터) has 5 (1.5%) zerosZeros

Reproduction

Analysis started2024-03-14 15:10:49.856394
Analysis finished2024-03-14 15:10:53.699145
Duration3.84 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
체력단련장업
104 
체육도장업
88 
당구장업
63 
골프연습장업
32 
가상체험 체육시설업
22 
Other values (6)
27 

Length

Max length10
Median length7
Mean length5.5446429
Min length4

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row수영장업
2nd row수영장업
3rd row수영장업
4th row수영장업
5th row수영장업

Common Values

ValueCountFrequency (%)
체력단련장업 104
31.0%
체육도장업 88
26.2%
당구장업 63
18.8%
골프연습장업 32
 
9.5%
가상체험 체육시설업 22
 
6.5%
체육교습업 12
 
3.6%
수영장업 6
 
1.8%
무도학원업 3
 
0.9%
인공암벽장업 3
 
0.9%
종합체육시설업 2
 
0.6%

Length

2024-03-15T00:10:53.830678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
체력단련장업 104
29.1%
체육도장업 88
24.6%
당구장업 63
17.6%
골프연습장업 32
 
8.9%
가상체험 22
 
6.1%
체육시설업 22
 
6.1%
체육교습업 12
 
3.4%
수영장업 6
 
1.7%
무도학원업 3
 
0.8%
인공암벽장업 3
 
0.8%
Other values (2) 3
 
0.8%

상호
Text

Distinct331
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2024-03-15T00:10:54.777046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length7.4791667
Min length2

Characters and Unicode

Total characters2513
Distinct characters377
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique326 ?
Unique (%)97.0%

Sample

1st row키즈스플래쉬(부산본점)
2nd row망고키즈수영장(사직점)
3rd row아이올림픽 유소년 체육센터
4th row골드키즈
5th row지니키즈스윔스쿨
ValueCountFrequency (%)
당구클럽 12
 
2.2%
동래점 11
 
2.1%
당구장 11
 
2.1%
휘트니스 9
 
1.7%
태권도 8
 
1.5%
gym 7
 
1.3%
피트니스 6
 
1.1%
사직점 5
 
0.9%
골프 4
 
0.7%
스크린골프 4
 
0.7%
Other values (420) 458
85.6%
2024-03-15T00:10:55.982650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
199
 
7.9%
116
 
4.6%
64
 
2.5%
61
 
2.4%
58
 
2.3%
56
 
2.2%
49
 
1.9%
48
 
1.9%
46
 
1.8%
43
 
1.7%
Other values (367) 1773
70.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2098
83.5%
Space Separator 199
 
7.9%
Uppercase Letter 131
 
5.2%
Lowercase Letter 49
 
1.9%
Decimal Number 14
 
0.6%
Open Punctuation 9
 
0.4%
Close Punctuation 9
 
0.4%
Other Punctuation 3
 
0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
116
 
5.5%
64
 
3.1%
61
 
2.9%
58
 
2.8%
56
 
2.7%
49
 
2.3%
48
 
2.3%
46
 
2.2%
43
 
2.0%
42
 
2.0%
Other values (311) 1515
72.2%
Uppercase Letter
ValueCountFrequency (%)
M 18
13.7%
G 14
 
10.7%
P 10
 
7.6%
T 9
 
6.9%
Y 8
 
6.1%
J 7
 
5.3%
S 6
 
4.6%
I 6
 
4.6%
B 5
 
3.8%
R 5
 
3.8%
Other values (15) 43
32.8%
Lowercase Letter
ValueCountFrequency (%)
i 6
12.2%
o 5
10.2%
n 5
10.2%
r 4
 
8.2%
c 4
 
8.2%
m 4
 
8.2%
e 3
 
6.1%
y 3
 
6.1%
s 3
 
6.1%
f 2
 
4.1%
Other values (6) 10
20.4%
Decimal Number
ValueCountFrequency (%)
1 4
28.6%
2 2
14.3%
3 2
14.3%
9 2
14.3%
0 1
 
7.1%
4 1
 
7.1%
6 1
 
7.1%
5 1
 
7.1%
Other Punctuation
ValueCountFrequency (%)
& 1
33.3%
' 1
33.3%
. 1
33.3%
Space Separator
ValueCountFrequency (%)
199
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2098
83.5%
Common 234
 
9.3%
Latin 181
 
7.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
116
 
5.5%
64
 
3.1%
61
 
2.9%
58
 
2.8%
56
 
2.7%
49
 
2.3%
48
 
2.3%
46
 
2.2%
43
 
2.0%
42
 
2.0%
Other values (311) 1515
72.2%
Latin
ValueCountFrequency (%)
M 18
 
9.9%
G 14
 
7.7%
P 10
 
5.5%
T 9
 
5.0%
Y 8
 
4.4%
J 7
 
3.9%
S 6
 
3.3%
I 6
 
3.3%
i 6
 
3.3%
o 5
 
2.8%
Other values (32) 92
50.8%
Common
ValueCountFrequency (%)
199
85.0%
( 9
 
3.8%
) 9
 
3.8%
1 4
 
1.7%
2 2
 
0.9%
3 2
 
0.9%
9 2
 
0.9%
0 1
 
0.4%
& 1
 
0.4%
4 1
 
0.4%
Other values (4) 4
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2098
83.5%
ASCII 414
 
16.5%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
199
48.1%
M 18
 
4.3%
G 14
 
3.4%
P 10
 
2.4%
( 9
 
2.2%
) 9
 
2.2%
T 9
 
2.2%
Y 8
 
1.9%
J 7
 
1.7%
S 6
 
1.4%
Other values (45) 125
30.2%
Hangul
ValueCountFrequency (%)
116
 
5.5%
64
 
3.1%
61
 
2.9%
58
 
2.8%
56
 
2.7%
49
 
2.3%
48
 
2.3%
46
 
2.2%
43
 
2.0%
42
 
2.0%
Other values (311) 1515
72.2%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct310
Distinct (%)92.5%
Missing1
Missing (%)0.3%
Memory size2.8 KiB
2024-03-15T00:10:57.262596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length42
Mean length23.674627
Min length18

Characters and Unicode

Total characters7931
Distinct characters144
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique292 ?
Unique (%)87.2%

Sample

1st row부산광역시 동래구 낙민동 84-8 보림프라자 지1층 108호
2nd row부산광역시 동래구 사직동 74-18
3rd row부산광역시 동래구 명륜동 96-5
4th row부산광역시 동래구 낙민동 90-4 B동
5th row부산광역시 동래구 온천동 1834 동래 지웰
ValueCountFrequency (%)
부산광역시 335
21.1%
동래구 335
21.1%
온천동 103
 
6.5%
사직동 87
 
5.5%
안락동 57
 
3.6%
명륜동 45
 
2.8%
3층 26
 
1.6%
명장동 24
 
1.5%
2층 17
 
1.1%
수안동 14
 
0.9%
Other values (397) 542
34.2%
2024-03-15T00:10:58.901345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1566
19.7%
703
 
8.9%
343
 
4.3%
339
 
4.3%
337
 
4.2%
336
 
4.2%
335
 
4.2%
335
 
4.2%
335
 
4.2%
1 333
 
4.2%
Other values (134) 2969
37.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4338
54.7%
Decimal Number 1669
 
21.0%
Space Separator 1566
 
19.7%
Dash Punctuation 310
 
3.9%
Uppercase Letter 18
 
0.2%
Other Punctuation 14
 
0.2%
Open Punctuation 6
 
0.1%
Close Punctuation 6
 
0.1%
Math Symbol 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
703
16.2%
343
 
7.9%
339
 
7.8%
337
 
7.8%
336
 
7.7%
335
 
7.7%
335
 
7.7%
335
 
7.7%
110
 
2.5%
107
 
2.5%
Other values (110) 1058
24.4%
Decimal Number
ValueCountFrequency (%)
1 333
20.0%
4 208
12.5%
2 207
12.4%
3 190
11.4%
5 152
9.1%
7 125
 
7.5%
0 123
 
7.4%
6 113
 
6.8%
9 112
 
6.7%
8 106
 
6.4%
Uppercase Letter
ValueCountFrequency (%)
B 5
27.8%
K 4
22.2%
S 4
22.2%
H 2
 
11.1%
U 1
 
5.6%
J 1
 
5.6%
Y 1
 
5.6%
Other Punctuation
ValueCountFrequency (%)
, 13
92.9%
· 1
 
7.1%
Space Separator
ValueCountFrequency (%)
1566
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 310
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4338
54.7%
Common 3575
45.1%
Latin 18
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
703
16.2%
343
 
7.9%
339
 
7.8%
337
 
7.8%
336
 
7.7%
335
 
7.7%
335
 
7.7%
335
 
7.7%
110
 
2.5%
107
 
2.5%
Other values (110) 1058
24.4%
Common
ValueCountFrequency (%)
1566
43.8%
1 333
 
9.3%
- 310
 
8.7%
4 208
 
5.8%
2 207
 
5.8%
3 190
 
5.3%
5 152
 
4.3%
7 125
 
3.5%
0 123
 
3.4%
6 113
 
3.2%
Other values (7) 248
 
6.9%
Latin
ValueCountFrequency (%)
B 5
27.8%
K 4
22.2%
S 4
22.2%
H 2
 
11.1%
U 1
 
5.6%
J 1
 
5.6%
Y 1
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4338
54.7%
ASCII 3592
45.3%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1566
43.6%
1 333
 
9.3%
- 310
 
8.6%
4 208
 
5.8%
2 207
 
5.8%
3 190
 
5.3%
5 152
 
4.2%
7 125
 
3.5%
0 123
 
3.4%
6 113
 
3.1%
Other values (13) 265
 
7.4%
Hangul
ValueCountFrequency (%)
703
16.2%
343
 
7.9%
339
 
7.8%
337
 
7.8%
336
 
7.7%
335
 
7.7%
335
 
7.7%
335
 
7.7%
110
 
2.5%
107
 
2.5%
Other values (110) 1058
24.4%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct329
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2024-03-15T00:11:00.075765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length70
Median length50
Mean length32.017857
Min length22

Characters and Unicode

Total characters10758
Distinct characters193
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique324 ?
Unique (%)96.4%

Sample

1st row부산광역시 동래구 안남로 23, 지1층 108호 (낙민동, 보림프라자)
2nd row부산광역시 동래구 석사로18번길 41 (사직동)
3rd row부산광역시 동래구 시실로 60 (명륜동)
4th row부산광역시 동래구 안남로 32, B동 (낙민동)
5th row부산광역시 동래구 쇠미로 197, 104동 지하1층 101~103호 (온천동, 동래 지웰)
ValueCountFrequency (%)
동래구 337
 
16.1%
부산광역시 336
 
16.0%
온천동 91
 
4.3%
사직동 83
 
4.0%
안락동 48
 
2.3%
2층 47
 
2.2%
3층 45
 
2.1%
명륜동 42
 
2.0%
지하1층 26
 
1.2%
사직북로 22
 
1.1%
Other values (466) 1018
48.6%
2024-03-15T00:11:01.891094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1777
 
16.5%
727
 
6.8%
371
 
3.4%
354
 
3.3%
, 353
 
3.3%
340
 
3.2%
( 339
 
3.2%
) 338
 
3.1%
337
 
3.1%
337
 
3.1%
Other values (183) 5485
51.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6282
58.4%
Space Separator 1777
 
16.5%
Decimal Number 1603
 
14.9%
Other Punctuation 354
 
3.3%
Open Punctuation 339
 
3.2%
Close Punctuation 338
 
3.1%
Uppercase Letter 29
 
0.3%
Dash Punctuation 27
 
0.3%
Math Symbol 9
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
727
 
11.6%
371
 
5.9%
354
 
5.6%
340
 
5.4%
337
 
5.4%
337
 
5.4%
337
 
5.4%
336
 
5.3%
333
 
5.3%
250
 
4.0%
Other values (158) 2560
40.8%
Decimal Number
ValueCountFrequency (%)
1 327
20.4%
2 269
16.8%
3 211
13.2%
4 163
10.2%
0 146
9.1%
5 121
 
7.5%
6 104
 
6.5%
8 100
 
6.2%
7 88
 
5.5%
9 74
 
4.6%
Uppercase Letter
ValueCountFrequency (%)
B 8
27.6%
S 6
20.7%
K 6
20.7%
H 3
 
10.3%
U 2
 
6.9%
Y 2
 
6.9%
J 1
 
3.4%
A 1
 
3.4%
Other Punctuation
ValueCountFrequency (%)
, 353
99.7%
· 1
 
0.3%
Space Separator
ValueCountFrequency (%)
1777
100.0%
Open Punctuation
ValueCountFrequency (%)
( 339
100.0%
Close Punctuation
ValueCountFrequency (%)
) 338
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 27
100.0%
Math Symbol
ValueCountFrequency (%)
~ 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6282
58.4%
Common 4447
41.3%
Latin 29
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
727
 
11.6%
371
 
5.9%
354
 
5.6%
340
 
5.4%
337
 
5.4%
337
 
5.4%
337
 
5.4%
336
 
5.3%
333
 
5.3%
250
 
4.0%
Other values (158) 2560
40.8%
Common
ValueCountFrequency (%)
1777
40.0%
, 353
 
7.9%
( 339
 
7.6%
) 338
 
7.6%
1 327
 
7.4%
2 269
 
6.0%
3 211
 
4.7%
4 163
 
3.7%
0 146
 
3.3%
5 121
 
2.7%
Other values (7) 403
 
9.1%
Latin
ValueCountFrequency (%)
B 8
27.6%
S 6
20.7%
K 6
20.7%
H 3
 
10.3%
U 2
 
6.9%
Y 2
 
6.9%
J 1
 
3.4%
A 1
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6282
58.4%
ASCII 4475
41.6%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1777
39.7%
, 353
 
7.9%
( 339
 
7.6%
) 338
 
7.6%
1 327
 
7.3%
2 269
 
6.0%
3 211
 
4.7%
4 163
 
3.6%
0 146
 
3.3%
5 121
 
2.7%
Other values (14) 431
 
9.6%
Hangul
ValueCountFrequency (%)
727
 
11.6%
371
 
5.9%
354
 
5.6%
340
 
5.4%
337
 
5.4%
337
 
5.4%
337
 
5.4%
336
 
5.3%
333
 
5.3%
250
 
4.0%
Other values (158) 2560
40.8%
None
ValueCountFrequency (%)
· 1
100.0%

시설전화번호
Text

MISSING 

Distinct56
Distinct (%)98.2%
Missing279
Missing (%)83.0%
Memory size2.8 KiB
2024-03-15T00:11:02.853046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.017544
Min length12

Characters and Unicode

Total characters685
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)96.5%

Sample

1st row051-503-6888
2nd row051-558-3476
3rd row051-529-9648
4th row051-527-4177
5th row051-553-0224
ValueCountFrequency (%)
051-555-7621 2
 
3.5%
051-553-7709 1
 
1.8%
051-522-5770 1
 
1.8%
051-582-0546 1
 
1.8%
051-552-8409 1
 
1.8%
051-501-0333 1
 
1.8%
051-754-6721 1
 
1.8%
051-803-5069 1
 
1.8%
051-503-3836 1
 
1.8%
051-501-1940 1
 
1.8%
Other values (46) 46
80.7%
2024-03-15T00:11:04.195918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 155
22.6%
- 114
16.6%
0 102
14.9%
1 90
13.1%
2 49
 
7.2%
7 42
 
6.1%
3 33
 
4.8%
8 31
 
4.5%
6 26
 
3.8%
4 22
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 571
83.4%
Dash Punctuation 114
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 155
27.1%
0 102
17.9%
1 90
15.8%
2 49
 
8.6%
7 42
 
7.4%
3 33
 
5.8%
8 31
 
5.4%
6 26
 
4.6%
4 22
 
3.9%
9 21
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 114
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 685
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 155
22.6%
- 114
16.6%
0 102
14.9%
1 90
13.1%
2 49
 
7.2%
7 42
 
6.1%
3 33
 
4.8%
8 31
 
4.5%
6 26
 
3.8%
4 22
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 685
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 155
22.6%
- 114
16.6%
0 102
14.9%
1 90
13.1%
2 49
 
7.2%
7 42
 
6.1%
3 33
 
4.8%
8 31
 
4.5%
6 26
 
3.8%
4 22
 
3.2%

시설부지면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct142
Distinct (%)97.3%
Missing190
Missing (%)56.5%
Infinite0
Infinite (%)0.0%
Mean336.24979
Minimum0
Maximum3580
Zeros2
Zeros (%)0.6%
Negative0
Negative (%)0.0%
Memory size3.1 KiB
2024-03-15T00:11:04.608287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile100.115
Q1153.0025
median213.475
Q3326.79
95-th percentile823.13
Maximum3580
Range3580
Interquartile range (IQR)173.7875

Descriptive statistics

Standard deviation443.38716
Coefficient of variation (CV)1.3186243
Kurtosis28.635818
Mean336.24979
Median Absolute Deviation (MAD)79.335
Skewness4.8779854
Sum49092.47
Variance196592.18
MonotonicityNot monotonic
2024-03-15T00:11:05.052237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
348.36 2
 
0.6%
195.54 2
 
0.6%
0.0 2
 
0.6%
326.79 2
 
0.6%
493.4 1
 
0.3%
489.8 1
 
0.3%
177.54 1
 
0.3%
228.96 1
 
0.3%
135.8 1
 
0.3%
235.52 1
 
0.3%
Other values (132) 132
39.3%
(Missing) 190
56.5%
ValueCountFrequency (%)
0.0 2
0.6%
71.15 1
0.3%
81.38 1
0.3%
95.04 1
0.3%
96.0 1
0.3%
97.2 1
0.3%
98.78 1
0.3%
104.12 1
0.3%
104.76 1
0.3%
105.63 1
0.3%
ValueCountFrequency (%)
3580.0 1
0.3%
2952.0 1
0.3%
1939.86 1
0.3%
1821.6 1
0.3%
1230.0 1
0.3%
1036.16 1
0.3%
955.68 1
0.3%
831.18 1
0.3%
798.98 1
0.3%
776.34 1
0.3%

시설면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION 

Distinct319
Distinct (%)95.5%
Missing2
Missing (%)0.6%
Infinite0
Infinite (%)0.0%
Mean335.29111
Minimum30
Maximum4869.06
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.1 KiB
2024-03-15T00:11:05.475512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum30
5-th percentile96.5135
Q1151.4225
median224.61
Q3344.25
95-th percentile764.307
Maximum4869.06
Range4839.06
Interquartile range (IQR)192.8275

Descriptive statistics

Standard deviation465.05656
Coefficient of variation (CV)1.3870232
Kurtosis47.811293
Mean335.29111
Median Absolute Deviation (MAD)88.83
Skewness6.2387769
Sum111987.23
Variance216277.6
MonotonicityNot monotonic
2024-03-15T00:11:05.923411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
326.79 3
 
0.9%
233.58 3
 
0.9%
174.79 2
 
0.6%
478.0 2
 
0.6%
199.33 2
 
0.6%
212.42 2
 
0.6%
305.07 2
 
0.6%
135.0 2
 
0.6%
195.54 2
 
0.6%
280.8 2
 
0.6%
Other values (309) 312
92.9%
ValueCountFrequency (%)
30.0 1
0.3%
42.93 1
0.3%
48.14 1
0.3%
59.82 1
0.3%
65.5 1
0.3%
71.15 1
0.3%
81.38 1
0.3%
84.91 1
0.3%
86.62 1
0.3%
90.36 1
0.3%
ValueCountFrequency (%)
4869.06 1
0.3%
4216.81 1
0.3%
2952.0 1
0.3%
2764.0 1
0.3%
2710.75 1
0.3%
1939.86 1
0.3%
1707.47 1
0.3%
1596.92 1
0.3%
1237.98 1
0.3%
1086.29 1
0.3%

건물연면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct156
Distinct (%)88.1%
Missing159
Missing (%)47.3%
Infinite0
Infinite (%)0.0%
Mean5526.4212
Minimum0
Maximum222898.94
Zeros5
Zeros (%)1.5%
Negative0
Negative (%)0.0%
Memory size3.1 KiB
2024-03-15T00:11:06.341645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile111.248
Q1658.02
median1148.47
Q32901.93
95-th percentile23195.35
Maximum222898.94
Range222898.94
Interquartile range (IQR)2243.91

Descriptive statistics

Standard deviation19672.154
Coefficient of variation (CV)3.5596552
Kurtosis86.793871
Mean5526.4212
Median Absolute Deviation (MAD)749.63
Skewness8.4630781
Sum978176.55
Variance3.8699363 × 108
MonotonicityNot monotonic
2024-03-15T00:11:06.773402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 5
 
1.5%
23195.35 4
 
1.2%
5241.68 3
 
0.9%
1497.2 2
 
0.6%
1706.27 2
 
0.6%
987.04 2
 
0.6%
4560.44 2
 
0.6%
3772.26 2
 
0.6%
3409.05 2
 
0.6%
2942.95 2
 
0.6%
Other values (146) 151
44.9%
(Missing) 159
47.3%
ValueCountFrequency (%)
0.0 5
1.5%
81.38 1
 
0.3%
87.93 1
 
0.3%
96.0 1
 
0.3%
107.2 1
 
0.3%
112.26 1
 
0.3%
129.0 1
 
0.3%
129.6 1
 
0.3%
142.2 1
 
0.3%
213.84 1
 
0.3%
ValueCountFrequency (%)
222898.94 1
 
0.3%
76389.4 1
 
0.3%
64681.13 2
0.6%
48705.09 1
 
0.3%
42844.95 1
 
0.3%
30146.0 1
 
0.3%
23195.36 1
 
0.3%
23195.35 4
1.2%
22278.93 1
 
0.3%
21338.4 1
 
0.3%

체육도장업상세분류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct13
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
<NA>
222 
태권도
61 
골프종목
 
20
권투
 
7
합기도
 
6
Other values (8)
 
20

Length

Max length4
Median length4
Mean length3.6607143
Min length2

Unique

Unique2 ?
Unique (%)0.6%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 222
66.1%
태권도 61
 
18.2%
골프종목 20
 
6.0%
권투 7
 
2.1%
합기도 6
 
1.8%
검도 5
 
1.5%
유도 5
 
1.5%
우슈 2
 
0.6%
레슬링 2
 
0.6%
야구종목 2
 
0.6%
Other values (3) 4
 
1.2%

Length

2024-03-15T00:11:07.217564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 222
66.1%
태권도 61
 
18.2%
골프종목 20
 
6.0%
권투 7
 
2.1%
합기도 6
 
1.8%
검도 5
 
1.5%
유도 5
 
1.5%
우슈 2
 
0.6%
레슬링 2
 
0.6%
야구종목 2
 
0.6%
Other values (3) 4
 
1.2%

Interactions

2024-03-15T00:10:52.225692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:10:50.692075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:10:51.478749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:10:52.487817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:10:50.969717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:10:51.734670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:10:52.722959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:10:51.218773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:10:51.987558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T00:11:07.477353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종시설전화번호시설부지면적(제곱미터)시설면적(제곱미터)건물연면적(제곱미터)체육도장업상세분류
업종1.0001.0000.7570.7800.6511.000
시설전화번호1.0001.0001.0001.000NaN1.000
시설부지면적(제곱미터)0.7571.0001.0000.9880.0000.786
시설면적(제곱미터)0.7801.0000.9881.0000.6470.124
건물연면적(제곱미터)0.651NaN0.0000.6471.0000.415
체육도장업상세분류1.0001.0000.7860.1240.4151.000
2024-03-15T00:11:07.763609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
체육도장업상세분류업종
체육도장업상세분류1.0000.959
업종0.9591.000
2024-03-15T00:11:08.009100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설부지면적(제곱미터)시설면적(제곱미터)건물연면적(제곱미터)업종체육도장업상세분류
시설부지면적(제곱미터)1.0000.9590.5960.5340.451
시설면적(제곱미터)0.9591.0000.5970.5160.059
건물연면적(제곱미터)0.5960.5971.0000.3210.287
업종0.5340.5160.3211.0000.959
체육도장업상세분류0.4510.0590.2870.9591.000

Missing values

2024-03-15T00:10:52.912925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T00:10:53.344393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-15T00:10:53.562989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업종상호시설주소(지번)시설주소(도로명)시설전화번호시설부지면적(제곱미터)시설면적(제곱미터)건물연면적(제곱미터)체육도장업상세분류
0수영장업키즈스플래쉬(부산본점)부산광역시 동래구 낙민동 84-8 보림프라자 지1층 108호부산광역시 동래구 안남로 23, 지1층 108호 (낙민동, 보림프라자)<NA><NA>369.68993.58<NA>
1수영장업망고키즈수영장(사직점)부산광역시 동래구 사직동 74-18부산광역시 동래구 석사로18번길 41 (사직동)<NA><NA>345.0<NA><NA>
2수영장업아이올림픽 유소년 체육센터부산광역시 동래구 명륜동 96-5부산광역시 동래구 시실로 60 (명륜동)<NA><NA>709.7<NA><NA>
3수영장업골드키즈부산광역시 동래구 낙민동 90-4 B동부산광역시 동래구 안남로 32, B동 (낙민동)<NA><NA>329.7<NA><NA>
4수영장업지니키즈스윔스쿨부산광역시 동래구 온천동 1834 동래 지웰부산광역시 동래구 쇠미로 197, 104동 지하1층 101~103호 (온천동, 동래 지웰)<NA><NA>374.81844.72<NA>
5수영장업(주)케이비스포츠인재교육원부산광역시 동래구 사직동 151-8 여고 플래티넘부산광역시 동래구 미남로 52, 여고 플래티넘 1층 (사직동)051-503-6888<NA>292.98<NA><NA>
6체육도장업복천태권도부산광역시 동래구 복천동 500-1 우성아파트 상가 104호부산광역시 동래구 복천로5번길 34, 상가동 104호 (복천동, 우성아파트)051-558-3476190.9190.9<NA>태권도
7체육도장업수안체육관부산광역시 동래구 수안동 4-11부산광역시 동래구 명륜로63번길 14 (수안동)<NA>129.0129.0129.0태권도
8체육도장업대한태권도부산광역시 동래구 온천동 1592-1 동래삼정그린코아포레스트부산광역시 동래구 쇠미로 153, 107동 2층 (온천동, 동래삼정그린코아포레스트)<NA><NA>118.28<NA>태권도
9체육도장업동아태권도부산광역시 동래구 명장동 506-6부산광역시 동래구 명서로 116 (명장동)051-529-964871.1571.15<NA>태권도
업종상호시설주소(지번)시설주소(도로명)시설전화번호시설부지면적(제곱미터)시설면적(제곱미터)건물연면적(제곱미터)체육도장업상세분류
326체육교습업제이에스 축구클럽부산광역시 동래구 안락동 461-2부산광역시 동래구 화현길 8, 2층 (안락동)<NA><NA>295.2733.96<NA>
327체육교습업백호축구클럽부산광역시 동래구 낙민동 76-10 신라하우징부산광역시 동래구 온천천로337번길 12, 신라하우징 2층 (낙민동)<NA><NA>209.52750.77<NA>
328체육교습업히어로 스포츠부산광역시 동래구 사직동 47-3부산광역시 동래구 석사로 15, 지하1층 (사직동)<NA><NA>415.94985.04<NA>
329체육교습업타고나 스포츠 아카데미 4호점부산광역시 동래구 온천동 502-3 롯데백화점부산광역시 동래구 중앙대로 1393, 롯데백화점 10층 (온천동)<NA><NA>30.0<NA>축구
330체육교습업지니어스 음악줄넘기 명륜중앙점부산광역시 동래구 명륜동 700-148부산광역시 동래구 명륜로207번길 42, 2층 (명륜동)<NA><NA>151.15<NA>줄넘기
331체육교습업맥스FC 락커룸부산광역시 동래구 명륜동 680-3부산광역시 동래구 동래로79번길 19, 2,3층 (명륜동)<NA><NA>429.0<NA>축구
332체육교습업MCP성장체육부산광역시 동래구 온천동 1535-36부산광역시 동래구 아시아드대로195번길 66, 2층 (온천동)<NA><NA>169.69676.14농구
333인공암벽장업죠스클라이밍부산광역시 동래구 사직동 92-8 호영빌딩부산광역시 동래구 사직로 48, 호영빌딩 7층 (사직동)<NA><NA>280.82204.52<NA>
334인공암벽장업패밀리 클라이밍센터부산광역시 동래구 사직동 71-24부산광역시 동래구 석사로 42-1, 지하1층 (사직동)<NA><NA>283.141401.39<NA>
335인공암벽장업락오디세이 동래부산광역시 동래구 낙민동 91-3부산광역시 동래구 안남로31번길 6, 1,2층 (낙민동)<NA><NA>310.0597.17<NA>