Overview

Dataset statistics

Number of variables3
Number of observations247
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.2 KiB
Average record size in memory25.5 B

Variable types

Numeric1
Text2

Dataset

Description부산광역시_기장군_체육시설업현황_20230413
Author부산광역시 기장군
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3072011

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:13:38.270593
Analysis finished2023-12-10 16:13:39.018683
Duration0.75 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct247
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean124
Minimum1
Maximum247
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-11T01:13:39.135467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13.3
Q162.5
median124
Q3185.5
95-th percentile234.7
Maximum247
Range246
Interquartile range (IQR)123

Descriptive statistics

Standard deviation71.44695
Coefficient of variation (CV)0.57618508
Kurtosis-1.2
Mean124
Median Absolute Deviation (MAD)62
Skewness0
Sum30628
Variance5104.6667
MonotonicityStrictly increasing
2023-12-11T01:13:39.400336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
171 1
 
0.4%
158 1
 
0.4%
159 1
 
0.4%
160 1
 
0.4%
161 1
 
0.4%
162 1
 
0.4%
163 1
 
0.4%
164 1
 
0.4%
165 1
 
0.4%
Other values (237) 237
96.0%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
247 1
0.4%
246 1
0.4%
245 1
0.4%
244 1
0.4%
243 1
0.4%
242 1
0.4%
241 1
0.4%
240 1
0.4%
239 1
0.4%
238 1
0.4%
Distinct244
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-11T01:13:39.858007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length18
Mean length8.2348178
Min length3

Characters and Unicode

Total characters2034
Distinct characters305
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique241 ?
Unique (%)97.6%

Sample

1st row힐튼호텔 아웃도어풀
2nd row힐튼호텔 인도어풀
3rd row워터하우스
4th row오너스클럽 야외수영장
5th row망고키즈수영장
ValueCountFrequency (%)
태권도 12
 
2.8%
당구클럽 11
 
2.5%
동아대 7
 
1.6%
골프연습장 7
 
1.6%
일광 6
 
1.4%
합기도 6
 
1.4%
스크린 6
 
1.4%
당구장 6
 
1.4%
태권도장 5
 
1.2%
용인대 4
 
0.9%
Other values (314) 364
83.9%
2023-12-11T01:13:40.522046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
187
 
9.2%
87
 
4.3%
66
 
3.2%
62
 
3.0%
59
 
2.9%
51
 
2.5%
45
 
2.2%
45
 
2.2%
44
 
2.2%
39
 
1.9%
Other values (295) 1349
66.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1643
80.8%
Space Separator 187
 
9.2%
Uppercase Letter 147
 
7.2%
Open Punctuation 17
 
0.8%
Close Punctuation 17
 
0.8%
Lowercase Letter 11
 
0.5%
Other Punctuation 7
 
0.3%
Decimal Number 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
87
 
5.3%
66
 
4.0%
62
 
3.8%
59
 
3.6%
51
 
3.1%
45
 
2.7%
45
 
2.7%
44
 
2.7%
39
 
2.4%
37
 
2.3%
Other values (250) 1108
67.4%
Uppercase Letter
ValueCountFrequency (%)
G 15
 
10.2%
M 14
 
9.5%
P 12
 
8.2%
O 12
 
8.2%
J 11
 
7.5%
T 10
 
6.8%
S 9
 
6.1%
N 8
 
5.4%
B 7
 
4.8%
Y 7
 
4.8%
Other values (14) 42
28.6%
Lowercase Letter
ValueCountFrequency (%)
i 2
18.2%
s 2
18.2%
u 1
9.1%
h 1
9.1%
t 1
9.1%
p 1
9.1%
r 1
9.1%
o 1
9.1%
c 1
9.1%
Other Punctuation
ValueCountFrequency (%)
' 2
28.6%
. 2
28.6%
1
14.3%
: 1
14.3%
& 1
14.3%
Decimal Number
ValueCountFrequency (%)
2 2
40.0%
5 1
20.0%
1 1
20.0%
7 1
20.0%
Space Separator
ValueCountFrequency (%)
187
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1643
80.8%
Common 233
 
11.5%
Latin 158
 
7.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
87
 
5.3%
66
 
4.0%
62
 
3.8%
59
 
3.6%
51
 
3.1%
45
 
2.7%
45
 
2.7%
44
 
2.7%
39
 
2.4%
37
 
2.3%
Other values (250) 1108
67.4%
Latin
ValueCountFrequency (%)
G 15
 
9.5%
M 14
 
8.9%
P 12
 
7.6%
O 12
 
7.6%
J 11
 
7.0%
T 10
 
6.3%
S 9
 
5.7%
N 8
 
5.1%
B 7
 
4.4%
Y 7
 
4.4%
Other values (23) 53
33.5%
Common
ValueCountFrequency (%)
187
80.3%
( 17
 
7.3%
) 17
 
7.3%
2 2
 
0.9%
' 2
 
0.9%
. 2
 
0.9%
1
 
0.4%
: 1
 
0.4%
5 1
 
0.4%
1 1
 
0.4%
Other values (2) 2
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1643
80.8%
ASCII 390
 
19.2%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
187
47.9%
( 17
 
4.4%
) 17
 
4.4%
G 15
 
3.8%
M 14
 
3.6%
P 12
 
3.1%
O 12
 
3.1%
J 11
 
2.8%
T 10
 
2.6%
S 9
 
2.3%
Other values (34) 86
22.1%
Hangul
ValueCountFrequency (%)
87
 
5.3%
66
 
4.0%
62
 
3.8%
59
 
3.6%
51
 
3.1%
45
 
2.7%
45
 
2.7%
44
 
2.7%
39
 
2.4%
37
 
2.3%
Other values (250) 1108
67.4%
None
ValueCountFrequency (%)
1
100.0%

주소
Text

Distinct233
Distinct (%)94.3%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-11T01:13:40.966684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length39
Mean length27.668016
Min length19

Characters and Unicode

Total characters6834
Distinct characters188
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique222 ?
Unique (%)89.9%

Sample

1st row부산광역시 기장군 기장읍 기장해안로 268-32, 힐튼부산
2nd row부산광역시 기장군 기장읍 기장해안로 268-32, 힐튼부산
3rd row부산광역시 기장군 기장읍 기장해안로 268-32, 아난티코브
4th row부산광역시 기장군 기장읍 기장해안로 268-32, 아난티코브
5th row부산광역시 기장군 정관읍 정관중앙로 30, 3층
ValueCountFrequency (%)
부산광역시 247
 
16.6%
기장군 247
 
16.6%
기장읍 82
 
5.5%
정관읍 72
 
4.8%
정관로 42
 
2.8%
정관면 35
 
2.3%
2층 28
 
1.9%
일광면 26
 
1.7%
3층 20
 
1.3%
장안읍 18
 
1.2%
Other values (358) 675
45.2%
2023-12-11T01:13:41.503404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1245
18.2%
369
 
5.4%
346
 
5.1%
292
 
4.3%
267
 
3.9%
254
 
3.7%
252
 
3.7%
248
 
3.6%
247
 
3.6%
233
 
3.4%
Other values (178) 3081
45.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4200
61.5%
Space Separator 1245
 
18.2%
Decimal Number 1138
 
16.7%
Other Punctuation 148
 
2.2%
Dash Punctuation 45
 
0.7%
Open Punctuation 17
 
0.2%
Close Punctuation 17
 
0.2%
Math Symbol 11
 
0.2%
Uppercase Letter 7
 
0.1%
Lowercase Letter 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
369
 
8.8%
346
 
8.2%
292
 
7.0%
267
 
6.4%
254
 
6.0%
252
 
6.0%
248
 
5.9%
247
 
5.9%
233
 
5.5%
201
 
4.8%
Other values (154) 1491
35.5%
Decimal Number
ValueCountFrequency (%)
2 169
14.9%
1 169
14.9%
3 152
13.4%
5 137
12.0%
4 133
11.7%
0 95
8.3%
6 84
7.4%
8 73
6.4%
7 70
6.2%
9 56
 
4.9%
Uppercase Letter
ValueCountFrequency (%)
B 4
57.1%
K 1
 
14.3%
P 1
 
14.3%
D 1
 
14.3%
Lowercase Letter
ValueCountFrequency (%)
a 2
50.0%
z 1
25.0%
l 1
25.0%
Space Separator
ValueCountFrequency (%)
1245
100.0%
Other Punctuation
ValueCountFrequency (%)
, 148
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 45
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Math Symbol
ValueCountFrequency (%)
~ 11
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4200
61.5%
Common 2621
38.4%
Latin 13
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
369
 
8.8%
346
 
8.2%
292
 
7.0%
267
 
6.4%
254
 
6.0%
252
 
6.0%
248
 
5.9%
247
 
5.9%
233
 
5.5%
201
 
4.8%
Other values (154) 1491
35.5%
Common
ValueCountFrequency (%)
1245
47.5%
2 169
 
6.4%
1 169
 
6.4%
3 152
 
5.8%
, 148
 
5.6%
5 137
 
5.2%
4 133
 
5.1%
0 95
 
3.6%
6 84
 
3.2%
8 73
 
2.8%
Other values (6) 216
 
8.2%
Latin
ValueCountFrequency (%)
B 4
30.8%
2
15.4%
a 2
15.4%
K 1
 
7.7%
z 1
 
7.7%
l 1
 
7.7%
P 1
 
7.7%
D 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4200
61.5%
ASCII 2632
38.5%
Number Forms 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1245
47.3%
2 169
 
6.4%
1 169
 
6.4%
3 152
 
5.8%
, 148
 
5.6%
5 137
 
5.2%
4 133
 
5.1%
0 95
 
3.6%
6 84
 
3.2%
8 73
 
2.8%
Other values (13) 227
 
8.6%
Hangul
ValueCountFrequency (%)
369
 
8.8%
346
 
8.2%
292
 
7.0%
267
 
6.4%
254
 
6.0%
252
 
6.0%
248
 
5.9%
247
 
5.9%
233
 
5.5%
201
 
4.8%
Other values (154) 1491
35.5%
Number Forms
ValueCountFrequency (%)
2
100.0%

Interactions

2023-12-11T01:13:38.645804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-11T01:13:38.829362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:13:38.974776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업체명주소
01힐튼호텔 아웃도어풀부산광역시 기장군 기장읍 기장해안로 268-32, 힐튼부산
12힐튼호텔 인도어풀부산광역시 기장군 기장읍 기장해안로 268-32, 힐튼부산
23워터하우스부산광역시 기장군 기장읍 기장해안로 268-32, 아난티코브
34오너스클럽 야외수영장부산광역시 기장군 기장읍 기장해안로 268-32, 아난티코브
45망고키즈수영장부산광역시 기장군 정관읍 정관중앙로 30, 3층
56송강유도관부산광역시 기장군 기장읍 차성동로87번길 16
67좌천체육관부산광역시 기장군 장안읍 좌천4길 9-43
78기장골든태권도부산광역시 기장군 기장읍 차성로344번길 30
89문창체육관부산광역시 기장군 기장읍 차성동로 180
910현대체육관부산광역시 기장군 기장읍 기장대로 563
연번업체명주소
237238피이씨(PEC) 바스켓볼부산광역시 기장군 정관읍 정관중앙로 45, 탑스퀘어 12층 1207, 1208호
238239점프파이어 줄넘기클럽부산광역시 기장군 일광면 해빛5로 21-3, 지음프라자 4층
239240케이엠 스포츠 아카데미부산광역시 기장군 기장읍 차성동로178번길 10-1, 반석종합학원 2층
240241아이비스포츠(정관점)부산광역시 기장군 정관읍 정관중앙로 45, 탑스퀘어 6층
241242버킷스포츠아카데미부산광역시 기장군 정관읍 정관중앙로 45, 탑스퀘어 12층 1209호
242243드림사커부산광역시 기장군 일광면 장곡길 46
243244점프윙스 줄넘기클럽부산광역시 기장군 정관읍 정관로 704, 2층
244245(주)기장축구센터부산광역시 기장군 정관읍 산단5로 76-142
245246더 그릿 정관(THE GRIT JEONGGWAN)부산광역시 기장군 정관읍 예림1로 75-1
246247리버스 락부산광역시 기장군 정관읍 정관7로 33-8, 4층