Overview

Dataset statistics

Number of variables3
Number of observations262
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.4%
Total size in memory6.3 KiB
Average record size in memory24.5 B

Variable types

Categorical1
Text2

Dataset

Description부산광역시 기장군 관내에 현재 영업 중으로 등록되어 있는 신고 체육시설업에 대한 데이터로 신고 체육시설업의 상호, 주소 등에 대한 자료입니다.
Author부산광역시 기장군
URLhttps://www.data.go.kr/data/3072011/fileData.do

Alerts

Dataset has 1 (0.4%) duplicate rowsDuplicates

Reproduction

Analysis started2024-04-21 02:47:24.972210
Analysis finished2024-04-21 02:47:25.637320
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct10
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
체육도장업
80 
체력단련장업
42 
당구장업
42 
골프연습장업
36 
가상체험 체육시설업
29 
Other values (5)
33 

Length

Max length10
Median length7
Mean length5.6755725
Min length4

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row수영장업
2nd row수영장업
3rd row수영장업
4th row수영장업
5th row수영장업

Common Values

ValueCountFrequency (%)
체육도장업 80
30.5%
체력단련장업 42
16.0%
당구장업 42
16.0%
골프연습장업 36
13.7%
가상체험 체육시설업 29
 
11.1%
체육교습업 21
 
8.0%
수영장업 7
 
2.7%
썰매장업 2
 
0.8%
종합체육시설업 2
 
0.8%
인공암벽장업 1
 
0.4%

Length

2024-04-21T11:47:25.704437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:47:25.811823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
체육도장업 80
27.5%
체력단련장업 42
14.4%
당구장업 42
14.4%
골프연습장업 36
12.4%
가상체험 29
 
10.0%
체육시설업 29
 
10.0%
체육교습업 21
 
7.2%
수영장업 7
 
2.4%
썰매장업 2
 
0.7%
종합체육시설업 2
 
0.7%

상호
Text

Distinct259
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2024-04-21T11:47:26.079313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length22
Mean length8.6526718
Min length3

Characters and Unicode

Total characters2267
Distinct characters321
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique256 ?
Unique (%)97.7%

Sample

1st row아난티 앳 부산 코브 아웃도어풀
2nd row아난티 앳 부산 코브 인도어풀
3rd row워터하우스
4th row오너스클럽 야외수영장
5th row망고키즈수영장
ValueCountFrequency (%)
태권도 13
 
2.7%
당구클럽 11
 
2.3%
일광 8
 
1.7%
동아대 8
 
1.7%
골프연습장 7
 
1.4%
스크린 6
 
1.2%
당구장 6
 
1.2%
합기도 6
 
1.2%
태권도장 6
 
1.2%
스크린골프 5
 
1.0%
Other values (346) 407
84.3%
2024-04-21T11:47:26.471107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
221
 
9.7%
95
 
4.2%
68
 
3.0%
65
 
2.9%
62
 
2.7%
54
 
2.4%
48
 
2.1%
45
 
2.0%
44
 
1.9%
43
 
1.9%
Other values (311) 1522
67.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1785
78.7%
Space Separator 221
 
9.7%
Uppercase Letter 166
 
7.3%
Lowercase Letter 34
 
1.5%
Open Punctuation 23
 
1.0%
Close Punctuation 23
 
1.0%
Decimal Number 9
 
0.4%
Other Punctuation 6
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
95
 
5.3%
68
 
3.8%
65
 
3.6%
62
 
3.5%
54
 
3.0%
48
 
2.7%
45
 
2.5%
44
 
2.5%
43
 
2.4%
41
 
2.3%
Other values (257) 1220
68.3%
Uppercase Letter
ValueCountFrequency (%)
G 18
 
10.8%
M 15
 
9.0%
P 14
 
8.4%
J 13
 
7.8%
O 12
 
7.2%
S 11
 
6.6%
T 10
 
6.0%
Y 8
 
4.8%
E 8
 
4.8%
B 8
 
4.8%
Other values (14) 49
29.5%
Lowercase Letter
ValueCountFrequency (%)
i 5
14.7%
r 4
11.8%
o 3
8.8%
p 3
8.8%
t 2
 
5.9%
s 2
 
5.9%
u 2
 
5.9%
n 2
 
5.9%
c 2
 
5.9%
a 2
 
5.9%
Other values (6) 7
20.6%
Decimal Number
ValueCountFrequency (%)
2 4
44.4%
5 1
 
11.1%
4 1
 
11.1%
7 1
 
11.1%
3 1
 
11.1%
1 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
' 2
33.3%
. 1
16.7%
1
16.7%
: 1
16.7%
& 1
16.7%
Space Separator
ValueCountFrequency (%)
221
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1785
78.7%
Common 282
 
12.4%
Latin 200
 
8.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
95
 
5.3%
68
 
3.8%
65
 
3.6%
62
 
3.5%
54
 
3.0%
48
 
2.7%
45
 
2.5%
44
 
2.5%
43
 
2.4%
41
 
2.3%
Other values (257) 1220
68.3%
Latin
ValueCountFrequency (%)
G 18
 
9.0%
M 15
 
7.5%
P 14
 
7.0%
J 13
 
6.5%
O 12
 
6.0%
S 11
 
5.5%
T 10
 
5.0%
Y 8
 
4.0%
E 8
 
4.0%
B 8
 
4.0%
Other values (30) 83
41.5%
Common
ValueCountFrequency (%)
221
78.4%
( 23
 
8.2%
) 23
 
8.2%
2 4
 
1.4%
' 2
 
0.7%
5 1
 
0.4%
. 1
 
0.4%
1
 
0.4%
4 1
 
0.4%
: 1
 
0.4%
Other values (4) 4
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1785
78.7%
ASCII 481
 
21.2%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
221
45.9%
( 23
 
4.8%
) 23
 
4.8%
G 18
 
3.7%
M 15
 
3.1%
P 14
 
2.9%
J 13
 
2.7%
O 12
 
2.5%
S 11
 
2.3%
T 10
 
2.1%
Other values (43) 121
25.2%
Hangul
ValueCountFrequency (%)
95
 
5.3%
68
 
3.8%
65
 
3.6%
62
 
3.5%
54
 
3.0%
48
 
2.7%
45
 
2.5%
44
 
2.5%
43
 
2.4%
41
 
2.3%
Other values (257) 1220
68.3%
None
ValueCountFrequency (%)
1
100.0%
Distinct249
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2024-04-21T11:47:26.720819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length38
Mean length27.770992
Min length19

Characters and Unicode

Total characters7276
Distinct characters193
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique238 ?
Unique (%)90.8%

Sample

1st row부산광역시 기장군 기장읍 기장해안로 268-32, 힐튼부산
2nd row부산광역시 기장군 기장읍 기장해안로 268-32, 힐튼부산
3rd row부산광역시 기장군 기장읍 기장해안로 268-32, 아난티코브
4th row부산광역시 기장군 기장읍 기장해안로 268-32, 아난티코브
5th row부산광역시 기장군 정관읍 정관중앙로 30, 3층
ValueCountFrequency (%)
부산광역시 262
 
16.5%
기장군 262
 
16.5%
기장읍 86
 
5.4%
정관읍 83
 
5.2%
정관로 42
 
2.6%
정관면 34
 
2.1%
2층 31
 
2.0%
3층 26
 
1.6%
일광면 23
 
1.4%
장안읍 17
 
1.1%
Other values (377) 721
45.4%
2024-04-21T11:47:27.107906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1325
18.2%
389
 
5.3%
367
 
5.0%
309
 
4.2%
286
 
3.9%
270
 
3.7%
267
 
3.7%
263
 
3.6%
262
 
3.6%
244
 
3.4%
Other values (183) 3294
45.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4465
61.4%
Space Separator 1325
 
18.2%
Decimal Number 1211
 
16.6%
Other Punctuation 164
 
2.3%
Dash Punctuation 51
 
0.7%
Close Punctuation 18
 
0.2%
Open Punctuation 18
 
0.2%
Math Symbol 11
 
0.2%
Uppercase Letter 7
 
0.1%
Lowercase Letter 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
389
 
8.7%
367
 
8.2%
309
 
6.9%
286
 
6.4%
270
 
6.0%
267
 
6.0%
263
 
5.9%
262
 
5.9%
244
 
5.5%
214
 
4.8%
Other values (159) 1594
35.7%
Decimal Number
ValueCountFrequency (%)
1 189
15.6%
2 182
15.0%
3 163
13.5%
5 143
11.8%
4 138
11.4%
0 98
8.1%
6 92
7.6%
8 75
 
6.2%
7 75
 
6.2%
9 56
 
4.6%
Uppercase Letter
ValueCountFrequency (%)
B 4
57.1%
P 1
 
14.3%
K 1
 
14.3%
D 1
 
14.3%
Lowercase Letter
ValueCountFrequency (%)
a 2
50.0%
z 1
25.0%
l 1
25.0%
Space Separator
ValueCountFrequency (%)
1325
100.0%
Other Punctuation
ValueCountFrequency (%)
, 164
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 51
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Math Symbol
ValueCountFrequency (%)
~ 11
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4465
61.4%
Common 2798
38.5%
Latin 13
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
389
 
8.7%
367
 
8.2%
309
 
6.9%
286
 
6.4%
270
 
6.0%
267
 
6.0%
263
 
5.9%
262
 
5.9%
244
 
5.5%
214
 
4.8%
Other values (159) 1594
35.7%
Common
ValueCountFrequency (%)
1325
47.4%
1 189
 
6.8%
2 182
 
6.5%
, 164
 
5.9%
3 163
 
5.8%
5 143
 
5.1%
4 138
 
4.9%
0 98
 
3.5%
6 92
 
3.3%
8 75
 
2.7%
Other values (6) 229
 
8.2%
Latin
ValueCountFrequency (%)
B 4
30.8%
a 2
15.4%
2
15.4%
z 1
 
7.7%
l 1
 
7.7%
P 1
 
7.7%
K 1
 
7.7%
D 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4465
61.4%
ASCII 2809
38.6%
Number Forms 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1325
47.2%
1 189
 
6.7%
2 182
 
6.5%
, 164
 
5.8%
3 163
 
5.8%
5 143
 
5.1%
4 138
 
4.9%
0 98
 
3.5%
6 92
 
3.3%
8 75
 
2.7%
Other values (13) 240
 
8.5%
Hangul
ValueCountFrequency (%)
389
 
8.7%
367
 
8.2%
309
 
6.9%
286
 
6.4%
270
 
6.0%
267
 
6.0%
263
 
5.9%
262
 
5.9%
244
 
5.5%
214
 
4.8%
Other values (159) 1594
35.7%
Number Forms
ValueCountFrequency (%)
2
100.0%

Missing values

2024-04-21T11:47:25.537372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T11:47:25.603132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종상호도로명주소
0수영장업아난티 앳 부산 코브 아웃도어풀부산광역시 기장군 기장읍 기장해안로 268-32, 힐튼부산
1수영장업아난티 앳 부산 코브 인도어풀부산광역시 기장군 기장읍 기장해안로 268-32, 힐튼부산
2수영장업워터하우스부산광역시 기장군 기장읍 기장해안로 268-32, 아난티코브
3수영장업오너스클럽 야외수영장부산광역시 기장군 기장읍 기장해안로 268-32, 아난티코브
4수영장업망고키즈수영장부산광역시 기장군 정관읍 정관중앙로 30, 3층
5수영장업A Spirit of Journey(에이 스피릿 오브 저니)부산광역시 기장군 기장읍 기장해안로 267-7, 아난티 앳 부산
6수영장업Spring Palace(스프링 팰리스)부산광역시 기장군 기장읍 기장해안로 267-17, 엘피크리스탈(메인동)
7체육도장업송강유도관부산광역시 기장군 기장읍 차성동로87번길 16
8체육도장업기장골든태권도부산광역시 기장군 기장읍 차성로344번길 30
9체육도장업문창체육관부산광역시 기장군 기장읍 차성동로 180
업종상호도로명주소
252체육교습업드림사커부산광역시 기장군 일광면 장곡길 46
253체육교습업점프윙스 줄넘기클럽부산광역시 기장군 정관읍 정관로 704, 2층
254체육교습업(주)기장축구센터부산광역시 기장군 정관읍 산단5로 76-142
255체육교습업더 그릿 정관(THE GRIT JEONGGWAN)부산광역시 기장군 정관읍 예림1로 75-1
256체육교습업주식회사 우리컴퍼니 JI 인라인스쿨부산광역시 기장군 정관읍 산단3로 74-6
257체육교습업오름동행협동조합(정관점)부산광역시 기장군 정관읍 정관6로 46, 3층
258체육교습업고에프씨 아이파크 축구교실(GOFC IPARK 축구교실)부산광역시 기장군 일광읍 해빛4로 35, 5층
259체육교습업이종원 풋볼아카데미부산광역시 기장군 정관읍 정관중앙로 45, 4층
260체육교습업그레이스스포츠센터부산광역시 기장군 기장읍 차성서로101번길 12-5
261인공암벽장업리버스 락부산광역시 기장군 정관읍 정관7로 33-8, 4층

Duplicate rows

Most frequently occurring

업종상호도로명주소# duplicates
0골프연습장업J골프부산광역시 기장군 정관면 산단4로 12