Overview

Dataset statistics

Number of variables5
Number of observations105
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.3 KiB
Average record size in memory42.3 B

Variable types

Numeric1
Categorical1
Text2
Boolean1

Dataset

Description연제구 공중이용시설 내 흡연실 설치 현황에 대한 데이터로 흡연실 설치 여부, 해당 공중이용시설 주소에 대한 정보를 현행화하여 제공합니다.
Author부산광역시 연제구
URLhttps://www.data.go.kr/data/15029124/fileData.do

Alerts

흡연실여부 has constant value ""Constant
연번 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:13:26.489309
Analysis finished2023-12-12 05:13:27.025138
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct105
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean53
Minimum1
Maximum105
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-12T14:13:27.123632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.2
Q127
median53
Q379
95-th percentile99.8
Maximum105
Range104
Interquartile range (IQR)52

Descriptive statistics

Standard deviation30.454885
Coefficient of variation (CV)0.57462047
Kurtosis-1.2
Mean53
Median Absolute Deviation (MAD)26
Skewness0
Sum5565
Variance927.5
MonotonicityStrictly increasing
2023-12-12T14:13:27.298858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
80 1
 
1.0%
78 1
 
1.0%
77 1
 
1.0%
76 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
Other values (95) 95
90.5%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
105 1
1.0%
104 1
1.0%
103 1
1.0%
102 1
1.0%
101 1
1.0%
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%

구분
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size972.0 B
당구장업
46 
게임제공업소
44 
공공기관청사
11 
대형건축물
 
3
종합병원
 
1

Length

Max length6
Median length6
Mean length5.0761905
Min length4

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row공공기관청사
2nd row공공기관청사
3rd row공공기관청사
4th row공공기관청사
5th row공공기관청사

Common Values

ValueCountFrequency (%)
당구장업 46
43.8%
게임제공업소 44
41.9%
공공기관청사 11
 
10.5%
대형건축물 3
 
2.9%
종합병원 1
 
1.0%

Length

2023-12-12T14:13:27.469118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:13:27.598093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
당구장업 46
43.8%
게임제공업소 44
41.9%
공공기관청사 11
 
10.5%
대형건축물 3
 
2.9%
종합병원 1
 
1.0%
Distinct103
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size972.0 B
2023-12-12T14:13:27.834982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length6.9619048
Min length1

Characters and Unicode

Total characters731
Distinct characters191
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique101 ?
Unique (%)96.2%

Sample

1st row부산광역시시청
2nd row부산광역시의회
3rd row연제구청
4th row연제구보건소
5th row연제구의회
ValueCountFrequency (%)
pc 11
 
7.9%
pc방 5
 
3.6%
cafe 4
 
2.9%
바닐라pc방 2
 
1.4%
ares 2
 
1.4%
ok당구장 2
 
1.4%
부산연산본점 1
 
0.7%
엔터존 1
 
0.7%
부산광역시시청 1
 
0.7%
아일랜드pc 1
 
0.7%
Other values (110) 110
78.6%
2023-12-12T14:13:28.255533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
C 48
 
6.6%
48
 
6.6%
45
 
6.2%
P 44
 
6.0%
35
 
4.8%
28
 
3.8%
26
 
3.6%
25
 
3.4%
22
 
3.0%
17
 
2.3%
Other values (181) 393
53.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 526
72.0%
Uppercase Letter 143
 
19.6%
Space Separator 35
 
4.8%
Lowercase Letter 16
 
2.2%
Decimal Number 5
 
0.7%
Dash Punctuation 2
 
0.3%
Other Punctuation 2
 
0.3%
Open Punctuation 1
 
0.1%
Close Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
48
 
9.1%
45
 
8.6%
28
 
5.3%
26
 
4.9%
25
 
4.8%
22
 
4.2%
17
 
3.2%
13
 
2.5%
12
 
2.3%
12
 
2.3%
Other values (144) 278
52.9%
Uppercase Letter
ValueCountFrequency (%)
C 48
33.6%
P 44
30.8%
K 8
 
5.6%
E 7
 
4.9%
A 6
 
4.2%
O 4
 
2.8%
S 3
 
2.1%
N 3
 
2.1%
I 3
 
2.1%
V 2
 
1.4%
Other values (9) 15
 
10.5%
Lowercase Letter
ValueCountFrequency (%)
e 4
25.0%
a 3
18.8%
f 2
12.5%
c 2
12.5%
s 1
 
6.2%
r 1
 
6.2%
h 1
 
6.2%
g 1
 
6.2%
n 1
 
6.2%
Decimal Number
ValueCountFrequency (%)
3 2
40.0%
0 1
20.0%
2 1
20.0%
1 1
20.0%
Space Separator
ValueCountFrequency (%)
35
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 526
72.0%
Latin 159
 
21.8%
Common 46
 
6.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
48
 
9.1%
45
 
8.6%
28
 
5.3%
26
 
4.9%
25
 
4.8%
22
 
4.2%
17
 
3.2%
13
 
2.5%
12
 
2.3%
12
 
2.3%
Other values (144) 278
52.9%
Latin
ValueCountFrequency (%)
C 48
30.2%
P 44
27.7%
K 8
 
5.0%
E 7
 
4.4%
A 6
 
3.8%
e 4
 
2.5%
O 4
 
2.5%
S 3
 
1.9%
N 3
 
1.9%
I 3
 
1.9%
Other values (18) 29
18.2%
Common
ValueCountFrequency (%)
35
76.1%
- 2
 
4.3%
. 2
 
4.3%
3 2
 
4.3%
0 1
 
2.2%
( 1
 
2.2%
) 1
 
2.2%
2 1
 
2.2%
1 1
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 526
72.0%
ASCII 205
 
28.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
C 48
23.4%
P 44
21.5%
35
17.1%
K 8
 
3.9%
E 7
 
3.4%
A 6
 
2.9%
e 4
 
2.0%
O 4
 
2.0%
S 3
 
1.5%
N 3
 
1.5%
Other values (27) 43
21.0%
Hangul
ValueCountFrequency (%)
48
 
9.1%
45
 
8.6%
28
 
5.3%
26
 
4.9%
25
 
4.8%
22
 
4.2%
17
 
3.2%
13
 
2.5%
12
 
2.3%
12
 
2.3%
Other values (144) 278
52.9%

주소
Text

Distinct95
Distinct (%)90.5%
Missing0
Missing (%)0.0%
Memory size972.0 B
2023-12-12T14:13:28.482839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length26
Mean length20.019048
Min length10

Characters and Unicode

Total characters2102
Distinct characters82
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique86 ?
Unique (%)81.9%

Sample

1st row연제구 중앙대로 1001(연산동)
2nd row연제구 중앙대로 1001(연산동)
3rd row연제구 연제로 2(연산동)
4th row연제구 연제로 2(연산동)
5th row연제구 연제로 2(연산동)
ValueCountFrequency (%)
연제구 105
23.5%
연산동 74
 
16.6%
거제동 18
 
4.0%
과정로 17
 
3.8%
중앙대로 7
 
1.6%
거제천로 7
 
1.6%
고분로13번길 6
 
1.3%
반송로 6
 
1.3%
2층 5
 
1.1%
3층 5
 
1.1%
Other values (136) 196
43.9%
2023-12-12T14:13:28.833925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
341
16.2%
197
 
9.4%
138
 
6.6%
1 121
 
5.8%
105
 
5.0%
105
 
5.0%
103
 
4.9%
( 99
 
4.7%
) 99
 
4.7%
81
 
3.9%
Other values (72) 713
33.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1116
53.1%
Decimal Number 398
 
18.9%
Space Separator 341
 
16.2%
Open Punctuation 99
 
4.7%
Close Punctuation 99
 
4.7%
Other Punctuation 30
 
1.4%
Dash Punctuation 18
 
0.9%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
197
17.7%
138
12.4%
105
 
9.4%
105
 
9.4%
103
 
9.2%
81
 
7.3%
31
 
2.8%
31
 
2.8%
30
 
2.7%
29
 
2.6%
Other values (56) 266
23.8%
Decimal Number
ValueCountFrequency (%)
1 121
30.4%
2 56
14.1%
3 56
14.1%
4 32
 
8.0%
0 32
 
8.0%
9 27
 
6.8%
5 23
 
5.8%
8 22
 
5.5%
6 16
 
4.0%
7 13
 
3.3%
Space Separator
ValueCountFrequency (%)
341
100.0%
Open Punctuation
ValueCountFrequency (%)
( 99
100.0%
Close Punctuation
ValueCountFrequency (%)
) 99
100.0%
Other Punctuation
ValueCountFrequency (%)
, 30
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1116
53.1%
Common 985
46.9%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
197
17.7%
138
12.4%
105
 
9.4%
105
 
9.4%
103
 
9.2%
81
 
7.3%
31
 
2.8%
31
 
2.8%
30
 
2.7%
29
 
2.6%
Other values (56) 266
23.8%
Common
ValueCountFrequency (%)
341
34.6%
1 121
 
12.3%
( 99
 
10.1%
) 99
 
10.1%
2 56
 
5.7%
3 56
 
5.7%
4 32
 
3.2%
0 32
 
3.2%
, 30
 
3.0%
9 27
 
2.7%
Other values (5) 92
 
9.3%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1116
53.1%
ASCII 986
46.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
341
34.6%
1 121
 
12.3%
( 99
 
10.0%
) 99
 
10.0%
2 56
 
5.7%
3 56
 
5.7%
4 32
 
3.2%
0 32
 
3.2%
, 30
 
3.0%
9 27
 
2.7%
Other values (6) 93
 
9.4%
Hangul
ValueCountFrequency (%)
197
17.7%
138
12.4%
105
 
9.4%
105
 
9.4%
103
 
9.2%
81
 
7.3%
31
 
2.8%
31
 
2.8%
30
 
2.7%
29
 
2.6%
Other values (56) 266
23.8%

흡연실여부
Boolean

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size237.0 B
True
105 
ValueCountFrequency (%)
True 105
100.0%
2023-12-12T14:13:28.949413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-12T14:13:26.752940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:13:28.999362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분주소
연번1.0000.9610.952
구분0.9611.0000.990
주소0.9520.9901.000
2023-12-12T14:13:29.078629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분
연번1.0000.708
구분0.7081.000

Missing values

2023-12-12T14:13:26.876814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:13:26.977438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구분시설명주소흡연실여부
01공공기관청사부산광역시시청연제구 중앙대로 1001(연산동)Y
12공공기관청사부산광역시의회연제구 중앙대로 1001(연산동)Y
23공공기관청사연제구청연제구 연제로 2(연산동)Y
34공공기관청사연제구보건소연제구 연제로 2(연산동)Y
45공공기관청사연제구의회연제구 연제로 2(연산동)Y
56공공기관청사부산지방검찰청연제구 법원로 15Y
67공공기관청사부산고등검찰청연제구 법원로 15Y
78공공기관청사부산지방법원연제구 법원로 31(거제동 1500)Y
89공공기관청사부산고등법원연제구 법원로 31(거제동 1500)Y
910공공기관청사부산지방경찰청연제구 중앙대로 999Y
연번구분시설명주소흡연실여부
9596게임제공업소코델리아PC방 연산본점연제구 고분로236번길 10, 2층 (연산동)Y
9697게임제공업소허니PC방연제구 중앙대로1120번길 13-9 (연산동)Y
9798게임제공업소코파PC방연제구 중앙대로1124번길 15, 1층 304호 (연산동, 연산동 에스케이뷰)Y
9899게임제공업소바닐라PC방연제구 중앙대로1050번길 28 (연산동)Y
99100게임제공업소런던PC방연제구 금련로 9 (연산동)Y
100101게임제공업소옥스PC방 연산점연제구 반송로 19, 4층 (연산동)Y
101102게임제공업소피에스타PC방 부산연산본점연제구 과정로 137, 3층 312호 (연산동)Y
102103게임제공업소스틸시리즈PC방 부산본점연제구 과정로 192, 3층 (연산동)Y
103104게임제공업소바닐라PC방연제구 월드컵대로91번길 14, 1층 (연산동)Y
104105게임제공업소샹떼PC방 부산연산점연제구 쌍미천로 11, 2층 (연산동)Y