Overview

Dataset statistics

Number of variables4
Number of observations478
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.5 KiB
Average record size in memory33.3 B

Variable types

Numeric1
Text2
Categorical1

Dataset

Description도봉구에 위치하고 있는 담배소매인 지정현황 데이터(시설명, 위치, 주소등)
Author서울특별시 도봉구
URLhttps://www.data.go.kr/data/15028110/fileData.do

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:54:27.571745
Analysis finished2023-12-12 04:54:28.132246
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct478
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean239.5
Minimum1
Maximum478
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2023-12-12T13:54:28.225981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile24.85
Q1120.25
median239.5
Q3358.75
95-th percentile454.15
Maximum478
Range477
Interquartile range (IQR)238.5

Descriptive statistics

Standard deviation138.13098
Coefficient of variation (CV)0.57674729
Kurtosis-1.2
Mean239.5
Median Absolute Deviation (MAD)119.5
Skewness0
Sum114481
Variance19080.167
MonotonicityStrictly increasing
2023-12-12T13:54:28.413945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
316 1
 
0.2%
328 1
 
0.2%
327 1
 
0.2%
326 1
 
0.2%
325 1
 
0.2%
324 1
 
0.2%
323 1
 
0.2%
322 1
 
0.2%
321 1
 
0.2%
Other values (468) 468
97.9%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
478 1
0.2%
477 1
0.2%
476 1
0.2%
475 1
0.2%
474 1
0.2%
473 1
0.2%
472 1
0.2%
471 1
0.2%
470 1
0.2%
469 1
0.2%
Distinct459
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
2023-12-12T13:54:28.735538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length15
Mean length7.6924686
Min length2

Characters and Unicode

Total characters3677
Distinct characters351
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique444 ?
Unique (%)92.9%

Sample

1st row금강마트(주)
2nd row도봉산쉼터
3rd row세븐일레븐 도봉삼환점
4th row다모아슈퍼
5th row지에스25 쌍문신원점
ValueCountFrequency (%)
씨유 35
 
5.1%
세븐일레븐 30
 
4.4%
gs25 26
 
3.8%
이마트24 12
 
1.8%
지에스25 9
 
1.3%
미니스톱 9
 
1.3%
주식회사 6
 
0.9%
cu 6
 
0.9%
주)코리아세븐 5
 
0.7%
방학점 4
 
0.6%
Other values (480) 542
79.2%
2023-12-12T13:54:29.245933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
226
 
6.1%
206
 
5.6%
91
 
2.5%
2 88
 
2.4%
87
 
2.4%
87
 
2.4%
85
 
2.3%
84
 
2.3%
81
 
2.2%
76
 
2.1%
Other values (341) 2566
69.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3026
82.3%
Space Separator 206
 
5.6%
Decimal Number 194
 
5.3%
Uppercase Letter 151
 
4.1%
Close Punctuation 42
 
1.1%
Open Punctuation 41
 
1.1%
Lowercase Letter 12
 
0.3%
Dash Punctuation 3
 
0.1%
Other Punctuation 1
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
226
 
7.5%
91
 
3.0%
87
 
2.9%
87
 
2.9%
85
 
2.8%
84
 
2.8%
81
 
2.7%
76
 
2.5%
72
 
2.4%
71
 
2.3%
Other values (298) 2066
68.3%
Uppercase Letter
ValueCountFrequency (%)
S 53
35.1%
G 51
33.8%
U 14
 
9.3%
C 11
 
7.3%
J 2
 
1.3%
L 2
 
1.3%
P 2
 
1.3%
A 2
 
1.3%
K 2
 
1.3%
R 2
 
1.3%
Other values (8) 10
 
6.6%
Decimal Number
ValueCountFrequency (%)
2 88
45.4%
5 68
35.1%
4 19
 
9.8%
3 6
 
3.1%
1 5
 
2.6%
6 4
 
2.1%
7 2
 
1.0%
0 1
 
0.5%
8 1
 
0.5%
Lowercase Letter
ValueCountFrequency (%)
c 2
16.7%
a 2
16.7%
e 2
16.7%
s 2
16.7%
i 1
8.3%
g 1
8.3%
u 1
8.3%
l 1
8.3%
Close Punctuation
ValueCountFrequency (%)
) 41
97.6%
] 1
 
2.4%
Open Punctuation
ValueCountFrequency (%)
( 40
97.6%
[ 1
 
2.4%
Space Separator
ValueCountFrequency (%)
206
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Other Punctuation
ValueCountFrequency (%)
' 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3027
82.3%
Common 487
 
13.2%
Latin 163
 
4.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
226
 
7.5%
91
 
3.0%
87
 
2.9%
87
 
2.9%
85
 
2.8%
84
 
2.8%
81
 
2.7%
76
 
2.5%
72
 
2.4%
71
 
2.3%
Other values (299) 2067
68.3%
Latin
ValueCountFrequency (%)
S 53
32.5%
G 51
31.3%
U 14
 
8.6%
C 11
 
6.7%
c 2
 
1.2%
J 2
 
1.2%
a 2
 
1.2%
e 2
 
1.2%
L 2
 
1.2%
P 2
 
1.2%
Other values (16) 22
13.5%
Common
ValueCountFrequency (%)
206
42.3%
2 88
18.1%
5 68
 
14.0%
) 41
 
8.4%
( 40
 
8.2%
4 19
 
3.9%
3 6
 
1.2%
1 5
 
1.0%
6 4
 
0.8%
- 3
 
0.6%
Other values (6) 7
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3026
82.3%
ASCII 650
 
17.7%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
226
 
7.5%
91
 
3.0%
87
 
2.9%
87
 
2.9%
85
 
2.8%
84
 
2.8%
81
 
2.7%
76
 
2.5%
72
 
2.4%
71
 
2.3%
Other values (298) 2066
68.3%
ASCII
ValueCountFrequency (%)
206
31.7%
2 88
13.5%
5 68
 
10.5%
S 53
 
8.2%
G 51
 
7.8%
) 41
 
6.3%
( 40
 
6.2%
4 19
 
2.9%
U 14
 
2.2%
C 11
 
1.7%
Other values (32) 59
 
9.1%
None
ValueCountFrequency (%)
1
100.0%

동구분
Categorical

Distinct4
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
창동
147 
방학동
116 
쌍문동
113 
도봉동
102 

Length

Max length3
Median length3
Mean length2.6924686
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row쌍문동
2nd row도봉동
3rd row도봉동
4th row도봉동
5th row쌍문동

Common Values

ValueCountFrequency (%)
창동 147
30.8%
방학동 116
24.3%
쌍문동 113
23.6%
도봉동 102
21.3%

Length

2023-12-12T13:54:29.452313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:54:29.612412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
창동 147
30.8%
방학동 116
24.3%
쌍문동 113
23.6%
도봉동 102
21.3%

주소
Text

Distinct476
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
2023-12-12T13:54:29.972671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length50
Mean length30.10251
Min length20

Characters and Unicode

Total characters14389
Distinct characters218
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique474 ?
Unique (%)99.2%

Sample

1st row서울특별시 도봉구 방학로5길 33. 지하층 1호 (쌍문동. 금강아미움파크타운)
2nd row서울특별시 도봉구 도봉로173길 139 (도봉동)
3rd row서울특별시 도봉구 도봉로180길 6-81. 삼환도봉아파트 상가동 106. 107호 (도봉동)
4th row서울특별시 도봉구 시루봉로27길 9 (도봉동)
5th row서울특별시 도봉구 해등로 190. 1층 102호 (쌍문동. 신원주상복합2차아파트)
ValueCountFrequency (%)
서울특별시 478
 
16.7%
도봉구 478
 
16.7%
창동 145
 
5.1%
1층 134
 
4.7%
방학동 116
 
4.1%
쌍문동 115
 
4.0%
도봉동 102
 
3.6%
도봉로 36
 
1.3%
101호 28
 
1.0%
상가동 27
 
0.9%
Other values (626) 1197
41.9%
2023-12-12T13:54:30.572382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2383
 
16.6%
813
 
5.7%
795
 
5.5%
1 705
 
4.9%
560
 
3.9%
510
 
3.5%
486
 
3.4%
483
 
3.4%
482
 
3.3%
478
 
3.3%
Other values (208) 6694
46.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8268
57.5%
Decimal Number 2385
 
16.6%
Space Separator 2383
 
16.6%
Open Punctuation 478
 
3.3%
Close Punctuation 478
 
3.3%
Other Punctuation 340
 
2.4%
Dash Punctuation 33
 
0.2%
Uppercase Letter 18
 
0.1%
Math Symbol 5
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
813
 
9.8%
795
 
9.6%
560
 
6.8%
510
 
6.2%
486
 
5.9%
483
 
5.8%
482
 
5.8%
478
 
5.8%
478
 
5.8%
464
 
5.6%
Other values (182) 2719
32.9%
Decimal Number
ValueCountFrequency (%)
1 705
29.6%
2 253
 
10.6%
0 246
 
10.3%
3 231
 
9.7%
6 197
 
8.3%
4 189
 
7.9%
5 184
 
7.7%
8 139
 
5.8%
7 132
 
5.5%
9 109
 
4.6%
Uppercase Letter
ValueCountFrequency (%)
A 4
22.2%
S 3
16.7%
E 3
16.7%
R 2
11.1%
B 2
11.1%
T 1
 
5.6%
M 1
 
5.6%
G 1
 
5.6%
L 1
 
5.6%
Space Separator
ValueCountFrequency (%)
2383
100.0%
Open Punctuation
ValueCountFrequency (%)
( 478
100.0%
Close Punctuation
ValueCountFrequency (%)
) 478
100.0%
Other Punctuation
ValueCountFrequency (%)
. 340
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 33
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8269
57.5%
Common 6102
42.4%
Latin 18
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
813
 
9.8%
795
 
9.6%
560
 
6.8%
510
 
6.2%
486
 
5.9%
483
 
5.8%
482
 
5.8%
478
 
5.8%
478
 
5.8%
464
 
5.6%
Other values (183) 2720
32.9%
Common
ValueCountFrequency (%)
2383
39.1%
1 705
 
11.6%
( 478
 
7.8%
) 478
 
7.8%
. 340
 
5.6%
2 253
 
4.1%
0 246
 
4.0%
3 231
 
3.8%
6 197
 
3.2%
4 189
 
3.1%
Other values (6) 602
 
9.9%
Latin
ValueCountFrequency (%)
A 4
22.2%
S 3
16.7%
E 3
16.7%
R 2
11.1%
B 2
11.1%
T 1
 
5.6%
M 1
 
5.6%
G 1
 
5.6%
L 1
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8268
57.5%
ASCII 6120
42.5%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2383
38.9%
1 705
 
11.5%
( 478
 
7.8%
) 478
 
7.8%
. 340
 
5.6%
2 253
 
4.1%
0 246
 
4.0%
3 231
 
3.8%
6 197
 
3.2%
4 189
 
3.1%
Other values (15) 620
 
10.1%
Hangul
ValueCountFrequency (%)
813
 
9.8%
795
 
9.6%
560
 
6.8%
510
 
6.2%
486
 
5.9%
483
 
5.8%
482
 
5.8%
478
 
5.8%
478
 
5.8%
464
 
5.6%
Other values (182) 2719
32.9%
None
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-12T13:54:27.881269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:54:30.689120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번동구분
연번1.0000.077
동구분0.0771.000
2023-12-12T13:54:30.772777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번동구분
연번1.0000.045
동구분0.0451.000

Missing values

2023-12-12T13:54:27.995422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:54:28.087363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시설명동구분주소
01금강마트(주)쌍문동서울특별시 도봉구 방학로5길 33. 지하층 1호 (쌍문동. 금강아미움파크타운)
12도봉산쉼터도봉동서울특별시 도봉구 도봉로173길 139 (도봉동)
23세븐일레븐 도봉삼환점도봉동서울특별시 도봉구 도봉로180길 6-81. 삼환도봉아파트 상가동 106. 107호 (도봉동)
34다모아슈퍼도봉동서울특별시 도봉구 시루봉로27길 9 (도봉동)
45지에스25 쌍문신원점쌍문동서울특별시 도봉구 해등로 190. 1층 102호 (쌍문동. 신원주상복합2차아파트)
56미니스톱 창동상아점창동서울특별시 도봉구 해등로16다길 14. 1층 (창동)
67미니스톱 도봉방학점방학동서울특별시 도봉구 도당로13길 38 (방학동)
78지에스25 도봉신동아점방학동서울특별시 도봉구 방학로 203. 신동아상가 104호 (방학동)
89SM전기쌍문동서울특별시 도봉구 노해로37길 120. 1층 (쌍문동)
910픽미픽미아이스 도봉구청점방학동서울특별시 도봉구 도봉로152길 32. 101호 (방학동)
연번시설명동구분주소
468469우리마트쌍문동서울특별시 도봉구 노해로60길 36 (쌍문동)
469470신일사진관쌍문동서울특별시 도봉구 해등로 168 (쌍문동)
470471돼지슈퍼쌍문동서울특별시 도봉구 삼양로 628 (쌍문동)
471472한일문구사쌍문동서울특별시 도봉구 도봉로115길 11 (쌍문동)
472473개미식당창동서울특별시 도봉구 노해로 281 (창동)
473474영훈슈퍼창동서울특별시 도봉구 덕릉로 222 (창동)
474475럭키슈퍼도봉동서울특별시 도봉구 도봉로169나길 24 (도봉동)
475476동진마트쌍문동서울특별시 도봉구 해등로 370 (쌍문동)
476477한아름슈퍼쌍문동서울특별시 도봉구 삼양로 586 (쌍문동)
477478씨앗슈퍼창동서울특별시 도봉구 덕릉로59가길 53 (창동)