Overview

Dataset statistics

Number of variables4
Number of observations1201
Missing cells0
Missing cells (%)0.0%
Duplicate rows3
Duplicate rows (%)0.2%
Total size in memory37.7 KiB
Average record size in memory32.1 B

Variable types

Text3
Categorical1

Dataset

Description송파구 담배소매인 지정현황으로 업소명, 업소지번주소, 업소도로명주소, 데이터 기준일자 등에 정보를 제공합니다.
Author서울특별시 송파구
URLhttps://www.data.go.kr/data/15005407/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 3 (0.2%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 13:24:13.196095
Analysis finished2023-12-12 13:24:13.907535
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1143
Distinct (%)95.2%
Missing0
Missing (%)0.0%
Memory size9.5 KiB
2023-12-12T22:24:14.138002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length22
Mean length8.6369692
Min length1

Characters and Unicode

Total characters10373
Distinct characters506
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1118 ?
Unique (%)93.1%

Sample

1st row미니스톱 방이2점
2nd row지에스25(GS25) 잠실제일점
3rd row(주)코리아세븐 오금대림점
4th row대한마트
5th row(주)코리아세븐 문정문현점
ValueCountFrequency (%)
gs25 102
 
5.6%
씨유 69
 
3.8%
주)코리아세븐 61
 
3.4%
세븐일레븐 54
 
3.0%
이마트24 36
 
2.0%
cu 26
 
1.4%
미니스톱 22
 
1.2%
지에스25 18
 
1.0%
잠실점 10
 
0.6%
주식회사 7
 
0.4%
Other values (1194) 1403
77.6%
2023-12-12T22:24:14.670217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
648
 
6.2%
611
 
5.9%
2 254
 
2.4%
237
 
2.3%
) 223
 
2.1%
( 222
 
2.1%
221
 
2.1%
5 198
 
1.9%
191
 
1.8%
188
 
1.8%
Other values (496) 7380
71.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8125
78.3%
Space Separator 648
 
6.2%
Decimal Number 592
 
5.7%
Uppercase Letter 506
 
4.9%
Close Punctuation 224
 
2.2%
Open Punctuation 223
 
2.1%
Lowercase Letter 39
 
0.4%
Dash Punctuation 8
 
0.1%
Other Punctuation 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
611
 
7.5%
237
 
2.9%
221
 
2.7%
191
 
2.4%
188
 
2.3%
188
 
2.3%
178
 
2.2%
175
 
2.2%
150
 
1.8%
139
 
1.7%
Other values (436) 5847
72.0%
Uppercase Letter
ValueCountFrequency (%)
S 167
33.0%
G 150
29.6%
C 62
 
12.3%
U 57
 
11.3%
M 10
 
2.0%
B 8
 
1.6%
A 8
 
1.6%
I 5
 
1.0%
E 5
 
1.0%
L 4
 
0.8%
Other values (14) 30
 
5.9%
Lowercase Letter
ValueCountFrequency (%)
a 5
12.8%
e 5
12.8%
s 5
12.8%
o 5
12.8%
r 4
10.3%
t 4
10.3%
c 2
 
5.1%
y 2
 
5.1%
k 2
 
5.1%
f 1
 
2.6%
Other values (4) 4
10.3%
Decimal Number
ValueCountFrequency (%)
2 254
42.9%
5 198
33.4%
4 51
 
8.6%
1 24
 
4.1%
9 24
 
4.1%
3 13
 
2.2%
8 11
 
1.9%
0 8
 
1.4%
6 5
 
0.8%
7 4
 
0.7%
Other Punctuation
ValueCountFrequency (%)
· 2
25.0%
. 2
25.0%
& 1
12.5%
/ 1
12.5%
? 1
12.5%
' 1
12.5%
Close Punctuation
ValueCountFrequency (%)
) 223
99.6%
] 1
 
0.4%
Open Punctuation
ValueCountFrequency (%)
( 222
99.6%
[ 1
 
0.4%
Space Separator
ValueCountFrequency (%)
648
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8125
78.3%
Common 1703
 
16.4%
Latin 545
 
5.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
611
 
7.5%
237
 
2.9%
221
 
2.7%
191
 
2.4%
188
 
2.3%
188
 
2.3%
178
 
2.2%
175
 
2.2%
150
 
1.8%
139
 
1.7%
Other values (436) 5847
72.0%
Latin
ValueCountFrequency (%)
S 167
30.6%
G 150
27.5%
C 62
 
11.4%
U 57
 
10.5%
M 10
 
1.8%
B 8
 
1.5%
A 8
 
1.5%
I 5
 
0.9%
a 5
 
0.9%
e 5
 
0.9%
Other values (28) 68
12.5%
Common
ValueCountFrequency (%)
648
38.1%
2 254
 
14.9%
) 223
 
13.1%
( 222
 
13.0%
5 198
 
11.6%
4 51
 
3.0%
1 24
 
1.4%
9 24
 
1.4%
3 13
 
0.8%
8 11
 
0.6%
Other values (12) 35
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8124
78.3%
ASCII 2246
 
21.7%
None 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
648
28.9%
2 254
 
11.3%
) 223
 
9.9%
( 222
 
9.9%
5 198
 
8.8%
S 167
 
7.4%
G 150
 
6.7%
C 62
 
2.8%
U 57
 
2.5%
4 51
 
2.3%
Other values (49) 214
 
9.5%
Hangul
ValueCountFrequency (%)
611
 
7.5%
237
 
2.9%
221
 
2.7%
191
 
2.4%
188
 
2.3%
188
 
2.3%
178
 
2.2%
175
 
2.2%
150
 
1.8%
139
 
1.7%
Other values (435) 5846
72.0%
None
ValueCountFrequency (%)
· 2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct1130
Distinct (%)94.1%
Missing0
Missing (%)0.0%
Memory size9.5 KiB
2023-12-12T22:24:15.116204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length44
Mean length26.992506
Min length1

Characters and Unicode

Total characters32418
Distinct characters356
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1098 ?
Unique (%)91.4%

Sample

1st row서울특별시 송파구 방이동 102-16
2nd row서울특별시 송파구 잠실동 204-10
3rd row서울특별시 송파구 오금동 23-2 한이빌딩
4th row서울특별시 송파구 장지동 883 힐스테이트송파위례
5th row서울특별시 송파구 문정동 78-21
ValueCountFrequency (%)
서울특별시 1200
 
17.4%
송파구 1200
 
17.4%
1층 207
 
3.0%
가락동 152
 
2.2%
문정동 152
 
2.2%
140
 
2.0%
잠실동 138
 
2.0%
방이동 88
 
1.3%
석촌동 78
 
1.1%
마천동 77
 
1.1%
Other values (1189) 3467
50.3%
2023-12-12T22:24:15.721301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6455
19.9%
1 1493
 
4.6%
1407
 
4.3%
1300
 
4.0%
1299
 
4.0%
1252
 
3.9%
1238
 
3.8%
1222
 
3.8%
1222
 
3.8%
1218
 
3.8%
Other values (346) 14312
44.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20236
62.4%
Space Separator 6455
 
19.9%
Decimal Number 5404
 
16.7%
Uppercase Letter 125
 
0.4%
Dash Punctuation 78
 
0.2%
Open Punctuation 40
 
0.1%
Close Punctuation 40
 
0.1%
Other Punctuation 23
 
0.1%
Lowercase Letter 11
 
< 0.1%
Math Symbol 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1407
 
7.0%
1300
 
6.4%
1299
 
6.4%
1252
 
6.2%
1238
 
6.1%
1222
 
6.0%
1222
 
6.0%
1218
 
6.0%
1201
 
5.9%
1200
 
5.9%
Other values (299) 7677
37.9%
Uppercase Letter
ValueCountFrequency (%)
B 36
28.8%
A 16
12.8%
T 12
 
9.6%
L 8
 
6.4%
I 5
 
4.0%
S 5
 
4.0%
E 5
 
4.0%
F 5
 
4.0%
X 5
 
4.0%
D 4
 
3.2%
Other values (12) 24
19.2%
Decimal Number
ValueCountFrequency (%)
1 1493
27.6%
2 739
13.7%
0 583
 
10.8%
4 444
 
8.2%
3 434
 
8.0%
6 403
 
7.5%
8 341
 
6.3%
5 332
 
6.1%
9 329
 
6.1%
7 306
 
5.7%
Lowercase Letter
ValueCountFrequency (%)
t 3
27.3%
e 2
18.2%
l 2
18.2%
o 1
 
9.1%
r 1
 
9.1%
m 1
 
9.1%
a 1
 
9.1%
Other Punctuation
ValueCountFrequency (%)
. 20
87.0%
' 2
 
8.7%
& 1
 
4.3%
Space Separator
ValueCountFrequency (%)
6455
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 78
100.0%
Open Punctuation
ValueCountFrequency (%)
( 40
100.0%
Close Punctuation
ValueCountFrequency (%)
) 40
100.0%
Math Symbol
ValueCountFrequency (%)
~ 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20236
62.4%
Common 12046
37.2%
Latin 136
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1407
 
7.0%
1300
 
6.4%
1299
 
6.4%
1252
 
6.2%
1238
 
6.1%
1222
 
6.0%
1222
 
6.0%
1218
 
6.0%
1201
 
5.9%
1200
 
5.9%
Other values (299) 7677
37.9%
Latin
ValueCountFrequency (%)
B 36
26.5%
A 16
11.8%
T 12
 
8.8%
L 8
 
5.9%
I 5
 
3.7%
S 5
 
3.7%
E 5
 
3.7%
F 5
 
3.7%
X 5
 
3.7%
D 4
 
2.9%
Other values (19) 35
25.7%
Common
ValueCountFrequency (%)
6455
53.6%
1 1493
 
12.4%
2 739
 
6.1%
0 583
 
4.8%
4 444
 
3.7%
3 434
 
3.6%
6 403
 
3.3%
8 341
 
2.8%
5 332
 
2.8%
9 329
 
2.7%
Other values (8) 493
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20236
62.4%
ASCII 12182
37.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6455
53.0%
1 1493
 
12.3%
2 739
 
6.1%
0 583
 
4.8%
4 444
 
3.6%
3 434
 
3.6%
6 403
 
3.3%
8 341
 
2.8%
5 332
 
2.7%
9 329
 
2.7%
Other values (37) 629
 
5.2%
Hangul
ValueCountFrequency (%)
1407
 
7.0%
1300
 
6.4%
1299
 
6.4%
1252
 
6.2%
1238
 
6.1%
1222
 
6.0%
1222
 
6.0%
1218
 
6.0%
1201
 
5.9%
1200
 
5.9%
Other values (299) 7677
37.9%
Distinct1117
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Memory size9.5 KiB
2023-12-12T22:24:16.033847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length79
Median length53
Mean length32.371357
Min length1

Characters and Unicode

Total characters38878
Distinct characters371
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1112 ?
Unique (%)92.6%

Sample

1st row서울특별시 송파구 올림픽로32길 47. 1층 (방이동)
2nd row서울특별시 송파구 올림픽로12길 22. 1층 (잠실동)
3rd row서울특별시 송파구 마천로 83. 제이에스빌딩 102호 (오금동)
4th row서울특별시 송파구 위례광장로 170. 상가B동 102호 (장지동. 힐스테이트송파위례)
5th row서울특별시 송파구 새말로10길 22. 1층 (문정동)
ValueCountFrequency (%)
서울특별시 1126
 
15.3%
송파구 1126
 
15.3%
1층 429
 
5.8%
문정동 149
 
2.0%
가락동 137
 
1.9%
잠실동 125
 
1.7%
올림픽로 104
 
1.4%
방이동 76
 
1.0%
석촌동 74
 
1.0%
마천동 72
 
1.0%
Other values (1454) 3954
53.6%
2023-12-12T22:24:16.495349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6591
 
17.0%
1 1973
 
5.1%
1456
 
3.7%
1385
 
3.6%
1367
 
3.5%
. 1277
 
3.3%
1187
 
3.1%
( 1156
 
3.0%
) 1156
 
3.0%
1154
 
3.0%
Other values (361) 20176
51.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22371
57.5%
Space Separator 6591
 
17.0%
Decimal Number 5993
 
15.4%
Other Punctuation 1280
 
3.3%
Open Punctuation 1156
 
3.0%
Close Punctuation 1156
 
3.0%
Uppercase Letter 164
 
0.4%
Dash Punctuation 151
 
0.4%
Lowercase Letter 10
 
< 0.1%
Math Symbol 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1456
 
6.5%
1385
 
6.2%
1367
 
6.1%
1187
 
5.3%
1154
 
5.2%
1149
 
5.1%
1144
 
5.1%
1134
 
5.1%
1126
 
5.0%
1126
 
5.0%
Other values (317) 10143
45.3%
Uppercase Letter
ValueCountFrequency (%)
B 59
36.0%
A 32
19.5%
Y 9
 
5.5%
C 7
 
4.3%
F 7
 
4.3%
T 6
 
3.7%
G 5
 
3.0%
E 5
 
3.0%
S 5
 
3.0%
X 5
 
3.0%
Other values (9) 24
14.6%
Decimal Number
ValueCountFrequency (%)
1 1973
32.9%
2 855
14.3%
0 621
 
10.4%
3 579
 
9.7%
4 468
 
7.8%
5 366
 
6.1%
6 348
 
5.8%
9 276
 
4.6%
8 275
 
4.6%
7 232
 
3.9%
Lowercase Letter
ValueCountFrequency (%)
l 2
20.0%
e 2
20.0%
t 2
20.0%
o 1
10.0%
a 1
10.0%
m 1
10.0%
r 1
10.0%
Other Punctuation
ValueCountFrequency (%)
. 1277
99.8%
' 2
 
0.2%
& 1
 
0.1%
Space Separator
ValueCountFrequency (%)
6591
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1156
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1156
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 151
100.0%
Math Symbol
ValueCountFrequency (%)
~ 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22371
57.5%
Common 16333
42.0%
Latin 174
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1456
 
6.5%
1385
 
6.2%
1367
 
6.1%
1187
 
5.3%
1154
 
5.2%
1149
 
5.1%
1144
 
5.1%
1134
 
5.1%
1126
 
5.0%
1126
 
5.0%
Other values (317) 10143
45.3%
Latin
ValueCountFrequency (%)
B 59
33.9%
A 32
18.4%
Y 9
 
5.2%
C 7
 
4.0%
F 7
 
4.0%
T 6
 
3.4%
G 5
 
2.9%
E 5
 
2.9%
S 5
 
2.9%
X 5
 
2.9%
Other values (16) 34
19.5%
Common
ValueCountFrequency (%)
6591
40.4%
1 1973
 
12.1%
. 1277
 
7.8%
( 1156
 
7.1%
) 1156
 
7.1%
2 855
 
5.2%
0 621
 
3.8%
3 579
 
3.5%
4 468
 
2.9%
5 366
 
2.2%
Other values (8) 1291
 
7.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22371
57.5%
ASCII 16507
42.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6591
39.9%
1 1973
 
12.0%
. 1277
 
7.7%
( 1156
 
7.0%
) 1156
 
7.0%
2 855
 
5.2%
0 621
 
3.8%
3 579
 
3.5%
4 468
 
2.8%
5 366
 
2.2%
Other values (34) 1465
 
8.9%
Hangul
ValueCountFrequency (%)
1456
 
6.5%
1385
 
6.2%
1367
 
6.1%
1187
 
5.3%
1154
 
5.2%
1149
 
5.1%
1144
 
5.1%
1134
 
5.1%
1126
 
5.0%
1126
 
5.0%
Other values (317) 10143
45.3%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.5 KiB
2022-12-30
1201 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-12-30
2nd row2022-12-30
3rd row2022-12-30
4th row2022-12-30
5th row2022-12-30

Common Values

ValueCountFrequency (%)
2022-12-30 1201
100.0%

Length

2023-12-12T22:24:16.642415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:24:16.731797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-12-30 1201
100.0%

Missing values

2023-12-12T22:24:13.770995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:24:13.866556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명업소지번주소업소도로명주소데이터기준일자
0미니스톱 방이2점서울특별시 송파구 방이동 102-16서울특별시 송파구 올림픽로32길 47. 1층 (방이동)2022-12-30
1지에스25(GS25) 잠실제일점서울특별시 송파구 잠실동 204-10서울특별시 송파구 올림픽로12길 22. 1층 (잠실동)2022-12-30
2(주)코리아세븐 오금대림점서울특별시 송파구 오금동 23-2 한이빌딩서울특별시 송파구 마천로 83. 제이에스빌딩 102호 (오금동)2022-12-30
3대한마트서울특별시 송파구 장지동 883 힐스테이트송파위례서울특별시 송파구 위례광장로 170. 상가B동 102호 (장지동. 힐스테이트송파위례)2022-12-30
4(주)코리아세븐 문정문현점서울특별시 송파구 문정동 78-21서울특별시 송파구 새말로10길 22. 1층 (문정동)2022-12-30
5씨유 송파파크센트럴점서울특별시 송파구 거여동 234서울특별시 송파구 오금로 551. 송파파크센트럴 209동 101호 (거여동)2022-12-30
6GS25 잠실행운점서울특별시 송파구 잠실동 175-6 J타워서울특별시 송파구 올림픽로 76. J타워 102호 (잠실동)2022-12-30
7GS25 거여우리서울특별시 송파구 거여동 546서울특별시 송파구 오금로64길 3. 1층 101호 (거여동)2022-12-30
8(주)이마트 에브리데이 송파점서울특별시 송파구 가락동 479 헬리오시티서울특별시 송파구 송파대로 345. 1블록 A동 지하1층 149호 (가락동. 헬리오시티)2022-12-30
9트윙클네일앤모어서울특별시 송파구 가락동 165 래미안파크팰리스 상가서울특별시 송파구 동남로 227. 래미안파크팰리스 상가 110호 (가락동)2022-12-30
업소명업소지번주소업소도로명주소데이터기준일자
1191서울특별시 송파구 마천동 187번지 3 호서울특별시 송파구 성내천로 272-1 (마천동)2022-12-30
1192황금수퍼서울특별시 송파구 오금동 121번지 11호서울특별시 송파구 마천로21길 6 (오금동)2022-12-30
1193서울특별시 송파구 마천동 10번지 10 호서울특별시 송파구 마천로35길 15-8 (마천동)2022-12-30
1194일등수퍼서울특별시 송파구 삼전동 100번지 5 호서울특별시 송파구 백제고분로27길 5 (삼전동)2022-12-30
1195서울특별시 송파구 풍납동 404호서울특별시 송파구 풍성로26길 56 (풍납동)2022-12-30
1196서울특별시 송파구 마천동 207번지 3 호서울특별시 송파구 성내천로 269-3 (마천동)2022-12-30
1197서울특별시 송파구 마천동 211번지 30 호서울특별시 송파구 성내천로 290-1 (마천동)2022-12-30
1198대영슈퍼서울특별시 송파구 풍납동 94번지 4호서울특별시 송파구 바람드리9길 21 (풍납동)2022-12-30
1199알파문구 장지점서울특별시 송파구 문정동 82번지 10호서울특별시 송파구 새말로 124 (문정동)2022-12-30
1200서울특별시 송파구 마천동 130호서울특별시 송파구 거마로22길 20-15 (마천동)2022-12-30

Duplicate rows

Most frequently occurring

업소명업소지번주소업소도로명주소데이터기준일자# duplicates
0서울특별시 송파구 석촌동 277번지 0002호2022-12-302
1올림픽파크텔서울특별시 송파구 방이동 66번지 0002호2022-12-302
2이천쌀수퍼서울특별시 송파구 오금동 71번지 0009호2022-12-302