Overview

Dataset statistics

Number of variables6
Number of observations851
Missing cells0
Missing cells (%)0.0%
Duplicate rows5
Duplicate rows (%)0.6%
Total size in memory40.0 KiB
Average record size in memory48.2 B

Variable types

Text3
DateTime1
Categorical2

Dataset

Description서울특별시 동대문구 담배소매인 지정현황 데이터로 업소명, 업소 지번주소, 업소 도로명주소, 지정일자 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15014278/fileData.do

Alerts

관리기관명 has constant value ""Constant
데이터기준일 has constant value ""Constant
Dataset has 5 (0.6%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 00:21:20.673637
Analysis finished2023-12-12 00:21:21.126075
Duration0.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct817
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
2023-12-12T09:21:21.302064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length20
Mean length8.0235018
Min length1

Characters and Unicode

Total characters6828
Distinct characters453
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique793 ?
Unique (%)93.2%

Sample

1st row명문손세차
2nd row씨유(CU) 용두허브리츠점
3rd row씨유 용두두산위브점
4th row씨유(CU) 동대문도서관점
5th row(주)코리아세븐 신이문로점
ValueCountFrequency (%)
씨유 70
 
5.9%
주)코리아세븐 37
 
3.1%
지에스25 28
 
2.4%
세븐일레븐 27
 
2.3%
gs25 23
 
1.9%
이마트24 15
 
1.3%
주식회사 8
 
0.7%
씨유(cu 7
 
0.6%
cu 7
 
0.6%
편의점 7
 
0.6%
Other values (861) 956
80.7%
2023-12-12T09:21:21.640439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
358
 
5.2%
343
 
5.0%
2 161
 
2.4%
155
 
2.3%
138
 
2.0%
132
 
1.9%
125
 
1.8%
5 121
 
1.8%
119
 
1.7%
118
 
1.7%
Other values (443) 5058
74.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5672
83.1%
Space Separator 343
 
5.0%
Decimal Number 328
 
4.8%
Uppercase Letter 234
 
3.4%
Close Punctuation 105
 
1.5%
Open Punctuation 105
 
1.5%
Lowercase Letter 37
 
0.5%
Other Punctuation 3
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
358
 
6.3%
155
 
2.7%
138
 
2.4%
132
 
2.3%
125
 
2.2%
119
 
2.1%
118
 
2.1%
117
 
2.1%
111
 
2.0%
108
 
1.9%
Other values (393) 4191
73.9%
Uppercase Letter
ValueCountFrequency (%)
S 79
33.8%
G 74
31.6%
U 21
 
9.0%
C 21
 
9.0%
K 5
 
2.1%
L 5
 
2.1%
J 4
 
1.7%
I 3
 
1.3%
D 3
 
1.3%
E 3
 
1.3%
Other values (13) 16
 
6.8%
Lowercase Letter
ValueCountFrequency (%)
r 6
16.2%
e 5
13.5%
t 5
13.5%
a 5
13.5%
m 4
10.8%
o 2
 
5.4%
k 2
 
5.4%
d 1
 
2.7%
g 1
 
2.7%
s 1
 
2.7%
Other values (5) 5
13.5%
Decimal Number
ValueCountFrequency (%)
2 161
49.1%
5 121
36.9%
4 34
 
10.4%
1 8
 
2.4%
3 3
 
0.9%
7 1
 
0.3%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
? 1
33.3%
Space Separator
ValueCountFrequency (%)
343
100.0%
Close Punctuation
ValueCountFrequency (%)
) 105
100.0%
Open Punctuation
ValueCountFrequency (%)
( 105
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5671
83.1%
Common 885
 
13.0%
Latin 271
 
4.0%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
358
 
6.3%
155
 
2.7%
138
 
2.4%
132
 
2.3%
125
 
2.2%
119
 
2.1%
118
 
2.1%
117
 
2.1%
111
 
2.0%
108
 
1.9%
Other values (392) 4190
73.9%
Latin
ValueCountFrequency (%)
S 79
29.2%
G 74
27.3%
U 21
 
7.7%
C 21
 
7.7%
r 6
 
2.2%
e 5
 
1.8%
t 5
 
1.8%
a 5
 
1.8%
K 5
 
1.8%
L 5
 
1.8%
Other values (28) 45
16.6%
Common
ValueCountFrequency (%)
343
38.8%
2 161
18.2%
5 121
 
13.7%
) 105
 
11.9%
( 105
 
11.9%
4 34
 
3.8%
1 8
 
0.9%
3 3
 
0.3%
. 2
 
0.2%
- 1
 
0.1%
Other values (2) 2
 
0.2%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5671
83.1%
ASCII 1156
 
16.9%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
358
 
6.3%
155
 
2.7%
138
 
2.4%
132
 
2.3%
125
 
2.2%
119
 
2.1%
118
 
2.1%
117
 
2.1%
111
 
2.0%
108
 
1.9%
Other values (392) 4190
73.9%
ASCII
ValueCountFrequency (%)
343
29.7%
2 161
13.9%
5 121
 
10.5%
) 105
 
9.1%
( 105
 
9.1%
S 79
 
6.8%
G 74
 
6.4%
4 34
 
2.9%
U 21
 
1.8%
C 21
 
1.8%
Other values (40) 92
 
8.0%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct777
Distinct (%)91.3%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
2023-12-12T09:21:21.929726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length24
Mean length19.417156
Min length15

Characters and Unicode

Total characters16524
Distinct characters82
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique738 ?
Unique (%)86.7%

Sample

1st row서울특별시 동대문구 서울시립대로 66-2
2nd row서울특별시 동대문구 왕산로26길 35
3rd row서울특별시 동대문구 천호대로 124
4th row서울특별시 동대문구 천호대로2길 23-17
5th row서울특별시 동대문구 신이문로9길 5-2
ValueCountFrequency (%)
동대문구 851
25.0%
서울특별시 850
25.0%
왕산로 48
 
1.4%
한천로 37
 
1.1%
천호대로 32
 
0.9%
답십리로 30
 
0.9%
회기로 25
 
0.7%
이문로 23
 
0.7%
장한로 21
 
0.6%
전농로 21
 
0.6%
Other values (612) 1467
43.1%
2023-12-12T09:21:22.320112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2557
15.5%
982
 
5.9%
901
 
5.5%
901
 
5.5%
888
 
5.4%
875
 
5.3%
874
 
5.3%
851
 
5.2%
850
 
5.1%
850
 
5.1%
Other values (72) 5995
36.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11093
67.1%
Decimal Number 2755
 
16.7%
Space Separator 2557
 
15.5%
Dash Punctuation 119
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
982
8.9%
901
 
8.1%
901
 
8.1%
888
 
8.0%
875
 
7.9%
874
 
7.9%
851
 
7.7%
850
 
7.7%
850
 
7.7%
842
 
7.6%
Other values (60) 2279
20.5%
Decimal Number
ValueCountFrequency (%)
1 555
20.1%
2 461
16.7%
3 291
10.6%
4 255
9.3%
6 246
8.9%
7 224
8.1%
5 222
 
8.1%
8 181
 
6.6%
0 163
 
5.9%
9 157
 
5.7%
Space Separator
ValueCountFrequency (%)
2557
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 119
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11093
67.1%
Common 5431
32.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
982
8.9%
901
 
8.1%
901
 
8.1%
888
 
8.0%
875
 
7.9%
874
 
7.9%
851
 
7.7%
850
 
7.7%
850
 
7.7%
842
 
7.6%
Other values (60) 2279
20.5%
Common
ValueCountFrequency (%)
2557
47.1%
1 555
 
10.2%
2 461
 
8.5%
3 291
 
5.4%
4 255
 
4.7%
6 246
 
4.5%
7 224
 
4.1%
5 222
 
4.1%
8 181
 
3.3%
0 163
 
3.0%
Other values (2) 276
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11093
67.1%
ASCII 5431
32.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2557
47.1%
1 555
 
10.2%
2 461
 
8.5%
3 291
 
5.4%
4 255
 
4.7%
6 246
 
4.5%
7 224
 
4.1%
5 222
 
4.1%
8 181
 
3.3%
0 163
 
3.0%
Other values (2) 276
 
5.1%
Hangul
ValueCountFrequency (%)
982
8.9%
901
 
8.1%
901
 
8.1%
888
 
8.0%
875
 
7.9%
874
 
7.9%
851
 
7.7%
850
 
7.7%
850
 
7.7%
842
 
7.6%
Other values (60) 2279
20.5%
Distinct776
Distinct (%)91.2%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
2023-12-12T09:21:22.468665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length22
Mean length20.321974
Min length16

Characters and Unicode

Total characters17294
Distinct characters53
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique736 ?
Unique (%)86.5%

Sample

1st row서울특별시 동대문구 전농동 500-2
2nd row서울특별시 동대문구 용두동 792-3
3rd row서울특별시 동대문구 용두동 791-1
4th row서울특별시 동대문구 신설동 104-30
5th row서울특별시 동대문구 이문동 242-51
ValueCountFrequency (%)
서울특별시 851
24.9%
동대문구 851
24.9%
장안동 182
 
5.3%
제기동 98
 
2.9%
답십리동 98
 
2.9%
전농동 96
 
2.8%
이문동 82
 
2.4%
용두동 78
 
2.3%
휘경동 72
 
2.1%
청량리동 58
 
1.7%
Other values (772) 949
27.8%
2023-12-12T09:21:22.718526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2564
14.8%
1695
 
9.8%
933
 
5.4%
851
 
4.9%
851
 
4.9%
851
 
4.9%
851
 
4.9%
851
 
4.9%
851
 
4.9%
851
 
4.9%
Other values (43) 6145
35.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10386
60.1%
Decimal Number 3615
 
20.9%
Space Separator 2564
 
14.8%
Dash Punctuation 729
 
4.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1695
16.3%
933
9.0%
851
8.2%
851
8.2%
851
8.2%
851
8.2%
851
8.2%
851
8.2%
851
8.2%
183
 
1.8%
Other values (31) 1618
15.6%
Decimal Number
ValueCountFrequency (%)
1 645
17.8%
3 493
13.6%
2 465
12.9%
4 400
11.1%
5 328
9.1%
9 295
8.2%
6 286
7.9%
0 240
 
6.6%
8 240
 
6.6%
7 223
 
6.2%
Space Separator
ValueCountFrequency (%)
2564
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 729
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10386
60.1%
Common 6908
39.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1695
16.3%
933
9.0%
851
8.2%
851
8.2%
851
8.2%
851
8.2%
851
8.2%
851
8.2%
851
8.2%
183
 
1.8%
Other values (31) 1618
15.6%
Common
ValueCountFrequency (%)
2564
37.1%
- 729
 
10.6%
1 645
 
9.3%
3 493
 
7.1%
2 465
 
6.7%
4 400
 
5.8%
5 328
 
4.7%
9 295
 
4.3%
6 286
 
4.1%
0 240
 
3.5%
Other values (2) 463
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10386
60.1%
ASCII 6908
39.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2564
37.1%
- 729
 
10.6%
1 645
 
9.3%
3 493
 
7.1%
2 465
 
6.7%
4 400
 
5.8%
5 328
 
4.7%
9 295
 
4.3%
6 286
 
4.1%
0 240
 
3.5%
Other values (2) 463
 
6.7%
Hangul
ValueCountFrequency (%)
1695
16.3%
933
9.0%
851
8.2%
851
8.2%
851
8.2%
851
8.2%
851
8.2%
851
8.2%
851
8.2%
183
 
1.8%
Other values (31) 1618
15.6%
Distinct728
Distinct (%)85.5%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
Minimum1983-02-01 00:00:00
Maximum2023-04-13 00:00:00
2023-12-12T09:21:22.828850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:21:22.960448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

관리기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
서울특별시 동대문구청
851 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시 동대문구청
2nd row서울특별시 동대문구청
3rd row서울특별시 동대문구청
4th row서울특별시 동대문구청
5th row서울특별시 동대문구청

Common Values

ValueCountFrequency (%)
서울특별시 동대문구청 851
100.0%

Length

2023-12-12T09:21:23.060556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:21:23.136244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 851
50.0%
동대문구청 851
50.0%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
2023-04-18
851 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-04-18
2nd row2023-04-18
3rd row2023-04-18
4th row2023-04-18
5th row2023-04-18

Common Values

ValueCountFrequency (%)
2023-04-18 851
100.0%

Length

2023-12-12T09:21:23.223567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:21:23.305736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-04-18 851
100.0%

Missing values

2023-12-12T09:21:21.016558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:21:21.093573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명도로명주소지번주소지정일자관리기관명데이터기준일
0명문손세차서울특별시 동대문구 서울시립대로 66-2서울특별시 동대문구 전농동 500-22023-04-13서울특별시 동대문구청2023-04-18
1씨유(CU) 용두허브리츠점서울특별시 동대문구 왕산로26길 35서울특별시 동대문구 용두동 792-32023-04-13서울특별시 동대문구청2023-04-18
2씨유 용두두산위브점서울특별시 동대문구 천호대로 124서울특별시 동대문구 용두동 791-12023-04-07서울특별시 동대문구청2023-04-18
3씨유(CU) 동대문도서관점서울특별시 동대문구 천호대로2길 23-17서울특별시 동대문구 신설동 104-302023-03-31서울특별시 동대문구청2023-04-18
4(주)코리아세븐 신이문로점서울특별시 동대문구 신이문로9길 5-2서울특별시 동대문구 이문동 242-512023-03-16서울특별시 동대문구청2023-04-18
5이엔에프하우스서울특별시 동대문구 천호대로 10서울특별시 동대문구 신설동 103-22023-03-14서울특별시 동대문구청2023-04-18
6씨유 청계노벨리아점서울특별시 동대문구 서울시립대로 19서울특별시 동대문구 답십리동 10082023-03-13서울특별시 동대문구청2023-04-18
7(주)코리아세븐 아이클래스청계첨서울특별시 동대문구 청계천로 461서울특별시 동대문구 용두동 255-492023-02-24서울특별시 동대문구청2023-04-18
8춘보물산서울특별시 동대문구 약령중앙로 23-2서울특별시 동대문구 제기동 1141-332023-02-24서울특별시 동대문구청2023-04-18
9지에스25 전농샛별점서울특별시 동대문구 전농로37길 78서울특별시 동대문구 전농동 128-1012023-02-21서울특별시 동대문구청2023-04-18
업소명도로명주소지번주소지정일자관리기관명데이터기준일
841만경상회서울특별시 동대문구 회기로12길 29서울특별시 동대문구 회기동 102-1111986-03-07서울특별시 동대문구청2023-04-18
842담배서울특별시 동대문구 경희대로 15서울특별시 동대문구 회기동 42-11986-12-01서울특별시 동대문구청2023-04-18
843삼오사부동산서울특별시 동대문구 이문로 41서울특별시 동대문구 회기동 346-131985-07-01서울특별시 동대문구청2023-04-18
844문방구서울특별시 동대문구 답십리로48길 45서울특별시 동대문구 답십리동 36-241985-06-30서울특별시 동대문구청2023-04-18
845담배서울특별시 동대문구 외대역동로18길 5서울특별시 동대문구 이문동 86-1171983-02-01서울특별시 동대문구청2023-04-18
846근린식품서울특별시 동대문구 답십리로65길 123서울특별시 동대문구 장안동 102-61983-06-28서울특별시 동대문구청2023-04-18
847동천상회서울특별시 동대문구 답십리로23길 70-2서울특별시 동대문구 전농동 530-101983-06-09서울특별시 동대문구청2023-04-18
848씨유 외대제일점서울특별시 동대문구 이문로30길 23서울특별시 동대문구 이문동 293-232009-03-04서울특별시 동대문구청2023-04-18
849해태수퍼서울특별시 동대문구 한천로46길서울특별시 동대문구 휘경동 49-1811994-02-25서울특별시 동대문구청2023-04-18
850GS25 장안에이스점서울특별시 동대문구 답십리로 252서울특별시 동대문구 장안동 143-402017-08-09서울특별시 동대문구청2023-04-18

Duplicate rows

Most frequently occurring

업소명도로명주소지번주소지정일자관리기관명데이터기준일# duplicates
2경희대학교 소비자생활협동조합서울특별시 동대문구 경희대로 26-6서울특별시 동대문구 회기동 1-52007-02-13서울특별시 동대문구청2023-04-186
0서울특별시 동대문구 천호대로 247서울특별시 동대문구 답십리동 498-52005-05-16서울특별시 동대문구청2023-04-182
1경주사업총괄본부 장안서울특별시 동대문구 장한로2길 33서울특별시 동대문구 장안동 4622022-05-11서울특별시 동대문구청2023-04-182
3경희대학교소비자생활협동조합서울특별시 동대문구 경희대로 26-6서울특별시 동대문구 회기동 1-52021-02-18서울특별시 동대문구청2023-04-182
4숭인상회서울특별시 동대문구 난계로 254서울특별시 동대문구 신설동 117-42022-11-21서울특별시 동대문구청2023-04-182