Overview

Dataset statistics

Number of variables5
Number of observations324
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.1 KiB
Average record size in memory41.4 B

Variable types

Numeric1
Text2
DateTime1
Categorical1

Dataset

Description경기도 구리시 내에서 담배(전자담배 포함) 구입이 가능한 편의점, 슈퍼, 모든 판매처의 현황(업소명, 주소 등)을 제공합니다.
URLhttps://www.data.go.kr/data/3038721/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 has unique valuesUnique
업소도로명주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:12:00.244130
Analysis finished2023-12-12 10:12:00.923268
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct324
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean162.5
Minimum1
Maximum324
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.0 KiB
2023-12-12T19:12:01.026849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile17.15
Q181.75
median162.5
Q3243.25
95-th percentile307.85
Maximum324
Range323
Interquartile range (IQR)161.5

Descriptive statistics

Standard deviation93.67497
Coefficient of variation (CV)0.57646135
Kurtosis-1.2
Mean162.5
Median Absolute Deviation (MAD)81
Skewness0
Sum52650
Variance8775
MonotonicityStrictly increasing
2023-12-12T19:12:01.189177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
205 1
 
0.3%
223 1
 
0.3%
222 1
 
0.3%
221 1
 
0.3%
220 1
 
0.3%
219 1
 
0.3%
218 1
 
0.3%
217 1
 
0.3%
216 1
 
0.3%
Other values (314) 314
96.9%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
324 1
0.3%
323 1
0.3%
322 1
0.3%
321 1
0.3%
320 1
0.3%
319 1
0.3%
318 1
0.3%
317 1
0.3%
316 1
0.3%
315 1
0.3%
Distinct320
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-12T19:12:01.475170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length17
Mean length8.7037037
Min length2

Characters and Unicode

Total characters2820
Distinct characters317
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique316 ?
Unique (%)97.5%

Sample

1st row백문슈퍼
2nd rowGS25 수택제일점
3rd row(주)코리아세븐 구리향군회관점
4th row주식회사 진로마트
5th row롯데씨브이에스711(주) 구리꽃길점
ValueCountFrequency (%)
세븐일레븐 32
 
6.3%
씨유 28
 
5.5%
gs25 28
 
5.5%
지에스(gs)25 15
 
2.9%
이마트24 14
 
2.8%
미니스톱 8
 
1.6%
주)코리아세븐 7
 
1.4%
cu 6
 
1.2%
구리갈매점 4
 
0.8%
구리토평점 4
 
0.8%
Other values (341) 363
71.3%
2023-12-12T19:12:01.946002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
188
 
6.7%
185
 
6.6%
142
 
5.0%
126
 
4.5%
2 76
 
2.7%
72
 
2.6%
60
 
2.1%
5 59
 
2.1%
G 56
 
2.0%
56
 
2.0%
Other values (307) 1800
63.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2215
78.5%
Space Separator 185
 
6.6%
Decimal Number 165
 
5.9%
Uppercase Letter 163
 
5.8%
Close Punctuation 38
 
1.3%
Open Punctuation 37
 
1.3%
Lowercase Letter 12
 
0.4%
Other Punctuation 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
188
 
8.5%
142
 
6.4%
126
 
5.7%
72
 
3.3%
60
 
2.7%
56
 
2.5%
49
 
2.2%
46
 
2.1%
42
 
1.9%
41
 
1.9%
Other values (275) 1393
62.9%
Uppercase Letter
ValueCountFrequency (%)
G 56
34.4%
S 55
33.7%
C 21
 
12.9%
U 18
 
11.0%
R 6
 
3.7%
D 2
 
1.2%
L 1
 
0.6%
Y 1
 
0.6%
T 1
 
0.6%
B 1
 
0.6%
Decimal Number
ValueCountFrequency (%)
2 76
46.1%
5 59
35.8%
4 19
 
11.5%
1 6
 
3.6%
0 2
 
1.2%
3 1
 
0.6%
8 1
 
0.6%
7 1
 
0.6%
Lowercase Letter
ValueCountFrequency (%)
f 3
25.0%
e 3
25.0%
o 2
16.7%
l 1
 
8.3%
s 1
 
8.3%
p 1
 
8.3%
u 1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
. 3
60.0%
& 1
 
20.0%
# 1
 
20.0%
Space Separator
ValueCountFrequency (%)
185
100.0%
Close Punctuation
ValueCountFrequency (%)
) 38
100.0%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2215
78.5%
Common 430
 
15.2%
Latin 175
 
6.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
188
 
8.5%
142
 
6.4%
126
 
5.7%
72
 
3.3%
60
 
2.7%
56
 
2.5%
49
 
2.2%
46
 
2.1%
42
 
1.9%
41
 
1.9%
Other values (275) 1393
62.9%
Latin
ValueCountFrequency (%)
G 56
32.0%
S 55
31.4%
C 21
 
12.0%
U 18
 
10.3%
R 6
 
3.4%
f 3
 
1.7%
e 3
 
1.7%
o 2
 
1.1%
D 2
 
1.1%
L 1
 
0.6%
Other values (8) 8
 
4.6%
Common
ValueCountFrequency (%)
185
43.0%
2 76
17.7%
5 59
 
13.7%
) 38
 
8.8%
( 37
 
8.6%
4 19
 
4.4%
1 6
 
1.4%
. 3
 
0.7%
0 2
 
0.5%
3 1
 
0.2%
Other values (4) 4
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2215
78.5%
ASCII 605
 
21.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
188
 
8.5%
142
 
6.4%
126
 
5.7%
72
 
3.3%
60
 
2.7%
56
 
2.5%
49
 
2.2%
46
 
2.1%
42
 
1.9%
41
 
1.9%
Other values (275) 1393
62.9%
ASCII
ValueCountFrequency (%)
185
30.6%
2 76
12.6%
5 59
 
9.8%
G 56
 
9.3%
S 55
 
9.1%
) 38
 
6.3%
( 37
 
6.1%
C 21
 
3.5%
4 19
 
3.1%
U 18
 
3.0%
Other values (22) 41
 
6.8%
Distinct324
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-12T19:12:02.290733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length46
Mean length32.04321
Min length15

Characters and Unicode

Total characters10382
Distinct characters254
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique324 ?
Unique (%)100.0%

Sample

1st row경기도 구리시 장자대로13번길 5. 대한빌딩 1층 (교문동)
2nd row경기도 구리시 검배로29번길 53. 1층 (수택동)
3rd row경기도 구리시 안골로30번길 16 (교문동)
4th row경기도 구리시 검배로 42. 1층 (수택동)
5th row경기도 구리시 체육관로 147. 두호 1층 (교문동)
ValueCountFrequency (%)
경기도 324
 
15.5%
구리시 324
 
15.5%
1층 135
 
6.5%
수택동 105
 
5.0%
인창동 71
 
3.4%
교문동 69
 
3.3%
갈매동 39
 
1.9%
토평동 26
 
1.2%
101호 25
 
1.2%
상가동 23
 
1.1%
Other values (526) 949
45.4%
2023-12-12T19:12:02.846306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1770
 
17.0%
1 581
 
5.6%
425
 
4.1%
394
 
3.8%
360
 
3.5%
354
 
3.4%
337
 
3.2%
334
 
3.2%
330
 
3.2%
. 330
 
3.2%
Other values (244) 5167
49.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5725
55.1%
Decimal Number 1821
 
17.5%
Space Separator 1770
 
17.0%
Other Punctuation 330
 
3.2%
Close Punctuation 327
 
3.1%
Open Punctuation 327
 
3.1%
Dash Punctuation 52
 
0.5%
Uppercase Letter 24
 
0.2%
Lowercase Letter 4
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
425
 
7.4%
394
 
6.9%
360
 
6.3%
354
 
6.2%
337
 
5.9%
334
 
5.8%
330
 
5.8%
318
 
5.6%
188
 
3.3%
177
 
3.1%
Other values (211) 2508
43.8%
Uppercase Letter
ValueCountFrequency (%)
B 4
16.7%
L 3
12.5%
G 2
8.3%
S 2
8.3%
A 2
8.3%
E 2
8.3%
O 2
8.3%
H 1
 
4.2%
R 1
 
4.2%
W 1
 
4.2%
Other values (4) 4
16.7%
Decimal Number
ValueCountFrequency (%)
1 581
31.9%
2 218
 
12.0%
0 212
 
11.6%
3 156
 
8.6%
4 138
 
7.6%
5 119
 
6.5%
6 119
 
6.5%
7 100
 
5.5%
9 92
 
5.1%
8 86
 
4.7%
Lowercase Letter
ValueCountFrequency (%)
e 2
50.0%
k 1
25.0%
t 1
25.0%
Space Separator
ValueCountFrequency (%)
1770
100.0%
Other Punctuation
ValueCountFrequency (%)
. 330
100.0%
Close Punctuation
ValueCountFrequency (%)
) 327
100.0%
Open Punctuation
ValueCountFrequency (%)
( 327
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5725
55.1%
Common 4629
44.6%
Latin 28
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
425
 
7.4%
394
 
6.9%
360
 
6.3%
354
 
6.2%
337
 
5.9%
334
 
5.8%
330
 
5.8%
318
 
5.6%
188
 
3.3%
177
 
3.1%
Other values (211) 2508
43.8%
Latin
ValueCountFrequency (%)
B 4
14.3%
L 3
10.7%
G 2
 
7.1%
S 2
 
7.1%
A 2
 
7.1%
e 2
 
7.1%
E 2
 
7.1%
O 2
 
7.1%
k 1
 
3.6%
t 1
 
3.6%
Other values (7) 7
25.0%
Common
ValueCountFrequency (%)
1770
38.2%
1 581
 
12.6%
. 330
 
7.1%
) 327
 
7.1%
( 327
 
7.1%
2 218
 
4.7%
0 212
 
4.6%
3 156
 
3.4%
4 138
 
3.0%
5 119
 
2.6%
Other values (6) 451
 
9.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5725
55.1%
ASCII 4657
44.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1770
38.0%
1 581
 
12.5%
. 330
 
7.1%
) 327
 
7.0%
( 327
 
7.0%
2 218
 
4.7%
0 212
 
4.6%
3 156
 
3.3%
4 138
 
3.0%
5 119
 
2.6%
Other values (23) 479
 
10.3%
Hangul
ValueCountFrequency (%)
425
 
7.4%
394
 
6.9%
360
 
6.3%
354
 
6.2%
337
 
5.9%
334
 
5.8%
330
 
5.8%
318
 
5.6%
188
 
3.3%
177
 
3.1%
Other values (211) 2508
43.8%
Distinct302
Distinct (%)93.2%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
Minimum1900-01-01 00:00:00
Maximum2022-05-03 00:00:00
2023-12-12T19:12:03.034213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:12:03.193733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-05-25
324 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-05-25
2nd row2023-05-25
3rd row2023-05-25
4th row2023-05-25
5th row2023-05-25

Common Values

ValueCountFrequency (%)
2023-05-25 324
100.0%

Length

2023-12-12T19:12:03.345230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:12:03.471434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-05-25 324
100.0%

Interactions

2023-12-12T19:12:00.573588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T19:12:00.717912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:12:00.867028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업소명업소도로명주소지정일자데이터기준일자
01백문슈퍼경기도 구리시 장자대로13번길 5. 대한빌딩 1층 (교문동)2022-05-032023-05-25
12GS25 수택제일점경기도 구리시 검배로29번길 53. 1층 (수택동)2022-04-192023-05-25
23(주)코리아세븐 구리향군회관점경기도 구리시 안골로30번길 16 (교문동)2022-04-192023-05-25
34주식회사 진로마트경기도 구리시 검배로 42. 1층 (수택동)2022-03-302023-05-25
45롯데씨브이에스711(주) 구리꽃길점경기도 구리시 체육관로 147. 두호 1층 (교문동)2022-03-232023-05-25
56미니스톱 구리인창점경기도 구리시 건원대로 42. 삼원골드프라자 111.112.113호 (인창동)2022-03-142023-05-25
67GS25 인창골드점경기도 구리시 동구릉로200번길 29-22. 1층 왼쪽호 (인창동)2022-03-072023-05-25
78CU 인창어반포레가을점경기도 구리시 응달말로52번길 69-5. 1층 (인창동)2022-03-072023-05-25
89CU 구리스타점경기도 구리시 갈매중앙로 201-3. 별내역 메트로망 3차 오피스텔 109.110호 (갈매동)2022-02-252023-05-25
910베이프 챌린지경기도 구리시 안골로63번길 35. 1층 (수택동)2022-02-212023-05-25
연번업소명업소도로명주소지정일자데이터기준일자
314315여주상회경기도 구리시 안골로63번길 42-12 (수택동)1998-11-202023-05-25
315316(주)코리아세븐 수택점경기도 구리시 검배로60번길 4 (수택동)1997-04-262023-05-25
316317왕자문구사경기도 구리시 체육관로 165 (교문동)1996-07-292023-05-25
317318삼육지물건재경기도 구리시 동구릉로 209 (인창동)1995-04-112023-05-25
318319구리농협경기도 구리시 경춘로 145 (교문동)1994-05-112023-05-25
319320대동장식경기도 구리시 검배로72번길 10 (수택동)1984-08-292023-05-25
320321경일슈퍼경기도 구리시 안골로97번길 17-5. 경일슈퍼 (수택동)1987-09-012023-05-25
321322수택슈퍼경기도 구리시 안골로 94 (수택동)1988-05-302023-05-25
322323강원상회경기도 구리시 경춘로162번길 36-6 (교문동)1900-01-012023-05-25
323324나영상회경기도 구리시 경춘로24번길 10 (교문동)1983-12-092023-05-25