Overview

Dataset statistics

Number of variables4
Number of observations202
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.4 KiB
Average record size in memory32.7 B

Variable types

Categorical1
Text2
DateTime1

Dataset

Description담배사업법 및 담배사업법 시행규칙과 관련한 영광군 담배소매인 현황 자료로 구분, 업소명, 업소주소, 지정일자에 대한 내용이 담겨있습니다.
Author전라남도 영광군
URLhttps://www.data.go.kr/data/15021441/fileData.do

Alerts

구분 is highly imbalanced (67.5%)Imbalance

Reproduction

Analysis started2023-12-12 10:55:27.998469
Analysis finished2023-12-12 10:55:28.599665
Duration0.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
일반소매인
190 
구내소매인
 
12

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반소매인
2nd row일반소매인
3rd row일반소매인
4th row일반소매인
5th row일반소매인

Common Values

ValueCountFrequency (%)
일반소매인 190
94.1%
구내소매인 12
 
5.9%

Length

2023-12-12T19:55:28.702703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:55:28.880782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반소매인 190
94.1%
구내소매인 12
 
5.9%
Distinct200
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-12T19:55:29.286941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length15
Mean length6.4356436
Min length2

Characters and Unicode

Total characters1300
Distinct characters258
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique198 ?
Unique (%)98.0%

Sample

1st row미니스톱 영광도동점
2nd row설도젓갈타운매점
3rd row지에스25 법성포 터미널점
4th row이마트24R 영광중앙점
5th row정상매점
ValueCountFrequency (%)
세븐일레븐 8
 
3.2%
이마트24 6
 
2.4%
씨유 6
 
2.4%
미니스톱 5
 
2.0%
일성수퍼 2
 
0.8%
대성상회 2
 
0.8%
한전kps 2
 
0.8%
영광농협 2
 
0.8%
영광단주점 2
 
0.8%
하나로마트 2
 
0.8%
Other values (212) 215
85.3%
2023-12-12T19:55:29.964949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
59
 
4.5%
50
 
3.8%
49
 
3.8%
47
 
3.6%
39
 
3.0%
35
 
2.7%
31
 
2.4%
28
 
2.2%
26
 
2.0%
24
 
1.8%
Other values (248) 912
70.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1164
89.5%
Space Separator 50
 
3.8%
Decimal Number 42
 
3.2%
Uppercase Letter 30
 
2.3%
Open Punctuation 7
 
0.5%
Close Punctuation 7
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
59
 
5.1%
49
 
4.2%
47
 
4.0%
39
 
3.4%
35
 
3.0%
31
 
2.7%
28
 
2.4%
26
 
2.2%
24
 
2.1%
19
 
1.6%
Other values (228) 807
69.3%
Uppercase Letter
ValueCountFrequency (%)
S 7
23.3%
C 6
20.0%
G 5
16.7%
P 3
10.0%
K 2
 
6.7%
U 2
 
6.7%
R 2
 
6.7%
X 1
 
3.3%
V 1
 
3.3%
T 1
 
3.3%
Decimal Number
ValueCountFrequency (%)
2 15
35.7%
5 8
19.0%
4 8
19.0%
3 5
 
11.9%
6 3
 
7.1%
1 2
 
4.8%
9 1
 
2.4%
Space Separator
ValueCountFrequency (%)
50
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1164
89.5%
Common 106
 
8.2%
Latin 30
 
2.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
59
 
5.1%
49
 
4.2%
47
 
4.0%
39
 
3.4%
35
 
3.0%
31
 
2.7%
28
 
2.4%
26
 
2.2%
24
 
2.1%
19
 
1.6%
Other values (228) 807
69.3%
Common
ValueCountFrequency (%)
50
47.2%
2 15
 
14.2%
5 8
 
7.5%
4 8
 
7.5%
( 7
 
6.6%
) 7
 
6.6%
3 5
 
4.7%
6 3
 
2.8%
1 2
 
1.9%
9 1
 
0.9%
Latin
ValueCountFrequency (%)
S 7
23.3%
C 6
20.0%
G 5
16.7%
P 3
10.0%
K 2
 
6.7%
U 2
 
6.7%
R 2
 
6.7%
X 1
 
3.3%
V 1
 
3.3%
T 1
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1164
89.5%
ASCII 136
 
10.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
59
 
5.1%
49
 
4.2%
47
 
4.0%
39
 
3.4%
35
 
3.0%
31
 
2.7%
28
 
2.4%
26
 
2.2%
24
 
2.1%
19
 
1.6%
Other values (228) 807
69.3%
ASCII
ValueCountFrequency (%)
50
36.8%
2 15
 
11.0%
5 8
 
5.9%
4 8
 
5.9%
( 7
 
5.1%
) 7
 
5.1%
S 7
 
5.1%
C 6
 
4.4%
3 5
 
3.7%
G 5
 
3.7%
Other values (10) 18
 
13.2%
Distinct199
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-12T19:55:30.429894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length32
Mean length24.09901
Min length18

Characters and Unicode

Total characters4868
Distinct characters125
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique197 ?
Unique (%)97.5%

Sample

1st row전라남도 영광군 영광읍 도동리 360 도동휴먼시아
2nd row전라남도 영광군 염산면 봉남리 695-20
3rd row전라남도 영광군 법성면 법성리 1229
4th row전라남도 영광군 영광읍 남천리 251
5th row전라남도 영광군 백수읍 대신리 162-1
ValueCountFrequency (%)
전라남도 202
18.3%
영광군 202
18.3%
영광읍 101
 
9.1%
홍농읍 22
 
2.0%
법성면 22
 
2.0%
백수읍 22
 
2.0%
남천리 21
 
1.9%
신하리 16
 
1.4%
백학리 16
 
1.4%
법성리 15
 
1.4%
Other values (297) 466
42.2%
2023-12-12T19:55:31.101539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1007
20.7%
307
 
6.3%
305
 
6.3%
233
 
4.8%
216
 
4.4%
209
 
4.3%
206
 
4.2%
204
 
4.2%
178
 
3.7%
1 163
 
3.3%
Other values (115) 1840
37.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3020
62.0%
Space Separator 1007
 
20.7%
Decimal Number 776
 
15.9%
Dash Punctuation 60
 
1.2%
Other Punctuation 3
 
0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
307
 
10.2%
305
 
10.1%
233
 
7.7%
216
 
7.2%
209
 
6.9%
206
 
6.8%
204
 
6.8%
178
 
5.9%
145
 
4.8%
139
 
4.6%
Other values (100) 878
29.1%
Decimal Number
ValueCountFrequency (%)
1 163
21.0%
2 106
13.7%
3 102
13.1%
4 77
9.9%
7 61
 
7.9%
6 59
 
7.6%
5 56
 
7.2%
0 53
 
6.8%
8 51
 
6.6%
9 48
 
6.2%
Space Separator
ValueCountFrequency (%)
1007
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 60
100.0%
Other Punctuation
ValueCountFrequency (%)
. 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3020
62.0%
Common 1848
38.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
307
 
10.2%
305
 
10.1%
233
 
7.7%
216
 
7.2%
209
 
6.9%
206
 
6.8%
204
 
6.8%
178
 
5.9%
145
 
4.8%
139
 
4.6%
Other values (100) 878
29.1%
Common
ValueCountFrequency (%)
1007
54.5%
1 163
 
8.8%
2 106
 
5.7%
3 102
 
5.5%
4 77
 
4.2%
7 61
 
3.3%
- 60
 
3.2%
6 59
 
3.2%
5 56
 
3.0%
0 53
 
2.9%
Other values (5) 104
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3020
62.0%
ASCII 1848
38.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1007
54.5%
1 163
 
8.8%
2 106
 
5.7%
3 102
 
5.5%
4 77
 
4.2%
7 61
 
3.3%
- 60
 
3.2%
6 59
 
3.2%
5 56
 
3.0%
0 53
 
2.9%
Other values (5) 104
 
5.6%
Hangul
ValueCountFrequency (%)
307
 
10.2%
305
 
10.1%
233
 
7.7%
216
 
7.2%
209
 
6.9%
206
 
6.8%
204
 
6.8%
178
 
5.9%
145
 
4.8%
139
 
4.6%
Other values (100) 878
29.1%
Distinct161
Distinct (%)79.7%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
Minimum1992-12-30 00:00:00
Maximum2021-05-26 00:00:00
2023-12-12T19:55:31.310539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:55:31.492086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Missing values

2023-12-12T19:55:28.403594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:55:28.540551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분업소명업소주소지정일자
0일반소매인미니스톱 영광도동점전라남도 영광군 영광읍 도동리 360 도동휴먼시아2021-05-26
1일반소매인설도젓갈타운매점전라남도 영광군 염산면 봉남리 695-202021-05-14
2일반소매인지에스25 법성포 터미널점전라남도 영광군 법성면 법성리 12292020-11-13
3일반소매인이마트24R 영광중앙점전라남도 영광군 영광읍 남천리 2512020-11-04
4일반소매인정상매점전라남도 영광군 백수읍 대신리 162-12020-10-12
5일반소매인씨유(CU)영광대마산단점전라남도 영광군 대마면 송죽리 1034-12020-09-23
6일반소매인조은마트전라남도 영광군 영광읍 녹사리 23-72020-09-08
7일반소매인농업회사법인(유)보리올팜전라남도 영광군 대마면 월산리 411-102020-08-24
8일반소매인이마트24 R대마산단점전라남도 영광군 대마면 송죽리 1039-62020-07-27
9일반소매인해빛 법성점전라남도 영광군 법성면 법성리 1225번지 4호2020-03-20
구분업소명업소주소지정일자
192구내소매인마트넷전라남도 영광군 영광읍 도동리 167번지 3호2018-08-16
193구내소매인(주)롯데슈퍼영광홍농가맹점전라남도 영광군 홍농읍 상하리 130번지2018-05-28
194구내소매인코끼리마트전라남도 영광군 백수읍 백수로 8262015-10-19
195구내소매인수협바다마트전라남도 영광군 법성면 굴비로1길 1122014-06-19
196구내소매인(주)동부유통전라남도 영광군 영광읍 물무로 2142014-05-08
197구내소매인군내매점전라남도 영광군 영광읍 신하리 6번지2013-08-28
198구내소매인터미널편의점전라남도 영광군 영광읍 신하리 10번지 1호2013-02-08
199구내소매인(유)영광파머스마켓전라남도 영광군 영광읍 신하리 829번지 2호2012-11-27
200구내소매인마운틴마트전라남도 영광군 법성면 법성리 602번지 38호2011-08-04
201구내소매인영광농협 하나로마트전라남도 영광군 영광읍 백학리 90번지 1호2011-03-17