Overview

Dataset statistics

Number of variables4
Number of observations206
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.6 KiB
Average record size in memory32.6 B

Variable types

Text3
Categorical1

Dataset

Description담배소매인 지정현황(지정번호, 업소명, 주소 등)
Author경상북도 성주군
URLhttps://www.data.go.kr/data/15031449/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
지정번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 20:38:12.519527
Analysis finished2023-12-12 20:38:12.978969
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지정번호
Text

UNIQUE 

Distinct206
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-13T05:38:13.179560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length23
Mean length23
Min length23

Characters and Unicode

Total characters4738
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique206 ?
Unique (%)100.0%

Sample

1st row2018-5210082-05-6-00011
2nd row2018-5210082-05-6-00010
3rd row2018-5210082-05-6-00009
4th row2018-5210082-05-6-00008
5th row2018-5210082-05-6-00007
ValueCountFrequency (%)
2018-5210082-05-6-00011 1
 
0.5%
2007-5210060-05-6-00012 1
 
0.5%
2007-5210060-05-6-00003 1
 
0.5%
2008-5210060-05-6-00019 1
 
0.5%
2008-5210060-05-6-00018 1
 
0.5%
2008-5210060-05-6-00016 1
 
0.5%
2008-5210060-05-6-00015 1
 
0.5%
2008-5210060-05-6-00012 1
 
0.5%
2008-5210060-05-6-00009 1
 
0.5%
2008-5210060-05-6-00008 1
 
0.5%
Other values (196) 196
95.1%
2023-12-13T05:38:13.670913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1743
36.8%
- 824
17.4%
2 553
 
11.7%
5 485
 
10.2%
1 443
 
9.3%
6 324
 
6.8%
8 128
 
2.7%
9 94
 
2.0%
3 56
 
1.2%
7 48
 
1.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3914
82.6%
Dash Punctuation 824
 
17.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1743
44.5%
2 553
 
14.1%
5 485
 
12.4%
1 443
 
11.3%
6 324
 
8.3%
8 128
 
3.3%
9 94
 
2.4%
3 56
 
1.4%
7 48
 
1.2%
4 40
 
1.0%
Dash Punctuation
ValueCountFrequency (%)
- 824
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4738
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1743
36.8%
- 824
17.4%
2 553
 
11.7%
5 485
 
10.2%
1 443
 
9.3%
6 324
 
6.8%
8 128
 
2.7%
9 94
 
2.0%
3 56
 
1.2%
7 48
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4738
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1743
36.8%
- 824
17.4%
2 553
 
11.7%
5 485
 
10.2%
1 443
 
9.3%
6 324
 
6.8%
8 128
 
2.7%
9 94
 
2.0%
3 56
 
1.2%
7 48
 
1.0%
Distinct202
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-13T05:38:14.015419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length19
Mean length6.5145631
Min length2

Characters and Unicode

Total characters1342
Distinct characters268
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique198 ?
Unique (%)96.1%

Sample

1st rowCU성주선남점
2nd row용신공단식당
3rd row세븐일레븐성주산업단지점
4th row담배
5th row세븐일레븐 성주용암점
ValueCountFrequency (%)
gs25 7
 
2.8%
씨유 6
 
2.4%
성주점 4
 
1.6%
구판장 4
 
1.6%
세븐일레븐 3
 
1.2%
백운슈퍼 2
 
0.8%
이마트24 2
 
0.8%
대성슈퍼 2
 
0.8%
성주용암점 2
 
0.8%
하나로마트 2
 
0.8%
Other values (213) 217
86.5%
2023-12-13T05:38:14.511037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
56
 
4.2%
55
 
4.1%
46
 
3.4%
45
 
3.4%
30
 
2.2%
29
 
2.2%
29
 
2.2%
27
 
2.0%
23
 
1.7%
22
 
1.6%
Other values (258) 980
73.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1207
89.9%
Space Separator 45
 
3.4%
Decimal Number 31
 
2.3%
Uppercase Letter 29
 
2.2%
Open Punctuation 14
 
1.0%
Close Punctuation 14
 
1.0%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
56
 
4.6%
55
 
4.6%
46
 
3.8%
30
 
2.5%
29
 
2.4%
29
 
2.4%
27
 
2.2%
23
 
1.9%
22
 
1.8%
20
 
1.7%
Other values (239) 870
72.1%
Uppercase Letter
ValueCountFrequency (%)
S 9
31.0%
G 9
31.0%
C 5
17.2%
D 2
 
6.9%
U 1
 
3.4%
T 1
 
3.4%
K 1
 
3.4%
O 1
 
3.4%
Decimal Number
ValueCountFrequency (%)
2 13
41.9%
5 10
32.3%
1 4
 
12.9%
4 2
 
6.5%
9 1
 
3.2%
8 1
 
3.2%
Other Punctuation
ValueCountFrequency (%)
/ 1
50.0%
. 1
50.0%
Space Separator
ValueCountFrequency (%)
45
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1207
89.9%
Common 106
 
7.9%
Latin 29
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
56
 
4.6%
55
 
4.6%
46
 
3.8%
30
 
2.5%
29
 
2.4%
29
 
2.4%
27
 
2.2%
23
 
1.9%
22
 
1.8%
20
 
1.7%
Other values (239) 870
72.1%
Common
ValueCountFrequency (%)
45
42.5%
( 14
 
13.2%
) 14
 
13.2%
2 13
 
12.3%
5 10
 
9.4%
1 4
 
3.8%
4 2
 
1.9%
9 1
 
0.9%
8 1
 
0.9%
/ 1
 
0.9%
Latin
ValueCountFrequency (%)
S 9
31.0%
G 9
31.0%
C 5
17.2%
D 2
 
6.9%
U 1
 
3.4%
T 1
 
3.4%
K 1
 
3.4%
O 1
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1207
89.9%
ASCII 135
 
10.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
56
 
4.6%
55
 
4.6%
46
 
3.8%
30
 
2.5%
29
 
2.4%
29
 
2.4%
27
 
2.2%
23
 
1.9%
22
 
1.8%
20
 
1.7%
Other values (239) 870
72.1%
ASCII
ValueCountFrequency (%)
45
33.3%
( 14
 
10.4%
) 14
 
10.4%
2 13
 
9.6%
5 10
 
7.4%
S 9
 
6.7%
G 9
 
6.7%
C 5
 
3.7%
1 4
 
3.0%
D 2
 
1.5%
Other values (9) 10
 
7.4%

주소
Text

Distinct145
Distinct (%)70.4%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-13T05:38:14.924347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length32
Mean length15.88835
Min length1

Characters and Unicode

Total characters3273
Distinct characters117
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique141 ?
Unique (%)68.4%

Sample

1st row경상북도 성주군 선남면 나선로 1027
2nd row경상북도 성주군 선남면 용신공단길 10-9
3rd row경상북도 성주군 성주읍 성주산업단지로3길 76-23
4th row경상북도 성주군 용암면 문명6길 19-4
5th row경상북도 성주군 용암면 상성로 3
ValueCountFrequency (%)
경상북도 147
19.7%
성주군 147
19.7%
성주읍 45
 
6.0%
선남면 29
 
3.9%
성주로 23
 
3.1%
초전면 17
 
2.3%
가천면 11
 
1.5%
금수면 10
 
1.3%
월항면 10
 
1.3%
수륜면 8
 
1.1%
Other values (202) 300
40.2%
2023-12-13T05:38:15.533320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
660
20.2%
260
 
7.9%
253
 
7.7%
153
 
4.7%
151
 
4.6%
150
 
4.6%
147
 
4.5%
147
 
4.5%
1 108
 
3.3%
102
 
3.1%
Other values (107) 1142
34.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2079
63.5%
Space Separator 660
 
20.2%
Decimal Number 483
 
14.8%
Dash Punctuation 36
 
1.1%
Close Punctuation 6
 
0.2%
Open Punctuation 6
 
0.2%
Other Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
260
12.5%
253
12.2%
153
 
7.4%
151
 
7.3%
150
 
7.2%
147
 
7.1%
147
 
7.1%
102
 
4.9%
91
 
4.4%
67
 
3.2%
Other values (92) 558
26.8%
Decimal Number
ValueCountFrequency (%)
1 108
22.4%
2 72
14.9%
3 69
14.3%
0 42
 
8.7%
9 39
 
8.1%
8 37
 
7.7%
4 35
 
7.2%
7 29
 
6.0%
5 28
 
5.8%
6 24
 
5.0%
Space Separator
ValueCountFrequency (%)
660
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 36
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Other Punctuation
ValueCountFrequency (%)
. 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2079
63.5%
Common 1194
36.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
260
12.5%
253
12.2%
153
 
7.4%
151
 
7.3%
150
 
7.2%
147
 
7.1%
147
 
7.1%
102
 
4.9%
91
 
4.4%
67
 
3.2%
Other values (92) 558
26.8%
Common
ValueCountFrequency (%)
660
55.3%
1 108
 
9.0%
2 72
 
6.0%
3 69
 
5.8%
0 42
 
3.5%
9 39
 
3.3%
8 37
 
3.1%
- 36
 
3.0%
4 35
 
2.9%
7 29
 
2.4%
Other values (5) 67
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2079
63.5%
ASCII 1194
36.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
660
55.3%
1 108
 
9.0%
2 72
 
6.0%
3 69
 
5.8%
0 42
 
3.5%
9 39
 
3.3%
8 37
 
3.1%
- 36
 
3.0%
4 35
 
2.9%
7 29
 
2.4%
Other values (5) 67
 
5.6%
Hangul
ValueCountFrequency (%)
260
12.5%
253
12.2%
153
 
7.4%
151
 
7.3%
150
 
7.2%
147
 
7.1%
147
 
7.1%
102
 
4.9%
91
 
4.4%
67
 
3.2%
Other values (92) 558
26.8%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2018-07-27
206 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018-07-27
2nd row2018-07-27
3rd row2018-07-27
4th row2018-07-27
5th row2018-07-27

Common Values

ValueCountFrequency (%)
2018-07-27 206
100.0%

Length

2023-12-13T05:38:15.671158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:38:15.767761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2018-07-27 206
100.0%

Missing values

2023-12-13T05:38:12.819128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:38:12.932972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지정번호업소명주소데이터기준일자
02018-5210082-05-6-00011CU성주선남점경상북도 성주군 선남면 나선로 10272018-07-27
12018-5210082-05-6-00010용신공단식당경상북도 성주군 선남면 용신공단길 10-92018-07-27
22018-5210082-05-6-00009세븐일레븐성주산업단지점경상북도 성주군 성주읍 성주산업단지로3길 76-232018-07-27
32018-5210082-05-6-00008담배경상북도 성주군 용암면 문명6길 19-42018-07-27
42018-5210082-05-6-00007세븐일레븐 성주용암점경상북도 성주군 용암면 상성로 32018-07-27
52018-5210082-05-6-00006용암부동산컨설팅경상북도 성주군 용암면 운용로 10302018-07-27
62018-5210082-05-6-00005성주공단할인마트경상북도 성주군 성주읍 성주산업단지로2길 164-12018-07-27
72018-5210082-05-6-00004시민슈퍼경상북도 성주군 선남면 성주로 35442018-07-27
82018-5210082-05-6-00003성주(양평)휴게소경상북도 성주군 초전면 중부내륙고속도로 982018-07-27
92018-5210082-05-6-00002성주(창원)휴게소경상북도 성주군 초전면 중부내륙고속도로 972018-07-27
지정번호업소명주소데이터기준일자
1962000-5210009-05-6-00055부강식품2018-07-27
1972000-5210009-05-6-00050이임선 담배소매인2018-07-27
1982000-5210009-05-6-00049정상컴퓨터시스템2018-07-27
1992000-5210009-05-6-00029정류소매점2018-07-27
2002000-5210009-05-6-00021신안상회2018-07-27
2012000-5210009-05-6-00018경북지업사2018-07-27
2022000-5210009-05-6-00002태양슈퍼2018-07-27
2031998-5210060-05-6-00000성산상회2018-07-27
2041996-5210060-05-6-11111수촌농약사경상북도 성주군 벽진면 벽봉로 612018-07-27
2051970-5210060-05-6-00001정상컴퓨터시스템경상북도 성주군 성주읍 성주읍4길 10-12018-07-27