Overview

Dataset statistics

Number of variables3
Number of observations916
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory21.6 KiB
Average record size in memory24.1 B

Variable types

Text2
DateTime1

Dataset

Description양산시에 등록되어 운영중인 담배소매인 지정 공공데이터입니다. 업소명 및, 소재지주소, 읍면동 등 현황을 확인할 수 있습니다.
Author경상남도 양산시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3040404

Alerts

기준일 has constant value ""Constant

Reproduction

Analysis started2024-04-17 12:43:35.689454
Analysis finished2024-04-17 12:43:36.104260
Duration0.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct902
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size7.3 KiB
2024-04-17T21:43:36.271233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length19
Mean length8.7478166
Min length2

Characters and Unicode

Total characters8013
Distinct characters495
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique892 ?
Unique (%)97.4%

Sample

1st row씨유 가촌대로점
2nd row지에스(GS)25북정동원점
3rd row고성상회
4th row세븐일레븐 양산뉴북부로점
5th row서원할인마트
ValueCountFrequency (%)
씨유 64
 
4.8%
세븐일레븐 58
 
4.4%
지에스(gs)25 49
 
3.7%
이마트24 41
 
3.1%
지에스25 17
 
1.3%
주식회사 16
 
1.2%
주)코리아세븐 14
 
1.1%
gs25 12
 
0.9%
위드미 8
 
0.6%
양산점 7
 
0.5%
Other values (962) 1040
78.4%
2024-04-17T21:43:36.594491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
460
 
5.7%
410
 
5.1%
390
 
4.9%
323
 
4.0%
184
 
2.3%
2 179
 
2.2%
167
 
2.1%
164
 
2.0%
) 146
 
1.8%
( 146
 
1.8%
Other values (485) 5444
67.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6584
82.2%
Space Separator 410
 
5.1%
Decimal Number 383
 
4.8%
Uppercase Letter 320
 
4.0%
Close Punctuation 146
 
1.8%
Open Punctuation 146
 
1.8%
Lowercase Letter 15
 
0.2%
Other Punctuation 8
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
460
 
7.0%
390
 
5.9%
323
 
4.9%
184
 
2.8%
167
 
2.5%
164
 
2.5%
142
 
2.2%
136
 
2.1%
132
 
2.0%
121
 
1.8%
Other values (433) 4365
66.3%
Uppercase Letter
ValueCountFrequency (%)
S 102
31.9%
G 98
30.6%
C 16
 
5.0%
H 14
 
4.4%
L 12
 
3.8%
U 11
 
3.4%
E 9
 
2.8%
R 9
 
2.8%
T 6
 
1.9%
P 6
 
1.9%
Other values (12) 37
 
11.6%
Lowercase Letter
ValueCountFrequency (%)
o 2
13.3%
e 2
13.3%
i 2
13.3%
n 2
13.3%
m 1
6.7%
s 1
6.7%
h 1
6.7%
a 1
6.7%
p 1
6.7%
d 1
6.7%
Decimal Number
ValueCountFrequency (%)
2 179
46.7%
5 128
33.4%
4 52
 
13.6%
1 9
 
2.3%
3 6
 
1.6%
7 3
 
0.8%
6 3
 
0.8%
0 1
 
0.3%
9 1
 
0.3%
8 1
 
0.3%
Other Punctuation
ValueCountFrequency (%)
& 3
37.5%
. 2
25.0%
' 1
 
12.5%
/ 1
 
12.5%
# 1
 
12.5%
Space Separator
ValueCountFrequency (%)
410
100.0%
Close Punctuation
ValueCountFrequency (%)
) 146
100.0%
Open Punctuation
ValueCountFrequency (%)
( 146
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6582
82.1%
Common 1094
 
13.7%
Latin 335
 
4.2%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
460
 
7.0%
390
 
5.9%
323
 
4.9%
184
 
2.8%
167
 
2.5%
164
 
2.5%
142
 
2.2%
136
 
2.1%
132
 
2.0%
121
 
1.8%
Other values (431) 4363
66.3%
Latin
ValueCountFrequency (%)
S 102
30.4%
G 98
29.3%
C 16
 
4.8%
H 14
 
4.2%
L 12
 
3.6%
U 11
 
3.3%
E 9
 
2.7%
R 9
 
2.7%
T 6
 
1.8%
P 6
 
1.8%
Other values (23) 52
15.5%
Common
ValueCountFrequency (%)
410
37.5%
2 179
16.4%
) 146
 
13.3%
( 146
 
13.3%
5 128
 
11.7%
4 52
 
4.8%
1 9
 
0.8%
3 6
 
0.5%
7 3
 
0.3%
& 3
 
0.3%
Other values (9) 12
 
1.1%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6582
82.1%
ASCII 1429
 
17.8%
CJK 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
460
 
7.0%
390
 
5.9%
323
 
4.9%
184
 
2.8%
167
 
2.5%
164
 
2.5%
142
 
2.2%
136
 
2.1%
132
 
2.0%
121
 
1.8%
Other values (431) 4363
66.3%
ASCII
ValueCountFrequency (%)
410
28.7%
2 179
12.5%
) 146
 
10.2%
( 146
 
10.2%
5 128
 
9.0%
S 102
 
7.1%
G 98
 
6.9%
4 52
 
3.6%
C 16
 
1.1%
H 14
 
1.0%
Other values (42) 138
 
9.7%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct913
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size7.3 KiB
2024-04-17T21:43:36.857190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length53
Mean length27.983624
Min length15

Characters and Unicode

Total characters25633
Distinct characters334
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique910 ?
Unique (%)99.3%

Sample

1st row경상남도 양산시 물금읍 가촌로 111
2nd row경상남도 양산시 북정로 86. 105동 101호 (북정동. 양산북정2차대동아파트)
3rd row경상남도 양산시 원동면 원동로 1652
4th row경상남도 양산시 중앙우회로 143. 1층 (북부동)
5th row경상남도 양산시 교동1길 59-2 (교동)
ValueCountFrequency (%)
경상남도 916
 
16.4%
양산시 916
 
16.4%
물금읍 215
 
3.9%
1층 202
 
3.6%
동면 79
 
1.4%
삼호동 75
 
1.3%
101호 72
 
1.3%
상북면 60
 
1.1%
중부동 56
 
1.0%
평산동 50
 
0.9%
Other values (1197) 2939
52.7%
2024-04-17T21:43:37.225814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4727
18.4%
1 1377
 
5.4%
1216
 
4.7%
1123
 
4.4%
1060
 
4.1%
1000
 
3.9%
958
 
3.7%
950
 
3.7%
921
 
3.6%
837
 
3.3%
Other values (324) 11464
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15029
58.6%
Space Separator 4727
 
18.4%
Decimal Number 4038
 
15.8%
Close Punctuation 552
 
2.2%
Open Punctuation 552
 
2.2%
Other Punctuation 544
 
2.1%
Dash Punctuation 140
 
0.5%
Uppercase Letter 46
 
0.2%
Math Symbol 3
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1216
 
8.1%
1123
 
7.5%
1060
 
7.1%
1000
 
6.7%
958
 
6.4%
950
 
6.3%
921
 
6.1%
837
 
5.6%
532
 
3.5%
510
 
3.4%
Other values (293) 5922
39.4%
Uppercase Letter
ValueCountFrequency (%)
B 13
28.3%
A 9
19.6%
C 6
13.0%
K 3
 
6.5%
L 3
 
6.5%
M 3
 
6.5%
G 2
 
4.3%
D 2
 
4.3%
T 1
 
2.2%
P 1
 
2.2%
Other values (3) 3
 
6.5%
Decimal Number
ValueCountFrequency (%)
1 1377
34.1%
0 479
 
11.9%
2 448
 
11.1%
3 364
 
9.0%
4 281
 
7.0%
5 264
 
6.5%
7 241
 
6.0%
6 233
 
5.8%
8 184
 
4.6%
9 167
 
4.1%
Other Punctuation
ValueCountFrequency (%)
. 542
99.6%
& 2
 
0.4%
Space Separator
ValueCountFrequency (%)
4727
100.0%
Close Punctuation
ValueCountFrequency (%)
) 552
100.0%
Open Punctuation
ValueCountFrequency (%)
( 552
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 140
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15029
58.6%
Common 10556
41.2%
Latin 48
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1216
 
8.1%
1123
 
7.5%
1060
 
7.1%
1000
 
6.7%
958
 
6.4%
950
 
6.3%
921
 
6.1%
837
 
5.6%
532
 
3.5%
510
 
3.4%
Other values (293) 5922
39.4%
Common
ValueCountFrequency (%)
4727
44.8%
1 1377
 
13.0%
) 552
 
5.2%
( 552
 
5.2%
. 542
 
5.1%
0 479
 
4.5%
2 448
 
4.2%
3 364
 
3.4%
4 281
 
2.7%
5 264
 
2.5%
Other values (7) 970
 
9.2%
Latin
ValueCountFrequency (%)
B 13
27.1%
A 9
18.8%
C 6
12.5%
K 3
 
6.2%
L 3
 
6.2%
M 3
 
6.2%
e 2
 
4.2%
G 2
 
4.2%
D 2
 
4.2%
T 1
 
2.1%
Other values (4) 4
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15029
58.6%
ASCII 10604
41.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4727
44.6%
1 1377
 
13.0%
) 552
 
5.2%
( 552
 
5.2%
. 542
 
5.1%
0 479
 
4.5%
2 448
 
4.2%
3 364
 
3.4%
4 281
 
2.6%
5 264
 
2.5%
Other values (21) 1018
 
9.6%
Hangul
ValueCountFrequency (%)
1216
 
8.1%
1123
 
7.5%
1060
 
7.1%
1000
 
6.7%
958
 
6.4%
950
 
6.3%
921
 
6.1%
837
 
5.6%
532
 
3.5%
510
 
3.4%
Other values (293) 5922
39.4%

기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.3 KiB
Minimum2024-03-08 00:00:00
Maximum2024-03-08 00:00:00
2024-04-17T21:43:37.312217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:43:37.385609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2024-04-17T21:43:36.029550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T21:43:36.081970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명업소 소재지기준일
0씨유 가촌대로점경상남도 양산시 물금읍 가촌로 1112024-03-08
1지에스(GS)25북정동원점경상남도 양산시 북정로 86. 105동 101호 (북정동. 양산북정2차대동아파트)2024-03-08
2고성상회경상남도 양산시 원동면 원동로 16522024-03-08
3세븐일레븐 양산뉴북부로점경상남도 양산시 중앙우회로 143. 1층 (북부동)2024-03-08
4서원할인마트경상남도 양산시 교동1길 59-2 (교동)2024-03-08
5양산전자담배경상남도 양산시 양산역로 103. 골든세븐 1층 103호 (중부동)2024-03-08
6지에스25 양산소주공단점경상남도 양산시 주남로 47 (주남동)2024-03-08
7씨유 서창타운점경상남도 양산시 삼호동부로 162. 1층 (삼호동)2024-03-08
8씨유뉴소주공단점경상남도 양산시 주남로 16. 1층 (주남동)2024-03-08
9디에스푸드앤마트경상남도 양산시 신명로 39 (평산동)2024-03-08
업소명업소 소재지기준일
906현대백화점경상남도 양산시 북부동 437-19호2024-03-08
907고려트레필알베드경상남도 양산시 유산공단7길 15 (유산동)2024-03-08
908롯데칠성음료(주)경상남도 양산시 양산대로 1025 (북정동)2024-03-08
909엘지전자부품새마을금고경상남도 양산시 북정동 191호2024-03-08
910재흥슈퍼경상남도 양산시 북안남3길 39 (북부동)2024-03-08
911민마우트 양산대리점경상남도 양산시 북부동 422-4호2024-03-08
912단골상회경상남도 양산시 삼일로 120 (중부동)2024-03-08
913담배경상남도 양산시 신기6길 7 (신기동)2024-03-08
914물금농협연쇄점경상남도 양산시 물금읍 물금중앙길 42024-03-08
915롯데칠성음료신협양산분점경상남도 양산시 양산대로 1060 (북정동)2024-03-08