Overview

Dataset statistics

Number of variables4
Number of observations990
Missing cells517
Missing cells (%)13.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory32.0 KiB
Average record size in memory33.1 B

Variable types

Numeric1
Text3

Dataset

Description양산시에 등록되어 운영중인 담배소매인 지정 공공데이터입니다. 업소명 및, 소재지주소, 읍면동 등 현황을 확인할 수 있습니다.
Author경상남도 양산시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3040404

Alerts

업소전화번호 has 517 (52.2%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2024-04-17 12:43:40.199218
Analysis finished2024-04-17 12:43:40.720313
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct990
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean495.5
Minimum1
Maximum990
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.8 KiB
2024-04-17T21:43:40.779319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile50.45
Q1248.25
median495.5
Q3742.75
95-th percentile940.55
Maximum990
Range989
Interquartile range (IQR)494.5

Descriptive statistics

Standard deviation285.93268
Coefficient of variation (CV)0.5770589
Kurtosis-1.2
Mean495.5
Median Absolute Deviation (MAD)247.5
Skewness0
Sum490545
Variance81757.5
MonotonicityStrictly increasing
2024-04-17T21:43:40.886169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
652 1
 
0.1%
654 1
 
0.1%
655 1
 
0.1%
656 1
 
0.1%
657 1
 
0.1%
658 1
 
0.1%
659 1
 
0.1%
660 1
 
0.1%
661 1
 
0.1%
Other values (980) 980
99.0%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
990 1
0.1%
989 1
0.1%
988 1
0.1%
987 1
0.1%
986 1
0.1%
985 1
0.1%
984 1
0.1%
983 1
0.1%
982 1
0.1%
981 1
0.1%
Distinct911
Distinct (%)92.0%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2024-04-17T21:43:41.104566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length22
Mean length7.3909091
Min length2

Characters and Unicode

Total characters7317
Distinct characters489
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique888 ?
Unique (%)89.7%

Sample

1st row파이디(P.I.E.D) 뉴턴 전자담배 양산 물금 증산 가촌점
2nd row상생부동산
3rd row핸즈독(HANDSDOG)
4th row세븐공인중개사사무소
5th row지에스25양산청운점
ValueCountFrequency (%)
담배 53
 
4.2%
세븐일레븐 29
 
2.3%
지에스(gs)25 26
 
2.0%
씨유 26
 
2.0%
gs25 22
 
1.7%
위드미 21
 
1.6%
미니스톱 13
 
1.0%
이마트24 12
 
0.9%
양산점 8
 
0.6%
주식회사 7
 
0.5%
Other values (964) 1057
83.0%
2024-04-17T21:43:41.437476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
369
 
5.0%
316
 
4.3%
284
 
3.9%
241
 
3.3%
180
 
2.5%
176
 
2.4%
2 130
 
1.8%
126
 
1.7%
) 107
 
1.5%
( 107
 
1.5%
Other values (479) 5281
72.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6219
85.0%
Space Separator 284
 
3.9%
Decimal Number 279
 
3.8%
Uppercase Letter 264
 
3.6%
Close Punctuation 107
 
1.5%
Open Punctuation 107
 
1.5%
Lowercase Letter 48
 
0.7%
Other Punctuation 7
 
0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
369
 
5.9%
316
 
5.1%
241
 
3.9%
180
 
2.9%
176
 
2.8%
126
 
2.0%
103
 
1.7%
98
 
1.6%
87
 
1.4%
84
 
1.4%
Other values (426) 4439
71.4%
Uppercase Letter
ValueCountFrequency (%)
G 93
35.2%
S 86
32.6%
C 14
 
5.3%
L 11
 
4.2%
I 10
 
3.8%
U 8
 
3.0%
D 7
 
2.7%
O 6
 
2.3%
H 5
 
1.9%
V 4
 
1.5%
Other values (9) 20
 
7.6%
Lowercase Letter
ValueCountFrequency (%)
e 12
25.0%
r 4
 
8.3%
t 4
 
8.3%
o 4
 
8.3%
n 3
 
6.2%
a 3
 
6.2%
f 3
 
6.2%
c 3
 
6.2%
m 2
 
4.2%
h 2
 
4.2%
Other values (8) 8
16.7%
Decimal Number
ValueCountFrequency (%)
2 130
46.6%
5 106
38.0%
4 22
 
7.9%
1 7
 
2.5%
3 5
 
1.8%
6 3
 
1.1%
0 2
 
0.7%
7 2
 
0.7%
9 2
 
0.7%
Other Punctuation
ValueCountFrequency (%)
. 3
42.9%
& 3
42.9%
# 1
 
14.3%
Space Separator
ValueCountFrequency (%)
284
100.0%
Close Punctuation
ValueCountFrequency (%)
) 107
100.0%
Open Punctuation
ValueCountFrequency (%)
( 107
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6219
85.0%
Common 786
 
10.7%
Latin 312
 
4.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
369
 
5.9%
316
 
5.1%
241
 
3.9%
180
 
2.9%
176
 
2.8%
126
 
2.0%
103
 
1.7%
98
 
1.6%
87
 
1.4%
84
 
1.4%
Other values (426) 4439
71.4%
Latin
ValueCountFrequency (%)
G 93
29.8%
S 86
27.6%
C 14
 
4.5%
e 12
 
3.8%
L 11
 
3.5%
I 10
 
3.2%
U 8
 
2.6%
D 7
 
2.2%
O 6
 
1.9%
H 5
 
1.6%
Other values (27) 60
19.2%
Common
ValueCountFrequency (%)
284
36.1%
2 130
16.5%
) 107
 
13.6%
( 107
 
13.6%
5 106
 
13.5%
4 22
 
2.8%
1 7
 
0.9%
3 5
 
0.6%
. 3
 
0.4%
6 3
 
0.4%
Other values (6) 12
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6219
85.0%
ASCII 1098
 
15.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
369
 
5.9%
316
 
5.1%
241
 
3.9%
180
 
2.9%
176
 
2.8%
126
 
2.0%
103
 
1.7%
98
 
1.6%
87
 
1.4%
84
 
1.4%
Other values (426) 4439
71.4%
ASCII
ValueCountFrequency (%)
284
25.9%
2 130
11.8%
) 107
 
9.7%
( 107
 
9.7%
5 106
 
9.7%
G 93
 
8.5%
S 86
 
7.8%
4 22
 
2.0%
C 14
 
1.3%
e 12
 
1.1%
Other values (43) 137
12.5%
Distinct983
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2024-04-17T21:43:41.716635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length51
Mean length26.343434
Min length16

Characters and Unicode

Total characters26080
Distinct characters320
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique976 ?
Unique (%)98.6%

Sample

1st row경상남도 양산시 물금읍 증산역로 153. 정우프라자 111호
2nd row경상남도 양산시 물금읍 청운로 52-43
3rd row경상남도 양산시 물금읍 증산역로 162. 플러스타워 108호
4th row경상남도 양산시 물금읍 증산역로 135. 퍼스트조양 119호
5th row경상남도 양산시 물금읍 청운로 346. 네오리더스 1층 106호
ValueCountFrequency (%)
경상남도 990
 
17.5%
양산시 990
 
17.5%
물금읍 206
 
3.6%
1층 160
 
2.8%
동면 81
 
1.4%
삼호동 69
 
1.2%
중부동 65
 
1.1%
상북면 64
 
1.1%
101호 60
 
1.1%
평산동 59
 
1.0%
Other values (1211) 2909
51.5%
2024-04-17T21:43:42.126302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4912
18.8%
1280
 
4.9%
1 1274
 
4.9%
1192
 
4.6%
1106
 
4.2%
1088
 
4.2%
1026
 
3.9%
1022
 
3.9%
996
 
3.8%
875
 
3.4%
Other values (310) 11309
43.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15397
59.0%
Space Separator 4912
 
18.8%
Decimal Number 3959
 
15.2%
Close Punctuation 587
 
2.3%
Open Punctuation 587
 
2.3%
Other Punctuation 399
 
1.5%
Dash Punctuation 170
 
0.7%
Uppercase Letter 52
 
0.2%
Math Symbol 11
 
< 0.1%
Lowercase Letter 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1280
 
8.3%
1192
 
7.7%
1106
 
7.2%
1088
 
7.1%
1026
 
6.7%
1022
 
6.6%
996
 
6.5%
875
 
5.7%
527
 
3.4%
449
 
2.9%
Other values (270) 5836
37.9%
Uppercase Letter
ValueCountFrequency (%)
B 14
26.9%
L 7
13.5%
A 6
11.5%
D 3
 
5.8%
T 3
 
5.8%
E 3
 
5.8%
C 3
 
5.8%
N 2
 
3.8%
H 2
 
3.8%
G 2
 
3.8%
Other values (6) 7
13.5%
Decimal Number
ValueCountFrequency (%)
1 1274
32.2%
2 449
 
11.3%
0 417
 
10.5%
3 385
 
9.7%
4 295
 
7.5%
5 274
 
6.9%
6 245
 
6.2%
7 240
 
6.1%
8 193
 
4.9%
9 187
 
4.7%
Lowercase Letter
ValueCountFrequency (%)
e 2
33.3%
s 1
16.7%
u 1
16.7%
l 1
16.7%
h 1
16.7%
Other Punctuation
ValueCountFrequency (%)
. 393
98.5%
@ 4
 
1.0%
/ 1
 
0.3%
& 1
 
0.3%
Space Separator
ValueCountFrequency (%)
4912
100.0%
Close Punctuation
ValueCountFrequency (%)
) 587
100.0%
Open Punctuation
ValueCountFrequency (%)
( 587
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 170
100.0%
Math Symbol
ValueCountFrequency (%)
~ 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15397
59.0%
Common 10625
40.7%
Latin 58
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1280
 
8.3%
1192
 
7.7%
1106
 
7.2%
1088
 
7.1%
1026
 
6.7%
1022
 
6.6%
996
 
6.5%
875
 
5.7%
527
 
3.4%
449
 
2.9%
Other values (270) 5836
37.9%
Latin
ValueCountFrequency (%)
B 14
24.1%
L 7
12.1%
A 6
10.3%
D 3
 
5.2%
T 3
 
5.2%
E 3
 
5.2%
C 3
 
5.2%
N 2
 
3.4%
H 2
 
3.4%
G 2
 
3.4%
Other values (11) 13
22.4%
Common
ValueCountFrequency (%)
4912
46.2%
1 1274
 
12.0%
) 587
 
5.5%
( 587
 
5.5%
2 449
 
4.2%
0 417
 
3.9%
. 393
 
3.7%
3 385
 
3.6%
4 295
 
2.8%
5 274
 
2.6%
Other values (9) 1052
 
9.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15397
59.0%
ASCII 10683
41.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4912
46.0%
1 1274
 
11.9%
) 587
 
5.5%
( 587
 
5.5%
2 449
 
4.2%
0 417
 
3.9%
. 393
 
3.7%
3 385
 
3.6%
4 295
 
2.8%
5 274
 
2.6%
Other values (30) 1110
 
10.4%
Hangul
ValueCountFrequency (%)
1280
 
8.3%
1192
 
7.7%
1106
 
7.2%
1088
 
7.1%
1026
 
6.7%
1022
 
6.6%
996
 
6.5%
875
 
5.7%
527
 
3.4%
449
 
2.9%
Other values (270) 5836
37.9%

업소전화번호
Text

MISSING 

Distinct376
Distinct (%)79.5%
Missing517
Missing (%)52.2%
Memory size7.9 KiB
2024-04-17T21:43:42.331242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length9.9006342
Min length1

Characters and Unicode

Total characters4683
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique370 ?
Unique (%)78.2%

Sample

1st row055-372-3650
2nd row055-366-9644
3rd row055-382-9505
4th row055-367-0888
5th row02-1577-0711
ValueCountFrequency (%)
051-335-1501 4
 
1.0%
055-363-1500 2
 
0.5%
055-370-0178 2
 
0.5%
051-644-4675 2
 
0.5%
055-382-5123 2
 
0.5%
055-381-5129 1
 
0.3%
055-367-7977 1
 
0.3%
055-363-3999 1
 
0.3%
055-385-2422 1
 
0.3%
055-362-6987 1
 
0.3%
Other values (365) 365
95.5%
2024-04-17T21:43:42.622835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 976
20.8%
- 764
16.3%
0 598
12.8%
3 585
12.5%
8 369
 
7.9%
6 301
 
6.4%
7 268
 
5.7%
2 214
 
4.6%
1 192
 
4.1%
4 178
 
3.8%
Other values (3) 238
 
5.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3827
81.7%
Dash Punctuation 764
 
16.3%
Space Separator 91
 
1.9%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 976
25.5%
0 598
15.6%
3 585
15.3%
8 369
 
9.6%
6 301
 
7.9%
7 268
 
7.0%
2 214
 
5.6%
1 192
 
5.0%
4 178
 
4.7%
9 146
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 764
100.0%
Space Separator
ValueCountFrequency (%)
91
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4683
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 976
20.8%
- 764
16.3%
0 598
12.8%
3 585
12.5%
8 369
 
7.9%
6 301
 
6.4%
7 268
 
5.7%
2 214
 
4.6%
1 192
 
4.1%
4 178
 
3.8%
Other values (3) 238
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4683
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 976
20.8%
- 764
16.3%
0 598
12.8%
3 585
12.5%
8 369
 
7.9%
6 301
 
6.4%
7 268
 
5.7%
2 214
 
4.6%
1 192
 
4.1%
4 178
 
3.8%
Other values (3) 238
 
5.1%

Interactions

2024-04-17T21:43:40.540133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-04-17T21:43:40.626994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T21:43:40.689522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호업소명업소주소업소전화번호
01파이디(P.I.E.D) 뉴턴 전자담배 양산 물금 증산 가촌점경상남도 양산시 물금읍 증산역로 153. 정우프라자 111호<NA>
12상생부동산경상남도 양산시 물금읍 청운로 52-43<NA>
23핸즈독(HANDSDOG)경상남도 양산시 물금읍 증산역로 162. 플러스타워 108호<NA>
34세븐공인중개사사무소경상남도 양산시 물금읍 증산역로 135. 퍼스트조양 119호<NA>
45지에스25양산청운점경상남도 양산시 물금읍 청운로 346. 네오리더스 1층 106호055-372-3650
56신창필마트경상남도 양산시 물금읍 신주로 73. 상가동 110.112호<NA>
67지에스(GS)25 양산대방점경상남도 양산시 물금읍 야리로 90. 313동 108호 (양산 대방노블랜드 연리지(3차))055-366-9644
78남락구판장경상남도 양산시 동면 남락1길 2. 남락구판장<NA>
89지에스(GS)25양산이지2차점경상남도 양산시 물금읍 물금로 75. 220동 1층 101호 (이지더원 그랜드파크)<NA>
910석계샷시유리공업사경상남도 양산시 상북면 상북중앙로 405<NA>
번호업소명업소주소업소전화번호
980981엘지전자부품새마을금고경상남도 양산시 북정동 191호055-370-2550
981982재흥슈퍼경상남도 양산시 북안남3길 39 (북부동)
982983양동상회경상남도 양산시 장터1길 5-16 (중부동)055-385-4470
983984민마우트 양산대리점경상남도 양산시 북부동 422-4호
984985단골상회경상남도 양산시 삼일로 120 (중부동)055-386-2195
985986담배경상남도 양산시 신기6길 7 (신기동)
986987샘터슈퍼경상남도 양산시 일동1길 13 (중부동)055-287-2727
987988영주상회경상남도 양산시 소주로 54 (소주동)<NA>
988989물금농협연쇄점경상남도 양산시 물금읍 물금중앙길 4055-382-0330
989990롯데칠성음료신협양산분점경상남도 양산시 양산대로 1060 (북정동)055-385-5585