Overview

Dataset statistics

Number of variables4
Number of observations264
Missing cells130
Missing cells (%)12.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.6 KiB
Average record size in memory33.5 B

Variable types

Numeric1
Text3

Dataset

Description부산광역시해운대구_안전상비의약품판매업현황_20210723
Author부산광역시 해운대구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3075597

Alerts

전화번호 has 130 (49.2%) missing valuesMissing
순번 has unique valuesUnique
판매점포명 has unique valuesUnique
소재지(도로명) has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:00:25.649354
Analysis finished2023-12-10 17:00:26.353373
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct264
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean132.5
Minimum1
Maximum264
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.4 KiB
2023-12-11T02:00:26.437334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile14.15
Q166.75
median132.5
Q3198.25
95-th percentile250.85
Maximum264
Range263
Interquartile range (IQR)131.5

Descriptive statistics

Standard deviation76.354437
Coefficient of variation (CV)0.5762599
Kurtosis-1.2
Mean132.5
Median Absolute Deviation (MAD)66
Skewness0
Sum34980
Variance5830
MonotonicityStrictly increasing
2023-12-11T02:00:26.657386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
183 1
 
0.4%
169 1
 
0.4%
170 1
 
0.4%
171 1
 
0.4%
172 1
 
0.4%
173 1
 
0.4%
174 1
 
0.4%
175 1
 
0.4%
176 1
 
0.4%
Other values (254) 254
96.2%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
264 1
0.4%
263 1
0.4%
262 1
0.4%
261 1
0.4%
260 1
0.4%
259 1
0.4%
258 1
0.4%
257 1
0.4%
256 1
0.4%
255 1
0.4%

판매점포명
Text

UNIQUE 

Distinct264
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-11T02:00:26.960250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length10.962121
Min length6

Characters and Unicode

Total characters2894
Distinct characters240
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique264 ?
Unique (%)100.0%

Sample

1st rowGS25해운오션점
2nd row씨유 반여협성점
3rd row씨유 센텀타워메디컬점
4th row세븐일레븐 해운대해피점
5th row미니스톱 부산팔레드시즈점
ValueCountFrequency (%)
씨유 65
 
13.4%
gs25 50
 
10.3%
세븐일레븐 30
 
6.2%
주)코리아세븐 21
 
4.3%
지에스(gs)25 12
 
2.5%
미니스톱 11
 
2.3%
cu 11
 
2.3%
이마트24 7
 
1.4%
지에스25 4
 
0.8%
해운대점 2
 
0.4%
Other values (262) 271
56.0%
2023-12-11T02:00:27.608001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
248
 
8.6%
220
 
7.6%
2 97
 
3.4%
86
 
3.0%
85
 
2.9%
83
 
2.9%
5 83
 
2.9%
83
 
2.9%
S 82
 
2.8%
G 78
 
2.7%
Other values (230) 1749
60.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2201
76.1%
Space Separator 220
 
7.6%
Uppercase Letter 198
 
6.8%
Decimal Number 191
 
6.6%
Open Punctuation 42
 
1.5%
Close Punctuation 42
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
248
 
11.3%
86
 
3.9%
85
 
3.9%
83
 
3.8%
83
 
3.8%
76
 
3.5%
67
 
3.0%
67
 
3.0%
53
 
2.4%
52
 
2.4%
Other values (213) 1301
59.1%
Uppercase Letter
ValueCountFrequency (%)
S 82
41.4%
G 78
39.4%
C 16
 
8.1%
U 16
 
8.1%
K 2
 
1.0%
W 1
 
0.5%
I 1
 
0.5%
H 1
 
0.5%
R 1
 
0.5%
Decimal Number
ValueCountFrequency (%)
2 97
50.8%
5 83
43.5%
4 9
 
4.7%
9 1
 
0.5%
1 1
 
0.5%
Space Separator
ValueCountFrequency (%)
220
100.0%
Open Punctuation
ValueCountFrequency (%)
( 42
100.0%
Close Punctuation
ValueCountFrequency (%)
) 42
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2201
76.1%
Common 495
 
17.1%
Latin 198
 
6.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
248
 
11.3%
86
 
3.9%
85
 
3.9%
83
 
3.8%
83
 
3.8%
76
 
3.5%
67
 
3.0%
67
 
3.0%
53
 
2.4%
52
 
2.4%
Other values (213) 1301
59.1%
Latin
ValueCountFrequency (%)
S 82
41.4%
G 78
39.4%
C 16
 
8.1%
U 16
 
8.1%
K 2
 
1.0%
W 1
 
0.5%
I 1
 
0.5%
H 1
 
0.5%
R 1
 
0.5%
Common
ValueCountFrequency (%)
220
44.4%
2 97
19.6%
5 83
 
16.8%
( 42
 
8.5%
) 42
 
8.5%
4 9
 
1.8%
9 1
 
0.2%
1 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2201
76.1%
ASCII 693
 
23.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
248
 
11.3%
86
 
3.9%
85
 
3.9%
83
 
3.8%
83
 
3.8%
76
 
3.5%
67
 
3.0%
67
 
3.0%
53
 
2.4%
52
 
2.4%
Other values (213) 1301
59.1%
ASCII
ValueCountFrequency (%)
220
31.7%
2 97
14.0%
5 83
 
12.0%
S 82
 
11.8%
G 78
 
11.3%
( 42
 
6.1%
) 42
 
6.1%
C 16
 
2.3%
U 16
 
2.3%
4 9
 
1.3%
Other values (7) 8
 
1.2%

전화번호
Text

MISSING 

Distinct134
Distinct (%)100.0%
Missing130
Missing (%)49.2%
Memory size2.2 KiB
2023-12-11T02:00:27.985269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.014925
Min length12

Characters and Unicode

Total characters1610
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique134 ?
Unique (%)100.0%

Sample

1st row051-747-3362
2nd row051-781-5083
3rd row051-741-3338
4th row051-531-8332
5th row051-542-5033
ValueCountFrequency (%)
051-545-6636 1
 
0.7%
051-782-0217 1
 
0.7%
051-704-2832 1
 
0.7%
051-522-1681 1
 
0.7%
051-542-2603 1
 
0.7%
051-522-3559 1
 
0.7%
051-747-5508 1
 
0.7%
051-746-8851 1
 
0.7%
051-742-8872 1
 
0.7%
051-703-4595 1
 
0.7%
Other values (124) 124
92.5%
2023-12-11T02:00:28.615977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 268
16.6%
1 239
14.8%
0 216
13.4%
5 212
13.2%
7 180
11.2%
4 122
7.6%
3 98
 
6.1%
2 83
 
5.2%
6 76
 
4.7%
8 70
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1342
83.4%
Dash Punctuation 268
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 239
17.8%
0 216
16.1%
5 212
15.8%
7 180
13.4%
4 122
9.1%
3 98
7.3%
2 83
 
6.2%
6 76
 
5.7%
8 70
 
5.2%
9 46
 
3.4%
Dash Punctuation
ValueCountFrequency (%)
- 268
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1610
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 268
16.6%
1 239
14.8%
0 216
13.4%
5 212
13.2%
7 180
11.2%
4 122
7.6%
3 98
 
6.1%
2 83
 
5.2%
6 76
 
4.7%
8 70
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1610
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 268
16.6%
1 239
14.8%
0 216
13.4%
5 212
13.2%
7 180
11.2%
4 122
7.6%
3 98
 
6.1%
2 83
 
5.2%
6 76
 
4.7%
8 70
 
4.3%
Distinct264
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-11T02:00:28.966879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length43
Mean length35.863636
Min length22

Characters and Unicode

Total characters9468
Distinct characters245
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique264 ?
Unique (%)100.0%

Sample

1st row부산광역시 해운대구 해운대해변로209번길 8-3(우동)
2nd row부산광역시 해운대구 해운대로38번길 60, 105동 101-1호 (재송동, 센텀협성르네상스타운)
3rd row부산광역시 해운대구 센텀2로 20, 센텀타워메디컬 1층 105호 (우동)
4th row부산광역시 해운대구 해운대로 602, 1층 (우동)
5th row부산광역시 해운대구 해운대해변로298번길 24, 팔레드시즈 (중동)
ValueCountFrequency (%)
부산광역시 264
 
15.6%
해운대구 264
 
15.6%
우동 70
 
4.1%
중동 49
 
2.9%
좌동 48
 
2.8%
1층 36
 
2.1%
반여동 33
 
1.9%
재송동 32
 
1.9%
101호 25
 
1.5%
반송동 19
 
1.1%
Other values (501) 853
50.4%
2023-12-11T02:00:29.514328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1429
 
15.1%
1 492
 
5.2%
383
 
4.0%
380
 
4.0%
377
 
4.0%
344
 
3.6%
, 301
 
3.2%
292
 
3.1%
278
 
2.9%
274
 
2.9%
Other values (235) 4918
51.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5601
59.2%
Decimal Number 1547
 
16.3%
Space Separator 1429
 
15.1%
Other Punctuation 301
 
3.2%
Open Punctuation 264
 
2.8%
Close Punctuation 264
 
2.8%
Dash Punctuation 40
 
0.4%
Math Symbol 12
 
0.1%
Uppercase Letter 10
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
383
 
6.8%
380
 
6.8%
377
 
6.7%
344
 
6.1%
292
 
5.2%
278
 
5.0%
274
 
4.9%
268
 
4.8%
265
 
4.7%
264
 
4.7%
Other values (212) 2476
44.2%
Decimal Number
ValueCountFrequency (%)
1 492
31.8%
2 214
13.8%
0 214
13.8%
3 130
 
8.4%
4 100
 
6.5%
5 90
 
5.8%
7 85
 
5.5%
6 79
 
5.1%
8 75
 
4.8%
9 68
 
4.4%
Uppercase Letter
ValueCountFrequency (%)
C 3
30.0%
E 2
20.0%
S 1
 
10.0%
H 1
 
10.0%
A 1
 
10.0%
P 1
 
10.0%
B 1
 
10.0%
Space Separator
ValueCountFrequency (%)
1429
100.0%
Other Punctuation
ValueCountFrequency (%)
, 301
100.0%
Open Punctuation
ValueCountFrequency (%)
( 264
100.0%
Close Punctuation
ValueCountFrequency (%)
) 264
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 40
100.0%
Math Symbol
ValueCountFrequency (%)
~ 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5601
59.2%
Common 3857
40.7%
Latin 10
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
383
 
6.8%
380
 
6.8%
377
 
6.7%
344
 
6.1%
292
 
5.2%
278
 
5.0%
274
 
4.9%
268
 
4.8%
265
 
4.7%
264
 
4.7%
Other values (212) 2476
44.2%
Common
ValueCountFrequency (%)
1429
37.0%
1 492
 
12.8%
, 301
 
7.8%
( 264
 
6.8%
) 264
 
6.8%
2 214
 
5.5%
0 214
 
5.5%
3 130
 
3.4%
4 100
 
2.6%
5 90
 
2.3%
Other values (6) 359
 
9.3%
Latin
ValueCountFrequency (%)
C 3
30.0%
E 2
20.0%
S 1
 
10.0%
H 1
 
10.0%
A 1
 
10.0%
P 1
 
10.0%
B 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5601
59.2%
ASCII 3867
40.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1429
37.0%
1 492
 
12.7%
, 301
 
7.8%
( 264
 
6.8%
) 264
 
6.8%
2 214
 
5.5%
0 214
 
5.5%
3 130
 
3.4%
4 100
 
2.6%
5 90
 
2.3%
Other values (13) 369
 
9.5%
Hangul
ValueCountFrequency (%)
383
 
6.8%
380
 
6.8%
377
 
6.7%
344
 
6.1%
292
 
5.2%
278
 
5.0%
274
 
4.9%
268
 
4.8%
265
 
4.7%
264
 
4.7%
Other values (212) 2476
44.2%

Interactions

2023-12-11T02:00:25.995161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-11T02:00:26.176069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:00:26.312456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번판매점포명전화번호소재지(도로명)
01GS25해운오션점051-747-3362부산광역시 해운대구 해운대해변로209번길 8-3(우동)
12씨유 반여협성점<NA>부산광역시 해운대구 해운대로38번길 60, 105동 101-1호 (재송동, 센텀협성르네상스타운)
23씨유 센텀타워메디컬점<NA>부산광역시 해운대구 센텀2로 20, 센텀타워메디컬 1층 105호 (우동)
34세븐일레븐 해운대해피점<NA>부산광역시 해운대구 해운대로 602, 1층 (우동)
45미니스톱 부산팔레드시즈점<NA>부산광역시 해운대구 해운대해변로298번길 24, 팔레드시즈 (중동)
56지에스(GS)25해운재송점051-781-5083부산광역시 해운대구 해운대로123번길 32 (재송동)
67미니스톱 부산송정해변점<NA>부산광역시 해운대구 송정중앙로6번길 118 (송정동)
78지에스25반여재반로점<NA>부산광역시 해운대구 재반로226번길 15, 1층 (반여동)
89지에스(GS)25 센텀그린점<NA>부산광역시 해운대구 센텀중앙로 78, 센텀그린타워 101호 (우동)
910GS25더샵센텀스타점<NA>부산광역시 해운대구 센텀동로 123, 상가14호 (재송동, 더샵센텀스타아파트)
순번판매점포명전화번호소재지(도로명)
254255씨유 센텀월드마크점<NA>부산광역시 해운대구 센텀동로 25, 131호 (우동, 센텀월드마크상가)
255256GS25 해운대9점051-783-5711부산광역시 해운대구 재반로 208 (반여동)
256257GS25 해운벽산점051-701-2133부산광역시 해운대구 세실로 48, 104동 116호 (좌동, 삼정코아상가)
257258GS2 문탠로드점051-746-2554부산광역시 해운대구 달맞이길62번길 2 (중동)
258259GS25 해운온천점051-746-6925부산광역시 해운대구 중동2로 12 (중동)
259260세븐일레븐 재송센트럴시티점070-4412-8987부산광역시 해운대구 재반로 118, 102호 (재송동)
260261세븐일레븐 부산재송제일점<NA>부산광역시 해운대구 재반로 123 (재송동)
261262GS25 트럼프월드점051-741-3291부산광역시 해운대구 센텀동로 9, 106호 (우동)
262263GS25 해운제니스점051-746-1496부산광역시 해운대구 마린시티2로 33, 104동 121호 (우동, 두산제니스스퀘어)
263264세븐일레븐 재송중앙점051-784-2283부산광역시 해운대구 재송1로 34 (재송동)